Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MUSCLE-SPECIFIC NUCLEIC ACID REGULATORY ELEMENTS AND METHODS AND USE THEREOF
Document Type and Number:
WIPO Patent Application WO/2021/165353
Kind Code:
A1
Abstract:
The present invention relates to nucleic acid regulatory elements that are able to enhance muscle-specific expression of genes, in particular expression in muscle cells and/or tissues such as in diaphragm, smooth muscle, heart and/or skeletal muscle, comprising at least two diaphragm-specific regulatory elements and a heart- and skeletal muscle-specific regulatory element. The present invention further relates to expression cassettes and vectors containing these nucleic acid regulatory elements, as well as uses thereof. The present invention is particularly useful for applications using gene therapy of muscle-related disorders, more particularly diaphragm, heart and/or skeletal muscle-directed gene therapy, and for vaccination purposes.

Inventors:
VANDENDRIESSCHE THIERRY (BE)
CHUAH LAY KHIM (BE)
Application Number:
PCT/EP2021/053945
Publication Date:
August 26, 2021
Filing Date:
February 18, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV BRUSSEL VRIJE (BE)
International Classes:
C07K14/47; A61K48/00; C12N15/86
Domestic Patent References:
WO2018178067A12018-10-04
WO2015110449A12015-07-30
WO2015110449A12015-07-30
WO2018178067A12018-10-04
WO2015110449A12015-07-30
WO2018178067A12018-10-04
Other References:
POZSGAI E R ET AL: "[beta]-Sarcoglycan gene transfer decreases fibrosis and restores force in LGMD2E mice", GENE THERAPY, vol. 23, no. 1, 2016, pages 57 - 66, XP037325556, ISSN: 0969-7128, DOI: 10.1038/GT.2015.80
B WANG ET AL: "Construction and analysis of compact muscle-specific promoters for AAV vectors", GENE THERAPY, vol. 15, no. 22, 19 November 2008 (2008-11-19), pages 1489 - 1499, XP055078593, ISSN: 0969-7128, DOI: 10.1038/gt.2008.104
PACAK ET AL., CIRC RES., vol. 99, no. 4, 2006, pages e3 - 9
VANDENDRIESSCHE ET AL., J THROMB HAEMOST., vol. 5, no. 1, 2007, pages 16 - 24
INAGAKI ET AL., MOL THER. 2006, vol. 14, no. 1, 2006, pages 45 - 53
GREENSAMBROOK: "Molecular Cloning: A Laboratory Manual", 2012, COLD SPRING HARBOR LABORATORY PRESS
AUSUBEL ET AL., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 1987
ALTSCHUL ET AL., J MOL BIOL, vol. 215, 1990, pages 403 - 10
TATUSOVAMADDEN, FEMS MICROBIOL LETT, vol. 174, 1999, pages 247 - 250
LI ET AL., NAT BIOTECHNOL., vol. 17, 1999, pages 241 - 245
WANG ET AL., GENE THER, vol. 15, 2008, pages 1489 - 1499
SALVA ET AL., MOL THER, vol. 15, 2007, pages 320 - 9
LEVITT ET AL., GENES DEV, vol. 3, 1989, pages 1019 - 1025
TULALAMBA W ET AL.: "Distinct transduction of muscle tissue in mice after systemic delivery of AAVpo1 vectors", GENE THER., 2019, Retrieved from the Internet
CHAHAL ET AL.: "Production of adeno-associated virus (AAV) serotypes by transient transfection of HEK293 cell suspension cultures for gene delivery", JOURNAL OF VIROLOGICAL METHODS, vol. 196, 2014, pages 163 - 173, XP055685164, DOI: 10.1016/j.jviromet.2013.10.038
GRIEGER ET AL.: "Production of recombinant adeno-associated virus vectors using suspension HEK293 cells and continuous harvest of vector from the culture media for GMP FIX and FLT1 clinical vector", MOLECULAR THERAPY, vol. 24, 2016, pages 287 - 297, XP055436946, DOI: 10.1038/mt.2015.187
BLESSING ET AL.: "Scalable Production of AAV Vectors in Orbitally Shaken HEK293 Cells", MOLECULAR THERAPY METHODS & CLINICAL DEVELOPMENT, vol. 13, 2019, pages 14 - 26
KOTIN ET AL.: "Manufacturing Clinical Grade Recombinant Adeno-Associated Virus Using Invertebrate Cell Lines", HUMAN GENE THERAPY, vol. 28, 2017, pages 350 - 360, XP055690044, DOI: 10.1089/hum.2017.042
SEMIN RESPIR CRIT CARE MED., vol. 23, no. 3, June 2002 (2002-06-01), pages 191 - 200
VANDENDRIESSCHE ET AL., J THROMB HAEMOST, vol. 5, 2007, pages 16 - 24
Attorney, Agent or Firm:
DE CLERCQ & PARTNERS (BE)
Download PDF:
Claims:
CLAIMS

1. A nucleic acid regulatory element for enhancing muscle-specific gene expression comprising at least two diaphragm-specific regulatory elements selected from a diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02) or a functional fragment thereof, a diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04) or a functional fragment thereof, and a diaphragm-specific regulatory consisting of the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06) or a functional fragment thereof; and a heart- and skeletal muscle-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof.

2. The nucleic acid regulatory element according to claim 1 comprising at least: a) the diaphragm-specific regulatory consisting of the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06) or a functional fragment thereof; b) the heart- and skeletal muscle-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof; and cl) either the diaphragm-specific regulatory elements consisting of the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02) or a functional fragment thereof, or c2) the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04) or a functional fragment thereof, or c3) both diaphragm-specific regulatory elements of cl) and c2).

3. The nucleic acid regulatory element according to claim 1 or 2 comprising a diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:7 (Dph-CRE04- CRE06) or SEQ ID NO:8 (Dph-CRE06-CRE04), and the heart- and skeletal muscle-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof.

4. The nucleic acid regulatory element according to claim 3, comprising a diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:9 (Dph-CRE02- CRE04-CRE06) and the heart- and skeletal muscle-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof.

5. The nucleic acid regulatory element according to claim 4, comprising the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph-CRE04 - Dph-CRE06 - CSk-SH5).

6. In vitro or ex vivo use of the nucleic acid regulatory element according to any one of claims 1 to 5 for enhancing gene expression in muscle cells or tissue, preferably in diaphragm, smooth muscle, heart and/or skeletal muscle cells or tissue, more preferably in diaphragm, heart and/or skeletal muscle cells or tissue.

7. A nucleic acid expression cassette comprising a nucleic acid regulatory element according to any one of claims 1 to 5, operably linked to a promoter and a transgene, preferably a codon-optimized transgene.

8. The nucleic acid expression cassette according to claim 7, wherein the promoter is a muscle- specific promoter, preferably the Spc5-12 promoter as defined by SEQ ID NO:15.

9. The nucleic acid expression cassette according to any one of claims 7 or 8, wherein the transgene encodes a sarcoglycan, preferably b-sarcoglycan, more preferably human b-sarcoglycan.

10. The nucleic acid expression cassette according to any one of claims 7 to 9, wherein the nucleic acid regulatory element comprises the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph-CRE04 - Dph-CRE06 - CSk-SH5), wherein the promoter is the Spc5-12 promoter as defined by SEQ ID NO:15, and wherein the transgene encodes b-sarcoglycan, preferably wherein the expression cassette further comprises the Minute Virus of Mouse (MVM) intron as set forth in SEQ ID NO: 16 and a polyadenylation signal as defined by SEQ ID NO:17.

11. A vector, preferably an adeno-associated viral (AAV) vector, more preferably an AAV9 vector, comprising the nucleic acid regulatory element according to any one of claims 1 to 5, or the nucleic acid expression cassette according to any one of claims 7 to 10.

12. A pharmaceutical composition comprising the nucleic acid expression cassette according to any one of claims 7 to 10, or the vector according to claim 11, and a pharmaceutically acceptable carrier.

13. The nucleic acid regulatory element according to any one of claims 1 to 5, the nucleic acid expression cassette according to any one of claims 7 to 10, the vector according to claim 11, or the pharmaceutical composition according to claim 12 for use in medicine, preferably in gene therapy, more preferably muscle-directed gene therapy.

14. The nucleic acid regulatory element according to any one of claims 1 to 5, the nucleic acid expression cassette according to any one of claims 7 to 10, the vector according to claim 11, or the pharmaceutical composition according to claim 12 for use in the treatment of limb girdle muscular dystrophy, preferably limb girdle muscular dystrophy type 2E (LGMD2E).

15. A method, preferably an in vitro or ex vivo method, for expressing a transgene product in muscle cells, preferably in diaphragm, smooth muscle, heart and/or skeletal muscle cells, more preferably in diaphragm, heart and/or skeletal muscle cells comprising:

- introducing the nucleic acid expression cassette according to any one of claims 7 to 10, or the vector according to claim 11 into said muscle cells;

- expressing the transgene product in the muscle cells.

Description:
MUSCLE-SPECIFIC NUCLEIC ACID REGULATORY ELEMENTS AND METHODS AND USE THEREOF

FIELD

The present invention relates to nucleic acid regulatory elements that are able to enhance muscle- specific expression of genes, methods employing these regulatory elements and use thereof. More particularly, the nucleic acid regulatory elements are able to enhance gene expression in diaphragm, cardiac and skeletal muscle and smooth muscle, preferably in diaphragm, cardiac and skeletal muscle. The invention encompasses expression cassettes, vectors and pharmaceutical compositions comprising these regulatory elements. The present invention is particularly useful for applications using gene therapy, more particularly muscle-directed gene therapy, and for vaccination purposes.

BACKGROUND

There are many hereditary disorders that are due to a gene defect that impairs the function of smooth muscle, skeletal muscle, heart and/or diaphragm (e.g. Pompe disease, muscular dystrophies etc.). This gene defect may ultimately result in severe muscle weakness and paralysis or cardiopulmonary failure with life-threatening consequences. For example, Limb Girdle Muscular Dystrophy Type 2E (LGMD2E) is one of the most severe LGMD disorders with a worldwide incidence of 1 in 200,000 to 350,000. It is a type of autosomal recessive inheritable muscle dystrophy caused by mutation of the b-sarcoglycan (b-sg or bsg) gene, resulting in depletion of the functional b- sarcoglycan protein. Consequently, LGMD2E patients exhibit severe and widespread progressive muscle wasting, pelvic and shoulder girdle weakness, muscle fibrosis and force reduction, cardiomyopathy, joint diseases, and diaphragm dysfunction. Patients often die from cardiopulmonary failure, since standard clinical modalities do not provide a cure to the disease. Hence, developing an effective clinical therapy for LGMD2E represents an urgent unmet medical need.

Gene therapy provides an unprecedented opportunity to simultaneously treat dysfunction and degeneration of the smooth muscle, skeletal muscle, heart and diaphragm, thanks to its ability to deliver therapeutic genes to the affected tissues in order to obtain a lasting therapeutic response. Despite its promise, the drawback is that a relatively high virus vector dose is required to achieve a desirable therapeutic effect, thus impeding possible clinical translation. More particularly, gene therapy to the smooth muscle, skeletal muscle, heart and/or diaphragm, more particularly towards skeletal muscle, heart and/or diaphragm, is relatively inefficient due to limitations in gene delivery and gene expression. Moreover, immune responses specific for the therapeutic gene product curtail the efficiency of gene therapy applications directed towards smooth muscle, skeletal muscle, heart and/or diaphragm, more particularly towards skeletal muscle, heart and/or diaphragm.

The challenges that hamper clinical translation and preclude the development of an effective cure for LGMD2E by gene therapy also relate to: (i) insufficient expression of the therapeutic transgene in the affected skeletal muscles, heart and diaphragm; and (ii) the potential toxicity and untoward immune responses due to very high doses of conventional vectors needed to reach the main muscle groups (i.e. skeletal muscle, heart and diaphragm) affected by this life-threatening disease to effectively treat the different clinical manifestations of this diseases including the skeletal muscle weakness/wasting, cardiomyopathy and diaphragm dysfunction.

Efforts to deliver transgenes to muscle have focused on vectors derived from adenoviruses, retroviruses, lentiviruses, and adeno-associated viruses (AAV), and plasmids. Adeno-associated viral vector (AAV) is by far the most promising gene delivery vehicle for muscle-directed gene therapy. AAV's natural tropism to muscle cells, their long-term persistent transgene expression, their multiple serotypes, as well as their minimal immune response have made AAV vectors well suited for muscle-directed gene therapy. AAV9 is known among the most efficient vectors for cardiac gene delivery (Pacak et al. 2006. Circ Res. 99(4):e3-9; VandenDriessche et al. 2007. Thromb Haemost. 5(l):16-24; Inagaki et al. 2006. Mol Ther. 2006 14(l):45-53), and is also well suited for skeletal muscle gene delivery. AAV vector can be delivered into skeletal muscle cardiac muscle, smooth muscle and diaphragm by means of local, regional, and systemic administrations.

There remain however concerns regarding the efficacy and safety of some gene delivery approaches. The major limiting factors are: insufficient and/or transient transgene expression levels, and inappropriate expression of the transgene in unwanted cell types. In particular, it has been shown that inadvertent transgene expression in antigen-presenting cells (APCs) increases the risk of untoward immune responses against the gene-modified cells and/or the therapeutic transgene product that consequently curtails long-term gene expression.

This problem has been addressed by boosting gene expression using muscle- and diaphragm- specific cis-regulatory elements (CREs) (also designated as cis-regulatory modules (CRM)). These novel and robust human cis-regulatory elements were obtained via genome-wide data-mining and yielded robust specific transgene expression levels in diaphragm and/or heart and skeletal muscle while avoiding expression in non-target tissues (WO 2015/110449 Al; WO 2018/178067 Al).

However, just expressing the therapeutic protein in the muscle, heart or diaphragm using one of these CRMs may not be sufficient, particularly since there is a need to further diminish the vector doses to levels that do not provoke any unwanted toxicity, such as the well-documented liver toxicity that occurred in most if not all of the gene therapy trials that were based on high vector dose that were systemically administered to the patients.

Thus, there remains a need in the art for safe and efficient gene delivery to muscle. For example, it is imperative to further improve the efficacy and safety of tissue-targeted gene therapy applications for LGMD2E, ideally by developing more robust gene therapy vectors that allow for high and widespread diaphragm, smooth muscle, heart and skeletal muscle-specific expression, preferably diaphragm, heart and skeletal muscle-specific expression, of the therapeutic b-sarcoglycan transgene (b -SG) at lower and thus safer vector doses.

SUMMARY

The present inventors addressed the challenges with current gene therapy applications by developing approaches to maximize expression in muscle cells and tissues such as diaphragm, heart and/or skeletal muscle based on novel combinations of transcriptional cis-regulatory elements or modules (CREs or CRMs) that confer unexpectedly high expression in these specific cells and tissues. To this end, AAV vectors were designed that express human b-sarcoglycan cDNA (f^sg or f^SG) using combinations of CRE elements or modules that direct expression to diaphragm (also referred to herein as Dph-CRE) or heart and skeletal muscle (also referred to herein as CSk(-)CRE or CSk(- )SH(-)CRE or CSk(-)SH or Csk(-)CRE or Csk(-)SH(-)CRE or Csk(-)SH or CSK(-)CRE or CSK(-)SH(-)CRE or CSK(-)SH) in conjunction with a potent muscle-specific promotor as a gene therapy strategy for LGMD2E. These novel regulatory elements were subsequently validated in vivo in mice yielding unexpectedly high and tissue-specific gene expression in heart, smooth muscle, skeletal muscle and diaphragm, preferably in heart, skeletal muscle and diaphragm. This approach hence, allows for the use of lower and thus safer vector doses, while maximizing therapeutic efficacy.

The invention therefore provides the following aspects:

Aspect 1: a nucleic acid regulatory element for enhancing muscle-specific gene expression comprising, consisting essentially of, or consisting of at least two diaphragm-specific regulatory elements selected from a diaphragm-specific regulatory element comprising, consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:2, preferably the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), or a functional fragment thereof; a diaphragm-specific regulatory element comprising, consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:3, preferably the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph- CRE04), or a functional fragment thereof; and a diaphragm-specific regulatory element comprising, consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:4, preferably the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), or a functional fragment thereof; and a heart- and skeletal muscle-specific regulatory element comprising, consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:l, preferably the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5), or a functional fragment thereof.

Aspect 2: the nucleic acid regulatory element according to aspect 1 comprising, consisting essentially of or consisting of a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:2, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof; a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:3, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04), or a functional fragment thereof; and a heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:l, preferably the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof.

Aspect 3: the nucleic acid regulatory element according to aspect 2 comprising, consisting essentially of or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:5 (Dph-CRE02-CRE04) ;and the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof.

Aspect 4: the nucleic acid regulatory element according to aspect 2 or 3, comprising, consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:10 (Dph-CRE02 - Dph- CRE04-CSk-SH5).

Aspect 5: the nucleic acid regulatory element according to any one of aspects 2 to 4, further comprising a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:4, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof. Aspect 6: the nucleic acid regulatory element according to aspect 1 comprising, consisting essentially of or consisting of a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:2, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof; a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:4, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof; and a heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:l, preferably the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof.

Aspect 7: the nucleic acid regulatory element according to aspect 6 comprising, consisting essentially of or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:6 (Dph-CRE02-CRE06) and the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof.

Aspect 8: the nucleic acid regulatory element according to aspect 6 or 7, comprising, consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:ll (Dph-CRE02 - Dph- CRE06 - CSk-SH5).

Aspect 9: the nucleic acid regulatory element according to any one of aspects 6 to 8, further comprising a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:3, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04), or a functional fragment thereof.

Aspect 10: the nucleic acid regulatory element according to aspect 1 comprising, consisting essentially of or consisting of a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:3, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04), or a functional fragment thereof; a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:4, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof; and a heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:l, preferably the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof.

Aspect 11: the nucleic acid regulatory element according to aspect 10 comprising, consisting essentially of or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:7 (Dph-CRE04-CRE06) and the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof.

Aspect 12: the nucleic acid regulatory element according to aspect 10 comprising, consisting essentially of or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:8 (Dph-CRE06-CRE04) and the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof.

Aspect 13: the nucleic acid regulatory element according to aspect 10 or 11, comprising, consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:12 (Dph-CRE04 - Dph- CRE06 - CSk-SH5).

Aspect 14: the nucleic acid regulatory element according to aspect 10 or 12, comprising, consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:13 (Dph-CRE06 - Dph- CRE04 - CSk-SH5).

Aspect 15: the nucleic acid regulatory element according to any one of aspects 10 to 14, further comprising a diaphragm-specific regulatory element consisting essentially of or consisting of a nucleotide sequence having at least 95% identity to the sequence set forth in SEQ ID NO:2, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof.

Aspect 16: the nucleic acid regulatory element according to any one of aspects 10, 11, 13 or 15 comprising, consisting essentially of or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:9 (Dph- CRE02-CRE04-CRE06), ; and the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof.

Aspect 17: the nucleic acid regulatory element according to any one of aspects 10, 11, 13, 15 or 16 comprising, consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph-CRE04 - Dph-CRE06 - CSk-SH5).

Aspect 18: use, preferably an in vitro or ex vivo use, of the nucleic acid regulatory element according to any one of aspects 1 to 17 for enhancing gene expression in muscle, in particular in diaphragm, smooth muscle, heart and skeletal muscle, more particularly in diaphragm, heart and skeletal muscle.

Aspect 19: a nucleic acid expression cassette comprising a nucleic acid regulatory element according to any one of aspects 1 to 17, operably linked to a promoter.

Aspect 20: the nucleic acid expression cassette according to aspect 19, wherein the nucleic acid regulatory element is operably linked to a promoter and a transgene.

Aspect 21: the nucleic acid expression cassette according to any one of aspects 19 or 20, wherein the promoter is a muscle-specific promoter, such as a muscle-specific promoter selected from the group comprising or consisting of: the desmin (DES) promoter; the synthetic SPc5-12 promoter (SPc5-12); the alpha-actinl promoter (ACTA1); the Creatine kinase, muscle (CKM) promoter; the Four and a half LIM domains protein 1 (FHL1) promoter; the alpha 2 actinin (ACTN2) promoter; the filamin-C (FLNC) promoter; the sarcoplasmic/endoplasmic reticulum calcium ATPase 1 (ATP2A1) promoter; the Troponin I Type 1 (TNNI1) promoter; the Troponin I Type 2 (TNNI2) promoter; the Troponin T Type 1 (TNNT1) promoter; the Troponin T Type 3 (TNNT3) promoter; the myosin-1 (MYFI1) promoter; the myosin-2 (MYFI2) promoter; the sarcolipin (SLN) promoter; the Myosin Binding Protein Cl (MYBPC1) promoter; the enolase (EN03) promoter; the Carbonic Anhydrase 3 (CA3) promoter; the phosphorylatable, fast skeletal muscle myosin light chain (MYLPF) promoter; the Tropomyosin 1 (TPM1) promoter; the Tropomyosin 2 (TPM2) promoter; the alpha-3 chain tropomyosin (TPM3) promoter; the ankyrin repeat domain-containing protein 2 (ANKRD2) promoter; the myosin heavy-chain (MFIC) promoter; the alpha myosin heavy-chain promoter (otMHC) promoter; the myosin light-chain (MLC) promoter; the muscle creatine kinase (MCK) promoter; the Myosin, Light Chain 1 (MYL1) promoter; the Myosin, Light Chain 2 (MYL2) promoter; the Myoglobin (MB) promoter; the Troponin T type 2 (TNNT2) promoter; the Troponin C type 2 (TNNC2) promoter; the Troponin C Type 1 (TNNC1) promoter; the Titin-Cap (TCAP) promoter; the Myosin, Heavy Chain 7 (MYH7) promoter; the Aldolase A (ALDOA) promoter; the dMCK promoter, the tMCK promoter; the MHCK7 promoter; the myosin heavy chain 11 (Myhll) promoter; the transgelin (Tagln) promoter and the actin alpha 2 smooth muscle (Acta2) promoter.

Aspect 22: the nucleic acid expression cassette according any one of aspects 19 to 21, wherein the promoter is the SPc5-12 promoter as defined by SEQ ID NO:15. Aspect 23: the nucleic acid expression cassette according any one of aspects 19 to 21, wherein the promoter is the MHCK7 promoter as defined by SEQ ID NO:56.

Aspect 24: the nucleic acid expression cassette according any one of aspects 19 to 21, wherein the promoter is the desmin promoter as defined by SEQ ID NO:57.

Aspect 25: the nucleic acid expression cassette according to any one of aspects 19 to 24, wherein the transgene encodes a therapeutic protein.

Aspect 26: the nucleic acid expression cassette according to any one of aspects 19 to 25, wherein the transgene encodes a sarcoglycan, preferably b-sarcoglycan, more preferably human b- sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19). Aspect 27: the nucleic acid expression cassette according to any one of aspects 19 to 25, wherein the transgene encodes acid a-glucosidase (GAA), preferably human acid a-glucosidase (hGAA) (e.g. the transgene set forth in SEQ ID NO:58, preferably the codon-optimized transgene set forth in SEQ ID NO:59).

Aspect 28: the nucleic acid expression cassette according to any one of aspects 19 to 24, wherein the transgene is a reporter gene such as a luciferase transgene (e.g. the transgene set forth in SEQ ID NO:20).

Aspect 29: the nucleic acid expression cassette according to any one of aspects 19 to 28, wherein the transgene is codon-optimized.

Aspect 30: the nucleic acid expression cassette according to any one of aspects 19 to 29, further comprising an intron, preferably the Minute Virus of Mouse (MVM) intron (as set forth in SEQ ID NO: 16).

Aspect 31: the nucleic acid expression cassette according to any one of aspects 19 to 30, further comprising a polyadenylation signal, preferably a synthetic polyadenylation signal such as the one defined by SEQ ID NO:17. Aspect 32: a vector comprising the nucleic acid regulatory element according to any one of aspects 1 to 17, or the nucleic acid expression cassette according to any one of aspects 19 to 31. Aspect 33: the vector according to aspect 32, which is a viral vector, preferably an adeno-associated viral (AAV) vector, more preferably an AAV9 vector or an AAV8 vector.

Aspect 34: a pharmaceutical composition comprising the nucleic acid expression cassette according to any one of aspects 19 to 31, or the vector according to any one of aspects 32 or 33, and a pharmaceutically acceptable carrier.

Aspect 35: the nucleic acid regulatory element according to any one of aspects 1 to 17, the nucleic acid expression cassette according to any one of aspects 19 to 31, the vector according to any one of aspects 32 or 33, or the pharmaceutical composition according to aspect 34 for use in medicine.

Aspect 36: the nucleic acid regulatory element according to any one of aspects 1 to 17, the nucleic acid expression cassette according to any one of aspects 19 to 31, the vector according to any one of aspects 32 or 33, or the pharmaceutical composition according to aspect 34 for use in gene therapy, preferably muscle-directed gene therapy. For example, the gene therapy may be for a disease or disorder selected from lysosomal storage diseases (e.g. Fabry disease) including glycogen storage disorders (e.g. Pompe disease glycogen storage disorder (GSD) type II, Danon disease, GSD type Mb, GSD III or GSD 3 (also known as Cori's disease or Forbes' disease), GSD IV or GSD4 (also known as Andersen disease), GSD V or GSD5 (also known as McArdle disease), GSD VII or GSD7 (also known as Tarui's disease), GSD X or GSD10, GSD XII or GSD 12 (also known as Aldolase A deficiency), GSD XIII or GSD13, GSD XV or GSD15) and mucopolysaccharidosis disorders (e.g. Flunter syndrome, Sanfilippo syndrome, mucopolyssacharidose (MPS) I, MPS II, MPS III, MPS IMA, MPS IIIB, MPS MIC, MPS IV, MPS VI, MPS VII, MPS IX); mitochondrial disorders (e.g. Barth syndrome); channelopathy (e.g. Brugada syndrome); metabolic disorders; myotubular myopathy (MTM); muscular dystrophy (e.g. Duchenne muscular dystrophy (DMD), Becker muscular dystrophy (BMD)); myotonic dystrophy; Myotonic Muscular Dystrophy (DM); Miyoshi myopathy; Fukuyama type congenital; dysferlinopathies; neuromuscular disease; motor neuron diseases (MND) (e.g. Charcot-Marie- Tooth disease (CMT)), spinal muscular atrophy (SMA) or amyotrophic lateral sclerosis (ALS)); Emery- Dreifuss muscular dystrophy; facioscapulohumeral muscular dystrophy (FSHD); congenital muscular dystrophies; congenital myopathie; limb girdle muscular dystrophy (e.g. Limb Girdle Muscular Dystrophy type 2E (LGMD2E) Limb Girdle Muscular Dystrophy type 2D (LGMD2D), Limb Girdle Muscular Dystrophy type 2C (LGMD2C), Limb Girdle Muscular Dystrophy type 2B (LGMD2B), Limb Girdle Muscular Dystrophy type 2L (LGMD2L), Limb Girdle Muscular Dystrophy type 2A (LGMD2A)); metabolic myopathies; muscle inflammatory diseases; myasthenia; mitochondrial myopathies; anomalies of ionic channels; nuclear envelop diseases; cardiomyopathies; cardiac hypertrophy; heart failure; distal myopathies, hemophilia (e.g. hemophilia A and B); diabetes; cardiovascular diseases and heart diseases.

Aspect 37: the nucleic acid regulatory element, the nucleic acid expression cassette, the vector, or the pharmaceutical composition for use according to aspect 36, wherein the gene therapy is for treating muscle-related disorders in general, alleviating the symptoms of myopathies in general or restoring the function of muscle cells in general.

Aspect 38: the nucleic acid regulatory element, the nucleic acid expression cassette, the vector, or the pharmaceutical composition for use according to aspect 36, wherein the gene therapy is for treating cardiovascular diseases. Non-limiting examples of cardiovascular diseases include atherosclerosis, arteriosclerosis, coronary heart disease, coronary artery disease, peripheral arterial disease, congenital heart disease, congestive heart failure, heart failure (also known as cardiac insufficiency), myocardial infarction (also known as heart attack), cardiac ischemia, acute coronary syndrome, unstable angina, stable angina, cardiomyopathy, hypertrophic cardiomyopathy, dilated cardiomyopathy, restrictive cardiomyopathy, primary cardiomyopathies caused by genetic mutations such as Brugada syndrome, Pompe disease, Danon disease and Fabry disease, cardiac amyloidosis (also known as stiff heart syndrome), myocarditis (also known as inflammatory cardiomyopathy), valvular heart disease, valvular stenosis, valvular insufficiency, endocarditis, rheumatic heart disease, pericarditis (i.e. a disease caused by an inflammation and/or infection of the pericardium), cardiac tamponade (also known as pericardial tamponade), endocarditis, cardiac arrhythmia, hypertension, hypotension, vessel stenosis, valve stenosis, or restenosis.

Aspect 39: a nucleic acid regulatory element according to any one of aspects 1 to 17, a nucleic acid expression cassette according to any one of aspects 19 to 26 or 29 to 31, a vector according to any one of aspects 32 or 33, or a pharmaceutical composition according to aspect 34 for use in the treatment of limb girdle muscular dystrophy, more preferably limb girdle muscular dystrophy type 2E (LGMD2E).

Aspect 40: a nucleic acid regulatory element according to any one of aspects 1 to 17; a nucleic acid expression cassette according to any one of aspects 19 to 25 or 29 to 31, wherein the transgene encodes a lysosomal protein, preferably a lysosomal protein selected from the group consisting of acid a-galactosidase (GAA), alpha-galactosidase A and LAMP2; a vector according to any one of aspects 32 or 33 comprising said nucleic acid expression cassette; or a pharmaceutical composition according to aspect 34 comprising said nucleic acid expression cassette or said vector, for use in the treatment of a lysosomal storage disease. Aspect 41: a nucleic acid regulatory element according to any one of aspects 1 to 17, a nucleic acid expression cassette according to any one of aspects 19 to 25, 27, or 29 to 31, a vector according to any one of aspects 32 or 33, or a pharmaceutical composition according to aspect 34 for use in the treatment of Pompe disease.

Aspect 42: A method, preferably an in vitro or ex vivo method, for expressing a transgene product in muscle cells such as diaphragm, smooth muscle, heart and/or skeletal muscle cells, in particular in diaphragm, heart and/or skeletal muscle cells, comprising:

- introducing the nucleic acid expression cassette according to any one of aspects 19 to 31, or the vector according to any one of aspects 32 or 33 into the muscle cells;

- expressing the transgene product in the muscle cells.

BRIEF DESCRIPTION OF DRAWINGS

FIG 1: Nucleic acid regulatory elements. (A) CSk-SH5 (SEQ ID NO: 1). (B) Dph-CRE02 (SEQ ID NO:2). (C) Dph-CRE04 (SEQ ID NO:3). (D) Dph-CRE06 (SEQ ID NO:4). (E) Dph-CRE02-CRE04 (SEQ ID NO:5), which is a combination of Dph-CRE02 and Dph-CRE04 connected through a flanking sequence (underlined). (F) Dph-CRE02-CRE06 (SEQ ID NO:6), which is a combination of Dph-CRE02 and Dph- CRE06 connected through a flanking sequence (underlined). (G) Dph-CRE04-CRE06 (SEQ ID NO:7), which is a combination of Dph-CRE04 and Dph-CRE06 connected through a flanking sequence (underlined). (H) Dph-CRE06-CRE04 (SEQ ID NO:8), which is a combination of Dph-CRE06 and Dph- CRE04 connected through a flanking sequence (underlined). (I) Dph-CRE02-CRE04-CRE06 (SEQ ID NO:9), which is a combination of Dph-CRE02, Dph-CRE04 and Dph-CRE06 connected with each other through a flanking sequence (underlined). (J) Dph-CRE02-CRE04-CSk-SH5 (SEQ ID NO:10). (K) Dph-CRE02-CRE06-CSk-SH5 (SEQ ID NO:ll). (L) Dph-CRE04-CRE06-CSk-SH5 (SEQ ID NO:12). (M) Dph-CRE06-CRE04-CSk-SH5 (SEQ ID NO:13). (N) Dph-CRE02-CRE04-CRE06-CSk-SH5 (SEQ ID NO:14).

FIG 2: Design of AAV vectors. AAV vectors were designed to express human b-sarcoglycan (hpsg) or luciferase (Luc). The vectors contain either codon-optimized (hpsgco, SEQ ID NO:19) or non codon-optimized hpsg (SEQ ID NO:18) transgene. The vectors contain a synthetic muscle promoter designated as SPc5-12 (SEQ ID NO:15). The vectors all contain a minute virus of mouse intron (MVM, SEQ ID NO:16) and a synthetic polyadenylation site (pA, SEQ ID NO:17). The vectors are flanked by a 5' and 3' AAV inverted terminal repeat (ITR). The vectors contain a cardiac and skeletal muscle- specific CRE (CSk-SH5) (A, B, Al), in combination with one or more diaphragm-specific CREs (CRE02, CRE04 or CRE06) (C-W), or are devoid of any CRE element (X, Y, Z). FIG 3: Effect of CRE combinations on luciferase expression. Bioluminescence analysis (BLI) of gastrocnemius and quadriceps of CB17-SCID mice that were injected with AAV vectors expressing luciferase from a synthetic muscle promoter designated as SPc5-12 (SPc). The vectors contained a cardiac/skeletal muscle CRE (CSk-SH5) alone, or in combination with one or more diaphragm- specific CRE (CRE02, CRE04 and/or CRE06) and/or a cardiac/skeletal muscle CRE (CSk-SH5).

FIG 4: Effect of CRE combinations on expression of codon-optimized hpsc gene in gastrocnemius (A), quadriceps (B), diaphragm (C) and heart (D). AAV vectors expressing codon-optimized (hpsgco) from the SPc5-12 promoter were injected into CB17-SCID. The vectors contained a heart and skeletal muscle-specific CRE (CSk-SH5), either alone or in combination with one or two diaphragm- specific CREs (Dph-CRE02, Dph-CRE04 and/or Dph-CRE06). Expression of hpsgco gene was determined by quantitative real-time reverse transcriptase PCR (qRT-PCR).

FIG 5: Effect of CRE combinations on expression of codon-optimized hbsc gene in gastrocnemius (A), quadriceps (B), diaphragm (C) and heart (D). AAV vectors expressing codon-optimized (hpsgco) from the SPc5-12 promoter were injected into CB17-SCID. The vectors contained no CRE element (SPc), a heart and skeletal muscle-specific CRE (CSk-SH5-SPC), a combination of two diaphragm- specific CREs and a heart and skeletal muscle-specific CRE (CRE04-CRE06-CSk-SH5), or a combination of three diaphragm-specific CREs and a heart and skeletal muscle-specific CRE (CRE02- CRE04-CRE06-CSk-SH5). Expression of hpsgco gene was determined by quantitative real-time reverse transcriptase PCR (qRT-PCR).

FIG 6: Treadmill endurance assay to assess phenotypic correction of gene therapy treated Sgcb- null mice. Three different groups of mice were subjected to a treadmill endurance assay. Sgcb-null mice injected with AAV-SPc5-12-MVM-hR-SGco-pA, AAV-CRE02-CRE04-CRE06-CSk-SH5-SPc5-12- MVM-hR-SGco-pA, or PBS as negative control. The average distance run by each mouse group is indicated (n=3).

DESCRIPTION

As used herein, the singular forms "a", "an", and "the" include both singular and plural referents unless the context clearly dictates otherwise.

The terms "comprising", "comprises" and "comprised of" as used herein are synonymous with "including", "includes" or "containing", "contains", and are inclusive or open-ended and do not exclude additional, non-recited members, elements or method steps. The terms also encompass "consisting of" and "consisting essentially of", which enjoy well-established meanings in patent terminology.

The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.

The terms "about" or "approximately" as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of +/-10% or less, preferably +/-S% or less, more preferably +/-!% or less, and still more preferably +/-0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier "about" refers is itself also specifically, and preferably, disclosed.

Whereas the terms "one or more" or "at least one", such as one or more members or at least one member of a group of members, is clear per se, by means of further exemplification, the term encompasses inter alia a reference to any one of said members, or to any two or more of said members, such as, e.g., any >3, >4, >5, >6 or >7 etc. of said members, and up to all said members. In another example, "one or more" or "at least one" may refer to 1, 2, 3, 4, 5, 6, 7 or more.

The discussion of the background to the invention herein is included to explain the context of the invention. This is not to be taken as an admission that any of the material referred to was published, known, or part of the common general knowledge in any country as of the priority date of any of the claims.

Throughout this disclosure, various publications, patents and published patent specifications are referenced by an identifying citation. All documents cited in the present specification are hereby incorporated by reference in their entirety. In particular, the teachings or sections of such documents herein specifically referred to are incorporated by reference.

Unless otherwise defined, all terms used in disclosing the invention, including technical and scientific terms, have the meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. By means of further guidance, term definitions are included to better appreciate the teaching of the invention. When specific terms are defined in connection with a particular aspect of the invention or a particular embodiment of the invention, such connotation is meant to apply throughout this specification, i.e., also in the context of other aspects or embodiments of the invention, unless otherwise defined. In the following passages, different aspects or embodiments of the invention are defined in more detail. Each aspect or embodiment so defined may be combined with any other aspect(s) or embodiment(s) unless clearly indicated to the contrary. In particular, any feature indicated as being preferred or advantageous may be combined with any other feature or features indicated as being preferred or advantageous.

Reference throughout this specification to "one embodiment", "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention, and form different embodiments, as would be understood by those in the art. For example, in the appended claims, any of the claimed embodiments can be used in any combination.

For general methods relating to the invention, reference is made inter alia to well-known textbooks, including, e.g., "Molecular Cloning: A Laboratory Manual, 4 th Ed." (Green and Sambrook, 2012, Cold Spring Flarbor Laboratory Press), "Current Protocols in Molecular Biology" (Ausubel et al., 1987).

In an aspect, the invention relates to a nucleic acid regulatory element for enhancing muscle- specific gene expression, in particular for enhancing diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly for enhancing diaphragm, cardiac and skeletal muscle-specific gene expression comprising, consisting essentially of (i.e., the regulatory element may for instance additionally comprise sequences used for cloning purposes, but the indicated sequences make up the essential part of the regulatory element), or consisting of at least two, such as two, three or more, diaphragm-specific regulatory elements and a heart and skeletal muscle-specific regulatory element, wherein the at least two diaphragm-specific regulatory elements are selected from the group consisting of: a diaphragm-specific regulatory element comprising, consisting essentially of (i.e., the regulatory element may for instance additionally comprise sequences used for cloning purposes, but the indicated sequences make up the essential part of the regulatory element, e.g. they do not form part of a larger regulatory region such as a promoter), or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:2, preferably the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02,) or a functional fragment thereof; a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:3, preferably the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04), or a functional fragment thereof; and a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:4, preferably the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), or a functional fragment thereof, and wherein the heart and skeletal muscle-specific regulatory element comprises, consists essentially of, or consists of a nucleotide sequence having at least 95% identity to SEQ ID NO:l, preferably the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5), or a functional fragment thereof.

A 'regulatory element' or 'cis-regulatory element (CRE)' or 'cis-regulatory module (CRM)' or “SH” as used herein refers to a transcriptional control element, in particular a non-coding cis-acting transcriptional control element, capable of regulating and/or controlling transcription of a gene, in particular tissue-specific transcription of a gene. Regulatory elements comprise at least one transcription factor binding site (TFBS), more in particular at least one binding site for a tissue- specific transcription factor. Regulatory elements alone are typically not sufficient to initiate transcription, but require a promoter to this end. Typically, regulatory elements as used herein increase or enhance promoter-driven gene expression when compared to the transcription of the gene from the promoter alone, without the regulatory elements. Thus, regulatory elements particularly comprise enhancer sequences, although it is to be understood that the regulatory elements enhancing transcription are not limited to typical far upstream enhancer sequences, but may occur at any distance of the gene they regulate or even within the gene or open reading frame itself. Indeed, it is known in the art that sequences regulating transcription may be situated either upstream (e.g. in the promoter region) or downstream (e.g. in the 3'UTR) of the gene they regulate in vivo, and may be located in the immediate vicinity of the gene or further away. Regulatory elements as disclosed herein encompass naturally occurring sequences, as well as variants thereof and combinations of such regulatory elements or several copies of such a regulatory element, i.e. regulatory elements comprising non-naturally occurring sequences. Regulatory elements as disclosed herein may comprise part of a larger sequence involved in transcriptional control, e.g. part of a promoter sequence. The regulatory elements of the present invention are non-naturally occurring sequences. The regulatory elements disclosed herein are provided as nucleic acid molecules. The term "nucleic acid" as used herein typically refers to an oligomer or polymer (preferably a linear polymer) of any length composed essentially of nucleotides. A nucleotide unit commonly includes a heterocyclic base, a sugar group, and at least one, e.g. one, two, or three, phosphate groups, including modified or substituted phosphate groups. Heterocyclic bases may include inter alia purine and pyrimidine bases such as adenine (A), guanine (G), cytosine (C), thymine (T) and uracil (U) which are widespread in naturally-occurring nucleic acids, other naturally-occurring bases (e.g., xanthine, inosine, hypoxanthine) as well as chemically or biochemically modified (e.g., methylated), non-natural or derivatised bases. Sugar groups may include inter alia pentose (pentofuranose) groups such as preferably ribose and/or 2-deoxyribose common in naturally-occurring nucleic acids, or arabinose, 2-deoxyarabinose, threose or hexose sugar groups, as well as modified or substituted sugar groups. Nucleic acids as intended herein may include naturally occurring nucleotides, modified nucleotides or mixtures thereof. A modified nucleotide may include a modified heterocyclic base, a modified sugar moiety, a modified phosphate group or a combination thereof. Modifications of phosphate groups or sugars may be introduced to improve stability, resistance to enzymatic degradation, or some other useful property. The term "nucleic acid" further preferably encompasses DNA, RNA and DNA/RNA hybrid molecules, specifically including hnRNA, pre-mRNA, mRNA, cDNA, genomic DNA, amplification products, oligonucleotides, and synthetic (e.g., chemically synthesised) DNA, RNA or DNA/RNA hybrids. A nucleic acid can be naturally occurring, e.g., present in or isolated from nature; or can be non-naturally occurring, e.g., recombinant, i.e., produced by recombinant DNA technology, and/or partly or entirely, chemically or biochemically synthesised. A "nucleic acid" can be double-stranded, partly double stranded, or single-stranded. Where single-stranded, the nucleic acid can be the sense strand or the antisense strand. In addition, nucleic acid can be circular or linear.

As used herein "transcription factor binding site", "transcription factor binding sequence" or "TFBS" refers to a sequence of a nucleic acid region to which transcription factors bind. Non-limiting examples of TFBS include binding sites for transcription factor 3, also known asTCF3 or E2A; binding sites for nuclear factor I, also known as NF1; binding sites for CCAAT-enhancer-binding protein, also known as C/EBP; binding sites for myogenic differentiation, also known as MyoD; binding sites for sterol regulatory element-binding protein, also known as SREBP; binding sites for leukemia/lymphoma-related factor, also known as LRF; binding sites for protein 53, also known as p53; binding sites for hepatocyte nuclear factor 3-alpha, also known as HNF3a; binding sites for hepatocyte nuclear factor 3-beta, also known as HNF3b; binding sites for hepatocyte nuclear factor 4, also known as HNF4; binding sites for myocyte-specific enhancer factor 2A, also known as MEF2A or RSRFC4; binding sites for peroxisome proliferator-activated receptor gamma 1/2, also known as PPAR-gammal/2; binding sites for serum response factor, also known as SRF; binding sites for transcription activator-like protein lb, also known asTall_b or Tail-beta; binding sites for enhancer of zeste homolog 2, also known as EZH2; binding sites for Polycom Repressive Complex 2 Subunit, also known as SUZ12; binding sites for TATA-binding protein, also known as TBP; binding sites for folate receptor alpha, also known as FOLR2A; binding sites for RE-1 silencing transcription factor, also known as REST; binding sites for TEA domain transcription factor 4, also known as TEAD4; transcription factor binding sites for Retinablastoma-Binding Protein 5, also known as RBBP5; transcription factor binding sites for Msh Flomeobox 1, also known as Msx-1; transcription factor binding sites for SIN3 Transcription Regulator Family Member A, also known as SIN3A; transcription factor binding sites for lunD, also known as 1UND; transcription factor binding sites for Zinc Finger ZZ-Type Containing 3, also known as ZZZ3; transcription factor binding sites for Zinc Finger E-Box Binding Flomeobox 1, also known as AREB6; transcription factor binding sites for transcription factor E2F; transcription factor binding sites for Transcription Factor 4, also known as ITF2; transcription factor binding sites for estrogen receptor alpha, also known as ER-alpha; Myb-related protein B, also known as MYBL2; forkhead box A1 also known as FOXA1; forkhead box A2, also known as FOXA2; Transcription initiation factor TFIID subunit 7, also known as TAF7; SIN3AK20; MAX transcription factor, also known as MAX; Zinc finger and BTB domain-containing protein 7A, also known as ZBTB7A. Transcription factor binding sites may be found in databases such as Transfac®.

As used herein, the terms "identity" and "identical" and the like refer to the sequence similarity between two polymeric molecules, e.g., between two nucleic acid molecules, e.g., two DNA molecules. Sequence alignments and determination of sequence identity can be done, e.g., using the Basic Local Alignment Search Tool (BLAST) originally described by Altschul et al. 1990 (1 Mol Biol 215: 403-10), such as the "Blast 2 sequences" algorithm described by Tatusova and Madden 1999 (FEMS Microbiol Lett 174: 247-250). Typically, the percentage sequence identity is calculated over the entire length of the sequence. As used herein, the term "substantially identical" denotes at least 90%, preferably at least 95%, such as 95%, 96%, 97%, 98% or 99%, sequence identity.

The term "functional fragment" as used in the application with respect to the nucleic acid regulatory elements disclosed herein refers to fragments of said regulatory element sequences that retain the capability of regulating muscle-specific expression, i.e. they can still confer tissue specificity and they are capable of regulating expression of a (trans)gene in the same way (although possibly not to the same extent) as the sequence from which they are derived. Functional fragments may preferably comprise at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 120, at least 150, at least 200, at least 250, at least 300, at least 350, at least 400 or at least 450 contiguous nucleotides from the sequence from which they are derived. Also preferably, functional fragments may comprise at least 1, more preferably at least 2, at least 3, or at least 4, even more preferably at least 5, at least 10, or at least 15, of the transcription factor binding sites (TFBS) that are present in the sequence from which they are derived. Functional fragments as defined herein preferably have at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or about 100% of the nucleic acids regulatory capacities of the regulatory element wherefrom they are derived.

A "cardiac and skeletal muscle-specific regulatory element" or "heart and skeletal muscle-specific regulatory element" as used herein refers to a regulatory element that is capable of enhancing cardiac and skeletal muscle-specific expression. Non-limiting examples of cardiac and skeletal muscle-specific regulatory elements are the regulatory elements denoted as "CSk-SH" in WO 2015/110449. In particular, a cardiac and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:l, preferably the nucleotide sequence set forth in SEQ ID NO:l, or a functional fragment thereof is used herein. In embodiments, the cardiac and skeletal muscle-specific regulatory element consisting essentially of consisting of the nucleotide sequence set forth in SEQ ID NO:l, or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:l, also denoted herein as "CSk-SH5" or "CSkSH5" or "Csk-SH5" or "CskSH5" or "CSK-SH5" or "CSKSH5", is used.

As used herein "cardiac and skeletal muscle-specific expression" refers to the preferential or predominant expression of a (trans)gene in heart, in particular heart muscle, and skeletal muscle. At least 50%, more particularly at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or 100% of the (trans)gene expression may occur within heart and skeletal muscle. Less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 2% or even less than 1% of the (trans)gene expression may occur in an organ or tissue other than heart and skeletal muscle. Throughout the application, where cardiac and skeletal muscle-specific is mentioned in the context of expression, cardiomyocyte and skeletal myocyte-specific expression, cardiac myoblast and skeletal myoblast-specific expression or cardiac and muscle stem/progenitor cell or satellite cell-specific expression are also explicitly envisaged.

As used herein, the terms "heart muscle" or "cardiac muscle" refer to the autonomically regulated, striated muscle type found in the heart. As used herein, the term "skeletal muscle" refers to the voluntarily controlled, striated muscle type that is attached to the skeleton. Non-limiting examples of skeletal muscle include the biceps, the triceps, the quadriceps, the tibialis interior, and the gastrocnemius muscle.

The term "myocyte," as used herein, refers to a cell that has been differentiated from a progenitor muscle stem/progenitor cell, satellite cell or myoblast such that it is capable of expressing muscle- specific phenotype under appropriate conditions. Terminally differentiated myocytes fuse with one another to form myotubes, a major constituent of muscle fibers. The term "myocyte" also refers to myocytes that are de-differentiated. The term includes cells in vivo and cells cultured ex vivo regardless of whether such cells are primary or passaged.

The terms "muscle stem/progenitor cell", "satellite cell" or "myoblast" as used herein, refer to an embryonic cell in the mesoderm that differentiates to give rise to a muscle cell or myocyte. The term includes cells in vivo and cells cultured ex vivo regardless of whether such cells are primary or passaged.

A "diaphragm-specific regulatory element" as used herein refers to a regulatory element that is capable of enhancing diaphragm and skeletal muscle-specific expression. Non-limiting examples of diaphragm-specific regulatory elements are the regulatory elements denoted as Dph-CREOl to Dph- CRE065 in WO 2018/178067. In particular, a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:2, preferably the nucleotide sequence set forth in SEQ ID NO:2, or a functional fragment thereof; a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:3, preferably the nucleotide sequence set forth in SEQ ID NO:3, or a functional fragment thereof; and/or a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to SEQ ID NO:4, preferably the nucleotide sequence set forth in SEQ ID NO:4, or a functional fragment thereof, is used herein. In embodiments, the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:2, also denoted herein as "Dph-CRE02" or "CRE-02" or "CRE02", the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:3, also denoted herein as "Dph-CRE04" or "CRE-04" or "CRE04", and/or the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID N04 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:4, also denoted herein as "Dph-CRE06" or "CRE-06" or "CRE06", is/are used. Table 1: Nucleic acid regulatory elements

In embodiments, the diaphragm-specific regulatory element is a combination of at least 2 diaphragm-specific regulatory elements selected from the group consisting of the diaphragm- specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:2 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof; the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:3 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:3 (Dph-CRE04), or a functional fragment thereof; and the diaphragm- specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:4 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof. More particularly, a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:5 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:5 (Dph- CRE02-CRE04), a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:6 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:6 (Dph-CRE02-CRE06), a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:7 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:7 (Dph-CRE04-CRE06), a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:8 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:8 (Dph-CRE06-CRE04) or a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:9 or a sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, such as 95%, 96%, 97%, 98%, or 99%, identity to SEQ ID NO:9 (Dph-CRE02-CRE04-CRE06), is used herein.

"Diaphragm and skeletal muscle-specific expression" as used herein, refers to the preferential or predominant expression of a (trans)gene (as RNA and/or polypeptide) in diaphragm and/or skeletal muscle cells or diaphragm and/or skeletal muscle tissue, as compared to other (i.e. non-diaphragm and skeletal muscle) cells or tissue. At least 50%, more particularly at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or 100% of the (trans)gene expression may occur within diaphragm and/or skeletal muscle cells or tissue. Diaphragm and skeletal muscle specific expression may entail that there is less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 2% or even less than 1% 'leakage' of expressed gene product to other organs or tissue than diaphragm and skeletal muscle, such as lung, liver, brain, kidney and/or spleen.

"Muscle-specific expression" as used herein, refers to the preferential or predominant expression of a (trans)gene (as RNA and/or polypeptide) in muscle cells, as compared to other (i.e. non- muscle) cells or tissue. At least 50%, more particularly at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or 100% of the (trans)gene expression may occur within muscle cells or tissue. Muscle specific expression may entail that there is less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 2% or even less than 1% 'leakage' of expressed gene product to other organs or tissue than muscle, such as lung, liver, brain, kidney and/or spleen. "Smooth muscle-specific expression" as used in the application, refers to the preferential or predominant expression of a (trans)gene (as RNA and/or polypeptide) in smooth muscle cells and tissue, as compared to other (i.e. non-smooth muscle) cells or tissues. According to particular embodiments, at least 50%, more particularly at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or 100% of the (trans)gene expression occurs within smooth muscle cells or tissue. According to a particular embodiment, smooth muscle specific expression entails that there is less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 2% or even less than 1% "leakage" of expressed gene product to other organs or tissue than smooth muscle.

The nucleic acid regulatory elements of the present invention comprise an artificial sequence by combining two or more diaphragm-specific regulatory elements disclosed herein and a cardiac and skeletal muscle-specific regulatory element disclosed herein. As shown in the experimental section, the combination of two diaphragm-specific regulatory elements disclosed herein and a cardiac and skeletal muscle-specific regulatory element as disclosed herein led to robust transgene expression in diaphragm, smooth muscle, heart and skeletal muscle, more particularly in diaphragm, heart and skeletal muscle. More particularly, the two diaphragm-specific regulatory elements and the cardiac and skeletal muscle-specific regulatory element act synergistically as a new regulatory element for enhancing diaphragm, smooth muscle, cardiac and skeletal muscle-specific transgene expression, more particularly for enhancing diaphragm, cardiac and skeletal muscle-specific transgene expression. It has further been shown that the addition of a third diaphragm-specific regulatory element disclosed herein to this combination could further increase diaphragm, cardiac, smooth muscle and skeletal muscle-specific transgene expression, more particularly diaphragm, cardiac and skeletal muscle-specific transgene expression, in a synergistic way.

As used herein "diaphragm, cardiac, smooth muscle and skeletal muscle-specific expression" refers to the preferential or predominant expression of a (trans)gene in diaphragm, smooth muscle, heart and/or skeletal muscle cells or tissue. According to particular embodiments, at least 50%, more particularly at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or 100% of the (trans)gene expression occurs within diaphragm, smooth muscle, heart and/or skeletal muscle cells and tissue. Thus, according to particular embodiments, less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 2% or even less than 1% of the (trans)gene expression occurs in an organ or tissue other than diaphragm, smooth muscle, heart and skeletal muscle. As used herein "diaphragm, cardiac and skeletal muscle-specific expression" refers to the preferential or predominant expression of a (trans)gene in diaphragm, heart and/or skeletal muscle cells or tissue. According to particular embodiments, at least 50%, more particularly at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99% or 100% of the (trans)gene expression occurs within diaphragm, heart and/or skeletal muscle cells and tissue. Thus, according to particular embodiments, less than 50%, less than 40%, less than 30%, less than 20%, less than 10%, less than 5%, less than 2% or even less than 1% of the (trans)gene expression occurs in an organ or tissue other than diaphragm, heart and skeletal muscle.

In embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle- specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle- specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression comprising, consisting essentially of, or consisting of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), or a functional fragment thereof, a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04), or a functional fragment thereof, and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5), or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof, the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04), or a functional fragment thereof, and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), or a functional fragment thereof, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof.

In embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle- specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle- specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), or a functional fragment thereof; a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), or a functional fragment thereof; and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5), or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof; the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof; and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04), or a functional fragment thereof, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04) or a functional fragment thereof.

In embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle- specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle- specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04), or a functional fragment thereof; a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), or a functional fragment thereof; and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5), or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID N03 (Dph-CRE04), or a functional fragment thereof; the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof; and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5), or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), or a functional fragment thereof, preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02) or a functional fragment thereof.

The at least two diaphragm-specific regulatory elements and the cardiac and skeletal muscle- specific regulatory element can be combined in any order in the nucleic acid regulatory elements disclosed herein. The at least two diaphragm-specific regulatory elements and the cardiac and skeletal muscle-specific regulatory element can be combined in tandem or the regulatory elements can be combined with one or more intervening or flanking nucleotides (e.g. nucleotides used for cloning purposes) between one or more of the regulatory elements.

In embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:5 (e.g. Dph-CRE02-CRE04) and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:5 (Dph-CRE02-CRE04) and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph- CRE06), or a functional fragment thereof.

In embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:6 (e.g. Dph-CRE02-CRE06) and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:6 (Dph-CRE02-CRE06) and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04), preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph- CRE04), or a functional fragment thereof. In embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:7 (e.g. Dph-CRE04-CRE06) and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:7 (Dph-CRE04-CRE06) and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph- CRE02), or a functional fragment thereof. In specific embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:9 (e.g. Dph-CRE02-CRE04-CRE06) and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:9 (Dph-CRE02-CRE04-CRE06) and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof.

In embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:8 (e.g. Dph-CRE06-CRE04) and a heart and skeletal muscle-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:l (e.g. CSk-SH5) or a functional fragment thereof. In particular embodiments, the invention relates to a nucleic acid regulatory element for enhancing muscle-specific gene expression, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific gene expression, more particularly diaphragm, cardiac and skeletal muscle-specific gene expression, comprising, consisting essentially of, or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:8 (Dph-CRE06-CRE04) and the heart and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof. In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of a nucleotide sequence having at least 95% identity to the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), preferably the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph- CRE02), or a functional fragment thereof.

In specific embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of the nucleotide sequence set forth in SEQ ID NO:10 (Dph-CRE02 - Dph-CRE04- CSk-SH5). In further embodiments, the nucleic acid regulatory element further comprises a diaphragm- specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (e.g. Dph-CRE06), preferably the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06), or a functional fragment thereof. In specific embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of the nucleotide sequence set forth in SEQ ID NO:ll (Dph-CRE02 - Dph- CRE06 - CSk-SH5). In further embodiments, the nucleic acid regulatory element further comprises a diaphragm-specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (e.g. Dph-CRE04), preferably the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04), or a functional fragment thereof.

In specific embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of the nucleotide sequence set forth in SEQ ID NO:12 (Dph-CRE04 - Dph-CRE06 - CSk-SH5). In other specific embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of the nucleotide sequence set forth in SEQ ID NO:13 (Dph-CRE06 - Dph-CRE04 - CSk- SH5). In further embodiments, the nucleic acid regulatory element further comprises a diaphragm- specific regulatory element comprising, consisting essentially of, or consisting of the nucleotide sequence set forth in SEQ ID NO:2 (e.g. Dph-CRE02), preferably the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02), or a functional fragment thereof. In specific embodiments, the nucleic acid regulatory element comprises, consists essentially of, or consists of the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph- CRE04 - Dph-CRE06 - CSk-SH5).

The nucleic acid regulatory elements disclosed herein may be used in a nucleic acid expression cassette. Accordingly, in an aspect the invention provides for the use of the nucleic acid regulatory elements as described herein in a nucleic acid expression cassette.

In an aspect the invention provides a nucleic acid expression cassette comprising a nucleic acid regulatory element as described herein, operably linked to a promoter. In embodiments, the nucleic acid expression cassette does not contain a transgene. Such nucleic acid expression cassette may be used to drive expression of an endogenous gene. In preferred embodiments, the nucleic acid expression cassette comprises a nucleic acid regulatory element as described herein, operably linked to a promoter and a transgene.

As used herein, the term 'nucleic acid expression cassette' refers to nucleic acid molecules that include one or more transcriptional control elements (such as, but not limited to promoters, enhancers and/or regulatory elements, polyadenylation sequences, and introns) that direct (trans)gene expression in one or more desired cell types, tissues or organs. Typically, they will also contain a transgene, although it is also envisaged that a nucleic acid expression cassette directs expression of an endogenous gene in a cell into which the nucleic acid cassette is inserted.

The term 'operably linked' as used herein refers to the arrangement of various nucleic acid molecule elements relative to each such that the elements are functionally connected and are able to interact with each other. Such elements may include, without limitation, a promoter, an enhancer and/or a regulatory element, a polyadenylation sequence, one or more introns and/or exons, and a coding sequence of a gene of interest to be expressed (i.e., the transgene). The nucleic acid sequence elements, when properly oriented or operably linked, act together to modulate the activity of one another, and ultimately may affect the level of expression of the transgene. By modulate is meant increasing, decreasing, or maintaining the level of activity of a particular element. The position of each element relative to other elements may be expressed in terms of the 5' terminus and the 3' terminus of each element, and the distance between any particular elements may be referenced by the number of intervening nucleotides, or base pairs, between the elements. As understood by the skilled person, operably linked implies functional activity, and is not necessarily related to a natural positional link. Indeed, when used in nucleic acid expression cassettes, the regulatory elements will typically be located immediately upstream of the promoter (although this is generally the case, it should definitely not be interpreted as a limitation or exclusion of positions within the nucleic acid expression cassette), but this needs not be the case in vivo. E.g., a regulatory element sequence naturally occurring downstream of a gene whose transcription it affects is able to function in the same way when located upstream of the promoter. Hence, according to a specific embodiment, the regulatory or enhancing effect of the regulatory element is position-independent.

As used in the application, the term 'promoter' refers to nucleic acid sequences that regulate, either directly or indirectly, the transcription of corresponding nucleic acid coding sequences to which they are operably linked (e.g. a transgene or endogenous gene). A promoter may function alone to regulate transcription or may act in concert with one or more other regulatory sequences (e.g. enhancers or silencers, or regulatory elements). In the context of the present application, a promoter is typically operably linked to a regulatory element as disclosed herein to regulate transcription of a (trans)gene. When a regulatory element as described herein is operably linked to both a promoter and a transgene, the regulatory element can (1) confer a significant degree of muscle-specific, in particular diaphragm, smooth muscle, cardiac and skeletal muscle-specific more particularly diaphragm, cardiac and skeletal muscle-specific expression in vivo (and/or in myoblasts, myocytes, or muscle-derived cell lines, in particular cardiac and skeletal myoblasts, cardiac and skeletal myocytes, or cardiac and skeletal muscle-derived cell lines in vitro) of the transgene, and/or (2) can increase the level of expression of the transgene in muscle, in particular diaphragm, smooth muscle, cardiac and skeletal muscle, more particularly diaphragm, cardiac and skeletal muscle (and/or in myoblasts, myocytes, or muscle-derived cell lines, in particular smooth, cardiac and skeletal myoblasts, smooth, cardiac and skeletal myocytes, or smooth, cardiac and skeletal muscle- derived cell lines in vitro, more particularly cardiac and skeletal myoblasts, cardiac and skeletal myocytes, or cardiac and skeletal muscle-derived cell lines in vitro).

The promoter may be homologous (i.e. from the same species as the animal, in particular mammal, to be transfected with the nucleic acid expression cassette) or heterologous (i.e. from a source other than the species of the animal, in particular mammal, to be transfected with the expression cassette). As such, the source of the promoter may be any virus (e.g. CMV or cytomegalovirus), any unicellular prokaryotic or eukaryotic organism, any vertebrate or invertebrate organism, or any plant, or may even be a synthetic promoter (i.e. having a non-naturally occurring sequence), provided that the promoter is functional in combination with the regulatory elements described herein. In preferred embodiments, the promoter is a mammalian promoter, in particular a murine or human promoter. The promoter may be an inducible or constitutive promoter.

The enrichment in muscle-specific TFBS in the nucleic acid regulatory elements disclosed herein in principle allows the regulatory elements to direct muscle-specific expression even from a promoter that itself is not muscle-specific (e.g. CAG promoter, CMV promoter). Hence, the regulatory elements disclosed herein can be used in nucleic acid expression cassettes in conjunction with any promoter, in particular the promoter may either be tissue-specific, e.g. muscle-specific, or ubiquitously expressed. Non-limiting examples of ubiquitously expressed promoters include polymerase II (pol II) promoters, polymerase III (pol III) promoters (e.g. U6) and chimeric pol III promoters. These promoters may be suitable for expressing e.g. non-coding RNAs. Preferably, the nucleic acid expression cassettes disclosed herein comprise a muscle-specific promoter, in particular a diaphragm, smooth muscle, heart, and/or skeletal muscle-specific promoter, more particularly a diaphragm, heart, and/or skeletal muscle-specific promoter, in order to increase muscle-specificity, in particular diaphragm, smooth muscle, heart, and/or skeletal muscle- specificity, more particularly diaphragm, heart, and/or skeletal muscle-specificity, and/or reduce leakage of expression in other tissues.

Non-limiting examples of muscle-specific promoters include the desmin (DES) promoter; the synthetic SPc5-12 promoter (SPc5-12); the alpha-actinl promoter (ACTA1); the Creatine kinase, muscle (CKM) promoter; the Four and a half LIM domains protein 1 (FHL1) promoter; the alpha 2 actinin (ACTN2) promoter; the filamin-C (FLNC) promoter; the sarcoplasmic/endoplasmic reticulum calcium ATPase 1 (ATP2A1) promoter; the Troponin I type 1 (TNNI1) promoter; the Troponin I type 2 (TNNI2) promoter; the Troponin T type 1 (TNNT1) promoter; the Troponin T type 2 (TNNT2) promoter; the Troponin T type 3 (TNNT3) promoter; the myosin-1 (MYH1) promoter; the myosin-2 (MYH2) promoter; the sarcolipin (SLN) promoter; the Myosin Binding Protein Cl (MYBPC1) promoter; the enolase (EN03) promoter; the Carbonic Anhydrase 3 (CA3) promoter; the phosphorylatable, fast skeletal muscle myosin light chain (MYLPF) promoter; the Tropomyosin 1 (TPM1) promoter; the Tropomyosin 2 (TPM2) promoter; the alpha-3 chain tropomyosin (TPM3) promoter; the ankyrin repeat domain-containing protein 2 (ANKRD2) promoter; the myosin heavy- chain (MHC) promoter; the alpha myosin heavy chain promoter (otMHC) promoter; the myosin light-chain (MLC) promoter; the muscle creatine kinase (MCK) promoter; the Myosin, Light Chain 1 (MYL1) promoter; the Myosin, Light Chain 2 (MYL2) promoter; the Myoglobin (MB) promoter; the Troponin C type 1 (TNNC1) promoter; the Troponin C Type 2 (TNNC2) promoter; the Titin-Cap (TCAP) promoter; the Myosin, Heavy Chain 7 (MYH7) promoter; the Aldolase A (ALDOA) promoter; the myosin heavy chain 11 (Myhll) promoter; the transgelin (Tagln) promoter (also known as SM22a promoter); the actin alpha 2, smooth muscle (Acta2) promoter; synthetic promoters as described in Li et al. (1999, Nat Biotechnol. 17:241-245), such as the SPc5-12 promoter, the dMCK promoter and the tMCK promoter consisting of respectively, a double or triple tandem of the MCK enhancer to the MCK basal promoter as described in Wang et al. (2008, Gene Ther, 15:1489-1499), and the MHCK7 promoter. The MHCK7 promoter is a synthetic skeletal and cardiac muscle-specific promoter and has been described in Salva et al. (2007. Mol Ther 15: 320-9). In preferred embodiments, the promoter is a muscle-specific promoter selected from the group consisting of: the SPc5-12 promoter, the DES promoter and the MHCK7 promoter.

In particularly preferred embodiments, the promoter is a mammalian muscle-specific promoter, in particular a murine or human muscle-specific promoter.

In embodiments, the promoter is the synthetic SPc5-12 promoter. The SPc5-12 promoter is a synthetic muscle-specific promoter and has been described in Li et al. (1999. Nat Biotechnol. 17:241-245). In embodiments, the promoter is the SPc5-12 promoter as defined by SEQ ID NO:15. In embodiments, the promoter is the MHCK7 promoter, preferably the MHCK7 promoter as defined by SEQ ID NO:56. In embodiments, the promoter is the desmin promoter, preferably the desmin promoter as defined by SEQ ID NO:57.

Furthermore, the promoter does not need to be the promoter of the transgene in the nucleic acid expression cassette, although it is possible that the transgene is transcribed from its own promoter.

To minimize the length of the nucleic acid expression cassette, the regulatory elements may be linked to minimal promoters, or shortened versions of the promoters described herein. A 'minimal promoter' (also referred to as basal promoter or core promoter) as used herein is part of a full-size promoter still capable of driving expression, but lacking at least part of the sequence that contributes to regulating (e.g. tissue-specific) expression. This definition covers both promoters from which (tissue-specific) regulatory elements have been deleted- that are capable of driving expression of a gene but have lost their ability to express that gene in a tissue-specific fashion and promoters from which (tissue-specific) regulatory elements have been deleted that are capable of driving (possibly decreased) expression of a gene but have not necessarily lost their ability to express that gene in a tissue-specific fashion.

The term 'transgene' as used herein refers to particular nucleic acid sequences encoding a polypeptide or a portion of a polypeptide to be expressed in a cell into which the nucleic acid sequence is introduced. However, it is also possible that transgenes are expressed as RNA, typically to control (e.g. lower) the amount of a particular polypeptide in a cell into which the nucleic acid sequence is inserted. These RNA molecules include but are not limited to molecules that exert their function through RNA interference (shRNA, RNAi), micro-RNA regulation (miR) (which can be used to control expression of specific genes), non-coding RNA (ncRNA), long non-coding RNA (IncRNA), guide RNA (gRNA), catalytic RNA, antisense RNA, RNA aptamers, etc. How the nucleic acid sequence is introduced into a cell is not essential to the invention, it may for instance be through integration in the genome or as an episomal plasmid. Of note, expression of the transgene may be restricted to a subset of the cells into which the nucleic acid sequence is introduced. The term 'transgene' is meant to include (1) a nucleic acid sequence that is not naturally found in the cell (i.e., a heterologous nucleic acid sequence); (2) a nucleic acid sequence that is a mutant form of a nucleic acid sequence naturally found in the cell into which it has been introduced; (3) a nucleic acid sequence that serves to add additional copies of the same (i.e., homologous) or a similar nucleic acid sequence naturally occurring in the cell into which it has been introduced ; or (4) a silent naturally occurring or homologous nucleic acid sequence whose expression is induced in the cell into which it has been introduced.

The transgene may be homologous or heterologous to the promoter (and/or to the animal, in particular mammal, in which it is introduced, e.g. in cases where the nucleic acid expression cassette is used for gene therapy).

The transgene may be a full length cDNA or genomic DNA sequence, or any fragment, subunit or mutant thereof that has at least some biological activity. In particular, the transgene may be a minigene, i.e. a gene sequence lacking part, most or all of its intronic sequences. The transgene thus optionally may contain intron sequences. Optionally, the transgene may be a hybrid nucleic acid sequence, i.e., one constructed from homologous and/or heterologous cDNA and/or genomic DNA fragments. By 'mutant form' is meant a nucleic acid sequence that contains one or more nucleotides that are different from the wild-type or naturally occurring sequence, i.e., the mutant nucleic acid sequence contains one or more nucleotide substitutions, deletions, and/or insertions. The nucleotide substitution, deletion, and/or insertion can give rise to a gene product (i.e. e., protein or nucleic acid) that is different in its amino acid/nucleic acid sequence from the wild type amino acid/nucleic acid sequence. Preparation of such mutants is well known in the art. In some cases, the transgene may also include a sequence encoding a leader peptide or signal sequence such that the transgene product will be secreted from the cell.

In particular embodiments, the transgene is codon-optimized. As used herein the terms "codon- optimization", "codon-optimized" and the like refer to changes in the codon composition of a nucleic acid sequence, in particular a transgene, without altering the amino acid sequence, e.g. for optimal expression in a host cell or organism. As shown in the experimental section, codon- optimization of the transgene can further enhance muscle-specific, in particular diaphragm, smooth muscle, heart and skeletal muscle-specific, more particularly diaphragm, heart and skeletal muscle- specific expression of the transgene.

The transgene that may be contained in the nucleic acid expression cassettes described herein typically encodes a gene product such as RNA or a polypeptide (protein). The transgene may also include members of the CRISPR/Cas system, such as Cas and/or one or more gRNAs.

In embodiments, the transgene encodes a therapeutic protein.

The therapeutic protein may be a secretable protein or a non-secreted protein. Non-limiting examples of secretable proteins, in particular secretable therapeutic proteins, include clotting factors, such as factor VIII or factor IX, insulin, erythropoietin, lipoprotein lipase, antibodies or nanobodies, growth factors, angiogenic factors, cytokines, chemokines, plasma factors, etc. Non limiting examples of non-secreted proteins include metabolic enzymes (e.g. tafazzin), lysosomal proteins, nuclear proteins, etc. The therapeutic protein may also be a structural protein. Non limiting examples of structural proteins, in particular structural therapeutic proteins, include dystrophin and sarcoglycans.

A non-exhaustive and non-limiting list of transgenes envisaged in the application includes transgenes encoding angiogenic factors for therapeutic angiogenesis (e.g. VEGF, PIGF, or guidance molecules such as ephrins, semaphorins, Slits and netrins or their cognate receptors); transgenes encoding clotting factors (e.g. factor VIII or factor IX), transgenes encoding insulin, transgenes encodinglipoprotein lipase, transgenes encoding plasma factors, transgenes encoding cytokines, chemokines and/or growth factors (e.g. erythropoietin (EPO), interferon-a, interferon-b, interferon-y, interleukin 1 (IL-1), interleukin 2 (IL-2), interleukin 3 (IL-3), interleukin 4 (IL-4), interleukin 5 (IL-5), interleukin 6 (IL-6), interleukin 7 (IL-7), interleukin 8 (IL-8), interleukin 9 (IL-9), interleukin 10 (IL-10), interleukin 11 (IL-11), interleukin 12 (IL-12), chemokine (C-X-C motif) ligand 5 (CXCL5), granulocyte-colony stimulating factor (G-CSF), granulocyte-macrophage colony stimulating factor (GM-CSF), macrophage colony stimulating factor (M-CSF), stem cell factor (SCF), keratinocyte growth factor (KGF), monocyte chemoattractant protein-1 (MCP-1) and tumor necrosis factor (TNF)); transgenes encoding proteins involved in calcium handling (e.g. Sarco/Endoplasmic Reticulum Ca2+-ATPase (SERCA), phospholamban, calsequestrin, sodium- calcium exchanger, L-type calcium's channel, ryanodine receptors), transgenes encoding calcineurin; transgenes encoding microdystrophin; transgenes encoding follistatin (FST); transgenes encoding myotubularin 1 (MTM1); transgenes encoding dysferlin; transgenes encoding dystrophin; transgenes encoding metabolic enzymes: transgenes encoding nuclear proteins; transgenes encoding mitochondrial proteins (e.g. tafazzin); transgenes encoding lysosomal proteins (e.g. acid a-glucosidase (GAA) (as a secreted or native form), alpha-galactosidase A, LAMP2); transgenes encoding ion channels (e.g. SCN5A); transgenes encoding enzymes involved in glycogen metabolism (e.g. Glycogen synthase (GYS2), Glycogen debranching enzyme (AGL), Glycogen branching enzyme (GBE1), Muscle glycogen phosphorylase (PYGM), Muscle phosphofructokinase (PKFM), Phosphoglycerate mutase (PGAM2), Aldolase A (ALDOA), b-enolase (EN03) or Glycogenin-1 (GYG1)); transgenes encoding enzymes deficient in mucopolysaccharidosis (e.g. a-L-iduronidase, Iduronate sulfatase, Heparan sulfamidase, N-acetylglucosaminidase, Heparan-a-glucosaminide N- acetyltransferase, N-acetylglucosamine 6-sulfatase, Galactose-6-sulfate sulfatase, b-galactosidase, N-acetylgalactosamine-4-sulfatase, b-glucuronidase or Hyaluronidase); transgenes encoding sarcoglycan (e.g. alpha-sarcoglycan, beta-sarcoglycan and gamma-sarcoglycan); transgenes encoding anoctamin 5; transgenes encoding calpain 3; transgenes encoding antibodies, transgenes encoding nanobodies, transgenes encoding anti-viral dominant-negative proteins; and transgenes fragments, subunits or mutants thereof.

In particular embodiments, the transgene encodes a sarcoglycan, preferably b-sarcoglycan, more preferably human b-sarcoglycan such as the transgene defined by SEQ ID NO:18. In further embodiments, the transgene encoding human b-sarcoglycan is codon-optimized, such as the transgene defined by SEQ ID NO:19.

In particular embodiments, the transgene encodes a lysosomal protein, preferably the transgene encodes a lysosomal protein selected from the group consisting of acid a-glucosidase (GAA) (e.g. GAA as a secreted or native form), alpha-galactosidase A and LAMP2. In further particular embodiments, the transgene encodes acid a-glucosidase (GAA), preferably human GAA such as the transgene defined by SEQ ID NO:58. In further embodiments, the transgene encoding human GAA is codon-optimized, such as the transgene defined by SEQ ID NO:59. The transgene may also be a reporter gene, i.e. the transgene encodes a reporter such as a luciferase enzyme. In particular embodiments, the transgene encodes a luciferase, e.g. the transgene may have the nucleotide sequence set forth in SEQ ID NO:20.

The transgene may also encode an immunogenic protein. Non-limiting examples of immunogenic proteins include epitopes and antigens derived from a pathogen.

As used herein, the term "immunogenic" refers to a substance or composition capable of eliciting an immune response. Other sequences may be incorporated in the nucleic acid expression cassette disclosed herein as well, typically to further increase or stabilize the expression of the transgene product (e.g. introns and/or polyadenylation sequences).

Any intron can be utilized in the expression cassettes described herein. The term "intron" encompasses any portion of a whole intron that is large enough to be recognized and spliced by the nuclear splicing apparatus. Typically, short, functional, intron sequences are preferred in order to keep the size of the expression cassette as small as possible which facilitates the construction and manipulation of the expression cassette. In some embodiments, the intron is obtained from a gene that encodes the protein that is encoded by the coding sequence within the expression cassette. The intron can be located 5' to the coding sequence, 3' to the coding sequence, or within the coding sequence. An advantage of locating the intron 5' to the coding sequence is to minimize the chance of the intron interfering with the function of the polyadenylation signal. In embodiments, the nucleic acid expression cassette disclosed herein further comprises an intron. Non-limiting examples of suitable introns are Minute Virus of Mice (MVM) intron, beta-globin intron (betalVS- II), factor IX (FIX) intron A, Simian virus 40 (SV40) small-t intron, and beta-actin intron. Preferably, the intron is MVM intron (SEQ ID NO:16).

Any polyadenylation signal that directs the synthesis of a polyA tail is useful in the expression cassettes described herein, examples of those are well known to one of skill in the art. Exemplary polyadenylation signals include, but are not limited to, polyA sequences derived from the Simian virus 40 (SV40) late gene, the bovine growth hormone (BGH) polyadenylation signal, the minimal rabbit b-globin (mRBG) gene, and the synthetic polyA site (SPA) site as described in Levitt et al. (1989, Genes Dev 3:1019-1025).

Preferably, the polyadenylation signal is the polyadenylation signal defined by SEQ ID NO:17.

In particular embodiments, the invention provides a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting essentially of or consisting of the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04) or a functional fragment thereof, the diaphragm-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06) or a functional fragment thereof, and the heart- and skeletal muscle-specific regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof, preferably a nucleic acid regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:12 (Dph-CRE04 - Dph-CRE06 - CSk-SH5), operably linked to a promoter, preferably the SPc5-12 promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19). In preferred embodiments, the transgene is codon-optimized. In embodiments, the nucleic acid expression cassette further comprises an MVM intron. In embodiments the nucleic acid expression cassette further comprises a polyadenylation signal, preferably a polyadenylation signal defined by SEQ ID NO:17.

In particular embodiments, the invention provides a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting essentially of or consisting of the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:2 (Dph-CRE02) or a functional fragment thereof, the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:3 (Dph-CRE04) or a functional fragment thereof, the diaphragm-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:4 (Dph-CRE06) or a functional fragment thereof, and the heart- and skeletal muscle-specific regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:l (CSk-SH5) or a functional fragment thereof, preferably a nucleic acid regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph-CRE04 - Dph- CRE06-CSk-SH5), operably linked to a promoter, preferably the SPc5-12 promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon- optimized transgene set forth in SEQ ID NO:19). In preferred embodiments, the transgene is codon- optimized. In embodiments, the nucleic acid expression cassette further comprises an MVM intron. In embodiments the nucleic acid expression cassette further comprises a polyadenylation signal, preferably a polyadenylation signal defined by SEQ ID NO:17.

The nucleic acid regulatory element and the nucleic acid expression cassette disclosed herein may be used as such, or typically, they may be part of a nucleic acid vector. Accordingly, a further aspect relates to the use of a nucleic acid regulatory element as described herein or a nucleic acid expression cassette as described herein in a vector, in particular a nucleic acid vector.

In an aspect, the invention also provides a vector comprising a nucleic acid regulatory element as disclosed herein. In further embodiments, the vector comprises a nucleic acid expression cassette as disclosed herein.

The term 'vector' as used in the application refers to nucleic acid molecules, e.g. double-stranded DNA, which may have inserted into it another nucleic acid molecule (the insert nucleic acid molecule) such as, but not limited to, a cDNA molecule. The vector is used to transport the insert nucleic acid molecule into a suitable host cell. A vector may contain the necessary elements that permit transcribing the insert nucleic acid molecule, and, optionally, translating the transcript into a polypeptide. The insert nucleic acid molecule may be derived from the host cell, or may be derived from a different cell or organism. Once in the host cell, the vector can replicate independently of, or coincidental with, the host chromosomal DNA, and several copies of the vector and its inserted nucleic acid molecule may be generated. The vectors can be episomal vectors (i.e., that do not integrate into the genome of a host cell), or can be vectors that integrate into the host cell genome. The term 'vector' may thus also be defined as a gene delivery vehicle that facilitates gene transfer into a target cell. This definition includes both non-viral and viral vectors. Non-viral vectors include but are not limited to cationic lipids, liposomes, nanoparticles, PEG, PEI, plasmid vectors (e.g. pUC vectors, bluescript vectors (pBS) and pBR322 or derivatives thereof that are devoid of bacterial sequences (minicircles)) transposons-based vectors (e.g. PiggyBac (PB) vectors or Sleeping Beauty (SB) vectors), etc. Viral vectors are derived from viruses and include but are not limited to retroviral, lentiviral, adeno-associated viral, adenoviral, herpes viral, hepatitis viral vectors or the like. Typically, but not necessarily, viral vectors are replication-deficient as they have lost the ability to propagate in a given cell since viral genes essential for replication have been eliminated from the viral vector. However, some viral vectors can also be adapted to replicate specifically in a given cell, such as e.g. a cancer cell, and are typically used to trigger the (cancer) cell-specific (onco)lysis. Virosomes are a non-limiting example of a vector that comprises both viral and non-viral elements, in particular they combine liposomes with an inactivated HIV or influenza virus (Yamada et al., 2003). Another example encompasses viral vectors mixed with cationic lipids.

In preferred embodiments, the vector is a viral vector, such as a retroviral, lentiviral, adenoviral, or adeno-associated viral (AAV) vector, more preferably an AAV vector. AAV vectors are preferably used as self-complementary, double-stranded AAV vectors (scAAV) in order to overcome one of the limiting steps in AAV transduction (i.e. single-stranded to double-stranded AAV conversion) (McCarty, 2001, 2003; Nathwani et al, 2002, 2006, 2011; Wu et al., 2008), although the use of single- stranded AAV vectors (ssAAV) are also encompassed herein.

In embodiments, the vector is an AAV serotype 9 (AAV9) vector, more particularly a self complementary AAV9 vector (scAAV9). In embodiments, the vector is an AAV serotype 8 (AAV8) vector such as a self-complementary AAV8 vector (scAAV8) or a single-stranded AAV8 vector (ssAAV8).

The vector may be an AAV vector of which the AAV capsid is engineered to direct the vector specifically to muscle cells. For example, the vector may be a AAVpol vector as described in Tulalamba W et al. (Tulalamba W, et al. Distinct transduction of muscle tissue in mice after systemic delivery of AAVpol vectors. Gene Ther. (2019) https://doi.org/10.1038/s41434-019-0106-3).

Production of AAV vector particles can be achieved e.g. by transient transfection of suspension- adapted mammalian HEK293 cells, as described (Chahal et al. Production of adeno-associated virus (AAV) serotypes by transient transfection of HEK293 cell suspension cultures for gene delivery, Journal of Virological Methods. 196: 163-173 (2014); Grieger et al., Production of recombinant adeno-associated virus vectors using suspension HEK293 cells and continuous harvest of vector from the culture media for GMP FIX and FLT1 clinical vector. Molecular Therapy. 24: 287-297 (2016); Blessing et al., Scalable Production of AAV Vectors in Orbitally Shaken FIEK293 Cells. Molecular Therapy Methods & Clinical Development. 13: 14-26 (2019)), or by infection of Spodoptera frugiperda (Sf9) insect cells using the baculovirus expression vector system (BEVS), as described (Kotin et al. Manufacturing Clinical Grade Recombinant Adeno-Associated Virus Using Invertebrate Cell Lines. Fluman Gene Therapy. 28: 350-360 (2017)), followed by a purification step. Purification may be based on cesium chloride (CsCI) density gradient ultracentrifugation, as described (VandenDriessche et al., 2007), or using chromatographic techniques or columns or by immunoaffinity as known in the art. The vector can also be a non-viral vector, preferably a plasmid, a minicircle, or a transposon-based vector, such as a Sleeping Beauty(SB)-based vector or piggyBac(PB)-based vector.

The vector may also comprise viral and non-viral elements.

In particular embodiments, the invention provides a vector comprising a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:10 (Dph-CRE02 - Dph-CRE04 - CSk-SH5), a promoter, preferably the SPc5-12 promoter, an MVM intron, a transgene, preferably a transgene encoding human b-sarcoglycan, and a polyadenylation signal, preferably the polyadenylation signal defined by SEQ ID NO:17. In preferred embodiments, the transgene encoding human b-sarcoglycan is codon-optimized such as the transgene set forth in SEQ ID NO:19. In particular embodiments, said vector has SEQ ID NO: 26 or 27.

In particular embodiments, the invention provides a vector comprising a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:13 (Dph-CRE06 - Dph-CRE04 - CSk-SH5), a promoter, preferably the SPc5-12 promoter, an MVM intron, a transgene, preferably a transgene encoding human b-sarcoglycan, and a polyadenylation signal, preferably the polyadenylation signal defined by SEQ ID NO:17. In preferred embodiments, the transgene encoding human b-sarcoglycan is codon-optimized such as the transgene set forth in SEQ ID NO:19. In particular embodiments, said vector has SEQ ID NO: 30 or 31.

In particular embodiments, the invention provides a vector comprising a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:ll (Dph-CRE02 - Dph-CRE06 - CSk-SH5), a promoter, preferably the SPc5-12 promoter, an MVM intron, a transgene, preferably a transgene encoding human b-sarcoglycan, and a polyadenylation signal, preferably the polyadenylation signal defined by SEQ ID NO:17. In preferred embodiments, the transgene encoding human b-sarcoglycan is codon-optimized such as the transgene set forth in SEQ ID NO:19. In particular embodiments, said vector has SEQ ID NO: 32 or 33.

In particular embodiments, the invention provides a vector comprising a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:12 (Dph-CRE04 - Dph-CRE06 - CSk-SH5), a promoter, preferably the SPc5-12 promoter, an MVM intron, a transgene, preferably a transgene encoding human b-sarcoglycan, and a polyadenylation signal, preferably the polyadenylation signal defined by SEQ ID NO:17. In preferred embodiments, the transgene encoding human b-sarcoglycan is codon-optimized such as the transgene set forth in SEQ ID NO:19. In particular embodiments, said vector has SEQ ID NO: 28 or 29.

In particular embodiments, the invention provides a vector comprising a nucleic acid expression cassette comprising a nucleic acid regulatory element consisting of the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph-CRE04 - Dph-CRE06 - CSk-SH5), a promoter, preferably the SPc5-12 promoter, an MVM intron, a transgene, preferably a transgene encoding human b- sarcoglycan, and a polyadenylation signal, preferably the polyadenylation signal defined by SEQ ID NO:17. In preferred embodiments, the transgene encoding human b-sarcoglycan is codon- optimized such as the transgene set forth in SEQ ID NO:19. In particular embodiments, said vector has SEQ ID NO: 34 or 35.

The nucleic acid expression cassettes and vectors disclosed herein may be used, for example, to express proteins that are normally expressed and utilized in muscle (i.e. structural proteins), or to express proteins that are expressed in muscle and that are then exported to the blood stream for transport to other portions of the body (i.e. secretable proteins). For example, the expression cassettes and vectors disclosed herein may be used to express a therapeutic amount of a gene product (such as a polypeptide, in particular a therapeutic protein, or RNA) for therapeutic purposes, in particular for gene therapy. Typically, the gene product is encoded by the transgene within the expression cassette or vector, although in principle it is also possible to increase expression of an endogenous gene for therapeutic purposes. In an alternative example, the expression cassettes and vectors disclosed herein may be used to express an immunological amount of a gene product (such as a polypeptide, in particular an immunogenic protein, or RNA) for vaccination purposes.

The nucleic acid expression cassettes and vectors as taught herein may be formulated in a pharmaceutical composition with a pharmaceutically acceptable excipient, i.e., one or more pharmaceutically acceptable carrier substances and/or additives, e.g., buffers, carriers, excipients, stabilisers, etc. The pharmaceutical composition may be provided in the form of a kit.

The term "pharmaceutically acceptable" as used herein is consistent with the art and means compatible with the other ingredients of the pharmaceutical composition and not deleterious to the recipient thereof.

Accordingly, a further aspect of the invention relates to a pharmaceutical composition comprising a nucleic acid expression cassette or a vector described herein.

The use of nucleic acid regulatory elements and vectors described herein for the manufacture of these pharmaceutical compositions is also disclosed herein.

For example, the pharmaceutical composition may be a vaccine. The vaccine may further comprise one or more adjuvants for enhancing the immune response. Suitable adjuvants include, for example, but without limitation, saponin, mineral gels such as aluminium hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil or hydrocarbon emulsions, bacilli Calmette-Guerin (BCG), Corynebacterium parvum, and the synthetic adjuvant QS- 21. Optionally, the vaccine may further comprise one or more immunostimulatory molecules. Non limiting examples of immunostimulatory molecules include various cytokines, lymphokines and chemokines with immunostimulatory, immunopotentiating, and pro-inflammatory activities, such as interleukins (e.g., IL-1, IL-2, IL-3, IL-4, IL-12, IL-13); growth factors (e.g., granulocyte-macrophage (GM)-colony stimulating factor (CSF)); and other immunostimulatory molecules, such as macrophage inflammatory factor, Flt3 ligand, B7.1; B7.2, etc.

In a further aspect, the invention relates to the nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein for use in medicine.

As used herein, the terms "treat" or "treatment" refer to both therapeutic treatment and prophylactic or preventative measures. Beneficial or desired clinical results include, but are not limited to, prevention of an undesired clinical state or disorder, reducing the incidence of a disorder, alleviation of symptoms associated with a disorder, diminishment of extent of a disorder, stabilized (i.e., not worsening) state of a disorder, delay or slowing of progression of a disorder, amelioration or palliation of the state of a disorder, remission (whether partial or total), whether detectable or undetectable, or combinations thereof. "Treatment" can also mean prolonging survival as compared to expected survival if not receiving treatment.

As used herein, the terms "therapeutic treatment" or "therapy" and the like, refer to treatments wherein the object is to bring a subjects body or an element thereof from an undesired physiological change or disorder to a desired state, such as a less severe or unpleasant state (e.g., alleviation of symptoms, amelioration of the condition or palliation), or back to its normal, healthy state (e.g., restoring the health, the physical integrity and the physical well-being of a subject), to keep it at said undesired physiological change or disorder (e.g., stabilization, or not worsening), or to prevent or slow down progression to a more severe or worse state compared to said undesired physiological change or disorder.

As used herein the terms "prevention", "preventive treatment" or "prophylactic treatment" and the like encompass preventing the onset of a disease or disorder, including reducing the severity of a disease or disorder or symptoms associated therewith prior to affliction with said disease or disorder. Such prevention or reduction prior to affliction refers to administration of the nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein to a patient that is not at the time of administration afflicted with clear symptoms of the disease or disorder. "Preventing" also encompasses preventing the recurrence or relapse-prevention of a disease or disorder for instance after a period of improvement.

In embodiments, the nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein may be for use in gene therapy, in particular muscle-directed gene therapy, in particular diaphragm-, smooth muscle-, heart- and skeletal muscle-directed gene therapy, more particularly diaphragm-, heart- and skeletal muscle- directed gene therapy.

Also disclosed herein is the use of the nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein for the manufacture of a medicament for gene therapy, in particular muscle-directed gene therapy, in particular diaphragm-, smooth muscle-, heart- and skeletal muscle-directed gene therapy, more particularly diaphragm-, heart- and skeletal muscle-directed gene therapy. Also disclosed herein is a method for gene therapy, in particular muscle-directed gene therapy, in particular diaphragm-, smooth muscle-, heart- and skeletal muscle-directed gene therapy, more particularly diaphragm-, heart- and skeletal muscle-directed gene therapy, in a subject in need of said gene therapy comprising: introducing in the subject, in particular in muscle tissue or cells of the subject, in particular in diaphragm, smooth muscle, heart, and/or skeletal muscle tissue or cells, more particularly in diaphragm, heart, and/or skeletal muscle tissue or cells of the subject, a nucleic acid expression cassette, a vector or a pharmaceutical composition described herein, wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein, operably linked to a promoter and a transgene; and expressing a therapeutically effective amount of the transgene product in the subject, in particular in muscle cells or tissue of the subject, in particular in diaphragm, smooth muscle, heart and/or skeletal muscle cells or tissue, more particularly in diaphragm, heart and/or skeletal muscle cells or tissue of the subject.

The transgene product may be a polypeptide, in particular a structural protein such as, e.g., dystrophin or a sarcoglycan; a secretable protein such as, e.g., a clotting factor, e.g., factor IX or factor VIII, a cytokine, a growth factor, an antibody or nanobody, a chemokine, a plasma factor, insulin, erythropoietin, lipoprotein lipase; or a non-secreted protein such as a nuclear protein, metabolic enzyme or a lysosomal protein. Further, non-limiting examples of transgene products have been disclosed above in connection with the transgene and include, without limitation, angiogenic factors; cytokines and/or growth factors; proteins involved in calcium handling; lysosomal proteins; ion channels antibodies, nanobodies, anti-viral dominant-negative proteins, and fragments, subunits or mutants thereof; and enzymes (e.g. enzymes involved in glycogen metabolism and enzymes deficient in mucopolysaccharidosis). In particular embodiments, the transgene product is a sarcoglycan, preferably b-sarcoglycan. In particular embodiments, the transgene product is a lysosomal protein, preferably acid a-glucosidase (GAA) (e.g. GAA as a secreted or native form), alpha-galactosidase A or LAMP2, more preferably acid alpha-glucosidase (GAA). Alternatively, the transgene product may be RNA, such as siRNA or non-coding RNA (ncRNA).

Exemplary diseases and disorders that may benefit from gene therapy using the nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein include: - lysosomal storage diseases (e.g. Fabry disease) including glycogen storage disorders (e.g. Pompe disease glycogen storage disorder (GSD) type II, Danon disease, glycogen storage disorder (GSD) type Mb, GSD III or GSD 3 (also known as Cori's disease or Forbes' disease), GSD IV or GSD4 (also known as Andersen disease), GSD V or GSD5 (also known as McArdle disease), GSD VII or GSD7 (also known as Tarui's disease), GSD X or GSD10, GSD XII or GSD 12 (also known as Aldolase A deficiency), GSD XIII or GSD13, GSD XV or GSD15) and mucopolysaccharidosis disorders (e.g. Flunter syndrome, Sanfilippo syndrome, mucopolyssacharidose (MPS) I, MPS II, MPS III, MPS IMA, MPS IIIB, MPS MIC, MPS IV, MPS VI, MPS VII, MPS IX); mitochondrial disorders (e.g. Barth syndrome); channelopathy (e.g. Brugada syndrome); metabolic disorders; myotubular myopathy (MTM); muscular dystrophy (e.g. Duchenne muscular dystrophy (DMD), Becker muscular dystrophy

(BMD)); myotonic dystrophy;

Myotonic Muscular Dystrophy (DM);

Miyoshi myopathy;

Fukuyama type congenital; dysferlinopathies; neuromuscular disease; motor neuron diseases (MND) (e.g. Charcot-Marie-Tooth disease (CMT), spinal muscular atrophy (SMA) or amyotrophic lateral sclerosis (ALS));

Emery-Dreifuss muscular dystrophy; facioscapulohumeral muscular dystrophy (FSHD); congenital muscular dystrophies; congenital myopathies; limb girdle muscular dystrophy (e.g. Limb Girdle Muscular Dystrophy type 2E (LGMD2E), Limb Girdle Muscular Dystrophy type 2D (LGMD2D), Limb Girdle Muscular Dystrophy type 2C (LGMD2C), Limb Girdle Muscular Dystrophy type 2B (LGMD2B), Limb Girdle Muscular Dystrophy type 2L (LGMD2L), Limb Girdle Muscular Dystrophy type 2A (LGMD2A)); metabolic myopathies; muscle inflammatory diseases; myasthenia; mitochondrial myopathies; anomalies of ionic channels; nuclear envelop diseases; cardiomyopathies; cardiac hypertrophy; heart failure; distal myopathies; hemophilia (e.g. hemophilia A and B); diabetes; and

- cardiovascular diseases and heart diseases (e.g. atherosclerosis, arteriosclerosis, coronary heart disease, coronary artery disease, peripheral arterial disease, congenital heart disease, congestive heart failure, heart failure (also known as cardiac insufficiency), myocardial infarction (also known as heart attack), cardiac ischemia, acute coronary syndrome, unstable angina, stable angina, cardiomyopathy, hypertrophic cardiomyopathy, dilated cardiomyopathy, restrictive cardiomyopathy, primary cardiomyopathies caused by genetic mutations such as Brugada syndrome, Pompe disease, Danon disease and Fabry disease, cardiac amyloidosis (also known as stiff heart syndrome), myocarditis (also known as inflammatory cardiomyopathy), valvular heart disease, valvular stenosis, valvular insufficiency, endocarditis, rheumatic heart disease, pericarditis (i.e. a disease caused by an inflammation and/or infection of the pericardium), cardiac tamponade (also known as pericardial tamponade), endocarditis, cardiac arrhythmia, hypertension, hypotension, vessel stenosis, valve stenosis, or restenosis).

In addition, many neuromuscular disorders affect respiratory function due to weakening of the diaphragm and respiratory muscles (www.medscape.com/viewarticle/805299_3) Semin Respir Crit Care Med. 2002 Jun;23(3):191-200). Causes of diseases of the diaphragm vary, but they can be due to gene defects that influence diaphragm function directly. In particular, there are multiple genetic disorders that are due to mutations in genes that affect the function of the diaphragm, often in combination with abnormalities at the level of skeletal muscles and/or heart. For example, myotubular myopathy (MTM) is due to mutations in the myotubularin gene and affects the skeletal muscle and diaphragm. Patients suffering from MTM typically present with hypotonia, generalized muscle weakness and respiratory failure at birth. Survival beyond the postnatal period requires intensive support, often including gastrostomy feeding and mechanical ventilation. Because of their severe breathing problems, patients suffering from MTM typically do not live past age 2. For MTM, muscle-directed gene therapy is currently the only clinically relevant option. Alternatively, Pompe's disease (also referred to as glycogen storage disorder type II or GSD II) mainly affects skeletal muscle, diaphragm and heart. GSD II results in deficiency of the lysosomal enzyme acid a- glucosidase (GAA) that leads to a lysosomal storage defect. In GSD II patients, glycogen cannot be broken down effectively into glucose. The accumulation of glycogen in GSD II patients causes myopathy with progressive muscle weakness. Without medical intervention, patients suffering from the most severe form of GSD II die because of respiratory failure within the first year of life. Other muscle diseases such as Duchenne muscular dystrophy (DMD) afflicts approximately one in 3500 live male births. The disease leads to a progressive destruction of skeletal muscles, including the diaphragm, the most affected individuals die of ventilatory failure in the third decade of life. Many other myopathies also affect pulmonary function, including -but not limited to- polymyositis/dermatomyositis, hereditary channel disorders, mitochondrial encephalomyopathies, acid maltase deficiency, and congenital myopathy, disuse atrophy. Other diseases affecting diaphragm include Congenital Muscular Dystrophy (CMD), Becker Muscular Dystrophy (BMD), Facioscapulohumeral Muscular Dystrophy (FSHD), Limb Girdle Muscular Dystrophy (LGMD), Myotonic Muscular Dystrophy (DM), Miyoshi myopathy, Fukuyama type congenital muscular dystrophy, dysferlinopathies. Also many neuropathic disorders weaken the diaphragm and respiratory muscles. This includes amyotrophic lateral sclerosis, poliomyelitis, postpolio syndrome, Kennedy syndrome, stroke, multiple sclerosis, spinal muscular atrophy, syringomyelia, neuralgic neuropathy, and motor neuron diseases. Brachial plexitis and isolated unilateral or bilateral phrenic neuropathies can also weaken the diaphragm significantly. Peripheral neuropathies affecting respiration are primarily acute disorders such as Guillain-Barre syndrome, porphyria, and critical illness neuropathy, but chronic diseases such as chronic inflammatory demyelinating polyneuropathy (CIDP) and Charcot-Marie-Tooth disease (CMT) can also cause respiratory insufficiency. Disorders of neuromuscular transmission such as Lambert-Eaton syndrome, and myasthenia gravis often affect respiration. Alternatively, diaphragm dysfunction can be the result of congenital defects resulting in anatomical abnormalities (e.g. Arnold-Chiari malformation) or acquired defects, which occur as the result of an injury, trauma, infection (e.g. West Nile virus, botulism), exposure to, organophosphates, radiation therapy, malnutrition, tumour compression or surgery. Cold cardioplegia used in cardiac surgery is another common cause of phrenic nerve injury. In addition, radiation therapy can affect the phrenic nerve resulting in diaphragmatic dysfunction. Obstructive airway diseases that affect the lungs, such as chronic obstructive pulmonary disease (COPD) and asthma, can result in significant hyperinflation resulting in diaphragmatic disadvantage and weakness. Finally, it is known that lupus and thyroid disorders can also contribute to diaphragm dysfunction.

In embodiments, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein are for use in the treatment of Limb Girdle Muscular Dystrophy, in particular Limb Girdle Muscular Dystrophy type 2E (LGMD2E), wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene encoding b-sarcoglycan, preferably human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19).

Also disclosed herein is the use of the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein for the manufacture of a medicament for the treatment of Limb Girdle Muscular Dystrophy, in particular Limb Girdle Muscular Dystrophy type 2E (LGMD2E), wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene encoding b-sarcoglycan, preferably human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19).

Also disclosed herein is a method for treating Limb Girdle Muscular Dystrophy, in particular Limb Girdle Muscular Dystrophy type 2E (LGMD2E), in a subject comprising: introducing in the subject, in particular in muscle tissue or cells of the subject, preferably in diaphragm, smooth muscle, skeletal muscle, and heart tissue or cells of the subject, more preferably in diaphragm, skeletal muscle-and heart tissue or cells of the subject, a nucleic acid expression cassettes, a vector, or a pharmaceutical compositions described herein, wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene encoding b- sarcoglycan, preferably human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19); and expressing a therapeutically effective amount of b-sarcoglycan, preferably human b- sarcoglycan, in the subject, in particular in the muscle tissue or cells of the subject, preferably in the diaphragm, skeletal muscle, smooth muscle and heart tissue or cells of the subject, more preferably in the diaphragm-, skeletal muscle- and heart tissue or cells of the subject.

In particular embodiments, the invention relates to a nucleic acid expression cassette comprising a regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:12 (Dph-CRE04 - Dph-CRE06 - CSk-SH5), operably linked to a promoter, preferably a promoter selected from the group consisting the SPc5-12 promoter, the MHCK7 promoter and the desmin promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19), a vector comprising said nucleic acid expression cassette, or a pharmaceutical composition comprising said nucleic acid expression cassette or said vector, for use in the treatment of Limb Girdle Muscular Dystrophy, preferably Limb Girdle Muscular Dystrophy type 2E (LGMD2E). In particular embodiments, the invention provides a nucleic acid expression cassette comprising a regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:14 (Dph-CRE02 - Dph-CRE04 - Dph-CRE06 - CSk-SH5), operably linked to a promoter, preferably a promoter selected from the group consisting the SPc5-12 promoter, the MHCK7 promoter and the desmin promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19), a vector comprising said nucleic acid expression cassette, or a pharmaceutical composition comprising said nucleic acid expression cassette or said vector, for use in the treatment of Limb Girdle Muscular Dystrophy type 2E (LGMD2E). In particular embodiments, the invention relates to a nucleic acid expression cassette comprising a regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:ll (Dph-CRE02 - Dph-CRE06-CSk-SH5), operably linked to a promoter, preferably a promoter selected from the group consisting the SPc5-12 promoter, the MHCK7 promoter and the desmin promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19), a vector comprising said nucleic acid expression cassette, or a pharmaceutical composition comprising said nucleic acid expression cassette or said vector, for use in the treatment of Limb Girdle Muscular Dystrophy type 2E (LGMD2E). In particular embodiments, the invention relates to a nucleic acid expression cassette comprising a regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:13 (Dph-CRE06 - Dph-CRE04 - CSk-SH5), operably linked to a promoter, preferably a promoter selected from the group consisting the SPc5-12 promoter, the MHCK7 promoter and the desmin promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19), a vector comprising said nucleic acid expression cassette, or a pharmaceutical composition comprising said nucleic acid expression cassette or said vector, for use in the treatment of Limb Girdle Muscular Dystrophy type 2E (LGMD2E). In particular embodiments, the invention relates to a nucleic acid expression cassette comprising a regulatory element consisting essentially of or consisting of the nucleotide sequence set forth in SEQ ID NO:10 (Dph-CRE02 - Dph-CRE04-CSk-SH5), operably linked to a promoter, preferably a promoter selected from the group consisting the SPc5-12 promoter, the MHCK7 promoter and the desmin promoter, and a transgene encoding human b-sarcoglycan (e.g. the transgene set forth in SEQ ID NO:18, preferably the codon-optimized transgene set forth in SEQ ID NO:19), a vector comprising said nucleic acid expression cassette, or a pharmaceutical composition comprising said nucleic acid expression cassette or said vector, for use in the treatment of Limb Girdle Muscular Dystrophy, preferably Limb Girdle Muscular Dystrophy type 2E (LGMD2E).

In further embodiments of the nucleic acid expression cassette, the vector or the pharmaceutical composition for use in the treatment of Limb Girdle Muscular Dystrophy, preferably Limb Girdle Muscular Dystrophy type 2E (LGMD2E) as disclosed herein, the transgene is codon-optimized. In embodiments, the nucleic acid expression cassette further comprises an MVM intron. In embodiments the nucleic acid expression cassette further comprises a polyadenylation signal, preferably a polyadenylation signal defined by SEQ ID NO:17.

Also disclosed herein is a nucleic acid expression cassette, a vector, or a pharmaceutical compositions described herein for use in treating Pompe disease, wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene encoding acid a-glucosidase (GAA), preferably human acid a-glucosidase (hGAA) (e.g. the transgene set forth in SEQ ID NO:58, preferably the codon-optimized transgene set forth in SEQ ID NO:59).

Also disclosed herein is the use of a nucleic acid expression cassette, a vector, or a pharmaceutical compositions described herein, wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene encoding acid a-glucosidase (GAA), preferably human acid a- glucosidase (hGAA) (e.g. the transgene set forth in SEQ ID NO:58, preferably the codon-optimized transgene set forth in SEQ ID NO:59), for the manufacture of a medicament for treating Pompe disease.

Also disclosed herein is a method for treating Pompe disease in a subject comprising: introducing in the subject, in particular in muscle tissue or cells of the subject, preferably in diaphragm, skeletal muscle, smooth muscle and heart tissue or cells of the subject, more preferably in diaphragm-, skeletal muscle- and heart tissue or cells of the subject, a nucleic acid expression cassette, a vector, or a pharmaceutical compositions described herein, wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene encoding acid a-glucosidase (GAA), preferably human acid a-glucosidase (hGAA) (e.g. the transgene set forth in SEQ ID NO:58, preferably the codon-optimized transgene set forth in SEQ ID NO:59); and expressing a therapeutically effective amount of GAA, preferably hGAA, in the subject, in particular in the muscle tissue or cells of the subject, preferably in the diaphragm, skeletal muscle, smooth muscle and heart tissue or cells of the subject, more preferably in the diaphragm-, skeletal muscle- and heart tissue or cells of the subject.

Gene therapy protocols have been extensively described in the art. These include, but are not limited to, intramuscular injection of plasmid (naked or in liposomes), hydrodynamic gene delivery in various tissues, including muscle, interstitial injection, instillation in airways, application to endothelium, intra-hepatic parenchyme, and intravenous or intra-arterial administration. Various devices have been developed for enhancing the availability of DNA to the target cell. A simple approach is to contact the target cell physically with catheters or implantable materials containing DNA. Another approach is to utilize needle-free, jet injection devices which project a column of liquid directly into the target tissue under high pressure. These delivery paradigms can also be used to deliver vectors. Another approach to targeted gene delivery is the use of molecular conjugates, which consist of protein or synthetic ligands to which a nucleic acid-or DNA-binding agent has been attached for the specific targeting of nucleic acids to cells (Cristiano et al., 1993).

The nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein may also be for use as a vaccine, more particularly for use as a prophylactic vaccine.

Also disclosed herein is the use of the nucleic acid regulatory elements, the nucleic acid expression cassettes, the vectors, or the pharmaceutical compositions described herein for the manufacture of a vaccine, in particular for the manufacture of a prophylactic vaccine.

Also disclosed herein is a method of vaccination, in particular prophylactic vaccination, of a subject in need of said vaccination comprising: introducing in the subject, in particular in muscle of the subject, more particularly in diaphragm, smooth muscle, heart and/or skeletal muscle tissue or cells, even more particularly in diaphragm, heart and/or skeletal muscle tissue or cells of the subject, a nucleic acid expression cassette, a vector or a pharmaceutical composition described herein, wherein the nucleic acid expression cassette, the vector or the pharmaceutical composition comprises a nucleic acid regulatory element described herein operably linked to a promoter and a transgene; and expressing an immunologically effective amount of the transgene product in the subject, in particular in muscle of the subject, more particularly in diaphragm, smooth muscle, heart and/or skeletal muscle tissue or cells, even more particularly in diaphragm, heart and/or skeletal muscle cells or tissue of the subject.

As used herein, a phrase such as "a subject in need of treatment" includes subjects that would benefit from treatment of a recited disease or disorder. Such subjects may include, without limitation, those that have been diagnosed with said disease or disorder, those prone to contract or develop said disease or disorder and/or those in whom said disease or disorder is to be prevented.

The terms "subject" and "patient" are used interchangeably herein and refer to animals, preferably vertebrates, more preferably mammals, and specifically include human patients and non-human mammals. "Mammalian" subjects include, but are not limited to, humans, domestic animals, commercial animals, farm animals, zoo animals, sport animals, pet and experimental animals such as dogs, cats, guinea pigs, rabbits, rats, mice, horses, cattle, cows; primates such as apes, monkeys, orang-utans, and chimpanzees; canids such as dogs and wolves; felids such as cats, lions, and tigers; equids such as horses, donkeys, and zebras; food animals such as cows, pigs, and sheep; ungulates such as deer and giraffes; rodents such as mice, rats, hamsters and guinea pigs; and so on. Preferred patients or subjects are human subjects.

A 'therapeutic amount' or 'therapeutically effective amount' as used herein refers to the amount of gene product effective to treat a disease or disorder in a subject, i.e., to obtain a desired local or systemic effect. The term thus refers to the quantity of gene product that elicits the biological or medicinal response in a tissue, system, animal, or human that is being sought by a researcher, veterinarian, medical doctor or other clinician. Such amount will typically depend on the gene product and the severity of the disease, but can be decided by the skilled person, possibly through routine experimentation.

An "immunologically effective amount" as used herein refers to the amount of (trans)gene product effective to enhance the immune response of a subject against a subsequent exposure to the immunogen encoded by the (trans)gene. Levels of induced immunity can be determined, e.g. by measuring amounts of neutralizing secretory and/or serum antibodies, e.g., by plaque neutralization, complement fixation, enzyme-linked immunosorbent, or microneutralization assay.

Typically, the amount of (trans)gene product expressed when using an expression cassette or vector as described herein (i.e., with at least one muscle-specific nucleic acid regulatory element) are higher than when an identical expression cassette or vector is used but without a nucleic acid regulatory element therein. More particularly, the expression is at least double as high, at least five times as high, at least ten times as high, at least 20 times as high, at least 30 times as high, at least 40 times as high, at least 50 times as high, or even at least 60 times as high as when compared to the same nucleic acid expression cassette or vector without nucleic acid regulatory element. Moreover, the higher expression remains specific to muscle, in particular diaphragm, smooth muscle, heart and skeletal muscle tissues or cells, more particularly diaphragm, heart and skeletal muscle tissues or cells. Furthermore, the expression cassettes and vectors described herein direct the expression of a therapeutic amount of the gene product for an extended period. Typically, therapeutic expression is envisaged to last at least 20 days, at least 50 days, at least 100 days, at least 200 days, or at least 300 days or more such as at least 1 year, at least 2 years, at least 3 years, at least 5 years, at least 6 years, at least 7 years, at least 8 years, at least 9 years, or even at least 10 years or more. Expression of the gene product (e.g. polypeptide) can be measured by any art- recognized means, such as by antibody-based assays, e.g. a Western Blot or an ELISA assay, for instance to evaluate whether therapeutic expression of the gene product is achieved. Expression of the gene product may also be measured in a bioassay that detects an enzymatic or biological activity of the gene product.

Also disclosed herein is the use of the nucleic acid regulatory elements, the nucleic acid expression cassettes, or the vectors disclosed herein for transfecting or transducing muscle cells, preferably diaphragm, smooth muscle, heart and/or skeletal muscle cells, more preferably diaphragm, heart and/or skeletal muscle cells.

Further disclosed herein is the use of the nucleic acid expression cassettes or the vectors disclosed herein for expressing a transgene product in muscle cells, preferably diaphragm, smooth muscle, heart and/or skeletal muscle cells, more preferably diaphragm, heart and/or skeletal muscle cells, wherein the nucleic acid expression cassette or the vector comprises a nucleic acid regulatory element disclosed herein operably linked to a promoter and a transgene.

Further disclosed herein is a method for expressing a transgene product in muscle cells, preferably diaphragm, smooth muscle, heart and/or skeletal muscle cells, more preferably diaphragm, heart and/or skeletal muscle cells, comprising: transfecting or transducing the muscle cells with a nucleic acid expression cassette or a vector disclosed herein, wherein the nucleic acid expression cassette or the vector comprises a nucleic acid regulatory element disclosed herein operably linked to a promoter and a transgene; and expressing the transgene product in the muscle cells. Non-viral transfection or viral vector-mediated transduction of muscle cells may be performed by in vitro, ex vivo or in vivo procedures. The in vitro approach requires the in vitro transfection or transduction of muscle cells, e.g. muscle cells previously harvested from a subject, muscle cell lines or muscle cells differentiated from e.g. induced pluripotent stem cells or embryonic cells. The ex vivo approach requires harvesting of the muscle cells from a subject, in vitro transfection or transduction, and optionally re-introduction of the transfected muscle cells into the subject. The in vivo approach requires the administration of the nucleic acid expression cassette or the vector disclosed herein into a subject. In preferred embodiments, the transfection of the muscle cells is performed in vitro or ex vivo.

It is understood by the skilled person that the use of the nucleic acid regulatory elements, the nucleic acid expression cassettes and vectors disclosed herein has implications beyond gene therapy, e.g. coaxed differentiation of stem cells into diaphragm, smooth muscle, heart and skeletal muscle cells, transgenic models for over-expression of proteins in muscle cells or tissue such as in diaphragm, smooth muscle heart and/or skeletal muscle cells and/or tissue, etc.

The invention is further explained by the following non-limiting examples.

EXAMPLES

Example 1 : Design and construction of novel AAV vectors containing de novo computationally-designed regulatory elements (CREs)

AAV9 vectors were designed that express either a codon-optimized transgene encoding human b- sacoglycan (Iib-sgco or f^sgco) (SEQ ID NO: 19) or the wild type Iib-sg (wt) gene (SEQ ID NO:18), or a luciferase reporter gene (Luc) (SEQ ID NO:20) from a synthetic SPc5-12 promoter (SEQ ID NO:15) combined with (combinations of) CRE elements. The individual CRE elements are known to mediate high transgene expression in diaphragm and skeletal muscle (denoted as Dph-CRE herein) or in heart and skeletal muscle (denoted as CSk-CRE or CSk-SH herein). The vectors also contained a Minute Virus of Mouse (MVM) intron (SEQ ID NO: 16) and a synthetic polyadenylation site (pA) (SEQ ID NO:17).

AAV vectors devoid of any CRE element were constructed as control (designated herein as AAV- SPc5-12-MVM-f^sgco-pA and AAV-SPc5-12-MVM-l^sg-pA).

Figure 2 depicts the different AAV vectors that were designed and constructed. The vectors were constructed by conventional cloning. Briefly, the wild-type human beta-sarcoglycan gene (hpsg) (SEQ ID NO: 18), the codon-optimized hpsgco gene (SEQ ID NO: 19), or a luciferase encoding reporter gene (SEQ ID NO:20), all flanked by Hind III and BstBI restriction sites at the 5' and 3' ends, were cloned downstream from the SPc5-12 promoter, which was operably linked to the regulatory element CSk-SH5 (SEQ ID NO: 1). The human beta-sarcoglycan gene (hpsg) gene was codon-optimized using the Gene optimizer (GeneArt, Life technologies, Germany).

The CSk-SH5 regulatory element (SEQ ID NO: 1) operably linked to the SPc5-12 promoter (SEQ ID NO:15), were cloned upstream of the MVM intron (SEQ ID NO:16) in the context of a single stranded adeno-associated viral vector (AAVss) backbone. The vector also contained a 49 bp synthetic polyadenylation site (Levitt N et al, 1989) (SEQ ID NO:17).

The vectors depicted in Fig. 2A (AAV-CSk-SH5-SPc5-12-MVM-hpsgco-pA), Fig. 2B (AAV-CSk-SH5- SPc5-12-MVM-hpsg-pA) and Fig. 2A1 (AAV-CSk-SH5-SPc5-12-MVM-Luc-pA) served as a backbone for cloning the different Dph-CREs.

One, two or three diaphragm CREs (Dph-CRE) were cloned upstream of the CSk-SH5 CRE to generate constructs expressing hpsgco gene (one CRE: Fig. 2C (AAV-CRE02-CSk-SH5-SPc5-12-MVM-hpsgco- pA), Fig. 2D (AAV-CRE04-CSk-SH5-SPc5-12-MVM-hpsgco-pA), Fig. 2E (AAV-CRE06-CSk-SH5-SPc5-12- MVM-hpsgco-pA); two CREs: Fig. 2F (AAV-CRE02-CRE04-CSk-SH5-SPc5-12-MVM-hpsgco-pA), Fig. 2H (AAV-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hpsgco-pA), Fig. 2J (AAV-CRE06-CRE04-CSk-SH5- SPc5-12-MVM-hpsgco-pA), Fig. 2L (AAV-CRE02-CRE06-CSk-SH5-SPc5-12-MVM-hpsgco-pA); and three CREs: Fig. 2N (AAV-CRE02-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hpsgco-pA)).

Similarly, two or three diaphragm CREs were cloned upstream of the CSk-SH5 CRE to generate constructs expressing the wild type hpsg gene (two CREs: Fig. 2G (AAV-CRE02-CRE04-CSk-SH5-SPc5- 12-MVM-hpsg-pA), Fig. 21 (AAV-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hpsg-pA), Fig. 2K (AAV- CRE06-CRE04-CSk-SH5-SPc5-12-MVM-hpsg-pA), Fig. 2M (AAV-CRE02-CRE06-CSk-SH5-SPc5-12- MVM-hpsg-pA); and three CREs: Fig. 20 (AAV-CRE02-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hpsg- pA)).

In addition, two or three diaphragm CREs were cloned upstream of the CSk-SH5 CRE to generate constructs expressing Luc reporter gene (one CRE: Fig. 2R (AAV-CRE02-CSk-SH5-SPc5-12-MVM-Luc- pA), Fig. 2Q (AAV-CRE04-CSk-SH5-SPc5-12-MVM-Luc-pA), Fig. 2P (AAV-CRE06-CSk-SH5-SPc5-12- MVM-Luc-pA); two CREs: Fig. 2S (AAV-CRE06-CRE04-CSk-SH5-SPc5-12-MVM-Luc-pA, Fig. 2T (AAV- CRE04-CRE06-CSk-SH5-SPc5-12-MVM-Luc-pA), Fig. 2U (AAV-CRE02-CRE06-CSk-SH5-SPc5-12-MVM- Luc-pA), Fig. 2V (AAV-CRE02-CRE04-CSk-SH5-SPc5-12-MVM-Luc-pA); and three CREs: Fig. 2W (AAV- CRE02-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-Luc-pA)).

Lastly, three control vectors devoid of CRE were generated namely, AAV-SPc5-12-MVM-hpsgco-pA (Fig. 2Y), AAV-SPc5-12-MVM-hpsg-pA (Fig. 2X) and AAV-SPc5-12-MVM-Luc-pA (Fig. 2Z).

Example 2: In vivo validation of the regulatory elements via a reporter gene

To evaluate which CRE combinations were most potent, AAV vectors expressing luciferase gene (Luc) from the Spc5-12 promoter and containing different CRE combinations were injected into CB17-SCID mice for comparative analysis. Different CRE combinations were tested for their ability to augment luciferase gene expression. More particularly, the following vectors were injected: AAV-SPc5-12-MVM-Luc-pA (Fig. 21, SEQ ID NO: 46)

AAV- CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2A1, SEQ ID NO: 47) AAV-CRE06-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2P, SEQ ID NO:36) AAV-CRE04-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2Q, SEQ ID NO:37) AAV-CRE02-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2R, SEQ ID NO:38) AAV-CRE06-CRE04-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2S, SEQ ID NO:39) AAV-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2T, SEQ ID NO:40) AAV-CRE02-CRE06-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2U, SEQ ID NO:41) AAV-CRE02-CRE04-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2V, SEQ ID NO:42) AAV-CRE02-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-Luc-pA (Fig. 2W, SEQ ID NO:43)

Experimental procedures

AAV vector production and purification

AAV9 vectors were produced by calcium phosphate (Invitrogen Corp, Carlsbad, CA) co-transfection of 293T human embryonic kidney cells with the pAAV plasmid of interest, an adenoviral helper plasmid and a chimeric packaging construct that delivers the AAV2 rep gene together with the AAV9 cap gene, as described in Vandendriessche et al. (2007. Thromb Haemost 5:16-24), which is specifically incorporated by reference herein.

Briefly, two days post transfection, cells were harvested and vector particles were purified using isopycnic centrifugation methods. Harvested cells were lysed by successive freeze/thaw cycles and sonication, treated with benzonase (Novagen, Madison, Wl) and deoxycholic acid (Sigma-Aldrich, St. Louis, MO) and subsequently subjected to 3 successive rounds of cesium chloride (Invitrogen Corp, Carlsbad, CA) density gradient ultracentrifugation. Fractions containing the AAV vector were collected, concentrated in ImM MgCh in Dulbecco's phosphate buffered saline (PBS) (Gibco, BRL) and stored at -80°C.

Vector titers (in viral genomes (vg)/ml) were determined by quantitative real-time PCR using SYBR Green mix (which included SYBR Green dye, Taqman polymerase, ROX and dNTP's all in one) and luciferase specific primers on an ABI 7500 Real-Time PCR System (Applied Biosystem, Foster city, CA, USA). The forward and reverse primers used were 5'-CCCACCGTCGTATTCGTGAG-3' (SEQ ID NO: 48) and 5'-TCAGGGCGATGGTTTTGTCCC-3' (SEQ ID NO: 49), respectively.

Known copy numbers (10 2 -10 7 ) of the respective vector plasmids used to generate the corresponding AAV vectors, carrying the appropriate cDNAs, were used to generate the standard curves.

Animal studies

Adult 4-5 weeks old CB17-SCID mice were intravenously injected with a dose of 10 vg per mouse. Luciferase activity was assessed by whole body bioluminescence analysis (BLI), 7 or 10 days post injection. 15 weeks post-injection, one mouse per cohort was sacrificed and individual organ BLI was performed to quantify the level of luciferase expression in each individual organ. Fig. 3 shows the fold difference of the luciferase expression in gastrocnemius and quadriceps of the constructs tested.

Results

Whole body bioluminescence analysis revealed that the combination of the CSk-Sh5-CRE, and the dual Dph-CRE combinations CRE02 and CRE04, CRE04 and CRE06, in any order, and CRE02 and CRE06, or the triple Dph-CRE combination CRE02, CRE04 and CRE06 resulted in an unexpected and disproportionate increase in luciferase expression, compared to the control Spc5-12 promoter devoid of any CRE (data not shown). The combination of CRE elements increased the expression of the luciferase reporter gene to a higher level compared to the individual CRE elements. In addition, we dissected out the gastrocnemius and quadriceps of the injected mice upon euthanasia and examined the bioluminescence activity using BLI. These results confirmed the whole body BLI results by showing an unexpected increase upon combining the CSk-SH5-CRE with one Dph-CRE (i.e. CRE-02), two Dph-CREs (i.e. CRE-02 + CRE-04) or three Dph-CREs (i.e. CRE-02 + CRE-04 + CRE- 06) (Figure 3).

Example 3: In vivo validation of the regulatory elements via a therapeutic transgene

To determine which CRE combinations were most potent, AAV vectors containing different CRE combinations were injected into CB17-SCID mice for comparative analysis. Different CRE combinations were evaluated for codon-optimized hpsg gene expression. More particularly, the following vectors were injected:

AAV-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2A, SEQ ID NO:21) AAV-CRE02-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2C, SEQ ID NO:23) AAV-CRE04-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2D, SEQ ID NO:24) AAV-CRE06-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2E, SEQ ID NO:25) AAV-CRE02-CRE04-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2F, SEQ ID NO:26) AAV-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2H, SEQ ID NO:28) AAV-CRE06-CRE04-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2J, SEQ ID NO:30) AAV-CRE02-CRE06-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2L, SEQ ID NO:32)

Experimental procedures

AAV vector production and purification

AAV vector production and purification was conducted as described in Example 2. The forward and reverse primers used to determine the vector titers by quantitative real-time PCR were 5'- AGGGATGGTTGGTTGGTGG-3' (SEQ ID NO: 50) and 5'- GGCAGGTGCTCCAGGTAAT -3' (SEQ ID NO: 51), respectively.

Animal studies

Adult 4-5 week old CB17-SCID mice were intravenously injected with a dose of 3xlO vg per mouse. 2-3 mice were injected per cohort. 26 days post-injection, one mouse per cohort was sacrificed and different organs were isolated from the sacrificed mice (isolation of muscles such as quadriceps, gastrocnemius, tibialis, triceps, biceps, diaphragm, heart, as well as non-muscle such as liver, kidney, brain, spleen). Q-RT-PCR was performed to quantify hpsgco gene expression. mRNA analysis

Total RNA was extracted from different organs of the mice by a silica-membrane based purification kit according to the manufacturer's instructions (Invitrogen Corp, Carlsbad, CA, USA). Subsequently, 200 ng of total RNA from each sample was subjected to reverse transcription (RT) using a cDNA synthesis kit (Invitrogen Corp, Carlsbad, CA, USA). Next, a cDNA amount corresponding to 10 ng of total RNA was amplified by quantitative(q) PCR on an ABI 7700 (Applied Biosystems, Foster City, CA, USA), using 5' -CAT C AC AAGTG AC ATCG G C A-3' (SEQ ID NO: 52) as a forward primer and 5'- TGGCAGCCCATGTTCTGGC-3' (SEQ ID NO: 53) as reverse primer (amplicon 217 bp). The hpsgco mRNA levels were normalized to mRNA levels of the endogenous murine glyceraldehyde-3- phosphate dehydrogenase (mGAPDH) gene, using 5'- T GT GT CCGT CGT GG AT CTG A-3' (SEQ ID NO:54) as forward primer and 5'- GCCTGCTTCACCACCTTCTTGA-3' (SEQ ID NO:55) as the reverse primer (amplicon 82 bp). RNA samples were amplified with and without reverse transcriptase to exclude DNA amplification. The size of the amplified PCR fragments was verified on a 1.8% agarose gel.

Results

The combination of the CSk-SH5-CRE with the dual Dph-CRE combinations CRE02 and CRE04, CRE04 and CRE06, in any order, and CRE02 and CRE06, or the tripe Dph-CRE combination CRE02, CRE04 and CRE06 resulted in an unexpected and disproportionate increase in hpsgco gene expression, compared to the CSk-SH5-CRE alone, or the combination of CSk-SH5-CRE with a single Dph-CRE (Fig. 4). Certain CRE combinations are more potent than others for enhancing hpsgco gene expression.

Example 4: In vivo validation of the regulatory elements via a therapeutic transgene

To determine which CRE combinations were most potent, AAV vectors containing different CRE combinations were injected into CB17-SCID mice for comparative analysis of hpsgco gene expression. The following vectors were injected:

-AAV-SPc5-12-MVM-hB-SGco-pA (Fig. 2Y, SEQ ID NO:45) (SPc5-12) -AAV-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2A, SEQ ID NO:21) (CSkSH5-SPc5-12) -AAV-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2H, SEQ ID NO:28) (Dph- CRE04-Dph-CRE06-CSkSH5-SpC5-12)

-AAV-CRE02-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hB-SGco-pA (Fig. 2N, SEQ ID NO:34) (Dph-CRE02-Dph-CRE04-Dph-CRE06-CSkSH5-SpC5-12)

Experimental procedures

The experimental procedures are the same as described in Example 3 with some modification in the animal study design. Briefly, adult 4-5 week old CB17-SCID mice were intravenously injected with a dose of 3xlO vg per mouse. 4-5 mice were injected per cohort. Two months post-injection, one mouse per cohort was sacrificed and different organs were isolated from the sacrificed mice (isolation of muscles such as quadriceps, gastrocnemius, tibialis, triceps, biceps, diaphragm, heart, as well as non-muscle such as liver, kidney, brain, spleen). Q-RT-PCR was performed to quantify hpsgco gene expression.

Results

Tables 2-4 show hBSGco expression in gastrocnemius, quadriceps and heart of the tested vectors relative to each other. Relative hBSGco expression is shown as fold difference.

Table 2: Relative hBSGco expression in gastrocnemius.

Table 3: Relative hBSGco expression in quadriceps.

Table 4: Relative hBSGco expression in heart.

The results shown in Fig. 5 and Tables 2-4 show a significant increase in skeletal muscle or heart- specific mRNA expression of the hpsgco gene, whenever the vectors contained a CRE compared to the control vector (devoid of any CRE, designated as SPc5-12 in Fig. 5 and Tables 1-3). Most importantly, the combination of the CSk-SH5-CRE and the dual Dph-CRE combination CRE04 and CRE06 or the triple Dph-CRE combination CRE02, CRE04 and CRE06 resulted in an unexpected and disproportionate increase in hpsgco mRNA expression levels, up to approximately 100-fold or 30- fold in skeletal muscle and heart, respectively, compared to the control Spc5-12 promoter devoid of any CRE.

Example 5: In vivo evaluation of effect of codon-optimization of the transgene

To determine if codon-optimization of the hpsg gene can further increase hpsg protein levels when compared to the non-codon-optimized gene, three pairs of AAV vectors containing either codon- optimized or wild type hpsg gene are compared side by side. The three pairs of vectors are: i) pAAV-SPc5-12-MVM-hp-SGwt-SynthpA (Fig. 2X, SEQ ID NO:44) versus pAAV-SPc5-12- MVM- hp-SGco-SynthpA (Fig. 2Y; SEQ ID NO:45) ii) pAAV-CSk-SH5-SPc5-12-MVM-hp-SGwt-SynthpA (Fig. 2B, SEQ ID NO:22) versus pAAV-CSk- SH5-SPC5-12-MVM- hp-SGco-SynthpA (Fig. 2A, SEQ ID NO:21) iii) pAAV-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hp-SGwt-SynthpA (Fig. 2H, SEQ ID NO:28) versus pAAV-CRE04-CRE06-CSk-SH5-SPc5-12 -MVM-hp-SGco-SynthpA (Fig. 21, SEQ ID NO:29)

The AAV vectors are produced, titered and injected into 4-5 weeks old male CB17 mice as described in Example 3. Each mouse is injected with a dose of 3xlO vg; 4 to 5 mice are injected per cohort. Two months post injection, mice are sacrificed and different organs are isolated from the sacrificed mice (isolation of muscles such as quadriceps, gastrocnemius, tibialis, triceps, biceps, diaphragm, heart, as wells as non-muscle tissues such as liver, kidney, brain, spleen). A Western blot is performed on the different isolated tissues of mice injected with AAV vectors encoding either wt or codon-optimized hpsg gene.

Example 6: In vivo evaluation of the regulatory elements in SGCB null mice

Therapeutic efficacy of the CRE combination Dph-CRE02-Dph-CRE04-Dph-CRE06-CSkSFI5 (SEQ ID NO:14) was evaluated in homozygous sarcoglycan, beta (dystrophin-associated glycoprotein) (Sgcb) targeted mutant mice by gene therapy using a vector comprising p-sarcoglycan-encoding transgene. Sgcb-null mice, with knocked-out p-sarcoglycan, develop severe muscular dystrophy as in type 2E human limb girdle muscular dystrophy.

2-3 days old neonatal Sgcb-null male mice were injected with experimental vector AAV-CRE02- CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hR-SGco-pA (Fig. 2N, SEQ ID NO: 34) or the control vector AAV-SPc5-12-MVM-hR-SGco-pA devoid of CRE (Fig. 2Y, SEQ ID NO: 45) at a vector dose of lxlO 11 vg with 3 mice per cohort.

6 months post vector injection, mice were subjected to a treadmill endurance assay to monitor muscle performance. Phosphate-buffered saline (PBS) -injected Sgcb-null mice were used as controls that were not treated by gene therapy. The running distance was determined using a Bioseb (France) treadmill using the TREAD SOFTWARE. Results are shown in Figure 6.

Mice injected with the experimental vector AAV-CRE02-CRE04-CRE06-CSk-SH5-SPc5-12-MVM-hR- SGco-pA run a longer distance (average distance 364 meter) as compared to mice injected with the control vector devoid of CREs (designated as AAV-SPc5-12-MVM-hR-SGco-pA) (average distance 148 meter). Mice injected with the experimental vector also run a 5 times longer distance than control mice that were injected with PBS (average distance 71 meter).

These results show that the gene therapy was efficient in Sgcb-null mice, and that the CRE combination increased therapeutic efficacy.