METHODS OF TREATING DISEASES ASSOCIATED WITH REPEAT INSTABILITY

Title:

METHODS OF TREATING DISEASES ASSOCIATED WITH REPEAT INSTABILITY

Document Type and Number:

WIPO Patent Application WO/2018/029660

Kind Code:

Abstract:

Methods of treating diseases caused by repeat DNA instability are described herein. The methods described herein can inhibit the further expansion of repeat DNA and, in some instances, reduce the size of the repeat DNA (e.g., reduce the number of repeats).

More Like This:

WO/2009/105717	COMPOUNDS AND COMPOSITIONS AS MODULATORS OF GPR119 ACTIVITY
WO/2004/012671	SUBSTITUTED FURO [2,3-B] PYRIDINE DERIVATIVES
WO/2007/024182	A COMBINATION OF COMPOUNDS, WHICH CAN BE USED IN THE TREATMENT OF RESPIRATORY DISEASES, ESPECIALLY CHRONIC OBSTRUCTIVE PULMONARY DISEASE (COPD) AND ASTHMA.

Inventors:

PEARSON CHRISTOPHER E (CA)
NAKAMORI MASAYUKI (JP)
NAKATANI KAZUHIKO (JP)

Application Number:

PCT/IB2017/054932

Publication Date:

February 15, 2018

Filing Date:

August 12, 2017

Export Citation:

Click for automatic bibliography generation Help

Assignee:

HOSPITAL FOR SICK CHILDREN (CA)
UNIV OSAKA (JP)
PEARSON CHRISTOPHER E (CA)
NAKAMORI MASAYUKI (JP)
NAKATANI KAZUHIKO (JP)

International Classes:

A61K31/4375; A61P25/18; A61P25/28

Other References:

JINXING LI ET AL.: "NAPHTHYRIDINE-BENZOAZAQUINOLONE: EVALUATION OF A TRICYCLIC SYSTEM FOR THE BINDING TO (CAG)n REPEAT DNA AND RNA", CHEMISTRY AN ASIAN JOURNAL, vol. 11, no. 13, 5 July 2016 (2016-07-05), pages 1971 - 1981, XP002776072
HAGIHARA M. ET AL.: "SMALL MOLECULE MODULATES HAIRPIN STRUCTURES IN CAG TRINUCLEOTIDE REPEATS", CHEMBIOCHEM, vol. 12, 2011, pages 1686 - 1689, XP002776073
"Genetic Modifiers of Huntington's Disease (GeM-HD) Consortium", CELL, vol. 162, 2015, pages 516 - 526
BETTENCOURT ET AL., ANNALS NEUROL., 2016
PEARSON ET AL., NUC. ACIDS RES., vol. 30, 2002, pages 4534 - 47
LEE ET AL., BMC SYST. BIOL., vol. 4, 2010, pages 29
NIELSEN ET AL., BIOCHEM., vol. 27, 1988, pages 67 - 73
BARCEIO ET AL., BIOCHEM., vol. 30, 1991, pages 4863 - 73
CARLSSON ET AL., NUC. ACIDS RES., vol. 23, 1995, pages 2413 - 20
FOX ET AL., NUC. ACIDS RES., vol. 16, 1988, pages 2489 - 507
PANIGRAHI ET AL., J. BIOL. CHEM., vol. 277, 2002, pages 13926 - 34
CLEARY ET AL., NAT. GENET., vol. 31, no. 1, 2002, pages 37 - 46
LI ET AL., CHEM. ASIAN J., vol. 11, 2016, pages 1971 - 1981
PANIGRAHI ET AL., PNAS USA, vol. 107, 2010, pages 12593 - 9
HOU ET AL., NATURE STRUCT. & MOL. BIOL., vol. 16, 2009, pages 869 - 75
PLUCIENNIK ET AL., PNAS USA, vol. 110, 2013, pages 12277 - 82
NAKATANI ET AL., NAT. CHEM. BIOI., vol. 1, 2005, pages 39 - 43
HAGIHARA ET AL., NUC. ACIDS SYMP. SER. (OXF, vol. 50, 2006, pages 147 - 8
HAGIHARA ET AL., CHEMBIOCHEM., vol. 12, 2011, pages 1686 - 9
"Remington: The Science and Practice of Pharmacy", 2005, LIPPINCOTT WILLIAMS & WILKINS, article ED. 21TH
"The Pharmacological Basis of Therapeutics", 2001, MCGRAW-HILL CO, article 12TH ED.
"Handbook of Pharmaceutical Additives", 2002, SYNAPSE INFORMATION RESOURCES, INC.
LOPEZ CASTEL ET AL., HUM. MOL. GENET., vol. 20, 2011, pages 1 - 15
SERIOLA ET AL., HUM. MOL. GENET., vol. 20, 2011, pages 176 - 85
DE ROOIJ ET AL., HUM. GENET., vol. 95, 1995, pages 270 - 4
TELENIUS ET AL., NAT. GENET., vol. 6, 1994, pages 409 - 14
AOKI ET AL., CLIN. GENET., vol. 50, 1996, pages 199 - 201
TANAKA ET AL., J. NEUROL. SCI., vol. 135, 1996, pages 43 - 50
CANCEL ET AL., HUM. MUTAT., vol. 11, 1998, pages 23 - 7
YOON ET AL., BRAIN, vol. 139, 2016, pages e20
CLEARY; PEARSON, CYTOGENET. GENOME RES., vol. 100, 2003, pages 25 - 55
ABELIOVICH ET AL., AM. J. HUM. GENET., vol. 52, 1993, pages 1175 - 81
WOHRLE ET AL., HUM. MOL. GENET., vol. 4, 1995, pages 1147 - 53
PETERLIN ET AL., PFLUGERS ARCH., vol. 431, no. 6, 1996, pages R199 - 200
ANVRET ET AL., HUM. MOL. GENET, vol. 2, 1993, pages 1397 - 400
THORNTON ET AL., ANN. NEUROL., vol. 35, 1994, pages 104 - 7
ISHII ET AL., HUM. GENET., vol. 98, 1996, pages 138 - 40
NAKATANI ET AL., NAT. CHEM. BIOL., vol. 1, 2005, pages 39 - 43
HAGIHARA, CHEM. BIO. CHEM., vol. 12, 2011, pages 1686 - 9
SATHASIVAM ET AL., HUM. GENET., vol. 99, 1997, pages 692 - 5
SATHASIVAM ET AL., HUM. GENET., vol. 99, 1997, pages 6692 - 5
TOME ET AL., PLOS GENET., vol. 9, 2013, pages el003280
NAKAMORI ET AL., NEUROMUSCUL. DISORD., vol. 19, 2009, pages 759 - 62
NAKAMORI ET AL., MOL. THER., vol. 19, 2011, pages 2222 - 7
KREMER ET AL., AM. J. HUM. GENET., vol. 57, 1995, pages 343 - 50
BROOK ET AL., CELL, vol. 69, 1992, pages 385
KOOB ET AL., CELL, vol. 21, 1999, pages 379 - 84
DIETMAIER, CANCER RES., vol. 57, 1997, pages 4749 - 56
KABBARAH ET AL., MOL. CARCINOGEN., vol. 38, 2003, pages 155 - 9
PEARSON ET AL., HUM. MOL. GENET., vol. 6, 1997, pages 1117 - 23
AXFORD ET AL., PLOS GENET., vol. 9, 2013, pages e1003866
PEARSON ET AL., BIOCHEM., vol. 35, 1996, pages 5041 - 53
NUC. ACIDS RES., vol. 26, no. 3, 1998, pages 816 - 23
PEARSONS ET AL., NUC. ACIDS RES., vol. 30, 2002, pages 4534 - 47
AXFORD ET AL., PLOS GENET., vol. 9, 2013, pages e1 003866
NAKATAM ET AL., NAT. CHEM. BIOL., vol. 1, 2005, pages 39 - 43
PANIGRAHI ET AL., NAT. STRUCT. & MOL. BIOL., vol. 12, 2005, pages 654 - 62
PANIGRAHI ET AL., PNAS USA, vol. 107, 2010, pages 12593 - 8
REDDY ET AL., NUC. ACIDS RES., vol. 42, 2014, pages 10473 - 87
DEARY ET AL., NAT. GENETICS, vol. 31, 2002, pages 37 - 46
PAMGRAHI ET AL., J. BIOL. CHEM., vol. 277, 2002, pages 13926 - 34
BUISSON ET AL., NAT. STRUCT. MOL. BIOL., vol. 17, no. 10, 2010, pages 1247 - 54
LIU ET AL., SCIENCE, vol. 329, 2010, pages 693 - 6
ZHOU ET AL., PLOS ONE, vol. 7, no. 6, 2012, pages e39156
BINZ ET AL., METHODS ENZYMOL., vol. 409, 2006, pages 11 - 38
MASON ET AL., BIOCHEM., vol. 49, 2010, pages 5919 - 28
HAGIHARA ET AL., CHEM. BIO. CHEM., vol. 12, 2011, pages 1686 - 9
TOME ET AL., PLOS GENET., vol. 9, 2013, pages el 003280
AXFORD ET AL., PLOS GENET., vol. 9, 2013, pages el003866
PEARSON ET AL., NUC. ACIDS RES., vol. 26, 1998, pages 816 - 23
NEILSEN ET AL., BIOCHEM., vol. 27, 1988, pages 67 - 73
BARCELO ET AL., BIOCHEM., vol. 30, 1991, pages 4863 - 73
FOX; WOOLLEY, BIOCHEM. PHARMACOL., vol. 39, 1990, pages 941 - 8
NAKAMORI ET AL., HUMAN MOL. GENET., vol. 20, 2011, pages 580 - 8
LIN ET AL., PNAS USA, vol. 107, 2010, pages 692 - 7
HOU ET AL., NATURE STRUCT. MOL. BIOL., vol. 16, 2009, pages 869 - 75
LOPEZ CASTEL ET AL., NATURE REVIEWS, MOL. CELL. BIOL., vol. 11, 2010, pages 165 - 170
KENNEDY ET AL., HUMAN MOL. GENET., vol. 12, 2003, pages 3359 - 67
GOULA ET AL., PLOS GENET., vol. 5, 2009, pages e1000749
LARSON ET AL., NEUROBIOL. DIS., vol. 76, 2015, pages 98 - 111
KOVALENKO ET AL., PLOS ONE, vol. 7, 2012, pages e44273
LEE ET AL., PLOS ONE, vol. 6, 2011, pages e23647
MOLLERSEN ET AL., PLOS GENET., vol. 6, 2010, pages el 001242
YANG, AM. J. HUM. GENET., vol. 73, 2003, pages 1092 - 105
GOMES-PEREIRA ET AL., MUTAT. RES., vol. 598, 2006, pages 15 - 34
BERNAT ET AL., NEURON, vol. 87, 2015, pages 28 - 46

Attorney, Agent or Firm:

DEYOUNG, Janice Kugler et al. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

WHAT IS CLAIMED IS:

1. A method of inhibiting the expansion of a repeat DNA sequence in a cell, comprising:

contacting the cell with naphthyridine-azaquinolone (NA).

2. The method of claim 1 , wherein the contacting is in vivo.

3. The method of claim 1 or 2, wherein the contacting step is performed a plurality of times.

4. The method of any of the preceding claims, further comprising, prior to the contacting step, determining the number of repeats within the repeat DNA sequence.

5. The method of any of the preceding claims, further comprising, after the contacting step, determining the number of repeats within the repeat DNA sequence.

6. The method of any of the preceding claims, wherein the cells are contacted with an amount of NA that is dependent on the number of repeats within the repeat DNA sequence.

7. The method of any of the preceding claims, wherein the NA is a modified NA.

8. A method of reducing the number of repeats within a repeat DNA sequence in the genome of an individual, comprising:

administering at least one dose of a therapeutic amount of naphthyridine- azaquinolone (NA) to the individual.

9. The method of claim 8, wherein the NA is administered directly into the affected tissue.

10. The method of claim 8, wherein the NA is administered systemically.

11. The method of claim 8 or 9 or 10, wherein the administration is via injection.

12. The method of any of claims 8 - 11, wherein the NA is administered a plurality of times.

13. The method of any of claims 8 - 12, wherein a plurality of doses of NA are administered.

14. The method of any of claims 8 - 13, further comprising repeating the administering step a plurality of times.

15. The method of any of claims 8 - 14, wherein the therapeutic amount of NA is based on the number of repeats.

16. The method of any of claims 8 - 15, wherein the therapeutic amount of NA is about 0.01 uM to about 1 M

17. The method of any of claims 8 - 16, further comprising identifying an individual having a repeat DNA sequence.

18. The method of any of claims 8 - 17, further comprising determining the number of repeats in one or more cells from the individual.

19. The method of any of claims 8 - 18, further comprising monitoring the number of repeats in one or more cells from the individual.

20. The method of any of claims 8 - 19, wherein the NA is modified to increase its in vivo stability.

21. The method of any of claims 8 - 20, wherein the NA is delivered via a liposome or an intracranial pump.

22. A method of treating or preventing a disease in an individual caused by expansion of a repeat DNA sequence, comprising:

administering at least one dose of a therapeutic amount of naphthyridine- azaquinolone (NA) to the individual.

23. The method of claim 22, further comprising identifying an individual having a disease caused by repeat DNA instability.

24. The method of claim 22 or 23, wherein the disease caused by expansion of a repeat DNA sequence is selected from the group consisting of Huntington's disease (HD), Huntington's disease-like 2 (HDL2), myotonic dystrophy (DM1), Spinocerebellar ataxia type 1 (SCA1), SCA2, SCA3, SCA6, SCA7, SCA8, SCA12, SCA17, Spinal and bulbar muscular atrophy (SBMA), Dentatorubropallidoluysian atrophy (DRPLA), Fuch's Endothelial Corneal Dystrophy 2 (FECD2), schizophrenia, bipolar disorder (KCNN3), and breast cancer risk factor AIBl.

25. The method of any of claims 22-24, wherein the NA is administered prior to expansion of the repeat DNA sequence.

26. The method of any of claims 22- 24, wherein the administering step occurs prior to expansion of the repeat DNA sequence.

27. The method of any of claims 22-26, wherein the NA is administered to the individual prior to birth (in utero).

28. The method of any of claims 22 - 24, wherein the NA is administered to the individual following expansion of the repeat DNA sequence.

Description:

METHODS OF TREATING DISEASES ASSOCIATED WITH

REPEAT INSTABILITY

TECHNICAL FIELD

This disclosure generally relates to methods of treating diseases associated with repeat expansions.

BACKGROUND

Gene-specific CAG/CTG trinucleotide repeat expansions are responsible for at least 16 of the >40 neurodegenerative diseases caused by unstable repeats, including Huntington's disease (HD) and myotonic dystrophy (DM1). Ongoing repeat expansions occurring in affected tissues correlate with disease age-of-onset, severity, and progression. Dramatic repeat length variations exist between tissues of the same individual, with differences >5,000 repeats, with the largest expansions in heart, cerebral cortex and striatum The considerably larger expansions in the clinically affected tissues of individuals further correlate ongoing somatic expansions with disease onset, severity and progression. Recent studies reveal that, for at least six of the sixteen CAG diseases (HD, SCA1, SCA2, SCA3, SCA7, & SCA17), DNA repair proteins are major modifiers of age-of-onset (Genetic Modifiers of Huntington's Disease (GeM-HD) Consortium, 201S, Cell, 162:516-526; Bettencourt et al., 2016, Annals Neurol., in press), lending further support to the correlation between ongoing somatic expansions and age-of-onset This association is likely to be true for all 16 CAG diseases, each of which show somatic expansions, as well as other diseases associated with repeat instabilities.

Thus, methods of arresting or reversing somatic repeat expansions could be used to arrest or reverse disease progression and would be extremely beneficial in a therapeutic setting.

SUMMARY

This disclosure provides for methods of treating diseases caused by repeat DNA instability. This disclosure also provides for methods of inhibiting the further expansion of repeat DNA and, in some instances, reducing the size of the expanded repeat DNA (e.g., reducing the number of repeats).

In one aspect, a method of inhibiting the expansion of a repeat DNA sequence in a cell is provided. Such a method typically includes contacting the cell with naphtnyridine- azaquinolone (NA).

In some embodiments, the contacting is in vivo. In some embodiments, the contacting step is performed a plurality of times. In some embodiments, the method further includes, prior to the contacting step, determining the number of repeats within the repeat DNA sequence. In some embodiments, the method further includes, after the contacting step, determining the number of repeats within the repeat DNA sequence.

In some embodiments, the cells are contacted with an amount of NA that is dependent on the number of repeats within the repeat DNA sequence. In some embodiments, the NA is a modified NA

In another aspect, a method of reducing the number of repeats within a repeat DNA sequence in the genome of an individual is provided. Such a method typically includes administering at least one dose of a therapeutic amount of naphthyridine-azaquinolone (NA) to the individual.

In some embodiments, the NA is administered directly into the affected tissue. In some embodiments, the NA is administered systemicalry. In some embodiments, the administration is via injection.

In some embodiments, the NA is administered a plurality of times. In some embodiments, a plurality of doses of NA are administered. In some embodiments, the method further includes repeating the administering step a plurality of times.

In some embodiments, the therapeutic amount of NA is based on the number of repeats. In some embodiments, the therapeutic amount of NA is about 0.01 uM to about 1 M.

In some embodiments, the method further includes identifying an individual having a repeat DNA sequence. In some embodiments, the method further includes determining the number of repeats in one or more cells from the individual. In some embodiments, the method further includes monitoring the number of repeats in one or more cells from the individual. In some embodiments, the NA is modified to increase its in vivo stability. In some embodiments, the NA is delivered via a liposome or an intracranial pump.

In still another aspect, a method of treating or preventing a disease in an individual caused by expansion of a repeat DNA sequence is a provided Such a method typically includes administering at least one dose of a therapeutic amount of naphthyridine- azaquinolone (NA) to the individual.

In some embodiments, such a method further includes identifying an individual having a disease caused by repeat DNA instability. Representative diseases caused by expansion of a repeat DNA sequence include, without limitation, Huntington's disease (HD), Huntington's disease-like 2 (HDL2), myotonic dystrophy (DM1), Spinocerebellar ataxia type 1 (SCA1), SCA2, SCA3, SCA6, SCA7, SCA8, SCA12, SCA17, Spinal and bulbar muscular atrophy (SBMA), Dentatorubropallidoluysian atrophy (DRPLA), Fuch's Endothelial Corneal Dystrophy 2 (FECD2), schizophrenia, bipolar disorder (KCNN3), and breast cancer risk factor AIBl.

In some embodiments, the NA is administered prior to expansion of the repeat DNA sequence. In some embodiments, the administering step occurs prior to expansion of the repeat DNA sequence. In some embodiments, the NA is administered to the individual prior to birth (in utero). In some embodiments, the NA is administered to the individual following expansion of the repeat DNA sequence.

In one aspect, a method of treating an individual having a disease caused by an expanded repeat DNA sequence is provided. Such a method typically includes administering at least one dose of a therapeutic amount of mphthyridine-azaquinolone (NA) to the individual.

In another aspect, a method of reducing the number of repeats within a repeat DNA sequence in an individual is provided. Such a method typically includes administering at least one dose of a therapeutic amount of naphthyridine-azaquinolone (NA) to the individual

In still another aspect, a method of inhibiting the expansion of a repeat DNA sequence in an individual is provided. Such a method typically includes administering at least one dose of a therapeutic amount of naphthyridine-azaquinolone (NA) to the individual. In some embodiments, the administration In some embodiments, the administration is via injection. In some embodiments, the administration is systemic.

In some embodiments, the dose of a therapeutic amount of NA is administered more man once. In some embodiments, the dose of a therapeutic amount of N A is administered in a dose-dependent manner based upon the number of repeats. In some embodiments, the dose of a therapeutic amount of NA is about 0.01 μΜ to about 1 M In some embodiments, any of the methods described herein further includes repeating the administering step a plurality of times.

In some embodiments, any of the methods described herein further includes identifying an individual having a disease caused by repeat DNA instability. In some embodiments, any of the methods described herein further includes identifying an individual having a repeat expansion. In some embodiments, any of the methods described herein further includes determining the size of the repeat in cells from the individual. In some embodiments, any of the methods described herein further includes monitoring the size of the repeat in cells from the individual.

In some embodiments, the NA is modified to increase its vr ^'vo stability. In some embodiments, the NA is delivered via a liposome or an intracranial pump.

In some embodiments, the disease caused by trinucleotide repeat DNA instability includes Huntington's disease (HD), Huntington's disease-like 2 (HDL2), myotonic dystrophy (DM1), Spinocerebellar ataxia type 1 (SCA1), SCA2, SCA3, SCA6, SCA7, SCA8, SCA12, SCA17, Spinal and bulbar muscular atrophy (SBMA), Dentatorubropallidoluysian atrophy (DRPLA), Fuch's Endothelial Corneal Dystrophy 2 (FECD2), schizophrenia, bipolar disorder (KCNN3), or breast cancer risk factor AD31.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the methods and compositions of matter belong. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the methods and compositions of matter, suitable methods and materials are described below. In addition, the materials, methods, and examples are illustrative only and not intended to be timiting. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety.

DESCRIPTION OF DRAWINGS

Figure 1 A shows the structure of naphthyridine-azaquinolone (NA).

Figure IB shows the chemical reactions that were used to label NA with NBD.

Figure 2 are the results of experiments showing that NA binds to long CAG slip-outs. Panel A of Figure 2 shows the structure of NA comprising of two heterocycles,

naphthyridine (in red) and 8-azaquinolone moiety (in blue). Panel B of Figure 2 shows a schematic illustration of the NA-(CAG)●(CAG) triad complex. The (CAG)n DNA sequence (left) can fold into an hairpin structure involving mismatched A-A pair flanking C-G base pairs (middle). NA molecules can intercalate into the DNA helix, with the 2-amino-l,8- naphthyridine moiety (in red) hydrogen bonding to guanine and 8-azaquinolone moiety (in blue) hydrogen bonding to adenine. Panel C of Figure 2 demonstrates the binding of NA to gel-purified DNA fragments with repeats in both strands flanked by 59 bp and 54 bp of non- repetitive DNA upstream and downstream of the CTG tract ³²P-labeled on both strands, where the repeats are (CAG)50*(CTG)50 in the fully-duplexed form, clustered short slip-outs on (CAG)50-(CTG)50, long (CAG)20 slip-outs from (CAG)50-(CTG)30, or long (CTG)20 slip-outs from (CAG)3O(CTG)50. Long slip-outs extruded as a single slip-out from a fully base-paired backbone of (CTG)n»(CAG)n, as described (Pearson et aL, 2002, Nuc. Acids Res., 30:4534-47). DNAs were mixed with increasing concentrations of NA (0.15 μM, 0.75 μM, 7.5 uM and 50 uM) and resolved on 4% polyacrylamide gels. All lanes are from the same gel, and they were separated for clarity. NA-DNA complexes are shown by brackets, free DNA is indicated by arrowheads. For both the linear and the (CTG)20 slip-out DNA, there was no NA binding (white arrowheads); at the higher concentration of NA, a decrease of the clustered short slip-outs DNA was observed (grey arrowheads), but it was not possible to define a clear shifted band; band-shift was evident for the (CAG)20 slip-out DNA (black arrowheads). Panel D of Figure 2 shows the quantification of NA binding. The relative migration was measured for the linear and the three slip-outs substrates as the ratio of the migration distance of each NA-DNA complex to the migration distance of the free DNA The migration distance was measured from the well to a consistent center point of each band. Densitometry analysis was performed for the S-DNA substrate. Graphs indicate the mean of three independent experiments and the corresponding standard deviatioa Panel E of Figure 2 shows that the gel-purified DNA heteroduplex fragment with a long (CAG)20 slip-out from (CAG)50»(CTG)30, ³²P-labeled on both strands, was heat denatured, snap-cooled on ice (to enhance intra-strand structure formation and inhibit inter-strand hybridization) and incubated in the presence or absence of 7.5 uMN A (left panel). The same DNAs were denatured by alkaline treatment and then rehybridized in presence or absence of 7.5 uM NA (right panel). DNAs were then resolved on a 4% polyacrylamide gel. Each DNA species is schematically indicated. NA-DNA complexes formed with both the (CAG)50 strand (both panels) and the heteroduplexed (CAG)5O(CTG)30 (right panel) are shown by brackets; free DNA is indicated by arrowheads. NA did not bind the (CTG)30 hairpin fragment in either experiment (white arrowheads). Notably, NA did not inhibit re-hybridization of

complementary strands.

Figure 3 shows that NA specifically inhibited repair of long CAG slip-outs by human cell extracts (HeLa cells). Panel A of Figure 4 is a schematic of the in vitro repair assay. Starting DNAs and repair products released the repeat-containing fragment, which is resolved on PAGE and assessed on a molar level by Southern blotting and densitometry. Repair of different DNA substrates containing a long (CAG)20 slip-out (Panel B of Figure 3) long (CTG)20 slip-out (Panel C of Figure 3), a single CAG slip-out (Panel D of Figure 3) or a single G-T mismatch (Panel E of Figure 3), in the absence or in the presence of NA

Slipped-DNA substrates were prepared from hybrids of (CAG)5O(CTG)30,

(CAG)3O(CTG)50 or of (CAG)50(CTG)49. Repair of the G-T mismatch reconstitutes a Hindm restriction she, as shown schematically. Graphs show percentage repair efficiencies to repaired product relative to all repeat-containing fragments in the lane, values are normalized to the NA-free efficiency. Values represent the mean of three to five independent experiments ± standard deviatioa NA inclusion inhibits the repair of the long (CAG)20 slip- out (Panel B of Figure 3), but doesn't affect the repair efficiency of all the other substrates (Panels C, D, and E of Figure 3).

Figure 4 are results of experiments showing NA cellular distribution, non-toxicity, and effects on repeat instability in HD patient cells. Panel A of Figure 4 shows the cellular distribution of NA in HT1080 cell model with (C AG)850. NBD-labeled NA (green) was distributed throughout the nuclei and cytoplasm. Nuclei and cell membranes were stained with DAH (blue) and Cell Light Plasma Membrane-RFP (red), respectively. Scale bar represents 20 urn. Panel B of Figure 4 shows the cell toxicity of HT108O-(CAG)850 cells treated with NA for 72 hr. The cell viability was estimated with WST-1 assays. Values represent the means of three independent measures ±SD. Panel C of Figure 4 shows the growth curves of HT1080-(CAG)850 cells treated with or without SO μΜ NA. Error bars indicate the SD of triplicate experiments. Panels D & E of Figure 4 show that repeat instability was analyzed by small-pool-PCR across the HD repeat tract (see Table 2).

Histograms show repeat-length distributions in human HD primary fibroblasts GM09197 with (CAG)180 or GM02191 with (CAG)43, after 40 days growth with or without NA. The frequency distribution of repeat alleles is indicated as gray bars. The dashed line indicates the peak C AG size. Allele lengths are grouped in bins spanning 10 repeats. More than 230 alleles were sized per group. Percentage of repeat population was calculated by dividing the number of alleles grouped in bins spanning 10 repeats by the number of total alleles. Shown is a summary of 3 independent experiments (see Figures 10 & 11). Panels G& H of Figure 4 show the average repeat size in HD fibroblasts after 40 days incubation with or without NA. * P < 0.001, Student's t test Panel H shows repeat-tract lengths of the CASK, HTT (normal allele), and Mfdl 5 loci in HD fibroblasts (after 40 days incubation with or without NA) (see also Figure 12A & 12C). Length variation was not observed at any of these non-expanded.

Figure S are results from experiments showing that NA induces CAG contractions in HT1080-(CAG)8S0 cells, independent of proliferation, dependent upon rCAG transcriptioa Panel A of Figure 5 shows a schematic transcription bubble, and representative data showing small-pool-PCR CAG/CTG repeat length analysis of HT1080-(CAG)850 cells (initial cell clone and cells after 30 days incubation with or without NA). Scale at left shows molecular weight markers (M) converted into repeat number for CAG-repeat fragments of equivalent size. Panel B of Figure 5 are histograms showing repeat length distributions in HT1080- (CAG)850 cells. Frequency distribution of unstable alleles is shown by gray bars.

Frequency of stable alleles is shown by black bars. To facilitate comparisons, a dashed line indicates the unchanged CAG size. Allele lengths are grouped in bins spanning SO repeats. Percentage of repeat population was calculated by dividing the number of alleles grouped in bins spanning SO repeats by the number of total alleles. More than SO alleles were sized for each group. P-value was calculated by chi-square test (Table 2). Shown is a summary of 3 independent experiments (see Figures 10 & 11). Panel C of Figure S shows the average repeat size change in HT1080-(CAG)850 cells after 30 days incubation with or without NA. P-value was calculated by Student's t-test Panel D shows the repeat length distributions in non-proliferative HT108O-(CAG)850 cells with and without NA treatment, with average repeat size change shown below (see also Figure ISA & 1SB). NA still has an effect, demonstrating that its activity is independent of cell proliferation. Panel E of Figure S shows proliferating HT1080-(CAG)850 cells that do not transcribe the repeat (Figure 16A) after 30 days with and without NA. Repeat length distributions and average repeat size changes are shown. NA has no effect, showing dependence upon transcription. Panel F of Figure 5 shows that NA does not block transcription across the expanded CAG tract (see also Figure 10A & 10B). RNA transcript levels of transgene (transcript in the CAG-direction) in HT1080-(CAG)850 cells treated with or without NA (SO μΜ). Quantitative reverse transcription (RT)-PCR was performed using TaqMan Gene Expression assays. NA (SO uM) did not affect transcription of the CAG-repeats containing transcript Data are the mean ± SD of triplicates. Panel G of Figure S shows the repeat-tract lengths of the CASK, ATXN8, and MfdlS loci in HT1080-(CAG)8S0 cells (initial clone and cells after 30 days incubation with or without NA). Length variation was not observed at any of these repeats of normal length, loci in HT1080-(CAG)850 cells (after 30 days incubation with or without NA) (see also Figure 12A & 12C).

Figure 6 are the results of experiments showing that NA induces CAG contractions in R672 mouse striatum. Panel A of Figure 6 is an analysis of CAG length and shows the distribution of CAG repeat lengths in striatum from six representative R6/2 mice with one (mouse i), two (mice ii and iii) or four injections of NA (mice iv, v, vi) over a four-week period. NA, dissolved in saline, was injected into left striatum (blue) and saline only was injected into right striatum (red) (see Figure 24). NA does not affect transcription across the CAG transgene (Figure 1 SB). All mice treated 4- times are shown in Figure 25A-25H Panel B of Figure 6 shows the effect of one, two or four NA injections, reflected by Contraction and Expansions instability indices, calculated as described (Lee et al., 2010, BMC Syst Biol., 4:29) (see, also, Figure 27). Panel C of Figure 6 shows the effect of one, two or four NA injections reflected by Relative composition of contractions and expansions, calculated as described (see also Figure 27). Panel D of Figure 6 is CAG length analysis showing the distribution of CAG repeat lengths in striatum, frontal cortex, cerebellum and tail from one representative R6/2 mouse (mouse vi) following four injections of NA into the left striatum over a four-week period. DN As were isolated from the left (blue) and right (red) sides of striatum, frontal cortex, and cerebellum, and from the tail before (red) and after (green) NA treatment Notably, CAG repeats in the NA-treated half of the striatum were shorter than the inherited length in the tail (estimated progenitor allele length based upon the CAG length in the tail, as this length does not change from birth over the life of the mice; ePAL=l 59). Repeat size change brackets, with the first number representing the NA-induced contractions of the major peak relative to the somatic expansions without NA, and the second number representing the contractions relative to the inherited allele. These brackets do not account for the size changes in the second mode of the bimodal distribution in the striatum (see also Figure 25A-25H). Panel E of Figure 6 shows the Instability Indices in various tissues shown in D, where red and blue diamonds represent values of saline-treated/right side and NA- treated/left side of the striatum, respectively (see also Figure 25A-25H & Figure 27B). Panel F of Figure 6 shows the repeat-tract lengths of the Mapkapl and Fgd4 loci in both sides of striatum from R6/2 mouse with four injections (see also Figure 12A & 12C). Length variation was not observed at any of these repeats of normal length.

Figure 7 shows high-resolution polyacrylamide gel electrophoretic separation of NA- bound CAG slip-outs. 1 pmol of gel-purified DNA heteroduplex with a long (CAG)20 slip- out, from (CAG)50*(CTG)30, ³²P-labeled on both strands, was mixed with increasing concentrations of NA (0.7S μM, 7.S μM, SO μΜ, 100 μM 2, S0 μΜ, 500 μΜ, 1000 uM and 1500 uM) and resolved on 4% polyacrylamide gels (left panels) or 8% polyacrylamide gels (right panels). NA-DNA complexes are shown by brackets, free DNA is indicated by arrowheads. The relative mobility shift (Rm) was measured (graph panel) as the ratio of the migration distance of each NA-DNA complex to the migration distance of the free DNA. The migration distance was measured with a ruler from the well to a consistent centre point of each band. Histograms indicate the mean of three independent experiments and the corresponding standard deviation. NA-binding to (CAG)S0*(CTG)30 caused a band- broadening, previously observed for other DNA-binding ligands (Nielsen et al., 1988, Biochem., 27:67-73; Barcelo et al., 1991, Biochem., 30:4863-73; Carlsson et al., 199S, Nuc. Acids Res., 23:2413-20; Fox etal., 1988, Nuc. Acids Res., 16:2489-507).

Figure 8 shows binding of NA to DNA substrates with either no repeats (left panel) or (CAG)n repeats in one strand, ³²P-labeled on the common strand. Slip-outs of 3, 5, 11 or 15 CAG repeats extruded as a single slip-out from a fully base-paired backbone (see top panel, DNAs sequence and structure). One pmol of each substrate was mixed with increasing concentrations of NA (IS uM and SO uM) and resolved on 4% poly aery lamide gels. All lanes are from the same gel, and they were separated for clarity. NA-DNA complexes are shown by brackets, free DNA is indicated by arrowheads. For the linear substrate, there is no NA binding (white arrowheads); at the higher concentration of NA, a small decrease of the short slip-outs DNAs (3 and S repeats) is observed (grey arrowheads), but it is not possible to define a clear shifted band; band-shift is evident for the (CAG)n and (CAG) is slip-out DNAs (black arrowheads). The relative mobility shift was measured (graph panel) as the ratio of the migration distance of each NA-DNA complex to the migration distance of the free DNA. The migration distance was measured from the well to a consistent centre point of each band. Histograms indicate the mean of three independent experiments and the corresponding standard deviatioa P-values were calculated by Student's t test

Figure 9 shows that NA does not affect replication efficiency or replication fork progression, regardless of the presence of a CAG/CTG tract or replication direction. Three circular plasmids containing the SV40 origin of replication, and an expanded

(CAG)79»(CTG)79 repeat tract (pDM79EF and pDM79HF) or no repeats (pKN16), were replicated in vitro by human HeLa cell extracts and added recombinant SV40 T-Ag, without or with NA (7.S uM or 15 uM) treatment. The location of SV40-on determines the replication direction and which strand will be used as the leading or the lagging strand template. pDM79HF uses the CAG strand as the lagging strand template, while pDM79EF uses the CTG strand as the lagging strand template (schematic on the top of the gel panel). Replication products were purified and linearized with BamHL An equal portion of the reaction material was also digested with BamHl and DpnI; Dpnl digests un-replicated and partially-replicated material, as shown in the schematic (top figure). The digestion products were electrophoresed on a 1% agarose gel to resolve completely replicated and un-replicated material, as previously described (Panigrahi et al., 2002, J. BioL Chem., 277:13926-34; Geary et al., 2002, Nat Genet, 31(1): 37-46; bottom figure). An equal amount of unreplicated plasmid DNA was digested with Dpnl and stained with etfaidium bromide to show the complete digestion of unreplicated plasmid DNA (bottom panel). Panel I, ethidium bromide stained, Panel II, autorad: marker (lane 1); Dpnl undigested plasmid DNA (lane 2); Dpnl digested unreplicated plasmid DNA (lane 3-4); replicated plasmid DNA, Dpnl resistant (lane S). No difference in Dpnl resistant material was observed between replication in the presence or absence of NA, in all the three templates tested (panel III, IV, and V).

Figure 10 shows an independent analyses of the effects of NA on repeat instability in HD fibroblast cells with a (CAG)180 expansion from 3 independent experiments. Small- pool PCR across the HD repeat tract was used to assess repeat instability. Histograms show the repeat-length distributions in the HD primary fibroblast cells after 40-days incubation with or without NA. The frequency distribution of repeat alleles is indicated as gray bars. Dashed lines indicate the peak C AG size.

Figure 11 shows the effects of NA on repeat instability in HD fibroblast cells with a (CAG)43 expansion from 3 independent experiments. Small-pool PCR across the HD repeat tract was used to assess repeat instability. Histograms show the repeat-length distributions in the HD primary fibroblast cells after 40-days incubation with or without NA. The frequency distribution of repeat alleles is indicated as gray bars. Dashed lines indicate the peak CAG size.

Figure 12 are representative data showing ultra-sensitive small-pool PCR for the non- expanded CAG/CTG repeat length of CASK in HD primary fibroblast cells (Panel A of Figure 12) and HT1080-(CAG)850 cells (Panel B of Figure 12). Even under the highly sensitive mutation detection capacity of spPCR, length variation was not observed in either NA treated- and untreated-cells. Some reactions did not show any product, which is typical of the low genomic DNA template dilutions used in small-pool PCR Repeat length variability in non-expanded CAG tracts in TBP alleles in HD patient fibroblasts treated with or without NA for 40 days (Panel C of Figure 12) and in HD R6V2 mouse striatum with four injections of NA or saline (Panel D of Figure 12). Repeat lengths were analyzed by Agilent BioAnaryzer.

Figure 13 shows data that demonstrates that NA is not a general mutagen. Towards assessing whether NA-treatment acted as a general mutagen to sequences other than CAG slip-outs, the high read accuracy and depth of single molecule, real time, circular consensus sequencing (SMRT-CCS) was harnessed Single-molecule sequencing was done on the

HPRT1 gene - widely used as a surrogate indicator of the global effect of induced genetic variation. Single-molecule sequencing can detect mutations at frequencies below 0.5% with no false positives. Panel A of Figure 13 is a schematic of ΗΡΚΓ1 sequencing for mutation detectioa Briefly, cells were grown under identical conditions differing only by the addition of NA (SO uM) or saline, DNAs were isolated, HPRT1 exons 2 and 3 PCR amplified and sequenced. Panel B of Figure 13 are graphs showing the comparison of sequence variations between NA-treated and saline-treated samples. We chose to compare the single-molecule sequence reads of individual X chromosome-linked HPRT1 alleles (exons 2 and 3, 2,897 nucleotides) from the male HD patient-derived cells that had been NA- or saline-treated. In this manner, each read would represent a single cell Graphs show the distribution of sequence variants by relative mutation rate between three experimental replicates of NA- treated and saline-treated cells sequenced with PacBio single-molecule long reads.

Comparison of up to 2,402 individual HPRT1 alleles did not reveal any sequence differences. While there was minimal sequence variation, this did not differ significantly between NA- and saline-treated cells (p = 0.1083, two-sample Kolmogorov-Smirnov test). Panel C shows that SMRT long read sequencing was ideal for this application because it can sequence the whole length of the amphcon in a single contiguous read, using multiple passes through the read to generate high quality consensus sequence. Thus, the absence of sequence differences between the separate replicates is evidence that NA does not act as a general mutagen.

Figure 14 shows independent analyses of the effects of NA on repeat instability in HT1080-(CAG)8S0, producing an rCAG transcript Histograms showing repeat length distributions in HT1080-(CAG)8S0 cells. Repeat lengths were analyzed by small-pool PCR in both untreated and NA-treated cells, in three independent experiments. The frequency distribution of unstable alleles is shown by gray bars. The frequency of stable alleles is shown by black bars. A reduction of the CAG length tracts is evident in the NA-treated cells in each experiment To facilitate comparisons, a dashed line indicates the starting unchanged CAG size. P- values were calculated by chi-square test.

Figure IS shows the effect of NA in (CAG)8S0 cells is independent of proliferation. The effect of NA was tested in non-proliferating cells maintained in contact inhibition for the same interval as in Figure 4. Panel A of Figure IS shows the contact inhibition and serum- starvation arrest proliferation, as measured by the proportion of BrdU positive cells after 24h in BrdU-containing media. Panel B of Figure IS shows histograms showing repeat length distributions in HT108O-(CAG)850 cells under contact inhibition. Repeat lengths were analyzed by small-pool PCR in both untreated and NA-treated cells, in three independent experiments. The frequency distribution of unstable alleles is shown by gray bars. The frequency of stable alleles is shown by black bars. The effect of NA was independent of cell proliferation. NA induced a shift toward contraction of expanded repeats in the contact- inhibited cells (PO.05). To facilitate comparisons, a dashed line indicates the unchanged CAG size. P- values ware calculated by chi-square test

Figure 16 shows that NA does not affect transcription across HTT. Panel A of Figure 16 shows RT-PCR analysis of transgene expression in HT1080-(CAG)850 and HT1080-non transcribing (CAG)850 cells. Results demonstrate that the CAG repeat is not transcribed in HTl 080-non transcribing cells and that it is integrated as a single copy. Strand specific RT- PCR (1 weeks). AttB-PhiC31 system have been widely used for single copy integration. Panel B of Figure 16 was determined by quantitative real-time reverse transcriptase (qRT)- PCR and normalized to U6 RNA, expressed as the ratio of NA-treated vs. PBS-treated R6/2 striatum. Data are indicated as the mean ± SD of triplicates.

NA-induced displacement of the RNA component from the R-loop was not observed, which one might expect were NA to competitively bind for RNA away from the R-loop (Figure 17E). The inability of NA to compete for the binding to RNA repeats is likely due to its low binding to these RNAs. A study of NA interacting with RNAs of one repeat length, r(CAG)9 (Li et aL, 2016, Chem. Asian J., 11 : 1971-1981). Notably, the affinity of NA for RNA (CAG)9 versus DNA (CAG)9 was several fold different. Moreover, the binding stoichiometry of NA to d(CAG)9 and r(CAG)9 was quite different According to the surface plasmon resonance (SPR) assay for d(CAG)9 (amount of immobilization was 497 RU) and r(CAG)9 (S24 RU), NA binding to d(CAG)9 produced significant changes in the SPR intensity (140 RU) whereas only modest changes were observed for r(CAG)9 (8 RU).

Electrospray ionization time-of-flight mass spectrometry clearly suggested the much higher affinity of NA to d(CAG)9 as compared to r(CAG)9. Due to the ambiguity in the stoichiometry of NA-binding to d(CAG)9 and r(CAG)9, it is scientifically difficult to accurately measure the binding constant for NA-binding to d(CAG)9 and r(CAG)9. In addition, all these experiments were conducted in vitro, where there were only nucleic acids and NA in the buffer solution. It seems reasonable that N A binding to r(CAG)9 is not significantly effective in the cell The absence of strong binding of NA to RNA repeats is consistent with its inability to compete away the RNA from the R-loop (Fig. 17E).

Figure 17 are the results of experiments that show that NA does not affect transcription, R-loop formation, processing by RNAses, or R-loop biophysical stability. Panel A of Figure 17 is a graph of RNA transcript levels of transgene (transcript in the CAG- direction) in HT108O-(CAG)850 cells treated with or without NA (SO uM). Quantitative reverse transcription (RT)-PCR was performed using TaqMan Gene Expression assays. NA (SO uM) did not affect transcription of the CAG-repeats containing transcript Data are the mean ± SD of triplicates. Panel B of Figure 17 is a schematic for R-loop formation and processing. Arrows identify the steps at which NA may disrupt either transcription, R-loop formation, RNaseA/H digestion, R-loop biophysical stability, or R-loop processing, tested in panels A, D, C and E, respectively, of Figure 17. Panel C of Figure 17 shows a plasmid harboring (CAG)79»(CTG)79 repeats that was in vitro transcribed on either strand with T3 or T7 RNA polymerases, in the presence or in the absence of NA (120 uM). Reaction products were then treated with RNase A (labeled "A"), to cleave all single-stranded RNAs, and resolved on 1% agarose gel. Control treatment with RNaseH (labeled "IT), diminating RNA:DNA hybrids, was performed to reveal the RNA-free super coiled DNAs. The presence of NA (SO uM) during transcription did not affect R-loop formation. Panel D of Figure 17 shows in vitro transcription of a supercoiled plasmid containing an expanded tract of (CAG)79*(CTG)79 repeats was performed using T3 or T7 RNA polymerases transcription products and treated with RNaseA (labeled "A") or with both RNaseA and RNaseH (labeled "H"), in the presence or in the absence of NA (120 uM). Reaction products then were resolved on 1% agarose gel. NA (SO uM) didn't affect the cleavage of ssRNAs by RNaseA, nor did NA affect the processing of RNA:DNA hybrids by RNaseH Panel E of Figure 17 shows supercoiled DNAs with (CAG)79»(CTG)79 (lane 1) were transcribed in vitro across the repeats on both strands, with T3 and T7 RNA polymerases to yield double-R-loops. Double-R-loops were then treated with RNaseA or H (labeled "A" or ΤΓ) and incubated with NA (SO μΜ). Reaction products were then resolved on 1% agarose gel. Control reactions with double-R-loops treated with both RNaseA and RNaseH (labeled "H") (lane S) and the supercoiled plasmid itself (lane 6) were unaltered by NA (500 μM n, ot shown). The position of supercoiled plasmid in dimer or monomer form is indicated by "SC"". The top "SC" represents linked dimers and the bottom "SC"" represents monomers. In Panels C, D, and E, N A was either included in the transcription reaction (Panel C), only in the RNAse A/H reaction, only following the RNAse A/H clean-up. NA does not inhibit transcription, R- loop formation, digestion of R- loops by RNase A/H or disruption of the RNA from the R- loop.

Figure 18 shows the effect of NA on R-loop formation, processing, and instability. Panel A of Figure 18 shows supercoiled DNAs with (CAG)79*(CTG)79 (lane 1) were transcribed in vitro across the repeats on both strands with T3 and T7 RNA polymerases to yield double-R-loops. Double-R-loops were then treated with RNaseA or H (labeled "A" or " H") and incubated with SO μM of NA Reaction products were then resolved on 1% agarose gel. At the higher concentration of NA, the R-loop smear is reduced (lane 4). Control reactions with double-R-loops treated with both RNaseA and RNaseH (labeled "H") (lane S) and the supercoiled plasmid alone (lane 6) were unaltered by the higher NA concentration. Positions of supercoiled plasmid in dimer or monomer forms are indicated by "SC"". Panel B of Figure 18 is a schematic of R-loop formation, processing, and analysis. Pre-formed double-R-loops were processed by terminally differentiated (retinoic acid) human neuron- like cell extracts (SH-SY5Y) in the absence or presence of NA (SO μΜ), as described and DNA repeat lengths were assessed as expansions, contractions, or stable, by the STRIP assay. Panel C of Figure 18 is a graph showing a representative example of STRIP analysis. Transcription products were isolated, processed and transformed in E.coli cells, previously shown to stably maintain the (CAG)79»(CTG)79 lengths. Plasmids isolated from individual bacterial colonies were digested with restriction enzymes to release the repeat containing fragment, resolved on 4% polyacrylamide gels and scored for instability.

Figure 19 shows a possible mechanism through which NA may induce contractions of expanded CAG tracts. Panel A of Figure 19 shows transcription-induced R-loop formation with a r(CAG)n in a hybrid with the CTG DNA template strand will displace the CAG strand, making it available to be bound by NA NA may bind varying amounts, making shorter or longer hairpins. Intra-strand DNA repeat structures formed at R-loops, or at the reannealed DNA strands upon RNA removal, could be processed to repeat expansions and contractions by unclear mechanisms. Slipped-out CAG and slipped-out CTG repeats were predominantly in the random-coil and hairpin conformations, respectively. Repeat instability following the processing of R-loops has been shown to occur through the formation of slipped-DNAs from the R-loops, and their subsequent aberrant repair (Panigrahi et al., 2010, PNAS USA, 107: 12593-9). It is possible that during formation of the rCAG-dCTG R-loop, NA may bind the intra-strand hairpins formed on the displaced dCAG strand and possibly further extend these hairpins as demonstrated in Panel D of Figure 2. Panel B of Figure 19 shows that, similarly, NA may bind to CAG slip-outs formed by out-of-register reannealing of DNA strands following removal of RNA (Panel B of Figure 19). These NA-bound structures may shift processing to contractions over expansions. Processing of slipped- CAG/CTG repeats is complex, and has been shown to involve multiple endonucleolytic incisions (see arrows in panel B of Figure 19) by unknown nucleases (Hou et al., 2009, Nature Struct & Mol. Biol., 16:869-75; Pluciennik et al., 2013, PNAS USA, 110: 12277-82). NA also can bind to gaps of single-stranded (CAG) 10 repeats, which can arise during the removal of CTG slip-outs. That NA blocked the repair of a (CAG)20 slip-out is consistent with the inability of human and other DNA polymerases to extend primers along NA-bound (CAG) 10 templates. These results suggest that the effect of NA upon transcription-enhanced repeat instability could have arisen by NA perturbation of R-loop processing.

Figure 20 shows that MutS beta was allowed to bind radio-labelled slipped-DNA, containing a single slip-out of (CAG)20, at room temperature for 30 min in a buffer containing 10 mM Hepes-KOH pH 7.S, 110 mM KC1, 1 mM EDTA, and 1 raM DTT.

Addition of ATP (+/- Mg+) into this reaction disrupts binding of MutS beta to the DNA, as previously demonstrated. Addition of NA has no effect on binding or dissociation of MutS beta with this substrate.

Figure 21 shows FANl cleavage of supercoiled DNA, linear DNA and transcription- induced R-loops. A supercoiled plasmid harboring (CAG)79*(CTG)79 repeats, was linearized with BamHI, then incubated at 37°C for 15 min with 200 nM FANl (left panel, first two lanes). The same supercoiled plasmid was treated directly with 200 nM FANl (left panel, second two lanes). The same plasmid was in vitro transcribed across the CTG strand with T3 RNA polymerase (middle panel). Transcription reaction products were then treated with RNase A, to cleave all single-stranded RNAs, or both RNase A and with RNaseH, to also eliminate RNA:DNA hybrids. These reactions were purified by extraction with phenol/chloroform/isoamyl alcohol (25:24:1, v/v/v) and ethanol precipitation, and incubated at 37°C for 15 min with 200 nM FAN1 (right panel). All reaction products were analyzed on 1% agarose gels run in lx TBE buffer at 80 V for 3 h. Gels were subsequently stained with ethidium bromide (0.S mg/ml) to allow visualization of total nucleic acid under ultraviolet (UV) light FAN1 cleaves supercoiled and linear DNAs (left panel), as well as the transcription-induced R-loop containing DNAs, releasing the supercoil-dependent RNA portion (right panel). The position of supercoiled plasmid in monomer or multimer form is indicated by 'SC. 'OC' indicates the open circular form. 'LN' indicates the linearized plasmid.

Figure 22 shows thatFANl cleavage is altered by the presence of a CAG hairpin in the flap-DNA substrate and this activity is inhibited by NA-binding to the CAG hairpin. Flap-DNA substrates (see sequences in Panels D-F of Figure 22) are schematically shown in the top panel: S'-flap DNA with no repeats (Panel A of Figure 22), 5 '-flap DNA with (CAG)20 repeats in the flap (Panel B of Figure 22), S'-flap DNA with (CAG)20 repeats in the duplex region (Panel C of Figure 22). All DNA structures were ³²P radiolabeled at the S'-end of the flap strand. 1 pmol of each substrate was incubated with increasing

concentration of NA (7.5 μΜ, 15 μΜ, SO uM) and then treated with 200 nM FAN1.

Nuclease reactions were analyzed on a native 8% polyacrylamide gel at 200V for 1 h. The amount of digested product was quantified by densitometric analysis. All lanes are from the same gel, and they were separated for clarity. Histograms indicate the mean of three independent experiments and the corresponding standard deviation. To map FAN1 digestion sites, reaction products were separated on a 8% denaturing gel, together with Maxam-Gilbert sequencing reactions. Cut sites are indicated by arrowheads (going from white to black to indicate the increasing digestion efficacy). The different digestion fragments are indicated by letters (a,b,c). V indicates the starting substrate; 's+NA' indicates the shifted substrate due to NA binding to the CAG slip-out The cut sites are indicated by the nucleotide location relative to either the end of the DNA, the junction point, or the repeat unit number. Panels D— F of Figure 22: cleavage sites were mapped for the three substrates; Panel D of Figure 22 corresponds to Panel A of Figure 22; Panel E of Figure 22 corresponds to Panel B of Figure 22; Panel F of Figure 22 corresponds to Panel C of Figure 22.

Figure 23 shows that NA competes with RPA binding and blocks the enhanced progression of pol delta along a CAG template. Panel A of Figure 23 shows the binding of RPA (250 nM) to a DNA substrate containing a slip-out of (CAG)20 (lane 2), ³²P labelled on both strands (schematic on top). The substrate was incubated with NA (SO μΜ) for 10 min at RT, prior addition of RPA NA competes with RPA binding to the (CAG)20 substrate (lane 3). The percentage of DNA bound to RPA was quantified by densitometric analysis.

Histograms indicate the mean of three independent experiments and the corresponding standard deviation. Panel B of Figure 23 shows that a polymerase extension assay was performed as previously described. 0.1 uM of a (CAG) 10 template oligo was annealed with 0.1 uM of a ³²P labelled primer and incubated with or without NA (SO uM) for 30 min at RT. 2S0 nM RPA and/or 20 nM Pol delta was added to the reaction and incubated for 15 min at 37°C. Reaction products were separated on a 6% sequencing gel together with Maxam- Gilbert sequencing reactions (lane 1). Primer only is in lane 2. Pol delta alone cannot synthesize through the CAG tract (lane 3) but its activity is enhanced by the addition of RPA (lane 4). NA blocks the enhanced progression of pol delta along the CAG template (lane S and 6), a result consistent with NA binding competitively against RPA for the CAG tract Panel C of Figure 23 shows a schematic of a possible mechanism for NA-induction of contraction through an inability of polymerases to synthesize across NA-bound CAG templates.

Figure 24 shows the NA and saline injection regimen for R6/2 HD mice.

Figure 25 show CAG repeat length analysis and instability indices for the striatum, cortex, cerebellum and tail of eight of the ten mice treated with 4 injections (mice vi-xii) (Panels A-H of Figure 25). The CAG scans of the other two mice (mice iv and v) are shown in Figure 6A

Figure 26 shows histological study, mouse striatum, with saline, NA in saline, or no injection, followed by H&E staining.

Figure 27 shows that, most, if not all alleles in the NA-treated striatum had incurred repeat contractions, indicating that NA had affected most cells. Panel A of Figure 27 shows the quantification of the somatic "instability index" from the gene scan traces of CAG repeat sizes using an established method. This calculation corrects background using a relative height threshold and normalizes data efficiently, generating an "instability index" that represents the mean CAG length change from the main allele per cell in a population of cells: greater indices reflect greater expansions, lower indices— lower expansions. Analysis of striatum treated with and without NA revealed instability indexes that were progressively reduced with subsequent NA injections of 1 , 2 and 4 administrations (Figure 6B, Figure 25A- 2SH). Panel B shows the Instability Indices summarized for all ten mice treated 4-times with NA The repeat size distributions in striatum treated four times with NA were significantly different from the mock-treated striatum (Mann-Whitney, p=0.00035). The relative peak height correction permitted quantification of a contraction and expansion index and the relative composition of contracted/expanded peaks, as described - both of which revealed increasing contractions with subsequent NA treatments. The effect of NA was localized to the site of injection, as the CAG tract in the cerebral cortex and cerebellum from the same mice that had intra-striatal injections, showed identical patterns of CAG length heterogeneity in the right and left sides (Figure 6C-D, Figure 25A-25H).

Figure 28 shows NMR data to confirm the structures of NBD-labeled NA

DETAILED DESCRIPTION

Small molecules that specifically target genetically unstable disease-causing repeat DNA sequences, or structures that form as a result of these sequences, can have the potential to control their genetic instability, as ongoing somatic repeat expansions can contribute to disease onset and progressioa Ligands designed for therapeutic application must bind their target with high specificity for the mutant allele, and arrest expansions or induce contractions with little to no overall toxicity. This disclosure describes small molecules designed to bind specifically to slipped-CAG/CTG repeat structures (Nakatani et al., 2005, Nat Chem. BioL, 1 :39-43; Hagihara et al., 2006, Nuc. Acids Symp. Ser. (Oxf.), 50: 147-8; and Hagihara et al., 2011 , Chembiochem, 12: 1686-9), and modify instability of the expanded repeat, leading to repeat contractions. This disclosure describes the first potential therapy that specifically reduces repeat expansions. Specifically, it is reported herein that NA can arrest expansions, which may slow or arrest disease onset, progression or severity, as well as induce

contractions of the expanded repeat, possibly even below the inherited length. This disclosure describes methods for reducing the number of trinucleotide repeats within a trinucleotide repeat DNA sequence in an individual This disclosure also describes methods that can be used to inhibit the expansion, or further expansion, of a trinucleotide repeat DNA sequence in an individual. The methods described herein can be used to treat an individual having a disease caused by trinucleotide repeat DNA instability. As used herein, treating a disease caused by a trinucleotide repeat DNA instability can refer to arresting or halting the progression of the expansion and, hence, the symptoms associated with expansion of the trinucleotide repeat DNA sequence. Also as used herein, treating a disease caused by a trinucleotide repeat DNA instability can refer to reversing progression of the disease. For example, if the number of trinucleotide repeats within a trinucleotide repeat DNA sequence can be reduced to within or below a threshold number, the symptoms associated with the disease can be halted, or prevented, or possibly even reversed to some extent given early intervention (e.g., prior to significant degeneration of cells and tissues).

The methods described herein include administering at least one dose of a therapeutic amount of naphthyridine-azaquinolone (NA) to the individual. NA consists of a 2-amino- 1,8-naphthyridine moiety and an 8-azaquinolone moiety (see, for example Figure 1 A). Two molecules of NA intercalate into the DNA helix, with the 2-amino-l,8-naphthyridine moiety available for hydrogen bonding to guanine and the 8-azaquinolone moiety available for hydrogen bonding to adenine, which results in two cytosines being extruded or pushed out of the helix. Without being bound by any particular theory, the results presented herein indicate that NA acts in post-mitotic cells during transcription of the mutant repeat, is specific for the expanded mutant allele, and is mediated by modulating the repair of CAG slip-outs.

Diseases that are caused by trinucleotide repeat instability are known in the art, and currently number in the dozens. The trinucleotide repeat instability diseases that can be treated with NA as described herein include the diseases associated with the CAG / CTG trinucleotide repeats. Such diseases include, without limitation, Huntington's disease (HD), Huntington's disease-like 2 (HDL2), myotonic dystrophy (DM1), Spinocerebellar ataxia type 1 (SCA1), SCA2, SCA3, SCA4, SCA6, SCA7, SCA8, SCA12, SCA17, Spinal and bulbar muscular atrophy (SBMA), Dentatorubropallidoluysian atrophy (DRFLA), Fuch's

Endothelial Corneal Dystrophy 2 (FECD2), schizophrenia, bipolar disorder (KCNN3), breast cancer risk factor AIB1 (also known as NCOA3, SRC-3, ACTR, pCIP, RAC3, and TRAMl). Individuals having a disease caused by a trinucleotide repeat DNA instability typically are identified using genetic analysis (e.g., PCR amplification, sequencing, restriction digest analysis, restriction fragment length polymorphisms) to determine the size of the repeat (e.g., the number of trinucleotide repeats) and/or the presence of an expanded region (e.g., an expansion).

NA can be formulated with a pharmaceutically acceptable carrier for delivery to an individual in a therapeutic (or effective) amount The particular formulation and the therapeutic amount is dependent upon a variety of factors including the route of

administration, the dosage and dosage interval of the NA, the sex, age, and weight of the individual being treated, the severity of the affliction, and the judgment of the individual's physician.

Determining the developmental timing of somatic expansions in a given tissue (skeletal muscle, heart, brain regions, etc.) may enhance the ability to administer NA in a way that prevents the onset of somatic expansions and/or more easily induces contractions to lengths closer to the non-affected lengths.

As used herein, "pharmaceutically acceptable carrier" is intended to include any and all excipients, solvents, dispersion media, coatings, antibacterial and anti-fungal agents, isotonic and absorption delaying agents, and the like, compatible with administration. The use of such media and agents for pharmaceutically acceptable carriers is well known in the art Except insofar as any conventional media or agent is incompatible with a compound, use thereof is contemplated.

Pharmaceutically acceptable carriers for delivering compounds are well known in the art See, for example Remington: The Science and Practice of Pharmacy, University of the Sciences in Philadelphia, Ed, 21* Edition, 2005, Lippincott Williams & Wilkins; and The Pharmacological Basis of Therapeutics, Goodman and Gilman, Eds., 12 ^th Ed, 2001 , McGraw-Hill Co. The type of pharmaceutically acceptable carrier used in a particular formulation can depend on various factors, such as, for example, the physical and chemical properties of the compound, the route of administration, and the manufacturing procedure. Pharmaceutically acceptable carriers are available in the art, and include those listed in various pharmacopoeias. See, for example, the U.S. Pharmacopeia (USP), Japanese

Pharmacopoeia (JP), European Pharmacopoeia (EP), and British pharmacopeia (BP); the U.S. Food and Drug Administration (FDA) Center for Drug Evaluation and Research (CDER) publications (e.g., Inactive Ingredient Guide (1996)); and Ash and Ash, Eds. (2002) Handbook of Pharmaceutical Additives, Synapse Information Resources, Inc., Endicott, NY.

A pharmaceutical composition that includes NA as described herein is typically formulated to be compatible with its intended route of administration. Suitable routes of administration include, for example, oral, rectal, topical, nasal, pulmonary, ocular, intestinal, and parenteral administratioa Routes of parenteral administration include intravenous, intramuscular, and subcutaneous administration, as well as intraperitoneal, intra-arterial, intra-articular, intracardiac, intracisternal, intradermal, intralesional, intraocular, intrapleural, intrathecal, intrauterine, and intraventricular administratioa

Simply by way of example, for intravenous injection, the NA may be formulated as an aqueous solution using physiologically compatible buffers, including, for example, phosphate, histidine, or citrate for adjustment of the formulation pH, and a tonicity agent, such as, for example, sodium chloride or dextrose. For oral administration, NA can be formulated in liquid or solid dosage forms, and also formulated as an instant release or controlled / sustained release formulations. Suitable forms for oral ingestion by an individual include tablets, pills, hard and soft shell capsules, liquids, gels, syrups, slurries, suspensions, and emulsions. Solid oral dosage forms can be obtained using excipients, which can include fillers, disintegrants, binders (dry and wet), dissolution retardants, lubricants, glidants, anti- adherants, cationic exchange resins, wetting agents, antioxidants, preservatives, coloring, and flavoring agents. Examples of such excipients include cellulose derivatives, citric acid, dicalcium phosphate, gelatine, magnesium carbonate, magnesium / sodium lauryl sulfate, mannitol, polyethylene glycol, polyvinyl pyrrolidone, silicates, silicium dioxide, sodium benzoate, sorbitol, starches, stearic acid or a salt thereof, sugars (eg., dextrose, sucrose, lactose), talc, tragacanth mucilage, vegetable oils (hydrogenated), and waxes.

NA can be administered topically, such as through a skin patch, a semi-solid, or a liquid formulation, for example a gel, a (micro-) emulsion, an ointment, a solution, a (nano / micro)-suspension, or a foam. Penetration of NA into the skin and underlying affected tissues can be regulated, for example, using penetration enhancers; the appropriate choice and combination of lipophilic, hydrophilic, and amphophilic excipients, including water, organic solvents, waxes, oils, synthetic and natural polymers, surfactants, emulsifiers; by pH adjustment; and/or the use of complexing agents.

NA can be administered in a therapeutic amount to an individual having a disease caused by trinucleotide repeat DNA instability. Typically, a therapeutic amount or dose of N A refers to the amount of NA that results in either arresting somatic expansions, or a reduction in the inherited size of the trinucleotide repeat DNA sequence (e.g., a reduction in the number of trinucleotide repeats) and, eventually, a reduction in, or amelioration of, one or more symptoms without inducing any adverse effects. In some instances, the therapeutic amount correlates with the number of repeats present Toxicity and therapeutic efficacy of NA can be determined by standard pharmaceutical procedures in cell cultures or

experimental animals, eg., by determining the LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio of toxic to therapeutic effects is the therapeutic index, which can be expressed as the ratio LD50/ED50.

A therapeutic amount of NA can be administered in a single dose (i.e., a one-time dose) or a therapeutic amount of NA can be administered in more than one dose (i.e., a plurality of times) or even in a slow, continuous manner using, for example, a pump or a patch. As used herein, a therapeutic amount of NA refers to about 0.01 uM to about 1 M (e.g., about 0.05 uM to about 0.75 M; about 0.1 uM to about 0.5 M; about 0.5 uM to about 0.1 M; about 1 uM to about 50 mM; about 50 uM to about 1 mM; or about 100 uM to about 0.5 mM) of NA In some embodiments, the concentration of NA in a composition as described herein can be between about 1% and about 50% (e.g., between about 5% and about 40%; between about 10% and about 30%; between about 15% and about 25%; or about 20%). In some instances, NA can be administered in a dose-dependent manner depending upon the number of trinucleotide repeats a person has inherited, or the rate of somatic expansions in a given tissue. It would be appreciated that, depending upon the disease, the severity of the symptoms, and the number of trinucleotide repeats, a therapeutic amount of NA (e.g., in one or more doses) can be administered once a week, once a month, once a year, or more or less frequently as needed (e.g., chronically with an intracranial pump).

It also would be appreciated that, given the tissue-specific expansion exhibited in several of the trinucleotide repeat diseases, the therapeutic amount of N A may differ for different tissues within the same individual. For example, in HD, the primary tissues that exhibit the most expansion (e.g., the largest expansion, the most rapid expansion, or combinations thereof) includes the striatum, the cerebral cortex, the basal ganglia, medium spiny neurons, and the male germline; in myotonic dystrophy type I, the primary somatic tissues that exhibit expansion includes the brain, heart, and the cerebral cortex. For DM1, see, for example, Lopez Castel et al. (2011, Hum Mol. Genet, 20:1-15) and Seriola et al. (2011, Hum. Mol. Genet, 20: 176-85); for Huntington's disease, see, for example, De Rooij et al. (1995, Hum Genet, 95:2704) and Telenius et al. (1994, Nat Genet, 6:409-14); For DRPLA, see, for example, Aoki et al. (1996, Clin. Genet, 50:199-201); for spinal and bulbar muscular atrophy (SBMA), see, for example, Tanaka et al. (1996, J. Neurol. Sci., 135:43-50); for SCA1 and SCA3, see, for example, Cancel et al. (1998, Hum. Mutat, 11:23-7); for SCA7, see, for example, Yoon et al. (2016, Brain, 139(Pt 3):e20). See, also, Cleary & Pearson (2003, Cytogenet Genome Res., 100:25-55), Abeliovich et al. (1993, Am J. Hum Genet, 52:1175-81), Wohrle etal. (1995, Hum Mol. Genet, 4:1147-53), Peterlin etal.

(1996, Pflugers Arch., 431(6 Suppl 2):R199-200), Anvret et al. (1993, Hum Mol. Genet, 2:1397-400), Thornton et al. (1994, Ana Neurol., 35:104-7), and Ishii et al. (1996, Hum Genet, 98:138-40).

The methods described herein also can include the step of monitoring individuals that have received NA For example, using routine methods in the art (e.g., PCR amplification, sequencing, restriction digest analysis, restriction fragment length polymorphisms), the size of the trinucleotide repeat can be monitored in cells from the individual. The monitoring step can occur with the desired frequency; for example, the individual, or cells from one or more tissues from the individual, can be monitored with the same frequency as the NA is administered (eg., before or after the NA is administered) or can be monitored at another desired frequency (e.g., weekly, monthly, or yearly). In some instances, the results of the monitoring step can determine, or help determine, the tissue(s) that require treatment, the therapeutic amount, and/or the appropriate frequency with which the therapeutic amount should be delivered to the individual (or to the tissue).

It would be understood that the N A can be modified in a number of ways to increase its stability, tissue-selectivity, uptake, or combinations thereof in vivo. In some instances, the NA can be delivered via, for example, liposomes, an intracranial pump, intramuscular diffusion, a blood-brain barrier "key", or another formulation that stabilizes and/or provides protection for the NA during administration and delivery to the affected cells / tissues.

In accordance with the present invention, there may be employed conventional molecular biology, microbiology, biochemical, and recombinant DNA techniques within the skill of the art Such techniques are explained fully in the literature. The invention will be further described in the following examples, which do not limit the scope of the methods and compositions of matter described in the claims.

EXAMPLES

Example l— Characterisation of NA-CAG/CAG Complex

Previous characterizations of the NA-CAG used UV-melting, electrospray ionization time-of-flight mass spectrometry (ESI-TOF MS), isothermal titration calorimetry (TTC), surface plasmon resonance (SPR) assay, circular dichroism (CD) spectroscopy, SPR-imaging assay, and nuclear magnetic resonance (NMR) spectroscopy (Hagihara et al., 2006, Nuc. Acids Symp. Ser. (Oxf.), 50:147-8; Nakatani et al. 2005, Nat Chem. Biol., 1:39-43;

Hagihara et aL, 2011, Chem Bio. Chem., 12:1686-9). These studies revealed a unique NA- (CAG)*(CAG) structure of a distorted intra-strand hairpin. The naphthyridine and azaquinolone moieties in NA exhibit complementary hydrogen bonding to guanine and adenine, respectively, causing two cytosine bases to flip-out from the CAG hairpin (Figure 2A and 2B). The affinity of each NA molecule/CAG-C AG was estimated to be 1.8 x 10 ⁶M- ¹ as Ka or 0.56 x 10 ^-6 M as Kd (Hagihara et al., 2006, Nuc. Acids Symp. Ser. (Oxf), 50: 147- 8). NA increased the melting temperature of (CAG)io hairpins from 47.1 (±0.7) °C to 78.8 (±0.1) °C at 50 uM NA, surpassing the melting temperature of (CTG)io hairpins (54.4 (±0.3) °C). A large structural change of d(CAG)io shown by CD spectroscopy and SPR-imaging assay was induced by multiple NA-binding (at least 6 NA molecules) to the repeat as confirmed by ESI-TOF MS. NA-binding increased for longer repeats with isolated short oligonucleotides of (CAG)10-30 (Nakatani et aL, 2005, Nat Chem. Biol., 1:39-43). The NA- CAG/CAG bound structure was determined by NMR spectroscopy, and this structure was deposited in the Protein Data Bank (PDB) and assigned Accession Number 1X26. These features explain the remarkable selectivity of N A for CAG hairpins; both sequence- and hairpin-specificity likely target the CAG/CTG expanded allele.

Example 2— NA Synthesis and Labeling

Solvents and starting materials were purchased by the standard suppliers and used without further purification. Analytical thin- layer chromatography (TLC) was performed on 0.2 mm silica 60 coated on plates with F254 indicator. Flash column chromatography was performed on Wako gel C-200 silica gel. High performance liquid chromatography (HPLC) was performed by a Gilson 811C Dynamic Mixer system with a UV detector set at 254 nm using a Cosmosil SCI 8-MS-II column (150x20 mm) with a dual solvent system of 0.1% ΗΟΑc/H ₂Ο (Solvent A) and MeCN (Solvent B). Nuclear magnetic resonance (NMR) spectra were recorded on a Bruker Avance ΙΠ 700 spectrometer at 21±3 °C unless otherwise indicated. Chemical shifts (δ) are reported in parts per million (ppm). Coupling constants (J) were reported in Hertz. ¹H NMR chemical shifts were referenced to the residual solvent peak at 3.31 ppm in CD30D-d4. ¹³C NMR chemical shifts were referenced to the center solvent peak at 49.00 ppm for CD30D. ESI mass spectra were recorded on a Thermo LTQ Orbitrap XL mass spectrometer.

NA (40 mg, 0.09 mmol) and teri-butyl (4-oxo-4-((3- oxopropyl)amino)butyl)carbamate (34 mg, 0.13 mmol) was stirred in methanol (2 mL). Acetic acid was added to adjust the pH of the mixture about to 6. Then, sodium

triacetoxyborohydride (31 mg, 0.13 mmol) was added to the mixture. The mixture was stirred at room temperature for 12 h. The solvent was evaporated to dryness, and the residue was purified by column chromatography on silica gel eluted with cMoroform:methanol = SO: 1 to give Boc-NA-Linker (IS mg, 25%) as pale yellow solids. To a solution of Boc-NA- Linker (15 mg, 0.02 mmol) in chloroform (1 mL) was added ethyl acetate containing 4 M HC1 (0.5 mL), and the reaction mixture was stirred at room temperature for 0.5 h. The solvent was evaporated to dryness. The residue was further purified by HPLC to give NA- Linker (11 mg, 86%) as white solids. Ή NMR (CD3OD, 700 MHz): delta = 8.20 (d, J = 8.6 Hz, lH), 8.06 (dd, J = 8.4, 2.4 Hz, 2H), 7.76-7.60 (m, 2H), 7.33 (d, J = 8.2 Hz, lH), 7.02 (d, J = 7.7 Hz, lH), 6.43 (d, J = 9.5 Hz, lH), 4.42 (s, 2H), 3.26 (t, J = 6.9 Hz, 2H), 3.01 (t, J = 7.3 Hz, 2H), 2.92 (t, J = 6.0 Hz, 2H), 2.87 (t, J = 6.0 Hz, 2H), 2.70 (t, J = 6.0 Hz, 2H), 2.69 (s, 3H), 2.63 (t, J = 7.1 Hz, 2H), 2.56 (t, J = 6.0 Hz, 2H), 2.37 (t, J = 6.9 Hz, 2H), 1.93 (t, J = 7.3 Hz, 2H), 1.76 (t, J = 7.1 Hz, 2H). ¹³C NMR (CDsOD, 175 MHz): delta = 175.4, 174.1, 173.2, 172.9, 164.4, 162.8, 159.8, 153.7, 153.6, 148.3, 139.5, 138.8, 137.3, 136.6, 121.5, 120.9, 118.3, 115.8, 114.1, 113.1, 51.3, 50.4, 49.4, 44.3, 39.0, 37.2, 34.5, 33.7, 32.4, 26.3, 23.6,

NA-Linker (15 mg, 0.02 mmol) and NBD-X, SE (Succinimidyl 6-(N-(7-nitrobenz-2- oxa-1, 3-diazol-4-yl)amjno)hexanoate) (12 mg, 0.03 mmol) was stirred in

dimetnylfonnamide (3 mL). Then, triethylamine (5 mg, 0.05 mmol) was added to the mixture. The mixture was stirred at room temperature for 6 h. The solvent was evaporated to dryness, and the residue was purified by column chromatography on silica gel eluted with chloroform: methanol = 20: 1 to give crude product The crude product was further purified by HPLC to give NA-NBD (13 mg, 59%) as white solids. ¹H NMR (CD3OD, 700 MHz): delta = 8.41 (d, J = 8.6 Hz, lH), 8.18 (bs, 1H), 8.04 (q, J = 8.3 Hz, 2H), 7.73 (d, J = 8.2 Hz, 1H), 7.66 (d, J = 9.5 Hz, lH), 7.29 (d, J = 8.2 Hz, lH), 7.02 (d, J = 8.2 Hz, lH), 6.44 (d, J = 9.5 Hz, lH), 6.22 (d, J = 9.0 Hz, lH), 4.43 (s, 2H), 3.44 (s, 2H), 3.26 (t, J = 6.9 Hz, 2H), 3.14 (t, J = 7.1 Hz, 2H), 3.09-2.97 (4H), 2.77 (s, 4H), 2.66 (s, 3H), 2.64 (t, J = 6.3 Hz, 2H), 2.18 (t, J = 7.5 Hz, 4H), 1.83 (t, J = 7.1 Hz, 2H), 1.76-1.70 (4H), 1.65 (t, J = 7.5 Hz, 2H), 1.47-1.37 (2H). ¹³C NMR (CD30D, 175 MHz): delta = 174.6, 174.2, 174.1, 173.5, 172.3, 164.4, 162.8, 159.8, 153.7, 153.6, 148.3, 145.1, 144.4, 144.0, 139.5, 138.7, 137.2, 136.6, 121.4, 121.0, 118.3, 115.8, 114.2, 113.1, 98.1, 51.2, 50.0, 49.3, 44.3, 43.1, 38.5, 38.3, 36.9, 35.4, 33.5, 33.0, 32.6, 27.5, 26.1, 25.8, 25.4, 25.1, 23.6. HRMS (ESI) m/z: calcd for

900.3883; found, 900.3875.

These reactions are shown in Figure IB, and the structure of the NBD-labeled NA was confirmed by NMR (Figure 28). Example 3— Cell Culture

The construction of the HT1080-(CAG)850 cell model was described previously (Sathasivam et al., 1997, Hum. Genet, 99:692-5). Briefly, HT1080 (ATCC) human fibrosarcoma cells were co-transfected with a plasmid (LC15-R) containing 800-850 (CTG>(CAG) repeals (Nakatani et al., 2005, Nat Chem. Biol., 1 :39-43) and a plasmid encoding PhiC31 integrase. Transfection was performed with a Nucleofector (Lonza, Basel, Switzerland), and stably transfected clones were selected with puromycin. The HT1080-non transcribing (CAG)HO cell model was established by transfecting a plasmid modified from pLC16 by adding a SV40 polyadenylation signal downstream of the repeats, instead of LC15-R. The HT1080- non-transcribing (CAG)850 cell model was established by transfecting a plasmid modified from pLC16 by inserting an additional SV40

polyadenylation signal downstream of the repeats (in addition to the original

polyadeniylation signals located upstream of the repeats), so that CAG/CTG repeats are floxed by transcription-terminator elements. The AttB-PhiC31 system has been widely used for single copy integration. As expected, this AttB-PhiC31 -mediated integration resulted in a confirmed single integration of the transgene, that was only transcribed only when induced (Figure 16A). The HT1080 model cells and an HD primary fibroblast cell line, GM09197 (Coriell Biorepository) with a large expanded allele of 180 CAG repeats and a non-expanded allele of 23 repeats (Sathasivam et al., 1997, Hum. Genet, 99:6692-5), the HD primary fibroblast cell line, GM02191 (Coreill Biorepository) with an expanded allele of 43 CAG repeats and a non-expanded allele of 19 repeats, were cultured at 37°C with 5% CCh in Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum and 1% penicillin/ streptomycin. Cells were treated with or without continuous exposure to 50-μΜ NA for 30 days (HT1080 cells) or for 40 days (fibroblast cells). For experiments on non- proliferating cells, HT1080 cells were grown to confluence and then treated for the same period. WST-1 assay was performed according to the manufacturer's instructions (Roche, Basel, Switzerland). All cell lines were tested to be free of mycoplasma. Three biological replicates were performed for each experiment

Proliferation was inhibited by growth under contact inhibition and serum starvation, as previously performed. Degree of arrest from proliferation was assessed in living cells by counting BrdU positive cells after 24 hrs incubation with BrdU in proliferating or contact- inhibited (and serum starved) HT1080-(CAG)850 cells. The percentage of BrdU positive cells was 92.6% in proliferating cells and 8.17% in contact-inhibited cells. Although a small portion of HT1080-(CAG)850 cells were still proliferating even under contact-inhibition and serum starvation, the ratio is very small. Based on a calculation how many cells derive from a single cell after 30 days (the period that was studied) (proliferating, 1.926 ^Λ30 =

348653546.6; contact inhibition, 1.0817 ^Λ30 = 10.SS), the effect of proliferation in contact- inhibited cells can be ignored.

Fxample 4— Microscopy

HT1080-(CAG)850 cells were incubated with 50-μΜ NBD-labeled NA for 48 hr and Cell Light Plasma Membrane-RFP, BacMam 2.0 (Life Technologies, Carlsbad, CA), then fixed for 15 min at RT with 4% paraformaldehyde and washed 2 times for 10 min in phosphate buffered saline (PBS). Cells were mounted with Vectashield hard-set mounting media that contains DAH (Vector Laboratories, Burlingame, CA). Fluorescence images were obtained using Olympus FV1000D confocal laser scanning microscope (Olympus, Tokyo, Japan).

Example 5— Repeat I-ength Analysis

Genomic DNA was extracted from HT1080-(CAG)850 clones and HD primary fibroblasts using the Gentra Puregene Cell Kit (Qiagen, Valencia, CA). The expanded (CTG>(CAG) repeats were sized by small-pool PCR (sp-PCR) followed by Southern blot as described previously (Sathasivam et al., 1997, Hum. Genet, 99:692-5). For HT1080 models, small-pool PCR (spPCR) was performed with the input of 1.4-1.7 genome equivalents. The repeat size difference in the models is, at most, 3000 base-pairs, therefore, being strictly conservative, a bias toward amplifying the shorter allele can be possible, even under optimized PCR conditions described hereia For HD primary fibroblasts, sp-PCR &

Southern blots were performed as described previously with slight modifications (Tome et al., 2013, PLoS Genet, 9: el 003280). Briefly, the PCR primers listed in Table 1 were used, and blots were hybridized with a DIG-labeled (CAG)7 locked nucleic acid probe (Nakamori et al. 2009, Neuromuscul. Disord., 19:759-62). At least 50 alleles were analyzed for each of the three experiments (more than 150 alleles for a total study). Repeat analyses are summarized in Table 2, compiled from complete small-pool-PCR datasets for each repetition of each experiment For the histograms in Figures 4D, 4E, 10, 11, SB, SD, SE, 14 and 1SB, the Y-axis, "% of repeat population" was calculated by determining the proportion of >50 individual small-pool PCR reactions across the CAG repeat tract, that harbored a certain size of repeat product with a certain length. Cell clones were grown 30 or 40 days with or without NA (see, for example, Sathasivam et al., 1997, Hum. Genet, 99:692-5).

Specifically, the histograms of repeat length distributions were from at least two different cell clones. Length distributions were determined as follows: Small-pool PCR followed by Southern blot for analysis of CAG/CTG repeat length. DNA from fibroblast clones was diluted to produce, on average, one amplifiable expanded repeat allele per PCR tube. The repeat tract was amplified by 24 cycles of PCR Amplified products were electrophoretically resolved, detected by Southern blot probing with a CAG repeat oligonucleotide, and repeat lengths determined relative to a DNA size marker. The scale on the left shows molecular weight markers (M) converted into repeat number for C AG-repeat fragments of equivalent size. To facilitate comparisons, a dashed line indicates the unchanged CAG size. The distribution of unstable alleles is shown by gray bars (left vertical axis). The frequency of stable alleles is shown by black bars (right vertical axis). Allele lengths are grouped in bins spanning 50 repeats along the X-axis. For the statistical analysis, x ²-tests were performed to compare the frequencies of expanded, unchanged, and contracted alleles in each set of experiments as reported previously (Nakamori et al., 2011, Mol. Ther., 19:2222-7). The trinucleotide and dinucleotide tract lengths of the HTT, CASK, ATXN8, Μβ15, TOP, and murine Mapkapl, Fgd4, and tdp loci were PCR amplified through use of primers (listed in Table 1) and amplification conditions described elsewhere (see, for example, Kroner et al., 1995, Am. J. Hum. Genet, 57:343-50; Brook et al., 1992, Cell, 69:385; Koob et al., 1999, 21:379-84; Dietmaier et al., 1997, Cancer Res., 57:4749-56; Kabbarah et al., 2003, Mol. Carcinogen., 38: 155-9). Repeat length variability was studied in non-expanded CAG alleles and TBP alleles by PCR (electrophoresed in high resolution gel) and Agilent BioAnalyzer, respectively. PCR products were separated on 6% polyacrylamide gels supplemented with EnhancelT polymer (Elchrom Scientific, Switzerland), stained by GelRed (Biotium Inc., Hayward, CA), and scanned on a Typhoon fluorescence imager (GE Life Sciences, Piscataway, NJ). Repeat analyses are summarized in Table 2, compiled from complete small-pool-PCR datasets for each repetition of each experiment

Example 6— mRNA Quantification

RNA was harvested from the HT1080-(CAG)850 ceils at 72 hr after adding or not adding NA using the RNeasy Plus Micro Kit (Qiagen). Total RN A was primed with random hexamers and reverse transcribed with Superscript III (Life Technologies, Carlsbad, CA), followed by treatment with RNase H. Quantitative reverse transcription (RT)-PCR was performed using TaqMan Gene Expression assays or PrimeTime qPCR assays on an ABI PRISM 7900HT Sequence Detection System (Life Technologies) and the primer sequences shown in Table 1. The level of transgene- derived mRNA was normalized to 18S rRNA. The results were statistically analyzed using paired t-tests. Primer sequences for transgene mRNA in the CAG-repeat direction were 5'-AGA GAA TAG GAA CTT CGG AAT AGG - 3' (SEQ ID NO: 21) and 5'-CCA TGT TCA TGC CTT CTT CTT T-3' (SEQ ID NO: 22). The probe sequence was 5'-ACA GCA CAA TAA CCA GCA CGT TGC-3' (SEQ ID NO: 23).

Example 7— NA-DNA Binding

Structures for band-shift assays were made as previously described (Pearson et al, 1997, Hum. Mol. Genet, 6: 1117-23; Pearson et. nL 2002, Nuc, Acids Res., 30:4534-47; Axford et al, 2013, PLos Genet., 9:el 003866), with minor changes. Plasmids containing human DM1 genomic (CTG)n ^»(CAG)n repeats (n = 30 or 50) were linearized by H; ^'«dIII digestion. Homoduplex slipped structure (S-DNA) of 50 repeats were formed by alkaline denaturation/renaturation, as described in Pearson et al. (1996, Biochem., 35:5041-53; 1998, Nuc, Acids Res., 26(3): 816-23). Heteroduplex SI-DNA with long (CAG)20 or (CTG)20 slip-outs were prepared as previously described (Pearsons et al., 2002, Nuc. Acids Res., 30:4534-47; Axford et al, 2013, PLos Genet, 9:el 003866), Briefly, DNAs of (CAG)50 and (CTG)30 repeats, or DNAs of (CAG)30 and (CTG)50 repeats, were mixed in equimolar amount and then heteroduplexed by alkaline denaturation/renaturation. Repeat containing fragments were released by EcoRl digestion, electrophoretically resolved on a 4%

polyacryiamide gel and gel purified. Purified fragments were radiolabeled with [a- ³²P]dNTPs on both strands by fill-in reaction. The radioactivity of each structure was determined using Cerenkov counting, and an equivalent radioactive concentration of each structure was incubated with increasing concentration of NA for 30 minutes at room temperature with IX hypotonic buffer. Binding products were resolved by electrophoresis on a 4% (w/v) polyacrylamide gel in 1 x TBE buffer at a constant 200 V for 2.5 hours. The affinity of each NA molecule/CAG-CAG was estimated as 1.8 x 10 ⁶ M ^-1 as Ka or 0.56 >< 10- ⁶ M as Kd (Nakatam et al., 2005, Nat. Chem. Biol., 1.39-43 ;·.

Example 8— -Replication Assay

For in vitro replication, templates were designed as previously described (Panigrahi et al., 2002, J. Biol. Chem., 277: 13926-34; Cleary et al., 2002, Nat. Genet, 31(l):37-46).

Briefly, genomic clones containing EcoRl/Hindlll (CTG)79●(CAG)79 fragments were subcloned into pBluescript KSII. The SV40-ori was cloned as a blunted Xbal fragment into either the Hindlll or EcoRl sites of pBluescript KSII placing the SV40 origin of bidirectional replication 103 and 98 bp 5' and 3' of the C AG repeat, respectively. These templates containing repeats (pDM79EF and pDM79HF) and another substrate with SV40-ori and no- repeats (pKN16) were replicated in vitro by HeLa cell extract adding [a- ^j2P]dCTP and T- Antigen, as described in detail in Panigrahi et al. (2002, J. Biol. Chem., 277: 13926-34) and Cleary et al. (2002, Nat. Genet, 31(l):37-46). Replication reaction was performed without or with NA (7.5 μΜ and 15 μΜ). Radioactive replication products were purified, linearized with BamHI, and treated with Dpnl (for details see Figure 9). An equal amount of unrepUcated pKNl 6 plasmid DNA was taken and treated with Dpnl to show the complete digestion of unrepUcated plasmid DNA. Equal quantities of reaction products were resolved by electrophoresis on a 15-cm 1% agarose gel. The gel was run for 16 h at 4 V/cm in TBE buffer, dried, and exposed to Kodak film.

Example 9— Repair Assay

To determine how the binding of NA to slipped DNA structures affects repair, a series of circular slipped heteroduplex substrates was made with an excess of repeats with a nick located either 5' or 3' of the slip-out (Figure 3). As described in detail (Panigrahi et al., 2005, Nat. Struct. & Mol. Biol., 12:654-62; Panigrahi et al., 2010, PNAS USA, 107: 12593- 8), substrates were prepared by hybridizing single-stranded circular plasmids with their complementary linearized strand of differing repeat lengths, resulting in nicks located at the sites of linearization. Substrates were identical except for a single slip-out that varied only in the number of excess repeats, or in the location of the nick. In all substrates, the slip-out extruded at a unique point from a fully base-paired backbone of (CTG)n*(CAG)n, where n could equal 30, 47, 48, or SO repeats. G-T mismatched substrate was prepared as described previously (Panigrahi et aL, 2005, Nat Struct & Mol. BioL, 12:654-62; Panigrahi et al., 2010, PNAS USA, 107:12593-8). All these substrates were processed in vitro by HeLa extracts and then were assessed for repair. Repair products were analyzed by Southern blotting, probed for the repeat-containing fragment, and compared with starting material, which permitted quantitative assessment of repair efficiency at a molar level. Each repair assay was performed in three to five independent experiments.

Example 10— R-Loop Formation and Processing

Plasmids bearing an expanded (CAG)79*(CTG)79 repeat tract with convergent T3 and T7 RNA polymerase promoters have been previously described in detail (Panigrahi et al., 2002, J. Biol. Chem., 277:13926-34). Transcription reactions were performed as previously described (Reddy et al., 2014, Nuc. Acids Res., 42:10473-87). Briefly, 500 ng of template DNA in lx transcription buffer (Roche) and lx bovine serum albumin (NEB) were mixed for 1 h with 20 U of the appropriate RNA polymerase: Ί7, T3 or T7+T3 (Roche), with or without NA 120 μΜ. Samples were purified and then treated with either 1 g of RNase A (Roche) alone or with 1 g of RNase A (Roche) and 1 U of K coli RNase H (Roche), at room temperature for 30 min, in the presence or in the absence of NA (120 μΜ). All in vitro transcription reaction products were analyzed on 1% agarose gels run in lx ΊΒΕ buffer at 80 V for 3 h. Gels were subsequently stained with ethidium bromide (0.5 mg/ml) to allow visualization of total nucleic acid under ultraviolet (UV) light

R-loop templates prepared from in vitro transcription and RNase A treatments were incubated with NA and then processed by extracts of HeLa or SH-SY5Y neuroblastoma cells, where the latter were terminally differentiated by retinoic acid, as previously described (Panigrahi et al., 2005, Nat Struct & Mol. BioL, 12:654-62). These cell extracts are functional, processing slipped-strand DNAs formed by CAG/CTG repeats, and are capable of inducing replication-mediated CAG/CTG expansions and contractions. Nucleic acid material was subsequently extracted. Samples were further purified using QIAquick enzyme clean-up kits as per manufacturer's instructions prior to transforming into bacteria for Stability of Trinucleotide Repeats by analysis of Individual Products (STRIP) analysis. The STRIP assay has been previously described in detail (Panigrahi et aL, 2002, J. Biol. Chem.,

277:13926-34; Cleary et al., 2002, Nat Genetics, 31:37-46). Briefly, products of human cell extract processing were transformed into E.coli XLl-MutS (Agilent). Individual bacterial colonies, each representing one processed template, were picked and cultured for a limited growth period (maximum of 6 h, 4-6 generations). Miniprep DNA was analyzed for changes in repeat length by analysis of the repeat-containing fragment on 4% polyacrylamide gels. The magnitudes of repeat length changes were determined by electrophoretic sizing of the repeat-containing fragments on 4% polyacrylamide gels relative to the starting length material and a known set of size markers.

The non-transcribed DNA template (no transcription from T3 or T7 promoter) that was treated with human cell extract and subj ected to STRIP served as the cell extract processing control to assess basal levels of length heterogeneity present in the starting material (instability during preparation in bacteria), as well as any instability that may be incurred by exposure of the fully-paired DNA repeat to the HeLa extract As has been published, some level of length heterogeneity is expected for the DNA template due to its unstable starting length of 79 repeat units, resulting from processing of endogenous DNA damage (potentially including single strand breaks, oxidative damage, nucleotide

mismatches, etc.) by human cell extract, repeat length heterogeneity present in the starting plasmid, as well as from bacterial cuhuring during the STRIP procedure. The tract length heterogeneity in this starting template serves as the background level of tract length instability above which any potential R-loop processing instability must rise. Only values that were significantly greater than background are reported. Instability analysis of products from R-loop processing by human cell extract (A) Percentage of total unstable products following processing. Products were characterized as either stable (having 79 repeats) or unstable (having fewer than or greater than 79 repeats), based on electrophoretic migration and plotted. Data are derived from three independent in vitro transcription and human cell extract processing reactions with ~1 SO colonies (~50 colonies per replicate) representing 1 SO individual products of cell extract treatment analyzed for each R-loop configuration. Individual experiments were compared with each other within a triplicate using the χ2 test to ensure there were no significant differences between experiments and then data were pooled for each experimental condrtioa As described (Reddy et aL, 2014, Nuc. Acids Res., 42: 10473-87), products of R-loop processing were compared to the DNA control processing products using the x ²- test. Percentage of contractions and expansions from processing. Unstable products were further separated into contractions (fewer than 79 repeats) and expansions (greater than 79 repeats) and plotted. The distribution of contractions and expansions were compared between R-loop products and DNA control products using the χ2 test Distribution of unstable products of R-loop processing. Sizes were estimated for each unstable product of processing from electrophoretic migration position relative to known size markers as previously described (Panigrahi et al., 2002, J. Biol. Chem., 277: 13926-34) and plotted.

Example 1 1— MutS beta Binding Assay

MutS beta was purified from baculovirus-infected Sf9 cells expressing his-tagged hMSH2 and hMSH3 as previously described (Panigrahi et al., 2010, PNAS USA, 107: 12593- 8). Binding reactions were performed at room temperature. Heteroduplex SI-DNA with long (CAG)20 was prepared and end-labelled as described above. Protein was incubated with DNA for 30 minutes in a buffer containing lOmM HEPES-KOH pH 7.5, llOmMKCl, lmM EDTA, and lmM DTT with or without ATP in the buffer as indicated. Reactions were loaded onto a 4% native polyacrylamide gel with non-denaturing loading dye (20mM Tris- HC1 pH 7.4, 4% glycerol, bromophenol blue). Gel was run in IX TBE buffer for 2 h.

Example 12— FAN1 Nuclease Assay

FAN1 nuclease was purified from baculovirus-infected Sf9 cells using a double- affinity purification strategy as described previously for PALB2 (Buisson et aL, 2010, Nat Struct Mol. Biol., 17(10): 1247-54) . Briefly, FAN1 was tagged at the N-terminus by a GST- tag and FLAG/His 10-tagged at the C-terminus. Following GST-pull-down, the GST tag was removed by incubation with PreScission Protease and affinity purified on Talon beads (GE Healthcare).

1 pmol of 32P labelled, gel-purified, DNA structures containing a 5'- flap and either no repeats, or a (CAG)20 slip-outs in the flap or in the duplex region (see sequences in Figure 22D-22F) were treated or non-treated with increasing concentration of NA for lOmin at RT, and incubated with 200 nM FAN1 for 30 min at 37°C. The reaction was performed in IX nuclease buffer, as previously described (Liu et aL, 2010, Science, 329:693-6), in 10 μΐ reaction volume. Nuclease reactions were stopped by adding 20 mM EDTA and purified by extraction with phenol/chloroformAsoamyl alcohol (25:24: 1 , v/v/v) following by ethanol precipitation. Reaction products were run on a 8% polyacrylamide gel at 200 V for 1 h. To identify FANl digestion sites, Maxam-Gilbert sequencing (OA and T reactions) of the ³²P labelled oligos was performed. Digestion fragments were separated on a 8% sequencing gel at 2500V and 82W for 75 min.

Fxample 13— Polymerase Extension Assay

Human DNA polymerase beta (Pol beta) was isolated from E.coli clones and purified by ion exchange and affinity chromatography (Chimera, catalog#1077, Lot#2203007).

Protein concentrations were determined using a Bradford assay.

Recombinant human polymerase delta (Pol delta) was prepared in insect cells using a recombinant baculovirus vector and purified by immunoaffinity column chromatography, as described (Zhou et al., 2012, PLoS ONE, 7(6):e39156). Protein concentrations determined using a Bradford assay.

Purification of recombinant RPA complexes were expressed in BL21(DE3) cells and purified as described in Binz et al (2006, Methods EnzymoL, 409:11-38).

Pol delta extension assay was performed as previously described (Mason et al., 2010, Biochem., 49:5919-28), using an oligo containing (CAG)10 repeats. Oligo's sequence and primer condition were previously described (Hagihara et aL, 2011, Chem. Bio. Chem., 12: 1686-9). Briefly, 0.1 uM primer and 0.1 uM oligo were denatured at 95°C for 3 min, annealed for 30 min at RT, and incubated with NA for 30 mia at RT. RPA and/or Pol delta were added and the reactions was started by adding 0.1 mMdNTPs in 10 μΐ reaction volume, and incubated at 37°C for 15 mia The reaction was stopped by adding 20 mM EDTA and purified by extraction with phenol/chloroform/isoamyl alcohol (25:24: 1, v/v/v) following by ethanol precipitation. Pellets were resuspended in formamide buffer, denatured at 95°C for 10 min and run on a 6% sequencing gel at 2000 V and 90 W for 40 mia Example 14— Stereotaxic Injections into R6/2 Mice

Mouse handling and experimental procedures were conducted in accordance with the Osaka University guidelines for the welfare of animals. A single drug application involved six separate stereotactic injections, three injections of drug or saline into three different striatal regions of either the left or right striatum, respectively (outlined in Figure 24). Under sterile conditions, 6-week-old R6/2 mice were anesthetized with SO mg/kg pentobarbital sodium, and stereotaxically injected with 5 ul of saline (right side) or 500 uM NA dissolved in saline (left side) into three different sites of the striatum The mice received injections once or twice bi-weekly, or weekly for 4 consecutive weeks. Stereotaxic injections were delivered to three sites within the striatum with the following coordinates: (anterior-posterior (AP) = 0.0 mm, medial-lateral (ML) = l.S mm from bregma, dorsal-ventral (DV) = 2.5 mm below the dural surface; AP = 1.0 mm, ML = 1.5 mm, DV = 2.5 mm; and AP = 0.5 mm, ML = 1.5 mm, DV = 2.5mm), using a 10-ul Hamilton microsyringe at a rate of 0.5 uL/min.

Refer to Figure 24 outlining dosing regimen. Each mouse had an internal control, as both sides of the brain were stereotaxically injected with either saline, in the right striatum, and NA, in the left striatum The control of our NA treatment is therefore the contralateral side of the striatum of each mouse. Both right and left striatum were assessed for repeat length at the HD CAG transgene and at endogenous, long C AG tracts. For the mice treated four times, DNAs from the tail prior to and following NA administration, as well as left and right frontal cortex and left and right cerebellum were harvested.

Example 15— Genescan Analysis

At 4 weeks after the first injection, DNA was isolated from mouse brain tissue as previously described (Nakamori et al. 2009, Neuromuscul. Disord., 19:759-62). PCR was performed as described previously (Tome et al., 2013, PLoS Genet, 9: el 003280), and PCR products were sized on an ABB 10 Gene Analyser using GENESCAN 3.1 software (Life Technologies).

Example 16— Instability Index Calculation

The procedure for instability index calculation was done as previously described (Lee et al., 2010, BMC Syst BioL, 4:29) and is illustrated and outlined in Figure 27A Example 17— ΗΡΚΓ1 Sequence Analysis bv SMRT-CCS Sequencing

DNAs were extracted from three independent NA treatments of the HD primary fibroblast cell line GM09197 and from three independent untreated control cells. HPRT1 exons 2 and 3 were amplified using the high fidelity Platinum Taq DNA Polymerase

(Invitrogen, Cat #11304) and the primers shown in Table 1. Amplification products were analyzed using single molecule, real-time (SMRT) sequencing on the PacBio RSI1 instrument (see Figure 13 A).

Specifically, a -2.8 kb PCR product spanning exons two and three of HPRT1 was sequenced for three replicates of male HD patient-derived cells in both NA- and saline- treated colonies on a Pacific Biosciences (PacBio) RSII with P6-C4v2 chemistry, 360 minute movies, and one SMRT cell per replicate/treatment combination. Individual molecules from the original cell colony were represented by single long reads from the corresponding SMRT cell. PacBio 's SMRTbell construct allows a single molecule to be sequenced one or more times in a single long read based on the number of "passes" the sequencing polymerase makes around the SMRTbell insert An average of 99,023 distinct reads were sequenced per SMRT cell The resulting long reads contained an average of ~7 passes through the insert sequence. The average median insert length per SMRT cell was 2,873 bp. Using PacBio' s "reads of insert" pipeline (SMRT Analysis 2.3.0), high-quality consensus sequences were created for each long read that contained more than 18 copies of the original molecule and eliminated consensus sequences with either an incomplete PCR molecule (<2.S kbp) or with incorrectly detected PacBio adapters that resulted in overly long consensus sequences (>3 kbp) (Figure 13C). Based on these filters, an average of 1,343 consensus sequences per replicate/treatment pair were produced with a range of up to 2,402 sequences. Consensus sequences were aligned to the corresponding region of the human reference (GRCh38, chrX: 134,473,062- 134,475,958) with BLASR (SMRT Analysis 2.3.0; parameters: - affineAlign -affineOpen 8 -affineExtend 0 -bestn 1 -maxMatch 30 -sdpTupleSize 13 -m 5) and all single base pair differences were identified between sequences and the reference (mismatches, insertions, and deletions) for sequences with full-length alignments (2.5-3 kbp). The average alignment identity of the consensus sequences was 99.698% +/- 0.3321% SD with a range of 94.7679% to 100%. To estimate the mutation rate for each replicate/treatment combination, the number of times a specific variant occurred was calculated between high-quality PacBio consensus reads at a specific position in the human reference (GRCh38) divided by the total number of high-quality reads aligned across each position. For each replicate, the relative mutation rate was calculated between NA- and saline-treated cells as the mutation rate for NA-treated cells minus the rate for saline-treated cells and identified excess mutation rates based on an absolute relative rate X).5%. Of the 15,277 variants identified across all three replicates, there were extreme mutation rates in 149 (1.0%) NA-treated sequences and 101 (0.7%) saline-treated sequences. Across the 2,897 positions assessed in the human reference, extreme relative mutation rates were identified at 113 distinct sites in treated sequences (3.9%) and 84 sites in control sequences (2.9%).

Mutation rates were not significantly different between NA- and saline- sequences for variants with extreme relative mutation rates (p = 0.1083, two-sample Kolmogorov- Smirnov test) (Figure 13B).

Example 18— Results

A CAG- specific DNA binding compound, Naphthyridine-.4zaquinolone (NA) (Nakatani et al., 2005, Nat Chem. Biol., 1:39-43; Hagihara et aL, 2006, Nuc. Acids Symp. Ser. (Oxf), 50:147-8; Hagihara et al., 2011, Chembiochera, 12:1686-9) previously was designed. Detailed characterization revealed NA bound a distorted intra-strand CAG hairpin, where the naphthyridine and azaquinolone moieties in NA exhibit complementary hydrogen bonding to guanine and adenine, respectively, causing two cytosine bases to flip-out from the CAG hairpin (Figure 2A and 2B). NA bound with high affinity and increased the melting temperature of (CAG) 10 hairpins by >30°C (Hagihara etal., 2011, Chembiochenx, 12:1686- 9). NA also bound with a preference for longer CAG hairpins, where the greatest binding was to (CAG)30, followed by binding for (CAG)20, and (CAG)l 0 (Nakatani et al., 2005, Nat. Chem. Biol., 1:39-43). This remarkable selectivity of NA for longer CAG hairpin structures coupled with the presence of slipped-DNAs at a mutant repeat locus (Axford et al., 2013, FLos Genet, 9: el 003866) may permit targeting of the expanded allele.

To determine if NA can bind to disease-relevant (CAG)«(CTG) duplexes with and without slip-outs, band-shift analysis was performed using slipped-DNAs, which are models of mutagenic intermediates of repeat instability (Pearson et aL, 2002, Nuc. Acids Res., 30:4534-47). Detailed biophysical characterization of these in vitro structures show that they reflect the slipped-DNAs recently identified at the mutant DM1 locus in DM1 patient tissues (Axford et al., 2013, PLos Genet, 9: el 003866). NA did not bind to fully-duplexed DNA fragments containing (CAG)50●(CTG)50 repeats, and high concentrations of NA did not induce structural alterations of the fully-duplexed repeat (Figure 2C-2D). The same

(CAG)50»(CTG)50 molecules could be induced to harbor clustered short slip-outs, called S- DNAs, which comprise a heterogeneous mixture of molecules, each containing 2-62 short slip-outs of mostly 1 to 5 repeat units each, extruding at various locations from both strands along the repeat tract (Pearson et al., 1998, Nuc. Acids Res., 26:816-23; Panigrahi et al., 2010, PNAS USA, 107:12593-9). S-DNAs can bind NA to a limited degree, but their short slip-outs and electrophoretic heterogeneity may not permit detection of a single-shifted species. Slipped-heteroduplexes with an excess of twenty CAG repeats in the slip-out of (CAG)50»(CTG)30 were extensively bound by NA (Figure 2C-2D, see also Figure 7). NA- binding caused a band-broadening, previously observed for other DNA-binding ligands (Neilsen et al., 1988, Biochem., 27:67-73; Carlsson et al., 1995, Nuc. Acids Res., 23:2413- 20; Barcelo etal., 1991, Biochem., 30:4863-73; Fox etal., 1988, Nuc. Acids Res., 16:2489- 507; Fox & Woolley, 1990, Biochem. Pharmacol., 39:941-8). The progressively slower migration of the slipped-DNA in the presence of NA is consistent with NA binding to additional (CAG)●(CAG) pairs in the hairpin (Figure 7; Nakatani et al., 2005, Nat Chem. Biol, 1 : 39-43). In contrast, heteroduplexes with long slip-outs of CTG repeats,

(CAG)30*(CTG)50, were not bound by NA (Figure 2C-2D, the multiple panels of Figure 2C were all derived from a single gel, and were separated for clarity). Thus, NA bound CAG but not CTG slip-outs. Binding was quantitatively specific for the CAG slipped-DNA structures (Figure 2D). Denaturation of purified slipped (CAG)50*(CTG)30 to isolated (CAG)50 and (CTG)30 single-strands (without renaturation) in the presence of NA also showed NA binding specifically to the single-stranded (CAG)50 strand (Figure 2E, left panel).

Renaturation of individual (CAG)50 and (CTG)30 single-strands in the presence of NA revealed that NA did not block the ability of complementary strand hybridization, but did bind specifically to the s lipped-out CAG strand and not the CTG strand (Figure 2E, right panel). The degree of electrophoretic shift of the isolated single-stranded (CAG)50 tract is greater than for the heteroduplex (CAG)50*(CTG)30, which has a slip-out of only 20 excess CAG repeats, whereas, once rendered single-stranded, the full SO C AG repeats is forced into a long hairpin of all SO repeats (Figure 2, compare panels E) - which can bind more NA molecules. The CTG hairpin strand is not bound by NA (Figure 2C-2E). Binding was significantly better for longer than shorter CAG slip-outs (Figure 8). Based upon these results and the established binding mode of NA, a ratio of 2 NA molecules to 1 CAG-CAG was previously estimated, wherein a slip-out of (CAG)20 would bind 20 NA molecules. Together, these findings support the structure-specificity of NA for long CAG repeat slip- outs of disease-relevant tract lengths, but do not support the contention that NA will induce slip-out extrusion from an expanded fully duplexed molecule.

NA blocks processing of slipped-DNAs, suggesting it may modify repeat instability. NA specifically inhibits repair of slipped-DNAs with long (CAG)20 slip-outs, but not (CTG)20 slip-outs (Figure 3A-3C), consistent with its designed specificity for CAG hairpins (Figure 2). That NA blocked the repair of a (CAG)20 slip-out, is consistent with the inability of human and other DNA polymerases to extend primers along NA-bound (CAG) 10 - (CAG)20 templates. The repair of long (CAG)20 slip-outs is independent of mismatch repair (MMR), indicating that the effect of NA does not require an involvement of MMR. In contrast, single repeat slip-outs, too small to bind NA, require the mismatch repair MutS beta (MSH2-MSH3) complex. The effects of NA upon repair of a single extra CAG repeat slip- out, too small to be bound by NA, were assessed. Repair of a single repeat CAG or CTG slip-out was unaffected by NA, indicating that MutS beta function in this process was unaffected by NA (Figure 3D).

To explore the specificity of NA for CAG/CTG slipped-heteroduplexes, the effect of NA on the repair of the most frequent base-base mismatch, a G-T mispair, was assessed. Repair of this mispair depends upon the mismatch repair MutS beta complex (MSH2- MSH6). NA had no effect upon G-T repair, further indicating that NA does not block MMR and is unlikely to cause genome-wide mutations known to occur in the genetic absence of MMR (Figure 3E). Thus, the preference of NA to block repair of long CAG slip-outs may forcast the ability to modulate repeat instability.

To address questions of potential therapeutic use of NA, it was determined that NA is cell-permeable and can enter the nuclei without causing acute cytotoxicity or slowed proliferation to human cells (Figure 2E-2G) or DNA replication (Figure 8). Treatment of HD patient cells with NA induced contractions of expanded repeats. NA is cell-permeable and can enter the nuclei without causing acute cytotoxicity or slowing proliferation of human cells (Figure 4A-4C) or altering DNA replication (Figure 9). Tract lengths >200 repeats are frequent amongst brain cells that have incurred somatic expansions in HD individuals that inherited the common mutations of (CAG)40-50. NA induced a significant shift in repeat population towards contraction of a (CAG)18O(CTG)180 tract at the Huntington's disease locus in primary fibroblasts derived from an HD patient NA enhanced the number of contractions of the expanded HD repeat (Figure 4D; P=7.25E-06, Figure 10, repeat analyses are summarized in Table 2, compiled from complete small-pool- PCR datasets for each repetition of each experiments), incurring significant losses of repeats (Figure 4F; P=0.0003). NA also reduced the number of expansions of the expanded HD repeat (Figure 4D; P=4.339E-05, Figure 10, Table 2). An HD patient fibroblast also was treated with an expanded allele of 43 CAG repeats, a mutation common to the majority of patients. A significant number of NA-treated cells incurred contractions of the expanded repeat, with contractions as low as 20 repeats, below the HD disease threshold of 35 units (Figure 4E & 4G; P=3.28E-03, Figure 11, Table 2). NA also reduced the number of expansions (Figure 4E; P=8.32E-05, Figure 4E; Figure 11, Table 2). Thus, NA can induce contractions of expanded tract lengths common to inherited and somatically expanded alleles. In contrast, NA does not affect either the non-expanded HD repeat or other microsatellite repeats known to be prone to instability under stressed conditions (Figure 4H & Figure 12), suggesting that NA is specific for the expanded CAG/CTG repeat and will not deleteriously affect other repeats. As an additional control for the specificity of NA to slipped-DNA structures, the effect of NA was assessed at the very long, but not genetically unstable, CAG tract of the TBP gene in the NA-treated HD patient cells. The normal range of repeats in TBP is 25-41, while stretches of >49 repeats cause fully-penetrant disease (SCA17) and stretches of 42-48 are lead to reduced penetrance. In the HD patient cells, the non-mutant TBP gene has lengths of 38 and 34 repeats, above and just below the HD disease CAG length threshold of 34/35 repeats. Importantly, NA would only be expected to have an effect if the TBP repeat were actively undergoing events of instability that would lead to the formation of slipped-DNA structures - to which NA binds. Using established spPCR methods, NA did not change the length distribution of the TBP repeats (Figure 12C, Table 2). Thus, the fact that NA has no effect upon the large, but not mutant repeat lengths at the TBP gene (Figure 12C, Table 2), but did affect mutant HD tracts of (CAG)43-180 (Figure 4D-4G), supports the specificity of NA for repeats actively undergoing instability, presumably involving slipped- DNAs. The inability of NA to affect a non-expanded repeat tract is likely due to the poor ability of short tracts to form slipped-DNAs, which is consistent with the absence of these structures at non-expanded loci. Moreover, sequence analysis of up to 2,402 individual alleles of the HPRT1 gene (exons 2 to 3), often used as a surrogate of mutation induction, did not show sequence changes for cells treated with NA (Figure 13 - see panel B) arguing against NA being a general mutagen. This finding is consistent with the inability of NA to affect base-base mismatch repair (Figure 3E). Together, these results suggest that NA is specific for expanded repeat lengths that are commonly inherited and lengths that are frequent amongst brain cells that incurred somatic expansions, the effect of NA is specific for tracts that are actively unstable and will not deleteriously affect other repeats or sequences.

NA induced a significant shift in repeat population towards contraction of a

(CAG)850*(CTG)850 tract in human cells, expressing r(CAG)850 (Nakamori et al., 2011, Human Mol. Genet, 20:580-8) (Figure SA and SB, and Figure 14; P=4.78E-05, repeat analyses are summarized in Table 2). NA significantly shortened the average size of the repeat tract, where the magnitude of repeat units lost was as great as 790 repeats (Figure 4C; P=6.44E-03). The effect of NA was independent of cell proliferation and DNA replication (Figure SD, Figure IS), consistent with NA's lack of an effect upon cell proliferation (Figure 4C) or replication progression (Figure 9). The effect of NA depended upon transcription across the repeat (Figure SE, Figure 16A-16C), yet NA did not alter transcription across the expanded repeat (Figure SF). NA does not affect non-expanded CAG/CTG tracts or other microsatellite repeats (Figure SG and Figure 12). Together, these results suggest that NA is effective independent of proliferation and dependent upon transcription across the repeat

Transcriptional ly-induced R-loops can lead to CAG/CTG instability. The

transcription dependency of NA upon repeat instability that was observed above (Figure SE) might suggest NA may affect either transcription, R-loop formation, biophysical stability of R-loops, and/or R-loop processing to instability. These alternatives were tested and found that NA did not alter transcript levels of the expanded repeat in cells (Figure SF, Figure 16A- 16C), nor did it affect transcription in vitro (Figure 17B). NA (120 uM) did not affect R- loop formation (Figure 17B), nor RNaseA or RNaseH processing of pre-formed R-loops (Figure 17C). At NA concentrations similar to that used on cells (SO uM) or higher (500 uM), R-loops were still detectable suggesting that NA did not affect the biophysical stability of R-loops (Figure 17D). However, NA altered the processing of CAG/CTG R-loops by extracts of neuron-like human SH-SYS Y cells, significantly increasing the number of contraction products but not expansion products (Figure 18C, p=0.002). Results from these model systems suggest NA causes preferential R-loop processing to repeat contractions.

A plausible mechanism for NA to enhance contractions of expanded repeats is through the aberrant repair of slipped-DNAs that arise from transcription-induced R-loops - a path that is supported by the data presented herein as well as other reports (Nakamori et al. , 2011, Human Mol. Genet, 20:580-8; Lin et al., 2010, FNAS USA, 107:692-7; Reddy et al., 2014, Nuc. Acids Res., 42: 10473-87). As outlined in Figure 19, during transcription of the CTG strand to produce a C AG transcript, as occurs at the HIT gene, there is a predisposition to form slipped-DNAs on the displaced CAG DNA strand, NA may bind the intra-strand hairpins formed on the displaced dCAG strand and possibly further extend these to longer hairpins (Figure 19A). These NA-bound slipped-DNAs would escape repair and lead to contractions. When retaining or removing slip-outs, by endonucleolytic incisions (Hou et al., 2009, Nature Struct Mol. Biol., 16:869-75; Pluciennik etal., 2013, FNAS USA, 110, 12277- 82) (see arrows in Figure 19B), DNA repair synthesis on NA-bound (CAG)n templates will be blocked (Hagihara et al., 2006, Nuc. Acids Symp. Ser. (Oxf), 50: 147-8; Hagihara et al., 2011, Chembiochem., 12:1686-9), thereby favouring contractions over expansions. This model is consistent with NA inducing repeat contractions in the absence of DNA replication, in a transcription-dependent manner, where NA exerts its effect via perturbing repair of repeat structures (Figures 3-5).

There are many paths by which repeat instability can arise, and various ways through which NA may act (DNA replication, transcription, epigenetic changes, DNA-damage, etc), all of which involve slipped-DNAs. That the effect of NA upon CAG instability was independent of proliferation/replication (Figure 5D) and dependent upon transcription through the repeat (Figure 5E), coupled with the dependence of CAG instability upon transcription, the formation of slipped-DNAs through R-loop processing, the likely involvement of slipped-DNAs in instability, the specificity of NA for CAG hairpins (Figure 2), and NA's ability to block repair of CAG contraction intermediates (Figure 3B) all support this proposed model.

Might NA act by blocking the interaction of DNA repair proteins with slipped- DNAs? While the proteins required for large slip-out repair are not known, towards testing this hypothesis we assessed the effect of NA upon four candidate proteins MutS beta, FANl, and RPA-pol delta. Many mouse models demonstrate that the mismatch repair MutS beta complex with a functioning ATPase, drives CAG/CTG expansions. While NA blocks the repair of a large CAG slip-out of 20 excess repeats, a process that is independent of MutS beta, we, and others have suggested that MutS beta may be involved in the formation of slipped-DNAs which subsequently lead to expansions. This process, expected to involve MutS beta binding to DNA, may be affected by NA Here, it was found that MutS beta can bind long CAG slip-outs repeats, a complex that can be dissociated by ATP (Figure 20), similar to previous reports. NA did not block binding of MutS beta to CAG slip-outs or block ATP-mediated dissociation of this complex (Figure 20). Since NA does not block MutS beta from binding slip-outs, and since NA does not disallow formation of slip-outs from denatured repeat-containing DNAs (Figure 2E), does not support a role of NA in blocking the formation of slipped-DNAs either alone or enhanced by MutS beta. MutS beta, if involved in the action of NA upon repeat instability, likely acts upstream, possibly forming slipped-DNAs during transcription, as suggested.

FAN1/KIAA1018/MTMR15, initially identified in brain, is a DNA repair nuclease with a preference for DNA structures. FANl was recently identified as the top in several searches for modifiers of age-of-onset of HD and five other CAG diseases (SCA1, SCA2, SCA3, SCA7, & SCA17). While the manner by which FANl modulates age-of-onset is unknown, it could be speculated to do so by modulating somatic repeat instability. The action of FANl upon repeat DNAs was tested, for which there is no precedent As previously demonstrated, FANl can cleave super coiled and linear DNAs (Figure 21, left panel), and can cleave transcription-induced R-loop containing DNAs - thereby releasing the supercoil-dependent RNA portion (Figure 21, right panel). Slipped-DNAs may arise from misaligned DNA annealing following loss of the RNA from R-loops. FANl cleavage was mapped on three different slipped-DNA junctions: a control junction devoid of repeats; one with a CAG hairpin in the flap; and one with a CAG hairpin in the duplex region beyond the junction (Figure 22, top panels; Table 1). As with other studies, FAN1 cuts in the 5' single- strand flap and in the duplex beyond the flap. Cleavage at both sites is approximately equal for the control substrate (Figure 22A & 22D). However, for the CAG-substrates, cleavage is reduced at the duplex and hairpin regions (Figure 22B-22C & 22E-22F, denaturing gels, see arrowheads). This suggests that the presence of a CAG hairpin may affect FAN1 activity. Inclusion of increasing amounts of NA in the reaction markedly reduced nuclease activity on the CAG in-duplex substrate, specifically affecting cleavage proximal to and within the CAG hairpin (Figure 22C & 22F). The demonstration herein that FAN1 cleavage of a slipped- CAG substrate can be altered by NA, a drug that has been shown to modify CAG instability (Figures 3, 5 & 6), supports the possibility that NA can modify the action of a DNA structure-specific nuclease, FAN1. The manner by which FAN1 may modulate HD age-of- onset is a focus of future studies.

Replication Protein A (RPA), a key player in many DNA repair processes, inhibits the formation of unusual DNA structures, like hairpins, through binding and stabilizing single-stranded regions. RPA has been reported to enhance DNA polymerase progression through difficult DNA templates by melting the structured template. The effect of NA upon RPA-binding to slipped-DNAs was assessed and upon enhancing progression by polymerase delta (pol delta, a polymerase implicated in CAG repeat instability) and active in brains in a repair capacity. NA competitively blocked the interaction of RPA to slipped-CAG repeats (Figure 23 A). Pol delta was unable to synthesize across a CAG tract and this was rescued by the addition of RPA, likely through its ability to melt the impeding hairpin (Figure 23B, lanes 2-4). NA blocked the enhanced progression of pol delta along the CAG template (Figure 23 A, lanes 5-6), a result consistent with NA binding competitively against RPA for the CAG tract (Figure 23 A). These results support a mechanism of NA inducing CAG contractions through an inability of polymerases to synthesize through NA-bound CAG templates (Figure 23C). This result supports a process by which NA may induce contractions (see lower portion of Figure 19). Together, our FAN1, pol delta, and RPA results provide proof-of- principle supporting the concept that NA may act by disrupting interaction or activity of repair proteins on CAG slip-outs. Defining the exact players and steps that NA can alter to induce CAG contractions is to be determined. Would NA be effective in vivo in neural tissues that show rampant CAG expansions and are susceptible to degeneration? In both mice and patients, the largest CAG expansions and most degeneration occur in the striatum (Lopez Castel et al., 2010, Nature Reviews, Mol. Cell. Biol, 11:165-170; Kennedy et al., 2003, Human MoL Genet, 12:3359-67; Goula et al., 2009, FLoS Genet, 5: el 000749; Larson etal., 2015, NeurobioL Dis., 76:98-111; Kovalenko et al., 2012, FLoS One, 7:e44273), with the medium-spiny neurons being the most vulnerable and incur the greatest CAG expansions (Kovalenko et al., 2012, PLoS One, 7:e44273).

Therefore, the effect of NA on the instability of the expanded (CTG)150 in Huntington's disease R6/2 transgenic mice was tested, each inheriting around 150-160 CAG repeast, focusing upon the striatum (Goula et al., 2009, PLoS Genet, 5:el000749; Larson et al., 2015, Neurobiol. Dis., 76:98-111), where each mouse was injected with either NA+saline on the left striatum or with saline only on the right, as an internal control, spanning 4- weeks (Figure 24). CAG expansions are evident in the striatum of these mice as early as a few weeks, which continue as the mice age (Goula et al., 2009, PLoS Genet, 5:el000749; Larson et al., 2015, NeurobioL Dis., 76:98-111). In one model, the CAG repeats have been quantified to gain broadly distributed sizes of additional repeats at a rate of -3.5 CAG unit per month per cell (Lee et al., 2011, PLoS One, 6:e23647; Mollersen et al., 2010, PLoS Genet,

6:el001242). Repeated intra-striatal injections of NA (dissolved in saline) in 6-week-old R6/2 mice led to seeming contractions of the expanded repeat, relative to saline-only injections into contralateral striatum of the same mice (Figure 6 A, compare blues scans NA- treated with red scans saline-treated striatal portions, injections were once, twice, or four administrations spanning four weeks when DNAs were harvested (Figure 24)). The effect of NA was localized to the injection site, so only the NA-injected (blue scans), but not the saline injected striatal regions (red scans), showed repeat contractions, thereby permitting the saline injected half of the same mouse brain to serve as an internal control. In the absence of any treatment, for a given mouse the CAG length distributions between left and right halves of the striatum were indistinguishable. That the NA (in saline) treated half has shorter CAG tracts than the saline only treated half supports the interpretation that NA induced the CAG length differences (Figure 6A).

Continued contractions were induced with additional NA administrations over a four- week period (this was highly reproducible for a total of 13 mice (one for 1 injection, two for 2 injections, and ten for 4 injections), see Figure 25A-25H, NA injection did not alter brain morphology, Figure 26). The effect of NA is consistent between mice, regardless of inherited repeat length differences between mice. A bimodal distribution of repeat sizes is present in HD patient brains. This second peak of larger expansions is evident in the striatum of most CAG mouse models and was found to consist of the most vulnerable HD cells, the medium spiny neurons and to have greater levels of expanded CAG transcript Interestingly, N A had a greater effect upon the larger expansions of the bimodal repeat distribution (Figure 6D). This portion of larger sized expansions in the striatum may have arisen by mutation events involving large slip-outs, large enough to be bound by NA. Instability analyses in HD individuals suggests that the number of CAG units gained or lost at each mutation event are predominantly changes of one repeat unit, but may include changes of 5-15 repeat units, sizes that could be bound by NA. It is noteworthy that slipped-DNAs at the DM1 locus, isolated from various tissues of DM1 patients, including brain, presented a bimodal distribution of slip-out sizes, with peaks of -30 and <10 repeats, where the former could be bound by NA.

Most, if not all, alleles in the NA-treated striatum had incurred repeat contractions, indicating that NA had affected most cells (Figure 6 A, Figure 25A-H). Using an established method, the somatic "instability index" was quantified: greater indices reflect greater expansions, lower indices reflect lower expansions or greater contractions (Figure 27). The number of contracted versus expanded peaks (Figure 6B) or the relative composition (%) of contracted versus expanded peaks (Figure 6C) was greater with subsequent treatments. The repeat size distributions in striatum treated four times with NA were significantly different from the mock-treated striatum (Mann-Whitney, p=0.00034, or paired t-test, p-0.000019). The effect of NA was localized to the site of injection, as the CAG tract in the cerebral cortex and cerebellum from the same mice that had intra-striatal injections showed identical patterns of CAG length heterogeneity in the right and left sides (Figure 6C and 6D, Figure 25A-25H).

That NA induced contractions, rather than only arresting expansions, is supported by the fact that the peak repeat length in the NA-treated striatum was shorter than the inherited tract length in the tail of the same mouse, which does not change through the life of the mouse (Figure 5D and Figure 25). Importantly, for a given mouse, the repeat size distributions in the half of the striatum treated with NA were shifted towards contraction, relative to the inherited length in the tail, while the saline-treated half of the striatum continued to incur expansions (Figure 6B-6E, Figure 25A-25H, see delta of main peaks where the saline - versus the NA-treated versus the tail CAG lengths differ by 4-7 CAG units). This suggests that injections of NA into the striatum induced contractions of the expanded CAG tract, and can do so countering the expansion-bias in that tissue (showing a spontaneous expansion rate of -5.75 CAG units/month), a finding that is consistent with the NA-induced contractions observed in cells (Figure 3 and Figure S). The effect of NA in vivo was specific for the expanded repeat, having no effect upon non-expanded repeats (Figure 6E and 12D). The effect of NA likely involved genome maintenance rather than DNA replication, as the 6-week-old treated mice had passed the postnatal 'brain growth spurt', where replication peaks 6-17 days after murine birth. It is likely that it involved

transcription as CAG-RNA levels were greatest in striatal cells with the largest expansions and NA did not alter transcription across the repeat (Figure 16C). This is consistent with NA inducing contractions independent of cell proliferation (Figure SE). Thus, NA could induce contractions of expanded CAG repeats in most cells in a tissue, which, in HD individuals, show selective neuronal vulnerability.

These findings in mice with about 1 SO CAG repeats over a four-week period of NA treatment shows neurons en masse incurred contractions of ~0.5 repeats lost/week.

Extrapolating this to an HD-affected human, applying a drug like NA prior to the rapid onset of somatic CAG expansions could effectively block expansions and induce contractions of the inherited expanded allele to shorter lengths, where treatments spanning one year could contract the repeat by 5-25 repeats. For an above average HD allele of 60-70 repeats, this could be significant and modify disease onset and progression.

Data from diverse approaches using multiple model systems led to harmonious interpretations. For example, the NA-enhanced contractions in various cell models are consistent with the increased contractions in vivo, and the absence of an effect of NA at replication forks and an independence from replication but a requirement of transcription are all consistent with the absence of a requirement of proliferation in cells or striatum where transcription occurs. In addition, that NA affects R-loop processing to contractions is consistent with NA affecting instability in cell lines and striatum undergoing transcription producing rC AG-dCTG R-loops, displacing the CAG DNA strand and allowing it to be bound by NA. Further, that NA binds to and inhibits repair of long, but not short, CAG slip- outs is consistent with NA preferentially affecting the larger repeat sizes in striatum, which undergo large saltatory expansions.

Previous studies using cell models of CAG/CTG instability demonstrated that exogenously added compounds can modulate levels of repeat instability (Yang et aL, 2003, Am J. Hum Genet, 73:1092-105; Gomes-Pereira et al., 2006, Mutat Res., 598:15-34). However, the effective compounds were DNA damaging agents lacking specificity for the expanded repeat (emylmethanesulphonate, ethidium bromide, mitomycin C and DNA polymerase inhibitors), and, thus, would induce deleterious mutations throughout the genome. In a separate strategy, a (CAG)6 antisense oligonucleotide was able to reduce instability of a (CTG)800, but did not induce contractions of an expanded repeat (Nakamori et al., 2011, Mol. Ther., 19:2222-7).

A rnitochondria-directed compound, XJB-5-131, by unknown processes, was reported to mildly suppress CAG expansions rather than induce contractions; this compound also suppressed contractions. A sequence-specific polyamide directed to the duplex GAA repeat that is expanded in Friedreich's ataxia patients prevents GAA triple-stranded structure formation and suppressed GAA repeat expansions in FRD A cells, but did not induce contractions. CRISPR/Cas9 treatment of HD cells, to target the mutant allele, deleted ~44 kb DNA spanning promoter region, transcription start site, and the CAG expansion of the mutant HIT allele, resulting in haplo-insufficiency with a functional non-mutant allele. A small molecule approach may overcome some of the in vivo hurdles (delivery, specificity, etc.) of enzyme-mediated paths.

In contrast to these previous approaches, our results indicate that NA, a DNA ligand that binds slipped-strand DNA structures comprised of CAG repeats, can shift the dynamic of repeat instability in favor of contractions rather than further expansions. NA likely acts by inhibiting cycles of aberrant repair of slipped-DNA, possibly by perturbing the interaction of DNA repair proteins with the slipped-DNA. Other nucleic acid binding compounds demonstrated to be effective towards attacking deleterious repeat r(CUG) RNA-protein interactions or CAG/CTG transcription (Bernat et al. , 2015, Neuron, 87:28-46) may also affect repeat instability. The attributes of NA - its sequence- and structure-specificity, its preferential effect upon expanded repeat harboring slipped-DNAs, its ability to induce contractions in vivo in an affected brain region - make it a first-in-class example showing the potential of small molecule DNA-binding compounds to impact CAG repeat instability by inducing contractions and inhibiting expansions of disease-causing expanded repeats.

Administration of such small molecules, once optimized for therapy, to human brains might effectively target the root cause and address all downstream effects caused by the expansion mutation.

Table 2. Results from Small Pool PCR

Data summarized from analysis of HD patient fibroblasts (GM09197; (CAG)180 and GM02191, (CAG)43), and HT1080-(CAG)850, presented in Figures 5, 6, and Figures 10 and 11, 14, and 15.

5 ^■ A cut-off point of ± 25 repeats was used to determine expansion and contraction.

b P-values were calculated using the χ ² test to compare the proportions of expanded, unchanged, and contracted alleles within the population of HT1080-(CAG)850 cells.

c for all alleles (expanded + unchanged + contracted), the average change in the repeat size is expressed as the number of repeats. Note that the average change in the repeat size was 10 biased toward contraction because of the preferential amplification of shorter alleles by small pool PCT.

d A cut-off point of ± 10 repeats was used to determine expansion and contractions. ^e P- values were calculated using the χ test to compare the proportions of expanded, peak repeat, and contracted alleles within the populations of HD fibroblasts.

f P-values were calculated by Student's t test

It is to be understood that, while the methods and compositions of matter have been described herein in conjunction with a number of different aspects, the foregoing description of the various aspects is intended to illustrate and not limit the scope of the methods and compositions of matter. Other aspects, advantages, and modifications are within the scope of the following claims.

Disclosed are methods and compositions that can be used for, can be used in conjunction with, can be used in preparation for, or are products of the disclosed methods and compositions. These and other materials are disclosed herein, and it is understood that combinations, subsets, interactions, groups, etc. of these methods and compositions are disclosed. That is, while specific reference to each various individual and collective combinations and permutations of these compositions and methods may not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a particular composition of matter or a particular method is disclosed and discussed and a number of compositions or methods are discussed, each and every combination and permutation of the compositions and the methods are specifically contemplated unless specifically indicated to the contrary. Likewise, any subset or combination of these is also specifically contemplated and disclosed.

Previous Patent: LBT PARAMETERS FOR UPLINK IN UNLICENSED SPECTRUM

Next Patent: PROCESS FOR REDUCING THE AMOUNT OF CARBON DISULPHIDE (CS 2) IN A HYDROCARBON FEED