Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
COMPOSITIONS AND METHODS FOR TREATMENT OF MICROSATELLITE DNA EXPANSION DISORDERS
Document Type and Number:
WIPO Patent Application WO/2023/225495
Kind Code:
A2
Abstract:
The present disclosure provides single- or double-stranded interfering RNA molecules (e.g.,siRNA) that target a MutS Homolog 3 (MSH3) gene. The interfering RNA molecules may contain specific patterns of nucleoside modifications and internucleoside linkage modifications, as pharmaceutical compositions including the same. The siRNA molecules may be branched siRNA molecules, such as di-branched, tri-branched, ortetra-branched siRNA molecules. The disclosed siRNA molecules may further feature a 5' phosphorus stabilizing moiety and/or a hydrophobic moiety. Additionally, the disclosure provides methods for delivering the siRNA molecule of the disclosure to the central nervous system of a subject, such as a subject identified as having Huntington's Disease.

Inventors:
GALLANT-BEHM CORRIE (US)
HASSLER MATTHEW (US)
CURTIS DANIEL (US)
GODINHO BRUNO MIGUEL DA (US)
Application Number:
PCT/US2023/067039
Publication Date:
November 23, 2023
Filing Date:
May 16, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
ATALANTA THERAPEUTICS INC (US)
International Classes:
A61K31/712; C12N15/11
Attorney, Agent or Firm:
BELLIVEAU, Michael, J. et al. (US)
Download PDF:
Claims:
Claims

1 . A small interfering RNA (siRNA) molecule comprising an antisense strand and sense strand having complementarity to the antisense strand, wherein the antisense strand is from 10 to 30 nucleotides in length and has complementarity sufficient to hybridize to a region within a MutS Homolog 3 (MSH3) mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

2. The siRNA molecule of claim 1 , wherein the antisense strand has at least 70% complementarity to a region of 21 contiguous nucleobases within the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

3. The siRNA molecule of claim 2, wherein the antisense strand has at least 75% complementarity to a region of 21 contiguous nucleobases within the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408, optionally wherein the antisense strand has at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementarity to the region within the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID Nos: 1-408.

4. The siRNA molecule of any one of claims 1-3, wherein the antisense strand comprises at least 10, at least 11 , at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21 , at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, or 30 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

5. The siRNA molecule of claim 4, wherein the antisense strand comprises from 10 to 30 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

6. The siRNA molecule of claim 5, wherein the antisense strand comprises from 12 to 30 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

7. The siRNA molecule of claim 6, wherein the antisense strand comprises from 15 to 30 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

8. The siRNA molecule of claim 7, wherein the antisense strand comprises from 18 to 30 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

9. The siRNA molecule of claim 8, wherein the antisense strand comprises from 18 to 25 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

10. The siRNA molecule of any one of claims 1-9, wherein the antisense strand comprises from 18 to 21 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

11 . The siRNA molecule of claim 10, wherein the antisense strand comprises 21 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

12. The siRNA molecule of any one of claims 1-11 , wherein the antisense strand comprises 9 or fewer nucleotide mismatches relative to a region of 21 contiguous nucleobases of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408, optionally wherein the antisense strand comprises 8 or fewer, 7 or fewer, 6 or fewer, 5 or fewer, 4 or fewer, 3 or fewer, 2 or fewer, or only 1 mismatch relative to the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408.

13. The siRNA molecule of any one of claims 1-12, wherein the region of the MSH3 RNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1 -24, 40-84, 100, 118-139, 173-181 , 193- 205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408.

14. The siRNA molecule of any one of claims 1-12, wherein the region ofthe MSH3 RNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362, optionally wherein the region of the MSH3 RNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196.

15. The siRNA molecule of claim 14, wherein the region of the MSH3 RNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362, optionally wherein the region of the MSH3 RNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

16. The siRNA molecule of any one of claims 1-15, wherein the antisense strand has a nucleic acid sequence that is at least 85% identical to the nucleic acid sequence of any one of SEQ ID NOs: 817- 1224.

17. The siRNA molecule of claim 16, wherein the antisense strand has a nucleic acid sequence that is at least 90% identical to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224.

18. The siRNA molecule of claim 17, wherein the antisense strand has a nucleic acid sequence that is at least 95% identical to the nucleic acid sequence of SEQ ID NOs: 817-1224, optionally wherein the antisense strand has a nucleic acid sequence that is at least 96%, 97%, 98%, or 99% identical to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224.

19. The siRNA molecule of claim 18, wherein the antisense strand has the nucleic acid sequence of any one of SEQ ID NOs: 817-1224.

20. The siRNA molecules of any one of claims 16-19, wherein the nucleic acid sequence is any one of SEQ ID NOs: 817-840, 856-900, 916, 934-955, 989-997, 1009-1021 , 1036-1038, 1056-1058, 1065- 1124, 1135-1146, 1152-1193, 1200-1202, and 1214-1224.

21. The siRNA molecule of any one of claims 16-19, wherein the nucleic acid sequence is any one of SEQ ID NOs: 821 , 833-836, 849, 858, 860, 918, 919, 921 , 922, 924, 925, 929, 942, 945, 946, 974, 977, 982, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1033, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134, optionally wherein the nucleic acid sequence is any one of SEQ ID NOs: 834- 836, 918, 921 , 922, 924, 925, 999, 1010, and 1012.

22. The siRNA molecule of claim 21 , wherein the nucleic acid sequence is any one of SEQ ID NOs: 821 , 833-836, 858, 860, 942, 945, 946, 974, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134, optionally wherein the nucleic acid sequence is any one of SEQ ID NOs: 834-836, 999, 1010, and 1012.

23. The siRNA molecule of any one of claims 1 -22, wherein the sense strand has a nucleic acid sequence that is at least 85% identical to the nucleic acid sequence of any one of SEQ ID NOs: 409- 816.

24. The siRNA molecule of claim 23, wherein the sense strand has a nucleic acid sequence that is at least 90% identical to the nucleic acid sequence of any one of SEQ ID NOs: 409-816.

25. The siRNA molecule of claim 24, wherein the sense strand has a nucleic acid sequence that is at least 95% identical to the nucleic acid sequence of SEQ ID NOs: 409-816, optionally wherein the sense strand has a nucleic acid sequence that is at least 96%, 97%, 98%, or 99% identical to the nucleic acid sequence of any one of SEQ ID NOs: 409-816.

26. The siRNA molecule of claim 25, wherein the sense strand has the nucleic acid sequence of any one of SEQ ID NOs: 409-816.

27. The siRNA molecule of any one of claims 23-26, wherein the nucleic acid sequence is any one of SEQ ID NOs: 409-432, 448-492, 508, 526-547, 581-589, 601-613, 628-630, 648-650, 657-716, 727- 738, 744-785, 792-794, and 806-816.

28. The siRNA molecule of any one of claims 23-26, wherein the nucleic acid sequence is any one of SEQ ID NOs: 413, 425-428, 441 , 450, 452, 510, 511 , 513, 514, 516, 517, 521 , 534, 537, 538, 566, 569, 574, 585, 591 , 601 , 602, 604, 605, 610, 617, 625, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770, optionally wherein the nucleic acid sequence is any one of SEQ ID NOs: 426-428, 513, 516, 514, 516, 517, 521 , 591 , 602, and 604.

29. The siRNA molecule of claim 28, wherein the nucleic acid sequence is any one of SEQ ID NOs: 413, 425-428, 450, 452, 534, 537, 538, 566, 585, 591 , 601 , 602, 604, 605, 610, 617, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770, optionally wherein the nucleic acid sequence is any one of SEQ ID NOs: 426-428, 591 , 602, and 604.

30. The siRNA molecule of any one of claims 1-29, wherein the antisense strand comprises a structure represented by Formula I, wherein Formula I is, in the 5’-to-3’ direction:

A-B-(A’)j-C-P2-D-P1-(C’-P1)k-C’

Formula I; wherein A is represented by the formula C-P1-D-P1; each A’ is represented by the formula C-P2-D-P2;

B is represented by the formula C-P2-D-P2-D-P2-D-P2; each C is a 2’-O-methyl (2’-O-Me) ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-fluoro (2’-F) ribonucleoside; each D is a 2’-F ribonucleoside; each P1 is a phosphorothioate internucleoside linkage; each P2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7; and k is an integer from 1 to 7.

31 . The siRNA molecule of claim 30, wherein the antisense strand comprises a structure represented by Formula A1 , wherein Formula A1 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-B-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A-S-A-S-A-S-B-S-A

Formula A1 ; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

32. The siRNA molecule of any one of claims 1-29, wherein the antisense strand comprises a structure represented by Formula II, wherein Formula II is, in the 5’-to-3’ direction:

A-B-(A’)j-C-P2-D-P1-(C-P1)k-C’

Formula II; wherein A is represented by the formula C-P1-D-P1; each A’ is represented by the formula C-P2-D-P2;

B is represented by the formula C-P2-D-P2-D-P2-D-P2; each C is a 2’-O-methyl (2’-O-Me) ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-fluoro (2’-F) ribonucleoside; each D is a 2’-F ribonucleoside; each P1 is a phosphorothioate internucleoside linkage; each P2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7; and k is an integer from 1 to 7.

33. The siRNA molecule of claim 32, wherein the antisense strand comprises a structure represented by Formula A2, wherein Formula A2 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-B-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A-S-A-S-A-S-A-S-A

Formula A2; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

34. The siRNA molecule of any one of claims 1-33, wherein the sense strand comprises a structure represented by Formula III, wherein Formula III is, in the 5’-to-3’ direction:

E-(A’)m-F

Formula III; wherein E is represented by the formula (C-P1)2;

F is represented by the formula (C-P2)3-D-P1-C-P1-C, (C-P2)3-D-P2-C-P2-C, (C-P2)3-D-P1-C-P1-D, or (C-P2)3-D-P2-C-P2-D;

A’, C, D, P1, and P2 are as defined in Formula II; and m is an integer from 1 to 7.

35. The siRNA molecule of claim 34, wherein the sense strand comprises a structure represented by Formula S1 , wherein Formula S1 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-S-A-S-A

Formula S1 ; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

36. The siRNA molecule of claim 34, wherein the sense strand comprises a structure represented by Formula S2, wherein Formula S2 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-O-A-O-A

Formula S2; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

37. The siRNA molecule of claim 34, wherein the sense strand comprises a structure represented by Formula S3, wherein Formula S3 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-S-A-S-B

Formula S3; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

38. The siRNA molecule of claim 34, wherein the sense strand comprises a structure represented by Formula S4, wherein Formula S4 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-O-A-O-B

Formula S4; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

39. The siRNA molecule of any one of claims 1-32 and 34-38, wherein the antisense strand comprises a structure represented by Formula IV, wherein Formula IV is, in the 5’-to-3’ direction:

A-(A’)j-C-P2-B-(C-P1)k-C’

Formula IV; wherein A is represented by the formula C-P1-D-P1; each A’ is represented by the formula C-P2-D-P2;

B is represented by the formula D-P1-C-P1-D-P1; each C is a 2’-O-Me ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-F ribonucleoside; each D is a 2’-F ribonucleoside; each P1 is a phosphorothioate internucleoside linkage; each P2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7; and k is an integer from 1 to 7.

40. The siRNA molecule of claim 39, wherein the antisense strand comprises a structure represented by Formula A3, wherein Formula A3 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A-S-B-S-A-S-A-S-A Formula A3; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

41. The siRNA molecule of any one of claims 1-33, 39, and 40, wherein the sense strand comprises a structure represented by Formula V, wherein Formula V is, in the 5’-to-3’ direction:

E-(A’)m-C-P2-F

Formula V; wherein E is represented by the formula (C-P1)2;

F is represented by the formula D-P1-C-P1-C, D-P2-C-P2-C, D-P1-C-P1-D, or D-P2-C-P2-D;

A’, C, D, P1 and P2 are as defined in Formula IV; and m is an integer from 1 to 7.

42. The siRNA molecule of claim 41 , wherein the sense strand comprises a structure represented by Formula S5, wherein Formula S5 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A-S-A

Formula S5; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

43. The siRNA molecule of claim 41 , wherein the sense strand comprises a structure represented by

Formula S6, wherein Formula S6 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A

Formula S6; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

44. The siRNA molecule of claim 41 , wherein the sense strand comprises a structure represented by

Formula S7, wherein Formula S7 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A-S-B

Formula S7; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

45. The siRNA molecule of claim 41 , wherein the sense strand comprises a structure represented by

Formula S8, wherein Formula S8 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B

Formula S8; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

46. The siRNA molecule of any one of claims 1-32, 34-38 and 41-45, wherein the antisense strand comprises a structure represented by Formula VI, wherein Formula VI is, in the 5’-to-3’ direction: A-Bj-E-Bk-E-F-Gi-D-P1-C’

Formula VI; wherein A is represented by the formula C-P1-D-P1; each B is represented by the formula C-P2; each C is a 2’-O-Me ribonucleoside; each O’, independently, is a 2’-O-Me ribonucleoside or a 2’-F ribonucleoside; each D is a 2’-F ribonucleoside; each E is represented by the formula D-P2-C-P2;

F is represented by the formula D-P1-C-P1; each G is represented by the formula C-P1; each P1 is a phosphorothioate internucleoside linkage; each P2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7; k is an integer from 1 to 7; and

I is an integer from 1 to 7.

47. The siRNA molecule of claim 46, wherein the antisense strand comprises a structure represented by Formula A4, wherein Formula A4 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-A-O-A-O-B-O-A-O-A-O-A-O-A-O-A-O-A-O-A-O-B-O-A-O-B-S-A-S-A-S-A-S-B-S-A

Formula A4; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

48. The siRNA molecule of any one of claims 1 -33, 39, 40, 46, and 47, wherein the sense strand comprises a structure represented by Formula VII, wherein Formula VII is, in the 5’-to-3’ direction:

H-Bm-ln-A’-Bo-H-C

Formula VII; wherein A’ is represented by the formula C-P2-D-P2; each H is represented by the formula (C-P1)2; each I is represented by the formula (D-P2);

B, C, D, P1 and P2 are as defined in Formula VI; m is an integer from 1 to 7; n is an integer from 1 to 7; and o is an integer from 1 to 7.

49. The siRNA molecule of claim 48, wherein the sense strand comprises a structure represented by Formula S9, wherein Formula S9 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-A-O-A-O-B-O-B-O-B-O-A-O-B-O-A-O-A-O-A-O-A-S-A-S-A

Formula S9; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

50. The siRNA molecule of any one of claims 1-49, wherein the antisense strand further comprises a 5’ phosphorus stabilizing moiety at the 5’ end of the antisense strand.

51 . The siRNA molecule of any one of claims 1 -50, wherein the sense strand further comprises a 5’ phosphorus stabilizing moiety at the 5’ end of the sense strand.

52. The siRNA molecule of claim 50 or 51 , wherein each 5’ phosphorus stabilizing moiety is, independently, represented by any one of Formulas IX-XVI:

Formula XIII Formula XIV Formula XV Formula XVI wherein Nuc represents a nucleobase selected from the group consisting of adenine, uracil, guanine, thymine, and cytosine, and R represents an optionally substituted alkyl, optionally substituted alkenyl, optionally substituted alkynyl, phenyl, benzyl, hydroxy, or hydrogen.

53. The siRNA molecule of claim 52, wherein the nucleobase is an adenine, uracil, guanine, thymine, or cytosine.

54. The siRNA molecule of any one of claims 50-53, wherein the 5’ phosphorus stabilizing moiety is (E)-vinylphosphonate represented by Formula XI.

55. The siRNA molecule of any one of claims 1-54, wherein the siRNA molecule further comprises a hydrophobic moiety at the 5’ or the 3’ end of the siRNA molecule.

56. The siRNA molecule of claim 55, wherein the hydrophobic moiety is selected from a group consisting of cholesterol, vitamin D, or tocopherol.

57. The siRNA molecule of any one of claims 1-56, wherein the length of the sense strand is between 12 and 30 nucleotides.

58. The siRNA molecule of any one of claims 1-57, wherein the siRNA molecule is a branched siRNA molecule.

59. The siRNA molecule of claim 58, wherein the branched siRNA molecule is di-branched, tribranched, ortetra-branched.

60. The siRNA molecule of claim 59, wherein the siRNA molecule is a di-branched siRNA molecule, optionally wherein the di-branched siRNA molecule is represented by any one of Formulas XVII-XIX:

Formula XVII; Formula XVIII; Formula XIX; wherein each RNA is, independently, an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

61 . The siRNA molecule of claim 59, wherein the siRNA molecule is a tri-branched siRNA molecule, optionally wherein the tri-branched siRNA molecule is represented by any one of Formulas XX-XXIII: Formula XX; Formula XXI; Formula XXII; Formula XXIII; wherein each RNA is, independently, an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

62. The siRNA molecule of claim 59, wherein the siRNA molecule is a tetra-branched siRNA molecule, optionally wherein the tetra-branched siRNA molecule is represented by any one of Formulas XXI V-XXVI II:

Formula XXIV; Formula XXV; Formula XXVI; Formula XXVII; Formula XXVIII: wherein each RNA is, independently, an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

63. The siRNA molecule of any one of claims 60-62, wherein the linker is selected from a group consisting of one or more contiguous subunits of an ethylene glycol, alkyl, carbohydrate, block copolymer, peptide, RNA, and DNA.

64. The siRNA molecule of claim 63, wherein the one or more contiguous subunits is 2 to 20 contiguous subunits.

65. A pharmaceutical composition comprising the siRNA molecule of any one of claims 1 -64 and a pharmaceutically acceptable excipient, carrier, or diluent.

66. A method of delivering an siRNA molecule to a subject diagnosed as having a microsatellite repeat expansion disorder, the method comprising administering a therapeutically effective amount of the siRNA molecule of any one of claims 1 -64 or the pharmaceutical composition of claim 65 to the subject.

67. A method of treating a microsatellite repeat expansion disorder in a subject in need thereof, the method comprising administering a therapeutically effective amount of the siRNA molecule of any one of claims 1-64 or the pharmaceutical composition of claim 65 to the subject.

68. The method of claim 67, wherein the microsatellite repeat expansion disorder is Huntington’s disease.

69. The method of claim 67, wherein the microsatellite repeat expansion disorder is a spinocerebellar ataxia.

70. The method of claim 67, wherein the microsatellite repeat expansion disorder is a Fragile X syndrome.

71 . The method of claim 67, wherein the microsatellite repeat expansion disorder is a myotonic dystrophy.

72. A method of reducing MSH3 expression in a subject in need thereof, the method comprising administering a therapeutically effective amount of the siRNA molecule of any one of claims 1 -64 or the pharmaceutical composition of claim 65 to the subject.

73. The method of any one of claims 66-72, wherein the siRNA molecule or the pharmaceutical composition is administered to the subject by way of intracerebroventricular, intrastriatal, intraparenchymal, or intrathecal injection.

74. The method of any one of claims 66-73, wherein the siRNA molecule or the pharmaceutical composition is administered to the subject by way of intravenous, intramuscular, or subcutaneous injection.

75. The method of any one of claims 66-74, wherein the subject is a human.

76. A kit comprising the siRNA molecule of any one of claims 1-64, or the pharmaceutical composition of claim 65, and a package insert, wherein the package insert instructs a user of the kit to perform the method of any one of claims 66-75.

Description:
COMPOSITIONS AND METHODS FOR TREATMENT OF MICROSATELLITE DNA EXPANSION DISORDERS

Sequence Listing

The instant application contains a Sequence Listing which has been submitted electronically in XML format and is hereby incorporated by reference in its entirety. Said XML copy, created on May 15, 2023, is named “51436-034WO2_Sequence_Listing_5_15_23” and is 1 ,089,536 bytes in size.

Technical Field

This disclosure relates to small interfering RNA (siRNA) molecules, and compositions containing the same, that target RNA transcripts (e.g., mRNA) of a MutS Homolog 3 (MSH3) gene. The disclosure further describes methods for silencing of MSH3 and the treatment of diseases that may benefit from the silencing of MSH3 (e.g., diseases caused by pathological microsatellite DNA expansions, such as Huntington’s Disease and myotonic dystrophy type 1 , among others) by delivering MS/73-targeting siRNA molecules to a target tissue of a subject in need.

Background

MSH3 (MutS Homolog 3) encodes a protein that is important in the DNA mismatch repair system that may play a role in the age of onset and rate of progression of diseases caused by pathological microsatellite DNA expansions, including Huntington’s Disease and myotonic dystrophy type 1 . Recent studies show that individuals with a loss of function MSH3 mutation have delayed onset of Huntington's disease compared to individuals with normal forms of the gene. Currently, there are no treatments that can alter the course of Huntington’s Disease. Accordingly, there is a need for therapeutics capable of selectively diminishing MSH3 activity in a manner that provides effective treatment for Huntington’s Disease or other MS/73-related diseases or disorders.

Summary of the Invention

The present disclosure provides compositions and methods for reduction of MutS homolog 3 (MSH3) expression by way of small interfering RNA (siRNA)-mediated silencing of MSH3 transcripts. The compositions and methods provide the benefit of exhibiting high selectivity toward MSH3 over other genes.

The siRNA molecules of the disclosure can be used to silence the MSH3 gene, thereby preventing the translation of the corresponding mRNA transcript and reducing MSH3 protein expression. This reduction of MSH3 levels thus prevents disease onset or progression, as lengthening of DNA microsatellite repeats to pathological lengths depends on MSH3 function. The siRNA molecules of the disclosure can be delivered directly to a subject in need of MSH3 silencing by way of, for example, injection intrathecally, intracerebroventricularly, intrastriatally, intraparenchymally, intra-cisterna magna injection, such as by catheterization, intravenous injection, subcutaneous injection, or intramuscular injection. In an aspect, the disclosure provides an siRNA molecule containing an antisense strand and sense strand having complementarity to the antisense strand. The antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

The antisense strand may be, for example, from 10 to 50 nucleotides in length (e.g., from 10 to 45 nucleotides in length, from 10 to 40 nucleotides in length, from 10 to 35 nucleotides in length, from 10 to 30 nucleotides in length, from 10 to 29 nucleotides in length, from 10 to 28 nucleotides in length, from 10 to 27 nucleotides in length, from 10 to 26 nucleotides in length, from 10 to 25 nucleotides in length, from 10 to 24 nucleotides in length, from 10 to 23 nucleotides in length, from 10 to 22 nucleotides in length, from 10 to 21 nucleotides in length, or from 10 to 20 nucleotides in length). In some embodiments, the antisense strand is 10 nucleotides in length, 11 nucleotides in length, 12 nucleotides in length, 13 nucleotides in length, 14 nucleotides in length, 15 nucleotides in length, 16 nucleotides in length, 17 nucleotides in length, 18 nucleotides in length, 19 nucleotides in length, 20 nucleotides in length, 21 nucleotides in length, 22 nucleotides in length, 23 nucleotides in length, 24 nucleotides in length, 25 nucleotides in length, 26 nucleotides in length, 27 nucleotides in length, 28 nucleotides in length, 29 nucleotides in length, 30 nucleotides in length, or more.

In some embodiments, the antisense strand has at least 70% complementarity to the region within the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1- 408. In some embodiments, the region within the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region within the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region within the MSH3 transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336- 377, 384-386, and 398-408. In some embodiments, the region within the MSH3 transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region within the MSH3 transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has at least 75% complementarity to the region within the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1- 408. For example, the antisense strand may have at least 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementarity to the region within the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region within the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region within the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region within the MSH3 transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240- 242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region within the MSH3 transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region within the MSH3 transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has at least 10, at least 11 , at least 12, at least 13, at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21 , at least 22, at least 23, at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, or 30 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196. In some embodiments, the antisense strand has from 10 to 30 contiguous nucleotides (e.g., 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides) that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has from 12 to 30 contiguous nucleotides (e.g., 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides) that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has from 15 to 30 contiguous nucleotides (e.g., 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides) that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118- 139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has from 18 to 30 contiguous nucleotides (e.g., 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 contiguous nucleotides) that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118- 139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has from 18 to 25 contiguous nucleotides (e.g., 18, 19, 20, 21 , 22, 23, 24, or 25 contiguous nucleotides) that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID Nos: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17- 20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220- 222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196. In some embodiments, the antisense strand has from 18 to 21 contiguous nucleotides (e.g., 18, 19, 20, or 21 contiguous nucleotides) that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103,

105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240- 242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105,

106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has 21 contiguous nucleotides that are fully complementary to a contiguous polynucleotide segment of equal length within the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118- 139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has 9 or fewer nucleotide mismatches relative to a region of 21 contiguous nucleobases of the MSH3 RNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408, optionally wherein the antisense strand contains 8 or fewer, 7 or fewer, 6 or fewer, 5 or fewer, 4 or fewer, 3 or fewer, 2 or fewer, or only 1 mismatch relative to the region of the MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1- 408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 1-24, 40-84, 100, 118-139, 173-181 , 193-205, 220-222, 240-242, 249-308, 319-330, 336-377, 384-386, and 398-408. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 102, 105, 106, 108, 109, 113, 183, 194, and 196. In some embodiments, the region of the MSH3 mRNA transcript has the nucleic acid sequence of any one of SEQ ID NOs: 18-20, 183, 194, and 196.

In some embodiments, the antisense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 849, 858, 860, 918, 919, 921 , 922, 924, 925, 929, 942, 945, 946, 974, 977, 982, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1033, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 858, 860, 942, 945, 946, 974, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-840, 856-900, 916, 934-955, 989-997, 1009-1021 , 1036-1038, 1056-1058, 1065-1124, 1135-1146, 1152-1193, 1200-1202, and 1214-1224. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 918, 921 , 922, 924, 925, 999, 1010, and 1012. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 999, 1010, and 1012.

In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 849, 858, 860, 918, 919, 921 , 922, 924, 925, 929, 942, 945, 946, 974, 977, 982, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1033, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 858, 860, 942, 945, 946, 974, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-840, 856-900, 916, 934-955, 989-997, 1009- 1021 , 1036-1038, 1056-1058, 1065-1124, 1135-1146, 1152-1193, 1200-1202, and 1214-1224. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 918, 921 , 922, 924, 925, 999, 1010, and 1012. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 999, 1010, and 1012.

In some embodiments, the antisense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224, optionally wherein the antisense strand has a nucleic acid sequence that is at least 96%, 97%, 98%, or 99% identical to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 849, 858, 860, 918, 919, 921 , 922, 924, 925, 929, 942, 945, 946, 974, 977, 982, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1033, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 858, 860, 942, 945, 946, 974, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-840, 856-900, 916, 934-955, 989-997, 1009-1021 , 1036-1038, 1056-1058, 1065- 1124, 1135-1146, 1152-1193, 1200-1202, and 1214-1224. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 918, 921 , 922, 924, 925, 999, 1010, and 1012. In some embodiments, the antisense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 999, 1010, and 1012.

In some embodiments, the antisense strand has the nucleic acid sequence of any one of SEQ ID NOs: 817-1224. In some embodiments, the antisense strand has the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 849, 858, 860, 918, 919, 921 , 922, 924, 925, 929, 942, 945, 946, 974, 977, 982, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1033, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments the antisense strand has the nucleic acid sequence of any one of SEQ ID NOs: 821 , 833-836, 858, 860, 942, 945, 946, 974, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the antisense strand has the nucleic acid sequence of any one of SEQ ID NOs: 817- 840, 856-900, 916, 934-955, 989-997, 1009-1021 , 1036-1038, 1056-1058, 1065-1124, 1135-1 146, 1152-1193, 1200-1202, and 1214-1224. In some embodiments, the antisense strand has the nucleic acid sequence of any one of SEQ ID NOs: 834-836, 918, 921 , 922, 924, 925, 999, 1010, and 1012. In some embodiments, the antisense strand has the nucleic acid of any one of SEQ ID NOs: 834-836, 999, 1010, and 1012.

In some embodiments, the sense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-816. In some embodiments, the sense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 441 , 450, 452, 510, 511 , 513, 514, 516, 517, 521 , 534, 537, 538, 566, 569, 574, 585, 591 , 601 , 602, 604, 605, 610, 617, 625, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 450, 452, 534, 537, 538, 566, 585, 591 , 601 , 602, 604, 605, 610, 617, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-432, 448-492, 508, 526-547, 581-589, 601-613, 628-630, 648-650, 657- 716, 727-738, 744-785, 792-794, and 806-816. In some embodiments, the sense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 510, 513, 514, 516, 517, 521 , 591 , 602, and 604. In some embodiments, the sense strand has a nucleic acid sequence that is at least 85% identical (e.g., 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 591 , 602, and 604.

In some embodiments, the sense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-816. In some embodiments, the sense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 441 , 450, 452, 510, 511 , 513, 514, 516, 517, 521 , 534, 537, 538, 566, 569, 574, 585, 591 , 601 , 602, 604, 605, 610, 617, 625, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 450, 452, 534, 537, 538, 566, 585, 591 , 601 , 602, 604, 605, 610, 617, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-432, 448-492, 508, 526-547, 581-589, 601-613, 628-630, 648-650, 657- 716, 727-738, 744-785, 792-794, and 806-816. In some embodiments, the sense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 510, 513, 514, 516, 517, 521 , 591 , 602, and 604. In some embodiments, the sense strand has a nucleic acid sequence that is at least 90% identical (e.g., 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 591 , 602, and 604.

In some embodiments, the sense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-816, optionally wherein the sense strand has a nucleic acid sequence that is at least 96%, 97%, 98%, or 99% identical to the nucleic acid sequence of any one of SEQ ID NOs: 409- 816. In some embodiments, the sense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 441 , 450, 452, 510, 511 , 513, 514, 516, 517, 521 , 534, 537, 538, 566, 569, 574, 585, 591 , 601 , 602, 604, 605, 610, 617, 625, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 450, 452, 534, 537, 538, 566, 585, 591 , 601 , 602, 604, 605, 610, 617, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-432, 448-492, 508, 526-547, 581-589, 601-613, 628-630, 648-650, 657-716, 727-738, 744-785, 792-794, and 806-816. In some embodiments, the sense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 510, 513, 514, 516, 517, 521 , 591 , 602, and 604. In some embodiments, the sense strand has a nucleic acid sequence that is at least 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 591 , 602, and 604.

In some embodiments, the siRNA molecule has a sense strand having the nucleic acid sequence of any one of SEQ ID NOs: 409-816. In some embodiments, the sense strand has a nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 441 , 450, 452, 510, 511 , 513, 514, 516, 517, 521 , 534, 537, 538, 566, 569, 574, 585, 591 , 601 , 602, 604, 605, 610, 617, 625, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the sense strand has a nucleic acid sequence of any one of SEQ ID NOs: 413, 425-428, 450, 452, 534, 537, 538, 566, 585, 591 , 601 , 602, 604, 605, 610, 617, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the siRNA molecule has a sense strand having the nucleic acid sequence of any one of SEQ ID NOs: 409-432, 448-492, 508, 526-547, 581-589, 601-613, 628-630, 648-650, 657-716, 727- 738, 744-785, 792-794, and 806-816. In some embodiments, the sense strand has the nucleic acid sequence of any one of SEQ ID NOs: 426-428, 510, 513, 514, 516, 517, 521 , 591 , 602, and 604. In some embodiments, the sense strand has the nucleic acid sequence of any one of SEQ ID NOs: 426- 428, 591 , 602, and 604.

In some embodiments, the antisense strand has a structure represented by Formula I, wherein Formula I is, in the 5’-to-3’ direction:

A-B-(A’)j-C-P 2 -D-P 1 -(C’-P 1 )k-C’

Formula I; wherein A is represented by the formula C-P 1 -D-P 1 ; each A’ is represented by the formula C-P 2 -D-P 2 ;

B is represented by the formula C-P 2 -D-P 2 -D-P 2 -D-P 2 ; each C is a 2’-O-methyl (2’-O-Me) ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-fluoro (2’-F) ribonucleoside; each D is a 2’-F ribonucleoside; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7).

In some embodiments, the antisense strand has a structure represented by Formula A1 , wherein Formula A1 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-B-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B-S-A-S-A-S-A-S-B-S-A

Formula A1 ; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the antisense strand has a structure represented by Formula II, wherein Formula II is, in the 5’-to-3’ direction:

A-B-(A’)j-C-P 2 -D-P 1 -(C-P 1 )k-C’

Formula II; wherein A is represented by the formula C-P 1 -D-P 1 ; each A’ is represented by the formula C-P 2 -D-P 2 ;

B is represented by the formula C-P 2 -D-P 2 -D-P 2 -D-P 2 ; each C is a 2’-O-methyl (2’-O-Me) ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-fluoro (2’-F) ribonucleoside; each D is a 2’-F ribonucleoside; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7).

In some embodiments, antisense strand has a structure represented by Formula A2, wherein Formula A2 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-B-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B-S-A-S-A-S-A-S-A-S-A

Formula A2; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula III, wherein Formula III is, in the 5’-to-3’ direction:

E-(A’)m-F Formula III; wherein E is represented by the formula (C-P 1 )2;

F is represented by the formula (C-P 2 ) 3 -D-P 1 -C-P 1 -C, (C-P 2 ) 3 -D-P 2 -C-P 2 -C, (C-P 2 ) 3 -D-P 1 -C-P 1 -D, or (C-P 2 ) 3 -D-P 2 -C-P 2 -D;

A’, C, D, P 1 , and P 2 are as defined in Formula II; and m is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7).

In some embodiments, the sense strand has a structure represented by Formula S1 , wherein Formula S1 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-S-A -S-A

Formula S1 ; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula S2, wherein Formula S2 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-O-A -O-A

Formula S2; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula S3, wherein Formula S3 is, in the 5’-to-3’ direction: A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-S-A-S- B

Formula S3; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula S4, wherein

Formula S4 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-O-A -O-B

Formula S4; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the antisense strand has a structure represented by Formula IV, wherein Formula IV is, in the 5’-to-3’ direction:

A-(A’)j-C-P 2 -B-(C-P 1 )k-C’

Formula IV; wherein A is represented by the formula C-P 1 -D-P 1 ; each A’ is represented by the formula C-P 2 -D-P 2 ;

B is represented by the formula D-P 1 -C-P 1 -D-P 1 ; each C is a 2’-O-Me ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-F ribonucleoside; each D is a 2’-F ribonucleoside; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7).

In some embodiments, the antisense strand has a structure represented by Formula A3, wherein Formula A3 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B-S-A-S-B-S-A-S-A-S-A

Formula A3; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula V, wherein

Formula V is, in the 5’-to-3’ direction: E-(A’) m -C-P 2 -F

Formula V; wherein E is represented by the formula (C-P 1 )2;

F is represented by the formula D-P 1 -C-P 1 -C, D-P 2 -C-P 2 -C, D-P 1 -C-P 1 -D, or D-P 2 -C-P 2 -D;

A’, C, D, P 1 and P 2 are as defined in Formula IV; and m is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7).

In some embodiments, the sense strand has a structure represented by Formula S5, wherein

Formula S5 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A -S-A

Formula S5; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula S6, wherein Formula S6 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-A

Formula S6; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula S7, wherein Formula S7 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A -S-B

Formula S7; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula S8, wherein Formula S8 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B

Formula S8; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage. In some embodiments, the antisense strand has a structure represented by Formula VI, wherein Formula VI is, in the 5’-to-3’ direction:

A-Bj-E-Bk-E-F-Gi-D-P 1 -C’

Formula VI; wherein A is represented by the formula C-P 1 -D-P 1 ; each B is represented by the formula C-P 2 ; each C is a 2’-O-Me ribonucleoside; each O’, independently, is a 2’-O-Me ribonucleoside or a 2’-F ribonucleoside; each D is a 2’-F ribonucleoside; each E is represented by the formula D-P 2 -C-P 2 ;

F is represented by the formula D-P 1 -C-P 1 ; each G is represented by the formula C-P 1 ; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and

I is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7).

In some embodiments, the antisense strand has a structure represented by Formula A4, wherein Formula A4 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-A-O-A-O-B-O-A-O-A-O-A-O-A-O-A-O-A-O-A-O-B-O-A -O-B-S-A-S-A-S-A-S-B-S-A

Formula A4; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the sense strand has a structure represented by Formula VII, wherein Formula VII is, in the 5’-to-3’ direction:

H-Bm-ln-A’-Bo-H-C

Formula VII; wherein A’ is represented by the formula C-P 2 -D-P 2 ; each H is represented by the formula (C-P 1 )2; each I is represented by the formula (D-P 2 );

B, C, D, P 1 and P 2 are as defined in Formula VI; m is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); n is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and o is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, the sense strand has a structure represented by Formula S9, wherein Formula S9 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-A-O-A-O-B-O-B-O-B-O-A-O-B-O-A-O-A-O-A-O-A-S-A -S-A

Formula S9; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments, the antisense strand also has a 5’ phosphorus stabilizing moiety at the 5’ end of the antisense strand.

In some embodiments, the sense strand also has a 5’ phosphorus stabilizing moiety at the 5’ end of the sense strand.

In some embodiments, each 5’ phosphorus stabilizing moiety is, independently, represented by any one of Formulas IX, XX, XI, XII, XIII, XIV, XV, or XVI:

Formula XIII Formula XIV Formula XV Formula XVI wherein Nuc represents a nucleobase selected from the group consisting of adenine, uracil, guanine, thymine, and cytosine, and R represents an optionally substituted alkyl, optionally substituted alkenyl, optionally substituted alkynyl, phenyl, benzyl, hydroxy, or hydrogen.

In some embodiments, the nucleobase is an adenine, uracil, guanine, thymine, or cytosine.

In some embodiments, the 5’ phosphorus stabilizing moiety is (E)-vinylphosphonate represented by Formula XI.

In some embodiments, the siRNA molecule also has a hydrophobic moiety at the 5’ or the 3’ end of the siRNA molecule.

In some embodiments, the hydrophobic moiety is selected from a group consisting of cholesterol, vitamin D, or tocopherol. In some embodiments, the length of the sense strand is between 12 and 30 nucleotides (e.g., 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides).

In some embodiments, the siRNA molecule is a branched siRNA molecule.

In some embodiments, the branched siRNA molecule is di-branched, tri-branched, or tetrabranched.

In some embodiments, the siRNA molecule is di-branched, optionally wherein the di-branched siRNA molecule is represented by any one of Formulas XVII, XVIII, or XIX:

RNA RNA RNA X-L-X X-L-X

RNA-L-RNA RNA Z RNA RNA' RNA

Formula XVII; Formula XVIII; Formula XIX; wherein each RNA is, independently, an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

In some embodiments, the siRNA molecule is tri-branched, optionally wherein the tribranched siRNA molecule is represented by any one of Formulas XX, XXI, XXII, or XXIII: Formula XX; Formula XXI; Formula XXII; Formula XXIII; wherein each RNA is, independently, an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

In some embodiments, the siRNA molecule is tetra-branched, optionally wherein the tetrabranched siRNA molecule is represented by any one of Formulas XXIV, XXV, XXVI, XXVII, or XXVIII:

Formula XXIV; Formula XXV; Formula XXVI; Formula XXVII; Formula XXVIII; wherein each RNA is, independently, an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

In some embodiments, the linker is selected from a group consisting of one or more contiguous subunits of an ethylene glycol, alkyl, carbohydrate, block copolymer, peptide, RNA, and DNA.

In some embodiments, the one or more contiguous subunits is 2 to 20 contiguous subunits (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, or 20 contiguous subunits). In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript that is within the open reading frame of the MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript that is within the 3’-untranslated region.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 1 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1 -4.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 2 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 5-20. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 5-15.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 3 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 16-47. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 21 -47.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 4 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 48-84.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 5 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 85- 103. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 85-100.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 6 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 101- 120. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 104-120. In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 7 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 121- 133.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 8 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 134- 139.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 9 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 140- 162. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 140-154.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 10 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 155- 187. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 163-182.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 11 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 183- 200. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 188-192.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 12 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 193- 205.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 13 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 206- 223. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 206-222.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 14 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 223- 242. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 224-242.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 15 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 243- 262. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 243-255.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 16 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 256- 276. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 263-268

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 17 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 269- 296. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 277-289.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 18 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 290- 302. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 297-302.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 19 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 303- 304.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 20 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 303- 324. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 305-318.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 21 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 319- 339. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 325-336.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 22 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 337- 344. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 340-344.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 23 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 345- 358.

In some embodiments of any of the siRNA molecules described herein, the antisense strand has complementarity sufficient to hybridize to a region within exon 24 of an MSH3 mRNA transcript. In some embodiments, the antisense strand has complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 359- 408.

In a further aspect, the disclosure provides a pharmaceutical composition containing an siRNA molecule of any of the preceding aspects or embodiments of the disclosure, and a pharmaceutically acceptable excipient, carrier, or diluent.

In a further aspect, the disclosure provides a method of delivering an siRNA molecule to a subject diagnosed as having a microsatellite repeat expansion disorder by administering a therapeutically effective amount of the siRNA molecule or a pharmaceutical composition of any of the preceding aspects or embodiments of the disclosure to the subject. In some embodiments, the microsatellite repeat expansion disease is Huntington’s Disease. In some embodiments, the microsatellite repeat expansion disease is a spinocerebellar ataxia. In some embodiments, the microsatellite repeat expansion disease is a Fragile X syndrome. In some embodiments, the microsatellite repeat expansion disease is a myotonic dystrophy.

In a further aspect, the disclosure provides a method of treating a microsatellite repeat expansion disorder in a subject in need thereof by administering a therapeutically effective amount of an siRNA molecule or a pharmaceutical composition of any of the preceding aspects or embodiments of the disclosure to the subject. In some embodiments, the microsatellite repeat expansion disease is Huntington’s disease.

In another aspect, the disclosure provides a method of reducing MSH3 expression in a subject in need thereof by administering a therapeutically effective amount of an siRNA or pharmaceutical composition of any of the preceding aspects or embodiments of the disclosure.

In some embodiments, the siRNA molecule or the pharmaceutical composition is administered to the subject by way of intracerebroventricular, intrastriatal, intraparenchymal or intrathecal injection. In some embodiments, the siRNA molecule or the pharmaceutical composition is administered to the subject by way of intravenous, intramuscular, or subcutaneous injection.

In some embodiments, the subject is a human.

In another aspect, the disclosure provides a kit containing an siRNA molecule or pharmaceutical composition of any of the preceding aspects or embodiments of the disclosure, and a package insert that instructs a user of the kit to perform the method of any of the preceding aspects or embodiments of the disclosure.

Brief Description of the Figures

FIG. 1 is a graph showing reduction in MSH3 protein in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. Protein was quantified in the motor cortex, 28 days following di-siRNA treatment at 1 nmol or 5 nmol dose levels.

FIG. 2 is a graph showing reduction in MSH3 protein in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. Protein was quantified in the striatum, 28 days following di-siRNA treatment at 1 nmol or 5 nmol dose levels.

FIG. 3 is a graph showing reduction in MSH3 protein in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. Protein was quantified in the hippocampus, 28 days following di-siRNA treatment at 1 nmol or 5 nmol dose levels.

FIG. 4 is a graph showing reduction in MSH3 mRNA in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. mRNA was quantified in the motor cortex, 28 days following di-siRNA treatment at 1 nmol or 5 nmol dose levels.

FIG. 5 is a graph showing reduction in MSH3 mRNA in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. mRNA was quantified in the striatum, 28 days following di-siRNA treatment at 1 nmol or 5 nmol dose levels. FIG. 6 is a graph showing reduction in MSH3 mRNA in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. mRNA was quantified in the hippocampus, 28 days following di-siRNA treatment at 1 nmol or 5 nmol dose levels.

FIG. 7 is a graph showing reduction in MSH3 protein in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. Protein was quantified in the motor cortex, striatum, and hippocampus, 28 days following di-siRNA treatment at a 5 nmol dose level.

FIG. 8 is a graph showing reduction in MSH3 mRNA in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. mRNA was quantified in the motor cortex, striatum, and hippocampus, 28 days following di-siRNA treatment at a 5 nmol dose level.

FIG. 9 is a graph showing reduction in MSH3 protein in FVB mice that were administered siRNA molecules of the disclosure. The siRNA identifiers are shown in Table 6, below. Protein was quantified in the motor cortex, striatum, and hippocampus, 28 days following di-siRNA treatment at 0.5 nmol, 1 nmol, 2.5 nmol, 5 nmol, and 20 nmol dose levels.

Definitions

Unless otherwise defined herein, scientific, and technical terms used herein have the meanings that are commonly understood by those of ordinary skill in the art. In the event of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. The use of "or" means "and/or" unless stated otherwise. The use of the term "including," as well as other forms, such as "includes" and "included," is not limiting.

As used herein, the term "nucleic acids" refers to RNA or DNA molecules consisting of a chain of ribonucleotides or deoxyribonucleotides, respectively.

As used herein, the term "therapeutic nucleic acid" refers to a nucleic acid molecule (e.g., ribonucleic acid) that has partial or complete complementarity to, and interacts with, a disease- associated target mRNA and mediates silencing of expression of the mRNA.

As used herein, the term "carrier nucleic acid" refers to a nucleic acid molecule (e.g., ribonucleic acid) that has sequence complementarity with, and hybridizes with, a therapeutic nucleic acid. As used herein, the term "3' end" refers to the end of the nucleic acid that contains an unmodified hydroxyl group at the 3' carbon of the ribose ring.

As used herein, the term "nucleoside" refers to a molecule made up of a heterocyclic base and its sugar.

As used herein, the term "nucleotide" refers to a nucleoside having a phosphate group, or a variant thereof, on its 3' or 5' sugar hydroxyl group. Examples of phosphate group variants include, but are not limited to, saturated alkyl phosphonates, unsaturated alkenyl phosphonates, phosphorothioates, and phosphoramidites.

In the context of this disclosure, the term "oligonucleotide" refers to an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or mimetics thereof. This term includes oligonucleotides composed of naturally-occurring nucleobases, sugars and covalent internucleoside (backbone) linkages as well as oligonucleotides having non-naturally-occurring (e.g., modified) portions that function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases.

As used herein, the term "siRNA" refers to small interfering RNA duplexes that induce the RNA interference (RNAi) pathway. siRNA molecules may vary in length (generally, between 10 and 30 base pairs) and may contain varying degrees of complementarity to their target mRNA. The term "siRNA" includes duplexes of two separate strands, as well as single strands that optionally form hairpin structures including a duplex region.

As used herein, the term "antisense strand" refers to the strand of the siRNA duplex that contains some degree of complementarity to the target gene.

As used herein, the term "sense strand" refers to the strand of the siRNA duplex that contains complementarity to the antisense strand.

The term “interfering RNA molecule” refers to an RNA molecule, such as a small interfering RNA (siRNA), microRNA (miRNA), short hairpin RNA (shRNA), or an antisense oligonucleotide (ASO) that suppresses the endogenous function of a target RNA transcript.

As used herein, the terms "express" and “expression” refer to one or more of the following events: (1) production of an RNA template from a DNA sequence (e.g., by transcription); (2) processing of an RNA transcript (e.g., by splicing, editing, 5' cap formation, and/or 3' end processing); and (3) translation of an RNA into a polypeptide or protein. In the context of a gene that encodes a protein product, the terms “gene expression” and the like are used interchangeably with the terms “protein expression” and the like. A change in expression of a gene or protein of interest in a patient can be detected, for example, by: a change in the quantity or concentration of mRNA encoding corresponding protein (as assessed, e.g., using RNA detection procedures described herein or known in the art, such as quantitative polymerase chain reaction (qPCR) and RNA-seq techniques), a change in the quantity or concentration of the corresponding protein (as assessed, e.g., using protein detection methods described herein or known in the art, such as enzyme-linked immunosorbent assays (ELISA), among others), and/or a change in the activity of the corresponding protein (e.g., in the case of an enzyme, as assessed using an enzymatic activity assay described herein or known in the art) in a sample obtained from the patient. As used herein, a cell is considered to “express” a gene or protein of interest if one or more, or all, of the above events can be detected in the cell or in a medium in which the cell resides. For example, a gene or protein of interest is considered to be “expressed” by a cell or population of cells if one can detect (i) production of a corresponding RNA transcript, such as an mRNA template, by the cell or population of cells (e.g., using RNA detection procedures described herein); (ii) processing of the RNA transcript (e.g., splicing, editing, 5’ cap formation, and/or 3’ end processing, such as using RNA detection procedures described herein); (iii) translation of the RNA template into a protein product (e.g., using protein detection procedures described herein); and/or (iv) post-translational modification of the protein product (e.g., using protein detection procedures described herein). As used herein, the terms “target,” “targeting,” and “targeted,” in the context of the design of an siRNA, refers to generating an antisense strand so as to anneal the antisense strand to a region within the mRNA transcript of interest in a manner that results in a reduction in translation of the mRNA into the protein product.

As used herein, the terms "chemically modified nucleotide," "nucleotide analog," "altered nucleotide," and "modified nucleotide" refer to a non-standard nucleotide, including non-naturally occurring ribonucleotides or deoxyribonucleotides. Exemplary nucleotide analogs are modified at any position so as to alter certain chemical properties of the nucleotide yet retain the ability of the nucleotide analog to perform its intended function.

As used herein, the term "metabolically stabilized" refers to RNA molecules that contain ribonucleotides that have been chemically modified in order to decrease the rate of metabolism of an RNA molecule that is administered to a subject. Exemplary modifications include 2’-hydroxy to 2’-O- methoxy or 2’-fluoro, and phosphodiester to phosphorothioate.

As used herein, the term "phosphorothioate" refers to a phosphate group of a nucleotide that is modified by substituting one or more of the oxygens of the phosphate group with sulfur.

As used herein, the terms “internucleoside linkage,” “internucleoside bond,” and the like refer to the bonds between nucleosides in a nucleic acid molecule.

As used herein, the term "antagomirs" refers to nucleic acids that can function as inhibitors of miRNA activity.

As used herein, the term "gapmers" refers to chimeric antisense nucleic acids that contain a central block of deoxynucleotide monomers sufficiently long to induce RNase H cleavage. The deoxynucleotide block is flanked by ribonucleotide monomers or ribonucleotide monomers containing modifications.

As used herein, the term "mixmers" refers to nucleic acids that contain a mix of locked nucleic acids (LNAs) and DNA.

As used herein, the term "guide RNAs" refers to nucleic acids that have sequence complementarity to a specific sequence in the genome immediately or 1 base pair upstream of the protospacer adjacent motif (PAM) sequence as used in CRISPR/Cas9 gene editing systems. Alternatively, “guide RNAs” may refer to nucleic acids that have sequence complementarity (e.g., are antisense) to a specific messenger RNA (mRNA) sequence. In this context, a guide RNA may also have sequence complementarity to a “passenger RNA” sequence of equal or shorter length, which is identical or substantially identical to the sequence of mRNA to which the guide RNA hybridizes.

As used herein, the term “branched siRNA” refers to a compound containing two or more double-stranded siRNA molecules covalently bound to one another. Branched siRNA molecules may be “di-branched,” also referred to herein as “di-siRNA,” wherein the siRNA molecule includes 2 siRNA molecules covalently bound to one another, e.g., by way of a linker. Branched siRNA molecules may be “tri-branched,” also referred to herein as “tri-siRNA,” wherein the siRNA molecule includes 3 siRNA molecules covalently bound to one another, e.g., by way of a linker. Branched siRNA molecules may be “tetra-branched,” also referred to herein as “tetra-siRNA,” wherein the siRNA molecule includes 4 siRNA molecules covalently bound to one another, e.g., by way of a linker. As used herein, the term “branch point moiety” refers to a chemical moiety of a branched siRNA structure of the disclosure that may be covalently linked to a 5’ end or a 3’ end of an antisense strand or a sense strand of an siRNA molecule and which may support the attachment of additional single- or double-stranded siRNA molecules. Non-limiting examples of branch point moieties suitable for use in conjunction with the disclosed methods and compositions include, e.g., phosphoroamidite, tosylated solketal, 1 ,3-diaminopropanol, pentaerythritol, and any one of the branch point moieties described in US 10,478,503.

The term “phosphate moiety” as used herein, refers to a terminal phosphate group that includes phosphates as well as modified phosphates. The phosphate moiety may be located at either terminus but is preferred at the 5'-terminal nucleoside. In one aspect, the terminal phosphate is unmodified having the formula — O — P(=O)(OH)OH. In another aspect, the terminal phosphate is modified such that one or more of the O and OH groups are replaced with H, O, S, N(R’) or alkyl where R’ is H, an amino protecting group or unsubstituted or substituted alkyl. In some embodiments, the 5' and or 3' terminal group may include from 1 to 3 phosphate moieties that are each, independently, unmodified (di- or tri-phosphates) or modified.

As used herein, the term “5' phosphorus stabilizing moiety” refers to a terminal phosphate group that includes phosphates as well as modified phosphates (e.g., phosphorothioates, phosphodiesters, phosphonates). The phosphate moiety may be located at either terminus but is preferred at the 5'-terminal nucleoside. In one aspect, the terminal phosphate is unmodified having the formula -O-P(=O)(OH)OH. In another aspect, the terminal phosphate is modified such that one or more of the O and OH groups are replaced with H, O, S, N(R’), or alkyl where R’ is H, an amino protecting group, or unsubstituted or substituted alkyl. In some embodiments, the 5' and or 3' terminal group may include from 1 to 3 phosphate moieties that are each, independently, unmodified (di- or triphosphates) or modified.

The phosphate group of the nucleotide may also be modified, e.g., by substituting one or more of the oxygens of the phosphate group with sulfur (e.g., phosphorothioates), or by making other substitutions which allow the nucleotide to perform its intended function such as described in, for example, Eckstein, Antisense Nucleic Acid Drug Dev. 10:117-21 , 2000; Rusckowski et al., Antisense Nucleic Acid Drug Dev. 10:333-45, 2000; Stein, Antisense Nucleic Acid Drug Dev. 11 :317-25, 2001 ; Vorobjev et al., Antisense Nucleic Acid Drug Dev. 11 :77-85, 2001 ; and US 5,684,143. Certain of the above- referenced modifications (e.g., phosphate group modifications) preferably decrease the rate of hydrolysis of, for example, polynucleotides including said analogs in vivo or in vitro.

As used herein, the term “complementary” refers to two nucleotides that form canonical Watson-Crick base pairs. For the avoidance of doubt, Watson-Crick base pairs in the context of the present disclosure include adenine-thymine, adenine-uracil, and cytosine-guanine base pairs. A proper Watson-Crick base pair is referred to in this context as a “match,” while each unpaired nucleotide, and each incorrectly paired nucleotide, is referred to as a “mismatch.” Alignment for purposes of determining percent nucleic acid sequence complementarity can be achieved in various ways that are within the capabilities of one of skill in the art, for example, using publicly available computer software such as BLAST, BLAST-2, or Megalign software. “Percent (%) sequence complementarity” with respect to a reference polynucleotide sequence is defined as the percentage of nucleic acids in a candidate sequence that are complementary to the nucleic acids in the reference polynucleotide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence complementarity. A given nucleotide is considered to be “complementary” to a reference nucleotide as described herein if the two nucleotides form canonical Watson-Crick base pairs. For the avoidance of doubt, Watson-Crick base pairs in the context of the present disclosure include adenine-thymine, adenine-uracil, and cytosine-guanine base pairs. A proper Watson-Crick base pair is referred to in this context as a “match,” while each unpaired nucleotide, and each incorrectly paired nucleotide, is referred to as a “mismatch.” Alignment for purposes of determining percent nucleic acid sequence complementarity can be achieved in various ways that are within the capabilities of one of skill in the art, for example, using publicly available computer software such as BLAST, BLAST-2, or Megalign software. Those skilled in the art can determine appropriate parameters for aligning sequences, including any algorithms needed to achieve maximal complementarity over the full length of the sequences being compared. As an illustration, the percent sequence complementarity of a given nucleic acid sequence, A, to a given nucleic acid sequence, B, (which can alternatively be phrased as a given nucleic acid sequence, A that has a certain percent complementarity to a given nucleic acid sequence, B) is calculated as follows:

100 multiplied by (the fraction X/Y) where X is the number of complementary base pairs in an alignment (e.g., as executed by computer software, such as BLAST) in that program’s alignment of A and B, and where Y is the total number of nucleic acids in B. It will be appreciated that where the length of nucleic acid sequence A is not equal to the length of nucleic acid sequence B, the percent sequence complementarity of A to B will not equal the percent sequence complementarity of B to A. As used herein, a query nucleic acid sequence is considered to be “completely complementary” to a reference nucleic acid sequence if the query nucleic acid sequence has 100% sequence complementarity to the reference nucleic acid sequence.

“Percent (%) sequence identity” with respect to a reference polynucleotide or polypeptide sequence is defined as the percentage of nucleic acids or amino acids in a candidate sequence that are identical to the nucleic acids or amino acids in the reference polynucleotide or polypeptide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent nucleic acid or amino acid sequence identity can be achieved in various ways that are within the capabilities of one of skill in the art, for example, using publicly available computer software such as BLAST, BLAST-2, or Megalign software. Those skilled in the art can determine appropriate parameters for aligning sequences, including any algorithms needed to achieve maximal alignment over the full length of the sequences being compared. For example, percent sequence identity values may be generated using the sequence comparison computer program BLAST. As an illustration, the percent sequence identity of a given nucleic acid or amino acid sequence, A, to, with, or against a given nucleic acid or amino acid sequence, B, (which can alternatively be phrased as a given nucleic acid or amino acid sequence, A that has a certain percent sequence identity to, with, or against a given nucleic acid or amino acid sequence, B) is calculated as follows:

100 multiplied by (the fraction X/Y) where X is the number of nucleotides or amino acids scored as identical matches by a sequence alignment program (e.g., BLAST) in that program’s alignment of A and B, and where Y is the total number of nucleic acids in B. It will be appreciated that where the length of nucleic acid or amino acid sequence A is not equal to the length of nucleic acid or amino acid sequence B, the percent sequence identity of A to B will not equal the percent sequence identity of B to A.

The term “complementarity sufficient to hybridize,” as used herein, refers to a nucleic acid sequence or a portion thereof that need not be fully complementary (e.g., 100% complementary) to a target region or a nucleic acid sequence or a portion thereof that has one or more nucleotide mismatches relative to the target region but that is still capable of hybridizing to the target region under specified conditions. For example, the nucleic acid may be, e.g., 95% complementary, 90%, complementary, 85% complementary, 80% complementary, 75% complementary, 70% complementary, 65% complementary, 60% complementary, 55% complementary, 50% complementary, or less, but still form sufficient base pairs with the target so as to hybridize across its length. A sequence that has complementarity sufficient to hybridize to a target region or nucleic acid region within a specific exon includes a target region contained entirely within the specific exon or a target region that includes one or more nucleotides within the specific exon but also crosses an exonexon junction.

“Hybridization” or “annealing” of nucleic acids is achieved when one or more nucleoside residues within a polynucleotide base pairs with one or more complementary nucleosides to form a stable duplex. The base pairing is typically driven by hydrogen bonding events. Hybridization includes Watson-Crick base pairs formed from natural and/or modified nucleobases. The hybridization can also include non-Watson-Crick base pairs, such as wobble base pairs (guanosine-uracil, hypoxanthineuracil, hypoxanthine-adenine, and hypoxanthine-cytosine) and Hoogsteen base pairs. Nucleic acids need not be 100% complementary to undergo hybridization. For example, one nucleic acid may be, e.g., 95% complementary, 90%, complementary, 85% complementary, 80% complementary, 75% complementary, 70% complementary, 65% complementary, 60% complementary, 55% complementary, 50% complementary, or less, relative to another nucleic acid, but the two nucleic acids may still form sufficient base pairs with one another so as to hybridize.

The "stable duplex" formed upon the annealing/hybridization of one nucleic acid to another is a duplex structure that is not denatured by a stringent wash. Exemplary stringent wash conditions are known in the art and include temperatures of about 5° C less than the melting temperature of an individual strand of the duplex and low concentrations of monovalent salts, such as monovalent salt concentrations (e.g., NaCI concentrations) of less than 0.2 M (e.g., 0.2 M, 0.19 M, 0.18 M, 0.17 M, 0.16 M, 0.15 M, 0.14 M, 0.13 M, 0.12 M, 0.11 M, 0.1 M, 0.09 M, 0.08 M, 0.07 M, 0.06 M, 0.05 M, 0.04 M, 0.03 M, 0.02 M, 0.01 M, or less).

The term “gene silencing” refers to the suppression of gene expression, e.g., endogenous gene expression of MSH3, which may be mediated through processes that affect transcription and/or through processes that affect post-transcriptional mechanisms. In some embodiments, gene silencing occurs when an RNAi molecule initiates the inhibition or degradation of the mRNA transcribed from a gene of interest in a sequence-specific manner by way of RNA interference, thereby preventing translation of the gene's product.

The phrase “overactive disease driver gene,” as used herein, refers to a gene having increased activity and/or expression that contributes to or causes a disease state in a subject (e.g., a human). The disease state may be caused or exacerbated by the overactive disease driver gene directly or by way of an intermediate gene(s).

As used herein, the term "ethylene glycol chain" refers to a carbon chain with the formula ((CH 2 OH) 2 ).

As used herein, “alkyl” refers to a saturated hydrocarbon group. Alkyl groups may be acyclic or cyclic and contain only C and H when unsubstituted. When an alkyl residue having a specific number of carbons is named, all geometric isomers having that number of carbons are intended to be encompassed and described; thus, for example, “butyl” is meant to include n-butyl, sec-butyl, and isobutyl. Examples of alkyl include ethyl, propyl, butyl, pentyl, hexyl, heptyl, octyl, cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl, cyclooctyl, and the like. In some embodiments, alkyl may be substituted. Suitable substituents that may be introduced into an alkyl group include, for example, hydroxy, alkoxy, amino, alkylamino, and halo, among others.

As used herein, “alkenyl” refers to an acyclic or cyclic unsaturated hydrocarbon group having at least one site of olefinic unsaturation (i.e., having at least one moiety of the formula C=C). Alkenyl groups contain only C and H when unsubstituted. When an alkenyl residue having a specific number of carbons is named, all geometric isomers having that number of carbons are intended to be encompassed and described; thus, for example, “butenyl” is meant to include n-butenyl, sec-butenyl, and /so-butenyl. Examples of alkenyl include -CH=CH 2 , -CH 2 -CH=CH 2 , and -CH 2 -CH=CH-CH=CH 2 . In some embodiments, alkenyl may be substituted. Suitable substituents that may be introduced into an alkenyl group include, for example, hydroxy, alkoxy, amino, alkylamino, and halo, among others.

As used herein, “alkynyl” refers to an acyclic or cyclic unsaturated hydrocarbon group having at least one site of acetylenic unsaturation (i.e., having at least one moiety of the formula CEC) . Alkynyl groups contain only C and H when unsubstituted. When an alkynyl residue having a specific number of carbons is named, all geometric isomers having that number of carbons are intended to be encompassed and described; thus, for example, “pentynyl” is meant to include n-pentynyl, secpentynyl, /so-pentynyl, and te/Y-pentynyl. Examples of alkynyl include -CECH and -CEC-CHS. In some embodiments, alkynyl may be substituted. Suitable substituents that may be introduced into an alkynyl group include, for example, hydroxy, alkoxy, amino, alkylamino, and halo, among others.

As used herein the term "phenyl" denotes a monocyclic arene in which one hydrogen atom from a carbon atom of the ring has been removed. A phenyl group may be unsubstituted or substituted with one or more suitable substituents, wherein the substituent replaces an H of the phenyl group.

As used herein, the term “benzyl” refers to monovalent radical obtained when a hydrogen atom attached to the methyl group of toluene is removed. A benzyl group generally has the formula of phenyl-CH2-. A benzyl group may be unsubstituted or substituted with one or more suitable substituents. For example, the substituent may replace an H of the phenyl component and/or an H of the methylene (-CH2-) component.

As used herein, the term "amide" refers to an alkyl, alkenyl, alkynyl, or aromatic group that is attached to an amino-carbonyl functional group.

As used herein, the term "triazole" refers to heterocyclic compounds with the formula (C2H3N3), having a five-membered ring of two carbons and three nitrogens, the positions of which can change resulting in multiple isomers.

As used herein, the term "terminal group" refers to the group at which a carbon chain or nucleic acid ends.

As used herein, an "amino acid" refers to a molecule containing amine and carboxyl functional groups and a side chain specific to the amino acid.

In some embodiments the amino acid is chosen from the group of proteinogenic amino acids. In some embodiments, the amino acid is an L-amino acid or a D-amino acid. In some embodiments, the amino acid is a synthetic amino acid (e.g., a beta-amino acid).

As used herein, the term "lipophilic amino acid" refers to an amino acid including a hydrophobic moiety (e.g., an alkyl chain or an aromatic ring).

As used herein, the term "target of delivery" refers to the organ or part of the body to which it is desired to deliver the branched oligonucleotide compositions.

As used herein, the term “between X and Y” is inclusive of the values of X and Y. For example, “between X and Y” refers to the range of values between the value of X and the value of Y, as well as the value of X and the value of Y.

As used herein, the terms “subject’ and “patient” are used interchangeably and refer to an organism, such as a mammal (e.g., a human), that experiences a neurodegenerative disease or disorder (e.g., a microsatellite repeat expansion disorder, such as Huntington’s Disease, myotonic dystrophies, or spinocerebellar ataxias) and/or contains a gain-of-function MSH3 variant allele.

As used herein, the term “MSH3" refers to the gene encoding MutS Homolog 3, including any native MSH3 gene from any source. MSH3 is a DNA mismatch repair protein that forms a heterodimer with another mismatch repair protein, MutS Homolog 2 (MSH2) to form the complex MutSp. MutSp corrects insertion/deletion loops and base-base mispairs in microsatellites during DNA synthesis or DNA strand reannealing after transcription. The term encompasses “full-length,” unprocessed MSH3 as well as any form of MSH3 that results from processing in the cell. The term also encompasses naturally occurring variants of MSH3, e.g., splice variants or allelic variants. The nucleic acid sequence of an exemplary MSH3 gene is shown in European Nucleotide Archive (ENA) Accession No. J04810.1 . The amino acid sequence of an exemplary protein encoded by an MSH3 ger\e is shown in UNIPROT™ Accession No. P20585. As used herein, the terms “microsatellite repeat expansion disorder,” “microsatellite repeat expansion disease,” “nucleotide repeat expansion disorder,” and “nucleotide repeat expansion disease” are used interchangeably to refer to any disease or disorder caused by the instability and expansion of specific microsatellites. “Microsatellites” are coding or non-coding DNA sequences that contain tandem repeats of base pairs. Exemplary microsatellite repeat expansion disorders include, but are not limited to, Fragile X syndrome, Fragile XE syndrome, Fragile X-associated tremor/ataxia syndrome, Fragile X-primary ovarian insufficiency, progressive myoclonic epilepsy type 1/Unverricht- Lundborg disease, spinocerebellar ataxia (SCA) 12, neuronal intranuclear inclusion disease, glutaminase deficiency, Huntington’s disease, SCA1 , SCA2, SCA3, SCA6, SCA7, SCA17, dentatorubral-pallidoluysian atrophy, spinal-bulbar muscular atrophy, oculopharyngeal muscular dystrophy, Huntington disease-like 2, amyotrophic lateral sclerosis, myotonic dystrophy type 2 (DM2), Friedreich ataxia, Fuchs endothelial corneal dystrophy, SCAW, SCA31 , SCA36, SCA37, cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS), benign adult familial myoclonic epilepsy, SCA8, and myotonic dystrophy type 1 (DM1). Other microsatellite repeat expansion disorders are discussed in Rodriguez et al., Neurobiology of Disease, 130:104515, 2019, the disclosure of which is incorporated herein by reference.

As used herein, the terms “treat,” “treated,” and “treating” mean both therapeutic treatment and prophylactic or preventative measures wherein the object is to prevent, ameliorate, or slow down (lessen) an undesired physiological condition, disorder, or disease, or obtain beneficial or desired clinical results. Beneficial or desired clinical results include, but are not limited to, a reduction in a patient’s reliance on pharmacological treatments; alleviation of symptoms; diminishment of the extent of a condition, disorder, or disease; stabilized (i.e., not worsening) state of condition, disorder, or disease; delay in onset or slowing of condition, disorder, or disease progression; amelioration of the condition, disorder, or disease state or remission (whether partial or total), whether detectable or undetectable; an amelioration of at least one measurable physical, cognitive, or behavioral (e.g., depressive behavior or apathy) parameter, not necessarily discernible by the patient; or enhancement or improvement of condition, disorder, or disease. Treatment includes eliciting a clinically significant response without excessive levels of side effects. Treatment also includes prolonging survival as compared to expected survival if not receiving treatment.

As used herein, the terms “benefit” and “response” are used interchangeably in the context of a subject undergoing therapy for the treatment of, for example, a microsatellite repeat expansion disease, such as Huntington’s Disease or myotonic dystrophy type 1 . For example, clinical benefits in the context of a subject having Huntington’s disease administered an siRNA molecule or siRNA composition of the disclosure include, without limitation, a reduction in involuntary movements, memory lapses, mood swings, or symptoms of anxiety and depression, and/or; a reduction in wild type MSH3 transcripts, mutant MSH3 transcripts, variant MSH3 transcripts, splice isoforms of MSH3 transcripts, and/or overexpressed MSH3 transcripts; and/or an arrest or reduction of the length of the microsatellite repeat in the HTT gene. Detailed Description

The present disclosure provides compositions of small interfering RNA (siRNA) molecules with sequence homology to a MutS Homolog 3 (MSH3) gene and methods for administering said siRNA molecules to a subject. Furthermore, the siRNA molecules described herein may be composed as branched siRNA structures, such as di-branched, tri-branched, and tetra-branched siRNA structures and may further include specific patterns of chemical modifications (e.g., 2’ ribose modifications or internucleoside linkage modifications) to improve resistance against nuclease enzymes, toxicity profile, and physicochemical properties (e.g., thermostability). Small interfering RNA molecules are short, double-stranded RNA molecules. They are capable of mediating RNA interference (RNAi) by degrading mRNA with a complementary nucleotide sequence, thus preventing the translation of the target gene.

The siRNA molecules of the disclosure may exhibit, for example, robust gene-specific suppression of MSH3, relative to other genes in the MutS family (e.g., MSH2, MSH4, MSH5, and MSH6) and other human genes.

The siRNA molecules of the disclosure may feature an antisense strand having a nucleic acid sequence that is complementary to a region of a MSH3 mRNA transcript having the nucleic acid sequence of any one of SEQ ID NOs: 1-408. The degree of complementarity of the antisense strand to the region of the MSH3 mRNA transcript may be sufficient for the antisense strand to anneal over the full length of the region of the MSH3 mRNA transcript. For example, the antisense strand may have a nucleic acid sequence that is at least 60% complementary (e.g., 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% complementary) to the region of the MSH3 mRNA transcript. In some embodiments, the region of the MSH3 RNA transcript has the sequence of any one of SEQ ID NOs: 5, 17-20, 33, 42, 44, 102, 103, 105, 106, 108, 109, 113, 126, 129, 130, 158, 161 , 166, 177, 183, 193, 194, 196, 197, 202, 209, 217, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362. In some embodiments, the region of the MSH3 RNA transcript has the sequence of any one of SEQ ID NOs: 5, 17-20, 42, 44, 126, 129, 130, 158, 177, 183, 193, 194, 196, 197, 202, 209, 231 , 271 , 273, 291 , 294, 305, 307, 317, 318, and 362.

In some embodiments, the siRNA molecules of the disclosure feature an antisense strand having the nucleic acid sequence of any one of SEQ ID NOs: 817-1224, or a nucleic acid sequence that is at least 60% identical thereto. For example, the siRNA molecules of the disclosure may feature an antisense strand having a nucleic acid sequence that is at least 60% identical (e.g., 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 817-1224. In some embodiments, the nucleic acid sequence is any one of SEQ ID NOs: 821 , 833- 836, 849, 858, 860, 918, 919, 921 , 922, 924, 925, 929, 942, 945, 946, 974, 977, 982, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1033, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134. In some embodiments, the nucleic acid sequence is any one of SEQ ID NOs: 821 , 833-836, 858, 860, 942, 945, 946, 974, 993, 999, 1009, 1010, 1012, 1013, 1018, 1025, 1047, 1087, 1089, 1107, 1110, 1121 , 1123, 1133, and 1134.

In some embodiments, the siRNA molecules of the disclosure feature a sense strand having the nucleic acid sequence of any one of SEQ ID NOs: 409-816, or a nucleic acid sequence that is at least 60% identical thereto. For example, the siRNA molecules of the disclosure may feature a sense strand having a nucleic acid sequence that is at least 60% identical (e.g., 60%, 61 %, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical) to the nucleic acid sequence of any one of SEQ ID NOs: 409-816. In some embodiments, the nucleic acid sequence is any one of SEQ ID NOs: 413, 425-428, 441 , 450, 452, 510, 511 , 513, 514, 516, 517, 521 , 534, 537, 538, 566, 569, 574, 585, 591 , 601 , 602, 604, 605, 610, 617, 625, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. In some embodiments, the nucleic acid sequence is any one of SEQ ID NOs: 413, 425-428, 450, 452, 534, 537, 538, 566, 585, 591 , 601 , 602, 604, 605, 610, 617, 639, 679, 681 , 699, 702, 713, 715, 725, 726, and 770. Exemplary siRNA molecules of the disclosure are those shown in Table 1A, below. Table 1A summarizes the antisense strands, sense strands, and corresponding regions of an MSH3 mRNA transcript that are targeted by each antisense strand.

Table 1A. Nucleotide sequences for gene-specific IMS H3-targeting siRNA

In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript that is within the open reading frame of the MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within an MSH3 mRNA transcript that is within the 3’-untranslated region.

In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 1 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 2 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 3 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 4 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 5 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 6 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 7 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 8 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 9 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 10 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 11 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 12 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 13 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 14 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 15 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 16 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 17 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 18 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 19 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 20 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 21 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 22 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 23 of an MSH3 mRNA transcript. In some embodiments, the siRNA molecules of the disclosure have complementarity sufficient to hybridize to a region within exon 24 of an MSH3 mRNA transcript.

Table 1 B, below, discloses which position of the gene in which each target region is contained (e.g., open reading frame or 3’-untranslated region) and the exon in which each target region is contained.

Table 1B. Location of mRNA sequences for gene-specific IMS H3-targeting siRNA siRNA Structure

The siRNA molecules of the disclosure may be in the form of a single-stranded (ss) or doublestranded (ds) oligonucleotide structure. In some embodiments, the siRNA molecules may be dibranched, tri-branched, ortetra-branched molecules. Furthermore, the siRNA molecules of the disclosure may contain one or more phosphodiester internucleoside linkages and/or an analog thereof, such as a phosphorothioate internucleoside linkage. The siRNA molecules of the disclosure may further contain chemically modified nucleosides having 2’ sugar modifications.

The simplest siRNAs consist of a ribonucleic acid, including a ss- or ds- structure, formed by a first strand (i.e., antisense strand), and in the case of a ds-siRNA, a second strand (i.e., sense strand). The first strand includes a stretch of contiguous nucleotides that is at least partially complementary to a target nucleic acid. The second strand also includes a stretch of contiguous nucleotides where the second stretch is at least partially identical to a target nucleic acid. The first strand and said second strand may be hybridized to each other to form a double-stranded structure. The hybridization typically occurs by Watson Crick base pairing.

Depending on the sequence of the first and second strand, the hybridization or base pairing is not necessarily complete or perfect, which means that the first and second strand are not 100% basepaired due to mismatches. One or more mismatches may also be present within the duplex without necessarily impacting the siRNA RNAi activity.

The first strand contains a stretch of contiguous nucleotides which is essentially complementary to a target nucleic acid. Typically, the target nucleic acid sequence is, in accordance with the mode of action of interfering ribonucleic acids, a ss-RNA, preferably an mRNA. Such hybridization occurs most likely through Watson Crick base pairing but is not necessarily limited thereto. The extent to which the first strand has a complementary stretch of contiguous nucleotides to a target nucleic acid sequence may be between 80% and 100%, e.g., 80%, 85%, 90%, 95%, or 100% complementary.

The siRNA molecules described herein may employ modifications to the nucleobase, phosphate backbone, ribose core, 5'- and 3'-ends, and branching, wherein multiple strands of siRNA may be covalently linked.

Lengths of Small Interfering RNA Molecules

It is within the scope of the disclosure that any length, known and previously unknown in the art, may be employed for the current invention. As described herein, potential lengths for an antisense strand of the siRNA molecules of the present disclosure is between 10 and 30 nucleotides (e.g., 10 nucleotides, 11 nucleotides, 12 nucleotides, 13 nucleotides, 14 nucleotides, 15 nucleotides, 16 nucleotides, 17 nucleotides, 18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, 23 nucleotides, 24 nucleotides, 25 nucleotides, 26 nucleotides, 27 nucleotides, 28 nucleotides, 29 nucleotides, or 30 nucleotides), 15 and 25 nucleotides (e.g., 15 nucleotides, 16 nucleotides, 17 nucleotides, 18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, 23 nucleotides, 24 nucleotides, or 25 nucleotides), or 18 and 23 nucleotides (e.g., 18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, or 23 nucleotides). In some embodiments, the antisense strand is 20 nucleotides. In some embodiments, the antisense strand is 21 nucleotides. In some embodiments, the antisense strand is 22 nucleotides. In some embodiments, the antisense strand is 23 nucleotides. In some embodiments, the antisense strand is 24 nucleotides. In some embodiments, the antisense strand is 25 nucleotides. In some embodiments, the antisense strand is 26 nucleotides. In some embodiments, the antisense strand is 27 nucleotides. In some embodiments, the antisense strand is 28 nucleotides. In some embodiments, the antisense strand is 29 nucleotides. In some embodiments, the antisense strand is 30 nucleotides.

In some embodiments, the sense strand of the siRNA molecules of the present disclosure is between 12 and 30 nucleotides (e.g., 12 nucleotides, 13 nucleotides, 14 nucleotides, 15 nucleotides, 16 nucleotides, 17 nucleotides, 18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, 23 nucleotides, 24 nucleotides, 25 nucleotides, 26 nucleotides, 27 nucleotides, 28 nucleotides, 29 nucleotides, or 30 nucleotides), or 14 and 23 nucleotides (e.g., 14 nucleotides, 15 nucleotides, 16 nucleotides, 17 nucleotides, 18 nucleotides, 19 nucleotides, 20 nucleotides, 21 nucleotides, 22 nucleotides, or 23 nucleotides). In some embodiments, the sense strand is 15 nucleotides. In some embodiments, the sense strand is 16 nucleotides. In some embodiments, the sense strand is 17 nucleotides. In some embodiments, the sense strand is 18 nucleotides. In some embodiments, the sense strand is 19 nucleotides. In some embodiments, the sense strand is 20 nucleotides. In some embodiments, the sense strand is 21 nucleotides. In some embodiments, the sense strand is 22 nucleotides. In some embodiments, the sense strand is 23 nucleotides. In some embodiments, the sense strand is 24 nucleotides. In some embodiments, the sense strand is 25 nucleotides. In some embodiments, the sense strand is 26 nucleotides. In some embodiments, the sense strand is 27 nucleotides. In some embodiments, the sense strand is 28 nucleotides. In some embodiments, the sense strand is 29 nucleotides. In some embodiments, the sense strand is 30 nucleotides.

2' Sugar Modifications

The present disclosure may include ss- and ds- siRNA molecule compositions including at least one (e.g., at least 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , or more) nucleosides having 2’ sugar modifications. Possible 2'-modifications include all possible orientations of OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C1 to C10 alkyl or C2 to C10 alkenyl and alkynyl. In some embodiments, the modification includes a 2’-O-methyl (2’-O-Me) modification. Other potential sugar substituent groups include: C1 to C10 lower alkyl, substituted lower alkyl, alkenyl, alkynyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH 3 , OCN, Cl, Br, CN, CF 3 , OCF 3 , SOCH 3 , SO 2 CH 3 , ONO 2 , NO 2 , N 3 , NH 2 , heterocycloalkyl, heterocycloalkaryl, aminoalkylamino, polyalkylamino, substituted silyl, a group for improving the pharmacokinetic properties of an oligonucleotide, or a group for improving the pharmacodynamic properties of an oligonucleotide, and other substituents having similar properties. In some embodiments, the modification includes 2'-methoxyethoxy (2'-O-CH 2 CH 2 OCH 3 , also known as 2'-0-(2-methoxyethyl) or 2'-MOE). In some embodiments, the modification includes 2'- dimethylaminooxyethoxy, i.e., a O(CH 2 ) 2 ON(CH 3 ) 2 group, also known as 2'-DMAOE, and 2'- dimethylaminoethoxyethoxy (also known in the art as 2'-0-dimethylamino-ethoxy-ethyl or 2'- DMAEOE), i.e., 2'-O-CH 2 OCH 2 N(CH 3 ) 2 . Other potential sugar substituent groups include, e.g., aminopropoxy (-OCH 2 CH 2 CH 2 NH 2 ), allyl (-CH 2 -CH=CH 2 ), -O-allyl (-O-CH 2 -CH=CH 2 ) and fluoro (F). 2'- sugar substituent groups may be in the arabino (up) position or ribo (down) position. In some embodiments, the 2'-arabino modification is 2'-F. Similar modifications may also be made at other positions on the siRNA molecule, particularly the 3' position of the sugar on the 3' terminal nucleoside or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.

Nucleobase Modifications

The siRNA molecules of the disclosure may also include nucleosides or other surrogate or mimetic monomeric subunits that include a nucleobase (often referred to in the art simply as "base" or "heterocyclic base moiety"). The nucleobase is another moiety that has been extensively modified or substituted and such modified and or substituted nucleobases are amenable to the present disclosure. As used herein, "unmodified" or "natural" nucleobases include the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified nucleobases also referred herein as heterocyclic base moieties include other synthetic and natural nucleobases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl (-C=C-CH3) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-th iouracil, 8-halo, 8-amino, 8- thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7- methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Nucleobases may also include those in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza- adenine, 7-deazaguanosine, 2-aminopyridine and 2-pyridone. Further nucleobases include those disclosed in US 3,687,808, those disclosed in Kroschwitz, J. I., ed. The Concise Encyclopedia of Polymer Science and Engineering, New York, John Wiley & Sons, 1990, pp. 858-859; those disclosed by Englisch et al., Angewandte Chemie, International Edition 30:613, 1991 ; and those disclosed by Sanghvi, Y.S., Chapter 16, Antisense Research and Applications, CRC Press, Gait, M.J. ed., 1993, pp. 289-302. The siRNA molecules of the present disclosure may also include polycyclic heterocyclic compounds in place of one or more heterocyclic base moieties. A number of tricyclic heterocyclic compounds have been previously reported. These compounds are routinely used in antisense applications to increase the binding properties of the modified strand to a target strand.

Representative cytosine analogs that make three hydrogen bonds with a guanosine in a second strand include 1 ,3-diazaphenoxazine-2-one (Kurchavov et al., Nucleosides and Nucleotides, 16:1837-46, 1997), 1 ,3-diazaphenothiazine-2-one (Lin et al. Am. Chem. Soc., 117:3873-4, 1995), and 6,7,8,9-tetrafluoro-l,3-diazaphenoxazine-2-one (Wang et al., Tetrahedron Lett., 39:8385-8, 1998). Incorporated into oligonucleotides, these base modifications were shown to hybridize with complementary guanine and the latter was also shown to hybridize with adenine and to enhance helical thermal stability by extended stacking interactions (also see US 10/155,920 and US 10/013,295, both of which are herein incorporated by reference in their entirety). Further helixstabilizing properties have been observed when a cytosine analog/substitute has an aminoethoxy moiety attached to the rigid 1 ,3-diazaphenoxazine-2-one scaffold (Lin et al., Am. Chem. Soc., 120:8531-2, 1998).

Internucleoside Linkage Modifications

Another variable in the design of the present disclosure is the internucleoside linkage making up the phosphate backbone of the siRNA molecule. Although the natural RNA phosphate backbone may be employed here, derivatives thereof may be used which enhance desirable characteristics of the siRNA molecule. Although not limiting, of particular importance in the present disclosure is protecting parts, or the whole, of the siRNA molecule from hydrolysis. One example of a modification that decreases the rate of hydrolysis is phosphorothioates. Any portion or the whole of the backbone may contain phosphate substitutions (e.g., phosphorothioates). For instance, the internucleoside linkages may be between 0 and 100% phosphorothioate, e.g., between 0 and 100%, 10 and 100%, 20 and 100%, 30 and 100%, 40 and 100%, 50 and 100%, 60 and 100%, 70 and 100%, 80 and 100%, 90 and 100%, 0 and 90%, 0 and 80%, 0 and 70%, 0 and 60%, 0 and 50%, 0 and 40%, 0 and 30%, 0 and 20%, 0 and 10%, 10 and 90%, 20 and 80%, 30 and 70%, 40 and 60%, 10 and 40%, 20 and 50%, 30 and 60%, 40 and 70%, 50 and 80%, or 60 and 90% phosphorothioate linkages. Similarly, the internucleoside linkages may be between 0 and 100% phosphodiester linkages, e.g., between 0 and 100%, 10 and 100%, 20 and 100%, 30 and 100%, 40 and 100%, 50 and 100%, 60 and 100% 70 and 100%, 80 and 100%, 90 and 100%, 0 and 90%, 0 and 80%, 0 and 70%, 0 and 60%, 0 and 50%, 0 and 40%, 0 and 30%, 0 and 20%, 0 and 10%, 10 and 90%, 20 and 80%, 30 and 70%, 40 and 60%, 10 and 40%, 20 and 50%, 30 and 60%, 40 and 70%, 50 and 80%, or 60 and 90% phosphodiester linkages.

Specific examples of some potential siRNA molecules useful in this invention include oligonucleotides containing modified e.g., non-naturally occurring internucleoside linkages. As defined in this specification, oligonucleotides having modified internucleoside linkages include internucleoside linkages that retain a phosphorus atom and internucleoside linkages that do not have a phosphorus atom. For the purposes of this specification, and as sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides. A preferred phosphorus containing modified internucleoside linkage is the phosphorothioate internucleoside linkage. In some embodiments, the modified oligonucleotide backbones containing a phosphorus atom therein include, for example, phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'-alkylene phosphonates, 5'-alkylene phosphonates, phosphinates, phosphoramidates including 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates, and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3' to 3', 5' to 5' or 2' to 2' linkage. Exemplary U.S. patents describing the preparation of phosphorus-containing linkages include but are not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 4,476,301 ; 5,023,243; 5,177,195; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321 ,131 ; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821 ; 5,541 ,316; 5,550,111 ; 5,563,253; 5,571 ,799; 5,587,361 ; 5,625,050; 6,028,188; 6,124,445; 6,160,109; 6,169,170; 6,172,209; 6,239,265; 6,277,603; 6,326,199; 6,346,614; 6,444,423; 6,531 ,590; 6,534,639; 6,608,035; 6,683,167; 6,858,715; 6,867,294; 6,878,805; 7,015,315; 7,041 ,816; 7,273,933; 7,321 ,029; and U.S. Pat. RE39464, the entire contents of each of which are hereby incorporated herein by reference.

In some embodiments, the modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These include those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; riboacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts. Non-limiting examples of U.S. patents that teach the preparation of non-phosphorus backbones include, but are not limited to, U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141 ; 5,235,033; 5,64,562; 5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541 ,307; 5,561 ,225; 5,596,086; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, the entire contents of each of which are hereby incorporated herein by reference.

Patterns of Modifications of siRNA Molecules

The following section provides a set of exemplary scaffolds into which the siRNA molecules of the disclosure may be incorporated.

In some embodiments of the disclosure, the siRNA may contain an antisense strand including a region represented by Formula I, wherein Formula I is, in the 5’-to-3’ direction

A-B-(A’)j-C-P 2 -D-P 1 -(C’-P 1 )k-C’ Formula I; wherein A is represented by the formula C-P 1 -D-P 1 ; each A’ is represented by the formula C-P 2 -D-P 2 ; B is represented by the formula C-P 2 -D-P 2 -D-P 2 -D-P 2 ; each C is a 2’-O-methyl (2’-O-Me) ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-fluoro (2’-F) ribonucleoside; each D is a 2’-F ribonucleoside; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, j is 4. In some embodiments, k is 4. In some embodiments, j is 4 and k is 4. The antisense is complementary (e.g., fully or partially complementary) to a target nucleic acid sequence.

In some embodiments, the antisense strand includes a structure represented by Formula A1 , wherein Formula A1 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-B-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B-S-A-S-A-S-A-S-B-S-A

Formula A1 ; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the siRNA may contain an antisense strand including a region represented by Formula II, wherein Formula II is, in the 5’-to-3’ direction:

A-B-(A’)j-C-P 2 -D-P 1 -(C-P 1 )k-C’ Formula II; wherein A is represented by the formula C-P 1 -D-P 1 ; each A’ is represented by the formula C-P 2 -D-P 2 ; B is represented by the formula C-P 2 -D-P 2 -D-P 2 -D-P 2 ; each C is a 2’-O-methyl (2’-O-Me) ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-fluoro (2’-F) ribonucleoside; each D is a 2’-F ribonucleoside; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, j is 4. In some embodiments, k is 4. In some embodiments, j is 4 and k is 4. The antisense is complementary (e.g., fully or partially complementary) to a target nucleic acid sequence.

In some embodiments of the disclosure, the antisense strand includes a structure represented by Formula A2, wherein Formula A2 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-B-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B-S-A-S-A-S-A-S-A-S-A

Formula A2; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula III, wherein Formula III is, in the 5’-to-3’ direction:

E-(A’)m-F Formula III; wherein E is represented by the formula (C-P 1 )2; F is represented by the formula (C-P 2 )3-D-P 1 -C-P 1 -C, (C-P 2 ) 3 -D-P 2 -C-P 2 -C, (C-P 2 ) 3 -D-P 1 -C-P 1 -D, or (C-P 2 ) 3 -D-P 2 -C-P 2 -D; A’, C, D, P 1 , and P 2 are as defined in Formula I; and m is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, m is 4. The sense strand is complementary (e.g., fully or partially complementary) to the antisense strand.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S1 , wherein Formula S1 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-S-A -S-A

Formula S1 ; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S2, wherein Formula S2 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-O-A -O-A

Formula S2; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S3, wherein Formula S3 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-S-A -S-B Formula S3; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S4, wherein Formula S4 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-A-O-A-O-B-O-A -O-B

Formula S4; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the siRNA may contain an antisense strand including a region represented by Formula IV, wherein Formula IV is, in the 5’-to-3’ direction:

A-(A’)j-C-P 2 -B-(C-P 1 )k-C’ Formula IV; wherein A is represented by the formula C-P 1 -D-P 1 ; each A’ is represented by the formula C-P 2 -D-P 2 ; B is represented by the formula D-P 1 -C-P 1 -D-P 1 ; each C is a 2’-O-Me ribonucleoside; each C’, independently, is a 2’-O-Me ribonucleoside or a 2’-F ribonucleoside; each D is a 2’-F ribonucleoside; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and k is an integer from 1 to7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, j is 6. In some embodiments, k is 4. In some embodiments, j is 6 and k is 4. The antisense strand is complementary (e.g., fully or partially complementary) to a target nucleic acid.

In some embodiments of the disclosure, the antisense strand includes a structure represented by Formula A3, wherein Formula A3 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B-S-A-S-B-S-A-S-A-S-A Formula A3; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the siRNA of the disclosure may have a sense strand represented by Formula V, wherein Formula V is, in the 5’-to-3’ direction:

E-(A’) m -C-P 2 -F

Formula V; wherein E is represented by the formula (C-P 1 )2; F is represented by the formula D-P 1 -C-P 1 -C, D-P 2 - C-P 2 -C, D-P 1 -C-P 1 -D, or D-P 2 -C-P 2 -D; A’, C, D, P 1 , and P 2 are as defined in Formula IV; and m is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, m is 5. The sense strand is complementary (e.g., fully or partially complementary) to the antisense strand.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S5, wherein Formula S5 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A -S-A

Formula S5; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S6, wherein Formula S6 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-A

Formula S6; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S7, wherein Formula S7 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-S-A -S-B

Formula S7; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S8, wherein Formula S8 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A-O-B-O-A -O-B

Formula S8; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the siRNA may contain an antisense strand including a region represented by Formula VI, wherein Formula VI is, in the 5’-to-3’ direction: A-Bj-E-B k -E-F-Gi-D-P 1 -C’

Formula VI; wherein A is represented by the formula C-P 1 -D-P 1 ; each B is represented by the formula C-P 2 ; each C is a 2’-O-Me ribonucleoside; each O’, independently, is a 2’-O-Me ribonucleoside or a 2’-F ribonucleoside; each D is a 2’-F ribonucleoside; each E is represented by the formula D-P 2 -C-P 2 ; F is represented by the formula D-P 1 -C-P 1 ; each G is represented by the formula C-P 1 ; each P 1 is a phosphorothioate internucleoside linkage; each P 2 is a phosphodiester internucleoside linkage; j is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); k is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and I is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, j is 3. In some embodiments, k is 6. In some embodiments, I is 2. In some embodiments, j is 3, k is 6, and I is 2. The antisense strand is complementary (e.g., fully or partially complementary) to a target nucleic acid.

In some embodiments of the disclosure, the antisense strand includes a structure represented by Formula A4, wherein Formula A4 is, in the 5’-to-3’ direction:

A-S-B-S-A-O-A-O-A-O-B-O-A-O-A-O-A-O-A-O-A-O-A-O-A-O-B-O-A -O-B-S-A-S-A-S-A-S-B-S-A Formula A4; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage.

In some embodiments of the disclosure, the siRNA may contain a sense strand including a region represented by Formula VII, wherein Formula VII is, in the 5’-to-3’ direction:

H-Bm-ln-A’-Bo-H-C

Formula VII; wherein A’ is represented by the formula C-P 2 -D-P 2 ; each H is represented by the formula (C-P 1 )2; each I is represented by the formula (D-P 2 ); B, C, D, P 1 , and P 2 are as defined in Formula VI; m is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); n is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7); and o is an integer from 1 to 7 (e.g., 1 , 2, 3, 4, 5, 6, or 7). In some embodiments, m is 3. In some embodiments, n is 3. In some embodiments, o is 3. In some embodiments, m is 3, n is 3, and o is 3. The sense strand is complementary (e.g., fully or partially complementary) to the antisense strand.

In some embodiments of the disclosure, the sense strand includes a structure represented by Formula S9, wherein Formula S9 is, in the 5’-to-3’ direction:

A-S-A-S-A-O-A-O-A-O-B-O-B-O-B-O-A-O-B-O-A-O-A-O-A-O-A-S-A -S-A

Formula S9; wherein A represents a 2’-O-Me ribonucleoside, B represents a 2’-F ribonucleoside, O represents a phosphodiester internucleoside linkage, and S represents a phosphorothioate internucleoside linkage. In some embodiments of the disclosure, the siRNA may contain an antisense strand including a region that is represented by Formula VIII:

Z-((A-P-)n(B-P-)m)q;

Formula VIII wherein Z is a 5’ phosphorus stabilizing moiety; each A is a 2’-O-methyl (2'-O-Me) ribonucleoside; each B is a 2'-fluoro-ribonucleoside; each P is, independently, an internucleoside linkage selected from a phosphodiester linkage and a phosphorothioate linkage; n is an integer from 1 to 5 (e.g., 1 , 2, 3, 4, or 5); m is an integer from 1 to 5 (e.g., 1 , 2, 3, 4, or 5); and q is an integer between 1 and 30 (1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30).

Methods of siRNA Synthesis

The siRNA molecules of the disclosure can be synthesized by standard methods known in the art as further discussed below, e.g., by use of an automated DNA synthesizer, such as are commercially available from, for example, Biosearch, Applied Biosystems, Inc.

The siRNA agent can be prepared using solution-phase or solid-phase organic synthesis or both. Organic synthesis offers the advantage that the oligonucleotide including unnatural or modified nucleotides can be easily prepared. siRNA molecules of the disclosure can be prepared using solution-phase or solid-phase organic synthesis or both.

Further, it is contemplated that for any siRNA agent disclosed herein, further optimization could be achieved by systematically either adding or removing linked nucleosides to generate longer or shorter sequences. Further still, such optimized sequences can be adjusted by, e.g., the introduction of modified nucleosides, and/or modified internucleoside linkages as described herein or as known in the art, including alternative nucleosides, alternative sugar moieties, and/or alternative internucleoside linkages as known in the art and/or discussed herein to further optimize the molecule (e.g., increasing serum stability or circulating half-life, increasing thermal stability, enhancing transmembrane delivery, and/or targeting to a particular location or cell type).

Divalent Cations

The siRNA molecules of the disclosure may contain one or more phosphodiester internucleoside linkages and/or an analog thereof, such as a phosphorothioate internucleoside linkage, in which oxyanion moieties are electrostatically neutralized by ionic bonding to a divalent metal cation, such as Ba 2+ , Be 2+ , Ca 2+ , Cu 2+ , Mg 2+ , Mn 2+ , Ni 2+ , or Zn 2+ .

The siRNA molecules of the disclosure may include a plurality of cationic binding sites (e.g., electron-dense sites) that are saturated by one or more divalent cations (e.g., Ba 2+ , Be 2+ , Ca 2+ , Cu 2+ , Mg 2+ , Mn 2+ , Ni 2+ , or Zn 2+ , or a combination thereof). Because of their positive charge, divalent cations are typically reactive with negatively charges atoms (e.g., oxyanion from a phosphate group or phosphorothioate group carrying a unit or partial negative charge). The one or more divalent cations may have an ionic radius, when measured in the form of a crystal lattice, of about 30 picometers to about 150 picometers (e.g., from about 30 picometers to about 140 picometers, from about 40 picometers to about 130 picometers, from about 50 picometers to about 120 picometers, from about 60 picometers to about 110 picometers, from about 60 picometers to about 100 picometers, or from about 60 picometers to about 90 picometers). The calculated crystal radii of the divalent cations disclosed by R. D. Shannon, Acta Crystallographica A. 32:751-767, 1976, are herein incorporated by reference.

The degree of saturation of a siRNA molecule’s cationic binding sites by the one or more divalent cations may range from about 10% to about 100% (e.g., from about 20% to about 100%, from about 30% to about 100%, from about 40% to about 100%, from about 50% to about 100%, from about 60% to about 100%, from about 70% to about 100%, from about 80% to about 100%, or from about 90% to about 100%).

In some embodiments, the antisense strand of the siRNA molecule may have a length of from 10 to 30 nucleotides and may be ionically bound to a total of from 10 to 30 divalent cations. For example, the molar ratio of antisense strand nucleotides to divalent cations in the siRNA molecule could range from 1 :3 to 3:1 (e.g., 1 :3, 1 .1 :3, 1 .2:3, 1 .3:3, 1 .4:3, 1 .5:3, 1 .6:3, 1 .7:3, 1 .8:3, 1 .9:3, 2:3, 2.1 :3, 2.2:3, 2.3:3, 2.4:3, 2.5:3, 2.6:3, 2.7:3, 2.8:3, 2.9:3, 1 :1 , 3:2.9, 3:2.8, 3:2.7, 3:2.6, 3:2.5, 3:2.4, 3:2.3, 3:2.2, 3:2.1 , 3:2, 3:1.9, 3:1.8, 3:1.7, 3:1.6, 3:1.5, 3:1.4, 3:1.3, 3:1.2, 3:1 .1 , or 3:1).

In some embodiments, the sense strand of the siRNA molecule may have a length of from 10 to 30 nucleotides and may be ionically bound to a total of from 10 to 30 divalent cations. For example, the molar ratio of sense strand nucleotides to divalent cations in the siRNA molecule could range from 1 :3 to 3:1 (e.g., 1 :3, 1.1 :3, 1.2:3, 1.3:3, 1.4:3, 1 .5:3, 1.6:3, 1.7:3, 1.8:3, 1.9:3, 2:3, 2.1 :3, 2.2:3, 2.3:3, 2.4:3, 2.5:3, 2.6:3, 2.7:3, 2.8:3, 2.9:3, 1 :1 , 3:2.9, 3:2.8, 3:2.7, 3:2.6, 3:2.5, 3:2.4, 3:2.3, 3:2.2, 3:2.1 , 3:2, 3:1.9, 3:1.8, 3:1.7, 3:1.6, 3:1.5, 3:1.4, 3:1.3, 3:1.2, 3:1.1 , or 3:1).

The siRNA molecules of the disclosure may be combined with one or more divalent cations in a specific molar ratio. The specific molar ratio of siRNA molecule to divalent cation may be relevant to the toxicity benefit achieved by the divalent cation. For example, the molar ratio of siRNA molecule to divalent cation may range from 1 :10 to 1 :50 (e.g., 1 :10, 1 :11 , 1 :12, 1 :13, 1 :14, 1 :15, 1 :16, 1 :17, 1 :18, 1 :19, 1 :20, 1 :21 , 1 :22, 1 :23, 1 :24, 1 :25, 1 :26, 1 :27, 1 :28, 1 :29, 1 :30, 1 :31 , 1 :32, 1 :33, 1 :34, 1 :35, 1 :36, 1 :37, 1 :38, 1 :39, 1 :40. 1 :41 , 1 :42, 1 :43. 1 :44, 1 :45, 1 :46, 1 :47, 1 :48, 1 :49, or 1 :50). In some embodiments, the molar ratio of siRNA molecule to divalent cation may range from 1 :18 to 1 :38 (e.g., 1 :18, 1 :19, 1 :20, 1 :21 , 1 :22, 1 :23, 1 :24, 1 :25, 1 :26, 1 :27, 1 :28, 1 :29, 1 :30, 1 :31 , 1 :32, 1 :33, 1 :34, 1 :35, 1 :36, 1 :37, or 1 :38). In some embodiments, the molar ratio of siRNA molecule to divalent cation may range from 1 :20 to 1 :25 (e.g., 1 :20, 1 :21 , 1 :22, 1 :23, 1 :24, or 1 :25). In some embodiments, the molar ratio of siRNA molecule to divalent cation may be 1 :20. In some embodiments, the molar ratio of siRNA molecule to divalent cation may be 1 :25.

The siRNA molecules of the disclosure may be combined with one or more divalent cations in which the divalent cation is present in a specific concentration or range of concentrations. The concentration of the divalent cation may be relevant to the toxicity benefit achieved by the divalent cation. For example, the concentration of the divalent cation may be from 20 mM to 150 mM (e.g. ,20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 , 32, 33, 34, 35, 36, 37, 38, 39, 40, 41 , 42, 43, 44, 45, 46, 47, 48, 49, 50, 51 , 52, 53, 54, 55, 56, 57, 58, 59, 60, 61 , 62, 63, 64, 65, 66, 67, 68, 69, 70, 71 , 72, 73, 74, 75, 76, 77, 78, 79, 80, 81 , 82, 83, 84, 85, 86, 87, 88, 89, 90, 91 , 92, 93, 94, 95, 96, 97, 98, 99, 100, 101 , 102, 103, 104, 105, 106, 107, 108, 109, 110, 111 , 112, 113, 114, 115, 116, 117, 118, 119, 120, 121 , 122, 123, 124, 125, 126, 127, 128, 129, 130, 131 , 132, 133, 134, 135, 136, 137, 138, 139, 140, 141 , 142, 143, 144, 145, 146, 147, 148, 149, or 150 mM). In some embodiments, the concentration of the divalent cation is from 20 mM to 100 mM (e.g., 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 , 32, 33, 34, 35, 36, 37, 38, 39, 40, 41 , 42, 43, 44, 45, 46, 47, 48, 49, 50, 51 , 52, 53, 54, 55, 56, 57, 58, 59,

60, 61 , 62, 63, 64, 65, 66, 67, 68, 69, 70, 71 , 72, 73, 74, 75, 76, 77, 78, 79, 80, 81 , 82, 83, 84, 85, 86,

87, 88, 89, 90, 91 , 92, 93, 94, 95, 96, 97, 98, 99, or 100 mM). In some embodiments, the concentration of the divalent cation is from 35 mM to 75 mM (e.g., 35, 36, 37, 38, 39, 40, 41 , 42, 43,

44, 45, 46, 47, 48, 49, 50, 51 , 52, 53, 54, 55, 56, 57, 58, 59, 60, 61 , 62, 63, 64, 65, 66, 67, 68, 69, 70,

71 , 72, 73, 74, or 75 mM). In some embodiments, the concentration of the divalent cation may be from 40 mM to 70 mM (e.g., 40, 41 , 42, 43, 44, 45, 46, 47, 48, 49, 50, 51 , 52, 53, 54, 55, 56, 57, 58, 59, 60, 61 , 62, 63, 64, 65, 66, 67, 68, 69, or 70 mM).

The siRNA molecule may include one or more atoms having a negative charge and the divalent cation may include a positive charge. In some embodiments, the siRNA molecule and divalent cation are present in an amount so that there is a specific ratio of negative to positive charge present within the composition. Methods of determining the negative to positive charge ratio are known in the art, for example, in Furst et al., Electrophoresis., 37:2685-2691 , 2016, the disclosure of which is hereby incorporated by reference. In some embodiments, the ratio of negative charge to positive charge is from 0.75 to 7.5 (e.g., 0.76, 0.77, 0.78, 0.79, 0.80, 0.81 , 0.82, 0.83, 0.84, 0.85, 0.86, 0.87, 0.88, 0.89, 0.90, 0.91 , 0.92, 0.93, 0.94, 0.95, 0.96, 0.97, 0.98, 0.99, 1.0, 1.1 , 1.2, 1.3, 1.4, 1.5,

1.6, 1.7, 1.8, 1.9, 2.0, 2.1 , 2.2, 2.3, 2.4, 2.5, 2.6, 2.7, 2.8, 2.9, 3.0, 3.1 , 3.2, 3.3, 3.4, 3.5, 3.6, 3.7, 3.8,

3.9, 4.0, 4.1 , 4.2, 4.3, 4.4, 4.5, 4.6, 4.7, 4.8, 4.9, 5.0, 5.1 , 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1 ,

6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1 , 7.2, 7.3, 7.4, or 7.5). In some embodiments, the ratio of negative charge to positive charge is from 1 .0 to 2.0 (e.g., from 1 .0 to 1 .9, from 1 .0 to 1 .8, from 1.0 to

1.7, from 1.0 to 1.6, from 1.0 to 1.5, from 1.0 to 1.4, from 1.0 to 1.3, from 1.0 to 1.2, from 1.0 to 1.1 , from 1 .1 to 2.0, from 1 .2 to 2.0, from 1 .3 to 2.0, from 1 .4 to 2.0, from 1 .5 to 2.0, from 1 .6 to 2.0, from 1 .7 to 2.0, from 1 .8 to 2.0, or from 1 .9 to 2.0). In some embodiments, the ratio of negative charge to positive charge is from 0.75 to 6.5 (e.g., from 0.75 to 5.5, from 0.75 to 4.5, from 0.75 to 3.5, from 0.75 to 2.5, from 0.75 to 1 .5, or from 0.75 to 1). In some embodiments, the ratio of negative charge to positive charge is from 1 to 7.5 (e.g., from 1 .5 to 7.5, from 2.5 to 7.5, from 3.5 to 7.5, from 4.5 to 7.5, from 5.5 to 7.5, or from 6.5 to 7.5).

5' Phosphorus Stabilizing Moieties

To further protect the siRNA molecules of this disclosure from degradation, a 5'-phosphorus stabilizing moiety may be employed. A 5'-phosphorus stabilizing moiety replaces the 5'-phosphate to prevent hydrolysis of the phosphate. Hydrolysis of the 5'-phosphate prevents binding to RISC, a necessary step in gene silencing. Any replacement for phosphate that does not impede binding to RISC is contemplated in this disclosure. In some embodiments, the replacement for the 5'-phosphate is also stable to in vivo hydrolysis. Each strand of a siRNA molecule may independently and optionally employ any suitable 5'-phosphorus stabilizing moiety.

Formula XIII Formula XIV Formula XV Formula XVI

Some exemplary endcaps are demonstrated in Formulas IX-XVI. Nuc in Formulas IX-XVI represents a nucleobase or nucleobase derivative or replacement as described herein. X in formula IX-XVI represents a 2’-modification as described herein. Some embodiments employ hydroxy as in Formula IX, phosphate as in Formula X, vinylphosphonates as in Formula XI and XIV, 5’-methyl- substitued phosphates as in Formula XII, XIII, and XVI, methylenephosphonates as in Formula XV, or vinyl 5'-vinylphsophonate as a 5'-phosphorus stabilizing moiety as demonstrated in Formula XI.

Hydrophobic Moieties

The present disclosure further provides siRNA molecules having one or more hydrophobic moieties attached thereto. The hydrophobic moiety may be covalently attached to the 5’ end or the 3’ end of the siRNA molecules of the disclosure. Non-limiting examples of hydrophobic moieties suitable for use with the siRNA molecules of the disclosure may include cholesterol, vitamin D, tocopherol, phosphatidylcholine (PC), docosahexaenoic acid, docosanoic acid, PC-docosanoic acid, eicosapentaenoic acid, lithocholic acid or any combination of the aforementioned hydrophobic moieties with PC. siRNA Branching

The siRNA molecules of the disclosure may be branched. For example, the siRNA molecules of the disclosure may have one of several branching patterns, as described herein.

According to the present disclosure, the siRNA molecules disclosed herein may be branched siRNA molecules. The siRNA molecule may not be branched, or may be di-branched, tri-branched, or tetra-branched, connected through a linker. Each main branch may be further branched to allow for 2, 3, 4, 5, 6, 7, or 8 separate RNA single- or double-strands. The branch points on the linker may stem from the same atom, or separate atoms along the linker. Some exemplary embodiments are listed in Table 2. Table 2. Branched siRNA structures

In some embodiments, the siRNA molecule is a branched siRNA molecule. In some embodiments, the branched siRNA molecule is di-branched, tri-branched, ortetra-branched. In some embodiments, the di-branched siRNA molecule is represented by any one of Formulas XVII-XIX, wherein each RNA, independently, is an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety (e.g., phosphoroamidite, tosylated solketal, 1 ,3-diaminopropanol, pentaerythritol, or any one of the branch point moieties described in US 10,478,503). In some embodiments, the tri-branched siRNA molecule represented by any one of Formulas XX-XXIII, wherein each RNA, independently, is an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

In some embodiments, the tetra-branched siRNA molecule represented by any one of Formulas XXIV-XXVIII, wherein each RNA, independently, is an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety.

Linkers

Multiple strands of siRNA described herein may be covalently attached by way of a linker. The effect of this branching improves, inter alia, cell permeability allowing better access into cells (e.g., neurons or glial cells) in the CNS. Any linking moiety may be employed which is not incompatible with the siRNAs of the present invention. Linkers include ethylene glycol chains of 2 to 10 subunits (e.g., 2, 3, 4, 5, 6, 7, 8, 9, or 10 subunits), alkyl chains, carbohydrate chains, block copolymers, peptides, RNA, DNA, and others. In some embodiments, any carbon or oxygen atom of the linker is optionally replaced with a nitrogen atom, bears a hydroxyl substituent, or bears an oxo substituent. In some embodiments, the linker is a poly-ethylene glycol (PEG) linker. The PEG linkers suitable for use with the disclosed compositions and methods include linear or non-linear PEG linkers. Examples of non-linear PEG linkers include branched PEGs, linear forked PEGs, or branched forked PEGs.

PEG linkers of various weights may be used with the disclosed compositions and methods. For example, the PEG linker may have a weight that is between 5 and 500 Daltons. In some embodiments, a PEG linker having a weight that is between 500 and 1 ,000 Dalton may be used. In some embodiments, a PEG linker having a weight that is between 1 ,000 and 10,000 Dalton may be used. In some embodiments, a PEG linker having a weight that is between 200 and 20,000 Dalton may be used. In some embodiments, the linker is covalently attached to a sense strand of the siRNA. In some embodiments, the linker is covalently attached to an antisense strand of the siRNA. In some embodiments, the PEG linker is a triethylene glycol (TrEG) linker. In some embodiments, the PEG linker is a tetraethylene glycol (TEG) linker.

In some embodiments, the linker is an alkyl chain linker. In some embodiments, the linker is a peptide linker. In some embodiments, the linker is an RNA linker. In some embodiments, the linker is a DNA linker.

Linkers may covalently link 2, 3, 4, or 5 unique siRNA strands. The linker may covalently bind to any part of the siRNA oligomer. In some embodiments, the linker attaches to the 3' end of nucleosides of each siRNA strand. In some embodiments, the linker attaches to the 5' end of nucleosides of each siRNA strand. In some embodiments, the linker attaches to a nucleoside of an siRNA strand (e.g., sense or antisense strand) by way of a covalent bond-forming moiety. In some embodiments, the covalent-bond-forming moiety is selected from the group consisting of an alkyl, ester, amide, carbonate, carbamate, triazole, urea, formacetal, phosphonate, phosphate, and phosphate derivative (e.g., phosphorothioate, phosphoramidate, etc.).

In some embodiments, the linker has a structure of Formula L1 :

(Formula L1)

In some embodiments, the linker has a structure of Formula L2:

(Formula L2)

In some embodiments, the linker has a structure of Formula L3:

(Formula L3)

In some embodiments, the linker has a structure of Formula L4:

(Formula L4)

In some embodiments, the linker has a structure of Formula L5:

(Formula L5)

In some embodiments, the linker has a structure of Formula L6:

(Formula L6)

In some embodiments, the linker has a structure of Formula L7, as is shown below:

(Formula L7)

In some embodiments, the linker has a structure of Formula L8:

(Formula L8)

In some embodiments, the linker has a structure of Formula L9:

(Formula L9)

In some embodiments, the selection of a linker for use with one or more of the branched siRNA molecules disclosed herein may be based on the hydrophobicity of the linker, such that, e.g., desirable hydrophobicity is achieved for the one or more branched siRNA molecules of the disclosure. For example, a linker containing an alkyl chain may be used to increase the hydrophobicity of the branched siRNA molecule as compared to a branched siRNA molecule having a less hydrophobic linker or a hydrophilic linker.

The siRNA agents disclosed herein may be synthesized and/or modified by methods well established in the art, such as those described in Beaucage, S. L. et al. (edrs.), Current Protocols in Nucleic Acid Chemistry, John Wiley & Sons, Inc., New York, N.Y., 2000, which is hereby incorporated herein by reference.

Methods of Treatment

The MS/73-targeting siRNA molecules of the disclosure may be delivered to a subject, thereby treating a microsatellite repeat expansion disorder and/or mitigating phenotypes associated with the disorder. For example, the siRNA molecules may be delivered to a subject, thereby treating Huntington’s disease and/or mitigating Huntington’s disease associated phenotypes (e.g., movement disorders such as chorea and dystonia, cognitive impairments, and psychiatric disorders such as depression and anxiety). Alternatively, the MS/73-targeting siRNA molecules of the disclosure may be delivered to a subject, thereby treating myotonic dystrophy type 1 and/or mitigating myotonic dystrophy type 1-associated phenotypes (e.g., muscle wasting and weakness). As a further alternative, the MS/7-targeting siRNA molecules of the disclosure may be delivered to a subject, thereby treating and mitigating the effects of spinocerebellar ataxias or any one of the many syndromes caused by expansions of DNA microsatellite repeats. Exemplary microsatellite repeat expansion disorders are described herein and discussed in Rodriguez et al., Neurobiology of Disease, 130:104515, 2019, the disclosure of which is incorporated herein by reference. Furthermore, the siRNA molecules of the disclosure may also be delivered to a subject having a variant of the MSH3 gene for which siRNA-mediated gene silencing of the MSH3 variant gene reduces the expression level of MSH3 transcript, thereby treating Huntington’s Disease, myotonic dystrophies, spinocerebellar ataxias, or another microsatellite repeat expansion disorder.

The disclosure provides methods of treating a subject by way of MSH3 gene silencing with one or more of the siRNA molecules described herein. The gene silencing may be performed in a subject to silence wild type MSH3 transcripts, mutant MSH3 transcripts, splice isoforms of MSH3 transcripts, and/or overexpressed MSH3 transcripts thereof, relative to a healthy subject. The method may include delivering to the CNS or affected tissues of the subject (e.g., a human) the siRNA molecules of the disclosure or a pharmaceutical composition containing the same by any appropriate route of administration (e.g., intracerebroventricular, intrathecal injection, intrastriatal injection, intracisterna magna injection by catheterization, intraparenchymal injection, intravenous injection, subcutaneous injection, or intramuscular injection). The active compound can be administered in any suitable dose. The actual dosage amount of a composition of the present disclosure administered to a patient can be determined by physical and physiological factors such as body weight, severity of condition, previous or concurrent therapeutic interventions, idiopathy of the patient and on the route of administration. Depending upon the dosage and the route of administration, the number of administrations of a preferred dosage and/or an effective amount may vary according to the response of the subject. The practitioner responsible for administration will, in any event, determine the concentration of active ingredient(s) in a composition and appropriate dose(s) for the individual subject. Administration may occur any suitable number of times per day, and for as long as necessary. Subjects may be adult or pediatric humans, with or without comorbid diseases.

Selection of Subjects

Subjects that may be treated with the siRNA molecules disclosed herein are subjects in need of treatment of, for example, a microsatellite repeat expansion disorder, such as Huntington’s Disease, and/or any other medical risk(s) associated with DNA microsatellite repeat expansions or a gain of function mutation in the MSH3 gene. Subjects that may be treated with the siRNA molecules disclosed herein may include, for example, humans, monkeys, rats, mice, pigs, and other mammals containing at least one orthologous copy of the MSH3 gene. Subjects may be adult or pediatric humans, with or without comorbid diseases.

Pharmaceutical Compositions

The siRNA molecules in the present disclosure may be formulated into a pharmaceutical composition for administration to a subject in a biologically compatible form suitable for administration in vivo. Accordingly, the present disclosure provides a pharmaceutical composition containing a siRNA molecule of the disclosure in admixture with a suitable diluent, carrier, or excipient. The siRNA molecules may be administered, for example, directly into the CNS or affected tissues of the subject (e.g., by way of intracerebroventricular, intrastriatally, intrathecal injection, intra-cisterna magna injection by catheterization, intraparenchymal injection, intravenous injection, subcutaneous injection, or intramuscular injection).

Conventional procedures and ingredients for the selection and preparation of suitable formulations are described, for example, in Remington, J.P. The Science and Practice of Pharmacy, Easton, PA. Mack Publishers, 2012, 22 nd ed. and in The United States Pharmacopeial Convention, The National Formulary, United States Pharmacopeial, 2015, USP 38 NF 33).

Under ordinary conditions of storage and use, a pharmaceutical composition may contain a preservative, e.g., to prevent the growth of microorganisms. Pharmaceutical compositions may include sterile aqueous solutions, dispersions, or powders, e.g., for the extemporaneous preparation of sterile solutions or dispersions. In all cases the form may be sterilized using techniques known in the art and may be fluidized to the extent that may be easily administered to a subject in need of treatment.

A pharmaceutical composition may be administered to a subject, e.g., a human subject, alone or in combination with pharmaceutically acceptable carriers, as noted herein, the proportion of which may be determined by the solubility and/or chemical nature of the compound, chosen route of administration, and standard pharmaceutical practice.

Dosing Regimens

A physician having ordinary skill in the art can readily determine an effective amount of the siRNA molecule for administration to a mammalian subject (e.g., a human) in need thereof. For example, a physician could start prescribing doses of one the siRNA molecules of the disclosure at levels lower than that required to achieve the desired therapeutic effect and gradually increase the dosage until the desired effect is achieved. Alternatively, a physician may begin a treatment regimen by administering one of the siRNA molecules of the disclosure at a high dose and subsequently administer progressively lower doses until reaching a minimal dosage at which a therapeutic effect is achieved (e.g., a reduction in expression of a target gene sequence). In general, a suitable daily dose of one of the siRNA molecules of the disclosure will be an amount of the siRNA molecule which is the lowest dose effective to produce a therapeutic effect. The ss- or ds-siRNA molecules of the disclosure may be administered by injection, e.g., intrathecally, intracerebroventricularly, by intra-cisterna magna injection by catheterization, intraparenchymally, intravenously, subcutaneously, or intramuscularly. A daily dose of a therapeutic composition of the siRNA molecules of the disclosure may be administered as a single dose or as two, three, four, five, six or more doses administered separately at appropriate intervals throughout the day, week, month, or year, optionally, in unit dosage forms. While it is possible for the siRNA molecules of the disclosure to be administered alone, it may also be administered as a pharmaceutical formulation in combination with excipients, carriers, and optionally, additional therapeutic agents.

Routes of Administration

The method of the disclosure contemplates any route of administration tolerated by the therapeutic composition. Some embodiments of the method include injection intrathecally, intracerebroventricularly, intrastriatally, intraparenchymally, or by intra-cisterna magna injection by catheterization.

Intrathecal injection is the direct injection into the spinal column or subarachnoid space. By injecting directly into the CSF of the spinal column the siRNA molecules of the disclosure have direct access to cells (e.g., neurons and glial cells) in the spinal column and a route to access the cells in the brain by bypassing the blood brain barrier.

Intracerebroventricular (ICV) injection is a method to directly inject into the CSF of the cerebral ventricles. Similar to intrathecal injection, ICV is a method of injection which bypasses the blood brain barrier. Using ICV allows the advantage of access to the cells of the brain and spinal column without the danger of the therapeutic being degraded in the blood.

Intrastriatal injection is the direct injection into the striatum, or corpus striatum. The striatum is an area in the subcortical basal ganglia in the brain. Injecting into the striatum bypasses the blood brain barrier and the pharmacokinetic challenges of injection into the blood stream and allows for direct access to the cells of the brain.

Intraparenchymal administration is the direct injection into the parenchyma (e.g., the brain parenchyma). Injection into the brain parenchyma allows for injection directly into brain regions affected by a disease or disorder while bypassing the blood brain barrier.

Intra-cisterna magna injection by catheterization is the direct injection into the cisterna magna. The cisterna magna is the area of the brain located between the cerebellum and the dorsal surface of the medulla oblongata. Injecting into the cisterna magna results in more direct delivery to the cells of the cerebellum, brainstem, and spinal cord.

In some embodiments of the methods described herein, the therapeutic composition may be delivered to the subject by way of systemic administration, e.g., intravenously, intramuscularly, or subcutaneously.

Intravenous (IV) injection is a method to directly inject into the bloodstream of a subject. The IV administration may be in the form of a bolus dose or by way of continuous infusion, or any other method tolerated by the therapeutic composition.

Intramuscular (IM) injection is injection into a muscle of a subject, such as the deltoid muscle or gluteal muscle. IM may allow for rapid absorption of the therapeutic composition.

Subcutaneous injection is injection into subcutaneous tissue. Absorption of compositions delivered subcutaneously may be slower than IV or IM injection, which may be beneficial for compositions requiring continuous absorption.

Examples

The following examples are put forth so as to provide those of ordinary skill in the art with a description of how the compositions and methods described herein may be used, made, and evaluated, and are intended to be purely exemplary of the disclosure and are not intended to limit the scope of what the inventors regard as their disclosure. Example 1. MSH3 Knockdown

Objective

This Example describes the results of a series of experiments undertaken to investigate the ability of siRNA molecules complementary to specific regions within a human MSH3 mRNA transcript to effectuate reduced expression of the MSH3 ger\e.

Materials and Methods

MeWo cells (metastatic malignant melanoma cells, ATCC cat# HTB-65) were seeded at 30,000 cells per well in a 96-well tissue culture plate in Modified Eagle’s Medium alpha (MEM-alpha) with 6% fetal bovine serum without antibiotics or antimycotics. An equal volume of siRNA at either 4 .M or 1 pM in serum-free Opti-MEM was added to the cells, resulting in a final assay condition of 3% fetal bovine serum and either 2 pM or 0.5 pM siRNA. Non-siRNA treated cells (untreated) exposed to the same media composition were used as controls. Cells were grown in a standard cell culture incubator at 37°C. and 5% CO2. At 72 hours after siRNA treatment, the TaqMan Fast Advanced Cells- to-CT Kit (Thermo) was used to lyse all cells, digest the genomic DNA, perform reverse-transcription on the resulting RNA, and perform quantitative polymerase chain reaction (qPCR) on the resulting cDNA. Exon spanning MSH3-specific primer/probes (Thermo) were multiplexed with primer/probes for a housekeeping gene (ATP5B, Thermo), and all reactions were run on a Quantstudio 7 Flex (Thermo). MSH3 mRNA expression was calculated relative to untreated control cells using the delta- delta-Ct method. Biological duplicates and technical (qPCR) duplicates were run for each condition, and all results are expressed as the mean of all technical and biological duplicates, and as residual mRNA expression as a percent of untreated control cells.

Results

Cells were treated with an siRNA molecule of the disclosure having an antisense strand and a sense strand as shown in Tables 3 and 4, below. The knockdown efficiency of the siRNA molecule was measured as the residual mRNA expression as a percent of untreated at 2 pM and 0.5 pM. The knockdown results are reported in Table 3 (2 pM) and Table 4 (0.5 pM).

Table 3. MSH3 knockdown with siRNA molecules of the disclosure at 2 pM

Table 4. MSH3 knockdown with siRNA molecules of the disclosure at 0.5 pM

Example 2. IC50 Determination of IMS H3-targeting siRNA Molecules

Objective

This Example describes the results of a series of experiments undertaken to determine the IC50 of siRNA molecules complementary to specific regions within a human MSH3 mRNA.

Materials and Methods

MeWo cells (metastatic malignant melanoma cells, ATCC cat# HTB-65) were actively transfected at concentrations of 1 fM to 100 nM using 0.05% RNAiMax as the transfection reagent. Non-siRNA treated cells (untreated) exposed to the same media composition were used as controls. Cells were grown in a standard cell culture incubator at 37°C. and 5% CO2. At 72 hours after siRNA treatment, the TaqMan Fast Advanced Cells-to-CT Kit (Thermo) was used to lyse all cells, digest the genomic DNA, perform reverse-transcription on the resulting RNA, and perform quantitative polymerase chain reaction (qPCR) on the resulting cDNA. Exon spanning MSH3-specific primer/probes (Thermo) were multiplexed with primer/probes for a housekeeping gene (ATP5B, Thermo), and all reactions were run on a Quantstudio 7 Flex (Thermo). MSH3 mRNA expression was calculated relative to untreated control cells using the delta-delta-Ct method. Biological duplicates and technical (qPCR) duplicates were run for each condition, and all results are expressed as the mean of all technical and biological duplicates, and as residual mRNA expression as a percent of untreated control cells.

Results

Cells were treated with an siRNA molecule of the disclosure having an antisense strand and a sense strand as shown in Table 5, below. The IC50 was calculated and the knockdown results are reported in Table 5.

Table 5. IC50 of IMS H 3-T argeting siRNA Molecules

Example 3. In vivo inhibition of MSH3 gene expression using di-siRNA sequence variants in mouse

In vivo administration of di-siRNA 12 siRNA sequences targeting MSH3 were synthesized as di-siRNAs having the structure of

Formula XVII. Test articles are shown in Table 6, below.

Table 6. MSH3-targeting siRNA molecules used for in vivo experiments Di-siRNAs were formulated in PBS. siRNAs at the 20nmol dose were formulated as a divalent cation salt. In vivo testing was performed in FVB/NJ female mice with intracerebroventricular (ICV) dosing, in groups of 8 animals. On day 1 , stereotactic injection was performed, wherein bilateral ICV injections of test article or PBS control (5 pL per side, 10 pL total) were performed at 0.5-2 pL/min after needle placement at the following coordinates from bregma: -0.45 mm or -0.25 mm anteriorposterior, +/- 1 mm mediolateral and -2.5 mm dorsoventral. Dose levels of 0.5 nmol, 1 nmol, 2.5 nmol, 5 nmol, and/or 20 nmol were utilized. At 28 days after injection, animals were perfused with cold 1X PBS and brains were collected and sliced. Tissue punches of fixed diameter and thickness were collected from different brain regions (motor cortex, hippocampus and striatum) and snap frozen on dry ice.

Protein expression analysis

To evaluate MSH3 protein expression levels, mouse brain tissue punches were homogenized in Cell Lysis Buffer with protease/phosphatase inhibitor cocktail (Cell Signaling Technologies) using a TissueLyser II. Tissue homogenate was centrifuged for 5 minutes at 1000 rpm to remove bubbles. Total protein content was normalized to using BCA (Pierce) to 0.125 mg/mL in 16 pL.

For MSH3 protein expression analysis, Western blot was performed using the Jess (Protein Simple) system and reagents as recommended by the manufacturer. The 12-230 kD separation cartridge was utilized, with Fluorescent Master Mix 1 and the anti-mouse detection module (all from Protein Simple). Samples were probed for mouse total MSH3 expression (MABE324 Antibody, EMD Millipore, 1 :100 dilution) and were normalized against the multiplexed housekeeping protein reference mouse actin (MAB8929, R&D Systems, 1 :20,000 dilution). Comparative analysis was performed for relative MSH3 quantitation against samples taken from PBS treated mice. Results are shown in Figures 1 , 2, 3, 7, and 9.

RNA expression analysis

To evaluate MSH3 mRNA expression levels, total RNA was extracted from mouse brain tissue punches using phenofchloroform extraction. This method was performed by first disrupting tissue samples in TRIzol reagent (Invitrogen) using a TissueLyser II (Qiagen) and adding chloroform to the homogenized samples at a ratio of 5:1 TRIzofchloroform. Tubes were then shaken vigorously and spun at 12,000 x g for 15 mins and the resulting upper aqueous phase containing total RNA was carefully removed and added to a clean tube. An equal volume of 70% ethanol was added to each sample and mixed gently. The sample was further purified using Qiagen RNeasy column purification according to standard kit protocol. Samples were eluted in 40 pL RNase-free water. Following elution, RNA was analyzed on a Tapestation 4200 Bioanalyzer (Agilent) to assess concentration and quality. All samples were normalized for total RNA. cDNA synthesis was performed in a 20 pL reaction volume using the Applied Biosystems High-Capacity cDNA Reverse Transcription kit. Following RT- PCR, nuclease-free water was added to cDNA for a final sample volume of 50 pL.

For Htt gene expression analysis, qPCR was performed using TaqMan reagents on a QuantStudio7 Real-Time PCR instrument (Life Technologies). Samples were probed for mouse total MSH3 expression (assay ID Mm00487756_m1) and were normalized against the multiplexed housekeeping gene reference mouse TBP (Assay ID Mm01277042_m1). Comparative analysis was performed for relative MSH3 quantitation (DeltaDeltaCt) against samples taken from PBS treated mice. Results are shown in Figures 4, 5, 6, and 8. Statistical assessment

Statistical significance between PBS control and each treatment group was determined using two-way ANOVA and Dunnett’s multiple comparisons test, except for Figure 9, which utilized Sidak’s multiple comparisons test.

Example 4. Generating MSH3-targeting siRNA Molecules

The siRNA molecules of the disclosure can be synthesized by standard methods known in the art as further discussed below, e.g., by use of an automated DNA synthesizer, such as are commercially available from, for example, Biosearch, Applied Biosystems, Inc.

The siRNA agent can be prepared using solution-phase or solid-phase organic synthesis or both. Organic synthesis offers the advantage that the oligonucleotide including unnatural or modified nucleotides can be easily prepared. Specific examples of siRNA molecules, with the nucleotide sequence of the sense and antisense strand, as well as the MutS Homolog 3 (MSH3) mRNA target sequence, are shown in Table 1A, above. It is appreciated that one of skill in the art could anneal the antisense (AS) strand to the corresponding sense (S) strand to yield a ds-siRNA molecule. Alternatively, one of skill in the art could derive a ss-siRNA molecule using antisense strand only.

Example 5. Optimizing MSH3-targeting siRNA Molecules

It is contemplated that for any small interfering RNA (siRNA) agent disclosed herein, modifications to the siRNA may further optimize the molecule’s efficacy or biophysical properties (e.g., increasing serum stability or circulating half-life, increasing thermal stability, enhancing transmembrane delivery, and/or targeting to a particular location or cell type). Such optimization could be achieved by systematically either adding or removing linked nucleosides to generate longer or shorter sequences. Further siRNA optimization could include the incorporation of, for example, one or more alternative nucleosides, alternative 2’ sugar moieties, and/or alternative internucleoside linkages. Further still, such optimized siRNA molecules may include the introduction of hydrophobic and/or stabilizing moieties at the 5’ and/or 3’ ends. siRNA Optimization with Alternative Nucleosides

Optimization of the siRNA molecules of the disclosure may include one or more of the following nucleoside modifications: 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5- halouracil and cytosine, 5-propynyl (-C=C-CH3) uracil and cytosine and other alkynyl derivatives of pyrimidine bases, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo, 8- amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7-methyladenine, 2-F-adenine, 2-amino-adenine, 8-azaguanine and 8-azaadenine, 7- deazaguanine and 7-deazaadenine, and/or 3-deazaguanine and 3-deazaadenine. The siRNA molecules may also include nucleobases in which the purine or pyrimidine base is replaced with other heterocycles, for example 7-deaza-adenine, 7-deazaguanosine, 2-aminopyridine, and/or 2-pyridone. Further optimization of the siRNA molecules of the disclosure may include nucleobases disclosed in US 3,687,808; Kroschwitz, J. I., ed. The Concise Encyclopedia of Polymer Science and Engineering, New York, John Wiley & Sons, 1990, pp. 858-859; Englisch et al., Angewandte Chemie, International Edition 30:613, 1991 ; and Sanghvi, Y.S., Chapter 16, Antisense Research and Applications, CRC Press, Gait, M.J. ed., 1993, pp. 289-302. siRNA Optimization with Alternative Sugar Modifications

Optimization of the siRNA molecules of the disclosure may include one or more of the following 2’ sugar modifications: 2’-O-methyl (2’-O-Me), 2'-methoxyethoxy (2'-O-CH2CH2OCH3, also known as 2'-0-(2-methoxyethyl) or 2'-MOE), 2'-dimethylaminooxyethoxy, i.e., a O(CH2)2ON(CH3)2 group, also known as 2'-DMAOE, and/or 2'-dimethylaminoethoxyethoxy (also known in the art as 2'-O- dimethylamino-ethoxy-ethyl or 2'-DMAEOE), i.e., 2'-O-CH2OCH2N(CH3)2. Other possible 2'- modifications that can optimize the siRNA molecules of the disclosure include all possible orientations of OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-alkynyl; or O-alkyl-O-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or unsubstituted C1 to C10 alkyl or C2 to C10 alkenyl and alkynyl. Other potential sugar substituent groups include, e.g., aminopropoxy (- OCH2CH2CH2NH2), allyl (-CH 2 -CH=CH 2 ), -O-allyl (-O-CH 2 -CH=CH 2 ) and fluoro (F). 2'-sugar substituent groups may be in the arabino (up) position or ribo (down) position. In some embodiments, the 2'-arabino modification is 2'-F. Similar modifications may also be made at other positions on the siRNA molecule, particularly the 3' position of the sugar on the 3' terminal nucleoside or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar. siRNA Optimization with Alternative Internucleoside Linkages

Optimization of the siRNA molecules of the disclosure may include one or more of the following internucleoside modifications: phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'-alkylene phosphonates, 5'-alkylene phosphonates, phosphinates, phosphoramidates including 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, selenophosphates, and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein one or more internucleotide linkages is a 3' to 3', 5' to 5' or 2' to 2' linkage. siRNA Optimization with Hydrophobic Moieties

Optimization of the siRNA molecules of the disclosure may include hydrophobic moieties covalently attached to the 5’ end or the 3’ end. Non-limiting examples of hydrophobic moieties suitable for use with the siRNA molecules of the disclosure may include cholesterol, vitamin D, tocopherol, phosphatidylcholine (PC), docosahexaenoic acid, docosanoic acid, PC-docosanoic acid, eicosapentaenoic acid, lithocholic acid or any combination of the aforementioned hydrophobic moieties with PC. siRNA Optimization with Stabilizing Moieties

Optimization of the siRNA molecules of the disclosure may include a 5’-phosphorous stabilizing moiety that protects the siRNA molecules from degradation. A 5'-phosphorus stabilizing moiety replaces the 5'-phosphate to prevent hydrolysis of the phosphate. Hydrolysis of the 5'- phosphate prevents binding to RISC, a necessary step in gene silencing. Any replacement for phosphate that does not impede binding to RISC is contemplated in this disclosure. In some embodiments, the replacement for the 5'-phosphate is also stable to in vivo hydrolysis. Each siRNA strand may independently and optionally employ any suitable 5'-phosphorus stabilizing moiety. Nonlimiting examples of 5’ stabilizing moieties suitable for use with the siRNA molecules of the disclosure may include those demonstrated by Formulas IX-XVI above. siRNA Optimization with Branched siRNA

Optimization of the siRNA molecules of the disclosure may include the incorporation of branching patterns, such as, for example, di-branched, tri-branched, ortetra-branched siRNAs connected by way of a linker. Each main branch may be further branched to allow for 2, 3, 4, 5, 6, 7, or 8 separate RNA single- or double-strands. The branch points on the linker may stem from the same atom, or separate atoms along the linker. Some exemplary embodiments are listed in Table 2, above.

The siRNA composition of the disclosure may be optimized to be in the form of: di-branched siRNA molecules, as represented by any one of Formulas XVII-XIX; tri-branched siRNA molecules, as represented by any one of Formulas XX-XXIII; and/or tetra-branched siRNA molecules, as represented by any one of Formulas XXIV-XXVIII, wherein each RNA, independently, is an siRNA molecule, L is a linker, and each X, independently, represents a branch point moiety (e.g., phosphoroamidite, tosylated solketal, 1 ,3-diaminopropanol, pentaerythritol, or any one of the branch point moieties described in US 10,478,503).

Example 6. Preparation and Administrating MSH3-targeting siRNA Molecules

The siRNA molecules in the present disclosure may be formulated into a pharmaceutical composition for administration to a subject in a biologically compatible form suitable for administration in vivo. For example, the siRNA molecules of the disclosure may be administered in a suitable diluent, carrier, or excipient, and may further contain a preservative, e.g., to prevent the growth of microorganisms. Conventional procedures and ingredients for the selection and preparation of suitable formulations are described, for example, in Remington, J.P. The Science and Practice of Pharmacy, Easton, PA. Mack Publishers, 2012, 22 nd ed. and in The United States Pharmacopeial Convention, The National Formulary, United States Pharmacopeial, 2015, USP 38 NF 33).

The method of the disclosure contemplates any route of administration to the subject that is tolerated by the siRNA compositions of the disclosure. Non-limiting examples of siRNA injections into the CNS include intrathecally, intracerebroventricularly, intrastriatally, intraparenchymally, or intracisterna magna injection by catheterization. Examples of systemic administration include intravenous, intramuscular, and subcutaneous injection. A physician having ordinary skill in the art can readily determine an effective route of administration. Example 7. Methods for the Treatment of Microsatellite Repeat Expansion Diseases Using MSH3-targeting siRNA Molecules

A subject in need of treatment of a microsatellite repeat expansion disease, such as Huntington’s Disease, is treated with a dosage of the siRNA molecule or siRNA composition of the disclosure, formulated as a salt, at frequency determined by a practitioner. A physician having ordinary skill in the art can readily determine an effective amount of the siRNA molecule for administration to a mammalian subject (e.g., a human) in need thereof. For example, a physician could start prescribing doses of one of the siRNA molecules of the disclosure at levels lower than that required to achieve the desired therapeutic effect and gradually increase the dosage until the desired effect is achieved. Alternatively, a physician may begin a treatment regimen by administering one of the siRNA molecules of the disclosure at a high dose and subsequently administer progressively lower doses until a minimum dose that produces a therapeutic effect (e.g., a reduction in expression of MSH3 mRNA or suitable biomarker) is achieved. In general, a suitable daily dose of one of one of the siRNA molecules of the disclosure will be an amount which is the lowest dose effective to produce a therapeutic effect. The ss- or ds-siRNA molecules of the disclosure may be administered by injection, e.g., intrathecally, intracerebroventricularly, intrastriatally, intraparenchymally, intravenously, intramuscularly, subcutaneously, or by intra-cisterna magna injection via catheterization. A daily dose of a therapeutic composition of one of the siRNA molecules of the disclosure may be administered as a single dose or as two, three, four, five, six or more doses administered separately at appropriate intervals throughout the day, week, month, or year, optionally, in unit dosage forms. While it is possible for any of the siRNA molecules of the disclosure to be administered alone, it may also be administered as a pharmaceutical formulation in combination with excipients, carriers, and optionally, additional therapeutic agents. Dosage and frequency are determined based on the subject’s height, weight, age, sex, and other disorders.

The siRNA molecule(s) of the disclosure is selected by the practitioner for compatibility with the subject. Single- or double-stranded siRNA molecules (e.g., non-branched siRNA, di-branched siRNA, tri-branched siRNA, tetra-branched siRNA) are available for selection. The siRNA molecule chosen has an antisense strand and may have a sense strand with a sequence and RNA modifications (e.g., natural and non-natural internucleoside linkages, modified sugars, 5'-phosphorus stabilizing moieties, hydrophobic moieties, and/or branching structures) best suited to the patient.

The siRNA molecule is delivered by the route best suited the patient (e.g., intrathecally, intracerebroventricularly, intrastriatally, intraparenchymally, intravenously, intramuscularly, subcutaneously, or by intra-cisterna magna injection via catheterization) and condition at a rate tolerable to the patient until the subject has reached a maximum tolerated dose, or until symptoms are ameliorated satisfactorily.

Other Embodiments

All publications, patents, and patent applications mentioned in this specification are incorporated herein by reference to the same extent as if each independent publication or patent application was specifically and individually indicated to be incorporated by reference. While the invention has been described in connection with specific embodiments thereof, it will be understood that it is capable of further modifications and this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the invention that come within known or customary practice within the art to which the invention pertains and may be applied to the essential features hereinbefore set forth, and follows in the scope of the claims.

Other embodiments are within the claims.