Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
COMPOSITIONS AND METHODS FOR MODULATING HEMOGLOBIN GENE FAMILY EXPRESSION
Document Type and Number:
WIPO Patent Application WO/2013/173599
Kind Code:
A1
Abstract:
Aspects of the invention provide single stranded oligonucleotides for activating or enhancing expression of hemoglobin genes (HBB, HBD, HBEl, HBG1 or HBG2). Further aspects provide compositions and kits comprising single stranded oligonucleotides for activating or enhancing expression of hemoglobin genes. Methods for modulating expression of hemoglobin genes using the single stranded oligonucleotides are also provided. Further aspects of the invention provide methods for selecting a candidate oligonucleotide for activating or enhancing expression of hemoglobin genes.

Inventors:
KRIEG ARTHUR M (US)
SUBRAMANIAN ROMESH (US)
MCSWIGGEN JAMES (US)
LEE JEANNIE T (US)
Application Number:
PCT/US2013/041382
Publication Date:
November 21, 2013
Filing Date:
May 16, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
RANA THERAPEUTICS INC (US)
GEN HOSPITAL CORP (US)
International Classes:
C12N15/11; A61K31/7125; A61P7/06; C07H21/00
Domestic Patent References:
WO2008103763A22008-08-28
Foreign References:
US20090155910A12009-06-18
Other References:
SANKARAN, VIJAY G. ET AL.: "MicroRNA-15a and -16-1 act via MYB to elevate fetal hemoglobin expression in human trisomy 13", PNAS, vol. 108, no. 4, 25 January 2011 (2011-01-25), pages 1519 - 1524, XP055177794
FU, YAN-FANG ET AL.: "Mir-144 selectively regulates embryonic a-hemoglobin synthesis during primitive erythropoiesis", BLOOD, vol. 113, no. 6, 5 February 2009 (2009-02-05), pages 1340 - 1349, XP055177799
SANKARAN, VIJAY G.: "Targeted therapeutic strategies for fetal hemoglobin induction", HEMATOLOGY, vol. 2011, no. 1, 10 December 2011 (2011-12-10), pages 459 - 465, XP055108964
KHALIL, ABMAD M. ET AL.: "Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression", PNAS, vol. 106, no. 28, 14 July 2009 (2009-07-14), pages 11667 - 11672, XP055073351
See also references of EP 2850188A4
Attorney, Agent or Firm:
YOUNG, Daniel, W. (Greenfield & Sacks P.C.,600 Atlantic Avenu, Boston MA, US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. A single stranded oligonucleotide having a sequence 5'-X-Y-Z, wherein X is any nucleotide, Y is a nucleotide sequence of 6 nucleotides in length that is not a seed sequence of a human microRNA, and Z is a nucleotide sequence of 1-23 nucleotides in length, wherein the single stranded oligonucleotide is complementary with at least 8 consecutive nucleotides of a PRC2-associated region of a hemoglobin gene.

2. The single stranded oligonucleotide of claiml, wherein the oligonucleotide does not comprise three or more consecutive guanosine nucleotides.

3. The single stranded oligonucleotide of claim 1 or 2, wherein the

oligonucleotide does not comprise four or more consecutive guanosine nucleotides.

4. The single stranded oligonucleotide of any one of claims 1 to 3, wherein the oligonucleotide is 8 to 30 nucleotides in length.

5. The single stranded oligonucleotide of any one of claims 1 to 4, wherein the oligonucleotide is 8 to 10 nucleotides in length and all but 1, 2, or 3 of the nucleotides of the complementary sequence of the PRC2-associated region are cytosine or guanosine nucleotides.

6. The single stranded oligonucleotide of any one of claims 1 to 5, wherein at least one nucleotide of the oligonucleotide is a nucleotide analogue.

7. The single stranded oligonucleotide of claim 6, wherein the at least one nucleotide analogue results in an increase in Tm of the oligonucleotide in a range of 1 to 5 °C compared with an oligonucleotide that does not have the at least one nucleotide analogue.

8. The single stranded oligonucleotide of any one of claims 1 to 7, wherein at least one nucleotide of the oligonucleotide comprises a 2' O-methyl.

9. The single stranded oligonucleotide of any one of claims 1 to 8, wherein each nucleotide of the oligonucleotide comprises a 2' O-methyl.

10. The single stranded oligonucleotide of any one of claims 1 to 8, wherein the oligonucleotide comprises at least one ribonucleotide, at least one deoxyribonucleotide, or at least one bridged nucleotide.

11. The single strand oligonucleotide of claim 11, wherein the bridged nucleotide is a LNA nucleotide, a cEt nucleotide or a ENA modified nucleotide.

12. The single stranded oligonucleotide of any one of claims 1 to 6, wherein each nucleotide of the oligonucleotide is a LNA nucleotide.

13. The single stranded oligonucleotide of any one of claims 1 to 6, wherein the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and 2'-fluoro- deoxyribonucleotides.

14. The single stranded oligonucleotide of any one of claims 1 to 6, wherein the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and 2'-0- methyl nucleotides.

15. The single stranded oligonucleotide of any one of claims 1 to 6, wherein the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and ENA nucleotide analogues.

16. The single stranded oligonucleotide of any one of claims 1 to 6, wherein the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and LNA nucleotides.

17. The single stranded oligonucleotide of any one of claims 13 to 16, wherein the 5' nucleotide of the oligonucleotide is a deoxyribonucleotide.

18. The single stranded oligonucleotide of any one of claims 1 to 6, wherein the nucleotides of the oligonucleotide comprise alternating LNA nucleotides and 2' -O-methyl nucleotides.

19. The single stranded oligonucleotide of claim 18, wherein the 5' nucleotide of the oligonucleotide is a LNA nucleotide.

20. The single stranded oligonucleotide of any one of claims 1 to 8, wherein the nucleotides of the oligonucleotide comprise deoxyribonucleotides flanked by at least one LNA nucleotide on each of the 5' and 3' ends of the deoxyribonucleotides.

21. The single stranded oligonucleotide of any one of claims 1 to 20, further comprising phosphorothioate internucleotide linkages between at least two nucleotides.

22. The single stranded oligonucleotide of claim 21, further comprising phosphorothioate internucleotide linkages between all nucleotides.

23. The single stranded oligonucleotide of any one of claims 1 to 22, wherein the nucleotide at the 3' position of the oligonucleotide has a 3' hydroxyl group.

24. The single stranded oligonucleotide of any one of claims 1 to 22, wherein the nucleotide at the 3' position of the oligonucleotide has a 3' thiophosphate.

25. The single stranded oligonucleotide of any one of claims 1 to 24, further comprising a biotin moiety conjugated to the 5' nucleotide.

26. A single stranded oligonucleotide comprising a region of complementarity that is complementary with at least 8 consecutive nucleotides of a PRC2-associated region of a hemoglobin gene, wherein the oligonucleotide has at least one of: a) a sequence that is 5'X-Y-Z, wherein X is any nucleotide and wherein X is anchored at the 5' end of the oligonucleotide, Y is a nucleotide sequence of 6 nucleotides in length that is not a human seed sequence of a microRNA, and Z is a nucleotide sequence of 1 to 23 nucleotides in length; b) a sequence that does not comprise three or more consecutive guanosine

nucleotides; c) a sequence that has less than a threshold level of sequence identity with every sequence of nucleotides, of equivalent length to the second nucleotide sequence, that are between 50 kilobases upstream of a 5 '-end of an off-target gene and 50 kilobases downstream of a 3 '-end of the off-target gene; d) a sequence that is complementary to a PRC2-associated region that encodes an RNA that forms a secondary structure comprising at least two single stranded loops; and/or e) a sequence that has greater than 60% G-C content.

27. The single stranded oligonucleotide of claim 26, wherein the oligonucleotide has the sequence 5'X-Y-Z and wherein the oligonucleotide is 8-50 nucleotides in length.

28. A composition comprising a single stranded oligonucleotide of any one of claims 1 to 27 and a carrier.

29. A composition comprising a single stranded oligonucleotide of any one of claims 1 to 27 in a buffered solution. 30. A composition of claim 29, wherein the oligonucleotide is conjugated to the carrier.

31. The composition of claim 30, wherein the carrier is a peptide.

32. The composition of claim 30, wherein the carrier is a steroid.

33. A pharmaceutical composition comprising a composition of any one of claims 28 to 32 and a pharmaceutically acceptable carrier.

34. A kit comprising a container housing the composition of any one of claims 28 to 33.

35. A method of increasing expression of a hemoglobin gene in a cell, the method comprising delivering the single stranded oligonucleotide of any one of claims 1 to 27 into the cell.

36. The method of claim 35, wherein delivery of the single stranded

oligonucleotide into the cell results in a level of expression of the hemoglobin gene that is at least 50% greater than a level of expression of the hemoglobin gene in a control cell that does not comprise the single stranded oligonucleotide.

37. A method increasing levels of a hemoglobin gene in a subject, the method comprising administering the single stranded oligonucleotide of any one of claims 1 to 26 to the subject.

38. A method of treating a condition associated with decreased levels of a hemoglobin gene in a subject, the method comprising administering the single stranded oligonucleotide of any one of claims 1 to 27 to the subject.

39. The method of claim 38, wherein the condition is anemia, sickle cell anemia and/or thalassemia.

Description:
COMPOSITIONS AND METHODS FOR MODULATING HEMOGLOBIN GENE

FAMILY EXPRESSION

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 61/785,956, entitled "COMPOSITIONS AND METHODS FOR

MODULATING HEMOGLOBIN GENE FAMILY EXPRESSION", filed March 14, 2013 and U.S. Provisional Application No. 61/647,901, entitled "COMPOSITIONS AND

METHODS FOR MODULATING HEMOGLOBIN GENE FAMILY EXPRESSION", filed May 16, 2012, each of which is incorporated herein by reference in its entirety.

FIELD OF THE INVENTION

The invention relates to oligonucleotide based compositions, as well as methods of using oligonucleotide based compositions for treating disease.

BACKGROUND OF THE INVENTION

Red blood cells are essential for transporting oxygen throughout the body. Red blood cells are made up of hemoglobin, which is a multi-subunit oxygen-transport metalloprotein. During development, embryonic hemoglobin is composed of epsilon chains (encoded by HBE1) and zeta chains and is produced by the embryonic yolk sac. In human infants, hemoglobin is made up of alpha chains (encoded by HBA1 and HBA2) and gamma chains (encoded by HBG1 and HBG2), with the gamma chains gradually replaced by beta chains over time. The majority of hemoglobin in adults is made up of alpha chains and beta chains (encoded by HBB) with a small percentage (about 3%) made up of alpha and delta chains (encoded by HBD). Several disorders are caused by mutations in hemoglobin subunits and affect red blood cell function or production, resulting in anemia. Two major diseases that affect red blood cells include sickle cell anemia and thalassemia.

Sickle cell anemia is a recessive disorder caused by the absence of a polar amino acid at position six of the beta-globin chain due to a point mutation in HBB. The absence of this amino acid causes aggregation of hemoglobin and results in red blood cells having a stiff, sickle shape. The rigidity of these red blood cells results in vessel occlusion and ischaemia as the cells pass through capillary beds. Anemia is also a symptom, due to the excessive lysis of sickle- shaped red blood cells. Mouse models of sickle cell anemia have shown that expression of other hemoglobin subunits can alleviate symptoms. In adult sickle cell anemia mice, for example, expression of HBEl, which is normally not expressed in adults but serves a similar function as beta-chains during embryonic development, restores the mice to a normal phenotype.

Thalassemia is a group of hereditary blood disorders characterized by a reduced amount of hemoglobin and fewer red blood cells. There are several types of thalassemia, including alpha-thalassemia, beta-thalassemia, delta thalassemia. Alpha-thalassemia is caused by mutations in the HBA1 or HBA2 gene. These mutations cause reduction in alpha- globin production and formation of beta-chain tetramers with altered oxygen profiles and anemia. Delta-thalassemia is caused by a reduction in the synthesis of delta chains of hemoglobin, which is encoded by HBD. Beta-thalassemia, the most severe form of thalassemia, is caused by a reduction in the synthesis of the beta chains of hemoglobin, which is encoded by HBB. Beta-thalassemia is classified into three types, thalassemia minor, thalassemia intermedia, and thalassemia major, depending on the number of mutations and disease severity. Thalassemia minor occurs when only one beta globin allele is mutated and results in microcytic anemia. When more than one allele is mutated, thalassemia intermedia or thalassemia major can occur depending on the severity of the mutation. Patients with thalassemia major require blood transfusions or bone marrow transplantation, otherwise anemia, splenomegaly, and severe bone deformities occur. Patients with thalassemia intermedia may require blood transfusions depending on the severity of the disease.

SUMMARY OF THE INVENTION

Aspects of the invention disclosed herein provide methods and compositions that are useful for upregulating hemoglobin genes (HBB, HBD, HBEl, HBG1, and/or HBG2) in cells. In some embodiments, single stranded oligonucleotides are provided that target a PRC2-associated region of a hemoglobin gene (e.g., human HBB, human HBD, human HBEl, human HBG1, human HBG2) and thereby cause upregulation of the gene. In some embodiments, single stranded oligonucleotides are provided that target a PRC2-associated region of the gene encoding a hemoglobin gene. In some embodiments, these single stranded oligonucleotides activate or enhance expression of a hemoglobin gene by relieving or preventing PRC2 mediated repression of the hemoglobin gene. Upregulation of hemoglobin subunits is provided as a treatment for anemia, sickle cell anemia and/or thalassemia. Aspects of the invention disclosed herein provide methods and compositions that are useful for upregulating a hemoglobin gene, e.g., HBB, HBD, HBEl, HBGl, and/or HBG2, for the treatment and/or prevention of sickle cell anemia or

thalassemia. In certain aspects, the invention provides methods for upregulating HBEl,

HBGl, and/or HBG2 for treatment and/or prevention of sickle cell anemia. In other aspects, the invention provides methods for upregulating HBB for treatment and/or prevention of beta-thalassemia. In certain aspects, the invention provides methods for upregulating HBD for treatment and/or prevention of delta-thalassemia.

Further aspects of the invention provide methods for selecting oligonucleotides for activating or enhancing expression of a hemoglobin gene, e.g., HBB, HBD, HBEl, HBGl or HBG2. In some embodiments, methods are provided for selecting a set of oligonucleotides that is enriched in candidates (e.g. , compared with a random selection of oligonucleotides) for activating or enhancing expression of a hemoglobin gene, e.g., HBB, HBD, HBEl, HBGl or HBG2. Accordingly, the methods may be used to establish sets of clinical candidates that are enriched in oligonucleotides that activate or enhance expression of a hemoglobin gene, e.g., HBB, HBD, HBEl, HBGl or HBG2. Such libraries may be utilized, for example, to identify lead oligonucleotides for developing therapeutics to treat a disease associated with decreased levels of a hemoglobin gene, e.g., HBB, HBD, HBEl, HBGl or HBG2. Furthermore, in some embodiments, oligonucleotide chemistries are provided that are useful for controlling the pharmacokinetics, biodistribution, bioavailability and/or efficacy of the single stranded oligonucleotides for activating expression of a hemoglobin gene, e.g., HBB, HBD, HBEl, HBGl or HBG2.

According to some aspects of the invention single stranded oligonucleotides are provided that have a region of complementarity that is complementarty with (e.g. , at least 8 consecutive nucleotides of ) a PRC2-associated region of a hemoglobin gene, e.g., a PRC2- associated region of the nucleotide sequence set forth as any one of SEQ ID NOS: 1- 10. In some embodiments, the oligonucleotide has at least one of the following features: a) a sequence that is 5'X-Y-Z, in which X is any nucleotide and in which X is at the 5' end of the oligonucleotide, Y is a nucleotide sequence of 6 nucleotides in length that is not a human seed sequence of a microRNA, and Z is a nucleotide sequence of 1 to 23 nucleotides in length; b) a sequence that does not comprise three or more consecutive guanosine

nucleotides; c) a sequence that has less than a threshold level of sequence identity with every sequence of nucleotides, of equivalent length to the second nucleotide sequence, that are between 50 kilobases upstream of a 5 '-end of an off-target gene and 50 kilobases downstream of a 3 '-end of the off-target gene; d) a sequence that is complementary to a PRC2-associated region that encodes an RNA that forms a secondary structure comprising at least two single stranded loops; and e) a sequence that has greater than 60% G-C content. In some

embodiments, the single stranded oligonucleotide has at least two of features a), b), c), d), and e), each independently selected. In some embodiments, the single stranded

oligonucleotide has at least three of features a), b), c), d), and e), each independently selected. In some embodiments, the single stranded oligonucleotide has at least four of features a), b), c), d), and e), each independently selected. In some embodiments, the single stranded oligonucleotide has each of features a), b), c), d), and e). In certain embodiments, the oligonucleotide has the sequence 5'X-Y-Z, in which the oligonucleotide is 8-50 nucleotides in length.

According to some aspects of the invention, single stranded oligonucleotides are provided that have a sequence X-Y-Z, in which X is any nucleotide, Y is a nucleotide sequence of 6 nucleotides in length that is not a seed sequence of a human microRNA, and Z is a nucleotide sequence of 1 to 23 nucleotides in length, in which the single stranded oligonucleotide is complementary with a PRC2- associated region of a hemoglobin gene, e.g., a PRC2-associated region of the nucleotide sequence set forth as any one of SEQ ID NOS: 1- 10. In some aspects of the invention, single stranded oligonucleotides are provided that have a sequence 5' -X-Y-Z, in which X is any nucleotide, Y is a nucleotide sequence of 6 nucleotides in length that is not a seed sequence of a human microRNA, and Z is a nucleotide sequence of 1 to 23 nucleotides in length, in which the single stranded

oligonucleotide is complementary with at least 8 consecutive nucleotides of a PRC2- associated region of a hemoglobin gene, e.g., a PRC2-associated region of the nucleotide sequence set forth as any one of SEQ ID NOS: 1-10. In some embodiments, Y is a sequence selected from Table 1. In some embodiments, the PRC2-associated region is a sequence listed in SEQ ID NO: 17 or 18. In some embodiments, the single stranded oligonucleotide comprises a nucleotide sequence as set forth in any one of SEQ ID NOS: 19 to 3788 or 3797-3916, or a fragment thereof that is at least 8 nucleotides. In some embodiments, the single stranded

oligonucleotide comprises a nucleotide sequence as set forth in any one of SEQ ID NOS: 19 to 3788 or 3797-3916, in which the 5' end of the nucleotide sequence provided is the 5' end of the oligonucleotide. In some embodiments, the region of complementarity (e.g. , the at least 8 consecutive nucleotides) is also present within the nucleotide sequence set forth as any one of SEQ ID NOS: 11- 16.

In some embodiments, the single stranded oligonucleotide comprises a nucleotide sequence as set forth in any one of SEQ ID NOS: 19 to 3788 or 3797-3916. In some embodiments, the oligonucleotide is up to 50 nucleotides in length. In some embodiments, the single stranded oligonucleotide comprises a fragment of at least 8 nucleotides of a nucleotide sequence as set forth in any one of SEQ ID NOS: 19 to 3788or 3797-3916.

In some embodiments, the at least 8 consecutive nucleotides are also present within the nucleotide sequence set forth as SEQ ID NO: 1 1, 13, or 15. In some embodiments, the PRC2-associated region is a sequence listed in SEQ ID NO: 17 or 18. In some embodiments, the single stranded oligonucleotide comprises a nucleotide sequence as set forth in any one of SEQ ID NOS: 19-3788, 3820-3821, 3825-3826, or 3850-3916 or a fragment thereof that is at least 8 nucleotides. In some embodiments, the at least 8 consecutive nucleotides are present within the nucleotide sequence set forth as SEQ ID NO: 12, 14, or 16. In some embodiments, the single stranded oligonucleotide comprises a nucleotide sequence as set forth in any one of SEQ ID NOS: 3797-3819, 3822-3824, or 3827-3849 or a fragment thereof that is at least 8

nucleotides.

In some embodiments, a single stranded oligonucleotide comprises a nucleotide sequence as set forth in Table 3. In some embodiments, the single stranded oligonucleotide comprises a fragment of at least 8 nucleotides of a nucleotide sequence as set forth in Table 3. In some embodiments, a single stranded oligonucleotide consists of a nucleotide sequence as set forth in Table 3. In some embodiments, the single stranded oligonucleotide does not comprise three or more consecutive guanosine nucleotides. In some embodiments, the single stranded oligonucleotide does not comprise four or more consecutive guanosine nucleotides.

In some embodiments, the single stranded oligonucleotide is 8 to 30 nucleotides in length. In some embodiments, the single stranded oligonucleotide is up to 50 nucleotides in length. In some embodiments, the single stranded oligonucleotide is 8 to 10 nucleotides in length and all but 1, 2, or 3 of the nucleotides of the complementary sequence of the PRC2- associated region are cytosine or guanosine nucleotides.

In some embodiments, the single stranded oligonucleotide is complementary with at least 8 consecutive nucleotides of a PRC2-associated region of a hemoglobin gene, e.g., a PRC2 region of a nucleotide sequence set forth as any one of SEQ ID NOS: 1-10, in which the nucleotide sequence of the single stranded oligonucleotide comprises one or more of a nucleotide sequence selected from the group consisting of

(a) (X)Xxxxxx, (X)xXxxxx, (X)xxXxxx, (X)xxxXxx, (X)xxxxXx and (X)xxxxxX, (b) (X)XXxxxx, (X)XxXxxx, (X)XxxXxx, (X)XxxxXx, (X)XxxxxX, (X)xXXxxx,

(X)xXxXxx, (X)xXxxXx, (X)xXxxxX, (X)xxXXxx, (X)xxXxXx, (X)xxXxxX, (X)xxxXXx, (X)xxxXxX and (X)xxxxXX,

(c) (X)XXXxxx, (X)xXXXxx, (X)xxXXXx, (X)xxxXXX, (X)XXxXxx, (X)XXxxXx, (X)XXxxxX, (X)xXXxXx, (X)xXXxxX, (X)xxXXxX, (X)XxXXxx, (X)XxxXXx

(X)XxxxXX, (X)xXxXXx, (X)xXxxXX, (X)xxXxXX, (X)xXxXxX and (X)XxXxXx,

(d) (X)xxXXX, (X)xXxXXX, (X)xXXxXX, (X)xXXXxX, (X)xXXXXx,

(X)XxxXXXX, (X)XxXxXX, (X)XxXXxX, (X)XxXXx, (X)XXxxXX, (X)XXxXxX, (X)XXxXXx, (X)XXXxxX, (X)XXXxXx, and (X)XXXXxx,

(e) (X)xXXXXX, (X)XxXXXX, (X)XXxXXX, (X)XXXxXX, (X)XXXXxX and (X)XXXXXx, and

(f) XXXXXX, XxXXXXX, XXxXXXX, XXXxXXX, XXXXxXX, XXXXXxX and XXXXXXx, wherein "X" denotes a nucleotide analogue, (X) denotes an optional nucleotide analogue, and "x" denotes a DNA or RNA nucleotide unit.

In some embodiments, at least one nucleotide of the oligonucleotide is a nucleotide analogue. In some embodiments, the at least one nucleotide analogue results in an increase in Tm of the oligonucleotide in a range of 1 to 5 °C compared with an oligonucleotide that does not have the at least one nucleotide analogue.

In some embodiments, at least one nucleotide of the oligonucleotide comprises a 2' O-methyl. In some embodiments, each nucleotide of the oligonucleotide comprises a 2' O- methyl. In some embodiments, the oligonucleotide comprises at least one ribonucleotide, at least one deoxyribonucleotide, or at least one bridged nucleotide. In some embodiments, the bridged nucleotide is a LNA nucleotide, a cEt nucleotide or a ENA modified nucleotide. In some embodiments, each nucleotide of the oligonucleotide is a LNA nucleotide.

In some embodiments, the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and 2'-fluoro-deoxyribonucleotides. In some embodiments, the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and 2'-0- methyl nucleotides. In some embodiments, the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and ENA nucleotide analogues. In some embodiments, the nucleotides of the oligonucleotide comprise alternating deoxyribonucleotides and LNA nucleotides. In some embodiments, the 5' nucleotide of the oligonucleotide is a

deoxyribonucleotide. In some embodiments, the nucleotides of the oligonucleotide comprise alternating LNA nucleotides and 2' -O-methyl nucleotides. In some embodiments, the 5' nucleotide of the oligonucleotide is a LNA nucleotide. In some embodiments, the nucleotides of the oligonucleotide comprise deoxyribonucleotides flanked by at least one LNA nucleotide on each of the 5' and 3' ends of the deoxyribonucleotides.

In some embodiments, the single stranded oligonucleotide comprises modified internucleotide linkages (e.g. , phosphorothioate internucleotide linkages or other linkages) between at least two, at least three, at least four, at least five or more nucleotides. In some embodiments, the single stranded oligonucleotide comprises modified internucleotide linkages (e.g. , phosphorothioate internucleotide linkages or other linkages) between between all nucleotides.

In some embodiments, the nucleotide at the 3' position of the oligonucleotide has a 3' hydroxyl group. In some embodiments, the nucleotide at the 3' position of the

oligonucleotide has a 3' thiophosphate. In some embodiments, the single stranded oligonucleotide has a biotin moiety or other moiety conjugated to its 5' or 3' nucleotide. In some embodiments, the single stranded oligonucleotide has cholesterol, Vitamin A, folate, sigma receptor ligands, aptamers, peptides, such as CPP, hydrophobic molecules, such as lipids, ASGPR or dynamic polyconjugates and variants thereof at its 5' or 3' end.

According to some aspects of the invention compositions are provided that comprise any of the oligonucleotides disclosed herein, and a carrier. In some embodiments, compositions are provided that comprise any of the oligonucleotides in a buffered solution. In some embodiments, the oligonucleotide is conjugated to the carrier. In some embodiments, the carrier is a peptide. In some embodiments, the carrier is a steroid. According to some aspects of the invention pharmaceutical compositions are provided that comprise any of the oligonucleotides disclosed herein, and a pharmaceutically acceptable carrier.

According to other aspects of the invention, kits are provided that comprise a container housing any of the compositions disclosed herein.

According to some aspects of the invention, methods of increasing expression of HBB, HBD, HBEl, HBGl or HBG2 in a cell are provided. In some embodiments, the methods involve delivering any one or more of the single stranded oligonucleotides disclosed herein into the cell. In some embodiments, delivery of the single stranded oligonucleotide into the cell results in a level of expression of HBB, HBD, HBEl, HBGl or HBG2 that is greater (e.g. , at least 50% greater) than a level of expression of HBB, HBD, HBEl, HBGl or HBG2 in a control cell that does not comprise the single stranded oligonucleotide.

According to some aspects of the invention, methods of increasing levels of HBB, HBD, HBEl, HBGl or HBG2 in a subject are provided. According to some aspects of the invention, methods of treating a condition (e.g. , anemia, sickle cell anemia, thalassemia) associated with decreased levels of HBB, HBD, HBEl, HBGl or HBG2 in a subject are provided. In some embodiments, the methods involve administering any one or more of the single stranded oligonucleotides disclosed herein to the subject.

BRIEF DESCRIPTION OF TABLES

Table 1: Hexamers that are not seed sequences of human miRNAs.

Table 2: A listing of oligonucleotide modifications.

Table 3: Formatted oligonucleotide sequences made for testing showing nucleotide modifications. The table shows the sequence of the modified nucleotides, where InaX represents an LNA nucleotide with 3' phosphorothioate linkage, omeX is a 2'-0-methyl nucleotide, dX is a deoxy nucleotide. An s at the end of a nucleotide code indicates that the nucleotide had a 3' phosphorothioate linkage. The "-Sup" at the end of the sequence marks the fact that the 3' end lacks either a phosphate or thiophosphate on the 3' linkage. The Formatted Sequence column shows the sequence of the oligonucleotide, including modified nucleotides.

DETAILED DESCRIPTION OF CERTAIN EMBODIMENTS OF THE INVENTION

Aspects of the invention provided herein relate to the discovery of polycomb repressive complex 2 (PRC2)-interacting RNAs. Polycomb repressive complex 2 (PRC2) is a histone methyltransferase and a known epigenetic regulator involved in silencing of genomic regions through methylation of histone H3. Among other functions, PRC2 interacts with long noncoding RNAs (IncRNAs), such as Rep A, Xist, and Tsix, to catalyze

trimethylation of histone H3-lysine27. PRC2 contains four subunits, Eed, Suzl2, RbAp48, and Ezh2. Aspects of the invention relate to the recognition that single stranded

oligonucleotides that bind to PRC2-associated regions of RNAs (e.g. , IncRNAs) that are expressed from within a genomic region that encompasses or that is in functional proximity to the HBB, HBD, HBEl, HBGl or HBG2 gene can induce or enhance expression of HBB, HBD, HBEl, HBGl or HBG2. In some embodiments, this upregulation is believed to result from inhibition of PRC2 mediated repression of HBB, HBD, HBEl, HBGl or HBG2.

As used herein, the term "PRC2-associated region" refers to a region of a nucleic acid that comprises or encodes a sequence of nucleotides that interact directly or indirectly with a component of PRC2. A PRC2-associated region may be present in a RNA (e.g. , a long non- coding RNA (IncRNA)) that that interacts with a PRC2. A PRC2-associated region may be present in a DNA that encodes an RNA that interacts with PRC2. In some cases, the PRC2- associated region is equivalently referred to as a PRC2-interacting region.

In some embodiments, a PRC2-associated region is a region of an RNA that crosslinks to a component of PRC2 in response to in situ ultraviolet irradiation of a cell that expresses the RNA, or a region of genomic DNA that encodes that RNA region. In some embodiments, a PRC2-associated region is a region of an RNA that immunoprecipitates with an antibody that targets a component of PRC2, or a region of genomic DNA that encodes that RNA region. In some embodiments, a PRC2-associated region is a region of an RNA that immunoprecipitates with an antibody that binds specifically to SUZ12, EED, EZH2 or RBBP4 (which as noted above are components of PRC2), or a region of genomic DNA that encodes that RNA region.

In some embodiments, a PRC2-associated region is a region of an RNA that is protected from nucleases (e.g. , RNases) in an RNA-immunoprecipitation assay that employs an antibody that targets a component of PRC2, or a region of genomic DNA that encodes that protected RNA region. In some embodiments, a PRC2-associated region is a region of an RNA that is protected from nucleases (e.g. , RNases) in an RNA-immunoprecipitation assay that employs an antibody that targets SUZ12, EED, EZH2 or RBBP4, or a region of genomic DNA that encodes that protected RNA region.

In some embodiments, a PRC2-associated region is a region of an RNA within which occur a relatively high frequency of sequence reads in a sequencing reaction of products of an RNA-immunoprecipitation assay that employs an antibody that targets a component of PRC2, or a region of genomic DNA that encodes that RNA region. In some embodiments, a PRC2- associated region is a region of an RNA within which occur a relatively high frequency of sequence reads in a sequencing reaction of products of an RNA-immunoprecipitation assay that employs an antibody that binds specifically to SUZ12, EED, EZH2 or RBBP4, or a region of genomic DNA that encodes that protected RNA region. In such embodiments, the PRC2-associated region may be referred to as a "peak."

In some embodiments, a PRC2-associated region comprises a sequence of 40 to 60 nucleotides that interact with PRC2 complex. In some embodiments, a PRC2-associated region comprises a sequence of 40 to 60 nucleotides that encode an RNA that interacts with PRC2. In some embodiments, a PRC2-associated region comprises a sequence of up to 5kb in length that comprises a sequence (e.g. , of 40 to 60 nucleotides) that interacts with

PRC2. In some embodiments, a PRC2-associated region comprises a sequence of up to 5kb in length within which an RNA is encoded that has a sequence (e.g. , of 40 to 60 nucleotides) that is known to interact with PRC2. In some embodiments, a PRC2-associated region comprises a sequence of about 4kb in length that comprise a sequence (e.g. , of 40 to 60 nucleotides) that interacts with PRC2. In some embodiments, a PRC2-associated region comprises a sequence of about 4kb in length within which an RNA is encoded that includes a sequence (e.g., of 40 to 60 nucleotides) that is known to interact with PRC2. In some embodiments, a PRC2-associated region has a sequence as set forth in SEQ ID NO: 17 or 18.

In some embodiments, single stranded oligonucleotides are provided that specifically bind to, or are complementary to, a PRC2-associated region in a genomic region that encompasses or that is in proximity to the HBB, HBD, HBE1, HBG1 or HBG2 gene. In some embodiments, single stranded oligonucleotides are provided that specifically bind to, or are complementary to, a PRC2-associated region that has a sequence as set forth in SEQ ID NO: 17 or 18. In some embodiments, single stranded oligonucleotides are provided that specifically bind to, or are complementary to, a PRC2-associated region that has a sequence as set forth in SEQ ID NO: 17 or 18 combined with up to 2kb, up to 5kb, or up to lOkb of flanking sequences from a corresponding genomic region to which these SEQ IDs map (e.g., in a human genome). In some embodiments, single stranded oligonucleotides have a sequence as set forth in any one of SEQ ID NOS: 19 to 3788 or 3797-3916. In some embodiments, a single stranded oligonucleotide has a sequence as set forth in Table 3.

Without being bound by a theory of invention, these oligonucleotides are able to interfere with the binding of and function of PRC2, by preventing recruitment of PRC2 to a specific chromosomal locus. For example, a single administration of single stranded oligonucleotides designed to specifically bind a PRC2-associated region IncRNA can stably displace not only the IncRNA, but also the PRC2 that binds to the IncRNA, from binding chromatin. After displacement, the full complement of PRC2 is not recovered for up to 24 hours. Further, IncRNA can recruit PRC2 in a cis fashion, repressing gene expression at or near the specific chromosomal locus from which the IncRNA was transcribed.

Methods of modulating gene expression are provided, in some embodiments, that may be carried out in vitro, ex vivo, or in vivo. It is understood that any reference to uses of compounds throughout the description contemplates use of the compound in preparation of a pharmaceutical composition or medicament for use in the treatment of condition (e.g., anemia, sickle cell anemia, thalassemia) associated with decreased levels or activity of HBB, HBD, HBE1, HBG1 or HBG2. Thus, as one nonlimiting example, this aspect of the invention includes use of such single stranded oligonucleotides in the preparation of a medicament for use in the treatment of disease, wherein the treatment involves upregulating expression of HBB, HBD, HBE1, HBG1 or HBG2. In further aspects of the invention, methods are provided for selecting a candidate oligonucleotide for activating expression of HBB, HBD, HBEl, HBGl or HBG2. The methods generally involve selecting as a candidate oligonucleotide, a single stranded oligonucleotide comprising a nucleotide sequence that is complementary to a PRC2- associated region (e.g., a nucleotide sequence as set forth in SEQ ID NO: 17 or 18). In some embodiments, sets of oligonucleotides may be selected that are enriched (e.g., compared with a random selection of oligonucleotides) in oligonucleotides that activate expression of HBB, HBD, HBEl, HBGl or HBG2. Single Stranded Oligonucleotides for Modulating Expression of HBB, HBD, HBEl, HBGl or HBG2

In one aspect of the invention, single stranded oligonucleotides complementary to the PRC2-associated regions are provided for modulating expression of HBB, HBD, HBEl, HBGl or HBG2 in a cell. In some embodiments, expression of HBB, HBD, HBEl, HBGl or HBG2 is upregulated or increased. In some embodiments, single stranded oligonucleotides complementary to these PRC2-associated regions inhibit the interaction of PRC2 with long RNA transcripts such that gene expression is upregulated or increased. In some

embodiments, single stranded oligonucleotides complementary to these PRC2-associated regions inhibit the interaction of PRC2 with long RNA transcripts, resulting in reduced methylation of histone H3 and reduced gene inactivation, such that gene expression is upregulated or increased. In some embodiments, this interaction may be disrupted or inhibited due to a change in the structure of the long RNA that prevents or reduces binding to PRC2. The oligonucleotide may be selected using any of the methods disclosed herein for selecting a candidate oligonucleotide for activating expression of HBB, HBD, HBEl, HBGl or HBG2.

The single stranded oligonucleotide may comprise a region of complementarity that is complementary with a PRC2-associated region of a nucleotide sequence set forth in any one of SEQ ID NOS: 1 to 16. The region of complementarity of the single stranded

oligonucleotide may be complementary with at least 6, e.g., at least 7, at least 8, at least 9, at least 10, at least 15 or more consecutive nucleotides of the PRC2-associated region. The PRC2-associated region may map to a position in a chromosome between 50 kilobases upstream of a 5'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene and 50 kilobases downstream of a 3'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2-associated region may map to a position in a chromosome between 25 kilobases upstream of a 5'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene and 25 kilobases downstream of a 3'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2- associated region may map to a position in a chromosome between 12 kilobases upstream of a 5'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene and 12 kilobases downstream of a 3'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2-associated region may map to a position in a chromosome between 5 kilobases upstream of a 5'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene and 5 kilobases downstream of a 3'-end of the HBB, HBD, HBEl, HBG1 or HBG2 gene.

The genomic position of the selected PRC2-associated region relative to the HBB, HBD, HBEl, HBG1 or HBG2 gene may vary. For example, the PRC2-associated region may be upstream of the 5' end of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2- associated region may be downstream of the 3' end of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2- associated region may be within an intron of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2-associated region may be within an exon of the HBB, HBD, HBEl, HBG1 or HBG2 gene. The PRC2- associated region may traverse an intron- exon junction, a 5'-UTR-exon junction or a 3'-UTR-exon junction of the HBB, HBD, HBEl, HBG1 or HBG2 gene.

The single stranded oligonucleotide may comprise a sequence having the formula X- Y-Z, in which X is any nucleotide, Y is a nucleotide sequence of 6 nucleotides in length that is not a human seed sequence of a microRNA, and Z is a nucleotide sequence of varying length. In some embodiments X is the 5' nucleotide of the oligonucleotide. In some embodiments, when X is anchored at the 5' end of the oligonucleotide, the oligonucleotide does not have any nucleotides or nucleotide analogs linked 5' to X. In some embodiments, other compounds such as peptides or sterols may be linked at the 5' end in this embodiment as long as they are not nucleotides or nucleotide analogs. In some embodiments, the single stranded oligonucleotide has a sequence 5'X-Y-Z and is 8-50 nucleotides in length.

Oligonucleotides that have these sequence characteristics are predicted to avoid the miRNA pathway. Therefore, in some embodiments, oligonucleotides having these sequence characteristics are unlikely to have an unintended consequence of functioning in a cell as a miRNA molecule. The Y sequence may be a nucleotide sequence of 6 nucleotides in length set forth in Table 1.

The single stranded oligonucleotide may have a sequence that does not contain guanosine nucleotide stretches (e.g. , 3 or more, 4 or more, 5 or more, 6 or more consecutive guanosine nucleotides). In some embodiments, oligonucleotides having guanosine nucleotide stretches have increased non-specific binding and/or off-target effects, compared with oligonucleotides that do not have guanosine nucleotide stretches.

The single stranded oligonucleotide may have a sequence that has less than a threshold level of sequence identity with every sequence of nucleotides, of equivalent length, that map to a genomic position encompassing or in proximity to an off-target gene. For example, an oligonucleotide may be designed to ensure that it does not have a sequence that maps to genomic positions encompassing or in proximity with all known genes (e.g. , all known protein coding genes) other than HBB, HBD, HBE1, HBG1 or HBG2. In a similar embodiment, an oligonucleotide may be designed to ensure that it does not have a sequence that maps to any other known PRC2-associated region, particularly PRC2-associated regions that are functionally related to any other known gene (e.g. , any other known protein coding gene). In either case, the oligonucleotide is expected to have a reduced likelihood of having off-target effects. The threshold level of sequence identity may be 50%, 60%, 70%, 80%, 85%, 90%, 95%, 99% or 100% sequence identity.

The single stranded oligonucleotide may have a sequence that is complementary to a PRC2-associated region that encodes an RNA that forms a secondary structure comprising at least two single stranded loops. In has been discovered that, in some embodiments, oligonucleotides that are complementary to a PRC2-associated region that encodes an RNA that forms a secondary structure comprising one or more single stranded loops (e.g. , at least two single stranded loops) have a greater likelihood of being active (e.g. , of being capable of activating or enhancing expression of a target gene) than a randomly selected

oligonucleotide. In some cases, the secondary structure may comprise a double stranded stem between the at least two single stranded loops. Accordingly, the region of

complementarity between the oligonucleotide and the PRC2-associated region may be at a location of the PRC2 associated region that encodes at least a portion of at least one of the loops. In some cases, the region of complementarity between the oligonucleotide and the PRC2-associated region may be at a location of the PRC2-associated region that encodes at least a portion of at least two of the loops. In some cases, the region of complementarity between the oligonucleotide and the PRC2-associated region may be at a location of the

PRC2 associated region that encodes at least a portion of the double stranded stem. In some embodiments, a PRC2-associated region (e.g. , of an IncRNA) is identified (e.g. , using RIP- Seq methodology or information derived therefrom). In some embodiments, the predicted secondary structure RNA (e.g. , IncRNA) containing the PRC2-associated region is determined using RNA secondary structure prediction algorithms, e.g. , RNAfold, mfold. In some embodiments, oligonucleotides are designed to target a region of the RNA that forms a secondary structure comprising one or more single stranded loop (e.g. , at least two single stranded loops) structures which may comprise a double stranded stem between the at least two single stranded loops.

The single stranded oligonucleotide may have a sequence that is has greater than 30%

G-C content, greater than 40% G-C content, greater than 50% G-C content, greater than 60% G-C content, greater than 70% G-C content, or greater than 80% G-C content. The single stranded oligonucleotide may have a sequence that has up to 100% G-C content, up to 95% G-C content, up to 90% G-C content, or up to 80% G-C content. In some embodiments in which the oligonucleotide is 8 to 10 nucleotides in length, all but 1, 2, 3, 4, or 5 of the nucleotides of the complementary sequence of the PRC2-associated region are cytosine or guanosine nucleotides. In some embodiments, the sequence of the PRC2- associated region to which the single stranded oligonucleotide is complementary comprises no more than 3 nucleotides selected from adenine and uracil.

The single stranded oligonucleotide may be complementary to a chromosome of a different species (e.g. , a mouse, rat, rabbit, goat, monkey, etc.) at a position that encompasses or that is in proximity to that species' homolog of HBB, HBD, HBE1, HBG1 or HBG2. The single stranded oligonucleotide may be complementary to a human genomic region encompassing or in proximity to the HBB, HBD, HBE1, HBG1 or HBG2 gene and also be complementary to a mouse genomic region encompassing or in proximity to the mouse homolog of HBB, HBD, HBE1, HBG1 or HBG2. For example, the single stranded oligonucleotide may be complementary to a sequence as set forth as any one of SEQ ID NOS: 1-10, which is a human genomic region encompassing or in proximity to the HBB, HBD, HBE1, HBG1 or HBG2 gene, and also be complementary to a sequence as set forth in any one of SEQ ID NOS: 1-10, which is a mouse genomic region encompassing or in proximity to the mouse homolog of the HBB, HBD, HBE1, HBG1 or HBG2 gene.

Oligonucleotides having these characteristics may be tested in vivo or in vitro for efficacy in multiple species (e.g., human and mouse). This approach also facilitates development of clinical candidates for treating human disease by selecting a species in which an appropriate animal exists for the disease.

In some embodiments, the region of complementarity of the single stranded oligonucleotide is complementary with at least 8 to 15, 8 to 30, 8 to 40, or 10 to 50, or 5 to 50, or 5 to 40 bases, e.g., 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 consecutive nucleotides of a PRC2-associated region. In some embodiments, the region of complementarity is complementary with at least 8 consecutive nucleotides of a PRC2-associated region. In some embodiments the sequence of the single stranded oligonucleotide is based on an RNA sequence that binds to PRC2, or a portion thereof, said portion having a length of from 5 to 40 contiguous base pairs, or about 8 to 40 bases, or about 5 to 15, or about 5 to 30, or about 5 to 40 bases, or about 5 to 50 bases.

Complementary, as the term is used in the art, refers to the capacity for precise pairing between two nucleotides. For example, if a nucleotide at a certain position of an

oligonucleotide is capable of hydrogen bonding with a nucleotide at the same position of PRC2-associated region, then the single stranded nucleotide and PRC2-associated region are considered to be complementary to each other at that position. The single stranded nucleotide and PRC2-associated region are complementary to each other when a sufficient number of corresponding positions in each molecule are occupied by nucleotides that can hydrogen bond with each other through their bases. Thus, "complementary" is a term which is used to indicate a sufficient degree of complementarity or precise pairing such that stable and specific binding occurs between the single stranded nucleotide and PRC2-associated region. For example, if a base at one position of a single stranded nucleotide is capable of hydrogen bonding with a base at the corresponding position of a PRC2-associated region, then the bases are considered to be complementary to each other at that position. 100% complementarity is not required.

The single stranded oligonucleotide may be at least 80% complementary to

(optionally one of at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% complementary to) the consecutive nucleotides of a PRC2-associated region. In some embodiments the single stranded oligonucleotide may contain 1, 2 or 3 base mismatches compared to the portion of the consecutive nucleotides of a PRC2-associated region. In some embodiments the single stranded oligonucleotide may have up to 3 mismatches over 15 bases, or up to 2 mismatches over 10 bases.

It is understood in the art that a complementary nucleotide sequence need not be

100% complementary to that of its target to be specifically hybridizable. In some

embodiments, a complementary nucleic acid sequence for purposes of the present disclosure is specifically hybridizable when binding of the sequence to the target molecule (e.g. , IncRNA) interferes with the normal function of the target (e.g. , IncRNA) to cause a loss of activity (e.g. , inhibiting PRC2-associated repression with consequent up-regulation of gene expression) and there is a sufficient degree of complementarity to avoid non-specific binding of the sequence to non-target sequences under conditions in which avoidance of non-specific binding is desired, e.g. , under physiological conditions in the case of in vivo assays or therapeutic treatment, and in the case of in vitro assays, under conditions in which the assays are performed under suitable conditions of stringency.

In some embodiments, the single stranded oligonucleotide is 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50 or more nucleotides in length. In a preferred embodiment, the oligonucleotide is 8 to 30 nucleotides in length.

In some embodiments, the PRC2-associated region occurs on the same DNA strand as a gene sequence (sense). In some embodiments, the PRC2-associated region occurs on the opposite DNA strand as a gene sequence (anti-sense). Oligonucleotides complementary to a PRC2-associated region can bind either sense or anti-sense sequences. Base pairings may include both canonical Watson-Crick base pairing and non-Watson-Crick base pairing (e.g. , Wobble base pairing and Hoogsteen base pairing). It is understood that for complementary base pairings, adenosine-type bases (A) are complementary to thymidine-type bases (T) or uracil-type bases (U), that cytosine-type bases (C) are complementary to guanosine-type bases (G), and that universal bases such as 3-nitropyrrole or 5-nitroindole can hybridize to and are considered complementary to any A, C, U, or T. Inosine (I) has also been considered in the art to be a universal base and is considered complementary to any A, C, U or T.

In some embodiments, any one or more thymidine (T) nucleotides (or modified nucleotide thereof) or uridine (U) nucleotides (or a modified nucleotide thereof) in a sequence provided herein, including a sequence provided in the sequence listing, may be replaced with any other nucleotide suitable for base pairing (e.g., via a Watson-Crick base pair) with an adenosine nucleotide . In some embodiments, any one or more thymidine (T) nucleotides (or modified nucleotide thereof) or uridine (U) nucleotides (or a modified nucleotide thereof) in a sequence provided herein, including a sequence provided in the sequence listing, may be suitably replaced with a different pyrimidine nucleotide or vice versa. In some embodiments, any one or more thymidine (T) nucleotides (or modified nucleotide thereof) in a sequence provided herein, including a sequence provided in the sequence listing, may be suitably replaced with a uridine (U) nucleotide (or a modified nucleotide thereof) or vice versa.

In some embodiments, GC content of the single stranded oligonucleotide is preferably between about 30-60 %. Contiguous runs of three or more Gs or Cs may not be preferable in some embodiments. Accordingly, in some embodiments, the oligonucleotide does not comprise a stretch of three or more guanosine nucleotides.

In some embodiments, the single stranded oligonucleotide specifically binds to, or is complementary to an RNA that is encoded in a genome (e.g., a human genome) as a single contiguous transcript (e.g., a non-spliced RNA). In some embodiments, the single stranded oligonucleotide specifically binds to, or is complementary to an RNA that is encoded in a genome (e.g., a human genome), in which the distance in the genome between the 5 'end of the coding region of the RNA and the 3' end of the coding region of the RNA is less than 1 kb, less than 2 kb, less than 3 kb, less than 4 kb, less than 5 kb, less than 7 kb, less than 8 kb, less than 9 kb, less than 10 kb, or less than 20 kb.

It is to be understood that any oligonucleotide provided herein can be excluded. In some embodiments, a single stranded oligonucleotide is not complementary to SEQ ID NO: 3917. In some embodiments, it has been found that single stranded oligonucleotides disclosed herein may increase expression of mRNA corresponding to the gene by at least about 50% (i.e. 150% of normal or 1.5 fold), or by about 2 fold to about 5 fold. In some embodiments, expression may be increased by at least about 15 fold, 20 fold, 30 fold, 40 fold, 50 fold or 100 fold, or any range between any of the foregoing numbers. It has also been found that increased mRNA expression has been shown to correlate to increased protein expression.

In some or any of the embodiments of the oligonucleotides described herein, or processes for designing or synthesizing them, the oligonucleotides will upregulate gene expression and may specifically bind or specifically hybridize or be complementary to the PRC2 binding RNA that is transcribed from the same strand as a protein coding reference gene. The oligonucleotide may bind to a region of the PRC2 binding RNA that originates within or overlaps an intron, exon, intron exon junction, 5' UTR, 3' UTR, a translation initiation region, or a translation termination region of a protein coding sense strand of a reference gene (refGene).

In some or any of the embodiments of oligonucleotides described herein, or processes for designing or synthesizing them, the oligonucleotides will upregulate gene expression and may specifically bind or specifically hybridize or be complementary to a PRC2 binding RNA that transcribed from the opposite strand (the antisense strand) of a protein coding reference gene. The oligonucleotide may bind to a region of the PRC2 binding RNA that originates within or overlaps an intron, exon, intron exon junction, 5' UTR, 3' UTR, a translation initiation region, or a translation termination region of a protein coding antisense strand of a reference gene

The oligonucleotides described herein may be modified, e.g., comprise a modified sugar moiety, a modified internucleoside linkage, a modified nucleotide and/or combinations thereof. In addition, the oligonucleotides can exhibit one or more of the following properties: do not induce substantial cleavage or degradation of the target RNA; do not cause

substantially complete cleavage or degradation of the target RNA; do not activate the RNAse H pathway; do not activate RISC; do not recruit any Argonaute family protein; are not cleaved by Dicer; do not mediate alternative splicing; are not immune stimulatory; are nuclease resistant; have improved cell uptake compared to unmodified oligonucleotides; are not toxic to cells or mammals; may have improved endosomal exit; do interfere with interaction of IncRNA with PRC2, preferably the Ezh2 subunit but optionally the Suzl2, Eed, RbAp46/48 subunits or accessory factors such as Jarid2; do decrease histone H3 lysine27 methylation and/or do upregulate gene expression.

Oligonucleotides that are designed to interact with RNA to modulate gene expression are a distinct subset of base sequences from those that are designed to bind a DNA target (e.g., are complementary to the underlying genomic DNA sequence from which the RNA is transcribed).

Any of the oligonucleotides disclosed herein may be linked to one or more other oligonucleotides disclosed herein by a linker, e.g., a cleavable linker.

Method for Selecting Candidate Oligonucleotides for Activating Expression of Hemoglobin Genes

Methods are provided herein for selecting a candidate oligonucleotide for activating or enhancing expression of HBB, HBD, HBEl, HBGl or HBG2. The target selection methods may generally involve steps for selecting single stranded oligonucleotides having any of the structural and functional characteristics disclosed herein. Typically, the methods involve one or more steps aimed at identifying oligonucleotides that target a PRC2-associated region that is functionally related to HBB, HBD, HBEl, HBGl or HBG2, for example a PRC2-associated region of a IncRNA that regulates expression of HBB, HBD, HBEl, HBGl or HBG2 by facilitating (e.g., in a -regulatory manner) the recruitment of PRC2 to the HBB, HBD, HBEl, HBGl or HBG2 gene. Such oligonucleotides are expected to be candidates for activating expression of HBB, HBD, HBEl, HBGl or HBG2 because of their ability to hybridize with the PRC2-associated region of a nucleic acid (e.g., a IncRNA). This hybridization event is likely to disrupt interaction of PRC2 with the nucleic acid (e.g., a IncRNA) and as a result disrupt recruitment of PRC2 and its associated co-repressors (e.g., chromatin remodeling factors) to the HBB, HBD, HBEl, HBGl or HBG2 gene locus.

Methods of selecting a candidate oligonucleotide may involve selecting a PRC2- associated region (e.g., a nucleotide sequence as set forth in SEQ ID NO: 17 or 18) that maps to a chromosomal position encompassing or in proximity to the HBB, HBD, HBEl, HBGl or HBG2 gene (e.g., a chromosomal position having a sequence as set forth in any one of SEQ ID NOS: 1 to 16). The PRC2-associated region may map to the strand of the chromosome comprising the sense strand of the HBB, HBD, HBEl, HBGl or HBG2 gene, in which case the candidate oligonucleotide is complementary to the sense strand of the HBB, HBD, HBEl, HBGl or HBG2 gene (i.e. , is antisense to the HBB, HBD, HBEl, HBGl or HBG2 gene). Alternatively, the PRC2-associated region may map to the strand of the first chromosome comprising the antisense strand of the HBB, HBD, HBEl, HBGl or HBG2 gene, in which case the oligonucleotide is complementary to the antisense strand (the template strand) of the HBB, HBD, HBEl, HBGl or HBG2 gene (i.e. , is sense to the HBB, HBD, HBEl, HBGl or HBG2 gene).

Methods for selecting a set of candidate oligonucleotides that is enriched in oligonucleotides that activate expression of HBB, HBD, HBEl, HBGl or HBG2 may involve selecting one or more PRC2-associated regions that map to a chromosomal position that encompasses or that is in proximity to the HBB, HBD, HBEl, HBGl or HBG2 gene and selecting a set of oligonucleotides, in which each oligonucleotide in the set comprises a nucleotide sequence that is complementary with the one or more PRC2-associated regions. As used herein, the phrase, "a set of oligonucleotides that is enriched in oligonucleotides that activate expression of refers to a set of oligonucleotides that has a greater number of oligonucleotides that activate expression of a target gene (e.g. , HBB, HBD, HBEl, HBGl or HBG2) compared with a random selection of oligonucleotides of the same physicochemical properties (e.g. , the same GC content, T m , length etc.) as the enriched set.

Where the design and/or synthesis of a single stranded oligonucleotide involves design and/or synthesis of a sequence that is complementary to a nucleic acid or PRC2- associated region described by such sequence information, the skilled person is readily able to determine the complementary sequence, e.g., through understanding of Watson Crick base pairing rules which form part of the common general knowledge in the field.

In some embodiments design and/or synthesis of a single stranded oligonucleotide involves manufacture of an oligonucleotide from starting materials by techniques known to those of skill in the art, where the synthesis may be based on a sequence of a PRC2- associated region, or portion thereof.

Methods of design and/or synthesis of a single stranded oligonucleotide may involve one or more of the steps of: Identifying and/or selecting PRC2-associated region;

Designing a nucleic acid sequence having a desired degree of sequence identity or complementarity to a PRC2-associated region or a portion thereof;

Synthesizing a single stranded oligonucleotide to the designed sequence;

Purifying the synthesized single stranded oligonucleotide; and

Optionally mixing the synthesized single stranded oligonucleotide with at least one pharmaceutically acceptable diluent, carrier or excipient to form a pharmaceutical

composition or medicament.

Single stranded oligonucleotides so designed and/or synthesized may be useful in method of modulating gene expression as described herein.

Preferably, single stranded oligonucleotides of the invention are synthesized chemically. Oligonucleotides used to practice this invention can be synthesized in vitro by well-known chemical synthesis techniques.

Oligonucleotides of the invention can be stabilized against nucleolytic degradation such as by the incorporation of a modification, e.g., a nucleotide modification. For example, nucleic acid sequences of the invention include a phosphorothioate at least the first, second, or third internucleotide linkage at the 5' or 3' end of the nucleotide sequence. As another example, the nucleic acid sequence can include a 2'-modified nucleotide, e.g., a 2'-deoxy, 2'- deoxy-2'-fluoro, 2'-0-methyl, 2'-0-methoxyethyl (2'-0-MOE), 2'-0-aminopropyl (2'-0-AP), 2'-0-dimethylaminoethyl (2'-0-DMAOE), 2'-0-dimethylaminopropyl (2'-0-DMAP), 2'-0- dimethylaminoethyloxyethyl (2'-0-DMAEOE), or 2'-0-N-methylacetamido (2'-0-NMA). As another example, the nucleic acid sequence can include at least one 2'-0-methyl-modified nucleotide, and in some embodiments, all of the nucleotides include a 2'-0-methyl modification. In some embodiments, the nucleic acids are "locked," i.e., comprise nucleic acid analogues in which the ribose ring is "locked" by a methylene bridge connecting the 2'- O atom and the 4'-C atom.

It is understood that any of the modified chemistries or formats of single stranded oligonucleotides described herein can be combined with each other, and that one, two, three, four, five, or more different types of modifications can be included within the same molecule.

In some embodiments, the method may further comprise the steps of amplifying the synthesized single stranded oligonucleotide, and/or purifying the single stranded oligonucleotide (or amplified single stranded oligonucleotide), and/or sequencing the single stranded oligonucleotide so obtained.

As such, the process of preparing a single stranded oligonucleotide may be a process that is for use in the manufacture of a pharmaceutical composition or medicament for use in the treatment of disease, optionally wherein the treatment involves modulating expression of a gene associated with a PRC2-associated region.

In the methods described above a PRC2-associated region may be, or have been, identified, or obtained, by a method that involves identifying RNA that binds to PRC2.

Such methods may involve the following steps: providing a sample containing nuclear ribonucleic acids, contacting the sample with an agent that binds specifically to PRC2 or a subunit thereof, allowing complexes to form between the agent and protein in the sample, partitioning the complexes, synthesizing nucleic acid that is complementary to nucleic acid present in the complexes.

Where the single stranded oligonucleotide is based on a PRC2-associated region, or a portion of such a sequence, it may be based on information about that sequence, e.g., sequence information available in written or electronic form, which may include sequence information contained in publicly available scientific publications or sequence databases.

Nucleotide Analogues

In some embodiments, the oligonucleotide may comprise at least one ribonucleotide, at least one deoxyribonucleotide, and/or at least one bridged nucleotide. In some

embodiments, the oligonucleotide may comprise a bridged nucleotide, such as a locked nucleic acid (LNA) nucleotide, a constrained ethyl (cEt) nucleotide, or an ethylene bridged nucleic acid (ENA) nucleotide. Examples of such nucleotides are disclosed herein and known in the art. In some embodiments, the oligonucleotide comprises a nucleotide analog disclosed in one of the following United States Patent or Patent Application Publications: US 7,399,845, US 7,741,457, US 8,022,193, US 7,569,686, US 7,335,765, US 7,314,923, US 7,335,765, and US 7,816,333, US 20110009471, the entire contents of each of which are incorporated herein by reference for all purposes. The oligonucleotide may have one or more 2' O-methyl nucleotides. The oligonucleotide may consist entirely of 2' O-methyl nucleotides. Often the single stranded oligonucleotide has one or more nucleotide analogues. For example, the single stranded oligonucleotide may have at least one nucleotide analogue that results in an increase in T m of the oligonucleotide in a range of 1°C, 2 °C, 3°C, 4 °C, or 5°C compared with an oligonucleotide that does not have the at least one nucleotide analogue. The single stranded oligonucleotide may have a plurality of nucleotide analogues that results in a total increase in T m of the oligonucleotide in a range of 2 °C, 3 °C, 4 °C, 5 °C, 6 °C, 7 °C, 8 °C, 9 °C, 10 °C, 15 °C, 20 °C, 25 °C, 30 °C, 35 °C, 40 °C, 45 °C or more compared with an oligonucleotide that does not have the nucleotide analogue.

The oligonucleotide may be of up to 50 nucleotides in length in which 2 to 10, 2 to 15 5 2 to 16, 2 to 17, 2 to 18, 2 to 19, 2 to 20, 2 to 25, 2 to 30, 2 to 40, 2 to 45, or more nucleotides of the oligonucleotide are nucleotide analogues. The oligonucleotide may be of 8 to 30 nucleotides in length in which 2 to 10, 2 to 15 5 2 to 16, 2 to 17, 2 to 18, 2 to 19, 2 to 20, 2 to 25, 2 to 30 nucleotides of the oligonucleotide are nucleotide analogues.

The oligonucleotide may be of 8 to 15 nucleotides in length in which 2 to 4, 2 to 5, 2 to 6, 2 to 7, 2 to 8, 2 to 9, 2 to 10, 2 to 11, 2 to 12, 2 to 13, 2 to 14 nucleotides of the oligonucleotide are nucleotide analogues. Optionally, the oligonucleotides may have every nucleotide except 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides modified.

The oligonucleotide may consist entirely of bridged nucleotides (e.g. , LNA nucleotides, cEt nucleotides, ENA nucleotides). The oligonucleotide may comprise alternating deoxyribonucleotides and 2'-fluoro-deoxyribonucleotides. The oligonucleotide may comprise alternating deoxyribonucleotides and 2'-0-methyl nucleotides. The oligonucleotide may comprise alternating deoxyribonucleotides and ENA nucleotide analogues. The oligonucleotide may comprise alternating deoxyribonucleotides and LNA nucleotides. The oligonucleotide may comprise alternating LNA nucleotides and 2'-0- methyl nucleotides. The oligonucleotide may have a 5' nucleotide that is a bridged nucleotide (e.g. , a LNA nucleotide, cEt nucleotide, ENA nucleotide). The oligonucleotide may have a 5' nucleotide that is a deoxyribonucleotide.

The oligonucleotide may comprise deoxyribonucleotides flanked by at least one bridged nucleotide (e.g. , a LNA nucleotide, cEt nucleotide, ENA nucleotide) on each of the 5' and 3' ends of the deoxyribonucleotides. The oligonucleotide may comprise

deoxyribonucleotides flanked by 1, 2, 3, 4, 5, 6, 7, 8 or more bridged nucleotides (e.g. , LNA nucleotides, cEt nucleotides, ENA nucleotides) on each of the 5' and 3' ends of the deoxyribonucleotides. The 3' position of the oligonucleotide may have a 3' hydroxyl group. The 3' position of the oligonucleotide may have a 3' thiophosphate.

The oligonucleotide may be conjugated with a label. For example, the

oligonucleotide may be conjugated with a biotin moiety, cholesterol, Vitamin A, folate, sigma receptor ligands, aptamers, peptides, such as CPP, hydrophobic molecules, such as lipids, ASGPR or dynamic polyconjugates and variants thereof at its 5' or 3' end.

Preferably the single stranded oligonucleotide comprises one or more modifications comprising: a modified sugar moiety, and/or a modified internucleoside linkage, and/or a modified nucleotide and/or combinations thereof. It is not necessary for all positions in a given oligonucleotide to be uniformly modified, and in fact more than one of the

modifications described herein may be incorporated in a single oligonucleotide or even at within a single nucleoside within an oligonucleotide.

In some embodiments, the single stranded oligonucleotides are chimeric

oligonucleotides that contain two or more chemically distinct regions, each made up of at least one nucleotide. These oligonucleotides typically contain at least one region of modified nucleotides that confers one or more beneficial properties (such as, for example, increased nuclease resistance, increased uptake into cells, increased binding affinity for the target) and a region that is a substrate for enzymes capable of cleaving RNA:DNA or RNA:RNA hybrids. Chimeric single stranded oligonucleotides of the invention may be formed as composite structures of two or more oligonucleotides, modified oligonucleotides,

oligonucleosides and/or oligonucleotide mimetics as described above. Such compounds have also been referred to in the art as hybrids or gapmers. Representative United States patents that teach the preparation of such hybrid structures comprise, but are not limited to, US patent nos. 5,013,830; 5,149,797; 5, 220,007; 5,256,775; 5,366,878; 5,403,711; 5,491,133;

5,565,350; 5,623,065; 5,652,355; 5,652,356; and 5,700,922, each of which is herein incorporated by reference.

In some embodiments, the single stranded oligonucleotide comprises at least one nucleotide modified at the 2' position of the sugar, most preferably a 2'-0-alkyl, 2'-0-alkyl-0- alkyl or 2'-fluoro-modified nucleotide. In other preferred embodiments, RNA modifications include 2'-fluoro, 2'-amino and 2' O-methyl modifications on the ribose of pyrimidines, abasic residues or an inverted base at the 3' end of the RNA. Such modifications are routinely incorporated into oligonucleotides and these oligonucleotides have been shown to have a higher Tm (i.e., higher target binding affinity) than 2'-deoxyoligonucleotides against a given target.

A number of nucleotide and nucleoside modifications have been shown to make the oligonucleotide into which they are incorporated more resistant to nuclease digestion than the native oligodeoxynucleotide; these modified oligos survive intact for a longer time than unmodified oligonucleotides. Specific examples of modified oligonucleotides include those comprising modified backbones, for example, phosphorothioates, phosphotriesters, methyl phosphonates, short chain alkyl or cycloalkyl intersugar linkages or short chain heteroatomic or heterocyclic intersugar linkages. Most preferred are oligonucleotides with

phosphorothioate backbones and those with heteroatom backbones, particularly CH 2 -NH-O- CH 2 , CH,~N(CH 3 )~0~CH 2 (known as a methylene(methylimino) or MMI backbone, CH 2 - O-N (CH 3 )-CH 2 , CH 2 -N (CH 3 )-N (CH 3 )-CH 2 and O-N (CH 3 )- CH 2 -CH 2 backbones, wherein the native phosphodiester backbone is represented as O- P— O- CH,); amide backbones (see De Mesmaeker et al. Ace. Chem. Res. 1995, 28:366-374); morpholino backbone structures (see Summerton and Weller, U.S. Pat. No. 5,034,506); peptide nucleic acid (PNA) backbone (wherein the phosphodiester backbone of the oligonucleotide is replaced with a polyamide backbone, the nucleotides being bound directly or indirectly to the aza nitrogen atoms of the polyamide backbone, see Nielsen et al., Science 1991, 254, 1497). Phosphorus-containing linkages include, but are not limited to, phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, aminoalkylphosphotriesters, methyl and other alkyl phosphonates comprising 3'alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates comprising 3'-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3 -5' to 5'-3' or 2 -5' to 5'-2'; see US patent nos. 3,687,808; 4,469,863;

4,476,301; 5,023,243; 5, 177,196; 5,188,897; 5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455, 233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563, 253; 5,571,799; 5,587,361; and 5,625,050. Morpholino-based oligomeric compounds are described in Dwaine A. Braasch and David R. Corey, Biochemistry, 2002, 41(14), 4503-4510); Genesis, volume 30, issue 3, 2001 ; Heasman, J., Dev. Biol., 2002, 243, 209-214; Nasevicius et al., Nat. Genet., 2000, 26, 216- 220; Lacerra et al., Proc. Natl. Acad. Sci., 2000, 97, 9591-9596; and U.S. Pat. No. 5,034,506, issued Jul. 23, 1991. In some embodiments, the morpholino-based oligomeric compound is a phosphorodiamidate morpholino oligomer (PMO) (e.g. , as described in Iverson, Curr. Opin. Mol. Ther., 3:235-238, 2001 ; and Wang et al., J. Gene Med., 12:354-364, 2010; the disclosures of which are incorporated herein by reference in their entireties).

Cyclohexenyl nucleic acid oligonucleotide mimetics are described in Wang et al., J. Am. Chem. Soc, 2000, 122, 8595-8602.

Modified oligonucleotide backbones that do not include a phosphorus atom therein have backbones that are formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic internucleoside linkages. These comprise those having morpholino linkages (formed in part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino backbones;

sulfonate and sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH2 component parts; see US patent nos. 5,034,506; 5, 166,315; 5,185,444; 5,214,134; 5,216, 141 ; 5,235,033; 5,264, 562; 5, 264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596, 086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623, 070; 5,663,312; 5,633,360; 5,677,437; and 5,677,439, each of which is herein incorporated by reference.

Modified oligonucleotides are also known that include oligonucleotides that are based on or constructed from arabinonucleotide or modified arabinonucleotide residues.

Arabinonucleosides are stereoisomers of ribonucleosides, differing only in the configuration at the 2'-position of the sugar ring. In some embodiments, a 2'-arabino modification is 2'-F arabino. In some embodiments, the modified oligonucleotide is 2'-fluoro-D-arabinonucleic acid (FANA) (as described in, for example, Lon et al., Biochem., 41 :3457-3467, 2002 and Min et al., Bioorg. Med. Chem. Lett., 12:2651-2654, 2002; the disclosures of which are incorporated herein by reference in their entireties). Similar modifications can also be made at other positions on the sugar, particularly the 3' position of the sugar on a 3' terminal nucleoside or in 2'-5' linked oligonucleotides and the 5' position of 5' terminal nucleotide.

PCT Publication No. WO 99/67378 discloses arabinonucleic acids (ANA) oligomers and their analogues for improved sequence specific inhibition of gene expression via association to complementary messenger RNA.

Other preferred modifications include ethylene-bridged nucleic acids (ENAs) (e.g. , International Patent Publication No. WO 2005/042777, Morita et al., Nucleic Acid Res., Suppl 1 :241-242, 2001 ; Surono et al., Hum. Gene Ther., 15:749-757, 2004; Koizumi, Curr. Opin. Mol. Ther., 8: 144- 149, 2006 and Horie et al., Nucleic Acids Symp. Ser (Oxf), 49: 171- 172, 2005; the disclosures of which are incorporated herein by reference in their entireties). Preferred ENAs include, but are not limited to, 2'-0,4'-C-ethylene -bridged nucleic acids.

Examples of LNAs are described in WO/2008/043753 and include compounds of the following general formula.

where X and Y are independently selected among the groups -0-,

-S-, -N(H)-, N(R)-, -CH 2 - or -CH- (if part of a double bond), -CH 2 -O-, -CH 2 -S-, -CH 2 -N(H)-, -CH 2 -N(R)-, -CH 2 -CH 2 - or -CH 2 -CH- (if part of a double bond),

-CH=CH-, where R is selected from hydrogen and Ci-4-alkyl; Z and Z* are independently selected among an internucleoside linkage, a terminal group or a protecting group; B constitutes a natural or non-natural nucleotide base moiety; and the asymmetric groups may be found in either orientation. Preferably, the LNA used in the oligonucleotides described herein comprises at least one LNA unit according any of the formulas

wherein Y is -0-, -S-, -NH-, or N(R ); Z and Z* are independently selected among an intemucleoside linkage, a terminal group or a protecting group; B constitutes a natural or non-natural nucleotide base moiety, and RH is selected from hydrogen and Ci-4-alkyl.

In some embodiments, the Locked Nucleic Acid (LNA) used in the oligonucleotides described herein comprises at least one Locked Nucleic Acid (LNA) unit according any of the formulas shown in Scheme 2 of PCT/DK2006/000512.

In some embodiments, the LNA used in the oligomer of the invention comprises intemucleoside linkages selected from -0-P(O) 2 -O-, -0-P(0,S)-0-, -0-P(S) 2 -O-, -S-P(0) 2 -0-, -S-P(0,S)-0-, -S-P(S) 2 -0-, -0-P(O) 2 -S-, -0-P(0,S)-S-, -S-P(0) 2 -S-, -0-PO(R H )-0-, o- PO(OCH 3 )-0-, -0-PO(NR H )-0-, -0-PO(OCH 2 CH 2 S-R)-O-, -0-PO(BH 3 )-0-, -0-PO(NHR H )- 0-, -0-P(0) 2 -NR H -, -NR H -P(0) 2 -0-, -NR H -CO-0-, where R H is selected from hydrogen and Ci-4-alkyl.

Specifically preferred LNA units are shown in scheme 2:

fS-D-oxy-LNA

g-D-awiino-tfiA

Scheme 2

The term "thio-LNA" comprises a locked nucleotide in which at least one of X or Y in the general formula above is selected from S or -CH 2 -S-. Thio-LNA can be in both beta-D and alpha-L-configuration.

The term "amino-LNA" comprises a locked nucleotide in which at least one of X or Y in the general formula above is selected from -N(H)-, N(R)-, CH 2 -N(H)-, and -CH 2 -N(R)- where R is selected from hydrogen and Ci-4-alkyl. Amino-LNA can be in both beta-D and alpha-L-configuration.

The term "oxy-LNA" comprises a locked nucleotide in which at least one of X or Y in the general formula above represents -O- or -CH 2 -0-. Oxy-LNA can be in both beta-D and alpha-L-configuration.

The term "ena-LNA" comprises a locked nucleotide in which Y in the general formula above is -CH 2 -0- (where the oxygen atom of -CH 2 -0- is attached to the 2'-position relative to the base B).

LNAs are described in additional detail herein. One or more substituted sugar moieties can also be included, e.g. , one of the following at the 2' position: OH, SH, SCH 3 , F, OCN, OCH 3 OCH 3 , OCH 3 0(CH 2 )n CH 3 , 0(CH 2 )n NH 2 or 0(CH 2 )n CH 3 where n is from 1 to about 10; CI to CIO lower alkyl, alkoxyalkoxy, substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF ; OCF ; 0-, S-, or N-alkyl; 0-, S-, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ; N0 2 ; N 3 ; NH2; heterocycloalkyl; heterocyclo alkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA cleaving group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties of an oligonucleotide; or a group for improving the pharmacodynamic properties of an oligonucleotide and other substituents having similar properties. A preferred modification includes 2'-methoxyethoxy [2'-0-CH 2 CH 2 OCH 3 , also known as 2'-0-(2-methoxyethyl)] (Martin et al, Helv. Chim. Acta, 1995, 78, 486). Other preferred modifications include 2'- methoxy (2'-0-CH 3 ), 2'-propoxy (2'-OCH 2 CH 2 CH 3 ) and 2'-fluoro (2'-F). Similar

modifications may also be made at other positions on the oligonucleotide, particularly the 3' position of the sugar on the 3' terminal nucleotide and the 5' position of 5' terminal nucleotide. Oligonucleotides may also have sugar mimetics such as cyclobutyls in place of the pentofuranosyl group.

Single stranded oligonucleotides can also include, additionally or alternatively, nucleobase (often referred to in the art simply as "base") modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases include adenine (A), guanine (G), thymine (T), cytosine (C) and uracil (U). Modified nucleobases include nucleobases found only infrequently or transiently in natural nucleic acids, e.g. , hypoxanthine, 6-methyladenine, 5-Me pyrimidines, particularly 5-methylcytosine (also referred to as 5-methyl-2'

deoxycytosine and often referred to in the art as 5-Me-C), 5-hydroxymethylcytosine (HMC), glycosyl HMC and gentobiosyl HMC, isocytosine, pseudoisocytosine, as well as synthetic nucleobases, e.g. , 2-aminoadenine, 2- (methylamino)adenine, 2-(imidazolylalkyl)adenine, 2- (aminoalklyamino)adenine or other hetero substituted alkyladenines, 2-thiouracil, 2- thiothymine, 5-bromouracil, 5-hydroxymethyluracil, 5-propynyluracil, 8-azaguanine, 7- deazaguanine, N6 (6-aminohexyl)adenine, 6-aminopurine, 2-aminopurine, 2-chloro-6- aminopurine and 2,6-diaminopurine or other diaminopurines. See, e.g. , Kornberg, "DNA Replication," W. H. Freeman & Co., San Francisco, 1980, pp75-77; and Gebeyehu, G., et al. Nucl. Acids Res., 15:4513 (1987)). A "universal" base known in the art, e.g. , inosine, can also be included. 5-Me-C substitutions have been shown to increase nucleic acid duplex stability by 0.6-1.2°C. (Sanghvi, in Crooke, and Lebleu, eds., Antisense Research and Applications, CRC Press, Boca Raton, 1993, pp. 276-278) and may be used as base substitutions.

It is not necessary for all positions in a given oligonucleotide to be uniformly modified, and in fact more than one of the modifications described herein may be

incorporated in a single oligonucleotide or even at within a single nucleoside within an oligonucleotide.

In some embodiments, both a sugar and an internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced with novel groups. The base units are maintained for hybridization with an appropriate nucleic acid target compound. One such oligomeric compound, an oligonucleotide mimetic that has been shown to have excellent hybridization properties, is referred to as a peptide nucleic acid (PNA). In PNA compounds, the sugar- backbone of an oligonucleotide is replaced with an amide containing backbone, for example, an aminoethylglycine backbone. The nucleobases are retained and are bound directly or indirectly to aza nitrogen atoms of the amide portion of the backbone. Representative United States patents that teach the preparation of PNA compounds include, but are not limited to, US patent nos. 5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference. Further teaching of PNA compounds can be found in Nielsen et al, Science, 1991, 254, 1497-1500.

Single stranded oligonucleotides can also include one or more nucleobase (often referred to in the art simply as "base") modifications or substitutions. As used herein, "unmodified" or "natural" nucleobases comprise the purine bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U). Modified

nucleobases comprise other synthetic and natural nucleobases such as 5-methylcytosine (5- me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudo-uracil), 4-thiouracil, 8-halo, 8-amino, 8-thiol, 8- thioalkyl, 8-hydroxyl and other 8- substituted adenines and guanines, 5-halo particularly 5- bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylquanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-deazaguanine and 7-deazaadenine and 3- deazaguanine and 3-deazaadenine.

Further, nucleobases comprise those disclosed in United States Patent No. 3,687,808, those disclosed in "The Concise Encyclopedia of Polymer Science And Engineering", pages 858-859, Kroschwitz, ed. John Wiley & Sons, 1990;, those disclosed by Englisch et al., Angewandle Chemie, International Edition, 1991, 30, page 613, and those disclosed by Sanghvi, Chapter 15, Antisense Research and Applications," pages 289- 302, Crooke, and Lebleu, eds., CRC Press, 1993. Certain of these nucleobases are particularly useful for increasing the binding affinity of the oligomeric compounds of the invention. These include 5-substituted pyrimidines, 6-azapyrimidines and N-2, N-6 and 0-6 substituted purines, comprising 2-aminopropyladenine, 5-propynyluracil and 5- propynylcytosine. 5- methylcytosine substitutions have been shown to increase nucleic acid duplex stability by 0.6- 1.2<0>C (Sanghvi, et al., eds, "Antisense Research and Applications," CRC Press, Boca Raton, 1993, pp. 276-278) and are presently preferred base substitutions, even more particularly when combined with 2'-0-methoxyethyl sugar modifications. Modified nucleobases are described in US patent nos. 3,687,808, as well as 4,845,205; 5,130,302; 5,134,066; 5, 175, 273; 5, 367,066; 5,432,272; 5,457, 187; 5,459,255; 5,484,908; 5,502,177; 5,525,711 ; 5,552,540; 5,587,469; 5,596,091 ; 5,614,617; 5,750,692, and 5,681,941, each of which is herein incorporated by reference.

In some embodiments, the single stranded oligonucleotides are chemically linked to one or more moieties or conjugates that enhance the activity, cellular distribution, or cellular uptake of the oligonucleotide. For example, one or more single stranded oligonucleotides, of the same or different types, can be conjugated to each other; or single stranded

oligonucleotides can be conjugated to targeting moieties with enhanced specificity for a cell type or tissue type. Such moieties include, but are not limited to, lipid moieties such as a cholesterol moiety (Letsinger et al., Proc. Natl. Acad. Sci. USA, 1989, 86, 6553-6556), cholic acid (Manoharan et al., Bioorg. Med. Chem. Let., 1994, 4, 1053-1060), a thioether, e.g. , hexyl-S- tritylthiol (Manoharan et al, Ann. N. Y. Acad. Sci., 1992, 660, 306-309; Manoharan et al., Bioorg. Med. Chem. Let., 1993, 3, 2765-2770), a thiocholesterol (Oberhauser et al., Nucl. Acids Res., 1992, 20, 533-538), an aliphatic chain, e.g. , dodecandiol or undecyl residues (Kabanov et al., FEBS Lett., 1990, 259, 327-330; Svinarchuk et al., Biochimie, 1993, 75, 49- 54), a phospholipid, e.g. , di-hexadecyl-rac-glycerol or triethylammonium 1,2- di-O-hexadecyl- rac-glycero-3-H-phosphonate (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654; Shea et al., Nucl. Acids Res., 1990, 18, 3777-3783), a polyamine or a polyethylene glycol chain (Mancharan et al., Nucleosides & Nucleotides, 1995, 14, 969-973), or adamantane acetic acid (Manoharan et al., Tetrahedron Lett., 1995, 36, 3651-3654), a palmityl moiety (Mishra et al., Biochim. Biophys. Acta, 1995, 1264, 229-237), or an octadecylamine or hexylamino-carbonyl-t oxycholesterol moiety (Crooke et al., J. Pharmacol. Exp. Ther., 1996, 277, 923-937). See also US patent nos. 4,828,979; 4,948,882; 5,218, 105; 5,525,465; 5,541,313; 5,545,730; 5,552, 538; 5,578,717, 5,580,731 ; 5,580,731 ; 5,591,584; 5,109, 124; 5, 118,802; 5,138,045; 5,414,077; 5,486, 603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762, 779; 4,789,737; 4,824,941 ; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082, 830; 5,112,963; 5,214,136; 5,082,830; 5, 112,963; 5,214, 136; 5, 245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391, 723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5, 565,552; 5,567,810; 5,574, 142; 5,585,481 ; 5,587,371 ; 5,595,726; 5,597,696; 5,599,923; 5,599, 928 and 5,688,941, each of which is herein incorporated by reference.

These moieties or conjugates can include conjugate groups covalently bound to functional groups such as primary or secondary hydroxyl groups. Conjugate groups of the invention include intercalators, reporter molecules, polyamines, polyamides, polyethylene glycols, polyethers, groups that enhance the pharmacodynamic properties of oligomers, and groups that enhance the pharmacokinetic properties of oligomers. Typical conjugate groups include cholesterols, lipids, phospholipids, biotin, phenazine, folate, phenanthridine, anthraquinone, acridine, fluoresceins, rhodamines, coumarins, and dyes. Groups that enhance the pharmacodynamic properties, in the context of this invention, include groups that improve uptake, enhance resistance to degradation, and/or strengthen sequence- specific hybridization with the target nucleic acid. Groups that enhance the pharmacokinetic properties, in the context of this invention, include groups that improve uptake, distribution, metabolism or excretion of the compounds of the present invention. Representative conjugate groups are disclosed in International Patent Application No. PCT/US92/09196, filed Oct. 23, 1992, and U.S. Pat. No. 6,287,860, which are incorporated herein by reference. Conjugate moieties include, but are not limited to, lipid moieties such as a cholesterol moiety, cholic acid, a thioether, e.g. , hexyl-5-tritylthiol, a thiocholesterol, an aliphatic chain, e.g. , dodecandiol or undecyl residues, a phospholipid, e.g. , di-hexadecyl-rac- glycerol or triethylammonium 1,2- di-O-hexadecyl-rac-glycero-3-H-phosphonate, a polyamine or a polyethylene glycol chain, or adamantane acetic acid, a palmityl moiety, or an octadecylamine or hexylamino-carbonyl-oxy cholesterol moiety. See, e.g. , U.S. Pat. Nos. 4,828,979; 4,948,882; 5,218, 105; 5,525,465; 5,541,313; 5,545,730; 5,552,538; 5,578,717, 5,580,731 ; 5,580,731 ; 5,591,584; 5,109, 124; 5,118,802; 5, 138,045; 5,414,077; 5,486,603; 5,512,439; 5,578,718; 5,608,046; 4,587,044; 4,605,735; 4,667,025; 4,762,779; 4,789,737; 4,824,941 ; 4,835,263; 4,876,335; 4,904,582; 4,958,013; 5,082,830; 5,112,963; 5,214,136; 5,082,830; 5, 112,963; 5,214, 136; 5,245,022; 5,254,469; 5,258,506; 5,262,536; 5,272,250; 5,292,873; 5,317,098; 5,371,241, 5,391,723; 5,416,203, 5,451,463; 5,510,475; 5,512,667; 5,514,785; 5,565,552; 5,567,810; 5,574, 142; 5,585,481 ; 5,587,371 ; 5,595,726; 5,597,696; 5,599,923; 5,599,928 and 5,688,941.

In some embodiments, single stranded oligonucleotide modification include modification of the 5' or 3' end of the oligonucleotide. In some embodiments, the 3' end of the oligonucleotide comprises a hydroxyl group or a thiophosphate. It should be appreciated that additional molecules (e.g. a biotin moiety or a fluorophor) can be conjugated to the 5' or 3' end of the single stranded oligonucleotide. In some embodiments, the single stranded oligonucleotide comprises a biotin moiety conjugated to the 5' nucleotide.

In some embodiments, the single stranded oligonucleotide comprises locked nucleic acids (LNA), ENA modified nucleotides, 2'-0-methyl nucleotides, or 2'-fluoro- deoxyribonucleotides. In some embodiments, the single stranded oligonucleotide comprises alternating deoxyribonucleotides and 2'-fluoro-deoxyribonucleotides. In some embodiments, the single stranded oligonucleotide comprises alternating deoxyribonucleotides and 2'-0- methyl nucleotides. In some embodiments, the single stranded oligonucleotide comprises alternating deoxyribonucleotides and ENA modified nucleotides. In some embodiments, the single stranded oligonucleotide comprises alternating deoxyribonucleotides and locked nucleic acid nucleotides. In some embodiments, the single stranded oligonucleotide comprises alternating locked nucleic acid nucleotides and 2'-0-methyl nucleotides.

In some embodiments, the 5' nucleotide of the oligonucleotide is a

deoxyribonucleotide. In some embodiments, the 5' nucleotide of the oligonucleotide is a locked nucleic acid nucleotide. In some embodiments, the nucleotides of the oligonucleotide comprise deoxyribonucleotides flanked by at least one locked nucleic acid nucleotide on each of the 5' and 3' ends of the deoxyribonucleotides. In some embodiments, the nucleotide at the 3' position of the oligonucleotide has a 3' hydroxyl group or a 3' thiophosphate.

In some embodiments, the single stranded oligonucleotide comprises

phosphorothioate internucleotide linkages. In some embodiments, the single stranded oligonucleotide comprises phosphorothioate internucleotide linkages between at least two nucleotides. In some embodiments, the single stranded oligonucleotide comprises phosphorothioate internucleotide linkages between all nucleotides.

It should be appreciated that the single stranded oligonucleotide can have any combination of modifications as described herein.

The oligonucleotide may comprise a nucleotide sequence having one or more of the following modification patterns.

(a) (X)Xxxxxx, (X)xXxxxx, (X)xxXxxx, (X)xxxXxx, (X)xxxxXx and (X)xxxxxX,

(b) (X)XXxxxx, (X)XxXxxx, (X)XxxXxx, (X)XxxxXx, (X)XxxxxX, (X)xXXxxx, (X)xXxXxx, (X)xXxxXx, (X)xXxxxX, (X)xxXXxx, (X)xxXxXx, (X)xxXxxX, (X)xxxXXx,

(X)xxxXxX and (X)xxxxXX,

(c) (X)XXXxxx, (X)xXXXxx, (X)xxXXXx, (X)xxxXXX, (X)XXxXxx, (X)XXxxXx, (X)XXxxxX, (X)xXXxXx, (X)xXXxxX, (X)xxXXxX, (X)XxXXxx, (X)XxxXXx

(X)XxxxXX, (X)xXxXXx, (X)xXxxXX, (X)xxXxXX, (X)xXxXxX and (X)XxXxXx,

(d) (X)xxXXX, (X)xXxXXX, (X)xXXxXX, (X)xXXXxX, (X)xXXXXx,

(X)XxxXXXX, (X)XxXxXX, (X)XxXXxX, (X)XxXXx, (X)XXxxXX, (X)XXxXxX, (X)XXxXXx, (X)XXXxxX, (X)XXXxXx, and (X)XXXXxx,

(e) (X)xXXXXX, (X)XxXXXX, (X)XXxXXX, (X)XXXxXX, (X)XXXXxX and (X)XXXXXx, and

(f) XXXXXX, XxXXXXX, XXxXXXX, XXXxXXX, XXXXxXX, XXXXXxX and

XXXXXXx, in which "X" denotes a nucleotide analogue, (X) denotes an optional nucleotide analogue, and "x" denotes a DNA or RNA nucleotide unit. Each of the above listed patterns may appear one or more times within an oligonucleotide, alone or in combination with any of the other disclosed modification patterns.

Methods for Modulating Gene Expression In one aspect, the invention relates to methods for modulating gene expression in a cell (e.g. , a cell for which HBB, HBD, HBE1, HBG1 or HBG2 levels are reduced) for research purposes (e.g. , to study the function of the gene in the cell). In another aspect, the invention relates to methods for modulating gene expression in a cell (e.g. , a cell for which HBB, HBD, HBE1, HBG1 or HBG2 levels are reduced) for gene or epigenetic therapy. The cells can be in vitro, ex vivo, or in vivo (e.g. , in a subject who has a disease resulting from reduced expression or activity of HBB, HBD, HBE1, HBG1 or HBG2. In some

embodiments, methods for modulating gene expression in a cell comprise delivering a single stranded oligonucleotide as described herein. In some embodiments, delivery of the single stranded oligonucleotide to the cell results in a level of expression of gene that is at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200% or more greater than a level of expression of gene in a control cell to which the single stranded oligonucleotide has not been delivered. In certain embodiments, delivery of the single stranded oligonucleotide to the cell results in a level of expression of gene that is at least 50% greater than a level of expression of gene in a control cell to which the single stranded oligonucleotide has not been delivered.

In another aspect of the invention, methods comprise administering to a subject (e.g. a human) a composition comprising a single stranded oligonucleotide as described herein to increase protein levels in the subject. In some embodiments, the increase in protein levels is at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 100%, 200%, or more, higher than the amount of a protein in the subject before administering.

As another example, to increase expression of HBB, HBD, HBE1, HBG1 or HBG2 in a cell, the methods include introducing into the cell a single stranded oligonucleotide that is sufficiently complementary to a PRC2-associated region (e.g. , of a long non-coding RNA) that maps to a genomic position encompassing or in proximity to the HBB, HBD, HBE1, HBG1 or HBG2 gene.

In another aspect of the invention provides methods of treating a condition (e.g. , anemia, sickle cell anemia, thalassemia) associated with decreased levels of expression of HBB, HBD, HBE1, HBG1 or HBG2 in a subject, the method comprising administering a single stranded oligonucleotide as described herein. A subject can include a non-human mammal, e.g. mouse, rat, guinea pig, rabbit, cat, dog, goat, cow, or horse. In preferred embodiments, a subject is a human. Single stranded oligonucleotides have been employed as therapeutic moieties in the treatment of disease states in animals, including humans. Single stranded oligonucleotides can be useful therapeutic modalities that can be configured to be useful in treatment regimes for the treatment of cells, tissues and animals, especially humans.

For therapeutics, an animal, preferably a human, suspected of having anemia, sickle cell anemia and/or thalassemia is treated by administering single stranded oligonucleotide in accordance with this invention. For example, in one non-limiting embodiment, the methods comprise the step of administering to the animal in need of treatment, a therapeutically effective amount of a single stranded oligonucleotide as described herein.

Formulation, Delivery, And Dosing

The oligonucleotides described herein can be formulated for administration to a subject for treating a condition (e.g., anemia, sickle cell anemia, thalassemia) associated with decreased levels of HBB, HBD, HBE1, HBG1 or HBG2. It should be understood that the formulations, compositions and methods can be practiced with any of the oligonucleotides disclosed herein.

The formulations may conveniently be presented in unit dosage form and may be prepared by any methods well known in the art of pharmacy. The amount of active ingredient (e.g., an oligonucleotide or compound of the invention) which can be combined with a carrier material to produce a single dosage form will vary depending upon the host being treated, the particular mode of administration, e.g., intradermal or inhalation. The amount of active ingredient which can be combined with a carrier material to produce a single dosage form will generally be that amount of the compound which produces a therapeutic effect, e.g. tumor regression.

Pharmaceutical formulations of this invention can be prepared according to any method known to the art for the manufacture of pharmaceuticals. Such formulations can contain sweetening agents, flavoring agents, coloring agents and preserving agents. A formulation can be admixtured with nontoxic pharmaceutically acceptable excipients which are suitable for manufacture. Formulations may comprise one or more diluents, emulsifiers, preservatives, buffers, excipients, etc. and may be provided in such forms as liquids, powders, emulsions, lyophilized powders, sprays, creams, lotions, controlled release formulations, tablets, pills, gels, on patches, in implants, etc.

A formulated single stranded oligonucleotide composition can assume a variety of states. In some examples, the composition is at least partially crystalline, uniformly crystalline, and/or anhydrous (e.g. , less than 80, 50, 30, 20, or 10% water). In another example, the single stranded oligonucleotide is in an aqueous phase, e.g. , in a solution that includes water. The aqueous phase or the crystalline compositions can, e.g. , be incorporated into a delivery vehicle, e.g. , a liposome (particularly for the aqueous phase) or a particle (e.g. , a microparticle as can be appropriate for a crystalline composition). Generally, the single stranded oligonucleotide composition is formulated in a manner that is compatible with the intended method of administration.

In some embodiments, the composition is prepared by at least one of the following methods: spray drying, lyophilization, vacuum drying, evaporation, fluid bed drying, or a combination of these techniques; or sonication with a lipid, freeze-drying, condensation and other self-assembly.

A single stranded oligonucleotide preparation can be formulated or administered (together or separately) in combination with another agent, e.g. , another therapeutic agent or an agent that stabilizes a single stranded oligonucleotide, e.g. , a protein that complexes with single stranded oligonucleotide. Still other agents include chelators, e.g. , EDTA (e.g. , to remove divalent cations such as Mg 2+ ), salts, RNAse inhibitors (e.g. , a broad specificity RNAse inhibitor such as RNAsin) and so forth.

In one embodiment, the single stranded oligonucleotide preparation includes another single stranded oligonucleotide, e.g. , a second single stranded oligonucleotide that modulates expression of a second gene or a second single stranded oligonucleotide that modulates expression of the first gene. Still other preparation can include at least 3, 5, ten, twenty, fifty, or a hundred or more different single stranded oligonucleotide species. Such single stranded oligonucleotides can mediated gene expression with respect to a similar number of different genes. In one embodiment, the single stranded oligonucleotide preparation includes at least a second therapeutic agent (e.g. , an agent other than an oligonucleotide). Route of Delivery

A composition that includes a single stranded oligonucleotide can be delivered to a subject by a variety of routes. Exemplary routes include: intravenous, intradermal, topical, rectal, parenteral, anal, intravaginal, intranasal, pulmonary, ocular. The term "therapeutically effective amount" is the amount of oligonucleotide present in the composition that is needed to provide the desired level of HBB, HBD, HBE1, HBG1 or HBG2 expression in the subject to be treated to give the anticipated physiological response. The term "physiologically effective amount" is that amount delivered to a subject to give the desired palliative or curative effect. The term "pharmaceutically acceptable carrier" means that the carrier can be administered to a subject with no significant adverse toxicological effects to the subject.

The single stranded oligonucleotide molecules of the invention can be incorporated into pharmaceutical compositions suitable for administration. Such compositions typically include one or more species of single stranded oligonucleotide and a pharmaceutically acceptable carrier. As used herein the language "pharmaceutically acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, compatible with pharmaceutical administration. The use of such media and agents for pharmaceutically active substances is well known in the art. Except insofar as any conventional media or agent is incompatible with the active compound, use thereof in the compositions is contemplated. Supplementary active compounds can also be incorporated into the compositions.

The pharmaceutical compositions of the present invention may be administered in a number of ways depending upon whether local or systemic treatment is desired and upon the area to be treated. Administration may be topical (including ophthalmic, vaginal, rectal, intranasal, transdermal), oral or parenteral. Parenteral administration includes intravenous drip, subcutaneous, intraperitoneal or intramuscular injection, or intrathecal or

intraventricular administration.

The route and site of administration may be chosen to enhance targeting. For example, to target muscle cells, intramuscular injection into the muscles of interest would be a logical choice. Lung cells might be targeted by administering the single stranded oligonucleotide in aerosol form. The vascular endothelial cells could be targeted by coating a balloon catheter with the single stranded oligonucleotide and mechanically introducing the oligonucleotide.

Topical administration refers to the delivery to a subject by contacting the formulation directly to a surface of the subject. The most common form of topical delivery is to the skin, but a composition disclosed herein can also be directly applied to other surfaces of the body, e.g. , to the eye, a mucous membrane, to surfaces of a body cavity or to an internal surface. As mentioned above, the most common topical delivery is to the skin. The term encompasses several routes of administration including, but not limited to, topical and transdermal. These modes of administration typically include penetration of the skin's permeability barrier and efficient delivery to the target tissue or stratum. Topical administration can be used as a means to penetrate the epidermis and dermis and ultimately achieve systemic delivery of the composition. Topical administration can also be used as a means to selectively deliver oligonucleotides to the epidermis or dermis of a subject, or to specific strata thereof, or to an underlying tissue.

Formulations for topical administration may include transdermal patches, ointments, lotions, creams, gels, drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical carriers, aqueous, powder or oily bases, thickeners and the like may be necessary or desirable. Coated condoms, gloves and the like may also be useful.

Transdermal delivery is a valuable route for the administration of lipid soluble therapeutics. The dermis is more permeable than the epidermis and therefore absorption is much more rapid through abraded, burned or denuded skin. Inflammation and other physiologic conditions that increase blood flow to the skin also enhance transdermal adsorption. Absorption via this route may be enhanced by the use of an oily vehicle

(inunction) or through the use of one or more penetration enhancers. Other effective ways to deliver a composition disclosed herein via the transdermal route include hydration of the skin and the use of controlled release topical patches. The transdermal route provides a potentially effective means to deliver a composition disclosed herein for systemic and/or local therapy. In addition, iontophoresis (transfer of ionic solutes through biological membranes under the influence of an electric field), phonophoresis or sonophoresis (use of ultrasound to enhance the absorption of various therapeutic agents across biological membranes, notably the skin and the cornea), and optimization of vehicle characteristics relative to dose position and retention at the site of administration may be useful methods for enhancing the transport of topically applied compositions across skin and mucosal sites.

Both the oral and nasal membranes offer advantages over other routes of

administration. For example, oligonucleotides administered through these membranes may have a rapid onset of action, provide therapeutic plasma levels, avoid first pass effect of hepatic metabolism, and avoid exposure of the oligonucleotides to the hostile gastrointestinal (GI) environment. Additional advantages include easy access to the membrane sites so that the oligonucleotide can be applied, localized and removed easily.

In oral delivery, compositions can be targeted to a surface of the oral cavity, e.g. , to sublingual mucosa which includes the membrane of ventral surface of the tongue and the floor of the mouth or the buccal mucosa which constitutes the lining of the cheek. The sublingual mucosa is relatively permeable thus giving rapid absorption and acceptable bioavailability of many agents. Further, the sublingual mucosa is convenient, acceptable and easily accessible.

A pharmaceutical composition of single stranded oligonucleotide may also be administered to the buccal cavity of a human being by spraying into the cavity, without inhalation, from a metered dose spray dispenser, a mixed micellar pharmaceutical

formulation as described above and a propellant. In one embodiment, the dispenser is first shaken prior to spraying the pharmaceutical formulation and propellant into the buccal cavity.

Compositions for oral administration include powders or granules, suspensions or solutions in water, syrups, slurries, emulsions, elixirs or non-aqueous media, tablets, capsules, lozenges, or troches. In the case of tablets, carriers that can be used include lactose, sodium citrate and salts of phosphoric acid. Various disintegrants such as starch, and lubricating agents such as magnesium stearate, sodium lauryl sulfate and talc, are commonly used in tablets. For oral administration in capsule form, useful diluents are lactose and high molecular weight polyethylene glycols. When aqueous suspensions are required for oral use, the nucleic acid compositions can be combined with emulsifying and suspending agents. If desired, certain sweetening and/or flavoring agents can be added.

Parenteral administration includes intravenous drip, subcutaneous, intraperitoneal or intramuscular injection, intrathecal or intraventricular administration. In some embodiments, parental administration involves administration directly to the site of disease (e.g. injection into a tumor).

Formulations for parenteral administration may include sterile aqueous solutions which may also contain buffers, diluents and other suitable additives. Intraventricular injection may be facilitated by an intraventricular catheter, for example, attached to a reservoir. For intravenous use, the total concentration of solutes should be controlled to render the preparation isotonic.

Any of the single stranded oligonucleotides described herein can be administered to ocular tissue. For example, the compositions can be applied to the surface of the eye or nearby tissue, e.g. , the inside of the eyelid. For ocular administration, ointments or droppable liquids may be delivered by ocular delivery systems known to the art such as applicators or eye droppers. Such compositions can include mucomimetics such as hyaluronic acid, chondroitin sulfate, hydroxypropyl methylcellulose or poly(vinyl alcohol), preservatives such as sorbic acid, EDTA or benzylchronium chloride, and the usual quantities of diluents and/or carriers. The single stranded oligonucleotide can also be administered to the interior of the eye, and can be introduced by a needle or other delivery device which can introduce it to a selected area or structure.

Pulmonary delivery compositions can be delivered by inhalation by the patient of a dispersion so that the composition, preferably single stranded oligonucleotides, within the dispersion can reach the lung where it can be readily absorbed through the alveolar region directly into blood circulation. Pulmonary delivery can be effective both for systemic delivery and for localized delivery to treat diseases of the lungs.

Pulmonary delivery can be achieved by different approaches, including the use of nebulized, aerosolized, micellular and dry powder-based formulations. Delivery can be achieved with liquid nebulizers, aerosol-based inhalers, and dry powder dispersion devices. Metered-dose devices are preferred. One of the benefits of using an atomizer or inhaler is that the potential for contamination is minimized because the devices are self-contained. Dry powder dispersion devices, for example, deliver agents that may be readily formulated as dry powders. A single stranded oligonucleotide composition may be stably stored as lyophilized or spray-dried powders by itself or in combination with suitable powder carriers. The delivery of a composition for inhalation can be mediated by a dosing timing element which can include a timer, a dose counter, time measuring device, or a time indicator which when incorporated into the device enables dose tracking, compliance monitoring, and/or dose triggering to a patient during administration of the aerosol medicament.

The term "powder" means a composition that consists of finely dispersed solid particles that are free flowing and capable of being readily dispersed in an inhalation device and subsequently inhaled by a subject so that the particles reach the lungs to permit penetration into the alveoli. Thus, the powder is said to be "respirable." Preferably the average particle size is less than about 10 μιη in diameter preferably with a relatively uniform spheroidal shape distribution. More preferably the diameter is less than about 7.5 μ m and most preferably less than about 5.0 μ m. Usually the particle size distribution is between about 0.1 μ m and about 5 μ m in diameter, particularly about 0.3 μ m to about 5 μ m.

The term "dry" means that the composition has a moisture content below about 10% by weight (% w) water, usually below about 5% w and preferably less it than about 3% w. A dry composition can be such that the particles are readily dispersible in an inhalation device to form an aerosol.

The types of pharmaceutical excipients that are useful as carrier include stabilizers such as human serum albumin (HSA), bulking agents such as carbohydrates, amino acids and polypeptides; pH adjusters or buffers; salts such as sodium chloride; and the like. These carriers may be in a crystalline or amorphous form or may be a mixture of the two.

Suitable pH adjusters or buffers include organic salts prepared from organic acids and bases, such as sodium citrate, sodium ascorbate, and the like; sodium citrate is preferred. Pulmonary administration of a micellar single stranded oligonucleotide formulation may be achieved through metered dose spray devices with propellants such as tetrafluoroethane, heptafluoroethane, dimethylfluoropropane, tetrafluoropropane, butane, isobutane, dimethyl ether and other non-CFC and CFC propellants.

Exemplary devices include devices which are introduced into the vasculature, e.g. , devices inserted into the lumen of a vascular tissue, or which devices themselves form a part of the vasculature, including stents, catheters, heart valves, and other vascular devices. These devices, e.g. , catheters or stents, can be placed in the vasculature of the lung, heart, or leg.

Other devices include non- vascular devices, e.g. , devices implanted in the peritoneum, or in organ or glandular tissue, e.g. , artificial organs. The device can release a therapeutic substance in addition to a single stranded oligonucleotide, e.g. , a device can release insulin.

In one embodiment, unit doses or measured doses of a composition that includes single stranded oligonucleotide are dispensed by an implanted device. The device can include a sensor that monitors a parameter within a subject. For example, the device can include pump, e.g. , and, optionally, associated electronics.

Tissue, e.g. , cells or organs can be treated with a single stranded oligonucleotide, ex vivo and then administered or implanted in a subject. The tissue can be autologous, allogeneic, or xenogeneic tissue. E.g. , tissue can be treated to reduce graft v. host disease . In other embodiments, the tissue is allogeneic and the tissue is treated to treat a disorder characterized by unwanted gene expression in that tissue. E.g. , tissue, e.g. , hematopoietic cells, e.g. , bone marrow hematopoietic cells, can be treated to inhibit unwanted cell proliferation. Introduction of treated tissue, whether autologous or transplant, can be combined with other therapies. In some implementations, the single stranded oligonucleotide treated cells are insulated from other cells, e.g. , by a semi-permeable porous barrier that prevents the cells from leaving the implant, but enables molecules from the body to reach the cells and molecules produced by the cells to enter the body. In one embodiment, the porous barrier is formed from alginate.

In one embodiment, a contraceptive device is coated with or contains a single stranded oligonucleotide. Exemplary devices include condoms, diaphragms, IUD

(implantable uterine devices, sponges, vaginal sheaths, and birth control devices.

Dosage

In one aspect, the invention features a method of administering a single stranded oligonucleotide (e.g., as a compound or as a component of a composition) to a subject (e.g. , a human subject). In one embodiment, the unit dose is between about 10 mg and 25 mg per kg of bodyweight. In one embodiment, the unit dose is between about 1 mg and 100 mg per kg of bodyweight. In one embodiment, the unit dose is between about 0.1 mg and 500 mg per kg of bodyweight. In some embodiments, the unit dose is more than 0.001, 0.005, 0.01, 0.05, 0.1, 0.5, 1, 2, 5, 10, 25, 50 or 100 mg per kg of bodyweight. The defined amount can be an amount effective to treat or prevent a disease or disorder, e.g. , a disease or disorder associated with the HBB, HBD, HBE1, HBG1 or HBG2. The unit dose, for example, can be administered by injection (e.g. , intravenous or

intramuscular), an inhaled dose, or a topical application.

In some embodiments, the unit dose is administered daily. In some embodiments, less frequently than once a day, e.g. , less than every 2, 4, 8 or 30 days. In another embodiment, the unit dose is not administered with a frequency (e.g. , not a regular frequency). For example, the unit dose may be administered a single time. In some embodiments, the unit dose is administered more than once a day, e.g. , once an hour, two hours, four hours, eight hours, twelve hours, etc.

In one embodiment, a subject is administered an initial dose and one or more maintenance doses of a single stranded oligonucleotide. The maintenance dose or doses are generally lower than the initial dose, e.g. , one-half less of the initial dose. A maintenance regimen can include treating the subject with a dose or doses ranging from 0.0001 to 100 mg/kg of body weight per day, e.g. , 100, 10, 1, 0.1, 0.01, 0.001, or 0.0001 mg per kg of bodyweight per day. The maintenance doses may be administered no more than once every 1, 5, 10, or 30 days. Further, the treatment regimen may last for a period of time which will vary depending upon the nature of the particular disease, its severity and the overall condition of the patient. In some embodiments the dosage may be delivered no more than once per day, e.g. , no more than once per 24, 36, 48, or more hours, e.g. , no more than once for every 5 or 8 days. Following treatment, the patient can be monitored for changes in his condition and for alleviation of the symptoms of the disease state. The dosage of the oligonucleotide may either be increased in the event the patient does not respond significantly to current dosage levels, or the dose may be decreased if an alleviation of the symptoms of the disease state is observed, if the disease state has been ablated, or if undesired side-effects are observed.

The effective dose can be administered in a single dose or in two or more doses, as desired or considered appropriate under the specific circumstances. If desired to facilitate repeated or frequent infusions, implantation of a delivery device, e.g. , a pump, semi- permanent stent (e.g. , intravenous, intraperitoneal, intracisternal or intracapsular), or reservoir may be advisable. In some embodiments, the oligonucleotide pharmaceutical composition includes a plurality of single stranded oligonucleotide species. In another embodiment, the single stranded oligonucleotide species has sequences that are non- overlapping and non-adjacent to another species with respect to a naturally occurring target sequence (e.g. , a PRC2-associated region). In another embodiment, the plurality of single stranded oligonucleotide species is specific for different PRC2-associated regions. In another embodiment, the single stranded oligonucleotide is allele specific. In some cases, a patient is treated with a single stranded oligonucleotide in conjunction with other therapeutic modalities.

Following successful treatment, it may be desirable to have the patient undergo maintenance therapy to prevent the recurrence of the disease state, wherein the compound of the invention is administered in maintenance doses, ranging from 0.0001 mg to 100 mg per kg of body weight.

The concentration of the single stranded oligonucleotide composition is an amount sufficient to be effective in treating or preventing a disorder or to regulate a physiological condition in humans. The concentration or amount of single stranded oligonucleotide administered will depend on the parameters determined for the agent and the method of administration, e.g. nasal, buccal, pulmonary. For example, nasal formulations may tend to require much lower concentrations of some ingredients in order to avoid irritation or burning of the nasal passages. It is sometimes desirable to dilute an oral formulation up to 10- 100 times in order to provide a suitable nasal formulation.

Certain factors may influence the dosage required to effectively treat a subject, including but not limited to the severity of the disease or disorder, previous treatments, the general health and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a therapeutically effective amount of a single stranded oligonucleotide can include a single treatment or, preferably, can include a series of treatments. It will also be appreciated that the effective dosage of a single stranded oligonucleotide used for treatment may increase or decrease over the course of a particular treatment. For example, the subject can be monitored after administering a single stranded oligonucleotide composition. Based on information from the monitoring, an additional amount of the single stranded

oligonucleotide composition can be administered. Dosing is dependent on severity and responsiveness of the disease condition to be treated, with the course of treatment lasting from several days to several months, or until a cure is effected or a diminution of disease state is achieved. Optimal dosing schedules can be calculated from measurements of HBB, HBD, HBEl, HBGl or HBG2 expression levels in the body of the patient. Persons of ordinary skill can easily determine optimum dosages, dosing methodologies and repetition rates. Optimum dosages may vary depending on the relative potency of individual compounds, and can generally be estimated based on EC50s found to be effective in in vitro and in vivo animal models. In some embodiments, the animal models include transgenic animals that express a human HBB, HBD, HBEl, HBGl or HBG2. In another embodiment, the composition for testing includes a single stranded oligonucleotide that is complementary, at least in an internal region, to a sequence that is conserved between HBB, HBD, HBEl, HBGl or HBG2 in the animal model and the HBB, HBD, HBEl, HBGl or HBG2 in a human.

In one embodiment, the administration of the single stranded oligonucleotide composition is parenteral, e.g. intravenous (e.g. , as a bolus or as a diffusible infusion), intradermal, intraperitoneal, intramuscular, intrathecal, intraventricular, intracranial, subcutaneous, transmucosal, buccal, sublingual, endoscopic, rectal, oral, vaginal, topical, pulmonary, intranasal, urethral or ocular. Administration can be provided by the subject or by another person, e.g. , a health care provider. The composition can be provided in measured doses or in a dispenser which delivers a metered dose. Selected modes of delivery are discussed in more detail below.

Kits

In certain aspects of the invention, kits are provided, comprising a container housing a composition comprising a single stranded oligonucleotide. In some embodiments, the composition is a pharmaceutical composition comprising a single stranded oligonucleotide and a pharmaceutically acceptable carrier. In some embodiments, the individual components of the pharmaceutical composition may be provided in one container. Alternatively, it may be desirable to provide the components of the pharmaceutical composition separately in two or more containers, e.g. , one container for single stranded oligonucleotides, and at least another for a carrier compound. The kit may be packaged in a number of different configurations such as one or more containers in a single box. The different components can be combined, e.g. , according to instructions provided with the kit. The components can be combined according to a method described herein, e.g. , to prepare and administer a pharmaceutical composition. The kit can also include a delivery device.

The present invention is further illustrated by the following Examples, which in no way should be construed as further limiting. The entire contents of all of the references (including literature references, issued patents, published patent applications, and co-pending patent applications) cited throughout this application are hereby expressly incorporated by reference.

EXAMPLES

The invention is further described in the following examples, which do not limit the scope of the invention described in the claims.

MATERIALS AND METHODS:

Real Time PCR

RNA is harvested from the cells using Promega SV 96 Total RNA Isolation system or Trizol omitting the DNAse step. RNA harvested from cells is normalized so that equal RNA is input to each reverse transcription reaction. Reverse transcriptase reactions are performed using the Superscript II kit and real time PCR performed on cDNA samples using icycler SYBR green chemistry (Biorad). A baseline level of mRNA expression for HBB, HBD, HBE1, HBG1 or HBG2 is determined through quantitative PCR as outlined above. Baseline levels are also determined for mRNA of various housekeeping genes which are constitutively expressed. A "control" housekeeping gene with approximately the same level of baseline expression as the target gene is chosen for comparison purposes.

Cell Culture Human hepatocyte Hep3B, human hepatocyte HepG2 cells, mouse hepatoma Hepal-6 cells, and human renal proximal tubule epithelial cells (RPTEC) are cultured using conditions known in the art (see, e.g. Current Protocols in Cell Biology). Oligonucleotide design

Oligonucleotides were designed within PRC2-interacting regions in order to upregulate HBB or HBG2. The sequence and structure of each oligonucleotide is shown in Table 3. A description of the nucleotide analogs, modifications and intranucleotide linkages used for certain oligonucleotides described is provided in Table 2.

SEQ ID NOs: 3789 to 3796 are control oligos. SEQ ID NO: 3789 is a splice- altering oligo targeting a site in intron 2 of HBB. SEQ ID NOs: 3790-3793 are 15-mer subsequences of SEQ ID NO: 3789. SEQ ID NO: 3795 is a dimer of SEQ ID NO: 3789. SEQ ID NOs: 3794 and 3796 are a control oligo and a dimer version of the control oligo, respectively. In vitro transfection of cells with oligonucleotides

Oligonucleotides are designed within PRC2-interacting regions in order to upregulate HBB, HBD, HBE1, HBG1 or HBG2. Cells are seeded into each well of 24-well plates at a density of 25,000 cells per 500uL and transfections are performed with Lipofectamine and the single stranded oligonucleotides. Control wells contain Lipofectamine alone. At 48 hours post-transfection, approximately 200 uL of cell culture supernatants are stored at -80 C for ELISA. At 48 hours post-transfection, RNA is harvested from the cells and quantitative PCR is carried out as outlined above. The percent induction of target mRNA expression by each oligonucleotide is determined by normalizing mRNA levels in the presence of the

oligonucleotide to the mRNA levels in the presence of control (Lipofectamine alone). This is compared side -by-side with the increase in mRNA expression of the "control" housekeeping gene.

In vitro delivery of single stranded oligonucleotides

Oligonucleotides are designed as candidates for upregulating HBB, HBD, HBE1, HBG1 or HBG2 expression. The single stranded oligonucleotides are designed to be complementary to a PRC2-interacting region within a sequence as set forth as any one of SEQ ID NOS: 1-10. The oligonucleotides are tested in at least duplicate. Briefly, cells are transfected in vitro with each of the oligonucleotides as described above. HBB, HBD, HBEl, HBG1 or HBG2 expression in cells following treatment is evaluated by qRT-PCR.

Oligonucleotides that upregulate HBB, HBD, HBEl, HBG1 or HBG2 expression are identified.

Table 1: Hexamers that are not seed sequences of human miRNAs

AAAAAA, AAAAAG, AAAACA, AAAAGA, AAAAGC, AAAAGG, AAAAUA, AAACAA, AAACAC, AAACAG, AAACAU, AAACCC, AAACCU, AAACGA, AAACGC, AAACGU, AAACUA, AAACUC, AAACUU, AAAGAU, AAAGCC, AAAGGA, AAAGGG, AAAGUC, AAAUAC, AAAUAU, AAAUCG, AAAUCU, AAAUGC, AAAUGU, AAAUUA, AAAUUG, AACAAC, AACAAG, AACAAU, AACACA, AACACG, AACAGA, AACAGC, AACAGG, AACAUC, AACAUG, AACCAA, AACCAC, AACCAG, AACCAU, AACCCC, AACCCG, AACCGA, AACCGC, AACCGG, AACCUA, AACCUU, AACGAA, AACGAC, AACGAG, AACGAU, AACGCU, AACGGG, AACGGU, AACGUA, AACGUC, AACGUG, AACGUU, AACUAU, AACUCA, AACUCC, AACUCG, AACUGA, AACUGC, AACUGU, AACU UA, AACUUC, AACU UG, AACUUU, AAGAAA, AAGAAG, AAGAAU, AAGACG, AAGAGA, AAGAGC, AAGAGG, AAGAGU, AAGAUU, AAGCAA, AAGCAC, AAGCAG, AAGCAU, AAGCCA, AAGCCC, AAGCCG, AAGCCU, AAGCGA, AAGCGG, AAGCGU, AAGCUA, AAGGAA, AAGGAC, AAGGCU, AAGGGC, AAGGGU, AAGGUU, AAGUAA, AAGUAC, AAGUAU, AAGUCC, AAGUCG, AAGUGA, AAGUGG, AAGUUA, AAGU UU, AAUAAA, AAUAAC, AAUAAG, AAUAAU, AAUACA, AAUACC, AAUACG, AAUAGA, AAUAGC, AAUAGG, AAUAGU, AAUAUC, AAUAU U, AAUCAA, AAUCAU, AAUCCA, AAUCCC, AAUCCG, AAUCGA, AAUCGC, AAUCGU, AAUCUA, AAUCUG, AAUCUU, AAUGAA, AAUGAC, AAUGAG, AAUGAU, AAUGCG, AAUGCU, AAUGGA, AAUGGU, AAUGUA, AAUGUC, AAUGUG, AAUUAA, AAUUAC, AAUUAG, AAU UCC, AAU UCG, AAUUGA, AAUUGG, AAU UGU, AAUUUC, AAU UUG, ACAAAA, ACAAAC, ACAAAG, ACAAAU, ACAACC, ACAACG, ACAACU, ACAAGA, ACAAGC, ACAAGU, ACAAUC, ACAAUG, ACAAUU, ACACAG, ACACCA, ACACCC, ACACCG, ACACCU, ACACGA, ACACGC, ACACGU, ACACUC, ACACUG, ACACUU, ACAGAA, ACAGAC, ACAGCC, ACAGCG, ACAGCU, ACAGGG, ACAGUC, ACAGUG, ACAGU U, ACAUAA, ACAUAC, ACAUCC, ACAUCG, ACAUCU, ACAUGA, ACAUGC, ACAUGU, ACAU UG, ACAU UU, ACCAAA, ACCAAC, ACCAAG, ACCAAU, ACCACC, ACCACG, ACCAGA, ACCAGU, ACCAUA, ACCAUG, ACCAU U, ACCCAA, ACCCAC, ACCCCA, ACCCCG, ACCCGA, ACCCGC, ACCCUA, ACCCUC, ACCCUU, ACCGAA, ACCGAC, ACCGAU, ACCGCA, ACCGCC, ACCGCG, ACCGCU, ACCGGA, ACCGGC, ACCGGU, ACCGUA, ACCGUC, ACCGUG, ACCGUU, ACCUAA, ACCUAC, ACCUAG, ACCUAU, ACCUCA, ACCUCC, ACCUCG, ACCUCU, ACCUGA, ACCUGC, ACCUGU, ACCUUA, ACCUUC, ACCUU U, ACGAAA, ACGAAC, ACGAAG, ACGAAU, ACGACA, ACGACC, ACGACG, ACGACU, ACGAGA, ACGAGC, ACGAGG, ACGAGU, ACGAUA, ACGAUC, ACGAUG, ACGAUU, ACGCAA, ACGCAG, ACGCAU, ACGCCC, ACGCCG, ACGCCU, ACGCGA, ACGCGG, ACGCGU, ACGCUA, ACGCUG, ACGCUU, ACGGAA, ACGGAC, ACGGAG, ACGGAU, ACGGCC, ACGGCG, ACGGCU, ACGGGC, ACGGGG, ACGGGU, ACGGUA, ACGGUC, ACGGUG, ACGGUU, ACGUAA, ACGUAC, ACGUAU, ACGUCC, ACGUCG, ACGUCU, ACGUGA, ACGUGC, ACGUGG, ACGUGU, ACGUUA, ACGU UC, ACGUUG, ACGUU U, AC U AAA, ACUAAG, ACUAAU, ACUACA, ACUACC, ACUACG, ACUACU, ACUAGG, ACUAUC, ACUAUG, ACUAUU, ACUCAU, ACUCCC, ACUCCG, ACUCCU, ACUCGA, ACUCGC, ACUCGG, ACUCUC, ACUCUU, ACUGAG, ACUGAU, ACUGCC, ACUGCG, ACUGCU, ACUGGG, ACUGGU, ACUGUC, ACUUAA, ACU UAC, ACUUAU, ACU UCA, ACUUCC, ACUUCG, ACUUCU, ACUUGA, ACU UGC, ACU UGU, ACUUUA, ACU UUC, ACUU UG, AGAAAA, AGAAAC, AGAAAG, AGAACC, AGAACG, AGAACU, AGAAGC, AGAAGU, AGAAUA, AGAAUC, AGAAUG, AGAAUU, AGACAA, AGACAC, AGACAU, AGACCA, AGACCC, AGACCG, AGACCU, AGACGA, AGACGC, AGACGU, AGACUA, AGACUC, AGACUU, AGAGAC, AGAGAG, AGAGAU, AGAGCC, AGAGCG, AGAGCU, AGAGGC, AGAGGG, AGAGGU, AGAGUA, AGAGUU, AGAUAC, AGAUAG, AGAUAU, AGAUCC, AGAUCG, AGAUCU, AGAUGA, AGAUGC, AGAUGG, AGAU UA, AGAU UC, AGAUUG, AGAUU U, AGCAAC, AGCACA, AGCACG, AGCACU, AGCAGA, AGCAUA, AGCAUC, AGCAUG, AGCCAA, AGCCAU, AGCCCA, AGCCGA, AGCCGC, AGCCGG, AGCCGU, AGCCUA, AGCCUC, AGCGAA, AGCGAG, AGCGAU, AGCGCA, AGCGCC, AGCGCG, AGCGCU, AGCGGA, AGCGGC, AGCGGU, AGCGUA, AGCGUC, AGCGUG, AGCGUU, AGCUAA, AGCUAC, AGCUAG, AGCUAU, AGCUCA, AGCUCC, AGCUCG, AGCUCU, AGCUGA, AGCUGG, AGCUGU, AGCU UC, AGCUU U, AGGAAU, AGGACC, AGGACG, AGGAGA, AGGAGU, AGGAUA, AGGCAA, AGGCAU, AGGCCG, AGGCGA, AGGCGC, AGGCGG, AGGCUA, AGGCUC, AGGCUU, AGGGAC, AGGGAU, AGGGGA, AGGGGU, AGGGUA, AGGGUG, AGGUAA, AGGUAC, AGGUCA, AGGUCC, AGGUCU, AGGUGA, AGGUGC, AGGUGG, AGGUGU, AGGU UC,

AGGUUG, AGUAAA, AGUAAG, AGUAAU, AGUACA, AGUACG, AGUAGC, AGUAGG, AGUAUA, AGUAUC, AGUAUG, AGUAUU, AGUCAA, AGUCAC, AGUCAG, AGUCAU, AGUCCA, AGUCCG, AGUCCU, AGUCGA, AGUCGC, AGUCGG, AGUCGU, AGUCUA, AGUCUC, AGUCUG, AGUCU U, AGUGAA, AGUGAC, AGUGCG, AGUGGG, AGUGUC, AGU UAA, AGU UAC, AGUUAG, AGUUCC, AGU UCG, AGUUGA, AGUUGC,

AGUUGU, AGUUUA, AGUUUC, AGUU UG, AGU UU U, AUAAAC, AUAAAU, AUAACA, AUAACC, AUAACG, AUAACU, AUAAGA, AUAAGC, AUAAGG, AUAAGU, AUAAUC, AUAAUG, AUAAUU, AUACAC, AUACAG, AUACAU, AUACCA, AUACCC, AUACCG, AUACGA, AUACGC, AUACGG, AUACGU, AUACUA, AUACUC, AUACUG, AUACUU, AUAGAA, AUAGAC, AUAGAU, AUAGCA, AUAGCG, AUAGCU, AUAGGA, AUAGGU, AUAGUA, AUAGUC, AUAGUG, AUAGUU, AUAUAC, AUAUAG, AUAUCC, AUAUCG, AUAUCU, AUAUGA, AUAUGC, AUAUGG, AUAUGU, AUAUUC, AUAU UG, AUAUU U, AUCAAA, AUCAAC, AUCAAG, AUCAAU, AUCACA, AUCACC, AUCACG, AUCAGC, AUCAGG, AUCCAA, AUCCAU, AUCCCC, AUCCCG, AUCCGA, AUCCGC, AUCCGG, AUCCUA, AUCCUC, AUCCUG, AUCGAA, AUCGAC, AUCGAG, AUCGAU, AUCGCA, AUCGCC, AUCGCG, AUCGCU, AUCGGC, AUCGGG, AUCGGU, AUCGUC, AUCGUG, AUCGU U, AUCUAA, AUCUAC, AUCUAG, AUCUAU, AUCUCC, AUCUCG, AUCUGU, AUCUUG, AUCUU U, AUGAAA, AUGAAC, AUGAAG, AUGAAU, AUGACC, AUGACU, AUGAGG, AUGAGU, AUGAUA, AUGAUC, AUGAU U, AUGCAA, AUGCAG, AUGCCA, AUGCCC, AUGCCG, AUGCGA, AUGCGG, AUGCGU, AUGCUC, AUGCUU, AUGGAC, AUGGCC, AUGGGA, AUGGGC, AUGGGU, AUGGUC, AUGGUG, AUGUAC, AUGUAU, AUGUCA, AUGUCC, AUGUCG, AUGUGU, AUGUUA, AUGUUC, AU UAAA, AUUAAC, AU UAAG, AU UAAU, AU UACA, AUUACC, AUUACG, AUUACU, AU UAGA, AUUAGC, AUUAGG, AUUAGU, AU UAUA, AUUAUC, AUUAUG, AUUCAC, AUUCCA, AU UCCG, AU UCCU, AUUCGA, AUUCGC, AUUCGG, AU UCGU, AUUCUA, AUUCUC, AUUCU U, AUUGAA, AUUGAC, AUUGAU, AU UGCC, AUUGCG, AU UGCU, AUUGGA, AUUGGC,

AUUGGG, AU UGGU, AUUGUA, AUUGUC, AUUGUG, AU UGU U, AU UUAA, AU UUAG, AU UUAU, AUU UCC, AUU UCG, AUU UCU, AU UUGA, AU UUGC, AU UUGU, AU UUUA, AU UU UC, AUU UUG, AUU UU U, CAAAAG, CAAACA, CAAACC, CAAACG, CAAACU, CAAAGA, CAAAGG, CAAAUA, CAAAUU, CAACAC, CAACAU, CAACCA, CAACCC, CAACCG, CAACGA, CAACGC, CAACGG, CAACGU, CAACUA, CAACUC, CAACUG, CAACUU, CAAGAA, CAAGAC, CAAGAU, CAAGCA, CAAGCC, CAAGCG, CAAGCU, CAAGGA, CAAGGG, CAAGUC, CAAGUG, CAAGU U, CAAUAA, CAAUAC, CAAUAG, CAAUCC, CAAUCG, CAAUCU, CAAUGA, CAAUGC, CAAUGG, CAAUGU, CAAU UC, CAAU UG, CAAU UU, CACAAU, CACACA, CACACG, CACACU, CACAGA, CACAGC, CACAGG, CACAUA, CACAUC, CACAUU, CACCAA, CACCAC, CACCAU, CACCCA, CACCCC, CACCCG, CACCGA, CACCGC, CACCGG, CACCGU, CACCUA, CACCU U, CACGAA, CACGAC, CACGAG, CACGAU, CACGCA, CACGCC, CACGCU, CACGGA, CACGGC, CACGGG, CACGG U, CACGUA, CACGUC, CACGUG, CACGU U, CACUAA, CACUAG, CACUAU, CACUCA, CACUCG, CACUGA, CACUGC, CACUGG, CACUUA, CACU UC, CACU UU, CAGAAA, CAGAAG, CAGAAU, CAGACC, CAGACG, CAGAGC, CAGAUA, CAGAUC, CAGCCG, CAGCCU, CAGCGA, CAGCGC, CAGCGG, CAGCGU, CAGCUC, CAGCUU, CAGGAU, CAGGGG, CAGGGU, CAGGUA, CAGGUC, CAGGUU, CAGUAC, CAGUCG, CAGU UG, CAUAAA, CAUAAC, CAUAAG, CAUAAU, CAUACA, CAUACC, CAUACG, CAUACU, CAUAGA, CAUAGG, CAUAGU, CAUAUA, CAUAUC, CAUAUG, CAUCAA, CAUCAC, CAUCAG, CAUCAU, CAUCCA, CAUCCC, CAUCCG, CAUCGA, CAUCGC, CAUCGG, CAUCGU, CAUCUA, CAUCUC, CAUCUG, CAUCUU, CAUGAA, CAUGAC, CAUGAG, CAUGAU, CAUGCA, CAUGCC, CAUGCG, CAUGCU, CAUGGC, CAUGGG, CAUGGU, CAUGUA, CAUGUC, CAUGUU, CAU UAA, CAUUAC, CAUUAG, CAUUCA, CAU UCC, CAU UCG, CAU UCU, CAU UGA, CAU UGG, CAUU UC, CAU UUG, CAUU UU, CCAAAA, CCAAAC, CCAAAG, CCAAAU, CCAACA, CCAACC, CCAACG, CCAACU, CCAAGA, CCAAGC, CCAAGG, CCAAUC, CCAAUG, CCAAU U, CCACAA, CCACAC, CCACAG, CCACAU, CCACCA, CCACCC, CCACCG, CCACCU, CCACGA, CCACGC, CCACGG, CCACGU, CCACUA, CCACUC, CCACUU, CCAGAA, CCAGAC, CCAGAG, CCAGCC, CCAGGU, CCAGUC, CCAGUU, CCAUAA, CCAUAC, CCAUAG, CCAUAU, CCAUCA, CCAUCC, CCAUCU, CCAUGA, CCAUGC, CCAUGG, CCAUUC, CCAUUG, CCAUU U, CCCAAC, CCCAAG, CCCAAU, CCCACA, CCCAGA, CCCAGC, CCCAGU, CCCAUA, CCCAUC, CCCAUG, CCCAUU, CCCCAA, CCCCAG, CCCCAU, CCCCCC, CCCCCG, CCCCCU, CCCCGA, CCCCGC, CCCCGU, CCCCUA, CCCCUC, CCCGAA, CCCGAC, CCCGAU, CCCGCA, CCCGCU, CCCGGA, CCCGGC, CCCGUA, CCCGUG, CCCGU U, CCCUAA, CCCUAG, CCCUCA, CCCUCU, CCCUGC, CCCUUA, CCCU UC, CCCU UU, CCGAAA, CCGAAC, CCGAAU, CCGACA, CCGACC, CCGACG, CCGACU, CCGAGA, CCGAGG, CCGAGU, CCGAUA, CCGAUC, CCGAUG, CCGAU U, CCGCAA, CCGCAC, CCGCAG, CCGCAU, CCGCCA, CCGCCC, CCGCCG, CCGCCU, CCGCGA, CCGCGC, CCGCGG, CCGCGU, CCGCUA, CCGCUC, CCGCUG, CCGCU U, CCGGAA, CCGGAU, CCGGCA, CCGGCC, CCGGCG, CCGGCU, CCGGGA, CCGGGC, CCGGGG, CCGGGU, CCGGUA, CCGGUC, CCGGUG, CCGUAA, CCGUAG, CCGUAU, CCGUCA, CCGUCC, CCGUCG, CCGUGA, CCGUGU, CCGUUA, CCGUUC, CCGU UG, CCGUU U, CCUAAC, CCUAAG, CCUAAU, CCUACA, CCUACC, CCUACG, CCUACU, CCUAGA, CCUAGC, CCUAGG, CCUAGU, CCUAUA, CCUAUC, CCUAUG, CCUAUU, CCUCAA, CCUCAC, CCUCAG, CCUCAU, CCUCCA, CCUCCC, CCUCCG, CCUCGA, CCUCGC, CCUCGG, CCUCGU, CCUCUA, CCUCUG, CCUGAC, CCUGAU, CCUGCA, CCUGGG, CCUGGU, CCUGU U, CCUUAA, CCUUAC, CCUUAG, CCUUAU, CCUUCG, CCUUGA, CCUUGU, CCU UUA, CCUU UC, CCU UU U, CGAAAA, CGAAAC, CGAAAG, CGAAAU, CGAACA, CGAACC, CGAACG, CGAACU, CGAAGA, CGAAGC, CGAAGG, CGAAGU, CGAAUA, CGAAUC, CGAAUG, CGAAUU, CGACAA, CGACAC, CGACAU, CGACCA, CGACCU, CGACGA, CGACGC, CGACGG, CGACGU, CGACUA, CGACUG, CGACU U, CGAGAA, CGAGAC, CGAGAG, CGAGAU, CGAGCA, CGAGCC, CGAGCG, CGAGCU, CGAGGC, CGAGGG, CGAGGU, CGAGUA, CGAGUC, CGAGUG, CGAGUU, CGAUAA, CGAUAC, CGAUAG, CGAUAU, CGAUCA, CGAUCC, CGAUCG, CGAUCU, CGAUGA, CGAUGC, CGAUGG, CGAUGU, CGAUUA, CGAU UC, CGAUUG, CGAUU U, CGCAAA, CGCAAC, CGCAAG, CGCAAU, CGCACA, CGCACC, CGCACG, CGCAGA, CGCAGC, CGCAGG, CGCAGU, CGCAUA, CGCAUC, CGCAUG, CGCAU U, CGCCAA, CGCCAC, CGCCAG, CGCCAU, CGCCCA, CGCCCC, CGCCCG, CGCCGA, CGCCGC, CGCCGG, CGCCGU, CGCCUA, CGCCUG, CGCCUU, CGCGAA, CGCGAC, CGCGAG, CGCGAU, CGCGCA, CGCGCC, CGCGCG, CGCGCU, CGCGGA, CGCGGC, CGCGGG, CGCGGU, CGCGUA, CGCGUC, CGCGUG, CGCGU U, CGCUAA, CGCUAC, CGCUAG, CGCUAU, CGCUCA, CGCUCC, CGCUCG, CGCUCU, CGCUGA, CGCUGC, CGCUGG, CGCUGU, CGCUUA, CGCU UC, CGCU UG, CGGAAA, CGGAAC, CGGAAG, CGGACA, CGGACC, CGGACG, CGGACU, CGGAGC, CGGAGG, CGGAGU, CGGAUA, CGGAU U, CGGCAA, CGGCAC, CGGCAG, CGGCCA, CGGCCC, CGGCCG, CGGCGC, CGGCGG, CGGCGU, CGGCUA, CGGCUC, CGGCUG, CGGCU U, CGGGAA, CGGGAC, CGGGAG, CGGGAU, CGGGCA, CGGGCC, CGGGCG, CGGGCU, CGGGGU, CGGGUA, CGGGUC, CGGGUG, CGGUAA, CGGUAC, CGGUAG, CGGUAU, CGGUCA, CGGUCG, CGGUCU, CGGUGA, CGGUGG, CGGUGU, CGGU UA, CGGU UC, CGGUUG, CGGUUU, CGUAAA, CGUAAC, CGUAAG, CGUAAU, CGUACA, CGUACG, CGUACU, CGUAGA, CGUAGC, CGUAGG, CGUAGU, CGUAUA, CGUAUC, CGUAUG, CGUAUU, CGUCAA, CGUCAC, CGUCAG, CGUCAU, CGUCCA, CGUCCC, CGUCCG, CGUCCU, CG UCGA, CGUCGG, CGUCGU, CGUCUA, CGUCUC, CGUCUG, CGUCU U, CGUGAA, CGUGAC, CGUGAG, CGUGAU, CGUGCC, CGUGCG, CGUGCU, CGUGGA, CGUGGG, CGUGGU, CGUGUA, CGUGUG, CGUUAA, CGUUAC, CGU UAG,

CGU UAU, CGUUCA, CGUUCC, CGUUCG, CGU UCU, CGUUGA, CGUUGC, CGU UGU, CGU UUA, CGUUUC, CGU U UU, CUAAAA, CUAAAC, CUAAAU, CUAACA, CUAACC, CUAACG, CUAACU, CUAAGA, CUAAGC, CUAAGU, CUAAUA, CUAAUC, CUAAUG, CUACAC, CUACAU, CUACCA, CUACCC, CUACCG, CUACCU, CUACGA, CUACGC, CUACGG, CUACGU, CUACUA, CUACUC, CUACUG, CUAGAA, CUAGAG, CUAGAU, CUAGCA, CUAGCC, CUAGCG, CUAGCU, CUAGGA, CUAGGG, CUAGGU, CUAGUG, CUAGUU, CUAUAA, CUAUAG, CUAUAU, CUAUCA, CUAUCC, CUAUCG, CUAUCU, CUAUGA, CUAUGC, CUAUGG, CUAUGU, CUAUUA, CUAUUG, CUCAAC, CUCAAG, CUCAAU, CUCACC, CUCACG, CUCAGC, CUCAUA, CUCAUC, CUCAUG, CUCAU U, CUCCAC, CUCCCC, CUCCCG, CUCCGA, CUCCGC, CUCCGG, CUCCUA, CUCCUC, CUCCU U, CUCGAA, CUCGAC, CUCGAG, CUCGAU, CUCGCA, CUCGCC, CUCGCG, CUCGGG, CUCGGU, CUCGUA, CUCGUC, CUCGUG, CUCGU U, CUCUAA, CUCUAC, CUCUAU, CUCUCA, CUCUCC, CUCUCU, CUCUGC, CUCUGU, CUCUUA, CUCU UG, CUGAAG, CUGACC, CUGACG, CUGAGC, CUGAUA, CUGAUC, CUGCCG, CUGCCU, CUGCGA, CUGCUA, CUGCUU, CUGGAG, CUGGAU, CUGGCG, CUGGGU, CUGUAC, CUGUCA, CUGUCC, CUGUCG, CUGUGG, CUGUGU, CUGUUA, CUGU UU, CUUAAC, CUUAAG, CU UAAU, CU UACC, CUUACG, CUUAGA, CU UAGC, CU UAGG, CUUAGU, CUUAUA, CU UAUC, CU UAUG, CUUAUU, CU UCAG, CU UCAU, CUUCCA, CUUCCC, CU UCCG, CU UCCU, CUUCGA, CUUCGC, CU UCGG, CUUCGU, CU UCUA, CUUGAC, CUUGAG, CUUGAU, CUUGCA, CUUGCC, CUUGCG, CU UGCU, CUUGGC, CUUGGU, CU UGUU, CUU UAC, CU UUAG, CU UUAU, CUU UCA, CUU UCG, CU UUCU, CUUUGA, CU UUGC, CU UUGU, CU U UUA, CUUU UC, CUU UUG, CUU UUU, GAAAAA, GAAAAG, GAAAAU, GAAACC, GAAACG, GAAAGA, GAAAGC, GAAAGU, GAAAUA, GAAAUC, GAAAUG, GAAAUU, GAACAA, GAACAC, GAACAG, GAACAU, GAACCA, GAACCC, GAACCG, GAACCU, GAACGA, GAACGC, GAACGG, GAACGU, GAACUA, GAACUG, GAACUU, GAAGAC, GAAGAG, GAAGCA, GAAGCG, GAAGCU, GAAGUC, GAAUAA, GAAUAC, GAAUAG, GAAUAU, GAAUCC, GAAUCG, GAAUCU, GAAUGA, GAAUGC, GAAUGU, GAAU UA, GAAU UC, GAAUU U, GACAAA, GACAAG, GACAAU, GACACC, GACAGA, GACAGG, GACAUA, GACAUG, GACAUU, GACCAA, GACCAC, GACCAG, GACCCA, GACCCC, GACCCG, GACCGC, GACCGG, GACCGU, GACCUA, GACCUC, GACCU U, GACGAA, GACGAC, GACGAG, GACGAU, GACGCA, GACGCC, GACGCG, GACGCU, GACGGA, GACGGC, GACGGG, GACGGU, GACGUA, GACGUC, GACGUG, GACGUU, GACUAA, GACUAC, GACUAG, GACUAU, GACUCA, GACUCC, GACUCG, GACUGG, GACUGU, GACUUA, GACUUG, GACUU U, GAGAAU, GAGAGA, GAGAGC, GAGAGG, GAGAUA, GAGAUC, GAGCAA, GAGCAU, GAGCCA, GAGCGA, GAGCGG, GAGCGU, GAGGGU, GAGGUC, GAGGUG, GAGUAA, GAGUAG, GAGUCC, GAGUUC, GAGU UU, GAUAAA, GAUAAC, GAUAAG, GAUAAU, GAUACA, GAUACC, GAUACG, GAUACU, GAUAGA, GAUAGC, GAUAGG, GAUAGU, GAUAUA, GAUCAA, GAUCAC, GAUCAU, GAUCCA, GAUCCC, GAUCCU, GAUCGC, GAUCGG, GAUCGU, GAUCUA, GAUCUG, GAUCU U, GAUGAA, GAUGAC, GAUGAG, GAUGCA, GAUGCC, GAUGCG, GAUGCU, GAUGGC, GAUGGG, GAUGGU, GAUGUG, GAUGUU, GAUUAA, GAUUAC, GAUUAG, GAUUAU, GAUUCA, GAUUCG, GAU UCU, GAUUGA, GAUUGC, GAU UUA, GAU UUC, GAUU UG, GAUUU U, GCAAAC, GCAAAG, GCAAAU, GCAACA, GCAACC, GCAAGC, GCAAGU, GCAAUA, GCAAUC, GCAAUG, GCAAUU, GCACAA, GCACAC, GCACAG, GCACCC, GCACCG, GCACCU, GCACGA, GCACGC, GCACGU, GCACUA, GCACUC, GCACUG, GCACUU, GCAGAU, GCAGCC, GCAGCG, GCAGGC, GCAGUA, GCAGUC, GCAGUG, GCAGUU, GCAUAA, GCAUAG, GCAUAU, GCAUCG, GCAUCU, GCAUGA, GCAUGC, GCAUGG, GCAUGU, GCAU UA, GCAU UC, GCAUUG, GCAUU U, GCCAAA, GCCAAC, GCCAAU, GCCACA, GCCACC, GCCACG, GCCAGA, GCCAGU, GCCAUA, GCCAUC, GCCAUG, GCCAUU, GCCCAA, GCCCAC, GCCCAG, GCCCCG, GCCCGA, GCCCGG, GCCCGU, GCCGAA, GCCGAC, GCCGAG, GCCGAU, GCCGCA, GCCGCU, GCCGGA, GCCGGC, GCCGGG, GCCGGU, GCCGUA, GCCGUC, GCCGUG, GCCGU U, GCCUAA, GCCUAU, GCCUCA, GCCUCC, GCCUCG, GCCUGA, GCCU UA, GCCU UU, GCGAAA, GCGAAC, GCGAAG, GCGAAU, GCGACC, GCGACG, GCGACU, GCGAGA, GCGAGC, GCGAGG, GCGAGU, GCGAUA, GCGAUC, GCGAUG, GCGAUU, GCGCAA, GCGCAC, GCGCAG, GCGCAU, GCGCCA, GCGCCC, GCGCCU, GCGCGA, GCGCGU, GCGCUA, GCGCUC, GCGCUG, GCGCUU, GCGGAA, GCGGAC, GCGGAU, GCGGCA, GCGGCC, GCGGCU, GCGGGA, GCGGUA, GCGGUC, GCGGUU, GCGUAA, GCGUAC, GCGUAG, GCGUAU, GCGUCA, GCGUCC, GCGUCG, GCGUCU, GCGUGA, GCGUGC, GCGUGG, GCGUGU, GCGU UA, GCGUUC, GCGU UG, GCGUU U, GCUAAA, GCUAAC, GCUAAG, GCUAAU, GCUACC, GCUACG, GCUACU, GCUAGA, GCUAGG, GCUAGU, GCUAUA, GCUAUC, GCUAUU, GCUCAA, GCUCAC, GCUCAG, GCUCAU, GCUCCA, GCUCCC, GCUCCG, GCUCGA, GCUCGC, GCUCGU, GCUCUA, GCUCUC, GCUCU U, GCUGAA, GCUGAC, GCUGAU, GCUGCA, GCUGCC, GCUGCG, GCUGCU, GCUGUG, GCUGUU, GCUUAC, GCUUAG, GCUUAU, GCU UCA, GCUUCG, GCU UGA, GCUUGG, GCUUGU, GCUU UA, GCU UUG, GGAAAG, GGAACA, GGAACC, GGAACG, GGAACU, GGAAGU, GGAAUA, GGAAUC, GGAAU U, GGACAA, GGACAC, GGACAG, GGACAU, GGACCG, GGACGA, GGACGC, GGACGU, GGACUA, GGACUC, GGACU U, GGAGAC, GGAGCA, GGAGCG, GGAGGG, GGAGUA, GGAUAA, GGAUAC, GGAUCA, GGAUCC, GGAUCG, GGAUCU, GGAUGC, GGAUUA, GGAUUG, GGCAAU, GGCACA, GGCACU, GGCAGA, GGCAUA, GGCAUC, GGCCAC, GGCCAG, GGCCCC, GGCCGA, GGCCGC, GGCCGU, GGCCUA, GGCCUG, GGCCU U, GGCGAA, GGCGAG, GGCGAU, GGCGCA, GGCGCU, GGCGGU, GGCGUA, GGCGUC, GGCGUG, GGCGU U, GGCUAA, GGCUAC, GGCUAG, GGCUAU, GGCUCC, GGCUCG, GGCUGA, GGCU UA, GGCUUC, GGCUUG, GGGAAU, GGGACA, GGGAGA, GGGAGU, GGGAUA, GGGAU U, GGGCAA, GGGCAC, GGGCAG, GGGCCG, GGGCGG, GGGGCC, GGGGGG,

GGGGGU, GGGGUA, GGGUAC, GGGUAU, GGGUCA, GGGUCC, GGGUCG, GGGUGA, GGGUGC, GGGU UA, GGGU UG, GGUAAA, GGUAAC, GGUAAG, GGUAAU, GGUACA, GGUACC, GGUACG,

GGUACU, GGUAGC, GGUAGG, GGUAGU, GGUAUA, GGUAUC, GGUAUG, GGUCAA, GGUCAC, GGUCAG, GGUCAU, GGUCCA, GGUCCG, GGUCCU, GGUCGA, GGUCGC, GG UCGG, GGUCGU, GGUCUC, GGUCU U, GGUGAA, GGUGAC, GGUGAU, GGUGCA, GGUGCC, GGUGGC, GGUGUA, GGUGUC, GGU UAA, GGU UAG, GGU UAU, GGUUCA, GGU UCC, GGU UCG, GGU UGC, GGU UUC, GGUU UU, GUAAAA, GUAAAG, GUAAAU, GUAACC, GUAACG, GUAACU, GUAAGA, GUAAGC, GUAAGG, GUAAGU, GUAAUA, GUAAUC, GUAAUG, GUAAUU, GUACAA, GUACAC, GUACAG, GUACAU, GUACCA, GUACCC, GUACCG, GUACCU, GUACGA, GUACGC, GUACGG, GUACGU, GUACUA, GUACUC, GUACUG, GUACUU, GUAGAA, GUAGAC, GUAGCA, GUAGCC, GUAGCG, GUAGCU, GUAGGA, GUAGGC, GUAGGG,

GUAGGU, GUAGUA, GUAGUC, GUAUAA, GUAUAC, GUAUAG, GUAUAU, GUAUCA, GUAUCG, GUAUCU, GUAUGA, GUAUGC, GUAUGG, GUAUUA, GUAU UG, G UAU UU, GUCAAA, GUCAAG, GUCAAU, GUCACA, GUCACC, GUCACG, GUCAGA, GUCAGC, GUCAGG, GUCAUA, GUCAUC, GUCAUG, GUCCAA, GUCCAC, GUCCAU, GUCCCC, GUCCCU, GUCCGA, GUCCGC, GUCCGG, GUCCGU, GUCCUA, GUCCUG, GUCCU U, GUCGAA, GUCGAC, GUCGAG, GUCGAU, GUCGCA, GUCGCC, GUCGCG, GUCGCU, GUCGGA, GUCGGC, GUCGGG, GUCGGU, GUCGUA, GUCGUC, GUCGU U, GUCUAA, GUCUAG, GUCUCA, GUCUCC, GUCUCG, GUCUGA, GUCUGG, GUCUGU, GUCU UC, GUCU UU, GUGAAA, GUGAAC, GUGAAG, GUGACC, GUGACG, GUGAGA, GUGAGC, GUGAGU, GUGAUC, GUGAUG, GUGAUU, GUGCAC,

GUGCAU, GUGCCC, GUGCCG, GUGCGA, GUGCGG, GUGCGU, GUGCUA, GUGCUC, GUGCUG,

GUGGAG, GUGGCG, GUGGCU, GUGGGU, GUGGUC, GUGGUG, GUGUAA, GUGUAG, GUGUCG, GUGUGA, GUGUGC, GUGUGU, GUGUUG, GUGU UU, GU UAAA, GUUAAC, GUUAAG, GU UACA, GU UACC, GUUACG, GU UACU, GU UAGA, GUUAGC, GUUAGU, GUUAUA, GUUAUC, GUUAUG, GU UAUU, GUUCAA, GUUCAC, GUUCAG, GU UCCA, GUUCCG, GUUCGA, GU UCGC, GU UCGG, GU UCGU, GU UCUA, GUUCUG, GUUGAA, GU UGAC, GUUGAG, GUUGAU, GUUGCG, GUUGCU, GUUGGA, GU UGGC, GU UGGU, GU UGUC, GUUGUG, GUUGU U, GUU UAA, GU UUAC, GU UUAG, GUU UAU, GU UUCA, GUUUCC, GU UUCU, GU UUGA, GUU UGC, GUU UGG, GUU UGU, GU UUUA, GU UU UC,

GU UU UU, UAAAAA, UAAAAC, UAAAAG, UAAAAU, UAAACA, UAAACC, UAAACG, UAAACU, UAAAGA, UAAAGG, UAAAGU, UAAAUA, UAAAUC, UAAAUG, UAAAU U, U A AC A A, UAACAC, UAACAG, UAACCA, UAACCC, UAACCG, UAACCU, UAACGA, UAACGC, UAACGG, UAACGU, UAACUA, UAACUG, UAACUU, UAAGAG, UAAGAU, UAAGCA, UAAGCC, UAAGCG, UAAGCU, UAAGGA, UAAGGC, UAAGGG, UAAGGU, UAAGUA, UAAGUC, UAAGUG, UAAGUU, UAAUAA, UAAUCA, UAAUCC, UAAUCG, UAAUCU, UAAUGA, UAAUGG, UAAUGU, UAAUUA, UAAU UC, UAAUUG, UACAAC, UACAAG, UACAAU, UACACC, UACACG, UACACU, UACAGA, UACAGC, UACAUA, UACAUC, UACAU U, UACCAA, UACCAC, UACCAG, UACCAU, UACCCC, UACCCG, UACCCU, UACCGA, UACCGC, UACCGG, UACCGU, UACCUA, UACCUG, UACGAA, UACGAC, UACGAG, UACGAU, UACGCA, UACGCC, UACGCG, UACGCU, UACGGC, UACGGG, UACGGU, UACGUA, UACGUC, UACGUG, UACGUU, UACUAA, UACUAC, UACUAG, UACUAU, UACUCA, UACUCC, UACUCG, UACUCU, UACUGA, UACUGC, UACUGG, UACU UA, UACU UG, UACU UU, UAGAAA, UAGAAG, UAGAAU, UAGACA, UAGACG, UAGAGA, UAGAGC, UAGAGU, UAGAUA, UAGAUC, UAGAUG, UAGCAU, UAGCCC, UAGCCG, UAGCCU, UAGCGA, UAGCGC, UAGCGU, UAGCUA, UAGCUC, UAGCUG, UAGGAA, UAGGAU, UAGGCG, UAGGCU, UAGGGU, UAGGUC, UAGGUG, UAGGUU, UAGUAA, UAGUAC, UAGUAG, UAGUAU, UAGUCA, UAGUCG, UAGUGU, UAGUUA, UAGU UC, UAGU UG, UAGUU U,

UAUAAC, UAUAAG, UAUACU, UAUAGA, UAUAGC, UAUAGG, UAUAGU, UAUAUA, UAUAUC, UAUAUG, UAUAU U, UAUCAA, UAUCAC, UAUCAU, UAUCCA, UAUCCC, UAUCCG, UAUCCU, UAUCGA, UAUCGC, UAUCGG, UAUCGU, UAUCUA, UAUCUC, UAUCUG, UAUCUU, UAUGAA, UAUGAC, UAUGAG,

UAUGAU, UAUGCA, UAUGCG, UAUGCU, UAUGGA, UAUGGC, UAUGUC, UAUGUG, UAUGU U, UAU UAG, UAUUCA, UAU UCC, UAUUCG, UAUUCU, UAUUGA, UAUUGG, UAUU UA, UAU UUC,

UAU UUG, UAUUU U, UCAAAA, UCAAAC, UCAAAG, UCAACC, UCAACU, UCAAGA, UCAAGC, UCAAUA, UCAAUC, UCAAUG, UCAAUU, UCACCC, UCACCG, UCACCU, UCACGA, UCACGC, UCACGG, UCACGU, UCACUA, UCACUC, UCACUU, UCAGAA, UCAGAC, UCAGAG, UCAGCG, UCAGCU, UCAGGA, UCAGGC, UCAGGU, UCAGUC, UCAGU U, UCAUAA, UCAUCA, UCAUCC, UCAUCG, UCAUGC, UCAUGG, UCAUGU, UCAUUA, UCAUUG, UCCAAA, UCCAAC, UCCAAG, UCCAAU, UCCACA, UCCACC, UCCACG, UCCAGC, UCCAGG, UCCAUA, UCCAUC, UCCAU U, UCCCAA, UCCCAG, UCCCAU, UCCCCC, UCCCCG, UCCCCU, UCCCGA, UCCCGC, UCCCGG, UCCCGU, UCCCUA, UCCCUC, UCCGAA, UCCGAC, UCCGAG, UCCGAU, UCCGCA, UCCGCC, UCCGGA, UCCGGC, UCCGGU, UCCGUA, UCCGUC, UCCGUG, UCCUAA, UCCUCA, UCCUCG, UCCUCU, UCCUGC, UCCUGU, UCCU UA, UCCU UC, UCCUU U, UCGAAA, UCGAAC, UCGAAG, UCGAAU, UCGACA, UCGACC, UCGACG, UCGACU, UCGAGA, UCGAGC, UCGAGG, UCGAUA, UCGAUC, UCGAUG, UCGAU U, UCGCAA, UCGCAC, UCGCAG, UCGCAU, UCGCCA, UCGCCC, UCGCCG, UCGCCU, UCGCGA, UCGCGC, UCGCGU, UCGCUA, UCGCUC, UCGGAA, UCGGAC, UCGGAG, UCGGAU, UCGGCA, UCGGCU, UCGGGG, UCGGGU, UCGGUC, UCGGUG, UCGGU U, UCGUAA, UCGUAC, UCGUAG,

UCGUAU, UCGUCA, UCGUCC, UCGUCG, UCGUCU, UCGUGA, UCGUGU, UCGUUA, UCGU UC, UCGU UG, UCGU UU, UCUAAC, UCUAAG, UCUAAU, UCUACA, UCUACC, UCUACG, UCUACU, UCUAGC, UCUAGG, UCUAGU, UCUAUA, UCUAUC, UCUAUG, UCUAUU, UCUCAG, UCUCAU, UCUCCG, UCUCGC, UCUCGG, UCUCGU, UCUCUC, UCUGAA, UCUGAU, UCUGCA, UCUGCG, UCUGCU, UCUGGC, UCUGGU, UCUGUC, UCUGUG, UCUGU U, UCUUAA, UCU UAC, UCUUAG, UCU UAU, UCU UCA, UCUUCC, UCUUCG, UCUUCU, UCU UGC, UCU UGG, UCUUGU, UCUU UA, UCU UUC, UCU UUG, UCU UU U, UGAAAA, UGAAAC, UGAACA, UGAACC, UGAAGG, UGAAUC, UGAAUG, UGACAA, UGACAC, UGACAG, UGACCA, UGACCC, UGACCG, UGACGA, UGACGC, UGACGG, UGACGU, UGACUA, UGACUC, UGACU U, UGAGAG, UGAGAU, UGAGCA, UGAGCC, UGAGCU, UGAGGC, UGAGGU, UGAGUA, UGAGUU, UGAUAC, UGAUAG, UGAUAU, UGAUCA, UGAUCG, UGAUCU, UGAUGA, UGAUGC, UGAUGG, UGAUGU, UGAUUA, UGAU UC, UGAUUG, UGAUU U, UGCAAC, UGCAAG, UGCACA, UGCACG, UGCAGG, UGCAGU, UGCAUC, UGCCCA, UGCCCC, UGCCCG, UGCCGA, UGCCGC, UGCCGG, UGCCGU, UGCCUA, UGCCUC, UGCCUG, UGCCU U, UGCGAA, UGCGAC, UGCGAU, UGCGCC, UGCGCG, UGCGCU, UGCGGC, UGCGGG, UGCGGU, UGCGUA, UGCGUC, UGCGUG, UGCGUU, UGCUAC, UGCUAU, UGCUCC, UGCUCG, UGCUGC, UGCUGG, UGCUGU, UGCUUA, UGCUUU, UGGAAC, UGGAAG, UGGAGC, UGGAUC, UGGAUU, UGGCAA, UGGCAC, UGGCAG, UGGCCG, UGGCCU, UGGCGA, UGGCGC, UGGCGU, UGGCUA, UGGCUC, UGGCUU, UGGGAA, UGGGCA, UGGGCC, UGGGGC, UGGGUC, UGGUAA, UGGUAG, UGGUAU, UGGUCC, UGGUCG, UGGUCU, UGGUGA, UGGUGC, UGGUGG, UGGUGU, UGGU UA, UGGUUG, UGUAAA, UGUAAC, UGUAAG, UGUACC, UGUACG, UGUACU, UGUAGA, UGUAGC, UGUAGU, UGUAUC, UGUAUU, UGUCAA, UGUCAC, UGUCAG, UGUCAU, UGUCCA, UGUCCC, UG UCCG, UGUCGA, UGUCGC, UGUCGG, UGUCGU, UGUCUA, UGUCUC, UGUGAC, UGUGAG, UGUGAU, UGUGCA, UGUGGU, UGUGUA, UGUGUU, UGUUAC, UGUUAG, UGU UAU, UGUUCA, UGUUCC, UGUUCG, UGU UGG, UGUUGU, UGU UUA, UGU UUC, UGU UUG, UGUUU U, UUAAAA, U UAAAC, UUAAAG, UUAAAU,

U UAACC, U UAACG, U UAACU, UUAAGU, UUAAUA, UUAAUC, UUAAUG, UUAAU U, UUACAA, UUACAC, U UACAG, UUACAU, U UACCA, UUACCC, UUACCG, UUACCU, UUACGA, UUACGC, UUACGG, U UACGU, U UACUA, U UACUC, UUACUG, UUACUU, U UAGAA, UUAGAC, U UAGCC, U UAGCG, U UAGCU, U UAGGC, U UAGGU, U UAGUA, U UAGUC, U UAGUU, U UAUAA, U UAUAC, U UAUAG, UUAUAU, U UAUCC, U UAUCG, UUAUCU, U UAUGA, U UAUGG, UUAUGU, UUAU UA, UUAUUC, UUAU UG, UUAU UU,

U UCAAC, U UCAAU, UUCACA, U UCACC, UUCACG, UUCACU, U UCAGC, U UCAGG, UUCAGU, UUCAUA, UUCAUC, UUCAUG, UUCAUU, UUCCAA, UUCCCA, UUCCCG, UUCCGA, UUCCGU, UUCCUU, UUCGAA, UUCGAC, UUCGAG, UUCGAU, UUCGCA, UUCGCC, UUCGCG, UUCGCU, UUCGGA, UUCGGC, UUCGGG, UUCGGU, UUCGUA, UUCGUC, UUCGUG, UUCGUU, UUCUAC, UUCUAG, UUCUCA, UUCUCG,

UUCUGG, UUCUUA, UUCUUU, UUGAAA, UUGAAC, UUGAAG, UUGAAU, UUGACC, UUGACG, UUGACU, UUGAGA, UUGAGC, UUGAGU, UUGAUA, UUGAUC, UUGAUG, UUGAUU, UUGCAA,

UUGCAC, UUGCAG, UUGCAU, UUGCCC, UUGCCG, UUGCGA, UUGCGC, UUGCGG, UUGCGU, UUGCUA, UUGCUC, UUGCUG, UUGCUU, UUGGAA, UUGGAG, UUGGCC, UUGGCG, UUGGCU, UUGGGC, UUGGGU, UUGGUA, UUGGUG, UUGUAA, UUGUAC, UUGUCA, UUGUCG, UUGUCU, UUGUGC, UUGUGG, UUGUUA, UUGUUG, UUGUUU, UUUAAA, UUUAAC, UUUAAG, UUUAAU, UUUACA, UUUACC, UUUACG, UUUACU, UUUAGA, UUUAGC, UUUAGG, UUUAGU, UUUAUA, UUUAUC, UUUAUG, UUUAUU, UUUCAU, UUUCCA, UUUCCG, UUUCCU, UUUCGA, UUUCGC, UUUCGG, UUUCGU, UUUCUA, UUUCUC, UUUCUG, UUUCUU, UUUGAA, UUUGAC, UUUGAG, UUUGAU, UUUGCC, UUUGCU, UUUGGA, UUUGGC, UUUGGG, UUUGGU, UUUGUA, UUUGUC, UUUGUU, UUUUAA, UUUUAG, UUUUAU, UUUUCC, UUUUCG, UUUUCU, UUUUGA, UUUUGC, UUUUGG, UUUUGU, UUUUUA, UUUUUC, UUUUUU

Table 2: A listing of oligonucleotide modifications.

Symbol Feature Description

bio 5' biotin

dAs DNA w/3' thiophosphate

dCs DNA w/3' thiophosphate

dGs DNA w/3' thiophosphate

dTs DNA w/3' thiophosphate

dG DNA w/3' phosphate

dT DNA w/3' phosphate

dU deoxyuridine w/3' phosphate

d5mCs deoxy-5-methylcytidine w/3' thiophosphate

enaAs ENA w/3' thiophosphate

enaCs ENA w/3' thiophosphate

enaGs ENA w/3' thiophosphate

enaTs ENA w/3' thiophosphate

fluAs 2'-fluoro w/3' thiophosphate

fluCs 2'-fluoro w/3' thiophosphate

fluGs 2'-fluoro w/3' thiophosphate

fluUs 2'-fluoro w/3' thiophosphate

InaAs LNA w/3' thiophosphate

InaCs LNA w/3' thiophosphate InaGs LNA w/3' thiophosphate

InaTs LNA w/3' thiophosphate

omeAs 2'-OMe w/3' thiophosphate

omeCs 2'-OMe w/3' thiophosphate

omeGs 2'-OMe w/3' thiophosphate

omeTs 2'-OMe w/3' thiophosphate

InaAs-Sup LNA w/3' thiophosphate at 3' terminus

InaCs-Sup LNA w/3' thiophosphate at 3' terminus

InaGs-Sup LNA w/3' thiophosphate at 3' terminus

InaTs-Sup LNA w/3' thiophosphate at 3' terminus

InaA-Sup LNA w/3' OH at 3' terminus

InaC-Sup LNA w/3' OH at 3' terminus

InaG-Sup LNA w/3' OH at 3' terminus

InaT-Sup LNA w/3' OH at 3' terminus

omeA-Sup 2'-OMe w/3' OH at 3' terminus

omeC-Sup 2'-OMe w/3' OH at 3' terminus

omeG-Sup 2'-OMe w/3' OH at 3' terminus

omeU-Sup 2'-OMe w/3' OH at 3' terminus

dAs-Sup DNA w/3' thiophosphate at 3' terminus

dCs-Sup DNA w/3' thiophosphate at 3' terminus

dGs-Sup DNA w/3' thiophosphate at 3' terminus

dTs-Sup DNA w/3' thiophosphate at 3' terminus

dA-Sup DNA w/3' OH at 3' terminus

dC-Sup DNA w/3' OH at 3' terminus

dG-Sup DNA w/3' OH at 3' terminus

dT-Sup DNA w/3' OH at 3' terminus

dU deoxyuridine w/3' OH at 3' terminus

rA NA w/3' phosphate

rC RNA w/3' phosphate

rG RNA w/3' phosphate

rU RNA w/3' phosphate

Table 3: Formatted oligonucleotide sequences made for testing in the lab showing nucleotide modifications.

OligolD Base Sequence Formatted Sequence SeqID

HBB-01 G7A mOl GCTATTACCTTAACCC lnaGs;omeCs;lnaTs;omeAs;lnaTs 3789

AG ;omeUs;lnaAs;omeCs;lnaCs;ome Us;lnaTs;omeAs;lnaAs;omeCs;ln

aCs;omeCs;lnaAs;omeG-Sup

HBB-02 mOl GCTATTACCTTAACC lnaGs;omeCs;lnaTs;omeAs;lnaTs 3790

;omeUs;lnaAs;omeCs;lnaCs;ome

Us;lnaTs;omeAs;lnaAs;omeCs;ln aC-Sup

HBB-03 mOl CTATTACCTTAACCC lnaCs;omeUs;lnaAs;omeUs;lnaTs 3791

;omeAs;lnaCs;omeCs;lnaTs;ome

Us;lnaAs;omeAs;lnaCs;omeCs;ln aC-Sup

HBB-04 mOl TATTACCTTAACCCA lnaTs;omeAs;lnaTs;omeUs;lnaAs 3792

;omeCs;lnaCs;omeUs;lnaTs;ome

As;lnaAs;omeCs;lnaCs;omeCs;ln aA-Sup

HBB-05 mOl ATTACCTTAACCCAG lnaAs;omeUs;lnaTs;omeAs;lnaCs 3793

;omeCs;lnaTs;omeUs;lnaAs;ome

As;lnaCs;omeCs;lnaCs;omeAs;ln aG-Sup

HBB-06 705 CCTCTTACCTCAGTTA lnaCs;omeCs;lnaTs;omeCs;lnaTs; 3794 control mOl CA omeUs;lnaAs;omeCs;lnaCs;ome

Us;lnaCs;omeAs;lnaGs;omeUs;ln aTs;omeAs;lnaCs;omeA-Sup

HBB-01_G7A m01 GCTATTACCTTAACCC lnaGs;omeCs;lnaTs;omeAs;lnaTs 3789

AG ;omeUs;lnaAs;omeCs;lnaCs;ome

Us;lnaTs;omeAs;lnaAs;omeCs;ln aCs;omeCs;lnaAs;omeG-Sup

HBB-06 705 CCTCTTACCTCAGTTA lnaCs;omeCs;lnaTs;omeCs;lnaTs; 3794 control m02 CA omeUs;lnaAs;omeCs;lnaCs;ome

+bioTEGs +heg Us;lnaCs;omeAs;lnaGs;omeUs;ln

aTs;omeAs;lnaCs;omeA-Sup

HBB-01_G7A m01 GCTATTACCTTAACCC lnaGs;omeCs;lnaTs;omeAs;lnaTs 3789

AG ;omeUs;lnaAs;omeCs;lnaCs;ome

Us;lnaTs;omeAs;lnaAs;omeCs;ln aCs;omeCs;lnaAs;omeG-Sup

HBB-01_G7A GCTATTACCTTAACCC lnaGs;omeCs;lnaTs;omeAs;lnaTs 3795 dimer mlOOO AG 1 1 1 1 GCTATTACCT ;omeUs;lnaAs;omeCs;lnaCs;ome

TAACCCAG Us;lnaTs;omeAs;lnaAs;omeCs;ln

aCs;omeCs;lnaAs;omeG;dT;dT;d

T;dT;lnaGs;omeCs;lnaTs;omeAs;l naTs;omeUs;lnaAs;omeCs;lnaCs;

omeUs;lnaTs;omeAs;lnaAs;ome

Cs;lnaCs;omeCs;lnaAs;omeG-

Sup

HBB-06 705 CCTCTTACCTCAGTTA lnaCs;omeCs;lnaTs;omeCs;lnaTs; 3794 control mlOOO CA omeUs;lnaAs;omeCs;lnaCs;ome

Us;lnaCs;omeAs;lnaGs;omeUs;ln aTs;omeAs;lnaCs;omeA-Sup HBB-06 705 CCTCTTACCTCAGTTA lnaCs;omeCs;lnaTs;omeCs;lnaTs; 3796 control dimer CA I 1 1 I CCTCTTACCT omeUs;lnaAs;omeCs;lnaCs;ome mlOOO CAGTTACA Us;lnaCs;omeAs;lnaGs;omeUs;ln

aTs;omeAs;lnaCs;omeA;dT;dT;d

T;dT;lnaCs;omeCs;lnaTs;omeCs;l naTs;omeUs;lnaAs;omeCs;lnaCs;

omeUs;lnaCs;omeAs;lnaGs;ome

Us;lnaTs;omeAs;lnaCs;omeA-

Sup

HBB-28 m08 ACTTTTATGCCCAGC lnaAs;lnaCs;lnaTs;dTs;dTs;dTs;d 3797

As;dTs;dGs;dCs;dCs;dCs;lnaAs;ln aGs;lnaC-Sup

HBB-29 m08 TTTTATGCCCAGCCC lnaTs;lnaTs;lnaTs;dTs;dAs;dTs;d 3798

Gs;dCs;dCs;dCs;dAs;dGs;lnaCs;ln aCs;lnaC-Sup

HBB-30 m08 GTAGATTGGCCAACC lnaGs;lnaTs;lnaAs;dGs;dAs;dTs;d 3799

Ts;dGs;dGs;dCs;dCs;dAs;lnaAs;ln aCs;lnaC-Sup

HBB-31 m08 TAGATTGGCCAACCC lnaTs;lnaAs;lnaGs;dAs;dTs;dTs;d 3800

Gs;dGs;dCs;dCs;dAs;dAs;lnaCs;ln aCs;lnaC-Sup

HBB-32 m08 GATTGGCCAACCCTA lnaGs;lnaAs;lnaTs;dTs;dGs;dGs;d 3801

Cs;dCs;dAs;dAs;dCs;dCs;lnaCs;ln aTs;lnaA-Sup

HBB-33 m08 ATTGGCCAACCCTAG lnaAs;lnaTs;lnaTs;dGs;dGs;dCs;d 3802

Cs;dAs;dAs;dCs;dCs;dCs;lnaTs;ln aAs;lnaG-Sup

HBB-34 m08 TGATGACAGCCGTAC lnaTs;lnaGs;lnaAs;dTs;dGs;dAs;d 3803

Cs;dAs;dGs;dCs;dCs;dGs;lnaTs;ln aAs;lnaC-Sup

HBB-35 m08 CAGCCGTACCTGTCC lnaCs;lnaAs;lnaGs;dCs;dCs;dGs;d 3804

Ts;dAs;dCs;dCs;dTs;dGs;lnaTs;ln aCs;lnaC-Sup

HBB-36 m08 CCGTACCTGTCCTTG lnaCs;lnaCs;lnaGs;dTs;dAs;dCs;d 3805

Cs;dTs;dGs;dTs;dCs;dCs;lnaTs;ln aTs;lnaG-Sup

HBB-37 m08 CGTACCTGTCCTTGG lnaCs;lnaGs;lnaTs;dAs;dCs;dCs;d 3806

Ts;dGs;dTs;dCs;dCs;dTs;lnaTs;ln aGs;lnaG-Sup

HBB-38 m08 TTCTGGCACTGGCTT lnaTs;lnaTs;lnaCs;dTs;dGs;dGs;d 3807

Cs;dAs;dCs;dTs;dGs;dGs;lnaCs;ln aTs;lnaT-Sup

HBB-39 m08 GCACTGGCTTAGGAG lnaGs;lnaCs;lnaAs;dCs;dTs;dGs;d 3808

Gs;dCs;dTs;dTs;dAs;dGs;lnaGs;ln aAs;lnaG-Sup

HBB-40 m08 G CTCTTCTG G C ACTG lnaGs;lnaCs;lnaTs;dCs;dTs;dTs;d 3809

GCTTAGGAGT Cs;dTs;dGs;dGs;dCs;dAs;dCs;dTs ;dGs;dGs;dCs;dTs;dTs;dAs;dGs;d

Gs;lnaAs;lnaGs;lnaT-Sup

HBB-41 m08 TCTG G C ACTG G CTT A lnaTs;lnaCs;lnaTs;dGs;dGs;dCs;d 3810

GGAGTTGGAC As;dCs;dTs;dGs;dGs;dCs;dTs;dTs

;dAs;dGs;dGs;dAs;dGs;dTs;dTs;d

Gs;lnaGs;lnaAs;lnaC-Sup

HBB-42 m08 GCACTGGCTTAGGAG lnaGs;lnaCs;lnaAs;dCs;dTs;dGs;d 3811

TTGGACTTCA Gs;dCs;dTs;dTs;dAs;dGs;dGs;dAs

;dGs;dTs;dTs;dGs;dGs;dAs;dCs;d

Ts;lnaTs;lnaCs;lnaA-Sup

HBB-43 m08 A I LG I 1 1 I LLLAA I 1 lnaAs;lnaTs;lnaCs;dGs;dTs;dTs;d 3812

Ts;dTs;dCs;dCs;dCs;dAs;lnaAs;ln aTs;lnaT-Sup

HBB-44 m08 A AA ACTCTACCTCG G lnaAs;lnaAs;lnaAs;dAs;dCs;dTs;d 3813

Cs;dTs;dAs;dCs;dCs;dTs;lnaCs;ln aGs;lnaG-Sup

HBB-45 m08 ACCTCG GTTCT AAG C lnaAs;lnaCs;lnaCs;dTs;dCs;dGs;d 3814

Gs;dTs;dTs;dCs;dTs;dAs;lnaAs;ln aGs;lnaC-Sup

HBB-46 m08 G CTCTGTG C ATT AGT lnaGs;lnaCs;lnaTs;dCs;dTs;dGs;d 3815

Ts;dGs;dCs;dAs;dTs;dTs;lnaAs;ln aGs;lnaT-Sup

HBB-47 m08 AATAGGAGGTTAACT lnaAs;lnaAs;lnaTs;dAs;dGs;dGs;d 3816

As;dGs;dGs;dTs;dTs;dAs;lnaAs;ln aCs;lnaT-Sup

HBB-48 m08 GGTCTTCTACTTGGC lnaGs;lnaGs;lnaTs;dCs;dTs;dTs;d 3817

Cs;dTs;dAs;dCs;dTs;dTs;lnaGs;ln aGs;lnaC-Sup

HBB-49 m08 G CTG C A AG CTAT ACT lnaGs;lnaCs;lnaTs;dGs;dCs;dAs;d 3818

As;dGs;dCs;dTs;dAs;dTs;lnaAs;ln aCs;lnaT-Sup

HBB-50 m08 G GTG C AG GTCTATTC lnaGs;lnaGs;lnaTs;dGs;dCs;dAs;d 3819

Gs;dGs;dTs;dCs;dTs;dAs;lnaTs;ln aTs;lnaC-Sup

HBB-51 m08 CCTCTAAGTATACCC lnaCs;lnaCs;lnaTs;dCs;dTs;dAs;d 3820

As;dGs;dTs;dAs;dTs;dAs;lnaCs;ln aCs;lnaC-Sup

HBB-52 m08 GCTTGACCTAGGAAC lnaGs;lnaCs;lnaTs;dTs;dGs;dAs;d 3821

Cs;dCs;dTs;dAs;dGs;dGs;lnaAs;ln aAs;lnaC-Sup

HBB-53 m08 AACCAGAACCCATAG lnaAs;lnaAs;lnaCs;dCs;dAs;dGs;d 3822

As;dAs;dCs;dCs;dCs;dAs;lnaTs;ln aAs;lnaG-Sup

HBB-54 m08 GAAACAAACCGCACA lnaGs;lnaAs;lnaAs;dAs;dCs;dAs;d 3823

As;dAs;dCs;dCs;dGs;dCs;lnaAs;ln aCs;lnaA-Sup

HBB-55 m08 CGAATACCACACAAG lnaCs;lnaGs;lnaAs;dAs;dTs;dAs;d 3824 Cs;dCs;dAs;dCs;dAs;dCs;lnaAs;ln

aAs;lnaG-Sup

HBB-56 m08 TACTTGTGACCTCTC lnaTs;lnaAs;lnaCs;dTs;dTs;dGs;d 3825

Ts;dGs;dAs;dCs;dCs;dTs;lnaCs;ln aTs;lnaC-Sup

HBB-57 m08 CTATACCCCATCAAG lnaCs;lnaTs;lnaAs;dTs;dAs;dCs;d 3826

Cs;dCs;dCs;dAs;dTs;dCs;lnaAs;ln aAs;lnaG-Sup

HBB-58 m08 GTTGGAGTTTAATCG lnaGs;lnaTs;lnaTs;dGs;dGs;dAs;d 3827

Gs;dTs;dTs;dTs;dAs;dAs;lnaTs;ln aCs;lnaG-Sup

HBB-59 m08 TTGGAGTTTAATCGT lnaTs;lnaTs;lnaGs;dGs;dAs;dGs;d 3828

Ts;dTs;dTs;dAs;dAs;dTs;lnaCs;ln aGs;lnaT-Sup

HBB-60 m08 GGAGTTTAATCGTAG lnaGs;lnaGs;lnaAs;dGs;dTs;dTs;d 3829

Ts;dAs;dAs;dTs;dCs;dGs;lnaTs;ln aAs;lnaG-Sup

HBB-61 m08 GAGTTTAATCGTAGC lnaGs;lnaAs;lnaGs;dTs;dTs;dTs;d 3830

As;dAs;dTs;dCs;dGs;dTs;lnaAs;ln aGs;lnaC-Sup

HBB-62 m08 AGTTT AATCGT AG C A lnaAs;lnaGs;lnaTs;dTs;dTs;dAs;d 3831

As;dTs;dCs;dGs;dTs;dAs;lnaGs;ln aCs;lnaA-Sup

HBB-63 m08 GTTTAATCGTAGCAT lnaGs;lnaTs;lnaTs;dTs;dAs;dAs;d 3832

Ts;dCs;dGs;dTs;dAs;dGs;lnaCs;ln aAs;lnaT-Sup

HBB-64 m08 TT AATCGT AGCATTA lnaTs;lnaTs;lnaAs;dAs;dTs;dCs;d 3833

Gs;dTs;dAs;dGs;dCs;dAs;lnaTs;ln aTs;lnaA-Sup

HBB-65 m08 TA ATCGT AG C ATT AC lnaTs;lnaAs;lnaAs;dTs;dCs;dGs;d 3834

Ts;dAs;dGs;dCs;dAs;dTs;lnaTs;ln aAs;lnaC-Sup

HBB-66 m08 AATCGTAGCATTACC lnaAs;lnaAs;lnaTs;dCs;dGs;dTs;d 3835

As;dGs;dCs;dAs;dTs;dTs;lnaAs;ln aCs;lnaC-Sup

HBB-67 m08 ATCGTAGCATTACCC lnaAs;lnaTs;lnaCs;dGs;dTs;dAs;d 3836

Gs;dCs;dAs;dTs;dTs;dAs;lnaCs;ln aCs;lnaC-Sup

HBB-68 m08 CGTAGCATTACCCTT lnaCs;lnaGs;lnaTs;dAs;dGs;dCs;d 3837

As;dTs;dTs;dAs;dCs;dCs;lnaCs;ln aTs;lnaT-Sup

HBB-69 m08 GTAGCATTACCCTTG lnaGs;lnaTs;lnaAs;dGs;dCs;dAs;d 3838

Ts;dTs;dAs;dCs;dCs;dCs;lnaTs;ln aTs;lnaG-Sup

HBB-70 m08 GTTGAGGTCTTCCCT lnaGs;lnaTs;lnaTs;dGs;dAs;dGs;d 3839

Gs;dTs;dCs;dTs;dTs;dCs;lnaCs;ln aCs;lnaT-Sup HBB-71 m08 TTAATGCCTTGTACG lnaTs;lnaTs;lnaAs;dAs;dTs;dGs;d 3840

Cs;dCs;dTs;dTs;dGs;dTs;lnaAs;ln aCs;lnaG-Sup

HBB-72 m08 TAATGCCTTGTACGG lnaTs;lnaAs;lnaAs;dTs;dGs;dCs;d 3841

Cs;dTs;dTs;dGs;dTs;dAs;lnaCs;ln aGs;lnaG-Sup

HBB-73 m08 TGCCTTGTACGGTTC lnaTs;lnaGs;lnaCs;dCs;dTs;dTs;d 3842

Gs;dTs;dAs;dCs;dGs;dGs;lnaTs;ln aTs;lnaC-Sup

HBB-74 m08 GCCTTGTACGGTTCC lnaGs;lnaCs;lnaCs;dTs;dTs;dGs;d 3843

Ts;dAs;dCs;dGs;dGs;dTs;lnaTs;ln aCs;lnaC-Sup

HBB-75 m08 CCTTGTACGGTTCCC lnaCs;lnaCs;lnaTs;dTs;dGs;dTs;d 3844

As;dCs;dGs;dGs;dTs;dTs;lnaCs;ln aCs;lnaC-Sup

HBB-76 m08 CTTGTACGGTTCCCT lnaCs;lnaTs;lnaTs;dGs;dTs;dAs;d 3845

Cs;dGs;dGs;dTs;dTs;dCs;lnaCs;ln aCs;lnaT-Sup

HBB-77 m08 TGTACGGTTCCCTTG lnaTs;lnaGs;lnaTs;dAs;dCs;dGs;d 3846

Gs;dTs;dTs;dCs;dCs;dCs;lnaTs;ln aTs;lnaG-Sup

HBB-78 m08 GTACGGTTCCCTTGC lnaGs;lnaTs;lnaAs;dCs;dGs;dGs;d 3847

Ts;dTs;dCs;dCs;dCs;dTs;lnaTs;lna

Gs;lnaC-Sup

HBB-79 m08 CAAGTGCTCCCTATC lnaCs;lnaAs;lnaAs;dGs;dTs;dGs;d 3848

Cs;dTs;dCs;dCs;dCs;dTs;lnaAs;ln aTs;lnaC-Sup

HBB-80 m08 GCTCCCTATCTGTAG lnaGs;lnaCs;lnaTs;dCs;dCs;dCs;d 3849

Ts;dAs;dTs;dCs;dTs;dGs;lnaTs;ln aAs;lnaG-Sup

HBB-07 ml3 GCCATCTATTGCTTAC rG;rC;rC;rA;rU;rC;rU;rA;rU;rU;rG 3850

ATT ;rC;rU;rU;rA;rC;rA;rU;rU;dU;dU- Sup

HBB-08 ml3 CCATCTATTGCTTACA rC;rC;rA;rU;rC;rU;rA;rU;rU;rG;rC; 3851

TTT rU;rU;rA;rC;rA;rU;rU;rU;dU;dU-

Sup

HBB-09 ml3 C ATCTATTG CTT AC AT rC;rA;rU;rC;rU;rA;rU;rU;rG;rC;rU 3852

TTG ;rU;rA;rC;rA;rU;rU;rU;rG;dU;dU- Sup

HBB-10 ml3 ATCTATTG CTTAC ATT rA;rU;rC;rU;rA;rU;rU;rG;rC;rU;rU 3853

TGC ;rA;rC;rA;rU;rU;rU;rG;rC;dU;dU-

Sup

HBB-11 ml3 TCTATTG CTT AC ATTT rU;rC;rU;rA;rU;rU;rG;rC;rU;rU;rA 3854

GCT ;rC;rA;rU;rU;rU;rG;rC;rU;dU;dU-

Sup

HBB-12 ml3 CT ATTG CTTAC ATTTG rC;rU;rA;rU;rU;rG;rC;rU;rU;rA;rC 3855

CTT ;rA;rU;rU;rU;rG;rC;rU;rU;dU;dU- Sup

HBB-13 ml3 TATTG CTT AC ATTTG C rU;rA;rU;rU;rG;rC;rU;rU;rA;rC;rA 3856

TTC ;rU;rU;rU;rG;rC;rU;rU;rC;dU;dU- Sup

HBB-14 ml3 ATTG CTTAC ATTTG CT rA;rU;rU;rG;rC;rU;rU;rA;rC;rA;rU 3857

TCT ;rU;rU;rG;rC;rU;rU;rC;rU;dU;dU- Sup

HBB-15 ml3 TTG CTT AC ATTTG CTT rU;rU;rG;rC;rU;rU;rA;rC;rA;rU;rU 3858

CTG ;rU;rG;rC;rU;rU;rC;rU;rG;dU;dU- Sup

HBB-16 ml3 TG CTT AC ATTTG CTTC rU;rG;rC;rU;rU;rA;rC;rA;rU;rU;rU 3859

TGA ;rG;rC;rU;rU;rC;rU;rG;rA;dU;dU- Sup

HBB-17 ml3 G CTTAC ATTTG CTTCT rG;rC;rU;rU;rA;rC;rA;rU;rU;rU;rG 3860

GAC ;rC;rU;rU;rC;rU;rG;rA;rC;dU;dU-

Sup

HBB-18 ml3 CTTAC ATTTG CTTCTG rC;rU;rU;rA;rC;rA;rU;rU;rU;rG;rC 3861

ACA ;rU;rU;rC;rU;rG;rA;rC;rA;dU;dU- Sup

HBB-19 ml3 TTACATTTGCTTCTGA rU;rU;rA;rC;rA;rU;rU;rU;rG;rC;rU 3862

CAC ;rU;rC;rU;rG;rA;rC;rA;rC;dU;dU-

Sup

HBB-20 ml3 TACATTTGCTTCTGAC rU;rA;rC;rA;rU;rU;rU;rG;rC;rU;rU 3863

ACA ;rC;rU;rG;rA;rC;rA;rC;rA;dU;dU-

Sup

HBB-21 ml3 ACATTTGCTTCTGAC rA;rC;rA;rU;rU;rU;rG;rC;rU;rU;rC 3864

ACAA ;rU;rG;rA;rC;rA;rC;rA;rA;dU;dU-

Sup

HBB-22 ml3 CATTTGCTTCTGACAC rC;rA;rU;rU;rU;rG;rC;rU;rU;rC;rU 3865

AAC ; rG ; r A; rC; r A; rC; r A; r A; rC; d U ; d U - Sup

HBB-23 ml3 ATTTGCTTCTGACAC rA;rU;rU;rU;rG;rC;rU;rU;rC;rU;rG 3866

AACT ; r A; rC; r A; rC; r A; r A; rC; r U ; d U ; d U - Sup

HBB-24 ml3 TTTGCTTCTGACACA rU;rU;rU;rG;rC;rU;rU;rC;rU;rG;rA 3867

ACTG ;rC;rA;rC;rA;rA;rC;rU;rG;dU;dU-

Sup

HBB-25 ml3 TTGCTTCTGACACAA rU;rU;rG;rC;rU;rU;rC;rU;rG;rA;rC 3868

CTGT ;rA;rC;rA;rA;rC;rU;rG;rU;dU;dU-

Sup

HBB-26 ml3 TGCTTCTGACACAAC rU;rG;rC;rU;rU;rC;rU;rG;rA;rC;rA 3869

TGTG ;rC;rA;rA;rC;rU;rG;rU;rG;dU;dU- Sup

HBB-27 ml3 CACAACTGTGTTCAC rC;rA;rC;rA;rA;rC;rU;rG;rU;rG;rU; 3870

TAGC rU;rC;rA;rC;rU;rA;rG;rC;dU;dU-

Sup HBG2-01 ml3 AGGGCAAAGAACAG r A; rG ; rG; rG ; rC; r A; r A; r A; rG ; r A; r A; 3871 GTGGG rC;rA;rG;rG;rU;rG;rG;rG;dU;dU-

Sup

HBG2-02 ml3 GGCAAAGAACAGGT rG ; rG ; rC; r A; r A; r A; rG ; r A; r A; rC; r A; 3872

GGGAA rG;rG;rU;rG;rG;rG;rA;rA;dU;dU-

Sup

HBG2-03 ml3 GCAAAGAACAGGTG rG ; rC; r A; r A; r A; rG ; r A; r A; rC; r A; rG ; 3873

GGAAT rG;rU;rG;rG;rG;rA;rA;rU;dU;dU-

Sup

HBG2-04 ml3 AGAACAGGTGGGAA rA;rG;rA;rA;rC;rA;rG;rG;rU;rG;rG 3874

TGTAT ;rG;rA;rA;rU;rG;rU;rA;rU;dU;dU- Sup

HBG2-05 ml3 GGGAATGTATCCAGA rG;rG;rG;rA;rA;rU;rG;rU;rA;rU;rC 3875

GAAT ;rC;rA;rG;rA;rG;rA;rA;rU;dU;dU-

Sup

HBG2-06 ml3 GGAATGTATCCAGAG rG;rG;rA;rA;rU;rG;rU;rA;rU;rC;rC 3876

AATC ;rA;rG;rA;rG;rA;rA;rU;rC;dU;dU-

Sup

HBG2-07 ml3 GAATGTATCCAGAGA rG;rA;rA;rU;rG;rU;rA;rU;rC;rC;rA 3877

ATCA ;rG;rA;rG;rA;rA;rU;rC;rA;dU;dU- Sup

HBG2-08 ml3 AATGTATCCAGAGAA rA;rA;rU;rG;rU;rA;rU;rC;rC;rA;rG 3878

TCAA ; r A; r G ; r A; r A; r U ; r C; r A; r A; d U ; d U - Sup

HBG2-09 ml3 ATGTATCCAGAGAAT rA;rU;rG;rU;rA;rU;rC;rC;rA;rG;rA 3879

CAAC ;rG;rA;rA;rU;rC;rA;rA;rC;dU;dU-

Sup

HBG2-10 ml3 TGTATCCAGAGAATC rU;rG;rU;rA;rU;rC;rC;rA;rG;rA;rG 3880

AACA ; r A; r A; r U ; rC; r A; r A; rC; r A; d U ; d U - Sup

HBG2-11 ml3 GTATCCAGAGAATCA rG;rU;rA;rU;rC;rC;rA;rG;rA;rG;rA 3881

ACAT ;rA;rU;rC;rA;rA;rC;rA;rU;dU;dU-

Sup

HBG2-12 ml3 TATCCAGAGAATCAA r U ; r A; r U ; rC; rC; r A; rG ; r A; rG ; r A; r A; 3882

CATA rU;rC;rA;rA;rC;rA;rU;rA;dU;dU-

Sup

HBG2-13 ml3 ATCCAGAGAATCAAC rA;rU;rC;rC;rA;rG;rA;rG;rA;rA;rU; 3883

ATAC rC; r A; r A; rC; r A; r U ; r A; rC; d U ; d U - Sup

HBG2-14 ml3 TCCAGAGAATCAACA r U ; rC; rC; r A; rG ; r A; rG ; r A; r A; r U ; rC; 3884

TACA r A; r A; rC; r A; r U ; r A; rC; r A; d U ; d U - Sup

HBG2-15 ml3 AGAGAATCAACATAC r A; rG ; r A; rG ; r A; r A; r U ; rC; r A; r A; rC; 3885

AATG rA;rU;rA;rC;rA;rA;rU;rG;dU;dU-

Sup

HBG2-16 ml3 AGAATCAACATACAA r A; rG ; r A; r A; r U ; rC; r A; r A; rC; r A; r U ; 3886

TGAG rA;rC;rA;rA;rU;rG;rA;rG;dU;dU- Sup

HBG2-17 ml3 GAATCAACATACAAT rG ; r A; r A; r U ; rC; r A; r A; rC; r A; r U ; r A; 3887

GAGC rC;rA;rA;rU;rG;rA;rG;rC;dU;dU-

Sup

HBG2-18 ml3 AATCAACATACAATG r A; r A; r U ; rC; r A; r A; rC; r A; r U ; r A; rC; 3888

AGCA rA;rA;rU;rG;rA;rG;rC;rA;dU;dU-

Sup

HBG2-19 ml3 ATCAACATACAATGA r A; r U ; rC; r A; r A; rC; r A; r U ; r A; rC; r A; 3889

GCAA rA;rU;rG;rA;rG;rC;rA;rA;dU;dU-

Sup

HBG2-20 ml3 TCAACATACAATGAG r U ; rC; r A; r A; rC; r A; r U ; r A; rC; r A; r A; 3890

CAAA rU;rG;rA;rG;rC;rA;rA;rA;dU;dU-

Sup

HBG2-21 ml3 CAACATACAATGAGC rC; r A; r A; rC; r A; r U ; r A; rC; r A; r A; r U ; 3891

AAAA rG ; r A; rG; rC; r A; r A; r A; r A; d U ; d U - Sup

HBG2-22 ml3 AACATACAATGAGCA r A; r A; rC; r A; r U ; r A; rC; r A; r A; r U ; rG ; 3892

AAAG r A; rG ; rC; r A; r A; r A; r A; rG ; d U ; d U - Sup

HBG2-23 ml3 GGAAGCACCCTTCAG rG ; rG ; r A; r A; rG; rC; r A; rC; rC; rC; r U ; 3893

CAGT rU;rC;rA;rG;rC;rA;rG;rU;dU;dU-

Sup

HBG2-24 ml3 GAAGCACCCTTCAGC rG;rA;rA;rG;rC;rA;rC;rC;rC;rU;rU; 3894

AGTT rC;rA;rG;rC;rA;rG;rU;rU;dU;dU-

Sup

HBG2-25 ml3 AGCACCCTTCAGCAG rA;rG;rC;rA;rC;rC;rC;rU;rU;rC;rA; 3895

TTCC rG;rC;rA;rG;rU;rU;rC;rC;dU;dU-

Sup

HBG2-26 ml3 GCACCCTTCAGCAGT rG;rC;rA;rC;rC;rC;rU;rU;rC;rA;rG; 3896

TCCA rC;rA;rG;rU;rU;rC;rC;rA;dU;dU-

Sup

HBG2-27 ml3 CACCCTTCAGCAGTT rC;rA;rC;rC;rC;rU;rU;rC;rA;rG;rC; 3897

CCAC rA;rG;rU;rU;rC;rC;rA;rC;dU;dU-

Sup

HBG2-28 ml3 ACCCTTC AG C AGTTC rA;rC;rC;rC;rU;rU;rC;rA;rG;rC;rA; 3898

CACA rG;rU;rU;rC;rC;rA;rC;rA;dU;dU-

Sup

HBG2-29 ml3 CCCTTCAGCAGTTCC rC;rC;rC;rU;rU;rC;rA;rG;rC;rA;rG; 3899

ACAC rU;rU;rC;rC;rA;rC;rA;rC;dU;dU-

Sup

HBG2-30 ml3 CCTTCAGCAGTTCCA rC;rC;rU;rU;rC;rA;rG;rC;rA;rG;rU; 3900

CACA r U ; rC; rC; r A; rC; r A; rC; r A; d U ; d U - Sup

HBG2-31 ml3 CTTC AG C AGTTCC AC rC;rU;rU;rC;rA;rG;rC;rA;rG;rU;rU 3901

ACAC ; rC; rC; r A; rC; r A; rC; r A; rC; d U ; d U - Sup HBG2-32 ml3 TTCAGCAGTTCCACA rU;rU;rC;rA;rG;rC;rA;rG;rU;rU;rC 3902 CACT ;rC;rA;rC;rA;rC;rA;rC;rU;dU;dU-

Sup

HBG2-33 ml3 C AG C AGTTCC AC AC A rC;rA;rG;rC;rA;rG;rU;rU;rC;rC;rA; 3903

CTCG rC;rA;rC;rA;rC;rU;rC;rG;dU;dU-

Sup

HBG2-34 ml3 AGCAGTTCCACACAC rA;rG;rC;rA;rG;rU;rU;rC;rC;rA;rC; 3904

TCGC rA;rC;rA;rC;rU;rC;rG;rC;dU;dU-

Sup

HBG2-35 ml3 G C AGTTCC AC AC ACT rG;rC;rA;rG;rU;rU;rC;rC;rA;rC;rA; 3905

CGCT rC;rA;rC;rU;rC;rG;rC;rU;dU;dU-

Sup

HBG2-36 ml3 CAGTTCCACACACTC rC;rA;rG;rU;rU;rC;rC;rA;rC;rA;rC; 3906

GCTT rA;rC;rU;rC;rG;rC;rU;rU;dU;dU-

Sup

HBG2-37 ml3 AGTTCCACACACTCG rA;rG;rU;rU;rC;rC;rA;rC;rA;rC;rA; 3907

CTTC rC;rU;rC;rG;rC;rU;rU;rC;dU;dU-

Sup

HBG2-38 ml3 GTTCCACACACTCGC rG ; r U ; r U ; rC; rC; r A; rC; r A; rC; r A; rC; 3908

TTCT rU;rC;rG;rC;rU;rU;rC;rU;dU;dU-

Sup

HBG2-39 ml3 TTCC AC AC ACTCG CTT r U ; r U ; rC; rC; r A; rC; r A; rC; r A; rC; r U ; 3909

CTG rC;rG;rC;rU;rU;rC;rU;rG;dU;dU-

Sup

HBG2-40 ml3 TCCACACACTCGCTTC r U ; rC; rC; r A; rC; r A; rC; r A; rC; r U ; rC; 3910

TGG rG;rC;rU;rU;rC;rU;rG;rG;dU;dU- Sup

HBG2-41 ml3 CCACACACTCGCTTCT rC; rC; r A; rC; r A; rC; r A; rC; r U ; rC; rG; 3911

GGA rC;rU;rU;rC;rU;rG;rG;rA;dU;dU-

Sup

HBG2-42 ml3 ACACACTCGCTTCTG rA;rC;rA;rC;rA;rC;rU;rC;rG;rC;rU; 3912

GAAC rU;rC;rU;rG;rG;rA;rA;rC;dU;dU-

Sup

HBG2-43 ml3 C AC ACTCG CTTCTG G rC;rA;rC;rA;rC;rU;rC;rG;rC;rU;rU; 3913

AACG rC;rU;rG;rG;rA;rA;rC;rG;dU;dU-

Sup

HBG2-44 ml3 ACACTCGCTTCTGGA rA;rC;rA;rC;rU;rC;rG;rC;rU;rU;rC; 3914

ACGT rU;rG;rG;rA;rA;rC;rG;rU;dU;dU-

Sup

HBG2-45 ml3 CACTCGCTTCTGGAA rC;rA;rC;rU;rC;rG;rC;rU;rU;rC;rU; 3915

CGTC rG;rG;rA;rA;rC;rG;rU;rC;dU;dU-

Sup

HBG2-46 ml3 ACTCGCTTCTGGAAC rA;rC;rU;rC;rG;rC;rU;rU;rC;rU;rG 3916

GTCT ;rG;rA;rA;rC;rG;rU;rC;rU;dU;dU- Sup

HBB-01_G7A GCTATTACCTTAACCC lnaGs;omeCs;lnaTs;omeAs;lnaTs 3795 dimer mlOOO AG 1 1 1 1 GCTATTACCT ;omeUs;lnaAs;omeCs;lnaCs;ome TAACCCAG Us;lnaTs;omeAs;lnaAs;omeCs;ln

aCs;omeCs;lnaAs;omeG;dT;dT;d

T;dT;lnaGs;omeCs;lnaTs;omeAs;l naTs;omeUs;lnaAs;omeCs;lnaCs;

omeUs;lnaTs;omeAs;lnaAs;ome

Cs;lnaCs;omeCs;lnaAs;omeG-

Sup

HBB-06 705 CCTCTTACCTCAGTTA lnaCs;omeCs;lnaTs;omeCs;lnaTs; 3796 control dimer CA I 1 1 I CCTCTTACCT omeUs;lnaAs;omeCs;lnaCs;ome

CAGTTACA Us;lnaCs;omeAs;lnaGs;omeUs;ln

aTs;omeAs;lnaCs;omeA;dT;dT;d

T;dT;lnaCs;omeCs;lnaTs;omeCs;l naTs;omeUs;lnaAs;omeCs;lnaCs;

omeUs;lnaCs;omeAs;lnaGs;ome

Us;lnaTs;omeAs;lnaCs;omeA-

Sup

BRIEF DESCRIPTION OF THE SEQUENCE LISTING

Single Strand Oligonucleotides (Sense Strand of Target Gene) SeqlD range: 19-3788, 3820-3821, 3825-3826, 3850-3916

SeqIDs w/o G Runs:

19-50, 64-77, 112, 126-187, 201-253, 266-322, 327-358, 368-382, 401 445, 473-475, 489-808, 822-856, 870-955, 969-1021, 1035-1148, 1162- 1232, 1246-1402, 1431-1508, 1530-1568, 1582-1956, 1971-2079, 2093- 2481, 2495-2511, 2525-2580, 2594-3075, 3101-3114, 3128-3136, 3150- 3309, 3323-3344, 3358-3399, 3414-3443, 3461-3467, 3481-3532, 3546- 3788, 3820-3821, 3825-3826, 3850-3870, 3876-3916

SeqIDs w/o miR Seeds:

19, 22-26, 28-29, 31-32, 34-37, 39, 44, 46-47, 49, 51-52, 54, 56-60 62, 64-65, 68-76, 78, 90, 99, 109, 111-113, 115, 119-120, 125, 128- 136, 138-141, 143-149, 156, 160-162, 164-165, 172, 175-176, 178, 181-189, 194, 199, 204-206, 208-211, 215-216, 221-224, 227, 230-234 237, 239, 242-243, 245, 247, 251-252, 254-258, 261, 263, 266, 270, 275-276, 279, 281-284, 286, 288-294, 296, 299, 301-303, 305, 308, 310-315, 318, 320, 326-332, 334, 336, 338-340, 342, 345-348, 352- 353, 355-356, 358-361, 363-364, 366, 368-370, 372-373, 376-381, 383 385-387, 389, 392, 394-395, 397, 400-408, 410-415, 419, 421, 424- 428, 430-431, 434-438, 442-447, 449-451, 456, 458-463, 466-473, 475 477-482, 485-486, 490-502, 504-505, 507, 516-517, 519-521, 523-524, 526-550, 552, 554-579, 581, 583-586, 588-599, 601, 604-610, 612-613 615-619, 621-628, 632-635, 639, 641, 643, 646-649, 651-655, 658-662 668-673, 675-679, 681-684, 686-691, 694-699, 701, 703, 705, 708-709 711, 713-716, 718-722, 724, 726-727, 729-730, 733-736, 738, 741-746 748, 750, 753, 755-759, 761, 764, 766, 768-801, 803, 805-806, 808- 809, 812-816, 818, 820-824, 828, 830-831, 834-838, 841-844, 847, 850, 853-860, 864, 866-867, 869-870, 873-874, 876, 878-882, 884, 887-892, 895-898, 900-901, 903, 905-910, 913-916, 918-919, 921-928, 930-935, 937-938, 941-943, 947-951, 956-961, 963, 967-977, 979-986, -989, 991, 995-996, 998, 1001, 1003-1005, 1009, 1012-1016, 1018,

1023-1024, 1026-1029, 1031, 1033-1035, 1037, 1039-1044, 1047, 1049-

1051, 1054-1055, 1058-1059, 1062-1069, 1071, 1074-1075, 1078-1088,

1090, 1093-1097, 1100-1103, 1106, 1108-1109, 1111-1112, 1114-1115, 1120, 1123-1126, 1131-1134, 1136-1137, 1141-1142, 1144-1152, 1155, 1160-1165, 1170-1172, 1174-1175, 1181-1189, 1195-1200, 1202-1205, 1207-1210, 1212, 1214, 1217, 1219-1223, 1225-1226, 1229, 1232-1237, 1243-1244, 1247-1252, 1255, 1257-1259, 1261, 1263-1264, 1270-1272, 1274-1276, 1278, 1280, 1282-1283, 1286-1287, 1289-1293, 1296, 1298, 1304-1306, 1308, 1311-1314, 1316, 1318-1319, 1322, 1330-1332, 1334, 1336-1339, 1341-1345, 1347-1349, 1353-1354, 1356-1359, 1361-1369, 1371-1372, 1374-1376, 1378-1380, 1382-1383, 1385-1386, 1388, 1391- 1401, 1403-1408, 1411, 1416, 1421-1424, 1427, 1430, 1432-1439, 1443 1445, 1449-1455, 1457-1458, 1460-1461, 1463, 1465-1469, 1471-1473, 1476, 1478-1480, 1482, 1484-1492, 1495-1498, 1500, 1503-1507, 1509- 1517, 1522-1523, 1530-1532, 1537, 1539, 1541-1546, 1548-1549, 1551, 1553, 1555, 1557-1558, 1560-1562, 1564-1567, 1569, 1573-1574, 1579, 1582-1587, 1589, 1591-1594, 1596, 1598-1601, 1605-1607, 1609, 1611, 1615, 1618, 1622, 1625-1629, 1631-1633, 1637, 1639-1642, 1644, 1646 1648-1649, 1655, 1657-1658, 1661-1663, 1666-1671, 1673-1675, 1677- 1679, 1681-1686, 1690, 1699, 1701-1707, 1709-1710, 1715-1718, 1720- 1721, 1723-1726, 1728-1729, 1731, 1734-1737, 1739-1741, 1743-1749, 1751-1759, 1761, 1764, 1766, 1768-1772, 1775-1777, 1780-1781, 1783- 1784, 1787-1800, 1802-1804, 1806-1809, 1811, 1814-1818, 1820, 1822- 1823, 1826, 1832-1838, 1840-1851, 1855, 1859, 1865, 1867-1868, 1870, 1874, 1876-1877, 1879-1882, 1884-1886, 1888-1889, 1892-1894, 1897- 1901, 1907, 1909-1910, 1912, 1914, 1916-1924, 1927-1928, 1930-1932, 1935-1938, 1941-1943, 1946, 1948-1949, 1951, 1954, 1957, 1960, 1962- 1967, 1969-1973, 1977-1979, 1981-1983, 1985-1986, 1989-1991, 1995, 1999-2000, 2003, 2005-2010, 2014-2015, 2020, 2022-2025, 2027-2028,

2030-2032, 2034, 2036, 2040, 2042, 2045-2046, 2048-2058, 2060, 2062, 2064-2065, 2067, 2070, 2072, 2075-2077, 2079-2080, 2084, 2087-2088, 2092-2102, 2105-2106, 2113, 2120-2125, 2130, 2132-2133, 2136, 2138- 2139, 2141-2142, 2145-2148, 2151-2154, 2156-2157, 2159, 2163-2165, 2167-2169, 2173-2175, 2177, 2182, 2184-2187, 2189, 2192, 2195-2197, 2200-2204, 2206-2207, 2209-2210, 2212-2213, 2216, 2218-2220, 2222- 2226, 2228, 2230-2231, 2233-2234, 2238-2241, 2245, 2247-2249, 2254, 2257-2259, 2262-2264, 2266, 2268-2272, 2274-2277, 2279-2280, 2286- 2287, 2289, 2292-2293, 2295-2296, 2299-2302, 2304-2309, 2312-2315, 2317-2334, 2336-2345, 2347-2350, 2352-2355, 2358-2359, 2364-2366,

2369-2370, 2372-2373, 2375, 2380, 2382, 2384-2386, 2388-2390, 2393- 2394, 2396, 2399, 2401-2404, 2406, 2411, 2413-2415, 2418-2420, 2422- 2423, 2425, 2430, 2432-2438, 2440, 2442, 2445-2448, 2451, 2454, 2456-2457, 2461-2464, 2467-2468, 2473-2479, 2481-2482, 2484-2488, 2491-2492, 2494-2500, 2504-2505, 2507-2510, 2512, 2514-2516, 2518- 2528, 2530-2532, 2534, 2538-2539, 2541, 2543-2544, 2546, 2550-2552, 2554, 2556, 2558, 2561, 2563-2572, 2574, 2577, 2579-2581, 2583-2587, 2590, 2592-2596, 2599-2605, 2607-2608, 2611-2613, 2615-2616, 2619- 2622, 2624, 2629-2634, 2636, 2638, 2640-2644, 2646, 2649-2651, 2655, 2657-2658, 2660, 2662-2668, 2670, 2672-2679, 2681-2683, 2685-2690, 2692-2693, 2697-2699, 2701-2702, 2705, 2707, 2710-2714, 2717, 2719, 2721, 2723, 2725-2726, 2728, 2731, 2733-2734, 2736-2737, 2739-2740, 2742-2743, 2745-2748, 2752-2754, 2756-2758, 2761, 2763, 2765-2766, 2768, 2770-2778, 2780, 2783-2784, 2789-2798, 2800, 2805-2806, 2808- 2810, 2812, 2814, 2817-2828, 2830-2838, 2840-2841, 2843, 2845-2850, 2852-2853, 2856, 2858-2859, 2861-2862, 2864-2867, 2870, 2872-2874, 2876-2881, 2883-2885, 2887, 2889, 2892-2896, 2898, 2900-2902, 2905- 2906, 2908, 2910-2912, 2915-2921, 2923, 2926-2928, 2932-2933, 2937- 2938, 2940-2943, 2945-2946, 2948-2949, 2951-2958, 2962, 2970-2972, 2974, 2976-2977, 2979-2981, 2983, 2985-2988, 2990-2991, 2993, 2997, 2999-3000, 3003-3007, 3009-3012, 3016, 3018, 3021-3022, 3024-3025, 3027-3029, 3032-3033, 3035, 3037-3038, 3040-3042, 3046, 3048, 3050- 3054, 3060-3062, 3071, 3074-3075, 3079-3081, 3083, 3085-3086, 3088, 3093, 3099, 3101-3102, 3105-3106, 3111-3114, 3116-3118, 3121-3125, 3127-3128, 3131, 3134, 3137-3138, 3141-3143, 3150-3151, 3153-3156, 3159-3160, 3162-3164, 3166-3173, 3175-3176, 3179-3184, 3187, 3190, 3192-3193, 3195-3201, 3205-3217, 3220-3221, 3223, 3225-3226, 3228- 3229, 3231-3232, 3236-3250, 3252-3253, 3258-3259, 3261-3263, 3265- 3271, 3273, 3275, 3281-3282, 3285, 3288, 3290-3296, 3298, 3301, 3304-3305, 3308-3310, 3312, 3314, 3319, 3322-3323, 3325-3327, 3329, 3331, 3333, 3335-3337, 3339-3344, 3346-3352, 3356-3358, 3360, 3364, 3366-3369, 3371-3381, 3384-3385, 3389-3391, 3393-3401, 3404-3406, 3408-3410, 3413-3415, 3421-3423, 3426-3428, 3431, 3433, 3437-3440, 3443-3448, 3450-3453, 3459-3461, 3463, 3466-3471, 3473-3475, 3477- 3478, 3480, 3484-3488, 3490-3491, 3494-3496, 3498, 3500-3502, 3504, 3506, 3509-3511, 3513, 3515-3521, 3524-3531, 3533-3536, 3538-3540, 3542, 3545, 3547-3548, 3551-3555, 3558-3561, 3563-3565, 3567-3571, 3574, 3577, 3584-3585, 3587-3588, 3590, 3592, 3595-3600, 3602, 3604- 3605, 3607, 3609, 3611-3616, 3618-3620, 3623-3624, 3627-3629, 3631- 3637, 3642-3647, 3650-3655, 3657, 3660-3662, 3666-3668, 3670-3671, 3673-3676, 3679-3684, 3687-3688, 3691-3693, 3695-3700, 3703, 3705- 3708, 3710, 3712, 3714-3716, 3719, 3722-3725, 3727-3729, 3731-3737, 3741, 3743-3745, 3748-3753, 3758-3760, 3763-3775, 3777, 3779-3785, 3787-3788, 3820-3821, 3825, 3850-3854, 3856-3859, 3861-3867, 3870- 3874, 3876-3879, 3881, 3884-3885, 3887-3888, 3890, 3892-3895, 3897- 3898, 3900-3901, 3904, 3906-3907, 3909-3916

Single Strand Oligonucleotides (Antisense Strand of Target Gene)

SeqID range: 3797-3819, 3822-3824, 3827-3849

SeqIDs w/o G Runs:

3797-3819, 3822-3824, 3827-3849

SeqIDs w/o miR Seeds:

3797-3798, 3800-3804, 3806-3808, 3811-3817, 3822-3824, 3827, 3829- 3834, 3836-3838, 3843, 3845-3847

The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the invention. The present invention is not to be limited in scope by examples provided, since the examples are intended as a single illustration of one aspect of the invention and other functionally equivalent embodiments are within the scope of the invention. Various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description and fall within the scope of the appended claims. The advantages and objects of the invention are not necessarily encompassed by each embodiment of the invention.