Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
AGENTS FOR STIMULATING ACTIVITY OF METHYL MODIFYING ENZYMES AND METHODS OF USE THEREOF
Document Type and Number:
WIPO Patent Application WO/2011/011366
Kind Code:
A2
Abstract:
Agents for stimulating activity of methyl modifying enzymes and methods of using the enzymes in assays to identify enzyme modulators are provided herein.

Inventors:
TROJER PATRICK (US)
LAN FEI (US)
Application Number:
PCT/US2010/042527
Publication Date:
January 27, 2011
Filing Date:
July 20, 2010
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CONSTELLATION PHARMACEUTICALS (US)
TROJER PATRICK (US)
LAN FEI (US)
International Classes:
C40B30/00; C12Q1/68; G01N33/566
Foreign References:
US20080070257A12008-03-20
US20070274912A12007-11-29
US20060115839A12006-06-01
US20020081638A12002-06-27
Other References:
TAN ET AL.: 'Pharmacologic disruption of Polycomb-repressive complex-2 mediated gene repression selectively induces apoptosis in cancer cells' GENES DEV. vol. 21, 2007, pages 1050 - 1063
Attorney, Agent or Firm:
CLOUSE, Katherine, Nicole et al. (Hall & Stewart LLPTwo International Plac, Boston MA, US)
Download PDF:
Claims:
We claim:

1. A method of evaluating a test compound, the method comprising:

contacting a methyl modifying enzyme and a substrate with a test compound in the presence of a stimulating agent;

evaluating activity of the methyl modifying enzyme on the substrate in the presence of the test compound, relative to a control, wherein a change in activity of the methyl modifying enzyme in the presence of the test compound, relative to a control, indicates that the test compound is a modulator of the methyl modifying enzyme.

2. The method of claim 1, wherein the methyl modifying enzyme comprises a histone methyl modifying enzyme.

3. The method of claim 1, wherein the methyl modifying enzyme comprises a methylase.

4. The method of claim 1, wherein the methyl modifying enzyme comprises a demethylase.

5. The method of claim 1, wherein the substrate comprises a peptide.

6. The method of claim 1, wherein the substrate comprises a histone polypeptide.

7. The method of claim 1, wherein the substrate comprises a nucleosome.

8. The method of claim 1, wherein the substrate comprises an oligonucleosome.

9. The method of claim 1, wherein the stimulating agent comprises a peptide.

10. The method of claim 9, wherein the peptide comprises a methylated peptide.

11. The method of claim 10, wherein the methylated peptide is 4-60 amino acids in length.

12. The method of claim 10, wherein the methylated peptide comprises one or more methylated lysine residues.

13. The method of claim 12, wherein the methylated peptide comprises one or more tri- methylated lysine residues.

14. The method of claim 12, wherein the methylated peptide comprises one or more di- methylated lysine residues.

15. The method of claim 12, wherein the methylated peptide comprises one or more mono- methylated lysine residues.

16. The method of claim 9, wherein the stimulating agent comprises a histone peptide.

17. The method of claim 16, wherein the stimulating agent comprises a methylated histone peptide.

18. The method of claim 17, wherein the methylated histone peptide comprises a methylated histone H3 peptide.

19. The method of claim 17, wherein the methylated histone peptide comprises a methylated histone H4 peptide.

20. The method of claim 17, wherein the methylated histone peptide comprises a methylated histone Hl peptide.

21. The method of claim 17, wherein the methylated histone peptide comprises one or more tri-methylated lysine residues.

22. The method of claim 17, wherein the methylated histone peptide comprises one or more di-methylated lysine residues.

23. The method of claim 17, wherein the methylated histone peptide comprises one or more mono-methylated lysine residues.

24. The method of claim 18, wherein the methylated histone peptide comprises at least four consecutive amino acids of the following H3 histone peptide sequence:

ARTKQTARKSTGGKAPRKQLATKAARKSAPATGESKKPHRYRPGTAALREIRRYQ KSTEL (SEQ ID NO :1).

25. The method of claim 24, wherein the H3 histone peptide is methylated on one or more of the following lysine residues: K4, K9, K27, and K36.

26. The method of claim 25, wherein the H3 histone peptide is methylated on K27.

27. The method of claim 25, wherein the H3 histone peptide is methylated on K9.

28. The method of claim 19, wherein the methylated histone peptide comprises at least four consecutive amino acids of the following H4 histone peptide sequence:

SGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRG VLKV (SEQ ID NO:2).

29. The method of claim 28, wherein the H4 histone peptide is methylated on K20.

30. The method of claim 20, wherein the methylated histone peptide comprises at least four consecutive amino acids of the following Hl histone peptide sequence:

SETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGV SLAA (SEQ ID NO:3).

31. The method of claim 30, wherein the Hl histone peptide is methylated on K26.

32. The method of claim 1 , wherein the stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 2-fold.

33. The method of claim 1, wherein the stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 5 -fold.

34. The method of claim 1 , wherein the stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 10-fold.

35. The method of claim 1, wherein the test compound comprises a small molecule, a peptide, or a nucleic acid.

36. The method of claim 1, wherein the methyl modifying enzyme and substrate are contacted with a library of test compounds, and wherein a change in activity of the methyl modifying enzyme in the presence of the library, relative to a control, indicates that the library comprises a modulator of the methyl modifying enzyme.

37. The method of claim 36, further comprising selecting the modulator from the library.

38. The method of claim 3, wherein the methylase comprises a human histone methylase.

39. The method of claim 38, wherein the human histone methylase comprises a Poly comb Repressive Complex 2 polypeptide complex.

40. A reaction mixture comprising:

a methyl modifying enzyme;

a substrate; and

a stimulating agent, wherein the stimulating agent is present in an amount sufficient to increase activity of the methyl modifying enzyme.

41. The reaction mixture of claim 40, wherein the methyl modifying enzyme comprises a histone methyl modifying enzyme.

42. The reaction mixture of claim 40, wherein the methyl modifying enzyme comprises a methylase.

43. The reaction mixture of claim 40, wherein the methyl modifying enzyme comprises a demethylase.

44. The reaction mixture of claim 40, wherein the substrate comprises a peptide.

45. The reaction mixture of claim 40, wherein the substrate comprises a histone

polypeptide.

46. The reaction mixture of claim 40, wherein the substrate comprises a nucleosome.

47. The reaction mixture of claim 40, wherein the substrate comprises an oligonucleosome.

48. The reaction mixture of claim 40, wherein the stimulating agent comprises a peptide.

49. The reaction mixture of claim 48, wherein the peptide comprises a methylated peptide.

50. The reaction mixture of claim 49, wherein the methylated peptide is 4-60 amino acids in length.

51. The reaction mixture of claim 49, wherein the methylated peptide comprises one or more methylated lysine residues.

52. The reaction mixture of claim 51 , wherein the methylated peptide comprises one or more tri-methylated lysine residues.

53. The reaction mixture of claim 51 , wherein the methylated peptide comprises one or more di-methylated lysine residues.

54. The reaction mixture of claim 51 , wherein the methylated peptide comprises one or more mono-methylated lysine residues.

55. The reaction mixture of claim 48, wherein the stimulating agent comprises a histone peptide.

56. The reaction mixture of claim 55, wherein the stimulating agent comprises a methylated histone peptide.

57. The reaction mixture of claim 56, wherein the methylated histone peptide comprises a methylated histone H3 peptide.

58. The reaction mixture of claim 56, wherein the methylated histone peptide comprises a methylated histone H4 peptide.

59. The reaction mixture of claim 56, wherein the methylated histone peptide comprises a methylated histone Hl peptide.

60. The reaction mixture of claim 56, wherein the methylated histone peptide comprises one or more tri-methylated lysine residues.

61. The reaction mixture of claim 56, wherein the methylated histone peptide comprises one or more di-methylated lysine residues.

62. The reaction mixture of claim 56, wherein the methylated histone peptide comprises one or more mono-methylated lysine residues.

63. The reaction mixture of claim 57, wherein the methylated histone peptide comprises at least four consecutive amino acids of the following H3 histone peptide sequence:

ARTKQTARKSTGGKAPRKQLATKAARKSAPATGESKKPHRYRPGTAALREIRRYQ KSTEL (SEQ ID NO :1).

64. The reaction mixture of claim 63, wherein the H3 histone peptide is methylated on one or more of the following lysine residues: K4, K9, K27, and K36.

65. The reaction mixture of claim 64, wherein the H3 histone peptide is methylated on K27.

66. The reaction mixture of claim 64, wherein the H3 histone peptide is methylated on K9.

67. The reaction mixture of claim 58, wherein the methylated histone peptide comprises at least four consecutive amino acids of the following H4 histone peptide sequence: SGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRG VLKV (SEQ ID NO:2).

68. The reaction mixture of claim 67, wherein the H4 histone peptide is methylated on K20.

69. The reaction mixture of claim 59, wherein the methylated histone peptide comprises at least four consecutive amino acids of the following Hl histone peptide sequence:

SETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGV SLAA (SEQ ID NO:3).

70. The reaction mixture of claim 69, wherein the Hl histone peptide is methylated on K26.

71. The reaction mixture of claim 40, wherein the stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 2-fold.

72. The reaction mixture of claim 40, wherein the stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 5 -fold.

73. The reaction mixture of claim 40, wherein the stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 10-fold.

Description:
AGENTS FOR STIMULATING ACTIVITY OF METHYL MODIFYING ENZYMES

AND METHODS OF USE THEREOF

Related Applications

[0001] This application is related to and claims priority under 35 U. S. C. § 119(e) to United States provisional patent application number 61/227,031, filed July 20, 2009 ("the '031 application"). The entire contents of the '031 application are incorporated herein by reference.

Background of the Invention

[0002] Eukaryotic chromatin is composed of macromolecular complexes called nucleosomes. A nucleosome has 147 base pairs of DNA wrapped around a protein octamer having two subunits of each of histone protein H2A, H2B, H3, and H4. Histone proteins are subject to post-translational modifications which in turn affect chromatin structure and gene expression. One type of post-translational modification found on histones is methylation of lysine and arginine residues. Histone methylation plays a critical role in the regulation of gene expression in eukaryotes. Methylation affects chromatin structure and has been linked to both activation and repression of transcription (Zhang and Reinberg, Genes Dev. 15:2343- 2360, 2001). Enzymes that catalyze attachment and removal of methyl groups from histones are implicated in gene silencing, embryonic development, cell proliferation, and other processes.

Summary of the Invention

[0003] The present disclosure encompasses the recognition that methyl modifying enzymes are an attractive target for modulation, given their role in the regulation of diverse biological processes. The present disclosure provides methods and compositions to facilitate identification of modulators of these enzymes by enhancing their activity in vitro. For example, according to the present disclosure, it has been discovered that methylase and demethylase activity can be stimulated by addition of peptides to enzymatic reactions or by introducing particular modifications on substrate molecules, thereby stimulating enzymatic activity and/or changing target site specificity, and in this context providing a more robust platform for evaluating candidate agents for inhibition and/or activation of enzymatic activity. In particular embodiments, the present disclosure provides agents that stimulate activity of histone methyl modifying enzymes, including histone methylases and histone demethylases. Stimulating agents for histone methylases and demethylases include methylated histone peptides (e.g., synthetic peptides composed of amino acids mimicking the sequence of distinct regions of histone proteins).

[0004] Accordingly, in one aspect, the present disclosure features a method of evaluating a test compound including, for example: contacting a methyl modifying enzyme and a substrate with a test compound in the presence of a stimulating agent; evaluating activity of the methyl modifying enzyme on the substrate in the presence of the test compound, relative to a control, wherein a change in activity of the methyl modifying enzyme in the presence of the test compound, e.g., relative to a control, indicates that the test compound is a modulator of the methyl modifying enzyme. In some embodiments, the invention provides high- throughput formats for performing such methods, for example allowing simultaneous assessment of at least 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 100 or more (and in some embodiments several 100s or 1000s) of reactions.

[0005] In some embodiments, a methyl modifying enzyme comprises a histone methyl modifying enzyme. In some embodiments, a methyl modifying enzyme comprises a methylase (e.g., a human histone methylase, e.g., a human histone methylase in Table IA). In some embodiments, a methyl modifying enzyme comprises a demethylase (e.g., a human histone demethylase, e.g., a human histone demethylase in Table IB).

[0006] A substrate can include a peptide (e.g., a histone peptide), a polypeptide (e.g., histone polypeptide), a histone dimer (e.g., an H2A-H2B dimer), a histone tetramer (e.g., an H3-H4 tetramer), a histone octamer, a nucleosome, an oligonucleosome, chromatin (e.g., in the presence or absence of histone Hl isotypes), or a combination thereof.

[0007] A stimulating agent can include a peptide, e.g., a methylated peptide. In some embodiments, a stimulating agent comprises a peptide 4-60 amino acids in length. In some embodiments, a methylated peptide comprises one or more methylated lysine residues. In some embodiments, a methylated peptide comprises one or more tri-methylated lysine residues. In some embodiments, a methylated peptide comprises one or more di-methylated lysine residues. In some embodiments, a methylated peptide comprises one or more mono- methylated lysine residues.

[0008] In some embodiments, a stimulating agent comprises a histone peptide, e.g., a methylated histone peptide. In some embodiments, a methylated histone peptide comprises a methylated histone H3 peptide, a methylated histone H4 peptide, or a methylated histone Hl peptide. [0009] In some embodiments, a methylated histone peptide comprises one or more tri- methylated lysine residues, one or more di-methylated lysine residues, and/or one or more mono-methylated lysine residues.

[0010] In some embodiments, a methylated histone peptide comprises at least four consecutive amino acids of the following H3 histone peptide sequence:

ARTKQTARKSTGGKAPRKQLATKAARKSAPATGESKKPHRYRPGTAALREIRRYQK

STEL (SEQ ID NO:1). In some embodiments, an H3 histone peptide is methylated on one or more of the following lysine residues: K4, K9, K27, and K36. In some embodiments, a H3 histone peptide is methylated on K27. In some embodiments, an H3 histone peptide is methylated on K9.

[0011] In some embodiments, a methylated histone peptide comprises at least four consecutive amino acids of the following H4 histone peptide sequence:

SGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGV

LKV (SEQ ID NO:2). In some embodiments, an H4 histone peptide is methylated on K20.

[0012] In some embodiments, a methylated histone peptide comprises at least four consecutive amino acids of the following Hl histone peptide sequence:

SETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGVS

LAA (SEQ ID NO:3).

[0013] In some embodiments, an Hl histone peptide is methylated on K26.

[0014] In some embodiments, a stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 2-fold, at least 5-fold, or at least

10-fold.

[0015] A test compound can include a small molecule, a peptide, and/or a nucleic acid.

[0016] In some embodiments of a method provided herein, a methyl modifying enzyme and substrate are contacted with a library of test compounds, and a change in activity of the methyl modifying enzyme in the presence of the library, relative to a control, indicates that the library comprises a modulator of the methyl modifying enzyme. A method can further include selecting the modulator from the library.

[0017] In another aspect, the present disclosure features reaction mixture including, for example: a substrate of a methyl modifying enzyme; and a stimulating agent, wherein the stimulating agent is present in an amount sufficient to increase activity of a methyl modifying enzyme. The reaction mixture can further include a methyl modifying enzyme.

[0018] A methyl modifying enzyme can include a histone methyl modifying enzyme. A methyl modifying enzyme can include a methylase or a demethylase. [0019] In some embodiments, a substrate comprises a peptide (e.g., a histone peptide), a polypeptide (e.g., a histone polypeptide), a nucleosome, an oligonucleosome, chromatin, or a combination thereof.

[0020] A stimulating agent can include a peptide, e.g., a methylated peptide. In some embodiments, a stimulating agent comprises a peptide 4-60 amino acids in length. In some embodiments, a methylated peptide comprises one or more methylated lysine residues. In some embodiments, a methylated peptide comprises one or more tri-methylated lysine residues. In some embodiments, a methylated peptide comprises one or more di-methylated lysine residues. In some embodiments, a methylated peptide comprises one or more mono- methylated lysine residues.

[0021] In some embodiments, a stimulating agent comprises a histone peptide, e.g., a methylated histone peptide. In some embodiments, a methylated histone peptide comprises a methylated histone H3 peptide, a methylated histone H4 peptide, a methylated histone Hl peptide.

[0022] In some embodiments, a methylated histone peptide comprises one or more tri- methylated lysine residues, one or more di-methylated lysine residues, and/or one or more mono-methylated lysine residues.

[0023] In some embodiments, a methylated histone peptide comprises at least four consecutive amino acids of the following H3 histone peptide sequence:

ARTKQTARKSTGGKAPRKQLATKAARKSAPATGESKKPHRYRPGTAALREIRRYQK STEL (SEQ ID NO:1). In some embodiments, an H3 histone peptide is methylated on one or more of the following lysine residues: K4, K9, K27, and K36. In some embodiments, a H3 histone peptide is methylated on K27. In some embodiments, an H3 histone peptide is methylated on K9.

[0024] In some embodiments, a methylated histone peptide comprises at least four consecutive amino acids of the following H4 histone peptide sequence:

SGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGV LKV (SEQ ID NO:2). In some embodiments, an H4 histone peptide is methylated on K20.

[0025] In some embodiments, a methylated histone peptide comprises at least four consecutive amino acids of the following Hl histone peptide sequence:

SETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGVS LAA (SEQ ID NO:3).

[0026] In some embodiments, an Hl histone peptide is methylated on K26. [0027] In some embodiments, a stimulating agent is present in an amount which stimulates activity of the methyl modifying enzyme at least 2-fold, at least 5-fold, or at least 10-fold.

[0028] In another aspect, the present disclosure provides a composition comprising a stimulating agent described herein.

[0029] According to the present disclosure, stimulating agents confer various benefits. For example, the presence of a stimulating agent can increase sensitivity of an assay.

Alternatively or additionally, the presence of a stimulating agent can allow one to use less enzyme in assays (e.g., five, 10, 25, 50, 100 fold less than needed in the absence of a stimulating agent), thereby reducing costs and/or facilitating adaptation to high throughput formats. In some embodiments, a stimulating agent mimics an interaction encountered by an enzyme in vivo. In such embodiments, modulation of enzyme activity in the presence of a stimulating agent can reflect modulation in a more physiologically relevant state.

Compounds identified under such conditions may be found to have greater specificity and/or superior activity in vivo.

[0030] Details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims. All cited patents, patent applications, and references (including references to public sequence database entries) are incorporated by reference in their entireties for all purposes.

Brief Description of the Drawing

[0031] Figure IA is a schematic depiction of a recombinant Po Iy comb Repressive

Complex 2 (rPRC2) complex, including EZH2, EED, SUZ 12, RBAP46, and RBAP48 subunits.

[0032] Figure IB shows silver staining and Western blot analysis of rPRC2 preparation used in examples described herein.

[0033] Figure 1C shows analysis of H3, H2A/H2B, H4, and [ 3 H]-FB labeled substrate from reactions with rPRC2 and wild type histone H3 (wt) or H3 having a K27A substitution

(H3K27A). Fluorographic analysis is shown in the top panel. Coomassie staining is shown in the bottom panel.

[0034] Figure 2 A shows fluorographic analysis of [ 3 H]-FB in wild type histone H3

(H3wt), H3K27A, biotin/avidin labeled H3 (Bio/Avi-H3), wild type octamers (octamers wt), octamers containing H3 K27A, and Bio/A vi-octamers incubated with rPRC2. Coomassie staining is shown in the bottom panel.

[0035] Figure 2B shows TopCount analysis of methylase reaction products shown in

Figure 2A.

[0036] Figure 3 A shows fluorographic analysis of [ H]-Bio/Avi-H3 in Bio/Avi- oligonucleosomes incubated with rPRC2. Coomassie staining is shown in the bottom panel.

[0037] Figure 3B shows TopCount analysis of methylase reaction products shown in

Figure 3A.

[0038] Figure 3C shows quantitative information for oligonucleosome substrates used in reactions shown in Figures 3 A and 3B.

[0039] Figure 3D is a graph of [ 3 H]-cpm in methylase reactions shown in Figures 3A-3C using increasing concentrations of oligonucleosomes.

[0040] Figures 4 A and 4B are graphs showing [ H]-cpm (Figure 4A) and Michaelis-

Menten data (Figure 4B) for increasing concentrations of oligonucleosomes in methylase reactions.

[0041] Figure 5 is a graph showing stimulation of rPRC2 methylase activity in the presence of unmodified H3 or the following: H3K4me3, H3K9me3, H3K27me3, H3K36me3,

H3K79me3, H4K20me3, and H1.4K26me3 peptides.

[0042] Figure 6A shows fluorographic analysis of [ 3 H]-EZH2 and [ 3 H]-rAvi-H3 in a methylase assay using rPRC2 in the presence of H3K27me3, H3K27meO, H3K9me3,

H4K20me3, or no stimulating agent. Bio/Avi-H3 was used as substrate. Coomassie staining is shown in the bottom panel.

[0043] Figure 6B is a graph of TopCount analysis of reactions shown in Figure 6C.

[0044] Figure 6C shows fluorographic analysis of [ 3 H]-EZH2 and [ 3 H]-rAvi-H3 in methylase assays using rPRC2 in the presence of H3K27me3, H3K27meO, H3K9me3,

H4K20me3, or no stimulating agent. Bio/Avi-oligonucleosomes were used as substrate.

Coomassie staining is shown in the bottom panel.

[0045] Figure 6D is a graph of photostimulated luminescence (PSL) for reactions shown in Figure 6C.

[0046] Figure 7 A shows fluorographic analysis of [ H]-Bio/Avi-H3 in methylase assays using rPRC2 in the presence of H3K27me3, H3K27me2, H3K27mel, H3K27meO,

H3K9me3, or H4K20me3 peptides. Coomassie staining is shown in the bottom panel.

[0047] Figure 7 B is a graph of TopCount analysis of reactions shown in Figure 7 A. [0048] Figure 8A is a graph showing a time course of methylation in an assay using rPRC2 in the presence of an excess amount of a stimulating agent, H3K27me3.

[0049] Figure 8B is a graph showing a time course of methylation in an assay using rPRC2 in the presence of a limiting amount of a stimulating agent, H3K27me3 (1.24 μM).

[0050] Figure 8C shows conditions used for time course assays shown in Figures 8A and

8B.

[0051] Figure 9 is a graph showing a time course of methylation in an assay using rPRC2.

[0052] Figure WA shows conditions used for methylase assays depicted in Figures 1OA and 1OB.

[0053] Figures WB and WC are graphs showing titration of rPRC2 enzyme using oligonucleosomes as a substrate. Figure 1OB shows results from Day 1, using robotics.

Figure 1OC shows results from Day 2, using robotics.

[0054] Figure HA is an analysis of Nuclear SET domain-containing 2 (NSD2) protein from 293 cells.

[0055] Figure HB shows fluorographic analysis of [ 3 H] -H3 in methylase assays using

NSD2 enzyme and octamers or nucleosomes as a substrate. Coomassie staining is shown in the bottom panel.

[0056] Figure 12 A is a graph showing counts per minute of labeled SAM from methylase assays using NSD2 in the presence of various histone peptides.

[0057] Figure 12B is a graph showing fold increase in NSD2 activity in the presence of different concentrations H3K36me2 or H3K79me2.

Definitions

[0058] Characteristic sequence element: As used herein, the term "characteristic sequence element" or "sequence element" refers to a stretch of contiguous amino acids, typically 5 amino acids, e.g., at least 5-50, 5-25, 5-15, or 5-10 amino acids, that shows at least about 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% identity with another polypeptide. In some embodiments, a characteristic sequence element participates in or confers function on a polypeptide.

[0059] Corresponding to: As used herein, the term "corresponding to" is often used to designate the position/identity of an amino acid residue in a peptide or polypeptide (e.g., in a histone peptide). Those of ordinary skill will appreciate that, for purposes of simplicity, a canonical numbering system (based on wild type histone polypeptides) is utilized herein, so that an amino acid "corresponding to" a lysine residue at position 4 (K4) of histone H3 (also referred to as "H3K4"), for example, need not actually be the 4th amino acid in a particular histone peptide amino acid chain but rather corresponds to the residue found at position 4 in a wild type polypeptide (e.g., in a wild type histone polypeptide); those of ordinary skill in the art readily appreciate how to identify corresponding amino acids.

[0060] Demethylase: A "demethylase", as used herein, refers to an enzyme that removes a methyl group or multiple methyl groups from a substrate. The term refers to catalytic demethylase subunits as well as protein complexes containing the catalytic subunits. In some embodiments, a demethylase is a protein demethylase, i.e., an enzyme that removes methyl groups from a polypeptide substrate. In some embodiments, a demethylase is a histone demethylase, i.e., an enzyme that removes methyl groups from a histone polypeptide substrate.

[0061] Histone peptide: The term "histone peptide" as used herein, refers to a peptide that has structural and/or functional similarity to a portion of a wild type histone polypeptide (and includes portions of histone polypeptides)(i.e., a histone peptide has a sequence that is not a full-length histone polypeptide sequence). In some embodiments, a histone peptide has an amino acid sequence that is substantially identical to that of a portion of a wild type histone polypeptide. In some embodiments, a histone peptide has an amino acid sequence that is substantially identical to that of an N-terminal portion of a histone polypeptide. In some embodiments, a histone peptide is less than 60, 50, 40, 30, 20, 10, or fewer amino acids long. In some embodiments, a histone peptide is more than 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more amino acids long. In some embodiments, a histone peptide is between about 20 and about 60 amino acids long. In some embodiments, a histone peptide is between about 10 and about 50 amino acids long. In some embodiments, a histone peptide has an amino acid sequence that includes one or more lysine residues. In some embodiments, a histone peptide has an amino acid sequence that includes one or more methylated (e.g., mono-, di-, and/or tri-methylated) lysine residues. In some embodiments, a histone peptide has an amino acid sequence that includes a plurality of sequence elements, each of which is found in a natural histone polypeptide. In some embodiments, a histone peptide has an amino acid sequence that includes a plurality of sequence elements that are found in (or share substantially identity with sequence elements that are found in) a plurality of different natural histone polypeptides.

[0062] Methyl modifying enzyme: The term "methyl modifying enzyme", as used herein, refers to an enzyme that catalyzes transfer of a methyl group from one molecule to another. Methyl modifying enzymes include methylases (e.g., methylases that attach methyl groups to polypeptide substrates) and demethylases (e.g., demethylases that remove methyl groups from polypeptide substrates). Methyl modifying enzymes include enzymes having a full length sequence, enzymes having a portion of a full length sequence, and/or partial enzyme complexes that retain enzymatic activity.

[0063] Methylase: A "methylase", as used herein, refers to an enzyme that attaches a methyl group to a substrate. The term refers to catalytic methylase subunits as well as protein complexes containing the catalytic subunits. Methylases are also referred to as

methyltransferases. In some embodiments, a methylase is a protein methylase, i.e., an enzyme that attaches methyl groups to polypeptide substrate. In some embodiments, a methylase is a histone methylase, i.e., an enzyme that attaches methyl groups to a histone polypeptide substrate.

[0064] Methylated: The term "methylated", as used herein, refers to the presence of one or more methyl groups on a molecule (e.g., peptide). In some embodiments, a methylated peptide has one methylated amino acid. In some embodiments, a methylated peptide has more than one methylated amino acid. In some embodiments, an amino acid residue on a methylated peptide has one or more methyl groups (i.e., a residue is di- or tri-methylated).

[0065] Polypeptide: The term "polypeptide", as used herein, generally has its art- recognized meaning of a polymer of at least three amino acids. However, the term is also used to refer to specific functional classes of polypeptides, such as, for example, methylase polypeptides, demethylase polypeptides, histone polypeptides, etc. For each such class, the present specification provides several examples of known sequences of such polypeptides. Those of ordinary skill in the art will appreciate, however, that the term "polypeptide" is intended to be sufficiently general as to encompass not only polypeptides having the complete sequence recited herein (or in a reference or database specifically mentioned herein), but also to encompass polypeptides that represent functional fragments (i.e., fragments retaining at least one activity) of such complete polypeptides. Moreover, those of ordinary skill in the art understand that protein sequences generally tolerate some substitution without destroying activity. Thus, any polypeptide that retains activity and shares at least about 30-40% overall sequence identity, often greater than about 50%, 60%, 70%, or 80%, and further usually including at least one region of much higher identity, often greater than 90% or even 95%, 96%, 97%, 98%, or 99% in one or more highly conserved regions, usually encompassing at least 3-4 and often up to 20 or more amino acids, with another polypeptide of the same class, is encompassed within the relevant term "polypeptide" as used herein. Other regions of similarity and/or identity can be determined by those of ordinary skill in the art by analysis of the sequences of various polypeptides described herein.

[0066] Stimulating agent: The term "stimulating agent", as used herein, refers to an agent that increases activity of a methyl modifying enzyme. A stimulating agent of a methylase enzyme increases methylase activity of the enzyme. A stimulating agent of a demethylase enzyme increases demethylase activity of the enzyme. In some embodiments, a stimulating agent is a peptide 4-60 amino acids in length. In some embodiments, a stimulating agent is a methylated peptide 4-60 amino acids in length. A stimulating agent can include, or consist of, a peptide sequence (e.g., a methylated peptide sequence) of a histone polypeptide, such as an H3, Hl, or H4 polypeptide. Stimulating agents can include peptides (e.g., methylated peptides) having natural and/or non-natural amino acids. Stimulating agents can include modifications such one or more labels. In some embodiments, a stimulating agent is biotinylated. In some embodiments, enzyme activity is stimulated two, three, four, five, ten, twenty, fifty- fold, or more, in the presence of a stimulating agent.

[0067] Substantial identity: The term "substantial identity" of amino acid sequences (and of polypeptides having these amino acid sequences) typically means sequence identity of at least 40% compared to a reference sequence as determined by comparative techniques known in the art. For example, a variety of computer software programs are well known for particular sequence comparisons. In some embodiments, the BLAST is utilized, using standard parameters, as described. In some embodiments, the preferred percent identity of amino acids can be any integer from 40% to 100%. In some embodiments, sequences are substantially identical if they show at least 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical residues in corresponding positions. In some embodiments, polypeptides are considered to be "substantially identical" when they share amino acid sequences as noted above except that residue positions which are not identical differ by conservative amino acid changes. Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic- hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur-containing side chains is cysteine and methionine. Preferred conservative amino acids substitution groups are: valine-leucine-isoleucine, phenylalanine -tyrosine, lysine-arginine, alanine -valine, aspartic acid-glutamic acid, and asparagine-glutamine.

[0068] As mentioned above, one example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al, 1977, Nuc. Acids Res. 25:3389-3402. BLAST is used, with the parameters described herein, to determine percent sequence identity for the nucleic acids and proteins of the present disclosure. Software for performing BLAST analysis is publicly available through the National Center for Biotechnology Information (available at the following internet address: ncbi.nlm.nih.gov). This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always>0) and N (penalty score for mismatching residues; always<0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative- scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11 , an expectation (E) or 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA, 89:10915 (1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands.

[0069] The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA, 90:5873- 5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01, and most preferably less than about 0.001.

[0070] Substrate: A "substrate" as used herein to describe substrates of a methyl modifying enzyme, refers to any peptide, polypeptide, or molecular complex that can be modified by activity of the enzyme. In general, a "substrate" of an enzyme, is an entity with which the enzyme specifically interacts (e.g., in the presence of other entities). Substrates of methyl modifying enzymes include peptides or polypeptides that have a site to which a methyl can be attached and/or removed. In some embodiments, a substrate of a methyl modifying enzyme comprises a histone peptide or histone polypeptide. In some

embodiments, a substrate of a methyl modifying enzyme comprises a nucleosome. In some embodiments, a substrate of a methyl modifying enzyme comprises an oligonucleosome. In some embodiments, a substrate of a methyl modifying enzyme comprises chromatin.

[0071] Test compound: A "test compound" can be any chemical compound, for example, a macromolecule (e.g., a polypeptide, a protein complex, or a nucleic acid) or a small molecule (e.g., an amino acid, a nucleotide, an organic or inorganic compound). The test compound can have a formula weight of less than about 10,000 grams per mole, less than 5,000 grams per mole, less than 1,000 grams per mole, or less than about 500 grams per mole, e.g., between 5,000 to 500 grams per mole. The test compound can be naturally occurring (e.g., a herb or a nature product), synthetic, or both. Examples of macromolecules are proteins (e.g., antibodies, antibody fragments), protein complexes, and glycoproteins, nucleic acids, e.g., DNA, RNA (e.g., siRNA), and PNA (peptide nucleic acid). Examples of small molecules are peptides, peptidomimetics (e.g., peptoids), amino acids, amino acid analogs, polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic or inorganic compounds e.g., heteroorganic or organometallic compounds.

[0072] Wild type: The term "wild-type", when applied to a polypeptide (e.g., a histone polypeptide) has its art understood meaning and refers to a polypeptide whose primary amino acid sequence is identical to that of a polypeptide found in nature. As will be appreciated by those skilled in the art, a wild type polypeptide is one whose amino acid sequence is found in normal (i.e., non-mutant) polypeptides. Detailed Description of Certain Embodiments

Histone methyl modifying enzymes

[0073] The present disclosure provides methods and compositions for identifying compounds that modulate activity of histone methyl modifying enzymes. Histone methyl modifying enzymes are key regulators of cellular and developmental processes. Such enzymes have modules that mediate binding to methylated residues. For example, multiple demethylases contain a Tudor domain (e.g., JMJD2C/GASC1) or a PHD domain (e.g.,

JARID 1C/SMCX, PHF8). In some embodiments, stimulating agents described herein present one or more modifications recognized by a methyl binding domain of an enzyme of interest and provide a more physiological environment for the enzyme, thereby increasing its activity (e.g., by increasing substrate affinity).

[0074] One class of histone methylases is characterized by the presence of a SET domain, named after proteins that share the domain, Su(var)3-9, enhancer of zeste [E(Z)], and trithorax. A SET domain includes about 130 amino acids. SET domain-containing methylase families include SUV39H1, SETl, SET2, EZH2, RIZl, SMYD3, SUV4-20H1,

SET7/9, and PR-SET7/SET8 families (reviewed in Dillon et al, Genome Biol. 6:227, 2005).

Members of a family typically include similar sequence motifs in the vicinity of and within the SET domain. The human genome encodes over 50 SET domain-containing histone protein methylases, any of which can be used in an assay described herein.

[0075] EZH2 is an example of a human SET-domain containing methylase. EZH2 associates with EED (Embryonic Ectoderm Development) and SUZ 12 (suppressor of zcstc

12 homo log) to form a complex known as PRC2 (Poly comb Group Repressive Complex 2) having the ability to tri-methylate histone H3 at lysine 27 (Cao and Zhang, MoI. Cell 15:57-

67, 2004). PRC2 complexes can also include RBAP46 and RBAP48 subunits. EZH2 overexpression is associated with aggressiveness of certain cancers such as breast cancer

(Kleer et al, Proc. Nat. Acad. Sci. USA 100:11606-11611, 2003).

[0076] The lysine specificities of many histone methyltransferases have been

characterized. For example SET7/9, SMYD3, and MLL1-5 are specific for H3K4.

SUV39H1, DIM-5, and G9a are specific for H3K9. SET8 is specific for H4K20.

[0077] DOTl is an example of a non-SET domain containing histone methylase. DOTl methylates H3 on lysine 79.

[0078] Just as histone methylases have been shown to regulate transcriptional activity, chromatin structure, and gene silencing, demethylases have also been discovered which impact gene expression. LSDl was the first histone lysine demethylase to be characterized. This enzyme displays homology to FAD-dependent amine oxidases and acts as a transcriptional corepressor of neuronal genes (Shi et al., Cell 119:941-953, 2004). Additional demethylases defining separate demethylase families have been discovered, including JHDMl (or KDM2), JHDM2 (or KDM3), JMJD2 (or KDM4), JARID (or KDM5), JMJD3 (or KDM6), and JMJD6 families (Lan et al., Curr. Opin. Cell Biol. 20(3):316-325, 2008).

[0079] Demethylases act on specific lysine residues within substrate sequences and discriminate between the degree of methylation present on a given residue. For example, LSDl removes mono- or dimethyl- groups from H3K4. Members of the JARIDlA-D family remove trimethyl groups from H3K4. UTX and JMJD3 demethylate H3K27, counteracting effects of EZH2 methylase activity. Substrate specificities of other demethylases have been characterized (see Shi, Nat. Rev. 8:829-833, 2007).

[0080] Histone methyl modifying enzymes can be produced recombinantly or purified from a natural source. Histone methyl modifying enzymes are also commercially available. In some embodiments, a histone methyl modifying enzyme used in a method or composition described herein is a human enzyme. In some embodiments, a histone methyl modifying enzyme used in a method or composition described herein is a non-human enzyme (e.g., a murine, rat, bovine, equine, porcine, canine, chicken, zebrafϊsh, chimpanzee, macaque, Drosophila, C. elegans, Xenopus, ox Anopheles enzyme). Examples of human histone methylases and demethylases that can be used according to the present disclosure are listed in Tables IA and IB. Non-human homologs of enzymes shown in Tables IA and IB, as well as additional human and non-human methyl modifying enzymes are known and can also be used in/part of methods and compositions described herein.

Table IA. Exemplary Methylases

Table IB. Exemplary Demethylases

Substrates of histone methyl modifying enzymes

[0081] Any substrate for histone methyl-modifying activity can be used according to methods of the present disclosure. In some embodiments, a substrate comprises a full length histone polypeptide or portion thereof. In some embodiments, a substrate comprises a nucleosome. In some embodiments, a substrate comprises an oligonucleosome. In some embodiments, a substrate comprises a reconstituted nucleosome. In some embodiments, a substrate comprises a nucleosome purified from a cell (see, e.g., Ausio and van Holde, Biochem. 25:1421-1428, 1986; and Fang et al, Meth. Enzymol. 377:213-226, 2003). In some embodiments, a substrate comprises chromatin. Histones used as substrates can include histones from one or more species.

[0082] In some embodiments, a substrate comprises a peptide (e.g., a histone peptide). For example, analysis of activity demethylase enzyme may utilize a histone peptide (e.g., as shown in Examples herein).

Stimulating agents

[0083] Stimulating agents provided herein comprise peptides. In some embodiments, stimulating agents are 4-60 amino acids in length. For example, a stimulating agent can include 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, or 60 amino acids. In some embodiments, a stimulating agent includes 4-60 amino acids of a histone polypeptide (e.g., an H3, H4, Hl, H2A, or H2B histone polypeptide). In some embodiments, a stimulating agent comprises 4-60 amino acids from the N-terminus of a histone polypeptide. In some embodiments, a stimulating agent comprises a methylated peptide.

[0084] Methylation of a stimulating agent can include mono-, di-, and/or tri-methylation. A stimulating agent can include methylation of one or more residues (e.g., one or more lysine residues).

[0085] In some embodiments, a stimulating agent comprises at least four amino acids from an N-terminal region (e.g., an N-terminal region comprising the N-terminal 60 amino acids) of a histone polypeptide, wherein the agent comprises a methylated lysine. In some embodiments, a stimulating agent has a sequence derived from a natural methylase substrate, and is methylated at a position in which the natural methylase substrate is methylated. For example, enzymes methylate histone H3 on lysines 4, 9, 27, 36, and 79 (H3K4, H3K9, H3K27, H3K36, and H3K79). H4 is methylated on lysine 20 (H4K20). Thus, for example, a stimulating agent can include a methylated histone peptide comprising one or more of H3K4 ("H3K4 "refers to lysine 4 of an H3 histone polypeptide, wherein the numbering corresponds to position of lysine 4 in a wild type H3 histone polypeptide sequence), H3K9, H3K27, H3K36, H3K79, or H4K20.

[0086] In some embodiments, a stimulating agent comprises at least four amino acids of the following sequence of a histone H3 polypeptide, wherein the agent comprises a methylated lysine:

ARTKQTARKSTGGKAPRKQLATKAARKSAPATGESKKPHRYRPGTAALREIRRYQK

STEL (SEQ ID NO: 1). A stimulating agent can also include a peptide having one or more amino acid substitutions relative to SEQ ID NO:1 (e.g, substitutions at one, two, three, four, or five positions) other than at the methylated lysine residue. In some embodiments, a substitution is a substitution found in a histone H3 sequence of a non-human species. In some embodiments, a stimulating agent includes at least four amino acids of SEQ ID NO: 1 having an alanine to serine substitution at residue 31 (A3 IS).

[0087] In some embodiments, a stimulating agent comprises at least four amino acids of

SEQ ID NO:1, wherein the sequence includes one or more of H3 K4, K9, K27, K36, or K79.

For example, a stimulating agent comprising H3K4 can include one of the following sequences: ARTK (SEQ ID NO:6); ARTKQ (SEQ ID NO:7); ARTKQT (SEQ ID NO:8);

ARTKQTA (SEQ ID NO:9); ARTKQTAR (SEQ ID NO: 10); ARTKQTARK (SEQ ID

NO: 11); RTKQ (SEQ ID NO: 12); RTKQT (SEQ ID NO: 13); TKQT (SEQ ID NO: 14);

KQTA (SEQ ID NO: 15).

[0088] In some embodiments, a stimulating agent comprises an H3K27 sequence as follows: RKQLATKAAR(KMe3)SAPATGGVKKP (SEQ ID NO: 16). ("me3" denotes the presence of trimethylation on a lysine residue.)

[0089] In some embodiments, a stimulating agent comprises an H3K9 sequence as follows: ARTKQTAR[Rme3]STGGKAPRKQLA (SEQ ID NO: 17).

[0090] In some embodiments, a stimulating agent comprises at least four amino acids of the following sequence of a histone H4 polypeptide, wherein the agent comprises a methylated lysine:

SGRGKGGKGLGKGGAKRHRKVLRDNIQGITKPAIRRLARRGGVKRISGLIYEETRGV

LKV (SEQ ID NO:2). A stimulating agent can also include a peptide having one or more amino acid substitutions relative to SEQ ID NO:2 (e.g, substitutions at one, two, three, four, or five positions) other than at the methylated lysine residue. In some embodiments, a substitution is a substitution found in a histone H4 sequence of a non-human species (e.g., a

V21A or V21I substitution).

[0091] In some embodiments, a stimulating agent comprises an H4K20 sequence as follows: LGKGGAKRHR[Rme3]VLRDNIQGIT (SEQ ID NO: 18).

[0092] In some embodiments, a stimulating agent comprises at least four amino acids of the following sequence of a histone H 1.4 (HIe) polypeptide, wherein the agent comprises a methylated lysine: SETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGVS

LAA (SEQ ID NO:3). A stimulating agent can also include a peptide having one or more amino acid substitutions relative to SEQ ID NO:3 (e.g, substitutions at one, two, three, four, or five positions) other than at the methylated lysine residue. In some embodiments, a substitution is a substitution found in a histone HIe sequence of a non-human species.

In some embodiments, a stimulating agent comprises an H1.4K26 sequence as follows:

VKKKAR[Rme2] SAGAAKRKASG (SEQ ID NO: 19).

[0093] In some embodiments, a stimulating agent comprises at least four amino acids of the following sequence of a histone HIe polypeptide, wherein the agent comprises a methylated lysine:

SETAPAAPAAPAPAEKTPVKKKARKAAGGAKRKTSGPPVSELITKAVAASKERSGVS

LAA (SEQ ID NO:4).

[0094] A stimulating agent can also include a peptide having one or more amino acid substitutions relative to SEQ ID NO:4 (e.g, substitutions at one, two, three, four, or five positions) other than at the methylated lysine residue. In some embodiments, a substitution is a substitution found in a histone HIe sequence of a non-murine species.

[0095] Additional stimulating agents are described in the Examples herein.

[0096] Peptides can be produced by chemical synthesis or recombinant expression.

Peptides can be methylated by any available means (e.g., by chemical or enzymatic methods).

[0097] Stimulating agents can include modifications in addition (or alternative) to methylation, such as acetylation, phosphorylation, hydroxylation, glycosylation, sulfation, or lipidation.

[0098] A stimulating agent can be labeled, e.g., at its N-terminus, C-terminus, or internally. A label can be coupled to a stimulating agent directly or via a linker or spacer.

Useful labels include radioactive moieties, enzymes, and fluorescent moieties. In some embodiments, a stimulating agent is labeled with biotin.

Assays

[0099] Test compounds

[00100] The present disclosure provides assays for screening for a test compound, or more typically, a library or collection of test compounds, to evaluate an effect of the test compound on activity of a histone methyl modifying enzyme in vitro (e.g., on a methylase or a demethylase). [00101] A test compound can be the only substance assayed by a method described herein. Alternatively, a collection of test compounds can be assayed either consecutively or concurrently by methods described herein. Members of a collection of test compounds can be evaluated individually or in a pool, e.g., using a split-and-pool method.

[00102] In one embodiment, high throughput screening methods are used to screen a combinatorial chemical or peptide library, or other collection, containing a large number of potential HMME modulatory compounds (test compounds). Such "combinatorial chemical libraries" are then screened in one or more assays to identify those library members

(particular chemical species or subclasses) that display a desired characteristic activity.

Compounds thus identified can serve as conventional "lead compounds" or can themselves be used as potential or actual modulators (e.g., as therapeutics).

[00103] A combinatorial chemical library typically includes a collection of diverse chemical compounds, for example, generated by either chemical synthesis or biological synthesis, by combining a number of chemical "building blocks" such as reagents. For example, a linear combinatorial chemical library such as a polypeptide library may be formed by combining a set of chemical building blocks (amino acids), e.g., in particular specified arrangements or in every possible way for a given compound length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical compounds can be synthesized through such combinatorial mixing of chemical building blocks.

[00104] Preparation and screening of combinatorial chemical libraries is well known to those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, peptide libraries (see, e.g., U.S. Pat. No. 5,010,175, Furka, Int. J. Pept. Prot. Res. 37:487-493 (1991) and Houghton et al, Nature 354:84-88 (1991)). Other chemistries for generating chemical diversity libraries can also be used. Such chemistries include, but are not limited to: peptoids (e.g., PCT Publication No. WO 91/19735), encoded peptides (e.g., PCT Publication No. WO 93/20242), random bio-oligomers (e.g., PCT Publication No. WO 92/00091), benzodiazepines (e.g., U.S. Pat. No. 5,288,514), diversomers such as hydantoins,

benzodiazepines and dipeptides (Hobbs et al., Proc. Nat. Acad. Sci. USA 90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al., J. Amer. Chem. Soc. 114:6568 (1992)), nonpeptidal peptidomimetics with glucose scaffolding (Hirschmann et al., J. Amer. Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses of small compound libraries (Chen et al., J. Amer. Chem. Soc. 116:2661 (1994)), oligocarbamates (Cho et al., Science 261 :1303 (1993)), and/or peptidyl phosphonates (Campbell et al., J. Org. Chem. 59:658 (1994)), nucleic acid libraries (see Ausubel, Berger and Sambrook, all supra), peptide nucleic acid libraries (see, e.g., U.S. Pat. No. 5,539,083), antibody libraries (see, e.g., Vaughn et al, Nature Biotechnology, 14(3):309-314 (1996) and PCT/US96/10287), carbohydrate libraries (see, e.g., Liang et al., Science, 274:1520-1522 (1996) and U.S. Pat. No. 5,593,853), small organic molecule libraries (see, e.g., benzodiazepines, Baum C&EN, January 18, page 33 (1993); isoprenoids, U.S. Pat. No. 5,569,588; thiazolidinones and metathiazanones, U.S. Pat. No. 5,549,974; pyrrolidines, U.S. Pat. Nos. 5,525,735 and 5,519,134; morpholino

compounds, U.S. Pat. No. 5,506,337; benzodiazepines, U.S. Pat. No. 5,288,514, and the like). Additional examples of methods for the synthesis of molecular libraries can be found in the art, for example in: DeWitt et al. (1993) Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb et al. (1994) Proc. Natl. Acad. Sci. USA 91 :11422; Zuckermann et al. (1994). J. Med. Chem.

37:2678; Cho et al. (1993) Science 261 :1303; Carrell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et al. (1994) J. Med. Chem. 37:1233.

[00105] Some exemplary libraries are used to generate variants from a particular lead compound. One method includes generating a combinatorial library in which one or more functional groups of the lead compound are varied, e.g., by derivatization. Thus, the combinatorial library can include a class of compounds which have a common structural feature (e.g., scaffold or framework).

[00106] Devices for the preparation of combinatorial libraries are commercially available (see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville Ky., Symphony, Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, Calif, 9050 Plus, Millipore, Bedford, MA). In addition, numerous combinatorial libraries are commercially available (see, e.g., ComGenex, Princeton, NJ, Asinex, Moscow, Ru, Tripos, Inc., St. Louis, MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, Pa., Martek Biosciences, Columbia, MD, etc.).

[00107] Test compounds can also be obtained from: biological libraries; peptoid libraries (libraries of molecules having the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to enzymatic degradation but which nevertheless remain bioactive; see, e.g., Zuckermann, R. N. et al. (1994) J. Med. Chem. 37:2678-85); spatially addressable parallel solid phase or solution phase libraries; synthetic library methods requiring deconvolution; the "one-bead one-compound" library method; synthetic library methods using affinity chromatography selection, or any other source, including assemblage of sets of compounds having a structure and/or suspected activity of interest. Biological libraries include libraries of nucleic acids and libraries of proteins. Some nucleic acid libraries provide, for example, functional RNA and DNA molecules such as nucleic acid aptamers or ribozymes. A peptoid library can be made to include structures similar to a peptide library. (See also Lam (1997) Anticancer Drug Des. 12:145). A library of proteins may be produced by an expression library or a display library (e.g., a phage display library).

[00108] Libraries of compounds may be presented in solution (e.g., Houghten (1992) Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria (Ladner, U.S. Pat. No. 5,223,409), spores (Ladner U.S. Pat. No. 5,223,409), plasmids (Cull et al. (1992) Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390; Devlin (1990) Science 249:404-406; Cwirla et al. (1990) Proc. Natl. Acad. Sci. 87:6378-6382; Felici (1991) J. MoI. Biol. 222:301- 310; Ladner supra.).

Assay methods

[00109] Any assay herein, e.g., an in vitro assay, can be performed individually, e.g., just with the test compound, or with other appropriate controls. A "control" reaction is typically a reaction identical to a test reaction except for the change of a single parameter (or, in some cases, a small number of parameters). For example, a control reaction may be a parallel assay without a test compound, or a other parallel assay without one or more other reaction components, e.g., without a target or without a substrate. In some embodiments, it is possible to compare assay results to a reference, e.g., a reference value, e.g., obtained from the literature, a prior assay, and so forth. Appropriate correlations and art known statistical methods can be used to evaluate an assay result.

[00110] Once a compound is identified as having a desired effect (e.g., modulation of activity of a histone methyl modifying enzyme), production quantities of the compound can be synthesized, e.g., producing at least 50 mg, 500 mg, 5 g, or 500 g of the compound. The compound can be formulated, e.g., for administration to a subject, and may also be administered to the subject.

[00111] Activity of histone methyl modifying enzymes can be evaluated in an in vitro system. The effect of a test compound can be evaluated, for example, by measuring methylation of a substrate in the presence of a stimulating agent at the beginning of a time course, and then comparing such levels after a predetermined time (e.g., 0.1, 0.25, 0.5, 1, 1.5, 2, 2.5, 3, or more hours) in a reaction that includes the test compound and in a parallel control reaction that does not include the test compound. This is one example of a method for determining the effect of a test compound on enzyme activity in vitro using a stimulating agent as provided by the present disclosure. [00112] In general, an assay involves preparing a reaction mixture of a histone methyl modifying enzyme, a substrate, a stimulating agent, and one or more test compounds under conditions and for a time sufficient to allow components to interact. Methylation can be evaluated directly or indirectly.

[00113] In some embodiments, a component of an assay reaction mixture (e.g., a substrate) is anchored onto a solid phase. A component anchored on the solid phase can be detected at the end of a reaction, e.g., a methylase reaction. Any vessel suitable reactants can be used. Examples of suitable vessels include microtiter plates, test tubes, and micro-centrifuge tubes.

[00114] Activity of methyl modifying enzymes can be evaluated by any available means. In some embodiments, a methylation state of a substrate is evaluated by mass spectrometric analysis of a substrate. In some embodiments, methylation of a substrate is evaluated with an antibody specific for a methylated or demethylated substrate. Such antibodies are

commercially available (e.g., from Upstate Group, NY, or Abeam Ltd., UK). Suitable immunoassay techniques for detecting methylation state of a substrate include

immunoblotting, ELISA, and immunoprecipitation.

[00115] Methylation reactions can be carried out in the presence of a labeled methyl donor (e.g., a S- adenosyl-[methyl- 14 C]-L-methionine, or S-adenosyl-[methyl-Η]-L-methionine), allowing detection of label into a methylase substrate, or release of label from a demethylase substrate.

[00116] In some embodiments, activity of a methyl modifying enzyme is evaluated using fluorescence energy transfer (FET or FRET for fluorescence resonance energy transfer) (see, for example, Lakowicz et al., U.S. Pat. No. 5,631,169; Stavrianopoulos, et al., U.S. Pat. No. 4,868,103). A fluorophore label on a 'donor' (e.g., a DNA molecule of a nucleosome) is selected such that its emitted fluorescent energy will be absorbed by a fluorescent label on an 'acceptor' (e.g., an antibody specific for a histone methyl modification of interest), which in turn is able to fluoresce due to the absorbed energy. A reaction can be carried out using an unlabelled substrate, and histone modification is determined by detecting antibody binding using a fiuorimeter (see, U.S. Pat. Pub. 2008/0070257).

[00117] In some embodiments, demethylation is evaluated by direct or indirect detection of release of a reaction product such as formaldehyde and/or succinate. In some

embodiments, release of formaldehyde is detected. Release of formaldehyde can be detected using a formaldehyde dehydrogenase assay in which formaldehyde dehydrogenase converts released formaldehyde to formic acid using NAD + as electron acceptor. Reduction OfNAD + can be detected spectrophotometrically (Lizcano et al., Anal. Biochem. 286:75-79, 2000). In some embodiments, release of formaldehyde is detected by converting formaldehyde to 3,5- diacethyl- 1 ,4-dihydrolutidine (DDL) and detecting the DDL, for example, by detecting radiolabeled DDL (e.g., 3 H-DDL). A substrate can be labeled so that a labeled reaction product is released (e.g., formaldehyde and/or succinate) by a demethylation reaction. In some embodiments, a substrate is methylated with 3 H-SAM (S-adenosylmethionine), demethylation of which releases H-formaldehyde, which can detected directly, or which can be converted to 3 H-DDL, which is detected. Methods of detecting reaction products such as formaldehyde and/or succinate include mass spectrometry, gas chromatography, liquid chromatography, immunoassay, electrophoresis, and the like, and combinations thereof. Demethylase assays are also described in Shi et al, Cell 119:941-953, 2004.

[00118] An alternative means for detecting demethylase activity employs analysis of release of radioactive carbon dioxide. See, e.g., Pappalardi et al., Biochem. 47(43): 11165- 11167, 2008, and Supporting Information, which describes use of a radioactive assay in which capture of 14 CO 2 is captured and detected following release from α-[l- 14 C]- ketoglutaric acid coupled to hydroxylation reactions. Such methods can also be employed for detection of demethylation.

[00119] Detection of enzyme activity can include use of fluorescent, radioactive, scintillant, or other type of reagents. In some embodiments, a scintillation proximity assay is used for evaluating enzyme activity. Such assays can involve use of an immobilized scintillant (e.g., immobilized on a bead or microplate) and a radioactive methyl donor. In some embodiments, a scintillation proximity assay employs scintillant-coated microplates such as FlashPlates® (Perkin Elmer).

[00120] In some embodiments, components of an assay reaction mixture are conjugated to biotin and streptavidin. Biotinylated components (e.g., biotinylated substrate or biotinylated stimulating agent) can be prepared, e.g., using biotin-NHS (N-hydroxy-succinimide) according to known techniques (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL). Biotinylated components can be captured using streptavidin-coated beads or immobilized in the wells of streptavidin-coated plates (Pierce Chemical).

[00121] As would be appreciated by those of skill in the art, assays can employ any of a number of standard techniques for preparation and/or analysis of enzymatic activity, including but not limited to: differential centrifugation (see, for example, Rivas, G., and Minton, A. P., (1993) Trends Biochem Sci 18:284-7); chromatography (gel filtration chromatography, ion-exchange chromatography); electrophoresis (see, e.g., Ausubel, F. et al., eds. Current Protocols in Molecular Biology 1999, J. Wiley: New York.); and immunoprecipitation (see, for example, Ausubel, F. et al, eds. (1999) Current Protocols in Molecular Biology, J. Wiley: New York). Such resins and chromatographic techniques are known to one skilled in the art (see, e.g., Heegaard, N. H., (1998) J MoI Recognit 11 :141-8; Hage, D. S., and Tweed, S. A. (1997) J Chromatogr B Biomed Sci Appl. 699:499-525).

Further, fluorescence energy transfer may also be conveniently utilized, as described herein, to detect activity of histone methyl modifying enzymes.

[00122] Test compounds identified as enzyme modulators using in vitro assays described herein can be further evaluated in an animal model. An animal model can include a mammal (e.g., a mouse, rat, primate, or other non-human), or other organism (e.g., Xenopus, zebrafϊsh, or an invertebrate such as a fly or nematode). In some cases, an animal model uses a transgenic organism, e.g., an organism which includes a heterologous histone methyl modifying enzyme. A test compound can be administered to an animal once or as a regimen (regular or irregular). A parameter of the animal is then evaluated, e.g., a parameter of a pathway regulated by the histone methyl modifying enzyme, such as cell proliferation or differentiation. Test compounds that are indicated as of interest result in a change in the parameter relative to a reference, e.g., a parameter of a control animal. Other parameters (e.g., related to toxicity, clearance, and pharmacokinetics) can also be evaluated.

[00123] In some embodiment, a test compound is evaluated using an animal that has a particular disorder, e.g., a cell proliferative disorder, or using an animal that is otherwise sensitized to developing a particular disorder, e.g., a cell proliferative disorder.

[00124] Screening assays or any information described herein can be evaluated using standard statistical methods. For example, data can be expressed as mean ± SEM. Differences can be analyzed by ANOVA; significance can be assessed at the 95% and 99% significance levels by the Fisher PLSD statistical test or by the paired 2-tailed t test. Data involving more than 2 repeated measures can be assessed by repeated-measures ANOVA. Non-normally distributed data can be compared using the Mann- Whitney U test.

Exemplification

Example 1: High throughput demethylase assays

[00125] High throughput demethylase assays can be performed in the presence of a stimulating agent according to the following exemplary protocol.

[00126] Materials and Reagents: 1. E. coli BL21 (DE3) expressed Human GASCl (aal-350) enzyme (In House prep)

2. H3K9me3 peptide (New England Peptide, Gardner, MA)

3. TrisHCl (pH 7.4, at room temperature) (Cat# 4109-07, J.T. Baker Phillipsburg, NJ)

4. α-Ketoglutaric acid, sodium salt (Cat# K2010, Sigma Aldrich, St. Louis, MO)

5. (+)-Sodium L-ascorbate (Cat# A4034, Sigma Aldrich, St. Louis, MO)

6. Ammonium iron (II) sulfate hexahydrate (Cat# F1543, Sigma Aldrich, St. Louis, MO)

7. Glycerol (Cat# BP229, Fisher Scientific, Fair Lawn, NJ)

8. Triton X-IOO (Cat# T9284, Sigma Aldrich, St. Louis, MO)

9. Reaction Stop Mix - IN Hydrochloric acid (Cat# BDH3202-1, VWR, West Chester, PA)

10. Multidrop Combi (Cat# 5840300, Thermo Fisher Scientific, Waltham, MA)

[00127] General Procedure for use with Multidrop:

1. Prepare fresh co-factor stocks each day that reactions are to be run

2. Prepare the Reaction Mix with ascorbate (135ml), keep on ice:

11.25ml IM TrisHCl, pH 7.4

2.25ml 10OmM Ascorbate (made fresh)

22.5ml 50% Glycerol

1.125ml 2% Triton X-IOO

93.78ml deionized water

3. Prepare the Initiation Mix (100ml), keep at room temperature

250μl 2OmM Ammonium iron (II) sulfate hexahydrate

125μl 1OmM α-Ketoglutaric acid, sodium salt

99.625ml deionized water

4. Rinse the Multidrop by priming with 50ml Milli-Q water

5. Rinse the Multidrop by priming with 10ml IN HCl

6. Using the Multidrop, pre-quench any MIN control wells, dispense 25μl of IN HCl to each MIN well on the Thermo matrix 384 polypro 4314 plate.

7. Split the above Reaction Mix with ascorbate solution into 2 bottles, keep both on ice.

Use one 65ml aliquot of the Reaction Mix with ascorbate to wash the Multidrop machine prior to plating the reaction mix.

8. Rinse the Multidrop by priming with 100ml Milli-Q water

9. Rinse the Multidrop by priming with 50ml chilled Reaction Mix with ascorbate 10. To the other 65mL aliquot of the Reaction Mix with ascorbate add 1.0ml of 2.0ug/ul GASCl enzyme and 217ul of 1OmM H3K9me3 peptide just prior to dispensing the reaction mix to theplate. 65ml of Reaction mix with ascorbate, enzyme and

H3K9me3 peptide yields -4250 reactions.

11. Using the Multidrop dispense 15μl of Reaction Mix with ascorbate, enzyme, and peptide to each well on the plate. Once dispensing is complete, shake the plate for 5 seconds. Repeat with the next plate every 20 seconds. Be sure to make note of the order in which the plates are run through the Multidrop as this needs to be the same order in which the plates are initiated and quenched.

12. Rinse the Multidrop by priming with 50ml Milli-Q water.

13. Rinse the Multidrop by priming with 30ml Initiation Mix.

14. Start a timer at the initiation of Initiation Mix dispense. Using the Multidrop dispense lOμl of Initiation Mix to each well on the plate. Once dispensing is complete, shake the plate for 5 seconds. Repeat with the next plate every 20seconds. Be sure to make note of the order in which the plates are run through the Multidrop as this needs to be the same order in which the plates are quenched.

15. Let the initiated reaction run for 45 minutes.

16. Rinse the Multidrop by priming with 50ml Milli-Q water.

17. Rinse the Multidrop by priming with 10ml IN HCl

18. Once the reaction has completed, use the Multidrop to quench all of the wells except for the MIN control well previously quenched, dispense 25uL of IN HCl to each well on the plate. Be sure to quench each plate in the same order as they were initiated and in 20 second intervals.

19. Heat seal the plate and store at -8O 0 C.

[00128] Demethylation can be analyzed by mass spectrometry. The removal of a methyl group from a substrate such as H3K9me3 results in the loss of 15 mass units, to produce H3K9me2. Further demethylations of H3K9me2 yield losses of 14 mass units each. The difference in mass allows for quantitative determination of concentrations of analytes in complex mixtures.

Example 2: Stimulation of KDM4/JMJD2 demethylase family members

[00129] JMJD2A, JMJD2B and JMJD2C/GASC 1 proteins contain double PHD and Tudor domains in its C-terminus. The double Tudor domain of JMJD2A has been shown to specifically recognize H3K4me3 and H4K20me3 marks on histone H3 and H4 tails. It is likely that the double Tudor domains of JMJD2B and JMJD2C/GASC 1 preserve the same binding specificity. All JMJD2 family members have been shown to be H3K9me3 demethylases and JMJD2A and JMJD2B has also been shown to catalyze H3K36me3 demethylation in vitro.

[00130] In a peptide demethylation reaction, JMJD2C/GASC1 can utilize H3K9me3 and

H3K36me3 peptide as substrates and produce di-methylated lysine preferentially. The enzyme can also catalyze di to mono demethylation, but to a less robust extent. Since the H3 lysine 4 residue localizes in the same H3 polypeptide of H3 lysine 9 and H3 lysine 36, it was examined whether inclusion of an H3K4me3 mark on the peptide substrates stimulates

JMJD2C/GASC1 activity by promoting enzyme and substrate recognition.

[00131] Experiments: Flag tagged full length JMJD2C/GASC1 was purified from insect cells. The peptide substrates contain the amino acid sequence of 1-21 residues of Histone

H3, and trimethylation groups were introduced into the peptide substrates by chemical synthesis.

[00132] The following peptide was used as a substrate of JMJD2C/GASC1 enzyme activity:

[00133] H3 (1-21) K9me3 peptide: H2N-ARTKQTAR(KMe3)STGGKAPRKQLA-OH

(SEQ ID NO:20)

[00134] The following peptide was used as a candidate stimulating agent of

JMJD2C/GACS1 enzyme activity:

[00135] H3 (1-21) K4me3K9me3 peptide: H2N-

ART(KMe3)QTAR(KMe3)STGGKAPRKQLA-OH (SEQ ID NO:21)

[00136] Assays were performed as described in Example 1.

[00137] Result: A stimulating effect on JMJD2C/GASC1 demethylase activity was observed in the presence of a peptide having a trimethyl group on H3 lysine 4.

Example 3: Stimulation ofKDM5/JARIDl demethylase family members

[00138] JARID IA-D proteins contain multiple PHD domains, and the N-terminus PHD domain of JARID 1C/SMCX has been shown to specifically recognize H3K9me3 mark. It is likely that the corresponding PHD domains of the other family members preserve the same binding specificity. JARIDl family enzymes have been shown to be H3K4me3 demethylases in vitro. [00139] In a peptide demethylation reaction, JARID 1 C/SMCX can utilize H3K4me3 peptide as substrate and produce di-methylated lysine preferentially. The enzyme can also catalyze di to mono demethylation, but to a less robust extent. Since the H3 lysine 9 residue localizes in the same H3 polypeptide of H3 lysine 4, it was examined whether the presence of an H3K9me3 mark on the peptide substrates stimulates JARIDl C/SMCX activity by promoting enzyme and substrate recognition.

[00140] Experiments: Flag tagged full length JARID 1A/SMCX was purified from insect cells. The peptide substrates contain the amino acid sequence of 1-21 residues of Histone

H3, and trimethylation groups were introduced into the peptide substrates by chemical synthesis.

[00141] The following peptide was used as a substrate of JARIDl C/SMCX enzyme activity:

[00142] H3 1-21 H3K4me3 peptide: H2N-ART(KMe3)QTARKSTGGKAPRKQLA-OH

(SEQ ID NO:22)

[00143] The following peptide was used as a candidate stimulating agent:

[00144] H3 1-21 H3K4me3K9me3 peptide: H2N-

ART(KMe3)QTAR(KMe3)STGGKAPRKQLA-OH (SEQ ID NO:23)

[00145] Demethylation reactions were performed as described in Example 1.

[00146] Result: A stimulating effect on JARID 1 C/SMCX activity was observed in the presence of a peptide having a trimethyl group on H3 Iysine9.

Example 4: Stimulation of PHF 2, PHF 8 and KIAAl 718 demethylase family members

[00147] PHF2, PHF8 and KIAAl 718 proteins contain one N-terminus PHD domain. The N-terminus PHD domains are likely to bind H3K4me3 mark in histone H3 due to sequence similarity to known PHD domain recognizing H3K4me3 mark, such as BPTF and ING2. PHF8 and KIAA1718 has been shown to be H3K9me2 and H3K27me2 demethylases in vitro, respectively (unpublished observations).

[00148] In a peptide demethylation reaction, PHF8 can utilize H3K9meme2 peptide as substrate and produce mono-methylated lysine preferentially. The enzyme can also catalyze mono to zero demethylation, but to a less robust extent. Since the H3 lysine 4 residue localizes in the same H3 polypeptide of H3 lysine 9, it was examined whether inclusion of an H3K4me3 mark on the peptide substrates stimulates PHF8 activity by promoting enzyme and substrate recognition. [00149] Experiments: Flag tagged full length PHF8 was purified from insect cells. The peptide substrates contain the amino acid sequence of 1-21 residues of Histone H3, and trimethylation groups were introduced into the peptide substrates by chemical synthesis.

[00150] The following peptide was used as a substrate of PHF2/PHF8 enzyme activity:

[00151] H3 1-21 H3K9me2 peptide: H2N-ARTKQTAR(KMe2)STGGKAPRKQLA-OH

(SEQ ID NO:24)

[00152] The following peptide was used as a candidate stimulating agent:

[00153] H3 1-21 H3K4me3K9me2 peptide: H2N-

ART(KMe3)QTAR(KMe2)STGGKAPRKQLA-OH (SEQ ID NO:25)

[00154] Demethylation reactions were performed as described in Example 1.

[00155] Result: A stimulating effect on PHF2/PHF8 demethylase activity was observed in the presence of a peptide having a trimethyl group on H3 Iysine9.

Example 5: High throughput methylase assays

[00156] Poly comb repressive complex 2 (PRC2) is a multisubunit methylase complex that includes EZH2 (Enhancer of Zeste Homolog 2), EED, SUZ12, Rbap46, and Rbap48 subunits. Reconstituted PRC2 complexes (MW=600 kDa) were used in vitro assays to determine methylase activity in the presence of novel stimulating agents. A schematic depiction of a reconstituted PRC2 complex is shown in Figure IA. Silver staining and Western blot analysis are shown in Figure IB. Methylation of wt and K27A H3 substrates are shown in Figure 1C.

High throughput methylase assay

[00157] The following reaction mix was used for high throughput methylase assays:

[00158] 30 ul reaction mix

6.0 μl 5x HMT buffer

0.45 μl DTT 0.2M = 3 mM

1.0 μl 3 H-SAM (Perkin Elmer, 0.55 μCi/ μl) = 0.24 μM

1.0 μl H3K27me3 peptide [0. lmg/ml] = 1.24 μM

1.0 μl rPRC2 [0.319 mg/ml] = 17.7 nM

0.11 μl Bio/ Avi-oligonucleosomes [1.0 mg/ml] = 1.5 nM

0.24 ul DMSO/compounds = 0.79%

9.80 μl

20.2 μl water [00159] Reaction mixtures were incubated for 60 min. at 30 0 C in 384-well Black and White Microplates, Polystyrene (Greiner Bio-One Black FLUOTRAC 200 Medium Binding Nonsterile Greiner Bio-One No.781096, VWR Catalog# 82051-294). For detection, Streptavidin FlashPlate HTS PLUS were used (High Capacity, 384 well, Perkin Elmer product # SMP41000 IPK).

[00160] A workflow used for high throughput assays was as follows:

1. Preparation of assay plates using the ECHO ® (Labcyte)(240 nl of DMSO or compound per well)

2. Preparation of Mix I + II; I: 1202 x 15 μl; II: 1202 x 15.0 μl (dead volume -50 reactions)

Mix I

6.0 μl 5x HMT buffer

0.45 μl DTT 0.2M

1.0 μl rPRC2

1.0 μl H3K27me3 peptide

0.11 μl Bio/Avi-oligonuc . 1 :10

6.44 μl water

15.0 μl

Mix II

1.0 μl 3 H-SAM

14.0 μl water

15.0 μl

3. Adding Mix I to assay plate using the Multidrop (15 μl per well)

4. Adding Mix II to assay plate using the Multidrop (15 μl per well)

5. Mix content of wells in the assay plate

6. Spin plate for 1 min at 400 RPM

7. Incubate assay plate for 60 min at 30 0 C

8. Quench reaction by adding 30 μl of SAH [1.25 mM] to each well (mix); final cone. = 568 μM

9. Spin plate for 1 min at 400 RPM

10. Transfer reactions to FLASHplate using the BRAVO

11. Spin plate for 1 min at 400 RPM

12. Incubate FLASHplate at RT for 20 min under agitation

13. Remove liquid from all wells using the plate washer

14. Wash FLASHplate 2x with 60 μl wash buffer (20 mM Tris, pH8, 20OmM NaCl, 0.5% NP40) using the Multidrop 15. Remove wash buffer using plate washer

16. Analyze FLASHplate using the Topcount (2 min per well)

[00161] Methylase assays were performed in the presence and absence of rPRC2 using the following substrates: wt H3, H3K27A, Bio/Avi-H3, wt octamers, K27A octamers, and different concentrations of Bio/Avi-octamers. H3 methylation was analyzed by fluorography and TopCount, which is a scintillation proximity assay (SPA). The results for assays using these substrates are shown in Figures 2 A and 2B. The greatest degree of methylation was observed with the lower concentration of Bio/Avi-octamers, followed by Bio/Avi-H3, H3 wt, and higher concentrations of Bio/A vi-H3 octamers.

Oligonucleosome titration

[00162] Methylase assays were performed as described above, using Bio/Avi- oligonucleosomes at increasing concentrations. The results are shown in Figures 3A, 3B, and

3D. Km measurements of oligonucleosome methylase activity are shown in Figures 4A and

4B.

Example 6: Stimulation ofrPRC2 methylase family members

[00163] Stimulation of rPRC2 methylase activity was determined in the presence of unmodified H3, H3K4me3, H3K9me3, H3K27me3, H3K36me3, H3K79me3, H4K20me3, and Hl .4K26me3. Reactions had the following components:

Enzyme: 12.15 nM

[ 3 H]-SAM: 0.24 μM

DTT: 3 mM

Oligonucleosomes: 14.95 nM

Histone peptides: -1.86 μM

[00164] Peptides used in stimulation assays included the following:

[00165] H3K27me3 : H2N-RKQLATKAAR(KMe3)SAPATGGVKKP-COOH (SEQ ID NO:26)

[00166] H3K9me3-Bio ARTKQTAR[Kme3]STGGKAPRKQLA(-Biotin) (SEQ ID NO:27)

[00167] H4K20me3-Bio LGKGGAKRHR[Kme3]VLRDNIQGIT(-Biotin) (SEQ ID NO:28)

[00168] Hl .4K26me3-Bio VKKKAR[Kme2]SAGAAKRKASG(-Biotin) (SEQ ID NO:29) [00169] Reactions were incubated for 45 min. at 30 0 C. Reactions were stopped by the addition of 450 μM SAH (final concentration; total vol. = 60 μl). Reactions were incubated on FLASHplates for 45 min. and washed twice with 60 μl wash buffer.

[00170] As shown in Figure 5, H3K27me3 stimulated rPRC2 methylase activity toward oligonucleosomes over 15-fold (relative to unmodified H3). H3K9me3 stimulated activity over 9-fold. Stimulation by H3K4me3, H3K36me3, and H1.4K26me3 was also observed.

[00171] Additional reactions were performed in which methylase activity towards Bio/Avi-H3 or Bio-Avi-oligonucleosomes in the presence of H3K27me3, H3K27meO, H3K9me3, and H4K20me3 was compared. The results, depicted in Figures 6A-6D, show that H3K27me3 potently stimulated rPRC2 activity.

Stimulation by tri-, di-, and mono-methylated peptides

[00172] Results of a further experiment are shown in Figures 7 A and 7B. In this experiment, rPRC2 methylase activity toward Bio/Avi-H3 in the presence of H3K27me3, H3K27me2, H3K26mel, H3K27meO, H3K9me3, and H4K20me3 were compared. The results show that H3K27me3 peptides stimulate rPRC2 methylase activity approximately 11- fold. Dimethylated H3K27 peptide stimulates activity approximately 6-fold.

Monomethylated H3K27 stimulates activity approximately 4-fold. Other trimethylated H3 peptides also stimulate activity. Maximal H3K27me3 activity was observed between 0.1-1.0 mg/ml.

Enzyme titration and time course of stimulation

[00173] The time course of stimulation of rPRC2 methylation of Bio/Avi- oligonucleosomes by excess and limiting concentrations H3K27me3 peptides was examined. Results are shown in Figures 8 A and 8B, respectively. Reaction components and conditions are listed in Figure 8C. Km of rPRC2 was measured and is shown in Figure 9. Conditions and results of enzyme titration experiments using oligonucleosomes as substrate in the presence of H2K27me3 peptides are shown in Figures 1OA, 1OB and 1OC.

Example 7: Stimulation of NSD2 family members

[00174] Native NSD2 was purified from 293 cells (Figure 1 IA). Methylase activity of NSD2 was evaluated as described in Examples above for EZH2, and it was shown that NSD2 is active towards H3K36 (Figure 1 IB). NSD2 methylase activity was next tested in the presence of various histone peptides, including H3K9me2, H3K9me3, H3K18me3, H3K36me2, H3K36me3, HlK26me2, HlK26me3, and H3K79me2. Data are shown in Figures 12A and 12B. It was discovered that H3K36me2 and H3K36me3 stimulate NSD2 activity.

Equivalents

[00175] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The scope of the present invention is not intended to be limited to the above Description. Alternative methods and materials and additional applications will be apparent to one of skill in the art, and are intended to be included within the following claims: