Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
KRAB-CONTAINING ZINC FINGER PROTEIN AND CANCER
Document Type and Number:
WIPO Patent Application WO/2022/237974
Kind Code:
A1
Abstract:
The present invention provides methods for detecting cancer as well as agents and compositions for treating cancer by modulating the expression and/or activity of one or more i) KRAB-containing zinc finger protein (KZFP), ii) mRNA encoding a KZFP, and/or iii) KZFP gene.

Inventors:
TRONO DIDIER (CH)
MARTINS FILIPE (CH)
TURELLI PRISCILLA (CH)
Application Number:
PCT/EP2021/062606
Publication Date:
November 17, 2022
Filing Date:
May 12, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
ECOLE POLYTECHNIQUE FED LAUSANNE EPFL (CH)
International Classes:
C12N15/113; A61K31/7088
Domestic Patent References:
WO2020061391A12020-03-26
Foreign References:
US20160200829A12016-07-14
EP2757157A12014-07-23
US5075109A1991-12-24
US4452775A1984-06-05
US4675189A1987-06-23
US5736152A1998-04-07
US4667013A1987-05-19
US4748034A1988-05-31
US5239660A1993-08-24
US3832253A1974-08-27
US3854480A1974-12-17
US5133974A1992-07-28
US5407686A1995-04-18
US5539083A1996-07-23
US5578832A1996-11-26
US5556752A1996-09-17
US5510270A1996-04-23
US6028189A2000-02-22
US5545522A1996-08-13
US5891636A1999-04-06
US5716785A1998-02-10
Other References:
JOSEPH B. ADDISON ET AL: "KAP1 Promotes Proliferation and Metastatic Progression of Breast Cancer Cells", CANCER RESEARCH, vol. 75, no. 2, 24 November 2014 (2014-11-24), US, pages 344 - 355, XP055566880, ISSN: 0008-5472, DOI: 10.1158/0008-5472.CAN-14-1561
ITOKAWA Y ET AL: "KAP1-independent transcriptional repression of SCAN-KRAB-containing zinc finger proteins", BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, ELSEVIER, AMSTERDAM NL, vol. 388, no. 4, 30 October 2009 (2009-10-30), pages 689 - 694, XP026601501, ISSN: 0006-291X, [retrieved on 20090818], DOI: 10.1016/J.BBRC.2009.08.065
MEDUGNO L ET AL: "The Kruppel-like zinc-finger protein ZNF224 represses aldolase A gene transcription by interacting with the KAP-1 co-repressor protein", GENE, ELSEVIER AMSTERDAM, NL, vol. 359, 10 October 2005 (2005-10-10), pages 35 - 43, XP025394710, ISSN: 0378-1119, [retrieved on 20051010]
REDDY, ANUPAMA ET AL.: "Genetic and Functional Drivers of Diffuse Large B Cell Lymphoma", CELL, vol. 171, no. 2, 2017, pages 481 - 494
QIAN Z. M. ET AL.: "Targeted drug delivery via the transferrin receptor-mediated endocytosis pathway", PHARMACOLOGICAL REVIEWS, vol. 54, 2002, pages 561, XP008054257, DOI: 10.1124/pr.54.4.561
IOANNIDES C.G. ET AL.: "Inhibition of IL-2 receptor induction and IL-2 production in the human leukemic cell line Jurkat by a novel peptide inhibitor of protein kinase C", CELL IMMUNOL., vol. 131, 1990, pages 242, XP024005944, DOI: 10.1016/0008-8749(90)90250-U
KOLE H.K. ET AL.: "A peptide-based protein-tyrosine phosphatase inhibitor specifically enhances insulin receptor function in intact cells", J. BIOL. CHEM., vol. 271, 1996, pages 14302
FUTAKI S. ET AL.: "Arginine-rich peptides. An abundant source of membrane-permeable peptides having potential as carriers for intracellular protein delivery", J. BIOL. CHEM., vol. 276, 2001, pages 5836, XP002236577, DOI: 10.1074/jbc.M007540200
SUNG, Y.KIM, S.: "Recent advances in the development of gene delivery systems", BIOMATER RES, vol. 23, 2019, pages 8
SAMBROOK ET AL., MOLECULAR CLONING: A LABORATORY MANUAL, 2001
INNIS ET AL.: "PCR Protocols: A Guide To Methods And Applications", 1990, ACADEMIC PRESS INC.
FROEHLER ET AL., NUCLEIC ACID RES., vol. 14, 1986, pages 5399 - 5407
MCBRIDE ET AL., TETRAHEDRON LETT., vol. 24, 1983, pages 246 - 248
REDDY ET AL., GENETIC AND FUNCTIONAL DRIVERS OF DIFFUSE LARGE B CELL LYMPHOMA
Attorney, Agent or Firm:
KATZAROV S.A. (CH)
Download PDF:
Claims:
CLAIMS

1. An agent modulating the expression and/or activity of one or more i) KRAB- containing zinc finger protein (KZFP), ii) mRNA encoding a KZFP, and/or iii) KZFP gene, for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof.

2. The agent for use of claim 1, wherein said KZFP is selected from the group comprising ZNF671, ZNF776, ZNF586, ZNF552, ZNF587B, ZNF587, ZNF814 and ZNF417, or a combination of one or more thereof.

3. The agent for use of claim 1 or 2, wherein said agent inhibits the translation of one or more RNA encoding a KZFP.

4. The agent for use of claim 1 or 2, wherein the agent inhibits the transcription of one or more KZFP gene encoding a KZFP.

5. The agent for use of claim 1 or 2, wherein the agent modulates the activity of one or more KZFP protein by enhancing and/or favoring the degradation of said one or more KZFP, by preventing the recognition of its targets or by any other mean resulting in a loss of function.

6. The agent for use of any one of the preceding claims, wherein the agent is selected from the group comprising a nucleic acid, a chemical compound, a peptide or analog thereof, an antibody or an antigen-binding fragment thereof, and an antibody mimetic, or a combination of one or more thereof.

7. The agent of claim 6, wherein the nucleic acid is selected from the group comprising a nucleic acid encoding an siRNA, an miRNA, a piRNA, an hnRNA, an snRNA, an sg RNA, a CRISPR-based loss-of-function system, an esiRNA, an shRNA, and an antisense oligonucleotide, or a combination of one or more thereof.

8. A plasmid or a vector comprising one or more nucleic acid(s) encoding the siRNA, miRNA , piRNA, hnRNA, snRNA, sg RNA, CRISPR-based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of claim 6 or 7.

9. A host cell comprising, or modified by the introduction of, i) a plasmid or vector of claim 8, or ii) one or more nucleic acid(s) encoding the siRNA, miRNA , piRNA, hnRNA, snRNA, sg RNA, CRISPR-based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of claim 7.

10. A pharmaceutical composition comprising i) a therapeutically effective amount of an agent modulating the expression and/or activity of one or more i) KZFP, ii) mRNA encoding a KZFP and/or iii) KZFP gene, of anyone of claims 1 to 6, or ii) a plasmid or a vector of claim 8, or iii) a host cell of claim 9, and a pharmaceutically acceptable carrier or diluent.

11. The pharmaceutical composition of claim 10, for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof.

12. The pharmaceutical composition of claim 10 or 11, further comprising one or more anti -cancer therapy.

13. A method of diagnosing cancer and/or cancer metastasis in a subject comprising:

(a) detecting, directly or indirectly, and measuring the level of transcription and/or expression and/or activity of one or more KZFP in a sample obtained from said subject;

(b) comparing the level of transcription and/or expression and/or activity of said one or more KZFP to a control biological sample for the same one or more KZFP; wherein a differential transcription and/or expression and/or activity of one or more one or more KZFP in said biological sample, relative to the level of corresponding said one or more KZFP in a control biological sample of a cancer-free subject, is indicative of the subject having cancer and/or cancer metastasis.

14. The method of claim 13, wherein the differential transcription and/or expression and/or activity level of one or more one or more KZFP corresponds to an upregulated expression of said one or more KZFP. 15. The method of claim 14, wherein the upregulated transcription and/or expression and/or activity of said one or more KZFP in a biological sample corresponds to an increase equal or superior to about 5 %, preferably equal or superior to about 20 %, more preferably equal or superior to about 40 %, most preferably equal or superior to about 60 %, more preferably equal or superior to about 500%, even more preferably equal or superior to about 1000 %, in particular equal or superior to about 5000 % when compared to the level of corresponding said one or more KZFP in a control biological sample of a cancer-free subject.

16. The method of any one of claims 13-15, wherein the biological sample is selected from the group comprising whole blood, serum, plasma, semen, saliva, tears, urine, fecal material, sweat, buccal smears, skin, and cancer cells or a combination of one or more of these biological samples.

Description:
KRAB-containing zinc finger protein and Cancer

FIELD OF THE INVENTION

The present invention provides methods for detecting cancer as well as agents and compositions for treating cancer by modulating the expression and/or activity of one or more i) KRAB-containing zinc finger protein (KZFP), ii) mRNA encoding a KZFP, and/or iii) KZFP gene.

SEQUENCE LISTING

The instant application contains a Sequence Listing, named PAT7576PC00 ST25, which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety.

BACKGROUND OF THE INVENTION

DLBCL (diffuse large B cell lymphoma) is the most frequently diagnosed aggressive non-Hodgkin lymphoma (NHL). It comprises several histological variants which can be grouped into three main molecular subtypes based on gene expression signatures, namely germinal center (GCB), activated B-cell (ABC) and unclassified (UNCL) DLBCL. The distinction between these three groups carries prognostic and therapeutic value. Indeed, ABC DLBCL has a worse prognosis than its GCB or UNCL counterparts as it is often refractory to first line standard treatments and prone to relapse. The standard of care for first-line therapy in DLBCL is the R-CHOP (rituximab, cyclophosphamide, doxorubicin, vincristine and prednisone) regimen.

Research on drug resistance has so far mainly focused on genetic alterations, but recent data points to the role of epigenetic changes. For instance, anthracycline-resistance has been linked to silencing of the transcription factor SMADl by DNA methylation of its promoter, resulting in impaired activation of DNA damage response genes. Accordingly, inhibitors of DNA methylation, such as 5-azacytidine, are currently coupled with R-CHOP in some clinical trials.

Although advances in therapy over the last 20 years have led to improvements in cancer survival, additional strategies are still urgently needed. SUMMARY OF THE INVENTION

The present invention provides an agent modulating the expression and/or activity of one or more i) KRAB -containing zinc finger protein (KZFP), ii) mRNA encoding a KZFP, and/or iii) KZFP gene, for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof.

The present invention further provides a plasmid or a vector comprising one or more nucleic acid(s) encoding the siRNA, miRNA, piRNA, hnRNA, snRNA, sg RNA, CRISPR- based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of the invention.

The present invention provides a host cell comprising, or modified by the introduction of, i) a plasmid or vector of the invention, or ii) one or more nucleic acid(s) encoding the siRNA, miRNA , piRNA, hnRNA, snRNA, sg RNA, CRISPR-based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of the invention.

The present invention provides a pharmaceutical composition comprising i) a therapeutically effective amount of an agent modulating the expression and/or activity of one or more i) KZFP, ii) mRNA encoding a KZFP and/or iii) KZFP gene, of anyone of the invention, or ii) a plasmid or a vector of the invention, or iii) a host cell of the invention, and a pharmaceutically acceptable carrier or diluent.

The present invention provides a method of diagnosing cancer and/or cancer metastasis in a subject comprising:

(a) detecting, directly or indirectly, and measuring the level of expression of one or more KZFP in a sample obtained from said subject;

(b) comparing the level of expression of said one or more KZFP to a control biological sample for the same one or more KZFP; wherein an alteration in the expression level of one or more one or more KZFP in said biological sample, relative to the level of corresponding said one or more KZFP in a control biological sample of a cancer-free subject, is indicative of the subject having cancer and/or cancer metastasis.

DESCRIPTION OF THE FIGURES

Figure 1. (a) Results of a transcriptome-wide association study (TWAS) using univariate Cox regression analysis to correlate the expression of genes/pseudogenes with an average expression >1 counts per million (CPM) with survival, in a cohort 630 DLBCL patients whose lymphoma samples were sequenced at the Duke University (North Carolina, Reddy et al. Cell. 2017). Out of 23’560 included genes, 1530 were significantly associated with survival (p adj <0.05, light grey dots). Amongst the latter, 41 were KRAB-ZnF genes (darkred dots) with a skewing of these towards an increased risk of death (b) KRAB-ZFP genes carry a significantly higher proportion of adverse prognosis-associated genes compared to other TFs and protein coding genes.

Figure 2. (a) 38 out of the 39 KRAB-ZnF genes associated with poor prognosis were upregulated in ABC DLBCL aggressive disease subtype. Although upregulated in ABC DLBCL, these KRAB ZnF genes conserved their significancy after including cell of origin as a covariate in our Cox regression analysis, suggesting that they are not solely a surrogate feature of the intrinsic aggressivity of the ABC subtype (b) 32 (82%) of these KRAB-ZnF genes were located on chromosome 19, suggesting a non-random distribution, as only 56% of the members of KRAB-ZnF gene family are present on this chromosome. Most highly expressed poor prognosis KRAB ZnF genes were enriched in the subtelomeric region of the long arm of chromosome 19 (Chrl9q), a region reported as recurrently amplified in ABC disease subtype (Lem et al ., 2008) and associated to an increased risk of relapse {Dubois et al ., 2019). A 200 kb region defining a hub of closely distributed poor prognosis-associated KRAB ZnF genes embedding the most highly expressed ones (ZNF586, ZNF587, ZNF587B) together with the respective paralogs (ZNF417, ZNF814) of the latter were selected for further in vitro studies.

Figure 3. (a) The downregulation of ZNF586 in U2932 ABC and OCI-Ly7 GCB cell lines using two different shRNAs has a clear negative effect on cell growth (b) The tandem downregulation of the ZNF587 and ZNF417 paralog pair also decreases lymphoma cell proliferation. The importance of this negative effect doesn’t seem to be related to the origin of lymphoma cells, suggesting that its targeting may also be active in GCB DLBCL. (c)

After 2 days of puromycin selection in the case of OCI-Ly7 (total of 4 days of knock down) and 3 days in the case of U2932 (total of 5 days of knock down) transduced cells w ere seeded for another 5-6 days of proliferation assessment. These assays showed a durable and profound arrest of cell proliferation.

Figure 4. Apoptotic studies on the U2932 cell line revealed that, in parallel to cell proliferation arrest, knocked down cells demonstrated an increase in cell death in comparison to control cells (Fig. 4 A & B). Annexin V/propidium iodide (PI) staining assessed by flux cytometry revealed that this the burden of this cell death, w hich apparently of the necrotic form, started to surge after 4 days of gene silencing (Fig. 4 C).

Figure 5. (a) Gene set expression analysis (GSEA) of the genes upregulated in arrested cells surviving to ZNF587/ZNF417 conjugated knock down compared to control cells revealed enrichments in pathways implicated in proteolysis, endoplasmic reticulum-associated protein degradation (ERAD), antigen processing, Major histocompatibility complex (MCH) class I presentation and cytokine production, suggestive of an increased immunogenicity of these cells (b) Volcano and bar-/plots showing a significant upregulation of MHC class I A, B, C, E, F, G subunits, together with their common beta2-microglobulm (B2M) subunit in knocked down U2932 lymphoma cells compared to controls. In addition, CD80, a surface transmembrane protein activator of cytotoxic CD8 T cells is also upregulated in this context (c) In human DLBCL samples (n=630), MHC class I components expression tend to be negatively correlated to ZNF587/417 gene expression, suggesting that ZNF587 and ZNF417 may contribute to immune escape of lymphoma cells.

Figure 6. (a) The transcriptome of U2932 (and OCT-Ly7, data not shown) KD cells revealed features of partial cell differentiation, such as an upregulation of immunoglobulin heavy chains and of the master regulator of plasma cell differentiation PRDM1 (BLIMP-1), suggesting that ZNF587/417 may contribute to differentiation blockade of lymphoma cells (b) Western blotting of phosphorylated gH2AX showing its increase upon Z9NF587/417 conjugated KD. This suggests that their silencing generates replicative stress, thus contributing to cell cycle arrest and cell death phenotypes.

Figure 7. (a) The negative effect of ZNF586 KD is not restricted to lymphoma cell lines. When tested on HL60 and HEK293T cell lines, these were also impacted. However, the HT29 colon cancer cell line was spared from this effect (b) In the same way, the negative effect of ZNF587/417 conjugated KD impacted also HL60 and HEK293T cells, but in this case, the negative effect was present in all 3 colon cancer cell lines tested (i.e. Lovo, HT29 and HCT116) cell lines. Interestingly, the K562 CML cell line was not impacted by the silencing of none of these KZF genes, suggesting that this cell line possesses intrinsic resistance mechanisms that will deserve further exploration.

DESCRIPTION OF THE INVENTION

Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. The publications and applications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. In addition, the materials, methods, and examples are illustrative only and are not intended to be limiting.

In the case of conflict, the present specification, including definitions, will control. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in art to which the subject matter herein belongs. As used herein, the following definitions are supplied in order to facilitate the understanding of the present invention.

The term "comprise/comprising" is generally used in the sense of include/including, that is to say permitting the presence of one or more features or components. The terms "comprise(s)" and "comprising" also encompass the more restricted ones "consist(s)", "consisting" as well as "consist/consi sting essentially of', respectively.

As used in the specification and claims, the singular form "a", "an" and "the" include plural references unless the context clearly dictates otherwise.

As used herein, "at least one" means "one or more", "two or more", "three or more", etc.

As used herein the terms "subject"/" subject in need thereof, or "patienfV'patient in need thereof " are well-recognized in the art, and, are used interchangeably herein to refer to a mammal, including dog, cat, rat, mouse, monkey, cow, horse, goat, sheep, pig, camel, and, most preferably, a human. In some cases, the subject is a subject in need of treatment or a subject with a disease or disorder. However, in other aspects, the subject can be a normal subject. The term does not denote a particular age or sex. Thus, adult and newborn subjects, whether male or female, are intended to be covered. Preferably, the subject is a human, most preferably a human suffering from cancer and/or cancer metastasis or a human that might be at risk of suffering from cancer and/or cancer metastasis.

The terms "nucleic acid", "polynucleotide," and "oligonucleotide" are used interchangeably and refer to any kind of deoxyribonucleotide (e.g. DNA, cDNA, ...) or ribonucleotide (e.g. RNA, mRNA, ...) polymer or a combination of deoxyribonucleotide and ribonucleotide (e.g. DNA/RNA) polymer, in linear or circular conformation, and in either single - or double - stranded form. These terms are not to be construed as limiting with respect to the length of a polymer and can encompass known analogues of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g. phosphorothioate backbones). In general, an analogue of a particular nucleotide has the same base-pairing specificity, i.e., an analogue of A will base-pair with T.

The term "vector", as used herein, refers to a viral vector or to a nucleic acid (DNA or RNA) molecule such as a plasmid or other vehicle, which contains one or more heterologous nucleic acid sequence(s) of the invention and, preferably, is designed for transfer between different host cells. The terms "expression vector", “gene delivery vector” and "gene therapy vector" refer to any vector that is effective to incorporate and express one or more nucleic acid(s) of the invention, in a cell, preferably under the regulation of a promoter. A cloning or expression vector may comprise additional elements, for example, regulatory and/or post- transcriptional regulatory elements in addition to a promoter.

The term “about,” particularly in reference to a given quantity, number or percentage, is meant to encompass deviations of plus or minus ten percent (± 10). For example, about 5% encompasses any value between 4.5% to 5.5%, such as 4.5, 4.6, 4.7, 4.8, 4.9, 5, 4.1, 5.2, 5.3, 5.4, or 5.5.

According to the present invention, the cancer is a solid or a non-solid cancer. The cancer will be selected from the non-limiting group comprising carcinoma, sarcoma, melanoma, lymphoma, and leukemia. Preferably, the cancer is selected from the group comprising Acute Lymphoblastic Leukemia (ALL), Acute Myeloid Leukemia (AML), Adrenocortical Carcinoma, Adult, Childhood Adrenocortical Carcinoma, AIDS-Related Cancers, Kaposi Sarcoma (Soft Tissue Sarcoma), AIDS-Related Lymphoma (Lymphoma), Primary CNS Lymphoma (Lymphoma), Anal Cancer, Astrocytomas, Childhood (Brain Cancer), Atypical Teratoid/Rhabdoid Tumor, Childhood, Central Nervous System (Brain Cancer), Basal Cell Carcinoma of the Skin, Bile Duct Cancer, Bladder Cancer, Bone Cancer (includes Ewing Sarcoma and Osteosarcoma and Malignant Fibrous Histiocytoma), Brain Tumors, Breast Cancer, Childhood Breast Cancer, Childhood Bronchial Tumors, Burkitt Lymphoma, Carcinoid Tumor (Gastrointestinal), Childhood Carcinoid Tumors, Carcinoma of Unknown Primary, Cardiac (Heart) Tumors, Central Nervous System, Childhood Atypical Teratoid/Rhabdoid Tumor, Childhood Embryonal Tumors, Childhood Germ Cell Tumor, Primary CNS Lymphoma, Cervical Cancer, Childhood Cervical Cancer, Cholangiocarcinoma, Chordoma, Childhood, Chronic Lymphocytic Leukemia (CLL), Chronic Myelogenous Leukemia (CML), Chronic Myeloproliferative Neoplasms, Colorectal Cancer, Childhood Colorectal Cancer, Childhood Craniopharyngioma, Cutaneous T-Cell Lymphoma, Ductal Carcinoma In Situ (DCIS), Embryonal Tumors, Endometrial Cancer (Uterine Cancer), Childhood Ependymoma, Esophageal Cancer, Childhood Esophageal Cancer, Esthesioneuroblastoma, Ewing Sarcoma (Bone Cancer), Childhood Extracranial Germ Cell Tumor, Extragonadal Germ Cell Tumor, Eye Cancer, Childhood Intraocular Melanoma, Intraocular Melanoma, Retinoblastoma, Fallopian Tube Cancer, Fibrous Histiocytoma of Bone (Malignant, and Osteosarcoma), Gallbladder Cancer, Gastric, (Stomach) Cancer, Childhood Gastric (Stomach) Cancer, Gastrointestinal Carcinoid Tumor, Gastrointestinal Stromal Tumors (GIST) (Soft Tissue Sarcoma), Childhood Gastrointestinal Stromal Tumors, Germ Cell Tumors, Childhood Central Nervous System Germ Cell Tumors (Brain Cancer), Childhood Extracranial Germ Cell Tumors, Extragonadal Germ Cell Tumors, Ovarian Germ Cell Tumors, Testicular Cancer, Gestational Trophoblastic Disease, Hairy Cell Leukemia, Head and Neck Cancer, Childhood Head and Neck Cancers, Childhood Heart Tumors, Hepatocellular (Liver) Cancer, Langerhans Cell Histiocytosis, Hodgkin Lymphoma, Hypopharyngeal Cancer (Head and Neck Cancer), Intraocular Melanoma, Childhood Intraocular Melanoma, Islet Cell Tumors, Pancreatic Neuroendocrine Tumors, Kaposi Sarcoma (Soft Tissue Sarcoma), Kidney (Renal Cell) Cancer, Langerhans Cell Histiocytosis, Laryngeal Cancer (Head and Neck Cancer), Childhood Laryngeal Cancer, Papillomatosis, Leukemia, Lip and Oral Cavity Cancer (Head and Neck Cancer), Liver Cancer, Lung Cancer (Non-Small Cell and Small Cell), Childhood Lung Cancer, Lymphoma, Male Breast Cancer, Malignant Fibrous Histiocytoma of Bone and Osteosarcoma, Melanoma, Childhood Melanoma, Intraocular (Eye)Melanoma, Childhood Intraocular Melanoma, Merkel Cell Carcinoma (Skin Cancer), Malignant Mesothelioma, Childhood Mesothelioma , Metastatic Squamous Neck Cancer with Occult Primary (Head and Neck Cancer), Midline Tract Carcinoma Involving NUT Gene, Mouth Cancer (Head and Neck Cancer), Multiple Endocrine Neoplasia Syndromes, Multiple Myeloma/Plasma Cell Neoplasms, Mycosis Fungoides (Lymphoma), Myelodysplasia Syndromes, Myelodysplastic/Myeloproliferative Neoplasms, Myelogenous Leukemia, Chronic (CML), Myeloid Leukemia, Acute (AML), Chronic Myeloproliferative Neoplasms, Nasal Cavity and Paranasal Sinus Cancer (Head and Neck Cancer), Nasopharyngeal Cancer (Head and Neck Cancer), Childhood Nasopharyngeal Cancer, Neuroblastoma, Non-Hodgkin Lymphoma (NHL) such as diffuse large B cell lymphoma, Non-Small Cell Lung Cancer, Oral Cancer, Lip and Oral Cavity Cancer and Oropharyngeal Cancer (Head and Neck Cancer), Childhood Oral Cavity Cancer, Osteosarcoma and Malignant Fibrous Histiocytoma of Bone, Ovarian Cancer, Childhood Ovarian Cancer, Pancreatic Cancer, Childhood Pancreatic Cancer, Pancreatic Neuroendocrine Tumors (Islet Cell Tumors), Papillomatosis, Paraganglioma, Childhood Paraganglioma, Paranasal Sinus and Nasal Cavity Cancer (Head and Neck Cancer), Parathyroid Cancer, Penile Cancer, Pharyngeal Cancer (Head and Neck Cancer), Pheochromocytoma, Childhood Pheochromocytoma, Pituitary Tumor, Plasma Cell Neoplasm/Multiple Myeloma, Pleuropulmonary Blastoma, Pregnancy and Breast Cancer, Primary Central Nervous System (CNS) Lymphoma, Primary Peritoneal Cancer, Prostate Cancer, Rectal Cancer, Recurrent Cancer, Renal Cell (Kidney) Cancer, Retinoblastoma, Childhood Rhabdomyosarcoma (Soft Tissue Sarcoma), Salivary Gland Cancer (Head and Neck Cancer), Childhood Salivary Gland Tumors, Sarcoma, Childhood Rhabdomyosarcoma (Soft Tissue Sarcoma), Childhood Vascular Tumors (Soft Tissue Sarcoma), Ewing Sarcoma (Bone Cancer), Kaposi Sarcoma (Soft Tissue Sarcoma), Osteosarcoma (Bone Cancer), Uterine Sarcoma, Sezary Syndrome (Lymphoma), Skin Cancer, Childhood Skin Cancer, Small Cell Lung Cancer, Small Intestine Cancer, Soft Tissue Sarcoma, Squamous Cell Carcinoma of the Skin, Squamous Neck Cancer with Occult Primary, Metastatic (Head and Neck Cancer), Stomach (Gastric) Cancer, Childhood Stomach (Gastric) Cancer, T-Cell Lymphoma, Cutaneous, Testicular Cancer, Childhood Testicular Cancer, Throat Cancer (Head and Neck Cancer), Nasopharyngeal Cancer, Oropharyngeal Cancer, Hypopharyngeal Cancer, Thymoma and Thymic Carcinoma, Thyroid Cancer, Childhood Thyroid Tumors, Transitional Cell Cancer of the Renal Pelvis and Ureter (Kidney (Renal Cell) Cancer), Carcinoma of Unknown Primary, Childhood Cancer of Unknown Primary, Unusual Cancers of Childhood, Ureter and Renal Pelvis, Transitional Cell Cancer (Kidney (Renal Cell) Cancer, Urethral Cancer, Endometrial Uterine Cancer, Uterine Sarcoma, Vaginal Cancer, Childhood Vaginal Cancer, Vascular Tumors (Soft Tissue Sarcoma), Vulvar Cancer, and Wilms Tumor or a combination of one of more of these cancers

More preferably, the cancer is a non-solid/hematological cancer , most preferably, the cancer is aggressive non-Hodgkin lymphoma.

KRAB-containing zinc finger proteins (KZFPs) constitute a large family of transcription factors notably involved in controlling the expression of transposable element-embedded regulator sequences (TEeRS). KZFPs and their genomic targets contribute to regulating many biological events, from early embryogenesis to brain development, and from immune responses to liver metabolism.

While focusing on the role of these KZFP, the Inventors surprisingly discovered the role of several KZFP, specifically the role of a set of 8 KZFP genes (ZNF671, ZNF776, ZNF586, ZNF552, ZNF587B, ZNF587, ZNF814 and ZNF417), clustering over a 200 kb region on the sub-telomeric region of human Chrl9q (chrl9:57, 716, 203-57, 919,117), the expression of which, individually or in combination, were associated with poor cancer outcome.

Table 1

In one aspect, the KZFP will be selected from the group comprising ZNF671, ZNF776, ZNF586, ZNF552, ZNF587B, ZNF587, ZNF814 and ZNF417, or a combination of one or more thereof, e.g. two or more, three or more, four or more, five or more, six or more, seven or more, or the complete set of the 8 KZFPs.

Non-limiting examples of a combination of one or more KZFPs comprise a combination of i) ZNF586, ZNF587, ii) ZNF417, ZNF587, iii) ZNF552-ZNF587B-ZNF814, and iv) ZNF814, ZNF671, ZNF552, ZNF776.

The present invention thus provides an agent modulating the expression and/or activity of one or more i) KRAB -containing zinc finger protein (KZFP), for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof. Preferably, the agent of the invention modulates the activity of the one or more KZFP protein by enhancing and/or favoring the degradation of said one or more KZFP (Sievers et al.,

2018). Usually, the one or more KZFP protein will be selected from the group comprising SEQ ID No. 2, SEQ ID No. 4, SEQ ID No. 6, SEQ ID No. 8, SEQ ID No. 10, SEQ ID No.

12, SEQ ID No. 14, and SEQ ID No. 16, a fragment or a variant thereof or a combination of one or more thereof. Non-limiting examples of a combination of one or more KZFPs comprise a combination of i) ZNF586 protein and ZNF587 protein, ii) ZNF417 protein and ZNF587 protein, iii) ZNF552 protein, ZNF587B protein and ZNF814 protein, and iv) ZNF814 protein, ZNF671 protein, ZNF552 protein, and ZNF776 protein, as well as a fragment or a variant of any one of these combinations.

In other aspects, the agent inhibits and/or impairs the binding of the KZFP to a target genomic DNA (e.g. by targeting the DNA comprising transposable element-embedded regulator sequences (TEeRS) or the zinc-finger motif of the C2H2 zinc finger motif and/or leading to proteasomal degradation), and/or the agent inhibits or impairs the binding of KZFP to its interacting protein(s) or other molecule(s) via which it controls heterochromatin and DNA methylation (e.g. scaffold protein KAPl). In other aspects, the agent is an antibody, or antigenbinding fragment thereof, that i) inhibits and/or impairs the binding of the KZFP to a target genomic DNA (e.g. by specifically targeting the zinc-finger motif of the C2H2 zinc finger motif), and/or the binding of KZFP to its interacting protein(s) or other molecule(s).

As used herein, an “antibody” is a protein molecule that reacts with a specific antigenic determinant or epitope and belongs to one or five distinct classes based on structural properties: IgA, IgD, IgE, IgG and IgM. The antibody may be a polyclonal (e.g. a polyclonal serum) or a monoclonal antibody, including but not limited to fully assembled antibody, single chain antibody, antibody fragment, and chimeric antibody, humanized antibody as long as these molecules are still biologically active and still bind to at least one peptide or protein of the invention. Preferably the antibody is a monoclonal antibody. Preferably also the monoclonal antibody will be selected from the group comprising the IgGl, IgG2, IgG2a, IgG2b, IgG3 and IgG4 or a combination thereof. Most preferably, the monoclonal antibody is selected from the group comprising the IgGl, IgG2, IgG2a, and IgG2b, or a combination thereof.

An “antigen binding fragment" comprises a portion of a full-length antibody. Examples of antigen binding fragments include Fab, Fab', F(ab')2, and Fv fragments; diabodies; linear antibodies; single-chain antibody molecules; and multispecific antibodies formed from antibody fragments.

The present invention also provides an agent modulating the expression and/or activity of one or more transcript, preferably one or more mRNA, encoding a KZFP, for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof. Preferably, the agent of the invention modulates the expression and/or activity by inhibiting the translation of said one or more mRNA encoding a KZFP.

As used herein, a messenger RNA (mRNA) refers to a single-stranded RNA molecule that is complementary to one of the DNA strands of a gene, e.g. a gene encoding a KZFP. The mRNA is an RNA version of the gene that leaves the cell nucleus and moves to the cytoplasm where proteins are made.

Since a single gene often has more than one transcript, one or ordinary skill in the art will appreciate that the term mRNA also encompasses fragments and variants of an mRNA of the invention. Non-limiting examples of mRNA variants comprise precursor mRNA and splicing variants. Alternative splicing is a widely used mechanism for the formation of isoforms or splicing variants. In this process, which occurs during gene expression, the exons of a gene may be included or excluded in the processed mRNA.

Preferably, the at least one transcript encoding a KZFP of the invention is selected from the group comprising cDNA sequences: SEQ ID No. 1, SEQ ID No. 3, SEQ ID No. 5, SEQ ID No. 7, SEQ ID No. 9, SEQ ID No. 11, SEQ ID No. 13, and SEQ ID No. 15, a fragment or a variant thereof or a combination of one or more thereof. Non-limiting examples of a combination of one or more KZFP transcripts comprise a combination of i) a transcript encoding ZNF586, or a part thereof, and a transcript, or a part thereof, encoding ZNF587, ii) a transcript, or a part thereof, encoding ZNF417 and a transcript, or a part thereof, encoding ZNF587, iii) a transcript, or a part thereof, encoding ZNF552, a transcript, or a part thereof, encoding ZNF587B, and a transcript, or a part thereof, encoding ZNF814, and iv) a transcript, or a part thereof, encoding ZNF814, a transcript, or a part thereof, encoding ZNF671, a transcript, or a part thereof, encoding , and a transcript, or a part thereof, encoding ZNF552, ZNF776, , as well as a fragment or a variant of any one of these combinations.

As used herein, the term “variant” refers to biologically active derivatives of a nucleic acid or peptide sequence. In general, the term “variant” refers to molecules having a native sequence and structure with one or more additions, substitutions (generally conservative in nature) and/or deletions (e.g. splice variants), relative to the native molecule, so long as the modifications do not destroy biological activity and which are “substantially homologous” to the reference molecule. In general, the sequences of such variants are functionally, i.e. biologically, active variants and will have a high degree of sequence homology to the reference sequence, e.g., sequence homology of more than 50%, generally more than 60%- 70%, even more particularly 80% or more, such as at least 90% or 95% or more, when the two sequences are aligned.

Alternatively, the term “variant” also refers to post-transcriptionally modified nucleic acid sequences of the invention, i.e. methylation, phosphorylation, etc...

As used herein, a “fragment” of one or more nucleic acid sequence or polypeptide sequence of the invention refers to a sequence containing less nucleotides or amino acids in length than the respective sequences of the invention while retaining the biological activity described herein. Preferably, this fragment contains, at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, or 90% of the entire length of the reference nucleic acid sequence or polypeptide sequence.

As used herein, “Homology” refers to the percent identity between two polynucleotide or two polypeptide sequences. Two nucleic acid, or two polypeptide sequences are “substantially homologous” to each other when the sequences exhibit at least about 50% sequence identity, preferably at least about 75% sequence identity, more preferably at least about 80% sequence identity, more preferably at least about 90% sequence identity, and most preferably at least about 95%-98% sequence identity over a defined length of the molecules. As used herein, substantially homologous also refers to sequences showing complete identity to the specified sequence.

In general, “identity” refers herein to an exact nucleotide to nucleotide or amino acid to amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Percent identity can be determined by a direct comparison of the sequence information between two molecules by aligning the sequences, counting the exact number of matches between the two aligned sequences, dividing by the length of the shorter sequence, and multiplying the result by 100. Sequence identity is typically measured using sequence analysis software (for example, Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis.53705, BLAST, BESTFIT, GAP, or PILEUP/PRETTYBOX programs). Such software matches identical or similar sequences by assigning degrees of homology to various substitutions, deletions, and/or other modifications.

Alternatively, homology can be determined by readily available computer programs or by hybridization of polynucleotides under conditions which form stable duplexes between homologous regions, followed by digestion with single stranded specific nuclease(s), and size determination of the digested fragments. DNA sequences that are substantially homologous can be identified in a Southern hybridization experiment under, for example, stringent conditions, as defined for that particular system. Defining appropriate hybridization conditions is within the skill of the art. The present invention further provides an agent modulating the expression and/or activity of one or more KZFP gene, for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof.

Preferably, the agent of the invention modulates the expression and/or activity by inhibiting the transcription of one or more KZFP gene encoding a KZFP.

Also envisioned are combinations of one or more approaches or agents described herein for use in the treatment and/or prevention of cancer and/or cancer metastasis in a subject in need thereof.

Preferably, the agent of the invention is selected from the group comprising a nucleic acid, a chemical compound, a peptide or analog thereof, an antibody or an antigen-binding fragment thereof, and an antibody mimetic, or a combination of one or more thereof.

As used herein, a “chemical agent” is a compound that produces change by virtue of its chemical composition and its effects on living tissues and organisms. The chemical agent may be a small molecule inhibitor (SMI) and is preferably a non-peptidyl molecule inhibitor of KZFPs.

The chemical agents of the invention can be tested using a number of techniques known to those of skill in the art. For example, tagged KZFPs or cell lines transfected and expressing a KZFP coupled to a reporter gene such as Luciferase or green fluorescent protein may be used in assay screens for inhibitors of KZFPs. Computational zinc finger docking and biochemical analysis can be conducted in order to identify molecules able to bring ZNF proteins to the Cereblon complex for subsequent ubiquitination and proteasomal degradation Non-limiting examples of chemical agents modulating the expression and/or activity of one or more KZFP of the invention are selected among the non-limiting group comprising agents inducing ubiquitination and proteasomal degradation. Non-limiting examples of such agents comprise thalidomide and its analogues and derivatives (e.g. lenalidomide and pomalidomide).

The terms "polypeptide," "peptide" and "protein" are used interchangeably to refer to a polymer of amino acid residues. The term also applies to amino acid polymers in which one or more amino acids are chemical analogues or modified derivatives of corresponding naturally-occurring amino acids.

In case the agent of the invention is a peptide, then it will preferably be conjugated to an agent that increases the accumulation of the peptide in the cancer cell. Such an agent can be a compound which induces receptor mediated endocytosis such as for example the membrane transferrin receptor mediated endocytosis of transferrin conjugated to therapeutic drugs (Qian Z. M. et al., “Targeted drug delivery via the transferrin receptor-mediated endocytosis pathway” Pharmacological Reviews, 54, 561, 2002) or a cell membrane permeable carrier which can, be selected e. g. among the group of fatty acids such as decanoic acid, myristic acid and stearic acid, which have already been used for intracellular delivery of peptide inhibitors of protein kinase C (Ioannides C.G. et al., “Inhibition of IL-2 receptor induction and IL-2 production in the human leukemic cell line Jurkat by a novel peptide inhibitor of protein kinase C” Cell Immunol., 131, 242, 1990) and protein-tyrosine phosphatase (Kole H.K. et al., “A peptide-based protein-tyrosine phosphatase inhibitor specifically enhances insulin receptor function in intact cells” J. Biol. Chem. 271, 14302, 1996) or among peptides. Preferably, cell membrane permeable carriers are used, more preferably a cell membrane permeable carrier peptide is used.

In case the cell membrane permeable carrier is a peptide then it will preferably be a positively charged amino acid rich peptide. Preferably such positively charged amino acid rich peptide is an arginine rich peptide. It has been shown in Futaki et al. (Futaki S. et al., “Arginine-rich peptides. An abundant source of membrane-permeable peptides having potential as carriers for intracellular protein delivery” J. Biol. Chem., 276, 5836, 2001), that the number of arginine residues in a cell membrane permeable carrier peptide has a significant influence on the method of internalization and that there seems to be an optimal number of arginine residues for the internalization, preferably they contain more than 6 arginines, more preferably they contain 9 arginines (R9).

The peptide may be conjugated to the cell membrane permeable carrier by a spacer (e.g. two glycine residues). In this case, the cell membrane permeable carrier is preferably a peptide. Usually, arginine rich peptides are selected from the group comprising the HIV-TAT 48-57 peptide, the FHV-coat 35-49 peptide, the HTLV-II Rex 4-16 peptide and the BMV gag 7-25 peptide. Preferably, the arginine rich peptide is HIV-TAT 48-57 peptide.

Since an inherent problem with native peptides (in L-form) is degradation by natural proteases, the peptide, as well as the cell membrane permeable peptide, of the invention may be prepared to include D-forms and/or "retro-inverso isomers" of the peptide. In this case, retro-inverso isomers of fragments and variants of the peptide, as well as of the cell membrane permeable peptide, of the invention are prepared according to techniques known in the art.

The agent of the invention may also be a "nucleic acid". The terms "nucleic acid," "polynucleotide," and "oligonucleotide" are used interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer, in linear or circular conformation, and in either single- or double-stranded form. For the purposes of the present disclosure, these terms are not to be construed as limiting with respect to the length of a polymer. The terms can encompass known analogues of natural nucleotides, as well as nucleotides that are modified in the base, sugar and/or phosphate moieties (e.g., phosphorothioate backbones). In general, an analogue of a particular nucleotide has the same base-pairing specificity; i.e., an analogue of A will base-pair with T.

Examples of nucleic acid agents of the invention comprise those selected among the group comprising an acid nucleic encoding an miRNA, an siRNA, a piRNA, an hnRNA, an snRNA, an sg RNA, an esiRNA, an shRNA, and an antisense oligonucleotide (e.g. modified ASO), or a combination thereof.

The terms “microRNA,” “miRNA,” and “MiR” are interchangeable and refer to endogenous or artificial non-coding RNAs that are capable of regulating gene expression. It is believed that miRNAs function via RNA interference.

The terms “siRNA” and “short interfering RNA” are interchangeable and refer to single- stranded or double-stranded RNA molecules that are capable of inducing RNA interference. SiRNA molecules typically have a duplex region that is between 18 and 30 base pairs in length.

The terms “piRNA” and “Piwi-interacting RNA” are interchangeable and refer to a class of small RNAs involved in gene silencing. PiRNA molecules typically are between about 26 and about 31 nucleotides in length.

Examples of antisense oligonucleotides (ASOs) include the GapmeRs. As used herein, a GapmeR is a chimeric antisense oligonucleotide that contains a central block of deoxynucleotide monomers sufficiently long to induce RNase H cleavage. Usually, the GapmeRs of the invention are directed against one or more mRNA encoding a KZFP of the invention. Modified ASOs such as modified GapmeRs are also encompassed in the present invention and comprise, e.g. GapmeRs with fixed chemical modification architectures selected from the group comprising i) a gapmer with five 2' '-O-m ethoxy ethyl (MOE) modifications in each flank, and a central gap of 10 unmodified dans (e.g. 5-10-5 MOE design), and ii) a gapmer employing three or four locked nucleic acid (LNA) modifications in each flank (e.g. 3-10-3 or 4-8-4 LNA designs), as well as a combination of one or more thereof.

The terms “sgRNA” and “guideRNA” are interchangeable and refer to a specific RNA sequence that recognizes the target DNA region of interest and directs the endonuclease there for editing. The gRNA is usually made up of two parts: crispr RNA (crRNA), a 17-20 nucleotide sequence complementary to the target DNA, and a tracr RNA, which serves as a binding scaffold for the Cas nuclease.

Any suitable engineered sgRNA, or crRNA and tracrRNA, can be employed as long as it is effective for recognizing a target DNA of the invention. The design of such sgRNA, or crRNA and tracrRNA, is within the skill of ordinary artisans.

The terms “shRNA” as used herein refers to a nucleic acid molecule comprising at least two complementary portions hybridized or capable of specifically hybridizing to form a duplex structure sufficiently long to mediate RNAi (typically between about 15 to about 29 nucleotides in length), and at least one single-stranded portion, typically between approximately 1 and about 10 nucleotides in length that forms a loop connecting the ends of the two sequences that form the duplex. Exemplary shRNA of the invention may, e.g. be selected from the non-limiting group comprising those listed in table 2 (represented by their DNA sequence) that were designed to be uniquely and selectively recognizing a target sequence within an mRNA encoding a KZFP, or a combination of KZFP, of the invention. Selecting a suitable shRNA is well within the competences of one of ordinary skill in the art using routine experimentation, several commercial and noncommercial web sites available for shRNA design as well as the information are provided herein.

Non-limiting examples of target sequences within an mRNA encoding a KZFP, or a combination of KZFP, of the invention are also given in Table 2. One of ordinary skill in the art using routine experimentation, several available commercial and noncommercial web sites for determining said target sequences as well as relevant information are provided herein. Table 2

The terms “snRNA” and “small nuclear RNA” are interchangeable and refer to a class of small RNAs involved in a variety of processes including RNA splicing and regulation of transcription factors. The subclass of small nucleolar RNAs (snoRNAs) is also included. The term is also intended to include artificial snRNAs, such as antisense derivatives of snRNAs.

In particular, the invention therefore provides isolated siRNA comprising short double- stranded RNA from about 18 to about 30 nucleotides in length, that are targeted to the mRNA encoding a KZFP of the invention.

The term “isolated” means altered or removed from the natural state through human intervention. For example, an siRNA naturally present in a living animal is not “isolated,” but a synthetic siRNA, or an siRNA partially or completely separated from the coexisting materials of its natural state is “isolated.” An isolated siRNA can exist in substantially purified form, or can exist in a non-native environment such as, for example, a cell into which the siRNA has been delivered. The siRNAs of the invention can comprise partially purified RNA, substantially pure RNA, synthetic RNA, or recombinantly produced RNA, as well as altered RNA that differs from naturally occurring RNA by the addition, deletion, substitution and/or alteration of one or more nucleotides. Such alterations can include addition of non-nucleotide material, such as to the end(s) of the siRNA or to one or more internal nucleotides of the siRNA, including modifications that make the siRNA resistant to nuclease digestion.

One or both strands of the siRNA of the invention can also comprise a 3' overhang. A “3' overhang” refers to at least one unpaired nucleotide extending from the 3 '-end of an RNA strand. Thus, in one aspect, the siRNA of the invention comprises at least one 3' overhang of from one to about six nucleotides (which includes ribonucleotides or deoxynucleotides) in length, preferably from one to about five nucleotides in length, more preferably from one to about four nucleotides in length, and particularly preferably from about one to about two nucleotides in length.

In the case both strands of the siRNA molecule comprise a 3' overhang, the length of the overhangs can be the same or different for each strand. In a most preferred embodiment, the 3' overhang is present on both strands of the siRNA, and is two nucleotides in length. In order to enhance the stability of the present siRNAs, the 3' overhangs can also be stabilized against degradation. In one embodiment, the overhangs are stabilized by including purine nucleotides, such as adenosine or guanosine nucleotides.

Alternatively, substitution of pyrimidine nucleotides by modified analogues, e.g., substitution of uridine nucleotides in the 3' overhangs with 2'-deoxythymidine, is tolerated and does not affect the efficiency of RNAi degradation. In particular, the absence of a 2' hydroxyl in the 2'-deoxythymidine significantly enhances the nuclease resistance of the 3' overhang in tissue culture medium.

The siRNAs of the invention can be targeted to any stretch of approximately about 18-30, preferably about 19-25 contiguous nucleotides in any of the target mRNA sequences (including the mRNA encoding a KZFP of the invention). Techniques for selecting target sequences for siRNA are well known in the art. Thus, the sense strand of the present siRNA comprises a nucleotide sequence identical to any contiguous stretch of about 18 to about 30 nucleotides in the target mRNA.

The siRNAs of the invention can be obtained using a number of techniques known to those of skill in the art. For example, the siRNAs can be chemically synthesized or recombinantly produced using methods known in the art. Preferably, the siRNA of the invention are chemically synthesized using appropriately protected ribonucleoside phosphoramidites and a conventional DNA/RNA synthesizer. The siRNA can be synthesized as two separate, complementary RNA molecules, or as a single RNA molecule with two complementary regions. Commercial suppliers of synthetic RNA molecules or synthesis reagents include Proligo (Hamburg, Germany), Dharmacon Research (Lafayette, Colo., USA), Pierce Chemical (part of Perbio Science, Rockford, Ill., USA), Glen Research (Sterling, Va., USA), Chem Genes (Ashland, Mass., USA), Qiagen (Hilden, Germany) and Cruachem (Glasgow, UK).

Alternatively, siRNA can also be expressed from recombinant circular or linear DNA plasmids using any suitable promoter. Suitable promoters for expressing siRNA of the invention from a plasmid include, for example, the U6 or HI RNA pol III promoter sequences and the cytomegalovirus promoter. Selection of other suitable promoters is within the skill in the art. The recombinant plasmids of the invention can also comprise inducible or regulatable promoters for expression of the siRNA in a particular tissue or in a particular intracellular environment. The siRNA expressed from recombinant plasmids either can be isolated from cultured cell expression systems by standard techniques or can be expressed intracellularly.

The siRNAs of the invention can also be expressed from recombinant viral vectors intracellularly in cells. The recombinant viral vectors comprise sequences encoding the siRNAs of the invention and any suitable promoter for expressing the siRNA sequences. Suitable promoters include, for example, the U6 or HI RNA pol III promoter sequences and the cytomegalovirus promoter. Selection of other suitable promoters is within the skill in the art. The recombinant viral vectors of the invention can also comprise inducible or regulatable promoters for expression of the siRNA in a particular tissue. Any RNA-targeting oligonucleotide can be considered as ASOs, although they may act through different mechanisms. Specific and strong AON recognition and binding to the mRNA target is accomplished through Watson-Crick base pairing, which may or may not be augmented by chemical modifications to the AON internucleotide phosphate linkages, backbone sugars, or nucleobases. The majority of ASOs can be divided into one of two groups: those that direct cleavage of the target mRNA, and those that alter mRNA translation without causing mRNA cleavage.

The present invention also contemplates a gene delivery vector, preferably in the form of a plasmid (circular or linear plasmid) or a vector, that comprises one or more nucleic acid(s) encoding an agent modulating the expression and/or activity of one or more KZFP of the invention.

Preferably, said one or more nucleic acid(s) encodes the siRNA, miRNA , piRNA, hnRNA, snRNA, sg RNA, CRISPR-based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of the invention. Most preferably, the nucleic acid including an antisense oligonucleotide is selected from the group comprising an antisense oligonucleotide (ASO) and a modified ASO.

As used herein, a "vector" or a " gene delivery vector" is capable of transferring nucleic acid sequences to target cells (e.g., viral vectors, non-viral vectors, particulate carriers, and liposomes).

Suitable vectors include derivatives of SV40 and known bacterial plasmids, e. g., E. coli plasmids col El, pCRl, pBR322, pMB9 and their derivatives, plasmids such as RP4; phage DNAs, e. g., the numerous derivatives of phage X, e. g., NM989, and other phage DNA, e. g., Ml 3 and filamentous single stranded phage DNA; yeast plasmids such as the 2m plasmid or derivatives thereof; vectors useful in eukaryotic cells, such as vectors useful in insect or mammalian cells; vectors derived from combinations of plasmids and phage DNAs, such as plasmids that have been modified to employ phage DNA or other expression control sequences; and the like (for a review, Sung, Y., Kim, S. Recent advances in the development of gene delivery systems. Biomater Res 23, 8 (2019)). Various viral vectors are used for delivering nucleic acids to cells in vitro or in vivo. Non limiting examples are vectors based on Herpes Viruses, Pox- viruses, Adeno-associated virus, Lentivirus, and others. In principle, all of them are suited to deliver the expression cassette comprising an expressible nucleic acid molecule that codes for an agent modulating the expression and/or activity of one or more KZFP of the invention. In a preferred aspect, said viral vector is an adenoviral vector, preferably a replication competent adenovirus.

The present invention also contemplates a host cell comprising i) a plasmid or vector of the invention, or ii) one or more nucleic acid(s) encoding the siRNA, miRNA , piRNA, hnRNA, snRNA, sg RNA, CRISPR-based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of the invention.

The gene delivery vector (e.g. plasmid or vector) comprising one or more nucleic acid(s) the siRNA, miRNA , piRNA, hnRNA, snRNA, sg RNA, CRISPR-based loss-of-function system, esiRNA, shRNA, and antisense oligonucleotide, or combination of one or more thereof, of the invention, or the one or more nucleic acid(s) of the invention, can be introduced to host cell via one or more methods known in the art. These one or more methods include, without limitation, microinjection, electroporation, calcium phosphate-mediated transfection, cationic transfection, liposome transfection, dendrimer transfection, heat shock transfection, nucleofection transfection, magnetofection, lipofection, optical transfection, proprietary agent-enhanced uptake of nucleic acids, and delivery via liposomes, immunoliposomes, virosomes, or artificial virions.

It will be appreciated that in the present method the modification following the introduction of the gene delivery vector (plasmid or vector), or the one or more nucleic acid(s) of the invention, to the host cell may occur ex vivo or in vitro, for instance in a cell culture and in some instances not in vivo. In other aspects, it may occur in vivo.

The present invention further contemplates one or more pharmaceutical compositions.

In one aspect, the pharmaceutical composition comprises: a therapeutically effective amount of an agent modulating the expression and/or activity of one or more i) KZFP, ii) mRNA encoding a KZFP and/or iii) KZFP gene, of the invention, or a plasmid or a vector of the invention, or a host cell of the invention, and a pharmaceutically acceptable carrier or diluent.

In one aspect, the pharmaceutical composition of the invention is for use in the treatment and/or prevention of cancer and/or cancer metastasis.

The term "therapeutically effective amount" as used herein means an amount of an agent of the invention high enough to significantly positively modify the symptoms and/or condition to be treated, but low enough to avoid serious side effects (at a reasonable risk/benefit ratio), within the scope of sound medical judgment. The therapeutically effective amount of the agent of the invention is selected in accordance with a variety of factors including type, species, age, weight, sex and medical condition of the patient; the severity of the condition to be treated; the route of administration; the renal and hepatic function of the patient. A physician of ordinary skill in the art can readily determine and prescribe the effective amount of the agent required to prevent, counter or arrest the progress of the cancer.

“Pharmaceutically acceptable carrier or diluent” means a carrier or diluent that is useful in preparing a pharmaceutical composition that is generally safe, non-toxic, and desirable, and includes carriers or diluents that are acceptable for human pharmaceutical use.

Such pharmaceutically acceptable carriers can be sterile liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. Water is a preferred carrier when the pharmaceutical composition is administered intravenously. Saline solutions and aqueous dextrose and glycerol solutions can also be employed as liquid carriers, particularly for injectable solutions.

Pharmaceutically acceptable excipients include starch, glucose, lactose, sucrose, sodium stearate, glycerol monostearate, talc, sodium chloride, dried skim milk, glycerol, propylene glycol, water, ethanol and the like.

The pharmaceutical compositions may further contain one or more pharmaceutically acceptable salts such as, for example, a mineral acid salt such as a hydrochloride, a hydrobromide, a phosphate, a sulfate, etc.; and the salts of organic acids such as acetates, propionates, malonates, benzoates, etc. Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, gels or gelling materials, flavorings, colorants, microspheres, polymers, suspension agents, etc. may also be present herein. In addition, one or more other conventional pharmaceutical ingredients, such as preservatives, humectants, suspending agents, surfactants, antioxidants, anticaking agents, fillers, chelating agents, coating agents, chemical stabilizers, etc. may also be present, especially if the dosage form is a reconstitutable form. Suitable exemplary ingredients include macrocrystalline cellulose, carboxymethyf cellulose sodium, polysorbate 80, phenyletbyl alcohol, chiorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, ethyl vanillin, glycerin, phenol, parachlorophenol, gelatin, albumin and a combination thereof. A thorough discussion of pharmaceutically acceptable excipients is available in REMINGTON'S PHARMACEUTICAL SCIENCES (Mack Pub. Co., N.J. 1991) which is incorporated by reference herein.

The pharmaceutical composition can also further comprise, or comprise the administration of, at least one additional anticancer agent or therapy selected from the group comprising radiotherapy, chemotherapy, immunotherapy and hormone therapy, or a combination of one of more thereof.

A chemotherapy of the present invention can concern as well agents that damage DNA and / or prevent cells from multiplying, such as genotoxins.

Genotoxins can be selected from the group comprising alkylating agents, antimetabolites, DNA cutters, DNA binders, topoisomerase poisons and spindle poisons. Examples of alkylating agents are lomustine, carmustine, streptozocin, mechlorethamine, melphalan, uracil nitrogen mustard, chlorambucil, cyclosphamide, iphosphamide, cisplatin, carboplatin, mitomycin, thiotepa, dacarbazin, procarbazine, hexamefhyl melamine, triethylene melamine, busulfan, pipobroman, mitotane and other platine derivatives or a combination of one of more thereof.

An example of DNA cutters is bleomycin.

Topoisomerases poisons can be selected from the group comprising topotecan, irinotecan, camptothecin sodium salt, daorubicin, doxorubicin, idarubicin, mitoxantrone teniposide, adriamycin and etoposide or a combination of one of more thereof.

Examples of DNA binders are dactinomycin and mithramycin whereas spindle poisons can be selected among the group comprising vinblastin, vincristin, navelbin, paclitaxel and docetaxel or a combination of one of more thereof.

A chemotherapy of the present invention can concern as well antimetabolites selected among the following non-limiting list of compounds: methotrexate, trimetrexate, pentostatin, cytarabin, ara-CMP, fludarabine phosphate, hydroxyurea, fluorouracyl, fioxuridine, chlorodeoxyadenosine, gemcitabine, thioguanine and 6-mercaptopurine or a combination of one of more thereof. Radiotherapy refers to the use of high-energy radiation to shrink tumors and kill cancer cells. Examples of radiation therapy include, without limitation, external radiation therapy and internal radiation therapy (also called brachytherapy) or a combination of one of more thereof. External radiation therapy is most common and typically involves directing a beam of direct or indirect ionizing radiation to a tumor or cancer site. While the beams of radiation, the photons, the Cobalt or the particule therapy are focused to the tumor or cancer site, it is nearly impossible to avoid exposure of normal, healthy tissue. Energy source for external radiation therapy is selected from the group comprising direct or indirect ionizing radiation (for example: x-rays, gamma rays and particle beams or combination thereof).

Internal radiation therapy involves implanting a radiation-emitting source, such as beads, wires, pellets, capsules, etc., inside the body, at, or near to the tumor site. Energy source for internal radiation therapy is selected from the group of radioactive isotopes comprising: iodine (iodinel25 or iodinel31), strontium89, radioisotopes of phosphorous, palladium, cesium, indium, phosphate, or cobalt, and combination thereof. Such implants can be removed following treatment, or left in the body inactive. Types of internal radiation therapy include, but are not limited to, interstitial, and intracavity brachytherapy (high dose rate, low dose rate, pulsed dose rate).

Another internal radiation therapy involves biological carriers of radioisotopes, such as with radio-immunotherapy wherein tumor-specific antibodies bound to radioactive material are administered to a patient. The antibodies bind tumor antigens, thereby effectively administering a dose of radiation to the relevant tissue.

Methods of administering radiation therapy are well known to those of skill in the art.

Immunotherapy refers to the use of monoclonal antibodies, immune checkpoint inhibitors, cancer vaccines, cytokines, immunomodulators and CAR-T cell or adoptive cell therapies. In a preferred aspect, the immunotherapy is selected from the group comprising monoclonal antibodies (e.g. targeting the protein CD20 such as Rituximab), immune checkpoint inhibitors or a combination of one or more thereof. Examples of immune checkpoint proteins targeted by inhibitors include PD-1/PD-L1 and CTLA-4/B7-1/B7-2.

The present invention further contemplates methods of treatment of cancer, and/or cancer metastasis.

In one aspect, the method of treatment of cancer, and/or cancer metastasis comprises: (a) providing an agent of the invention and (b) administering said agent to a subject in need thereof.

Preferably, said agent is in the form of a pharmaceutical composition comprising a therapeutically effective amount of said agent as described herein.

In another aspect, the method comprises (a) modifying a host cell as described herein, and (b) reintroducing the host cell (e.g. single cell or population of cells), into the patient in need thereof, i.e. suffering from cancer and/or cancer metastasis. Preferably, said host cell is in the form of a pharmaceutical composition comprising a therapeutically effective amount of said host cell as described herein.

Administration of the agents and/or pharmaceutical compositions described herein may be accomplished by any acceptable method which allows the agents modulating the expression and/or activity of one or more i) KRAB -containing zinc finger protein (KZFP), ii) mRNA encoding a KZFP, and/or iii) KZFP gene to reach its target. The particular mode selected will depend of course, upon factors such as the particular formulation, the severity of the state of the subject being treated, and the dosage required for therapeutic efficacy. The actual effective amounts of agent (the "drug") can vary according to the specific drug or combination thereof being utilized, the particular composition formulated, the mode of administration, and the age, weight, condition of the patient, and severity of the symptoms or condition being treated.

Any acceptable method known to one of ordinary skill in the art may be used to administer the agents and/or pharmaceutical compositions to the subject. The administration may be localized (i.e., to a particular region, physiological system, tissue, organ, or cell type) or systemic, depending on the condition being treated.

Injections can be e.g., intravenous, intradermal, subcutaneous, intramuscular, or intraperitoneal. The agents and/or pharmaceutical compositions can be injected intradermally for treatment cancer, for example.

The injections can be given at multiple locations. Implantation includes inserting implantable drug delivery systems, e.g., microspheres, hydrogels, polymeric reservoirs, cholesterol matrixes, polymeric systems, e.g., matrix erosion and/or diffusion systems and non-polymeric systems, e.g., compressed, fused, or partially-fused pellets. Inhalation includes administering the composition with an aerosol in an inhaler, either alone or attached to a carrier that can be absorbed. For systemic administration, it may be preferred that the agents and/or pharmaceutical compositions are encapsulated in liposomes.

Preferably, the agents and/or pharmaceutical compositions delivery systems are provided in a manner which enables tissue-specific uptake of the agents and/or pharmaceutical compositions delivery systems. Techniques include using tissue or organ localizing devices, such as wound dressings or transdermal delivery systems, using invasive devices such as vascular or urinary catheters, and using interventional devices such as stents having drug delivery capability and configured as expansive devices or stent grafts.

The agents and/or pharmaceutical compositions may be delivered using a bio-erodible implant by way of diffusion or by degradation of the polymeric matrix.

The administration of the agents and/or pharmaceutical compositions may be designed so as to result in sequential exposures to the agents and/or pharmaceutical compositions over a certain time period, for example, hours, days, weeks, months or years. This may be accomplished, for example, by repeated administrations of a formulation or by a sustained or controlled release delivery system in which the agents and/or pharmaceutical compositions is/are delivered over a prolonged period without repeated administrations. Administration of the formulations using such a delivery system may be, for example, by oral dosage forms, bolus injections, transdermal patches or subcutaneous implants. Maintaining a substantially constant concentration of the composition may be preferred in some cases.

Other delivery systems suitable include, but are not limited to, time-release, delayed release, sustained release, or controlled release delivery systems. Such systems may avoid repeated administrations in many cases, increasing convenience to the subject and the physician. Many types of release delivery systems are available and known to those of ordinary skill in the art. They include, for example, polymer-based systems such as polylactic and/or polyglycolic acids, polyanhydrides, polycaprolactones, copolyoxalates, polyesteramides, polyorthoesters, polyhydroxybutyric acid, and/or combinations of these. Microcapsules of the foregoing polymers containing nucleic acids are described in, for example, U.S. Patent No. 5,075,109. Other examples include nonpolymer systems that are lipid-based including sterols such as cholesterol, cholesterol esters, and fatty acids or neutral fats such as mono-, di- and triglycerides; hydrogel release systems; liposome-based systems; phospholipid based- systems; silastic systems; peptide based systems; wax coatings; compressed tablets using conventional binders and excipients; or partially fused implants. Specific examples include, but are not limited to, erosional systems in which the agent and/or pharmaceutical composition contained in a formulation within a matrix (for example, as described in U.S. Patent Nos. 4,452,775, 4,675,189, 5,736,152, 4,667,013, 4,748,034 and 5,239,660), or diffusional systems in which an active component controls the release rate (for example, as described in U.S. Patent Nos. 3,832,253, 3,854,480, 5,133,974 and 5,407,686). The formulation may be as, for example, microspheres, hydrogels, polymeric reservoirs, cholesterol matrices, or polymeric systems. The system may allow sustained or controlled release of the composition to occur, for example, through control of the diffusion or erosion/degradation rate of the formulation containing the agent and/or pharmaceutical composition. In addition, a pump-based hardware delivery system may be used for delivery.

Also envisioned is a tumor-specific delivery of an agent and/or pharmaceutical composition - coupled super-paramagnetic iron oxide nanoparticles as known in the art.

Further contemplated is a method of diagnosing cancer and/or cancer metastasis in a subject comprising:

(a) detecting, directly or indirectly, and measuring the level of transcription and/or expression and/or activity of one or more KZFP in a sample obtained from said subject;

(b) comparing the level of transcription and/or expression and/or activity of said one or more KZFP to a control biological sample for the same one or more KZFP; wherein a differential transcription and/or expression and/or activity of one or more KZFP in said biological sample, relative to the level of corresponding said one or more KZFP in a control biological sample of a cancer-free subject, is indicative of the subject having cancer and/or cancer metastasis.

It is understood that the detection and measurement of the level of transcription and/or expression and/or activity of one or more KZFP in a sample obtained can be direct or indirect. For example, the abundance levels of mRNAs can be directly quantitated. Alternatively, the amount of one or more KZFP can be determined indirectly by measuring abundance levels of cDNAs, amplified RNAs or DNAs, or by measuring quantities or activities of RNAs, or other molecules that are indicative of the expression level of the one or more KZFP. Preferably, the detection and measurement of the level of transcription and/or expression and/or activity of one or more KZFP is determined indirectly by measuring abundance levels of cDNAs.

Transcripts (such as mRNAs), amplified RNAs or DNAs (such as cDNAs) can be detected and quantitated by a variety of methods including, but not limited to, microarray analysis, polymerase chain reaction (PCR), reverse transcriptase polymerase chain reaction (RT-PCR), Northern blot, serial analysis of gene expression (SAGE), immunoassay, and mass spectrometry, any sequencing-based methods known in the art.

In one aspect, microarrays are used to measure the levels of one or more KZFP of the invention. An advantage of microarray analysis is that the expression of each of the one or more KZFP of the invention can be measured simultaneously, and microarrays can be specifically designed to provide a diagnostic expression profile for a particular disease or condition (e.g., cancer).

Microarrays are prepared by selecting probes which comprise a polynucleotide sequence, and then immobilizing such probes to a solid support or surface. For example, the probes may comprise DNA sequences, RNA sequences, or copolymer sequences of DNA and RNA. The polynucleotide sequences of the probes may also comprise DNA and/or RNA analogues, or combinations thereof. For example, the polynucleotide sequences of the probes may be full or partial fragments of genomic DNA. The polynucleotide sequences of the probes may also be synthesized nucleotide sequences, such as synthetic oligonucleotide sequences. The probe sequences can be synthesized either enzymatically in vivo, enzymatically in vitro (e.g., by PCR), or non-enzymatically in vitro.

Probes used in the methods of the invention are preferably immobilized to a solid support which may be either porous or non-porous. For example, the probes may be polynucleotide sequences which are attached to a nitrocellulose or nylon membrane or filter covalently at either the 3' or the 5' end of the polynucleotide. Such hybridization probes are well known in the art (see, e.g., Sambrook, et ah, Molecular Cloning: A Laboratory Manual (3rd Edition, 2001). Alternatively, the solid support or surface may be a glass or plastic surface. In one embodiment, hybridization levels are measured to microarrays of probes consisting of a solid phase on the surface of which are immobilized a population of polynucleotides, such as a population of DNA or DNA mimics, or, alternatively, a population of RNA or RNA mimics. The solid phase may be a nonporous or, optionally, a porous material such as a gel.

In one embodiment, the microarray comprises a support or surface with an ordered array of binding (e.g., hybridization) sites or “probes” each representing one of the biomarkers described herein. Preferably the microarrays are addressable arrays, and more preferably positionally addressable arrays. More specifically, each probe of the array is preferably located at a known, predetermined position on the solid support such that the identity (i.e., the sequence) of each probe can be determined from its position in the array (i.e., on the support or surface). Each probe is preferably covalently attached to the solid support at a single site. Microarrays can be made in a number of ways, of which several are described below. However they are produced, microarrays share certain characteristics. The arrays are reproducible, allowing multiple copies of a given array to be produced and easily compared with each other. Preferably, microarrays are made from materials that are stable under binding (e.g., nucleic acid hybridization) conditions. Microarrays are generally small, e.g., between 1 cm2 and 25 cm2; however, larger arrays may also be used, e.g., in screening arrays. Preferably, a given binding site or unique set of binding sites in the microarray will specifically bind (e.g., hybridize) to the product of a single gene in a cell (e.g., to a specific mRNA, or to a specific cDNA derived therefrom). However, in general, other related or similar sequences will cross hybridize to a given binding site.

As noted above, the “probe” to which a particular polynucleotide molecule specifically hybridizes contains a complementary polynucleotide sequence. The probes of the microarray typically consist of nucleotide sequences of no more than 1,000 nucleotides. In some embodiments, the probes of the array consist of nucleotide sequences of 10 to 1,000 nucleotides. In one aspect, the nucleotide sequences of the probes are in the range of 10-200 nucleotides in length and are genomic sequences of one species of organism, such that a plurality of different probes is present, with sequences complementary and thus capable of hybridizing to the genome of such a species of organism, sequentially tiled across all or a portion of the genome. In other aspects, the probes are in the range of 10-30 nucleotides in length, in the range of 10-40 nucleotides in length, in the range of 20-50 nucleotides in length, in the range of 40-80 nucleotides in length, in the range of 50-150 nucleotides in length, in the range of 80-120 nucleotides in length, or are 60 nucleotides in length. The probes may comprise DNA or DNA “mimics” (e.g., derivatives and analogues) corresponding to a portion of an organism's genome. In another aspect, the probes of the microarray are complementary RNA or RNA mimics. DNA mimics are polymers composed of subunits capable of specific, Watson-Crick-like hybridization with DNA, or of specific hybridization with RNA. The nucleic acids can be modified at the base moiety, at the sugar moiety, or at the phosphate backbone (e.g., phosphorothioates). DNA can be obtained, e.g., by polymerase chain reaction (PCR) amplification of genomic DNA or cloned sequences. PCR primers are preferably chosen based on a known sequence of the genome that will result in amplification of specific fragments of genomic DNA.

Computer programs that are well known in the art are useful in the design of primers with the required specificity and optimal amplification properties, such as Oligo version 5.0 (National Biosciences). Typically, each probe on the microarray will be between 10 bases and 50,000 bases, usually between 300 bases and 1,000 bases in length. PCR methods are well known in the art, and are described, for example, in Innis et ah, eds., PCR Protocols: A Guide To Methods And Applications, Academic Press Inc., San Diego, Calif. (1990); herein incorporated by reference in its entirety. It will be apparent to one skilled in the art that controlled robotic systems are useful for isolating and amplifying nucleic acids.

An alternative, preferred means for generating polynucleotide probes is by synthesis of synthetic polynucleotides or oligonucleotides, e.g., using N-phosphonate or phosphoramidite chemistries (Froehler et ah, Nucleic Acid Res. 14:5399-5407 (1986); McBride et ah, Tetrahedron Lett. 24:246-248 (1983)). Synthetic sequences are typically between about 10 and about 500 bases in length, more typically between about 20 and about 100 bases, and most preferably between about 40 and about 70 bases in length. In some embodiments, synthetic nucleic acids include non-natural bases, such as, but by no means limited to, inosine. As noted above, nucleic acid analogues may be used as binding sites for hybridization. An example of a suitable nucleic acid analogue is peptide nucleic acid (see, e.g., U.S. Pat. No. 5,539,083).

Probes are preferably selected using an algorithm that takes into account binding energies, base composition, sequence complexity, cross-hybridization binding energies, and secondary structure.

A skilled artisan will also appreciate that positive control probes, e.g., probes known to be complementary and hybridizable to sequences in the target polynucleotide molecules, and negative control probes, e.g., probes known to not be complementary and hybridizable to sequences in the target polynucleotide molecules, should be included on the array. In one embodiment, positive controls are synthesized along the perimeter of the array. In another embodiment, positive controls are synthesized in diagonal stripes across the array. In still another embodiment, the reverse complement for each probe is synthesized next to the position of the probe to serve as a negative control. In yet another embodiment, sequences from other species of organism are used as negative controls or as “spike-in” controls.

The probes are attached to a solid support or surface, which may be made, e.g., from glass, plastic (e.g., polypropylene, nylon), polyacrylamide, nitrocellulose, gel, or other porous or nonporous material. One method for attaching nucleic acids to a surface is by printing on glass plates, as known in the art. This method is especially useful for preparing microarrays of cDNA. A second method for making microarrays produces high-density oligonucleotide arrays. Techniques are known for producing arrays containing thousands of oligonucleotides complementary to defined sequences, at defined locations on a surface using photolithographic techniques for synthesis in situ (see, U.S. Pat. Nos. 5,578,832; 5,556,752; and 5,510,270; herein incorporated by reference in their entireties) or other methods for rapid synthesis and deposition of defined oligonucleotides. When these methods are used, oligonucleotides (e.g., 60-mers) of known sequence are synthesized directly on a surface such as a derivatized glass slide. Usually, the array produced is redundant, with several oligonucleotide molecules per RNA.

Other methods for making microarrays, e.g., by masking, may also be used. In principle, any type of array known in the art, for example, dot blots on a nylon hybridization membrane could be used. However, as will be recognized by those skilled in the art, very small arrays will frequently be preferred because hybridization volumes will be smaller.

Microarrays can also be manufactured by means of an inkjet printing device for oligonucleotide synthesis, e.g., using the methods and systems described by Blanchard in U.S. Pat. No. 6,028,189. Specifically, the oligonucleotide probes in such microarrays are synthesized in arrays, e.g., on a glass slide, by serially depositing individual nucleotide bases in “microdroplets” of a high surface tension solvent such as propylene carbonate. The microdroplets have small volumes (e.g., 100 pL or less, more preferably 50 pL or less) and are separated from each other on the microarray (e.g., by hydrophobic domains) to form circular surface tension wells which define the locations of the array elements (i.e., the different probes). Microarrays manufactured by this inkjet method are typically of high density, preferably having a density of at least about 2,500 different probes per 1 cm2. The polynucleotide probes are attached to the support covalently at either the 3' or the 5' end of the polynucleotide. KZFP polynucleotides which may be measured by microarray analysis can be expressed mRNAs or a nucleic acid derived therefrom (e.g., cDNA or amplified RNA derived from cDNA that incorporates an RNA polymerase promoter), including naturally occurring nucleic acid molecules, as well as synthetic nucleic acid molecules. In one embodiment, the target polynucleotide molecules comprise RNA, including, but by no means limited to, total cellular RNA, poly(A)+ messenger RNA (mRNA) or a fraction thereof, cytoplasmic mRNA, or RNA transcribed from cDNA (i.e., cRNA; see, e.g., U.S. Pat. No. 5,545,522, 5,891,636, or 5,716,785). Methods for preparing total and poly(A)+ RNA are well known in the art, and are described generally, e.g., in Sambrook, et ah, Molecular Cloning: A Laboratory Manual (3rd Edition, 2001). RNA can be extracted from a cell of interest using guanidinium thiocyanate lysis followed by CsCl centrifugation, a silica gel-based column (e.g., RNeasy (Qiagen, Valencia, Calif.) or StrataPrep (Stratagene, La Jolla, Calif.)), or using phenol and chloroform, as known in the art. Poly(A)+ RNA can be selected, e.g., by selection with oligo-dT cellulose or, alternatively, by oligo-dT primed reverse transcription of total cellular RNA. RNA can be fragmented by methods known in the art, e.g., by incubation with ZnC12, to generate fragments of RNA.

In one aspect, total RNA, mRNAs, or nucleic acids derived therefrom (such as cDNA), are isolated from a sample taken from a patient having cancer or a cancer tissue undergoing surgical and/or pharmacological therapies. KZFPs of the invention that are poorly expressed in particular cells may be enriched using normalization techniques known in the art.

As described above, the KZFP polynucleotides can be detectably labeled at one or more nucleotides. Any method known in the art may be used to label the target polynucleotides. Preferably, this labeling incorporates the label uniformly along the length of the RNA, and more preferably, the labeling is carried out at a high degree of efficiency. For example, polynucleotides can be labeled by oligo-dT primed reverse transcription. Random primers (e.g., 9-mers) can be used in reverse transcription to uniformly incorporate labeled nucleotides over the full length of the polynucleotides. Alternatively, random primers may be used in conjunction with PCR methods or T7 promoter-based in vitro transcription methods in order to amplify polynucleotides.

The detectable label may be a luminescent label. For example, fluorescent labels, bioluminescent labels, chemiluminescent labels, and colorimetric labels may be used in the practice of the invention. Fluorescent labels that can be used include, but are not limited to, fluorescein, a phosphor, a rhodamine, or a polymethine dye derivative. Additionally, commercially available fluorescent labels including, but not limited to, fluorescent phosphoramidites such as FluorePrime (Amersham Pharmacia, Piscataway, N. J.), Fluoredite (Miilipore, Bedford, Mass.), FAM (ABI, Foster City, Calif.), and Cy3 or Cy5 (Amersham Pharmacia, Piscataway, N.J.) can be used. Alternatively, the detectable label can be a radiolabeled nucleotide.

In one aspect, KZFP polynucleotide molecules from a patient sample are labeled differentially from the corresponding polynucleotide molecules of a reference sample. The reference can comprise mRNAs from a normal biological sample (i.e., control sample, e.g., biopsy from a subject not having a cancer or a cancer tissue undergoing surgical and/or pharmacological therapies) or from a reference biological sample, (e.g., sample from a subject not having a cancer or a cancer tissue undergoing surgical and/or pharmacological therapies).

Nucleic acid hybridization and wash conditions are chosen so that the target polynucleotide molecules specifically bind or specifically hybridize to the complementary polynucleotide sequences of the array, preferably to a specific array site, wherein its complementary DNA is located. Arrays containing double-stranded probe DNA situated thereon are preferably subjected to denaturing conditions to render the DNA single-stranded prior to contacting with the target polynucleotide molecules. Arrays containing single-stranded probe DNA (e.g., synthetic oligodeoxyribonucleic acids) may need to be denatured prior to contacting with the target polynucleotide molecules, e.g., to remove hairpins or dimers which form due to self complementary sequences.

Optimal hybridization conditions will depend on the length (e.g., oligomer versus polynucleotide greater than 200 bases) and type (e.g., RNA, or DNA) of probe and target nucleic acids. One of skill in the art will appreciate that as the oligonucleotides become shorter, it may become necessary to adjust their length to achieve a relatively uniform melting temperature for satisfactory hybridization results. General parameters for specific (i.e., stringent) hybridization conditions for nucleic acids are described in Sambrook, et ah, Molecular Cloning: A Laboratory Manual (3rd Edition, 2001) . Typical hybridization conditions for the cDNA microarrays of Schena et al. are hybridization in 5><SSC plus 0.2% SDS at 65° C. for four hours, followed by washes at 25° C. in low stringency wash buffer (lxSSC plus 0.2% SDS), followed by 10 minutes at 25° C. in higher stringency wash buffer (O.lxSSC plus 0.2% SDS). Particularly preferred hybridization conditions include hybridization at a temperature at or near the mean melting temperature of the probes (e.g., within 51° C., more preferably within 21° C.) in 1 M NaCl, 50 mM MES buffer (pH 6.5), 0.5% sodium sarcosine and 30% formamide.

When fluorescently labeled gene products are used, the fluorescence emissions at each site of a microarray may be, preferably, detected by scanning confocal laser microscopy. In one embodiment, a separate scan, using the appropriate excitation line, is carried out for each of the two fluorophores used. Alternatively, a laser may be used that allows simultaneous specimen illumination at wavelengths specific to the two fluorophores and emissions from the two fluorophores can be analyzed simultaneously. Arrays can be scanned with a laser fluorescent scanner with a computer controlled X-Y stage and a microscope objective. Sequential excitation of the two fluorophores is achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and detected with two photomultiplier tubes. Fluorescence laser scanning devices are known in the art. Alternatively, a fiber-optic bundle, may be used to monitor RNA, mRNA, DNA or cDNA abundance levels at a large number of sites simultaneously.

As used herein, the wording “differential transcription and/or expression and/or activity of one or more KZFP” and their synonyms, which are used interchangeably, refer to one or more KZFP gene(s) or KZFP gene product(s) (e.g. transcripts or proteins) whose transcription and/or expression and/or activity is/are activated to a higher or lower level in a subject suffering from a disease, specifically cancer, relative to its/their transcription and/or expression and/or activity in a normal or control subject, or reference data. The terms also include KZFP genes or gene products whose transcription and/or expression and/or activity is altered to a higher or lower level at different stages of the same disease. It is also understood that a differentially transcripted and/or expressed gene may be either activated or inhibited at the nucleic acid level or protein level, or may be subject to alternative splicing to result in a different polypeptide product. Such differences may be evidenced by a change in mRNA levels, surface expression, secretion or other partitioning of a polypeptide, for example.

Differential gene transcription and/or expression and/or activity may include a comparison of transcription and/or expression and/or activity between two or more KZFP genes or their gene products, or a comparison of the ratios of the transcription and/or expression and/or activity between two or more KZFP genes or their gene products, or even a comparison of two differently processed products of the same KZFP gene, which differ between normal subjects and subjects suffering from a disease, specifically cancer, or between various stages of the same disease. In particular, the reference (“control”) transcription and/or expression and/or activity level can be the transcription and/or expression and/or activity level of the KZFP gene product in a control sample. The control sample can be a normal sample, that is, a non-tumoural sample, preferably from the same tissue than the cancer sample, or a basal level of transcription and/or expression and/or activity.

The normal sample may be obtained from the subject affected with the cancer or from another subject, such as a normal or healthy subject, i.e. a subject who does not suffer from a cancer. Additionally, or alternatively, the control sample can be obtained from data repositories of cancer patient expression studies; accordingly, the control sample could be in this case a virtual sample obtained from such a data repository for a given cancer type.

Transcription and/or expression and/or activity levels obtained from cancer and normal samples may be normalized by using e.g. expression levels of proteins which are known to have stable expression such as RPLPO (acidic ribosomal phosphoprotein PO), TBP (TATA box binding protein), GAPDH (glyceraldehyde 3 -phosphate dehydrogenase) or b-actin. Additionally, or alternatively, as it will be appreciated by a person skilled in the art, it is possible to use statistical methods based on whole-transcriptome expression distribution for normalization of cancer and control/normal samples.

Differential transcription and/or expression and/or activity includes both quantitative, as well as qualitative, differences in the temporal or cellular transcription and/or expression and/or activity pattern in a gene or its products among, for example, normal and diseased cells, or among cells which have undergone different disease events or disease stages. For the purpose of this description, “differential gene transcription and/or expression and/or activity” is considered to be present when there is at least a two-fold difference between the transcription and/or expression and/or activity of a given gene or gene product in normal and diseased subjects, or in various stages of disease development in a diseased subject.

In an aspect of the invention, the differential gene transcription and/or expression and/or activity of one or more KZFP corresponds to an upregulated expression, transcription and/or activity of said one or more KZFP.

Usually, the upregulated expression , transcription and/or activity of said one or more KZFP in a biological sample corresponds to an increase equal or superior to about 5 %, preferably equal or superior to about 20 %, more preferably equal or superior to about 40 %, most preferably equal or superior to about 60 %, more preferably equal or superior to about 500%, even more preferably equal or superior to about 1000 %, in particular equal or superior to about 5000 % when compared to the level of corresponding said one or more KZFP in a control biological sample of a cancer-free subject.

The biological sample is selected, in the context of the present application, from the group comprising whole blood, serum, plasma, semen, saliva, tears, urine, fecal material, sweat, buccal smears, skin, and cancer cells, or a combination of one or more of these biological samples.

The present invention also encompasses a method of stratifying a cancer in a subject comprising:

(a) detecting, directly or indirectly, and measuring the level of transcription and/or expression and/or activity of one or more KZFP in a sample obtained from said subject;

(b) comparing the level of transcription and/or expression and/or activity of said one or more KZFP to a control biological sample for the same one or more KZFP; wherein an alteration in the transcription and/or expression and/or activity level of one or more one or more KZFP in said biological sample, relative to the level of corresponding said one or more KZFP in a control biological sample of a cancer-free subject, is indicative of the disease stage or grade.

Also encompassed is a method of determining the progression or regression, of a cancer in a subject suffering therefrom, said method comprising:

(a) detecting, directly or indirectly, and measuring the level of transcription and/or expression and/or activity of one or more KZFP in a sample obtained from said subject;

(b) periodically comparing the level of transcription and/or expression and/or activity of said one or more KZFP, wherein an alteration in the level of transcription and/or expression and/or activity of one or more KZFP in a sample obtained from the same subject, relative to the level of transcription and/or expression and/or activity of one or more KZFP, determined previously, is indicative of the progression or regression of said cancer.

In an aspect of the invention, when the alteration in the transcription and/or expression and/or activity level (s) of one or more one or more KZFP corresponds to an enhanced transcription and/or expression and/or activity of said one or more KZFP, then this is indicative of the progression of said cancer.

In the same aspect of the invention, when the alteration in the transcription and/or expression and/or activity level (s) of one or more one or more KZFP corresponds to a decreased transcription and/or expression and/or activity of said one or more KZFP, then this is indicative of the regression of said cancer.

Alternatively, this method can also be used to determine the response to a cancer therapy. In this aspect of the invention, when the alteration in the transcription and/or expression and/or activity level of one or more one or more KZFP corresponds to an enhanced transcription and/or expression and/or activity of said one or more KZFP, then this is indicative that the cancer does not respond to said cancer therapy.

In the same aspect of the invention, when the alteration or differential transcription and/or expression and/or activity level of one or more one or more KZFP corresponds to a decreased transcription and/or expression and/or activity of said one or more KZFP, then this is indicative that the cancer responds to said cancer therapy.

Also provided is a kit for performing a method according to the invention of for use in the treatment of cancer, said kit comprising

(a) one or more agents and/or pharmaceutical compositions, and

(b) instructions for use.

The disclosure is further illustrated by the following examples. The examples below are non limiting and are merely representative of various aspects of the disclosure.

EXAMPLES

Primary tumors data

The RNA-seq dataset analyzed here has been granted access by the development assistance committee (DAC) of the University of Duke and accessed through The European Genome- phenome Archive (EGA) at the European Bioinformatics Institute. It includes RNA-seq data from 775 DLBCL samples sequenced on Illumina Hiseq 2500 platform generating 125 PE length reads with an average of 10 million reads per sample. The paired metadata used for survival analysis was extracted from the supplementary data of the study published by Reddy et al. Genetic and Functional Drivers of Diffuse Large B Cell Lymphoma. Reddy, Anupama et al. “Genetic and Functional Drivers of Diffuse Large B Cell Lymphoma.” Cell vol. 171,2 (2017): 481-494. el5. doi:10.1016/j.cell.2017.09.027

RNA-seq analysis

RNA-Seq bam files of the 775 DLBCL samples were downloaded from the EGA under the accession number EGAS00001002606. Read summarization on genes were generated using featureCounts with parameters -s 2 -t exon -g gene id -Q 10. Sequencing depth normalization was performed using the TMM method as implemented in the voom function from the R package LIMMA from Bioconductor. One-hundred and forty -two samples with low coverage due to potential artefacts were omitted. The genes with an average expression <1 counts per million (CPM) across samples were filtered.

RNA extraction, quantification, and sequencing

Duplicates from two independent experiments were collected per sample. Total RNA was extracted with NucleoSpin RNA Plus (Macherey -Nagel) with an on-column deoxyribonuclease treatment. The sequencing libraries were generated using the TruSeq stranded mRNA sample preparation kit from Illumina. Libraries were then sequenced using the Illumina Hi-Seq technology with stranded 75-base single or paired-end reads. RNA-seq reads were mapped to the human genome (hgl9) using HISAT2 (v2.1.0). Reads that were not uniquely mapped were discarded from the analysis using bamtools filter v2.4.1 with parameters -tag "NH:1".

RNA-seq analysis

RNA-Seq bam files of the 775 DLBCL samples were downloaded from the EGA under the accession number EGAS00001002606. Read summarization on genes were generated using featureCounts with parameters -s 2 -t exon -g gene id -Q 10. Sequencing depth normalization was performed using the TMM method as implemented in the voom function from the R package LIMMA from Bioconductor. One-hundred and forty -two samples with low coverage due to potential artefacts were omitted. The genes with an average expression <1 counts per million (CPM) across samples were filtered.

Gene-set enrichment analysis from differentially expressed genes in the KD experiments were performed using the Metascape webtool (www.metascape.org).

Cell culture

OCI-Ly7 and U2932 human lymphoma cell lines were obtained from DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany (www.dsmz.de). The other cell lines used were obtained from ATCC collection, namely SU-DHL-4 (CRL- 2957 ™) HL60 (CCL-240™), LoVo (CCL-229 ™), HCT116 (CCL-247 ™), HT-29 (HTB-38 ™)293T (CRL-11268 ™), K562 (CCL-243 ™). HBL-1 were obtained from ABM (CAT N° T8204). All lymphoma cell lines were grown in RPMI 1640 (Gibco®) containing 10-20% fetal calf serum (FCS) and 1% penicillin, with the exception of OCI-Ly7 which were grown in EMDM containing 20% FCS and 1% penicillin. All cell lines were cultured at 37°C and 5% CO2. Cells were transduced, selected with 2-3 ug/mL puromycin and collected after 6 days for RNA-sequencing.

Plasmids and lentiviral vectors pLKO.puro shRNA vectors were used for ZNF586, ZNF587 and ZNF417 knock downs. The hairpin sequences against ZNF417/587 (GCAGCATATTGGAGAGAAATT ; AGTCGAAAGAGCAGCCTTATT) were designed using the Genetic Perturbation Portal of the Broad Institute and inserted with Age I/Eco RI into pLKO.1.puro LV (TRCN0000018002; Sigma- Aldrich). The short hairpin RNAs (shRNAs) targeting the sequence ZNF586 (TRCN0000015373 and TRCN0000015376) were obtained from Sigma- Aldrich. Lentiviral vectors production protocols are detailed at http://tronolab.epfl.ch. The cell proliferation after LV transduction was determined using the 3-[4,5-dimethylthiazol-2- yl]-2,5- diphenyltetrazolium bromide (MTT) assay. The cells were transduced at a multiplicity of infection (MOI) of 20 in every experiment. Results

Diffuse large B cell lymphoma (DLBCL) is the most frequently diagnosed aggressive non- Hodgkin lymphoma (NHL). It comprises several histological variants, which can be grouped into three main molecular subtypes based on gene expression signatures: germinal center (GCB), activated B-cell (ABC) and unclassified (UNCL) DLBCL. The distinction between these three groups holds prognostic and therapeutic value, ABC DLBCL has a worse prognosis than its GCB or UNCL counterparts as it is more frequently refractory to first line standard treatments and prone to relapse. The standard of care for first-line therapy in DLBCL is the R-CHOP (rituximab, cyclophosphamide, doxorubicin, vincristine and prednisone) regimen. Research on drug resistance has so far mainly focused on genetic alterations, but recent data points to the role of epigenetic changes. For instance, anthracycline-resi stance has been linked to silencing of the transcription factor SMADl by DNA methylation of its promoter, resulting in impaired activation of DNA damage response genes. Accordingly, inhibitors of DNA methylation, such as 5-azacytidine, are currently coupled with R-CHOP in some clinical trials. Although advances in therapy over the last 20 years have led to improvements in caner survival, additional strategies are still urgently needed.

We used our transcriptome analytical pipeline on a cohort of 630 DCBL patients (Fig. 1 A). Out of 23,560 genes, 1,530 were significantly associated with survival. Amongst these, 41 encoded for KZFPs (dark dots), which were associated with an increased risk of death compared with other TFs and protein coding genes (Fig. IB).

Of 39 KZFP genes associated with poor prognosis, 38 were upregulated in the ABC DCBL subtype, 82% of which located on the long arm of chromosome 19 (Chrl9q), a region frequently amplified in forms of DLBCL associated with disease relapse.

ZNF586, ZNF587 and its paralog ZNF417 were amongst the most highly expressed genes of this region, along with 5 others KZFP genes (namely ZNF587B, ZNF814, ZNF552, ZNF776 and ZNF671) also associated to poor prognosis clustered over a 200 kb segment (Fig. 2B). Gene-silencing experiments of ZNF586, ZNF587 and ZNF417, across a panel of DLBCL cell lines, demonstrated a stringent proliferation arrest phenotype, which was also noticed in leukemia and colon cancer cell lines (Fig 3-C).

These experiments also revealed that ZNF586, ZNF587 and ZNF417 depletion was able to trigger a necrotic form of cell death (Fig. 4A-B), the mechanisms of which is currently being investigated. At the same time, the KD of these KZFP were also able to generate replicative stress and DNA damage (Fig. 6B).

Transcriptomic analysis of arrested cells revealed enrichments in pathways implicated in proteolysis, endoplasmic reticulum-associated protein degradation (ERAD), antigen processing, major histocompatibility complex (MCH) class I presentation and cytokine production, suggesting increased sensitivity to immune recognition and killing (Fig. 5A-C). Additionally, proteome analysis of cell supernatant detected the presence of CXCL10, a chemokine implicated in T-cell, NK cell and macrophage migration, also suggesting that maneuvers aimed at blocking the effect of the KZFPs might enhance anti-tumor immune responses.

Based on these results, ZNF586, ZNF587 and ZNF417, along with other members of their 200 kb cluster, are new potential targets for the treatment of aggressive non-Hodgkin lymphoma and other tumors.