BIOMARKERS FOR LUNG CANCER

Title:

BIOMARKERS FOR LUNG CANCER

Document Type and Number:

WIPO Patent Application WO/2012/160177

Kind Code:

Abstract:

The present invention relates, in part, to methods for determining a prognosis of early stage lung cancer in an individual using one or more biomarkers.

Inventors:

MISSIAGLIA EDOARDO (CH)
WIRAPATI PRATYAKSHA (CH)
ROSSI SIMONA (CH)

Application Number:

PCT/EP2012/059784

Publication Date:

November 29, 2012

Filing Date:

May 24, 2012

Export Citation:

Click for automatic bibliography generation Help

Assignee:

NOVARTIS AG (CH)
MISSIAGLIA EDOARDO (CH)
WIRAPATI PRATYAKSHA (CH)
ROSSI SIMONA (CH)

International Classes:

C12Q1/68

Domestic Patent References:

WO2010063121A1

2010-06-10

Other References:

SUZANNE K LAU ET AL: "Three-gene prognostic classifier for early-stage non-small-cell lung cancer", JOURNAL OF CLINICAL ONCOLOGY, AMERICAN SOCIETY OF CLINICAL ONCOLOGY, US, vol. 25, no. 35, 10 December 2007 (2007-12-10), pages 5562 - 5569, XP008145645, ISSN: 0732-183X, DOI: 10.1200/JCO.2007.12.0352
BOUTROS PAUL C ET AL: "Prognostic gene signatures for non-small-cell lung cancer.", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA 24 FEB 2009 LNKD- PUBMED:19196983, vol. 106, no. 8, 24 February 2009 (2009-02-24), pages 2824 - 2828, XP002679415, ISSN: 1091-6490
KADARA HUMAM ET AL: "Identification of Gene Signatures and Molecular Markers for Human Lung Cancer Prognosis using an In vitro Lung Carcinogenesis System", CANCER PREVENTION RESEARCH, vol. 2, no. 8, August 2009 (2009-08-01), pages 702 - 711 URL, XP002679416
PALLANTE P ET AL: "The loss of the CBX7 gene expression represents an adverse prognostic marker for survival of colon carcinoma patients", EUROPEAN JOURNAL OF CANCER, PERGAMON PRESS, OXFORD, GB, vol. 46, no. 12, 1 August 2010 (2010-08-01), pages 2304 - 2313, XP027189167, ISSN: 0959-8049, [retrieved on 20100609]
KARAMITOPOULOU EVA ET AL: "Loss of the CBX7 protein expression correlates with a more aggressive phenotype in pancreatic cancer", EUROPEAN JOURNAL OF CANCER, vol. 46, no. 8, May 2010 (2010-05-01), pages 1438 - 1444, XP002679417, ISSN: 0959-8049
SALOMON ET AL., CRIT REV ONCOL HEMATOL., vol. 19, 1995, pages 183 - 232
GOLDSTRAW ET AL., J. THORAC. ONCOL., vol. 2, 2007, pages 706 - 714
KUTIKOVA ET AL., LUNG CANCER, vol. 50, no. 2, 2005, pages 143 - 154
MENDELSSOH ET AL., NCCN-GUIDELINES NSCLC, vol. 1, 2000, pages 2010
D'ADDARIO ET AL.: "Annals of Oncology. 2009", vol. 20, 2009, pages: IV68 - IV70
ARRIAGADA ET AL., NEJM, vol. 350, 2004, pages 351 - 360
CHIRGWIN ET AL., BIOCHEMISTRY, vol. 18, 1979, pages 5294 - 5299
DULAC, CURR. TOP. DEV. BIOL., vol. 36, 1998, pages 245
JENA ET AL., J. IMMUNOL. METHODS, vol. 190, 1996, pages 199
WANG ET AL., PROC. NATL. ACAD. SCI. USA, vol. 86, 1989, pages 9717
"Current Protocols in Molecular Biology", 1989, JOHN WILEY AND SONS, N.Y, pages: 6.3.1 - 6.3.6
KARLIN; ALTSCHUL, PROC. NATL. ACAD. SCI. USA, vol. 87, 1990, pages 2264 - 68
KARLIN; ALTSCHUL, PROC. NATL. ACAD. SCI. USA, vol. 90, 1993, pages 5873 - 77
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 10
ALTSCHUL ET AL., NUCLEIC ACIDS RESEARCH, vol. 25, no. 17, 1997, pages 3389 - 3402
MYERS; MILLER, CABIOS, 1989
KRICKA: "Nonisotopic DNA Probe Techniques", 1992, ACADEMIC PRESS
DYMECKI ET AL., J. BIOL. CHEM., vol. 267, 1992, pages 4815
BOERSMA; VAN LEEUWEN, J. NEUROSCI. METHODS, vol. 51, 1994, pages 317
GREEN ET AL., CELL, vol. 28, 1982, pages 477
AMHEITER ET AL., NATURE, vol. 294, 1981, pages 278
HELD ET AL., GENOME RESEARCH, vol. 6, 1996, pages 986 - 994
VELCULESCU ET AL., SCIENCE, vol. 270, 1995, pages 484 - 487
VELCULESCU ET AL., CELL, vol. 88, 1997, pages 243 - 51
BHATTACHARJEE A ET AL., PROC NATL ACAD SCI USA, vol. 98, 2001, pages 13790 - 5
TAKEUCHI ET AL., JCLIN ONCOL, vol. 24, 2006, pages 1679 - 88
RAPONI ET AL., CANCER RES, vol. 66, 2006, pages 7466 - 72
LU ET AL., PLOS MED, vol. 3, 2006, pages 467
SHEDDEN ET AL., NAT MED, vol. 14, 2008, pages 822 - 7
HOU ET AL., PLOS ONE, vol. 5, 2010, pages 10312
WILKERSON ET AL., CLIN CANCER RES, vol. 16, 2010, pages 4864 - 75
ZHU ET AL., J CLIN ONCOL, vol. 28, 2010, pages 4417 - 24
WIRAPATI, BREAST CANCER RESEARCH, vol. 10, 2008, pages R65
KAPLAN; MEIER, J AM. STATIST. ASSOC., vol. 53, 1953, pages 457 - 481
POPOVICI ET AL.: "Selecting control genes for RT-QPCR using public microarray data", BMC BIOINFORMATICS, vol. 10, 2 February 2009 (2009-02-02), pages 42, XP021047304, DOI: doi:10.1186/1471-2105-10-42
MCCALL ET AL., BIOSTATISTICS, vol. 11, no. 2, 22 January 2010 (2010-01-22), pages 242 - 53
SOTIRIOU ET AL., J NATL CANCER INST., vol. 98, no. 4, 15 February 2006 (2006-02-15), pages 262 - 72
THERNEAU TM; GRAMBSCH PM: "Modelling survival data: extending the Cox model", 2000, SPRINGER
RAVDIN ET AL., J CLIN ONCOL., vol. 19, no. 4, 15 February 2001 (2001-02-15), pages 980 - 91
XIE ET AL., CLIN CANCER RES, vol. 17, 2011, pages 5705 - 14

Attorney, Agent or Firm:

DIDELON, Frederic (Patent Department, Basel, CH)

Download PDF:

View/Download PDF PDF Help

Claims:

What is claimed is:

1. A method for prognosing a subject with non-small cell lung cancer (NSCLC) comprising: obtaining a test sample from a subject suffering from NSCLC following surgical resection; determining the expression level of at least three or more biomarkers identified in each of Table 1, Table 2 or Table 3; and analyzing the expression level to generate a risk score, wherein the risk score can be used to provide a prognosis of the subject.

2. A method for prognosing a subject with non-small cell lung cancer (NSCLC) comprising: obtaining a test sample from a subject suffering from NSCLC following surgical resection; determining the expression level of at least one biomarker from Table 1, Table 2 and Table 3 in the test sample; and analyzing the expression level to generate a risk score, wherein the risk score can be used to provide a prognosis of the subject.

3. The method of claim 1 or claim 2 wherein the risk score classifies the subject in a high risk group and would benefit from receiving adjuvant chemotherapy or in a low risk group and would not benefit from receiving adjuvant chemotherapy.

4. The method of claim 1 or claim 2, wherein the subject has a prognosis of having poor survival or a prognosis of having good survival.

5. The method of claim 2, wherein the at least one biomarker identified in Table 1, Table 2 and Table 3 comprise CBX7, STXIA, and TPX2.

6. The method of claim 2, wherein the at least one biomarker identified in Table 1, Table 2 and Table 3 comprise CBX7, TMPRSS2, STXIA, KLK6, TPX2 and UCK2.

7. The method of claim 2, wherein the at least one biomarker identified in Table 1, Table 2 and Table 3 comprises CBX7, TMPRSS2, GPR116, STXIA, KLK6, SLC16A3, TPX2, UCK2, PHKAl.

8. The method of claim 2, wherein the at least one biomarker identified in Table 1, Table 2 and Table 3 comprises CBX7, TMPRSS2, GPRl 16, KCNJ15, STXl A, KLK6, SLC16A3, PYGL, TPX2, UCK2, PHKA1, or EIF4A3.

9. The method of claim 2, wherein the at least one biomarker identified in Table 1, Table 2 and Table 3 comprises CBX7, TMPRSS2, GPRl 16, KCNJ15, PTPN13, STX1A, LK6, SLC16A3, PYGL, LDHA, TPX2, UCK2, PHKA1, EIF4A3 or TK1.

10. The method of claim 2, wherein the at least one biomarker identified in Table 1, Table 2 and Table 3 comprises CBX7, TMPRSS2, GPRl 16, KCNJ15, PTPN13, CTSH, STXl A, KLK6, SLC16A3, PYGL, LDHA, ITGA5, TPX2, UCK2, PHKA1, EIF4A3, TK1, or CCNA2.

11. A method of predicting prognosis in a subject with non-small cell lung cancer (NSCLC) following surgical resection, comprising determining expression of one or more biomarkers listed in Table 1, Table 2 and/or Table 3, wherein an increase in expression of one or more biomarkers listed in Table 2 and/or Table 3 and a decrease in expression of one or more of the biomarkers listed in Table 1 compared to a control is used to predict whether the subject is in a high risk group having poor survival or a low risk group having good survival.

12. The method of claim 11, wherein the subject in the high risk group is selected for adjuvant chemotherapy and the subject in the low risk group is not selected for adjuvant chemotherapy.

13. A method of selecting a therapy for a subject with NSCLC, comprising obtaining a test sample from a subject suffering from NSCLC who has undergone a resection; determining the expression level of at least two or more biomarkers identified in Table 2 in the test sample to generate an expression value for each gene; and analyzing the expression value to generate a risk score, wherein the risk score can be used to classify whether the subject is selected to receive an angiogenesis inhibitor.

14. The method of claim 13, wherein the angiogenesis inhibitor is avastin.

15. A method of selectively treating a subject having NSCLC cancer, comprising: obtaining a test sample from a subject suffering from NSCLC following surgical resection; determining the expression level of at least one or more biomarkers identified in Table 1 , Table 2 or Table 3, or any combination of biomarkers identified in Table 1, Table 2 or Table 3 in the test sample to generate a risk score; classifying the subject based on the risk score into a high risk group or a low risk group; and administering adjuvant therapy to the subject classified as belonging to the high risk group or administering no adjuvant therapy to the subject classified as belonging to the low risk group.

16. The method of claim 11, claim 13 or claim 15 comprising determining expression of at least three biomarkers identified in Table 1, Table 2 and/or Table 3.

17. The method of claim 1, claim 11, claim 13 or claim 15 comprising determining expression of at least four biomarkers identified in Table 1, Table 2 and/or Table 3.

18. The method of claim 1, claim 11, claim 13 or claim 15 comprising determining expression of at least five biomarkers identified in Table 1, Table 2 and/or Table 3.

19. The method of claim 1, claim 11, claim 13 or claim 15 comprising determining expression of at least six biomarkers identified in Table 1, Table 2 and/or Table 3.

20. The method of any of claims 1-19 wherein the NSCLC is stage I NSCLC or stage II NSCLC, or a combination thereof.

21. The method of any one of claims 1-20, wherein the NSCLC is identified in the group consisting of squamous cell carcinoma and adenocarcinoma.

22. The method of any one of claims 1-21, wherein the test sample is fresh, frozen, parrafin fixed embedded cells.

23. The method of any one of claims 1-22, wherein the expression level is determined using qNPA, nanoString, quantitative PCR or an array.

24. The method of claim 1 or claim 2, wherein analyzing expression to generate a risk score is performed using statistical analysis.

25. The method of claim 24, wherein the statistical analysis comprises Cox regression analysis or parametric survival predictors.

26. The method of claims 1-25, wherein the subiect is a human.

27. A kit for comprising a plurality of agents for measuring the expression of one or more biomarkers identified in Table 1, Table 2 and/or Table 3 and instructions for use

28. A kit for predicting whether a subject with lung cancer would benefit from adjuvant therapy, the kit comprising: a plurality of agents for measuring the expression of one or more biomarkers identified in Table 1, Table 2 and/or Table 3; means for analyzing the expression and generating a risk score to predict whether a patient would benefit from adjuvant therapy.

29. The kit of claim 28, wherein the agents for measuring expression comprise an array of polynucleotides complementary to the mRNAs of the identified genes.

30. The kit of claim 28, wherein the agents for measuring expression comprise a plurality of PCR probes and/or primers for qRT-PCR.

31. The kit of claim 28, wherein the at least one biomarker identified in Table 1 , Table 2 and Table 3 comprises CBX7, STXl A, and TPX2.

32. An array comprising one or more polynucleotide probes complementary and hybrdizable to an expression product of at least two biomarkers shown in Table 1, Table 2 and/or Table 3.

33. A composition comprising a plurality of isolated nucleic acid sequences, wherein each isolated nucleic acid sequence hybridizes to an RNA product of the biomarkers CBX7, STXl A, and TPX2, wherein the composition is used to measure the level of RNA expression of the three genes.

34. A computer product for predicting a prognosis a subject with NSCLC comprising: means for receiving data corresponding to the expression level of one or more biomarkers in a sample from a subject having NSCLC, wherein the one or more biomarkers are identified in Table 1, Table 2 and/or Table , means for generating an expression value for each gene; and means for generating a risk score based on inputting the expression value into a database comprising a reference expression profile associated with a prognosis, wherein the risk score predicts a prognosis of survival or classifies the subject into a high risk group or a low risk group.

5. A computer product of claim 34 for use with the method of any one of claims 1-25.

Description:

BIOMARKERS FOR LUNG CANCER

FIELD OF THE INVENTION

The present invention relates to a method of prognosis and personalized therapy.

BACKGROUND OF THE INVENTION

Lung cancer is the most common cancer diagnosis in the world with 1.5 million new cases in 2007 (Salomon et al., Crit Rev Oncol Hematol., 19:183-232, 1995, SEER-database 05.2010). The high incidence and mortality rates make it the leading cause of cancer-related death with more than 975,000 deaths per year and a 5-year survival rate of 15% (Salomon et al., supra). Lung cancer can be classified as small cell lung cancer, or non-small cell lung cancer (NSCLC). NSCLC accounts for about 80% of the cases. NSCLC can be further subdivided into several histological types, the most common ones are adenocarcinoma (40%) and squamous cell carcinoma (25%).

The current treatment of NSCLC is mainly based on tumor morphology and the tumor-node- metastasis (TNM)-based staging system that classifies tumor in graduated categories (Stage IA, IB, IIA, IIB, IIIA, IIIB and IV) corresponding to the extent of tumor progression. Many staging systems exist (e.g., clinical vs. pathological staging, as well as various editions of staging guidelines such as those issued by International Association for the Study of Lung Cancer (IASLC) (Goldstraw et al, J. Thorac. Oncol. 2, 706-714, 2007). The frequency of the stages, for example according to clinical staging and IASLC 6th edition of TNM staging recommendation, are 23% stage I, 19% stage II, 37% stage III and 21% stage IV (Goldstraw et al. supra). The relative proportions may change substantially depending on the guidelines and whether clinical or pathological staging is used.

Surgery is the standard treatment for early stage NSCLC (stage I and II), followed by adjuvant therapy such as radiation therapy, chemotherapy (for stage II and later), and bevacizumab and epidermal growth factor receptor (EGFR) tyrosine kinase inhibitors (TKIs) for the advanced

NSCLC-stages (Kutikova et al., Lung Cancer; 50(2):143-154, 2005). Clinical guidelines in the United States and Europe for treatment of NSCLC support these treatment options (Mendelssohn et al., 2000; NCCN-Guidelines NSCLC VI, 2010).

Based on the current TNM-based staging system for early lung cancer Stage I NSCLC patients suffer from a 35 % chance of relapse within 5 years after surgery (SEER-Database, 2008). Current treatment guidelines do not recommend an adjuvant chemotherapy for these patients. Whereas 30 % patients with a TNM-based Stage II will not experience a relapse without any adjuvant

chemotherapy (SEER-Database) meaning that these patients experience over treatment based on the current treatment guidelines (i.e., ESMO (D'Addario et al., Annals of Oncology. 2009; 20 (suppl 4):iv68-iv70, 2009), NCCN Vl-2010). This is paralleled by reports stating that 60% of patients with early NSCLC will have no relapse after surgery (Arriagada et al., NEJM, 350:351-360, 2004).

Based on current clinical data adjuvant chemotherapy treatment of early NSCLC provides evidence that the median benefit for adjuvant chemotherapy is 4 % (NSCLC Meta-Analysis Collaborative Group), improving from 60 % to 64 % at 5 years.

Based on the current shortcomings, there is a medical rational for the need of a prognostic and/or predictive genomic signature for patients with NSCLC.

SUMMARY OF THE INVENTION

The present invention relates, in part, to methods for determining a prognosis of early stage lung cancer in an individual using one or more biomarkers described herein. These findings may be used to help to determine appropriate treatments for patients with early stage lung cancer such as identifying those patients who would benefit from receiving adjuvant therapy.

In one aspect, the invention includes a method for prognosing or classifying a subject with non-small cell lung cancer (NSCLC) including obtaining a test sample from a subject suffering from NSCLC following surgical resection; determining the expression level of at least one or more biomarker identified in Table 1, Table 2 and/or Table 3, or any combination of biomarkers identified in Table 1, Table 2 and/or Table 3 in the test sample; and analyzing the expression level to generate a risk score, wherein the risk score can be used to provide a prognosis or classify the subject.

In another aspect, the invention includes a method for prognosing or classifying a subject with non- small cell lung cancer (NSCLC) comprising: obtaining a test sample from a subject suffering from NSCLC following surgical resection; determining the expression level of at least one biomarkers from Table 1 , Table 2 and Table 3 in the test sample; and analyzing the expression level to generate a risk score, wherein the risk score can be used to provide a prognosis or classify the subject. In one embodiment, the at least one biomarker identified in Table 1, Table 2 and Table 3 includes CBX7, STX1A, and TPX2. In another embodiment, the at least one biomarker identified in Table 1, Table 2 and Table 3 includes CBX7, TMPRSS2, STX1A, KLK6, TPX2 and UCK. In yet another embodiment, the at least one biomarker identified in Table 1, Table 2 and Table 3 includes CBX7, TMPRSS2, GPR116, STX1A, KLK6, SLC16A3, TPX2, UCK2, PHKA1. In still yet another embodiment, the at least one biomarker identified in Table 1, Table 2 and Table 3 comprises CBX7, TMPRSS2, GPR116, KCNJ15, STX1A, KLK6, SLC16A3, PYGL, TPX2, UCK2, PHKA1, or EIF4A3. In yet another embodiment, the at least one biomarker identified in Table 1, Table 2 and Table 3 includes CBX7, TMPRSS2, GPR116, KCNJ15, PTPN13, STX1A, KLK6, SLC16A3, PYGL, LDHA, TPX2, UCK2, PHKA1, EIF4A3 or TK1. In yet another embodiment, the at least one biomarker identified in Table 1, Table 2 and Table 3 comprises CBX7, TMPRSS2, GPR116, KCNJ15, PTPN13, CTSH, STX1A, KLK6, SLC16A3, PYGL, LDHA, ITGA5, TPX2, UCK2, PHKA1, EIF4A3, TK1, or CCNA2.

In one embodiment, the risk score of the invention can be used for prognosis by mapping subjects to time-specific probability of death due to lung cancer, distance metastasis or local relapse.

In another embodiment, the risk score can classify the subject into a high risk group that would benefit from receiving adjuvant chemotherapy or in a low risk group that would not benefit from receiving adjuvant chemotherapy.

In another aspect, the invention includes a method of predicting prognosis in a subject with non- small cell lung cancer (NSCLC) following surgical resection, comprising determining expression profile of mRNA from tumor samples, either from fresh frozen (FF) or formalin fixed paraffin embedded (FFPE) material. The profile comprises of one or more biomarkers listed in Table 1, Table 2 and/or Table 3, wherein an increase in expression of one or more biomarkers listed in Table 2 and/or Table 3 and a decrease in expression of one or more of the biomarkers listed in Table 1 compared to a control is used to predict whether the subject is in a high risk group having poor survival or a low risk group having good survival. In the method of the invention, a subject in the high risk group is selected for adjuvant chemotherapy and the subject in the low risk group is not selected for adjuvant chemotherapy and then treated accordingly.

In yet another embodiment, the invention includes a method of selecting a therapy for a subject with NSCLC, including obtaining a test sample from a subject suffering from NSCLC who has undergone a resection; determining the expression level of at least two or more biomarkers identified in Table 2 in the test sample to generate an expression value for each gene; and analyzing the expression value to generate a risk score, wherein the risk score can be used to classify whether the subject is selected to receive an angiogenesis inhibitor such as avastin.

In the methods of the invention, the invention includes determining expression of at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at leastlO, at least 11, at least 12, at least 13, at least 14 or at least 15 biomarkers identified in Table 1, Table 2 and/or Table 3, or any combination thereof. For example, the expression of at least any 5 biomarkers from each of Table 1, Table 2 and Table 3 are selected (the signature in this embodiment would include at least 15 biomarkers). In one embodiment, the NSCLC is stage I NSCLC, stage II NSCLC, or a combination thereof. The NSCLC can be identified in the group consisting of squamous cell carcinoma and/or

adenocarcinoma.

In one embodiment, the subject is human. In another embodiment, the test sample can be fresh, frozen, FFPE cells. In another embodiment, the expression level is determined using quantitative PCR or an array.

In yet another embodiment, analyzing expression to generate a risk score is performed using statistical analysis such as Cox regression or parametric survival predictors.

In another aspect, the invention includes a method of selectively treating a subject having NSCLC cancer including obtaining a test sample from a subject suffering from NSCLC following surgical resection, determining the expression level of at least one or more biomarkers identified in Table 1, Table 2 and/or Table 3, or any combination of biomarkers identified in Table 1, Table 2 and/or Table 3 in the test sample to generate a risk score; classifying the subject based on the risk score into a high risk group or a low risk group; and administering adjuvant therapy to the subject classified as belonging to the high risk group or administering no adjuvant therapy to the subject classified as belonging to the low risk group.

In yet another aspect, the invention includes a kit including a plurality of agents for measuring the expression of one or more biomarkers identified in Table 1, Table 2 and/or Table 3 and instructions for use. In yet another aspect, the invention includes a kit for predicting whether a subject with lung cancer would benefit from adjuvant therapy, the kit includes a plurality of agents for measuring the expression of one or more biomarkers identified in Table 1, Table 2 and/or Table 3; means for analyzing the expression and generating a risk score to predict whether a patient would benefit from adjuvant therapy. The agents for measuring expression can include an array of polynucleotides complementary to the mRNAs of the identified biomarkers. The agents that measure expression can include a plurality of PCR probes and/or primers for qRT-PCR. The kit can include agents for measuring at least one biomarker identified in Table 1, Table 2 and Table 3 such as CBX7, STX1 A, or TPX2.

In another aspect, the invention includes an array comprising one or more polynucleotide probes complementary and hybrdizable to an expression product of at least two biomarkers etc shown in Table 1, Table 2 and/or Table 3. In yet another aspect, the invention includes a composition comprising a plurality of isolated nucleic acid sequences, wherein each isolated nucleic acid sequence hybridizes to an RNA product of a biomarker shown in Table 1, e.g., the biomarkers CBX7, STX1A and TPX2, wherein the composition is used to measure the level of RNA expression of the three genes.

In yet another aspect, the invention includes a computer product for predicting a prognosis, or classifying a subject with NSCLC including means for receiving data corresponding to the expression level of one or more biomarkers in a sample from a subject having NSCLC, wherein the one or more biomarkers are identified in Table 1, Table 2 and/or Table 3, means for generating an expression value for each gene; and means for generating a risk score based on inputting the expression value into a database comprising a reference expression profile associated with a prognosis, wherein the risk score predicts a prognosis of survival or classifies the subject into a high risk group or a low risk group.

In yet another aspect, the invention includes a computer product for use with the method of any one of methods described above.

A "biomarker" is a molecule useful as an indicator of a biologic state in a subject. With reference to the present subject matter, the biomarkers disclosed herein can be molecules that exhibit a change in expression and whose presence can be used for prognosis or to predict whether a subject would benefit from receiving a particular treatment. The biomarkers of interest can be determined by detecting for a change in expression of the biomarker. A change in expression describes the conversion of the DNA gene sequence information into transcribed RNA (the initial unspliced RNA transcript or the mature mRNA) or the encoded protein product. The biomarkers disclosed herein include any, or any combination of the biomarkers listed in Tables 1, 2 and 3 and can be transcribed RNA or encoded protein product.

BRIEF DESCRIPTION OF THE DRAWINGS

Fig. 1 depicts pairwise scatter plots of the standardized scores of modules 1, 2 and 3, demonstrating they convey different information about the tumor.

Fig 2A-D depict a Kaplan-Meier overall survival analysis curves showing the prognostic

performance of the three signatures (module 1, 2, and 3) when they are applied separately, or combined, for patients up to five years after surgery. Fig. 3A-F depict a Kaplan-Meier overall survival analysis curves showing the prognostic performance of module 1 under stratification by stage, histology and age, for patients up to five years after surgery.

Fig. 4A-F depict a Kaplan-Meier survival analysis curves showing the prognostic performance of module module 2 under stratification by stage, histology and age, for patients up to five years after surgery.

Fig. 5A-F depict a Kaplan-Meier survival analysis curves showing the prognostic performance of module 3 under stratification by stage, histology and age, for patients up to five years after surgery.

Fig. 6A-F depict a Kaplan-Meier survival analysis curves showing the prognostic performance of combined score under stratification by stage, histology and age, for patients up to five years after surgery.

Fig. 7A-F depict Kaplan-Meier survival analysis curves showing the prognostic performance of a number of example signatures constructed as subsets of the 38-gene list.

Fig. 8 A depicts the lung cancer percentage mortality (equivalent to 100% minus survival) as the function of the risk score of the claimed signature and shows such prognosis at 1-, 2-, 3-, 4- and 5- year follow-up time. Fig. 8B depicts the lung cancer percentage mortality (equivalent to 100% minus survival) as the function of the risk score of the claimed signature and shows the 5-year mortality as the function of risk score, stratified by tumor stage 1 and 2.

Fig. 9 depicts a Kaplan-Meier survival analysis curve showing the survival performance in publicly available dataset from M. D. Anderson Cancer Center which were obtained from FFPE material.

Fig. 10A-B depict pairwise scatter plots comparing FF/Affymetrix vs FFPE/qNPA, as well as FF/Affymetrix vs FFPE/nanoString.

DETAILED DESCRIPTION OF THE INVENTION

The present invention is based, in part, on methods which can be used for the prognosis or classification of individuals having early stage lung cancer. The invention further includes identifying those patients who are at high risk for disease recurrence and for whom adjuvant therapy might be recommended, as well as patients with a low recurrence risk, who might not benefit from adjuvant therapy. In one example, the prognosis and prediction methods described herein are based upon the differential expression of a plurality of biomarkers in a lung cancer test sample. The biomarkers of the invention can include 38 genes (CBX7, TMPRSS2, GPR116, KCNJ15, PTPN13, CTSH, PPFIBP2, CD302, SFTPB, HSD17B6, DLC1, ADRB2, PARMl, KLRB1, MS4A1,

STX1A, KLK6, SLC16A3, PYGL, LDHA, ITGA5, VEGFC, EEF1A2, TPX2, UCK2, PHKA1, EIF4A3, TK1, CCNA2, GGH, CCNB1, MELK, HMMR, EIF2S1, TEAD4, HMGA1, RIMS2, H2AFZ), or a combination thereof, which can be broken up into three modules (Table 1, 2, and 3) based on criteria including biological function. Table 1 (which is referred to herein as also Module

1) includes genes involved in tumor suppression, Table 2 (which is referred to herein as also Module

2) includes genes involved in angiogenesis, and Table 3 (which is referred to herein as also Module

3) includes genes involved in proliferation.

It was discovered that some biomarkers are over-expressed in early stage lung cancer such as those markers involved in angiogensis or proliferation (Table 2 and Table 3, respectively), whereas other biomarkers involved in tumor suppression are under-expressed (Table 1) as compared to a control (e.g., the average expression of these genes in patients with early stage lung cancer (stage I and II)).

Biomarker

The biomarker(s) of the invention includes one or more biomarkers listed in Table 1, Table 2, and/or Table 3, or their gene products. The present invention is based on the finding that the biomarkers listed in Table 1, Table 2, and/or Table 3 are differentially expressed. By analyzing the expression profile levels of one or more biomarkers identified in Table 1, Table 2, and/or Table 3 it is possible to determine the prognosis of an individual with early stage lung cancer.

In one example, the method of the invention includes measuring one or more biomarkers from Table 1. For example, the method of the invention measures at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen or at least fifteen, biomarkers from Table 1. In one example, the level of expression of one gene CBX7 from Table 1 is measured. In another example, the level of expression of two biomarkers CBX7 and TMPRSS2 from Table 1 are measured. In yet another example, the level of expression of three biomarkers CBX7, TMPRSS2 and GPR116 from Table 1 are measured. In yet another example, the level of expression of four biomarkers CBX7, TMPRSS2, GPR116 and KCNJ15 from Table 1 are measured. In yet another example, the level of expression of five biomarkers CBX7, TMPRSS2, GPRl 16, KCNJ15 and PTPN13 from Table 1 are measured.

Table 1

In another example, the method of the invention includes measuring one or more biomarkers from Table 2. For example, the method of the invention measures the expression of at least one, at least two, at least three, at least four, at least five, at least six, at least seven or at least eight biomarkers from Table 2. In one example, the level of expression of one gene STX1 A from Table 2 is measured. In one example, the level of expression of two biomarkers STXl A and KLK6 from Table 2 are measured. In another example, the level of expression of three biomarkers STXl A, KLK6 and SLC16A3 from Table 2 are measured. In another example, the level of expression of four biomarkers STXl A, KLK6, SLC16A3 and PYGL from Table 2 are measured. In yet another example, the level of expression of five biomarkers STX1A, KLK6, SLC16A3, PYGL and LDHA from Table 2 are measured.

Table 2

In another example, the method of the invention includes measuring one or more biomarkers from Table 3. For example, the method of the invention measures at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten, at least eleven, at least twelve, at least thirteen, at least fourteen or at least fifteen, biomarkers from Table 3. In one example, the level of expression of one gene TPX2 from Table 3 is measured. In another example, the level of expression of two biomarkers TPX2 and UCK2 from Table 3 are measured. In another example, the level of expression of three biomarkers TPX2, UCK2 and PHKA1 from Table 3 are measured. In another example, the level of expression of four biomarkers TPX2, UCK2, PHKAl and EIF4A3 from Table 3 are measured. In yet another example, the level of expre five biomarkers TPX2, UCK2, PHKAl EIF4A3 and TKl from Table 3 are measured.

Table 3

The biomarkers of the invention can also include any combination of biomarkers identified in Table 1, Table 2 and Table 3 whose level of expression or gene product serves as a predictive marker or biomarker for prognosis of an individual with early stage lung cancer. In one example, the level of expression of one gene selected from each Table, Table 1, Table 2 and Table 3 is measured, e.g., CBX7, STX1 A and TPX2. In another example, the level of expression of two biomarkers selected from each of the Tables, Table 1, Table 2 and Table 3 is measured, e.g., CBX7 and TMPRSS2 from Table 1, STX1A and KLK6 from Table 2 and TPX2 and UCK2 from Table 3. See Table 4 below for examples of various combinations of biomarkers from Tables 1, 2 and 3. The combinations shown in Table 4 are not meant to be construed as limiting and any combination of biomarkers shown in Tables 1-3 can be made.