Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
CANCER BIOMARKERS TO PREDICT RECURRENCE AND METASTATIC POTENTIAL
Document Type and Number:
WIPO Patent Application WO/2011/082198
Kind Code:
A2
Abstract:
Described herein are methods for predicting recurrence, progression, and metastatic potential of a prostate cancer in a subject. In certain embodiments, the methods comprise analyzing a sample from a subject for aberrant expression patters of one or more biomarkers disclosed herein. An increase or decrease in one or more biomarkers as compared to a standard indicates a recurrent, progressive, or metastatic prostate cancer.

Inventors:
MORENO CARLOS
LONG QI
BARWICK BENJAMIN G
Application Number:
PCT/US2010/062293
Publication Date:
July 07, 2011
Filing Date:
December 29, 2010
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV EMORY
MORENO CARLOS
LONG QI
BARWICK BENJAMIN G
International Classes:
C12Q1/68; C12N15/12; C40B40/08
Other References:
LAPOINTE, J. ET AL.: 'Gene expression profiling identifies clinically relevant subtypes of prostate cancer' PNAS vol. 101, no. 3, 20 January 2004, pages 811 - 816
SUN, Y. ET AL.: 'Optimizing molecular signatures for predicting prostate cancer recurrence' PROSTATE. vol. 69, no. 10, 01 July 2009, pages 1119 - 1127
NAM, R. K. ET AL.: 'Expression of TMPRSS2:ERG gene fusion in prostate cancer cells is an important prognostic factor for cancer progression' CANCER BIOLOGY AND THERAPY. vol. 6, no. 1, January 2007, pages 40 - 45
SARAMAKI, O. R. ET AL.: 'TMPRSS2:ERG fusion identifies a subgroup of prostate cancers with a favorable prognosis' CLINICAL CANCER RESEARCH. vol. 14, no. 11, 01 June 2008, pages 3395 - 3400
Attorney, Agent or Firm:
MASON, James, C. et al. (Office Of Technology Transfer1599 Clifton Road Ne, 4th Floo, Atlanta GA, US)
Download PDF:
Claims:
Emory Ref: 10048 PCT

WHAT IS CLAIMED IS:

1. A method of predicting the recurrence, progression, and metastatic potential of a cancer in a subject, the method comprising detecting in a sample from the subject one or more biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, LAF4, CTNNA1, XPOl, PTGDS, SOX9, RELA, EPB49, SIM2, EDNRA, RAD23B, FBP1, TNFRSF1A, CCNG2, LETMDl, NOTCH3, ETV1, BID, SIM2, ANXAl, BCL2, miR-519d, miR-647, FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMDl, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRSS2_ETV1 FUSION, CSPG2, WNTIOB, E2F3, CDKN2A, TYMS, miR-103, miR-339, miR-183, miR-182, miR- 136, and/or miR-221, wherein an increase or decrease in one or more of the biomarkers as compared to a standard indicating a recurrent, progressive, or metastatic cancer.

2. The method of claim 1, wherein the sample comprises prostate tumor tissue.

3. The method of claim 1, wherein the cancer comprises a TMPRSS2-ERG fusion-positive prostate cancer.

4. The method of claim 1, wherein the detecting step comprises detecting mRNA and miRNA expression level patterns of the biomarkers.

5. The method of claim 4, wherein the RNA detection comprises reverse-transcription polymerase chain reaction (RT-PCR) assay; quantitative real-time-PCR (qRT-PCR); Northern analysis; microarray analysis; or cDNA-mediated annealing, selection, extension, and ligation (DASL) assay.

6. The method of claim 1, further comprising detecting in a sample from the subject two, three, four, five , six , seven, eight or more biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4. Emory Ref: 10048 PCT

7. The method of claim 1, wherein the detected biomarkers comprise two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty or more biomarkers selected from the group consisting of FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221.

8. The method of claim 1, wherein the detected biomarkers are selected from the group consisting of miR-519d and/or miR-647 and two, three, four, five, six, seven, eight, nine or more markers selected from the group consisting of RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETVl, BID, SIM2, and ANXA1.

9. A method of treating a subject with cancer comprising modifying a treatment regimen of the subject based on the results of the method of any one of the claims 1-9.

10. The method of claim 9, wherein the treatment regimen is modified to be aggressive based on an increase in one or more biomarkers selected from the group consisting of CSPG2, WNT10B, E2F3, CDKN2A, and TYMS as compared to a standard, and a decrease in one or more biomarkers selected from the group consisting of TGFB3, ALOX12, CD44, and LAF4 as compared to a standard.

11. The method of claim 9, wherein the treatment regimen is further modified to be aggressive based on an increase in one or more biomarkers selected from the group consisting of CLNS1A, XPOl, LETMD1, RAD23B, TMPRS S2 ET VI FUSION, ABCC3, SPC, CHES1, FRZB, HSPG2, miR-103, miR-339, miR-183, and miR-182 as compared to a standard, and a decrease in one or more biomarkers selected from the group consisting of FOXOIA, SOX9, PTGDS, EDNRA, miR-136, and miR-221 as compared to a standard. Emory Ref: 10048 PCT

12. The method of claim 9, wherein the treatment regimen is further modified to be aggressive based on an increased expression of RAD23B, FBP1, CCNG2, LETMD1 ,

NOTCH3, ETVl, BID, SIM2, miR-519d and the decreased expression of TNFRSFIA, miR- 647, and ANXA1.

13. A method of predicting the recurrence, progression, and metastatic potential of a prostate cancer in a subject, the method comprising analyzing a sample from the subject for an aberrant expression pattern of four or more biomarkers wherein at least one of the biomarkers is a microRNA selected from miR-519d, miR-647, miR-103, miR-339, miR-183, and miR-182 miR- 136, and/ or miR-221.

14. A method of predicting the recurrence, progression, and metastatic potential of a cancer in a subject, the method comprising detecting in a sample from the subject an increase in miR- 519d.

15. A method of predicting the recurrence, progression, and metastatic potential of a cancer in a subject, the method comprising detecting in a sample from the subject a decrease in miR- 647.

Description:
CANCER BIOMARKERS TO PREDICT RECURRENCE AND

METASTATIC POTENTIAL

This application claims priority to U.S. provisional application number 61/291,681 filed December 31 st , 2009 and U.S. provisional application number 61/329,387 filed April 29 th , 2010 both hereby incorporated by reference.

BACKGROUND

Prostate cancer is the most commonly diagnosed noncutaneous neoplasm and second most common cause of cancer-related mortality in Western men. One of the important challenges in current prostate cancer research is to develop effective methods to determine whether a patient is likely to progress to the aggressive, metastatic disease in order to aid clinicians in deciding the appropriate course of treatment.

Various approaches using clinical parameters including prostate specific antigen (PSA) levels at time of initial diagnosis have been explored to predict disease progression. Although these models work well for men with extreme levels of PSA, the majority of men fall within an intermediate range characterized by a PSA level between 4-10 ng/ml and a Gleason score of 6 or 7. Current prognostic models of prostate cancer, including PSA, Gleason score and clinical stage fail to accurately predict disease progression, especially for men with intermediate disease. Thus there is a need for additional tests to complement and improve upon these existing approaches.

Technologies have been developed to exploit formalin- fixed paraffin-embedded (FFPE) tumor tissue samples for gene expression analysis. The DASL (cDNA-mediated Annealing, Selection, extension and Ligation) assay is a unique expression profiling platform based upon massively multiplexed RT-PCR applied in a microarray format allowing for the determination of expression of RNA isolated from FFPE tumor tissue samples in a high throughput format. See Bibikova et al., Am J Pathol 2004, 165: 1799-1807 and Fan et al. Genome Res 2004, 14:878-885. The DASL assay has been used to identify a 16-gene set that correlates with prostate cancer relapse. Bibikova et al., Genomics 2007, 89:666-672. However, diagnosis of the progression for prostate cancer using molecular biomarkers is challenging because Emory Ref: 10048 PCT molecular expression may be limited by sampling at time of initial diagnosis, may not be present at time of initial diagnosis, or may occur as the disease progresses. See Sboner et al., BMC Med Genomics 2010, 3:8 and Nakagawa et al, PLoS ONE 2008, 3:e2318.

SUMMARY

Provided are methods of predicting the recurrence, progression, and metastatic potential of a cancer in a subject, typically prostate cancer. The methods comprise selecting a subject at risk of recurrence, progression, or metastasis of cancer, detecting in a sample from the subject one or more biomarkers selected from the group consisting of CSPG2, WNT10B, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, LAF4, CTNNAl, XPOl, PTGDS, SOX9, RELA, EPB49, SIM2, EDNRA, RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, SIM2, ANXA1, BCL2, miR-519d, miR-647, FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2,

TMPRSS2_ETV1 FUSION, CSPG2, WNT10B, E2F3, CDKN2A, TYMS, miR-103, miR-339, miR-183, miR-182, miR-136, and/or miR-221 to create a biomarker profile, analyzing the biomarker, and correlating an aberrant expression pattern to a heightened potential for recurrence, progression or metastasis of cancer.

In another embodiment, the biomarkers are selected from one or more of CTNNAl, XPOl, PTGDS, SOX9, RELA, EPB49, SIM2, and EDNRA. In certain embodiments, the panel includes at least two of the biomarkers, and typically includes at least three or at least four, or at least five, or at least six, or at least seven or at least eight biomarkers and includes at least one biomarker selected from CTNNAl, XPOl, PTGDS, SOX9, RELA, EPB49, SIM2, and

EDNRA. In another embodiment, the biomarkers are selected from one or more, or two or more, or three or more, or four or more, or five or more, or six or more, or seven or more, or eight or more of RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, SIM2, and ANXA1, and miR-519d, and/or miR-647. Typically, one analyzes a sample from a subject for the presence of mRNA of one or more protein-coding genes RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, SIM2, ANXA1 and one or both microRNA of miR-519d and/or miR-647.

In certain embodiments, the panel includes at least two of the biomarkers, and typically includes at least three or at least four, or at least five, or at least six, or at least seven, or at least Emory Ref: 10048 PCT eight, or at least nine biomarkers and includes at least one biomarker selected from RAD23B, FBP1, TNFRSFIA, CCNG2, LETMD1, NOTCH3, ETVl, BID, SIM2, and ANXA1, and miR- 519d, and/or miR-647.

An increase or decrease in one or more of the biomarkers as compared to a standard indicates a prostate cancer that is prone to recur, progress, and/or metastasize. Optionally, the methods further comprise detecting one or more biomarkers selected from the group consisting of FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221.

Also provided are methods of treating a subject diagnosed with prostate cancer comprising modifying the treatment regimen of the subject based on the results of the method of predicting the recurrence, progression, and/or metastatic potential of a prostate cancer in a subject. The treatment regimen is modified to be aggressive based on an increase in one or more biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, and TYMS as compared to a standard, and a decrease in one or more biomarkers selected from the group consisting of TGFB3, ALOX12, CD44 and LAF4 as compared to a standard. The treatment regimen is further modified to be aggressive based on an increase in one or more biomarkers selected from the group consisting of CLNS1A, XPOl, LETMD1, RAD23B, TMPRS S2 ETV 1 FUSION, ABCC3, SPC, CHES1, FRZB, HSPG2, miR-103, miR-339, miR- 183, and miR-182 as compared to a standard, and a decrease in one or more biomarkers selected from the group consisting of FOXOIA, SOX9, PTGDS, EDNRA, miR-136, and miR- 221 as compared to a standard. The treatment regimen is further modified to be aggressive based on an increased expression of RAD23B, FBP1, CCNG2, LETMD1, NOTCH3, ETVl, BID, SIM2, miR-519d and the decreased expression of TNFRSFIA, miR-647, and ANXA1.

Also provided are kits comprising one or more primers to detect expression of biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4. The kits can further comprise one or more primers to detect expression of biomarkers selected from the group consisting of FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1 , RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221. The kits can further comprise one or more primers to detect expression of Emory Ref: 10048 PCT biomarkers selected from the group consisting of RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, SIM2, ANXA1, miR-519d, and miR-647.

In certain embodiments, the disclosure relates to methods of predicting the recurrence, progression, and metastatic potential of a prostate cancer in a subject, the method comprising analyzing a sample from the subject for an aberrant expression pattern of four, five, six, seven, eight, nine or more biomarkers wherein at least one of the biomarkers is a microRNA. In certain embodiments, the mircoRNA is miR-519d, miR-647, miR-103, miR-339, miR-183, miR-182, miR-136, and/or miR-221.

In some embodiments, the panel includes at least two of the biomarkers, and typically includes at least three or at least four, or at least five, or at least six, or at least seven or at least eight biomarkers and correlates expression levels to the recurrence, progression, and potential of prostate cancer. Typically, one analyzes a sample from a subject for the presence of mRNA of one or more protein-coding genes and one or more miRNA. Typically, the subject previously had a partial or total prostate removal by surgery including portions of the prostate that contained cancerous cells.

In certain embodiments, the disclosure relates to analyzing biomarkers disclosed herein and correlating aberrant expression patterns to a likelihood of prostate cancer recurrence.

Typically, analyzing comprises detecting mRNA or detecting protein levels directly such as, but not limited to, moving the samples through a separation medium and exposing fractions to antibodies with epitopes to certain sequences on the proteins, or identifying the biomarker using mass spectroscopy. Typically the mRNA or microRNA (miRNA) may be detected by amplification using primers and hybridization to a suitably labeled complimentary nucleic acid probe. Typically, the label is a fluorescent dye conjugated to the nucleic acid probe.

DESCRIPTION OF DRAWINGS

Figure 1 shows data on the time to recurrence survival analysis of Prostate cancer patients. (A) Kaplan-Meier analysis of the training set of 61 patients with complete clinical data that were separated based on the expression of RAD23B, FBP1, TNFRSF1A, LETMD1, NOTCH3, ETV1, BID, SIM2, ANXA1, and BCL2. (B) Kaplan-Meier analysis on the 35 validation cases with complete clinical data using this mRNA panel. (C) Kaplan-Meier analysis of the training set using the combined mRNA and miRNA panel of RAD23B, FBP 1 , Emory Ref: 10048 PCT

TNFRSF1A, CCNG2, hsa-miR-647, LETMD1, NOTCH3, ETV1, hsa-miR-519d, BID, SIM2, and ANXA1. (D) Kaplan-Meier analysis of the validation set using the combined mRNA and miRNA panel.

Figures 2A-2C show characteristics of prostate cancer patients with and without TMPRSS2-ERG fusion. Figure 2 A shows a graph demonstrating that patients with TMPRSS2- ERG fusion positive tumors experienced a higher rate of biochemical recurrence opposed to those that did not have the gene fusion (log rank p-value = 3.54 x 10 " ). Figure 2B shows a graph demonstrating that ERG expression was upregulated in TMPRSS2-ERG fusion positive tumors by 3.07-fold (p = 3.48 x 10 "11 , Student's t-test). Figure 2C shows a graph confirming the microarray results presented in Figure 2B with an RT-PCR assay. The RT-PCR assay confirmed increased ERG expression in TMPRSS2-ERG fusion positive tumors (p = 8.13 x 10- 10, Student's two-sided t-test).

Figure 3 shows validated genes differentially expressed in TMPRSS2-ERG fusion positive tumors. Significance testing of genes differentially regulated in TMPRSS2-ERG fusion positive prostate tumors in the Toronto cohort of a 139 patients characterized on 502 genes (solid black line) was validated in a Swedish cohort of 455 patients characterized for 6,144 genes (dashed black line). Nine genes upregulated with TMPRSS2-ERG fusion in both cohorts are shown on top, while six genes downregulated in both cohorts are shown on the bottom

Figures 4A-4D show permutation testing of genes associated with TMPRSS2-ERG fusion. To determine significant differentially regulated genes associated with TMPRSS2-ERG fusion, 1,000 permutations of random class assignment estimated genes were performed with a false discovery rate (FDR) less than 5%. Figures 4 A and 4B show Q-q plots of the Toronto cohort of 139 patients (Figure 4A) and of the Swedish cohorts of 455 patients (Figure 4B). In both cases ERG was distinctly the most overrepresented gene in TMPRSS2-ERG fusion positive tumors as depicted by box plots of ERG expression intensities for the Toronto (Figure 4C) and Swedish cohorts (Figure 4D).

Figure 5 shows common genes prognostic of biochemical recurrence. Univariate Cox proportional hazards regression determined genes associated with biochemical recurrence in the Toronto cohort of 139 patients and a Minnesota cohort of 596 patients. Seven genes were Emory Ref: 10048 PCT identified in common; five genes were associated with recurrence, and two genes were associated with non-recurrence.

Figures 6A and 6B show Kaplan-Meier survival analysis of the Toronto cohort. Figure

6A shows a Kaplan-Meier plot demonstrating the seven-gene expression recurrence score used to segregate patients into good and poor prognostic categories. (p=0.000167) Figure 6B shows a Kaplan-Meier plot demonstrating that a mixed clinical model composed of Gleason score,

TMPRSS2-ERG fusion status, and the seven-gene expression recurrence score is better able to prognosticate recurrence (p=4.15 x 10 "7 ).

Figure 7 shows data using Kaplan-Meier survival analysis. (A) Kaplan-Meier analysis of the training set of 42 Gleason 7 cases with complete clinical data using the mRNA panel of

RAD23B, FBP1, TNFRSF1A, LETMD1, NOTCH3, ETV1, BID, SIM2, ANXA1, and BCL2.

(B) Kaplan-Meier analysis of the 19 Gleason 7 cases in the validation set using the mRNA panel. (C) Kaplan-Meier analysis of the Gleason 7 cases in the training set using the combined mRNA and miRNA panel or RAD23B, FBP1, TNFRSF1A, CCNG2, hsa-miR-647, LETMD1, NOTCH3, ETV1, hsa-miR-519d, BID, SIM2, and ANXA1. (D) Kaplan-Meier analysis of the

Gleason 7 cases in the validation set using the combined mRNA and miRNA panel.

DETAILED DESCRIPTION

Described herein are methods for predicting the recurrence, progression, and/or metastatic potential of a cancer in a subject. The methods comprise selecting a subject at risk of recurrence, progression, or metastasis of prostate cancer, and detecting in a sample from a subject one or more biomarkers selected from the group consisting of CSPG2, WNT10B, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, LAF4, CTNNAl, XPOl, PTGDS, SOX9, RELA, EPB49, SIM2, EDNRA, RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, SIM2, ANXA1, miR-519d, miR-647, FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRS S2 ET V 1 FUSION, CSPG2, WNT10B, E2F3, CDKN2A, TYMS, miR-103, miR-339, miR-183, miR- 182, miR-136, and miR-221 to create a biomarker profile. It is understood that detection of biomarker may be by detection of the gene, mRNA, translated protein, microRNA or other indicator that suggests gene expression. Emory Ref: 10048 PCT

In certain embodiments, one analyzes a sample from the subject for aberrant expression of RAD23B, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, ANXAl, miR-519d, miR-647, and correlating such expression to a likelihood of recurrence, progression, or metastasis of prostate cancer. In certain embodiments, the aberrant expression is increased expression of RAD23B, FBPl, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, miR-519d and the decreased expression of TNFRSFIA, miR-647, and ANXAl .

An increase or decrease in one or more of the biomarkers as compared to a standard indicates a prostate cancer that is prone to recur, progress, and/or metastasize. Optionally, the sample comprises prostate tumor tissue. Optionally, the prostate cancer comprises a

TMPRSS2-ERG fusion-positive prostate cancer.

Optionally, the detected biomarkers comprise two or more, three or more, four or more, five or more, six or more, seven or more, eight or more biomarkers selected from the group consisting of RAD23B, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, ANXAl, combination with miR-519d and/or miR-647. For example, the detected biomarkers can comprise detecting miR-519 and/or miR-647 in combination with RAD23, FBPl, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2,

LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, SIM2, and ANXAl ; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, and SIM2; or TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, ETVl, BID, SIM2, and ANXA; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, and BID; or FBPl, CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or Emory Ref: 10048 PCT

FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or FBPl,

TNFRSFIA, CCNG2, NOTCH3, ETVl, BID, SIM2, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, BID, SIM2, and ANXAl; or FBPl, TNFRSFIA, CCNG2,

LETMDl, NOTCH3, BID, SIM2, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl , SIM2, ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, and SIM2; or RAD23, TNFRSFIA, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23TNFRSF1A, CCNG2 , NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, ETVl, BID, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, BID, and SIM2; or RAD23, FBPl, CCNG2, NOTCH3, ETVl, BID, SIM2, and

ANXAl; or RAD23, FBPl, CCNG2, LETMDl, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, CCNG2, LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl, CCNG2, LETMDl, NOTCH3, ETVl, SIM2, and ANXAl; or RAD23, FBPl, CCNG2, LETMDl, NOTCH3, ETVl, BID, and SIM2; or RAD23, FBPl, TNFRSFIA, LETMDl, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, BID, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, BID, and SIM2; or RAD23, FBPl,

TNFRSFIA, CCNG2, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, ETVl, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, ETVl, BID, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, ETVl, BID, and SIM2; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, BID, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, BID, and SIM2; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, BID, and ANXAl; or RAD23, FBPl,

TNFRSFIA, CCNG2, LETMDl, NOTCH3, BID, and SIM2; or CCNG2, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, Emory Ref: 10048 PCT

TNFRSFIA, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, and ETVl; or FBPl, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or FBPl, TNFRSFIA, NOTCH3, ETVl, BID, SIM2, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, BID, SIM2, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, SIM2, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl,

NOTCH3, ETVl, and ANXAl; or FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, and BID; or RAD23, TNFRSFIA, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, ETVl, BID, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, SIM2, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, and ANXAl; or RAD23, TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, and BID; or RAD23, FBPl, CCNG2, LETMDl, NOTCH3, SIM2, and ANXAl; or RAD23, FBPl, CCNG2, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, CCNG2,

LETMDl, NOTCH3, ETVl, and ANXAl; or RAD23, FBPl, CCNG2, LETMDl, NOTCH3, ETVl, and BID; or TNFRSFIA, LETMDl, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, and ANXAl; or RAD23, FBPl, TNFRSFIA, LETMDl, NOTCH3, ETVl, and BID; or TNFRSFIA, CCNG2, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, CCNG2, NOTCH3, ETVl, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, ETVl, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, NOTCH3, ETVl, and BID; or TNFRSFIA, CCNG2, LETMDl, ETVl , BID, SIM2, and ANXAl; or RAD23, CCNG2, LETMDl, ETVl , BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, ETVl, and BID; or TNFRSFIA, CCNG2, LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, CCNG2, LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl,

LETMDl, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, NOTCH3, BID, SIM2, and ANXAl; or RAD23, FBPl, TNFRSFIA, CCNG2, LETMDl, NOTCH3, and BID; or TNFRSFIA, CCNG2, LETMDl, NOTCH3, ETVl, SIM2, and ANXAl; or RAD23, Emory Ref: 10048 PCT

CCNG2, LETMD1, NOTCH3, ETV1, SIM2, and ANXA1; or RAD23, FBP1, LETMD1, NOTCH3, ETV1, SIM2, and ANXA1 ; or RAD23, FBP1, TNFRSF1A, NOTCH3, ETV1, SIM2, and ANXA1; or RAD23, FBP1, TNFRSF1A, CCNG2, ETV1, SIM2, and ANXA1; or RAD23, FBP1, TNFRSF1A, CCNG2, LETMD1, ETV1, and SIM2; or TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, and ANXA1; or RAD23, CCNG2, LETMD1, NOTCH3, ETV1, BID, and ANXA1; or RAD23, FBP1, LETMD1, NOTCH3, ETV1, BID, and ANXA1; or RAD23, FBP1, TNFRSF1A, NOTCH3, ETV1, BID, and ANXA1; or RAD23, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, and ANXA1; or RAD23, FBP1, TNFRSF1A, CCNG2, ETV1, BID, SIM2, and ANXA1; or RAD23, FBP1, TNFRSF1A, CCNG2, LETMD1, BID, and ANXA1; or RAD23, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, and BID; or TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, and SIM2; or RAD23, CCNG2, LETMD1, NOTCH3, ETV1, BID, and SIM2; or RAD23, FBP1, LETMD1, NOTCH3, ETV1, BID, and SIM2; or RAD23, FBP1, TNFRSF1A, NOTCH3, ETV1, BID, and SIM2; or RAD23, FBP1, TNFRSF1A, CCNG2, ETV1, BID, SIM2; or RAD23, FBP1, TNFRSF1A, CCNG2, LETMD1, BID, and SIM2.

Optionally, multiple biomarkers are detected. Detection can comprise identifying an RNA expression pattern. An increase in one or more of the biomarkers selected from the group consisting of CSPG2, WNT10B, E2F3, CDKN2A, and TYMS as compared to a standard indicates a prostate cancer that is prone to recur, progress, and/or metastasize, whereas a decrease indicates a prostate cancer that is unlikely to recur and is slow to progress and/or metastasize. A decrease in one or more of the biomarkers selected from the group consisting of TGFB3, ALOX12, CD44, and LAF4 as compared to a standard indicates a prostate cancer that is prone to recur, progress, and/or metastasize, whereas an increase indicates a prostate cancer that is unlikely to recur and is slow to progress and/or metastasize. Optionally, the detected biomarkers comprise two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, or all nine biomarkers selected from the group consisting of CSPG2, WNT10B, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4. For example, the detected biomarkers can comprise CSPG2 and E2F3. For example, the detected biomarkers can comprise CDKN2A, TGFB3, and LAF4. For example, the detected biomarkers can comprise WNT10B, E2F3, ALOX12, and CD44. For example, the detected biomarkers can comprise CSPG2, CDKN2A, TYMS, TGFB3, and LAF4. For example, the detected Emory Ref: 10048 PCT biomarkers can comprise CSPG2, WNT10B, E2F3, TYMS, ALOX12, and CD44. For example, the detected biomarkers can comprise CSPG2, WNT10B, E2F3, CDKN2A, TYMS, CD44, and LAF4. For example, the detected biomarkers can comprise WNT10B, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4. Optionally, the detected biomarkers comprise biomarkers from the group consisting of CSPG2, WNT10B, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4.

Optionally, the methods further comprise detecting in a sample from the subject one or more biomarkers selected from the group consisting of FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2,

TMPPvS S2_ET V 1 FUSION, miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221.

Optionally, multiple biomarkers are detected. Detection can comprise identifying an RNA expression pattern. An increase in one or more biomarkers selected from the group consisting of CLNSIA, XPOl, LETMD1, RAD23B, TMPRS S2 ET V 1 FUSION, ABCC3, APC, CHES1, FRZB, HSPG2, miR-103, miR-339, miR-183, and miR-182 as compared to a control indicates a prostate cancer that is prone to recur, progress, and/or metastasize, whereas a decrease indicates the opposite. A decrease in one or more biomarkers selected from the group consisting of FOXOIA, SOX9, EDNRA, PTGDS, miR-136, and miR-221 as compared to a standard indicates a prostate cancer that is prone to recur, progress, and/or metastasize, whereas an increase indicates the opposites. Optionally, the detected biomarkers comprise two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten or more, eleven or more, twelve or more, thirteen or more, fourteen or more, fifteen or more, sixteen or more, seventeen or more, eighteen or more, nineteen or more, or all twenty biomarkers selected from the group consisting of FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRSS2 ETV1 FUSION, miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221. For example, the detected biomarkers can comprise FOXOIA and SOX9. For example, the detected biomarkers can comprise SOX9, CLNSIA, and miR-136. For example, the detected biomarkers can comprise FOXOIA, PTGDS, XPOl, and RAD23B. For example, the detected biomarkers can comprise CLNSIA, LETMD1, FRZB, miR-136, and miR-182. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, miR-339, and miR-183. For example, the selected biomarkers can comprise FOXOIA, CLNSIA, PTGDS, XPOl , FRZB, Emory Ref: 10048 PCT miR-182, and miR-183. For example, the selected biomarkers can comprise FOXOIA, CLNSIA, PTGDS, XPOl, LETMD1, miR-103, miR-339, and miR-183. For example, the selected biomarkers can comprise SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, TMPRS S2 ETV 1 FUSION, miR-103, miR-339, and miR-182. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, XPOl, RAD23B, ABCC3, EDNRA, FRZB, TMPRS S2 ETV1 FUSION, and miR-339. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, miR-339, miR-183, miR-182, miR-136, and miR-221. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, and FRZB. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1,

RAD23B, ABCC3, APC, EDNRA, HSPG2, and TMPRS S2 ET VI FUSION. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, and HSPG2. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, HSPG2, and TMPRS S2 ETV 1 FUSION. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, and miR-221. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1 , RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR-103, and miR-339. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, HSPG2, TMPRSS2 ETV1 FUSION, miR-103, miR- 339, and miR-183. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1 , RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR-103, miR-339, miR-183, and miR-182. For example, the selected biomarkers can comprise FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDRNA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR-339, miR-183, miR-182, miR-136, and miR-221. Optionally, the selected biomarkers comprise biomarkers selected from the group consisting of FOXOIA, SOX9, CLNSIA, PTGDS, XPOl, LETMD1 , RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, Emory Ref: 10048 PCT

HSPG2, TMPPvS S2_ET V 1 FUSION, miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221.

Optionally, the detecting step comprises detecting mRNA levels of the biomarker. The mRNA detection can, for example, comprise reverse-transcription polymerase chain reaction (RT-PCR), quantitative real-time PCR (qRT-PCR), Northern analysis, microarray analysis, and cDNA-mediated annealing, selection, extension, and ligation (DASL) assay (Illumina, Inc.; San Diego, CA). Preferably, the RNA detection comprises the cDNA-mediated annealing, selection, extension, and ligation (DASL) assay (Illumina, Inc.). Optionally, the detecting step comprises detecting miRNA levels of the biomarker. The miRNA detection can, for example, comprise miRNA chip analysis, Northern analysis, RNase protection assay, in situ

hybridization, miRNA expression profiling panels designed for the DASL assay (Illumina, Inc.), or a modified reverse transcription quantitative real-time polymerase chain reaction assay (qRT-PCR). Preferably the miRNA detection comprises the miRNA expression profiling panels designed for the DASL assay (Illumina, Inc.). Optionally, the detecting step comprises detecting mRNA and miRNA levels of the biomarker. The analytical techniques used to determine mRNA and miRNA expression are known. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, 3 rd Ed., Cold Spring Harbor Press, Cold Spring Harbor, NY (2001), Yin et al, Trends Biotechnol. 26:70-6 (2008); Wang and Cheng, Methods Mol. Biol. 414: 183-90 (2008); Einat, Methods Mol. Biol. 342: 139-57 (2006).

Comparing the mRNA or miRNA biomarker content with a biomarker standard includes comparing mRNA or miRNA content from the subject with the mRNA or miRNA content of a biomarker standard. Such comparisons can be comparisons of the presence, absence, relative abundance, or combination thereof of specific mRNA or miRNA molecules in the sample and the standard. Many of the analytical techniques discussed above can be used alone or in combination to provide information about the mRNA or miRNA content (including presence, absence, and/or relative abundance information) for comparison to a biomarker standard. For example, the DASL assay can be used to establish a mRNA or miRNA profile for a sample from a subject and the abundances of specific identified molecules can be compared to the abundances of the same molecules in the biomarker standard.

Optionally, the detecting step comprises detecting the protein expression levels of the protein-coding gene biomarkers. The protein-coding gene biomarkers can comprise CSPG2, Emory Ref: 10048 PCT

WNTIOB, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, LAF4, FOX01A, SOX9, CLNS1A, PTGDS, XPOl, LETMD1 , RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, and TMPRSS2 ETV1 FUSION. The protein detection can, for example, comprise an assay selected from the group consisting of Western blot, enzyme-linked immunosorbent assay (ELISA), enzyme immunoassay (EIA), radioimmunoassay (RIA), immunohistochemistry, and protein array. The analytical techniques used to determine protein expression are known. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual, 3 rd Ed., Cold Spring Harbor Press, Cold Spring Harbor, NY (2001).

Biomarker standards can be predetermined, determined concurrently, or determined after a sample is obtained from the subject. Biomarker standards for use with the methods described herein can, for example, include data from samples from subjects without prostate cancer, data from samples from subjects with prostate cancer that is not a progressive, recurrent, and/or metastatic prostate cancer, and data from samples from subjects with prostate cancer that is a progressive, recurrent, and/or metastatic prostate cancer. Comparisons can be made to multiple biomarker standards. The standards can be run in the same assay or can be known standards from a previous assay.

Also provided herein are methods of treating a subject with prostate cancer. The methods comprise modifying a treatment regimen of the subject based on the results of any of the methods of predicting the recurrence, progression, and metastatic potential of a prostate cancer in a subject. Optionally, the treatment regimen is modified to be aggressive based on an increase in one or more biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, and TYMS as compared to a standard. Optionally, the treatment regimen is modified to be aggressive based on a decrease in one or more biomarkers selected from the group consisting of TGFB3, ALOX12, CD44, and LAF4 as compared to the standard.

Optionally, the treatment regimen is modified to be aggressive based on a combination of an increase in one or more biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, and TYMS as compared to a standard, and a decrease in one or more biomarkers selected from the group consisting of TGFB3, ALOX12, CD44, and LAF4 as compared to a standard. Optionally, the treatment regimen is further modified to be aggressive based on an increase in one or more biomarkers selected from the group consisting of

CLNS1A, XPOl, LETMD1, RAD23B, TMPRS S2 ETV 1 FUSION, ABCC3, APC, CHES1, Emory Ref: 10048 PCT

FRZB, HSPG2, miR-103, miR-339, miR-183 and miR-182 as compared to a standard.

Optionally, the treatment regimen is further modified to be aggressive based on a decrease in one or more biomarkers selected from the group consisting of FOXOIA, SOX9, PTGDS, EDNRA, miR-136, and miR-221 as compared to a standard. Optionally, the treatment regimen is further modified to be aggressive based on a combination of an increase in one or more biomarkers selected from the group consisting of CLNS1A, XPOl, LETMD1, RAD23B, TMPRS S2 ETV 1 FUSION, ABCC3, APC, CHES1, FRZB, HSPG2, miR-103, miR-339, miR- 183, and miR-182 and a decrease in one or more biomarkers selected from the group consisting of FOXOIA, SOX9, PTGDS, EDNRA, miR-136, and miR-221 as compared to a standard.

In certain embodiments, the treatment regimen is further modified to be aggressive based on an aberrant pattern of expression when analyzing miR-519d and/or miR-647 and four, five, six, seven, eight or more markers selected from the group consisting of RAD23B, FBP1, TNFRSF1A, CCNG2, LETMD1, NOTCH3, ETV1, BID, SIM2, and ANXA1.

Also provided are kits comprising primers to detect the expression of one or more biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4. Optionally, the kits further comprise primers to detect the expression of one or more biomarkers selected from the group consisting of FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, and TMPRSS2 ETV1, and primers to detect the expression of one or more biomarkers selected from the group consisting of miR- 103, miR-339, miR-183, miR-182, miR-136, and miR-221. Optionally, directions to use the primers provided in the kit to predict the progression and metastatic potential of prostate cancer in a subject, materials needed to obtain RNA in a sample from a subject, containers for the primers, or reaction vessels are included in the kit.

Also provided are arrays consisting of probes to one or more of the biomarkers selected from the group consisting of CSPG2, WNTIOB, E2F3, CDKN2A, TYMS, TGFB3, ALOX12, CD44, and LAF4. Optionally, the arrays further consist of probes to one or more biomarkers selected from the group consisting of FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, TMPRS S2 ETV 1 FUSION, miR- 103, miR-339, miR-183, miR-182, miR-136, and miR-221.

The arrays provided herein can be a DNA microarray, an RNA microarray, a miRNA microarray, or an antibody array. Arrays are known in the art. See, e.g., Dufva, Methods Mol. Emory Ref: 10048 PCT

Biol. 529: 1-22 (2009); Plomin and Schalkwyk, Dev. Sci. 10: l): 19-23 (2007); Kopf and Zharhary, Int. J. Biochem. Cell Biol. 39(7-8): 1305-17 (2007); Haab, Curr. Opin. Biotechnol. 17(4):415-21 (2006); Thomson et al, Nat. Methods 1 :47-53 (2004).

As used herein, subject can be a vertebrate, more specifically a mammal (e.g., a human, horse, cat, dog, cow, pig, sheep, goat mouse, rabbit, rat, and guinea pig), birds, reptiles, amphibians, fish, and any other animal. The term does not denote a particular age. Thus, adult and newborn subjects are intended to be covered. As used herein, patient or subject may be used interchangeably and can refer to a subject afflicted with a disease or disorder (e.g., prostate cancer). The term patient or subject includes human and veterinary subjects.

As used herein a subject at risk for recurrence, progression, or metastasis of prostate cancer refers to a subject who currently has prostate cancer, a subject who previously has had prostate cancer, or a subject at risk of developing prostate cancer. A subject at risk of developing prostate cancer can be genetically predisposed to prostate cancer, e.g., a family history or have a mutation in a gene that causes prostate cancer. Alternatively a subject at risk of developing prostate cancer can show early signs or symptoms of prostate cancer, such as hyperplasia. A subject currently with prostate cancer has one or more of the symptoms of the disease and may have been diagnosed with prostate cancer.

As used herein, the terms treatment, treat, or treating refers to a method of reducing the effects of a disease or condition (e.g., prostate cancer) or symptom of the disease or condition. Thus, in the disclosed method, treatment can refer to a 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% reduction in the severity of an established disease or condition or symptom of the disease or condition. For example, a method of treating a disease is considered to be a treatment if there is a 10% reduction in one or more symptoms of the disease in a subject as compared to a control. Thus, the reduction can be a 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%), 90%), 100%) or any percent reduction between 10 and 100%) as compared to native or control levels. It is understood that treatment does not necessarily refer to a cure or complete ablation of the disease, condition, or symptoms of the disease or condition.

Disclosed are materials, compositions, and components that can be used for, can be used in conjunction with, can be used in preparation for, or are products of the disclosed methods and compositions. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that Emory Ref: 10048 PCT while specific reference of each various individual and collective combinations and

permutations of these compounds may not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a method is disclosed and discussed and a number of modifications that can be made to a number of molecules including the method are discussed, each and every combination and permutation of the method, and the modifications that are possible are specifically contemplated unless specifically indicated to the contrary. Likewise, any subset or combination of these is also specifically contemplated and disclosed. This concept applies to all aspects of this disclosure including, but not limited to, steps in methods using the disclosed compositions. Thus, if there are a variety of additional steps that can be performed, it is understood that each of these additional steps can be performed with any specific method steps or combination of method steps of the disclosed methods, and that each such combination or subset of combinations is specifically contemplated and should be considered disclosed.

Publications cited herein and the materials for which they are cited are hereby specifically incorporated by reference in their entireties.

Examples

EXAMPLE 1: IDENTIFICATION OF BIOMARKER PREDICTORS FOR THE RECURRENCE OF PROSTATE CANCER ASSOCIATED WITH TMPRSS2-ERG

GENE FUSION.

RNA Samples.

Total RNA samples from frozen prostate tumor specimens used in this study were prepared previously (Nam et al., Br. J. Cancer 97: 1690-5 (2007)). Aliquoted RNA samples were used in the cDNA-mediated annealing, selection, extension, and ligation assay (DASL assay). RNA concentration was quantified by Nanodrop spectrophotometry and quality was assessed using the Agilent Bioanalyzer (Agilent Technologies; Santa Clara, CA) for which RNA integrity number (RIN) of more than 7 was used as a quality criteria. Emory Ref: 10048 PCT

DASL Assay Performance, Reporducibility, and Data Normalization.

The DASL assay was performed on Illumina's (Illumina, Inc.; San Diego, CA) 502- gene Human Cancer Panel (HCP) using 200 nanograms (ng) of input RNA. The

manufacturer's instructions were followed without any changes. Samples were hybridized on two Universal Array Matrices (UAMs). The hybridized UAMs were scanned using the

BeadStation 500 Instrument (Illumina Inc.). The data were interpreted and quantile normalized using GenomeStudio vl .0.2 (Illumina Inc.). Experimental replicates (same RNA assayed twice) were assessed for reproducibility and subsequently averaged so as to represent each patient's tumor sample with one gene expression profile.

Data Analysis and Meta-analysis.

Differential mRNA expression of TMPRSS2-ERG T1/E4 fusion-positive versus fusion- negative tumors was assessed using significance analysis of microarrays (SAM) (Tusher et al., Proc. Natl. Acad. Sci. USA 98:5116-21 (2001)) for which 1,000 random class assignment permutations estimated a false discovery rate (FDR) less than or equal to 5%. Hierarchical clustering was generated in R using the heatmap2 package where distance was computed using a Euclidean dissimilarity metric with an average linkage clustering algorithm. Data was displayed with mRNA intensities Z-score normalized. Gene Ontology analysis was conducted using the GOstats package with a significance value of p < 0.01 of overrepresentation computed by the hypergeometric test using the lumiHumanAll.db annotation file. Univariate Cox proportional hazards regression was conducted in R using the Cox proportional hazards survival package (CoxPH) and was conducted on each gene expression profile and clinical factor independently. Multivariate Cox analysis considered clinical factors that were significant (p < 0.05) in univariate analysis as well as a recurrence predictor built as a weighted average of the expression level of genes, which were significant in univariate analysis in both the Toronto data set and that from Nakagawa et al. (Nakagawa et al., PLoS ONE 3(5):e2318 (2008)). Kaplan-Meier curves were generated in R using the survival package and significance testing utilized the survdiff function for which the log-rank test determined the p-value. Meta- Analysis utilized expression profiles from both Setlur et al. (Setlur et al., J. Natl. Cancer Inst. 100(11):815-25 (2008)) and Nakagawa et al. (Nakagawa et al, PLoS ONE 3(5):e2318 (2008)) Emory Ref: 10048 PCT studies, which were downloaded from Gene Expression Omnibus (GEO; located on the National Center for Biotechnology Information website) and had the series numbers GSE8402 and GSE 10645, respectively. The same differential, annotation, and prognostic analyses methods described above were employed on the meta-analysis sets. Results

After RNA and assay quality control, 139 patient tumors were characterized on the DASL assay for 502 cancer-related genes (GEO series GSE 18655). Seven samples were run as experimental replicates to estimate assay reproducibility for which an average Pearson R 2 of 0.965 indicated highly reproducible data (Figure 1). Moreover, unsupervised hierarchical clustering of all samples and probes resulted in experimental replicates clustering together without exception. The Toronto cohort, a subset of that previously characterized for clinical markers (Nam et al, Br. J. Cancer 97(12): 1690-5 (2007)), includes 69 patients with TMPRSS2- ERG T1/E4 fusion-positive tumors and 70 prostate tumors that were TMPRSS2-ERG fusion- negative. Fusion status indicated a significantly worse outcome with respect to biochemical recurrence (Figure 2A, p = 3.54 x 10 " log-rank test) similar to that observed in the entire cohort (Nam et al., Br. J. Cancer 97(12): 1690-5 (2007)). As previously reported, patients with TMPRSS2-ERG fusion-positive tumors had a significantly higher expression of ERG transcripts (Figure 2B, p = 3.48 x 10 "11 , Student's two-sided t-test) likely a result of androgen- responsive promoter elements in TMPRSS2 driving expression (Tomlins et al., Science 310(5748):644-8 (2005)). ERG overexpression was validated using a reverse transcription- polymerase chain reaction (RT-PCR) assay, which corroborated ERG overexpression found by microarray results (Figure 2C, p = 8.13 x 10 "10 , Student's two-sided t-test).

To investigate molecular biomarkers differentially regulated in TMPRSS2-ERG fusion- positive tumors, significance testing was conducted using SAM (Tusher et al., Proc. Natl.

Acad. Sci. USA 98(9):5116-21 (2001)) for both the Toronto cohort and that of the 455 patient Swedish cohort (Setlur et al, J. Natl. Cancer Inst. 100(11):815-25 (2008)). Using a FDR equal to or less than 5% yielded 51 genes differentially regulated in TMPRSS2-ERG fusion-positive tumors in the Toronto cohort (Table 1). Nine upregulated genes and six downregulated genes were validated by replicating the analysis on the Swedish cohort (Setlur et al., J. Natl. Cancer Inst. 100(11):815-25 (2008)), which was characterized for expression of 6,144 transcripts Emory Ref: 10048 PCT

(Figure 3, FDR < 5%). In both the Toronto and Swedish cohorts ERG was uniquely the most significant differentially regulated transcript in TMPRSS2-ERG fusion-positive tumors (Figure 4). Genes annotated for mismatch base repair and histone deacetylation functions were overrepresented in Gene Ontology analysis of common upregulated genes TMPRSS2-ERG fusion positive tumors. Downregulated genes were overrepresented for annotations that included the insulin-like growth factor and Jak-Stat signaling pathways suggesting that these pathways may be attenuated in TMPRSS2-ERG fusion-positive tumors (Table 2, p < 0.01). Hierarchical clustering of tumor expression profiles across common differentially regulated genes resulted in segregation of TMPRSS2-ERG fusion-positive tumors (Figure 3), suggesting that TMPRSS2-ERG fusion-positive tumors have a distinct molecular metabolism that is replicated in multiple cohorts.

Table 1 : Differentially regulated mRNAs with TMPRSS2-ERG fusion. Differential regulated genes associated with TMPRSS2-ERG fusion were determined using Significance Analysis of Microarrays (SAM) which permutated 1,000 assignments of class assignment to determine differential targets (Tusher et al, Proc. Natl. Acad. Sci. USA 98(9):5116-21 (2001)). mRNAs were validated in a 455 patient Swedish cohort that was characterized for expression of 6,144 transcripts (Setlur et al, J. Natl. Cancer Inst. 100(11):815-25 (2008)).

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Table 2: Gene Ontology Annotation of mRNAs associated with TMPRSS2-ERG T1/E4 fusion in Prostate Cancer. The 15 mRNAs associated with TMPRSS2-ERG T1/E4 fusion (Figure 3, Table 1 in bold) in both our Toronto-139 cohort and a Swedish 455 patient cohort (Setlur et al., J. Natl. Cancer Inst. 100(11):815-25 (2008)) were annotated for gene ontology terms. Several terms annotated from the nine upregulated mRNAs in T1/E4 fusion positive tumors were related to DNA damage & repair mechanisms and histone deacetylation. Conversly, overrepresented terms for the six mRNAs downregulated in T1/E4 positive tumors were associated with insulin-like growth factor (IGF) activity and JAK-STAT tyrosine

phosporylation signaling. P-values were calculated using a hypergeometric test in the R package GOstats.

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

To determine molecular factors associated with biochemical recurrence, defined as a PSA increase of > 0.2 ng/ml on at least two consecutive measurements that are at least 3 months apart, univariate Cox proportional hazards regression was conducted in the Toronto cohort and replicated in a 596 patient Minnesota cohort (Nakagawa et al., PLoS ONE

3(5):e2318 (2008)). The Toronto dataset yielded 16 genes associated with recurrence and 11 genes associated with non-recurrence (Table 3, p < 0.05). Repeating this analysis in the Minnesota cohort validated five genes associated with biochemical recurrence (CSPG2, WNT10B, E2F3, CDKN2A, and TYMS) and four genes associated with non-recurrence (TGFB3, ALOX12, CD44, and LAF4) (Figure 5, p < 0.05). Gene Ontology functional annotation of genes commonly associated with recurrence yielded overrepresentation of deoxyribosylthymine monophosphate (dTMP) biosynthesis, negative regulation of leukocyte activation, specifically T and B cell lymphocytes, as well as inhibition of cell-matrix adhesion. Conversely, annotation of genes associated with non-recurrence resulted in cell-matrix adhesion and collagen binding (Table 4, p < 0.01). Common genes prognostic of recurrence were used to build a recurrence score calculated as the sum product of each gene's expression intensity by its Cox coefficient determined by regression analysis. Ordering samples by the recurrence score in a supervised heatmap produced a trend whereby patients that did not have recurrence were separated from those who did in both the Toronto and Swedish cohorts. More importantly, the recurrence score was significant in univariate Cox regression and remained significant in a multivariate model considering clinical factors that were significant (p < 0.05) in the univariate analysis, namely pre-operative PSA level, Gleason score, and TMPRSS2-ERG fusion status (Table 5, Toronto cohort). Furthermore, the nine-gene expression recurrence score was significantly associated with biochemical recurrence by itself (Figure 6A, p = 0.000167) and in a multivariate model considering with Gleason score and TMPRSS2-ERG fusion status (i.e., those clinical data significant in univariate analysis; Figure 6B, p = 4.15 x 10 "7 ).

Table 3: mRNAs Associated with Biochemical Recurrence. mRNAs associated with biochemical recurrence were determined using a cox proportional hazards regression of mRNA expression. mRNAs were validated in a 596 Minnesota cohort characterized for the same 502 mRNA transcripts (Nakagawa et al., 2008). Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Table 4: Gene Ontology Annotation of mRNAs associated with Biochemical Recurrence. The nine mRNAs associated with biochemical recurrence in both the Toronto- 139 and

Minnesota-596 (Nakagawa et al., 2008) were cohorts (Figure 5, Table 3) were annotated for gene ontology terms. Several terms were found overrepresented in the five mRNAs associated with recurrence including deoxyribosylthymine monophosphate (dTMP) metabolism, negative regulation of B and T-cell leukocyte proliferation, and negative regulation of cell adhesion. Overrepresented terms for the four mRNAs associated with non-recurrence T1/E4 positive tumors were associated with cell adhesion and hydrolase and oxide activity. P-values were calculated using a hypergeometric test in the R package GOstats.

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Table 5: Clinical and Molecular Factors for the Toronto- 139. Cohort clinical characteristics for the 139 prostate cancer patients in the Toronto cohort are listed out for TMPRSS2-ERG T1/E4 fusion positive and fusion negative patients. Factors were assessed for their association with biochemical recurrence when relevant (indicated by a univariate p-value). Factors prognostic of recurrence (p < 0.05) were used in a multivariate model of recurrence. The nine- gene recurrence score (composed of the genes listed in Figure 5) is composed of mRNAs replicated as prognostic of recurrence in this experiment and a 596 patient Minnesota experiment (Nakagawa et al., 2008).

Emory Ref: 10048 PCT

EXAMPLE 2: IDENTIFICATION OF BIOMARKER PREDICTORS FOR THE PROGRESSION AND METASTATIC POTENTIAL OF PROSTATE CANCER.

RNA Isolation. RNA is isolated from formalin-fixed paraffin-embedded (FFPE) tissue according to the methods described in Abramovitz et al., Biotechniques 44(3):417-23 (2008). In brief, three 5 μιη sections per block were cut and placed into a 1.5mL sterile microfuge tube. The tissue section was deparaffinized with 100% xylene for 3 minutes at 50°C. The tissue section was centrifuged, washed twice with ethanol, and allowed to air dry. The tissue section was digested with Proteinase K for 24 hours at 50°C. RNA was isolated using an Ambion Recover All Kit (Ambion; Austin, TX). cDNA-mediated annealing, selection, extension, and ligation assay (DASL assay).

Upon the completion of RNA isolation, the isolated RNA is used in the DASL assay. The DASL assay is performed according to the protocols supplied by the manufacturer (Illumina, Inc.; San Diego, CA). The primer sequences for the fourteen biomarker genes are shown in Table 6. The probe sequences for the fourteen biomarker genes are shown in Table 7. Emory Ref: 10048 PCT

Table 6: DASL assay Primer Sequences for Fourteen Biomarker Genes

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Table 7: Probe Sequences for Detection of Fourteen Biomarker Genes in DASL assay

Emory Ref: 10048 PCT

For each of the genes in the predictive nine-gene score, the signal is obtained by the average of three probes. The sets of DASL assay primer sequences are given in Table 8, and the DASL probe sequences are given in Table 9. Table 8: DASL assay Primer Sequences for Nine Biomarker Genes

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

To compute the predictive nine-gene score, DASL signal levels are quantile normalized across the array and the signal for each of the three probes is averaged to produce a gene signal. The nine-gene score is then computed using the following formula:

NINE GENE SCORE = (C C SPG2 X CSPG2 A vgGeneSignal) + (CcDKN2A X CDKN2A A vgGeneSignal) +

(CwNTlOB X WNTlOBAvgGeneSignal) + (CTYMS X TYMSAvgGeneSignal) + (CE2F3 X E2F3AvgGeneSignal) + (CLAF4 X LAF4AvgGeneSignal) + (CALOX12 X ALOX 12AvgGeneSignal) + (CcD44 X CD44AvgGeneSignal) + (CxGFB3 X TGFB3AvgGeneSignal)-

The coefficients for the predictive nine-gene score are as follows: C CSPG 2 = 0.000295, CCDKN2A = 0.00024, CwNTioB = 0.001528, C TY MS = 0.000219 C E2 F3 = 0.000585, C L AF4 = -8.8e- 05, CALOXI2 = -0.00291, C CD 44 = -0.00012, CTGFBS = -0.00025.

To compute the predictive fourteen-gene score, DASL signal levels are quantile normalized across the array, and then Z-score normalized across the samples. (Z-score = (signal - average(signal))/stdev(signal)). Once the predictive scores are computed, samples are separated based on whether they are greater or less than the median score. If a sample has a score greater than the median, the subject is predicted to not have recurrence. If the score is less than the median, the subject is predicted to have recurrence. For this predictive score, the higher the score, the less likely the subject is to have recurrence.

The predictive fourteen-gene score can be calculated using the following formula:

FOURTEEN GENE SCORE = (C FO XOIA X FOX01A ZsC ore) + (C SO x 9 x SOX9 ZsC ore) + (CCLNSIA X

CLNS1 Azscore) + (C PTG DS X PTGDS Zs core) + (C XP0 1 X XP01 Zs core) + (CRA D 23B X RAD23B ZsC ore) +

(CTMPRSS2_ETVI FUSION X TMPRSS2_ETV1 FUSION ZsC ore) + (C A BCC3 X ABCC3 ZsC ore) + (CAPC X APC Zscore ) + (CcHESi x CHESl Zscore ) + (CEDNRA X EDNRA Zscore ) + X FRZB Zscore ) + (CHSPG2 X HSPG2 Zscore ). Emory Ref: 10048 PCT

The coefficients for the predictive fourteen-gene score are as follows: CFOXOIA = 0.687, Csox9 = 0.35 1 , CcLNSiA = 0.1 12, CPTGDS = 0.058, CXPOI = -0.208, CLETMDI = -0.019, CRAD23B = - 0.065, CTMPRSS2 ETvi FUSION = -0. 168, CABCC3 = -0.202, CAPC = -0. 128, CFRZB = 0.3 10, CHSPG2 = -0.048, CEDNRA = 0.539, and CCHESI = -0. 143. The coefficients for the predictive seven-gene score are as follows:

CFOXOIA = 0.625, Csox9 = 0.253, QXNSIA = 0.0, CPTGDS = 0.056, CXPOI = -0.092, CLETMDI = - 0. 140, CRAD23B = -0.045, and CTMPRSS2 ETVI FUSION = -0. 137. miRNA expression profiling

The isolated RNA is additionally used in the Illumina Human Version 2 MicroRNA Expression Profiling kit (Illumina, Inc.; San Diego, CA) in conjunction with the DASL assay. The miRNA expression profiling is performed according to the manufacturer's protocol. The mature miRNA sequence for the six miRNA biomarkers are shown in Table 10. The probe sequences for the six miRNA biomarkers are shown in Table 1 1 . Table 10: Mature miRNA Sequences for Six miRNA Biomarkers

Emory Ref: 10048 PCT

To compute a predictive miRNA score, DASL signal levels are quantile normalized across the array, and then Z-score normalized across the samples. (Z-score = (signal - average(signal))/stdev(signal)). The more positive the predictive score, the more likely the subject will recur. The more negative the score, the less likely the patient will recur.

The predictive six miRNA gene score can be calculated using the following formula: SIX miRNA SCORE = miR-103 ZsC ore + miR-339 ZsC ore + miR-183 ZsC ore + miR-182 Zscore - miR136 Zscore - miR221 Zscore .

Results

A highly predictive set of 520 genes was determined through analysis of multiple publicly available gene expression datasets (Dhanasekaran et al., Nature 412:822-6 (2001); Lapointe et al, Proc. Natl. Acad. Sci. USA 101 :811-6 (2004); LaTulippe et al, Cancer Res. 62:4499-506 (2002); Varambally et al, Cancer Cell 8:393-406 (2005)), datasets from gene expression profiling analysis of 58 prostate cancer patient samples (Liu et al., Cancer Res. 66:4011-9 (2006)), and genes involved in prostate cancer progression based on state of the art understanding of the disease (Tomlins et al., Science 310:644-8 (2005); Varambally et al., Cancer Cell 8:393-406 (2005)). The predictive set of 520 genes were optimized for

performance in the cDNA-mediated annealing, selection, extension, and ligation (DASL) assay (Illumina, Inc.; San Diego, CA). The DASL assay is based upon multiplexed reverse transcription-polymerase chain reaction (RT-PCR) applied in a microarray format and enables the quantitation of expression of up to 1536 probes using RNA isolated from archived formalin- fixed paraffin embedded (FFPE) tumor tissue samples in a high throughput format (Bibokova et al, Am. J. Pathol. 165: 1799-807 (2004); Fan et al, Genome Res. 14:878-85 (2004)). RNAwas isolated from 71 patient samples with definitive clinical outcomes and was analyzed using the Emory Ref: 10048 PCT

DASL assay. Based on the data from 71 patients, a subset of fourteen protein encoding genes were found to be capable of separating Gleason 7 subjects with and without recurrence, and thus were found to be good predictors of recurrent, progressive, or metastatic prostate cancers. The fourteen protein encoding genes included: FOXOIA, SOX9, CLNS1A, PTGDS, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, EDNRA, FRZB, HSPG2, and the

TMPPvS S2_ET V 1 FUSION. The expression of CLNS1A, XPOl, LETMD1, RAD23B, ABCC3, APC, CHES1, FRZB, HSPG2, and TMPRS S2 ET V 1 FUSION was increased in recurrent, progressive, or metastatic prostate cancers, while the expression of FOXOIA, SOX9, EDNRA, and PTGDS was decreased in recurrent, progressive, or metastatic prostate cancers. Additionally, based on data obtained from the 71 patients using the MicroRNA Expression

Profiling Panels (Illumin, Inc.; San Diego, CA) designed for the DASL assay, it was found that six miRNA genes were found to be good predictors of recurrent, progressive, or metastatic prostate cancers. The six miRNA genes included: miR-103, miR-339, miR-183, miR-182, miR-136, and miR-221. The expression of miR-103, miR-339, miR-183, and miR-182 was increased in recurrent, progressive, or metastatic prostate cancers, while the expression of miR- 136 and miR-221 was decreased in recurrent, progressive, or metastatic prostate cancers.

EXAMPLE 3

To identify biomarkers predictive of recurrence, FFPE tissue blocks from 73

prostatectomy patient samples were assembled to perform DASL expression profiling with our custom-designed panel of 522 prostate cancer relevant genes. This training set of samples included 29 cases with biochemical PSA recurrence, and 44 cases without recurrence. A lasso Cox PH models was fit to identify the probes that achieved the optimal prediction performance, with the tuning parameter for Lasso selected using a leave-one-out cross-validation technique. This approach identified a panel of eight protein-coding genes (CTNNA1, XPOl, PTGDS, SOX9, RELA, EPB49, SIM2, and EDNRA) that could be used to predict recurrence following radical prostatectomy.

Emory Ref: 10048 PCT

Kaplan-Meier analysis demonstrated that these probes could significantly discriminate patients with and without recurrence by the log rank test (p = 1.16e-06). This predictive model was applied to a separate DASL profiling experiment on 40 prostate cancer cases (27 without recurrence and 13 with recurrence). Kaplan-Meier analysis on this validation set determined that the model could significantly discriminate patients with and without recurrence (p = 0.000153).

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

Emory Ref: 10048 PCT

In addition, comprehensive DASL miRNA profiling of these same 73 FFPE cases was performed using the MicroRNA Expression Profiling Panels (Illumina, Inc.) designed for the DASL assay. MicroRNA probes were filtered to retain only those that were present on the microRNA microarrays used for both the training and validation sets, reducing the total number of probes examined to 403 miRNA probes. A panel of five microRNAs (hsa-miR-103, hsa-miR- 340, hsa-miR-136, HS 168, HS l 11) was identified correlated with prostate cancer recurrence. Emory Ref: 10048 PCT

Kaplan-Meier analysis and the log-rank test determined that this panel could

significantly discriminate patients with and without recurrence in the training set (p = 1.63E- 05). However, in the independent validation set, this panel was borderline significant in its ability to discriminate patients with and without recurrence (p = 0.056).

An additional analysis was performed using combined data from both the 1536 protein- coding and 403 miRNA DASL probes. Combined analysis of both biomarker panels identified seven protein-coding and one miRNA gene (XPOl, hsa-miR- 103, PTGDS, SOX9, RELA, EPB49, EDNRA, FOXOIA), and this combined panel was also significant in both the training set (p = 1.41E-07) and the validation set (p = 0.009).

Next we applied the three biomarker panels to the subset of cases in the training (n = 46) and validation sets (n = 18) that had a Gleason score of seven. Of the three panels, only the mRNA panel was significant (p = 0.00927) at discriminating Gleason score seven cases in both the training and validation sets (see below). Emory Ref: 10048 PCT

Hierarchical clustering of the patient samples using this set of eight genes performed well in separating Gleason seven patients with and without recurrence. While the trend in the combined panel of mRNA and miRNA was towards significance (p = 0.164) for the validation set, and could possibly achieve significance with a larger sample set, it did not perform as well as the mRNA panel alone.

EXAMPLE 4

Panel of ten protein-coding genes and two miRNA genes (RAD23B, FBP1,

TNFRSFIA, CCNG2, NOTCH3, ETVl, BID, SIM2, ANXAl, miR-519d, and miR-647) were identified that could be used to separate patients with and without biochemical recurrence (p < 0.001), as well as for the subset of 42 Gleason score 7 patients (p < 0.001). an independent validation analysis on 40 samples was performed and it was found that the biomarker panel was also significant at prediction of recurrence for all cases (p = 0.013) and for a subset of 19 Gleason score 7 cases (p = 0.010), both of which were adjusted for relevant clinical information including T-stage, PSA and Gleason score. Importantly, these biomarkers could significantly predict clinical recurrence for Gleason 7 patients. These biomarkers may increase the accuracy of prognostication following radical prostatectomy using formalin- fixed specimens.

Patient Samples

In the initial training set, 70 cases were used (29 with biochemical recurrence and 41 controls), 45 patients from Sunnybrook Health Science Center (Toronto, ON), and 25 patients Emory Ref: 10048 PCT from Emory University. The 45 cases of paraffin-embedded tissue samples from Toronto were drawn from men who underwent radical prostatectomy as the sole treatment for clinically localized prostate cancer (PCa) between 1998 and 2006. The clinical data includes multiple clinicopathologic variables such as prostate specific antigen (PSA) levels, histologic grade (Gleason score), tumor stage (pathologic stage category for example; organ confined, pT2; or with extra-prostatic extension, pT3a; or with seminal vesicle invasion, pT3b), and biochemical recurrence rates. For the cases from Emory University, both the training set (25 cases) and validation set (40 cases)

FFPE samples were also selected from a screen of over a thousand patients through an IRB-approved retrospective study at Emory University of men who had undergone radical prostatectomy. Those who were included met specific inclusion criteria, had available tissue specimens, documented long term follow-up and consented to participate or were included by IRB waiver. The cases were assigned prostate ID numbers to protect their identities. These patients did not receive neo-adjuvant or concomitant hormonal therapy. Their demographic, treatment and long-term clinical outcome data have been collected and recorded in an electronic database. Clinical data recorded include PSA measurements, radiological studies and findings, clinical findings, tissue biopsies and additional therapies that the subjects may have received.

RNA Preparation

Tissue cores (1 mm) were used for RNA preparation rather than sections because of the heterogeneity of samples and the opportunity for obtaining cores with very high percentage tumor content. H&E stained slides were reviewed by a board certified urologic pathologist (AOO) to identify regions of cancer to select corresponding areas for cutting of cores from paraffin blocks. Total RNA was prepared at the Emory Biomarker Service Center from FFPE cores using the Ambion Recoverall MagMax methodology in 96-well format on a MagMax 96 Liquid Handler Robot (Life Technologies, Carlsbad, CA). FFPE RNA was quantitated by nanodrop spectrophotometry, and tested for RNA integrity and quality by Taqman analysis of the RPL13a ribosomal protein on a HT7900 real-time PCR instrument (Applied Biosystems, Foster City, CA). Samples with sufficient yield (>500 ng), A260/A280 ratio > 1.8 and RPL13a CT values less than 30 cycles were used for miRNA and DASL profiling. Emory Ref: 10048 PCT

Custom Prostate Cancer DASL Assay Pool (DAP)

The DASL assay enables quantitation of expression using RNA isolated from archived FFPE tumor tissue samples in a high throughput format. Data from multiple publicly available gene expression datasets, along with genes involved in prostate cancer progression based on state-of-the-art understanding of the disease, were distilled to develop a highly predictive set of 522 genes for use in the DASL assay. Due to specific probe design considerations, this panel had three probes for 497 genes, two probes for 20 genes, and a single probe for five genes, two of which were specific to TMPRSS2-ERG and TMPRSS2-ETV1 fusions transcripts. The unique combination of genes was optimized for performance in the DASL assay using stringent criteria that predicts performance of the primer sets. The panel includes genes found to be correlated with Gleason score. It also includes prognostic markers, and genes associated with metastasis. In addition, a number of genes known from other studies to be critical in prostate cancer such as NKX3.1, PTEN, and the Androgen Receptor are all included in the panel. Other genes that play important roles in the Wnt, Hedgehog, TGFp, Notch, MAPK and PI3K pathways are also present in this gene set. Finally, primer sets that detect chromosomal translocations in ERG 9, ETV1 15, and ETV4 16 are also included in this panel. The optimal oligonucleotide sequence for each gene probe was determined using an oligonucleotide scoring algorithm. The oligonucleotide pool or DASL Assay Pool (DAP) was synthesized by Illumina for use with the 96-well Universal Array Matrix (UAM). Data Analysis

DASL fluorescent intensities were interpreted in GenomeStudio, quantile normalized, and exported for meta-analysis. Average signal intensity, genes detected (p-value = 0.01), background, and noise (standard deviation of background) were analyzed for trends by plate, row, and column. The two endpoints of interest were postoperative biochemical recurrence, defined as two detectable PSA readings (>0.2 ng/ml), and clinical recurrence, defined as evidence of local or metastatic disease. The primary outcome of interest was time to biochemical recurrence following surgery. A local recurrence was defined as recurrence of cancer in the prostatic bed that was detected by either a palpable nodule on digital rectal examination (DRE) and subsequently verified by a positive biopsy, and/or a positive imaging study (prostascint or CT scan) accompanied by a detectable postoperative PSA result and lack Emory Ref: 10048 PCT of evidence for metastases. Also, patients whose PSA level decreased following adjuvant pelvic radiation therapy for elevated postoperative PSA were considered as local recurrence cases. A recurrence with metastases was defined as a positive imaging study indicating presence of a tumor outside of the prostatic bed.

To identify important probes and build and evaluate prediction models for prostate cancer biochemical recurrence, the following strategy was adopted. In the training step, the prediction model was built based on the time to biochemical recurrence. Specifically, we first fit a univariate Cox proportional hazard (PH) model for each individual probe using the training data set, and a set of important mRNA and miRNA probes were then preselected based on a false discovery rate (FDR) threshold of 0.30. Next, to identify the optimal prediction score based on the preselected probes, we fit a lasso Cox PH model using the training data set, where the tuning parameter for lasso was selected using a leave-one-out cross-validation technique. See Goeman, Biom J 2010, 52:70-84. The lasso Cox PH model was fitted first using the set of preselected mRNA probes only and then using the complete set of preselected mRNA and miRNA probes resulting in an optimal mRNA panel and an optimal combined mRNA/miRNA panel, respectively. Based on each biomarker panel, a final prediction model for recurrence was built to also incorporate relevant clinical biomarkers, namely, T-stage, PSA and Gleason score, through fitting Cox PH models.

To evaluate and validate the final prediction models obtained from the training phase, 79 samples from 40 patients were used and replicate samples from the same patient were again averaged to generate a single average signal for each patient. Each prediction model from the training phase was used to generate a predictive score for each subject in the validation data set, and subjects were subsequently divided into high and low scoring groups using the median predictive score. Kaplan Meier analysis was performed to compare the time to biochemical recurrence, between high (poor score) and low (good score) risk groups, and the statistical significance was determined using the log-rank test. Similarly, the final model that included both mRNA and miRNA probes for predicting time to clinical recurence in both training and validation data sets was evaluated. The available-case approach was adopted in our analyses and the sample sizes used in each step of building and evaluating prediction models may be less than the total sample size. Emory Ref: 10048 PCT

Custom Prostate DASL profiling

DASL expression profiling with a custom-designed prostate cancer panel (see Materials and Methods section) and the Illumina DASL microRNA (miRNA) panel were performed on 70 prostatectomy patient samples to identify biomarkers predictive of recurrence. An independent validation profiling experiment was performed on 40 additional samples.

MicroRNA probes were filtered to retain only those that were present on the miRNA microarrays used for both the training and validation sets, reducing the total number of probes examined to 403 microRNA probes. The training set included 29 cases with observed biochemical PSA recurrence (median time to recurrence = 19 months), and 41 cases censored, i.e., without observed recurrence during the follow-up (median follow-up time = 83.0 months).

Integrated DASL biomarker analysis

After fitting a univariate Cox proportional hazard (PH) model for each individual probe using the training data, a set of 27 important probes were preselected based on an FDR threshold of 0.30. Next, to identify the optimal prediction score based on the preselected probes, a lasso Cox proportional hazard (PH) model was first fit using the set of 25 preselected mRNA probes only, resulting in a panel of nine protein-coding genes shown in the Table below (RAD23B, FBP1, TNFRSF1A, NOTCH3, ETV1, BID, SIM2, ANXA1, and BCL2).

Emory Ref: 10048 PCT

A final prediction model was then built to include the predictive score based on this panel of nine mRNA biomarkers as well as the relevant clinical biomarkers including T-stage, PSA and Gleason score, which could be used to predict recurrence following radical prostatectomy. Kaplan-Meier analysis (Figure 1 A) demonstrated that these probes could significantly discriminate patients with and without recurrence by the log rank test (p < 0.001). The final predictive model developed on the training set was applied to the validation set, a separate, independent DASL profiling experiment performed on a different day. Kaplan-Meier analysis (Figure IB) on this validation set determined that the model could discriminate patients with and without recurrence (p = 0.010).

Subsequently, the above training procedure was repeated using the complete set of 27 preselected mRNA and miRNA probes, and an optimal panel of ten mRNAs and two microRNAs (additional oligonucleotides below) was identified and built as a prediction model for prostate cancer biochemical recurrence, which again included relevant clinical biomarkers. Kaplan-Meier analysis and the log-rank test determined that this panel could significantly discriminate patients with and without recurrence both in the training set (p < 0.001, Figure 1C) and in the validation set (p = 0.013, Figure ID).

FBP1

5 '-ACTTCGTCAGTAACGGACTGGCATTGCTGGTTCTACCAAC-3 ' (SEQ ID NO: 178)

5 '-GAGTCGAGGTCATATCGTTGGCATTGCTGGTTCTACCAAC-3 ' (SEQ ID NO: 179)

5 ' -TGACAGGTGATCAAGTTAAGAAGTCGAGCGTTCGGAGCACTTAATCGTCT

GCCTATAGTGAGTC-3 ' (SEQ ID NO: 180)

TNFRSF1A

5 '-ACTTCGTCAGTAACGGACTCCCCAAGGAAAATATATCCACCC-3 ' (SEQ ID NO: 181) 5 '-GAGTCGAGGTCATATCGTTCCCCAAGGAAAATATATCCACCC-3 ' (SEQ ID NO: 182)

5 '-CAAAATAATTCGATTTGCTGTACAGTAGCCCAGGTAGCGGAGCTTGTCTGC

CTATAGTGAGTC-3 ' (SEQ ID NO: 183) NOTCH3

5 '-ACTTCGTCAGTAACGGACGTTCACAGGAACCTATTGCGAGGT-3 ' (SEQ ID NO: 184) 5 '-GAGTCGAGGTCATATCGTGTTCACAGGAACCTATTGCGAGGT-3' (SEQ ID NO: 185) Emory Ref: 10048 PCT

5'-GACATTGACGAGTGTCAGAGTAGCTGACTCTTGTAGTATTGCGCGAAGTC

TGCCTATAGTGAGTC-3 ' (SEQ ID NO: 186)

ETV1

5 '-ACTTCGTCAGTAACGGACTATGTTTGAAAAGGGCCCCAGG-3 ' (SEQ ID NO: 187)

5 '-GAGTCGAGGTCATATCGTTATGTTTGAAAAGGGCCCCAGG-3 ' (SEQ ID NO: 188)

5 '- AGTTTTATGATGACACCTGTGTTTACGGATGGCAACAAGTACGGATTGTCT

GCCTATAGTGAGTC-3 ' (SEQ ID NO: 189) BID

5 '-ACTTCGTCAGTAACGGACGTTCCAGCCTCAGGGATGAGTG-3 ' (SEQ ID NO: 190)

5 '-GAGTCGAGGTCATATCGTGTTCCAGCCTCAGGGATGAGTG-3 ' (SEQ ID NO: 191)

5'-ATCACAAACCTACTGGTGTTTGGCGCTAGGTTAATAAGCGGATGCGTCTGC

CTATAGTGAGTC-3 ' (SEQ ID NO: 192)

ANXA1

5 '-ACTTCGTCAGTAACGGACGATCAGAATTCCTCAAGCAGGCC-3 ' (SEQ ID NO: 193)

5 '-GAGTCGAGGTCATATCGTGATCAGAATTCCTCAAGCAGGCC-3 ' (SEQ ID NO: 194)

5 ' -GGTTTATTGAAAATGAAGAGCAAGGGTTCTATGTTTGGACGCCATGGTCTG

CCTATAGTGAGTC-3 ' (SEQ ID NO: 195)

BCL2

5'-ACTTCGTCAGTAACGGACCGTGCCTCATGAAATAAAGATCCG-3' (SEQ ID NO: 196) 5'-GAGTCGAGGTCATATCGTCGTGCCTCATGAAATAAAGATCCG-3' (SEQ ID NO: 197) 5 '-AAGGAATTGGAATAAAAATTTCCGGATGACGACCGAATACCGTTGGTCTG

CCTATAGTGAGTC-3' (SEQ ID NO: 198)

CCNG2

5 '-ACTTCGTCAGTAACGGACGCCACTCATGATGTGATCCGGATT-3 ' (SEQ ID NO: 199)

5 '-GAGTCGAGGTCATATCGTGCCACTCATGATGTGATCCGGATT-3 ' (SEQ ID NO:200)

5'-GTCAGTGTAAATGTACTGCTTCTGGTGCTCTGAGACGGCAAAGATTCGTC

TGCCTATAGTGAGTC-3' (SEQ ID NO:201) Emory Ref: 10048 PCT hsa-miR-647

ProbeSeq 5'-GTGGCTGCACTCACTTC-3' (SEQ ID NO:202)

TargetMatureSeqs 5'-GTGGCTGCACTCACTTCCTTC-3' (SEQ ID NO:203)

Oligo 5 ' - ACTTCGTCAGTAACGGACTTGAGCGGACCCAGA

TGTACCGGTGGCTGCACTCACTTC-3 ' (SEQ ID NO:204) hsa-miR-519d

ProbeSeq 5 ' -AGTGCCTCCCTTTAGAGTG-3 ' (SEQ ID NO:205)

TargetMatureSeqs 5 '-CAAAGTGCCTCCCTTTAGAGTG-3 ' (SEQ ID NO:206)

Oligo 5 ' - ACTTCGTCAGTAACGGACCAGAGTGTCCCCGT

GGCGATAC AGTGCCTCCCTTTAGAGTG-3' (SEQ ID NO:207)

RAD23B

5'-ACTTCGTCAGTAACGGACAATCCTTCCTTGCTTCCAGCG-3' (SEQ ID NO: 13)

5 '-GAGTCGAGGTCATATCGTAATCCTTCCTTGCTTCCAGCG-3 ' (SEQ ID NO: 14)

5 '-TACTACAGCAGATAGGTCGAGAGTAGGGTTCGGGTTCAGACACCGGTCTGCCTAT AGTGAGTC-3' (SEQ ID NO:208)

Prediction of Cases with a Gleason Score 7

Prediction of recurrence for patients with a Gleason score 7 is particularly difficult. In order to address this issue, we applied the biomarker panels to the subset of cases in the training and validation sets that had a Gleason score 7. The prediction model based on the nine-mRNA panel was significant at discriminating biochemical recurrence in Gleason score 7 cases in both the training set (p < 0.001, Figure 7A) and the validation set (p = 0.027, Figure 7B). For the prediction model based on the combined panel of ten mRNAs and two miRNAs in the tables below, the predictive value was again significant for both the training set (p = < 0.001, Figure 7C) and the validation set (p = 0.010, Figure 7D).

Emory Ref: 10048 PCT

Analysis of clinical recurrence

Although most patients who have clinical recurrence following prostatectomy also have biochemical recurrence, there is a significant population of patients with biochemical recurrence who do not have clinically significant recurrences observed during their follow-ups. To evaluate our biomarker panel of biochemical recurrence for predicting the clinical recurrence, the prediction model was tested based on the combined mRNA/miRNA panel in the same training and validation samples using their clinical recurrence outcome data.

Unfortunately, clinical recurrence data was lacking on some of the samples, and the total number of samples used in the training set was reduced. In the training data, the combined mRNA/miRNA panel was highly significant for predicting recurrence in all patients (p=0.002) as well as in the subset of patients with a Gleason score 7 (p=0.004); in the validation data, it Emory Ref: 10048 PCT was also significant for predicting recurrence in patients with a Gleason score 7 (p=0.023) and trended towards significance in all patients (p=0.078).

An analysis was also performed to construct a predictive set of biomarkers based on the clinical recurrence data instead of biochemical recurrence. Only three probes passed the initial preselection step for the univariate Cox PH modeling, all corresponding to the ETV1 gene. Furthermore, the prediction model built on clinical recurrence did not perform as well as the model built on biochemical recurrence, which is likely due to the considerably less number of clinical recurrences in the training set as well as the smaller total sample size.

Discussion

The DASL assay has been used to identify a 16-gene set that correlates with prostate cancer relapse. Bibikova et al., Genomics 2007, 89:666-672. Overlap between our panel of ten mRNA and two miRNA biomarkers described here and the previously described 16-gene panel was limited to FBPl even though ten of the genes in the 16-gene panel reported were included in our 522 custom prostate DASL panel. When the performance of the probes corresponding to those ten mRNAs was analyzed in our dataset, they were not able to significantly discriminate patients at higher and lower risk of recurrence. The gene signature selection and prediction model building were performed in separate steps and the signature selection was based on the correlation between the gene expression and Gleason score rather than between the gene expression and time to biochemical recurrence; our analytic approach overcomes these limitations. Specifically, our approach of building (training) prediction models takes advantage of recent advancement in regularized regression models for survival outcomes; regularized Emory Ref: 10048 PCT regression models can achieve simultaneous feature selection and model estimation and avoid model overfitting leading to better prediction performance.

Two other studies have employed DASL profiling to prostate cancer, but not detected any signature that improved upon clinical models in validation sets. Sboner et al., BMC Med Genomics 2010, 3:8 and Nakagawa et al., PLoS ONE, 2008, 3:e2318. While these studies used large cohorts with long-term follow-up, they did not include probes corresponding to microRNA genes. Moreover, these earlier studies suggested that tumor heterogeneity may play an important role in confounding signature identification. For our study of prostatectomy specimens, the most prominent tumor lesion were identified, and used a tissue core sample from that region to minimize stromal contributions and tumor heterogeneity.

In our twelve-gene predictive biomarker panel, nine of the genes are positively associated with recurrence, and three are negatively associated with recurrence. The nine genes positively associated with recurrence included miR-519d, Notch homo log 3 (Notch3),

Fructose- 1 ,6-bisphosphatase 1 (FBP1), ETS variant gene 1 (ETV1), BH3 interacting domain death agonist (BID), Single-Minded homolog 2 (SIM2), RAD23 homolog B (RAD23B), LETM1 domain containing 1 (LETMD1), and Cyclin G2 (CCNG2). Little is known about miPv-519d other than it may be associated with obesity. Martinelli et al., miR-519d

Overexpression Is Associated With Human Obesity, Obesity (Silver Spring) 2010. NOTCH3 is one of four Notch family receptors in humans, and Notch signaling has been shown to be important for prostate cancer cell growth, migration, and invasion as well as normal prostate development. FBP1 is expressed in the prostate and is involved in gluconeogenesis. The identification of this metabolic enzyme as a biomarker of recurrence is initially surprising. FBP1 was overexpressed in independent microarray analyses of prostate cancers. ETV1 is one of the recurrent translocations found in prostate cancers, and has been used in clinical models of recurrence following prostatectomy. Cheville et al., J Clin Oncol 2008, 26:3930-3936. BID is a pro-apoptotic protein that binds to BCL2 and potentiates apoptotic responses upon cleavage in response to tumor necrosis factor alpha (TNFa) and other death receptors. SIM2 was identified as a potential biomarker of prostate cancer. Halvorsen et al., Clin Cancer Res 2007, 13:892-897. SIM2 functions as a transcription factor that represses the proapoptotic gene BNIP3. RAD23B plays a role in DNA damage recognition and nucleotide excision repair, as well as inhibiting MDM2 mediated degradation of the p53 tumor suppressor. LETMD1 (also Emory Ref: 10048 PCT known as HCCR) is an oncogene that is induced by Wnt and PI3K/AKT signaling, inhibits p53 function, and is a biomarker for hepatocellular and breast cancers. Cyclin G2 is an atypical cyclin that is induced by DNA damage in a p53 -independent manner, as well as by

PI3K/AKT/FOXO signals, and induces p53-dependent cell cycle arrest.

The three genes in the predictive biomarker panel negatively associated with recurrence were miR-647, the TNFa receptor (TNFRSFl A), and annexin Al (ANXAl). While little is known about miR-647, TNFRSFl A (also known as TNFRl) mediates pro-apoptotic responses to TNFa ligand. Annexin A 1 expression is reduced in early onset prostate cancer and high- grade prostatic intraepithelial neoplasia. ANXAl plays roles in vesicle trafficking and reduced ANXAl promotes EMT and metastasis, and upregulates autocrine IL-6 signaling.