Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHODS AND MATERIALS FOR IDENTIFYING AND TREATING MAMMALS HAVING HER2-POSITIVE BREAST CANCER
Document Type and Number:
WIPO Patent Application WO/2015/164238
Kind Code:
A1
Abstract:
This document provides methods and materials involved in identifying mammals having breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab as well as methods and materials involved in treating mammals having breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab. For example, methods and materials for using expression level profiles to identify mammal having HER2-positive breast cancer with an increased likelihood of being responsive to trastuzumab are provided.

Inventors:
PEREZ EDITH A (US)
THOMPSON E AUBREY (US)
BALLMAN KARLA V (US)
Application Number:
PCT/US2015/026620
Publication Date:
October 29, 2015
Filing Date:
April 20, 2015
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
MAYO FOUNDATION (US)
International Classes:
C12Q1/68
Domestic Patent References:
WO2011056688A22011-05-12
Foreign References:
US20110217297A12011-09-08
US20140056881A12014-02-27
US20130251710A12013-09-26
Other References:
See also references of EP 3134549A4
Attorney, Agent or Firm:
FINN III, J. Patrick (P.O. Box 1022Minneapolis, Minnesota, US)
Download PDF:
Claims:
WHAT IS CLAIMED IS: 1. A method for identifying a mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab, wherein said method comprises 5 determining whether or not cancer cells from said mammal contain an elevated level of expression for at least nine of the nucleic acids listed in Table 9, wherein the presence of said elevated levels indicates that said mammal has breast cancer with an increased likelihood of being responsive to trastuzumab. 10 2. The method of claim 1, wherein said mammal is a human. 3. The method of claim 1, wherein said elevated levels are determined using a cDNA-mediated annealing, selection, extension, and ligation (DASL) assay. 15 4. The method of claim 1, wherein said breast cancer is an HER2-positive breast cancer. 5. A method for identifying a mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab, wherein said method comprises:

20 (a) determining whether or not a breast cancer cells from said mammal contain an elevated level of expression for at least nine of the nucleic acids listed in Table 9, and

(b) classifying said mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab if said sample contains said elevated 25 levels of said at least nine nucleic acids. 6. The method of claim 5, wherein said mammal is a human. 7. The method of claim 5, wherein said elevated levels are determined using a 30 cDNA-mediated annealing, selection, extension, and ligation (DASL) assay.

8. The method of claim 5, wherein said breast cancer is an HER2-positive breast cancer. 9. A method for identifying a mammal as having breast cancer with an increased 5 likelihood of being responsive to trastuzumab, wherein said method comprises:

(a) detecting the presence of an elevated level of expression for at least nine of the nucleic acids listed in Table 9 in breast cancer cells from said mammal, and

(b) classifying said mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab based at least in part on said presence 10 of said elevated levels. 10. The method of claim 9, wherein said mammal is a human. 11. The method of claim 9, wherein said elevated levels are determined using a 15 cDNA-mediated annealing, selection, extension, and ligation (DASL) assay. 12. The method of claim 9, wherein said breast cancer is an HER2-positive breast cancer. 20 13. A method for treating breast cancer, wherein said method comprises:

(a) detecting the presence of an elevated level of expression for at least nine of the nucleic acids listed in Table 9 in breast cancer cells from a mammal, and

(b) administering a taxane compound and trastuzumab to said mammal under conditions wherein the number of breast cancer cells within said mammal is reduced. 25

14. The method of claim 13, wherein said mammal is a human. 15. The method of claim 13, wherein said elevated levels are determined using a cDNA-mediated annealing, selection, extension, and ligation (DASL) assay.

30

16. The method of claim 13, wherein said breast cancer is an HER2-positive breast cancer. 17. The method of claim 13, wherein said taxane compound is paclitaxel. 5

Description:
METHODS AND MATERIALS FOR IDENTIFYING AND TREATING MAMMALS HAVING HER2-POSITIVE BREAST CANCER CROSS REFERENCE TO RELATED APPLICATIONS

5 This application claims the benefit of U.S. Provisional Serial No. 61/982,251 filed April 21, 2014. This disclosure of the prior application is considered part of (and is incorporated by reference in) the disclosure of this application. BACKGROUND

10 1. Technical Field

This document relates to methods and materials involved in identifying mammals having breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab as well as methods and materials involved in treating mammals having breast cancer (e.g., HER2-positive breast cancer). For example, this document 15 provides methods and materials for using expression level profiles to identify a

mammal as having breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab. 2. Background Information

20 Clinical trials demonstrated the efficacy of trastuzumab in an adjuvant setting.

20-25 percent of patients with HER2-positive breast tumors, however, relapse despite HER2-targeted therapy. A number of potential mechanisms were proposed to account for differential response to HER2-targeted therapy, including overexpression of EGFR, cMYC, or ERBB3, mutational activation of PI3K, and mutational loss of 25 PTEN (Arteaga et al., Nat. Rev. Clin. Oncol., 9(1):16-32 (2012)). SUMMARY

This document provides methods and materials involved in identifying mammals having breast cancer (e.g., HER2-positive breast cancer) responsive to 30 trastuzumab as well as methods and materials involved in treating mammals having breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab. For example, this document provides methods and materials for using expression level profiles to identify mammal having HER2-positive breast cancer with an increased likelihood of being responsive to trastuzumab. As described herein, the presence of an elevated level of expression of at least nine of the nucleic acids listed in Table 9 within a HER2-positive breast cancer sample from a mammal can indicate that that mammal (e.g., a human) has HER2-positive breast cancer with an increased

5 likelihood of being responsive to trastuzumab. As also described herein, a mammal with breast cancer can be treated by detecting the presence of an elevated level of expression of at least nine of the nucleic acids listed in Table 9 within a HER2- positive breast cancer sample from a mammal and administering trastuzumab to that mammal.

10 Having the ability to identify mammals as having breast cancer (e.g., HER2- positive breast cancer) with an increased likelihood of being responsive to trastuzumab

as described herein can allow those breast cancer patients to be properly identified and treated in an effective and reliable manner. For example, the breast cancer 15 treatments provided herein can be used to treat breast cancer patients identified as having breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab.

In general, one aspect of this document features a method for identifying a mammal as having breast cancer with an increased likelihood of being responsive to 20 trastuzumab. The method comprises, or consists essentially of, determining whether or not cancer cells from the mammal contain an elevated level of expression for at least nine of the nucleic acids listed in Table 9, wherein the presence of the elevated levels indicates that the mammal has breast cancer with an increased likelihood of being responsive to trastuzumab. The mammal can be a human. The elevated levels 25 can be determined using a cDNA-mediated annealing, selection, extension, and ligation (DASL) assay. The breast cancer can be an HER2-positive breast cancer.

In another aspect, this document features a method for identifying a mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab. The method comprises, or consists essentially of, (a) determining 30 whether or not a breast cancer cells from the mammal contain an elevated level of expression for at least nine of the nucleic acids listed in Table 9, and (b) classifying the mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab if the sample contains the elevated levels of the at least nine nucleic acids. The mammal can be a human. The elevated levels can be determined using a cDNA-mediated annealing, selection, extension, and ligation (DASL) assay. The breast cancer can be an HER2-positive breast cancer.

In another aspect, this document features a method for identifying a mammal as having breast cancer with an increased likelihood of being responsive to

5 trastuzumab. The method comprises, or consists essentially of, (a) detecting the presence of an elevated level of expression for at least nine of the nucleic acids listed in Table 9 in breast cancer cells from the mammal, and (b) classifying the mammal as having breast cancer with an increased likelihood of being responsive to trastuzumab based at least in part on the presence of the elevated levels. The mammal can be a 10 human. The elevated levels can be determined using a cDNA-mediated annealing, selection, extension, and ligation (DASL) assay. The breast cancer can be an HER2- positive breast cancer.

In another aspect, this document features a method for treating breast cancer. The method comprises, or consists essentially of, (a) detecting the presence of an 15 elevated level of expression for at least nine of the nucleic acids listed in Table 9 in breast cancer cells from a mammal, and (b) administering a taxane compound and trastuzumab to the mammal under conditions wherein the number of breast cancer cells within the mammal is reduced. The mammal can be a human. The elevated levels can be determined using a cDNA-mediated annealing, selection, extension, and 20 ligation (DASL) assay. The breast cancer can be an HER2-positive breast cancer.

The taxane compound can be paclitaxel.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. Although methods and materials similar or equivalent 25 to those described herein can be used to practice the invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended 30 to be limiting.

The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims. DESCRIPTION OF THE DRAWINGS

Figure 1: The N9831 multi-site phase III trial (NCT00005970) had three arms. Patients randomized to Arm A received doxorubicin and cyclophosphamide 5 (AC) followed by weekly paclitaxel for 12 weeks (chemotherapy alone), whereas patients in Arms B and C received chemotherapy plus 12 months of trastuzumab. Arms B and C differed in that paclitaxel was given concurrently for the first month of trastuzumab treatment in Arm C, whereas trastuzumab was started after completion of paclitaxel therapy in Arm B. Women randomly assigned to the trastuzumab arms B 10 and C had a significantly increased DFS (p<0.001) and overall survival (OS)

(p<0.001) compared with women assigned to the control (chemotherapy alone) arm.

Figure 2: Consort diagram describing the process whereby 1282 samples were selected for downstream analyses. The N9831 trial registered 3505 patients of whom 1282 (Arm A: 433, Arm B: 477, Arm C: 372) were evaluable for DASL gene 15 expression profiling. The median follow-up time was 6 years, 11 months. All tumors included in this figure were tested for HER2 protein overexpression by

immunohistochemistry (IHC) and/or gene amplification by fluorescent in situ hybridization (FISH) at a central laboratory (Mayo Clinic, Rochester, MN), and some tumors were excluded after central review of HER2 status. The largest cause of 20 exclusion was insufficient tissue. Quality control (QC) failure after DASL analysis eliminated a small number of samples.

Figure 3: Kaplan-Meier analysis of RFS in 1282 patients included in downstream analysis. In the N9831 comparison of sequential versus concurrent trastuzumab chemotherapy, there was an increase in DFS with concurrent

25 trastuzumab (Arm C) compared to sequential trastuzumab (Arm B). Although

outcome from the concurrent arm (Arm C) was slightly better than that from the sequential arm (Arm B), the significance did not cross the pre-specified O’Brien- Fleming boundary (p=0.00116) for the interim analysis of these two arms (Perez et al., J. Clin. Oncol., 29(34):4491-7 (2011)). The data shown in this figure indicate that 30 outcome among the 1282 patients used to analyze gene expression recapitulates the outcome described elsewhere for all of the patients enrolled in N9831.

Figure 4: Surface mapping reveals optimum values of q and m. A five-fold cross-validation (CV) using 100 iterations was used to identify the optimum values of q and m (number of m-genes with at least one probe above the q-quantile). For each of the 500 CV-iteration training sets, all values of m from 4 to 10 were paired with q- values from 0.25 to 0.75 by 0.01. The resulting 357 pairs of q/m values were used to determine enriched and not enriched tumors. Kaplan-Meier curves and log-rank tests were used to determine the hazard ratio and p-value for the difference between the 5 arms for enriched tumors. Panel A shows the resulting contours of the HR and Panel B shows the p-values for one representative of the 500 CV-iterations. The optimum q/m pair was chosen via the minimum p-value. The dashed-lines in both panels show the HR and p-value for optimum q/m value for this CV-iteration.

Figure 5: Network models reveal functional connections between genes 10 associated with outcome in N9831. The Cytoscape Functional Interactome tool integrates functional relationships defined by multiple bioinformatics tools, including protein-protein and gene-gene interaction datasets. This tool was used to define networks associated with either decreased RFS (Panels A and C) or increased RFS (Panels B and D) in Arm A (Panels A and B) or Arms B/C (Panels C and D).

15 Networks were constructed using genes with significant HRs (p<0.01), identified in Tables 4 and 5. Insertion of a single linker gene was allowed in network construction.

Figure 6: A cohort of immune function genes is strongly associated with outcome after trastuzumab treatment, but has no effect on RFS following

chemotherapy alone. Tumors in Arm A and Arms B/C were“binned” in to immune- 20 enriched (IRE) and not immune-enriched (NIRE) using the voting model in which enrichment was defined by the m9q41 model. Panel A shows relapse-free survival (RFS) in years for enriched and not enriched subsets of tumors from both arms. Panel B shows relapse-free survival (RFS) in years for the enriched subset of tumors from both arms. Panel C shows relapse-free survival (RFS) in years for the non-enriched 25 subset of tumors from both arms.

Figure 7: Cross-validation of the immune function score model. The data were randomly split into 5 cohorts, and the optimal q/m combination was selected based on 4 cohorts. This q/m relationship was then used to determine whether a tumor was immune-enriched (IRE) or not enriched (NIRE) in the remaining cohort. 30 Each tumor is classified 100 times (once for each cross-validation). The curves showed the results of the observed RFS based on these 100 different cross-validation sets, hence there are a total of n = 128200 observations (Arm A.IRE 18117, Arm A.NIRE 25183, Arms B/C.IRE 36877, and Arms B/C.NIRE 48023). DETAILED DESCRIPTION

This document provides methods and materials involved in identifying mammals having breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab as well as methods and materials involved in treating mammals having 5 breast cancer (e.g., HER2-positive breast cancer) responsive to trastuzumab. For example, this document provides methods and materials for identifying a mammal as having HER2-positive breast cancer with an increased likelihood of being responsive to trastuzumab by determining whether or not a breast cancer sample from a mammal has an elevated level of expression for at least nine of the nucleic acids listed in Table 10 9. As described herein, if a mammal contains breast cancer cells (e.g., HER2-positive breast cancer cells) with an elevated level of expression for at least nine of the nucleic acids listed in Table 9, then that mammal can be classified as having HER2-positive breast cancer with an increased likelihood of being responsive to trastuzumab.

The term“elevated level” as used herein is in reference to the abundance of an 15 individual mRNA in a given sample as compared to the abundance of that mRNA in a population of samples. A level is“elevated” when an mRNA abundance equals or is greater than 0.40 quantile for the population of samples for that specific mRNA. In general, the range of expression for the nucleic acids listed in Table 9 is defined for all tested samples and expressed as a range of 0 to 1.0 with 0 being the lowest and 1.0 20 being the highest quantile. The expression of each nucleic acid within a given sample is then referred to the distribution of expression within that population and defined as “elevated” when that expression level falls within the range of 0.40 to 1.0.

As described herein, the level of expression of nine or more of the nucleic acids listed in Table 9 within breast cancer cells can be used to determine whether or 25 not a particular mammal has breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab. Any appropriate breast cancer sample can be used as described herein to identify mammals having breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab. For example, breast cancer tissue samples, breast cancer 30 cell samples, and breast cancer needle biopsy specimen can be used to determine whether or not a mammal has breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab. In addition, any appropriate method can be used to obtain breast cancer cells. For example, a breast cancer sample can be obtained by a tissue biopsy or following a surgical resection. Once obtained, a sample can be processed prior to measuring a level of expression. For example, a breast cancer sample can be processed to extract RNA from the sample. Once obtained, the RNA can be evaluated to determine the level of an mRNA of interest. In some cases, nucleic acids present within a sample can be 5 amplified (e.g., linearly amplified) prior to determining the level of expression (e.g., using array technology). In another example, a breast cancer sample can be frozen, and sections of the frozen tissue sample can be prepared on glass slides. The frozen tissue sections can be stored (e.g., at -80ºC) prior to analysis, or they can be analyzed immediately (e.g., by immunohistochemistry with an antibody specific for a particular 10 polypeptide of interest).

Any appropriate methods can be used to determine the level of expression of one or more of the nucleic acids listed in Table 9 within breast cancer cells. For example, quantitative real time PCR, in situ hybridization, or microarray technology can be used to determine whether or not a particular sample contains an elevated level 15 of mRNA expression for a particular nucleic acid or lacks an elevated level of mRNA expression for a particular nucleic acid. In some cases, the level of expression can be determined using polypeptide detection methods such as immunochemistry techniques. For example, antibodies specific for FYN polypeptides can be used to determine the polypeptide level in a sample. In some cases, polypeptide-based 20 techniques such as ELISAs and immunocytochemistry techniques can be used to determine whether or not a particular sample contains an elevated level of polypeptide expression for a particular nucleic acid or lacks an elevated level of polypeptide expression for a particular nucleic acid.

Once the levels of expression for at least nine of the nucleic acids listed in 25 Table 9 within breast cancer cells from a mammal are determined, the levels can be compared to reference levels and used to classify the mammal as having or lacking breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab as described herein.

This document also provides methods and materials for treating breast cancer 30 (e.g., HER2-positive breast cancer). In some cases, a taxane compound (e.g.,

paclitaxel, Abraxane ® , Taxol ® , or docetaxel) and trastuzumab can be administered to a mammal (e.g., a human) having breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab under conditions wherein the number of breast cancer cells or the progression of the breast cancer is reduced. For example, paclitaxel can be administered to a human having breast cancer at a dose of 80-100 mg/m 2 per week, while trastuzumab is administered to that same human at a dose of 2 mg/kg every week or 6 mg/kg every 3 weeks (after loading doses). In some cases, a non-taxane compound (e.g., eribulin, carboplatin, or 5 vinorelbine) and trastuzumab can be administered to a mammal (e.g., a human)

having breast cancer (e.g., HER2-positive breast cancer) with an increased likelihood of being responsive to trastuzumab under conditions wherein the number of breast cancer cells or the progression of the breast cancer is reduced.

In some cases, a mammal (e.g., a human) with breast cancer can be treated by 10 detecting the presence of an elevated level of expression of at least nine of the nucleic acids listed in Table 9 within a HER2-positive breast cancer sample from a mammal and administering trastuzumab alone or combination with a taxane compound to that mammal.

The invention will be further described in the following examples, which do 15 not limit the scope of the invention described in the claims.

EXAMPLES

Example 1– Elevated expression levels of a panel of nucleic acids can be used to identify patients with breast cancer that is responsive to trastuzumab Patients

20 There were 3505 patients enrolled in N9831 (NCT 00005970) and randomized to 3 arms. All patients received anthracycline plus cyclophosphamide (AC). Arm A patients received paclitaxel alone; Arm B patients received paclitaxel followed by trastuzumab; and Arm C patients received paclitaxel and concurrent trastuzumab (Figure 1) after completion of the AC therapy. From these patients, 1282 samples 25 (Arm A-433, Arm B-477, Arm C-372) were evaluable for DASL gene expression profiling (Figure 2). There were some differences in the clinical-pathological characteristics between the 1282 patients included in this analysis and the 2223 patients who were excluded (Table 1); however, the differences in outcomes among the three arms for the 1282 included patients (Figure 3) were similar to those reported 30 for the trial as a whole (Perez et al., J. Clin. Oncol., 29(25):3366-73 (2011)). Since the interest of this analysis is the biological basis that underlies trastuzumab response, the 849 patients who received trastuzumab (Arms B and C, denoted Arms B/C) were pooled. Table 1. Patient demographics of 1282 samples included in the DASL analysis vs. remaining patients registered on N9831.

5 downstream analyses. The clinical-pathological characteristics and outcomes of the 1282 patients enrolled on Arms A, B, and C reported herein were similar to those of the 2223 patients on Arms A, B, and C excluded from analysis. There was a small but significant increase in representation of ER-negative patients among those included for DASL analysis.

10 DASL analysis of mRNA abundance

Individual tumor blocks were examined microscopically, and tissue punches were obtained from demarcated areas of invasive tumor using a 1 mm biopsy punch with plunger (Fisher Scientific). Total RNA was extracted from at least one 1 mm 5 tissue punch. Punches were deparaffinized in Citrisolv (Fisher Scientific) at room temperature for 30 minutes. The Citrisolv was aspirated, and the tissue was washed with 100% ethanol, vortexed, and centrifuged twice. Ethanol was removed, and the tissue was dried at 37°C for 10 minutes. The samples were then incubated in Proteinase K Digestion (PKD) buffer and proteinase K (1 μg/μL) for overnight (at 10 least 8 hours) at 56°C. The digested tissue was incubated for 15 minutes at 80°C and centrifuged (14000 rpm) for 2 minutes at room temperature. The supernatant was collected, and the RNA extraction, including DNase I treatment, was completed using the RNeasy FFPE kit on an automated QIAcube platform according to the manufacturer’s instructions (QIAGEN, Valencia, CA). The concentration of the 15 purified RNA was determined using a NanoDrop ND-1000 spectrophotometer

(Nanodrop Technologies; Wilmington, DE). Purified total RNA was stored at -80°C. Labeling and hybridizations to BeadChips (HumanRef v4 Beadchip, Illumina) were performed as described elsewhere (Ton et al., Breast Cancer Research and

Treatment, 125(3):879-83 (2011), Bibikova et al., Am. J. Pathol., 165(5):1799-807 20 (2004), Li et al., Cancer Res., 66(8):4079-88 (2006), and Reinholz et al., BMC Med.

Genomics, 3:60 (2010)) with slight modifications. Samples (200 ng RNA) were randomized across 17 plates and subsequently to 136 chips according to date and order of RNA extraction, clinicopathologic characteristics, year on study, and treatment arm. The non-background corrected expression values from BeadStudio 25 underwent a quality-control evaluation using the metrics of 1) proportion of probes detected at p<0.05, 2) inter-quartile range, and 3) skewness (Mahoney et al., BMC Res. Notes, 6(1):33 (2013)). In addition, a Stress metric, which quantified the amount of transformation that is required for an array to be normalized, was applied. The replicated patient sample with the lowest Stress value was used for analysis. Samples 30 with a Stress value>log 2 (1.5) were deemed to be poor quality and removed. The remaining data were normalized using quantile normalization. A detailed description of the quality assessment protocols that were applied to these samples is described elsewhere (Mahoney et al., BMC Res. Notes, 6(1):33 (2013)). Intra- and inter-plate technical replicates were performed using randomly selected N9831 patient samples and Universal Human Reference RNA (UHRR) control samples (Ambion Life Technologies). UHRR samples were analyzed in duplicate on every plate, with correlation coefficients of >0.9 for both UHRR and 5 patient samples (Table 2 and Table 3, respectively), well within FDA and NCI

guidelines that recommend CV values be less than 15% to be considered a precise assay.

Table 2. Replicate analyses using UHRR and patient samples indicates a high degree of analytical precision on the DASL platform.

1 Table 3.

egar ng a es an , pa rw se pearman rank correlation coefficients of quintile-normalized, log2 transformed data from 34 UHRR samples (identified as UHR01-UHR17, with duplicates assayed on different plates designated UHR.1 vs UHR.2) were analyzed. Table 2 shows correlation coefficients for all samples against all other samples. The correlation coefficients for duplicate samples on the same plate 10 averaged 0.994 (S.D.=0.006, 95%C.I. 0.98-1.00, range 0.97-1.0), whereas correlation coefficients for duplicates run on separate plates averaged 0.989 (S.D.=0.006, 95%C.I. 0.989-0.990. range 0.97-1.0). Likewise, 23 duplicate FFPE patient samples were analyzed in duplicate on the same plate and twice on two different plates (Table 3). The correlation coefficients for patient samples run in duplicate (identified by .115 and .3) on the same plate averaged 0.91 (S.D.=0.052, 95%C.I. 0.74-0.97, range 0.73- 0.97), whereas correlations for duplicate patient samples run on two different plates averaged 0.903 (S.D.=0.05 95%C.I. 0.75- 0.96, range 0.75-0.96).

Table 4. Genes with significant adjusted HRs in Arm A.

5



ıĶ







20

Regarding Table 4, genes with adjusted HRs >1 (p<0.01) are shown in the top section, whereas genes with HRs <1 (p<0.01) are shown in bottom section. CoxPH analysis (adjusted for significant clinical/pathological variables) was carried out using 5 gene expression data from the DASL arrays and RFS as a continuous variable.

Filtering was conducted to identify probes which had a median expression across all arms that were above the lowest 20% and below the highest 2%. Statistical analysis of Cox hazard ratios (HR)

10 The primary endpoint was relapse-free survival (RFS), which was defined as the time from randomization to first local, regional, or distant recurrence, or the development of a new contralateral primary breast cancer. Multivariable Cox models (adjusting for nodal status, tumor size, hormone receptor status, age, and tumor grade) 21 were used to evaluate the association between RFS and probe expression for all genes. The association was assessed separately within each patient group to understand biological processes that might be involved with response to trastuzumab. Probes meeting the filtering criteria and having an adjusted-model p<0.01 were considered to 5 be significantly associated with RFS for the purpose of the functional analysis. Cox proportional models, which included the prognostic factors listed above as adjusting variables, were evaluated on the set of all patients and included probe, treatment group, and probe-treatment group interaction terms to identify probes that were potentially predictive of trastuzumab response.

10

Functional analysis

Cox hazard ratios were determined for all genes from the DASL analysis using time to event (RFS) as a continuous variable, as described herein. The Cytoscape Functional Interactome tool (Matthews et al., Nucleic Acids Res., 37(Database 15 issue):D619-22 (2009)) was used to define networks associations among genes with Cox hazard ratios with adjusted-model p<0.01. Functional processes associated with network components were deduced from the pathway enrichment statistics function within the Cytoscape Functional Interactome tool. 20 Enrichment of Gene Ontology Biological Process terms

Functional ontology enrichment was determined by analysis of Gene Ontology Biological Process (GO:BP) terms using Fisher’s exact test. Individual GO terms apply to many genes, and individual genes may have many associated GO terms. This one-to-many relationship between genes and Gene Ontology (GO) terms was 25 downloaded from the BioMart portal at Ensembl

(http://useast.ensembl.org/biomart/martview/). The Ensembl human gene annotation version 70 (v70) was used to identify genes. A developed script was used to assign each gene into all possible GO terms to which it belongs. This was done on both the genes with significant hazard ratios (HR), as well as all genes in the v70 annotation. 30 For each of the GO terms, a Fisher’s exact test was performed on a two-by-two

contingency table with: (1) the number of genes with significant HR belonging to the GO term from Arm A; (2) the number of genes with significant HR belonging to the GO term from Arms B/C; (3) the numbers of genes, excluding those in (1), from all v70 genes that assigned to the GO term; and (4) the numbers of genes, excluding those in (2), from all v70 genes that assigned to the GO Term. 5 Statistical Analysis

A decision was made not to split the samples into separate training and validation sets for the signature development due to the limited power in the overall dataset (204 recurrence events, with 89 in Arm A and 115 in Arms B/C). A split- sample approach, in which the data are divided into two cohorts for training and 10 validation, fails to use all the information in the sample for signature development, yielding a noisy signature (Subramanian and Simon, Stat. Med., 30(6):642-53 (2011)). For a preliminary validation of the signature, cross-validation was used as described below.

The analyses focused on genes that had a plausible biological function with 15 respect to trastuzumab response, as identified by network and functional ontology analysis. A voting scheme was used to develop a signature from a cohort of genes with HR<1.0, adjusted-model p<0.01, and interaction p<0.05. Since it is likely that the contribution of individual genes within the biological process might vary from tumor to tumor, a voting scheme was used to develop a signature. A tumor was 20 designated as enriched for a biological function if m or more of the genes in the

functional group had one or more probes expressed above a quantile q threshold. To determine the best pair of m and q values, a response surface was searched that consisted of all quantile values of q, between 0.25 and 0.75 by increments of 0.01. For each q/m pair, tumors were classified as enriched if they had m or more genes 25 with at least one probe having an expression value above the q quantile for that probe across all samples. The q/m pair that was selected as best had the smallest p-value for a comparison of RFS between women with enriched tumors (as determined by the voting scheme based on q/m values) who were treated with trastuzumab compared to women with enriched tumors that were not treated with trastuzumab.

30

Cross-validation of the signature development

A cross-validation method was used to assess whether the observed predictive nature of the signature was generalizable. Since the feature selection was based on identified biological processes that differed between Arms A and B/C, it was not possible to do a complete cross-validation of the entire process starting from feature selection. However, the development of the signature was cross-validated based on the selected probes.

A five-fold cross validation was replicated 100 times for determining the 5 performance of the voting scheme for classifying tumors as enriched or not enriched and whether the resulting signature appears predictive of RFS. During each cross validation replicate, all patients were randomly assorted into five different cohorts. Four of the cohorts were then used to define the best set of q/m pairs, searching the q/m grid (Figure 4). The q/m pairs determined in this fashion were then used to 10 define the immune enrichment scores of the“left out” 1/5 of the tumors. This

procedure was repeated five times leaving out one of the cohorts each time.

Replicating this analysis 100 times determined each tumor as immune enriched or not-enriched. 15 Final Voting Scheme Values and Analysis

Using the selected q/m values, patients were grouped into enriched and non- enriched groups. Kaplan-Meier curves were used to summarize the RFS for each group and compared with a logrank test. Multivariable Cox models adjusted for the prognostic factors (listed above) and with treatment group, enriched status

20 (determined by the voting scheme), and the treatment group-enriched status

interaction term were used to determine whether the signature was potentially predictive. Gene Expression and Outcome Association

25 Multivariable Cox regression was used to identify genes significantly

associated with RFS in Arm A and Arms B/C. 473 genes were identified that were associated with RFS at adjusted-model p<0.01 in Arm A (Table 4). We identified 510 genes significantly associated with RFS at adjusted-model p<0.01 in Arms B/C (Table 5).

30 Table 5. Genes with significant adjusted HRs in Arms B/C (chemotherapy plus trastuzumab) of N9831.

5

Regarding Table 5, adjusted HRs for genes associated with decreased RFS (top section) and increased RFS (bottom section) at p<0.01 were determined as described herein.

5

Functional Analyses

Cytoscape Functional Interactome tools were used to construct four interactome models using genes significantly associated with outcome (Figure 5). Each interactome map contained 10-12 highly interconnected modules (color coded) 10 that were connected to other modules within the networks. Pathway enrichment statistics were used to assess the biological significance of these four network models. The top-scoring pathways for each network are provided in Table 6. The most significant pathways associated with decreased RFS (HR>1.0) in Arm A were integrin signaling, co-regulation of androgen receptor activity, and vascular smooth muscle 15 contraction (Table 6, panel A). Pathways associated with increased RFS (HR<1.0) in Arm A included formation and maturation of mRNA transcript, ribosome, neuroactive ligand-receptor interaction, homologous recombination, and innate immunity signaling (Table 6, panel B). 20 Table 6. Pathway enrichment statistics from Cytoscape networks. Significant pathways were filtered for p<0.001 and FDR<0.1. Pathways were ranked on number of genes from network in the individual pathways.

34

Among the trastuzumab-treated patients (Arms B/C), integrin signaling, Alzheimer disease-presenilin pathway, and M/G1 cell cycle transition pathways were 5 the most significant pathways linked to decreased RFS (HR>1.0) (Table 6, pane C).

The most significant pathways associated with increased RFS (HR<1.0) after adjuvant trastuzumab (Table 6, panel D) included cytokine-cytokine receptor interaction, T-cell receptor signaling in CD8 + T-cells, INF-gamma pathway, TNF receptor signaling pathway, cell surface interaction at the vascular endothelium, and class 1 PI3K 10 signaling events. The observation that 4/6 significant pathways are linked to

immunological functions strongly suggests an association between immune response and increased RFS in trastuzumab-treated patients with HER2-positive breast tumors.

Gene Ontology Biological Process terms were defined for each gene with a significant HR (adjusted-model p<0.01). Fisher’s exact test was used to identify 13 15 GO biological process descriptors that exhibited significantly different distribution in Arms A and B/C at p<0.01 (Table 7); the most significant was immune response (GO:0006955_BP). Ten of 13 biological processes were linked to immune functions, including T-cell and B-cell responses, chemokine signaling and chemotaxis, and inflammation. These results suggest that a major immunological component is predictive of RFS among trastuzumab-treated patients with early stage HER2-postive breast cancer.

5 87 immune function genes, defined by the 10 immune function GO terms that were enriched in Arms B/C (Table 7) and associated with increased RFS (HR<1.0) at adjusted-model p<0.01, were identified (Table 8). To find which of these probes were potentially predictive, probes among the 87 immune function genes that had a significant interaction term (p<0.05) were selected. This resulted in a list of 14 genes 10 (Table 9).

R

Table 7. Analysis of biological process defined by gene ontology (GO) terms reveals enrichment of immune function terms in Arms B/C. Thirteen GO biological process terms were enriched in Arms B/C, relative to Arm A. Ten of these, labeled with“R,” 15 were linked to various immune functions. GO terms associated with signal

transduction or response to drug are labeled with“G” and“B,” respectively.

Table 8. A cohort of 87 immune function genes are associated with RFS in N9831. As listed in Table 7, 10 GO terms associated with various immune functions were identified as enriched in a comparison of Arm A versus Arms B/C. All genes with significant HRs (p<0.01) in either arm were then used to generate a list of 87 immune 5 function genes that are significantly associated with RFS in either or both arms.

Table 9. Interaction p-values. The table displays the hazard ratios (HRs) for the probe expression effect (HR.exprs), treatment arm effect (HR.rand.arm), and the interaction of probe and treatment arm (HR interaction exprs:arm) in a multivariable Cox model that also contained prognostic variables (nodal status, tumor size, hormone 5 receptor status, age, and tumor grade) as adjusting variables. The prognostic

adjusting variables are not shown in the table. It also includes the p-values for the probe expression, treatment arm, and the probe-treatment arm interaction variables: p.exprs, p.rand.arm, and p interaction exprs:arm, respectively.

10

Voting scheme parameters

The response surface analysis resulted in two unique sets of q/m values. The first set q=40 and m=9 (q40m9) occurred 235 times (47%) and identified 226 (52.2%) enriched patients in Arm A and 441 (51.9%) enriched patients in Arms B/C. The 15 second set q=58 and m=8 (q58m8) occurred 265 times (53%) and identified 139 (32.1%) enriched patients in Arm A and 310 (36.5%) enriched patients in Arms B/C. Since both sets of optimum q/m values occurred about evenly, q/m pair q40m9 was selected as the optimum. 20 Final Signature Analysis

Based on the optimum set of q/m values, a tumor was designated as immune- enriched if any 9 (m) or more of the 14 immune function genes were expressed at or above the 0.40 quantile (q) expression value for one or more probes. This signature was used to“bin” tumors in Arm A and Arms B/C into immune response enriched 25 (IRE) and non-immune response enriched (NIRE) groups. The difference in RFS between the IRE and NIRE tumors in Arm A was not statistically significant (HR=0.90, p=0.64, black and red labeled curves, Figure 6A). Patients with IRE tumors exhibited significantly increased RFS after adjuvant trastuzumab (green labeled curve), compared to IRE patients who did not receive trastuzumab (black labeled curve; HR=0.35, p<0.0001). Furthermore, the RFS of trastuzumab-treated 5 patients whose tumors were NIRE (blue labeled curve) was not significantly different from RFS of IRE patients who received chemotherapy alone (HR=0.89, p=0.53). A multivariable Cox model was evaluated that included the prognostic factors as adjusting variables, immune-enrichment status, treatment group, and an immune- enrichment status and treatment group interaction group term. In this model, the 10 interaction term value was significant (p<0.0001). Figures 6B and 6C show the effect of the interaction on trastuzumab response. There is a difference in RFS for patients with IRE tumors treated with trastuzumab compared to those who received chemotherapy alone (Figure 6B; HR=0.36, p<0.0001). There is no difference in RFS for patients with NIRE tumors treated with trastuzumab and those who received 15 chemotherapy alone (Figure 6C; HR=0.98; p=0.91). Cross-validation results

To validate the signature, a five-fold cross validation was performed. The immune enrichment status of the tumors in the“left out” groups for each iteration20 were combined, so that all samples within the study were assigned as enriched or non- enriched. Figure 7 shows the RFS curves for each enrichment status and treatment group combination obtained from cross-validation. There is no difference in RFS between Arm A and Arms B/C for NIRE tumors (HR = 0.93), but there is a difference in RFS between Arm A and Arms B/C for IRE tumors (HR = 0.32). The p-value for 25 the enrichment status-treatment group interaction was less than 0.0001 in the

multivariable Cox model that adjusted for known prognostic factors.

These results demonstrate that when nine or more of the fourteen immune function genes listed in Table 9 are at or above 0.40 quantile for the population for a particular patient, then that patient has an increased likelihood of remission free 30 survival following treatment with adjuvant trastuzumab. Example 2– Treating HER2-positive breast cancer with trastuzumab A patient with HER2-positive breast cancer is identified as having an increased level of expression of nine or more of the fourteen genes listed in Table 9 and is administered a taxane agent (e.g., paclitaxel) and trastuzumab. The taxane 5 agent is administered at a dose that is between 80 and 100 mg/m 2 per week.

Trastuzumab is administered at a dose that is 2 mg/kg every week or 6 mg/kg every 3 weeks (after loading doses). OTHER EMBODIMENTS

10 It is to be understood that while the invention has been described in

conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.

15