Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHODS AND COMPOSITIONS FOR CLASSIFYING DLBCL
Document Type and Number:
WIPO Patent Application WO/2017/194670
Kind Code:
A1
Abstract:
Provided herein are methods and compositions to classify DLBCL subtypes using quantitative RT-PCR.

Inventors:
DUA RAJIV (US)
FONTECHA MARCEL (US)
LI YAN (US)
LIU WEI-MIN (US)
SANTINI CHRISTOPHER (US)
STEINER LORI (US)
TAI YU CHUAN (US)
Application Number:
PCT/EP2017/061313
Publication Date:
November 16, 2017
Filing Date:
May 11, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
ROCHE DIAGNOSTICS GMBH (DE)
HOFFMANN LA ROCHE (CH)
ROCHE MOLECULAR SYSTEMS INC (US)
International Classes:
C12Q1/68
Domestic Patent References:
WO2015135935A12015-09-17
WO2015069790A12015-05-14
Foreign References:
US20050079513A12005-04-14
US20140170730A12014-06-19
US20140051126A12014-02-20
US7363168B22008-04-22
Other References:
YU-DONG CAI ET AL: "A Unified 35-Gene Signature for both Subtype Classification and Survival Prediction in Diffuse Large B-Cell Lymphomas", PLOS ONE, vol. 5, no. 9, 13 September 2010 (2010-09-13), pages e12726, XP055389620, DOI: 10.1371/journal.pone.0012726
MATTHEW A. CARE ET AL: "A Microarray Platform-Independent Classification Tool for Cell of Origin Class Allows Comparative Analysis of Gene Expression in Diffuse Large B-cell Lymphoma", PLOS ONE, vol. 8, no. 2, 12 February 2013 (2013-02-12), pages e55895, XP055389622, DOI: 10.1371/journal.pone.0055895
WRIGHT G ET AL: "A gene expression-based method to diagnose clinically distinct subgroups of diffuse large B cell lymphoma", PROCEEDINGS NATIONAL ACADEMY OF SCIENCES PNAS, NATIONAL ACADEMY OF SCIENCES, US, vol. 100, no. 17, 19 August 2003 (2003-08-19), pages 9991 - 9996, XP002322638, ISSN: 0027-8424, DOI: 10.1073/PNAS.1732008100
INNIS ET AL.: "PCR Strategies", 1995, ACADEMIC PRESS
INNIS ET AL.: "PCR Protocols : A Guide to Methods and Applications", 1990, ACADEMIC PRESS
LACKIE: "DICTIONARY OF CELL AND MOLECULAR BIOLOGY", 2007, ELSEVIER
SAMBROOK ET AL.: "MOLECULAR CLONING, A LABORATORY MANUAL", 1989, SPRINGS HARBOR PRESS
DUNLEAVY ET AL., ONCOLOGY, 15 April 2014 (2014-04-15)
NOWAKOWKI; CZUCZMAN: "Am. Soc. Clin. Oncol. Educ. Book", 2015, pages: E449
PFAFFL, METHODS: THE ONGOING EVOLUTION OF QPCR, vol. 50, 2010
Attorney, Agent or Firm:
SCHWARZ, Ralf et al. (DE)
Download PDF:
Claims:
A method of identifying an individual with diffuse large B cell lymphoma (DLBCL) comprising:

(a) obtaining a sample from the individual (DLBCL sample);

(b) detecting by qRT-PCR the expression of germinal center B cell (GCB) markers ZNF318, PDK3, HMGNl, PTK2, SSBP2, BCL6, and LRMP in the DLBCL sample;

(c) detecting by qRT-PCR the expression of activated B cell (ABC) markers ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C in the DLBCL sample; and

(d) detecting by qRT-PCR the expression of a control gene in the DLBCL sample;

(i) wherein the ratio of GCB marker expression to ABC marker

expression being higher than a GCB threshold value in said individual's sample indicates sensitivity of said individual to the administration of R-CHOP(rituximab or etoposide;

cyclophosphamide; doxorubicin; vincristine; and prednisolone); or

(ii) wherein the ratio of ABC marker expression to GCB marker

expression being higher than an ABC threshold value in said individual's sample indicates sensitivity of said individual to an alternative administration.

The method according to claim 1 comprising steps (a) to (d), wherein (b) and/or (c) are carried out as follows: (b) detecting by qRT-PCR the expression of germinal center B cell (GCB) markers ZNF318, PTK2, and SSBP2 in the DLBCL sample;

(c) detecting by qRT-PCR the expression of activated B cell (ABC) markers CCND2, FOXP1, and JADE3 in the DLBCL sample.

3. The method of claim 1 or 2, wherein the GCB threshold value is set based on the ratio of GCB marker expression to ABC marker expression in a GCB positive control.

4. The method of any one of the preceding claims, wherein the ABC threshold value is set based on ABC marker expression to GCB marker expression in an ABC positive control.

5. The method of any one of the preceding claims, wherein the sample is a from a lung biopsy or bronchoalveolar lavage.

6. The method of any one of the preceding claims, wherein the sample is formalin- fixed paraffin embedded tissue (FFPET).

7. The method of any one of claims 1-4, wherein the sample is blood, plasma, or serum.

8. The method of any one of the preceding claims, wherein the detecting of (b) and (c) are carried out in multiplex in multiple vessels.

9. The method of any one of the preceding claims, wherein each GCB and ABC marker is individually detected.

10. The method of any one of claims 1-8, wherein the detecting of (b) is carried out in a single vessel for each sample.

11. The method of any one of claims 1-8 and 10, wherein the detecting of (c) is

carried out in a single vessel for each sample.

12. The method of any of the preceding claims, wherein the detecting of (d) is carried out in the same vessel(s) as the detecting of (b) and (c).

13. The method of any one of the preceding claims, wherein the alternative

administration includes a BTK inhibitor, SYK inhibitor, NFkB inhibitor, or immunomodulatory agent.

14. The method of claim 13, wherein the alternative administration further includes R-CHOP.

15. The method of any one of the preceding claims, comprising adjusting the level of expression detected for the genes in steps (b) and (c) based on the expression detected of the control gene in (d).

16. A method of determining the cell of origin (COO) subtype for an individual with diffuse large B cell lymphoma (DLBCL) comprising:

(a) obtaining a sample from the individual (DLBCL sample);

(b) detecting by qRT-PCR the expression of germinal center B cell (GCB) markers ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and LRMP in the

DLBCL sample;

(c) detecting by qRT-PCR the expression of activated B cell (ABC) markers ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C in the DLBCL sample; (d) detecting by qRT-PCR the expression of a control gene in the DLBCL

sample; and

(e) determining that the COO subtype of the individual is

(i) GCB if the ratio of GCB marker expression to ABC marker expression is higher than a GCB threshold value; or (ii) ABC if the ratio of ABC marker expression to GCB marker expression is higher than an ABC threshold value.

17. The method according to claim 6 comprising steps (a) to (d), wherein (b) and/or (c) are carried out as follows: (b) detecting by qRT-PCR the expression of germinal center B cell (GCB) markers ZNF318, PTK2, and SSBP2 in the DLBCL sample;

(c) detecting by qRT-PCR the expression of activated B cell (ABC) markers CCND2, FOXP1, and JADE3 in the DLBCL sample.

18. The method of claim 16 or 17, wherein the GCB threshold value is set based on the ratio of GCB marker expression to ABC marker expression in a GCB positive control.

19. The method of any one of claims 16-18, wherein the ABC threshold value is set based on ABC marker expression to GCB marker expression in an ABC positive control. 20. The method of any one of claims 16-19, wherein the sample is a from a lung biopsy or bronchoalveolar lavage.

21. The method of any one of claims 16-20, wherein the sample is formalin-fixed paraffin embedded tissue (FFPET).

22. The method of any one of claims 16-19, wherein the sample is blood, plasma, or serum.

23. The method of any one of claims 16-22, wherein the detecting of (b) and (c) are carried out in multiplex in multiple vessels.

24. The method of any one of claims 16-23, wherein each GCB and ABC marker is individually detected.

25. The method of any one of claims 16-23, wherein the detecting of (b) is carried out in a single vessel for each sample.

26. The method of any one of claims 16-23 and 25, wherein the detecting of (c) is carried out in a single vessel for each sample. 27. The method of any one of claims 16-26, wherein the detecting of (d) is carried out in the same vessel as the detecting of (b) and (c).

28. The method of any one of claims 16-27, comprising adjusting the level of

expression detected for the genes in steps (b) and (c) based on the expression detected of the control gene in (d). 29. A kit comprising:

(a) a mixture comprising a primer set and a fluorescently labeled probe that specifically amplifies and detects each of germinal center B cell (GCB) marker ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and LRMP gene products or each of germinal center B cell (GCB) marker ZNF318, PTK2, and SSBP2 gene products; and

(b) a mixture comprising a primer set and a fluorescently labeled probe that specifically amplifies and detects each of activated B cell (ABC) marker ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C gene products or each of activated B cell (ABC) marker CCND2, FOXP 1 , and JADE3 gene products.

30. The kit of claim 29, wherein the mixtures of (a) and (b) each further comprise a primer set and a fluorescently labeled probe that specifically amplifies and detects a control gene product, wherein the fluorescently labeled probe that specifically detects the control gene product is differently labeled than the fluorescently labeled probes in mixture (a) and mixture (b).

31. The kit of claim 29 or 30, wherein the fiuorescently labeled probes in mixture (a) are all labeled with the same fluorescent label.

32. The kit of any one of claims 29-31, wherein the fiuorescently labeled probes in mixture (b) are all labeled with the same fluorescent label.

33. The kit of any one of claims 29-32, further comprising reverse transcriptase

and/or thermostable DNA polymerase.

34. The kit of any one of claims 29-33, further comprising an enzyme with both

reverse transcriptase and DNA polymerase activity.

35. The kit of any one of claims 29-34, further comprising at least one control sample.

36. A kit comprising: a plurality of mixtures that comprise a primer set and a fiuorescently labeled probe that specifically amplifies and individually detects each of GCB marker ZNF318, PDK3, HMGNl, PTK2, SSBP2, BCL6, and LRMP gene products; each of ABC marker ARID3A, CCND2, FOXPl, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C gene products; and a control gene product, wherein the primer set and fiuorescently labeled probe that specifically amplifies and individually detects the control gene product are present in each of the plurality of mixtures.

37. The kit of claim 36, wherein the kit comprises 3-15 mixtures.

38. The kit of claim 36 or 37, wherein the kit comprises 5 mixtures.

39. The kit of any one of claims 36-38, wherein the kit comprises 5 mixtures with primer sets and fiuorescently labeled probes specific for GCB and ABC markers as follows: Mixture 1 Mixture 2 Mixture 3 Mixture 4 Mixture 5

ARID3A TCF4 PDK3 SSBP2 JADE3

CCND2 ZNF318 HMGN1 BCL6 PIM2

FOXP1 KIAA0226L PTK2 LRMP FAM46C

Control Control Control Control Control

40. The kit of any one of claims 36-39, further comprising reverse transcriptase

and/or thermostable DNA polymerase.

41. The kit of any one of claims 36-40, further comprising an enzyme having both reverse transcriptase and DNA polymerase activity.

42. The kit of any one of claims 36-41, further comprising at least one control sample.

43. The kit of any one of claims 36-42, comprising a GCB positive control and an ABC positive control.

44. The kit of any one of claims 36-43, comprising a negative control.

Description:
METHODS AND COMPOSITIONS FOR CLASSIFYING DLBCL

BACKGROUND OF THE INVENTION

Diffuse Large B-Cell Lymphoma (DLBCL) comprises 30-35% of all Non-Hodgkin lymphoma. DLBCL is biologically aggressive, but can be cured in >50% of the cases. However, up to one third of the patients develop resistance and are refractory to the treatments. The standard treatment is chemotherapy CHOP or

chemotherapy+Rituxamab (R-CHOP). DLBCL can be classified into three different molecular cell-of-origin (COO) subtypes: germinal center B-cell (GCB), activated B-cell (ABC), and primary mediastinal B-cell lymphoma (PMBCL). Retrospective analysis by the Lymphoma/Leukemia molecular profiling project demonstrated that DLBCL patients with GCB subtype have better prognosis than those with ABC subtype when treated with R-CHOP, and drug candidates to improve ABC subtype prognosis are in development. Current methods for distinguishing GCB and ABC subtypes include

immunohistochemistry (IHC) and gene expression profiling. IHC and gene expression profiling technologies are time consuming, and have additional drawbacks for subtype classification. For example, gene expression technology uses frozen samples and not the formaldehyde fixed paraffin embedded tissue (FFPET) specimens that are typically collected in clinical laboratories. Nanostring Technologies (Seattle, WA) has developed a gene expression profiling signature that classifies DLBCL subtypes using FFPET samples, but the Nanostring platform is not widely adopted in the marketplace and it is expensive. IHC also uses FFPET samples but shows high assay variability across laboratories. SUMMARY OF THE INVENTION

Provided herein are methods and compositions for determining diffuse large B cell lymphoma (DLBCL) subtype and treating DLBCL patients. Provided herein are methods and compositions for determining diffuse large B cell lymphoma (DLBCL) subtype and treating DLBCL patients. In some embodiments, provided are methods of identifying an individual with DLBCL comprising: (a) obtaining a sample from the individual (DLBCL sample); (b) detecting by qRT-PCR the expression of GCB markers ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and/or LRMP in the DLBCL sample;

(c) detecting by qRT-PCR the expression of ABC markers ARID3A, CCND2, FOXPl, KIAA0226L, JADE3, PIM2, TCF4, and/or FAM46C in the DLBCL sample and the control sample; and (d) detecting by qRT-PCR the expression of a control gene (e.g., internal control) in the DLBCL sample; wherein the ratio of GCB marker expression to ABC marker expression being higher than a GCB threshold value in said individual's sample indicates sensitivity of said individual to the administration of R-CHOP (rituximab or etoposide; cyclophosphamide; doxorubicin; vincristine; and

prednisolone). Some embodiments, if the ratio of ABC marker expression to GCB marker expression is higher than an ABC threshold value in the individual's sample indicate sensitivity of the individual to an alternative administration. In some embodiments, the method further comprises adjusting the level of expression detected for the genes in steps (b) and (c) based on the expression detected of the control gene in

(d) . In some embodiments, the administration is provided directly to the patient.

In some embodiments, provided are methods of providing treatment for an individual with DLBCL comprising: (a) obtaining a sample from the individual (DLBCL sample); (b) detecting by qRT-PCR the expression of GCB markers ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and/or LRMP in the DLBCL sample; (c) detecting by qRT-PCR the expression of ABC markers ARID3A, CCND2, FOXPl, KIAA0226L, JADE3, PIM2, TCF4, and/or FAM46C in the DLBCL sample and the control sample; (d) detecting by qRT-PCR the expression of a control gene (e.g., internal control) in the DLBCL sample; and (e) providing treatment for the individual. In some embodiments, the treatment comprises administration of R-CHOP (rituximab or etoposide; cyclophosphamide; doxorubicin; vincristine; and prednisolone) if the ratio of GCB marker expression to ABC marker expression is higher than a GCB threshold value. In some embodiments, the treatment comprises an alternative therapy if the ratio of ABC marker expression to GCB marker expression is higher than an ABC threshold value. In some embodiments, the method further comprises adjusting the level of expression detected for the genes in steps (b) and (c) based on the expression detected of the control gene in (d). In some embodiments, the treatment is provided directly to the patient.

In some embodiments, 1, 2, 3, 4, 5, or 6 GCB markers are detected in step (b) in any combination. In some embodiments, all 7 GCB markers are detected in step (b). In some embodiments, 1, 2, 3, 4, 5, 6, or 7 ABC markers are detected in step (c) in any combination. In some embodiments, all 8 ABC markers are detected in step (c). In some embodiments, step (b) comprises detecting the expression of ZNF318, SSBP2, and PTK2. In some embodiments, step (c) comprises detecting the expression of CCND2, FOXP1, and JADE3.

In some embodiments, the methods further comprise carrying out steps (b)-(d) on a GCB positive control, and the result used to set the GCB threshold value. In some embodiments, the GCB positive control comprises 51-100% known GCB sample, e.g., 55-85%, 55-65%,

60-70% known GCB sample. In some embodiments, the remaining GCB positive control is comprised of known ABC sample. In some embodiments, the method further comprises carrying out steps (b)-(d) on an ABC positive control, and the result used to set the ABC threshold value. In some embodiments, the ABC positive control comprises 51-100% known ABC sample, e.g., 55-85%, 55-65%, 60-70% known ABC sample. In some embodiments, the remaining ABC positive control is comprised of known GCB sample. In some embodiments, the method further comprises carrying out steps (b)-(d) on a negative control sample, e.g., a sample lacking nucleic acids, a non- cancer sample, or a sample substantially lacking the recited ABC and GCB marker nucleic acids. In some embodiments, the sample is from lung biopsy (e.g., tumor tissue) or bronchoalveolar lavage. In some embodiments, the sample is formalin-fixed paraffin embedded tissue (FFPET), e.g., from a tumor sample, either in the lung or metastasized. In some embodiments, the sample is blood, plasma, serum, urine, mucous, mucosal tissue, or saliva. In some embodiments, the detecting of (b) and (c) are carried out in multiplex in multiple vessels. For example, the detecting of (b) can be carried out in 1-6 vessels, with each of the GCB markers detected using a different label for each GCB marker probe, or using the same label on two or more GCB marker probes. Similarly, the detecting of (c) can be carried out in 1-7 vessels, with each of the ABC markers detected using a different label for each ABC marker probe, or using the same label on two or more ABC marker probes. In some embodiments, each GCB and ABC marker is individually detected. In some embodiments, the detecting of (b) is carried out in a single vessel for each sample. In some embodiments, the detecting of (c) is carried out in a single vessel for each sample. In some embodiments, the detecting of (d) is carried out in the same vessel(s) as the detecting of (b) and (c).

In some embodiments, the alternative administration or therapy includes a BTK inhibitor, SYK inhibitor, NFkB inhibitor, or immunomodulatory agent. In some embodiments, the alternative administration or therapy comprises R-CHOP, alone or in combination with a BTK inhibitor, SYK inhibitor, NFkB inhibitor, or

immunomodulatory agent. Further provided are methods for determining the cell of origin (COO) subtype for an individual with DLBCL comprising (a) obtaining a sample from the individual (DLBCL sample); (b) detecting by qRT-PCR the expression of GCB markers ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and/or LRMP in the DLBCL sample; (c) detecting by qRT-PCR the expression of ABC markers ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and/or FAM46C in the DLBCL sample; (d) detecting by qRT- PCR the expression of a control gene in the DLBCL sample; and (e) determining that the COO subtype of the individual is (i) germinal center B cell (GCB) if the ratio of GCB marker expression to ABC marker expression is higher than a GCB threshold value, or (ii) activated B cell (ABC) if the ratio of ABC marker expression to GCB marker expression is higher than an ABC threshold value. In some embodiments, the method further comprises adjusting the level of expression detected for the genes in steps (b) and (c) based on the expression detected of the control gene in (d).

In some embodiments, 1, 2, 3, 4, 5, or 6 GCB markers are detected in step (b) in any combination. In some embodiments, all 7 GCB markers are detected in step (b). In some embodiments, 1, 2, 3, 4, 5, 6, or 7 ABC markers are detected in step (c) in any combination. In some embodiments, all 8 ABC markers are detected in step (c). In some embodiments, step (b) comprises detecting the expression of ZNF318, SSBP2, and PTK2. In some embodiments, step (c) comprises detecting the expression of CCND2, FOXP1, and JADE3.

In some embodiments, the methods further comprise carrying out steps (b)-(d) on a GCB positive control, and the result used to set the GCB threshold value. In some embodiments, the GCB positive control comprises 51-100% known GCB sample, e.g., 55-85%, 55-65%, 60-70% known GCB sample. In some embodiments, the remaining GCB positive control is comprised of known ABC sample. In some embodiments, the method further comprises carrying out steps (b)-(d) on an ABC positive control, and the result used to set the ABC threshold value. In some embodiments, the ABC positive control comprises 51-100% known ABC sample, e.g., 55-85%, 55-65%, 60-70% known ABC sample. In some embodiments, the remaining ABC positive control is comprised of known GCB sample. In some embodiments, the method further comprises carrying out steps (b)-(d) on a negative control sample. In some embodiments, the sample is from lung biopsy (e.g., tumor tissue) or bronchoalveolar lavage. In some embodiments, the sample is formalin-fixed paraffin embedded tissue (FFPET), e.g., from a tumor sample, either in the lung or metastasized. In some embodiments, the sample is blood, plasma, serum, urine, mucous, mucosal tissue, or saliva. In some embodiments, the detecting of (b) and (c) are carried out in multiplex in multiple vessels. For example, the detecting of (b) can be carried out in 1-6 vessels, with each of the GCB markers detected using a different label for each GCB marker probe, or using the same label on two or more GCB marker probes. Similarly, the detecting of (c) can be carried out in 1-7 vessels, with each of the ABC markers detected using a different label for each ABC marker probe, or using the same label on two or more ABC marker probes. In some embodiments, each GCB and ABC marker is individually detected. In some embodiments, the detecting of (b) is carried out in a single vessel for each sample. In some embodiments, the detecting of (c) is carried out in a single vessel for each sample. In some embodiments, the detecting of (d) is carried out in the same vessel(s) as the detecting of (b) and (c).

In some embodiments, the method further comprises providing treatment for the individual depending on the COO subtype.

Further provided are kits for determining the COO subtype of an individual with DLBCL. In some embodiments, the kit comprises (a) a mixture comprising a primer set and a fiuorescently labeled probe that specifically amplifies and detects at least one of GCB marker ZNF318, PDK3, HMGNl, PTK2, SSBP2, BCL6, and LRMP gene products (e.g., 2, 3, 4, 5, 6, or all 7); and (b) a mixture comprising a primer set and a fluorescently labeled probe that specifically amplifies and detects at least one of ABC marker ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C gene products (e.g., 2, 3, 4, 5, 6, 7, or all 8). In some embodiments, the kit includes primer sets and probes to specifically amplify and detect all 7 GCB markers and all 8 ABC markers. In some embodiments, mixture (a) comprises a primer set and a fluorescently labeled probe that specifically amplifies and detects ZNF318, SSBP2, and PTK2. In some embodiments, mixture (b) comprises a primer set and a fluorescently labeled probe that specifically amplifies and detects CCND2, FOXP1, and JADE3. In some embodiments, the mixtures of (a) and (b) each further comprise a primer set and a fluorescently labeled probe that specifically amplifies and detects a control gene product, wherein the fluorescently labeled probe that specifically detects the control gene product is differently labeled than the fluorescently labeled probes in mixture (a) and mixture (b). In some embodiments, the fluorescently labeled probes in mixture (a) are all labeled with the same fluorescent label. In some embodiments, the fluorescently labeled probes in mixture (b) are all labeled with the same fluorescent label.

In some embodiments, the kit comprises a plurality of mixtures that comprise a primer set and a fluorescently labeled probe that specifically amplifies and individually detects (i) each of GCB marker ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and LRMP gene products; (ii) each of ABC marker ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C gene products; and (iii) a control gene product, wherein the primer set and fluorescently labeled probe that specifically amplifies and individually detects the control gene product are present in each of the plurality of mixtures. In some embodiments, the kit comprises 3-15 mixtures, e.g., 5 mixtures. In some embodiments, the kit comprises a plurality of mixtures that comprise a primer set and a fluorescently labeled probe that specifically amplifies and individually detects (i) each of ZNF318, PDK2, and SSBP2; (ii) each of CCND2, FOXP1, and JADE3; and (iii) a control gene product, wherein the primer set and fluorescently labeled probe that specifically amplifies and individually detects the control gene product are present in each of the plurality of mixtures.

In some embodiments, the kit further comprises reverse transcriptase and/ or thermostable DNA polymerase. In some embodiments, the kit further comprises an enzyme with reverse transcriptase and DNA polymerase activity. In some

embodiments, the kit further comprises at least one control sample, e.g., an ABC positive control and/or GCB positive control, as described herein. In some

embodiments, the kit further includes a negative control (e.g., non-cancer sample).

In some embodiments, the primer set that specifically amplifies ZNF318 is a forward and reverse primer having sequences selected from SEQ ID NOs:193-208, and the sequence of the probe that individually detects ZNF318 is selected from SEQ ID NOs:302-304. In some embodiments, the sequence of the probe that individually detects ZNF318 is SEQ ID NO:304. In some embodiments, the primer set that specifically amplifies PDK3 is a forward and reverse primer having sequences selected from SEQ ID NOs:177-192, and the sequence of the probe that individually detects PDK3 is selected from SEQ ID NOs:299-301. In some embodiments, the sequence of the probe that individually detects PDK3 is SEQ ID NO:300. In some embodiments, the primer set that specifically amplifies HMGN1 is a forward and reverse primer having sequences selected from SEQ ID NOs:209-220, and the sequence of the probe that individually detects HMGN1 is selected from SEQ ID NOs:305-307. In some embodiments, the sequence of the probe that individually detects HMGN1 is SEQ ID NO:305 In some embodiments, the primer set that specifically amplifies PTK2 is a forward and reverse primer having sequences selected from SEQ ID NOs:l-24, and the sequence of the probe that individually detects PTK2 is selected from SEQ ID NOs:253- 258. In some embodiments, the sequence of the probe that individually detects PTK2 is SEQ ID NO:253. In some embodiments, the primer set that specifically amplifies SSBP2 is a forward and reverse primer having sequences selected from SEQ ID NOs:161-176, and the sequence of the probe that individually detects SSBP2 is selected from SEQ ID NOs:297 and 298. In some embodiments, the sequence of the probe that specifically detects SSBP2 is SEQ ID NO:297. In some embodiments, the primer set that specifically amplifies BCL6 is a forward and reverse primer having sequences selected from SEQ ID NOs:49-64, and the sequence of the probe that individually detects BCL6 is selected from SEQ ID NO:266-268. In some embodiments, the probe that individually detects BCL6 is SEQ ID NO:266. In some embodiments, the primer set that specifically amplifies LRMP is a forward and reverse primer having sequences selected from SEQ ID NOs:25-48, and the sequence of the probe that individually detects LRMP is selected from SEQ ID NOs:259-265. In some embodiments, the sequence of the probe that individually detects LRMP is SEQ ID NO:262. In some embodiments, the primer set that specifically amplifies ARIDA3A is a forward and reverse primer having sequences selected from SEQ ID NOs:81-96, and the sequence of the probe that individually detects ARIDA3A is selected from SEQ ID NOs:276-280. In some embodiments the sequence of the probe that individually detects ARIDA3A is SEQ ID NO:279. In some embodiments, the primer set that specifically amplifies CCND2 is a forward and reverse primer having sequences selected from SEQ ID NOs:97-112, and the sequence of the probe that individually detects CCND2 is selected from SEQ ID NOs:281-283. In some embodiments, the sequence of the probe that individually detects CCND2 is SEQ ID NO:281. In some embodiments, the primer set that specifically amplifies FOXP1 is a forward and reverse primer having sequences selected from SEQ ID NOs:221-236, and the sequence of the probe that individually detects FOXP1 is selected from SEQ ID NOs:308 and 309. In some embodiments, the sequence of the probe that individually detects FOXP1 is SEQ ID NO:309. In some embodiments, the primer set that specifically amplifies KIAA0226L is a forward and reverse primer having sequences selected from SEQ ID NOs:237-252, and the sequence of the probe that individually detects KIAA0226L is selected from SEQ ID NOs:310-314. In some embodiments, the sequence of the probe that individually detects KIAA0226L is SEQ ID NO:313. In some embodiments, the primer set that specifically amplifies JADE3 is a forward and reverse primer having sequences selected from SEQ ID NOs:145-160, and the sequence of the probe that individually detects JADE3 is selected from SEQ ID NOs:290-296. In some embodiments, the sequence of the probe that individually detects JADE3 is SEQ ID NO:292. In some embodiments, the primer set that specifically amplifies PIM2 is a forward and reverse primer having sequences selected from SEQ ID NOs:65-80, and the sequence of the probe that individually detects PIM2 is selected from SEQ ID NOs:269- 275. In some embodiments, the sequence of the probe that individually detects PIM2 is SEQ ID NO:275. In some embodiments, the primer set that specifically amplifies TCF4 is a forward and reverse primer having sequences selected from SEQ ID NOs:129-144, and the sequence of the probe that individually detects TCF4 is selected from SEQ ID NOs:287-289. In some embodiments, the sequence of the probe that individually detects TCF4 is SEQ ID NO:287. In some embodiments, the primer set that specifically amplifies FAM46C is a forward and reverse primer having sequences selected from SEQ ID NOs:l 13-128, and the sequence of the probe that individually detects FAM46C is selected from SEQ ID NOs:284-286. In some embodiments, the sequence of the probe that individually detects FAM46C is SEQ ID NO:284.

DETAILED DESCRIPTION OF THE INVENITON I. Introduction

Provided herein is a novel multiplex real-time, quantitative reverse transcription (qRT)- PCR classifier to determine cell-of-origin (COO) subtype of Diffuse Large B-cell Lymphoma (DLBCL). The classifier uses a qRT-PCR multiplex reaction to quantify 16 gene targets (15 determinative and 1 control) and assign a COO subtype of DLBCL. In some embodiments, the assay is a five-tube qRT-PCR. The feasibility and accuracy of the qRT-PCR classifier in formalin-fixed paraffin embedded tissues (FFPET) from DLBCL is shown herein.

The presently described assays rely on proven, widely adopted technology and provide accurate, reproducible, and rapid results. II. Definitions

The term "multiplex" refers to an assay in which more than one target is detected.

The terms "receptacle," "vessel," "tube," "well," "chamber," "microchamber," etc. refer to a container that can hold reagents or an assay. If the receptacle is in a kit and holds reagents, or is being used for an amplification reaction, it can be closed or sealed to avoid contamination or evaporation. If the receptacle is being used for an assay, it can be open or accessible, at least during set up of the assay.

The terms "individually detected" or "individual detection," referring to a marker gene or marker gene product, indicates that each marker in a multiplex reaction is detected. That is, each marker is associated with a different label (detected by a differently labeled probe).

Unless otherwise labeled, the terms "COO classifier," "subtype classifier," "COO subtype signature," "subtype determination signature," and like terms are used to refer to the 15 -gene signature that can be used to classify the cell of origin subtype of a DLBCL patient. The terms "6-gene COO classifier," "6-gene subtype classifier," "6-gene COO subtype signature," "6-gene subtype determination signature," and like terms refer to the classifier that includes CCND2, FOXP1, JADE3, ZNF318, SSBP2, and PTK2.

The terms "nucleic acid," "polynucleotide," and "oligonucleotide" refer to polymers of nucleotides (e.g., ribonucleotides or deoxyribo-nucleotides) and includes naturally- occurring (adenosine, guanidine, cytosine, uracil and thymidine), non-naturally occurring, and modified nucleic acids. The term is not limited by length (e.g., number of monomers) of the polymer. A nucleic acid may be single-stranded or double- stranded and will generally contain 5'-3' phosphodiester bonds, although in some cases, nucleotide analogs may have other linkages. Monomers are typically referred to as nucleotides. The term "non-natural nucleotide" or "modified nucleotide" refers to a nucleotide that contains a modified nitrogenous base, sugar or phosphate group, or that incorporates a non-natural moiety in its structure. Examples of non-natural nucleotides include dideoxynucleotides, biotinylated, aminated, deaminated, alkylated, benzylated and fiuorophor-labeled nucleotides. The term "primer" refers to a short nucleic acid (an oligonucleotide) that acts as a point of initiation of polynucleotide strand synthesis by a nucleic acid polymerase under suitable conditions. Polynucleotide synthesis and amplification reactions typically include an appropriate buffer, dNTPs and/or rNTPs, and one or more optional cofactors, and are carried out at a suitable temperature. A primer typically includes at least one target-hybridized region that is at least substantially complementary to the target sequence (e.g., having 0, 1, 2, or 3 mismatches). This region of is typically about 8 to about 40 nucleotides in length, e.g., 12-25 nucleotides. A "primer set" refers to a forward and reverse primer that are oriented in opposite directions relative to the target sequence, and that produce an amplification product in amplification conditions. The primer set can further include and additional forward or reverse primer, e.g., to carry out allele-specific amplification.

As used herein, "probe" means any molecule that is capable of selectively binding to a specifically intended target biomolecule, for example, a nucleic acid sequence of interest that hybridizes to the probes. The probe is detectably labeled with at least one non- nucleotide moiety. In some embodiments, the probe is labeled with a fluorophore and quencher. The words "complementary" or "complementarity" refer to the ability of a nucleic acid in a polynucleotide to form a base pair with another nucleic acid in a second

polynucleotide. For example, the sequence A-G-T (A-G-U for RNA) is complementary to the sequence T-C-A (U-C-A for RNA). Complementarity may be partial, in which only some of the nucleic acids match according to base pairing, or complete, where all the nucleic acids match according to base pairing. A probe or primer is considered "specific for" a target sequence if it is at least partially complementary to the target sequence. Depending on the conditions, the degree of complementarity to the target sequence is typically higher for a shorter nucleic acid such as a primer (e.g., greater than 80%, 90%, 95%, or higher) than for a longer sequence.

The term "specifically amplifies" indicates that a primer set amplifies a target sequence more than non-target sequence at a statistically significant level. The term "specifically detects" indicates that a probe will detect a target sequence more than non-target sequence at a statistically significant level. As will be understood in the art, specific amplification and detection can be determined using a negative control, e.g., a sample that includes the same nucleic acids as the test sample, but not the target sequence or a sample lacking nucleic acids. For example, primers and probes that specifically amplify and detect a target sequence result in a Ct that is readily distinguishable from

background (non-target sequence), e.g., a Ct that is at least 2, 3, 4, 5, 5-10, 10-20, or 10- 30 cycles less than background.

The terms "identical" or "percent identity," in the context of two or more nucleic acids, or two or more polypeptides, refer to two or more sequences or subsequences that are the same or have a specified percentage of nucleotides, or amino acids, that are the same (e.g., about 60% identity, e.g., at least any of 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters, or by manual alignment and visual inspection. See e.g., the NCBI web site at ncbi.nlm.nih.gov/BLAST. Such sequences are then said to be "substantially identical." Percent identity is typically determined over optimally aligned sequences, so that the definition applies to sequences that have deletions and/or additions, as well as those that have substitutions. The algorithms commonly used in the art account for gaps and the like. Typically, identity exists over a region comprising an a sequence that is at least about 8-25 amino acids or nucleotides in length, or over a region that is 50-100 amino acids or nucleotides in length, or over the entire length of the reference sequence. The term "kit" refers to any manufacture (e.g., a package or a container) including at least one reagent, such as a nucleic acid probe or probe pool or the like, for specifically amplifying, capturing, tagging/converting or detecting RNA or DNA as described herein.

The term "amplification conditions" refers to conditions in a nucleic acid amplification reaction (e.g., PCR amplification) that allow for hybridization and template-dependent extension of the primers. The term "amplicon" or "amplification product" refers to a nucleic acid molecule that contains all or a fragment of the target nucleic acid sequence and that is formed as the product of in vitro amplification by any suitable amplification method. One of skill will understand that a forward and reverse primer (primer pair) defines the borders of an amplification product. The term "generate an amplification product" when applied to primers, indicates that the primers, under appropriate conditions (e.g., in the presence of a nucleotide polymerase and NTPs), will produce the defined amplification product. Various PCR conditions are described in PCR Strategies (Innis et ah, 1995, Academic Press, San Diego, CA) at Chapter 14; PCR Protocols : A Guide to Methods and Applications (Innis et al, Academic Press, NY, 1990)

The term "amplification product" refers to the product of an amplification reaction. The amplification product includes the primers used to initiate each round of polynucleotide synthesis. An "amplicon" is the sequence targeted for amplification, and the term can also be used to refer to amplification product. The 5' and 3' borders of the amplicon are defined by the forward and reverse primers.

The term "sample" or "biological sample" refers to any composition containing or presumed to contain nucleic acid. The term includes purified or separated components of cells, tissues, or blood, e.g., DNA, RNA, proteins, cell-free portions, or cell lysates. In the context of the presently disclosed assay, the sample is typically FFPET, e.g., from a tumor or metastatic lesion. The sample can also be from frozen or fresh tissue, or from a liquid sample, e.g., blood or a blood component (plasma or serum), urine, semen, saliva, sputum, mucus, semen, tear, lymph, cerebral spinal fluid, mouth/throat rinse, bronchial alveolar lavage, material washed from a swab, etc. Samples also may include constituents and components of in vitro cultures of cells obtained from an individual, including cell lines. The sample can also be partially processed from a sample directly obtained from an individual, e.g., cell lysate or blood depleted of red blood cells. The term "obtaining a sample from an individual" means that a biological sample from the individual is provided for testing. The obtaining can be directly from the individual, or from a third party that directly obtained the sample from the individual.

A "control" sample or value refers to a value that serves as a reference, usually a known reference, for comparison to a test sample or test conditions. For example, a test sample can be taken from a test condition, e.g., from an individual suspected of having cancer, and compared to samples from known conditions, e.g., from a cancer-free individual (negative control), or from an individual known to have cancer (positive control). In the context of the present disclosure, the test sample is typically from a DLBCL patient. A control can also represent an average value or a range gathered from a number of tests or results. A control can also be prepared for reaction conditions. For example, a control for the presence, quality, and/ or quantity of nucleic acid (e.g., internal control) can include primers or probes that will detect a sequence known to be present in the sample (e.g., a housekeeping gene such as beta actin, beta globin, glyceraldehyde 3- phosphate dehydrogenase (GAPDH), ribosomal protein L37 and L38, PPIase, EIF3, eukaryotic translation elongation factor 2 (eEF2), DHFR, or succinate dehydrogenase) A known added polynucleotide, e.g., having a designated length, can also be added. An example of a negative control is one free of nucleic acids, or one including primers or probes specific for a sequence that would not be present in the sample, e.g., from a different species. One of skill will understand that the selection of controls will depend on the particular assay, e.g., so that the control is cell type and organism-appropriate. One of skill in the art will recognize that controls can be designed for assessment of any number of parameters. For example, a control can be devised to compare therapeutic benefit based on pharmacological data (e.g., half-life) or therapeutic measures (e.g., comparison of benefit and/or side effects). Controls can be designed for in vitro applications. One of skill in the art will understand which controls are valuable in a given situation and be able to analyze data based on comparisons to control values. Controls are also valuable for determining the significance of data. For example, if values for a given parameter are widely variant in controls, variation in test samples will not be considered as significant.

The terms "label," "tag," "detectable moiety," and like terms refer to a composition detectable by spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical means. For example, useful labels include fluorescent dyes

(fiuorophores), luminescent agents, radioisotopes (e.g., 32 P, 3 H), electron-dense reagents, or an affinity-based moiety, e.g., a poly-A (interacts with poly-T) or poly-T tag (interacts with poly-A), a His tag (interacts with Ni), or a strepavidin tag (separable with biotin). The term "identifying an individual" means determining based on a sample derived from an individual (e.g., a patient) whether the respective individual is actually sensitive to an administration or treatment. The term "providing treatment for an individual" means that the treatment is actually administered to the individual (e.g., an in-patient injection), or that it is made available to the individual, so that the individual or third party actually administers the treatment. Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by a person of ordinary skill in the art. See, e.g., Lackie, DICTIONARY OF CELL AND MOLECULAR BIOLOGY, Elsevier (4th ed. 2007); Sambrook et al, MOLECULAR CLONING, A LABORATORY MANUAL, Cold Springs Harbor Press (Cold Springs Harbor, N.Y. 1989). The term "a" or "an" is intended to mean "one or more." The terms "comprise," "comprises," and "comprising," when preceding the recitation of a step or an element, are intended to mean that the addition of further steps or elements is optional and not excluded.

III. Nucleic acid samples

Samples for nucleic acid amplification can be obtained from any source suspected of containing nucleic acid. Samples can be taken from formalin fixed paraffin embedded tissue (FFPET), tissue biopsy, brochoalveolar lavage, or cultured cells (e.g., obtained from a patient, or representing a control). In the context of the present disclosure, the sample is typically taken from lung tissue or a cell population that includes lung cells, e.g., lung cancer cells. In some embodiments, the sample is obtained in a non-invasive manner, e.g., from urine, skin, swab, saliva, blood or a blood fraction.

In a sample that includes cells, the cells can be separated out (e.g., using size-based filtration or centrifugation), thereby leaving cell free nucleic acids (cfNA), including nucleic acids in exosomes, microvesicles, viral particles, or those circulating freely. Alternatively, the cells can be lysed to obtain cellular nucleic acids, either in the presence of magnetic glass particles (MGPs) or before addition of the cellular lysate to the MGPs. Methods for isolating nucleic acids from biological samples are known, e.g., as described in Sambrook, and several kits are commercially available (e.g., High Pure RNA Isolation Kit, High Pure Viral Nucleic Acid Kit, and MagNA Pure LC Total Nucleic Acid Isolation Kit, DNA Isolation Kit for Cells and Tissues, DNA Isolation Kit for Mammalian Blood, High Pure FFPET DNA Isolation Kit, available from Roche). In the context of the presently disclosed methods, RNA is collected, though in some embodiments, the classifier can be used on previously prepared cDNA.

IV. Diffuse Large B Cell Lymphoma (DLBCL) and Therapies

Diffuse large B-cell lymphoma (DLBCL) is the most common subtype of non-Hodgkin lymphoma. Approximately 40% of patients have refractory disease or disease that will relapse after an initial response, and the majority of patients with relapsed DLBCL will succumb to the disease. There are two major biologically distinct molecular subtypes of DLBCL: germinal center B-cell (GCB) and activated B-cell (ABC). ABC DLBCL is associated with substantially worse outcomes when treated with standard

chemotherapy.

GCB patients typically benefit from standard chemotherapy. This can include CHOP (cyclophosphamide; doxorubicin; vincristine; and prednisolone) or R-CHOP, which further includes rituximab and/or etoposide. The cocktail can be administered periodically for a set period of time, or until reduction in tumor size and/or symptoms are detected. For example, the CHOP or R-CHOP can be administered every 2 or 3 weeks. Treatment or administration typically begins with a low dose so that side effects can be determined, and the dose increased until side effects appear or within the patient's tolerance.

A number of additional agents (alternative therapies) are in development for ABC patients. These can be administered in combination with CHOP or R-CHOP, simultaneously or in separate doses. These alternative therapies can include BTK inhibitors (e.g., ibrutinib), SYK inhibitors (e.g., fostamatinib), NFkB inhibitors (e.g., bortezomib), or immunomodulatory agents (e.g., structural and functional analogs of thalidomide, e.g., lenalidomide).

Additional appropriate therapies for DLBCL GCB and ABC subtypes are described in Dunleavy et al. (April 15, 2014) Oncology and Nowakowki & Czuczman (2015) Am. Soc. Clin. Oncol. Educ. Book e449.

V. Amplification and detection

A nucleic acid sample can be used for detection and quantification, e.g., using nucleic acid amplification, e.g., using any primer-dependent method. In some embodiments, a preliminary reverse transcription step is carried out (also referred to as RT-PCR, not to be confused with real time PCR). See, e.g., Hierro et al. (2006) 72:7148. The term "qRT- PCR" as used herein refers to reverse transcription followed by quantitative PCR. Both reactions can be carried out in a single tube without interruption, e.g., to add reagents. For example, a polyT primer can be used to reverse transcribe all mRNAs in a sample with a polyA tail, random oligonucleotides can be used, or a primer can be designed that is specific for a particular target transcript that will be reverse transcribed into cDNA. The cDNA can form the initial template strand to be for quantitative

amplification (real time or quantitative PCR, i.e., RTPCR or qPCR). qPCR allows for reliable detection and measurement of products generated during each cycle of PCR process. Such techniques are well known in the art, and kits and reagents are

commercially available, e.g., from Roche Molecular Systems, Life Technologies, Bio- Rad, etc. See, e.g., Pfaffi (2010) Methods: The ongoing evolution of qPCR vol. 50.

A separate reverse transcriptase and thermostable DNA polymerase can be used, e.g., in a two-step (reverse transcription followed by addition of DNA polymerase and amplification) or combined reaction (with both enzymes added at once). In some embodiments, the target nucleic acid is amplified with a thermostable polymerase with both reverse transcriptase activity and DNA template-dependent activity. Exemplary enzymes include Tth DNA polymerase, the C. therm Polymerase system, and those disclosed in US20140170730 and US20140051126.

Probes for use as described herein can be labeled with a fiuorophore and quencher (e.g., TaqMan, LightCycler, Molecular Beacon, Scorpion, or Dual Labeled probes).

Appropriate fluorophores include FAM, JOE, TET, Cal Fluor Gold 540, HEX, VIC, Cal Fluor Orang 560, TAMRA, Cyanine 3, Quasar 570, Cal Fluor Red 590, Rox, Texas Red, Cyanine 5, Quasar 670, and Cyanine 5.5. Appropriate quenchers include TAMRA (for FAM, JOE, and TET), DABCYL, and BHQ1-3. Detection devices are known in the art and can be selected as appropriate for the selected labels. Detection devices appropriate for quantitative PCR include the cobas * and Light Cycler * systems (Roche), PRISM 7000 and 7300 real-time PCR systems (Applied Biosystems), etc. Six-channel detection is available on the CFX96 Real Time PCR Detection System (Bio-Rad) and Rotorgene Q (Qiagen), allowing for a higher degree of multiplexing.

Results can be expressed in terms of a threshold cycle (abbreviated as Ct, and in some instances Cq or Cp). A lower Ct value reflects the rapid achievement of a predetermined threshold level, e.g., because of higher target nucleic acid concentration or a more efficient amplification. A higher Ct value may reflect lower target nucleic acid concentration, or inefficient or inhibited amplification. The threshold cycle is generally selected to be in the linear range of amplification for a given target. In some

embodiments, the Ct is set as the cycle at which the growth signal exceeds a pre-defined threshold line, e.g., in relation to the baseline, or by determining the maximum of the second derivation of the growth curve. Determination of Ct is known in the art, and described, e.g., in US Patent No. 7363168. VI. Kits

Provided herein are kits for multiplex qRT-PCR assays to classify the COO subtype of a DLBCL patient. In some embodiments, the kit includes mixtures of primers and probes for amplification, detection, and quantification of GCB and ABC marker gene products (RNA). GCB markers include ZNF318, PDK3, HMGN1, PTK2, SSBP2, BCL6, and

LRMP, and transcripts of these genes are present at a higher level in samples from GCB patients than in samples from non-cancer or ABC patients. ABC markers include ARID3A, CCND2, FOXP1, KIAA0226L, JADE3, PIM2, TCF4, and FAM46C, and transcripts of these genes are present at a higher level in samples from ABC patients than in samples from non-cancer or GCB patients.

Kits for multiplex qRT-PCR assays to classify the 6-gene COO subtype of a DLBCL patient are also included herein. In some embodiments, the kit includes mixtures of primers and probes for amplification, detection, and quantification of GCB and ABC marker gene products (RNA). GCB markers include ZNF318, PTK2, and SSBP2, and transcripts of these genes are present at a higher level in samples from GCB patients than in samples from non-cancer or ABC patients. ABC markers include CCND2, FOXP1, and JADE3, transcripts of these genes are present at a higher level in samples from ABC patients than in samples from non-cancer or GCB patients.

The marker-specific primer sets and probes can be mixed and matched in any combination. For example, each marker can be individually detected. In a detection system having 6 channels, up to 5 markers can be detected in a single vessel, along with the internal control. In this case, only 3 primer set and probe mixtures are required to include all 15 markers. In a detection system having 4 channels, up to 3 markers can be detected in a single vessel, along with the internal control. In this case, 5 primer set and probe mixtures are required. Alternatively, the assay can be carried out with a lower degree of multiplexing, or in non-multiplex fashion, so that more primer set and probe mixtures are required to test expression of all 15 markers in a sample. An example of a 5-tube multiplex assay is shown in the Examples. Thus in some embodiments, the kit includes 5 mixtures (e.g., master mixes), each comprising a primer set and probe specific for up to three GCB and ABC probe markers, and a primer set and probe specific for an internal control gene. For the 6-gene COO signature, the kit can include 2 mixtures, e.g., (i) a mixture including primers and probes that specifically amplify and detect GCB markers include ZNF318, PTK2, and SSBP2 (and an internal control) and (ii) a mixture including primers and probes that specifically amplify and detect ABC markers include CCND2, FOXP1, and JADE3 (and an internal control). In some embodiments, the probe for each gene in mixture (i) and (ii) has a different label so that the expression of each gene can be individually detected. In some embodiments, the probe for each of the determinative gene (not an internal control) in mixture (i) and (ii) has the same label. In some embodiments, the kit includes 6 different mixtures, one for each gene in the 6- gene COO signature. In some embodiments, markers are not detected individually. For example, all of the probes specific for GCB markers can be labeled with the same label, and all of the probes specific for ABC markers can be labeled with the same label (different from that on the GCB probes). In this case, all 15 markers can be massively multiplexed in a single vessel for detection with only 3 channels (one for GCB marker probes, one for ABC marker probes, and one for the control probe).

In some embodiments, the mixtures further comprise buffers, dNTPs, and other elements (e.g., cofactors or aptamers) appropriate for reverse transcription and amplification. Typically, the mixture is concentrated, so that an aliquot is added to the final reaction volume, along with sample (e.g., RNA), enzymes, and/ or water. In some embodiments, the kit further comprises reverse transcriptase (or an enzyme with reverse transcriptase activity), and/or DNA polymerase (e.g., thermostable DNA polymerase such as Taq, Z05, and derivatives thereof). In some embodiments, the kit further includes components for RNA purification from a sample, e.g., an FFPET sample. For example, the kit can include components from High Pure or MagNA Pure FFPE RNA Isolation Kits (Roche), RNeasy FFPE Kit (Qiagen), PureLink FFPE RNA Isolation Kit (Thermo Fisher), etc. In some embodiments, the kit further includes at least one control sample, e.g., nucleic acids from non-cancer sample (or pooled samples), or from a known ABC or GCB sample (or pooled samples). In some embodiments, the kit includes an ABC positive control and/or a GCB positive control. In some embodiments, the kit includes a negative control, e.g., lacking nucleic acids, or lacking ABC and/or GCB marker nucleic acids. In some embodiments, the kit further includes consumables, e.g., plates or tubes for nucleic acid preparation, tubes for sample collection, etc. In some embodiments, the kit further includes instructions for use, reference to a website, or software.

VII. Examples

Design of COO subtype determination signature A set of commercially acquired DLBCL FFPET specimens (Training cohort 1; n=32) was used to select the classifier genes (Table 1). The samples were prepared using the FFPET RNA kit from Roche.

Gene targets in the qRT-PCR classifier were derived from a collection of genes (n=76) screened in a cohort of DLBCL specimens (n=32; Training cohort). We used the Affymetrix microarray platform as a "gold standard" for confirmation. Table 1. qRT-PCR COO classifier for DLBCL with GenBank Accession Numbers

Once the genes were selected, a qRT-PCR assay was designed to be performed in 5 separate wells. 200ng RNA test and control sample (40 ng/well) was used. Reaction conditions were as follows for each reaction:

25 ul RNA + 25 ul reaction mix

Reaction mix: 5ul manganese acetate + 10 ul RNA master mix stock + 10 ul primer/probe mix (final concentration 100-300 nM) Reactions were run in a cobas ® LC480 with four filters to detect probes as indicated in Table 2.

Table 2. Exemplary assay lay-out

Table 2 shows an exemplary assay design, and allows individual detection and quantification of each of the marker genes in a minimal number of wells.

More or fewer reaction vessels can be used. For example, a one-tube assay having all of the GCB markers labeled with the same label (fiuorophore 1), all of the ABC markers labeled with the same label (fiuorophore 2), and an internal control (IC) labeled with a different label (fiuorophore 3) can be used. On the other end of the spectrum, each classifier gene can be detected in a separate well to determine the COO subtype of the test sample. The test is carried out by comparing the expression level of the GCB markers to the expression level of the ABC markers within a sample. If the ratio of GCB marker expression to ABC marker expression is higher than a threshold (e.g., GCB threshold), the result indicates that the sample is from an individual with GCB DLBCL. If the ratio of ABC marker expression to GCB marker expression is higher than a threshold (e.g., ABC threshold), the result indicates that the sample is from an individual with ABC DLBCL. The internal control is used to standardize expression levels based on the amount or quality of nucleic acid in the sample. The threshold levels are based on the probability that the GCB and ABC expression levels in a sample from an individual accurately classify the individual's DLBCL COO subtype. For example, a GCB threshold level can be set using a sample from an individual (or group of individuals) known to have GCB subtype. A GCB positive control can then be prepared with the known GCB sample. In some embodiments, the GCB positive control is prepared from the known GCB sample mixed with a sample known to be from an individual with ABC so that > 50% of the nucleic acids in the GCB positive control are from the known GCB sample to provide a minimum GCB:ABC expression level ratio. If a sample has a GCB:ABC expression ratio above that ratio (GCB threshold), the result is considered an accurate call of GCB COO subtype. The GCB positive control can be prepared with 51-100% known GCB sample, e.g., about 55, 58, 60, 62, 65, 68, 70, 75% or higher, with higher percentages resulting in a more stringent confidence level in the GCB threshold. If a sample has a GCB:ABC expression ratio below the GCB threshold, the result is either not determined, or an ABC COO subtype. The ABC threshold is set similarly. For example, the ABC positive control can be prepared with 51-100% known ABC sample, e.g., about 55, 58, 60, 62, 65, 68, 70, 75% or higher, with higher percentages resulting in a more stringent confidence level in the ABC threshold. If a sample has an ABGGCB expression ratio above the ABC threshold, the result is considered an accurate call of ABC COO subtype, while a ratio below the ABC threshold, the result is not determined, or a GCB COO subtype. In some embodiments, the GCB and ABC positive controls are prepared by mixing a known amount of GCB marker nucleic acids and ABC marker nucleic acids. The GCB and

ABC positive controls also act as controls for assay performance, e.g., to ensure reagents are added and that the instrument is performing properly.

Tables 3 and 4 show the sequences of primers and probes, respectively, that can be used for the present classifier. Table 3: Primer sequences

Gene Forward 5' to 3' Sequence (SEQ ID NO) Reverse 5' to 3' Sequence (SEQ ID

primer Primer NO)

PTK2 CS_PTK2_F1 GGCAGTATTGACAGGGAGGA (1) CS_PTK2_R1 TGGTTTACCCACAGGCTGA

(2)

PTK2 CS_PTK2_F2 GGAGAAGGCCAATTTGGAGAT (3) CS_PTK2_R2 ACAG I 1 1 1 I ACA I G I 1 1 I AAT

TGCAACC (4)

PTK2 CS_PTK2_F3 GGACAGAAAAGGAATGCTACAACT CS_PTK2_R3 CGCAATGGTTAGGGATGGTG

A (5) (6)

PTK2 CS_PTK2_F4 AGCTTAGTACAGCTCTTGCAT (7) CS_PTK2_R4 TCATTTGAGGACACCAGAAC

ATT (8)

PTK2 CS_PTK2_F5 GCCCAGAAGAAGGAATCAGTT (9) CS_PTK2_R5 GGTTTGCACTTGAGTGAAGT

C (10)

PTK2 CS_PTK2_F6 ACCATTCCCCTCCTACCAG (11) CS_PTK2_R6 CTCACCCAGGTCAGAGTTCA

(12)

PTK2 CS_PTK2_F7 GACCTCAGGAGATAGCAATGT (13) CS_PTK2_R7 CACTTGCCCAATCCCTCG

(14)

PTK2 CS_PTK2_F8 GAATGGAACCTCGCAGTCAT (15) CS_PTK2_R8 GGCCAACTTTGGTATTGATG

G (16)

PTK2 CS_PTK2_F9 TTCGACG 1 1 1 I ACCTCAGCT (17) CS_PTK2_R9 GGCTTCACACCATGCATCAG

(18)

PTK2 CS_PTK2_F1 GAAAGAAGGTGAACGGGCTT (19) CS_PTK2_R10 GTGTGTCCGCATGCCTTG

0 (20)

PTK2 CS_PTK2_F1 CCTGTCTGGATAATCATGGAGC CS_PTK2_R11 GCTAGATCCAAACTGTATTTC

1 (21) CTTAC (22)

PTK2 CS_PTK2_F1 CGGCCCAGGTTTACTGAAC (23) CS_PTK2_R12 TCTTCTTGCTGAGCCTTCTCT

2 (24)

LRMP LRMP_F1 CAGGCTGCATCAGGATGAAT (25) LRMP_R1 CAGCAGGCTCTCAGGACA

(26) Gene Forward 5' to 3' Sequence (SEQ ID NO) Reverse 5' to 3' Sequence (SEQ ID primer Primer NO)

BCL6 BCL6_F1 GAAGAGCCACCTGCGAATC (53) BCL6_R3 GCTGGC I 1 1 1 GTGACGGAA

(54)

BCL6 BCL6_F2 CCACCTGCGAATCCACAC (55) BCL6_R1 GCTGGC I 1 1 1 GTGACGGAAA

(56)

BCL6 BCL6_F3 CATGTTGTGGACACTTGCC (57) BCL6_R4 CTTCACGAGGAGGCTTGATG

(58)

BCL6 BCL6_F4 GGAGCATGTTGTGGACACTT (59) BCL6_R5 CTCTTCACGAGGAGGCTTGA

(60)

BCL6 BCL6_F5 ATGGAGCATGTTGTGGACAC (61) BCL6_R6 CGAGGAGGCTTGATGGCA

(62)

BCL6 BCL6_F6 GGACTCCACCATCCCACAA (63) BCL6_R7 TAGAGTGGTGAGTGGCTCTC

(64)

PIM2 PIM2_FP1 GTGCCCTGCTTCATGATG (65) PIM2_RP1 CTGGTGTCGAGAGATCCACT

C (66)

PIM2 PIM2_FP2 GGCTGTGCCAAACTCATT (67) PIM2_RP2 GGGCTGTACACCCTTGT (68)

PIM2 PIM2_FP1 GTGCCCTGCTTCATGATG (69) PIM2_RP3 CATGGTACTGGTGTCGAGAG

A (70)

PIM2 PIM2_FP1 GTGCCCTGCTTCATGATG (71) PIM2_RP4 CCGGGAGTGCATGGTACT

(72)

PIM2 PIM2_FP3 GGACACCGCCTCACAGAT (73) PIM2_RP5 AGTGGGCATGTGACTGAGTC

(74)

PIM2 PIM2_FP3 GGACACCGCCTCACAGAT (75) PIM2_RP6 CTTCGAGTGGGCATGTGA

(76)

PIM2 PIM2_FP4 CGCCTCACAGATCGACTC (77) PIM2_RP7 G CAGTG CG ACTTCG AGTG

(78) Gene Forward 5' to 3' Sequence (SEQ ID NO) Reverse 5' to 3' Sequence (SEQ ID primer Primer NO)

PIM2 PIM2_FP5 ACCGTCTTCGCAGGACAC (79) PIM2_RP8 GGGCATGTGACTGAGTCTG

(80)

ARID3A ARID3A_FP1 GGCGACTGGACTTACGAGG (81) ARID3A_RP1 CCAGGAATTCCTTCCTCTTGG

(82)

ARID3A ARID3A_FP2 TCCTGGATGACTTGTTCAGC (83) ARID3A_RP2 AGGACCTGTTTGGCCATGAT

(84)

ARID3A ARID3A_FP3 ACGTCCATCACCAGTGCA (85) ARID3A_RP3 CTTCTCACACTCGTAGGGGT

(86)

ARID3A ARID3A_FP3 ACGTCCATCACCAGTGCA (87) ARID3A_RP4 CCCGCTTCTCACACTCGTA

(88)

ARID3A ARID3A_FP4 CAGCTGCCCATGAGCATTC (89) ARID3A_RP5 TCAGGTTCACAGCAGAGTCC

(90)

ARID3A ARID3A_FP4 CAGCTGCCCATGAGCATTC (91) ARID3A_RP6 CGTCAGGTTCACAGCAGAG

(92)

ARID3A ARID3A_FP5 AGCATGTCGGTGGAGATCAA (93) ARID3A_RP7 TTGGGAGCAGAGGTTGGC

(94)

ARID3A ARID3A_FP6 ATCAGCATGTCGGTGGAGAT (95) ARID3A_RP8 TTTGTTGGGAGCAGAGGTTG

(96)

CCND2 CCND2_FP1 AGGACATCCAACCCTACATGC (97) CCND2_RP1 GAAGACCTCTTCTTCGCACT

(98)

CCND2 CCND2_FP2 CTTCATTGCTCTGTGTGCCA (99) CCND2_RP2 TGCTCCCACACTTCCAGTT

(100)

CCND2 CCND2_FP2 CTTCATTGCTCTGTGTGCCA (101) CCND2_RP3 CCACACTTCCAGTTGCGATC

(102)

CCND2 CCND2_FP3 GACTGAGCTGCTGGCTAAGA (103) CCND2_RP4 GAGCACCGCCTCAATCTG

(104) Gene Forward 5' to 3' Sequence (SEQ ID NO) Reverse 5' to 3' Sequence (SEQ ID primer Primer NO)

CCND2 CCND2_FP4 GGACATCCAACCCTACATGC (105) CCND2_RP5 AGAGGGAAGACCTCTTCTTC

G (106)

CCND2 CCND2_FP4 GGACATCCAACCCTACATGC (107) CCND2_RP6 GGAAGACCTCTTCTTCGCACT

(108)

CCND2 CCND2_FP2 CTTCATTGCTCTGTGTGCCA (109) CCND2_RP7 CTGCTCCCACACTTCCAGT

(110)

CCND2 CCND2_FP5 ACCTTCATTGCTCTGTGTGC (111) CCND2_RP8 GCTCCCACACTTCCAGTTG

(112)

FAM46C FAM46C_FP1 AAGGACCTGCCTCTGTCG (113) FAM46C_RP1 TCTCCTCTGCCATCTTCAGG

(114)

FAM46C FAM46C_FP2 CCAAG G ACCTG CCTCTGT (115) FAM46C_RP2 CTCCTCTGCCATCTTCAGGG

(116)

FAM46C FAM46C_FP3 CCAAG G ACCTG CCTCTGTC (117) FAM46C_RP1 TCTCCTCTGCCATCTTCAGG

(118)

FAM46C FAM46C_FP4 CCAAG G ACCTG CCTCTGTC (119) FAM46C_RP2 CTCCTCTGCCATCTTCAGGG

(120)

FAM46C FAM46C_FP5 CAAGGACCTGCCTCTGTCG (121) FAM46C_RP1 TCTCCTCTGCCATCTTCAGG

(122)

FAM46C FAM46C_FP2 CCAAG G ACCTG CCTCTGT (123) FAM46C_RP3 CTCTGCCATCTTCAGGGGAT

(124)

FAM46C FAM46C_FP1 AAGGACCTGCCTCTGTCG (125) FAM46C_RP4 TGCTCTCCTCTGCCATCTTC

(126)

FAM46C FAM46C_FP3 CCAAGGACCTGCCTCTGTC (127) FAM46C_RP5 CTCTGCCATCTTCAGGGGAT

(128)

TCF4 TCF4_FP1 AAACCAGCAACCAGCACTTT (129) TCF4_RP1 GAGGAGCTCCAAGGGTCAC

(130) Gene Forward 5' to 3' Sequence (SEQ ID NO) Reverse 5' to 3' Sequence (SEQ ID primer Primer NO)

HMGN1 HMGN1_FP1 AAGACTTACCTGCGGAAAACG HMGN1_RP1 TGGCTTCTTTCTCTCCTGCT

(209) (210)

HMGN1 HMGN1_FP2 AGACTTACCTGCGGAAAACG (211) HMGN1_RP2 TTGG CTTCTTTCTCTCCTG C

(212)

HMGN1 HMGN1_FP1 AAGACTTACCTGCGGAAAACG HMGN1_RP3 CTTGG CTTCTTTCTCTCCTG C

(213) (214)

HMGN1 HMGN1_FP3 GACTTACCTGCGGAAAACGG (215) HMGN1_RP2 TTGG CTTCTTTCTCTCCTG C

(216)

HMGN1 HMGN1_FP2 AGACTTACCTGCGGAAAACG (217) HMGN1_RP1 TGGCTTCTTTCTCTCCTGCT

(218)

HMGN1 HMGN1_FP1 AAGACTTACCTGCGGAAAACG HMGN1_RP2 TTGG CTTCTTTCTCTCCTG C

(219) (220)

FOXP1 F0XP1_FP1 CAACGAGAGTGACAGCAGTC (221) F0XP1_RP1 GGCTCTTCTTTGACGTGTACA

(222)

FOXP1 F0XP1_FP1 CAACGAGAGTGACAGCAGTC (223) FOXPl_RP2 GGGCTCTTCTTTGACGTGTA

(224)

FOXP1 F0XP1_FP2 CGCCTACTGCACACCTCTC (225) FOXPl_RP3 CATGGAAGCGGTAGTGTATA

GAG (226)

FOXP1 F0XP1_FP2 CGCCTACTGCACACCTCTC (227) FOXPl_RP4 CCATGGAAGCGGTAGTGTAT

AG (228)

FOXP1 F0XP1_FP2 CGCCTACTGCACACCTCTC (229) FOXPl_RP5 CCATGGAAGCGGTAGTGTA

(230)

FOXP1 F0XP1_FP3 AGAGCAGCCACGCCTACT (231) FOXPl_RP3 CATGGAAGCGGTAGTGTATA

GAG (232)

FOXP1 F0XP1_FP4 CGAAGGCCACAAAAGATCA (233) FOXPl_RP6 GCATTGAGAGGTGTGCAGTA

(234)

FOXP1 F0XP1_FP5 ATGGACAGTGGATGAAGTAGAATT FOXPl_RP7 GCTGCTCTGCATG 1 1 1 1 I AAT

C (235) AA (236) Gene Forward 5' to 3' Sequence (SEQ ID NO) Reverse 5' to 3' Sequence (SEQ ID primer Primer NO)

KIAA0226L KIAA0226L_F TCTTCCCATTTCAGACAGCA (237) KIAA0226L_R GAGGACTGGAAGCACTGTTT

1 1 (238)

KIAA0226L KIAA0226L_F TCTTCCCATTTCAGACAGCA (239) KIAA0226L_R GGAGGACTGGAAGCACTGT

1 2 (240)

KIAA0226L KIAA0226L_F AGCAAGAGTCTGGGTCTTCTA KIAA0226L_R GTTTCAGTCACTGGGCTGAC

2 (241) 3 (242)

KIAA0226L KIAA0226L_F AGCAAGAGTCTGGGTCTTCT (243) KIAA0226L_R TTTCAGTCACTGGGCTGAC

3 4 (244)

KIAA0226L KIAA0226L_F ACAGAAACCTGTAGCTGTTCC KIAA0226L_R GCTCTTTGGCTAATAGTTCTG

4 (245) 5 CA (246)

KIAA0226L KIAA0226L_F ACAGAAACCTGTAGCTGTTCC KIAA0226L_R GCTCTTTGGCTAATAGTTCTG

4 (247) 6 C (248)

KIAA0226L KIAA0226L_F AATTCTCAGCTGGCAGGTTC (249) KIAA0226L_R GATTCAAAGTC 1 1 1 I CGGAC

5 7 ACA (250)

KIAA0226L KIAA0226L_F TGGGCTCCTCCTAGATTTCA (251) KIAA0226L_R AGAAAAAATTCTGGGCTGCC

6 8 A (252)

Table 4: Probe sequences

Gene Probe_label 5' to 3' Sequence (SEQ ID NO)

PTK2 CS_ PTK2 _JA270_ _5B C AAG G GCTG C AATCCC AC AC ATCTTG C (253)

cs_ PTK2 _JA270_ 1C AAGTCTTCAGGGTCCGATTGGAAACCAACA (254) cs_ PTK2 _JA270_ _2C AGGCATTTATATGAGTCCAGAGAATCCAGCTTTGG (255) cs_ PTK2 _JA270_ _4B AGGTGCACCCGAGCCTCTGACAG (256)

cs_ PTK2 _JA270_ _3A C AA AAG ATTTGTAC AC AG G G AC ATTG CTG CTCG (257) cs_ PTK2 _JA270_ .7 TAACGGACAAGGGCTGCAATCCCACAC (258) Gene Probe_label 5' to 3' Sequence (SEQ ID NO)

L M P LRM PJA270_1 TGACCCAAGTATGGAAGAGAATGGTGTTGAACG (259)

LRM PJA270_2 AG AG G CCC A AG G C AC A AGTCC AG (260) LRM PJA270_3 ACATGCTTCAGGAGACTCTGTGGTTTCCC (261) LRM PJA270_4 AG CC ATC AATC AG G AA AG CCG G GTT AGTA (262) LRM PJA270_5 GCTTCTCTAAACTCCAAGCCATCTTCTCTACGAAGAG LRM PJA270_6 (263)

LRM PJA270_7 GTGGGATGTCTCTTCAGTTTATGACACAATAGCTTCC

(264)

ACTATTAGAGTCTTTAACACCTCTGTGTGAAGATGACA

(265)

BCL6 BCL6_HEX1 AGGAGAGAAACCTTACCATTGTGAGAAGTGTAACCT

BCL6_HEX2 (266)

BCL6_HEX3 GGAAGTTTATTAAGGCCAGTGAAGCAGAGATGGTTT

(267)

AATAACATCGTTAACAGGTCCATGACGGGCTC (268)

PIM2 PI M2_H EX1 ACAGATCGACTCCAGGTGGCCATCAAAG (269)

PI M2_H EX2 GCTGGTCCCCCTTGTCAGACTCAGT (270) PI M2_H EX3 CTACACTGACTTTGATGGGACAAGGGTGTACA (271) PI M2_H EX7 TCACATGCCCACTCGAAGTCGCA (272)

PI M2_H EX8 CTGACTTTGATGGGACAAGGGTGTACA (273) PI M2_H EX9 CTGGTGCCCTGCTTCATGATGAACC (274) PI M2_H EX10 CACTGACTTTGATGGGACAAGGGTG (275)

ARID3A ARI D3A_FAM 1 AGCAGTTTAAGCAGCTCTACGAACTCGACG (276)

ARI D3A_FAM2 TGCAGAAGCGAGGGACACCTGTGA (277) ARI D3A_FAM3 CCCTGCGGACCCAATACATGAAGTACCT (278) ARI D3A_FAM6 CAACAGCCAAGCCTCCGAAAGCCG (279) ARI D3A_FAM7 CGGCATCATGTACACAGGAGTTCTGTTTGCTCA (280)

CCN D2 CCND2_HEX1 AGTTTGCCATGTACCCACCGTCGA (281)

CCND2_HEX2 CTTTAAGTTTGCCATGTACCCACCGTCG (282) CCND2_HEX3 TTGCCATGTACCCACCGTCGATGAT (283)

FAM46C FAM46CJA270_1 TTCTATTGCCCAGTTTCCCCAGCCAGAA (284)

FAM46CJA270_2 CTCTTCTATTGCCCAGTTTCCCCAGCCAG (285) FAM46CJA270_3 CTCCTCTTCTATTGCCCAGTTTCCCCAGC (286) Gene Probe_label 5' to 3' Sequence (SEQ ID NO)

TCF4 TCF4_FAM 1 TCCTTCTTC ATG C AAG ATG G CC ATC AC AG C (287)

TCF4_FAM2 AGGACCCTTACAGAGGCATGCCACC (288) TCF4_FAM3 TGGAGCAGCAAGTCCGAGAAAGGAATCTGAA (289)

JADE3 JADE3_FAM 1 AGAAACCTGCTGAGGTATTCCGGAAGGAC (290)

JADE3_FAM2 CACAGCCTTCTCTCAGGATTATAGCTGAGAAGGT (291) JADE3_FAM3 CTTGCAGAAATGGGTTGTGGGCCAGTT (292) JADE3_FAM5 TGTGTG CATC AG G CCTG CTATG G C (293)

JADE3_FAM6 TGTG G ATCCC AG AG GTC AG C AUG CTTG (294) JADE3_FAM7 AGGACCTGGAGAGGGTCCGAAATCTGT (295) JADE3_FAM8 CC A AG A AATTG ATG C AG G G CTTCCTTTG AC A AATG (296)

SSBP2 SSBP2_FAM 1 CATGCCTAGTCCAGCAGATTCAACCAACTCT (297)

SSBP2_FAM2 CCTGGACCTAACAGACCTAA 1 1 1 1 CCAATGGG (298)

PDK3 PDK3_FAM 1 TTGAAGAATTCAATGCCAAAGCGCCAGACAAA (299)

PDK3_FAM2 CGGATGTGGTGAAAGATGCATATGAAACAGCC (300) PDK3_FAM3 CAATTCCTGGACTTCGGGAGAGATAATGCATGTG (301)

ZNF318 ZN F318_H EX1 TGCTTCCCAGAAGCAAAAGGTTATTGAAGAGAGG (302)

ZN F318_H EX2 T AACTGTTCCTG C A AA AGG CTCTG AGTTTCTG G (303) ZN F318_H EX3 CGGCTTCATAAACAACAAGGAGAAATGCTGCGC (304)

H MGN1 HMGN 1_H EX1 CGAAGACTGAGGAGAGTCCAGCCTCT (305)

HMGN 1_H EX2 AACGAAGACTGAGGAGAGTCCAGCCTC (306) HMGN 1_H EX3 AAGACTGAGGAGAGTCCAGCCTCTGATG (307)

FOXP1 FOXP1JA270-1 CCACAAAAGATCAGTGGTAACCCTTCCC (308)

FOXP1JA270_2 TCCTATGCAAGCCGTGCATCC (309)

KIAA0226L KIAA0226LJA270_1 TGTAGAAGATGTTCAGCGTGCAGGGCTT (310)

KIAA0226LJA270_2 C AT AC AGTG G CTATG AAG GTTGTG CTGTGTTAC A (311) KIAA0226LJA270_3 TGCAGCTCCTCTAAG AGTGTCACTTATG AGCC (312) KIAA0226LJA270_4 TGCAGCTGGCTCGATAGTCGTAAATGAAGA (313) KIAA0226LJA270_5 TTCATCCACCACTCAAGAGGGACCTTGTG (314) Validation of the 15 -gene signature

The qRT-PCR classifier was validated in commercially acquired DLBCL FFPET specimens (validation cohort 2; n=29, and validation cohort 3; n=46). Concordance rate between qRT-PCR and Afiymetrix microarray-based classifiers was 97.1% (Tables 5 and 6).

Table 5: Validation of qRT-PCR COO subtype classifier (Cohort 2, n=29)

Table 6: Validation of qRT-PCR COO subtype classifier (Cohort 3, n=46)

The high concordance of the DLBCL subtype classification signature in two

independent DLBCL cohorts is surprising, especially given the relatively small number of genes in the signature. These results show that the DLBCL classifier can be used for quick-turn around, simple, inexpensive, and accurate determination of COO subtype.

Validation of the 6 -gene signature The qRT-PCR classifier with 6 of the genes was validated in commercially acquired DLBCL FFPET specimens (validation cohort n=50). The genes included in the 6-gene signature includes the ABC genes CCND2, FOXP1, and JADE3, and the GCB genes ZNF318, SSBP2, and PTK2. Concordance rate between qRT-PCR and Affymetrix microarray-based classifiers was 95% (Table 7).

Table 7: Validation of qRT-PCR COO subtype classifier (6-gene classifier)

The high concordance of a small 6-gene DLBCL subtype classification signature is surprising. These results show that the 6-gene DLBCL classifier can be used for quick- turn around, simple, inexpensive, and accurate determination of COO subtype.