PROTEOMIC ANALYSIS OF HOST CELL PROTEINS

Title:

PROTEOMIC ANALYSIS OF HOST CELL PROTEINS

Document Type and Number:

WIPO Patent Application WO/2018/031858

Kind Code:

A1

Abstract:

Disclosed herein are methods and compositions useful for detecting and/or quantifying host cell proteins during the production of a product, e.g., a recombinant protein, e.g., an antibody.

Inventors:

GRAHAM JAMES (CH)

Application Number:

PCT/US2017/046440

Publication Date:

February 15, 2018

Filing Date:

August 11, 2017

Export Citation:

Click for automatic bibliography generation Help

Assignee:

LONZA AG (US)
LONZA AG (US)

International Classes:

G01N33/68; G06F19/00; G16B45/00

Domestic Patent References:

WO2015051310A2	2015-04-09
WO2001096584A2	2001-12-20
WO2001029058A1	2001-04-26

Foreign References:

US6838284B2	2005-01-04
US5272071A	1993-12-21
US6326193B1	2001-12-04
US5633162A	1997-05-27
EP0481791A2	1992-04-22
US20130280797A1	2013-10-24
US20120077429A1	2012-03-29
US20110280797A1	2011-11-17
US20090305626A1	2009-12-10
US8298054B2	2012-10-30
US7629167B2	2009-12-08
US5656491A	1997-08-12

Other References:

HANNE KOLSRUD HUSTOFT ET AL: "A Critical Review of Trypsin Digestion for LC-MS Based Proteomics", 1 February 2012 (2012-02-01), XP002708428, ISBN: 978-953-51-0070-6, Retrieved from the Internet [retrieved on 20130730]
PIERO GIANSANTI ET AL: "Six alternative proteases for mass spectrometry-based proteomics beyond trypsin", NATURE PROTOCOLS, vol. 11, no. 5, 28 April 2016 (2016-04-28), GB, pages 993 - 1006, XP055413585, ISSN: 1754-2189, DOI: 10.1038/nprot.2016.057
WANG XING ET AL: "Host cell proteins in biologics development: Identification, quantitation and risk assessment", BIOTECHNOLOGY AND BIOENGINEERING, WILEY ETC, vol. 103, no. 3, 15 June 2009 (2009-06-15), pages 446 - 458, XP002546192, ISSN: 0006-3592, [retrieved on 20090422], DOI: 10.1002/BIT.22304
CATALIN DONEANU ET AL: "Analysis of host-cell proteins in biotherapeutic proteins by comprehensive online two-dimensional liquid chromatography/mass spectrometry", MABS, vol. 4, no. 1, 1 January 2012 (2012-01-01), US, pages 24 - 44, XP055297897, ISSN: 1942-0862, DOI: 10.4161/mabs.4.1.18748
ANNE LUISE TSCHELIESSNIG ET AL: "Host cell protein analysis in therapeutic protein bioprocessing - methods and applications", BIOTECHNOLOGY JOURNAL, vol. 8, no. 6, 22 June 2013 (2013-06-22), DE, pages 655 - 670, XP055378442, ISSN: 1860-6768, DOI: 10.1002/biot.201200018
KATRIN BOMANS ET AL: "Identification and Monitoring of Host Cell Proteins by Mass Spectrometry Combined with High Performance Immunochemistry Testing", PLOS ONE, vol. 8, no. 11, 27 November 2013 (2013-11-27), pages e81639, XP055413596, DOI: 10.1371/journal.pone.0081639
DANIEL G. BRACEWELL ET AL: "The future of host cell protein (HCP) identification during process development and manufacturing linked to a risk-based management for their control", BIOTECHNOLOGY AND BIOENGINEERING, vol. 112, no. 9, 1 September 2015 (2015-09-01), pages 1727 - 1737, XP055368145, ISSN: 0006-3592, DOI: 10.1002/bit.25628
BATZER ET AL., NUCLEIC ACID RES., vol. 19, 1991, pages 5081
OHTSUKA ET AL., J. BIOL. CHEM., vol. 260, 1985, pages 2605 - 2608
ROSSOLINI ET AL., MOL. CELL. PROBES, vol. 8, 1994, pages 91 - 98
UWE GOTTSCHALK: "Process Scale Purification of Antibodies", 2011, JOHN WILEY & SONS
G. SUBRAMANIAN: "Antibodies Vol 1 Production and Purification", 2013, SPRINGER SCIENCE & BUSINESS MEDIA
GARY C. HOWARD: "Basic Methods in Antibody Production and Characterization", 2000, CRC PRESS
JULIAN WHITELEGGE: "Protein Mass Spectrometry", 2008, ELSEVIER
MICHAEL KINTER: "Protein Sequencing and Identification Using Tandem Mass Spectrometry", 2005, JOHN WILEY & SONS
GUODONG CHEN: "Characterization of Protein Therapeutics using Mass Spectrometry", 2014, SPRINGER SCIENCE & BUSINESS MEDIA
"Antibody Expression and Production", 2011, SPRINGER PUBLISHING
LEADER ET AL.: "Protein therapeutics: a summary and pharmacological classification", NATURE REVIEWS DRUG DISCOVERY, vol. 7, 2008, pages 21 - 39
SAMBROOK ET AL.: "MOLECULAR CLONING: A LABORATORY MANUAL", vol. 1 -4, 2012, COLD SPRING HARBOR PRESS
FAN ET AL., PHARM. BIOPROCESS., vol. 1, no. 5, 2013, pages 487 - 502
MORRE, G., THE JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, vol. 199, 1967, pages 519 f
LEIBOVITZ, A. ET AL., AMER. J. OF HYGIENE, vol. 78, 1963, pages 173 ff
HAM, R. ET AL., PROC. NATL. ACAD. SC., vol. 53, 1965, pages 288 ff
ISCOVES ET AL., J. EXP. MED., vol. 1, 1978, pages 923
R. IAN FRESNEY: "Culture of Animal cells, a manual", 2000, WILEY-LISS
SAMBROOK ET AL.: "MOLECULAR CLONING: A LABORATORY MANUAL", vol. 1 - 4, 2012, COLD SPRING HARBOR PRESS
"Biological Sciences", vol. 556, BACILLUS GENETIC STOCK CENTER, pages: 484
ZHANG, J ET AL.: "PEAKS DB: de novo sequencing assisted database search for sensitive and accurate peptide identification", MOL.CELL PROTEOMICS, vol. 4, no. 11, 2012, pages 111
WALLE VAN, I ET AL.: "Immunogenicity screening in protein drug development", EXPERT OPIN BIOL THER, vol. 7, no. 3, 2007, pages 405, XP009174848, DOI: doi:10.1517/14712598.7.3.405
J. C. SILVA ET AL.: "Absolute quantification of proteins by LCMSE: a virtue of parallel MS acquisition", MOL. CELL PROTEOMICS., vol. 5, no. 1, 2006, pages 144, XP002682650, DOI: doi:10.1074/mcp.m500230-mcp200
C. E. DONEANU ET AL.: "Analysis of host-cell proteins in biotherapeutic proteins by comprehensive online two-dimensional liquid chromatography/mass spectrometry", MABS, vol. 4, no. 1, 2012, pages 24, XP055297897, DOI: doi:10.4161/mabs.4.1.18748
A. FARRELL ET AL.: "Quantitative Host Cell Protein Analysis Using Two Dimensional Data Independent LC-MS(E", ANAL. CHEM., vol. 87, no. 18, 2015, pages 9186
M. R. SCHENAUER; G. C. FLYNN; A. M. GOETZE: "Identification and quantification of host cell protein impurities in biotherapeutics using mass spectrometry", ANAL. BIOCHEM., vol. 428, no. 2, 2012, pages 150, XP055297904, DOI: doi:10.1016/j.ab.2012.05.018
Q. ZHANG ET AL.: "Comprehensive tracking of host cell proteins during monoclonal antibody purifications using mass spectrometry", MABS, vol. 6, no. 3, 2014, pages 659
N. E. LEVY ET AL.: "Identification and characterization of host cell protein product-associated impurities in monoclonal antibody bioprocessing", BIOTECHNOL. BIOENG., vol. 111, no. 5, 2014, pages 904
V. N. SISODIYA ET AL.: "Studying host cell protein interactions with monoclonal antibodies using high throughput protein A chromatography", BIOTECHNOL. J., vol. 7, no. 10, 2012, pages 1233, XP055073925, DOI: doi:10.1002/biot.201100479
R. D. TARRANT ET AL.: "Host cell protein adsorption characteristics during protein A chromatography", BIOTECHNOL. PROG., vol. 28, no. 4, 2012, pages 1037, XP055073820, DOI: doi:10.1002/btpr.1581
J. C. SILVA ET AL.: "Absolute quantification of proteins by LCMSE: a virtue of parallel MS acquisition", MOL. CELL PROTEOMICS, vol. 5, no. 1, 2006, pages 144, XP002682650, DOI: doi:10.1074/mcp.m500230-mcp200
NIAN, R. ET AL.: "Advance chromatin extraction improves capture performance of protein A affinity chromatography", J CHROMATOGR A, vol. 1431, 2016, pages 1 - 7, XP029388321, DOI: doi:10.1016/j.chroma.2015.12.044

Attorney, Agent or Firm:

COLLAZO, Diana M. et al. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

WHAT IS CLAIMED IS:

1. A simple method of rapidly analyzing a sample, e.g., to provide an assessment of the risk of a protein (e.g., the risk the protein presents if present as a contaminant in a

preparation, e.g., a preparation to be administered to a subject, e.g., a pharmaceutical

preparation), the method comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the sample, which comprises the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced via a process; and

ii) a denaturant, e.g., deoxycholate and urea,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample at temperature of between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+l°C, or 20°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from (a)); and ii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and

d) assigning a protein risk score to a protein identified in the sample,

thereby analyzing the sample and providing an assessment of the risk of a protein.

2. A simple method of rapidly analyzing a sample to provide an assessment of the risk of a protein, the method comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the sample, which comprises the protein (e.g., a HCP) and optionally a therapeutic product (e.g., a recombinant polypeptide), produced via a process; and

ii) a first denaturant, e.g., guanidine hydrochloride,

under conditions, e.g., concentrations of a first denaturant, that denatures the protein in the sample at temperature of between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+1 °C, or 50°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture;

ii) a second denaturant, e.g., urea; and

iii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products;

d) assigning a protein risk score to a protein identified in the sample,

thereby analyzing the sample and providing an assessment of the risk of a protein.

3. The method of either of claims 1 or 2, further comprising evaluating a plurality of different samples, each made by a different process, e.g., evaluating at least 2, 4, 8, 10, 50, 96, 100, 192, 200, 500 or 1,000, different samples.

4. The method of claim 3, further comprising comparing the assessment of risk for a first and a second different sample.

5. The method of claim 4, further comprising, responsive to the comparison, selecting, a process for producing the product.

6. The method of claim 4, further comprising, responsive to the comparison, selecting, classifying, or further processing one of the samples.

7. A method of evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process, comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced by the process; and

ii) a denaturant, e.g., deoxycholate and urea

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample, e.g., at temperature of between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+1 °C, or 20°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from a)); and ii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products; c) separating the protein digestion products, e.g., by using chromatography, e.g., 1- dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and

d) assigning a protein risk score to a protein identified in the sample,

thereby evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process.

8. A method of evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process, comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced by the process; and

ii) a first denaturant, e.g., guanidine hydrochloride,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample, e.g., at temperature of between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+1 °C, or 50°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from a));

ii) a second denaturant, e.g., urea; and

iii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products; c) separating the protein digestion products, e.g., by using chromatography, e.g., 1- dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and

d) assigning a protein risk score to a protein identified in the sample,

thereby evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process.

9. The method of either of claims 7 or 8, further comprising evaluating a plurality of different processes of making a product, e.g., evaluating at least 2, 4, 8, 10, 50, 96, 100, 192, 200, 500 or 1,000, different processes.

10. The method of claim 9, further comprising comparing the evaluation for a first and a second different process.

11. The method of claim 10, further comprising, responsive to the comparison, selecting a process of making the product.

12. The method of any of claims 1-11, wherein the protein is a contaminant or other undesirable component (e.g., a fragment, denatured, or mis-folded version of a product being produced by a set of conditions, or a host cell protein (HCP) or fragment thereof).

13. The method of any of claims 1-12, wherein the denaturant, first denaturant, or second denaturant comprises, consists of, or consists essentially of deoxycholate and urea, guanidine hydrochloride, or urea and guanidine hydrochloride.

14. The method of any of claims 1-13, wherein the concentration of denaturant in the sample mixture is higher than the concentration of denaturant in the sample/enzyme mixture.

I l l

15. The method of any of claims 1-14, wherein:

the concentration of denaturant in sample mixture is sufficiently high to denature the protein, e.g., wherein at least 50, 60, 70, 80, 90, 95, 96, 97, 98, 99, or 100% of the protein is denatured; and

the concentration of denaturant in sample/enzyme mixture is sufficiently low to not denature the enzyme, e.g., wherein less than 50, 40, 30, 20, 10, 5, 4, 3, 2, or 1% of the enzyme is denatured.

16. The method of any of claims 1-15, wherein the concentration of the denaturant in the sample mixture is:

i) at least 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.5, or 8;

ii) 1-10 M, 2-9 M, 3-8 M, 4-7 M, 5-7 M, 6-6.6 M, 6 M, 6.6 M, or 8 M;

iii) 0-10 M, 2-9 M, 3-8 M, 4-7 M, 5-7 M, 0.5-5 M, 0.5-2 M, 0.5 M, 1 M, or 2 M; or iv) 0.01%-50%, l%-40%, l%-20%, 0.5%-10%, 0.01%-5%, or 0.1%-2% (m/v).

17. The method of any of claims 1-16, wherein the sample mixture comprises a first denaturant (i.e. the denaturant of (a)(ii)), and the sample/enzyme mixture comprises the first denaturant and a second denaturant.

18. The method of claim 17, wherein the first denaturant is guanidine hydrochloride and the second denaturant is urea.

19. The method of either of claims 17 or 18, wherein the concentration of the first denaturant in the sample mixture is 1-10 M, 2-9 M, 3-8 M, 4-7 M, 5-7 M, 6-6.6 M, 6 M, 6.6 M, or 8 M.

20. The method of any of claims 17-19, wherein the concentration of the first denaturant in the sample/enzyme mixture is:

i) 0-10 M, 2-9 M, 3-8 M, 4-7 M, 5-7 M, 0.5-5 M, 0.5-2 M, 0.5 M, 1 M, or 2 M;

ii) less than or equal to 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.1 M; or

iii) ii) less than or equal to 0.5 or 0.1 M, e.g., essentially 0 M.

21. The method of any of claims 17-20, wherein the concentration of the second denaturant in the sample/enzyme mixture is:

i) less than or equal to 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.1 M;

ii) less than or equal to 0.5 or 0.1 M, e.g., essentially 0 M; or

iii) less than or equal to 2 M or 0.5 M..

22. The method of any of claims 1-21, wherein:

the pH of the sample mixture is sufficiently low that deamidation reactions are substantially inhibited e.g., wherein at least 50, 60, 70, 80, 90, 95, 96, 97, 98, 99, or 100% of the asparagine and glutamine side chains of the protein are unaltered, and

the pH of the sample/enzyme mixture is sufficiently high that the enzyme is active, e.g., wherein the enzyme is at least 50, 60, 70, 80, 90, or 100% active, e.g., operating at 50, 60, 70, 80, 90, or 100% efficiency compared to maximum efficiency..

23. The method of any of claims 1-22, wherein the pH of the sample mixture is 5.5+ 1, 0.75, 0.5, or 0.25 (e.g., 5.5+0.5) and the pH of the sample/enzyme mixture is 7.3+ 1, 0.75, 0.5, or 0.25 (e.g., 7.3+0.5).

24. The method of any of claims 1-23, wherein the pH of the sample mixture is 5.5 and the pH of the sample/enzyme mixture is 7.3.

25. The method of any of claims 1-24, wherein the method does not comprise alkylation of cysteine residues of the protein or protein digestion products.

26. The method of any of claims 1-25, wherein the sample mixture and/or

sample/enzyme mixture comprise a reducing agent, e.g., Tris(2-carboxyethyl)phosphine (TCEP), dithiothreitol (DTT), or beta-mercaptoethanol.

27. The method of claim 26, wherein the concentration of reducing agent, e.g., Tris(2- carboxyethyl)phosphine (TCEP), dithiothreitol (DTT), or beta-mercaptoethanol, in the sample mixture is higher than in sample/enzyme mixture.

28. The method of either of claims 26 or 27, wherein:

the concentration of reducing agent, e.g., Tris(2-carboxyethyl)phosphine (TCEP), dithiothreitol (DTT), or beta-mercaptoethanol, in the sample mixture is sufficiently high to substantially reduce the cysteines of the protein, e.g., wherein at least 50, 60, 70, 80, 90, 95, 96, 97, 98, 99, or 100% of the cysteine residues of the protein are reduced; and

the concentration of reducing agent, e.g., Tris(2-carboxyethyl)phosphine (TCEP), dithiothreitol (DTT), or beta-mercaptoethanol, in the sample/enzyme mixture is sufficiently low to not interfere with other steps of the method or method of manufacturing, e.g., wherein the reducing agent does not significantly accumulate in equipment (e.g., mass spectrometer or analytical column) or produce additional signal in data (e.g., mass spectrometry data).

29. The method of any of claims 26-28, wherein the concentration of the reducing agent in the sample mixture is at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 mM, e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 mM.

30. The method of any of claims 26-29, wherein the concentration of the reducing agent in the sample/enzyme mixture is less than or equal to 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0.5, or 0.1 mM, e.g., 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0.5, or 0.1 mM.

31. The method of any of claims 26-30, wherein the reducing agent is Tris(2- carboxyethyl)phosphine (TCEP).

32. The method of any of claims 1-31, wherein the protein digestion products are separated on the basis of one or more (e.g., one, two, three or more) of size, charge, or affinity.

33. The method of any of claims 1-32, wherein separating the protein digestion products comprises using chromatography, e.g., 1-dimensional chromatography, e.g., affinity chromatography, gel filtration chromatography, ion exchange chromatography, reversed phase chromatography, hydrophobic interaction chromatography, high performance liquid

chromatography (HPLC), gas chromatography (GC), capillary electrophoresis, ion mobility, or any chromatographic method described herein.

34. The method of any of claims 1-33, wherein (c) further comprises providing the identity of a protein digestion product, e.g., by mass spectroscopy, e.g., LC/MS, tandem mass spectrometry, or RP-LCMS².

35. The method of any of claims 1-34, wherein (c) comprises separating the protein digestion products using chromatography, e.g., 1-dimensional chromatography, and providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, tandem mass spectrometry, or ID RP-LCMS .

36. The method of any of claims 1-35, wherein a plurality, e.g., at least 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100% of protein digestion products in at least 2, 10, 20, 96, 100, 192,

1,000, or 10,000 samples are classified or assigned an identity, structure or composition.

37. The method of any of claims 1-36, wherein the protein risk score is a function of more of: an unwanted, e.g., off-target, property in a subject to receiving a preparation comprising the protein and, optionally, a product, e.g., immunogenicity;

an unwanted effect of the protein in a preparation of the product, e.g., a preparation of a drug, e.g., the propensity to cause denaturation, precipitation, or color; and

a value for the abundance of the protein present in the sample.

38. The method of any of claims 1-37, wherein step (d) is repeated to provide a protein risk score for one or more (e.g., at least 2, 10, 50, 100, 200, 500, 1000, or all) proteins identified in the sample.

39. The method of any of claims 1-38, wherein step (d) comprises providing a protein risk score, e.g., an immunogenicity risk score, e.g., as generated by the Epibase® platform.

40. The method of any of claims 1-39, wherein step (d) comprises providing a immunogenicity risk score as generated by the Epibase® platform.

41. The method of any of claims 1-40, further comprising providing a process risk score to the sample.

42. The method of claim 41, wherein the process risk score is a function of the one or more protein risk scores of the sample's proteins.

43. The method of either of claims 41 or 42, wherein the process risk score is calculated based upon the formula: Process Risk Score = ^([Protein Abundance] x [Immunogenicity Risk Score])

44. The method of any of claims 41-43, wherein the method is repeated to analyze a plurality of samples, e.g., at least 2, 10, 50, 96, 10, 192, or 1000, and

wherein two or more (e.g., all) of the samples are provided using a different process or method of manufacturing, and wherein a process risk score is provided for a plurality of samples (e.g., all samples), or

wherein two or more (e.g., all) of the samples are provided at different time points during a process or method of manufacturing, and wherein a process risk score is provided for a plurality of samples (e.g., all samples).

45. The method of any of claims 41-44, comprising comparing the process risk score of a process or method of manufacturing, e.g., the process or method of manufacturing used to provide the sample, with a reference.

46. The method of any of claims 41-45, comprising comparing the process risk score of a first process or method of manufacturing with a process risk score for a second process or method of manufacturing.

47. The method of any of claims 41-45, comprising comparing the process risk score of a process at a first time point with the process risk score of a process at a second time point.

48. The method of claim 46, comprising, responsive to the comparison, selecting one of the processes or methods of manufacturing, e.g., for further analysis or for further use, e.g., to make the product, e.g., recombinant polypeptide.

49. A database (e.g., memorialized or recorded on a computer readable medium) comprising a library of identifying characteristics for HCPs or protein digestion products and protein risk scores derived from cell culture supernatant of a cell culture (e.g., a CHO, eg., a GS- CHO, cell culture).

Description:

PROTEOMIC ANALYSIS OF HOST CELL PROTEINS

RELATED APPLICATIONS

This application claims priority to U.S. Serial No.: 62/374,489, filed August 12, 2016, the entire contents of which is incorporated herein by reference.

FIELD OF THE INVENTION

The present disclosure relates to methods of detecting and/or quantifying host cell protein impurities during the production of a product, e.g., a recombinant protein, e.g., an antibody.

BACKGROUND

Host cell protein (HCP) is an unwanted complex mixture of host proteins which may present in the final product after various manufacturing process. Those HCPs can pose risks to, inter alia, product efficacy and patient safety.

Previously, HCP impurity testing has been accomplished using ELISA-based methods and/or advances in the field of proteomics. HCP impurity testing utilizing proteomics has been based on 2-dimensional chromatography separations. This approach is powerful, but lacks sufficient throughput to be used as a routine process development tool. Lack of such a tool means that proteomic HCP assessment has often been reactive and limited to analysis of a product following process development rather than as a source of information on which development decisions can be based. Therefore, a need exists for methods of detecting and quantifying HCPs in biopharmaceutical products in a simple, rapid, high throughput manner.

SUMMARY

The invention pertains, in part, to the development of methods of rapidly analyzing a sample, e.g., a plurality of samples, comprising a protein, e.g., proteins, e.g., HCPs or fragments thereof, produced by processes or methods of manufacturing a product, e.g., recombinant polypeptide, to assess the risk the protein poses as a contaminant in a final formulated product, e.g., recombinant polypeptide. For example, a sample of recombinant polypeptide produced by a process or method of manufacturing may be analyzed by a method described herein to quickly assess the risk any protein, e.g., proteins, e.g., HCPs or fragments thereof, might pose as a contaminant. In such an exemplary method, many samples can be quickly and accurately assessed to assess the risk of many proteins, in such a way to allow for the comparison of the risk of contaminants associated with a process or method of manufacturing. A method of the invention may thus be useful to evaluate, differentiate, and select between processes or methods of manufacturing. Method of the invention may further be useful to evaluate, differentiate, and select between samples based on the assessment of the risk associated with proteins, as well as for monitoring a process or method of manufacturing to determine the ongoing development of proteins and associated contaminant risks.

In one aspect, the invention provides a simple method of rapidly analyzing a sample, e.g., to provide an assessment of the risk of a protein (e.g., the risk the protein presents if present as a contaminant in a preparation, e.g., a preparation to be administered to a subject, e.g., a pharmaceutical preparation), the method comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the sample, which comprises the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced via a process; and

ii) a denaturant, e.g., deoxycholate and urea,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample at temperature of between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+l°C, or 20°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from (a)); and ii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture, under conditions in which the enzyme maintains substantial activity and reacts with, e.g cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and

d) assigning a protein risk score to a protein identified in the sample,

thereby analyzing the sample and providing an assessment of the risk of a protein.

In another aspect, the invention provides a simple method of rapidly analyzing a sample to provide an assessment of the risk of a protein, the method comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the sample, which comprises the protein (e.g., a HCP) and optionally a therapeutic product (e.g., a recombinant polypeptide), produced via a process; and

ii) a first denaturant, e.g., guanidine hydrochloride,

under conditions, e.g., concentrations of a first denaturant, that denatures the protein in the sample at temperature of between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+1 °C, or 50°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture;

ii) a second denaturant, e.g., urea; and

iii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products; c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products;

d) assigning a protein risk score to a protein identified in the sample,

thereby analyzing the sample and providing an assessment of the risk of a protein.

In another aspect, the invention provides a method of evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process, comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced by the process; and

ii) a denaturant, e.g., deoxycholate and urea, or guanidine hydrochloride, under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample, e.g., at temperature of between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+1 °C, or 20°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from a)); and ii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products, e.g., by using chromatography, e.g., 1- dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and d) assigning a protein risk score to a protein identified in the sample,

thereby evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process.

In another aspect, the invention provides a method of evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process, comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced by the process; and

ii) a first denaturant, e.g., guanidine hydrochloride,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample, e.g., at temperature of between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+1 °C, or 50°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from a));

ii) a second denaturant, e.g., urea; and

iii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products, e.g., by using chromatography, e.g., 1- dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and d) assigning a protein risk score to a protein identified in the sample,

thereby evaluating a process of making a product, e.g., an evaluation that incorporates assessment of the risk presented by a protein other than the product, e.g., a contaminant, produced by the process.

In another aspect, the invention provides a method of evaluating a method of

manufacturing a product, e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine, to provide an assessment of risk, (e.g., the risk presented by inclusion of a protein other than the product in a preparation of the product) comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) one or more proteins and the product, e.g., recombinant polypeptide, produced via the method of manufacturing; and

ii) a denaturant, e.g., deoxycholate and urea,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample at temperature of between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+l°C, or 20°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from (a)); and ii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products;

d) assigning a protein risk score to a protein identified in the sample; optionally wherein (d) is repeated for a plurality of proteins, e.g., all proteins identified by the protein digestion products; and

e) assigning a process risk score to the method of manufacturing,

thereby evaluating the method of manufacturing a product, e.g., a recombinant polypeptide, to provide an assessment of risk.

In another aspect, the invention provides a method of evaluating a method of

manufacturing a product, e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine, to provide an assessment of risk, (e.g., the risk presented by inclusion of a protein other than the product in a preparation of the product) comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) one or more proteins and the product, e.g., recombinant polypeptide, produced via the method of manufacturing; and

ii) a first denaturant, e.g., guanidine hydrochloride,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample at temperature of between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+l°C, or 50°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from (a));

ii) a second denaturant, e.g., urea, and

iii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products;

d) assigning a protein risk score to a protein identified in the sample;

optionally wherein (d) is repeated for a plurality of proteins, e.g., all proteins identified by the protein digestion products; and

e) assigning a process risk score to the method of manufacturing,

thereby evaluating the method of manufacturing a product, e.g., a recombinant polypeptide, to provide an assessment of risk.

In another aspect, the invention provides a method of manufacturing a product, e.g., a recombinant polypeptide, comprising providing a sample comprising the product, wherein the sample is analyzed by a method of analyzing a sample described herein.

In another aspect, the invention provides a database (e.g., memorialized or recorded on a computer readable medium) comprising a library of identifying characteristics for HCPs or protein digestion products and protein risk scores derived from cell culture supernatant of a cell culture (e.g., a CHO, eg., a GS-CHO, cell culture).

Among the advantages of the invention is that the methods disclosed herein allow for a simple, rapid risk assessment, e.g., immunogenicity assessment, dissociation, product stability, to be performed for any production system where the genome is known, or for specific variants of the production system (e.g., GS CHO specifically as a subset of CHO). Risk, e.g.,

immunogenicity, assessment can be performed for different patient populations (e.g. by geographic area or ethnicity). This is important since an overall average score for a protein contaminant, e.g., HCP, for the global population may mask a high score for a single particularly susceptible group. The risk, e.g., immunogenicity, calculation is fully integrated into the development process.

BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a top level overview of an analytical process according to the invention. FIG. 2 is a graphical representation showing pi values for each protein quantified to determine if pDADMAC-mediated HCP removal was correlated with protein pi.

FIG. 3 is an LC-MS profile for the Protein SET.

FIG. 4 is a graph showing a comparison of overall product degradation risk score for different products and cell lines processed under different purification conditions.

FIG. 5 is a graph showing a comparison of phospholipase B abundance for different products and cell lines processed under different purification conditions.

FIG. 6 is a graph showing a comparison of cathepsin D abundance for different products and cell lines processed under different purification conditions.

FIG. 7 is a graph showing a comparison of overall immunogenicity risk score for different products and cell lines processed under different purification conditions.

FIG. 8 is a graph showing a comparison of total HCP abundance in culture supernatant treated with alternative protein removal methods.

FIG. 9 is a graph showing a comparison of product dissociation risk scores in culture supernatant treated with alternative protein removal methods.

FIG. 10 is a graph showing a comparison of product degradation risk scores in culture supernatant treated with alternative protein removal methods.

FIG. 11 is a graph showing a comparison of example specific HCP abundances in culture supernatant treated with alternative protein removal methods in comparison to total HCP levels.

FIG. 12 is a schematic showing methanol detoxification pathways and changes in protein expression in pAOX induction.

DETAILED DESCRIPTION

For recombinant biopharmaceutical proteins to be acceptable for administration to human patients, it is important that residual contaminants resulting from the manufacture and purification process are removed from the final biological product, e.g., recombinant

polypeptide. These process contaminants include culture medium proteins, immunoglobulin affinity ligands, viruses, endotoxin, DNA, and proteins, e.g., host cell proteins (HCPs).

Contaminant proteins, e.g., HCPs, may generate a range of undesirable effects that may impact on the safety profile of a product, including immune response, adjuvant activity, direct biological activity or product interaction/degradation. These host cell contaminants include process- specific proteins, e.g., HCPs, which are process-related impurities/contaminants in the biologies derived from recombinant DNA technology (e.g., recombinant polypeptides).

U.S. and foreign regulations often require removal of such contaminants. For example, the U.S. Food and Drug Administration (FDA) requires that biopharmaceuticals intended for in vivo human use should be as free as possible of extraneous immunoglobulin and non- immunoglobulin impurities, and requires tests for detection and quantitation of potential impurities, such as proteinss, e.g., HCPs. As well, the International Conference on

Harmonization (ICH) provides guidelines on test procedures and acceptance criteria for bio technologic al/biologic al products .

Based mainly on the analytical challenge in detection and quantitation of impurity proteins present at trace levels, measurement of contaminant proteins, e.g., HCPs, in

bio therapeutics has historically been performed on an aggregate basis using ELISA

methodology, with results reported as a total grammage of proteins, e.g., HCPs, without information on identities and relative levels within the population. Contaminant protein, e.g., HCP, ELISAs are developed based on combinations of total protein and protein fractions from null-transfected cell lines and are performed on the inherent assumption that the contaminant proteins, e.g., HCP, load in the analyte varies only in abundance and not in composition.

Current proteomics based methods are limited in application as they do not have sufficient throughput to be used as a routine tool for manufacturing and process development, product monitoring, and analysis. Regulatory agencies require risk assessment to be associated with each contaminating protein, which existing proteomics based methods have not addressed in a rapid, scalable manner. Lack of such a rapid, high throughput tool means that proteomic protein, e.g., HCP, contaminant assessment is often reactive and limited to analysis of a small number of samples in process development rather than as a routine tool on which development decisions can be based.

A particular problem that manufacturers of biopharmaceuticals face is that each protein, e.g., HCP, has a specific, individual activity resulting in highly variable risk profiles for different proteins, e.g., HCP, populations at similar total abundances. The mixture of proteins, e.g., HCPs, that are present as impurities can vary substantially between products, especially as it is know that proteins, e.g., HCPs, can "piggyback" through purification by direct binding to the protein therapeutic. This can result in a number of issues including product and surfactant degradation, adjuvant activity and adverse immune response when the therapeutic is administered.

The invention allows the identification of which specific proteins, e.g., HCPs, are present in a protein therapeutic at all manufacturing scales, as well as all manufacturing and purification stages of the therapeutic protein and with sufficient throughput to support routine process development.

The present disclosure describes a proteomic analysis of proteins, e.g., HCPs, in products, e.g., purified therapeutic products or recombinant polypeptides, using proteomics in a format compatible with commercial demands, including e.g., manufacturing process development, production monitoring, and analysis of final products.

This process incorporates an integrated clinical risk assessment for each protein, e.g., HCP, based on in silico prediction of immunogenicity. These predictions are based on the specific production system and can be made against defined patient sub-populations that may be particularly susceptible to certain protein, e.g., HCP, epitopes (e.g. by geographic area or ethnicity). The specific combination of high-throughput analysis methods and integrated impurity risk assessment allows routine decisions to be made based on calculated clinical risk during the manufacturing process development. These conclusions can be made with high confidence as both the specific genome of the production system and the full range of potential patient responses are taken into account.

The proteomic based analysis of proteins, e.g., HCPs, disclosed herein can be used throughout the entire manufacturing process, thus avoiding development of different assays to detect proteins, e.g., HCPs, during scalablity of manufacturing, i.e., the method is independent of scale. The use of the same monitoring method throughout the entire manufacturing process from small scale to large scale avoids multiple issues such as the need to develop multiple monitoring methods at different manufacturing scales, along with concomitant errors in translation of the output from one scale to another scale. Rather, the proteomic method provides a simple, consistent, reproducible, rapid method that can be used independent of manufacturing scale. It also ensures that the same set of proteins, e.g., HCPs, are monitored at different stages of the process to show the changes in proteins, e.g., HCPs, for example, during cell culture, protein expression, as well as before and after purification. The present disclosure describes, inter alia, methods of analyzing hundreds of samples (e.g., samples derived from processes and methods of manufacturing products, e.g., recombinant polypeptides or therapeutic products) quickly by optimizing key steps of proteomic analysis, such as, for example:

applying particular combinations and levels of denaturing agents, e.g., deoxycholate, urea, and guanidine hydrochloride;

using a number of different enzymes, e.g., proteolytic enzymes, at select pH (e.g., low pH, e.g., acidic conditions), as opposed to only individual enzymes, e.g., proteolytic enzyme; and omission of alkylation step(s) without loss of result quality.

The methods of the present disclosure are capable of detecting, identifying, and quantifying the abundance of thousands of proteins (e.g., HCPs) present at very low levels in sample mixtures. The methods of the present disclosure are applicable at the start of manufacturing and processes, e.g., using cell supernatant, rather than only being suitable for analyzing purified product, e.g., recombinant polypeptide.

The methods of the present disclosure can be performed significantly faster and on a larger number of samples than commonly used methods. Exemplary methods of the present disclosure can analyze 100 samples in 3 to 5 days, whereas commonly used methods require 5- 10 days to analyze 6 samples. Conservatively this represents at least a 10-fold throughput increase. In one embodiment, throughput is improved at least by about 10-fold, 20-fold, 30-fold, 40-fold, 50-fold, 60-fold, 70-fold, 80-fold, or 90-fold.

DEFINITIONS

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although any methods and materials similar or equivalent to those described herein can be used in the practice of and/or for the testing of the present invention, the preferred materials and methods are described herein. In describing and claiming the present invention, the following terminology will be used according to how it is defined, where a definition is provided.

It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting. The articles "a" and "an" are used herein to refer to one or to more than one (i.e. , to at least one) of the grammatical object of the article. By way of example, "a cell" can mean one cell or more than one cell.

As used herein, the term "protein" or "proteins" in the context of a sample refers to any protein in a sample produced by a process or method of manufacturing that is not the desired product, e.g., the desired recombinant product, e.g., therapeutic product. In some embodiments, proteins may be host cell proteins (HCPs) or fragments thereof.

As used herein, the term "host cell protein" or "HCP" refers to any protein produced or encoded by the organism used to produce a recombinant polypeptide product and unrelated to the intended recombinant product. HCPs are undesirable in the final drug substance.

As used herein, the term "semi-quantitative" refers to the comparative assessment of different chemical species by mass spectrometry without reference to specific standards for each individual species.

As used herein, the term "endogenous" refers to any material from or naturally produced inside an organism, cell, tissue or system.

As used herein, the term "exogenous" refers to any material introduced to or produced outside of an organism, cell, tissue or system. Accordingly, "exogenous nucleic acid" refers to a nucleic acid that is introduced to or produced outside of an organism, cell, tissue or system. In an embodiment, sequences of the exogenous nucleic acid are not naturally produced, or cannot be naturally found, inside the organism, cell, tissue, or system that the exogenous nucleic acid is introduced into. In one embodiment, the sequences of the exogenous nucleic acids are non- naturally occurring sequences, or encode non-naturally occurring products.

As used herein, the term "heterologous" refers to any material from one species, when introduced to an organism, cell, tissue or system from a different species.

As used herein, the terms "nucleic acid," "polynucleotide," or "nucleic acid molecule" are used interchangeably and refers to deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), or a combination of a DNA or RNA thereof, and polymers thereof in either single- or double- stranded form. The term "nucleic acid" includes, but is not limited to, a gene, cDNA, or an mRNA. In one embodiment, the nucleic acid molecule is synthetic (e.g., chemically synthesized or artificial) or recombinant. Unless specifically limited, the term encompasses molecules containing analogues or derivatives of natural nucleotides that have similar binding properties as the reference nucleic acid and are metabolized in a manner similar to naturally or non-naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions), alleles, orthologs, SNPs, and complementary sequences as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted with mixed-base and/or deoxyinosine residues (Batzer et al., Nucleic Acid Res. 19:5081 (1991); Ohtsuka et al., J. Biol. Chem. 260:2605-2608 (1985); and Rossolini et al., Mol. Cell. Probes 8:91-98 (1994)).

As used herein, the terms "peptide," "polypeptide," and "protein" (e.g., protein when not used in the context of a method of the present invention) are used interchangeably, and refer to a compound comprised of amino acid residues covalently linked by peptide bonds, or by means other than peptide bonds. A protein or peptide must contain at least two amino acids, and no limitation is placed on the maximum number of amino acids that can comprise a protein's or peptide's sequence. In one embodiment, a protein may comprise of more than one, e.g., two, three, four, five, or more, polypeptides, in which each polypeptide is associated to another by either covalent or non-covalent bonds/interactions. Polypeptides include any peptide or protein comprising two or more amino acids joined to each other by peptide bonds or by means other than peptide bonds. As used herein, the term refers to both short chains, which also commonly are referred to in the art as peptides, oligopeptides and oligomers, for example, and to longer chains, which generally are referred to in the art as proteins, of which there are many types. "Polypeptides" include, for example, biologically active fragments, substantially homologous polypeptides, oligopeptides, homodimers, heterodimers, variants of polypeptides, modified polypeptides, derivatives, analogs, fusion proteins, among others.

As used herein, "product" refers to a molecule, nucleic acid, polypeptide, or any hybrid thereof, that is produced, e.g., expressed, by a cell which has been modified or engineered to produce the product. In one embodiment, the product is a naturally occurring product or a non- naturally occurring product, e.g., a synthetic product. In one embodiment, a portion of the product is naturally occurring, while another portion of the product is non-naturally occurring. In one embodiment, the product is a polypeptide, e.g., a recombinant polypeptide. In one embodiment, the product is suitable for diagnostic or pre-clinical use. In another embodiment, the product is suitable for therapeutic use, e.g., for treatment of a disease. In one embodiment, the product is selected from Table 1 or Table 2. In one embodiment, the modified or engineered cells comprise an exogenous nucleic acid that controls expression or encodes the product. In other embodiments, the modified or engineered cells comprise other molecules, e.g., that are not nucleic acids, that controls the expression or construction of the product in the cell.

In one embodiment, the modification of the cell comprises the introduction of an exogenous nucleic acid comprising a nucleic acid sequence that controls or alters, e.g., increases, the expression of an endogenous nucleic acid sequence, e.g., endogenous gene. In such embodiments, the modified cell produces an endogenous polypeptide product that is naturally or endogenously expressed by the cell, but the modification increases the production of the product and/or the quality of the product as compared to an unmodified cell, e.g., as compared to endogenous production or quality of the polypeptide.

In another embodiment, the modification of the cell comprises the introduction of an exogenous nucleic acid encoding a recombinant polypeptide as described herein. In such embodiments, the modified cell produces a recombinant polypeptide product that can be naturally occurring or non-naturally occurring. In such embodiments, the modified cell produces a recombinant polypeptide product that can also be endogenously expressed by the cell or not. In embodiments where the recombinant polypeptide product is also endogenously expressed by the cell, the modification increases the production of the product and/or the quality of the product as compared to an unmodified cell, e.g., as compared to endogenous production or quality of the polypeptide.

As used herein, "recombinant polypeptide" or "recombinant protein" refers to a polypeptide that can be produced by a cell described herein. A recombinant polypeptide is one for which at least one nucleotide of the sequence encoding the polypeptide, or at least one nucleotide of a sequence which controls the expression of the polypeptide, was formed by genetic engineering (of the cell or of a precursor cell). E.g., at least one nucleotide was altered, e.g., it was introduced into the cell or it is the product of a genetically engineered rearrangement. In an embodiment, the sequence of a recombinant polypeptide does not differ from a naturally occurring isoform of the polypeptide or protein. In an embodiment, the amino acid sequence of the recombinant polypeptide differs from the sequence of a naturally occurring isoform of the polypeptide or protein. In an embodiment, the recombinant polypeptide and the cell are from the same species. In an embodiment, the recombinant polypeptide is endogenous to the cell, in other words, the cell is from a first species and the recombinant polypeptide is native to that first species. In an embodiment, the amino acid sequence of the recombinant polypeptide is the same as or is substantially the same as, or differs by no more than 1%, 2%, 3%, 4%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% from, a polypeptide encoded by the endogenous genome of the cell. In an embodiment, the recombinant polypeptide and the cell are from different species, e.g., the recombinant polypeptide is a human polypeptide and the cell is a non-human, e.g., a rodent, e.g., a CHO, or an insect cell. In an embodiment, the recombinant polypeptide is exogenous to the cell, in other words, the cell is from a first species and the recombinant polypeptide is from a second species. In one embodiment, the polypeptide is a synthetic polypeptide. In one embodiment, the polypeptide is derived from a non-naturally occurring source. In an embodiment, the

recombinant polypeptide is a human polypeptide or protein which does not differ in amino acid sequence from a naturally occurring isoform of the human polypeptide or protein. In an embodiment, the recombinant polypeptide differs from a naturally occurring isoform of the human polypeptide or protein at no more than 1, 2, 3, 4, 5, 10, 15 or 20 amino acid residues. In an embodiment, the recombinant polypeptide differs from a naturally occurring isoform of the human polypeptide by no more than 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or 15% of its amino acid residues.

"Acquire" or "acquiring" as the terms are used herein, refer to obtaining possession of a physical entity, or a value, e.g., a numerical value, by "directly acquiring" or "indirectly acquiring" the physical entity or value. "Directly acquiring" means performing a process (e.g., performing a synthetic or analytical method) to obtain the physical entity or value. "Indirectly acquiring" refers to receiving the physical entity or value from another party or source (e.g., a third party laboratory that directly acquired the physical entity or value). Directly acquiring a physical entity includes performing a process that includes a physical change in a physical substance, e.g., a starting material. Exemplary changes include making a physical entity from two or more starting materials, shearing or fragmenting a substance, separating or purifying a substance, combining two or more separate entities into a mixture, performing a chemical reaction that includes breaking or forming a covalent or non-covalent bond. Directly acquiring a value includes performing a process that includes a physical change in a sample or another substance, e.g., performing an analytical process which includes a physical change in a substance, e.g., a sample, analyte, or reagent (sometimes referred to herein as "physical analysis"), performing an analytical method, e.g., a method which includes one or more of the following: separating or purifying a substance, e.g., an analyte, or a fragment or other derivative thereof, from another substance; combining an analyte, or fragment or other derivative thereof, with another substance, e.g., a buffer, solvent, or reactant; or changing the structure of an analyte, or a fragment or other derivative thereof, e.g., by breaking or forming a covalent or non- covalent bond, between a first and a second atom of the analyte; or by changing the structure of a reagent, or a fragment or other derivative thereof, e.g., by breaking or forming a covalent or non- covalent bond, between a first and a second atom of the reagent.

"Acquiring a sample" as the term is used herein, refers to obtaining possession of a sample, e.g., a tissue sample or nucleic acid sample, by "directly acquiring" or "indirectly acquiring" the sample. "Directly acquiring a sample" means performing a process (e.g., performing a physical method such as a surgery or extraction) to obtain the sample. "Indirectly acquiring a sample" refers to receiving the sample from another party or source (e.g., a third party laboratory that directly acquired the sample). Directly acquiring a sample includes performing a process that includes a physical change in a physical substance, e.g., a starting material, such as a tissue, e.g., a tissue in a human patient or a tissue that has was previously isolated from a patient. Exemplary changes include making a physical entity from a starting material, dissecting or scraping a tissue; separating or purifying a substance (e.g., a sample tissue or a nucleic acid sample); combining two or more separate entities into a mixture; performing a chemical reaction that includes breaking or forming a covalent or non-covalent bond. Directly acquiring a sample includes performing a process that includes a physical change in a sample or another substance, e.g., as described above.

As used herein, "protein digestion products" refer to fragments of proteins, e.g., peptides, produced by the action of an enzyme, e.g., a proteolytic enzyme, for which the protein is a substrate. Examples of enzymes include trypsin, lysC, GluC, AspN, and others known in the art.

As used herein, a "protein risk score" comprises an assessment of the risk associated with a protein, e.g., a HCP or fragment thereof, as a contaminant or impurity in a product, e.g, a therapeutic product, e.g., a final formulation of a therapeutic product. The protein risk score may be a function of one or more of: an unwanted, e.g., off-target, property in a subject receiving a preparation comprising the protein and the product, e.g., immunogenicity, e.g., an unwanted immune response; an unwanted effect of the protein in a preparation of the product, e.g., a preparation of a drug, e.g., the propensity to cause denaturation, precipitation, color, or odor; and a value for the abundance of the protein present in the sample. For example, a sample, optionally comprising product, produced by a process or method of manufacturing, comprises one or more protein contaminants. A method of the invention may analyze the proteins or fragments thereof within the sample, and assign one or more protein risk scores to each protein identified in the sample. In some embodiments, a protein risk score is or comprises a value produced by

Epibase®, e.g., a immunogenicity score. In some embodiments, a protein risk score is a value of the protein abundance in the sample.

As used herein a "process risk score" comprises an assessment of the risk associated with a process or method of manufacturing, and is a function of the protein risk scores of proteins present in samples produced by the process or method of manufacturing. For example, sample A could be produced by process A and sample B could be produced by process B. Using the methods of the invention, samples A and B could be analyzed, the protein risk scores of their protein contaminants assessed, and the process risk scores of processes A and B determined, allowing for the rapid comparison of processes A and B.

As used herein, a "process" is a series of one or more operations and/or conditions that produces a sample comprising, inter alia, a protein or plurality of proteins, e.g., a HCP or fragment thereof. In some embodiments, the sample further comprises a product, e.g., a recombinant polypeptide, or a therapeutic product. For example, an exemplary process could comprise culturing a plurality of cells under conditions conducive to the expression of a recombinant polypeptide, thus producing a sample, e.g., cell culture, supernatant, or cell lysate, comprising one or more proteins, e.g., HCPs or fragments thereof, and product, e.g., recombinant polypeptide.

As used herein, a "method of manufacturing" is a series of one or more operations and/or conditions that produces a sample comprising a product, e.g., a recombinant polypeptide or a therapeutic product, and a protein or plurality of proteins, e.g., a HCP or fragment thereof. For example, an exemplary method of manufacturing could comprise culturing a plurality of cells under conditions conducive to the expression of a recombinant polypeptide, thus producing a sample, e.g., cell culture, supernatant, or cell lysate, comprising one or more proteins, e.g., HCPs or fragments thereof, and product, e.g., recombinant polypeptide. As used herein, MS means mass spectrometry.

As used herein, MS means tandem mass spectrometry.

As used herein, "substantially active" or "substantial activity" describes enzyme, e.g., a plurality of enzymes, e.g., in a sample under a set of conditions or in a step of a method or process described herein, that are at least 50, 60, 70, 80, 90, or 100% active, e.g., operating at 50, 60, 70, 80, 90, or 100% efficiency/reaction rate compared to a reference efficiency/reaction rate, e.g., the highest efficiency/reaction rate of that enzyme under ideal conditions, e.g., conditions recommended by the enzyme supplier or conditions without denaturant, reducing agent, or non- recommended pH present.

The disclosures of each and every patent, patent application, and publication cited herein are hereby incorporated herein by reference in their entirety. While this invention has been disclosed with reference to specific aspects, it is apparent that other aspects and variations of this invention may be devised by others skilled in the art without departing from the true spirit and scope of the invention. The appended claims are intended to be construed to include all such aspects and equivalent variations.

SAMPLE PREPARATION

The methods described herein, in part, recite methods for analyzing samples, e.g., samples comprising proteins, e.g. HCPs, generated by a process or method of manufacturing, e.g., samples comprising cell culture, supernatant, or cell lysate. In some embodiments, the methods described herein recite preparing the sample for analysis. In some embodiments, the analysis comprises mass spectrometry, identification of the proteins, e.g. HCPs within the sample, and/or assignment of risk scores to the proteins, e.g. HCPs and/or the process/method of manufacturing that produced the sample.

In some embodiments, preparing the sample for analysis may comprise exposing the proteins, e.g. HCPs to a denaturant. In some embodiments, the denaturant is a chaotropic agent, an acid, a base, a reducing agent, or a detergent. In some embodiments, the denaturant is selected from guanidine hydrochloride, urea, and deoxycholate. In some embodiments, the denaturant is a combination of multiple types of denaturants or multiple denaturants, e.g., urea and deoxycholate, or guanidine hydrochloride and urea. In some embodiments, preparing the sample for analysis comprises multiple steps wherein the denaturant provided in one step is different from the denaturant provided in a second step, or the concentration of a denaturant changes, e.g., is altered, from step to step. In some embodiments, the multiple steps of sample preparation may alter, e.g., decrease, e.g., by dilution, the concentration of a first denaturant, e.g., guanidine hydrochloride, and introduce or increase the concentration of a second denaturant, e.g., urea. In some embodiments, the concentration of denaturant in the sample mixture is at least 1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 6.6, 6.7, 6.8, 6.9, 7, 7.5, or 8 M. In some embodiments, the concentration of denaturant, e.g., guanidine hydrochloride, in the sample mixture is 1-10 M, 2-9 M, 3-8 M, 4-7 M, 5-7 M, 6-6.6 M, 6 M, 6.6 M, or 8 M. In some embodiments, the concentration of denaturant, e.g., urea, in the sample mixture is 0-10 M, 2-9 M, 3-8 M, 4-7 M, 5-7 M, 0.5-5 M, 0.5-2 M, 0.5 M, 1 M, or 2 M. In some embodiments, the concentration of denaturant, e.g., deoxycholate, in the sample mixture is at least 0.01%, 0.05%, 0.1%, 0.2%, 0.5%, 0.7%, 0.9%, 1%, 1.2%, 1.5%, 2%, 5%, 10%, 15%, 20%, 30%, 40%, or 50% (m/v), e,g, 0.1%-2%. In some embodiments, the concentration of denaturant, e.g., deoxycholate, in the sample mixture is 0.01%-50%, l%-40%, l%-20%, 0.5%-10%, 0.01%-5%, or 0.1%-2% (m/v). In some embodiments, the sample (e.g., sample mixture) comprises both urea and deoxycholate. In some embodiments, the sample (e.g., sample mixture) comprises both urea and deoxycholate and the concentration of urea is at least 8 M (e.g., 8 M) and the concentration of deoxycholate is at least 1% (e.g., 1%) (m/v). In some embodiments, the sample (e.g., sample mixture) comprises both urea and deoxycholate, and the concentration of urea is at least 8 M (e.g., 8 M) and the concentration of deoxycholate is at least 0.01% (e.g., 0.01%) (m/v). In some embodiments, the concentration of denaturant in the sample/enzyme mixture is less than or equal to 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.1 M. In some embodiments, the concentration of denaturant, e.g., guanidine hydrochloride, in the sample/enzyme mixture is less than or equal to 0.5 or 0.1 M, e.g., essentially 0 M. In some embodiments, the concentration of denaturant, e.g., urea, in the sample/enzyme mixture is less than or equal to 4, 3.5, 3, 2.5, 2, 1.5, 1, 0.5, or 0.1 M, e.g., less than or equal to 2 M or less than or equal to 0.5 M. In some embodiments, the concentration of denaturant, e.g., deoxycholate, in the sample/enzyme mixture is less than or equal to 2%, 1%, 0.1%, 0.05%, 0.01%, 0.005%, 0.0025%, 0.001%, or 0.0001%, e.g., essentially 0% (m/v). In some embodiments, the sample/enzyme mixture comprises both urea and deoxycholate. In some embodiments, the sample/enzyme mixture comprises both urea and deoxycholate, and the concentration of urea is less than or equal to 2 M (e.g., 2 M or 0.5 M) and the concentration of deoxycholate is less than or equal to 0.0025% (m/v). In some embodiments, samples, e.g., sample mixtures and/or sample/enzyme mixtures, comprising deoxycholate also comprise acetonitrile, e.g., at least 10, 20, 30, 40, 50, 60 70, 80, or 90% acetonitrile (e.g., 80% acetonitrile) (m/v).

In some embodiments, preparing the sample for analysis may comprise exposing the protein to a reducing agent. Disulfide bonds present in the protein may need to be reduced to denature the protein. Reducing agents contemplated are those known in the art and include Tris(2-carboxyethyl)phosphine (TCEP), dithiothreitol (DTT), or beta-mercaptoethanol (i.e. 2- mercaptoethanol). In some embodiments, the concentration of reducing agent in the sample mixture is at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 mM, e.g., at least 10 mM, e.g., 10 mM. In some embodiments, the concentration of reducing agent in the sample/enzyme mixture is less than or equal to 10, 9, 8, 7, 6, 5, 4, 3, 2, 1, 0.5, or 0.1 mM, e.g., less than or equal to 1 mM, e.g., 1 mM.

In some embodiments, preparing the sample for analysis may not comprise exposing the protein to an alkylating agent. Alkylating agents, e.g., iodoacetamide, can be used to alkylate cysteine thiol groups as one way of preventing reformation of disulfide bonds. While not wishing to be bound by theory, maintaining the presence of a reducing agent, e.g., a low concentration of a reducing agent (e.g., 1 mM reducing agent, e.g., 1 mM TCEP), throughout sample preparation may mitigate the need for an alkylating agent, e.g., to contribute to denaturation of a protein. In addition, skipping an alkylating step may expedite sample preparation time.

In some embodiments, preparing the sample for analysis may comprise providing a specific pH in the sample mixture or sample/enzyme mixture. In some embodiments, the pH of the sample mixture is 5, 5.25, 5.5, 5.75, 6, 6.25, 6.5, 6.75, 7, 7.25, 7.5, or 8. In some

embodiments, the pH of the sample mixture is 5.5. In some embodiments, the pH of the sample/enzyme mixture is 6, 6.25, 6.5, 6.75, 6.8, 6.9, 7, 7.1, 7.2, 7.3, 7.4, 7.5, 7.75, 8, 8.25, or 8.5. In some embodiments, the pH of the sample/enzyme mixture is 7, 7.2, 7.3, or 8. In some embodiments, the pH of the sample/enzyme mixture is 7.3. Applicant unexpectedly discovered that in using the methods of the invention a less than optimum pH for an enzyme reaction could be used with minimum effect on the reaction of the enzyme but an increased speed in the overall analysis, e.g., from 6 sample runs to 96 or 192 sample runs, as well as performing the entire analysis in 3-5 days (e.g., 3 days, 4 days, or 5 days).

In some embodiments, sample preparation is conducted at room temperature. In some embodiments, sample preparation, e.g., conditions for denaturation, is conducted at between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+l°C, or 50°C. In some embodiments, sample preparation, e.g., conditions for denaturation, is conducted at between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+l°C, or 20°C.

In some embodiments, preparing the sample for analysis may comprise providing an enzyme, e.g., a proteolytic enzyme, in the sample/enzyme mixture. The enzyme, e.g., proteolytic enzyme, may be trypsin, lysC, GluC, AspN, or other enzymes known in the art; methods of use and characteristics of said enzymes are also available in the art. In some embodiments, the enzyme, e.g., proteolytic enzyme, is present in a 1: 100, 1:90, 1:80, 1:70, 1:60, 1:50, 1:40, 1:30, 1:20, 1: 15, 1: 10, 1:8, 1:6, 1:5, 1:4, 1:3, 1:2, or 1: 1 ratio (enzyme to protein). In some

embodiments, the enzyme, e.g., proteolytic enzyme, is present in a 1:20 ratio (enzyme to protein). In some embodiments, the enzyme, e.g., proteolytic enzyme, is present in a 1:40 ratio (enzyme to protein).

ONE-DIMENSIONAL CHROMATOGRAPHY

Methods of 1-dimensional (ID) chromatography suitable for use in the methods described here are known to one of skill in the art and include, e.g., affinity chromatography, gel filtration chromatography, ion exchange chromatography, reversed phase chromatography, hydrophobic interaction chromatography. In some embodiments, the one-dimensional chromatography method is HPLC reversed phase chromatography. Chromatography can include high performance liquid chromatography (HPLC), gas chromatography (GC), capillary electrophoresis, ion mobility. See also, e.g., Process Scale Purification of Antibodies, Uwe Gottschalk 2011 John Wiley & Sons ISBN: 1118210743; Antibodies Vol 1 Production and Purification, G. Subramanian 2013 Springer Science & Business Media; Basic Methods in Antibody Production and Characterization, Gary C. Howard 2000 CRC Press.

Additional exemplary chromatographic methods include, but are not limited to, Strong

Anion Exchange chromatography (SAX), liquid chromatography (LC), high performance liquid chromatography (HPLC), ultra performance liquid chromatography (UPLC), thin layer chromatography (TLC), amide column chromatography, and combinations thereof. Exemplary mass spectrometry (MS) include, but are not limited to, tandem MS, LC-MS, LC-MS/MS, matrix assisted laser desorption ionisation mass spectrometry (MALDI-MS), Fourier transform mass spectrometry (FTMS), ion mobility separation with mass spectrometry (IMS-MS), electron transfer dissociation (ETD-MS), and combinations thereof. Exemplary electrophoretic methods include, but are not limited to, capillary electrophoresis (CE), CE-MS, gel electrophoresis, agarose gel electrophoresis, acrylamide gel electrophoresis, SDS-polyacrylamide gel

electrophoresis (SDS-PAGE) followed by Western blotting using antibodies that recognize specific glycan structures, and combinations thereof. Exemplary nuclear magnetic resonance (NMR) include, but are not limited to, one-dimensional NMR (1D-NMR), two-dimensional NMR (2D-NMR), correlation spectroscopy magnetic-angle spinning NMR (COSY-NMR), total correlated spectroscopy NMR (TOCSY-NMR), heteronuclear single-quantum coherence NMR (HSQC-NMR), heteronuclear multiple quantum coherence (HMQC-NMR), rotational nuclear overhauser effect spectroscopy NMR (ROESY-NMR), nuclear overhauser effect spectroscopy (NOESY-NMR), and combinations thereof.

MASS SPECTROMETRY

Mass spectrometry methods suitable for use in the methods described herein are known to one of skill in the art and include, e.g., electrospray ionization MS, matrix-assisted laser desportion/ionization MS, time of flight MS, fourier-transform ion cyclotron resonance MS, quadrupole time of flight MS, linear quadrupole, quadrupole ion trap MS, orbitrap, cylindrical ion trap, three dimensional ion trap, quadruple mass filter, tandem mass spectrometry. In some embodiments, the mass spectrometry is tandem mass spectrometry. See also, e.g., Protein Mass Spectrometry, Julian Whitelegge 2008, Elsevier; Protein Sequencing and Identification Using Tandem Mass Spectrometry, Michael Kinter 2005, John Wiley & Sons; Characterization of Protein Therapeutics using Mass Spectrometry, Guodong Chen 2014, Springer Science & Business Media. PRODUCTION PARAMETERS A production parameter as used herein is a parameter or element in a production process. Production parameters that can be selected include, e.g., the cell or cell line used to produce the glycoprotein preparation, the culture medium, culture process or bioreactor variables (e.g., batch, fed-batch, or perfusion), purification process and formulation of a glycoprotein preparation.

Primary production parameters include: 1) the types of host; 2) genetics of the host; 3) media type; 4) fermentation platform; 5) purification steps; and 6) formulation. Secondary production parameter, as used herein, is a production parameter that is adjustable or variable within each of the primary production parameters. Examples include: selection of host subclones based on desired glycan properties; regulation of host gene levels constitutive or inducible;

introduction of novel genes or promoter elements; media additives (e.g. partial list on Table IV); physiochemical growth properties; growth vessel type (e.g. bioreactor type, T flask); cell density; cell cycle; enrichment of product with a desired glycan type (e.g. by lectin or antibody-mediated enrichment, ion-exchange chromatography, CE, or similar method); or similar secondary production parameters clear to someone skilled in the art.

Media

The methods described herein can include determining and/or selecting a media component and/or the concentration of a media component that has a positive correlation to a desired glycan property or properties. A media component can be added in or administered over the course of glycoprotein production or when there is a change media, depending on culture conditions. Media components include components added directly to culture as well as components that are a byproduct of cell culture.

Media components include, e.g., buffer, amino acid content, vitamin content, salt content, mineral content, serum content, carbon source content, lipid content, nucleic acid content, hormone content, trace element content, ammonia content, co-factor content, indicator content, small molecule content, hydrolysate content and enzyme modulator content.

Examples of various media components are provided below:

amino acids sugar precursors

Vitamins Indicators

Carbon source (natural and Nucleosides or nucleotides

unnatural)

Salts butyrate or organics

Sugars DMSO Sera Animal derived products

Plant derived hydroly sates Gene inducers

sodium pyruvate Non natural sugars

Surfactants Regulators of intracellular pH

Ammonia Betaine or osmoprotectant

Lipids Trace elements

Hormones or growth factors minerals

Buffers Non natural amino acids

Non natural amino acids Non natural vitamins

Exemplary buffers include Tris, Tricine, HEPES, MOPS, PIPES, TAPS, bicine, BES, TES, cacodylate, MES, acetate, MKP, ADA, ACES, glycinamide and acetamidoglycine. The media can be serum free or can include animal derived products such as, e.g., fetal bovine serum (FBS), fetal calf serum (FCS), horse serum (HS), human serum, animal derived serum substitutes (e.g., Ultroser G, SF and HY; non-fat dry milk; Bovine EX-CYTE), fetuin, bovine serum albumin (BSA), serum albumin, and transferrin. When serum free media is selected lipids such as, e.g., palmitic acid and/or steric acid, can be included.

Lipids components include oils, saturated fatty acids, unsaturated fatty acids, glycerides, steroids, phospholipids, sphingolipids and lipoproteins. Exemplary amino acid that can be included or eliminated from the media include alanine, arginine, asparagine, aspartic acid, cysteine, glutamic acid, glutamine, glycine, histidine, proline, isoleucine, leucine, lysine, methionine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine and valine. Examples of vitamins that can be present in the media or eliminated from the media include vitamin A (retinoid), vitamin B l (thiamine), vitamin B2 (riboflavin), vitamin B3 (niacin), vitamin B5 (pantothenic acid), vitamin B6 (pyroxidone), vitamin B7 (biotin), vitamin B9 (folic acid), vitamin. B 12 (cyanocobalamin), vitamin C (ascorbic acid), vitamin D, vitamin E, and vitamin K.

Minerals that can be present in the media or eliminated from the media include bismuth, boron, calcium, chlorine, chromium, cobalt, copper, fluorine, iodine, iron, magnesium, manganese, molybdenum, nickel, phosphorus, potassium, rubidium, selenium, silicon, sodium, strontium, sulfur, tellurium, titanium, tungsten, vanadium, and zinc. Exemplary salts and minerals include CaC12 (anhydrous), CuS04 5H20, Fe(N03).9H20, KC1, KN03, KH2P04, MgS04 (anhydrous), NaCl, NaH2P04H20, NaHC03, Na2SE3 (anhydrous), ZnS04.7H20; linoleic acid, lipoic acid, D-glucose, hypoxanthine 2Na, phenol red, putrescine 2HC1, sodium pyruvate, thymidine, pyruvic acid, sodium succinate, succinic acid, succinic acid.Na.hexahydrate, glutathione (reduced), para-aminobenzoic acid (PABA), methyl linoleate, bacto peptone G, adenosine, cytidine, guanosine, 2 '-deoxy adenosine HC1, 2'-deoxycytidine HC1, 2'-deoxyguanosine and uridine. When the desired glycan characteristic is decreased fucosylation, the production parameters can include culturing a cell, e.g., CHO cell, e.g., dhfr deficient CHO cell, in the presence of manganese, e.g., manganese present at a concentration of about 0.1 μΜ to 50 μΜ. Decreased fucosylation can also be obtained, e.g., by culturing a cell (e.g., a CHO cell, e.g., a dhfr deficient CHO cell) at an osmolality of about 350 to 500 mOsm. Osmolality can be adjusted by adding salt to the media or having salt be produced as a byproduct as evaporation occurs during production.

Hormones include, for example, somatostatin, growth hormone-releasing factor (GRF), insulin, prolactin, human growth hormone (hGH), somatotropin, estradiol, and progesterone. Growth factors include, for example, bone morphogenic protein (BMP), epidermal growth factor (EGF), basic fibroblast growth factor (bFGF), nerve growth factor (NGF), bone derived growth factor (BDGF), transforming growth factor-betal (TGF-betal), [Growth factors from U.S. Pat. No. 6,838,284 B2], hemin and NAD. Examples of surfactants that can be present or eliminated from the media include Tween-80 and pluronic F-68. Small molecules can include, e.g., butyrate, ammonia, non natural sugars, non natural amino acids, chloroquine, and betaine.

Physiochemical Parameters

Production parameters can also include physiochemical parameters. Such conditions can include temperature, pH, osmolality, shear force or agitation rate, oxidation, spurge rate, growth vessel, tangential flow, DO, C0 ₂, nitrogen, fed batch, redox, cell density and feed strategy.

Examples of physiochemical parameters that can be selected include, e.g., pH, osmolality, shear force or agitation rate, oxidation, spurge rate, growth vessel, tangential flow, batch dissolved 02, C0 ₂, nitrogen, fed batch, redox, cell density, perfusion culture, feed strategy, temperature and time of culture.

Additional production parameters are known to one of skill in the art, see e.g., Antibody Expression and Production (2011) Ed. Mohamed Al-Rubeai; Springer Publishing.

PRODUCTS AND NUCLEIC ACIDS ENCODING THEM Provided herein are methods for identifying, selecting, or making a cell or cell line capable of producing a product, e.g., cells and products as recited in the methods of the invention. The products encompassed by the present disclosure include, but are not limited to, molecules, nucleic acids, polypeptides (e.g., recombinant polypeptides, e.g., antibodies, bispecific antibodies, multispecific antibodies), or hybrids thereof, that can be produced by, e.g., expressed in, a cell. In some embodiments, the cells are engineered or modified to produce the product. Such modifications include the introducing molecules that control or result in production of the product. For example, a cell is modified by introducing an exogenous nucleic acid that encodes a polypeptide, e.g., a recombinant polypeptide, and the cell is cultured under conditions suitable for production, e.g., expression and secretion, of the polypeptide, e.g., recombinant polypeptide.

In embodiments, the cultured cells are used to produce proteins e.g., antibodies, e.g., monoclonal antibodies, and/or recombinant proteins, for therapeutic use. In embodiments, the cultured cells produce peptides, amino acids, fatty acids or other useful biochemical

intermediates or metabolites. For example, in embodiments, molecules having a molecular weight of about 4000 daltons to greater than about 140,000 daltons can be produced. In embodiments, these molecules can have a range of complexity and can include posttranslational modifications including glycosylation.

In embodiments, the polypeptide is, e.g., BOTOX, Myobloc, Neurobloc, Dysport (or other serotypes of botulinum neurotoxins), alglucosidase alpha, daptomycin, YH-16,

choriogonadotropin alpha, filgrastim, cetrorelix, interleukin-2, aldesleukin, teceleulin, denileukin diftitox, interferon alpha-n3 (injection), interferon alpha-nl, DL-8234, interferon, Suntory (gamma-la), interferon gamma, thymosin alpha 1, tasonermin, DigiFab, ViperaTAb, EchiTAb, CroFab, nesiritide, abatacept, alefacept, Rebif, eptoterminalfa, teriparatide, calcitonin, etanercept, hemoglobin glutamer 250 (bovine), drotrecogin alpha, collagenase, carperitide, recombinant human epidermal growth factor, DWP401, darbepoetin alpha, epoetin omega, epoetin beta, epoetin alpha, desirudin, lepirudin, bivalirudin, nonacog alpha, Mononine, eptacog alpha (activated), recombinant Factor VIII+VWF, Recombinate, recombinant Factor VIII, Factor VIII (recombinant), Alphnmate, octocog alpha, Factor VIII, palifermin,Indikinase, tenecteplase, alteplase, pamiteplase, reteplase, nateplase, monteplase, follitropin alpha, rFSH, hpFSH, micafungin, pegfilgrastim, lenograstim, nartograstim, sermorelin, glucagon, exenatide, pramlintide, iniglucerase, galsulfase, Leucotropin, molgramostirn, triptorelin acetate, histrelin (Hydron), deslorelin, histrelin, nafarelin, leuprolide (ATRIGEL), leuprolide (DUROS), goserelin, Eutropin, somatropin, mecasermin, enlfavirtide, Org-33408, insulin glargine, insulin glulisine, insulin (inhaled), insulin lispro, insulin deternir, insulin (RapidMist), mecasermin rinfabate, anakinra, celmoleukin, 99 mTc-apcitide, myelopid, Betaseron, glatiramer acetate, Gepon, sargramostim, oprelvekin, human leukocyte-derived alpha interferons, Bilive, insulin (recombinant), recombinant human insulin, insulin aspart, mecasenin, Roferon-A, interferon- alpha 2, Alfaferone, interferon alfacon-1, interferon alpha, Avonex' recombinant human luteinizing hormone, dornase alpha, trafermin, ziconotide, taltirelin, diboterminalfa, atosiban, becaplermin, eptifibatide, Zemaira, CTC-111, Shanvac-B, octreotide, lanreotide, ancestirn, agalsidase beta, agalsidase alpha, laronidase, prezatide copper acetate, rasburicase, ranibizumab, Actimmune, PEG-Intron, Tricomin, recombinant human parathyroid hormone (PTH) 1-84, epoetin delta, transgenic antithrombin III, Granditropin, Vitrase, recombinant insulin, interferon- alpha, GEM-21S, vapreotide, idursulfase, omnapatrilat, recombinant serum albumin,

certolizumab pegol, glucarpidase, human recombinant CI esterase inhibitor, lanoteplase, recombinant human growth hormone, enfuvirtide, VGV-1, interferon (alpha), lucinactant, aviptadil, icatibant, ecallantide, omiganan, Aurograb, pexigananacetate, ADI-PEG-20, LDI-200, degarelix, cintredelinbesudotox, Favld, MDX-1379, ISAtx-247, liraglutide, teriparatide, tifacogin, AA4500, T4N5 liposome lotion, catumaxomab, DWP413, ART-123, Chrysalin, desmoteplase, amediplase, corifollitropinalpha, TH-9507, teduglutide, Diamyd, DWP-412, growth hormone, recombinant G-CSF, insulin, insulin (Techno sphere), insulin (AERx), RGN- 303, DiaPep277, interferon beta, interferon alpha-n3, belatacept, transdermal insulin patches, AMG-531, MBP-8298, Xerecept, opebacan, AIDSVAX, GV-1001, LymphoScan, ranpirnase, Lipoxysan, lusupultide, MP52, sipuleucel-T, CTP-37, Insegia, vitespen, human thrombin, thrombin, TransMID, alfimeprase, Puricase, terlipressin, EUR-1008M, recombinant FGF-I,

BDM-E, rotigaptide, ETC-216, P-113, MBI-594AN, duramycin, SCV-07, OPI-45, Endostatin, Angiostatin, ABT-510, Bowman Birk Inhibitor, XMP-629, 99 mTc-Hynic-Annexin V, kahalalide F, CTCE-9908, teverelix, ozarelix, romidepsin, BAY-504798, interleukin4, PRX-321, Pepscan, iboctadekin, rhlactoferrin, TRU-015, IL-21, ATN-161, cilengitide, Albuferon,

Biphasix, IRX-2, omega interferon, PCK-3145, CAP-232, pasireotide, huN901-DMI, SB- 249553, Oncovax-CL, OncoVax-P, BLP-25, CerVax-16, MART-1, gplOO, tyrosinase, nemifitide, rAAT, CGRP, pegsunercept, thymosinbeta4, plitidepsin, GTP-200, ramoplanin, GRASPA, OBI-1, AC- 100, salmon calcitonin (eligen), examorelin, capromorelin, Cardeva, velafermin, 131I-TM-601, KK-220, T-10, ularitide, depelestat, hematide, Chrysalin, rNAPc2, recombinant Factor VI 11 (PEGylated liposomal), bFGF, PEGylated recombinant staphylokinase variant, V-10153, SonoLysis Prolyse, NeuroVax, CZEN-002, rGLP-1, BIM-51077, LY-548806, exenatide (controlled release, Medisorb), AVE-0010, GA-GCB, avorelin, ACM-9604, linaclotid eacetate, CETi-1, Hemospan, VAL, fast-acting insulin (injectable, Viadel), insulin (eligen), recombinant methionyl human leptin, pitrakinra, Multikine, RG-1068, MM-093, NB 1-6024, AT- 001, PI-0824, Org-39141, CpnlO, talactoferrin, rEV-131, rEV-131, recombinant human insulin, RPI-78M, oprelvekin, CYT-99007 CTLA4-Ig, DTY-001, valategrast, interferon alpha-n3, IRX- 3, RDP-58, Tauferon, bile salt stimulated lipase, Merispase, alaline phosphatase, EP-2104R, Melanotan-II, bremelanotide, ATL-104, recombinant human microplasmin, AX-200, SEMAX, ACV-1, Xen-2174, CJC-1008, dynorphin A, SI-6603, LAB GHRH, AER-002, BGC-728, ALTU-135, recombinant neuraminidase, Vacc-5q, Vacc-4x, Tat Toxoid, YSPSL, CHS-13340, PTH(l-34) (Novasome), Ostabolin-C, PTH analog , MBRI-93.02, MTB72F, MVA-Ag85A,

FARA04, BA-210, recombinant plague FIV, AG-702, OxSODrol, rBetVl, Der-pl/Der-p2/Der- p7, PR1 peptide antigen , mutant ras vaccine, HPV-16 E7 lipopeptide vaccine, labyrinthin, WT1- peptide, IDD-5, CDX-110, Pentrys, Norelin, CytoFab, P-9808, VT-111, icrocaptide, telbermin, rupintrivir, reticulose, rGRF, HA, alpha-galactosidase A, ACE-011, ALTU-140, CGX-1160, angiotensin, D-4F, ETC-642, APP-018, rhMBL, SCV-07, DRF-7295, ABT-828, ErbB2-specific immunotoxin, DT3SSIL-3, TST- 10088, PRO- 1762, Combotox, cholecystokinin-B/gastrin- receptor binding peptides, l l lln-hEGF, AE-37, trasnizumab-DMl, Antagonist G, IL-12, PM- 02734, IMP-321, rhIGF-BP3, BLX-883, CUV-1647, L-19 based ra, Re-188-P-2045, AMG-386, DC/1540/KLH, VX-001, AVE-9633, AC-9301, NY-ESO-1 (peptides), NA17.A2 peptides, CBP- 501, recombinant human lactoferrin, FX-06, AP-214, WAP-8294A, ACP-HIP, SUN-11031, peptide YY [3-36], FGLL, atacicept, BR3-Fc, BN-003, BA-058, human parathyroid hormone 1- 34, F-18-CCR1, AT-1100, JPD-003, PTH(7-34) (Novasome), duramycin, CAB-2, CTCE-0214, GlycoPEGylated erythropoietin, EPO-Fc, CNTO-528, AMG-114, JR-013, Factor XIII, aminocandin, PN-951, 716155, SUN-E7001, TH-0318, BAY-73-7977, teverelix, EP-51216, hGH, OGP-I, sifuvirtide, TV4710, ALG-889, Org-41259, rhCCIO, F-991, thymopentin, r(m)CRP, hepatoselective insulin, subalin, L19-IL-2 fusion protein, elafin, NMK-150, ALTU- 139, EN-122004, rhTPO, thrombopoietin receptor agonist, AL-108, AL-208, nerve growth factor antagonists, SLV-317, CGX-1007, INNO-105, teriparatide (eligen), GEM-OS 1, AC-162352, PRX-302, LFn-p24 fusion, EP-1043, gpEl, gpE2, MF-59, hPTH(l-34) , 768974, SYN-101, PGN-0052, aviscumnine, BIM-23190, multi-epitope tyrosinase peptide, enkastim, APC-8024, GI-5005, ACC-001, TTS-CD3, vascular-targeted TNF, desmopressin, onercept, and TP-9201.

In some embodiments, the polypeptide is adalimumab (HUMIRA), infliximab

(REMICADE™), rituximab (RITUXAN™/MAB THERA™) etanercept (ENBREL™), bevacizumab (AVASTIN™), trastuzumab (HERCEPTIN™), pegrilgrastim (NEULASTA™), or any other suitable polypeptide including bio similars and biobetters.

Other suitable polypeptides are those listed below and in Table 1 of US2016/0097074:

Table

- ^otelidns ^vBlent immune Fab, ovine CroFab'"

In embodiments, the polypeptide is a hormone, blood clotting/coagulation factor, cytokine/growth factor, antibody molelcule, fusion protein, protein vaccine, or peptide as in Table 2. Table 2. Exemplary Products

Cytokine/Growth Type I alpha-interferon Infergen factor Interferon- n3 (IFN n3) Alferon N

Interferon- la (rIFN- β) Avonex, Rebif

Interferon- lb (rIFN- β) Betaseron

Interferon-ylb (IFN γ) Actimmune

Aldesleukin (interleukin Proleukin

2(IL2), epidermal

theymocyte activating factor; ETAF

Kepivance

Palifermin (keratinocyte

Regranex growth factor; KGF)

Becaplemin (platelet-

Anril, Kineret derived growth factor;

PDGF)

Anakinra (recombinant ILl

antagonist)

Antibody molecules Bevacizumab (VEGFA Avastin

mAb) Erbitux

Cetuximab (EGFR mAb) Vectibix

Panitumumab (EGFR mAb) Campath

Alemtuzumab (CD52 mAb) Rituxan

Rituximab (CD20 chimeric Herceptin

Ab)

Orencia

Trastuzumab (HER2/Neu

Humira mAb)

Enbrel

Abatacept (CTLA Ab/Fc fusion) Remicade

Adalimumab (TNFa mAb) Amevive

Etanercept (TNF Raptiva

receptor/Fc fusion) Tysabri

Infliximab (TNFa chimeric Soliris

mAb) Orthoclone, OKT3

Alefacept (CD2 fusion

protein)

Efalizumab (CD1 la mAb)

Natalizumab (integrin a4

subunit mAb)

Eculizumab (C5mAb)

Muromonab-CD3

Other: Insulin Humulin, Novolin

Fusion Hepatitis B surface antigen Engerix, Recombivax HB proteins/Protein (HBsAg)

vaccines/Peptides HPV vaccine Gardasil

OspA LYMErix

Anti-Rhesus(Rh) Rhophylac

immunoglobulin G

Fuzeon

Enfuvirtide

Spider silk, e.g., fibrion

QMONOS

In embodiments, the protein is a multispecific protein, e.g., a bispecific antibody as shown in Table 3.

Table 3: Bispecific Formats

BAY2010112 BiTE CD3, PSMA Retargeting of T Phase I Prostate cancer

(Merrimack) ErbB3

Pharmaceuticals)

domain cytokines

(Chugai, Roche) factor X coagulation Table 4

Table 4

In some embodiments, the polypeptide is an antigen expressed by a cancer cell. In some embodiments the recombinant or therapeutic polypeptide is a tumor-associated antigen or a tumor- specific antigen. In some embodiments, the recombinant or therapeutic polypeptide is selected from HER2, CD20, 9-0-acetyl-GD3, phCG, A33 antigen, CA19-9 marker, CA-125 marker, calreticulin, carboanhydrase IX (MN/CA IX), CCR5, CCR8, CD19, CD22, CD25, CD27, CD30, CD33, CD38, CD44v6, CD63, CD70, CC123, CD138, carcinoma embryonic antigen (CEA; CD66e), desmoglein 4, E-cadherin neoepitope, endosialin, ephrin A2 (EphA2), epidermal growth factor receptor (EGFR), epithelial cell adhesion molecule (EpCAM), ErbB2, fetal acetylcholine receptor, fibroblast activation antigen (FAP), fucosyl GMl, GD2, GD3, GM2, ganglioside GD3, Globo H, glycoprotein 100, HER2/neu, HER3, HER4, insulin-like growth factor receptor 1, Lewis- Y, LG, Ly-6, melanoma- specific chondroitin- sulfate proteoglycan (MCSCP), mesothelin, MUC1, MUC2, MUC3, MUC4,

MUC5A _C, MUC5 _b, MUC7, MUC16, Mullerian inhibitory substance (MIS) receptor type II, plasma cell antigen, poly SA, PSCA, PSMA, sonic hedgehog (SHH), SAS, STEAP, sTn antigen, TNF-alpha precursor, and combinations thereof.

In some embodiments, the polypeptide is an activating receptor and is selected from 2B4 (CD244), α ₄βι integrin, β ₂ integrins, CD2, CD16, CD27, CD38, CD96, CDIOO, CD160, CD137, CEACAMl (CD66), CRTAM, CSl (CD319), DNAM-1 (CD226), GITR (TNFRSF18), activating forms of KIR, NKG2C, NKG2D, NKG2E, one or more natural cytotoxicity receptors, NTB-A, PEN-5, and combinations thereof, optionally wherein the β ₂ integrins comprise CD1 la-CD 18, CD11 b-CD 18, or CD1 lc-CD 18, optionally wherein the activating forms of KIR comprise K1R2DS1, KIR2DS4, or KIR-S, and optionally wherein the natural cytotoxicity receptors comprise NKp30, NKp44, NKp46, or NKp80.

In some embodiments, the polypeptide is an inhibitory receptor and is selected from KIR, ILT2/LIR-l/CD85j, inhibitory forms of KIR, KLRG1, LAIR-1, NKG2A, NKR-P1A, Siglec-3, Siglec-7, Siglec-9, and combinations thereof, optionally wherein the inhibitory forms of KIR comprise KIR2DL1, KIR2DL2, KIR2DL3, KIR3DL1, KIR3DL2, or KIR-L.

In some embodiments, the polypeptide is an activating receptor and is selected from CD3, CD2 (LFA2, 0X34), CD5, CD27 (TNFRSF7), CD28, CD30 (TNFRSF8), CD40L, CD84 (SLAMF5), CD137 (4-lBB), CD226, CD229 (Ly9, SLAMF3), CD244 (2B4, SLAMF4), CD319 (CRACC, BLAME), CD352 (Lyl08, NTBA, SLAMF6), CRTAM (CD355), DR3 (TNFRSF25), GITR (CD357), HVEM (CD270), ICOS, LIGHT, LTpR (TNFRSF3), OX40 (CD134), NKG2D, SLAM (CD150, SLAMF1), TCRa, TCRp, TCR5y, TIMl (HA VCR, KIM1), and combinations thereof.

In some embodiments, the polypeptide is an inhibitory receptor and is selected from PD-1

(CD279), 2B4 (CD244, SLAMF4), B71 (CD80), B7H1 (CD274, PD-L1), BTLA (CD272), CD160 (BY55, NK28), CD352 (Lyl08, NTBA, SLAMF6), CD358 (DR6), CTLA-4 (CD152),

LAG3, LAIR1, PD-1H (VISTA), TIGIT (VSIG9, VSTM3), TIM2 (TIMD2), TIM3 (HAVCR2,

KIM3), and combinations thereof.

Other exemplary proteins include, but are not limited to any protein described in Tables

1-10 of Leader et al., "Protein therapeutics: a summary and pharmacological classification", Nature Reviews Drug Discovery, 2008, 7:21-39 (incorporated herein by reference); or any conjugate, variant, analog, or functional fragment of the recombinant polypeptides described herein.

Other recombinant protein products include non-antibody scaffolds or alternative protein scaffolds, such as, but not limited to: DARPins, affibodies and adnectins. Such non-antibody scaffolds or alternative protein scaffolds can be engineered to recognize or bind to one or two, or more, e.g., 1, 2, 3, 4, or 5 or more, different targets or antigens. Also provided herein are nucleic acids, e.g., exogenous nucleic acids that encode the products, e.g., polypeptides, e.g., recombinant polypeptides described herein. The nucleic acid sequences coding for the desired recombinant polypeptides can be obtained using recombinant methods known in the art, such as, for example by screening libraries from cells expressing the desired nucleic acid sequence, e.g., gene, by deriving the nucleic acid sequence from a vector known to include the same, or by isolating directly from cells and tissues containing the same, using standard techniques. Alternatively, the nucleic acid encoding the recombinant polypeptide can be produced synthetically, rather than cloned. Recombinant DNA techniques and technology are highly advanced and well established in the art. Accordingly, the ordinarily skilled artisan having the knowledge of the amino acid sequence of a recombinant polypeptide described herein can readily envision or generate the nucleic acid sequence that would encode the recombinant polypeptide.

In some embodiments, the exogenous nucleic acid controls the expression of a product that is endogenously expressed by the host cell. In such embodiments, the exogenous nucleic acid comprises one or more nucleic acid sequences that increase the expression of the endogenous product (also referred to herein as "endogenous product transactivation sequence"). For example, the nucleic acid sequence that increases the expression of an endogenous product comprises a constitutively active promoter or a promoter that is stronger, e.g., increases transcription at the desired site, e.g., increases expression of the desired endogenous gene product. After introduction of the exogenous nucleic acid comprising the endogenous product transactivation sequence, said exogenous nucleic acid is integrated into the chromosomal genome of the cell, e.g., at a preselected location proximal to the genomic sequence encoding the endogenous product, such that the endogenous product transactivation sequence increases the transactivation or expression of the desired endogenous product. Other methods for modifying a cell, e.g., introducing an exogenous nucleic acid, for increasing expression of an endogenous product is described, e.g., in U.S. Patent No. 5,272,071; hereby incorporated by reference in its entirety.

The expression of a product described herein is typically achieved by operably linking a nucleic acid encoding the recombinant polypeptide or portions thereof to a promoter, and incorporating the construct into an expression vector. The vectors can be suitable for replication and integration eukaryotes or prokaryotes. Typical cloning vectors contain other regulatory elements, such as transcription and translation terminators, initiation sequences, and promoters useful for regulation of the expression of the desired nucleic acid sequence.

The nucleic acid sequences described herein encoding a product, e.g., a recombinant polypeptide, or comprising a nucleic acid sequence that can control the expression of an endogenous product, can be cloned into a number of types of vectors. For example, the nucleic acid can be cloned into a vector including, but not limited to a plasmid, a phagemid, a phage derivative, an animal virus, and a cosmid. Vectors of particular interest include expression vectors, replication vectors, probe generation vectors, and sequencing vectors. In embodiments, the expression vector may be provided to a cell in the form of a viral vector. Viral vector technology is well known in the art and is described, for example, in Sambrook et al., 2012, MOLECULAR CLONING: A LABORATORY MANUAL, volumes 1 -4, Cold Spring Harbor Press, NY), and in other virology and molecular biology manuals. Viruses, which are useful as vectors include, but are not limited to, retroviruses, adenoviruses, adeno- associated viruses, herpes viruses, and lentiviruses. In general, a suitable vector contains an origin of replication functional in at least one organism, a promoter sequence, convenient restriction endonuclease sites, and one or more selectable markers, (e.g., WO 01/96584; WO 01/29058; and U.S. Pat. No. 6,326,193). Vectors derived from viruses are suitable tools to achieve long-term gene transfer since they allow long-term, stable integration of a transgene and its propagation in daughter cells.

A vector may also include, e.g., a signal sequence to facilitate secretion, a

polyadenylation signal and transcription terminator (e.g., from Bovine Growth Hormone (BGH) gene), an element allowing episomal replication and replication in prokaryotes (e.g. SV40 origin and ColEl or others known in the art) and/or elements to allow selection, e.g., a selection marker or a reporter gene.

In one embodiment, the vector comprising a nucleic acid sequence encoding a

polypeptide, e.g., a recombinant polypeptide, further comprises a promoter sequence responsible for the recruitment of polymerase to enable transcription initiation for expression of the polypeptide, e.g., the recombinant polypeptide. In one embodiment, promoter sequences suitable for the methods described herein are usually associated with enhancers to drive high amounts of transcription and hence deliver large copies of the target exogenous mRNA. In an embodiment, the promoter comprises cytomegalovirus (CMV) major immediate early promoters (Xia, Bringmann et al. 2006) and the SV40 promoter (Chernajovsky, Mory et al. 1984), both derived from their namesake viruses or promoters derived therefrom. Several other less common viral promoters have been successfully employed to drive transcription upon inclusion in an expression vector including Rous Sarcoma virus long terminal repeat (RSV-LTR) and Moloney murine leukaemia virus (MoMLV) LTR (Papadakis, Nicklin et al. 2004). In another embodiment,specific endogenous mammalian promoters can be utilized to drive constitutive transcription of a gene of interest (Pontiller, Gross et al. 2008). The CHO specific Chinese Hamster elongation factor 1 -alpha (CHEFla) promoter has provided a high yielding alternative to viral based sequences (Deer, Allison 2004). In addition to promoters, the vectors described herein further comprise an enhancer region as described above; a specific nucleotide motif region, proximal to the core promoter, which can recruit transcription factors to upregulate the rate of transcription (Riethoven 2010). Similar to promoter sequences, these regions are often derived from viruses and are encompassed within the promoter sequence such as hCMV and SV40 enhancer sequences, or may be additionally included such as adenovirus derived sequences (Gaillet, Gilbert et al. 2007).

In one embodiment, the vector comprising a nucleic acid sequence encoding a product, e.g., a polypeptide, e.g, a recombinant polypeptide, described herein further comprises a nucleic acid sequence that encodes a selection marker. In one embodiment, the selectable marker comprises glutamine synthetase (GS); dihydrofolate reductase (DHFR) e.g., an enzyme which confers resistance to methotrexate (MTX); or an antibiotic marker, e.g., an enzyme that confers resistance to an antibiotic such as: hygromycin, neomycin (G418), zeocin, puromycin, or blasticidin. In another embodiment, the selection marker comprises or is compatible with the Selexis selection system (e.g., SUREtechnology Platform™ and Selexis Genetic Elements™, commercially available from Selexis SA) or the Catalant selection system.

In one embodiment, the vector comprising a nucleic acid sequence encoding a recombinant product described herein comprises a selection marker that is useful in identifying a cell or cells comprise the nucleic acid encoding a recombinant product described herein. In another embodiment, the selection marker is useful in identifying a cell or cells that comprise the integration of the nucleic acid sequence encoding the recombinant product into the genome, as described herein. The identification of a cell or cells that have integrated the nucleic acid sequence encoding the recombinant protein can be useful for the selection and engineering of a cell or cell line that stably expresses the product.

Suitable vectors for use are commercially available, and include vectors associated with the GS Expression System™, GS Xceed™ Gene Expression System, or Potelligent® CHOK1SV technology available from Lonza Biologies, Inc, e.g., vectors as described in Fan et al., Pharm. Bioprocess. (2013); l(5):487-502, which is incorporated herein by reference in its entirety. GS expression vectors comprise the GS gene, or a functional fragment thereof (e.g., a GS mini- gene), and one or more, e.g., 1, 2, or 3, or more, highly efficient transcription cassettes for expression of the gene of interest, e.g., a nucleic acid encoding a recombinant polypeptide described herein. A GS mini-gene comprises, e.g., consists of, intron 6 of the genomic CHO GS gene. In one embodiment, a GS vector comprises a GS gene operably linked to a SV40L promoter and one or two polyA signals. In another embodiment, a GS vector comprises a GS gene operably linked to a SV40E promoter, SV40 splicing and polyadenylation signals. In such embodiments, the transcription cassette, e.g., for expression of the gene of interest or recombinant polypeptide described herein, includes the hCMV-MIE promoter and 5' untranslated sequences from the hCMV-MIE gene including the first intron. Other vectors can be constructed based on GS expression vectors, e.g., wherein other selection markers are substituted for the GS gene in the expression vectors described herein.

Vectors suitable for use in the methods described herein include, but are not limited to, other commercially available vectors, such as, pcDNA3.1/Zeo, pcDNA3.1/CAT,

pcDNA3.3TOPO (Thermo Fisher, previously Invitrogen); pTarget, HaloTag (Promega); pUC57 (GenScript); pFLAG-CMV (Sigma- Aldrich); pCMV6 (Origene); pEE12 or pEE14 (Lonza Biologies), or pBK-CMV/ pCMV-3Tag-7/ pCMV-Tag2B (Stratagene).

CELLS AND CELL CULTURE

In embodiments, the cell is a mammalian cell. In other embodiments, the cell is a cell other than a mammalian cell. In an embodiment, the cell is a mouse, rat, Chinese hamster, Syrian hamster, monkey, ape, dog, horse, ferret, or cat. In embodiments, the cell is a mammalian cell, e.g., a human cell or a rodent cell, e.g., a hamster cell, a mouse cell, or a rat cell. In another embodiment, the cell is from a duck, parrot, fish, insect, plant, fungus, or yeast. In one embodiment, the cell is an Archaebacteria. In an embodiment, the cell is a species of

Actinobacteria, e.g., Mycobacterium tuberculosis). In one embodiment, the cell is a Chinese hamster ovary (CHO) cell. In one embodiment, the cell is a CHO-K1 cell, a CHO-K1 SV cell, a DG44 CHO cell, a DUXB 11 CHO cell, a CHOS, a CHO GS knock-out cell, a CHO FUT8 GS knock-out cell, a CHOZN, or a CHO- derived cell. The CHO GS knock-out cell (e.g., GSKO cell) is, for example, a CHO-KISV GS knockout cell (Lonza Biologies, Inc.). The CHO FUT8 knockout cell is, for example, the Potelligent® CHOK1 SV (Lonza Biologies, Inc.).

In another embodiment, the cell is a Hela, HEK293, HT1080, H9, HepG2, MCF7, Jurkat, NIH3T3, PC12, PER.C6, BHK (baby hamster kidney cell), VERO, SP2/0, NSO, YB2/0, Y0, EB66, C127, L cell, COS, e.g., COS 1 and COS7, QCl-3, CHOK1, CHOK1SV, Potelligent CHOK1SV, CHO GS knockout, CHOK1SV GS-KO, CHOS, CHO DG44, CHO DXB 11, and CHOZN, or any cells derived therefrom. In one embodiment, the cell is a stem cell. In one embodiment, the cell is a differentiated form of any of the cells described herein. In one embodiment, the cell is a cell derived from any primary cell in culture.

In an embodiment, the cell is any one of the cells described herein that comprises an exogenous nucleic acid encoding a recombinant polypeptide, e.g., expresses a recombinant polypeptide, e.g., a recombinant polypeptide selected from Table 1 or 2.

Large scale production

The devices, facilities and methods described herein are suitable for culturing any desired cell line including prokaryotic and/or eukaryotic cell lines. Further, in embodiments, the devices, facilities and methods are suitable for culturing suspension cells or anchorage-dependent (adherent) cells and are suitable for production operations configured for production of pharmaceutical and biopharmaceutical products— such as polypeptide products, nucleic acid products (for example DNA or RNA), or cells and/or viruses such as those used in cellular and/or viral therapies.

In embodiments, the cells express or produce a product, such as a recombinant therapeutic or diagnostic product. Examples of products produced by cells include, but are not limited to, antibody molecules (e.g., monoclonal antibodies, bispecific antibodies), antibody mimetics (polypeptide molecules that bind specifically to antigens but that are not structurally related to antibodies such as e.g. DARPins, affibodies, adnectins, or IgNARs), fusion proteins (e.g., Fc fusion proteins, chimeric cytokines), other recombinant proteins (e.g., glycosylated proteins, enzymes, hormones), viral therapeutics (e.g., anti-cancer oncolytic viruses, viral vectors for gene therapy and viral immunotherapy), cell therapeutics (e.g., pluripotent stem cells, mesenchymal stem cells and adult stem cells), vaccines or lipid-encapsulated particles (e.g., exosomes, virus-like particles), RNA (such as e.g. siRNA) or DNA (such as e.g. plasmid DNA), antibiotics or amino acids. In embodiments, the devices, facilities and methods can be used for producing bio similars.

As mentioned, in embodiments, devices, facilities and methods allow for the production of eukaryotic cells, e.g., mammalian cells or lower eukaryotic cells such as for example yeast cells or filamentous fungi cells, or prokaryotic cells such as Gram-positive or Gram-negative cells and/or products of the eukaryotic or prokaryotic cells, e.g., proteins, peptides, antibiotics, amino acids, nucleic acids (such as DNA or RNA), synthesised by the eukaryotic cells in a large- scale manner. Unless stated otherwise herein, the devices, facilities, and methods can include any desired volume or production capacity including but not limited to bench-scale, pilot-scale, and full production scale capacities.

In embodiments, devices and methods allow for the production of ceils and products of the cells, especially proteins, peptides (discussed in detail above), antibiotics or amino acids, synthesized by cells, e.g., mammalian cells, in a large-scale manner.

A wide array of flasks, bottles, reactors, and controllers allow the production and scale up of cell culture systems. The system can be chosen based, at least in part, upon its correlation with a desired glycan property or properties. Cells can be grown, for example, as batch, fed-batch, perfusion, or continuous cultures. Production parameters that can be selected include, e.g., addition or removal of media including when (early, middle or late during culture time) and how often media is harvested; increasing or decreasing speed at which cell cultures are agitated; increasing or decreasing temperature at which cells are cultured; adding or removing media such that culture density is adjusted; selecting a time at which cell cultures are started or stopped; and selecting a time at which cell culture parameters are changed. Such parameters can be selected for any of the batch, fed-batch, perfusion and continuous culture conditions.

In embodiments, the cultivated cells for large scale production are eukaryotic cells, e.g., animal cells, e.g., mammalian cells. The mammalian cells can be, for example, human cell lines, mouse myeloma (NSC))- cell lines, Chinese hamster ovary (CHO)-cell lines or hybri-doma- cell lines. Preferably the mammalian cells are CHO-cell lines. In embodiments, the cultivated cells for large scale production re used to produce antibodies discussed in detail above, e.g., monoclonal antibodies, and/or recombinant proteins, e.g., recombinant proteins for therapeutic use. In embodiments, the cells produce peptides, amino acids, fatty acids or other useful biochemical intermediates or metabolites.

In embodiments, the cells for large scale production are eukaryotic cells, biochemical markers, recombinant peptides or nucleotide sequences of interest, proteins, yeast, insect cells, stable or viral infected, avian cells or mammalian cells such as CHO cells, monkey cells, lytic products and the like for medical, research or commercial purposes.

In embodiments, the cells for large scale production are prokaryotic cells, strains of Gram-positive cells such as Bacillus and Streptomyces. In embodiments, the host cell is of phylum Firmicutes, e.g., the host cell is Bacillus. BSacillus that can be used are, e.g. the strains B.subtilis, B.amyloliquefaciens, B.licheniformis, B.natto, B.megaterium, etc. In embodiments, the host cell is B.subtilis, such as B.subtilis 3NA and B.subtilis 168. Bacillus is obtainable from, e.g., the Bacillus Genetic Stock Center , Biological Sciences 556, 484 West 12 ^th Avenue, Columbus OH 43210-1214.

In embodiments, the prokaryotic cells for large scale production are Gram negative cells, such as Salmonella spp. or E.coli, e.g., the strains TGI, W3110, DH1, XLl-Blue and Origami, which are commercially available.

Suitable host cells are commercially available, for example, from culture collections such as the DSMZ (Deutsche Sammlung von Mikroorganismen and Zellkulturen GmbH,

Braunschweig, Germany).

In an embodiment, the cell culture is carried out as a batch culture, fed-batch culture, draw and fill culture, or a continuous culture. In an embodiment, the cell culture is a suspension culture. In one embodiment, the cell or cell culture is placed in vivo for expression of the recombinant polypeptide, e.g., placed in a model organism or a human subject.

In one embodiment, the culture media is free of serum. Serum-free and protein-free media are commercially available, e.g., Lonza Biologies.

Suitable media and culture methods for mammalian cell lines are well-known in the art, as described in U.S. Pat. No. 5,633,162, for instance. Examples of standard cell culture media for laboratory flask or low density cell culture and being adapted to the needs of particular cell types are for instance: Roswell Park Memorial Institute (RPMI) 1640 medium (Morre, G., The Journal of the American Medical Association, 199, p. 519 f. 1967), L-15 medium (Leibovitz, A. et al., Amer. J. of Hygiene, 78, lp. 173 ff, 1963), Dulbecco's modified Eagle's medium (DMEM), Eagle's minimal essential medium (MEM), Ham's F12 medium (Ham, R. et al., Proc. Natl. Acad. Sc.53, p288 ff. 1965) or Iscoves' modified DMEM lacking albumin, transferrin and lecithin (Iscoves et al., J. Exp. med. 1, p. 923 ff., 1978). For instance, Ham's F10 or F12 media were specially designed for CHO cell culture. Other media specially adapted to CHO cell culture are described in EP-481 791. It is known that such culture media can be supplemented with fetal bovine serum (FBS, also called fetal calf serum FCS), the latter providing a natural source of a plethora of hormones and growth factors. The cell culture of mammalian cells is nowadays a routine operation well-described in scientific textbooks and manuals, it is covered in detail e.g. in R. Ian Fresney, Culture of Animal cells, a manual, 4 ^th edition, Wiley-Liss/N.Y., 2000.

Other suitable cultivation methods are known to the skilled artisan and may depend upon the recombinant polypeptide product and the host cell utilized. It is within the skill of an ordinarily skilled artisan to determine or optimize conditions suitable for the expression and production of the recombinant polypeptide to be expressed by the cell.

In one aspect, the cell or cell line for large scale production comprises an exogenous nucleic acid that encodes a product, e.g., a recombinant polypeptide. In an embodiment, the cell or cell line expresses the product, e.g., a therapeutic or diagnostic product. Methods for genetically modifying or engineering a cell to express a desired polypeptide or protein are well known in the art, and include, for example, transfection, transduction (e.g., viral transduction), or electroporation.

Physical methods for introducing a nucleic acid, e.g., an exogenous nucleic acid or vector described herein, into a host cell include calcium phosphate precipitation, lipofection, particle bombardment, microinjection, electroporation, and the like. Methods for producing cells comprising vectors and/or exogenous nucleic acids are well-known in the art. See, for example, Sambrook et al., 2012, MOLECULAR CLONING: A LABORATORY MANUAL, volumes 1 - 4, Cold Spring Harbor Press, NY).

Chemical means for introducing a nucleic acid, e.g., an exogenous nucleic acid or vector described herein, into a host cell include colloidal dispersion systems, such as macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. An exemplary colloidal system for use as a delivery vehicle in vitro and in vivo is a liposome (e.g., an artificial membrane vesicle). Other methods of state-of-the-art targeted delivery of nucleic acids are available, such as delivery of polynucleotides with targeted nanoparticles or other suitable sub-micron sized delivery system.

In embodiments, the integration of the exogenous nucleic acid into a nucleic acid of the host cell, e.g., the genome or chromosomal nucleic acid of the host cell is desired. Methods for determining whether integration of an exogenous nucleic acid into the genome of the host cell has occurred can include a GS/MSX selection method. The GS/MSX selection method uses complementation of a glutamine auxotrophy by a recombinant GS gene to select for high-level expression of proteins from cells. Briefly, the GS/MSX selection method comprises inclusion of a nucleic acid encoding glutamine synthetase on the vector comprising the exogenous nucleic acid encoding the recombinant polypeptide product. Administration of methionine sulfoximine (MSX) selects cells that have stably integrated into the genome the exogenous nucleic acid encoding both the recombinant polypeptide and GS. As GS can be endogenously expressed by some host cells, e.g., CHO cells, the concentration and duration of selection with MSX can be optimized to identify high producing cells with stable integration of the exogenous nucleic acid encoding the recombinant polypeptide product into the host genome. The GS selection and systems thereof is further described in Fan et al., Pharm. Bioprocess. (2013); l(5):487-502, which is incorporated herein by reference in its entirety.

Other methods for identifying and selecting cells that have stably integrated the exogenous nucleic acid into the host cell genome can include, but are not limited to, inclusion of a reporter gene on the exogenous nucleic acid and assessment of the presence of the reporter gene in the cell, and PCR analysis and detection of the exogenous nucleic acid.

In one embodiment, the cells selected, identified, or generated using the methods described herein are capable of producing higher yields of protein product than cells that are selected using only a selection method for the stable expression, e.g., integration of exogenous nucleic acid encoding the recombinant polypeptide. In an embodiment, the cells selected, identified, or generated using the methods described herein produce 2-fold, 3-fold, 4-fold, 5-fold, 6-fold, 7- fold, 8-fold, 9-fold, or 10-fold or more of the product, e.g., recombinant polypeptide, as compared to cells that were not contacted with an inhibitor of protein degradation, or cells that were only selected for stable expression, e.g., integration, of the exogenous nucleic acid encoding the recombinant polypeptide. METHODS FOR CELL LINE AND RECOMBINANT POLYPEPTIDE PRODUCTION

The current state of the art in both mammalian and microbial selection systems is to apply selective pressure at the level of the transcription of DNA into RNA. The gene of interest is tightly linked to the selection marker making a high level of expression of the selective marker likely to result in the high expression of the gene of interest. Cells which express the selection marker at high levels are able to survive and proliferate, those which do not are less likely to survive and proliferate, e.g., apoptose and/or die. In this way a population of cells can be enriched for cells expressing the selection marker and by implication the gene of interest at high levels. This method has proved very successful for expressing straightforward proteins. In embodiments, the process described herein provides a substantially pure protein product. As used herein, "substantially pure" is meant substantially free of pyrogenic materials, substantially free of nucleic acids, and/or substantially free of endogenous cellular proteins enzymes and components from the host cell, such as polymerases, ribosomal proteins, and chaperone proteins. A substantially pure protein product contains, for example, less than 25%, 20%, 15%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1% of contaminating endogenous protein, nucleic acid, or other macromolecule from the host cell.

Methods for recovering and purification of a product, e.g., a recombinant polypeptide, are well established in the art. For recovering the recombinant polypeptide product, a physical or chemical or physical-chemical method is used. The physical or chemical or physical-chemical method can be a filtering method, a centrifugation method, an ultracentrifugation method, an extraction method, a lyophilization method, a precipitation method, a crystallization method, a chromatography method or a combination of two or more methods thereof. In an embodiment, the chromatography method comprises one or more of size-exclusion chromatography (or gel filtration), ion exchange chromatography, e.g., anion or cation exchange chromatography, affinity chromatography, hydrophobic interaction chromatography, and/or multimodal chromatography.

The devices, facilities and methods described herein are suitable for culturing any desired cell including prokaryotic cells and/or eukaryotic cells. The methods can be performed in, e.g., a reactor, e.g., a bioreactor. Further, in embodiments, the devices, facilities and methods are suitable for culturing suspension cells or anchorage-dependent (adherent) cells and are suitable for production operations configured for production of molecular products— such as polypeptide products - or cells and/or viruses such as those used in cellular and/or viral therapies.

In embodiments, the cells express or produce a product, such as a recombinant therapeutic or diagnostic product. As described in more detail below, examples of products produced by cells include, but are not limited to, antibody molecules (e.g., monoclonal antibodies, bispecific antibodies), fusion proteins (e.g., Fc fusion proteins, chimeric cytokines), other recombinant proteins (e.g., glycosylated proteins, enzymes, hormones), or lipid- encapsulated particles (e.g., exosomes, virus-like particles). In embodiments, the devices, facilities and methods can be used for producing biosimilars.

In embodiments, devices, facilities and methods allow for the production of eukaryotic cells, e.g., mammalian cells, and/or products of the eukaryotic cells, e.g., proteins, peptides, antibiotics or amino acids, synthesized by the eukaryotic cells in a large-scale manner. Unless stated otherwise herein, the devices, facilities, and methods can include any desired volume or production capacity including but not limited to bench-scale, pilot-scale, and full production scale capacities.

Moreover and unless stated otherwise herein, the devices, facilities, and methods can include any suitable reactor(s) including but not limited to stirred tank, airlift, fiber, microfiber, hollow fiber, ceramic matrix, fluidized bed, fixed bed, spouted bed, and/or stirred tank

bioreactors. For example, in some aspects, an example bioreactor unit can perform one or more, or all, of the following: feeding of nutrients and/or carbon sources, injection of suitable gas (e.g., oxygen), flow of fermentation or cell culture medium, separation of gas and liquid phases, maintenance of temperature, maintenance of pH level, agitation (e.g., stirring), and/or cleaning/sterilizing. Example reactor units, such as a fermentation unit, may contain 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, or 100, or more bioreactors. In various embodiments, the bioreactor can be suitable for batch, semi fed-batch, fed-batch, perfusion, and/or continuous fermentation processes. Any suitable reactor diameter can be used. In embodiments, the bioreactor can have a volume between about 100 mL and about 50,000 L. Non-limiting examples include a volume of 100 mL, 250 mL, 500 mL, 750 mL, 1 liter, 2 liters, 3 liters, 4 liters, 5 liters, 6 liters, 7 liters, 8 liters, 9 liters, 10 liters, 15 liters, 20 liters, 25 liters, 30 liters, 40 liters, 50 liters, 60 liters, 70 liters, 80 liters, 90 liters, 100 liters, 150 liters, 200 liters, 250 liters, 300 liters, 350 liters, 400 liters, 450 liters, 500 liters, 550 liters, 600 liters, 650 liters, 700 liters, 750 liters, 800 liters, 850 liters, 900 liters, 950 liters, 1000 liters, 1500 liters, 2000 liters, 2500 liters, 3000 liters, 3500 liters, 4000 liters, 4500 liters, 5000 liters, 6000 liters, 7000 liters, 8000 liters, 9000 liters, 10,000 liters, 15,000 liters, 20,000 liters, and/or 50,000 liters. Additionally, suitable reactors can be multi-use, single-use, disposable, or non-disposable and can be formed of any suitable material including metal alloys such as stainless steel (e.g., 316L or any other suitable stainless steel) and Inconel, plastics, and/or glass. In some embodiments, suitable reactors can be round, e.g., cylindrical. In some embodiments, suitable reactors can be square, e.g., rectangular. Square reactors may in some cases provide benefits over round reactors such as ease of use (e.g., loading and setup by skilled persons), greater mixing and homogeneity of reactor contents, and lower floor footprint.

In embodiments and unless stated otherwise herein, the devices, facilities, and methods described herein can also include any suitable unit operation and/or equipment not otherwise mentioned, such as operations and/or equipment for separation, purification, and isolation of such products. Any suitable facility and environment can be used, such as traditional stick-built facilities, modular facilities, or any other suitable construction, facility, and/or layout. For example, in some embodiments modular clean-rooms can be used. Additionally and unless otherwise stated, the devices, systems, and methods described herein can be housed and/or performed in a single location or facility or alternatively be housed and/or performed at separate or multiple locations and/or facilities.

By way of non-limiting examples and without limitation, U.S. Publication Nos.

2013/0280797; 2012/0077429; 2011/0280797; 2009/0305626; and U.S. Patent Nos. 8,298,054; 7,629,167; and 5,656,491, which are hereby incorporated by reference in their entirety, describe example facilities, equipment, and/or systems that may be suitable.

In embodiments, the cells are eukaryotic cells, e.g., mammalian cells. The mammalian cells can be for example human or rodent or bovine cell lines or cell strains. Examples of such cells, cell lines or cell strains are e.g. mouse myeloma (NSO)-cell lines, Chinese hamster ovary (CHO)-cell lines, HT1080, H9, HepG2, MCF7, MDBK Jurkat, NIH3T3, PC12, BHK (baby hamster kidney cell), VERO, SP2/0, YB2/0, Y0, C127, L cell, COS, e.g., COS 1 and COS7, QCl-3,HEK-293, VERO, PER.C6, HeLA, EB1, EB2, EB3, oncolytic or hybridoma-cell lines. Preferably the mammalian cells are CHO-cell lines. In one embodiment, the cell is a CHO cell. In one embodiment, the cell is a CHO-K1 cell, a CHO-K1 SV cell, a DG44 CHO cell, a DUXB l l CHO cell, a CHOS, a CHO GS knock-out cell, a CHO FUT8 GS knock-out cell, a CHOZN, or a CHO-derived cell. The CHO GS knock-out cell (e.g., GSKO cell) is, for example, a CHO-Kl SV GS knockout cell. The CHO FUT8 knockout cell is, for example, the

Potelligent® CHOK1 SV (Lonza Biologies, Inc.). Eukaryotic cells can also be avian cells, cell lines or cell strains, such as for example, EBx® cells, EB 14, EB24, EB26, EB66, or EBvl3.

In one embodiment, the eukaryotic cells are stem cells. The stem cells can be, for example, pluripotent stem cells, including embryonic stem cells (ESCs), adult stem cells, induced pluripotent stem cells (iPSCs), tissue specific stem cells (e.g., hematopoietic stem cells) and mesenchymal stem cells (MSCs).

In embodiments, the cultivated cells are eukaryotic cells, e.g., mammalian cells. The mammalian cells can be for example human cell lines, mouse myeloma (NSO)- cell lines, Chinese hamster ovary (CHO)-cell lines or hybridoma-cell lines. Preferably the mammalian cells are CHO-cell lines. In one embodiment, the cell is a CHO cell. In one embodiment, the cell is a CHO-Kl cell, a CHO-Kl SV cell, a DG44 CHO cell, a DUXB l l CHO cell, a CHOS, a CHO GS knock-out cell, a CHO FUT8 GS knock-out cell, a CHOZN, or a CHO-derived cell. The CHO GS knock-out cell (e.g., GSKO cell) is, for example, a CHO-Kl SV GS knockout cell. The CHO FUT8 knockout cell is, for example, the Potelligent® CHOK1 SV (Lonza Biologies, Inc.).

In embodiments, the cell is a yeast cell (e.g., S. cerevisae, T. reesei), an insect cell (e.g., Sf9), an algae cell (e.g., cyanobacteria), or a plant cell (e.g., tobacco, alfalfa, Physcomitrella patens). In one embodiment, the cell is a rodent cell. In another embodiment, the cell is a HeLa, HEK293, HT1080, H9, HepG2, MCF7, Jurkat, NIH3T3, PC12, PER.C6, BHK (baby hamster kidney cell), VERO, SP2/0, NSO, YB2/0, Y0, EB66, C127, L cell, COS, e.g., COS 1 and COS7, QCl-3, CHO-Kl.

In embodiments, the cell is a stem cell. In one embodiment, the cell is a differentiated form of any of the cells described herein. In one embodiment, the cell is a cell derived from any primary cell in culture.

In embodiments, the cell is a hepatocyte such as a human hepatocyte, animal hepatocyte, or a non-parenchymal cell. For example, the cell can be a plateable metabolism qualified human hepatocyte, a plateable induction qualified human hepatocyte, plateable Qualyst Transporter Certified™ human hepatocyte, suspension qualified human hepatocyte (including 10-donor and 20-donor pooled hepatocytes), human hepatic kupffer cells, human hepatic stellate cells, dog hepatocytes (including single and pooled Beagle hepatocytes), mouse hepatocytes (including CD-I and C57BI/6 hepatocytes), rat hepatocytes (including Sprague-Dawley, Wistar Han, and Wistar hepatocytes), monkey hepatocytes (including Cynomolgus or Rhesus monkey

hepatocytes), cat hepatocytes (including Domestic Shorthair hepatocytes), and rabit hepatocytes (including New Zealand White hepatocytes). Example hepatocytes are commercially available from Triangle Research Labs, LLC, 6 Davis Drive Research Triangle Park, North Carolina, USA 27709.

In one embodiment, the eukaryotic cell is a lower eukaryotic cell such as e.g. a yeast cell (e.g., Pichia genus (e.g. Pichia pastoris, Pichia methanolica, Pichia kluyveri, and Pichia angusta), Komagataella genus (e.g. Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii), Saccharomyces genus (e.g. Saccharomyces cerevisae, cerevisiae, Saccharomyces kluyveri, Saccharomyces uvarum), Kluyveromyces genus (e.g. Kluyveromyces lactis,

Kluyveromyces marxianus), the Candida genus (e.g. Candida utilis, Candida cacaoi, Candida boidinii,), the Geotrichum genus (e.g. Geotrichum fermentans), Hansenula polymorpha,

Yarrowia lipolytica, or Schizosaccharomyces pombe, . Preferred is the species Pichia pastoris. Examples for Pichia pastoris strains are X33, GS 115, KM71, KM71H; and CBS7435.

In one embodiment, the eukaryotic cell is a fungal cell (e.g. Aspergillus (such as A. niger, A. fumigatus, A. orzyae, A. nidula), Acremonium (such as A. thermophilum), Chaetomium (such as C. thermophilum), Chrysosporium (such as C. thermophile), Cordyceps (such as C. militaris), Corynascus, Ctenomyces, Fusarium (such as F. oxysporum), Glomerella (such as G.

graminicola), Hypocrea (such as H. jecorina), Magnaporthe (such as M. orzyae), Myceliophthora (such as M. thermophile), Nectria (such as N. heamatococca), Neurospora (such as N. crassa), Penicillium, Sporotrichum (such as S. thermophile), Thielavia (such as T. terrestris, T.

heterothallica), Trichoderma (such as T. reesei), or Verticillium (such as V. dahlia)).

In one embodiment, the eukaryotic cell is an insect cell (e.g., Sf9, Mimic™ Sf9, Sf21,

High Five™ (BT1-TN-5B 1-4), or BT1-Ea88 cells), an algae cell (e.g., of the genus Amphora, Bacillariophyceae, Dunaliella, Chlorella, Chlamydomonas, Cyanophyta (cyanobacteria), Nannochloropsis, Spirulina,or Ochromonas), or a plant cell (e.g., cells from monocotyledonous plants (e.g., maize, rice, wheat, or Setaria), or from a dicotyledonous plants (e.g., cassava, potato, soybean, tomato, tobacco, alfalfa, Physcomitrella patens or Arabidopsis).

In one embodiment, the cell is a bacterial or prokaryotic cell. In embodiments, the prokaryotic cell is a Gram-positive cells such as Bacillus,

Streptomyces Streptococcus, Staphylococcus or Lactobacillus. Bacillus that can be used is, e.g. the B.subtilis, B.amyloliquefaciens, B.licheniformis, B.natto, or B.megaterium. In embodiments, the cell is B.subtilis, such as B.subtilis 3NA and B.subtilis 168. Bacillus is obtainable from, e.g., the Bacillus Genetic Stock Center , Biological Sciences 556, 484 West 12 ^th Avenue, Columbus OH 43210-1214.

In one embodiment, the prokaryotic cell is a Gram- negative cell, such as Salmonella spp. or Escherichia coli, such as e.g., TGI, TG2, W3110, DH1, DHB4, DH5a, HMS 174, HMS 174 (DE3), NM533, C600, HB 101, JM109, MC4100, XLl-Blue and Origami, as well as those derived from E.coli B-strains, such as for example BL-21 or BL21 (DE3), all of which are commercially available.

Suitable host cells are commercially available, for example, from culture collections such as the DSMZ (Deutsche Sammlung von Mikroorganismen and Zellkulturen GmbH,

Braunschweig, Germany) or the American Type Culture Collection (ATCC).

In embodiments, the cultured cells are used to produce proteins e.g., antibodies, e.g., monoclonal antibodies, and/or recombinant proteins, for therapeutic use. In embodiments, the cultured cells produce peptides, amino acids, fatty acids or other useful biochemical

intermediates or metabolites. For example, in embodiments, molecules having a molecular weight of about 4000 daltons to greater than about 140,000 daltons can be produced. In embodiments, these molecules can have a range of complexity and can include posttranslational modifications including glycosylation.

The methods and process disclosed herein includes an immunogenicity calculation that is fully integrated into the development process, providing a number of advantages, including, but not limited to, (1) a method that allows immunogenicity to be performed for any production system where the genome is known, or for specific variants of the production system (e.g., GS CHO specifically as a subset of CHO), and (2) immunogenicity assessment to be performed for different patient populations (e.g. by geographic area or ethnicity). This is important as an overall average score for an HCP for the global population may mask a high score for a single particularly susceptible group. NUMBERED EMBODIMENTS

1. A simple method of rapidly analyzing a sample, e.g., to provide an assessment of the risk of a protein (e.g., the risk the protein presents if present as a contaminant in a preparation, e.g., a preparation to be administered to a subject, e.g., a pharmaceutical

preparation), the method comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the sample, which comprises the protein and optionally a product (e.g., a recombinant polypeptide, e.g., an antibody, enzyme, or cytokine), produced via a process and

ii) a denaturant, e.g., deoxycholate and urea,

under conditions, e.g., concentrations of denaturant, that denatures the protein in the sample at temperature of between 10 and 30 °C, e.g., 18-26 °C, e.g., 20+3°C, 20+2°C, 20+l°C, or 20°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture (e.g., an aliquot of sample mixture from (a)); and ii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products; and

d) assigning a protein risk score to a protein identified in the sample,

thereby analyzing the sample and providing an assessment of the risk of a protein. 2. A simple method of rapidly analyzing a sample to provide an assessment of the risk of a protein, the method comprising:

a) providing, e.g., forming and/or maintaining, a sample mixture comprising:

i) the sample, which comprises the protein (e.g., a HCP) and optionally a therapeutic product (e.g., a recombinant polypeptide), produced via a process; and

ii) a first denaturant, e.g., guanidine hydrochloride,

under conditions, e.g., concentrations of a first denaturant, that denatures the protein in the sample at temperature of between 30 and 60 °C, e.g., 45-55 °C, e.g., 50+3°C, 50+2°C, 50+1 °C, or 50°C;

b) providing, e.g., forming and or maintaining, a sample/enzyme mixture comprising: i) sample mixture;

ii) a second denaturant, e.g., urea; and

iii) an enzyme preparation comprising an enzyme for which the protein is a substrate, e.g., a proteolytic enzyme, e.g., an enzyme which cleaves proteins at a preselected or defined target site, e.g., trypsin, lysC, GluC, or AspN, with the sample mixture,

under conditions in which the enzyme maintains substantial activity and reacts with, e.g., cleaves, the protein to provide protein digestion products;

c) separating the protein digestion products using chromatography, e.g., 1 -dimensional chromatography, providing the identity of the protein digestion products, e.g., by mass spectrometry, e.g., LC/MS, and using one or more protein digestion products to provide the identity of a protein associated with the protein digestion products;

d) assigning a protein risk score to a protein identified in the sample,

thereby analyzing the sample and providing an assessment of the risk of a protein.

3. The method of either of paragraphs 1 or 2, further comprising evaluating a plurality of different samples, each made by a different process, e.g., evaluating at least 2, 4, 8, 10, 50, 96, 100, 192, 200, 500 or 1,000, different samples. 4. The method of paragraph 3, further comprising comparing the assessment of risk for a first and a second different sample.

5. The method of paragraph 4, further comprising, responsive to the comparison, selecting, a process for producing the product.

6. The method of paragraph 4, further comprising, responsive to the comparison, selecting, classifying, or further processing one of the samples.