Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
VECTOR FOR CO-EXPRESSING GENES OF INTEREST
Document Type and Number:
WIPO Patent Application WO/2019/215303
Kind Code:
A1
Abstract:
The present invention relates to an expression vector for the co-expression of different polynucleotides in microalgal cells and to the expression cassettes comprising both polynucleotides. The invention also relates to a host cell, a method for expressing the proteins of interest and a method for selecting a cell co-expressing two proteins of interest in a stoichiometric manner.

Inventors:
DURANY TURK OLGA (ES)
SEGURA DE YEBRA JORDI (ES)
MERCADÉ ROCA JAUME (ES)
LÓPEZ CERRO MARÍA TERESA (ES)
LÓPEZ PAZ CRISTINA (ES)
Application Number:
PCT/EP2019/061981
Publication Date:
November 14, 2019
Filing Date:
May 09, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
GREENALTECH S L (ES)
International Classes:
C12N15/82
Domestic Patent References:
WO2017129723A12017-08-03
Foreign References:
CN105316356A2016-02-10
CN104630255A2015-05-20
Other References:
MASAYUKI ONISHI ET AL: "Robust Transgene Expression from Bicistronic mRNA in the Green Alga Chlamydomonas reinhardtii", G3: GENES|GENOMES|GENETICS, vol. 6, no. 12, 21 October 2016 (2016-10-21), pages 4115 - 4125, XP055489434, DOI: 10.1534/g3.116.033035
BARAHIMIPOUR ROUHOLLAH ET AL: "Efficient expression of nuclear transgenes in the green algaChlamydomonas: synthesis of an HIV antigen and development of a new selectable marker", PLANT MOLECULAR BIOLOGY, SPRINGER, DORDRECHT, NL, vol. 90, no. 4, 8 January 2016 (2016-01-08), pages 403 - 418, XP036145533, ISSN: 0167-4412, [retrieved on 20160108], DOI: 10.1007/S11103-015-0425-8
CRISTINA LÓPEZ-PAZ ET AL: "Identification of Chlamydomonas reinhardtii endogenous genic flanking sequences for improved transgene expression", THE PLANT JOURNAL, vol. 92, no. 6, 18 November 2017 (2017-11-18), GB, pages 1232 - 1244, XP055489718, ISSN: 0960-7412, DOI: 10.1111/tpj.13731
YUBING LI ET AL: "A new class of cyclin dependent kinase in Chlamydomonas is required for coupling cell size to cell division", ELIFESCIENCES.ORG, 25 March 2016 (2016-03-25), pages 1 - 28, XP055591328
MARKUS HEITZER ET AL: "Construction of modular tandem expression vectors for the green alga Chlamydomonas reinhardtii using the Cre/lox-system", BIOTECHNIQUES, vol. 43, no. 3, 1 September 2007 (2007-09-01), pages 324 - 332, XP055104012, ISSN: 0736-6205, DOI: 10.2144/000112556
DONG BIN ET AL: "A novel bicistronic expression system composed of the intraflagellar transport protein geneift25and FMDV 2A sequence directs robust nuclear gene expression inChlamydomonas reinhardtii", APPLIED MICROBIOLOGY AND BIOTECHNOLOGY, SPRINGER, DE, vol. 101, no. 10, 25 February 2017 (2017-02-25), pages 4227 - 4245, XP036219441, ISSN: 0175-7598, [retrieved on 20170225], DOI: 10.1007/S00253-017-8177-9
BIN JIA ET AL: "A vector for multiple gene co-expression in Chlamydomonas reinhardtii", ALGAL RESEARCH, vol. 20, 1 December 2016 (2016-12-01), NL, pages 53 - 56, XP055489613, ISSN: 2211-9264, DOI: 10.1016/j.algal.2016.09.013
FRANCISCO J. ROMERO-CAMPERO ET AL: "ChlamyNET: a Chlamydomonas gene co-expression network reveals global properties of the transcriptome and the early setup of key co-expression patterns in the green lineage", BMC GENOMICS, vol. 17, no. 1, 12 March 2016 (2016-03-12), XP055591314, DOI: 10.1186/s12864-016-2564-y
YUICHI AOKI ET AL: "ALCOdb: Gene Coexpression Database for Microalgae", PLANT AND CELL PHYSIOLOGY, vol. 57, no. 1, 7 December 2015 (2015-12-07), UK, pages e3 - e3, XP055591316, ISSN: 0032-0781, DOI: 10.1093/pcp/pcv190
MAYFIELD STEPHEN P ET AL: "Expression and assembly of a fully active antibody in algae", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, NATIONAL ACADEMY OF SCIENCES, US, vol. 100, no. 2, 21 January 2003 (2003-01-21), pages 438 - 442, XP002440629, ISSN: 0027-8424, DOI: 10.1073/PNAS.0237108100
RASALA ET AL., PLOS ONE, vol. 7, 2012
LOPEZ-PAZ ET AL., PLANT J., vol. 92, 2017, pages 1232 - 1244
ONISHIPRINGLE, G3, vol. 6, 2016, pages 4115 - 4125
"Molecular cloning: a laboratory manual", 2001, COLD SPRING HARBOR
"Current protocols in molecular biology", vol. 2, September 2006, GREENE PUBLISHING ASSOCIATES AND WILEY INTERSCIENCE
FISCHER NROCHAIX JD, MOL GEN GENET, vol. 265, 2001, pages 888 - 894
KABAT, E.A. ET AL.: "The United States Department of Health and Human Services", 1991, NIH PUBLICATION, article "Sequences of Proteins of Immunological Interest"
HOTHIA, C. ET AL., J. MOL. BIOL., vol. 196, 1987, pages 901 - 917
K. WISING ET AL., ANN. REV. GENETICS, vol. 22, 1988, pages 421
JEFFERSON ET AL., BIOCHEM. SOC. TRANS., vol. 15, 1987, pages 17 - 19
GARRICK ET AL., NAT GENET., vol. 18, no. 1, 1998, pages 56 - 9
HEITZERZSCHOERNIG, BIOTECHNIQUES, vol. 43, 2007, pages 324 - 328
NEUPERT ET AL., THE PLANT JOURNAL, vol. 57, 2009, pages 1140 - 1150
Attorney, Agent or Firm:
ALCONADA RODRÍGUEZ, Agustín (ES)
Download PDF:
Claims:
CLAIMS

1- An expression vector for the co-expression of a first and second polynucleotides of interest in microalgal cells, wherein said first and second polynucleotides of interest are provided respectively in a first and second expression cassettes, each expression cassette comprising a 5’ cis regulatory region and a 3’ cis regulatory region operatively linked with the first and second polynucleotides and wherein both the 5’ and 3’ cis regulatory regions of the first expression cassette are different from the 5’ and 3’ cis regulatory regions of the second expression cassette, wherein the 5’ cis regulatory regions comprise a sequence selected from the group consisting of a HSP70A/ RBCS2 chimeric promoter, the RPL23 gene promoter, the FDX gene promoter and the 3’ cis regulatory regions comprise a sequence selected from the 3 'UTR of the RBCS2 gene, the 3 'UTR of the RPL23 gene and 3 'UTR of the FDX gene.

2- The vector according to claim 1 further comprising a third expression cassette comprising a 5’ and a 3’ cis regulatory regions wherein said 5’ and 3’ cis regulatory regions are each different to the 5’ and 3’ cis regulatory regions of the first and second expression cassette.

3-The vector according to any of claims 1 or 2 wherein the HSP70A/ RBCS2 chimeric promoter and the 3 'UTR of the RBCS2 are in one of the cassettes, the RPL23 gene promoter and the 3 'UTR of the RPL23 gene are in a different cassette, and the FDX gene promoter and the 3 'UTR of the FDX gene are in a cassette different from the other two.

4- The vector according to any of claims 2 to 3, wherein the third expression cassette comprises a selection gene. 5- The vector according to claim 4 wherein the selection gene is bleomycin resistance gene. 6- The vector according to any of claims 4 or 5 wherein the third expression cassette comprises the HSP70A/ RBCS2 chimeric promoter and the 3'UTR of the RBCS2 gene as cis regulatory regions.

7- The vector according to any of claims 1 to 6 wherein the 5’ and 3’ cis regulatory regions in the first expression cassette comprise respectively the RPL23 gene promoter and the 3 TJTR of the RPL23 gene and the 5’ and 3’ regulatory regions of the second expression cassette comprise respectively the FDX gene promoter and the 3 TJTR of the FDX gene.

8- The vector according to any of claims 1 to 7 wherein the polynucleotide of interest within at least one of the expression cassettes encodes a protein of interest and wherein said expression cassette comprises an intron sequence within the region encoding the protein of interest or the protein encoded by the selection gene and/or a nucleotide sequence encoding a signal peptide in the same open reading frame at position 5 ' respect to the nucleotide sequence encoding the protein of interest or the protein encoded by the selection gene.

9- The vector according to claim 8 wherein the signal peptide is a murine signal peptide.

10- The vector according to any of claims 1 to 9 wherein the first and second polynucleotides of interest encode two different subunits of a heterodimeric protein.

11 -The vector according to claim 7 wherein the first and second subunits are codon optimized for expression in microalgal cells.

12- The vector according to claim 11 wherein the first and second subunit are the heavy chain and the light chain of an antibody.

13- The vector according to claim 12 wherein the antibody is m5c3. 14- The vector according to claim 7 wherein the first expression cassette comprises a polynucleotide encoding the light chain of an antibody, the second expression cassette comprises polynucleotide encoding the heavy chain of the antibody and the third expression cassette comprises the bleomycin resistance gene, the HSP70A/ RBCS2 chimeric promoter and the 3'UTR of the RBCS2 gene as cis regulatory regions.

15- A host cell comprising a vector according to any of claims 1 to 14.

16- The host according to claim 15 wherein the host cell is a microalga.

17- The host cell according to claim 16 wherein the microalga is from genus Chlamydomonas.

18- The host cell according to claim 17 wherein the microalga is C. reinhardtii.

19- Use of a vector according to any of claims 1 to 14 or a host cell according to any of claims 15 to 18 for simultaneously co-expressing the first and second polynucleotides of interest.

20- A method for co-expressing two polynucleotides of interest which comprises growing a cell carrying a vector according to any of claims 1 to 14 in conditions suitable for allowing the expression of the polynucleotides of interest.

21- The method according to claim 20 wherein the cell comprises a second expression vector comprising the first or second expression cassette used in the vector according to any of claims 1 to 14.

22- The method according to claim 21 wherein the second expression vector comprises an expression cassette comprising a nucleotide sequence of a selection gene different from the selection gene used in the vector used for the expression of the first and second proteins of interest. 23- The method according to claim 22 wherein the selection gene found in the second vector is the paromomycin resistance gene.

24- A method for selecting a cell co-expressing in a stoichiometric manner two proteins of interest, comprising

a) transforming a cell with a vector according to any of claims 1 to 14 b) detecting the expression of the two proteins of interest, and

c) selecting a cell expressing the two proteins of interest in a stoichiometric manner.

25- The method according to claim 24, wherein the two proteins of interest are heavy chain and light chain of an antibody, and wherein step b) is performed by an antibody specific for Fc region and an antibody specific for F(ab')2 region.

Description:
VECTOR FOR CO-EXPRESSING GENES OF INTEREST

TECHNICAL FIELD OF THE INVENTION

The present invention relates to a method for improving co-expression of more than one polynucleotide or protein in microalgae.

BACKGROUND OF THE INVENTION

Several strategies may be employed in order to achieve multiple recombinant gene expression such as use of multiple vectors with different selection markers, a single vector with a polycistronic mR A or a single vector producing multiple R As. Multiple vectors results in very different expression patterns since random genome insertion results in transformants with different expression levels. Large screenings need to be done since probability of having both genes expressed at the same level is low and this is particularly true for Chlamydomonas where high frequency of transgene silencing is observed. The use of a single vector may involve having a duplicate strong promoter/UTR, but this strategy introduces the risk of silencing and recombination. Production of two separate proteins from dicistronic genes has been achieved with viral FMDV2A sequences (Rasala et al. 2012. PLoS One 7), however, the presence in some cases of not processed full-length proteins could become a limitation (Lopez-Paz et al. 2017. Plant J. 92:1232-1244). Bicistronic mRNAs containing a non-structured junction sequence between the two ORFs has also been described, but in this case the expression of downstream gene was lower than upstream gene (Onishi and Pringle 2016. G3 (Bethesda) 6:4115-4125). Therefore, when high levels of expression of a Gene of Interest (GOI) are desired, the use of a single promoter for each gene is probably the preferred option.

Co-expression of subunits is an important requirement in the study and production of protein complexes. When subunits of protein complex are expressed separately they may not be soluble, active or stable, therefore a system to co-express proteins in a stoichiometric manner is a highly desirable tool.

The complex nature of the mechanisms leading to protein co-expression needs designing new strategies that allow for successful expression, assembly and secretion of complex proteins. SUMMARY OF THE INVENTION

The authors of the present invention have developed a microalgae expression vector containing different promoter and regulatory regions to co-express different polynucleotides and/or proteins simultaneously from the same vector. In the case of the study and production of protein complexes, it is an important requirement that the different subunits of the complex are coexpressed at the same time and with similar efficiency. When subunits of protein complex are expressed separately they may not be soluble, active or stable, therefore the inventor's system to co-express proteins in a stoichiometric manner is a highly desirable tool.

In a first aspect, the invention relates to an expression vector for the co- expression of a first and second polynucleotides of interest in microalgal cells, wherein said first and second polynucleotides of interest are provided respectively in a first and second expression cassettes, each expression cassette comprising a 5’ cis regulatory region and a 3’ cis regulatory region operatively linked with the first and second polynucleotides and wherein both the 5’ and 3’ cis regulatory regions of the first expression cassette are different from the 5’ and 3’ cis regulatory regions of the second expression cassette, wherein the 5’ cis regulatory regions comprise a sequence selected from the group consisting of a HSP70A/ RBCS2 chimeric promoter, the RPL23 gene promoter and the FDX gene promoter and the 3’ cis regulatory regions comprise a sequence selected from the 3'UTR of the RBCS2 gene, the 3'UTR of the RPL23 gene and 3 'UTR of the FDX gene.

In a second aspect, the invention relates to a host cell comprising a vector according to the invention.

In a third aspect, the invention relate to the use of a vector according to the invention or a host cell according to the invention for simultaneously co-expressing the first and second polynucleotides of interest.

In a fourth aspect, the invention relates to a method for co-expressing two polynucleotides of interest which comprises growing a cell carrying a vector according to the invention in conditions suitable for allowing the expression of the polynucleotides of interest. In a fifth aspect, the invention relates to a method for selecting a cell co- expressing in a stoichiometric manner two proteins of interest, comprising

a) transforming a cell with a vector according to the invention

b) detecting the expression of the two proteins of interest, and

c) selecting a cell expressing the two proteins of interest in a stoichiometric manner.

BRIEF DESCRIPTION OF THE FIGURES

Figure 1. Schematic representation of the vector used for the expression in Chlamydomonas of murine 5C3 monoclonal antibody (mAB). (A) FM-zeo r vector map. Schematics of (B) heavy chain (HC) and (C) light chain (LC). 5’cis regulatory regions (promoter+5’UTR) are represented as rectangles with arrowheads and 3'cis regulatory regions (3’UTR and flanking 3’regions, including terminators) are represented as rectangles. Indicated genic regions are AR: HSP70A-RBCS2 chimeric promoter (includes also 5’UTR and first intron of RBCS2 ); RPL23 : cis regulatory regions of ribosomal protein RPL23; FDX Ferredoxinl cis regulatory elements; CDRS: complementarity-determining regions; ss: signal sequence, 3’T: 3’UTR + terminator), ir: intron 1 RBCS2.

Figure 2. Sandwich ELISA results of positive selected Chlamydomonas clones expressing m5C3 monoclonal antibody. Results of initially identified (A) CC-124 and (B) UVM4 positive transformants are expressed as RLU (Relative Luminiscence Units). Mean background levels is represented as a thick line.

Figure 3. Immunoblot evaluation of mAh expressed by CC-124 and UVM4 selected transformants. (A) 12 mΐ (1700c concentrated clarified media from 19 A3 transformant and different amounts of a mouse monoclonal antibody (C+) were run under reducing conditions to detect heavy and light chains. Two different exposure times are shown. (B) 28 mΐ of 80x concentrated culture from different UVM4 positive transformants was loaded and ran under reducing conditions and (C) non reducing conditions (Left panel). A more accurate expression of both Light (C, middle panel) and heavy chain (C, right panel) was performed with clone UVM4-2. Purified m5C3 (produced in mammalian cells) was used as a control- (HC+LC)2: complete mAh. HC: heavy chain. LC: light chain. For detection of light chain a primary antibody anti-mouse IgG, f(ab’)2 fragment specific was used. For detection of heavy chain an anti-mouse IgG, was used.

Figure 4. HC-paro R vector map. AR: HSP70-RBCS2 chimeric promoter (including 5’UTR and intron of RBCS2), IR: intron 1 of RBCS2. FDX: ferredoxin 1 cis regulatory elements. Aph8: paromomycin resistant gene.

Figure 5. Analysis of antibody expression and assembly of positive identified retransformants. (A) Confirmation by Sandwich ELISA of mAh increased expression level of the six selected 19A3 transformants. Cells were grown to late log phase and assay was performed only with culture media. Standard deviations of five technical replicates are presented as error bars. (B) Immunoblot of 19A3 clone and selected retransformantS (19A3-1 and 19A3-6) under reducing (left and middle pannel) and non reducing conditions (right pannel). Non transformed wild type control (CC-124) and m5C3 purified antibody are included as a positive and negative controls respectively. * indicates non specific binding.

Figure 6. Quantification of secreted and intracellular mAh concentration in selected clones. (A) Cells were grown to late log phase of growth and Sandwich ELISA assay was performed with cell extracts (white) or culture media (black) (A). Results are expressed in RLU. Standard deviations of three technical replicates are presented as error bars. (B) A standard curve with purified m5C3 was included to estimate expression in transformants.

Figure 7. Time course of mAh expression and assembly during exponential and stationary phases. (A) Sandwich ELISA of selected clones (19A3-1, 19A3-6, UVM4-2) was performed to quantify extracellular (top panel) and intracellular concentration (lower panel) of mAh at different stages of cell culture, time shown in hours (h). (B) Inmmnoblot of 19A3-1, 19A-6 and UVM4 -2 selected transformants at different culture stages under non-reducing (Bl) or reducing conditions (B2). Primary antibody anti mouse IgG, F(ab’)2 fragment specific. 12 mΐ of 80X concentrated culture media were loaded in each lane. A low exposure (top) and high exposure (bottom) is shown for each immunoblot. (HC+LC) 2 : complete mAh. HC: heavy chain. LC: light chain. (C) Growth curves obtained by OD750 nm monitoring every 24 h.

DETAILED DESCRIPTION OF THE INVENTION The inventors have developed a microalgae expression vector containing different regulatory regions to co-express different proteins simultaneously from the same vector.

Expression vector

In a first aspect the invention relates to an expression vector for the co- expression of a first and second polynucleotides of interest in microalgal cells, wherein said first and second polynucleotides of interest are provided respectively in a first and second expression cassettes, each expression cassette comprising a 5’ cis regulatory region and a 3’ cis regulatory region operatively linked with the first and second polynucleotides and wherein both the 5’ and 3’ cis regulatory regions of the first expression cassette are different from the 5’ and 3’ cis regulatory regions of the second expression cassette, wherein the 5’ cis regulatory regions comprise a sequence selected from the group consisting of a HSP70A/ RBCS2 chimeric promoter, the RPL23 gene promoter and the FDX gene promoter and the 3’ cis regulatory regions comprise a sequence selected from the 3'UTR of the RBCS2 gene, the 3'UTR of the RPL23 gene and 3 'UTR of the FDX gene.

As it is used herein, the term“vector” or“expression vector” refers to a replicative DNA construct used for expressing at least one polynucleotide in a cell, preferably a eukaryotic cell, more preferably a microalga. The choice of expression vector will depend upon the choice of host. A wide variety of expression host/vector combinations can be employed. Useful expression vectors for eukaryotic hosts include, for example, vectors comprising expression control sequences from SV40, bovine papilloma virus, adenovirus and cytomegalovirus. Useful expression vectors for bacterial hosts include known bacterial plasmids, such as plasmids from Escherichia coli, including pCR 1, pBR322, pMB9 and their derivatives, wider host range plasmids, such as M13 and filamentous single-stranded DNA phages. In a preferred embodiment, the vector is suitable for expression in microalga. Preferred vectors for this invention are vectors developed for algae such as the vectors commonly known by the skilled person such as aspChlamy_4 vector (Invitrogen), or vectors available through Chlamydomonas center. These vectors may contain an additional independent cassette to express a selectable marker that will be used to initially selecting clones that have incorporated the exogenous DNA during the transformation protocol. The expression vector preferably contains an origin of replication in microalga. The expression vector can also contain one or more multiple cloning sites.

The expression vector may also contain an origin of replication in prokaryotes, necessary for vector propagation in bacteria. Additionally, the expression vector can also contain a selection gene for bacteria, for example, a gene encoding a protein conferring resistance to an antibiotic, for example, ampicillin, kanamycin, chloramphenicol, etc. The expression vector can also contain one or more multiple cloning sites. A multiple cloning site is a polynucleotide sequence comprising one or more unique restriction sites. Non-limiting examples of the restriction sites include EcoRI, Sacl, Kpnl, Smal, Xmal, BamHI, Xbal, HincII, Pstl, Sphl, Hindlll, Aval, or any combination thereof.

As it is used herein, the term“polynucleotide” refers to a single-stranded or double-stranded polymer having deoxyribonucleotide or ribonucleotide bases. In a preferred embodiment, the polynucleotide has ribonucleotide bases. In another preferred embodiment, the polynucleotide has deoxyribonucleotide bases.

The polynucleotide or polynucleotides expressed in the vector of the invention as well as the RNA or DNA constructs necessary for preparing the expression vector of the invention can be obtained by means of conventional molecular biology methods included in general laboratory manuals, for example, in “Molecular cloning: a laboratory manual” (Joseph Sambrook, David W. Russel Eds. 2001, 3rd ed. Cold Spring Harbor, New York) or in“Current protocols in molecular biology” (F. M. Ausubel, R. Brent, R. E. Kingston, D. D. Moore, J. A. Smith, J. G. Seidman and K. Struhl Eds, vol. 2. Greene Publishing Associates and Wiley Interscience, New York, N. Y. Updated in September 2006). Useful vectors for microalgal cells are for example RPL23 :Luc:RPL23 (Lopez-Paz et al. 2017. Plant J. 92:1232-1244), pGenD-Ble (Fischer N, Rochaix JD 2001 Mol Gen Genet 265:888-894), pChlamy_4vector (invitrogen) pChlamy_4 vector (Invitrogen), or vectors available through Chlamydomonas center.

As disclosed herein the term “co-expression” refers to the simultaneous expression of at least two polynucleotides of interest. The term“polynucleotide of interest” refers to any polynucleotides the expression of which in a cell is to be achieved. The expression of the polynucleotides of interest may take place at similar or different levels. In a preferred embodiment the first polynucleotide of interest is expressed at the same level as the second polynucleotide of interest. In another preferred embodiment one of the polynucleotides of interest is expressed at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 6%, at least 7%, at least 8%, at least 9%, at least 10%, at least 20%, at least 25%, at least 30%, at least 35 %, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% or more than the second polynucleotide of interest. In a more preferred embodiment, the two polynucleotides of interest are co-expressed stoichiometrically. In a particular embodiment at least one of the polynucleotides of interest encodes a protein of interest. In another embodiment the two polynucleotides of interest encode a protein of interest. The co-expression of the proteins of interest may take place at similar or different levels. In a preferred embodiment the first protein of interest is expressed at the same level as the second protein of interest. In another preferred embodiment one of the proteins of interest is expressed at least 1%, at least 2%, at least 3%, at least 4%, at least 5%, at least 6%, at least 7%, at least 8%, at least 9%, at least 10%, at least 20%, at least 25%, at least 30%, at least 35 %, at least 40%, at least 45%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% or more than the second protein of interest. In a preferred embodiment, the two proteins of interest are co-expressed stoichiometrically.

Non-limiting techniques to measure the level of expression of a gene of interest include any known in the art for detecting the expression of a gene and can be based on detecting mRNA or protein. Methods for detecting mRNA are well known in the art and include without limitation, standard assays for determining mRNA expression levels such as qPCR, RT-PCR, RNA protection analysis, Northern blot, RNA dot blot, in situ hybridization, microarray technology, tag based methods such as serial analysis of gene expression (SAGE) including variants such as LongSAGE and SuperSAGE, microarrays, fluorescence in situ hybridization (FISH), including variants such as Flow- FISH, qFiSH and double fusion FISH (D-FISH), and the like. Preferably quantitative or semi-quantitative RT-PCR is preferred. Real-time quantitative or semi-quantitative RT- PCR is particularly advantageous. Virtually any conventional method can be used within the frame of the invention to detect and quantify the levels of proteins. By way of a non-limiting illustration, the expression levels are determined by means of antibodies with the capacity for binding specifically to the protein to be determined (or to fragments thereof containing the antigenic determinants) and subsequent quantification of the resulting antigen-antibody complexes. The antibodies that are going to be used in this type of assay can be, for example, polyclonal sera, hybridoma supernatants or monoclonal antibodies, antibody fragments, Fv, Fab, Fab’ and F(ab’)2, scFv, diabodies, triabodies, tetrabodies and humanized antibodies. At the same time, the antibodies may or may not be labeled. Illustrative, but non-exclusive, examples of markers that can be used include radioactive isotopes, enzymes, fluorophores, chemoluminescent reagents, enzyme cofactors or substrates, enzyme inhibitors, particles, dyes, etc. There is a wide variety of well-known assays that can be used in the present invention, using non-labeled antibodies (primary antibody), labeled antibodies (secondary antibodies); these techniques include Western- blot or immunoblot, ELISA (enzyme-linked immunosorbent assay), RIA (radioimmunoassay), competitive EIA (enzyme immunoassay), DAS-ELISA (double antibody sandwich ELISA), immunocytochemical and immunohistochemical techniques, immunofluorescence, techniques based on the use of biochips or protein microarrays including specific antibodies or assays based on the colloidal precipitation in formats such as reagent strips. Other forms of detecting and quantifying the proteins include affinity chromatography techniques, ligand-binding assays, etc.

In a preferred embodiment the polynucleotide of interest encodes a protein of interest. The term“protein of interest” refers to any protein the expression of which in a cell is to be achieved. In another preferred embodiment, the protein of interest is heterologous. Heterologous sequence could be a sequence that is derived from a different gene or from the same host, from a different strain of host cell, or from an organism of a different taxonomic group (e.g., different kingdom, phylum, class, order, family genus, or species, or any subgroup within one of these classifications). The term "heterologous" is also used synonymously herein with the term "exogenous." In a preferred embodiment, the protein of interest is in the form of a precursor. The term “precursor” refers to a polypeptide which, once processed, can give rise to a protein of interest. In a particular embodiment, the precursor of the protein of interest is a polypeptide comprising a signal sequence or signal peptide. In a preferred embodiment, the vector co-expresses two different proteins of interest. In another preferred embodiment, the first and second proteins of interest are the two different subunits of a heterodimeric protein.

“Heterodimeric protein”, as used herein relates to a macromolecular complex formed by two different protein monomers, or single proteins, which are usually non- covalently bound.

In another preferred embodiment, the first and second proteins of interest are the two subunits of a homodimeric protein, complex formed by two identical proteins.

In a more preferred embodiment the first and second subunits are the heavy chain and the light chain of an antibody. As it is used herein, the term“antibody” refers to a protein including at least one immunoglobulin variable region, for example, an amino acid sequence providing an immunoglobulin variable domain or a sequence of the immunoglobulin variable domain. An antibody can include, for example, a variable heavy chain (H) region (herein abbreviated as VH) and a variable light chain (L) region (herein abbreviated as VL). Typically, an antibody includes two variable heavy chain regions and two variable light chain regions. The term“antibody” encompasses antigen binding antibody fragments (for example, single-chain antibodies, Fab fragments, F(ab’) 2 fragments, Fd fragments, Fv fragments and dAb fragments) as well as whole antibodies, for example, intact and/or full length immunoglobulins of the IgA, IgG types (for example, IgGl, IgG2, IgG3, IgG4), IgE, IgD, IgM (as well as subtypes thereof). The variable heavy and light chain regions can additionally be subdivided into hypervariability regions, referred to as “complementarity determining regions” (“CDR”), mixed together with more conserved regions, referred to as“framework regions” (FR). The extension of FRs and CDRs has been precisely defined (see Rabat, E.A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, The United States Department of Health and Human Services, NIH Publication No. 91- 3242; and Chothia, C. et al. (1987) J. Mol. Biol. 196:901-917). Rabat definitions are used in the present document. Each variable heavy and light chain region is typically made up of three CDRs and four FRs, organized from the amino end to the carboxyl end in the following order: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4. The antibody VH or VL chain can furthermore include all or part of a heavy chain or light chain constant region to thereby form a heavy chain (HC) or light chain (LC immunoglobulin, respectively. Immunoglobulin light and heavy chains can be bound by disulfide bridges. The heavy chain constant region typically includes three constant domains, CH1, CH2 and CH3. The light chain constant region typically includes a CL domain. The variable heavy and light chain region contains a binding domain interacting with an antigen. The constant regions of the antibodies typically mediate the binding of the antibody to host tissues or factors, including various cells of the immune system (for example, effector cells) and the first component (Clq) of the conventional complement system. The term antibody encompasses both antibodies formed by heavy chains and light chains and single-chain antibodies. Therefore, in a particular embodiment the gene of interest encodes a single-chain antibody or a precursor thereof. In another preferred embodiment, the gene of interest encodes an antibody heavy chain or a precursor thereof. In another preferred embodiment, the gene of interest encodes an antibody light chain or a precursor thereof.

As it is used herein, the term“heavy chain” or“HC” encompasses both a full length heavy chain and fragments thereof. A full length heavy chain includes a variable region domain, VH, and three constant region domains, Cid, CH2 and CH3. The VH domain is at the amino terminal end of the polypeptide, and the CH3 domain is at the carboxyl terminal end.

As it is used herein, the term“light chain” encompasses a full length light chain and fragments thereof. A full length light chain includes a variable region domain, VL, and a constant region domain, CL. Like the heavy chain, the variable light chain region domain is at the amino terminal end of the polypeptide.

As it is used herein, the term“single-chain antibody” refers to a molecule modified by means of genetic engineering containing the variable light chain region and the variable heavy chain region bound by means of a suitable peptide linker, formed as a genetically fused single-chain molecule.

In a still more preferred embodiment the antibody is m5c3 which specifically binds to S1004.

In a preferred embodiment the first and second subunits are codon optimized for expression in microalgal cells. The term“codon optimised” as referred to herein relates to the alteration of codons in nucleic acid molecules to reflect the typical codon usage of the host organism, in the present case humans, without altering the polypeptide encoded by the DNA, to improve expression.

A plethora of methods and software tools for codon optimisation are known to the skilled person. Codon-optimised nucleic acids for use according to the present invention can be prepared by replacing the codons of the nucleic acid encoding the immunogen by "microalgae" codons, i.e., the codons are those that appear frequently in highly expressed microalgae genes.

In a preferred embodiment of the protein of interest, the heavy chain of the antibody has the sequence shown in SEQ ID NO: 1 and the light chain of the antibody has the sequence shown in SEQ ID NO: 2. As disclosed herein, SEQ ID NO 1 or 2 may contain a sequence for a restriction site at their 5 'or 3 'end.

The expression vector of the invention is suitable for the co-expression of a first and second protein of interest in microalgae cells. The term“microalgae cell” or “microalga” is used such that it refers not only to the particular subject cell, but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein. Microalga as used herein relates to large and diverse group of simple, typically autotrophic organisms, ranging from unicellular to multicellular forms, microscopic algae, typically found in freshwater and marine systems. Examples of suitable microalgae for expressing the vector of the invention, include microalgae from the phylums Cyanophyta, Chlorophyta, Rhodophyta, Heterokontophyta, and Haptophyta. The algae from the phylum Cyanophyta can be Spirulina ( Arthrospira ), Aphanizomenon flos-aquae, Anabaena cylindrica or Lyngbya majuscule. The algae from the phylum Chlorophyta can be Chlorella, Scenedesmus, Dunaliella, Tetraselmis, Haematococcus, Ulva, Codium, Botryococcus or Caulerpa spp. the algae from the phylum Rhodophyta can be Porphyridium cruentum, Gracilaria sp., Grateloupia sp, Palmaria sp. Corallina sp., Chondrus crispus, Porphyra sp. or Rhodosorus sp. The algae from the phylum Heterokontophyta can be Nannochlorropsis oculata, Odontella aurita, Phaeodactylum tricornutum. Fucus sp. Sargassum sp. Padina sp., Undaria pinnatifida, or Laminaria sp. The algae from the phylum Haptophyta can be Isochrysis sp. Tisochrysis sp. or Pavlova sp. The algae can be Chrypthecodinium cohnii, Schizochytrium, Ulkenia or Euglena gracilis. The algae can be a green microalga such as Chlorella, Scenedesmus, Dunialiella, Haematococcusand Bracteacoccus haptophyte microalgae such as Isochrysis, and heterokontophyta microalgae such as Phaeodactylum, Ochromonas and Odontella.

In a particular embodiment, the microalga is a green alga. Suitable examples of green alga are Chlorella or Haematyococcus or Chlamydomonas . In another preferred embodiment, the microalga is from genus Chlamydomonas.

Chlamydomonas, as used herein relates to a genus of green algae consisting of about 325 species all unicellular flagellates, found in stagnant water and on damp soil, in freshwater, seawater, and even in snow as "snow algae". In a more preferred embodiment, the microalga is Chlamydomonas reinhardtii. In another preferred embodiment, the microalga is Botryococcus braunii.

Chlamydomonas reinhardtii, as used herein is a single-cell green alga about 10 micrometres in diameter that swims with two flagella. It has a cell wall made of hydroxyproline-rich glycoproteins, a large cup-shaped chloroplast, a large pyrenoid, and an "eyespof ' that senses light.

The first and second polynucleotides of interest expressed by the vector of the invention are provided respectively in a first and second expression cassettes, each expression cassette comprising a 5’ cis regulatory region and a 3’ cis regulatory region operatively linked with the first and second polynucleotides and wherein both the 5’ and 3’ cis regulatory regions of the first expression cassette are different from the 5’ and 3’ cis regulatory regions of the second expression cassette.

“Operatively linked” as disclosed herein refers to different DNA fragments joined together such that the amino acid sequences encoded by the DNA fragments remain in- frame. In a still more preferred embodiment, both the 5’ and 3’ cis regulatory regions of the first expression cassette are different from the 5’ and 3’ cis regulatory regions of the second expression cassette.

“An expression cassette” as disclosed herein refers to a component of an expression vector comprising one or more polynucleotides of interest and the sequences controlling their expression. Non-limiting basic components of an expression cassette include promoter elements, the gene(s) of interest, and an appropriate mRNA stabilizing polyadenylation signal. Other frequently employed cis-acting elements include internal ribosome entry site (IRES) sequences to allow expression of two or more genes without the need for an additional promoter, introns and post-transcriptional regulatory elements to improve transgene expression. In a preferred embodiment, the vector according to the inventions comprises at least one, at least two or at least three expression cassettes. In a preferred embodiment the vector comprises two expression cassettes.

In a preferred embodiment, the vector of the invention further comprises a third expression cassette comprising a 5’ and a 3’ cis regulatory regions different from the 5’ and 3’ cis regulatory regions of the first and second expression cassettes.

In a preferred embodiment, the third expression cassette comprises a selection gene. As it is used herein, the term“selection gene” is a gene, the expression of which creates a detectable phenotype and which facilitates detection of host cells that contain an expression cassette having the selection marker. Non-limiting examples of selection genes include drug resistance genes and nutritional markers. For example, the selection gene can be a gene that confers resistance to an antibiotic selected from the group consisting of: ampicillin, kanamycin, erythromycin, chloramphenicol, gentamycin, kasugamycin, rifampicin, spectinomycin, D-Cycloserine, nalidixic acid, streptomycin, or tetracycline, or to herbicides such as acetoliasa synthase gene (ALS) which confers resistance to the herbicide silfonilurea, or the BAR gene conferring resistence to the herbicide phosphinothricin (PPT). Other non-limiting examples of selection genes include adenosine deaminase, aminoglycoside phosphotransferase, dihydrofolate reductase, hygromycin-B-phosphotransferase, thymidine kinase, and xanthine-guanine phosphoribosyltransferase. A single expression cassette can comprise one or more selection genes such as a nucleotide sequence of shBle gene that codes for bleomycin resistance and can be selected for using bleomycin, a neo gene that codes for kanamycin resistance and can be selected for using kanamycin, G418, etc. Non-limiting-examples of selection genes also include nucleotide sequences encoding a reporter protein. A “reporter protein” as used herein refers to a protein that typically is not present in the recipient organism and typically encodes for proteins resulting in some phenotypic change or enzymatic property which may allow for the selection of transformed cells. Examples of such genes are provided in K. Wising et al. Ann. Rev. Genetics, 22, 421 (1988). Non-limiting examples of reporter genes include the beta-glucuronidase (GUS) of the uidA locus of E. coli, the chloramphenicol acetyl transferase gene from Tn9 of E. coli, the green fluorescent protein from the bio luminescent jellyfish Aequorea victoria, and the luciferase due) genes from firefly Photinus pyralis. An assay for detecting reporter gene expression may then be performed at a suitable time after said gene has been introduced into recipient cells. One preferred such assay entails the use of the gene encoding beta-glucuronidase (GUS) of the uidA locus of E. coli as described by Jefferson et al., (Biochem. Soc. Trans. 15, 17-19 (1987) to identify transformed cells, referred to herein as GUST .

In a preferred embodiment, the expression cassette of the invention comprises as a selection gene a nucleotide sequence of shBle gene (SEQ ID NO: 3) that codes for bleomycin resistance. As disclosed herein, the shBle gene may contain an intron sequence, preferably RBCS2 intron sequence SEQ ID NO 10. More preferably, the shBle contains the RBCS2 intron sequence as disclosed in SEQ ID 22.

The vector of the invention may comprise more than three expression cassettes. Each expression cassette of the vector of the invention comprises a 5’ cis regulatory region and a 3’ cis regulatory region. A“cis regulatory region” as disclosed herein refers to regions of non-coding DNA which regulate the transcription of neighboring genes. The regulatory region may be located upstream (5' non-coding sequences), within, or downstream (3' non-coding sequences) of a coding region, and which influence the transcription, RNA processing or stability, or translation of the associated coding region. Regulatory regions may include promoters, translation leader sequences, RNA processing site, effector binding site and stem-loop structure. The boundaries of the coding region are determined by a start codon at the 5’ (amino) terminus and a translation stop codon at the 3’ (carboxyl) terminus. A coding region can include, but is not limited to, prokaryotic regions, cDNA from mRNA, genomic DNA molecules, synthetic DNA molecules, or RNA molecules. If the coding region is intended for expression in a eukaryotic cell, a polyadenylation signal and transcription termination sequence will usually be located 3' to the coding region. As disclosed herein any known in the art cis regulatory region may be included in the expression cassette. In a preferred embodiment the 5 'cis regulatory region comprises: a promoter, a 5'UTR and/or any flanking sequence. In a preferred embodiment the 3 'cis regulatory region comprises: 3'UTR, terminator sequences and/or flanking sequences. As used herein, the 5' and/or 3 'cis regulatory regions may contain any restriction site at their 5 'or 3 'end.

As it is used herein, the term“5’-UTR”, refers to the sequence at the 5’ end of the expression cassette which is not translated and which contains the region necessary for replication, i.e., the sequence which is recognized by the polymerase during synthesis of the RNA molecule from the RNA template. In a preferred embodiment, the 5’ untranslated sequence is selected from the group consisting of RPL23, FDX1, HSP70 A, RBCS2, PSAD or any other constitutive highly expressed Chlamydomonas gene. Non-limiting examples of 5'UTR include RPL23-5’UTR (SEQ ID NO: 23), FDX- 5’UTR (SEQ ID NO: 24) and RBCS2-5’UTR (SEQ ID NO: 25).

As it is used herein, the term“promoter” refers to a nucleic acid sequence which is structurally characterized by the presence of a binding site for the DNA-dependent RNA polymerase, transcription start sites and any other DNA sequence including, but without being limited to, transcription factor binding sites, repressor and activator protein binding sites and any other nucleotide sequence known in the state of the art capable of directly or indirectly regulating transcription from a promoter. Promoter refers to a DNA fragment capable of controlling the expression of a coding sequence or functional RNA. In general, a coding region is located 3’ to a promoter. Promoters may be derived in their entirety from a native gene, or be composed of different elements derived from different promoters found in nature, or even comprise synthetic DNA segments. It is understood by those skilled in the art that different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental or physiological conditions. Promoters which cause a gene to be expressed in most cell types at most times are commonly referred to as "constitutive promoters". It is further recognized that since in most cases the exact boundaries of regulatory sequences have not been completely defined, DNA fragments of different lengths may have identical promoter activity. A promoter is generally bounded at its 3’ terminus by the transcription initiation site and extends upstream (5’ direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter will be found a transcription initiation site (conveniently defined for example, by mapping with nuclease Sl), as well as protein binding domains (consensus sequences) responsible for the binding of RNA polymerase. In a preferred embodiment, flanking sequences may be operatively linked to the promoter sequences. The term “flanking sequence” refers to a DNA sequence extending on either side of a specific gene sequence. Flanking sequences may be preferably located upstream of the promoter. In the vector of the invention, the 5'cis regulatory region of the expression cassette comprises a sequence selected from the group consisting of the RPL23 promoter, also known as PL (SEQ ID NO: 4), the ferredoxin 1 FDX promoter (SEQ ID NO: 5) and the HSP70- RBCS2chimerie promoter, also known as AR (SEQ ID NO: 6).

In a preferred embodiment the 5 'cis regulatory region comprises RPL23 promoter (SEQ ID NO: 4) and RPL23- 5’UTR (SEQ ID NO: 23). In a more preferred embodiment the 5'cis regulatory region comprises the RPL23 promoter and RPL23- 5’UTR as disclosed in SEQ ID NO 29. In another preferred embodiment, the 5'cis regulatory region is the sequence shown in SEQ ID NO 29.

In a preferred embodiment the 5'cis regulatory region comprises FDX promoter (SEQ ID NO: 5) and FDX 5’UTR (SEQ ID NO: 24). In a more preferred embodiment the 5'cis regulatory region comprises the FDX promoter and FDX - 5’UTR as disclosed in SEQ ID NO 31. In another preferred embodiment, the 5'cis regulatory region is the sequence shown in SEQ ID NO: 31.

In a preferred embodiment the 5'cis regulatory region comprises AR chimeric promoter (SEQ ID NO: 6) and 5’UTR RBCS2 (SEQ ID NO: 25). In a more preferred embodiment the 5'UTR comprises additionally the RBCS2 intron sequence (SEQ ID NO 10). In a still more preferred embodiment the 5'cis regulatory region comprises the AR promoter and 5’UTR as disclosed in SEQ ID NO 33. In another preferred embodiment, the 5'cis regulatory region is the sequence shown in SEQ ID NO 33.

As it is used herein, the term“3’-UTR”, refers to an untranslated region which appears after the end codon. The 3’ untranslated region typically contains a polyadenine tag which allows increasing RNA stability, and therefore the amount of products resulting from the translation of said RNA. As disclosed herein, the 3 'UTR sequences may also contain other 3 'cis sequences, such as: terminator sequences, flanking sequences and/or any restriction site at their 5 'or 3 'end. Non-limiting examples of 3 'cis sequences include: 3’cis RPL23 (SEQ ID NO 26), 3’cis FDX1 (SEQ ID NO 27) and 3 'cis RBCS2 (SEQ ID NO 28). Flanking sequences may be located downstream of 3 'UTR. In the vector of the invention the 3’ cis regulatory regions comprise a sequence selected from the 3 'UTR of the RBCS2 gene, the 3 'UTR of the RPL23 gene and 3 'UTR of the FDX gene.

In a preferred embodiment the 3 'cis regulatory region comprises the 3 'UTR RPL23 (SEQ ID NO: 7). In a more preferred embodiment the 3 'cis regulatory region comprises additionally the RPL23 terminator and flanking regions (SEQ ID NO: 26). In a still more preferred embodiment the 3 'cis regulatory region comprises the 3 'UTR RPL23 containing 3’UTR terminator and flanking region as disclosed in SEQ ID NO 30. In another preferred embodiment, the 3 'cis regulatory region is the sequence shown in SEQ ID NO: 30.

In a preferred embodiment the 3 'cis regulatory region comprises the 3’UTR FDX (SEQ ID NO 9). In a more preferred embodiment the 3 'cis regulatory region comprises additionally the FDX terminator and flanking regions (SEQ ID NO: 27). In a still more preferred embodiment the 3 'cis regulatory region comprises the 3 'UTR FDX containing the 3’UTR terminator and flanking region as disclosed in SEQ ID NO 32. In another preferred embodiment, the 3 'cis regulatory region is the sequence shown in SEQ ID NO: 32.

In a preferred embodiment the 3 'cis regulatory region comprises the 3 'UTR RBCS2 (SEQ ID NO 8). In a more preferred embodiment the 3 'cis regulatory region comprises additionally the RBCS2 flanking regions (SEQ ID NO: 28). In a still more preferred embodiment the 3 'cis regulatory region comprises the 3 'UTR RBCS2 containing the 3’UTR terminator and flanking region as disclosed in SEQ ID NO 34. In another preferred embodiment, the 3 'cis regulatory region is the sequence shown in SEQ ID NO: 34.

The poly(A) tag can be of any size provided that it is sufficient to increase stability in the cytoplasm of the molecule of the vector of the invention. In a preferred embodiment, the 3’ cis regulatory sequence comprises a sequence selected from the group consisting of 3 'UTR of RPL23 (SEQ ID NO: 7), the 3 'UTR of the RBCS2 gene (SEQ ID NO: 8) and the 3 UTR of the FDX gene promoter (SEQ ID NO: 9).

In a preferred embodiment, the cis regulatory region comprises a sequence selected from the group consisting of a HSP70A/ RBCS2 chimeric promoter (SEQ ID NO: 6), the RPL23 gene promoter (SEQ ID NO: 4), the FDX gene promoter (SEQ ID NO: 5), the 3'UTR of the RBCS2 gene (SEQ ID NO: 8), the 3 'UTR of the RPL23 gene (SEQ ID NO: 7), 3'UTR of the FDX gene (SEQ ID NO: 9) and any combination thereof.

In a more preferred embodiment, the 5' cis regulatory region comprises a sequence selected from the group consisting of the HSP70A/ RBCS2 chimeric promoter (SEQ ID NO: 6), the RPL23 gene promoter (SEQ ID NO: 4) and the FDX gene promoter (SEQ ID NO: 5), and the 3' cis regulatory region comprises a sequence selected from the group consisting of 3'UTR of the RBCS2 gene (SEQ ID NO: 8), the 3'UTR of the RPL23 gene (SEQ ID NO: 7) and the 3'UTR of the FDX gene (SEQ ID NO: 9).

In a preferred embodiment, the cis regulatory regions in one of the expression cassettes comprises a sequence selected from the group consisting of HSP70A/ RBCS2 chimeric promoter (SEQ ID NO: 6) and 3 'UTR of the RBCS2 gene (SEQ ID NO: 8), HSP70A/ RBCS2 chimeric promoter (SEQ ID NO: 6) and the 3 'UTR of the RPL23 gene (SEQ ID NO: 7); and HSP70A/ RBCS2 chimeric promoter (SEQ ID NO: 6) and 3'UTR of the FDX gene (SEQ ID NO: 9).

In another preferred embodiment, the cis regulatory regions in one expression cassette comprises a sequence selected from the group consisting of RPL23 gene promoter (SEQ ID NO: 4) and 3'UTR of the RBCS2 gene (SEQ ID NO: 8), RPL23 gene promoter (SEQ ID NO: 4) and the 3 'UTR of the RPL23 gene (SEQ ID NO: 7); and RPL23 gene promoter (SEQ ID NO: 4) and 3'UTR of the FDX gene (SEQ ID NO:

9)·

In another preferred embodiment, the cis regulatory regions in one expression cassette comprises a sequence selected from the group consisting of FDX gene promoter (SEQ ID NO: 5), and 3'UTR of the RBCS2 gene (SEQ ID NO: 8), FDX gene promoter (SEQ ID NO: 5) and the 3 'UTR of the RPL23 gene (SEQ ID NO: 7); and FDX gene promoter (SEQ ID NO: 5), and 3'UTR of the FDX gene (SEQ ID NO: 9).

In another preferred embodiment, the 5 'cis regulatory regions in one expression cassette is selected from the sequences shown in SEQ ID NO 29, SEQ ID NO: 31 and SEQ ID NO: 33. In another preferred embodiment, the 3 'cis regulatory regions in one expression cassette is selected from the sequences shown in SEQ ID NO 30, SEQ ID NO: 32 and SEQ ID NO: 34. In a still more preferred embodiment the sequence comprising HSP70A/ RBCS2 chimeric promoter and the sequence comprising the 3'UTR of the RBCS2 are in one of the cassettes, the sequence comprising RPL23 gene promoter and the sequence comprising 3'UTR of the RPL23 gene are in a different cassette, and the sequence comprising FDX gene promoter and sequence comprising the 3 'UTR of the FDX gene are in a cassette different from the other two.

In a preferred embodiment, the third expression cassette comprises the HSP70A/ RBCS2 chimeric promoter and the 3 'UTR of the RBCS2 gene as cis regulatory regions.

In a preferred embodiment, the 5’ and 3’ cis regulatory regions in the first expression cassette are respectively a sequence comprising the RPL23 gene promoter (SEQ ID NO: 4) and the 3'UTR of the RPL23 gene (SEQ ID NO: 7) and the 5’ and 3’ regulatory regions of the second expression cassette are respectively a sequence comprising the FDX gene promoter (SEQ ID NO: 5) and the 3'UTR of the FDX gene (SEQ ID NO: 9).

In another preferred embodiment of the vector of the invention, the polynucleotide of interest within at least one of the expression cassettes encodes a protein of interest and said expression cassette comprises an intron sequence within the region encoding the protein of interest or the protein encoded by the selection gene and/or a nucleotide sequence encoding a signal peptide in the same open reading frame at position 5' respect to the nucleotide sequence encoding the protein of interest or the protein encoded by the selection gene.

In a preferred embodiment, the vector comprises a first expression cassette comprising a polynucleotide encoding the light chain of an antibody, the RPL23 gene promoter and the 3 'UTR of the RPL23 gene, the second expression cassette comprises the FDX gene promoter, the 3 'UTR of the FDX gene and a polynucleotide encoding the heavy chain of the antibody, and the third expression cassette comprises the bleomycin resistance gene, the HSP70A/ RBCS2 chimeric promoter and the 3'UTR of the RBCS2 gene as cis regulatory regions.

As it is used herein, the term “intron” or“intron sequence” refers to any nucleotide sequence within a gene that is removed by RNA splicing during maturation of the final RNA product. The term intron refers to both the DNA sequence within a gene and the corresponding sequence in RNA transcripts. In a preferred embodiment, the polynucleotide of interest within at least one of the expression cassettes comprises an intron within any of the expression cassette components. In a more preferred embodiment, at least one of the expression cassettes comprises an intron within the polynucleotide of interest, the region encoding the protein of interest or the selection gene. In a still more preferred embodiment the intron is a RBCS2 intron having the sequence shown in SEQ ID NO: 10.

In another preferred embodiment, at least one of the expression cassettes comprises a nucleotide sequence encoding a signal peptide in the same open reading frame at position 5 ' respect to the nucleotide sequence encoding the protein of interest or the protein encoded by the selection gene.

As it is used herein, the term“signal peptide” or“secretory signal peptide” refers to a peptide of a relatively short length, generally between 5 and 30 amino acid residues, directing proteins synthesized in the cell towards the secretory pathway. The signal peptide usually contains a series of hydrophobic amino acids adopting a secondary alpha helix structure. Additionally, many peptides include a series of positively-charged amino acids that can contribute to the protein adopting the suitable topology for its translocation. The signal peptide tends to have at its carboxyl end a motif for recognition by a peptidase, which is capable of hydrolyzing the signal peptide giving rise to a free signal peptide and a mature protein. The signal peptide can be cleaved once the protein of interest has reached the appropriate location. Any secretory signal peptide may be used in the present invention. As a way of illustrative non limitative examples signal peptide from Chlamydomonas reinhardtii carbonic anhydrase (CAH1) (SEQ ID NO: 12), signal peptide from Chlamydomonas reinhardtii periplasmic arylsulfatase (ARS1) (SEQ ID NO: 14) or the signal peptide from Chlamydomonas reinhardtii Gametolysin Ml l (SEQ ID NO: 16) may be used. In a preferred embodiment CAH1 signal peptide is encoded by SEQ ID NO: 11, ARS1 signal peptide is encoded by SEQ ID NO: 13 and Gametolysin Ml l signal peptide is encoded by SEQ ID NO: 15.

In a preferred embodiment the sequence encoding the signal peptide is in the same open reading frame at position 5' respect to the nucleotide sequence encoding the polynucleotide of interest. In a more preferred embodiment, the polynucleotide of interest encodes a protein of interest. In another preferred embodiment the sequence encoding the signal peptide is in the same open reading frame at position 5 ' respect to the selection gene. In a preferred embodiment the signal peptide is a murine signal peptide. In a more preferred embodiment, the murine signal peptide is selected from SEQ ID NO 20 and SEQ ID NO 21. In a still more preferred embodiment, the sequence encoding the murine signal peptide is selected from SEQ ID 17 and SEQ ID 18. Fusion of signal peptide to the protein of interest results in secretion of the fusion protein to media, which is the preferred strategy since it permits easy and efficient purification from the extracellular medium. In addition, the secretory production of recombinant proteins has the advantage that proteolytic degradation may be avoided and that there is a better chance of correct protein folding.

Host cell

In another aspect the invention relates to a host cell comprising a vector as described previously.

The term“host cell” is used such that it refers not only to the particular subject cell, but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein. A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., yeast or plant cells).

In a preferred embodiment the host cell is a microalga. In a more preferred embodiment, the host cell is a green microalga. In a more preferred embodiment the host cell is from genus Chlamydomonas.

In a more preferred embodiment the host cell is C. reinhardtii.

The invention also relates to the use of the vector according to the invention or a host cell according to the invention for simultaneously co-expressing the first and second polynucleotide of interest.

In a preferred embodiment, the co-expression of the first and second polynucleotide of interest is stoichiometric.

In another preferred embodiment the first polynucleotide of interest encodes a protein of interest. In another preferred embodiment the first and second polynucleotide of interest encode a protein of interest. In a more preferred embodiment the first and second protein of interest are co-expressed. In a still more preferred embodiment the co- expression of the first and second protein of interest is stoichiometric. As disclosed herein“stoichiometric manner” or“stoichiometric” means that the two polynucleotides and/or proteins of interest are expressed at the same level (1 :1).

All the terms and embodiments previously described are equally applicable to this aspect of the invention.

A method for co-expressing two polynucleotides of interest

In another aspect the invention relates to a method for co-expressing two polynucleotides of interest which comprises growing a cell carrying a vector according to the invention in conditions suitable for allowing the expression of the polynucleotides of interest.

The method of the invention may comprise a first step of introducing in a microalga cell a vector according to the invention. The vector of the invention may be introduced into a microalga by means of well-known Chlamydomonas nuclear transformation techniques such as transfection (that may be chemical or non-chemical such as glass beads), electroporation and particle bombardment using the vector of the invention that has been isolated. In a preferred embodiment the vector is introduced by transformation. The transformed algae may be recovered on a solid nutrient media or in liquid media.

In addition, the method of the invention may comprise growing said cell in conditions suitable for allowing the co-expression of the polynucleotides interest. Culture conditions suitable for the growth of the microalga and for the expression of the protein of interest may be different for each type of microalga. However, those conditions are known by skilled workers and are readily determined. In a particular embodiment, the microalga is grown under mixotrophic conditions. In a particular embodiment, the microalga is cultured in a photobioreactor in a suitable medium, under a suitable luminous intensity, at a suitable temperature. Practically any medium suitable for growing microalgae can be used; nevertheless, illustrative, non-limitative examples of said media include TAP media. The luminous intensity can vary widely, nevertheless, in a particular embodiment, the luminous intensity is comprised between 25 and 150 pmol photons m-2 s-l, particularly 100 mE. The temperature can vary usually between about l7°C and about 30°C, particularly 25°C. The culture can be performed in the absence of aeration or with aeration. Similarly, the duration of maintenance can differ with the microalga and with the amount of protein desired to be prepared. Again, those conditions are well known and can readily be determined in specific situations. In a preferred embodiment the microalga is a green alga, more particularly from genus Chlamydomonas, and more particularly Chlamydomonas reinhardtii.

In a preferred embodiment the first polynucleotide of interest encodes a protein of interest. In another preferred embodiment the first and second polynucleotide of interest encode a protein of interest.

In a particular embodiment, the method of the invention further comprises purifying the protein of interest. Suitable purification can be carried out by methods known to the person skilled in the art such as by using lysis methods, extraction, ion exchange resins, electrodialysis, nanofiltration, etc.

In a preferred embodiment, the co-expression of the first and second proteins of interest is stoichiometric. Thus, if stoichiometric levels of the two proteins are not detected, an additional vector may be introduced into the cell having a vector of the invention. The detection of the two proteins of interest can be done within a purified sample, cell culture media or whole cell extracts (total culture).

Thus, in a particular embodiment, the method of the invention further comprises growing a cell carrying a second expression vector comprising the first or second expression cassette used in the first expression vector according to the invention.

In a preferred embodiment when the expression and/or secretion of one of the proteins of interest is not stoichiometric (1 :1) in relation to the expression and/or secretion of the other protein of interest, the selected cell is retransformed with a second expression vector comprising an expression cassette encoding the protein of interest with the lowest expression and/or secretion. As used herein, the non- stoichiometric expression may lead to an insoluble, inactive or unstable heterodimer.

In a preferred embodiment the second expression vector comprises an expression cassette comprising a nucleotide sequence of a selection gene different from the selection gene used in the vector used for the expression of the first and second proteins of interest. In a preferred embodiment the selection gene found in the second vector is the paromomycin (paro ) resistance gene (SEQ ID 19). In a preferred embodiment the second transformants with an expression vector according to the invention are selected on paromomycin (paro ) containing plates.

All the terms and embodiments previously described are equally applicable to this aspect of the invention.

A method for selecting a cell

In another aspect the invention relates to a method for selecting a cell co- expressing in a stoichiometric manner two proteins of interest, comprising

a) transforming a cell with a vector according to the invention,

b) detecting the expression of the two proteins of interest, and

c) selecting a cell expressing the two proteins of interest in a stoichiometric manner.

In a preferred embodiment the method for detecting the expression of the two proteins of interest is performed upon total culture, which includes media and cells that were previously disrupted by a cycle of freeze/thaw. In a preferred embodiment the expression of the two proteins of interest is detected by sandwich ELISA.

In a preferred embodiment the two proteins of interest are the heavy chain and the light chain of an antibody. In a more preferred embodiment the two proteins of interest are detected by an antibody specific for Fc region and an antibody specific for F(ab')2 region.

All the terms and embodiments previously described are equally applicable to this aspect of the invention.

The invention will be described by way of the following examples which are to be considered as merely illustrative and not limitative of the scope of the invention.

EXAMPLES

Example 1- Vector construction

Expression strategy is based on the design of a single vector (Figure 1) containing three cassettes: one for the expression of an antibiotic resistance for initial selection of transformants (Ble R ), one for the expression of the heavy chain (HC) and one for the expression of the light chain (LC). Because use of multiple copies of transgene arranged in tandem are associated with low transgene expression (Garrick et al. 1998 Nat Genet. 18(l):56-9) use of repetitive elements in one single vector, such as same cis regulatory elements to drive multiple gene expression, may also cause a decreased transgene expression. To increase probability of multiple high transgene expression three different cis regulatory regions (5’ and 3’) were included in a single vector to drive expression of a monoclonal antibody as an example of dimeric protein: HSP70A/ RBCS2 (AR) promoter and 5’UTR RBCS2 (containing RBCS2 intron as previously described and 3’UTR RBCS2 to drive expression of bleomycin resistance, Ferredoxin 1 ( FDX1 , also called PETF) (Crel4.g626700) cis regulatory regions (5’ and 3’) to drive expression of the heavy chain (HC) and Chlamydomonas Ribosomal protein L23 ( RPL23 ) (Cre04.g2l 1800) cis regulatory regions to drive light chain (LC) expression. The FDX1 and RPL23 cis regulatory regions have been recently described (Lopez-Paz et al. 2017). AR quimeric promoter, including also an intron of RBCS2 has been also previously used (Heitzer and Zschoemig 2007 BioTechniques 43: 324-328. HC and LC sequences were codon adapted with IDT web tool (https://eu.idtdna.com/CodonOpt) and murine signal sequence was taken from the original murine antibody sequence.

Examples 2- Screening of mAh production in Chlamydomonas transformants

The high variability in expression levels among Chlamydomonas transformants makes necessary the development of a high throughput screening to select the highest expressing transformants, therefore, a rapid sandwich ELISA was developed to identify clones expressing both heavy and light chain. Briefly, initial transformants were selected on zeocin containing plates, individually picked and grown in 96 well plates until reaching stationary phase. Since we had no information if the used murine signal peptide would be functional in Chlamydomonas our screening was performed with total culture (this includes media and cells that were previously disrupted by a cycle of freeze/thaw). Presence of fully assembled antibody was determined by a sandwich

ELISA where capture antibody was an Anti-Mouse IgG, Fc-g Fragment Specific antibody, and detection antibody was a conjugated anti-mouse IgG, F(ab’)2 fragment specific. As a positive control and as a reference of assay sensitivity serial dilutions of purified 5mC3 (CHO produced) were used to create a standard trend line. The inclusion in the assay of standard curves containing wild type cell extract or culture media showed that neither the crude cell extract nor culture media prevented the mAh from being recognized by the used antibodies. Values obtained with non transformed strains (media or total culture extracts) served as references of background signal.

Two different strains were used to test the expression of recombinant m5C3 antibody: wild type CC-124 and mutant strain UVM4. For the case of CC-124 strain, four out of 184 transformants showed, at least, 5 -fold signal increase relative to background (represented by a line in Figure 2) and were selected for further analysis (18B1, 19A1, 19A2 and 19A3). Results of two transformants considered not positive are also shown (Figure 2: 18B2 and 19B1). For the case of UVM4 (Neupert et al, 2009. The Plant Journal. 57: 1140-1150), a mutant reported to express transgenes to higher levels), 9 out of 96 initial transformants (resistant to l5pg/ml zeocin) showed, at least, a 50-fold signal increase relative to background (represented by a line in Figure 2). Four transformants were selected with the highest expression that showed, at least, a lOO-fold signal increase relative to mean value.

Expression of both chains was analyzed by western blot since the used anti mouse IgG that reacts with F(ab’)2 region may also react with single heavy chain- although with less affinity than for the light chain or Fab2 domain- and therefore it is possible to obtain signal in clones that express only heavy chain. The immunoblot was performed using an anti-mouse IgG that preferentially detects the heavy chain, and an anti-mouse F(ab’)2 region (Figure 3). Clone 19A3 was selected for western blot analysis since it showed maximum signal by Sandwich ELISA. Surprisingly, whereas, light chain was detected by immunoblot in concentrated clarified media, no heavy chain was detected in clone 19A3. In the case of UVM4 positive transformants, it was possible to detect both light and heavy chain, although expression of light chain was in all cases significantly higher than expression of heavy chain. This result may not be due to lower transcription/translational effectiveness of FDX1 promoter/regulatory regions compared to RPL23 regions but may account for the lower unstability or secretion of heavy chain vs light chain. (Figure 3). Importantly, no band or signal was detected in UVM4 or CC124 negative controls with any of the antibodies used. Under reducing conditions both antibody chains arc detected separately, whereas in non-rcduccd SDS- PAGE a high molecular weight signal is detected, which corresponds to the fully- assembled antibody (Figure 3C). Under both conditions (denaturing and non denaturing) it can be observed that stoichiometry differs from 5C3 antibody expressed in mammalian cells: LC is overexpressed relative to HC, presumably due to poor expression/secretion or instability of this chain.

In order to increase expression of the heavy chain antibody, and consequently, levels of fully assembled antibody, clone 19A3 was retransformed with a vector containing only heavy chain plus a different selectable marker. A similar ELISA sandwich screening was performed with this second round of transformants and clones with signal above the mean value (signal in the range of 19A3 signal) were selected for further characterization.

Example 3 -Retransformation and screening of transformants.

HC-paro R vector (Figure 4) was transformed into 19 A3 clone and 376 initial transformants (resistant to 25 gg/ml paromomycin) were analyzed by sandwich ELISA as reported previously. In this second ELISA, the median RLU value of all the initial transformants (that was similar to 19A3 signal) was taken as background. Clones that showed a 3 -fold RLU signal above background were considered positives and were selected for further analysis (19A3#1 and 19A3#6) (Figure 5). Quantification by Sandwich ELISA of mAb expression in both cells and clarified media revealed that in both transformants accumulation occurred mainly in the media (only 1% of total mAb was found in cell extracts) and therefore, clarified media was used in the following analysis. The standard curve included in the assay (m5c3 purified mAb) allowed the estimation of the mAb expressed (Figure 6).

To evaluate expression of fully assembled mAb in the selected transformants (19A3#1, 19A#6 and 19 A3) an immunoblot analysis was performed. Immunoblots were performed using an anti-mouse IgG and an anti-mouse IgG specific for F(ab’)2 region under reducing and non-reducing conditions (Figure 5B). Results showed that increase on RLU expression correlated with increased heavy chain expression in retransformed clones and, accordingly, with an increase in the expression of fully assembled mAb. ELISA sandwich and Inmmunoblot assays validate our high-throughput screening method and demonstrates that Chlamydomonas may express and secrete fully assemble mammalian monoclonal antibodies.

Expression of mAh in selected clones was estimated on the base of these immunoblots. Based on the non-reducing immunoblot approximately 0.4 pg/L of complete mAh was estimated in clones 19A#1 and 19A#6 and approximately 0.1 pg/F was estimated in 19A3 transformant. Based on the reducing immunoblot using anti-Fab, approximately 1.5 pg/L of light chain were estimated in 19A#1, 19A3#6 and 19A3 transformants. In the case of heavy chain, based on the reducing immunoblot with anti- Fc region around a 0.2 pg/F were estimated for clones #1 and #6 whereas the heavy chain was too low to be quantifiable in 19A3 transformant. That estimation of expression corroborates that the increase on heavy chain copy number resulted in an increase on the expression of complete mAh. Note that since insertion of transgene during transformation occurs randomly in the genome, probability of second copy being inserted next to first insertion is low and so it is the probability of silencing due to multiple copy arrangement.

Examples 4-mAb characterization in selected transformants.

mAh expression in selected tranformants (19A3#1, 19A3#6 and UVM4#2) was monitored at different stages of growth. Quantification in samples (extracellular and intracellular) was done by sandwich EFISA, using a standard curve of purified mouse m5C3. Growth rate was monitored by OD 7 50nm and no significant differences were observed between transformed and wild type strains (Figure 7).