Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
HETEROLOGOUS PRODUCTION OF PATCHOULOL, BETA-SANTALENE, AND SCLAREOL IN MOSS CELLS
Document Type and Number:
WIPO Patent Application WO/2014/206412
Kind Code:
A1
Abstract:
The present invention relates to methods for preparing patchoulol, β-santalene, and sclareol in transgenic moss cells comprising heterologous nucleic acid molecules encoding a polypeptide or synthasecapable of using FPP or GGPP as substrate, methods for producing the transgenic moss cell, as well as the transgenic moss cell itself.

Inventors:
SIMONSEN HENRIK (DK)
CHEN DONG-FANG (GB)
ZHAN XIN (DK)
PAN XIWU (CN)
KING BRIAN CHRISTOPHER (DK)
Application Number:
PCT/DK2014/050130
Publication Date:
December 31, 2014
Filing Date:
May 07, 2014
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
KØBENHAVNS UNI (DK)
FIRMENICH & CIE (CH)
International Classes:
C12N15/82; C12P5/00; C12P7/02
Domestic Patent References:
WO2005052163A22005-06-09
Foreign References:
US20100297722A12010-11-25
US8058046B22011-11-15
US20110281257A12011-11-17
US20110041218A12011-02-17
US20100297722A12010-11-25
US20030157592A12003-08-21
Other References:
SCHAEFER D G ET AL: "EFFICIENT GENE TARGETING IN THE MOSS PHYSCOMITRELLA PATENS", THE PLANT JOURNAL, BLACKWELL SCIENTIFIC PUBLICATIONS, OXFORD, GB, vol. 11, no. 6, 1 June 1997 (1997-06-01), pages 1195 - 1206, XP000885764, ISSN: 0960-7412, DOI: 10.1046/J.1365-313X.1997.11061195.X
DECKER ET AL: "Moss bioreactors producing improved biopharmaceuticals", CURRENT OPINION IN BIOTECHNOLOGY, LONDON, GB, vol. 18, no. 5, 1 October 2007 (2007-10-01), pages 393 - 398, XP022350909, ISSN: 0958-1669, DOI: 10.1016/J.COPBIO.2007.07.012
SIMONSEN HENRIK TOFT ET AL: "Perspectives on using physcomitrella patens as an alternative production platform for thapsigargin and other terpenoid drug candidates.", PERSPECTIVES IN MEDICINAL CHEMISTRY 2009, vol. 3, 2009, pages 1 - 6, XP002727138, ISSN: 1177-391X
BACH SØREN SPANNER ET AL: "Heterologous stable expression of terpenoid biosynthetic genes using the moss Physcomitrella patens.", METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2014, vol. 1153, 2014, pages 257 - 271, XP008170582, ISSN: 1940-6029
TATIANA ET AL., FEMS MICROBIOL LETT., vol. 174, 1999, pages 247 - 250
LINK ET AL., NATURE REVIEWS, vol. 5, no. 9, 2007, pages 680 - 688
SANGER ET AL., GENE, vol. 97, no. 1, 1991, pages 119 - 123
GHADESSY ET AL., PROC NATL ACAD SCI USA, vol. 98, no. 8, 2001, pages 4552 - 4557
KNIGHT ET AL., MOLECULAR PLANT BIOLOGY, vol. 2, 2002, pages 285 - 301
COVE ET AL.: "Emerging Model Organisms: A Laboratory Manual", vol. 1, 2009, CSHL PRESS
KIRBY J; KEASLING JD: "Biosynthesis of plant isoprenoids: Perspectives for microbial engineering", ANNU REV PLANT BIOL, vol. 60, 2009, pages 335 - 355, XP009143317, DOI: doi:10.1146/annurev.arplant.043008.091955
PICKEL B; DREW DP; MANCZAK T; WEITZEL C; SIMONSEN HT; RO DK: "Identification and characterization of a kunzeaol synthase from Thapsia garganica: implications for the biosynthesis of the pharmaceutical thapsigargin", BIOCHEMICAL JOURNAL, vol. 448, no. 2, 2012, pages 261 - 271
MARTIN DM; FALDT J; BOHLMANN J: "Functional characterization of nine Norway spruce TPS genes and evolution of gymnosperm terpene synthases of the TPS-d subfamily", PLANT PHYSIOL, vol. 135, no. 4, 2004, pages 1908 - 1927, XP002579008, DOI: doi:10.1104/PP.104.042028
HAMBERGER B; OHNISHI T; HAMBERGER B; SÉGUIN A; BOHLMANN J: "Evolution of diterpene metabolism: Sitka spruce CYP720B4 catalyzes multiple oxidations in resin acid biosynthesis of conifer defense against insects", PLANT PHYSIOL, vol. 157, no. 4, 2011, pages 1677 - 1695, XP055119176, DOI: doi:10.1104/pp.111.185843
FISCHER MJ; MEYER S; CLAUDEL P; PERRIN M; GINGLINGER JF; GERTZ C; MASSON JE; WERCK-REINHARDT D; HUGUENEY P; KARST F: "Specificity of Ocimum basilicum geraniol synthase modified by its expression in different heterologous systems", J BIOTECHNOL, vol. 163, no. 1, 2012, pages 24 - 29
RENSING SA; FRITZOWSKY D; LANG D; RESKI R: "Protein encoding genes in an ancient plant: analysis of codon usage, retained genes and splice sites in a moss", PHYSCOMITRELLA PATENS. BMC GENOMICS, vol. 6, 2005, pages 43, XP021002307, DOI: doi:10.1186/1471-2164-6-43
VIETOR R; LOUTELIER-BOURHIS C; FITCHETTE AC; MARGERIE P; GONNEAU M; FAYE L; LEROUGE P: "Protein N-glycosylation is similar in the moss Physcomitrella patens and in higher plants", PLANTA, vol. 218, no. 2, 2003, pages 269 - 275
KOPRIVOVA A; ALTMANN F; GORR G; KOPRIVA S; RESKI R; DECKER EL: "N-glycosylation in the moss Physcomitrella patens is organized similarly to that in higher plants", PLANT BIOL, vol. 5, no. 6, 2003, pages 582 - 591, XP002422344, DOI: doi:10.1055/s-2003-44721
MEGA T: "Plant-type N-glycans containing fucose and xylose in Bryophyta (mosses) and Tracheophyta (ferns", BIOSCI BIOTECHNOL BIOCHEM, vol. 71, no. 12, 2007, pages 2893 - 2904
SIMONSEN HT; DREW DP; LUNDE C: "Perspectives on using Physcomitrella patens as an alternative production platform for thapsigargin and other terpenoid drug candidates", PERSPECT MEDICIN CHEM, vol. 3, 2009, pages 1 - 6, XP002727138
ANTEROLA A; SHANLE E; PERROUD PF; QUATRANO R: "Production of taxa-4(5),11(12)-diene by transgenic Physcomitrella patens", TRANSGEIC RES, vol. 18, no. 4, 2009, pages 655 - 660, XP019689774
HAMBERGER B; BAK S: "Plant P450s as versatile drivers for evolution of speciesspecific chemical diversity", PHIL TRANS R SOC B (IN PRESS), 2013
YONEKURA-SAKAKIBARA K; HANADA K: "An evolutionary view of functional diversity in family 1 glycosyltransferases", PLANT J, vol. 66, no. 1, 2011, pages 182 - 193
CHEN F; THOLL D; BOHLMANN J; PICHERSKY E: "The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom", PLANT J, vol. 66, no. 1, 2011, pages 212 - 229
SAKAMOTO T; MIURA K; ITOH H; TATSUMI T; UEGUCHI-TANAKA M; ISHIYAMA K; KOBAYASHI M; AGRAWAL GK; TAKEDA S; ABE K ET AL.: "An overview of gibberellin metabolism enzyme genes and their related mutants in rice", PLANT PHYSIOL, vol. 134, no. 4, 2004, pages 1642 - 1653, XP003003256, DOI: doi:10.1104/pp.103.033696
ANTEROLA A; SHANLE E: "Genomic insights in moss gibberellin biosynthesis", BRYOLOGIST, vol. 111, no. 2, 2008, pages 218 - 230
JOHRI MM: "Hormonal regulation in green plant lineage families", PHYSIOL MOL BIOL PLANTS, vol. 14, no. 1-2, 2008, pages 23 - 38
VON SCHWARTZENBERG K; SCHULTZE W; KASSNER H: "The moss Physcomitrella patens releases a tetracyclic diterpene", PLANT CELL REP, vol. 22, no. 10, 2004, pages 780 - 786
SCHAEFER DG; ZRYD JP: "Efficient gene targeting in the moss Physcomitrella patens", PLANT J, vol. 11, no. 6, 1997, pages 1195 - 1206, XP000885764, DOI: doi:10.1046/j.1365-313X.1997.11061195.x
HAYASHI K ET AL.: "Endogenous diterpenes derived from ent-daurene, a common gibberellin precursor, regulate protonema differentiation of the moss Physcomitrella patens", PLANT PHYSIOL, vol. 153, no. 3, 2010, pages 1085 - 1097
BACH SS; KING BC; ZHAN X; SIMONSEN HT; HAMBERGER B: "Methods in Molecular Biology", 2013, article "Heterologous stable expression of terpenoid biosynthetic genes using the moss Physcomitrella patens"
RESKI R: "Molecular genetics of Physcomitrella", PLANTA, vol. 208, 1999, pages 301 - 309, XP000946448, DOI: doi:10.1007/s004250050563
RESKI R: "Development, genetics and molecular biology of mosses", BOTANICA ACTA, vol. 111, 1998, pages 1 - 15
ASHTON NW; GRIMSLEY NH; COVE DJ: "Analysis of Gametophytic Development in the Moss, Physcomitrella-Patens, Using Auxin and Cytokinin Resistant Mutants", PLANTA, vol. 144, no. 5, 1979, pages 427 - 435
Attorney, Agent or Firm:
RASMUSSEN, Torben Ravn et al. (Rigensgade 11, København K, DK)
Download PDF:
Claims:
C L A I M S

1. A method for producing a transgenic moss cell comprising introducing into a moss cell capable of producing farnesyl-pyrophosphate (FPP) or geranylgeranyl- pyrophosphate (GGPP) a heterologous nucleic acid molecule encoding a polypeptide, the polypeptide having a patchouloi synthase activity or β-santalene synthase activity to catalyze the conversion of FPP to pathoulol or β-santalene, or a sclareol synthase activity to catalyse the conversion of GGPP to sclareol, wherein the heterologous nucleic acid molecule is at least 70% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

2. A method for preparing pathoulol, β-santalene, or sclareol in a transgenic moss cell comprising;

a) introducing into a moss cell capable of producing farnesyl-pyrophosphate (FPP) or geranylgeranyl-pyrophosphate (GGPP) a heterologous nucleic acid molecule encoding a polypeptide, the polypeptide having a patchouloi synthase activity or β-santalene synthase activity to catalyze the conversion of FPP to pathoulol or β-santalene, or a sclareol syn- thase activity to catalyse the conversion of GGPP to sclareol, wherein the heterologous nucleic acid molecule is at least 70% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3,

b) culturing the transgenic moss cell to express or overexpress the polypeptide encoded by the heterologous nucleic acid molecule, and

c) isolating pathoulol, β-santalene, and/or sclareol produced in step b).

3 A method according to claim 2, wherein step c) is performed by steam distillation and/or vacuum distillation.

4. A method according to claim 1 , 2 or 3 wherein the moss cell is selected from the group comprising of Takakiopsida, Sphagnopsida, Andreaeopsida, Andreaeobryopsida, Oedipodiopsida, Polytrichopsida, Tetraphidopsidaan and Bryopsida.

5. A method according to claim 4, wherein the moss cell is Physcomitrella patens.

6. A method according to any of the claims 1 , 2, 3, 4, or 5, wherein the heterologous nucleic acid molecule is at least 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, 97%, 98%, or 99% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

7. A transgenic moss cell capable of producing FPP or GGPP, the transgenic moss cell comprises a heterologous nucleic acid molecule encoding one or more polypeptides, the polypeptide having a patchouloi synthase activity or β-santalene synthase activity to catalyze the conversion of FPP to pathoulol or β-santalene, or a sclareol synthase activity to catalyse the conversion of GGPP to sclareol, wherein the het- erologous nucleic acid molecule is at least 70% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

8. A transgenic moss cell to claim 7, wherein the transgenic moss cell is selected from the group comprising of Takakiopsida, Sphagnopsida, Andreaeopsida, Andreaeobryopsida, Oedipodiopsida, Polytrichopsida, Tetraphidopsidaan and Bryopsida.

9. A transgenic moss cell according to claim 8, wherein the transgenic moss cell is Physcomitrella patens.

10. A transgenic moss cell according to any of the claims 7, 8, or 9, wherein the heterologous nucleic acid molecule is at least 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, 97%, 98%, or 99% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

Description:
Heterologous production of patchoulol, β-santalene, and sclareol in moss cells Field of the invention

The present invention relates to methods for production of patchoulol, β-santalene, and sclareol in transgenic moss (Physcomitrella patens) cells comprising heterologous nucleic acid molecules encoding a synthase capable of using FPP or GGPP as substrate, methods for producing the transgenic moss cells, as well as the transgenic moss cells itself. Background

Terpenoids or terpenes represent a family of natural products found in all organisms (bacteria, fungi, animals, plants) and these compounds are made up of five carbon units called isoprene units, which are classified by the number of units present in their structure. Thus, monoterpenes, sesquiterpenes and diterpenes are terpenes containing 10, 15 and 20 carbon atoms respectively. The common five-carbon precursor to all terpenes is isopentenyl pyrophosphate (IPP). IPP forms the acyclic prenyl pyrophosphate terpene precursors for each class of terpenes, e.g. farnesyl-pyrophosphate (FPP) for the sesquiterpenes, and geranylgeranyl-pyrophosphate (GGPP) for the diterpenes. These precursors serve as substrate for the terpene synthatases or cyclases, which are specific for each subclass of terpene, e.g. monoterpene, sesquiterpene or diterpene synthases. Some terpene synthases produce a single product, but most of them produce multiple products. The synthases are responsible for the extremely large number of terpene skeletons. Finally, in the last stage of terpenoid biosynthesis, the terpene molecules may undergo several steps of secondary enzymatic transformations such as hydroxylations, isomerisations, oxido-reductions or acylations, leading to the tens of thousand of different terpene molecules.

Patchoulol and β-santalene are classified as sesquiterpenes whereas sclareol is classified as a diterpene.

The biosynthesis of terpenes in plants has been extensively studied and heterologous expression, in vivo, and in vitro testing are important tools when characterizing genes involved in terpenes biosynthesis. Until now, most plant genes encoding isoprenoid biosynthetic synthases have been characterized using bacterial, yeast, or insect cell-based expression systems [1 ,2]. However, synthase activity is highly dependent on the assay conditions and choice of heterologous expression hosts. For example, it has been demonstrated that acidification of the yeast growth media led to rearrangement of enzymatic products [2]. Similarly, a diterpene synthase from Norway spruce was shown to be a multi-product synthase when purified and characterized in vitro, but when expressed in yeast and characterized in vivo only a single product accumulated [3, 4]. It was also recently shown that a monoterpene synthase from sweet basil generated different product profiles when characterized using microbial systems compared to in planta characterization [5]. Hence, the biochemical function of a plant synthatase expressed in bacterial or fungal hosts may differ from the endogenous in planta function.

US8058046 relates to sesquiterpene synthases derived from patchouli plants, and methods of their production in suitable host cells such prokaryotes, yeast or higher eukar- yotic cells. US8058046 provides nucleic acid molecules identified in Pogostemon cablin comprising a nucleotide sequence that encodes for at least one sesquiterpene synthase, which may be used to convert e.g. farnesyl-pyrophosphate (FPP) to various sesquiterpenes, including patchoulol. US8058046 exemplifies the expression of sesquiterpene synthases from patchouli plants in bacteria, E. coli.

US20110281257 relates to sesquiterpene synthases derived from Santalum species, and methods of their production in suitable host cells such prokaryotes, yeast or higher eukaryotic cells. US201 10281257 exemplifies expression of sesquiterpene synthases from Santalum species in at least S. cerevisiae. US201 10281257 provides nucleic acid molecules and variants thereof comprising a nucleotide sequence that encodes for at least one sesquiterpene synthase, which may be used to convert e.g. farnesyl-pyrophosphate (FPP) to various sesquiterpenes including β-santalene.

Genes encoding diterpene synthases have been identified and cloned and the corresponding recombinant enzymes characterized. Amongst other, US2011041218 relates to sclareol synthases, a diterpeniod, from Salvia sclarea species, and methods of their production in suitable host cells such as prokaryotes, yeast or higher eukaryotic cells. US201 1041218 exemplifies expression of the diterpene synthases from Salvia sclarea in at least S. cerevisiae. US2011041218 provides nucleic acid molecules and variants thereof comprising a nucleotide sequence that encodes for various diterpene synthases, which may be used to convert e.g. geranylgeranyl-pyrophosphate (GGPP) to prepare sclareol. US201 1041218 describes amongst other two synthases wherein GGPP is first converted to labdenediol diphosphate (LPP) and then converted to sclareol.

US20100297722 relates to the provision of compositions and processes for production of terpenoids from transgenic moss cells. US 20100297722 specifically relates to overexpression of various taxadiene synthases having activities towards geranylgeranyl- pyrophosphate and the derivatives thereof to produce different intermediates and end- products of diterpeniod compounds. In particular, expression or overexpression of specific polypeptides in Physcomitrella patens resulting in the production of terpenoid compounds such as various substituted taxadienes, 10-deacetylbaccatin III, abietadiene, abietic acid, steviol, steviolmonoside; stevioside; rebaudioside A, kaurenoic acid, a cembranoid, momilactone A-B, oryzalexins A-F, oryzalexin S and phytocassanes A-E, are exemplified.

Sesquiterpenes and diterpenes, including patchoulol, β-santalene, and sclareol, accumulates in plants and can be extracted by different means such as steam distillation or solvent extraction that produces the so-called essential oil containing the concentrated terpenes. Such natural plant extracts are important components for the flavor and perfum- ery industry due to their flavour and fragrance properties, and some sesquiterpenes and diterpenes may even possess cosmetic, medicinal and antimicrobial effects. Extracted terpene molecules are often used as such, but in some cases chemical reactions are used to transform the terpenes into even higher valued molecules

Because of the complexity of the terpene structure, production of individual terpene molecules by chemical synthesis is often limited by the cost of the process and may not always be chemically or financially feasible. The price and availability of the plant natural extracts is dependent on the abundance, the oil yield and the geographical origin of the plants. The recent progress in understanding terpene biosynthesis in plants and the use of modern biotechnology techniques opens new opportunities for the production of terpene molecules. Thus, there exist a continuously need to provide improved biological production of terpenes and products derived therefrom, such as patchoulol, sclareol, and β-santalene, at more environmental friendly conditions and/or higher efficiency.

Summary of the invention

The present invention provides means and methods to biosynthetically prepare patchoulol, β-santalene, and sclareol from transgenic moss cells.

Thus, it is an object of the present invention in a first aspect to provide a method for producing a transgenic moss cell comprising introducing into a moss cell capable of producing farnesyl-pyrophosphate (FPP) or geranylgeranyl-pyrophosphate (GGPP) a heterol- ogous nucleic acid molecule encoding a polypeptide, the polypeptide having a patchoulol synthase activity or β-santalene synthase activity to catalyze the conversion of FPP to pathoulol or β-santalene, or a sclareol synthase activity to catalyse the conversion of GGPP to sclareol, wherein the heterologous nucleic acid molecule is at least 70% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3. In an aspect of the invention the method above comprise a polypeptide, which is at least 70% identical to the polypeptide encoded by SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

In another aspect of the invention, a method for preparing pathoulol, β-santalene, or sclareol in a transgenic moss cell comprising;

a) introducing into a moss cell capable of producing farnesyl-pyrophosphate (FPP) or geranylgeranyl-pyrophosphate (GGPP) a heterologous nucleic acid molecule encoding a polypeptide, the polypeptide having a patchoulol synthase activity or β-santalene synthase activity to catalyze the conversion of FPP to pathoulol or β-santalene, or a sclareol synthase activity to catalyse the conversion of GGPP to sclareol, wherein the heterologous nucleic acid molecule is at least 70% identical to SEQ I D NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

b) culturing the transgenic moss cell to express or overexpress the polypeptide encoded by the heterologous nucleic acid molecule, and

c) isolating pathoulol, β-santalene, and/or sclareol produced in step b), is provided. In one embodiment, isolation of pathoulol, β-santalene, and/or sclareol in the step c), may be performed by steam distillation and/or vacuum distillation.

In an asepct of the invention the method for producing pathoulol, β-santalene, or sclareol comprises a polypeptide which is at least 70% identical to the polypeptide encoded by SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3.

In yet another aspect of the invention, a transgenic moss cell capable of producing FPP or GGPP, the transgenic moss cell comprises a heterologous nucleic acid molecule encoding one or more polypeptides, the polypeptide having a patchoulol synthase activity or β-santalene synthase activity to catalyze the conversion of FPP to pathoulol or β- santalene, or a sclareol synthase activity to catalyse the conversion of GGPP to sclareol, wherein the heterologous nucleic acid molecule is at least 70% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3, or the polypeptide is at least 70% identical to the polypeptide encoded by SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3, is provided.

Further, in one embodiment, the moss cell or transgenic moss cell is selected from the group comprising of Takakiopsida, Sphagnopsida, Andreaeopsida, Andreaeobryopsida, Oedipodiopsida, Polytrichopsida, Tetraphidopsidaan and Bryopsida. Preferably, the moss cell or transgenic moss cell is Physcomitrella patens.

In yet another embodiment, the heterologous nucleic acid molecule is at least 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, 97%, 98%, or 99% identical to SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3, or the polypeptide is at least 70%, 75%, 80%, 85%, 90%, 92%, 94%, 96%, 97%, 98%, or 99% identical to the polypeptide encoded by SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3. It has been found be the present invention that moss cells can be transformed to produce terpenes such as patchoulol, β-santalene or sclareol. This is beneficial as these terpenes can be produced more environmental friendly and/or more efficiently due to e.g. higher expression yields and/or more convinient purification steps.

Figures

Figure 1. Volatile metabolite profile of the transgenic moss lines producing patchoulol obtained by HS-SPME GC-MS analysis (X-axis is in min.). 1 denotes β- patchoulene; 2 denotes β-elemene; 3 denotes (£)^-caryophyllene; 4 denotes a-guaiene; 5 denotes seychellene; 6 denotes a-patchoulene; 7 denotes γ-patchoulene; 8 denotes guai-4,1 1-diene; 9 denotes an unidentified sesquiterpene; 10 denotes a-bulnesene; 11 denotes an unidentified sesquiterpene; 12 denotes (-)-patchoulol; A denotes an unidentified diterpene; B denotes an unidentified diterpene; C denotes ent-15-kaurene; and D denotes ent-16-kaurene. The WT moss line produces primarily the following products; A, B, C and D, whereas the transgenic line primarily produces the following products; 1 , 2, 3, 4,

5, 6, 7, 8, 9, 10, 1 1 and 12 [(-)-patchoulol].

Figure 2. Volatile metabolite profile of the transgenic moss lines using dodecane overlays (X-axis is in min.). 3 denotes (E)^-caryophyllene; 4 denotes a-guaiene; 5 denotes seychellene; 6 denotes a-patchoulene; 7 denotes γ-patchoulene; C denotes ent-15- kaurene; and D denotes ent-16-kaurene. The PpCPS/KS knock-out patchoulol producing moss line produces the lowest amount of the different measured terpen products, whereas the patchoulol producing transgenic line primarily produces the following products; 3, 4, 5,

6, 7, 12 [(-)-patchoulol], C and D.

Figure 3. The yields of patchoulol ( g/mg DW).

Figure 4. Volatile metabolites profile of the transgenic line producing β-santalene by

HS-SPME analysis (X-axis is in min.). 1 denotes a-santalene; 2 denotes (E)-a- bergamotene; 3 denotes epi^-santalene; 4 denotes β-santalene; C denotes ent-15- kaurene; and D denotes ent-16-kaurene. The transgenic line produces amongst other the following products; 1 , 2, 3, and 4 [β-santalene], whereas the WT moss line does not pro- duces any significant amounts of these when compared to the transgenic line.

Figure 5. Volatile metabolite profile of the transgenic line producing β-santalene with plastidic STS enzyme (X-axis is in min.). 1 denotes a-santalene; 2 denotes (E)-a- bergamotene; 3 denotes epi^-santalene; 4 denotes β-santalene; C denotes ent-15- kaurene; and D denotes ent-16-kaurene. The transgenic line produces amongst other the following products; 1 , 2, 3, and 4 [β-santalene], whereas the WT moss line does not pro- duces any significant amounts of these when compared to the transgenic line.

Figure 6. Formula I represents patchoulol, formula II represents β-sanatalene and formula III represents sclareol.

Figure 7. Volatile metabolite profile of the transgenic line producing sclareol (X-axis is in min.). A denotes ent-15-kaurene, B denotes ent-16-kaurene, C+D are unknown compounds, E denotes sclareol, F denotes 16-OH-ent-kaurene. WT moss line produces amongst other product A, B and F. PpCPS/KS-KO produces amongst other product C and D. The transgenic sclareol producing line produces amongst other product E [Sclareol] and F.

Figure 8. Time course measurement of sclareol production im KO-pUNI33-TPSsa3-

"A-TPS1 132, leading to 280 μg/g dry weight of Sclareol.

Figure 9. Map of pJET1.2. See also SEQ ID NO 6. pJET1.2 is an artificial DNA sequence from Thermo Scientific designed for biological research.

Figure 10. Map of pLIFE33: See also SEQ ID NO 7. pLIFE33 is an artificial DNA se- quence designed for biological research. The sequence was created on the basis of pCAMBIA 1300U, which is designed by CAMBIA, a non-profit scientific organization.

Figure 11. Map of pUNI33: See also SEQ ID NO 8. pUNI33 is an artificial DNA sequence designed for biological research. The sequence was created on the basis of pLIFE33.

Figure 12. Map of pUNI33 PTS: See also SEQ ID NO 9. pUNI33-PTS is an artificial

DNA sequence designed for patchoulol biosynthesis in moss. The sequence was created on the basis of pUNI33.

Figure 13. Map of pUNI33 STS: See also SEQ ID NO 10. pUNI33-STS is an artificial DNA sequence designed for β-santalene biosynthesis in moss. The sequence was created on the basis of pUNI33.

Figure 14. Map of pUNI33-TPSsa3-2A-TPS1 132. See also SEQ ID NO 11. pUNI33- TPSsa3-2A-TPS1 132 is an artificial DNA sequence designed for sclareol biosynthesis in moss . The sequence was created on the basis of pUNI33.

Figure 15. Map of pCL755: See also SEQ ID NO 14. pCL755 is an artificial DNA se- quence designed for disrupting the moss endogenous enzyme PpCPS/KS.

Figure 16. Map of pUNI33 tpPTS: See also SEQ ID NO 15. pUNI33-tpPTS is an artificial DNA sequence designed for sclareol biosynthesis in moss. The sequence was created on the basis of pUNI33.

SEQ ID NO 1 is a gene having pathcoulol synthase activity (PTS). The gene is de- rived from the plant Pogostemon cablin and is equivalent to GenBank: AAS86323.1. SEQ ID NO 2 is a gene having β-santalene synthese activity (STS). The gene is derived from the plant Santalum album and is equivalent to GenBank: ADO87000.1.

SEQ ID NO 3 comprises two genes having sclareol synthese activity (TPSsa3 / TPS1 132). The genes are derived from the plant Salvia sclarea and is equivalent to GenBank: AET21246.1 / AET21247.1.

SEQ ID NO 6: DNA sequence of pj ET1.2. pJET1.2 is a cloning vector from Thermo Scientific

SEQ ID NO 7: DNA sequence of pLIFE33. pLIFE33 is a binary vector derived from the pCAMBIA1300u vector containing CaMV 35S promoter followed by empty standard PAC (USER: Uracil-Specific Excision Reagent) cassette

SEQ ID NO 8: DNA sequence of pUNI33. pUNI33 is an artificial DNA sequence designed for biological research and was created on the basis of pLIFE33.

SEQ ID NO 9: DNA sequence of pUNI33-PTS. pUNI33-PTS is an artificial DNA sequence designed for biological research and was created on the basis of pLIFE33.

SEQ ID NO 10: DNA sequence of pUNI33-STS. pUNI33-STS is an artificial DNA sequence designed for biological research and was created on the basis of pLIFE33.

SEQ ID NO 1 1 : DNA sequence of pUNI33-TPSsa3-2A-TPS1132. pUNI33-TPSsa3- 2A-TPS1 132 is an artificial DNA sequence designed for biological research and was created on the basis of pLIFE33.

SEQ ID NO 14: DNA sequence of pCL755. pCL755 is an artificial DNA sequence designed for disrupting the moss endogenous enzyme PpCPS/KS.

SEQ ID NO 15: DNA sequence of pUNI33-tpPTS. pUNI33-tpPTS is an artificial DNA sequence designed for biological research and was created on the basis of pLIFE33.

SEQ ID NO 16: DNA sequence of pUNI33-tpSTS. pUNI33-tpSTS is an artificial DNA sequence designed for biological research and was created on the basis of pLIFE33.

Detailed description of the invention

In the following, the present invention is described in more detail. All individual features and details can be individually applied to each embodiment described.

Mosses, as such, are grouped as a single division within the plant kingdom,

Bryophyta. Further, Bryophyta can be subdivided into eight classes; Takakiopsida, Sphagnopsida, Andreaeopsida, Andreaeobryopsida, Oedipodiopsida, Polytrichopsida, Tetraphidopsidaan and Bryopsida. Hence, by the term moss cells are meant moss cells belonging to Bryophyta and its subclasses. In one embodiment of the present invention, moss cells belonging to the class of Bryopsida are preferred, in particularly Physcomitrella patens.

A moss cell is according to the present invention meant to be "capable of producing FPP or GGPP" when it produces FPP or GGPP.

By the term, "polypeptide", is meant, unless specifically limited, an enzyme capable of catalyzing the synthesis of a terpene, starting from FPP or GPPP or from any intermediate products thereof provided by a polypeptide.

In the present context, a synthase is defined as enzyme (polypeptide) that catalyzes the synthesis of a specific or unspecific reaction whether or not the synthase uses nucleoside triphosphates during the reacton. This definition is in correspondance to the dictate of the Joint Commission on Biochemical Nomenclature (JCBN).

By the term, "terpene synthase", is meant, unless specifically limited, a polypeptide capable of catalyzing the synthesis of a sesquiterpene or a diterpene in the form of any of its stereoisomers or a mixture thereof, starting from FPP or GGPP, respectively.

By the term, "sesquiterpene synthase", is meant, unless specifically limited, a poly- peptide capable of catalyzing the synthesis of a sesquiterpene in the form of any of its stereoisomers or a mixture thereof, starting from FPP.

By the term, "patchoulol synthase", is meant, unless specifically limited, a polypeptide capable of catalyzing the synthesis of patchoulol in the form of any of its stereoisomers or a mixture thereof, starting from FPP. Patchoulol may be the only product or may be part of a mixture of sesquiterpenes. Patchoulol is defined by the way of its structure, as represented by formula I in Figure 6.

By the term, "β-santalene synthase" is meant, unless specifically limited, a polypeptide capable of catalyzing the synthesis of β-santalene in the form of any of its stereoisomers or a mixture thereof starting from FPP. β-Santalene may be the only product or may be part of a mixture of sesquiterpenes. β-Santalene is defined by the way of its structure, as represented by formula II in Figure 6.

By the term, "diterpene synthase", is meant, unless specifically limited, a polypeptide capable of catalyzing the synthesis of a diterpene in the form of any of its stereoisomers or a mixture thereof, starting from the acyclic terpene precursor GGPP or from a diterpene diphosphate ester such as LPP.

By the term, "sclareol synthase", is meant, unless specifically limited, one or more polypeptides capable of catalyzing the synthesis of sclareol in the form of any of its stereoisomers or a mixture thereof starting from labdenediol diphosphate (LPP) or GGPP. Sclareol may be the only product or may be part of a mixture of diterpenes. Sclareol is defined by the way of its structure, as represented by formula III in Figure 6. For the purpose of the present invention, the term "nucleic acid molecules" and nucleic acid sequence" is meant to be the same, unless otherwise provided.

The percentage of identity between two nucleic acid sequences is a function of the number of nucleic acid residues that are identical in the two sequences when an alignment of these two sequences has been generated. Identical residues are defined as residues that are the same in the two sequences in a given position of the alignment. The percentage of sequence identity, as used herein, is calculated from the optimal alignment by taking the number of residues identical between two sequences dividing it by the total number of residues in the shortest sequence and multiplying by 100. The optimal alignment is the alignment in which the percentage of identity is the highest possible. Gaps may be introduced into one or both sequences in one or more positions of the alignment to obtain the optimal alignment. These gaps are then taken into account as non-identical residues for the calculation of the percentage of sequence identity.

Alignment for the purpose of determining the percentage of nucleic acid sequence identity can be achieved in various ways using computer programs and for instance publicly available computer programs available on the world wide web. Preferably, the BLAST program (Tatiana et al, FEMS Microbiol Lett., 1999, 174:247- 250, 1999) set to the default parameters, available from the National Center for Biotechnology Information (NCBI) at http://www.ncbi.nlm.nih.gov/BLAST/bl2seq/wblast2.cgi, can be used to obtain an optimal alignment of peptidic or nucleotidic sequences and to calculate the percentage of sequence identity.

According to the present invention, a moss cell can be genetically modified (transgenic moss cell) to provide for the preparation or accumulation of patchoulol, β-santalene, or sclareol. The transgenic moss cell may thus been transformed with a heterologous nu- cleic acid molecule encoding a sesquiterpene or diterpene synthase capable of performing the conversions required or involved in the metabolism of patchoulol, β-santalene, or sclareol. The transgenic moss cell may be transformed to comprise one or more heterologous nucleic acids molecules, SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3, encoding the synthase polypeptides, which may have a substantial effect on the production of the de- sired end compounds; patchoulol, β-santalene, and/or sclareol, or in the preparation of the relevant precursors in the transgenic moss cell.

A heterologous nucleic acid molecule encoding a terpene synthase or a combination of several hetereologous nucleic acid molecules encoding terpene synthases can be transformed into a moss cell, whereby the hetereologous nucleic acid molecules(s) can be modified either in their activity or number in the transgenic moss cell. A hetereologous nucleic acid molecule encoding a sesquiterpene or diterpene synthase can be isolated from any suitable organism, e.g. prokaryotes, plant cells, or eukaryotes, which comprises an endogenous nucleic acid sequence encoding a sesquiterpene or diterpene synthase.

A moss cell can be transformed with a heterologous nucleic acid molecule having at least about 70% sequence identity to a heterologous nucleotide sequence, SEQ ID NO 1 , SEQ ID NO 2, or SEQ ID NO 3, which encodes a polypeptide having a specified enzymatic activity, or a complementary sequence to any of these sequences.

According to the present invention, a moss cell can be engineered by introducing a heterologous nucleic acid molecule encoding the sequence of SEQ ID NO 1 to express or overexpress a patchoulol synthase for the preparation of patchoulol starting from FPP.

According to the present invention, a moss cell can be engineered by introducing a nucleic acid molecule encoding the sequence of SEQ ID NO 2 to express or overexpress a β-santalene synthase for the preparation of β-santalene starting from FPP.

According to the present invention, a moss cell can be engineered by introducing a nucleic acid molecule encoding the sequence of SEQ ID NO 3 to express or overexpress a diterpene synthase(s) for the preparation of sclareol and/or intermediates thereof starting from GGPP.

Factors such as pH, cofactors, codon usage, and posttranslational modifications contribute to expression and functionality of the peptides.

Physcomitrella contains all the cellular compartments relevant for terpenoid biosynthesis found in higher plants, such as the endoplasmic reticulum (ER) and plastids. In addition, codon usage is conserved between Physcomitrella and higher plants such as Arabidopsis thaliana, as are several posttranslational modifications such as N- glycosylation [6-9]. Thus, Physcomitrella has strong potential to heterologously express functional enzymes with the same functionality as the endogenous higher plant [10]. This was demonstrated by successful expression of a diterpene synthase in Physcomitrella using the native plastid targeting signal and codons from Taxus brevifolia [11].

According to the present invention, a transgenic moss cell can be transformed so as to reduce an endogenous terpenoid compound. Reduction of an endogenous terpenoid compound may increase availability for engineered pathways associated with production of patchoulol, β-santalene, or sclareol compounds. Reduction of an endogenous terpenoid compound can increase production or accumulation of a target terpenoid compound in a moss cell. Thus, a transgenic moss cell may be transformed by reducing or eliminating one or more polypeptides in order to reduce the amounts of terpenoid compounds the transgenic moss cell can produce. Non-limiting examples for reduction or removal of terpene synthases would be mevalonate diphosphate decarboxylase, Mevalonate kinase, 4-hydroxyphenylpyruvate dioxygenase; geranylgeranyl pyrophosphate synthase; ent- Kaurene synthetase; farnesyl pyrophosphate synthase; etc.

In a preferred embodiment of the present invention, the transgenic moss cell belongs to the genus Physcomitrella. An example of a Physcomitrella moss that can be transformed to accumulate a terpenoid includes, but is not limited to, Physcomitrella patens.

Physcomitrella represents an ancient lineage of land plants, and its metabolic and chemical diversity is low compared to higher plants. This is illustrated by the number of cytochromes P450 (P450s) and UDP glycosyltransferases (UGTs) found in the genome. The genomes of Arabidopsis thaliana and Oryza sativa contain 246 and 343 P450s respectively, while the genome of Physcomitrella only contains 71 P450s [12]. Similarly the genome of Physcomitrella contains a low number of UGTs compared to other land plants [13]. The low number of P450s and UGTs found in Physcomitrella and the correspondingly lower chemical diversity reduces the risk of un-specific modifications by endogenous enzymes, through pathways used in higher plants for detoxification of xenobiotics. In addition to this Physcomitrella has a simple terpenoid profile and the genome of Physcomitrella only contains a single functional terpene synthase (TPS) [14]. This gene encodes a bifunctional copalyl synthase/kaurene synthase (PpCPS/KS), responsible for producing of enf-kaurene, a common precursor for the phytohormone gibberellic acid (GA) which is essential for growth and development in vascular plants [15]. Despite the importance of GAs in vascular plants, they have not been detected in Physcomitrella and enf-kaurene as well as enf-kaurenoic acid do not have a clear role in Physcomitrella [16, 17]. Although the role of kaurene-type molecules in Physcomitrella is unclear, they are produced in impressively large quantities, indicating a high native capacity to produce terpenoids [18]. Gene editing by efficient homologous recombination in Physcomitrella provides a very powerful tool for metabolic engineering [19]. Targeted knockouts of PpCPS/KS result in viable kaurene-free moss lines [20]. Knocking out moss terpenoid synthesis genes can be accomplished by homologous recombination, see e.g.

[21]. In addition to having a terpenoid free-background, FPP or GGPP (the universal precursors for sesquiterpenes and diterpenes biosynthesis) could be redirected into heterologously expressed terpenoid pathways.

A heterologous nucleic acid molecule encoding a terpene synthase can be operably linked to a promoter for transformation of the moss cell. According to the present invention, the promoter can be any promoter functional in a moss cell.

There are several methods known in the art for the creation of transgenic plants and also in moss cells. These are well described in the art [21-23].

Protoplast transformation is the most commonly used method for Physcomitrella transformation, and is well described in the literature [21]. The method requires careful handling and regeneration of fragile protoplasts and must be done under sterile conditions. This method can be very efficient and yield a large number of stable transformants, however, several attempts may be needed to successfully recover stable Physcomitrella lines. Another robust alternative to e.g. PEG mediated transformation of protoplasts is biolistic transformation, i.e. direct gene transfer by particle bombardment may be utilized. In another embodiment, agrobacterium-mediated transformation may be utilized.

The DNA vectors used for transmation may additional contain various transit peptides targeting the peptides for specific cellular compartments, such as a transit peptides from Arabidopsis thaliana, RuBisCO.

In one embodiment, the transformation may be introduced into the cytosol, or into the chloroplasts, of the transgenic moss cell.

Direct gene transfer by particle bombardment provides an example for transforming plant tissue. In this technique a particle, or microprojectile, coated with DNA is shot through the physical barriers of the cell. Particle bombardment can be used to introduce DNA into any target tissue that is penetrable by DNA coated particles, but for stable transformation, it is imperative that regenerable cells be used. Typically, the particles are made of gold or tungsten. The particles are coated with DNA using either CaCI 2 or ethanol precipitation methods which are commonly known in the art. DNA coated particles are shot out of a particle gun. A suitable particle gun can be purchased from Bio-Rad Laboratories (Hercules, Calif.). Particle penetration is controlled by varying parameters such as the intensity of the explosive burst, the size of the particles, or the distance particles must travel to reach the target tissue. The DNA used for coating the particles may comprise an expression cassette suitable for driving the expression of the gene of interest that will comprise a promoter operably linked to the gene of interest. For example, moss transformation protocols are described in US App Pub No. 2003/0157592.

An important tool for transforming moss cells suitable to carry out the method of the invention in vivo is an expression vector comprising a nucleic acid molecule according to any embodiment of the invention. The skilled person is capable of selecting a suitable vector according to the expression system. In one embodiment, the expression vector includes the nucleic acid molecule of the invention operably linked to at least one regulatory sequence, which controls transcription, translation, initiation and termination, such as a transcriptional promoter, operator or enhancer, or an mRNA ribosomal binding site and, optionally, including at least one selection marker. Nucleotide sequences are "operably linked" when the regulatory sequence functionally relates to the nucleic acid molecule of the invention.

The expression vector may be used in the methods for preparing a transgenic moss cell to comprise the heterologous nucleic acid molecules of the present invention, in the methods for producing polypeptides having a patchoulol synthase activity, β-santalene synthase activity, or sclareol synthase activity, or in the methods for preparing patchoulol, β-santalene, or sclareol, as disclosed further below.

Generation of variant nucleotides having the above required percent identities is common general knowledge. For example, directed evolution and rapid isolation of mutants can be according to methods described in references including, but not limited to, Link et al. (2007) Nature Reviews 5(9), 680-688; Sanger et al. (1991) Gene 97(1), 119- 123; Ghadessy et al. (2001) Proc Natl Acad Sci USA 98(8) 4552-4557. Thus, a person skilled in the art could generate a large number of nucleotide and/or polypeptide variants having, for example, at least 70%-99% identity to the reference sequence described herein and screen such for desired phenotypes according to methods routine in the art.

In one embodiment of the present invention, methods for metabolite production using simple and inexpensive photo-bioreactors as well as the basic tools for performing gene editing by homologous recombination are provided. Processes for culturing moss cells are known in the art (see e.g., Decker and Reski 2008; Knight et al. 2002 Molecular Plant Biology 2, 285-301 ; Cove et al. 2009 Emerging Model Organisms: A Laboratory Manual, Vol. 1. CSHL Press, Cold Spring Harbor, N.Y., USA, 2009). Thus, the culturing of a transgenic moss cell described herein can be carried out in accordance with such processes.

Patchoulol, β-santalene, or sclareol compounds prepared in a transgenic moss cell may be accumulated, converted to another terpenoid compound, or both. Generally, patchoulol, β-santalene, or sclareol compounds will have higher levels of accumulation if no additional terpenoid synthases with specificity for a terpenoid compound has been engineered into the moss cell.

In one embodiment, the essential oils produced by the transgenic moss cell may be isolated and/or purified further. The essential oils produced may decompose at high temperatures, and normal distillation is therefore not necessarily optimal. The essential oils may thus be purified by distillation, often by using steam. Other processes include expression or solvent extraction.

If the essential oils produced by the transgenic moss is intracellular, the transgenic moss cell may be disrupted by applying steam scalding, drying, high stress shear, or other suitable means.

In one embodiment steam distillation is preferred. By adding water or steam, the boiling points of the compounds may be depressed, allowing them to evaporate at lower temperatures, preferably below the temperatures at which the deterioration of the material becomes appreciable or neglible.

In one embodiment vaccume distillation is preferred. Alternatively, if the essential oils are very sensitive to heat, steam distillation can be combined with vacuum distillation.

After distillation the vapors are condensed as usual, usually yielding a two-phase system of water and the essential oils, allowing for decantation.

All methods described herein can be performed in any suitable order unless otherwise indicated or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language provided with respect to certain embodiments herein is intended merely to better illustrate the invention and does not pose a limitation on the scope of the invention otherwise claimed.

All publications, patents, patent applications, and other references cited in this application are incorporated herein by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application or other reference was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. Citation of a reference herein shall not be construed as an admission that such is prior art to the present invention.

Examples

The following examples are intended to illustrate the various embodiments of the present invention. Thus, each individual example should not be construed as limiting the scope of the present invention. The examples provide techniques for establishing Physcomitrella as a heterologous moss cell for terpenoid biosynthesis. This includes two protocols for transformation and recovery of transgenic Physcomitrella lines as well as methods for metabolic profiling and characterization.

Plant mate rial, growth conditions and transformation

1. Wild type P. patens (Gransden ecotype) was obtained from the International Moss Stock Center at the University of Freiburg (http://www.moss-stock-center.org/) and a PpCPS/KS KO (bifunctional Copalyl Diphosphate/Kaurene Synthase KnockOut) moss line was created. 2. Growth conditions and transformation are described in detail [21]. Growth media for Physcomitrella patens

1. Phy B media, modified from minimal media [24]: For one liter mix 800 mg Ca(N0 3 ) 2 , 250 mg MgS0 4 « 7H 2 0, 12.5 mg FeSO 4 « 7H 2 0, 10 mL KH 2 P0 4 buffer (25 g

KH 2 P0 4 per liter and adjusted to pH 6.5 with 4M KOH) and 0.25 mL trace element solution (1 10 mg CuS0 4 « 5H 2 0, 110 mg ZnS0 4 « 7H 2 0, 1228 mg H 3 B0 3 , 778 mg MnCI 2 « 4H 2 0, 110 mg CoCI 2 « 6H 2 0, 53 mg Kl, 50 mg Na 2 Mo0 4 « 2 H 2 0 per liter). The minimal medium was supplemented with 0.5 g ammonium tartrate per liter. The medium can be solidified with 0.7 % (w/v) Agar A, and was sterilized by autoclaving at 121 °C.

2. BCD media: For one liter mix 12.5 mg FeS0 4 « 7H 2 0, 10 mL solution B (25 g/L MgS0 4 « 7H 2 0), 10 mL solution C (25 g/L KH 2 P0 4 adjusted to pH 6.5 with 4 M KOH), 10 ml solution D (101 g/L KN0 3 ), 1 mL trace element solution (614 mg H 3 B0 3 , 389 mg MnCI 2 « 4H 2 0, 55 mg AI 2 (S0 4 ) 3 « K 2 S0 4 « 24H 2 0, 55 mg CoCI 2 « 6H 2 0, 55 mg CuS0 4 « 5H 2 0, 55 mg ZnS0 4 « 7H 2 0, 28 mg KBr, 28 mg Kl, 28 mg LiCI, 28 mg and SnCI 2 « 2H 2 0 per liter). The medium was solidified with 0.7 % (w/v) Agar A and was sterilized by autoclaving at 121 °C. After autoclaving, 10 ml sterile 1 M CaCI 2 solution was added (10 mM final concentration).

Example 1

DNA expression vectors construction for PEG-mediated transformation

DNA expression vector containing Patchoulol synthase (PTS), SEQ ID NO 1 , or β- santalene synthase (STS), SEQ ID NO 2, genes for random integration into the Physcomitrella genome was constructed as the following procedures:

1. pLIFE33 was double-digested using Psil and Pmel to generate the frag- ments containing the hygromycin-resistant cassette and the CaMV 35S promoter-driven operon containing the USER cassette.

2. The digested fragment was ligated into pJET1.2 in a blunted-end way and the resultant construct was referred to as pUNI33.

3. The PTS and STS genes were amplified by PCR using the primer pairs with USER-compatible overhangs, respectively.

4. The PTS and STS genes were integrated into the USER cassette sites of pUNI33 by USER cloning and referred to as pUNI33 PTS & pUNI33 STS, respectively.

The DNA expression vectors which targeted the PTS or STS enzyme into the plas- tids were constructed by adding the transit peptide of Arabidopsis thaliana RuBisCO en- zyme at the 5' end of the PTS or STS gene by USER fusion. They were referred to as pUNI33 tpPTS & pUNI33 tpSTS.

The expression vector containing the two sclareol synthase (TPSsa3 and TPS1 132, SEQ ID NO 3) genes was constructed as the following procedures:

1. pLIFE33 was double-digested by using Psil and Pmel to generate a frag- ment containing the hptll expression cassette, and the CaMV 35S promoter-driven expression cassette for gene-of-interest.

2. The digested fragment was ligated into pJET1.2 in a blunted-end way and the resultant construct was referred to as pUNI33.

3. The TPSsa3 and TPS1 132 gene were amplified by PCR using the primer pairs with USER-compatible overhangs to fit pUNI33 cloning.

4. The two sclareol genes, which were linked by 2A (Ma, Chonglie and Mitra, Amitava 2002) through fusion PCR, were integrated to the USER cassette of pUNI33. The final construct was named pUNI33-TPSsa3-2A-TPS1 132.

pCL755, which was used to knockout PpCPS/KS gene, was constructed as follow- ings:

1. Part of PpCPS/KS gene was amplified by nested PCR (the sequence shown in the supplementary materials);

2. The PCR product was cloned into pDONR201 and digested with Xhol;

3. The nptll cassette was obtained from pMBL6 by Xhol digestion and ligated into the backbone vector from step 3;

4. The resulting vector was referred to as pCL755.

Example 2

PEG-mediated transformation of Physcomitrella

Preparation of DNAs for transformation

The DNA expression vectors were subjected to linearization before PEG-mediated transformation. pUNI33 PTS was taken here as an example and all the other vectors were subjected to the same treatment.

1. pUNI33 PTS was prepared from E. coli cultures using standard miniprep kits.

2. 200 ul (-30 μg) of pUNI33 was digested using the restriction enzyme Notl- HF overnight. The restriction enzyme was inactivated by heating at 60°C for 20 minutes.

3. The next day, 1-2 μΙ of digestion mix was loaded on the agarose gel to check if the digestion was completed. Linearized pUNI33 PTS was precipitated by adding 0.7 volumes of room temperature isopropyl alcohol, mixing and centrifuging at 15k rcf 4°C for 30 minutes. The supernatant was carefully decanted, and the pellet was washed with 70 % ethanol. DNA was centrifuged again at 15k rcf for 15 minutes and the supernatant was carefully decanted. The pellet was air-dried and DNA was re-dissolved 30 μΙ_ sterile H 2 0. Final DNA concentration should be between 600-1000 ng/μ-..

Preparation and transformation of protoplasts

I . On the day of transformation, new PEG solution was prepared, and allowed to stand for 2 hours before use. The solution was subsequently sterilized by filtration by passing it through a 0.22 μηι syringe filter.

2. 5-7 day old moss protonema tissue was harvested by scraping it off cellophane-overlaid agar plates and placed in a sterile 50 ml_ plastic tube. 1 ml_ of a 0.5% Driselase solution was prepared for every 40 mg tissue. The Driselase powder was dissolved in 8.5% D-mannitol, and sterilized using a syringe filter and Physcomitrella tissue was added subsequently. The mixture was incubated for 30-60 minutes with occasional inversion of tube until the tissue had been thoroughly digested.

3. The Driselase-treated tissue was poured through a sterile 100 μηι stainless steel mesh screen, and the protoplasts were recovered in a sterile beaker. Undigested tissue and cellular debris do not pass through the mesh.

4. The protoplasts were centrifuged at 200 rcf for 5 minutes, with gentle break- ing.

5. The supernatant was decanted using a serological pipette.

6. The pellet was re-suspended in protoplast wash solution using the same volume as driselase in step 2.

7. The steps 4 and 5 were repeated.

8. The protoplasts were re-suspended in half the original volume of 8.5% D- mannitol and the density was estimated using a hemocytometer.

9. The solution was centrifuged at 200 rcf for 5 minutes, and the supernatant removed. The pellet was re-suspended in sterile MMM solution yielding a protoplast concentration of 1.5-2 x 10 6 protoplasts/mL.

10. 30 μΙ (approximately 20-30 μg) of linearized DNA was added to the bottom of a 15 mL conical tube. 300 μΙ_ of protoplast suspension and 300 μΙ_ of sterile PEG solution were added and mixed with the DNA by flicking the tube.

I I . The mixture was incubated at 45°C in a water bath for 5 minutes followed by 5 minutes at room temperature.

12. The protoplast suspension was diluted 5 times with 300 μΙ_ of 8.5% D- mannitol, followed by an additional 5 times dilution with 1 ml_ of 8.5% D-mannitol.

13. The transformed protoplasts were centrifuged at 200 rcf with gentle braking for 5 minutes, and the supernatant was removed.

14. The protoplasts was resuspended in 500 μΙ_ 8.5% D-mannitol, and 2.5 ml_ of molten PRMT was added.

15. 1 ml of the protoplast suspension was added to a PRMB Petri dish overlaid with sterile cellophane. At least 3 plates were maded from each transformation event.

16. The plates were sealed with 3M tape, and placed in a growth chamber under standard conditions.

17. The protoplasts were allowed to regenerate their cell walls for 5-7 days, and then the cellophane and regenerating plants were to the selective media and selection of positive transformants as proceeded as described in section 2.7.

Example 3

Selection procedure in order to obtain stable moss lines.

1. Cellophane discs with transformed moss were transferred onto solid PhyB media with 30 μg/mL hygromycin and incubated for two weeks under standard conditions.

2. The cellophane discs were transferred with recovered transformants to solid PhyB media and incubated for another 2 weeks for relaxation. Unstable or transient transformants lose the plasmid and the ability to survive on selective media in this period.

3. Step 1 and 2 was repeated. Moss lines obtained after two rounds of selection were considered to be stably transformed moss lines.

Example 4

Metabolite profiling of stable moss lines using GC-MS

Analysis of volatile sesquiterpenoid products

20 ml_ sterile GC vials were used to analyze the volatile metabolites using the HS- SPME (Headspace-Solid Phase Micro Extraction) technique.

1 . 3 mL solid Phy B media was added into each 20 ml_ GC vial and inoculated with moss.

2. The GC vials were incubated for 2-3 weeks in the growth chamber as described.

3. The volatiles in the headspace were sampled on a SPME fiber for 30 minutes and then analyzed by GC-MS.

HS-SPME conditions A 23 Gauge, 50/30 μηι, DVB/CAR/PDMS SPME fiber (Sigma 57298-U) was used to sample the volatiles. The fiber was penetrated into the 20 mL GC vial and exposed in the headspace for 30 min at the room temperature, followed by 1 min desorption in the injection. The GC-MS is described below.

Quantification of volatile sesquiterpenoid using dodecane overlaved liquid cultures

1 . A 100 mL Erlenmeyer flask containing 20 ml BCD media was inoculated with 2 mL of one week old plate blended for 30 seconds in 10 mL sterile water. The flasks were sealed with a sterile flask sponge cap. After 4 days 1.5 mL (7.5 % v/v) dodecane was added under sterile conditions, and cultivation was continued for 2 weeks.

2. 10 ml of culture containing dodecane was centrifuged at 4000 rpm for 5 min in a glass tube.

3. 100 of dodecane was diluted in 900 hexane and analyzed by GC-MS.

4. The concentrations of the metabolite of interest in each moss line were calculated based on the external authentic standard and the dry weight of the moss. Each moss line was tested in triplicate.

GC-MS analysis conditions

The GC-MS analysis was performed with the following conditions: Injection volume was 1 μΙ using a splitless injection mode, and the injector temperature was 250°C. The analysis was performed in pressure hold mode with He as carrier gas at 160.0 kPa. The coloumn was a SLB®-5ms capillary GC column (Sigma 28466-U, 14 m [L] χ 0.10 mm [D] x 0.10 μηι [thickness]). The GC temperature program was: 0-3 min 45°C, 10 °C/min to 300°C and hold 5 min, the interface temperature was 150°C, and the MS-ion source temperature 260°C, the voltage was 70 eV, relavtive to tuning. The MS analysis was per- formed with a scan range from m/z 50 to m/z 350, a solvent cut time at 5 min, and with a threshold of 100.

Quantification of sclareol

1. 300 mg fresh weight of moss was loaded into a Crimp neck vial N1 1 (1 1.6 x 32 mm, usage volume 1.5 ml).

2. 1 ml ethyl acetate was added, and extraction was performed for 2 hours, using ultrasonic treatment to facilitate the extraction.

3. The EtOAc was filtrated and transferred in to a new vial, then analysied on the GC-MS as described.

4. The moss left in the first vial was dried at 75 °C for 48 hours, and the dry weight was established.

5. The concentrations of sclareol in each moss line were calculated based on the external authentic standard and the dry weight of the moss. Example 5

Cultivation of Physcomitrella

Cultivation of Physcomitrellla was performed according to standard protocols [21]. All work with Physcomitrella was done under sterile conditions using sterile materials and standard sterile techniques. All moss handling was performed with sterile forceps.

Cultivation on solid media using Petri dishes.

1 . A lump (approximately 2-5 mm in diameter) of Physcomitrella gametophyte tissue was placed on a Petri dish with Phy B solid media. The plates were sealed with 3M surgical tape

2. The cultures were incubated at standard conditions in a 25 °C growth chamber. Cultivation on solid media overlaid with cellophane discs.

The growing Physcomitrella on top of cellophane the tissue did not adhere to the solid agar and was therefore easy to handle.

1 . A 9 mm Petri dish filled with Phy B solid media was overlayed with a sterilized disc of cellophane. The lid was closed to let the cellophane disc absorb moisture for 10 min, then straightened to avoid air bubbles and wrinkles.

2. A lump of approximately one week old tissue (around 10 mm in diameter) was added to 10 mL sterile water in a 50 mL tube and homogenized using a Polytron tissue disruptor.

3. A serological pipette was used to transfer 1-3 mL of homogenized suspension to Petri dishes with cellophane overlaid Phy B media and the plates were carefully dried uncovered.

4. The plates were sealed carefully with surgical tape and the cultures were incubated one week in a growth chamber at standard conditions.

5. The tissue was harvested by scraping it off the cellophane.

6. For further biomass generation step 1 - 5 was repeated using a plate of protonemal tissue blended in 10 mL H 2 0.

Cultivation in liquid medium using simple medium scale photo-bioreactors.

1 . Tissue from a plate of one week old protonemal tissue was added to 5-10 mL sterile water and homogenized by a Polytron. 2. The solution was inoculated in 50-100 ml_ BCD liquid media. In addition to BCD media, liquid Phy B and KNOP media may be used for liquid growth. In order to obtain very high growth rates approximately 1.5% of C0 2 may be supplemented to the moss culture and the light intensity may be increased.

3. The flask using a sterile BugStopper was sealed and incubated for up to 3 months on a rotary shaker in a growth chamber.

4. The tissue was homogenized every week.

Results

The transgenic moss lines producing patchoulol

29 transgenic lines producing patchoulol were obtained after 2 rounds of selection. Among them, 5 transgenic lines producing the highest amounts of patchoulol were selected according to HS-SPME results. The volatile metabolites were captured by dodecane overlays (Fig.2) and the yields of patchoulol were quantified.

5 transgenic moss lines with plastid-targeted PTS enzyme were also obtained and they showed the identical volatile metabolites profile to that of cytosolic-targeted ones. 3 transgenic moss lines with plastid-targeted PTS enzymes in the PpCPS/KS KO (bifunctional Copalyl Diphosphate/Kaurene Synthase KnockOut) moss line were also obtained and they showed the identical volatile metabolites profile to that of cytosolic- targeted ones. 16 and 10 stable lines were obtained with PpCPS/KS gene knockout in WT-PTS35 and WT-tpPTS9, respectively. Patchoulol was identified and no kaurene products were detected using HS-SPME analysis.

The transgenic moss lines producing 3-santalene

2 transgenic lines producing β-santalene with cytosolic STS enzyme in the WT moss line were obtained (Fig.4).

1 transgenic line producing β-santalene with cytosolic STS enzyme in the PpCPS/KS KO line was obtained and the volatile metabolites profile was identical to that in the WT line.

1 transgenic line producing β-santalene with plastidic STS enzyme was obtained.

The transgenic moss lines producing sclareol

2 transgenic lines producing Sclareol were obtained both with WT and PpCPS/KS KO background line. The stable transgenic lines were named WT-pUNI33-TPSsa3-2A- TPS1132 and KO-pUNI33-TPSsa3-2A-TPS1132 respectively. The volatile metabolites profile demonstrated sclareol production in transformed moss (Fig 7).

A time-course measurement of scalreol production in KO-pUNI33-TPSsa3-2A- TPS1 132 was conducted and the yield was dependent on moss age, with the highest production around 22 days after subculture to yield 280 μg/g dry weight (Fig 8).

Example 6

The DNA expression vector containing PTS (patchoulol synthase gene, SEQ ID NO 1) or STS (α/β-santalene, SEQ ID NO 2) gene for random integration into the Physcomitrella genome was constructed as the following procedures:

1. pLIFE33 was double-digested using Psil and Pml to generate the fragments containing the hygromycin-resistant cassette and the CaMV 35S promoter-driven operon containing the USER cassette.

2. The digested blunt-end fragment was ligated into pJET1.2 and the resultant construct was referred to as pUNI33.

3. The PTS and STS genes were amplified by PCR using the primer pairs PTS-F & R and STS-F & R with USER-compatible overhangs, respectively.

4. The PTS and STS genes were integrated into the USER cassette sites of pUNI33 by USER cloning and referred to as pUNI33 PTS & pUNI33 STS, respectively.

The DNA expression vectors, which targeted the PTS or STS enzyme into the plas- tids, were constructed by adding the transit peptide of Arabidopsis thaliana RuBisCO small subunit enzyme at the N-terminus of the PTS or STS enzyme by USER fusion. They were referred to as pUNI33 tpPTS & pUNI33 tpSTS.

pCL755, used to knockout PpCPS/KS gene, was constructed as followings:

1. A 3 kb fragment of the PpCPS/KS was amplified by PCR using the primers oCL252 and oCL254 and the nested primers oCL253 and oCL255 including attB1 and attB2 overhangs;

2. The PCR product was cloned into pDONR201 and digested with Xhol;

3. The nptll cassette was obtained from pMBL6 by Xhol digestion and ligated into the backbone vector from step 3;

4. The resulting vector was referred to as pCL755.

The primer sequences and vector descriptions are shown in Table S1 and S2.

Before PEG-mediated transformation, the vectors were linearized using Notl-HF (New England Biolabs), concentrated by isopropanol and approximately 20 μg of DNA was prepared. For CPS/KS disruption, the vector pCL755 was double digested by EcoRI and Ndel and concentrated by isopropanol as well. Linearized DNA was subsequently mixed with moss protoplasts using the PEG-mediated transformation method. After 5-7 days of regeneration, the moss tissues were transferred to antibiotic selection to recover stable transgenic lines using appropriate antibiotics.

The volatile metabolites profile of all the transgenic lines was determined by HS-

SPME (Headspace-Solid Phase Micro-Extraction) and GC-MS (Gas Chromatography- Mass Spectrum) analysis. Specification of HS-SPME and GC-MS for qualitative study is described above.

The volatile metabolites were putatively identified by comparing the mass spectra in the NIST and WILEY library and their retention index (Rl). Quantification of volatile metabolites was conducted in liquid culture overlaid with dodecane (Sigma-Aldrich).

Fresh moss tissues were blended and inoculated (each inoculum approximately 10 mg DW) into 50 ml PhyB medium in 250 ml shake flasks, and cultivated on a rotary shaker at standard conditions. After 2 weeks, the media together with moss tissues were extract- ed using 50 (for the PTS lines) or 25 ml heptane (for the STS lines) by hand shaking for 1 min. After phase separation, 1 ml of the organic phase was taken into the GC vial and 1 ul was injected into GC-MS. Quantification of patchoulol and a-santalene was achieved based on a standard curve.

The moss tissue was filtered using a vacuum pump and dried at 60°C overnight, and the dry weights were measured the next day.

GC-MS analysis was performed on a GCMS 7890/5975C (Agilent) equipped with a LTM column module (DB-1 MS) (10 m x 0.18 mm i.d. x 0.18 μηι). Samples (1 μΙ) were injected with a split ratio of 25: 1 into LTM (DB-1 MS) column using the following temperature program: 50 °C (held for 1 min), 50-320 °C (30°C /min, held for 1 min), and the total time is 1 1 min (2 min solvent delay). The oven temperature was 200°C (held for 1 1 min). The injector temperature of GC was 250°C. The ion source temperature of the mass spectrometer was 230 °C and the transfer line temperature was set at 250°C. Helium was used as carrier gas at constant flow rate 0.7 ml/min. Data were acquired by EI+ with SIM (Selected Ion Monitor) mode. The detail of SIM method to quantify patchoulol and a-santalene was as follows:

RT (retention time) 2.00 to 6.00 min diagnostic ions 98, 138, 161 , 222 were selected to monitor patchoulol and 93, 94, 107, 121 , 122, 204 for α/β-santalene;

RT 6.00 to 1 1.00 min diagnostic ions 232, 257, 272, 290 were selected to monitor 16a-hydroxykaurane.

16a-hydroxykaurane was quantified in approximate level according to the external standards (patchoulol or a-santalene) curves. The statistical significance was calculated using Student's t-test defining the significant level as P value<0.05.

The transcripts of the regulatory genes in the terpenoid biosynthetic pathway were quantified by RT-qPCR (Real Time-quantitative PCR). Three biological replicates were selected for each 2-week old transgenic line and their total RNAs were extracted using Spectrum™ Total RNA Kit (Sigma) and the concentrations and quality (A260/A28O1.8; A260/A23O2.0) were measured using Nanodrop 1000D.

The first-strand cDNA was subsequently generated using iScript™ cDNA synthesis Kit (Bio-Rad). cDNA was used as templates in RT-qPCR using the dyNAmo SYBR Green qPCR kit (Thermo Scientific).

P. patens actin2 (NP_188508) was used as the reference gene here, and the transcripts level was calculated using the 2 _ΔΔι method. The nucleotide sequences of the qPCR primers are listed in Table S3 in the supporting information. Results:

The metabolite profile of the cytosolic PTS lines.

DNA for cytosolic expression of PTS was introduced into wild type moss by random integration. After two rounds of antibiotic selection, 44 independent stable lines named WT-PTS1 to WT-PTS44 were obtained. Of these, 29 emitted the patchoulol and other sesquiterpene products as analyzed by HS-SPME and GC-MS analysis (Figure 18 and 19). In addition to the native diterpene metabolites in the moss, at least 16 different sesquiterpene products including patchoulol were detected in the headspace (Figure 18 and 19). Among them, seychellene was the most abundant volatile followed by a- patchoulene and a-guaiene.

Five cytosolic PTS lines, WT-PTS10, 14, 19, 27 & 35, were subsequently selected according to their relatively larger peak area of seychellene for metabolite quantification. Patchoulol was confirmed to be the major sesquiterpene product in the WT-PTS lines in liquid cultures overlaid with dodecane along with several other sesquiterpene hydrocar- bons detected (Figure 18 and 19). The five selected cytosolic PTS lines produced 0.2-0.8 mg patchoulol/g dry weight (DW), cf. Table S4. The amount of the native diterpene metabolite 16a-hydroxykaurane was significantly reduced compared to the WT moss, with the exception of the line WT-PTS10 (Table S4).

The metabolites profile in the KO lines.

The CPS/KS gene was disrupted in both WT and WT-PTS35 to augment the IPP pool and redirect the carbon flux from native diterpenes to sesquiterpene patchoulol production in P. patens.

The CPS/KS knockout line designated as KO was first constructed by disrupting the corresponding gene CPS/KS by homologous recombination. Using KO as the background line, one transgenic line named KO-tpPTS1 was obtained by introducing the plastid- targeted PTS enzyme. However, the yield of patchoulol in KO-tpPTS1 was below detection (<1 μg/g DW) compared to 0.02 mg patchoulol/g DW in WT-tpPTS2, one of the plastidic PTS lines in the WT background (Table S4). To eliminate the positional effect of random integration events, the effect of CPS/KS disruption on patchoulol production was evaluated by knocking out CPS/KS in the previously characterized line WT-PTS35.

The CPS/KS gene was knocked out in the cytosolic PTS line using the identical approach as previously described. The headspace volatile analysis identified one line from the transformants, which showed presence of patchoulol but absence of the diterpene metabolites, PTS35-KO. However, the yield of patchoulol was appearently reduced to 0.4 mg/g DW in PTS35-KO (Table S4). Disruption of CPS/KS in one of the plastidic PTS lines, WT-tpPTS2, was also attempted but no transgenic lines with a positive chemotype lacking kaurenoids was obtained. In summary, both studies showed that disruption of the CPS/KS gene neither improved the patchoulol yield in the cytosolic nor in the plastidic PTS line. The metabolites profile of the plastidic PTS lines.

PTS enzyme was targeted into the moss plastids by fusing the transit peptide of Ar- abidopsis RuBisCO small subunit at the N-terminus of the enzyme. Both the WT and KO line were used as the background lines to test the effect of the plastidic targeting of the PTS enzyme on patchoulol production. Besides the native diterpenes and heterologous sesquiterpene products (including patchoulol), the headspace volatiles analysis showed emission of monoterpene products in both the WT and KO plastidic lines (Figure 20).

Genotyping of the cytosolic PTS lines.

In the cytosolic PTS lines, the PTS gene transcripts level in the five WT-PTS lines were quantified by RT-qPCR, cf. Table G.

Five WT-PTS lines showed in general very high PTS transcript level, among which WT-PTS27 was the highest. WT-PTS27 was about 10-100 folds higher than the endogenous terpenoid biosynthetic genes, indicating the high expression activity of the PTS gene in the moss genome (Table S6). Moreover, the expression level of the HMGR gene was investigated and, it was found that this rate-limiting gene in the MEV pathway was up- regulated in all the five WT-PTS lines. However, unlike HMGR, both FPS and SQS showed differentiated expression profiles in the five WT-PTS lines. The FPS expression was strongly reduced in WT-PTS10 and WT-PTS14, whereas highly overexpressed in the rest of the three lines WT-PTS19, 27 and 35. Similarly for SQS, its expression was de- creased in WT-PTS10 and 14, but remained constant or slightly increased in WT-PTS19, 27 and 35. With respect to the MEP pathway and its downstream routes towards diterpenes synthesis, the genes DXS, DXR, GGPS, and CPS/KS were examined in transcripts level. DXS and GGPS showed a very similar expression pattern with each other in the 5 cytosolic PTS lines. Intriguingly, DXS and GGPS were down-regulated in the three higher patchoulol-producers WT-PTS19, 27 & 35, in contrast to the slight up-regulation in WT-PTS10 & 14. In addition, DXR either remained stable or was overexpressed in the 5 cytosolic PTS lines. Regarding to the bi-functional diterpene synthase gene CPS/KS in P. patens, it showed much higher expression level in WT-PTS10 than the WT line, while the rest four lines remained either constant or strongly reduced.

Genotyping of the CPS/KS KO lines.

Disruption of the CPS/KS gene was conducted to redirect the diterpenes metabolism to the sesquiterpenes production in P. patens, cf. Table G.

One of the patchoulol-producing moss lines WT-PTS35 was selected to disrupt the CPS/KS gene by homologous recombination, PTS35-KO. The genotyping work confirmed the reduction of the CPS/KS transcript level in PTS35-KO compared to WT but was slightly increased against WT-PTS35, probably because the qPCR primers of the CPS/KS gene annealed to the 5'-UTR. The genomic diagnostic work showed that a 2-kb PCR product was successfully amplified when using primer pair oSSB94 & oSSB95, but no PCR prod- uct was amplified when the primer pair oCL77 and oCL252 was used, indicating a possible unexpected integration way of the nptll cassette into the moss genome of PTS35-KO. Regarding the change in transcripts level, all the regulatory genes in the MEV pathway and its downstream route, PTS, HMGR, FPS and SQS, were transcriptional down-regulated. Unlike the expression pattern of the regulatory genes in the MEV pathway, the expression patterns in the MEP and its downstream pathway seemed to be differentially regulated. In PTS35-KO, the activity of DXS and GGPS was significantly up-regulated but the transcript level of DXR was obviously reduced, the same trend as CPS/KS disruption in WT. A similar phenomenon was observed in GGR and PSY, which catalyze the first committed step in phytol and carotenoid synthesis, respectively. GGR activity was decreased but PSY was boosted in PTS35-KO, indicating the rebalance of different terpneoid products from the MEP and its downstream pathway in P. patens. Genotyping of the plastidic PTS lines.

Several plastidic PTS lines in WT or KO background were obtained, and the genetic background of two of these lines, WT-tpPTS2 and KO-tpPTS1 , were investigated in terms of the regulatory gene transcripts level, cf. Table G.

PTS transcript levels of WT-tpPTS2 and KO-tpPTS1 were found to be at the same magnitude as some cytosolic PTS lines. For example WT-PTS14. HMGR, FPS and SQS transcripts level were all down-regulated after the PTS enzyme was targeted into the plas- tids of the WT line, while in contrast they were all up-regulated after the PTS enzyme was targeted into the plastids of the KO line. Likewise, DXS, DXR and GGPS were all down- regulated, when the PTS enzyme was targeted into the plastids of the WT line, while in contrast, they were stable or overexpressed when the PTS enzyme was targeted into the plastids of the KO line. In addition, CPS/KS, GGR and PSY were all down-regulated in WT-tpPTS2, being especially significant for GGR, while no significant GGR and PSY transcripts level change was observed between KO and KO-tpPTS1. Very low amounts of CPS/KS transcript level was detected in KO and KO-tpPTS1 , i.e. the lines where the CPS/KS gene was assumed to be disrupted. This could also be ascribed to the 5'-UTR annealing sites of the qPCR primers of the CPS/KS gene.

The metabolites profile of the cytosolic STS lines.

STS (α/β-santalene synthase) gene from the sandalwood S. album was introduced into the moss genome by random integration. Four stable transgenic lines WT-STS3, 6, 11 & 13 were obtained, and HS-SPME coupled with GC-MS analysis showed that a- santalene, a-bergamotene, epi^-santalene and β-santalene were emitted into the head- space besides the endogenous diterpene metabolites (Figure 21). The yields of α/β- santalene were - attempted to be - quantified in liquid cultures overlaid with dodecane, but none of the four transgenic lines appears to produce sufficient amounts of detectable α/β- santalene, indicating that the culture produced about or below 1 μg β-santalene/g DW in P. patens (Table S5). However, the 16a-hydroxykaurane amounts varied a lot between the four transgenic lines (Table S7). WT-STS3 showed same level of 16a-hydroxykaurane as WT, but the rest three lines WT-STS6, 11 and 13 produced less amounts of 16a- hydroxykaurane (Table S7). Especially in WT-STS1 1 , the amount of 16a-hydroxykaurane was reduced to 0.05 mg/g DW (Table S7).

The metabolites profile in the CPS/KS-KO STS lines. One cytosolic STS line in the KO background was obtained, KO-STS1. The volatile metabolites analysis confirmed emission of α/β-santalene into the headspace and no detection of native diterpene products in KO-STS1 (data not shown). The yield of α/β- santalene was investigated as well but again it was not sufficient high to reach the detect- able level (Table S5).

The metabolites profile of the plastidic STS lines.

Like the PTS enzyme, the STS enzyme was targeted into the plastids of the WT line using the transit peptide of Arabidopsis RuBisCO small subunit. Four plastidic STS lines were obtained, and the HS-SPME GC-MS analysis showed that one of the four lines, named WT-tpSTS1 , emitted monoterpenes besides the sesquiterpene products and the native diterpene metabolites (Figure 22). However, 3 additional plastidic STS lines WT- tpSTS8, 9, & 10 produced low amounts of α/β-santalene and no monoterpenes was detected (data not shown). The yield of α/β-santalene in WT-tpSTS1 was subsequently de- termined to be 0.039±0.008 and 0.035±0.001 mg/g DW respectively, which were the highest α/β-santalene yields achieved in P. patens (Table S5).

Genotyping of the cytosolic & plastidic STS lines.

In order to understand why the yields of α/β-santalene were low in the cytosolic STS lines, the transcripts level of the selected key genes in the terpenoid biosynthetic pathway were investigated, cf. Table G.

The transcripts profile showed an extremely low expression levels of the STS gene in WT-STS6 and WT-STS13 compared to WT-tpSTS1 , of which the STS transcript level was in the same magnitude as the cytosolic PTS lines. In addition, HMGR, FPS and SQS behaved differently after the STS enzyme was introduced into the cytosol of the WT line. Thus, HMGR was overexpressed but FPS and SQS were down-regulated. However, all 3 genes were up-regulated in the plastidic line WT-tpSTS1. Regarding the genes in the MEP and its downstream pathway, different expression profiles were observed for the different regulatory genes. DXS and GGPS were up-regulated in both the cytosolic and plastidic STS lines, while the DXR level in these 3 selected lines were either reduced or improved. Moreover, the expression level of CPS/KS was reduced in both the cytosolic and plastidic STS lines. With respect to the pigments biosynthetic genes, the GGR transcript level remained constant, but the transcript level of PSY was up-regulated in WT-tpSTS1. Genotyping of the CPS/KS KO lines. Compared to the STS lines in the WT background, the STS line obtained in the KO background showed an 8-fold higher STS transcript level, which apeear to be too low to produce sufficient detectable amounts of α/β-santalene by GC-MS, cf. table G. The CPS/KS transcript level study confirmed the disruption of the transcriptional activity of the CPS/KS gene in KO-STS1. Likewise, the HMGR gene was up-regulated in both KO and KO-STS1 , but both FPS and SQS were down-regulated. In the MEP and its downstream pathway, DXS, DXR, GGPS and PSY were all up-regulated, while GGR remained constant after the introduction of the STS gene in the KO line. When comparing between WT and KO, all regulatory genes except DXR in the MEP pathway were over-expressed after disruption of the CPS/KS gene in WT.

Abbreviations

"WT" equals "wildtype"

Table G. Fold change of gene expression in mutant lines compared to WT of P. patens

native native

MEV genes MEP genes

PpHM FPS SQS DXS DXR GGPS CPS/K GGR PSY GR S

WT 1 1 1 1 1 1 1 1 1

KO 1,683 0,635 0,670 1,299 0,352 1,022 0,234 1,731 2,499

WT- 1,487 0,449 0,701 2,086 1,386 2,131 5,609

PTS10

WT- 4,205 0,229 0,706 1,656 2,001 1,174 0,579

PTS14

WT- 2,741 1,430 0,988 0,924 1,027 0,730 1,082

PTS19

WT- 3,339 1,243 0,962 0,898 1,025 0,558 1,211

PTS27

WT- 3,006 1,595 1,240 0,512 1,697 0,330 0,181 0,702 0,556 PTS35

WT- 0,769 0,276 0,716 0,498 0,101 0,305 0,662 0,376 0,699 tpPTS2

KO- 3,104 0,879 0,772 1,262 1,773 1,380 0,308 0,468 1,468 tpPTSI

PTS35- 2,213 0,548 0,572 1,096 1,040 2,258 0,377 0,351 1,837 KO

WT- 1,648 0,241 0,692 2,193 0,637 1,728 0,607

STS6

WT- 3,731 0,512 0,555 1,698 1,232 2,670 0,403

STS13

WT- 4,529 2,327 1,076 2,348 1,794 1,911 0,532 0,978 1,553 tpSTSI

KO- 2,070 0,694 0,715 2,706 1,139 2,192 0,331 1,740 3,891 STS1 Table S1 the nucleotide sequences of the primers used for vectors construction

Primer Nucleotide sequence (5'->3')

PTS-F GGCTTAAUATGGAGTTGTATGCCCAAAG

PTS-R G GTTTA A U TTAATATGG A AC AG G GTG A AG

STS-F GGCTTAAUATGGATTCTTCCACCGCCAC

STS-R GGTTTAAUCTACTCCTCGCCGAGAGGAA

tp-F GGCTTAAUATGGCTTCCTCTATGCTCTCCTC

PTS-R GGTTTAAUTTAATATGGAACAGGGTGAAGGTACAA

C

tp-R AGATCTUCCCCCGTTGCTTGC

tp-STS-F AAGATCUAGCTGCATGAAGGAGCTCGGCGCGCCT

ATGGATTCTTCCACCGCCACC

oCL.252 CTCTGTCTCTCCCACAATCCTCC

oCL77 CCTGTGCAAGGTAAGAAGATGG

OSSB94 AAGCCGACTTCAACATGTG

OSSB95 AACCCGAAGCTCTTCCAC

Table S2 the vectors used and constructed for PEG-mediated moss transformation

vector description

pCAMBIA230035Su pCAMBIA2300 with USER cassette, hygromycin resistant pCAMBIA130035Su pCAMBIAI 300 with USER cassette, kanamycin resistant pUNI33 pJET1.2 backbone with the essential compartment from pCAMBIA130035Su (plant antibiotic resistant cassette, 35S promoter-USER cloning site-35S terminator) pUNI6 pJET1.2 backbone with the essential compartment from pCAMBIA230035Su (plant antibiotic resistant cassette, 35S promoter-USER cloning site-35S terminator) pUNI33 PTS pUNI33 containing the coding region of the PTS gene

pUNI33 STS pUNI33 containing the coding region of the STS gene

pUNI33 tpPTS pUNI33 containing the PTS gene with transit peptide sequence of Arabidopsis RuBisCO small subunit pUNI33 tpSTS pUNI33 containing the STS gene with transit peptide sequence of Arabidopsis RuBisCO small subunit pCL755 pDONR201 ::CPS/KS containing a cassette containing p35S- nptll-CamVter between two 1 ,5 kb homologous recombination arms from the CPS/KS gene

Table S3 the selected regulatory genes and the nucleotide sequences of the qPCR primers

gene Primer Nucleotide sequence

PTS PTS-QF AAGCACAAACCCACAACCAAGGAG

PTS-QR AGAGCTTACATGGAAGAGGCCCAA

STS STS-QF CCTTCCTGATCTTCTGCACTAC

STS-QR ATTATCGCCTCTTGCCATCTC

PpHMGR P100 CCATTCGTCCAACATTTTGATGA

P101 GGGAGCAAGCAAGTTTAACACTG

FPS P134 CAGGACGACTATCTTGACTGTTACG

P135 TTCCTTCAATAACTGTTTCTGGG

SQS SQS-QF CAGTTGCAATCTTCAGCAGTTC

SQS-QR CACCTCAACCTGCTAGTCTAATC

DXS P70 GGAGTGAGGTTGTTGTTGTTGTTG

P71 CAAGGCAGCCATAGTGAGAAAC

DXR P92 GGTCTCAGTTGCTTCCGAAATAC

P93 GAAAGAAAAGGAGAGCGTTGG

GGPS P112 GCTTCTCTCCCTTCGTCTCTCATAG

P113 CCAGCACACCACACAACTCAAC

CPS/KS P136 GCTTCCAGCACCTTGATACAGA

P137 CGACGCTTCTTTAGGCATTGA

GGR P120 AGGGTTCTTATCCGTTGATACTGC

P121 GATGTCCTGTTTTACAAGTGAATGAG

PSY P124 TCTTACAATCTGCTGGGATGGTG

P125 CCTCAAGTGCCTGGACTGGTCA

Actin2 P84 GCGAAGAGCGAGTATGACGAG

P85 CTCCATAACCCCACCTGACAA Table S4 Quantification of patchoulol and 16a-hydroxykaurane (mg/g DW) in all the PTS lines

Table S5 the α/β-santalene and 16a-hydroxykaurane amounts (mg/g DW) in the STS lines

a-santalene β-santalene 16a-hydroxykaurane

WT 0 0 2,42±0,01

KO 0 0 0

WT-STS3 n.d. n.d. 3,21 ±0,55

WT-STS6 n.d. n.d. 1 ,69±0,37

WT-STS1 1 n.d. n.d. 0,04±0,02

WT-STS13 n.d. n.d. 1 ,51 ±0,50

KO-STS1 n.d. n.d. 0

WT-tpSTS1 0,039±0,008 0,035±0,005 16,557±5,034

Table S6 the transcripts level of the selected regulatory genes in the WT, KO and all the transgenic moss lines. They were all normalized by the actin2 gene in P. patens.

PTS HMGR FPS SQS

WT - 0,107±0,006 0,212±0,033 0,069±0,005

KO - 0,180±0,070 0,134±0,005 0,047±0,003

WT-PTS10 9,159±0,547 0,159±0,014 0,095±0,002 0,049±0,001

WT-PTS14 3,169±1 ,002 0,449±0, 121 0,048±0,019 0,049±0,003

WT-PTS19 1 1 ,553±0, 160 0,292±0,032 0,303±0,007 0,069±0,005

WT-PTS27 22,045±2,633 0,356±0,053 0,263±0,022 0,067±0,007

WT-PTS35 5,731±2,413 0,321±0,027 0,338±0,017 0,086±0,006

PTS35-KO 2,460±0,801 0,236±0,005 0,1 16±0,013 0,040±0,001

WT-tpPTS2 2,985±0,267 0,082±0,008 0,058±0,014 0,050±0,004

KO-tpPTS1 3,192±0,289 0,331±0,037 0,186±0,004 0,054±0,003

DXS DXR GGPS CPS/KS

WT 0,067±0,014 0,166±0,008 0,1 17±0,013 0,098±0,004

KO 0,086±0,018 0,058±0,030 0,120±0,039 0,023±0,004

WT-PTS10 0,139±0,006 0,230±0,005 0,250±0,079 0,552±0, 140

WT-PTS14 0,1 10±0,01 1 0,332±0,035 0,138±0,025 0,057±0,027

WT-PTS19 0,062±0,014 0,170±0,019 0,086±0,016 0,106±0,012

WT-PTS27 0,060±0,01 1 0,170±0,010 0,065±0,007 0,1 19±0,01 1

WT-PTS35 0,034±0,010 0,281±0,030 0,039±0,005 0,018±0,002

PTS35-KO 0,073±0,006 0,172±0,030 0,265±0,032 0,037±0,010

WT-tpPTS2 0,033±0,004 0,017±0,006 0,036±0,027 0,065±0,009

KO-tpPTS1 0,084±0,015 0,294±0,024 0,162±0,039 0,030±0,007

STS HMGR FPS SQS

WT-STS6 0,010±0,003 0,176±0,022 0,051±0,004 0,048±0,003

WT-STS13 0,006±0,003 0,398±0,01 1 0,108±0,035 0,039±0,007

WT-tpSTS1 6,670±2,040 0,483±0,210 0,493±0, 112 0,075±0,007

KO-STS1 0,083±0,012 0,221±0,016 0,147±0,001 0,050±0,004

DXS DXR GGPS CPS/KS

WT-STS6 0,146±0,022 0,106±0,016 0,202±0,016 0,060±0,012

WT-STS13 0,1 13±0,009 0,204±0,086 0,313±0,025 0,040±0,008 WT-tpSTS1 0,156±0,029 0,297±0, 114 0,224±0,016 0,052±0,01 1

KO-STS1 0,180±0,039 0,189±0,023 0,257±0,052 0,033±0,01 1

GGR PSY

WT 0,908±0,047 0,133±0,01 1

KO 1 ,572±0,031 0,333±0,010

WT-PTS35 0,638±0,306 0,074±0,017

PTS35-KO 0,319±0,043 0,244±0,021

WT-tpPTS2 0,341±0,047 0,093±0,009

KO-tpPTS1 0,425±0,031 0,195±0,029

WT-tpSTS1 0,888±0,219 0,207±0,023

KO-STS1 1 ,580±0, 109 0,518±0,079

Table S7. Production of levels of Patchoulol, santalene and 16-hydroxykaurene

nd: not detected.

The leves are given in mg/g dry weigth.

References

1. Kirby J, Keasling JD: Biosynthesis of plant isoprenoids: Perspectives for microbial engineering. Annu Rev Plant Biol 2009, 60:335-355.

2. Pickel B, Drew DP, Manczak T, Weitzel C, Simonsen HT, Ro DK: Identification and characterization of a kunzeaol synthase from Thapsia garganica: implications for the biosynthesis of the pharmaceutical thapsigargin. Biochemical Journal 2012, 448(2):261-271.

3. Martin DM, Faldt J, Bohlmann J: Functional characterization of nine Norway spruce TPS genes and evolution of gymnosperm terpene synthases of the TPS-d subfamily. Plant Physiol 2004, 135(4): 1908-1927.

4. Hamberger B, Ohnishi T, Hamberger B, Seguin A, Bohlmann J: Evolution of diterpene metabolism: Sitka spruce CYP720B4 catalyzes multiple oxidations in resin acid biosynthesis of conifer defense against insects. Plant Physiol 2011 , 157(4): 1677-1695.

5. Fischer MJ, Meyer S, Claudel P, Perrin M, Ginglinger JF, Gertz C, Masson

JE, Werck-Reinhardt D, Hugueney P, Karst F: Specificity of Ocimum basilicum geraniol synthase modified by its expression in different heterologous systems. J Biotechnol lOM, 163(1):24-29.

6. Rensing SA, Fritzowsky D, Lang D, Reski R: Protein encoding genes in an ancient plant: analysis of codon usage, retained genes and splice sites in a moss, Physcomitrella patens. Bmc Genomics 2005, 6:43.

7. Vietor R, Loutelier-Bourhis C, Fitchette AC, Margerie P, Gonneau M, Faye L, Lerouge P: Protein N-glycosylation is similar in the moss Physcomitrella patens and in higher plants. Planta 2003, 218(2):269-275.

8. Koprivova A, Altmann F, Gorr G, Kopriva S, Reski R, Decker EL: N- glycosylation in the moss Physcomitrella patens is organized similarly to that in higher plants. Plant Biol 2003, 5(6):582-591.

9. Mega T: Plant-type N-glycans containing fucose and xylose in Bryophyta (mosses) and Tracheophyta (ferns). Biosci Biotechnol Biochem 2007, 71 (12):2893-2904.

10. Simonsen HT, Drew DP, Lunde C: Perspectives on using Physcomitrella patens as an alternative production platform for thapsigargin and other terpenoid drug candidates. Perspect Medici n Chem 2009, 3: 1-6.

1 1. Anterola A, Shanle E, Perroud PF, Quatrano R: Production of taxa- 4(5),11 (12)-diene by transgenic Physcomitrella patens. Transgeic Res 2009, 18(4):655-660.

12. Hamberger B, Bak S: Plant P450s as versatile drivers for evolution of speciesspecific chemical diversity In: Phil Trans R Soc B (in press). 2013.

13. Yonekura-Sakakibara K, Hanada K: An evolutionary view of functional diversity in family 1 glycosyltransferases. Plant J 201 1 , 66(1): 182-193.

14. Chen F, Tholl D, Bohlmann J, Pichersky E: The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom. Plant J 201 1 , 66(1):212-229.

15. Sakamoto T, Miura K, Itoh H, Tatsumi T, Ueguchi-Tanaka M, Ishiyama K, Kobayashi M, Agrawal GK, Takeda S, Abe K et al: An overview of gibberellin metabolism enzyme genes and their related mutants in rice. Plant Physiol 2004, 134(4): 1642-1653.

16. Anterola A, Shanle E: Genomic insights in moss gibberellin biosynthesis. Bryologist 2008, 111 (2):218-230.

17. Johri MM: Hormonal regulation in green plant lineage families. Physiol

Mol Biol Plants 2008, 14(1-2):23-38.

18. von Schwartzenberg K, Schultze W, Kassner H: The moss Physcomitrella patens releases a tetracyclic diterpene. Plant Cell Rep 2004, 22(10):780-786.

19. Schaefer DG, Zryd JP: Efficient gene targeting in the moss Physcomitrella patens. Plant J 1997, 11 (6): 1195-1206.

20. Hayashi K, et al: Endogenous diterpenes derived from enf-daurene, a common gibberellin precursor, regulate protonema differentiation of the moss Physcomitrella patens. Plant Physiol 2010, 153(3): 1085-1097.

21. Bach SS, King BC, Zhan X, Simonsen HT, Hamberger B: Heterologous stable expression of terpenoid biosynthetic genes using the moss Physcomitrella patens. In: Methods in Molecular Biology Edited by Rodriguez-Concepcion M, vol. The Plant Isoprenoids; 2013: in press.

22. Reski R: Molecular genetics of Physcomitrella. Planta 1999, 208:301-

309.

23. Reski R: Development, genetics and molecular biology of mosses.

Botanica Acta 1998, 111 : 1-15.

24. Ashton NW, Grimsley NH, Cove DJ: Analysis of Gametophytic Development in the Moss, Physcomitrella-Patens, Using Auxin and Cytokinin Resistant Mutants. Planta 1979, 144(5):427-435.