Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SESQUITERPENE SYNTHASES FOR PRODUCTION OF DRIMENOL AND MIXTURES THEREOF
Document Type and Number:
WIPO Patent Application WO/2018/202578
Kind Code:
A1
Abstract:
The present application relates to a method of producing drimenol and/or drimenol derivatives by comprising contacting at least one polypeptide with farnesyl diphosphate (FPP). The method may be performed in vitro or in vivo. Also provided are amino acid sequences of polypeptides useful in the methods and nucleic acids encoding the polypeptides described. The method further provides host cells or organisms genetically modified to express the polypeptides and useful to produce drimenol and/or derivatives of drimenol.

Inventors:
LI PAN (CN)
WANG QI (CN)
HE XIU-FENG (CN)
HAEFLIGER OLIVIER (CH)
Application Number:
PCT/EP2018/060889
Publication Date:
November 08, 2018
Filing Date:
April 27, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
FIRMENICH & CIE (CH)
International Classes:
C12N9/02; C12N9/10; C12N9/16; C12N9/88; C12P5/00; C12P7/04
Domestic Patent References:
WO2015086885A12015-06-18
WO2014150599A12014-09-25
WO2015169871A22015-11-12
WO2015176959A12015-11-26
WO2013058655A12013-04-25
Other References:
DATABASE GenBank [online] 16 December 2016 (2016-12-16), CHU,F.H., WEN,C.H., LEE,Y.R. AND CHUANG,L: "Identification and characterization of five terpene synthases from Liquidambar formosana", XP002773517, Database accession no. KF874661
"Laboratory Manuals", 2002, COLD SPRING HARBOR LAB PRESS
TATIANA ET AL., FEMS MICROBIOL LETT., vol. 174, 1999, pages 247 - 250
Attorney, Agent or Firm:
BAUMGARTNER HARRIS, Pauline (CH)
Download PDF:
Claims:
CLAIMS

1. A method for producing drimenol or a mixture comprising drimenol comprising:

a. contacting an acyclic farnesyl diphosphate (FPP) precursor with a polypeptide having sesquiterpene synthase activity, wherein the polypeptide comprises an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2 or comprises the amino acid sequence of SEQ ID NO: 2, to produce drimenol or a mixture comprising drimenol and one or more terpenes; and

b. optionally isolating the drimenol.

2. The method of claim 1, comprising transforming a host cell or non-human host organism with a nucleic acid encoding a polypeptide having drimenol synthase activity, wherein the polypeptide comprises an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprises SEQ ID NO: 2.

3. The method as recited in claim 1 or 2, further comprising culturing a non-human host organism or a host cell capable of producing FPP and transformed to express a polypeptide, wherein the polypeptide comprises an amino acid sequence having at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% sequence identity to SEQ ID NO: 2 under conditions that allow for the production of the polypeptide.

4. The method as recited in any one of claims 1 to 3, comprising converting the drimenol to a drimenol derivative using chemical synthesis or biochemical synthesis.

5. An isolated polypeptide having sesquiterpene synthase activity comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2.

6. An isolated nucleic acid molecule

a. comprising a nucleotide sequence encoding the polypeptide of claim 5 ;

b. comprising a nucleotide sequence having at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 3; or

c. comprising the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 3.

7. A vector comprising

c. the nucleic acid molecule of claim 6; or

d. a nucleic acid encoding the polypeptide of claim 5.

8. The vector of claim 7, wherein the vector is a prokaryotic vector, viral vector or a eukaryotic vector.

9. A host cell or a non-human host organism comprising

a. the isolated nucleic acid of claim 6; or

b. the vector of claim 7 or 8.

10. The method of claim 2 or 3, wherein the cell or non-human host organism is a plant, a prokaryote, a fungus, or a microorganism.

11. The method of claim 10, wherein the microorganism is a bacterium or yeast.

12. The method of claim 11, wherein the bacterium is E. coli and the yeast is Saccharomyces cerevisiae.

13. Use of the polypeptide of claim 7 for producing drimenol or a mixture comprising drimenol and one or more terpenes.

14. The use of claim 13, wherein the mixture comprises drimenol and nerolidol.

15. The method of claim 1, wherein the mixture comprises drimenol and nerolidol.

Description:
SESQUITERPENE SYNTHASES FOR PRODUCTION OF DRIMENOL AND

MIXTURES THEREOF

Technical field

Provided herein are biochemical methods of producing drimenol and related compounds and derivatives and mixtures comprising drimenol, which method comprises the use of novel polypeptides.

Background

Terpenes are found in most organisms (microorganisms, animals and plants). These compounds are made up of five carbon units called isoprene units and are classified by the number of these units present in their structure. Thus monoterpenes, sesquiterpenes and diterpenes are terpenes containing 10, 15 and 20 carbon atoms, respectively. Sesquiterpenes, for example, are widely found in the plant kingdom. Many sesquiterpene molecules are known for their flavor and fragrance properties and their cosmetic, medicinal and antimicrobial effects. Numerous sesquiterpene hydrocarbons and sesquiterpenoids have been identified.

Biosynthetic production of terpenes involves enzymes called terpene synthases. There are numerous sesquiterpene synthases present in the plant kingdom, all using the same substrate (farnesyl diphosphate, FPP), but having different product profiles. Genes and cDNAs encoding sesquiterpene synthases have been cloned and the corresponding recombinant enzymes characterized.

Currently the main sources for drimenol are plants naturally containing drimenol; however, the contents of drimenol in these natural sources are low. Chemical synthesis approaches have been developed but are still complex and not cost-effective. There still remains a need for the discovery of new terpenes, terpene synthases and more cost-effective methods of producing drimenol and derivatives therefrom and mixtures comprising drimenol.

Summary

Provided herein is a method for producing drimenol or a mixture comprising drimenol comprising:

a. contacting an acyclic farnesyl diphosphate (FPP) precursor with a polypeptide having sesquiterpene synthase activity, wherein the polypeptide comprises an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2 or comprises the amino acid sequence of SEQ ID NO: 2, to produce drimenol or a mixture comprising drimenol and one or more terpenes; and

b. optionally isolating the drimenol.

Also provided is a method that comprises transforming a host cell or non-human host organism with a nucleic acid encoding a polypeptide having drimenol synthase activity, wherein the polypeptide comprises an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprises SEQ ID NO: 2.

Also provided is a method that further comprises culturing a non-human host organism or a host cell capable of producing FPP and transformed to express a polypeptide, wherein the polypeptide comprises an amino acid sequence having at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% sequence identity to SEQ ID NO: 2 under conditions that allow for the production of the polypeptide.

In one aspect, the drimenol produced from the above methods is isolated.

In another aspect, the method further comprises contacting the drimenol with at least one enzyme to produce a drimenol derivative.

In a further aspect, the method comprises converting the drimenol to a drimenol derivative using chemical synthesis or biochemical synthesis.

Also provided herein is an isolated polypeptide having drimenol synthase activity comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2.

Further provided herein is an isolated nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide having drimenol synthase activity and comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2.

Further provided is an isolated nucleic acid comprising a nucleotide sequence having at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 1 or SEQ ID NO: 3 or comprising the nucleotide sequence of SEQ ID NO: 1 and SEQ ID NO: 3. Further provided is an isolated nucleic acid molecule encoding a polypeptide provided herein.

In one aspect provided herein is a vector comprising the nucleic acid molecules described herein. In another aspect, the vector is an expression vector. In a further aspect, the vector is a prokaryotic vector, viral vector or a eukaryotic vector.

Also provided is a non-human host organism or a host cell comprising (1) a nucleic acid molecule described above, or (2) an expression vector comprising said nucleic acid molecule. In one aspect the non-human organism or host cell is a prokaryotic or eukaryotic cell. In another aspect the host cell is a bacterial cell, a plant cell, a fungal cell or a yeast. In a further aspect, the bacterial cell is E. coli and the yeast cell is Saccharomyces cerevisiae

Further provided is the use of a polypeptide described herein for producing drimenol or a mixture comprising drimenol and one or more terpenes.

In one aspect, the mixture produced in the above methods or uses comprises drimenol and nerolidol.

In a further aspect, the drimenol and/or nerolidol is isolated.

Description of the drawings.

Figure 1. Structure of (-)-drimenol.

Figure 2. Mass spectrum of authentic (-)-drimenol.

Fi •gure 3. 13 C NMR spectrum of authentic (-)-drimenol.

Figure 4. X-Ray (Cu Ka radiation) structure of authentic (-)-drimenol.

Figure 5. Shows GC/MS chromatogram of Paeonia anomala root dichloromethane extract.

Arrow denotes the peak of drimenol.

Figure 6. Shows GC/MS chromatogram of the E. coli expression experiment of PaTPS3 (only the zone for sesquiterpene is displayed).

Figure 7. Shows mass spectrum of the peak of drimenol in Figure 6.

Figure 8. Shows GC/MS chromatogram of the in vitro assay of PaTPS3 (only the zone for sesquiterpene is displayed).

Figure 9. Shows mass spectrum of the peak of drimenol in Figure 8. Abbreviations used

bp base pair

kb kilo base

DNA deoxyribonucleic acid

cDNA complementary DNA

DTT dithiothreitol

FPP farnesyl diphosphate

GC gas chromatograph

IPTG isopropyl-D-thiogalacto-pyranoside

LB lysogeny broth

MS mass spectrometer / mass spectrometry

MVA mevalonic acid

PCR polymerase chain reaction

RNA ribonucleic acid

mRNA messenger ribonucleic acid

miRNA micro RNA

siRNA small interfering RNA

rRNA ribosomal RNA

tRNA transfer RNA

Definitions

The term "polypeptide" means an amino acid sequence of consecutively polymerized amino acid residues, for instance, at least 15 residues, at least 30 residues, at least 50 residues. In some embodiments herein, a polypeptide comprises an amino acid sequence that is an enzyme, or a fragment, or a variant thereof.

The term "protein" refers to an amino acid sequence of any length wherein amino acids are linked by covalent peptide bonds, and includes oligopeptide, peptide, polypeptide and full length protein whether naturally occurring or synthetic.

The term "isolated" polypeptide refers to an amino acid sequence that is removed from its natural environment by any method or combination of methods known in the art and includes recombinant, biochemical and synthetic methods. The terms "sesquiterpene synthase" or "polypeptide having sesquiterpene synthase activity" relate to a polypeptide capable of catalyzing the synthesis of a sesquiterpene or a mixture comprsing one or more sesquiterpenes, for example, drimenol and/or nerolidol, starting from an acyclic terpene pyrophosphate, particularly farnesyl diphosphate (FPP).

The terms "drimenol synthase" or "polypeptide having a drimenol synthase activity" or

"drimenol synthase protein" relate to a polypeptide capable of catalyzing the synthesis of drimenol, in the form of any of its stereoisomers or a mixture thereof, starting from an acyclic terpene pyrophosphate, particularly farnesyl diphosphate (FPP). Drimenol may be the only product or may be part of a mixture of sesquiterpenes.

The terms "biological function," "function," "biological activity" or "activity" refer to the ability of the drimenol synthase to catalyze the formation of drimenol or a mixture of compounds comprising drimenol and one or more terpenes.

The terms "mixture of terpenes comprising drimenol" or "mixture of sesquiterpenes comprising drimenol" refer to a mixture of terpenes or sesquiterpenes that comprises drimenol and one or more additional terpenes or sesquiterpenes.

The terms "nucleic acid sequence," "nucleic acid," "nucleic acid molecule" and "polynucleotide" are used interchangeably meaning a sequence of nucleotides. A nucleic acid sequence may be a single- stranded or double- stranded deoxyribonucleotide, or ribonucleotide of any length, and include coding and non-coding sequences of a gene, exons, introns, sense and anti-sense complimentary sequences, genomic DNA, cDNA, miRNA, siRNA, mRNA, rRNA, tRNA, recombinant nucleic acid sequences, isolated and purified naturally occurring DNA and/or RNA sequences, synthetic DNA and RNA sequences, fragments, primers and nucleic acid probes. The skilled artisan is aware that the nucleic acid sequences of RNA are identical to the DNA sequences with the difference of thymine (T) being replaced by uracil (U). The term "nucleotide sequence" should also be understood as comprising a polynucleotide molecule or an oligonucleotide molecule in the form of a separate fragment or as a component of a larger nucleic acid.

An "isolated nucleic acid" or "isolated nucleic acid sequence" relates to a nucleic acid or nucleic acid sequence that is in an environment different from that in which the nucleic acid or nucleic acid sequence naturally occurs and can include those that are substantially free from contaminating endogenous material. The term "naturally-occurring" as used herein as applied to a nucleic acid refers to a nucleic acid that is found in a cell of an organism in nature and which has not been intentionally modified by a human in the laboratory.

"Recombinant nucleic acid sequences" are nucleic acid sequences that result from the use of laboratory methods (for example, molecular cloning) to bring together genetic material from more than on source, creating or modifying a nucleic acid sequence that does not occur naturally and would not be otherwise found in biological organisms.

"Recombinant DNA technology" refers to molecular biology procedures to prepare a recombinant nucleic acid sequence as described, for instance, in Laboratory Manuals edited by Weigel and Glazebrook, 2002, Cold Spring Harbor Lab Press; and Sambrook et al, 1989, Cold Spring Harbor, NY, Cold Spring Harbor Laboratory Press.

The term "gene" means a DNA sequence comprising a region, which is transcribed into a RNA molecule, e.g., an mRNA in a cell, operably linked to suitable regulatory regions, e.g., a promoter. A gene may thus comprise several operably linked sequences, such as a promoter, a 5' leader sequence comprising, e.g., sequences involved in translation initiation, a coding region of cDNA or genomic DNA, introns, exons, and/or a 3 'non- translated sequence comprising, e.g., transcription termination sites.

A "chimeric gene" refers to any gene which is not normally found in nature in a species, in particular, a gene in which one or more parts of the nucleic acid sequence are present that are not associated with each other in nature. For example the promoter is not associated in nature with part or all of the transcribed region or with another regulatory region. The term "chimeric gene" is understood to include expression constructs in which a promoter or transcription regulatory sequence is operably linked to one or more coding sequences or to an antisense, i.e., reverse complement of the sense strand, or inverted repeat sequence (sense and antisense, whereby the RNA transcript forms double stranded RNA upon transcription). The term "chimeric gene" also includes genes obtained through the combination of portions of one or more coding sequences to produce a new gene.

A "3' UTR" or "3' non-translated sequence" (also referred to as "3' untranslated region," or "3'end") refers to the nucleic acid sequence found downstream of the coding sequence of a gene, which comprises, for example, a transcription termination site and (in most, but not all eukaryotic mRNAs) a polyadenylation signal such as AAUAAA or variants thereof. After termination of transcription, the mRNA transcript may be cleaved downstream of the polyadenylation signal and a poly(A) tail may be added, which is involved in the transport of the mRNA to the site of translation, e.g., cytoplasm.

"Expression of a gene" encompasses "heterologous expression" and "over-expression" and involves transcription of the gene and translation of the mRNA into a protein. Overexpression refers to the production of the gene product as measured by levels of mRNA, polypeptide and/or enzyme activity in transgenic cells or organisms that exceeds levels of production in non-transformed cells or organisms of a similar genetic background.

"Expression vector" as used herein means a nucleic acid molecule engineered using molecular biology methods and recombinant DNA technology for delivery of foreign or exogenous DNA into a host cell. The expression vector typically includes sequences required for proper transcription of the nucleotide sequence. The coding region usually codes for a protein of interest but may also code for an RNA, e.g., an antisense RNA, siRNA and the like.

An "expression vector" as used herein includes any linear or circular recombinant vector including but not limited to viral vectors, bacteriophages and plasmids. The skilled person is capable of selecting a suitable vector according to the expression system. In one embodiment, the expression vector includes the nucleic acid of an embodiment herein operably linked to at least one regulatory sequence, which controls transcription, translation, initiation and termination, such as a transcriptional promoter, operator or enhancer, or an mRNA ribosomal binding site and, optionally, including at least one selection marker. Nucleotide sequences are "operably linked" when the regulatory sequence functionally relates to the nucleic acid of an embodiment herein.

"Regulatory sequence" refers to a nucleic acid sequence that determines expression level of the nucleic acid sequences of an embodiment herein and is capable of regulating the rate of transcription of the nucleic acid sequence operably linked to the regulatory sequence. Regulatory sequences comprise promoters, enhancers, transcription factors, promoter elements and the like.

"Promoter" refers to a nucleic acid sequence that controls the expression of a coding sequence by providing a binding site for RNA polymerase and other factors required for proper transcription including without limitation transcription factor binding sites, repressor and activator protein binding sites. The meaning of the term promoter also includes the term "promoter regulatory sequence". Promoter regulatory sequences may include upstream and downstream elements that may influences transcription, RNA processing or stability of the associated coding nucleic acid sequence. Promoters include naturally-derived and synthetic sequences. The coding nucleic acid sequences is usually located downstream of the promoter with respect to the direction of the transcription starting at the transcription initiation site.

The term "constitutive promoter" refers to an unregulated promoter that allows for continual transcription of the nucleic acid sequence it is operably linked to.

As used herein, the term "operably linked" refers to a linkage of polynucleotide elements in a functional relationship. A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For instance, a promoter, or rather a transcription regulatory sequence, is operably linked to a coding sequence if it affects the transcription of the coding sequence. Operably linked means that the DNA sequences being linked are typically contiguous. The nucleotide sequence associated with the promoter sequence may be of homologous or heterologous origin with respect to the plant to be transformed. The sequence also may be entirely or partially synthetic. Regardless of the origin, the nucleic acid sequence associated with the promoter sequence will be expressed or silenced in accordance with promoter properties to which it is linked after binding to the polypeptide of an embodiment herein. The associated nucleic acid may code for a protein that is desired to be expressed or suppressed throughout the organism at all times or, alternatively, at a specific time or in specific tissues, cells, or cell compartment. Such nucleotide sequences particularly encode proteins conferring desirable phenotypic traits to the host cells or organism altered or transformed therewith. More particularly, the associated nucleotide sequence leads to the production of drimenol or a mixture comprising drimenol and one or more terpenes in the cell or organism. Particularly, the nucleotide sequence encodes a polypeptide having drimenol synthase activity.

"Target peptide" refers to an amino acid sequence which targets a protein, or polypeptide to intracellular organelles, i.e., mitochondria, or plastids, or to the extracellular space (secretion signal peptide). A nucleic acid sequence encoding a target peptide may be fused to the nucleic acid sequence encoding the amino terminal end, e.g., N-terminal end, of the protein or polypeptide, or may be used to replace a native targeting polypeptide.

The term "primer" refers to a short nucleic acid sequence that is hybridized to a template nucleic acid sequence and is used for polymerization of a nucleic acid sequence complementary to the template. As used herein, the term "host cell" or "transformed cell" refers to a cell (or organism) altered to harbor at least one nucleic acid molecule, for instance, a recombinant gene encoding a desired protein or nucleic acid sequence which upon transcription yields a drimenol synthase protein useful to produce drimenol or a mixture comprising drimenol and one or more terpenes. The host cell is particularly a bacterial cell, a fungal cell or a plant cell. The host cell may contain a recombinant gene which has been integrated into the nuclear or organelle genomes of the host cell. Alternatively, the host may contain the recombinant gene extra-chromosomally.

Homologous sequences include orthologous or paralogous sequences. Methods of identifying orthologs or paralogs including phylogenetic methods, sequence similarity and hybridization methods are known in the art and are described herein.

Paralogs result from gene duplication that gives rise to two or more genes with similar sequences and similar functions. Paralogs typically cluster together and are formed by duplications of genes within related plant species. Paralogs are found in groups of similar genes using pair-wise Blast analysis or during phylogenetic analysis of gene families using programs such as CLUSTAL. In paralogs, consensus sequences can be identified characteristic to sequences within related genes and having similar functions of the genes.

Orthologs, or orthologous sequences, are sequences similar to each other because they are found in species that descended from a common ancestor. For instance, plant species that have common ancestors are known to contain many enzymes that have similar sequences and functions. The skilled artisan can identify orthologous sequences and predict the functions of the orthologs, for example, by constructing a polygenic tree for a gene family of one species using CLUSTAL or BLAST programs. A method for identifying or confirming similar functions among homologous sequences is by comparing of the transcript profiles in host cells or organisms, such as plants, overexpressing or lacking (in knockouts/knockdowns) related polypeptides. The skilled person will understand that genes having similar transcript profiles, with greater than 50% regulated transcripts in common, or with greater than 70% regulated transcripts in common, or greater than 90% regulated transcripts in common will have similar functions. Homologs, paralogs, orthologs and any other variants of the sequences herein are expected to function in a similar manner by making the host cells, organism such as plants producing drimenol synthase proteins. The term "selectable marker" refers to any gene which upon expression may be used to select a cell or cells that include the selectable marker. Examples of selectable markers are described below. The skilled artisan will know that different antibiotic, fungicide, auxotrophic or herbicide selectable markers are applicable to different target species.

"Drimenol" for purposes of this application refers to (-)-drimenol (CAS: 468-68-8) (see also Figures 1-4).

The term "organism" refers to any non-human multicellular or unicellular organisms such as a plant, or a microorganism. Particularly, a micro-organism is a bacterium, a yeast, an algae or a fungus.

The term "plant" is used interchangeably to include plant cells including plant protoplasts, plant tissues, plant cell tissue cultures giving rise to regenerated plants, or parts of plants, or plant organs such as roots, stems, leaves, flowers, pollen, ovules, embryos, fruits and the like. Any plant can be used to carry out the methods of an embodiment herein.

A particular organism or cell is meant to be "capable of producing FPP" when it produces FPP naturally or when it does not produce FPP naturally but is transformed to produce FPP, either prior to the transformation with a nucleic acid as described herein or together with said nucleic acid. Organisms or cells transformed to produce a higher amount of FPP than the naturally occurring organism or cell are also encompassed by the "organisms or cells capable of producing FPP".

For the descriptions herein and the appended claims, the use of "or" means "and/or" unless stated otherwise. Similarly, "comprise," "comprises," "comprising", "include," "includes," and "including" are interchangeable and not intended to be limiting.

It is to be further understood that where descriptions of various embodiments use the term "comprising," those skilled in the art would understand that in some specific instances, an embodiment can be alternatively described using language "consisting essentially of" or "consisting of."

Detailed Description

The present invention particularly refers to the following embodiments: 1. A method for producing drimenol or a mixture comprising drimenol comprising: a. contacting an acyclic farnesyl diphosphate (FPP) precursor with a polypeptide having sesquiterpene synthase activity, wherein the polypeptide comprises an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2 or comprises the amino acid sequence of SEQ ID NO: 2, to produce drimenol or a mixture comprising drimenol and one or more terpenes; and

b. optionally isolating the drimenol. The method of embodiment 1, comprising transforming a host cell or non-human host organism with a nucleic acid encoding a polypeptide having drimenol synthase activity, wherein the polypeptide comprises an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprises SEQ ID NO: 2. The method of one of the preceding embodiments, further comprising culturing a non- human host organism or a host cell capable of producing FPP and transformed to express a polypeptide, wherein the polypeptide comprises an amino acid sequence having at least 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, 99%, or 100% sequence identity to SEQ ID NO: 2 under conditions that allow for the production of the polypeptide. The method of embodiment 1 , wherein the drimenol is isolated. The method of one of the preceding embodiments, comprising contacting the drimenol with at least one enzyme to produce a drimenol derivative. The method of one of the preceding embodiments, comprising converting the drimenol to a drimenol derivative using chemical synthesis or biochemical synthesis. An isolated polypeptide having sesquiterpene synthase activity comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2. An isolated nucleic acid molecule

a. comprising a nucleotide sequence encoding the polypeptide of embodiment 7;

b. comprising a nucleotide sequence having at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 3; or

c. comprising the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 3. A vector comprising

a. the nucleic acid molecule of embodiment 8; or

b. a nucleic acid encoding the polypeptide of embodiment 7. The vector of embodiment 9, wherein the vector is a prokaryotic vector, viral vector or a eukaryotic vector. The vector of one of the preceding embodiments, where the vector is an expression vector. A host cell or a non-human host organism comprising

a. the isolated nucleic acid of embodiment 8; or

b. the vector of anyone of embodiments 9 to 11. The method of anyone of embodiments 2 or 3, wherein the cell is a prokaryotic cell. The method of embodiment 13, wherein the prokaryotic cell is a bacterial cell. The method of embodiment 14, wherein the bacterial cell is E. coli. 16. The method of anyone of embodiments 2 or 3, wherein the cell is a eukaryotic cell.

17. The method of embodiment 16, wherein the eukaryotic cell is a yeast cell or a plant cell.

18. The method of embodiment 17, wherein the yeast cell is Saccharomyces cerevisiae.

19. Use of the polypeptide of embodiment 7 for producing drimenol or a mixture comprising drimenol and one or more terpenes.

20. The use of embodiment 19, wherein the mixture comprises drimenol and nerolidol.

21. The method of embodiment 1, wherein the mixture comprises drimenol and nerolidol. 22. The method of embodiment 21, wherein the drimenol and/or the nerolidol produced is isolated.

Additionally, provided herein is a nucleic acid molecule comprising a nucleotide sequence having at least 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 1 or SEQ ID NO: 3 or comprising the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 3 or the reverse complement thereof.

According to one embodiment, the nucleic acid molecule consists of a nucleotide sequence SEQ ID NO: 1 or SEQ ID NO: 3 or the reverse complement thereof.

In one embodiment, the nucleic acid of an embodiment herein can be either present naturally in Paeonia plants or in other plant species, or be obtained by modifying SEQ ID NO: 1 or SEQ ID NO: 3 or the reverse complement thereof.

In another embodiment, the nucleic acid is isolated or is derived from a plant of the Paeoniaceae family. In a further embodiment the nucleic acid is isolated or derived from Paeonia anomala.

Further provided is a nucleotide sequence obtained by modifying SEQ ID NO: 1 or SEQ

ID NO: 3 or the reverse complement thereof which encompasses any sequence that has been obtained by modifying the sequence of SEQ ID NO: 1 or SEQ ID NO: 3, or of the reverse complement thereof using any method known in the art, for example, by introducing any type of mutations such as deletion, insertion and/or substitution mutations. The nucleic acids comprising a sequence obtained by mutation of SEQ ID NO: 1 or SEQ ID NO: 3 or the reverse complement thereof are encompassed by an embodiment herein, provided that the sequences they comprise share at least the defined sequence identity of SEQ ID NO: 1 or SEQ ID NO: 3 or the reverse complement thereof and provided that they encode a polypeptide having a drimenol synthase activity, as defined in any of the above embodiments. Mutations may be any kind of mutations of these nucleic acids, for example, point mutations, deletion mutations, insertion mutations and/or frame shift mutations of one or more nucleotides of the DNA sequence of SEQ ID NO: 1 or SEQ ID NO: 3. In one embodiment, the nucleic acid of an embodiment herein may be truncated provided that it encodes a polypeptide as described herein.

A variant nucleic acid may be prepared in order to adapt its nucleotide sequence to a specific expression system. For example, bacterial expression systems are known to more efficiently express polypeptides if amino acids are encoded by particular codons.

Due to the degeneracy of the genetic code, more than one codon may encode the same amino acid sequence, multiple nucleic acid sequences can code for the same protein or polypeptide, all these DNA sequences being encompassed by an embodiment herein. Where appropriate, the nucleic acid sequences encoding the drimenol synthase may be optimized for increased expression in the host cell. For example, nucleotides of an embodiment herein may be synthesized using codons particular to a host for improved expression.

In one embodiment provided herein is an isolated, recombinant or synthetic nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 3 encoding for a polypeptide having drimenol synthase activity comprising the amino acid sequence of SEQ ID NO: 2 or fragments thereof that catalyze production of drimenol or a mixture comprising drimenol and one or more terpenes in a cell from a FPP precursor.

Provided herein are also cDNA, genomic DNA and RNA sequences. Any nucleic acid sequence encoding the drimenol synthase or variants thereof is also referred herein as a drimenol synthase encoding sequence. According to one embodiment, the nucleic acid of SEQ ID NO: 1 or SEQ ID NO: 3 is the coding sequence of a drimenol synthase gene encoding a drimenol synthase obtained as described in the Examples.

A fragment of a polynucleotide of SEQ ID NO: 1 or SEQ ID NO: 3 refers to contiguous nucleotides that is particularly at least 15 bp, at least 30 bp, at least 40 bp, at least 50 bp and/or at least 60 bp in length of the polynucleotide of an embodiment herein. Particularly the fragment of a polynucleotide comprises at least 25, more particularly at least 50, more particularly at least 75, more particularly at least 100, more particularly at least 150, more particularly at least 200, more particularly at least 300, more particularly at least 400, more particularly at least 500, more particularly at least 600, more particularly at least 700, more particularly at least 800, more particularly at least 900, more particularly at least 1000 contiguous nucleotides of the polynucleotide of an embodiment herein. Without being limited, the fragment of the polynucleotides herein may be used as a PCR primer, and/or as a probe, or for anti-sense gene silencing or RNAi.

It is clear to the person skilled in the art that genes, including the polynucleotides of an embodiment herein, can be cloned on basis of the available nucleotide sequence information, such as found in the attached sequence listing, by methods known in the art. These include e.g. the design of DNA primers representing the flanking sequences of such gene of which one is generated in sense orientations and which initiates synthesis of the sense strand and the other is created in reverse complementary fashion and generates the antisense strand. Thermo stable DNA polymerases such as those used in polymerase chain reaction are commonly used to carry out such experiments. Alternatively, DNA sequences representing genes can be chemically synthesized and subsequently introduced in DNA vector molecules that can be multiplied by e.g. compatible bacteria such as e.g. E. coli.

In a related embodiment provided herein, PCR primers and/or probes for detecting nucleic acid sequences encoding a drimenol synthase are provided. The skilled artisan will be aware of methods to synthesize degenerate or specific PCR primer pairs to amplify a nucleic acid sequence encoding the drimenol synthase or fragments thereof, based on SEQ ID NO: 1 or SEQ ID NO: 3. A detection kit for nucleic acid sequences encoding the drimenol synthase may include primers and/or probes specific for nucleic acid sequences encoding the drimenol synthase, and an associated protocol to use the primers and/or probes to detect nucleic acid sequences encoding the drimenol synthase in a sample. Such detection kits may be used to determine whether a plant, organism or cell has been modified, i.e., transformed with a sequence encoding the drimenol synthase.

To test a function of variant DNA sequences according to an embodiment herein, the sequence of interest is operably linked to a selectable or screenable marker gene and expression of the reporter gene is tested in transient expression assays with protoplasts or in stably transformed plants. The skilled artisan will recognize that DNA sequences capable of driving expression are built as modules. Accordingly, expression levels from shorter DNA fragments may be different than the one from the longest fragment and may be different from each other. Provided herein are also functional equivalents of the nucleic acid sequence coding the drimenol synthase proteins provided herein, i.e., nucleotide sequences that hybridize under stringent conditions to the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 3.

The skilled artisan will be aware of methods to identify homologous sequences in other organisms and methods to determine the percentage of sequence identity between homologous sequences. Such newly identified DNA molecules then can be sequenced and the sequence can be compared with the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 3.

The percentage of identity between two peptide or nucleotide sequences is a function of the number of amino acids or nucleotide residues that are identical in the two sequences when an alignment of these two sequences has been generated. Identical residues are defined as residues that are the same in the two sequences in a given position of the alignment. The percentage of sequence identity, as used herein, is calculated from the optimal alignment by taking the number of residues identical between two sequences dividing it by the total number of residues in the shortest sequence and multiplying by 100. The optimal alignment is the alignment in which the percentage of identity is the highest possible. Gaps may be introduced into one or both sequences in one or more positions of the alignment to obtain the optimal alignment. These gaps are then taken into account as non-identical residues for the calculation of the percentage of sequence identity. Alignment for the purpose of determining the percentage of amino acid or nucleic acid sequence identity can be achieved in various ways using computer programs and for instance publicly available computer programs available on the world wide web. Preferably, the BLAST program (Tatiana et al, FEMS Microbiol Lett., 1999, 174:247-250, 1999) set to the default parameters, available from the National Center for Biotechnology Information (NCBI) website at ncbi.nlm.nih.gov/BLAST/bl2seq/wblast2.cgi, can be used to obtain an optimal alignment of protein or nucleic acid sequences and to calculate the percentage of sequence identity.

A related embodiment provided herein provides a nucleic acid sequence which is complementary to the nucleic acid sequence according to SEQ ID NO: 1 or SEQ ID NO: 3 such as inhibitory RNAs, or nucleic acid sequence which hybridizes under stringent conditions to at least part of the nucleotide sequence according to SEQ ID NO: 1 or SEQ ID NO: 3. An alternative embodiment of an embodiment herein provides a method to alter gene expression in a host cell. For instance, the polynucleotide of an embodiment herein may be enhanced or overexpressed or induced in certain contexts (e.g. upon exposure to a certain temperature or culture conditions) in a host cell or host organism.

Alteration of expression of a polynucleotide provided herein may also result in ectopic expression which is a different expression pattern in an altered and in a control or wild-type organism. Alteration of expression occurs from interactions of polypeptide of an embodiment herein with exogenous or endogenous modulators, or as a result of chemical modification of the polypeptide. The term also refers to an altered expression pattern of the polynucleotide of an embodiment herein which is altered below the detection level or completely suppressed activity.

In one embodiment, provided herein is also an isolated, recombinant or synthetic polynucleotide encoding a polypeptide or variant polypeptide provided herein.

In one embodiment is provided an isolated nucleic acid molecule encoding a polypeptide having drimenol synthase activity and comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2.

In one embodiment provided herein is an isolated polypeptide having sesquiterpene synthase activity and/or drimenol synthase activity and comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2.

According to one embodiment, the polypeptide consists of the amino acid sequence of SEQ ID NO: 2. In one embodiment, the polypeptide of an embodiment herein can be present naturally in Paeonia plants or in other plant species, or comprises an amino acid sequence that is a variant of SEQ ID NO: 2, either obtained by genetic engineering or found naturally in Paeonia plants or in other plant species.

According to another embodiment, the polypeptide is isolated or derived from a plant of the Paeoniaceae family. In a further embodiment, the polypeptide is isolated or derived from Paeonia anomala.

In one embodiment, the at least one polypeptide having sesquiterpene synthase activity and/or a drimenol synthase activity used in any of the herein-described embodiments or encoded by the nucleic acid used in any of the herein-described embodiments comprises an amino acid sequence that is a variant of SEQ ID NO: 2, obtained by genetic engineering. In one embodiment the polypeptide comprises an amino acid sequence encoded by a nucleotide sequence that has been obtained by modifying SEQ ID NO: 1 or SEQ ID NO: 3 or the reverse complement thereof.

Polypeptides are also meant to include variants and truncated polypeptides provided that they have sesquiterpene synthase activity and/or drimenol synthase activity.

According to another embodiment, the at least one polypeptide having a drimenol synthase activity used in any of the herein-described embodiments or encoded by the nucleic acid used in any of the herein-described embodiments comprises an amino acid sequence that is a variant of SEQ ID NO: 2, obtained by genetic engineering, provided that said variant has drimenol synthase activity and has the required percentage of identity to SEQ ID NO: 2 as described herein.

According to another embodiment, the at least one polypeptide having a drimenol synthase activity used in any of the herein-described embodiments or encoded by the nucleic acid used in any of the herein-described embodiments is a variant of SEQ ID NO: 2 that can be found naturally in other organisms, such as other plant species, provided that it has a drimenol synthase activity. As used herein, the polypeptide includes a polypeptide or peptide fragment that encompasses the amino acid sequences identified herein, as well as truncated or variant polypeptides provided that they have sesquiterpene synthase activity and/or a drimenol synthase activity and that they share at least the defined percentage of identity with the corresponding fragment of SEQ ID NO: 2. Examples of variant polypeptides are naturally occurring proteins that result from alternate mRNA splicing events or from proteolytic cleavage of the polypeptides described herein. Variations attributable to proteolysis include, for example, differences in the N- or C- termini upon expression in different types of host cells, due to proteolytic removal of one or more terminal amino acids from the polypeptides of an embodiment herein. Polypeptides encoded by a nucleic acid obtained by natural or artificial mutation of a nucleic acid of an embodiment herein, as described thereafter, are also encompassed by an embodiment herein.

Polypeptide variants resulting from a fusion of additional peptide sequences at the amino and carboxyl terminal ends can also be used in the methods of an embodiment herein. In particular such a fusion can enhance expression of the polypeptides, be useful in the purification of the protein or improve the enzymatic activity of the polypeptide in a desired environment or expression system. Such additional peptide sequences may be signal peptides, for example. Another aspect encompasses methods using variant polypeptides, such as those obtained by fusion with other oligo- or polypeptides and/or those which are linked to signal peptides. Polypeptides resulting from a fusion with another functional protein, such as another protein from the terpene biosynthesis pathway, can also be advantageously used in the methods of an embodiment herein.

A variant may also differ from the polypeptide of an embodiment herein by attachment of modifying groups which are covalently or non-covalently linked to the polypeptide backbone. The variant also includes a polypeptide which differs from the polypeptide provided herein by introduced N-linked or O-linked glycosylation sites, and/or an addition of cysteine residues. The skilled artisan will recognize how to modify an amino acid sequence and preserve biological activity.

In addition to the gene sequences shown in the sequences disclosed herein, it will be apparent for the person skilled in the art that DNA sequence polymorphisms may exist within a given population, which may lead to changes in the amino acid sequence of the polypeptides disclosed herein. Such genetic polymorphisms may exist in cells from different populations or within a population due to natural allelic variation. Allelic variants may also include functional equivalents.

Further embodiments also relate to the molecules derived by such sequence polymorphisms from the concretely disclosed nucleic acids. These natural variations usually bring about a variance of about 1 to 5% in the nucleotide sequence of a gene or in the amino acid sequence of the polypeptides disclosed herein. As mentioned above, the nucleic acid encoding the polypeptide or variants thereof of an embodiment herein is a useful tool to modify non- human host organisms or cells and to modify non-human host organisms or cells intended to be used in the methods described herein.

An embodiment provided herein provides amino acid sequences of drimenol synthase proteins including orthologs and paralogs as well as methods for identifying and isolating orthologs and paralogs of the drimenol synthases in other organisms. Particularly, so identified orthologs and paralogs of the drimenol synthase retain drimenol synthase activity and are capable of producing drimenol or a mixture comprising drimenol and one or more terpenes starting from an acyclic terpene pyrophosphate precursor, e.g. FPP.

The polypeptide to be contacted with an acyclic terpene pyrophosphate, e.g. FPP, in vitro can be obtained by extraction from any organism expressing it, using standard protein or enzyme extraction technologies. If the host organism is an unicellular organism or cell releasing the polypeptide of an embodiment herein into the culture medium, the polypeptide may simply be collected from the culture medium, for example by centrifugation, optionally followed by washing steps and re-suspension in suitable buffer solutions. If the organism or cell accumulates the polypeptide within its cells, the polypeptide may be obtained by disruption or lysis of the cells and optionally further extraction of the polypeptide from the cell lysate. Intact cells, the cell lysate or the extracted polypeptide can be used to contact the acyclic terpene pyrophosphate for production of a terpene or a mixture of terpenes.

The polypeptide having a drimenol synthase activity, either in an isolated form or together with other proteins, for example in a crude protein extract obtained from cultured cells or microorganisms, may then be suspended in a buffer solution at optimal pH. If adequate, salts, DTT, inorganic cations and other kinds of enzymatic co-factors, may be added in order to optimize enzyme activity. The precursor FPP is added to the polypeptide suspension, which is then incubated at optimal temperature, for example between 15 and 40°C, particularly between 25 and 35°C, more particularly at 30°C. After incubation, the drimenol produced may be isolated from the incubated solution by standard isolation procedures, such as solvent extraction and distillation, optionally after removal of polypeptides from the solution. According to another embodiment, the at least one polypeptide having a drimenol synthase activity can be used for production of drimenol or mixtures of terpenes comprising drimenol. In another embodiment, the mixture comprising drimenol may also comprise nerolidol.

One particular tool to carry out the method of an embodiment herein is the polypeptide itself as described herein.

According to a particular embodiment, the polypeptide is capable of producing a mixture of sesquiterpenes comprising drimenol. In a further embodiment, the synthase is capable of producing a mixture of sesquiterpenes comprising drimenol, wherein drimenol represents at least 20%, particularly at least 30%, particularly at least 35%, particularly at least 90%, particularly at least 95%, more particularly at least 98% of the sesquiterpenes produced. In another aspect provided here, the drimenol is produced with greater than or equal to 95%, more particularly 98% selectivity.

According to another embodiment, the sesquiterpene synthase is capable of producing a mixture of sesquiterpenes comprising drimenol and nerolidol. In a further embodiment, the synthase is capable of producing a mixture of sesquiterpenes comprising drimenol and nerolidol, wherein nerolidol represents at least 5% to about 80%, particularly at least 10% to about 80%, particularly at least 15% to about 80%, particularly at least 16% to about 80%, particularly at least 50% to about 80%, particularly at least 60% to about 80%, particularly at least 70% to about 80%, particularly about 79% of the sesquiterpenes produced.

The functionality or activity of any sesquiterpene synthase or drimenol synthase protein, variant or fragment, may be determined using various methods. For example, transient or stable overexpression in plant, bacterial or yeast cells can be used to test whether the protein has activity, i.e., produces one or more sesquiterpenes such as drimenol or a mixture of sesquiterpenes comprising drimenol or comprising drimenol and nerolidol from produce an acyclic terpene pyrophosphate precursor, e.g. FPP precursor. Drimenol synthase activity may be assessed in a microbial expression system, such as the assay described in Example 2 herein on the production of drimenol, indicating functionality. A variant or derivative of a drimenol synthase polypeptide of an embodiment herein retains an ability to produce drimenol or a mixture comprising drimenol from FPP precursors. Amino acid sequence variants of the drimenol synthases provided herein may have additional desirable biological functions including, e.g., altered substrate utilization, reaction kinetics, product distribution or other alterations.

The ability of a polypeptide to catalyze the synthesis of a particular sesquiterpene (for example drimenol) can be simply confirmed, for example, by performing the enzyme assay as detailed in Examples 1 and 2.

Further provided is at least one vector comprising the nucleic acid molecules described herein.

Also provided herein is a vector selected from the group of a prokaryotic vector, viral vector and a eukaryotic vector.

Further provided here is a vector that is an expression vector.

In one embodiment, several drimenol synthases encoding nucleic acid sequences are co- expressed in a single host, particularly under control of different promoters. In another embodiment, several drimenol synthase proteins encoding nucleic acid sequences can be present on a single transformation vector or be co-transformed at the same time using separate vectors and selecting transformants comprising both chimeric genes. Similarly, one or more drimenol synthase encoding genes may be expressed in a single plant, cell, organism, or microorganism together with other chimeric genes,.

The nucleic acid sequences of an embodiment herein encoding drimenol synthase proteins can be inserted in expression vectors and/or be contained in chimeric genes inserted in expression vectors, to produce drimenol synthase proteins in a host cell or non-human host organism. The vectors for inserting transgenes into the genome of host cells are well known in the art and include plasmids, viruses, cosmids and artificial chromosomes. Binary or co- integration vectors into which a chimeric gene is inserted can also be used for transforming host cells.

An embodiment provided herein provides recombinant expression vectors comprising a nucleic acid sequence of a drimenol synthase gene, or a chimeric gene comprising a nucleic acid sequence of a drimenol synthase gene, operably linked to associated nucleic acid sequences such as, for instance, promoter sequences. For example, a chimeric gene comprising a nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 3 or a variant thereof may be operably linked to a promoter sequence suitable for expression in plant cells, bacterial cells or fungal cells, optionally linked to a 3' non-translated nucleic acid sequence. Alternatively, the promoter sequence may already be present in a vector so that the nucleic acid sequence which is to be transcribed is inserted into the vector downstream of the promoter sequence. Vectors can be engineered to have an origin of replication, a multiple cloning site, and a selectable marker.

In one embodiment, an expression vector comprising a nucleic acid as described herein can be used as a tool for transforming non-human host organisms or host cells suitable to carry out the method of an embodiment herein in vivo.

The expression vectors provided herein may be used in the methods for preparing a genetically transformed non-human host organism and/or host cell, in non-human host organisms and/or host cells harboring the nucleic acids of an embodiment herein and in the methods for making polypeptides having a drimenol synthase activity, as described herein.

Recombinant non-human host organisms and host cells transformed to harbor at least one nucleic acid of an embodiment herein so that it heterologously expresses or over-expresses at least one polypeptide of an embodiment herein are also very useful tools to carry out the method of an embodiment herein. Such non-human host organisms and host cells are therefore provided herein.

In one embodiment is provided a host cell or non-human host organism comprising at least one of the nucleic acid molecules described herein or comprising at least one vector comprising at least one of the nucleic acid molecules.

A nucleic acid according to any of the above-described embodiments can be used to transform the non-human host organisms and cells and the expressed polypeptide can be any of the above-described polypeptides.

In one embodiment, the non-human host organism or host cell is a prokaryotic cell. In another embodiment, the non-human host organism or host cell is a bacterial cell. In a further embodiment, the non-human host organism or host cell is Escherichia coli.

In one embodiment, the non-human host organism or host cell is a eukaryotic cell. In another embodiment, the non-human host organism or host cell is a yeast cell. In a further embodiment, the non-human host organism or cell is Saccharomyces cerevisiae.

In a further embodiment, the non-human organism or host cell is a plant cell.

In one embodiment the non-human host organism or host cell expresses a polypeptide, provided that the organism or cell is transformed to harbor a nucleic acid encoding said polypeptide, this nucleic acid is transcribed to mRNA and the polypeptide is found in the host organism or cell.

Suitable methods to transform a non-human host organism or a host cell have been previously described and are also provided herein.

To carry out an embodiment herein in vivo, the host organism or host cell is cultivated under conditions conducive to the production of sesquiterpenes such as drimenol or a mixture comprising drimenol. Accordingly, if the host is a transgenic plant, optimal growth conditions can be provided, such as optimal light, water and nutrient conditions, for example. If the host is a unicellular organism, conditions conducive to the production of drimenol or a mixture comprising drimenol may comprise addition of suitable cofactors to the culture medium of the host. In addition, a culture medium may be selected, so as to maximize drimenol synthesis. Examples of optimal culture conditions are described in a more detailed manner in the Examples.

Non-human host organisms suitable to carry out the method of an embodiment herein in vivo may be any non-human multicellular or unicellular organisms. In one embodiment, the non- human host organism used to carry out an embodiment herein in vivo is a plant, a prokaryote or a fungus. Any plant, prokaryote or fungus can be used. Particularly useful plants are those that naturally produce high amounts of terpenes. In another embodiment the non-human host organism used to carry out the method of an embodiment herein in vivo is a microorganism. Any microorganism can be used, for example, the microorganism can be a bacteria or yeast, such as E. coli or Saccharomyces cerevisiae.

Some of these organisms do not produce FPP naturally. To be suitable to carry out the method of an embodiment herein, organisms or cells that do not produce an acyclic terpene pyrophosphate precursor, e.g. FPP, naturally are transformed to produce said precursor. They can be so transformed either before the modification with the nucleic acid described according to any of the above embodiments or simultaneously, as explained above. Methods to transform organisms, for example microorganisms, so that they produce an acyclic terpene pyrophosphate precursor, e.g. FPP, are already known in the art.

Isolated higher eukaryotic cells can also be used, instead of complete organisms, as hosts to carry out the method of an embodiment herein in vivo. Suitable eukaryotic cells may be any non-human cell, such as plant or fungal cells. Further provided herein is a method of producing drimenol or a mixture comprising drimenol comprising:

i) contacting an acyclic terpene pyrophosphate, particularly farnesyl diphosphate (FPP) with a polypeptide having drimenol synthase activity and comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%,

85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2 to produce drimenol or a mixture comprising drimenol and one or more terpenes; and

ii) optionally isolating the drimenol.

In one aspect, the drimenol is isolated.

In another aspect provided here, the drimenol is produced with greater than or equal to 20%, 30%, 35%, 40%, 50%, 60%, 80%, or 90% or even 95% selectivity of the sesquiterpenes produced.

Further provided here is a method comprising transforming a host cell or a non-human host organism with a nucleic acid encoding a polypeptide having drimenol synthase activity and comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 2 or comprising the amino acid sequence of SEQ ID NO: 2.

In one embodiment, a method provided herein comprises cultivating a non-human host organism or a host cell capable of producing FPP and transformed to express a polypeptide wherein the polypeptide comprises a sequence of amino acids that has at least 75%, 80%, 85%, 90%, 95%, 98%, 99% or 100% sequence identity to SEQ ID NO: 1 or SEQ ID NO: 3 under conditions that allow for the production of the polypeptide.

In another embodiment, a method provided herein comprises contacting a sesquiterpene such as drimenol with at least one enzyme to produce a sesquiterpene derivative. Examples of such derivatives of drimenol include but not limited to drimenyl acetate (CAS 40266-93-1), drimenal (CAS 105426-71-9), drimenic acid (CAS 111319-84-7).

According to another particularly embodiment, the method of any of the above-described embodiments is carried out in vivo. In such a case, step a) comprises cultivating a non-human host organism or a host cell capable of producing FPP and transformed to express at least one polypeptide comprising an amino acid comprising SEQ ID NO: 2 or a functional variant thereof and having a drimenol synthase activity, under conditions conducive to the production of one or more sesquiterpenes such as drimenol or a mixture comprising drimenol. Drimenol may be the only product or may be part of a mixture of sesquiterpenes. In one embodiment, the mixture of sesquiterpenes comprises drimenol and nerolidol.

According to a further embodiment, the method further comprises, prior to step a), transforming a non-human organism or cell capable of producing FPP with at least one nucleic acid encoding a polypeptide comprising an amino acid comprising SEQ ID NO: 2 or encoding a polypeptide having drimenol synthase activity and comprising an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity to SEQ ID NO: 2, so that said organism expresses said polypeptide.

These embodiments of an embodiment herein are particularly advantageous since it is possible to carry out the method in vivo without previously isolating the polypeptide. The reaction occurs directly within the organism or cell transformed to express said polypeptide.

An embodiment herein provides polypeptides of an embodiment herein to be used in a method to produce drimenol or a mixture comprising drimenol by contacting an FPP precursor with the polypeptides of an embodiment herein either in vitro or in vivo.

Further provided is the use of a polypeptide as described herein for producing drimenol or a mixture comprising drimenol and one or more terpenes or a mixture comprising drimenol and nerolidol. In one embodiment, the drimenol and/or nerolidol produced is isolated.

The following examples are illustrative only and are not intended to limit the scope of the claims an embodiments described herein.

Examples

Example 1

Paeonia anomala plant material sourcing and root transcriptome sequencing.

Paeonia anomala plant material was obtained from Datong in Qinghai, China. To establish if Paeonia anomala contained drimenol, its roots were collected and extracted fresh with dichloromethane for chemical analysis. The extract was analyzed by GC-MS, the parameters of GC-MS analysis were described as below: An Agilent 6890 series GC system equipped with a DBl-ms column 30 m x 0.25 mm x 0.25 μπι film thickness, P/N 122-0132 (J&W scientific Inc., Folsom, CA) and coupled with a 5975 series mass spectrometer was used. The carrier gas was helium at a constant flow of 0.7 mL/min. Injection was in split (1:5) mode with the injector temperature set at 250°C. The oven temperature was programmed from 50°C (5 min hold) to 300°C at 5°C/min, then to 340°C at 50°C/min and held for 3 min. Identification of products was based on mass spectra and retention index. The roots of Paeonia anomala contained a small amount of drimenol (Figure 5).

Fresh roots of Paeonia anomala were used for transcriptome analysis. Total RNA was extracted using the Column Plant RNAout (TIANDZ, China). This total RNA was processed using the Illumina Total RNA-Seq technique and sequenced on Illumina MiSeq sequencer. A total of 9 million of paired-end reads of 2x251 bp were generated. The reads were assembled using the Trinity (http://trinityrnaseq.sf.net/) software. 26457 unigenes with an average size of 1109 bp were obtained. The unigenes were annotated by NCBI Blast (http://www.ncbi.nlm.nih.gov/) as well as InterProScan software

(http://www.ebi.ac.uk/Tools/pfa/iprscan/). This approach provided the sequences for 7 new putative sesquiterpene synthases including PaTPS3. The enzymatic activity of PaTPS3 was evaluated as described in the following example.

Example 2

Functional expression and characterization o(PaTPS3.

The total RNA extracted by Column Plant RNAout kit was first reverse transcribed into cDNA using the Superscript III First-Strand Synthesis kit (Invitrogen, Shanghai, China). And then the product was used as the template, forward primer (5'- ATGTCTGTCAAAGTTCCTC AATC-3 ' ) (SEQ ID NO: 4) and reverse primer (5'- TCAC ATTGCAATAGGATCGGTG-3 ' ) (SEQ ID NO: 5) were used to amplify the gene from the cDNA library of P. anomala

The sequences of PaTPS3 was optimized by following the genetic codon frequency of E. coli and synthesized. The restriction site of Ndel was added to the 5' end of PaTPS3 while Kpnl was added to the 3' end. PaTPS3 was subcloned into the pJ401 (DNA 2.0) plasmid for subsequent expression in E. coli.

KRX E. coli cells (Promega) were co-transformed with the plasmid pACYC/ScMVA, containing the genes encoding for a heterologous mevalonate pathway, and the plasmid pJ401- PaTPS3. To construct the pACYC/ScMVA plasmid, we divided the eight biosynthetic genes into 2 synthetic operons referred as the 'upper' and 'lower' mevalonate (MVA) pathway. As an upper MVA pathway, we created a synthetic operon consisting of an acetoacetyl-CoA thiolase from E. coli encoded by atoB, a HMG-CoA synthase and a truncated version of HMG-CoA reductase from Saccharomyces cerevisiae encoded by ERG13 and ERG19, respectively. This operon transforms the primary metabolite Acetyl-CoA into (R)-mevalonate. As a 'lower' mevalonate pathway, we created a second synthetic operon encoding a mevalonate kinase (ERG12, S. cerevisiae), a phosphomevalonate kinase (ERG8, S. cerevisiae), a phosphomevalonate decarboxylase (MVD1, S. cerevisiae), an isopentenyl diphosphate isomerase (idi, E. coli) and a farnesyl pyrophosphate (FPP) synthase (IspA, E. coli). Finally, a second FPP synthase from S. cerevisiae (ERG20) was introduced into the upper pathway operon to improve the conversion of the isoprenoid C5 units (IPP and DMAPP) into farnesyl pyrophosphate (FPP). Each operon was subcloned into one of the multiple-cloning sites of a low-copy expression plasmid under the control of a bacteriophage T7 promoter (pACYCDuet-1, Invitrogen).

The co-transformed cells were selected on LB-agar plates containing kanamycin (50 μg/mL final) and chloramphenicol (34 μg/mL final). Single colonies were used to inoculate 5 mL liquid LB medium containing kanamycin (25 μg/mL final) and chloramphenicol (34 μg/mL final). Cultures were incubated overnight at 37°C and 200 rpm shaking. The next day 6 mL of TB medium supplemented with the same antibiotics and glycerol (3% w/v final) were inoculated with 0.6 mL of the overnight cultures. After 4 hours of incubation at 37°C and shaking at 200 rpm, the cultures were cooled down to 25 °C for an hour. The volume of the cultures was adjusted to 2 mL for each tube and IPTG (0.1 mM final) was added, overlaid with 200 μL· of dodecane. The cultures were incubated for another 48 hours at 25 °C and 200 rpm shaking. The cultures were then extracted with 1 mL ethyl acetate, and 50 μL· of isolongifolene (internal standard) at 2 mg/mL was added as internal standard before analysing the samples by GC/MS. GC/MS analysis used the same method as described in Example 1. The carrier gas was helium at a constant flow of 0.7 mL/min. Injection was in split (1:5) mode with the injector temperature set at 250°C. The oven temperature was programmed from 50°C (5 min hold) to 300°C at 5°C/min, then to 340°C at 50°C/min and held for 3 min. Identification of products was based on mass spectra and retention indices. GC/MS analysis revealed that PaTPS3 produced drimenol as the main product with a selectivity of 73% (not including farnesol and farnesyl acetate) and a titer of 18.4 mg/L (Figures 6 and 7). An in vitro assay was performed to confirm the above in vivo characterization of PaTPS3. BL21 (DE3) E. coli cells were transformed with the plasmid pJ401-PaTPS3. The transformed cells were selected on LB-agar plates containing kanamycin (50 μg/mL final). Single colonies were used to inoculate 25 mL liquid LB medium supplemented with the same antibiotic. Cultures were incubated at 37°C and 200 rpm shaking until turbid (OD around 0.5). After 5 hours of incubation, the cultures were cooled down to 20°C for 30 min and IPTG (0.1 mM final) was added. The cultures were incubated at 20°C and 200 rpm overnight, and then centrifuged and re-suspended in 5 mL of 50mM MOPSO buffer (containing 10% glycerol w/v, and 5 mM DTT, pH 7). The re-suspended cells were broken by sonication on ice for 10 sec for 3 times and centrifuged at 4°C, 12000 rpm for 30min, the supernatant (containing the crude protein) was used in in vitro assay. A total of 2 mL 50mM MOPSO reaction buffer (containing 10% glycerol w/v, 15 mM MgCl 2 , 0.1 mM MnCl 2 , 1 mM DTT, 6 mM Na 3 V0 4 , pH 7), 10 of the 145 μΜ FTP solution and 1 mL crude protein were mixed and overlaid by 1 mL of heptane, then incubated for 16 hours for in vitro reaction. The reaction was then extracted with 1 mL ethyl acetate, and 50 \L of isolongifolene (internal standard) at 2 mg/mL was added as internal standard before analysing the samples by GC/MS. GC/MS analysis used the same method as described in Example 1. The carrier gas was helium at a constant flow of 0.7 mL/min. Injection was in split (1 :5) mode with the injector temperature set at 250°C. The oven temperature was programmed from 50°C (5 min hold) to 300°C at 5°C/min, then to 340°C at 50°C/min and held for 3 min. Identification of products was based on mass spectra and retention indices. GC/MS analysis revealed that PaTPS3 produced drimenol with a selectivity of 21% along with 79% (E)- nerolidol (Figures 8 and 9).

Sequence Listings

SEQ ID NO: 1

cDNA sequence of PaTPS3:

ATGTCTGTCAAAGTTCCTCAATCTCAGAATGCTCCTACAGAGGTTGGACGTCGGTCC GTAAATTTTCATCCTACTGTTTGGGGAGATCGGTTTATCACATACAATAACCAGTCA GTTGATGATGATGTGGAAAAGAGATTAACAAAAGAACTAAAATCCCAAGTGAGGAG AAAGTTGGTGGATGCTGCTGAAAATACATGTCAGAAGCTTAACACAATCGATGCAA TCGAGAGATTAGGCTTGGCTTATCATTTCGAAACAGAGATTGAAGAAGCACTGCAA AATATTTATAATTCCTCTCAGGTTGTTGGAAATAATGTGGAAGAAGATGACCTCTAC TCTGTTGCCTTACGCTTTAGGCTTCTCAGACAACAGGGCTACAATATTTCATCTGATG TGTTTAACAAATTCAAAGATGATAAAGATAACTTCAAGGTATCTTTAATTGGTGATG CATCAAGCTTGCTAAGCCTATATGAAGCTGCACACCTTCGAGTACACGGAGAACAC ATACTGGATGAAGCTCTAACTTTCTCAGTTAATAATCTGGAATCAATGGCAACCCAA TTAAGTCCACCCCTTGCAACACATGTAACCCATGCACTAAACAGACCACTTCGAAAG GGCATTCCAAGGCTAGAAGCAAGGCACTACATTTCTGTCTACGAACAAGATCCTTTA CACGATGAAGATCTATTGAAGCTCGCAAAGTTAGATTTCAACCAATTACAGAAAATT CACCAGAAGGAGCTAAGCGAGATCTCAAAGTGGTGGAAAGATATAAACTTTGTATC AAAGCTACCTTTTGCAAGGGACAGAGTGGTGGAGTGCTACTTTTGGATAATGTCAGT GCATAGCGAGCCCGAGAACTGGCTTGCACGAAGGACAGCTGCAAAAATAGCTGCGG TAACCTCCATTATAGATGATATCTATGATGTGCATGGTACAATTGACGAACTGACGC TATTTACAGAAGCCGTCAACAGGTGGGATATAAACAACATTGATCAACTCCCGGAG TACATGAAAATATGTTATAAGGCGCTCTTGGGCGTTTTTAGTGAATTAGGGGAAGAG TTGGAAAAACAAGGAAGATCTTACCGCCTCGATCATACAATTGAACTTATGAAAGA TCTAGTTGGGAACTATTTTACTGAATCGAAATGGTTAAGCGAAAAATATGTGCCCAC AATAGAGGAGTATATGCGTGCTGCAGAAGTCACCATAGGTTACAACAATGCTATAA CTGCATCTTTTGCCACAGCCAAAGCCGGAGATATTGCAACCAAGGAGACCTTTGAAT GGGTGTTGAGTGAACCTAAAATTGTTAAGGCTTCCTCAGTAATTTGCAGGTTGATGG ATGACTTATCATCCCACAAGTTTGAGCAAAAGAGAGGACATGTTGCATCTGCTATTG AATGCTACATGAAGCAACATGATGCTACAGAGGAAAAGGTGCGTGCGGAGTTTAAT AAACAAGTCACCGACGCCTGGAAGGTGATAAATCAAGAATGTCTCCACCCAACAGC CATTCCAATGCCTCTTCTTACATGTGTTCTCAACTATGCACGTGTGGCTGATGTCATG TACAAGGATGGAGATGCTTATACATTTGCCCAGATCTTACTGAAAGATCATTTATCG GCATTGTTCACCGATCCTATTGCAATGTGA

SEQ ID NO: 2

Amino acid sequence of PaTPS3:

MSVKVPQSQNAPTEVGRRSVNFHPTVWGDRFITYNNQSVDDDVEKRLTKELKSQVRRK LVDAAENTCQKLNTIDAIERLGLAYHFETEIEEALQNIYNSSQVVGNNVEEDDLYSVALR FRLLRQQGYNIS SD VFNKFKDD KDNFKVS LIGD AS SLLS LYE A AHLR VHGEHILDE ALTF SVNNLESMATQLSPPLATHVTHALNRPLRKGIPRLEARHYISVYEQDPLHDEDLLKLAKL DFNQLQKIHQKELSEISKWWKDINFVSKLPFARDRVVECYFWIMSVHSEPENWLARRTA AKIAAVTSIIDDIYDVHGTIDELTLFTEAVNRWDINNIDQLPEYMKICYKALLGVFSELG E ELEKQGRSYRLDHTIELMKDLVGNYFTESKWLSEKYVPTIEEYMRAAEVTIGYNNAITA SFATAKAGDIATKETFEWVLSEPKIVKASSVICRLMDDLSSHKFEQKRGHVASAIECYM KQHDATEEKVRAEFNKQVTDAWKVINQECLHPTAIPMPLLTCVLNYARVADVMYKDG D A YTFAQILLKDHLS ALFTDPIAM

SEQ ID NO: 3

codon optimized cDNA sequence of PaTPS3 for expression in E. coli:

ATGTCCGTTAAAGTTCCGCAAAGCCAAAATGCCCCTACCGAAGTTGGCCGTCGTTCC GTCAACTTCCACCCGACGGTCTGGGGTGATCGTTTCATTACCTACAATAACCAGAGC GTTGACGACGATGTGGAAAAGCGTTTGACCAAAGAATTGAAGTCCCAGGTCCGTCG TAAACTGGTTGACGCTGCAGAGAACACTTGCCAGAAACTGAACACCATCGACGCGA TCGAGCGCCTGGGTCTGGCTTACCATTTCGAGACTGAGATTGAAGAGGCACTGCAGA ACATCTACAATTCCAGCCAAGTCGTGGGCAATAATGTAGAGGAAGATGATTTATATA GCGTGGCGCTGCGTTTTCGTCTGCTGCGTCAACAGGGTTATAACATCAGCTCCGATG TCTTTAACAAGTTCAAAGATGATAAAGACAATTTCAAGGTTAGCCTGATCGGTGACG CAAGCTCTTTGTTATCTCTGTATGAAGCCGCGCATCTGCGCGTGCATGGCGAGCATA TCTTGGATGAAGCGCTGACCTTTAGCGTTAATAATCTGGAATCGATGGCAACCCAGC TGAGCCCGCCGCTGGCAACGCACGTTACGCACGCGTTGAACCGCCCGCTGCGCAAG GGTATCCCGCGTCTGGAAGCGCGTCATTACATTTCTGTGTACGAACAAGATCCACTG CACGACGAAGATTTGCTTAAACTGGCGAAACTGGATTTTAATCAACTGCAAAAGATT CACCAGAAAGAACTGAGCGAGATTAGCAAATGGTGGAAAGACATTAATTTCGTCAG CAAGCTGCCGTTCGCCCGCGACCGTGTTGTGGAGTGCTATTTCTGGATTATGAGCGT TCACAGCGAGCCTGAGAACTGGCTGGCGCGCCGCACCGCGGCTAAGATTGCGGCAG TCACGTCGATTATCGACGATATCTATGACGTCCACGGCACCATCGATGAACTGACGC TGTTCACCGAAGCCGTTAACCGCTGGGACATCAACAACATTGATCAGCTGCCGGAAT ACATGAAGATCTGCTACAAAGCGCTGCTGGGCGTGTTCAGCGAGCTGGGTGAAGAA CTGGAGAAACAGGGTCGTAGCTATCGCTTGGATCATACCATTGAGCTGATGAAAGA TCTGGTCGGTAATTACTTCACCGAGTCCAAGTGGCTGAGCGAGAAATACGTTCCGAC GATCGAAGAGTACATGCGTGCTGCCGAAGTGACCATCGGTTACAACAATGCCATTA

GGGTGCTGAGCGAACCGAAGATTGTCAAAGCCTCCAGCGTTATTTGTCGTCTGATGG ACGATTTGAGCAGCCATAAGTTTGAGCAAAAGCGTGGCCACGTCGCGAGCGCGATC GAGTGCTATATGAAACAGCACGACGCGACCGAGGAAAAAGTTCGTGCAGAGTTCAA TAAACAAGTCACCGATGCGTGGAAAGTCATTAACCAAGAGTGCTTGCACCCGACGG CGATCCCGATGCCACTGCTGACCTGTGTGCTCAATTATGCACGTGTTGCGGACGTTA TGTATAAGGATGGTGACGCGTATACCTTTGCGCAAATTCTGCTGAAAGACCACCTGA GCGCACTGTTCACGGACCCGATCGCGATGTAA

SEQ ID NO: 4

forward primer

ATGTCTGTCAAAGTTCCTCAATC SEQ ID NO: 5

reverse primer

TCACATTGCAATAGGATCGGTG