Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
LARGE SCALE PRODUCTION OF OLIVETOL, OLIVETOLIC ACID AND OTHER ALKYL RESORCINOLS BY FERMENTATION
Document Type and Number:
WIPO Patent Application WO/2021/225952
Kind Code:
A1
Abstract:
Provided herein are processes, such as commercially viable processes, of producing alkyl resorcinols, such as olivetol and olivetolic acid, and analogs of each thereof. Certain of these processes utilize a recombinant, heterologous host microorganism. Certain of the heterologous microorganisms include a Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS). Certain of the heterologous microorganisms include a Cannabis sativa olivetolic acid cyclase (csOAC). Certain of the heterologous microorganisms include a Cannabis sativa acyl activating enzyme (csAAE), such as, without limitation, csAAE1. In certain of these processes, glucose is fermented. In certain of these processes, the fermentation further comprises a carboxylic acid, RCO2H where R is defined as herein, or a salt thereof. Certain of these processes provide olivetol and olivetolic acid in a combined amount of at least 3 g/liter.

Inventors:
OHLER NICK (US)
CONLEY ANDREW (US)
FARINA ANTHONY (US)
OUELLET MARIO (US)
LENNEN REBECCA (US)
ALIKHANI AZADEH (US)
MELIS DAVID (US)
HELD MARK (US)
Application Number:
PCT/US2021/030452
Publication Date:
November 11, 2021
Filing Date:
May 03, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
LYGOS INC (US)
International Classes:
C12N15/81; C07C63/04; C12N9/10; C12N9/12; C12N9/88; C12N15/52
Domestic Patent References:
WO2018209143A12018-11-15
WO2020069142A12020-04-02
WO2019190945A12019-10-03
WO2018148848A12018-08-23
Foreign References:
US20190300888A12019-10-03
CN111019874A2020-04-17
CN105838632B2019-05-10
Attorney, Agent or Firm:
MORSHEDIAN, Shaghayegh et al. (US)
Download PDF:
Claims:
CLAIMS

1. A process comprising: contacting an aqueous phase comprising glucose and hexanoic acid or a salt thereof and an organic phase immiscible with the aqueous phase with a recombinant, heterologous microorganism comprising one or more of a polypeptide having: at least 95% sequence identity with Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS), at least 95% sequence identity with Cannabis sativa olivetolic acid cyclase (csOAC), and at least 95% sequence identity with a Cannabis sativa acyl activating enzyme (csAAE) to produce olivetol and olivetolic acid or a salt thereof, wherein the olivetol and olivetolic acid or the salt thereof are produced in a combined amount of at least about 2 g per liter of total liquid broth (comprising both aqueous and immiscible liquid phases) after 1-7 days of operation.

2. The process of claim 1, wherein the fermenting is performed in the absence of galactose.

3. The process of claim 1, wherein the aqueous phase comprises galactose.

4. The process of claim 1, wherein the organic phase comprises an alkane, an alcohol with carbon number greater than 4, an ester (such as isopropyl myristate), a triglyceride (including commercially available vegetable oils such as sunflower oil, soybean oil, or olive oil), a diester, a ketone, or a polyether (such as a polyglyme).

5. The process of claim 1, wherein the aqueous phase further comprises histidine.

6. The process of claim 1, wherein the pH of the aqueous phase is at a pH of about 4 to about 8.

7. The process of claim 1, wherein the microorganism is Socchoromyces cerevisiae.

8. The process of claim 1, wherein the fermentation is performed in a semi-continuous mode ("fill-and-draw"), or a continuous mode, for a prolonged duration, and the overall combined productivity of olivetol and olivetolate is > 0.3 g per L of total volume (including aqueous and immiscible liquid phases) per day of operation.

9. A process comprising: contacting an aqueous phase comprising glucose and butyric acid (CHsiCI-h CC H) or a salt thereof and an organic phase immiscible with the aqueous phase with a recombinant, heterologous microorganism comprising a polypeptide having: at least 95% sequence identity with a one or more of a Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS), at least 95% sequence identity with a Cannabis sativa olivetolic acid cyclase (csOAC), and at least 95% sequence identity with a Cannabis sativa acyl activating enzyme (csAAE) to produce divarin and/or divarinic acid or a salt thereof.

10. The process of claim 9, wherein the fermenting is performed in the absence of galactose.

11. The process of claim 9, wherein the aqueous phase comprises galactose.

12. The process of claim 9, wherein the organic phase comprises an alkane, an alcohol with carbon number greater than 4, an ester (such as isopropyl myristate), a triglyceride (including commercially available vegetable oils such as sunflower oil, soybean oil, or olive oil), a diester, or a ketone.

13. The process of claim 9, wherein the aqueous phase further comprises histidine.

14. The process of claim 9, wherein the pH of the aqueous phase is at a pH of about 4 to about 8.

15. The process of claim 9, wherein the microorganism is Saccharomyces cerevisiae.

16. The process of claim 9, wherein the fermentation is performed in a semi-continuous mode ("fill-and draw"), or a continuous mode, for a prolonged duration.

17. A process comprising: contacting an aqueous phase comprising glucose and a carboxylic acid of formula RCO2H or a salt thereof, wherein R is optionally substituted Ci-Cs alkyl, optionally substituted C2-C6 alkenyl, or optionally substituted C2-C8 alkynyl and an organic phase immiscible with the aqueous phase with a recombinant, heterologous microorganism comprising one or more of a polypeptide having: at least 95% sequence identity with a Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS), at least 95% sequence identity with a Cannabis sativa olivetolic acid cyclase (csOAC), and at least 95% sequence identity with a a Cannabis sativa acyl activating enzyme (csAAE) to produce a compound of formula (IA) and/or (IB):

IA IB or a salt thereof wherein R is defined as above.

18. The process of claim 17, wherein the fermenting is performed in the absence of galactose.

19. The process of claim 17, wherein the aqueous phase comprises galactose.

20. The process of claim 17, wherein the organic phase comprises an alkane, an alcohol with carbon number greater than 4, an ester (such as isopropyl myristate), a triglyceride (including commercially available vegetable oils such as sunflower oil, soybean oil, or olive oil), a diester, a ketone, or a polyether (such as a polyglyme).

Description:
LARGE SCALE PRODUCTION OF OLIVETOL, OLIVETOLIC ACID AND OTHER ALKYL RESORCINOLS

BY FERMENTATION

PRIORITY CLAIM

This application claims priority to US provisional application nos. US 63/122,369 filed on December 7, 2020; US 63/089,736 filed on October 9, 2020; 63/079,390 filed on September 16, 2020; US 63/070,513 filed on August 26, 2020; and US 63/022,038 filed on May 8, 2020, each of which is incorporated herein in its entirety by reference.

STATEMENT ABOUT FEDERAL FUNDING

Not applicable.

FIELD

Provided herein are processes, preferably scalable, commercially relevant processes, of producing olivetol and olivetolic acid, or an analog thereof, or a salt of each thereof by fermentation employing a recombinant heterologous host microorganism.

BACKGROUND

Olivetol and olivetolic acid are key gateway molecules for preparing cannabinoids. And yet, there is very little if any report of a scalable, commercially viable, production of olivetol and olivetolic acid by fermentation.

Further, divarin, which is l,3-dihydroxy-5-propylbenzene and divarinic acid, which is 2,4- dihydroxy-6-propylbenzoic acid, are key gateway molecules for preparing certain minor cannabinoids. Minor cannabinoids are naturally obtained in quantities smaller than major cannabinoids such as tetrahydrocannabinol (THC). And yet, there is very little if any report of producing divarin and divarinic acid by fermentation.

SUMMARY

In one aspect provided herein are processes of producing a compound of formula (IA) and/or (IB) or a salt thereof wherein R is optionally substituted C 1 -C 8 alkyl, optionally substituted C 2 -C 6 alkenyl, or optionally substituted C 2 -C 8 alkynyl. In certain embodiments, other R groups such as optionally substituted cycloalkyl, preferably optionally substituted C 3 -C 8 cycloalkyl; optionally substituted heterocyclyl; optionally substituted aryl, preferably optionally substituted phenyl; and optionally substituted heteroaryl are contemplated as employed according to the present invention. In one embodiment, the compound produced is of formula IA. In another embodiment, the compound produced is of formula IB.

Certain of these processes utilize a recombinant, heterologous, host cells or host microorganism. Certain of the host microorganisms comprise a recombinant olivetol synthase (OLS or OS), which is a tetraketide synthase (TKS). Certain of the host microorganisms comprise a recombinant Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS).

Certain of the host microorganisms comprise a recombinant olivetolic acid cyclase (OAC). Certain of the heterologous microorganisms comprise a recombinant Cannabis sativa olivetolic acid cyclase (csOAC). Certain of the host microorganisms comprise an acyl activating enzyme (AAE). Certain of the heterologous microorganisms comprise a recombinant Cannabis sativa acyl activating enzyme (csAAE), such as, without limitation, csAAEl.

In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csOLS. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csOAC. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csAAEl.

In certain of these processes, glucose is fermented. In certain of these processes, galactose is fermented. The process further comprises a carboxylic acid of formula R-CO2H or a salt thereof. In certain of the processes, the microorganism is a yeast. In certain of the processes, the microorganism is Socchoromyces cerevisiae. In certain of the processes, the microorganism is a bacteria. In certain of the processes, the microorganism is Escherichia coli.

In some embodiments, the process further comprises contacting: an aqueous phase comprising glucose and RCO2H or a salt thereof and an organic phase immiscible with the aqueous phase.

In one embodiment, R is Ci-Cs alkyl. In another embodiment, R is C1-C4 alkyl. In another embodiment, R is C 6 -C 8 alkyl. In another embodiment, R is substituted Ci-Cs alkyl.

In another embodiment, R is C 2 -C 8 alkenyl. In another embodiment, R is substituted C 2 - C 8 alkenyl.

In another embodiment, R is C 2 -C 8 alkynyl. In another embodiment, R is substituted C 2 - Cg alkynyl.

In another embodiment, the compounds of formula IA and IB are provided in a combined amount of at least about 2 g/liter over about 4 to about 7 days. In another embodiment, the compounds of formula IA and IB are provided in a combined amount of at least about 3 g/liter over about 4 to about 7 days. In another embodiment, the compounds of formula IA and IB are provided in a combined amount of at least about4 g/liter over about 4 to about 7 days. In another embodiment, the compounds of formula IA and IB are provided in a combined amount of at least about 5 g/liter over about 4 to about 7 days. In another embodiment, the compounds of formula IA and IB are provided in a combined amount of at least about 10 g/liter over about 4 to about 7 days.

This invention arises in part from the surprising discovery that recombinant host microorganisms produce commercially relevant amounts of olivetol and olivetolic acid by fermentation. In some aspects, provided herein are processes of producing olivetol, olivetolic acid, or a salt thereof. Certain of these processes are commercially viable for producing olivetol, olivetolic acid or a salt thereof, which are key gateway compounds for preparing a variety of cannabinoids. Certain of these processes utilize a recombinant, heterologous, host microorganism. Certain of the host microorganisms include a recombinant Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS). Certain of the heterologous microorganisms include a recombinant Cannabis sativa olivetolic acid cyclase (csOAC). Certain of the heterologous microorganisms include a recombinant Cannabis sativa acyl activating enzyme (csAAE), such as, without limitation, csAAEl. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csOLS. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csOAC. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csAAEl.

In certain of these processes, glucose is fermented. In certain of these processes, galactose is fermented. In certain of these processes, the fermentation further comprises hexanoic acid or a salt thereof. Certain of these processes provide olivetol and olivetolic acid in a combined amount of at least 3 g/liter. In certain of the processes, the microorganism is Saccharomyces cerevisiae.

This invention arises in another part from the surprising discovery that recombinant host microorganisms produce divarin and divarinic acid by fermentation. In some aspects, provided herein are processes of producing divarin and/or divarinic acid or a salt thereof. Certain of these processes are commercially viable for producing divarin and divarinic acid, which are key gateway compounds for preparing a variety of minor cannabinoids. Certain of these processes utilize a recombinant, heterologous, host microorganism. Certain of the host microorganisms include a recombinant Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS). Certain of the heterologous microorganisms include a recombinant Cannabis sativa olivetolic acid cyclase (csOAC). Certain of the heterologous microorganisms include a recombinant Cannabis sativa acyl activating enzyme (csAAE), such as, without limitation, csAAEl. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csOLS. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csOAC. In another embodiment, the recombinant host microorganism comprises 4-20, or 6-16 copies of csAAEl. In certain of these processes, glucose is fermented. In certain of these processes, the fermentation further comprises butyric acid or a salt thereof. Certain of these processes provide divarin and/or divarinic acid or a salt thereof in a combined amount of at least about 0.25 - about 8 g/liter, about 1 - about 7 g/liter, about 0.25 - about 2 g/liter, about 0.25 - about 2 g/liter, about 0.5 - about 1 g/liter, or about 2 - about 4 g/liter. Certain of these processes provide divarin and/or divarinic acid or a salt thereof in a combined amount of at least about 2 - about 5, preferably about 3 - about 4 g/liter. In one embodiment, the combined amount of divarin and/or divarinic acid is provided over 2-7 or 4-7 days, such as 2,3, 4, 5, 6, or 7 days. In certain of the processes, the microorganism is Saccharomyces cerevisiae.

In one embodiment, the fermentation is performed as a batch/fed batch fermentation with a fixed batch duration. In one embodiment, the fermentation is performed as a "semi- continuous" fermentation operating mode. In one embodiment, the fermentation is performed as a continuous fermentation operating mode. The continuous mode may be a fill-and-draw, or a true continuous operation.

An illustrative and non-limiting process of isolating olivetol or another compound of formula IA or IB is schematically illustrated in Figure 1.

In one embodiment, a mixture of compounds of formula IA and IB provided by fermentation is extracted from a fermentation media by alkaline extraction. In some embodiments, the alkaline extraction is an aqueous alkaline extraction. In some embodiments, the alkaline extraction is performed at a pH of about 12 - about 14. In some embodiments, the alkaline extraction is performed at a pH of about 13. In some embodiments, the alkaline extraction is performed under milder alkaline conditions. In some embodiments, the alkaline extraction is performed at a pH of about 7 - about 12. In some embodiments, the alkaline extraction is performed at a pH of about 7 - about 10. Without being bound by theory, under the milder alkaline extraction, a compound of formula IB is preferentially extracted. The compound of formula IA can thereafter be extracted under stronger alkaline conditions, e.g., as described herein.

In some embodiments, the extracted mixture of compounds of formula IA and IB are decarboxylated to provide a compound of formula IA. In some embodiments, the decarboxylation is performed by heating. In some embodiments, the heating is performed at about 100°C - about 140°C, or preferably at about 110°C - about 130°C. In some embodiments, the heating is performed at about 120°C. Post decarboxylation, the compound of formula IA provided, comprises by weight about 2% or less, or preferably about 1% or less of a compound of formula IB or a salt thereof. In some embodiments, the extracted mixture of compounds of formula IA and IB are acidified before decarboxylation. In some embodiments, the decarboxylation is performed at a pH of about 5 - about 8. In some embodiments, the decarboxylation is performed at a pH of about 6.5.

In one embodiment, the compound of formula IA provided by decarboxylation is extracted into an organic solvent (e.g., a water immiscible organic solvent) to provide a solution of the compound of formula IA in the organic solvent. In some embodiments, the organic solvent is a solvent capable of dissolving a compound of formula IA; formula IA comprises an aromatic ring and polar hydroxy groups. In one embodiment, the organic solvent comprises an aromatic hydrocarbon solvent. In one embodiment, the organic solvent comprises toluene. In one embodiment, the organic solvent is toluene. In some embodiments, the organic solvent comprises aliphatic or alicyclic hydrocarbon solvents.

In some embodiments, the compound of formula IA, present as a solution in the organic solvent, is reacted with a terpene alcohol, a terpenal (i.e., a terpene aldehyde), and the likes. In some embodiments, the solution of the compound of formula IA in the organic solvent is employed for reacting the compound of formula IA with a terpene alcohol. In some embodiments, the solution of the compound of formula IA in the organic solvent is employed for reacting the compound of formula IA with a terpenal. In one embodiment, the terpene alcohol is geraniol. In one embodiment, the terpene alcohol is farnesol. In one embodiment, the terpene alcohol is menthadienol (trans 2,8-menthadienol or PMD). In one embodiment, the terpene alcohol is or a diastereomer thereof, or an ester of each thereof. In one embodiment, the the hydroxy form (unesterified) is employed. In one embodiment, the terpene alcohol is:

(lR,4R)-4-lsopropenyl-l-methyl-2-cyclohexen-l-ol In one embodiment, the terpenal is citral. In some embodiments, the reaction with a terpenal further comprises a primary amine. In one embodiment, the primary amine is tertiary butyl amine.

In some embodiments, the reaction of a compound of formula IA with a terpene alcohol, a terpenal, or the likes provides a cannabinoid. In one embodiment, the cannabinoid is cannabigerol (CBG). In another embodiment, the cannabinoid is cannabichromene (CBC). In another embodiment, the cannabinoid is cannabidiol (CBD). In another embodiment, the cannabinoid is tetrahydrocannabinol (THC). In another embodiment, the cannabinoid is cannabinol (CBN). In another embodiment, the cannabinoid is the varin analog (CBGV, CBCV, CBDV, THCV, CBNV) of CBG, CBC, CBD, THC, CBN. A varin analog is a compound where the n- pentyl chain of a cannabinoid, e.g., and without limitation, CBG, CBC, CBD, or THC is replaced by an n-propyl chain. The cannabinoids obtained are purified by a variety of purification methods. In one embodiment, the purification method comprises chromatography. In one embodiment the purification method comprises distillation. In one embodiment, the chromatography comprises a reverse phase chromatography.

In one embodiment, R is n-pentyl. In another embodiment, R is n-propyl. In another embodiment, R is n-heptyl.

A non-limiting example of reacting (prenylating) olivetol with the terpene alcohol, geraniol, is schematically illustrated in Figure 2.

A non-limiting example of reacting (prenylating) olivetol with the terpenal, citral, is schematically illustrated in Figure 3.

The initial engineering of Saccharomyces cerevisiae was done by introducing a gene fragment containing csOLS, csOAC, and csAAEl under the control of galactose regulatable elements called promoters. The csOLS and csOAC were physically linked to each other on the gene with a genetic element called T2A in all examples.

To select for Saccharomyces cerevisiae cells that efficiently incorporated the foreign DNA, but removed Saccharomyces cerevisiae that had no foreign genes, a standard protocol method was utilized that allows growth on nutrient preferred media. One way to construct gene fragments that are functional in an organism is to generate individual gene fragments by a polymerase chain reaction (PCR). This creates individual gene fragments from simple smaller DNA sequences and a well defined DNA fragment called a 'template' that contains pieces of your final gene fragment. The smaller pieces of DNA are called 'primers'. These primers flank the DNA you want to generate from various templates to generate the final product you desire by PCR. These final products are called 'amplicons'. One process to 'stitch' together various amplicons is called Gibson Assembly. Many gene fragments disclosed in this method were first generated by Gibson Assembly of several amplicons generated by PCR. These assembled gene fragments were then allowed to be uptaken iteratively into a wild type Saccharomyces cerevisiae cell called JK9-3d. In some embodiments, CEN.PK is useful as a wild type Saccharomyces cerevisiae.

The process by which Saccharomyces cerevisiae uptakes foreign DNA and stably utilize the foreign DNA is called recombination. The final Saccharomyces cerevisiae strains that took up the foreign DNA and utilized the DNA are called recombinants. Saccharomyces cerevisiae that did not undergo recombination are the wild type. The process of selecting recombinants in a preferred media is termed prototrophy rescue. To separate recombinants from wild type prototrophy rescue was utilized.

Examples herein below provide a method to create recombinants that produce various levels of O/OA by varying how many of those gene fragments are uptaken by JK9-3d. In one example, recombinants that produce O/OA under the control of galactose are disclosed. Another example discloses, how the number of exogenous fragments taken up from Saccharomyces cerevisiae correlates with O/OA concentrations in the media. The number of genetic fragments recombinants contain can be determined by sequencing the DNA of the recombinant and by using the PCR method to quantify the number of amplicons generated. Amplicons are quantified by quantitative real time polymerase chain reaction (qPCR). qPCR and direct sequencing, and how those quantitative values of the genetic elements relate to the quantitative levels of O and OA are exemplified and provided herein.

DESCRIPTION OF THE FIGURES

Figure 1 schematically illustrates recovery of olivetol and other compounds of formula IA in accordance with the present invention.

Figure 2 schematically illustrates the semisynthesis of cannabinoids (CBG) by prenylation of fermented olivetol.

Figure 3 schematically illustrates the semisynthesis of cannabinoids (CBC) by prenylation of fermented olivetol.

Figure 4A graphically illustrates the time course of total product titer (olivetol and olivetolic acid) in g/L.

Figure 4B graphically illustrates the time course of titer/time (g/L/day).

Figure 5A graphically illustrates the time course of total product titer (divarin and divarinic acid) in g/L.

Figure 5B graphically illustrates the time course of titer/time (g/L/day).

DETAILED DESCRIPTION

While the present invention is described herein with reference to aspects and specific embodiments thereof, those skilled in the art will recognize that various changes may be made and equivalents may be substituted without departing from the invention. The present invention is not limited to particular nucleic acids, expression vectors, enzymes, host microorganisms, or processes, as such may vary. The terminology used herein is for purposes of describing particular aspects and embodiments only, and is not to be construed as limiting. In addition, many modifications may be made to adapt a particular situation, material, composition of matter, process, process step or steps, in accordance with the invention. All such modifications are within the scope of the claims appended hereto. Headers are used solely for readers' convenience, and disclosure found under any header is understood in the context of and applicable to the entire disclosure.

Definitions

In this specification and in the claims that follow, reference will be made to a number of terms that shall be defined to have the following meanings.

As used in the specification and the appended claims, the singular forms "a", "an", and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to an "expression vector" includes a single expression vector as well as a plurality of expression vectors, either the same ( e.g the same operon) or different; reference to "cell" includes a single cell as well as a plurality of cells; and the like.

As used herein, the term "comprising" is intended to mean that the compounds, compositions and processes include the recited elements, but not exclude others.

"Consisting essentially of" when used to define compounds, compositions and processes, shall mean excluding other elements of any essential significance to the combination.

Thus, a composition consisting essentially of the elements as defined herein would not exclude trace contaminants, e.g., from the isolation and purification method. "Consisting of" shall mean excluding more than trace elements of other ingredients. Embodiments defined by each of these transition terms are within the scope of this technology.

All numerical designations, e.g., pH, temperature, time, concentration, and molecular weight, including ranges, are approximations which are varied (+) or (-) by increments of 1, 5, or 10%, e.g., by using the prefix, "about." It is to be understood, although not always explicitly stated that all numerical designations are preceded by the term "about." It also is to be understood, although not always explicitly stated, that the reagents described herein are merely exemplary and that equivalents of such are known in the art.

"Alkyl" refers to monovalent saturated aliphatic hydrocarbyl groups having from 1 to 10 carbon atoms and preferably 1 to 6 carbon atoms. Higher carbon atom containing alkyl groups are also contemplated in certain embodiments, as the context will indicate. This term includes, by way of example, linear and branched hydrocarbyl groups such as methyl (CH3-), ethyl (CH3CH2), -n- propyl- (CH3CH2CH2-), isopropyl ((CH3)CH), -n-butyl- (CH3CH2CH2CH2-), isobutyl ((CH 3 )2CHCH 2 -), sec-butyl ((CH3)(CH 3 CH 2 )CH), -t-butyl- ((CH 3 ) 3 C), -n- pentyl- (CH3CH2CH2CH2CH2- ), and neopentyl ((CH 3 ) 3 CCH 2 -).

"Alkenyl" refers to monovalent straight or branched hydrocarbyl groups having from 2 to 10 carbon atoms and preferably 2 to 6 carbon atoms or preferably 2 to 4 carbon atoms and having at least 1 and preferably from 1 to 2 sites of vinyl (>C=C<) unsaturation. Higher carbon atom containing alkenyl groups are also contemplated in certain embodiments, as the context will indicate. Such groups are exemplified, for example, by vinyl, allyl, and but-3-en-lyl.

Included within this term are the cis and trans isomers or mixtures of these isomers.

“Alkynyl” refers to straight or branched monovalent hydrocarbyl groups having from 2 to 10 carbon atoms and preferably 2 to 6 carbon atoms or preferably 2 to 3 carbon atoms and having at least 1 and preferably from 1 to 2 sites of acetylenic (-CºC-) unsaturation. Higher carbon atom containing alkynyl groups are also contemplated in certain embodiments, as the context will indicate. Examples of such alkynyl groups include acetylenyl (-CºCH), and propargyl (-CH2CºCH).

"Substituted alkyl" refers to an alkyl group having from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO3H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Heteroalkyl" refers to an alkyl group one or more carbons is replaced with -0-, -S-, SO2, a phosphorous (P) containing moiety, or -NR Q - moieties where R Q is H or C1-C6 alkyl. Substituted heteroalkyl refers to a heteroalkyl group having from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO3H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Substituted alkenyl" refers to alkenyl groups having from 1 to 3 substituents, and preferably 1 to 2 substituents, selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxyl, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO 3 H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein and with the proviso that any hydroxyl or thiol substitution is not attached to a vinyl (unsaturated) carbon atom.

"Heteroalkenyl" refers to an alkenyl group where one or more carbons is replaced with one or more -0-, -S-, SO 2 , P containing moiety, or -NR Q - moieties where R Q is H or C 1 -C 6 alkyl. Substituted heteroalkenyl refers to a heteroalkenyl group having from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO 3 H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Substituted alkynyl" refers to alkynyl groups having from 1 to 3 substituents, and preferably 1 to 2 substituents, selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO3H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein and with the proviso that any hydroxyl or thiol substitution is not attached to an acetylenic carbon atom.

"Heteroalkynyl" refers to an alkynyl group one or more carbons is replaced with -O- , -S- , SO2, P containing moiety, or -NR Q - moieties where R Q is H or C1-C6 alkyl. Substituted heteroalkynyl refers to a heteroalkynyl group having from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO3H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Alkylene" refers to divalent saturated aliphatic hydrocarbyl groups having from 1 to 10 carbon atoms, preferably having from 1 to 6 and more preferably 1 to 3 carbon atoms that are either straight-chained- or branched. Higher carbon atom containing alkenyl groups are also contemplated in certain embodiments, as the context will indicate. This term is exemplified by groups such as methylene (-CH 2 -), ethylene (-CH2CH2-), n-propylene (-CH2CH2CH2-), iso- propylene (-CH 2 CH(CH 3 )- or -CH(CH3)CH 2 -), butylene (-CH2CH2CH2CH2-), isobutylene (- CH2 C H(CH 3 )CH 2 -), sec-butylene (-CH 2 CH 2 (CH3)CH), and the like. Similarly, "alkenylene" and "alkynylene" refer to an alkylene moiety containing respective 1 or 2 carbon-carbon double bonds or a carbon-carbon triple bond.

"Substituted alkylene" refers to an alkylene group having from 1 to 3 hydrogens replaced with substituents selected from the group consisting of alkyl, substituted alkyl, alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminoacyl, aryl, substituted aryl, aryloxy, substituted aryloxy, cyano, halogen, hydroxyl, nitro, carboxyl, carboxyl ester, cycloalkyl, substituted cycloalkyl, heteroaryl, substituted heteroaryl, heterocyclic, substituted heterocyclic, and oxo wherein said substituents are defined herein. In some embodiments, the alkylene has 1 to 2 of the aforementioned groups, or having from 1-3 carbon atoms replaced with -0-, -S-, SO2, P containing moiety or -NR Q - moieties where R Q is H or C1-C6 alkyl. It is to be noted that when the alkylene is substituted by an oxo group, 2 hydrogens attached to the same carbon of the alkylene group are replaced by "=0." "Substituted alkenylene" and "substituted alkynylene" refer to alkenylene and alkynylene moieties substituted with substituents as described for substituted alkylene.

"Alkynylene ” refers to straight or branched divalent hydrocarbyl groups having from 2 to 10 carbon atoms and preferably 2 to 6 carbon atoms or preferably 2 to 3 carbon atoms and having at least 1 and preferably from 1 to 2 sites of acetylenic (-CºC-) unsaturation. Higher carbon atom containing alkynylene groups are also contemplated in certain embodiments, as the context will indicate. Examples of such alkynylene groups include -CºC- and -CH2CºC-.

"Substituted alkynylene" refers to alkynylene groups having from 1 to 3 substituents, and preferably 1 to 2 substituents, selected from the group consisting of alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO 3 H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein and with the proviso that any hydroxyl or thiol substitution is not attached to an acetylenic carbon atom.

"Heteroalkylene" refers to an alkylene group wherein one or more carbons is replaced with -0-, -S-, SO 2 , a P containing moiety, or -NR Q - moieties where R Q is H or C 1 -C 6 alkyl. "Substituted heteroalkylene" refers to heteroalkynylene groups having from 1 to 3 substituents, and preferably 1 to 2 substituents, selected from the substituents disclosed for substituted alkylene.

"Heteroalkenylene" refers to an alkenylene group wherein one or more carbons is replaced with -0-, -S-, SO 2 , a P containing moiety, or -NR Q - moieties where R Q is H or C 1 -C 6 alkyl. "Substituted heteroalkenylene" refers to heteroalkynylene groups having from 1 to 3 substituents, and preferably 1 to 2 substituents, selected from the substituents disclosed for substituted alkenylene.

"Heteroalkynylene" refers to an alkynylene group wherein one or more carbons is replaced with -0-, -S-, SO 2 , a P containing moiety, or -NR Q - moieties where R Q is H or C 1 -C 6 alkyl. "Substituted heteroalkynylene" refers to heteroalkynylene groups having from 1 to 3 substituents, and preferably 1 to 2 substituents, selected from the substituents disclosed for substituted alkynylene.

"Alkoxy" refers to the group -O-alkyl wherein alkyl is defined herein. Alkoxy includes, by way of example, methoxy-, ethoxy, n-propoxy, isopropoxy, n-butoxy, t- butoxy, -sec-butoxy, and- n-pentoxy. "Substituted alkoxy" refers to the group -0-(substituted alkyl) wherein substituted alkyl is defined herein.

"Acyl" refers to the groups H-C(O), -alkyl-C-(O)-, substituted alkyl-C(O)-, alkenyl-C(O)-, substituted alkenyl-C(O)-, alkynyl-C(O)-, substituted alkynyl-C(O)-, cycloalkyl-C(O)-, substituted cycloalkyl-C(O)-, cycloalkenyl-C(O)-, substituted cycloalkenyl-C(O)-, aryl-C(O)-, substituted aryl- C(O)-, heteroaryl-C(O)-, substituted heteroaryl-C(O)-, heterocyclic-C(O), and substituted- heterocyclic-C-(O)-, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. Acyl includes the "acetyl" group CH 3 C(0)-.

"Acylamino" refers to the groups -NR 40 C(O)alkyl, -NR 40 C(O)substituted alkyl, - NR 40 C(O)cycloalkyl, -NR 40 C(O)substituted cycloalkyl, -NR 40 C(O)cycloalkenyl, - NR 40 C(O)substituted cycloalkenyl, -NR 40 C(O)alkenyl, -NR 40 C(O)substituted alkenyl, - NR 40 C(O)alkynyl, -NR 40 C(O)substituted alkynyl, -NR 40 C(O)aryl, -NR 40 C(O)substituted aryl, - NR 40 C(O)heteroaryl, -NR 40 C(O)substituted heteroaryl, -NR 40 C(O)heterocyclic, and - NR 40 C(O)substituted heterocyclic wherein R 40 is hydrogen or alkyl and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Acyloxy" refers to the groups alkyl-C-(0)0, substituted-alkyl-C-(0)0-, alkenyl-C(0)0-, substituted alkenyl-C(0)0-, alkynyl-C(0)0-, substituted alkynyl-C(0)0-, aryl-C(0)0, substituted- aryl-C-(0)0-, cycloalkyl-C(0)0-, substituted cycloalkyl-C(0)0-, cycloalkenyl-C(0)0-, substituted cycloalkenyl-C(0)0-, heteroaryl-C(0)0-, substituted heteroaryl-C(0)0, -heterocyclic-C-(0)0, and substituted-heterocyclic-C-(0)0- wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Amino" refers to the group -IMH2. "Substituted amino" refers to the group -NR 41 R 42 where R 41 and R 40 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, substituted heterocyclic, -SCh-alkyl, -SCh-substituted alkyl, -SCh-alkenyl, -SCh-substituted alkenyl, -SCh-cycloalkyl, -SCh-substituted cylcoalkyl, -SCh-cycloalkenyl, -S0 2 -substituted cylcoalkenyl, -SCh-aryl, -S0 2 -substituted aryl, -S0 2 -heteroaryl, -S0 2 -substituted heteroaryl, -SO2- heterocyclic, and -S0 2 -substituted heterocyclic and wherein R 41 and R 42 are optionally joined, together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, provided that R 41 and R 42 are both not hydrogen, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. When R 41 is hydrogen and R 42 is alkyl, the substituted amino group is sometimes referred to herein as alkylamino. When R 41 and R 42 are alkyl, the substituted amino group is sometimes referred to herein as dialkylamino. When referring to a monosubstituted amino, it is meant that either R 41 or R 42 is hydrogen but not both. When referring to a disubstituted amino, it is meant that neither R 41 nor R 42 are hydrogen.

"Aminocarbonyl" refers to the group -C(O)NR 50 R 51 where R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Aminothiocarbonyl" refers to the group -C(S)NR 50 R 51 where R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Aminocarbonylamino" refers to the group -NR 40 C(O)NR 50 R 51 where R 40 is hydrogen or alkyl and R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic, and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Aminothiocarbonylamino" refers to the group -NR 40 C(S)NR 50 R 51 where R 40 is hydrogen or alkyl and R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. "Aminocarbonyloxy" refers to the group -O-C(O)NR 50 R 51 where R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R50 and R51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Aminosulfonyl" refers to the group -SO2NR 50 R 51 where R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Aminosulfonyloxy" refers to the group -O-SO2NR 50 R 51 where R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. "Aminosulfonylamino" refers to the group -NR 40 SO2NR 50 R 51 where R 40 is hydrogen or alkyl and R 50 and R 51 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R 50 and R 51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Amidino" refers to the group -C(=NR 52 )NR 50 R 51 where R 50 , R 51 , and R 52 are independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, aryl, substituted aryl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic and where R50 and R51 are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Aryl" or "Ar" refers to an aromatic carbocyclic group of from 6 to 14 carbon atoms having a single ring (e.g., phenyl) or multiple condensed rings (e.g., naphthyl or anthryl) which condensed rings may or may not be aromatic (e.g., 2-benzoxazolinone, 2H-l,4-benzoxazin- 3(4H)-one-7-yl, and the like) provided that the point of attachment is at an aromatic carbon atom. Certain, preferred aryl groups include phenyl and naphthyl.

"Substituted aryl" refers to aryl groups which are substituted with 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO3H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Arylene" refers to a divalent aromatic carbocyclic group of from 6 to 14 carbon atoms having a single ring or multiple condensed rings. "Substituted arylene" refers to an arylene having from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents as defined for aryl groups.

"Heteroarylene" refers to a divalent aromatic group of from 1 to 10 carbon atoms and 1 to 4 heteroatoms selected from the group consisting of oxygen, nitrogen and sulfur within the ring. "Substituted heteroarylene" refers to heteroarylene groups that are substituted with from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of the same group of substituents defined for substituted aryl. Unless otherwise noted, the context will clearly indicate, whether an aryl or heteroaryl moiety is monovalent or divalent.

"Aryloxy" refers to the group-O-aryl-, where aryl is as defined herein, that includes, by way of example, phenoxy and naphthoxy.

"Substituted aryloxy" refers to the group -0-(substituted aryl) where substituted aryl is as defined herein.

"Arylthio" refers to the group-S-aryl-, where aryl is as defined herein. "Substituted arylthio" refers to the group -S-(substituted aryl), where substituted aryl is as defined herein.

"Carbonyl" refers to the divalent group -C(O)- which is equivalent to -C(=0)-.

"Carboxyl" or"carboxy" refers to -COOH or salts thereof.

"Carboxyl ester" or "carboxy ester" refers to the group -C(0)(0)-alkyl, -C(0)(0)- substituted alkyl, -C(0)0-alkenyl, -C(0)(0)-substituted alkenyl, -C(0)(0)-alkynyl, - C(0)(0)- substituted alkynyl, -C(0)(0)-aryl, -C(0)(0)-substituted-aryl, -C(0)(0)-cycloalkyl, -C(0)(0)- substituted cycloalkyl, -C(0)(0)-cycloalkenyl, -C(0)(0)-substituted cycloalkenyl, -C(0)(0)- heteroaryl, -C(0)(0)-substituted heteroaryl, -C(0)(0)-heterocyclic, and -C(0)(0)- substituted heterocyclic wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"(Carboxyl ester)amino" refers to the group-NR 40 C(O)(O)-alkyl, -NR 40 C(O)(O)- substituted alkyl, -NR 40 C(O)O-alkenyl, -NR 40 C(O)(O)-substituted alkenyl, -NR 40 C(O)(O)- alkynyl, - NR 40 C(O)(O)-substituted alkynyl, -NR 40 C(O)(O)-aryl, -NR 40 C(O)(O)- substituted-aryl, - NR 40 C(O)(O)-cycloalkyl, -NR 40 C(O)(O)-substituted cycloalkyl, -NR 40 C(O)(O)-cycloalkenyl, - NR 40 C(O)(O)-substituted cycloalkenyl, -NR 40 C(O)(O)- heteroaryl, -NR 40 C(O)(O)-substituted heteroaryl, -NR 40 C(O)(O)-heterocyclic, and -NR 40 C(O)(O)-substituted heterocyclic wherein R 40 is alkyl or hydrogen, and wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"(Carboxyl ester)oxy" refers to the group -0-C(0)0-alkyl, -0-C(0)0-substituted alkyl, -O- C(0)0-alkenyl, -0-C(0)0-substituted alkenyl, -0-C(0)0-alkynyl, -0-C(0)(0)- substituted alkynyl, -0-C(0)0-aryl, -0-C(0)0-substituted-aryl, -0-C(0)0-cycloalkyl, - 0-C(0)0-substituted cycloalkyl, -0-C(0)0-cycloalkenyl, -0-C(0)0-substituted cycloalkenyl, -0-C(0)0-heteroaryl, -0C(0)-0- substituted heteroaryl, -0C(0)-0- heterocyclic-, and -0C(0)-0-substituted heterocyclic wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Cyano" refers to the group -CN.

"Cycloalkyl" refers to cyclic alkyl groups of from 3 to 10 carbon atoms having single or multiple cyclic rings including fused, bridged, and spiro ring systems, and further includes cycloalkenyl. The fused ring can be an aryl ring provided that the non aryl part is joined to the rest of the molecule. Examples of suitable cycloalkyl groups include, for instance, adamantyl, cyclopropyl, cyclobutyl, cyclopentyl, and cyclooctyl.

"Cycloalkenyl" refers to nonaromatic- cyclic alkyl groups of from 3 to 10 carbon atoms having single or multiple cyclic rings and having at least one >C=C< ring unsaturation and preferably from 1 to 2 sites of >C=C< ring unsaturation.

"Substituted cycloalkyl" and "substituted cycloalkenyl" refers to a cycloalkyl or cycloalkenyl group having from 1 to 5 or preferably 1 to 3 substituents selected from the group consisting of oxo, thioxo, alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl esterjamino, (carboxyl esterjoxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, S03H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Cycloalkyloxy" refers to-O-cycloalkyl-.

"Substituted cycloalkyloxy refers to -0-(substituted cycloalkyl). "Cycloalkylthio" refers to -S-cycloalkyk

"Substituted cycloalkylthio" refers to -S-(substituted cycloalkyl).

"Cycloalkenyloxy" refers to-O-cycloalkenyl-.

"Substituted cycloalkenyloxy" refers to -0-(substituted cycloalkenyl).

"Cycloalkenylthio" refers to-S-cycloalkenyl-.

"Substituted cycloalkenylthio" refers to -S-(substituted cycloalkenyl).

"Guanidino" refers to the group -NHC(=NH)NH2.

Substituted guanidino" refers to -NR 53 C(=NR 53 )N(R 53 )2 where each R 53 is independently selected from the group consisting of hydrogen, alkyl, substituted alkyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, cycloalkyl, substituted cycloalkyl, heterocyclic, and substituted heterocyclic and two R 53 groups attached to a common guanidino nitrogen atom are optionally joined together with the nitrogen bound thereto to form a heterocyclic or substituted heterocyclic group, provided that at least one R 53 is not hydrogen, and wherein said substituents are as defined herein.

"Halo" or "halogen" refers to fluoro, chloro, bromo and iodo.

"Hydroxy" or "hydroxyl" refers to the group -OH.

"Heteroaryl" refers to an aromatic group of from 1 to 10 carbon atoms and 1 to 4 heteroatoms selected from the group consisting of oxygen, nitrogen and sulfur within the ring. Such heteroaryl groups can have a single ring (e.g., pyridinyl orfuryl) or multiple condensed rings (e.g., indolizinyl or benzothienyl) wherein the condensed rings may or may not be aromatic and/or contain a heteroatom provided that the point of attachment is through an atom of the aromatic heteroaryl group. In one embodiment, the nitrogen and/or the sulfur ring atom(s) of the heteroaryl group are optionally oxidized to provide for the N-oxide (N—O), sulfinyl, or sulfonyl moieties. Certain non-limiting examples include pyridinyl, pyrrolyl, indolyl, thiophenyl, oxazolyl, thizolyl, and- furanyl.

"Substituted heteroaryl" refers to heteroaryl groups that are substituted with from 1 to 5, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of the same group of substituents defined for substituted aryl.

"Heteroaryloxy" refers to -O-heteroaryl. "Substituted heteroaryloxy" refers to the group -0-(substituted heteroaryl).

"Heteroarylthio" refers to the group-S-heteroaryl-.

"Substituted heteroarylthio" refers to the group -S-(substituted heteroaryl).

"Heterocycle" or "heterocyclic" or "heterocycloalkyl" or "heterocyclyl" refers to a saturated or partially saturated, but not aromatic, group having from 1 to 10 ring carbon atoms and from 1 to 4 ring heteroatoms selected from the group consisting of nitrogen, sulfur, or oxygen. Heterocycle encompasses single ring or multiple condensed rings, including fused bridged and spiro ring systems. In fused ring systems, one or more the rings can be cycloalkyl, aryl, or heteroaryl provided that the point of attachment is through a nonaromatic ring. In one embodiment, the nitrogen and/or sulfur atom(s) of the heterocyclic group are optionally oxidized to provide for the-N-oxide, sulfinyl-, or sulfonyl moieties.

"Substituted heterocyclic" or "substituted heterocycloalkyl" or "substituted heterocyclyl" refers to heterocyclyl groups that are substituted with from 1 to 5 or preferably 1 to 3 of the same substituents as defined for substituted cycloalkyl.

"Heterocyclyloxy" refers to the group -O-heterocycyl.

"Substituted heterocyclyloxy" refers to the group -0-(substituted heterocycyl).

"Heterocyclylthio" refers to the group -S-heterocycyl.

"Substituted heterocyclylthio" refers to the group -S-(substituted heterocycyl).

Examples of heterocycle and heteroaryls include, but are not limited to, azetidine, pyrrole, furan, thiophene, imidazole, pyrazole, pyridine, pyrazine, pyrimidine, pyridazine, indolizine, isoindole, indole, dihydroindole, indazole, purine, quinolizine, isoquinoline, quinoline, phthalazine, naphthylpyridine, quinoxaline, quinazoline, cinnoline, pteridine, carbazole, carboline, phenanthridine, acridine, phenanthroline, isothiazole, phenazine, isoxazole, phenoxazine, phenothiazine, imidazolidine, imidazoline, piperidine, piperazine, indoline, phthalimide, 1,2,3,4-tetrahydroisoquinoline, 4,5,6,7-tetrahydrobenzo-[b]thiophene, thiazole, thiazolidine, thiophene, benzo[b]thiophene, morpholinyl, thiomorpholinyl (also referred to as thiamorpholinyl), 1,1-dioxothiomorpholinyl, piperidinyl, pyrrolidine, and tetrahydrofuranyl.

"Nitro" refers to the group -NO2. "Oxo" refers to the atom (=0).

Phenylene refers to a divalent aryl ring, where the ring contains 6 carbon atoms.

Substituted phenylene refers to phenylenes which are substituted with 1 to 4, preferably 1 to 3, or more preferably 1 to 2 substituents selected from the group consisting of alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, alkoxy, substituted alkoxy, acyl, acylamino, acyloxy, amino, substituted amino, aminocarbonyl, aminothiocarbonyl, aminocarbonylamino, aminothiocarbonylamino, aminocarbonyloxy, aminosulfonyl, aminosulfonyloxy, aminosulfonylamino, amidino, aryl, substituted aryl, aryloxy, substituted aryloxy, arylthio, substituted arylthio, carboxyl, carboxyl ester, (carboxyl ester)amino, (carboxyl ester)oxy, cyano, cycloalkyl, substituted cycloalkyl, cycloalkyloxy, substituted cycloalkyloxy, cycloalkylthio, substituted cycloalkylthio, cycloalkenyl, substituted cycloalkenyl, cycloalkenyloxy, substituted cycloalkenyloxy, cycloalkenylthio, substituted cycloalkenylthio, guanidino, substituted guanidino, halo, hydroxy, heteroaryl, substituted heteroaryl, heteroaryloxy, substituted heteroaryloxy, heteroarylthio, substituted heteroarylthio, heterocyclic, substituted heterocyclic, heterocyclyloxy, substituted heterocyclyloxy, heterocyclylthio, substituted heterocyclylthio, nitro, SO3H, substituted sulfonyl, substituted sulfonyloxy, thioacyl, thiol, alkylthio, and substituted alkylthio, wherein said substituents are as defined herein.

"Spirocycloalkyl" and "spiro ring systems" refers to divalent cyclic groups from 3 to 10 carbon atoms having a cycloalkyl or heterocycloalkyl ring with a spiro union (the union formed by a single atom which is the only common member of the rings).

"Sulfonyl" refers to the divalent group -S(0) 2 -.

"Substituted sulfonyl" refers to the group -SCh-alkyl-, -SO2- substituted-alkyl, -50 2 - alkenyl, -S0 2 -substituted alkenyl, -S0 2 -cycloalkyl, -S0 2 -substituted cylcoalkyl, -S0 2 -cycloalkenyl, -S0 2 -substituted cylcoalkenyl, -S0 2 -aryl, -S0 2 -substituted aryl, -S0 2 -heteroaryl, -S0 2 -substituted heteroaryl, -S0 2 -heterocyclic, -S0 2 -substituted-heterocyclic, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein. Substituted sulfonyl includes groups such as methyl-SCh-, phenyl-SCh-, and 4-methylphenyl-S02-.

"Substituted sulfonyloxy" refers to the group -OSCh-alkyl, -OSCh-substituted- alkyl, - 0S0 2 -alkenyl, -OSCh-substituted alkenyl, -OSCh-cycloalkyl, -0S0 2 -substituted cylcoalkyl, -OSO 2 - cycloalkenyl, -OSCh-substituted cylcoalkenyl,-OS0 2 -aryl, -0S0 2 -substituted aryl, -OSO 2 - heteroaryl, -OSCh-substituted heteroaryl, -OS0 2 -heterocyclic, -0S0 2 -substituted heterocyclic, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substituted heterocyclic are as defined herein.

"Thioacyl" refers to the groups H-C(S)-, alkyl-C(S)-, substituted alkyl-C(S), -alkenyl-C-(S), substituted-alkenyl-C-(S)-, alkynyl-C(S)-, substituted alkynyl-C(S)-, cycloalkyl-C(S)-, substituted cycloalkyl-C(S)-, cycloalkenyl-C(S)-, substituted cycloalkenyl-C(S)-, aryl-C(S)-, substituted aryl- C(S)-, heteroaryl-C(S)-, substituted heteroaryl-C(S)-, heterocyclic-C(S), and substituted- heterocyclic-C-(S)-, wherein alkyl, substituted alkyl, alkenyl, substituted alkenyl, alkynyl, substituted alkynyl, cycloalkyl, substituted cycloalkyl, cycloalkenyl, substituted cycloalkenyl, aryl, substituted aryl, heteroaryl, substituted heteroaryl, heterocyclic, and substitutedheterocyclic are as defined herein.

"Thiol" refers to the group -SH.

"Thiocarbonyl" refers to the divalent group -C(S)- which is equivalent to -C(=S)-.

"Thioxo" refers to the atom (=S).

"Alkylthio" refers to the group-S-alkyl- wherein alkyl is as defined herein.

"Substituted alkylthio" refers to the group -S-(substituted alkyl) wherein substituted alkyl is as defined herein.

"Optionally substituted" refers to a group selected from that group and a substituted form of that group. Substituents are such as those defined hereinabove. E.g., and without limitation, substituents can be selected from monovalent and divalent groups, such as, C 1 -C 10 or C 1 -C 6 alkyl, substituted C 1 -C 10 or C 1 -C 6 alkyl, C 2 -C 6 alkenyl, C 2 -C 6 alkynyl, C 6 -C 10 aryl, C 3 -C 8 cycloalkyl, C 2 -C 10 heterocyclyl, C 1 -C 10 heteroaryl, substituted C 2 -C 6 alkenyl, substituted C 2 -C 6 alkynyl, substituted C 6 -C 10 aryl, substituted C 3 -C 8 cycloalkyl, substituted C 2 -C 10 heterocyclyl, substituted C 1 -C 10 heteroaryl, halo, nitro, cyano, oxo (=0), -CO 2 H or a C 1 -C 6 alkyl ester thereof.

Unless indicated otherwise, the nomenclature of substituents that are not explicitly defined herein are arrived at by naming the terminal portion of the functionality followed by the adjacent functionality toward the point of attachment. For example, the substituent "alkoxycarbonylalkyl" refers to the group (alkoxy)-C(0)-(alkyl)-.

It is understood that in all substituted groups defined above, polymers arrived at by defining substituents with further substituents to themselves (e.g., substituted aryl having a substituted aryl group as a substituent which is itself substituted with a substituted aryl group, etc.) are not intended for inclusion herein. In such cases, the maximum number of such substituents is three. That is to say that each of the above definitions is constrained by a limitation that, for example, substituted aryl groups are limited to-substituted aryl- (substituted aryl)-substituted aryl.

It is understood that the above definitions are not intended to include impermissible substitution patterns (e.g., methyl substituted with 5 fluoro groups). Such impermissible substitution patterns are well known to the skilled artisan.

A "salt" is derived from a variety of organic and inorganic counter ions well known in the art and include, when the compound has an acidic functionality, by way of example only, sodium, potassium, calcium, magnesium, ammonium, and tetraalkylammonium; and when the molecule has a basic functionality, salts of organic or inorganic acids, such as hydrochloride, hydrobromide, tartrate, mesylate, acetate, maleate, and oxalate. Salts include acid addition salts formed with inorganic acids or organic acids. Inorganic acids suitable for forming acid addition salts include, by way of example and not limitation, hydrohalide acids (e.g., hydrochloric acid, hydrobromic acid, hydroiodic acid, etc.), sulfuric acid, nitric acid, phosphoric acid, and the like.

Organic acids suitable for forming acid addition salts include, by way of example and not limitation, acetic acid, trifluoroacetic acid, propionic acid, hexanoic acid, cyclopentanepropionic acid, glycolic acid, oxalic acid, pyruvic acid, lactic acid, malonic acid, succinic acid, malic acid, maleic acid, fumaric acid, tartaric acid, citric acid, palmitic acid, benzoic acid, 3-(4- hydroxybenzoyl) benzoic acid, cinnamic acid, mandelic acid, alkylsulfonic acids (e.g., methanesulfonic acid, ethanesulfonic acid, 1,2- ethane-disulfonic acid, 2-hydroxyethanesulfonic acid, etc.), arylsulfonic acids (e.g., benzenesulfonic acid, 4-chlorobenzenesulfonic acid, 2- naphthalenesulfonic acid, 4- toluenesulfonic acid, camphorsulfonic acid, etc.), glutamic acid, hydroxynaphthoic- acid, salicylic acid, stearic acid, muconic acid, and the like.

Salts also include salts formed when an acidic proton present in the parent compound is either replaced by a metal ion (e.g., an alkali metal ion, an alkaline earth metal ion, or an aluminum ion) or by an ammonium ion (e.g., an ammonium ion derived from an organic base, such as, ethanolamine, diethanolamine, triethanolamine, morpholine, piperidine, dimethylamine, diethylamine, triethylamine, and ammonia).

Amino acids in a protein coding sequence are identified herein by the following abbreviations and symbols. Specific amino acids are identified by a three-letter abbreviation, as follows: Ala is alanine, Arg is arginine, Asn is asparagine, Asp is aspartic acid, Cys is cysteine, Gin is glutamine, Glu is glutamic acid, Gly is glycine, His is histidine, Leu is leucine, lie is isoleucine, Lys is lysine, Met is methionine, Phe is phenylalanine, Pro is proline, Ser is serine, Thr is threonine, Trp is tryptophan, Tyr is tyrosine, and Val is valine, or by a one-letter abbreviation, as follows: A is alanine, R is arginine, N is asparagine, D is aspartic acid, C is cysteine, Q is glutamine, E is glutamic acid, G is glycine, H is histidine, L is leucine, I is isoleucine, K is lysine, O is pyrrolysine, M is methionine, F is phenylalanine, P is proline, S is serine, T is threonine, W is tryptophan, Y is tyrosine, and V is valine. A dash (-) in a consensus sequence indicates that there is no amino acid at the specified position. A plus (+) in a consensus sequence indicates any amino acid may be present at the specified position. Thus, a plus in a consensus sequence herein indicates a position at which the amino acid is generally non-conserved; a homologous enzyme sequence, when aligned with the consensus sequence, can have any amino acid at the indicated "+" position. Specific amino acids in a protein coding sequence are identified by their respective one-letter abbreviation followed by the amino acid position in the protein coding sequence where 1 corresponds to the amino acid (typically methionine) at the N-terminus of the protein. For example, G204 in C. sativa wild type OLS refers to the glycine at position 204 from the OLS N-terminal methionine (/.e., Ml). Amino acid substitutions (i.e., point mutations) are indicated by identifying the mutated (i.e., progeny) amino acid after the one-letter code and number in the parental protein coding sequence; for example, G204A in C. sativa OLS refers to substitution of glycine by alanine at position 204 in the OLS protein coding sequence. The mutation may also be identified in parentheticals, for example OLS (G204A). Multiple point mutations in the protein coding sequence are separated by a backslash (/); for example, OLS G204A/Q205N indicates that mutations G204A and Q205N are both present in the OLS protein coding sequence. The number of mutations introduced into some examples has been annotated by a dash followed by the number of mutations, preceding the parenthetical identification of the mutation (e.g., B1Q2B6-1 (G204A)). The Uniprot IDs with and without the dash and number are used interchangeably herein (i.e., B1Q2B6-1 (G204A) = B1Q2B6 (G204A)).

As used herein, the term "express", when used in connection with a nucleic acid encoding an enzyme or an enzyme itself in a cell, means that the enzyme, which may be an endogenous or exogenous (heterologous) enzyme, is produced in the cell. The term "overexpress", in these contexts, means that the enzyme is produced at a higher level, i.e., enzyme levels are increased, as compared to the wild type, in the case of an endogenous enzyme. Those skilled in the art appreciate that overexpression of an enzyme can be achieved by increasing the strength or changing the type of the promoter used to drive expression of a coding sequence, increasing the strength of the ribosome binding site or Kozak sequence, increasing the stability of the mRNA transcript, altering the codon usage, increasing the stability of the enzyme, and the like.

The term "expression vector" or "vector" refer to a nucleic acid and/or a composition comprising a nucleic acid that can be introduced into a host cell, e.g., by transduction, transformation, or infection, such that the cell then produces ("expresses") nucleic acids and/or proteins other than those native to the cell, or in a manner not native to the cell, that are contained in or encoded by the nucleic acid so introduced. Thus, an "expression vector" contains nucleic acids (ordinarily DNA) to be expressed by the host cell. Optionally, the expression vector can be contained in materials to aid in achieving entry of the nucleic acid into the host cell, such as the materials associated with a virus, liposome, protein coating, or the like. Expression vectors suitable for use in various aspects and embodiments include those into which a nucleic acid sequence can be, or has been, inserted, along with any preferred or required operational elements. Thus, an expression vector can be transferred into a host cell and, typically, replicated therein (although, one can also employ, in some embodiments, non- replicable vectors that provide for "transient" expression). In some embodiments, an expression vector that integrates into chromosomal, mitochondrial, or plastid DNA is employed. In other embodiments, an expression vector that replicates extrachromosomally is employed. Typical expression vectors include plasmids, and expression vectors typically contain the operational elements required for transcription of a nucleic acid in the vector. Such plasmids, as well as other expression vectors, are described herein or are well known to those of ordinary skill in the art.

The terms "ferment", "fermentative", and "fermentation" are used herein to describe culturing host cells and microbes under conditions to produce useful chemicals, including but not limited to conditions under which microbial growth, be it aerobic or anaerobic, occurs.

The term "heterologous" as used herein refers to a material that is non-native to a cell. For example, a nucleic acid is heterologous to a cell, and so is a "heterologous nucleic acid" with respect to that cell, if at least one of the following is true: (a) the nucleic acid is not naturally found in that cell (that is, it is an "exogenous" nucleic acid); (b) the nucleic acid is naturally found in a given host cell (that is, "endogenous to"), but the nucleic acid or the RNA or protein resulting from transcription and translation of this nucleic acid is produced or present in the host cell in an unnatural ( e.g ., greater or lesser than naturally present) amount; (c) the nucleic acid comprises a nucleotide sequence that encodes a protein endogenous to a host cell but differs in sequence from the endogenous nucleotide sequence that encodes that same protein (having the same or substantially the same amino acid sequence), typically resulting in the protein being produced in a greater amount in the cell, or in the case of an enzyme, producing a mutant version possessing altered (e.g., higher or lower or different) activity; and/or (d) the nucleic acid comprises two or more nucleotide sequences that are not found in the same relationship to each other in the cell. As another example, a protein is heterologous to a host cell if it is produced by translation of RNA or the corresponding RNA is produced by transcription of a heterologous nucleic acid; a protein is also heterologous to a host cell if it is a mutated version of an endogenous protein, and the mutation was introduced by genetic engineering.

The terms "host cell" and "host microorganism" are used interchangeably herein to refer to a living cell that can perform one or more steps of the cannabinoid pathway, e.g. and without limitation, converting malonyl-CoA and hexanoyl-CoA (or another acyl-CoA) to olivetol and olivetolic acid. A host cell can be (or is) transformed via insertion of an expression vector. A host microorganism or cell as described herein may be a prokaryotic cell (e.g., a microorganism of the kingdom Eubacteria) or a eukaryotic cell. As will be appreciated by one of skill in the art, a prokaryotic cell lacks a membrane-bound nucleus, while a eukaryotic cell has a membrane- bound nucleus. In certain instances, a host cell is part of a multi-cellular organism.

Polyketide synthases (PKSs) are a family of multi-domain enzymes or enzyme complexes that produce polyketides, a large class of secondary metabolites, in bacteria, fungi, plants, and a few animal lineages. The terms "polyketide synthase", "PKS", "olivetol synthase" ("OLS" or "OS"), "tetraketide synthase", TKS, and olivetolic synthase as described herein or elsewhere typically refers to any enzyme capable of converting three molecules of malonyl-CoA and one molecule of hexanoyl-CoA or another acyl-CoA to olivetol or an olivetol analog. A wild type example of an OLS is the native C. sotivo OLS enzyme (UniProt ID: B1Q2B6; SEQ ID NO: 1).

Sequence ID 1: OS

MNHLRAEGPASVLAIGTANPENIUQDEFPDYYFRVTKSEHMTQLKEKFRKICDKSMI RKRNCFLNEEHLKQN

PRLVEHEMQTLDARQDMLVVEVPKLGKDACAKAIKEWGQPKSKITHLIFTSASTTDM PGADYHCAKLLGLSP

SVKRVMMYQLGCYGGGTVLRIAKDIAENNKGARVLAVCCDIMACLFRGPSDSDLELL VGQAIFGDGAAAVI

VGAEPDESVGERPIFELVSTGQTILPNSEGTIGGHIREAGUFDLHKDVPMLISNNIE KCLIEAFTPIGISDWNSIF

WITHPGGKI LDKVEEKLDLKKEKFVDSRHVLSEHGNMSSSTVLFVMDELRKRSLEEGKSTTGDGFEWGV LF

G FG PG LTVE RVVVRSVPI KY

Olivetolic acid cyclase ("OAC", EC: 4.4.1.26) is a polyketide cyclase derived from C. sativa which functions in concert with an OLS enzyme or a tetraketide synthase ("TKS") to form OLA. See, e.g.:

Sequence ID 2A: OAC MAVKHLIVLKFKDE!TEAQKEEFFKTYVNLVN!!PAMKDVYWGKDVTQKKEEGYTHIVEV TFESVETIQDYI!HP

AHVGFGDVYRSFWEKLUFDYTPRK

The terms "cannabinoid pathway", "cannabinoid production", "cannabinoid compound production", "cannabinoid synthesis", "THC synthesis", and the like, refer gene rally to a biosynthetic pathway that facilitates the synthesis and production of olivetol, olivetolic acid, and olivetolic acid-derived compounds. This biosynthetic pathway utilizes a variety of enzymes, catalysts, and intermediate compounds. For example, cannabigerolic acid synthase (EC: 2.5.1.102) is used to convert OA to cannabigerolic acid, which is a key intermediate acted upon by a variety of enzymes during THC synthesis. Cannabidiolic acid synthase (EC: 1.21.3.7) is used to convert cannabigerolic acid into cannabidiolic acid. Tetrahydrocannabinolic acid synthase (EC: 1.21.3.8) is used to convert cannabigerolic acid into D 9 -tetrahydrocannabinolic acid. A cannabichromenic acid synthase is used to convert cannabigerolic acid into cannabichromenic acid (CAS# 20408-52-0). These three olivetolic acid-derived compounds (i.e., cannabidiolic acid, D 9 -tetrahydrocannabinolic acid, and cannabichromenic acid) are themselves converted to even more diverse cannabinoids via a combination of oxidation, decarboxylation, and isomerization reactions, which can be catalyzed using either biological or synthetic catalysts, or can also occur spontaneously following heating and/or application of UV light. For example, cannabidiol results from cannabidiolic acid decarboxylation, D 9 -tetrahydrocannabinol results from D 9 - tetrahydrocannabinolic acid decarboxylation, and subsequent isomerization of D 9 - tetrahydrocannabinol results in A 6 -tetrahydrocannabinol.

As used herein, "recombinant" refers to the alteration of genetic material by human intervention. Typically, recombinant refers to the manipulation of DNA or RNA in a cell or virus or expression vector by molecular biology (recombinant DNA technology) methods, including cloning and recombination. Recombinant can also refer to manipulation of DNA or RNA in a cell or virus by random or directed mutagenesis. A "recombinant" cell or nucleic acid can typically be described with reference to how it differs from a naturally occurring counterpart (the "wild type"). In addition, any reference to a cell or nucleic acid that has been "engineered" or "modified" and variations of those terms, is intended to refer to a recombinant cell or nucleic acid. The terms "transduce", "transform", "transfect", and variations thereof as used herein refers to the introduction of one or more nucleic acids into a cell. For practical purposes, the nucleic acid must be stably maintained or replicated by the cell for a sufficient period of time to enable the function(s) or product(s) it encodes to be expressed for the cell to be referred to as "transduced", "transformed", or "transfected". As will be appreciated by those of skill in the art, stable maintenance or replication of a nucleic acid may take place either by incorporation of the sequence of nucleic acids into the cellular chromosomal DNA, e.g., the genome, as occurs by chromosomal integration, or by replication extrachromosomally, as occurs with a freely- replicating plasmid. A virus can be stably maintained or replicated when it is "infective": when it transduces a host microorganism, replicates, and (without the benefit of any complementary virus or vector) spreads progeny expression vectors, e.g., viruses, of the same type as the original transducing expression vector to other microorganisms, wherein the progeny expression vectors possess the same ability to reproduce.

Descriptive Embodiments

In one aspect, provided herein is a process comprising: contacting an aqueous phase comprising glucose and hexanoic acid or a salt thereof and an organic phase immiscible with the aqueous phase with a heterologous microorganism comprising a Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS), Cannabis sativa olivetolic acid cyclase (csOAC), and a Cannabis sativa acyl activating enzyme (csAAE) to provide olivetol and olivetolic acid or a salt thereof, wherein the olivetol and olivetolic acid is provided in a combined amount of at least about 3 g/liter over about 4 to about 7 days.

In accordance with certain embodiments of this process, other microorganisms, such as those utilized herein are useful.

In one aspect, provided herein is a process comprising: contacting an aqueous phase comprising glucose and butyric acid or a salt thereof and an organic phase immiscible with the aqueous phase with a heterologous microorganism comprising a Cannabis sativa olivetol synthase (which is a tetraketide synthase, csOLS), Cannabis sativa olivetolic acid cyclase (csOAC), and a Cannabis sativa acyl activating enzyme (csAAE) to provide divarin and/or divarinic acid or a salt thereof.

Other microorganisms, such as those utilize herein, utilized herein are useful in accordance with certain embodiments of this process.

In one embodiment, the divarin and/or divarinic acid is provided in a combined amount of at least about 0.25 - about 2 g/liter, or about 0.5 - about lg/liter over about 4 to about 7 days.

In one embodiment, the fermenting is performed in the absence of galactose. In another embodiment, the aqueous phase comprises galactose.

In one embodiment, the fermenting is performed in the absence of galactose. In another embodiment, the aqueous phase comprises galactose.

Organic Phase Immiscible With Aqueous Phase

In one embodiment, the organic phase immiscible with aqueous phase, or simply the organic phase, comprises an alkane, an alcohol with carbon number greater than 4, an ester (such as isopropyl myristate), a triglyceride (including commercially available vegetable oils such as sunflower oil, soybean oil, or olive oil), a diester (such as dialkyl malonate), a ketone, or a glyme. Other organic solvents immiscible with water or the aqueous phase employed can be utilized. In another embodiment, the organic phase comprises isopropyl myristate. Suitable solvents include without limitation, other esters, aromatic solvents, and the likes. In one embodiment, the organic phase comprises an aromatic solvent. Non limiting examples of aromatic hydrocarbon solvents include benzene, toluene, other alkylated benzenes, anisole and the likes, and mixtures thereof. In one embodiment, the organic phase comprises toluene.

In-situ liquid-liquid extraction (biphasic fermentation) is a strategy that can be employed in accordance with the present invention for physical separation of product from microorganisms via partitioning into the water immiscible organic liquid phase from an aqueous culture phase. The organic liquid phase or organic phase is present as either an overlay if its density is less than that of the aqueous phase, or an underlay if its density is greater than that of the aqueous phase. Certain properties of the overlay or underlay are considered for production of olivetolic acid/olivetol and other resorcinols such as of formulas IA and IB: (1) non-toxic or low toxicity for growth of the host strain, and/or (2) a favorable partition coefficient of the product in the organic phase vs. the aqueous phase, and/or (3) preferably a lower partition coefficient for fed hexanoic acid (for olivetolic acid/olivetol) or other fatty acid such as RCO2H (for other resorcinols) in the organic phase vs. the aqueous phase. Additional properties of the organic phase enhance its suitability for downstream conversion, e.g. and without limitation, to cannabigerol and other cannabinoid compounds, including suitability as a solvent or co-solvent during downstream prenylation or other reactions, and boiling point if downstream separation by distillation is employed.

The performance of various classes of organic phase compounds are provided herein. Among the diesters tested, certain may be toxic to growth under the test conditions. Certain diethyl esters were toxic under the test conditions with the exception of modest growth by most strains in the presence of diethyl sebacate and diethyl diethylmalonate (with glucose only, with galactose strains appeared to exhibit substantial lag). For malonate diesters, under the test conditions, di-tert-butyl malonate supported growth of all strains with glucose addition, again appearing toxic or to induce substantial lag with galactose addition.

Increasing the dialkyl ester chain length from diethyl to diisopropyl to dibutyl in a dialkyl adipate series reduced toxicity. Growth was observed with diisopropyl adipate and no apparent toxicity observed in dibutyl adipate. Dibutyl sebacate was also completely non-inhibitory to growth and accordingly, non-toxic. In certain embodiments, the minimum non-toxic internal alkyl chain length of diethyl diesters is sebacate. In certain embodiments, shorter internal alkyl chain length down to adipate is possible with diisopropyl diesters.

For monoester compounds, under the test conditions, octyl acetate was toxic and for the hexanoate series, growth was only observed starting with hexyl hexanoate, which was moderately non-toxic. Isopropyl octanoate was moderately inhibitory but allowed for some growth. For the decanoate series, methyl decanoate was moderately inhibitory to growth but still allowed for growth. Texanol, a monoester alcohol (2,2,4-trimethyl-l,3-pentanediol monoisobutyrate), was inhibitory to growth under the conditions tested. However, ethyl decanoate and higher alkyl chains were increasingly non-toxic. Both ethyl and butyl laurate were non-toxic, as well as methyl and ethyl myristate. In certain embodiments, growth-suitable monoester overlays for resorcinol or cannabinoid production include hexyl hexanoate or any higher chain length alkyl hexanoate ester, C 3 chain-length or higher (e.g., and without limitation C 6 -Cs or higher) alkyl octanoate esters, and methyl (Ci) or higher (e.g., and without limitation C 6 -Cs or higher) alkyl decanoates, laurates, or myristates.

In various embodiments, esters and diesters are employed as the organic phase in accordance with the present invention.

Fatty alcohols are mostly solids above C 10 saturated chain length. Decanol, a liquid, was toxic to growth under test conditions. However, oleyl alcohol supported robust growth. In certain embodiments, longer chain length (C 12 or higher) unsaturated fatty alcohols can be suitable overlays supporting S. cerevisiae or another fermenting organism's growth. In various embodiments, fatty alcohols, preferably C 12 or higher alcohols, are employed as the organic phase in accordance with the present invention.

In certain embodiments, alkanes and paraffins support robust growth. Lack of toxicity was observed for dodecane, tetradecane, hexadecane, light and heavy paraffin oils, and isopar M. In certain embodiments, C 12 and higher paraffins are suitable overlays supporting S. cerevisiae or another fermenting organism's growth. In various embodiments, fatty alcohols, preferably C 12 or higher alcohols, are employed as the organic phase in accordance with the present invention.

Certain triacylglycerols were tested, including tricaprylin, coconut oil, and canola oil (vegetable oils having different average chain length compositions of fatty acid chains, with coconut oil being predominantly C 12 -C 14 saturated fatty acids, and canola being predominantly Ci6-Ci8and a mixture of saturated and unsaturated fatty acids). Tricaprylin, a synthetic oil containing three Cs fatty acid chains, was fairly toxic, however allowed some growth of strains. In certain embodiments, coconut and canola oil were non-toxic to growth.

Mixtures of isopropyl myristate (IPM) and isopar M with different diesters - dibasic esters (DBE), diethyl sebacate, and di-tert-butyl malonate were explored to investigate if lower percentage mixtures of these compounds in non-toxic IPM or isopar M would mitigate their toxicity toward growth of a microorganism such as S. cerevisioe, as they may also advantageously alter partitioning properties of olivetolic acid, olivetol, and other analogues into the overlay and could offer advantages with alternative downstream separations processes. For example, and without limitation, a DBE, which may be toxic by itself as an underlay, was much less toxic at concentrations of between 1 and 2.5% (v/v) in IPM and especially isopar M. Di-tert- butyl malonate also exhibited much lower toxicity at 1-10% (v/v), and particularly 1-2.5% (v/v), in IPM and isopar M. In certain embodiments, mixtures of longer chain monoesters or paraffins with moderately to very toxic diesters are useful according to the present invention.

In another embodiment, the aqueous phase further comprises histidine. In another embodiment, the pH of the aqueous phase is at a pH of about 4 to about 8.

In another embodiment, the olivetol and olivetolic acid is provided in a combined amount of at least about 4 g/liter over about 4 to about 7 days. In another embodiment, the olivetol and olivetolic acid is provided in a combined amount of at least about 4.5 g/liter over about 4 to about 7 days. In another embodiment, the olivetol and olivetolic acid is provided in a combined amount of at least about 5 g/liter over about 4 to about 7 days. In another embodiment, the olivetol and olivetolic acid is provided in a combined amount of at least about 7 g/liter over about 4 to about 7 days. In another embodiment, the olivetol and olivetolic acid is provided in a combined amount of at least about 9 g/liter over about 4 to about 7 days. In another embodiment, the combined amount of olivetol and olivetolic acid provided herein, is provided over 4 days. In another embodiment, the combined amount of olivetol and olivetolic acid provided herein, is provided over 5 days. In another embodiment, the combined amount of olivetol and olivetolic acid provided herein, is provided over 6 days. In another embodiment, the combined amount of olivetol and olivetolic acid provided herein, is provided over 7 days.

In one embodiment, the fermentation is performed in a semi-continuous mode ("fill- and-draw"). In another embodiment, the fermentation is performed in a continuous mode. In one embodiment, the overall combined productivity of olivetol and olivetolic acid is greater than 0.3 g per L of total volume (including aqueous and immiscible liquid phases) per day of operation. In another embodiment, the fermentation is performed in a total volume of 15 liters or a larger volume such as 1,000 liters, 10,000 liters, 20,000 or 50,000 liters, or an even larger volume. The combined yield of O and OA obtained in such large scale fermentation performed according to the present invention over 2-7 or 4-7 days, such as over 2, 3, 4, 5, 6, or 7 days is unexpectedly high. In some embodiment, the combined amount of O/OA obtained, even in large scale fermentations, e.g., and without limitation in 10,000 liters fermentations, is about 7 - about 10 g/liter. In some embodiment, the combined amount of O/OA obtained, even in large scale fermentations, e.g., and without limitation in 20,000 liters fermentations, is about 7 - about 10 g/liter.

In one embodiment, the functional OLS has a Sequence ID 1. In another embodiment, the functional OLS has an at least 95% sequence identity with Sequence ID 1. In another embodiment, the functional olivetolic acid cyclase has at least 50%, at least 75%, or at least 95% sequence identity to SEQ ID 1.

In one embodiment, the functional OAC has a Sequence ID 2A. In another embodiment, the functional OAC has an at least 95% sequence identity with Sequence ID 2A. In another embodiment, the functional olivetolic acid cyclase has at least 50%, at least 75%, or at least 95% sequence identity to SEQ ID NO: 2A. In another embodiment, the functional olivetolic acid cyclase is of SEQ ID NO: 2B. In another embodiment, the functional olivetolic acid cyclase has at least 50%, at least 75%, or at least 95% sequence identity to SEQ ID NO: 2B.

SEQ ID NO: 2B

MAVKHLIVLKFKDEITEAQKEEFFKTYVNLVNIIPAMKDVYWGKDVTQKNKEEGYTH IVEVTFESVETIQDYII

HPAHVGFGDVYRSFWEKLLIFDYTPRK.

In one embodiment, the functional AAE has a Sequence ID 3A. In another embodiment, the functional AAE has an at least 95% sequence identity with Sequence ID 3A.

Sequence ID 3A: Cannabis sativa acyl activating enzyme (CsAAEl)

MGKNYKSLDSVVASDFIALGITSEVAETLHGRLAEIVCNYGAATPQTWINIANHILS PDLPFSLHQMLFYGCYK

DFGPAPPAWIPDPEKVKSTNLGALLEKRGKEFLGVKYKDPISSFSHFQEFSVRNPEV YWRTVLMDEMK ISFSK

DPECILRRDDINNPGGSEWLPGGYLNSAKNCLNVNSNKKLNDTMIVWRDEGNDDLPL NKLTLDQLRKRVW

LVGYALEEMGLEKGCALAIDMPMHVDAVVIYLAIVLAGYVVVSIADSFSAPELSTRL RLSKAKAIFTQDHIIRGK

KRIPLYSRVVEAKSPMAIVIPCSGSNIGAELRDGDISWDYFLERAKEFKNCEFTARE QPVDAYTNILFSSGTTGE

PKAIPWTQATPLKAAADGWSHLDIRKGDVIVWPTNLGWMMGPWLVYASLLNGASIAL YNGSPLVSGFAKF VQDAKVTMLGVVPSIVRSWKSTNCVSGYDWSTIRCFSSSGEASNVDEYLWLMGRANYKPV IEMCGGTEIG

GAFSAGSFLQAQSLSSFSSQCMGCTLYILDKNGYPMPKNKPGIGELALGPVMFGASK TLLNGNHHDVYFKG

MPTLNGEVLRRHGDIFELTSNGYYHAHGRADDTMN!GGIKISS!EIERVCNEVDDRV FETTA!GVPPLGGGPE

QLVIFFVLKDSNDTTIDLNQLRLSFNLGLQKKLNPLFKVTRVVPLSSLPRTATNKIM RRVLRQQFSHFE

In another embodiment, the functional AAE polypeptide comprises an amino acid sequence SEQ ID NO: 3B. In another embodiment, the functional AAE polypeptide comprises an amino acid sequence that has at least 50%, at least 75%, or at least 95% sequence identity to SEQ ID NO: 3B. In another embodiment, the functional AAE polypeptide comprises an amino acid sequence SEQ ID NO: 3C. In another embodiment, the functional AAE polypeptide comprises an amino acid sequence that has at least 50%, at least 75%, or at least 95% sequence identity to SEQ ID NO: 3C. In another embodiment, the functional AAE polypeptide comprises an amino acid sequence that is SEQ ID NO: 3D. In another embodiment, the functional AAE polypeptide has at least 50%, at least 75%, or at least 95% sequence identity to SEQ ID NO: 3D.

SEQ ID NO 3B: Cannabis sativa acyl activating enzyme (CsAAE3) MEKSGYGRDGIYRSLRPPLHLPNNNNLSMVSFLFRNSSSYPQKPALIDS ETNQILSFSHFKSTVIKVSHGFLNLGIKKNDWLIYAPNSIHFPVCFLGIIA SGAIATTSNPLYTVSELSKQVKDSNPKLIITVPQLLEKVKGFNLPTILIGP DSEQESSSDKVMTFNDLVNLGGSSGSEFPIVDDFKQSDTAALLYSSGTT GMSKGWLTHKNFIASSLMVTMEQDLVGEMDNVFLCFLPMFHVFGLAI ITYAQLQRGNTVISARFDLEKMLKDVEKYVTHLWWPPVILALSKNSM VKFNLSSIKYIGSGAAPLGKDLMEECSKWPYGIVAQGYGMTETCGIVS MEDIRGGKRNSGSAGMLASGVEAQIVSVDTLKPLPPNQLGEIWVKGPN MMQGYFNNPQATKLTIDKKGWVHTGDLGYFDEDGHLYWDRIKELIK YKGFQVAPAELEGLLVSHPEILDAWIPFPDAEAGEVPVAYWRSPNSSL TENDVKKFIAGQV ASFKRLRKVTFINSVPKSASGKILRRELIQKVRSNM

SEQ ID NO 3C: Truncated Cannabis sativa acyl activating enzyme MEKSGYGRDGIYRSLRPPLHLPNNNNLSMVSFLFRNSSSYPQKPALIDS ETNQILSFSHFKSTVIKVSHGFLNLGIKKNDWLIYAPNSIHFPVCFLGIIA SGAIATTSNPLYTVSELSKQVKDSNPKLIITVPQLLEKVKGFNLPTILIGP DSEQESSSDKVMTFNDLVNLGGSSGSEFPIVDDFKQSDTAALLYSSGTT

GMSKGWLTHKNFIASSLMVTMEQDLVGEMDNVFLCFLPMFHVFGLAI

ITYAQLQRGNTVISARFDLEKMLKDVEKYVTHLWWPPVILALSKNSM

VKFNLSSIKYIGSGAAPLGKDLMEECSKWPYGIVAQGYGMTETCGIVS

MEDIRGGKRNSGSAGMLASGVEAQIVSVDTLKPLPPNQLGEIWVKGPN

MMQGYFNNPQATKLTIDKKGWVHTGDLGYFDEDGHLYWDRIKELIK

YKGFQVAPAELEGLLVSHPEILDAWIPFPDAEAGEVPVAYWRSPNSSL

TENDVKKFIAGQVASFKRLRKVTFINSVPKSASGKIL.

SEQ ID NO 3D: Escherichia coli hexanoyl-CoA synthetase amino acid sequence MHPTGPHLGPDVLFRESNMKVTLTFNEQRRAAYRQQGLWGDASLADYWQQTARAMPDKIA VVDNHGA SYTYSALDHAASCLANWMLAKGIESGDRIAFQLPGWCEFTVIYLACLKIGAVSVPLLPSW REAELVWVLNKC QAKMFFAPTLFKQTRPVDLILPLQNQLPQLQQIVGVDKLAPATSSLSLSQIIADNTSLTT AITTHGDELAAVLFT SGTEGLPKGVMLTHNNILASERAYCARLNLTWQDVFMMPAPLGHATGFLHGVTAPFLIGA RSVLLDIFTPDA CLALLEQQRCTCMLGATPFVYDLLNVLEKQPADLSALRFFLCGGTTIPKKVARECQQRGI KLLSVYGSTESSPH AVVNLDDPLSRFMHTDGYAAAGVEIKVVDDARKTLPPGCEGEEASRGPNVFMGYFDEPEL TARALDEEGW YYSGDLCRMDEAGYIKITGRKKDIIVRGGENISSREVEDILLQHPKIHDACVVAMSDERL GERSCAYVVLKAPH HSLSLEEVVAFFSRKRVAKYKYPEHIVVIEKLPRTTSGKIQKFLLRKDIMRRLTQDVCEE IE

In one embodiment, the sequence identity is at least 50%. In another embodiment, the sequence identity is at least 75%. In another embodiment, the sequence identity is at least 95%.

In another embodiment, the sequence identity is at least 99% with a protein sequence utilized herein. In another embodiment, the sequence identity is at least 99% with a nucleic acid sequence utilized herein.

In another embodiment, the heterologous microorganism is antibiotic-marker free. In another embodiment, the heterologous microorganism is an FAA2 (peroxisomal medium chain fatty acyl-CoA synthetase) knock out or has a lowered FAA2 activity. In another embodiment, the heterologous microorganism is a PXA1 (part of the heterodimeric peroxisomal fatty acid and/or acyl-CoA ABC transport complex with PXA2) knockout or has a lowered PXA2 activity. In another embodiment, the heterologous microorganism is a PEX11 (peroxisomal protein required for medium-chain fatty acid oxidation) knockout or has a lowered PEX11 activity. In another embodiment, the heterologous microorganism is an ANTI (peroxisomal adenine nucleotide transporter, which exchanges AMP generated in peroxisomes by acyl-CoA synthetases for ATP, that is consumed in that reaction, from the cytosol) knockout or has a lowered ANTI activity.

In another embodiment, the microorganism is Saccharomyces cerevisiae. In another embodiment, the Saccharomyces cerevisiae comprises galactose regulatable promoters for the heterologous genes (csOLS, csOAC, csAAE, and the likes). In another embodiment, the Saccharomyces cerevisiae does not include galactose regulatable promoters for the heterologous genes. In another embodiment, the Saccharomyces cerevisiae is haploid. In another embodiment, the Saccharomyces cerevisiae is diploid.

Initial construction of an olivetol/olivetolic acid (O/OA) producing line was done by introducing 3 genes (cannabis olivetol synthase csOLS, cannabis olivetolic acid cyclase csOAC, and a cannabis acyl-activating enzyme csAAEl) from the Cannabis sativa plant under the control of genetic elements that are regulated by a galactose carbon source. It is well known that galactose regulates gene expression in Saccharomyces cerevisiae. Introduction of foreign genes into Saccharomyces cerevisiae can be toxic to Saccharomyces cerevisiae for a variety of reasons. One way to regulate toxicity of foreign genes is to produce their expression under the control of galactose. Glucose is used under normal Saccharomyces cerevisiae growth conditions. However, during the course of growth or at the beginning of growth, glucose can then be exchanged with galactose to tightly control the expression of foreign genes.

The initial engineering of Saccharomyces cerevisiae was done by introducing a gene fragment containing csOLS, csOAC, and csAAEl under the control of galactose regulatable elements called promoters. The csOLS and csOAC were physically linked to each other on the gene with a genetic element called T2A in all examples. In order to select for Saccharomyces cerevisiae cells that efficiently incorporated the foreign DNA, but removed Saccharomyces cerevisiae that had no foreign genes, a method that allows growth on nutrient preferred media was utilized. The process by which Saccharomyces cerevisiae uptake foreign DNA and stably utilize the foreign DNA is called recombination. The final Saccharomyces cerevisiae strains that took up the foreign DNA and utilized the DNA are called recombinants. Saccharomyces cerevisiae that did not undergo recombination is called wild type. The process of selecting in preferred media is defined as prototrophy rescue. In order to separate recombinants from wild type we utilized prototrophy rescue. In some cases, recombinants we added foreign genes that contained genetic elements that controlled the resistance to antibiotics. Antibiotics such as G418 or hygromycin will normally kill Saccharomyces cerevisiae. In some instances, foreign genes were introduced into O/OA producing lines in order to rescue survival of G418 or hygromycin for the purpose of removing genes native to the Saccharomyces cerevisiae and allowing them to survive in antibiotics while decreasing the need for galactose utilization.

Expression Vectors

In various aspects, provided herein is a recombinant host cell modified by genetic engineering as disclosed herein. In one embodiment, a recombinant polyketide synthase, such as an OLS enzyme, is introduced. In another embodiment, an aromatic prenyltransferase is introduced. In another embodiment, the modification increases the production of malonyl-CoA, hexanoyl-CoA or a R-CoA. In some embodiments, the host cell is engineered via recombinant DNA technology to express heterologous nucleic acids that encode a cannabinoid pathway enzyme such as an OLS enzyme, which is either a mutated version of a naturally occurring enzyme, or a non-naturally occurring enzyme as provided herein.

In one preferred embodiment, the invention includes methods of generating a polynucleotide that expresses one or more of the SEQ IDs related to a mutant or modified OLS provided or utilized herein. In certain preferred embodiments, the proteins of the invention are expressed using any of a number of systems, such as in whole plants, as well as plant cell and/or yeast suspension cultures. E.g., the polynucleotide that encodes the OLS is placed under the control of a promoter that is functional in the desired host cell. An extremely wide variety of promoters may be available and can be used in the expression vectors of the invention, depending on the particular application. Ordinarily, the promoter selected depends on the cell in which the promoter is to be active. Other expression control sequences such as ribosome binding sites, transcription termination sites and the like are also optionally included.

Nucleic acid constructs provided and utilized herein include expression vectors that comprise nucleic acids encoding one or more polyketide synthase enzymes. The nucleic acids encoding the enzymes are operably linked to promoters and optionally other control sequences such that the subject enzymes are expressed in a host cell containing the expression vector when cultured under suitable conditions. The promoters and control sequences employed depend on the host cell selected for the production of olivetol, olivetolic acid (OLA or OA), OLA- derived compound, or another cannabinoid or cannabinoid derivative. Thus, the invention provides not only expression vectors but also nucleic acid constructs useful in the construction of expression vectors. Methods for designing and making nucleic acid constructs and expression vectors generally are well known to those skilled in the art and so are only briefly reviewed herein.

Nucleic acids encoding the polyketide synthase enzymes can be prepared by any suitable method known to those of ordinary skill in the art, including, for example, direct chemical synthesis and cloning. Further, nucleic acid sequences for use in the invention can be obtained from commercial vendors that provide de novo synthesis of the nucleic acids.

A nucleic acid encoding the desired enzyme can be incorporated into an expression vector by known methods that include, for example, the use of restriction enzymes to cleave specific sites in an expression vector, e.g., plasmid, thereby producing an expression vector of the invention. Some restriction enzymes produce single stranded ends that may be annealed to a nucleic acid sequence having, or synthesized to have, a terminus with a sequence complementary to the ends of the cleaved expression vector. The ends are then covalently linked using an appropriate enzyme, e.g., DNA ligase. DNA linkers may be used to facilitate linking of nucleic acids sequences into an expression vector.

A set of individual nucleic acid sequences can also be combined by utilizing polymerase chain reaction (PCR)-based methods known to those of skill in the art. For example, each of the desired nucleic acid sequences can be initially generated in a separate PCR. Thereafter, specific primers are designed such that the ends of the PCR products contain complementary sequences. When the PCR products are mixed, denatured, and reannealed, the strands having the matching sequences at their 3' ends overlap and can act as primers for each other.

Extension of this overlap by DNA polymerase produces a molecule in which the original sequences are "spliced" together. In this way, a series of individual nucleic acid sequences may be joined and subsequently transduced into a host cell simultaneously. Thus, expression of each of the plurality of nucleic acid sequences is affected.

A typical expression vector contains the desired nucleic acid sequence preceded and optionally followed by one or more control sequences or regulatory regions, including a promoter and, when the gene product is a protein, ribosome binding site, e.g., a nucleotide sequence that is generally 3-9 nucleotides in length and generally located 3-11 nucleotides upstream of the initiation codon that precede the coding sequence, which is followed by a transcription terminator in the case of E. coli or other prokaryotic hosts. See Shine et a!., Nature 254:34 (1975) and Steitz, in Biological Regulation and Development: Gene Expression (ed. R. F. Goldberger), vol. 1, p. 349 (1979) Plenum Publishing, N.Y. In the case of eukaryotic hosts like yeast, a typical expression vector contains the desired nucleic acid coding sequence preceded by one or more regulatory regions, along with a Kozak sequence to initiate translation and followed by a terminator. See Kozak, Nature 308:241-246 (1984).

Regulatory regions or control sequences include, for example, those regions that contain a promoter and an operator. A promoter is operably linked to the desired nucleic acid coding sequence, thereby initiating transcription of the nucleic acid sequence via an RNA polymerase. An operator is a sequence of nucleic acids adjacent to the promoter, which contains a protein-binding domain where a transcription factor can bind. Transcription factors activate or repress transcription initiation from a promoter. In this way, control of transcription is accomplished, based upon the particular regulatory regions used and the presence or absence of the corresponding transcription factor. Non-limiting examples for prokaryotic expression include lactose promoters (Lad repressor protein changes conformation when contacted with lactose, thereby preventing the Lad repressor protein from binding to the operator) and tryptophan promoters (when complexed with tryptophan, TrpR repressor protein has a conformation that binds the operator; in the absence of tryptophan, the TrpR repressor protein has a conformation that does not bind to the operator). Non-limiting examples of promoters to use for eukaryotic expression include pTDH3, pTEFl, pTEF2, pRNR2, pRPL18B, pREVl, pGALl, pGALlO, pGAPDH, pCUPl, pMET3, pPGKl, pPYKl, pHXT7, pPDCl, pFBAl, pTDH2, pPGIl, pPDCl, pTPIl, pEN02, pADHl, and pADH2. As will be appreciated by those of ordinary skill in the art, a variety of expression vectors and components thereof are useful.

Although any suitable expression vector are useful to incorporate the desired sequences, readily available expression vectors include, without limitation: plasmids, such as pESC, pTEF, p414CYCl, p414GALS, pSClOl, pBR322, pBBRlMCS-3, pUR, pEX, pMRIOO, pCR4, pBAD24, pUC19, pRS series; and bacteriophages, such as M13 phage and l phage. Of course, such expression vectors may only be suitable for particular host cells or for expression of particular polyketide synthases. One of ordinary skill in the art, however, can readily determine through routine experimentation whether any particular expression vector is suited for any given host cell or protein. For example, the expression vector can be introduced into the host cell, which is then monitored for viability and expression of the sequences contained in the vector. In addition, relevant texts and literature describe expression vectors and their suitability to any particular host cell. In addition to the use of expression vectors, strains are built where expression cassettes are directly integrated into the host genome.

The expression vectors are introduced or transferred, e.g., by transduction, transfection, or transformation, into the host cell. Such methods for introducing expression vectors into host cells are well known to those of ordinary skill in the art. For example, one method for transforming P. kudriavzevii with an expression vector involves a calcium chloride treatment wherein the expression vector is introduced via a calcium precipitate.

For identifying whether a nucleic acid has been successfully introduced or into a host cell, a variety of methods are available. For example, potentially transformed host cells in a culture are separated, using a suitable dilution, into individual cells and thereafter individually grown and tested for expression of a desired gene product of a gene contained in the introduced nucleic acid. For example, an often-used practice involves the selection of cells based upon antibiotic resistance that has been conferred by antibiotic resistance-conferring genes in the expression vector, such as the beta lactamase (amp), aminoglycoside phosphotransferase (neo), and hygromycin phosphotransferase (hyg, hph, hpt) genes.

In one embodiment, a host cell of the disclosure is transformed with at least one expression vector. When only a single expression vector is used, the vector will typically contain a polyketide synthase gene. Once the host cell has been transformed with the expression vector, the host cell is cultured in a suitable medium containing a carbon source, such as a sugar ( e.g ., glucose). As the host cell is cultured, expression of the polyketide synthase enzyme(s) occurs. Once expressed, these OLS(s) and other enzymes provided and utilized herein convert three molecules of malonyl-CoA and one molecule of hexanoyl-CoA or R-CoA, wherein R is defined as herein, to olivetol or a compound of formula (I).

If a host cell of the invention is to include more than one heterologous gene, the multiple genes can be expressed from one or more vectors. For example, a single expression vector can comprise one, two, or more genes encoding one, two, or more mutant OLS enzyme(s), other enzymes of the cannabinoid pathway, e.g., improved malonyl-CoA production, hexanoyl-CoA, or R-CoA production, etc. The heterologous genes can be contained in a vector replicated episomally or in a vector integrated into the host cell genome, and where more than one vector is employed, then all vectors may replicate episomally (extrachromasomally), or all vectors may integrate, or some may integrate and some may replicate episomally. While a "gene" is generally composed of a single promoter and a single coding sequence, in certain host cells, two or more coding sequences are controlled by one promoter in an operon. In some embodiments, a two or three operon system is used.

In some embodiments, the coding sequences employed have been modified, relative to some reference sequence, to reflect the codon preference of a selected host cell. Codon usage tables for numerous organisms are readily available and can be used to guide sequence design. The use of prevalent codons of a given host organism generally improves translation of the target sequence in the host cell. As one non-limiting example, in some embodiments the subject nucleic acid sequences will be modified for yeast codon preference (see, for example, Bennetzen et al., J. Biol. Chem. 257: 3026-3031 (1982)). In some embodiments, the nucleotide sequences will be modified for P. kudriavzevii codon preference (see, for example, Nakamura et al., Nucleic Acids Res. 28:292 (2000)). In other embodiments, the nucleotide sequences are modified to include codons optimized forS. cerevisiae codon preference.

Nucleic acids can be prepared by a variety of routine recombinant techniques. Briefly, the subject nucleic acids can be prepared from genomic DNA fragments, cDNAs, and RNAs, all of which can be extracted directly from a cell or recombinantly produced by various amplification processes including but not limited to PCR and rt-PCR. Subject nucleic acids can also be prepared by a direct chemical synthesis.

The nucleic acid transcription levels in a host microorganism can be increased (or decreased) using numerous techniques. For example, the copy number of the nucleic acid can be increased through use of higher copy number expression vectors comprising the nucleic acid sequence, or through integration of multiple copies of the desired nucleic acid into the host microorganism's genome. Non-limiting examples of integrating a desired nucleic acid sequence onto the host chromosome include recA-mediated recombination, lambda phage recombinase- mediated recombination and transposon insertion. Nucleic acid transcript levels can be increased by changing the order of the coding regions on a polycistronic mRNA or breaking up a polycistronic operon into multiple poly- or mono-cistronic operons each with its own promoter. RNA levels can be increased (or decreased) by increasing (or decreasing) the strength of the promoter to which the protein-coding region is operably linked.

The translation level of a desired polypeptide sequence in a host microorganism can also be increased in a number of ways. Non-limiting examples include increasing the mRNA stability, modifying the ribosome binding site (or Kozak) sequence, modifying the distance or sequence between the ribosome binding site (or Kozak sequence) and the start codon of the nucleic acid sequence coding for the desired polypeptide, modifying the intercistronic region located 5' to the start codon of the nucleic acid sequence coding for the desired polypeptide, stabilizing the 3'-end of the mRNA transcript, modifying the codon usage of the polypeptide, altering expression of low-use/rare codon tRNAs used in the biosynthesis of the polypeptide. Determination of preferred codons and low-use/rare codon tRNAs can be based on a sequence analysis of genes derived from the host microorganism.

The polypeptide half-life, or stability, can be increased through mutation of the nucleic acid sequence coding for the desired polypeptide, resulting in modification of the desired polypeptide sequence relative to the control polypeptide sequence. When the modified polypeptide is an enzyme, the activity of the enzyme in a host is altered due to increased solubility in the host cell, improved function at the desired pH, removal of a domain inhibiting enzyme activity, improved kinetic parameters (lower Km or higher k cat values) for the desired substrate, removal of allosteric regulation by an intracellular metabolite, and the like. Altered/modified enzymes can also be isolated through random mutagenesis of an enzyme, such that the altered/modified enzyme can be expressed from an episomal vector or from a recombinant gene integrated into the genome of a host microorganism.

Host Cells

Provided herein are host cells, preferably recombinant host cells, more preferably heterologous recombinant host cells for performing one or more steps of the cannabinoid pathway. In some embodiments, the recombinant host cell is a eukaryote. In various embodiments, the eukaryote is a yeast strain selected from the non-limiting list of example genera: Candida, Cryptococcus, Hansenula, Issatchenkia, Kluyveromyces, Komagataella, Lipomyces, Pichia, Rhodosporidium, Rhodotorula, Saccharomyces, or Yarrowia. Those skilled in the art will recognize that these genera broadly encompass yeast, including those distinguished as oleaginous yeast. In some embodiments, the host cell is Saccharomyces cerevisiae. In other embodiments, the host cell is Pichia kudriavzevii. In other embodiments of the invention, the eukaryotic host cell is a fungus or algae. In yet other embodiments, the recombinant host cell is a prokaryote selected from the non-limited example genera: Bacillus, Clostridium, Corynebacterium, Escherichia, Pseudomonas, Rhodobacter, and Streptomyces. In various embodiments, the host cell is P. kudriavzevii.

In one embodiment, the host cell is part of a multicellular organism. In one embodiment, the multicellular organism is a plant. In one embodiment, the plant is a cannabis plant. In one embodiment, the plant is a tobacco plant.

As utilized herein, a number of genetic modifications are further useful for increasing microbial biosynthesis of malonyl-CoA. For example, in some embodiments a host cell provided or utilized herein is further engineered to include a genetic modification useful for converting pyruvate to malonyl-CoA, wherein the genetic modification produces and/or provides a pyruvate decarboxylase, an acetaldehyde dehydrogenase, an acetyl-CoA synthetase, an acetyl- CoA carboxylase, and a carbonic anhydrase. In some embodiments, an engineered host cell provided or utilized herein is a Saccharomyces cerevisiae host cell. In some embodiments, an engineered host cell comprises heterologous enzymes that are overexpressed to increase malonyl-CoA production, thereby facilitating production of olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative. In some embodiments, the engineered host cell comprises heterologous enzymes selected from the group consisting of an acetyl Co-A carboxylase, such as P. kudriavzevii acetyl-CoA carboxylase, S. cerevisiae aldehyde dehydrogenase, Yarrowia lipolytica acetyl-CoA synthetase, and S. cerevisiae pyruvate decarboxylase.

In some embodiments, the host cell is a Saccharomyces cerevisiae host cell. In some embodiments, a yeast host cell expressing an OLS is used to produce olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative. In some embodiments, an oleaginous yeast host cell expressing an OLS is used to produce olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative.

Also provided herein is a mutated OLS comprising a mutated active site, vectors for expressing the mutant, and host cells that express the mutant. In another embodiment, the host cell further produces olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative. Introduction of mutations in the region comprising D198 to G209 of OLA increases the turnover rate (i.e., k cat values) of the mutated OLS. One or more point mutations at amino acid positions D198 to G209 can be introduced alone or in any desired combination. In these embodiments, the recombinant host cell can be, without limitation, a P. kudriavzevii or yeast, including but not limited to S. cerevisiae or other yeast, host cell.

In some aspects, provided herein are recombinant host cells, preferably host cells suitable for producing olivetol (including OLA and/or OLA-derived compounds) and other cannabinoids and cannabinoid derivatives in accordance with the methods provided herein, the host cells comprising one or more heterologous OLS enzymes, preferably OLS enzymes having an increased k cat value as compared to wild type or homologous OLS enzymes, wherein the recombinant host cells provide increase olivetol titer, yield, and/or productivity relative to a host cell not comprising a heterologous OLS enzyme. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased malonyl-CoA biosynthesis. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased hexanoyl-CoA synthetase biosynthesis. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased pyruvate dehydrogenase biosynthesis. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased acetaldehyde dehydrogenase biosynthesis. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased acetyl-CoA synthetase biosynthesis. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased acetyl-CoA carboxylase biosynthesis. In some aspects, provided herein are recombinant host cells suitable for producing olivetol in accordance with the methods of the invention comprising increased carbonic anhydrase biosynthesis.

In accordance with the invention, increased olivetol titer, yield, and/or productivity can be achieved through increased OLS enzymatic activity, which may require increased malonyl- CoA biosynthesis, and the invention provides host cells, vectors, enzymes, and methods relating thereto. Malonyl-CoA is produced in host cells through the activity of an acetyl-CoA carboxylase (EC 6.4.1.2) catalyzing the formation of malonyl-CoA from acetyl-CoA and carbon dioxide. The invention provides recombinant host cells for producing olivetol that express a heterologous acetyl-CoA carboxylase (ACC). In some embodiments, the host cell is a S. cerevisiae cell comprising a heterologous S. cerevisiae acetyl-CoA carboxylase ACC1 or an enzyme homologous thereto. In some embodiments, the host cell modified for heterologous expression of an ACC such as S. cerevisiae ACC1 is further modified to eliminate ACC1 post-translational regulation by genetic modification of S. cerevisiae SN FI protein kinase or an enzyme homologous thereto.

The disclosure also provides a recombinant host cell suitable for producing olivetol in accordance with the invention that is an E. coli cell that comprises a heterologous nucleic acid coding for expression of E. coli acetyl-CoA carboxylase complex proteins AccA, AccB, AccC and AccD or one or more enzymes homologous thereto. Thus, in one aspect of the invention, the recombinant host cell comprises a heterologous nucleic acid encoding a mutant OLS enzyme or another mutant cannabinoid pathway enzyme, that results in increased production of olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative relative to host cells not comprising the mutant OLS enzyme and/or an OLS enzyme.

Thus, in accordance with the invention an OLS enzyme other than, or in addition to, OLS derived from C. sativa can be used for biological synthesis of olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative in a recombinant host. In some embodiments, the recombinant host is P. kudriavzevii. In some embodiments, the recombinant host is S. cerevisiae. In other embodiments, the recombinant host is E. coli. In other embodiments, the recombinant host is a yeast other than P. kudriavzevii. In various embodiments, the host is modified to express a mutated OLS enzyme and/or an OLS enzyme provided or utilized herein. In various embodiments, the host is further modified to express one or more heterologous enzymes that are overexpressed to increase malonyl-CoA production. In various embodiments, the host is further modified to express or overexpress a functional hexanoyl-CoA synthetase.

Moreover, additional enzymes and catalysts other than those specifically disclosed herein can be utilized in mutated or heterologously expressed form. It will be well understood to those skilled in the art in view of this disclosure how other appropriate enzymes can be identified, modified, and expressed to achieve the desired olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative production, as disclosed herein.

In one aspect, provided herein are recombinant host cells suitable for biological production of cannabinoids and derivatives, such as without limitation olivetol, OLA, OLA- derived compound, or another cannabinoid or cannabinoid derivative. Any suitable host cell is useful in practice of the methods provided herein. In some embodiments, the host cell is a recombinant host microorganism in which nucleic acid molecules have been inserted, deleted or modified (i.e., mutated; e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), either to produce olivetol, or to increase yield, titer, and/or productivity of olivetol relative to a "wild type", "control cell", "parental cell", or "reference cell". A "control cell" can be used for comparative purposes, and is typically a wild type or recombinant parental cell that does not contain one or more of the modification(s) made to the host cell of interest.

In some embodiments, the invention provides a recombinant host cell that has been modified to produce one or more enzymes that facilitate malonyl-CoA production. In some embodiments, the invention provides a recombinant host cell that has been modified to produce one or more enzymes of the cannabinoid pathway. In some embodiments, the invention provides a recombinant host cell that has been modified to produce an OLS, such as, without limitation, an engineered or modified OLS, for example, olivetol synthase, having improved k cat values. In some embodiments, the invention provides a recombinant host cell that has been modified to produce an OLS, such as, without limitation, an engineered or modified OLS having improved solubility in the host. In some embodiments, the invention provides a recombinant host cell that has been modified to produce an OLS, such as, without limitation, an engineered or modified OLS or having improved stability in the host. Thus, various embodiments of the invention provide recombinant host cells capable of producing increased amounts of olivetol, OLA, OLA-derived compound, or another cannabinoid or cannabinoid derivative (i.e., product) per unit time. Accordingly, various embodiments of the invention provide recombinant host cells capable of achieving higher titers of product over shorter fermentation run times.

With respect to production titer levels, the recombinant host cells provided or utilized herein produce titer levels that exceed production titer levels of control cells. In some embodiments, the recombinant host cells provided or utilized herein produce titer levels that are suitable for commercial production, for example approximately 1-20 g/L, such as 2-10 g/L or 3-8 g/L, or greater. The recombinant host cells described herein promote high titer levels of product(s) in at least two ways. First, the recombinant host cells produce mutated OLS enzymes having improved synthetase kinetics (i.e., an increase in k cat ), which allows for faster product production, thereby increasing the rate and ease at which a desired titer level can be achieved. Secondly, the materials and methods provided or utilized herein provide and facilitate in situ extraction of the product into an organic phase, as described below. By adding the organic phase directly to the broth during the fermentation, the product can be quickly and continuously separated from the fermentation process, thereby decreasing undesirable effects of the product on the fermentation process, such as toxicity and product inhibition feedback on the pathway enzymes, thereby further increasing the titer levels of the product(s). Additionally, various genetic modifications provided or utilized herein are useful for increasing the provision of malonyl-CoA, which is a substrate for OLS.

In one embodiment, provided herein are recombinant yeast cells suitable for the production of cannabinoids and derivatives such as, without limitation, olivetol, at levels sufficient for subsequent purification and use as described herein. Yeast host cells are excellent host cells for construction of recombinant metabolic pathways comprising heterologous enzymes catalyzing production of small molecule products. There are established molecular biology techniques and nucleic acids encoding genetic elements necessary for construction of yeast expression vectors, including, but not limited to, promoters, origins of replication, antibiotic resistance markers, auxotrophic markers, terminators, and the like. Second, techniques for integration of nucleic acids into the yeast chromosome are well established. Yeast also offers a number of advantages as an industrial fermentation host. Yeast can tolerate high concentrations of organic acids and maintain cell viability at low pH and can grow under both aerobic and anaerobic culture conditions, and there are established fermentation broths and fermentation protocols. The ability of a strain to propagate and/or produce desired product under low pH provides a number of advantages. First, this characteristic provides tolerance to the environment created by the production of malonic acid. Second, from a process standpoint, the ability to maintain a low pH environment limits the number of organisms that are able to contaminate and spoil a batch.

In some embodiments of the invention, the recombinant host cell comprising a heterologous nucleic acid provided or utilized herein is a eukaryote. In various embodiments, the eukaryote is a yeast selected from the non-limiting list of genera; Saccharomyces, Candida, Cryptococcus, Hansenula, Issatchenki, Kluyveromyces, Komagataella, Lipomyces, Pichia, Rhodosporidium, Rhodotorula, or Yarrowia species. In various embodiments, the yeast is of a species selected from the group consisting of Candida albicans, Candida ethanolica, Candida krusei, Candida methanosorbosa, Candida sonorensis, Candida tropicalis, Cryptococcus curvotus, Hansenula polymorpha, Issotchenkio orientalis, Kluyveromyces loctis, Kluyveromyces marxianus, Kluyveromyces thermotolerans, Komagataella pastoris, Lipomyces starkeyi, Pichia angusta, Pichia deserticola, Pichia galeiformis, Pichia kodamae, Pichia kudriavzevii, Pichia membranaefaciens, Pichia methanolica, Pichia pastoris, Pichia salictaria, Pichia stipitis, Pichia thermotolerans, Pichia trehalophila, Rhodosporidium toruloides, Rhodotorula glutinis, Rhodotorula graminis, Saccharomyces bayanus, Saccharomyces boulardi, Saccharomyces cerevisiae, Saccharomyces kluyveri, and Yarrowia lipolytica. One skilled in the art will recognize that this list encompasses yeast in the broadest sense, including both oleaginous and non- oleaginous strains.

Other recombinant host cells provided or utilized herein include without limitation, eukaryotic, prokaryotic, and archaea cells. Illustrative examples of eukaryotic cells include, but are not limited to: Aspergillus niger, Aspergillus oryzae, Crypthecodinium cohnii,

Cunninghamella japonica, Entomophthora coronata, Mortierella alpina, Mucor circinelloides, Neurospora crassa, Pythium ultimum, Schizochytrium limacinum, Thraustochytrium aureum, Trichoderma reesei and Xanthophyllomyces dendrorhous. In general, if a eukaryotic cell is used, a non-pathogenic strain is employed. Illustrative examples of non-pathogenic strains include but are not limited to: Pichia pastoris and Saccharomyces cerevisiae. In addition, certain strains, including Saccharomyces cerevisiae, have been designated by the Food and Drug Administration as Generally Regarded As Safe (or GRAS) and so can be conveniently employed in various embodiments of the methods of the invention.

Illustrative and non-limiting examples of recombinant prokaryotic host cells provided or utilized herein include, Bacillus subtilis, Brevibacterium ammoniagenes, Clostridium beigerinckii, Corynebacterium glutamicum, Escherichia coli, Enterobacter sakazakii, Lactobacillus acidophilus, Lactococcus lactis, Mesorhizobium loti, Pseudomonas aeruginosa, Pseudomonas putida, Rhodobacter capsulatus, Rhodobacter sphaeroides, Salmonella enterica, Salmonella typhi, Salmonella typhimurium, Shigella flexneri, Staphylococcus aureus, Streptomyces ambofaciens, Streptomyces aureofaciens, Streptomyces aureus, Streptomyces fungicidicus, Streptomyces griseochromogenes, Streptomyces griseus, Streptomyces lividans, Streptomyces olivogriseus, Streptomyces rameus, Streptomyces tanashiensis, and Streptomyces vinaceus. Certain of these cells, including Bacillus subtilis, Corynebacterium glutamicum, and Lactobacillus acidophilus, have been designated by the Food and Drug Administration as Generally Regarded As Safe (or GRAS) and so are employed in various embodiments of the methods of the invention. While desirable from public safety and regulatory standpoints, GRAS status does not impact the ability of a host strain to be used in the practice of this invention; hence, non-GRAS and even pathogenic organisms are included in the list of illustrative host strains suitable for use in the practice of this invention.

Escherichia coli and Corynebacterium glutamicum are suitable prokaryotic host cells for metabolic pathway construction. Wild type E. coli can catabolize both pentose and hexose sugars as carbon sources. Provided herein are variety of E. coli host cells suitable for the production of malonate as described herein. In various embodiments, the recombinant host cell comprising a heterologous nucleic acid provided or utilized herein is an E. coli cell. In various embodiments of the methods of the invention, the recombinant host cell comprising a heterologous nucleic acid provided or utilized herein is a C. glutamicum cell.

Fermentation

In one embodiment, the fermentation is performed at a pH of about 5-6, preferably at about 5.5. In one embodiment, the fermentation is performed at a temperature of about 30°C. In one embodiment, the organic solvent immiscible with aqueous phase employed in the fermentation is loaded at about 26% of total fermentation tank volume. In one embodiment, the organic solvent immiscible with aqueous phase employed in the fermentation is loaded at about 40% of initial fermentation tank volume. In one embodiment, Isopropyl myristate is the aqueous phase immiscible organic solvent. In one embodiment, the aqueous phase immiscible organic solvent is added about 12 to about 36 hours post inoculation.

In one embodiment, the fermentation is performed wherein the compound of formula RCO2H or a salt thereof is present in an amount of 0.1-0.3 moles/500 g of glucose in feed. In one embodiment, the fermentation is performed wherein the compound of formula RCO2H or a salt thereof is present in an amount of 0.14-0.25 moles/500 g of glucose in feed. In one embodiment, the fermentation is performed wherein the compound of formula RCO2H or a salt thereof is present in an amount of 0.14-0.21 moles/500 g of glucose in feed. In one embodiment, the fermentation is performed wherein the sodium hexanoate/glucose in feed ratio was in the range of about 20 to about 28 g sodium hexanoate/ 500 g glucose. In one embodiment, the fermentation is performed wherein the sodium hexanoate/glucose in feed ratio was in the range of about 23 to about 28 g sodium hexanoate/ 500 g glucose.

In one embodiment, the oxygen transmission rate (OTR) is about 60 - about 80 mmoles/L/hr. In one embodiment, an oxygen uptake rate (OUR) of about 100 - about 110 mmoles/L/hr is achieved. In one embodiment, the pulse parameter was about 1.7 g glucose/L initial tank volume/pulse with a feed rate of about 10 g/L of initial tank volume/hr. In one embodiment, the batch glucose concentration employed in the fermentation was about 10 - about 20 g/L.

Synthesis, Utilization, and Purification of Cannabinoids and Derivatives

In some aspects, provided herein are methods of producing a cannabinoid, a cannabinoid derivative, a cannabinoid precursor, or a cannabinoid precursor derivative. In some embodiments, the methods may involve culturing a genetically modified host cell of the present disclosure in a suitable medium and recovering the produced cannabinoid, the cannabinoid precursor, the cannabinoid precursor derivative, or the cannabinoid derivative. The methods may also involve cell-free production of cannabinoids, cannabinoid precursors, cannabinoid precursor derivatives, or cannabinoid derivatives using one or more polypeptides disclosed herein expressed or overexpressed by a genetically modified host cell of the disclosure.

In some embodiments, provided herein are methods of producing a cannabinoid or a cannabinoid derivative. The methods may involve culturing a genetically modified host cell of the present disclosure in a suitable medium and recovering the produced cannabinoid or cannabinoid derivative. The methods may also involve cell-free production of cannabinoids or cannabinoid derivatives using one or more polypeptides disclosed herein expressed or overexpressed by a genetically modified host cell of the disclosure.

Cannabinoids, cannabinoid derivatives, cannabinoid precursors, or cannabinoid precursor derivatives that can be produced according to the present disclosure may include, but are not limited to, cannabichromene (CBC) type (e.g., cannabichromenic acid), cannabigerol (CBG) type (e.g., cannabigerolic acid), cannabidiol (CBD) type (e.g., cannabidiolic acid), D 9 -trans- tetrahydrocannabinol (D 9 -THC) type (e.g., A 9 -tetrahydrocannabinolic acid), A 8 -trans- tetrahydrocannabinol (D 8 -THC) type, cannabicyclol (CBL) type, cannabielsoin (CBE) type, cannabinol (CBN) type, cannabinodiol (CBND) type, cannabitriol (CBT) type, olivetolic acid, GPP, derivatives of any of the foregoing, and others as listed in Elsohly M.A. and Slade D., Life Sci.2005 Dec 22;78(5):539-48. Epub 2005 Sep 30.

Cannabinoids or cannabinoid derivatives that can be produced with the methods or genetically modified host cells of the present disclosure may also include, but are not limited to, cannabigerolic acid (CBGA), cannabigerolic acid monomethylether, (CBGAM), cannabigerol (CBG), cannabigerol monomethylether (CBGM), cannabigerovarinic acid (CBGVA), cannabigerovarin (CBGV), cannabichromenic acid (CBCA), cannabichromene (CBC), cannabichromevarinic acid (CBCVA), cannabichromevarin (CBCV), cannabidiolic acid (CBDA), cannabidiol (CBD), cannabidiol monomethylether (CBDM), cannabidiol-C4 (CBD-C4), cannabidivarinic acid (CBDVA), cannabidivarin (CBDV), cannabidiorcol (CBD-Ci), D 9 - tetrahydrocannabinolic acid A (THCA-A), A 9 -tetrahydrocannabinolic acid B (THCA-B), D 9 - tetrahydrocannabinol (THC), A 9 -tetrahydrocannabinolic acid-C4 (THCA-C4), D 9 - tetrahydrocannabinol-C4 (THC-C4), D 9 -tetrahydrocannabivarinic acid (THCVA), D 9 - tetrahydrocannabivarin (THCV), D 9 - tetrahydrocannabiorcolic acid (THCA-Ci), D 9 - tetrahydrocannabiorcol (THC-Ci), A 7 -cis- iso-tetrahydrocannabivarin, D 8 - tetrahydrocannabinolic acid (A 8 -THCA), A 8 -tetrahydrocannabinol (A 8 -THC), cannabicyclolic acid (CBLA), cannabicyclol (CBL), cannabicyclovarin (CBLV), cannabielsoic acid A (CBEA-A), cannabielsoic acid B (CBEA- B), cannabielsoin (CBE), cannabielsoinic acid, cannabicitranic acid, cannabinolic acid (CBNA), cannabinol (CBN), cannabinol methylether (CBNM), cannabinol-C4, (CBN-C4), cannabivarin (CBV), cannabinol-C2 (CNB-C2), cannabiorcol (CBN-Ci), cannabinodiol (CBND), cannabinodivarin (CBVD), cannabitriol (CBT), 10-ethyoxy-9-hydroxy-delta-6a- tetrahydrocannabinol, 8,9-dihydroxyl-delta-6a-tetrahydrocannabinol, cannabitriolvarin (CBTVE), dehydrocannabifuran (DCBF), cannabifuran (CBF), cannabichromanon (CBCN), cannabicitran (CBT), 10-oxo-delta-6a-tetrahydrocannabinol (OTHC), delta-9-cis- tetrahydrocannabinol (cis-THC), 3,4,5,6-tetrahydro-7-hydroxy-alpha-alpha-2-trimethyl-9-n- propyl-2,6-methano-2H-l-benzoxocin-5-methanol (OH-iso-HHCV), cannabiripsol (CBR), trihydroxy-delta-9-tetrahydrocannabinol (triOH-THC), and derivatives of any of the foregoing.

Additional cannabinoid derivatives that can be produced with the methods or genetically modified host cells of the present disclosure may also include, but are not limited to, 2-geranyl-5-pentyl-resorcylic acid, 2-geranyl-5-(4-pentynyl)-resorcylic acid, 2-geranyl-5- (trans- 2-pentenyl)-resorcylic acid, 2-geranyl-5-(4-methylhexyl)-resorcylic acid, 2-geranyl-5- (5-hexynyl) resorcylic acid, 2-geranyl-5-(trans-2-hexenyl)-resorcylic acid, 2-geranyl-5-(5- hexenyl)-resorcylic acid, 2-geranyl-5-heptyl-resorcylic acid, 2-geranyl-5-(6-heptynoic)- resorcylic acid, 2-geranyl-5- octyl-resorcylic acid, 2-geranyl-5-(trans-2-octenyl)-resorcylic acid, 2-geranyl-5-nonyl-resorcylic acid, 2-geranyl-5-(trans-2-nonenyl) resorcylic acid, 2- geranyl-5-decyl-resorcylic acid, 2-geranyl- 5-(4-phenylbutyl)-resorcylic acid, 2-geranyl-5-(5- phenylpentyl)-resorcylic acid, 2-geranyl-5-(6- phenylhexyl)-resorcylic acid, 2-geranyl-5-(7- phenylheptyl)-resorcylic acid, (6aR,10aR)-l- hydroxy-6,6,9-trimethyl-3-propyl-6a,7,8,10a- tetrahydro-6H-dibenzo[b,d]pyran-2-carboxylic acid, (6aR,10aR)-l-hydroxy-6, 6, 9-trimethyl- 3-(4-methylhexyl)-6a,7,8,10a-tetrahydro-6H- dibenzo[b,d]pyran-2-carboxylic acid, (6aR,10aR)-l-hydroxy-6,6,9-trimethyl-3-(5-hexenyl)- 6a,7,8,10a-tetrahydro-6H- dibenzo[b,d]pyran-2-carboxylic acid, (6aR,10aR)-l-hydroxy-6,6,9- trimethyl-3-(5-hexenyl)- 6a,7,8,10a-tetrahydro-6H-dibenzo[b,d]pyran-2-carboxylic acid, (6aR,10aR)-l-hydroxy- 6,6,9-trimethyl-3-(6-heptynyl)-6a,7,8,10a-tetrahydro-6H- dibenzo[b,d]pyran-2-carboxylic acid, 3-[(2E)-3,7-dimethylocta-2,6-dien-l-yl]-6-(hexan-2-yl)-2,4- dihydroxybenzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-(2- methylpentyl)benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-(3- methylpentyl)benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-(4- methylpentyl)benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-[(lE)- pent- l-en-l-yl]benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-[(2E)- pent-2-en- l-yl]benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-[(2E)- pent-3-en-l- yl] benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-(pent- 4-en-l-yl)benzoic acid, 3- [(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-propyl benzoic acid, 3-[(2E)-3,7- dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6-butylbenzoic acid, 3-[(2E)-3,7-dimethylocta- 2,6- dien-l-yl]-2,4-dihydroxy-6-hexylbenzoic acid, 3-[(2E)-3,7-dimethylocta-2,6-dien-l-yl]- 2,4- dihydroxy-6-heptylbenzoic acid, 3-[(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2, 4-dihydroxy- 6- octylbenzoic acid, 3-[(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6- nonanyl benzoic acid, 3-[(2E)-3,7-dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6- decanylbenzoic acid, 3-[(2E)-3,7- dimethylocta-2,6-dien-l-yl]-2,4-dihydroxy-6- undecanylbenzoic acid, 6-(4-chlorobutyl)-3-[(2E)- 3,7-dimethylocta-2,6-dien-l-yl]-2,4- dihydroxybenzoic acid, 3-[(2E)-3,7-dimethylocta-2,6-dien- l-yl]-2,4-dihydroxy-6-[4- (methylsulfanyl)butyl]benzoic acid, and others as listed in Bow, E. W. and Rimoldi, J. M., "The Structure-Function Relationships of Classical Cannabinoids: CB1/CB2 Modulation," Perspectives in Medicinal Chemistry 2016:8, 17-39 doi: 10.4137/PMC.S32171, incorporated herein by reference. Methods of determining the activity and properties of cannabinoids and cannabinoid derivatives are well known (see, e.g., Bow and Rimoldi, supra), and can be adapted in view of the present disclosure by the skilled artisan.

Certain non-limiting processes for producing certain compounds of formulas IA and IB are exemplified. A skilled artisan will be able to prepare other compounds of formulas IA and IB based on the disclosure provided herein.

EXAMPLES

These examples illustrate but do not limit the disclosed invention. Methods and strains useful in accordance with this invention can be adapted by a skilled artisan from US 10,392,635 (incorporated herein by reference).

Strain Construction Examples

Certain strains may be renumbered over time for convenience, as will be apparent to the skilled artisan.

Example 1A: Construction of LSC3-16 and LSC3-2

LSC3-2 was iteratively constructed by transforming chemically competent JK9-3d (LSC3- 1) with pAG304GalllOOSOACCSAAEl, pAG305GalllOOSOACCSAAEl and pAG306GalllOOSOACCSAAEl and controlling their genomic copy number at specific genomic loci. pAG304GalllOOSOACCSAAEl and pAG306GalllOOSOACCSAAEl were constructed by first amplifying the yeast shuttle vectors designated as pAG304 and pAG306 with the primers pAG304_fwd and pAG304_rev to amplify the Saccharomyces cerevisiae prototrophy genetic elements in addition, E. coli origins of replication and an ampicillin resistant expression cassette. The dual promoter system pGall and pGallO was amplified from genomic DNA of JK9- 3d with primers Gall_10_fwd and Gall_10_rev. The csAAEl and OS-T2A-OAC fragments were amplified from sequences that were stored in pUC19 subcloning vectors. Amplified DNA fragments were mixed at equimolar concentrations with their respective shuttle vector sequences (pAG304 or pAG306) and preassembled by Gibson Assembly using the NEBuilder HiFi DNA Assembly Mix (NEB E5520S). The final sequences pAG304GalllOOSOACCSAAEl and pAG306GalllOOSOACCSAAEl. To generate the DNA fragment pAG305GalllOOSOACCSAAEl, the template GalllOCBGA was amplified with Gall_10_fwd and Gall_10_rev to generate the yeast shuttle vector containing leucine prototrophy, dual expression promoter pGall and pGallO, and the OS-T2A-OAC fragment. The csAAEl fragment was amplified from a pUC19 subcloning containing the csAAEl gene fragment using primers CB_CSAAEl_fwd and CB_CSAAEl_rev. The amplified sequences were mixed at equimolar concentrations and assembled by Gibson Assembly using the NEBuilder HiFi DNA Assembly Mix.

First a parental strain to LSC3-2, designated as LSC3-16, was generated. LSC3-16 was generated by transforming 2 microgram (ug) of Aflll (NEB R0520S) linearized pAG305GalllOOSOACCSAAEl into chemically competent JK9-3d mating type alpha cells that are auxotrophic to leucine, histidine, tryptophan, and uracil. Selection for pAG305GalllOOSOACCSAAEl integration was done with leucine prototrophy rescue on yeast nitrogen base agar plates with dropout amino acid mixes deficient in leucine supplemented with 100 mg/L glucose. Genetic copy number of pAG305GalllOOSOACCSAAEl integrated at chromosome III was initially quantitated by qPCR through isolation of sister clones from the transformation. The highest copy integrant was taken and designated as LSC3-16.

Chemically competent LSC3-16 were co-transformed with both pAG304GalllOOSOACCSAAEl and pAG306GalllOOSOACCSAAEl and selected on yeast nitrogen base agar plates with dropout amino acid mixes deficient in leucine, uracil and tryptophan supplemented with 100 mg/L glucose. Genetic copy number for p AG 304Ga 1110OSO ACCS AAE 1, pAG305GalllOOSOACCSAAEl and pAG306GalllOOSOACCSAAEl integrated at chromosomes 4, 3 and 5 were quantitated by qPCR through isolation of genomic DNA of sister clones on selection plates. A correlation of both total polyketide and 0A:0 molar ratio was observed. Whole genomic sequencing was done on LSC3-16 and LSC3-2 which had achieved titers of 150 mg/L and 350 mg/L in shake flask experiments, respectively. Genetic copy numbers were determined to be 6 (LSC3-16) and 16 (LSC3-2).

Table of Primers Used

Table of Sequences

Example IB: LSC3-4 strain

Table of primers used in examples IB through IF

Table of strain names and parents/differential genotypes used in examples IB through IF

LSC3-4 was generated from LSC3-2 by integrating a cassette containing PkHIS4 (HIS4 from Pichia kudriavzevii) preceded by the TEF1 promoter from Saccharomyces cerevisiae (pScTEFl) into the GAL80 locus using PCR fragments amplified from Pichia kudriavzevii genomic DNA using primers Y011334 and Y011335, and from S. cerevisiae genomic DNA using primers YO11307 and YO11308. The two PCR fragments were transformed into chemically competent LSC3-2, and colonies were selected on defined media containing Complete Supplement Mixture (CSM; Formedium, Hunstanton, UK) without histidine (CSM-His). Integration of the cassette at the GAL80 locus was confirmed by colony PCR.

Example 1C: LSC3-13 strain

LSC3-13 was generated from LSC3-4 by integrating a cassette (HygR) containing the hph gene encoding hygromycin-B 4-O-kinase from Escherichia coli (with a GGT codon inserted immediately after the start codon) flanked by the 379 bp TEF1 promoter and 240 bp TEF1 terminator from Ashbya gossypii, into the MIG1 locus. A PCR fragment containing the HygR cassette was amplified from an in-house plasmid (pLYG-001) using primers Y011337 and Y011338. The PCR fragment was transformed into chemically competent LSC3-4, and colonies were selected on YPD medium containing 300 mg/mL hygromycin B. Integration of the cassette at the MIG1 locus was confirmed by colony PCR.

Example ID: LSC3-18 strain

LSC3-18 was generated from LSC3-4 by integrating the HygR cassette into the GAL1 locus. A PCR fragment containing the HygR cassette was amplified from an in-house plasmid using primers Y011436 and Y011437. The PCR fragment was transformed into chemically competent LSC3-4, and colonies were selected on YPD medium containing 300 mg/mL hygromycin. Integration of the cassette at the GAL1 locus was confirmed by colony PCR.

Example IE: LSC3-13 gall A strain (LSC3-64. LSC3-65)

LSC3-64 and LSC3-65 were generated by transforming two PCR fragments into chemically competent LSC3-13 that together compose a split KanMX marker (with a TEF1 promoter and terminator from Ashbya gossypii flanking a kanR gene encoding an aminoglycoside phosphotransferase conferring G418 resistance) flanked by lox66 and lox71 recombination sites, replacing the GAL1 gene region. This cassette is hereafter referred to as a loxKanMX cassette. The first fragment was amplified from an internal plasmid containing the assembled KanMX cassette flanked by lox sites (pLOA-058) using primers Y011791 and Y0316, binding internally in the KanMX cassette. The second PCR fragment, was generated by PCR from the same internal plasmid template using primers Y0949, binding internally in the KanMX cassette, and Y011792. Colonies were selected on YPD agar plates containing 200 mg/mL G418, and two isolates (LSC3-64 and LSC3-65) containing the full length integration at the desired locus were confirmed by colony PCR.

Example IF: Additional strain construction examples

An antibiotic marker-free version of LSC3-13 (LSC3-133 and LSC3-134) was generated by first integrating a cassette containing the HIS4 gene from S. cerevisiae CEN.PK2-1C MATa, together with its native upstream promoter and downstream terminator regions, and flanked by lox66 and lox71 recombination sites, into the GAL80 locus. This "loxHIS4" cassette was amplified in two fragments from an in-house constructed vector (pLOA-027) using primers Y011438 and Y011431, and Y011432 and Y011439. The PCR fragments were transformed into chemically competent LSC3-2, and colonies were selected on CSM-His agar plates. Integration of the cassette at the GAL80 locus was confirmed by colony PCR, and the strain was designated LSC3-46. Subsequently, the integrated functional HIS4 marker was looped out by transforming an in-house vector (pLYG-005) expressing Cre recombinase and harboring the CEN/ARS origin of replication. Transformants were selected on YPD plates containing 200 mg/mL G418 and up to 50 colonies were restruck on both G418 and YPD plates to screen for colonies that were spontaneously cured of pLYG-005 (grow on YPD but not on YPD plus G418). Cured isolates were then confirmed for loss of HIS4 by colony PCR and checking for lack of growth on CSM-His plates. One confirmed isolate was designated LSC3-103. Following this, a cassette consisting of the PkHIS4 marker with promoter and terminator as previously described, flanked with lox66 and lox71 recombination sites (hereafter referred to as a loxPkHIS4 cassette), was amplified from an in-house vector (pLOA-093) in two PCR fragments and integrated into the MIG1 locus of LSC3-103. The first fragment was amplified using primers YO12096 and YO12098, and the second fragment was amplified using primers YO12018 and YO12097. Both fragments were transformed into the chemically competent loopout strain and colonies were selected on CSM- His agar plates. Integration of the cassette into the MIG1 locus was confirmed by colony PCR.

An alternative marker-free version of LSC3-13 (LSC3-133A and LSC3-134A) was generated by integrating a cassette containing the HygR cassette previously described, flanked by the lox66 and lox71 recombination sites (hereafter referred to as a loxHygR cassette). Two PCR fragments were amplified from an in-house vector containing the loxHygR cassette (pLOA- 094) using primers YO12096 and Y0343, and Y0189 and YO12097. The two PCR fragments were transformed into chemically competent LSC3-4 and colonies selected on YPD medium containing 300 mg/mL hygromycin B. Integration of the cassette at the MIG1 locus was confirmed by colony PCR. Subsequently, the HygR cassette was looped out by transforming the resulting strain above with pLYG-005, expressing Cre recombinase and harboring the CEN/ARS origin of replication. Transformants were selected on YPD plates containing 200 mg/mL G418 and up to 50 colonies were restruck on both YPD plus G418 and YPD plates to screen for colonies that were spontaneously cured of pLYG-005. Cured isolates were confirmed for loss of HygR by colony PCR and checking for lack of growth on YPD plus 300 mg/mL hygromycin-B plates.

LSC3-89 and LSC3-90 were generated by transforming two PCR fragments into chemically competent LSC3-13 that together comprise a split loxKanMX cassette as described above into the GPD1 locus. The first fragment was amplified as described above using primers YO11970 and Y0316, and the second PCR fragment was amplified as described above using primers Y0949 and Y011971. Similarly, LSC3-91 and LSC3-92 were generated in an identical way except into chemically competent LSC3-18. Colonies were selected on YPD medium containing 200 mg/mL G418 and integration in the GPD1 locus was confirmed by colony PCR.

To prevent degradation of hexanoic acid and hexanoyl-CoA through native peroxisomal b-oxidation pathways, genes were individually disrupted and tested in the LSC3-2 background. These included FAA2 (peroxisomal medium chain fatty acyl-CoA synthetase), PXA1 (part of the heterodimeric peroxisomal fatty acid and/or acyl-CoA ABC transport complex with PXA2),

PEX11 (peroxisomal protein required for medium-chain fatty acid oxidation), and ANTI (peroxisomal adenine nucleotide transporter, which exchanges AMP generated in peroxisomes by acyl-CoA synthetases for ATP, that is consumed in that reaction, from the cytosol). LSC3-48 and LSC3-49 (FAA2 knockouts) were generated by integrating a loxHIS4 cassette as 2 PCR fragments in the 3' portion (starting at nucleotide position 412) of the FAA2 locus. The immediate 5' portion of the gene containing the first 411 nucleotides of FAA2 and its upstream region were preserved due to overlap with the BUD25 locus transcribed from the complement strand. The two PCR fragments were amplified from pLOA-027 using primers Y011478 and Y011431, and Y011432 and Y011479, transformed into chemically competent LSC3-2, and colonies were selected CSM-His agar plates. LSC3-63 (PXA1 knockout) was generated by integrating a loxHIS4 cassette as 2 PCR fragments in the PXA1 locus. The two PCR fragments were amplified from pLOA-027 using primers Y011795 and Y011431, and Y011432 and Y011796, transformed into chemically competent LSC3-2, and colonies were selected on CSM- His agar plates. To generate the PEX11 and ANTI knockouts, the native non-functional HIS4 locus was first knocked out in LSC3-2 by integrating a HygR cassette, generating strain LSC3-52. A PCR fragment was amplified from pLYG-001 using primers YO11709 and YO11710, transformed into chemically competent LSC3-2, and colonies were selected on YPD plus 300 mg/mL hygromycin B, with integration at the HIS4 locus confirmed by colony PCR. This strain exhibited enhanced efficiency of desired integrations using the loxHIS4 cassette, due to reduced homology with the native HIS4 locus. LSC3-74 and LSC3-75 (PEX11 knockouts) were subsequently generated by integrating a loxHIS4 cassette as 2 PCR fragments into the PEX11 locus of LSC3-52. The two PCR fragments were amplified from pLOA-027 using primers Y011498 and Y011431, and Y011432 and Y011499, transformed into chemically competent LSC3-52, and colonies were selected on CSM-His agar plates. LSC3-76 and LSC3-77 (ANTI knockouts) were generated by integrating a loxHIS4 cassette as 2 PCR fragments into the ANTI locus of LSC3-52. The two PCR fragments were amplified from pLOA-027 using primers YO11680 and Y011431, and Y011432 and Y011681, transformed into chemically competent LSC3-52, and colonies were selected on CSM-His agar plates. Integrations of cassettes into the desired loci were all confirmed by colony PCR for all strains.

To reduce proteolysis of the heterologously expressed pathway proteins, common proteases were additionally deleted in the LSC3-2 background. LSC3-47 (harboring a knockout of PRB1, encoding vacuolar proteinase B) was generated by integrating a loxHIS4 cassette in the PRB1 locus using 2 PCR fragments. The two PCR fragments were amplified from pLOA-027 using primers Y011488 and Y011431, and Y011432 and Y011489, transformed into chemically competent LSC3-2, and colonies were selected on CSM-His agar plates. LSC3-87 and LSC3-88 (harboring knockouts of PEP4, encoding vacuolar aspartyl protease/proteinase A) were generated by integrating a loxHIS4 cassette at the PEP4 locus in LSC3-52 using two PCR fragments. The two PCR fragments were amplified from pLOA-027 using primers Y011687 and Y011431, and Y011432 and Y011688, transformed into chemically competent LSC3-52, and colonies were selected on CSM-His agar plates. Integrations of cassettes into both desired loci were confirmed by colony PCR.

Additional knockouts can subsequently be combined from any combination of integrated lox66/lox71 flanked cassettes by transforming into strains where the previous marker was looped out by transforming pLYG-005, isolating colonies spontaneously cured for pLYG-005 with confirmed loopout by colony PCR and phenotypic checks, and integrating the next lox site flanked marker into a new locus. For example, a strain can harbor knockouts in modifications that allow production from glucose (GAL80 knockout in combination with either MIG1 or GAL1 knockouts), knockouts in genes involved in hexanoic acid or hexanoyl-CoA degradation (e.g. FAA2 and ANTI knockouts), and/or knockouts in one or multiple proteases involved in degradation of expressed heterologous pathway proteins (e.g. PRB1 and PEP4 knockouts).

Example 1G: Small-scale strain screening examples

Strains were tested either in shake flasks or an adapted protocol scaling down to 96 well plates. For shake flask testing, precultures were grown overnight (approximately 16-24 hours) in 15 or 30 mL of YP + 2% (w/v) glucose in 250 mL baffled shake flasks at 30°C with 200 rpm shaking and 80% humidity. Main cultures were inoculated using between 1 to 3 mL of preculture in 250 mL baffled shake flasks containing 30 mL of YP + 0.02-0.04% (w/v) hexanoic acid + 2% (w/v) galactose or 2% (w/v) glucose + 5 mL of isopropyl myristate (IPM). In some experiments, the percentage of galactose or glucose was altered, the percent hexanoic acid added was modified, or the overlay was intentionally not added or was replaced with alternative overlay candidates, such as diethyl sebacate, di-tert-butyl malonate, or methyl soyate. Sampling time was between 24 to 50 hours as indicated. The shake flask experiments were scaled down to 96 well deepwell plate format. Precultures from colonies of each strain were grown in 300 pL YP + 2% (w/v) glucose. Main cultures containing 300 pL YP + 2% (w/v) galactose (or glucose or combinations of galactose and glucose) + 0.04% (w/v) hexanoic acid + 20% (v/v) IPM (60 pL) or alternative overlay candidates, were grown at 30°C with 950 rpm shaking and 80% humidity in an Infors Multitron plate shaker, and the IPM or diethyl sebacate overlay was sampled at different elapsed times between 18 and 48 hours post-inoculation, following acidification of the media with 10 pL of 5 M phosphoric acid. Overlay from the cultures was diluted 2:1 with methanol prior to HPLC analysis.

Additional defined media optimization and production experiments were conducted with YNB or Delft medium base. YNB medium was initially optimized and consisted of 100 mL/L of a 10X YNB stock solution (containing 68 g/L yeast nitrogen base without amino acids from Sigma-Aldrich, product number Y0626), optionally 1 mL/L of 10% Bacto™ casamino acids (BD Biosciences), 300 mL/L of 1 M MES buffer (pH 6.5), optionally 3.6 mL/L of a trace element solution (containing 130 g/L citric acid monohydrate, 0.574 g/L copper (II) sulfate pentahydrate, 8.07 g/L iron (III) chloride hexahydrate, 0.5 g/L boric acid, 0.333 g/L manganese (II) chloride, 0.2 g/L sodium molybdate, and 4.67 g/L zinc sulfate heptahydrate), and optionally 1 mL/L of a vitamin solution (containing 0.008 g/L biotin, 1.6 g/L calcium pantothenate, 0.008 g/L folic acid, 8 g/L myo-inositol, 1.6 g/L nicotinic acid, 0.8 g/L p-aminobenzoic acid, 1.6 g/L pyridoxal hydrochloride, 0.8 g/L riboflavin, 1.6 g/L thiamine hydrochloride, adjusted to pH 10.5 with sodium hydroxide.

Delft CSM medium, consisted of (per liter solution) 7.5 g ammonium sulfate, 14.4 g potassium phosphate monobasic, 0.5 g magnesium sulfate heptahydrate (with these first three components prepared as an 0.9X solution and adjusted to pH 6.5 with sodium hydroxide prior to autoclaving), 3.6 mL of a trace metal solution (consisting of 130 g/L citric acid monohydrate, 0.574 g/L copper (II) sulfate pentahydrate, 8.07 g/L iron (III) chloride hexahydrate, 0.5 g/L boric acid, 0.333 g/L manganese (II) chloride, 0.2 g/L sodium molybdate, and 4.67 g/L zinc sulfate heptahydrate), 1.0 mL of a vitamin solution (0.008 g/L biotin, 1.6 g/L calcium pantothenate, 0.008 g/L folic acid, 8 g/L myo-inositol, 1.6 g/L nicotinic acid, 0.8 g/L p-aminobenzoic acid, 1.6 g/L pyridoxal hydrochloride, 0.8 g/L riboflavin, 1.6 g/L thiamine hydrochloride, adjusted to pH 10.5 with sodium hydroxide), 0.79 g of Complete Supplement Mixture (Formedium, Norfolk, UK), and 2% (w/v) of either galactose or glucose where specified. The final media was filter- sterilized, and hexanoic acid was added to 0.04% (w/v) for production.

Titers for both screening methods are presented as mg/L values on the basis of the entire volume of broth and overlay (total volume of 35 mL for shake flasks and 0.36 mL for 96 well deep well plates). In some plots, "olivetol equivalents" are depicted, which is the titer of olivetol, plus the titer of olivetolic acid multiplied by the molecular weight of olivetol divided by the molecular weight of olivetolic acid. In some plots, "olivetolic acid equivalents" are depicted, which is the titer of olivetolic acid, plus the titer of olivetol multiplied by molecular weight of olivetolic acid divided by the molecular weight of olivetol.

50 hour shake flask production of olivetolic acid and olivetol from LSC3-2 ("3X"), LSC3-4 ("3X gal80 A ::HIS4"), and LSC3-13 ("3X gal80 A ::HIS4 migl A ") from YP + 2% (w/v) galactose + 0.02% (w/v) hexanoic acid and YP + 2% (w/v) glucose + 0.02% (w/v) hexanoic acid for LSC3-2 were determined.

Example 1J

50 hour 96 well deepwell plate production of olivetolic acid equivalents (total olivetolates) and corresponding measured optical density (600 nm) values from the aqueous phase of the culture for LSC3-2 (pink) and LSC3-4 (blue) cultivated in YP + 2% or 4% (w/v) galactose + varying concentrations of hexanoic acid were determined. A tradeoff between hexanoic acid toxicity and conversion efficiency of hexanoic acid to end product could be observed, with an optimum between 0.08 to 0.1% (w/v) for 50 hour sampling. Lower concentrations of hexanoic acid can be employed to minimize cellular toxicity with earlier sampling points.

Example IK

50 hour 96 well deepwell plate production of olivetolic acid equivalents (total olivetolates) and corresponding measured optical density (600 nm) values from the aqueous phase of the culture for, from left to right in each column, LSC3-13 (pink), LSC3-18 (green), LSC3-2 (blue), and LSC3-4 (purple) cultivated in YNB base medium plus 2% (w/v) galactose ("gal") or 2% (w/v) glucose ("glu"), with or without casamino acids ("a. a") or casamino acids plus trace element and vitamin solutions ("a.a_v.t") + 0.04% (w/v) hexanoic acid. Optimal production levels were observed in YNB medium supplemented with casamino acids and vitamin plus trace element solutions.

Example 1L

Further defined media optimization with alternative defined amino acid compositions and vitamin/trace solutions for LSC3-4 and LSC3-18, with total olivetolate equivalents after 24 hours and optical density (600 nm), were determined. Glucose was added to 2% (w/v), galactose to 0.05% (w/v), and hexanoic acid to 0.04% (w/v). For these fully amino acid prototrophic strains, optimal growth and production were observed in Delft medium base containing CSM supplement and the trace and vitamin solution (T05 and V01) for which the composition is described above.

Example 1M

18 hour and 48 hour 96 well deepwell plate titers of olivetolic acid and olivetol (and byproducts PDAL and HTAL) for strains LSC3-2, LSC3-48 and LSC3-49 (FAA2 knockouts in LSC3- 2), LSC3-63 (PXA1 knockout in LSC3-2), LSC3-74 and LSC3-75 (PEX11 knockouts in LSC3-2), LSC3- 76 and LSC3-77 (ANTI knockouts in LSC3-2), and LSC3-47 (PRB1 knockout in LSC3-2) in YP medium + 2% (w/v) galactose + 0.04% (w/v) hexanoic acid + 20% (v/v) IPM. The sampling time at 18 hours is indicative of productivity/rate of product formation due to hexanoic acid not yet being depleted. The sampling time at 48 hours represents a total conversion of hexanoic acid after hexanoic acid is fully utilized. For LSC3-48, LSC3-74, LSC3-76, and LSC3-77 in particular, both improved 18 and 48 hour titers were observed, indicating more efficient incorporation of hexanoic acid into olivetolic acid and olivetol. LSC3-48 and LSC3-76/77 had the highest overall conversions of hexanoic acid to olivetolic acid and olivetol, therefore these FAA2 and ANTI were selected for further combinatorial knockouts and introduction into galactose independent strains. LSC3-47 had a higher 48 hour titer of olivetolic acid, indicating a potential role in proteolysis of CsOAC. Example IN

24 hour and 48 hour 96 well deepwell plate titers of olivetolic acid and olivetol (and byproducts PDAL and HTAL) for strains LSC3-2, LSC3-50 and LSC3-51 (LSC3-2 his4::loxHIS4 as HIS4 prototrophic controls), LSC3-48 and LSC3-49 (FAA2 knockouts in LSC3-2), LSC3-77 (ANTI knockout in LSC3-2), LSC3-47 (PRB1 knockout in LSC3-2), and LSC3-87 and LSC3-88 (PEP4 knockouts in LSC3-2) in YP medium + 2% (w/v) galactose + 0.04% (w/v) hexanoic acid + 20%

(v/v) IPM. The sampling time at 24 hours is indicative of productivity/rate of product formation due to hexanoic acid not yet being fully depleted. Higher 24 hour productivities were again observed for LSC3-48 and LSC3-77, indicating more efficient incorporation of hexanoic acid into olivetolic acid and olivetol. LSC3-87 and LSC3-88 also had higher 24 hour titers, than LSC3-2 or LSC3-50/51, indicating a higher pathway flux to olivetolic acid and olivetol from the PEP4 knockout. PEP4 and PRB1 are additionally selected for further combinatorial knockouts and introduction into galactose-independent strain.

Example 10

24 and 48 hour total olivetolate titers for strains LSC3-2, LSC3-4, LSC3-46, LSC3-18, and LSC3-64 and LSC3-65 (GAL80, MIG1, GAL1 triple knockout strains) tested in YP medium + 2% (w/v) glucose + different indicated galactose concentrations (0, 0.05, 0.25, or 1.0% (w/v)). LSC3- 64 and LSC3-65 combine the features of LSC3-13 and LSC3-18, with greatly enhanced productivities up to at least 24 hours compared to LSC3-13 in YP + 2% glucose that are more similar to productivities from LSC3-18, as well as reducing the galactose-dependent inhibition of production of LSC3-13 after 48 hours.

Example IP

(Left) 24 and 48 hour total olivetolate titers for strains LSC3-13, LSC3-18, LSC3-89 and LSC3-90 (GPD1 knockouts in LSC3-13), and LSC3-91 and LSC3-92 (GPD1 knockouts in LSC3-18) in YP + 2% (w/v) glucose + 0.04% (w/v) hexanoic acid + 20% (v/v) IPM (left side), or the same but with an additional 0.05% (w/v) galactose (right side). In the presence of 0.05% galactose, LSC3- 89 and LSC3-90 have slightly increased final titers compared to LSC3-13. (Right) glycerol titers after 48 hours indicate greatly reduced glycerol formation in all GPD1 knockout strains. Example IQ

48 hour 96 well deepwell plate production of LSC3-2 in YP + 2% (w/v) galactose + 0.04% (w/v) hexanoic acid + 20% (v/v) of different overlays (IPM, di-tert-butyl malonate, diethyl sebacate, and methyl soyate). Enhanced production was observed with the diethyl sebacate overlay compared to IPM.

Sequences for Examples IB through IF pLYG-001 tcgcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtca cagcttgtctgtaagcggatgccgggagca gacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatg cggcatcagagcagattgtactgagagtg caeca cgcttttcaattcaattcatcattttttttttattcttttttttgatttcggtttctttg aaatttttttgattcggtaatctccgaacagaa ggaagaacgaaggaaggagcacagacttagattggtatatatacgcatatgtagtgttga agaaacatgaaattgcccagtattcttaa cccaactgcacagaacaaaaacctgcaggaaacgaagataaatcatgtcgaaagctacat ataaggaacgtgctgctactcatcctagt cctgttgctgccaagctatttaatatcatgcacgaaaagcaaacaaacttgtgtgcttca ttggatgttcgtaccaccaaggaattactgg agttagttgaagcattaggtcccaaaatttgtttactaaaaacacatgtggatatcttga ctgatttttccatggagggcacagttaagccg ctaaaggcattatccgccaagtacaattttttactcttcgaagacagaaaatttgctgac attggtaatacagtcaaattgcagtactctgc gggtgtatacagaatagcagaatgggcagacattacgaatgcacacggtgtggtgggccc aggtattgttagcggtttgaagcaggcgg cagaagaagtaacaaaggaacctagaggccttttgatgttagcagaattgtcatgcaagg gctccctatctactggagaatatactaag ggtactgttgacattgcgaagagcgacaaagattttgttatcggctttattgctcaaaga gacatgggtggaagagatgaaggttacgat tggttgattatgacacccggtgtgggtttagatgacaagggagacgcattgggtcaacag tatagaaccgtggatgatgtggtctctaca ggatctgacattattattgttggaagaggactatttgcaaagggaagggatgctaaggta gagggtgaacgttacagaaaagcaggctg ggaagcatatttgagaagatgcggccagcaaaactaaaaaactgtattataagtaaatgc atgtatactaaactcacaaattagagctt caatttaattatatcagttattaccctgcggtgtgaaataccgcacagatgcgtaaggag aaaataccgcatcaggaaattgtaaacgtt aatattttgttaaaattcgcgttaaatttttgttaaatcagctcattttttaaccaatag gccgaaatcggcaaaatcccttataaatcaaaa gaatagaccgagatagggttgagtgttgttccagtttggaacaagagtccactattaaag aacgtggactccaacgtcaaagggcgaaa aaccgtctatcagggcgatggcccactacgtgaaccatcaccctaatcaagttttttggg gtcgaggtgccgtaaagcactaaatcggaa ccctaaagggagcccccgatttagagcttgacggggaaagccggcgaacgtggcgagaaa ggaagggaagaaagcgaaaggagcgg gcgctagggcgctggcaagtgtagcggtcacgctgcgcgtaaccaccacacccgccgcgc ttaatgcgccgctacagggcgcgtcgcgc cattcgccattcaggctgcgcaactgttgggaagggcgatcggtgcgggcctcttcgcta ttacgccagctggcgaaggggggatgtgct gcaaggcgattaagttgggtaacgccagggttttcccagtcacgacgttgtaaaacgacg gccagtgaattgtaatacgactcactatag ggcgaattggagctccaccgcggtggcggccgcataggccactagtggatctgatatcat cgatgaattcgagctcgttttcgacactgg atggcggcgttagtatcgaatcgacagcagtatagcgaccagcattcacatacgattgac gcatgatattactttctgcgcacttaacttc gcatctgggcagatgatgtcgaggcgaaaaaaaatataaatcacgctaacatttgattaa aatagaacaactacaatataaaaaaact atacaaatgacaagttcttgaaaacaagaatctttttattgtcagtactgattattcctt tgccctcggacgagtgctggggcgtcggtttcc actatcggcgagtacttctacacagccatcggtccagacggccgcgcttctgcgggcgat ttgtgtacgcccgacagtcccggctccggat cggacgattgcgtcgcatcgaccctgcgcccaagctgcatcatcgaaattgccgtcaacc aagctctgatagagttggtcaagaccaatg cggagcatatacgcccggagccgcggcgatcctgcaagctccggatgcctccgctcgaag tagcgcgtctgctgctccatacaagccaa ccacggcctccagaagaagatgttggcgacctcgtattgggaatccccgaacatcgcctc gctccagtcaatgaccgctgttatgcggcc attgtccgtcaggacattgttggagccgaaatccgcgtgcacgaggtgccggacttcggg gcagtcctcggcccaaagcatcagctcatc gagagcctgcgcgacggacgcactgacggtgtcgtccatcacagtttgccagtgatacac atggggatcagcaatcgcgcatatgaaat cacgccatgtagtgtattgaccgattccttgcggtccgaatgggccgaacccgctcgtct ggctaagatcggccgcagcgatcgcatccat ggcctccgcgaccggctgcagaacagcgggcagttcggtttcaggcaggtcttgcaacgt gacaccctgtgcacggcgggagatgcaat aggtcaggctctcgctgaattccccaatgtcaagcacttccggaatcgggagcgcggccg atgcaaagtgccgataaacataacgatctt tgtagaaaccatcggcgcagctatttacccgcaggacatatccacgccctcctacatcga agctgaaagcacgagattcttcgccctccg agagctgcatcaggtcggagacgctgtcgaacttttcgatcagaaacttctcgacagacg tcgcggtgagttcaggctttttacccatggt tgtttatgttcggatgtgatgtgagaactgtatcctagcaagattttaaaaggaagtata tgaaagaagaacctcagtggcaaatcctaac cttttatatttctctacaggggcgcggcgtggggacaattcaacgcgtctgtgaggggag cgtttccctgctcgcaggtctgcagcgagga gccgtaatttttgcttcgcgccgtgcggccatcaaaatgtatggatgcaaatgattatac atggggatgtatgggctaaatgtacgggcga cagtcacatcatgcccctgagctgcgcacgtcaagactgtcaaggagggtattctgggcc tccatgtcgctggccgggtgacccggcggg gacaaggcaagctaaacagatctggcgcgccttaattaacccggggatccgtcgacctgc agcgtacgaagcttcagctggcggccgct ctagccagcttttgttccctttagtgagggttaattccgagcttggcgtaatcatggtca tagctgtttcctgtgtgaaattgttatccgctca caattccacacaacataggagccggaagcataaagtgtaaagcctggggtgcctaatgag tgaggtaactcacattaattgcgttgcgct cactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaac gcgcggggagaggcggtttgcgtattgggc gctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgagcgg tatcagctcactcaaaggcggtaatacggtta tccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggcc aggaaccgtaaaaaggccgcgttgct ggcgtttttccataggctcggcccccctgacgagcatcacaaaaatcgacgctcaagtca gaggtggcgaaacccgacaggactataaa gataccaggcgttcccccctggaagctccctcgtgcgctctcctgttccgaccctgccgc ttaccggatacctgtccgcctttctcccttcgg gaagcgtggcgctttctcaatgctcacgctgtaggtatctcagttcggtgtaggtcgttc gctccaagctgggctgtgtgcacgaacccccc gttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaaga cacgacttatcgccactggcagcagccactg gtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggc ctaactacggctacactagaaggacagta tttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttga tccggcaaacaaaccaccgctggtagcggtg gtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctt tgatcttttctacggggtctgacgctcagtgg aacgaaaactcacgttaagggattttggtcatgagattatcaaaaaggatcttcacctag atccttttaaattaaaaatgaagttttaaat caatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgagg cacctatctcagcgatctgtctatttcgttcat ccatagttgcctgactgcccgtcgtgtagataactacgatacgggagggcttaccatctg gccccagtgctgcaatgataccgcgagacc cacgctcaccggctccagatttatcagcaataaaccagccagccggaagggccgagcgca gaagtggtcctgcaactttatccgcctcc atccagtctattaattgttgccgggaagctagagtaagtagttcgccagttaatagtttg cgcaacgttgttgccattgctacaggcatcgt ggtgtcacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcg agttacatgatcccccatgttgtgaaaaaaag cggttagctccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcac tcatggttatggcagcactgcataattctctta ctgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattct gagaatagtgtatgcggcgaccgagttgctctt gcccggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatca ttggaaaacgttcttcggggcgaaaactct caaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgat cttcagcatcttttactttcaccagcgtttctg ggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaat gttgaatactcatactcttcctttttca atattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtat ttagaaaaataaacaaataggggttccgcgca catttccccgaaaagtgccacctgggtccttttcatcacgtgctataaaaataattataa tttaaattttttaatataaatatataaattaaa aatagaaagtaaaaaaagaaattaaagaaaaaatagtttttgttttccgaagatgtaaaa gactctagggggatcgccaacaaatacta ccttttatcttgctcttcctgctctcaggtattaatgccgaattgtttcatcttgtctgt gtagaagaccacacacgaaaatcctgtgattttac attttacttatcgttaatcgaatgtatatctatttaatctgcttttcttgtctaataaat atatatgtaaagtacgctttttgttgaaattttttaa acctttgtttatttttttttcttcattccgtaactcttctaccttctttatttactttct aaaatccaaatacaaaacataaaaataaataaacac agagtaaattcccaaattattccatcattaaaagatacgaggcgcgtgtaagttacaggc aagcgatccgtcctaagaaaccattattat catgacattaacctataaaaataggcgtatcacgaggccctttcgtc pLYG-005 agatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaagg gattttggtcatgagattatcaaaaaggatct tcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatatatgagt aaacttggtctgacagttaccaatgcttaatc agtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactgccc gtcgtgtagataactacgatacgggagggctt accatctggccccagtgctgcaatgataccgcgagacccacgctcaccggctccagattt atcagcaataaaccagccagccggaaggg ccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgttgcc gggaagctagagtaagtagttcgccagttaat agtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgctcgtcgtttggt atggcttcattcagctccggttcccaacgatca aggcgagttacatgatcccccatgttgtgaaaaaaagcggttagctccttcggtcctccg atcgttgtcagaagtaagttggccgcagtgt tatcactcatggttatggcagcactgcataattctcttactgtcatgccatccgtaagat gcttttctgtgactggtgagtactcaaccaagt cattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggata ataccgcgccacatagcagaactttaaaag tgctcatcattggaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttga gatccagttcgatgtaacccactcgtgcaccc aactgatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaagg caaaatgccgcaaaaaagggaataagggc gacacggaaatgttgaatactcatactcttcctttttcaatattattgaagcatttatca gggttattgtctcatgagcggatacatatttga atgtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacc tgggtccttttcatcacgtgctataaaaata attataatttaaattttttaatataaatatataaattaaaaatagaaagtaaaaaaagaa attaaagaaaaaatagtttttgttttccgaa gatgtaaaagactctagggggatcgccaacaaatactaccttttatcttgctcttcctgc tctcaggtattaatgccgaattgtttcatcttgt ctgtgtagaagaccacacacgaaaatcctgtgattttacattttacttatcgttaatcga atgtatatctatttaatctgcttttcttgtctaat aaatatatatgtaaagtacgctttttgttgaaattttttaaacctttgtttatttttttt tcttcattccgtaactcttctaccttctttatttacttt ctaaaatccaaatacaaaacataaaaataaataaacacagagtaaattcccaaattattc catcattaaaagatacgaggcgcgtgtaa gttacaggcaagcgatcatccgtcctaagaaaccattattatcatgacattaacctataa aaataggcgtatcacgaggccctttcgtctc gcgcgtttcggtgatgacggtgaaaacctctgacacatgcagctcccggagacggtcaca gcttgtctgtaagcggatgccgggagcag acaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggggctggcttaactatgc ggcatcagagcagattgtactgagagtgc accacgcttttcaattcaattcatcattttttttttattcttttttttgatttcggtttc tttgaaatttttttgattcggtaatctccgaacagaag gaagaacgaaggaaggagcacagacttagattggtatatatacgcatatgtagtgttgaa gaaacatgaaattgcccagtattcttaac ccaactgcacagaacaaaaacctgcaggaaacgaagataaatcgaaacatcatgaaaact gtttcaccctctgtgaagcataaacact agaaagccaatgaagagctctacaagcctcttatgggttcaatgggtctgcaatgaccgc atacgggcttggacaattaccttctattgaa tttctgagaagagatacatctcaccagcaatgtaagcagacaatcccaattctgtaaaca acctctttgtccataattccccatcagaaga gtgaaaaatgccctcaaaatgcatgcgccacacccatctttcaactgcactgcgccacct ctgagggtcttttcaggggtcgactaccccg gacacctcgcagaggagcgaggtcacgtacttttaaaatggcagagacgcgcagtttctt gaagaaaggataaaaatgaaatggtgcg gaaatgcgaaaatgatgaaaaattttcttggtggcgaggaaattgagtgcaataattggc acgaggttgttgccacccgagtgtgagtat atatcctagtttctgcacttttcttcttcttttctttaccttttcttttcaacttttttt tactttttccttcaacagacaaatctaacttatatatca caATGGGTAAGG A A A AG ACT C ACGTTT CGAGGCCGCG ATT A A ATT CCA AC AT G G ATG CT G ATTT AT AT G

GGTATAAATGGGCTCGCGATAATGTCGGGCAATCAGGTGCGACAATCTATCGATTGT ATGGGAAGCCC

GATGCGCCAGAGTTGTTTCTGAAACATGGCAAAGGTAGCGTTGCCAATGATGTTACA GATGAGATGGTC

AGACTAAACTGGCTGACGGAATTTATGCCTCTTCCGACCATCAAGCATTTTATCCGT ACTCCTGATGATGC

ATGGTTACTCACCACTGCGATCCCCGGCAAAACAGCATTCCAGGTATTAGAAGAATA TCCTGATTCAGGT

G A A A AT ATT GTTG ATG CGCTGG CAGTGTTCCTG CG CCG GTTG CATTCG ATT CCT GTTT GT AATT GTCCTTT

TAACAGCGATCGCGTATTTCGTCTCGCTCAGGCGCAATCACGAATGAATAACGGTTT GGTTGATGCGAG

TGATTTTGATGACGAGCGTAATGGCTGGCCTGTTGAACAAGTCTGGAAAGAAATGCA TAAGCTTTTGCC

ATTCTCACCGGATTCAGTCGTCACTCATGGTGATTTCTCACTTGATAACCTTATTTT TGACGAGGGGAAAT

TAATAGGTTGTATTGATGTTGGACGAGTCGGAATCGCAGACCGATACCAGGATCTTG CCATCCTATGGA

ACTGCCT CG GT G AGTTTT CT CCTT CATT AC AG AAACGG CTTTTT CAAAAAT ATGGTATT GAT AAT CCTG AT

ATGAATAAATTGCAGTTTCATTTGATGCTCGATGAGTTTTTCTAAgtgaatttactt taaatcttgcatttaaataaatt ttctttttatagctttatgacttagtttcaatttatatactattttaatgacattttcga ttcattgattgaaagctttgtgttttttcttgatgcgc tattgcattgttcttgtctttttcgccacatgtaatatctgtagtagatacctgatacat tgtggataaaactgtattataagtaaatgcatgt atactaaactcacaaattagagcttcaatttaattatatcagttattaccctgcggtgtg aaataccgcacagatgcgtaaggagaaaata ccgcatcaggaaattgtaaacgttaatattttgttaaaattcgcgttaaatttttgttaa atcagctcattttttaaccaataggccgaaatc ggcaaaatcccttataaatcaaaagaatagaccgagatagggttgagtgttgttccagtt tggaacaagagtccactattaaagaacgtg gactccaacgtcaaagggcgaaaaaccgtctatcagggcgatggcccactacgtgaacca tcaccctaatcaagttttttggggtcgagg tgccgtaaagcactaaatcggaaccctaaagggagcccccgatttagagcttgacgggga aagccggcgaacgtggcgagaaaggaa gggaagaaagcgaaaggagcgggcgctagggcgctggcaagtgtagcggtcacgctgcgc gtaaccaccacacccgccgcgcttaat gcgccgctacagggcgcgtcgcgccattcgccattcaggctgcgcaactgttgggaaggg cgatcggtgcgggcctcttcgctattacgc cagctggcgaaggggggatgtgctgcaaggcgattaagttgggtaacgccagggttttcc cagtcacgacgttgtaaaacgacggccag tgaattgtaatacgactcactatagggcgaattggagctccaccgcggtggcggccgcat aggccactagtggatctgatatcatcgatg aattcgagctcgtttgggcccgctacttagcttctatagttagttaatgcactcacgata ttcaaaattgacacccttcaactactccctact attgtctactactgtctactactcctctttactatagctgctcccaataggctccaccaa taggctctgccaatacattttgcgccgccacctt tcaggttgtgtcactcctgaaggaccatattgggtaatcgtgcaatttctggaagagagt ccgcgagaagtgaggcccccactgtaaatc ctcgagggggcatggagtatggggcatggaggatggaggatggggggggggcgaaaaata ggtagcaaaaggacccgctatcacccc acccggagaactcgttgccgggaagtcatatttcgacactccggggagtctataaaaggc gggttttgtcttttgccagttgatgttgctga aaggacttgtttgccgtttcttccgatttaacagtatagaaatcaaccactgttaattat acacgttatactaacacaacaaaaacaaaaa ca a cga ca a ca a ca a ca a ca AT GT CC AATTT ACT G ACCGTAC ACC AAAATTT G CCT G CATT ACCG GT CG AT G CA

ACGAGTGATGAGGTTCGCAAGAACCTGATGGACATGTTCAGGGATCGCCAGGCGTTT TCTGAGCATACC

TGGAAAATGCTTCTGTCCGTTTGCCGGTCGTGGGCGGCATGGTGCAAGTTGAATAAC CGGAAATGGTTT

CCCGCAGAACCTGAAGATGTTCGCGATTATCTTCTATATCTTCAGGCGCGCGGTCTG GCAGTAAAAACTA

TCCAGCAACATTTGGGCCAGCTAAACATGCTTCATCGTCGGTCCGGGCTGCCACGAC CAAGTGACAGCA

ATGCTGTTTCACTGGTTATGCGGCGGATCCGAAAAGAAAACGTTGATGCCGGTGAAC GTGCAAAACAG

GCTCTAGCGTTCGAACGCACTGATTTCGACCAGGTTCGTTCACTCATGGAAAATAGC GATCGCTGCCAG

GATATACGTAATCTGGCATTTCTGGGGATTGCTTATAACACCCTGTTACGTATAGCC GAAATTGCCAGGA

TCAGGGTTAAAGATATCTCACGTACTGACGGTGGGAGAATGTTAATCCATATTGGCA GAACGAAAACGC

TGGTTAGCACCGCAGGTGTAGAGAAGGCACTTAGCCTGGGGGTAACTAAACTGGTCG AGCGATGGATT

TCCGTCTCTGGTGTAGCTGATGATCCGAATAACTACCTGTTTTGCCGGGTCAGAAAA AATGGTGTTGCCG

CGCCATCTGCCACCAGCCAGCTATCAACTCGCGCCCTGGAAGGGATTTTTGAAGCAA CTCATCGATTGAT

TTACGGCGCTAAGGATGACTCTGGTCAGAGATACCTGGCCTGGTCTGGACACAGTGC CCGTGTCGGAGC

CGCGCGAGATATGGCCCGCGCTGGAGTTTCAATACCGGAGATCATGCAAGCTGGTGG CTGGACCAATG

TAAATATTGTCATGAACTATATCCGTACCCTGGATAGTGAAACAGGGGCAATGGTGC GCCTGCTGGAAG

ATGGCGATTAGtcatgtaattagttatgtcacgcttacattcacgccctccccccac atccgctctaaccgaaaaggaaggagttag acaacctgaagtctaggtccctatttatttttttatagttatgttagtattaagaacgtt atttatatttcaaatttttcttttttttctgtacaga cgcgtgtacgcatgtaacattatactgaaaaccttgcttgagaaggttttgggacgctcg aaggctttaatttgcggccggcgcgccttaat taacccggggatccgtcgacctgcagcgtacgaagcttcagctggcggccgctctagcca gcttttgttccctttagtgagggttaattccg agcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaatt ccacacaacataggagccggaagcataaagt gtaaagcctggggtgcctaatgagtgaggtaactcacattaattgcgttgcgctcactgc ccgctttccagtcgggaaacctgtcgtgcca gctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttc cgcttcctcgctcactgactcgctgcgctcg gtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccaca gaatcaggggataacgcaggaaagaacat gtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgttttt ccataggctcggcccccctgacgagcatc acaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccagg cgttcccccctggaagctccctcgtgcg ctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaag cgtggcgctttctcaatgctcacgctgtaggtat ctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcag cccgaccgctgcgccttatccggtaactatcg tcttgagtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacag gattagcagagcgaggtatgtaggcggtg ctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttggta tctgcgctctgctgaagccagttaccttcgg aaaaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttt tgtttgcaagcagcagattacgcgcagaaa aaaaggatctcaaga pLOA-027 gacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgataataatggttt cttagacgtcaggtggcacttttcggggaaat gtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatg agacaataaccctgataaatgcttcaataatat tgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcg gcattttgccttcctgtttttgctcacccagaaa cgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaac tggatctcaacagcggtaagatccttgag agttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggc gcggtattatcccgtattgacgccgggcaaga gcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcac agaaaagcatcttacggatggcatgacagt aagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttct gacaacgatcggaggaccgaaggagctaa ccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagc tgaatgaagccataccaaacgacgagcgtg acaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactac ttactctagcttcccggcaacaattaatag actggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggct ggtttattgctgataaatctggagccggtga gcgtgggtcCcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgt agttatctacacgacggggagtcaggcaac tatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggta actgtcagaccaagtttactcatatatactt tagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgat aatctcatgaccaaaatcccttaacgtgagtttt cgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatccttttt ttctgcgcgtaatctgctgcttgcaaacaaaa aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccg aaggtaactggcttcagcagagcgcagatac caaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcac cgcctacatacctcgctctgctaatcctgttac cagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagt taccggataaggcgcagcggtcgggctgaa cggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacc tacagcgtgagctatgagaaagcgccac gcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggaga gcgcacgagggagcttccagggggaa acgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttt tgtgatgctcgtcaggggggcggagcctatgga aaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcaca tgttctttcctgcgttatcccctgattctgtggat aaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgc agcgagtcagtgagcgaggaagcggaag agcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcaggttta aacAGGTGGTAATAATCGCGCGAT

T C AATT GC ATT CATT AAAG ACAG AT AATT CG CAAG ACCTT CT CCCT CCAG AT C AACTT GTAT CAAT GATT C

ACTTGTTCATCAACGATGAAAGGTTTACCTCCGGTATAACGAGTTTTGACATTGATT TTTCTAGAATGAAA

ATGCCATAGAAATTTCTAAATTTAGACTGAATCCCTACGTCACTGGTTTAAAAATTG AGTGGTGCTTACTA

ATT ATT ACATT CG G AAACGTCT CAT CAAGT GTTTCCG AAAAAATG AG GGTTTTT CT AAAGCTT CTTT CTTT

CACG G AT AT CACCG G GTTT AAG AT GT ATTTTTTTTTTCCAC AG AAATT AAAGTT CCAG CGTTT ACCAAAGT

AGATCGTTCAATAATATGGATGGTGTTATAAGAAGACGACCACTATCCCCCATGAAT TCTCACATGATAC

TTTCTTTTACTTTATTTACAGAGGCAGTAACATCCAAGAAGAAtaccgttcgtataa tgtatgctatacgaagttataac cggcgttgccagcgataaacggCCCATCACAATCCTGACAACCAGCAGTTCTTCTAGGCA GTCGAACTGACTCTA

AT AGT G ACT CCGGTAA ATT AGTTA ATT AATTGCT AAACCC AT G CACAGTG ACT C ACGTTTTTTT AT CAGTC

ATT CG AT AT AG A AG GTA AG A A A AG G ATATG ACTATG A AC AGTAGTAT ACTGTGT ATAT A AT AG ATATG G

AACGTT AT ATT C ACCTCCG AT GTGTGTTGT ACAT AC AT AAAAAT AT CAT AGC ACA ACT GCG CT GT GT AAT

AGTAAT ACAAT AGTTTACAAAATTTTTTTT CT G AAT AATGGTTTT GCCG ATT CT ACCGTT AATT GAT GAT CT

GGCCTCATGGAATAGTAAGAAGGAATACGTTTCACTTGTTGGTCAGGTACTTTTGGA TGGCTCGAGCCT

GAGTAATGAAGAGATTCTCCAGTTCTCCAAAGAGGAAGAAGTTCCATTGGTGGCTTT GTCCTTGCCAAG

TGGTAAATTCAGCGATGATGAAATCATTGCCTTCTTGAACAACGGAGTTTCTTCTCT GTTCATTGCTAGCC

AAGATGCTAAAACAGCCGAACACTTGGTTGAACAATTGAATGTACCAAAGGAGCGTG TTGTTGTGGAA

G AG AACGGT GTTTT CTCCAAT CAATT CAT G GT AAAACA AAAATT CT CG CAAG AT AAAATT GT GT CC AT AA AGAAATTAAGCAAGGATATGTTGACCAAAGAAGTGCTTGGTGAAGTACGTACAGACCGTC CTGACGGTT

TATATACCACCCTAGTTGTCGACCAATATGAGCGTTGTCTAGGGTTGGTGTATTCTT CGAAGAAATCTAT

AGCAAAGGCCATCGATTTGGGTCGTGGCGTTTATTATTCTCGTTCTAGGAATGAAAT CTGGATCAAGGG

T G A A ACTT CTG G C A AT G G CCA A A AG CTTTT AC A A AT CTCTACTG ACTGT G ATTCG G AT G CCTT A A AGTTT

ATCGTTGAACAAGAAAACGTTGGATTTTGCCACTTGGAGACCATGTCTTGCTTTGGT GAATTCAAGCATG

GTTT GGTGGGGCTAG A AT CTTT ACT A A A AC A A AG GCTACAGGACGCTCCAGAGG A AT CTT ATACTAG A A

G ACT ATT C A ACG ACTCTG C ATT GTTAG ATG CC A AG ATC A AG G A AG A AG CTG A AG A ACTG ACTG AG G C A A

AGGGTAAGAAGGAG CTTT CTT GGGAGGCTGCCG ATTT GTT CT ACTTT G C ACTG G CCA A ATT AGTGGCCA

ACGATGTTTCATTGAAGGACGTCGAGAATAATCTGAATATGAAGCATCTGAAGGTTA CAAGACGGAAAG

GT G ATGCTAAGCCAAAGTTT GTTGG ACAACCAAAGGCTG AAG AAG AAAAACT G ACCGGT CCAATT CACT

TGGACGTGGTGAAGGCTTCCGACAAAGTTGGTGTGCAGAAGGCTTTGAGCAGACCAA TCCAAAAGACT

T CTG AAATT AT G CATTT AGT CAAT CCG AT CAT CG AAAAT GTT AG AG AC AAAG GT A ACT CT G CCCTTTT G G

AGTAC ACAG AAAAGTTT G ATG GT GT AAAATT AT CC AAT CCT GTT CTT AAT GCT CC ATT CCCAG AAG AAT A

CTTTGAAGGTTTAACCGAGGAAATGAAGGAAGCTTTGGACCTTTCAATTGAAAACGT CCGCAAATTCCAT

G CTG CT CAATT G CC AACAG AG ACT CTT G AAGTT G AAACCCA ACCTGGT GTCTT GTGTT CC AG ATTCCCT C

GTCCTATTGAAAAAGTTGGTTTGTATATCCCTGGTGGCACTGCCATTTTACCAAGTA CTGCATTAATGCTT

GGTGTTCCAGCACAAGTTGCCCAATGTAAGGAGATTGTGTTTGCATCTCCACCAAGA AAATCTGATGGT

AAAGTTTCACCCGAAGTTGTTTATGTCGCAGAAAAAGTTGGCGCTTCCAAGATTGTT CTAGCTGGTGGTG

CCCAAGCCGTTGCTGCTATGGCTTACGGGACAGAAACTATTCCTAAAGTGGATAAGA TCTTGGGTCCAG

GTAAT CAATTT GT G ACTGCCG CC AAAAT GTAT GTT C AAAATG AC ACT C AAG CT CT AT GTTCCATTG AT AT G

CCAGCTGGCCCAAGTGAAGTTTTGGTTATTGCCGATGAAGATGCCGATGTGGATTTT GTTGCAAGTGAT

TTGCTATCG CAAG CTG AAC ACG GT ATT GACTCCC AAGTT AT CCTT GTT G GTGTTAACTTG AG CG AAAAG A

AAATT C AAG AG ATT CAAG AT GCT GTCCAC AAT C AAGCTTT AC AACT GCCACGT GTG G AT ATT GTT CGTAA

AT GT ATT G CT CAC AGT ACG AT CGTT CTTT GT G ACGGTT ACG AAG AAGCCCTT G A AAT GTCCAACCAAT AT

GCACCAGAACATTTGATTCTACAAATCGCCAATGCTAACGATTATGTTAAATTGGTT GACAATGCAGGGT

CCGTATTTGTGGGTGCTTACACTCCAGAATCGTGCGGTGACTATTCAAGTGGTACTA ACCATACATTACC

AACCTATG GTTACG CTAGG CAGTACAGTG GTG CC AACACTGC AACCTTCC AAA AGTTT ATCACTG CCCAA

AACATT ACCCCT G AAGGTTT AG AAAACAT CGGT AG AGCT GTT AT GTGCGTTGCCAAG AAGG AGGGT CT A

GACGGTCACAGAAACGCTGTGAAAATCAGAATGAGTAAGCTTGGGTTGATCCCAAAG GATTTCCAGTA

G ATT ATTT CT A ACTT G G AA ACCG A AC ACTA ACG AAAAT AAT ATGT AT AT ATAC AT AT AT AT AT C A A AC A A

AATACAGTCTTGAATGAATAGAGATACACTATGTAATGAATGGTAACGTAAAAATTG TAATTTTGGATTA

AAAGAGAGGTAGcgcctggcagcagggcgataacctcataacttcgtataatgtatg ctatacgaacggtaTTTGGTGTTGTT

TTCTATTGCATACGAATTAGAATGCCCAGACTTGTTTATATACTACGCTGAATGTTT GTACATTTATACTTA

A A AC A A A AT G CT AGT C AG CC AT ATT A A AC AG AG CCGTTT AG C A AC ATTT CAAT AG C ACCTT CC AC AG ATC

CACCG CT ACGTYT CAAT G CG GC AAT GTT ACG GT CG AAGT CAAAG AAG CCCAT AT CGTTT AATTG ACGTAA

TTGTGTTGCATAGACTTCTTCTGGAGGCCTTGTATCGGAAGCAGAAGGTGCAGTTGA ACCAGTACCAGT

ACTGG CACCACCAA AG AG ATTC ATTAAATTAG G ATTTG CC AAAATG GGATTACCTCCTAAACCTG CTCCA

GGAACACCTGCTCCAAATAGTGACGCAAATGGATTGCTTGGTACAGAGGAACCTGAA GAGTTTGAAGTT

GAATTACGTGGCGAATCCGTGTTTGCAGTGTTAGAGTCAGATGGATTTCCGGGTGAT GGGAAGTgtttaa acctggcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctg aatggcgaatggcgcctgatgcggtatttt ctccttacgcatctgtgcggtatttcacaccgcatatggtgcactctcagtacaatctgc tctgatgccgcatagttaagccagccccgaca cccgccaacacccgctgacgcgccctgacgggcttgtctgctcccggcatccgcttacag acaagctgtgaccgtctccgggagctgcat gtgtcagaggttttca ccgtca tea ccga a a egegega pLOA-058 atgaccatgattacgccactagtccgaggcctcgagatccgatatcgccgtggcggccgc cagctgaagcttaattatcctgggcacgag tgaaacaaagctaaaacctttatttagcatggccattgaatgtaacaattatatatatcg caagcacaaaaaatcaaggagagagaact accactttgttcatgtgtacaatgttcattatctccataagcaaaaaaaaaaaaatagaa aacatatgctataaggttgatattctcacga gtaagcggcacttgctacttattgacattgcagatttttggctacagaaatagtatatta gagattataattgctaatcaaatcaaaatata aaattagtaaaccaaaccatttatacccttccttagtagttatggattgttttttaatga tatttctgcaaaccaaagaaagattgttatcca gatagaatttagttttgatattcatttttttgttgaagattgaacgccatatctgggcct cataattcaaaagacggtgccattatcggtagc gtttcgcattgtactggatttcagaaatttcacagttgatgaatcgaaaagaatggtctc attgcaacacgtaaggttaagatgtcccttttt accattataggcaataaatgaatcataaaacgaccgtatactggtgaaatagtagggaga acgagtacctgtagtaaaaagtataaatc atagttaatcgggcaatgtccctcgatcaaggagtattgtgtcatgttcgagacaaacgc caacatttttgtttcttttggacaaatgttgtt tgcatttatgatccgttatattttgatctaatgtagagttgcacgtagttcttactggca aagaaatcgatgcataccaaaaaagaataaa ggtgatatttgatctttaccgtttagttccaacgtaaaattgtgcctttggacttaaaat ggcgtcgtacgctgcaggtcgacggatccccg ggttaattaaggcgcgccagatctgtttagcttgcctcgtccccgccgggtcactaccgt tcgtataatgtatgctatacgaagttatgaca tggaggcccagaataccctccttgacagtcttgacgtgcgcagctcaggggcatgatgtg actgtcgcccgtacatttagcccatacatcc ccatgtataatcatttgcatccatacattttgatggccgcacggcgcgaagcaaaaatta cggctcctcgctgcagacctgcgagcaggg aaacgctcccctcacagacgcgttgaattgtccccacgccgcgcccctgtagagaaatat aaaaggttaggatttgccactgaggttcttc tttcatatacttccttttaaaatcttgctaggatacagttctcacatcacatccgaacat aaacaaccatgggtaaggaaaagactcacgtt tcgaggccgcgattaaattccaacatggatgctgatttatatgggtataaatgggctcgc gataatgtcgggcaatcaggtgcgacaatct atcgattgtatgggaagcccgatgcgccagagttgtttctgaaacatggcaaaggtagcg ttgccaatgatgttacagatgagatggtca gactaaactggctgacggaatttatgcctcttccgaccatcaagcattttatccgtactc ctgatgatgcatggttactcaccactgcgatc cccggcaaaacagcattccaggtattagaagaatatcctgattcaggtgaaaatattgtt gatgcgctggcagtgttcctgcgccggttgc attcgattcctgtttgtaattgtccttttaacagcgatcgcgtatttcgtctcgctcagg cgcaatcacgaatgaataacggtttggttgatgc gagtgattttgatgacgagcgtaatggctggcctgttgaacaagtctggaaagaaatgca taagcttttgccattctcaccggattcagtc gtcactcatggtgatttctcacttgataaccttatttttgacgaggggaaattaataggt tgtattgatgttggacgagtcggaatcgcaga ccgataccaggatcttgccatcctatggaactgcctcggtgagttttctccttcattaca gaaacggctttttcaaaaatatggtattgataa tcctgatatgaataaattgcagtttcatttgatgctcgatgagtttttctaatcagtact gacaataaaaagattcttgttttcaagaacttgt catttgtatagtttttttatattgtagttgttctattttaatcaaatgttagcgtgattt atattttttttcgcctcgacatcatctgcccagatgc gaagttaagtgcgcagaaagtaatatcatgcgtcaatcgtatgtgaatgctggtcgctat actgataacttcgtataatgtatgctatacg aacggtaaattcctgggggaacaacttcacagaatgttttgtcatattgtcgaagtggtc acaaaacaagagaagttccgccaattataa aaagggaacccgtatatttcagcttcacggatgatttccagggtgagagtactgtatatg ggcttacgatagaaggccataaaaatttctt gcttggcaacaaaatagaagtgaaatcatgtcgaggctgctgtgtgggagaacagcataa aatatcacaaaaaaagaatctaaaacac tgtgttgcttgtcccagaaagggaatcaagtatttttataaagattggagtggtaaaaat cgagtatgtgctagatgctatggaagataca aattcagcggtcatcactgtataaattgcaagtatgtaccagaagcacgtgaagtgaaaa aggcaaaagacaaaggcgaaaaattggg cattacgcccgaaggtttgccagttaaaggaccagagtgtataaaatgtggcggaatctt acagtggcctatgcggccgctctagaacta gtggatcgatccccaattcgccctatagtgagtcgtattacaattcactggccgtcgttt tacaacgtcgtgactgggaaaaccctggcgtt acccaacttaatcgccttgcagcacatccccctttcgccagctggcgtaatagcgaagag gcccgcaccgatcgcccttcccaacagttgc gcagcctgaatggcgaatggcgcctgatgcggtattttctccttacgcatctgtgcggta tttcacaccgcatacgtcaaagcaaccatag tacgcgccctgtagcggcgcattaagcgcggcgggtgtggtggttacgcgcagcgtgacc gctacacttgccagcgccctagcgcccgct cctttcgctttcttcccttcctttctcgccacgttcgccggctttccccgtcaagctcta aatcggggtgggccatcgccctgatagacggtttt tcgccctttgacgttggagtccacgttctttaatagtggactcttgttccaaactggaac aacactcaaccctatctcgggctattcttttgat ttataagggattttgccgatttcggcctattggttaaaaaatgagctgatttaacaaaaa tttaacgcgaattttaacaaaatattaacgtt tacaattttatggtgcactctcagtacaatctgctctgatgccgcatagttaagccagcc ccgacacccgccaacacccgctgacgcgccc tgacgggcttgtctgctcccggcatccgcttacagacaagctgtgaccgtctccgggagc tgcatgtgtcagaggttttcaccgtcatcacc gaaacgcgcgagacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgat aataatggtttcttagacgtcaggtggcact tttcggggaaatgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatg tatccgctcatgagacaataaccctgataaatg cttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttatt cccttttttgcggcattttgccttcctgtttttgc tcacccagaaacgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtggg ttacatcgaactggatctcaacagcggta agatccttgagagttttcgccccgaagaacgttttccaatgatgagcacttttaaagttc tgctatgtggcgcggtattatcccgtattgacg ccgggcaagagcaactcggtcgccgcatacactattctcagaatgacttggttgagtact caccagtcacagaaaagcatcttacggatg gcatgacagtaagagaattatgcagtgctgccataaccatgagtgataacactgcggcca acttacttctgacaacgatcggaggaccg aaggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgg gaaccggagctgaatgaagccataccaaac gacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaact ggcgaactacttactctagcttcccggcaa caattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggccctt ccggctggctggtttattgctgataaatctg gagccggtgagcgtgggtctcgcggtatcattgcagcactggggccagatggtaagccct cccgtatcgtagttatctacacgacgggga gtcaggcaactatggatgaacgaaatagacagatcgctgagataggtgcctcactgatta agcattggtaactgtcagaccaagtttact catatatactttagattgatttaaaacttcatttttaatttaaaaggatctaggtgaaga tcctttttgataatctcatgaccaaaatccctta acgtgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttg agatcctttttttctgcgcgtaatctgctgctt gcaaacaaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaac tctttttccgaaggtaactggcttcagcag agcgcagataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaa ctctgtagcaccgcctacatacctcgctctg ctaatcctgttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggac tcaagacgatagttaccggataaggcgcagc ggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccg aactgagatacctacagcgtgagcattg agaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggt cggaacaggagagcgcacgagggagc ttccagggggaaacgcctggtatctttatagtcctgtcgggtttcgccacctctgacttg agcgtcgatttttgtgatgctcgtcaggggggc ggagccta tgga a a a a cgccagca a cgcggccttttta cggttcctggccttttgctggccttttgctca ca tgttctttcctgcgttatcccc tgattctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccg aacgaccgagcgcagcgagtcagtgagcga ggaagcggaagagcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcatta atgcagctggcacgacaggtttcccgact ggaaagcgggcagtgagcgcaacgcaattaatgtgagttagctcactcattaggcacccc aggctttacactttatgcttccgcggctcgt atgttgtgtggaattgtgagcggataacaatttcacacaggaaacagct pLOA-093 gacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgataataatggttt cttagacgtcaggtggcacttttcggggaaat gtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatg agacaataaccctgataaatgcttcaataatat tgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcg gcattttgccttcctgtttttgctcacccagaaa cgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaac tggatctcaacagcggtaagatccttgag agttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggc gcggtattatcccgtattgacgccgggcaaga gcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcac agaaaagcatcttacggatggcatgacagt aagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttct gacaacgatcggaggaccgaaggagctaa ccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagc tgaatgaagccataccaaacgacgagcgtg acaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactac ttactctagcttcccggcaacaattaatag actggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggct ggtttattgctgataaatctggagccggtga gcgtgggtcCcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgt agttatctacacgacggggagtcaggcaac tatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggta actgtcagaccaagtttactcatatatactt tagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgat aatctcatgaccaaaatcccttaacgtgagtttt cgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatccttttt ttctgcgcgtaatctgctgcttgcaaacaaaa aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccg aaggtaactggcttcagcagagcgcagatac caaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcac cgcctacatacctcgctctgctaatcctgttac cagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagt taccggataaggcgcagcggtcgggctgaa cggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacc tacagcgtgagctatgagaaagcgccac gcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggaga gcgcacgagggagcttccagggggaa acgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttt tgtgatgctcgtcaggggggcggagcctatgga aaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcaca tgttctttcctgcgttatcccctgattctgtggat aaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgc agcgagtcagtgagcgaggaagcggaag agcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcaggttta aacAGGTGGTAATAATCGCGCGAT

T C AATT GC ATT CATT AAAG ACAG AT AATT CG CAAG ACCTT CT CCCT CCAG AT C AACTT GTAT CAAT GATT C

ACTTGTTCATCAACGATGAAAGGTTTACCTCCGGTATAACGAGTTTTGACATTGATT TTTCTAGAATGAAA

ATGCCATAGAAATTTCTAAATTTAGACTGAATCCCTACGTCACTGGTTTAAAAATTG AGTGGTGCTTACTA

ATT ATT ACATT CG G AAACGTCT CAT CAAGT GTTTCCG AAAAAATG AG GGTTTTT CT AAAGCTT CTTT CTTT

CACG G AT AT CACCG G GTTT AAG AT GT ATTTTTTTTTTCCAC AG AAATT AAAGTT CCAG CGTTT ACCAAAGT

AGATCGTTCAATAATATGGATGGTGTTATAAGAAGACGACCACTATCCCCCATGAAT TCTCACATGATAC

TTTCTTTTACTTTATTTACAGAGGCAGTAACATCCAAGAAGAAtaccgttcgtataa tgtatgctatacgaagttataac cggcgttgccagcgataaacggatagcttcaaaatgtttctactccttttttactcttcc agattttctcggactccgcgcatcgccgtacca cttcaaaacacccaagcacagcatactaaatttcccctctttcttcctctagggtgtcgt taattacccgtactaaaggtttggaaaagaaa aaagagaccgcctcgtttctttttcttcgtcgaaaaaggcaataaaaatttttatcacgt ttctttttcttgaaaattttttttttgatttttttct ctttcgatgacctcccattgatatttaagttaataaacggtcttcaatttctcaagtttc agtttcatttttcttgttctattacaactttttttac ttcttgctcattagaaagaaagcatagcaatctaatctaagtttATGGTATTTCCTGTAC TCCCACTTTATGAGTCCACAG

G C A A ACCT CTGTTGTCTGTTGTTG G CC A AG CT CTTT ACAG ATTT A AT G G ATCT AAT ACTG ATG CAAT AGTT

C A A ATTT CCAAGTACACT CCA AACTT G A AT GT GTTT GTCG AATT G G CG GT CAAT G AG AT ATCTG ATG CT A

TT GTTG AACAATT ACT CT CATT AT ACAAC AAT G G AGTTT GTTCGGTTTT AG CA ACTCCCG AACAAG G AAG

TGTTATTCTTGAAAAAATCCCAAATGCAAGAATTACATATAAGGCATCCGAGAACAA ACAATATCAGTCT

ATTGCCTACGTTTTGGGATCATCATTACCGCAGACCATTGATGAAAAGATTACTGCT TTTGTTTATGTTGA

AG ACACGTT AT CCCTT G AG G AATT AC AAAAG CT CGTTAAGTC AGGTT AT ATTCCG ATT GT CAAAT CAG AC

TTATTGACAAATGAGTACGAAGATGTCAAGGGTCAATATCCATTAGTTGATTTTATT ATCCCTAAAATTGT

CACCGATCGTGCAGATGGACTATACACTACTCTGGTTGTGGATTCATCAAATCAATC TTTGGGTCTCGTG

TATTCATCTGTAACTTCTATTTCCGAATCAATTAGAACCGGTACAGGTGTTTATCAA TCCAGGAAACATGG

TTTATGGTATAAGGGCAAAACCTCTGGTGCAACCCAAAAATTAATTTCTTTTGACCT AGACTGCGATTCC

GACTGTTTGAAGGCAATAGTCGAGCAAACTGGATCTGGATTTTGCCACCTGAGTACT AACTCATGTTTTG

GCAATTTTACAGGCTTGAAAGCCCTGGAAGCGACTCTATTTCAACGTAAAACAGATG CACCAGAAGGTT

CGTAC ACT AAACGT CTTTTT G ATG AT G AAT CGTT GTTG AACG CT AAG AT CA AAG AG G AAG CAG AAG AGT

TAGCAGATGCCAAAACCAAGGAAGAAATTGCTTGGGAGGCTGCTGATCTGTTTTATT TTGCATTGGCTA

GATGTGCGAAATATAATGTTACCTTAGCTGATATTGAAAAGAACTTGGACATGAAAG CATTGAAAGTTT

CAAGAAGAAAGGGCGATGCGAAACCTAAGTTTATTGAAAAGAAGCAATCAACAGAGA AGTCGGAAATT

AAT G AAAG ACAT ATT G GCCC AG AT G AC AAG ATTT ATTT G AAT AG AAT CAAT G CTGC AACT GC AT C AAAG

GAAGAAGTTGAAGCGTGCTTGGAAAGACCAATCCAGAAATCGGCAGATATTATGTCA TTGGTTACTCCG

ATTGTTGAGAATGTCAAGGCGAACGGAGATAAGGCTCTATTGGAGTTAACTGCTAAG TTTGACAGGGTT

CAGTT AG ATT C ACCCGTTTT ATTT G CTCCTTACAAG CCTG AT ATG ATG CAAATCTC AG AG AAG CTAAAAA

AGGCGATCGATGTATCATTTGAAAATATCAGGATTTTCCACGAAGCTCAAAATCAAA AGGATATTCTAAC

GGTGGAAACATCGCCAGGAGTTTACTGTTCTAGATTTGCTAGGCCTATCGAGAAGGT TGGTTGTTATATT

CCAGGTGGAACTGCT GTTTT G CC AT C A AC AT CGTTGATGTTATCTGTTCCAG CATT AGTTGCTGGTTGCA

AG GAG ATT AT CTTT G CTT CT CC ACCT G GT AAGG AT G GT AAACT AACT CCAG AGGTT GTTT ATGTAG CAC A

CAAGGTTGGCGCCAAGTGTATTGTTATGGCAGGTGGAGCACAGGCTGTAGCAGCTAT GGCTTATGGTAC

CGAGAGTGTTCCAAAATGTGATAAAATTATGGGTCCGGGTAATCAATTTGTCACTGC TGCTAAGATGTTA

GTCTCTAATGATTCCAATGCATTATGTGCCATTGATATGCCAGCAGGTCCATCTGAA GTATTAGTCATTGC TGATAAGCATGCCGATCCTGATTTTGTTGCCAGCGATTTACTCTCACAAGCTGAACATGG TATCGATTCCC

AG GT C ATT CT ACTG G CTGTTG AC AT G ACTG ACG C AG A AGTTG ATG CC ATT GACGAAGCTGT CC AT AG AC

AAG CTTT AG CG CT ACCG AG AGTCG AC ATT GTT AG AAAAT GT ATT GC ACATT CC ACT AC AATT GT AGT CAA

AACGCTGGATGAGGCATTTGAAATGTCCAACAAATATGCTCCAGAGCATTTGATTTT GCAGATTGAGAA

CGCAGAAGAATGGGTTCCTAAGGTTGACAATGCAGGTTCTGTCTTTGTTGGCGCATT ATCGCCAGAATCT

TGTGGTGATTATTCCTCCGGTACTAACCATACATTACCTACGTATGGTTATGCTAGG ATGTACAGCGGAG

TG A AC AC AG C A ACCTTCC A A A AGTT C ATC ACCTCTC AG GTT GTC AC A AG G G A AG GTTT G A AG A AC AT CG

GTCCTGCAGTTATGGATTTGGCTGAGGTTGAAGGTCTTGATGGCCACCGTAACGCCG TTAGGGTGAGAA

TGGATAAACTTGGTATGCTCCCTGAAGGATACTGAcgcctggcagcagggcgataac ctcataacttcgtataatgtatg ctatacgaacggtaTTTGGTGTTGTTTTCTATTGCATACGAATTAGAATGCCCAGACTTG TTTATATACTACGC

TG AAT GTTT GT ACATTT AT ACTT AAAACAAA AT G CT AGTCAGCCAT ATT AAACAG AGCCGTTT AG CAAC A

TTTCAATAGCACCTTCCACAGATCCACCGCTACGTYTCAATGCGGCAATGTTACGGT CGAAGTCAAAGAA

G CCC AT AT CGTTT AATT G ACGT AATT GT GTT G CATAG ACTTCTTCTG G AG G CCTTGTATCG G AAG CAG AA

G GTG CAGTT G A ACCAGT ACCAGT ACT G G CACCACCA AAG AG ATTC ATTAAATTAG G ATTTGCCAAA ATG

G G ATTACCTCCTAAACCTG CTCC AGG AACACCTG CTCC AAATAGTG ACG CAAATGGATTG CTTGGTAC AG

AGGAACCTGAAGAGTTTGAAGTTGAATTACGTGGCGAATCCGTGTTTGCAGTGTTAG AGTCAGATGGAT

TTCCGGGTGATGGGAAGTgtttaaacctggcgtaatagcgaagaggcccgcaccgat cgcccttcccaacagttgcgcagcctg aatggcgaatggcgcctgatgcggtattttctccttacgcatctgtgcggtatttcacac cgcatatggtgcactctcagtacaatctgctct gatgccgcatagttaagccagccccgacacccgccaacacccgctgacgcgccctgacgg gcttgtctgctcccggcatccgcttacaga caagctgtgaccgtctccgggagctgcatgtgtcagaggttttcaccgtcatcaccgaaa cgcgcga pLOA-094 gacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgataataatggttt cttagacgtcaggtggcacttttcggggaaat gtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccgctcatg agacaataaccctgataaatgcttcaataatat tgaaaaaggaagagtatgagtattcaacatttccgtgtcgcccttattcccttttttgcg gcattttgccttcctgtttttgctcacccagaaa cgctggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaac tggatctcaacagcggtaagatccttgag agttttcgccccgaagaacgttttccaatgatgagcacttttaaagttctgctatgtggc gcggtattatcccgtattgacgccgggcaaga gcaactcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcac agaaaagcatcttacggatggcatgacagt aagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttct gacaacgatcggaggaccgaaggagctaa ccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagc tgaatgaagccataccaaacgacgagcgtg acaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactac ttactctagcttcccggcaacaattaatag actggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggct ggtttattgctgataaatctggagccggtga gcgtgggtcCcgcggtatcattgcagcactggggccagatggtaagccctcccgtatcgt agttatctacacgacggggagtcaggcaac tatggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggta actgtcagaccaagtttactcatatatactt tagattgatttaaaacttcatttttaatttaaaaggatctaggtgaagatcctttttgat aatctcatgaccaaaatcccttaacgtgagtttt cgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatccttttt ttctgcgcgtaatctgctgcttgcaaacaaaa aaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactctttttccg aaggtaactggcttcagcagagcgcagatac caaatactgttcttctagtgtagccgtagttaggccaccacttcaagaactctgtagcac cgcctacatacctcgctctgctaatcctgttac cagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacgatagt taccggataaggcgcagcggtcgggctgaa cggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacc tacagcgtgagctatgagaaagcgccac gcttcccgaagggagaaaggcggacaggtatccggtaagcggcagggtcggaacaggaga gcgcacgagggagcttccagggggaa acgcctggtatctttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttt tgtgatgctcgtcaggggggcggagcctatgga aaaacgccagcaacgcggcctttttacggttcctggccttttgctggccttttgctcaca tgttctttcctgcgttatcccctgattctgtggat aaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaacgaccgagcgc agcgagtcagtgagcgaggaagcggaag agcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcaggttta aacAGGTGGTAATAATCGCGCGAT

T C AATT GC ATT CATT AAAG ACAG AT AATT CG CAAG ACCTT CT CCCT CCAG AT C AACTT GTAT CAAT GATT C

ACTTGTTCATCAACGATGAAAGGTTTACCTCCGGTATAACGAGTTTTGACATTGATT TTTCTAGAATGAAA

ATGCCATAGAAATTTCTAAATTTAGACTGAATCCCTACGTCACTGGTTTAAAAATTG AGTGGTGCTTACTA

ATT ATT ACATT CG G AAACGTCT CAT CAAGT GTTTCCG AAAAAATG AG GGTTTTT CT AAAGCTT CTTT CTTT

CACG G AT AT CACCG G GTTT AAG AT GT ATTTTTTTTTTCCAC AG AAATT AAAGTT CCAG CGTTT ACCAAAGT

AGATCGTTCAATAATATGGATGGTGTTATAAGAAGACGACCACTATCCCCCATGAAT TCTCACATGATAC

TTTCTTTTACTTTATTTACAGAGGCAGTAACATCCAAGAAGAAtaccgttcgtataa tgtatgctatacgaagttataac cggcgttgccagcgataaacggagcttgccttgtccccgccgggtcacccggccagcgac atggaggcccagaataccctccttgacagt cttgacgtgcgcagctcaggggcatgatgtgactgtcgcccgtacatttagcccatacat ccccatgtataatcatttgcatccatacatttt gatggccgcacggcgcgaagcaaaaattacggctcctcgctgcagacctgcgagcaggga aacgctcccctcacagacgcgttgaattg tccccacgccgcgcccctgtagagaaatataaaaggttaggatttgccactttttaaaat cttgctaggatacagttctcacatcacatccg aacataaacaaccatgggtaaaaagcctgaactcaccgcgacgtctgtcgagaagtttct gatcgaaaagttcgacagcgtctccgacc tgatgcagctctcggagggcgaagaatctcgtgctttcagcttcgatgtaggagggcgtg gatatgtcctgcgggtaaatagctgcgccg atggtttctacaaagatcgttatgtttatcggcactttgcatcggccgcgctcccgattc cggaagtgcttgacattggggaattcagcgag agcctgacctattgcatctcccgccgtgcacagggtgtcacgttgcaagacctgcctgaa accgaactgcccgctgttctgcagccggtcg cggaggccatggatgcgatcgctgcggccgatcttagccagacgagcgggttcggcccat tcggaccgcaaggaatcggtcaatacact acatggcgtgatttcatatgcgcgattgctgatccccatgtgtatcactggcaaactgtg atggacgacaccgtcagtgcgtccgtcgcgc aggctctcgatgagctgatgctttgggccgaggactgccccgaagtccggcacctcgtgc acgcggatttcggctccaacaatgtcctgac ggacaatggccgcataacagcggtcattgactggagcgaggcgatgttcggggattccca atacgaggtcgccaacatcttcttctggag gccgtggttggcttgtatggagcagcagacgcgctacttcgagcggaggcatccggagct tgcaggatcgccgcggctccgggcgtatat gctccgcattggtcttgaccaactctatcagagcttggttgacggcaatttcgatgatgc agcttgggcgcagggtcgatgcgacgcaatc gtccgatccggagccgggactgtcgggcgtacacaaatcgcccgcagaagcgcggccgtc tggaccgatggctgtgtagaagtactcgc cgatagtggaaaccgacgccccagcactcgtccgagggcaaaggaataatcagtactgac aataaaaagattcttgttttcaagaactt gtcatttgtatagtttttttatattgtagttgttctattttaatcaaatgttagcgtgat ttatattttttttcgcctcgacatcatctgcccagat gcgaagttaagtgcgcagaaagtaatatcatgcgtcaatcgtatgtgaatgctggtcgct atactgctgtcgattcgatactaacgccgcc atccagtgtcgacgcctggcagcagggcgataacctcataacttcgtataatgtatgcta tacgaacggtaTTTGGTGTTGTTTTCT

ATTGCATACGAATTAGAATGCCCAGACTTGTTTATATACTACGCTGAATGTTTGTAC ATTTATACTTAAAA

CAAAATG CTAGTCAG CC ATATTAAAC AG AG CCGTTTAG CAAC ATTTC AATAG CACCTTCCAC AG AT CC AC

CG CT ACGTYT CAAT G CG GC AAT GTT ACG GT CG AAGT C AAAG A AGCCCAT ATCGTTT AATT G ACGT AATT G

TGTTGCATAGACTTCTTCTGGAGGCCTTGTATCGGAAGCAGAAGGTGCAGTTGAACC AGTACCAGTACT

GGCACCACCAAAGAGATTCATTAAATTAGGATTTGCCAAAATGGGATTACCTCCTAA ACCTGCTCCAGGA

ACACCTGCTCCAAATAGTGACGCAAATGGATTGCTTGGTACAGAGGAACCTGAAGAG TTTGAAGTTGAA

TTACGTGGCGAATCCGTGTTTGCAGTGTTAGAGTCAGATGGATTTCCGGGTGATGGG AAGTgtttaaacctg gcgtaatagcgaagaggcccgcaccgatcgcccttcccaacagttgcgcagcctgaatgg cgaatggcgcctgatgcggtattttctcctt acgcatctgtgcggtatttcacaccgcatatggtgcactctcagtacaatctgctctgat gccgcatagttaagccagccccgacacccgc caacacccgctgacgcgccctgacgggcttgtctgctcccggcatccgcttacagacaag ctgtgaccgtctccgggagctgcatgtgtca gaggttttca ccgtca tea ccga a a egegega

Fermentation Examples Example 2A

This example demonstrates producing a surprisingly high and commercially relevant yield of a combined amount of olivetol and olivetolic acid of about 4.5 g/liter over 4-5 days.

• Strain: LSC3-4

• Genotype: gal80 A ::pScTEFl>PkHIS4<tScGAL80

• Parent strain: LSC3-2

• Genotype of parent strain: o pGallO-CsAAEl-tCycl-pGall-OST2AOAC-tCycl::leu2-3, pGa 110-CsAAE 1-tCycl-pGa I l-OST2AOAC-tCycl : : u ra 3-52, pGallO-CsAAEl-tCycl-pGall-OST2AOAC-tCycl::trpl, pGallO-HMGK2R-tADHl-pGall-IDIl-tCycl-KanMX::YORWA22

Fermentation Process Summary:

Seed Train:

A shake flask containing 50 mL YPD with 20 g/L glucose was inoculated with freshly streaked LSC3-4. Strain grew at 30 C for 24 hours to an OD600 of 8. 40 mL of this culture was used to inoculate the fermentation tank.

Media:

Seed Media: YP with 20 g/L glucose Batch media:

• lx YP + 55g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CUS0 4 -5H 2 0+0.0512 g/L FeS0 4 -7H20+0.0032 g/L MnCI 2 +4.77 mg/L Na2MoO 4 +0.102 g/L ZnS0 4 -7H 2 0+0.0086 g/L CoCI2-6H 2 O+0.0384 g/L CaCI 2 -2H 2 0+5.5 g/L KH 2 P04+2.9 g/L MgS0 4 -7H20+45.1 g/L (NH 4 ) 2 S0 4

Growth media:

• 600 g/L glucose+500 mg/L histidine Production Media: • 650 g/L glucose+10 g/L hexanoic acid Base (for pH Control)

• 5M NH40H Galactose Addition

• 4 g galactose was added to tank at 24, 48 and 120 hours, respectively.

Overlay

• 100 mL isopropyl myristate (25% V/V0) was added to tank at 24 hours. Additionally, 10 mL (2.5% V/Vo) isopropyl myristate was added to tank at 48 and 120 hours, respectively.

Antifoam

• Struktol SB2121 (0.1 mL/L) at the beginning of the run

• Struktol SB509 (0.5 mL/day)

Fermentation Run Condition:

Pulse feeding was used for both growth phase and production phase during the run. Fermentation batch was inoculated with 40 mL of inoculum. Feeding was triggered at the end of batch phase when batch glucose was completely exhausted and p02 was increased by 20% (or more). Feed media (growth or production media) was delivered in pulses. Each pulse delivered 2 g glucose /starting batch volume with maximum feed rate not exceeding 20 g glucose/L/hr. pH was maintained at 6 throughout the run. Temperature of fermenter was maintained at 30 C. Airflow rate was maintained at 1.25 L/min. Agitation was 800 rpm. Summary of Metrices:

• Maximum Olivetolic acid titer: 3.48 g/L

• Maximum Olivetol: 0.9 g/L

• Total product titer: 4.36 g/L • Titer/time at 119 hours: 0.86 g/L/day

• Cumulative yield of olivetolic acid /HA consumed: 0.49 mol/mol

• Cumulative yield of olivetol/HA consumed: 0.15 mol/mol

Example 2B

In this example, a different strain was used compared to that used in example 2A, and no galactose was added in this run. Surprisingly, despite the absence of galactose, the combined amount of olivetol and olivetolic acid obtained was about 3.5 g/liter over a period of 4-5 days.

• Strain: LSC3-13

• Genotype: migl A ::HygR

• Parent strain: LSC3-5 (sister clone of LSC3-4)

• Genotype of parent strain: See example 2A

Fermentation Process Summary:

Seed Train:

A shake flask containing 50 mL YPD with 20 g/L glucose was inoculated with freshly streaked LSC3-13. Strain grew at 30 C for 24 hours to an OD600 of 8. 40 mL of this culture was used to inoculate the fermentation tank.

Media:

Seed Media: YP with 20 g/L glucose Batch media:

• lx YP + 55g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CUS0 4 -5H 2 0+0.0512 g/L FeS0 4 -7H 2 0+0.0032 g/L MnCI 2 +4.77 mg/L Na 2 MoO 4 +0.102 g/L ZnS0 4 -7H 2 0+0.0086 g/L CoCI 2 -6H 2 O+0.0384 g/L CaCI 2 -2H 2 0+5.5 g/L KH 2 P0 4 +2.9 g/L MgS0 4 -7H 2 0+45.1 g/L (NH 4 ) 2 S0 4

Growth media:

• 600 g/L glucose+500 mg/L histidine Production Media: • 650 g/L glucose+10 g/L hexanoic acid Base (for pH Control)

• 5M NH40H Overlay:

• 100 mL isopropyl myristate (25% V/V0) was added to tank at 24 hours. Additionally, 10 mL (2.5% V/Vo) isopropyl myristate was added to tank at 48 and 120 hours, respectively.

Antifoam

• Struktol SB2121 (0.1 mL/L) at the beginning of the run

• Struktol SB509 (0.5 mL/day)

Fermentation Run Condition:

Pulse feeding was used for both growth phase and production phase during the run. Fermentation batch was inoculated with 40 mL of inoculum. Feeding was triggered at the end of batch phase when batch glucose was completely exhausted and p02 was increased by 20% (or more). Feed media (growth or production media) was delivered in pulses. Each pulse delivered 2 g glucose /starting batch volume with maximum feed rate not exceeding 20 g glucose/L/hr. pH was maintained at 6 throughout the run. Temperature of fermenter was maintained at 30 C. Airflow rate was maintained at 1.25 L/min. Agitation was 800 rpm. Summary of Metrices:

• Maximum Olivetolic acid titer: 2.92 g/L

• Maximum Olivetol: 0.65 g/L

• Total product titer: 3.53 g/L

• Titer/time at 119 hours: 0.71 g/L/day

• Cumulative yield of olivetolic acid /HA consumed: 0.49 mol/mol Cumulative yield of olivetol/HA consumed: 0.10 mol/mol

Example 2C

This example demonstrates a high yielding fermentation of O and OA.

• Strain: LSC3-13

• Genotype: migl A ::HygR

• Parent strain: LSC3-5 (sister clone of LSC3-4)

• Genotype of parent strain: See example 2A

Fermentation Process Summary:

Seed Train:

A shake flask containing 50 mL YPD with 20 g/L glucose was inoculated with freshly streaked LSC3-13. Strain grew at 30 C for 24 hours to an OD600 of 4. 40 mL of this culture was used to inoculate the fermentation tank.

Media:

Seed Media: YP with 20 g/L glucose

Batch media:

• lx YP + 55g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CuS04-5H20+0.0512 g/L FeS04-7H20+0.0032 g/L MnCI2+4.77 mg/L Na2MoO4+0.102 g/L ZnS04-7H20+0.0086 g/L CoCI2-6H2O+0.0384 g/L CaCI2-2H20+5.5 g/L KH2P04+2.9 g/L MgS04-7H20+45.1 g/L (NH4)2S04

Production Media:

• 650 g/L glucose +438 mg/L citric acid monohydrate+2 mg/L H3B03+1.3 mg/L CuS04- 5H20+22.4 mg/L FeCI3-6H20+1.33 mg/L MnCI2+0.8 mg/L Na2MoO4+10.8 mg/L ZnS04- 7H20+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+12 mg/L biotin+12 mg/L p-aminobenzoic acid+12 mg/L folic acid+12mg/L riboflavin+2.5 g/L KH2P04+1 g/L MgSO4-7H2O+20 g/L (NH4)2S04+17.8 g/L sodium hexanoate

• Base (for pH Control): 5M NH40H Overlay: • 100 mL isopropyl myristate (25% V/VO) was added to tank at 24 hours. Additionally, 10 mL (2.5% V/Vo) isopropyl myristate was added to tank at 96, 120 and 144 hours, respectively.

Antifoam

• Struktol SB2121 (0.1 mL/L) at the beginning of the run

• Struktol SB509 (0.5 mL/day)

Fermentation run condition:

We used pulse feeding for both growth phase and production phase during the run. Fermentation batch was inoculated with 40 mL of inoculum. Feeding was triggered at the end of batch phase when batch glucose was completely exhausted and p02 was increased by 20% (or more). Feed media (growth or production media) was delivered in pulses. Each pulse delivered 2 g glucose /starting batch volume with maximum feed rate not exceeding 40 g glucose/L/hr. pH was maintained at 6 throughout the run. Temperature of fermenter was maintained at 30C. Air flow rate was maintained at 1.25 L/min. Agitation was 800 rpm. This fermentation works at pH range of 5-6. It is contemplated that in some embodiments, the fermentation is carried out at about pH 5.0. It is contemplated that in some embodiments, a pulse rate of about 1.7 g/L/pulse, with maximum feed rate of about 10 g/L/hr is employed. It is contemplated that in some embodiments, a pulse rate of about 1.7 g/L/pulse is employed. It is contemplated that in some embodiments a maximum feed rate of about 10 g/L/hr is employed.

Summary of metrics:

• Maximum Olivetolic acid titer: 6.07 g/L

• Maximum Olivetol: 2.03 g/L

• Total product titer: 8 g/L (see Figure 4A)

• Titer/time at 119 hours: 1.5 g/L/day (see Figure 4B)

• Cumulative yield of olivetolic acid /HA consumed: 0.93 g/g

• Cumulative yield of olivetol/HA consumed: 0.27 g/g Example 2D

• Strain: LSC3-134

Genotype: gal80 A ::(loxHIS4)/ his4 A / migl A ::(loxPkHIS4)

• Parent strain: LSC300002

• Genotype of parent strain: leu2 A ::ScLEU2<pScLEU2/tScCYCl>CsAEEKpScGAL10/pScGALl& gt;CsTkS-T2A-CsOAC<tScCY

Cl/pAG3G5-baac.kbone/leu2(defective)_ura3 A ::pScURA3>ScURA3/tScCYCl>CsAEEl<p5cG

AL10/pScGALl>CsTkS-T2A-CsOAC<tScCYCl/pAG306-backbon e/ura3{defective)_trpl A ::p

ScTRPl>ScTRPl/tScCYCl>CsAEEKpScGAL10/pScGALl>CsT KS-T2A-CsOAC<tScCYCl/pAG

304-backbone/trpl(defective)_yorWdelta22 A :tScADHl>HMGK2R<pScGAL10/pScGALl>l

DIKtScCYCl/KanMX

Fermentation Process Summary:

Seed Train:

A shake flask containing 50 mL YPD (10 g/L yeast extract, 20 g/L peptones and 20 g/L glucose) was inoculated with freshly streaked LSC3-134. Strain grew at 30 C for 24 hours to an OD600 of 4. 17 mL of this culture was used to inoculate the fermentation tank (3.5% of initial tank volume).

Media:

Seed Media: YPD (10 g/L yeast extract† 20 g/L peptones† 20 g/L glucose)

Batch media:

• 10 g/L yeast extract† 20 g/L peptones† 20 g/L glucose † 500mg/L histidine†12 mg/L myo-inositol†12 mg/L thiamin hydrochloride†12 mg/L pyridoxal hydrochloride†12 mg/L nicotinic acid†12 mg/L calcium pantothenate†0.6 mg/L biotin†12 mg/L p-aminobenzoic acid†0.15 g/L EDTA†7.8 mg/L CuS04-5H20†0.0512 g/L FeS04-7H20†0.0032 g/L MnCI2+4.77 mg/L Na2MoO4+0.102 g/L ZnS04-7H20+0.0086 g/L CoCI2-6H2O+0.0384 g/L CaCI2-2H20+5.5 g/L KH2P04+2.9 g/L MgS04-7H20+45.1 g/L (NH4)2S04 Production Media:

• 650 g/L glucose +438 mg/L citric acid monohydrate+2 mg/L H3B03+1.3 mg/L CuS04- 5H20+22.4 mg/L FeCI3-6H20+1.33 mg/L MnCI2+0.8 mg/L Na2MoO4+10.8 mg/LZnS04- 7H20+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+12 mg/L biotin+12 mg/L p-aminobenzoic acid+12 mg/L folic acid+12mg/L riboflavin+2.5 g/L KH2P04+1 g/L MgSO4-7H2O+20 g/L (NH4)2S04+36 g/L sodium hexanoate

• Base (for pH Control): 5M NH40H

Overlay:

• 182 mL isopropyl myristate (IPM, 40% V/VO) was added to tank at 24 hours. Stir rate was reduced to 500 rpm before addition of IPM at 24 hours and was increased to 800 rpm at around 48 hours. This step was performed to eliminate the risk of foaming after IPM addition. 30% to 50% positive p02 was maintained between 24 and 48 hours runs time. 1.6% )V/V) antifoam was added to IPM before addition to tank.

• Antifoam: Struktol SB2121 (0.1 mL/L) at the beginning of the run

Fermentation run condition:

We used pulse feeding for both growth phase and production phase during the run. Fermentation batch was inoculated with 17 mL of inoculum. Feeding was triggered at the end of batch phase when batch glucose was completely exhausted and p02 was increased by 10% (or more). Feed media (production media) was delivered in pulses. Each pulse delivered 1.7 g glucose /starting batch volume with maximum feed rate not exceeding 10 g glucose/L/hr. pH was maintained at 5.5 throughout the run. Temperature of fermenter was maintained at 30C. Air flow rate was maintained at 1.25 L/min. Agitation was 800 rpm. Median oxygen uptake rate (OUR) was 60-80 mmoles/L/hr. A maximum OUR of 100-110 moles/L/hr was achieved during the process.

Summary of metrics:

• Maximum Olivetolic acid titer: 6.95±0.22

• Maximum Olivetol: 2.68±0.24

• Titer/time at 96 hours: 2.2 g/L/day (OA+O)

• Cumulative yield of olivetolic acid /Hexanoic Acid consumed:1.4 g/g

• Cumulative yield of olivetol/Hexanoic Acid consumed:0.56 g/g

Effect of pH on process metrics

We found that optimum pH for our process is 5.5 (+/-) 0.3.

Effect of temperature on process metrics

We found that optimum temperature for our process is 30(+/-) 2

Optimum time to add IPM: We found that the optimum time to add IPM is between 12 and 36 hours post inoculation.

Effect of sodium hexanoate/glucose ratio in feed: In a series of experiments, we tested sensitivity of metrics (titer and productivity) to the ratio of sodium hexanoate to glucose in feed. We found that the maximum olivetol equivalent titer was achieved when sodium hexanoate/glucose in feed ratio was in the range of 20 to 28 g sodium hexanoate/ 500 g glucose. Maximum productivity was achieved at a range of 23 to 28 g sodium hexanoate/500 g glucose in feed.

Effect of Oxygen transfer rate on metrics: We found that the optimum median OTR for our process is 60-80 mmoles/L/hr and a maximum OUR of 100-110 mmoles/L/hr is achieved in our process.

Pulse metric parameters: We found that the optimum pulse parameters for our process was 1.7 g glucose/L initial tank volume/pulse with a maximum feed rate of 10 g/L of initial tank volume/hr. Effect of overlay: Isopropyl myristate is used as an overlay in our process. The optimum IPM loading for our process at pH 5.5 is 26% of total tank volume or 40% of initial tank volume.

There was a clear negative effect when no IPM was used.

Note: Percentages of IPM reported here in this figure are based on total tank volume.

Effect of batch glucose concentration: We found that the optimum batch glucose concentration for our process was 10-20 g/L.

Seed train condition: We found that the optimum seed train condition was to inoculate an initial flask containing YPD (10 g/L yeast extract, 20 g/L peptones and 20 g/L glucose) with 1 mL seed vial. All subsequent seed tanks would be inoculated with 2% inoculum and will run as batch tanks with pH control (pH set at 5.5). The production tank will be inoculated with 3.5% inoculum from the last seed train stage.

• Batch media composition for seed tanks: lx YP + 55g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p- aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CuS04-5H20+0.0512 g/L FeS04- 7H20+0.0032 g/L MnCI2+4.77 mg/L Na2MoO4+0.102 g/L ZnS04-7H20+0.0086 g/L CoCI2-6H2O+0.0384 g/L CaCI2-2H20+5.5 g/L KH2P04+2.9 g/L MgS04-7H20+45.1 g/L (NH4)2S04

• Batch media composition for production tanks: lx YP + 17-20g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CuS04-5H20+0.0512 g/L FeS04-7H20+0.0032 g/L MnCI2+4.77 mg/L Na2MoO4+0.102 g/L ZnS04-7H20+0.0086 g/L CoCI2-6H2O+0.0384 g/L CaCI2-2H20+5.5 g/L KH2P04+2.9 g/L MgS04-7H20+45.1 g/L (NH4)2S04

• Production Media (for production tank): 650 g/L glucose +438 mg/L citric acid monohydrate+2 mg/L H3B03+1.3 mg/L CuS04-5H20+22.4 mg/L FeCI3-6H20+1.33 mg/L MnCI2+0.8 mg/L Na2MoO4+10.8 mg/L ZnS04-7H20+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+12 mg/L biotin+12 mg/L p-aminobenzoic acid+12 mg/L folic acid+12mg/L riboflavin+2.5 g/L KH2P04+1 g/L MgSO4-7H2O+20 g/L (NH4)2S04+36 g/L sodium hexanoate

• Base (for pH Control): 5-10 M NH40H

Example 3: Divarinic acid/Divarin Production in LSC3-2 and derived strains

Precultures of LSC3-2 were grown in 50 mL tubes containing 10 mL of YP glucose overnight. These were used to inoculate 50 mL tubes containing 10 mL of YP + 2% (w/v) galactose + 2 mM (176 mg/L) butyric acid (BA) + 20% (v/v) isopropyl myristate (IPM) and were grown for 48 hours with 180 rpm shaking at 30 C. D/DA present in IPM layer only, is tabulated below based on standard curves run for DA and D, and titers are based on the whole volume of broth plus overlay (12 mL total).

Additionally, the same experiment was conducted with 50 mL of YP + 2% (w/v) galactose + 2 mM BA + 20% (v/v) IPM in 250 mL shake flasks. D/DA was quantified in both the IPM and aqueous layers in two biological replicates, and summed between the phases: In another experiment, the same growth conditions were employed with 50 mL of YP + 2% (w/v) galactose + 2 mM BA with no overlay in 250 mL shake flasks. D/DA was quantified in the aqueous culture broth:

In a final shake flask/tube experiment, the same growth conditions were employed with 10 mL of YP + 2% (w/v) galactose + 4 mM BA + 20% (v/v) IPM in 50 mL Falcon tubes. D/DA was quantified in both the IPM and aqueous layers, and summed between phases:

The shake flask/tube experiments were scaled down to 96 well deepwell plate format. Precultures from colonies of each strain were grown in 300 pL YP + 2% (w/v) glucose. Main cultures containing 300 pL YP + 2% (w/v) galactose + 0.02-0.08% (w/v) butyric acid + 20% (v/v) IPM (60 pL) or 20% (v/v) diethyl sebacate (60 pL) for strain LSC3-2, or the same but with 2% (w/v) glucose instead of galactose for strains LSC3-4, LSC3-13, and LSC3-18, were grown for 48 hours and the IPM or diethyl sebacate overlay was sampled at 24 and 48 hours following acidification of the media with 10 pL of 5 M phosphoric acid. Three replicates of each strain/media condition were tested and averages and standard deviations for detected divarinic acid, divarin, and total divarin equivalents are reported in the table below for 24 hour and 48 hour sampling:

24 hour sampling:

8 hour sampling:

Example 3A: Divarinic acid/divarin production in LSC3-2 and derived strains in defined media with varying pH

One step in optimizing divarinic acid/divarin production was to identify the optimum pH for production and to investigate titers in alternative media. pH was adjusted using a defined medium with buffers added that were pre-adjusted to different pH values. A defined medium, Delft CSM medium, consisted of (per liter solution) 7.5 g ammonium sulfate, 14.4 g potassium phosphate monobasic, 0.5 g magnesium sulfate heptahydrate (with these first three components prepared as an 0.9X solution and adjusted to pH 6.5 with sodium hydroxide prior to autoclaving), 3.6 mL of a trace metal solution (consisting of 130 g/L citric acid monohydrate, 0.574 g/L copper (II) sulfate pentahydrate, 8.07 g/L iron (III) chloride hexahydrate, 0.5 g/L boric acid, 0.333 g/L manganese (II) chloride, 0.2 g/L sodium molybdate, and 4.67 g/L zinc sulfate heptahydrate), 1.0 mL of a vitamin solution (0.008 g/L biotin, 1.6 g/L calcium pantothenate, 0.008 g/L folic acid, 8 g/L myo-inositol, 1.6 g/L nicotinic acid, 0.8 g/L p-aminobenzoic acid, 1.6 g/L pyridoxal hydrochloride, 0.8 g/L riboflavin, 1.6 g/L thiamine hydrochloride, adjusted to pH

10.5 with sodium hydroxide), 0.79 g of Complete Supplement Mixture (Formedium, Norfolk, UK), and 2% (w/v) of either galactose or glucose where specified. The final media was filter- sterilized, and butyric acid was added to 0.04% (w/v) for production. Media with different pHs were prepared according to the same recipe, only with 3.75 g/L ammonium sulfate, 7.2 g/L potassium phosphate monobasic, and 0.25 g magnesium sulfate heptahydrate, adjusted to pH

6.5 and autoclaved (the pH the next day was measured to be 6.63), with other components the same as above, plus 300 mM of 2-(N-morpholino)ethanesulfonic acid (MES) from a 1 M stock adjusted to pH 5.0, 5.75, 6.0, 6.25, or 6.5. The final pH of each media formulation with different pH MES buffers added are shown in the table below:

LSC3-2, LSC3-4, LSC3-13, and LSC3-18 were then tested in 96 well deepwell plates as described in the last experiment in Example 2, with precultures containing 2% (w/v) glucose as carbon source, and production cultures containing 2% (w/v) galactose for LSC3-2, 2% (w/v) glucose for LSC3-4, LSC3-13, and LSC3-18, 20% (v/v) IPM overlay, plus 0.04% (w/v) butyric acid as substrate. After 48 hours, production cultures were acidified with 10 pL of 5 M phosphoric acid, and the IPM layer was sampled. Divarinic acid and divarin were quantified by HPLC and detected whole broth + overlay titers, averaged across 3 biological replicates. Up to 59.6 mg/L divarinic acid plus 19.0 mg/L divarin were produced by strain LSC3-13 in full-strength Delft CSM medium plus 2% (w/v) glucose. In reduced salt strength buffered media, addition of MES pH 5.0 (leading to a final medium pH of 5.69) appeared optimal for most strains, with less of a pH dependence in production observed for LSC3-13. When normalized to optical density (600 nm), a measure of cell density, it is clear that the medium with MES pH 5.0 also resulted in the highest yield of divarinic acid/divarin (expressed as "divarin equivalents", which are equal to the titer of divarin plus the titer of divarinic acid multiplied by the ratio of the molecular weight of divarin to divarinic acid) per unit of biomass. LSC3-13 exhibited the highest yield per unit of biomass of the 4 tested strains.

Example 3B

This example provides a process of producing divarin and/or divarinic acid .

• Strain: LSC3-4

• Genotype: gal80 A ::pScTEFl>PkHIS4<tScGAL80

• Parent strain: LSC3-2

• Genotype of parent strain: o pGallO-CsAAEl-tCycl-pGall-OST2AOAC-tCycl::leu2-3, pGa 110-CsAAE 1-tCycl-pGa I l-OST2AOAC-tCycl : : u ra 3-52, pGallO-CsAAEl-tCycl-pGall-OST2AOAC-tCycl::trpl, pGallO-HMGK2R-tADHl-pGall-IDIl-tCycl-KanMX::YORWA22

Fermentation Process Summary:

Seed Train:

A shake flask containing 50 mL YPD with 20 g/L glucose is inoculated with freshly streaked LSC3- 4. Strain grows at 30 C for 24 hours to an OD600 of 8. 40 mL of this culture is used to inoculate the fermentation tank.

Media:

Seed Media: YP with 20 g/L glucose Batch media:

• lx YP + 55g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CUS0 4 -5H 2 0+0.0512 g/L FeS0 4 -7H20+0.0032 g/L MnCI 2 +4.77 mg/L Na2MoO 4 +0.102 g/L ZnS0 4 -7H 2 0+0.0086 g/L CoCI2-6H 2 O+0.0384 g/L CaCI 2 -2H 2 0+5.5 g/L KH 2 P04+2.9 g/L MgS0 4 -7H20+45.1 g/L (NH 4 ) 2 S0 4

Growth media:

• 600 g/L glucose+500 mg/L histidine Production Media:

• 650 g/L glucose+10 g/L hexanoic acid Base (for pH Control)

• 5M NH40H Galactose Addition

• 4 g galactose was added to tank at 24, 48 and 120 hours, respectively. Overlay

• 100 mL isopropyl myristate (25% V/VO) is added to tank at 24 hours. Additionally, 10 mL (2.5% V/Vo) isopropyl myristate is added to tank at 48 and 120 hours, respectively.

Antifoam

• Struktol SB2121 (0.1 mL/L) at the beginning of the run

• Struktol SB509 (0.5 mL/day)

Fermentation Run Condition:

Pulse feeding is used for both growth phase and production phase during the run. Fermentation batch is inoculated with 40 mL of inoculum. Feeding is triggered at the end of batch phase when batch glucose is completely exhausted and p02 is increased by 20% (or more). Feed media (growth or production media) is delivered in pulses. Each pulse delivered 2 g glucose /starting batch volume with maximum feed rate not exceeding 20 g glucose/L/hr. pH is maintained at 6 throughout the run. Temperature of fermenter is maintained at 30C. Air flow rate is maintained at 1.25 L/min. Agitation is 800 rpm.

Example 3C

In this example, a different strain is used compared to that used in example 3A, and no galactose is added in this run.

• Strain: LSC3-13

• Genotype: migl A ::HygR

• Parent strain: LSC3-5 (sister clone of LSC3-4)

• Genotype of parent strain: See example 3A

Fermentation Process Summary: Seed Train:

A shake flask containing 50 mL YPD with 20 g/L glucose is inoculated with freshly streaked LSC3- 13. Strain grew at 30 C for 24 hours to an OD600 of 8. 40 mL of this culture is used to inoculate the fermentation tank.

Media:

Seed Media: YP with 20 g/L glucose Batch media:

• lx YP + 55g/L glucose + 500mg/L histidine+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CUS0 4 -5H 2 0+0.0512 g/L FeS0 4 -7H 2 0+0.0032 g/L MnCI 2 +4.77 mg/L Na 2 MoO 4 +0.102 g/L ZnS0 4 -7H 2 0+0.0086 g/L CoCI 2 -6H 2 O+0.0384 g/L CaCI 2 -2H 2 0+5.5 g/L KH 2 P0 4 +2.9 g/L MgS0 4 -7H 2 0+45.1 g/L (NH 4 ) 2 S0 4

Growth media:

• 600 g/L glucose+500 mg/L histidine Production Media:

• 650 g/L glucose+10 g/L hexanoic acid Base (for pH Control)

• 5M NH40H Overlay:

• 100 mL isopropyl myristate (25% V/V0) is added to tank at 24 hours. Additionally, 10 mL (2.5% V/Vo) isopropyl myristate is added to tank at 48 and 120 hours, respectively.

Antifoam

• Struktol SB2121 (0.1 mL/L) at the beginning of the run

Struktol SB509 (0.5 mL/day) Fermentation Run Condition:

Pulse feeding is used for both growth phase and production phase during the run. Fermentation batch is inoculated with 40 mL of inoculum. Feeding is triggered at the end of batch phase when batch glucose is completely exhausted and p02 is increased by 20% (or more). Feed media (growth or production media) is delivered in pulses. Each pulse delivered 2 g glucose /starting batch volume with maximum feed rate not exceeding 20 g glucose/L/hr. pH is maintained at 6 throughout the run. Temperature of fermenter is maintained at 30 C. Air flow rate is maintained at 1.25 L/min. Agitation is 800 rpm.

In a fermentation experiment run substantially as Example 3B, and employing strain LSC3-134A or LSC3-134 as disclosed here, surprisingly, a titer of 2 g/L of divarin equivalent was obtained.

Example 4A: Growth of 5. cerevisiae strains in overlay/underlay candidates ln-situ liquid-liquid extraction (biphasic fermentation) is a strategy that can be employed in accordance with the present invention for physical separation of product from cells via partitioning into a second liquid phase from an aqueous culture phase. The second or organic liquid phase is present as either an overlay if its density is less than that of the aqueous phase, or underlay if its density is greater than that of the aqueous phase. Certain properties of the overlay or underlay are considered for production of olivetolic acid/olivetol and other resorcinols such as formulas IA and IB: (1) non-toxic or low toxicity for growth of the host strain, (2) a favorable partition coefficient of the product in the organic phase vs. the aqueous phase, and (3) preferably a lower partition coefficient for fed hexanoic acid (for olivetolic acid/olivetol) or other fatty acid such as RCO2H (for other resorcinols) in the organic phase vs. the aqueous phase. Additional properties of the organic phase enhance its suitability for downstream conversion, e.g. and without limitation, to cannabigerol and other cannabinoid compounds, including suitability as a solvent or co-solvent during downstream prenylation or other reactions, and boiling point if downstream separation by distillation is employed.

To test non-toxic organic overlays/underlays, different non-production background and production strains of S. cerevisiae (Table 1) were inoculated from colonies on YPD agar plates and grown for approximately 24 hours in 96 well deepwell plates containing 300 mί. YP + 2% (w/v) glucose at 30°C with 950 rpm shaking (3 mm throw) and maintained at 85% humidity. These cultures were then used to inoculate 96 well deepwell plates containing 300 mί. YP + 2% (w/v) galactose (YP gal) or glucose (YP glu), with or without addition of 0.04% (w/v) hexanoic acid (HA) plus 20% (v/v) (60 mί) of different overlays/underlays or no organic phase, under the same conditions described for precultures. At three different times (specified in plots), biological triplicate cultures were sampled and optical density at 600 nm (OD600) was measured as a proxy for biomass growth on a SpectraMax plate reader, with dilution in water to allow measurements to be within the linear range of instrumental readings. Averaged OD600s across at least three biological replicates of each strain/overlay condition at each sampling time. Multiple sets of measurements were performed on large groups of overlay and underlay candidates in different batches.

Table 1: Strain IDs and description/genotype features. Double colons (::) indicate replacement of the indicated locus to the left of the colons with the integration cassette to the right of the colons. PkHIS4 is the HIS4 gene from Pichia kudriavzevii under control of a TEF1 promoter from S. cerevisiae. HygR is a hygromycin resistance cassette. Defective genes that generate auxotrophies are indicated in parentheses and strains that have none listed are fully prototrophic.

The performance of various classes of organic phase compounds are provided herein. Among the diesters tested, certain were toxic to growth under the test conditions. Diethyl esters were toxic under the test conditions with the exception of modest growth by most strains in the presence of diethyl sebacate and diethyl diethylmalonate (with glucose only, with galactose strains appeared to exhibit substantial lag). For malonate diesters, under the test conditions, di-tert-butyl malonate supported growth of all strains with glucose addition, again appearing toxic or to induce substantial lag with galactose addition.

Increasing the dialkyl ester chain length from diethyl to diisopropyl to dibutyl in a dialkyl adipate series reduced toxicity. Some growth was observed with diisopropyl adipate and no apparent toxicity in dibutyl adipate. Dibutyl sebacate was also completely non-inhibitory to growth and accordingly, non-toxic. In certain embodiments, the minimum non-toxic internal alkyl chain length of diethyl diesters is sebacate. In certain embodiments, shorter internal alkyl chain length down to adipate is possible with diisopropyl diesters.

For monoester compounds, under the test conditions, octyl acetate was toxic and for the hexanoate series, growth was only observed starting with hexyl hexanoate, which was moderately non-toxic. Isopropyl octanoate was moderately inhibitory but allowed for some growth. For the decanoate series, methyl decanoate was moderately inhibitory to growth but still allowed for growth. Texanol, a monoester alcohol (2,2,4-trimethyl-l,3-pentanediol monoisobutyrate), was inhibitory to growth under the conditions tested.

However, ethyl decanoate and higher alkyl chains were increasingly non-toxic. Both ethyl and butyl laurate were non-toxic, as well as methyl and ethyl myristate. In certain embodiments, growth-suitable monoester overlays for resorcinol or cannabinoid production include hexyl hexanoate or any higher chain length alkyl hexanoate ester, C 3 chain-length or higher alkyl octanoate esters, and methyl (Ci) or higher alkyl decanoates, laurates, or myristates.

In various embodiments, esters and diesters are employed as the organic phase in accordance with the present invention.

Fatty alcohols are mostly solids above C 10 saturated chain length. Decanol is a liquid however it was toxic to growth. However, oleyl alcohol supports robust growth. In certain embodiments, longer chain length (C 12 or higher) unsaturated fatty alcohols can be suitable overlays supporting S. cerevisiae or another fermenting organism's growth. In various embodiments, fatty alcohols, preferably C 12 or higher alcohols, are employed as the organic phase in accordance with the present invention. In certain embodiments, alkanes and paraffins support robust growth. Lack of toxicity was observed for dodecane, tetradecane, hexadecane, light and heavy paraffin oils, and isopar M. In certain embodiments, all C12 and higher paraffins are suitable overlays supporting S. cerevisiae or another fermenting organism's growth. In various embodiments, fatty alcohols, preferably C12 or higher alcohols, are employed as the organic phase in accordance with the present invention.

Certain triacylglycerols were tested, including tricaprylin, coconut oil, and canola oil (vegetable oils having different average chain length compositions of fatty acid chains, with coconut oil being predominantly C12-C14 saturated fatty acids, and canola being predominantly Ci 6 -Ci 8 and a mixture of saturated and unsaturated fatty acids). Tricaprylin, a synthetic oil containing three Cs fatty acid chains, was fairly toxic, however allowed some growth of all strains in YP + 2% glucose. In certain embodiments, coconut and canola oil were non-toxic to growth.

Mixtures of IPM and isopar M with different diesters - dibasic esters (DBE), diethyl sebacate, and di-tert-butyl malonate were explored to investigate if lower percentage mixtures of these compounds in non-toxic IPM or isopar M would mitigate their toxicity toward growth of S. cerevisiae, as they may also advantageously alter partitioning properties of olivetolic acid, olivetol, and other analogues into the overlay and could offer advantages with alternative downstream separations processes. DBE, which is highly toxic by itself as an underlay, was much less toxic at concentrations of between 1 and 2.5% (v/v) in IPM and especially isopar M. Di-tert-butyl malonate also exhibited much lower toxicity at 1-10% (v/v), and particularly 1- 2.5% (v/v), in IPM and isopar M. In certain embodiments, mixtures of longer chain monoesters or paraffins with moderately to very toxic diesters are useful according to the present invention.

Example 4B: Olivetolic acid and olivetol production with organic solvent overlays

Precultures of LSC3-2 and LSC3-18 were inoculated from YPD streak plates and grown in 30 mL of YP + 2% (w/v) glucose in baffled 250 mL shake flasks for approximately 18 hours overnight at 30°C with 200 rpm shaking. Baffled 250 mL shake flasks containing 30 mL of YP + 2% (w/v) galactose, 5 mL of IPM or diethyl sebacate, and 0.04% (w/v) hexanoic acid, were inoculated with 1 mL of preculture and grown for at 30°C with 200 rpm shaking. After 24 and 48 hours, samples of cell culture broth plus overlay were sampled into microcentrifuge tubes and stored at -20°C at least overnight. Sample tubes were thawed and aqueous sample and overlay sample were pipetted into plates for HPLC analysis. Overlay samples were diluted 1:1 v/v with methanol in the HPLC plate prior to analysis.

Total olivetol equivalents are defined as the concentration of olivetol in mg/L, plus the concentration of olivetolic acid in mg/L multiplied by the ratio of the molecular weight of olivetol to olivetolic acid. Lower total olivetol equivalents were observed with diethyl sebacate overlayer as compared to IPM. With IPM overlay, OA partitioned between the IPM and aqueous phases, with a substantial amount of OA remaining in the aqueous phase in these culturing conditions. By contrast, OA entirely partitioned into diethyl sebacate with none present in the aqueous phase. The reduction in total production levels with a diethyl sebacate overlay may occur due to a reduction in Oϋeoo due to moderately growth inhibitory properties of diethyl sebacate.

In another experiment, LSC3-2 precultures were inoculated from YPD streak plates and grown in 300 mί YP + 2% (w/v) glucose in round-bottom square well 96 well deepwell plates for approximately 18 hours overnight at 30°C with 950 rpm shaking in an Infors plate shaker. 96 well deepwell plate wells containing 300 mί of YP + 2% (w/v) galactose, 60 mί of IPM, diethyl sebacate, di-tert-butyl malonate, or methyl soyate, and 0.04% (w/v) hexanoic acid, were inoculated with 10 mί of preculture and grown for 30°C with 950 rpm shaking. After 48 hours, cultures were acidified with 10 mί of 5 M phosphoric acid to enhance partitioning of olivetolic acid into the organic phase (as the free acid), and overlays were sampled on a Bravo automated liquid handling platform (Agilent) by first adding 120 mί of IPM, mixing on a shaking platform for several minutes, centrifuging the plate at 3000 rpm for 5 minutes to separate phases, and pipetting 100 mί of overlay from each well into an HPLC plate. Overlay samples were diluted 1:1 v/v with methanol in the HPLC plate, sealed and analyzed by HPLC.

Under these conditions, higher production levels were observed with a diethyl sebacate overlayer as compared with IPM. No product was observed in the overlayer (aqueous samples were not measured) with a di-tert-butyl malonate overlayer. Substantial production was observed in methyl soyate, however at slightly lower levels than with IPM.

In another experiment, several monoester overlay candidates and one diester candidate were compared to IPM. LSC3-13 precultures were inoculated from YPD streak plates and grown in 300 mί Delft medium + 0.79 g/L complete supplement mixture (CSM) (ForMedium, Norfolk, UK) + 2% (w/v) glucose in round-bottom square well 96 well deepwell plates for approximately 18 hours overnight at 30°C with 950 rpm shaking in an Infors plate shaker. 96 well deepwell plate wells containing 300 mί of Delft medium + CSM + 2% (w/v) glucose, 60 mί of IPM, diethyl sebacate, di-tert-butyl malonate, or methyl soyate, and 0.04% (w/v) hexanoic acid, were inoculated with 10 mί of preculture and grown for 30°C with 950 rpm shaking. Delft medium contains (per liter solution) 7.5 g ammonium sulfate, 14.4 g potassium phosphate monobasic (added from a 1 M stock solution adjusted to pH 6.5 with sodium hydroxide), 0.5 g magnesium sulfate heptahydrate, 3.6 mL of a trace metal solution (consisting of 130 g/L citric acid monohydrate, 0.574 g/L copper (II) sulfate pentahydrate, 8.07 g/L iron (III) chloride hexahydrate, 0.5 g/L boric acid, 0.333 g/L manganese (II) chloride, 0.2 g/L sodium molybdate, and 4.67 g/L zinc sulfate heptahydrate), and 1.0 mL of a vitamin solution (0.008 g/L biotin, 1.6 g/L calcium pantothenate, 0.008 g/L folic acid, 8 g/L myo-inositol, 1.6 g/L nicotinic acid, 0.8 g/L p-aminobenzoic acid, 1.6 g/L pyridoxal hydrochloride, 0.8 g/L riboflavin, 1.6 g/L thiamine hydrochloride, adjusted to pH 10.5 with sodium hydroxide). After 48 hours, the aqueous layer and overlay were sampled on a Bravo automated liquid handling platform (Agilent) by first removing 200 mί of aqueous sample into a 96 well filter plate, adding 180 mί of IPM to each well, mixing on a shaking platform for several minutes, centrifuging the plate at 3000 rpm for 5 minutes to separate phases, and pipetting 100 mί of overlay from each well into an HPLC plate. Overlay samples were diluted 1:1 v/v with methanol in the HPLC plate, sealed and analyzed by HPLC. Aqueous samples were centrifuged in the 96 well filter plate at 3000 rpm for 5 minutes into an HPLC plate, sealed, and analyzed by HPLC.

Titers were calculated in each phase on the basis of the volume of the full broth plus overlay, thus concentrations reported correspond to actual concentrations in the full liquid volume of each production well. Ethyl myristate exhibited approximately equal aqueous phase concentrations of olivetolic acid and olivetol product as IPM, but with slightly higher overlay concentrations. Other monoesters also supported robust production slightly lower than that of IPM, including methyl decanoate and hexyl hexanoate. The results demonstrate that monoester overlays that are not inhibitory to growth support robust production of olivetolic acid and olivetol.

Example 4C: Divarinic acid and divarin production with organic solvent overlays

Several monoester and one diester overlay candidate were compared to IPM for production of divarinic acid and divarin. LSC3-13 precultures were inoculated from YPD streak plates and grown in 300 mί. Delft medium + 0.79 g/L complete supplement mixture (CSM) (ForMedium, Norfolk, UK) + 2% (w/v) glucose in round-bottom square well 96 well deepwell plates for approximately 18 hours overnight at 30°C with 950 rpm shaking in an Infors plate shaker. 96 well deepwell plate wells containing 300 mί. of Delft medium + CSM + 2% (w/v) glucose, 60 mί. of different overlay solvents, and 0.08% (w/v) butyric acid, were inoculated with 10 mί. of preculture and grown for 30°C with 950 rpm shaking. After 48 hours, the aqueous layer and overlay were sampled on a Bravo automated liquid handling platform (Agilent) and samples from the aqueous and organic overlay phases were subjected to HPLC analysis to measure divarinic acid and divarin production.

Multiple overlays support production of divarinic acid and divarin. Under the test conditions, divarinic acid and divarin partition less effectively into monoester overlays than olivetolic acid and olivetol. IPM allowed for higher production levels than other tested monesters without branched chain substituents.

Example 5: Fermentation of glucose to produce Divarin and Divarinic Acid

• Strain: LSC3-134

• Genotype: gal80 A ::(loxHIS4)/his4 A /migl A ::(loxPkHIS4)

• Parent strain: LSC300002

• Genotype of parent strain: leu2 A ::ScLEU2<pScLEU2/tScCYCl>CsAEEKpScGAL10/pScGALl& gt;CsTKS-T2A-CsOAC<tScCY

Cl/pAG305-backbone/leu2(defective)_ura3 A ::pScURA3>ScURA3/tScCYCl>CsAEEKpScG AL10/pScGALl>CsTKS-T2A-CsOAC<tScCYCl/pAG306-backbone/u ra3(defective)_trpl A ::p

ScTRPl>ScTRPl/tScCYCl>CsAEEKpScGAL10/pScGALl>CsT KS-T2A-CsOAC<tScCYCl/pAG

304-backbone/trpl(defective)_yorWdelta22 A :tScADHl>HMGK2R<pScGAL10/pScGALl>l

DIKtScCYCl/KanMX

Fermentation Process Summary:

Seed Train:

A shake flask containing 50 mL YPD with 20 g/L glucose was inoculated with a seed vial containing LSC3-134. Strain grew at 30 C for 24 hours to an OD600 of 3-4. 15 mL of this culture was used to inoculate the fermentation tank.

Media:

Seed Media: YP with 20 g/L glucose

Batch media:

• 10 g/L yeast extract+20g/L peptones + 20g/L glucose + 500mg/L histidine+12 mg/L myo- inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+0.6 mg/L biotin+12 mg/L p-aminobenzoic acid+0.15 g/L EDTA+7.8 mg/L CuS04-5H20+0.0512 g/L FeS04-7H20+0.0032 g/L MnCI2+4.77 mg/L Na2MoO4+0.102 g/L ZnS04-7H20+0.0086 g/L CoCI2-6H2O+0.0384 g/L CaCI2-2H20+5.5 g/L KH2P04+2.9 g/L MgS04-7H20+45.1 g/L (NH4)2S04

Production Media:

• 650 g/L glucose +438 mg/L citric acid monohydrate+2 mg/L H3B03+1.3 mg/L CuS04- 5H20+22.4 mg/L FeCI3-6H20+1.33 mg/L MnCI2+0.8 mg/L Na2MoO4+10.8 mg/L ZnS04- 7H20+12 mg/L myo-inositol+12 mg/L thiamin hydrochloride+12 mg/L pyridoxal hydrochloride+12 mg/L nicotinic acid+12 mg/L calcium pantothenate+12 mg/L biotin+12 mg/L p-aminobenzoic acid+12 mg/L folic acid+12mg/L riboflavin+2.5 g/L KH2P04+1 g/L MgSO4-7H2O+20 g/L (NH4)2SO4+20 g/L sodium butyrate

Base (for pH Control) • 5M NH40H

Overlay

• 170 mL isopropyl myristate (40% V/VO) was added to tank at 24 hours.

• Antifoam Struktol SB2121 (0.1 mL/L) at the beginning of the run

Fermentation run condition:

We used pulse feeding during the run. Fermentation batch was inoculated with 15 mL of inoculum. Feeding was triggered at the end of batch phase when batch glucose was completely exhausted and p02 was increased by 10% (or more). Feed media (growth or production media) was delivered in pulses. Each pulse delivered 1.7 g glucose /starting batch volume with maximum feed rate not exceeding 10 g glucose/L/hr. pH was maintained at 5.5 throughout the run. Temperature of fermenter was maintained at 30 C. Air flow rate was maintained at 1.25 L/min. Agitation was 800 rpm.

Summary of metrics:

• Total product titer: 5 g/L (D+DA; See Figure 5A)

• Titer/time: 0.74 g Divarin equivalent/L/day (see Figure 5B)

• Maximum Divarinic acid titer: 2.5 g/L

• Maximum Divarin: 2.5 g/L

Also evaluated was the effect of sodium butyrate concentration in feed in a set of three runs (10, 14.3 and 20 g/L sodium butyrate in feed, all other nutrients remained the same as stated above) and we observed that titer and productivity increased as we increased sodium butyrate concentration in feed. The highest titer (and productivity) was observed in the run with 20g/L sodium butyrate in feed. lOg/L sodium butyrate, while producing an appreciable amount of the products, demonstrated the lowest titer.