Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD FOR THE MICROBIAL PRODUCTION OF 8-METHYL NONANOIC ACID
Document Type and Number:
WIPO Patent Application WO/2018/119343
Kind Code:
A1
Abstract:
The present disclosure relates to the production of 8-methyl nonanoic acid and medium chain branched fatty acids, e.g., via microbial fermentation.

Inventors:
CHEN HUI (US)
YU XIAODAN (US)
Application Number:
PCT/US2017/068085
Publication Date:
June 28, 2018
Filing Date:
December 22, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CONAGEN INC (US)
International Classes:
C12N9/04; C07C59/01; C11D3/20; C12N9/02; C12N9/10; C12N9/16; C12N9/88; C12N15/11; C12N15/52; C12N15/70
Foreign References:
US20130029387A12013-01-31
US20160138061A12016-05-19
US9371549B22016-06-21
US20160168603A12016-06-16
US20130005003A12013-01-03
US20030145350A12003-07-31
US20150011729W2015-01-16
US5094782A1992-03-10
US4493848A1985-01-15
US20140063952W2014-11-04
Other References:
MAZOUREK, M ET AL.: "A Dynamic Interface for Capsaicinoid Systems Biology", BIOINFORMATICS, vol. 150, no. 4, 1 August 2009 (2009-08-01), pages 1806 - 1821, XP055016385, ISSN: 0032-0889, DOI: 10.1104/pp.109.136549
QIN, C ET AL.: "Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE USA., vol. 111, no. 14, 8 April 2014 (2014-04-08), pages 5135 - 5140, XP055446995, ISSN: 0027-8424, DOI: 10.1073/pnas.1400975111
ANONYMOUS: "S.O.C. Medium - Catalog No. 15544-034", INVITROGEN , 2002, pages 1, XP009515419
KOZIK, A ET AL.: "CLPY5434.b1_C15.ab1 CLP(XYZ) lettuce perennis Lactuca perennis cDNA clone CLPY5434, mRNA sequence", 6 October 2006 (2006-10-06), XP055612260, Retrieved from the Internet
SIMBAQUEBA, J ET AL.: "Development and Characterization of Microsatellite Markers for the Cape Gooseberry Physalis peruvian a", PLOS ONE, vol. 6, no. 10, e26719, 21 October 2011 (2011-10-21), pages 2, XP055612263, DOI: 10.1371/journal.pone.0026719
ASHRAFI, H ET AL.: "De novo assembly of the pepper transcriptome (Capsicum annuum): a benchmark for in silico discovery of SNPs, SSRs and candidate genes", BMC GENOMICS, vol. 13, no. 571, 30 October 2012 (2012-10-30), pages 1 - 15, XP021134724, ISSN: 1471-2164, DOI: 10.1186/1471-2164-13-571
ZHANG ET AL., METABOLIC ENGINEERING, vol. 13, 2011, pages 713 - 722
STEWART C ET AL., J. EXP. BOT., vol. 58, 2007, pages 979 - 991
ALURU M. ET AL.: "Differential expression of fatty acid synthase genes, Acl, Fat and Kas, in Capsicum fruit", J EXPBOT., vol. 54, no. 388, 2003, pages 1655 - 64
AZA-GONZALEZ C.: "Molecular biology of capsaicinoid biosynthesis in chili pepper (Capsicum spp.)", PLANT CELL REP, vol. 30, 2011, pages 695 - 706, XP055336530, DOI: 10.1007/s00299-010-0968-8
BELZA A, JESSEN AB: "Bioactive food stimulants of sympathetic activity: effect on 24-h energy expenditure and fat oxidation", EMT J CLIN NUTR., vol. 59, 2005, pages 733 - 41, XP037766329, DOI: 10.1038/sj.ejcn.1602121
BENTLEY G.J. ET AL.: "Engineering Escherichia coli to produce branchedchain fatty acids in high percentages", METAB ENG, vol. 38, 2016, pages 148 - 58, XP055557153, DOI: 10.1016/j.ymben.2016.07.003
YAO: "Supercritical carbon dioxide extraction of Scotch Bonnet (Capsicum annuum) and quantification of capsaicin and dihydrocapsaicin", J. AGR. FOOD CHEM., vol. 42, 1994, pages 1303 - 05, XP000453336, DOI: 10.1021/jf00042a010
ZHANG: "Efficient free fatty acid production in Escherichia coli using plant acyl-ACP thioesterases", METABOLIC. ENG., vol. 13, pages 713 - 722
See also references of EP 3559219A4
Attorney, Agent or Firm:
MCMAHON, Amy J. et al. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. A biosynthetic method of making the capsaicinoid 8-methyl nonanoic acid (8M)

comprising:

a) expressing KASIIIa, KASIIIb and a thioesterase in a transformed cellular system; b) producing isobutryl-CoA in said transformed cellular system; and,

c) producing 8-methyl nonanoic acid.

2. The biosynthetic method of claim 1, wherein the thioesterase is FATB or FATB2.

3. The biosynthetic method of claim 1 or 2, wherein the cellular system also comprises a glucose medium for the transformed cellular system.

4. The biosynthetic method of any one of claims 1 to 3, wherein the KASIIIa is cloned from a plant of the Capsicum genus.

5. The biosynthetic method of any one of claims 1 to 4, wherein the KASIIIb is cloned from a plant of the Capsicum genus.

6. The biosynthetic method of claim 2, wherein the thioesterase is FATB2 and it is cloned from a plant of the Capsicum genus.

7. The biosynthetic method of claim 2, wherein the thioesterase is FATB and it is cloned from a plant of the Capsicum genus.

8. The biosynthetic method of any one of claims 4 to 7, wherein the Capsicum genus plant is the ghost chili.

9. The biosynthetic method of any one of claims 1 to 8, wherein the cellular system also comprises a pyruvate medium for the transformed cellular system.

10. The biosynthetic method of any one of claims 1 to 9, wherein said transformed cellular system is selected from the group including yeast, non-capsaicinoid producing plants, algae and bacteria.

11. The biosynthetic method of claim 10, wherein said cellular system is E. Coli.

12. The biosynthetic method of any one of claims 1 to 11, wherein said transformed cellular system further comprises acyl-CoA synthetase.

13. The biosynthetic method of any one of claims 1 to 12, wherein said transformed cellular system further comprises propionate CoA transferase.

14. The biosynthetic method of any one of claims 1 to 13, wherein the cellular system is selected from the group consisting of bacteria, yeast, plant cells, animal cells, an in vitro translation system and a combination thereof.

15. The biosynthetic method of any one of claims 1 to 14, wherein said 8M is purified.

16. The biosynthetic method of claim 15, wherein said 8M is greater than 70% pure.

17. The biosynthetic method of claim 16, wherein said 8M content is greater than about 85% by weight on a dry basis.

18. The biosynthetic method of any one of claims 1 to 14, wherein step c) comprises: i) purifying crude product; and, ii) removing solvents under vacuum to provide a concentrated capsaicinoid product.

19. The biosynthetic method of claim 18, wherein said crude product is purified by column chromatography.

20. The biosynthetic method of claim 18, wherein said crude product is purified by acid-base extraction.

21. The biosynthetic method of claim 18, wherein said crude product is purified by vacuum distillation.

22. The biosynthetic method of claim 18, wherein said crude product is purified by semi- preparative HPLC.

23. The biosynthetic method of any one of claims 1 to 22, wherein the transformed cellular system is a microbe culture of yeast or bacteria.

24. A thioesterase gene having the nucleic acid sequence of SEQ ID NO. 5.

25. A modified KasIIIa gene having the nucleic acid sequence of SEQ ID NO. 6.

26. A modified KasIIIb gene having the nucleic acid sequence of SEQ ID NO. 7.

27. A modified CCL4 gene having the nucleic acid sequence of SEQ ID NO. 8.

28. A modified PCT gene having the nucleic acid sequence of SEQ ID NO. 9.

29. A biosynthetic method of making a medium chain BCFA (branched-chain fatty acid) comprising: a) expressing KASIIIa, KASIIIb and a thioesterase in a transformed cellular system; b) producing isobutryl-CoA in said transformed cellular system; and, c) producing a C8 to C14 branched chain fatty acid.

30. The biosynthetic method of claim 29, wherein the branched chain fatty acid produced is used for biofuel production.

31. The biosynthetic method of claim 30, wherein the branched chain fatty acid produced used in biofuel production and has enhanced cold flow properties.

32. The biosynthetic method of claim 29, further comprising the use of FabH and isobuytyl- CoA to produce a fatty acid synthase (FAS) system for elongation of branched chain fatty acids.

33. The biosynthetic method of claim 29, further comprising the feeding of a-ketoisovalerate or a sodium salt thereof to the transformed cellular system to express 8M.

34. The biosynthetic method of claim 33, further comprising expressing bkd in the

transformed cellular system.

35. The biosynthetic method of claim 29, further comprising the feeding of isobutyric acid and/or salts thereof to the transformed cellular system to express 8M.

36. The biosynthetic method of claim 35, further comprising expressing CCL4 in the

transformed cellular system.

37. The biosynthetic method of claim 29, further comprising the feeding of isobutyric acid and/or salts thereof to the transformed cellular system to express 8M.

38. The biosynthetic method of claim 37, further comprising expressing PCT in the

transformed cellular system.

39. The biosynthetic method of claim 29, further comprising the feeding of isobutyric acid and/or salts thereof to the transformed cellular system to express 8M.

40. The biosynthetic method of claim 39, further comprising expressing alsS, ilvC, ilvD and bkd in the transformed cellular system.

41. A Fat2B protein having the amino acid sequence of SEQ ID NO. 2.

42. A protein comprising an amino acid sequence derived from the translation the nucleic acid sequence of SEQ ID NO. 5.

Description:
METHOD FOR THE MICROBIAL PRODUCTION OF 8-METHYL NONANOIC ACID

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Application No. 62/437,792, filed on

December 22, 2016, the contents of which are incorporated by reference herein in their entirety.

FIELD OF THE INVENTION

The field of the invention relates to methods and processes useful in the microbial production of 8-methyl nonanoic acid (CAS No. 5963-14-4) (hereinafter "8M"), an immediate precursor to capsaicin. Importantly, the current disclosure provides a method to produce 8M in yields not possible in plant-based systems and the sequence for the FatB2 protein. The present disclosure provides a genetically modified E. coli strain that has been modified to biosynthesize 8M, a method of producing the genetically modified microorganism to produce 8M, and a production method to produce 8M in high yield and purity levels starting from a specific starting material. In this disclosure, the KASIIIa, KASIIIb, FatB, and FatB2 genes were identified and characterized, including their involvement in the biosynthesis of 8M.

BACKGROUND OF THE INVENTION

The present disclosure is directed, in some embodiments, to a method for modifying a microbial strain to produce 8-methyl nonanoic acid (also known as "isocapric acid"), a desirable branched medium chain fatty acid compound in high yield, using a glucose or isobutyric acid as a feeding source depending on different pathways exploited for isobutryl-CoA formation for the microbial production strain. Capsaicinoid compounds (including without limitation: capsaicin (CP), dihydrocapsaicin (DHCP), nonivamide (NV)), are the "spicy" components of hot peppers, and may be effective for preventing a number of pathologies by promoting energy metabolism, stimulating the central nervous system and helping in weight loss via activating lipolytic enzymes. In addition, these compounds can act to sterilize the Gastrointestinal tract, improve immunity when consumed and are also effective for activating immunity and relieving fatigue. In the production of capsaicin, the 8M compound is an essential intermediate and degradation product of CP (FIG. 1).

Capsaicinoid Chemistry

"Spiciness" or pungency is a unique characteristic of hot peppers in the Capsicum genus that produce alkaloids called capsaicinoids. In nature, it is believed that capsaicinoids are present in plants to deter mammals from consuming their fruits and destroying seeds. Humans can sense capsaicinoids via a receptor structurally related to members of the transient receptor potential channels (or "TRP" family of ion channels). Similar to other receptors important in sensory neurons, the capsaicin receptor ("TRPV1" also referred to as the vanilloid TRP, vanilloid 1 Receptor or the TRPV1) mediates the pungent odor and pain/hot sensations associated with capsaicin and piperine by reversibly losing sensitivity to capsaicin as well as other pain and heat stimuli when it is under prolonged exposure to the same stimulus (Caterina et al., 1997). This phenomenon may partially explain why humans can tolerate and even enjoy spicy foods, as pain sensors may be turned off after extensive exposure.

Capsaicin (CP), 8-methyl-N-vanillyl-trans-6-nonenamide, and dihydrocapsaicin (DHCP),

8-methyl-N-vanillylnonanamide, are the two major capsaicinoids typically found in plants together they are responsible for up to 90% of the pungency (Garces-Claver et al., 2007) of the pepper family. Relative to the current disclosure 8M is an immediate biosynthetic precursor to CP but is generally only available in vanishingly small amounts when sourced from plant-based materials.

Speaking generally, the capsaicin content of chili peppers ranges from 0.1 to 1% w/w (Govindarajan and Sathyanarayana 1991) and for many uses requires that methods be used to concentrate the CP and/or DHCP compounds for any industrial or commercial use. As for 8M, it significantly increases the yield of capsaicin when added to immobilized and freely suspended cells of C. frutescens but is almost impossible to extract in appreciable amounts from plant-based sources.

In addition to their primary use in food as an ingredient or minor additive, capsaicinoids have many pharmaceutical and medical uses. They have been found to exert a series of physiological and pharmacological effects, including analgesia, anticancer, anti-inflammatory, anti-oxidative and anti-obesity activities and are used as the main components of ointments, patches, oils and creams designed to relieve pain caused by several diseases such as vasomotor rhinitis, osteoarthritis and rheumatoid arthritis (Aza-Gonzalez et al., 2011). Capsaicinoids are currently used as the main active ingredient in self -protection aerosol sprays on the market (Reilly et al., 2001 popularly called "pepper spray"). In addition, recently capsaicinoid consumption was reported to lower plasma cholesterol and improve endothelial function in hamsters (Liang et al., 2013). Due to these diverse and growing uses of capsaicinoids including 8M in food, medicine and defense, the demand in the international market for the capsaicin and capsaicinoids is increasing. The Capsicum genus includes over 20 species of peppers, from which C. annuum, C. frutescens, C. chinense, C. baccatum, and C. pubescens have been domesticated (Walsh and Hoot, 2001). There is wide genetic variation in pungency or spiciness levels across the various

Capsicum species due to varying levels and types of capsaicinoids present. For example, the non-pungent sweet bell pepper from C. annuum scored 0.0 SHU (Scoville Heat Unit; a scale that indicates the amount of capsaicin), while the "Bhut Jolokia" or ghost chili is a hybrid between C. chinense and C. frutescens from Northeastern India, scored up to 1,001,304 SHUs (Bosland and Baral, 2007) (see Table 1). Table 1.

In vivo, capsaicin is synthesized by capsaicin synthase (CS/AT3/Punl), an acyltransferase that transfers the 8-methyl nonenoyl moiety from 8-methylnonenoyl-CoA to vanillylamine to form an amide conjugate (FIG. 1, Chen et al. 2015 and Ogawa et al. 2015). Vanillylamine is formed from the phenylpropanoid pathway whereas the branched-chain fatty acid is derived from a branched-chain amino acid, e.g. valine (Curry et al., 1999; Stewart et al. 2007; Mazourek et al., 2009). Limitations in the production and use of capsaicins primarily stems from the prior need to extract it from plants. Typically, pure CP extracted from plants typically rates at approximately 16,000,000 Scoville units on the heat index and in this concentration can sell for over $5000 USD per gram (Batchelor, 2000)(see Table 1). However, the content of capsaicinoids actually present in hot peppers is generally very low and can be greatly affected by environmental and growth conditions leading to problems of sustainability and consistency. The production of 8M can help solve this issue through its use as a capsaicin content enhancer when incubated with intact C. frutescens cells as well as its direct use in various pharmacological and commercial formulations. According to the current disclosure, in some embodiments, the identification of genes related to the biosynthesis of 8M will help to increase capsaicinoid production through bioconversion in modified microbes.

When extracted from plants, typically solid-liquid extraction using solvents like hexane, chloroform, and ethanol are commonly employed for capsaicinoid recovery (Catchpole et al., 2003). However, solvent extraction is itself energy intensive, leads to problems of toxic waste disposal, requires extensive acreage for the plants themselves to be grown and yields a product that requires further purification for minor constituents such as 8M to be recovered. Thus, new production methods are needed to reduce costs of pure capsaicin or 8M production and/or the recovery of other capsaicinoids and lessen the environmental impact of large scale cultivation and processing (Yao et al., 1994). Genetic manipulation of selected microbial strains has the potential to address these needed improvements and increase the selectivity, abundance and production of specific capsaicinoids.

It is estimated that about 60% of pure capsaicin currently produced is used as a raw material or ingredient in pharmaceuticals and about 15% is used in pesticides with the remainder being used as a food additive. As of 2016, capsaicin extract is priced at about $5,000 per kilogram in the current international market with 1,200 tons of global annual production.

In addition to the above, while consumers approve and actively seek natural and biological sources for food, fragrance, flavor or medicinal components they are also concerned about sourcing, consistent potency and environmentally sustainable production. Into this situation, the microbial fermentation and production methods of the current disclosure provide compounds produced by modified microbial strains that have the capability to produce products biologically in quantities useful for a variety of industries and research while doing so in a more natural fashion than organic chemical synthesis. In the last three decades it has become evident that the world's fossil fuel reserves are decreasing and will eventually be depleted and will be unable to meet global need and/or will be cost prohibitive to recover. In response, the development of renewable resources has become a focus of governments and scientists alike. In this vein, the current disclosure provides for the production a medium chain (C8-C14) branched chain fatty acids which, if produced in volume, would provide a base for enhanced biofuels along with the other benefits provided herein.

Therefore, a component of the current disclosure is in the focus on microbial fatty acid biosynthesis with the aim to produce medium-chain fatty acids or derivatives thereof for use in the production of diesel.

As an industrially important organism and with the best studied microbial fatty acid biosynthesis, E. coli has often been the organism of choice for studies into fatty acid biosynthesis. Plant-based production of fatty acids for fuel use is available but as world population grows a debate concerning the social and environmental consequences of the use of food and agricultural land for biofuel production has also grown. Relative to plant-based biofuel production the methods herein, in some embodiments, provide for the microbial production of medium chain (C8-C14) free fatty acids (FFA) useful in the production of biofuels. Such production would be beneficial for the quick production time and renewable nature. They also would avoid using agricultural land that could be otherwise occupied growing food for an increasing population.

Accordingly, a need exists for the development of a novel method of producing specific capsaicin intermediates, such as 8M, for consumers or for industrial needs. Specifically, the current disclosure provides, in some embodiments, methods to produce 8M from a fermentation process in commercially relevant amounts.

SUMMARY OF THE INVENTION

The present disclosure encompasses improved methods of producing 8-methyl nonanoic acid. The current disclosure provides, in some embodiments, a method to produce 8-methyl nonanoic acid in modified microbes that comprise a cellular system, such as yeast or bacteria. According to the current disclosure, a microbial fermentation process is provided that synthesizes 8M. In some embodiments, the cultures of the disclosure can produce 8M when the modified microbial strains are fed from natural precursors such as glucose or isobutyric acid. The identification of genes related to 8M production e. g. FatB and KASIII is also an essential part of this disclosure.

Due to the low-temperature operability of branched-chain fatty acids which are superior to straight chain fuels, several microbial fermentation processes to produce branched long-chain fatty acids have been described in the field of biofuel research (Howard et al. 2013; Haushalter et al. 2014; Jiang et al. 2015; Tao et al. 2015; and, Bentley et al. 2016). However, there has been no report on the ability to produce branched medium-chain fatty acid production using microbial fermentation techniques or methods, such as provided herein.

As shown in FIG. 1, an acyl-ACP thioesterase controls the chain length of fatty acid and is one of the key enzymes in 8M biosynthetic pathway. However, such an 8M-specific thioesterase has not been characterized thus far. Aluru et al. (2003), using a differential expression technique isolated a FatA gene (GenBank: AF318288.1) from pepper fruits and it was theorized that this thioesterase could be involved in 8M biosynthesis (FIG. 1, Stewart et al. 2007). However, Aluru et al., did not provide the biochemical activity of FatA and no study since has reported on this feature of FatA. Using bioinformatics tools, Mazourek et al. (2009) isolated another acyl-ACP thioesterase, FatB (GenBank: EU616562) from hot peppers. Likewise, FatB has not been previously characterized thus far and its role in 8M biosynthesis is still unknown. In this application, we reported the identification of a third acyl-ACP thioesterase gene from the genome of C. annuum L_Zunla-l (Qin et al. 2014) and named it FatB2 (NCBI Reference Sequence:

XM_016708605.1). FatB2 has 372 aa which contains 64 aa of signal peptide that designate it as chloroplast targeted. (See SEQ IDS NOs 1 through 5).

In some embodiments, the current method provides an approach for the development of a bacterial strain that can produce significant volumes of 8-methyl nonanoic acid by genetic modification and targeted feeding of specific starting molecules that are more cost effective and easier to obtain. According to some embodiments of the current disclosure the production of isobutyryl-CoA is the key intermediate. Once it is produced, KASIIIa/KASIIIb are involved in its elongation, the thioesterase (FatB/FatB2) controls the length of elongated fatty acids. 8M is produced after 3 rounds of elongation with the cutting by FatB/FatB2. (See SEQ IDS NOs 6 through 9). Additional rounds can be included to further elongate the fatty acids produced hereby.

An embodiment of the present disclosure is a biosynthetic method of developing 8-methyl nonanoic acid by making isobutyryl-CoA from glucose or isobutyric acid.

In the current disclosure we demonstrated three ways of making isobutyryl-CoA: 1) de novo from glucose/pyruvate; 2) from ACS (AAE, acyl-CoA synthetase); and, 3) from PCT (propionate CoA-transferase). In some embodiments, this intermediate is itself key to producing 8M microbially.

In an alternative embodiment, the feeding of glucose to the cellular system comprises expressing the genes for de novo biosynthesis of isobutyryl-CoA in the cellular system, the feeding of isobutyric acid to the cellular system expressing a isobutyryl-CoA synthetase (ACS) gene or a propionate/isobutyrate-CoA transferase (PCT) gene.

The current disclosure provides, in part, three pathways for isobutyryl-CoA formation (see the publications cited below for additional pathway information) and thereafter the production of branched medium-chain fatty acids in engineered microbes:

1. For de novo using alsS, ilvC, ilvD and bkd genes (Jiang et al.,);

2. Using the ACS (AAE) pathway, (Zhang et al.,); and,

3. Using the PCT pathway, (McMahon MD et al.,)

In terms of product/commercial utility there are several dozen products containing capsaicin or close analogs such as 8M on the market in the United States and can be used in everything from analgesics to pest repellents as well as in foods and as an exercise supplement. Products containing 8M can be aerosols, liquids, or granular formulations.

According to some embodiments of the current disclosure, medium chain (C8 through C14) branched-chain fatty acids (BCFAs) can be produced for use in biofuel production. These molecules are important in the development of biofuels as they enhance the operating temperature range of the fuel, especially with regard to developing fuels that have superior cold-flow properties. Single chain fatty acids (SCFAs) by contrast appear much easier to produce via microbial bioconversion. Cellular synthesis of both SCFAs and BCFAs requires the FabH enzyme that can catalyze condensation reactions between malonyl-ACP and a short-chain acyl- CoA.

In some aspects, the disclosure provides a biosynthetic method of making the capsaicinoid 8-methyl nonanoic acid (8M) comprising (a) expressing KASIIIa, KASIIIb and a thioesterase in a transformed cellular system; (b) producing isobutryl-CoA in the transformed cellular system; and, (c) producing 8-methyl nonanoic acid.

In some embodiments of the method, the thioesterase is FATB or FATB2. In some embodiments of the method, the KASIIIa is cloned from a plant of the Capsicum genus (e.g., ghost chili). In some embodiments of the method, the KASIIIb is cloned from a plant of the Capsicum genus (e.g., ghost chili). In some embodiments of the method, the thioesterase is FATB2 and it is cloned from a plant of the Capsicum genus (e.g., ghost chili). In some embodiments of the method, the thioesterase is FATB and it is cloned from a plant of the

Capsicum genus (e.g., ghost chili).

In some embodiments of the method, the cellular system also comprises a glucose medium for the transformed cellular system. In some embodiments of the method, the cellular system also comprises a pyruvate medium for the transformed cellular system. In some embodiments of the method, the transformed cellular system further comprises acyl-CoA synthetase. In some embodiments of the method, the transformed cellular system further comprises propionate CoA transferase.

In some embodiments of the method, the transformed cellular system is selected from the group including yeast, non-capsaicinoid producing plants, algae and bacteria. In some

embodiments of the method, the cellular system is E. Coli. In some embodiments of the method, the cellular system is selected from the group consisting of bacteria, yeast, plant cells, animal cells, an in vitro translation system and a combination thereof. In some embodiments of the method, the transformed cellular system is a microbe culture of yeast or bacteria.

In some embodiments of the method, the 8M is purified. In some embodiments of the method, the 8M is greater than 70% (e.g., greater than 70%, greater than 80%, greater than 90%, greater than 95%, greater than 98%, or greater than 99%) pure. In some embodiments of the method, the 8M content is greater than about 85% (e.g., greater than about 85%, greater than about 90%, greater than about 95%, greater than about 98%, greater than about 99%) by weight on a dry basis.

In some embodiments of the method, step c) comprises: i) purifying crude product; and, ii) removing solvents under vacuum to provide a concentrated capsaicinoid product. In some embodiments of the method, the crude product is purified by column chromatography. In some embodiments of the method, the crude product is purified by acid-base extraction. In some embodiments of the method, the crude product is purified by vacuum distillation. In some embodiments of the method, the crude product is purified by semi-preparative HPLC.

In other aspects the disclosure provides a thioesterase gene having the nucleic acid sequence of SEQ ID NO. 5 or a nucleic acid sequence that is at least 80% (e.g., at least 80%, at least 90%, at least 95%, at least 98%, or at least 99%) identical to SEQ ID NO. 5. In some embodiments, the gene is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the gene is in an E. coli cell.

In other aspects the disclosure provides a modified KasIIIa gene having the nucleic acid sequence of SEQ ID NO. 6 or a nucleic acid sequence that is at least 80% (e.g., at least 80%, at least 90%, at least 95%, at least 98%, or at least 99%) identical to SEQ ID NO. 6. In some embodiments, the gene is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the gene is in an E. coli cell.

In other aspects the disclosure provides a modified KasIIIb gene having the nucleic acid sequence of SEQ ID NO. 7 or a nucleic acid sequence that is at least 80% (e.g., at least 80%, at least 90%, at least 95%, at least 98%, or at least 99%) identical to SEQ ID NO. 7. In some embodiments, the gene is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the gene is in an E. coli cell.

In other aspects the disclosure provides a modified CCL4 gene having the nucleic acid sequence of SEQ ID NO. 8 or a nucleic acid sequence that is at least 80% (e.g., at least 80%, at least 90%, at least 95%, at least 98%, or at least 99%) identical to SEQ ID NO. 8. In some embodiments, the gene is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the gene is in an E. coli cell.

In other aspects the disclosure provides a modified PCT gene having the nucleic acid sequence of SEQ ID NO. 9 or a nucleic acid sequence that is at least 80% (e.g., at least 80%, at least 90%, at least 95%, at least 98%, or at least 99%) identical to SEQ ID NO. 9. In some embodiments, the gene is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the gene is in an E. coli cell.

In other aspects, the disclosure provides a Fat2B protein having the amino acid sequence of SEQ ID NO. 2 or an amino acid sequence that is at least 80% (e.g., at least 80%, at least 90%, at least 95%, at least 98%, or at least 99%) identical to SEQ ID NO. 2. In some embodiments, the protein is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the protein is in an E. coli cell.

In other aspects, the disclosure provides a protein comprising an amino acid sequence derived from the translation the nucleic acid sequence of SEQ ID NO. 5. In some embodiments, the protein is in a host cell (e.g., a bacteria cell, a yeast cell, a plant cell, or an animal cell). In some embodiments, the protein is in an E. coli cell.

In other aspects the disclosure provides a biosynthetic method of making a medium chain

BCFA (branched-chain fatty acid) comprising (a) expressing KASIIIa, KASIIIb and a thioesterase in a transformed cellular system; (b) producing isobutryl-CoA in the transformed cellular system; and, (c) producing a C8 to C14 branched chain fatty acid. In some embodiments of the method, the branched chain fatty acid produced is used for biofuel production. In some embodiments of the method, the branched chain fatty acid produced is used in biofuel production and has enhanced cold flow properties. In some embodiments of the method, the method further comprises the use of FabH and isobuytyl-CoA to produce a fatty acid synthase (FAS) system for elongation of branched chain fatty acids. In some embodiments of the method, the method further comprises the feeding of a-ketoisovalerate or a sodium salt thereof to the transformed cellular system to express 8M. In some embodiments of the method, the method further comprises expressing bkd in the transformed cellular system. In some embodiments of the method, the method further comprises the feeding of isobutyric acid and/or salts thereof to the transformed cellular system to express 8M. In some embodiments of the method, the method further comprises expressing CCL4 in the transformed cellular system. In some embodiments of the method, the method further comprises the feeding of isobutyric acid and/or salts thereof to the transformed cellular system to express 8M. In some embodiments of the method, the method further comprises expressing PCT in the transformed cellular system. In some embodiments of the method, the method further comprises the feeding of isobutyric acid and/or salts thereof to the transformed cellular system to express 8M. In some embodiments of the method, the method further comprises expressing alsS, ilvC, ilvD and bkd in the transformed cellular system.

In some embodiments of the method, the transformed cellular system is selected from the group including yeast, non-capsaicinoid producing plants, algae and bacteria. In some

embodiments of the method, the cellular system is E. Coli. In some embodiments of the method, the cellular system is selected from the group consisting of bacteria, yeast, plant cells, animal cells, an in vitro translation system and a combination thereof. In some embodiments of the method, the transformed cellular system is a microbe culture of yeast or bacteria.

In some embodiments of the method, the branched chain fatty acid and/or 8M is purified.

In some embodiments of the method, the branched chain fatty acid and/or 8M is greater than 70% (e.g., greater than 70%, greater than 80%, greater than 90%, greater than 95%, greater than 98%, or greater than 99%) pure. In some embodiments of the method, the branched chain fatty acid and/or 8M content is greater than about 85% (e.g., greater than about 85%, greater than about 90%, greater than about 95%, greater than about 98%, greater than about 99%) by weight on a dry basis.

In some embodiments of the method, step c) comprises: i) purifying crude product; and, ii) removing solvents under vacuum to provide a concentrated branched chain fatty acid and/or 8M product. In some embodiments of the method, the crude product is purified by column chromatography. In some embodiments of the method, the crude product is purified by acid-base extraction. In some embodiments of the method, the crude product is purified by vacuum distillation. In some embodiments of the method, the crude product is purified by semi- preparative HPLC.

As for the cellular system, in some embodiments, it is selected from the group consisting of bacteria, yeast, and a combination thereof, or any cellular system that would allow the genetic transformation with the selected genes and thereafter the biosynthetic production of the desired 8- methyl nonanoic acid from a feeding of glucose (de novo pathway) or isobutyric acid (ACS or PCT pathway) precursors. In a most preferred microbial system, E. coli is used to produce the desired 8M. While the disclosure is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail. It should be understood, however, that the drawings and detailed description presented herein are not intended to limit the disclosure to a particular embodiment disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the present disclosure as defined by the appended claims.

Other features and advantages of this invention will become apparent in the following detailed description of preferred embodiments of this invention, taken with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1. The Capsaicinoid Biosynthetic Pathway. The aminotransferase (pAMT) catalyzes the formation of vanillyamine from vanillin. Pal, phenylalanine ammonia lyase; Ca4H, cinnamic acid 4-hydroxylase; 4CL, 4-coumarate CoA ligase; HCT, hydroxycinnamoyl transferase; C3H, coumaroyl shikimate/quinate 3-hydroxylase; COMT, caffeic acid O-methyltransferase; pAMT, aminotransferase; BCAT, branched-chain amino acid transferase; Kas, 3-keto-acyl ACP synthase; ACL, acyl carrier protein; Fat, acyl-ACP thioesterase; ACS, acyl-CoA synthetase; CS, capsaicin synthase (Stewart C et al. 2007, J. Exp. Bot. 58:979-991).

FIG. 2. The activities of purified FatA (TE1, left bar of each pair of bars) and FatB (TE2, right bar of each pair of bars) proteins against various acyl-CoAs. Acyl-CoAs were purchased from Sigma (St Louis, MO). The activity of thioesterase (TE) was assayed in 100 mM Tris buffer, pH7.5 containing 100 mg/L of acyl-CoA and 5 μΐ of purified enzyme at 30°C. Free CoA and acyl- CoAs were analyzed by HPLC. HPLC was performed with Dionex - UltiMate® 3000 LC

Systems (Thermo Scientific) using an Acclaim® 120 CI 8 reversed-phase column (Thermo Scientific; 3 μ, 120 A, 150 χ 3 mm). The mobile phase consisted of solvent A (0.1%

trifluoroacetic acid) and solvent B (acetonitrile). The gradient elution procedure was as follows: 0 to 5 min, 5% of B; 5 to 9 min, a linear gradient from 5 to 80% of B; 9 to 1 1 min, 80% of B; 1 1 to 12 min, 5% of B. The flow rate was 0.6 ml/min. The diode array detector collected data in the 200- to 400-nm range. For detection and quantification of substrate and products, peak areas were measured at 257 nm. Due to the unavailability of commercial acyl-ACPs, various acyl-CoAs from Sigma were used as the substrates of thioesterases.

FIG. 3. GC/MS analysis of supernatants from IPTG-induced BL21(DE3) cultures.

GC/MS analysis of supernatants from IPTG-induced BL21(DE3) cultures. CK, BL21(DE3) with the empty vector; TEl, FatA; TE2, FatB; TE3, FatB2. The cultures were grown at 16°C after the induction by 1 mM IPTG and one day later, fatty acids were extracted from the supernatants.

FIG. 4. Cis-5-dodecenoic acid (CAS No. 2430-94-6).

FIG. 5. Production of 8M in strains with different FabH genes. KBB, KAS Illb; KAB, KAS Ilia; KBsB l, FabH2 from Bacillus subtilis 168; KSaB, FabH from Staphylococcus aureus. KBsB and KSaB are two positive controls (Jiang et al. 2015). The experiment was performed in triplicate.

FIG. 6. MS spectra of putative 8M peak and 8M standard and GC profile of 8M Standard. The lower spectrum is the library match. FIG. 7. Production of 8M in strains with CCL4, FaB2 and different FabH genes. BBA,

KAS Illb; ABA, KAS Ilia; BsBAl, FabH2 from Bacillus subtilis 168; SaBA, FabH from

Staphylococcus aureus. The experiment was performed in triplicate.

FIG. 8. De Novo production of 8M in E. coli. De Novo production of 8M in E. coli. Colored genes are those engineered ones. alsS and bkd are from Bacillus subtilis 168; ilvC and ilvD are from E. coli; KAS IIIa/KAS Illb and FatB/FatB2 are from hot peppers. This figure was adapted from FIG. 1 of Jiang et al. (2015). (See SEQ IDS NOs 1 through 7).

DESCRIPTION OF THE PREFERRED EMBODIMENT Explanation of Terms Used Herein:

Capsaicin or CP is a colorless irritant phenolic amide C 18 H 27 NO 3 and is one of a series of phenolic amides found in various Capsicum species and hybrids thereof that gives hot peppers their hotness or pungency and that is used for food, medicine, and security applications. Pure CP is a volatile, hydrophobic, colorless, odorless, crystalline to waxy compound.

Capsaicinoid as used herein this refers to a class of irritant compounds, related to

Capsaicin, that are responsible for the heat of chili peppers. They are irritants for mammals, including humans, and produce a sensation of burning in any tissue with which they come into contact. The capsaicinoids including 8M are produced as secondary metabolites by chili peppers, probably as deterrents against certain mammals and fungi.

Cellular system is any cells that provide for the expression of ectopic proteins. It includes bacteria, yeast, plant cells and animal cells. It includes both prokaryotic and eukaryotic cells. It also includes the in vitro expression of proteins based on cellular components, such as ribosomes. Fatty Acids, C8-C14. According to the current disclosure a variety of fatty acids can be used as starting source materials. The source materials include vanillin, vanillylamine or their derivatives with modifications at the aromatic ring such as methylation, ethylation, or glycosylation; and more particularly 8-14 carbon straight chain or branched chain fatty acids or their derivatives such as hydroxy fatty acids (Ex: Octanoic acid; Nonanoic acid; Decanoic acid; Undecanoic acid; and, Dodecanoic acid) can be straight chain fatty acids or branched chain fatty acids and be used to make the 8-methyl nonanoic acid of the current disclosure.

Growing the Cellular System. Growing includes providing an appropriate medium that would allow cells to multiply and divide. It also includes providing resources so that cells or cellular components can translate and make recombinant proteins.

Protein Expression. Protein production can occur after gene expression. It consists of the stages after DNA has been transcribed to messenger RNA (mRNA). The mRNA is then translated into polypeptide chains, which are ultimately folded into proteins. DNA is present in the cells through transfection - a process of deliberately introducing nucleic acids into cells. The term is often used for non-viral methods in eukaryotic cells. It may also refer to other methods and cell types, although other terms are preferred: "transformation" is more often used to describe non-viral DNA transfer in bacteria, non-animal eukaryotic cells, including plant cells. In animal cells, transfection is the preferred term as transformation is also used to refer to progression to a cancerous state (carcinogenesis) in these cells. Transduction is often used to describe virus- mediated DNA transfer. Transformation, transduction, and viral infection are included under the definition of transfection for this application.

Yeast. According to the current disclosure yeast as claimed herein are eukaryotic, single- celled microorganisms classified as members of the fungus kingdom. Yeast are unicellular organisms which evolved from multicellular ancestors but with some species useful for the current disclosure being those that can develop multicellular characteristics by forming strings of connected budding cells known as pseudo hyphae or false hyphae.

Acronyms:

AAE, acyl activating enzyme

PCT, propionate/isobutyrate-CoA transferase TE, acyl-ACP thioesterase

Pal, phenylalanine ammonia lyase;

Ca4H, cinnamic acid 4-hydroxylase; 4CL, 4-coumarate CoA ligase;

HCT, hydroxycinnamoyl transferase;

C3H, coumaroyl shikimate/quinate 3-hydroxylase;

COMT, caffeic acid O-methyltransferase; AMT, aminotransferase;

BCAT, branched-chain amino acid transferase;

KAS, 3-keto-acyl ACP synthase;

ACL, acyl carrier protein;

FatA, acyl-ACP thioesterase;

ACS, acyl-CoA synthetase; and,

CS, capsaicin synthase.

DETAILED DESCRIPTION

The present disclosure provides, in part, a microbial fermentation system for the production of 8M.

Due to the low-temperature operability of branched-chain fatty acids which are superior to straight chain fuels, several microbial fermentation processes for the production of branched long- chain fatty acids have been described in the field of biofuel research (Howard et al. 2013;

Haushalter et al. 2014; Jiang et al. 2015; Tao et al. 2015; Bentley et al. 2016). However, there has been no report on branched medium-chain fatty acid production using microbial fermentation.

As shown in FIG. 1, an acyl-ACP thioesterase controls the chain length of fatty acid and is one of the key enzymes in 8M biosynthetic pathway. However, such an 8M-spectic thioesterase has not been characterized so far. Aluru et al. (2003), based on differential expression, once isolated FatA gene (GenBank: AF318288.1) from pepper fruits and people tend to believe this thioesterase is involved in 8M biosynthesis (FIG. 1, Stewart et al. 2007). However, the biochemical activity of FatA has not been reported so far. Using bioinformatics tools, Mazourek et al. (2009) isolated another acyl-ACP thioesterase, FatB (GenBank: EU616562) from hot peppers. However, FatB has not been characterized so far and its role in 8M biosynthesis is still unknown. In this application, we reported the identification of the third acyl-ACP thioesterase gene from the genome of C. annuum L_Zunla-l (Qin et al. 2014) and named it as FatB2 (NCBI Reference Sequence: XM_016708605.1). FatB2 has 372 aa which contains 64 aa of signal peptide for chloroplast targeting. (See SEQ IDS NOs 1 through 5).

The 8M produced herein was synthesized in modified E. coli cultures that were modified to carry the KASIIIa/KASIIIb and FatB/FatB2 genes from hot peppers and other genes related to the production of isobutyryl-CoA. (See SEQ IDS NOs 1 through 6). Isobutyryl-CoA could be generated de novo from glucose by the overexpression of alsS, ilvC, ilvD and bkd genes.

Alternatively, isobutyryl-CoA could be produced from fed isobutyric acid by the overexpression of an ACS (isobutyryl-CoA synthetase) or a PCT (propionate CoA-transferase) gene. These genes allowed the properly fed selected strains to synthesize the 8-methyl nonanoic acid (FIG. 8).

Organic or non-biological processes for the synthesis of capsaicin and analogues thereof have been reported, for example, by Crombie et al., (J. CHEM SOC, 1025-27 (1955)) describes an inorganic synthesis of capsaicin, N-(4-hydroxy-3-methoxybenzyl)-8-methylnon-trans-6-enamide, the active principle in red pepper. Other organic pathways have been shown in U.S. Pat. No. 4,493,848 issued to LaHann et al., and U.S. Pat. No. 5,094,782 issued to Chen et al.

Capsaicinoids, have long been used as an experimental tool because of their selective action on the small diameter afferent nerve fibers C-fibers and A-delta fibers that are believed to signal pain. From studies in animals, capsaicinoids appears to trigger C-fiber membrane depolarization by opening cation channels permeable to calcium and sodium. Recently one of the receptors for capsaicinoid effects has been cloned.

In most chili peppers, vanillylamine is formed from phenylalanine via ferulic acid, vanillin and related compounds, and capsaicinoid is produced from vanillyamine and branched chain fatty acid by capsaicin synthase (FIG. 1).

Synthetic Biology

Genetically engineered microbes have become an increasingly important platform for the production of drugs, chemicals, and biofuels from renewable resources (Du et al., 2011). These bio technological products, when used in food, can be labeled 'natural' in food sector according to current regulations (Hausler and Miinch, 1997).

Exemplary capsaicinoids include, but are not limited to: nonivamide, N- vanillylnonanamides, N-vanillylsulfonamides, N-vanillylureas, N-vanillylcarbamates,

N[(substituted phenyl)methyl]alkylamides, methylene substituted N[(substituted

phenyl)methyl]alkanamides, N[(substituted phenyl)methyl]-cis-monosaturated alkenamides, N[(substituted phenyl)methyl]diunsaturated amides, 8M, 3-hydroxyacetanilide,

hydroxyphenylacetamides, pseudocapsaicin, dihydrocapsaicin, nordihydrocapsaicin, homocapsaicin, homodihydrocapsaicin I, anandamide, piperine, zingerone, warburganal, polygodial, aframodial, cinnamodial, cinnamosmolide, cinnamolide, civamde, nonivamide, olvanil, N-oleyl-homovanillamidia, isovelleral, scalaradial, ancistrodial, and any combinations or mixtures thereof.

Determination of Sequence Similarity Using Hybridization Techniques

Nucleic acid hybridization is a technique well known to those of skill in the art of DNA manipulation. The hybridization properties of a given pair of nucleic acids are an indication of their similarity or identity.

The term "hybridization" refers generally to the ability of nucleic acid molecules to join via complementary base strand pairing. Such hybridization may occur when nucleic acid molecules are contacted under appropriate conditions. "Specifically hybridizes" refers to the ability of two nucleic acid molecules to form an anti-parallel, double- stranded nucleic acid structure. A nucleic acid molecule is said to be the "complement" of another nucleic acid molecule if they exhibit "complete complementarity," i.e., each nucleotide in one sequence is complementary to its base pairing partner nucleotide in another sequence. Two molecules are said to be "minimally complementary" if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under at least conventional "low- stringency" conditions. Similarly, the molecules are said to be "complementary" if they can hybridize to one another with sufficient stability to permit them to remain annealed to one another under conventional "high-stringency" conditions. Nucleic acid molecules that hybridize to other nucleic acid molecules, e.g., at least under low stringency conditions are said to be "hybridizable cognates" of the other nucleic acid molecules. Conventional low stringency and high stringency conditions are described herein and by Sambrook et al., MOLECULAR CLONING, A LABORATORY MANUAL, 2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor, New York (1989) and by

Haymes et al., NUCLEIC ACID HYBRIDIZATION, A PRACTICAL APPROACH, IRL Press, Washington, DC (1985). Departures from complete complementarity are permissible, as long as such departures do not completely preclude the capacity of the molecules to form a double- stranded structure.

Low stringency conditions may be used to select nucleic acid sequences with lower sequence identities to a target nucleic acid sequence. One may wish to employ conditions such as about 0.15 M to about 0.9 M sodium chloride, at temperatures ranging from about 20°C to about 55°C. High stringency conditions may be used to select for nucleic acid sequences with higher degrees of identity to the disclosed nucleic acid sequences (Sambrook et al., 1989). High stringency conditions typically involve nucleic acid hybridization in about 2X to about 10X SSC (diluted from a 20X SSC stock solution containing 3 M sodium chloride and 0.3 M sodium citrate, pH 7.0 in distilled water), about 2.5X to about 5X Denhardt's solution (diluted from a 50X stock solution containing 1% (w/v) bovine serum albumin, 1% (w/v) ficoll, and 1% (w/v) polyvinylpyrrolidone in distilled water), about 10 mg/mL to about 100 mg/mL fish sperm DNA, and about 0.02% (w/v) to about 0.1% (w/v) SDS, with an incubation at about 50°C to about 70°C for several hours to overnight. High stringency conditions are preferably provided by 6X SSC, 5X Denhardt's solution, 100 mg/mL fish sperm DNA, and 0.1% (w/v) SDS, with an incubation at 55°C for several hours. Hybridization is generally followed by several wash steps. The wash compositions generally comprise 0.5X to about 10X SSC, and 0.01% (w/v) to about 0.5% (w/v) SDS with a 15 minute incubation at about 20°C to about 70°C. Preferably, the nucleic acid segments remain hybridized after washing at least one time in 0.1X SSC at 65°C.

Analysis of Sequence Similarity Using Identity Scoring

As used herein "sequence identity" refers to the extent to which two optimally aligned polynucleotide or peptide sequences are invariant throughout a window of alignment of components, e.g., nucleotides or amino acids. An "identity fraction" for aligned segments of a test sequence and a reference sequence is the number of identical components which are shared by the two aligned sequences divided by the total number of components in reference sequence segment, i.e., the entire reference sequence or a smaller defined part of the reference sequence.

As used herein, the term "percent sequence identity" or "percent identity" refers to the percentage of identical nucleotides in a linear polynucleotide sequence of a reference ("query") polynucleotide molecule (or its complementary strand) as compared to a test ("subject") polynucleotide molecule (or its complementary strand) when the two sequences are optimally aligned (with appropriate nucleotide insertions, deletions, or gaps totaling less than 20 percent of the reference sequence over the window of comparison). Optimal alignment of sequences for aligning a comparison window are well known to those skilled in the art and may be conducted by tools such as the local homology algorithm of Smith and Waterman, the homology alignment algorithm of Needleman and Wunsch, the search for similarity method of Pearson and Lipman, and preferably by computerized implementations of these algorithms such as GAP, BESTFIT, FASTA, and TFASTA available as part of the GCG® Wisconsin Package® (Accelrys Inc., Burlington, MA). An "identity fraction" for aligned segments of a test sequence and a reference sequence is the number of identical components which are shared by the two aligned sequences divided by the total number of components in the reference sequence segment, i.e., the entire reference sequence or a smaller defined part of the reference sequence. Percent sequence identity is represented as the identity fraction multiplied by 100. The comparison of one or more polynucleotide sequences may be to a full-length polynucleotide sequence or a portion thereof, or to a longer polynucleotide sequence. For purposes of this disclosure "percent identity" may also be determined using BLASTX version 2.0 for translated nucleotide sequences and BLASTN version 2.0 for polynucleotide sequences.

The percent of sequence identity is preferably determined using the "Best Fit" or "Gap" program of the Sequence Analysis Software Package™ (Version 10; Genetics Computer Group, Inc., Madison, WI). "Gap" utilizes the algorithm of Needleman and Wunsch (Needleman and Wunsch, JOURNAL OF MOLECULAR BIOLOGY 48:443-453, 1970) to find the alignment of two sequences that maximizes the number of matches and minimizes the number of gaps. "BestFit" performs an optimal alignment of the best segment of similarity between two sequences and inserts gaps to maximize the number of matches using the local homology algorithm of Smith and Waterman (Smith and Waterman, ADVANCES IN APPLIED MATHEMATICS, 2:482-489, 1981, Smith et al., NUCLEIC ACIDS RESEARCH 11:2205-2220, 1983). The percent identity is most preferably determined using the "Best Fit" program.

Useful methods for determining sequence identity are also disclosed in the Basic Local Alignment Search Tool (BLAST) programs which are publicly available from National Center Biotechnology Information (NCBI) at the National Library of Medicine, National Institute of Health, Bethesda, Md. 20894; see BLAST Manual, Altschul et al., NCBI, NLM, NIH; Altschul et al., J. MOL. BIOL. 215:403-410 (1990); version 2.0 or higher of BLAST programs allows the introduction of gaps (deletions and insertions) into alignments; for peptide sequence BLASTX can be used to determine sequence identity; and, for polynucleotide sequence BLASTN can be used to determine sequence identity.

As used herein, the term "substantial percent sequence identity" refers to a percent sequence identity of at least about 70% sequence identity, at least about 80% sequence identity, at least about 85% identity, at least about 90% sequence identity, or even greater sequence identity, such as about 98% or about 99% sequence identity. Thus, one embodiment of the disclosure is a polynucleotide molecule that has at least about 70% sequence identity, at least about 80% sequence identity, at least about 85% identity, at least about 90% sequence identity, or even greater sequence identity, such as about 98% or about 99% sequence identity with a

polynucleotide sequence described herein. Polynucleotide molecules that have the activity of the ACS and CS genes of the current disclosure are capable of directing the production of a variety of capsaicinoids and have a substantial percent sequence identity to the polynucleotide sequences provided herein and are encompassed within the scope of this disclosure.

"Homology" refers to the level of similarity between two or more nucleic acid or amino acid sequences in terms of percent of positional identity (i.e., sequence similarity or identity). Homology also refers to the concept of similar functional properties among different nucleic acids or proteins.

In an alternative embodiment, the nucleic acid molecule comprises a nucleic acid sequence that exhibits 70% or greater identity, and more preferably at least 80 or greater, 85 or greater, 87 or greater, 88 or greater, 89 or greater, 90 or greater, 91 or greater, 92 or greater, 93 or greater, 94 or greater, 95 or greater, 96 or greater, 97 or greater, 98 or greater, or 99% or greater identity to a nucleic acid molecule selected from the group consisting of SEQ ID NO: 1 through and SEQ ID NO: 9 or any other nucleic acid sequence described herein, any complements thereof, any fragments thereof, or any cis elements thereof. The nucleic acid molecule preferably comprises a nucleic acid sequence that exhibits a 75% or greater sequence identity with a polynucleotide selected from the group consisting of SEQ ID NO: 1 through and SEQ ID NO: 9 or any other nucleic acid sequence described herein, any complements thereof, any fragments thereof, or any cis elements thereof. The nucleic acid molecule more preferably comprises a nucleic acid sequence that exhibits an 80% or greater sequence identity with a polynucleotide selected from the group consisting of SEQ ID NO: 1 through SEQ ID NO: 9 or any other nucleic acid sequence described herein, any complements thereof, any fragments thereof, or any cis elements thereof. The nucleic acid molecule most preferably comprises a nucleic acid sequence that exhibits an 85% or greater sequence identity with a polynucleotide selected from the group consisting of SEQ ID NO: 1 through SEQ ID NO: 9 or any other nucleic acid sequence described herein, any complements thereof, any fragments thereof, or any cis elements thereof.

For purposes of this disclosure "percent identity" may also be determined using BLASTX version 2.0 for translated nucleotide sequences and BLASTN version 2.0 for polynucleotide sequences. In a preferred embodiment of the present disclosure, the presently disclosed corn genomic promoter sequences comprise nucleic acid molecules or fragments having a BLAST score of more than 200, preferably a BLAST score of more than 300, and even more preferably a BLAST score of more than 400 with their respective homologues.

According to another embodiment of the current disclosure the efficiency of heterologous protein production in a microbial system can be enhanced by codon changes that alter the DNA sequences to one that may be preferred by the cellular system being used for expression but that varies from the original gene source organism without changing the eventual polypeptide produced. Approaches normally used to overcome this problem include targeted mutagenesis to remove rare codons or the addition of rare codon tRNAs in specific cell lines to move towards a codon sequence preferred by a host organism that will produce the polypeptide of interest.

Recently, improvements in such "codon optimization" technology have enabled cost-effective production of synthetic genes, making this a feasible alternative and potentially useful for the current disclosure.

Identity and Similarity

Identity is the fraction of amino acids that are the same between a pair of sequences after an alignment of the sequences (which can be done using only sequence information or structural information or some other information, but usually it is based on sequence information alone), and similarity is the score assigned based on an alignment using some similarity matrix. The similarity index can be any one of the following BLOSUM62, PAM250, or GONNET, or any matrix used by one skilled in the art for the sequence alignment of proteins.

Identity is the degree of correspondence between two sub-sequences (no gaps between the sequences). An identity of 25% or higher implies similarity of function, while 18- 25% implies similarity of structure or function. Keep in mind that two completely unrelated or random sequences (that are greater than 100 residues) can have higher than 20% identity. Similarity is the degree of resemblance between two sequences when they are compared. This is dependent on their identity.

As is evident from the foregoing description, certain aspects of the present disclosure are not limited by the details of the examples illustrated herein, and it is therefore contemplated that other modifications and applications, or equivalents thereof, will occur to those skilled in the art. It is accordingly intended that the claims shall cover all such modifications and applications that do not depart from the spirit and scope of the present disclosure.

Moreover, unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the disclosure belongs. Although any methods and materials similar to or equivalent to or those described herein can be used in the practice or testing of the present disclosure, the preferred methods and materials are described above.

Although the foregoing invention has been described in some detail by way of illustration and example for purposes of understanding, it will be apparent to those skilled in the art that certain changes and modifications may be practiced. Therefore, the description and examples should not be construed as limiting the scope of the invention, which is delineated by the appended claims.

Accordingly, it is to be understood that the embodiments of the invention herein providing for the production of 8-methyl nonanoic acid is illustrative of the application of the principles of the invention. It will be evident from the foregoing description that changes in the form, methods of use, and applications of the elements of the disclosed production methods and selected microbial strains may be resorted to without departing from the spirit of the invention, or the scope of the appended claims. EXAMPLES

Example 1

Cloning and Characterization of Different Pathways

Previously, an E. coli fermentation platform was developed where various capsaicinoids could be produced upon the feeding of fatty acids and vanillylamine/vanillin (Chen et al. 2015). The production of nonivamide was also demonstrated by this system. In the current disclosure, this system was used to fully sequence and characterize three Capsicum acyl-ACP thioesterases and thereby elucidate their roles in 8M biosynthesis. To facilitate their expression in E. coli, the nucleotide sequences encoding the matured proteins without transient peptides were codon optimized against E. coli genome and synthesized by GenScript (Piscataway, NJ). The synthesized genes were cloned into pCDFDuet-1 vector (Spect+) and transformed into

BL21(DE3) for both in vitro and in vivo studies.

As shown in FIG. 2, FatA (TE1 in FIG. 2) is a long chain acyl-CoA thioesterase whereas FatB prefers medium chain acyl-CoAs (TE2 in FIG. 2). It seems that FatB rather than FatA is involved in 8M biosynthesis. This conclusion is supported by the analysis of fatty acid composition in cell cultures (FIG. 3). Peak 1 was identified as cis-5 dodecenoic acid (FIG. 4) by the comparison of this retention time and mass spectrum with that of cis-5 dodecenoic acid standard from Sigma. As shown in FIG. 3, cis-5-dodecenoic acid is produced by FatB and FatB2 confirming involvement in 8M production.

In plants as well as in bacteria, KAS III (β-ketoacyl-ACP synthase III, FabH) catalyzes the condensation of acetyl-CoA and malonyl-CoA, an initial step in fatty acid biosynthetic pathway whereas KAS I is involved in fatty acid elongation. In 8M biosynthetic pathway, a KAS III enzyme must be able to accept isobutyryl-CoA as a substrate (FIG. 1). However, such an enzyme has not been identified in the plant kingdom although the bacterial KASIII enzymes using isobutyryl-CoA as a substrate have been well characterized (Howard et al. 2013; Haushalter et al. 2014; Jiang et al. 2015; Tao et al. 2015; Bentley et al. 2016).

Using a bioinformatics approach, Mazourek et al. (2009) isolated KAS Ilia (EU616569) and KAS Illb (EU616570) genes from hot peppers. However, their enzyme activity and roles in 8M biosynthesis have not been reported. In the present disclosure, these two KAS III enzymes were characterized in order to understand their roles in 8M biosynthesis. To facilitate their expression in E. coli, the nucleotide sequences encoding the matured proteins without transient peptides were codon optimized against E. coli genome and synthesized by GenScript (SEQ ID. NOS 6 and 7, respectively). The synthesized genes were cloned into pRSFDuet-1 vector (Kan+) and transformed into BL21(DE3) containing pCDFDuet-FatB or pCDFDuet-FatB2 for in vivo studies.

However, when KAS IIIa/KAS Illb co-expressed with FatB/FatB2 in BL21(DE3) cells, upon the induction by 1 mM IPTG, no 8M was produced (data not shown), suggesting that the endogenous isobutyryl-CoA concentration might be too low to support 8M biosynthesis. The bkd operon was cloned from the genome of Bacillus subtilis 168 which encode an ketoisovalerate dehydrogenase (FIG. 1) into pETDuet-1 vector (AMP+) and bkd was co-expressed with KAS IIIa/KAS Illb and FatB/FatB2 in BL21(DE3). After IPTG-induction, upon the feeding of lg/L of a-ketoisovalerate (FIG. 1), 8M was produced (FIGS 5 and 6).

In another example, the CCL4 gene (GenBank: AGA17921.1) from Humulus lupulus was synthesized with codon optimization against E. coli genome. CCL4 encodes an isobutyryl-CoA synthetase which converts isobutyric acid into isobutyryl-CoA (Xu et al., 2013). CCL4 was cloned into pETDuet-1 vector (AMP+) and CCL4 was co-expressed with KAS IIIa/KAS Illb and FatB/FatB2 in BL21(DE3). After IPTG-induction, upon the feeding of lg/L of isobutyric acid, 8M was produced (FIG. 7). The SEQ IDS NOs 6 and 7 represent optimized KAS nucleic acid sequences from hot peppers. KAS Ilia (EU616569) and KAS Illb (EU616570) are genes from hot peppers.

In another example, a PCT gene (NCBI Reference Sequence: WP_014015705.1) from Megasphaera elsdenii was synthesized with codon optimization against E. coli genome. PCT encodes an acetate CoA-transferase which converts isobutyric acid and acetyl-CoA into isobutyryl-CoA and acetate (McMahon and Prather, 2014). PCT was cloned into pETDuet-1 vector (AMP+) and PCT was co-expressed with KAS IIIa/KAS Illb and FatB/FatB2 in

BL21(DE3). After IPTG-induction, upon the feeding of lg/L of isobutyric acid, 8M was produced. In another example, based on the work of Jiang et al. (2015), alsS and bkd genes from the genome of Bacillus subtilis 168 were cloned into pETDuet-1 vector together with KAS IIIa/KAS Illb pRSFDuet-1 vector respectively, and ilvC and ilvD genes from the genome of E. coli were cloned into the two multiple cloning sites of pACYCDuet-1 vector (CM+), respectively (FIG. 8). When alsS, ilvC, ilvD, bkd and KAS IIIa/KAS Illb genes were co-expressed with FatB/FatB2 in BL21(DE3), after IPTG-induction, 8M was produced from a de novo pathway.

Expression

As described above, to produce 8M, applicants first used three methods to make isobutyryl-CoA namely, de novo, ACS (AAE) or PCT pathways. Then isobutyryl-CoA was used by KAS Ilia or KAS Illb for entering fatty acid elongation cycles of microbial host e.g. E. coli. After three rounds of elongation, 8M was released by FatB or FatB2. pRSFDuet-1 was used to express KAS Ilia or KAS Illb; pCDFDuet-1 was used to express FatB or FatB2; pETDuet-1 was used to express ACS (AAE) or PCT; pACYCDuet was used to express ilvC and ilvD; pETDuet to express bkd; pRSFDuet-1 to express alsS and KAS Ilia. (See SEQ IDS Nos 1 through 9). The transformed culture in BL21star(DE3) was induced by IPTG. In the case of ACS or PCT pathway were exploited for 8M production, lg/L of isobutyric acid was fed into the culture. Fatty acids were extracted with ethyl acetate and methylated with 2.5% sulfuric acid in methanol. Methyl esters of fatty acids were extracted with hexanes and analyzed by GC/MS.

Product Analysis

GC/MS was used to analyze fatty acid composition after the methylation. GC/MS analysis was conducted on Shimadzu GC-2010 system, which with GCMS-QP2010S detector analytical column is: SHRXI-5MS (Thickness 0.25um; length30m; diameter0.25mm), injection temperature 265 0 C. Injection mode was split, furnace temperature 50 0 C, the temperature gradient: 0-4 min 50 0 C; 4-7.3 min 50 0 C -100 0 C, a gradient of 15; 7.3-15.3 min, 100 0 C -260 0 C, a gradient of 20. The identity and presence of the 8M compound in the fermentation product was confirmed by comparing its retention time and mass spectrum with those of an 8M standard purchased from Cayman Chemical (Ann Arbor, MI). They were identical.

Production of 8 -Methyl Nonanoic Acid

As shown in FIG. 5, the culture system had a titer of ca. 1 mg/L. To produce 8M, cultures from glycerol stock of the E. coli BL21(DE3) strain overexpressing ACS or PCT, KASIIIa or KASIIIb, and FatB or FatB2 were streaked on LB (AMP+Kan+Spect+) plates and incubated at 37° C overnight. Single colonies were picked to inoculate 5 ml of LB (AMP+Kan+Spect+) liquid medium and the cultures were shaken at 250 rpm at 37 0 C overnight. 20 ml of LB

(AMP+Kan+Spect+) liquid medium were inoculated with 2% of overnight culture and the culture were shaken at 250 rpm at 37 0 C until OD600 reached 0.6 (usually 3 hours). Then the cultures were cooled down to 30 0 C and 1 mM IPTG was added to induce protein expression. Three hours later, 1 g/L of isobutyric acid was added and samples were taken for 8M analysis.

To analyze 8M production, 0.5 ml culture was extracted with 0.5 ml ethyl acetate with shaking for 30 minutes. After centrifugation at 14,000 rpm for 10 min, 100 μΐ of ethyl acetate phase were transfer to a glass tube and dried over nitrogen gas. 1ml of 2.5% sulfuric acid in methonal and the tube was incubated in a 65°C water bath for 60 min. Then 2 ml of 0.9% NaCl and 2 ml of hexanes were added to extract methyl esters of fatty acids. After centrifugation at 2500 rpm for 10 min, the hexane phase was injected into GC/MS for the analysis of 8M concentrations as described above.

STATEMENT OF INDUSTRIAL APPLICABILITY / TECHNICAL FIELD

This disclosure has applicability in the food, medicinal, and pharmacological industries. This disclosure relates generally to a method for the biosynthetic production of 8-methyl nonanoic acid via a modified microbial strain.

SEQUENCES OF INTEREST

Nucleic Acid & Amino Acid Sequences of FAT2B

SEQ ID Atgtcgttgatgatcagggatttgagcagtttacattacacagataatttctggaaaaca gagaagcatgtgatggaa NO: 1 tgccggagtttgaaatttgattgtaacgcgaagaagaaatggagagcgataactgctagt gcagacagtagcggaa gcagaagcattgatacaattaatgggaagaagataaatggtgttcatgttgaggggcact cgcaatcaggacaaag

Fat2B - gggaaatgtagttgaatcaggatcatcatcatcaccgaaacattcatatatgttagggaa ttttgtggaggataaggtt Nucleic gtgtataggcagtcatttgtgattaggtcttatgaaattggacctgataaaactgctact atggaaactatcatgaatctc Acid cttcaggagacagctctaaatcatgtggcgaactcaggggttggtagtagtggattcggg gctacacgagagatga gccttaggaaactcatatgggtagtcactcgcatacatatacaaattgaacaatatagct cttggggagatgtggtag agatcgatacatgggtagatgcagcaggcaaaaatggaatgagaagggattggatcattc gcgactccaacactc gcaaaatcatcactagagcaacaagtaaatgggtgataatgaacatagaaacaagaaggt tatccaaaatcccaga gcaggtcaaagcagaagttcgacctttctacatcaacagattcgcgatccctactgcaca aattgactctgaaaagat tgagaaactcaacgatgaaactgcccaaatcatctcttccggcttagctccgcgatggag cgacatggatgctaatc aacatgtcaacaatgtcaaatatattggatggattttggagagtgtgcccataaatgttt tagaagactactacttaatg agcttgacattagagtatagacgtgaatgtcaattatcaaatgtgctgcaatccatgaca actatgcgagaaatagca acatcagcaagtgataaaaattgtggacttgaatgcacacatctgattcgaatggaggct gatcgagctgaggttgtt cgagcaaggtccctatggcagccaaaacagtga

-Nucleotide sequence of FatB2, NCBI Reference Sequence: XM_016708605.1 SEQ ID MSLMIRDLSSLHYTDNFWKTEKHVMECRSLKFDCNAKKKWRAITASAD NO: 2 S S GS RS IDTINGKKING VH VEGHS QS GQRGN V VES GS S S S PKHS YMLGNF

VEDKVVYRQSFVIRSYEIGPDKTATMETIMNLLQETALNHVANSGVGSS

Fat2B - GFGATREMS LRKLIW VVTRIHIQIEQYS S WGD VVEIDTWVD A AGKNGM

Amino RRDWIIRDSNTRKIITRATSKWVIMNIETRRLSKIPEQVKAEVRPFYINRF

Acid AIPTAQIDSEKIEKLNDETAQIISSGLAPRWSDMDANQHVNNVKYIGWIL

ES VPIN VLED Y YLMS LTLE YRRECQLS N VLQS MTTMREIATS AS DKNC G LECTHLIRMEADRAEVVRARSLWQPKQ

- NCBI Reference Sequence: XP_016564091.1

Nucleic Acid Sequences of 8M Synthesis Related Thioesterases

SEQ ID Gcgaccaacgaaagagaatctaaaagtaaacagcaggtcagtcacgaaccatcacttgct gatcgtttacgt NO: 3 ctggggaccatgagtgaagatggtatgtcatataaagagaaattcatcgttcgctgctat gaagttggcgttaataaa acgctgacggttgaaacaattgctaatctgctgcaggaagtaggttgtaaccatgcacag agcgtgggcttcagca

FatA - TP cagatggctttgcgactacccatagcatgcgtaaattacatttaatttgggttactgcgc gtatgtatatcgaaatttata aatatccggcttggtctgatgtcgttgagattgagacatggtgccagagtgaaggacgca tcggtactcgccgcga ttggatcctgaaagattgtgccacgggggaggtcatcggacgcgccacctcaaaatggct gatgatgaatcagga tacacgtcgtctgcagaaagttacggatgaggtaagagatgagttagagctgtatttccc gaaagaacttcgcctgg cttttcctgaagaaaacaatggctcactgaaaaaaattcctaaattagaagatccggctg aatatagtaaacttggcct ggtgcctagacgtgcagatctggatatgaatcagcatgtgaacaatgttacatatattgg ttgggtcttggaaagtatg ccgcaggagatcatcgatacgcatgagttagggaccattacgattgattatagaagagag tgccagcacgatgatg ttgtggattctttaacctcagtggaaccgatcgaggatactgatgctttgggtgcaaatg ggagcgcgacagcagca aaagatgttaataaatcagtgctgcattttctgcgtcttagtagtgatggcctggagatc aatagatgtcgcacagagt ggcgtaaaaaaccaagccggatttaa

SEQ ID GCGAAAGCCAACGCACAGGCTCCGCCGAAAGTGAATGGCACCAAAG NO: 4 TCGGTGTGATGGATGGCTTTAAAACGGATGACGAAGTTATTAGCTCT

CATCACCCGCGTACCTTCATCAACCAGCTGCCGGATTGGAGCATGCT

FatB - TP GCTGGCGGCCATTACCACGATCTTTCTGGCAGCTGAAAAACAATGGA

TGATGCTGGATTGGAAACCGAAACGTCCGGACATGCTGGTCGATCCG

TTTGGCCTGGGTAAAATTGTGGAAGATGGTTTTATCTTCCGTCAGAA

CTTCTCAATTCGCTCGTATGAAATCGGCGCAGATCGCACCGCTAGTA

TTGAAACGATGATGAACCATCTGCAAGAAACCGCGCTGAATCACGTG

AAATCCGCCGGTCTGATGCATGGCGGTTTCGGCAGTACCCCGGAAAT

GTCCAAACGTAATCTGATCTGGGTGGTTACGAAAATGCAGGTCGTGC

TGGATCGCTACCCGACCTGGGGTGACGTTGTCCAAGTGGATACGTGG

GTTGCGGCCAGCGGCAAAAACGGTATGCGTCGCGACTGGCTGATTCG

TGATTCTACCACGGGCGACGTTCTGATGCGCGCAAGTTCCCAGTGGG

TCATGATGAATAAAGAAACCCGTCGCCTGAGCAAAATTCCGGATGA

AGCGCGTGCCGAAATCGAAGGTTATTTTGTCGACTCACCGCCGGTGA

TCGATGACGATTCGCGCAAACTGCCGAAACTGGATGAAACCACGGC

AGACTATACCCGCACCGGTCTGACCCCGCGCTGGTCAGACCTGGATG

TGAACCAGCACGTTAACAATGTCAAATACATTGGTTGGATTCTGGAA

TCGGCTCCGATGCAAATTCTGGAAGGCTGCGAACTGGCAGCTATGAC

GCTGGAATACCGTCGCGAATGTCGTCGCGATAGCGTTCTGCAGAGCC

TGACCTCTGTTCTGGATAAAGAAGTCGGCGGTGACCTGACGAACTTT

GGTCATGTGGAATGCCAACACGTTCTGCGCCTGGAAAATGGCGGTGA

AGTGGTTAAAGGCCGTACCGAATGGCGCCCGAAACTGATTAATGGC

ATCGGTTCTCTGGGCGGTTTCCCGGCCTAA SEQ ID GGTGTCCATGTGGAAGGCCACAGTCAGTCCGGCCAACGCGGTAACGT NO: 5 GGTTGAATCAGGTAGTTCCTCATCGCCGAAACATTCGTATATGCTGG

GCAATTTTGTGGAAGATAAAGTCGTGTATCGCCAGAGCTTCGTTATC

FatB2 -TP CGTTCTTACGAAATTGGTCCGGACAAAACCGCGACGATGGAAACCAT

TATGAACCTGCTGCAAGAAACGGCGCTGAACCATGTTGCCAATAGCG

GCGTCGGTAGCTCTGGCTTCGGTGCGACCCGTGAAATGTCACTGCGC

AAACTGATCTGGGTTGTCACGCGTATTCACATCCAGATTGAACAATA

TAGTTCCTGGGGCGATGTGGTTGAAATTGATACCTGGGTGGACGCGG

CCGGCAAAAACGGTATGCGTCGCGATTGGATTATCCGTGACAGCAAT

ACCCGCAAAATTATCACCCGTGCTACGTCTAAATGGGTTATCATGAA

TATTGAAACCCGTCGCCTGAGCAAAATCCCGGAACAGGTGAAAGCC

GAAGTTCGCCCGTTTTACATCAACCGTTTCGCAATTCCGACCGCTCAG

ATCGATAGTGAAAAAATCGAAAAACTGAACGACGAAACGGCGCAAA

TTATCTCATCGGGTCTGGCGCCGCGTTGGTCTGATATGGACGCAAAC

CAGCATGTTAACAACGTCAAATACATCGGCTGGATTCTGGAAAGTGT

CCCGATTAATGTGCTGGAAGATTATTACCTGATGTCCCTGACCCTGG

AATACCGTCGCGAATGTCAGCTGAGTAACGTGCTGCAATCCATGACC

ACGATGCGCGAAATCGCGACCTCAGCCTCGGATAAAAATTGCGGCCT

GGAATGTACGCACCTGATTCGTATGGAAGCGGACCGTGCGGAAGTG

GTGCGTGCTCGTTCTCTGTGGCAGCCGAAACAATGA

SEQ ID agtagtaccgttgagggtgcggataaattgagcacggatcagtcacgcgtaagtcgtttg gttagtcgcggatgtaa NO: 6 actgattggctgtggcagtgcggtgccggcattgaaaatctccaacgatgatctggctaa aattgtagatactaatga tgaatggatcagcgttcgtaccgggatccgtaaccgccgcgtgctgtcaggtaaagataa tctgacggatttagctg

KAS Ilia - cggaagccgcacgcaaagcactggaaatggccgaggtcgatcctaacgatattgatttga ttctgctttgttcatcca TP cccctgaggatctgtttggcagcgcacctcagattcagaaagctttgggttgcaaaagta atccactggcgtttgata optimized ttacagccgcgtgtagcggatttatgctgggtttggtaagcgccgcctgttatattcgcg gaggagggttcaaaaac gtattagtggttggcgcagatgcgctgtcacggtatgtagattggaccgatcgcgggacg tgtattctttttggtgatg cggcaggggcggttgttatgcaggcctgtgatattggagaggatggtcttttcgggtttg atctgcattctgatggcg agggtcagcgccatctgaacgcctcctttaaagagaacgagtcagatcgggcctgcggta ccaacggtagtgttat tggattccctccgaaaacgagctcatattcctgtattcagatgaacggtaaagaagtgtt cagatttgctgtgcgggt ggtgcctcagagcatcgaggcagcactggagaatgctggcctgcctcagtctaaaattga ttggcttctgttacacc aggcgaatcagcgcatcattgatgcagtcgccacgcgtctggaggtgccgtccgagcatg tgatttctaatttatca aattatggaaatacatcagcagcgagtattcctctggctctggatgaagcagttcgtagc ggtaaagttcaggctgg gcatgtcattgcagccgctggttttggtgcaggactgacctggggaagcgctattttgag atggggctaa

SEQ ID GCCAAAGGCGCAGTGGAACTGAGCCGTCTGGTTAATACCGGTTGCAA NO: 7 ACTGGTGGGCTGTGGTTCAGCTGTGCCGTCGCTGCGCGTTAGCAACA

ATGATCTGGCAAAAATCGTCGATACCAATGACGAATGGATTTCTGTG

KAS Illb - CGTACGGGCATCCGCCATCGTCGCGTTCTGAGTGGTAAAGAAAACCT TP GACGGATCTGGCCATCGAAGCGGCCTGGAAAGCTCTGGAAATGGCG

optimized GACGTCCCGCCGGAAGATGTGGACCTGATTCTGATGTGCAGCTCTAC

CGGCGATGACCTGTTTGGTTCTGCCCCGGTGATCCAGAAAGCACTGG

GCTGTAAACGTAATCCGCTGGCGTTCGATATTACGGCAGCTTGCAGT

GGCTTTCTGCTGGGTCTGTTCTCAGCTTCGTGTTATATTAAAGCGGGC

GGTTTTAAAAACGTCCTGGTGATTGGCGCCGATGCAGTTTCCCGTTTT

ATCGATTGGACCGACCGCGGTTCATGCATTCTGTTCGGCGATGCGGC

CGGTGCTGTTCTGGTCCAGGCGTGTGATATCGGTGAAGACGGCCTGT

TTGGTTTCGATCTGCATTCGGATGGCGACGGTAAACGCCACCTGATT

AGCACCTTTAAAGAAAATGAAACGGATGACGCCTCGAACGAAAATC

ACAGCGTCACCTCTTTCCCGCCGAAATGCAGTTCCTATTCATACCTGC

AGATGAACGGCAAAGAAATCTTTAAATTCGCGGTTCGTGTGGTTCCG

CAATCCATTGAAGCAGCTCTGGAAAAAGCCGGCCTGGATGGTTCAAA

CAATTTTGACTGGCTGCTGCTGCATCAGGCCAACCAACGCATTATCG

ATGGCATCGCAACCCGTCTGGAAGTTCCGTCTGAACGCGTCATTAGT

AACCTGGCAAATTACGGTAACACGAGTGCCGCATCCATTCCGCTGGC

ACTGGATGAAGCAGTGCAGGACGGCAAAGTTCAACCGGGTCACGTC

ATTGCAGCGGCCGGTTTTGGTGCCGGTCTGACCTGGGCTAGCGCGAT

TTTCCGTTGGGGTTAA

SEQ ID ATGGAAGACCTGAAACCGCGTCCGGCATCGTCCTCCCCGCTGACCCC NO: 8 GCTGGGCTTTCTGGAACGTGCTGCGACCGTGTATGGTGACTGCACCT

CTGTGGTTTATGATGCGGTTAGCTACACCTGGTCTCAGACGCATCGTC CCL4 - GCTGCCTGTGTCTGGCGAGCTCTATTGCCTCACTGGGCATCGAAAAC optimized GGTCATGTCGTGTCGGTCCTGGCCCCGAATGTGCCGCAAATGTATGA

ACTGCACTTTGCAGTCCCGATGGCAGGTGCTATTCTGAACGCAGTGA ATCTGCGTCTGGATGCTCGCACCATTTCAATCCTGCTGCATCACAGTG AATCCAAACTGATCTTCGTGGATCACCTGTCGCGTGACCTGATTCTG GAAGCGATCGCCCTGTTTCCGAAACAGGCACCGGTGCCGCGCCTGGT TTTCATGGCTGATGAATCAGAATCGGGCAACAGTTCCGAACTGGGTA AAGAATTTTTCTGCTCTTACAAAGATCTGATTGACCGTGGTGATCCG GACTTTAAATGGGTGATGCCGAAAAGCGAATGGGACCCGATGATCCT GAATTACACCTCTGGCACCACGTCATCGCCGAAAGGTGTTGTCCATT GTCACCGCGGCATTTTCATCATGACCGTTGATAGTCTGATTGACTGG GGCGTTCCGAAACAGCCGGTCTATCTGTGGACGCTGCCGATGTTTCA TGCGAATGGTTGGAGCTATCCGTGGGGTATGGCCGCGGTGGGCGGTA CCAATATTTGCCTGCGTAAATTCGATTCTGAAATCATCTACGACATG ATCAAACGCCATGGCGTTACGCACATGTGTGGTGCGCCGGTGGTTCT GAACATGCTGTCAAATGCCCCGGGTTCGGAACCGCTGAAAACCACG GTGCAAATTATGACCGCAGGTGCACCGCCGCCGAGCGCAGTTCTGTT TCGTACGGAAAGCCTGGGTTTCGCTGTCTCTCATGGCTACGGTCTGA CCGAAACGGCGGGCCTGGTCGTGAGTTGTGCCTGGAAGAAAGAATG GAACCACCTGCCGGCAACCGAACGTGCTCGCCTGAAATCCCGCCAGG GCGTTGGTACCGTCATGCAAACGAAAATTGATGTTGTCGACCCGGTG ACCGGTGCAGCTGTTAAACGTGACGGCAGTACGCTGGGTGAAGTGGT TCTGCGCGGCGGTTCCGTGATGCTGGGTTATCTGAAAGATCCGGAAG GCACCGCGAAATCCATGACGGCCGATGGTTGGTTTTATACCGGCGAC GTGGGTGTTATGCATCCGGATGGCTACCTGGAAATCAAAGATCGTAG TAAAGACGTTATCATCTCCGGCGGTGAAAATCTGAGCTCTGTCGAAG TGGAAAGTATTCTGTACTCCCATCCGGACATTCTGGAAGCCGCGGTT GTGGCCCGTCCGGATGAATTTTGGGGTGAAACCCCGTGCGCCTTCGT GTCACTGAAGAAAGGCCTGACCAAAAAACCGACGGAAAAAGAAATT GTGGAATATTGTCGTTCGAAACTGCCGCGCTACATGGTTCCGAAAAC CGTTGTCTTTAAAGAAGAACTGCCGAAAACCAGCACGGGCAAAGTG CAGAAATTCATCCTGCGTGATATGGCTCGTGGTATGGGCTCGGCGAC CGCAGGTGCTTCCCGCTCCCGTATGTAA

SEQ ID ATGCGTAAAGTGGAAATTATCACCGCGGAACAGGCGGCCCAACTGG NO: 9 TTAAAGATAACGACACCATTACGAGCATCGGTTTTGTCAGCTCTGCA

CATCCGGAAGCTCTGACGAAAGCGCTGGAAAAACGTTTCCTGGATAC PCT - CAACACGCCGCAGAATCTGACCTATATTTACGCCGGTTCTCAAGGCA optimized AACGTGATGGCCGCGCAGCTGAACATCTGGCTCACACGGGTCTGCTG

AAACGCGCGATTATCGGCCACTGGCAGACCGTGCCGGCTATTGGTAA ACTGGCGGTTGAAAACAAAATCGAAGCATACAACTTCAGTCAGGGC ACCCTGGTTCATTGGTTCCGTGCACTGGCTGGTCACAAACTGGGCGT GTTTACGGATATTGGTCTGGAAACCTTCCTGGACCCGCGCCAGCTGG GCGGTAAACTGAACGATGTTACGAAAGAAGACCTGGTTAAACTGAT CGAAGTCGATGGCCATGAACAGCTGTTTTATCCGACCTTCCCGGTTA ACGTCGCTTTTCTGCGCGGCACGTACGCGGATGAAAGTGGCAATATT ACCATGGACGAAGAAATCGGTCCGTTCGAATCAACCTCGGTCGCGCA GGCCGTGCATAATTGCGGCGGTAAAGTGGTTGTCCAAGTGAAAGATG TGGTTGCCCACGGTTCCCTGGACCCGCGTATGGTCAAAATTCCGGGC ATCTATGTGGATTACGTCGTGGTTGCGGCCCCGGAAGACCATCAGCA AACGTATGATTGTGAATACGATCCGAGCCTGAGCGGTGAACATCGTG CACCGGAAGGTGCAACCGATGCAGCTCTGCCGATGTCAGCGAAGAA AATTATTGGTCGTCGCGGCGCCCTGGAACTGACCGAAAACGCAGTCG TGAATCTGGGTGTGGGCGCACCGGAATATGTGGCGTCGGTTGCCGGT GAAGAAGGCATTGCGGATACCATCACGCTGACCGTCGAAGGCGGTG CCATTGGCGGTGTGCCGCAGGGCGGTGCACGTTTTGGTAGTTCCCGC AACGCAGATGCTATTATCGACCATACCTATCAGTTTGATTTCTACGAC GGCGGTGGCCTGGATATTGCCTATCTGGGCCTGGCACAATGCGACGG TAGTGGCAACATCAATGTTTCCAAATTTGGTACGAATGTCGCCGGCT GCGGTGGCTTCCCGAACATTTCTCAGCAAACCCCGAATGTTTACTTTT GTGGCACGTTCACCGCGGGCGGCCTGAAAATTGCGGTGGAAGATGG TAAAGTTAAAATCCTGCAGGAAGGCAAAGCCAAAAAATTCATCAAA GCAGTGGATCAAATCACCTTCAACGGTAGCTATGCGGCCCGTAATGG CAAACATGTTCTGTACATTACGGAACGCTGTGTCTTTGAACTGACCA AAGAAGGTCTGAAACTGATTGAAGTGGCTCCGGGCATTGATATCGAA AAAGACATCCTGGCCCACATGGATTTTAAACCGATTATCGACAATCC GAAACTGATGGATGCGCGTCTGTTCCAGGACGGTCCGATGGGCCTGA AAAAATAA

Literature Cited and Incorporated by Reference:

1. Aluru M. et al., Differential expression of fatty acid synthase genes, Acl, Fat and Kas, in Capsicum fruit. J EXP BOT.; 54(388): 1655-64 (2003).

2. Aza-Gonzalez C. et al., (201 1), Molecular biology of capsaicinoid biosynthesis in chili pepper (Capsicum spp.). PLANT CELL REP. 30: 695-706.

3. Belza A, Jessen AB. (2005), Bioactive food stimulants of sympathetic activity: effect on 24-h energy expenditure and fat oxidation. EUR J CLIN NUTR., 59:733-41.

4. Bentley G.J. et al., Engineering Escherichia coli to produce branched-chain fatty acids in high percentages, METAB ENG. 38: 148-58 (2016). 5. Bosland and Baral, (2007), 'Bhut Jolokia '— The world's hottest known chile pepper is a putative naturally occurring interspecific hybrid, HORTSCIENCE 42:222- 24.

6. Batchelor and Jones, (2000), Determination of the Scoville Heat Value for Hot Sauces and Chilies: An HPLC Experiment, J. CHEM. EDUC, 77 (2) p 266.

7. Catchpole et al., (2003), Extraction of chili, black pepper, and ginger with near critical

C02, propane, and dimethyl ether: analysis of the extracts by quantitative nuclear magnetic resonance, J. AGRICULTURAL AND FOOD CHEMISTRY 51: 4853 - 60.

8. Caterina MJ., et al., (1997), The capsaicin receptor: a heat-activated ion channel in the pain pathway, NATURE 389, 816-24.

9. Constant HL et al., (1996), Nonivamide, a constituent of Capsicum oleoresin, J. NATURAL

PRODUCTS, 59: 425-26.

10. Crombie et al., (1955). Amides of vegetable origin. Part VI. Synthesis of capsaicin, J. CHEM SOC, 1025-27.

11. Curry, J. et al., (1999), Transcripts for possible capsaicinoid biosynthetic genes are different accumulated in pungent and non-pungent Capsicum spp., PLANT SCI. 148: 47-57.

12. Du J et al., (2011), Engineering microbial factories for synthesis of value-added products, J IND MICROBIOL BIOTECHNOL. 38: 873-90.

13. Flagan S .F., Utilization of capsaicin and vanillylamine as growth substrates by Capsicum (hot pepper)-associated bacteria, ENV. MICROBIOLOGY 8(3) 560-65 (2006).

14. Garces-Claver A. et al., (2007). Identification, validation and survey of a single nucleotide polymorphism (SNP) associated with pungency in Capsicum spp. THEOR APPL GENET 115: 907-16.

15. Govindarajan and Sathyanarayana (1991), Capsicum-production, technology, chemistry, and quality. Part V. Impact on physiology, pharmacology, nutrition, and metabolism;

structure, pungency, pain, and de sensitization sequences, CRITICAL REVIEWS IN FOOD

SCIENCE AND NUTRITION 29(6):435-74.

16. Haushalter R.W., et al., Production of anteiso -branched fatty acids in Escherichia coli; next generation biofuels with improved cold-flow properties, METAB . ENG. 26: 111-18 (2014).

17. Hausler A, and Miinch T., (1997), Microbial production of natural flavors, ASM NEWS 63:551-59.

18. Higashiguchi F., et al., (2006) Purification and structure determination of glucosides of capsaicin and dihydrocapsaicin from various Capsicum fruits, J AGRIC FOOD CHEM. 54: 5948-5953. 19. Howard T.P., et al., Synthesis of customized petroleum-replica fuel molecules by targeted modification of free fatty acid pools in Escherichia coli. PROC NATL ACAD SCI USA

110:7636-41 (2013).

20. Jiang W., et al., Enhanced production of ' branched- chain fatty acids by replacing β- ketoacyl-(acyl-carrier-protein) synthase III (FabH), BIOTECHNOL BIOENG. 112: 1613-22

(2015).

21. Jordt SE. and Julius D., (2002), Molecular basis for species-specific sensitivity to "hot" chili peppers. CELL. 108: 421-30.

22. Kaminaga Y. et al., (2004) Molecular cloning and characterization of a

glucosyltransferase catalyzing glucosylation of curcumin in cultured Catharanthus roseus cells. FEBS LETT. 567: 197-202.

23. Kometani, T. et al., (1993) Glucosylation of capsaicin by cell suspension cultures of Coffea arabica. BlOSCl. BIOTECHNOL. BlOCHEM. 57, 2192-2193.

24. Liang Y.T., et al., Capsaicinoids lower plasma cholesterol and improve endothelial function in hamsters, EUR J NUTR. 52: 379-88 (2013).

25. Lindsey K. and Yeoman M., The synthetic potential of immobilised cells of Capsicum frutescens mill cv. annuum. PLANTA 162 495-501 (1984).

26. McMahon M.D., and Prather K.L., Functional screening and in vitro analysis reveal thioesterases with enhanced substrate specificity profiles that improve short-chain fatty acid production in Escherichia coli. APPL ENVIRON MICROBIOL. 80: 1042-50 (2014).

27. Mazourek, et al., (2009). A Dynamic Interface for Capsaicinoid Systems Biology, PLANT PHYS., 150: 1806-21.

28. Ogawa K., et al., Evidence of capsaicin synthase activity of the Punl-encoded protein and its role as a determinant of capsaicinoid accumulation in pepper, BMC PLANT BlOL. 15: 93 (2015).

29. Prasad BC, (2006), Characterization of capsaicin synthase and identification of its gene (csyl)for pungency factor capsaicin in pepper (Capsicum sp.). PROC NATL Ac AD SCI U S A. 103: 13315-20.

30. Prasad BC. et al., (2008.), Characterization of capsaicin synthase and identification of its gene (csyl) for pungency factor capsaicin in pepper ( Capsicum sp.). PROC NATL ACAD SCI U

S A. 105: 20558.

31. Qin C, et al., Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization, PROC. NAT'L ACAD. Sci. USA. I l l : 5135-40 (2014). 32. Reilly CA. et al., (2001), Determination of capsaicin, dihydrocapsaicin, and nonix -amide in self -defense weapons by liquid chromatography-mass spectrometry and liquid

chromatography-tandem mass spectrometry. J. CHROMATOGR A. 912: 259-67.

33. Stew an C. et al.. (2005) The Punl gene for pungency in pepper encodes a putative acyltransferase, PLANT J. 42: 675-88.

34. Stewart C. et al., (2007) Genetic control of pungency in C. chinense via the Punl locus. J EXP BOT. 58: 979-91.

35. Shimoda K., et al., (2007) Glycosylation of capsaicin and 8-nordihydrocapsaicin by cultured cells of Catharanthus roseus. PHYTOCHEMISTRY 68: 1391-96.

36. Shockey JM. Et a., (2003), Arabidopsis contains a large superfamily of acyl-activating enzymes: phylogenetic and biochemical analysis reveals a new class of acyl-coenzyme A synthetases. PLANT PHYSIOL 132 1065-76.

37. Stewart C, et al., The Punl gene for pungency in pepper encodes a putative

acyltransferase, PLANT J. 42: 675-88 (2005).

38. Suzuki T. et al., (1980) Intracellular localization of capsaicin and its analogs

capsaicinoid in Capsicum fruit, microscopic investigation of the structure of the placenta of Capsicum annuum var annuum cv. Karayatsubusa. PLANT CELL PHYSIOL 21 839-53.

39. Suzuki T and Iwai K, (1984), Constituents of red pepper spices: chemistry, biochemistry, pharmacology and food science of the pungent principle of Capsicum species. In A Brossi, ed, THE ALKALOIDS : CHEMISTRY AND PHARMACOLOGY, Vol 23. Academic Press,

Orlando, FL, pp 227-99.

40. Tao H., et al., Metabolic engineering of microbes for branched- chain biodiesel production with low-temperature property, BIOTECHNOL BIOFUELS. 8:92 (2015).

41. Thomas BV. Et al., (1998), Simple method for quantitation of capsaicinoids in peppers using capillary gas chromatography. J AGRIC FOOD CHEM., 46:2655-63.

42. Tominaga M. and Tominaga T., (2005), Structure and function of TRPVl, PFLUGERS ARCH. EUR J PHYSIOL.; 451: 143-50.

43. Xu H., et al., Characterization of the formation of branched short-chain fatty acid: CoAs for bitter acid biosynthesis in hop glandular trichome, MOL., PLANT. 6: 1301-17 (2013). 44. Walsh and Hoot, (2001), Phylogenetic relationships of Capsicum (Solanaceae) using DNA sequences from two noncoding regions: The chloroplast atpB-rbcL spacer region and nuclear waxy introns, INT. J. PLANT Sci. 162: 1409-18.

45. Weber N. et al., (2014), Biocatalytic potential of vanillin aminotransferase from Capsicum chinense. BMC BIOTECHNOL. 14:25. 46. Yao et al., (1994), Supercritical carbon dioxide extraction of Scotch Bonnet (Capsicum annuum) and quantification of capsaicin and dihydrocapsaicin, J. AGR. FOOD CHEM.

42: 1303-05.

Patents Cited and Incorporated by Reference:

1. Chen H. et al., (2015) Methods of using capsaicin synthase for the microbial production of capsaicinoids. PCT/US2015/011729.

2. Chen et al., U.S. Pat. No. 5,094,782.

3. LaHann et al., U.S. Pat. No. 4,493,848.

4. Zhou R., and Yu X., (2014) Methods of making vanillin via the microbial fermentation of ferulic acid from eugenol using a plant dehydrogenase. PCT/US2014/063952.