Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHODS OF PRODUCING ACETOIN AND 2,3-BUTANEDIOL USING PHOTOSYNTHETIC MICROORGANISMS
Document Type and Number:
WIPO Patent Application WO/2014/052920
Kind Code:
A2
Abstract:
The present disclosure provides recombinant cyanobacteria having acetolactate synthase activity and acetolactate decarboxylase activity, as well as recombinant cyanobacteria having acetolactate synthase activity, acetolactate decarboxylase activity, and secondary alcohol dehydrogenase activity. Methods for the production of the recombinant cyanobacteria, as well as methods for use thereof for production of acetoin and 2,3- butanediol from carbon dioxide and light are also provided.

Inventors:
ATSUMI SHOTA (US)
OLIVER JOHN W K (US)
MACHADO IARA M P (US)
Application Number:
PCT/US2013/062459
Publication Date:
April 03, 2014
Filing Date:
September 27, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV CALIFORNIA (US)
International Classes:
C12N1/21
Foreign References:
US20110212498A12011-09-01
US20120184002A12012-07-19
US20100112655A12010-05-06
US6699696B22004-03-02
US20090239275A12009-09-24
Other References:
YAN ET AL.: 'Enantioselective synthesis of pure (R,R)-2,3-butanediol in Escherichia coli with stereospecific secondary alcohol dehydrogenases' ORG. BIOMOL. CHEM vol. 7, 2009, pages 3914 - 3917
MIRHENDI ET AL.: 'Molecular screening for Candida orthopsilosis and Candida metapsilosis among Danish Candida parapsilosis group blood culture isolates: proposal of a new RFLP profile for differentiation.' J MED MICROBIOL. vol. 59, no. 4, 2010, pages 414 - 20
Attorney, Agent or Firm:
LEKUTIS, Christine et al. (425 Market StreetSan Francisco, CA, US)
Download PDF:
Claims:
CLAIMS

We claim:

1. A cyanobacterium comprising a recombinant polynucleotide encoding an acetolactate synthase (ALS) and an acetolactate decarboxylase (ALDC), wherein expression of the ALS and the ALDC results in an increase in production of acetoin as compared to a corresponding cyanobacterium lacking the polynucleotide.

2. A cyanobacterium comprising a recombinant polynucleotide encoding an acetolactate synthase (ALS), an acetolactate decarboxylase (ALDC) and a secondary alcohol dehydrogenase (sADH), wherein expression of the ALS, the ALDC and the sADH results in an increase in production of one or both of acetoin and 2,3-butanediol (23BD) as compared to a corresponding cyanobacterium lacking the polynucleotide.

3. The cyanobacterium of claim 1, wherein the ALS is a bacterial ALS and the ALDC is a bacterial ALDC or a fungal ALDC; or the cyanobacterium of claim 2 wherein the ALS is a bacterial ALS, the ALDC is a bacterial ALDC or a fungal ALDC, and the sADH is a bacterial sADH or a fungal sADH.

4. The cyanobacterium of claim 3, wherein the ALS is a Bacillus sp. ALS.

5. The cyanobacterium of claim 3, wherein the ALDC is selected from the group consisting of an Enterobacter sp. ALDC, a Bacillus sp. ALDC, an Aeromonas sp. ALDC, and a Gluconacetobacter sp. ALDC.

6. The cyanobacterium of claim 3, wherein the ALDC is selected from the group consisting of an Enterobacter aerogenes ALDC, an Enterobacter cloacae ALDC, a Bacillus licheniformis ALDC, a Bacillus subtilis ALDC, an Aeromonas hydrophila ALDC, and a Gluconacetobacter xylinus ALDC.

7. The cyanobacterium of claim 3, wherein the sADH is selected from the group consisting of a Candida sp. sADH, a Leuconostoc sp. sADH, a Clostridium sp. sADH, and a Thermoanaerobacter sp. sADH.

8. The cyanobacterium of claim 3, wherein the sADH is selected from the group consisting of a Candida parapsilosis sADH, a Leuconostoc pseudomesenteroides sADH, a Clostridium beijerinckii sADH, and a Thermoanaerobacter brockii sADH.

9. The cyanobacterium of claim 3, wherein the expression of the ALS and the ALDC, or the expression of the ALS, the ALDC and the sADH is driven by an inducible promoter.

10. The cyanobacterium of claim 3, wherein the polynucleotide is stably integrated into the genome of the cyanobacterium.

11. The cyanobacterium of claim 3, wherein the cyanobacterium is selected from the group consisting of Nostoc sp., Synechococcus sp., Synechocystis sp., and

Thermosynechococcus sp..

12. The cyanobacterium of claim 11, wherein the cyanobacterium is a

Synechococcus sp..

13. The cyanobacterium of claim 12, wherein the cyanobacterium is an

Synechococcus elongates cell.

14. The cyanobacterium of claim 3, wherein the production of one or both of acetoin and 2,3-butanediol (23BD) occurs as a result of culturing the cyanobacterium under constant light.

15. The cyanobacterium of claim 3, wherein the production of one or both of acetoin and 2,3-butanediol (23BD) occurs as a result of culturing the cyanobacterium in the presence of bicarbonate.

16. The cyanobacterium of claim 3, wherein the ALDC is essentially insensitive to oxygen.

17. The cyanobacterium of claim 3, wherein the sADH is essentially insensitive to oxygen and is NADPH-dependent.

18. A method of producing acetoin, the method comprising:

a) providing the cyanobacterium of claim 1,

b) culturing the cyanobacterium in a photo synthetic environment comprising C02 and light whereby the expression of the ALS and the ALDC results in production of acetoin.

19. The method of claim 18, wherein the production of acetoin occurs at a higher level than that produced by culturing the corresponding cyanobacterium lacking the polynucleotide under the same conditions.

20. A method of producing 2,3-butanediol, the method comprising:

a) providing a cyanobacterium of claim 2;

b) culturing the cyanobacterium in a photo synthetic environment comprising C02 and light whereby expression of the ALS, the ALDC and the sADH results in production of 2,3-butanediol.

21. The method of claim 20, wherein the production of 2,3-butanediol occurs at a higher level than that produced by culturing the corresponding cyanobacterium lacking the polynucleotide under the same conditions.

Description:
METHODS OF PRODUCING ACETOIN AND 2,3-BUTANEDIOL USING

PHOTOSYNTHETIC MICROORGANISMS

CROSS-REFERENCE TO RELATED APPLICATION(S)

[0001] The present application claims priority from U.S. Provisional Application No.

61/707,866, filed September 28, 2012, which is incorporated herein by reference in its entirety for all purposes.

FIELD

[0002] The present disclosure provides recombinant cyanobacteria having acetolactate synthase activity and acetolactate decarboxylase activity, as well as recombinant

cyanobacteria having acetolactate synthase activity, acetolactate decarboxylase activity, and secondary alcohol dehydrogenase activity. Methods for the production of the recombinant cyanobacteria, as well as for use thereof for production of acetoin and 2,3-butanediol from carbon dioxide and light are also provided.

BACKGROUND

[0003] Amid rising global energy demands, interest is growing in the production of fuels and chemicals from renewable resources. Petroleum consumption reached 37.1 quadrillion BTU in the United States in 2008, of which a large majority (71%) was liquid fuel in the transportation sector. Petroleum and natural gas account for 99% of the feedstocks for chemicals such as plastics, fertilizers and pharmaceuticals in chemical industry (McFarlane et al., "Survey of Alternative Feedstocks for Commodity Chemical Manufacturing," Oak Ridge National Laboratory, 2007). Considering rapidly increasing world population and exhaustion of fossil fuels, the development of sustainable processes for energy and carbon capture (ECC) to produce fuels and chemicals is crucial for the human society.

[0004] In addition to increasing energy demands, renewable energy resources are of interest to address growing environmental issues. According to the United States Energy Information Administration (Serferlein, "Annual Energy Review, USEIA 2008), world energy-related CO 2 emissions in 2006 were 29 billion metric tons, which is an increase of 35% from 1990. Accelerating accumulation of atmospheric CO 2 is not only due to increased emissions from world growth and intensifying carbon use, but also from a possible attenuation in the efficiency of the world's natural carbon sinks (Raupach et al., Proc Natl Acad Sci USA, 104: 10288-10293, 2007). As a result, atmospheric levels of C0 2 have increased by -25% over the past 150 years. Thus, it has become increasingly important to develop new technologies to reduce C0 2 emissions.

[0005] Previous methods of producing renewable energies have involved converting terrestrial plant biomass into biochemicals. However, these methods present undesirable complications, such as harsh chemical pretreatments of the biomass resulting in toxic byproducts and large land-use requirements to grow the plants. Photo synthetic

microorganisms possess many advantages over traditional terrestrial plants with regard to biochemical production. For example, the photosynthetic efficiency of photo synthetic microorganisms is higher than plants, and photosynthetic microorganisms can be cultivated in locations that do not compete with traditional agricultural crops (Scharlemann et al., Science, 281:237-240, 2008).

[0006] An example of a photosynthetic microorganism with potential for biochemical production is cyanobacteria. Cyanobacteria are collectively responsible for almost 50% of global photosynthesis and are found in a wide range of environments (Field et al., Science, 281:237-240, 1998). While cyanobacteria have many similar features with algae in this context, many cyanobacterial species feature simpler genetic structures and faster growth rates (Ruffing, Bioeng Bugs, 2: 136-149, 2011). As a result, genetic engineering methods for cyanobacteria are also more advanced in terms of genetic manipulation efforts than those for algae (Golden et al., Methods Enzymol, 153:215-231, 1987; Huang et al., Nucleic Acids Res, 38:2577-2593, 2010; and Heidorn et al., Methods Enzymol, 497:539-579, 2011).

[0007] Cyanobacteria have the biochemical machinery required to fix C0 2 , but lack the critical components to generate fuels and chemicals efficiently. Thus to produce valuable chemicals, cyanobacteria host strains must be equipped with new biosynthetic pathways (Keasling, ACS Chem Biol, 3:64-76, 2008; Ducat et al., Trends Biotechnol, 29:95-103, 2011; and Machado and Atsumi, J Biotechnol, 2012). Unfortunately, this approach in

cyanobacteria is significantly less developed compared to a model organism such as

Escherichia coli. Further, results in E. coli cannot be directly translated into cyanobacteria. For example, an engineered E. coli strain containing the 1-butanol pathway produced more than 30 g/L 1-butanol (Shen et al., Appl Environ Microbiol, 77:2905-2915, 2011), while a cyanobacterial strain with the same pathway produced only trace amounts of 1-butanol (Lan et ah, Metab Eng, 13:353-363, 2011). Thus, there exists a need for construction of a biosynthetic pathway in cyanobacteria leading to significant production of a commodity chemical from C0 2 .

BRIEF SUMMARY

[0008] The present disclosure provides recombinant cyanobacteria having acetolactate synthase activity and acetolactate decarboxylase activity, as well as recombinant

cyanobacteria having acetolactate synthase activity, acetolactate decarboxylase activity, and secondary alcohol dehydrogenase activity. Methods for the production the recombinant cyanobacteria, as well as methods for use thereof for production of acetoin and 2,3- butanediol from carbon dioxide and light are also provided.

[0009] The present disclosure provides cyanobacteria comprising a recombinant (e.g., heterologous) polynucleotide encoding an acetolactate synthase (ALS) and an acetolactate decarboxylase (ALDC), wherein expression of the ALS and the ALDC results in an increase in production of acetoin as compared to a corresponding cyanobacterium lacking the polynucleotide. The present disclosure further provides cyanobacteria comprising a recombinant (e.g., heterologous) polynucleotide encoding an acetolactate synthase (ALS), an acetolactate decarboxylase (ALDC) and a secondary alcohol dehydrogenase (sADH), wherein expression of the ALS, the ALDC and the sADH results in an increase in production of one or both of acetoin and 2,3-butanediol (23BD) as compared to a corresponding cyanobacterium lacking the polynucleotide. In some embodiments, the corresponding cyanobacterium is a control cyanobacterium such as a parent cyanobacterium or a cell of the same genus, or the same species. In some embodiments, the ALS is a bacterial ALS. In some preferred embodiments, the ALS is a Bacillus sp. ALS (e.g., a Bacillus subtilis ALS). In some embodiments, the ALDC is a bacterial ALDC or a fungal ALDC. In some preferred embodiments, the ALDC is selected from the group consisting of an Enterobacter sp. ALDC, a Bacillus sp. ALDC, an Aeromonas sp. ALDC, and a Gluconacetobacter sp. ALDC. In a subset of these embodiments, the ALDC is selected from the group consisting of an

Enterobacter aerogenes ALDC, an Enterobacter cloacae ALDC, a Bacillus licheniformis ALDC, a Bacillus subtilis ALDC, an Aeromonas hydrophila ALDC, and a

Gluconacetobacter xylinus ALDC. In some embodiments, the sADH is a bacterial sADH or a fungal sADH. In some embodiments, the sADH is selected from the group consisting of an ascomycetes sADH, a firmicutes sADH and a saccharomycetes sADH. In some preferred embodiments, the sADH is selected from the group consisting of a Candida sp. sADH, a Leuconostoc sp. sADH, a Clostridium sp. sADH, and a Thermoanaerobacter sp. sADH. In a subset of these embodiments, the sADH is selected from the group consisting of a Candida parapsilosis sADH, a Leuconostoc pseudomesenteroides sADH, a Clostridium beijerinckii sADH, and a Thermoanaerobacter brockii sADH. In some embodiments, the expression of the ALS and the ALDC, or the expression of the ALS, the ALDC and the sADH is driven by an inducible promoter. In some embodiments, the expression of the ALS and the ALDC, or the expression of the ALS, the ALDC and the sADH is driven by a constitutive promoter. In some embodiments, the polynucleotide is stably integrated into the genome of the

cyanobacterium. In some embodiments, the polynucleotide is transiently maintained (e.g., as a plasmid) within the cyanobacterium. In some embodiments, the cyanobacterium is of an order selected from the group consisting of chroococcales and nostocales. In some preferred embodiments, the cyanobacterium is selected from the group consisting of Nostoc sp., Synechococcus sp., Synechocystis sp., and Thermosynechococcus sp.. In a subset of these embodiments, the cyanobacterium is selected from the group consisting of Nostoc punctiforme ACCS 074, Synechococcus elongates PCC 7942, Synechococcus sp. WH 8102, Synechocystis sp. PCC 6803, and Thermosynechococcus elongates BP-1. In some embodiments, the production of one or both of acetoin and 2,3-butanediol (23BD) occurs as a result of culturing the cyanobacterium under constant light. In some embodiments, the production of one or both of acetoin and 2,3-butanediol (23BD) occurs as a result of culturing the cyanobacterium in the presence of bicarbonate. In some embodiments, wherein the ALDC is essentially insensitive to oxygen (undetectable or low oxygen sensitivity). In some embodiments, the sADH is essentially insensitive to oxygen (undetectable or low oxygen sensitivity) and is NADPH-dependent.

[0010] Additionally, the present disclosure provides methods of producing acetoin, comprising: providing a cyanobacterium; and culturing the cyanobacterium in a

photo synthetic environment comprising C0 2 and light whereby the expression of the ALS and the ALDC results in production of acetoin. In some embodiments, the production of acetoin occurs at a higher level than that produced by culturing the corresponding (e.g., control) cyanobacterium lacking the polynucleotide under the same conditions. The present disclosure further provides methods of producing 2,3-butanediol, comprising: providing a cyanobacterium; and culturing the cyanobacterium in a photo synthetic environment comprising C0 2 and light whereby expression of the ALS, the ALDC and the sADH results in production of 2,3-butanediol (23BD). In some embodiments, the production of 23BD occurs at a higher level than that produced by culturing the corresponding (e.g., control) cyanobacterium lacking the polynucleotide under the same conditions. The cyanobacterium that produces one or both of acetoin and 23BD through the use of these methods is a cyanobacterium of the preceding paragraph or a recombinant cyanobacterium as otherwise provided in the description of the present disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

[0011] FIG. 1 illustrates how C0 2 can be used in the biological synthesis of acetoin and 2,3-butanediol, as well as other industrially important chemicals. ALS: acetolactate synthase, ALDC: acetolactate decarboxylase, sADH: secondary alcohol dehydrogenase.

[0012] FIG. 2 illustrates how various concentrations of acetoin and 2,3-butanediol impact the growth rate of S. elongatus.

[0013] FIG. 3A provides a schematic representation of recombination to integrate alsS and alsD genes into the S. elongatus chromosome. FIG. 3B shows acetoin production in modified E. coli cells grown for 16 h (dark) and 40 h (light). FIG 3C shows acetoin production in modified S. elongatus cells grown for 72 h. alsS indicates inclusion of (+) alsS (B.s.) or absence of (-) the gene. alsD indicates the source organism for the alsD gene

(Table 1). FIG. 3D shows the Effect of IPTG (1 mM) on the production of acetoin in S. elongatus containing alsS (B.s.) and alsD (E. a.) under Puacoi - Error bars indicate s.d. (n = 3).

[0014] FIG. 4A provides a schematic representation of recombination to integrate alsS, alsD, and adh genes into the S. elongatus chromosome. FIG. 4B shows acetoin production in modified E. coli cells were grown for 40 hours. Light bars show acetoin, while dark bars show butanediol (middle bars: meso-23BO, right bars (R,R)-23BD). alsS indicates inclusion of (+) alsS (B.s.). alsD and adh rows indicate the source organism for the gene (Table 1). Activity is that of sADH expressed in E. coli and measured in cell extract (nmol NADPH min "1 mg "1 ). FIG. 4C shows 23BD production in modified S. elongatus cells grown for 72 h. FIG. 4D shows specific activities of ALS and ADH in cell extracts from modified S.

elongatus strains. Error bars indicate standard deviation.

[0015] FIG 5A-C provides a summary of long-term 2,3-butanediol production by recombinant S. elongatus in continuous cultures. Squares: S. elongatus containing alsS (B. s.), alsD (A. h.) and adh (C. b.). Circles: S. elongatus containing alsS (B. s.), alsD (A. h.) and adh (T. b.). Triangles: S. elongatus containing alsS (B. s.) and alsD (E. a.). Diamonds: S. elongatus without alsS, alsD or adh. FIG. 5A shows time courses for growth. FIG. 5B shows total 23BD production. FIG. 5C shows photosynthetic efficiency. FIG. 5D shows total biomass production per day. Error bars indicate standard deviation (n = 3).

[0016] FIG. 6A shows daily 2,3-butanediol production by AL757 (with integrated alsS, alsD (A. h.), and adh (C. b.)). FIG. 6B shows acetoin concentration during long-term 23BD production experiments. Squares: AL757 (with integrated alsS (B. s.), alsD (A. h.) and adh (C. b.)). Circles: AL756 (with integrated alsS (B. s.), alsD (A. h.) and adh (T. b.)). FIG. 6C shows long-term acetoin production by AL763 (with integrated alsS and alsD (E. a.)).

[0017] FIG. 7 shows a comparison of productivities for various chemicals produced from exogenous pathways in cyanobacteria. Source data is obtained from: 23BD (this disclosure), isobutyraldehyde and isobutanol (Atsumi et al., Nat Biotechnol, 27: 1177-1180, 2009), fatty acid (Liu et al., Proc Natl Acad Sci USA, 108:6899-6904, 2011), ethanol (Dexter and Pengcheng, Energy Environ Sci, 2:857-864, 2009), acetone (Zhou et al., Metab Eng, 14:394- 400, 2012), ethylene (Takahama et al., J Biosci Bioeng, 95:302-305, 2003), butanol (Lan and Liao, Proc Natl Acad Sci USA, 109:6018-6023, 2012), fatty alcohol (Tan et al., Metab Eng, 13: 169-176, 2011). The detailed calculations are described in Example 1.

DESCRIPTION

[0018] There exists a need to curb C0 2 emissions into the atmosphere and/or reduce the C0 2 concentrations already in the atmosphere, Photosynthetic microorganisms have the capability of removing this compound from the atmosphere, but generally lack the capability to synthesize commodity chemicals using C0 2 as a starting material. A useful commodity chemical is the compound acetoin. Acetoin is cited as one of the top 30 most valuable chemical building blocks suitable for use by biorefineries (U.S. Department of Energy). Additionally, acetoin is a widely used perfuming agent and flavorant with important applications in the food industry. Prior to development of the present disclosure, there were no known methods for producing acetoin from C0 2 nor were there any known methods for producing acetoin from C0 2 and sun light with the use of photosynthetic microorganisms.

[0019] Another useful commodity chemical is 2,3-butanediol (23BD); a versatile building block molecule for the synthesis of valuable chemicals. 23BD can be converted by dehydrogenation to methyl ethyl ketone (MEK), which is a liquid fuel additive and useful industrial solvent (Tran et al., Biotechnol Bioeng, 29:343-351, 1987). Furthermore, the catalytic conversion of 23BD to 1,3-butadiene, which is a precursor for a diversity of polymer and copolymer materials, has been well established (van Haveren et al., Biofuels Bioprod Bioref, 2:41-57, 2008). 23BD has also been used in the manufacturing of plasticizers, inks, fumigants, and explosives (Syu, Appl Microbiol Biotechnol, 55: 10-18, 2001). Importantly, 23BD can be synthesized from the starting compound acetoin. While microbial fermentation of 23BD has also been developed for many years (Ji et al., Biotechnol Adv, 29:351-364, 2011; and Celinska and Grajek, Biotechnol Adv, 27:715-725, 2009), existing methods fail to additionally address the need to reduce C0 2 emissions from the atmosphere.

[0020] Conversion of C0 2 by photosynthetic organisms is an attractive target for establishment of fossil fuel reserve-independent synthesis of chemicals. As described herein, a 2,3-butanediol (23BD) biosynthetic pathway has been systematically developed in

Synechococcus elongatus PCC7942. This model system demonstrates that cyanobacteria can be employed for efficient production of commodity chemicals. 23BD was selected as a target chemical with low host toxicity, which permitted the design of an oxygen-insensitive, cofactor-matched biosynthetic pathway coupled with irreversible enzymatic steps to create a driving force toward 23BD production. Exemplary methods resulted in the production of 23BD from C0 2 to a level of 2.38 g/L, which is a significant increase for chemical production from exogenous pathways in cyanobacteria.

Recombinant Polynucleotides, Vectors, Source, and Host Organisms

[0021] The present disclosure provides cyanobacteria for use in the production of acetoin and 2,3-butanediol (23BD). These cyanobacteria contain a recombinant polynucleotide encoding an acetolactate synthase (ALS) and an acetolactate decarboxylase (ALDC).

Expression of the ALS and ALDC results in an increase in production of acetoin as compared to a cyanobacterium lacking this polynucleotide. [0022] The present disclosure further provides cyanobacteria for use in the production acetoin and/or 2,3-butanediol. These cyanobacteria contain a recombinant polynucleotide encoding and acetolactate synthase (ALS), an acetolactate decarboxylase (ALDC), and a secondary alcohol dehydrogenase (sADH). Expression of the ALS, ALDC, and sADH results in an increase in production of acetoin and/or 2,3-butanediol as compared to a cyanobacterium lacking this polynucleotide.

[0023] Also provided herein are recombinant vectors, expression cassettes, and

polynucleotides comprising coding sequences of one or more of ALS, ALDC, and sADH. In some embodiments, these nucleic acids are used to produce bacteria (e.g., cyanobacteria), which in turn can be used to produce one or both of acetoin and 2,3-butanediol.

[0024] Provided herein are polynucleotides comprising coding sequences for an acetolactate synthase and an acetolactate decarboxylase. Also provided herein are polynucleotides comprising coding sequences for an acetolactate synthase, an acetolactate decarboxylase, and a secondary alcohol dehydrogenase. In some embodiments, the coding sequences are operably linked to a promoter as part of a vector or an expression cassette.

[0025] As used herein, "vector" refers to a polynucleotide compound and/or composition that once introduced into a host cell, transforms a host cell, thereby causing the cell to express polynucleotides and/or polypeptides other than those native to the organism, or in a manner not native to the organism. Preferred vectors are plasmids or similar genetic constructs, particularly those with restriction sites that have been well documented to ease the cloning step of introducing polynucleotide sequences of interest into the plasmid. Such plasmids, as well as other vectors, are well known to those of ordinary skill in the art and may be used in the compositions and methods of the present disclosure as appropriate.

[0026] As used herein, "coding sequence" refers to a polynucleotide sequence that, when expressed, can be translated into a polypeptide.

[0027] As used herein, "operably linked" refers to a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of the polynucleotide sequence such that the control sequence directs expression of the

polynucleotide. [0028] In preferred embodiments, the acetolactate synthase coding sequence of the present disclosure corresponds to an alsS polynucleotide or a homolog thereof. In preferred embodiments, the acetolactate decarboxylase coding sequence corresponds to an alsD polynucleotide or a homolog thereof. In preferred embodiments, the secondary alcohol dehydrogenase coding sequence corresponds to an adh polynucleotide or a homolog thereof.

[0029] In preferred embodiments, the alsS polynucleotide produces a polypeptide having acetolactate synthase activity. In preferred embodiments, the alsD polynucleotide produces a polypeptide having acetolactate decarboxylase activity. In preferred embodiments, the adh polynucleotide produces a polypeptide having secondary alcohol dehydrogenase activity.

[0030] Acetolactate synthase (ALS) is part of an amino acid biosynthesis pathway responsible for the synthesis of valine, leucine, and isoleucine. Overall, acetolactate synthase enzymes catalyze the conversion of two pyruvate molecules into 2-acetolacate and C0 2 . Specifically, the enzyme catalyzes the aldo condensation of two molecules of pyruvate to 2- acetolactate. The overall reaction catalyzed by ALS is irreversible because of C0 2 evolution. The first step in catalysis is the ionization of the thiazolium ring of thiamine pyrophosphate (TPP). Overall, TPP is involved in the linkage of the two pyruvate molecules. The highly reactive tricyclic intermediate first forms and this reacts with the first pyruvate that then decarboxylates to give the relatively non-reactive enamine. Because this intermediate is stable, the enzyme can pause midway through the catalytic cycle while releasing C0 2 and admitting the second molecule of pyruvate. The tricyclic-carbanion then forms, followed by reacting with the second pyruvate. Deprotonation followed by carbon-carbon bond breakage completes the reaction, producing 2-acetolactate (see, e.g. US 2011/0262982).

[0031] Enzymatic reactions can be classified according to their Enzyme Commission (EC) number. The EC number associated with a given enzyme specifies the classification of the type of enzymatic reaction that a given enzyme is capable of catalyzing. EC numbers do not specify identities of enzymes, but instead specify the identity of the chemical reaction that a given enzyme catalyzes. Similarly, proteins can also be assigned Gene Ontology (GO) terms. GO terms attempt to further define the given role and/or function of a protein in a living organism by specifying protein function in terms of a cellular component, a biological process, and/or a molecular function. For example, two enzymes from two different species of organisms that catalyze the same chemical reaction could be assigned the same EC classification and GO term annotation, despite that the respective enzymes are endogenous to different organisms. EC and GO term classifications are helpful to those skilled in the art in identifying the molecular function and/or activity of a given protein outside of knowing its unique identifying classification with regard to the organism it came from, such as its NCBI (National Council for Biotechnology) identifier.

[0032] Acetolactate synthase enzymes catalyze the enzymatic reaction belonging to the classification EC 2.2.1.6 (acetolactate synthase activity) and gene ontology (GO) term ID of GO: 0003984. The GO term ID specifies that any protein characterized as having this associated GO term encodes an enzyme with catalytic acetolactate synthase activity.

[0033] Various acetolactate synthase (alsS) genes, which encode acetolactate synthase enzymes, are known in the art. Examples of alsS genes include but are not limited to gil83644996lreflYP_433431. ll acetolactate synthase [Hahella chejuensis KCTC 2396]; gil32141318lreflNP_733719. ll acetolactate synthase [Streptomyces coelicolor A3(2)];

gil238917299lref I YP_002930816.11 acetolactate synthase [Eubacterium eligens ATCC 27750]; gil312136848lreflYP_004004185.1l acetolactate synthase [Methanothermus fervidus DSM 2088]; gil311224567lgblADP77423.1l Acetolactate synthase [Methanothermus fervidus DSM 2088]; gil238872659lgblACR72369.1l acetolactate synthase [Eubacterium eligens ATCC 27750]; gill l9671178lemblCAL95091.1l acetolactate synthase [Azoarcus sp. BH72]; gil384250777lgblEIE24256. ll acetolactate synthase [Coccomyxa subellipsoidea C-169]; gil365857129lreflZP_09397126.ll acetolactate synthase [Acetobacteraceae bacterium AT- 5844]; gil363716653lgblEHM00051.1l acetolactate synthase [Acetobacteraceae bacterium AT-5844]; gil357547224lgblEHJ29116.1l acetolactate synthase [Lactobacillus rhamnosus ATCC 21052]; gil312898921lreflZP_07758309.1l acetolactate synthase [Megasphaera micronuciformis F0359]; gil310620083lgblEFQ03655.1l acetolactate synthase [Megasphaera micronuciformis F0359]; gil392978028lreflYP_006476616.1l acetolactate synthase

[Enterobacter cloacae subsp. dissolvens SDM]; gil387878229lreflYP_006308533.1l acetolactate synthase [Mycobacterium sp. MOTT36Y]; gil387619850lreflYP_006127477.1l acetolactate synthase [Escherichia coli DH1]; gil38676023 Href I YP_006233448. II acetolactate synthase [Bacillus sp. JS]; gil386732742lreflYP_006206238.1l acetolactate synthase [Listeria monocytogenes 07PF0776]; gil386035198lreflYP_005955111.1l acetolactate synthase [Klebsiella pneumoniae KCTC 2242] ; gil384259770lreflYP_005403704.1l acetolactate synthase [Rahnella aquatilis HX2];

gil384267136lreflYP_005422843. ll acetolactate synthase [Bacillus amyloliquefaciens subsp. plantarum YAU B9601-Y2]; gil384170312lreflYP_005551690.11 acetolactate synthase

[Bacillus amyloliquefaciens XH7]; gil379741437lreflYP_005333406.1l acetolactate synthase [Vibrio cholerae IEC224]; gil375364038 Iref I YP_005132077.11 acetolactate synthase [Bacillus amyloliquefaciens subsp. plantarum CAU B946]; gil375261263lreflYP_005020433.1l acetolactate synthase [Klebsiella oxytoca KCTC 1686]; gil375009759lreflYP_004983392.1l acetolactate synthase [Geobacillus thermoleovorans CCB_US3_UF5];

gil374323653lreflYP_005076782. ll acetolactate synthase [Paenibacillus terrae HPL-003]; gil344207293lreflYP_004792434. ll acetolactate synthase [Stenotrophomonas maltophilia JV3]; gil338534039lreflYP_004667373.1l acetolactate synthase [Myxococcus fulvus HW-1]; gil336249979lreflYP_004593689. ll acetolactate synthase [Enterobacter aerogenes KCTC 2190]; gil334144738lreflYP_004537894.1l acetolactate synthase [Thioalkalimicrobium cyclicum ALM1]; gil333895383lreflYP_004469258.1l acetolactate synthase [Alteromonas sp. SN2]; gil332297456lreflYP_004439378.1l Acetolactate synthase [Treponema brennaborense DSM 12168]; gil330838389lreflYP_004412969.1l Acetolactate synthase [Selenomonas sputigena ATCC 35185]; gil328950467lreflYP_004367802.1l acetolactate synthase

[Marinithermus hydrothermalis DSM 14884]; gil326780667lreflZP_08239932.1l Acetolactate synthase [Streptomyces griseus XylebKG-1]; gil325298954lreflYP_004258871.1l

Acetolactate synthase [Bacteroides salanitronis DSM 18170];

gil321313145 Iref I YP_004205432.11 acetolactate synthase [Bacillus subtilis BSn5];

gil311070109lreflYP_003975032. ll acetolactate synthase [Bacillus atrophaeus 1942];

gil53721452lreflYP_l 10437. II acetolactate synthase [Burkholderia pseudomallei K96243]; gil21221221lreflNP_627000. ll acetolactate synthase [Streptomyces coelicolor A3(2)]. Each sequence associated with the foregoing accession numbers is incorporated herein by reference.

[0034] Acetolactate decarboxylase (ALDC) is an enzyme that belongs to the family of carboxy lyases, which are responsible for cleaving carbon-carbon bonds. Acetolactate decarboxylase catalyzes the conversion of 2-acetolactate (also known as 2-hydroxy-2-methyl- 3-oxobutanoate) to 2-acetoin and releasing C0 2 . [0035] Acetolactate decarboxylase enzymes catalyze the enzymatic reaction belonging to the classification EC 4.1.1.5 (acetolactate decarboxylase activity) and gene ontology (GO) term ID of GO: 0047605. The GO term ID specifies that any protein characterized as having this associated GO term encodes an enzyme with catalytic acetolactate decarboxylase activity.

[0036] Various acetolactate decarboxylase (alsD) genes, which encode acetolactate decarboxylase enzymes, are known in the art. Examples of alsD genes include but are not limited to gil375143627lreflYP_005006068.1l Acetolactate decarboxylase [Niastella koreensis GR20-10]; gil361057673lgblAEV96664.1l Acetolactate decarboxylase [Niastella koreensis GR20-10]; gil218763415lgblACL05881.1l Acetolactate decarboxylase

[Desulfatibacillum alkenivoram AK-01]; gil220909520lreflYP_002484831.1l acetolactate decarboxylase [Cyanothece sp. PCC 7425]; gil218782031 lreflYP_002433349.11 acetolactate decarboxylase [Desulfatibacillum alkenivorans AK-01]; gil213693090lreflYP_002323676.1l Acetolactate decarboxylase [Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222]; gil 189500297 Iref I YP_001959767.11 Acetolactate decarboxylase [Chlorobium phaeobacteroides BS1]; gil 189423787 Iref I YP_001950964.11 acetolactate decarboxylase [Geobacter lovleyi SZ]; gil 172058271 Iref I YP_001814731.11 acetolactate decarboxylase

[Exiguobacterium sibiricum 255-15]; gil 163938775lreflYP_001643659. II acetolactate decarboxylase [Bacillus weihenstephanensis KB AB4] ; gil 158522304lref I YP_001530174.11 acetolactate decarboxylase [Desulfococcus oleovorans Hxd3];

gill57371670lreflYP_001479659. ll acetolactate decarboxylase [Serratia proteamaculans 568]; gill50395111lreflYP_001317786.1l acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH1]; gill50394715lreflYP_001317390.1l acetolactate decarboxylase

[Staphylococcus aureus subsp. aureus JH1]; gil 146311679lreflYP_001176753. II acetolactate decarboxylase [Enterobacter sp . 638]; gil 109900061 Iref I YP_663316.11 acetolactate decarboxylase [Pseudoalteromonas atlantica T6c]; gil219866131 lgblACL46470.11

Acetolactate decarboxylase [Cyanothece sp. PCC 7425]; gil213524551lgblACJ53298.1l Acetolactate decarboxylase [Bifidobacterium longum subsp. infantis ATCC 15697 = JCM 1222]; gill89420046lgblACD94444.1l Acetolactate decarboxylase [Geobacter lovleyi SZ]; gil 158511130lgblABW68097. II Acetolactate decarboxylase [Desulfococcus oleovorans Hxd3]; gill57323434lgblABV42531.1l Acetolactate decarboxylase [Serratia proteamaculans 568]; gil 145318555lgbl ABP60702.11 Acetolactate decarboxylase [Enterobacter sp. 638]; gill49947563lgblABR53499.ll Acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH1]; gill49947167lgblABR53103.1l Acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH1]; gill63860972lgblAB Y42031.il Acetolactate decarboxylase

[Bacillus weihenstephanensis KB AB4] ; gil l09702342lgbl ABG42262. il Acetolactate decarboxylase [Pseudoalteromonas atlantica T6c]; gill89495738lgblACE04286.1l

Acetolactate decarboxylase [Chlorobium phaeobacteroides BS1];

gil 171990792lgblACB61714.11 Acetolactate decarboxylase [Exiguobacterium sibiricum 255- 15]; gil223932563lreflZP_03624564.1l Acetolactate decarboxylase [Streptococcus suis 89/1591]; gil 19446753 llreflZP_03073518. II Acetolactate decarboxylase [Lactobacillus reuteri 100-23]; gil223898834lgblEEF65194.1l Acetolactate decarboxylase [Streptococcus suis 89/1591]; gill94454567lgblEDX43464.1l Acetolactate decarboxylase [Lactobacillus reuteri 100-23]; gil384267135lreflYP_005422842.1l acetolactate decarboxylase [Bacillus amyloliquefaciens subsp. plantarum YAU B9601-Y2]; gil375364037lreflYP_005132076. II acetolactate decarboxylase [Bacillus amyloliquefaciens subsp. plantarum CAU B946];

gil340793231lreflYP_004758694. ll acetolactate decarboxylase [Corynebacterium variabile DSM 44702]; gil336325119lreflYP_004605085.1l acetolactate decarboxylase

[Corynebacterium resistens DSM 45100]; gill48269032lreflYP_001247975.1l acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH9] ;

gil 148268650lref I YP_001247593.11 acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH9]; gil 148543372lreflYP_001270742. II acetolactate decarboxylase

[Lactobacillus reuteri DSM 20016]; gil380500488lemblCCG51526.1l acetolactate decarboxylase [Bacillus amyloliquefaciens subsp. plantarum YAU B9601-Y2];

gil371570031lemblCCF06881. ll acetolactate decarboxylase [Bacillus amyloliquefaciens subsp. plantarum CAU B946]; gil340533141 Igbl AEK35621.11 acetolactate decarboxylase [Corynebacterium variabile DSM 44702]; gil336101101 Igbl AEI08921.11 acetolactate decarboxylase [Corynebacterium resistens DSM 45100]; gil 148530406lgblABQ82405.11 Acetolactate decarboxylase [Lactobacillus reuteri DSM 20016];

gil 147742101lgblABQ50399. II Acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH9]; gill47741719lgblABQ50017.1l Acetolactate decarboxylase [Staphylococcus aureus subsp. aureus JH9]; gil392529510lreflZP_10276647.1l acetolactate decarboxylase [Carnobacterium maltaromaticum ATCC 35586]; gil366054074lreflZP_09451796. II acetolactate decarboxylase [Lactobacillus suebicus KCTC 3549]; gil339624147lreflZP_08659936.ll acetolactate decarboxylase [Fructobacillus fructosus KCTC 3544]; gil336393727lreflZP_08575126. II acetolactate decarboxylase [Lactobacillus coryniformis subsp. torquens KCTC 3535]. Each sequence associated with the foregoing accession numbers is incorporated herein by reference.

[0037] Alcohol dehydrogenase (ADH) is an enzyme that catalyzes the reduction of aldehydes and/or ketones into alcohols. Secondary alcohol dehydrogenases specifically have catalytic activity on substrates capable of producing secondary alcohols. Secondary alcohols are those alcohols in which the carbon bound to the hydroxyl (alcohol) functional group is covalently bonded to two other carbon atoms. Depending on the specific ADH, some alcohol dehydrogenases can also catalyze the opposite reaction, catalyzing the oxidation of alcohols into aldehydes and/or ketones. Alcohol dehydrogenases typically use NAD-containing molecules (such as NAP+, NADH, and NADPH) as cofactors in the enzymatic redox reaction with their substrates. With regard to the methods of the present disclosure, the secondary alcohol dehydrogenase catalyzes the reduction of an aldehyde and/or ketone into an alcohol.

[0038] Alcohol dehydrogenase enzymes catalyze the enzymatic reaction belonging to the classification EC 1.1.1.1 (alcohol dehydrogenase activity) and gene ontology (GO) term ID of GO: 0004025. The GO term ID specifies that any protein characterized as having this associated GO term encodes an enzyme with catalytic alcohol dehydrogenase activity.

[0039] Various alcohol dehydrogenase (adh) genes, which encode alcohol dehydrogenase enzymes, are known in the art. Examples of adh genes include but are not limited to gil223587866lemblCAX36647. ll alcohol dehydrogenase [Arthrobacter sp. JEK-2009];

gil343083017lreflYP_004772312. ll alcohol dehydrogenase [Cyclobacterium marinum DSM 745]; gil375146863lreflYP_005009304.1l Alcohol dehydrogenase [Niastella koreensis GR20 10]; gil332296134lreflYP_004438057.1l Alcohol dehydrogenase [Thermodesulfobium narugense DSM 14796]; gil327403757lreflYP_004344595.1l Alcohol dehydrogenase (NADP(+)) [Fluviicola taffensis DSM 16823]; gil325287739lreflYP_004263529.1l alcohol dehydrogenase [Cellulophaga lytica DSM 7489]; gil325295143lref I YP_004281657.11 Alcohol dehydrogenase [Desulfuwbacterium thermolithotrophum DSM 11699];

gil325289753lreflYP_004265934. ll alcohol dehydrogenase [Syntrophobotulus glycolicus DSM 8271]; gil325289499lreflYP_004265680.1l alcohol dehydrogenase [Syntrophobotulus glycolicus DSM 8271]; gil325110410lreflYP_004271478.1l alcohol dehydrogenase [Planctomyces brasiliensis DSM 5305]; gil325108594lreflYP_004269662.1l alcohol dehydrogenase [Planctomyces brasiliensis DSM 5305]; gil319955560lref I YP_004166827.11 alcohol dehydrogenase [Cellulophaga algicola DSM 14237]; gil325065591lgblAD Y73598.il Alcohol dehydrogenase [Desulfurobacterium thermolithotrophum DSM 11699];

gil332179237lgblAEE14926.ll Alcohol dehydrogenase [Thermodesulfobium narugense DSM 14796]; gil324970678lgblAD Y61456.il Alcohol dehydrogenase [Planctomyces brasiliensis DSM 5305]; gil324968862lgblADY59640.1l Alcohol dehydrogenase [Planctomyces brasiliensis DSM 5305]; gil324965154lgblAD Y55933. il Alcohol dehydrogenase

[Syntrophobotulus glycolicus DSM 8271]; gil324964900lgblADY55679.1l Alcohol dehydrogenase [Syntrophobotulus glycolicus DSM 8271]; gil374373821lreflZP_09631481.11 Alcohol dehydrogenase [Niabella soli DSM 19437]; gil373234794lgblEHP54587.1l Alcohol dehydrogenase [Niabella soli DSM 19437]; gil361060909lgblAEV99900.1l Alcohol dehydrogenase [Niastella koreensis GR20-10]; gil375144658lreflYP_005007099.1l alcohol dehydrogenase [Niastella koreensis GR20-10]; gil332981006lreflYP_004462447.1l alcohol dehydrogenase zinc-binding domain-containing protein [Mahella australiensis 50-1 BON]; gil332666040lreflYP_004448828. ll alcohol dehydrogenase [Haliscomenobacter hydrossis DSM 1100]; gil330837587lreflYP_004412228.1l alcohol dehydrogenase [Sphaewchaeta coccoides DSM 17374]; gil330837463lreflYP_004412104.1l alcohol dehydrogenase

[Sphaewchaeta coccoides DSM 17374]; gil330836315lreflYP_004410956.1l alcohol dehydrogenase [Sphaewchaeta coccoides DSM 17374]; gil327405514lreflYP_004346352.1l Alcohol dehydrogenase zinc-binding domain-containing protein [Fluviicola taffensis DSM 16823]; gil325298359lreflYP_004258276.1l Alcohol dehydrogenase [Bacteroides

salanitronis DSM 18170]; gil325291396lreflYP_004267577.1l alcohol dehydrogenase

[Syntrophobotulus glycolicus DSM 8271]; gil325281283lreflYP_004253825.1l Alcohol dehydrogenase (NADP(+)) [Odoribacter splanchnicus DSM 20712];

gil325299524lreflYP_004259441. ll Alcohol dehydrogenase [Bacteroides salanitronis DSM 18170]; gil325106236lreflYP_004275890.1l alcohol dehydrogenase [Pedobacter saltans DSM 12145]; gil320333315lreflYP_004170026.1l alcohol dehydrogenase [Deinococcus maricopensis DSM 21211]; gil329749366lgblAEC02722.1l Alcohol dehydrogenase

[Sphaewchaeta coccoides DSM 17374]; gil324319077lgblAD Y36968.il Alcohol

dehydrogenase [Bacteroides salanitronis DSM 18170]; gil324317912lgblAD Y35803. il Alcohol dehydrogenase [Bacteroides salanitronis DSM 18170]; gil384129895lreflYP_005512508. ll alcohol dehydrogenase [Hydrogenobacter thermophilus TK-6]; gil 357419011 Iref I YP_004932003.11 alcohol dehydrogenase [Thermovirga lienii DSM 17291]; gil332981068lreflYP_004462509.1l alcohol dehydrogenase zinc-binding domain- containing protein [Mahella australiensis 50-1 BON]; gil330837556lreflYP_004412197.1l alcohol dehydrogenase [Sphaerochaeta coccoides DSM 17374];

gil330836856lreflYP_004411497. II alcohol dehydrogenase [Sphaerochaeta coccoides DSM 17374]; gil327398952lreflYP_004339821.1l Alcohol dehydrogenase [Hippea maritima DSM 10411]; gil325290821lreflYP_004267002.1l alcohol dehydrogenase [Syntrophobotulus glycolicus DSM 8271]; gil322833567lreflYP_004213594. II alcohol dehydrogenase [Rahnella sp. Y9602]; gil308049202lreflYP_003912768.1l alcohol dehydrogenase [Ferrimonas balearica DSM 9799]; gil302392706lreflYP_003828526.1l alcohol dehydrogenase

[Acetohalobium arabaticum DSM 5501]; gil302336054lreflYP_003801261.11 alcohol dehydrogenase [Olsenella uli DSM 7084]; gil302335092lreflYP_003800299.1l alcohol dehydrogenase [Olsenella uli DSM 7084]; gil 152966730lref I YP_001362514.11 alcohol dehydrogenase [Kineococcus radiotolerans SRS30216]. Each sequence associated with the foregoing accession numbers is incorporated herein by reference.

[0040] In preferred embodiments, the acetolactate decarboxylase polypeptide is insensitive to oxygen. In this embodiment, the acetolactate decarboxylase would have enzyme activity on its substrates even in the absence of oxygen in the environment. In some embodiments, the secondary alcohol dehydrogenase is essentially insensitive to oxygen. In this

embodiment, the secondary alcohol dehydrogenase would have enzyme activity on its substrates even in the absence of oxygen in the environment.

[0041] In some embodiments, the secondary alcohol dehydrogenase polypeptide requires NADPH as a cofactor. This means that the secondary alcohol dehydrogenase (sADH) is NADPH-dependent.

[0042] In some embodiments, the secondary alcohol dehydrogenase polypeptide induces S- installing chirality on its substrates. In some embodiments, the secondary alcohol dehydrogenase polypeptide induces R-installing chirality on its substrates.

[0043] As used herein, "chiral" refers to a molecular compound that is not superimposable on its mirror image. The molecule and its mirror image are thus referred to in terms of their "chirality," often referred to as S or R. Different enzymes have the capacity to induce an S configuration on its substrate to form an S-product or to induce an R configuration on its substrate to form an R-product. If a given molecule has more than one asymmetric carbon, then different enzymes can induce multiple S and R configurations on the different asymmetric carbons in the target molecule.

[0044] The present disclosure identifies specific polynucleotides/genes useful in the methods, compositions and organisms of the disclosure. However, it should be recognized that absolute identity to such genes is not necessary, as substantially similar

polynucleotides/genes that perform substantially similar functions can also be used in the compositions and methods of the present disclosure. For example, changes in a particular gene or polynucleotide containing a sequence encoding a polypeptide or enzyme can be performed and screened for activity. Typically such changes include conservative mutation and silent mutations. Such modified or mutated polynucleotides and polypeptides can be screened for expression or function of enzymes using methods known in the art.

Additionally, homologs of the polynucleotides/genes of the present disclosure are suitable for use in the compositions and methods disclosed herein.

[0045] Due to the inherent degeneracy of the genetic code, polynucleotides which encode substantially the same or a functionally equivalent polypeptide can also be used to clone and express the same polypeptides or enzymes. As will be understood by those of skill in the art, it can be advantageous to modify a coding sequence to enhance its expression in a particular host. The genetic code is redundant with 64 possible codons, but most organisms typically use a subset of these codons. The codons that are utilized most often in a species are called optimal codons, and those not utilized very often are classified as rare or low-usage codons. Codons can be substituted to reflect the preferred codon usage of E. coli or S. elongatus, a process sometimes called "codon optimization" or "controlling for species codon bias" (See Murray et al. 1989 Nucl. Acids Res. 17:477-508). An example of codon optimization of a polynucleotide is present in Example 1 of the present disclosure (also see Table 4).

[0046] Those of skill in the art will recognize that, due to the degenerate nature of the genetic code, a variety of DNA compounds differing in their nucleotide sequences can be used to encode a given polypeptide or enzyme of the disclosure. The present disclosure includes DNA compounds of any sequence that encode the amino acid sequences of the polypeptides and proteins of the enzymes utilized in the compositions and methods of the disclosure. In similar fashion, a polypeptide can typically tolerate one or more amino acid substitutions, deletions, and insertions in its amino acid sequence without loss or significant loss of a desired activity.

[0047] Homologs of polypeptides or enzymes useful for generating metabolites are encompassed by the microorganisms and methods provided herein. The term "homologs" used with respect to an original enzyme or gene of a first family or species refers to distinct enzymes or genes of a second family or species which are determined by functional, structural or genomic analyses to be an enzyme or gene of the second family or species which corresponds to the original enzyme or gene of the first family or species. Most often, homologs will have functional, structural or genomic similarities. Techniques are known by which homologs of an enzyme or gene can readily be cloned using genetic probes and PCR. Homologs can be identified by reference to various databases and identity of cloned sequences as homolog can be confirmed using functional assays and/or by genomic mapping of the genes.

[0048] A protein has "homology" or is "homologous" to a second protein if the nucleic acid sequence that encodes the protein has a similar sequence to the nucleic acid sequence that encodes the second protein. Alternatively, a protein has homology to a second protein if the two proteins have "similar" amino acid sequences. Thus, the term "homologous proteins" is defined to mean that the two proteins have similar amino acid sequences.

[0049] As used herein, two proteins (or a region of the proteins) are substantially homologous when the amino acid sequences have at least about 30%, 40%, 50% 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity. Similarly, two polynucleotides (or a region of the polynucleotides) are substantially homologous when the nucleic acid sequences have at least about 30%, 40%, 50% 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity . To determine the percent identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid "identity" is equivalent to amino acid or nucleic acid "homology"). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.

[0050] In some embodiments of the disclosure, the coding sequences of the polynucleotides are operably linked to a promoter. In some embodiments, the promoter is an inducible promoter. In some embodiments, the promoter is a constitutive promoter.

[0051] As used herein, "inducible promoter" refers to a promoter that drives expression of a polynucleotide to which it is operably linked upon cellular perception of a stimulus.

Likewise, inducible promoters can terminate expression of a polynucleotide to which it is operably linked upon removal of a stimulus. An example of an inducible promoter in the present disclosure is the isopropyl- -D-thiogalactoside (IPTG) inducible promoter, in which this promoter drives expression of a polynucleotide to which it is operably linked upon perception of IPTG, an exogenous chemical. Any appropriate inducible promoter that has use in the compositions and methods of the present disclosure may be used accordingly. One of skill in the art will recognize that many characterized inducible promoters exist and can be used according to the compositions and methods disclosed herein.

[0052] Constitutive promoters are those promoters that are substantially insensitive to regulation by external stimuli and promote expression of a given polynucleotide in an essentially constant manner.

[0053] The present disclosure provides recombinant vectors containing recombinant polynucleotides for use in host microorganisms such as cyanobacteria.

[0054] As used herein, "recombinant" or "heterologous" or "heterologous polynucleotide" or "recombinant polynucleotide" refers to a polynucleotide wherein the exact nucleotide sequence of the polynucleotide is foreign to (i.e. , not naturally found in) a given host. These terms may also refer to a polynucleotide sequence that may be naturally found in a given host, but in an unnatural (e.g. , greater than or less than expected) amount, or additionally if the sequence of a polynucleotide comprises two or more subsequences that are not found in the same relationship to each other in nature. For example, regarding the latter, a recombinant polynucleotide could have two or more sequences from unrelated polynucleotides or from homologous nucleotides arranged to make a new polynucleotide. Specifically, the present disclosure describes the introduction of a recombinant vector into a microorganism, wherein the vector contains a polynucleotide coding for a polypeptide that is not normally found in the microorganism or contains a foreign polynucleotide coding for a substantially homologous polypeptide that is normally found in the host organism. With reference to the host cell's genome, then, the polynucleotide sequence that encodes the polypeptide is recombinant or heterologous.

[0055] In some embodiments, the recombinant polynucleotides of the present disclosure are stably integrated into the genome of the host organism. In some embodiments, the host organism is a cyanobacterium that has had the recombinant polynucleotides of the present disclosure stably integrated into its genome. As used herein, "stably integrated," as used with reference to the stable integration of a recombinant polynucleotide into a genome, refers to the phenomenon where the recombinant polynucleotide has become physically integrated into the organism' s genomic DNA such that mitotic and reproductive events that require genomic DNA replication pass on the genetic information contained in the recombinant polynucleotide as a physical unit of the host genome.

[0056] In other embodiments, the recombinant polynucleotides of the present disclosure are not stably integrated in the host organism such as a cyanobacterium. In this sense, the recombinant polynucleotide is expressed in a host cell without becoming stably integrated into the host genome.

[0057] In some embodiments, the recombinant vector or expression cassette includes coding sequences for an acetolactate synthase and an acetolactate decarboxylase operably linked to a promoter. In preferred embodiments, the acetolactate synthase coding sequence of the present disclosure corresponds to an alsS polynucleotide or a homolog thereof. In preferred embodiments, the acetolactate decarboxylase coding sequence corresponds to an alsD polynucleotide or a homolog thereof.

[0058] It should be noted that with regard to recombinant molecules, the polynucleotides and coding sequences are always associated with their respective polypeptides. An acetolactate synthase alsS polynucleotide will produce an ALS polypeptide. An acetolactate decarboxylase alsD polynucleotide will produce an ALDC polypeptide. A secondary alcohol dehydrogenase adh polynucleotide will produce a sADH polypeptide. By definition, a polynucleotide or coding sequence from a given organism also implies the respective polypeptide is from the given organism.

[0059] In preferred embodiments, the alsS polynucleotide is derived from Bacillus subtilis. In some embodiments, the alsD polynucleotide is derived from a source organism selected from Enterobacter aerogenes, Enterobacter cloacae, Bacillus licheniformis, Bacillus subtilis, Aeromonas hydrophila, and Gluconacetobacter xylinus.

[0060] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis and an alsD polynucleotide from Enterobacter aerogenes. In some embodiments, the recombinant vector contains an alsS polynucleotide from Bacillus subtilis and an alsD polynucleotide from Enterobacter cloacae. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis and an alsD polynucleotide from Bacillus licheniformis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis and an alsD polynucleotide from Bacillus subtilis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis and an alsD

polynucleotide from Aeromonas hydrophila. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis and an alsD

polynucleotide from Gluconacetobacter xylinus.

[0061] In some embodiments, the recombinant polynucleotide includes coding sequences for an acetolactate synthase, an acetolactate decarboxylase, and a secondary alcohol dehydrogenase. In preferred embodiments, the acetolactate synthase coding sequence corresponds to an alsS polynucleotide or a homolog thereof. In preferred embodiments, the acetolactate decarboxylase coding sequence corresponds to an alsD polynucleotide or a homolog thereof. In preferred embodiments, the secondary alcohol dehydrogenase coding sequence corresponds to an adh polynucleotide or a homolog thereof.

[0062] In preferred embodiments, the alsS polynucleotide is derived from Bacillus subtilis. In some embodiments, the alsD polynucleotide is derived from a source organism selected from Enterobacter aerogenes, Enterobacter cloacae, Bacillus licheniformis, Bacillus subtilis, Aeromonas hydrophila, and Gluconacetobacter xylinus. In some embodiments, the adh polynucleotide is derived from a source organism selected from Candida parapsilosis, Leuconostoc pseudomesenteroides, Clostridium beijerinckii, and Thermoanaerobacter brockii.

[0063] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter aerogenes, and an adh polynucleotide from Candida parapsilosis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter aerogenes, and an adh polynucleotide from Leuconostoc pseudomesenteroides . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter aerogenes, and an adh polynucleotide from Clostridium beijerinckii. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter aerogenes, and an adh polynucleotide from

Thermoanaerobacter brockii.

[0064] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter cloacae, and an adh polynucleotide from Candida parapsilosis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter cloacae, and an adh polynucleotide from Leuconostoc

pseudomesenteroides . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter cloacae, and an adh polynucleotide from Clostridium beijerinckii . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Enterobacter cloacae, and an adh polynucleotide from

Thermoanaerobacter brockii.

[0065] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus licheniformis, and an adh polynucleotide from Candida parapsilosis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus licheniformis, and an adh polynucleotide from Leuconostoc pseudomesenteroides . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus licheniformis, and an adh polynucleotide from Clostridium beijerinckii. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus licheniformis, and an adh polynucleotide from Thermoanaerobacter brockii.

[0066] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus subtilis, and an adh polynucleotide from Candida parapsilosis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus subtilis, and an adh polynucleotide from Leuconostoc pseudomesenteroides . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus subtilis, and an adh polynucleotide from Clostridium beijerinckii. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Bacillus subtilis, and an adh polynucleotide from Thermoanaerobacter brockii.

[0067] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis, an alsD polynucleotide from Aeromonas hydrophila, and an adh polynucleotide from Candida parapsilosis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Aeromonas hydrophila, and an adh polynucleotide from Leuconostoc pseudomesenteroides . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Aeromonas hydrophila, and an adh polynucleotide from Clostridium beijerinckii. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Aeromonas hydrophila, and an adh polynucleotide from

Thermoanaerobacter brockii.

[0068] In some embodiments, the recombinant polynucleotide contains an alsS

polynucleotide from Bacillus subtilis, an alsD polynucleotide from Gluconacetobacter xylinus, and an adh polynucleotide from Candida parapsilosis. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Gluconacetobacter xylinus, and an adh polynucleotide from

Leuconostoc pseudomesenteroides . In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from

Gluconacetobacter xylinus, and an adh polynucleotide from Clostridium beijerinckii. In some embodiments, the recombinant polynucleotide contains an alsS polynucleotide from Bacillus subtilis, an alsD polynucleotide from Gluconacetobacter xylinus, and an adh polynucleotide from Thermoanaerobacter brockii.

[0069] In some embodiments, recombinant polynucleotides of the present disclosure are introduced into a host organism. In some embodiments, the host organism is E. coli. In preferred embodiments, the host organism is a cyanobacterium. In preferred embodiments, the host organism is S. elongatus. Introduction of the recombinant polynucleotides into the host organism results in transformation of the host, producing a transformed organism.

[0070] As used herein, "transformed" organisms are those organisms that have been provided a recombinant polynucleotide molecule. Transformed organisms therefore differ from wild-type organisms in that they contain exogenous genetic information. Transformed organisms, by this definition, are also recombinant organisms. Methods of transforming host organisms are elaborated upon in Example 1. However, one of skill in the art will recognize that additional methods of transformation may exist and may be used in the methods and compositions of the present disclosure where appropriate.

[0071] In preferred embodiments of the disclosure, the transformed organisms produce acetoin. In preferred embodiments, the transformed organisms produce higher levels of acetoin as compared to a wild-type organism. In preferred embodiments of the disclosure, the transformed organisms produce 2,3-butanediol. In preferred embodiments, the transformed organisms produce higher levels of 2,3-butanediol as compared to a wild-type organism.

Methods of Producing Acetoin and 2,3-Butanediol

[0072] In some embodiments, a method of producing acetoin is provided in the disclosure. The method involves the step of providing a cyanobacteria with a recombinant vector including coding sequences for an acetolactate synthase (ALS) and an acetolactate decarboxylase (ALDC) operably linked to a promoter to form a transformed cyanobacteria. The method further involves culturing the transformed cyanobacteria in a photo synthetic environment including C0 2 and light whereby expression of the ALS and the ALDC results in the production of acetoin. [0073] In some embodiments, a method of producing 2,3-butanediol is provided in the disclosure. The method involves the step of providing a cyanobacterium with a recombinant vector including coding sequences for an acetolactate synthase (ALS), an acetolactate decarboxylase (ALDC), and a secondary alcohol dehydrogenase (sADH) operably linked to a promoter to form a transformed cyanobacteria. The method further involves culturing the transformed cyanobacteria in a photo synthetic environment including C0 2 and light whereby expression of the ALS, the ALDC, and the sADH results in the production of 2,3-butanediol.

[0074] In some embodiments, the acetolactate synthase coding sequence, the acetolactate decarboxylase coding sequence, and the secondary alcohol dehydrogenase coding sequence in the recombinant polynucleotides of the present disclosure are from a bacterial or a fungal source. In some embodiments, the acetolactate synthase (ALS) is a bacterial ALS or a fungal ALS. In some embodiments, the acetolactate decarboxylase (ALDC) is a bacterial ALDC or a fungal ALDC. In some embodiments, the secondary alcohol dehydrogenase (sADH) is a bacterial sADH or a fungal sADH.

[0075] In some embodiments, the ALS is of bacterial origin. In some embodiments, the ALS is an ALS from a Bacillus sp. In some embodiments, the ALS is an ALS from Bacillus subtilis.

[0076] "Bacteria", or "eubacteria", refers to a domain of prokaryotic organisms. The recombinant polynucleotides and coding sequences of the present disclosure may be of bacterial origin. Bacteria include at least 11 distinct groups as follows: (1) Gram-positive (gram+) bacteria, of which there are two major subdivisions: (1) high G+C group

(Actinomycetes, Mycobacteria, Micrococcus, others) (2) low G+C group (Bacillus,

Clostridia, Lactobacillus, Staphylococci, Streptococci, Mycoplasmas); (2) Proteobacteria, e.g., Purple photo synthetic+ non-photosynthetic Gram-negative bacteria (includes most "common" Gram-negative bacteria); (3) Cyanobacteria, e.g., oxygenic phototrophs; ( 4) Spirochetes and related species; (5) Planctomyces; (6) Bacteroides, Flavobacteria; (7) Chlamydia; (8) Green sulfur bacteria; (9) Green non-sulfur bacteria (also anaerobic phototrophs); (10) Radioresistant micrococci and relatives; (11) Thermotoga and

Thermosipho thermophiles (see, e.g. US 2011/0250060).

[0077] "Fungi" or "fungal" refers to members of a large group of eukaryotic organisms including yeasts, molds, and mushrooms. Fungi are abundant and diverse organisms that fall under the classification of Eumycota. The recombinant polynucleotides and coding sequences of the present disclosure may be of fungal origin. The fungal origin of the recombinant polynucleotides may be from fungi including species from the groups

Blastocladiomycetes, Chytridiomycetes, Glomeromycetes, Microsporidia,

Neocallimastigomycetes, Ascomycetes, and Basidiomycetes .

[0078] In some embodiments, the sADH is a fungal sADH. In some embodiments, the sADH is an ascomycete sADH, a firmicutes sADH, or a saccharomycete sADH.

[0079] The present disclosure relates to cyanobacteria, which are a type of photoautotrophic bacteria. Photoautotrophic bacteria are typically Gram -negative rods which obtain their energy from sunlight through the processes of photosynthesis. In this process, sunlight energy is used in the synthesis of carbohydrates, which in recombinant photoautotrophs can be further used as intermediates in the synthesis of biofuels. In other embodiment, the photoautotrophs serve as a source of carbohydrates for use by nonphotosynthetic

microorganism (e.g., recombinant E. coli) to produce biofuels by a metabolically engineered microorganism. Certain photoautotrophs called anoxygenic photoautotrophs grow only under anaerobic conditions and neither use water as a source of hydrogen nor produce oxygen from photosynthesis. Other photoautotrophic bacteria are oxygenic photoautotrophs. These bacteria are typically cyanobacteria. They use chlorophyll pigments and photosynthesis in photo synthetic processes resembling those in algae and complex plants. During the process, they use water as a source of hydrogen and produce oxygen as a product of photosynthesis (see, e.g. US 2011/0250060).

[0080] In some embodiments, the present disclosure provides cyanobacteria that contain recombinant polynucleotides for use in the production of acetoin and 2,3-butanediol. In some embodiments, the cyanobacteria are a Synechococcus sp. In some embodiments, the

Synechococcus sp. is Synechococcus elongatus. In some embodiments, the Synechococcus elongatus is Synechococcus elongatus PCC7942. One of skill in the art will recognize that other cyanobacteria can be used according to the present disclosure. Examples of other exemplary cyanobacteria include marine cyanobacteria such as Synechococcus sp. WH8102, thermostable cyanobacteria such as Thermosynechococcus elongatus BP-1,

photoheterotrophic cyanobacteria such as Synechocystis sp. PCC6803 and filamentous cyanobacteria such as Nostoc punctiforme. [0081] Cyanobacteria include various types of bacterial rods and cocci, as well as certain filamentous forms. The cells contain thylakoids, which are cytoplasmic, platelike membranes containing chlorophyll. The organisms produce heterocysts, which are specialized cells believed to function in the fixation of nitrogen compounds (see, e.g. US 2011/0250060).

[0082] As used herein, "photo synthetic environment" refers to any environment where the environmental conditions are suitable to allow a photo synthetic cell to perform

photosynthesis. Notably, such an environment would contain sufficient quantities of carbon dioxide (C0 2 ) and light to allow photosynthesis to occur.

[0083] The C0 2 may be provided to the organism in a variety of means. C0 2 is naturally present in the atmosphere, so no additional supplementation of this compound to the organism may be necessary as long as the environment surrounding the organism contains sufficient quantities of C0 2 to allow a photo synthetic organism to perform photosynthesis. This is preferred, as this method removes C0 2 from the environment and uses it as a starting molecule in the biological synthesis of a commodity chemical.

[0084] The light used in the methods of the present disclosure may be of any quality, quantity, and from any source known in the art sufficient to allow a photo synthetic organism to perform photosynthesis. The quality of light used may be of any wavelength in the visible light spectrum, such as from 400 nm to 800 nm, or any other quality of light suitable to promote photosynthesis. The quantity of light in the photo synthetic environment may be of any quantity as long as the quantity of light is sufficient to allow photosynthesis to occur. Quantities of light suitable for use in the methods of the present disclosure are provided in Example 1. However, these quantities of light are merely exemplary and are in no way limiting of the quantity of light that could be used in the disclosed methods. One of skill in the art would readily understand the scope of light quantities suitable for use in the disclosed methods.

[0085] The source of light may be from any light-emitting source such that the light is sufficient to promote photosynthesis in a photo synthetic organism. The light may be from a fluorescent bulb or a light-emitting diode. Alternatively, the light may be from natural sunlight. One of skill would readily understand that many different light sources could be used in the methods of the present disclosure. Possible light sources are provided in Example 1, but these are mere possibilities and in no way limit the scope of light sources applicable to the methods of the present disclosure.

[0086] In some embodiments, the production of one or more of acetoin and 2,-3 butanediol occurs as a result of culturing recombinant cyanobacteria of the present disclosure under constant light. One of skill will readily recognize that constant light energy is only one method of providing light to drive photosynthesis. Other light duration regimes also have use in the present disclosure and will be apparent to those of skill in the art.

[0087] In some embodiments, the production of one or more of acetoin and 2,3-butanediol occurs as a result of culturing recombinant cyanobacteria of the present disclosure in the presence of bicarbonate.

[0088] In preferred embodiments, the methods of the disclosure produce transformed cyanobacteria that produce higher levels of acetoin as compared to control cyanobacteria lacking the recombinant polynucleotide under the same conditions. In preferred

embodiments, the methods of the disclosure produce a transformed cyanobacteria that produces higher levels of 2,3-butanediol as compared to a control cyanobacteria lacking the recombinant polynucleotide under the same conditions.

[0089] As used herein, control cyanobacteria refer to cyanobacteria that are substantially the same as the recombinant cyanobacteria, but that the control cyanobacteria lack the recombinant polynucleotides of the recombinant cyanobacteria described in the present disclosure. Examples of control cyanobacteria include wild-type cyanobacteria of the same species as the recombinant cyanobacteria. Such wild-type cyanobacteria could include the parent of the recombinant cyanobacteria. Other control cyanobacteria include those carrying the vectors of the present disclosure but without the recombinant polynucleotides located in the vector. Other types of control cyanobacteria or organisms will be apparent to those skilled in the art.

[0090] Exemplary methods of detecting acetoin and 2,3-butanediol concentrations from organisms are provided in Example 1. However, one of skill in the art would recognize that many different methods of detecting metabolite concentrations are known and practiced. Any method that can quantify the concentration of acetoin and/or 2,3-butanediol in a sample could have application in the present disclosure. [0091] The 2,3-butanediol molecule has two asymmetric carbons, and thus can have differential chirality at each stereocenter. The 2,3-butanediol produced in the methods of the present disclosure may be (R,R)-2,3-butanediol, it may be ( l S , ,5 , )-2,3-butanediol, or it may be me5O-2,3-butanediol in which one asymmetric carbon of the molecule has an ^-configuration and one asymmetric carbon of the molecule has an R-configuration.

Supplemental Information

[0092] The practice of the present disclosure will employ, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are within the skill of the art. Such techniques are explained fully in the literature, such as, Molecular Cloning: A Laboratory Manual, second edition (Sambrook et al., 1989); Oligonucleotide Synthesis (Gait, ed., 1984); Animal Cell Culture (Freshney, ed., 1987); Handbook of Experimental

Immunology (Weir & Blackwell, eds.); Gene Transfer Vectors for Mammalian Cells (Miller & Calos, eds., 1987); Current Protocols in Molecular Biology (Ausubel et al., eds., 1987); PCR: The Polymerase Chain Reaction, (Mullis et al., eds., 1994); Current Protocols in Immunology (Coligan et al., eds., 1991); The Immunoassay Handbook (Wild ed., Stockton Press NY, 1994); Bioconjugate Techniques (Hermanson, ed., Academic Press, 1996); and Methods of Immunological Analysis (Masseyeff, Albert, and Staines, eds., Weinheim: VCH Verlags gesellschaft mbH, 1993).

[0093] The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters. When comparing two sequences for identity, it is not necessary that the sequences be contiguous, but any gap would carry with it a penalty that would reduce the overall percent identity. For blastn, the default parameters are Gap opening penalty=5 and Gap extension penalty=2. For blastp, the default parameters are Gap opening penalty=l 1 and Gap extension penalty=l.

[0094] A "comparison window", as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison can be conducted using known algorithms (e.g., by the local homology algorithm of Smith and Waterman, Adv Appl Math, 2:482, 1981; by the homology alignment algorithm of Needleman and Wunsch, J Mol Biol, 48:443, 1970; by the search for similarity method of Pearson and Lipman, Proc Natl Acad Sci USA, 85:2444, 1988; by computerized implementations of these algorithms FASTDB (Intelligenetics), BLAST (National Center for Biomedical Information), GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package (Genetics Computer Group, Madison, WI), or by manual alignment and visual inspection.

[0095] A preferred example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the FASTA algorithm (Pearson and Lipman, Proc Natl Acad Sci USA, 85:2444, 1988; and Pearson, Methods Enzymol, 266:227-258, 1996). Preferred parameters used in a FASTA alignment of DNA sequences to calculate percent identity are optimized, BL50 Matrix 15:-5, k-tuple=2; joining penalty=40, optimization=28; gap penalty- 12, gap length penalty=-2; and width=16.

[0096] Another preferred example of algorithms suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms (Altschul et al., Nuc Acids Res, 25:3389-3402, 1977; and Altschul et al., J Mol Biol, 215:403-410, 1990, respectively). BLAST and BLAST 2.0 are used, with the parameters described herein, to determine percent sequence identity for the nucleic acids and proteins of the disclosure. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information website. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold. These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score.

Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a word length (W) of 11, an expectation (E) of 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a word length of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (Henikoff and Henikoff, Proc Natl Acad Sci USA, 89: 10915, 1989) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands.

[0097] The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (See, e.g., Karlin and Altschul, Proc Natl Acad Sci USA, 90:5873-5787, 1993). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01, and most preferably less than about 0.001.

[0098] Another example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment from a group of related sequences using progressive, pairwise alignments to show relationship and percent sequence identity. It also plots a tree or dendogram showing the clustering relationships used to create the alignment. PILEUP uses a simplification of the progressive alignment method (Feng and Doolittle, J Mol Evol, 35:351-360, 1987), employing a method similar to a published method (Higgins and Sharp, CABIOS 5: 151-153, 1989). The program can align up to 300 sequences, each of a maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure begins with the pairwise alignment of the two most similar sequences, producing a cluster of two aligned sequences. This cluster is then aligned to the next most related sequence or cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension of the pairwise alignment of two individual sequences. The final alignment is achieved by a series of progressive, pairwise alignments. The program is run by designating specific sequences and their amino acid or nucleotide coordinates for regions of sequence comparison and by designating the program parameters. Using PILEUP, a reference sequence is compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence analysis software package, e.g., version 7.0 (Devereaux et al., Nuc Acids Res, 12:387-395, 1984).

[0099] Another preferred example of an algorithm that is suitable for multiple DNA and amino acid sequence alignments is the CLUSTALW program (Thompson et al., Nucl Acids. Res, 22:4673-4680, 1994). ClustalW performs multiple pairwise comparisons between groups of sequences and assembles them into a multiple alignment based on homology. Gap open and Gap extension penalties were 10 and 0.05 respectively. For amino acid alignments, the BLOSUM algorithm can be used as a protein weight matrix (Henikoff and Henikoff, Proc Natl Acad Sci USA, 89: 10915-10919, 1992).

[00100] Polynucleotides of the disclosure further include polynucleotides that encode conservatively modified variants of the polypeptides of Table 4 and the nucleic acid and amino acid sequences of SEQS ID NOS: l-23. "Conservatively modified variants" as used herein include individual mutations that result in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the disclosure. The following eight groups contain amino acids that are conservative substitutions for one another: 1. Alanine (A), Glycine (G); 2. Aspartic acid (D), Glutamic acid (E); 3. Asparagine (N), Glutamine (Q); 4. Arginine (R), Lysine (K); 5. Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6. Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 7. Serine (S), Threonine (T); and 8. Cysteine (C), Methionine (M).

[00101] The terms "derived from" or "of when used in reference to a nucleic acid or protein indicates that its sequence is identical or substantially identical to that of an organism of interest.

[00102] The term "corresponding" when used in reference to a cyanobacterium, refers to a cynaobacterium of the same genus and species as the cynaobacterium of interest. For instance in regard to an S. elongates comprising a recombinant polynucleotide encoding an ALS and an ALDC, a "corresponding cynaobacterium" is an S. elongates cell (wild type, parental, or otherwise comparable) lacking the recombinant polynucleotide.

[00103] The terms "decrease," "reduce" and "reduction" as used in reference to biological function (e.g., enzymatic activity, production of compound, expression of a protein, etc.) refer to a measurable lessening in the function by preferably at least 10%, more preferably at least 50%, still more preferably at least 75%, and most preferably at least 90%. Depending upon the function, the reduction may be from 10% to 100%. The term "substantial reduction" and the like refers to a reduction of at least 50%, 75%, 90%, 95% or 100%.

[00104] The terms "increase," "elevate" and "elevation" as used in reference to biological function (e.g., enzymatic activity, production of compound, expression of a protein, etc.) refer to a measurable augmentation in the function by preferably at least 10%, more preferably at least 50%, still more preferably at least 75%, and most preferably at least 90%. Depending upon the function, the elevation may be from 10% to 100%; or at least 10-fold, 100-fold, or 1000-fold up to 100-fold, 1000-fold or 10,000-fold or more. The term

"substantial elevation" and the like refers to an elevation of at least 50%, 75%, 90%, 95% or 100%.

[00105] The terms "isolated" and "purified" as used herein refers to a material that is removed from at least one component with which it is naturally associated (e.g., removed from its original environment). The term "isolated," when used in reference to a

biosythetically-produced chemical, refers to a chemical that has been removed from the culture medium of the bacteria that produced the chemical. As such an isolated chemical is free of extraneous or unwanted compounds (e.g., substrate molecules, bacterial components, etc.).

[00106] As used herein, the singular form "a", "an", and "the" includes plural references unless indicated otherwise. For example, "an" ALD includes one or more ALDs.

[00107] The phrase "comprising" as used herein is open-ended, indicating that such embodiments may include additional elements. In contrast, the phrase "consisting of is closed, indicating that such embodiments do not include additional elements (except for trace impurities). The phrase "consisting essentially of is partially closed, indicating that such embodiments may further comprise elements that do not materially change the basic characteristics of such embodiments. It is understood that aspects and embodiments described herein as "comprising" include "consisting" and/or "consisting essentially of aspects and embodiments.

EXAMPLES

[00108] To better facilitate an understanding of the embodiments of the disclosure, the following examples are presented. The following examples are merely illustrative and are not meant to limit any embodiments of the present disclosure in any way.

[00109] Abbreviations: ALDC (acetolactate decarboxylase); ALS (acetolactate synthase); sADH (secondary alcohol dehydrogenase); and 23BD (2,3-butanediol).

EXAMPLE 1 - Production of Acetoin and 2,3-Butanediol in Recombinant Microorganisms

[00110] The following example describes the engineering of heterologous acetoin and 2,3- butanediol biosynthetic pathways in an exemplary photo synthetic microorganism (e.g., Synechococcus elongatus), as well as the successful production of these commodity chemicals from carbon dioxide and light energy.

Materials and Methods

[00111] Reagents. The chemicals (R,R)-2,3-butanediol , me5O-2,3-butanediol, (S,S)-2,3- butanediol and acetoin were obtained from Sigma- Aldrich (St. Louis, MO). NADH and IPTG were obtained from Fisher Scientific (Hanover Park, IL). Phusion polymerase was purchased from NEB (Ipswich, MA). KOD polymerase and NADPH were purchased from

EMD4Biosciences (San Diego, CA). Gentamicin was purchased from Teknova (Hollister, CA). Oligonucleotides were synthesized from Integrated DNA Technologies, Inc. (San Diego, CA).

[00112] Strains and Plasmids. Strains described herein are listed in Table 1, while plasmids described herein are listed in Table 2. All plasmids except pAL60 were constructed using sequence and ligase independent cloning (SLIC)(50) in E. coli XLl-Blue (Agilent

Technologies, Santa Clara, CA). Primers for construction and genotype verification are listed in Table 3. Table 1: Microorganism Strains

All strains this study except XL-1 Blue (Agilent Technologies) and PCC7942

(Golden et al, Methods Enzymol, 153:215-231, 1987).

Table 2: Plasmids

Plasmid Description Λ

pAL306 As pAL60, but P Uac ou- alsS (B. s.)-alsD (A. h.); Gent R

pAL307 As pAL60, but Pu ac oic:: alsS (B. s.)-alsD (G. x); Gent R

pAL308 As pAL60, but Puacoi: alsS (B. s.)-alsD (E. a.)-adh (C. p.); Gent

pAL309 As pAL60, but Puacoi- alsS (B. s.)-alsD (E. a.)-adh (L. p.); Gent

pAL310 As pAL60, but Puacoi- alsS (B. s.)-alsD (E. a.)-adh (C. b.); Gent R

pAL312 As pAL60, but P L i ac0 , Gent R

pAL315 As pAL60, but Pu ac oi:: alsS (B. s.)-alsD (E. a.)-adh (T. b.)\ Gent R

Λ Α11 plasmids this study except pSA69 (Atsumi et al, Nature, 451 :86-89, 2008) Table 3: Oligonucleotides used in Plasmid Construction and Verification of Integration

Name Sequence (5' -> 3') Plasmid

pAL308, pAL309,

IM44 GTACCTTTCTCCTCTTCTAACTTTCTACTGAACGGATGGC

pAL310, pAL315

IM45 TAGAAGAGGAGAAAGGTACATGAAAGGTTTTGCCA pAL310

IM46 CAGGTCGACTCTAGAGGATCTCTACAGGATTACGAC pAL310

IM47 GTTAGAAGAGGAGAAAGGTACATGAAGGGTTTCGC pAL315

IM48 CAGGTCGACTCTAGAGGATCTCTATGCCAAAATGAC pAL315

IM49 TTAGAAGAGGAGAAAGGTACATGGGGGAGATTGAG pAL308

IM50 CAGGTCGACTCTAGAGGATCTCTAGGGGCATGTGTAA pAL308

IM51 TTAGAAGAGGAGAAAGGTACATGACAAAGAAAGT pAL309

IM52 AGGTCGACTCTAGAGGATCTCTAGTGAAACTGCATG pAL309

IM114 GGTCGACTCTAGAGGATCTTGTACCTTTCTCCTCTTTAA pAL312 pAL299,

IM125 GTACCTTTCTCCTCTTCTAACCCTCAGCCGCACGGATAGC

pAL300 pAL299,

IM11 AGATCCTCTAGAGTCGACCTG pAL300, pAL312

[00113] A neutral site (NS) located between Synpcc7942_0893 (903,564 - 904,283 bp) and Synpcc7942_0894 (904,845 - 905,417 bp) in the S. elongatus chromosome was used for insertion of an expression cassette. This region was amplified with primers MC173 and MC176. PCR products were digested with Aatll and Avrll and cloned into pZE12-luc (51) cut with the same enzyme, creating pAL60.

[00114] The fragment containing Piiacoi and alsS (B.s.) genes was amplified with primers IM103 and IM11 and lacP was amplified with primers IM39 and AK3 from pSA69 (52). The resulting fragments were inserted into pAL60 by SLIC, creating pAL301.

[00115] To clone alsD (E. a.), we used genomic DNA of E. aerogenes ATCC13048 (ATCC) as a PCR template with primers IM16 and IM17. To clone alsD (E. c), genomic DNA of E. cloacae ATCC 13047 (ATCC) was used as a PCR template with primers IM19 and IM20. To clone alsD (B. I.), genomic DNA of B. licheniformis ATCC 14580 (ATCC) was used as a PCR template with primers IM23 and IM24. To clone alsD (G. x), genomic DNA of G. xylinus (NBRC 3288) was used as a PCR template with primers IM21 and IM22. alsD (B. s) and alsD (A. h.) were chemically synthesized by DNA2.0 Inc. (Menlo Park, CA) to optimize codon usage for S. elongatus. Each alsD gene was cloned into downstream of alsS (B. s.) on pAL301 by SLIC, creating pAL302, pAL303, pAL304, pAL305, pAL306 and pAL307. To construct plasmid pAL312, we used plasmid pAL301 as a PCR template and primers IM114 and IM11 to amplify the entire plasmid, without the alsS gene. The resulting fragment was assembled by SLIC. All four adh genes were chemically synthesized by DNA2.0 Inc.

(Menlo Park, CA) to optimize codon usage for S. elongatus. Each adh gene was cloned into downstream of alsD (E. a.) on pAL302 by SLIC, creating pAL308, pAL309, pAL310 and pAL315. The adh (T.b.) and adh (C. b.) genes were clone into downstream of alsD (A. h.) on pAL306 by SLIC, creating pAL299 and pAL300, respectively.

[00116] Transformation ofS. elongatus. Transformation of S. elongatus was performed as described (53). Strains were segregated several times by transferring colonies to fresh selective plates. Correct recombinants were confirmed by PCR to verify integration of targeting genes into the chromosome. The strains used and constructed are listed in Table 1. NS between Synpcc7942_0893 (903,564 - 904,283 bp) and Synpcc7942_0894 (904,845 - 905,417 bp) on the S. elongatus chromosome was used as a targeting site for recombination. It was confirmed that insertion of the gentamicin resistance gene at this site does not affect the growth of cells.

[00117] Oxygen Evolution. Evolution of 0 2 was measured using a clark-type electrode with the Oxygraph system (Hansatech Instruments Ltd, Norfolk, UK). Under ambient light conditions, 1 ml of cells was transferred to the 4 ml borosilicate glass chamber and headspace gas was expelled using a center bored contact plunger with rubber cap. Cells were stirred at 100 rpm using a magnetic flea, and subjected to 2 minutes of darkness to allow the cells to equilibrate with the surrounding water jacket to 25 °C. A constant negative rate over at least

-1 -2

30 s was recorded after equilibration. Cells were then subjected to excess light (60 μΕ s _1 m " ") and allowed to equilibrate for at least 2 minutes until a constant rate could be measured over at least 60 s.

[00118] Enzyme Assays. S. elongatus cells were collected 72h after induction by centrifugation (4000g, 5 min, washed in 50mM potassium phosphate buffer (pH 7.5) and resuspended in the same buffer. Crude extract were prepared with 0.1-mm glass beads and a Mini bead beater (Mini Bead Beater 8 (BioSpec Products, Inc., Bartlesville, OK)). The total protein determination was performed by Advanced Protein Assay Reagent from

Cytoskeleton, Inc. (Denver, CO). [00119] Acetolactate synthase (ALS) activity was determined as described (54). The concentration of acetoin produced was measured by a standard curve using pure acetoin. One specific unit of AlsS activity corresponds to the formation of 1 nmol of acetoin per mg of protein per minute.

[00120] Alcohol dehydrogenase (ADH) activity was determined by measuring the oxidation of NAD(P)H. The reaction mixture contained 50 mM 3-(N-morpholino) propanesulfonic acid (MOPS) pH 7.0, 25 mM acetoin and 0.2 mM NAD(P)H. The consumption of NAD(P)H was monitored at 340 nm. One specific unit of ADH activity corresponds to the oxidation of 1 nmol of NAD(P)H per minute per mg of protein.

[00121] Calculation of Productivities. The following narrative provides a summary of how productivity calculations were made for specific compounds (FIG. 7).

[00122] For 2,3-butanediol, productivity per day was averaged over 3 days. Per day yields were 175 mg Γ 1 , 297 mg Γ 1 , and 237 mg Γ 1 for the time periods from 24 h - 48 h, 48 h - 72 h, and 72 h - 96 h respectively. By converting units this becomes 7292 μg Γ 1 h "1 , 12375 μg Γ 1 h " l , and 9875 μg Γ 1 h "1 respectively. The average of these rates is 9847 μg Γ 1 h "1 .

[00123] For isobutyraldehyde, the rate is calculated in literature (55).

[00124] For fatty acids, the apparent maximum titer was 197 mg Γ 1 which was produced over a minimum of 2 days - converting units gives 4104.2 μg \ A l (56).

[00125] For ethanol, the apparent maximum is 13 millimoles Γ 1 produced in 145 hours. Using the Molar mass of ethanol (46.07 g mol "1 ), this becomes 575.9 mg Γ 1 over 145 hours. Converting units gives 3972 μ β 1 ~ V (57).

[00126] For isobutanol, published titer is 450 mg Γ 1 over 6 days. Converting units gives 3125 μg Γ 1 h "1 (55).

[00127] For acetone, the apparent maximum titer was 36 mg i _1 over 4 days. Converting units gives 375 μg Γ 1 h "1 (58).

[00128] For ethylene, the apparent maximum rate was 240 nl ml "1 h "1 . This can be approximated as 9.81 μιηοΐ Γ 1 h "1 using the molar volume of an ideal gas at ambient temperature and pressure (24.465 1 mol "1 at 25°C and 1 atm) and converting units. Using the molar mass of ethylene (28.05 g mol "1 ) this becomes 275.17 μg l "1 h "1 (59). [00129] For 1-butanol, the apparent maximum titer was 19 mg l _1 over 10 days. Converting units gives 79.2 μg Γ 1 h "1 (60).

[00130] For fatty alcohol, the apparent maximum titer was 137.63 μg l _1 over 4 days.

Converting units gives 0.48 μg Γ 1 h "1 (61).

[00131] Culture Conditions. Unless otherwise specified, S. elongatus strains were cultured in BG-11 medium (45) with the addition of 50 mM NaHC0 3 . Cells were grown at 30°C with rotary shaking (100 rpm) and light (55 μΕ s m ~2 ) provided by four 86 cm 20 W fluorescent tubes 5 cm above the cell cultures. Cell growth was monitored by measuring OD 730 .

[00132] For acetoin and 23BD production in S. elongatus, cells in exponential phase were diluted to an OD 730 of 0.1 in 25 ml BG- 11 medium including 50 mM NaHC0 3 , 10 mg/L thiamine, and 10 mg/L gentamicin in 125 ml baffled shake flasks. Cultures were grown to an OD 73 o of 0.4-0.6 before induction with 1 mM IPTG. Every 24 hours, the pH was adjusted to 7.5 +/- 0.4 with 10N HC1. 10% of the culture volume was removed and an equal volume of BG-11 containing 0.5 M NaHC0 3 was added achieving a final concentration of 50 mM NaHC0 in the culture.

[00133] For acetoin and 2,3-butanediol production in E. coli, overnight cultures were diluted 1: 100 into 5 ml of modified M9 medium (33) containing 50 g/L glucose, 5 g/L yeast extract and 5 mg/L gentamicin in a 30-ml test tube. Cells were grown at 37°C to an OD of 0.2-0.4 followed by addition of 0.1 mM IPTG. Production was continued at 30°C on a rotary shaker (250 rpm) for 40 hours.

[00134] Acetoin Quantification. Acetoin was quantified by the method of Voges and Proskauer (46-47), adapted to small volume on 96 well plates. Sample concentration was varied between 1-10% of final volume to achieve a result within the linear range of detection. This was achieved by dilution in H 2 0 to 100 μΐ initial volume. For an assay containing 2% sample (most common), 98 μΐ water and 2 μΐ of the supernatant were added to wells and mixed. To this was added 100 μΐ of a solution, prepared at the time of use, consisting of one part 5% Naphthol dissolved in 2.5N NaOH and one part 0.5% Creatine in water. The assay was monitored every 5 minutes and final readings were taken after 40 min, when the slope of the absorbance curve matched the background oxidation rate of Naphthol. Triplicate measurements of no less than 3 standards, including at least one value each above, below and within the desired range, were included in every assay.

[00135] 2,3-Butanediol Quantification. Supernatant samples from cultures were analyzed with gas chromatography (GC) (Shimadzu) equipped with flame ionization detector and an HP-chiral 20b column (30 m, 0.32-mm internal diameter, 0.25-mm film thickness; Agilent Technologies). Samples were prepared by mixing 9 parts supernatant (diluted as necessary in H 2 0) with 1 part internal standard. For each analysis the GC oven temperature was held at 40°C for 4 min, increased with a gradient of 15°C min 1 until 235°C, and held for 4min. Ultra high purity Helium was used as the carrier gas. The temperature of the injector and detector were set at 250°C. The stereoisomers were identified by matching retention time to standards for (R,R)-23BD, meso-23BD and (5,5)-23BD.

Results

[00136] 23BD exhibits low toxicity in S. elongatus. To increase the titer and duration of chemical production, low toxicity or constant removal of the product is necessary. Because constant removal and purification of small concentrations during production is not cost- effective on an industrial scale, it was a prerequisite of this study that the chemical target be tolerated at an acceptable volume of greater than 1% (10 g/L) by production strains.

[00137] To evaluate acetoin and 23BD toxicity, we tested the growth of S. elongatus over 72 h in the presence of 23BD or acetoin. Growth decreased approximately 50% in the presence of 0.2 g/L acetoin, and stopped at 1.0 g/L (FIG. 2A) indicating acute toxicity for this precursor. This is comparable to isobutyraldehyde and isobutanol, which prevent growth of S. elongatus at 1 g/L (14). Conversely growth of S. elongatus was barely inhibited in the presence of 10 g/L 23BD (FIG. 2B), and still exhibited growth in the presence of 30 g/L 23BD, surpassing our benchmark goal for product tolerance. These results indicate that 23BD is a suitable target for high titer and long-term cyanobacterial production, as long as high flux through acetoin can be maintained to prevent accumulation of the toxic

intermediate.

[00138] Construction of the Acetoin Biosynthetic Pathway. Acetoin can be produced by the decarboxylation of 2-acetolactate. In this pathway (FIG. 1) two pyruvate molecules are converted into 2-acetolactate by acetolactate synthase (ALS) encoded by alsS. 2- Acetolactate is then decarboxylated to yield acetoin by 2-acetolactate decarboxylase (ALDC) encoded by alsD. Pyruvate, the source of carbon for the pathway, is produced naturally through the fixation of three C0 2 molecules in the Calvin-Benson-Bassham cycle (32). Conversion of pyruvate to 2-acetolactate occurs naturally during valine/leucine biosynthesis, albeit in low amounts (33). Previously the alsS gene which encodes ALS from Bacillus subtilis (B. s.) was overexpressed to increase carbon flux to 2-acetolactate for the production of

isobutyraldehyde and was reported to have relatively high activity (14).

[00139] To identify strong ALDC candidates, we used the bioinformatics tool,

BRaunschweig ENzyme DAtabase (BRENDA)(34) and a comprehensive literature review. We limited our search to 0 2 insensitive enzymes, and looked for reports of strong acetoin production. We were further restricted by the need to match pre- sequencing literature reports to chronologically consistent strain names, which now match currently available gene sequences. Based on these criteria six alsD genes were selected (Table 4).

Table 4: Acetolactate Decarboxylase (ALDC) and Secondary Alcohol Dehydrogenase (ADH) Genes

Genes were synthesized with codon optimization for expression in S. elongatus [00140] To test acetoin production each alsD gene was overexpressed with alsS (B. s.) under the isopropyl- -D-thiogalactoside (IPTG) inducible promoter Piiacoi (35) in E. coli . The cells were cultured in modified M9 medium, containing 50 g/L of glucose, at 30°C for 16 and 40 h. A control strain expressing only alsS (B. s.) produced 0.2 g/L acetoin indicating that 2-acetolactate decomposes to acetoin in small amounts, which is consistent with previous observations (36-37). When alsD was coexpressed more than 20 g/L of acetoin was produced indicating that autodecarboxylation is not a major contributor to 2-acetolactate conversion (FIG. 3B). All ALDC except that from Enterobacter cloacae (E. c.) were active in E. coli, and displayed a pattern of activity that was consistent through 16 h and 40 h of production (FIG. 3B). The strain expressing alsD from Aeromonas hydrophila (A. h.) was the highest producer (21.0 g/L) followed by the strains expressing alsD from

Gluconacetobacter xylinus (G. x)(17.8 g/L), alsD from Bacillus licheniformis (B. Z.)(16.7 g/L) and alsD from Enterobacter aerogenes (E. <x)(16.0 g/L)(FIG. 3B). The strain expressing codon optimized alsD (B. s.), which is the natural gene partner to the alsS (B. s.) used in the production operon, produced the least acetoin (6.6 g/L) which demonstrates that native pathways do not necessarily maintain their integrity when transferred to new hosts. Thus screening of multiple candidates revealed the optimal genes for pathway optimization in each new host (FIG. 3B).

[00141] Acetoin production in S. elongatus from CO 2 . Following our screening strategy for pathway optimization ALDC activity was compared in the photosynthetic cell environment of S. elongatus, based on production of acetoin during heterologous alsS and alsD expression. Each strain was cultured in 125 ml shake flasks with 25 ml BG-11 containing 50 mM NaHC0 3 in constant light (55 at 30°C for 72h (FIG. 3C).

Strains expressing alsD from E. a., B. I., B. s., A. h., and G. x, produced 108 mg/L, 62 mg/L, 35 mg/L, 203 mg/L, and 14 mg/L respectively (FIG. 3C). Control strains, and the strain expressing alsD (E. c.) did not produce a measureable amount of acetoin in this host. Based on these results, we had two alsD genes (from E. a. and A h.) capable of moderate and high production of acetoin respectively in S. elongatus. To avoid excessive acetoin toxicity we chose alsD (E. a.) as a starting point for sADH analysis.

[00142] In order to have an inducible expression system, lacP, which encodes the E. coli lac repressor, was cloned upstream of Piiacoi (FIG. 3A). The efficiency of Lacl repression in the S. elongatus strain containing alsS (B. s) and alsD (E. a.) was investigated by testing acetoin production with or without ImM of IPTG (FIG. 3D). Interestingly, acetoin production without IPTG was similar to that with 1 mM IPTG (FIG. 3D), suggesting that Rii ac oi was not repressed well by Lacl in this construct. The promoter and coding region of lacP were verified by Sanger sequencing. This phenomena has been reported with other IPTG-inducible promoters in S. elongatus PCC7942 (38-39) and Synechocystis sp. PCC6803 (40).

[00143] Constructing the 23BD biosynthetic pathway. Acetoin can be reduced by a secondary alcohol dehydrogenase (sADH) to produce 23BD (FIG. 1). Identification of strong sADH candidates followed the same method used for ALDC, but in addition to low oxygen sensitivity, two more criteria were added. First, we limited our search to NADPH- dependent sADH as this cofactor is expected to have higher bioavailability during

photosynthesis (22). Second, reduction of acetoin by sADH is a diastereo selective reaction, allowing us to choose enzymes to install either an R or S stereocenter. Two NADPH utilizing sADH with R-installing reaction sites had been characterized previously in E. coli (41). The availability of sADH with S-installing reaction sites and NADPH as a cofactor, however, was limited. In the end we chose four adh genes, two with R-installing reaction sites, two with S- installing reaction sites (Table 4). Plasmids were constructed harboring alsS (B. s.), alsD (E. a.) and each of the four adh under RLiacoi (FIG. 4A).

[00144] The resulting E. coli strains were cultured in modified M9 medium, containing 50 g/L glucose, at 30°C for 40 h (FIG. 4B). The concentration of acetoin remained high for three out of the four strains indicating that sADH activity is limiting in this cell environment. The fourth strain, expressing adh (C. b.), maintained a relatively low acetoin concentration (less than 6% of total production) and produced 13.8 g/L total 23BD as a mixture of (RR)- 23BD and meso-23BO stereoisomers forming 74% and 21% of total production respectively (FIG. 4B). The strains expressing adh (T. b.) and adh (C. p.) produced 2.4 g/L and 14.2 g/L 23BD respectively with both isomers formed in roughly equal amounts in each. High stereoselectivity was achieved in the strain expressing adh (L. p.), which produced 9.1 g/L meso-23BD exclusively. Enzyme activities measured from crude cell lysate isolated during production were high for both sADH (C. p.) and sADH (C. b.), at 265 and 440 nmol min "1 mg "1 respectively, when excess substrate was used (FIG. 4B). However for the strain expressing adh (C. p.), accumulation of acetoin in the supernatant during production indicates that the enzyme turnover rate at the substrate concentrations present within the cell is slower than the rate of secretion. Relatively low activity of sADH (Γ. b.) was consistent with accumulation of acetoin in the strain indicating that sADH activity is a bottleneck for production in this E. coli strain (FIG. 4B). The major 23BD product of each of the adh expressing strains matched the stereochemistry predicted by previous characterization (41- 43).

[00145] 23BD production in S. elongatus from CO 2. To screen the differences in 23BD productivity in S. elongatus, each of the plasmids used for 23BD production in E. coli was used for transformation of S. elongatus. The engineered strains were cultured in 125 ml baffled shake flasks with 25 ml BG-11 containing 50 mM NaHC0 3 in constant light (55 μΕ8 ~ V 2 ) at 30°C.

[00146] 23BD production was detected in three out of four S. elongatus strains (FIG. 4C). Measurement of sADH performance in S. elongatus was made by comparison of acetoin and 23BD concentrations after 72 hours of growth, using the less active ALDC (E. a.) to lower toxicity in cases when acetoin conversion was low. The strain expressing adh (T. b.) produced 301 mg/L (R,R)-23BD with trace amounts of meso-23BO but also allowed for accumulation of acetoin (FIG. 4C). The strain expressing adh (C. b.) produced 270 mg/L (R,R)-23BD (major) and undetectable levels of acetoin, indicating high flux through the intermediate. The strain expressing adh (C. p.) produced 65 mg L "1 23BD, with meso-23 O as the primary product and accumulated toxic levels of acetoin. The remaining ^-installing adh (L. p.) was not active in S. elongatus, resulting only in accumulation of acetoin. Enzyme activities measured in crude cell lysate isolated during production showed a roughly 10-fold higher activity for adh (C. b.) than for adh (T. b.), 56.3 and 6.3 nmol min "1 mg "1 respectively, which could explain the accumulation of acetoin in the less active strain (FIG. 4C and 4D). Activity for adh (C. p.) was roughly 5-fold higher again than adh (C. b.), however low production and acetoin accumulation was observed, similar to the result in E. coli. Enzyme activity could not be detected for the strain expressing adh (L. p.), indicating that the sADH enzyme is responsible for lack of production (FIG. 4C and 4D). The two sADH with highest production and lowest acetoin accumulation were further tested with the stronger ALDC gene alsD (A. h.). Both strains increased production, yielding 568 mg L "1 from adh (T. b.) and 952 mg L "1 from adh (C. b.), the latter of which is 3 fold higher than production with alsD (E. a.) over 72 hours (FIG. 4C). Both strains also showed increased acetoin concentrations, although neither reached toxic levels, accumulating 59 mg L "1 and 61 mg L "1 respectively.

[00147] Long-term production of23BD in S. elongatus. The stability of the highest producing strains was verified by maintaining continuous production in 25 ml cultures at 30°C in the presence of constant light. The strain containing adh (C. b.) reached a total yield of 2.38 g/L (R,R)-23BD and a maximum production rate of 9847 μg L "1 h "1 (3 day

average)(FIG. 5B). Production was sustained for 21 days. The strain containing adh (T. b.), showed similar results, reaching a total yield of 1.97 g L "1 , maximum production rate of 7757 μg L "1 h "1 (3 day average), and sustaining production for 21 days (FIG. 5B). After 21 days, production in both cultures dropped off sharply and was not restored when cells were resuspended in fresh medium, indicating that changes in the culture population such as spontaneous mutations, which restore flux to metabolism, fundamentally impair production over time. Strains containing the 23BD biosynthetic pathway showed reduced growth compared to control strains, as expected mirroring the rate of carbon redirection from central metabolism (FIG. 5A, 5B and 5D). A second control strain containing only alsS (B. s.) and alsD (E. a.) produced acetoin up to toxic levels after the stationary phase was reached and showed impaired growth beyond what is attributable to carbon redirection.

[00148] Evaluating the photo synthetic efficiency of production strains. Evolution of 0 2 from illuminated cells during continuous production was measured to verify whether the 23BD overproduction pathway could affect the photo synthetic system (FIG. 5C). Both strains expressing the 23BD biosynthetic pathway displayed a slightly higher rate of 0 2 evolution per μg of chlorophyll compared to control strains (FIG. 5C). This rate increased during late stages of production. Both control strains, each with no production pathway, or only the acetoin production pathway expressed, displayed similar rates of 0 2 evolution

(FIG. 5C). This trend follows the amount of fixed carbon diverted away from central metabolism, indicating that the burden placed on the cell by overproduction could stimulate a positive effect on the cells photo synthetic efficiency (FIG. 5C and 5D).

Conclusions

[00149] As described herein for the first time, production of 23BD and acetoin directly from C0 2 and light was accomplished through engineering of the cyanobacterium, S. elongatus. Pathway design was approached so as to match the production pathway to a photo synthetic host. Engineered strains achieved a production rate of 9847 μg Γ 1 h "1 and final titer of 2.4 g Γ 1 , with sustained production lasting for 21 days. These values, achieved during continuous production from C0 2 and light, compare favorably with other studies. The rate is 1.6 fold higher than that for isobutyraldehyde (6,230 μg Γ 1 h "1 ), and significantly higher than other products overproduced from exogenous pathways (FIG. 7). The percentage of biomass produced as 23BD ranges from 30% to 60% (FIG. 5D), which compares favorably to the maximum of 80% achieved during endogenous sucrose overproduction (38).

[00150] To construct the 23BD pathway, low toxicity was a priority for improved culture sustainability and thus commodity chemical production in S. elongatus. The negative effect of toxicity on pathway flux is reinforced by low production of acetoin, which is toxic above 0.1 g/L in S. elongatus (FIG. 2A), from the 23BD pathway without coexpression of adh

(FIG. 6C). Addition of a strong adh to the operon to convert acetoin to 23BD increases total production of the pathway 10-fold (FIG. 5B and 6C), even though reduction by Adh is not an irreversible step, while production of acetoin is. Thus matching genes to their host provides a means for optimizing pathway function. All genes were screened for production in E. coli concurrently with cyanobacteria using identical operons. The patterns of production exhibited by the genes were different between hosts. The productivities of strains expressing alsD (B. I.) and alsD (G. x.) in S. elongatus were much lower (30% and 7% of top production respectively) (FIG. 3C), than strains overexpressing the same genes in E. coli (80% and 85% of top production respectively) (FIG. 3B). Conversely sADH (Γ. b.), which displayed severely attenuated production in E. coli achieved significant production in S. elongatus. Additionally the enzyme encoded by adh (L. p.) was entirely inactive in S.

elongatus despite production of 9.1 g/L meso-23BO in E. coli.

[00151] Using 23BD production as a model system allowed for inclusion of

stereoselectivity as part of the pathway design (FIG. 1). Chirality can be costly to install in chemical synthesis; however biological control offers a much simpler route to these products. In all known cases in nature, acetoin is generated from 2-acetolactate containing an R- stereocenter resulting in (S,S)-23BO not being observed. However autodecarboxylation of 2- acetolactate, or enolate racemization of acetoin, could possibly form (^-acetoin in the cell and result in (S,S)-23BO production in the presence of S-installing sADH enzymes. Two pathways were designed, one for each stereoisomer (Table 4). In S. elongatus both Reinstalling strains tested in long-term production consistently produced (R,R)-23BD as the major product, although a trace amount of meso-23BO was observed in production by the strain expressing adh (T. b.). Additionally the strain expressing adh (C. p.), which produced mixed isomers in E. coli, produced only meso-23BO in S. elongatus. In this study no production of (S,S)-23BO was detected, indicating that degradation products do not contribute significantly to the pathway.

[00152] During long-term production, evolution of 0 2 per μg of chlorophyll increased in production strains relative to a control containing the recombination cassette but no alsS, alsD, or adh genes (FIG. 6C). This indicates that the stress imposed on metabolism by production elicits an increase in photo synthetic efficiency. Chlorophyll and 0 2 production have been seen to increase in roughly equal amounts during similar overproduction of sucrose in engineered S. elongatus.. Defining the engineering principles for photo synthetic organisms is an important landmark in the search for sustainable technologies. Biological production of 23BD by heterotrophic microbes has attracted attention for many years because of the existence of natural fermentative producers, and the chemical's potential as a versatile carbon feedstock for plastics, solvents, and fuel. The biosynthetic production rate and titer achieved using the tools of the present disclosure mark a large increase in cyanobacterial yields.

[00153] References

1. McFarlane J & Robinson S (2007) Survey of Alternative Feedstocks for Commodity Chemical Manufacturing. Oak Ridge National Laboratory.

2. Serferlein KE (2008) Annual Energy Review (E.I Administration).

3. Raupach MR, et al. (2007) Global and regional drivers of accelerating C02 emissions. Proc Natl Acad Sci U S A 104(24): 10288-10293.

4. Herzog H & Golomb D (2004) In Encyclopedia of Energy. Edited by Cleveland CJ. New York.277-287.

5. Ruffing AM (2011) Engineered cyanobacteria: teaching an old bug new tricks.

Bioeng Bugs 2(3): 136-149. 6. Machado IM & Atsumi S (2012) Cyanobacterial biofuel production. J Biotechnol doi: 10.1016/j.jbiotec.2012.03.005.

7. Ducat DC, Way JC, & Silver PA (2011) Engineering cyanobacteria to generate high- value products. Trends Biotechnol 29(2):95-103.

8. Scharlemann JP & Laurance WF (2008) Environmental science. How green are biofuels? Science 319(5859):43-44.

9. Field CB, Behrenfeld MJ, Randerson JT, & Falkowski P (1998) Primary production of the biosphere: integrating terrestrial and oceanic components. Science 281(5374):237-240.

10. Golden SS, Brusslan J, & Haselkorn R (1987) Genetic engineering of the

cyanobacterial chromosome. Methods Enzymol 153:215-231.

11. Heidorn T, et al. (2011) Synthetic biology in cyanobacteria engineering and analyzing novel functions. Methods Enzymol 497:539-579.

12. Huang HH, Camsund D, Lindblad P, & Heidorn T (2010) Design and characterization of molecular tools for a Synthetic Biology approach towards developing cyanobacterial biotechnology. Nucleic Acids Res 38(8):2577-2593.

13. Keasling JD (2008) Synthetic biology for synthetic chemistry. ACS Chem Biol 3(l):64-76.

14. Atsumi S, Higashide W, & Liao JC (2009) Direct photo synthetic recycling of carbon dioxide to isobutyraldehyde. Nat Biotechnol 27(12): 1177-1180.

15. Lan EI & Liao JC (2012) ATP drives direct photo synthetic production of 1-butanol in cyanobacteria. Proc Natl Acad Sci U S A 109(16):6018-6023.

16. Takahama K, Matsuoka M, Nagahama K, & Ogawa T (2003) Construction and analysis of a recombinant cyanobacterium expressing a chromosomally inserted gene for an ethylene-forming enzyme at the psbAI locus. J Biosci Bioeng 95(3):302-305.

17. Lindberg P, Park S, & Melis A (2010) Engineering a platform for photo synthetic isoprene production in cyanobacteria, using Synechocystis as the model organism. Metab Eng 12(l):70-79.

18. Zhou J, Zhang H, Zhang Y, Li Y, & Ma Y (2012) Designing and creating a modularized synthetic pathway in cyanobacterium Synechocystis enables production of acetone from carbon dioxide. Metab Eng 14(4):394-400.

19. Liu X, Sheng J, & Curtiss R, 3rd (2011) Fatty acid production in genetically modified cyanobacteria. Proc Natl Acad Sci U S A 108(17):6899-6904. 20. Tan X, et al. (2011) Photosynthesis driven conversion of carbon dioxide to fatty alcohols and hydrocarbons in cyanobacteria. Metab Eng 13(2): 169- 176.

21. Shen CR, et al. (2011) Driving forces enable high-titer anaerobic 1-butanol synthesis in Escherichia coli. Appl Environ Microbiol 77(9):2905-2915.

22. Lan EI & Liao JC (2011) Metabolic engineering of cyanobacteria for 1-butanol production from carbon dioxide. Metab Eng 13(4):353-363.

23. Bond-Watts BB, Bellerose RJ, & Chang MC (2011) Enzyme mechanism as a kinetic control element for designing synthetic biofuel pathways. Nat Chem Biol 7(4):222-227.

24. Tran AV & Chambers RP (1987) The dehydration of fermentative 2,3-butanediol into methyl ethyl ketone. Biotechnol Bioeng 29(3):343-351.

25. van Haveren J, Scott EL, & Sanders J (2008) Bulk chemicals from biomass. Biofuels Bioprod Bioref 2:41-57.

26. Syu MJ (2001) Biological production of 2,3-butanediol. Appl Microbiol Biotechnol 55(1): 10-18.

27. Ji XJ, Huang H, & Ouyang PK (2011) Microbial 2,3-butanediol production: a state- of-the-art review. Biotechnol Adv 29(3):351-364.

28. Celinska E & Grajek W (2009) Biotechnological production of 2,3-butanediol— current state and prospects. Biotechnol Adv 27 (6):715-725.

29. Wijffels RH & Barbosa MJ (2010) An outlook on microalgal biofuels. Science 329(5993):796-799.

30. Greenwell HC, Laurens LM, Shields RJ, Lovitt RW, & Flynn KJ (2010) Placing microalgae on the biofuels priority list: a review of the technological challenges. J R Soc Interface 7(46):703-726.

31. Eiteman MA & Gainer JL (1989) In situ extraction versus the use of an external column in fermentation. Appl Microbiol Biotechnol 30:614-618.

32. Blankenship RE (2002) Carbon Metabolism. Molecular Mechanisms of

Photosynthesis, (Blackwell Science Ltd), pp 172-203.

33. Atsumi S, Hanai T, & Liao JC (2008) Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels. Nature 451(7174):86-89.

34. Scheer M, et al. (2011) BRENDA, the enzyme information system in 2011. Nucleic Acids Res. 39:D670-676. 35. Lutz R & Bujard H (1997) Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/Il-I2 regulatory elements. Nucleic Acids Res 25(6): 1203- 1210.

36. Aristidou AA, San KY, & Bennett GN (1994) Modification of central metabolic pathway in escherichia coli to reduce acetate accumulation by heterologous expression of the bacillus subtilis acetolactate synthase gene. Biotechnol Bioeng 44(8):944-951.

37. Park HS, Xing R, & Whitman WB (1995) Nonenzymatic acetolactate oxidation to diacetyl by flavin, nicotinamide and quinone coenzymes. Biochim Biophys Acta

1245(3):366-370.

38. Ducat DC, Avelar-Rivas JA, Way JC, & Silver PA (2012) Rerouting carbon flux to enhance photo synthetic productivity. Appl Environ Microbiol 78(8):2660-2668.

39. Mutsuda M, Michel KP, Zhang X, Montgomery BL, & Golden SS (2003)

Biochemical properties of CikA, an unusual phytochrome-like histidine protein kinase that resets the circadian clock in Synechococcus elongatus PCC 7942. J Biol Chem

278(21): 19102-19110.

40. Ng WO, Zentella R, Wang Y, Taylor JS, & Pakrasi HB (2000) PhrA, the major photoreactivating factor in the cyanobacterium Synechocystis sp. strain PCC 6803 codes for a cyclobutane-pyrimidine-dimer-specific DNA photolyase. Arch Microbiol 173(5-6):412-417.

41. Yan Y, Lee CC, & Liao JC (2009) Enantio selective synthesis of pure (R,R)-2,3- butanediol in Escherichia coli with stereo specific secondary alcohol dehydrogenases. Org Biomol Chem 7(19):3914-3917.

42. Zhang R, et al. (2008) Crystal structure of a carbonyl reductase from Candida parapsilosis with anti-Prelog stereospecificity. Protein Sci 17(8): 1412-1423.

43. Rattray FP, Walfridsson, M.' Nilsson, D. (2000) Purification and characterization of a diacetyl reductase from Leuconostoc paseudomesenteroides. International Dairy Journal 10.

44. Najmudin S, et al. (2003) Purification, crystallization and preliminary X-ray crystallographic studies on acetolactate decarboxylase. Acta Crystallogr D Biol Crystallogr 59(Pt 6): 1073-1075.

45. Rippka RD, J.; Waterbury, J.B.; Herdman. M. Stainer, R.Y (1979) Generic

Assignments, Strain Histories and Properties of Pure Cultures of Cyanobacteria. Journal of General Microbiology 111: 1-61. 46. Voges O, Proskauer, B. (1898) Beitraege zur Ernaehrungsphysiologie und zur Differential Diagnose der Bakterien der hemmorrhagischen Septicamie. Z. Hyg. 28.

47. Westerfeld WW (1945) A colorimetric determination of blood acetoin. J Biol Chem 161:495-502.

48. Godtfredsen SE, Lorck H, & Sigsgaard P (1983) On the occurrence of a- acetolactate decarboxylases among microorganims. Carlsberg Res, Commun. 48:239-247.

49. Dexter J & Pengcheng F (2009) Metabolic engineering of cyanobacteria for ethanol production. Energy & Environ. Sci. 2:857-864.

50. Li MZ & Elledge SJ (2007) Harnessing homologous recombination in vitro to generate

recombinant DNA via SLIC. Nat Methods 4(3):251-256.

51. Lutz R & Bujard H (1997) Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/Il-I2 regulatory elements. Nucleic Acids Res 25(6): 1203- 1210.

52. Atsumi S, Hanai T, & Liao JC (2008) Non-fermentative pathways for synthesis of branched-chain higher alcohols as biofuels. Nature 451(7174):86-89.

53. Golden SS, Brusslan J, & Haselkorn R (1987) Genetic engineering of the

cyanobacterial chromosome. Methods Enzymol 153:215-231.

54. Yang YT, Peredelchuk M, Bennett GN, & San KY (2000) Effect of variation of Klebsiella pneumoniae acetolactate synthase expression on metabolic flux redistribution in Escherichia coli. Biotechnol Bioeng 69(2): 150- 159.

55. Atsumi S, Higashide W, & Liao JC (2009) Direct photo synthetic recycling of carbon dioxide to isobutyraldehyde. Nat Biotechnol 27(12): 1177-1180.

56. Liu X, Sheng J, & Curtiss R, 3rd (2011) Fatty acid production in genetically modified cyanobacteria. Proc Natl Acad Sci U S A 108(17):6899-6904.

57. Dexter J & Pengcheng F (2009) Metabolic engineering of cyanobacteria for ethanol production. Energy & Environ. Sci. 2:857-864.

58. Zhou J, Zhang H, Zhang Y, Li Y, & Ma Y (2012) Designing and creating a modularized synthetic pathway in cyanobacterium Synechocystis enables production of acetone from carbon dioxide. Metab Eng 14(4):394-400. 59. Takahama K, Matsuoka M, Nagahama K, & Ogawa T (2003) Construction and analysis of a recombinant cyanobacterium expressing a chromosomally inserted gene for an ethylene-forming enzyme at the psbAI locus. J Biosci Bioeng 95(3):302-305.

60. Lan EI & Liao JC (2012) ATP drives direct photo synthetic production of 1-butanol in cyanobacteria. Proc Natl Acad Sci U S A 109(16):6018-6023.

61. Tan X, et al. (2011) Photosynthesis driven conversion of carbon dioxide to fatty alcohols and hydrocarbons in cyanobacteria. Metab Eng 13(2): 169-176.

SEQUENCES

SEQ ID NO: 1: PLlacOl promoter

AATTGTGAGCGGATAACAATTGACATTGTGAGCGGATAACAAGATACTGAGCACATCAGC AGGACGCA CTGACC

SEQ ID NO: 2: Bacillus subtilis acetolactate synthase- alsS gene nucleotide sequence

ATGTTGACAAAAGCAACAAAAGAACAAAAATCCCTTGTGAAAAACAGAGGGGCGGAGCTT GTTGTTGA TTGCTTAGTGGAGCAAGGTGTCACACATGTATTTGGCATTCCAGGTGCAAAAATTGATGC GGTATTTG ACGCTTTACAAGATAAAGGACCTGAAATTATCGTTGCCCGGCACGAACAAAACGCAGCAT TCATGGCC CAAGCAGTCGGCCGTTTAACTGGAAAACCGGGAGTCGTGTTAGTCACATCAGGACCGGGT GCCTCTAA CTTGGCAACAGGCCTGCTGACAGCGAACACTGAAGGAGACCCTGTCGTTGCGCTTGCTGG AAACGTGA TCCGTGCAGATCGTTTAAAACGGACACATCAATCTTTGGATAATGCGGCGCTATTCCAGC CGATTACA AAATACAGTGTAGAAGTTCAAGATGTAAAAAATATACCGGAAGCTGTTACAAATGCATTT AGGATAGC GTCAGCAGGGCAGGCTGGGGCCGCTTTTGTGAGCTTTCCGCAAGATGTTGTGAATGAAGT CACAAATA CGAAAAACGTGCGTGCTGTTGCAGCGCCAAAACTCGGTCCTGCAGCAGATGATGCAATCA GTGCGGCC ATAGCAAAAATCCAAACAGCAAAACTTCCTGTCGTTTTGGTCGGCATGAAAGGCGGAAGA CCGGAAGC AATTAAAGCGGTTCGCAAGCTTTTGAAAAAGGTTCAGCTTCCATTTGTTGAAACATATCA AGCTGCCG GTACCCTTTCTAGAGATTTAGAGGATCAATATTTTGGCCGTATCGGTTTGTTCCGCAACC AGCCTGGC GATTTACTGCTAGAGCAGGCAGATGTTGTTCTGACGATCGGCTATGACCCGATTGAATAT GATCCGAA ATTCTGGAATATCAATGGAGACCGGACAATTATCCATTTAGACGAGATTATCGCTGACAT TGATCATG CTTACCAGCCTGATCTTGAATTGATCGGTGACATTCCGTCCACGATCAATCATATCGAAC ACGATGCT GTGAAAGTGGAATTTGCAGAGCGTGAGCAGAAAATCCTTTCTGATTTAAAACAATATATG CATGAAGG TGAGCAGGTGCCTGCAGATTGGAAATCAGACAGAGCGCACCCTCTTGAAATCGTTAAAGA GTTGCGTA ATGCAGTCGATGATCATGTTACAGTAACTTGCGATATCGGTTCGCACGCCATTTGGATGT CACGTTAT TTCCGCAGCTACGAGCCGTTAACATTAATGATCAGTAACGGTATGCAAACACTCGGCGTT GCGCTTCC TTGGGCAATCGGCGCTTCATTGGTGAAACCGGGAGAAAAAGTGGTTTCTGTCTCTGGTGA CGGCGGTT TCTTATTCTCAGCAATGGAATTAGAGACAGCAGTTCGACTAAAAGCACCAATTGTACACA TTGTATGG AACGACAGCACATATGACATGGTTGCATTCCAGCAATTGAAAAAATATAACCGTACATCT GCGGTCGA TTTCGGAAATATCGATATCGTGAAATATGCGGAAAGCTTCGGAGCAACTGGCTTGCGCGT AGAATCAC CAGACCAGCTGGCAGATGTTCTGCGTCAAGGCATGAACGCTGAAGGTCCTGTCATCATCG ATGTCCCG GTTGACTACAGTGATAACATTAATTTAGCAAGTGACAAGCTTCCGAAAGAATTCGGGGAA CTCATGAA AACGAAAGCTCTC

SEQ ID NO: 3: Bacillus subtilis acetolactate synthase- alsS amino acid sequence

MLTKATKEQKSLVKNRGAELVVDCLVEQGVTHVFGIPGAKIDAVFDALQDKGPEI IVARHEQNAAFMA QAVGRLTGKPGVVLVTSGPGASNLATGLLTANTEGDPVVALAGNVIRADRLKRTHQSLDN AALFQPIT KYSVEVQDVKNIPEAVTNAFRIASAGQAGAAFVSFPQDVVNEVTNTKNVRAVAAPKLGPA ADDAISAA IAKIQTAKLPVVLVGMKGGRPEAIKAVRKLLKKVQLPFVETYQAAGTLSRDLEDQYFGRI GLFRNQPG DLLLEQADVVLTIGYDPIEYDPKFWNINGDRTIIHLDEIIADIDHAYQPDLELIGDIPST INHIEHDA VKVEFAEREQKILSDLKQYMHEGEQVPADWKSDRAHPLEIVKELRNAVDDHVTVTCDIGS HAIWMSRY FRSYEPLTLMISNGMQTLGVALPWAIGASLVKPGEKVVSVSGDGGFLFSAMELETAVRLK APIVHIVW NDSTYDMVAFQQLKKYNRTSAVDFGNIDIVKYAESFGATGLRVESPDQLADVLRQGMNAE GPVIIDVP VDYSDNINLASDKLPKEFGELMKTKAL

SEQ ID NO: 4: Enterobacter aerogenes KCTC 2190/ ATCC13048

acetolactate decarboxylase - alsD gene nucleotide sequence

ATGAATCATGCTTCAGATTGCACCTGTGAAGAGAGTCTGTGTGAAACGCTACGCGCGTTT TCCGCTCA

GCATCCCGATAGCGTGCTGTATCAAACTTCGCTGATGAGCGCCCTGCTCAGCGGCGT CTACGAAGGTA

CCACCACCATTGCGGACCTGCTGAAGCACGGTGATTTCGGGCTCGGCACTTTTAATG AACTCGACGGC

GAGCTGATCGCGTTTAGCAGCCAGGTTTATCAACTGCGTGCCGACGGCAGCGCGCGT AAAGCGCGTCC

GGAACAGAAAACGCCGTTTGCGGTGATGACCTGGTTTCAGCCGCAGTACCGTAAAAC CTTTGACCATC

CGGTCAGCCGCCAGCAGCTGCATGAGGTTATTGACCAGCAAATTCCTTCCGACAATC TGTTCTGCGCG

CTGCGAATCGATGGTCATTTCCGCCACGCCCATACCCGCACCGTGCCTCGTCAGACG CCGCCCTACCG

GGCGATGACCGACGTGCTCGACGATCAGCCGGTTTTCCGCTTTAACCAGCGTGACGG CGTACTGGTCG

GTTTTCGTACCCCGCAGCATATGCAGGGAATTAACGTCGCCGGCTATCACGAACACT TCATTACCGAT

GACCGCCAGGGCGGCGGCCACCTGCTGGACTACCAGCTCGACCATGGGGTATTGACC TTCGGCGAAAT

TCATAAGCTGATGATCGACCTTCCCGCCGACAGCGCGTTCCTGCAGGCCAATTTGCA TCCCGATAATC

TCGATGCCGCCATCCGTTCAGTAGAAAGTTAG

SEQ ID NO: 5: Enterobacter aerogenes KCTC 2190/ ATCC13048

acetolactate decarboxylase - alsD amino acid sequence

MNHASDCTCEESLCETLRAFSAQHPDSVLYQTSLMSALLSGVYEGTTTIADLLKHGDFGL GTFNELDG ELIAFSSQVYQLRADGSARKARPEQKTPFAVMTWFQPQYRKTFDHPVSRQQLHEVIDQQI PSDNLFCA LRIDGHFRHAHTRTVPRQTPPYRAMTDVLDDQPVFRFNQRDGVLVGFRTPQHMQGINVAG YHEHFITD DRQGGGHLLDYQLDHGVLTFGEIHKLMIDLPADSAFLQANLHPDNLDAAIRSVES

SEQ ID NO: 6: Enterobacter cloacae subsp. cloacae ATCC 13047 acetolactate decarboxylase - alsD gene nucleotide sequence

ATGAGCGCCCTGCTAAGCGGTGTCTACGAAGGGGACACCACCATCGCCGATCTGCTGGCA CATGGTGA

TTTTGGTCTGGGCACCTTCAACGAGCTGGACGGCGAAATGATTGCCTTCAGCAGCCA GGTGTACCAGC

TGCGCGCCGACGGCAGCGCACGCGCCGCGAAGCCAGAGCAGAAAACGCCGTTCGCGG TGATGACCTGG

TTCCAGCCGCAGTACCGCAAAACCTTTGATGCGCCGGTCAGCCGTCAGCAGATCCAC GACGTGATCGA

CCAGCAAATTCCCTCGGATAACCTGTTCTGCGCGCTGCGCATCGACGGCAACTTCCG CCACGCCCACA

CCCGTACCGTACCGCGTCAGACGCCGCCATACCGCGCGATGACCGACGTGCTGGACG ACCAGCCGGTG

TTCCGCTTTAACCAGCGTGAAGGGGTGCTGGTTGGGTTCCGCACGCCGCAGCATATG CAGGGCATCAA

CGTGGCCGGCTATCACGAACATTTCATTACCGACGACCGTCAGGGCGGGGGACATCT GCTGGATTACC

AGCTGGAGAGCGGCGTGCTCACCTTTGGCGAAATACACAAGCTAATGATTGACCTGC CCGCCGACAGC

GCGTTTTTACAGGCCAACCTTCACCCCAGCAACCTTGATGCAGCGATCCGTTCCGTC GAAAACTAA

SEQ ID NO: 7: Enterobacter cloacae subsp. cloacae ATCC 13047 acetolactate decarboxylase - alsD amino acid sequence

MSALLSGVYEGDTTIADLLAHGDFGLGTFNELDGEMIAFSSQVYQLRADGSARAAKPEQK TPFAVMTW FQPQYRKTFDAPVSRQQIHDVIDQQIPSDNLFCALRIDGNFRHAHTRTVPRQTPPYRAMT DVLDDQPV FRFNQREGVLVGFRTPQHMQGINVAGYHEHFITDDRQGGGHLLDYQLESGVLTFGEIHKL MIDLPADS AFLQANLHPSNLDAAIRSVEN

SEQ ID NO: 8: Bacillus lichen!formis ATCC 14580 acetolactate decarboxylase - alsD gene nucleotide sequence ATGAAAAGTGCAAGCAAACAAAAAATAATTCAGCCCGTTGATAAGAACCTCGATCAAGTC TATCAGGT CTCAACGATGGTATCTTTATTGGACGGAATTTACGACGGGGATTTTTATATGTCCGAAGC GAAGGAGC ACGGAGACTTCGGGATCGGAACGTTCAACCGGCTCGACGGCGAGCTGATCGGTTTTGACG GTGAGTTT TACCGTCTCCGTTCCGATGGAAAAGCCTACCCAGTTCAAGGAAGCGATTGTTCTCCATTT TGCTCGCT GGCTTTCTTCCGGCCGGATATCTATCACGAAATCAAGCAGCGGATGCCGCTTGAGGCGTT CGAAGAAG AAATGAAACGGATCATGCCGAGTGAAAACCTGTTTTACGCGATTCGCATGGACGGAACCT TTAAGAAA GTCAAAACGAGAACAGTTGAACTTCAGGAAAAACCGTATGTGCCGATGGTTGATGCGGTA AAATCACA GCCGATCTTTGATTTTAATGATATTACGGGGACGATCGTCGGCTTTTGGACACCGCAATA TGCCAACG GAATCGCAGTTTCCGGCTTCCATCTTCACTTTATAGATGAAGACCGCAATGTCGGCGGAC ACGTTTTC GATTATGAAATCGAAGAATGCACGGTGCAAATTTCTCAAAAACTCAATATGAACCTCAGA TTGCCGAA TACGCAAGATTTCTTTCAAGCGGATTTCAATAAACACGATCTTGCAGCCGGAATTGAAGC GGCCGAAG GCAATCCCGAGTAA

SEQ ID NO: 9: Bacillus lichen!formis ATCC 14580 acetolactate decarboxylase - alsD amino acid sequence

MKSASKQKI IQPVDKNLDQVYQVSTMVSLLDGIYDGDFYMSEAKEHGDFGIGTFNRLDGELIGFDGEF YRLRSDGKAYPVQGSDCSPFCSLAFFRPDIYHEIKQRMPLEAFEEEMKRIMPSENLFYAI RMDGTFKK VKTRTVELQEKPYVPMVDAVKSQPIFDFNDITGTIVGFWTPQYANGIAVSGFHLHFIDED RNVGGHVF DYEIEECTVQISQKLNMNLRLPNTQDFFQADFNKHDLAAGIEAAEGNPE

SEQ ID NO: 10: Bacillus subtilis acetolactate decarboxylase - alsD gene nucleotide sequence (codon usage is optimized for S. elongatus)

ATGAAACGTGAGTCGAACATTCAAGTCTTGAGCCGAGGCCAAAAGGACCAACCAGTC TCCCAGATCTA

CCAAGTGAGCACTATGACAAGTCTCTTGGACGGAGTCTACGATGGCGATTTTGAGCT CTCGGAAATTC

CGAAATATGGGGATTTCGGCATTGGGACCTTTAACAAACTGGACGGTGAACTGATCG GCTTTGATGGT

GAGTTCTACCGCCTGCGCAGTGATGGGACCGCCACGCCGGTTCAAAATGGCGACCGG AGCCCGTTTTG

CAGCTTTACATTCTTCACCCCCGACATGACACACAAGATTGATGCTAAAATGACTCG CGAAGATTTCG

AGAAAGAAATCAATTCGATGTTGCCTAGTCGTAATTTGTTTTATGCCATTCGCATCG ACGGTCTGTTT

AAGAAGGTGCAGACCCGCACGGTTGAACTCCAGGAGAAGCCGTACGTTCCTATGGTG GAAGCAGTCAA

GACGCAGCCCATCTTTAACTTCGACAATGTGCGCGGGACCATTGTCGGCTTCCTGAC GCCCGCTTATG

CGAACGGCATCGCTGTCTCTGGTTACCATCTCCACTTTATCGATGAAGGCCGAAATT CCGGAGGCCAT

GTTTTTGATTATGTGCTCGAAGATTGTACGGTGACCATCAGCCAGAAAATGAACATG AACTTGCGGCT

GCCAAATACCGCGGATTTCTTCAATGCAAACCTGGATAACCCCGATTTTGCCAAAGA TATTGAAACGA

CTGAGGGTTCTCCCGAGTAG

SEQ ID NO: 11: Bacillus subtilis acetolactate decarboxylase - alsD amino acid sequence

MKRESNIQVLSRGQKDQPVSQIYQVSTMTSLLDGVYDGDFELSEIPKYGDFGIGTFNKLD GELIGFDG EFYRLRSDGTATPVQNGDRSPFCSFTFFTPDMTHKIDAKMTREDFEKEINSMLPSRNLFY AIRIDGLF KKVQTRTVELQEKPYVPMVEAVKTQPIFNFDNVRGTIVGFLTPAYANGIAVSGYHLHFID EGRNSGGH VFDYVLEDCTVTISQKMNMNLRLPNTADFFNANLDNPDFAKDIETTEGSPE

SEQ ID NO: 12: Aeromonas hydrophila acetolactate decarboxylase - alsD gene nucleotide sequence (codon usage is optimized for S.

elongatus)

ATGGAAACTAATAGCTCGTGCGATTGTGCAATCGAAATCTCGCAGCAATTTGCGCGCTGG CAGGCCCG TCAAGGTGGGGGCGAGGTCTACCAGTCCAGCCTGATGTCGGCACTGCTGGCGGGTGTTTA CGAAGGCG AAACCACAATGGCCGATCTGCTCCGCCACGGGGACTTTGGTCTGGGCACGTTTAACCGGC TGGACGGC GAACTCATTGCCTTTGAGCGGCAAATCCATCAGTTGAAAGCGGATGGATCTGCCCGACCC GCTCGCGC AGAACAGAAAACGCCGTTTGCCGTGATGACGCACTTCCGGCCGTGCTTGCAACGCCGGTT CGCTCATC CGCTGTCCCGCGAAGAAATTCACCAATGGGTCGATCGCCTCGTGGGCACTGACAACGTTT TCGTTGCA TTTCGACTGGATGGCTTGTTTGAGCAAGCGCAGGTCCGCACCGTCCCCTGTCAGAGCCCA CCCTATAA GCCCATGTTGGAGGCCATTGAAGCCCAGCCTCTGTTCAGTTTCAGTTTGCGGCGTGGGAC CCTCGTCG GCTTTCGCTGCCCACCCTTCGTGCAAGGCATTAACGTGGCTGGCTATCATGAACATTTCA TTACCGAG GATCGCCGAGGTGGGGGTCATATCTTGGATTACGCTATGGGACACGGCCAGCTCCAACTG AGCGTGGT TCAACACCTCAACATCGAGTTGCCTCGAAATCCTGCCTTTCAACAGGCAGACCTCAATCC GGCGGATC TGGACCGCGCTATCCGTGCGGCTGAGGGTTAG

SEQ ID NO: 13: Aeromonas hydrophila acetolactate decarboxylase - alsD amino acid sequence

METNSSCDCAIEISQQFARWQARQGGGEVYQSSLMSALLAGVYEGETTMADLLRHGDFGL GTFNRLDG ELIAFERQIHQLKADGSARPARAEQKTPFAVMTHFRPCLQRRFAHPLSREEIHQWVDRLV GTDNVFVA FRLDGLFEQAQVRTVPCQSPPYKPMLEAIEAQPLFSFSLRRGTLVGFRCPPFVQGINVAG YHEHFITE DRRGGGHILDYAMGHGQLQLSVVQHLNIELPRNPAFQQADLNPADLDRAIRAAEG

SEQ ID NO: 14: Acetobacter aceti, ssp. xylinum, NBRC 3288

(Gluconacetobacter xylinus) acetolactate decarboxylase - alsD gene nucleotide sequence

ATGGAAATAGGCTTTAATATATATTGGACGTACGAACCTGCCTGCATCACCATTAGTCTG CAATCACA AATGACCGGGTTGAGGCGATGCCATGTGCCGCATTGTCCCCCGATGCAGGAGACTGAGGT CGTGAAGC TTAAATGCTACTCGGTAGGGGATGTTGATACCCGGTCCAGCGCTGCTGATTCGACTGGCG TGCGTCCG CGCATGAACCGCCTGTACCAGACATCGACCATGGCCGCGCTGCTTGATGCGGTCTATGAT GGCGAGAC CACGCTTGATGAACTGCTGATGCACGGCAATTTCGGGCTGGGCACGTTCAACGGCCTTGA TGGCGAGA TGATCGTCAATGACAGCGTAATCCACCAGTTCCGTGCAGACGGGCAGGCCGGTCGTGTGC CGGGCGAC CTCAGGACTCCGTTCGCCTGCGTTACCTTCTTCAACCCGGAGAAGGAATACATGATCGAC ACCGCGCA GGATAAGGAAGGCTTCGAGGCGATCGTGGATCACCTCGTCAACAATCCCAACCTGTTCGC CGCCGTGC GCTTTACCGGCATGTTCGAGCGGGTCGAGACCCGCACCGTGTTCTGCCAGTGCCAGCCCT ACCCACCC ATGCTGGAAGTGGTGGCCCGCCAGCCCACCATGCAGCTTGGTGCCTCCACCGGCACCATG CTTGGTTT CCGCACGCCGGGCTACATGCAGGGCGTGAACGTGGCGGGTTATCACCTGCACTTCCTGAC TGAGGACG GACGCCGTGGCGGCCATGTGACCGATTACGGCGTGCTGCGCGGTCGGCTTGAGGTGGGCG TGATTTCC GATGTGGAAATCCAGCTGCCCCGCACCGAACAGTTCGCGCGCGCCAACCTGTCCCCCGAA AACATTCA CGAGGCCATTCGCGTGGCCGAGGGCGGCTGA

SEQ ID NO: 15: Acetobacter aceti, ssp. xylinum, NBRC 3288

(Gluconacetobacter xylinus) acetolactate decarboxylase - alsD amino acid sequence

MEIGFNIYWTYEPACITI SLQSQMTGLRRCHVPHCPPMQETEVVKLKCYSVGDVDTRSSAADSTGVRP RMNRLYQTSTMAALLDAVYDGETTLDELLMHGNFGLGTFNGLDGEMIVNDSVIHQFRADG QAGRVPGD LRTPFACVTFFNPEKEYMIDTAQDKEGFEAIVDHLVNNPNLFAAVRFTGMFERVETRTVF CQCQPYPP MLEVVARQPTMQLGASTGTMLGFRTPGYMQGVNVAGYHLHFLTEDGRRGGHVTDYGVLRG RLEVGVIS DVEIQLPRTEQFARANLSPENIHEAIRVAEGG

SEQ ID NO: 16: Candida parapsilosis M203011 alcohol dehydrogenase - adh gene nucleotide sequence (codon usage is optimized for S.

elongatus)

ATGGGGGAGATTGAGTCCTATTGTAACAAAGAGCTAGGTCCCTTACCTACTAAAGCACCG ACCCTGTC CAAAAACGTGCTCGATCTATTCTCCTTGAAGGGAAAGGTGGCTTCGGTGACGGGCAGCAG CGGAGGCA TTGGTTGGGCCGTAGCCGAAGCATACGCCCAGGCCGGTGCAGACGTCGCGATTTGGTACA ACAGCCAT CCTGCGGACGAGAAGGCGGAACACCTGCAGAAAACTTATGGCGTGCACAGTAAAGCCTAT AAGTGCAA TATCAGTGATCCGAAAAGTGTGGAAGAAACGATCTCACAGCAAGAGAAGGATTTTGGCAC GATCGACG TGTTCGTCGCTAATGCCGGGGTCACTTGGACACAGGGCCCAGAGATCGATGTTGACAACT ATGATTCG TGGAACAAAATCATTAGTGTTGATTTGAATGGCGTTTACTATTGCAGCCACAATATTGGC AAGATCTT CAAGAAGAATGGGAAAGGTTCTCTGATTATCACGTCGAGTATTTCTGGGAAAATCGTTAA CATTCCGC AGTTGCAAGCGCCCTACAATACCGCGAAGGCCGCGTGTACCCATCTCGCGAAATCGCTCG CCATTGAA TGGGCTCCGTTTGCACGCGTCAACACGATTAGCCCCGGATACATCGACACCGATATCACC GATTTTGC CTCTAAAGACATGAAAGCTAAGTGGTGGCAACTTACTCCACTGGGACGGGAAGGCCTCAC CCAAGAAC TGGTCGGCGGCTACTTGTATCTGGCTAGCAATGCTTCGACATTTACCACCGGTTCAGATG TCGTTATC GATGGCGGTTACACATGCCCC

SEQ ID NO: 17: Candida parapsilosis M203011 alcohol dehydrogenase - adh amino acid sequence

MGEIESYCNKELGPLPTKAPTLSKNVLDLFSLKGKVASVTGSSGGIGWAVAEAYAQAGAD VAIWYNSH PADEKAEHLQKTYGVHSKAYKCNI SDPKSVEETI SQQEKDFGTIDVFVANAGVTWTQGPEIDVDNYDS WNKI ISVDLNGVYYCSH IGKIFKKNGKGSLI ITSS ISGKIVNIPQLQAPYNTAKAACTHLAKSLAIE WAPFARVNTI SPGYIDTDITDFASKDMKAKWWQLTPLGREGLTQELVGGYLYLASNASTFTTGSDVVI DGGYTCP

SEQ ID NO: 18: Leuconostoc pseudomesenteroides CHCC2114 alcohol dehydrogenase - adh gene nucleotide sequence (codon usage is optimized for S. elongatus)

ATGACAAAGAAAGTGGCTATGGTGACGGGTGGCGCCCAAGGGATCGGAGAGGCCATTGTG CGACGTTT GTCGGCAGATGGCTTTGCGGTCGCAGTGGCTGATCTGAACGAAGCCAAATCCAAGGAAGT CGCCACGG ACATCGAAAAGAATGGGGGCACTGCGATTGCGGTTAAGCTCGATGTTTCAGATCGCGAGG GGTTTTTC GCCGCTGTTAAAGAAGTCGCGGAGAAGCTTGGAGGTTTTGACGTCTTGGTAAACAACGCT GGGCTCGG CCCCACCACCCCCATCGATACTATTACCCCGGAACTATTTGATAAGGTCTATCACATCAA CGTCGCTG GCGACATTTGGGGGATTCAAGCAGCCGTGGAGCAGTTCAAGAAAAATGGCAACGGCGGCA AAATCATT AACGCGACAAGTCAAGCCGGCGTGGTCGGCAATCCTAATTTGTCTCTGTATAGCTCGACT AAGTTCGC AGTGCGCGGACTGACGCAGGTCGCAGCGCGCGATCTAGCCGAACAGAATATCACGGTTAA TGCGTACG CTCCAGGAATTGTGAAAACGCCGATGATGTTTGATATTGCCCATGAAGTTGGCAAAAACG CCGGTAAG GATGACGAGTGGGGTATGCAAACCTTCGCGAAAGATATCGCTCTCAAACGGCTGAGCGAA CCCGAGGA CGTCGCCGCTGCTGTTGCGTTCTTAGCAGGTCCTGACAGTAATTACATCACCGGTCAGAC CATCGAAG TGGATGGTGGCATGCAGTTTCACTAG

SEQ ID NO: 19: Leuconostoc pseudomesenteroides CHCC2114 alcohol dehydrogenase - adh amino acid sequence

MTKKVAMVTGGAQGIGEAIVRRLSADGFAVAVADLNEAKSKEVATDIEKNGGTAIAVKLD VSDREGFF AAVKEVAEKLGGFDVLVNNAGLGPTTPIDTITPELFDKVYHINVAGDIWGIQAAVEQFKK NGNGGKI I NATSQAGVVGNPNLSLYSSTKFAVRGLTQVAARDLAEQNITVNAYAPGIVKTPMMFDIAH EVGKNAGK DDEWGMQTFAKDIALKRLSEPEDVAAAVAFLAGPDSNYITGQTIEVDGGMQFH

SEQ ID NO: 20: Clostridium beijerinckii NRRLB593 alcohol

dehydrogenase - adh gene nucleotide sequence

ATGAAAGGTTTTGCCATGCTTGGAATCAATAAGCTAGGCTGGATTGAAAAAGAACGCCCA GTGGCGGG TAGTTATGATGCCATCGTTCGCCCACTGGCAGTGTCGCCCTGTACGAGCGATATCCACAC CGTATTTG AGGGCGCGTTAGGGGATCGTAAGAATATGATTCTGGGCCATGAAGCGGTTGGTGAGGTCG TAGAGGTG GGGTCTGAAGTGAAAGATTTCAAACCCGGCGATCGCGTCATCGTTCCATGCACTACGCCG GACTGGCG AAGCCTTGAAGTGCAGGCGGGATTCCAACAGCACTCAAACGGCATGCTCGCGGGGTGGAA GTTTTCCA ACTTTAAGGACGGGGTGTTCGGGGAATACTTCCACGTCAACGATGCGGATATGAATTTAG CCATTTTG CCTAAGGACATGCCCCTCGAAAATGCTGTTATGATTACCGATATGATGACAACGGGTTTC CACGGCGC TGAGCTCGCTGATATCCAAATGGGCAGCAGTGTGGTTGTCATCGGGATTGGTGCAGTCGG CCTGATGG GCATTGCGGGAGCCAAATTGCGGGGAGCTGGCCGCATCATTGGTGTTGGTAGCCGGCCTA TTTGTGTC GAAGCTGCAAAGTTTTATGGGGCCACAGACATTTTGAACTACAAGAATGGTCATATTGTG GACCAGGT GATGAAGCTCACGAATGGCAAGGGAGTCGATCGCGTTATCATGGCTGGCGGTGGCTCTGA AACCCTGA GCCAAGCAGTCTCGATGGTCAAACCGGGAGGCATTATCTCGAACATCAACTATCATGGCA GTGGCGAT GCGCTGCTGATCCCCCGAGTGGAGTGGGGCTGCGGCATGGCCCATAAGACCATTAAAGGC GGCCTCTG CCCGGGTGGCCGTTTGCGCGCAGAGATGCTACGCGATATGGTGGTCTACAACCGGGTGGA TCTGTCCA AACTGGTCACTCACGTTTACCATGGTTTTGATCACATCGAAGAGGCCTTGTTGTTGATGA AAGACAAA CCGAAAGACCTCATTAAAGCCGTCGTAATCCTGTAG SEQ ID NO: 21: Clostridium beijerinckii NRRLB593 alcohol dehydrogenase - adh amino acid sequence

MKGFAMLGINKLGWIEKERPVAGSYDAIVRPLAVSPCTSDIHTVFEGALGDRKNMILGHE AVGEVVEV GSEVKDFKPGDRVIVPCTTPDWRSLEVQAGFQQHSNGMLAGWKFSNFKDGVFGEYFHVND ADMNLAIL PKDMPLENAVMITDMMTTGFHGAELADIQMGSSVVVIGIGAVGLMGIAGAKLRGAGRI IGVGSRPICV EAAKFYGATDILNYKNGHIVDQVMKLTNGKGVDRVIMAGGGSETLSQAVSMVKPGGI I S INYHGSGD ALLIPRVEWGCGMAHKTIKGGLCPGGRLRAEMLRDMVVYNRVDLSKLVTHVYHGFDHIEE ALLLMKDK PKDLIKAVVIL

SEQ ID NO: 22: Thermoanaerobacter brockii HTD4 alcohol dehydrogenase

- adh gene nucleotide sequence

ATGAAGGGTTTCGCAATGCTGAGCATCGGGAAAGTCGGTTGGATCGAAAAAGAGAAACCT GCGCCAGG CCCGTTCGACGCGATTGTTCGGCCCTTAGCTGTCGCGCCGTGCACGTCGGACATTCACAC GGTATTCG AAGGGGCTATCGGCGAGCGGCACAATATGATCCTGGGCCATGAAGCTGTCGGCGAGGTTG TCGAAGTG GGCTCTGAGGTGAAAGACTTTAAACCCGGCGATCGCGTCGTAGTTCCGGCGATCACTCCC GACTGGCG GACGAGTGAAGTGCAACGCGGCTATCACCAGCACAGCGGGGGAATGTTGGCGGGCTGGAA GTTTTCAA ACGTCAAGGATGGGGTTTTTGGCGAATTCTTTCATGTGAACGATGCCGATATGAATCTGG CCCATCTT CCTAAAGAGATTCCGCTGGAGGCTGCTGTGATGATTCCGGATATGATGACCACAGGCTTC CATGGTGC TGAACTCGCGGATATCGAACTGGGTGCCACCGTGGCAGTGTTGGGCATCGGCCCTGTTGG CCTCATGG CGGTCGCAGGTGCTAAGTTACGTGGGGCTGGCCGTATTATCGCCGTCGGCTCGCGCCCAG TGTGTGTT GATGCGGCTAAGTACTATGGAGCCACTGATATCGTGAACTACAAGGATGGTCCCATCGAG AGTCAGAT TATGAATTTGACCGAGGGAAAGGGTGTCGACGCCGCCATTATCGCAGGTGGAAACGCCGA TATCATGG CCACCGCCGTCAAAATCGTTAAACCTGGAGGTACAATTGCAAATGTCAACTATTTTGGTG AAGGCGAA GTGCTGCCCGTTCCACGCCTGGAATGGGGATGTGGGATGGCCCATAAGACCATTAAAGGC GGCCTCTG CCCCGGTGGCCGACTGCGCATGGAACGCCTAATTGATCTCGTGTTCTACAAACGAGTCGA TCCGTCCA AGTTGGTGACGCACGTGTTTCGCGGTTTTGACAATATTGAGAAGGCATTTATGCTAATGA AGGACAAA CCGAAAGATCTCATCAAACCCGTGGTCATTTTGGCA

SEQ ID NO: 23: Thermoanaerobacter brockii HTD4 alcohol dehydrogenase

- adh amino acid sequence

MKGFAMLS IGKVGWIEKEKPAPGPFDAIVRPLAVAPCTSDIHTVFEGAIGERHNMILGHEAVGEVVEV GSEVKDFKPGDRVVVPAITPDWRTSEVQRGYHQHSGGMLAGWKFSNVKDGVFGEFFHVND ADMNLAHL PKEIPLEAAVMIPDMMTTGFHGAELADIELGATVAVLGIGPVGLMAVAGAKLRGAGRI IAVGSRPVCV DAAKYYGATDIVNYKDGPIESQIMNLTEGKGVDAAI IAGGNADIMATAVKIVKPGGTIANVNYFGEGE VLPVPRLEWGCGMAHKTIKGGLCPGGRLRMERLIDLVFYKRVDPSKLVTHVFRGFDNIEK AFMLMKDK PKDLIKPVVILA