Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ENGINEERING OF MULTI-CARBON SUBSTRATE UTILIZATION PATHWAYS IN METHANOTROPHIC BACTERIA
Document Type and Number:
WIPO Patent Application WO/2014/066670
Kind Code:
A1
Abstract:
The present disclosure relates to genetically engineered methanotrophic bacteria with the capability of growing on a multi-carbon substrate as a primary or sole carbon source and methods for growing methanotrophic bacteria on a multi-carbon substrate.

Inventors:
SILVERMAN JOSHUA (US)
LEONARD EFFENDI (US)
SAVILLE RENEE (US)
Application Number:
PCT/US2013/066665
Publication Date:
May 01, 2014
Filing Date:
October 24, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CALYSTA ENERGY INC (US)
International Classes:
C12N1/21; C12N15/52
Foreign References:
US20090253192A12009-10-08
Other References:
ZHAO, SHU-JIE ET AL.: "Variants of the obligate methanotroph isolate 761M capable of growth on glucose in the absence of methane", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 48, no. 4, 1984, pages 807 - 812
DEDYSH, SVETLANA N. ET AL.: "Methylocella species are facultatively methanotrophic", JOURNAL OF BACTERIOLOGY, vol. 187, no. 13, 2005, pages 4665 - 4670
IM, JEONGDAE ET AL.: "Characterization of a novel facultative Methylocystis species capable of growth on methane, acetate and ethanol", ENVIRONMENTAL MICROBIOLOGY REPORTS, vol. 3, no. 2, 2011, pages 174 - 181
THEISEN, ANDREAS R. ET AL.: "Regulation of methane oxidation in the facultative methanotroph Methylocella silvestris BL2", MOLECULAR MICROBIOLOGY, vol. 58, no. 3, 2005, pages 682 - 692
Attorney, Agent or Firm:
PEPE, Jeffrey, C. et al. (Suite 5400701 Fifth Avenu, Seattle Washington, US)
Download PDF:
Claims:
CLAIMS

1. A recombinant obligate methanotrophic bacterium or a recombinant facultative methanotrophic bacterium including at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component,

wherein the multi-carbon substrate is not utilized as a carbon source by a reference facultative methanotrophic bacterium;

wherein the at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component is expressed in a sufficient amount to permit growth of the recombinant obligate methanotrophic bacterium on the multi-carbon substrate as a primary carbon source or the recombinant facultative methanotrophic bacterium on the multi- carbon substrate as a sole carbon source.

2. The recombinant methanotrophic bacterium of claim 1, wherein the multi-carbon substrate is glucose, glycerol, acetate, lactate, arabinose, citrate, or succinate.

3. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is glucose and the recombinant methanotrophic bacterium includes an exogenous nucleic acid encoding a glucose transporter.

4. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is acetate and the recombinant methanotrophic bacterium includes an exogenous nucleic acid encoding an acetate transporter.

5. The recombinant methanotrophic bacterium of claim 4, wherein the recombinant methanotrophic bacterium is further modified to overexpress acetyl-CoA synthase.

6. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is lactate and the recombinant methanotrophic bacterium includes an exogenous nucleic acid encoding a lactate transporter and an exogenous nucleic acid encoding a lactate dehydrogenase.

7. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is arabinose and the recombinant methanotrophic bacterium includes an exogenous nucleic acid encoding an L-arabinose isomerase, an exogenous nucleic acid encoding an L-ribulose kinase, an exogenous nucleic acid encoding an L-ribulose-5- phosphate epimerase, and an exogenous nucleic acid encoding an arabinose transporter.

8. The recombinant methanotrophic bacterium of claim 7, wherein the L- arabinose isomerase is AraA, the L-ribulose kinase is AraB, the L-ribulose-5 -phosphate epimerase is AraD, and the arabinose transporter is AraE, AraFGH, or AraP.

9. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is citrate and the recombinant methanotrophic bacterium includes an exogenous nucleic acid encoding a citrate transporter.

10. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is succinate and the recombinant methanotrophic bacterium includes an exogenous nucleic acid encoding a succinate transporter.

11. The recombinant methanotrophic bacterium of claim 1 or 2, wherein the multi-carbon substrate is glycerol and the recombinant methanotrophic bacterium includes at least two exogenous nucleic acids encoding glycerol utilization pathway components.

12. The recombinant methanotrophic bacterium of claim 11 , wherein the at least two glycerol utilization pathway components comprise glycerol kinase and glycerol-3 - phosphate dehydrogenase.

13. The recombinant methanotrophic bacterium of claim 12, wherein the glycerol kinase is GlpK and the glycerol-3 -phosphate dehydrogenase is GlpD.

14. The recombinant methanotrophic bacterium of claim 11 , wherein the recombinant methanotrophic bacterium includes three exogenous nucleic acids encoding glycerol utilization pathway components.

15. The recombinant methanotrophic bacterium of claim 14, wherein the three glycerol utilization pathway components comprise glycerol uptake facilitator, glycerol kinase, and glycerol-3 -phosphate dehydrogenase.

16. The recombinant methanotrophic bacterium of claim 15, wherein the glycerol uptake facilitator is GlpF, the glycerol kinase is GlpK, and the glycerol-3 -phosphate dehydrogenase is GlpD.

17. The recombinant methanotrophic bacterium of any one of the preceding claims, wherein the recombinant methanotrophic bacterium is Methylomonas, Methylobacter, Methylococcus, Methylosinus, Methylomicrobium, or Methanomonas.

18. The recombinant methanotrophic bacterium of claim 17, wherein the recombinant methanotrophic bacterium is Methylosinus trichosporium strain OB3b,

Methylococcus capsulatus Bath strain, Methylomonas methanica 16A strain, Methylosinus trichosporium (NRRL B-11,196), Methylosinus sporium (NRRL B-11,197), Methylocystis parvus (NRRL B-11,198), Methylomonas methanica (NRRL B-11,199), Methylomonas albus (NRRL B-11,200), Methylobacter capsulatus (NRRL B-11,201), Methylomonas sp AJ-3670 (FERM P-2400), Methylacidiphilum infernorum, or Methylomicrobium alcaliphilum 20Z.

19. The recombinant methanotrophic bacterium of any one of claims 1-16, wherein the recombinant methanotrophic bacterium is Methylocella silvestris, Methylocella palustris, Methylocella tundra, Methylocystis daltona, Methylocystis bryophila,

Methylobacterium organophilum (ATCC 27,886), or Methylocapsa aurea.

20. The recombinant methanotrophic bacterium of any one of the preceding claims, wherein at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component is codon optimized for high expression in the methanotrophic bacterium.

21. A method for growing methanotrophic bacteria, comprising: culturing the recombinant methanotrophic bacteria of any one of claims 1-20 in the presence of a multi-carbon substrate, wherein the multi-carbon substrate is used as a primary carbon source by the recombinant obligate methanotrophic bacteria or wherein the multi-carbon substrate is used as a sole carbon source by the recombinant facultative methanotrophic bacteria.

Description:
ENGINEERING OF MULTI-CARBON SUBSTRATE UTILIZATION PATHWAYS

IN METHANOTROPHIC BACTERIA

STATEMENT REGARDING SEQUENCE LISTING

The Sequence Listing associated with this application is provided in text format in lieu of a paper copy, and is hereby incorporated by reference into the specification. The name of the text file containing the Sequence Listing is

200206_402WO_SEQUENCE_LISTING.txt. The text file is 389 KB, was created on October 22, 2013, and is being submitted electronically via EFS-Web.

BACKGROUND

Technical Field

The present disclosure relates to genetically engineered methanotrophic bacteria capable of growing on multi-carbon substrates and methods for growing methanotrophic bacteria on multi-carbon substrates.

Description of the Related Art

Methanotrophic bacteria generally rely on methane as their sole carbon and energy source. The low solubility of methane is a major limiting factor in achieving high cell density and rapid growth in methanotrophic bacteria. The slow growth of methanotrophs and their requirement for methane, a potentially explosive substrate, has hampered their industrial application. For many industrial applications, such as chemical catalysis or other biological transformations, it is desirable to achieve high amounts of methanotroph biomass, regardless of the carbon source used.

In view of the limitations associated with methanotrophic bacteria growth, there is a need in the art for methanotrophs that can utilize alternative, preferably inexpensive, substrates as carbon and energy sources. The present disclosure provides a solution by providing genetically engineered methanotrophic bacteria that can utilize multi-carbon substrates, including glycerol.

SUMMARY OF INVENTION

In one aspect, the present disclosure provides for recombinant obligate methanotrophic bacteria including at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component, wherein the at least one exogenous nucleic acid is expressed in a sufficient amount to permit growth of the non- naturally occurring methanotrophic bacteria on the multi-carbon substrate as a primary carbon source. In certain embodiments, the multi-carbon substrate is a sole carbon source. In certain embodiments, the multi-carbon substrate may be glucose, acetate, lactate, arabinose, citrate, succinate, or glycerol.

In another aspect, the present disclosure provides for recombinant facultative methanotrophic bacteria including at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component, wherein the multi- carbon substrate is not utilized as a carbon source by a reference facultative

methanotrophic bacterium, wherein the at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component is expressed in a sufficient amount to permit growth of the recombinant facultative methanotrophic bacteria on the multi-carbon substrate as a sole carbon source. In certain embodiments, the multi- carbon substrate may be glucose, glycerol, lactate, arabinose, acetate, succinate, or citrate.

In certain embodiments wherein the multi-carbon substrate is glucose, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a glucose transporter.

In certain embodiments wherein the multi-carbon substrate is acetate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding an acetate transporter. In further embodiments, the non-naturally occurring

methanotrophic bacteria are further modified to overexpress acetyl-CoA synthase.

In certain embodiments wherein the multi-carbon substrate is lactate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a lactate transporter and an exogenous nucleic acid encoding a lactate dehydrogenase.

In certain embodiments wherein the multi-carbon substrate is arabinose, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding an L-arabinose isomerase, an exogenous nucleic acid encoding an L-ribulose kinase, an exogenous nucleic acid encoding an L-ribulose-5 -phosphate epimerase, and an exogenous nucleic acid encoding an arabinose transporter. In further embodiments, the L-arabinose isomerase is AraA, the L-ribulose kinase is AraB, the L-ribulose-5- phosphate epimerase is AraD, and the arabinose transporter is AraE, AraFGH, or AraP.

In certain embodiments wherein the multi-carbon substrate is citrate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a citrate transporter. In certain embodiments wherein the multi-carbon substrate is succinate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a succinate transporter.

In certain embodiments wherein the multi-carbon substrate is glycerol, the recombinant methanotrophic bacteria include at least two exogenous nucleic acids encoding glycerol utilization pathway components. In further embodiments, the at least two glycerol utilization components comprise glycerol kinase and glycerol-3 -phosphate dehydrogenase. In a specific embodiment, the glycerol kinase is GlpK and glycerol-3- phosphate dehydrogenase is GlpD.

In further embodiments, the recombinant methanotrophic bacteria include three exogenous nucleic acids encoding glycerol utilization pathway

components. In still further embodiments, the three glycerol utilization pathway components comprise glycerol uptake facilitator, glycerol kinase, and glycerol-3 - phosphate dehydrogenase. In a specific embodiment, the glycerol uptake facilitator is GlpF, the glycerol kinase is GlpK, and the glycerol-3 -phosphate dehydrogenase is GlpD.

In certain embodiments, the recombinant obligate methanotrophic bacteria are Methylomonas, Methylobacter, Methylococcus, Methylosinus,

Methylomicrobium, or Methanomonas. In further embodiments, the recombinant obligate methanotrophic bacteria are Methylosinus trichosporium strain OB3b,

Methylococcus capsulatus Bath strain, Methylomonas methanica 16 A strain,

Methylosinus trichosporium (NRRL B- 11,196), Methylosinus sporium (NRRL B- 11,197), Methylocystis parvus (NRRL B- 11,198), Methylomonas methanica (NRRL B- 11,199), Methylomonas albus (NRRL B-l 1,200), Methylobacter capsulatus (NRRL B- 11,201), Methylomonas sp AJ-3670 (FERM P-2400), Methylacidiphilum infernorum, or Methylomicrobium alcaliphilum 20Z.

In certain embodiments, the recombinant facultative methanotrophic bacteria are Methylocella silvestris, Methylocella palustris, Methylocella tundrae, Methylocystis daltona strain SB2, Methylocystis bryophila, Methylocapsa aurea KYG, or Methylobacterium organophilum (ATCC 27,886).

In certain embodiments, at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component is codon optimized for high expression in methanotrophic bacteria.

Additionally, the present disclosure provides methods for growing methanotrophic bacteria, comprising culturing the recombinant obligate methanotrophic bacteria according to any of the embodiments provided herein in the presence of a multi-carbon substrate, wherein the multi-carbon substrate is used as a primary carbon source by the recombinant obligate methanotrophic bacteria. In certain embodiments, the present disclosure provides methods for growing methanotrophic bacteria, comprising culturing the recombinant faculatative methanotrophic bacteria according to any of the embodiments provided herein in the presence of a multi-carbon substrate, wherein the multi-carbon substrate is used as a sole carbon source by the recombinant faculatative methanotrophic bacteria.

BRIEF DESCRIPTION OF THE DRAWING

FIGURE 1 shows an exemplary glycerol utilization pathway (respiratory) for a genetically modified methanotrophic bacterium. Glycerol crosses the cell membrane (e.g., via a glycerol uptake facilitator such as GlpF), where it is phosphorylated by a glycerol kinase (GK) (e.g., GlpK) to form glycerol-3 -phosphate (G3P), which is then oxidized by glycerol-3 -phosphate dehydrogenase (e.g., GlpD) to dihydroxyacetone phosphate (DHAP), which is then isomerized by triose phosphate isomerase and may then enter endogenous sugar metabolism or gluconeogenesis pathways.

DETAILED DESCRIPTION

Prior to setting forth this disclosure in more detail, it may be helpful to an understanding thereof to provide definitions of certain terms to be used herein.

Additional definitions are set forth throughout this disclosure.

In the present description, the term "about" means ± 20% of the indicated range, value, or structure, unless otherwise indicated. The term "consisting essentially of limits the scope of a claim to the specified materials or steps and those that do not materially affect the basic and novel characteristics of the claimed invention. It should be understood that the terms "a" and "an" as used herein refer to "one or more" of the enumerated components. The use of the alternative (e.g., "or") should be understood to mean either one, both, or any combination thereof of the alternatives. As used herein, the terms "include" and "have" are used synonymously, which terms and variants thereof are intended to be construed as non-limiting. The term "comprise" means the presence of the stated features, integers, steps, or components as referred to in the claims, but that it does not preclude the presence or addition of one or more other features, integers, steps, components, or groups thereof.

As used herein, the term "recombinant" or "non-natural" refers to an organism, microorganism, cell, nucleic acid molecule, or vector that has at least one genetic alternation or has been modified by the introduction of an exogenous nucleic acid, or refers to a cell that has been altered such that the expression of an endogenous nucleic acid molecule or gene can be controlled, where such alterations or

modifications are introduced by genetic engineering. Genetic alterations include, for example, modifications introducing expressible nucleic acid molecules encoding proteins or enzymes, other nucleic acid additions, nucleic acid deletions, nucleic acid substitutions, or other functional disruption of the cell's genetic material. Such modifications include, for example, coding regions and functional fragments thereof for heterologous or homologous polypeptides for the referenced species. Additional modifications include, for example, non-coding regulatory regions in which the modifications alter expression of a gene or operon. Exemplary proteins or enzymes include proteins or enzymes (i.e., components) within a multi-carbon substrate utilization pathway (e.g., a glycerol utilization pathway). Genetic modifications to nucleic acid molecules encoding enzymes, or functional fragments thereof, can confer a biochemical reaction capability or a metabolic pathway capability to the recombinant cell that is altered from its naturally occurring state.

As used herein, the term "host bacterium" or "host" refers to a microorganism (e.g., methanotrophic bacterium) that has not yet been genetically modified with the capability to utilize a multi-carbon substrate (e.g., glycerol) as a carbon and energy source. A host methanotrophic bacterium is selected for

transformation with at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component to yield a recombinant methanotrophic bacterium with the capability to utilize a multi-carbon substrate as a carbon and energy source. A host methanotrophic bacterium may already possess other genetic modifications conferring it with desired properties, unrelated to the multi-carbon substrate utilization pathway disclosed herein. For example, a host methanotrophic bacterium may possess genetic modifications conferring high growth, tolerance of contaminants or particular culture conditions, ability to metabolize additional carbon substrates, or ability to synthesize desirable products or intermediates (e.g., propylene, crotonate, or crotonyl CoA, see, e.g., International Application Number

PCT/US 13/60460, incorporated herein by reference, in its entirety).

As used herein, the term "methanotrophic bacterium" refers to a methylotrophic bacterium that has the ability to oxidize methane as its sole carbon and energy source. As used herein, "methanotrophic bacteria" include "obligate methanotrophic bacteria" that can only utilize CI substrates for carbon and energy sources and "facultative methanotrophic bacteria" that are naturally able to use multi- carbon substrates, such as acetate, pyruvate, succinate, malate, or ethanol, in addition to CI substrates as their sole carbon and energy source. Facultative methanotrophs include some species of Methylocella, Methylocystis, and Methylocapsa {e.g.,

Methylocella silvestris, Methylocella palustris, Methylocella tundrae, Methylocystis daltona strain SB2, Methylocystis bryophila, and Methylocapsa aurea KYG), and Methylobacterium organophilum (ATCC 27,886).

As used herein, the term "reference facultative methanotrophic bacterium", as known as "wild type facultative methanotrophic bacterium" or "parent facultative methanotrophic bacterium", refers to a facultative methanotrophic bacterium that has not been genetically engineered with the capability to use an additional multi- carbon substrate other than its native substrates.

As used herein, the term "not utilized as a carbon source" means that the referenced carbon substrate cannot be used as a sole carbon source by the referenced bacteria.

As used herein, the term "CI substrate" or "CI compound" refers to any organic compound that lacks a carbon to carbon bond. C 1 substrates include methane, methanol, formaldehyde, formic acid (formate), carbon monoxide, carbon dioxide, methylated amines (e.g., methylamine, dimethylamine, trimethylamine, etc.), methylated thiols, methyl halogens (e.g., bromomethane, chloromethane, iodomethane, dichloromethane, etc.), and cyanide.

As used herein, the term "multi-carbon substrate" or "multi-carbon compound" refers to an organic compound that contains at least one carbon to carbon bond. A multi-carbon substrate includes organic acids and carbohydrates. Exemplary multi-carbon substrates include glucose, acetate, lactate, arabinose, citrate, succinate, and glycerol.

As used herein, the term "glucose", also known as "D-glucose" or "dextrose", refers to a colorless, water soluble organic compound having the formula C 6 Hi 2 0 6 and that is the D-isomer or "right-handed form" of glucose. As used herein, glucose refers to both the open-chain form as well as cyclic isomers (e.g., a-D- glucopyranose, β-D-glucopyranose, a-D-glucofuranose, and β-D-glucofuranose).

As used herein, the term "acetate", also known as "ethanoate", refers to an organic compound that is a salt or ester of acetic acid (chemical formula CH 3 CO 2 H (also written as CH 3 COOH)). The formula for acetate anion is CH 3 CO 2 , C 2 H 3 0 2 ~ , or CH 3 COO . Acetate may also be abbreviated as OAc " or AcO " .

As used herein, the term "lactate" refers to a salt or ester of lactic acid. Lactic acid, also known as 2-hydroxypropanoic acid or sarcolactic acid, is a carboxylic acid with the chemical formula C 3 H 6 O 3 . The lactate anion has the chemical formula CH 3 CH(OH)COO ~ . Lactate includes L-lactate and D-lactate optical isomers.

As used herein, the term "arabinose", also known as "pectinose", refers to a monosaccharide containing five carbon atoms including an aldehyde functional group (aldopentose). Arabinose has the chemical formula C5H10O5. As used herein, arabinose includes L-arabinose and D-arabinose.

As used herein, the term "citrate" refers to a salt or ester of citric acid. Citric acid, also known as 2-hydroxypropane-l ,2,3-tricarboxylic acid, is a weak organic acid that has the chemical formula CeHgOy. The citrate anion has the chemical formula C 3 H 5 0(COO) 3 3 .

As used herein, the term "succinate" refers to a salt or ester of succinic acid. Succinic acid, also known as butanedioic acid or ethane- 1 ,2-dicarboxylic acid, has the chemical formula C4H6O4.

As used herein, the term "glycerol", also known as glycerine, glycerin, 1 ,2,3-propanetriol, glyceritol, glycyl alcohol, trihydroxypropane, propanetriol, osmoglyn, or 1 ,2,3-trihydroxypropane, refers to a tri-hydroxy sugar alcohol with the formula C 3 H 8 0 3 . It is a colorless, odorless, viscous liquid. Glycerol is an intermediate in carbohydrate and lipid metabolism and is often used as a solvent, emollient, pharmaceutical agent, and sweetening agent. As used herein, glycerol includes both purified glycerol, the form used in pharmaceutical, food, and cosmetic industries, and crude glycerol. Crude glycerol, or g-phase, is a heavier separate liquid phase composed mainly of glycerol that is the by-product of biodiesel production. Crude glycerol generated by homogeneous base-catalyzed transesterification contains approximately 50-60% of glycerol, 12-16% of alkalies, especially in the form of alkali soaps and hydroxides, 15-18%) of methyl esters, 8-12% of methanol, 2-3% water, and further components. Crude glycerol also contains a variety of elements, such as calcium, magnesium, phosphorus, or sulfur, originating from the primary oil. Larger quantities of sodium or potassium are also present, derived from the catalyst.

As used herein, "exogenous" means that the referenced molecule (e.g., nucleic acid) or referenced activity (e.g., enzyme activity or membrane transport) is introduced into a host methanotrophic bacterium by genetic engineering. The molecule can be introduced, for example, by introduction of a nucleic acid into the host genetic material such as by integration into a host chromosome or by introduction of a nucleic acid as non-chromosomal genetic material, such as on a plasmid. When the term is used in reference to expression of an encoding nucleic acid, it refers to introduction of the encoding nucleic acid in an expressible form into the host methanotrophic bacterium. When used in reference to an enzymatic or protein activity, the term refers to an activity that is introduced into the host reference bacterium. Therefore, the term "endogenous" or "native" refers to a referenced molecule or activity that is present in the host bacterium. The term "chimeric" when used in reference to a nucleic acid refers to any nucleic acid that is not endogenous, comprising sequences that are not found together in nature. For example, a chimeric nucleic acid may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences that are derived from the same source, but arranged in a manner different than that found in nature. The term "heterologous" refers to a molecule or activity that is derived from a source other than the referenced species or strain whereas "homologous" refers to a molecule or activity derived from the host bacterium. Accordingly, a methanotrophic bacterium comprising an exogenous nucleic acid as provided in the present disclosure can utilize either a heterologous or homologous nucleic acid or both.

It is understood that when more than one exogenous nucleic acid is included in a bacterium that the more than one exogenous nucleic acid refers to the referenced encoding nucleic acid or protein activity, as discussed above. It is also understood that such more than one exogenous nucleic acid can be introduced into the host bacterium on separate nucleic acid molecules, on a polycistronic nucleic acid molecule, on a single nucleic acid molecule encoding a fusion protein, or a combination thereof, and still be considered as more than one exogenous nucleic acid. For example, as disclosed herein, a methanotrophic bacterium can be modified to express two or more exogenous nucleic acids encoding a desired multi-carbon substrate utilization pathway component (e.g., glycerol utilization pathway components). Where two exogenous nucleic acids encoding glycerol utilization pathway components are introduced into a host methanotrophic bacterium, it is understood that the two exogenous nucleic acids can be introduced as a single nucleic acid molecule, for example, on a single plasmid, on separate plasmids, can be integrated into the host chromosome at a single site or multiple sites, and still be considered two exogenous nucleic acids. Similarly, it is understood that more than two exogenous nucleic acid molecules can be introduced into a host bacterium in any desired combination, for example, on a single plasmid, on separate plasmids, can be integrated into the host chromosome at a single site or multiple sites, and still be considered as two or more exogenous nucleic acids. Thus, the number of referenced exogenous nucleic acids or enzymatic activities refers to the number of encoding nucleic acids or the number of protein activities, not the number of separate nucleic acid molecules introduced into the host bacterium.

As used herein, "nucleic acid", also known as polynucleotide, refers to a polymeric compound comprised of covalently linked subunits called nucleotides.

Nucleic acids include polyribonucleic acid (RNA), polydeoxyribonucleic acid (DNA), both of which may be single or double stranded. DNA includes cDNA, genomic DNA, synthetic DNA, and semi-synthetic DNA.

As used herein, "multi-carbon substrate utilization pathway component" refers to any enzyme or protein that is involved (e.g., a protein or enzyme involved in transport or catalyzing an enzymatic reaction) in the ability of an organism to utilize the selected multi-carbon substrate as a carbon and energy source. Exemplary multi-carbon substrate utilization pathways include glucose, acetate, lactate, arabinose, citrate, succinate, and glycerol utilization pathways. Sources of nucleic acids encoding multi- carbon substrate utilization pathway components are well known in the art and may be derived from a variety of species, including bacteria, yeast, or other microorganisms.

As used herein, a "glucose transporter" refers to transport protein or transport system that transports glucose into a microorganism. Exemplary glucose transporters include phosphoenolpyruvate: sugar phosphotransferase system (PTS), which is involved in the uptake and concomitant phosphorylation of a variety of hexose sugars in bacteria, and members of the major facilitator family (e.g., glucose/galactose transporter (GluP), and galactose permease (GalP)).

As used herein, an "acetate transporter", also known as "monocarboxylic acid transporter", refers to a transport protein or transport system that transports acetate into a microorganism. Exemplary acetate transporters include acetate permeases and proton-linked monocarboxylate transporters of the sodium/solute symporter family (e.g., MctC, MctP, ActP).

As used herein, "acetyl-CoA synthase", also known as "acetyl-CoA synthetase", refers to an enzyme that ligates acetate to coenzyme A to form acetyl coenzyme A (acetyl-CoA).

As used herein, a "lactate transporter", also known as "monocarboxylic acid transporter", refers to a transport protein or transport system that transports lactate into a microorganism. Exemplary lactate transporters include lactate permease (e.g., LctP (also known as LacP)), and proton-linked monocarboxylate transporters of the sodium/solute symporter family (e.g., MctC, MctP, ActP). As used herein, a "lactate dehydrogenase", refers to an NAD- independent enzyme that catalyzes the oxidation of lactate to pyruvate. As used herein, lactate dehydrogenase includes L-lactate and D-lactate dehydrogenase.

As used herein, an "arabinose operon", also known as "L-arabinose operon" or "ara operon", refers to a gene sequence encoding enzymes needed for the catabolism of arabinose to D-xylulose-5 -phosphate, an intermediate of the pentose phosphate pathway.

As used herein, an "arabinose transporter" refers to a membrane transport protein or system that transports L-arabinose into a microorganism. An exemplary arabinose transporter includes the low affinity AraE transport protein and the AraFGH ATP -binding cassette (ABC) transporter system. AraF is a periplasmic arabinose-binding protein, AraG is an ATP-binding component, and AraH is a membrane-bound component.

As used herein, "glycerol utilization pathway component", also known as "glycerol metabolism pathway enzyme" or "glycerol fermentation pathway enzyme" refers to any enzyme or protein that is involved (e.g., transport or catalyzing enzymatic reaction) in the ability of an organism to utilize glycerol as a carbon and energy source. A glycerol utilization pathway component may be from an anaerobic pathway or aerobic pathway. A glycerol utilization pathway component includes, for example, glycerol uptake facilitators, glycerol kinase, glycerol-3 -phosphate dehydrogenase, glycerol dehydrogenase, ATP- or phosphoenolpyruvate-dependent dihydroxyacetone kinase. Sources of nucleic acids encoding glycerol utilization pathway components are well known in the art and may be derived from a variety of species, including bacteria, yeast, or other microorganisms.

As used herein, "glycerol uptake facilitator", also known as "glycerol facilitator" refers to a cytoplasmic membrane protein that transports glycerol into a cell. It may be a member of the major intrinsic protein (MIP) family of transmembrane channel proteins. As used herein, a glycerol uptake facilitator may refer to a membrane protein that facilitates diffusion of or actively transports glycerol into a cell. The activity of glycerol uptake facilitator can be measured by a transport assay (see, e.g., Voegele and Boos, 1993, J. Bacteriol. 175: 1087-1094).

As used herein, "glycerol kinase", also known as glycerokinase, refers to an enzyme that catalyzes the phosphorylation of glycerol to glycerol-3 -phosphate (G3P). The activity of glycerol kinase can be measured as described in Lin et al, 1962, Proc. Natl. Acad. Sci. USA 48:2145-2150. As used herein, "glycerol-3 -phosphate dehydrogenase" refers to an enzyme that catalyzes the oxidation of glycerol-3 -phosphate (G3P) to dihydroxyacetone phosphate. The activity of glycerol-3 -phosphate dehydrogenase can be measured by the method of Spector and Pizer (1975, Methods Enzymol. 41 :249-254). Glycerol-3- phosphate dehydrogenase includes both aerobic and anaerobic versions. In certain embodiments, the glycerol-3 -phosphate is an aerobic enzyme.

As used herein, "transformation" refers to the transfer of a nucleic acid (e.g. , exogenous nucleic acid) into the genome of a host bacterium, resulting in genetically stable inheritance. Host bacteria containing the transformed nucleic acids are referred to as "recombinant" or "non-naturally occurring" or "genetically engineered" or "transformed" or "transgenic" bacteria.

Methanotrophic Bacteria

In certain embodiments, obligate methanotrophic bacteria are genetically engineered with the capability to utilize a multi-carbon substrate as a carbon and energy source. In other embodiments, facultative methanotrophic bacteria are genetically engineered with the capacity to utilize a non-native multi-carbon substrate.

Methanotrophic bacteria have the ability to oxidize methane as a carbon and energy source. Methanotrophic bacteria are classified into three groups based on their carbon assimilation pathways and internal membrane structure: type I (gamma proteobacteria), type II (alpha proteobacteria, and type X (gamma proteobacteria). Type I

methanotrophs use the ribulose monophosphate (RuMP) pathway for carbon

assimilation whereas type II methanotrophs use the serine pathway. Type X

methanotrophs use the RuMP pathway but also express low levels of enzymes of the serine pathway. Methanotrophic bacteria are grouped into several genera:

Methylomonas, Methylobacter, Methylococcus, Methylocystis, Methylosinus,

Methylomicrobium, Methanomonas, and Methylocella. Methanotrophic bacteria include obligate methanotrophs, which can only utilize CI substrates for carbon and energy sources, and facultative methanotrophs, which naturally have the ability to utilize some multi-carbon substrates as a sole carbon and energy source. Facultative methanotrophs include some species of Methylocella, Methylocystis, and Methylocapsa (e.g., Methylocella silvestris, Methylocella palustris, Methylocella tundrae,

Methylocystis daltona strain SB2, Methylocystis bryophila, and Methylocapsa aurea KYG), and Methylobacterium organophilum (ATCC 27,886). Exemplary obligate methanotrophic bacteria include: Methylococcus capsulatus Bath strain, Methylomonas 16a (ATCC PTA 2402), Methylosinus trichosporium OB3b (NRRL B-l 1,196), Methylosinus sporium (NRRL B-l 1,197), Methylocystis parvus (NRRL B-l 1,198), Methylomonas methanica (NRRL B-l 1,199), Methylomonas albus (NRRL B-l 1,200), Methylobacter capsulatus (NRRL B-l 1,201), Methylomonas flagellata sp AJ-3670 (FERM P-2400), Methylacidiphilum infernorum and Methylomicrobium alcaliphilum 20Z.

A selected methanotrophic host bacteria may also undergo strain adaptation under selective conditions to identify variants with improved properties for production. Improved properties may include increased growth rate, yield of desired products, and tolerance of likely process contaminants. In a particular embodiment, a high growth variant methanotrophic bacteria, which is an organism capable of growth on methane as the sole carbon and energy source and which possesses an exponential phase growth rate that is faster {i.e., shorter doubling time) than its parent, reference, or wild-type bacteria, is selected (see, e.g., U.S. 6,689,601).

Each of the methanotrophic bacteria of this disclosure may be grown as an isolated pure culture, with a heterologous organism(s) that may aid with growth, or one or more different strains of methanotrophic bacteria may be combined to generate a mixed culture.

Multi-carbon Substrate Utilization Pathway Enzymes

Glycerol Utilization Pathway Enzymes

Glycerol can be used as a source of carbon and energy by many microorganisms. As used herein, "glycerol utilization pathway component", also known as "glycerol metabolism pathway enzyme" or "glycerol fermentation pathway enzyme" refers to any enzyme or protein that is involved in the ability of an organism to utilize glycerol as a carbon and energy source. A glycerol utilization pathway component may include a number of enzymes and transport proteins from multiple glycerol utilization pathways, including for example, glycerol uptake facilitators, glycerol kinase, glycerol-3 -phosphate dehydrogenase, glycerol dehydrogenase, and dihydroxy acetone kinase.

The initial step of glycerol utilization is its uptake by the microorganism. Glycerol can passively diffuse through membranes without a transport system.

However, many microorganisms possess specific glycerol transporters. Glycerol transport can be mediated by a glycerol uptake facilitator (facilitated diffusion), or by an active glycerol transporter {e.g. , ATP dependent transporter or proton glycerol symporter). Genes that encode glycerol transporters include, for example, glpF, glpT, aqp, gupl, gupl, mip, gtsA, gtsB, gtsC, and stll. An exemplary amino acid sequence for a glycerol uptake facilitator comprises any one of SEQ ID NOS:l-21.

In most microorganisms that utilize glycerol, once glycerol has entered the cell, it is phosphorylated by glycerol kinase (GK) into glycerol-3 -phosphate (G3P), which is then oxidized by glycerol-3 -phosphate dehydrogenase to form

dihydroxyacetone phosphate (DHAP) {i.e., in presence of electron acceptors, respiratory metabolism) {see, Figure 1). In the respiratory glycerol utilization pathway, there are two forms of glycerol-3-phosphate dehydrogenase, aerobic and anaerobic {e.g., GlpD and GlpABC, respectively). Genes that encode glycerol kinase include, for example, glpK, gutl, and gykA. An exemplary amino acid sequence for a glycerol kinase comprises any one of SEQ ID NOS:22-42. Genes that encode glycerol-3- phosphate dehydrogenase include, for example, glpD, glpA, glpB, glpC, gpsA, glyC, and gpdA2. An exemplary amino acid sequence for a glycerol-3-phosphate

dehydrogenase comprises any one of SEQ ID NOS:43-63. A small number of yeast and bacteria, however, use an alternative pathway in the absence of electron acceptors {i.e., fermentative metabolism), where glycerol is oxidized by glycerol dehydrogenase into dihydroxyacetone, which is then phosphorylated by dihydroxyacetone kinase into dihydroxyacetone phosphate.

Glucose Utilization Pathway Enzymes

Glucose utilization in bacteria is well characterized {see, e.g., Hugh and Leifson, 1953, J. Bacteriol. 66:24-26; Tomlinson et al., 1974, Canadian J. Microbiol. 20: 1085-1091; Takahashi and Yamada, 1999, Crit. Rev. Oral Biol. Med. 10:487-503; Fuhrer et al, 2005, J. Bacteriol. 187: 1581-1590; Goldman and Blumenthal, 1963, J. Bacteriol. 86:303-311; Fraenkel and Levisohn, 1967, J. Bacteriol. 93: 1571-1578; Hua et al, 2003, J. Bacteriol. 185:7053-7067). The main pathway for glucose catabolism is the Embden-Meyerhof-Parnas (EMP) pathway, a type of glycolysis that converts glucose into pyruvate and is widely distributed in saccharolytic bacteria. Some bacteria may use another glycolytic pathway, the Entner-Doudoroff pathway, to convert glucose to pyruvate. An alternative pathway to glycolysis is the pentose phosphate pathway which converts glucose to ribulose-5 -phosphate in an oxidative phase. A non-oxidative phase of the pentose phosphate pathway catalyzes the interconversion of

phosphorylated sugars to xylulose-5 -phosphate, ribulose-5 -phosphate, and ribose-5- phosphate. Glucose-6-phosphate is a reaction component in the glycolytic pathways and pentose phosphate pathway. Phosphoenolpyruvate: sugar phosphotransferase systems (PEP-sugar- PTS) are multi-component systems involving enzymes of the plasma membrane and in the cytoplasm that catalyze the concomitant transport and phosphorylation of hexose sugars (e.g., glucose) to hexose-6-phosphate (see, e.g., Herzberg and Klevit, 1994, Curr. Biol. 4:814-822; Takahashi and Yamada, 1999, Crit. Rev. Oral Biol. Med. 10:487-503; Saier, 1977, Bacteriological Rev. 41 :856-871, disclosures of which are incorporated herein by reference, in their entirety). Cytoplasmic proteins enzyme EI and Histidine Protein (HPr) initiate phosphoryl transfer reactions and function in the transport and phosphorylation of all sugar substrates of the system. Enzymes II are sugar-specific permeases or transporters, commonly consisting two cytoplasmic domains EIIA and EIIB and an integral membrane domain EIIC. An exemplary amino acid sequence for components of PEP-glucose-PTS systems from E. coli comprises any one of SEQ ID NOS:64-67. Exemplary amino acid sequences for EI and HPr and EIIA components are provided in SEQ ID NOs: 64 and 65. Exemplary EIIB and EIIC components are provided in SEQ ID NOs: 66 and 67.

Alternative mechanisms for glucose uptake include glucose ion symporters (see, e.g., Sarker et al, 1997, J. Bacteriol. 179: 1805-1808; Essenberg et al, 1997, Microbiology 143: 1549-1555; Henderson et al, 1977, Biochem. J. 162:309-320, disclosures of which are incorporated herein by reference, in their entirety) and ABC transporters (see, e.g., Albers et al, 1999, J. Bacteriol. 14:4285-4291; Chevance et al, 2006, J. Bacteriol. 188:6561-6571; Wanner and Soppa, 1999, Genetics 152: 1417-1428, disclosures of which are incorporated herein by reference, in their entirety). An exemplary amino acid sequence for a glucose ion symporter or ABC transporter comprises any one of SEQ ID NOS:68-71. Glucose may be then be phosphorylated to glucose 6-phosphate by a separate gluco-kinase. An exemplary amino acid sequence for a gluco-kinase comprises SEQ ID NO: 72 or 73.

Acetate Utilization Pathway Enzymes

The utilization of acetate as a carbon and energy source has been previously described (see, e.g., Gerstmeir et al, 2003, J. Biotechnol. 104:99-122). To grow on acetate, bacteria activate it to acetyl-CoA. Acetate may be converted to acetyl- phosphate and then to acetyl-CoA via acetate kinase and phosphotransacetylase enzymes, respectively. Acetate may also be converted directly to acetyl-CoA by acetyl- CoA synthase. An exemplary amino acid sequence for acetyl-CoA synthase comprises SEQ ID NO:74. Bacterial transport systems for uptake of acetate include monocarboxylic transporters. Exemplary acetate transporters include ActP of E. coli (Gimenez et al., 2003, J. Bacteriol. 185:6448-6455, incorporated herein by reference, in its entirety), MctP ofRhizobium leguminosarum (Hosie et al, 2002, J. Bacteriol. 184:5436-5448, incorporated herein by reference, in its entirety), and MctC of Cory b acterium glutamicum (Jokver et al, 2009, J. Bacteriol. 191 :940-948, incorporated herein by reference, in its entirety). An exemplary amino acid sequence for an acetate transporter comprises SEQ ID NO: 75.

Lactate Utilization Pathway Enzymes

Many bacteria are able to utilize D- or L-lactate as a sole source of carbon and energy (see, e.g., Chai et al, 2009, J. Bacteriol. 191 :2423-2430; Pinchuk et al, 2009, Proc. Natl. Acad. Sci. USA 106:2874-2879; Bryant et al, 1977, Appl.

Environ. Microbiol. 33: 1162-1169; Erwin and Gotschlich, 1993, J. Bacteriol. 175:6382- 6391; Myers and Nealson, 1988, 240: 1319-1321; Garvie, 1980, Microbiol. Rev.

44: 106-139). To use lactate as a source of carbon, it is oxidized to pyruvate by lactate dehydrogenase. An exemplary amino acid sequence for a lactate dehydrogenase comprises SEQ ID NO: 76.

Monocarboxylic transporters that transport acetate may also be capable of transporting lactate. Lactate transporters have been described and include, for example, LutP (formerly YvfH) (Chai et al, 2009, J. Bacteriol. 191 :2423-2430, incorporated herein by reference, in its entirety), MctP (Hosie et al., 2002, J. Bacteriol. 184:5436-5448, incorporated herein by reference, in its entirety), GlcA (YghK) and LctP (LldP) (Nunez et al, 2001, Microbiol. 147: 1069-1077, incorporated herein by reference, in its entirety). An exemplary amino acid sequence for a lactate transporter comprises SEQ ID NO:77 or 78.

Arabinose Utilization Pathway Enzymes

The utilization of arabinose as a carbon and energy source has been well characterized in a number of bacteria (see, e.g., Engelsberg et al., 1962, J. Bacteriol. 84: 137-146; Brown et al, 1972, J. Bacteriol. 111 :606-613; Stoner et al, 1983, J. Mol. Biol. 170: 1049-1053; Gallegos et al, 1997, Microbiol. Mol. Biol. Rev. 61 :393-410; Schleif R., 2000, Trends Genetc. 16:559-565; Sa-Nogueira et al, 1997, Microbiol. 143:957-969; Kawaguchi et al. 2009, Appl. Environ. Microbiol. 75:3419-3429; Vlieg et al, 2006, Curr. Opin. Biotechnol. 17: 183-190; U.S. Patent Publication 2011/0143408, disclosures of which are incorporated herein by reference, in their entirety). The arabinose operon, also known as the L-arabinose operon or ara operon, is a gene sequence encoding enzymes needed for the catabolism of L-arabinose to D-xylulose 5- phosophate, an intermediate of the pentose phosphate pathway. The ara operon has both positive and negative regulation. In E. coli, the ara operon comprises of a regulator gene AraC, pC and pBAD promoters, and enzymes AraB, AraA, and AraD. AraA is an L-arabinose isomerase that converts arabinose to L-ribulose. AraB is a kinase that phosphorylates L-ribulose. AraD is an epimerase that converts L-ribulose- 5 -phosphate to D-xylulose-5 -phosphate. AraE is a low affinity transporter that is bound to the inner membrane and uses the electrochemical potential to transport arabinose. AraFGH genes encode arabinose-specific components of a high-affinity ABC transporter. AraF is the periplasmic arabinose-binding protein. AraG is the ATP- binding component, and AraH is the membrane bound component. AraC regulates the arabinose catabolic genes (AraBAD) through interactions with the pBAD and pC promoter regions and is itself under arabinose-induced control (reviewed in Schleif R., 2002, SGM symposium 61 : Signals, switches, regulons and cascades: control of bacterial gene expression. Ed. D. A. Hodgson, CM. Thomas. Cambridge Univ. Press). Additional arabinose transporters include an arabinose permease AraP and arabinose transporter AraT. Exemplary amino acid sequences for AraB, AraA, AraD, AraE, AraF, AraG, and AraH comprise SEQ ID NOS: 79, 80, 81, 82, 83, 84, and 85, respectively.

Citrate Utilization Pathway Enzymes

Citrate metabolism pathways in bacteria have been well characterized (see, e.g., Martin et al, 2005, J. Bacteriol. 187:5146-5155; Drider et al, 2004, Genet. Mol. Res. 3:273-281; Bott, 1997, Arch. Microbiol. 167:78-88; Bott et al, 1995, Mol Microbiol. 18:533-546; Yamamoto et al, 2000, Mol. Microbiol. 37:898-912;

Korithoski et al, 2005, J. Bacteriol. 2005, 187:4451-4456; Vlieg et al, 2006, Curr. Opin. Biotechnol. 17:183-190). Citrate uptake is mediated by citrate transporters including, for example, CitM (Warner et al, 2000, J. Bacteriol. 182:6099-6105;

Korithoski et al, 2005, J. Bacteriol. 187:4451-4456, disclosures of which are incorporated herein by reference, in their entirety), CitS (van der Rest et al., 1992, J. Biol. Chem. 267:8971-8976; Lolkema et al, 1994, Eur. J. Biochem. 220:469-475, disclosures of which are incorporated herein by reference, in their entirety), CitP (Magni et al, 1996, FEMS Microbiol. Lett. 142:265-269, incorporated herein by reference, in its entirety), CitC (Ishiguro et al, 1992, J. Bio. Chem. 267:9559-9564, incorporated herein by reference, in its entirety), and CitH (Lolkema et al., 1994, Eur. J. Biochem. 220:469-475, incorporated herein by reference, in its entirety). An exemplary amino acid sequence for a citrate transporter comprises SEQ ID NO:86.

Succinate Utilization Pathway Enzymes

Bacterial utilization of succinate as a carbon and energy source has been described for a variety of bacteria (see, e.g., Janssen and Liesack, 1995, Arch.

Microbiol. 164:29-35; Denger and Schink, 1990, Arch. Microbiol. 154:550-555;

Jansen, 1991, Arch. Microbiol. 155:288-293; Gylswiyk et al, 1997, Int. J. Syst.

Bacteriol. 47: 155-9; Duetz et al, 1994, J. Bacteriol. 176:2354-2361).

Succinate is a component of the citric acid cycle or glyoxylate cycle for generating energy. Succinate is oxidized by succinate dehydrogenase to fumarate. In some bacteria, succinate is decarboxylated to propionate and C0 2 by methylmalonyl- CoA decarboxylase (see, e.g., Bott et al, 1997, Eur. J. Biochem. 250:590-599; Ruiz- Herrera and Garcia, 1972, J. Gen. Microbiol. 72:29-35).

Transport systems for C 4 -dicarboxylates (e.g., succinate) have been described in a number of bacteria. For example, E. coli use anaerobic DcuA and DcuB and aerobic Dct dicarboxylate transport systems (Lo et al, 1977, J. Supramol. Struct. 7:463-480; Six et al, 1994, J. Bacteriol. 176:6470-6478, disclosures of which are incorporated herein by reference, in their entirety). YdbFG sensor-regulator and YdbH C 4 -dicarboxylate transport protein have been described in Bacillus subtilis (Asai et al, 2000, Microbiol. 146:263-271, incorporated herein by reference, in its entirety).

Rhodobacter capsulatus has a Dct transport system which consists of three proteins: C 4 - dicarboxylate periplasmic binding protein, DctP, and two integral membrane proteins, DctQ and DctM (Forward et al, 1994, J. Bacteriol. 179:5482-5493, incorporated herein by reference, in its entirety). DcsT mediates uptake of C4 dicarobxylates, including succinate, in Cory neb acterium glutamicum (Teramoto et al, 2008, Appl. Environ. Microbiol. 74:5290-5296, incorporated herein by reference, in its entirety). An exemplary amino acid sequence for a succinate transporter comprises SEQ ID NO: 87 or 88.

Recombinant Methanotrophic Bacteria

Provided in the present disclosure are recombinant methanotrophic bacteria that may be produced by introducing (e.g. , by transformation) into the host bacteria at least one expressible exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component. Alternatively, if a selected methanotrophic host bacterium exhibits endogenous expression of one of more genes of a multi-carbon substrate utilization pathway, but is deficient in others, then an encoding nucleic acid is needed for the deficient component(s) to achieve the desired multi-carbon substrate utilization capability. Thus, a recombinant methanotrophic bacterium of the invention can be produced by introducing exogenous component activities to obtain a desired multi-carbon substrate utilization pathway or a desired multi-carbon substrate utilization pathway can be obtained by introducing one or more exogenous component activities which, together with one or more endogenous components, allow use of a multi-carbon substrate as a carbon source. However, it is understood that even if a host methanotrophic bacterium contains at least one multi-carbon substrate utilization pathway component, introduction of exogenous nucleic acids encoding components of a complete multi-carbon substrate utilization pathway may be included. In some embodiments, a recombinant methanotrophic bacterium as described herein can also include other genetic modifications that facilitate or optimize a multi-carbon substrate utilization pathway or that confer other useful functions onto the host. For example, if a selected host methanotrophic bacteria exhibits endogenous expression of a protein or enzyme that inhibits or competes with a multi-carbon substrate utilization pathway, then the host may be genetically modified so that it does not produce a functional protein or enzyme or a substantial amount of a functional protein or enzyme that inhibits or competes with the desired multi-carbon substrate utilization. In another example, selected host methanotrophic bacteria may be genetically modified to increase expression of an endogenous gene that enhances utilization of a desired multi-carbon substrate. Additionally, a host methanotrophic bacterium may possess other genetic modifications conferring it with other desirable properties, unrelated to multi-carbon substrate utilization. For example, a host methanotrophic bacterium may possess genetic modifications conferring high growth, tolerance of contaminants or particular culture conditions, ability to metabolize additional carbon substrates, or ability to synthesize desirable products or intermediates (e.g., propylene, crotonate, or crotonyl CoA, see, e.g., International Application Number PCT/US 13/60460, incorporated herein by reference, in its entirety).

In certain embodiments, the present disclosure provides recombinant obligate methanotrophic bacteria or recombinant facultative methanotrophic bacteria including at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component, wherein the multi-carbon substrate is not utilized as a carbon source by a reference facultative methanotrophic bacterium, wherein the at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component is expressed in a sufficient amount to permit growth of the recombinant obligate methanotrophic bacteria on the multi-carbon substrate as a primary carbon source or the recombinant facultative methanotrophic bacteria on the multicarbon substrate as a sole carbon source. A multi-carbon substrate may be a carbohydrate or organic acid, including, for example, glucose, acetate, lactate, arabinose, citrate, succinate, and glycerol. Recombinant obligate methanotrophic bacteria use a selected multi-carbon substrate as a primary carbon source if the selected multi-carbon substrate is the source of at least 50% or more of carbon usage for the bacteria. In certain embodiments, the at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component is expressed in a sufficient amount to permit growth of the recombinant obligate methanotrophic bacteria on the multi-carbon substrate as a sole carbon source.

A reference facultative methanotrophic bacterium, also known as parent or wildtype facultative methanotrophic bacterium, is one that has not been genetically engineered with the capability to use an additional multi-carbon substrate other than its native substrates. A reference facultative methanotrophic bacterium may be selected for genetic engineering to introduce at least one exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component for a multi-carbon substrate it does not naturally utilize (i.e., cannot grow on the selected multi-carbon substrate as sole carbon source) and becomes a recombinant facultative methanotrophic bacterium.

In certain embodiments wherein the multi-carbon substrate is glucose, recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a glucose transporter. A number of methanotrophic bacterial genomes encode the enzymes of a complete glycolysis pathway necessary for utilization of D-glucose (see, e.g., Stein et al, 2010, J. Bacteriol. 192:6497-6498; U.S. Patent No. 6,555,353; Ward et al, 2004, PLoS Biol. 2:e303; VuiUeumier et al, 2012, J. Bacteriol. 194:551-552). Methanotrophs have also been found to possess pentose phosphate pathway genes (Dedysh et al, 2000, Int. J. Syst. Evol. Microbiol. 50:955-969; VuiUeumier et al, 2012, J. Bacteriol. 194:551-552). However, methanotrophic bacteria lack glucose transporters that allow them to bring extracellular sugars into the cell, as well as enzymes to phosphorylate glucose. A glucose transporter suitable for introduction into a methanotrophic bacteria may be a phosphoenolpyruvate: glucose phosphotransferase system, a glucose ion symporter, or an ABC transporter. Exemplary amino acid sequences for phosphoenolpyruvate: glucose phosphotransferase system components comprise a sequence provided by SEQ ID NOS:64-67. Exemplary EI, HPr, and EIIA components comprise amino acid sequences provided by SEQ ID NO: 64 or 65;

exemplary EIIC and EIIB components comprise amino acid sequences provided by SEQ ID NOs: 66 or 67. An exemplary amino acid sequence for glucose ion symporters and ABC transporters comprises SEQ ID NO:68, 69, 70, or 71. If the glucose transporter is not a phosphoenolpyruvate: glucose phosphotransferase system, then the recombinant methanotrophic bacterium may further include an exogenous nucleic acid encoding a gluco-kinase to phosphorylate imported glucose. An exemplary amino acid sequence for gluco-kinase comprises SEQ ID NO: 72 or 73.

In certain embodiments wherein the multi-carbon substrate is acetate, recombinant methanotrophic bacteria include an exogenous nucleic acid encoding an acetate transporter. Examples of acetate transporters include ActP, MctP, and MctC. An exemplary amino acid sequence for an acetate transporter comprises SEQ ID NO: 75. Methanotrophic bacteria lack acetate transporters but possess an acetyl-CoA synthase gene, which activates acetate to acetyl-CoA. Metabolism of acetate may be feasible if it is transported into the cell. In certain embodiments, the recombinant methanotrophic bacteria comprising an exogenous nucleic acid encoding an acetate transporter is further modified to overexpress acetyl-CoA synthase. Up-regulation or overexpression of an endogenous or exogenous nucleic acid encoding acetyl-CoA synthase may improve methanotrophic bacterial growth rate on exogenous acetate, as the endogenous enzyme may not be expressed at optimal levels for growth on this non- native substrate. An exemplary amino acid sequence for an acetyl-CoA synthase comprises SEQ ID NO: 74.

In certain embodiments, wherein the multi-carbon substrate is lactate, recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a lactate transporter and an exogenous nucleic acid encoding a lactate dehydrogenase. Examples of lactate transporters include LutP (formerly YvfH), MctP, GlcA (YghK), and LctP (LldP). A number of monocarboxylic acid transporters that transport acetate are capable of transporting lactate also. An exemplary amino acid sequence for lactate transporter comprises SEQ ID NO:77 or 78. A nucleic acid encoding lactate dehydrogenase is introduced into recombinant methanotrophic bacteria to convert imported lactate to pyruvate, which may then enter endogenous pyruvate metabolic pathways of the recombinant methanotrophic bacteria. An exemplary amino acid sequence for lactate dehydrogenase comprises SEQ ID NO: 76.

In certain embodiments, wherein the multi-carbon substrate is arabinose, recombinant methanotrophic bacteria include an exogenous nucleic acid encoding an L- arabinose isomerase (e.g., AraA), an exogenous nucleic acid encoding an L-ribulose kinase (e.g., AraB), an exogenous nucleic acid encoding a L-ribulose-5 -phosphate epimerase (e.g., AraD), and an exogenous nucleic acid encoding an arabinose transporter. Exemplary amino acid sequences for AraB, AraA, and AraD comprise SEQ ID NOs: 79, 80, and 81 , respectively. An arabinose transporter, for example, AraE, AraFGH, or AraP, is used to transport arabinose into the recombinant methanotrophic bacteria. Exemplary AraE, AraF, and AraG, and AraH amino acid sequences comprise SEQ ID NOs: 82, 83, 84, and 85, respectively. AraA is an L- arabinose isomerase that converts arabinose to L-ribulose. AraB is a kinase that phosphorylates L-ribulose. AraD is an epimerase that converts L-ribulose-5 -phosphate to D-xylulose-5-phosphate. D-xylulose-5 -phosphate is a pre-cursor to ribulose-5- phosphate, a key intermediate in the ribulose monophosphate (RuMP) pathway employed by Type I and Type X methanotrophic bacteria, and is likely to be used efficiently by recombinant methanotrophic bacteria. In further embodiments, the L- arabinose isomerase is AraA, the L-ribulose kinase is AraB, and L-ribulose-5 -phosphate epimerase is AraD, and the arabinose transporter is AraE, AraFGH, or AraP.

In certain embodiments, wherein the multi-carbon substrate is citrate, recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a citrate transporter. Examples of citrate transporters include CitM, CitS, CitP, CitC, and CitH. An exemplary amino acid sequence for a citrate transporter comprises SEQ ID NO:86. Once citrate is imported into the cell, recombinant methanotrophs are expected to grow, as they possess enzymes for utilizing citrate.

In certain embodiments, wherein the multi-carbon substrate is succinate, recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a succinate transporter. Examples of succinate transporters include DcuA, DcuB, Dct, YdbH, Dct, and DcsT. An exemplary amino acid sequence for a succinate transporter comprises SEQ ID NO:87 or 88. Once succinate is imported into the cell, recombinant methanotrophs are expected to grow, as they possess enzymes for succinate utilization.

In certain embodiments wherein the multi-carbon substrate is glycerol, the recombinant methanotrophic bacteria includes at least two exogenous nucleic acids encoding glycerol utilization pathway components. In certain embodiments the at least two exogenous nucleic acids are expressed in an amount sufficient to permit growth of the recombinant methanotrophic bacteria on glycerol as a sole carbon source. In certain embodiments, the at least two glycerol utilization pathway components comprise components that are from the respiratory glycerol metabolism pathway. In certain embodiments, the at least two glycerol utilization pathway components comprise glycerol kinase and glycerol-3 -phosphate dehydrogenase. The glycerol-3 -phophate dehydrogenase may be preferably, aerobic, or anaerobic. In a specific embodiment, the glycerol kinase is GlpK and the glycerol-3 -phosphate dehydrogenase is GlpD.

Exemplary glycerol kinase and glycerol-3 -phosphate dehydrogenase amino acid sequences encoded by exogenous nucleic acids that may be used to transform host methanotrophic bacteria comprise any one of SEQ ID NOS:22-42 and SEQ ID

NOS:43-63, respectively. In certain embodiments, the glycerol kinase comprises an amino acid sequence of SEQ ID NO:22 and the glycerol-3 -phosphate dehydrogenase comprises an amino acid sequence of SEQ ID NO:43.

In certain embodiments, recombinant methanotrophic bacteria comprise three exogenous nucleic acids encoding glycerol utilization pathway components. The three exogenous nucleic acids encoding glycerol utilization pathway components may be expressed in an amount sufficient to permit growth of the recombinant

methanotrophic bacterium on glycerol as a primary carbon source or as a sole carbon source. In certain embodiments, the three exogenous nucleic acids encoding glycerol utilization pathway components comprise glycerol uptake facilitator, glycerol kinase, and glycerol-3 -phosphate dehydrogenase. In a specific embodiment, the glycerol uptake facilitator is GlpF, the glycerol kinase is GlpK, and the glycerol-3 -phosphate dehydrogenase is GlpD. Exemplary glycerol uptake facilitator, glycerol kinase, and glycerol-3 -phosphate dehydrogenase amino acid sequences encoded by exogenous nucleic acids that may be used to transform host methanotrophic bacteria comprise a sequence selected from SEQ ID NOS: l-21 , SEQ ID NOS:22-42, and SEQ ID NOS:43- 63, respectively. In certain embodiments, the glycerol uptake facilitator comprises an amino acid sequence of SEQ ID NO: l , the glycerol kinase comprises an amino acid sequence of SEQ ID NO:22, and the glycerol-3 -phosphate dehydrogenase comprises an amino acid sequence of SEQ ID NO:43.

Recombinant methanotrophic bacteria comprising at least two exogenous nucleic acids encoding glycerol utilization pathway components, as described herein, are expected to exhibit rapid and efficient growth in the presence of glycerol. However, in the absence of glycerol (e.g., during growth on methane as a sole carbon source), it is possible for a glycerol utilization pathway to run in reverse, where intracellular DHAP from gluconeogenesis is reduced to glycerol-3 -phosphate by glycerol-3 -phosphate dehydrogenase, which is capable of catalyzing a reversible redox reaction. Glycerol-3 -phosphate may then be de-phosphorylated by glycerol-3 - phosphatase into glycerol. Glycerol may then be secreted from the cell, thereby lowering the cellular growth rate. Therefore, in certain embodiments, expression of nucleic acids encoding glycerol utilization pathway components may be regulated (e.g., via inducible or repressible promoter) to provide for optimal bacterial growth under a variety conditions (e.g., presence of a particular carbon source). For example, expression of nucleic acids encoding glycerol utilization pathway components may be regulated so that they are not expressed in the absence of glycerol and expressed in the presence of glycerol.

In certain embodiments, recombinant methanotrophic bacteria of any of the embodiments disclosed herein comprise two, three, four, five, six, or more exogenous nucleic acids encoding multi-carbon substrate utilization pathway components, wherein the exogenous nucleic acids are expressed in an amount sufficient to permit growth of the recombinant methanotrophic bacteria on the multi-carbon substrate as a primary or sole carbon source. Each exogenous nucleic acid may encode a different type of multi-carbon substrate utilization pathway component (i.e., catalyze different enzymatic reactions or processes), or one or more nucleic acids may encode the same type of multi-carbon substrate utilization pathway component. For example, a recombinant methanotrophic bacterium may comprise two or more exogenous nucleic acids encoding a glycerol uptake facilitator, with each having a different sequence, in order to increase glycerol import capability of the bacterium. It is apparent to one of skill in the art that any combination of two or more nucleic acids encoding multi-carbon substrate utilization pathway components may be used to constitute a multi-carbon substrate utilization pathway in a recombinant methanotrophic bacterium, provided that the genetically engineered metabolic pathway provides the recombinant methanotrophic bacterium with the capability to metabolize the selected multi-carbon substrate (e.g., glycerol into dihydroxyacetone phosphate, arabinose to D-xylulose 5-phosophate).

In certain embodiments, the recombinant methanotrophic bacteria according to any of the embodiments disclosed herein is Methylomonas, Methylobacter, Methylococcus, Methylosinus, Methylomicrobium, or Methanomonas. In further embodiments, the recombinant methanotrophic bacteria is Methylosinus trichosporium strain OB 3b, Methylococcus capsulatus Bath strain, Methylomonas methanica 16 A strain, Methylosinus trichosporium (NRRL B- 11,196), Methylosinus sporium (NRRL B- 11,197), Methylocystis parvus (NRRL B- 11,198), Methylomonas methanica (NRRL B- 11,199), Methylomonas albus (NRRL B-l 1,200), Methylobacter capsulatus (NRRL B- 11,201), Methylomonas sp AJ-3670 (FERM P-2400), Methylacidiphilum infernorum, or Methylomicrobium alcaliphilum 20Z.

In certain embodiments, the recombinant methanotrophic bacteria according to any of the embodiments disclosed herein is Methylocella silvestris, Methylocella palustris, Methylocella tundra, Methylocystis daltona, Methylocystis bryophila, Methylobacterium organophilum (ATCC 27,886), or Methylocapsa aurea.

Sources of encoding nucleic acids for multi-carbon substrate utilization pathway components may include any bacterial, yeast, or other microorganism species where the encoded gene product is capable of catalyzing the referenced reaction in the multi-carbon substrate utilization pathway. Exemplary species for such sources are well known in the art.

Exemplary sources of encoding nucleic acids for glucose utilization pathway components include: Escherichia coli, Bacillus subtilis, Corynebacterium glutamicum, Saccharomyces cerevisiae, Zymomonas mobilis; Agrobacterium tumefaciens, Sinorhizobium meliloti; Rhodobacter sphaeroides; Paracoccus versutus; Pseudomonas fluorescens, Pseudomonas putida, Salmonella enterica, Escherichia fergusonii, Salmonella enteric, Yersinia pestis, Yersinia pseudotuberculosis, Yersinia enterocolitica, Shigella flexneri, Shigella sonnei, Shigella boydii, Shigella dysenteriae, Pectobacterium atrosepticum, Pectobacterium wasabiae, Erwinia tasmaniensis, Erwinia pyrifoliae, Erwinia amylovora, Erwinia billingiae, Buchnera aphidicola, Enterobacter sp. 638, Enterobacter cloacae, Enterobacter asburiae, Enterobacter aerogenes, Cronobacter sakazakii, Cronobacter turicensis, Klebsiella pneumoniae, Klebsiella variicola, Klebsiella oxytoca, Citrobacter koseri, Citrobacter rodentium, Serratia proteamaculans, Serratia sp. AS12, Proteus mirabilis, Edwardsiella ictaluri, Edwardsiella tarda, Candidatus Hamiltonella defense, Dickeya dadantii, Dickeya zeae, Pantoea anantis, Pantoea sp. At-9b, Pantoea vagans, Rahnella sp. Y9602, Haemophilus parasuis, Haemophilus parainfluenzae, Pasteurella multocida, Aggregatibacter aphrophilus, Aggregatibacter actinomycetemcomitans, Vibrio cholerae, Vibrio vulnificus, Vibrio parahaemolyticus, Vibrio harveyi, Vibrio splendidus, Photobacterium profundum, Vibrio anguillarum, Shewanella oneidensis, Shewanella denitrificans, Shewanella frigidimarina, Shewanella amazonensis, Shewanella baltica, Shewanella loihica, Shewanella sp. ANAS, Shewanella sp. MR- 7, Shewanella putrefaciens, Shewanella sediminis, Shewanella sp. MR-4, Shewanella sp. W3-18-1, Shewanella woodyi, Psychromonas ingrahamii, Ferrimonas balearica, Aeromonas hydrophila, Aeromonas salmonicida, Aeromonas veronii, Tolumonas auensis, Chromobacterium violaceum, Burkholderia sp. CCGE1002, Azospirillum sp. B510, Bacillus anthracis, Bacillus cereus, Bacillus cytotoxicus, Bacillus thuringiensis, Bacillus

weihenstephanensis, Bacillus pseudofirmus, Bacillus megaterium, Staphylococcus aureus, Exiguobacterium sibiricum, Exiguobacterium sp. AT lb, Macrococcus caseolyticus, Paenibacillus polymyxa, Streptococcus pyogenes, Streptococcus pneumoniae, Streptococcus agalactiae, Streptococcus mutans, Streptococcus thermophilus, Streptococcus sanguinis, Streptococcus suis, Streptococcus gordonii, Streptococcus equi, Streptococcus uberis, Streptococcus dysgalactiae, Streptococcus gallolyticus, Streptococcus mitis, Streptococcus pseudopneumoniae, Lactobacillus johnsonii, Lactobacillus gasseri, Enterococcus faecalis, Aerococcus urinae,

Carnobacterium sp. 17-4, Clostridium acetobutylicum, Clostridium perfringens, Clostridium tetani, Clostridium novyi, Clostridium botulinum, Desulfotomaculum reducens, Clostridium lentocellum, Erysipelothrix rhusiopathiae, Mycoplasma genitalium, Mycoplasma pneumoniae, Mycoplasma pulmonis, Mycoplasma penetrans, Mycoplasma gallisepticum, Mycoplasma mycoides, Mycoplasma synoviae, Mycoplasma capricolum, Mycoplasma crocodyli, Mycoplasma leachii, Mesoplasma florum, Propionibacterium acnes, Nakamurella multipartita, Borrelia burgdorferi, Borrelia garinii, and Borrelia afzelii.

Exemplary sources of encoding nucleic acids for acetate utilization pathway components include: Corynebacterium glutamicum, Escherichia coli, and Rhizobium leguminosarum.

Exemplary sources of encoding nucleic acids for lactate utilization pathway components include: Rhizobium leguminosarum, Bacillus subtilis,

Staphylococcus aureus, Escherichia coli, Enterobacteriaceae, Propionibacterium pentosaceum, Pseudomonas aeruginosa, Acetobacter peroxydans, Selenomonas ruminantium, Pseudomonas natriegens, Aerobacter aerogenes, Lactobacillus casei, Lactobacillus plantarum, Serratia, Aerobacter cloacae, Proteus vulgaris, Escherichia freundii, Klebsiella sp., Hafnia sp., Butyribacterium rettgeri, Streptococcus faecium, Streptococcus lactis, Pediococcus pentosaceum, Salmonella typhimurium,

Aggregatibacter actinomycetemcomitans, and Neisseria gonorrhoeae.

Sources of encoding nucleic acids for arabinose utilization pathway components include: Escherichia coli, Bacillus subtilis, Scheffersomyces stipitis, Corynebacterium glutamicum, Lactococcus lactis, Pichia stipitis, Shigella flexneri, Shigella boydii, Shigella dysenteriae, Salmonella typhimurium, Salmonella enterica, Klebsiella pneumoniae, Klebsiella oxytoca, Enterobacter cancerogenus, Bacillus amyloliquefaciens, Rhizobium, and Agrobacterium.

Sources of encoding nucleic acids for citrate utilization pathway components include: Lactococcus lactis, Enterococcus, Lactobacillus plantarum, Oenococcus oeni, Leuconostoc mesenteroides, Weissella, Salmonella dublin,

Salmonella pulloru, Salmonella enteritidis, Klebsiella pneumoniae, Salmonella typhimurium, and Bacillus subtilis. Sources of encoding nucleic acids for succinate utilization pathway components include: Escherichia coli, Corynebacterium glutamicum, Rhodobacter capsulatus, and Bacillus subtilis.

Sources of encoding nucleic acids for glycerol utilization pathway components include, for example: Escherichia coli, Acinetobacter baumannii,

Fusobacterium nucleatum subsp. vincentii, Pantoea sp. Scl, Pseudomonas aeruginosa, Shigella flexneri, Shewanella baltica OS155, Actinobacillus pleuropneumoniae serovar 3 str. JL03, Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29, Yersinia bercovieri, Aeromonas veronii B565, Pseudomonas fluorescens, Serratia sp. AS 12, Vibrio fischeri SR5, Haemophilus haemolyticus, Vibrio harveyi, Vibrio cholera, Pseudomonas putida SI 6, Pectobacterium carotovorum subsp. carotovorum PCI, Pseudomonas syringae, Acinetobacter sp. ATCC 27244, Photobacterium profundum SS9, Citrobacter freundii, Klebsiella pneumoniae, Enterobacter sp., Enterococcus casseliflavus, Enterococcus faecalis, Bacillus stearothermophilus, Bacillus subtilis, Streptococcus pyogenes, Haemophilus, influezae, Mycoplasma genitalium, Mycoplasma pneumonia, Mycoplasma mycoides, Yersinia mollaretii, Shigella dysenteriae, Shigella boydii, Shigella sonnei, Yersinia pestis, Yersinia intermedia, Yersinia frederiksenii, Serratia proteamaculans, Erwinia carotovora, Pseudomonas tolaasii, Yersinia enterolitica, Photorhabdus luminesens, Azotobacter vinelandii, Haemophilus ducreyi, Actinobacillus pleuropneumoniae, Aeromonas hydrophila, Photobacterium profundum, Aeromonas salmonicida, Vibrio angustum, Vibrio vulnificus, Vibrionales bacterium, Vibrio splendidus, Vibrio sp. Ex25, Vibrio alginolyticus, Vibrio parahaemolyticus, Shewanella sp. W3-18-1, Alteromonas macleodii, Sodalis glossinidius, Pasteurella multocida, Salmonella typhimurium, Lactobacillus casei, Rhadopseudomonas,

Propionibacterium, Nocardia asteroides, Klebsiella aerogenes, Halobacterium cutirubrum, Gluconobacter oxydans, Staphylococcus aureus, Candida utilis, Candia mycodema, Saccharomyces cerevisiae, Pichia pastoris, Kluyveromyces lactis, Ashbya gossypii, Lodderomyces elongisporus, Debaryomyces hansenii, Candida albicans, Pichia guilliermondii, Pichia stipitis, and Fusarium oxysporum.

However, with the complete genome sequence available for hundreds of microorganisms, the identification of genes encoding the requisite multi-carbon substrate utilization pathway in related or distant species, including for example, homo logs, orthologs, paralogs, etc., is routine and well known in the art. Accordingly, exogenous nucleic acids encoding multi-carbon substrate utilization pathway components described herein with reference to particular nucleic acids from a particular organism can readily include other nucleic acids encoding multi-carbon substrate utilization pathway components from other microorganisms. For recombinant methanotrophic bacteria comprising at least two exogenous nucleic acids encoding multi-carbon substrate utilization components, each nucleic acid may be derived from the same microorganism or from different microorganisms.

Polypeptide sequences and encoding nucleic acids for proteins, protein domains, and fragments thereof described herein, such as a component of a multi- carbon substrate utilization pathway, may also include natural and recombinantly engineered variants. A nucleic acid variant refers to a nucleic acid that may contain one or more substitutions, additions, deletions, insertions, or may be or comprise

fragment(s) of a reference nucleic acid. A reference nucleic acid refers to a selected nucleic acid encoding a multi-carbon substrate utilization pathway component. A variant nucleic acid may have 40%, 45%, 50%, 55%, 60%, 65%, 70%, 71%, 72%, 73%, 74%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to a reference nucleic acid, as long as the variant nucleic acid can still perform its requisite function or biological activity in the multi-carbon substrate utilization pathway (e.g., membrane transport). A variant polypeptide may have 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity to a reference protein, as long as the variant polypeptide can still perform its requisite function or biological activity (e.g., membrane transport). In certain embodiments, a multi-carbon substrate utilization pathway component that is introduced into recombinant methanotrophs as provided herein encodes an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%,86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity to an amino acid sequence selected from SEQ ID NOS: l-88. In certain embodiments, an exogenous nucleic acid encoding a multi-carbon substrate utilization pathway component that is introduced into recombinant methanotrophs comprises a nucleic acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 81%, 82%, 83%, 84%, 85%,86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%), 98%o or 99% identity to a nucleic acid sequence provided in Tables 14-22. These variants may have improved function and biological activity (e.g., higher enzymatic activity or improved specificity for substrate) than the parent (or wildtype) protein. Due to redundancy in the genetic code, nucleic acid variants may or may not affect amino acid sequence. A nucleic acid variant may also encode an amino acid sequence comprising one or more conservative substitutions compared to a reference amino acid sequence. A conservative substitution may occur naturally in the polypeptide (e.g., naturally occurring genetic variants) or may be introduced when the polypeptide is recombinantly produced. A conservative substitution is where one amino acid is substituted for another amino acid that has similar properties, such that one skilled in the art would expect that the secondary structure and hydropathic nature of the polypeptide to be substantially unchanged. Amino acid substitutions may generally be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, and/or the amphipathic nature of the residues, and is known in the art. Amino acid substitutions, deletions, and additions may be introduced into a polypeptide using well-known and routinely practiced mutagenesis methods (see, e.g., Sambrook et al. Molecular

Cloning: A Laboratory Manual, 3d ed., Cold Spring Harbor Laboratory Press, NY 2001). Oligonucleotide-directed site-specific (or segment specific) mutagenesis procedures may be employed to provide an altered polynucleotide that has particular codons altered according to the substitution, deletion, or insertion desired. Deletion or truncation variants of proteins may also be constructed by using convenient restriction endonuclease sites adjacent to the desired deletion. Alternatively, random mutagenesis techniques, such as alanine scanning mutagenesis, error prone polymerase chain reaction mutagenesis, and oligonucleotide-directed mutagenesis may be used to prepare polypeptide variants (see, e.g., Sambrook et al, supra). Variant nucleic acids may be naturally occurring or genetically engineered.

Nucleic acids encoding multi-carbon substrate utilization pathway components may be combined with other nucleic acid sequences, such as promoters, polyadenylation signals, restriction enzyme sites, multiple cloning sites, other coding segments, and the like.

Differences between a wild type (or parent or reference) nucleic acid or polypeptide and the variant thereof, may be determined by methods routinely practiced in the art to determine identity, which are designed to give the greatest match between the sequences tested. Methods to determine sequence identity can be applied from publicly available computer programs. Computer program methods to determine identity between two sequences include, for example, BLASTP, BLASTN (Altschul, S.F. et al, J. Mol. Biol. 215: 403-410 (1990), and FASTA (Pearson and Lipman Proc. Natl. Acad. Sci. USA 85; 2444-2448 (1988). The BLAST family of programs is publicly available from NCBI and other sources (BLAST Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD.

Assays for determining whether a polypeptide variant folds into a conformation comparable to the non-variant polypeptide or fragment include, for example, the ability of the protein to react with mono- or polyclonal antibodies that are specific for native or unfolded epitopes, the retention of ligand-binding functions, the retention of enzymatic activity (if applicable), and the sensitivity or resistance of the mutant protein to digestion with proteases (see Sambrook et al, supra). Polypeptides, variants and fragments thereof, can be prepared without altering a biological activity of the resulting protein molecule (i.e., without altering one or more functional activities in a statistically significant or biologically significant manner). For example, such substitutions are generally made by interchanging an amino acid with another amino acid that is included within the same group, such as the group of polar residues, charged residues, hydrophobic residues, and/or small residues, and the like. The effect of any amino acid substitution may be determined empirically merely by testing the resulting modified protein for the ability to function in a biological assay, or to bind to a cognate ligand or target molecule.

Codon Optimization

Expression of recombinant proteins is often difficult outside their original host. For example, variation in codon usage bias has been observed across different species of bacteria (Sharp et al, 2005, Nucl. Acids. Res. 33: 1141-1153).

Over-expression of recombinant proteins even within their native host may also be difficult. In certain embodiments, at least one nucleic acid encoding a multi-carbon substrate utilization pathway component that is to be introduced into host

methanotrophic bacteria according to any of the embodiments described herein is codon optimized to enhance protein expression in the methanotrophic bacteria. Codon optimization refers to alteration of codons in genes or coding regions of nucleic acids for transformation of a methanotrophic bacterium to reflect the typical codon usage of the host bacteria species without altering the polypeptide for which the DNA encodes. Codon optimization methods for optimum gene expression in heterologous hosts have been previously described (see, e.g., Welch et al, 2009, PLoS One 4:e7002; Gustafsson et al, 2004, Trends Biotechnol. 22:346-353; Wu et al, 2007, Nucl. Acids Res. 35:D76- 79; Villalobos et al, 2006, BMC Bioinformatics 7:285; U.S. Patent Publication

2011/0111413; U.S. Patent Publication 2008/0292918; disclosure of which are incorporated herein by reference, in their entirety). One, two, three, or more nucleic acids encoding multi-carbon substrate utilization pathway components may be codon optimized. For example, wherein a recombinant methanotrophic bacterium comprises two glycerol utilization pathway components, one nucleic acid molecule (e.g., glycerol kinase) may be codon optimized, while the second nucleic acid molecule (e.g. , glycerol- 3 -phosphate dehydrogenase) is not codon optimized, or vice versa. Alternatively, all nucleic acids encoding multi-carbon substrate (e.g., glycerol) utilization pathway components may be codon optimized.

Transformation Methods

Any of the recombinant methanotrophic bacteria described herein may be transformed to comprise at least one exogenous nucleic acid to provide the host bacterium with a new or enhanced activity (e.g., enzymatic activity) or may be genetically modified to remove or substantially reduce an endogenous gene function using a variety of methods known in the art.

Transformation refers to the transfer of a nucleic acid (e.g., exogenous nucleic acid) into the genome of a host bacterium, resulting in genetically stable inheritance. Host bacteria containing the transformed nucleic acid molecules are referred to as "non-naturally occurring" or "recombinant" or "transformed" or

"transgenic" bacteria.

Expression systems and expression vectors useful for the expression of heterologous nucleic acids in methanotrophic bacteria are known.

Electroporation of CI metabolizing bacteria has been previously described in Toyama et al, 1998, FEMS Microbiol. Lett. 166: 1-7; Kim and Wood, 1997, Appl. Microbiol. Biotechnol. 48: 105-108; Yoshida et al, 2001 , Biotechnol. Lett. 23 :787-791 , and US2008/0026005.

Bacterial conjugation, which refers to a particular type of transformation involving direct contact of donor and recipient cells, is more frequently used for the transfer of nucleic acids into CI metabolizing bacteria. Bacterial conjugation involves mixing "donor" and "recipient" cells together in close contact with each other.

Conjugation occurs by formation of cytoplasmic connections between donor and recipient bacteria, with unidirectional transfer of newly synthesized donor nucleic acid molecules into the recipient cells. A recipient in a conjugation reaction is any cell that can accept nucleic acids through horizontal transfer from a donor bacterium. A donor in a conjugation reaction is a bacterium that contains a conjugative plasmid, conjugative transposon, or mobilized plasmid. The physical transfer of the donor plasmid can occur through a self-transmissible plasmid or with the assistance of a "helper" plasmid.

Conjugations involving CI metabolizing bacteria have been previously described in Stolyar et al, 1995, Mikrobiologiya 64:686-691 ; Motoyama et al, 1994, Appl. Micro. Biotech. 42:67-72; Lloyd et al, 1999, Archives of Microbiology 171 :364-370; and Odom et al, PCT Publication WO 02/18617; Ali et al, 2006, Microbiol. 152:2931- 2942. Expression of heterologous nucleic acids in CI metabolizing bacteria is known in the art (see, e.g., U.S. 6,818,424, US2003/0003528). Mu transposon based transformation of methylotrophic bacteria has been described (Akhverdyan et al., 2011, Appl. Microbiol. Biotechnol. 91 :857-871). A mini-Tn7 transposon system for single and multicopy expression of heterologous genes without insertional inactivation of host genes in Methylobacterium has been described (US2008/0026005).

Various methods for inactivating, knocking-out, or deleting endogenous gene function in CI metabolizing bacteria may be used. Allelic exchange using suicide vectors to construct deletion/insertional mutants in slow growing C 1 metabolizing bacteria have also been described in Toyama and Lidstrom, 1998, Microbiol. 144: 183- 191; Stolyar et al, 1999, Microbiol. 145:1235-1244; Ali et al, 2006, Microbiology 152:2931-2942; Van Dien et al, 2003, Microbiol. 149:601-609.

Suitable homologous or heterologous promoters for high expression of exogenous nucleic acids may be utilized. For example, U.S. 7,098,005 describes the use of promoters that are highly expressed in the presence of methane or methanol for heterologous gene expression in C 1 metabolizing bacteria. Additional promoters that may be used include deoxy-xylulose phosphate synthase methanol dehydrogenase operon promoter (Springer et al, 1998, FEMS Microbiol. Lett. 160: 119-124); the promoter for PHA synthesis (Foellner et al. 1993, Appl. Microbiol. Biotechnol. 40:284- 291); or promoters identified from native plasmid in methylotrophs (EP296484). Non- native promoters include the lac operon Plac promoter (Toyama et al, 1997,

Microbiology 143:595-602) or a hybrid promoter such as Ptrc (Brosius et al, 1984, Gene 27: 161-172). In certain embodiments, promoters or codon optimization are used for high constitutive expression of exogenous nucleic acids encoding glycerol utilization pathway enzymes in host methanotrophic bacteria. Regulated expression of an exogenous nucleic acid in the host methanotrophic bacterium may also be utilized. In particular, regulated expression of exogenous nucleic acids encoding glycerol utilization enzymes may be desirable to optimize growth rate of the non-naturally occurring methanotrophic bacteria. It is possible that in the absence of glycerol {e.g., during growth on methane as sole carbon source), for the glycerol utilization pathway to run in reverse, resulting in secretion of glycerol from the bacteria, thereby lowering growth rate. Controlled expression of nucleic acids encoding glycerol utilization pathway enzymes in response to the presence of glycerol may optimize bacterial growth in a variety of carbon source conditions. For example, an inducible/regulatable system of recombinant protein expression in methylotrophic and methanotrophic bacteria as described in US2010/0221813 may be used. Regulation of glycerol utilization genes in bacteria is well established (Schweizer and Po, 1996, J. Bacteriol. 178:5215-5221; Abram et al, 2008, Appl. Environ. Microbiol. 74:594-604; Darbon et al, 2002, Mol. Microbiol. 43: 1039-1052; Weissenborn et al, 1992, J. Biol. Chem. 267:6122-6131). Glycerol utilization regulatory elements may also be introduced or inactivated in host methanotrophic bacteria for desired expression levels of exogenous nucleic acid molecules encoding glycerol utilization pathway enzymes.

Methods of screening are disclosed in Brock, supra. Selection methods for identifying allelic exchange mutants are known in the art (see, e.g., U.S. Patent Publication No. 2006/0057726, Stolyar et al, 1999, Microbiol. 145: 1235-1244; and Ali et al, 2006, Microbiology 152:2931-2942.

Culture Methods

A variety of culture methodologies may be used for the recombinant methanotrophic bacteria described herein. For example, methanotrophic bacteria may be grown by batch culture and continuous culture methodologies. In certain

embodiments, the cultures are grown in a controlled culture unit, such as a fermentor, bioreactor, hollow fiber membrane bioreactor, or the like.

A classical batch culturing method is a closed system where the composition of the media is set at the beginning of the culture and not subject to external alterations during the culture process. Thus, at the beginning of the culturing process, the media is inoculated with the desired methanotrophic bacteria and growth or metabolic activity is permitted to occur without adding anything to the system.

Typically, however, a "batch" culture is batch with respect to the addition of carbon source and attempts are often made at controlling factors such as pH and oxygen concentration. In batch systems, the metabolite and biomass compositions of the system change constantly up to the time the culture is terminated. Within batch cultures, cells moderate through a static lag phase to a high growth logarithmic phase and finally to a stationary phase where growth rate is diminished or halted. If untreated, cells in the stationary phase will eventually die. Cells in logarithmic growth phase are often responsible for the bulk production of end product or intermediate in some systems. Stationary or post-exponential phase production can be obtained in other systems.

The Fed-Batch system is a variation on the standard batch system. Fed- Batch culture processes comprise a typical batch system with the modification that the substrate is added in increments as the culture progresses. Fed-Batch systems are useful when catabolite repression is apt to inhibit the metabolism of the cells and where it is desirable to have limited amounts of substrate in the media. Measurement of the actual substrate concentration in Fed-Batch systems is difficult and is therefore estimated on the basis of the changes of measureable factors, such as H, dissolved oxygen, and the partial pressure of waste gases such as C0 2 . Batch and Fed-Batch culturing methods are common and known in the art (see, e.g., Thomas D. Brock, Biotechnology: A Textbook of Industrial Microbiology, 2 nd Ed. (1989) Sinauer Associates, Inc.,

Sunderland, MA; Deshpande, 1992, Appl. Biochem. Biotechnol. 36:227, herein incorporated by reference in its entirety).

Continuous cultures are "open" systems where a defined culture media is added continuously to a bioreactor and an equal amount of conditioned media is removed simultaneously for processing. Continuous cultures generally maintain the cells at a constant high liquid phase density where cells are primarily in logarithmic phase growth. Alternatively, continuous culture may be practiced with immobilized cells where carbon and nutrients are continuously added and valuable products, byproducts, and waste products are continuously removed from the cell mass. Cell immobilization may be performed using a wide range of solid supports composed of natural and/or synthetic materials.

Continuous or semi-continuous culture allows for the modulation of one factor or any number of factors that affect cell growth or end product concentration. For example, one method will maintain a limited nutrient, such as the carbon source or nitrogen level, at a fixed rate and allow all other parameters to modulate. In other systems, a number of factors affecting growth can be altered continuously while the cell concentration, measured by media turbidity, is kept constant. Continuous systems strive to maintain steady state growth conditions and thus the cell loss due to media being drawn off must be balanced against the cell growth rate in the culture. Methods of modulating nutrients and growth factors for continuous culture processes, as well as techniques for maximizing the rate of product formation, are well known in the art, and a variety of methods are detailed by Brock, supra.

Culture media must contain suitable carbon substrates for the

methanotrophic bacteria. A culture media may comprise a selected multi-carbon substrate (e.g., glycerol) as the sole carbon source for the non-naturally occurring methanotrophic bacteria as described herein. Alternatively, a culture media may comprise two or more carbon substrates (mixed carbon substrates). Mixed carbon substrates may comprise a mixture of a CI substrate and a multi-carbon substrate. For example, the culture media may comprise a mixture of glycerol and methane.

Alternatively, mixed carbon substrates may comprise a mixture of more than one multi- carbon substrate or more than one CI substrate or a combination thereof. For cultures containing mixed carbon substrates, a selected multi-carbon substrate (e.g., glycerol) is used as a primary carbon source by the recombinant methanotrophic bacteria as described herein. A carbon source, whether a multi-carbon substrate alone or a mixed composition, may be added to culture media initially, provided to culture media intermittently, or supplied continuously. Alternatively, recombinant methanotrophic bacteria may be initially grown in culture with methane as a sole carbon source and then a multi-carbon substrate added at a later time point to make a mixed carbon source or methane may be substituted by a multi-carbon substrate.

Glycerol compositions added to the culture may be purified glycerol or crude glycerol. Purified glycerol, the refined form used in pharmaceutical, food, and cosmetic industries, is at least 90%, 95%, 96%, 97%, 98%, 99%, or 99.5% pure. Crude glycerol is a by-product of biodiesel production that contains approximately 50-60%) of glycerol, 12-16%) of alkalies, especially in the form of alkali soaps and hydroxides, 15- 18%) of methyl esters, 8-12% of methanol, 2-3% water, and further components. Crude glycerol also contains a variety of elements, such as calcium, magnesium, phosphorus, or sulfur, originating from the primary oil. Larger quantities of sodium or potassium are also present, derived from the catalyst. Purified or crude glycerol may be added directly to the culture. Alternatively, impurities may be removed from crude glycerol by conventional separation techniques prior to addition to culture in order to increase the concentration of glycerol in solution to at least 50%>, 55%, 60%, 65%, 70%, 75%, 80%, 85%, or 90%.

Methods for Growing Recombinant Methanotrophic Bacteria

In certain embodiments, methods for growing recombinant methanotrophic bacteria are provided, comprising: culturing any of the embodiments of recombinant methanotrophic bacteria disclosed herein in the presence of a multi-carbon substrate, wherein the multi-carbon substrate is used as a primary carbon source by the recombinant obligate methanotrophic bacteria or wherein the multi-carbon substrate is used as a sole carbon source by the recombinant facultative methanotrophic bacteria. A multi-carbon substrate may include glucose, acetate, lactate, arabinose, citrate, succinate, and glycerol. In certain embodiments, a selected multi-carbon substrate is used as a sole carbon source. In certain embodiments, the multi-carbon substrate is used as a sole carbon source by the recombinant obligate methanotrophic bacteria. In certain embodiments wherein the multi-carbon substrate is glucose, the recombinant methanotrophic bacteria include at least one exogenous nucleic acid encoding a glucose transporter.

In certain embodiments wherein the multi-carbon substrate is acetate, the recombinant methanotrophic bacteria include at least one exogenous nucleic acid encoding an acetate transporter. In further embodiments, the recombinant

methanotrophic bacterium is further modified to overexpress acetyl-CoA synthase.

In certain embodiments wherein the multi-carbon substrate is lactate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a lactate transporter and an exogenous nucleic acid encoding a lactate dehydrogenase.

In certain embodiments wherein the multi-carbon substrate is arabinose, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding an L-arabinose isomerase (e.g., AraA), an exogenous nucleic acid encoding an L- ribulose kinase (e.g., AraB), an exogenous nucleic acid encoding an L-ribulose-5- phosphate epimerase (e.g., AraD). An arabinose transporter includes AraE, AraFGH, and AraP.

In certain embodiments wherein the multi-carbon substrate is citrate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a citrate transporter.

In certain embodiments wherein the multi-carbon substrate is succinate, the recombinant methanotrophic bacteria include an exogenous nucleic acid encoding a succinate transporter.

In certain embodiments wherein the multi-carbon substrate is glycerol, the recombinant methanotrophic bacteria include at least two exogenous nucleic acids encoding glycerol utilization pathway components. The at least two exogenous nucleic acids encoding glycerol utilization pathway components may comprise glycerol kinase and glycerol-3 -phosphate dehydrogenase. In a specific embodiment, glycerol kinase is GlpK and glycerol-3 -phosphate dehydrogenase is GlpD. In another specific embodiment, glycerol kinase comprises an amino acid sequence of SEQ ID NO:22 and glycerol-3 -phosphate dehydrogenase comprises an amino acid sequence of SEQ ID NO:43.

In certain embodiments, the recombinant methanotrophic bacteria includes three exogenous nucleic acids encoding glycerol utilization pathway components. The three exogenous nucleic acids encoding glycerol utilization pathway components may comprise glycerol uptake facilitator, glycerol kinase, and glycerol-3 - phosphate dehydrogenase. In a specific embodiment, glycerol uptake facilitator is GlpF, glycerol kinase is GlpK, and glycerol-3 -phosphate dehydrogenase is GlpD. In another specific embodiment, glycerol uptake facilitator comprises an amino acid sequence of SEQ ID NO: l, glycerol kinase comprises an amino acid sequence of SEQ ID NO:22, and glycerol-3 -phosphate dehydrogenase comprises an amino acid sequence of SEQ ID NO:43.

Obligate methanotrophic bacteria for use according to any of the embodied methods disclosed herein include Methylomonas, Methylobacter,

Methylococcus, Methylosinus, Methylocystis, Methylomicrobium, or Methanomonas. Exemplary obligate methanotrophic bacteria include: Methylococcus capsulatus Bath strain, Methylomonas 16a (ATCC PTA 2402), Methylosinus trichosporium OB3b (NRRL B-11,196), Methylosinus sporium (NRRL B-11,197), Methylocystis parvus (NRRL B-11,198), Methylomonas methanica (NRRL B-11,199), Methylomonas albus (NRRL B-11,200), Methylobacter capsulatus (NRRL B-11,201), Methylomonas sp AJ- 3670 (FERM P -2400), Methylacidiphilum infernorumQ, and Methylomicrobium alcaliphilum 20Z. Exemplary facultative methanotrophic bacteria for use according to any of the embodied methods disclosed herein include Methylocella silvestris, Methylocella palustris, Methylocella tundra, Methylocystis daltona, Methylocystis bryophila, Methylobacterium organophilum (ATCC 27,886), and Methylocapsa aurea.

In certain embodiments, one or more exogenous nucleic acids encoding a multi-carbon substrate utilization pathway component is codon optimized for high expression in methanotrophic bacteria.

Measuring Utilization of Multi-Carbon Substrates

Utilization of a selected multi-carbon substrate as a sole carbon source may be measured by determining growth rate, biomass yield, or increase in cell density using standard methods known in the art during culture of recombinant bacteria with the selected multi-carbon substrate as the sole carbon source.

Measuring Glycerol Utilization

Glycerol utilization by recombinant methanotrophic bacteria described herein may be determined by various methods known in the art. For example, enzyme activity assays of glycerol utilization pathway enzymes, e.g., glycerol kinase and glycerol-3 -phosphate dehydrogenase, have been previously described (Rawls et al, 2011, J. Bacteriol. 193:4469-4476; Rittman et al, 2008, Applied Environmental Microbiol. 74:6216-6222; Darbon et al, 1999, Microbiol. 145:3205-3212; Charrier et al, 1997, J. Biol. Chem. 272: 14166-14174). Dihydroxyacetone phosphate (DHAP) levels may also be assayed via a coupled enzyme system: reduction with NADH-consuming glycerol-3 -phosphate dehydrogenase enables determination of DHAP by measuring NADH concentration with UV spectroscopy (WO2007/003574).

The growth rate or biomass yield of recombinant methanotrophic using glycerol as a carbon source may also be measured as an indicator of glycerol utilization (Rittman et al., 2008, Applied Environmental Microbiol. 74:6216-6222; Muraka et al., 2008, Applied Environmental Microbiol. 74: 1124-1135).

Glycerol depletion from the culture medium may also be measured over time. At various time points, aliquots of culture broth containing glycerol and non- naturally occurring methanotrophic bacteria described herein may be withdrawn, centrifuged, filtered, and analyzed by high-performance liquid chromatography to determine consumption of glycerol during bacterial growth in the medium (Rawls et al., 2011, J. Bacteriol. 193:4469-4476; Gonzalez et al, 2008, Metabolic Engineering 10:234-245; Rittman et al, 2008, Applied Environmental Microbiol. 74:6216-6222). Alternatively, radio-labeled glycerol substrate may be added to the culture media, and after different time points, radioactivity remaining in the culture or incorporated into bacteria may be measured (Sher et al., 2004, FEMS Microbiol. Lett. 232:211-215; Muraka et al. 2008, Applied Environmental Microbiol. 74: 1124-1135). Glycerol concentration in bacterial cell culture supernatant may also be measured at various time points using a free glycerol assay kit (see, e.g., Product #FG0100 from Sigma, St.

Louis, MO or Product #ab65337 from Abeam, Cambridge MA).

Measuring Glucose Utilization

Glucose consumption may be measured by sampling culture medium over time for glucose or its corresponding fermentation by-products using HPLC (see, e.g., Garcia Sanchez et al, 2010, Biotechnology for Biofuels 3:13). Coupled enzyme assays, such as the one described in U.S. Patent 4,490,465 or Amplex® Red

Glucose/Glucose Oxidase Assay Kit (Invitrogen, Catalog #A22189), may be used to determine the amount of glucose in solution.

Measuring Acetate Utilization

Acetate agar may be used to test recombinant methanotrophic bacteria's ability to utilize acetate. The medium contains sodium acetate as the sole carbon source and inorganic ammonium salts as the sole nitrogen source. Bacterial growth is indicative of acetate utilization. When the bacteria metabolize acetate, the ammonium salts are broken down to ammonia, increasing alkalinity. The resulting increase in pH turns the bromothymol blue indicator in the medium from green to blue (see, e.g., Weyant et al. 1995, Identification of Unusual Pathogenic Gram-Negative Aerobic and Facultatively Anaerobic Bacteria, 2 nd ed., pp. 6-7, Williams & Wilkins, Baltimore, MD).

Acetate utilization may also be measuring using a method as described in Dedysh et al. (2005, J. Bacteriol. 187:4665-4670). Briefly, recombinant

methanotrophic bacteria are grown in basal salts medium DNMS (dilute nitrate mineral salts, pH 5.8) with sodium acetate at 0.04% wt/vol. Cultures are grown to an OD600 of >0.1 from an inoculation of <0.001 at 25°C on a rotary shaker at 120rpm. Samples are taken daily for determination of acetate concentrations, direct microscopic cell counts, OD600, and DNA extraction and quantitative real-time PCR to determination acetate utilization and bacterial cell growth. Acetate is measured on a Sykam high- performance liquid chromatography system with a refraction index detector.

Measurement of growth yield and carbon conversion efficiency on acetate substrate has also been described in U.S. Patent Publication 2012/0034594. Assay kits for detecting acetate are commercially available (e.g., Catalog #K-ACETRM from Megazyme, Wicklow, Ireland)

Measuring Lactate Utilization

Lactate uptake assays may be used to measure lactate utilization.

Briefly, radiolabeled lactate is added to cell culture and after different time intervals, samples are taken, filtered, washed, and counted on a scintillator (see, e.g., Nunez et al, 2001, Microbiol. 147: 1069-1077; Exley et al, 2007, Infect. Immun. 75: 1318-1324).

Lactate utilization may also be measuring by testing for conversion of lactate to pyruvate by lactate dehydrogenase and performing kinetic analysis (see, e.g., Garvie, 1980, Microbiol. Rev. 44: 106-139; Brown and Whiteley, 2009, PLoS One 4:e7864; Futai, 1973, Biochemistry 12:2468-2474; Futai and Kimura, 1977, J. Biol. Chem. 252:5820-5827; Kohn and Kaback, 1973, J. Biol. Chem. 248:7012-7017;

Molinari and Lara, 1960, Biochem. J. 75:57-65; Kline and Mahler, 1965, Ann. N.Y. Acad. Sci. 119:905-919). Coupled enzyme assays, such as the one described in U.S. Patent 4,490,465 may be used to determine the amount of lactate in solution.

Measuring Arabinose Utilization

Arabinose utilization may be tested using an arabinose uptake assay (see, e.g. Subtil and Boles, 2011, Biotechnology for Biofuels, 4:38; U.S. Patent Publication 2012/0129241 ; Poysti et al, 2001 , Microbiol. 153 :727-736). Briefly, recombinant methanotrophic bacteria are incubated with radiolabeled arabinose, and after different time intervals, cells are collected. The suspension is immediately filtered, washed, and radioactivity of the filtrate is measured in a scintillation counter.

Alternatively, arabinose utilization may be measured by determining L- arabinose isomerase (e.g., AraA), L-ribulose kinase (e.g., AraB), or L-ribulose 5- phosphate epimerase (e.g., AraD) enzyme activities (see, e.g., Sedlak and Ho, 2001 , Enzyme and Microbial Technol. 28: 16-24; Shamanna and Sanderson, 1979, J.

Bacteriol. 139:64-70; Lee et al, 1968, J. Biol. Chem. 243 :4700-4705;).

Measuring Citrate Utilization

Citrate utilization as a sole carbon source may be detecting using a medium containing sodium citrate, a pH indicator (e.g., bromothymol blue), and inorganic ammonium salts as a sole nitrogen source. During its metabolism, citrate is converted to oxaloacetate and acetate. Production of NaHC0 3 and NH 3 from the use of sodium citrate and ammonium salts results in alkaline conditions, which is detected by a change of the medium's color from green to blue.

Measuring Succinate Utilization

Succinate utilization may be measuring by a succinate uptake by cells during culture using 14 C-labeled succinate (see, e.g., Weiss, 1970 J. Bacteriol. 101 : 133- 137; Gutowski and Rosenberg, 1975, Biochem. J. 152:647-654; or Glenn et al, 1980, Microbiol. 1 19:267-271).

EXAMPLES

EXAMPLE 1 : RECOMBINANT METHYLOSINUS TRICHOSPORIUM ENGINEERED TO GROW ON GLYCEROL

Preparation of NMS Media.

MgS0 4 · 7H 2 0 l .O g

CaCl 2 · 6H 2 0 0.20 g

Chelated Iron Solution (see below) 2.0 ml

KN0 3 l .O g

Trace Element Solution (see below) 0.5 ml

KH 2 P0 4 0.272 g

Na2HP0 4 - 12H 2 0 0.717 g

Purified Agar (e.g., Oxoid L28) 12.5 g Distilled deionized water 1.0 L

Adjust pH to 6.8. Autoclave at 121°C for 15 minutes.

Chelated Iron Solution:

Ferric (III) ammonium citrate* 0.1 g

EDTA, sodium salt 0.2 g

HC1 (concentrated) 0.3 ml

Distilled deionized water 100.0 ml

*0.5 g of Ferric (III) chloride may be substituted.

Use 2.0 ml of this chelated iron solution per liter of final medium.

Trace Element Solution:

EDTA 500.0 mg

FeS0 4 · 7H 2 0 200.0 mg

ZnS0 4 · 7H 2 0 lO.O mg

MnCl 2 · 4H 2 0 3.0 mg

CoCl 2 · 6H 2 0 20.0 mg

CaCl 2 · 2H 2 0 1.0 mg

NiCl 2 · 6H 2 0 2.0 mg

Na 2 Mo0 4 · 2H 2 0 3.0 mg

Distilled water 1.0 L

Autoclave at 121 °C for 15 minutes.

Growth and Conjugations. The procedure for conjugating plasmids from E. coli into methanotrophs was based on the method developed by Martin, H. & Murrell, J. C. (1995). Methane monooxygenase mutants of Methylosinus trichosporium constructed by marker exchange mutagenesis. FEMS Microbiol. Lett. 127:243-248.

Briefly, a mobilizing plasmid to be conjugated was first transformed into E. coli SI 7-1 using standard electroporation methods. Transformation was confirmed by selection of kanamycin-resistant colonies on LB-agar containing 20 ug/mL kanamycin. Transformed colonies were inoculated into LB media containing 20 ug/mL kanamycin and shaken overnight at 37°C. A 10 mL aliquot of the overnight culture was then collected on a sterile 47 mm nitrocellulose filter (0.2 mm pore size). The E. coli donor strain was washed on the filter with 50 mL sterile NMS media to remove residual media and antibiotic.

In parallel, a sample of the M. trichosporium OB3b recipient strain was inoculated into lOOmL serum bottles containing 20-50mL NMS media. The headspace of the bottles was then flushed with a 1 : 1 mixture of oxygen and methane, and the bottles were sealed with butyl butyl rubber septa and crimped. The bottles were shaken continuously in a 30°C incubator until reaching an OD600 of approximately 0.3. The cells were then collected on the same filter as the E. coli donor strain. The filter was again washed with 50 mL of sterile NMS media. The filter was placed (cells up) on an NMS agar plate containing 0.02% (w/v) proteose peptone and incubated for 24 h at 30°C in the presence of methane and oxygen. After 24 h, cells were resuspended in 10 mL sterile (NMS) medium before being concentrated by centrifugation. The pellet was resuspended in 1 mL sterile NMS media. Aliquots (100 μΐ) were spread onto NMS agar plates containing 10 ug/mL kanamycin.

The plates were incubated in sealed chambers containing a 1 : 1 mixture of methane and oxygen maintained at 30°C. The gas mixture was replenished every 2 days until colonies formed, typically after 7-14 days. Colonies were streaked onto NMS plates containing kanamycin to confirm kanamycin resistance as well as to further isolate transformed methanotroph cells from residual E. coli donor cells.

Introduction of Glycerol Utilization Pathway. Nucleic acids encoding GlpK, GlpD, and GlpF from E. coli were codon optimized for expression in

Methylosinus trichosporium. The codon optimized nucleic acids encoding GlpK, GlpD, and GlpF are synthesized as an operon (SEQ ID NO:95; see Table 14 for components of operon) under control of an mdh promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 14: Glycerol Utilization Pathway Operon Codon Optimized for Methylosinus trichosporium

SEQ ID Gene Name Nucleotide Sequence

NO:#

ggcaccaaggtcaagtggatcctggaccatgtcgagggctcgcgcgagcgc gcgcgccgcggcgagctgctgttcggcaccgtcgacacctggctgatctgga agatgacccagggccgcgtccatgtcaccgactataccaacgcgtcgcgcac catgctgttcaacatccataccctggactgggacgacaagatgctggaggtcc tggacatcccgcgcgagatgctgccggaggtccgccgctcgtcggaggtcta tggccagaccaacatcggcggcaagggcggcacccgcatcccgatctcgg gcatcgcgggcgaccagcaggcggcgctgttcggccagctgtgcgtcaagg agggcatggcgaagaacacctatggcaccggctgcttcatgctgatgaacac cggcgagaaggcggtcaagtcggagaacggcctgctgaccaccatcgcgtg cggcccgaccggcgaggtcaactatgcgctggagggcgcggtcttcatggc gggcgcgtcgatccagtggctgcgcgacgagatgaagctgatcaacgacgc gtatgactcggagtatttcgcgaccaaggtccagaacaccaacggcgtctatg tcgtcccggcgttcaccggcctgggcgcgccgtattgggacccgtatgcgcg cggcgcgatcttcggcctgacccgcggcgtcaacgcgaaccatatcatccgc gcgaccctggagtcgatcgcgtatcagacccgcgacgtcctggaggcgatg caggcggactcgggcatccgcctgcatgcgctgcgcgtcgacggcggcgc ggtcgcgaacaacttcctgatgcagttccagtcggacatcctgggcacccgcg tcgagcgcccggaggtccgcgaggtcaccgcgctgggcgcggcgtatctgg cgggcctggcggtcggcttctggcagaacctggacgagctgcaggagaagg cggtcatcgagcgcgagttccgcccgggcatcgagaccaccgagcgcaact atcgctatgcgggctggaagaaggcggtcaagcgcgcgatggcgtgggag gagcatgactga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

NO:91 region

SEQ ID GlpD atggagaccaaggacctgatcgtcatcggcggcggcatcaacggcgcgggc NO:92 atcgcggcggacgcggcgggccgcggcctgtcggtcctgatgctggaggc gcaggacctggcgtgcgcgacctcgtcggcgtcgtcgaagctgatccatggc ggcctgcgctatctggagcattatgagttccgcctggtctcggaggcgctggc ggagcgcgaggtcctgctgaagatggcgccgcatatcgcgttcccgatgcgc ttccgcctgccgcatcgcccgcatctgcgcccggcgtggatgatccgcatcg gcctgttcatgtatgaccatctgggcaagcgcacctcgctgccgggctcgacc ggcctgcgcttcggcgcgaactcggtcctgaagccggagatcaagcgcggc ttcgagtattcggactgctgggtcgacgacgcgcgcctggtcctggcgaacg cgcagatggtcgtccgcaagggcggcgaggtcctgacccgcacccgcgcg acctcggcgcgccgcgagaacggcctgtggatcgtcgaggcggaggacat cgacaccggcaagaagtattcgtggcaggcgcgcggcctggtcaacgcgac cggcccgtgggtcaagcagttcttcgacgacggcatgcatctgccgtcgccgt atggcatccgcctgatcaagggctcgcatatcgtcgtcccgcgcgtccatacc cagaagcaggcgtatatcctgcagaacgaggacaagcgcatcgtcttcgtcat cccgtggatggacgagttctcgatcatcggcaccaccgacgtcgagtataag ggcgacccgaaggcggtcaagatcgaggagtcggagatcaactatctgctg aacgtctataacacccatttcaagaagcagctgtcgcgcgacgacatcgtctg gacctattcgggcgtccgcccgctgtgcgacgacgagtcggactcgccgca ggcgatcacccgcgactataccctggacatccatgacgagaacggcaaggc gccgctgctgtcggtcttcggcggcaagctgaccacctatcgcaagctggcg SEQ ID Gene Name Nucleotide Sequence

NO:#

gagcatgcgctggagaagctgaccccgtattatcagggcatcggcccggcgt ggaccaaggagtcggtcctgccgggcggcgcgatcgagggcgaccgcga cgactatgcggcgcgcctgcgccgccgctatccgttcctgaccgagtcgctg gcgcgccattatgcgcgcacctatggctcgaactcggagctgctgctgggca acgcgggcaccgtctcggacctgggcgaggacttcggccatgagttctatga ggcggagctgaagtatctggtcgaccatgagtgggtccgccgcgcggacga cgcgctgtggcgccgcaccaagcagggcatgtggctgaacgcggaccagc agtcgcgcgtctcgcagtggctggtcgagtatacccagcagcgcctgtcgctg gcgtcgtga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

NO:93 region

SEQ ID GlpF atgtcgcagacctcgaccctgaagggccagtgcatcgcggagttcctgggca NO:94 ccggcctgctgatcttcttcggcgtcggctgcgtcgcggcgctgaaggtcgcg ggcgcgtcgttcggccagtgggagatctcggtcatctggggcctgggcgtcg cgatggcgatctatctgaccgcgggcgtctcgggcgcgcatctgaacccggc ggtcaccatcgcgctgtggctgttcgcgtgcttcgacaagcgcaaggtcatcc cgttcatcgtctcgcaggtcgcgggcgcgttctgcgcggcggcgctggtctat ggcctgtattataacctgttcttcgacttcgagcagacccatcatatcgtccgcg gctcggtcgagtcggtcgacctggcgggcaccttctcgacctatccgaaccc gcatatcaacttcgtccaggcgttcgcggtcgagatggtcatcaccgcgatcct gatgggcctgatcctggcgctgaccgacgacggcaacggcgtcccgcgcg gcccgctggcgccgctgctgatcggcctgctgatcgcggtcatcggcgcgtc gatgggcccgctgaccggcttcgcgatgaacccggcgcgcgacttcggccc gaaggtcttcgcgtggctggcgggctggggcaacgtcgcgttcaccggcgg ccgcgacatcccgtatttcctggtcccgctgttcggcccgatcgtcggcgcgat cgtcggcgcgttcgcgtatcgcaagctgatcggccgccatctgccgtgcgac atctgcgtcgtcgaggagaaggagaccaccaccccgtcggagcagaaggc gtcgctgtga

SEQ ID Glycerol tttgcctcgatcggcggtccttgtgacagggagatattcccgacggatccggg NO:95 Utilization gcattcgagcggaaccgcccgccgtgggagtttttccagcgagcattcgaga

Pathway gtttttcaaggcggcttcgaggggttattccgtaacgccgccgacatgatctgtc Operon ccagaatctccgccgctgttcgtagagcgccgatgcagggtcggcatcaatca ttcttggaggagacacatgaccgagaagaagtatatcgtcgcgctggaccag ggcaccacctcgtcgcgcgcggtcgtcatggaccatgacgcgaacatcatct cggtctcgcagcgcgagttcgagcagatctatccgaagccgggctgggtcga gcatgacccgatggagatctgggcgacccagtcgtcgaccctggtcgaggtc ctggcgaaggcggacatctcgtcggaccagatcgcggcgatcggcatcacc aaccagcgcgagaccaccatcgtctgggagaaggagaccggcaagccgat ctataacgcgatcgtctggcagtgccgccgcaccgcggagatctgcgagcat ctgaagcgcgacggcctggaggactatatccgctcgaacaccggcctggtca tcgacccgtatttctcgggcaccaaggtcaagtggatcctggaccatgtcgag ggctcgcgcgagcgcgcgcgccgcggcgagctgctgttcggcaccgtcga cacctggctgatctggaagatgacccagggccgcgtccatgtcaccgactata ccaacgcgtcgcgcaccatgctgttcaacatccataccctggactgggacgac aagatgctggaggtcctggacatcccgcgcgagatgctgccggaggtccgc SEQ ID Gene Name Nucleotide Sequence

NO:#

cgctcgtcggaggtctatggccagaccaacatcggcggcaagggcggcacc cgcatcccgatctcgggcatcgcgggcgaccagcaggcggcgctgttcggc cagctgtgcgtcaaggagggcatggcgaagaacacctatggcaccggctgc ttcatgctgatgaacaccggcgagaaggcggtcaagtcggagaacggcctg ctgaccaccatcgcgtgcggcccgaccggcgaggtcaactatgcgctggag ggcgcggtcttcatggcgggcgcgtcgatccagtggctgcgcgacgagatg aagctgatcaacgacgcgtatgactcggagtatttcgcgaccaaggtccagaa caccaacggcgtctatgtcgtcccggcgttcaccggcctgggcgcgccgtatt gggacccgtatgcgcgcggcgcgatcttcggcctgacccgcggcgtcaacg cgaaccatatcatccgcgcgaccctggagtcgatcgcgtatcagacccgcga cgtcctggaggcgatgcaggcggactcgggcatccgcctgcatgcgctgcg cgtcgacggcggcgcggtcgcgaacaacttcctgatgcagttccagtcggac atcctgggcacccgcgtcgagcgcccggaggtccgcgaggtcaccgcgctg ggcgcggcgtatctggcgggcctggcggtcggcttctggcagaacctggac gagctgcaggagaaggcggtcatcgagcgcgagttccgcccgggcatcga gaccaccgagcgcaactatcgctatgcgggctggaagaaggcggtcaagcg cgcgatggcgtgggaggagcatgactgatcattcttggaggagacacatgga gaccaaggacctgatcgtcatcggcggcggcatcaacggcgcgggcatcgc ggcggacgcggcgggccgcggcctgtcggtcctgatgctggaggcgcagg acctggcgtgcgcgacctcgtcggcgtcgtcgaagctgatccatggcggcct gcgctatctggagcattatgagttccgcctggtctcggaggcgctggcggagc gcgaggtcctgctgaagatggcgccgcatatcgcgttcccgatgcgcttccgc ctgccgcatcgcccgcatctgcgcccggcgtggatgatccgcatcggcctgtt catgtatgaccatctgggcaagcgcacctcgctgccgggctcgaccggcctg cgcttcggcgcgaactcggtcctgaagccggagatcaagcgcggcttcgagt attcggactgctgggtcgacgacgcgcgcctggtcctggcgaacgcgcagat ggtcgtccgcaagggcggcgaggtcctgacccgcacccgcgcgacctcgg cgcgccgcgagaacggcctgtggatcgtcgaggcggaggacatcgacacc ggcaagaagtattcgtggcaggcgcgcggcctggtcaacgcgaccggccc gtgggtcaagcagttcttcgacgacggcatgcatctgccgtcgccgtatggca tccgcctgatcaagggctcgcatatcgtcgtcccgcgcgtccatacccagaag caggcgtatatcctgcagaacgaggacaagcgcatcgtcttcgtcatcccgtg gatggacgagttctcgatcatcggcaccaccgacgtcgagtataagggcgac ccgaaggcggtcaagatcgaggagtcggagatcaactatctgctgaacgtct ataacacccatttcaagaagcagctgtcgcgcgacgacatcgtctggacctatt cgggcgtccgcccgctgtgcgacgacgagtcggactcgccgcaggcgatca cccgcgactataccctggacatccatgacgagaacggcaaggcgccgctgct gtcggtcttcggcggcaagctgaccacctatcgcaagctggcggagcatgcg ctggagaagctgaccccgtattatcagggcatcggcccggcgtggaccaag gagtcggtcctgccgggcggcgcgatcgagggcgaccgcgacgactatgc ggcgcgcctgcgccgccgctatccgttcctgaccgagtcgctggcgcgccat tatgcgcgcacctatggctcgaactcggagctgctgctgggcaacgcgggca ccgtctcggacctgggcgaggacttcggccatgagttctatgaggcggagct gaagtatctggtcgaccatgagtgggtccgccgcgcggacgacgcgctgtg gcgccgcaccaagcagggcatgtggctgaacgcggaccagcagtcgcgcg SEQ ID Gene Name Nucleotide Sequence

NO:#

tctcgcagtggctggtcgagtatacccagcagcgcctgtcgctggcgtcgtga tcattcttggaggagacacatgtcgcagacctcgaccctgaagggccagtgca tcgcggagttcctgggcaccggcctgctgatcttcttcggcgtcggctgcgtcg cggcgctgaaggtcgcgggcgcgtcgttcggccagtgggagatctcggtcat ctggggcctgggcgtcgcgatggcgatctatctgaccgcgggcgtctcgggc gcgcatctgaacccggcggtcaccatcgcgctgtggctgttcgcgtgcttcga caagcgcaaggtcatcccgttcatcgtctcgcaggtcgcgggcgcgttctgcg cggcggcgctggtctatggcctgtattataacctgttcttcgacttcgagcagac ccatcatatcgtccgcggctcggtcgagtcggtcgacctggcgggcaccttct cgacctatccgaacccgcatatcaacttcgtccaggcgttcgcggtcgagatg gtcatcaccgcgatcctgatgggcctgatcctggcgctgaccgacgacggca acggcgtcccgcgcggcccgctggcgccgctgctgatcggcctgctgatcg cggtcatcggcgcgtcgatgggcccgctgaccggcttcgcgatgaacccgg cgcgcgacttcggcccgaaggtcttcgcgtggctggcgggctggggcaacg tcgcgttcaccggcggccgcgacatcccgtatttcctggtcccgctgttcggcc cgatcgtcggcgcgatcgtcggcgcgttcgcgtatcgcaagctgatcggccg ccatctgccgtgcgacatctgcgtcgtcgaggagaaggagaccaccacccc gtcggagcagaaggcgtcgctgtga

The synthetic operon is then cloned and transformed as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on glycerol as a sole carbon source. Recombinant M.

trichosporium transformed with a vector containing the synthetic operon encoding genes for glycerol utilization are inoculated into 1 OOmL shake flasks containing 20- 50mL NMS media, 1% glycerol and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 30°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because glycerol is the only carbon source provided to the cells, all cell mass produced must have been derived from glycerol.

EXAMPLE 2 : RECOMBINANT METHYLOCOCCUS CAPSULATUS BATH STRAIN

ENGINEERED TO GROW ON GLYCEROL.

Growth and Conjugations. The procedure for conjugating plasmids from E. coli into M. capsulatus was based on the method reported in Ali, H. & Murrell, J. C. (2009). Development and validation of promoter-probe vectors for the study of methane monooxygenase gene expression in Methylococcus capsulatus Bath.

Microbiology (2009), 155:761-771. Briefly, a mobilizing plasmid to be conjugated was first transformed into E. coli SI 7-1 using standard electroporation methods. Transformation was confirmed by selection of kanamycin-resistant colonies on LB-agar containing 20 ug/mL kanamycin. Transformed colonies were inoculated into LB media containing 20 ug/mL kanamycin and shaken overnight at 37°C. A 10 mL aliquot of the overnight culture was then collected on a sterile 47 mm nitrocellulose filter (0.2 mm pore size). The E. coli donor strain was washed on the filter with 50 mL sterile NMS to remove residual media and antibiotic.

In parallel, a sample of the M. capsulatus recipient strain was inoculated into lOOmL serum bottles containing 20-50mL NMS media. The headspace of the bottles was then flushed with a 1 : 1 mixture of oxygen and methane, and the bottles were sealed with butyl rubber septa and crimped. The bottles were shaken continuously in a 45°C incubator until reaching an OD 6 oo of approximately 0.3. The cells were then collected on the same filter as the E. coli donor strain. The filter was again washed with 50 mL of sterile NMS media. The filter was placed (cells up) on an NMS agar plate containing 0.02% (w/v) proteose peptone and incubated for 24 h at 37°C in the presence of methane and oxygen. After 24 h, cells were resuspended in 10 mL sterile (NMS) medium before being concentrated by centrifugation. The pellet was resuspended in 1 mL sterile NMS media. Aliquots (100 ul) were spread onto NMS agar plates containing 10 ug/mL kanamycin.

The plates were incubated in sealed chambers containing a 1 : 1 mixture of methane and oxygen maintained at 45°C. The gas mixture was replenished every 2 days until colonies formed, typically after 7-14 days. Colonies were streaked onto NMS plates containing kanamycin to confirm kanamycin resistance as well as to further isolate transformed methanotroph cells from residual E. coli donor cells.

Introduction of glycerol utilization pathway. Nucleic acid sequences encoding GlpK, GlpD, and GlpF from E. coli were codon optimized for expression in M. capsulatus. The codon optimized nucleic acids encoding GlpK, GlpD, and GlpF are synthesized as an operon (SEQ ID NO: 102; see Table 15 for components of operon) under control of an mdh promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 15: Glycerol Utilization Pathway Operon Codon Optimized for M. capsulatus SEQ ID Gene Nucleotide Sequence

NO:# Name

AACCGCCCGCCGTGGGAGTTTTTCCAGCGAGCA

TTCGAGAGTTTTTCAAGGCGGCTTCGAGGGGTT

ATTCCGTAACGCCGCCGACATGATCTGTCCCAG

AATCTCCGCCGCTGTTCGTAGAGCGCCGATGCA

GGGTCGGCATCAATCATTCTTGGAGGAGACAC

SEQ ID GlpK atgaccgagaagaagtacatcgtcgccctggaccagggcaccaccagcagc NO:97 cgcgccgtcgtcatggaccacgacgccaacatcatcagcgtcagccagcgc gagttcgagcagatctacccgaagccgggctgggtcgagcacgacccgatg gagatctgggccacccagagcagcaccctggtcgaggtcctggccaaggcc gacatcagcagcgaccagatcgccgccatcggcatcaccaaccagcgcga gaccaccatcgtctgggagaaggagaccggcaagccgatctacaacgccat cgtctggcagtgccgccgcaccgccgagatctgcgagcacctgaagcgcga cggcctggaggactacatccgcagcaacaccggcctggtcatcgacccgta cttcagcggcaccaaggtcaagtggatcctggaccacgtcgagggcagccg cgagcgcgcccgccgcggcgagctgctgttcggcaccgtcgacacctggct gatctggaagatgacccagggccgcgtccacgtcaccgactacaccaacgc cagccgcaccatgctgttcaacatccacaccctggactgggacgacaagatg ctggaggtcctggacatcccgcgcgagatgctgccggaggtccgccgcagc agcgaggtctacggccagaccaacatcggcggcaagggcggcacccgcat cccgatcagcggcatcgccggcgaccagcaggccgccctgttcggccagct gtgcgtcaaggagggcatggccaagaacacctacggcaccggctgcttcat gctgatgaacaccggcgagaaggccgtcaagagcgagaacggcctgctga ccaccatcgcctgcggcccgaccggcgaggtcaactacgccctggagggc gccgtcttcatggccggcgccagcatccagtggctgcgcgacgagatgaag ctgatcaacgacgcctacgacagcgagtacttcgccaccaaggtccagaaca ccaacggcgtctacgtcgtcccggccttcaccggcctgggcgccccgtactg ggacccgtacgcccgcggcgccatcttcggcctgacccgcggcgtcaacgc caaccacatcatccgcgccaccctggagagcatcgcctaccagacccgcga cgtcctggaggccatgcaggccgacagcggcatccgcctgcacgccctgcg cgtcgacggcggcgccgtcgccaacaacttcctgatgcagttccagagcgac atcctgggcacccgcgtcgagcgcccggaggtccgcgaggtcaccgccct gggcgccgcctacctggccggcctggccgtcggcttctggcagaacctgga cgagctgcaggagaaggccgtcatcgagcgcgagttccgcccgggcatcg agaccaccgagcgcaactaccgctacgccggctggaagaaggccgtcaag cgcgccatggcctgggaggagcacgacgagtga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

NO:98 region

SEQ ID GlpD atggagaccaaggacctgatcgtcatcggcggcggcatcaacggcgccgg NO:99 catcgccgccgacgccgccggccgcggcctgagcgtcctgatgctggagg cccaggacctggcctgcgccaccagcagcgccagcagcaagctgatccac ggcggcctgcgctacctggagcactacgagttccgcctggtcagcgaggcc ctggccgagcgcgaggtcctgctgaagatggccccgcacatcgccttcccg atgcgcttccgcctgccgcaccgcccgcacctgcgcccggcctggatgatcc gcatcggcctgttcatgtacgaccacctgggcaagcgcaccagcctgccgg gcagcaccggcctgcgcttcggcgccaacagcgtcctgaagccggagatca SEQ ID Gene Nucleotide Sequence

NO:# Name

agcgcggcttcgagtacagcgactgctgggtcgacgacgcccgcctggtcct ggccaacgcccagatggtcgtccgcaagggcggcgaggtcctgacccgca cccgcgccaccagcgcccgccgcgagaacggcctgtggatcgtcgaggcc gaggacatcgacaccggcaagaagtacagctggcaggcccgcggcctggt caacgccaccggcccgtgggtcaagcagttcttcgacgacggcatgcacctg ccgagcccgtacggcatccgcctgatcaagggcagccacatcgtcgtcccg cgcgtccacacccagaagcaggcctacatcctgcagaacgaggacaagcg catcgtcttcgtcatcccgtggatggacgagttcagcatcatcggcaccaccg acgtcgagtacaagggcgacccgaaggccgtcaagatcgaggagagcgag atcaactacctgctgaacgtctacaacacccacttcaagaagcagctgagccg cgacgacatcgtctggacctacagcggcgtccgcccgctgtgcgacgacga gagcgacagcccgcaggccatcacccgcgactacaccctggacatccacga cgagaacggcaaggccccgctgctgagcgtcttcggcggcaagctgaccac ctaccgcaagctggccgagcacgccctggagaagctgaccccgtactacca gggcatcggcccggcctggaccaaggagagcgtcctgccgggcggcgcc atcgagggcgaccgcgacgactacgccgcccgcctgcgccgccgctaccc gttcctgaccgagagcctggcccgccactacgcccgcacctacggcagcaa cagcgagctgctgctgggcaacgccggcaccgtcagcgacctgggcgagg acttcggccacgagttctacgaggccgagctgaagtacctggtcgaccacga gtgggtccgccgcgccgacgacgccctgtggcgccgcaccaagcagggca tgtggctgaacgccgaccagcagagccgcgtcagccagtggctggtcgagt acacccagcagcgcctgagcctggccagctga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

NO: 100 region

SEQ ID GlpF atgagccagaccagcaccctgaagggccagtgcatcgccgagttcctgggc NO: 101 accggcctgctgatcttcttcggcgtcggctgcgtcgccgccctgaaggtcgc cggcgccagcttcggccagtgggagatcagcgtcatctggggcctgggcgt cgccatggccatctacctgaccgccggcgtcagcggcgcccacctgaaccc ggccgtcaccatcgccctgtggctgttcgcctgcttcgacaagcgcaaggtca tcccgttcatcgtcagccaggtcgccggcgccttctgcgccgccgccctggtc tacggcctgtactacaacctgttcttcgacttcgagcagacccaccacatcgtc cgcggcagcgtcgagagcgtcgacctggccggcaccttcagcacctacccg aacccgcacatcaacttcgtccaggccttcgccgtcgagatggtcatcaccgc catcctgatgggcctgatcctggccctgaccgacgacggcaacggcgtccc gcgcggcccgctggccccgctgctgatcggcctgctgatcgccgtcatcggc gccagcatgggcccgctgaccggcttcgccatgaacccggcccgcgacttc ggcccgaaggtcttcgcctggctggccggctggggcaacgtcgccttcacc ggcggccgcgacatcccgtacttcctggtcccgctgttcggcccgatcgtcg gcgccatcgtcggcgccttcgcctaccgcaagctgatcggccgccacctgcc gtgcgacatctgcgtcgtcgaggagaaggagaccaccaccccgagcgagc agaaggccagcc t gt ga

SEQ ID Glycerol tttgcctcgatcggcggtccttgtgacagggagatattcccgacggatccggg NO: 102 Utilization gcattcgagcggaaccgcccgccgtgggagtttttccagcgagcattcgaga

Pathway gtttttcaaggcggcttcgaggggttattccgtaacgccgccgacatgatctgt Operon cccagaatctccgccgctgttcgtagagcgccgatgcagggtcggcatcaat SEQ ID Gene Nucleotide Sequence

NO:# Name

cattcttggaggagacacatgaccgagaagaagtacatcgtcgccctggacc agggcaccaccagcagccgcgccgtcgtcatggaccacgacgccaacatc atcagcgtcagccagcgcgagttcgagcagatctacccgaagccgggctgg gtcgagcacgacccgatggagatctgggccacccagagcagcaccctggtc gaggtcctggccaaggccgacatcagcagcgaccagatcgccgccatcgg catcaccaaccagcgcgagaccaccatcgtctgggagaaggagaccggca agccgatctacaacgccatcgtctggcagtgccgccgcaccgccgagatctg cgagcacctgaagcgcgacggcctggaggactacatccgcagcaacaccg gcctggtcatcgacccgtacttcagcggcaccaaggtcaagtggatcctgga ccacgtcgagggcagccgcgagcgcgcccgccgcggcgagctgctgttcg gcaccgtcgacacctggctgatctggaagatgacccagggccgcgtccacg tcaccgactacaccaacgccagccgcaccatgctgttcaacatccacaccctg gactgggacgacaagatgctggaggtcctggacatcccgcgcgagatgctg ccggaggtccgccgcagcagcgaggtctacggccagaccaacatcggcgg caagggcggcacccgcatcccgatcagcggcatcgccggcgaccagcagg ccgccctgttcggccagctgtgcgtcaaggagggcatggccaagaacacct acggcaccggctgcttcatgctgatgaacaccggcgagaaggccgtcaaga gcgagaacggcctgctgaccaccatcgcctgcggcccgaccggcgaggtc aactacgccctggagggcgccgtcttcatggccggcgccagcatccagtgg ctgcgcgacgagatgaagctgatcaacgacgcctacgacagcgagtacttcg ccaccaaggtccagaacaccaacggcgtctacgtcgtcccggccttcaccgg cctgggcgccccgtactgggacccgtacgcccgcggcgccatcttcggcct gacccgcggcgtcaacgccaaccacatcatccgcgccaccctggagagcat cgcctaccagacccgcgacgtcctggaggccatgcaggccgacagcggca tccgcctgcacgccctgcgcgtcgacggcggcgccgtcgccaacaacttcct gatgcagttccagagcgacatcctgggcacccgcgtcgagcgcccggaggt ccgcgaggtcaccgccctgggcgccgcctacctggccggcctggccgtcg gcttctggcagaacctggacgagctgcaggagaaggccgtcatcgagcgcg agttccgcccgggcatcgagaccaccgagcgcaactaccgctacgccggct ggaagaaggccgtcaagcgcgccatggcctgggaggagcacgacgagtg atcattcttggaggagacacatggagaccaaggacctgatcgtcatcggcgg cggcatcaacggcgccggcatcgccgccgacgccgccggccgcggcctg agcgtcctgatgctggaggcccaggacctggcctgcgccaccagcagcgcc agcagcaagctgatccacggcggcctgcgctacctggagcactacgagttcc gcctggtcagcgaggccctggccgagcgcgaggtcctgctgaagatggccc cgcacatcgccttcccgatgcgcttccgcctgccgcaccgcccgcacctgcg cccggcctggatgatccgcatcggcctgttcatgtacgaccacctgggcaag cgcaccagcctgccgggcagcaccggcctgcgcttcggcgccaacagcgt cctgaagccggagatcaagcgcggcttcgagtacagcgactgctgggtcga cgacgcccgcctggtcctggccaacgcccagatggtcgtccgcaagggcg gcgaggtcctgacccgcacccgcgccaccagcgcccgccgcgagaacgg cctgtggatcgtcgaggccgaggacatcgacaccggcaagaagtacagctg gcaggcccgcggcctggtcaacgccaccggcccgtgggtcaagcagttctt cgacgacggcatgcacctgccgagcccgtacggcatccgcctgatcaaggg cagccacatcgtcgtcccgcgcgtccacacccagaagcaggcctacatcctg SEQ ID Gene Nucleotide Sequence

NO:# Name

cagaacgaggacaagcgcatcgtcttcgtcatcccgtggatggacgagttca gcatcatcggcaccaccgacgtcgagtacaagggcgacccgaaggccgtca agatcgaggagagcgagatcaactacctgctgaacgtctacaacacccacttc aagaagcagctgagccgcgacgacatcgtctggacctacagcggcgtccgc ccgctgtgcgacgacgagagcgacagcccgcaggccatcacccgcgacta caccctggacatccacgacgagaacggcaaggccccgctgctgagcgtctt cggcggcaagctgaccacctaccgcaagctggccgagcacgccctggaga agctgaccccgtactaccagggcatcggcccggcctggaccaaggagagc gtcctgccgggcggcgccatcgagggcgaccgcgacgactacgccgcccg cctgcgccgccgctacccgttcctgaccgagagcctggcccgccactacgcc cgcacctacggcagcaacagcgagctgctgctgggcaacgccggcaccgt cagcgacctgggcgaggacttcggccacgagttctacgaggccgagctgaa gtacctggtcgaccacgagtgggtccgccgcgccgacgacgccctgtggcg ccgcaccaagcagggcatgtggctgaacgccgaccagcagagccgcgtca gccagtggctggtcgagtacacccagcagcgcctgagcctggccagctgat cattcttggaggagacacatgagccagaccagcaccctgaagggccagtgc atcgccgagttcctgggcaccggcctgctgatcttcttcggcgtcggctgcgtc gccgccctgaaggtcgccggcgccagcttcggccagtgggagatcagcgtc atctggggcctgggcgtcgccatggccatctacctgaccgccggcgtcagcg gcgcccacctgaacccggccgtcaccatcgccctgtggctgttcgcctgcttc gacaagcgcaaggtcatcccgttcatcgtcagccaggtcgccggcgccttct gcgccgccgccctggtctacggcctgtactacaacctgttcttcgacttcgagc agacccaccacatcgtccgcggcagcgtcgagagcgtcgacctggccggc accttcagcacctacccgaacccgcacatcaacttcgtccaggccttcgccgt cgagatggtcatcaccgccatcctgatgggcctgatcctggccctgaccgac gacggcaacggcgtcccgcgcggcccgctggccccgctgctgatcggcct gctgatcgccgtcatcggcgccagcatgggcccgctgaccggcttcgccatg aacccggcccgcgacttcggcccgaaggtcttcgcctggctggccggctgg ggcaacgtcgccttcaccggcggccgcgacatcccgtacttcctggtcccgct gttcggcccgatcgtcggcgccatcgtcggcgccttcgcctaccgcaagctg atcggccgccacctgccgtgcgacatctgcgtcgtcgaggagaaggagacc accaccccgagcgagcagaaggccagcctgtga

The synthetic operon is then cloned and transformed into M. capsulatus as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on glycerol as a sole carbon source. Recombinant M.

capsulatus transformed with a vector containing the synthetic operon encoding genes for glycerol utilization are inoculated into lOOmL shake fiasks containing 20-5 OmL NMS media, 1% glycerol and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 42°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because glycerol is the only carbon source provided to the cells, all cell mass produced must have been derived from glycerol.

EXAMPLE 3: RECOMBINANT METHYLOMONAS METHANICA ENGINEERED TO GROW ON GLYCEROL

Growth and Conjugations. The procedure for growth and conjugation of Methylomonas methanica was performed essentially identically to the procedures described for M. capsulatus (above).

Introduction of glycerol utilization pathway. Nucleic acid sequences encoding GlpK, GlpD, and GlpF from E. coli were codon optimized for expression in M. methanica. The codon optimized nucleic acids encoding GlpK, GlpD, and GlpF are synthesized as an operon (SEQ ID NO: 109; see Table 16 for components of operon) under control of an hps promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 16: Glycerol Utilization Pathway Operon Codon Optimized for M. methanica

SEQ ID Gene Name Nucleotide Sequence

NO:#

cggcaaaccgatctataacgcgatcgtctggcaatgccgccgcaccg cggaaatctgcgaacacttgaaacgcgatggcttggaagattatatcc gcagcaacaccggcttggtcatcgatccgtatttcagcggcaccaaa gtcaaatggatcttggatcacgtcgaaggcagccgcgaacgcgcgc gccgcggcgaattgttgttcggcaccgtcgatacctggttgatctgga aaatgacccaaggccgcgtccacgtcaccgattataccaacgcgagc cgcaccatgttgttcaacatccacaccttggattgggatgataaaatgtt ggaagtcttggatatcccgcgcgaaatgttgccggaagtccgccgca gcagcgaagtctatggccaaaccaacatcggcggcaaaggcggca cccgcatcccgatcagcggcatcgcgggcgatcaacaagcggcgtt gttcggccaattgtgcgtcaaagaaggcatggcgaaaaacacctatg gcaccggctgcttcatgttgatgaacaccggcgaaaaagcggtcaaa agcgaaaacggcttgttgaccaccatcgcgtgcggcccgaccggcg aagtcaactatgcgttggaaggcgcggtcttcatggcgggcgcgagc atccaatggttgcgcgatgaaatgaaattgatcaacgatgcgtatgata gcgaatatttcgcgaccaaagtccaaaacaccaacggcgtctatgtcg tcccggcgttcaccggcttgggcgcgccgtattgggatccgtatgcgc gcggcgcgatcttcggcttgacccgcggcgtcaacgcgaaccacat catccgcgcgaccttggaaagcatcgcgtatcaaacccgcgatgtctt ggaagcgatgcaagcggatagcggcatccgcttgcacgcgttgcgc gtcgatggcggcgcggtcgcgaacaacttcttgatgcaattccaaagc gatatcttgggcacccgcgtcgaacgcccggaagtccgcgaagtca ccgcgttgggcgcggcgtatttggcgggcttggcggtcggcttctgg caaaacttggatgaattgcaagaaaaagcggtcatcgaacgcgaattc cgcccgggcatcgaaaccaccgaacgcaactatcgctatgcgggct ggaaaaaagcggtcaaacgcgcgatggcgtgggaagaacacgatg i it i i

SEQ ID Intergenic TAATAATTAAAAACTTTCGGAGCACATCAC NO: 105 region

SEQ ID GlpD atggaaaccaaagatttgatcgtcatcggcggcggcatcaacggcgc NO: 106 gggcatcgcggcggatgcggcgggccgcggcttgagcgtcttgatg ttggaagcgcaagatttggcgtgcgcgaccagcagcgcgagcagca aattgatccacggcggcttgcgctatttggaacactatgaattccgcttg gtcagcgaagcgttggcggaacgcgaagtcttgttgaaaatggcgcc gcacatcgcgttcccgatgcgcttccgcttgccgcaccgcccgcactt gcgcccggcgtggatgatccgcatcggcttgttcatgtatgatcacttg ggcaaacgcaccagcttgccgggcagcaccggcttgcgcttcggcg cgaacagcgtcttgaaaccggaaatcaaacgcggcttcgaatatagc gattgctgggtcgatgatgcgcgcttggtcttggcgaacgcgcaaatg gtcgtccgcaaaggcggcgaagtcttgacccgcacccgcgcgacca gcgcgcgccgcgaaaacggcttgtggatcgtcgaagcggaagatat cgataccggcaaaaaatatagctggcaagcgcgcggcttggtcaac gcgaccggcccgtgggtcaaacaattcttcgatgatggcatgcacttg ccgagcccgtatggcatccgcttgatcaaaggcagccacatcgtcgt cccgcgcgtccacacccaaaaacaagcgtatatcttgcaaaacgaag SEQ ID Gene Name Nucleotide Sequence

NO:#

ataaacgcatcgtcttcgtcatcccgtggatggatgaattcagcatcatc ggcaccaccgatgtcgaatataaaggcgatccgaaagcggtcaaaat cgaagaaagcgaaatcaactatttgttgaacgtctataacacccacttc aaaaaacaattgagccgcgatgatatcgtctggacctatagcggcgtc cgcccgttgtgcgatgatgaaagcgatagcccgcaagcgatcaccc gcgattataccttggatatccacgatgaaaacggcaaagcgccgttgtt gagcgtcttcggcggcaaattgaccacctatcgcaaattggcggaac acgcgttggaaaaattgaccccgtattatcaaggcatcggcccggcgt ggaccaaagaaagcgtcttgccgggcggcgcgatcgaaggcgatc gcgatgattatgcggcgcgcttgcgccgccgctatccgttcttgaccg aaagcttggcgcgccactatgcgcgcacctatggcagcaacagcga attgttgttgggcaacgcgggcaccgtcagcgatttgggcgaagattt cggccacgaattctatgaagcggaattgaaatatttggtcgatcacgaa tgggtccgccgcgcggatgatgcgttgtggcgccgcaccaaacaag gcatgtggttgaacgcggatcaacaaagccgcgtcagccaatggttg gtcgaatatacccaacaacgcttgagcttggcgagctaa

SEQ ID Intergenic TAATAATTAAAAACTTTCGGAGCACATCAC NO: 107 region

SEQ ID GlpF atgagccaaaccagcaccttgaaaggccaatgcatcgcggaattcttg NO: 108 ggcaccggcttgttgatcttcttcggcgtcggctgcgtcgcggcgttga aagtcgcgggcgcgagcttcggccaatgggaaatcagcgtcatctg gggcttgggcgtcgcgatggcgatctatttgaccgcgggcgtcagcg gcgcgcacttgaacccggcggtcaccatcgcgttgtggttgttcgcgt gcttcgataaacgcaaagtcatcccgttcatcgtcagccaagtcgcgg gcgcgttctgcgcggcggcgttggtctatggcttgtattataacttgttct tcgatttcgaacaaacccaccacatcgtccgcggcagcgtcgaaagc gtcgatttggcgggcaccttcagcacctatccgaacccgcacatcaac ttcgtccaagcgttcgcggtcgaaatggtcatcaccgcgatcttgatgg gcttgatcttggcgttgaccgatgatggcaacggcgtcccgcgcggc ccgttggcgccgttgttgatcggcttgttgatcgcggtcatcggcgcga gcatgggcccgttgaccggcttcgcgatgaacccggcgcgcgatttc ggcccgaaagtcttcgcgtggttggcgggctggggcaacgtcgcgtt caccggcggccgcgatatcccgtatttcttggtcccgttgttcggcccg atcgtcggcgcgatcgtcggcgcgttcgcgtatcgcaaattgatcggc cgccacttgccgtgcgatatctgcgtcgtcgaagaaaaagaaaccac caccccgagcgaacaaaaagcgagct t gt aa

SEQ ID Glycerol TTCGGAATCCCTGACGGGAATTGGCCCGA NO: 109 Utilization AGAAGGCAGATGCCATCGTTCAGTATCGA

Pathway AAGGAACATGGGGATTTTCAGTCATTGAA Operon GGATCTGGAGAATGTCAGCGGCATTGGCG

AGAAAACCCTTCAGGCCAATGAAAAAGAC

ATTCGCTTCACGGATGATTTGAGCGATAAG

TCATCCGCGGAAAAAGGTGCGGTAGCTGT

GGATAAAAAAGGCGCCAGATAGTAAGCGC

TAAGGATTGGGGTGCGTCGCCGGTCGCGG SEQ ID Gene Name Nucleotide Sequence

NO:#

CGGCGCTCCTCGACGGCAGAGTTGGTGCC

AGGTTGGCGGATGATTGATGCCGAATATT

ACGCGACCAATTCTCGAGGCAAATGAACT

GTGAGCTACTGAGTTGCAGGCATTGACAG

CCATCCCATTTCTATCATACAGTTACGGAC

GC ATC AC G AGT AGGTG AT AAGC CT AGC AG

ATTGCGGCAGTTGGCAAAATCAGCTATTAC

TAATAATTAAAAACTTTCGGAGCACATCAC

atgaccgaaaaaaaatatatcgtcgcgttggatcaaggcaccaccag cagccgcgcggtcgtcatggatcacgatgcgaacatcatcagcgtca gccaacgcgaattcgaacaaatctatccgaaaccgggctgggtcgaa cacgatccgatggaaatctgggcgacccaaagcagcaccttggtcga agtcttggcgaaagcggatatcagcagcgatcaaatcgcggcgatcg gcatcaccaaccaacgcgaaaccaccatcgtctgggaaaaagaaac cggcaaaccgatctataacgcgatcgtctggcaatgccgccgcaccg cggaaatctgcgaacacttgaaacgcgatggcttggaagattatatcc gcagcaacaccggcttggtcatcgatccgtatttcagcggcaccaaa gtcaaatggatcttggatcacgtcgaaggcagccgcgaacgcgcgc gccgcggcgaattgttgttcggcaccgtcgatacctggttgatctgga aaatgacccaaggccgcgtccacgtcaccgattataccaacgcgagc cgcaccatgttgttcaacatccacaccttggattgggatgataaaatgtt ggaagtcttggatatcccgcgcgaaatgttgccggaagtccgccgca gcagcgaagtctatggccaaaccaacatcggcggcaaaggcggca cccgcatcccgatcagcggcatcgcgggcgatcaacaagcggcgtt gttcggccaattgtgcgtcaaagaaggcatggcgaaaaacacctatg gcaccggctgcttcatgttgatgaacaccggcgaaaaagcggtcaaa agcgaaaacggcttgttgaccaccatcgcgtgcggcccgaccggcg aagtcaactatgcgttggaaggcgcggtcttcatggcgggcgcgagc atccaatggttgcgcgatgaaatgaaattgatcaacgatgcgtatgata gcgaatatttcgcgaccaaagtccaaaacaccaacggcgtctatgtcg tcccggcgttcaccggcttgggcgcgccgtattgggatccgtatgcgc gcggcgcgatcttcggcttgacccgcggcgtcaacgcgaaccacat catccgcgcgaccttggaaagcatcgcgtatcaaacccgcgatgtctt ggaagcgatgcaagcggatagcggcatccgcttgcacgcgttgcgc gtcgatggcggcgcggtcgcgaacaacttcttgatgcaattccaaagc gatatcttgggcacccgcgtcgaacgcccggaagtccgcgaagtca ccgcgttgggcgcggcgtatttggcgggcttggcggtcggcttctgg caaaacttggatgaattgcaagaaaaagcggtcatcgaacgcgaattc cgcccgggcatcgaaaccaccgaacgcaactatcgctatgcgggct ggaaaaaagcggtcaaacgcgcgatggcgtgggaagaacacgatg aataaT AAT AATT AAAAACTTTC GG AGC AC AT

CACatggaaaccaaagatttgatcgtcatcggcggcggcatcaac ggcgcgggcatcgcggcggatgcggcgggccgcggcttgagcgtc ttgatgttggaagcgcaagatttggcgtgcgcgaccagcagcgcgag cagcaaattgatccacggcggcttgcgctatttggaacactatgaattc SEQ ID Gene Name Nucleotide Sequence

NO:#

cgcttggtcagcgaagcgttggcggaacgcgaagtcttgttgaaaatg gcgccgcacatcgcgttcccgatgcgcttccgcttgccgcaccgccc gcacttgcgcccggcgtggatgatccgcatcggcttgttcatgtatgat cacttgggcaaacgcaccagcttgccgggcagcaccggcttgcgctt cggcgcgaacagcgtcttgaaaccggaaatcaaacgcggcttcgaat atagcgattgctgggtcgatgatgcgcgcttggtcttggcgaacgcgc aaatggtcgtccgcaaaggcggcgaagtcttgacccgcacccgcgc gaccagcgcgcgccgcgaaaacggcttgtggatcgtcgaagcgga agatatcgataccggcaaaaaatatagctggcaagcgcgcggcttgg tcaacgcgaccggcccgtgggtcaaacaattcttcgatgatggcatgc acttgccgagcccgtatggcatccgcttgatcaaaggcagccacatc gtcgtcccgcgcgtccacacccaaaaacaagcgtatatcttgcaaaac gaagataaacgcatcgtcttcgtcatcccgtggatggatgaattcagca tcatcggcaccaccgatgtcgaatataaaggcgatccgaaagcggtc aaaatcgaagaaagcgaaatcaactatttgttgaacgtctataacaccc acttcaaaaaacaattgagccgcgatgatatcgtctggacctatagcg gcgtccgcccgttgtgcgatgatgaaagcgatagcccgcaagcgatc acccgcgattataccttggatatccacgatgaaaacggcaaagcgcc gttgttgagcgtcttcggcggcaaattgaccacctatcgcaaattggcg gaacacgcgttggaaaaattgaccccgtattatcaaggcatcggccc ggcgtggaccaaagaaagcgtcttgccgggcggcgcgatcgaagg cgatcgcgatgattatgcggcgcgcttgcgccgccgctatccgttcttg accgaaagcttggcgcgccactatgcgcgcacctatggcagcaaca gcgaattgttgttgggcaacgcgggcaccgtcagcgatttgggcgaa gatttcggccacgaattctatgaagcggaattgaaatatttggtcgatca cgaatgggtccgccgcgcggatgatgcgttgtggcgccgcaccaaa caaggcatgtggttgaacgcggatcaacaaagccgcgtcagccaat ggttggtcgaatatacccaacaacgcttgagcttggcgagctaaTA

ATAATTAAAAACTTTCGGAGCACATCACatg agccaaaccagcaccttgaaaggccaatgcatcgcggaattcttggg caccggcttgttgatcttcttcggcgtcggctgcgtcgcggcgttgaaa gtcgcgggcgcgagcttcggccaatgggaaatcagcgtcatctggg gcttgggcgtcgcgatggcgatctatttgaccgcgggcgtcagcggc gcgcacttgaacccggcggtcaccatcgcgttgtggttgttcgcgtgc ttcgataaacgcaaagtcatcccgttcatcgtcagccaagtcgcgggc gcgttctgcgcggcggcgttggtctatggcttgtattataacttgttcttc gatttcgaacaaacccaccacatcgtccgcggcagcgtcgaaagcgt cgatttggcgggcaccttcagcacctatccgaacccgcacatcaactt cgtccaagcgttcgcggtcgaaatggtcatcaccgcgatcttgatggg cttgatcttggcgttgaccgatgatggcaacggcgtcccgcgcggcc cgttggcgccgttgttgatcggcttgttgatcgcggtcatcggcgcga gcatgggcccgttgaccggcttcgcgatgaacccggcgcgcgatttc ggcccgaaagtcttcgcgtggttggcgggctggggcaacgtcgcgtt caccggcggccgcgatatcccgtatttcttggtcccgttgttcggcccg atcgtcggcgcgatcgtcggcgcgttcgcgtatcgcaaattgatcggc SEQ ID Gene Name Nucleotide Sequence

NO:#

cgccacttgccgtgcgatatctgcgtcgtcgaagaaaaagaaaccac caccccgagcgaacaaaaagcgagcttgtaa

The synthetic operon is then cloned and transformed into M. methanica as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on glycerol as a sole carbon source. Recombinant M.

methanica transformed with a vector containing the synthetic operon encoding genes for glycerol utilization are inoculated into lOOmL shake flasks containing 20-5 OmL NMS media, 1% glycerol and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 30°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because glycerol is the only carbon source provided to the cells, all cell mass produced must have been derived from glycerol.

EXAMPLE 4 : RECOMBINANT METHYLOSINUS TRICHOSPORIUM ENGINEERED TO GROW ON ACETATE

M. trichosporium cells are cultured and conjugated as described above.

Introduction of an Acetate Utilization Pathway. Nucleic acid sequences encoding AcsA (acetyl-CoA synthase) and ActP from E. coli were codon optimized for expression in M. trichosporium. The codon optimized nucleic acids encoding AcsA and ActP are synthesized as an operon (SEQ ID NO: l 14; see Table 17 for components of operon) under control of an mdh promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 17: Acetate Utilization Pathway Operon Codon Optimized for M. trichosporium

SEQ ID Gene Name Nucleotide Sequence

NO:#

SEQ ID AcsA atgtcgcagatccataagcataccatcccggcgaacatcgcggaccgctgc NO: l l l ctgatcaacccgcagcagtatgaggcgatgtatcagcagtcgatcaacgtcc cggacaccttctggggcgagcagggcaagatcctggactggatcaagccgt atcagaaggtcaagaacacctcgttcgcgccgggcaacgtctcgatcaagt ggtatgaggacggcaccctgaacctggcggcgaactgcctggaccgccat ctgcaggagaacggcgaccgcaccgcgatcatctgggagggcgacgacg cgtcgcagtcgaagcatatctcgtataaggagctgcatcgcgacgtctgccg cttcgcgaacaccctgctggagctgggcatcaagaagggcgacgtcgtcgc gatctatatgccgatggtcccggaggcggcggtcgcgatgctggcgtgcgc gcgcatcggcgcggtccattcggtcatcttcggcggcttctcgccggaggcg gtcgcgggccgcatcatcgactcgaactcgcgcctggtcatcacctcggac gagggcgtccgcgcgggccgctcgatcccgctgaagaagaacgtcgacg acgcgctgaagaacccgaacgtcacctcggtcgagcatgtcgtcgtcctga agcgcaccggcggcaagatcgactggcaggagggccgcgacctgtggtg gcatgacctggtcgagcaggcgtcggaccagcatcaggcggaggagatga acgcggaggacccgctgttcatcctgtatacctcgggctcgaccggcaagc cgaagggcgtcctgcataccaccggcggctatctggtctatgcggcgctga ccttcaagtatgtcttcgactatcatccgggcgacatctattggtgcaccgcgg acgtcggctgggtcaccggccattcgtatctgctgtatggcccgctggcgtg cggcgcgaccaccctgatgttcgagggcgtcccgaactggccgaccccgg cgcgcatggcgcaggtcgtcgacaagcatcaggtcaacatcctgtataccg cgccgaccgcgatccgcgcgctgatggcggagggcgacaaggcgatcga gggcaccgaccgctcgtcgctgcgcatcctgggctcggtcggcgagccga tcaacccggaggcgtgggagtggtattggaagaagatcggcaacgagaag tgcccggtcgtcgacacctggtggcagaccgagaccggcggcttcatgatc accccgctgccgggcgcgaccgagctgaaggcgggctcggcgacccgcc cgttcttcggcgtccagccggcgctggtcgacaacgagggcaacccgctgg agggcgcgaccgagggctcgctggtcatcaccgactcgtggccgggccag gcgcgcaccctgttcggcgaccatgagcgcttcgagcagacctatttctcga ccttcaagaacatgtatttctcgggcgacggcgcgcgccgcgacgaggacg gctattattggatcaccggccgcgtcgacgacgtcctgaacgtctcgggcca tcgcctgggcaccgcggagatcgagtcggcgctggtcgcgcatccgaaga tcgcggaggcggcggtcgtcggcatcccgcataacatcaagggccaggcg atctatgcgtatgtcaccctgaaccatggcgaggagccgtcgccggagctgt atgcggaggtccgcaactgggtccgcaaggagatcggcccgctggcgacc ccggacgtcctgcattggaccgactcgctgccgaagacccgctcgggcaa gatcatgcgccgcatcctgcgcaagatcgcggcgggcgacacctcgaacct gggcgacacctcgaccctggcggacccgggcgtcgtcgagaagctgct ggaggagaagcaggcgatcgcgatgccgtcgtga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

N0: 112 Region

SEQ ID ActP atgaagcgcgtcctgaccgcgctggcggcgaccctgccgttcgcggcgaa N0: 113 cgcggcggacgcgatctcgggcgcggtcgagcgccagccgaccaactgg caggcgatcatcatgttcctgatcttcgtcgtcttcaccctgggcatcacctatt gggcgtcgaagcgcgtccgctcgcgctcggactattataccgcgggcggc SEQ ID Gene Name Nucleotide Sequence

NO:#

aacatcaccggcttccagaacggcctggcgatcgcgggcgactatatgtcg gcggcgtcgttcctgggcatctcggcgctggtcttcacctcgggctatgacg gcctgatctattcgctgggcttcctggtcggctggccgatcatcctgttcctgat cgcggagcgcctgcgcaacctgggccgctataccttcgcggacgtcgcgtc gtatcgcctgaagcagggcccgatccgcatcctgtcggcgtgcggctcgct ggtcgtcgtcgcgctgtatctgatcgcgcagatggtcggcgcgggcaagct gatcgagctgctgttcggcctgaactatcatatcgcggtcgtcctggtcggcg tcctgatgatgatgtatgtcctgttcggcggcatgctggcgaccacctgggtc cagatcatcaaggcggtcctgctgctgttcggcgcgtcgttcatggcgttcat ggtcatgaagcatgtcggcttctcgttcaacaacctgttctcggaggcgatgg cggtccatccgaagggcgtcgacatcatgaagccgggcggcctggtcaag gacccgatctcggcgctgtcgctgggcctgggcctgatgttcggcaccgcg ggcctgccgcatatcctgatgcgcttcttcaccgtctcggacgcgcgcgagg cgcgcaagtcggtcttctatgcgaccggcttcatgggctatttctatatcctgac cttcatcatcggcttcggcgcgatcatgctggtcggcgcgaacccggagtat aaggacgcggcgggccatctgatcggcggcaacaacatggcggcggtcc atctggcgaacgcggtcggcggcaacctgttcctgggcttcatctcggcggt cgcgttcgcgaccatcctggcggtcgtcgcgggcctgaccctggcgggcg cgtcggcggtctcgcatgacctgtatgcgaacgtcttcaagaagggcgcga ccgagcgcgaggagctgcgcgtctcgaagatcaccgtcctgatcctgggcg tcatcgcgatcatcctgggcgtcctgttcgagaaccagaacatcgcgttcatg gtcggcctggcgttcgcgatcgcggcgtcgtgcaacttcccgatcatcctgct gtcgatgtattggtcgaagctgaccacccgcggcgcgatgatgggcggctg gctgggcctgatcaccgcggtcgtcctgatgatcctgggcccgaccatctgg gtccagatcctgggccatgagaaggcgatcttcccgtatgagtatccggcgc tgttctcgatcaccgtcgcgttcctgggcatctggttcttctcggcgaccgaca actcggcggagggcgcgcgcgagcgcgagctgttccgcgcgcagttcatc

CgctcgcagaccggCttcggcgtcgagcagggccgcgcgcattga

SEQ ID Acetate TTTGCCTCGATCGGCGGTCCTTGTGACAGGGA NO: 114 Utilization GATATTCCCGACGGATCCGGGGCATTCGAGCG

Pathway GAACCGCCCGCCGTGGGAGTTTTTCCAGCGAG Operon CATTCGAGAGTTTTTCAAGGCGGCTTCGAGGG

GTTATTCCGTAACGCCGCCGACATGATCTGTCC

CAGAATCTCCGCCGCTGTTCGTAGAGCGCCGA

TGCAGGGTCGGCATCAATCATTCTTGGAGGAG

ACACatgtcgcagatccataagcataccatcccggcgaacatcgcggac cgctgcctgatcaacccgcagcagtatgaggcgatgtatcagcagtcgatca acgtcccggacaccttctggggcgagcagggcaagatcctggactggatca agccgtatcagaaggtcaagaacacctcgttcgcgccgggcaacgtctcga tcaagtggtatgaggacggcaccctgaacctggcggcgaactgcctggacc gccatctgcaggagaacggcgaccgcaccgcgatcatctgggagggcga cgacgcgtcgcagtcgaagcatatctcgtataaggagctgcatcgcgacgtc tgccgcttcgcgaacaccctgctggagctgggcatcaagaagggcgacgtc gtcgcgatctatatgccgatggtcccggaggcggcggtcgcgatgctggcg tgcgcgcgcatcggcgcggtccattcggtcatcttcggcggcttctcgccgg SEQ ID Gene Name Nucleotide Sequence

NO:#

aggcggtcgcgggccgcatcatcgactcgaactcgcgcctggtcatcacct cggacgagggcgtccgcgcgggccgctcgatcccgctgaagaagaacgt cgacgacgcgctgaagaacccgaacgtcacctcggtcgagcatgtcgtcgt cctgaagcgcaccggcggcaagatcgactggcaggagggccgcgacctg tggtggcatgacctggtcgagcaggcgtcggaccagcatcaggcggagga gatgaacgcggaggacccgctgttcatcctgtatacctcgggctcgaccggc aagccgaagggcgtcctgcataccaccggcggctatctggtctatgcggcg ctgaccttcaagtatgtcttcgactatcatccgggcgacatctattggtgcacc gcggacgtcggctgggtcaccggccattcgtatctgctgtatggcccgctgg cgtgcggcgcgaccaccctgatgttcgagggcgtcccgaactggccgacc ccggcgcgcatggcgcaggtcgtcgacaagcatcaggtcaacatcctgtat accgcgccgaccgcgatccgcgcgctgatggcggagggcgacaaggcg atcgagggcaccgaccgctcgtcgctgcgcatcctgggctcggtcggcga gccgatcaacccggaggcgtgggagtggtattggaagaagatcggcaacg agaagtgcccggtcgtcgacacctggtggcagaccgagaccggcggcttc atgatcaccccgctgccgggcgcgaccgagctgaaggcgggctcggcga cccgcccgttcttcggcgtccagccggcgctggtcgacaacgagggcaac ccgctggagggcgcgaccgagggctcgctggtcatcaccgactcgtggcc gggccaggcgcgcaccctgttcggcgaccatgagcgcttcgagcagacct atttctcgaccttcaagaacatgtatttctcgggcgacggcgcgcgccgcgac gaggacggctattattggatcaccggccgcgtcgacgacgtcctgaacgtct cgggccatcgcctgggcaccgcggagatcgagtcggcgctggtcgcgcat ccgaagatcgcggaggcggcggtcgtcggcatcccgcataacatcaaggg ccaggcgatctatgcgtatgtcaccctgaaccatggcgaggagccgtcgcc ggagctgtatgcggaggtccgcaactgggtccgcaaggagatcggcccgc tggcgaccccggacgtcctgcattggaccgactcgctgccgaagacccgct cgggcaagatcatgcgccgcatcctgcgcaagatcgcggcgggcgacacc tcgaacctgggcgacacctcgaccctggcggacccgggcgtcgtcgagaa gctgctggaggagaagcaggcgatcgcgatgccgtcgtgaTCATTCT

TGGAGGAGACACatgaagcgcgtcctgaccgcgctggcggcga ccctgccgttcgcggcgaacgcggcggacgcgatctcgggcgcggtcga gcgccagccgaccaactggcaggcgatcatcatgttcctgatcttcgtcgtct tcaccctgggcatcacctattgggcgtcgaagcgcgtccgctcgcgctcgga ctattataccgcgggcggcaacatcaccggcttccagaacggcctggcgat cgcgggcgactatatgtcggcggcgtcgttcctgggcatctcggcgctggtc ttcacctcgggctatgacggcctgatctattcgctgggcttcctggtcggctgg ccgatcatcctgttcctgatcgcggagcgcctgcgcaacctgggccgctata ccttcgcggacgtcgcgtcgtatcgcctgaagcagggcccgatccgcatcct gtcggcgtgcggctcgctggtcgtcgtcgcgctgtatctgatcgcgcagatg gtcggcgcgggcaagctgatcgagctgctgttcggcctgaactatcatatcg cggtcgtcctggtcggcgtcctgatgatgatgtatgtcctgttcggcggcatg ctggcgaccacctgggtccagatcatcaaggcggtcctgctgctgttcggcg cgtcgttcatggcgttcatggtcatgaagcatgtcggcttctcgttcaacaacct gttctcggaggcgatggcggtccatccgaagggcgtcgacatcatgaagcc gggcggcctggtcaaggacccgatctcggcgctgtcgctgggcctgggcct SEQ ID Gene Name Nucleotide Sequence

NO:#

gatgttcggcaccgcgggcctgccgcatatcctgatgcgcttcttcaccgtct cggacgcgcgcgaggcgcgcaagtcggtcttctatgcgaccggcttcatgg gctatttctatatcctgaccttcatcatcggcttcggcgcgatcatgctggtcgg cgcgaacccggagtataaggacgcggcgggccatctgatcggcggcaac aacatggcggcggtccatctggcgaacgcggtcggcggcaacctgttcctg ggcttcatctcggcggtcgcgttcgcgaccatcctggcggtcgtcgcgggcc tgaccctggcgggcgcgtcggcggtctcgcatgacctgtatgcgaacgtctt caagaagggcgcgaccgagcgcgaggagctgcgcgtctcgaagatcacc gtcctgatcctgggcgtcatcgcgatcatcctgggcgtcctgttcgagaacca gaacatcgcgttcatggtcggcctggcgttcgcgatcgcggcgtcgtgcaac ttcccgatcatcctgctgtcgatgtattggtcgaagctgaccacccgcggcgc gatgatgggcggctggctgggcctgatcaccgcggtcgtcctgatgatcctg ggcccgaccatctgggtccagatcctgggccatgagaaggcgatcttcccgt atgagtatccggcgctgttctcgatcaccgtcgcgttcctgggcatctggttctt ctcggcgaccgacaactcggcggagggcgcgcgcgagcgcgagctgttc cgcgcgcagttcatccgctcgcagaccggcttcggcgtcgagcagggccg cgcgcattga

The synthetic operon is then cloned and transformed as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on acetate as a sole carbon source. Recombinant M.

trichosporium transformed with a vector containing the synthetic operon encoding genes for acetate utilization are inoculated into lOOmL shake flasks containing 20- 50mL NMS media, 1% sodium acetate and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 30°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because acetate is the only carbon source provided to the cells, all cell mass produced must have been derived from acetate.

EXAMPLE 5 : RECOMBINANT METHYLOCOCCUS CAPSULATUS BATH ENGINEERED TO GROW ON ACETATE.

M. capsulatus cells are cultured and conjugated as described above.

Introduction of an Acetate Utilization Pathway. Nucleic acid sequences encoding AcsA (acetyl-CoA synthase) and ActP from E. coli were codon optimized for expression in M. trichosporium. The codon optimized nucleic acids encoding AcsA and ActP are synthesized as an operon (SEQ ID NO: l 19; see Table 18 for components of operon) under control of an mdh promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 18: Acetate Utilization Pathway Operon Codon Optimized for M. capsulatus

SEQ ID Gene Name Nucleotide Sequence

NO:#

cgtcgacgacgtcctgaacgtcagcggccaccgcctgggcaccgccga gatcgagagcgccctggtcgcccacccgaagatcgccgaggccgccgt cgtcggcatcccgcacaacatcaagggccaggccatctacgcctacgtc accctgaaccacggcgaggagccgagcccggagctgtacgccgaggt ccgcaactgggtccgcaaggagatcggcccgctggccaccccggacgt cctgcactggaccgacagcctgccgaagacccgcagcggcaagatcat gcgccgcatcctgcgcaagatcgccgccggcgacaccagcaacctggg cgacaccagcaccctggccgacccgggcgtcgtcgagaagc t gc t gg aggagaagcaggccatcgccatgccgagctga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

NO: 117 Region

SEQ ID ActP atgaagcgcgtcctgaccgccctggccgccaccctgccgttcgccgcca N0: 118 acgccgccgacgccatcagcggcgccgtcgagcgccagccgaccaac tggcaggccatcatcatgttcctgatcttcgtcgtcttcaccctgggcatcac ctactgggccagcaagcgcgtccgcagccgcagcgactactacaccgc cggcggcaacatcaccggcttccagaacggcctggccatcgccggcga ctacatgagcgccgccagcttcctgggcatcagcgccctggtcttcacca gcggctacgacggcctgatctacagcctgggcttcctggtcggctggcc gatcatcctgttcctgatcgccgagcgcctgcgcaacctgggccgctaca ccttcgccgacgtcgccagctaccgcctgaagcagggcccgatccgcat cctgagcgcctgcggcagcctggtcgtcgtcgccctgtacctgatcgccc agatggtcggcgccggcaagctgatcgagctgctgttcggcctgaactac cacatcgccgtcgtcctggtcggcgtcctgatgatgatgtacgtcctgttcg gcggcatgctggccaccacctgggtccagatcatcaaggccgtcctgct gctgttcggcgccagcttcatggccttcatggtcatgaagcacgtcggctt cagcttcaacaacctgttcagcgaggccatggccgtccacccgaagggc gtcgacatcatgaagccgggcggcctggtcaaggacccgatcagcgcc ctgagcctgggcctgggcctgatgttcggcaccgccggcctgccgcaca tcctgatgcgcttcttcaccgtcagcgacgcccgcgaggcccgcaagag cgtcttctacgccaccggcttcatgggctacttctacatcctgaccttcatca tcggcttcggcgccatcatgctggtcggcgccaacccggagtacaagga cgccgccggccacctgatcggcggcaacaacatggccgccgtccacct ggccaacgccgtcggcggcaacctgttcctgggcttcatcagcgccgtc gccttcgccaccatcctggccgtcgtcgccggcctgaccctggccggcg ccagcgccgtcagccacgacctgtacgccaacgtcttcaagaagggcgc caccgagcgcgaggagctgcgcgtcagcaagatcaccgtcctgatcctg ggcgtcatcgccatcatcctgggcgtcctgttcgagaaccagaacatcgc cttcatggtcggcctggccttcgccatcgccgccagctgcaacttcccgat catcctgctgagcatgtactggagcaagctgaccacccgcggcgccatg atgggcggctggctgggcctgatcaccgccgtcgtcctgatgatcctggg cccgaccatctgggtccagatcctgggccacgagaaggccatcttcccgt acgagtacccggccctgttcagcatcaccgtcgccttcctgggcatctggt tcttcagcgccaccgacaacagcgccgagggcgcccgcgagcgcgag ctgttccgcgcccagttcatccgcagccagaccggcttcggcgtcgag cagggccgcgcccactga

SEQ ID Acetate TTTGCCTCGATCGGCGGTCCTTGTGACAGGG SEQ ID Gene Name Nucleotide Sequence

NO:#

NO: 119 Utilization AGATATTCCCGACGGATCCGGGGCATTCGAG

Pathway CGGAACCGCCCGCCGTGGGAGTTTTTCCAGC

Operon GAGCATTCGAGAGTTTTTCAAGGCGGCTTCG

AGGGGTTATTCCGTAACGCCGCCGACATGAT

CTGTCCCAGAATCTCCGCCGCTGTTCGTAGA

GCGCCGATGCAGGGTCGGCATCAATCATTCT

TGGAGGAGACACatgagccagatccacaagcacaccatccc ggccaacatcgccgaccgctgcctgatcaacccgcagcagtacgaggc catgtaccagcagagcatcaacgtcccggacaccttctggggcgagcag ggcaagatcctggactggatcaagccgtaccagaaggtcaagaacacca gcttcgccccgggcaacgtcagcatcaagtggtacgaggacggcaccct gaacctggccgccaactgcctggaccgccacctgcaggagaacggcga ccgcaccgccatcatctgggagggcgacgacgccagccagagcaagc acatcagctacaaggagctgcaccgcgacgtctgccgcttcgccaacac cctgctggagctgggcatcaagaagggcgacgtcgtcgccatctacatg ccgatggtcccggaggccgccgtcgccatgctggcctgcgcccgcatc ggcgccgtccacagcgtcatcttcggcggcttcagcccggaggccgtcg ccggccgcatcatcgacagcaacagccgcctggtcatcaccagcgacg agggcgtccgcgccggccgcagcatcccgctgaagaagaacgtcgac gacgccctgaagaacccgaacgtcaccagcgtcgagcacgtcgtcgtcc tgaagcgcaccggcggcaagatcgactggcaggagggccgcgacctg tggtggcacgacctggtcgagcaggccagcgaccagcaccaggccga ggagatgaacgccgaggacccgctgttcatcctgtacaccagcggcagc accggcaagccgaagggcgtcctgcacaccaccggcggctacctggtc tacgccgccctgaccttcaagtacgtcttcgactaccacccgggcgacat ctactggtgcaccgccgacgtcggctgggtcaccggccacagctacctg ctgtacggcccgctggcctgcggcgccaccaccctgatgttcgagggcg tcccgaactggccgaccccggcccgcatggcccaggtcgtcgacaagc accaggtcaacatcctgtacaccgccccgaccgccatccgcgccctgat ggccgagggcgacaaggccatcgagggcaccgaccgcagcagcctg cgcatcctgggcagcgtcggcgagccgatcaacccggaggcctggga gtggtactggaagaagatcggcaacgagaagtgcccggtcgtcgacac ctggtggcagaccgagaccggcggcttcatgatcaccccgctgccggg cgccaccgagctgaaggccggcagcgccacccgcccgttcttcggcgt ccagccggccctggtcgacaacgagggcaacccgctggagggcgcca ccgagggcagcctggtcatcaccgacagctggccgggccaggcccgc accctgttcggcgaccacgagcgcttcgagcagacctacttcagcacctt caagaacatgtacttcagcggcgacggcgcccgccgcgacgaggacg gctactactggatcaccggccgcgtcgacgacgtcctgaacgtcagcgg ccaccgcctgggcaccgccgagatcgagagcgccctggtcgcccaccc gaagatcgccgaggccgccgtcgtcggcatcccgcacaacatcaaggg ccaggccatctacgcctacgtcaccctgaaccacggcgaggagccgag cccggagctgtacgccgaggtccgcaactgggtccgcaaggagatcgg cccgctggccaccccggacgtcctgcactggaccgacagcctgccgaa gacccgcagcggcaagatcatgcgccgcatcctgcgcaagatcgccgc SEQ ID Gene Name Nucleotide Sequence

NO:#

cggcgacaccagcaacctgggcgacaccagcaccctggccgacccgg gcgtcgtcgagaagctgctggaggagaagcaggccatcgccatgccga gctgaTCATTCTTGGAGGAGACACatgaagcgcgtcct gaccgccctggccgccaccctgccgttcgccgccaacgccgccgacgc catcagcggcgccgtcgagcgccagccgaccaactggcaggccatcat catgttcctgatcttcgtcgtcttcaccctgggcatcacctactgggccagc aagcgcgtccgcagccgcagcgactactacaccgccggcggcaacatc accggcttccagaacggcctggccatcgccggcgactacatgagcgcc gccagcttcctgggcatcagcgccctggtcttcaccagcggctacgacg gcctgatctacagcctgggcttcctggtcggctggccgatcatcctgttcct gatcgccgagcgcctgcgcaacctgggccgctacaccttcgccgacgtc gccagctaccgcctgaagcagggcccgatccgcatcctgagcgcctgc ggcagcctggtcgtcgtcgccctgtacctgatcgcccagatggtcggcgc cggcaagctgatcgagctgctgttcggcctgaactaccacatcgccgtcg tcctggtcggcgtcctgatgatgatgtacgtcctgttcggcggcatgctgg ccaccacctgggtccagatcatcaaggccgtcctgctgctgttcggcgcc agcttcatggccttcatggtcatgaagcacgtcggcttcagcttcaacaac ctgttcagcgaggccatggccgtccacccgaagggcgtcgacatcatga agccgggcggcctggtcaaggacccgatcagcgccctgagcctgggcc tgggcctgatgttcggcaccgccggcctgccgcacatcctgatgcgcttct tcaccgtcagcgacgcccgcgaggcccgcaagagcgtcttctacgccac cggcttcatgggctacttctacatcctgaccttcatcatcggcttcggcgcc atcatgctggtcggcgccaacccggagtacaaggacgccgccggccac ctgatcggcggcaacaacatggccgccgtccacctggccaacgccgtc ggcggcaacctgttcctgggcttcatcagcgccgtcgccttcgccaccat cctggccgtcgtcgccggcctgaccctggccggcgccagcgccgtcag ccacgacctgtacgccaacgtcttcaagaagggcgccaccgagcgcga ggagctgcgcgtcagcaagatcaccgtcctgatcctgggcgtcatcgcc atcatcctgggcgtcctgttcgagaaccagaacatcgccttcatggtcggc ctggccttcgccatcgccgccagctgcaacttcccgatcatcctgctgagc atgtactggagcaagctgaccacccgcggcgccatgatgggcggctgg ctgggcctgatcaccgccgtcgtcctgatgatcctgggcccgaccatctg ggtccagatcctgggccacgagaaggccatcttcccgtacgagtacccg gccctgttcagcatcaccgtcgccttcctgggcatctggttcttcagcgcca ccgacaacagcgccgagggcgcccgcgagcgcgagctgttccgcgcc cagttcatccgcagccagaccggct t cggcgt cgagcagggccgcg cccactga

The synthetic operon is then cloned and transformed as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on acetate as a sole carbon source. Recombinant M.

capsulatus transformed with a vector containing the synthetic operon encoding genes for acetate utilization are inoculated into lOOmL shake flasks containing 20-50mL NMS media, 1% sodium acetate and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 42°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because acetate is the only carbon source provided to the cells, all cell mass produced must have been derived from acetate.

EXAMPLE 6: RECOMBINANT METHYLOMONAS METHANICA ENGINEERED TO GROW ON ACETATE

Growth and Conjugations. The procedure for growth and conjugation of Methylomonas methanica was performed essentially identically to the procedures described for M. capsulatus (above).

Introduction of Acetate Utilization Pathway. Nucleic acid sequences encoding AcsA (acetyl-CoA synthase) and ActP from E. coli were codon optimized for expression in M. methanica. The codon optimized nucleic acids encoding AcsA and ActP are synthesized as an operon (SEQ ID NO: 124; see Table 19 for components of operon) under control of an hps promoter with appropriate intergenic regions

(CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 19: Acetate Utilization Pathway Operon Codon Optimized for M. methanica.

SEQ ID Gene Nucleotide Sequence

NO:# Name

atggcaccttgaacttggcggcgaactgcttggatcgccacttgcaagaaaacg gcgatcgcaccgcgatcatctgggaaggcgatgatgcgagccaaagcaaaca catcagctataaagaattgcaccgcgatgtctgccgcttcgcgaacaccttgttg gaattgggcatcaaaaaaggcgatgtcgtcgcgatctatatgccgatggtcccg gaagcggcggtcgcgatgttggcgtgcgcgcgcatcggcgcggtccacagc gtcatcttcggcggcttcagcccggaagcggtcgcgggccgcatcatcgatag caacagccgcttggtcatcaccagcgatgaaggcgtccgcgcgggccgcagc atcccgttgaaaaaaaacgtcgatgatgcgttgaaaaacccgaacgtcaccagc gtcgaacacgtcgtcgtcttgaaacgcaccggcggcaaaatcgattggcaaga aggccgcgatttgtggtggcacgatttggtcgaacaagcgagcgatcaacacc aagcggaagaaatgaacgcggaagatccgttgttcatcttgtataccagcggca gcaccggcaaaccgaaaggcgtcttgcacaccaccggcggctatttggtctat gcggcgttgaccttcaaatatgtcttcgattatcacccgggcgatatctattggtgc accgcggatgtcggctgggtcaccggccacagctatttgttgtatggcccgttg gcgtgcggcgcgaccaccttgatgttcgaaggcgtcccgaactggccgaccc cggcgcgcatggcgcaagtcgtcgataaacaccaagtcaacatcttgtataccg cgccgaccgcgatccgcgcgttgatggcggaaggcgataaagcgatcgaag gcaccgatcgcagcagcttgcgcatcttgggcagcgtcggcgaaccgatcaac ccggaagcgtgggaatggtattggaaaaaaatcggcaacgaaaaatgcccgg tcgtcgatacctggtggcaaaccgaaaccggcggcttcatgatcaccccgttgc cgggcgcgaccgaattgaaagcgggcagcgcgacccgcccgttcttcggcgt ccaaccggcgttggtcgataacgaaggcaacccgttggaaggcgcgaccgaa ggcagcttggtcatcaccgatagctggccgggccaagcgcgcaccttgttcgg cgatcacgaacgcttcgaacaaacctatttcagcaccttcaaaaacatgtatttca gcggcgatggcgcgcgccgcgatgaagatggctattattggatcaccggccg cgtcgatgatgtcttgaacgtcagcggccaccgcttgggcaccgcggaaatcg aaagcgcgttggtcgcgcacccgaaaatcgcggaagcggcggtcgtcggcat cccgcacaacatcaaaggccaagcgatctatgcgtatgtcaccttgaaccacgg cgaagaaccgagcccggaattgtatgcggaagtccgcaactgggtccgcaaa gaaatcggcccgttggcgaccccggatgtcttgcactggaccgatagcttgccg aaaacccgcagcggcaaaatcatgcgccgcatcttgcgcaaaatcgcggcgg gcgataccagcaacttgggcgataccagcaccttggcggatccgggcgtcgtc gaaaaattgttggaagaaaaacaagcgatcgcgatgccgagctaa

SEQ ID Intergenic TAATAATTAAAAACTTTCGGAGCACATCAC NO: 122 Region

SEQ ID ActP atgaaacgcgtcttgaccgcgttggcggcgaccttgccgttcgcggcgaacgc NO: 123 ggcggatgcgatcagcggcgcggtcgaacgccaaccgaccaactggcaagc gatcatcatgttcttgatcttcgtcgtcttcaccttgggcatcacctattgggcgag caaacgcgtccgcagccgcagcgattattataccgcgggcggcaacatcacc ggcttccaaaacggcttggcgatcgcgggcgattatatgagcgcggcgagctt cttgggcatcagcgcgttggtcttcaccagcggctatgatggcttgatctatagct tgggcttcttggtcggctggccgatcatcttgttcttgatcgcggaacgcttgcgc aacttgggccgctataccttcgcggatgtcgcgagctatcgcttgaaacaaggc ccgatccgcatcttgagcgcgtgcggcagcttggtcgtcgtcgcgttgtatttgat cgcgcaaatggtcggcgcgggcaaattgatcgaattgttgttcggcttgaactat SEQ ID Gene Nucleotide Sequence

NO:# Name

cacatcgcggtcgtcttggtcggcgtcttgatgatgatgtatgtcttgttcggcgg catgttggcgaccacctgggtccaaatcatcaaagcggtcttgttgttgttcggcg cgagcttcatggcgttcatggtcatgaaacacgtcggcttcagcttcaacaacttg ttcagcgaagcgatggcggtccacccgaaaggcgtcgatatcatgaaaccgg gcggcttggtcaaagatccgatcagcgcgttgagcttgggcttgggcttgatgtt cggcaccgcgggcttgccgcacatcttgatgcgcttcttcaccgtcagcgatgc gcgcgaagcgcgcaaaagcgtcttctatgcgaccggcttcatgggctatttctat atcttgaccttcatcatcggcttcggcgcgatcatgttggtcggcgcgaacccgg aatataaagatgcggcgggccacttgatcggcggcaacaacatggcggcggt ccacttggcgaacgcggtcggcggcaacttgttcttgggcttcatcagcgcggt cgcgttcgcgaccatcttggcggtcgtcgcgggcttgaccttggcgggcgcga gcgcggtcagccacgatttgtatgcgaacgtcttcaaaaaaggcgcgaccgaa cgcgaagaattgcgcgtcagcaaaatcaccgtcttgatcttgggcgtcatcgcg atcatcttgggcgtcttgttcgaaaaccaaaacatcgcgttcatggtcggcttggc gttcgcgatcgcggcgagctgcaacttcccgatcatcttgttgagcatgtattgga gcaaattgaccacccgcggcgcgatgatgggcggctggttgggcttgatcacc gcggtcgtcttgatgatcttgggcccgaccatctgggtccaaatcttgggccacg aaaaagcgatcttcccgtatgaatatccggcgttgttcagcatcaccgtcgcgttc ttgggcatctggttcttcagcgcgaccgataacagcgcggaaggcgcgcgcga acgcgaattgttccgcgcgcaattcatccgcagccaaaccggcttcggcgtcga acaaggccgcgcgcactaa

SEQ ID Acetate TTCGGAATCCCTGACGGGAATTGGCCCGAAGAA NO: 124 Utilization GGCAGATGCCATCGTTCAGTATCGAAAGGAACA

Pathway TGGGGATTTTCAGTCATTGAAGGATCTGGAGAA Operon TGTCAGCGGCATTGGCGAGAAAACCCTTCAGGC

CAATGAAAAAGACATTCGCTTCACGGATGATTT

GAGCGATAAGTCATCCGCGGAAAAAGGTGCGGT

AGCTGTGGATAAAAAAGGCGCCAGATAGTAAGC

GCTAAGGATTGGGGTGCGTCGCCGGTCGCGGCG

GCGCTCCTCGACGGCAGAGTTGGTGCCAGGTTG

GCGGATGATTGATGCCGAATATTACGCGACCAA

TTCTCGAGGCAAATGAACTGTGAGCTACTGAGT

TGCAGGCATTGACAGCCATCCCATTTCTATCATA

CAGTTACGGACGCATCACGAGTAGGTGATAAGC

CTAGCAGATTGCGGCAGTTGGCAAAATCAGCTA

TTACTAATAATTAAAAACTTTCGGAGCACATCAC

atgagccaaatccacaaacacaccatcccggcgaacatcgcggatcgctgctt gatcaacccgcaacaatatgaagcgatgtatcaacaaagcatcaacgtcccgg ataccttctggggcgaacaaggcaaaatcttggattggatcaaaccgtatcaaaa agtcaaaaacaccagcttcgcgccgggcaacgtcagcatcaaatggtatgaag atggcaccttgaacttggcggcgaactgcttggatcgccacttgcaagaaaacg gcgatcgcaccgcgatcatctgggaaggcgatgatgcgagccaaagcaaaca catcagctataaagaattgcaccgcgatgtctgccgcttcgcgaacaccttgttg gaattgggcatcaaaaaaggcgatgtcgtcgcgatctatatgccgatggtcccg gaagcggcggtcgcgatgttggcgtgcgcgcgcatcggcgcggtccacagc SEQ ID Gene Nucleotide Sequence

NO:# Name

gtcatcttcggcggcttcagcccggaagcggtcgcgggccgcatcatcgatag caacagccgcttggtcatcaccagcgatgaaggcgtccgcgcgggccgcagc atcccgttgaaaaaaaacgtcgatgatgcgttgaaaaacccgaacgtcaccagc gtcgaacacgtcgtcgtcttgaaacgcaccggcggcaaaatcgattggcaaga aggccgcgatttgtggtggcacgatttggtcgaacaagcgagcgatcaacacc aagcggaagaaatgaacgcggaagatccgttgttcatcttgtataccagcggca gcaccggcaaaccgaaaggcgtcttgcacaccaccggcggctatttggtctat gcggcgttgaccttcaaatatgtcttcgattatcacccgggcgatatctattggtgc accgcggatgtcggctgggtcaccggccacagctatttgttgtatggcccgttg gcgtgcggcgcgaccaccttgatgttcgaaggcgtcccgaactggccgaccc cggcgcgcatggcgcaagtcgtcgataaacaccaagtcaacatcttgtataccg cgccgaccgcgatccgcgcgttgatggcggaaggcgataaagcgatcgaag gcaccgatcgcagcagcttgcgcatcttgggcagcgtcggcgaaccgatcaac ccggaagcgtgggaatggtattggaaaaaaatcggcaacgaaaaatgcccgg tcgtcgatacctggtggcaaaccgaaaccggcggcttcatgatcaccccgttgc cgggcgcgaccgaattgaaagcgggcagcgcgacccgcccgttcttcggcgt ccaaccggcgttggtcgataacgaaggcaacccgttggaaggcgcgaccgaa ggcagcttggtcatcaccgatagctggccgggccaagcgcgcaccttgttcgg cgatcacgaacgcttcgaacaaacctatttcagcaccttcaaaaacatgtatttca gcggcgatggcgcgcgccgcgatgaagatggctattattggatcaccggccg cgtcgatgatgtcttgaacgtcagcggccaccgcttgggcaccgcggaaatcg aaagcgcgttggtcgcgcacccgaaaatcgcggaagcggcggtcgtcggcat cccgcacaacatcaaaggccaagcgatctatgcgtatgtcaccttgaaccacgg cgaagaaccgagcccggaattgtatgcggaagtccgcaactgggtccgcaaa gaaatcggcccgttggcgaccccggatgtcttgcactggaccgatagcttgccg aaaacccgcagcggcaaaatcatgcgccgcatcttgcgcaaaatcgcggcgg gcgataccagcaacttgggcgataccagcaccttggcggatccgggcgtcgtc gaaaaattgttggaagaaaaacaagcgatcgcgatgccgagctaaTAATA

ATTAAAAACTTTCGGAGCACATCACatgaaacgcgtctt gaccgcgttggcggcgaccttgccgttcgcggcgaacgcggcggatgcgatc agcggcgcggtcgaacgccaaccgaccaactggcaagcgatcatcatgttctt gatcttcgtcgtcttcaccttgggcatcacctattgggcgagcaaacgcgtccgc agccgcagcgattattataccgcgggcggcaacatcaccggcttccaaaacgg cttggcgatcgcgggcgattatatgagcgcggcgagcttcttgggcatcagcgc gttggtcttcaccagcggctatgatggcttgatctatagcttgggcttcttggtcgg ctggccgatcatcttgttcttgatcgcggaacgcttgcgcaacttgggccgctata ccttcgcggatgtcgcgagctatcgcttgaaacaaggcccgatccgcatcttga gcgcgtgcggcagcttggtcgtcgtcgcgttgtatttgatcgcgcaaatggtcg gcgcgggcaaattgatcgaattgttgttcggcttgaactatcacatcgcggtcgtc ttggtcggcgtcttgatgatgatgtatgtcttgttcggcggcatgttggcgaccac ctgggtccaaatcatcaaagcggtcttgttgttgttcggcgcgagcttcatggcgt tcatggtcatgaaacacgtcggcttcagcttcaacaacttgttcagcgaagcgat ggcggtccacccgaaaggcgtcgatatcatgaaaccgggcggcttggtcaaa gatccgatcagcgcgttgagcttgggcttgggcttgatgttcggcaccgcgggc ttgccgcacatcttgatgcgcttcttcaccgtcagcgatgcgcgcgaagcgcgc SEQ ID Gene Nucleotide Sequence

NO:# Name

aaaagcgtcttctatgcgaccggcttcatgggctatttctatatcttgaccttcatca tcggcttcggcgcgatcatgttggtcggcgcgaacccggaatataaagatgcg gcgggccacttgatcggcggcaacaacatggcggcggtccacttggcgaacg cggtcggcggcaacttgttcttgggcttcatcagcgcggtcgcgttcgcgaccat cttggcggtcgtcgcgggcttgaccttggcgggcgcgagcgcggtcagccac gatttgtatgcgaacgtcttcaaaaaaggcgcgaccgaacgcgaagaattgcg cgtcagcaaaatcaccgtcttgatcttgggcgtcatcgcgatcatcttgggcgtct tgttcgaaaaccaaaacatcgcgttcatggtcggcttggcgttcgcgatcgcgg cgagctgcaacttcccgatcatcttgttgagcatgtattggagcaaattgaccacc cgcggcgcgatgatgggcggctggttgggcttgatcaccgcggtcgtcttgat gatcttgggcccgaccatctgggtccaaatcttgggccacgaaaaagcgatctt cccgtatgaatatccggcgttgttcagcatcaccgtcgcgttcttgggcatctggt tcttcagcgcgaccgataacagcgcggaaggcgcgcgcgaacgcgaattgtt ccgcgcgcaattcatccgcagccaaaccggcttcggcgtcgaacaaggccgc gcgcactaa

The synthetic operon is then cloned and transformed into M. methanica as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on acetate as a sole carbon source. Recombinant M.

methanica transformed with a vector containing the synthetic operon encoding genes for acetate utilization are inoculated into lOOmL shake flasks containing 20-50mL NMS media, 1% sodium acetate and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 30°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because acetate is the only carbon source provided to the cells, all cell mass produced must have been derived from acetate.

EXAMPLE 7 : RECOMBINANT METHYLOSINUS TRICHOSPORIUM ENGINEERED TO GROW ON LACTATE

M. trichosporium cells are cultured and conjugated as described above.

Introduction of Lactate Utilization Pathway. Nucleic acid sequences encoding lactate dehydrogenase D (LdhD) and a lactate permease (LctP) from E.coli were codon optimized for expression in M. trichosporium. The codon optimized nucleic acids encoding LdhD and LctP are synthesized as an operon (SEQ ID NO: 129; see Table 20 for components of operon) under control of an mdh promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences. Table 20: Lactate Utilization Pathway Operon Codon Optimized for trichosporium

SEQ ID Gene Name Nucleotide Sequence

NO:#

cccggagctgccggacatcatctcgtcgctggtctcgctgctg tgcctgaccctgttcctgaagcgctggcagccggtccgcgtcttccgcttcgg cgacctgggcgcgtcgcaggtcgacatgaccctggcgcataccggctatac cgcgggccaggtcctgcgcgcgtggaccccgttcctgttcctgaccgcgacc gtcaccctgtggtcgatcccgccgttcaaggcgctgttcgcgtcgggcggcg cgctgtatgagtgggtcatcaacatcccggtcccgtatctggacaagctggtc gcgcgcatgccgccggtcgtctcggaggcgaccgcgtatgcggcggtcttc aagttcgactggttctcggcgaccggcaccgcgatcctgttcgcggcgctgct gtcgatcgtctggctgaagatgaagccgtcggacgcgatctcgaccttcggct cgaccctgaaggagctggcgctgccgatctattcgatcggcatggtcctggc gttcgcgttcatctcgaactattcgggcctgtcgtcgaccctggcgctggcgct ggcgcataccggccatgcgttcaccttcttctcgccgttcctgggctggctggg cgtcttcctgaccggctcggacacctcgtcgaacgcgctgttcgcggcgctgc aggcgaccgcggcgcagcagatcggcgtctcggacctgctgctggtcgcg gcgaacaccaccggcggcgtcaccggcaagatgatctcgccgcagtcgatc gcgatcgcgtgcgcggcggtcggcctggtcggcaaggagtcggacctgttc cgcttcaccgtcaagcattcgctgatcttcacctgcatcgtcggcgtcatcacca

CCctgcaggcgtatgtcctgacctggatgatcccgtga

SEQ ID Lactate TTTGCCTCGATCGGCGGTCCTTGTGACAGGGAG NO: 129 Utilization ATATTCCCGACGGATCCGGGGCATTCGAGCGG

Pathway AACCGCCCGCCGTGGGAGTTTTTCCAGCGAGCA Operon TTCGAGAGTTTTTCAAGGCGGCTTCGAGGGGTT

ATTCCGTAACGCCGCCGACATGATCTGTCCCAG

AATCTCCGCCGCTGTTCGTAGAGCGCCGATGCA

GGGTCGGCATCAATCATTCTTGGAGGAGACACa tgaagctggcggtctattcgaccaagcagtatgacaagaagtatctgcagcag gtcaacgagtcgttcggcttcgagctggagttcttcgacttcctgctgaccgag aagaccgcgaagaccgcgaacggctgcgaggcggtctgcatcttcgtcaac gacgacggctcgcgcccggtcctggaggagctgaagaagcatggcgtcaa gtatatcgcgctgcgctgcgcgggcttcaacaacgtcgacctggacgcggcg aaggagctgggcctgaaggtcgtccgcgtcccggcgtatgacccggaggc ggtcgcggagcatgcgatcggcatgatgatgaccctgaaccgccgcatccat cgcgcgtatcagcgcacccgcgacgcgaacttctcgctggagggcctgacc ggcttcaccatgtatggcaagaccgcgggcgtcatcggcaccggcaagatc ggcgtcgcgatgctgcgcatcctgaagggcttcggcatgcgcctgctggcgt tcgacccgtatccgtcggcggcggcgctggagctgggcgtcgagtatgtcga cctgccgaccctgttctcggagtcggacgtcatctcgctgcattgcccgctgac cccggagaactatcatctgctgaacgaggcggcgttcgagcagatgaagaa cggcgtcatgatcgtcaacacctcgcgcggcgcgctgatcgactcgcaggc ggcgatcgaggcgctgaagaaccagaagatcggctcgctgggcatggacgt ctatgagaacgagcgcgacctgttcttcgaggacaagtcgaacgacgtcatcc aggacgacgtcttccgccgcctgtcggcgtgccataacgtcctgttcaccggc catcaggcgttcctgaccgcggaggcgctgacctcgatctcgcagaccaccc tgcagaacctgtcgaacctggagaagggcgagacctgcccgaacgagctg gtctgaTCATTCTTGGAGGAGACACatgaacctgtggcagc SEQ ID Gene Name Nucleotide Sequence

NO:#

agaactatgacccggcgggcaacatctggctgtcgtcgctgatcgcgtcgct gccgatcctgttcttcttcttcgcgctgatcaagctgaagctgaagggctatgtc gcggcgtcgtggaccgtcgcgatcgcgctggcggtcgcgctgctgttctata agatgccggtcgcgaacgcgctggcgtcggtcgtctatggcttcttctatggc ctgtggccgatcgcgtggatcatcatcgcggcggtcttcgtctataagatctcg gtcaagaccggccagttcgacatcatccgctcgtcgatcctgtcgatcacccc ggaccagcgcctgcagatgctgatcgtcggcttctgcttcggcgcgttcctgg agggcgcggcgggcttcggcgcgccggtcgcgatcaccgcggcgctgctg gtcggcctgggcttcaagccgctgtatgcggcgggcctgtgcctgatcgtcaa caccgcgccggtcgcgttcggcgcgatgggcatcccgatcctggtcgcggg ccaggtcaccggcatcgactcgttcgagatcggccagatggtcggccgcca gctgccgttcatgaccatcatcgtcctgttctggatcatggcgatcatggacgg ctggcgcggcatcaaggagacctggccggcggtcgtcgtcgcgggcggct cgttcgcgatcgcgcagtatctgtcgtcgaacttcatcggcccggagctgccg gacatcatctcgtcgctggtctcgctgctgtgcctgaccctgttcctgaagcgct ggcagccggtccgcgtcttccgcttcggcgacctgggcgcgtcgcaggtcg acatgaccctggcgcataccggctataccgcgggccaggtcctgcgcgcgt ggaccccgttcctgttcctgaccgcgaccgtcaccctgtggtcgatcccgccg ttcaaggcgctgttcgcgtcgggcggcgcgctgtatgagtgggtcatcaacat cccggtcccgtatctggacaagctggtcgcgcgcatgccgccggtcgtctcg gaggcgaccgcgtatgcggcggtcttcaagttcgactggttctcggcgaccg gcaccgcgatcctgttcgcggcgctgctgtcgatcgtctggctgaagatgaag ccgtcggacgcgatctcgaccttcggctcgaccctgaaggagctggcgctgc cgatctattcgatcggcatggtcctggcgttcgcgttcatctcgaactattcggg cctgtcgtcgaccctggcgctggcgctggcgcataccggccatgcgttcacct tcttctcgccgttcctgggctggctgggcgtcttcctgaccggctcggacacct cgtcgaacgcgctgttcgcggcgctgcaggcgaccgcggcgcagcagatc ggcgtctcggacctgctgctggtcgcggcgaacaccaccggcggcgtcacc ggcaagatgatctcgccgcagtcgatcgcgatcgcgtgcgcggcggtcggc ctggtcggcaaggagtcggacctgttccgcttcaccgtcaagcattcgctgat cttcacctgcatcgtcggcgtcatcaccaccc t gc aggcgt a t gt c c t ga cctggatgatcccgtga

The synthetic operon is then cloned and transformed as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on lactate as a sole carbon source. Recombinant M.

trichosporium transformed with a vector containing the synthetic operon encoding genes for lactate utilization are inoculated into lOOmL shake flasks containing 20-5 OmL NMS media, 1% sodium lactate and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 30°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because lactate is the only carbon source provided to the cells, all cell mass produced must have been derived from lactate.

EXAMPLE 8 : RECOMBINANT METHYLOCOCCUS CAPSULATUS BATH ENGINEERED TO GROW ON LACTATE.

M. capsulatus cells are cultured and conjugated as described above.

Introduction of Lactate Utilization Pathway. Nucleic acid sequences encoding lactate dehydrogenase D (LdhD) and a lactate permease (LctP) from E. coli were codon optimized for expression in M. trichosporium. The codon optimized nucleic acids encoding LdhD and LctP are synthesized as an operon (SEQ ID NO: 134; see Table 21 for components of operon) under control of an mdh promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 21 : Lactate Utilization Pathway Operon Codon Optimized for capsulatus

SEQ ID Gene Nucleotide Sequence

NO:# Name

agctggtctga

SEQ ID Intergenic TCATTCTTGGAGGAGACAC

NO: 132 Region

SEQ ID LctP atgaacctgtggcagcagaactacgacccggccggcaacatctggctgagc NO: 133 agcctgatcgccagcctgccgatcctgttcttcttcttcgccctgatcaagctga agctgaagggctacgtcgccgccagctggaccgtcgccatcgccctggccgt cgccctgctgttctacaagatgccggtcgccaacgccctggccagcgtcgtct acggcttcttctacggcctgtggccgatcgcctggatcatcatcgccgccgtctt cgtctacaagatcagcgtcaagaccggccagttcgacatcatccgcagcagc atcctgagcatcaccccggaccagcgcctgcagatgctgatcgtcggcttctg cttcggcgccttcctggagggcgccgccggcttcggcgccccggtcgccatc accgccgccctgctggtcggcctgggcttcaagccgctgtacgccgccggcc tgtgcctgatcgtcaacaccgccccggtcgccttcggcgccatgggcatcccg atcctggtcgccggccaggtcaccggcatcgacagcttcgagatcggccaga tggtcggccgccagctgccgttcatgaccatcatcgtcctgttctggatcatgg ccatcatggacggctggcgcggcatcaaggagacctggccggccgtcgtcg tcgccggcggcagcttcgccatcgcccagtacctgagcagcaacttcatcgg cccggagctgccggacatcatcagcagcctggtcagcctgctgtgcctgacc ctgttcctgaagcgctggcagccggtccgcgtcttccgcttcggcgacctggg cgccagccaggtcgacatgaccctggcccacaccggctacaccgccggcca ggtcctgcgcgcctggaccccgttcctgttcctgaccgccaccgtcaccctgt ggagcatcccgccgttcaaggccctgttcgccagcggcggcgccctgtacga gtgggtcatcaacatcccggtcccgtacctggacaagctggtcgcccgcatgc cgccggtcgtcagcgaggccaccgcctacgccgccgtcttcaagttcgactg gttcagcgccaccggcaccgccatcctgttcgccgccctgctgagcatcgtct ggctgaagatgaagccgagcgacgccatcagcaccttcggcagcaccctga aggagctggccctgccgatctacagcatcggcatggtcctggccttcgccttc atcagcaactacagcggcctgagcagcaccctggccctggccctggcccac accggccacgccttcaccttcttcagcccgttcctgggctggctgggcgtcttc ctgaccggcagcgacaccagcagcaacgccctgttcgccgccctgcaggcc accgccgcccagcagatcggcgtcagcgacctgctgctggtcgccgccaac accaccggcggcgtcaccggcaagatgatcagcccgcagagcatcgccatc gcctgcgccgccgtcggcctggtcggcaaggagagcgacctgttccgcttca ccgtcaagcacagcctgatcttcacctgcatcgtcggcgtcatcaccaccctgc aggcctacgtcctgacctggatgatcccgtga

SEQ ID Lactate TTTGCCTCGATCGGCGGTCCTTGTGACAGGGAG NO: 134 Utilization ATATTCCCGACGGATCCGGGGCATTCGAGCGGA

Pathway ACCGCCCGCCGTGGGAGTTTTTCCAGCGAGCAT Operon TCGAGAGTTTTTCAAGGCGGCTTCGAGGGGTTA

TTCCGTAACGCCGCCGACATGATCTGTCCCAGA

ATCTCCGCCGCTGTTCGTAGAGCGCCGATGCAG

GGTCGGCATCAATCATTCTTGGAGGAGACACatg aagctggccgtctacagcaccaagcagtacgacaagaagtacctgcagcag gtcaacgagagcttcggcttcgagctggagttcttcgacttcctgctgaccgag aagaccgccaagaccgccaacggctgcgaggccgtctgcatcttcgtcaacg SEQ ID Gene Nucleotide Sequence

NO:# Name

acgacggcagccgcccggtcctggaggagctgaagaagcacggcgtcaag tacatcgccctgcgctgcgccggcttcaacaacgtcgacctggacgccgcca aggagctgggcctgaaggtcgtccgcgtcccggcctacgacccggaggcc gtcgccgagcacgccatcggcatgatgatgaccctgaaccgccgcatccacc gcgcctaccagcgcacccgcgacgccaacttcagcctggagggcctgaccg gcttcaccatgtacggcaagaccgccggcgtcatcggcaccggcaagatcg gcgtcgccatgctgcgcatcctgaagggcttcggcatgcgcctgctggccttc gacccgtacccgagcgccgccgccctggagctgggcgtcgagtacgtcgac ctgccgaccctgttcagcgagagcgacgtcatcagcctgcactgcccgctga ccccggagaactaccacctgctgaacgaggccgccttcgagcagatgaaga acggcgtcatgatcgtcaacaccagccgcggcgccctgatcgacagccagg ccgccatcgaggccctgaagaaccagaagatcggcagcctgggcatggac gtctacgagaacgagcgcgacctgttcttcgaggacaagagcaacgacgtca tccaggacgacgtcttccgccgcctgagcgcctgccacaacgtcctgttcacc ggccaccaggccttcctgaccgccgaggccctgaccagcatcagccagacc accctgcagaacctgagcaacctggagaagggcgagacctgcccgaacga gctggtctgaTCATTCTTGGAGGAGACACatgaacctgtggc agcagaactacgacccggccggcaacatctggctgagcagcctgatcgcca gcctgccgatcctgttcttcttcttcgccctgatcaagctgaagctgaagggcta cgtcgccgccagctggaccgtcgccatcgccctggccgtcgccctgctgttct acaagatgccggtcgccaacgccctggccagcgtcgtctacggcttcttctac ggcctgtggccgatcgcctggatcatcatcgccgccgtcttcgtctacaagatc agcgtcaagaccggccagttcgacatcatccgcagcagcatcctgagcatca ccccggaccagcgcctgcagatgctgatcgtcggcttctgcttcggcgccttc ctggagggcgccgccggcttcggcgccccggtcgccatcaccgccgccctg ctggtcggcctgggcttcaagccgctgtacgccgccggcctgtgcctgatcgt caacaccgccccggtcgccttcggcgccatgggcatcccgatcctggtcgcc ggccaggtcaccggcatcgacagcttcgagatcggccagatggtcggccgc cagctgccgttcatgaccatcatcgtcctgttctggatcatggccatcatggacg gctggcgcggcatcaaggagacctggccggccgtcgtcgtcgccggcggc agcttcgccatcgcccagtacctgagcagcaacttcatcggcccggagctgc cggacatcatcagcagcctggtcagcctgctgtgcctgaccctgttcctgaag cgctggcagccggtccgcgtcttccgcttcggcgacctgggcgccagccag gtcgacatgaccctggcccacaccggctacaccgccggccaggtcctgcgc gcctggaccccgttcctgttcctgaccgccaccgtcaccctgtggagcatccc gccgttcaaggccctgttcgccagcggcggcgccctgtacgagtgggtcatc aacatcccggtcccgtacctggacaagctggtcgcccgcatgccgccggtcg tcagcgaggccaccgcctacgccgccgtcttcaagttcgactggttcagcgcc accggcaccgccatcctgttcgccgccctgctgagcatcgtctggctgaagat gaagccgagcgacgccatcagcaccttcggcagcaccctgaaggagctggc cctgccgatctacagcatcggcatggtcctggccttcgccttcatcagcaacta cagcggcctgagcagcaccctggccctggccctggcccacaccggccacg ccttcaccttcttcagcccgttcctgggctggctgggcgtcttcctgaccggca gcgacaccagcagcaacgccctgttcgccgccctgcaggccaccgccgccc agcagatcggcgtcagcgacctgctgctggtcgccgccaacaccaccggcg SEQ ID Gene Nucleotide Sequence

NO:# Name

gcgtcaccggcaagatgatcagcccgcagagcatcgccatcgcctgcgccg ccgtcggcctggtcggcaaggagagcgacctgttccgcttcaccgtcaagca cagcctgatcttcacctgcatcgtcggcgtcatcaccaccctgcaggcctac gtcctgacctggatgatcccgtga

The synthetic operon is then cloned and transformed as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on lactate as a sole carbon source. Recombinant M.

capsulatus transformed with a vector containing the synthetic operon encoding genes for lactate utilization are inoculated into lOOmL shake flasks containing 20-50mL NMS media, 1% sodium lactate and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 42°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because lactate is the only carbon source provided to the cells, all cell mass produced must have been derived from lactate.

EXAMPLE 9: METHYLOMONAS METHANICA ENGINEERED TO GROW ON LACTATE

Growth and Conjugations. The procedure for growth and conjugation of Methylomonas methanica was performed essentially identically to the procedures described above for M capsulatus.

Introduction of Lactate Utilization Pathway. Nucleic acid sequences encoding lactate dehydrogenase D (LdhD) and a lactate permease (LctP) from E. coli were codon optimized for expression in M. methanica. The codon optimized nucleic acids encoding LdhD and LctP are synthesized as an operon (SEQ ID NO: 139; see Table 22 for components of operon) under control of an hps promoter with appropriate intergenic regions (CAPITALIZED sequence) incorporating ribosome binding sequences.

Table 22: Lactate Utilization Pathway Operon Codon Optimized for methanica

SEQ ID Gene Name Nucleotide Sequence

NO:#

GATGATTTGAGCGATAAGTCATCCGCGGAAA

AAGGTGCGGTAGCTGTGGATAAAAAAGGCGC

CAGATAGTAAGCGCTAAGGATTGGGGTGCGT

CGCCGGTCGCGGCGGCGCTCCTCGACGGCAG

AGTTGGTGCCAGGTTGGCGGATGATTGATGCC

GAATATTACGCGACCAATTCTCGAGGCAAATG

AACTGTGAGCTACTGAGTTGCAGGCATTGACA

GCCATCCCATTTCTATCATACAGTTACGGACG

CATC AC G AGT AGGTG AT AAGC CT AGC AG ATT

GCGGCAGTTGGCAAAATCAGCTATTACTAATA

ATTAAAAACTTTCGGAGCACATCAC

SEQ ID LdhD atgaaattggcggtctatagcaccaaacaatatgataaaaaatatttgcaaca NO: 136 agtcaacgaaagcttcggcttcgaattggaattcttcgatttcttgttgaccga aaaaaccgcgaaaaccgcgaacggctgcgaagcggtctgcatcttcgtca acgatgatggcagccgcccggtcttggaagaattgaaaaaacacggcgtc aaatatatcgcgttgcgctgcgcgggcttcaacaacgtcgatttggatgcgg cgaaagaattgggcttgaaagtcgtccgcgtcccggcgtatgatccggaag cggtcgcggaacacgcgatcggcatgatgatgaccttgaaccgccgcatc caccgcgcgtatcaacgcacccgcgatgcgaacttcagcttggaaggcttg accggcttcaccatgtatggcaaaaccgcgggcgtcatcggcaccggcaa aatcggcgtcgcgatgttgcgcatcttgaaaggcttcggcatgcgcttgttg gcgttcgatccgtatccgagcgcggcggcgttggaattgggcgtcgaatat gtcgatttgccgaccttgttcagcgaaagcgatgtcatcagcttgcactgccc gttgaccccggaaaactatcacttgttgaacgaagcggcgttcgaacaaat gaaaaacggcgtcatgatcgtcaacaccagccgcggcgcgttgatcgata gccaagcggcgatcgaagcgttgaaaaaccaaaaaatcggcagcttggg catggatgtctatgaaaacgaacgcgatttgttcttcgaagataaaagcaac gatgtcatccaagatgatgtcttccgccgcttgagcgcgtgccacaacgtctt gttcaccggccaccaagcgttcttgaccgcggaagcgttgaccagcatcag ccaaaccaccttgcaaaacttgagcaacttggaaaaaggcgaaacctgcc cgaacgaattggtctaa

SEQ ID Intergenic TAATAATTAAAAACTTTCGGAGCACATCAC NO: 137 Region

SEQ ID LctP atgaacttgtggcaacaaaactatgatccggcgggcaacatctggttgagc NO: 138 agcttgatcgcgagcttgccgatcttgttcttcttcttcgcgttgatcaaattga aattgaaaggctatgtcgcggcgagctggaccgtcgcgatcgcgttggcg gtcgcgttgttgttctataaaatgccggtcgcgaacgcgttggcgagcgtcg tctatggcttcttctatggcttgtggccgatcgcgtggatcatcatcgcggcg gtcttcgtctataaaatcagcgtcaaaaccggccaattcgatatcatccgcag cagcatcttgagcatcaccccggatcaacgcttgcaaatgttgatcgtcggc ttctgcttcggcgcgttcttggaaggcgcggcgggcttcggcgcgccggtc gcgatcaccgcggcgttgttggtcggcttgggcttcaaaccgttgtatgcgg cgggcttgtgcttgatcgtcaacaccgcgccggtcgcgttcggcgcgatgg gcatcccgatcttggtcgcgggccaagtcaccggcatcgatagcttcgaaa tcggccaaatggtcggccgccaattgccgttcatgaccatcatcgtcttgttc SEQ ID Gene Name Nucleotide Sequence

NO:#

tggatcatggcgatcatggatggctggcgcggcatcaaagaaacctggcc ggcggtcgtcgtcgcgggcggcagcttcgcgatcgcgcaatatttgagca gcaacttcatcggcccggaattgccggatatcatcagcagcttggtcagctt gttgtgcttgaccttgttcttgaaacgctggcaaccggtccgcgtcttccgctt cggcgatttgggcgcgagccaagtcgatatgaccttggcgcacaccggct ataccgcgggccaagtcttgcgcgcgtggaccccgttcttgttcttgaccgc gaccgtcaccttgtggagcatcccgccgttcaaagcgttgttcgcgagcgg cggcgcgttgtatgaatgggtcatcaacatcccggtcccgtatttggataaat tggtcgcgcgcatgccgccggtcgtcagcgaagcgaccgcgtatgcggc ggtcttcaaattcgattggttcagcgcgaccggcaccgcgatcttgttcgcg gcgttgttgagcatcgtctggttgaaaatgaaaccgagcgatgcgatcagc accttcggcagcaccttgaaagaattggcgttgccgatctatagcatcggca tggtcttggcgttcgcgttcatcagcaactatagcggcttgagcagcaccttg gcgttggcgttggcgcacaccggccacgcgttcaccttcttcagcccgttct tgggctggttgggcgtcttcttgaccggcagcgataccagcagcaacgcgt tgttcgcggcgttgcaagcgaccgcggcgcaacaaatcggcgtcagcgat ttgttgttggtcgcggcgaacaccaccggcggcgtcaccggcaaaatgatc agcccgcaaagcatcgcgatcgcgtgcgcggcggtcggcttggtcggca aagaaagcgatttgttccgcttcaccgtcaaacacagcttgatcttcacctgc atcgtcggcgtcatcaccaccttgcaagcgtatgtcttgacctggat gatcccgtaa

SEQ ID Lactate TTCGGAATCCCTGACGGGAATTGGCCCGAAG NO: 139 Utilization AAGGCAGATGCCATCGTTCAGTATCGAAAGG

Pathway AACATGGGGATTTTCAGTCATTGAAGGATCTG Operon GAGAATGTCAGCGGCATTGGCGAGAAAACCC

TTCAGGCCAATGAAAAAGACATTCGCTTCACG

GATGATTTGAGCGATAAGTCATCCGCGGAAA

AAGGTGCGGTAGCTGTGGATAAAAAAGGCGC

CAGATAGTAAGCGCTAAGGATTGGGGTGCGT

CGCCGGTCGCGGCGGCGCTCCTCGACGGCAG

AGTTGGTGCCAGGTTGGCGGATGATTGATGCC

GAATATTACGCGACCAATTCTCGAGGCAAATG

AACTGTGAGCTACTGAGTTGCAGGCATTGACA

GCCATCCCATTTCTATCATACAGTTACGGACG

CATC AC G AGT AGGTG AT AAGC CT AGC AG ATT

GCGGCAGTTGGCAAAATCAGCTATTACTAATA

ATTAAAAACTTTCGGAGCACATCACatgaaattggc ggtctatagcaccaaacaatatgataaaaaatatttgcaacaagtcaacgaa agcttcggcttcgaattggaattcttcgatttcttgttgaccgaaaaaaccgcg aaaaccgcgaacggctgcgaagcggtctgcatcttcgtcaacgatgatgg cagccgcccggtcttggaagaattgaaaaaacacggcgtcaaatatatcgc gttgcgctgcgcgggcttcaacaacgtcgatttggatgcggcgaaagaatt gggcttgaaagtcgtccgcgtcccggcgtatgatccggaagcggtcgcgg aacacgcgatcggcatgatgatgaccttgaaccgccgcatccaccgcgcg tatcaacgcacccgcgatgcgaacttcagcttggaaggcttgaccggcttc SEQ ID Gene Name Nucleotide Sequence

NO:#

accatgtatggcaaaaccgcgggcgtcatcggcaccggcaaaatcggcgt cgcgatgttgcgcatcttgaaaggcttcggcatgcgcttgttggcgttcgatc cgtatccgagcgcggcggcgttggaattgggcgtcgaatatgtcgatttgc cgaccttgttcagcgaaagcgatgtcatcagcttgcactgcccgttgacccc ggaaaactatcacttgttgaacgaagcggcgttcgaacaaatgaaaaacgg cgtcatgatcgtcaacaccagccgcggcgcgttgatcgatagccaagcgg cgatcgaagcgttgaaaaaccaaaaaatcggcagcttgggcatggatgtct atgaaaacgaacgcgatttgttcttcgaagataaaagcaacgatgtcatcca agatgatgtcttccgccgcttgagcgcgtgccacaacgtcttgttcaccggc caccaagcgttcttgaccgcggaagcgttgaccagcatcagccaaaccac cttgcaaaacttgagcaacttggaaaaaggcgaaacctgcccgaacgaatt ggtctaaTAATAATTAAAAACTTTCGGAGCACATC

ACatgaacttgtggcaacaaaactatgatccggcgggcaacatctggttg agcagcttgatcgcgagcttgccgatcttgttcttcttcttcgcgttgatcaaat tgaaattgaaaggctatgtcgcggcgagctggaccgtcgcgatcgcgttgg cggtcgcgttgttgttctataaaatgccggtcgcgaacgcgttggcgagcgt cgtctatggcttcttctatggcttgtggccgatcgcgtggatcatcatcgcgg cggtcttcgtctataaaatcagcgtcaaaaccggccaattcgatatcatccgc agcagcatcttgagcatcaccccggatcaacgcttgcaaatgttgatcgtcg gcttctgcttcggcgcgttcttggaaggcgcggcgggcttcggcgcgccg gtcgcgatcaccgcggcgttgttggtcggcttgggcttcaaaccgttgtatg cggcgggcttgtgcttgatcgtcaacaccgcgccggtcgcgttcggcgcg atgggcatcccgatcttggtcgcgggccaagtcaccggcatcgatagcttc gaaatcggccaaatggtcggccgccaattgccgttcatgaccatcatcgtct tgttctggatcatggcgatcatggatggctggcgcggcatcaaagaaacct ggccggcggtcgtcgtcgcgggcggcagcttcgcgatcgcgcaatatttg agcagcaacttcatcggcccggaattgccggatatcatcagcagcttggtc agcttgttgtgcttgaccttgttcttgaaacgctggcaaccggtccgcgtcttc cgcttcggcgatttgggcgcgagccaagtcgatatgaccttggcgcacacc ggctataccgcgggccaagtcttgcgcgcgtggaccccgttcttgttcttga ccgcgaccgtcaccttgtggagcatcccgccgttcaaagcgttgttcgcga gcggcggcgcgttgtatgaatgggtcatcaacatcccggtcccgtatttgga taaattggtcgcgcgcatgccgccggtcgtcagcgaagcgaccgcgtatg cggcggtcttcaaattcgattggttcagcgcgaccggcaccgcgatcttgtt cgcggcgttgttgagcatcgtctggttgaaaatgaaaccgagcgatgcgat cagcaccttcggcagcaccttgaaagaattggcgttgccgatctatagcatc ggcatggtcttggcgttcgcgttcatcagcaactatagcggcttgagcagca ccttggcgttggcgttggcgcacaccggccacgcgttcaccttcttcagccc gttcttgggctggttgggcgtcttcttgaccggcagcgataccagcagcaac gcgttgttcgcggcgttgcaagcgaccgcggcgcaacaaatcggcgtcag cgatttgttgttggtcgcggcgaacaccaccggcggcgtcaccggcaaaat gatcagcccgcaaagcatcgcgatcgcgtgcgcggcggtcggcttggtcg gcaaagaaagcgatttgttccgcttcaccgtcaaacacagcttgatcttcacc tgcatcgtcggcgtcatcaccaccttgcaagcgtatgtcttgacctg gatgatcccgtaa The synthetic operon is then cloned and transformed into M. methanica as described above. Transformation is confirmed by resistance of the cells to antibiotic selection, and gene expression is confirmed by northern blot (to confirm RNA transcription), western blot, or ELISA methods (to confirm protein expression).

Growth on lactate as a sole carbon source. Recombinant M.

methanica transformed with a vector containing the synthetic operon encoding genes for lactate utilization are inoculated into lOOmL shake flasks containing 20-50mL NMS media, 1% sodium lactate and 10 ug/mL kanamycin. The flasks are then shaken continuously while being incubated at 30°C. Growth is confirmed by monitoring optical density of the culture over time. Note that because lactate is the only carbon source provided to the cells, all cell mass produced must have been derived from lactate.

The disclosure of U.S. provisional patent application Serial No.

61/718,024 filed Oct 24, 2012, is incorporated herein in its entirety.

The various embodiments described above can be combined to provide further embodiments. All of the U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in the Application Data Sheet are incorporated herein by reference, in their entirety. Aspects of the embodiments can be modified, if necessary to employ concepts of the various patents, applications and publications to provide yet further embodiments.

These and other changes can be made to the embodiments in light of the above-detailed description. In general, in the following claims, the terms used should not be construed to limit the claims to the specific embodiments disclosed in the specification and the claims, but should be construed to include all possible

embodiments along with the full scope of equivalents to which such claims are entitled. Accordingly, the claims are not limited by the disclosure.