Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MODIFIED ORGANISMS FOR ETHYLENE, ETHANE, AND METHANE BIOGENESIS AND METHODS FOR USE THEREOF
Document Type and Number:
WIPO Patent Application WO/2022/204489
Kind Code:
A1
Abstract:
The present disclosure provides non-naturally occurring microbial organisms capable of producing ethylene, ethane, and/or methane, as well was methods for producing ethylene, ethane, and/or methane using the same.

Inventors:
NORTH JUSTIN (US)
MURALI SRIVIDYA (US)
YOUNG SARAH (US)
TABITA FRED
Application Number:
PCT/US2022/021905
Publication Date:
September 29, 2022
Filing Date:
March 25, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
OHIO STATE INNOVATION FOUNDATION (US)
International Classes:
C12N1/00; C12N1/20; C12N1/21; C12N15/09; C12N15/52; C12P5/00; C12P5/02
Foreign References:
US20190300870A12019-10-03
US20110296543A12011-12-01
Other References:
NORTH JUSTIN A, MURALI SRIVIDYA, NARROWE ADRIENNE B, XIONG WEILI, BYERLY KATHRYN M, YOUNG SARAH J, YOSHIKUNI YASUO, MCSWEENEY SEAN: "A nitrogenase-like enzyme system catalyzes methionine, ethylene, and methane biogenesis", SCIENCE, vol. 369, 28 August 2020 (2020-08-28), pages 1094 - 1098, XP055975532
LAMONTE BERNADETTE L., HUGHES JEFFREY A.: "In vivo hydrolysis of S-adenosylmethionine induces the met regulon of Escherichia coli", MICROBIOLOGY, vol. 152, no. 5, 1 May 2006 (2006-05-01), pages 1451 - 1459, XP055975516
CARRIÓN O., CURSON A. R. J., KUMARESAN D., FU Y., LANG A. S., MERCADÉ E., TODD J. D.: "A novel pathway producing dimethylsulphide in bacteria is widespread in soil environments", NAT COMMUN, vol. 6, no. 6579, 25 March 2015 (2015-03-25), pages 1 - 8, XP055975513
MUNK A. CHRISTINE, COPELAND ALEX, LUCAS SUSAN, LAPIDUS ALLA, DEL RIO TIJANA GLAVINA, BARRY KERRIE, DETTER JOHN C., HAMMON NANCY, I: "Complete genome sequence of Rhodospirillum rubrum type strain (S1)", STANDARDS IN GEN OMIC SCIENCES, vol. 4, 1 July 2011 (2011-07-01), pages 293 - 302, XP055975492
NORTH JUSTIN A., MILLER ANTHONY R., WILDENTHAL JOHN A., YOUNG SARAH J., TABITA F. ROBERT: "Microbial pathway for anaerobic 5'-methylthioadenosine metabolism coupled to ethylene formation", PROC NATL ACAD SCI USA, vol. 114, no. 48, 13 November 2017 (2017-11-13), pages E10455 - E10464, XP055975485
Attorney, Agent or Firm:
ANDREANSKY, Eric, S. (US)
Download PDF:
Claims:
WHAT IS CLAIMED IS:

1. A non-naturally occurring microbial organism comprising a nucleic acid encoding one or more genes of a methylthio-alkane reductase complex and one or more genes of a methionine salvage pathway.

2. A non-naturally occurring microbial organism of claim 1, wherein the organism produces ethylene, ethane, methane, or combinations thereof.

3. The non-naturally occurring microbial organism of claim 2, wherein the organism produces ethylene.

4. The non-naturally occurring microbial organism of claim 2, wherein the organism produces ethane.

5. The non-naturally occurring microbial organism of claim 2, wherein the organism produces methane.

6. The non-naturally occurring microbial organism of any one of claims 1-5, wherein the one or more genes of a methylthio-alkane reductase complex comprise marB, marH, marD, marK, or combinations thereof.

7. The non-naturally occurring microbial organism of any one of claims 1-6, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 1.

8. The non-naturally occurring microbial organism of claim 7, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 1.

9. The non-naturally occurring microbial organism of any one of claims 1-8, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 3.

10. The non-naturally occurring microbial organism of claim 9, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 3.

11. The non-naturally occurring microbial organism of any one of claims 1-10, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 5.

12. The non-naturally occurring microbial organism of claim 11, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 5.

92 13. The non-naturally occurring microbial organism of any one of claims 1-12, wherein the one or more genes of a methylthio-alkane reductase comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 7.

14. The non-naturally occurring microbial organism of claim 13, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 7.

15. The non-naturally occurring organism of any one of claims 1-14, wherein the one or more genes of a methionine salvage pathway comprise one or more genes of a dihydroxyacetone phosphate (DHAP) shunt pathway.

16. The non-naturally occurring organism of claim 15, wherein the one or more genes of a DHAP shunt pathway comprise 5 ’-methylthioadenosine phosphorylase (mtnP), methylthioadenosine nucleosidase (mtnl), 5-methylthioribose kinase (mtnK), 5- methylthioribose-1 -phosphate isomerase (mtnA), 5-methylthioribulose-l-phosphate aldolase (ald2), or combinations thereof.

17. The non-naturally occurring organism of claim 16, wherein the one or more genes of a DHAP shunt pathway comprise mtnP.

18. The non-naturally occurring organism of claim 16, wherein the one or more genes of a DHAP shunt pathway comprise mtnl and mtnK.

19. The non-naturally occurring organism of any one of claims 16-18, wherein the one or more genes of a DHAP shunt pathway comprise mtnA.

20. The non-naturally occurring organism of any one of claims 16-19, wherein the one or more genes of a DHAP shunt pathway comprise ald2.

21. The non-naturally occurring microbial organism of any one of claims 1-20, wherein the nucleic acid further encodes one or more genes of a SAM hydrolase.

22. The non-naturally occurring microbial organism of any one of claims 1-20, wherein the nucleic acid further encodes one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof.

23. The non-naturally occurring microbial organism of any one of claims 1-22, wherein the nucleic acid is codon optimized.

24. The non-naturally occurring microbial organism of any one of claims 1-23, wherein the nucleic acid is integrated into the genome of the organism.

25. The non-naturally occurring microbial organism of any one of claims 1-23, wherein the nucleic acid is episomally integrated into a plasmid.

93 26. A non-naturally occurring microbial organism, wherein the organism is an anaerobic organism which produces ethylene, ethane, and/or methane using a methylthio-alkane reductase complex and a methionine salvage pathway, and wherein the organism has been optimized for producing ethylene, ethane, and/or methane with one or more non-naturally occurring genes.

27. The non-naturally occurring microbial organism of claim 26, wherein the one or more non-naturally occurring genes comprise one or more genes of a SAM hydrolase.

28. The non-naturally occurring microbial organism of claim 26, wherein the one or more non-naturally occurring genes comprise one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof.

29. The non-naturally occurring microbial organism of any one of claims 26-28, wherein the one or more non-naturally occurring genes are integrated into the genome of the organism.

30. The non-naturally occurring microbial organism of any one of claims 26-28, wherein the one or more non-naturally occurring genes are episomally expressed from a plasmid.

31. The non-naturally occurring microbial organism of any one of claims 26-30, wherein the one or more non-naturally occurring genes are codon optimized.

32. A method of producing ethylene, ethane, and/or methane comprising: culturing a population of the non-naturally occurring microbial organism of any one of claims 1-31 in a culture medium comprising one or more carbon sources; and recovering the ethylene, ethane, and/or methane.

33. The method of claim 32, wherein the one or more carbon sources comprise carbon dioxide, carbon monoxide, an organic acid, a volatile fatty acid, an alcohol, cellulosic plant mass, or combinations thereof.

34. The method of claim 32 or 33, wherein the one or more carbon sources comprise carbon dioxide, carbon monoxide, malate, succinate, pyruvate, fumarate, formate, acetate, propionate, butyrate, ethanol, glycerol, com stover, miscanthus, or switchgrass.

35. The method of any one of claims 32-34, wherein the one or more carbon sources comprise com stover.

36. The method of claim 32, wherein the one or more carbon sources comprise lignocellulosic biomass.

94 37. The method of any one of claims 32-36, wherein the population is cultured in the absence of sulfate.

38. A bioreactor comprising the non-naturally occurring microbial organism of any one of claims 1-31.

39. A vector comprising: one or more exogenous nucleic acid molecules encoding one or more genes of a methylthio-alkane reductase complex and one or more genes of a methionine salvage pathway.

40. The vector of claim 39, wherein the one or more genes of a methylthio-alkane reductase complex comprise marB, marH, marD, marK, or combinations thereof.

41. The vector of claim 39 or claim 40, wherein the one or more genes of a methylthioalkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 1.

42. The vector of claim 41, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 1.

43. The vector of any one of claims 39-42, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 3.

44. The vector of claim 43, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 3.

45. The vector of any one of claims 39-44, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 5.

46. The vector of claim 43, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 5.

47. The vector of any one of claims 39-46, wherein the one or more genes of a methylthio-alkane reductase comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 7.

48. The vector of claim 47, wherein the one or more genes of a methylthio-alkane reductase comprise a nucleic acid sequence of SEQ ID NO: 7.

49. The vector of any one of claims 39-48, wherein the one or more genes of a methionine salvage pathway comprise one or more genes of a dihydroxyacetone phosphate (DHAP) shunt pathway.

95 50. The vector of claim 49, wherein the one or more genes of a DHAP shunt pathway comprise 5* -methylthioadenosine phosphorylase (mtnP), 5-methylthioribose kinase (mtnK), 5-methylthioribose- 1 -phosphate isomerase (mtnA), 5-methylthioribulose-l-phosphate aldolase (ald2), alcohol dehydrogenase (adh), or combinations thereof.

51. The vector of claim 50, wherein the one or more genes of a DHAP shunt pathway comprise mtnP.

52. The vector of claim 50, wherein the one or more genes of a DHAP shunt pathway comprise mtn 1 and mtnK.

53. The vector of any one of claims 50-52, wherein the one or more genes of a DHAP shunt pathway comprise mtnA.

54. The vector of any one of claims 50-53, wherein the one or more genes of a DHAP shunt pathway comprise ald2.

55. The vector of any one of claims 39-54, wherein the one or more exogenous nucleic acid molecules further encode one or more genes of a SAM hydrolase.

56. The vector of any one of claims 39-55, wherein the one or more exogenous nucleic acid molecules further encode one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof.

57. The vector of any one of claims 39-56, wherein the one or more genes are integrated into a gene expression cassette.

58. The vector of claim 57, wherein the gene expression cassette comprises a promoter.

59. The vector of claim 58, wherein the promoter comprises a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 9.

60. The vector of claim 59, wherein the promoter comprises a nucleic acid sequence of SEQ ID NO: 9.

61. The vector of claim 58, wherein the promoter comprises a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 10.

62. The vector of claim 61, wherein the promoter comprises a nucleic acid sequence of SEQ ID NO: 10.

63. The vector of any one of claims 39-62, wherein the one or more genes have been codon optimized.

64. A non-naturally occurring organism comprising a vector of any one of claims 39-63.

96

Description:
MODIFIED ORGANISMS FOR ETHYLENE, ETHANE, AND METHANE BIOGENESIS AND METHODS FOR USE THEREOF

CROSS-REFERENCE TO RELATED APPLICATIONS

5 This application claims the benefit of priority to U.S. Provisional Application No. 63/165,904, filed March 25, 2021, the disclosure of which is incorporated herein by reference in its entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR

DEVELOPMENT

10 This invention was made with government support under Grant No. DE-SC0019338 awarded by the U.S. Department of Energy. The Government has certain rights in the invention.

BACKGROUND

Nitrogenases are an ancient group of enzymes, existing approximately 3.2 billion

15 years ago, before the evolution of oxygenic photosynthesis and subsequent widespread oxygenation (1, 2). Their essential function is reduction of dinitrogen gas into ammonia, contributing over half of the annual global nitrogen fixation required for the synthesis of nucleic and amino acids by all life on earth (3). Ancestors to nitrogenase in anaerobic prokaryotes also gave rise to distinct nitrogenase-like reductases for bacterial

20 photosynthesis and archaeal methanogenesis cofactor metabolism (4, 5, 6, 7). These include the dark operative protochlorophyllide oxidoreductase (DPOR) and chlorophyllide a oxidoreductase (COR) of bacteriochlorophyll biosynthesis, and Ni 2+ -sirohydrochlorin a,c- diamide reductive cyclase for biosynthesis of the archaeal methyl coenzyme-M reductase cofactor F430 (4, 5, 6, 7). However, the evolutionary history of nitrogen fixation revealed

25 overlooked nitrogen fixation-like (NFL) sequences in the genomes of anaerobic bacteria with entirely unknown function. Some were surprisingly associated with sulfur metabolism and transport genes (8, 9). This suggested that certain members of the nitrogenase family potentially have a role in sulfur metabolism.

Previous production of ethylene gas (> 1 pmol/h/g dry cell weight) was observed

30 from photosynthetic Alphaproteobacteria such as Rhodospirillum nibrum and Rhodopseudomunas palustris when growing anaerobically under the low sulfate concentrations (< 200 μM) commonly encountered in their freshwater and soil habitats (see Figs. 5A-C) (10). The precursor of ethylene, (2-methylthio)ethanol (MT-EtOH), and the pathway for its production was documented (P2017-343 -096; Novel Microbial Process to Synthesize Ethylene) (10). This volatile organic sulfur compound (VOSC) was produced

5 from byproducts of S-adenosyl-L-methionine (SAM) utilization to regenerate methionine (Fig. 1A; DHAP shunt) (10). SAM is a key cellular cofactor synthesized directly from methionine and is required by all organisms for diverse processes including DNA, RNA, and protein methylations, polyamine and neurotransmitter synthesis, quorum sensing, and 5'-deoxyadenosyl radical generation by radical SAM enzymes (11). However, the enzymes

0 responsible for the liberation of sulfur from MT-EtOH for methionine regeneration and concomitant ethylene formation were unresolved (10). These enzymes are thus disclosed to be a reductase of the nitrogenase-like family of enzymes, specifically a methylthio-alkane reductase (Mar) composed of components MarB, MarH, MarD, and MarK (MarBHDK) (see Fig. 1A).

5 There is a clear need for methods of producing the industrial precursor compounds ethylene, ethane, and methane, and microorganisms for the same. In particular, known ethylene producing enzyme systems require oxygen (aminocyclopropanecarboxylate oxidase and 2-oxoglutatate dioxygenase), forming a flammable ethylene-oxygen gas mixture. In addition methane and ethane when mixed with air are also explosive and

0 flammable. Therefore, a microorganism and enzyme system to produce significant levels of ethylene, ethane, or methane in the absence of oxygen would have great utility.

SUMMARY

The present disclosure provides non-naturally occurring microbial organisms which are capable of producing ethylene, ethane, methane, or combinations thereof.

5 In one aspect, a non-naturally occurring microbial organism is provided comprising a nucleic acid encoding one or more genes of a methylthio-alkane reductase complex and one or more genes of a methionine salvage pathway.

In another aspect, a non-naturally occurring microbial organism is provided, wherein the organism is an anaerobic organism which produces ethylene, ethane, and/or methane

0 using a methylthio-alkane reductase complex and a methionine salvage pathway, and wherein the organism has been optimized for producing ethylene, ethane, and/or methane with one or more non-naturally occurring genes. In another aspect, a method of producing ethylene, ethane, and/or methane is provided, the method comprising: culturing a population of the non-naturally occurring microbial organism described herein in a culture medium comprising one or more carbon sources; and

5 recovering the ethylene, ethane, and/or methane.

A bioreactor is further provided comprising the non-naturally occurring microbial organism described herein.

A vector is also provided comprising: one or more exogenous nucleic acid molecules encoding one or more genes of a methylthio-alkane reductase complex and one

0 or more genes of a methionine salvage pathway.

The details of one or more embodiments of the disclosure are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the disclosure will be apparent from the description and drawings, and from the claims.

DESCRIPTION OF DRAWINGS

5 FIGs. 1A-1C show that nitrogenase-like proteins are linked to VOSC utilization. (FIG. 1A) Methylthio-alkane reductase (1), the gene product of marBHDK (proposed), converts VOSCs to ethylene, methane, and methanethiol for methionine biosynthesis. MT- EtOH is produced by the widespread DHAP shunt (10, 25, 26) (FIG. IB) R rubrum proteins with increased abundance when methylthio-alkane reductase activity is induced

0 (“MT-EtOH” or “Lo” 50 μM sulfate) versus repressed (“Hi” 1 mM sulfate). X; isolated Tn5 transposon mutants (Fig. 7), which could not utilize MT-EtOH for growth. (FIG. 1C) Changes in gene transcript abundance of R rubrum parent strain (WRdht) and SalR deletion strain (0785::Tn5) under “Hi” and “Lo” sulfate. *; no significant change, p>0.25, two- tailed; Enzyme and Compound Key: 1) methylthio-alkane reductase; 2) serine/homoserine

5 O-acetyltransferase (2.3.1.31); 3) O-acetylhomoserine sulfhydiylase (2.5.1.49); 4) S- adenosylmethionine synthetase (2.1.1.13, 2.1.1.14); 5) methionine synthase (2.5.1.6); 6) cystathionine beta-synthase (4.2.1.22); 7) cystathionine gamma-lyase (4.4.1.1); 8) cystathionine gamma-synthase (2.5.1.48); cystathionine beta-lyase (4.4.1.8); 10) MTA nucleosidase (3.2.2.16); 11) 5-methylthioribose kinase (2.7.1.100); 12) MTA phosphorylase

0 (2.4.2.28); 13) 5-methylthioribose-l -phosphate isomerase (5.3.1.33); 14) 5- methylthioribulose-1 -phosphate aldolase (4.1.2.n); 15) alcohol dehydrogenase (1.1.1.1); DHAP) dihydroxyacetone phosphate; HSE) homoserine; OAHS) O-acetyl-L-homoserine; SRH) S-ribosyl-L-homocysteine; R-H) methyl acceptor; THF) tetrahydrofolate. FIGs. 2A-2F show that genes for marBHDK are required for anaerobic methionine metabolism from VOSCs. (FIG. 2A) Growth and average total hydrocarbon production of strains utilizing sulfate or VOSCs (see Fig. 8 and Fig. 19). *; not applicable. (FIG. 2B) Total amount of hydrocarbons produced when cells were fed with the indicated VOSC.

5 (FIG. 2C) Plasmid-based complementation studies of NFL genes for growth of R rubrum NFL gene deletion strain. (A-C) error bars are standard deviation for N = 3 independent biological replicates. (FIGs. 2D and 2E) Identification of methionine (RT = 8.5 min) and methanethiol (RT = 28.3 min) upon feeding R. rubrum strains with (2-[methyl- C 14 ]thio)ethanol (RT = 22.8 min) (FIG. 2F) Change in Gibbs free energy under standard

0 conditions for the conversion of VOSCs to methanethiol and the corresponding hydrocarbon. H 2 represents 2e" and 2H + equivalents.

FIGs. 3A-3D show that methylthio-alkane reductase and nitrogenase are independent. (FIGs. 3A and 3B) Stoichiometric production of methane and ethane by cells feed with DMS and EMS. (FIG. 3C) Competition assays for methylthio-alkane reductase

5 repression in cells grown with 1 mM MT-EtOH or DMS plus the indicated amount of sulfate. Non-linear fit to the Hill equation gives EC5O DMS/sulfate = 140 μM sulfate and EC50 MT-EtOH/sulfate = 110 μM sulfate for 50% activity with DMS and MT-EtOH as substrate, respectively. (FIG. 3D) Whole-cell methylthio-alkane reductase (Mar) and molybdenum nitrogenase (NifHDK) activities for wild type (WT) and A0772:3/A0793:6 deletion (AA)

0 strains under methylthio-alkane reductase inducing (50 μM sulfate) or repressing (1 mM sulfate) and NifHDK inducing (Glu, glutamate) or repressing (NH» + , ammonium) conditions. Standard deviations are (A-C) the error bars or (D) are < 10% for N = 3 biological replicates.

FIG. 4 shows that methylthio-alkane reductases are phylogenetically distinct.

5 Phylogenetic tree of NifD superfamily homologues. The scale bar represents the number of substitutions per site. Nodes with UFBoot support values > 95% indicated with black circles. Clade labeling: Group IV- A (NfaD; nitrogen fixation IV-A) (27), Group IV-B (CfbD; Ni2+-sirohydrochlorin a,c-diamide reductive cyclase) (4,5), Group IV-C (MarD; putative methylthio-alkane reductase), Group IV and Group VI (NflD; nitrogen fixation-like

0 of unknown function), Group V (ChlN; DPOR, and BchY; COR). Clade labels and colors are per Raymond (9) and Meheust (28). Av, Azotobacter vinelandiv, Bv, Blastochloris viridis,' Ep, Endomicrobium proavitum; Rc, Rhodobacter capsulatus; Rp, Rhodopseudomonas palustris; Ru, Rhodospirillum rubrum. FIGs. 5A-5C show the ethylene specific rate of production during growth. Bacteria growth measured via optical density at 660 nm (O.D. 660nm) and the corresponding specific rate of ethylene production in μmol ethylene per hour per gram dry cell weight for (FIGs. 5A and SB) Limiting sulfate concentrations and (FIG. 5C) MT-EtOH (200 μM).

5 Error bars are the standard deviation for N = 3 independent growth experiments (biological replicates).

FIG. 6 shows the Rhodospirillum rubrum thiol cluster. Known sulfur metabolism genes (yellow) and other genes of putative and unknown function, potentially involved in sulfur metabolism, are localized to a cluster of genes in the R. rubrum genome. This region

0 contains the Group IV-C NFL genes marBHDK required for methylthio-alkane reductase activity and the Group IV NFL genes nflDK of unknown function (red).

FIGs. 7A-7D show the Rhodospirillum rubrum strain WRdht(ΔrlpA::Gm R /Δald2 transposon mutagenesis screen. (FIGs. 7A, 7B, and 7C) Example of screen and identification of a random R rubrum Tn5 transposon mutant (isolate 17E5) that is incapable

5 of growing on MT-EtOH but retains capability for growing on sulfate. These mutants presumably are defective in metabolism specific to MT-EtOH and are selected for further growth and sequencing analysis as summarized in panel (FIG. 7D).

FIGs. 8A-8F show the growth of R rubrum wild type and deletion strains. Culture optical density measured at 660 nm (O.D. 660nm) for cells in the absence of a sulfur source

0 (none) or 1 mM of the indicated sulfur source. Error bars are the standard deviation for N= 3 independent growth experiments (biological replicates).

FIGs. 9A-9C show the substrate variability and complementation growth studies of R rubrum. (FIG. 9A) Screen for R rubrum growth with additional VOSCs and sulfur- containing amino adds. Each sulfur source was supplied at 1 mM concentration. Error bars

5 are the standard deviation for N= 3 independent growth experiments (biological replicates). (FIGs. 9B and 9C) Plasmid-based complementation of NFL genes for growth of R. rubrum NFL gene deletion strain (strain A0772:3/A0793:6) utilizing 1 mM sulfate or 1 mM DMS, respectively, as sole sulfur source. Error bars are the standard deviation for N = 4 independent growth experiments (biological replicates).

0 FIG. 10 shows the NifD superfamily amino acid alignment. Pairwise alignment of NifD superfamily sequences are shown in the region of active site residues responsible for coordination of the P-cluster and FeMo-cofactor within the molybdenum nitrogenase subunit NifD (*), and substrate bound to the FeMo-cofactor (▼). Numbering is based off of Azotobacter vinelandii NifD (Av) (9). FIG. 11 shows the NifK superfamily amino acid alignment. Pairwise alignment of NifK superfamily sequences are shown in the region of active site residues responsible for coordination of the P-cluster within the molybdenum nitrogenase subunit NifK (*). Numbering is based off of Azotobacter vinelandii NifK (Av) (9). Note that the group IV-B

5 nitrogenase-like Ni2+-sirohydrochlorin a,c-diamide reductive cyclase (CfbCD) do not contain a NifK counterpart.

FIG. 12 shows the NifH superfamily amino acid alignment. Pairwise alignment of NifH superfamily sequences are shown in the region of active site residues responsible for MgATP binding and hydrolysis Fe 4 -S 4 iron sulfur cluster binding (*). The conserved

0 arginine (▼) is the site of ADP-ribosylation post translational modification for nitrogenase activity regulation in the bona fide nitrogenases. ADP-ribosylation performed by dinitrogenase reductase ADP-ribosyltransferase (DraT) in R. rubrum prevents association of NifH with NifDK. The modification is removed by dinitrogenase reductase activating glycohydrolase (DraG). Numbering is based off of Azotobacter vinelandii NifH (Av) (9).

5 For NflH, NfaH, and MarH, corresponding genes were located with 10 genes upstream or downstream from nflDK, nfaKD, and marDK, respectively, in each organism.

FIG. 13 shows the NifB superfamily amino acid alignment. Pairwise alignment of NifB superfamily sequences are shown in the regions of conservation for molybdenum nitrogenase NifB sequences. The Radical SAM motif CxxxCxxC cysteines (*) coordinates

0 the Fe 4 -S 4 cluster responsible for binding S-adenosyl-l-methionine (SAM). The SAM methyl group provides the carbide during formation of the NifB-cofactor precursor to FeMo-, FeV-, or FeFe-cofactor. Numbering is based off of Azotobacter vinelandii NifB (Av) (12). Note that the group IV-B Ni2+-sirohydrochlorin a,c-diamide reductive cyclases (CfbCD) and group V bacteriochlorophyll reductases DPOR (ChlLNB) and COR

5 (BchXYZ) do not require or posess a NifB counterpart for assembly. For NflB, NfaB, and MarB, corresponding genes, if present, were located with 10 genes upstream or downstream from nflDK, nfaDK and marDK, respectively, in each organism.

FIGs. 14A-14B show total C 14 incorporation from (2-[methyl-C 14 ]thio)ethanol. The wild type strain (FIG. 14A) and A793:6 marBHDK deletion strain (FIG. 14B) was fed with

0 (2-[methyl-C 14 ]thio)ethanol for the indicated amount of time. The radioactivity present due to soluble metabolites in the extracellular media and extracted from the cells was measured by scintillation counting before resolving metabolites by HPCL (Fig. 2D-E). The remaining insoluble material present in the cells was coordinately measured by scintillation counting, which indicates the amount of C 14 incorporation into cell material via methionine synthesis. The total radioactivity is the sum of the soluble and insoluble components. Data is a representative C 14 incorporation series for N = 2 independent feeding experiments (biological replicates).

FIGs. 15A-15B show the thermodynamics of ethanol versus ethylene and water

5 formation from MT-EtOH. (FIG. 15A) Thermodynamic cycle comparing the formation of ethanol to the formation of ethylene from MT-EtOH. The difference in the formation free energies can be understood in terms of ΔG 3 = ΔG2 - ΔGi. (FIG. 15B) Detailed thermodynamics in which it can be seen that in reaction 3, the gas phase reaction energy AEtxn(g) favors ethanol formation from ethylene and water by -52.3 kJ/mol, but even in the

0 gas phase the reaction is entropically disfavored due to the loss of degrees of freedom in going from two molecules to one, resulting in ΔGixn(g) = 7.1 kJ/mol. Then, the free energy of solvation of ethanol is less favorable than the solvation of the ethylene and water by 11.0 kJ/mol. Although the COSMO solvation model does not account explicitly for hydrogen bonding (which would likely favor the reactants, as well), solvating the water dipole and

5 ethylene quadrapole pair are likely more favorable than solvating the single ethanol dipole. The combined effect of entropy loss and differences in solvation is to make ethanol formation unfavorable relative to ethylene and water by 18.1 kJ/mol.

FIG. 16 shows NifH superfamily phylogenetic analysis. Phylogenetic tree of NifH superfamily homologues based on an LG+R10 evolution model. The scale bar represents

0 the number of substitutions per site. UFBoot support values of 95% or greater are shown as black circles on branches (56). For disambiguation, enzymes of known function are labeled Group IV- A (NfaH; nitrogen fixation IV-A) (27), Group IV-B (CfbC; Ni2+- sirohydrochlorin a,c-diamide reductive cyclase) (4,5), and Group IV-C (MarH; putative methylthio-alkane reductase). Group IV and Group VI NifH homologues of unknown

5 function are designated NflH. Group V is ChlL (DPOR) and BchX (COR). Clade coloring follows Raymond (9) and Meheust (28). Av, Azotobacter vinelandii,' Bv, Blastochloris viridis; Ep, Endomicrobium proavitum; Rc, Rhodobacter capsulatus; Rp, Rhodopseudomonas palustris,- Ru, Rhodosptrillum rubrum.

FIG. 17 shows organisms with genes for 5 ’-methylthioadenosine salvage via DHAP

0 Shunt and methylthio-alkane reductase pathways. The DHAP shunt for conversion of 5’- methylthioadenosine to MT-EtOH is composed of MTA phosphorylase (MtnP) or 5- methylthioribose kinase (MtnK) and 5-methylthioribose-l -phosphate isomerase (MtnA) and 5-methylthioribulose-l -phosphate aldolase (Ald2) (see Fig. 1A). Black circles represent UFBoot bootstrap values of 100. Nodes are labeled to indicate phylum membership. FIG. 18 shows the genes and their putative functions surrounding Group IV and VI gene clusters. Homologous sulfiir metabolism genes, which are enriched in Groups IV-A, IV-C, and IV of unknown function, are indicated as described in the key.

FIGs. 19A-19C show the identification of methylthio-alkane reductase capabilities

5 in other alpha-proteobacteria. Culture optical density measured at 660 nm (O.D. 660nm) for cells in the absence of a sulfiir source (none) or 1 mM of the indicated sulfur source. Error bars are the standard deviation for N = 3 independent growth experiments (biological replicates). Blastochloris viridis DSM 133 and Rhodopseudomonas palustris CGA010 possess marBHDK homologues, whereas Rhodobacter capsulatus SB 1003 does not.

0 FIGs. 20A-20D show the methionine Salvage Pathways for ethylene and methane, optimization, and bioreactor design. The bioreactor design employs cellulolytic bacteria to convert com stover biomass into industrially tractable gases (ethylene and methane) utilizing a novel anaerobic methionine salvage pathway discovered in certain photosynthetic bacteria and Clostridia. (FIG. 20A) Bioreactor design for conversion of cellulosic biomass

5 to ethylene and methane biogas. (FIG. 20B) Methionine salvage pathway and ethylene/ethane/methane producing enzyme system (MarBHDK) for biogas production. (FIG. 20C) Example of pathway construction in cellulose degrading Bacilli and Clostridia for production of ethylene and methane. (FIG. 20D) Optimization of ethylene production using non-naturally occurring gene from Coliphage, SAM hydrolase, for direct conversion

0 of SAM to MT A.

Like reference symbols in the various drawings indicate like elements.

DETAILED DESCRIPTION

Methane is used for the production of energy, hydrogen gas, synthesis gas, and methanol used in the manufacturing of various organic chemicals. Methane is the second

5 most used energy source next to electricity. Ethylene is used in a variety of industrial processes, including the production of polyethylene for plastic bags, polystyrene for packaging and insulation, and ethylene oxide for detergents. In addition, ethylene may be converted to C5-C10 gasoline-like molecules. Ethylene is thus thought to be the most widely used chemical on earth (over 175 million tons in 2018) and the demands and market

0 for this feedstock are steadily increasing, with nearly a $300 billion annual market. Thus, there is considerable interest in developing new and innovative ways to produce these key industrial precursor compounds (ethylene, ethane, methane) with bio-based methods as a potential way to supplement chemical-based processes. For anaerobic ethylene production by microorganisms, the novel and widespread bacterial carbon and sulfur salvage pathway, the DHAP Shunt (Fig. 1A), converts the ubiquitous S-adenosyl-L-methionine byproduct, MTA, into adenine, DHAP, and the volatile organic sulfur compound, (2-methylthio)ethanol (MT-EtOH). This includes

5 freshwater and soil bacteria such as Rhodospirillum rubrum and Rhodopseudomonas palustris, extra-intestinal pathogenic Escherichia coli, and pathogenic Bacillus species (70, 25, 26, 67). It was demonstrated that the Alphaproteobacteria, R. rubrum and R. palustris, were able to further utilize MT-EtOH as a sole sulfur source for growth and synthesis of sulfur-containing amino acids (e.g. methionine), producing stoichiometric amounts of

0 ethylene gas in the process (70). This process was strictly anaerobic and clearly enzymatic in nature (70). This was the first reported solely anaerobic route to ethylene, and involves a novel cooperation of genes and enzymes (MarBHDK). It was subsequently found that the enzyme system producing ethylene from MT-EtOH (MarBHDK) was a member of the nitrogenase family of enzymes from a novel and distinct clade (FIG. 4 and FIG. 16). This

5 strictly anaerobic methylthio-alkane reductase system not only could product ethylene form MT-EtOH, but it could also produce ethane from ethylmethyl sulfide (CH3-S-CH2-CH3) and methane from dimethyl sulfide (CH3-S-CH3). This was verified in alphaproteobacteria, including Rhodopseudomonas plaustris, Rhodospirillum rubrum, and Blastochloris viridis. A search of the available database for other organisms that possess the same set of

0 discovered genes encoding nitrogenase-like methylthio-alkane reductase enzymes for reactions for ethylene, ethane, and methane formation indicated that this enzyme was prevalent in genomes from multiple phyla of industrially relevant Proteobacteria and Firmicutes. It was also found that these genes were detected in anoxic high carbon ecosystems including wetland soils and animal rumen. Notably, expressed proteins for

5 methylthio-alkane reductase were recovered in situ, supporting the ability to use a functional screen to potentially recover catalytically active enzymes from the environment.

Disclosed herein is an exclusively anaerobic enzyme system and associated pathways that couples sulfur metabolism to ethylene and methane production in the purple non-sulfur alpha-proteobacteria Rhodospirillum rubrum, Rhodopseudomonas palustris, and

0 Blastochloris viridis (FIGs. 1A-1C). Genes for this anaerobic enzyme system are widely distributed amongst bacteria (FIG. 17), and this pathway reveals a possible route by which ethylene and methane, both of which are frequently observed in anoxic environments, can be produced by indigenous microbes. Disclosed herein are methods for the development of a potential industrially compatible process to biologically produce ethylene and methane in high yields. Disclosed herein is a method to fully characterize the anaerobic ethylene/ethane/methane producing enzyme system and determine how the genes are regulated at the molecular level.

5 Computational modeling of the chemical reactions performed by the relevant enzymes are initiated to learn the mechanisms by which these enzymes catalyze the reactions involved in ethylene biosynthesis. In addition, since ethylene/ethane/methane synthesis from the respective precursor compound is an inducible process, further studies probe the molecular regulation of the genes involved during photosynthetic metabolism using a variety of

0 “omics” tools. These biochemical and molecular studies are invaluable for optimizing ethylene/ethane/methane production and creating bacterial strains that over-produce ethylene/ethane/methane under controlled conditions.

Also disclosed herein is a method to maximize ethylene and methane production with different feedstocks; e.g., lignocellulose digests as well as inorganic carbon sources

5 (FIGs. 20A-20D). Rps. palustris, as well as cellulolytic and acetogenic bacteria such as Ruminiclostridium josui and Clostridium ljungdhalii species all contain the genes for the ethylene/ethane/methane producing enzyme system MarBHDK (FIG. 17), and each of these organisms has the capacity to grow on cellulosic digests as well as inorganic carbon sources (CO2). Conditions are optimized for each of these growth conditions.

0 Further disclosed are metagenomics and bioinformatic/computational approaches to discover more effective enzymes of uncultured organisms from anaerobic environments. Analysis of existing genome and metagenome databases allow identification of potential gene sequences for ethylene/ethane/methane producing enzymes systems that have specific or enhanced catalytic properties. Such sequences, homologous to known genes, may then be

5 screened for their effectiveness in catalyzing key reactions of ethylene/ethane/methane synthesis. This leverages over 4 billion years of evolution to obtain the most efficient enzymes. In addition, a functional genomics approach may be established to isolate relevant genes from the metagenome without previous knowledge of sequences; e.g., by complementing specific mutant host organisms with environmental DNA (68). These

D metagenomics approaches, plus a full battery of other synthetic biology and “omics” approaches is utilized to optimize ethylene/ethane/methane formation.

The following description of the disclosure is provided as an enabling teaching of the disclosure in its best, currently known embodiments. Many modifications and other embodiments disclosed herein will come to mind to one skilled in the art to which the disclosed compositions and methods pertain having the benefit of the teachings presented in the foregoing descriptions and the associated drawings. Therefore, it is to be understood that the disclosures are not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the

5 appended claims. The skilled artisan will recognize many variants and adaptations of the aspects described herein. These variants and adaptations are intended to be included in the teachings of this disclosure and to be encompassed by the claims herein.

Any recited method can be carried out in the order of events recited or in any other order that is logically possible. That is, unless otherwise expressly stated, it is in no way

0 intended that any method or aspect set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not specifically state in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible nonexpress basis for interpretation, including matters of logic with respect to arrangement of

5 steps or operational flow, plain meaning derived from grammatical organization or punctuation, or the number or type of aspects described in the specification.

All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited. The publications discussed herein are provided solely for their disclosure prior to the

0 filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided herein can be different from the actual publication dates, which can require independent confirmation.

It is also to be understood that the terminology used herein is for the purpose of

5 describing particular aspects only and is not intended to be limiting. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the disclosed compositions and methods belong. It can be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is

0 consistent with their meaning in the context of the specification and relevant art and should not be interpreted in an idealized or overly formal sense unless expressly defined herein.

Prior to describing the various aspects of the present disclosure, the following definitions are provided and should be used unless otherwise indicated. Additional terms may be defined elsewhere in the present disclosure. Definitions

As used herein, “comprising” is to be interpreted as specifying the presence of the stated features, integers, steps, or components as referred to, but does not preclude the presence or addition of one or more features, integers, steps, or components, or groups

5 thereof. Moreover, each of the terms “by”, “comprising,” “comprises”, “comprised of” “including,” “includes,” “included,” “involving,” “involves,” “involved,” and “such as” are used in their open, non-limiting sense and may be used interchangeably. Further, the term “comprising” is intended to include examples and aspects encompassed by the terms “consisting essentially of’ and “consisting of.” Similarly, the term “consisting essentially

0 of’ is intended to include examples encompassed by the term “consisting of.

As used in the specification and the appended claims, the singular forms “a,” and “the” include plural referents unless the context clearly dictates otherwise.

It should be noted that ratios, concentrations, amounts, and other numerical data can be expressed herein in a range format. It can be further understood that the endpoints of

5 each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. Ranges can be expressed herein as from “about” one particular value, and/or

0 to “about” another particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it can be understood that the particular value forms a further aspect. For example, if the value “about 10” is disclosed, then “10” is also disclosed.

When a range is expressed, a further aspect includes from the one particular value

5 and/or to the other particular value. For example, where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the disclosure, e.g. the phrase “x to y” includes the range from ‘x’ to ‘y’ as well as the range greater than ‘x’ and less than ‘y’. The range can also be expressed as an upper limit, e.g. ‘about x, y, z, or less’ and should be interpreted to include the specific ranges of ‘about

0 x’, ‘about y’, and ‘about z’ as well as the ranges of ‘less than x’, less than y’, and ‘less than z’. Likewise, the phrase ‘about x, y, z, or greater’ should be interpreted to include the specific ranges of ‘about x’, ‘about y’, and ‘about z’ as well as the ranges of ‘greater than x’, greater than y’, and ‘greater than z’. In addition, the phrase “about ‘x’ to ‘y’”, where ‘x’ and ‘y’ are numerical values, includes “about ‘x’ to about ‘y’”. It is to be understood that such a range format is used for convenience and brevity, and thus, should be interpreted in a flexible manner to include not only the numerical values explicitly recited as the limits of the range, but also to include all the individual numerical values or sub-ranges encompassed within that range as if each numerical value and sub¬

5 range is explicitly recited. To illustrate, a numerical range of “about 0.1% to 5%” should be interpreted to include not only the explicitly recited values of about 0.1% to about 5%, but also include individual values (e.g., about 1%, about 2%, about 3%, and about 4%) and the sub-ranges (e.g., about 0.5% to about 1.1%; about 5% to about 2.4%; about 0.5% to about 3.2%, and about 0.5% to about 4.4%, and other possible sub-ranges) within the indicated

0 range.

As used herein, the terms “about,” “approximate,” “at or about,” and “substantially” mean that the amount or value in question can be the exact value or a value that provides equivalent results or effects as recited in the claims or taught herein. That is, it is understood that amounts, sizes, formulations, parameters, and other quantities and characteristics are

5 not and need not be exact, but may be approximate and/or larger or smaller, as desired, reflecting tolerances, conversion factors, rounding off, measurement error and the like, and other factors known to those of skill in the art such that equivalent results or effects are obtained. In some circumstances, the value that provides equivalent results or effects cannot be reasonably determined. In such cases, it is generally understood, as used herein, that

0 “about” and “at or about” mean the nominal value indicated ±10% variation unless otherwise indicated or inferred. In general, an amount, size, formulation, parameter or other quantity or characteristic is “about,” “approximate,” or “at or about" whether or not expressly stated to be such. It is understood that where “about,” “approximate,” or “at or about” is used before a quantitative value, the parameter also includes the specific

5 quantitative value itself, unless specifically stated otherwise.

The term “culture”, “cultivate”, and “ferment” are used interchangeably and refer to the intentional growth, propagation, proliferation, and/or enablement of metabolism, catabolism, and/or anabolism of one or more cells (e.g. a microbial organism). The combination of both growth and propagation may be termed proliferation. Examples include

0 production by an organism of ethylene, ethane, or methane. Culture does not refer to the growth or propagation of microorganisms in nature or otherwise without human intervention.

The term “growth” means an increase in cell size, total cellular contents, and/or cell mass or weight of a cell (e.g. a microbial organism). A “growth media” or “growth medium” as used herein can be a solid, powder, or liquid mixture which comprises all or substantially all of the nutrients necessary to support the growth of microbial organisms; various nutrient compositions are preferably prepared when particular microbial species are being assayed. Amino acids, carbohydrates, minerals,

5 vitamins and other elements known to those skilled in the art to be necessary for the growth of microbial organisms are provided in the medium. In one embodiment, the growth medium is liquid.

The term “propagation" refers to an increase in cell number via cell division.

The term “promoter” or “regulatory element” refers to a region or sequence

0 determinants located upstream or downstream from the start of transcription and which are involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. Promoters need not be of origin in the microbial organism used, for example, promoters derived from viruses or from other organisms can be used in the compositions or methods described herein.

5 A polynucleotide sequence is “heterologous” to a second polynucleotide sequence if it originates from a foreign species, or, if from the same species, is modified by human action from its original form. For example, a promoter operably linked to a heterologous coding sequence refers to a coding sequence from a species different from that from which the promoter was derived, or, if from the same species, a coding sequence which is different

0 from naturally occurring allelic variants.

The term “recombinant” refers to a human manipulated nucleic acid (e.g. polynucleotide) or a copy or complement of a human manipulated nucleic acid (e.g. polynucleotide), or if in reference to a protein (i.e, a “recombinant protein”), a protein encoded by a recombinant nucleic acid (e.g. polynucleotide). In embodiments, a

5 recombinant expression cassette comprising a promoter operably linked to a second nucleic acid (e.g. polynucleotide) may include a promoter that is heterologous to the second nucleic acid (e.g. polynucleotide) as the result of human manipulation (e.g., by methods described in Sambrook et al., Molecular Cloning -A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., (1989) or Current Protocols in Molecular Biology

0 Volumes 1-3, John Wiley & Sons, Inc. (1994-1998)). In another example, a recombinant expression cassette may comprise nucleic acids (e.g. polynucleotides) combined in such a way that the nucleic acids (e.g. polynucleotides) are extremely unlikely to be found in nature. For instance, human manipulated restriction sites or plasmid vector sequences may flank or separate the promoter from the second nucleic acid (e.g. polynucleotide). “Nucleic acid" or “oligonucleotide” or “polynucleotide” or grammatical equivalents used herein means at least two nucleotides covalently linked together. The term “nucleic acid” includes single-, double-, or multiple-stranded DNA, RNA and analogs (derivatives) thereof. Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or

5 more nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and polynucleotides are polymers of any length, including longer lengths, e.g., 200, 300, 500, 1000, 2000, 3000, 5000, 7000, 10,000, etc. In certain embodiments, the nucleic acids herein contain phosphodiester bonds. In other embodiments, nucleic acid analogs are included that may have alternate backbones. The term encompasses nucleic acids containing known

0 analogues of natural nucleotides which have similar or improved binding properties, for the purposes desired, as the reference nucleic acid. A particular nucleic acid sequence also encompasses “splice variants." Similarly, a particular protein encoded by a nucleic acid encompasses any protein encoded by a splice variant of that nucleic add. “Splice variants," as the name suggests, are products of alternative splicing of a gene. After transcription, an

5 initial nucleic acid transcript may be spliced such that different (alternate) nucleic acid splice products encode different polypeptides. Mechanisms for the production of splice variants vary, but include alternate splicing of exons. Alternate polypeptides derived from the same nucleic acid by read-through transcription are also encompassed by this definition. Any products of a splicing reaction, including recombinant forms of the splice products, are

0 included in this definition. An example of splice variants is discussed in Leicher, et al., J. Biol. Chem. 273(52):35095-35101 (1998).

The term “expression cassette" refers to a nucleic acid construct, which when introduced into a host cell, results in transcription and/or translation of a RNA or polypeptide, respectively. In some embodiments, an expression cassette comprising a

5 promoter operably linked to a second nucleic acid (e.g. polynucleotide) may include a promoter that is heterologous to the second nucleic acid (e.g. polynucleotide) as the result of human manipulation (e.g., by methods described in Sambrook et al., Molecular Cloning— A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., (1989) or Current Protocols in Molecular Biology Volumes 1-3, John Wiley & Sons, Inc. (1994-

0 1998)). In some embodiments, an expression cassette comprising a terminator (or termination sequence) operably linked to a second nucleic acid (e.g. polynucleotide) may include a terminator that is heterologous to the second nucleic acid (e.g. polynucleotide) as the result of human manipulation. In some embodiments, the expression cassette comprises a promoter operably linked to a second nucleic acid (e.g. polynucleotide) and a terminator operably linked to the second nucleic acid (e.g. polynucleotide) as the result of human manipulation. In some embodiments, the expression cassette comprises an endogenous promoter. In some embodiments, the expression cassette comprises an endogenous terminator. In some embodiments, the expression cassette comprises a synthetic (or non¬

5 natural) promoter. In some embodiments, the expression cassette comprises a synthetic (or non-natural) terminator.

The terms “identical” or percent “identity,” in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues or nucleotides that are the same

0 (i.e., about 60% identity, preferably 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or higher identity over a specified region when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a

5 BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described below, or by manual alignment and visual inspection (see, e.g., NCBI web site or the like). Such sequences are then said to be “substantially identical.” This definition also refers to, or may be applied to, the compliment of a test sequence. The definition also includes sequences that have deletions and/or additions, as well as those that have substitutions. As

0 described below, the preferred algorithms can account for gaps and the like. Preferably, identity exists over a region that is at least about 10 amino acids or 20 nucleotides in length, or more preferably over a region that is 10-50 amino adds or 20-50 nucleotides in length. As used herein, percent (%) amino acid sequence identity is defined as the percentage of amino acids in a candidate sequence that are identical to the amino acids in a reference

5 sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN, ALIGN-2 or Megalign (DNASTAR) software. Appropriate parameters for measuring

0 alignment, including any algorithms needed to achieve maximal alignment over the full- length of the sequences being compared can be determined by known methods.

For sequence comparisons, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Preferably, default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.

5 One example of algorithm that is suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1977) Nuc. Acids Res. 25:3389-3402, and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/).

0 This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positivevalued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al. (1990) J. Mol. Biol. 215:403-410). These initial neighborhood word hits act as seeds for initiating

5 searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the

0 cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The

5 BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 11, an expectation (E) or 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915) alignments (B) of 50, expectation (E) of 10, M=5, N=~4, and a

0 comparison of both strands.

The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873- 5787). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic add to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01.

5 The phrase “codon optimized” as it refers to genes or coding regions of nucleic acid molecules for the transformation of various hosts, refers to the alteration of codons in the gene or coding regions of polynucleic acid molecules to reflect the typical codon usage of a selected organism without altering the polypeptide encoded by the DNA. Such optimization includes replacing at least one, or more than one, or a significant number, of codons with

0 one or more codons that are more frequently used in the genes of that selected organism.

The phrase “selectively (or specifically) hybridizes to” refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence with a higher affinity, e.g., under more stringent conditions, than to other nucleotide sequences (e.g., total cellular or library DNA or RNA).

5 The phrase “stringent hybridization conditions” refers to conditions under which a probe will hybridize to its taiget subsequence, typically in a complex mixture of nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures. An extensive guide to the hybridization of nucleic acids is found in Tijssen,

0 Techniques in Biochemistry and Molecular Biology— Hybridization "with Nucleic Probes, “Overview of principles of hybridization and the strategy of nucleic acid assays” (1993). Generally, stringent conditions are selected to be about 5-10° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The Tm is the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% of

5 the probes complementary to the target hybridize to the target sequence at equilibrium (as the target sequences are present in excess, at Tm, 50% of the probes are occupied at equilibrium). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. For selective or specific hybridization, a positive signal is at least two times background, preferably 10 times background hybridization. Exemplaiy stringent

0 hybridization conditions can be as following: 50% formamide, 5*SSC, and 1% SDS, incubating at 42° C., or, 5*SSC, 1% SDS, incubating at 65° C., with wash in 0.2*SSC, and 0.1% SDS at 65° C.

Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the polypeptides which they encode are substantially identical. This occurs, for example, when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. In such cases, the nucleic adds typically hybridize under moderately stringent hybridization conditions. Exemplary “moderately stringent hybridization conditions" include a hybridization in a buffer of 40% formamide, 1

5 M NaCl, 1% SDS at 37° C., and a wash in IxSSC at 45° C. A positive hybridization is at least twice background. Those of ordinary skill will readily recognize that alternative hybridization and wash conditions can be utilized to provide conditions of similar stringency. Additional guidelines for determining hybridization parameters are provided in numerous reference, e.g., and Current Protocols in Molecular Biology, ed. Ausubel, et al.

0 One of skill will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like. Polypeptides which are “substantially similar” share sequences as noted above except that residue positions which are not identical may differ by conservative amino acid changes.

5 Conservative amino acid substitutions refer to the interchangeability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic- hydroxyl side chains is serine and threonine; a group of amino acids having amide- containing side chains is asparagine and glutamine; a group of amino acids having aromatic

0 side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulfur- containing side chains is cysteine and methionine. Exemplary conservative amino acids substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, aspartic acid-glutamic acid, and asparagine-glutamine.

5 The term “modulator” refers to a composition that increases or decreases the level of a target molecule or the level of activity or function of a target molecule or the physical state of the target of the molecule. In embodiments a modulator is a recombinant nucleic acid that is capable of increasing or decreasing the amount of a protein in a cell or the level of activity of a protein in a cell or transcription of a second nucleic acid in a cell. In

0 embodiments, a modulator increases or decreases the level of activity of a protein or the amount of the protein in a cell. The term “modulate" is used in accordance with its plain and ordinary meaning and refers to the act of changing or varying one or more properties. “Modulation” refers to the process of changing or varying one or more properties. For example, as applied to the effects of a modulator on a target protein, to modulate means to change by increasing or decreasing a property or function of the target molecule or the amount of the target molecule. In embodiments, a recombinant nucleic acid that modulates the level of activity of a protein may increase the activity or amount of the protein relative the absence of the recombinant nucleic acid. In embodiments, an increase in the activity or

5 amount of a protein may include overexpression of the protein. “Overexpression” is used in accordance with its plain and ordinary meaning and refers to an increased level of expression of a protein relative to a control (e.g. cell or expression system not including a recombinant nucleic acid that contributes to the overexpression of a protein). In embodiments, a decrease in the activity or amount of a protein may include a mutation (e.g.

0 point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid; all/any of which may be in the coding region for a protein or in an operably linked region (e.g. promoter)) of the protein. The term “increased” refers to a detectable increase compared to a control.

A nucleic add is “operably linked” when it is placed into a functional relationship

5 with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation.

0 Generally, “operably linked” means that the DNA sequences being linked are near each other, and, in the case of a secretory leader, contiguous and in reading phase. However, operably linked nucleic acids (e.g. enhancers and coding sequences) do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with

5 conventional practice. In embodiments, a promoter is operably linked with a coding sequence when it is capable of affecting (e.g. modulating relative to the absence of the promoter) the expression of a protein from that coding sequence (i.e., the coding sequence is under the transcriptional control of the promoter).

Transformation” refers to the transfer of a nucleic acid molecule into a host

0 organism (e.g. a microbial organism). In embodiments, the nucleic acid molecule may be a plasmid that replicates autonomously or it may integrate into the genome of the host organism (e.g. a microbial organism). Host organisms containing the transformed nucleic acid molecule may be referred to as “transgenic” or “recombinant” or “transformed” organisms (e.g. microbial organisms). A “genetically modified” organism (e.g. genetically modified microbial organism) is an organism (e g. microbial organism) that includes a nucleic acid that has been modified by human intervention. Examples of a nucleic acid that has been modified by human intervention include, but are not limited to, insertions, deletions, mutations, expression nucleic acid constructs (e.g. over-expression or expression

5 from a non-natural promoter or control sequence or an operably linked promoter and gene nucleic acid distinct from a naturally occurring promoter and gene nucleic acid in an organism), extra-chromosomal nucleic acids, and genomically contained modified nucleic acids. Genetically modified organisms may be made by rational modification of a nucleic acid or may be made by use of a mutagen or mutagenesis protocol that results in a mutation

0 that was not identified (e.g. intended or targeted) prior to the use of the mutagen or mutagenesis protocol (e.g. UV exposure, EMS exposure, mutagen exposure, random genomic mutagenesis, transformation of a library of different nucleic acid constructs). Genetically modified organisms that include a modification (e.g. modification, insertion, deletion, mutation) not previously known or intended prior to making of the genetically

5 modified organism may be identified through screening a plurality of organism including one or more genetically modified organisms by using a selection criteria that identifies the genetically modified organism of interest. In embodiments, a genetically modified organism includes a recombinant nucleic acid.

As used herein, the term “episome” or “episomally” is intended to refer to an

0 extrachromosomal DNA moiety or plasmid that can replicate autonomously in a host cell when physically separated from the chromosomal DNA of the host cell.

Methods for synthesizing sequences and bringing sequences together are well established and known to those of skill in the art. For example, in vitro mutagenesis and selection, site-directed mutagenesis, error prone PCR (Melnikov et al., Nucleic Acids

5 Research, 27(4):1056-1062 (Feb. 15, 1999)), “gene shuffling” or other means can be employed to obtain mutations of naturally occurring genes.

Compositions

Microbial Organisms

The present disclosure provides non-naturally occurring microbial organisms which

0 are capable of producing ethylene, ethane, methane, or combinations thereof. In some aspects, the microbial organism has been genetically modified with one or more genes directed to the production of ethylene, ethane, methane, or combinations thereof. In other aspects, the microbial organism may naturally produce ethylene, ethane, methane, or combinations thereof, but has been optimized for said production by the introduction of one or more non-naturally occurring genes.

Thus, in one aspect, a non-naturally occurring microbial organism is provided comprising a nucleic acid encoding one or more genes of a methylthio-alkane reductase

5 complex and one or more genes of a methionine salvage pathway.

In some embodiments, the organism can produce ethylene, ethane, methane, or combinations thereof. In some embodiments, the organism produces ethylene. In some embodiments, the organism produces ethane. In some embodiments, the organism produces methane.

0 In another aspect, a non-naturally occurring microbial organism is provided, wherein the organism is an anaerobic organism which produces ethylene, ethane, and/or methane using a methylthio-alkane reductase complex and a methionine salvage pathway, and wherein the organism has been optimized for producing ethylene, ethane, and/or methane with one or more non-naturally occurring genes. In some embodiments, the one or more

5 non-naturally occurring genes comprise one or more genes of a SAM hydrolase. In some embodiments, the one or more non-naturally occurring genes comprise one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof.

Methylthio-alkane Reductases

0 In some embodiments, the one or more genes of a methylthio-alkane reductase complex may comprise marB, marH, marD, marK, or combinations thereof.

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise marB. In some embodiments, the one or more genes of a methylthioalkane reductase complex comprise a nucleic acid sequence having at least 60%, at least

5 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 1 (marB).

SEQ ID NO: 1

ATGACGGTTCCTGCTTATCCTTCCCGCCAGCCTGCGGCCGGCGGAGTTTCATCTT

0 GCGGTGGCGCGGGGGGCGGCTGCGGGGACAGGACGGCGTGCGACGGCGGCGA

CGGCGGTCGCGCCACCGCCCCGGTGGTCGCCCTGCGCGGTCGCCATCCCTGCTT

CGACCCCGCCCCCCAGGCCCATGCCCGGGCCGGGCGGCTGCATCTGCCGGTCAG

CCCGGCCTGCAATATCACCTGCCAGTTCTGCGCCCGGGATTTCAACGCCTCCGA

CCGCCGCCCCGGCGTGGCGCGCCGGCTTCTCAAGCCCGAGCAAGCCCTTGACGT GGTGCGCCGGGCGCTGCGGCTCTGCCCGGAAATCTCGGTCGTCGGCATCGCCGG

CCCCGGTGACACTTTGGCGACCAATCACGCCATCGACACCTTCGCCCTGATCCA

TGCGGACTTTCCGACGCTGATCAACTGCCTGTCGACCAATGGCCTGCGCCTGCC

CGATCGCGCCAAGGAGCTGGCCGCCGTTGGTGTTCAGACCCTGACCGTCACCGT

5 CAATGCCGTCGCCCCGGAGATCCAGGCGGTGATTTCGCCGGTGATCGCCGATCG

CGGCAAGCGGCTGGAGGGTATCGAGGCGGCCCGCGTGCTGATCGCCAACCAGC

TTGAGGGCATCGCCAAGGCGGTGGCTCTCGGCATGGTGGTCAAGGTCAATTGCG

TGCTGATCCCCGGGGTCAACGACGATCACATCGGCGCCGTCGCCCAAAAAGTG

GCGGCCGCCGGCGCCTCGTTGTTCAACATCATCGCCTTGATCCCCACCCATAAC

0 CTCGCCCATCTCCCCGCCCCCAGCCCGGCCCTGCTGGCCCGGGCCCAGCGCGAG

GCCGGACGCCACATCAGCGTCTTTACCCATTGTCAGCGCTGCCGCGCCGATGCC

GCCGGCGTGCCCGGCGTCAGCGATATCGCCGACCTGCTTTACGACCGGCGTCTT

GACGCCACGACCTTTTCCCACGGCTAG

5 In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic acid sequence of SEQ ID NO: 1 (marB). In some embodiments, the one or more genese of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID No: 1.

0 In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide sequence of SEQ ID NO: 2 (MarB).

5 SEQ ID NO: 2

MTVPAYPSRQPAAGGVSSCGGAGGGCGDRTACDGGDGGRATAPWALRGRHPCFD

PAPQAHARAGRLHLPVSPACNITCQFCARDFNASDRRPGVARRLLKPEQALDWRR

ALRLCPEISWGIAGPGDTLATNHAIDTFALIHADFPTLINCLSTNGLRLPDRAKELA A

VGVQTLTVTVNAVAPE1QAVISPVIADRGKRLEG1EAARVLIANQLEGIAKAVALGM V

0 VKVNCVLIPGVNDDHIGAVAQKVAAAGASLFNIIALIPTHNLAHLPAPSPALLARAQR

EAGRHISVFTHCQRCRADAAGVPGVSDIADLLYDRRLDATTFSHG

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 2. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 2. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some

5 embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise marH. In some embodiments, the one or more genes of a methylthio¬

0 alkane reductase complex comprise a nucleic acid sequence having at least 60%, at least

65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 3 (marH).

SEQ ID NO: 3

5 ATGGCCAAAAGTCCCAAACAAATCGCCATCTATGGCAAAGGTGGCATCGGCAA

ATCGACCACCACCTCGAATATCAGCGCCGCCCTGGCCGAGGCCGGCTACAAGG

TGATGCAGTTCGGCTGCGACCCCAAAAGCGATTCGACCAATACCCTGCGCGGCG

GCGATTACATCCCCTCGGTGCTCGACCTGCTGCGCGAGAACGCCCGCGTCGATG

CCCATGAGGCGATCTTCCAGGGCTTTGGCGGCATCTATTGCGTTGAAGCCGGTG

0 GTCCGGCGCCAGGCGTCGGCTGCGCCGGTCGCGGCATCATCACCGCCGTCGAAC

TGCTCAAGCAGCAGAACGTCTTCGAAGAGCTCGATCTTGATTACGTGATCTTCG

ACGTGCTGGGCGACGTGGTCTGCGGCGGCTTCGCCGTGCCGATCCGTGAAGGCA

TCGCCGAACATGTCTTCACCGTGTCGTCGTCGGATTTCATGGCGATCTATGCCGC

GAACAATCTGTTCAAGGGCATTCAGAAGTACTCCAACGCCGGGGGCGCCCTGCT

5 TGGCGGGGTGATCGCCAATTCGATCAACACCGATTTCCACCGGGACATCATCGA

CGATTTCGTCGCCCGCACCCAGACCCAGGTCGTCCAATACGTGCCGCGCTCGCT

GACCGTCACCCAGGCCGAACTGCAGGGCCGCACGACGATCGAGGCGGCGCCCG

AGTCCGCCCAGGCCGAGATCTATCGGACCCTGGCGCGCAGCATCGCCGACCAT

ACGGACTCGAAGGTGCCGACCCCGCTTAACGCCCAAGAGCTGCGCGACTGGTC

0 GGCATCCTGGGCCAACCAATTGATCGAGATCGAACGGGCGAGCCAGCCGATTC

CCGCCCTGGCCTCATAA

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic acid sequence of SEQ ID NO: 3. In some embodiments, the one or more genese of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 3.

In some embodiments, the one or more genes of a methylthio-alkane reductase

5 complex comprise one or more marH genes associated with an accession number found in Table 1 below:

Table 1. Representative MarH Genes

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide

5 sequence of SEQ ID NO: 4 (MarH).

SEQ ID NO: 4

MAKSPKQIAIYGKGGIGKSTTTSNISAALAEAGYKVMQFGCDPKSDSTNTLRGGDY

IPSVLDLLRENARVDAHEAIFQGFGGIYCVEAGGPAPGVGCAGRGnTAVELLKQQN

0 VFEELDLDYVIFDVLGDWCGGFAVPIREGIAEHVFTVSSSDFMATYAANNLFKGIQ

KYSNAGGALLGGVIANSINTDFHRDIIDDFVARTQTQWQYVPRSLTVTQAELQGRT TIEAAPESAQAEIYRTLARSIADHTDSKVPTPLNAQELRDWSASWANQLIEIERASQP

IPALAS

In some embodiments, the one or more genes of a methylthio-alkane reductase

5 complex comprise a gene encoding a protein having 90%, 91%, 92%, 93%, 94%, 95%,

96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 4. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 4. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some o embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise marD. In some embodiments, the one or more genes of a methylthio¬

5 alkane reductase complex comprise a nucleic acid sequence having at least 60%, at least

65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 5 (marD).

SEQ ID NO: 5

0 ATGCCCATCAATCTCAAGACATCGGTGGTCGAGAGCCGCGAACAGCGGCTGGG

CACCATCATCGCCTGGGACGGCAAGGCCTCTGACCTGTCCAAGGAATCGGCCTA

TGCGCGCAGCGAGGGCTGCGGCAGCGCCTGCGGCGCCAAGGCCCGCCGGGTCT

GCGAGATGCGCAGCCCGTTCAGCCAGGGCTCGGTCTGTAGCGAACAGATGGTC

GAATGCCAAGCCGGCAACGTGCGCGGCGCCGTGCTGGTCCAGCATTCGCCGAT

5 CGGCTGCGGCGCCGGTCAGGTGATCTATAATTCGATCTTCCGCAATGGTCTGGC

GATCCGCGGCCTGCCGGTGGAGAACCTCCATCTGATCAGCACCAACCTGCGCGA

ACGCGACATGGTCTATGGCGGGCTCGACAAGCTCGAACGCACCATCCGCGACG

CCTGGGAGCGCCATCACCCCCAGGCCATTTTCATCGCCACCTCCTGCCCGACGG

CGATCATTGGCGACGACATCGAAAGCGTCGCTTCGCAGCTTGAAGCCGAGTTCG

0 GCATACCGGTCATACCGCTGCACTGCGAGGGCTTCAAATCCAAGCATTGGAGCA

CCGGCTTCGACGCCACCCAGCACGGCATCTTGCGCCAGATCGTCCGCAAAAATC

CCGAGCGCAAGCAGGAAGACCTGGTCAACGTCATCAATCTGTGGGGATCGGAT

GTCTTTGGCCCGATGCTCGGCGAATTGGGTTTGCGGGTGAACTACGTCGTCGAT

CTCGCCACCGTCGAGGATCTGGCCCAGATGTCGGAGGCGGCGGCAACCGTCGG CTTCTGCTACACGCTGTCGACCTATATGGCCGCCGCCCTGGAACAGGAATTCGG

CGTTCCCGAGGTCAAGGCGCCCATGCCCTATGGCTTCGCCGGCACCGACGCCTG

GCTGCGCGAGATCGCCCGCGTCACCCACCGCGAGGAGCAGGCCGAGGCCTATA

TCGCCCGCGAGCACGCCCGGGTGAAGCCACAGCTTGAGGCCCTGCGCGAGAAG

5 CTCAAGGGCATCAAGGGCTTCGTCTCCACCGGCTCGGCCTATGCCCATGGCATG

ATCCAGGTGCTGCGCGAACTGGGCGTCACCGTCGACGGCTCGTTGGTCTTCCAC

CACGATCCGGTCTACGACAGCCAGGATCCGCGTCAGGATTCCCTTGCCCATCTG

GTCGACAACTATGGCGACGTCGGCCATTTCAGCGTCGGCAATCGCCAGCAGTTC

CAGTTCTACGGCCTGCTTCAGCGGGTGAAGCCCGATTTCATCATCATCCGCCAC

0 AACGGGTTGGCGCCGCTGGCCTCGCGCCTGGGCATCCCGGCCATTCCGCTGGGC

GATGAACATATCGCCGTGGGCTATCAGGGCATCTTGAACCTGGGTGAATCCATC

CTCGATGTGCTGGCCCACCGCAAGTTCCACGAAGACATCGCCGCCCATGTCCGC

CTGCCCTATCGCCAGGACTGGCTGGCCCGCGATCCCTTCGATCTGGCCCGGCAA

AGCGCCGGCCAGCCGCGCCGTCCCGCAGAGTGA

5

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%,

97%, 98%, or 99% identity to the nucleic acid sequence of SEQ ID NO: 5. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a

0 nucleic acid sequence of SEQ ID NO: 5.

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise one or more marD genes associated with an accession number found in

Table 2 below:

Table 2. Representative MarD Genes

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide

5 sequence of SEQ ID NO: 6 (MarD).

SEQ ID NO: 6

MPINLKTSWESREQRLGTIIAWDGKASDLSKESAYARSEGCGSACGAKARRVCEM

RSPFSQGSVCSEQMVECQAGNVRGAVLVQHSPIGCGAGQVIYNS1FRNGLAIRGLPV

0 ENLHLISTNLRERDMVYGGLDKLERTIRDAWERHHPQAIFIATSCPTAIIGDDIESVAS

QLEAEFGIPVIPLHCEGFKSKHWSTGFDATQHGILRQIVRKNPERKQEDLVNVINLW

GSDVFGPMLGELGLRVNYVVDLATVEDLAQMSEAAATVGFCYTLSTYMAAALEQ EFGVPEVKAPMPYGFAGTDAWLREIARVTHREEQAEAYIAREHARVKPQLEALREK

LKGIKGFVSTGSAYAHGMIQVLRELGVTVDGSLVFHHDPVYDSQDPRQDSLAHLVD

5 NYGDVGHFSVGNRQQFQFYGLLQRVKPDFIIIRHNGLAPLASRLGIPAIPLGDEHIAV GYQGILNLGESILDVLAHRKFHEDIAAHVRLPYRQDWLARDPFDLARQSAGQPRRP AE

In some embodiments, the one or more genes of a methylthio-alkane reductase

0 complex comprise a gene encoding a protein having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 6. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 6. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments, the one or more genes of a methylthio-alkane reductase

5 complex comprise marK. In some embodiments, the one or more genes of a methylthioalkane reductase complex comprise a nucleic acid sequence having at least 60%, at least

65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 7 (marK).

0 SEQ ID NO: 7

ATGCCCGATGCAGAGTCCCGTTCCCAGGTCACGGCGAAGGCCGCGCCACCACC

CGCCCCCAAGACCAATTCGATCGAACAGGTGCGCTATATCTGTTCGATCGGCGC

CATGCACAGCGCCTCGGCTATCCCACGGGTGATCCCGATCACCCATTGCGGCCC

GGGCTGCGCCGACAAGCAGTTCATGAACGTCGCCTTCTATAATGGCTTCCAGGG

5 CGGCGGCTATGGCGGCGGAGCGGTGGTGCCGAGCACCAACGCCACCGAGCGCG

AGGTGGTCTTCGGCGGCGCCGAGCGCCTGGACGAATTGATCGGCGCCTCGCTGC

AGGTGCTTGACGCCGACCTGTTCGTGGTGCTGACCGGCTGTATTCCCGATCTGG

TCGGCGATGACATCGGCTCGGTGGTCGGCCCCTATCAGAAGCGCGGCGTGCCG

ATCGTCTATGCCGAGACTGGCGGCTTTCGCGGCAATAACTTCACCGGCCACGAA

0 CTGGTGACCAAGGCGATCATCGACCAGTTCGTTGGCGATTACGATGCGGAGCGC

GACGGGGCCCGCGAGCCCCATACGGTCAATGTCTGGTCACTGCTGCCCTACCAC

AACACCTTCTGGCGCGGTGATTTGACCGAGATCAAGCGGCTGCTCGAAGGCATC

GGCCTTAAGGTCAATATCCTGTTCGGCCCGCAATCGGCCGGGGTGGCGGAATGG

AAGGCCATCCCGCGCGCCGGCTTTAATCTGGTGCTCTCGCCCTGGCTGGGGCTG

5 GACACGGCGCGCCATTTGGACCGCAAATACGGCCAGCCGACCCTGCATCGACC

GATCATCCCGATCGGCGCCAAGGAAACCGGCGCCTTCCTGCGCGAGGTGGCGG

CTTTCGCCGGCCTCGACAGCGCGGTGGTCGAGGCCTTCATCACCGCCGAAGAAG

CCGTTTATTACCGCTATCTGGAGGACTTCACCGATTTCTACGCGGAGTACTGGT

GGGGTCTGCCGGCCAAATTCGCCGTCATCGGCGACAGCGCCTATAATCTGGCCT

0 TGACCAAATTCCTGGTAAACCAGTTGGGCCTGATACCGGGGCTGCAGATCATCA

CCGACAATCCGCCCGAGGAGGTGCGCGAGGATATCCGCGCCCATTACCACGCG

ATCGCCGATGACGTGGCCACCGATGTCTCTTTTGAAGAAGACAGCTACACCATC

CACCAAAAGATCCGCGCCACCGATTTCGGCCACAAGGCGCCGATCCTGTTTGGC

ACCACCTGGGAACGCGACCTTGCCAAGGAATTGAAGGGGGCGATCGTCGAGGT CGGCTTCCCGGCATCCTATGAAGTCGTGCTGTCGCGCAGCTATCTTGGCTACCG

GGGCGCCCTGACTTTGCTGGAAAAAATCTACACAACCACCGTCAGCGCAAGCG

CTTGA

5 In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic acid sequence of SEQ ID NO: 7. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 7.

0 In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise one or more marK genes associated with an accession number found in Table 3 below:

Table 3. Representative MarK Genes In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide sequence of SEQ ID NO: 8 (MarK).

5

SEQ ID NO: 8

PDAESRSQVTAKAAPPPAPKTNSIEQVRYICSIGAMHSASAIPRVIPITHCGPGCAD KQ

FMNVAFYNGFQGGGYGGGAWPSTNATEREVVFGGAERLDELIGASLQVLDADLF

WLTGCIPDLVGDDIGSWGPYQKRGVPIVYAETGGFRGNNFTGHELVTKAIIDQFV

0 GDYDAERDGAREPHTVNVWSLLPYHNTFWRGDLTEIKRLLEGIGLKVNILFGPQSA GVAEWKAIPRAGFNLVLSPWLGLDTARHLDRKYGQPTLHRPIIPIGAKETGAFLREV AAFAGLDSAWEAFITAEEAVYYRYLEDFTDFYAEYWWGLPAKFAVIGDSAYNLALT

KFLVNQLGLIPGLQHTDNPPEEVREDIRAHYHAIADDVATDVSFEEDSYTIHQKIRA T

DFGHKAPILFGTTWERDLAKELKGAIVEVGFPASYEWLSRSYLGYRGALTLLEKIY

5 TTTVSASA

In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 8. In some

0 embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 8. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion,

5 or insertion of heterologous nucleic acid).

The art is familiar with the methods and techniques used to identify other methylthio-alkane reductase genes and nucleotide sequences.

Methionine Salvage Pathways

In some embodiments, the one or more genes of a methionine salvage pathway

0 comprise one or more genes of a dihydroxyacetone phosphate (DHAP) shunt pathway. In some embodiments, the one or more genes of a DHAP shunt pathway comprise 5’- methylthioadenosine phosphorylase (mtnP), methylthioadenosine nucleosidase (mtnl), 5- methylthioribose kinase (mtnK), 5-methylthioribose-l -phosphate isomerase (mtnA), 5- methyl thioribulose- 1 -phosphate aldolase (ald2), or combinations thereof. In some embodiments, the one or more genes of a methionine salvage pathway comprises mtnP. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point

5 mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid). In some embodiments, the one or more genes of a methionine salvage pathway comprises an mtnP gene associated with an accession number found in Table 4 below:

Table 4. Representative MtnP Genes

The art is familiar with the methods and techniques used to identify other 5’- methylthioadenosine phosphorylase genes and nucleotide sequences.

In some embodiments, the one or more genes of a methionine salvage pathway

5 comprises mtnK. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid). In some embodiments, the one or more genes of a methionine salvage pathway comprises an mtnK gene associated with an accession number found in Table 5 below:

Table 5. Representative MtnK Genes

5

The art is familiar with the methods and techniques used to identify other 5- methylthioribose kinase genes and nucleotide sequences.

In some embodiments, the one or more genes of a methionine salvage pathway comprises mtnA. In some embodiments, the gene is a wildtype version of the gene or

0 encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid). In some embodiments, the one or more genes of a methionine salvage pathway comprises an mtnA gene associated with an accession number found in Table 6 below:

Table 6. Representative MtnA Genes

The art is familiar with the methods and techniques used to identify other 5- methylthioribose-l-P isomerase genes and nucleotide sequences.

In some embodiments, the one or more genes of a methionine salvage pathway

5 comprises ald2. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid). In some embodiments, the one or more genes of a methionine salvage

0 pathway comprises an ald2 gene associated with an accession number found in Table 7 below:

Table 7. Representative Ald2 Genes Blastochloris viridis strain ATCC 19567, complete genome _ WP 055038972.1

Blastochloris viridis DNA, complete genome, strain: DSM 133 WP 055038972.1 Rhodopseudomonas palustris strain R1 NODE_7_length_89266_cov_41.693230, whole genome shotgun sequence _ WP 119019680.1

Rhodopseudomonas palustris strain DSM 126 scaffold0020, whole genome shotgun sequence _ WP 012497786.1

Rhodopseudomonas palustris strain 2.1.37 scaffold_3, whole genome shotgun sequence _ WP 011160187.1

Pleomorphomonas carboxyditropha strain SVCO-16 NODE_13_length_l 37005_cov_21.4606, whole genome shotgun sequence _ WP 100082573.1

Pleomorphomonas sp. CF100 Ga0189743_114, whole genome shotgun sequence _ WP 134187437.1

Pleomorphomonas koreensis DSM 23070 H512DRAFT_scaffold00010.10_C, whole genome shotgun sequence _ WP 053239475.1

Roseiarcus fermentans strain DSM 24875 Ga0244512_102, whole genome shotgun sequence _ WP 113887630.1

Siculibacillus lacustris strain SA-279 scaffold_6, whole genome shotgun sequence _ WP 131310260.1

Pelosinus sp. UFO1, complete genome _ WP 038670968.1

Dendrosporobacter quercicolus strain DSM 1736, whole genome shotgun sequence _ WP 092067972.1

Rhodopseudomonas palustris strain YSC3 chromosome, complete genome _ WP 107357124.1

Sporomusaceae bacterium strain FL31 scf_SPFL3102_001, whole genome shotgun sequence _ WP 127035514.1

Sporomusaceae bacterium FL31 scf_SPFL3101 011, whole genome shotgun sequence _ WP 127035514.1

Propionispora vibrioides strain DSM 13305, whole genome shotgun sequence _ WP 091746076.1

Rhodopseudomonas palustris strain PS3 chromosome, complete genome _ WP 107346191.1

Propionispora sp. 2/2-37, whole genome shotgun sequence _ WP 054261599.1

Clostridium sp. BNL1100, complete genome _ WP 014312609.1

Ruminiclostridium josui JCM 17888 K412DRAFT_scf7180000000007_quiver.2_C, whole genome shotgun sequence _ WP 024831703.1

Rhodopseudomonas palustris strain ELI 1980 Contig20, whole genome shotgun sequence _ WP 119019680.1

Rhodopseudomonas palustris CGA009 complete genome _ WP 011160187.1

Rhodopseudomonas palustris TIE-1, complete genome _ WP 012497786.1

Clostridium pasteurianum BC1, complete genome WP 015616819.1

The art is familiar with the methods and techniques used to identify other 5- m ethy 1 thioribulose- 1-P aldolase genes and nucleotide sequences. Additional Genes

In some embodiments, the nucleic acid may encode one or more genes of a SAM hydrolase. In some embodiments, the one or more genes of a SAM hydrolase may be a non-naturally occurring, or exogenous, gene. In some embodiments, the SAM hydrolase

5 may be derived from a coliphage virus. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

0 The art is familiar with the methods and techniques used to identify other SAM hydrolase genes and nucleotide sequences.

In some embodiments, the nucleic acid may encode one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof. In some embodiments, the one or more genes of mddA, mgl, or combinations thereof, may

5 be a non-naturally occurring, or exogenous, gene. In some embodiments, the one or more genes of mddA and/or mgl are derived from Rhodopsetutomonal palsutris. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation,

0 missense mutation, deletion, or insertion of heterologous nucleic acid).

The art is familiar with the methods and techniques used to identify other methanethiol methylase and/or methionine gamma lyase genes and nucleotide sequences.

In some embodiments, the nucleic acid may be codon optimized. In some embodiments, the one or more may be optionally and independently linked to a control

5 element. In some embodiments, the control element comprises a promoter.

Vectors

In another aspect, vectors are provided comprising one or more exogenous nucleic acid molecules encoding one or more genes of a methylthio-alkane reductase complex and one or more genes of a methionine salvage pathway. Vectors are also provided for use in

0 the methods disclosed herein. For example, one or more of the vectors disclosed herein can be used to transform a microbial organism. Microbial organisms are also described transformed with or comprising one or more of the vectors described herein. In some embodiments of the vedors described herein, the one or more genes of a methylthio-alkane reductase complex may comprise marB, marH, marD, marK, or combinations thereof.

In some embodiments of the vectors described herein, the one or more genes of a

5 methylthio-alkane reductase complex comprise marB. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 1 (marB).

In some embodiments of the vectors described herein, the one or more genes of a

0 methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic add sequence of SEQ ID NO: 1 (marB). In some embodiments, the one or more genese of a methylthioalkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 1.

In some embodiments of the vectors described herein, the one or more genes of a

5 methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide sequence of SEQ ID NO: 2 (MarB).

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having 90%,

0 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 2. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 2. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may

5 encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic add).

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise marH. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence

0 having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 3 (marH).

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic add sequence of SEQ ID NO: 3. In some embodiments, the one or more genese of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 3.

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise one or more marH genes associated with an

5 accession number found in Table 1.

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide sequence of SEQ ID NO: 4 (MarH).

0 In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 4. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 4. In some embodiments, the gene is a wildtype

5 version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments of the vectors described herein, the one or more genes of a

0 methylthio-alkane reductase complex comprise marD. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 5 (marD).

In some embodiments of the vectors described herein, the one or more genes of a

5 methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic add sequence of SEQ ID NO: 5. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID No: 5.

In some embodiments of the vectors described herein, the one or more genes of a

0 methylthio-alkane reductase complex comprise one or more marD genes associated with an accession number found in Table 2.

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide sequence of SEQ ID NO: 6 (MarD).

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having 90%,

5 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 6. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 6. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may

0 encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic add).

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise marK. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence

5 having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the nucleic acid sequence of SEQ ID NO: 7 (marK).

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucldc add sequence of

0 SEQ ID NO: 7. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 7.

In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise one or more marK genes associated with an accession number found in Table 3.

5 In some embodiments of the vectors described herein, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the peptide sequence of SEQ ID NO: 8 (MarK).

In some embodiments of the vectors described herein, the one or more genes of a

0 methylthio-alkane reductase complex comprise a gene encoding a protein having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the peptide sequence of SEQ ID NO: 8. In some embodiments, the one or more genes of a methylthio-alkane reductase complex comprise a gene encoding a protein of SEQ ID NO: 8. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the assodated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic add).

In some embodiments of the vectors described herein, the one or more genes of a

5 methionine salvage pathway comprise one or more genes of a dihydroxyacetone phosphate (DHAP) shunt pathway. In some embodiments, the one or more genes of a DHAP shunt pathway comprise 5 ’-methylthioadenosine phosphorylase (mtnP), 5-methylthioribose kinase (mtnK), 5-methylthioribose- 1 -phosphate isomerase (mtnA), 5-methylthioribulose-l- phosphate aldolase (ald2), or combinations thereof.

0 In some embodiments of the vedors described herein, the one or more genes of a methionine salvage pathway comprises mtnP. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion,

5 or insertion of heterologous nucleic add). In some embodiments, the one or more genes of a methionine salvage pathway comprises an mtnP gene assodated with an accession number found in Table 4.

In some embodiments of the vectors described herein, the one or more genes of a methionine salvage pathway comprises mtnl. In some embodiments, the gene is a wildtype

0 version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments of the vectors described herein, the one or more genes of a

5 methionine salvage pathway comprises mtnK. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic add). In some embodiments, the one or more genes of a

0 methionine salvage pathway comprises an mtnK gene associated with an accession number found in Table 5.

In some embodiments of the vectors described herein, the one or more genes of a methionine salvage pathway comprises mtnA. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic add). In some embodiments, the one or more genes of a methionine salvage pathway comprises an mtnA gene associated with an accession number

5 found in Table 6.

In some embodiments of the vectors described herein, the one or more genes of a methionine salvage pathway comprises ald2. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a mutant form of the gene or may encode a mutant form of the

0 associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic add). In some embodiments, the one or more genes of a methionine salvage pathway comprises an ald2 gene associated with an accession number found in Table 7.

In some embodiments of the vectors described herein, the exogenous nucleic acid

5 molecules may further encode one or more genes of a SAM hydrolase. In some embodiments, the one or more genes of a SAM hydrolase may be a non-naturally occurring, or exogenous, gene. In some embodiments, the SAM hydrolase may be derived from a coliphage virus. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some embodiments, the gene is a

0 mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments of the vectors described herein, the exogenous nucleic acid molecules may encode one or more genes of a methanethiol methylase (mddA), a

5 methionine gamma lyase (mgl), or combinations thereof. In some embodiments, the one or more genes of mddA, mgl, or combinations thereof, may be a non-naturally occurring, or exogenous, gene. In some embodiments, the one or more genes of mddA and/or mgl are derived from Rhodopseudomonal palsutris. In some embodiments, the gene is a wildtype version of the gene or encodes a wildtype form of the associated protein. In some

0 embodiments, the gene is a mutant form of the gene or may encode a mutant form of the associated protein (e.g. point mutant, loss of function mutation, missense mutation, deletion, or insertion of heterologous nucleic acid).

In some embodiments the one or more exogenous nucleic acid molecules are integrated into a gene expression cassette. In some embodiments, the gene expression cassette comprises one or more control elements. In some embodiments, the one or more exogenous nucleic add molecules disclosed herein are operably linked to a control element. In some embodiments, the control element is a promoter. In some embodiments, the promoter may be constitutively active or inducibly active. In some embodiments, the

5 promoter is constitutively active regardless of sulfate concentration, i.e., sulfate limitation is not required in order to induce expression of the gens found in the one or more exogenous nucleic acid molecules.

In some embodiments, the promoter comprises a nucleic acid sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%,

0 or more identity to the sequence of SEQ ID NO: 9:

SEQ ID NO: 9

AAACCGCTTTAACCGCCATCCTGCGCTAAACGGCCGCCGGCCCCCACCGGCGGC

CGTTTTTTATTCGCCGCCCCTCCCCGCGACGGGCTCCCTCGCCTTGGTGGCTTTT

5 CATCCGGGGGGGTGGCGCGCTAAGGTGCCCCACCCGCAAAAGGGTGAGCCAGC

CAGGAAGAGGGGAACAT

In some embodiments, the promoter comprises a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic add sequence of

0 SEQ ID NO: 9. In some embodiments, the promoter comprises a nucleic acid sequence of SEQ ID NO: 9.

In some embodiments, the promoter comprises a nucleic acid sequence having at least 60%, at least 65%, at least 70%, at least 75%, at least 80 %, at least 85%, at least 90%, or more identity to the sequence of SEQ ID NO: 10:

5

SEQ ID NO: 10

GGGCATGGCGCGGATGATCCGCCCGCTCTCGGGCTCGCCACACGAGGTTTTCCG

GGGTTTTCCGCTCCTTTCGGGGCAGAACACGCCGGATAACAAGGTCCGTCCCGA

CCTGGTCGGGTGGACTTCTTACCGCGGTTCTTCACCGCGGTAGAGCAGCCGTTC

0 CCTGCGCGGATGCAGTGGAATGGTTTTCTGGGCAAGAATTAGGAGGTAGCACA

T

In some embodiments, the promoter comprises a nucleic acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to the nucleic add sequence of SEQ ID NO: 10. In some embodiments, the promoter comprises a nucleic acid sequence of SEQ ID NO: 10.

In another aspect, a non-naturally occurring organism is provided comprising a vector described herein.

5 Methods of Use

In another aspect, methods of producing ethylene, ethane, and/or methane are provided comprising: culturing a population of the non-naturally occurring microbial organism described herein in a culture medium comprising one or more carbon sources; and

0 recovering the ethylene, ethane, and/or methane.

In some embodiments, the methods described herein may be used in the production of ethylene. In some embodiments, the methods described herein may be used in the production of ethane. In some embodiments, the methods described herein may be used in the production of methane.

5 The term “carbon source” means a carbon source that a microbial organism described herein will metabolize to derive energy (e.g. monosaccharides, oligosaccharides, polysaccharides, alkanes, fatty acids, esters of fatty adds, monoglycerides, acetate, carbon dioxide, methanol, formaldehyde, formate or carbon-containing amines). The term “carbon source” refers to a carbon containing composition (e.g. compound, mixture of compounds)

0 that an organism may metabolize for use by the organism or that may be used for organism viability. A “majority carbon source” refers to a carbon containing composition that accounts for greater than 50% of the available carbon sources for an organism (e.g. in a media, in a growth media, in a defined media for the organism, or in a defined media for producing ethylene, ethane, and/or methane by an organism) at a specified time (e.g. media

5 when starting a culture, media in a bioreactor when growing the organism, or media when producing ethylene, ethane, and/or methane from the organism). In embodiments, an organism may be cultured using a medium comprising a majority carbon source selected from the group consisting of glucose, glycerol, xylose, fructose, mannose, ribose, sucrose, and lignocellusic biomass. In embodiments, an organism may be cultured using a medium

0 comprising one or more carbon sources selected from the group consisting of glucose, fructose, sucrose, lactose, galactose, xylose, mannose, rhamnose, arabinose, glycerol, acetate, depolymerized sugar beet pulp, black liquor, com starch, depolymerized cellulosic material, com stover, sugar beet pulp, switchgrass, milk whey, molasses, potato, rice, sorghum, sugar cane, wheat, and mixtures thereof (e.g. mixtures of glycerol and glucose, mixtures of glucose and xylose, mixtures of fructose and glucose, mixtures of sucrose and depolymerized sugar beet pulp, black liquor, com starch, depolymerized cellulosic material, com stover, sugar beet pulp, switchgrass, milk whey, molasses, potato, rice, sorghum, sugar cane, and/or wheat). In some embodiments, an organism is cultured using a medium

5 comprising one or more carbon sources selected from the group consisting of depolymerized sugar beet pulp, black liquor, com starch, depolymerized cellulosic material, com stover, sugar beet pulp, switchgrass, milk whey, molasses, potato, rice, sorghum, sugar cane, thick cane juice, sugar beet juice, and wheat. In some embodiments, an organism is cultured using a medium comprising lignocellulosic biomass. In some embodiments, carbon

0 sources may be monosaccharides (e.g., glucose, fructose), disaccharides (e.g., lactose, sucrose), oligosaccharides, polysaccharides (e.g., starch, cellulose or mixtures thereof), sugar alcohols (e.g., glycerol) or mixtures from renewable feedstocks (e.g., cheese whey permeate, cornsteep liquor, sugar beet molasses, or barley malt). Additionally, carbon sources may include alkanes, fatty acids, esters of fatty acids, monoglycerides, diglycerides,

5 triglycerides, phospholipids, various commercial sources of fatty acids including vegetable oils (e.g., soybean oil) or animal fats. In some embodiments, the culture medium may contain, in addition to the primary (or majority) carbon source, one or more secondary carbon sources. In some embodiments, the secondary carbon source comprises lignin or lignin derived aromatic compounds. In some embodiments, the secondary carbon source

0 comprises lignin breakdown products.

In some embodiments, the one or more carbon sources may comprise biomass, for example lignocellulosic biomass. The term “biomass” refers to material produced by growth and/or propagation of cells. “Lignocellulosic biomass” is used according to it plain and ordinary meaning and refers to plant dry matter comprising carbohydrate (e.g. cellulose or

5 hemicellulose) and polymer (e.g. lignin). Lignocellulosic biomass may include agricultural residues (e g. com stover or sugarcane bagasse), energy crops (e.g. poplar trees, willow, Miscanthus purpureum, Pennisetum purpureum, elephant grass, maize, Sudan grass, millet, white sweet clover, rapeseed, giant miscanthus, switchgrass, jatropha, Miscanthus giganteus, or sugarcane), wood residues (e g. sawmill or papermill discard), or municipal

0 paper waste.

In some embodiments, the one or more carbon sources may be selected from one or more in combination of: carbon dioxide and carbon monoxide, mono and disaccharide sugars, organic acids ( for example, malate, succinate, pyruvate, and fumarate), volatile fatty acids (for example, formate, acetate, propionate, and butyrate), alcohols (for example, ethanol and glycerol), and cellulosic plant biomass including but not limited to com stover, miscanthus, switchgrass.

A “growth media” or “growth medium” as used herein can be a solid, powder, or liquid mixture which comprises all or substantially all of the nutrients necessary to support

5 the growth of an organism; various nutrient compositions are preferably prepared when particular species are being assayed. Amino acids, carbohydrates, minerals, vitamins and other elements known to those skilled in the art to be necessary for the growth of microbial organisms are provided in the medium. In one embodiment, the growth medium is liquid. In one embodiment, the growth medium is a production medium (for example, medium

0 optionally containing higher concentrations of glucose and/or altered concentrations of nitrogen).

In some embodiments, the growth media is sufficiently deficient in or absent of sulfate.

In another aspect, a bioreactor is provided comprising a non-naturally occurring

5 organism as described herein. Such bioreactors may be used in the methods described herein.

Embodiments

Further embodiments of the present disclosure are provided as follows:

Embodiment 1: a non-naturally occurring microbial organism comprising a nucleic

0 acid encoding one or more genes of a methylthio-alkane reductase complex and one or more genes of a methionine salvage pathway.

Embodiments 2: a non-naturally occurring microbial organism of embodiment 1, wherein the organism produces ethylene, ethane, methane, or combinations thereof.

Embodiment s: the non-naturally occurring microbial organism of embodiment 2,

5 wherein the organism produces ethylene.

Embodiment 4: the non-naturally occurring microbial organism of embodiment 2, wherein the organism produces ethane.

Embodiment 5: the non-naturally occurring microbial organism of embodiment 2, wherein the organism produces methane.

0 Embodiment 6: the non-naturally occurring microbial organism of any one of embodiments 1-5, wherein the one or more genes of a methylthio-alkane reductase complex comprise marB, marH, marD, marK, or combinations thereof.

Embodiment ?: the non-naturally occurring microbial organism of any one of embodiments 1-6, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 1.

Embodiment 8: the non-naturally occurring microbial organism of embodiment 7, wherein the one or more genes of a methylthio-alkane reductase complex comprise a

5 nucleic acid sequence of SEQ ID NO: 1.

Embodiment 9: the non-naturally occurring microbial organism of any one of embodiments 1-8, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 3.

0 Embodiment 10: the non-naturally occurring microbial organism of embodiment 9, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 3.

Embodiment 11: the non-naturally occurring microbial organism of any one of embodiments 1-10, wherein the one or more genes of a methyl thio-alkane reductase

5 complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 5.

Embodiment 12: the non-naturally occurring microbial organism of embodiment 11, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 5.

0 Embodiment 13: the non-naturally occurring microbial organism of any one of embodiments 1-12, wherein the one or more genes of a methylthio-alkane reductase comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 7.

Embodiment 14: the non-naturally occurring microbial organism of embodiment 13,

5 wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 7.

Embodiment 15: the non-naturally occurring organism of any one of embodiments 1-

14, wherein the one or more genes of a methionine salvage pathway comprise one or more genes of a dihydroxyacetone phosphate (DHAP) shunt pathway.

0 Embodiment 16: the non-naturally occurring organism of embodiment 15, wherein the one or more genes of a DHAP shunt pathway comprise 5 ’-methylthioadenosine phosphorylase (mtnP), methylthioadenosine nucleosidase (mtnl), 5-methylthioribose kinase (mtnK), 5-methylthioribose-l-phosphate isomerase (mtnA), 5-methylthioribulose-l- phosphate aldolase (ald2), or combinations thereof. Embodiment 17: the non-naturally occurring organism of embodiment 16, wherein the one or more genes of a DHAP shunt pathway comprise mtnP.

Embodiment 18: the non-naturally occurring organism of embodiment 16, wherein the one or more genes of a DHAP shunt pathway comprise mtnl and mtnK.

5 Embodiment 19: the non-naturally occurring organism of any one of embodiments 16-

18, wherein the one or more genes of a DHAP shunt pathway comprise mtnA.

Embodiment 20: the non-naturally occurring organism of any one of embodiments 16-

19, wherein the one or more genes of a DHAP shunt pathway comprise ald2.

Embodiment 21: the non-naturally occurring microbial organism of any one of

0 embodiments 1-20, wherein the nucleic acid further encodes one or more genes of a SAM hydrolase.

Embodiment 22: the non-naturally occurring microbial organism of any one of embodiments 1-10, wherein the nucleic acid further encodes one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof.

5 Embodiment 23: the non-naturally occurring microbial organism of any one of embodiments 1-22, wherein the nucleic acid is codon optimized.

Embodiment 24: the non-naturally occurring microbial organism of any one of embodiments 1-23, wherein the nucleic acid is integrated into the genome of the oiganism. Embodiment 25: the non-naturally occurring microbial organism of any one of

0 embodiments 1-23, wherein the nucleic acid is episomally integrated into a plasmid. Embodiment 26: a non-naturally occurring microbial oiganism, wherein the oiganism is an anaerobic organism which produces ethylene, ethane, and/or methane using a methylthio-alkane reductase complex and a methionine salvage pathway, and wherein the oiganism has been optimized for producing ethylene, ethane, and/or methane with one or

5 more non-naturally occurring genes.

Embodiment 27: the non-naturally occurring microbial organism of embodiment 26, wherein the one or more non-naturally occurring genes comprise one or more genes of a SAM hydrolase.

Embodiment 28: the non-naturally occurring microbial organism of embodiment 26,

0 wherein the one or more non-naturally occurring genes comprise one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof. Embodiment 29: the non-naturally occurring microbial organism of any one of embodiments 26-28, wherein the one or more non-naturally occurring genes are integrated into the genome of the organism. Embodiment 30: the non-naturally occurring microbial organism of any one of embodiments 26-28, wherein the one or more non-naturally occurring genes are episomally expressed from a plasmid.

Embodiment 31 : the non-naturally occurring microbial organism of any one of

5 embodiments 26-30, wherein the one or more non-naturally occurring genes are codon optimized.

Embodiment 32: a method of producing ethylene, ethane, and/or methane comprising: culturing a population of the non-naturally occurring microbial organism of any one of embodiments 1-31 in a culture medium comprising one or more carbon sources; and

0 recovering the ethylene, ethane, and/or methane.

Embodiment 33: the method of embodiment 32, wherein the one or more carbon sources comprise carbon dioxide, carbon monoxide, an organic acid, a volatile fatty acid, an alcohol, cellulosic plant mass, or combinations thereof.

Embodiment 34: the method of embodiment 32 or 33, wherein the one or more carbon

5 sources comprise carbon dioxide, carbon monoxide, malate, succinate, pyruvate, fumarate, formate, acetate, propionate, butyrate, ethanol, glycerol, com stover, miscanthus, or switchgrass.

Embodiment 35: the method of any one of embodiments 32-34, wherein the one or more carbon sources comprise com stover.

0 Embodiment 36: the method of embodiment 32, wherein the one or more carbon sources comprise lignocellulosic biomass.

Embodiment 3 : the method of any one of embodiments 32-36, wherein the population is cultured in the absence of sulfate.

Embodiment 38: a bioreactor comprising the non-naturally occurring microbial

5 organism of any one of embodiments 1-31.

Embodiment 39: a vector comprising: one or more exogenous nucleic acid molecules encoding one or more genes of a methylthio-alkane reductase complex and one or more genes of a methionine salvage pathway.

Embodiment 40: the vector of embodiment 39, wherein the one or more genes of a

0 methylthio-alkane reductase complex comprise marB, marH, marD, marK, or combinations thereof.

Embodiment 41: the vector of embodiment 39 or embodiment 40, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 1. Embodiment 42: the vector of embodiment 41, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 1. Embodiment 43: the vector of any one of embodiments 39-42, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence

5 having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 3.

Embodiment 44: the vector of embodiment 43, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 3. Embodiment 45: the vector of any one of embodiments 39-44, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence

0 having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 5.

Embodiment 46: the vector of embodiment 43, wherein the one or more genes of a methylthio-alkane reductase complex comprise a nucleic acid sequence of SEQ ID NO: 5. Embodiment 47: the vector of any one of embodiments 39-46, wherein the one or more genes of a methylthio-alkane reductase comprise a nucleic acid sequence having at

5 least 85% identity to the nucleic acid sequence of SEQ ID NO: 7.

Embodiment 48: the vector of embodiment 47, wherein the one or more genes of a methylthio-alkane reductase comprise a nucleic add sequence of SEQ ID NO: 7. Embodiment 49: the vector of any one of embodiments 39-48, wherein the one or more genes of a methionine salvage pathway comprise one or more genes of a

0 dihydroxyacetone phosphate (DHAP) shunt pathway.

Embodiment 50: the vector of embodiment 49, wherein the one or more genes of a

DHAP shunt pathway comprise 5 ’-methylthioadenosine phosphorylase (mtnP), 5- methylthioribose kinase (mtnK), 5-methylthioribose-l -phosphate isomerase (mtnA), 5- methylthioribulose-1 -phosphate aldolase (ald2), alcohol dehydrogenase (adh), or

5 combinations thereof.

Embodiment 51: the vector of embodiment 50, wherein the one or more genes of a DHAP shunt pathway comprise mtnP.

Embodiment 52: the vector of embodiment 50, wherein the one or more genes of a

DHAP shunt pathway comprise mtnl and mtnK.

0 Embodiment 53: the vector of any one of embodiments 50-52, wherein the one or more genes of a DHAP shunt pathway comprise mtnA.

Embodiment 54: the vector of any one of embodiments 50-53, wherein the one or more genes of a DHAP shunt pathway comprise ald2. Embodiment 55: the vector of any one of embodiments 39-54, wherein the one or more exogenous nucleic acid molecules further encode one or more genes of a SAM hydrolase.

Embodiment 56: the vector of any one of embodiments 39-55, wherein the one or

5 more exogenous nucleic acid molecules further encode one or more genes of a methanethiol methylase (mddA), a methionine gamma lyase (mgl), or combinations thereof.

Embodiment 57: the vector of any one of embodiments 39-56, wherein the one or more genes are integrated into a gene expression cassette.

Embodiment 58: the vector of embodiment 57, wherein the gene expression cassette

0 comprises a promoter.

Embodiment 59: the vector of embodiment 58, wherein the promoter comprises a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 9.

Embodiment 60: the vector of embodiment 59, wherein the promoter comprises a

5 nucleic acid sequence of SEQ ID NO: 9.

Embodiment 61: the vector of embodiment 58, wherein the promoter comprises a nucleic acid sequence having at least 85% identity to the nucleic acid sequence of SEQ ID NO: 10.

Embodiment 62: the vector of embodiment 61, wherein the promoter comprises a

0 nucleic acid sequence of SEQ ID NO: 10.

Embodiment 63: the vector of any one of embodiments 39-62, wherein the one or more genes have been codon optimized.

Embodiment 64: a non-naturally occurring organism comprising a vector of any one of embodiments 39-63.

5 A number of embodiments of the disclosure have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, other embodiments are within the scope of the following claims.

By way of non-limiting illustration, examples of certain embodiments of the present

0 disclosure are given below.

EXAMPLES

The following examples are set forth below to illustrate the compositions, methods, and results according to the disclosed subject matter. These examples are not intended to be inclusive of all aspects of the subject matter disclosed herein, but rather to illustrate representative methods and results. These examples are not intended to exclude equivalents and variations of the present invention which are apparent to one skilled in the art.

Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts,

5 temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or is at ambient temperature, and pressure is at or near atmospheric.

Example 1. A nitrogenase-like enzyme system catalyzes methionine, ethylene, ethane, and methane biogenesis.

0 R rubrum was grown under conditions for ethylene induction (50 μM limiting sulfate or 1 mM MT-EtOH as sole S-source) and ethylene repression (1 mM sulfate) (FIGs. 5A-5C) (10). Proteomics differential abundance analysis identified multiple proteins that increased over 20-fold in proteomes from induced versus repressed cells (FIG. IB). Among these were enzymes involved in cysteine and methionine metabolism: homoserine/serine O-

5 acetyltransferase (CysE), O-acetyl-L-homoserine sulfhydrylase, cystathionine betasynthase, and cystathionine gamma-lyase (FIGs. 1A-1C, reactions 2, 3 ,6, 7, respectively).

Several proteins previously identified as NFL sequences of unknown function (8,9) showed some of the highest increases in abundance under ethylene inducing conditions (Fig. IB, Rru_A0772-Rru_A0773 and Rru_A0793-Rru_A0796; see FIG. 6 for gene

0 organization). In addition, there was also a large increase in abundance of proteins likely involved in iron-sulfur cluster metabolism; NifS cysteine desulfurase and a putative Fe 4 -S 4 scaffold protein (FIG. IB, Rru_A1068-Rru_A1069). This appears analogous to the Azotobacter vinelandii NifUS system for synthesis of nitrogenase-destined iron-sulfur clusters from cysteine (12). However, the precise iron-sulfur cluster assembly pathway in R

5 rubrum is unknown. The involvement of the nitrogenase-like system in ethylene production was further bolstered by the R rubrum transposon mutant strain WRdht-66B3, possessing an inactivated gene encoding a putative nitrogenase reductase-like iron protein (Rru_A0795; FIG. IB). This and other mutants identified in a random mutagenesis screen were unable to grow anaerobically in the presence of MT-EtOH as sole S-source but could

0 still grow utilizing sulfate, indicating a defect in the ethylene-producing pathway (FIGs. 7A- 7D). Consistent with the Tn5 mutagenesis results, specific deletion of NFL gene cluster Rru_A0793-Rru_A0796 rendered R rubrum incapable of growth or production of ethylene above basal levels with MT-EtOH as sole S-source (FIGs. 2A-B and FIG. 8C). This result confirmed that the putative nitrogenase-like system encoded by NFL gene cluster Rru_A0793-Rru_A0796 was essential for assimilating sulfur from MT-EtOH to produce ethylene and methionine.

Other biologically relevant volatile organic sulfur compounds (VOSCs) were then

5 tested for utilization by this putative nitrogenase-like enzyme system (FIG. 2A-B and FIG. 9A). In addition to MT-EtOH, VOSC utilization with concomitant hydrocarbon production was specific to dimethyl sulfide (DMS), the most abundant environmental VOSC, and ethyl methyl sulfide (EMS) (FIG. 2A-B). Analogous to MT-EtOH (10), use of DMS or EMS resulted in methane or ethane production, respectively, in a 1 to 1 stoichiometry (FiG. 3A-

0 B). Specific deletion of the other two NFL genes, Rru_A0772-Rru_A0773, did not affect growth or hydrocarbon production (FIG. 2A-B and FIG. 8B). Thus, we designate R rubrum genes Rru_A0793-Rru_A0796, previously identified as NFL genes nflBHDK of unknown function (8, 9), as methylthio-alkane reductase genes, marBHDK. This is based on corresponding amino add similarity to R. rubrum molybdenum nitrogenase gene products

5 NifB (synthesis of the NifB-cofactor precursor to the nitrogenase catalytic cofactor), NifH (nitrogenase-reductase iron protein), NifD (nitrogenase catalytic subunit a), and NifK (nitrogenase catalytic subunit P) (FIG. 10 - FIG. 13). NFL genes Rru_A0772-Rru_A0773 remain designated nflDK genes of unknown function (8, 9).

When all R rubrum NFL genes were deleted (strain A0772:3/A0793:6) and specific

0 gene combinations were re-introduced via expression from a plasmid, expression of marBHDK was necessary and sufficient to restore growth and hydrocarbon metabolism from VOSCs (FIG. 2B-C and FIG. 9B-C). The NFL genes of unknown function, nflDK, could not replace marDK in complementing for growth. Upon feeding cells expressing marBH and nflDK with VOSCs, ethylene and ethane production was poorly catalyzed at 3-

5 to 4-fold above basal levels and no methane enhancement was observed (FIG. 2B-C and FIG. 9B-C). This revealed that R rubrum NflDK could only weakly catalyze methylthioalkane reduction, indicating a different primary function. Given nflDK is expressed not just in the presence of MT-EtOH but also in response to general sulfate limitation (FIG. 1B-C), NflDK may catalyze sulfur liberation from alternate albeit unknown compounds.

0 Alternately, given gene proximity and amino acid similarity (40%) to MarDK, NflDK may serve as accessory proteins for MarDK assembly analogous to NifEN (14). NifEN arose evolutionarily by gene duplication of NifDK and contains considerable sequence homology (-40%) to NifDK, including P-cluster and FeMo-cofactor coordination sites (8, 9, 12). While NifEN does not have nitrogenase and hydrogen formation activity, it still retains acetylene and azide reduction capabilities (66). The R. rubrum NflDK group IV nitrogenase-like proteins of unknown function (Rru_A0772-Rru_A0773 gene products) share 40% sequence identity with MarDK and are evolutionarily closer to MarDK than NifDK (FIG. 4). Coordinately, the nflDK genes are located near marBHDK analogous to the

5 association of nijEN with nijBHDK (8,9). However, unlike NifEN, NflDK is entirely dispensable, and homologous nflDK sequences are not observed to be present and associated with marBHDK gene clusters in several other organisms (FIG. 18).

These results demonstrated the requirement of the MarBHDK nitrogenase-like system for the anaerobic assimilation of sulfur from common environmental VOSCs such as

0 DMS and MT-EtOH in order to support growth and methionine metabolism. Moreover, these observations revealed a previously unknown mechanism for the bacterial production of methane and ethylene.

Methylthio-alkane reductase releases methanethiol from VOSCs for methionine biosynthesis

5 The link between VOSC utilization and methionine synthesis via the marBHDK gene products was characterized by feeding experiments with (2-[methyl-C 14 ]thio)ethanoL This enabled detection of the methylthio- moiety of MT-EtOH. Upon feeding the wild type strain, MT-EtOH was consumed. Labeled methanethiol (C 14 H 3 -SH) and methionine (methyl-C 14 ) were concomitantly produced and observed at low levels (~2% of MT-ETOH

0 concentration) until MT-EtOH was depleted (FIG. 2D). These low levels, like previously observed for methanethiol metabolism from 5 ’-methylthioadenosine in R. rubrum (12), are likely due to the flux of methanethiol to methionine and subsequent utilization thereof for protein synthesis and SAM-dependent processes (11). This is substantiated by C 14 incorporation from MT-EtOH into insoluble cell material (FIGs. 14A-14B). Conversely, in

5 the marBHDK deletion strain there was no detectable metabolism of MT-EtOH, and hence, no methanethiol or methionine produced (FIG. 2E and FIGs. 14A-14B). Given that ethylene, ethane, and methane are produced from MT-EtOH, EMS, and DMS, respectively, the observed methanethiol is consistent with C-S single bond reduction and methylthiorelease from these substrates by the methylthio-alkane reductase (FIG. 1A, reaction 1 and

0 FIG. 2F). Each process is thermodynamically favored for the substrates and products observed (FIG. 2F and FIGs. 15A-15B). The methanethiol along with O-acetyl -homoserine then serve as substrates for O-acetylhomoserine sulfhydryl ase, which catalyzes the synthesis of methionine (FIG. 1A, reaction 3) (13). This defines an anaerobic methylthio- alkane reductase methionine synthesis pathway and establishes the role of a nitrogenase-like enzyme system in sulfur metabolism (FIG. 1 A).

Native expression of methylthio-alkane reductase is regulated by sulfur response

SalR - Sulfur metabolism evidently is the primary function of these nitrogenase-like

5 methylthio-alkane reductases, as opposed to nitrogen fixation by nitrogenase. R rubrum possesses molybdenum nitrogenase (NifHDK), which is the default nitrogenase, and iron only nitrogenase (AnfHDGK) nitrogenase, which is synthesized in the absence of molybdenum (9). In in vivo activity assays, the R rubrum molybdenum nitrogenase could not perform methylthio-alkane reduction, even under maximally inducing conditions, and

0 vice versa (FIG. 3D; glutamate as N-source and 50 μM sulfate). Indeed, nitrogenase and methylthio-alkane reductase activities were independent, separately regulated, and both systems could be expressed simultaneously (FIG. 3D). 71 rubrum nitrogenase gene expression (nifHDK) is regulated by the transcriptional regulator NifA in response to nitrogen availability (14). Methylthio-alkane reductase activity in the presence of 1 mM

5 MT-EtOH or DMS was regulated by sulfate availability, with an ECso ~ 150 pM sulfate for 50% repression of activity (FIG. 3C). Our random mutagenesis screen identified the specific regulatory gene in the vicinity of marBHDK (Rru_A0785; FIG. IB, FIG. 6, and FIGs. 7A- 7D). We designate this LysR family regulator as SalR (sulfur salvage regulator). Inactivation of salR rendered strains incapable of growth or hydrocarbon production

0 utilizing MT-EtOH, DMS and EMS as sole S-source (FIG. 2A-B and FIG. 8E; strain 0785::Tn5). Transcriptomics and differential expression analysis of the parent (WRdht) and salR deletion strain (0785::Tn5) growing under marBHDK inducing and repressing conditions revealed that marBHDK and the rest of the methylthio-alkane reductase methionine synthesis pathway are under transcriptional control of SalR (FIG. 1C). Thus,

5 when sufficient sulfur is available (> 150 pM), expression appears repressed, but when sulfate becomes limiting, marBHDK and O-acetylhomoserine sulfhydrylase gene transcription is specifically upregulated via SalR to utilize VOSCs for methionine metabolism (FIG. 1A; reactions 1 and 3). Therefore, as shown in FIG. 2B, expression of marBHDK from a non-natural gene promoter DNA sequence enables synthesis of

0 MarBHDK and concomitant ethylene/ethane/methane production without the native regulation imposed by sulfate-sensitive SalR.

Organisms with methylthio-alkane reductase are widespread in nature including industrially relevant acetogenic and lignocellulosic Clostridia The nitrogenase superfamily is composed of the bona fide nitrogenase sequences (groups I-III) and nitrogen fixation-like sequences (NFL; groups IV- VI) (FIG. 4) (9). Phylogenetic analysis places methylthio-alkane reductase homologues in their own clade within group IV, which we denote as group IV-C (FIG. 4 and FIG. 16). In contrast, the R

5 rubrum NflD protein resides in a separate clade with other NflD sequences of unknown function (FIG.4), consistent with the poor methylthio-alkane reductase activity exhibited by NflDK (FIG. 2B). Bacteria possessing MarBHDK sequence homologs of this previously uncharacterized group IV-C clade include members of the Fibrobacter and Bacteriodetes phyla, Rhodospirillales and Rhizobiales within the Proteobacteria phylum, and

0 Selenomonadales and Clostridium species within the Firmicutes phylum (FIG. 17). To verify the phylogeny results for the Proteobacteria, Rhodopseudomonas palustris and Blastochloris viridis were tested, which possess group IV-C marBHDK homologues. Also tested was closely related species Rhodobacter capsulatus, which possesses nitrogenase and nflBHDK but no marBHDK (FIG. 4, FIG. 16, and FIG. 18; Rp, Bv, Rc). Both R palustris

5 and B. viridis were able to grow with MT-EtOH, EMS, or DMS as sole sulfur source and correspondingly produced ethylene, ethane, or methane (FIG. 2 A and FIGs. 19A-19C), demonstrating that methylthio-alkane reductase homologues from these organisms catalyze the same process. Conversely, R capsulatus could not utilize any of these VOSCs as sole sulfur source for growth (FIG. 2 A and FIGs. 19A-19C), like R rubrum expressing NflDK

0 but not MarDK (FIGs. 2B-C), indicating that group IV NFL proteins of unknown function catalyze processes distinct from methylthio-alkane reductase.

Amino acid sequence comparison of nitrogenase and methylthio-alkane reductase proteins indicate a distinct function for each group

Nitrogenase functions via a coordinated transfer of electrons through a network of

5 highly modified iron and sulfur metal clusters. The minimal molybdenum nitrogenase system requires gene products NifBHDKEN; the vanadium (Vnf) and iron (Anf) nitrogenases have similar requirements (8,9). The NifH homodimer possesses a single Fe4- S4 cluster at the homodimer interface. The NifDK heterotetramer contains Fe 8 -S 7 P-clusters coordinated at each of the two NifDK subunit interfaces, and each NifD subunit contains

0 the characteristic catalytic FeMo-cofactor [ Fe 7 -S 9 -C-Mo-homocitrate] (12). In the Vnf and Anf nitrogenase systems Mo is replaced with V or Fe, respectively. Initially, electrons are donated to the NifH Fe 4 -S 4 cluster from a reducing agent such as a ferredoxin or flavodoxin (61). When NifH is in complex with NifDK, these electrons are transferred in an ATP binding and hydrolysis dependent manner to the P-cluster of NifDK. NifH also has roles in P-cluster assembly from two Fe 4 -S 4 clusters on the apo-NifDK heterotetramer and synthesis of FeMo-cofactor when in complex with NifDK-like FeMo-cofactor assembly proteins, NifEN (12). P-cluster electrons are then passed to the FeMo-cofactor catalytic cluster and ultimately to FeMo-cofactor-bound dinitrogen for stepwise reduction to ammonia (17, 62).

5 MarH: MarH contains the same NifH conserved residues for MgATP hydrolysis and Fe 4 -S 4 cluster coordination that enables transfer of electrons from the NifH Fe 4 -S 4 cluster to the NifDK P-cluster (FIG. 12). The NifH conserved Arg-100 (V. vinelandii numbering) is also conserved in MarH. This residue is modifiable by ADP-ribosylation to prevent NifH from complexing from NifDK. As nitrogenase activity is an ATP intensive

0 process, this post translational modification effectively inactivates nitrogenase to prevent unnecessary ATP consumption when energy supply is insufficient or diazotrophy is not required (e.g. ammonium available as N-source). For R. rubrum nitrogenase, ADP- ribosylation is catalyzed by dinitrogenase reductase ADP-ribosyltransferase (DRAT) and removed by dinitrogenase reductase activating glycohydrolase (DRAG). An analogous

5 system appears to exist in A. vinelandii (63).

MarDK: MarD and MarK each possess the triad of cysteines conserved in the molybdenum nitrogenase subunits NifD and NifK for P-cluster coordination (FIG. 10 and FIG. 11). One or more of these conserved cysteines are absent in the bacteriochlorophyll oxidoreductase (ChlLNB and BchXYZ) and reductive cyclase F430 synthesis (CbfCD)

0 systems, which complex a catalytic Fe 4 -S 4 cluster instead (64, 65). MarD also has a conserved cysteine for coordinating a catalytic metallocofactor as in NifD for the FeMo- cofactor (Cys-275 in A. vinelandii). In contrast, however, the conserved NifD His-442 residue (A. vinelandii numbering) responsible for coordinating FeMo-cofactor homocitrate and molybdenum is replaced with a Gly-Asp-Glu motif in MarD and there are no

5 homocitrate synthase genes associated with marBHDK gene clusters (FIG. 10) (9, 15, 16). In addition, the conserved NifD Glu-191 and His-195 residues involved in coordinating nitrogen intermediates bound to the FeMo-cofactor are replaced in MarD with aromatic residues Trp and Phe (9, 17).

MarB: NifB is a radical SAM enzyme responsible for carbide insertion and

0 formation of the 8Fe-9S-C NifB-cofactor, the precursor to FeMo-cofactor (12). MarB possesses all of the identified motifs conserved across NifB enzymes associated with bona fide nitrogenases (FIG. 13). For nitrogenase, NifB-cofactor maturation to FeMo-cofactor requires NifH and NifEN for addition of molybdenum and homocitrate (12). Together, this indicates that methylthio-alkane reductase proceeds via a mechanism, similar but distinct to that of nitrogenase to convert MT-EtOH to ethylene, ethylmethylsulfide to ethane, and dimethylsulfide to methane (17). Methane release from DMS by the methylthio-alkane reductases is separate and distinct from the other known

5 non-archaeal methanogenic processes, including photosynthesis-linked methane production by cyanobacteria (18) , methane release from methylphosphonates by marine bacteria (19), and direct reduction of carbon dioxide to methane by iron-only nitrogenase (AnfDHGK) (20). In waterlogged soils, strictly anaerobic microbial processes produce ethylene that can accumulate to levels inhibitory to plant root growth, causing crop damage (21, 22). Early

0 attempts at identifying ethylene-producing organisms surprisingly isolated oxygendependent soil bacteria and fungi (23, 24). The organisms and methylthio-alkane reductases identified here function anaerobically and could contribute to this soil-ethylene paradox (10). This anaerobic ethylene process is distinct from the oxygen-dependent reactions catalyzed by aminocyclopropanecarboxylate oxidase and 2-oxoglutate dioxygenase in

5 plants, fungi, and certain bacteria.

Non-natural pathways for optimized microbial ethylene and methane production

The ethylene precursor, 5 ’-methylthioadenosine (MTA) is a routine byproduct of highly regulated processes such as quorum sensing, polyamine production, etc. These are highly regulated processes, making the native production of MTA for subsequent ethylene

0 production rate limiting. The coliphage SAM hydrolase (MTA-forming) is a viral enzyme that directly converts SAM to MTA (FIG. 20D) (69, 70). When this non-naturally occurring gene element is synthesized in Rhodospirillum rubrum and Rhodopseudomoms palustris for ethylene biogas production vial the DHAP shunt MarBHDK system (FIG. 20C), ethylene production is enhanced 20-50 fold above the native amount produced by the

5 oiganism in the absence of SAM hydrolase (FIG. 20D).

The methane precursor, dimethylsulfide, is the most abundant organic sulfur compound in the environment. It is produced by marine bacteria from dimethylsulfinypropionate and by terrestrial bacteria from methanethiol (71, 72). A nonnatural methionine salvage pathway from Rhodopseudomonal palsulris for the conversion

0 of methionine to dimethylsulfide is constructed using methionine gamma lyase (mgl) and methanethiol methyltransferase (mddA) (FIG. 20B) (72). This directly converts methionine to dimethyl sulfide for methane production by methylthio-alkane reductase (MarBHDK) (FIG. 20.C) in photosynthetic bacteria (e.g. Rhodospirillum rubrum) or lignocellulose degrading bacteria (e.g. Clostridium cellulofyticum). Materials and Methods

Fine chemicals: Dimethyl sulfide, methanethiol, L-methionine, 5’- methylthioadenosine, and S-methyl-L-cysteine were from Sigma; ethyl methyl sulfide, (2- methylthio)ethanol, (2-methylthio)acetate, and (3-methylthio)propanol were from Alfa

5 Aesar. All media components were of ultrapure grade from Sigma or I T. Baker. For targeted metabolite detection, (2-[methyl-C 14 ]thio)ethanol was synthesized from [methyl- C 14 ]-S-adenosylmethionine (Perkin Elmer). Labeled S-adenosylmethionine was acid hydrolyzed in 0.01 N H2SO4 under reflux at 100 °C for 30 min to form [methyl-C 14 ]-5’- methylthioadenosine. (2-[methyl-C 14 ]thio)ethanol was subsequently formed enzymatically

0 in a reaction containing 50 mM potassium phosphate pH 7.8, 5 mM MgCh, 0.2 mM NADH, 60 μM substrate, and 2 μM each of purified R rubrum 5 ’-methylthioadenosine phosphorylase (10), Bacillus subtilis 5-methylthioribose-l-phosphate isomerase (29), E. colt 5-methylthioribulose-l -phosphate aldolase (25), and S. cerevisiae alcohol dehydrogenase (Sigma) at 30 °C for 2 h. Enzymes were synthesized and purified as previously described

5 (10). Complete conversion was monitored by reverse phase HPLC with an inline scintillation detector as previously described (10), followed by enzyme removal via Amicon (Millipore) centrifugal concentration device.

Bacterial strains and growth conditions: R. rubrum ATCC 11170 wild type strain (Sm R ; NC 007643.1; American Type Culture Collection), Rru_A1998 deletion strain WR

0 (ΔrlpA::Gm R ) in which the MTA-isoprenoid shunt is inactivated, and Rru_A1998/Rru_A0359 deletion strain WRdht (ΔrlpA::Gm R / Δald2 ) in which the MTA- isoprenoid and DHAP shunts are inactivated were as previously described (10, 30). Rhodobacter capsulatus SB1003 (NC_014034.1, American Type Culture Collection) (31), Rhodopseudomonas palustris CGA010 (32), and Blastochloris viridis DSM133

5 (NZ AP014854.2, University of Leibnitz DSMZ) (33) wild type strains were also as previously described. Rhodopseudomonal palustris CGA010 (Caroline Harwood, University of Washington) is a derivative of CGA009 (Sm R ; NC_005296.1, American Type Culture Collection) in which a frame shift mutation is corrected. Anaerobic growth of R rubrum and R capsulatus was performed in static anaerobic culture tubes and serum bottles

0 at 30 °C with 2000 lux incandescent illumination. Cultures were composed of sulfur-free Ormerod’s malate (30 mM) minimal medium supplemented with the indicated sulfur source under a 95:5 mixture of N2:H2 gaseous headspace as previously described (34, 35). Anaerobic growth of R palustris was similarly performed by replacing malate with 0.5% (v/v) ethanol and 0.2% (w/v) sodium bicarbonate and adding 2 [ig/ml para-aminobenzoic acid. All anaerobic manipulations were performed using an anaerobic chamber under 5% hydrogen and 95% nitrogen (Coy Laboratories).

Anaerobic growth of B. viridis was performed in anaerobic cultures tubes continuously rotated on a rotisserie at 30 °C with 2000 lux incandescent illumination.

5 Cultures were composed of a modified sulfur-free succinate medium 27 (N medium) (36) supplemented with the indicated sulfur source under an N2 gaseous headspace. Briefly, sulfur-free succinate medium 27 contained (per liter water) 0.3 g yeast extract, 1.0 g Na2- succinate, 0.5 g ammonium acetate, 5 mg Fe(III) citrate, 0.5 g KH2PO4, 0.33 g MgCh 6H2O, 0.4 g NaCl, 0.4 g NHiCl, 0.05 g CaCh 2H 2 O, 0.4 ml of 0.1 g/L vitamin B12

0 solution, 0.5 ml of 1.0 g/L resazurin solution, and 1.0 ml of trace element solution [(per liter water) 0.075 g Zn-acetate, 0.03 g MnC12-4H2O, 0.3 g H3BO3, 0.20 g COCI2 6H2O, 0.01 g CuCl 2 ·2H 2 O, 0.02 g NiCh-6H 2 O, 0.03 g Na 2 MoO 4 -2H 2 O] at pH 6.8. Media was brought to a boil, dispensed and sealed in anaerobic culture tubes, sparged with Nz until anaerobic, autoclaved, cooled, supplemented with the appropriate sulfur source, and reduced with Tris¬

5 buffered titanium citrate pH 8.0 (1 mM final concentration) before inoculating.

Proteomics analysis: To optimize ethylene induction, and by inference of the remaining steps of the pathway in metabolizing MT-EtOH to methionine, the growth of R rubrum strain WR (ΔrlpA::Gm R ) was measured spectrophotometrically by optical density at 660 nm ( O.D. 660nm ) and the specific rate of ethylene production (pmol/h/g dry cell weight)

0 was independently measured by gas chromatography (see GC analysis below) at regular intervals for a given sulfate or MT-EtOH concentration (FIGs. 5A-5C). Cells were grown anaerobically, photoheterotrophically in anaerobic culture tubes containing 20 ml of sulfur- free malate minimal medium supplemented with 25, 50, 100, 1000 μM ammonium sulfate or 200-1000 μM MT-EtOH. For limiting sulfate, maximum ethylene specific rate was

5 observed under 50 μM sulfate at an O.D. 660nm of 0.6-0.75. For 200-1000 μM MT-EtOH, maximum ethylene specific rate was also observed in the same O.D. 660nm range. Subsequently, R. rubrum strain WR was grown in triplicate (biological replicates) anaerobically, photoheterotrophically in rectangular flasks containing 0.5 L sulfur-free malate minimal medium supplemented with 50 μM or 1000 μM ammonium sulfate or 1000

0 μM MT-EtOH to an O.D. 660nm of -0.60. Cultures were harvested anaerobically by centrifugation at 3000 x g for 5 min and remaining media was thoroughly removed by decanting. Cell pellets were aliquoted in 0.4 - 0.6 g fractions and flash frozen in liquid N2.

Each cell pellet was lysed by 4% sodium deoxycholate in 100 mM ammonium bicarbonate with the application of sonication (20% amplitude, 10 s pulse, 10 s rest, 2 min total pulse time). Crude protein extract was precleared via centrifugation, reduced with 10 mM dithiothreitol, alkylated with 30 mM iodoacetamide, and then collected on top of a 10 kDa cutoff spin column filter (VIVASPIN 500, Sartorius). Collected proteins were digested to peptides with two sequential aliquots of sequencing-grade trypsin (Sigma) at a 1:75

5 enzyme:protein ratio (w/w), initially overnight at room temperature followed by additional 3 h at room temperature. Peptides were collected by centrifugation and acidified to 1% formic acid followed by extraction with ethyl acetate to remove sodium deoxycholate. The peptide containing aqueous phase was recovered and concentrated. Concentrated peptides were measured using the bidnchoninic acid assay (Pierce).

0 Each peptide mixture was analyzed on a two-dimensional liquid chromatography tandem mass spectrometry (2D-LC-MSZMS) platform using a Q Exactive Plus (QE+) mass spectrometer (Thermo Fisher Scientific) equipped with an Ultimate 3000 RS system (Thermo Fisher Scientific). 9 μg of each peptide sample was loaded via autosampler onto a triphasic pre-column (5 cm C18 reversed phase (RP), 5 cm strong cation exchange, and 5

5 cm C18 RP). Bound peptides were then washed and separated over three successive salt cuts of ammonium acetate (35 mM, 50 mM and 500 mM), each followed by an RP-LC elution via an in-house pulled nano-electrospray emitter (75 μm ID) packed with 30 cm of CIS RP. Mass spectra were acquired on QE+ in a data-dependent mode with full scan at 70K resolution, followed by HCD fragmentation of the top 15 most abundant ions at 15K

0 resolution.

Acquired MS/MS spectra were matched with theoretical tryptic peptides generated from a concatenated Rhodospirillum nibrum proteome FASTA database with contaminants and decoy sequences using MyriMatch v. 2.2 (37). Peptide spectral matches were filtered to achieve peptide false-discovery rates (FDR) < 1% and assembled to their respective proteins

5 using IDPicker v. 3.1 (38). Peptide abundance intensities were derived in IDPicker by extracting precursor intensities from chromatograms with lower and upper retention time of 90 s and mass tolerance of 5 ppm. Protein abundances were calculated by summing up intensities of all identified peptides and normalized by their protein lengths respectively. Protein intensities were further log2 transformed and median centered using InfemoRDN

0 version 1.1 (39), to approximate a normal distribution and reduce technical variance for further pairwise comparison. Student’s T-test was then performed for every pair condition using Perseus platform (40) for two different thresholds (Benjamini-Hochberg FDR adjusted p-value < 0.05 and fold change > 2, or Benjamini-Hochberg FDR adjusted p-value < 0.01 and fold change > 4; two-sided). Transcriptomics analysis: R rubrum strain WRdht (ΔrlpA/ Δald2) and 0785::Tn5 (ΔrlpA/Δald2 /0785::T n5) were grown in triplicate (biological replicates) photoheterotrophically in anaerobic culture tubes containing 20 ml sulfur-free malate minimal medium supplemented with 50 μM (“Lo”) or 1000 μM (“Hi”) sulfate. When cells

5 reached an O.D. 660nm of 0.65-0.8, cells were harvested and stabilized by RNAprotect reagent (Qiagen). RNA was isolated using the RNeasy protect kit (Qiagen) and quantified by UV absorbance. RNA-seq library construction and sequencing were performed at The Genomics and Microarray Shared Resource at University of Colorado Denver Cancer Center, Denver, CO, USA. Library preparation and rRNA depletion were performed using

0 the Zymo-Seq Ribo Free Total RNA Library Kit Cat No. R3000 with input of 250 ng and libraries were sequenced on the Illumina NovaSeq 6000 using 2x150 paired end reads. Raw RNA-seq data were trimmed using sickle (github.com/najoshi/sickle) (41). Prior genomic sequencing of R rubrum strain WRdht confirmed the rlpA and ald2 deletions and >99% nucleotide identity to the R rubrum ATCC11170 genome. Mapping of transcriptomic reads

5 to the reference was conducted using Bowtie2 (v2.3.5.1) with the options -very-sensitive and -score-min L,0,-0.1 (42). Differential expression analysis was performed using DESEq2 (v 1.22.2) (fitType=local, test=Wald) (43). Comparison of transcriptomes from the parent strain (WRdht) grown under 50 μM versus 1000 μM sulfate indicated all genes that were transcriptionally regulated >1.5-fold in response to sulfate availability (two-sided

0 Wald Chi-square test, BH-FDR adjusted p<0.002 as implemented by DESeq2 (43J). Corresponding comparison for the SalR deletion strain (0785: :Tn5) indicated which of these genes were no longer regulated in response to sulfate availability. Comparison of the SalR deletion strain to the parent strain under 1000 μM sulfate indicated which of these genes were potentially transcriptionally activated or repressed by SalR.

5 Transposon mutagenesis: R rubrum strain WRdht (ΔrlpA/ Gm R /Asld2 ) was randomly mutagenized using the efficient mini-Tn5 transposable element (44). R rubrum was initially grown aerobically at 30 °C to late log phase in PYE liquid medium (3 g/L peptone, 3 g/L yeast extract, 266 mg/L MgSO 4 ·7H 2 O, 75 mg/L CaCl 2 -2H 2 O, 11.8 mg/L FeSO4-7H2O, 20 mg/L ethylenediaminetetraacetic acid, 1 ml/L Ormerod’s trace elements

0 solution (31), 1 mg/L thiamine, 1 mg/L nicotinic acid, 15 μg/L biotin). Donor strain, E. coli BW20767/pRL27 (Coli Genetic Stock Center, Yale) (44), was grown in lysogeny broth at 37 °C to mid exponential phase. Strains were separately centrifuged and washed three times with PYE medium, combined in a 1:2 ratio of E. coli to R rubrum, concentrated, and spotted onto a 16% PYE agar plate. Biparental conjugation was carried out aerobically at 30 °C in the dark for no more 24 h to ensure R rubrum cells received no more than one Tn5 insertion per genome. R rubrum transconjugants were selected on 16% PYE agar plates with 25 μg/ml kanamycin and 30 μg/ml gentamycin under the same growth conditions.

Transposon-insertion isolates of R rubrum were individually picked into 96-well

5 flat-bottom tissue culture plates containing 200 μl of sulfur-free Ormerod’s malate minimal medium supplemented with 100 μM ammonium sulfate and 25 μg/ml kanamycin. Inoculated plates were incubated in an anaerobic chamber for 2 h, sealed with thermal adhesive film to prevent evaporation, and further sealed in thermal-seal bags (Kapak, ProAmpac) to maintain anaerobic conditions. Isolates were grown anaerobically at 30 °C

0 under 2000 lux incandescent illumination to late log phase. Cultures were briefly exposed to air atmosphere, quickly transferred by 96-pin transfer device to new anaerobic 96-well plates containing 200 pl of anaerobic sulfur-free Ormerod’s malate minimal medium supplemented with 1 mM ammonium sulfate or 1 mM MT-EtOH, and then incubated and sealed in an anaerobic chamber as before. Isolates were again grown anaerobically under

5 illumination to screen for mutants incapable of growth on MT-EtOH but still able to grow on sulfate as sole S-source. 11,250 mutants were screened to ensure each gene received a transposon insertion at least once (FIGs. 7A-7D). Putative ethylene pathway mutants were verified by confirmatory growth experiment in anaerobic culture tubes. The false discovery rate was 80% due to the sensitive nature of growing R rubrum in 96-well plates with MT-

0 EtOH as sole S-source. Validated ethylene pathway mutants were sequenced to determine the location of the Tn5 insertion as previously described (44).

Gene deletion and complementation studies: Nonpolar gene cluster deletions of Rru_A1066-Rru_A1069, Rru_A0772-Rm_A0773, and Rru_A0793-Rru_A0796 in the R rubrum wild type strain were performed by homologous recombination using previously

5 described methods (10). Briefly, DNA fragments were amplified by PCR using primers listed in Table A below, digested with the indicated restriction enzyme following manufacturer’s protocols (New England Biolabs), and ligated into pK18mobSacBgm (10) using T4 DNA ligase (New England Biolabs). Sequence verified plasmids were transformed into E. coli Stellar strain (TaKaRa Bio) and mobilized into R rubrum wild type

0 by triparental conjugation with helper strain E. coli JM109/pRK2013 (American Type Culture Collection) (45), similar to methods used for the transposon mutagenesis. Transconjugants were selected on 16% PYE agar plates with 25 μg/ml kanamycin and 50 μg/ml streptomycin under aerobic growth at 30 °C. First and second homologous recombination events were selected by 10 % (w/v) sucrose sensitivity and kanamycin resistance of the isolates, and second recombinants possessing the proper gene deletion were sequence verified.

Table A. Primers and Plasmids Used

Gene complementation of the R. rubrum NFL gene deletion strain A0772:3/A0793:6 was performed in trans by NFL genes expressed from complementation plasmid μMTAP (10). Genes were amplified by PCR using primers listed in Table A, digested with the

5 indicated restriction enzyme, and ligated into μMTAP. Sequence verified plasmids were transformed into E. coli Stellar strain (Takara) and mobilized into R. rubrum by triparental conjugation with helper strain £ coli JM109/pRK2013. Transconjugants were selected on 16% PYE agar plates with 2 μg/ml tetracycline and 50 μg/ml streptomycin under aerobic growth at 30 °C. Isolates were then tested for their ability to grow anaerobically with o sulfate, MT-EtOH, or DMS as sole sulfur source. R. rubrum A0772:3/ A0793:6 transconjugants with plasmids that complemented for growth on MT-EtOH and DMS were also quantified for restoration of ethylene and methane production by GC as described below.

Whole-cell VOSC utilization and gas production assays: Cells were initially

5 grown aerobically in 150 ml serum bottles containing sulfur-free Ormerod’s malate minimal medium supplemented with 50 μM ammonium sulfate (methylthio-alkane reductase inducing conditions) to mid log phase (O.D. 660nm of 0.7-0.8). Cultures were washed anaerobically three times by centrifugation and resuspension in sulfur-free Ormerod’s malate minimal medium. Cells were resuspended to a final O.D. 660nm of ~2.0 (higher cell

0 densities suppressed methylthio-alkane reductase activity), dispensed in 20 ml aliquots in 60 ml serum vials, fed with 1 mM of DMS, EMS, or MT-EtOH, sealed, and incubated at 30 °C under 2000 lux incandescent illumination for 12 h. Produced methane, ethane, and ethylene gas was quantified by GC as described below.

Whole-cell nitrogenase and methylthio-alkane reductase specific rate assays: R.

5 rubrum wild type and NFL gene deletion (A0772:3/A0793:6) strains were grown anaerobically under argon headspace to late log phase (O.D. 660nm 0.9-1.1) in Ormerod’s malate minimal medium with 15 mM ammonium chloride or sodium glutamate as sole N- source and 50 μM or 1 mM sodium sulfate as sole S-source. For whole-cell nitrogenase assays (46), 2 ml of culture was transferred via syringe to an anaerobic 7.5 ml serum vial

0 flushed with argon. Assays were initiated by the addition of 0.06 atm acetylene and allowed to proceed for 10 min under 2000 lux illumination at 30 °C. Assays were quenched with 100% (w/v) trichloroacetic acid to 10% final and ethylene was quantified by GC as described below. Similarly, for whole-cell methylthio-alkane reductase assay, 4 ml of culture were transferred via syringe to an anaerobic 8 ml serum vial flushed with argon.

5 Assays were initiated by the addition of EMS to 1 mM final concentration and allowed to proceed for 30 min under 2000 lux illumination at 30 °C. Assays were quenched with TCA and ethane was quantified by GC.

GC analysis of hydrocarbons: Quantification of methane, ethane, and ethylene was performed using a Shimazdu GC-14A with Restek Rt- Alumina BOND/Na 2 SO 4 column.

0 Gaseous culture headspace after feeding or growth experiments was injected (250-500 pl) at 180 °C and separated isothermally at 30 °C. Eluted compounds were detected by flame ionization detector at 180 °C and identified based on retention time of methane, ethane, and ethylene standard (Praxair). The total amount of each hydrocarbon present was calculated from the peak area as compared to standard concentration curves of the corresponding

5 reference standard.

Targeted metabolomics: R. rubrum wild type and Rru_A0793-Ru_A0796 deletion strains were grown anaerobically to an O.D. 660nm of 0.8 (mid log phase) in Ormerod’s malate minimal medium supplemented with 50 μM ammonium sulfate to induce ethylene production. Cultures were washed anaerobically three times by centrifugation and

0 resuspension in sulfur-free Ormerod’s malate minimal medium. Cells were resuspended to a final O.D. 660nm of ~2.0 (higher concentrations repressed methylthio-alkane reductase activity), supplemented with 100 μM 5,5’-dithiobis-(2-nitrobenzoic acid) (Ellman’s reagent for trapping free thiols), and sealed as 1 ml aliquots in 1.5 ml anaerobic serum vials. Cells were then fed with 10 pM MT-EtOH and 1 μM (2-[methyl-C 14 ]thio)ethanol and incubated under 2000 lux incandescent light at 30 °C. Metabolism was stopped by flash freezing in liquid nitrogen; cells were pelleted, media supernatant reserved, and the cell pellet was extracted with 80% acetonitrile + 0.04N ammonium hydroxide with vortexing for 5 min followed by 20 min incubation at -20 °C. Acetonitrile was removed by vacuum

5 concentration, and the extracted metabolites were combined with the reserved media supernatant Metabolites were separated by reverse phase HPLC and identified by inline scintillation detector based on retention time compared to reference standards as previously described for N= 2 biological replicates (10).

Free-energy calculations: Standard free energies of formation and reaction were

0 determined using electronic structure calculations with continuum solvent models. Specifically, density functional theory with the B3YLP (47, 48) exchange correlation functional was used with the 6-311++G(2d,2p) basis set. The geometries were optimized and harmonic frequencies determined in a continuum model solvent using the COSMO self- consistent reaction field method (49). All calculations were performed with the NWChem

5 computational chemistry package (50) using the EMSL Arrows interface (51). Hi was used as the electron donor in each redox reaction since the actual electron donor is not known.

The relative difference in the reaction free energies will not change if, for example, ferredoxin or any other redox pair were used as the electron donor, since the electrochemical potential of the actual electron donor would be measured relative to the

0 standard hydrogen electrode.

Phylogenetics: The R. rubrum MarH, MarD, and MarK proteins were separately queried against the NCBI reference genome database using the translated nucleotide blast (tblastn) algorithm and filtered for protein subjects with e-value < e-50. Each identified MarH, MarD, and MarK candidate was correlated with its reference genome and only

5 genomes were retained that contained all three homologues on the same contig and with MarD and MarK being adjacent. These candidates, along with recently discovered Group VI representatives from metagenome assembled genomes (28) were then appended to a reference nitrogenase (Groups I, II., III) and NFL sequence (Groups IV and V) database (9) with additional sequences identified from genomes in the JGI IMGZM database. Amino

0 acid sequences were aligned using MAFFT (52) (v7.394 ) (—auto). Alignments were trimmed using TrimAl (53) (vl.4.rev22) (— gappyout). Maximum likelihood trees were constructed using IQ-TREE (54) (vl .6.8) (-alrt 1000 -bb 1000) using best-fit models (NifH: LG+R10; NifD: LG+R6) identified by ModelFinder (55) as implemented in IQ-TREE with ultrafast bootstrap (UFBoot) (56). Pairwise alignment of NifB, NifH, NifD, and NifK superfamily sequences for conserved active site residue analysis (FIG. 10-FIG. 13) was performed using Clustal Omega (EMBL-EBI) (57) and visualized with Jalview (58). Gene synteny (FIG. 18) was visualized using R package (R Foundation, Vienna, Austria) ‘gggenes’ (59) for an ~28kbp

5 neighborhood centered on the NifD homologs identified in selected genomes representing the Nif and NFL clades.

To identify organisms with native ethylene capacity (DHAP Shunt plus marBHDK genes, FIG. 17), organisms with a putative MarHDK complex, as indicated by the phylogenetic tree analysis (FIG. 4 and FIG. 16), were then analyzed for the presence of

0 DHAP shunt homologues by querying each genome (tblastn) with the R nibrum and E. coli DHAP Shunt genes (10, 25), MtnK, MtnP, MtnA, and Ald2, with a cutoff of e-value < -20. For organism phylogenetic analysis (FIG. 17), 113 genomic sequences including R. nibrum, R. palustris, B. viridis, and additional random organisms with MarHDK genes were downloaded from NCBI (Genome or Assembly databases). This set of genomes was

5 aligned to a set of reference bacteria using GTDB-TK (de novo wf) (60). The non- redundant subset of organisms as shown in FIG. 17 together with Chloroflexota sequences as the outgroup from the reference database were extracted from the alignment and a maximum likelihood tree was built using IQ-TREE (54) (-alrt 1000 -bb 1000) using the best-fit model LG+F+R6 identified by ModelFinder (55) as implemented in IQ-TREE with

0 ultrafast bootstrap (UFBoot) (56).

References

1. E. E. Stueken, R. Buick, B. M. Guy, M. C. Koehler, Isotopic evidence for biological nitrogen fixation by molybdenum-nitrogenase from 3.2 Gyr. Nature. 520, 666-669 (2015).

5 2. M. C. Weiss, F. L. Sousa, N. Mmjavac, S. Neukirchen, M. Roettger, S. Nelson- Sathi, W. F. Martin, The physiology and habitat of the last universal common ancestor. Nat. Microbiol. 1, 16116 (2016).

3. E. S. Boyd, J. W. Peters, New biological insights into the evolutionary history of biological nitrogen fixation. Front. Microbiol. 4, 201 (2013).

0 4. K. Zheng, P. D. Ngo, V. L. Owens, X. P. Yang, S. O. Mansoorabadi, The biosynthetic pathway of coenzyme F430 in methanogenic and methanotrophic archaea. Science. 354, 339-342 (2016).

5. S. J. Moore, S. T. Sowa, C. Schuchardt, E. Deery, A. D. Lawrence, J. V. Ramos, S. Billig, C. Birkemeyer, P. T. Chivers, M. J. Howard, S. E. Rigby, G. Layer, M. J. Warren, Elucidation of the biosynthesis of the methane catalyst coenzyme F430. Nature. 543, 78-82 (2017).

6. N. Muraki, J. Nomata, K. Ebata, T. Mizoguchi, T. Shiba, H. Tamiaki, G. Kurisu, X. Y. Fujita, X-ray crystal structure of the light-independent protochlorophyllide

5 reductase. Nature. 465, 110-4 (2010).

7. J. Nomata, T. Mizoguchi, H. Tamiaki, Y. A. Fujita, A second nitrogenase-like enzyme for bacteriochlorophyll biosynthesis: reconstitution of chlorophyllide a reductase with purified X-protein (BchX) and YZ-protein (BchY-BchZ) from Rhodobacter capsiilatus. J. Biol. Chem. 281, 15021-8 (2006).

0 8. P.C. Dos Santos, Z. Fang, S. W. Mason, J.C. Setubal, R. Dixon, Distribution of nitrogen fixation and nitrogenase-like sequences amongst microbial genomes. BMC Genomics. 13, 162 (2012).

9. J. Raymond, J. L. Siefert, C. R. Stales, R. E. Blankenship, The natural history of nitrogen fixation. Mol. Biol. Evol. 21, 541-54 (2004).

5 10. J. A. North, A. R. Miller, J. A. Wildenthal, S. J. Young, F. R Tabita, Microbial pathway for anaerobic 5'-methylthioadenosine metabolism coupled to ethylene formation. Proc. Natl. Acad. Sci. U.S.A. 114, E10455-E10464 (2017).

11. N. Parveen, K.A. Cornell, Methylthioadenosine/S-adenosylhomocysteine nucleosidase, a critical enzyme for bacterial metabolism. Mol. Microbiol. 79, 7-20

0 (2011).

12. S. Buren, E. Jimenez- Vicente, C. Echavarri-Erasun, L. M. Rubio, Biosynthesis of nitrogenase cofactors. Chemical Reviews, doi: 10.1021/acs.chemrev.9b00489 (2020).

13. T. J. Erb, B. S. Evans, K. Cho, B. P. Warlick, J. Sriram, B. M. Wood, H. J. tinker, J.

5 V. Sweedler, F. R. Tabita, J. A. Gerlt, A RuBisCO-like protein links SAM metabolism with isoprenoid biosynthesis. Nat. Chem. Biol. 8, 926-932 (2012).

14. Y. Zhang, E. L. Pohlmann, P. W. Ludden, G. P. Roberts, Mutagenesis and functional characterization of the glnB, glnA, and nifA genes from the photosynthetic bacterium Rhodospirillum rubrum. J. Bacterial. 182, 983-92 (2000).

0 15. D. Sippel, O. Einsle, The structure of vanadium nitrogenase reveals an unusual bridging ligand. Nat. Chem. Biol. 13, 956-960 (2017).

16. L. M. Zhang, C. M. Morrison, J. T. Kaiser, D. C. Reese, Nitrogenase MoFe protein from Clostridium pasteurianum at 1.08 A resolution: comparison with the Azotobacter vinelandii MoFe protein. Acta Crystallogr. D Biol. Crystallogr. 71, 274-282 (2015).

17. D. Sippel, M. Rohde, J. Netzer, C. Tmcik, J. Gies, K. Grunau, I. Djurdjevic, L. Decamps, S. L. A. Andrade, O. Einsle, A bound reaction intermediate sheds light on

5 the mechanism of nitrogenase. Science. 359, 1484-1489 (2018).

18. M. Biiic, T. Klintzsch, D. lonescu, M. Y. Hindiyeh, M. Gunthel, A. M. Muro- Pastor, W. Eckert, T. Urich, F. Keppler, H. -P. Grossart, Aquatic and terrestrial cyanobacteria produce methane. Sci. Adv. 6, eaax5343 (2020)

19. D. Repeta, S. Ferron, O. Sosa, C. G. Johnson, L. D. Repeta, M. Acker, E. F.

0 DeLong, D. M. Karl, Marine methane paradox explained by bacterial degradation of dissolved organic matter. Nat. Geosci. 9, 884-887 (2016).

20. Y. Zheng, D. F. Harris, Z. Yu, Y. Fu, S. Poudel, R. N. Ledbetter, K. R. Fixen, Z. Y. Yang, E. S. Boyd, M. E. Lidstrom, L. C. Seefeldt, C. S. Harwood, A pathway for biological methane production using bacterial iron-only nitrogenase. Nat. Microbiol.

5 3, 281-286 (2018).

21. K. A. Smith, R. S. Russell, Occurrence of ethylene and its significance in anaerobic soils. Nature. 222, 769-771 (1969).

22. S. Manik, G. Pengilley, G. Dean, B. Field, S. Shabala, M. Zhou, Soil and crop management practices to minimize the impact of waterlogging on crop productivity.

0 Front. Plant. Sci., 10, 140 (2019).

23. J. M. Lynch, Identification of substrates and isolation of micro-organisms responsible for ethylene production in soil. Nature. 240, 45-46 (1972).

24. J. M. Lynch, Ethylene in soil. Nature. 156, 576-577 (1975).

25. J. A. North, J. A. Wildenthal, T. J. Eib, B E. Evans, K. M. Byerly, J. A. Gerlt, F. R.

5 Tabita, A bifunctional salvage pathway for two distinct S-adenosylmethionine byproducts that is widespread in bacteria, including pathogenic Escherichia coli. Mol. Microbiol. 10.1111/mmi.14459 (2020).

26. G. A. W. Beaudoin, Q. Li, J. Folz, O. Fiehn, J. L. Goodsell, A. Angerhofer, S. D. Bruner, A. D. Hanson, Salvage of the 5-deoxyribose byproduct of radical SAM

0 enzymes. Nat. Commun. 9, 3105 (2018).

27. H. Zheng, C. Dietrich, R. Radek, A. Brune, Endomicrobium proavitum, the first isolate of Endomicrobia class, nov. (phylum Elusimicrobia)-- an ultramicrobacterium with an unusual cell cycle that fixes nitrogen with a Group IV nitrogenase. Environ. Microbiol. 18, 191-204 (2016). 28. R. Meheust, C. J. Castelle, P. B. M. Carevali, I. F. Farag, C. He, L. X. Chen, Y. Amano, L. A. Hug, J. F. Banfield, Aquatic Elusimicrobia are metabolically diverse compared to gut microbiome Elusimicrobia and some have novel nitrogenase-like gene clusters, https://www.bioixiv.org/content/10.1101/765248v2 (20191

5 29. H. J. Imker, A. A. Fedorov, E. V. Fedorov, S. C. Almo, J. A. Gerlt, Mechanistic diversity in the RuBisCO superfamily: the "enolase" in the methionine salvage pathway in Geobacillus kaustophilus. Biochemistry. 46, 4077-89 (2007).

30. J. Singh, F. R. Tabita, Roles of RubisCO and the RubisCO-like protein in 5- methylthioadenosine metabolism in the nonsulfur purple bacterium Rhodospirillum

0 rubrum. J. Bacteriol. 192, 1324-31 (2010).

31. H. Stmad, A. Lapidus, J. Paces, P. Ulbrich, C. Vlcek, V. Paces, R. Haselkom, Complete genome sequence of the photosynthetic purple nonsulfiir bacterium Rhodobacter capsulatus SB 1003. J. Bacteriol. 192, 3545-6 (2010).

32. F. E. Rey, Y. Oda, C. S. Harwood, Regulation of uptake hydrogenase and effects of

5 hydrogen utilization on gene expression in Rhodopseudomonas palustris. J. Bacteriol. 188, 6143-6152 (2006).

33. G. Drews, P. Giesbrecht, Rhodopseudomonas virictis, nov. spec., ein neu isoliertes, obligat phototrophes Bakterium. Archiv fur Mikrobiol. 53, 255-262 (1966).

34. J. G. Ormerod, K. S. Ormerod, H. Gest, Light-dependent utilization of organic

0 compounds and photoproduction of molecular hydrogen by photosynthetic bacteria; relationships with nitrogen metabolism. Arch. Biochem. Biophys. 94, 449-463 (1961).

35. S. Dey, J. A. North, J. Sriram, B. S. Evans, F. R. Tabita, In vivo studies in Rhodospirillum rubrum indicate that ribulose-l,5-bisphosphate

5 carboxyl ase/oxygenase (Rubisco) catalyzes two obligatorily required and physiologically significant reactions for distinct carbon and sulfur metabolic pathways. J. Biol. Chem. 290, 30658-68 (2015).

36. D. P. Canniffe, D. A. Bryant, Engineered biosynthesis of bacteriochlorophyll b in Rhodobacter sphaeroides. Biochim. Biophys. Acta. 1837, 1611-6 (2014).

0 37. D. L. Tabb, C. G. Fernando, M. C. Chambers, MyriMatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. J. Proteome Res. 6, 654-661 (2007).

38. Z. Q. Ma, S. Dasari, M. C. Chambers, M. D. Litton, S. M. Sobecki, L. J. Zimmerman, P. J. Halvey, B. Schilling, P. M. Drake, B. W. Gibson, D. L. Tabb, ID Picker 2.0: Improved protein assembly with high discrimination peptide identification filtering. J. Proteome Res. 8, 3872-3881 (2009).

39. T. Taverner, Y. V. Karpievitch, A. D. Polpitiya, J. N. Brown, A. R. Dabney, G. A. Anderson, R. D. Smith, DanteR: an extensible R-based tool for quantitative analysis

5 of -omics data. Bioinformatics. 28, 2404—2406 (2012).

40. S. Tyanova, T. Temu, P. Sinitcyn, A. Carlson, M. Y. Hein, T. Geiger, M. Mann, J. Cox, The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods. 9, 731-740 (2016).

41. N. A. Joshi, J. N. Pass, Sickle: A sliding-window, adaptive, quality-based trimming

0 tool forFastQ files (Version 1.33) https://Rithub.com/naioshi/sickle (2011),

42. B. Langmead, S. L. Salzberg, Fast gapped-read alignment with Bowtie 2. Nat. Methods. 9, 357-359 (2012).

43. M. I. Love, W. Huber, S. Anders, Moderated estimation of fold change and dispersion for RNA-Seq data with DESeq2. Genome Biol. 15, 550 (2014).

5 44. R. A. Larsen, M. M. Wilson, A. M. Guss, W. W. Metcalf, Genetic analysis of pigment biosynthesis in Xanthobacter autotrophicus Py2 using a new, highly efficient transposon mutagenesis system that is functional in a wide variety of bacteria. Arch. Microbiol. 178, 193-201 (2002).

45. D. H. Figurski, D. R. Helinski, Replication of an origin-containing derivative of

0 plasmid RK2 dependent on a plasmid function provided in trans. Proc. Natl. Acad Sci. U.S.A. 76, 1648-1652 (1979).

46. R. W. F. Hardy, R. D. Holsten, E. K. Jackson, R. C. Bums, The acetylene reduction assay for N2 fixation: laboratory and field evaluation. Plant Physiol. 43, 1185-1207 (1968).

5 47. C. T. Lee, W. T. Yang, R. G. Parr, Development of the Colle-Salvetti correlationenergy formula into a functional of the electron-density. Phys. Rev. B. 37, 785-789 (1988).

48. A. D. Becke, Density-functional thermochemistry. HI. The role of exact exchange. J. Chem. Phys. 98, 5648-5652 (1993).

0 49. A. KI amt, G. Schuurmann, Cosmo - a new approach to dielectric screening in solvents with explicit expressions for the screening energy and its gradient. J. Chem. Soc., Perkin Trans. 2. 1993, 799-805 (1993).

50. M. Valiev, E. J. Bylaskaa, N. Govinda, K. Kowalskia, T. P. Straatsmaa, H. J. J. Van Dama, D. Wanga, J. Nieplochaa, E. Aprab, T. L. Windusc, W. A. de Jonga, NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations. Comput. Phys. Commun. 181, 1477-1489 (2010).

51. E. J. Bylaska, EMSL Arrows, https://arrows.emsl.pnnl.gov/api/ (2020).

52. K. Katoh, D. M. Standley, MAFFT multiple sequence alignment software version 7:

5 improvements in performance and usability. Mol. Biol. Evol. 30, 772-780 (2013).

53. S. Capella-Gutierrez, J. M. Silla-Martinez, T. Gabaldon, TrirnAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 25, 1972-1973 (2009).

54. L. T. Nguyen, H. A. Schmidt, A. von Haeseler, B. Q. Minh, IQ-TREE: a fast and

0 effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268-274 (2015).

55. S. Kalyaanamoorthy, B. Q. Minh, T. K. F. Wong, A. von Haeseler, L .S. Jermiin, ModelFinder: fast model selection for accurate phylogenetic estimates. Nat. Methods. 14, 587-589 (2017).

5 56. D. T. Hoang, O. Chernomor, A. von Haeseler, B. Q. Minh, L. S. Vinh, UFBoot2: Improving the ultrafast bootstrap approximation. Mol. Biol. Evol. 35, 518-522 (2018).

57. F. Madeira, Y. M. Park, J. Lee, N. Buso, T. Gur, N. Madhusoodanan, P. Basutkar, A. R. N. Tivey, S. C. Potter, R. D. Finn, R. Lopez, The EMBL-EBI search and

0 sequence analysis tools APIs in 2019. Nucleic Acids. Res. 47, W636-W641 (2019).

58. A. M. Waterhouse, J. B. Procter, D. M. A. Martin, M. Clamp, G. J. Barton, Jalview Version 2 - a multiple sequence alignment editor and analysis workbench. Bioinformatics. 25, 1189-1191 (2009).

59. D. Wilkins, gggenes: Draw Gene Arrow Maps in 'ggplot2'. R package version 0.4.0.

5 https://wilkox.org/gggenes (2019).

60. P.-A. Chaumeil, A. J. Mussig, P. Hugenholtz, D. H. Parks, GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics 36, 1925- 1927 (2019).

61. S. Poudel, D. R. Colman, K. R. Fixen, R. N. Ledbetter, Y. Zheng, N. Pence, L. C.

0 Seefeldt, J. W. Peters, C. S. Harwood, E. S. Boyd, Electron Transfer to Nitrogenase in Different Genomic and Metabolic Backgrounds. J. Bacterial. 200, e00757-17 (2018).

62. B. M. Hoffman, D. Lukoyanov, Z-. Y. Yang, D. R. Dean, L. C. Seefeldt, Mechanism of nitrogen fixation by nitrogenase: the next stage. Chem. Rev. 114, 4041-62 (2014). 63. J. Oetjen, B. Reinhold-Hurek, Characterization of the DraT/DraG system for posttranslational regulation of nitrogenase in the endophytic betaproteobacterium Azoarcus sp. Strain BH72. J. Bacterial. 191, 3726-3735 (2009).

64. M. J. Brocket-, S. Virus, S. Ganskow, P. Heathcote, D. W. Heinz, W. D. Schubert, D.

5 Jahn, J. Moser. ATP-driven reduction by dark-operative protochlorophyllide oxidoreductase from Chlorobium tepidum mechanistically resembles nitrogenase catalysis. J. Biol. Chem. 283, 10559-67 (2008).

65. S. J. Moore, S. T. Sowa, C. Schuchardt, E. Deery, A. D. L., J. Vazquez Ramos, S. Billig, C. Birkemeyer, P. T. Chivers, M. J. Howard, S. E. J. Rigby, G. Layer, M. J.,

0 Warren Elucidation of the biosynthesis of the methane catalyst coenzyme F430. Nature. 543, 78-82 (2017).

66. Y. Hu, J. M. Yoshizawa, A. W. Fay, C. Chung Lee, J. A. Wiig, M. W. Ribbe, Catalytic activities of NifEN: Implications for nitrogenase evolution and mechanism. Proc. Natl. Acad. Sci. U. S. A. 106, 16962-16966 (2009).

5 67. Miller, A. R., North, J. A., Wildenthal, J. A. & Tabita, F. R. Two distinct aerobic methionine salvage pathways generate volatile methanethiol in Rhodopseudomonas palustris. MBio 9, e00407-18 (2018).

68. Varaljay, V. A., Satagopan, S., North, J. A., Witte, B., Dourado, M. N., Anantharaman, K., Arbing, M. A., Hoeft McCann, S., Oremland, R. S., Banfield, J.

0 F., Wrighton, K. C. and Tabita, F. R. Functional metagenomic selection of RubisCO from uncultivated bacteria. Environ. Microbiol. 18, 1187-1199 (2016).

69. J. J. Hultqvist, O. Warsi, A. Soderholm, M. Knopp, U. Eckhard, E. Vorontsov, M. Selmer , D.A. Andersson. A bacteriophage enzyme induces bacterial metabolic perturbation that confers a novel promiscuous function. Nat Ecol Evol. 2, 1321-1330

5 (2018).

70. J. A. Hughes. In vivo hydrolysis of S-adenosyl-L-methionine in Escherichia coli increases export of 5-methylthioribose. Can J Microbiol. 52, 599-602 (2006).

71. Curson, A. R. J., Todd, J. D., Sullivan, M. J. & Johnston, A. W. B. Catabolism of dimethylsulphoniopropionate: microorganisms, enzymes and genes. Nat. Rev.

0 Microbiol. 9, 849-859 (2011).

72. Carrion, O., Curson, A., Kumaresan, D., Fu, Y., Lang, A. S., Mercade, E. & Todd, J. D. A novel pathway producing dimethylsulphide in bacteria is widespread in soil environments. Nat Commun 6, 6579 (2015). It will be apparent to those skilled in the art that various modifications and variations can be made in the present disclosure without departing from the scope or spirit of the invention. Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the methods disclosed herein. It is

5 intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.