Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MORPHINAN N-DEMETHYLASE ISOLATED FROM THE METHYLOBACTERIUM THEBAINFRESSER AND METHODS OF USE THEREOF
Document Type and Number:
WIPO Patent Application WO/2018/136654
Kind Code:
A1
Abstract:
Disclosed herein are methods for N-demethylating an N-methylated compound using an enzymatic reaction, rather than, e.g. a chemical modification. Also provided herein are enzymes for performing the reaction.

Inventors:
KUTCHAN TONI (US)
AUGUSTIN MEGAN (US)
Application Number:
PCT/US2018/014271
Publication Date:
July 26, 2018
Filing Date:
January 18, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
DONALD DANFORTH PLANT SCIENCE CENTER (US)
International Classes:
A61K31/49; A61K31/485; C07D221/28; C12N9/02; C12P17/18; C12Q1/26
Domestic Patent References:
WO2016039961A12016-03-17
Foreign References:
US20140227729A12014-08-14
US4115389A1978-09-19
US20140329846A12014-11-06
US20070020624A12007-01-25
US20130332133A12013-12-12
Attorney, Agent or Firm:
HOLTZ, William A. et al. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. A method of N-demethylating a low molecular weight N-methylated compound, the method comprising incubating the N-methylated compound with an enzyme comprising N-demethylase activity, wherein the enzyme is a morphinan N-dem ethyl ase (MND) identified from the Methylobacterium Thebainfresser, or a fragment or variant thereof having N-demethylase activity, thereby converting the low molecular weight N-methylated compound into an N-demethylated compound.

2. The method of claim 1, wherein the N-methylated compound is of natural or synthetic origin.

3. The method of claim 1 or 2, wherein the MND enzyme or a fragment or variant thereof is isolated, purified, or isolated and purified.

4. The method of any one of claims 1 to 3, wherein the method comprises adding a cofactor to the MND enzyme or a fragment or variant thereof prior to and/or during incubation with the N-methylated compound.

5. The method of claim 4, wherein the cofactor is flavin adenine dinucleotide

(FAD).

6. The method of any one of claims 1 to 5, wherein the MND enzyme or a fragment or variant thereof further comprises one or more of benzylisoquinoline alkaloid N- demethylase activity, tropane alkaloid N-demethylase activity, pyrroloindole alkaloid N- demethylase activity, piperidine alkaloid N-demethylase activity, aporphine alkaloid N- demethylase activity, indole alkaloid N-demethylase activity, or Amaryllidaceae alkaloid N- demethylase activity.

7. The method of any one of claims 1 to 6, wherein the MND enzyme or a fragment or variant thereof has a Vmax of about 0.32 pmol s"1, Km of about 0.97 μΜ, and/or a Kcat of about 6.5 x 10"4 s"1, wherein thebaine is the substrate.

8. The method of any one of claims 1 to 7, wherein the N-methylated compound is a heterocyclic compound having between 3 and 7 atoms, having between 4 and 7 atoms, having between 5 and 7 atoms, having between 3 and 6 atoms, having between 4 and 6 atoms, having 5 or 6 atoms, or having 6 or 7 atoms, within the ring portion of the compound.

9. The method of any one of claims 1 to 8, wherein the MND enzyme or fragment or variant thereof comprises:

i) the amino acid sequence SEQ. ID NO: 2;

ii) a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or iii) a variant of SEQ. ID NO: 2, wherein the variant has at least 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2.

10. The method of any one of claims 1 to 8, wherein ii) the MND enzyme fragment is a fragment produced by partial Proteinase K digestion of a polypeptide comprising the amino acid sequence SEQ ID NO: 2.

11. The method of any one of claims 1 to 10, wherein the N-methylated compound is selected from the group consisting of thebaine, oripavine, (R)-reticuline, salutaridine, salutaridinol, heroin, morphinone, codeinone, codeine, morphine, hydromorphone, oxymorphone, galanthamine, laudanine (laudanidine), orientaline, protosinomenine, isoorientaline, laudanosine, (^-reticuline, scopolamine, hyoscyamine (atropine), noscapine (narcotine), gramine, (-)-lobeline, physostigmine, isothebaine, and tropinone.

12. The method of any one of claims 1 to 11, wherein the N-methylated compound is thebaine.

13. The method of claim 12, wherein thebaine is converted into the N- demethylated compound N-demethylthebaine (northebaine).

14. The method of any one of claims 1 to 13, wherein in the conversion rate of the N-methylated compound into an N-demethylated compound is at least about 20%, 30% 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, 99.9%, or 100%.

15. The method of claim 13, wherein in the conversion rate of thebaine into northebaine is at least about 90%, 95%, 98%, 99%, 99.9%, or 100%.

16. The method of any one of claims 1 to 15, wherein the N-methylated compound is incubated with the MND enzyme of a fragment or variant thereof for at least about 20 minutes, 30 minutes, 1 hour, 2 hours, 4 hours, 6 hours, 8 hours, 12 hours, 18 hours, 24 hours, 48 hours, or 72 hours.

17. The method of any one of claim 1 to 16, wherein the MND enzyme or a fragment or variant thereof is immobilized.

18. The method of any one of claims 1 to 17, wherein the N-methylated compound is incubated with the MND enzyme or a fragment or variant thereof at a temperature of from any of about 4 °C, 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, or 72 °C to any of about 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, 72 °C, or 80 °C.

19. The method of claim 18, wherein the N-methylated compound is incubated with the MND enzyme or a fragment or variant thereof at a temperature of about 4 °C, 15 °C,

25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 72 °C, or 80 °C.

20. The method of any one of claims 1 to 19, wherein the N-methylated compound is incubated with the MND enzyme or a fragment or variant thereof in a buffered solution, wherein the buffered solution comprises a buffering capacity of from any of about pH 3.0, pH 4.0, pH 5.0, pH 6.0, pH 7.0, pH 8.0, or pH 9.0 to any of about pH 4.0, pH 5.0, pH 6.0, pH 7.0, pH 9.0, pH 10.0, or pH 10.5.

21. The method of any one of claims 1 to 20, wherein the N-methylated compound is incubated with the MND enzyme or a fragment or variant thereof at a pH of about 4.0 in a citrate buffer.

22. The method of any one of claims 1 to 21, wherein the N-demethylated compound produced is further modified by the chemical or enzymatic addition of a functional moiety to the demethylated nitrogen.

23. The method of any one of claims 1 to 22, wherein the N-methylated compound is incubated with the MND enzyme or a fragment or variant thereof in the presence of a solvent other than water.

24. The method of Claim 23, wherein the solvent is a non-polar solvent, a polar protic solvent, or a polar aprotic solvent.

25. The method of Claim 23 or 24, wherein the solvent is selected from the group consisting of hexane, heptane, toluene, chloroform, diethyl ether, ethanol, methanol, acetic acid, n-propanol, t-butanol, dimethyl sulfoxide (DMSO), tetrahydrofuran, dichloromethane, ethyl acetate, acetone, acetonitrile, and dimethylformamide.

26. The method of any one of claims 23 to 25, wherein the solvent is at a concentration of at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99%.

27. The method of any one of claims 1 to 26, wherein the method is used in drug discovery.

28. A method of producing an active N-substituted compound or precursor thereof, the method comprising using an N-demethylated compound produced by the method of any one of claims 1 to 25 as a precursor to produce the active N-substituted compound or precursor thereof.

29. The method of claim 28, wherein the active compound or precursor thereof is produced by the chemical or enzymatic addition of a functional moiety to the demethylated nitrogen.

30. The method of claim 28 or 29, wherein the method further comprises screening the activity of the active compound for drug discovery.

31. The method of any one of claims 28 to 30, wherein the active compound produced is a pharmaceutical compound.

32. The method of any one of claims 28 to 31, wherein the active compound produced is a synthetic or semisynthetic opiate.

33. The method of claim 32, wherein the synthetic or semisynthetic opiate is selected from the group consisting of oxycodone, oxymorphine, nalbuphine, naltrexone, and nalmefene.

34. The method of any one of claims 28 to 31, wherein the active compound produced is selected from the group consisting of tropane alkaloids, benzylisoquinoline alkaloids, pyrroloindole alkaloids, piperidine alkaloids, aporphine alkaloids, and

Amaryllidaceae alkaloids.

35. The method of any one of claims 28 to 31, wherein the active compound produced is selected from the group consisting of noratropine, oxitropium, and ipratropium bromide.

36. The method of claim 29, wherein an N-methyl group of the N-methylated compound is substituted in the active compound with at least one alkyl group.

37. The method of claim 29, wherein the demethylated nitrogen is substituted with:

(i) a neutral N-Rl group, wherein Rl is selected from the group consisting of a methyl group, an isopropyl group, an ethyl group, a propene group, a cyclopropylmethyl group, and a cyclobutylmethyl group, or

(ii) a cationic N+-R1 group, wherein Rl is selected from the group consisting of a methyl group and an isopropyl group, a methyl group and an ethyl group, a methyl group and a propene group, a methyl group and a cyclopropylmethyl group, and a methyl group and a cyclobutylmethyl group.

38. The method of claim 37, wherein the demethylated nitrogen is substituted with N-Rl or N+-R1 havin a structure selected from the group consisting of:

39. A non-naturally occurring nucleic acid comprising a nucleotide sequence that encodes:

i) the amino acid sequence SEQ. ID NO: 2;

ii) a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or

iii) a variant of SEQ ID NO: 2, wherein the variant has at least about 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2.

40. The nucleic acid of claim 39 comprising the nucleotide sequence having at least about 75%, 80%, 85%, 90%, 95%, 98%, or 99% identity to SEQ ID NO: 1.

41. The nucleic acid of claim 39 comprising the nucleotide sequence SEQ ID NO:

1.

42. The nucleic acid of claim 39, wherein the nucleotide sequence comprises one or more codons preferred for expression in a bacterial, yeast, insect, or plant cell.

43. The nucleic acid of claim 42, wherein the nucleotide sequence is codon optimized for expression in Arabidopsis thaliana.

44. The nucleic acid of claim 43 comprising the nucleotide sequence SEQ ID NO:

3.

45. A recombinant nucleic acid construct comprising the nucleotide sequence of any one of claims 39 to 44 operably linked to at least one heterologous transcriptional regulatory sequence.

46. The construct of claim 45, wherein the heterologous transcriptional regulatory sequence is a promotor sequence.

47. A vector comprising the nucleic acid of any one of claims 39 to 44 or the recombinant nucleic acid construct of claim 45 or 46.

48. A host cell comprising the nucleotide sequence of any one of claims 39 to 44.

49. The host cell of claim 48 comprising the recombinant nucleic acid construct of claim 45 or 46, or the vector of claim 47.

50. The host cell of claim 48 or 49, where the nucleotide sequence of any one of claims 38 to 44, the recombinant nucleic acid construct of claim 45 or 46, or the vector of claim 47 is integrated into the genome of the host cell.

51. The host cell of any one of claims 48 to 50, wherein said cell is a bacterial, yeast, insect, or plant cell.

52. The host cell of any one of claims 48 to 51, wherein said cell is E. coli or Agrobacterium.

53. The host cell of any one of claims 48 to 51, wherein said cell is a Camelina sativa, Nicotiana benthamiana, or Papaver sominferum cell.

54. A method for producing a protein having N-demethylase activity, the method comprising culturing the host cell of any of claims 49 to 53, wherein said vector of claim 47 is an expression vector, and recovering the thus-produced protein having N-demethylase activity.

55. An isolated or purified non-naturally occurring protein having N-demethylase activity, comprising:

i) the amino acid sequence SEQ. ID NO: 2;

ii) a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or

iii) a variant of SEQ. ID NO: 2, wherein the variant has at least 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2.

56. The protein of claim 55, wherein the protein is a fragment produced by partial Proteinase K digestion of a polypeptide comprising the amino acid sequence SEQ ID NO: 2.

57. The protein of claim 55 or 56, further comprising a heterologous peptide sequence.

58. The protein of any one of claims 55 to 57, wherein the heterologous peptide sequence is a sequence that targets the protein to a specific subcellular location or the heterologous peptide sequence is a histidine tag.

59. A transgenic plant comprising the nucleic acid of any one of claims 39 to 44, the recombinant nucleic acid construct of claim 45 or 46, or the vector of claim 47.

60. The transgenic plant of claim 59, wherein the nucleic acid of any one of claims 38 to 44, the recombinant nucleic acid construct of claim 45 or 46, or the vector of claim 47 is integrated into the genome of the plant.

61. The transgenic plant of claim 59 or 60, wherein the plant is selected from the group consisting of Camelina sativa, Nicotiana benthamiana, and Papaver sominferum.

Description:
MORPHINAN V-DEMETHYLASE ISOLATED FROM THE METHYLOBACTERIUM THEBAINFRESSER AND METHODS OF USE THEREOF

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of U.S. Provisional Patent Application Serial No. 62/448,164, filed January 19, 2017, which is incorporated herein by reference in its entirety.

INCORPORATION OF SEQUENCE LISTING

[0002] The sequence listing that is contained in the file named "DDSC_47004_158297_Sequence Listing_ST25.txt", which is 12,707 bytes (measured in operating system MS-Windows), created on January 18, 2017, is filed herewith by electronic submission and incorporated herein by reference in its entirety.

BACKGROUND

[0003] Naturally occurring opiates such as thebaine, morphine, and codeine are N- methylated. The N-demethylation of such opiates is a key chemical transformation in the synthesis of synthetic opiates. To date, commercial N-demethylation methods are chemical in nature and typically require toxic and expensive reagents while low in yield (WO 2011/032214A1). The naturally occurring alkaloids atropine, scopolamine, and cocaine, along with the opiate morphine all contain a tertiary N-methylamine group. Modification of the N-methylamine group alters the pharmacological properties of these alkaloids. This can lead to the production of many important pharmaceutical compounds. (Do Pham, DD et al. Green Chem., 2014, 16, 1399-1409).

[0004] For example, the opiate thebaine is a synthetic precursor for the analgesic oxycodone and the intermediate oxymorphone, which is used to prepare the N- methylcyclobutyl-based analgesic nalbuphine and the N-methylcyclopropyl-based opioid antagonists naltrexone, buprenorphine, nalmefene, and the allyl-based naloxone used in the treatment of alcohol or opiate dependence and for rapid opiate detoxification. (Do Pham, DD et al. Green Chem., 2014, 16, 1399-1409; National Academies of Sciences, Engineering, and Medicine. 2017. Pain management and the opioid epidemic: Balancing societal and individual benefits and risks of prescription opioid use. Washington, DC: The National Academies Press; Allen, B.E., et al., U.S. Patent Application Publication No. 2010/0087647 Al). [0005] While many methods of N-demethylating opiates have been described, including the use of chemical reagents, and procedures utilizing photochemistry, electrochemistry, and microorganisms, enzymatic N-demethylation has not been widely described (J. von Braun, Chem. Ber. 1909, 42, 2035; J. H. Cooley, E. J. Evain, Synthesis 1989, 1; K. C. Rice, J. Org. Chem. 1975, 40, 1850; (c) K. C. Rice, E. L. May, J. Heterocycl. Chem. 1977, 14, 665; L. S. Schwab, J. Med. Chem. 1980, 23, 698; H. Merz, K. H. Pook, Tetrahedron 1970, 26, 1727; J. A. Ripper, E. R. T. Tiekink, P. J. Scammells, Bioorg. Med. Chem. Lett. 2001, 77, 443; P. J. Smith, C. K. Mann, J. Org. Chem. 1969, 34, 1821; J. E. Barry, M. Finkelstein, E. A. Mayeda, S. D. Ross, J. Org. Chem. 1974, 39, 3488; K. M. Madyastha, Proc. Indian Acad. Sci. 1994, 106, 1203; K. M. Madyastha, G. V. B. Reddy, J. Chem. Soc, Perkin Trans. 1 1994, 91 1; K. McCamley, J. A. Ripper, R. D. Singer, P. J. Scammells, J. Org. Chem. 2003, 68, 9847-9850; S. Thavaneswaran, P. J. Scammells, Bioorg. Med. Chem. Lett. 2006, 76, 2868-2871; S. Thavaneswaran, K. McCamley, P. J. Scammells, Nat. Prod. Commun. 2006, 7(10), 885-897).

[0006] Thus, there is a need to develop clean and efficient methods of N- demethylating opiates-as well as other drug classes such as the tropane alkaloids-for use in among other things, synthetic opiate production and new drug discovery.

SUMMARY

[0007] Provided for herein are methods for N-demethylating a low molecular weight N-methylated compound. In certain aspects, the methods comprise incubating the TV- methylated compound with an enzyme comprising N-demethylase activity, wherein the enzyme is a morphinan N-demethylase (M D) identified from the Methylobacterium Thebainfresser having N-demethylase activity. In certain aspects, the enzyme is a fragment or variant (including derivatives) thereof having N-demethylase activity. In certain aspects, the enzyme is a fragment or variant (including derivatives) thereof having N- demethylase activity containing a cofactor, such as for example, flavin adenine dinucleotide (FAD). Said incubation converts the low molecular weight N-methylated compound into an N-demethylated compound. In certain aspects, the N-methylated compound is of natural or synthetic origin. In certain aspects, the MND enzyme or fragment or variant thereof is isolated, purified, or isolated and purified. In certain aspects, the MND enzyme or fragment or variant thereof further comprises one or more of benzylisoquinoline alkaloid N-demethylase activity, tropane alkaloid N-demethylase activity, pyrroloindole alkaloid N-demethylase activity, piperidine alkaloid N-demethylase activity, aporphine alkaloid N-dem ethyl ase activity, indole alkaloid N-demethylase activity, or Amaryllidaceae alkaloid N-demethylase activity. In certain aspects, the MND enzyme or fragment or variant (including derivatives) thereof performs the reaction in the presence of a solvent. In certain aspects, the MND enzyme or fragment or variant (including derivatives) thereof performs the reaction in the presence of a nonpolar solvent (e.g., chloroform), an aprotic polar solvent (e.g., tetrahydrofuran), and/or a polar protic solvent (e.g, methanol). In certain aspects, the MND enzyme or fragment or variant thereof has a Vmax of about 0.32 pmol s "1 , K m of about 0.97 μΜ, and/or a K cat of about 6.5 x 10 "4 s "1 , when thebaine is the substrate. In certain aspects, the N-methylated compound is a heterocyclic compound having between 3 and 7 atoms, having between 4 and 7 atoms, having between 5 and 7 atoms, having between 3 and 6 atoms, having between 4 and 6 atoms, having 5 or 6 atoms, or having 6 or 7 atoms, within the ring portion of the compound. In certain aspects, the MND enzyme or fragment or variant thereof comprises: i) the amino acid sequence SEQ. ID NO: 2; ii) a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or iii) a variant of SEQ. ID NO: 2, wherein the variant has at least 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2. In certain aspects, the MND enzyme fragment is a fragment produced by partial Proteinase K digestion of a polypeptide comprising the amino acid sequence SEQ ID NO: 2. In certain aspects, the N-methylated compound is selected from the group consisting of thebaine, oripavine, (R)-reticuline, salutaridine, salutaridinol, heroin, morphinone, codeinone, codeine, morphine, hydromorphone, oxymorphone, galanthamine, laudanine (laudanidine), orientaline, protosinomenine, isoorientaline, laudanosine, (S)-reticuline, scopolamine, hyoscyamine (atropine), noscapine (narcotine), gramine, (-)-lobeline, physostigmine, isothebaine, (R,S)-autumnaline, and tropinone. In certain aspects, the N- methylated compound is thebaine. In certain aspects, thebaine is converted into the N- demethylated compound N-demethylthebaine (northebaine). In certain aspects, the conversion rate of the N-methylated compound into an N-demethylated compound is at least about 10%, 20%, 30% 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, 99.9%, or 100%). In certain aspects, the conversion rate of thebaine into northebaine is at least about 90%, 95%, 98%, 99%, 99.9%, or 100%. In certain aspects, the N-methylated compound is incubated with the MND enzyme or fragment or variant thereof for at least about 20 minutes, 30 minutes, 1 hour, 2 hours, 4 hours, 6 hours, 8 hours, 12 hours, 18 hours, 24 hours, 48 hours, or 72 hours. In certain aspects, the MND enzyme or fragment or variant thereof is immobilized. In certain aspects, the N-methylated compound is incubated with the MND enzyme or fragment or variant thereof at a temperature of from any of about 4 °C, 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, or 72 °C to any of about 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, 72 °C, or 80 °C. In certain aspects, the N-methylated compound is incubated with the MND enzyme or fragment or variant thereof at a temperature of about 4 °C, 15 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 72 °C, or 80 °C. In certain aspects, the N-methylated compound is incubated with the MND enzyme or fragment or variant thereof in a buffered solution, wherein the buffered solution comprises a buffering capacity of from any of about pH 3.0, pH 4.0, pH 5.0, pH 6.0, pH 7.0, pH 8.0, or pH 9.0 to any of about pH 4.0, pH 5.0, pH 6.0, pH 7.0, pH 9.0, pH 10.0, or pH 10.5. In certain aspects, the N-methylated compound is incubated with the MND enzyme or fragment or variant thereof at a pH of about 4.0 in a citrate buffer. In certain aspects, the N-dem ethyl ated compound produced is further modified by the chemical or enzymatic addition of a functional moiety to the demethylated nitrogen. In certain aspects, the method is used in drug discovery.

[0008] Also provided for herein are methods of producing an active N-substituted compound or precursor thereof. In certain aspects, the method comprises using an N- demethylated compound produced by any of the methods disclosed herein as a precursor to produce the active N-substituted compound or precursor thereof. In certain aspects, the active compound or precursor thereof is produced by the chemical or enzymatic addition of a functional moiety to the demethylated nitrogen. In certain aspects, the method further comprises screening the activity of the active compound for drug discovery. In certain aspects, the active compound produced is a pharmaceutical compound. In certain aspects, the active compound produced is a synthetic or semisynthetic opiate. In certain aspects, the synthetic or semisynthetic opiate is selected from the group consisting of oxycodone, oxymorphine, nalbuphine, naltrexone, buprenorphine, naloxone, and nalmefene. In certain aspects, the active compound produced is selected from the group consisting of tropane alkaloids, benzylisoquinoline alkaloids, pyrroloindole alkaloids, piperidine alkaloids, aporphine alkaloids, and Amaryllidaceae alkaloids. In certain aspects, the active compound produced is selected from the group consisting of noratropine, oxitropium, and ipratropium bromide. In certain aspects, an N-methyl group of an N-methylated compound is substituted in the active compound with at least one alkyl group. In certain aspects, the dem ethyl ated nitrogen is substituted with: (i) a neutral N-Rl group, wherein Rl is selected from the group consisting of a methyl group, an allyle group, an isopropyl group, an ethyl group, a propene group, a cyclopropylmethyl group, and a cyclobutylmethyl group, or (ii) a cationic N + -R1 group, wherein Rl is selected from the group consisting of a methyl group and an isopropyl group, a methyl group and an ethyl group, a methyl group and a propene group, a methyl group and a cyclopropylmethyl group, and a methyl group and a cyclobutylmethyl group. In certain aspects, the demethylated nitrogen is substituted with N-Rl or N + -R1 havin a structure selected from the group consisting of:

[0009] Also provided herein are non-naturally occurring nucleic acids comprising a nucleotide sequence that encodes: i) the amino acid sequence SEQ. ID NO: 2; ii) a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or iii) a variant of SEQ ID NO: 2, wherein the variant has at least about 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2. In certain aspects, the nucleotide sequence has at least about 75%, 80%, 85%, 90%, 95%, 98%, or 99% identity to SEQ ID NO: 1. In certain aspects, the nucleotide sequence comprises SEQ ID NO: 1. In certain aspects, the nucleotide sequence comprises one or more codons preferred for expression in a bacterial, yeast, insect, or plant cell. In certain aspects, the nucleotide sequence is codon optimized for expression in Arabidopsis thaliana. In certain aspects, the nucleotide sequence comprises SEQ ID NO: 3. In certain aspects, any of the nucleotide sequences disclosed herein is operably linked to at least one heterologous transcriptional regulatory sequence. In certain aspects, the heterologous transcriptional regulatory sequence is a promotor sequence. Certain aspects provide for a vector comprising a nucleic acid or recombinant nucleic acid construct disclosed anywhere herein. Certain aspects provide for a host cell comprising a nucleotide sequence discloses anywhere herein. In certain aspects a host cell comprises a recombinant nucleic acid construct or vector discloses anywhere herein. In certain aspects, the nucleotide sequence, the recombinant nucleic acid construct, or the vector is integrated into the genome of the host cell. In certain aspects, the host cell is a bacterial, yeast, insect, or plant cell. In certain aspects, the host cell is E. coli or Agrobacterium. In certain aspects, the host cell is a Camelina sativa, Nicotiana benthamiana, or Papaver sominferum cell.

[0010] Also provided herein are methods for producing a protein having N- demethylase activity. In certain aspects, a method comprises culturing a host cell disclosed anywhere herein, wherein the host cell comprises an expression vector, and wherein the method comprises recovering the thus-produced protein having N-d em ethyl ase activity.

[0011] Also provided for herein are isolated or purified non-naturally occurring proteins having N-demethylase activity, comprising: i) the amino acid sequence SEQ. ID NO: 2; ii) a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or iii) a variant of SEQ. ID NO: 2, wherein the variant has at least 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2. In certain aspects, the protein is a fragment produced by partial Proteinase K digestion of a polypeptide comprising the amino acid sequence SEQ ID NO: 2. In certain aspects, the protein further comprising a heterologous peptide sequence. In certain aspects, the heterologous peptide sequence is a sequence that targets the protein to a specific subcellular location or the heterologous peptide sequence is a histidine tag.

[0012] Also provided for herein are transgenic plants that comprise a nucleic acid, recombinant nucleic acid construct, or vector disclosed anywhere herein. In certain aspects, the nucleic acid, the recombinant nucleic acid construct, or the vector is integrated into the genome of the plant. In certain aspects, the plant is selected from the group consisting of Camelina sativa, Nicotiana benthamiana, and Papaver sominferum.

BRIEF DESCRIPTION OF THE DRAWINGS

[0013] Figure 1A. Figure 1A shows Thebainfresser morphology: images of

Thebainfresser grown at 28 °C in MODLS + thebaine media using the Nikon Eclipse 800 Microscope with 60X phase oil immersion objective. [0014] Figure IB. Figure IB shows Thebainfresser morphology: images of Thebainfresser spread onto LB + Agar plate.

[0015] Figure 2A. Figure 2A shows un-inoculated thebaine crystals in MODLS

+ thebaine media: microscopic images using the Nikon Eclipse 800 Microscope with 60X phase oil immersion objective were taken of thebaine crystal in un-inoculated MODLS + thebaine media.

[0016] Figure 2B. Figure 2B shows Thebainfresser living on thebaine crystals in MODLS + thebaine media: microscopic images using the Nikon Eclipse 800 Microscope with 60X phase oil immersion objective were taken of thebaine crystal covered in Thebainfresser grown in MODLS + thebaine media.

[0017] Figure 3. Figure 3 shows data from a Thebainfresser temperature optimum assay. A single colony of Thebainfresser was suspended in LB media and evenly distributed into 6 flasks, where two of each were grown at 21 °C, 28 °C, and 37 °C. Growth of Thebainfresser was monitored by absorbance at OD600 for a period of 7 days.

[0018] Figure 4. Figure 4 show data from a Thebainfresser N-demethylase induction assay. Analysis of N-demethylase induction was conducted by detection of northebaine production. Thebainfresser cultured in LB and Thebainfresser cultured in MODLS + thebaine were both washed and then resuspended in MODLS + thebaine and incubated at 28 °C for three days. The experiment was performed in duplicate. Samples were taken prior to induction and twice a day post induction and analyzed for northebaine by FIPLC.

[0019] Figure 5. Figure 5 shows data from an in vivo enzyme assay using candidate N-demethylase enzymes from Thebainfresser expressed in E. coli using thebaine as substrate. The E.coli expression strain BL21 Star (DE3) was individually transformed with the empty vector pET28a and four unknown enzymes (A-D). Each was cultured in Terrific Broth overnight prior to IPTG induction, toluene treatment, and addition of thebaine. Extracts were analyzed by QTRAP 6500. An overlay of MRM transitions specific to northebaine is shown (298.1/251).

[0020] Figure 6. Figure 6 shows data from replicates of a toluene permeabilization assay using unknown enzyme D. E.coli transformed with pET28a empty vector and unknown enzyme D, the enzyme that exhibited prior N-demethylase activity, were cultured in Terrific Broth and induced with IPTG, in duplicate, and subject to toluene treatment and incubation with thebaine. Samples were extracted, analyzed, and quantitated by LC-MS/MS using the QTRAP 6500. [0021] Figure 7. Figure 7 shows SDS-PAGE of MND protein purification from expression in the E. coli strain 'BL21 star (DE3)'. Volumes used for cell suspensions were normalized based on OD600 then concentrated prior to running. Laemmli sample buffer supplemented with β-mercaptoethanol was added to each sample for a total volume of 20 μΐ prior to denaturation at 95 °C and analysis on 10 % SDS-PAGE. Lane 1 : 10 μΐ Low molecular weight ladder (BIO-RAD), Lane 2: 20 μΐ pET28a empty vector cell suspension (post IPTG induction), Lane 3 : 17.6 μΐ MND cell suspension (pre IPTG induction), Lane 4: 12 μΐ MND cell suspension (post IPTG induction), Lane 5: 1 μΐ MND supernatant after cell lysis and centrifugation, Lane 6: 1 μΐ MND pellet after cell lysis and centrifugation, Lane 7: 2 μΐ MND supernatant from TALON resin, Lane 8: 2 μΐ 5 mM imidazole wash, Lane 9: 5 μΐ 100 mM imidazole wash, Lane 10: 10 μΐ eluted protein post PD-10 desalting column. The MND protein is indicated by an arrow.

[0022] Figure 8. Figure 8 shows data from an enzyme assay with His-tag purified MND enzyme from BL21 Star (DE3) cells using thebaine as substrate. Enzyme assays contained 30 mM potassium phosphate buffer pH 8.0, 1500 pmol thebaine, and 100 μΐ enzyme or water in the case of the no enzyme control. The assay containing MND enzyme was performed in duplicate. An empty vector control, a no enzyme control, and a boiled MND enzyme control were also included. Enzyme assays were extracted with ethyl acetate prior to filtration, dilution and analysis by LC-MS/MS using the QTRAP 6500. The chromatograph shows the MRM transition 298.1/251.0, a transition specific to northebaine.

[0023] Figure 9. Figure 9 shows the amount of northebaine produced from assays containing thebaine and purified MND enzyme, pET28a, boiled MND enzyme, or no enzyme. Enzymes were incubated overnight at 30 °C with 1500 pmol of thebaine in 30 mM potassium phosphate buffer (pH 8.0), extracted twice with ethyl acetate, and dried to completion with N2. Dried samples were reconstituted with 150 μΐ of 50 % MeOH then diluted 100 fold prior to LC-MS/MS analysis using the QTRAP 6500.

[0024] Figure 10. Figure 10 shows a purified MND enzyme assay with thebaine as substrate overlain with negative control and standard. Enzyme assays included 36 μg MND enzyme plus 1500 pmol thebaine in 30 mM potassium phosphate buffer pH 8.0 in a total volume of 200 μΐ and were allowed to incubate at 30 °C overnight. Assays were extracted twice with ethyl acetate and diluted 50 fold prior to filtration and analysis by LC-MS/MS with the QTRAP 6500. An empty vector control (pET28a) was also included. Both panels are from the same enzyme assay where the top is an overlay of MRM transition 312.1/251.0 (thebaine specific) and the bottom is an overlay of MRM transition 298.1/251.0 (northebaine specific). The chemical structures of each compound are included. The fragmentation pattern obtained from each compound is also included in their respective panel where the major fragments are indicated.

[0025] Figure 11. Figure 11 shows the amount of northebaine produced by BL21 Star (DE3) and PLUSE cells transformed with pET28a and MND enzyme treated with toluene then incubated with thebaine overnight. PLUSE and BL21 star (DE3) cells transformed with an MND-encoding vector and pET28a were induced with IPTG then normalized using OD600 prior to toluene treatment and overnight incubation with 20 μΜ (200 nmol) thebaine. One half of each culture was extracted and concentrated to 200 μΐ prior to 100 fold dilution and analysis by LC-MS/MS using the QTRAP 6500. Each assay was performed in triplicate.

[0026] Figure 12. Figure 12 shows SDS-PAGE of an MND-encoding vector and pET28a transformed into both BL21 Star (DE3) cells and PLUSE cells used for permeabilization assay. Samples were analyzed by 7% SDS-PAGE pre-induction and post-induction with IPTG. Laemmli sample buffer supplemented with β-mercaptoethanol was added to each sample for a total volume of 20 μΐ prior to denaturation at 95 °C. Lane 1 : 10 μΐ low molecular weight ladder (BIO-RAD), Lane 2: 5.7 μΐ pET28a empty vector in BL21 Star (DE3) cell suspension (pre IPTG induction), Lane 3 : 9.4 μΐ pET28a empty vector in BL21 Star (DE3) cell suspension (post IPTG induction), Lane 4: 3.9 μΐ MND in BL21 Star (DE3) cell suspension (pre IPTG induction), Lane 5: 5.8 μΐ MND in BL21 Star (DE3) cell suspension (post IPTG induction), Lane 6: 5.5 μΐ pET28a empty vector in PLUSE cell suspension (pre IPTG induction), Lane 7: 9.1 μΐ pET28a empty vector in PLUSE cell suspension (post IPTG induction), Lane 8: 4.4 μΐ MND in PLUSE cell suspension (pre IPTG induction), Lane 9: 6.1 μΐ MND in PLUSE cell suspension (post IPTG induction). The GroEL and MND proteins are indicated by an asterisk and an arrow, respectively. Volumes used for cell suspensions were normalized based on OD600 and concentrated prior to running.

[0027] Figure 13. Figure 13 shows SDS-PAGE of MND enzyme purification from expression in the E. coli strain 'PLUSE'. Samples taken throughout protein purification were analyzed by 10% SDS-PAGE. Volumes used for cell suspensions were normalized based on OD600 and concentrated prior to running. Lane 1 : 10 μΐ low molecular weight ladder (BIO-RAD), Lane 2: 22 μΐ pET28a empty vector cell suspension (24 h post IPTG induction), Lane 3 : 25 μΐ MND cell suspension (4 h post IPTG induction), Lane 4: 13.8 μΐ MND cell suspension (24 h post IPTG induction), Lane 5: 1 μΐ MND supernatant after cell lysis and centrifugation, Lane 6: 1 μΐ MND pellet after cell lysis and centrifugation, Lane 7: 2 μΐ MND supernatant from TALON resin, Lane 8: 2 μΐ 5 mM imidazole wash, Lane 9: 5 μΐ 100 mM imidazole wash, Lane 10: 10 μΐ eluted protein post PD-10 desalting column. The MND protein is indicated by an arrow. Volumes used for cell suspensions were normalized based on OD600 and concentrated prior to running. Laemmli sample buffer supplemented with β-mercaptoethanol was added to each sample for a total volume of 20 μΐ prior to denaturation at 95 °C.

[0028] Figure 14. Figure 14 shows data from an enzyme assay with His-tagged MND purified from PLUSE cells using thebaine as substrate. Enzyme assays were performed in duplicate prior to ethyl acetate extraction, filtration, dilution, and analysis by LC-MS/MS using the QTRAP 6500. Only one of the MND replicate assays is shown for clarity.

[0029] Figure 15. Figure 15 shows the amount of northebaine produced from enzyme assays with purified MND enzyme using thebaine as substrate. The MND enzyme assay was performed in duplicate. All assays contained 100 μΐ of purified enzyme (or water), 30 mM potassium phosphate buffer (pH 8.0), and 1500 pmol thebaine and incubated overnight at 30 °C prior to ethyl acetate extraction, filtration, dilution, and analysis by LC- MS/MS with the QTRAP 6500. Quantitation of northebaine was normalized using codeine as an internal standard.

[0030] Figure 16. Figure 16 shows representative compound structures used for MND enzyme substrate testing.

[0031] Figure 17. Figure 17 shows data from a MND enzyme temperature optimum assay. MND enzyme activity for northebaine production was evaluated at various temperatures to identify the temperature optimum. Each assay was performed in duplicate and contained 30 mM potassium phosphate buffer pH 8.0, 100 μΐ purified enzyme, and 1500 pmol of thebaine. Assays were allowed to proceed for 20 min at 20 °C, 25 °C, 30 °C, 40 °C, 50 °C, 60 °C, 70 °C, and 80 °C prior to protein precipitation, dilution, filtration and introduction to the QTRAP 6500 for LC-MS/MS analysis and quantitation. A pET28a empty vector control was included at each temperature.

[0032] Figure 18. Figure 18 shows data from a MND enzyme pH optimum enzyme assay. MND enzyme activity for northebaine production included assays containing 100 μΐ (-36 μg) enzyme purified MND enzyme protein, 1500 pmol thebaine and 60 mM buffer (citrate buffer pH 4, and pH 5; potassium phosphate buffer pH 6, pH 7, pH 8; glycine- NaOH buffer and pH 9, pH 10, and pH 10.5). Samples were incubated at 30 °C for 20 min prior to protein precipitation, dilution, filtration, and introduction into the QTRAP 6500 for LC-MS/MS analysis and quantitation. MND enzyme samples were performed in duplicate. A pET28a empty vector control was included at each pH.

[0033] Figure 19. Figure 19 shows data from a supplemental pH optimum assay with MND enzyme focusing on lower pHs. Each experimental reaction was completed in duplicate in 60 mM buffer with 1500 pmol thebaine and 100 μΐ (-36 μg) enzyme. Error bars show standard deviation. A control reaction was included at each pH using purified empty vector as enzyme. Assays were allowed to incubate for 20 min at 30 °C prior to addition of 2 volumes 100% MeOH and rapid mixing (30 s). Samples were centrifuged then diluted with 50% MeOH and filtered prior to analysis by QTRAP 6500. pH 4 was repeated with glycine-HCl and citrate buffer.

[0034] Figure 20. Figure 20 shows protein quantification of MND enzyme from His- tag purification. The following samples were run on a 10% Mini -PROTEAN TGX precast gel: lane 1) Biorad low range molecular weight marker; lane 2) 3,000 ng BSA; lane 3) 2,000 ng BSA; lane 4) 1,500 ng BSA; lane 5) 1,000 ng BSA; lane 6) 800 ng BSA; lane 7) 5 μΐ MND enzyme sample A; lane 8) 5 μΐ MND enzyme sample B; lane 9) 5 μΐ MND enzyme sample C; lane 10) 5 μΐ pET28a empty vector control. Samples were analyzed using a GelDoc EZ Imager and quantitate using Image Lab version 5.2.1.

[0035] Figure 21. Figure 21 shows determination of MND enzyme kinetic parameters. Kinetic parameters of MND enzyme were determined by enzyme assay using increasing substrate concentrations and quantitation of northebaine production by LC-MS/MS using the QTRAP 6500. The calculated kinetic values were obtained using Prism 7.

[0036] Figure 22. Figure 22 shows an MND enzyme stability assay. MND enzyme (-36 μg) was incubated at 30°C for the times indicated (in triplicate) in 30 mM potassium phosphate pH 7.0 when 1500 pmol of thebaine was added. The reaction was allowed to proceed for 20 min and stopped by addition of 400 μΐ 100% methanol and rapid mixing for 30 seconds. Samples were centrifuged and diluted to a total of 50 fold, filtered, and analyzed by LC-MS/MS using a QTRAP 6500.

[0037] Figure 23. Figure 23 shows the effect of FAD addition and Proteinase K digestion on MND enzyme. Enzyme assays with MND enzyme were performed with and without a prior 1 hour Proteinase K digestion and in the presence and absence of 1500 pmol FAD. Assays contained 30 mM potassium phosphate buffer pH 7.0 and 1500 pmol thebaine and were allowed to proceed for 20 minutes. Assays were stopped by addition of MeOH and mixing. Samples were then centrifuged, diluted, filtered, and analyzed by LC- MS/MS with the QTRAP 6500.

[0038] Figure 24. Figure 24 shows the effects of cofactors and validation of the effect of Proteinase K digestion on MND enzymatic activity. Enzyme assays were completed in duplicate. Assays contained 30 mM potassium phosphate buffer pH 7.0 and 1500 pmol thebaine. Reactions were terminated after 20 minutes by MeOH addition and mixing. Samples were centrifuged, diluted, filtered, and analyzed by LC-MS/MS using the QTRAP6500.

[0039] Figure 25. Figure 25 is a Phylogenetic tree of MND enzyme and closely related protein sequences. Protein sequences for the tree were obtained from the NCBI NR database and Uniprot using the MND enzyme protein sequence and protein BLAST. Top hits were chosen in addition to hits further down the list having better annotation. Each branch is labeled using the database entry followed by the putative function and species for each sequence. Sequences were aligned using the MUSCLE algorithm and the maximum likelihood tree was constructed using MEGA 6.06 with 500 bootstraps. Placement of MND enzyme in the tree is indicated by a star. The closest hit from BLAST is indicated by an asterisk.

[0040] Figure 26. Figure 26 shows that MND contains the cofactor flavin adenine dinucleotide. Samples of MND and empty vector control were either boiled or untreated prior to addition of menthol for protein precipitation. After centrifugation, supematants were analyzed by LC-MS/MS. TIC of each sample, control, and the standards flavin adenine dinucleotide (FAD) and flavin mononucleotide (FMN) are shown.

[0041] Figure 27. Figure 27 shows MND activity testing in industrial solvents. Enzyme assays with 5% and 25% of DMSO, methanol, chloroform, tetrahydrofuran, toluene, heptane, and diethyl ether were performed in duplicate and contained 30 mM potassium phosphate buffer pH 7.0, 36 μg purified enzyme, and 1500 pmol of thebaine. Assays were allowed to proceed for 20 min at 30°C and analyzed by LC-MS/MS using the QTRAP 6500.

[0042] Figure 28. Figure 28 shows MND expression testing in different E.coli strains. His-tagged MND and pET28a empty vector control were transformed into E.coli expression strains PlusSa, BL21 (DE3) pLysS, BL21 Star (DE3), and PlusE. Each strain was cultured, in duplicate, for 24 hours, and induced for 24 hours. Cell density was normalized then subjected to toluene permeabilization and incubation with 20 μΜ thebaine for about 24 hours. Cultures were extracted with ethyl acetate and northebaine production was quantitated by LC-MS/MS using QTRAP 6500.

[0043] Figure 29. Figure 29 shows N-demethylation by MND attached to Sepharose beads. 1 mg of His-tagged purified MND was coupled to CNBr-activated Sepharose™ 4B (GE Healthcare). 1500 pmol of thebaine was added to a stopped up column containing 0.1 M potassium phosphate buffer pH 7.0 for 20 minutes. After sample collection by centrifugation, the column was washed with buffer and 200 μΐ of 0.1 M potassium phosphate buffer pH 7.0 was added in addition to another 1500 pmol of thebaine and incubated overnight. Controls included thebaine incubated with buffer, thebaine added to 200 μΐ of NaHC0 3 wash (after initial ligand binding) with and without residual beads, and to a mix of the acid/base wash as controls. Samples were precipitated with MeOH and analyzed by LC-MS/MS QTRAP 6500.

DETAILED DESCRIPTION

[0044] An enzyme in a pink-pigmented facultative methylotroph has been discovered that N-demethylates thebaine to N-demethylthebaine (northebaine) with 100% conversion in an aqueous solution at 30°C. It has subsequently been demonstrated that this enzyme N- demethylates numerous compounds in addition to thebaine. The methods disclosed herein can thus eliminate the need for toxic chemicals and avoid the concomitant toxic waste of the industrial chemical N-demethylation process currently utilized in the production of synthetic opiates.

Definitions

[0045] To the extent necessary to provide descriptive support, the subject matter and/or text of the appended claims is incorporated herein by reference in their entirety.

[0046] It will be understood by all readers of this written description that the exemplary embodiments described and claimed herein may be suitably practiced in the absence of any recited feature, element or step that is, or is not, specifically disclosed herein.

[0047] It is to be noted that the term "a" or "an" entity refers to one or more of that entity; for example, "a compound," is understood to represent one or more compounds. As such, the terms "a" (or "an"), "one or more," and "at least one" can be used interchangeably herein.

[0048] Furthermore, "and/or" where used herein is to be taken as specific disclosure of each of the specified features or components with or without the other. Thus, the term and/or" as used in a phrase such as "A and/or B" herein is intended to include "A and B," "A or B," "A" (alone), and "B" (alone). Likewise, the term "and/or" as used in a phrase such as "A, B, and/or C" is intended to encompass each of the following embodiments: A, B, and C; A, B, or C; A or C; A or B; B or C; A and C; A and B; B and C; A (alone); B (alone); and C (alone).

[0049] It is understood that wherever aspects are described herein with the language "comprising," otherwise analogous aspects described in terms of "consisting of and/or "consisting essentially of are also provided.

[0050] Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure is related. Numeric ranges are inclusive of the numbers defining the range. Even when not explicitly identified by "and any range in between," or the like, where a list of values is recited, e.g., 1, 2, 3, or 4, the disclosure specifically includes any range in between the values, e.g., 1 to 3, 1 to 4, 2 to 4, etc.

[0051] The headings provided herein are solely for ease of reference and are not limitations of the various aspects or aspects of the disclosure, which can be had by reference to the specification as a whole. Accordingly, the terms defined immediately below are more fully defined by reference to the specification in its entirety.

[0052] As used herein, the term "non-naturally occurring" condition, substance, polypeptide, polynucleotide, composition, entity, plant, organism, individual, and/or any combination thereof, or any grammatical variants thereof, is a conditional term that explicitly excludes, but only excludes, those forms that are well-understood by persons of ordinary skill in the art as being "naturally-occurring," or that are, or might be at any time, determined or interpreted by a judge or an administrative or judicial body to be, "naturally-occurring."

[0053] As used herein, the term "low molecular weight" in reference to an TV- methylated compound means less than about 1,500 Daltons.

[0054] As used herein, the term "identity," e.g., "percent identity" to an amino acid sequence or to a nucleotide sequence disclosed herein refers to a relationship between two or more nucleotide sequences or between two or more amino acid sequences. When a position in one sequence is occupied by the same nucleic acid base or amino acid in the corresponding position of the comparator sequence, the sequences are said to be "identical" at that position. The percentage "sequence identity" is calculated by determining the number of positions at which the identical nucleic acid base or amino acid occurs in both sequences to yield the number of "identical" positions. The number of "identical" positions is then divided by the total number of positions in the comparison window and multiplied by 100 to yield the percentage of "sequence identity." Percentage of "sequence identity" is determined by comparing two optimally aligned sequences over a comparison window. In order to optimally align sequences for comparison, the portion of a nucleotide or amino acid sequence in the comparison window can comprise additions or deletions termed gaps while the reference sequence is kept constant. An optimal alignment is that alignment which, even with gaps, produces the greatest possible number of "identical" positions between the reference and comparator sequences. Percentage "sequence identity" between two sequences can be determined using, e.g., the program "BLAST" which is available from the National Center for Biotechnology Information, and which program incorporates the programs BLASTN (for nucleotide sequence comparison) and BLASTP (for amino acid sequence comparison), which programs are based on the algorithm of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90(12):5873-5877, 1993).

[0055] As used herein, the term "polypeptide" is intended to encompass a singular "polypeptide" as well as plural "polypeptides," and refers to a molecule composed of monomers (amino acids) linearly linked by amide bonds (also known as peptide bonds). The term "polypeptide" refers to any chain or chains of two or more amino acids, and does not refer to a specific length of the product. Thus, peptides, dipeptides, tripeptides, oligopeptides, "protein," "amino acid chain," or any other term used to refer to a chain or chains of two or more amino acids are included within the definition of "polypeptide," and the term "polypeptide" can be used instead of, or interchangeably with any of these terms. The term "polypeptide" is also intended to refer to the products of post-expression modifications of the polypeptide, including without limitation glycosylation, acetylation, phosphorylation, amidation, derivatization by known protecting/blocking groups, proteolytic cleavage, or modification by non-standard amino acids. A polypeptide can be derived from a natural biological source or produced by recombinant technology, but is not necessarily translated from a designated nucleic acid sequence. It can be generated in any manner, including by chemical synthesis.

[0056] A "protein" as used herein can refer to a single polypeptide, i.e., a single amino acid chain as defined above, but can also refer to two or more polypeptides that are associated, e.g., by disulfide bonds, hydrogen bonds, or hydrophobic interactions, to produce a multimeric protein.

[0057] By an "isolated" polypeptide or a fragment, variant, or derivative thereof or the like is intended a polypeptide that is not in its natural milieu. No particular level of purification is required. For example, an isolated polypeptide can be removed from its native or natural environment. Recombinantly produced polypeptides and proteins expressed in host cells are considered isolated as disclosed herein, as are recombinant polypeptides that have been separated, fractionated, or partially or substantially purified by any suitable technique. An isolated polypeptide or fragment, variant, or derivative thereof or the like can be associated, bound, etc., with a cofactor. Likewise, a purified or purified and isolated polypeptide or fragment, variant, or derivative thereof or the like can be associated, bound, etc., with a cofactor.

[0058] Other polypeptides disclosed herein are fragments, derivatives, analogs, or variants of the polypeptides disclosed herein, and any combination thereof. The terms "fragment," "variant," "derivative" and "analog" when referring to polypeptide subunit or multimeric protein as disclosed herein can include any polypeptide or protein that retain at least some of the activities of the complete polypeptide or protein (for example retain at least some of the enzymatic properties), but which is structurally different. Fragments of polypeptides include, for example, proteolytic fragments, as well as deletion fragments. Variants include fragments as described above, and also polypeptides with altered amino acid sequences due to amino acid substitutions, deletions, or insertions. Variants can occur spontaneously or be intentionally constructed. Intentionally constructed variants can be produced using art-known mutagenesis techniques. Variant polypeptides can comprise conservative or non-conservative amino acid substitutions, deletions or additions. Variant polypeptides can also be referred to herein as "polypeptide analogs." Derivatives are variants of polypeptides that have been altered so as to exhibit additional features not found on the native polypeptide. Examples include fusion proteins. As used herein a "derivative" also refers to a subject polypeptide having one or more amino acids chemically derivatized by reaction of a functional side group. Also included as "derivatives" are those peptides that contain one or more standard or synthetic amino acid derivatives of the twenty standard amino acids. For example, 4-hydroxyproline can be substituted for proline; 5-hydroxylysine can be substituted for lysine; 3-methylhistidine can be substituted for histidine; homoserine can be substituted for serine; and ornithine can be substituted for lysine.

[0059] As used herein, a "single amino acid substitution" means replacing an amino acid residue in a polypeptide sequence with a different amino acid residue (such as replacing the native residue in a wild-type sequence with a non-native amino acid), unless otherwise specified. Also encompassed by the disclosure are a "single amino acid deletion" and/or a "single amino acid insertion."

[0060] A "conservative amino acid substitution" is one in which one amino acid is replaced with another amino acid having a similar side chain. Families of amino acids having similar side chains have been defined in the art, including basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). For example, substitution of a phenylalanine for a tyrosine is a conservative substitution. Methods of identifying nucleotide and amino acid conservative substitutions which do not eliminate protein activity are well-known in the art (see, e.g., Brummell et al, Biochem. 32: 1180-1 187 (1993); Kobayashi et al, Protein Eng. 12(10):879-884 (1999); and Burks et al, Proc. Natl. Acad. Sci. USA 94:.412-417 (1997)).

[0061] The term "polynucleotide" is intended to encompass a singular nucleic acid as well as plural nucleic acids, and refers to an isolated nucleic acid molecule or construct, e.g., messenger RNA (mRNA) or plasmid DNA (pDNA). A polynucleotide can comprise a conventional phosphodiester bond or a non-conventional bond (e.g., an amide bond, such as found in peptide nucleic acids (PNA)). The term "nucleic acid" refers to any one or more nucleic acid segments, e.g., DNA or RNA fragments, present in a polynucleotide. By "isolated" nucleic acid or polynucleotide is intended a nucleic acid molecule, DNA or RNA, which has been removed from its native environment. For example, a recombinant polynucleotide encoding a polypeptide subunit contained in a vector is considered isolated as disclosed herein. Further examples of an isolated polynucleotide include recombinant polynucleotides maintained in heterologous host cells or purified (partially or substantially) polynucleotides in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts of polynucleotides. Isolated polynucleotides or nucleic acids further include such molecules produced synthetically. In addition, polynucleotide or a nucleic acid can be or can include a regulatory element such as a promoter, ribosome binding site, or a transcription terminator.

[0062] As used herein, a "coding region" is a portion of nucleic acid comprising codons translated into amino acids. Although a "stop codon" (TAG, TGA, or TAA) is not translated into an amino acid, it can be considered to be part of a coding region, but any flanking sequences, for example promoters, ribosome binding sites, transcriptional terminators, introns, and the like, are not part of a coding region. Two or more coding regions can be present in a single polynucleotide construct, e.g., on a single vector, or in separate polynucleotide constructs, e.g., on separate (different) vectors. Furthermore, any vector can contain a single coding region, or can comprise two or more coding regions, e.g., a single vector can separately encode a selection marker gene and a gene of interest. In addition, a vector, polynucleotide, or nucleic acid can encode heterologous coding regions, either fused or unfused to a nucleic acid encoding a polypeptide subunit or fusion protein as provided herein. Heterologous coding regions include without limitation specialized elements or motifs, such as a secretory signal peptide or a heterologous functional domain.

[0063] In certain aspects, the polynucleotide or nucleic acid is DNA. In the case of DNA, a polynucleotide comprising a nucleic acid that encodes a polypeptide normally can include a promoter and/or other transcription or translation regulatory elements operably associated with one or more coding regions. An operable association or linkage can be when a coding region for a gene product, e.g., a polypeptide, can be associated with one or more regulatory sequences in such a way as to place expression of the gene product under the influence or control of the regulatory sequence(s). Two DNA fragments (such as a polypeptide coding region and a promoter associated therewith) can be "operably associated" or "operably linked" if induction of promoter function results in the transcription of mRNA encoding the desired gene product and if the nature of the linkage between the two DNA fragments does not interfere with the ability of the expression regulatory sequences to direct the expression of the gene product or interfere with the ability of the DNA template to be transcribed. Thus, a promoter region would be operably associated with a nucleic acid encoding a polypeptide if the promoter was capable of effecting transcription of that nucleic acid. The promoter can be a cell-specific promoter that directs substantial transcription of the DNA only in predetermined cells. Other transcription regulatory elements, besides a promoter, for example enhancers, operators, repressors, and transcription termination signals, can be operably associated with the polynucleotide to direct cell-specific transcription.

[0064] A variety of transcription regulatory regions are known to those skilled in the art. These include, without limitation, transcription regulatory regions that function in vertebrate cells, such as, but not limited to, promoter and enhancer segments from cytomegaloviruses (the immediate early promoter, in conjunction with intron-A), simian virus 40 (the early promoter), and retroviruses (such as Rous sarcoma virus). Other transcription regulatory regions include those derived from vertebrate genes such as actin, heat shock protein, bovine growth hormone and rabbit β-globin, as well as other sequences capable of controlling gene expression in eukaryotic cells. Additional suitable transcription regulatory regions include tissue-specific promoters and enhancers.

[0065] Similarly, a variety of translation regulatory elements are known to those of ordinary skill in the art. These include, but are not limited to ribosome binding sites, translation initiation and termination codons, and elements derived from picornaviruses (particularly an internal ribosome entry site, or IRES, also referred to as a CITE sequence).

[0066] In other aspects, a polynucleotide can be RNA, for example, in the form of messenger RNA (mRNA).

[0067] Polynucleotide and nucleic acid coding regions can be associated with additional coding regions which encode secretory or signal peptides, which direct the secretion of a polypeptide encoded by a polynucleotide as disclosed herein. According to the signal hypothesis, proteins secreted by mammalian cells have a signal peptide or secretory leader sequence which is cleaved from the mature protein once export of the growing protein chain across the rough endoplasmic reticulum has been initiated. Those of ordinary skill in the art are aware that polypeptides secreted by vertebrate cells generally have a signal peptide fused to the N-terminus of the polypeptide, which is cleaved from the complete or "full length" polypeptide to produce a secreted or "mature" form of the polypeptide. In certain aspects, the native signal peptide is used, or a functional derivative of that sequence that retains the ability to direct the secretion of the polypeptide that is operably associated with it. Alternatively, a heterologous mammalian signal peptide, or a functional derivative thereof, can be used. For example, the wild-type leader sequence can be substituted with the leader sequence of human tissue plasminogen activator (TP A) or mouse B-glucuronidase.

[0068] A "vector" is nucleic acid molecule as introduced into a host cell, thereby producing a transformed host cell. A vector can include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication. A vector can also include one or more selectable marker gene and other genetic elements known in the art. Illustrative types of vectors include plasmids, phages, viruses and retroviruses.

[0069] A "transformed" cell, or a "host" cell, is a cell into which a nucleic acid molecule has been introduced by molecular biology techniques. As used herein, the term transformation encompasses those techniques by which a nucleic acid molecule can be introduced into such a cell, including transfection with viral vectors, transformation with plasmid vectors, and introduction of naked DNA by electroporation, lipofection, and particle gun acceleration. A transformed cell or a host cell can be a bacterial cell or a eukaryotic cell.

[0070] The term "expression" as used herein refers to a process by which a gene produces a biochemical, for example, a polypeptide. The process includes any manifestation of the functional presence of the gene within the cell including, without limitation, gene knockdown as well as both transient expression and stable expression. It includes without limitation transcription of the gene into messenger RNA (mRNA), and the translation of such mRNA into polypeptide(s). If the final desired product is a biochemical, expression includes the creation of that biochemical and any precursors. Expression of a gene produces a "gene product. " As used herein, a gene product can be either a nucleic acid, e.g., a messenger RNA produced by transcription of a gene, or a polypeptide that is translated from a transcript. Gene products described herein further include nucleic acids with post transcriptional modifications, e.g., polyadenylation, or polypeptides with post translational modifications, e.g., methylation, glycosylation, the addition of lipids, association with other protein subunits, proteolytic cleavage, and the like.

[0071] The term "pharmaceutical composition" refers to a preparation that is in such form as to permit the biological activity of the active ingredient to be effective, and that contains no additional components that are unacceptably toxic to a subject to which the composition would be administered. Such composition can be sterile. [0072] As used herein the term "engineered" includes manipulation of nucleic acid or polypeptide molecules by synthetic means (e.g. by recombinant techniques, in vitro peptide synthesis, by enzymatic or chemical coupling of peptides or some combination of these techniques).

N-demethylation of an N-methylated compound using MND enzyme

[0073] Disclosed herein are methods for N-demethylating an N-methylated compound using an enzymatic reaction, rather than, e.g. a chemical modification. In certain aspects, the N-methylated compound is a low molecular weight N-methylated compound. The TV- methylated compound to be demethylated, unless otherwise indicated herein, is interchangeably referred to as the "substrate". In certain aspects, the substrate has the structure of a naturally-occurring opiate compound.

[0074] Exemplary low molecular weight N-methylated compounds include, but are not limited to, thebaine, oripavine, (R)-reticuline, salutaridine, salutaridinol, heroin, morphinone, codeinone, codeine, morphine, hydromorphone, oxymorphone, galanthamine, laudanine (laudanidine), orientaline, protosinomenine, isoorientaline, laudanosine, (S)-reticuline, scopolamine, hyoscyamine (atropine), noscapine (narcotine), tropinone, physostigmine, isothebaine, (-)-lobeline, gramine, and any of the compounds otherwise listed in Figure 16. In certain aspects, the N-methylated compound is a heterocyclic compound. In certain aspects, a heterocyclic compound has 3, 4, 5, 6, or 7 atoms in the ring portion of the compound. Thus, in certain aspects, a heterocyclic compound has 3 or 4 atoms, between 3 and 5 atoms, between 3 and 6 atoms, between 3 and 7 atoms, 4 or 5 atoms, between 4 and 6 atoms, between 4 and 7 atoms, 5 or 6 atoms, between 5 and 7 atoms, or 6 or 7 atoms within the ring portion of the compound. In certain aspects, the N-methylated compound is a five-ringed morphinan alkaloid. In certain aspects, the N-methylated compound is thebaine which can be converted into the N- demethylated compound N-demethyl thebaine (northebaine).

[0075] The methods disclosed herein of N-demethylating an N-methylated compound comprise incubating the N-methylated compound with an enzyme comprising N- demethylase activity. One of ordinary skill in the art will understand what it means to incubate a substrate with an enzyme, and exemplary parameters such as time, temperature, buffer conditions, and pH are discussed in more detail elsewhere herein. In certain aspects, the enzyme with N-demethylase activity is a morphinan N-demethylase (MND) identified from the Me thy lobacterium Thebainfresser, or a fragment or variant of this enzyme having N-dem ethyl ase activity (referred to collectively herein as the "MND enzyme"). In certain aspects where thebaine is the substrate, the MND enzyme has a V max of about 0.32 pmol s " 1 , K m of about 0.97 μΜ, and/or a K cat of about 6.5 x 10 "4 s "1 . Although the MND enzyme was identified for its morphinan activity, it has been demonstrated that the MND enzyme exhibits N-demethylase activity on a wide range of substrates, and thus the designation of the MND enzyme as a "morphinan" N-demethylase is not limiting on the substrates of the methods disclosed herein.

[0076] Upon incubation of the N-methylated compound (substrate) with the MND enzyme, the substrate is N-demethylated to produce an N-demethylated compound. In certain aspects, the N-methylated compound contains one methylated nitrogen and thus the N-dem ethyl ated compound will no longer contain any methylated nitrogens. In certain aspects, the N-methylated compound contains two or more methylated nitrogens and the N-dem ethyl ated compound contains at least one nitrogen that has been demethylated by the MND enzyme-and thus available for the addition of a new moiety-even if not all the methylated nitrogens on the compound are demethylated. In certain aspects, the TV- methylated compound contains two or more methylated nitrogens and all of the methylated nitrogens are demethylated by the MND enzyme. Unless otherwise specified, where the N-methylated compound contains two or more methylated nitrogens, the N- demethylated compound refers to a compound in which at least one of the methylated nitrogens is demethylated.

[0077] In certain aspects, the MND enzyme is isolated, purified, or isolated and purified. While the MND enzyme activity was identified in a naturally occurring bacterium, the MND enzyme can be recombinantly expressed in other organisms including bacteria, yeast, plants, fungi, insects, birds, mammals, and cells thereof, for example overexpressed in a host cell selected for protein expression, to produce more usable quantities of the MND enzyme. In certain aspects, the MND enzyme can then be isolated from the host cell. The MND enzyme can also be purified by any means of protein purification. In certain aspects, the isolation and/or purification need only result in a crude preparation. In certain aspects, the MND enzyme expressed may be utilized to convert its substrate without isolation and/or purification, such as via cell permeabilization.

[0078] The organism designated Thebainfresser is known to consume thebaine as its sole carbon source to produce northebaine. The term "fresser" is derived from the German infinitive "fressen" which means "to eat," which is used only in reference to animals. The MND enzyme isolated from the Methylobacterium Thebainfresser has the amino acid sequence:

MTEKTPKLGSEAANKLGLEADISRRDMVGGVLIGAGAALLASVAP GAINRALAEGPSRLPPVRGSGTGWRGIEIADDWQGPGGIGDYSKS NGNTGKVIRDAHAGIRNHEFEKRLATASDVNEKYDV11VGAGISG LHSAYDLLRQRPNIKILMLDNHAIFGGEAKQNKMEVDGQALYGGQ GPTLYSFVGDDLPSWKGNPALASIMELKTYPKEFGLPTETTWSDK KTDVKVPVDLWFSMASPSQTDIAYRWEGSGLVKNPLLNSFRDAPV SQKSKDAIALMLAVDNGAKRPVEPVGDVSTWVDNMTYAEFLKKVY GADDEAVQLVDQIDWGTAGLGGDVFNASLAALGLNQYGGIELWN GGLQGLSLPTGNGGVGRSILRKFMPGAIKGGTSLTDTLFGDVNWD VLDHANNNVRIRLNSTWGVQNNETPTGTKDATVFFLHDNRLYKA KGKAVIMGTPQQVNRNVCLNLPNHLSEAMGDFHHAPILWNVALR NWKSMEKAGVSGLRWFGEYPGIGQIVRSMVIDGKEIMPCDPSKPA VMTFYIPMNQATRGMPRGEQAMTARHMLFNLTFADIELLIRDQLT RAFGSYGFDAKRDIAAIVANRWGHALVCAGPGFYTGLNGKPPVSK VITAGWDRVAFGHSDLSGRQAWTVAVNYARTAVANVFPKI

(SEQ ID NO: 2) .

[0079] In certain aspects, the methods of this disclosure comprise incubating an TV- methylated compound with an enzyme comprising N-demethylase activity. One of ordinary skill in the art would recognize that for the purposes of this disclosure, "incubating" refers to an intentional, human executed experimental, commercial, industrial, etc. step that excludes any activity of the enzyme against the substrate that may occur in nature without human intervention. In certain aspects, the MND enzyme comprises the amino acid sequence SEQ ID NO: 2.

[0080] The enzymatic activity of the MND enzyme may not require the full sequence of SEQ ID NO: 2 as the activity or a useful portion thereof could reside in a fragment of the amino acid sequence SEQ ID NO: 2. In certain aspects, the N-methylated compound is incubated with an enzyme comprising a fragment of the amino acid sequence SEQ ID NO: 2. In certain aspects, the fragment can result from an N-terminal truncation of SEQ ID NO: 2, a C-terminal truncation of SEQ ID NO: 2, or a truncation at both ends of SEQ ID NO: 2. In certain aspects, a fragment of SEQ ID NO: 2 can be formed by the internal deletion of one or more regions of SEQ ID NO: 2. In certain aspects, the remaining regions can be linked directly together or linked via a linker sequence. An enzyme comprising a fragment of SEQ ID NO: 2 can be produced by various methods including protease digestion of a peptide having the amino acid sequence of SEQ ID NO: 2 or the expression of a truncated version of SEQ ID NO: 2. In certain aspects, a fragment of the amino acid sequence of SEQ ID NO: 2 is produced by partial or full Proteinase K digestion of a polypeptide comprising the amino acid sequence SEQ ID NO: 2 or a fragment or variant thereof.

[0081] It is well known that the activity of an enzyme can tolerate some variation in the amino acid sequence of the MND enzyme. Further, in some cases, certain changes in the amino acid sequence of an enzyme can increase activity. In certain aspects of the methods herein, the N-methylated compound is incubated with an enzyme having N- methylase activity that has an amino acid sequence that is related to, but varies from, the MND enzyme, i.e., a "variant of MND enzyme." In certain aspects, the variant is a variant of SEQ ID NO: 2. It is understood that variants include variants of any fragment of MND enzyme and/or SEQ ID NO: 2 disclosed elsewhere herein. In certain aspects, a variant has at least about 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2 or a fragment thereof. In certain aspects, a variant has the amino acid sequence SEQ ID NO: 2, or a fragment thereof, with any one of 1 to 100 amino acid substitutions. In certain aspects, one or more of the, or all of the, changes from the amino acid sequence of SEQ ID NO: 2 in the variant are conserved amino acid substitutions. In certain aspects, none of the amino acid substitutions are within the active site(s) of the MND enzyme. In certain aspects, at least one of the amino acid substitutions is within the active site of the MND enzyme. In certain aspects, the fragment or variant of SEQ ID NO: 2 possesses about the same N-demethylase activity against one or more N-methylated substrates in comparison to the MND enzyme comprising the amino acid sequence SEQ ID NO: 2. In certain aspects, the fragment or variant of SEQ ID NO: 2 possess increased N-demethylase activity against one or more N-methylated substrates in comparison to the MND enzyme comprising the amino acid sequence SEQ ID NO: 2.

[0082] The MND enzyme of this disclosure exhibits activity against a broad range of N-methylated substrates. In certain aspects, in addition to morphinan N-demethylase activity, the MND enzyme further comprises one or more of for example benzylisoquinoline alkaloid N-demethylase activity, tropane alkaloid N-demethylase activity, pyrroloindole alkaloid N-dem ethyl ase activity, piperidine alkaloid N-dem ethyl ase activity, aporphine alkaloid N-dem ethyl ase activity, indole alkaloid N-demethylase activity, or Amaryllidaceae alkaloid N-demethylase activity.

[0083] The conversion of the N-methylated compounds into N-demethylated compounds can be very efficient. In certain aspects, at least about 20%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, 99.5%, 99.9%, or 100% of the substrate incubated with the MND enzyme is N-demethylated, i.e., converted into an N- demethylated compound. In certain aspects, the conversion rate of the substrate thebaine into northebaine is at least about 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, 99.5%, 99.9%, or 100%.

[0084] The activity of the MND enzyme is very stable and under certain conditions the MND enzyme can remain active for a long time. In certain aspects, the substrate or a source of substrate is incubated with the MND enzyme for at least about 5 minutes, 10 minutes, 15 minutes, 20 minutes, 30 minutes, 45 minutes, 1 hour, 2 hours, 3 hours, 4 hours, 6 hours, 8 hours, 12 hours, 15 hours, 18 hours, 24 hours, 30 hours, 36 hours, 48 hours, 60 hours, or 72 hours. In certain aspects, the MND enzyme is immobilized such as by attachment to a surface, for example the surface of a bead or the surface of a slide or dish. In certain aspects, the bead is a SEPHAROSE bead. Immobilization of the MND enzyme can allow for incubation of the substrate with the MND enzyme and then facilitate separation of the converted N-demethylated compound from the MND enzyme. This can allow the MND enzyme to be re-used in additional incubations and can avoid the need to "kill" the MND enzyme, such as by heat or pH change after the conversion reaction is completed. In certain aspects, the N-methylated compound is passed over the immobilized MND enzyme, such as in a column, thus converting N-methylated substrate into N- demethylated product in a continuous manner.

[0085] The MND enzyme is active over a wide range of temperatures. In certain aspects, the N-methylated compound is incubated with the MND enzyme at a temperature of about 4 °C, 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, 72 °C, or 80 °C. Thus, in certain aspects, the N-methylated compound is incubated with the MND enzyme at a temperature of from any of about 4 °C, 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, or 72 °C to any of about 15 °C, 20 °C, 25 °C, 30 °C, 37 °C, 42 °C, 48 °C, 55 °C, 60 °C, 65 °C, 72 °C, or 80 °C. One of ordinary skill in the art will understand that the temperature of the conversion reaction can influence factors such as the rate of the conversion, the efficiency of the conversion, non-specific activity, the stability of the MND enzyme, or simply the convenience of whether the incubation mixture needs to be cooled, heated, or at room temperature, e.g., between about 20 °C and 25 °C.

[0086] The MND enzyme is also active over a wide pH range. In certain aspects, the N-methylated compound is incubated with the MND enzyme at a pH of about pH 3.0, about pH 4.0, about pH 5.0, about pH 6.0, about pH 7.0, about pH 8.0, about pH 9.0, about pH 10.0, about pH 10.5, or any range in between. In certain aspects, the N-methylated compound is incubated with the MND enzyme in a buffered solution comprising a buffering capacity of from any of about pH 3.0, pH 4.0, pH 5.0, pH 6.0, pH 7.0, pH 8.0, or pH 9.0 to any of about pH 4.0, pH 5.0, pH 6.0, pH 7.0, pH 9.0, pH 10.0, or pH 10.5. In certain aspects, the N-methylated compound is incubated with the MND enzyme at a pH of about pH 4.0 in a citrate buffer. In certain aspects, the buffer is a phosphate buffer.

[0087] It has been determined that the MND enzyme contains the cofactor flavin adenine dinucleotide (FAD). Thus, in certain aspects, a cofactor, such as FAD, is added to the enzyme (or fragment, variant, and/or derivative thereof) prior to or during incubation with the N-methylated compound.

[0088] It has also been determined that the MND enzyme is active in the presence of a number of solvents, including commercially and/or industrially utilized solvents. One of ordinary skill in the art would recognize that while water is solvent, there are many commercial or industrial solvents used and thus in certain aspects, the solvent is a solvent other than water. As used herein, "in the presence of a solvent other than water" means that water may or may not also be present. In certain aspects, the MND enzyme performs the reaction in the presence of a nonpolar solvent (e.g., chloroform), an aprotic polar solvent (e.g., tetrahydrofuran), and/or a polar protic solvent (e.g, methanol). Thus, representative examples of solvents include, but are not limited to, dimethyl sulfoxide, methanol, chloroform, tetrahydrofuran, toluene, heptane, and diethyl ether. In certain aspects, the concentration of the solvent in the incubation mixture can be at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 99%, or in any range in between. [0089] N-demethylation of a compound according to the methods disclosed herein produces an N-demethylated compound that can be subjected to further chemical or enzymatic modification. In certain aspects, the N-demethylated compound is further modified by the addition of a functional moiety to the demethylated nitrogen as disclosed in greater detail elsewhere herein. In certain aspects, the modification of N-dem ethyl ated compounds produces compounds, including novel compounds that can be used for example in drug discovery, such as by screening the biological activity of the modified compounds.

Production of an active N-substituted compound

[0090] Disclosed herein are methods of producing a biologically active compound (an "active N-substituted compound"), or a precursor to a biologically active compound, from an N-demethylated compound produced by any of the methods for N-demethylating an TV- methylated compound disclosed elsewhere herein. In certain aspects, the N-methylated compound is a low molecular weight N-methylated compound. The active N-substituted compound or precursor can be produced by the chemical or enzymatic addition of a functional moiety at the position of the demethylated nitrogen. In certain aspects, an active compound produced is used in drug discovery such as to screen its biological activity and in certain aspects, the active compound produced is a pharmaceutical compound.

[0091] In certain aspects, the N-methyl group of the N-methylated compound (the MND enzyme substrate), having been removed by N-methylation by the MND enzyme, is substituted in the active N-substituted compound with at least on alkyl group. In certain aspects, the N-methylated nitrogen of the substrate is substituted, after N-demethylation, with a neutral N-Rl group. In certain aspects, Rl is selected from the group consisting of a methyl group, an isopropyl group, an ethyl group, a propene group, a cyclopropylmethyl group, and a cyclobutylmethyl group. In certain aspects, the N-methylated nitrogen of the substrate is substituted, after N-demethylation, with a cationic N + -R1 group. In certain aspects, Rl is selected from the group consisting of a methyl group and an isopropyl group, a methyl group and an ethyl group, a methyl group and a propene group, a methyl group and a cyclopropylmethyl group, and a methyl group and a cyclobutylmethyl group. For example, in certain aspects, the N-methylated nitrogen of the substrate is substituted as:

[0092] In certain aspects, the active N-substituted compound is an opiate. In certain aspects, the active N-substituted opiate is synthetic or semisynthetic. In certain aspects, a synthetic or semisynthetic opiate is selected from the group consisting of oxycodone, oxymorphine, nalbuphine, naltrexone, and nalmefene. In certain aspects, the active compound produced is selected from the group consisting of tropane alkaloids, benzylisoquinoline alkaloids, pyrroloindole alkaloids, piperidine alkaloids, aporphine alkaloids, indole alkaloids, and Amaryllidaceae alkaloids. In certain other aspects, the active compound produced is selected from the group consisting of noratropine, oxitropium, and ipratropium bromide.

[0093] For example, as described in Do Pham et al. Studies on the oxidative N- demethylation of atropine, thebaine and oxycodone using a Fe III-TAML catalyst. Green Chem., 2014,16, 1399-1409, N-demethylation of a precursor compound to form noratropine, followed by N-alkylation with isopropyl bromide and then with methyl bromide, produces the bronchodilator ipratropium bromide. By a similar process, the bronchodilator oxitropium bromide can be formed. As also described by Do Pham et al., thebaine serves as a synthetic precursor for the analgesic oxycodone and the intermediate oxymorphone. Oxymorphone is used to prepare the N-methylcyclobutyl -based analgesic nalbuphine, and the N-methylcyclopropyl-based opioid antagonists naltrexone and nalmefene, used in the treatment of alcohol or opiate dependence and for rapid opiate detoxification. Nucleotides

[0094] Provide herein are nucleic acids comprising a nucleotide sequence that encodes the MND enzyme and/or the amino acid sequence SEQ. ID NO: 2 or a fragment or variant thereof. In certain aspects, the nucleotide sequence encodes a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; or a variant of SEQ ID NO: 2, wherein the variant has at least about 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2. In certain aspects, the nucleotide sequence has at least about 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% identity to SEQ ID NO: 1. In certain aspects, the nucleotide sequence comprises SEQ ID NO: 1.

AT GAC T GAAAAAACGCC TAAAC TAGGC T C T GAAGCCGCAAATAAA CTCGGATTAGAAGCTGACATTTCTCGGCGCGACATGGTGGGCGGC GTATTGATCGGAGCAGGGGCAGCTCTGCTCGCATCCGTTGCGCCA GGGGCGATTAATAGAGCGTTGGCCGAGGGGCCGAGCAGGCTACCG CCGGTGCGTGGTTCCGGTACCGGTTGGCGAGGAATAGAGATCGCC GATGACTGGCAAGGCCCAGGCGGCATCGGGGATTACTCTAAGTCA AACGGCAACACGGGCAAAGTTATCCGTGACGCGCATGCAGGCATT CGGAACCACGAGTTCGAGAAGCGGCTTGCTACGGCGAGTGACGTC AACGAGAAATATGATGTTATTATTGTAGGCGCGGGCATTTCGGGG CTTCACAGCGCCTACGATCTTCTTCGCCAGCGGCCAAATATAAAG ATCCTAATGCTTGACAACCATGCCATCTTCGGTGGGGAGGCAAAG CAAAATAAGATGGAGGTAGACGGTCAGGCGCTGTATGGCGGCCAG GGCCCAACGCTTTACTCCTTCGTTGGTGATGACCTCCCGAGCTGG AAAGGTAATCCGGCCCTCGCATCTATCATGGAACTCAAAACATAT C C CAAG GAG T T C G GAC T T C C GAC C GAAAC T AC AT G GAG C GAT AAG AAGACGGACGTAAAGGTGCCGGTCGACCTTTGGTTCTCCATGGCC AGCCCCTCGCAGACCGACATCGCCTATCGGTGGGAGGGAAGTGGG TTGGTGAAGAATCCTTTGCTGAACTCCTTTCGTGATGCCCCAGTG TCGCAGAAAAGCAAAGATGCCATTGCCCTTATGCTCGCTGTCGAC AACGGCGCGAAGAGGCCTGTTGAACCAGTGGGCGATGTATCGACC T G G G T C GAC AAT AT GAC C T AT G C G GAG T T T C T GAAGAAG G TAT AT GGTGCGGACGACGAAGCTGTTCAGCTTGTCGACCAGATTGATGTC GTTGGAACAGCGGGGCTTGGCGGTGATGTTTTTAATGCAAGCCTT GCAGCACTCGGACTAAATCAATACGGGGGCATCGAGCTCTGGAAC

GGCGGTTTGCAAGGGTTGAGTCTTCCGACCGGAAATGGCGGCGTG GGGCGGTCCATCCTTCGCAAGTTCATGCCAGGAGCTATCAAGGGC GGGACATCGCTCACCGACACGCTTTTCGGTGACGTGAACTGGGAC GTGCTTGACCACGCTAACAACAACGTTCGAATCCGGCTCAATTCA ACTGTCGTAGGTGTTCAGAATAATGAGACGCCGACTGGCACAAAA GATGCGACCGTTTTCTTCCTTCACGATAATCGCCTTTACAAGGCC AAAGGGAAGGCGGTGATCATGGGTACACCGCAGCAGGTCAATCGT AATGTTTGCCTCAATTTGCCAAACCATCTTAGCGAGGCAATGGGC GATTTCCATCATGCTCCGATCCTGGTTGTGAATGTGGCCCTCCGG AACTGGAAATCGATGGAAAAGGCTGGCGTTTCCGGCTTGCGGTGG TTCGGAGAATATCCGGGTATCGGTCAGATAGTTCGATCGATGGTC ATTGACGGCAAAGAGATCATGCCTTGCGATCCCTCGAAACCAGCG GTCATGACCTTCTATATCCCGATGAATCAAGCGACACGGGGCATG CCTCGCGGCGAGCAAGCGATGACCGCCCGCCACATGCTTTTCAAC TTGACGTTCGCAGATATCGAACTGCTCATTCGGGATCAGCTCACT CGTGCGTTCGGATCTTATGGATTTGATGCCAAGCGGGACATCGCT GCCATTGTTGCAAACAGGTGGGGACATGCGCTGGTCTGCGCCGGG CCAGGATTTTACACTGGGCTTAACGGCAAACCGCCCGTCAGTAAG GTGATCACCGCTGGATGGGACCGAGTGGCATTCGGGCATTCGGAC CTTTCCGGCAGACAAGCATGGACCGTGGCCGTAAATTATGCACGG ACAGCGGTTGCGAATGTCTTCCCTAAAATCTGA

(SEQ ID NO: 1) .

[0095] Because of redundancy in the genetic code, the nucleotide codons encoding specific amino acid residues can differ between organisms and there can be bias towards using one of the several codons that encode the same amino acid over the others. Thus, in certain aspects, a nucleic acid sequence comprises one or more codons preferred for expression in a bacterial, yeast, insect, or plant cell. In certain aspects, the nucleotide sequence is codon optimized for expression in Arabidopsis thaliana. In certain aspects, the nucleotide sequence comprises SEQ ID NO: 3. AT GAC T GAGAAGAC AC C T AAAC T TGGAT C T GAGGCAGC TAATAAG TTGGGTTTGGAGGCTGACATCTCTAGGAGAGATATGGTGGGAGGA GTGCTTATCGGTGCAGGTGCTGCTTTGCTTGCATCTGTTGCTCCT GGTGCTATCAACAGGGCTCTTGCAGAAGGACCATCAAGGCTTCCT C C AG T T AGAG G T T C AG GAAC T G GAT G GAG G G G T AT T GAGAT T G C A GAC GAC T G G C AAG GAC C T G GAG GAAT C G GAGAT T AT T C T AAG T C T AACGGTAACACTGGAAAGGTGATCAGGGACGCTCACGCTGGTATC AGGAACCATGAGTTCGAAAAGAGGTTGGCTACTGCTTCTGACGTG AACGAGAAGTACGACGTGATCATCGTGGGAGCTGGTATCTCTGGA C T T C AC TCTGCATACGATTTGTT GAGAC AAAGAC C T AAT AT T AAA ATTTTAATGTTGGACAACCACGCTATCTTCGGTGGTGAGGCTAAG C AGAAC AAGAT G GAG G T T GAC G G T C AG G CAT T G T AC G GAG GAC AG GGACCAACTCTTTACTCATTCGTTGGAGACGATTTGCCTTCATGG AAG G GAAAC CCTGCTTTGG CAT C T AT CAT G GAAC T T AAGAC AT AT C CAAAG GAG T T C G GAC T T C C AAC T GAGAC T AC T T G G T C AGAT AAG AAGACTGACGTTAAGGTTCCAGTGGACCTTTGGTTCTCAATGGCA T C AC C T T C AC AG AC AG AT AT TGCTTACAGGTGG GAG GGTTCTGGT TTGGTGAAAAATCCTTTGCTTAACTCTTTCAGGGATGCTCCAGTT TCTCAGAAGTCAAAGGATGCTATCGCTTTGATGCTTGCAGTGGAC AACGGTGCTAAGAGACCTGTGGAACCTGTTGGAGACGTGTCTACA T G G G T G GAC AAC AT GAC AT AC G C T GAG T T T T T GAAGAAG G T G T AC G GAG C AGAT GAT GAG G C AG T T C AG T T G G T G GAC C AGAT C GAC G T T GTGGGTACAGCTGGACTTGGTGGAGACGTGTTCAACGCATCACTT GCTGCTTTGGGTTTGAACCAGTACGGTGGAATCGAGTTGTGGAAC GGAGGATTGCAGGGTCTTTCTTTGCCAACAGGTAATGGAGGAGTG GGTAGGTCTATCCTTAGGAAGTTCATGCCTGGTGCTATTAAGGGT GGAACATCTTTGACAGATACTCTTTTCGGTGACGTGAACTGGGAC GTTTTGGAT C AC G C AAAC AAC AAT G T GAGAAT TAGAC T T AAT T C T AC AG T G G T T G GAG T G C AGAAC AAC GAGAC T C C T AC AG GAAC AAAA GACGCTACTGTGTTTTTTTTGCATGACAATAGACTTTATAAGGCT AAGGGAAAAGC T GT GAT CAT GGGAACACC T CAGCAGGT GAATAGA AACGTTTGTCTTAACCTTCCAAACCATCTTTCTGAGGCTATGGGT GACTTCCATCACGCTCCAATCTTGGTGGTGAACGTGGCTCTTAGG AACTGGAAATCTATGGAAAAGGCAGGAGTGTCAGGACTTAGGTGG

TTTGGAGAGTACCCTGGTATCGGACAGATCGTTAGGTCTATGGTG ATTGATGGAAAGGAGATTATGCCATGCGACCCTTCTAAGCCAGCT GTTATGACATTTTACATTCCTATGAATCAAGCTACTAGGGGAATG CCAAGAGGAGAGCAGGCTATGACTGCTAGGCACATGCTTTTCAAT CTTACTTTCGCTGATATTGAGTTGCTTATCAGGGACCAGTTGACT AGGGCTTTCGGTTCTTACGGTTTCGACGCTAAGAGGGACATTGCT GCTATCGTGGCTAACAGATGGGGTCATGCTTTGGTTTGCGCTGGT CCTGGATTCTACACTGGACTTAACGGAAAGCCTCCAGTGTCAAAG GTGATCACAGCTGGATGGGACAGAGTGGCTTTTGGACATTCAGAC TTGTCAGGAAGGCAGGCTTGGACAGTTGCTGTGAACTACGCTAGG ACAGCAGTGGCTAACGTTTTCCCTAAGATATGA

(SEQ ID NO: 3)

[0096] Certain aspects provide for a recombinant nucleic acid construct that comprises a nucleic acid sequence encoding the MND enzyme or a fragment or variant thereof as disclosed herein. In certain aspects, the nucleic acid sequence encoding the MND enzyme or a fragment or variant thereof is operably linked to at least one transcriptional regulatory sequence. For example, in certain aspects, the sequence is operably linked to transcription regulatory sequences such as promoters, transcription terminators, enhancers, etc. The operable link can be direct or indirect, i.e. with or without intervening sequences, such as internal ribosome entry sites (IRES). The regulatory sequences can be endogenous to the coding sequence, i.e. they are the regulatory sequences naturally associated with the MND enzyme-encoding sequence in the genome of Methylobacterium Thebainfresser. Alternatively, the regulatory sequences can be heterologous. In this latter case the resulting construct comprises a coding sequence derived from Methylobacterium Thebainfresser operably linked to at least one heterologous transcription regulatory sequence. In certain aspects, the heterologous transcription regulatory sequence is a promoter sequence. Promoter sequences can include constitutive promoters, tissue specific promoters, developmentally specific promoters, and/or inducible promoters, depending upon what type of expression is sought. Examples of such promoters include but are not limited to the tissue specific promoters glycinin promoter, napin promoter, and oleosin promoter, the constitutive promoters 35 S promoter and cauliflower mosaic virus promoter and inducible promoters such as T7 promoter and AOX1 promoter. In certain aspects, the heterologous transcription regulatory sequence is a terminator sequence or other 3' regulatory regions. Several such terminators are available and known in the art (e. g. tml from CaMV, E9 from rbcS). Examples of such terminators include but are not limited to prokaryotic terminators t7 and rrnB, and engineered terminators such as TO.

[0097] Certain aspects provide for a vector comprising a recombinant nucleic acid construct described herein. A vector can be used to transfer a nucleic acid sequence (e.g., a shuttle vector) or to express the gene product of a nucleic acid sequence (e.g., expression vector). The nucleic acid molecules of the disclosure can be used to transform or transfect eukaryotic and prokaryotic cells. Thus, in certain aspects, the vector is inserted into a host cell so that the host cell comprises the vector. In certain aspects, the host cell is a bacterial, yeast, insect, avian, plant, or mammalian cell. In certain aspects, the host cell is a bacterial cell such as E. coli or Agrobacterium. In certain aspects, the host cell is a plant cell such as from Camelina sativa, Nicotiana benthamiana, or Papaver sominferum. In certain aspects, a nucleic acid of the disclosure (e.g., a nucleic acid that encodes the MND enzyme or a fragment or variant thereof, a construct, a vector, etc.) is used to create a transgenic plant. Thus, in certain aspects, a transgenic plant comprises a nucleic acid that encodes the MND enzyme or a fragment or variant thereof or a recombinant construct comprising such nucleic acid. In certain aspects, the plant is Camelina sativa, Nicotiana benthamiana, or Papaver sominferum. In certain aspects, the nucleic acid is integrated into the genome of a host cell or transgenic organism such as a transgenic plant.

[0098] Host cells comprising a nucleic acid sequence encoding the MND enzyme and/or a construct and/or vector disclosed herein can be used to produce a protein having an N-demethylase activity as described herein. Such host cells are provided by this disclosure. Thus, certain aspects provide for a method of producing a protein having N- demethylase activity. Such method comprises culturing a host cell comprising an expression vector under conditions conventional in the art for protein expression and recovering the MND enzyme having N-demethylase activity produced by the host cell. Certain aspects provide for producing an MND enzyme under conditions conventional in the art for protein expression and recovering the MND enzyme having N-demethylase activity produced by the host cell. Morphinan N-Demethylase (the MND Enzyme)

[0099] This disclosure also provides for a protein having N-demethylase enzymatic activity referred to herein as the morphinan N-demethylase (MND) enzyme, although its N-demethylase activity is not limited to morphinan. In certain aspects, the protein is isolated, purified, or isolated and purified. In certain aspects, the protein is non-naturally occurring. In certain aspects, the protein comprises the amino acid sequence SEQ ID NO: 2. As discussed elsewhere herein, the protein can comprise a fragment or variant of SEQ ID NO: 2 having N-demethylase activity. Thus, in certain aspects, the protein comprises a fragment of the amino acid sequence SEQ. ID NO: 2, wherein the fragment comprises an N- and/or C-terminal truncation of SEQ ID NO: 2; a variant of SEQ. ID NO: 2, wherein the variant has at least 85%, 90%, 95%, 98%, or 99% identity with the amino acid sequence SEQ ID NO: 2; or any other fragment or variant disclosed anywhere herein. In certain aspects, the protein is a fragment produced by full or partial Proteinase K digestion. In certain aspects, the protein comprises an internal deletion within the MND enzyme and/or SEQ ID NO: 2 sequence. In certain aspects, the remaining sequence is directly linked or linked by one or more linker sequences.

[0100] In certain aspects, the amino acid sequence of a protein having N-demethylase activity comprises a heterologous peptide sequence such as a signal peptide that can target the protein to a specific subcellular location or a peptide that can aid in the isolation and/or purification of the peptide, such as a histidine tag.

[0101] The following examples are included to demonstrate certain embodiments of the disclosure. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the disclosure. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the disclosure. EXAMPLES

Materials and Methods

1.0. Culture medium and conditions.

[0102] Thebainfresser was routinely cultivated by addition of 10 ml of frozen stock in 90% culture medium/10% DMSO into 75 ml of MODLS + thebaine media. Cultures were maintained by diluting ½ with fresh media (usually 35 ml culture + 25 ml of fresh media). Cells were grown at 21 °C or 28 °C at 140 rpm. Cultures were streaked or serially diluted on Potato Dextrose Agar media, MODLS + thebaine, MODLS + thebaine + vitamins media, and LB media to obtain single colonies of different organisms. Once single colonies of each organism were obtained and cultured, enzyme activity was monitored using N-demethylation of thebaine by TLC and later confirmed by LC -MS/MS using the QTRAP 4000.

2.0. Temperature optimum for Thebainfresser bacterial growth.

[0103] A single colony of Thebainfresser was resuspended in 7 ml of LB media and 1 ml was aliquoted to each of 6 flasks containing 25 ml LB media. 500 μΐ were removed from each flask to serve as time point 1. Thebainfresser was cultivated at three temperatures; 21°C, 28°C, and 37°C, 200 rpm, two flasks in each. The OD600 was measured every day for 7 days.

3.0. Detection of thebaine conversion to northebaine by TLC.

[0104] A solvent system consisting of toluene, acetone, ethanol, and ammonia (45:45: 14:6) and TLC silica gel 60 F254 or Polygram SIL G/UV254 0.2 mm silica gel with florescent UV254 indicator was used for separation of thebaine and northebaine. 5 nmol of standard and 10-20 μΐ of culture were loaded onto the plate prior to placing in the solvent.

4.0. Detection of thebaine conversion to northebaine by LC-MS MS.

4.1 QTRAP 4000 conditions.

[0105] Samples were extracted with chloroform under basic conditions or ethyl acetate and analyzed on a QTRAP 4000 by 10 μΐ injections using a flow rate of 0.5 ml/min. Source parameters included detection in the positive mode, CUR 30, CAD High, IS 5500, TEM 500, GS1 50 and GS2 55. Compounds were separated using a Gemini C-18 column (150 x 2.00 mm, 5 micron 110 A; Phenomenex) and the following gradient using solvents A (5 % acetonitrile/ 5 % MeOH/ 10 mM ammonium acetate/ 15 mM L OH) and B (90 % acetonitrile/ 10 mM ammonium acetate/ 15 mM H40H): 0-2.0 min 0 % B, 2.0-9.0 min 0-44 % B, 9.0-11.0 min 44-100 % solvent B, 11-13 min 100 % B and 100-0 % solvent B from 13-14 min and held at 0 % B for an additional 4 min. MRM (multiple reaction monitoring) scans detecting ion fragments 312.1/251 and 312.1/221 for thebaine and 298.1/251.0 and 298.1/236.0 for northebaine and EPI (enhanced product ion) scans for masses 312 m/z for thebaine and 298 m/z for northebaine were used with a DP (declustering potential) of 40 and a CE (collision energy) of 35. Data were analyzed using Analyst 1.5.

4.2 QTRAP 6500.

4.2.1 Method 1.

[0106] Samples were injected with an Eksigent ekspert microLC 200 system using a 15 μΐ/ min flow rate and separated using a PLRP-S column (100 X 0.5 mm, 3 μ, 100 A, Higgins Analytical) prior to introduction to the QTRAP 6500 using the following solvents and binary gradient: solvent A (0.05 % formic acid/ 0.01 % H40H in H20), solvent B (0.05 % formic acid/ 0.01 % H40H in 90 % acetonitrile), and 0-2.0 min 5 % B, 2.0-7.0 min 0-45 % B, 7.0-7.5 min 45-100 % solvent B, 7.5-9.0 min 100 % B and 100-5 % solvent B from 9.0-9.5 min and held at 5 % B for an additional 5.5 min. Detection proceeded in the positive mode with source parameters including CUR 20, CAD - 3, IS 5500, TEM 200, GS1 20, and GS2 20. Compounds were detected by MRM using CE 35, DP 85, Dwell 75 msec, and CXP 14 and detecting ion fragments 312.1/251 and 312.1/221 for thebaine and 298.1/251.0 and 298.1/236.0 for northebaine. EPI scans for masses 312 m/z for thebaine and 298 m/z for northebaine were used to qualitatively identify peaks initially, but not included for quantitation. Data were analyzed using Analyst 1.6.2. Quantitation was achieved by standard curve using peak area.

4.2.2 Method 2.

[0107] Samples were injected by the UFLCXR (Shimadzu) autosampler/ pump system using a 0.5 ml/min flow rate prior to separation using a Gemini C-18 column (150 x 2.00 mm, 5 μ, 110 A; Phenomenex) and introduction into the QTRAP 6500. The program consisted of a binary gradient with solvent A (5 % acetonitrile/ 5 % MeOH/ 10 mM ammonium acetate/ 15 mM H40H) and solvent B (90 % acetonitrile/ 10 mM ammonium acetate/ 15 mM H40H) and the following gradient: 0-2.0 min 0 % B, 2.0-9.0 min 0-44 % B, 9.0-11.0 min 44-100 % solvent B, 11-13 min 100 % B and 100-0 % solvent B from 13-14 min and held at 0 % B for an additional 8 min. EPI scans for each substrate and predicted product mass were included for each enzyme assay and source parameters were identical to those used for the QTRAP 4000 above except CE and DP which were based upon the optimum for each individual compound. Masses used can be found in table 3. Data were analyzed using Analyst 1.6.2.

4.2.3 Method 3.

[0108] Parameters were the same as method 2 except for the column, solvent A, and the gradient. Samples were separated using a Luna C-8 (2) 5μ 240 x 4.61 mm (Phenomenex). Solvent A was 10 mM ammonium acetate/ 15 mM H40H in H20, and the following gradient was used: : 0-4.0 min 1.5 % B, 4.0-9.0 min 1.5 -44 % B, 9.0-11.0 min 44-100 % solvent B, 11-13 min 100 % B and 100-1.5 % solvent B from 13-14 min and held at 0 % B for an additional 8 min. Masses for EPI scans can be found in table 3.

4.2.4 Method 4.

[0109] Source parameters, hardware, and column were identical to Method 2. The program consisted of a 10 ul injection volume with a binary gradient including solvent A (0.1% formic acid in water) and solvent B (0.1 % formic acid in acetonitrile) with the following gradient: 0-2.0 min 5 % B, 2.0-32.0 min 5-95 % B, 32.0-35.0 min 95 % solvent B, 35.0-36.0 min 95 %-5 % B, and held at 5 % B for an additional 9 min. The method included EPI scans for 786 m/z (FAD) and 457 m/z (FMN) with a DP of 85 and a CE of 25, and an EMS scan (200-1000 m/z).

5.0 Microscopy images.

[0110] Microscopy images were obtained using the Nikon Eclipse 800 Microscope with a 60x phase oil immersion objective, 1.4 NA— pixel size 170 nm (used in FIJI/Image J to measure sizes or make scale bar).

6.0 Induction experiment.

[0111] Thebainfresser was grown both in MODLS + thebaine (2 x 70 ml cultures) for several weeks and started fresh from single colonies in LB media (2 x 70 ml cultures). The cultures grown in MODLS + thebaine were sub-cultured (20 ml of culture into 50 ml fresh MODLS + thebaine) 7 days prior to cell collection and washing. Cultures growing in LB media were inoculated 11 days prior to cell collection and washing. 50 ml of each culture were collected by centrifugation (3,000 x g, 20 min, RT) and washed three times with 10 ml of MODLS (no thebaine added). The final pellets were resuspended in 20 ml of MODLS + thebaine. 1 ml of each resuspended culture was used to measure the OD600. The OD600 was used to normalize the density of each culture in a final volume of 50 ml MODLS + thebaine. Cultures were incubated at 28°C, 200 rpm, and allowed to grow for 3 days. A 0.5 ml sample was taken prior to cell washing and twice each day post induction.

[0112] Samples were diluted 1 : 1 with 100 % methanol followed by rapid mixing and centrifugation (16,000 x g, 10 min, RT) prior to HPLC analysis using a Waters 1525 binary HPLC pump coupled with a Waters 2998 photo diode array detector. Samples were injected (50 μΐ) by a Waters 2707 autosampler prior to compound separation by a Gemini C-18 column (150 x 2.00 mm, 5 μ, 110 A, Phenomenex). The HPLC method utilized a flow rate of 0.5 ml/min and a binary gradient (solvent A: 5 % acetonitrile/ 5 % MeOH/ 10 mM ammonium acetate/ 15 mM H40H and solvent B: 90 % acetonitrile/ 10 mM ammonium acetate/ 15 mM H40H), 0-2.0 min 0 % B, 2.0-9.0 min 0-44 % B, 9.0-11.0 min 44-100 % B, 11-13 min 100 % B and 100-0 % B from 13-14 min and held at 0 % B for an additional 4 min. Samples were quantitated by standard curve using peak area with the software Breeze 2.

7.0 Induction experiment for RNA extraction.

[0113] A single colony resuspended in 5 ml of LB was used to inoculate three 1L flasks (A-C) containing 250 ml of LB media where 1 ml of the suspension was aliquoted to each flask. The cultures were incubated at 28°C for 7 days then moved to 21°C. Flask A, cultured only in LB media, was collected at day 8 by centrifugation (3000 x g, 15 min, 4°C) and the final pellet was resuspended in 4 ml of RNA-later (Ambion). The culture was then collected by centrifugation (3000 x g, 15 min, 4°C) and flash frozen in a dry ice/ ethanol bath. This culture was considered to not be induced for MND enzyme production.

[0114] Flasks B-C were collected at day 7 and washed (2000 X g, 15 min, RT) 3 times with 20 ml of MODLS + thebaine and resuspended in 10 ml of MODLS + thebaine and moved to a 1 L flask containing 250 ml of MODLS + thebaine immediately prior to incubation at 21°C. Suspension of cells previously grown in LB then cultured in MODLS + thebaine was performed to induce the production of MND enzyme. Cells were collected at 3 days (flask B) and 6 days (flask C) post-induction in the same manner as flask A.

[0115] Prior to collection, 1 ml of each culture was plated on LB media to check for contamination and 200 μΐ of each sample were extracted by addition of 400 μΐ sodium carbonate buffer (pH 9.5) and 400 μΐ chloroform by 1 min rapid mixing followed by centrifugation (16,000 x g 2 min RT). The extraction was repeated once and dried to completion under N2. Dried samples were resuspended in 200 μΐ 80% methanol and diluted 1/10 before LC-MS/ MS analysis. 8.0 RNA extraction.

[0116] Modified from Giannoukos et al. Genome Biology 2012 13 : r23. Cells were thawed on ice and aliquoted into 16 X 1.5 ml sterile, RNAse free microcentrifuge tubes and pelleted by centrifugation (4,000 x g, 10 min, RT). The supernatant was removed and the cells were resuspended in 200 μΐ of bacterial lysis buffer (30 mM tris HCl pH 8.0, 1 mM EDTA), 15 mg/ ml lysozyme, and 20 μΐ of proteinase K. Samples were incubated at room temp for 10 min with rapid mixing for 10 s every 2 min. RNA extraction proceeded using RLT buffer in the RNeasy Plant Mini Kit (QIAGEN). The remainder of the extraction followed manufacturer's instructions. Samples were combined and precipitated using LiCl precipitation solution (Ambion) and resuspended in 100 μΐ of RNase free H20. RNA was treated with DNase (Ambion) and followed by another LiCl precipitation. RNA concentration was determined using a nanodrop and RNA integrity was evaluated with an Agilent Bioanalyzer 2100.

9.0 Transcriptome sequencing.

[0117] Library preparation and sequencing using Illumina Miseq (2X300 run, 40 million reads) was performed by MOgene (www.mogene.com/).

10.0 Transcriptome assembly.

[0118] Raw reads were first normalized using Fulcrum (Burriesci et al. 2012) prior to trimming with Trimmomatic (Bolger et al. 2014) for quality processing. Trimmed reads were then assembled with Trinity (Grabherr et al. 2011) using default parameters. A homology search was performed on the resulting contigs using BLAST (http://blast.ncbi.nlm.nih.gov/Blast.cgi). Blast results were analyzed and filtered for quality in two steps. First, contigs were filtered based upon the meaningfulness of the alignment. Contigs having a 15% or lower alignment with an existing sequence in the BLAST NR database were discarded. However, contigs having a length over 500 base pairs were kept, regardless of alignment. Second, contigs were filtered based upon homology to known organisms by an additional BLAST search against a database consisting of human, zebrafish, Arabidopsis, C. elegans, D. melanogaster, and yeast. Contigs with significant alignment to these organisms were discarded. The remaining contigs were used for determining expression data with BWA (Burrows-Wheeler Aligner) (Li and Durbin 2009) in conjunction with the raw reads. Open reading frames in the filtered set of contigs were estimated using an in-house algorithm based upon BLAST. 11.0 HAYSTACK analysis and candidate gene selection.

[0119] Production of northebaine on induced cultures was quantitated and used as the model for HAYSTACK (http://haystack.mocklerlab.org/) (Mockler et al. 2007) to mine the expression data of Thebainfresser contigs (gene transcripts). The following parameters were used for HAYSTACK: correlation cutoff 0.8, fold cutoff 2, P-value 0.05, and background cutoff 1. Transcripts that demonstrated an induction pattern that mirrored the accumulation of northebaine in the Thebainfresser-inoculated culture medium were selected for further analysis.

12.0 Thebainfresser DNA extraction.

[0120] Thebainfresser was grown for 2 months in MODLS + thebaine media prior to DNA extraction. DNA extraction was as previously described (William and Copeland 2012).

13.0 Cloning of candidate genes.

[0121] Four genes belonging to two clusters were chosen for analysis based upon their expression pattern and putative annotation. The MND enzyme coding region was cloned using primers for nested PCR and addition of restriction sites for cloning (respectively). comp7134_5_UTR_f: CCGATCTATGACGGGATATCTGGGA (SEQ ID NO: 4) Tfcont_9516_3UTR_RC: TGACGCGACAATCCCTCTACC (SEQ ID NO: 5) comp7134_f_NdeI: C AC C AT ATGAC TGA A A A A AC GCC T A A AC TAG (SEQ ID NO: 6)

Tfcont_9516_EcoRI_RC :

GAGGAATTCTCAGATTTTAGGGAAGACATTCGCAACC (SEQ ID NO: 7).

[0122] PCR was performed using Phusion DNA polymerase (NEB) and the following cycle parameters: 98°C 30 sec (1 cycle), 98°C 10 sec 56°C 30 sec, 72°C 2 min (35 cycles), and 72°C 10 minutes (1 cycle). The MND enzyme-encoding fragment was digested and ligated into the pET28a expression vector and cloned into both BL21 Star (DE3) (Invitrogen) and PLUSE expression strains of E. coil. PLUSE cells also expression the GroES and GroEL chaperone proteins to assist in protein folding.

14.0 Toluene permeabilization assay.

14.1 Multiple enzyme testing.

[0123] Enzyme activity was first tested by permeabilization of BL21 Star (DE3) cells expressing each gene by toluene treatment based upon work by Paoni N. and Koshland D (Paoni and Koshland 1979). Cells growing at 30°C in terrific broth (TB) were first induced with 100 μΜ IPTG and transferred to 16°C for 24 hours. Cells were collected by centrifugation (3,000 X g, 10 minutes, RT) then reconstituted in TB to a final OD of 2. Toluene was added to 5 ml of each culture to 0.14% and subject to rapid mixing for 2 minutes. Cultures were incubated for 1 hr at 30°C. Cells were then collected by centrifugation (3,000 X g, 10 minutes, 4°C), washed with 5 ml TB, and brought up to a total volume of 10 ml in TB. Thebaine was added to a final concentration of 20 μΜ ( 200 nmol) and cultures were incubated overnight at 30°C, 200 rpm. The next day, 5 ml of each culture was extracted with equal volume ethyl acetate and dried to completion with N2. Dried samples were resuspended in 200 μΐ of 80% MeOH before diluting 1/100 for analysis on the QTRAP 6500 using method 1.

14.2 MND enzyme activity verification, performed in triplicate testing two individual colonies encoding the MND enzyme and pET28a in BL21 Star (DE3).

[0124] Two individual colonies of both pET28a empty vector and MND enzyme- encoding clones in pET28a (with N-terminal His-tag) transformed into BL21 Star (DE3) cells were cultured, in triplicate, in 2.5 ml of terrific broth (TB) (Tartoff and Hobbs 1987) media supplemented with 50 μg/ml kanamycin at 30°C overnight. The next morning, after the addition of 4.5 ml of TB media, cultures were induced with 100 μΜ IPTG and transferred to 16°C for 24 hours. Cells were collected by centrifugation (3,000 X g, 10 minutes, 4°C) then reconstituted in TB to a final OD of 2. Toluene was added to 5 ml of each culture to 0.14% and mixed gently for 2 minutes. Cultures were incubated for 1 hr at 30°C, 200 rpm. Cells were then collected by centrifugation (3,000 X g, 10 minutes, 4°C), washed with 5 ml TB, and brought up to a total volume of 10 ml in TB. Thebaine was added to a final concentration of 20 μΜ and cultures were incubated overnight at 30°C, 200 rpm. The next day, 5 ml of each culture was extracted with equal volume ethyl acetate and dried to completion with N2. Dried samples were resuspended in 200 μΐ of 80% MeOH before diluting 1/100 for analysis on the QTRAP 6500 using method 1.

15.0 Protein purification.

[0125] Protein purification was performed on pET28a empty vector and the MND enzyme-encoding clones in pET28a (with N-terminal His-tag) transformed into BL21 Star (DE3) and PLUSE expression strains by culturing cells in 300 ml of TB media supplemented with 50 μg/ml kanamycin (BL21 Star (DE3) cells) or 50 μg/ml kanamycin and 32 μg/ml of chloramphenicol (PLUSE cells) for 12 hours at 30°C, 180 rpm. The following day, 600 ml of TB media supplemented with appropriate antibiotics and IPTG (100 μΜ final concentration) were added to each culture. Cells were incubated at 15°C, 180 rpm for 24 hours. Cells were then collected (8,000 x g, 10 min, RT), resuspended in 15 ml EB (50 mM sodium phosphate pH 7.0, 300 mM NaCl, 10% glycerol), and lysed. Lysis was achieved by addition of 1.75 ml of 10 mg/ml lysozyme followed by incubation at RT for 20 min with gentle agitation and sonication (repeat 5x, 15 s each, pulse 5). Cell lysate was collected (20,000 x g, 20 min, 4°C) and added to 2 ml TALON metal affinity resin (Clontech) which was washed twice with EB (700 x g, 2 min, 4°C) and kept on ice. The resin/protein mixture was incubated for 1 h on ice with gentle shaking then washed twice with 20 ml EB (700 x g, 5 min 4°C). The final pellet was resuspended in 2.5 ml of EB and placed into a Talon 2 ml disposable gravity column (Clontech). The protein bound resin was washed with EB supplemented with 5 mM imidazole and the protein was eluted with EB supplemented with 100 mM imidazole. The protein was then desalted using 50 mM potassium phosphate buffer pH 8.0 and PD-10 Desalting Columns (GE Healthcare) according to manufacturer's instructions.

16.0 Purified MND Enzyme assays using thebaine as substrate.

16.1 Enzyme assay with MND enzyme purified from BL21 Star (DE3) cells.

[0126] Enzyme assays included 100 μΐ of MND enzyme purified protein preparations (-36 μg) incubated in 30 mM potassium phosphate buffer (pH 8.0) with 7.5 μΜ (1500 pmol) thebaine, in a total volume of 200 μΐ at 30°C overnight. No enzyme, boiled enzyme, and purified empty vector controls were included. MND enzyme was heated for 10 min at 95 °C for the boiled enzyme control. Assays were extracted twice with 400 μΐ ethyl acetate and dried to completion with N2. Dried samples were resuspended in 150 μΐ of 50% MeOH, filtered (low protein binding 0.2 μπι, 4 mm, PTFE; Millipore), and diluted 100 fold prior to LC -MS/MS analysis on the QTRAP 6500 using method 1 and a 5 μΐ injection volume.

16.2 Enzyme assay on MND enzyme purified from PLUSE cells.

[0127] Enzyme assays using the purified MND enzyme protein produced in PLUSE cells were done, in duplicate, as indicated above with the following exceptions: 10 μΐ of 100 μΜ codeine standard was added to each assay, prior to extraction, as an internal standard. Dried extracts were resuspended in 200 μΐ of 50 % MeOH, filtered (0.2 μπι, 4 mm, PTFE, Millipore). Samples were diluted 50 fold prior to a 5 μΐ injection onto the QTRAP 6500 using method 1. 17.0 SDS-PAGE for protein purification.

17.1 SDS-PAGE for MND enzyme from BL21 Star (DE3) cells.

[0128] Protein purification was analyzed by SDS-PAGE using the Mini-PROTEAN Tetra system and a 10% mini -PROTEAN TGX gel. Samples were denatured in Laemmli Sample Buffer supplemented with β-mercaptoethanol by heating at 95°C for 10 minutes prior to analysis.

17.2 SDS-PAGE on MND enzyme in PLUSE cells.

[0129] Samples for SDS-PAGE were processed as above and analyzed on a 10% gel.

17.3 SDS-PAGE for protein quantitation.

[0130] His-tag purified MND enzyme samples were quantitated with SDS-PAGE using the method in 17.1 by comparison to pure BSA standards. The following concentrations of BSA were run alongside MND enzyme samples: 3,000 ng, 2,000 ng, 1,500 ng, 1,000 ng, and 800 ng. MND enzyme samples were run in triplicate. Imaging was performed with a GelDoc EZ Imager (BIORAD) and the samples were quantitated using Image Lab version 5.2.1. The band running at the predicted size of 72 kD was used for subsequent calculations.

18.0 MND enzyme in B121 Star (DE3) cells vs. PLUSE cells: permeabilization assay.

18.1 Permeabilization of B121 Star (DE3) and PLUSE E. coli cells expressing MND enzyme with toluene incubated with thebaine and northebaine production analysis.

[0131] Permeabilization assays were carried out as above, in triplicate, for MND enzyme-encoding vectors transformed into both BL21 Star (DE3) and PLUSE cells. Cells transformed with the empty vector pET28a were included as a control. Samples were also extracted as above but the dried extracts were resuspended in 200 μΐ of 80% MeOH prior to filtration and 100 fold dilution. Samples were analyzed and quantitated by QTRAP 6500 using method 1 with a 2 μΐ injection volume.

18.2 SDS PAGE on MND enzyme-encoding vectors and pET28a transformed into BL21 Star (DE3) and PLUSE cells cultured for permeabilization assay.

[0132] The OD600 of each culture was measured and used to normalize each sample concentration prior to analysis by SDS-PAGE by using 6 μΐ of sample per an OD600 of 10. The varying amounts of cultures were then centrifuged (16,000 x g, 5 min, RT) and 20 μΐ of Laemmli Sample Buffer supplemented with β-mercaptoethanol was added to each pellet and heated at 95°C for 10 minutes prior to SDS-PAGE using a 7% gel. 19.0 Poppy seed inoculation experiment.

[0133] Thebainfresser grown in LB media were used to inoculate 10 seeds each of three Papaver sominferum cultivars, Munich, Nopa (Norman parent), and Norman by rotating the seeds at room temperature for both 1 hour and 24 hours. Seeds were also incubated in LB only as a control. Each experiment was performed in triplicate. Seeds were planted immediately post inoculation in Metromix 360 soil and cultivated on site in the greenhouse (Donald Danforth Plant Science Center, St. Louis MO). Leaf tissue was harvested from each plant 55 days after planting. The leaves chosen were newer and all from the same developmental stage (about the 10th leaf from the original true leaves), and frozen in liquid nitrogen immediately post-harvest. Capsules were harvested after petal fall and the latex from one of each experimental group and control were streaked out onto MODLS + thebaine agar plates to monitor for growth of Thebainfresser.

20.0 MND enzyme substrate testing.

[0134] Substrates were prepared in a 1 mM stock concentration then diluted to 100 μΜ working concentration for use in enzyme assays with the MND enzyme. Ethanol was added to the stock solutions to aid in solubility for compounds that did not dissolve easily in water. No more than 15% ethanol was added to the 1 mM concentrations. The following substrates were tested: oripavine, (R)-reticuline, salutaridine, salutaridinol, heroin, thebaine, morphinone, codeinone, codeine, morphine, hydromorphone, oxymorphone, galanthamine, laudanine, orientaline, protosinomenine, isoorientaline, laudanosine, (S)- reticuline, scopolamine, hyoscyamine, noscapine, tropinone, physostigmine, isothebaine, (-)-lobeline, gramine, and autumnaline. Several other substrates were also tested with no detectable activity, but reasons for this could be attributed to solubility and detection issues, therefore these results were considered inconclusive. All enzyme assays contained 30 mM potassium phosphate buffer pH 8.0, 7.5 μΜ substrate and 100 μΐ of purified MND enzyme (-36 μg) in a total volume of 200 μΐ. Duplicate assays of each substrate using purified pET28a empty vector were performed as negative control and an assay containing thebaine and MND enzyme was used as a positive control. Assays were incubated at 30°C overnight prior to ethyl acetate extraction. All samples were diluted 10 fold with 50% MeOH and analyzed on the QTRAP 6500 using method 2 and a 20 μΐ injection volume. The percentage of substrate demethylation was calculated based upon reduction of substrate in the enzyme assay as compared to the negative control. The calculations were based on overnight assays which are most likely out of the enzyme's linear range. Therefore, the percentages should be considered approximate.

21.0 Agrobacterium mediated transformation of MND enzyme-encoding vectors into Camelina sativa.

[0135] The PCR product containing the MND enzyme open reading frame cut with Ndel and BamHI was ligated into the vector pKLl 1-gly (digested with the same enzymes) then transformed into E. coli DH5 a. The vector pKLl l- gly is a small E. coli vector containing the Glycinin promoter and terminator from soybean and an ampicillin resistance gene. The entire expression cassette containing the Glycinin promoter, MND enzyme coding region, and Glycinin terminator was then amplified by PCR using the following primers to add the Avrll and RsrII restriction sites (respectively):

GLYP Avrll F (5' - ATGGTACCTAGGGTACGTAAGTACGTACTCAA - 3')

(SEQ ID NO: 8).

GLYT RsrII Rev (5' -

CACTCACGGACCGAAGTCATGAAGAACCTGATAAGAC - 3') (SEQ ID NO: 9).

[0136] The PCR product was then digested with Avrll and RsrII and ligated into pRSe3 vector (Augustin et al. 2015) cut with the same enzymes. The ligation reaction was first transformed into E. coli DH5a for sequence verification then transformed into the Agrobacterium strain GV3103 : : pMP90. Camelina sativa Suneson and Camelina sativa Licalla were both transformed with the MND gene via agrobacterium (Lu and Kang 2008). In addition, a synthetic version of an MND enzyme coding region, encoding the identical amino acid sequence as the MND enzyme but codon optimized for Arabidopsis thaliana, a relative of C sativa, was ordered (Genewiz) and cloned into pRSe3 and transformed into C sativa using the same procedure.

22.0 Phylogenetic analysis

[0137] The MND enzyme protein sequence was subject to a protein BLAST (Camacho et al. 2009) search using the NR database at NCBI to obtain the most closely related sequences known. Sequences were aligned with MUSCLE and a maximum likelihood tree (500 bootstraps) was created with MEGA v 6.06 (Tamura et al. 2013). 23.0 Temperature optimum assay

[0138] Enzyme assays with 100 μΐ purified MND enzyme (-36 μg), 7.5 μΜ thebaine, and 30 mM potassium phosphate pH 8.0 in a total assay volume of 200 μΐ were performed in duplicate and allowed to incubate at the following temperatures for 20 minutes: 20°C, 25°C, 30°C, 40°C, 50°C, 60°C, 70°C and 80°C. Assays with purified pET28a empty vector control were included at each temperature. Assays were quenched by the addition of 400 μΐ MeOH followed by rapid mixing for 30 s then subject to centrifugation (16,000 x g, 30 min, RT). Samples were diluted further (total of 50 fold dilution) to achieve an analyte concentration of 0.15 μΜ prior to filtration (low protein binding 0.2 μπι, 4 mm, PTFE; Millipore) and introduction into the QTRAP 6500 using Method 1.

24.0 pH optimum assay

[0139] Enzyme assays with 100 μΐ purified MND enzyme (-36 μg), 7.5 μΜ thebaine, and 60 mM buffer (citrate buffer pH 4 and pH 5, potassium phosphate buffer pH 6.0, pH 7.0, and pH 8.0, and glycine-NaOH buffer pH 9, pH 10, and pH 10.5) in a total assay volume of 200 μΐ were allowed to incubate at 30° for 20 minutes. Assays with no enzyme and a purified pET28a empty vector control in pH 8.0 were included. The remaining protocol was the same as for the temperature optimum above. A supplemental pH assay was run in the same manner as above to verify the low pH results using a glycine-HCL buffer for pH 2, 3, and 4 and citrate buffer for pH 4, 5, and 6.

25.0 Kinetic assay

[0140] Kinetic assays were performed in triplicate with 100 μΐ purified MND enzyme (-36 μg) using 30 mM potassium phosphate buffer pH 7.0 at 30°C in a total volume of 200 μΐ and allowed to proceed for 20 min. The following substrate concentrations were used 0 μΜ, 0.5 μΜ, 1 μΜ, 5 μΜ, 10 μΜ, 15 μΜ, 20 μΜ, 30 μΜ, 40 μΜ, and 50 μΜ. No enzyme and empty vector controls were also included. Assays were quenched, filtered, and diluted as in section 23.0 and analyzed using Method 1. Kinetic parameters were determined using Prism 7 version 7.01.

26.0 MND enzyme stability assay

[0141] 100 μΐ purified MND enzyme (-36 μg) was incubated in 30 mM potassium phosphate buffer pH 7.0 at 30°C for 72, 48, 24, 6, 3, and 1 hours prior to addition of 1500 pmol thebaine in a total volume of 200 μΐ. The assay was allowed to proceed for 20 minutes after substrate addition and was processed and analyzed as in section 23.0.

27.0 MND enzyme treatment with Proteinase K and addition of several cofactors to determine effects on enzymatic conversion of thebaine to northebaine

[0142] The first of two assays performed included adding 1500 pmol FAD to the 200 μΐ MND enzyme assay including 100 μΐ purified MND (-36 μg) and 1500 pmol thebaine in 30 mM potassium phosphate buffer pH 7.0 with and without a prior 1 hour digest (of MND enzyme) with Proteinase K. Controls included a no enzyme control, FAD addition to a no enzyme control, a pET28a empty vector control, a pET28a empty vector control with FAD, and a pET28a empty vector control digested with Proteinase K followed by FAD addition. Assays were allowed to incubate for 20 minutes and then processed as in section 23.0. A second set of assays was performed, in duplicate, to validate the results found in the previous assay in addition to testing more cofactors. This assay included the following conditions; MND enzyme, MND enzyme pre-digested with Proteinase K for 1 hour, MND enzyme digested with Proteinase K in the presence of 1% SDS and 10 mM EDTA, MND enzyme with 1% SDS and 10 mM EDTA (no Proteinase K pre-digestion), pET28a empty vector, pET28a empty vector pre-digested with Proteinase K for 1 hour, pET28a empty vector digested with Proteinase K in the presence of 1% SDS and 10 mM EDTA, MND enzyme with the addition of 1500 pmol FMN, pET28a empty vector with the addition of 1500 pmol FMN, MND enzyme with the addition of 1500 pmol PLP and pET28a empty vector with the addition of 1500 pmol PLP. Assays were allowed to proceed for 20 minutes prior to processing and quantitation as described in section 23.0.

[0143]

28.0 Attachment of MND to sepharose beads

[0144] 1 mg of His-tagged purified MND underwent buffer exchange into 0.1 M NaHC0 3 / 0.5 M NaCl buffer using a PD-10 column (GE Healthcare) and concentrated to 1 ml using 3K Amicon filter (Millipore). The sample was then coupled to CNBr-activated Sepharose™ 4B (GE Healthcare) following manufacturer's instruction with the following modifications. The recommended volumes were reduced by 1/14 and washes were performed using vacuum filtration in a DNA plasmid purification column (Qiagen). After attachment, the column was washed several times before the final wash with 0.1 M potassium phosphate buffer pH 7.0. Some buffer remained in the column and 1500 pmol of thebaine was added to the stopped up column. After 20 minutes, the column was subject to centrifugation and the sample was collected. The column was stopped up again and 200 μΐ of 0.1 M potassium phosphate buffer pH 7.0 was added in addition to 1500 pmol of thebaine. Thebaine was incubated overnight and collected the next morning. 1500 pmol of thebaine was also added to 200 μΐ of NaHC0 3 wash (after initial ligand binding) with and without residual beads, to phosphate buffer only, and to a mix of the acid/base wash as controls. 29.0 Testing of MND in different E. coli strains

[0145] His-tagged MND and empty vector pET28a was transformed into PlusSa E.coli cells (expressing skp chaperone protein) and BL21 (DE3) pLysS. The following cells lines were tested for active protein production by toluene permeabilization assay, in duplicate, that contained His-tagged MND and an empty vector control: BL21 Star (DE3), PlusE, pLysS, and PlusSa. Cell growth was normalized and subjected to toluene permeabilization and incubation with thebaine. Northebaine production was detected and quantified by LC- MS/MS using method 2.

30.0 AKTA protein purification for determination of native protein size and quaternary structure

[0146] Additional protein purification after cobalt column was performed using the AKTA purifier (GE Heathcare) fitted with P-900 pumps, UV-900 monitor, and the Frac- 920 fraction collector. Protein was separated on a HiLoad 16/600 Superdex 200 prep grade size exclusion column with a flow rate of 1 ml/min. For MND, 1 ml protein was loaded (350-850 μg/ ml) after filtration with 0.45 GHP (Pall) syringe filter. A gel filtration standard (Bio-Rad cat. # 151-1901) was reconstituted according to manufacturer's instructions and run prior to MND. Data was analyzed using Unicorn 5.10.

31.0 Protein sequencing of both bands detected by SDS-PAGE

[0147] About 8.5 μg of HIS-tagged MND purified by cobalt resin was run on SDS- PAGE as previously described. The two prominent bands that appear on the gel were excised separately and sequenced by the Proteomics and Mass Spectrometry Facility at the Donald Danforth Plant Science Center. The Results were analyzed by Scaffold (TM) version 4.7.2.

32.0 Morphinan N-Demethylase (the MND Enzyme)

32.1 Cofactor identification

[0148] Due to the reaction type and color of the purified protein, it was suspected that the enzyme contained a co-factor. After methanol precipitation of MND and an empty vector control, the supernatant was analyzed by LC-MS/MS for potential cofactors. A large peak for FAD, but not FMN or any other cofactor, was detected in the MND containing samples and not in the pET28a containing samples.

32.2 Protein analysis: sequence, native size, and tertiary structure

[0149] Purified His- tagged MND analyzed by SDS-PAGE revealed two prominent protein bands around 60 and 75 kD. In order to verify the bands, and determine if one was a processed peptide of MND, the bands were excised and sequenced. The larger band corresponded to a weight of about 74 kD and was indeed the His-tagged version of MND. Also of note, the first methionine appeared to be cleaved. The smaller band had a calculated size of about 57 kD and turned out to be the E. coli GroEL chaperone protein. MND protein purified by FPLC using a size exclusion column with the AKTA protein purification system successfully separated the GroEL chaperone protein from MND. The activity, however, was variable and with the amount of lost material and activity, this purified form was not chosen for further analysis. However, a gel filtration standard was run directly prior to a run with MND to verify native size. The MND protein band (verified after fraction collection by enzyme assay) eluted between the Gamma-globulin (158,000 kD) and the Ovalbumin (44,000) protein standard, indicating the protein, in its native form, is most likely the predicted size of 74 KD and therefore exists as a monomer. Results

33.0 Thebainfresser isolation, growth, and morphological characteristics

[0150] A sludge pond sample was added to minimal media containing thebaine as the sole carbon source (MODLS + thebaine). Incubated under non-sterile conditions at room temperature over many months, eventually microbial growth was observed. The culture was then subcultured under the same conditions until remnants of the original sludge sample were no longer observed, but microbial growth was consistently achieved. The bacterium producing the thebaine-demethylating enzyme that was eventually isolated was one in a complex mixture of soil microorganisms (both bacterial and fungal) from the sludge pond sample. After multiple rounds of serial dilutions on various media, the organism producing the desired N-demethylation reaction was isolated. The organism had pink pigment (Figure IB) and grew on LB media and MODLS + thebaine media but not on Potato Dextrose Agar media. Colonies could be seen developing on LB media in 5-7 days, but took about a month to grow on MODLS + thebaine. Microscopic images of Thebainfresser growing in MODLS + thebaine showed morphology of rod shaped bacteria (Figure 1A). Further microscopic analysis of MODLS + thebaine incubated alone (Figure 2A) and MODLS + thebaine cultured with Thebainfresser (Figure 2B) showed the Thebainfresser colonizing on the thebaine crystals.

[0151] In order to determine the best growth conditions for culturing Thebainfresser, cultures were incubated at different temperatures and density was observed. Thebainfresser was cultured at 21°C, 28°C, and 37°C, in duplicate, and the cell density was measured spectrophotometrically using OD600 (Figure 3). Thebainfresser grew to a higher culture density at 28°C.

34.0 N-Demethylase enzyme induction, RNA extraction, and sequencing

[0152] In order to determine if the N-demethylating enzyme of interest was induced upon growth in a media lacking a carbon source other than thebaine, an induction experiment was performed. Cultures growing in nutrient rich LB media and cultures growing in MODLS + thebaine were washed with and resuspended in MODLS + thebaine. The amount of northebaine produced was monitored over several days (Figure 4). The lag in northebaine accumulation in cultures growing in LB media indicated that production of the MND enzyme was induced under low carbon and nutrient poor conditions.

[0153] RNA was extracted from cultures grown under three conditions; 1) LB only (Day 0), 2) LB then washed and resuspended in MODLS + thebaine for 3 days (Day 3), and 3) LB then washed and resuspended in MODLS + thebaine for 6 days (Day 6). The RNA was then check for quality using the Bioanalyzer and sent to MoGene for library preparation and Illumina sequencing using MiSeq. MoGene ran the samples twice and provided us with both sets of raw reads. The number of reads, length of reads, and the GC content of each set are presented in Table 1. UND indicates undetermined reads that could not be positively associated with any of the three samples.

35.0 Thebainfresser transcriptome assembly

[0154] The raw reads for Day 0, Day 3, and Day 6 from both Illumina Miseq runs were used in the assembly. Raw reads were normalized using Fulcrum (Burriesci et al. 2012) and trimmed with the Trimmomatic (Bolger et al. 2014) program. The processed reads were then assembled with Trinity (Grabherr et al. 2011). The number of contigs produced by trinity was 829,495 (Table 2). These contigs were then submitted to BLAST, and the resulting annotations suggest that the organism is a Pink Pigmented Methylobacterium {Methylobacterium spp) as 37% of contigs significantly aligned to Methylobacterium while 53% had no significant hits. Many of the contigs had confident annotations for Homo sapiens (Human), Cricetulus griseus (Chinese Hamster), and others. These contigs that had no hits in the database or significantly aligned to eukaryotes were removed. Statistics for our trinity assembly throughout the filtering process, in addition to statistics for two known metholybacteria are presented in Table 2. The similar GC-content (65%) and number of contigs (6,792) found in our assembly compared to the known methylobacteria suggest our final transcriptome is of high quality. 36.0 Candidate gene selection

[0155] Gene expression was determined by aligning raw reads to the transcriptome using BWA (Li and Durbin 2009). The expression dataset was then mined using HAYSTACK (Mockler et al. 2007) to find contigs (gene transcripts) that had an expression pattern consistent with the production of northebaine in the induced cultures (Figure 4). The HAYSTACK analysis resulted in 34 contigs whose expression pattern fit the model. The list was curated to produce 4 candidates for cloning based upon annotation and length of open reading frame. Candidate A annotated as an ethanolamine utilization protein, candidate B annotated as a diguanylate cyclase, candidate C annotated as a glutamine synthase, and candidate D annotated as an FAD-dependent oxidoreductase. 37.0 Identification of Morphinan N-demethylase ("the MND enzyme")

[0156] The four candidate genes were cloned into the E. coli expression vector pET28a and initially transformed into the expression strain BL21 Star (DE3). For initial enzyme function testing, the genes were expressed and the cells were treated with toluene to permeabilize the membrane and allow for substrate uptake. The toluene was washed away and the cells were resuspended in media supplemented with thebaine and incubated overnight. Half of each culture was extracted and analyzed for northebaine production by LC-MS/MS. Candidate enzyme D showed morphinan N-demethylase activity and was therefore designated morphinan N-demethylase (MND enzyme) (Figure 5). All other enzymes showed a background accumulation of northebaine similar to the pET28a empty vector control. The experiment was repeated with two individually transformed BL21 Star (DE3) colonies of pET28a and MND enzyme, each performed in triplicate (Figure 6). Purification of MND enzyme showed a high protein accumulation, but much of that protein was insoluble and remained in the pellet (Figure 7). Additional verification of protein purification and function was performed by enzyme assay with the purified protein and thebaine as substrate. The assay was performed in duplicate (Figure 8) with empty vector, boiled enzyme, and no enzyme controls. Only MND enzyme showed any accumulation of northebaine. Quantitation of northebaine production is presented in Figure 9. Another assay was completed with purified MND enzyme in the same manner, to verify activity (Figure 10). As presented in Figure 10, the substrate thebaine was depleted completely in the overnight assay. Chromatograms for both the substrate and product are presented with an inset displaying the fragmentation pattern and structure of each. Both panels contain an overlay of the standard and the negative control. 38.0 Optimized expression of MND enzyme using PLUSE cells

[0157] Due to the large amount of insoluble protein, several tests were performed to optimize protein production. MND enzyme production occurred pre-induction (Figure 7, lane 3), so it was hypothesized that the protein was being produced too quickly and mis- folding. Protein production was analyzed under different concentrations of IPTG and different temperatures but the amount of insoluble protein was not improved. The expression vector was then transformed into the PLUSE E. coli expression strain that also express the GroEL and GroES chaperone proteins. The amount of functional protein expressed was significantly increased during the toluene permeabilization assay (Figures 11 and 12) when compared to the B121 Star (DE3) cells. Therefore, the protein was purified from PLUSE cells for further testing and enzyme characterization (Figure 13). The purified MND enzyme from PLUSE cells was functionally validated prior to further testing (Figure 14) and the amount of northebaine produced was quantified (Figure 15). Essentially all of the substrate was converted to product.

39.0 MND enzyme characterization and substrate testing

[0158] Purified MND enzyme from PLUSE cells was used to test various substrates containing a methylated nitrogen (Figure 16) in overnight assays. MND enzyme was able to demethylate morphinan compounds tested with high efficiency and was also able to demethylate benzylisoquinoline alkaloids with slightly less efficiency (Table 3). Some of the more surprising substrates accepted by the MND enzyme include galanthamine, scopolamine, hyoscyamine, gramine, (-)-lobeline, physostigmine, isothebaine, and tropinone, indicating that the MND enzyme is promiscuous.

[0159] The MND enzyme was also able to perform at a wide range of temperatures tested. Assays were allowed to incubate for only 20 min to remain in the linear range of product production. Northebaine was detected at temperatures of 20°C- 80°C, but the highest accumulation was found at 60°C (Figure 17). The MND enzyme was also tested in a range of pH solutions including pH 2, 3, 4, 5, 6, 7, 8, 9, 10 and 10.5. There was no significant difference in northebaine production when tested in the pH range of 5-10.5, but pH 4 produced nearly twice as much (Figure 18). This phenomena was validated with a supplemental assay (performed in duplicate) including pH 4 with two different buffers (Figure 19), but was only observed in the citrate buffer, not the glycine-HCL buffer. 40.0 MND enzyme protein quantitation

[0160] The concentration of MND in each protein purification prep was determined by SDS-PAGE using a BSA standard curve. The MND protein purification used for substrate testing and kinetic assays revealed a protein concentration of 360 ng/μΐ (Figure 20). The larger band only was used for quantitation as it was shown to be MND by protein sequencing. The average amount of soluble protein was 2 mg/1 (in PLUSE cells) and the amount remaining in the inclusion bodies was around 300 mg/1.

41.0 Determination of MND enzyme kinetic values

[0161] MND enzyme assays with increasing substrate concentrations were performed in order to determine kinetic parameters (Figure 21). MND enzyme was found to have a

Vmax of 0.32 pmol s "1 , K m of 0.97 μΜ, and K cat of 6.5 x 10 "4 s "1 .

42.0 MND enzyme maintains it ability to demethylate after days of incubation at

30°C

[0162] MND enzyme was incubated at 30°C in assay buffer for 1, 3, 6, 24, 48, and 72 hours prior to addition of substrate (thebaine) in order to determine the stability of the MND enzyme (Figure 22). Upon addition of substrate, assays were allowed to proceed for 20 minutes and product formation was quantitated as above. The MND enzyme maintained its ability to demethylate thebaine after 72 of incubation at 30°C suggesting the MND enzyme is relatively stable under these conditions.

43.0 Effects of cofactor addition and Proteinase K digestion on enzymatic activity of the MND enzyme

[0163] Several assays were performed in order to determine the effect additional cofactors would have on MND enzyme activity. Addition of FAD to no enzyme and pET28a empty vector controls showed a slight increase in the background demethylation of thebaine to northebaine but not close to levels detected in assays containing MND enzyme (Figure 23). Addition of FMN also showed an increase in background activity, but addition of PLP did not (Figure 24). MND enzyme was also digested with Proteinase K prior to enzyme assay (Figures 23 and 24) in the presence of SDS (increases Proteinase K activity) and additional FAD. Surprisingly the amount of northebaine produced increased after the MND enzyme had been treated with Proteinase K.

44.0 Phylogenetic analysis of the MND enzyme

[0164] The nucleotide sequence and protein sequence of the MND enzyme were subject to BLAST searches using the NCBI NR database and Uniprot. The MND enzyme gene contains 2,013 nucleotides and the resulting protein sequence is 670 amino acids long with an estimated molecular weight of 72 kD. No similar sequences were obtained using the nucleotide BLAST optimized for highly similar sequences. The nucleotide BLAST optimized for more dissimilar sequences resulted in only two hits with very low query coverage (3% and 1%) and high E values (0.005 and 9.2). The top 7 hits resulting from the protein BLAST only shared 39 - 42 % identity with the MND enzyme and were all classified as hypothetical proteins. The subsequent 5 protein BLAST hits annotated as FAD-dependent oxidoreductases. Interestingly, even though the majority of contigs in the Thebainfresser transcriptome annotated as genes from Methyl obacterium using the NCBI BLAST NR database, not one of the top 100 blast hits using the MND enzyme as query was a Methylobacterium gene.

[0165] The protein sequences of the closest hits obtained by protein BLAST and additional less similar proteins, but with better annotation, were aligned with the Muscle algorithm. A phylogenetic tree was then constructed with the alignment using Mega 6.06 (Figure 25). The placement of the MND enzyme in the tree indicates no known sequences are closely related. The top protein BLAST hit from the NCBI NR database did not cluster with the MND enzyme and the low bootstrap value clustering the MND enzyme with an opine oxidase subunit from Rhizobium and a the ethanolamine utilizing FAD-containing monooxygenase from Mycobacterium suggest that its placement in the tree is arbitrary and that it is too dissimilar from all sequences for proper phylogenetic placement, making true annotation difficult.

[0166] However, several conserved domains were detected with the protein BLAST. Most notable are a NAD_binding_8 (Rossmann-like) domain spanning amino acids 128 - 162, a L-aspartate oxidase domain spanning amino acids 121-154, a FAD-dependent oxidoreductase domain spanning amino acids 125-164, a hydroxyglutarate oxidase domain spanning amino acids 124-155, and a deaminating D-amino acid oxidase domain spanning amino acids 121-167. In addition, a thiazole biosynthesis enzyme conserved domain was detected spanning amino acids 125-162 and a phospholipase C domain spanning amino acids 23-67 were also detected. Only one domain was detected when the MND enzyme protein sequence was submitted to Pfam (Finn et al. 2016), and that was also the NAD_binding_8 domain spanning amino acids 128-173. With this in mind, the amino acids from 121-173, spanning the region covered by most conserved domains, was then subject to protein BLAST using both the traditional protein BLAST and the DELTA- BLAST. In addition to one oxidoreductase, the most abundant top BLAST hits were for proteins involved in the twin-arginine translocation pathway. The programs Interpro and Prosite also identified a twin-arginine translocation signal from amino acids 1-53. The motifs uncovered by BLAST and Pfam do suggest binding of nucleotide containing cofactors including FAD, NAD, and NADP, but the lack of homology to known proteins leaves functional identification using sequence alone highly improbable.

45.0 Testing different E. coli expression strains

[0167] Several E.coli expression strains were tested in parallel for N-demethylase activity of thebaine by toluene permeabilization assay (Figure 28). MND in PlusE cells showed the highest activity, with almost 40 % more northebaine production than the BL21 Star (DE3) cells, almost 20 % more than PluSa cells, and nearly 90% more than pLysS cells. Therefore, this strain was chosen for further experiments.

46.0 Solvent stability assays

[0168] Enzymes used in pharma manufacturing must withstand harsh conditions and tolerate solvents, especially if the substrate is insoluble in water (Figure 27). MND activity was tested in the following solvents at 5% and 25% concentration: DMSO, methanol, chloroform, tetrahydrofuran, toluene, heptane, and diethyl ether. MND showed particular resilience to DMSO, toluene, and methanol with little to no reduction in activity at 5% solvent and only a 21%, 44% and 43% reduction in 25% solvent (respectively). Activity was completely abolished at 25% chloroform and tetrahydrofuran and diethyl ether had a significant reduction of 66%. The heptane reaction was an outlier with no reduction in activity which is most likely due to its insolubility with the enzyme assay components and therefore sat on top of the reaction.

47.0 MND immobilization to CNBr-activated SEPH AROSE 4B

[0169] MND attached to CNBr-activated Sepharose beads exhibited N-demethylation activity on thebaine. After 20 min on the column, nearly 250 pmol of northebaine was detected while over 400 pmol of northebaine was detected after the overnight incubation (Figure 29). The only control that showed activity was the sodium carbonate that contained residual sepharose beads, which had MND attached, so this result was expected. An increase in N-demethylase activity could most likely be achieved if substrate was added slowly, allowing the substrate to access more immobilized enzyme. 48.0 Tables

Table 1. Thebainfresser Miseq statistics

Table 2. Thebainfresser transcnptome statistics through several filtering steps compared to two known Methyl obacterium

Table 3. List of compounds used for MND enzyme substrate testing, the masses detected for each, and the percent of substrate demethylated by MND enzyme in an overnight enzyme assay.

m/z of

m/z of Parent Daughter Percent ion ion de¬

Compound (substrate) (product) methylated

Oripavine 298 284 100%

(R)-Reticuline 330 316 25%

Salutaridine 328 314 100%

Salutaridinol 330 316 100%

Heroin 370 356 70%

Thebaine 312 298 100%

Morphinone 284 270 100%

Codeinone 298 284 100%

Codeine 300 286 100%

Morphine 286 272 100%

Hydromorphone 286 272 98%

Oxymorphone 302 288 73%

Galanthamine 288 274 27%

Laudanine (Laudanidine) 344 330 14%

Orientaline 330 316 100%

Protosinomenine 330 316 60%

Isoorientaline 330 316 86%

Laudanosine 358 344 31%

(^-Reticuline 330 316 55%

Scopolamine 304 290 65%

Hyoscyamine 290 276 18%

Noscapine (Narcotine) 414 400 20%

Tropinone 140 126 40%

Physostigmine 276 262 37%

Isothebaine 312 298 35%

(-)-Lobeline 338 324 20%

Gramine 175 161 51% Autumnaline 374 360 I 11%

49.0 Thebainfresser growth media

Table 4. Minimal media (MODLS + THEBAINE) References

Augustin, J.M., Higashi, Y., Feng, X. and Kutchan, T.M. (2015) Production of mono- and sesquiterpenes in Camelina sativa oilseed. Planta, 242, 693-708.

Bolger, A.M., Lohse, M. and Usadel, B. (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30, 2114-2120.

Burriesci, M.S., Lehnert, E.M. and Pringle, J.R. (2012) Fulcrum: condensing redundant reads from high-throughput sequencing studies. Bioinformatics, 28, 1324-1327.

Camacho, C, Coulouris, G., Avagyan, V., Ma, N., Papadopoulos, J., Bealer, K. and Madden, T.L. (2009) BLAST+: architecture and applications. BMC bioinformatics, 10, 421.

Finn, R.D., Coggill, P., Eberhardt, R.Y., Eddy, S.R., Mistry, J., Mitchell, A.L., Potter, S.C., Punta, M., Qureshi, M., Sangrador- Vegas, A., Salazar, G.A., Tate, J. and Bateman, A. (2016) The Pfam protein families database: towards a more sustainable future. Nucleic acids research, 44, D279-285.

Grabherr, M.G., Haas, B.J., Yassour, M., Levin, J.Z., Thompson, D.A., Amit, I, Adiconis, X., Fan, L., Raychowdhury, R., Zeng, Q., Chen, Z., Mauceli, E., Hacohen, N., Gnirke, A., Rhind, N., di Palma, F., Birren, B.W., Nusbaum, C, Lindblad-Toh, K., Friedman, N. and Regev, A. (2011) Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature biotechnology, 29, 644-652.

Li, H. and Durbin, R. (2009) Fast and accurate short read alignment with Burrows- Wheeler transform. Bioinformatics, 25, 1754-1760.

Lu, C. and Kang, J. (2008) Generation of transgenic plants of a potential oilseed crop Camelina sativa by Agrobacterium-mediated transformation. Plant cell reports, 27, 273-278.

Mockler, T.C., Michael, T.P., Priest, H.D., Shen, R., Sullivan, CM., Givan, S.A., McEntee, C, Kay, S.A. and Chory, J. (2007) The DIURNAL project: DIURNAL and circadian expression profiling, model -based pattern matching, and promoter analysis. Cold Spring Harb Symp Quant Biol, 72, 353-363.

Paoni, N.F. and Koshland, D.E., Jr. (1979) Permeabilization of cells for studies on the biochemistry of bacterial chemotaxis. Proceedings of the National Academy of Sciences of the United States of America, 76, 3693-3697.

Tamura, K., Stecher, G., Peterson, D., Filipski, A. and Kumar, S. (2013) MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. Molecular biology and evolution, 30, 2725- 2729.

Tartoff, K.D. and Hobbs, C.A. (1987) Improved Media for Growing Plasmid and Cosmid Clones. Bethesda Res. Lab. Focus, 9.

William, S. and Copeland, H.F. (2012) Bacterial genomic DNA isolation using CTAB. World Wide Web at jgidoegov/collaborate-with-ghi/pmo-overview/protocols-sample -preparatioN- information.