Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A GENE ENCODING AN ENZYME FOR CATALYZING BIOSYNTHESIS OF LIGNAN, AND USE THEREOF
Document Type and Number:
WIPO Patent Application WO/2005/030944
Kind Code:
A1
Abstract:
The present invention provides an enzyme that catalyzes the reaction producing piperitol from pinoresinol, and a reaction producing sesamin from piperitol. The invention also provides a gene that encodes such enzyme. Further, the invention provides a vector and transformant including a gene encoding the enzyme, and a producing method of the protein using the transformant.

Inventors:
ONO EIICHIRO
TANAKA YOSHIKAZU
Application Number:
PCT/JP2004/014696
Publication Date:
April 07, 2005
Filing Date:
September 29, 2004
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SUNTORY LTD (JP)
ONO EIICHIRO
TANAKA YOSHIKAZU
International Classes:
C12N9/02; C12N15/82; C12P17/18; H04N5/217; (IPC1-7): C12N9/04; C12N15/82; C12P7/22; A01H5/00
Other References:
XIA Z-Q ET AL: "Dirigent-mediated podophyllotoxin biosynthesis in Linum flavum and Podophyllum peltatum", PHYTOCHEMISTRY, PERGAMON PRESS, GB, vol. 55, no. 6, November 2000 (2000-11-01), pages 537 - 549, XP004291678, ISSN: 0031-9422
GANG D R ET AL: "REGIOCHEMICAL CONTROL OF MONOLIGNOL RADICAL COUPLING: A NEW PARADIGM FOR LIGNIN AND LIGNAN BIOSYNTHESIS", CHEMISTRY AND BIOLOGY, CURRENT BIOLOGY, LONDON, GB, vol. 6, no. 3, March 1999 (1999-03-01), pages 143 - 151, XP000995932, ISSN: 1074-5521
OVERKAMP STEFAN ET AL: "Cloning and characterization of eight cytochrome P450 cDNAs from chickpea (Cicer arietinum L.) cell suspension cultures", PLANT SCIENCE (SHANNON), vol. 155, no. 1, June 2000 (2000-06-01), pages 101 - 108, XP002314013, ISSN: 0168-9452
DATABASE EMBL [online] 12 December 2001 (2001-12-12), "Arabidopsis thaliana cytochrome P450-like protein (F6G17.20) mRNA, complete cds.", XP002314014, retrieved from EBI accession no. EM_PRO:AY065192 Database accession no. AY065192
DATABASE EMBL [online] 5 June 2002 (2002-06-05), "Arabidopsis thaliana putative cytochrome P450 protein (At3g28740) mRNA, complete cds.", XP002314015, retrieved from EBI accession no. EM_PRO:AY113869 Database accession no. AY113869
DATABASE EMBL [online] 27 August 2001 (2001-08-27), "Arabidopsis thaliana putative cytochrome P450 protein (At3g28740) mRNA, complete cds.", XP002314016, retrieved from EBI accession no. EM_PRO:AY050849 Database accession no. AY050849
DATABASE EMBL [online] 14 June 2002 (2002-06-14), "Arabidopsis thaliana clone 253698 mRNA, complete sequence.", XP002314017, retrieved from EBI accession no. EM_PRO:AY086486 Database accession no. AY086486
JIAO YING ET AL: "Furanofuran lignan metabolism as a function of seed maturation in Sesamum indicum: Methylenedioxy bridge formation", PHYTOCHEMISTRY (OXFORD), vol. 49, no. 2, September 1998 (1998-09-01), pages 387 - 394, XP004290112, ISSN: 0031-9422
KATO M J ET AL: "Biosynthesis of antioxidant lignans in Sesamum indicum seeds", PHYTOCHEMISTRY, PERGAMON PRESS, GB, vol. 47, no. 4, February 1998 (1998-02-01), pages 583 - 591, XP004293758, ISSN: 0031-9422
IKEZAWA NOBUHIRO ET AL: "Molecular cloning and characterization of CYP719, a methylenedioxy bridge-forming enzyme that belongs to a novel P450 family, from cultured Coptis japonica cells.", JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 278, no. 40, 3 October 2003 (2003-10-03), pages 38557 - 38565, XP002313984, ISSN: 0021-9258
SUH MI CHUNG ET AL: "Comparative analysis of expressed sequence tags from Sesamum indicum and Arabidopsis thaliana developing seeds.", PLANT MOLECULAR BIOLOGY, vol. 52, no. 6, August 2003 (2003-08-01), pages 1107 - 1123, XP002313982, ISSN: 0167-4412
Attorney, Agent or Firm:
Hara, Kenzo (Daiwa Minamimorimachi Building 2-6, Tenjinbashi 2-chome Kita, Kita-k, Osaka-shi Osaka, JP)
Download PDF:
Claims:
CLAIMS
1. A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin.
2. A gene encoding a protein that catalyzes a reaction forming a methylene dioxybridge in pinoresinol and/or piperitol.
3. A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin, and that consists of (a) an amino acid sequence of SEQ ID NO: 1, 64 or 78, or (b) an amino acid sequence that has been modified by substitution, deletion, insertion, and/or addition of one or more amino acids of SEQ ID NO: 1, 64 or 78.
4. A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin, and that consists of an amino acid sequence which is at least 50% homologous to an amino acid sequence of SEQ ID NO: 1,64 or 78.
5. A gene including a base sequence of SEQ ID NO: 2,65 or 79 as an open reading frame region.
6. A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin, and hybridizing under stringent conditions with (a) a polynucleotide consisting of a base sequence of SEQ ID NO: 2,65 or 79, (b) a polynucleotide encoding a protein consisting of an amino acid sequence of SEQ ID NO: 1,64 or 78, or (c) a fragment of the polynucleotide (a) or (b).
7. A gene as set forth in any one of claims 1 through 6, which is derived from sesame.
8. A protein encoded by a gene as set forth in any one of claims 1 through 7.
9. A protein catalyzing biosynthesis of piperitol and/or sesamin, and consisting of (a) an amino acid sequence of SEQ ID NO: 1,64 or 78, or (b) an amino acid sequence that has been modified by substitution, deletion, insertion, and/or addition of one or more amino acids of SEQ ID NO: 1,64 or 78.
10. An antibody that recognizes a protein as set forth in claim 8 or 9.
11. A recombinant expression vector including a gene of any one of claims 1 through 7.
12. A transformant comprising a recombinant expression vector including a gene of any one of claims 1 through 7.
13. A producing method of a protein, comprising the steps of: incubating or growing a transformant of claim 12; and obtaining from the transformant a protein that catalyzes biosynthesis of piperitol and/or sesamin.
14. A plant, its offspring, and a tissue of the plant and its offspring, into which a gene of any one of claims 1 through 7 has been introduced.
15. A producing method of piperitol and/or sesamin, comprising the step of using a gene of any one of claims 1 through 7, or a protein of claim 8 or 9.
16. A producing method of a transformant containing a large amount of lignan, comprising the step of using a gene of any one of claims 1 through 7.
17. A producing method of a plant containing a large amount of piperitol and/or sesamin, comprising the step of using a gene of any one of claims 1 through 7.
18. A producing method of a transformant containing a small amount of lignan, comprising the step of using a gene of any one of claims 1 through 7.
19. A producing method of a plant containing a small amount of piperitol and/or lignan, comprising the step of using a gene of any one of claims 1 through 7.
20. A method of cultivating sesame, comprising the step of using a gene of any one of claims 1 through 7.
21. A gene detecting device comprising a polynucleotide probe whose base sequence is at least part of a base sequence of a gene set forth in any one of claims 1 through 7.
Description:
DESCRIPTION A GENE ENCODING AN ENZYME FOR CATALYZING BIOSYNTHESIS OF LIGNAN, AND USE THEREOF TECHNICAL FIELD The present invention relates to enzymes for catalyzing the biosynthesis of piperitol and sesamin in sesame, and to genes encoding the enzymes. The invention also relates to use of such enzymes and genes.

BACKGROUND ART Sesame plants, generally known as sesame (Sesamum indium), belong to family Pedaliaceae and genus Sesamum.

Sesame is native to central Africa and is the oldest cultivated oil plant with the history of over 6000 years. Sesame has been cultivated in various parts of the world.

Sesame seeds contain about 50% lipid and about 20% protein, in addition to various vitamins including vitamin B1, B2, and E. The main lipid component of sesame seed is triglyceride whose chief constituents are oleic acid and linoleic acid. Sesame also contains secondary metabolites generally known as sesamin, sesamolin, and lignan, which are characteristic components of the sesame plant.

Previous research has revealed various physiological activities of sesamin, which are found to be effective in improving cholesterol metabolism, and liver and immune functions (see Publication 1, for example). A separation and purification method of sesamin from the sesame seeds or wrung residues of sesame seeds has already been put to actual applications (see Patent Publications 1 and 2, for example). Sesamin is also commercially available as a medicament that enhances the liver function by promoting alcohol metabolism, etc. Other than sesamin, sesame lignans (sesaminol, sesamolin, etc. ) have also been reported to have various physiological activities (see Publication 2, for example).

There has been-some study on the biosynthesis of lignan (see Publication 3, for example). Fig. 1 is a schematic representation of a common biosynthesis pathway of lignan.

The lignan is synthesized from a phenyl propanoid compound used as a starting material. In plants, lignan is believed to play role in the defense mechanism. As illustrated in Fig. 1, polymerization of conipheryl alcohol yields pinoresinol as the "first"lignan in the biosynthesis pathway. From pinoresinol, a wide variety of lignans are synthesized through distinct biosynthesis pathways of different plant species.

In the biosynthesis of sesamin, as shown in Fig. 1, piperitol is synthesized by the enzymatic action of a piperitol

synthetase catalyzing (+)-pinoresinol, forming a methylene dioxybridge (circled in Fig. 1). Sesamin is synthesized by a sesamin synthetase forming another methylene dioxybridge in the piperitol. Experiments using membrane fractions of sesame seeds have shown that the enzymes catalyzing these reactions were two different kinds of cytochromes P450 (see Publication 4, for example).

The formation of methylene dioxybridge is often seen in the biosynthesis of alkaloid or flavonoid. For example, membrane fractions from cultured cells of Eschscholtzia calfornica are known to include cytochromes P450 that catalyze the biosynthesis of (S) -cheilanthifoline from<BR> (S) -scoulerine, and (S) -stylopine from (S) -cheilanthifoline, by forming a methylene dioxybridge in these compounds (see Publication 5, for example).

There have also been reported that membrane fractions from cultured cells of chickpea (Cicer arietinum) contain enzymes that catalyze the synthesis of Pseudobaptigenin and 5'-hydroxy Pseudobaptigenin produced by formation of a methylene dioxybridge in calycosin and pratensein, respectively. These enzymes have been identified as cytochromes P450 (see Publication 6, for example).

Others report the possibility that the deoxypodophyllotoxin 6-hydroxylase in cultured cells of Linum flavum may be cytochrome P450 (see Publication 7, for

example).

The cytochrome P450 has also been found as an enzyme involved in the biosynthesis of berberine, which is a benzylisoquinoline alkaloid, in which (S)-tetrahydroberberine is synthesized by forming a methylene dioxybridge in (S)-tetrahydrocolimbamine. A gene that encodes this enzyme has been cloned from Coptis japonica (see Publication 8, for example).

The cytochromes P450 that catalyze various types of reactions as above comprise a superfamily of diverse molecular species, which are categorized based on the homology of their amino acid sequences. Different molecular species of cytochrome P450 belong to the same family if their identity is 40% or greater, and to the same sub family if their identity is 55% or greater. In the notation used in this classification, numbers denote family, and alphabets denote sub family (see Publication 9, for example). The tree diagram shown in Fig. 3 represents families and their interrelations.

For example, the cytochrome P450 involved in the biosynthesis of berberine has been categorized as"CYP719." A single plant species include several hundred molecular species of cytochrome P450. However, as shown in Fig. 4, only a few of them have been identified based on their biochemical and physiological functions.

As a sesame-derived cytochrome P450, a gene

(AY065995) that encodes p-coumarate 3-hydroxylase has been cloned, though it does not directly relate to the synthesis of sesamin or piperitol concerning the present invention.

Cloning of cytochrome P450 genes and their functional analysis have also been reported for various species of other organisms, as noted below.

For example, some of the cloned genes include: A gene coding for petunia-derived flavonoid 3', 5' hydroxylase (F3', 5'H) (see Publication 10, for example); A gene coding for flavonoid 3'-hydroxylase (F3'H: CYP75B) (see Publication 11, for example); A gene coding for sweetroot-derived (2S) -flavanon 2-hydroxylase (F2H: CYP93B1) (see Publication 12, for example); A gene coding for 2-hydroxy-isoflavanon synthetase (IFS: CYP93C2) (see Publication 13, for example); and A gene coding for isoflavon 2'-hydroxylase (I2'H : CYP81E1) (see Publication 14, for example).

In addition, a gene coding for flavon synthetase II (FNSII: CYP93B3) has also been cloned from Antirrhinum majus, using I2'H (see Publication 15, for example).

Amino acid sequences of various cytochromes P450 that belong to the CYP81 family are found in Http://drnelson. utmem. edu/CytochromeP450. html. Among different functions of these cytochromes P450, Helianthus

tuberosus-derive d CYP81 B 1 is known to catalyze the hydrogenation of fatty acids (see Publication 16, for example).

It is also known that the enzyme I2'H belongs to CPY81E.

However, none of these cytochromes P450 is involved in the formation of methylene dioxybridge.

As a gene that encodes an enzyme involved in the biosynthesis of lignan, a gene encoding a dirigent protein involved in the synthesis of pinoresinol in Forsythia intermedia has been reported (see Publication 17, and Patent Publication 3, for example). There have also been reports on a gene encoding a Pinoresinol-Lariciresinol reductase in Forsythia intermedia (see Publication 18, and Patent Publication 3, for example), and a gene encoding a Pinoresinol-Lariciresinol reductase in Thuja plicata (see Publication 19, for example). In other reports, a recombinant secoiso lariciresinol dehydrogenase and its use are discussed (see Publication 20, for example).

[Patent Publication 1] Japanese laid-open publication No. 139579/2001 (published on May 22,2001) [Patent Publication 2] Japanese laid-open publication No. 7676/1998 (published on January 13,1998) [Patent Publication 3] Japanese PCT laid-open publication No. 507931/2001

(published on June 19,2001) [Patent Publication 4] Japanese PCT laid-open publication No. 512790/2002 (published on May 8,2002) [Publication 1] Mitsuo, Namiki, "Sesame, science and its functions," published by Maruzen Planet [Publication 2] Publication of Japan Society for Bioscience, Biotechnology, and Agrbchemistry, 76 805-813 2002 [Publication 3] Lignans: biosynthesis and function, Comprehensive natural products chemistry vol. 1.640-713, 1999 [Publication 4] Phytochemisity, 49,387, 1998 [Publication 5] Phytochemisity, 30,2953, 1991 [Publication 6] Phytochemisity, 41,457, 1996 [Publication 7] Planta, 214,288, 2001 [Publication 8] J Biol Chem. 2003, May 5 [Epub ahead of print] [Publication 9] Nelson et al. Pharmacogenetics 6,1-42, 1996 [Publication L 0]

Nature 366 276-279,1993 [Publication 11] Plant J. 19,441-451, 1999 [Publication 12] FEBS Letters, 431, 287, 1998 [Publication 13] Plant Physiology, 121, 821, 1999 [Publication 14] Biochemical and Biophysical Research Communications, 251,67, 1998 [Publication 15] Plant and Cell Physiology, 40,1182, 1999 [Publication 16] J. Biol. Chem. 273,7260, 1998 [Publication 17] Plant Physiol. 123,453, 2000 [Publication 18] J. Biol. Chem. 271,29473, 1996 [Publication 19] J. Biol. Chem. 271,618, 1999 [Publication 20] J. Biol. Chem. 2001 Apr 20; 276 (16): 12614-23, Epub 2001 Jan 18 As described above, sesamin has a variety of physiological activities, which are known to be effective in improving various deficiencies. However, conventional

sesamin production solely relied on a method using sesame seeds alone. In other words, sesamin production is completely dependent on sesame seeds. Consequently, it has been difficult to improve productivity or reduce the cost of sesamin production.

The problem can be solved effectively by genetic engineering techniques. However, to this date, no enzyme has been purified for the two kinds of cytochromes P450 that are known to be involved in the biosynthesis of sesamin in sesame seeds, and no gene for encoding these enzymes has been cloned. The situation is the same for the cytochromes P450 that form the methylene dioxybridge in different species of other organisms, or for the other kinds of cytochromes P450. There have been cloned genes encoding enzymes involved in the biosynthesis of lignan in different species of other organisms. However, none of them is involved in the synthesis of sesamin and/or piperitol.

Some of the genes derived from sesame seeds have been cloned, examples of which include AF240004, AF240005, and AF240006 encoding globulins as deposit proteins of seeds.

Other examples include genes or enzymes involved in the synthesis or storage of lipids, including oleosin (J. Biochem.

122: 819-24,1997), an acyl carrier protein desaturase (Plant Cell Physiol. 1996,37, 201-5), Steroleosin (Plant Physiol.

2002, 128: 1200-11), and a fatty acid unsaturase (Plant Sci. <BR> <BR> <P>161 935-941 (2001) ). However, none of these genes or

enzymes is involved in the synthesis of lignan.

In other words, there has not been found a gene that encodes an enzyme involved in the synthesis of lignan.

Accordingly, there is a strong need for identification of such enzymes, and genes encoding these enzymes.

The present invention was made in view of the foregoing problems, and an object of the invention is to provide a producing method of sesamin and/or piperitol with a gene encoding a sesame-derived enzyme, using, for example, recombinant organisms. The object is achieved by identifying a gene encoding an enzyme that catalyzes the formation of methylene dioxybridge between the hydroxyl group and methyl group of lignan, or more preferably a gene encoding an enzyme that catalyzes a reaction forming piperitol from pinoresinol, and/or a reaction forming sesamin from piperitol.

DISCLOSURE OF INVENTION The inventors of the present invention diligently worked to solve the foregoing problems. In achieving the object, a group of sesame-derived cytochrome P450 genes (hereinafter "SiP genes") was obtained from a cDNA library of sesame seed, and these genes were expressed in yeasts. Microsome fractions were collected from the recombinant yeasts, and were allowed to react with pinoresinol or piperitol. The presence or absence of piperitol from pinoresinol, or sesamin

from piperitol was examined by HPLC analysis after the reaction. From the result of HPLC analysis, a protein, or a gene encoding it, that catalyzes the reaction producing piperitol from pinoresinol, and sesamin from piperitol was identified.

Specifically, the present invention provides industrially useful substances and methods, as set forth in (1) through (21) below.

(1) A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin. (A gene encoding a protein that catalyzes the biosynthesis of piperitol from pinoresinol, and/or sesamin from piperitol.) (2) A gene encoding a protein that catalyzes a reaction forming a methylene dioxybridge in pinoresinol and/or piperitol.

(3) A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin, and that consists of (a) an amino acid sequence of SEQ ID NO: 1, 64 or 78, or (b) an amino acid sequence that has been modified by substitution, deletion, insertion, and/or addition of one or more amino acids of SEQ ID NO: 1, 64 or 78.

(4) A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin, and that consists of an amino acid sequence which is at least 50% homologous to an amino acid sequence of SEQ ID NO: 1,64 or 78.

(5) A gene including a base sequence of SEQ ID NO: 2, 65 or 79 as an open reading frame region.

(6) A gene encoding a protein that catalyzes biosynthesis of piperitol and/or sesamin, and hybridizing under stringent conditions with (a) a polynucleotide consisting of a base sequence of SEQ ID NO: 2,65 or 79, (b) a polynucleotide encoding a protein consisting of an amino acid sequence of SEQ ID NO: 1,64 or 78, or (c) a fragment of the polynucleotide (a) or (b).

(7) A gene as set forth in any one of (1) through (6), which is derived from sesame.

(8) A protein encoded by a gene as set forth in any one of (1) through (7).

(9) A protein catalyzing biosynthesis of piperitol and/or sesamin, and consisting of (a) an amino acid sequence of SEQ ID NO: 1, 64 or 78, or (b) an amino acid sequence that has been modified by substitution, deletion, insertion, and/or addition of one or more amino acids of SEQ ID NO: 1,64 or 78.

(10) An antibody that recognizes a protein as set forth in (8) or (9).

(11) A recombinant expression vector including a gene of any one of (1) through (7).

(12) A transformant comprising a recombinant expression vector including a gene of any one of (1) through

(7).

(13) A producing method of a protein, comprising the steps of: incubating or growing a transformant of (12); and obtaining from the transformant a protein that catalyzes biosynthesis of piperitol and/or sesamin.

(14) A plant, its offspring, and a tissue of the plant and its offspring, into which a gene of any one of (1) through (7) has been introduced.

(15) A producing method of piperitol and/or sesamin, comprising the step of using a gene of any one of (1) through (7), or a protein of (8) or (9).

(16) A producing method of a transformant containing a large amount of lignan, comprising the step of using a gene of any one of (1) through (7).

(17) A producing method of a plant containing a large amount of piperitol and/or sesamin, comprising the step of using a gene of any one of (1) through (7).

(18) A producing method of a transformant containing a small amount of lignan, comprising the step of using a gene of any one of (1) through (7).

(19) A producing method of a plant containing a small amount of piperitol and/or lignan, comprising the step of using a gene of any one of (1) through (7).

(20) A method of cultivating sesame, comprising the step

of using a gene of any one of (1) through (7).

(21) A gene detecting device comprising a polynucleotide probe whose base sequence is at least part of a base sequence of a gene set forth in any one of (1) through (7).

BRIEF DESCRIPTION OF DRAWINGS Fig. 1 is a schematic diagram showing a typical synthesis path of sesame lignan, wherein 1 is conipheryl alcohol, 2 is pinoresinol, 3 is piperitol synthetase, 4 is piperitol, 5 is sesamin synthetase, and 6 is sesamin.

Fig. 2a through Fig. 2f are diagrams showing results of HPLC measuring activity of an enzyme encoded by SiP189 gene.

Fig. 3 is a tree diagram obtained from the analysis of the primary structure of amino acids encoded by cytochrome P450.

Fig. 4 is a diagram showing a result of homology search with a protein encoded by SiP189 gene.

Fig. 5 is a diagram showing a result of Southern analysis on SiP189 gene.

Fig. 6a through Fig. 6d are diagrams showing a result of HPLC analysis measuring activity of a protein encoded by SrSiP189 gene.

Fig. 7 is a diagram showing a result of expression analysis on SiP gene in transformant tabaco.

Fig. 8a through Fig. 8d are diagrams showing a result of functional analysis on SiP protein in plant cells.

Fig. 9A through Fig. 9H are diagrams showing a result of analysis on a promoter region of SiP189 gene in search for an expression regulatory element.

Fig. 10 is a diagram showing expression regulatory elements present in the promoter region of SiP189 gene.

Fig. 1 1A through Fig. 11H are diagrams showing a result of analysis on a promoter region of SrSiP189 gene in search for an expression regulatory element.

Fig. 12A through Fig. 12C are diagrams representing homology between SiP189 gene and SrSiPgene.

BEST MODE FOR CARRYING OUT THE INVENTION The following will describe an embodiment of the present invention. It should be appreciated that the present invention is not limited in any way by the following descriptions.

(1) Genes according to the present invention, and structures of proteins encoded by the genes A gene according to the present invention encodes a protein for catalyzing the biosynthesis of piperitol and/or sesamin. As used herein, the"biosynthesis of piperitol and/or sesamin"means biosynthesis of piperitol from pinoresinol, and/or sesamin from piperitol. More specifically, it refers to a reaction forming a methylene dioxybridge in the pinoresinol

and/or piperitol. In the embodiment, the invention will be described through gene SiP189 (with a base sequence of SEQ ID NO: 2) that encodes a seed-derived protein for catalyzing the biosynthesis of piperitol and sesamin. In the present invention, the open reading frame is the region from the start codon to the end codon, excluding the end codon.

(1-1) Genes according to the present invention As used herein, the term"gene"is interchangeable with "polynucleotide","nucleic acids", or"nucleic acid molecule".

That is, the meaning of"gene"includes a polymer of nucleotides. Further, as used herein, the term"base sequence"is interchangeable with"nucleic acid sequence"or "nucleotide sequence", and it is represented by a sequence of deoxyribonucleotides (denoted by A, G, C, and T).

A gene of the present invention may code for the amino acid sequence of SEQ ID NO: 1,64 or 78, for example. It should be noted here that proteins are generally known to retain their enzyme activities even if their amino acid sequences were modified by addition, deletion, and/or substitution of several amino acids. In this connection, a gene of the present invention, which encodes a protein that catalyzes the biosynthesis of piperitol and sesamin, may encode (a) a protein consisting of the amino acid sequence of SEQ ID NO: 1,64 or 78, or (b) a protein consisting of the amino acid sequence modified by substitution, deletion,

insertion, and/or addition of one or more amino acids of SEQ ID NO: 1, 64 or 78 yet retains its catalytic action in the biosynthesis of piperitol and/or sesamin. For example, the present invention includes a gene having an open reading frame (ORF) with the base sequence of SEQ ID NO : 2,65 or 79.

As will be described later in Examples, a gene according to the present invention may also encode a cytochrome P450 protein.

As used herein, the"substitution, deletion, insertion, and/or addition of one or more amino acids"means the substitution, deletion, insertion, and/or addition of preferably no more than 10, more preferably no more than 7, and further preferably no more than 5 amino acids, as enabled by a conventional mutant protein producing method (e. g., site-directed mutagenesis inducing method, Hashimoto-Gotoh, Gene 152,271-275 (1995) ). Therefore, it can be said that protein (b) is encoded by a gene that encodes a mutant of protein (a) As used herein,"mutant protein"generally refers to mutant proteins that are artificially prepared by a conventional mutant protein producing method. However, the mutant protein may be isolated and purified from a natural source as well.

A gene of the present invention is not limited to double-stranded DNA, and may be the sense strand or anti-sense strand of double-stranded DNA or RNA. The

anti-sense strand may be used as a probe or anti-sense medicament. For DNA, cDNA or genomic DNA obtained by cloning techniques, chemical synthesis techniques, or a combination of these different techniques may be used.

Further, a gene according to the present invention may include a sequence of an untranslated region (UTR), or a vector sequence (including expression vector sequence), in addition to the sequences coding for the amino acids of proteins (a) and (b).

Further, a gene of the present invention may encode a protein that has no less than 20%, preferably no less than 50%, and more preferably no less than 60% or 70% homology to the amino acid sequence of SEQ ID NO: 1, 64 or 78, and that catalyzes the biosynthesis of piperitol and/or sesamin.

The gene may be used to control a lignan level or cultivate <BR> <BR> plants. As the term is used herein, "homology"refers to a proportion of matched amino acid sequences. A higher homology means a closer relation between the amino acid sequences compared.

Further, a gene of the present invention may include a polynucleotide consisting of a base sequence of SEQ ID NO: 2, 65 or 79, a polynucleotide coding for a protein consisting of an amino acid sequence of SEQ ID NO: 1,64 or 78, or fragments of these polynucleotides. For example, a gene of the present invention may hybridizes under stringent conditions

with a polynucleotide having a base sequence coding for six or more amino acids (i. e. , 18 bases), and may encode a protein that catalyzes the biosynthesis of piperitol and/or sesamin.

As used herein, "under stringent conditions"means that hybridization takes place only when there is at least 80% identity, preferably at least 95% identity, and more preferably at least 97% identity. Specifically, hybridization may take place under 5 x SSC at 50°C, for example.

Hybridization may be carried out by a conventional method, as described in J. Sambrook et al. Molecular Cloning, A Laboratory Manual, 2nd Ed. , Cold Spring Harbor Laboratory (1989), for example. Generally, the level of stringency increases with increase in temperature and/or decrease in salt concentration (more difficult to hybridize). A suitable hybridization temperature depends on the base sequence or the length of the base sequence. For example, when a DNA fragment consisting of 18 bases coding for 6 amino acids is used, temperatures of not more than 50°C are preferable.

The hybridization may select a gene derived from a natural source, for example, such as plants belonging to family Bryophyta. Genes derived from other natural sources may be used as well. Further, a gene selected by hybridization may be cDNA or genomic DNA.

(1-2) Proteins according to the present invention A protein according to the present invention is encoded

by a gene of the present invention described in (1-1) above.

Encoded by a gene of the present invention, the protein catalyzes the biosynthesis of piperitol from pinoresinol, or sesamin from piperitol. The present invention also includes a protein that catalyzes the reaction forming a methylene dioxybridge in the pinoresinol or piperitol. For example, a protein of the present invention may include (a) the amino acid sequence of SEQ ID NO: 1, 64 or 78, or (b) the amino acid sequence modified by substitution, deletion, insertion, and/or addition of one or more amino acids of SEQ ID NO: 1, 64 or 78 yet retains its catalytic action in the biosynthesis of piperitol and/or sesamin.

Using a gene of the present invention described in (1-1) above, a protein according to the present invention may be obtained by introducing the gene into a host cell and expressing the gene therein. Alternatively, a protein may be obtained by isolating and purifying from cells or tissues.

Further, a protein according to the present invention may be a fusion protein with other proteins. Further, a protein according to the present invention may be chemically synthesized.

It should be noted that a protein according to the present invention is not limited to polypeptides as chains of amino acids forming peptide bonds. The protein may be a complex protein of a structure other than the polypeptide, or

a protein with additional polypeptides. Examples of additional polypeptides include various epitopes such as His, Myc, and Flag, tagged to a protein of the present invention.

(2) Method of obtaining genes and proteins of the present invention A method of obtaining (producing) a gene and protein of the present invention is not particularly limited. The following describes some representative methods.

(2-1) Method of obtaining genes For genes with natural base sequences, a gene according to the present invention may be obtained by screening of cDNA libraries for example, as will be described in detail later in Examples.

Alternatively, a gene according to the present invention may be obtained with the use of amplification means such as PCR. For example, PCR may be carried out with primers designed according to the sequences at the 5'end and 3'end of a cDNA sequence (or the complementary sequences thereof) of a gene according to the present invention, using the genomic DNA (or cDNA) as a template, for example. The PCR amplifies a region of DNA flanked by the primers, thereby obtaining a large amount of DNA fragments containing a gene of the present invention.

In another method, a polynucleotide having a desired sequence is synthesized by conventional chemical synthesis

procedures, based on gene sequence information.

DNA encoding an enzyme having a modified sequence of amino acid may be synthesized by PCR or ordinary site-directed mutagenesis, using DNA with a natural base sequence. For example, in order to obtain modified DNA fragments of interest, natural cDNA or genomic DNA is treated with restriction enzymes to obtain DNA fragments.

Using the DNA fragment as a template, PCR or site-directed mutagenesis is carried out with primers that have been mutated to desired sequences. The mutated DNA fragments are then joined to polynucleotides (DNA) which encode an enzyme of interest and from which regions of the corresponding DNA fragments have been removed. For example, DNA that encodes an enzyme with a shorter amino acid sequence may be obtained with the use of certain restriction enzymes that cuts DNA encoding an amino acid sequence (e. g. , full length amino acid sequence) longer than an amino acid sequence of interest. If the resulting DNA fragments do not encode the entire amino acid sequence of the enzyme of interest, a DNA fragment that makes up the deficient sequence may be synthesized and joined.

The resulting gene is expressed in the gene expression system of Escherichia coli or yeasts to obtain a protein it encodes. The enzyme activity of the protein may be evaluated to confirm whether the gene has encoded a protein that

catalyzes the biosynthesis of piperitol from pinoresinol, or sesamin from piperitol. Further, with the gene introduced into a cell, the protein catalyzing the biosynthesis of piperitol from pinoresinol, or sesamin from piperitol can be obtained.

Further, with an antibody against a protein consisting of the amino acid sequence of SEQ ID NO: 1,64 or 78, the gene encoding the enzyme for catalyzing the synthesis of piperitol and sesamin can be also cloned from other organisms.

(2-2) Method of obtaining proteins A method of obtaining (producing) a protein of the present invention is not particularly limited. For example, simple purification from cells or tissues expressing the protein may be used. The cells or tissues expressing a protein of the present invention may be self-generating. For example, cells or tissues infected with a recombinant baculovirus may be used. Purification is not limited to a particular method, and may be carried out as follows. After a transformed host by an expression vector including a gene of the present invention is cultured, cultivated, or bred, a protein of interest may be collected and/or purified from the culture according to ordinary procedures, including filtration, centrifugation, cell-disruption (cell lysis), gel chromatography, and ion exchange chromatography, for example.

Another way to obtain a protein of the present invention is to use recombinant techniques. As one example, a gene

according to the present invention is inserted in a carrier such as a vector, introduced into host cells by a conventional method, and expressed therein. The translated protein is purified. Specific methods of gene transfer (transformation) and gene expression will be described later.

The protein for catalyzing the synthesis of piperitol and sesamin can also be obtained with the use of an antibody against a protein consisting of the amino acid sequence of SEQ ID NO: 1,64 or 78. Further, with the antibody, the enzyme catalyzing the biosynthesis of piperitol and sesamin, and the gene encoding the enzyme can also be cloned from other organisms.

Note that, in introducing the foreign gene into a host, various types of hosts and expression vectors can be used.

Here, the expression vectors have been integrated with a promoter that functions to express the gene in the host. A type of host or expression vector used is suitably selected according to intended use. Different purification methods are available for the product proteins, depending on the type of host used and the characteristics of the protein of interest.

Regardless of the type of purification method used, use of a tag or the like helps purification of the protein of interest with relative ease.

A method of preparing a mutant protein is not particularly limited either. For example, a mutant protein may

be generated by introducing a point mutation in the base sequence using conventional mutant protein inducing methods, including a site-directed mutagenesis (see Hashimoto-Gotoh, Gene 152,271-275 (1995), for example) or PCR. Alternatively, a mutant protein may be generated by a method in which mutant strains are produced by insertion of transposons. With any of these methods, the base sequence of cDNA encoding the protein (a) can be altered by substitution, deletion, insertion, and/or addition of one or more bases, so as to produce a mutant protein. For the preparation of mutant proteins, a commercially available kit may be used as well.

Other than the foregoing methods, a protein according to the present invention may be obtained by chemical synthesis using a commercially available peptide synthesizer, for example. Alternatively, a protein according to the present invention may be synthesized from a gene according to the present invention, using a cell-free protein synthesis solution as described in Proc. Natl. Acad. Sci. USA, 78,5598-5602 (1981), J. Biol. Chem., 253,3753-3756 (1978).

(3) Antibody according to the present invention An antibody according to the present invention (polyclonal antibody or monoclonal antibody) is obtained according to a conventional method, by using a protein of the present invention described in Section (1-2) above as an

antigen (for example, protein (a), (b), or fragments of these proteins). Examples of the conventional method include Harlow et al.; Antibodies: A laboratory manual (Cold Spring Harbor Laboratory, New York (1988), and Iwasaki et al.; Monoclonal antibody, hybridoma and ELIZA, Kodansha (1991).

The antibody may be used in the detection and/or measurement of a protein according to the present invention.

(4) Recombinant vectors according to the present invention A recombinant expression vector according to the present invention include a gene of the present invention described in Section (1-1) above (e. g. , gene encoding protein<BR> (a) or (b) ). A recombinant expression vector that has incorporated cDNA is one example of a recombinant expression vector of the present invention. The recombinant expression vector may be generated using a plasmid, phage, or cosmid as non-limiting examples. Further, the recombinant expression vector may be generated by a convention method.

The type of'vector is not particularly limited as long as it is expressed in a host cell. Specifically, such vectors are prepared by incorporating a gene of the present invention in a plasmid along with a promoter sequence that has been suitably selected to ensure gene expression. The promoter sequence depends on the type of host cell.

Various markers may be used to confirm whether a gene

of the present invention has been introduced in a host cell, or whether the protein encoded by the gene has been <BR> <BR> successfully expressed in the host cell. The marker (e. g. , a gene lacking in the host cell) is integrated with a carrier, such as a plasmid, together with a gene of the present invention, and is introduced into the host cell as an expression vector. Successful uptake of the gene of the present invention may be confirmed by checking the expression of the marker in the host cell that has incorporated the expression vector. Alternatively, a protein according to the present invention may be expressed in the form of a fusion protein in the host cell. For example, a protein according to the present invention may be expressed as a fusion protein with a green fluorescence protein (GFP) derived from Aequorea victoria. In this case, the GFP is used as a marker.

The host cell is not particularly limited, and various conventionally available cells may be used. Specifically, as a prokaryotic host, bacteria belonging to genus Escherichia (for example, Escherichia coli), or genus Bacillus (for example, Bacillus subtilis) may be used. As a eukaryotic host, lower eukaryotes (for example, eukaryotic bacteria including true fungi (e. g. , yeasts, filamentous fungi) ) may be used. Yeasts may be microorganisms that belong to genus Saccharomyces (for example, Saccharomyces cerevisiae). Examples of molds

(filamentous fungi) include microorganisms that belong to genus Aspergillus (for example, Aspergillus oryzae, Aspergillus niger), or genus Penicillum. Further, the host cell may be an animal cell or a plant cell. The plant cell may be obtained from various plants, including those belonging to family Pedaliaceae or Poaceae. As for animal cells, animal cell lines obtained from mice, hamsters, monkeys, or human may be used, for example. Insect cells (for example, silkworm cell or adult silkworm itself) may be used as well.

A recombinant expression vectors according to the present invention includes expression control regions (for example, promoter, terminator, and/or origin of replication), which vary depending on the type of target host cell. For bacteria expression vectors, ordinary promoters, for example, such as trc promoter, tac promoter, or lac promoter are used.

As yeast promoters, a glycelaldehyde triphosphate dehydrogenase (GAPDH) promoter, or PH05 promoter is used, for example. Amylase promoter or trpC promoter may be used as a filamentous fungi promoter, for example. For animal cell hosts, viral promoters (for example, SV40 early promoter, SV40 late promoter) are used. The expression vector may be prepared according to ordinary method, for example, using restriction enzymes and/or ligase. Transformation of hosts by the expression vector may also be carried out according to ordinary method.

A method of introducing the expression vector into the host cell (method of transformation) is not particularly limited and various conventional methods may be used, including an electroporation method, calcium phosphate method, liposome method, DEAE dextran method, and micro injection method, for example.

(5) Transformants according to the present invention A transformant according to the present invention has incorporated a gene of the present invention described in Section (1-1) above (for example, a gene encoding protein (a) or (b) ). As used herein, "incorporation of a gene"means transfer of a gene into a target cell (host cell) by conventional genetic engineering techniques. Further, as the term is used herein, "transformant"means not only cells, tissues, or organs, but also living organisms themselves.

A transformant according to the present invention may be prepared (produced) using a recombinant expression vector of the present invention described in Section (4) above. The organisms to be transformed are not particularly limited, and may be microbes or plants as exemplified above. With a suitable promoter or vector, animals and insects can be transformed as well.

In preferred embodyment, a sesame is used to generate a transformant according to the present invention. The method of generating a sesame transformant include a known method

described in, for example, T. Asamizu: Transformation of sesame plants using MAT vector system: introduction of fatty acid desaturase genes. Sesame Newsletter 16: 22-25 (2002).

Further, as the term is used herein,"transformant"also includes plants into which genes of the present invention have been transferred, and their offspring having the same characteristics. Tissues of these plants also fall within the meaning of"transformant".

(6) Gene detecting device according to the present invention A gene detecting device according to the present invention comprises a polynucleotide as a probe consisting of a base sequence, or its complementary sequence, which is at least a portion of a gene according to the present invention. A gene detecting device of the present invention can be used under a variety of conditions for the detection and/or measurement of expression pattern of a gene of the present invention.

One example of a gene detecting device according to the present invention is a DNA chip, in which a probe that specifically hybridizes with a gene of the present invention is immobilized on a substrate (carrier). As used herein"DNA chip"generally means a synthetic DNA chip that uses a synthetic oligonucleotide as a probe. However, it also includes an adhesion DNA microarray that uses cDNA as a probe

produced by PCR, for example.

As used herein, the term"oligonucleotide"means a chain of several to several ten nucleotides joined together, and it is used interchangeably with"polynucleotide". The oligonucleotide may be generated or chemically synthesized as a fragment of a longer polynucleotide.

The base sequence of the polynucleotide used as a probe may be determined by a conventional method of specifying a characteristic sequence of cDNA sequences. (For example, a SAGE (Serial Analysis of Gene Expression) method, as described in Science 276: 1268,1997 ; Cell 88: 243,1997 ; Science 270: 484,1995 ; Nature 389: 300,1997 ; US patent No.

5,695, 937 may be used.) The DNA chip may be made by a conventional method.

For example, when a synthetic oligonucleotide is used, it may be synthesized on a substrate by a combination of photolithography and solid phase DNA synthesis technique.

On the other hand, when the oligonucleotide is cDNA, it is stuck on a substrate using an array device.

Further, as in common DNA chips, the accuracy of gene detection can be improved by placing a perfect-match probe (oligonucleotide) with a mismatch probe that has been prepared by substituting a single base of the perfect-match probe. Further, in order to detect different genes simultaneously, a DNA chip may be prepared in which

different types of oligonucleotides are immobilized on a single substrate.

(7) Use (effectiveness) of genes and proteins according to the present invention The foregoing descriptions mainly discussed the enzyme that catalyzes the biosynthesis of piperitol and/or sesamin in sesame. However, the present invention is not just limited to sesame-derived genes, and it also relates to use of the enzyme that catalyzes the biosynthesis of piperitol and/or sesamin.

The enzyme for catalyzing the biosynthesis of piperitol and/or sesamin may be derived from plants, animals, or microorganisms. Regardless of where the enzyme derives from, the enzyme can be used to control lignan level as long as it has the enzyme activity in the biosynthesis of piperitol and/or sesamin. Further, the invention concerns plants with a controlled lignan level, and their offspring and tissues, obtained by transferring a gene that encodes an enzyme for catalyzing the biosynthesis of piperitol and/or sesamin. The plant may be in the form of cut flowers. A gene of the present invention that encodes an enzyme for catalyzing the biosynthesis of piperitol and/or sesamin can be used to generate piperitol or sesamin, or suppress formation of piperitol or sesamin. With the current technology, it is perfectly possible to transfer the gene in plants and express it constitutively or tissue-specifically. It is also possible to

suppress expression of a gene of interest using an anti-sense method, co-suppression method, RNAi method, or the like.

Non limiting examples of plants that can be transformed include: sesame, rice, Forsythia suspenta, tabaco, Arabidopsis thaliana, bird's-foot trefoil, barley, wheat, rapeseed, potatoes, tomatoes, poplar, bananas, eucalyptus, sweet potatoes, soy, alfalfa, lupin, corns, cauliflower, roses, chrysanthemum, carnation, Antirrhinum majus, cyclamen, orchid, Eustome russellianum, freesia, gerbera, gladiolus, Baby's Breath (Grypsophila elegans), kalanchoe, lily, pelargonium graveolen, geranium, petunia, torenia, tulips, and the like.

The present invention provides a protein producing method including the step of incubating or breeding a transformant described in Section (5) above, and the step of obtaining from the transformant a protein that catalyzes the biosynthesis of piperitol and/or sesamin. The method allows for easy and mass production of the protein that catalyzes the biosynthesis of piperitol and/or sesamin. The method only requires the foregoing steps, and other conditions are not particularly limited and may be suitably set (for example, type of host, materials used, and settings).

A gene and protein according to the present invention may be used in a producing method of piperitol and/or sesamin in which piperitol and/or sesamin are biosynthesized.

The method allows for easy and mass production of piperitol

and/or sesamin, and therefore may be used in manufacture of food (health food in particular) and pharmaceuticals containing piperitol and/or sesamin. Note that, as used herein"using a gene or protein"means use of a gene or protein under various conditions, including in vivo, in vitro, and ex vivo. In one example of a producing method of piperitol and/or sesamin, a gene is transferred into a plant cell, and piperitol and/or sesamin are biosynthesized in the transformant plant in vivo. In another example, a protein of the present invention is allowed to react with a precursor of piperitol or sesamin ex vivo for the biosynthesis of piperitol and/or sesamin. Further, piperitol and/or sesamin may be produced using a bioreactor, for example. It should be noted here that the gene itself does not possess the enzyme activity for the biosynthesis. It is therefore preferable that the gene be used in the form of a protein it encodes.

The present invention also provides a producing method of a transformant with a high lignan content or low lignan content, using genes of the present invention. The method can be used to produce a transformant with a controlled lignan level, both easily and conveniently. The lignan may be extracted, or the transformant itself may be used as a food or pharmaceutical, or as a precursor of these products.

As used herein, "plants with a high piperitol and/or sesamin content"means plants containing a large amount of

piperitol and/or sesamin. As used herein, "plants with a low piperitol and/or sesamin content"means plants containing a small amount of piperitol and/or sesamin.

A gene according to the present invention may be used in a method of cultivating sesame. For example, the method can be used to transfer a gene of the present invention into sesame and cultivate sesame containing a large amount of piperitol and/or sesamin. The transformant sesame produces seeds that contain lipids with a high piperitol and/or sesamin level, which can then be collected.

While the invention is susceptible to various modifications and alternative forms, a specific embodiment thereof will be described below in more detail by way of Examples with reference to the attached drawings. It should be understood, however, that it is not intended to limit the invention to the particular forms disclosed, but on the contrary, the invention is to cover all modifications, equivalents, and alternatives falling within the scope of the invention as defined in the appended claims.

[Examples] The invention is now described in detail based on the following Examples. Unless otherwise noted, the molecular biological techniques used in these Examples are based on Molecular Cloning (Sambrook et al.).

[Example 1 : Preparation of lignan from Forsythia

intermedia] Sixty grams of dried Forsythia intermedia leaves was boiled in 3 L water for 30 minutes, and the extract was filtered with a filter (circle; 300 mm; Tokyo roshi Kaisha).

After reducing the solvent volume of a filtrate dispensed with an evaporator, the sample was freeze-dried (30/N), and an extracted freeze-dried sample was obtained (21.6837 g).

Seven grams of the extracted freeze-dried sample was re-dissolved in 100 ml water and was homogenized by ultrasonic wave. Then, a column filled with 400 ml Diaion HP-20 resin (Mitsubishi Chemical Corporation) was washed with 800 ml of 50% acetone and equilibrated with 2 L water.

Using the column, the homogenized sample was crudely purified. Specifically, after loaded with the sample, the column was washed with 800 ml water, followed by first elution with 800 ml of 50% methanol, and second elution with 800 ml of 100% methanol.

Each sample obtained from the first elution and second elution was analyzed by HPLC under the following conditions to determine the presence or absence of lignan. For the mobile phase, solution A (0.1% trifluoroacetic acid (TFA) ) and solution B (0.1% TFA, 90% acetonitrile) were used. For the column, Develosil C30-UG-5 (Nomura Chemical Co., Ltd. , 4.6 mm x 150 mm) was used. After equilibrating the column with a mixture containing 60% solution A and 40% solution B (20

minutes), the sample was eluted for 15 minutes at a flow rate of 0.6 ml/minute through a linear gradient of 60% solution A and 40% solution B to 10% solution A and 90% solution B.

The sample was further eluted for another 10 minutes with 10% solution A and 90% solution B. Proteins were detected at an absorption wavelength of 287 nm and were fractionated every minute (crude fractionation analysis). Using the SPD-10AV (Shimazu Corporation), each fraction was measured in a spectral range of 220 nm to 400 nm to search for a substance with two absorption maxima, which are characteristic absorption patterns of lignan (230 nm and 280 nm). The result of measurement showed that a substance with the characteristic absorption maxima of lignan was contained mainly in the second elute.

To separate lignan, fractionation was carried out under the following conditions (main fractionation). The second elute was concentrated with an evaporator, and was freeze-dried (dry weight 1.0325 g). One gram of the extracted freeze-dried sample was dissolved first in 1 ml dimethyl sulfoxide (DMSO), and then in 6 ml of 30% solution B, using sonication both times. The elute was separated by centrifugation at 15000 rpm, and about 6 ml of the supernatant was loaded onto a column (Develosil ODS-UG-15/30; C-18 (Nomura Chemical Co. , Ltd. , 50 mm x<BR> 500 mm) ). For the mobile phase, solution A (0.05% TFA) and

solution B (0.05% TFA, 90% acetonitrile) were used. After equilibrating the column for 20 minutes at a flow rate of 32 ml/minute with 70% solution A and 30% solution B, the sample was loaded onto the column and was eluted for 60 minutes with a linear gradient of 70% solution A and 30% solution B to 10% solution A and 90% solution B. The sample was further eluted for 30 minutes with 10% solution A and 90% solution B. Detection was made at an absorption wavelength of 280 nm, and the elute was fractionated every minute (32 ml) (detector: 115UV detector, the Gilson product).

At the end of the procedure, eight fractions with an absorption peak at 280 nm were obtained. These fractions were concentrated with an evaporator, and were freeze-dried.

Each fraction was analyzed by HPLC under the same conditions as in the crude fractionation analysis. Among these fractions with the characteristic spectrum of lignan, fractions believed to be pure lignan were analyzed by lH NMR analysis so as to identify pinoresinol. As a result, 49.7 mg of pinoresinol was obtained.

[Example 2: lignan analysis in sesame seeds] Immature sesame seeds were collected out of the sheath of cultivated sesame, and were freeze-dried to extract lignans with acetone. The following describes a preparation method of acetone extract.

About 100 mg of freeze-dried sample of crushed sesame

seeds was dissolved in 1 ml of acetone to obtain an acetone extract. 100 pi of acetone extract was dried and re-dissolved in 20 pi DMSO, followed by addition of 80 pi of 50% acetonitrile containing 0.1% TFA. The solution was filtered with the Millex-LH filter (Millipore Corporation, 0. 45 pm/4 mm) to prepare a sample for HPLC analysis. The result of HPLC analysis confirmed the presence of lignans, with their retention times matching the retention times of piperitol (a standard sesame lignan (control) ) (retention time of about 12 minutes), sesamin (retention time of about 16 minutes), and sesamolin (retention time of about 18 minutes).

The sesame seeds were divided into four stages according to their growth stages.

Stage 1: Seed sheath no longer than 1.5 cm Stage 2: Seed sheath between 1.5 cm and 2 cm Stage 3: Seed sheath 2 cm or greater, yellowish green sheath Stage 4: Seed sheath 2 cm or greater, dark green sheath The result of HPLC analysis under these conditions confirmed accumulation of piperitol, sesamin, and sesamolin in Stage 3 and Stage 4.

That is, the result indicates the presence of an enzyme, and a gene encoding it, involved in the production of piperitol and sesamin in the sesame used in this Example.

[Example 3: preparation of sesame cDNA library] From the sesame seeds used in Example 2, total RNA was extracted with the RNeasy Plant Mini Kit (QIAGEN) according to the method recommended by the manufacturer.

From the extracted RNA, 5 ug of poly A (+) RNA was obtained using the oligotex-MAG mRNA purification kit (TaKaRa). Using the poly A (+) RNA as a template, cDNA library was constructed with the ZAP Express cDNA Synthesis Kit and ZAP Express cDNA Gigapack3 Gold Cloning Kit (Stratagene) according to the method recommended by the manufacturer. The cDNA library was obtained in 1 x 107 pfu/ml.

[Example 4: cloning of sesame-derived lignan biosynthetic gene] About 300,000 pfu of cDNA library were screened. As probes, the sequences of cytochrome P450 gene of Arabidopsis thaliana of a known genome sequence were used.

Specifically, a superfamily of cytochrome P450 genes of Arabidopsis thaliana was phylogenetically classified based on the primary sequences (see Fig. 3), and 13 kinds of Arabidopsis thaliana cytochrome P450 genes (CYP90A, CYP72B, CYP71B, CYP84A, CYP96A, CYP710A, CYP86A, CYP74, CYP75B, CYP79F, CYP81D, CYP705A, and CYP83A) were used as probes for screening the sesame seed library.

The DNA was amplified with primers of SEQ ID NOs: 5 to 30 to introduce DIG-label into the probes. The PCR was carried

out as follows. After extracting total RNA from Arabidopsis thaliana with the RNeasy Plant Mini Kit (QIAGEN), cDNA was obtained by reverse transcription using 1 ug of total RNA as a template. The synthesis of cDNA was carried out with the SuperscriptTM First-Strand Synthesis System for RT-PCR (Invitrogen) under the conditions recommended by the manufacturer. The reaction mixture of PCR (50 ul) contained 1 ul of the Arabidopsis genomic DNA, 1 x Taq buffer (TaKaRa), 0.2 mM dNTPs, primers (SEQ ID NOs: 5 to 30,0. 4 pmol/ul each), and 2.5 U rTaq polymerase. The reaction was carried out at 94°C for 5 minutes, and then in 28 cycles at 94°C for 1 minute, at 53°C for 1 minute, and at 72°C for 2 minutes. The probes were DIG-labeled under the same PCR condition.

Screening and the detection of positive clones were carried out with the DIG-DNA labeling 8s detection kit (Roche).

The detection of positive clones was carried out under reduced stringent hybridization conditions, based mainly on the method recommended by the manufacturer. Specifically, after 2-hour pre-hybridization of membrane at 37°C using a hybridization buffer containing 5 x SSC, 30% formaldehyde, 50 mM sodium phosphate buffer (pH 7.0), 1% SDS, 2% blocking reagent (Roche), 0.1% lauroyl sarcosine, and 80 ug/ml sperm DNA of salmon, DIG-labeled mix probes were added in the buffer and the membrane was incubated overnight. The membrane was washed at 58°C for 1 hour in a

5 x SSC washing solution containing 1% SDS.

After the second screening, 96 independent positive clones were obtained. The 96 clones were inserted in the pBK-CMV plasmid (Stratagene) according to the method recommended by the manufacturer, and the nucleotide sequences at the 5'end and 3'end of the insert region were determined with M13RV primer and M13M4 (-20) primer, respectively. As a result, 46 clones had sequences similar to the sequences of the cytochrome P450. The entire nucleotide sequence of the insert region was determined. The resulting clones were classified into 5 independent P450 homologs (Sesamum indicum P450; SiP) SiP168, SiP189, SiP236, SiP249, and SiP288. Thereafter, RT-PCR was carried out using primers (SEQ ID NOs: 31 to 40) specific to these five SiP genes. As a template, cDNA obtained by reverse transcription of RNA prepared from the leaves and seeds of sesame was used. The result showed that the five SiP genes were expressed in the seeds. The reaction mixture of PCR (25 ul) used in RT-PCR contained 1 ul of each cDNA, 1 x Ex-Taq buffer (TaKaRa), 0.2 mM dNTPs, primers (0.2 pmol/ul each), and 1.25 U Ex-Taq polymerase. The reaction was carried out at 94°C for 3 minutes, and then in 26 cycles at 94°C for 1 minute, at 53°C for 1 minute, and at 72°C for 2 minutes.

Ribosomal 18SRNA (AJ236041) was used as a sesame internal control gene to compare expression levels. Primers of SEQ ID

NOs: 3 and 4 were used for amplification.

[Example 5: generation of transformants including expression vector for the Sip gene] Among the 5 SiP genes, SiP249 (pSPB2031) and SiP288 (pSPB2034) were believed to encode the full open reading frame (SEQ ID NOs: 53 to 56). A 1.8 kb cDNA fragment containing the SiP249 ORF obtained by digesting pSPB2031 with BamHI and XhoI was inserted at the BamHI and SalI sites of the yeast expression vector pYE22m (Holton, T. A et al. , Nature 366,276-279, 1993). As a result, pSPB2046 was obtained. The multiple cloning site of the yeast expression vector pYE22m is flanked by the glyceraldehydes-3-phosphate dehydrogenase gene (GAPDH) promoter and the GAPDH terminator, and the insert inserted in the multiple cloning site is constitutively expressed in yeasts under the regulation of the GAPDH promoter. The selection marker for the vector is tryptophan. Meanwhile, a 1.8 kb cDNA fragment containing the SiP288 obtained by digesting pSPB2034 with BamHI and XhoI was inserted at the BamHI and SalI sites of the yeast expression vector pYE22m. As a result, pSPB2047 was obtained. The two kinds of yeast expression vector were used to transform the yeast INVsc strain (Invitrogen) according to ordinary method, so as to obtain INVsc/pYE22m/SiP249 and INVsc/pYE22m/SiP288.

The genes of SiP168, SiP189, and SiP236 had incomplete

open reading frames. In order to obtain sequences with complete open reading frames (SEQ ID NOs: 1 and 2, and 57 to 60), the 5'end of each gene was amplified with the GeneRacer kit (Invitrogen) according to the method recommended by the manufacturer. Primers of SEQ ID NOs: 41 to 46 were used for amplification. The sequences amplified the 5'end of each SiP gene were determined.

In order to amplify the full length open reading frames of these 3 kinds of SiP genes, the genes were amplified by PCR.

As a template, cDNA derived from sesame seeds was used. The primers used the PCR had restriction enzyme sites as noted below. The PCR was carried out as follows. The reaction mixture of PCR (50 ul) contained 1 ul of template cDNA derived from sesame seed, 1 x KOD plus buffer (TOYOBO), 0.2 mM dNTPs, primers (SEQ ID NOs: 47 to 52,0. 4 pmol/ul each), 1mM MgS04, and 1 U KOD plus DNA polymerase. The reaction was carried out at 94°C for 5 minutes, and then in 30 cycles at 94°C for 1 minute, at 55°C for 1 minute, and at 72°C for 2 minutes. The amplified fragment containing the full length SiP was inserted in the multiple cloning site of the pCR-blunt II TOPO vector (Invitrogen), so as to obtain TOPO-SiP168 (pSPB2064), TOPO-SiP189 (pSPB2055), and TOPO-SiP236 (pSPB2048).

A DNA fragment containing cDNA of about 1.5 kb obtained by digesting the restriction enzyme sites of the

primers used in the amplification of pSPB2064, pSPB2055, and pSPB2048 was inserted at the BamHI and SalI sites of the yeast expression vector pYE22m. As a result, pYE22m/SiP168 (pSPB2052), pYE22m/SiP189 (pSPB2053), and pYE22m/SiP236 (pSPB2049) were obtained.

The three kinds of yeast expression vectors were used to transform the yeast INVsc strain (Invitrogen), so as to obtain INVsc/pYE22m/SiP168, INVsc/pYE22m/SiP189, and INVsc/pYE22m/SiP236.

[Example 6: biosynthesis of sesame lignan in the transformants] In addition to the INVsc/pYE22m/SiP249 and INVsc/pYE22m/SiP288, the three transformants INVsc/pYE22m/SiP168, INVsc/pYE22m/SiP189, and INVsc/pYE22m/SiP236 were incubated at 30°C for 36 hours in a 400 mL YNBDglc medium (0.67% yeast nitrogen base, 2% glucose, and 20 mg/L of various amino acids excluding tryptophan). From the culture solutions of the yeast transformants, microsome fractions were collected by a known <BR> <BR> ultracentrifugation method (Holton, T. A et al. , Nature 366, 276-279,1993).

The microsome sediments were suspended in 1 ml of suspension buffer (0. 1M potassium phosphate buffer (pH 7.4), 20% glycerol, 0. 3 ul/ml mercaptoethanol) to obtain a microsome solution. To 240 ul of the microsome solution was

added 30 ul of 1M potassium phosphate buffer (pH 7.4), 6 ul of 50mM NADPH, and 24 ul of 267 uM pinoresinol or piperitol.

The mixture was reacted at 30°C for 1 hour. To the reaction mixture of enzyme, an equal amount of 100% acetonitrile containing 0.1% TFA was added (50% final concentration).

The mixture was centrifuged at 15000 rpm at 4°C for 3 minutes, and 150 ul of supernatant was purified with a Millex-LH filter (Millipore Corporation, 0. 45 um/4 mm). The purified sample was then analyzed by HPLC under the same conditions as in the crude fractionation analysis described in Example 1.

The result of HPLC analysis for INVsc/pYE22m/SiP189 is described below with reference to Fig. 2 (a) through Fig. 2 (e).

In the INVsc/pYE22m/SiP189, the pinoresinol (Fig. 2 (a), retention time of about 8 minutes) converted into two products with a lignan-like absorption spectrum with retention times of about 12 minutes and about 16 minutes, <BR> <BR> respectively (Fig. 2 (b) ). Further, in the INVsc/pYE22m/SiP189, the piperitol (Fig. 2 (d), retention time of about 12 minutes) was converted into a product with a lignan-like absorption spectrum with a retention time of about 16 minutes (Fig. 2 (e)).

From these retention times, they were considered to be piperitol (retention time of about 12 minutes) and sesamin (retention time of about 16 minutes), respectively.

By the LC-MS/MS analysis (LC: Waters 2790, the Waters

product; MS: QUATRO micro, the Micromass product), the two peaks (retention times of about 12 minutes and 16 minutes) were compared with a standard based on their molecular weights and fragment patterns. The result showed that their molecular weights matched that of the standard, thereby identifying the two SiP189 products as piperitol and sesamin.

The LC-MS/MS analysis was carried out under the following conditions. For LC, the Develosil C30-UG-5 (Nomura Chemical Co. , Ltd. , 2.0 x 50 mm) was used. As the mobile phase, solution A (H20), solution B (methanol), and solution C (10 mM CH3COONH4) were used. The flow rate was 0. 25ml/min with 10% solution C. Under these conditions, piperitol and sesamin were detected after 8.4 minutes and 10.1 minutes.

MS was carried out in a POSITIVE measurement mode. It was found as a result that SiP189 encodes an enzyme that synthesizes sesamin from pinoresinol via piperitol. Initially, it was believed that different enzymes were involved in the synthesis of piperitol from pinoresinol, and the synthesis of sesamin from piperitol. The present invention proved the contrary, showing that a single enzyme encoded by a gene of the present invention is involved in the both reactions.

From the result of analysis using the reaction system that did not contain NADPH in the reaction mixture of enzyme, it was found that the activity of INVsc/pYE22m/SiP189 to produce piperitol was dependent on NADPH (Figs. 2 (b) and

2 (C) ). The same was the case for sesamin (Figs. 2 (e) and 2 (f)).

In the absence of NADPH, the activities for producing piperitol and sesamin dropped to about 14% and about 18%, respectively, from the levels in the presence of NADPH.

Thereafter, the microsome fraction of the INVsc/pYE22m/SiP189 was reduced by CO, and its absorption spectrum was measured with a spectrophotometer (the Hitachi product U-3000P Spectrophotometer). The result showed that the microsome fraction had absorption at 450 nm, compared with the transformant yeast INVsc/pYE22m used as a control. The result therefore confirmed production of cytochrome P450 protein in the microsome fraction of the INVsc/pYE22m/SiPl89.

According to the method described in Example 2, RNA was extracted from the sesame seeds separated into 4 stages according to their growth in Example 2. RT-PCR was carried out using a primer set (SEQ ID NOs: 49 and 50) for amplifying SiP189, and a primer set (SEQ ID NOs: 3 and 4) for amplifying Sil8SrRNA. The reaction mixture of PCR (25 ul) contained 1 lul of each cDNA, 1 x Ex-Taq buffer (TaKaRa), 0.2 mM dNTPs, primers (0.2 pmol/ul each), and 1.25 U Ex-Taq polymerase.

The reaction was carried out at 94°C for 5 minutes, and then in 26 cycles at 94°C for 1 minute, at 53°C for 1 minute, and at 72°C for 2 minutes. The result confirmed strong expression of SiP189 in Stage 4, in which accumulation of sesamin in the

seed becomes notable. From the fact that the growth-stage dependent lignan accumulation coincides with the time of SiP189 gene expression, the SiP189 gene was found to encode the enzyme for producing piperitol and sesamin in the sesame seeds.

The foregoing results show that the SiP189 gene encodes cytochrome P450 that catalyzes the reaction producing piperitol from pinoresinol, and the reaction producing sesamin from piperitol. In Fig. 3, SiP189 is indicated by arrow.

It can be seen from Fig. 3 that SiP 189 belongs to the CYP81 family in the cytochrome P450 superfamily.

The gene enables synthesis of sesamin and piperitol using various organisms including sesame and other plants, or a system such as a bioreactor.

[Example 7: genomic analysis of SiP 189 gene in cultivated sesame Sesamum indicum In order to find the number of copies of SiP 189 gene in the S. indicum genome, a genomic Southern analysis was carried out.

Using the Nucleon Phytopure for Plant Extraction Kit (Amersham), genomic DNA was extracted from the leaves of S. indicum (cultivar Masekin) according to the method recommended by the manufacturer. Ten micrograms of the genomic DNA so extracted was completely digested by three kinds of restriction enzymes EcoRI, NcoI, and XbaI, and each

sample was separated by electrophoresis on an agarose gel.

The agarose gel was hydrolyzed for 15 minutes using 0.25 M HC1, denatured for 30 minutes using a solution containing 1.5 M NaCl and 0.5 M NaOH, and neutralized with a solution containing 1.5 M NaCl and Tris-HCl (pH 7.5). The genomic DNA was then transferred onto a membrane (Hybribond-N, Amersham) in 20 x SSC, and was bound to the membrane by irradiation of ultraviolet light. The membrane was subjected to pre-hybridization at 42°C for 1 hour in a hybridization buffer (high SDS buffer) containing 7% SDS, 50% formamide, 5 x SSC, 2% blocking reagent, 0.1% lauroyl sarcosine, and 50 mM sodium phosphate buffer (pH 7.0).

As a hybridization probe, an ORF region of about 900 bp starting from a start methionine of the cDNA of the SiP 189 was used. This region of the cDNA was DIG-labeled by PCR, using primers of SEQ ID NO: 61 (Bam-SST-FW2) and SEQ ID NO: 62 (SiP189-Nco-RV). The reaction mixture of PCR contained 1 ng of plasmid (pSPB2055) containing the cDNA of SiP189, 1 x PCR buffer, 1 x DIG-dNTP mixture (PCR DIG Labeling Mix, Roche), 0.2 pmol/ul of each primer, and 1U <BR> <BR> rTaq polymerase (TAKARA BIO INC. ). The PCR was carried out in 30 cycles at 95°C for 30 seconds, at 53°C for 30 seconds, and at 72°C for 1 minute.

The PCR product was purified with the Sephadex G-50 quick spin column (Boehringer), and was heat denatured and

immediately placed on ice. As a hybridization probe, 10 pi of the denatured product was added to a prehybridization solution, and the mixture was incubated overnight at 42°C.

The membrane was washed twice at 65°C for 30 minutes in a 0.2 x SSC solution containing 0.1% SDS (high stringent hybridization condition). Hybridization signals were obtained with the DIG-labeling 8n detection kit (Roche) according to the DIG application manual (Roche).

Fig. 5 shows the result of detection. As shown in Fig. 5, the SiP 189 gene was detected as a single band for all of the three restriction enzyme treatments. The result indicates that the SiP 189 gene exists as a single gene in the S. indicum genome, and that no other genes in the genome have strong homology to the SiP189 gene. Therefore, it can be said that the catalytic activity for synthesizing piperitol and sesamin in sesame plants is conferred by the SiP189 gene.

[Example 8: isolation of SiP189-like gene from Sesamum radiatum] Sesamum radiatum is a sesame plant found in Africa. A cytogenetic analysis has revealed that the number of chromosomes in this particular species of sesame plant was 2n = 64, indicating a cytogenetically different lineage from the S. indicum (2n = 26) (Mitsuo Namiki, Teisaku Kobayashi, Science of sesame, Asakura Shoten). A lignan content has also been analyzed in the seeds of S. radiatum, and

accumulation of sesamin has been reported (Bedigian, D. , et al. Biochemical Systematics and Ecology 13,133-139, 1985).

This suggests that S. radiatum should have a gene (SrSiP189) encoding an enzyme corresponding to the SiP189 of S. indicum In addition, it is expected that the sequence of the SrSiP189 gene of the S. radiatum is highly homologous to the sequence of SiP189.

According to the procedure of Example 4, cDNA was prepared from the seeds of S. radiatum. Using 1 ul of the cDNA as a template, RT-PCR was carried out according to the method of Example 4, using primers of SEQ ID No: 61 (Bam-SST-FW2) and SEQ ID NO: 63 (GR-SST-RV1). The primers were designed according to the sequence of SiP189, so as to amplify fragments probably containing a full length ORF. The RT-PCR produced a fragment of about 1.5 kb, which was believed to contain SrSiP189. The fragment was inserted in the pCR-blunt II TOPO vector (Invitrogen), and pSPB2068 was obtained. The entire nucleotide sequence of the inserted fragment was determined. The result showed 96% sequence homology on the DNA level, and 95% sequence homology on the amino acid level, as compared with SiP189 derived from S. indicum (SEQ ID NO: 64 shows the amino acid sequence of SrSiP189, and SEQ ID NO: 65 shows the nucleotide sequence of SrSiP189). According to the procedure of Example 4, RT-PCR was carried out using the primers (SEQ ID NOs: 61

and 63). As a template, cDNA prepared from the seeds and leaves of S. radiatum was used. The result found strong SrSiP189 expression in seeds but hardly any in leaves. The analysis of SrSiP189 expression by RT-PCR therefore indicated that SrSiP189 was functional in seeds.

[Example 9: functional analysis of SrSiP189 gene in Sesamum radiatum] In order to determine the biochemical function of SrSiP189, a recombinant SrSiP189 protein was expressed in yeasts, and the activity of the recombinant SrSiP189 protein for the biosynthesis of lignan was examined. First, pSPB2068 was digested with restriction enzymes BamHI and XhoI, and a resulting fragment of about 1.5 bp containing cDNA with a full length SrSiP189 was inserted at the BamHI site and SalI site of a yeast expression vector pYE22m. As a result, pSPB2069 was obtained. According to the procedure of Example 6, a microsome was prepared from the yeast transformants, and biosynthesis activities for lignan were measured. Fig. 6 shows a result of HPLC analysis. As shown in Fig. 6, The recombinant SrSiP189 protein had a NADPH dependent catalytic activity for converting pinoresinol into piperitol, and piperitol into sesamin, as with the SiP189 derived from S. indicum (Fig. 6 (a) and Fig. 6 (b) ). In an enzyme reaction mixture containing no NADPH, the catalytic activities for piperitol and sesamin dropped to 16.9% and 8.4%,

respectively (Fig. 6 (c) and Fig. 6 (d) ). The result therefore showed that the SrSiP189 was indeed a counterpart gene of the SiP189 in S. indium.

The foregoing result confirmed that a gene with an SiP189-like sequence is found across species, and that the gene encodes an enzyme that catalyzes the reaction converting pinoresinol into piperitol, and piperitol into sesamin.

The invention being thus described, it will be obvious that the same way may be varied in many ways. Such variations are not to be regarded as a departure from the spirit and scope of the invention, and all such modifications as would be obvious to one skilled in the art are intended to be included within the scope of the following claims.

[Example 10: functional analysis of SiP 189 protein in plant cells] In order to confirm the biological functions of SiP189 protein in plant cells, tabaco (N. tabaccum) was transformed with the SiP189 gene.

With the restriction enzymes BamHI and XhoI, pSPB2055 including SiP189 was digested. A resulting DNA fragment of about 1.5 kb including ORF of the SiP189 was ligated with the BamHI site and SalI site of a plant transforming binary vector pSPB176, so as to obtain a binary vector pSPB2057. The multiple cloning site of the pSPB176 is

flanked by the CaMV35S promoter and the NOS terminator.

The insert inserted in these sites is constitutively expressed in excess in the plant cells under the control of CaMV35S promoter.

The pSPB2057 was transformed into agrobacterium (strain: Aglo) according to a conventional method (Shimonishi et al., New Introductions to Biological and Chemical Experiments 3, Kagaku Doujin (pp. 122-124) ). The agrobacterium transformant was used to infect a tabaco leaf disk.

From the leaves of thirteen lines of the transformants, cDNA was prepared according to the method of Example 2.

Then, RT-PCR was carried out according to the method of Example 4, using primers of SEQ ID NOs: 49 and 50. A ubiquitin gene of tabaco (NtUBQ accession No: U66264) was amplified as an internal control gene, using primers consisting of the nucleotide sequences of SEQ ID NOs: 66 and 67 (NtUBQ-FW, NtUBQ-RW, respectively). The result confirmed the presence of highly-expressed SiP189 gene in lines 6,7, and 12 (Fig. 7).

The following procedure was carried out on ice or at 4°C.

About 15 g of each sample of transformant leaves (lines 6, 7, and 12) and non-transformant leaves was crushed with a pestle in liquid nitrogen, and was dissolved in 30 ml of homogenize buffer (0.1 M potassium phosphate buffer (pH

7.0), 0.5 M mannitol, 5 mM EDTA, 42 mM mercaptoethanol, 50 mM sodium ascorbate, 0.1% BSA, 1 mM PMSF, and 1% PVPP).

The mixture was centrifuged at 10000 x g for 20 minutes, and the supernatant was filtered with a Miracloth. The filtrate was ultracentrifuged at 100000 x g for 90 minutes to obtain a crudely extracted microsome fraction. The microsome fraction (240 p. l) was subjected to the enzymatic reaction with piperitol according to the method of Example 4, and the resulting product was analyzed by HPLC.

The result of HPLC showed that the tabaco-derived microsome expressing SiP189 in excess had a peak which was not observed in the non-transformant, and was dependent on the presence of NADPH in the reaction mixture of enzyme. The peak coincided with the retention time of a sesamin standard, indicating the function of the SiP 189 as a protein with the catalytic activity for the biosynthesis of sesamin in the plant cell (Fig. 8).

[Example 11 : identification of expression regulatory region in SiP189 gene] To get the insight of the transcriptional regulation of SiP 189 gene, a 5'non-coding region of the SiP 189 gene was isolated, sequenced and analyzed. From the genomic DNA of sesame (S. indium), a genome library was constructed using the ABlueSTARTM Vector system (NOVAGEN).

200 ug of sesame genomic DNA was partially digested with the restriction enzyme Sau3AI to obtain fragments of about 20 kb. The DNA fragments were subjected to a sucrose density gradient centrifugation (10% to 40%) at 25000 rpm, at 10°C for 24 hours (SW28 rotor, Beckman). The centrifuged samples were fractionated (1 ml each) using the AUTOMATIC LIQUID CHARGER (Advantec), and the Micro Tube Pump (EYELA). For each fraction, the size of fragments was confirmed by pulse-field gel electrophoresis. The pulse-field gel electrophoresis used a gel with 1% Agarose NA (Amersham biosceience) and 0.5 x TBE, and was carried out in a 0.5 x TBE buffer for 120°/1 sec. to 1 sec. at 6V/cm (CHEF MAPPER, Invitrogen). With the fractions containing fragments with an average fragment size of about 20 kb, a genome library was constructed according to the method recommended by the manufacturer. The library had a titer 1.5 x 106 pfu/500 ul.

The genome library (500000 clones) was screened with a probe, for which about 850 bp ORF region of the SiP189 gene amplified with primers consisting of base sequences of SEQ ID NOs: 68 and 69 (SiP189-bam-FW and SiP189-nco-RV, respectively) was used.

The probed was labeled and detected with the AlkaPhos Direct Labeling and Detection system (Amersham bioscience) according to the method recommended by the manufacturer.

Hybridization was carried out under the following conditions:

Probe: 5 ng/ml hybridization buffer Prehybridization: 55°C for 1 hour Hybridization: 55°C overnight Washing: twice at 55°C for 30 minutes After the third screening, nine kinds of positive clones were isolated, and a gSiP189-#6 with an insert size of about 12 kb was obtained therefrom.

Then, PCR was carried out using primers of nucleotide sequences of SEQ ID NOs : 68 and 69, and phage arm primers STAR-LF1 (SEQ ID NO: 70) and STAR-LR1 (SEQ ID NO: 71), so as to determine the direction and position of the probe inserted in the gSiP189-#6.

The PCR analysis indicated SiP89-#6 contains 5' non-coding region of SiP189 gene, no shorter than 5kb. Based of this result, the entire nucleotide sequence of the gSiP189-#6 was determined.

The inserts were amplified by LA-PCR, using the gSiP189-#6 as a template. The reaction mixture of LA-PCR contained 1 ul of positive clone SM buffer suspension liquid, 1 x LA buffer (TaKaRa), primers (lpmol/ul each), 0.4 mM dNTP, 2 mM MgCl2, and 2.5 U LA-Taq polymerase. The LA-PCR was carried out at 96°C for 5 minutes, and then in 30 cycles at 98°C for 10 seconds, at 55°C for 10 seconds, and at 68°C for 10 minutes. Finally, the product was maintained at 72°C for 15 minutes.

The resulting fragments were physically cut, and DNA fragments of about 1 to 2 kb were fractionated. The fragments were end-blunted and inserted in the HincII site of the pUC118 (TaKaRa) to construct a shot gun library. The library had a titer 2.8 x 106 cfu/ul.

With the shot gun library derived from gSiP189-#6, E. coli DH10B strain (Invitrogen) was transformed by an electroporation method. From randomly picked 192 colonies, DNA was prepared using the TempliPhi DNA Sequencing Template Amplification kit (Amersham bioscience). The DNA so prepared was amplified using M 13-47 (F) primer (SEQ ID NO: 72) and RV-M (R) primer (SEQ ID NO: 73).

The product of amplification was purified with the Clean SEQ (Agecourt), and sequenced using the MegaBASE4000 (Amersham Bioscience). The sequence data was assembled by PHRAP (CAP4), and a CONTIG sequence, including a 5'-sequence of about 13 kb starting from the start methionine site of the SiP189 gene, was obtained.

In order to identify regulatory cis-elements at the 5'-region of the SiP189 gene, a 5'-sequence of about 3 kb starting from the start methionine site of the SiP 189 gene (SEQ ID NO: 74) was analyzed by PLACE (http ://www. dna. affrc. go. jp/PLACE/). The PLACE analysis identified a large number of binding sites for a specific transcription factor family, along with a large number of

regulatory cis-elements that responded to a specific signal.

This indicates their involvement in the expression of the SiP189 gene (Fig. 9 and Fig. 10).

Thereafter, a regulatory non-coding region of the SrSiP189 gene encoding the S. radium-derived enzyme that catalyzes the synthesis of piperitol and sesamin was isolated.

Using the genomic DNA of S. radiatum as a template, PCR was carried out with primers SEQ ID NOs: 75 and 76 (gSST-FWl and gSST-RV2, respectively). The primers were designed based on the genomic sequence of SiP189 set forth in SEQ ID NO: 74.

The reaction mixture contained 1 ul of genomic DNA (50 ng), 1 x Ex-Taq buffer (TaKaRa), 0.2 mM dNTPs, primers (0.2 pmol/ul each), and 1.25 U Ex-Taq polymerase. The PCR was carried out at 94°C for 5 minutes, and then in 30 cycles at 94°C for 1 minutes, at 55°C for 1 minute, and at 72°C for 4 minutes. Finally, the product was maintained at 72°C for 4 minutes. The resulting fragments were subjected to electrophoresis, and a fragment of about 3 kb was obtained.

The fragment was inserted in the multiple cloning site of the pCR-TOPO-XL vector (Invitrogen) according to the method recommended by the manufacturer. As a result, pSPB2664 was obtained. The entire nucleotide sequence of the fragment inserted in the pSPB2664 was sequenced by a primer walking method. As a result, a fragment of about 2.8 kb at the

5'-region of the SrSiP189 gene (SEQ ID NO: 77), a probable sequence including the regulatory cis-elements of SrSiP189 gene, was obtained. The same PLACE analysis was performed as for the genomic SiP 189 gene. The result identified a large number of binding sites for a specific transcription factor family, along with a large number of elements that responded to a specific signal (Fig. 11).

The SiP189 gene derived from cultivated sesame (S. indicum) and the SrSiP189 gene derived from S. radiatum were tested for their sequence identity on the DNA level according to the Clustal-W analysis (MacVector ver. 7.2. 2, Symantech).

These genes only had 78% sequence identity in the 5'-non-coding region of about 3 kb, but had a notably high sequence identity (96%) in the ORF region of the genes (Fig.

12). These results support our findings that SiP 189 and SrSiP189 are highly conserved in terms of protein function but have different expression patterns. In addition to RT-PCR analyses, these cis-element analyses support that the two lignan biosynthetic genes, SiP 189 from S. indicum and SrSiP189 from S. radiatum are not under an identical transcriptional regulation.

[Example 12] S. alatum is an African wild sesame species that greatly differs (also in morphology) from cultivated S. indicum (Namiki et al. , Science of sesame, Asakura Shoten). The number of

chromosomes of S. alatum is the same as S. indium, 2n = 26, and the geographical distribution of S. alatum is in Nigeria, Sudan, and Mozambique in Africa.

With the same method of isolating the SrSiP189 gene from the African sesame S. radiatum in Example 8, a counterpart gene (SaSiP189) of SiP189 was isolated from S. alatum. cDNA of Stage 4 of S. alatum was used as a template for PCR. A fragment of about 1. 5kb, that had been amplified with primers of SEQ ID No: 61 (Bam-SST-FW2) and SEQ ID NO: 63 (GR-SST-RV1), was subcloned to the pCR-blunt II TOPO (Invitrogen). The nucleotide sequence of the inserted fragment was determined by a primer walking method. SEQ ID NO: 78 shows the amino acid sequence of SaSiP189, and SEQ ID NO: 79 shows the nucleotide sequence of SaSiP189.

The resultant SaSiP189 showed 90% sequence identity on the DNA level, and 86% sequence homology on the amino acid level, as compared with SiP189. These results show that the gene SiP 189 of lignan biosynthetic enzyme is a highly conserved gene, even in the presence of geographical isolation or morphological/cytogenetic difference.

The embodiments and concrete examples of implementation discussed in the foregoing detailed explanation serve solely to illustrate the technical details of the present invention, which should not be narrowly interpreted within the

limits of such embodiments and concrete examples, but rather may be applied in many variations within the spirit of the present invention, provided such variations do not exceed the scope of the patent claims set forth below.

INDUSTRIAL APPLICABILITY Previous research on sesamin has found various physiological activities of the compound, and sesamin is now known to possess a wide variety of recuperating effects. The present invention identified a gene that encodes an enzyme for catalyzing the biosynthesis of piperitol from pinoresinol, or of sesamin from piperitol. The sesame-derived cytochrome P450 gene (SiP gene) allows for production of sesamin and piperitol using a recombinant organism, and therefore increases sesamin yield and reduces production cost.

As described above, the sesame-derived SiP189 gene and SrSiP189 gene of the present invention encode cytochrome P450 that catalyzes the biosynthesis of piperitol from pinoresinol, and of sesamin from piperitol. Being an important food source since ancient times, sesame, including its seeds, seed oil, and seed extract, continues to hold its title as one of the healthiest foods available. Among many benefits offered by sesame, its physiological activities have caught the attention of many researchers. The SiP189 gene and SrSiP189 gene identified by the present invention can be used in the

production of sesamin, which conventionally relied solely on sesame seeds. The present invention therefore holds great promise in increasing sesamin yield.

With these and other advantages of the present invention, the invention is useful in agriculture, food industry, pharmaceutical industry, and all other industries related to these fields.