Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ENDOSPERM-PREFERENTIAL PROMOTERS AND USES THEREOF
Document Type and Number:
WIPO Patent Application WO/2015/067538
Kind Code:
A1
Abstract:
The present invention relates to Brassica sequences comprising endosperm-preferential promoter activity. Provided are recombinant genes comprising the endosperm-preferential promoter operably linked to a heterologous nucleic acid sequence, and cells and plants comprising the recombinant gene. The promoters can be used to alter gene expression specifically in the endosperm and to alter seed properties.

Inventors:
LAGA BENJAMIN (BE)
DENOLF PETER (BE)
DE BODT STEFANIE (BE)
CAESTECKER EVELYNE (BE)
Application Number:
PCT/EP2014/073467
Publication Date:
May 14, 2015
Filing Date:
October 31, 2014
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BAYER CROPSCIENCE NV (BE)
International Classes:
C12N15/82
Domestic Patent References:
WO2009077478A22009-06-25
WO2007065878A22007-06-14
Foreign References:
EP2402446A12012-01-04
EP2370456B12013-04-24
EP1528104A12005-05-04
Other References:
KEYU GU ET AL: "Expression of fatty acid and lipid biosynthetic genes in developing endosperm of Jatropha curcas", BIOTECHNOLOGY FOR BIOFUELS, BIOMED CENTRAL LTD, GB, vol. 5, no. 1, 18 July 2012 (2012-07-18), pages 47, XP021117107, ISSN: 1754-6834, DOI: 10.1186/1754-6834-5-47
Download PDF:
Claims:
CLAIMS:

1. An isolated nucleic acid comprising endosperm-preferential promoter activity selected from the group consisting of:

a. a nucleic acid comprising a nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof;

b. a nucleic acid comprising a nucleotide sequence having at least 80% sequence identity to any one of SEQ ID NO: 1 to SEQ ID NO: 20, or a functional fragment thereof; and c. a nucleic acid capable of hybridizing under stringent conditions to the nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20, or a functional fragment thereof.

2. A recombinant gene comprising the nucleic acid according to claim 1 operably linked to a

heterologous nucleic acid sequence encoding an expression product of interest, and optionally a transcription termination and polyadenylation sequence, preferably a transcription termination and polyadenylation region functional in plants.

3. The recombinant gene according to claim 2, wherein the expression product of interest is an RNA molecule capable of modulating the expression of a gene or is a protein.

4. A host cell comprising the isolated nucleic acid according to claim 1, or the recombinant gene according to claim 2 or 3.

5. The host cell of claim 4 which is an E. coli cell, an Agrobacterium cell, yeast cell, or a plant cell.

6. A plant comprising the recombinant gene of claim 2 or 3.

7. The plant according to claim 6, comprising at least two recombinant genes according to claim 2, wherein the nucleic acid according to claim 1, and, optionally, the heterologous nucleic acid sequence, is different in each recombinant gene.

8. Seeds obtainable from the plant according to claim 6 or 7.

9. The plant cell or plant or seeds according to any one of claims 5 to 8, which is a seed crop plant cell or plant or seed.

10. Method of producing a transgenic plant comprising the steps of:

a. introducing or providing the recombinant gene according to claim 2 or 3 to a plant cell to create transgenic cells; and

b. regenerating transgenic plants from said transgenic cell.

11. Method of effecting endosperm-preferential expression of a nucleic acid comprising introducing the recombinant gene according to claim 2 or 3 into the genome of a plant, or providing the plant according to claim 6 or 7.

12. Method for altering seed properties of a plant comprising introducing the recombinant gene

according to claim 2 or 3 into the genome of a plant, or providing the plant according to claim 6 or 7.

13. Use of the isolated nucleic acid according to claim 1 to regulate expression of an operably linked nucleic acid in a plant.

14. Use of the isolated nucleic acid according to claim 1, or the recombinant gene according to claim 2 or 3 to alter seed properties in a plant.

15. Use of the isolated nucleic acid according to claim 1 to identify other nucleic acids comprising

endosperm-preferential promoter activity.

16. The method according to any one of claims 10 to 12, or the use according to claim 13 or 14, wherein said plant is a seed crop plant.

17. A method of producing food, feed, or an industrial product comprising

a) obtaining the plant or a part thereof, of any one of claims 6 to 9; and

b) preparing the food, feed or industrial product from the plant or part thereof.

18. The method of claim 17 wherein

a) the food or feed is oil, meal, grain, starch, flour or protein; or

b) the industrial product is biofuel, fiber, industrial chemicals, a pharmaceutical or a nutraceutical.

Description:
ENDOSPERM-PREFERENTIAL PROMOTERS AND USES THEREOF

FIELD OF THE INVENTION

[ 1 ] The present invention relates to materials and methods for the expression of a gene of interest in the endosperm of plants, even more specifically in seed crop plants. In particular, the invention provides an expression cassette for regulating endosperm-preferential expression in plants.

BACKGROUND OF THE INVENTION

[2] Modification of plants to alter and/or improve phenotypic characteristics (such as productivity or quality) requires the overexpression or down-regulation of endogenous genes or the expression of heterologous genes in plant tissues. Such genetic modification relies on the availability of a means to drive and to control gene expression as required. Indeed, genetic modification relies on the availability and use of suitable promoters which are effective in plants and which regulate gene expression so as to give the desired effect(s) in the transgenic plant.

[3] For numerous applications in plant biotechnology a tissue-specific or a tissue-preferential expression profile is advantageous, since beneficial effects of expression in one tissue may have disadvantages in others.

[4] Seed-preferential or seed-specific promoters are useful for expressing or down-regulating genes to modify seed metabolic pathways or to produce proteins of interest in seeds. Seed- specific promoters are well known in the art and include, for example, the promoters described in WO 2009/077478 and references therein. [5] However, to enable specific modification or optimization of metabolic pathways in the endosperm, it is beneficial to express or down-regulate genes specifically in the endosperm. This allows a more specific modulation of pathways, and reduces the likelihood for non-targeted, pleiotropic effects.

[6] Several endosperm-specific promoters have been identified from cereals, such as rice (Wu et al., 1998, Plant Cell Physiol 39: 885; Russell and Fromm, 1997, Transgenic Res 6:157; Hwang et al., 2002, Plant Cell Rep 9:842; US 2011/0093984, US 8,552,256), maize (US 7071378, US 2007/0169226, US 2009/0227013, WO 2005/042745, WO 2010/147825; WO 2012/159891), wheat (Lamacchia et al., 2001, J Exp Bot 52:243; Song et al., 2012, Z Naturforsch C 67:611 ; WO 2010/129999, WO

2010/118477), and barley (US 2007/0199106, WO 1998/008961). Endosperm-specific promoters have also been identified in the model plant Arabidopsis (WO2007/110600; Tiwari et al., 2006, Plant Biotech J. 4: 393). Knowledge on endosperm-specific promoters from exalbuminous crop plants or from oil- - - accumulating crop plants is limited.

[7] Embryo-specific promoters can be identified based on embryo-specific transcripts. Huang et al., 2009, BMC Genomics 10:256 have identified genes expressed during embryo development in Brassica napus. However, these studies did not compare expression of the genes in the embryo to expression in other plant tissues. Further, Huang et al did not identify the sequences of the promoters of the genes accumulating in the embryo.

[8] It is thus an objective of the present invention to provide Brassica promoters for endosperm- preferential expression of genes of interest in plants. This objective is solved by the present invention as herein further explained. SUMMARY OF THE INVENTION

[9] In one aspect, the invention provides an isolated nucleic acid comprising endosperm-preferential promoter activity selected from the group consisting of (a) a nucleic acid comprising a nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof; (b) a nucleic acid comprising a nucleotide sequence having at least 80% sequence identity to any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof; and (c) a nucleic acid hybridizing under stringent conditions to the nucleotide sequence any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof.

[10] A further embodiment provides a recombinant gene comprising the nucleic acid according to the invention operably linked to a heterologous nucleic acid sequence encoding an expression product of interest, and optionally a transcription termination and polyadenylation sequence, preferably a transcription termination and polyadenylation region functional in plant cells. In a further embodiment, said expression product of interest is an RNA capable of modulating the expression of a gene or is a protein.

[1 1] Yet another embodiment provides a host cell, such as an E. coli cell, an Agrobacterium cell, a yeast cell, or a plant cell, comprising the isolated nucleic acid according to the invention, or the recombinant gene according to the invention.

[12] In a further embodiment, a plant is provided comprising the recombinant gene according to the invention. In again a further embodiment, a plant is provided comprising at least two recombinant genes according to the invention, wherein the nucleic acid comprising endosperm-preferential promoter activity according to the invention and, optionally, the heterologous nucleic acid sequence operably linked thereto are different in each recombinant gene. Yet a further embodiment provides seeds obtainable from the plant according to the invention. In another embodiment, the plants or seeds according to the invention are seed crop plants or seed crop seeds.

[13] Yet another embodiment provides a method of producing a transgenic plant comprising the steps of (a) introducing or providing the recombinant gene according to the invention to a plant cell to create transgenic cells; and (b) regenerating transgenic plants from said transgenic cell.

[14] Further provided is a method of effecting endosperm-preferential expression of a nucleic acid comprising introducing the recombinant gene according to the invention into the genome of a plant, or providing the plant according to the invention. Also provided is a method for altering seed properties of a plant comprising introducing the recombinant gene according to the invention into the genome of a plant, or providing the plant according to the invention. In another embodiment, said plant is a seed crop plant.

[15] Also provided is the use of the isolated nucleic acid according to the invention to regulate expression of an operably linked nucleic acid in a plant, and the use of the isolated nucleic acid according to the invention, or the recombinant gene according to the invention to alter seed properties in a plant. In a further embodiment, said plant is a seed crop plant.

[ 16] Yet another embodiment provides a method of producing food, feed, or an industrial product comprising (a) obtaining the plant or a part thereof, according to the invention; and (b) preparing the food, feed or industrial product from the plant or part thereof. In another embodiment, said food or feed is oil, meal, grain, starch, flour or protein, or said industrial product is biofuel, fiber, industrial chemicals, a pharmaceutical or a nutraceutical.

BRIEF DESCRIPTION OF THE DRAWINGS

[17] Figure 1 : Relative expression levels (tpm; transcript per million) of different transcripts in different tissues. A: Endosperm_2; B: Endosperm_4; C: Endosperm_7; D: Endosperm_8; E:

Endosperm_9; F: Endosperm lO; G: Endosperm_12; H: Endosperm_13; I: Endosperm_14; J:

Endosperm_15; K: Endosperm_16; L: Endosperm_17; M: Endosperm_18; N: Endosperm_19; O: Endosperm_20; P: Endosperm_21 ; Q: Endosperm_22; R: Endosperm_23; S: Endosperm_24; T:

Endosperm_25. Different tissues: AM33: Apical meristem 33 days after sowing (DAS); BFB42: Big flower buds 42 DAS; CTYL10: Cotyledons 10 DAS; OF52: Open flowers 52 DAS; Pod2: Pods 14-20 DAS; Pod3: Pods 21-25 DAS; Ro2w: Roots 14 DAS; SFB42: Small flower buds 42 DAS; Seed2: Seeds 14-20 days after flowering (DAF); Seed3: Seeds 21-25 DAF; Seed4: Seeds 26-30 DAF; Seed5: Seeds 31-35 DAF; Seed6: Seeds 42 DAF; Seed7: Seeds 49 DAF; St2w: Stem 14 DAS; St5w: Stem 33 DAS; YL33: Young leaf 33 DAS. - -

[18] Figure 2: Relative expression levels (tpm; transcript per million) of different transcripts in different tissues of the embryo. A: Endosperm_2; B: Endosperm_4; C: Endosperm_7; D: Endosperm_8; E: Endosperm_9; F: Endosperm lO; G: Endosperm_12; H: Endosperm_13; I: Endosperm_14; J:

Endosperm_15; K: Endosperm_16; L: Endosperm_17; M: Endosperm_18; N: Endosperm_19; O:

Endosperm_20; P: Endosperm_21 ; Q: Endosperm_22; R: Endosperm_23; S: Endosperm_24; T:

Endosperm_25. Different tissues: 1 : Embryo, hypocotyl, 18 Days after flowering (DAF); 10: Embryo, hypocotyl, 24 DAF; 11 : Embryo, inner cotyledon, 24 DAF; 12: Embryo, endosperm, 24 DAF; 13: Embryo, hypocotyl, 28 DAF; 14: Embryo, inner cotyledon, 28 DAF; 15: Embryo, hypocotyl, 28 DAF; 16: Embryo, inner cotyledon, 28 DAF; 17: Embryo, hypocotyl, 32 DAF; 18: Embryo, inner cotyledon, 32 DAF; 19: Embryo, hypocotyl, 32 DAF; 2: Embryo, inner cotyledon, 18 DAF; 20: Embryo, inner cotyledon, 32 DAF; 21 : Embryo, hypocotyl, 46 DAF; 22: Embryo, inner cotyledon, 46 DAF; 23:

Embryo, hypocotyl, 46 DAF; 24: Embryo, inner cotyledon, 46 DAF; 25: Embryo, outer cotyledon, 18 DAF; 26: Embryo, outer cotyledon, 18 DAF; 27: Embryo, outer cotyledon, inner part, 24 DAF; 28: Embryo, outer cotyledon, outer part, 24 DAF; 29: Embryo, outer cotyledon, inner part, 24 DAF; 3: Embryo, endosperm, 18 DAF; 30: Embryo, outer cotyledon, outer part, 24 DAF; 31 : Embryo, outer cotyledon, inner part, 28 DAF; 32: Embryo, outer cotyledon, outer part, 28 DAF; 33: Embryo, outer cotyledon, inner part, 28 DAF; 34: Embryo, outer cotyledon, outer part, 28 DAF; 35: Embryo, outer cotyledon, inner part, 32 DAF; 36: Embryo, outer cotyledon, outer part, 32 DAF; 37: Embryo, outer cotyledon, inner part, 32 DAF; 38: Embryo, outer cotyledon, outer part, 32 DAF; 39: Embryo, outer cotyledon, inner part, 46 DAF; 4: Embryo, hypocotyl, 18 DAF; 40: Embryo, outer cotyledon, outer part, 46 DAF; 41 : Embryo, outer cotyledon, inner part, 46 DAF; 42: Embryo, outer cotyledon, outer part, 46 DAF; 5: Embryo, inner cotyledon, 18 DAF; 6: Embryo, endosperm, 18 DAF; 8: Embryo, inner cotyledon, 24 DAF.

DETAILED DESCRIPTION [19] The present invention is based on the observation that SEQ ID NO: 1 to SEQ ID NO: 20 contain endosperm-preferential promoter activity.

[20] In one aspect, the invention provides an isolated nucleic acid comprising endosperm-preferential promoter activity selected from the group consisting of (a) a nucleic acid comprising a nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof; (b) a nucleic acid comprising a nucleotide sequence having at least 80% sequence identity to any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof; and (c) a nucleic acid capable of hybridizing under stringent conditions to the nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment thereof. [21] SEQ ID NOs: 1 to 20 depict the region upstream (i.e. located 5' upstream of) from the first ATG start codon of the Endosperm_2, Endosperm_4, Endosperm-7-10, and Endosperm 12-25 transcripts, respectively. Such a promoter region may be at least about 300 bp, at least about 500 bp, at least about 800 bp, at least about 1000 bp, at least about 1500 bp, at least about 2000 bp, at least about 2500 bp, or at least about 3000 bp upstream of the first start codon of the Endosperm_2, Endosperm_4, Endosperm- 7-10, or Endosperm 12-25 transcripts.

[22] The nucleic acid comprising the endosperm-preferential promoter activity according to the invention may also be comprised in a larger DNA molecule.

[23] "Endosperm-preferential promoter activity" in the context of this invention means the promoter activity is at least 6 times, or at least 10 times, or at least 20 times or even at least 100 times higher in endosperm than in any of the other tissues. In other words, in endosperm-preferential promoter activity, transcription of the nucleic acid operably linked to the promoter of the invention in the endosperm is at least 6 times, or at least 10 times, or at least 20 times or even at least 100 times higher than in any of the other tissues. In other words, the endosperm-preferential promoter drives endosperm-preferential expression of the nucleic acid operably linked to the endosperm-preferential promoter.

[24] The phrase "operably linked" refers to the functional spatial arrangement of two or more nucleic acid regions or nucleic acid sequences. For example, a promoter region may be positioned relative to a nucleic acid sequence such that transcription of a nucleic acid sequence is directed by the promoter region. Thus, a promoter region is "operably linked" to the nucleic acid sequence. "Functionally linked" is an equivalent term.

[25] The phrases "DNA", "DNA sequence", "nucleic acid sequence," "nucleic acid molecule" "nucleotide sequence" and "nucleic acid" refer to a physical structure comprising an orderly arrangement of nucleotides. The DNA sequence or nucleotide sequence may be contained within a larger nucleotide molecule, vector, or the like. In addition, the orderly arrangement of nucleic acids in these sequences may be depicted in the form of a sequence listing, figure, table, electronic medium, or the like.

[26] As used herein, "promoter" means a region of DNA sequence that is essential for the initiation of transcription of DNA, resulting in the generation of an RNA molecule that is complementary to the transcribed DNA; this region may also be referred to as a "5' regulatory region." Promoters are usually located upstream of the coding sequence to be transcribed and have regions that act as binding sites for RNA polymerase II and other proteins such as transcription factors (trans-acting protein factors that regulate transcription) to initiate transcription of an operably linked gene. Promoters may themselves contain sub-elements (i.e. promoter motifs) such as cis-elements or enhancer domains that regulate the transcription of operably linked genes. The promoters of this invention may be altered to contain "enhancer DNA" to assist in elevating gene expression. As is known in the art, certain DNA elements can be used to enhance the transcription of DNA. These enhancers often are found 5' to the start of transcription in a promoter that functions in eukaryotic cells, but can often be inserted upstream (5') or downstream (3') to the coding sequence. In some instances, these 5' enhancer DNA elements are introns. Among the introns that are useful as enhancer DNA are the 5' introns from the rice actin 1 gene (see US5641876), the rice actin 2 gene, the maize alcohol dehydrogenase gene, the maize heat shock protein 70 gene (see US5593874), the maize shrunken 1 gene, the light sensitive 1 gene of Solarium tuberosum, the Arabidopsis histon 4 intron and the heat shock protein 70 gene of Petunia hybrida (see US5659122). Thus, as contemplated herein, a promoter or promoter region includes variations of promoters derived by inserting or deleting regulatory regions, subjecting the promoter to random or site-directed mutagenesis, etc. The activity or strength of a promoter may be measured in terms of the amounts of RNA it produces, or the amount of protein accumulation in a cell or tissue, relative to a promoter whose transcriptional activity has been previously assessed.

[27] A promoter as used herein may thus include sequences downstream of the transcription start, such as sequences coding the 5' untranslated region (5' UTR) of the RNA, introns located downstream of the transcription start, or even sequences encoding the protein.

[28] Promoter activity for a functional promoter fragment in the endosperm may be determined by those skilled in the art, for example using analysis of RNA accumulation produced from the nucleic acid which is operably linked to the promoter as described herein, whereby the nucleic acid which is operably linked to the promoter can be the nucleic acid which is naturally linked to the promoter, i.e. the endogenous gene of which expression is driven by the promoter.

[29] The RNA accumulation, or levels of RNA, such as mRNA, can be measured either at a single time point or at multiple time points and as such the fold increase can be average fold increase or an extrapolated value derived from experimentally measured values. As it is a comparison of levels, any method that measures mRNA levels can be used. In a preferred aspect, the tissue or organs compared are the endosperm with other tissues of the organism. In another preferred aspect, multiple tissues or organs are compared. A preferred multiple comparison is endosperm compared with 2, 3, 4, or more tissues or organs selected from the group consisting of apical meristem, flower buds, cotyledons, flowers, pods, roots, seeds, leaves and different stages of the embryo, such as the tissues described herein in the Examples.

[30] The endosperm-preferential expression capacity of the identified or generated fragments of the promoters of the invention can be conveniently tested by determining levels of the transcript of which expression is naturally driven by the promoter of the invention, i.e. endogenous transcript levels, such as, for example, using the methods as described herein in the Examples. Further, endosperm-preferential expression capacity of the identified or generated fragments of the promoters of the invention can be conveniently tested by operably linking such DNA molecules to a nucleotide sequence encoding an easy scorable marker, e.g. a beta-glucuronidase gene, introducing such a chimeric gene into a plant and analyzing the expression pattern of the marker in the endosperm as compared with the expression pattern of the marker in other parts of the plant. Other candidates for a marker (or a reporter gene) are chloramphenicol acetyl transferase (CAT) and proteins with fluorescent properties, such as green fluorescent protein (GFP) from Aequora victoria. To define a minimal promoter region, a DNA segment representing the promoter region is removed from the 5' region of the gene of interest and operably linked to the coding sequence of a marker (reporter) gene by recombinant DNA techniques well known to the art. The reporter gene is operably linked downstream of the promoter, so that transcripts initiating at the promoter proceed through the reporter gene. Reporter genes generally encode proteins, which are easily measured, including, but not limited to, chloramphenicol acetyl transferase (CAT), beta- glucuronidase (GUS), green fluorescent protein (GFP), beta-galactosidase (beta-GAL), and luciferase. The expression cassette containing the reporter gene under the control of the promoter can be introduced into an appropriate cell type by transfection techniques well known to the art. To assay for the reporter protein, cell lysates are prepared and appropriate assays, which are well known in the art, for the reporter protein are performed. For example, if CAT were the reporter gene of choice, the lysates from cells transfected with constructs containing CAT under the control of a promoter under study are mixed with isotopically labeled chloramphenicol and acetyl-coenzyme A (acetyl-CoA). The CAT enzyme transfers the acetyl group from acetyl-CoA to the 2- or 3 -position of chloramphenicol. The reaction is monitored by thin-layer chromatography, which separates acetylated chloramphenicol from unreacted material. The reaction products are then visualized by autoradiography. The level of enzyme activity corresponds to the amount of enzyme that was made, which in turn reveals the level of expression and the endosperm- preferential functionality from the promoter or promoter fragment of interest. This level of expression can also be compared to other promoters to determine the relative strength of the promoter under study. Once activity and functionality is confirmed, additional mutational and/or deletion analyses may be employed to determine the minimal region and/or sequences required to initiate transcription. Thus, sequences can be deleted at the 5' end of the promoter region and/or at the 3' end of the promoter region, and nucleotide substitutions introduced. These constructs are then again introduced in cells and their activity and/or functionality determined. [31] The activity or strength of a promoter may be measured in terms of the amount of mRNA or protein accumulation it specifically produces, relative to the total amount of mRNA or protein.

Alternatively, the activity or strength of a promoter may be expressed relative to a well-characterized promoter (for which transcriptional activity was previously assessed).

[32] It will herein further be clear that equivalent endosperm-preferential promoters can be isolated from other plants. To this end, orthologous promoter fragments may be isolated from other plants using any one of SEQ ID NO: 1 to SEQ ID NO: 20 or a functional fragment having at least 300 consecutive nucleotides thereof as a probe and identifying nucleotide sequences from these other plants which hybridize under the herein described hybridization conditions. By way of example, a promoter of the invention may be used to screen a genomic library of a crop or plant of interest to isolate corresponding promoter sequences according to techniques well known in the art. Thus, a promoter sequence of the invention may be used as a probe for hybridization with a genomic library under medium to high stringency conditions. As an alternative equivalent promoters can be isolated using the coding sequences of the genes driven by the promoters of any one of SEQ ID NO: 1 to SEQ ID NO: 20 to screen a genomic library (e.g. by hybridization or in silico) of a crop of interest. When sufficient identity between the coding sequences is obtained (as a rule higher than 85% identity) then promoter regions can be isolated upstream of the orthologous genes.

[33] Hybridization occurs when the two nucleic acid molecules anneal to one another under appropriate conditions. Nucleic acid hybridization is a technique well known to those of skill in the art of DNA manipulation. The hybridization property of a given pair of nucleic acids is an indication of their similarity or identity. Another indication that two nucleic acid sequences are substantially identical is that the two molecules hybridize to each other under stringent conditions. The phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially" refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target nucleic acid sequence. "Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments such as Southern and Northern hybridization are sequence dependent, and are different under different environmental parameters. An example of highly stringent wash conditions is 0.15 M NaCI at 72°C for about 15 minutes. An example of stringent wash conditions is a 0.2 X SSC wash at 65°C for 15 minutes. Often, a high stringency wash is preceded by a low stringency wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 nucleotides, is 1 X SSC at 45°C for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 4 to 6 X SSC at 40°C for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), stringent conditions typically involve salt concentrations of less than about 1.5 M, more preferably about 0.01 to 1.0 M, Na ion concentration (or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30°C and at least about 60°C for long probes (e.g., >50 nucleotides). Stringent conditions may also be achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2 X (or higher) than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybridization. Nucleic acids that do not hybridize to each other under stringent conditions are still substantially identical if the proteins that they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon degeneracy permitted by the genetic code. Very stringent conditions are selected to be equal to the T m for a particular probe. An example of stringent conditions for hybridization of complementary nucleic acids which have more than 100 complementary residues on a filter in a Southern or Northern blot is 50% formamide, e.g., hybridization in 50% formamide, 1 M NaCI, 1% SDS at 37°C, and a wash in 0.1 x SSC at 60 to 65°C. Exemplary low stringency conditions include hybridization with a buffer solution of 30 to 35% formamide, 1 M NaCI, 1% SDS (sodium dodecyl sulphate) at 37°C, and a wash in 1 X to 2 X SSC (20 X SSC=3.0 M NaCI/0.3 M trisodium citrate) at 50 to 55°C. Exemplary moderate stringency conditions include hybridization in 40 to 45% formamide, 1.0 M NaCI, 1% SDS at 37°C, and a wash in 0.5 X to 1 X SSC at 55 to 60°C. The following are examples of sets of hybridization/wash conditions that may be used to clone orthologous nucleotide sequences that are substantially identical to reference nucleotide sequences of the present invention: a reference nucleotide sequence preferably hybridizes to the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 2 X SSC, 0. 1% SDS at 50°C, more desirably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 1 X SSC, 0.1% SDS at 50°C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 0.5 X SSC, 0. 1% SDS at 50°C, even more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 0.1 X SSC, 0.1% SDS at 50°C.

[34] Suitable to the invention are nucleic acids comprising endosperm-preferential promoter activity which comprise a nucleotide sequence having at least 40%, at least 50%, or at least 60%, or at least 70%, or at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98% sequence identity to the herein described promoters and promoter regions or functional fragments thereof and are also referred to as variants. The term "variant" with respect to the transcription regulating nucleotide sequences of any one of SEQ ID NO: 1 to SEQ ID NO: 20 of the invention is intended to mean substantially similar sequences. Naturally occurring allelic variants such as these can be identified with the use of well-known molecular biology techniques, as, for example, with polymerase chain reaction

(PCR) and hybridization techniques as herein outlined before. Variant nucleotide sequences also include synthetically derived nucleotide sequences, such as those generated, for example, by using site-directed mutagenesis of any one of SEQ ID NO: 1 to SEQ ID NO: 20. Generally, nucleotide sequence variants of the invention will have at least 40%, 50%, 60%, to 70%, e.g., preferably 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, to 79%, generally at least 80%, e.g., 81% to 84%, at least 85%, e.g., 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, to 98% and 99% nucleotide sequence identity to the native (wild type or endogenous) nucleotide sequence or a functional fragment thereof. Derivatives of the DNA molecules disclosed herein may include, but are not limited to, deletions of sequence, single or multiple point mutations, alterations at a particular restriction enzyme site, addition of functional elements, or other means of molecular modification which may enhance, or otherwise alter promoter expression. Techniques for obtaining such derivatives are well-known in the art (see, for example, J. F. Sambrook, D. W. Russell, and N. Irwin (2000) Molecular Cloning: A Laboratory Manual, 3 rd edition Volumes 1, 2, and 3. Cold Spring Harbor Laboratory Press). For example, one of ordinary skill in the art may delimit the functional elements within the promoters disclosed herein and delete any non-essential elements. Functional elements may be modified or combined to increase the utility or expression of the sequences of the invention for any particular application. Those of skill in the art are familiar with the standard resource materials that describe specific conditions and procedures for the construction, manipulation, and isolation of macromolecules (e.g., DNA molecules, plasmids, etc.), as well as the generation of recombinant organisms and the screening and isolation of DNA molecules. As used herein, the term "percent sequence identity" refers to the percentage of identical nucleotides between two segments of a window of optimally aligned DNA. Optimal alignment of sequences for aligning a comparison window are well-known to those skilled in the art and may be conducted by tools such as the local homology algorithm of Smith and Waterman (Waterman, M. S. Introduction to Computational Biology: Maps, sequences and genomes. Chapman & Hall. London (1995), the homology alignment algorithm of Needleman and Wunsch (J. Mol. Biol., 48:443-453 (1970), the search for similarity method of Pearson and Lipman (Proc. Natl. Acad. Sci., 85:2444 (1988), and preferably by computerized implementations of these algorithms such as GAP, BESTFIT, FASTA, and TFASTA available as part of the GCG (Registered Trade Mark), Wisconsin Package (Registered Trade Mark from Accelrys Inc., San Diego, Calif). An "identity fraction" for aligned segments of a test sequence and a reference sequence is the number of identical components that are shared by the two aligned sequences divided by the total number of components in the reference sequence segment, i.e., the entire reference sequence or a smaller defined part of the reference sequence. Percent sequence identity is represented as the identity fraction times 100. The comparison of one or more DNA sequences may be to a full-length DNA sequence or a portion thereof, or to a longer DNA sequence.

[35] A nucleic acid comprising a nucleotide sequence having at least 80% sequence identity to any one of SEQ ID NO: 1 to SEQ ID NO: 20 can thus be a nucleic acid comprising a nucleotide sequence having at least 80%, or at least 85%, or at least 90%, or at least 95%, or at least 98%, or 100% sequence identity to any one of SEQ ID NO: 1 to SEQ ID NO: 20.

[36] A nucleic acid capable of hybridizing under stringent conditions to the nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 is a nucleic acid which is capable of hybridizing to the nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 under conditions with about 5°C lower than the thermal melting point (T m ) for the specific sequences at a defined ionic strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50%> of the target sequence hybridizes to a perfectly matched probe. Typically stringent conditions will be chosen in which the salt concentration is about 0.02 molar at pH 7 and the temperature is at least 60°C. Lowering the salt concentration and/or increasing the temperature increases stringency. Stringent conditions for RNA- DNA hybridizations (Northern blots using a probe of e.g. lOOnt) are for example those which include at least one wash in 0.2X SSC at 63°C for 20min, or equivalent conditions. [37] Said nucleic acid capable of hybridizing under stringent conditions to the nucleotide sequence of any one of SEQ ID NO: 1 to SEQ ID NO: 20 can also be a nucleic acid capable of hybridizing to said nucleotide sequence under high stringency conditions. "High stringency conditions" can be provided, for example, by hybridization at 65°C in an aqueous solution containing 6x SSC (20x SSC contains 3.0 M NaCl, 0.3 M Na-citrate, pH 7.0), 5x Denhardt's (100X Denhardt's contains 2% Ficoll, 2% Polyvinyl pyrollidone, 2% Bovine Serum Albumin), 0.5% sodium dodecyl sulphate (SDS), and 20 μg/ml denaturated carrier DNA (single-stranded fish sperm DNA, with an average length of 120 - 3000 nucleotides) as non-specific competitor. Following hybridization, high stringency washing may be done in several steps, with a final wash (about 30 min) at the hybridization temperature in 0.2-0. l x SSC, 0.1% SDS.

[38] A "functional fragment" of a nucleic acid comprising endosperm-preferential promoter denotes a nucleic acid comprising a stretch of the nucleic acid sequences of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or of the nucleic acid having at least 80%> sequence identity to any one of SEQ ID NO: 1 to SEQ ID NO: 20 which still exerts the desired function, i.e. which has endosperm-preferential promoter activity. Assays for determining endosperm-preferential promoter activity are provided herein.

Preferably, the functional fragment of the endosperm-preferential promoter contains the conserved promoter motifs, such as, for example, conserved promoter motifs as described in DoOP

(http://doop.abc.liu, databases of Orthologous Promoters, Barta E. el al (2005) Nucleic Acids Research Vol. 33, D86-D90). A functional fragment may be a fragment of at least about 300 bp, at least about 500 bp, at least about 800 bp, at least about 1000 bp, at least about 1500 bp, at least about 2000 bp, at least about 2500 bp, or at least about 3000 bp.

[39] "Isolated nucleic acid", used interchangeably with "isolated DNA" as used herein refers to a nucleic acid not occurring in its natural genomic context, irrespective of its length and sequence.

Isolated DNA can, for example, refer to DNA which is physically separated from the genomic context, such as a fragment of genomic DNA. Isolated DNA can also be an artificially produced DNA, such as a chemically synthesized DNA, or such as DNA produced via amplification reactions, such as polymerase chain reaction (PCR) well-known in the art. Isolated DNA can further refer to DNA present in a context of DNA in which it does not occur naturally. For example, isolated DNA can refer to a piece of DNA present in a plasmid. Further, the isolated DNA can refer to a piece of DNA present in another chromosomal context than the context in which it occurs naturally, such as for example at another position in the genome than the natural position, in the genome of another species than the species in which it occurs naturally, or in an artificial chromosome.

[40] A further embodiment provides a recombinant gene comprising the nucleic acid according to the invention operably linked to a heterologous nucleic acid sequence encoding an expression product of interest, and optionally a transcription termination and polyadenylation sequence, preferably a transcription termination and polyadenylation region functional in plant cells. In a further embodiment, said expression product of interest is an RNA capable of modulating the expression of a gene or is a protein.

[41] The term "expression product" refers to a product of transcription. Said expression product can be the transcribed RNA. It is understood that the RNA which is produced is a biologically active RNA. Said expression product can also be a peptide, a polypeptide, or a protein, when said biologically active RNA is an mRNA and said protein is produced by translation of said mRNA.

[42] Alternatively, the heterologous nucleic acid, operably linked to the promoters of the invention, may also code for an RNA capable of modulating the expression of a gene. Said RNA capable of modulating the expression of a gene can be an RNA which reduces expression of a gene. Said RNA can reduce the expression of a gene for example through the mechanism of RNA-mediated gene silencing.

[43] Said RNA capable of modulating the expression of a gene can be a silencing RNA

downregulating expression of a target gene. As used herein, "silencing RNA" or "silencing RNA molecule" refers to any RNA molecule, which upon introduction into a plant cell, reduces the expression of a target gene. Such silencing RNA may e.g. be so-called "antisense RNA", whereby the RNA molecule comprises a sequence of at least 20 consecutive nucleotides having 95% sequence identity to the complement of the sequence of the target nucleic acid, preferably the coding sequence of the target gene. However, antisense RNA may also be directed to regulatory sequences of target genes, including the promoter sequences and transcription termination and polyadenylation signals. Silencing RNA further includes so-called "sense RNA" whereby the RNA molecule comprises a sequence of at least 20 consecutive nucleotides having 95% sequence identity to the sequence of the target nucleic acid. Other silencing RNA may be "unpolyadenylated RNA" comprising at least 20 consecutive nucleotides having 95% sequence identity to the complement of the sequence of the target nucleic acid, such as described in WO01/12824 or US6423885 (both documents herein incorporated by reference). Yet another type of silencing RNA is an RNA molecule as described in WO03/076619 (herein incorporated by reference) comprising at least 20 consecutive nucleotides having 95% sequence identity to the sequence of the target nucleic acid or the complement thereof, and further comprising a largely-double stranded region as described in WO03/076619 (including largely double stranded regions comprising a nuclear localization signal from a viroid of the Potato spindle tuber viroid-type or comprising CUG trinucleotide repeats). Silencing RNA may also be double stranded RNA comprising a sense and antisense strand as herein defined, wherein the sense and antisense strand are capable of base-pairing with each other to form a double stranded RNA region (preferably the said at least 20 consecutive nucleotides of the sense and antisense RNA are complementary to each other). The sense and antisense region may also be present within one RNA molecule such that a hairpin RNA (hpRNA) can be formed when the sense and antisense region form a double stranded RNA region. hpRNA is well-known within the art (see e.g WO99/53050, herein incorporated by reference). The hpRNA may be classified as long hpRNA, having long, sense and antisense regions which can be largely complementary, but need not be entirely complementary (typically larger than about 200 bp, ranging between 200-1000 bp). hpRNA can also be rather small ranging in size from about 30 to about 42 bp, but not much longer than 94 bp (see

WO04/073390, herein incorporated by reference). Silencing RNA may also be artificial micro-RNA molecules as described e.g. in WO2005/052170, WO2005/047505 or US 2005/0144667, or ta-siRNAs as described in WO2006/074400 (all documents incorporated herein by reference). Said RNA capable of modulating the expression of a gene can also be an RNA ribozyme.

[44] Said RNA capable of modulating the expression of a gene can modulate, preferably

downregulate, the expression of other genes (i.e. target genes) comprised within the endosperm or even of genes present within a pathogen or pest such as a virus, fungus, insect, nematode, bacteria. An example of pest control using gene silencing is described, for example, in WO2007/080127.

[45] The nucleic acid sequence heterologous to the promoters according to the invention may generally be any nucleic acid sequence for which an increased level or altered level (e.g. in a different organ) or reduced level of transcription is desired. The nucleic acid sequence can for example encode a protein of interest. Exemplary proteins of interest, or expression products of interest, are e.g.

polypeptides that can provide an agriculturally or industrially important feature in the endosperm.

[46] Suitable heterologous nucleic acid sequences include nucleic acids encoding for, or nucleic acids encoding an RNA molecule capable of modulating the expression of proteins such as: seed storage proteins, fatty acid pathway or synthesis enzymes, epoxidases, hydroxylases, cytochrome P450 mono- oxygenases, desaturases, tocopherol biosynthetic enzymes, carotenoid biosynthesis enzymes, amino acid biosynthetic enzymes, steroid pathway enzymes, starch branching enzymes, proteins involved in starch synthesis, glycolysis, carbon metabolism, oxidative pentose phosphate cycle, protein synthesis, organelle organization and biogenesis, DNA metabolism, DNA replication, cell cycle, cell organization and biogenesis, chromosome organization and biogenesis, microtubule-based processes, microtubule- based movement, cytoskeleton-dependent intracellular transport, cytoskeleton organization and biogenesis, chromatin assembly or disassembly, DNA-dependent DNA replication, chromosome organization and biogenesis, DNA packaging, establishment and/or maintenance of chromatin architecture, regulation of progression through the cell cycle, regulation of the cell cycle, nucleobase, nucleoside, nucleotide and nucleic acid metabolism, chromatin assembly, macromolecule biosynthesis, intracellular transport, establishment of cellular localization, cellular localization, nucleosome assembly, macromolecule metabolism, or M-phase; or of proteins involved in secondary metabolism. Also suitable heterologous nucleic acid sequences are sequences encoding a heterologous protein for

biomanufacturing of said heterologous protein in plant seeds, such as antibodies, pharmaceutical proteins, or vaccines and the like. [47] A "transcription termination and polyadenylation region" as used herein is a sequence that drives the cleavage of the nascent RNA, whereafter a poly(A) tail is added at the resulting RNA 3 ' end, functional in plant cells. Transcription termination and polyadenylation signals functional in plant cells include, but are not limited to, 3'nos, 3'35S, 3'his and 3'g7. [48] The term "protein" interchangeably used with the term "polypeptide" as used herein describes a group of molecules consisting of more than 30 amino acids, whereas the term "peptide" describes molecules consisting of up to 30 amino acids. Proteins and peptides may further form dimers, trimers and higher oligomers, i.e. consisting of more than one (poly)peptide molecule. Protein or peptide molecules forming such dimers, trimers etc. may be identical or non-identical. The corresponding higher order structures are, consequently, termed homo- or heterodimers, homo- or heterotrimers etc. The terms "protein" and "peptide" also refer to naturally modified proteins or peptides wherein the modification is effected e.g. by glycosylation, acetylation, phosphorylation and the like. Such modifications are well known in the art.

[49] The term "heterologous" refers to the relationship between two or more nucleic acid or protein sequences that are derived from different sources. For example, a promoter is heterologous with respect to an operably linked DNA region, such as a coding sequence if such a combination is not normally found in nature. In addition, a particular sequence may be "heterologous" with respect to a cell or organism into which it is inserted (i.e. does not naturally occur in that particular cell or organism).

[50] The term "recombinant gene" refers to any gene that contains: a) DNA sequences, including regulatory and coding sequences that are not found together in nature, or b) sequences encoding parts of proteins not naturally adjoined, or c) parts of promoters that are not naturally adjoined. Accordingly, a recombinant gene may comprise regulatory sequences and coding sequences that are derived from different sources, or comprise regulatory sequences, and coding sequences derived from the same source, but arranged in a manner different from that found in nature. [51] Any of the promoters and heterologous nucleic acid sequences described above may be provided in a recombinant vector. A recombinant vector typically comprises, in a 5' to 3' orientation: a promoter to direct the transcription of a nucleic acid sequence and a nucleic acid sequence. The recombinant vector may further comprise a 3' transcriptional terminator, a 3' polyadenylation signal, other untranslated nucleic acid sequences, transit and targeting nucleic acid sequences, selectable markers, enhancers, and operators, as desired. The wording "5' UTR" refers to the untranslated region of DNA upstream, or 5' of the coding region of a gene and "3' UTR" refers to the untranslated region of DNA downstream, or 3' of the coding region of a gene. Means for preparing recombinant vectors are well known in the art. Methods for making recombinant vectors particularly suited to plant

transformation are described in US4971908, US4940835, US4769061 and US4757011. Typical vectors useful for expression of nucleic acids in higher plants are well known in the art and include vectors 5 derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens. One or more additional promoters may also be provided in the recombinant vector. These promoters may be operably linked, for example, without limitation, to any of the nucleic acid sequences described above. Alternatively, the promoters may be operably linked to other nucleic acid sequences, such as those encoding transit peptides, selectable marker proteins, or antisense sequences. These additional promoters may be selected on the basis of the cell type into which the vector will be inserted. Also, promoters which function in bacteria, yeast, and plants are all well taught in the art. The additional promoters may also be selected on the basis of their regulatory features. Examples of such features include enhancement of transcriptional activity, inducibility, tissue specificity, and developmental stage-specificity. [52] The recombinant vector may also contain one or more additional nucleic acid sequences. These additional nucleic acid sequences may generally be any sequences suitable for use in a recombinant vector. Such nucleic acid sequences include, without limitation, any of the nucleic acid sequences, and modified forms thereof, described above. The additional structural nucleic acid sequences may also be operably linked to any of the above described promoters. The one or more structural nucleic acid sequences may each be operably linked to separate promoters. Alternatively, the structural nucleic acid sequences may be operably linked to a single promoter (i.e., a single operon).

[53] Yet another embodiment provides a host cell, such as an E. coli cell, an Agrobacterium cell, a yeast cell, or a plant cell, comprising the isolated nucleic acid according to the invention, or the recombinant gene according to the invention. [54] Other nucleic acid sequences may also be introduced into the host cell along with the promoter and structural nucleic acid sequence, e. g. also in connection with the vector of the invention. These other sequences may include 3' transcriptional terminators, 3' polyadenylation signals, other untranslated nucleic acid sequences, transit or targeting sequences, selectable markers, enhancers, and operators. Preferred nucleic acid sequences of the present invention, including recombinant vectors, structural nucleic acid sequences, promoters, and other regulatory elements, are described above.

[55] In a further embodiment, a plant is provided comprising the recombinant gene according to the invention. In again a further embodiment, a plant is provided comprising at least two recombinant genes according to the invention, wherein the nucleic acid comprising endosperm-preferential promoter activity according to the invention and, optionally, the heterologous nucleic acid sequence operably linked thereto, are different in each recombinant gene. Yet a further embodiment provides seeds obtainable from the plant according to the invention. In another embodiment, the plants or seeds according to the invention are seed crop plants or seeds.

[56] The plant cell or plant comprising the recombinant gene according to the invention can be a plant cell or a plant comprising a recombinant gene of which either the promoter, or the heterologous nucleic acid sequence operably linked to said promoter, are heterologous with respect to the plant cell. Such plant cells or plants may be transgenic plant in which the recombinant gene is introduced via transformation. Alternatively, the plant cell of plant may comprise the promoter according to the invention derived from the same species operably linked to a nucleic acid which is also derived from the same species, i.e. neither the promoter nor the operably linked nucleic acid is heterologous with respect to the plant cell, but the promoter is operably linked to a nucleic acid to which it is not linked in nature. A recombinant gene can be introduced in the plant or plant cell via transformation, such that both the promoter and the operably linked nucleotide are at a position in the genome in which they do not occur naturally. Alternatively, the promoter according to the invention can be integrated in a targeted manner in the genome of the plant or plant cell upstream of an endogenous nucleic acid encoding an expression product of interest, i.e. to modulate the expression pattern of an endogenous gene. The promoter that is integrated in a targeted manner upstream of an endogenous nucleic acid can be integrated in cells of a plant species from which it is originally derived, or in cells of a heterologous plant species.

Alternatively, a heterologous nucleic acid can be integrated in a targeted manner in the genome of the plant or plant cell downstream of the promoter according to the invention, such that said heterologous nucleic acid is expressed endosperm-preferentially. Said heterologous nucleic acid is a nucleic acid which is heterologous with respect to the promoter, i.e. the combination of the promoter with said heterologous nucleic acid is not normally found in nature. Said heterologous nucleic acid may be a nucleic acid which is heterologous to said plant species in which it is inserted, but it may also naturally occur in said plant species at a different location in the plant genome. Said promoter or said

heterologous nucleic acid can be integrated in a targeted manner in the plant genome via targeted sequence insertion, using, for example, the methods as described in WO2005/049842.

[57] Plants comprising at least two recombinant genes according to the invention wherein the nucleic acid comprising endosperm-preferential promoter activity is different in each recombinant gene are, for example, plants comprising a first recombinant gene comprising a nucleotide sequence having at least 80% sequence identity to SEQ ID NO: 1 or a functional fragment thereof, and a second recombinant gene comprising a nucleotide sequence having at least 80%> sequence identity to any one of SEQ ID NO: 2 to SEQ ID NO: 20 or a functional fragment thereof. It will be clear that, when the first recombinant gene comprises a nucleotide sequence having at least 80%> sequence identity to SEQ ID NO: x or a functional fragment thereof, wherein SEQ ID NO: x is selected from any one of SEQ ID NO: 1 to SEQ ID NO: 20, the second recombinant gene may comprise a nucleotide sequence having at least 80%> sequence identity to any one of the sequences according to the invention or a functional fragment thereof, except to SEQ ID NO: x. Said plants are suitable to express different genes with the same tissue-specificity, however without the negative features associated with the repeated use of one promoter, such as gene silencing or recombination of a vector comprising the recombinant genes. The at least two recombinant genes according to the invention may be present at one locus in the genome of said plant, and may be derived from the same transforming DNA molecule. ?

[58] Plants according to the invention may comprise one or more recombinant genes according to the invention, but may in addition contain a recombinant gene comprising a nucleic acid comprising promoter activity which is preferential or specific to other plant tissues, such as apical meristem, flower buds, cotyledons, flowers, pods, roots, seeds, leaves and different stages of the embryo, operably linked to a nucleic acid sequence encoding an expression product of interest. The recombinant gene according to the invention and the recombinant gene comprising a nucleic acid comprising another promoter activity may be present at one locus and may be derived from the same transforming DNA molecule.

[59] Yet another embodiment provides a method of producing a transgenic plant comprising the steps of (a) introducing or providing the recombinant gene according to the invention to a plant cell to create transgenic cells; and (b) regenerating transgenic plants from said transgenic cell.

[60] "Introducing" in connection with the present application relates to the placing of genetic information in a plant cell or plant by artificial means. This can be effected by any method known in the art for introducing RNA or DNA into plant cells, protoplasts, calli, roots, tubers, seeds, stems, leaves, seedlings, embryos, pollen and microspores, other plant tissues, or whole plants. More particularly, "introducing" means stably integrating into the plant's genome. Introducing the recombinant gene can be performed by transformation.

[61] The term "transformation" herein refers to the introduction (or transfer) of nucleic acid into a recipient host such as a plant or any plant parts or tissues including plant cells, protoplasts, calli, roots, tubers, seeds, stems, leaves, seedlings, embryos and pollen. Plants containing the transformed nucleic acid sequence are referred to as "transgenic plants". Transformed, transgenic and recombinant refer to a host organism such as a plant into which a heterologous nucleic acid molecule (e.g. an expression cassette or a recombinant vector) has been introduced. The nucleic acid can be stably integrated into the genome of the plant.

[62] As used herein, the phrase "transgenic plant" refers to a plant having an introduced nucleic acid stably introduced into a genome of the plant, for example, the nuclear or plastid genomes. In other words, plants containing transformed nucleic acid sequence are referred to as "transgenic plants".

Transgenic and recombinant refer to a host organism such as a plant into which a heterologous nucleic acid molecule (e.g. the promoter, the chimeric gene or the vector as described herein) has been introduced. The nucleic acid can be stably integrated into the genome of the plant. [63] Transformation methods are well known in the art and include Agrobacterium-rnQdiatsd transformation. Agrobacterium-mediated transformation of cotton has been described e.g. in US patent 5,004,863, in US patent 6,483,013 and WO2000/71733. Plants may also be transformed by particle bombardment: Particles of gold or tungsten are coated with DNA and then shot into young plant cells or plant embryos. This method also allows transformation of plant plastids. Viral transformation (transduction) may be used for transient or stable expression of a gene, depending on the nature of the virus genome. The desired genetic material is packaged into a suitable plant virus and the modified virus is allowed to infect the plant. The progeny of the infected plants is virus free and also free of the inserted gene. Suitable methods for viral transformation are described or further detailed e. g. in WO 90/12107, WO 03/052108 or WO 2005/098004. Further suitable methods well-known in the art are microinjection, electroporation of intact cells, polyethyleneglycol-mediated protoplast transformation, electroporation of protoplasts, liposome-mediated transformation, silicon- whiskers mediated transformation etc. Said transgene may be stably integrated into the genome of said plant cell, resulting in a transformed plant cell. The transformed plant cells obtained in this way may then be regenerated into mature fertile transformed plants.

[64] Further provided is a method of effecting endosperm-preferential expression of a nucleic acid comprising introducing the recombinant gene according to the invention into the genome of a plant, or providing the plant according to the invention. Also provided is a method for altering seed properties of a plant comprising introducing the recombinant gene according to the invention into the genome of a plant, or providing the plant according to the invention. In another embodiment, said plant is a seed crop plant.

[65] "Seed properties" as used herein are properties of the seed. Seed properties can, for example, be seed yield, seed storage compound production, seed compound accumulation, seed nutrient

accumulation; seed micronutrient accumulation; seed storage compound quality, seed compound composition, seed quality, abiotic stress tolerance, seed dormancy, seed imbibition, seed germination, seed vigor. Seed storage compounds can, for example, be, seed oil, seed starch, or seed protein.

[66] Seed properties may be modulated by modulating metabolic pathways, such as starch metabolism, sugar metabolism, inositol phosphate metabolism, glycolysis, amino acid biosynthesis, carbon metabolism, nucleotide metabolism, oxidative pentose phosphate cycle, fatty acid biosynthesis, protein synthesis, or phytate metabolism, and modulating secondary metabolism pathways. Another example is the methyl recycling metabolic activity impacting chromatin remodeling, phospholipid biosynthesis and cell wall lignification. Such metabolic pathways can be modulated by, for example, overexpressing or downregulating a gene involved in one or more of the metabolic pathways using the endosperm-preferential promoter according to the invention. [67] Also provided is the use of the isolated nucleic acid according to the invention to regulate expression of an operably linked nucleic acid in a plant, and the use of the isolated nucleic acid according to the invention, or the recombinant gene according to the invention to alter seed properties in a plant. In a further embodiment, said plant is a seed crop plant. Also provided is the use of the isolated nucleic acid according to the invention to identify other nucleic acids comprising endosperm-preferential promoter activity. [68] Other nucleic acids comprising endosperm-preferential promoter activity can be identified using methods known in the art. Such nucleotide sequence may be identified and isolated by hybridization under stringent conditions using as probes a nucleic acid comprising the nucleotide sequences of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or part thereof. Other nucleic acids comprising endosperm- preferential promoter activity may also be obtained by DNA amplification using oligonucleotides specific for the sequences according to the invention as primers, such as but not limited to

oligonucleotides comprising or consisting of about 20 to about 50 consecutive nucleotides from the nucleotide sequences of any one of any one of SEQ ID NO: 1 to SEQ ID NO: 20 or its complement. Other nucleic acids comprising endosperm-preferential promoter activity can be identified in silico using Basic Local Alignment Search Tool (BLAST) homology search with other nucleotide or amino acid sequences. Functionality of the identified nucleic acids comprising endosperm-preferential promoter activity can be validated using the methods described herein. Other nucleic acids comprising

endosperm-preferential promoter activity may also be identified by identification of gene sequences orthologous to the gene sequences of the endogenous coding sequences of the genes driven by the promoters of the invention, and isolating the promoter sequences upstream of these orthologous homologous coding sequences.

[69] The promoters according to the invention can further be used to create hybrid promoters, i.e. promoters containing (parts of) one or more of the promoters(s) of the current invention and (parts of) other promoter which can be newly identified or known in the art. Such hybrid promoters may have optimized tissue specificity or expression level.

[70] Yet another embodiment provides a method of producing food, feed, or an industrial product comprising (a) obtaining the plant or a part thereof, according to the invention; and (b) preparing the food, feed or industrial product from the plant or part thereof. In another embodiment, said food or feed is oil, meal, grain, starch, flour or protein, or said industrial product is biofuel, fiber, industrial chemicals, a pharmaceutical or a nutraceutical.

[71] A "seed crop" or "seed crop plant" as used herein is a crop grown for its seeds. Examples of seed crops are rice, maize, wheat, barley, millet, rye, oats, camelina, crambe, Linum, castor bean, calendula, safflower, sunflower, soybean, or Brassica species, such as Brassica napus, Brassica juncea, Brassica carinata, Brassica rapa, Brassica oleracea, and Brassica nigra. [72] A "Brassica plant" refers to allotetraploid or amphidiploid Brassica napus (AACC, 2n=38), Brassica juncea (AABB, 2n=36), Brassica carinata (BBCC, 2n=34), or to diploid Brassica rapa (syn. B. campestris) (AA, 2n=20), Brassica oleracea (CC, 2n=18) or Brassica nigra (BB, 2n=16). [73] A "Crop of oilseed rape" as used herein refers to oilseed rape cultivated as a crop, such as Brassica napus, Brassica juncea, Brassica carinata, Brassica rapa (syn. B. campestris), Brassica oleracea or Brassica nigra.

[74] The plants according to the invention may additionally contain an endogenous or a transgene, which confers herbicide resistance, such as the bar or pat gene, which confer resistance to glufosinate ammonium (Liberty®, Basta® or Ignite®) [EP 0 242 236 and EP 0 242 246 incorporated by reference]; or any modified EPSPS gene, such as the 2mEPSPS gene from maize [EPO 508 909 and EP 0 507 698 incorporated by reference], or glyphosate acetyltransferase, or glyphosate oxidoreductase, which confer resistance to glyphosate (RoundupReady®), or bromoxynitril nitrilase to confer bromoxynitril tolerance, or any modified AHAS gene, which confers tolerance to sulfonylureas, imidazolinones,

sulfonylaminocarbonyltriazolinones, triazolopyrimidines or pyrimidyl(oxy/thio)benzoates, such as oilseed rape imidazolinone-tolerant mutants PM1 and PM2, currently marketed as Clearfield® canola. Further, the plants according to the invention may additionally contain an endogenous or a transgene which confers increased oil content or improved oil composition, such as a 12:0 ACP

thioesteraseincrease to obtain high laureate, which confers pollination control, such as such as barnase under control of an anther-specific promoter to obtain male sterility, or barstar under control of an anther-specific promoter to confer restoration of male sterility, or such as the Ogura cytoplasmic male sterility and nuclear restorer of fertility.

[75] The plants and seeds according to the invention may be further treated with a chemical compound, such as a chemical compound selected from the following lists:

Herbicides: Clethodim, Clopyralid, Diclofop, Ethametsulfuron, Fluazifop, Glufosinate, Glyphosate, Metazachlor, Quinmerac, Quizalofop, Tepraloxydim, Trifluralin.

Fungicides / PGRs: Azoxystrobin, N-[9-(dichloromethylene)-l,2,3,4-tetrahydro-l,4-methanonapht halen- 5-yl]-3 -(difluoromethyl)- 1 -methyl- 1 H-pyrazole-4-carboxamide (Benzovindiflupyr, Benzodiflupyr), Bixafen, Boscalid, Carbendazim, Carboxin, Chlormequat-chloride, Coniothryrium minitans,

Cyproconazole, Cyprodinil, Difenoconazole, Dimethomorph, Dimoxystrobin, Epoxiconazole,

Famoxadone, Fluazinam, Fludioxonil, Fluopicolide, Fluopyram, Fluoxastrobin, Fluquinconazole, Flusilazole, Fluthianil, Flutriafol, Fluxapyroxad, Iprodione, Isopyrazam, Mefenoxam, Mepiquat- chloride, Metalaxyl, Metconazole, Metominostrobin, Paclobutrazole, Penflufen, Penthiopyrad,

Picoxystrobin, Prochloraz, Prothioconazole, Pyraclostrobin, Sedaxane, Tebuconazole, Tetraconazole, Thiophanate-methyl, Thiram, Triadimenol, Trifloxystrobin, Bacillus firmus, Bacillus firmus strain I- 1582, Bacillus subtilis, Bacillus subtilis strain GB03, Bacillus subtilis strain QST 713, Bacillus pumulis, Bacillus, pumulis strain GB34.

Insecticides: Acetamiprid, Aldicarb, Azadirachtin, Carbofuran, Chlorantraniliprole (Rynaxypyr), Clothianidin, Cyantraniliprole (Cyazypyr), (beta-)Cyfluthrin, gamma-Cyhalothrin, lambda-Cyhalothrin, Cypermethrin, Deltamethrin, Dimethoate, Dinetofuran, Ethiprole, Flonicamid, Flubendiamide, - -

Fluensulfone, Fluopyram,Flupyradifurone, tau-Fluvalinate, Imicyafos, Imidacloprid, Metaflumizone, Methiocarb, Pymetrozine, Pyrifluquinazon, Spinetoram, Spinosad, Spirotetramate, Sulfoxaflor, Thiacloprid, Thiamethoxam, 1 -(3-chloropyridin-2-yl)-N-[4-cyano-2-methyl-6- (methylcarbamoyl)phenyl]-3 - { [5-(trifluoromethyl)-2H-tetrazol-2-yl]methyl} - 1 H-pyrazole-5- carboxamide, 1 -(3 -chloropyridin-2-yl)-N- [4-cyano-2-methyl-6-(methylcarbamoyl)phenyl] -3 - { [5- (trifluoromethyl)- 1 H-tetrazol- 1 -yl]methyl} - 1 H-pyrazole-5-carboxamide, 1 - {2-fluoro-4-methyl-5- [(2,2,2-trifluorethyl)sulfinyl]phenyl} -3 -(trifluoromethyl)- 1 H- 1 ,2,4-triazol-5-amine, (1 E)-N-[(6- chloropyridin-3-yl)methyl]-N'-cyano-N-(2,2-difluoroethyl)eth animidamide, Bacillus firmus, Bacillus firmus strain 1-1582, Bacillus subtilis, Bacillus subtilis strain GB03, Bacillus subtilis strain QST 713, Metarhizium anisopliae F52.

[76] Whenever reference to a "plant" or "plants" according to the invention is made, it is understood that also plant parts (cells, tissues or organs, seed pods, seeds, severed parts such as roots, leaves, flowers, pollen, etc.), progeny of the plants which retain the distinguishing characteristics of the parents, such as seed obtained by selling or crossing, e.g. hybrid seed (obtained by crossing two inbred parental lines), hybrid plants and plant parts derived there from are encompassed herein, unless otherwise indicated.

[77] In some embodiments, the plant cells of the invention as well as plant cells generated according to the methods of the invention, may be non-propagating cells.

[78] The obtained plants according to the invention can be used in a conventional breeding scheme to produce more plants with the same characteristics or to introduce the same characteristic in other varieties of the same or related plant species, or in hybrid plants. The obtained plants can further be used for creating propagating material. Plants according to the invention can further be used to produce gametes, seeds (including crushed seeds and seed cakes), seed oil, embryos, either zygotic or somatic, progeny or hybrids of plants obtained by methods of the invention. Seeds obtained from the plants according to the invention are also encompassed by the invention.

[79] "Creating propagating material", as used herein, relates to any means know in the art to produce further plants, plant parts or seeds and includes inter alia vegetative reproduction methods (e.g. air or ground layering, division, (bud) grafting, micropropagation, stolons or runners, storage organs such as bulbs, corms, tubers and rhizomes, striking or cutting, twin-scaling), sexual reproduction (crossing with another plant) and asexual reproduction (e.g. apomixis, somatic hybridization).

[80] As used herein "comprising" is to be interpreted as specifying the presence of the stated features, integers, steps or components as referred to, but does not preclude the presence or addition of one or more features, integers, steps or components, or groups thereof. Thus, e.g., a nucleic acid or protein comprising a sequence of nucleotides or amino acids, may comprise more nucleotides or amino acids than the actually cited ones, i.e., be embedded in a larger nucleic acid or protein. A chimeric gene comprising a nucleic acid which is functionally or structurally defined, may comprise additional DNA regions etc.

[81] All patents, patent applications, and publications or public disclosures (including publications on internet) referred to or cited herein are incorporated by reference in their entirety.

[82] The sequence listing contained in the file named„BCS13-2023_ST25.txt", which is 63 kilobytes (size as measured in Microsoft Windows®), contains 20 sequences SEQ ID NO: 1 through SEQ ID NO: 20 is filed herewith by electronic submission and is incorporated by reference herein.

In the description and examples, reference is made to the following sequences SEQUENCES

SEQ ID NO 1 Endosperm-preferential promoter 2

SEQ ID NO 2 Endosperm-preferential promoter 4

SEQ ID NO 3 Endosperm-preferential promoter 7

SEQ ID NO 4 Endosperm-preferential promoter _8

SEQ ID NO 5 Endosperm-preferential promoter _9

SEQ ID NO 6 Endosperm-preferential promoter .10

SEQ ID NO 7 Endosperm-preferential promoter .12

SEQ ID NO 8 Endosperm-preferential promoter .13

SEQ ID NO 9 Endosperm-preferential promoter .14

SEQ ID NO 10 Endosperm-preferential promoter .15

SEQ ID NO 11 Endosperm-preferential promoter .16

SEQ ID NO 12 Endosperm-preferential promoter .17

SEQ ID NO 13 Endosperm-preferential promoter .18

SEQ ID NO 14 Endosperm-preferential promoter .19

SEQ ID NO 15 Endosperm-preferential promoter 20

SEQ ID NO 16 Endosperm-preferential promoter .21

SEQ ID NO 17 Endosperm-preferential promoter 22

SEQ ID NO 1 3 Endosperm-preferential promoter 23

SEQ ID NO 19 Endosperm-preferential promoter 24

SEQ ID NO 20 Endosperm-preferential promoter . 25 - -

EXAMPLES

[84] Unless stated otherwise in the Examples, all recombinant DNA techniques are carried out according to standard protocols as described in Sambrook and Russell (2001) Molecular Cloning: A Laboratory Manual, Third Edition, Cold Spring Harbor Laboratory Press, NY, in Volumes 1 and 2 of Ausubel et al. (1994) Current Protocols in Molecular Biology, Current Protocols, USA and in Volumes I and II of Brown (1998) Molecular Biology LabFax, Second Edition, Academic Press (UK). Standard materials and methods for plant molecular work are described in Plant Molecular Biology Labfax (1993) by R.D.D. Croy, jointly published by BIOS Scientific Publications Ltd (UK) and Blackwell Scientific Publications, UK. Standard materials and methods for polymerase chain reactions can be found in Dieffenbach and Dveksler (1995) PCR Primer: A Laboratory Manual, Cold Spring Harbor Laboratory Press, and in McPherson at al. (2000) PCR - Basics: From Background to Bench, First Edition, Springer Verlag, Germany.

Example 1 - RNA isolation from different tissues of Brassica napus

[85] The following tissues were isolated from Brassica napus:

- Apical meristem 33 days after sowing (DAS) (including smallest leaves) (AM33)

- Big flower buds (> 5 mm) 42 DAS (BFB42)

- Cotyledons (with hypocotyl) 10 DAS (CTYL 10)

- Open flowers 52 DAS (OF52)

- Pods 14-20 DAS (Pod2)

- Pods 21-25 DAS (Pod3)

- Roots 14 DAS (Ro2w)

- Small flower buds < 5 mm 42 DAS (SFB42)

- Seeds 14-20 days after flowering (DAF) (Seed2)

- Seeds 21-25 DAF (Seed3)

- Seeds 26-30 DAF (Seed4)

- Seeds 31-35 DAF (Seed5)

- Seeds 42 DAF (Seed6)

- Seeds 49 DAF (Seed7)

- Stem 14 DAS (St2w)

- Stem 33 DAS (St5w)

Young leaf 33 DAS (< 3 cm leaf next to apical meristem) (YL33)

The following embryonic tissues were isolated:

1 Embryo, hypocotyl, 18 DAF

4 Embryo, hypocotyl, 18 DAF 10 Embryo, hypocotyl, 24 DAF

13 Embryo, hypocotyl, 28 DAF

15 Embryo, hypocotyl, 28 DAF

17 Embryo, hypocotyl, 32 DAF

19 Embryo, hypocotyl, 32 DAF

21 Embryo, hypocotyl, 46 DAF

23 Embryo, hypocotyl, 46 DAF

2 Embryo, inner cotyledon, 18 DAF

5 Embryo, inner cotyledon, 18 DAF

8 Embryo, inner cotyledon, 24 DAF

11 Embryo, inner cotyledon, 24 DAF

14 Embryo, inner cotyledon, 28 DAF

16 Embryo, inner cotyledon, 28 DAF

18 Embryo, inner cotyledon, 32 DAF

20 Embryo, inner cotyledon, 32 DAF

22 Embryo, inner cotyledon, 46 DAF

24 Embryo, inner cotyledon, 46 DAF

3 Embryo, endosperm, 18 DAF

6 Embryo, endosperm, 18 DAF

12 Embryo, endosperm, 24 DAF

25 Embryo, outer cotyledon, 18 DAF

26 Embryo, outer cotyledon, 18 DAF

27 Embryo, outer cotyledon, inner part, 24 DAF

29 Embryo, outer cotyledon, inner part, 24 DAF

31 Embryo, outer cotyledon, inner part, 28 DAF

33 Embryo, outer cotyledon, inner part, 28 DAF

35 Embryo, outer cotyledon, inner part, 32 DAF

37 Embryo, outer cotyledon, inner part, 32 DAF

39 Embryo, outer cotyledon, inner part, 46 DAF

41 Embryo, outer cotyledon, inner part, 46 DAF

28 Embryo, outer cotyledon, outer part, 24 DAF

30 Embryo, outer cotyledon, outer part, 24 DAF

32 Embryo, outer cotyledon, outer part, 28 DAF

34 Embryo, outer cotyledon, outer part, 28 DAF

36 Embryo, outer cotyledon, outer part, 32 DAF

38 Embryo, outer cotyledon, outer part, 32 DAF

40 Embryo, outer cotyledon, outer part, 46 DAF

42 Embryo, outer cotyledon, outer part, 46 DAF - 5 -

[87] For the isolation of the embryo tissues, freshly harvested seeds were frozen at -80°C and cut into 20μιη sections. Sections were placed on PET -membranes, lyophilized at -20°C, and then used for laser- assisted microdissection (PALM Laser-Microbeam instrument; Bernried/Germany) (for details see Schiebold et al., 2011, Plant Methods 7:19). Up to 5 distinct embryonic tissues plus endosperm were targeted. Tissue dissection was applied to seeds at 18, 24, 28, 32 and 46 DAF, covering the

developmental period from onset of storage activity until late maturation. RNA was extracted

(purification of total RNA by RNeasy Micro kit; Qiagen) and amplified (C&E version ExpressArt mRNA amplification Nano kit; Amp-tec) as detailed in Schiebold et al. 2011 (supra).

[88] Total RNA from the non-embryonic tissues was isolated according to standard methods.

Example 2 - Identification of Brassica napus endosperm-preferential transcripts

[89] Transcript activity in the different B. napus tissues was measured using transcript profiling (RNA-seq) by Illumina HiSeq paired-end sequencing (2x100 bp sequences).

[90] The counts per transcript (transcript per million; tpm) was determined as described by Li and Dewey, 2011, BMC Bioinformatics 12: 323 (incorporated herein by reference). Sequence reads were mapped to transcripts using an in-house transcriptome dataset.

[91] The tpm values were normalized as described by Anders and Huber, 2010, Genome Biol 11 : R106 (incorporated herein by reference).

[92] endosperm-preferential transcript were selected using the following criteria:

- low transcript activity in AM33, BFB42, CtyllO, OF52, Pod2, Pod3, Ro2w, SFB42, St2w, St5w, YL33, and in the embryo tissues 1, 4, 10, 13, 15, 17, 19, 21, 23, 2, 5, 8, 11, 14, 16, 18, 20, 22, 24, 25, 26, 27, 29, 31, 33, 35, 37, 39, 41, 28, 30, 32, 34, 36, 38, 40 and 42;

- transcript should be detected in embryo tissues 3, 6 and 12 (Embryo, endosperm, 18, 18 and 24 DAF);

- whether or not expression was detected in seeds (seed2-7) was not taken into account.

[93] Transcripts were identified having a 6-fold higher expression in the endosperm in any one of tissues 3, 6, and 12 as compared to each of the the reference tissues, based on the normalized values.

[94] The 20 transcripts fulfilling the above criteria with the highest expression in the three endosperm tissues 3, 6, and 12, were selected. Figures 1 and 2 show the relative expression levels of the identified transcripts in different tissues. - -

Example 3 - Identification of Brassica napus endosperm-preferential promoters

[95] The sequences of the 20 transcripts as identified above were blasted against an in-house database of Brassica napus sequences. Sequences upstream of the predicted ATG translation start codon of the above-identified transcripts until the next upstream predicted gene, or until 3 kb if the next upstream predicted gene was located more than 3 kb further upstream, were obtained. The sequences obtained in this way, comprising the promoters conferring endosperm-preferential expression of the transcripts Endosperm_2, Endosperm_4 Endosperm_7- 10 and Endosperm_12-25 are given in SEQ ID NO: 1 to SEQ ID NO: 20, respectively.