Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
INHIBITION OF GENE EXPRESSION
Document Type and Number:
WIPO Patent Application WO/1997/004112
Kind Code:
A2
Abstract:
A method of inhibiting gene expression is described. The method, which affects enzymatic activity in a plant, comprises expressing in a plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; and wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.

Inventors:
POULSEN PETER (DK)
Application Number:
PCT/EP1996/003052
Publication Date:
February 06, 1997
Filing Date:
July 12, 1996
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
DANISCO (DK)
POULSEN PETER (DK)
International Classes:
A01H5/00; C12N9/10; C12N9/44; C12N15/09; C12N15/113; C12N15/29; C12N15/54; C12N15/82; C12Q1/68; (IPC1-7): C12N15/82; A01H5/00; C12N15/11; C12N15/54; C12N15/79
Domestic Patent References:
WO1992014827A11992-09-03
WO1992011375A11992-07-09
WO1995026407A11995-10-05
WO1992015680A11992-09-17
WO1990008828A21990-08-09
WO1994009144A11994-04-28
WO1994011520A21994-05-26
Foreign References:
EP0240208B11994-11-17
US4740463A1988-04-26
Other References:
THEORETICAL AND APPLIED GENETICS, vol. 86, 1993, pages 665-672, XP002021375 SHIMADA, H., ET AL.: "Antisense regulation of the rice waxy gene expression using a PCR-amplified fragment of rice genome reduces the amylose content in grain starch"
PROCEEDINGS OF AN INTERNATIONAL SYMPOSIUM ON PLANT POLYMERIC CARBOHYDRATES, no. 134, 1 July 1992, pages 33-39, XP002014045 WILLMITZER L ET AL: "STARCH SYNTHESIS IN TRANSGENIC PLANTS"
JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 268, no. 25, 5 September 1993, pages 19084-19091, XP002014043 MIZUNO K ET AL: "ALTERATION OF THE STRUCTURAL PROPERTIES OF STARCH COMPONENTS BY THE LACK OF AN ISOFORM OF STARCH BRANCHING ENZYME IN RICE SEEDS"
PLANT MOLECULAR BIOLOGY, vol. 26, 1994, pages 1759-1773, XP002021172 KUIPERS, A.G.J., ET AL.: "Field evaluation of transgenic potato plants expressing an antisense granule-bound starch synthase gene: increase of the antisense effect during tuber growth"
MOLECULAR AND GENERAL GENETICS, vol. 246, no. 6, 20 March 1995, pages 745-755, XP000571429 KUIPERS A G J ET AL: "FACTORS AFFECTING THE INHIBITION BY ANTISENSE RNA OF GRANULE-BOUND STARCH SYNTHASE GENE EXPRESSION IN POTATO"
CHEMICAL ABSTRACTS, vol. 121, no. 9, 29 August 1994 Columbus, Ohio, US; abstract no. 101242, SIHMADA, H., ET AL.: "Molecular cloning of cDNA and gene for rice starch branching enzyme and its use for breeding" XP002021261 & JP 06 098 656 A (MITSUI) 12 April 1994
Download PDF:
Claims:
CLAIMS
1. A method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; and wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.
2. A method according to claim 1 wherein starch branching enzyme activity is affected and/or wherein the levels of amylopectin are affected and/or the composition of starch is changed.
3. A method of affecting enzymatic activity in a starch producing organism (or a cell, a tissue or an organ thereof) comprising expressing in the starch producing organism (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; and wherein starch branching enzyme activity is affected and/or the levels of amylopectin are affected and/or the composition of starch is changed.
4. A method according to claim 3 wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.
5. A method according to any one of the preceding claims wherein the enzymatic activity is reduced or eliminated.
6. A method according to any one of the preceding claims wherein the nucleotide sequence codes for at least substantially all of at least one intron in an antisense orientation.
7. A method according to any one of the preceding claims wherein the nucleotide sequence codes for all of at least one intron in an antisense orientation.
8. A method according to any one of the preceding claims wherein the nucleotide sequence comprises the sequence shown as any one of SEQ.I.D. No. 15 to SEQ.I.D. No. 27 or a variant, derivative or homologue thereof, including combinations thereof.
9. A method according to any one of the preceding claims wherein the nucleotide sequence is expressed by a promoter having a sequence shown as SEQ.I.D. No. 14 or a variant, derivative or homologue thereof.
10. An antisense sequence comprising the nucleotide sequence as defined in claim 8 or a variant, derivative or homologue thereof.
11. A promoter having a sequence shown as SEQ.I.D. No. 14, or a variant, derivative or homologue thereof.
12. A promoter according to claim 11 in combination with a gene of interest ("GOI").
13. A construct capable of comprising or expressing the invention according to any one of claims 10 to 12.
14. A vector comprising or expressing the invention according to any one of claims 10 to 13.
15. A combination of nucleotide sequences comprising a first nucleotide sequence coding for a recombinant enzyme; and a second nucleotide sequence which corresponds to an intron in antisense orientation; wherein the intron is an intron that is associated with a genomic gene encoding an enzyme corresponding to the recombinant enzyme; and wherein the second nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.
16. A cell, tissue or organ comprising or expressing the invention according to any one of claims 10 to 15.
17. A transgenic starch producing organism comprising or expressing the invention according to any one of claims 10 to 16.
18. A transgenic starch producing organism according to claim 17 wherein the organism is a plant.
19. A starch obtained from the invention according to any one of the preceding claims.
20. pBEA8 (NCIMB 40753) or pBEA9 (NCIMB 40815).
21. A nucleotide sequence that is antisense to any one or more of the intron sequences obtainable from λSBE 3.2 (NCIMB 40751) or λSBE 3.4 (NCIMB 40752) or a variant, derivative or homologue thereof.
22. A method of expressing a recombinant protein or enzyme in a host organism comprising expressing a nucleotide sequence coding for the recombinant protein or enzyme; and expressing a further nucleotide sequence wherein the further nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; wherein the intron is an intron normally associated with the genomic gene encoding a protein or an enzyme corresponding to the recombinant protein or enzyme; and wherein the further nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.
Description:
INHIBITION OF GENE EXPRESSION

The present invention relates to a method of inhibiting gene expression, particularly inhibiting gene expression in a plant. The present invention also relates to a nucleotide sequence useful in the method. In addition, the present invention relates to a promoter that is useful for expressing the nucleotide sequence.

Starch is one of the main storage carbohydrates in plants, especially higher plants. The structure of starch consists of amylose and amylopectin. Amylose consists essentially of straight chains of α-l-4-linked glycosyl residues. Amylopectin comprises chains of α-l-4-linked glycosyl residues with some cϋ-1-6 branches. The branched nature of amylopectin is accomplished by the action of inter alia an enzyme commonly known as the starch branching enzyme ("SBE"). SBE catalyses the formation of branch points in the amylopectin molecule by adding α-1 ,4 glucans through α-l,6-glucosidic branching linkages. The biosynthesis of amylose and amylopectin is schematically shown in Figure 1 , whereas the α-l-4-links and the α-1-6 links are shown in Figure 2.

It is known that starch is an important raw material. Starch is widely used in the food, paper, and chemical industries. However, a large fraction of the starches used in these industrial applications are post- harvest modified by chemical, physical or enzymatic methods in order to obtain starches with certain required functional properties.

Within the past few years it has become desirable to make genetically modified plants which could be capable of producing modified starches which could be the same as the post-harvest modified starches. It is also known that it may be possible to prepare such genetically modified plants by expression of antisense nucleotide coding sequences. In this regard, June Bourque provides a detailed summary of antisense strategies for the genetic manipulations in plants (Bourque 1995 Plant Science 105 pp

125-149). At this stage, reference could be made to Figure 3 which is a schematic diagram of one of the proposed mechanisms of antisense-RNA inhibition.

In particular, WO 92/11375 reports on a method of genetically modifying potato so as to form amylose-type starch. The method involves the use of an anti-sense construct that can apparently inhibit, to a varying extent, the expression of the gene coding for formation of the branching enzyme in potato. The antisense construct of WO 92/11375 consists of a tuber specific promoter, a transcription start sequence and the first exon of the branching enzyme in antisense direction. However, WO 92/11375 does not provide any antisense sequence data. In addition, WO 92/11375 only discloses the use of the potato GBSS promoter.

WO 92/14827 reports on a plasmid that, after insertion into the genome of a plant, can apparently cause changes in the carbohydrate concentration and carbohydrate composition, such as the concentration and composition of amylose and amylopectin, in the regenerated plant. The plasmid contains part of the coding sequence of a branching enzyme in an antisense orientation.

EP-A-0647715 reports on the use of antisense endogenous mRNA coding DNA to alter the characteristics and the metabolic pathways of ornamental plants.

EP-A-0467349 reports on the expression of sequences that are antisense to sequences upstream of a promoter to control gene expression.

EP-A-0458367 and US-A-5107065 report on the expression of a nucleotide sequence to regulate gene expression in a plant. The nucleotide sequence is complementary to a mRNA sequence of a gene and may cover all or a portion of the non-coding region of the gene. In other words, the nucleotide sequences of EP-A-0458367 and US-A-

5107065 must at least comprise a sequence that is complementary to a coding region. EP-A-0458367 and US-A-5107065 contain minimal sequence information.

Kuipers ei al in Mol. Gen. Genet. [1995] 246 745-755 report on the expression of a series of nucleotides that are antisense to part of the genomic intron sequences of potato granule bound starch synthetase. Here the antisense intron sequences are attached to a part of the antisense exon sequences - wherein the intron sequences and

the exon sequences are naturally associated with each other. In addition, the expressed antisense intron sequences are at most 231 bp in length.

Likewise, Kull et al in J. Genet & Breed. [1995] 49 69-76 report on the expression of a series of nucleotides that are antisense to part of the genomic intron sequences of potato granule bound starch synthetase. Likewise, here the antisense intron sequences are attached to a pan of the antisense exon sequences - wherein the intron sequences and the exon sequences are naturally associated with each other. In addition, likewise, the expressed antisense intron sequences are at most 231 bp in length.

Shimada et al in Theor. Appl. Genet. [1993] 86 665-672 report on the expression of a series of nucleotides that are antisense to part of the genomic intron sequences of rice granule bound starch synthetase. Here the antisense intron sequences are attached to a part of the antisense exon sequences - wherein the intron sequences and the exon sequences are naturally associated with each other. In addition, the expressed antisense intron sequences are less than 350 bp in length.

Reviews on how enzymatic activity can be affected by expression of particular nucleotide sequences may be found in the teachings of Finnegan and McElroy [1994]

Biotechnology 12 883-888; and Matzke and Matzke [1995] TIG ϋ 1-3.

Whilst it is known that enzymatic activity can be affected by expression of particular nucleotide sequences there is still a need for a method that can more reliably and/or more efficiently and/or more specifically affect enzymatic activity.

According to a first aspect of the present invention there is provided a method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence partially or completely codes for

(is) an intron in an antisense orientation; and wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with

the intron.

According to a second aspect of the present invention there is provided a method of affecting enzymatic activity in a starch producing organism (or a cell, a tissue or an organ thereof) comprising expressing in the starch producing organism (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; and wherein starch branching enzyme activity is affected and/or the levels of amylopectin are affected and/or the composition of starch is changed.

According to a third aspect of the present invention there is provided an antisense sequence comprising the nucleotide sequence shown as any one of SEQ.I.D. No. 15 to SEQ.I.D. No. 27 or a variant, derivative or homologue thereof.

According to a fourth aspect of the present invention there is provided a promoter comprising the sequence shown as SEQ.I.D. No. 14 or a variant, derivative or homologue thereof.

According to a fifth aspect of the present invention there is provided a construct capable of comprising or expressing the present invention.

According to a sixth aspect of the present invention there is provided a vector comprising or expressing the present invention.

According to a seventh aspect of the present invention there is provided a cell, tissue or organ comprising or expressing the present invention.

According to an eighth aspect of the present invention there is provided a transgenic starch producing organism comprising or expressing the present invention.

According to a ninth aspect of the present invention there is provided a starch obtained from the present invention.

According to a tenth aspect of the present invention there is provided pBEA8 (NCIMB 40753) or pBEA9 (NCIMB 40815).

According to an eleventh aspect of the present invention there is provided a nucleotide sequence that is antisense to any one or more of the intron sequences obtainable from λ-SBE 3.2 (NCIMB 40751) or λ-SBE 3.4 (NCIMB 40752) or a variant, derivative or homologue thereof.

A key advantage of the present invention is that it provides a method for preparing modified starches that is not dependent on the need for post- harvest modification of starches. Thus the method of the present invention obviates the need for the use of hazardous chemicals that are normally used in the post-harvest modification of starches.

In addition, the present invention provides inter alia genetically modified plants which are capable of producing modified and/or novel and/or improved starches whose properties would satisfy various industrial requirements.

Thus, the present invention provides a method of preparing tailor-made starches in plants which could replace the post-harvest modified starches.

Also, the present invention provides a method that enables modified starches to be prepared by a method that can have a more beneficial effect on the environment than the known post-harvest modification methods which are dependent on the use of hazardous chemicals and large quantities of energy.

An other key advantage of the present invention is that it provides a method that may more reliably and/or more efficiently and/or more specifically affect enzymatic activity when compared to the known methods of affecting enzymatic activity. With regard to this advantage of the present invention it is to be noted that there is some degree of homology between coding regions of SBEs. However, there is little or no homology with the intron sequences of SBEs.

Thus, antisense intron expression provides a mechanism to affect selectively the expression of a particular SBE. This advantageous aspect could be used, for example, to reduce or eliminate a particular SBE enzyme and replace that enzyme with another enzyme which can be another branching enzyme or even a recombinant version of the affected enzyme or even a hybrid enzyme which could for example comprise part of a SBE enzyme from one source and at least a part of another SBE enzyme from another source. This particular feature of the present invention is covered by the combination aspect of the present invention which is discussed in more detail later.

Thus the present invention provides a mechanism for selectively affecting SBE activity. This is in contrast to the prior art methods which are dependent on the use of for example antisense exon expression whereby it would not be possible to introduce new SBE activity without affecting that activity as well.

Preferably with the first aspect of the present invention starch branching enzyme activity is affected and/or wherein the levels of amylopectin are affected and/or the composition of starch is changed.

Preferably with the second aspect of the present invention the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.

Preferably with the fourth aspect of the present invention the promoter is in combination with a gene of interest ("GOI").

Preferably the enzymatic activity is reduced or eliminated.

Preferably the nucleotide sequence codes for at least substantially all of at least one intron in an antisense orientation.

Preferably the nucleotide sequence codes, partially or completely, for two or more introns and wherein each intron is in an anti-sense orientation.

Preferably the nucleotide sequence comprises at least 350 nucleotides (e.g. at least 350 bp), more preferably at least 500 nucleotides (e.g. at least 500 bp).

Preferably the nucleotide sequence comprises the sequence shown as any one of SEQ. I.D. No. 15 to SEQ.I.D. No. 27 or a variant, derivative or homologue thereof, including combinations thereof.

Preferably the nucleotide sequence is expressed by a promoter having a sequence shown as SEQ. I.D. No 14 or a variant, derivative or homologue thereof.

Preferably the transgenic starch producing organism is a plant.

A preferred aspect of the present invention therefore relates to a method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron; and wherein starch branching enzyme activity is affected and/or the levels of amylopectin are affected and/ or the composition of starch is changed.

A more preferred aspect of the present invention therefore relates to a method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron; wherein starch branching enzyme activity is affected and/or the levels of amylopectin are affected and/or the composition of starch is changed; and wherein the

nucleotide sequence comprises the sequence shown as any one of SEQ.I.D. No. 15 to SEQ.I.D. No. 27 or a variant, derivative or homologue thereof, including combinations thereof.

The term "nucleotide" in relation to the present invention includes DNA and RNA.

Preferably it means DNA, more preferably DNA prepared by use of recombinant DNA techniques.

The term "intron" is used in its normal sense as meaning a segment of nucleotides, usually DNA, that does not encode part or all of an expressed protein or enzyme.

The term "exon" is used in its normal sense as meaning a segment of nucleotides, usually DNA, encoding part or all of an expressed protein or enzyme.

Thus, the term "intron" refers to gene regions that are transcribed into RNA molecules, but which are spliced out of the RNA before the RNA is translated into a protein. In contrast, the term "exon" refers to gene regions that are transcribed into RNA and subsequently translated into proteins.

The terms "variant" or "homologue" or "fragment" in relation to the nucleotide sequence of the present invention include any substitution of, variation of, modification of, replacement of, deletion of or addition of one (or more) nucleic acid from or to the respective nucleotide sequence providing the resultant nucleotide sequence can affect enzyme activity in a plant, or cell or tissue thereof, preferably wherein the resultant nucleotide sequence has at least the same effect as any one of the antisense sequences shown as SEQ.I.D. No.s 15-27. In particular, the term "homologue" covers homology with respect to similarity of structure and/or similarity of function providing the resultant nucleotide sequence has the ability to affect enzymatic activity in accordance with the present invention. With respect to sequence homology (i.e. similarity), preferably there is more than 80% homology, more preferably at least 85% homology, more preferably at least 90% homology, even more preferably at least 95% homology, more preferably at least 98% homology.

The above terms are also synonymous with allelic variations of the sequences.

Likewise, the terms "variant" or "homologue" or "fragment" in relation to the promoter of the present invention include any substitution of. variation of, modification of, replacement of, deletion of or addition of one (or more) nucleic acid from or to the respective promoter sequence providing the resultant promoter sequence allows expression of a GOI, preferably wherein the resultant promoter sequence has at least the same effect as SEQ.I.D. No. 14. In particular, the term "homologue" covers homology with respect to similarity of structure and/or similarity of function providing the resultant promoter sequence has the ability to allow for expression of a GOI, such as a nucleotide sequence according to the present invention. With respect to sequence homology (i.e. similarity), preferably there is more than 80% homology, more preferably at least 85% homology, more preferably at least 90% homology, even more preferably at least 95 % homology, more preferably at least 98% homology. The above terms are also synonymous with allelic variations of the sequences.

The term "antisense" means a nucleotide sequence that is complementary to, and can therefore hybridize with, any one or all of the intron sequences of the present invention, including partial sequences thereof.

With the present invention, the antisense intron can be complementary to an entire intron of the gene to be inhibited. However, in some circumstances, partial antisense sequences may be used (i.e. sequences that are not or do not comprise the full complementary sequence) providing the partial sequences affect enzymatic activity.

Suitable examples of partial sequences include sequences that are shorter than any one of the full antisense sequences shown as SEQ. I.D. No.s 15 to 27 but which comprise nucleotides that are at least antisense to the sense intron sequences adjacent the respective exon or exons.

With regard to the second aspect of the present invention (i.e. specifically affecting SBE activity), the nucleotide sequences of the present invention may comprise one or more sense or antisense exon sequences of the SBE gene, including complete or partial sequences thereof, providing the nucleotide sequences can affect SBE activity, preferably wherein the nucleotide sequences reduce or eliminate SBE activity.

Preferably, the nucleotide sequence of the second aspect of the present invention does not comprise an antisense exon sequence.

The term "vector" includes an expression vector and a transformation vector. The term "expression vector" means a construct capable of in vivo or in vitro expression.

The term "transformation vector" means a construct capable of being transferred from one species to another - such as from an E.Coli plasmid to a fungus or a plant cell, or from an Agrobacterium to a plant cell.

The term "construct" - which is synonymous with terms such as "conjugate",

"cassette" and "hybrid" - in relation to the antisense nucleotide sequence aspect of the present invention includes the nucleotide sequence according to the present invention directly or indirectly attached to a promoter. An example of an indirect attachment is the provision of a suitable spacer group such as an intron sequence, such as the STzi-intron or the ADH intron, intermediate the promoter and the nucleotide sequence of the present invention. The same is true for the term "fused" in relation to the present invention which includes direct or indirect attachment. The terms do not cover the natural combination of the wild type SBE gene when associated with the wild type SBE gene promoter in their natural environment.

The construct may even contain or express a marker which allows for the selection of the genetic construct in, for example, a plant cell into which it has been transferred. Various markers exist which may be used in, for example, plants - such as mannose. Other examples of markers include those that provide for antibiotic resistance - e.g. resistance to G418, hygromycin, bleomycin, kanamycin and gentamycin.

The construct of the present invention preferably comprises a promoter. The term "promoter" is used in the normal sense of the an, e.g. an RNA polymerase binding site in the Jacob-Monod theory of gene expression. Examples of suitable promoters are those that can direct efficient expression of the nucleotide sequence of the present invention and/or in a specific type of cell. Some examples of tissue specific promoters are disclosed in WO 92/11375.

The promoter could additionally include conserved regions such as a Pribnow Box or a TATA box. The promoters may even contain other sequences to affect (such as to maintain, enhance, decrease) the levels of expression of the nucleotide sequence of the present invention. Suitable examples of such sequences include the Shl-intτon or an ADH intron. Other sequences include inducible elements - such as temperature, chemical, light or stress inducible elements. Also, suitable elements to enhance transcription or translation may be present. An example of the latter element is the TMV 5' leader sequence (see Sleat Gene 217 [1987] 217-225; and Dawson Plant

Mol. Biol. 23 [1993] 97).

As mentioned, the construct and/or the vector of the present invention may include a transcriptional initiation region which may provide for regulated or constitutive expression. Any suitable promoter may be used for the transcriptional initiation region, such as a tissue specific promoter. In one aspect, preferably the promoter is the patatin promoter or the E35S promoter. In another aspect, preferably the promoter is the SBE promoter.

If, for example, the organism is a plant then the promoter can be one that affects expression of the nucleotide sequence in any one or more of seed, tuber, stem, sprout, root and leaf tissues, preferably tuber. By way of example, the promoter for the nucleotide sequence of the present invention can be the α-Amy 1 promoter (otherwise known as the Amy 1 promoter, the Amy 637 promoter or the α-Amy 637 promoter) as described in our co-pending UK patent application No. 9421292.5 filed

21 October 1994. Alternatively, the promoter for the nucleotide sequence of the present invention can be the α-Amy 3 promoter (otherwise known as the Amy 3

promoter, the Amy 351 promoter or the α-Amy 351 promoter) as described in our co-pending UK patent application No. 9421286.7 filed 21 October 1994.

The present invention also encompasses the use of a promoter to express a nucleotide sequence according to the present invention, wherein a pan of the promoter is inactivated but wherein the promoter can still function as a promoter. Partial inactivation of a promoter in some instances is advantageous. In particular, with the Amy 351 promoter mentioned earlier it is possible to inactivate a part of it so that the partially inactivated promoter expresses the nucleotide sequence of the present invention in a more specific manner such as in just one specific tissue type or organ.

The term "inactivated" means partial inactivation in the sense that the expression pattern of the promoter is modified but wherein the partially inactivated promoter still functions as a promoter. However, as mentioned above, the modified promoter is capable of expressing a gene coding for the enzyme of the present invention in at least one (but not all) specific tissue of the original promoter. Examples of partial inactivation include altering the folding pattern of the promoter sequence, or binding species to parts of the nucleotide sequence, so that a part of the nucleotide sequence is not recognised by, for example, RNA polymerase. Another, and preferable, way of partially inactivating the promoter is to truncate it to form fragments thereof. Another way would be to mutate at least a part of the sequence so that the RNA polymerase can not bind to that part or another part. Another modification is to mutate the binding sites for regulatory proteins for example the CreA protein known from filamentous fungi to exert carbon catabolite repression, and thus abolish the catabolite repression of the native promoter.

The construct and/or the vector of the present invention may include a transcriptional termination region.

The nucleotide according to the present invention can be expressed in combination (but not necessarily at the same time) with an additional construct. Thus the present invention also provides a combination of constructs comprising a first construct comprising the nucleotide sequence according to the present invention operatively

linked to a first promoter; and a second construct comprising a GOI operatively linked to a second promoter (which need not be the same as the first promoter). With this aspect of the present invention the combination of constructs may be present in the same vector, plasmid, cells, tissue, organ or organism. This aspect of the present invention also covers methods of expressing the same, preferably in specific cells or tissues, such as expression in just a specific cell or tissue, of an organism, typically a plant. With this aspect of the present invention the second construct does not cover the natural combination of the gene coding for an enzyme ordinarily associated with the wild type gene promoter when they are both in their natural environment.

An example of a suitable combination would be a first construct comprising the nucleotide sequence of the present invention and a promoter, such as the promoter of the present invention, and a second construct comprising a promoter, such as the promoter of the present invention, and a GOI wherein the GOI codes for another starch branching enzyme either in sense or antisense orientation.

The above comments relating to the term "construct" for the antisense nucleotide aspect of the present invention are equally applicable to the term "construct" for the promoter aspect of the present invention. In this regard, the term includes the promoter according to the present invention directly or indirectly attached to a GOI.

The term "GOI" with reference to the promoter aspect of the present invention or the combination aspect of the present invention means any gene of interest, which need not necessarily code for a protein or an enzyme - as is explained later. A GOI can be any nucleotide sequence that is either foreign or natural to the organism in question, for example a plant.

Typical examples of a GOI include genes encoding for other proteins or enzymes that modify metabolic and catabolic processes. The GOI may code for an agent for introducing or increasing pathogen resistance.

The GOI may even be an antisense construct for modifying the expression of natural transcripts present in the relevant tissues. An example of such a GOI is the nucleotide sequence according to the present invention.

The GOI may even code for a protein that is non-natural to the host organism - e.g. a plant. The GOI may code for a compound that is of benefit to animals or humans. For example, the GOI could code for a pharmaceutically active protein or enzyme such as any one of the therapeutic compounds insulin, interferon, human serum albumin, human growth factor and blood clotting factors. The GOI may even code for a protein giving additional nutritional value to a food or feed or crop. Typical examples include plant proteins that can inhibit the formation of anti-nutritive factors and plant proteins that have a more desirable amino acid composition (e.g. a higher lysine content than a non-transgenic plant). The GOI may even code for an enzyme that can be used in food processing such as xylanases and α-galactosidase. The GOI can be a gene encoding for any one of a pest toxin, an antisense transcript such as that for α-amylase, a protease or a glucanase. Alternatively, the GOI can be a nucleotide sequence according to the present invention.

The GOI can be the nucleotide sequence coding for the arabinofuranosidase enzyme which is the subject of our co-pending UK patent application 9505479.7. The GOI can be the nucleotide sequence coding for the glucanase enzyme which is the subject of our co-pending UK patent application 9505475.5. The GOI can be the nucleotide sequence coding for the α-amylase enzyme which is the subject of our co-pending UK patent application 9413439.2. The GOI can be the nucleotide sequence coding for the α-amylase enzyme which is the subject of our co-pending UK patent application

9421290.9. The GOI can be any of the nucleotide sequences coding for the α-glucan lyase enzyme which are described in our co-pending PCT patent application PCT/EP94/03397.

In one aspect the GOI can even be a nucleotide sequence according to the present invention but when operatively linked to a different promoter.

The GOI could include a sequence that codes for one or more of a xylanase, an arabinase, an acetyl esterase, a rhamnogalacturonase, a glucanase, a pectinase, a branching enzyme or another carbohydrate modifying enzyme or proteinase. Alternatively, the GOI may be a sequence that is antisense to any of those sequences.

As mentioned above, the present invention provides a mechanism for selectively affecting a particular enzymatic activity. In an important application of the present invention it is now possible to reduce or eliminate expression of a genomic nucleotide sequence coding for a genomic protein or enzyme by expressing an antisense intron construct for that particular genomic protein or enzyme and (e.g. at the same time) expressing a recombinant version of that enzyme or protein - in other words the GOI is a recombinant nucleotide sequence coding for the genomic enzyme or protein. This application allows expression of desired recombinant enzymes and proteins in the absence of (or reduced levels of) respective genomic enzymes and proteins. Thus the desired recombinant enzymes and proteins can be easily separated and purified from the host organism. This particular aspect of the present invention is very advantageous over the prior art methods which, for example, rely on the use of anti- sense exon expression which methods also affect expression of the recombinant enzyme.

Thus, a further aspect of the present invention relates to a method of expressing a recombinant protein or enzyme in a host organism comprising expressing a nucleotide sequence coding for the recombinant protein or enzyme; and expressing a further nucleotide sequence wherein the further nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; wherein the intron is an intron normally associated with the genomic gene encoding a protein or an enzyme corresponding to the recombinant protein or enzyme; and wherein the further nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron. Additional aspects cover the combination of those nucleotide sequences including their incorporation in constructs, vectors, cells, tissues and transgenic organisms.

Therefore the present invention also relates to a combination of nucleotide sequences comprising a first nucleotide sequence coding for a recombinant enzyme; and a second nucleotide sequence which corresponds to an intron in antisense orientation; wherein the intron is an intron that is associated with a genomic gene encoding an enzyme corresponding to the recombinant enzyme; and wherein the second nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron.

The GOI may even code for one or more introns, such as any one or more of the intron sequences presented in the attached sequence listings. For example, the present invention also covers the expression of for example an antisense intron (e.g. SEQ. I.D. No. 15) in combination with for example a sense intron which preferably is not complementary to the antisense intron sequence (e.g. SEQ. I.D. No. 2).

The terms "cell", "tissue" and "organ" include cell, tissue and organ perse and when within an organism.

The term "organism" in relation to the present invention includes any organism that could comprise the nucleotide sequence according to the present invention and/or wherein the nucleotide sequence according to the present invention can be expressed when present in the organism. Preferably the organism is a starch producing organism such as any one of a plant, algae, fungi, yeast and bacteria, as well as cell lines thereof. Preferably the organism is a plant.

The term "starch producing organism" includes any organism that can biosynthesise starch. Preferably, the starch producing organism is a plant.

The term "plant" as used herein includes any suitable angiosperm, gymnosperm, monocotyledon and dicotyledon. Typical examples of suitable plants include vegetables such as potatoes; cereals such as wheat, maize, and barley; fruit; trees; flowers; and other plant crops. Preferably, the term means "potato" .

The term "transgenic organism" in relation to the present invention includes any organism that comprises the nucleotide sequence according to the present invention and/or products obtained therefrom, and/or wherein the nucleotide sequence according to the present invention can be expressed within the organism. Preferably the nucleotide sequence of the present invention is incorporated in the genome of the organism. Preferably the transgenic organism is a plant, more preferably a potato.

To prepare the host organism one can use prokaryotic or eukaryotic organisms. Examples of suitable prokaryotic hosts include E. coli and Bacillus subtilis. Teachings on the transformation of prokaryotic hosts is well documented in the art, for example see Sambrook et al (Sambrook et al. in Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory Press).

Even though the enzyme according to the present invention and the nucleotide sequence coding for same are not disclosed in EP-B-0470145 and CA-A-2006454, those two documents do provide some useful background commentary on the types of techniques that may be employed to prepare transgenic plants according to the present invention. Some of these background teachings are now included in the following commentary.

The basic principle in the construction of genetically modified plants is to insert genetic information in the plant genome so as to obtain a stable maintenance of the inserted genetic material.

Several techniques exist for inserting the genetic information, the two main principles being direct introduction of the genetic information and introduction of the genetic information by use of a vector system. A review of the general techniques may be found in articles by Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205- 225) and Christou (Agro-Food-Industry Hi-Tech March/ April 1994 17-27).

Thus, in one aspect, the present invention relates to a vector system which carries a nucleotide sequence or construct according to the present invention and which is capable of introducing the nucleotide sequence or construct into the genome of an organism, such as a plant.

The vector system may comprise one vector, but it can comprise two vectors. In the case of two vectors, the vector system is normally referred to as a binary vector system. Binary vector systems are described in further detail in Gynheung An et al. (1980), Binary Vectors, Plant Molecular Biology Manual A3, 1-19.

One extensively employed system for transformation of plant cells with a given promoter or nucleotide sequence or construct is based on the use of a Ti plasmid from Agrobacterium tumefaciens or a Ri plasmid from Agrobacterium rhizogenes An et al. (1986), Plant Physiol. 81, 301-305 and Butcher D.N. et al. (1980), Tissue Culture Methods for Plant Pathologists, eds. : D.S. Ingrams and J.P. Helgeson, 203-208.

Several different Ti and Ri plasmids have been constructed which are suitable for the construction of the plant or plant cell constructs described above. A non- limiting example of such a Ti plasmid is pGV3850.

The nucleotide sequence or construct of the present invention should preferably be inserted into the Ti-plasmid between the terminal sequences of the T-DNA or adjacent a T-DNA sequence so as to avoid disruption of the sequences immediately surrounding the T-DNA borders, as at least one of these regions appears to be essential for insertion of modified T-DNA into the plant genome.

As will be understood from the above explanation, if the organism is a plant the vector system of the present invention is preferably one which contains the sequences necessary to infect the plant (e.g. the vt ' r region) and at least one border part of a T- DNA sequence, the border part being located on the same vector as the genetic construct.

Furthermore, the vector system is preferably an Agrobacterium tumefaciens Ti- plasmid or an Agrobacterium rhizogenes Ri-plasmid or a derivative thereof. As these plasmids are well-known and widely employed in the construction of transgenic plants, many vector systems exist which are based on these plasmids or derivatives thereof.

In the construction of a transgenic plant the nucleotide sequence or construct of the present invention may be first constructed in a microorganism in which the vector can replicate and which is easy to manipulate before insertion into the plant. An example of a useful microorganism is E. coli, but other microorganisms having the above properties may be used. When a vector of a vector system as defined above has been constructed in E. coli, it is transferred, if necessary, into a suitable Agrobacterium strain, e.g. Agrobacterium tumefaciens. The Ti-plasmid harbouring the nucleotide sequence or construct of the present invention is thus preferably transferred into a suitable Agrobacterium strain, e.g. A. tumefaciens, so as to obtain an Agrobacterium cell harbouring the promoter or nucleotide sequence or construct of the present invention, which DNA is subsequently transferred into the plant cell to be modified.

If, for example, for the transformation the Ti- or Ri-plasmid of the plant cells is used, at least the right boundary and often however the right and the left boundary of the

Ti- and Ri-plasmid T-DNA, as flanking areas of the introduced genes, can be connected. The use of T-DNA for the transformation of plant cells has been intensively studied and is described in EP-A-120516; Hoekema, in: The Binary Plant Vector System Offset-drukkerij Kanters B.B. , Alblasserdam, 1985, Chapter V; Fraley, et al. , Crit. Rev. Plant Sci. , 4: 1-46; and An et al., EMBO J. (1985) 4:277-

284.

Direct infection of plant tissues by Agrobacterium is a simple technique which has been widely employed and which is described in Butcher D.N. et al. (1980), Tissue Culture Methods for Plant Pathologists, eds.: D.S. Ingrams and J.P. Helgeson, 203-

208. For further teachings on this topic see Potrykus (Annu Rev Plant Physiol Plant Mol Biol [1991] 42:205-225) and Christou (Agro-Food-Industry Hi-Tech March/ April

1994 17-27). With this technique, infection of a plant may be performed in or on a certain part or tissue of the plant, i.e. on a part of a leaf, a root, a stem or another pan of the plant.

Typically, with direct infection of plant tissues by Agrobacterium carrying the GOI

(such as the nucleotide sequence according to the present invention) and, optionally, a promoter, a plant to be infected is wounded, e.g. by cutting the plant with a razor blade or puncturing the plant with a needle or rubbing the plant with an abrasive. The wound is then inoculated with the Agrobacterium. The inoculated plant or plant part is then grown on a suitable culture medium and allowed to develop into mature plants.

When plant cells are constructed, these cells may be grown and maintained in accordance with well-known tissue culturing methods such as by culturing the cells in a suitable culture medium supplied with the necessary growth factors such as amino acids, plant hormones, vitamins, etc.

Regeneration of the transformed cells into genetically modified plants may be accomplished using known methods for the regeneration of plants from cell or tissue cultures, for example by selecting transformed shoots using an antibiotic and by subculturing the shoots on a medium containing the appropriate nutrients, plant hormones, etc.

Further teachings on plant transformation may be found in EP-A-0449375.

As reported in CA-A-2006454, a large amount of cloning vectors are available which contain a replication system in E. coli and a marker which allows a selection of the transformed cells. The vectors contain for example pBR 322, pUC series, M13 mp series, pACYC 184 etc. In this way, the nucleotide or construct of the present invention can be introduced into a suitable restriction position in the vector. The contained plasmid is then used for the transformation in E.coli. The E.coli cells are cultivated in a suitable nutrient medium and then harvested and lysed. The plasmid

is then recovered. As a method of analysis there is generally used sequence analysis, restriction analysis, electrophoresis and further biochemical-molecular biological methods. After each manipulation, the used DNA sequence can be restricted and connected with the next DNA sequence. Each sequence can be cloned in the same or different plasmid.

After the introduction of the nucleotide sequence or construct according to the present invention in the plants the presence and/or insertion of further DNA sequences may be necessary - such as to create combination systems as outlined above (e.g. an organism comprising a combination of constructs).

The above commentary for the transformation of procaryotic organisms and plants with the nucleotide sequence of the present invention is equally applicable for the transformation of those organisms with the promoter of the present invention.

In summation, the present invention relates to affecting enzyme activity by expressing antisense intron sequences.

Also, the present invention relates to a promoter useful for the expression of those antisense intron sequences.

The following samples have been deposited in accordance with the Budapest Treaty at the recognised depositary The National Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scotland, AB2 IRY, United Kingdom, on 13 July 1995:

NCIMB 40753 (which refers to pBEA 8 as described herein);

NCIMB 40751 (which refers to λ-SBE 3.2 as described herein), and

NCIMB 40752 (which refers to λ-SBE 3.4 as described herein).

The following sample has been deposited in accordance with the Budapest Treaty at the recognised depositary The National Collections of Industrial and Marine Bacteria Limited (NCIMB) at 23 St Machar Drive, Aberdeen, Scotland, AB2 IRY, United Kingdom, on 9 July 1996:

NCIMB 40815 (which refers to pBEA 9 as described herein).

A highly preferred embodiment of the present invention therefore relates to a method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron; wherein starch branching enzyme activity is affected and/or the levels of amylopectin are affected and/or the composition of starch is changed; and wherein the nucleotide sequence is obtainable from NCIMB 40753 or NCIMB 40815, or is antisense to any one or more of the intron sequences obtainable from either λ-SBE

3.2 (NCIMB 40751) or λSBE 3.4 (NCIMB 40752) or a variant, derivative or homologue thereof.

A more highly preferred aspect of the present invention therefore relates to a method of affecting enzymatic activity in a plant (or a cell, a tissue or an organ thereof) comprising expressing in the plant (or a cell, a tissue or an organ thereof) a nucleotide sequence wherein the nucleotide sequence codes, partially or completely, for an intron in an antisense orientation; wherein the nucleotide sequence does not contain a sequence that is antisense to an exon sequence normally associated with the intron; wherein starch branching enzyme activity is affected and/or the levels of amylopectin are affected and/or the composition of starch is changed; wherein the nucleotide sequence comprises the sequence shown as any one of SEQ.I.D. No. 15 to SEQ.I.D. No. 27 or a variant, derivative or homologue thereof, including combinations thereof; and wherein the nucleotide sequence is obtainable from NCIMB 40753 or NCIMB 40815, or is antisense to any one or more of the intron sequences obtainable from either λSBE 3.2 (NCIMB 40751) or λSBE 3.4 (NCIMB 40752) or a variant, derivative or homologue thereof.

The present invention will now be described only by way of example, in which reference is made to the following attached Figures:

Figure 1, which is a schematic representation of the biosynthesis of amylose and amylopectin;

Figure 2, which is a diagrammatic representation of the α-l-4-links and the α-1-6 links of amylopectin;

Figure 3, which is a diagrammatic representation of a possible antisense-RNA inhibition mechanism;

Figure 4, which is a diagrammatic representation of the exon-intron structure of a genomic SBE clone;

Figure 5, which is a plasmid map of pPATAl, which is 3936 bp in size;

Figure 6, which is a plasmid map of pABE6, which is 5106 bp in size;

Figure 7, which is a plasmid map of pVictorlV Man, which is 7080 bp in size;

Figure 8, which is a plasmid map of pBEA8, which is 9.54 kb in size;

Figure 9, which is a plasmid map of pBEA9, which is 9.54 kb in size;

Figure 10, which is a plasmid map of pBEP2, which is 10.32 kb in size;

Figure 11, which is a plasmid map of pVictor5a, which is 9.12 kb in size; and

Figure 12, which shows the full genomic nucleotide sequence for SBE including the promoter, exons and introns.

Figures 1 and 2 were referred to above in the introductory description concerning starch in general. Figure 3 was referred to above in the introductory description concerning antisense expression.

As mentioned, Figure 4 is a diagrammatic representation of the exon-intron structure of a genomic SBE clone, the sequence of which is shown in Figure 12. This clone, which has about 11.5 k base pairs, comprises 14 exons and 13 introns. The introns are numbered in increasing order from the 5' end to the 3' end and correspond to SEQ. I.D. No.s 1-13, respectively. Their respective antisense intron sequences are shown as SEQ. I.D. No.s 15-27.

In more detail. Figures 4 and 12 present information on the 11478 base pairs of a potato SBE gene. The 5' region from nucleotides 1 to 2082 contain the promoter region of the SBE gene. A TATA box candidate at nucleotide 2048 to 2051 is boxed. The homology between a potato SBE cDNA clone (Poulsen & Kreiberg (1993) Plant

Physiol 102: 1053-1054) and the exon DNAs begin at 2083 bp and end at 9666 bp.

The homology between the cDNA and the exon DNA is indicated by nucleotides in upper case letters, while the translated amino acid sequences are shown in the single letter code below the exon DNA. Intron sequences are indicated by lower case letters.

Figures 5 to 7 are discussed below. As mentioned, Figure 8 is a plasmid map of pBEA8, which is 9.54 k base pairs in size; and Figure 9 is a plasmid map of pBEA9, which is 9.54 k base pairs in size. Each of pBEA 8 and pBEA 9 comprises an antisense sequence to the first intron sequence of the potato SBE gene. This first intron sequence, which has 1177 base pairs, is shown in Figure 4 and lies between the first exon and the second exon.

These experiments and aspects of the present invention are now discussed in more detail.

EXPERIMENTAL PROTOCOL

ISOLATION, SUBCLONING IN PLASMIDS, AND SEQUENCING OF GENOMIC SBE CLONES

Various clones containing the potato SBE gene were isolated from a Desiree potato genomic library (Clontech Laboratories Inc. , Palo Alto CA, USA) using radioactively labelled potato SBE cDNA (Poulsen & Kreiberg (1993) Plant Physiol. 102: 1053- 1054) as probe. The fragments of the isolated λ-phages containing SBE DNA (λSBE 3.2 - NCIMB 40751 - and λSBE-3.4 - NCIMB 40752) were identified by Southern analysis and then subcloned into pBluescript II vectors (Clontech Laboratories Inc. , Palo Alto CA, USA). λSBE 3.2 contains a 15 kb potato DNA insert and λSBE-3.4 contains a 13 kb potato DNA insert. The resultant plasmids were called pGB3, pGBll , pGB15, pGBlό and pGB25 (see discussion below). The respective inserts were then sequenced using the Pharmacia Autoread Sequencing Kit (Pharmacia,

Uppsala) and a A.L.F. DNA sequencer (Pharmacia, Uppsala).

/04112

26

In total, a stretch of 11.5 kb of the SBE gene was sequenced. The sequence was deduced from the above-mentioned plasmids, wherein: pGB25 contains the sequences from 1 bp to 836 bp, pGB15 contains the sequences from 735 bp to 2580 bp, pGB16 contains the sequences from 2580 bp to 5093 bp, pGBl l contains the sequences from 3348 bp to 7975 bp, and pGB3 contains the sequences from 7533 bp to 11468 bp.

In more detail, pGB3 was constructed by insertion of a 4 kb EcoRl fragment isolated from λSBE 3.2 into the EcoRl site of pBluescript II SK (+). pGBl l was constructed by insertion of a 4.7 kb Xhol fragment isolated from λSBE 3.4 into the Xhol site of pBluescript II SK (+). pGB15 was constructed by insertion of a 1.7 kb Spel fragment isolated from λSBE 3.4 into the Spel site of pBluescript II SK (+). pGB16 was constructed by inseπion of a 2.5 kb Spel fragment isolated from λSBE 3.4 into the Spel site of pBluescript II SK (+). For the construction of pGB25 a PCR fragment was produced with the primers

5' GGAATTCCAGTCGCAGTC TACATTAC 3'

and

5' CGG GAT CCA GAG GCA TTA AGA TTT CTG G 3'

and λSBE 3.4 as a template.

The PCR fragment was digested with BamHl and EcoRl, and inserted in pBluescript II SK (+) digested with the same restriction enzymes.

CONSTRUCTION OF SBE ANTISENSE INTRON PLASMIDS pBEA8 and pBEA9

The SBE intron 1 was amplified by PCR using the oligonucleotides:

5' CGGGATCCAAAGAAATTCTCGAGGTTACATGG3'

and

5 " CGGGATCCGGGGTAATTTTTACTAATTTCATG3'

and the λSBE 3.4 phage containing the SBE gene as template.

The PCR product was digested with BamHl and inserted in an antisense orientation in the BamHl site of plasmid pPATAl (described in WO 94/24292) between the patatin promoter and the 35S terminator. This construction, pABE6, was digested with Kpnl, and the 2.4 kb "patatin promoter-SBE intron 1- 35S terminator" Kpnl fragment was isolated and inserted in the Kpnl site of the plant transformation vector pVictorlV Man. The Kpnl fragment was inserted in two orientations yielding plasmids pBEA8 and pBEA9. pVictorlV Man is shown in Figure 7 and is formed by insertion of a filled in Xbal fragment containing a E35S promoter-manA-35S terminator cassette isolated from plasmid pVictorlV SGiN Man (WO 94/24292) into the filled in Xhol site of pVictor IV. The pVictor regions of pVictor IV Man contained between the coordinates 2.52 bp to 0.32 bp (see Figure 7).

PRODUCTION OF TRANSGENIC POTATO PLANTS

Axenic stock cultures

Shoot cultures of Solanum tuberosum 'Bintje' and 'Dianella' are maintained on a substrate (LS) of a formula according to Linsmaier, E.U. and Skoog, F. (1965), Physiol. Plant. 18: 100-127, in addition containing 2 μM silver thiosulphate at 25°C and 16 h light/8 h dark.

The cultures were subcultured after approximately 40 days. Leaves were then cut off the shoots and cut into nodal segments (approximately 0.8 cm) each containing one node.

Inoculation of potato tissues

Shoots from approximately 40 days old shoot cultures (height approximately 5-6 cms) were cut into internodal segments (approximately 0.8 cm). The segments were placed into liquid LS-substrate containing the transformed Agrobacterium tumefaciens containing the binary vector of interest. The Agrobacterium were grown overnight in YMB-substrate (di-potassium hydrogen phosphate, trihydrate (0.66 g/1); magnesium sulphate, heptahydrate (0.20 g/1); sodium chloride (0.10 g/1); mannitol (10.0 g/1); and yeast extract (0.40 g/1)) containing appropriate antibiotics (corresponding to the resistance gene of the Agrobacterium strain) to an optical density at 660 nm (OD-660) of approximately 0.8, centrifuged and resuspended in the LS-substrate to an OD-660 of 0.5.

The segments were left in the suspension of Agrobacterium for 30 minutes and then the excess of bacteria were removed by blotting the segments on sterile filter paper.

Co-cultivation

The shoot segments were co-cultured with bacteria for 48 hours directly on LS- substrate containing agar (8.0 g/1), 2,4-dichlorophenoxyacetic acid (2.0 mg/1) and trans-zeatin (0.5 mg/1). The substrate and also the explants were covered with sterile filter papers, and the petri dishes were placed at 25°C and 16 h light/ 8 dark.

"Washing" procedure

After the 48 h on the co-cultivation substrate the segments were transferred to containers containing liquid LS-substrate containing 800 mg/1 carbenicillin. The containers were gently shaken and by this procedure the major part of the Agrobacterium was either washed off the segments and/or killed.

Selection

After the washing procedure the segments were transferred to plates containing the LS-substrate, agar (8 g/1), trans-zeatin (1-5 mg/1), gibberellic acid (0.1 mg/1), carbenicillin (800 mg/1) , and kanamycin sulphate (50- 100 mg/1) or phosphinotricin ( 1 -

5 mg/1) or mannose (5 g/1) depending on the vector construction used.

The segments were sub-cultured to fresh substrate each 3-4 weeks.

In 3 to 4 weeks, shoots develop from the segments and the formation of new shoots continued for 3-4 months.

Rooting of regenerated shoots

The regenerated shoots were transferred to rooting substrate composed of LS- substrate, agar (8 g/1) and carbenicillin (800 mg/1).

The transgenic genotype of the regenerated shoot was verified by testing the rooting ability on the above mentioned substrates containing kanamycin sulphate (200 mg/1), by performing NPTII assays (Radke, S. E. et al, Theor. Appl. Genet. (1988), 75:

685-694) or by performing PCR analysis according to Wang et al (1993, NAR 21 pp 4153-4154). Plants which were not positive in any of these assays were discarded or used as controls. Alternatively, the transgenic plants could be verified by performing a GUS assay on the co-introduced β-glucuronidase gene according to Hodal, L. et al. (PI. Sci. (1992), 87: 115-122).

Transfer to soil

The newly rooted plants (height approx. 2-3 cms) were transplanted from rooting substrate to soil and placed in a growth chamber (21°C, 16 hour light 200-

400uE/m 2 /sec). When the plants were well established they were transferred to the greenhouse, where they were grown until tubers had developed and the upper part of

the plants were senescing.

Harvesting

The potatoes were harvested after about 3 months and then analysed.

BRANCHING ENZYME ANALYSIS

The SBE expression in the transgenic potato lines were measured using the SBE assays described by Blennow and Johansson (Phytochemistry (1991) 30:437-444) and by standard Western procedures using antibodies directed against potato SBE.

STARCH ANALYSIS

Starch was isolated from potato tubers and analysed for the amylose:amylopectin ratio

(Hovenkamp-Hermelink et al. (1988) Potato Research 31:241-246). In addition, the chain length distribution of amylopectin was determined by analysis of isoamylase digested starch on a Dionex HPAEC.

The number of reducing ends in isoamylase digested starch was determined by the method described by N. Nelson (1944) J. Biol.Chem. 153:375-380.

The results revealed that there was a reduction in the level of synthesis of SBE and/or the level of activity of SBE and/or the composition of starch SBE in the transgenic plants.

CONSTRUCTION OF SBE PROMOTER CONSTRUCT

An SBE promoter fragment was amplified from λ-SBE 3.4 using primers:

5 ' CCA TCG ATA CTT TAA GTG ATT TGA TGG C 3'

and

5' CGGGATCCTGTTCTGATTCTTGATTTC C 3'.

The PCR product was digested with Clal and BamHl. The resultant 1.2 kb fragment was then inserted in pVictor5a (see Figure 11) linearised with Clal and Bglll yielding pBEP2 (see Figure 10).

STARCHBRANCHINGENZYMEMEASUREMENTS OFPOTATOTUBERS

Potatoes from potato plants transformed with either pBEA8 or pBEA9 were cut in small pieces and homogenised in extraction buffer (50 mM Tris-HCl pH 7.5, Sodium-dithionit (0.1 g/1), and 2 mM DTT) using a Ultra-Turax homogenizer; 1 g of Dowex xl. was added pr. 10 g of tuber. The crude homogenate was filtered through a miracloth filter and centrifuged at 4°C for 10 minutes at 24.700 g. The supernatant was used for starch branching enzyme assays.

The starch branching enzyme assays were carried out at 25°C in a volume of 400 μl composed of 0.1 M Na citrate buffer pH 7.0, 0.75 mg/ml amylose, 5 mg/ml bovine serum albumin and the potato extract. At 0, 15, 30 and 60 minutes aliqouts of 50 μl were removed from the reaction into 20 μl 3 N HC1. 1 ml of iodine solution was added and the decrease in absorbance at 620 nm was measured with an ELISA spectrophotometer.

The starch branching enzyme (SBE) levels were measured in tuber extracts from 34 transgenic Dianella potato plants transformed with plasmid pBEA9.

The BEA9 transformed transgenic lines produced tubers which have SBE levels that are 10% to 15 % of the SBE levels found in non transformed Dianella plants.

7/04112

32

SUMMATION

The above-mentioned examples relate to the isolation and sequencing of a gene for potato SBE. The examples further demonstrate that it is possible to prepare SBE intron antisense constructs. These SBE intron antisense constructs can be introduced into plants, such as potato plants. After introduction, a reduction in the level of synthesis of SBE and/or the level of activity of SBE and/or the composition of starch in plants can be achieved.

Without wishing to be bound by theory it is believed that the expressed anti-sense nucleotide sequence of the present invention binds to sense introns on pre-mRNA and thereby prevents pre-mRNA splicing and/or subsequent translation of mRNA. This binding therefore is believed to reduce the level of plant enzyme activity (in particular SBE activity), which in turn for SBE activity is believed to influence the amylose: amylopectin ratio and thus the branching pattern of amylopectin.

Thus, the present invention provides a method wherein it is possible to manipulate the starch composition in plants, or tissues or cells thereof, such as potato tubers, by reducing the level of SBE activity by using an antisense-RNA technique using antisense intron sequences.

In summation the present invention therefore relates to the surprising use of antisense intron sequences in a method to affect enzymatic activity in plants.

Other modifications of the present invention will be apparent to those skilled in the art without departing from the scope of the present invention.

For example, it may be possible to use antisense promoter sequences to affect enzymatic activity, such as antisense SBE promoter - such as a nucleotide sequence comprising the nucleotide sequence shown as SEQ. I.D. No. 28 or a variant, derivative or homologue thereof.

04112

33

The following pages present a number of sequence listings which have been consecutively numbered from SEQ.I.D. No. 1 - SEQ.I.D. No. 29. In brief, SEQ.I.D. No. 1 - SEQ.I.D. No. 13 represent sense intron sequences (genomic DNA); SEQ.I.D. No. 14 represents the SBE promoter sequence (genomic sequence); SEQ.I.D. No. 15 - SEQ.I.D. No. 27 represent antisense intron sequences: and SEQ.

I.D. No. 28 represents is the sequence complementary to the SBE promoter sequence - i.e. the SBE promoter sequence in antisense orientation. The full genomic nucleotide sequence for SBE including the promoter, exons and introns is shown as SEQ. I.D. No. 29 and is explained by way of Figures 4 and 12 which highlight particular gene features.

/04112

34 SEQUENCE INFORMATION

SEQ.I.D. No. 1

Intron 1 sequence (1167 bp) .

GTAATTTTTACTAATTTCATGTTAATTTCAATTATTTTTAGCCTTTGCATTTCATTT TCCAATATATCT GGATCATCTCCTTAGTTTTTTATTTTATTTTTTATAATATCAAATATGGAAGAAAAATGA CACTTGTAG AGCCATATGTAAGTATCATGTGACAAATTTGCAAGGTGGTTGAGTGTATAAAATTCAAAA ATTGAGAGA TGGAGGGGGGGTGGGGGBARAGACAATATTTAGAAAGAGTGTTCTAGGAGGTTATGGAGG ACACGGATG AGGGGTAGAAGGTTAGTTAGGTATTTGAGTGTTGTCTGGCTTATCCTTTCATACTAGTAG TCGTGGAAT TATTTGGGTAGTTTCTTGTTTTGTTATTTGATCTTTGTTATTCTATTTTCTGTTTCTTGT ACTTCGATT ATTGTATTATATATCTTGTCGTAGTTATTGTTCCTCGGTAAGAATGCTCTAGCATGCTTC CTTTAGTGT TTTATCATGCCTTCTTTATATTCGCGTTGCTTTGAAATGCTTTTACTTTAGCCGAGGGTC TATTAGAAA CAATCTCTCTATCTCGTAAGGTAGGGGTAAAGTCCTCACCACACTCCACTTGTGGGATTA CATTGTGTT TGTTGTTGTAAATCAATTATGTATACATAATAAGTGGATTTTTTACAACACAAATACATG GTCAAGGGC AAAGTTCTGAACACATAAAGGGTTCATTATATGTCCAGGGATATGATAAAAATTGTTTCT TTGTGAAAG TTATATAAGATTTGTTATGGCTTTTGCTGGAAACATAATAAGTTATAATGCTGAGATAGC TACTGAAGT TTGTTTTTTCTAGCCTTTTAAATGTACCAATAATAGATTCCGTATCGAACGAGTATGTTT TGATTACCT GGTCATGATGTTTCTATTTTTTACATTTTTTTGGTGTTGAACTGCAATTGAAAATGTTGT ATCCTATGA GACGGATAGTTGAGAATGTGTTCTTTGTATGGACCTTGAGAAGCTCAAACGCTACTCCAA TAATTTCTA TGAATTCAAATTCAGTTTATGGCTACCAGTCAGTCCAGAAATTAGGATATGCTGCATATA CTTGTTCAA TTATACTGTAAAATTTCTTAAGTTCTCAAGATATCCATGTAACCTCGAGAATTTCTTTGA CAG

SEQ.I.D. No. 2

Intron 2 sequence (321 bp) .

GTATGTTTGATAATTTATATGGTTGCATGGATAGTATATAAATAGTTGGAAAACTTC TGGACTGGTGCT

CATGGCATATTTGATCTGTGCACCGTGTGGAGATGTCAAACATGTGTTACTTCGTTC CGCCAATTTATA

ATACCTTAACTTGGGAAAGACAGCTCTTTACTCCTGTGGGCATTTGTTATTTGAATT ACAATCTTTATG

AGCATGGTGTTTTCACATTATCAACTTCTTTCATGTGGTATATAACAGTTTTTAGCT CCGTTAATACCT TTCTTCTTTTTGATATAAACTAACTGTGGTGCATTGCTTGCBKKK

SEQ.I.D. No. 3

Intron 3 sequence (504 bp) .

GTAACAGCCAAAAGTTGTGCTTTAGGCAGTTTGACCTTATTTTGGAAGATGAATTGT TTATACCTACTT TGACTTTGCTAGAGAATTTTGCATACCGGGGAGTAAGTAGTGGCTCCATTTAGGTGGCAC CTGGCCATT TTTTTGATCTTTTAAAAAGCTGTTTGATTGGGTCTTCAAAAAAGTAGACAAGGTTTTTGG AGAAGTGAC ACACCCCCGGAGTGTCAGTGGCAAAGCAAAGATTTTCACTAAGGAGATTCAAAATATAAA AAAAGTATA GACATAAAGAAGCTGAGGGGATTCAACATGTACTATACAAGCATCAAATATAGTCTTAAA GCAATTTTG TAGAAATAAAGAAAGTCTTCCTTCTGTTGCTTCACAATTTCCTTCTATTATCATGAGTTA CTCTTTCTG TTCGAAATAGCTTCCTTAATATTAAATTCATGATACTTTTGTTGAGATTTAGCAGTTTTT TCTTGTGTA AACTGCTCTCTTTTTTTGCAG

SEQ . I . D . No . 4

Intron 4 sequence (146 bp) .

GTAGGTCCTCGTCTACTACAAAATAGTAGTTTCCATCATCATAACAGATTTTCCTAT TAAAGCATGATG TTGCAGCATCATTGGCTTTCTTACATGTTCTAATTGCTATTAAGGTTATGCTTCTAATTA ACTCATCCA CAATGCAG

SEQ.I.D. No. 5

Intron 5 sequence (218 bp) .

GTTTTGTTATTCATACCTTGAAGCTGAATTTTGAACACCATCATCACAGGCATTTCG ATTCATGTTCTT ACTAGTCTTGTTATGTAAGACATTTTGAAATGCAAAAGTTAAAATAATTGTGTCTTTACT AATTTGGAC TTGATCCCATACTCTTTCCCTTAACAAAATGAGTCAATTCTATAAGTGCTTGAGAACTTA CTACTTCAG CAATTAAACAG

SEQ.I.D. No. 6

Intron 6 sequence ( 198 bp) .

GTATTTTAAATTTATTTCTACAACTAAATAATTCTCAGAACAATTGTTAGATAGAAT CCAAATATATAC GTCCTGAAAGTATAAAAGTACTTATTTTCGCCATGGGCCTTCAGAATATTGGTAGCCGCT GAATATCAT GATAAGTTATTTATCCAGTGACATTTTTATGTTCACTCCTATTATGTCTGCTGGATACAG

SEQ . I . D . No . 7

Intron 7 sequence ( 208bp)

GTTTGTCTGTTTCTATTGCATTTTAAGGTTCATATAGGTTAGCCACGGAAAATCTCA CTCTTTGTGAGG TAACCAGGGTTCTGATGGATTATTCAATTTTCTCGTTTATCATTTGTTTATTCTTTTCAT GCATTGTGT TTCT TTTCAATATCCCTCTTATTTGGAGGTAATTTTTCTCATCTATTCACTTTTAGCTTCTAAC CACAG

SEQ . I . D . No . 8

Intron 8 sequence (293 bp) .

GTATGTCTTACATCTTTAGATATTTTGTGATAATTACAATTAGTTTGGCTTACTTGA ACAAGATTCATT CCTCAAAATGACCTGAACTGTTGAACATCAAAGGGGTTGAAACATAGAGGAAAACAACAT GATGAATGT TTCCATTGTCTAGGGATTTCTATTATGTTGCTGAGAACAAATGTCATCTTAAAAAAAACA TTGTTTACT TTTTTGTAGTATAGAAGATTACTGTATAGAGTTTGCAAGTGTGTCTGTTTTGGAGTAATT GTGAAATGT TTGATGAACTTGTACAG

SEQ.I.D. No. 9

Intron 9 sequence (376 bp) .

GTTCAAGTATTTTGAATCGCAGCTTGTTAAATAATCTAGTAATTTTTAGATTGCTTA CTTGGAAGTCTA CTTGGTTCTGGGGATGATAGCTCATTTCATCTTGTTCTACTTATTTTCCAACCGAATTTC TGATTTTTG TTTCGAGATCCAAGTATTAGATTCATTTACACTTATTACCGCCTCATTTCTACCACTAAG GCCTTGATG AGCAGCTTAAGTTGATTCTTTGAAGCTATAGTTTCAGGCTACCAATCCACAGCCTGCTAT ATTTGTTGG

/04112

36

ATACTTACCTTTTCTTTACAATGAAGTGATACTAATTGAAATGGTCTAAATCTGATA TCTATATTTCTC CGTCTTTCCTCCCCCTCATGATGAAATGCAG

SEQ.I.D. No. 10

Intron 10 sequence ( 172 bp) .

GTAAAATCATCTAAAGTTGAAAGTGTTGGGTTTATGAAGTGCTTTAATTCTATCCAA GGACAAGTAGAA ACCTTTTTACCTTCCATTTCTTGATGATGGATTTCATATTATTTAATCCAATAGCTGGTC AAATTCGGT AATAGCTGTACTGATTAGTTACTTCACTTTGCAG

SEQ . I . D . NO . 11

Intron 11 sequence (145 bp) .

GTATATATGTTTTACTTATCCATGAAATTATTGCTCTGCTTGTTTTTAATGTACTGA ACAAGTTTTATG GAGAAGTAACTGAAACAAATCATTTTCACATTGTCTAATTTAACTCTTTTTTCTGATCCT CGCATGACG AAAACAG

SEQ.I.D. No. 12

Intron 12 sequence (242 bp) .

GTAAGGATTTGCTTGAATAACTTTTGATAATAAGATAACAGATGTAGGGTACAGTTC TCTCACCAAAAA

GAACTGTAATTGTCTCATCCATCTTTAGTTGTATAAGATATCCGACTGTCTGAGTTC GGAAGTGTTTGA

GCCTCCTGCCCTCCCCCTGCGTTGTTTAGCTAATTCAAAAAGGAGAAAACTGTTTAT TGATGATCTTTG

TCTTCATGCTGACATACAATCTGTTCTCATGACAG

SEQ.I.D. No. 13

Intron 13 sequence (797 bp) .

GTACAGTTCTTGCCGTGTGACCTCCCTTTTTATTGTGGTTTTGTTCATAGTTATTTG AATGCGATAGAA

GTTAACTATTGATTACCGCCACAATCGCCAGTTAAGTCCTCTGAACTACTAATTTGA AAGGTAGGAATA

GCCGTAATAAGGTCTACTTTTGGCATCTTACTGTTACAAAACAAAAGGATGCCAAAA AAATTCTTCTCT

ATCCTCTTTTTCCCTAAACCAGTGCATGTAGCTTGCACCTGCATAAACTTAGGTAAA TGATCAAAAATG

AAGTTGATGGGAACTTAAAACCGCCCTGAAGTAAAGCTAGGAATAGTCATATAATGT CCACCTTTGGTG

TCTGCGCTAACATCAACAACAACATACCTCGTGTAGTCCCACAAAGTGGTTTCAGGG GGAGGGTAGAGT

GTATGCAAAACTTACTCCTATCTCAGAGGTAGAGAGGATTTTTTCAATAGACCCTTG GCTCAAGAAAAA

AAGTCCAAAAAGAAGTAACAGAAGTGAAAGCAACATGTGTAGCTAAAGCGACCCAAC TTGTTTGGGACT

GAAGTAGTTGTTGTTGTTGAAACAGTGCATGTAGATGAACACATGTCAGAAAATGGA CAACACAGTTAT

TTTGTGCAAGTCAAAAAAATGTACTACTATTTCTTTGTGCAGCTTTATGTATAGAAA AGTTAAATAACT

AATGAATTTTGCTAGCAGAAAAATAGCTTGGAGAGAAATTTTTTATATTGAACTAAG CTAACTATATTC

ATCTTTCTTTTTGCTTCTTCTTCTCCTTGTTTGTGAAG

/04112

37

SEQ . I . D . No . 14

DNA sequence of the SBE gene promoter region.

ATCATGGCCAATTACTGGTTCAAATGCATTACTTCCTTTCAGATTCTTTCGAGTTCT CAT 60

GACCGGTCCTACTACAGACGATACTAACCCGTGGAACTGTTGCATCTGCTTCTTAGA ACT 120

CTATGGCTATTTTCGTTAGCTTGGCGTCGGTTTGAACATAGTTTTTGTTTTCAAACT CTT 180

CATTTACAGTCAAAATGTTGTATGGTTTTTGTTTTCCTCAATGATGTTTACAGTGTT GTG 240

TTGTCATCTGTACTTTTGCCTATTACTTGTTTTGAGTTACATGTTAAAAAAGTGTTT ATT 300

TTGCCATATTTTGTTCTCTTATTATTATTATCATACATACATTATTACAAGGAAAAG ACA 360

AGTACACAGATCTTAACGTTTATGTTCAATCAACTTTTGGAGGCATTGACAGGTACC ACA 420

AATTTTGAGTTTATGATTAAGTTCAATCTTAGAATATGAATTTAACATCTATTATAG ATG 480

CATAAAAATAGCTAATGATAGAACATTGACATTTGGCAGAGCTTAGGGTATGGTATA TCC 540

AACGTTAATTTAGTAATTTTTGTTACGTACGTATATGAAATATTGAATTAATCACAT GAA 600

CGGTGGATATTATATTATGAGTTGGCATCAGCAAAATCATTGGTGTAGTTGACTGTA GTT 660

GCAGATTTAATAATAAAATGGTAATTAACGGTCGATATTAAAATAACTCTCATTTCA AGT 720

GGGATTAGAACTAGTTATTAAAAAAATGTATACTTTAAGTGATTTGATGGCATATAA TTT 780

AAAGTTTTTCATTTCATGCTAAAATTGTTAATTATTGTAATGTAGACTGCGACTGGA ATT 840

ATTATAGTGTAAATTTATGCATTCAGTGTAAAATTAAAGTATTGAACTTGTCTGTTT TAG 900

AAAATACTTTATACTTTAATATAGGATTTTGTCATGCGAATTTAAATTAATCGATAT TGA 960

ACACGGAATACCAAAATTAAAAAGGATACACATGGCCTTCATATGAACCGTGAACCT TTG 1020

ATAACGTGGAAGTTCAAAGAAGGTAAAGTTTAAGAATAAACTGACAAATTAATTTCT TTT 1080

ATTTGGCCCACTACTAAATTTGCTTTACTTTCTAACATGTCAAGTTGTGCCCTCTTA GTT 1140

GAATGATATTCATTTTTCATCCCATAAGTTCAATTTGATTGTCATACCACCCATGAT GTT 1200

CTGAAAAATGCTTGGCCATTCACAAAGTTTATCTTAGTTCCTATGAACTTTATAAGA AGC 1260

TTTAATTTGACATGTTATTTATATTAGATGATATAATCCATGACCCAATAGACAAGT GTA 1320

TTAATATTGTAACTTTGTAATTGAGTGTGTCTACATCTTATTCAATCATTTAAGGTC ATT 1380

AAAATAAATTATTTTTTGACATTCTAAAACTTTAAGCAGAATAAATAGTTTATCAAT TAT 1440

TAAAAACAAAAAACGACTTATTTATAAATCAACAAACAATTTTAGATTGCTCCAACA TAT 1500

TTTTCCAAATTAAATGCAGAAAATGCATAATTTTATACTTGATCTTTATAGCTTATT TTT 1560

TTTAGCCTAACCAACGAATATTTGTAAACTCACAACTTGATTAAAAGGGATTTACAA CAA 1620

GATATATATAAGTAGTGACAAATCTTGATTTTAAATATTTTAATTTGGAGGTCAAAA TTT 1680

TACCATAATCATTTGTATTTATAATTAAATTTTAAATATCTTATTTATACATATCTA GTA 1740

AACTTTTAAATATACGTATATACAAAATATAAAATTATTGGCGTTCATATTAGGTCA ATA 1800

AATCCTTAACTATATCTGCCTTACCACTAGGAGAAAGTAAAAAACTCTTTACCAAAA ATA 1860

C TGTATTATGTATACAAAAAGTCGATTAGATTACCTAAATAGAAATTGTATAACGAGTA 1920

AGTAAGTAGAAATATAAAAAAACTACAATACTAAAAAAAATATGTTTTACTTCAATT TCG 1980

AAACTAATGGGGTCTGAGTGAAATATTCAGAAAGGGGAGGACTAACAAAAGGGTCAT AAT 2040

GTTTTTTTATAAAAAGCCACTAAAATGAGGAAATCAAGAATCAGAACATACAAGAAG GCA 2100

GCAGCTGAAGCAAAGTACCATAATTTAATCAATGGAAATTAATTTCAAAGTTTTATC AAA 2160 ACCCATTCG

/04112

38

SEQ . I . D . No . 15

Intron 1 antisense sequence (1167 bp) .

CTGTCAAAGAAATTCTCGAGGTTACATGGATATCTTGAGAACTTAAGAAATTTTACA GTATAATTGAAC AAGTATATGCAGCATATCCTAATTTCTGGACTGACTGGTAGCCATAAACTGAATTTGAAT TCATAGAAA TTATTGGAGTAGCGTTTGAGCTTCTCAAGGTCCATACAAAGAACACATTCTCAACTATCC GTCTCATAG GATACAACATTTTCAATTGCAGTTCAACACCAAAAAAATGTAAAAAATAGAAACATCATG ACCAGGTAA TCAAAACATACTCGTTCGATACGGAATCTATTATTGGTACATTTAAAAGGCTAGAAAAAA CAAACTTCA GTAGCTATCTCAGCATTATAACTTATTATGTTTCCAGCAAAAGCCATAACAAATCTTATA TAACTTTCA CAAAGAAACAATTTTTATCATATCCCTGGACATATAATGAACCCTTTATGTGTTCAGAAC TTTGCCCTT GACCATGTATTTGTGTTGTAAAAAATCCACTTATTATGTATACATAATTGATTTACAACA ACAAACACA ATGTAATCCCACAAGTGGAGTGTGGTGAGGACTTTACCCCTACCTTACGAGATAGAGAGA TTGTTTCTA ATAGACCCTCGGCTAAAGTAAAAGCATTTCAAAGCAACGCGAATATAAAGAAGGCATGAT AAAACACTA AAGGAAGCATGCTAGAGCATTCTTACCGAGGAACAATAACTACGACAAGATATATAATAC AATAATCGA AGTACAAGAAACAGAAAATAGAATAACAAAGATCAAATAACAAAACAAGAAACTACCCAA ATAATTCCA CGACTACTAGTATGAAAGGATAAGCCAGACAACACTCAAATACCTAACTAACCTTCTACC CCTCATCCG TGTCCTCCATAACCTCCTAGAACACTCTTTCTAAATATTGTCTYTVCCCCCACCCCCCCT CCATCTCTC AATTTTTGAATTTTATACACTCAACCACCTTGCAAATTTGTCACATGATACTTACATATG GCTCTACAA GTGTCATTTTTCTTCCATATTTGATATTATAAAAAATAAAATAAAAAACTAAGGAGATGA TCCAGATAT ATTGGAAAATGAAATGCAAAGGCTAAAAATAATTGAAATTAACATGAAATTAGTAAAAAT TAC

SEQ.I.D. No. 16

Intron 2 antisense sequence (321 bp) .

MMMVGCAAGCAATGCACCACAGTTAGTTTATATCAAAAAGAAGAAAGGTATTAACGG AGCTAAAAACTG TTATATACCACATGAAAGAAGTTGATAATGTGAAAACACCATGCTCATAAAGATTGTAAT TCAAATAAC AAATGCCCACAGGAGTAAAGAGCTGTCTTTCCCAAGTTAAGGTATTATAAATTGGCGGAA CGAAGTAAC ACATGTTTGACATCTCCACACGGTGCACAGATCAAATATGCCATGAGCACCAGTCCAGAA GTTTTCCAA CTATTTATATACTATCCATGCAACCATATAAATTATCAAACATAC

SEQ.I.D. No. 17

Intron 3 antisense sequence (504 bp) .

CTGCAAAAAAAGAGAGCAGTTTACACAAGAAAAAACTGCTAAATCTCAACAAAAGTA TCATGAATTTAA TATTAAGGAAGCTATTTCGAACAGAAAGAGTAACTCATGATAATAGAAGGAAATTGTGAA GCAACAGAA GGAAGACTTTCTTTATTTCTACAAAATTGCTTTAAGACTATATTTGATGCTTGTATAGTA CATGTTGAA TCCCCTCAGCTTCTTTATGTCTATACTTTTTTTATATTTTGAATCTCCTTAGTGAAAATC TTTGCTTTG CCACTGACACTCCGGGGGTGTGTCACTTCTCCAAAAACCTTGTCTACTTTTTTGAAGACC CAATCAAAC AGCTTTTTAAAAGATCAAAAAAATGGCCAGGTGCCACCTAAATGGAGCCACTACTTACTC CCCGGTATG CAAAATTCTCTAGCAAAGTCAAAGTAGGTATAAACAATTCATCTTCCAAAATAAGGTCAA ACTGCCTAA AGCACAACTTTTGGCTGTTAC

SEQ . I . D . No . 18

Intron 4 antisense sequence (146 bp) .

CTGCATTGTGGATGAGTTAATTAGAAGCATAACCTTAATAGCAATTAGAACATGTAA GAAAGCCAATGA TGCTGCAACATCATGCTTTAATAGGAAAATCTGTTATGATGATGGAAACTACTATTTTGT AGTAGACGA GGACCTAC

SEQ.I.D. No. 19

Intron 5 antisense sequence (218 bp) .

CTGTTTAATTGCTGAAGTAGTAAGTTCTCAAGCACTTATAGAATTGACTCATTTTGT TAAGGGAAAGAG TATGGGATCAAGTCCAAATTAGTAAAGACACAATTATTTTAACTTTTGCATTTCAAAATG TCTTACATA ACAAGACTAGTAAGAACATGAATCGAAATGCCTGTGATGATGGTGTTCAAAATTCAGCTT CAAGGTATG AATAACAAAAC

SEQ.I.D. No. 20

Intron 6 antisense sequence ( 198 bp ) .

CTGTATCCAGCAGACATAATAGGAGTGAACATAAAAATGTCACTGGATAAATAACTT ATCATGATATTC AGCGGCTACCAATATTCTGAAGGCCCATGGCGAAAATAAGTACTTTTATACTTTCAGGAC GTATATATT TGGATTCTATCTAACAATTGTTCTGAGAATTATTTAGTTGTAGAAATAAATTTAAAATAC

SEQ . I . D . No . 21

Intron 7 antisense sequence ( 208 bp ) .

CTGTGGTTAGAAGCTAAAAGTGAATAGATGAGAAAAATTACCTCCAAATAAGAGGGA TATTGAAAAAGA AACACAATGCATGAAAAGAATAAACAAATGATAAACGAGAAAATTGAATAATCCATCAGA ACCCTGGTT ACCTCACAAAGAGTGAGATTTTCCGTGGCTAACCTATATGAACCTTAAAATGCAATAGAA ACAGACAAAC

SEQ . I . D . No . 22

Intron 8 antisense sequence (293 bp) .

CTGTACAAGTTCATCAAACATTTCACAATTACTCCAAAACAGACACACTTGCAAACT CTATACAGTAAT CTTCTATACTACAAAAAAGTAAACAATGTTTTTTTTAAGATGACATTTGTTCTCAGCAAC ATAATAGAA ATCCCTAGACAATGGAAACATTCATCATGTTGTTTTCCTCTATGTTTCAACCCCTTTGAT GTTCAACAG TTCAGGTCATTTTGAGGAATGAATCTTGTTCAAGTAAGCCAAACTAATTGTAATTATCAC AAAATATCT AAAGATGTAAGACATAC

SEQ.I.D. No. 23

Intron 9 antisense sequence (376 bp) .

CTGCATTTCATCATGAGGGGGAGGAAAGACGGAGAAATATAGATATCAGATTTAGAC CATTTCAATTAG TATCACTTCATTGTAAAGAAAAGGTAAGTATCCAACAAATATAGCAGGCTGTGGATTGGT AGCCTGAAA CTATAGCTTCAAAGAATCAACTTAAGCTGCTCATCAAGGCCTTAGTGGTAGAAATGAGGC GGTAATAAG TGTAAATGAATCTAATACTTGGATCTCGAAACAAAAATCAGAAATTCGGTTGGAAAATAA GTAGAACAA

7/04112

40

GATGAAATGAGCTATCATCCCCAGAACCAAGTAGACTTCCAAGTAAGCAATCTAAAA ATTACTAGATTA TTTAACAAGCTGCGATTCAAAATACTTGAAC

SEQ.I.D. No. 24

Intron 10 antisense sequence ( 172 bp ) .

CTGCAAAGTGAAGTAACTAATCAGTACAGCTATTACCGAATTTGACCAGCTATTGGA TTAAATAATATG AAATCCATCATCAAGAAATGGAAGGTAAAAAGGTTTCTACTTGTCCTTGGATAGAATTAA AGCACTTCA TAAACCCAACACTTTCAACTTTAGATGATTTTAC

SEQ . I . D . No . 25

Intron 11 antisense sequence (145 bp) .

CTGTTTTCGTCATGCGAGGATCAGAAAAAAGAGTTAAATTAGACAATGTGAAAATGA TTTGTTTCAGTT ACTTCTCCATAAAACTTGTTCAGTACATTAAAAACAAGCAGAGCAATAATTTCATGGATA AGTAAAACA TATATAC

SEQ.I.D. No. 26

Intron 12 antisense sequence (242 bp) .

CTGTCATGAGAACAGATTGTATGTCAGCATGAAGACAAAGATCATCAATAAACAGTT TTCTCCTTTTTG AATTAGCTAAACAACGCAGGGGGAGGGCAGGAGGCTCAAACACTTCCGAACTCAGACAGT CGGATATCT TATACAACTAAAGATGGATGAGACAATTACAGTTCTTTTTGGTGAGAGAACTGTACCCTA CATCTGTTA TCTTATTATCAAAAGTTATTCAAGCAAATCCTTAC

SEQ.I.D. No. 27

Intron 13 antisense sequence (797 bp) .

CTTCACAAACAAGGAGAAGAAGAAGCAAAAAGAAAGATGAATATAGTTAGCTTAGTT CAATATAAAAAA TTTCTCTCCAAGCTATTTTTCTGCTAGCAAAATTCATTAGTTATTTAACTTTTCTATACA TAAAGCTGC ACAAAGAAATAGTAGTACATTTTTTTGACTTGCACAAAATAACTGTGTTGTCCATTTTCT GACATGTGT TCATCTACATGCACTGTTTCAACAACAACAACTACTTCAGTCCCAAACAAGTTGGGTCGC TTTAGCTAC ACATGTTGCTTTCACTTCTGTTACTTCTTTTTGGACTTTTTTTCTTGAGCCAAGGGTCTA TTGAAAAAA TCCTCTCTACCTCTGAGATAGGAGTAAGTTTTGCATACACTCTACCCTCCCCCTGAAACC ACTTTGTGG GACTACACGAGGTATGTTGTTGTTGATGTTAGCGCAGACACCAAAGGTGGACATTATATG ACTATTCCT AGCTTTACTTCAGGGCGGTTTTAAGTTCCCATCAACTTCATTTTTGATCATTTACCTAAG TTTATGCAG GTGCAAGCTACATGCACTGGTTTAGGGAAAAAGAGGATAGAGAAGAATTTTTTTGGCATC CTTTTGTTT TGTAACAGTAAGATGCCAAAAGTAGACCTTATTACGGCTATTCCTACCTTTCAAATTAGT AGTTCAGAG GACTTAACTGGCGATTGTGGCGGTAATCAATAGTTAACTTCTATCGCATTCAAATAACTA TGAACAAAA CCACAATAAAAAGGGAGGTCACACGGCAAGAACTGTAC

41

SEQ . I . D . No . 28

Antisense DNA sequence of the SBE gene promoter region.

CGAATGGGTTTTGATAAAACTTTGAAATTAATTTCCATTGATTAAATTATGGTACTT TGC 60

TTCAGCTGCTGCCTTCTTGTATGTTCTGATTCTTGATTTCCTCATTTTAGTGGCTTT TTA 120

TAAAAAAACATTATGACCCTTTTGTTAGTCCTCCCCTTTCTGAATATTTCACTCAGA CCC 180

CATTAGTTTCGAAATTGAAGTAAAACATATTTTTTTTAGTATTGTAGTTTTTTTATA TTT 2 0

CTACTTACTTACTCGTTATACAATTTCTATTTAGGTAATCTAATCGACTTTTTGTAT ACA 300

TAATACATGTATTTTTGGTAAAGAGTTTTTTACTTTCTCCTAGTGGTAAGGCAGATA TAG 360

TTAAGGATTTATTGACCTAATATGAACGCCAATAATTTTATATTTTGTATATACGTA TAT 420

TTAAAAGTTTACTAGATATGTATAAATAAGATATTTAAAATTTAATTATAAATACAA ATG 480

ATTATGGTAAAATTTTGACCTCCAAATTAAAATATTTAAAATCAAGATTTGTCACTA CTT 540

ATATATATCTTGTTGTAAATCCCTTTTAATCAAGTTGTGAGTTTACAAATATTCGTT GGT 600

TAGGCTAAAAAAAATAAGCTATAAAGATCAAGTATAAAATTATGCATTTTCTGCATT TAA 660

TTTGGAAAAATATGTTGGAGCAATCTAAAATTGTTTGTTGATTTATAAATAAGTCGT TTT 720

TTGTTTTTAATAATTGATAAACTATTTATTCTGCTTAAAGTTTTAGAATGTCAAAAA ATA 780

ATTTATTTTAATGACCTTAAATGATTGAATAAGATGTAGACACACTCAATTACAAAG TTA g4 Q

CAATATTAATACACTTGTCTATTGGGTCATGGATTATATCATCTAATATAAATAACA TGT 900

CAAATTAAAGCTTCTTATAAAGTTCATAGGAACTAAGATAAACTTTGTGAATGGCCA AGC 960

ATTTTTCAGAACATCATGGGTGGTATGACAATCAAATTGAACTTATGGGATGAAAAA TGA 1020

ATATCATTCAACTAAGAGGGCACAACTTGACATGTTAGAAAGTAAAGCAAATTTAGT AGT 1080

GGGCCAAATAAAAGAAATTAATTTGTCAGTTTATTCTTAAACTTTACCTTCTTTGAA CTT 1140

CCACGTTATCAAAGGTTCACGGTTCATATGAAGGCCATGTGTATCCTTTTTAATTTT GGT 1200

ATTCCGTGTTCAATATCGATTAATTTAAATTCGCATGACAAAATCCTATATTAAAGT ATA 1260

AAGTATTTTCTAAAACAGACAAGTTCAATACTTTAATTTTACACTGAATGCATAAAT TTA 1320

CACTATAATAATTCCAGTCGCAGTCTACATTACAATAATTAACAATTTTAGCATGAA ATG 1380

AAAAACTTTAAATTATATGCCATCAAATCACTTAAAGTATACATTTTTTTAATAACT AGT 1440

TCTAATCCCACTTGAAATGAGAGTTATTTTAATATCGACCGTTAATTACCATTTTAT TAT 1500

TAAATCTGCAACTACAGTCAACTACACCAATGATTTTGCTGATGCCAACTCATAATA TAA 1560

TATCCACCGTTCATGTGATTAATTCAATATTTCATATACGTACGTAACAAAAATTAC TAA 1620

ATTAACGTTGGATATACCATACCCTAAGCTCTGCCAAATGTCAATGTTCTATCATTA GCT 1680

ATTTTTATGCATCTATAATAGATGTTAAATTCATATTCTAAGATTGAACTTAATCAT AAA 1740

CTCAAAATTTGTGGTACCTGTCAATGCCTCCAAAAGTTGATTGAACATAAACGTTAA GAT 1800

CTGTGTACTTGTCTTTTCCTTGTAATAATGTATGTATGATAATAATAATAAGAGAAC AAA 1860

ATATGGCAAAATAAACACTTTTTTAACATGTAACTCAAAACAAGTAATAGGCAAAAG TAC 1920

AGATGACAACACAACACTGTAAACATCATTGAGGAAAACAAAAACCATACAACATTT TGA 1980

CTGTAAATGAAGAGTTTGAAAACAAAAACTATGTTCAAACCGACGCCAAGCTAACGA AAA 2040

TAGCCATAGAGTTCTAAGAAGCAGATGCAACAGTTCCACGGGTTAGTATCGTCTGTA GTA 2100

GGACCGGTCATGAGAACTCGAAAGAATCTGAAAGGAAGTAATGCATTTGAACCAGTA ATT 2160 GGCCATGAT

SEQ.I.D. No. 29 Genomic SBE gene

ATCATGGCCA ATTACTGGTT CAAATGCATT ACTTCCTTTC AGATTCTTTC GAGTTCTCAT 60

GACCGGTCCT ACTACAGACG ATACTAACCC GTGGAACTGT TGCATCTGCT TCTTAGAACT 120

CTATGGCTAT TTTCGTTAGC TTGGCGTCGG TTTGAACATA GTTTTTGTTT TCAAACTCTT 180

CATTTACAGT CAAAATGTTG TATGGTTTTT GTTTTCCTCA ATGATGTTTA CAGTGTTGTG 240

TTGTCATCTG TACTTTTGCC TATTACTTGT TTTGAGTTAC ATGTTAAAAA AGTGTTTATT 300

TTGCCATATT TTGTTCTCTT ATTATTATTA TCATACATAC ATTATTACAA GGAAAAGACA 360

AGTACACAGA TCTTAACGTT TATGTTCAAT CAACTTTTGG AGGCATTGAC AGGTACCACA 420

AATTTTGAGT TTATGATTAA GTTCAATCTT AGAATATGAA TTTAACATCT ATTATAGATG 480

CATAAAAATA GCTAATGATA GAACATTGAC ATTTGGCAGA GCTTAGGGTA TGGTATATCC 540

AACGTTAATT TAGTAATTTT TGTTACGTAC GTATATGAAA TATTGAATTA ATCACATGAA 600

CGGTGGATAT TATATTATGA GTTGGCATCA GCAAAATCAT TGGTGTAGTT GACTGTAGTT 660

GCAGATTTAA TAATAAAATG GTAATTAACG GTCGATATTA AAATAACTCT CATTTCAAGT 720

GGGATTAGAA CTAGTTATTA AAAAAATGTA TACTTTAAGT GATTTGATGG CATATAATTT 780

AAAGTTTTTC ATTTCATGCT AAAATTGTTA ATTATTGTAA TGTAGACTGC GACTGGAATT 840

ATTATAGTGT AAATTTATGC ATTCAGTGTA AAATTAAAGT ATTGAACTTG TCTGTTTTAG 900

AAAATACTTT ATACTTTAAT ATAGGATTTT GTCATGCGAA TTTAAATTAA TCGATATTGA 960

ACACGGAATA CCAAAATTAA AAAGGATACA CATGGCCTTC ATATGAACCG TGAACCTTTG 1020

ATAACGTGGA AGTTCAAAGA AGGTAAAGTT TAAGAATAAA CTGACAAATT AATTTCTTTT 1080

ATTTGGCCCA CTACTAAATT TGCTTTACTT TCTAACATGT CAAGTTGTGC CCTCTTAGTT 1140

GAATGATATT CATTTTTCAT CCCATAAGTT CAATTTGATT GTCATACCAC CCATGATGTT 1200

CTGAAAAATG CTTGGCCATT CACAAAGTTT ATCTTAGTTC CTATGAACTT TATAAGAAGC 1260

TTTAATTTGA CATGTTATTT ATATTAGATG ATATAATCCA TGACCCAATA GACAAGTGTA 1320

TTAATATTGT AACTTTGTAA TTGAGTGTGT CTACATCTTA TTCAATCATT TAAGGTCATT 1380

AAAATAAATT ATTTTTTGAC ATTCTAAAAC TTTAAGCAGA ATAAATAGTT TATCAATTAT 1440

TAAAAACAAA AAACGACTTA TTTATAAATC AACAAACAAT TTTAGATTGC TCCAACATAT 1500

TTTTCCAAAT TAAATGCAGA AAATGCATAA TTTTATACTT GATCTTTATA GCTTATTTTT 1560

TTTAGCCTAA CCAACGAATA TTTGTAAACT CACAACTTGA TTAAAAGGGA TTTACAACAA 1620

GATATATATA AGTAGTGACA AATCTTGATT TTAAATATTT TAATTTGGAG GTCAAAATTT 1680

TACCATAATC ATTTGTATTT ATAATTAAAT TTTAAATATC TTATTTATAC ATATCTAGTA 1740

AACTTTTAAA TATACGTATA TACAAAATAT AAAATTATTG GCGTTCATAT TAGGTCAATA 1800

AATCCTTAAC TATATCTGCC TTACCACTAG GAGAAAGTAA AAAACTCTTT ACCAAAAATA 1860

CATGTATTAT GTATACAAAA AGTCGATTAG ATTACCTAAA TAGAAATTGT ATAACGAGTA 1920

AGTAAGTAGA AATATAAAAA AACTACAATA CTAAAAAAAA TATGTTTTAC TTCAATTTCG 1980

AAACTAATGG GGTCTGAGTG AAATATTCAG AAAGGGGAGG ACTAACAAAA GGGTCATAAT 2040

GTTTTTTTAT AAAAAGCCAC TAAAATGAGG AAATCAAGAA TCAGAACATA CAAGAAGGCA 2100

GCAGCTGAAG CAAAGTACCA TAATTTAATC AATGGAAATT AATTTCAAAG TTTTATCAAA 2160

ACCCATTCGA GGATCTTTTC CATCTTTCTC ACCTAAAGTT TCTTCAGGGG TAATTTTTAC 2220

TAATTTCATG TTAATTTCAA TTATTTTTAG CCTTTGCATT TCATTTTCCA ATATATCTGG 2280

ATCATCTCCT TAGTTTTTTA TTTTATTTTT TATAATATCA AATATGGAAG AAAAATGACA 2340

CTTGTAGAGC CATATGTAAG TATCATGTGA CAAATTTGCA AGGTGGTTGA GTGTATAAAA 2400

TTCAAAAATT GAGAGATGGA GGGGGGGTGG GGGBARAGAC AATATTTAGA AAGAGTGTTC 2460

TAGGAGGTTA TGGAGGACAC GGATGAGGGG TAGAAGGTTA GTTAGGTATT TGAGTGTTGT 2520

43

CTGGCTTATC CTTTCATACT AGTAGTCGTG GAATTATTTG GGTAGTTTCT TGTTTTGTTA 2580

TTTGATCTTT GTTATTCTAT TTTCTGTTTC TTGTACTTCG ATTATTGTAT TATATATCTT 2640

GTCGTAGTTA TTGTTCCTCG GTAAGAATGC TCTAGCATGC TTCCTTTAGT GTTTTATCAT 2700

GCCTTCTTTA TATTCGCGTT GCTTTGAAAT GCTTTTACTT TAGCCGAGGG TCTATTAGAA 2760

ACAATCTCTC TATCTCGTAA GGTAGGGGTA AAGTCCTCAC CACACTCCAC TTGTGGGATT 2820

ACATTGTGTT TGTTGTTGTA AATCAATTAT GTATACATAA TAAGTGGATT TTTTACAACA 2880

CAAATACATG GTCAAGGGCA AAGTTCTGAA CACATAAAGG GTTCATTATA TGTCCAGGGA 2940

TATGATAAAA ATTGTTTCTT TGTGAAAGTT ATATAAGATT TGTTATGGCT TTTGCTGGAA 3000

ACATAATAAG TTATAATGCT GAGATAGCTA CTGAAGTTTG TTTTTTCTAG CCTTTTAAAT 3060

GTACCAATAA TAGATTCCGT ATCGAACGAG TATGTTTTGA TTACCTGGTC ATGATGTTTC 3120

TATTTTTTAC ATTTTTTTGG TGTTGAACTG CAATTGAAAA TGTTGTATCC TATGAGACGG 3180

ATAGTTGAGA ATGTGTTCTT TGTATGGACC TTGAGAAGCT CAAACGCTAC TCCAATAATT 3240

TCTATGAATT CAAATTCAGT TTATGGCTAC CAGTCAGTCC AGAAATTAGG ATATGCTGCA 3300

TATACTTGTT CAATTATACT GTAAAATTTC TTAAGTTCTC AAGATATCCA TGTAACCTCG 3360

AGAATTTCTT TGACAGGCTT CTAGAAATAA GATATGTTTT CCTTCTCAAC ATAGTACTGG 3420

ACTGAAGTTT GGATCTCAGG AACGGTCTTG GGATATTTCT TCCACCCCAA AATCAAGAGT 3480

TAGAAAAGAT GAAAGGGTAT GTTTGATAAT TTATATGGTT GCATGGATAG TATATAAATA 3540

GTTGGAAAAC TTCTGGACTG GTGCTCATGG CATATTTGAT CTGTGCACCG TGTGGAGATG 3600

TCAAACATGT GTTACTTCGT TCCGCCAATT TATAATACCT TAACTTGGGA AAGACAGCTC 3660

TTTACTCCTG TGGGCATTTG TTATTTGAAT TACAATCTTT ATGAGCATGG TGTTTTCACA 3720

TTATCAACTT CTTTCATGTG GTATATAACA GTTTTTAGCT CCGTTAATAC CTTTCTTCTT 3780

TTTGATATAA ACTAACTGTG GTGCATTGCT TGCBKKKATG AAGCACAGTT CAGCTATTTC 3840

CGCTGTTTTG ACCGATGACG ACAATTCGAC AATGGCACCC CTAGAGGAAG ATGTCAAGAC 3900

TGAAAATATT GGCCTCCTAA ATTTGGATCC AACTTTGGAA CCTTATCTAG ATCACTTCAG 3960

ACACAGAATG AAGAGATATG TGGATCAGAA AATGCTCATT GAAAAATATG AGGGACCCCT 4020

TGAGGAATTT GCTCAAGGTA ACAGCCAAAA GTTGTGCTTT AGGCAGTTTG ACCTTATTTT 4080

GGAAGATGAA TTGTTTATAC CTACTTTGAC TTTGCTAGAG AATTTTGCAT ACCGGGGAGT 4140

AAGTAGTGGC TCCATTTAGG TGGCACCTGG CCATTTTTTT GATCTTTTAA AAAGCTGTTT 4200

GATTGGGTCT TCAAAAAAGT AGACAAGGTT TTTGGAGAAG TGACACACCC CCGGAGTGTC 4260

AGTGGCAAAG CAAAGATTTT CACTAAGGAG ATTCAAAATA TAAAAAAAGT ATAGACATAA 4320

AGAAGCTGAG GGGATTCAAC ATGTACTATA CAAGCATCAA ATATAGTCTT AAAGCAATTT 4380

TGTAGAAATA AAGAAAGTCT TCCTTCTGTT GCTTCACAAT TTCCTTCTAT TATCATGAGT 4440

TACTCTTTCT GTTCGAAATA GCTTCCTTAA TATTAAATTC ATGATACTTT TGTTGAGATT 4500

TAGCAGTTTT TTCTTGTGTA AACTGCTCTC TTTTTTTGCA GGTTATTTAA AATTTGGATT 4560

CAACAGGGAA GATGGTTGCA TAGTCTATCG TGAATGGGCT CCTGCTGCTC AGTAGGTCCT 4620

CGTCTACTAC AAAATAGTAG TTTCCATCAT CATAACAGAT TTTCCTATTA AAGCATGATG 4680

TTGCAGCATC ATTGGCTTTC TTACATGTTC TAATTGCTAT TAAGGTTATG CTTCTAATTA 4740

ACTCATCCAC AATGCAGGGA AGCAGAAGTT ATTGGCGATT TCAATGGATG GAACGGTTCT 4800

AACCACATGA TGGAGAAGGA CCAGTTTGGT GTTTGGAGTA TTAGAATTCC TGATGTTGAC 4860

AGTAAGCCAG TCATTCCACA CAACTCCAGA GTTAAGTTTC GTTTCAAACA TGGTAATGGA 4920

GTGTGGGTAG ATCGTATCCC TGCTTGGATA AAGTATGCCA CTGCAGACGC CACAAAGTTT 4980

GCAGCACCAT ATGATGGTGT CTACTGGGAC CCACCACCTT CAGAAAGGTT TTGTTATTCA 5040

TACCTTGAAG CTGAATTTTG AACACCATCA TCACAGGCAT TTCGATTCAT GTTCTTACTA 5100

GTCTTGTTAT GTAAGACATT TTGAAATGCA AAAGTTAAAA TAATTGTGTC TTTACTAATT 5160

TGGACTTGAT CCCATACTCT TTCCCTTAAC AAAATGAGTC AATTCTATAA GTGCTTGAGA 5220

44

ACTTACTACT TCAGCAATTA AACAGGTACC ACTTCAAATA CCCTCGCCCT CCCAAACCCC 5280

GAGCCCCACG AATCTATGAA GCACATGTCG GCATGAGCAG CTCTGAGCCA CGTGTAAATT 5340

CGTATCGTGA GTTTGCAGAT GATGTTTTAC CTCGGATTAA GGCAAATAAC TATAATACTG 5400

TCCAGTTGAT GGCCATAATG GAACATTCTT ACTATGGATC ATTTGGATAT CATGTTACAA 5460

ACTTTTTTGC TGTGAGCAGT AGATATGGAA ACCCGGAGGA CCTAAAGTAT CTGATAGATA 5520

AAGCACATAG CTTGGGTTTA CAGGTTCTGG TGGATGTAGT TCACAGTCAT GCAAGCAATA 5580

ATGTCACTGA TGGCCTCAAT GGCTTTGATA TTGGCCAAGG TTCTCAAGAA TCCTACTTTC 5640

ATGCTGGAGA GCGAGGGTAC CATAAGTTGT GGGATAGCAG GCTGTTCAAC TATGCCAATT 5700

GGGAGGTTCT TCGTTTCCTT CTTTCCAACT TGAGGTGGTG GCTAGAAGAG TATAACTTTG 5760

ACGGATTTCG ATTTGATGGA ATAACTTCTA TGCTGTATGT TCATCATGGA ATCAATATGG 5820

GATTTACAGG AAACTATAAT GAGTATTTCA GCGAGGCTAC AGATGTTGAT GCTGTGGTCT 5880

ATTTAATGTT GGCCAATAAT CTGATTCACA AGATTTTCCC AGATGCAACT GTTATTGCCG 5940

AAGATGTTTC TGGTATGCCG GGCCTTGGCC GGCCTGTTTC TGAGGGAGGA ATTGGTTTTG 6000

TTTACCGCCT GGCAATGGCA ATCCCAGATA AGTGGATAGA TTATTTAAAG AATAAGAATG 6060

ATGAAGATTG GTCCATGAAG GAAGTAACAT CGAGTTTGAC AAATAGGAGA TATACAGAGA 6120

AGTGTATAGC ATATGCGGAG ACCCATGATC AGGTATTTTA AATTTATTTC TACAACTAAA 6180

TAATTCTCAG AACAATTGTT AGATAGAATC CAAATATATA CGTCCTGAAA GTATAAAAGT 6240

ACTTATTTTC GCCATGGGCC TTCAGAATAT TGGTAGCCGC TGAATATCAT GATAAGTTAT 6300

TTATCCAGTG ACATTTTTAT GTTCACTCCT ATTATGTCTG CTGGATACAG TCTATTGTTG 6360

GTGACAAGAC CATTGCATTT CTCCTAATGG ACAAAGAGAT GTATTCTGGC ATGTCTTGCT 6420

TGACAGATGC TTCTCCTGTT GTTGATCGAG GAATTGCGCT TCACAAGGTT TGTCTGTTTC 6480

TATTGCATTT TAAGGTTCAT ATAGGTTAGC CACGGAAAAT CTCACTCTTT GTGAGGTAAC 6540

CAGGGTTCTG ATGGATTATT CAATTTTCTC GTTTATCATT TGTTTATTCT TTTCATGCAT 6600

TGTGTTTCTT TTTCAATATC CCTCTTATTT GGAGGTAATT TTTCTCATCT ATTCACTTTT 6660

AGCTTCTAAC CACAGATGAT CCATTTTTTC ACAATGGCCT TGGGAGGAGA GGGGTACCTC 6720

AATTTCATGG GTAACGAGGT ATGTCTTACA TCTTTAGATA TTTTGTGATA ATTACAATTA 6780

GTTTGGCTTA CTTGAACAAG ATTCATTCCT CAAAATGACC TGAACTGTTG AACATCAAAG 6840

GGGTTGAAAC ATAGAGGAAA ACAACATGAT GAATGTTTCC ATTGTCTAGG GATTTCTATT 6900

ATGTTGCTGA GAACAAATGT CATCTTAAAA AAAACATTGT TTACTTTTTT GTAGTATAGA 6960

AGATTACTGT ATAGAGTTTG CAAGTGTGTC TGTTTTGGAG TAATTGTGAA ATGTTTGATG 7020

AACTTGTACA GTTTGGCCAT CCTGAGTGGA TTGACTTCCC TAGAGAGGGC AATAATTGGA 7080

GTTATGACAA ATGTAGACGC CAGTGGAACC TCGCGGATAG CGAACACTTG AGATACAAGG 7140

TTCAAGTATT TTGAATCGCA GCTTGTTAAA TAATCTAGTA ATTTTTAGAT TGCTTACTTG 7200

GAAGTCTACT TGGTTCTGGG GATGATAGCT CATTTCATCT TGTTCTACTT ATTTTCCAAC 7260

CGAATTTCTG ATTTTTGTTT CGAGATCCAA GTATTAGATT CATTTACACT TATTACCGCC 7320

TCATTTCTAC CACTAAGGCC TTGATGAGCA GCTTAAGTTG ATTCTTTGAA GCTATAGTTT 7380

CAGGCTACCA ATCCACAGCC TGCTATATTT GTTGGATACT TACCTTTTCT TTACAATGAA 7440 GTGATACTAA TTGAAATGGT CTAAATCTGA TATCTATATT TCTCCGTCTT TCCTCCCCCT 7500 CATGATGAAA TGCAGTTTAT GAATGCATTT GATAGAGCTA TGAATTCGCT CGATGAAAAG 7560 TTCTCATTCC TCGCATCAGG AAAACAGATA GTAAGCAGCA TGGATGATGA TAATAAGGTA 7620 AAATCATCTA AAGTTGAAAG TGTTGGGTTT ATGAAGTGCT TTAATTCTAT CCAAGGACAA 7680 GTAGAAACCT TTTTACCTTC CATTTCTTGA TGATGGATTT CATATTATTT AATCCAATAG 7740 CTGGTCAAAT TCGGTAATAG CTGTACTGAT TAGTTACTTC ACTTTGCAGG TTGTTGTGTT 7800 TGAACGTGGT GACCTGGTAT TTGTATTCAA CTTCCACCCA AAGAACACAT ACGAAGGGTA 7860 TATATGTTTT ACTTATCCAT GAAATTATTG CTCTGCTTGT TTTTAATGTA CTGAACAAGT 7920

TTTATGGAGA AGTAACTGAA ACAAATCATT TTCACATTGT CTAATTTAAC TCTTTTTTCT 7980

GATCCTCGCA TGACGAAAAC AGGTATAAAG TTGGATGTGA CTTGCCAGGG AAGTACAGAG 8040

TTGCACTGGA CAGTGATGCT TGGGAATTTG GTGGCCATGG AAGAGTAAGG ATTTGCTTGA 8100

ATAACTTTTG ATAATAAGAT AACAGATGTA GGGTACAGTT CTCTCACCAA AAAGAACTGT 8160

AATTGTCTCA TCCATCTTTA GTTGTATAAG ATATCCGACT GTCTGAGTTC GGAAGTGTTT 8220

GAGCCTCCTG CCCTCCCCCT GCGTTGTTTA GCTAATTCAA AAAGGAGAAA ACTGTTTATT 8280

GATGATCTTT GTCTTCATGC TGACATACAA TCTGTTCTCA TGACAGACTG GTCATGATGT 8340

TGACCATTTC ACATCACCAG AAGGAATACC TGGAGTTCCA GAAACAAATT TCAATGGTCG 8400

TCCAAATTCC TTCAAAGTGC TGTCTCCTGC GCGAACATGT GTGGTACAGT TCTTGCCGTG 8460

TGACCTCCCT TTTTATTGTG GTTTTGTTCA TAGTTATTTG AATGCGATAG AAGTTAACTA 8520

TTGATTACCG CCACAATCGC CAGTTAAGTC CTCTGAACTA CTAATTTGAA AGGTAGGAAT 8580

AGCCGTAATA AGGTCTACTT TTGGCATCTT ACTGTTACAA AACAAAAGGA TGCCAAAAAA 8640

ATTCTTCTCT ATCCTCTTTT TCCCTAAACC AGTGCATGTA GCTTGCACCT GCATAAACTT 8700

AGGTAAATGA TCAAAAATGA AGTTGATGGG AACTTAAAAC CGCCCTGAAG TAAAGCTAGG 8760

AATAGTCATA TAATGTCCAC CTTTGGTGTC TGCGCTAACA TCAACAACAA CATACCTCGT 8820

GTAGTCCCAC AAAGTGGTTT CAGGGGGAGG GTAGAGTGTA TGCAAAACTT ACTCCTATCT 8880

CAGAGGTAGA GAGGATTTTT TCAATAGACC CTTGGCTCAA GAAAAAAAGT CCAAAAAGAA 8940

GTAACAGAAG TGAAAGCAAC ATGTGTAGCT AAAGCGACCC AACTTGTTTG GGACTGAAGT 9000

AGTTGTTGTT GTTGAAACAG TGCATGTAGA TGAACACATG TCAGAAAATG GACAACACAG 9060

TTATTTTGTG CAAGTCAAAA AAATGTACTA CTATTTCTTT GTGCAGCTTT ATGTATAGAA 9120

AAGTTAAATA ACTAATGAAT TTTGCTAGCA GAAAAATAGC TTGGAGAGAA ATTTTTTATA 9180

TTGAACTAAG CTAACTATAT TCATCTTTCT TTTTGCTTCT TCTTCTCCTT GTTTGTGAAG 9240

GCTTATTACA GAGTTGATGA ACGCATGTCA GAAACTGAAG ATTACCAGAC AGACATTTGT 9300

AGTGAGCTAC TACCAACAGC CAATATCGAG GAGAGTGACG AGAAACTTAA AGATTCGTTA 9360

TCTACAAATA TCAGTAACAT TGACGAACGC ATGTCAGAAA CTGAAGTTTA CCAGACAGAC 9420

ATTTCTAGTG AGCTACTACC AACAGCCAAT ATTGAGGAGA GTGACGAGAA ACTTAAAGAT 9480

TCGTTATCTA CAAATATCAG TAACATTGAT CAGACTGTTG TAGTTTCTGT TGAGGAGAGA 9540

GACAAGGAAC TTAAAGATTC ACCGTCTGTA AGCATCATTA GTGATGTTGT TCCAGCTGAA 9600

TGGGATGATT CAGATGCAAA CGTCTGGGGT GAGGACTAGT CAGATGATTG ATCGACCCTT 9660

CTACCGATTG GTGATCGCTA TCCTTGCTCT CTGAGAAATA GGTGAGGCGA AACAAAAAAT 9720

AATTTGCATG ATAAAAAGTC TGATTTTATG ATCGCTATCC TCGCTCTCTG AGAAAGAAGC 9780

GAAACAAAGG CGACTCCTGG ACTCGAATCT ATAAGATAAC AAAGGCGACT CCTGGGACTC 9840

GAATCTATAA GATAACAAAG GCAATTCCAA GACTTGAATC TATAAAAAAT TTAGTTAAGA 9900 ATGATTAACG TCCGATCCTA ATTCGAATCG AGGCATCTTA CCACTCCATT GATAATTATA 9960

TAAGTCAATA AGTCATATAA WAGTATTAAA AACTAAATTG ACTTGATCGG TCTATCAAAA 10020

ATMAGATMAA ATTGTGTTCA TATGTAACAT TTTTGTTGTC ACAATTAGCT TAATTACATC 10080

TTTCATGTGC AATAACAAAG AAATGATAGG AATTTAGAGA TTCCAATTTT TTTGTTGCCA 10140

CAATTAACTT AATTACATCT TTCATTTGCA ATAACAAAGA AATGATAGGA ATTTAGAGAT 10200

CCAGTGTCAA TACACAACCT AGGCCAACAT CGAAAGCATA ACTGTAAACT CATGCATGAA 10260

GAAATCAGTC GTAAAAATGA ATAAATGCGA CATAAAAACA AATTGCATGT ATCATTAATG 10320

TGACTTAACT ACAAGTAAAA ATAAATTTAA CAAATGTAAC TTAACTACAA GTAAAAATAA 10380

ATTGCTTCTA TCATTAACAA ACAAACAGAA TTAAAAAGAA AAAAACATAC TAAATCTTAC 10440

CGTCATTCGA TAAAAAAAAA TACCAAATTC ATAATGCAAG GAAAACGAAA CGCGTCCTGA 10500

TCGGGTATCA ACGATGAAAT GGACCAGTTG GATCGACTGC CTGCACAACG TTAGGTATGC 10560

CAAAAAAAAG AACACGATCC TTTGCACCCG TTCGATGATT ATCAGTATGT TCACAAAAAA 10620

AACTTAAGTT CATCCCAGTG TACAACAGCC CCAACATCTG CCCCAAGTAA CAAAAAACAA 10680

CCAATTTATC TTATTCTTAT CTGCCACAAA ATAATCGGTT TCACACTATT CTCTTGTTAT 10740

ACAAAATTGA CAAGTAGGAA GGAGAGGAGT CATCCAAATA AACGGTGCAC GTTCTTTGAG 10800

AAAAGTCTTA TTTTTCGTAA GATCCAATTT CAACAAACTT TTCTTCAAGT CAAAATTCCT 10860

GATAGTGTAT CTCCTCTCGA CGACCTCTTG CATTGAACGA TCTCCGCTTA TCATGAAAAG 10920

TTGCTTGGAT AACAAGTATT GCAAGGGGGG GACAGTAGCT ATTAAGTTAG TCGGCCCAAG 10980

GAAATGGAGG AGTGATAGTC TCGAATATTA TTCACCTCTT TAGCATTACC CGGTCTGGCT 11040

TTAAGGAGTT ACGTCTTTTA CGCTCGCCAA TTTCTTTTTT TAGAATGGTT GGTGTCAAAA 11100

TCGCGAGTTG TGGAAGGTTC AAGTTACTCG ATTCGTGATT TTCAAGTATG AGTGGTGAGA 11160

GAGATTCGAT ATTTTCACGA GGTGTATTCG AGGTCTAGTA GAACGAAGGG TGTCACTAAT 11220

GAAAGTTTCA AGAGTTCATC ATCATCTTCT TCTAGTAGAT TTTCGCTTTC AAATGAGTAT 11280

GAAAATTCTT CCTCTTTTCT ATTGATTTTC TTCATTGTTT TCTTCATTGT TGTGGTTGTT 11340

ATTGAAAAGA AAGAAAATTT ATAACAGAAA AAGATGTCAA AAAAAAGGTA AAATGAAAGA 11400

GTATCATATA CTTAAAGAGT TGCGTAGAGA TAAGTCAAAA GAAACAGAAT TATAGTAATT 11460

TCAGCTAAGT TAGAATTC 11478