Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ANTIMICROBIAL PROTEINS
Document Type and Number:
WIPO Patent Application WO/1997/023617
Kind Code:
A1
Abstract:
An antimicrobial protein comprising a peptide having the sequence -Gln/Cys-AA2-Pro/Ile-Asn/Thr/Leu-AA5-AA6-Cys-Cys-Ala/Asn-Gly/Lys-AA11-AA12-AA13-AA14-AA15-, with the proviso that AA2 and AA14 are not cysteine, and that AA4 is Leu when AA1 is Cys. An antimicrobial protein having the sequence depicted in any one of SEQ ID Nos 3, 5 or 6. Recombinant DNA encoding such proteins. A vector comprising such DNA which is expressible in plants and which is linked to a plant operable promoter and terminator. Plants transformed with such recombinant DNA; the progeny of such plants which contain the DNA stably incorporated and hereditable in a Mendelian manner, and/or the seeds of such plants or such progeny.

Inventors:
NIELSEN KLAUS KRISTIAN (DK)
KROLL KRISTENSEN ANNE (DK)
BRUNSTEDT JANNE (DK)
Application Number:
PCT/EP1996/005765
Publication Date:
July 03, 1997
Filing Date:
December 20, 1996
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SANDOZ LTD (CH)
SANDOZ AG (AT)
SANDOZ AG (AT)
NIELSEN KLAUS KRISTIAN (DK)
KROLL KRISTENSEN ANNE (DK)
BRUNSTEDT JANNE (DK)
International Classes:
A01H5/00; A01N65/00; C07K14/415; C12N15/09; C12N15/29; C12N15/82; C12P21/02; (IPC1-7): C12N15/29; C12N15/82; C07K14/415; A01H5/00; A01N65/00
Domestic Patent References:
WO1995011306A11995-04-27
WO1995004754A11995-02-16
WO1992020801A11992-11-26
WO1992017591A11992-10-15
Foreign References:
EP0612847A21994-08-31
Other References:
DIERYCK, W., ET AL.: "Nucleotide sequence of a cDNA encoding a lipid transfer protein from wheat (Triticum durum Desf.)", PLANT MOLECULAR BIOLOGY, vol. 19, 1992, pages 707 - 709, XP002029148
JAKOBSEN, K., ET AL.: "Barley aleurone cell development: molecular cloning of aleurone-specific cDNAs from immature grains", PLANT MOLECULAR BIOLOGY, vol. 12, 1989, pages 285 - 293, XP002029149
YE, Z.-H., ET AL.: "Gene expression patterns associated with in vitro tracheary element formation in isolated single mesophyll cells of Zinnia elegans", PLANT PHYSIOLOGY, vol. 103, 1993, pages 805 - 813, XP002029150
RAYNAL, M., ET AL.: "A. thaliana transcribed sequence; clone PAP058; 5' end; similar to lipid transfer protein; Hordeum vulgare.", EMBL SEQUENCE DATABASE, REL. 37, 1-NOV-1993, ACCESSION NO. Z27019, XP002029151
PYEE, J., ET AL.: "Broccoli lipid transfer protein mRNA, complete cds.", EMBL SEQUENCE DATABASE, REL. 39, 25-MAR-1994, ACCESSION NO. L29767, XP002029152
NIELSEN K K ET AL: "Extracellular antifungal proteins in sugar beet leaves.", KEYSTONE SYMPOSIUM ON HOST-FUNGUS PATHOGENIC INTERACTIONS, TAOS, NEW MEXICO, USA, FEBRUARY 25-MARCH 3, 1995. JOURNAL OF CELLULAR BIOCHEMISTRY SUPPLEMENT 0 (19B). 1995. 158., XP002029153
Download PDF:
Claims:
Claims
1. According to the present invention there is provided antimicrobial protein comprising a peptide having the sequence Gln/CysAA2Pro/IleAsn/Thr/LeuAA,AA6CysCysAla/Asn GJy/LysAAnAA^AA AAμAA,,, with the proviso that AA2 and AAI4 are not cysteine, and that AA4 is Leu when AA, is Cys .
2. An antimicrobial protein having the sequence depicted in any one of SEQ ID Nos 3, 5 or 6.
3. Pure protein at least 95% similar to the proteins according to claim 1.
4. Pure protein at least 65% similar, preferably 75% similar, more preferred 85% similar, particularly preferred 95% similar to the proteins according to claim 2.
5. Pure proteins according to any one of claims 1 to 4, in combination with at least one protein selected from the group consisting of those depicted in SEQ ID Nos 7 to 12 and 14.
6. Pure proteins according to any one of the preceding claims, in combination with al least one further protein having herbicide resistance, plant growthpromoting, antifungal, anti¬ bacterial, antiv iral and/or antinematode properties.
7. Recombinant DNA comprising a sequence encoding a protein as claimed in any one of claims 1 to 6.
8. Recombinant DNA according to claim 7 with the cDNA sequence depicted in SEQ ID No 1 , in SEQ ID No 2 or in SEQ ID No 4.
9. A recombinant DNA sequence according to claim 7 or claim 8, which further comprises a sequence encoding at least one of the proteins depicted in SEQ ID Nos 4 to 9 and 1 1.
10. Recombinant DNA according to any one of claims 7 to 9, which further encodes a protein having herbicide resistance, plant growthpromoting, antifungal, antibacterial, anti¬ viral and/or antinematode properties.
11. Recombinant DNA according to any one of claims 7 to 10, which is modified in that known mRNA instability motifs or polyadenylation signals are removed and/or codons which are preferred by the organism into which the recombinant DNA is to be inserted are used so that expression of the thus modified DNA in the said organism yields substantially similar protein to that obtained by expression of the unmodified recombinant DNA in the organism in which the protein is endogenous.
12. A DNA sequence which is complementary to one which hybridizes under stringent conditions with the DNA of any one of claims 7 to 11.
13. A vector containing a DNA sequence as claimed in any one of claims 7 to 12, which sequence is expressed in plants through linkage to a plant operable promoter and terminator.
14. A biological system which includes DNA as claimed in any one of claims 7 to 12, or the vector of claim 1 .
15. Plants transformed with recombinant DNA as claimed in any one of claims 7 to 12, the progeny of such plants which contain the DNA stably incoφorated and hereditable in a Mendelian manner, and/or the seeds of such plants or such progeny.
16. Protein derived from expression of the DNA as claimed in any one of claims 7 to 12, and antimicrobial protein produced by expression of the recombinant DNA within plants or progeny or seed thereof as claimed in claim 15.
17. An antimicrobial composition containing one or more of the proteins as claimed in any one of claims 1 to 6 and 16.
18. A process for combatting fungi which comprises exposing them to proteins or compositions as claimed in any one of claims 1 to 6, 16 and 17.
19. An extraction process for obtaining antimicrobial proteins, as claimed in any one of claims 1 to 6 or claim 16, from organic material containing them comprising submitting the material to maceration and solvent extraction, characterized in that the material is a micro¬ organism or a plant.
Description:
ANTI MICROBIAL PROTEINS

The present invention relates to anti-microbial proteins isolatable from sugar beet

According to the present invention there is provided anti-microbial protein comprising a peptide having the sequence -Gln7Cys-AA 2 -Pro/Ile-Asn/Thr/Leu-AA< i -AA 6 -Cys-Cys-Ala/Asn- Gly Lys-AA 11 -AA I2 -AA π -AA| 4 -AA„-, with the proviso that AA 2 and AA 14 are not cysteine, and that AA 4 is Leu when AA, is Cys The skilled man will recognize that the abbreviation AA X means one of the 20 commonly occurring amino acids

An anti-microbidl protein includes a protein (alone or in combination with another material) which is toxic or growth inhibitory under any circumstances to any micro-organism, including bacteria, (most particularly Gram positive bacteria), viruses and particularly fungi Such anti¬ microbial proteins include those that exhibit anti-microbial activity upon contact with a micro¬ organism and those that are anti-microbial as a consequence of assimilation or respiration thereof

According to the present invention there is provided an anti-microbial protein having the sequence depicted in any one of SEQ ID Nos 3, 5 or 6

Also included is an isoform of an anti-microbial protein having SEQ ID No 3 wherein Val

The invention still further includes pure protein which is substantially similar to any one of the above mentioned proteins

By "substantially similar" is meant pure proteins comprising an amino acid sequence which ist at least 95% similar to the peptide sequence given above (also as defined in Claim 1) and/or pure proteins having an amino acid sequence which is at least 65% similar, preferably 75% similar, more preferred 85% similar, particularly preferred 95% similar to the sequence of the proteins depicted in SEQ ID Nos 3, 5 or 6 below In the context of the present

invention, two amino acid sequences with a given percentage similarity to each other have at least that percentage of identical or conservatively replaced amino acid residues in a like position when aligned optimally allowing for up to 2 gaps with the proviso that in respect of each gap a total not more than 2 amino acid residues is affected.

For the purpose of the present invention conservative replacements may be made between amino acids within the following groups:

(i) Serine and Threonine;

(ii) Glutamic acid and Aspartic acid;

(iii) Arginine and Lysine;

(iv) Asparagine and Glutamine;

(v) Isoleucine, Leucine, Valine and Methionine;

(vi) Phenylalanine, Tyrosine and Tryptophan

(vii) Alanine and Glycine

The invention still further includes pure proteins which are at least 90% identical to the anti¬ microbial proteins according to the invention, as well as pure proteins which have at least 90% of the specific activity thereof. For the purposes of the present application, specific activity is a measurement of the amount of growth or replication inhibition produced by a specified quantity of the protein on a specified quantity of a specified micro-organism.

The invention still further includes said pure proteins in combination with at least one protein selected from the group consisting of those depicted in SEQ ID Nos. 7 to 12 and 14. Such combined proteins may be further combined with one or more of the known "pathogenesis- related proteins". Infection of plants with fungal or viral pathogens may induce a systemic synthesis of about 10 families of homologous pathogenesis-related proteins (PR proteins) in vegetative tissues. Such PR-proteins have been classified into 5 groups. The PR-2, PR-3 and PR-5 proteins are beta- l ,3-glucanase, chitinases and thaumatin-like proteins respectively. Specific functions have not been assigned to the PR- 1 and PR-4 groups of proteins. The PR-4 proteins are similar to C-terminal domains of prohevein and the putative wound-induced WIN proteins of potato, thus lacking the N-terminal hevein domain. It is particularly preferred that

the proteins according to the invention are combined with one or more proteins which are the basic counter parts of the P-R 4 group of proteins, meaning the basic counter part of proteins similar to the C-terminal domains of prohevein and the putative wound-induced WIN proteins of potato. It is particularly preferred that the basic counter-part of the said pathogenesis- related proteins is a chitin-binding WIN protein, in particular that produced by barley grain or stressed barley leaves.

The invention still further includes recombinant DNA comprising a sequence encoding a protein having the amino acid sequence of the above disclosed anti-microbial proteins. In particular the DNA may encode at least one of the proteins the sequences of which are depicted in SEQ ID Nos. 3, 5 and 6, optionally in addition to at least one of the proteins the sequences of which are depicted in SEQ ID Nos. 7 to 12 and 14. Preferably the sequence encoding a protein having the amino acid sequence depicted in SEQ ID No. 3, 5 or 6 has the cDNA sequence depicted in SEQ ID No. 1 (for the protein with SEQ ID No. 6, corresponding to IWF5), in SEQ ID No. 2 (for the protein with SEQ ID No. 3, corresponding to IWF6) or in SEQ ID No. 4 (for the protein with SEQ ID No. 5, corresponding to IWF7). The recombinant DNA may further encode a protein having herbicide resistance, plant growth- promoting, anti-fungal, anti bacterial, anti-viral and or anti-nematode properties. In the case that the DNA is to be introduced into a heterologous organism it may be modified to remove known mRNA instability motifs (such as AT-rich regions) and polyadenylation signals (if any arc present), and/or codons which are preferred by the organism into which the recombinant DNA is to be inserted may be used so that expression of the thus modified DNA in the said organism yields substantially similar protein to that obtained by expression of the unmodified recombinant DNA in the organism in which the anti-microbial protein according to the invention is endogenous.

The invention still further includes recombinant DNA which is "similar" to that mentioned above. By "similar DNA" is meant a sequence which is complementary to a test sequence which is capable of hybridizing to the inventive recombinant sequence. When the test and inventive sequences are double stranded the nucleic acid constituting the test sequence preferably has a TM within 20°C of that of the inventive sequence. In the case that the test

and inventive sequences are mixed together and denatured simultaneously, the TM values of the sequences are preferably within 10°C of each other. More preferably the hybridization is performed under stringent conditions, with either the test or inventive DNA preferably being supported. Thus either a denatured test or inventive sequence is preferably first bound to a support and hybridization is effected for a specified period of time at a temperature of between 50 and 70°C in double strength citrate buffered saline (SSC) containing 0.1 %SDS followed by rinsing of the support at the same temperature but with a buffer having a reduced SSC concentration Depending upon the degree of stringency required, and thus the degree of similarity of the sequences, at a particular temperature, - such as 60°C, for example - such reduced concentration buffers are typically single strength SSC containing 0 1%SDS, half strength SSC containing 0 1 %SDS and one tenth strength SSC containing 0.1%SDS Sequences having the greatest degree of similarity are those the hybridization of which is least affected by washing in buffers of reduced concentration. It is most preferred that the test and inventive sequences are so similar that the hybridization between them is substantially unaffected by washing or incubation in one tenth strength sodium citrate buffer containing 0 1 %SDS

The invention still further includes a DNA sequence which is complementary to one which hybridizes under stringent conditions with the recombinant DNA according to the invention

Also included in the present mvention is a vector which contains the above disclosed DNA which is expressible in plants and linked to a plant operable promoter and terminator, plants transformed with such DNA, the progeny of such plants which contain the DNA stably incoφorated and hereditable in a Mende an manner, and/or the seeds of such plants and such progeny The transformed plants are made by known methods and include regeneration of plant cells or protoplasts transformed with the DNA of the invention according to a variety of known methods (Agrobacteπum Ti and Ri plasmids, electroporation, micro-injection, micro-projectile gun etc) The transformed cell may in suitable cases be regenerated into whole plants in which the nuclear material is stably incoφorated into the genome. Both monocot and dicot plants may be obtained in this way Examples of transformed plants according to the present invention include: fruits, including tomatoes, mangoes, peaches,

apples, pears, strawberries, bananas, and melons; field cops such as canola, sunflower, tobacco, sugar beet, small grain cereals such as wheat, barley and rice, maize and cotton, and vegetables such as potato, carrot, lettuce, cabbage and onion. The preferred plants are sugar beet and maize.

The invention still further includes protein derived from expression of the said DNA, and anti-microbial protein produced by expression of the recombinant DNA within plants transformed therewith.

The invention still further includes an anti-microbial composition containing one or more of the proteins according to the invention; a process for combatting fungi which comprises exposing them to such proteins, and an extraction process for obtaining anti-microbial proteins from organic material containing them comprising submitting the material - preferably in the form of a micro-organism - to maceration and solvent extraction. It will be appreciated that the anti-microbial protein exhibits little, if any, anti-microbial effect on the micro-organism which is the source of the organic material referred to in the previous sentence

The invention will be further apparent from the following description and the associated drawings and sequence listings.

Figure 1 shows a typical elution profile of intercellular washing fluid from a CM- Scpharose column.

Figure 2 shows a typical elution profile from a Mono S FPLC column of the 0.3M NaCl fraction shown in Figure 1 ;

Figure 3 shows a typical elution profile from an RP-HPLC column of the proteins represented by peak 4 in Figure 2;

Figure 4 shows a typical elution profile from an RP-HPLC column of the protein represented by peak 4.4 in Figure 3;

Figure 5 shows a typical elution profile from an RP-HPLC column of the protein represented by peak 4.3 in Figure 3;

Figure 6 shows a typical elution profile from an RP-HPLC column of the protein

represented by peak 5 in Figure 2;

Figure 7 shows a typical elution profile from an RP-HPLC column of the protein represented by peak 5.4 in Figure 6;

Figures 8 and 9 show the anti-fungal activity of various amounts of the protein represented by peak 4.4 in Figure 4 (SEQ ID No. 6);

Figure 10 shows the anti-fungal activity of the protein represented by peak 4.3 (SEQ ID No. 3 wherein Val in pos. 80 is Ala) in Figure 5;

Figure 1 1 shows the anti-fungal activity of the protein represented by peak 5.4 (SEQ ID No. 5) in Figure 7.

SEQ ID No. 6 shows the amino acid sequence of protein represented by peak 4.4 in Figure 4; SEQ ID No. 3 wherein Val in pos. 80 is Ala shows the amino acid sequence of protein represented by peak 4.3 in Figure 5; and SEQ ID No. 5 shows the amino acid sequence of protein represented by peak 5.4 in Figure 7; SEQ ID Nos. 7 to 12 and 14 show the amino acid sequences of known anti-fungal proteins.

Induction of resistance

Six-week old plants of sugar beet (Beta vulgaris L., cv. Monova, Danisco Seed) are treated four times with 25 ppm 2,6-dichloroisonicotinic acid (INA) with a two-day interval. The INA, suspended in 0.05% Tween 20, is applied by spraying the adaxial leaf surface to the point of saturation. Two days after the final treatment, intercellular washing fluid (IWF) is isolated as described below. Intercellular washing fluid isolated from leaves not subjected to INA also contains proteins according to the present invention.

Isolation of Intercellular Washing Fluid

IWF is isolated from 500-700 gram sugar beet leaves by submerging them in 20mM HAc (pH 4.5). The thus submerged leaves are then placed in an exicator and vacuum infiltrated for 5 min at 4 ton * (max). Following air-drying of the leaf surface, the IWF is collected by centrifugation at 500g for 15 min in 500 ml centrifuge tubes.

Cation exchange chromatography

The thus obtained IWF is fractionated by cation exchange chromatography on a 10 ml CM- Sepharose column (Pharmacia LKB) pre-equilibrated in starting buffer (20mM HAc (pH 4.5)). The fractionation is performed at 4°C at a flow rate of 25 ml/h. Fractions of 3 ml are collected. Proteins not bound to the column are removed by extensive washing of the column with starting buffer. Bound proteins are eluted by applying to the column further starting buffer comprising stepwise increased salt concentrations: viz, 0.1 M NaCl, 0.3 M NAC1 and 0.5 M NaCl. The absorbance at 280nm of the eluate is measured, and fractions judged to comprise protein are tested for their anti-fungal activity against C. beticola using the microtiter plate bioassay described previously (PCT Patent Application No. PCT DK92/00108, Publication No. WO 92/17591 , now assigned to Sandoz LTD).

A typical elution profile is shown in Fig. 1. The eluates resulting from application to the column of starting buffer comprising the 0.3M NaCl is further purified as described below.

Purification of antifungal proteins in the 0.3 M NaCl eluate from the CM-Sepharose. FPLC Chromatography

The 0.3M NaCl protein fraction is desalted by overnight dialysis (MW cut off: 1 kDa) against 20mM HAc (pH 4 5) at 4°C. Betaine is added at a concentration of 5% (w/v) to the thus dialysed protein fraction. Four ml of the resulting solution is then fractionated by cation exchange fast protein liquid chromatography (FPLC) using a Mono S HR 5/5 column ( Pharmacia LKB ) equilibrated in 20 mM HAc (pH 4.5) containing 5% (w/v) betaine (A- buffer) Bound proteins are eluted with a linear salt gradient from 0 to 0.3 M NaCl in 30 l of the A-buffer followed by a step elution with 1.0 M NaCl in the same buffer. Flow rate is 1 ml/min.

Figure 2 shows that the 0.3M NaCl fraction contains a number of distinct proteins, the quantitatively most significant of which are designated as peaks 1-5. Strong anti-fungal activity is associated with the proteins represented by peaks 4 and 5. When separated by SDS- PAGE using the Phast System (Pharmacia LKB), silver stained 10-15% gradient Phast gels or High Density gels (Pharmacia LKB) reveal that each of the peaks 4 and 5 contains about

10 different proteins.

Reverse Phase HPLC

Protein peaks 4 and 5 (depicted in Figure 2) from the Mono S column is further purified by reverse phase (RP-) HPLC on a Vydac C silica column (The Separations Group. CA, USA). The solvent system is A: 0.1% TFA in water and B: 0.1% TFA in acetonitrile. Proteins are eluted with a linear gradient of 5 to 45% of the B-buffer applied in 18 min after sample loading followed by 60% B-buffer in 2 min. Flow rate is 0.7 ml/min. Protein is detected by monitoring the absorbance of the eluate at 214 and 280nm. Discrete protein peaks are collected and lyophilized. The thus lyophilized proteins are washed twice with water, re- lyophilized and subsequently resolved in lOmM Tris-HCl (pH 8.0), prior to analysis of purity and anti-fungal activity. The material represented by peak 4 is separated into 9- 10 distinct protein peaks (Fig. 3) on the RP-HPLC column. Peaks 4.3 and 4.4 comprise strong antifungal activity and are consequently re-chromatographed on the RP-C4 column (Figs. 4 and 5). A subsequent analysis by SDS-PAGE (silver staining) demonstrated that these two proteins had been purified to homogeneity (data not shown). Furthermore, the N-terminal amino acid sequencing confirmed the presence of only one protein in each peak. Peak 4.3 and 4.4 are designated IWF6 and IWF5, respectively. The material represented by peak 5 in Figure 2 is separated into 8- 10 different protein peaks following RP-HPLC purification (Fig. 6). Of these, peak 5.4 comprised strong antifungal activity. Following a 2nd round of RP-HPLC (Fig. 7), the protein was homogenous as confirmed by SDS-PAGE and amino acid sequencing. This protein is designated IWF7.

Antifungal activity

The proteins, either alone or in combination with WIN N (which is purified from barley grain or stressed barley leaf as described by Hejgaard et_al (FEBS Letters, 307, 389-392 (1992)), and or a protein having a sequence corresponding to at least one of those given in SEQ ID Nos. 7- 12 and 14, are incubated with spores of C. beticola. The assay mix (240ul) contains lOOul of potato dextrose broth (Difco), 40ul protein sample (or buffer control) in lOOmM Tris and 20mM NaCl (pH 8.0) as well as approximately 400 spores in lOOul water. The micro- titre plates are sealed with tape to avoid evaporation and contamination and subsequently

incubated at room temperature on an agitator operated at 200 φm The absorbance at 620nm is measured each day for 8 days and plotted for each concentration of protein vs time The protein designated as IWF 5 showed a strong growth inhibiting effect against C. beticola. At lOμg/well (40μg/ml) no fungal growth could be detected (Fig. 8) and at 4μg/well the growth was markedly delayed and strongly inhibited (Fig 9) The protein designated as IWF6 also shows a strong growth inhibiting effect against C beticola The level of activity is comparable to that of IWF5, giving almost complete growth inhibition at - 5μg/well (Fig 10) Moreover, the protein designated as IWF7 also shows a strong growth inhibiting effect against C beticola At lOμg/well no fungal growth was detected (Fig 1 1) Furthermore, microscopical analyses indicated that the protein inhibited the germination of conidia Thus, IWF7 may effect spore germination as well as hyphal growth

Amino acid sequencing

The purified anti-fungal proteins corresponding to peak 4 4 in Figure 4, peak 4 3 in Figure 5 and peak 4 4 in Figure 7 which originate from the 0.3 M NaCl eluate from the CM- Sepharose column, (Figure 1 ), are carboxymethylated and subjected to RP-HPLC on a Vydac Cj column The solvent system is A 0 1 % TFA in water and B 0 1 % TFA in acetonitrile The proteins elute as single peaks with slightly different retention times The C-termmal sequences of the proteins are obtained by cleavage thereof with em/ø-R-proteinase and subsequent purification by RP-HPLC on a Vydac C, g column

Cloning of IWF5, IWF6 and IWF/ cDNA

The cDNA sequence of IWF5, IWF6 and IWF7 was obtained by 3' and 5' RACE as described previously (Nielsen et al 1996, Plant Mol. Biol., 31 :539-552) using the following primers

A. 3' RACE Primers.

Q τ : 5' -CCAGTGAGCAGAGTGACGAGGACTCGAGCTCAAGC(T) 17 -3 ' Q 0 : 5'-CCAGTGAGCAGAGTGACG-3 '

Q λ : GAGGACTCGAGCTCAAGC-3 '

B. 5' RACE primers.

5 ' -ANKER: 5' -GGCCACGCGTCGACTAGTACGGGGGGGGGG-3 ' 5 ' -UNI : 5' -GGCCACGCGTCGACTAGTACG

IWF5 3" RACE.

The amino acid sequence of the IWF5 protein was used to construct two degenerated oligonucleotide primers for the isolation of a partial cDNA clone by 3'RACE Total RNA was purified from sugar beet (cv Monova) leaves 6 days after inoculation with C beticola according to Collinge et al 1987 (Plant Mol. Biol 8: 405-414). Reverse transcription followed by PCR was done with the RT-PCR kit from Perkin Elmer and according to their protocol Briefly, lμg of total RNA and 2.5 pmol Q τ -primer was incubated at 42°C for 45 mm with reverse transcriptase followed by incubations at 99°C for 5 min and 5°C for 5 mm. In the first PCR 40 pmol of the primer Q 0 was used as downstream primer and the upstream primer was 150 pmol of the degenerated primer (5'-GC(ACGT)TG(CT)(AC)G(ACGT)TG(CT)ATGAA position 315-331 in the IWF5 cDNA sequence). In the second nested PCR 50 pmol of the primer Q, was used as downstream primer and the upstream primer was 50 pmol of the degenerated primer (5'-GG(ACGT)AT(ACT)AA(CT)CA(CT)AA(GA)TA: position 354-370 in the IWF5 cDNA sequence) The PCR conditions were 1 min at 94°C, 2 nun at 42°C, 1 min at 50°C and 5 min at 72°C for 1 cycle followed by 1 mm at 94°C and 2 min at 42°C and 3 mm at 72°C for 35 cycles followed by 10 min at 72°C After the second PCR a single DNA product of 390 bp was obtained. The DNA product was cloned into the pT7Blue vector (Novagen) and sequenced using a Ter o Sequenase fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer (Pharmacia).

IWF5 5' RACE.

The sequence of the 5' end of IWF5 cDNA was obtained by 5 'RACE using the 5 * RACE system from Gibco BRL with 3 gene specific primers constructed from the partial cDNA sequence obtained by 3'RACE. Briefly, lμg of the same total RNA as used for 3'RACE and

2.5 pmol of a gene specific primer GSP5-1 (5 * -TGGAATTGG-AGATTATGTAAG: position 619-643 in the IWF5 cDNA sequence) was incubated at 70°C for 10 min followed by the addition of reverse transcriptase and incubating at 42°C for 30 min, 70°C for 15 min and the addition of RNaseH and incubating further 10 min at 55°C. The cDNA was dC-tailed according to the protocol of Gibco BRL. The tailed cDNA was subjected to two rounds of PCR. In the first PCR 20 pmol of the 5'-ANKER primer was used as upstream primer and the downstream primer was 20 pmol of the gene specific primer GSP5-2 (5'- TCACTTTAGATGTAAGAAGCACA-CATG: position 596-622 in the IWF5 cDNA sequence). In the second PCR 50 pmol of the 5'-UNI primer was used as upstream primer and the downstream primer was 50 pmol of the gene specific primer GSP5-3 (5'- TAAGCAGAAAGTTCCAGAAAGCA-TG: position 548-572 in the IWF5 cDNA sequence). The condition for the first PCR was. 1 min at 94°C and 1 min at 51 °C and 2 min at 72°C for 35 cycles followed by 10 min at 72°C. The condition for the second PCR was: 1 min at 94°C and 1 min at 55°C and 2 min at 72°C for 35 cycles followed by 10 min at 72°C. The single 600 bp DNA product was cloned into the pT7Blue vector (Novagen) and sequenced using a Termo Sequenase fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer (Pharmacia).

IWF6 3' RACE.

The amino acid sequence of the rWF6 protein was used to construct two degenerated oligonucleotide primers for the isolation of a partial cDNA clone by 3'RACE. Total RNA was purified from sugar beet (cv. Monova) leaves 6 days after inoculation with C. beticola according to Collinge et al. 1987 (Plant Mol. Biol 8: 405-414). Reverse transcription followed by PCR was done with the RT-PCR kit from Perkin Elmer and according to their protocol. Briefly, lμg of total RNA and 2.5 pmol Q τ -primer was incubated at 42°C for 45 min with reverse transcriptase followed by incubations at 99°C for 5 min and 5°C for 5 min. In the first PCR 40 pmol of the primer Q 0 was used as downstream primer and the upstream primer was 150 pmol of the degenerated primer (5'-GG(AGCT)TA(CT)TG(CT)AA(CT)AT(ACT)(TC)T: position 297-313 in the IWF6 cDNA sequence). In the second nested PCR 50 pmol of the primer Q, was used as downstream primer and the upstream primer was 50 pmol of the degenerated primer (5'-AA(CT)GT(ACGT)TG(CT)TG(CT)GC(ACGT)GG: position 314-332

in the IWF6 cDNA sequence). The PCR conditions were: 1 min at 94°C, 2 min at 42°C, 1 min at 50°C and 3 min at 72°C for 1 cycle followed by 1 min at 94°C and 2 nun at 42°C and 3 min at 72°C for 35 cycles followed by 10 min at 72°C. After the second PCR a single DNA product of 320 bp was obtained The DNA product was cloned into the pT7Blue vector (Novagen) and sequenced using a Termo Sequenase fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer (Pharmacia)

IWF6 5' RACE.

The sequence of the 5' end of IWF6 cDNA was obtained by 5'RACE using the 5'RACE system from Gibco BRL with 3 gene specific primers constructed from the partial cDNA sequence obtained by 3'RACE Briefly, lμg of the same total RNA as used for 3'RACE and 2.5 pmol of a gene specific primer GSP6-1 (5'-CATCAAGA-AGTCCATAATTGTCTAG position 508-532 in the IWF6 cDNA sequence) was incubated at 70°C for 10 min followed by the addition of reverse transcriptase and incubating at 42°C for 30 min, 70°C for 15 min and the addition of RNaseH and incubating further 10 min at 55°C The cDNA was dC-tailed according to the protocol of Gibco BRL. The tailed cDNA was subjected to two rounds of PCR In the first PCR 20 pmol of the 5 '-ANKER primer was used as upstream primer and the downstrejin primer was 20 pmol of the gene specific primer GSP6-2 (5'- TGATCTTTATTGAC-AAACAGACGC position 473-498 in the IWF6 cDNA sequence) In the second PCR 50 pmol of the 5' -UNI primer was used as upstream primer and the dow nstream primer was 50 pmol of the gene specific primer GSP6-3 (5'- ACAGACACGCTAGTT-AGATGACTAAGC position 456-482 in the IWF6 cDNA sequence) The condition for the first PCR was 1 min at 94°C and 1 min at 51°C and 2 min at 72°C for 35 cycles followed by 10 min at 72°C. The condition for the second PCR was * 1 min at 94°C and 1 min at 55°C and 2 min at 72°C for 35 cycles followed by 10 min at 72°C The single 510 bp DNA product was cloned into the pT7Blue vector (Novagen) and sequenced using a Termo Sequenase fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer (Pharmacia)

IWF7 3 * RACE.

The amino acid sequence of the IWF7 protein was used to construct two degenerated

oligonucleotide primers for the isolation of a partial cDNA clone by 3'RACE. Total RNA was purified from sugar beet (cv. Monova) leaves 6 days after inoculation with C. beticola according to Colhnge et al. 1987 (Plant Mol. Biol 8: 405-414). Reverse transcription followed by PCR was done with the RT-PCR kit from Perkin Elmer and according to their protocol. Briefly, lμg of total RNA and 2.5 pmol Q τ -primer was incubated at 42°C for 45 mm with reverse transcriptase followed by incubations at 99°C for 5 min and 5°C for 5 min. In the first PCR 40 pmol of the primer Q 0 was used as downstream primer and the upstream primer was 150 pmol ofthe degenerated primer (5'-GA(AG)CA(AG)AA(AG)CC(ACGT)TGA(CT)(CT)T. position 247-263 in the IWF7 cDNA sequence). In the second nested PCR 50 pmol of the primer Q, was used as downstream primer and the upstream primer was 50 pmol of the degenerated primer (5'-TG(CT)GG(ACGT)TA(CT)TA(CT)AA(AG)AA: position 265-286 in the IWF7 cDNA sequence) The PCR conditions were. 1 mm at 94°C, 2 min at 42°C, 1 min at 50°C and 3 min at 72°C for 1 cycle followed by 1 min at 94°C and 2 min at 42°C and 3 min at 72°C for 35 cycles followed by 10 mm at 72°C. After the second PCR a single DNA product of 270 bp was obtained The DNA product was cloned into the pT7Blue vector (Novagen) and sequenced using a Termo Sequenase fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer (Pharmacia)

IWF7 5' RACK.

The sequence ot the 5" end of IWF7 cDNA was obtained by 5'RACE using the 5'RACE sy stem from Gibco BRL with 3 gene specific primers constructed from the partial cDNA sequence obtained by 3'RACE Briefly, lμg of the same total RNA as used for 3'RACE and 2 5 pmol of a gene specific primer GSP7- 1 (5'-CCTAATTTC-CCTCAAATCACG: position 443-463 in the IWF7 cDNA sequence) was incubated at 70°C for 10 min followed by the addition of reverse transcriptase and incubating at 42°C for 30 nun, 70°C for 15 min and the addition of RNaseH and incubating further 10 min at 55°C The cDNA was dC-tailed according to the protocol of Gibco BRL. The tailed cDNA was subjected to two rounds of PCR In the first PCR 20 pmol of the 5'-ANKER primer was used as upstream primer and the downstream primer was 20 pmol of the gene specific primer GSP7-2 (5'- AATTTCCCTCAAATCACGAATTGAG: position 436-460 in the IWF7 cDNA sequence). In the second PCR 50 pmol of the 5'-UNI primer was used as upstream primer and the

downstream primer was 50 pmol of the gene specific primer GSP7-3 (5'- TCGTCAGTTTTGGCTCATTTTGGG: position 400-423 in the IWF7 cDNA sequence). The condition for the first PCR was 1 min at 94°C and 1 min at 51°C and 2 nun at 72°C for 35 cycles followed by 10 min at 72°C The condition for the second PCR was: 1 min at 94°C and 1 nun at 55°C and 2 nun at 72°C for 35 cycles followed by 10 nun at 72°C. The single 450 bp DNA product was cloned into the pT7Blue vector (Novagen) and sequenced using a Termo Sequenase fluorescent cycle sequencing kit (Amersham) and an ALF DNA sequencer (Pharmacia).

Production of transformed plants

The genes encoding proteins according to the invention are introduced into plants Based on gene specific primers, the coding regions of the genes encoding the proteins are synthesized from corresponding mRNA using PCR After addition of promoter and terminator sequences, the genes encoding the said proteins are introduced into a plant transformation vector. The vector may optionally include a gene encoding a WIN protein, such as that obtained from stressed barley leaf or barley grain, and/or a gene encoding one or more of the proteins depicted in SEQ ID Nos 4 to 9, and/or a gene encoding a chitinase and or a glucanase. One possible chitinase is the chitinase 4 described in PCT Patent Application No PCT/DK92/00I08 (Publication No WO 92/17591 ) Agrobacterium tumefaciens, for example, may be transformed with these vectors Plant cells are then treated with such transformed Agrobacterium. and the thus transformed plant cells are regenerated into whole plants, in which the new nuclear mateπal is stably incoφorated into the genome It will be appreciated, however, that the DNA encoding a protein according to the present invention, (or combination of such proteins), optionally further encoding other proteins, may be introduced into plant cells by other known methods, including use of a micro-projectile gun, electroporation, electro-transformation, and micro-injection etc, and that regeneration of transformed plant cells is carried out according to methods known to the skilled man, including treatment of the cells with cytokines where this is necessary or desirable in order to improve the regeneration frequency

Moreover, suitable micro-organisms (1 e. those in which the production of the present inventive proteins is not substantially toxic) may be transformed with a vector comprising the gene (or genes) encoding the protein so that the transformed micro-organisms produce such protein The micro-organisms may further comprise the genes encoding other proteins, such as a WIN protein and or a protein the sequence of which is depicted in one or more of SEQ IDs No 4-9 Furthermore, such other proteins may further comprise vaπous chitmases and/or glucanases One possible such other protein is the chitinase 4 as described in PCT Patent Application No PCT/DK92/00108 (Publication No WO 92/17591)

These micro-organisms may then be used to combat plant pathogens For example the transformed micro-organisms may be dried and sprayed onto infected plants or plants at risk of infection

SEQUENCE LISTING

(1) GENERAL INFORMATION:

(l) APPLICANT:

(A) NAME: Sandoz Ltd.

(B) STREET: Lichtstrasse 35

(C) CITY: Basel

(D) STATE* BS

(E) COUNTRY- Switzerland

(F) POSTAL CODE (ZIP) : CH-4002

(G) TELEPHONE* +41-61-324 64 28 (H) TELEFAX- +41-61-322 75 32

(A) NAME. Sandoz Patent GmbH

(B) STREET* Hu boldtstrasse 59

(C) CITY. Loerrach

(E) COUNTRY- Germany

(F) POSTAL CODE (ZIP) : D-7850

(A) NAME: Sandoz Erfindungen Ver altungsgesellschaf

MBH

(B) STREET Brunnerstrasse 59 IC) CITY Vienna

(E) COUNTRY. Austria

(F) POSTAL CODE (ZIP) : A-1230

(n) TITLE OF INVENTION: Anti-microbial proteins (m) NUMBER OF SEQUENCES: 34

(iv) COMPUTER READABLE FORM:

(A) MEDIUM TYPE Floppy disk

(B) COMPUTER: IBM PC compatible

(C) OPERATING SYSTEM: PC-DOS/MS-DOΞ

(D) SOFTWARE Patentin Release #1.0, Version #1.25 (EPO)

(2) INFORMATION FOR SEQ ID NO:l:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH- 722 base pairs

(B) TYPE, nucleic acid

(C) STRANDEDNESS- single

(D) TOPOLOGY- unknown

(ll) MOLECULE TYPE. cDNA

(m) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(vi ) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 :

ACCATCTAGT ACCAATTAAT TACAACTATC TCTCTCTCTC TTTCTCTCTA GGCCTCTTTC 60

TTAAAGTTTG TGATAGGATT AGCAGCAATC CCAACATGGC TAGGGCAGCT AATCTAAAGC 120

TGTTATGCGC ATTTATTCTG AGCATAGTGG TGTTCACACC ACATGCAGAG GCCGCCATTA 180

ACTGTGGTCT GGTCTCTCAG AGCCTTGCAG CATGCCTTGG ATTCCTAGAG AATGGTCAGG 240

GACCAAATGC AGCTTGCTGC AACGGTGTTA AGACACTCCG AAACTTGACT CCCACAACCC 300

AGGACAAAAG AACGGCTTGT AGGTGCATGA AATCAGCCGC TTCAGCCATT CCCGGCATCA 360

ACCATAAGTA CTCAGCTGCA CTTCCCGGCA AATGTGGGGT CAGCATTCCA GGGCCTGTTG 420

GCCCCCAGGC AGACTGCTCT CAGATCCACT AGACCTGAAG TTTCCAGGAG GGGAACACTA 480

GCAAAACAAA GAATAGTTGG GTTCTGACTT CATACAAGCA AAATCTATAG TAAATTCCCA 540

TGAAGTGCAT GCTTTCTGGA ACTTTCTGCT TATCAAGTTA TTATTACATA TATACCATGT 600

GTGCTTCTTA CATCTAAAGT GACTTACATA ATCTCCAATT CCATGTAAGA GATAGCAAGG 660

AAGATTAAAT ATTGAAATAA AATCYCTTAT TGGTTAAAHC CCAAAAAAAA AAAAAAAAAA 720

AA 722 (2) INFORMATION FOR SEQ ID NO:2 :

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 587 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ii) MOLECULE TYPE: cDNA

(iii) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(Vi ) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(ix) FEATURE-

(A) NAME/KEY: CDS

<B> LOCATION: 129..374

(x:) SEQUENCE DESCRIPTION: SEQ ID NO: 2 :

ATAAGTGGCA AGCAAAAAGA AATCATTTCA CTAAAAACTG CAAAAAGAAA ATGGAGAAGT 60

CATCATGCTT TAAGCTGGTT TTCTTAGTCT TTCTGTTGCT CAACATTTCA GCTTCAACAC 120

TTGCAAGA ATG CAA CCA ACT ATA ATG GAA GAA CTT ACA AAG GAA CAA GTT 170 Met Gin Pro Thr lie Met Glu Glu Leu Thr Lys Glu Gin Val 1 5 10

CTT GAA GAA CTT GGT GCT TAC AAG CAG ATA ATA GCA TCA GCA GCA ACA 218 Leu Glu Glu Leu Gly Ala Tyr Lys Gin lie lie Ala Ser Ala Ala Thr 15 20 25 30

AGA GAC CGC GGC GAG TTA TTA AGA ACG GTG ATA GAA GCG GTT GCG CGA 266 Arg Asp Arg Gly Glu Leu Leu Arg Thr Val lie Glu Ala Val Ala Arg 35 40 45

CCG CGA CCG CGA CCG TGC ATA AGG GCT GGC GGC TAC TGT AAC ATT TTG 314 Pro Arg Pro Arg Pro Cys lie Arg Ala Gly Gly Tyr Cys Asn lie Leu 50 55 60

AAC GTA TGT TGT GCT GGG TTG ACT TGT GAG GAA CAT GAT ATA CAA GAC 362 Asn Val Cys Cys Ala Gly Leu Thr Cys Glu Glu His Asp lie Gin Asp 65 70 75

GCC GTC TGC GTC TAAATCTTGG ACTTGCTTAA AAAAATGTGT TATTATAAGT 414

Ala Val Cys Val 80

GTATTTGGCC TTTTAGGCTT GATCTTCAAT TTCGTCTTTA AGCTTAGTCA TCTAACTAGC 474

GTGTCTGTTT GTCAATAAAG ATCATTGTCT TCTCTAGACA ATTATGGACT TCTTGATGCT 534

TTTATTTTAA TAATATAAAA TATTTCCTCG CTTTCAAAAA AAAAAAAAAA AAA 587

(2 ) INFORMATION FOR SEQ ID NO: 3 :

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 82 ammo acids

(B) TYPE: ammo acid (D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: :

Met Gin Pro Thr lie Met Glu Glu Leu Thr Lys Glu Gin Val Leu Glu 1 5 10 15

Glu Leu Gly Ala Tyr Lys Gin lie lie Ala Ser Ala Ala Thr Arg Asp 20 25 30

Arg Gly Glu Leu Leu Arg Thr Val lie Glu Ala Val Ala Arg Pro Arg 35 40 45

Pro Arg Pro Cys lie Arg Ala Gly Gly Tyr Cys Asn lie Leu Asn Val 50 55 60

Cys Cys Ala Gly Leu Thr Cys Glu Glu His Asp lie Gin Asp Ala Val 65 70 75 80

Cys Val

(2) INFORMATION FOR SEQ ID NO: 4:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 510 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ll) MOLECULE TYPE: cDNA

(iii) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(IX) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: 76..357

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4 :

TTATAATAAC TCCCTCTCTT ACACAAACAC ATAACATAAA TCTCCTCTAT ACATTTTTCT 60

CTTCTAAGCA TCATT ATG GCC AAG GTG GCT ACT CTT ACC CTA CTT GCC GTG 111 Met Ala Lys Val Ala Thr Leu Thr Leu Leu Ala Val

1 5 10

GTT GTC GTG GCG GTG CTA CTA TTC GAG ACA CCA ACG ACC GAG GCG GTT 159 Val Val Val Ala Val Leu Leu Phe Glu Thr Pro Thr Thr Glu Ala Val 15 20 25

ACC TGC AGT GCA GTG CAG CTG AGC CCT TGC GCA CCA GCA ATT ATG TCC 207 Thr Cys Ser Ala Val Gin Leu Ser Pro Cys Ala Pro Ala lie Met Ser 30 35 40

AAC CAA ACA CCA ACA AGC GCA TGT TGT GCA AAA TTG AGG GAG CAA AAA 255 Asn Gin Thr Pro Thr Ser Ala Cys Cys Ala Lys Leu Arg Glu Gin Lys 45 50 55 60

CCT TGC CTT TGT GGA TAC TAC AAG AAC CCT ACT CTT AGG CCT TAC ATT 303 Pro Cys Leu Cys Gly Tyr Tyr Lys Asn Pro Thr Leu Arg Pro Tyr lie 65 70 75

AAT TCC CCT GGT GCT AAA CGT GTG GCT TCT ACT TGT AAA GTC AGC GTT 351 Asn Ser Pro Gly Ala Lys Arg Val Ala Ser Thr Cys Lys Val Ser Val 80 85 90

AGC TGC TAAACATATG TACCAGTGTT ACCTAATTTG TTATCTAATT CGCCCAAAAT 407 Ser Cys

GAGCCAAAAC TGACGATAAT TAGCTTTTCT CAATTCGTGA TTTGAGGGAA ATTAGGTAAC 467 TACTGATATA TATACCTTCC ACAAAACAAA AAAAAAAAAA AAA 510

(2) INFORMATION FOR SEQ ID NO:5 :

(:) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 94 amino acids

(B) TYPE: ammo acid (D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:

Met Ala Lys Val Ala Thr Leu Thr Leu Leu Ala Val Val Val Val Ala

1 10 15

Val Leu Leu Phe Glu Thr Pro Thr Thr Glu Ala Val Thr Cys Ser Ala 20 25 30

Val Gin Leu Ser Pro Cys Ala Pro Ala lie Met Ser Asn Gin Thr Pro 35 40 45

Thr Ser Ala Cys Cys Ala Lys Leu Arg Glu Gin Lys Pro Cys Leu Cys 50 55 60

Gly Tyr Tyr Lys Asn Pro Thr Leu Arg Pro Tyr lie Asn Ser Pro Gly 65 70 75 80

Ala Lys Arg Val Ala Ser Thr Cys Lys Val Ser Val Ser Cys 85 90

(2) INFORMATION FOR SEQ ID NO: 6:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 92 amino acids (D) TOPOLOGY: unknown

(ii) MOLECULE TYPE: protein

(ill) HYPOTHETICAL: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM. Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 :

Ala lie Asn Cys Gly Leu Val Ser Gin Ser Leu Ala Ala Cys Leu Gly 1 5 10 15

Phe Leu Glu Asn Gly Gin Gly Pro Asn Ala Ala Cys Cys Asn Gly Val 20 25 30

Lys Thr Leu Arg Asn Leu Thr Pro Thr Thr Gin Asp Lys Arg Thr Ala 35 40 45

Cys Arg Cys Met Lys Ser Ala Ala Ser Ala lie Pro Gly lie Asn His 50 55 60

Lys Tyr Ser Ala Ala Leu Pro Gly Lys Cys Gly Val Ser lie Pro Gly 65 70 75 80

Pro Val Gly Pro Gin Ala Asp Cys Ser Gin lie His 85 90

(2) INFORMATION FOR SEQ ID NO.7

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH. 91 amino acids

(B) TYPE amino acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unKnown

(ll) MOLECULE TYPE: protein

(m) HYPOTHETICAL: NO

(ill) ANTI-SENSE. NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM- Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 : lie Thr Cys Gly Leu Val Ala Ser Lys Leu Ala Pro Cys lie Gly Tyr

1 5 10 15

Leu Gin Gly Ala Pro Gly Pro Ser Ala Gly Cys Cys Gly Gly He Lys 20 25 30

Gly Leu Asn Ser Ala Ala Ala Ser Pro Ala Asp Arg Lys Thr Ala Cys 35 40 45

Thr Cys Leu Lys Ser Ala Ala Thr Ser Met Lys Gly He Asn Tyr Gly 50 55 60

Lys Ala Ala Ser Leu Pro Arg Gin Cys Gly Val Ser He Pro Tyr Ala

65 70 75 80

He Ser Pro Asn Thr Asn Cys Asn Ala He His 85 90

(2) INFORMATION FOR SEQ ID NO: 8 :

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 91 amino acids

(B) TYPE: amino acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(i ) MOLECULE TYPE, protein

(in) HYPOTHETICAL- NO

(m) ANTI-SENSE NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:

He Thr Cys Gly Leu Val Ala Ser Lys Leu Ala Pro Cys He Gly Tyr

1 5 10 15

Leu Gin Gly Ala Pro Gly Pro Ser Ala Gly Cys Cys Gly Gly He Lys 20 25 30

Gly Leu Asn Ser Ala Ala Ala Ser Pro Ala Asp Arg Lys Thr Ala Cys 35 40 45

Tnr Cys Leu Lys Ser Ala Ala Thr Ser Met Lys Gly He Asn Tyr Gly 50 55 60

Lys Ala Ala Ser Leu Pro Arg Gin Cys Gly Val Ser He Pro Tyr Ala 65 70 75 80

He Ser Pro Asn Thr Asn Cys Asn Ala He His 85 90

!2) INFORMATION FOR SEQ ID NO: 9

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 30 amino acids

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(n) MOLECULE TYPE: protein (m) HYPOTHETICAL: NO (ill) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:

Ser Gly Glu Cys Asn Met Tyr Gly Arg Cys Pro Pro Gly Tyr Cys Cys 1 5 10 15

Ser Lys Phe Gly Tyr Cys Gly Val Gly Arg Ala Tyr Cys Gly 20 25 30

(2) INFORMATION FOR SEQ ID NO: 10:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 46 ammo acids

(B) TYPE: amino acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ii) MOLECULE TYPE: protein

(iii) HYPOTHETICAL: NO

(iii) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:

Ala He Cys Lys Lys Pro Ser Lys Phe Phe Lys Gly Ala Cys Gly Arg 1 5 10 15

Asp Ala Asp Cys Glu Lys Ala Cys Asp Gin Glu Asn Trp Pro Gly Gly 20 25 30

Val Cys Val Pro Phe Leu Arg Cys Glu Cys Gin Arg Ser Cys 35 40 45

(2) INFORMATION FOR SEQ ID NO: 11:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 46 amino acids

(B) TYPE: amino acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ii) MOLECULE TYPE: protein

(iii) HYPOTHETICAL: NO

(111) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:

Ala Thr Cys Arg Lys Pro Ser Met Tyr Phe Ser Gly Ala Cys Phe Ser

1 5 10 15

Asp Thr Asn Cys Gin Lys Ala Cys Asn Arg Glu Asp Trp Pro Asn Gly 20 25 30

Lys Cys Leu Val Gly Phe Lys Cys Glu Cys Gin Arg Pro Cys 35 40 45

(2) INFORMATION FOR SEQ ID NO:12:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 32 amino acids

(B) TYPE: amino acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ii) MOLECULE TYPE: protein

(iii) HYPOTHETICAL: NO

(iii) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:

Arg Cys He Pro Cys Gly Gin Asp Cys He Ser Ser Arg Asn Cys Cys 1 5 10 15

Ser Pro Cys Lys Cys Asn Phe Gly Pro Pro Val Pro Arg Cys Thr Asn 20 25 30

(2) INFORMATION FOR SEQ ID NO: 13:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 549 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ii) MOLECULE TYPE: cDNA

(in) HYPOTHETICAL: NO

(in) ANTI -SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Beta vulgaris

(ix) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: 66..293

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:

ACTCAACAAA TTCAGAAAAA AACAGAAGCA AAAAAAGTTT ATTGAAAGAG TAAvjTTGAGG 60

TGAAA ATG ATG AAA AGC TTT GTG ATA GTT ATG TTG GTC ATG TCC ATG 107

Met Met Lys Ser Phe Val He Val Met Leu Val Met Ser Met 1 5 10

ATG GTG GCT ACA TCT ATG GCA AGT GGT GAA TGC AAT ATG TAT GGT CGA 155 Met Val Ala Thr Ser Met Ala Ser Gly Glu Cys Asn Met Tyr Gly Arg 15 20 25 30

TGC CCC CCA GGG TAT TGT TGT AGC AAG TTT GGC TAC TGT GGT GTC GGA 203 Cys Pro Pro Gly Tyr Cys Cys Ser Lys Phe Gly Tyr Cys Gly Val Gly 35 40 45

CGC GCC TAT TGT GGC GAT GCT GAG CAG AAG GTT GAA GAT CAT CCA TCT 251 Arg Ala Tyr Cys Gly Asp Ala Glu Gin Lys Val Glu Asp His Pro Ser 50 55 60

AAT GAT GCT GAT GTT CCT GAG TTT GTT GGA GCT GGT GCC CCA 293

Asn Asp Ala Asp Val Pro Glu Phe Val Gly Ala Gly Ala Pro 65 70 75

TGATGCTCGA AGCCAGGTAA TCGTAATGGC ATGGGTTACC TAATAAGTAA ACTCATTGTG 353

CCTAGCTTGC TACATGCTTA TCCACTATAA ATAAGCTCCT ACAGGAGTTG TGTTTTTCTT 413

TTAATTTTGT AATCAAGGGT TTGACTTTAA TTAATGAGAC CAATGTATAC TTGCATGTCG 473

GATAAATATT AACTAAGCCA CTCGTATTGG TTTATTATAA AACTACTATA AAAAAAAAAA 533

AAAAAAAAAA AAAAAA 549

(2) INFORMATION FOR SEQ ID NO: 14:

(1) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 76 ammo acids

(B) TYPE: amino acid (D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14:

Met Met Lys Ser Phe Val He Val Met Leu Val Met Ser Met Met Val 1 10 15

A.a T.-r St-r en Ala Ser Gly Glu Cys Asn Met Tyr Gly Arg Cys Pro

2C 25 30

Pro Gly Tyr Cys Cys Ser Lys Phe Gly Tyr Cys Gly Val Gly Arg Ala 35 40 45

Tyr Cys Gly Asp Ala Glu Gin Lys Val Glu Asp His Pro Ser Asn Asp 50 55 60

Ala Asp Val Pro Glu Phe Val Gly Ala Gly Ala Pro 65 70 75

(2) INFORMATION FOR SEQ ID NO: 15:

(1) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 52 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO

( i) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: CCAGTGAGCA GAGTGACGAG GACTCGAGCT CAAGCTTTTT TTTTTTTTTT TT 52

(2) INFORMATION FOR SEQ ID NO: 16:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 18 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(iii) HYPOTHETICAL: NO

(m) ANTI-SENSE- NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 16 : CCAGTGAGCA GAGTGACG 18

(2) INFORMATION FOR SEQ ID NO: 17:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 18 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL. NO

(ill) ANTI -SENSE: NO

(vi) OFIGINAL SOURCE:

(Ai ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: GAGGACTCGA GCTCAAGC 18

(2) INFORMATION FOR SEQ ID NO: 18:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 30 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL: NO

(m) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: GGCCACGCGT CGACTAGTAC GGGGGGGGGG 30

(2) INFORMATION FOR SEQ ID NO:19:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 21 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(lli) HYPOTHETICAL: NO

(iii) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: GGCCACGCGT CGACTAGTAC G 21

(2) INFORMATION FOR SEQ ID NO: 20:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 26 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(m) HYPOTHETICAL: NO

(m) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20. GCACGTTGCT ACGACGTTGC TATGAA 26

(2) INFORMATION FOR SEQ ID NO:21:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 25 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(in) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(vi ) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: GGACGTATAC TAACTCACTA AGATA 25

(2) INFORMATION FOR SEQ ID NO:22:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 21 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY : unknown

(m) HYPOTHETICAL: NO

(111) ANTI-SENSE: NO

(vi ) ORIGINAL SOURCE.

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: TGGAATTGGA GATTATGTAA G 21

(2) INFORMATION FOR SEQ ID NO:23:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 27 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL: NO

(m) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:23: TCACTTTAGA TGTAAGAAGC ACACATG 27

(2) INFORMATION FOR SEQ ID NO:24.

(-) SEQUENCE CHARACTERISTICS:

(A) LENGTH. 25 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: εmg1e (D) TOPOLOGY: unknown

(in) HYPOTHETICAL: NO

(m) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24:

TAAGCAGAAA GTTCCAGAAA GCATG 25

(2) INFORMATION FOR SEQ ID NO:25:

(l) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs

(B) TYPE: nucleic ac d

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(lil) HYPOTHETICAL: NO

(m) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:

GGAGCTTACT TGCTAACTAT ACTTCT 26

(2) INFORMATION FOR SEQ ID NO: 26-

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 26 base pairs

(B) TYPE- nucleic acid

(C) STRANDEDNESS. single

(D) TOPOLOGY unknown

(ill) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION. SEQ ID NO.26: AACTGTACGT TGCTTGCTGC ACGTGG 26

(2) INFORMATION FOR SEQ ID NO:27:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH. 25 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS single

(D) TOPOLOGY, unknown

(ill) HYPOTHETICAL: NO

(m) ANTI-SENSE. NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 27: CATCAAGAAG TCCATAATTG TCTAG 25

(2) INFORMATION FOR SEQ ID NO:28:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 24 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(iii) HYPOTHETICAL: NO

(iii) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: TGATCTTTAT TGACAAACAG ACGC 24

(2) INFORMATION FOR SEQ ID NO: 29:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 27 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: Single

(D) TOPOLOGY: unknown

(m) HYPOTHETICAL: NO

(m) ANTI-SENSE NO

(vi) ORIGINAL SOURCE.

(A) ORGANISM- primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: ACAGACACGC TAGTTAGATG ACTAAGC 27

(2) INFORMATION FOR SEQ ID NO: 30:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 26 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single (D 1 TOPOLOGY- unknown

111. I HYPOTHETICAL NO

(iii) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(Al ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: GAAGCAAGAA AGCCACGTTG ACTCTT 26

(2) INFORMATION FOR SEQ ID NO: 31:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 24 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY, unknown

(m) HYPOTHETICAL: NO (m) ANTI-SENSE: NO

(Vl) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: TGCTGGACGT TACTTACTAA AGAA 24

(2) INFORMATION FOR SEQ ID NO: 32:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH. 21 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL: NO

(m) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:32: CCTAATTTCC CTCAAATCAC G 21

(2) INFORMATION FOR SEQ ID NO: 33: d) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 25 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL: NO I ill) ANTI -SENSE: NO

(v i OF:C:?, * AL SOURCE:

IA ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 AATTTCCCTC AAATCACGAA TTGAG 25

(2) INFORMATION FOR SEQ ID NO: 34:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 24 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: unknown

(ill) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(vi ) ORIGINAL SOURCE:

(A) ORGANISM: primer

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: TCGTCAGTTT TGGCTCATTT TGGG 24