Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A FUNGUNS WHEREIN THE areA, pepC AND/OR pepE GENES HAVE BEEN INACTIVATED
Document Type and Number:
WIPO Patent Application WO/1997/022705
Kind Code:
A1
Abstract:
The present invention relates to fungi, which do not produce proteases. The fungi of the invention are useful as hosts for the production of proteins susceptible of proteolytic degradation by the proteases usually produced, and the invention consequently encompasses processes for the production of proteins of interest in high yields by using the fungi of the invention. The invention also comprises methods for producing such fungi and DNA constructs to be used in these methods.

Inventors:
CHRISTENSEN TOVE (DK)
LEHMBECK JAN (DK)
Application Number:
PCT/DK1996/000528
Publication Date:
June 26, 1997
Filing Date:
December 16, 1996
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NOVO NORDISK AS (DK)
CHRISTENSEN TOVE (DK)
LEHMBECK JAN (DK)
International Classes:
C12N15/09; C07K14/38; C12N1/14; C12N1/15; C12N15/80; C12P21/02; C12R1/66; C12R1/69; C12R1/72; C12R1/77; C12R1/80; (IPC1-7): C12N15/80; C12N1/15
Domestic Patent References:
WO1995035385A11995-12-28
WO1992017595A11992-10-15
WO1990000192A11990-01-11
Foreign References:
EP0574347A21993-12-15
Other References:
MOLEC. GEN. GENET., Volume 126, 1973, HERBERT N. ARST, Jr. et al., "Nitrogen Metabolite Repression in Aspergillus Nidulans", pages 111-141.
Download PDF:
Claims:
PATENT CLAIMS
1. A fungus, wherein the areA gene by recombinant DNA technology has been modified in a way by which it cannot be expressed in a way providing for a functional AreA activator, and wherein the genes encoding for the extracellular proteases PepC and/or PepE have been inactivated in a manner whereby they are not expressed to produce functional proteases.
2. The fungus of claim 1, wherein said inactivation has been obtained by deletion of all or parts of the areA, pepC, and/or pepE genes .
3. The fungus of claim 1, wherein said inactivation has been obtained by deletion of all or parts of the areA, and pepE genes.
4. The fungus of claim 1, wherein said inactivation has been obtained by interfering with the regulation of the expression signals regulating the expression of the areA, pepC, and/or pepE genes themselves.
5. The fungus of claim 1, wherein said inactivation has been obtained by interfering with the regulation of the expression signals regulating the expression of the areA, and pepE genes themselves.
6. The fungus of claim 1, wherein said inactivation has been obtained by using antisense technology.
7. The fungus of claim 1, wherein said inactivation has been obtained by inserting extra DNA internally in the areA, pepC, and/or pepE genes.
8. The fungus of any of claims 1 to 7, being a filamentous fungus, preferably belonging to a genus selected from the group comprising Aspergillus , Trichoderma , Humi cola , Candida , Acremonium, Fusarium, and Penicill ium.
9. The fungus of claim 8, which belongs to a species selected from the group comprising A. oryzae, A . niger, A . awamori , A . phoenicis , A . japonicus, A . foetidus, A . nidulans, T. reesei , T. harzianum, H. insolens , H. lanuginosa , F. graminearum, F. solani , P . chrysogenu , and others.
10. A method for producing a fungus according to claim 1, wherein said inactivation has been obtained by deletion of the areA, pepC, and/or pepE genes, which method comprises i) cloning of the areA, pepC, and/or pepE genes from a fungus of interest, ii) producing DNA constructs each comprising one among the areA gene, the pepC gene, and/or the pepE gene, wherein an internal part has been substituted, deleted, or extra DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are areA' , pepC, and/or pepE' .
11. A method for producing a fungus according to claim 1, wherein said inactivation has been obtained by deletion of the areA and pepE genes, which method comprises i) cloning of the areA and pepE genes from a fungus of interest, ii) producing DNA constructs each comprising one among the areA gene and the pepE gene, wherein an internal part has been substituted, deleted, or extra DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are areA' , and pepE'.
12. A method for producing a fungus according to claim 1, wherein said inactivation has been obtained by deletion of the areA and pepC genes, which method comprises i) cloning of the areA and pepC genes from a fungus of interest, ii) producing DNA constructs each comprising one among the areA gene and the pepC gene, wherein an internal part has been substituted, deleted, or extra DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are areA' , and pepC .
13. A method for producing a fungus according to claim 1, wherein said inactivation has been obtained by deletion of the pepC and pepE genes, which method comprises i) cloning of the pepC and pepE genes from a fungus of interest, ii) producing DNA constructs each comprising one among the pepC gene and the pepE gene, wherein an internal part has been substituted, deleted, or extra DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are pepC , and pepE' .
14. A method for producing a fungus according to claim 1, wherein said inactivation has been obtained by using antisense technology, which method comprises i) construction of expression plasmids, each of which give rise to synthesis of an RNA molecule complementary to the mRNA transcribed from the areA gene, the pepC gene, and/or the pepE gene, ii) transformation of the host fungus with said expression plasmids and a suitable marker, either on separate plasmids or on the same plasmid, iii) selection of transformants using said marker, and iv) screening selected transformants for strains exhibiting a reduction in the synthesis of the AreA, PepC, and/or PepE products.
15. A method for producing a fungus according to claim 1, wherein said inactivation has been obtained by using antisense technology, which method comprises i) construction of expression plasmids, each of which give rise to synthesis of an RNA molecule complementary to the mRNA transcribed from the areA gene and the pepE gene, ii) transformation of the host fungus with said expression plasmids and a suitable marker, either on separate plasmids or on the same plasmid, iii) selection of transformants using said marker, and iv) screening selected transformants for strains exhibiting a reduction in the synthesis of the AreA, PepC, and PepE products.
16. A process for the production of a desired gene product, whereby a fungus according to any of the claims 1 to 9 is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.
17. A process for the production of a desired gene product, whereby a fungus according to any of the claims 1 to 9, which has been transformed to integrate a DNA sequence coding for the desired gene product into the genome of the fungus in a functional manner, is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.
18. A process for producing a desired polypeptide compri sing cultivating a fungus in an appropriate growth medium and recovering said polypeptide from said culture, said fungus carrying a recombinant DNA construct capable of causing expression of said polypeptide or a precursor thereof in said fungus, said fungus further being characterized by producing lower amounts of functional AreA, PepC, and/or PepE than the wildtype of said fungus.
19. A method according to claim 18, wherein said fungus has been modified to produce lower than wildtype amounts of AreA, PepC, and/or PepE by a process comprising transforming a parent of said fungus with DNA constructs capable of causing reduced 10 production of functional AreA, PepC, and/or PepE when integrated in the genome of said fungus .
20. A method according to claim 18, wherein said polypepti¬ de is secreted to the extracellular medium by said fungus.*& 15.
21. A method according to claim 18, wherein said fungus produces higher amounts of said polypeptide than a similar fungus where said similar fungus produces AreA, PepC, and/or PepE in amounts similar to those produced by the wildtype of 20 said fungus, said similar fungus being identical to said fungus in all other respects.
22. The process of claim 14 or 15 to 21, wherein said gene product is a secreted protein.*& 25.
23. The process of any of the claims 14 to 20, wherein said desired gene product is an industrial peptide or protein, preferably an enzyme.
24. 30 24.
25. The process of claim 23, wherein said enzyme is selec¬ ted from the group comprising a protease, lipase, cutinase, cellulase, chymosin.
26. The process of any of the claims 15 to 22, wherein said 35 desired gene product is a therapeutically active peptide or protein.
27. The process of claim 25, wherein said therapeutically active peptide or protein is selected from the group comprising insulin, growth hormone, glucagon, somatostatin, interferon, PDGF, factor VII, factor VIII, urokinase, tPA, EPO, or TPO.
28. A gene product produced in accordance with any of the processes 15 to 26.
29. A DNA sequence coding for the pepC gene from A . oryzae (SEQ ID No. 1) or functional alleles thereof.
30. A PepC protease from A . oryzae (SEQ ID No. 2) .
31. A process for the production of the PepC protease of claim 29 comprising transforming a suitable host with a DNA construct comprising a DNA sequence of claim 28, selecting a transformant capable of producing said PepC protease, cultivating said transformant in an appropriate growth medium and recovering said PepC protease from said culture .
32. A DNA sequence coding for the pepE gene from A . oryzae (SEQ ID No. 3) or functional alleles thereof.
33. A PepE protease from A . oryzae (SEQ ID No. 4) .
34. A process for the production of the PepE protease of claim 32 comprising transforming a suitable host with a DNA construct comprising a DNA sequence of claim 31, selecting a transformant capable of producing said PepE protease, cultivating said transformant in an appropriate growth medium and recovering said PepE protease from said culture.
35. The process of claim 30 or 33, wherein said host is a fungus, according to any of the claims 1 to 9.
36. The process of claim 30, 33 or 34, wherein said host is A. oryzae, and wherein said DNA construct comprising a DNA sequence of claim 28 or 31, provides for an extra copy of the gene encoding said PepC or PepE protease.
Description:
Title: A fungus wherein the are A, pep C and/or pep E genes have been inactivated

FIELD OF THE INVENTION

The present invention relates to fungi, which do not produce proteases. The fungi of the invention are useful as hosts for the production of proteins susceptible to proteolytic degradation by the proteases usually produced, and the invention consequently encompasses processes for the production of proteins of interest in high yields by using the fungi of the invention. The invention also comprises methods for producing such fungi and DNA constructs to be used in these methods .

BACKGROUND OF THE INVENTION

Fungi, and especially filamentous fungi, are widely used commercially because of their ability to secrete remarkably high levels of proteins

Among the filamentous fungi species belonging to the genus Aspergillus have a long history of commercial use for the production of endogenous and lately also heterologous proteins.

One disadvantage with most microorganisms used for the production of proteins is the inherent production of proteases which may subject a protein product of interest to degradation due to proteolysis.

Various ways of avoiding this have been envisaged. Among other solutions it has been suggested to delete or disrupt the genes encoding the various proteases. Unfortunately, the fungi produce a high number of proteases making such a solution more or less unrealistic.

A need is therefore persisting for strains of filamentous fungi exhibiting no or very low levels of protease production.

For a number of years it has been known that the regulatory gene areA which mediates nitrogen metabolite repression in A . nidulans influences the production of

extracellular proteases (Arst & Cove, molec . gen . Genet . 126, (1973) 111-141) .

The areA gene from A . nidulans has been cloned (Caddick et al . , EMBO Journal 5, (1986) 1087-1090) and various

5 modifications made to it to evaluate functions of different regions in the activator protein encoded by this gene

(Stankovitch et al . Mol . Microbiol . 1 , (1993) 81-87) . Furthermore the gene coding the corresponding function in A . fumiga tus apparently has been cloned recently (Hensel et al . 2nd European

10 Conference on Fungal Genetics, April 28 to May 1, 1994, Book of

Abstracts, Ell) .

From the literature a single use is also known of a strain of A . nidulans of genotype argrB areAl as a host for the production of t-PA (Upshall et al . Biotechnology 5, (1987) 1301- 15 1304) . In this example only the arcjB genotype is used as a selection marker through its arginine prototrophy, while the areA genotype is simply a coincidence .

International Patent Publication No. WO 95/35385 discloses the deletion of the areA gene as a means for reducing 20 the protease level in filamentous fungi.

Apart from the extracellular proteases, fungi also produce a number of intracellular proteases (also called endoplasmic) .

Among these a serine protease of the subtilisin type 25 produced by A . niger and designated PepC has been described, the gene expressing it cloned, and a deletion mutant described in EP 574 347 and in Frederick et al . , Gene, 125 57-64 (1993)

A further such protease of the aspartic type designated

PepE has been disclosed in Jarai et al . , Gene, 145 171-178

30 (1994) . the article discloses the cloning and characterisation of the pepE gene and speculates about the regulation of the pepE and pepC genes .

The present invention has as an object the alleviation of the need for protease free filamentous fungi.

35

SUMMARY OF THE INVENTION

The present invention consequently relates to fungi, wherein the areA, pepC, and/or pepE genes by recombinant DNA technology have been modified such that they cannot be expressed in a way providing for a functional AreA activator and functional

PepC and/or PepE proteases.

The invention furthermore relates to methods for producing such fungi, obtained by deletion of the areA, pepC, and/or pepE genes.

This may be obtained through a method comprising i) cloning of the areA, pepC, and/or pepE genes from a fungus of interest, ii) producing DNA constructs each comprising one among the areA gene, the pepC gene, and the pepE gene, wherein an internal part has been substituted, deleted, or extra

DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are areA ' , pepC , and/or pepE ~ .

The information obtained from the above mentioned cloning of the areA, pepC, and/or pepE genes may also be used in connection with the well-known anti-sense technology, to construct an expression plasmid giving rise to synthesis of a RNA molecules complementary to the mRNA transcribed from the areA, pepC, and/or pepE genes, and to transform the fungus of interest therewith. The invention furthermore relates to DNA constructs intended for use in the above mentioned methods.

Furthermore the invention relates to methods of producing a desired protein or gene product, especially secreted proteins, whereby a fungal host modified and optionally transformed with a DNA construct comprising at least a DNA sequence coding for the protein or gene product of interest, is

cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified.

When working with the invention it was surprisingly found that the fungi of the invention produces such secreted proteins in a much improved yield.

It was also surprisingly found that the only nitrogen source capable of providing good growth of the A . oryzae areA strains was glutamine.

The invention furthermore relates to protein products produced by the above methods.

Also the invention relates to a DNA sequence coding for the pepC gene from A . oryzae (SEQ ID No. 1) or functional alleles thereof.

The invention also covers a PepC protease from A . oryzae (SEQ ID No. 2) , and processes for the production of the PepC protease comprising transforming a suitable host with a DNA construct comprising a DNA sequence coding for the PepC protease, selecting a transformant capable of producing said PepC protease, cultivating said transformant in an appropriate growth medium and recovering said PepC protease from said culture .

Furthermore the invention relates to a DNA sequence coding for the pepE gene from A . oryzae (SEQ ID No. 3) or functional alleles thereof. Also, the invention relates to a PepE protease from A . oryzae (SEQ ID No. 4) , and processes for the production of the PepE protease comprising transforming a suitable host with a DNA construct comprising a DNA sequence coding therefore, sele -ing a transformant capable of producing said PepE protease, cultivating said transformant in an appropriate growth medium and recovering said PepE protease from said culture.

According to these aspects said host is preferably a fungus, according to the invention, especially A . oryzae, and wherein said DNA construct provides for an extra copy of the gene encoding either said PepC or PepE protease.

BRIEF DESCRIPTION OF THE DRAWING

The invention is described in further detail in the following parts of the specification with reference to the Examples and the drawing, wherein

Fig. 1 shows the steps involved in the construction of HowBlOl,

Fig. 2 shows the construction of pToC345, Fig. 3 shows the steps involved in the construction of pToC315.

Fig. 4 diagrammatically shows a two step gene deletion of the pyrG gene.

Fig. 5 shows the construction of pJaL235,

Fig. 6 shows the steps involved in the construction of pJaL335, Fig. 7 shows the steps involved in the construction of pJaL363,

Fig. 8 shows the steps involved in the construction of pJaLz,

Fig. 9 shows the construction of pSKS and pSK9,

Figs. 10a and 10b show the steps involved in the construction of pToC243 and pToC266, Fig. 11 shows the steps involved in the construction of pMT1606,

Fig. 12 shows the construction of pToC56,

Figs. 13a and 13b show the steps involved in the construction of pJaL368, and

Figs. 14a and 14b show the construction of pToC338.

DEFINITIONS

In the present specification the following definitions are used:

The expression areAD means a strain in which the areA gene is deleted. Similar notations are used for strains, wherein the pepC, and/or pepE genes are deleted.

The expression areA ' means a strain which does not produce a functional AreA activator. The term "loss of function"

is also often used for this. Similar notations used for strains, which do not produce functional PepC, and/or PepE protease (s) .

The expression "anti-sense technology" describes methods such as disclosed in US Patent No. 5,190,931.

DETAILED DESCRIPTION OF THE INVENTION

As indicated the present invention relates in its first aspect to fungi, wherein the areA gene by recombinant DNA technology has been modified in a way by which it cannot be expressed in a way providing for a functional AreA activator, and wherein the genes encoding for the extracellular proteases PepC and/or PepE has been inactivated in a manner whereby they are not expressed to produce functional proteases, This object may specifically be obtained by deletion or disruption of the areA, pepC, and/or pepE genes.

The cloning of the areA, pepC, and/or pepE genes are described in the Examples.

AreA homologs from other fungi could be cloned either by cross hybridization with one of the already known genes or by complementation of areA mutants; e.g. A . nidulans areA-18 or the A . oryzae areA deleted strain described in this application.

Methods for deleting or disrupting a gene are specifically described in WO 90/00192 (Genencor) . Methods for substituting DNA in a gene are also generally known, and can be accomplished by substituting one or more continuous parts of the gene, but it may also be obtained by site directed mutagenesis generating a DNA sequence encoding a AreA activator variant that is not functional. Another method by which such an object may be obtained is by using anti-sense technology.

The anti-sense technology and how to employ it is described in detail in the aforementioned US Patent No. 5,190,931 (University of New York) .

A further method of obtaining said inactivation is by inserting extra DNA internally in the areA gene, thereby giving rise to the expression of a dysfunctional activator protein.

In connection with this method information provided by the cloning can be used to make DNA constructs that can be in¬ tegrated into the areA gene, and even replace it with another gene, such as the pyrG gene.

A further method of avoiding the presence of the areA activator is by interfering with the regulation of the expression signals regulating the expression of the areA gene itself.

The principles described above apply equally to the pepC, and/or pepE genes .

According to the invention the fungus preferably belongs to a genus selected from the group comprising Aspergi ll us, Trichoderma , Humicola, Candida , Acremonium, Fusarium, and Peni cil li um .

Among these genera species selected from the group comprising A . oryzae, A . niger, A . awamori, A . phoeni cis, A . japonicus , A . foe tidus, A . nidulans , T. reesei , T. harzianum, H. insol ens , H. lanuginosa , F. graminearu , F. solani , P . chrysogenu , and others are preferred.

As indicated the invention also is meant to encompass the method for producing the fungi of the first aspect of the invention, and wherein said inactivation has been obtained by deletion of the areA, pepC, and/or pepE genes, which method comprises i) cloning of the areA, pepC, and/or pepE genes from a fungus of interest, ii) producing DNA constructs each comprising one among the areA gene, the pepC gene, and/or the pepE gene, wherein an internal part has been substituted, deleted, or extra

DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are areA ' , pepC , and/or pepE ' .

Since it is believed that the maturation of the PepC protease is controlled by the PepE protease the invention also comprises a method for producing a fungus of the invention, wherein said inactivation has been obtained by deletion of the areA and pepE genes, which method comprises i) cloning of the areA and pepE genes from a fungus of interest, ii) producing DNA constructs each comprising one among the areA gene and the pepE gene, wherein an internal part has been substituted, deleted, or extra DNA has been inserted, iii) transforming said fungus with the constructs, and iv) isolating transformants which are areA , and pepE ' .

Also included is the method for producing the fungi, wherein the inactivation has been obtained by using anti-sense technology. Such a method comprising i) construction of expression plasmids, each of which give rise to synthesis of an RNA molecule complementary to the mRNA transcribed from the areA gene, the pepC gene, and/or the pepE gene, ii) transformation of the host fungus with said expression plasmids and a suitable marker, either on separate plasmids or on the same plasmid, iii) selection of transformants using said marker, and iv) screening selected transformants for strains exhibiting a reduction in the synthesis of the AreA, PepC, and/or

PepE products.

A further aspect of the invention is meant to comprise DNA constructs for use in the above mentioned methods.

In respect of the former method said DNA constructs may comprise the areA, pepC, and/or pepE genes, wherein an internal part has been substituted, deleted, or extra DNA has been inserted.

At least one of the DNA constructs may furthermore also comprise DNA sequences encoding a protein product of interest, such as those mentioned later.

In respect of the latter anti-sense method the DNA constructs may comprise inverted DNA sequence of the areA, pepC, and/or pepE genes connected to a functional promoter, whereby the mRNAs are at least partially complementary to mRNAs produced from the areA, pepC, and/or pepE genes.

A further aspect of the invention relates to a process for the production of a desired gene product, preferably a secreted gene product, whereby a fungus according to the invention is cultivated in a suitable growth medium at appropriate conditions and the desired gene product is recovered and purified. In the case of a gene product expressed by a heterologous gene the DNA sequence coding for the desired gene product may be a part of the DNA construct used for producing said fungus.

Normally, however, a separate transformation of the fungus of the invention is performed in order to make the fungus capable of producing the desired product .

Methods for transforming fungi are well known in the art, cf. e.g. EP 0 184 438 A2 (Gist-Brocades N.V. ) and EP publication No. 0 98 993 (Novo Nordisk A/S) . For indigenous products this is of course not necessary, but in order to increase the production it may be an advantage to provide for multiple copies of the gene encoding the protein of interest to be incorporated into the host .

The desired gene product is generally a peptide or protein, preferably an enzyme.

Among enzymes it is preferably selected from the group comprising proteases, such as trypsin and chymosin; lipases, cutinases, cellulases, xylanases, laccases, pectinases, etc.

Another type of desired gene product is generally a therapeutically active peptide or protein.

Among the therapeutically active peptide or protein the protein preferably is selected from the group comprising insulin,

growth hormone, glucagon, somatostatin, interferons, PDGF, factor VII, factor VIII, urokinase, t-PA, CSF, lactoferrin, TPO etc.

A further aspect of the invention relates to the DNA sequences coding for the pepC gene from A . oryzae (SEQ ID No. 1) , the pepE gene from A . oryzae (SEQ ID No. 3)or functional alleles thereof. Also encompassed by the invention are the corresponding PepC and PepE proteases and their production, preferably by recombinant means. In this aspect the invention relates to processes for the production of the PepC protease or PepE protease from A . oryzae comprising transforming a suitable host with a DNA construct comprising a DNA sequence encoding the protease of interest, selecting a transformant capable of producing the protease, cultivating the transformant in an appropriate growth medium and recovering the PepC or PepE protease from the culture.

The host used in such a process is preferably a host according to the above mentioned aspects of the invention In certain embodiments of the process for producing the

PepC or PepE protease the host is A . oryzae . In that case it is preferred that the DNA construct comprising a DNA sequence coding for the protease, provides for an extra copy of the gene already present in the host . The DNA construct comprising the DNA sequence encoding the protease will normally also comprise regulatory elements in order to provide for proper expression and processing of the protease in the host .

The invention is explained in further detail in the Examples given below. These should, however, not in any way be construed as limiting the scope of the invention as defined in the appended claims.

EXAMPLES

Materials and Methods

Strains

A . oryzae , IF04177: available from Institute for Fermentation,

Osaka; 17-25 Juso Hammachi 2-Chome Yodogawa- Ku, Osaka, Japan. ToC913: The construction of this strain is described m the Examples.

Genes areA : This gene codes for a regulatory protein controlling nitrogen catabolism. pepC: This gene codes for a serine protease of the subtilism type pepE: This gene codes for an aspartic protease. pyrG: This gene codes for orotidme-S' -phosphate decarboxylase, an enzyme involved m the biosynthesis of uridine. bar: This gene was originally isolated from Streptomyces hygroscopicus and codes for phosphmothricin acetyltransferase . The enzyme modifies phosphmothricin

(=glufosmate) and thereby inactivates this compound which is toxic to bacteria, fungi and plants.

Plasmids pUC118- Viera and Mes q J. Meth . Enzγmol . 1987 153 3-11 pS02 : The construction of this plasmid is described in the Examples pJers4 : A 2.0 kb subclone of pS02 in pUCllβ . pJers4 contains a functional A. oryzae pyrG gene. pS05 : The construction of this plasmid from pS02 is described in the Examples . pToC56: The construction of this plasmid is described in EP publication No. 0 98 993.

pToC68 : The construction of this plasmid is described in WO

91/17243. pToC90: A subclone of p3SR2, harboring the amdS gene from

Aspergillus nidulans as a 2.7 kb Xbal fragment [Corrick et al . , GENE 1987 53 63-71] , on a pUC19 vector [Yannisch-Perron et al . , GENE 1985 33 103-119] , prepared as described in WO 91/17243. pToC266: The construction of this plasmid is described in the

Examples. pToC299: The construction of this plasmid is described in the

Examples . pToC338: The construction of this plasmid is described in the

Examples. pMT1606 : The construction of this plasmid from pBPIT (B. Straubinger et al . Fungal Genetics Newsletter

39 (1992) :82-83) and p775 (EP publication No. 0 98 993) is described in the Examples. p775: The construction of this plasmid is described in EP publication No. 0 98 993. pill : The construction of this plasmid is described in EP publication No. 0 98 993. pHW470: The construction of this plasmid is described in the

Examples .

Example 1

Cloning and deletion of the A . oryzae peyE gene.

The A . oryzae pepE gene was cloned by cross- hybridization with the A . niger gene. A partial A . niger gene was obtained as a 700 bp PCR fragment from a PCR reaction with A . niger chromosomal DNA and pepE specific primers made according to the pepE sequence published by G. Jarai et al, Gene 145 (1994) 171-178. The fragment was shown to contain pepE sequences by DNA sequencing. It hybridizes to A . oryzae

chromosomal DNA under stringent conditions and Southern analysis showed that A . oryzae contains a single pepE like gene.

The pepE gene was deleted both by the gene replacement method and the two step gene replacement method (G. May in "Applied Molecular Genetics of Filamentous Fungi" (1992) pp. 1- 25. Eds. J. R. Kinghorn and G. Turner; Blackie Academic and Professional) . As marker was used the A . oryzae pyrG gene, the A . oryzae strain was a pyrG- strain made by deletion of the pyrG gene .

Cloning of the A . oryzae pepE gene

A cosmid library of Aspergillus oryzae was constructed essentially according to the instruction from the supplier (Stratagene) of the "SuperCosl cosmid vector kit" . Genomic DNA of A . oryzae IF04177 was prepared from pro¬ toplasts made by standard procedures (Christensen, T., et . al. , Biotechnology 6 (1988) 1419-1422) . After isolation of the protoplasts they were pelleted by centrifugation at 2500 rpm for 5 minutes in a Labofuge T (Heto) , the pellet was suspended in 10 mM NaCI, 20 mM Tris-HCl (pH 8.0) , 1 mM EDTA, 100 μg/ml proteinase K and 0.5% SDS as stated in the manual from the Supercos 1 cosmid vector kit and the rest of the DNA preparation was done according to the kit's instructions. The size of the genomic DNA was analysed by electrophoresis using the CHEF-gel apparatus from Biorad. A 1% agarose gel was run for 20 hours at 200 volt with a 10-50 second pulse. The gel was stained by etidium bromide and photographed. The DNA was 50->100 kb in size. The DNA was partially restricted by Sau3A . The size of the restricted DNA was 20-50 kb determined by the same type of CHEF-gel analysis as above. The CsCl gradient banded SuperCosl vector was prepared according to the manual. Ligation and packaging was likewise done as described. After titration of the library all of the packaging mix from one ligation and packaging was transfected into the host cells XLl-Blue MR and plated on 50 μg/ml ampicillin LB plates. Approximately 3800 colonies were obtained. Cosmid preparation from 10 colonies showed that they all had inserts of the expected

size. The colonies were picked individually and inoculated in microtiter plate wells with 100 μl LB (100 μg/ml ampicillin) and incubated at 37°C over night. 100 μl of 50% glycerol was added to each well and the whole library was frozen at -80°C. A total of 5 3822 colonies were stored. This represents the A . oryzae genome apr. 4.4 times.

The individually frozen colonies in the library were inoculated onto LB-plates (lOOμg/ml ampicillin) by using a multipin device with 6 times 8 pins fitting into half a 0 microtiter dish. Plates were made containing colonies from all clones in the library. The plates were incubated at 37°C over night. Sterilized Whatman 540 filters cut to the size of a petri dish were placed upon the colonies which were incubated for two more hours at 37 ° C. The filters were transferred to LB plates 5 containing 200μg/ml of chloramphenicol and the plates were incubated over night at 37 ° C.The next day the filters were washed twice in 0.5 M NaOH for 5 minutes, then twice in 0.5 M Tris-HCl (pH=7.4) for 5 minutes and then twice in 2 x SSC for 5 minutes. The filters were wet with ethanol and air-dried. 0 The filters were hybridized with a 0.7 kb 32 P labelled

PCR fragment containing part of the pepE gene from A . niger. The PCR fragment was obtained by running PCR on chromosomal DNA from A . niger with two primer 700 bp apart in the DNA sequence. The hybridization was carried out for 16 hours at 65°C in 10 * 5 Denhart, 5 x SSC, 0.02 M EDTA, 1% SDS, 0.15 mg/ml polyA and 0.05 mg/ml yeast tRNA. After hybridization the filters were washed in 2 x SSC, 0.1% SDS at 45 ° C twice and placed on X-ray films. 5 colonies hybridized with the probe, 4 of them were subsequently shown to contain the A . oryzae pepE gene by Southern analysis of 0 the isolated cosmid DNA, using the same probe. Three of the cosmids were identical, thus two different cosmid clones containing pepE were isolated, they were called 7C7 and 33C1, names referring to their position in the stored library. Two overlapping fragments, a 4.3 kb EcoRI fragment (pToC299) and a 5 2.4 kb Hindlll (pToC301) fragment, were subcloned and partially sequenced. SEQ. ID No. 1 shows the DNA sequence and the deduced

aa sequence for the protease . The gene shows strong homology to the A. niger gene.

Cloning of the A . oryzae pyrG gene The A . oryzae pyrG gene was cloned by cross hybridization with the A . niger pyrG gene (W. van Hartingsveldt et al., Mol. Gen. Genet 206:71-75 (1987)) . A lambda library of partial SauIIIA digested A . oryzae IF04177 DNA was probed at low stringency with a 1 kb DNA fragment from the A . niger pyrG gene. A 3.8 kb Hindlll fragment from a positive clone was subcloned into a pUC118 vector. The resultant plasmid, pS02, was shown to contain the pyrG gene by complementation of an A . niger pyrG mutant .

Construction of an A . oryzae pyrG minus strain

A pyrG deletion plasmid, pS05, containing about 1 kb of pyrG flanking sequences on each end was constructed from the plasmid pS02. A . oryzae IF04177 was transformed with this construct and transformants were selected by resistance to 5- fluoro-orotic acid (FOA) , a phenotype characteristic of pyrG mutants. One transformant, HowBlOl, was shown by Southern analysis to have the expected deletion at the pyrG locus . Being a pyrG mutant HowBlOl requires uridine for growth. HowBlOl can be transformed with the wt pyrG gene by selection for ability to grow without uridine.

The steps involved in the construction of HowBlOl are illustrated in Fig. 1.

Deletion of the pevE gene in A . oryzae . By the gene replacement method

A plasmid, pToC345, designed to replace the pepE gene with the pyrG gene, was constructed.

Two PCR reactions were run with pToC299 as template; the first primer set was :

19819 GAAGATCTGCGCGGATGTACATTGTAG 19821 TTAGTCAGAAATTCGTCCCG

The second was :

19820 CCCAAGCTTCATGCTCGACCAGGGCCTCCT 5 19818 GGTCTGTGTTAACCAAAGAAC

The appr. 800 bp fragment obtained with 19819/19821 was cut with Bgl II /Hindlll and cloned together with the 1.1 kb fragment obtained with 19820/19818 and cut with Hindlll/Ps tI into Bglll/Ps tI cut pIC19R (J. L. Marsh et al, Gene 32 (1984)

10 481-484) . The resulting plasmid was cut at the unique Hindlll site, dephosphorylated and the 3.5 kb pyrG containing fragment from pJaL335 (described in Example 2) was inserted. The construction of pToC345 is illustrated in Figure 2.

HowBlOl was transformed with EcoRI cut pToC345 using

15 standard procedures and transformants were selected by their ability to grow without the addition of uridine. 100 transformants were reisolated once through conidiospores . Spores were picked from single colonies on the reisolation plates and suspended in 100 ml of water with 0.01% Trition X-100. 1 ml

20 spore suspension from each transformant and from IF04177, which was included as a control strain, were spotted on two Whatmann 540 filters placed on top of each their YPD plate. The plates were incubated at 30°C for 18 hours. The filters were removed from the plates and placed in 20% SDS for two hours at room

25 temperature. They were then bakes for 3 minutes in a 600W micro- wawe oven. The filters were then washed for 5 minutes in 10%SDS, 2 times 5 minutes in 0.5M NaOH, 1.5M NaCI, one time 5 minutes in 0.5MTris-HCl pH=7.5, 1.5M NaCI and one time 5 minutes in 20xSSC and air dried. The two sets of filters were hybridized by

30 standard procedures with each their 32p_labelled probe. One set was hybridized with a 600bp Bbul/ Hindlll fragment from pToC299 containing the part of the pepE gene that was attempted to be deleted. The other set of filters was hybridized with a DNA fragment from the A. oryzae tpi gene. Any gene present in one

35 copy, but pepE could be used since this is a control of the amount of DNA bound to the filters.

After hybridization the filters were washed with O.lxSSC, 0.1% SDS at 65°C and the radioactivity bound to the filters were visualized by a Phospolmager. 13 of the transformants were picked for further analysis because they showed little hybridization to the pepE probe compared to the hybridization to the control probe. Chromosomal DNA was prepared by standard procedures and a Southern blot of the EcoRI restricted DNA was hybridized with a 32 P-labelled 1. lkb Bbul fragment from pToC299 containing the 3' part of the pepE gene which was not to be deleted. In the wt strain a 4.3 kb fragment should hybridize to the probe, in a correct replacement strain the 4.3 kb fragment should be replaced by a 7.2kb fragment. Two of the transformants looked correct, one had no hybridizing bands at all and most had the wt band plus maybe one other band, indicating integration of the transforming DNA at a non- homologous locus .

In order to isolate a pyrG " derivative of the pepE deleted strain 10 7 conidiospores were spread on FOA containing plates and resistant colonies were selected. The FOA resistant colonies were reisolated, DNA was prepared and Southern analysis was performed to identify the strains in which the pyrG gene was lost via recombination between the repeat sequences flanking the gene in pToC345.

Deletion of the vepE gene in A . oryzae by the two step gene replacement method

A plasmid, pToC315, designed for a two step gene deletion of the pepE gene was constructed. A 1.6 kb EcoRI/Hindlll (the Hindlll site was blunt ended by treatment with the Klenow fragment of DNA polymerase) from pToC299 containing sequences upstream from the pepE gene was cloned together with a 1.4 kb Sai l /Bbul (the Bbul fragment was blunt ended) containing the 3 'end of the pepE gene into the EcoRI/Sall cut vector pUC19. The resulting plasmid was cut at the unique Hindlll site in the pUC19 linker, dephosphorylated and the 1.8 kb pyrG containing fragment

from pJers4 was inserted. The construction of pToC315 is illustrated in Fig. 3.

HowBlOl was transformed with pToC315 using standard procedures and transformants were selected by their ability to grow without the addition of uridine. After reisolation chromosomal DNA was prepared from 12 transformants, the DNA was cut with Asp718 and analysed by Southern analyses with a Bbul fragment from pToC301 containing part of the pepE gene as a radioactive labelled probe. One transformant had the plasmid integrated in the endogenous pepE gene as revealed by the disappearance of the pepE specific Asp718 fragment, which had been replaced by two new bands as predicted if pToC315 had integrated as a single copy by homologous recombination at the pepE locus. The transformant was named ToC1089. 5xl0 7 conidiae spores of ToC1089 were spread on plates containing 5-fluoro- orotic acid selecting for loss of the pyrG gene. This is the second step in a two step gene deletion, the pyrG gene can be lost by recombination with either of two pairs of identical sequences, one of which will result in the deletion of the pepE gene as well. The procedure is depicted in Fig. 4. The frequency of 5-fluoro-orotic acid resistance was approximately 10 '5 . The 5- fluoro-orotic acid resistant colonies were reisolated and a strain deleted for the pepE gene was identified by Southern analysis.

EXAMPLE 2

Cloning and disruption of the Aspergillus oryzae serine protease PepC The A . oryzae pepC gene was cloned by cross- hybridization with the A . niger gene. The A . niger gene was obtained as a 1.1 kb PCR fragment from a PCR reaction with A . niger chromosomal DNA and pepC specific primers made according to the pepC sequence published by Frederick G.D et al. Gene 125 (1993) 57-64. The fragment was shown to contain pepC sequences by

DNA sequencing. It hybridizes to A . oryzae chromosomal DNA under stringent conditions and Southern analysis showed that A . oryzae contains a single pepC like gene.

The pepC gene was deleted by a two step gene replacement method (G. May in "Applied Molecular Genetics of Filamentous Fungi" (1992) pp. 1-25. Eds. J. R. Kinghorn and G. Turner; Blackie Academic and Professional) . As marker was used the A . oryzae pyrG gene, the A. oryzae strain was a pyrG " strain made by deletion of the pyrG gene.

Cloning of the A. niger serine protease pepC

From the published cDNA nucleotide sequence encoding A . niger pepC (Frederick G.D et al . Gene 125 (1993) 57-64) two oligonucleotides were designed so that the encoding part of the pepC gene where amplified in a PCR reaction. The primer #5258 (5 ' -CTAGGATCCAAGGCATTTATGAAGGGCATCCTCGGCCTTTCC) where made so that the 3 ' end of the nucleotide sequence corresponds to the N- terminal part of the pepC gene (underline) and the 5 ' -end is for facilitating cloning (contains a BamHl restriction endonuclease site) . The primer #5259

(5 ' -CTACTCGAGTCAAAAAAAAACCAAGTCTTCCGATCTACG)

where made so that the 3 ' end of the nucleotide sequence corresponds to the C-terminal part of the pepC gene and the 5'- end is for facilitating cloning (contains a Xhol restriction endonuclease site) .

Genomic DNA from A . niger was used as template in the

PCR reaction. Amplification reaction were performed in 100 μl volumes containing 2.5 units Taq-polymerase, 100 ng of A . niger genomic DNA, 50 mM KCl, 10 mM Tris-HCl pH 8.0,1.5 mM MgCl 2 , 250 nM of each dNTP, and 100 pM of each of the two primers described above.

Amplification was carried out in a Perkin-Elmer Cetus DNA Termal 480, and consisted of one cycle of 3 minutes at 94°C,

followed by 25 cycles of 1 minutes at 94°C, 30 seconds at 55°C, and 1 minutes at 72°C. The PCR reaction produces one DNA fragment of ca. 1.1 kb in length. This fragment were isolated by gel electrophoresis, purified, cloned into the vector pCR°II 5 (Invitrogen Corporation) , and sequenced using standard methods known in the art of molecular biology. The resulting plasmid were called pJaL197.

Cloning of the A . oryzae serine protease pepC

10 Southern blot of genomic DNA from A . oryzae IF04177 where hybridized with the 1.1 kb 32 P labelled EcoRI DNA fragment from pJaL197 clone containing the A . niger pepC gene. Genomic DNA was cut with the following restriction enzymes: EcoRI, BamHl, Xhol, and Hindlll . Hybridization was carried out for 16 hours at

15 65 ° C in 10 x Denhart, 5 x SSC, 0.02 M EDTA, 1% SDS, 0.15 mg/ml polyA and 0.05 mg/ml yeast tRNA. After hybridization the filters were washed in 2 x SSC, 0.1% SDS at 65 ° C twice and placed on X- ray films. The probe hybridized to a single size of fragment in each of the four digest, indicating that the pepC gene is present

20 in a single copy in A . oryzae IF04177.

A partially library of A . oryzae genomic DNA was constructed containing BamHl fragments with a size of 4.5-5.5 kb and ligated into the vector pIC19H. The above A . niger pepC gene clone was radiolabelled and used to probe the partial A . oryzae

25 BamHl genomic library. Hybridization was carried out as described above. About 4000 E. coli colonies were screened and four positive colonies was obtained. The 4 c l ones was shown to be identical by restriction enzyme digestior One of these clones called pJaL235 (Fig. 5) , with an insert of 4.6 kb, was analyses

30 further by restriction mapping and Southern blotting. This shows that the pepC gene is located in a 2.9 kb BamHl/ Sail fragment. Sequencing of this 2.9 kb BamHl/ Sail fragment revealed the presence of a long open reading frame of 495 amino acids interrupted by two introns with consensus sequences indicative of

intron splicing. The sequence of the A . oryzae pepC gene is shown in SEQ ID No. 3.

Construction of A . oryzae pyrG gene flanked by a repeated sequence

By PCR, with the primer #7659

(5 ' -GGAGGAAGATCTCTCTGGTACTCTTCGATCTC) ,

where the 3 ' end of the nucleotide sequence corresponds to position 7-26 in pS02 (underline) and the 5 '-end is for facilitating cloning (contains a Bglll restriction endonuclease site) , and the primer #7656

(5 ' -GGAGGAGAATTCAAGCTTCTTCTACATCACAGTTTGAAAGC) ,

where the 3 ' end of the nucleotide sequence corresponds to position 385-407 in pS02 (underline) and the 5 '-end is for facilitating cloning (contains a EcoRI and Hindlll restriction endonuclease site) , on the plasmid pS02 a 432 bp fragment was amplified. The fragment was digested with Bglll and EcoRI and isolated by gel electrophoresis, purified, and cloned into the corresponding site in pS02, resulting in plasmid pJaL335 (The construction is outlined in Fig. 6.

Construction of A . oryzae pepC disruption plasmid

Plasmid pJaL235 was digested with Pvul and treated with Klenow polymerase to make the ends blunt and then digested with Hindlll . The 2.6 kb fragment were isolated by gel electrophore- sis, and purified. The 2.6 kb fragment was cloned into pUC12 digested with Smal and Hindlll giving plasmid pJaL308.

Plasmid pJaL308 was digested with S al and treated with bacterial alkaline phosphatase to remove the 5 ' phosphate groups according to the manufacturers instructions and phenol extracted and precipitated.

Plasmid pS02 was digested with Hindlll, and treated with Klenow polymerase to make the ends blunt . The 3.8 kb fragment encoding the A . oryzae pyrG gene were isolated by gel electrophoresis, and purified. The two fragments are mixed together and ligated. After transformations of E. coli , the colonies carrying the correct plasmids are identified by restriction enzyme digestion of mini- plasmid preparations. The construction of pJaL363 is illustrated in Fig. 7. Plasmid pJaL363 consist of pUC12 vector containing a fragment which carries the pepC gene flanked by an EcoRI site and an Hindlll and where the pepC is interrupted by an 3.8 kb DNA fragment encoding the A . oryzae pyrG gene.

Plasmid pJaL335 is digested with Hindlll, and treated with Klenow polymerase to make the ends blunt . The 3.5 kb fragment encoding the A. oryzae pyrG gene is isolated by gel electrophoresis, and purified. The fragment is cloned into pJaL308 Smal restriction site. The construction of pJaLz is outlined in Fig. 8. The plasmid consist of pUC12 vector containing a fragment which carries the pepC gene flanked by an EcoRI site and an Hindlll and where the pepC is interrupted by an 3.5 kb DNA fragment encoding the A. oryzae pyrG gene.

Transformation of A . oryzae strain HowBlOl 15 μg of either one of the disruption plasmids is digested to completion by Hindlll and EcoRI . The completeness of the digest is checked by running an aliquot on a gel and the remainder of the DNA is phenol extracted, precipitated and resuspended in 10 μl of sterile water. The transformation of A . oryzae HowBlOl host strain is preformed by the protoplast method (Christensen et al . Biotechno¬ logy (1988) 6:1419-1422) . Typically, A. oryzae mycelia is grown in a rich nutrient broth. The mycelia is separated from the broth by filtration. The enzyme preparation Novozyme ® (Novo Nordisk) is added to the mycelia in osmotically stabilizing buffer such as

1.2 M MgS0 4 buffered to pH 5.0 with sodium phosphate. The suspension was incubated for 60 minutes at 37°C with agitation. The protoplast is filtered through mira-cloth to remove mycelial debris. The protoplast is harvested and washed twice with STC (1.2 M sorbitol, 10 mM CaCl 2 , 10 mM Tris-HCl pH 7.5) . The protoplast is finally resuspended in 200-1000 μl STC.

For transformation 5 μg DNA is added to 100 μl protoplast suspension and then 200 μl PEG solution (60% PEG 4000, 10 mM CaCl 2 , 10 mM Tris-HCl pH 7.5) is added and the mixture is incubated for 20 minutes at room temperature. The protoplast is harvested and washed twice with 1.2 M sorbitol. The protoplast is finally resuspended 200 ml 1.2 M sorbitol, plated on selective plates (minimal medium + 10 g/l Bacto-Agar (Difco) , and incubated at 37°C. After 3-4 days of growth at 37°C, stable transformants appear as vigorously growing and sporulating colonies.

Identification of a pepC deletion strain

From the stable colonies individual spores is streaked on fresh minimal plates. Single colonies is selected and restreaked to give pure cultures. These are used to inoculate 10 ml of liquid YPM medium (1% yeast extract, 1% peptone, 2% maltose) . After 18 hours at 30°C shaking at 180 rpm, the mycelia are harvested on filter paper. Mycelia is then transfer to an 2 ml eppendorf tube and freeze dried. After freeze drying DNA is prepared from the individual mycelia by grinding the mycelia to a fine powder with a pestle in the tube. This powder are resuspended in 0.5 ml of 50 mM EDTA pH 8.0, 0.2% SDS, 1 μl DEP by vortexing. These are incubate at 65°C for 20 minutes. After this is added 0.1 ml 5 M KAc pH 6.5, the solution is mixed and incubated on ice for 5 minutes. The cell debris is separated from the DNA solution by centrifugation at 20.000 rpm for 5 minutes. 0.4 ml supernatant are precipitated with 0.3 ml isopropanol and centrifugated at 20.000 rpm for 10 minutes. The DNA pellet is redissolved in 100 μl of sterile TE buffer containing 0.1 mg/ml RNAaseA.

3 μg of each DNA is digested with EcoRI, fractionated by agarose gel electrophoresis, transferred to Immobilan-N membrane

filters, and probe with the 1.5 kb 32 P labelled Ncol DNA fragment from pJaL335 containing part of the pepC protease gene. Strains which carry a disruption of the pepC are easily recognized by that the wild type band on 3.6 kb is shifted to a 7.4 kb band in the transformant.

The disrupted A . oryzae pepC strain is made pyrG minus by selecting spontaneous mutant resistance to 5-fluoro-orotic acid, a phenotype characteristic of pyrG mutants. Being a pyrG mutant the strain requires uridine for growth. The strain can be transformed with the wt pyrG gene by selection for the ability to grow without uridine.

EXAMPLE 3

Construction of an Asper jillus oryzae areAA strain

The areAA strain was constructed as follows. The areA gene from A . oryzae was cloned. A pyrG ' strain also deficient for either pepC or pepE or pepC plus pepE was transformed with a plasmid carrying the pyrG gene inserted between DNA fragments upstream and downstream from the areA gene. The coding region for areA was not present on the plasmid. Transformants were selected for their ability to grow in the absence of uridine and in the presence of chlorate. This double selection selects both for a functional pyrG gene and for areA minus. Strains obtained by this selection procedure were finally screened by Southern analysis to identify those in which the chromosomal areA gene was substituted by the pyrG gene.

Cloning of the areA gene The A . oryzae areA gene was cloned by cross hybridization to the A . nidulans areA gene (B. Kudla et al . , EMBO J. 9:1355-1364 (1990)) . A genomic library of A . oryzae IF04177 was prepared by partial digestion of chromosomal DNA with SauIIIA and cloning of the obtained DNA fragments into the vector 1GEM-II

(obtained from Promega) . Cross hybridization of the library with the A. nidulans areA gene was performed in 40% formamide at 37°C. Hybridizing 1 clones were isolated and from these fragments were sub-cloned into the vector pBluescript SK+ (obtained from Stratagene) giving rise to the plasmids pSK5 and pSK9 illustrated in Fig. 9. The cloned gene was able to complement an A . nidulans areA mutant, proving that it is indeed the A. oryzae areA homolog. 5643 bp of the clone was sequenced, and comparison of the sequences of the A . oryzae and the A . nidulans areA genes shows that they are highly homologous. The sequence of the A. oryzae areA gene is shown in SEQ ID No. 5.

Construction of the areA deletion plasmid

In order to delete the areA gene from the A. oryzae chromosome the plasmid pToC266 was constructed. pToC266 contains a 2.1 kb DNA fragment originating upstream of the areA gene

(isolated from pSK5) and a 1.4 kb DNA fragment originating downstream from the areA gene (isolated from pSK9) . The two fragments are separated by approximately 3.2 kb in the genome, the coding region is situated in this part of the gene. The A. oryzae pyrG gene from pJers4 was inserted between the areA upstream and downstream DNA fragments. The construction of pToC266 is illustrated in Figs. 10a and 10b. pToC266 has a unique EcoRI site and was linearized by cutting with this restriction enzyme before used in transformations.

Selection of A . oryzae areA_ strains

A pyrG ' strain also deficient for either pepC or pepE or pepC plus pepE is transformed with linearized pToC266. Transfor¬ mants are selected on minimal plates (Cove Biochem . biophy. Acta (1966) 113 : 51-56) containing glutamine as the nitrogen source and glucose as the carbon source. Transformants are reisolated twice on the same type of plates, and then subjected to growth test on different nitrogen sources. Transformants growing well on glutamine but not on nitrate, ammonium or urea are expected to be deleted for areA . The deletion is confirmed by Southern analysis.

EXAMPLE 4

Construction of PMT1606 A plasmid containing the Jbar gene from Streptomyces hygroscopius (C. J. Thompson et . al, EMBO J. 6 : 2519-2523 (1987) ) inserted after the A . oryzae TAKA-amylase promoter and followed by a fragment containing the transcriptional terminator and polyadenylation signal from the A . niger gla gene was constructed.

The plasmid, pMT1606, can be used for selection of glufosinate resistant transformants of A . oryzae . pMT1606 was constructed by isolating the bar gene from the plasmid pBPIT (B.

Straubinger et . al, Fungal Genetics Newsletter 39 : 82-83 (1992)) and cloning it into the fungal expression plasmid p775 described in EP publication No. 0 098 993 Al . Fig. 11 illustrates the construction of pMT1606.

EXAMPLE 5

Production of chvmosin in A . oryzae (areAA ,pepEΔ,pepC " )

An A . oryzae areAA , pepEA , pepC ' strain is transformed with the plasmid pToC56 (Fig. 12) , which is a fungal expression plasmid for the mammalian enzyme chymosin, by co-transformation

with pMT1606. Construction of the plasmid pToC56 is described in EP publication No. 0 98 993.

Transformants are selected for growth on minimal medium containing 10 mM ammonium and 1 mg/ml glufosinate and screened for the presence of pToC56 by the ability to produce chymosin. The transformants are grown in shake flasks in minimal medium containing maltodextrin and glutamine for 4 days at 30°C.The content of chymosin in the supernatants were analysed by SDS-Page and Western blotting.

Example 6

Production of PepC in A. oryzae

Construction of an expression plasmid for pepC. Plasmid pJaL235 was digested with AatJJ and Nsil and treated with Klenow polymerase to make the ends blunt. The 1.7 kb fragment was isolated by gel electrophoresis, and purified. The

1.7 kb fragment was cloned into pIC19H digested with Smal giving pJaL365. Plasmid pJaL365 was digested with BamHl and Xhol and the 1.7 kb fragment was isolated by gel electrophoresis, and purified. The 1.7 kb fragment was cloned into pToC68 digested with BamHl and Xhol giving pJaL368 (Figs. 13a and 13 b) .

An A . oryzae strain is transformed with the plasmid pJaL368, which is a fungal expression plasmid for the protease PepC, by cotransformation with pToC90.

Transformants are selected for growth on minimal medium containing 10 mM acetamide and screened for the presence of pJaL368 by the ability to produce the protease PepC.

Example 7

Overexpression of pepg

A plasmid called pToC338 carrying the pepE gene fused to the TAKA-amylase promoter from A. oryzae was constructed. Figs. 14a and 14b depicts the construction.

An EcoRI /Sal I fragment from pToC299 containing most of the coding region and appr. 430 bp of the 3' untranslated region of pepE was cloned into EcoRI /BamHl cut pUC19 together with a synthetic DNA fragment of the following sequence :

8681 GATCCACCATGAAG 8747 GTGGTACTTCAGCT

The resulting plasmid called pToC334 was cut with BamHl/ EcoRI and a fragment containing the entire structural gene of pepE with a BamHl site fused immediately upstream of the start codon was isolated, approximately 430 bp of untranslated 3' sequence was also present in the fragment. The fragment was cloned into EcoRI/ Sai l cut pUC19 together with an approximately 1.1 kb Sail/ BamHl fragment from the plasmid p775 containing the TAKA-amylase promoter from A. oryzae . The resulting plasmid was named pToC338. pToC338 was co-transformed into A. oryzae JaL125 (an A. oryzae alp minus strain described in Danish Patent Application No. 0354/96) with pToC90, which contains the A . ni dulans acetamidase { amdS) gene, using standard procedures (e.g. as described in EP 0 098 993 Al) . Transformants were selected by their ability to use acetamide as the sole nitrogen source. 11 transformants were reisolated twice through conidiospores . The transformants were fermented for three days at 30°C inlO ml YPM (YP with 2% maltose) and the fermentation broth was analysed by SDS-page. One of the transformants produced a protein of the same size as the protein encoded by the pepE gene, protease activity measurements confirmed that the broth from that transformant contained a higher activity toward casein at pH=5.5 compared to the host strain JaL125. The protein was purified and N-terminal sequencing showed that it is indeed the protein encoded by the pepE gene. The N-terminal of the secreted protein was :

gly*-arg-his-asp-val-leu-val-asp-asn-phe-leu-asn-ala-gin-tyr - phe-ser-glu-ile-glu-ile-gly-thr-pro-pro-gln-lys-phe-lys *this residue could also be a lysine.

confirming the expression of the PepE protease.

SEQUENCE LISTING

(1) GENERAL INFORMATION:

(i) APPLICANT:

(A) NAME: Novo Nordisk A/S

(B) STREET: Novo Alle

(C) CITY: Bagsvaerd

(E) COUNTRY: DENMARK

(F) POSTAL CODE (ZIP) : DK-2880

(G) TELEPHONE: +45 4444 8888 (H) TELEFAX: +45 4449 3256

(ii) TITLE OF INVENTION: Novel Microorganisms

(iii) NUMBER OF SEQUENCES: 12

(iv) COMPUTER READABLE FORM:

(A) MEDIUM TYPE: Floppy disk

(B) COMPUTER: IBM PC compatible

(C) OPERATING SYSTEM: PC-DOS/MS-DOS

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO)

(2) INFORMATION FOR SEQ ID NO: 1:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 2454 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE:

(A) ORGANISM: Aspergillus oryzae (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION: 603..701 (ix) FEATURE:

(A) NAME/KEY: intron

(B) LOCATION: 702..791 (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION: 792..942

( ix) FEATURE :

(A) NAME/KEY: intron

(B) LOCATION: 943..1001 (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION: 1002..1656 (ix) FEATURE:

(A) NAME/KEY: intron

(B) LOCATION:1657..1713 (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION: 171 ..2001 (ix) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION:join (603-701 , 792-942, 1002-1656, 1714-2001) (ix) FEATURE:

(A) NAME/KEY: mat_peptide

(B) LOCATION:603..2001

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:

GCGCGGATGT ACATTGTAGG TACATGGATA GTATGTACTG TATATACCGT CATTTAGTAA 60

GGCAACTAAC TAACTTATCA GCCTAGCTCC CGAGACGGCC TTACATCATC CGCAGCGGCA 120

ATCAGCTCCA CACCCTTGGA TAGTGATAAG AGACACAAGG AGTTCTGAGT ATGGTATTAT 180

AAGTGCAGTG AGTTGGGATG AAACAGAGAG ATAGAGGGAA TACTCCTATT TATCAATGAA 240

CGTATACAGA CATACCCCAG CAGCGTTCCT GGCGGTATTG TAAAAGGGCC GTACCTTGGA 300

GATCAAGTGA TGAGACACCC GTGATGCAGG AACTCCACTT CAATCCAATG ACGCATCGAG 360

TTGCTCCCTG ATTGGTTGAT ACGCAGGTCG CTCCGCAACC GGTCCGCATC ACCTCACTTC 420

CCTCCCCCAG ACCTGGAGGT ACCTCTCCCG TCCTTCTCTC CCTCTCCATC CCATCATCTA 480

TCCCTCTCCA GACCCTGATT GTATTTCATC ATTCCTATCG TCCCATATTA ATAGAGTATT 540

GCTAGTTTTC TTTTGATTTC GTCTGTTGAG GTGCTGCTTT TTTGTCGCCG TTGTCGCCCA 600

CC ATG AAG TCG ACC TTG GTT ACG GCC TCT GTG CTG TTG GGC TGT GCT 647

Met Lys Ser Thr Leu Val Thr Ala Ser Val Leu Leu Gly Cys Ala

1 5 10 15

TCC GCC GAG GTT CAC AAG CTG AAG CTC AAC AAG GTG CCC GTG TCC GAG 695 Ser Ala Glu Val His Lys Leu Lys Leu Asn Lys Val Pro Val Ser Glu 20 25 30

CAA TTT GTGAGTAGAC CTTACTATTC CGGCCATGAA AATATTCATC TACCCATCTG 751 Gin Phe

AAAGCTTGTC GGGACGAATT TCTGACTAAA TCGTATCCAG AAC TTG CAC AAC ATC 806

Asn Leu His Asn lie 35

GAC ACC CAT GTG CAG GCT CTC GGC CAG AAG TAC ATG GGA ATC CGT CCC 854 Asp Thr His Val Gin Ala Leu Gly Gin Lys Tyr Met Gly lie Arg Pro 40 45 50

AAC ATC AAG CAA GAT CTT CTC AAT GAG AAC CCG ATT AAC GAT ATG GGA 902 Asn lie Lys Gin Asp Leu Leu Asn Glu Asn Pro lie Asn Asp Met Gly 55 60 65 70

CGT CAT GAT GTC CTT GTT GAC AAC TTC CTG AAT GCA CAA T 942

Arg His Asp Val Leu Val Asp Asn Phe Leu Asn Ala Gin 75 80

GTACGAAACC CTAGTAATAC TTGAAGGGGG GCTCCAACTT ACGCGTAGAT TCTCTAAAG AC 1003

Tyr

TTC TCC GAA ATC GAG ATC GGT ACT CCT CCA CAG AAG TTC AAG GTG GTC 1051 Phe Ser Glu lie Glu lie Gly Thr Pro Pro Gin Lys Phe Lys Val Val 85 90 95 100

CTT GAC ACT GGC AGC TCA AAC CTA TGG GTG CCC TCT TCG GAG TGT GGT 1099 Leu Asp Thr Gly Ser Ser Asn Leu Trp Val Pro Ser Ser Glu Cys Gly 105 110 115

TCT ATC GCC TGC TAT TTG CAT AAC AAG TAC GAC TCA TCC TCG TCC TCC 1147 Ser lie Ala Cys Tyr Leu His Asn Lys Tyr Asp Ser Ser Ser Ser Ser 120 125 130

ACG TAC CAG AAG AAT GGC AGC GAA TTT GCC ATC AAG TAC GGC TCT GGT 1195 Thr Tyr Gin Lys Asn Gly Ser Glu Phe Ala lie Lys Tyr Gly Ser Gly 135 140 145

AGC CTG AGT GGT TTT GTT TCT CAG GAT ACT CTC AAG ATC GGT GAC CTG 1243 Ser Leu Ser Gly Phe Val Ser Gin Asp Thr Leu Lys lie Gly Asp Leu 150 155 160

AAG GTG AAG GAT CAG CTG TTC GCC GAG GCT ACT AGT GAG CCC GGC CTT 1291 Lys Val Lys Asp Gin Leu Phe Ala Glu Ala Thr Ser Glu Pro Gly Leu 165 170 175 180

GCT TTT GCC TTT GGC CGC TTT GAT GGT ATC CTT GGG TTG GGA TTT GAC 1339 Ala Phe Ala Phe Gly Arg Phe Asp Gly lie Leu Gly Leu Gly Phe Asp 185 190 195

ACA ATT TCC GTC AAC AAG ATT CCT CCA CCC TTC TAT AGC ATG CTC GAC 1387 Thr lie Ser Val Asn Lys lie Pro Pro Pro Phe Tyr Ser Met Leu Asp 200 205 210

CAG GGC CTC CTC GAC GAG CCA GTC TTT GCT TTC TAC CTT GGA GAC ACT 1435 Gin Gly Leu Leu Asp Glu Pro Val Phe Ala Phe Tyr Leu Gly Asp Thr 215 220 225

AAC AAG GAA GGT GAT GAC TCC GTA GCG ACA TTC GGC GGT GTT GAC AAG 1483 Asn Lys Glu Gly Asp Asp Ser Val Ala Thr Phe Gly Gly Val Asp Lys 230 235 240

GAT CAC TAC ACC GGC GAG TTG GTC AAG ATT CCC CTT CGC CGC AAG GCC 1531 Asp His Tyr Thr Gly Glu Leu Val Lys lie Pro Leu Arg Arg Lys Ala 245 250 255 260

TAC TGG GAG GTT GAC CTT GAT GCT ATC GCC CTT GGC GAT AGC GTT GCT 1579 Tyr Trp Glu Val Asp Leu Asp Ala lie Ala Leu Gly Asp Ser Val Ala 265 270 275

GA.-. CTC GAT AAC ACC GGT GTC ATT CTG GAT ACC GGC ACT TCC CTT ATC 1627 Glu Leu Asp Asn Thr Gly Val lie Leu Asp Thr Gly Thr Ser Leu He 280 285 290

GCC TTG GCC ACC ACC CTT GCC GAG CTT AT GTAAGTCAAG CCAGTGTACT 1676 Ala Leu Ala Thr Thr Leu Ala Glu Leu He 295 300

GTGCATGTCT GTCATACTCT TACTAACTAT TCTGAAG T AAC AAG GAA ATC GGT 1729

Asn Lys Glu He Gly 305

GCC AAG AAG GGC TTC ACC GGC CAA TAC TCG GTT GAC TGT GAC AAG CGC 1777 Ala Lys Lys Gly Phe Thr Gly Gin Tyr Ser Val Asp Cys Asp Lys Arg 310 315 320

GAT TCC TTG CCT GAC CTC ACC TTC ACC CTG AGC GGA TAC AAC TTC ACC 1825 Asp Ser Leu Pro Asp Leu Thr Phe Thr Leu Ser Gly Tyr Asn Phe Thr 325 330 335

ATT GGT CCC TAC GAC TAC ACT CTT GAA GTC CAG GGA TCT TGC ATC AGC 1873 He Gly Pro Tyr Asp Tyr Thr Leu Glu Val Gin Gly Ser Cys He Ser 340 345 350 355

GCC TTC ATG GGC ATG GAC TTC CCT GAA CCC GTT GGC CCC TTG GCC ATC 1921 Ala Phe Met Gly Met Asp Phe Pro Glu Pro Val Gly Pro Leu Ala He 360 365 370

CTG GGT GAC GCG TTC CTC AGG AAG TGG TAC AGT GTG TAC GAC CTC GCC 1969 Leu Gly Asp Ala Phe Leu Arg Lys Trp Tyr Ser Val Tyr Asp Leu Ala 375 380 385

AAC GGT GCT GTT GGC CTG GCC AAG GCT AAG TAACCAAGTA ATCTACCATG 2019 Asn Gly Ala Val Gly Leu Ala Lys Ala Lys 390 395

CTATGTTCTT ATTGGTTGCT TGTGTATGTG AGACAATGGT ACATGATAGC CTGCCTCGGT 2079

AGTTGGTTGG CCTTTTTCTG TTACGGGAAA TCGGCAAAGC CTTGTTTTCG CTTATGACCT 2139

CTATCCTGTT TGTTATTGAT ATTTTGTGTG ACTCAGTGAG CCACTGGCTA TGCTCTAATG 2199

ACATTCATTG GATGCCGATA GTTCTATATA CATTGCGATT TTAACGCGTA TCTTTGATCT 2259

ATCGGTACAA TGATTCCCTA CTAAAGGTAG CCCAACTAGA CAACTATGCC TACGACCTCT 2319

CTACATTCTT CATAGCTCCG TGTGGAGTCC GTCTCATACA ACCTCGAGCA ACCTGCAGTT 2379

CTTTGGTTAA CACAGACCAC ACCTTAAAAC GGCACGATCC ATTCGAATAG ACAAGCCCTC 2439

TTAATATTTG AATTC 2454

(2) INFORMATION FOR SEQ ID NO: 2-

(l) SEQUENCE CHARACTERISTICS

(A) LENGTH- 397 ammo acids (D) TOPOLOGY linear

(li) MOLECULE TYPE protein

(xi) SEQUENCE DESCRIPTION SEQ ID NO 2

Met Lys Ser Thr Leu Val Thr Ala Ser Val Leu Leu Gly Cys Ala Ser 1 5 10 15

Ala Glu Val His Lys Leu Lys Leu Asn Lys Val Pro Val Ser Glu Gin 20 25 30

Phe Asn Leu His Asn He Asp Thr His Val Gin Ala Leu Gly Gin Lys 35 40 45

Tyr Met Gly He Arg Pro Asn He Lys Gin Asp Leu Leu Asn Glu Asn 50 55 60

Pro He Asn Asp Met Gly Arg His Asp Val Leu Val Asp Asn Phe Leu 65 70 75 80

Asn Ala Gin Tyr Phe Ser Glu He Glu He Gly Thr Pro Pro Gin Lys 85 90 95

Phe Lys Val Val Leu Asp Thr Gly Ser Ser Asn Leu Trp Val Pro Ser 100 105 110

Ser Glu Cys Gly Ser He Ala Cys Tyr Leu His Asn Lys Tyr Asp Ser 115 120 125

Ser Ser Ser Ser Thr Tyr Gin Lys Asn Gly Ser Glu Phe Ala He Lys 130 135 140

Tyr Gly Ser Gly Ser Leu Ser Gly Phe Val Ser Gin Asp Thr Leu Lys 145 150 155 160

He Gly Asp Leu Lys Val Lys Asp Gin Leu Phe Ala Glu Ala Thr Ser 165 170 175

Glu Pro Gly Leu Ala Phe Ala Phe Gly Arg Phe Asp Gly He Leu Gly 180 185 190

Leu Gly Phe Asp Thr He Ser Val Asn Lys He Pro Pro Pro Phe Tyr 195 200 205

Ser Met Leu Asp Gin Gly Leu Leu Asp Glu Pro Val Phe Ala Phe Tyr 210 215 220

Leu Gly Asp Thr Asn Lys Glu Gly Asp Asp Ser Val Ala Thr Phe Gly 225 230 235 240

Gly Val Asp Lys Asp His Tyr Thr Gly Glu Leu Val Lys He Pro Leu 245 250 255

Arg Arg Lys Ala Tyr Trp Glu Val Asp Leu Asp Ala He Ala Leu Gly 260 265 270

Asp Ser Val Ala Glu Leu Asp Asn Thr Gly Val He Leu Asp Thr Gly 275 280 285

Thr Ser Leu He Ala Leu Ala Thr Thr Leu Ala Glu Leu He Asn Lys 290 295 300

Glu He Gly Ala Lys Lys Gly Phe Thr Gly Gin Tyr Ser Val Asp Cys 305 310 315 320

Asp Lys Arg Asp Ser Leu Pro Asp Leu Thr Phe Thr Leu Ser Gly Tyr 325 330 335

Asn Phe Thr He Gly Pro Tyr Asp Tyr Thr Leu Glu Val Gin Gly Ser 340 345 350

Cys He Ser Ala Phe Met Gly Met Asp Phe Pro Glu Pro Val Gly Pro 355 360 365

Leu Ala He Leu Gly Asp Ala Phe Leu Arg Lys Trp Tyr Ser Val Tyr 370 375 380

Asp Leu Ala Asn Gly Ala Val Gly Leu Ala Lys Ala Lys 385 390 395

(2) INFORMATION FOR SEQ ID NO: 3:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 3224 base pairs

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: linear

(n) MOLECULE TYPE: DNA (genomic) (lil) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE:

(A) ORGANISM: Aspergillus oryzae (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION:388..756 (ix) FEATURE:

(A) NAME/KEY: intron

(B) LOCATION: 757..817 (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION: 818..1753 (ix) FEATURE:

(A) NAME/KEY: mtron

(B) LOCATION:1754..1814 (ix) FEATURE:

(A) NAME/KEY: exon

(B) LOCATION: 1815..1997 <1X) FEATURE:

(A) NAME/KEY: mat_peρtide

(B) LOCATION: 388..1997 ( x) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: join (388..756, 818..1753, 1815..1997)

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:

GGATCCATTA CCCTCTTACC GCCATACCCC AGGTCTTGCG ACCGCGCTAA TCGGGAGCGA 60 TCGACGCGCG GCACCTCCTC AGTAAAGCTG TGTCATCATT GTAAATTACC GTATCCCGGT 120 TGCATCATCC TCCGCTGCCC TTGCCTGCTT GGGGGATCGA CCTATTAAGC CCAGCTATCT 180

TACACCTGCT CCCATCCTCC TCTTCTCCAA CTCCTCATCC ATCCTTCCTC CTCCTTCTTC 240

CTTTTAACCC CCCCAACTCA GCATCGTTCC ATCCTCCCAT CTTTCCTTTC TTTTTACCTC 300

AAATCTCCAT CTGTATTCTT TCCTCTTAGA ACTCTTCCTT TCCCCCCTTC TGTACCTTGT 360

GTTTAGACGT CACTCTTGTT GCCCATC ATG AGA GGC ATC CTC GGC CTT TCC 411

Met Arg Gly He Leu Gly Leu Ser 1 5

CTG CTG CCA CTA CTA GCA GCG GCC TCC CCC GTT GCT GTT GAC TCC ATC 459 Leu Leu Pro Leu Leu Ala Ala Ala Ser Pro Val Ala Val Asp Ser He 10 15 20

CAC AAC GGA GCG GCT CCC ATT CTT TCG GCC TCA AAT GCC AAA GAG GTT 507 His Asn Gly Ala Ala Pro He Leu Ser Ala Ser Asn Ala Lys Glu Val 25 30 35 40

CCA GAC TCT TAC ATT GTC GTC TTC AAG AAG CAT GTT TCC GCT GAA ACG 555 Pro Asp Ser Tyr He Val Val Phe Lys Lys His Val Ser Ala Glu Thr 45 50 55

GCT GCT GCT CAT CAC ACC TGG GTG CAG GAC ATC CAC GAT TCG ATG ACT 603 Ala Ala Ala His His Thr Trp Val Gin Asp He His Asp Ser Met Thr 60 65 70

GGA CGC ATC GAC CTG AAG AAG CGC TCT CTT TTT GGT TTC AGT GAT GAC 651 Gly Arg He Asp Leu Lys Lys Arg Ser Leu Phe Gly Phe Ser Asp Asp 75 80 85

CTT TAC CTC GGT CTC AAG AAC ACC TTC GAT ATC GCC GGG TCC CTA GCG 699 Leu Tyr Leu Gly Leu Lys Asn Thr Phe Asp He Ala Gly Ser Leu Ala 90 95 100

GGC TAC TCC GGA CAT TTC CAT GAG GAT GTG ATC GAG CAG GTC CGG AGA 747 Gly Tyr Ser Gly His Phe His Glu Asp Val He Glu Gin Val Arg Arg 105 110 115 120

CAT CCT GAT GTAGGTTCCC CCCTCGGCCC ACCCGTTTTT GTAGAGCCCT 796

TGGTCTAACT TGATTTTCAA G GTT GAA TAC ATC GAG AAA GAC ACC GAA GTC 847

Val Glu Tyr He Glu Lys Asp Thr Glu Val 125 130

CAC ACC ATG GAG GAG ACA ACC GAG AAG AAT GCT CCC TGG GGC TTG GCT 895 His Thr Met Glu Glu Thr Thr Glu Lys Asn Ala Pro Trp Gly Leu Ala 135 140 145

CGT ATC TCT CAC CGT GAC AGC CTC TCG TTC GGT ACC TTT AAC AAG TAC 943 Arg He Ser His Arg Asp Ser Leu Ser Phe Gly Thr Phe Asn Lys Tyr 150 155 160 165

CTG TAT GCT TCG GAA GGC GGT GAG GGT GTC GAT GCT TAT ACT ATT GAC 991 Leu Tyr Ala Ser Glu Gly Gly Glu Gly Val Asp Ala Tyr Thr He Asp 170 175 180

ACT GGT ATC AAC ATT GAG CAT GTC GAT TTC GAG GAT CGA GCA CAC TGG 1039 Thr Gly He Asn He Glu His Val Asp Phe Glu Asp Arg Ala His Trp 185 190 195

GGA AAG ACC ATC CCT AGC AAT GAT GAG GAT GCG GAT GGC AAC GGA CAC 1087 Gly Lys Thr He Pro Ser Asn Asp Glu Asp Ala Asp Gly Asn Gly His 200 205 210

GGA ACT CAC TGC TCC GGA ACC ATT GCT GGT AAG AAG TAC GGT GTT GCC 1135 Gly Thr His Cys Ser Gly Thr He Ala Gly Lys Lys Tyr Gly Val Ala 215 220 225

AAG AAG GCC AAC ATC TAT GCC GTC AAG GTC TTG AGG TCC AGC GGT TCT 1183 Lys Lys Ala Asn He Tyr Ala Val Lys Val Leu Arg Ser Ser Gly Ser 230 235 240 245

GGC ACT ATG TCC GAT GTC GTT CTG GGT GTC GAG TGG GCC GTC CAG TCC 1231 Gly Thr Met Ser Asp Val Val Leu Gly Val Glu Trp Ala Val Gin Ser 250 255 260

CAC CTC AAG AAG GCT AAG GAC GCC AAA GAT GCC AAG GTC AAG GGT TTC 1279 H s Leu Lys Lys Ala Lys Asp Ala Lys Asp Ala Lys Val Lys Gly Phe 265 270 275

AAG GGC AGC GTT GCC AAC ATG AGT CTT GGT GGT GCC AAG TCC AGG ACC 1327 Lys Gly Ser Val Ala Asn Met Ser Leu Gly Gly Ala Lys Ser Arg Thr 280 285 290

CTT GAG GCT GCT GTC AAT GCT GGT GTT GAG GCT GGT CTT CAC TTC GCC 1375 Leu Glu Ala Ala Val Asn Ala Gly Val Glu Ala Gly Leu His Phe Ala 295 300 305

GTT GCT GCT GGT AAC GAC AAT GCC GAT GCC TGC AAC TAC TCC CCT GCT 1423 Val Ala Ala Gly Asn Asp Asn Ala Asp Ala Cys Asn Tyr Ser Pro Ala 310 315 320 325

GCC GCT GAG AAT GCC ATC ACT GTC GGT GCC TCG ACC CTT CAG GAT GAG 1471 Ala Ala Glu Asn Ala He Thr Val Gly Ala Ser Thr Leu Gin Asp Glu 330 335 340

CGT GCT TAC TTC TCC AAC TAC GGA AAG TGC ACT GAC ATC TTT GCC CCG 1519 Arg Ala Tyr Phe Ser Asn Tyr Gly Lys Cys Thr Asp He Phe Ala Pro 345 350 355

GGT CCC AAC ATT CTT TCC ACC TGG ACT GGC AGC AAG CAC GCT GTC AAC 1567 Gly Pro Asn He Leu Ser Thr Trp Thr Gly Ser Lys H s Ala Val Asn 360 365 370

ACC ATC TCT GGA ACC TCT ATG GCT TCT CCT CAC ATT GCT GGT CTG CTG 1615 Thr He Ser Gly Thr Ser Met Ala Ser Pro His He Ala Gly Leu Leu 375 380 385

GCC TAC TTC GTT TCT CTG CAG CCT GCT CAG GAC TCT GCT TTC GCT GTC 1663 Ala Tyr Phe Val Ser Leu Gin Pro Ala Gin Asp Ser Ala Phe Ala Val 390 395 400 405

GAT GAG CTT ACT CCT GCC AAG CTC AAG AAG GAT ATC ATC TCC ATC GCC 1711 Asp Glu Leu Thr Pro Ala Lys Leu Lys Lys Asp He He Ser He Ala 410 415 420

ACC CAG GGT GCC CTT ACT GAT ATC CCA TCT GAC ACC CCC AAC 1753

Thr Gin Gly Ala Leu Thr Asp He Pro Ser Asp Thr Pro Asn 425 430 435

GTAAGTTATA TTATCCATTT TGGTATAATG AAACAGAAAG TGGCTAACTG TTTTATTCTA 1813

G CTT CTC GCC TGG AAC GGC GGT GGT GCC GAC AAC TAC ACC CAG ATT 1859 Leu Leu Ala Trp Asn Gly Gly Gly Ala Asp Asn Tyr Thr Gin He 440 445 450

GTC GCC AAG GGT GGA TAC AAG GCC GGC AGT GAC AAC CTT AAG GAC CGC 1907 Val Ala Lys Gly Gly Tyr Lys Ala Gly Ser Asp Asn Leu Lys Asp Arg 455 460 465

TTT GAC GGA CTA GTC AAC AAG GCC GAG AAG TTG CTC GCT GAG GAG CTT 1955 Phe Asp Gly Leu Val Asn Lys Ala Glu Lys Leu Leu Ala Glu Glu Leu 470 475 480

GGA GCT ATT TAC AGT GAG ATC CAG GGT GCT GTT GTT GCA TAG 1997 Gly Ala He Tyr Ser Glu He Gin Gly Ala Val Val Ala * 485 490 495

ATGCAAGACA AGACTTGATT TAGAGTGACG TAACTAGTTT CGTTTATGGC AGGGTATGGG 2057

AATTGGCTAA CCGAACACTG GCGCTGGTAT TTGTTTTTGC TGCTGCTTTT TGGTAACACG 2117

GAGAAGCCGA TGCATTGACT GCATTGGGTA CATTATCCTG ACATGGTTTA CCTGGTCTTT 2177

CATTATTATT ATAGCATACA TGTCCACAAC AATCTTTGAC ATCCTATCTA GAGATACATG 2237

TGCTTGCTTT TAACAGACTG CCAAATCAAT TATGCGACTG TTCTGCACAG ATAATCGTGG 2297

CTTGGTTTGA AGGCTGCCAT AAAGTCTAAC GCTGGCTACC AATTAGGTAG GAGTGTCCCC 2357

TTCCTGCCAG GTTGCTCCAG TCGTAGAAGT AGACTGATAT ATTGAAGATT GCCCATATAC 2417

CATGGACGCT CGTCTTATTC TACATCATAT ATGTCACTCC TAGTGACCAT ATAGACATGC 2477

TAACCATTGC ACAACCCCCC ACAGGTTCAA TCCAACCCAT GACCCCCTCT CATCTTCTGT 2537

TGTATTTTCA GGTTCTAGAT TTGCATACAT ACTACCCATC ATCGGAAGAC GGGTGAGGAG 2597

GCAGATGACC CGACATTATA TTTATTAATT GCTTAGGATG TTTCAACAAC ATTAAAAGTA 2657

TATCAATAAG CTTTTCCAGT TTATATTTAC TACCTAAGAT TACGGCATAT AGTGTATTCT 2717

GTGTGCGTAA GAGGTCGCCC TTAAATGGAA ACAGTTCGCG GTTGGAGATA TATATTTGTA 2777

GTGTTCAGGC GGAACGAGTA AAAAAAAAAA AAAATGAGAA GCTGGTGATA TTAACTCCGA 2837

TGTTTATCTT ACATATACCA ATGGATGTAG TCTCATTATA ACGCTTTCTC TGTAGTTTGG 2897

TTGTCATAGA ACTGAATGAC AGGTAAGTGT GTATGTATGT ACAGTACGCA CGGGGGGCCA 2957

TGTGGTCAAC CACACCCAAT GGGCGGTCTT GTCACTTTCC GGACTGGAAA TGAAACGTTC 3017

CATGGAAGAA ATCTGGATGA TTACCTTGAG TACGAGAGAA CTATGGTTGC CGGTAATGGG 3077

TGATTGCCAC AATCATCAGT TCGGTTGAGG CGTTCAACAT CTACGGTACG TTCAGTCACA 3137

TGAATCTGGG AATTCGGGCC TGGTATGCTG GTTTTCGCAA GAGATCCACC CGGCGTGTGC 3197

CAGGTATGCT ACATTTTCTC AGTCGAC 3224

(2) INFORMATION FOR SEQ ID NO 4

( ) SEQUENCE CHARACTERISTICS

(A) LENGTH 496 amino acids (D) TOPOLOGY linear

(ii) MOLECULE TYPE protein

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:

Met Arg Gly He Leu Gly Leu Ser Leu Leu Pro Leu Leu Ala Ala Ala 1 5 10 15

Ser Pro Val Ala Val Asp Ser He His Asn Gly Ala Ala Pro He Leu 20 25 30

Ser Ala Ser Asn Ala Lys Glu Val Pro Asp Ser Tyr He Val Val Phe 35 40 45

Lys Lys His Val Ser Ala Glu Thr Ala Ala Ala His His Thr Trp Val 50 55 60

Gin Asp He His Asp Ser Met Thr Gly Arg He Asp Leu Lys Lys Arg 65 70 75 80

Ser Leu Phe Gly Phe Ser Asp Asp Leu Tyr Leu Gly Leu Lys Asn Thr 85 90 95

Phe Asp He Ala Gly Ser Leu Ala Gly Tyr Ser Gly His Phe His Glu 100 105 110

Asp Val He Glu Gin Val Arg Arg His Pro Asp Val Glu Tyr He Glu 115 120 125

Lys Asp Thr Glu Val His Thr Met Glu Glu Thr Thr Glu Lys Asn Ala 130 135 140

Pro Trp Gly Leu Ala Arg He Ser His Arg Asp Ser Leu Ser Phe Gly 145 150 155 160

Thr Phe Asn Lys Tyr Leu Tyr Ala Ser Glu Gly Gly Glu Gly Val Asp 165 170 175

Ala Tyr Thr He Asp Thr Gly He Asn He Glu His Val Asp Phe Glu 180 185 190

Asp Arg Ala His Trp Gly Lys Thr He Pro Ser Asn Asp Glu Asp Ala 195 200 205

Asp Gly Asn Gly His Gly Thr His Cys Ser Gly Thr He Ala Gly Lys 210 215 220

Lys Tyr Gly Val Ala Lys Lys Ala Asn He Tyr Ala Val Lys Val Leu 225 230 235 240

Arg Ser Ser Gly Ser Gly Thr Met Ser Asp Val Val Leu Gly Val Glu 245 250 255

Trp Ala Val Gin Ser His Leu Lys Lys Ala Lys Asp Ala Lys Asp Ala 260 265 270

Lys Val Lys Gly Phe Lys Gly Ser Val Ala Asn Met Ser Leu Gly Gly 275 280 285

Ala Lys Ser Arg Thr Leu Glu Ala Ala Val Asn Ala Gly Val Glu Ala 290 295 300

Gly Leu His Phe Ala Val Ala Ala Gly Asn Asp Asn Ala Asp Ala Cys 305 310 315 320

Asn Tyr Ser Pro Ala Ala Ala Glu Asn Ala He Thr Val Gly Ala Ser 325 330 335

Thr Leu Gin Asp Glu Arg Ala Tyr Phe Ser Asn Tyr Gly Lys Cys Thr 340 345 350

Asp He Phe Ala Pro Gly Pro Asn He Leu Ser Thr Trp Thr Gly Ser 355 360 365

Lys His Ala Val Asn Thr He Ser Gly Thr Ser Met Ala Ser Pro His 370 375 380

He Ala Gly Leu Leu Ala Tyr Phe Val Ser Leu Gin Pro Ala Gin Asp 385 390 395 400

Ser Ala Phe Ala Val Asp Glu Leu Thr Pro Ala Lys Leu Lys Lys Asp 405 410 415

He He Ser He Ala Thr Gin Gly Ala Leu Thr Asp He Pro Ser Asp 420 425 430

Thr Pro Asn Leu Leu Ala Trp Asn Gly Gly Gly Ala Asp Asn Tyr Thr 435 440 445

Gin He Val Ala Lys Gly Gly Tyr Lys Ala Gly Ser Asp Asn Leu Lys 450 455 460

Asp Arg Phe Asp Gly Leu Val Asn Lys Ala Glu Lys Leu Leu Ala Glu 465 470 475 480

Glu Leu Gly Ala He Tyr Ser Glu He Gin Gly Ala Val Val Ala * 485 490 495

(2) INFORMATION FOR SEQ ID NO 5

(l) SEQUENCE CHARACTERISTICS-

(A) LENGTH 5643 base pairs

(B) TYPE- nucleic acid

(C) STRANDEDNESS: Single

(D) TOPOLOGY linear

(ii) MOLECULE TYPE. DNA (genomic)

(ill) HYPOTHETICAL: NO

(ill) ANTI-SENSE: NO

(vi) ORIGINAL SOURCE

(A) ORGANISM- Aspergillus oryzae

(B) STRAIN IF04177 (ix) FEATURE:

(A) NAME/KEY: intron

(B) LOCATION. 2701..2769 (ix) FEATURE

(A) NAME/KEY. CDS

(B) LOCATION- 3om(2282 2700, 2770 4949)

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 5.

AAGCTTCGTC CTCGCATCTC GGCCGGGTGA GTAAGGTATG GTATTATTCA TGAAGGGATC 60 TCGTTGGTTA CCGTTGTCTA TCCCTAAACA AAGGATTCAA GAGAACAACT CGGAATGCTC 120

CCTCCGCTTA AACCCCTTGA CTCACTGATG GTGTATGTAC TATGGGTACG ACGTTCGGGA 180

TGTGGACTAC CAACCAGAGA GTGATTAGAG AGTCCGGGTT CTCAGTCCAT GATTTTTGCA 240

TCTTTGAAAC AGACGATGCG GAGCGGTCAT TGGCGGAGTT TACTCCCAAA TACGGCCGAA 300

CGGGGTACTT TAAGTGGAAT CTCCGATTTT GGATCTAAGC TCATGAAGGA AAAGTACTAC 360

TAATGCGTAC CTGTGCCTAA TGTTAGTGCT AGTTCGTCTG TTGCATTTTA CCCGTCGGTT 420

AAGACGAATG GATCCGTTCA GGTTTTAAAA TAACTATCTA TGAAATATTT TAGATTTCCC 480

GACATAGTGG TTGGGATGTC TCGATTAACA CTAGGTACAT CAGGCTCAAT TGATTTTGGT 540

TTTAACGAAA CATGATATAG GTCAGGGTCG TGGACCACCC TCCGCCAGGG ATCAGGGGAC 600

GGTTACATGC GAAGGATTCT GATTATATTC ATGATTATGT CAAGCCTTTT CTCTCGTGTG 660

AAGAGGAGCA GAGAATCCGT ACGGGTTTAA TTTAATTTAG CGCCCTGCAG CTTCGAGAAC 720

ATCCCCAGCA ACGTTAAAAA CCACGAGCTA AAATGGGTCG CCACCGGAAG CACTCGAGTC 780

GAGAGATCGG TCGGCTCAGT ATTCGTAATA CCTGCGTTCC AGACGGTTTT GGTCGTTGGT 840

TTCACTCAGG GAACTTAATT CCAGCGGGAC CCAATATAAT TTGAATGATT CATGATACAT 900

CCATTCGTTT GAACCGATCC TGCAAGAGTT CTGTCTGATT TGGTCAACAT AGTTTTCCTC 960

TGGGGGAGAC TGGGGAAGAG TCAACACAAT GGTCAGGGAG AGAAGAATGA AAGCTCTCGC 1020

AAGTGGATGA TCATGCTACG TACTGTAGGA ATAAAATTAA TTAATGCGAG GCTGCAAGTA 1080

TCCCTGCGCC GATTTTCTCT TCTTACGGCG GGAACCAAAA AATGTGACGC TGTGATTTTC 1140

TGGAAAAGGT AAGGATGTTT AGTTTCCCAG GATTATTACT GGTTCCGTAT GTGTATGTGT 1200

ATGGATATCA TTCCGTATGG ATACGCCCGT TTCCTCCGCC CAGAACCAGT CCGTCATCCA 1260

TCCTCCACTC TTTCTTCTCT TAGAGCCTTT CCACCTCTCT TCACTTTCTT TTTCTTTCCC 1320

CCCTCCCTCT TTGCTTTCCC TCTCCCAGTA TTATTCTTAT ATTATCGGTT TGACCGTCGC 1380

CTCAGTATCG GCCCCCCGTG AATCACTTTT CGTTTCTCTT GTATTTTACT TTCCTATCTG 1440

GGATTGCTCC TCGATTAGCA GCTCTACTTC ATTCGGCCAT GTGCGTCTAG AGGGTCTAGC 1500

CCCTCTCTCT CTTTGCACTG ACTGTCAGCC ATACCATAGT ATCATCCCGG AATTAAGAAA 1560

AAAAAAGAAA TTATTCTACC TCCGATCTGG ACAAATTATA ACCAGGAGAA AATCAAGCGA 1620

AAGAGGGGCA AAGGAGGAGA CACCATTAAA ACTGGGTCTG GTTTGATTCA TGACATACAT 1680

TCGTCGTCTT GAATTTCAAT AGGTACGGAC TGATGCATTC CACTCGAGCC TTTTTAGCTG 1740

CGTGTCCGTC TCCAATCGCA CTTCTTTTCT TATTTCCTTG TGGGATAAAT TGATTATTTA 1800

CCGTTTCGTT TTCTCTATAT TGCGGTGGTG GTGCGACCCA TCCAACTATT ATTATTATAA 1860

TTGGAATTTG ATTTGGATTT TGATTCCTGT GACGGATCTC AGACCAAGTG CCTAAACTAT 1920

AACTGACTTG GACCCCCTTC AGATCCTAGC TTCCCGATTC TTTTCCACCA CTGCTGCATC 1980

CTCTTCCTGC ACGCAGCGTT CGTTTAGGGC GGGTAGACTG GAATTTATTC CTTGCGCCAC 2040

GGACCAATCG CTCCCTCGAC GCTCTCATTC CTGCGTCGAG CTCTTTTTCC CTCGACTCTC 2100

ATTGCTTGCT GGGCTGGTTC TTGAACCTCT TCAATCGTCC TTATCTCTTT CCCCCCATCC 2160 GGCCTGTGAT TCCTATCTTT CCTTTTTTTC TTCCCTTTCT TGTTTGATCC CCCCTCCTCC 2220 CCGTCTTATC GCCTACTATC GTGATCCCCG CCCTTCCCAA TAAAGAGTAG GGCGTGTGAA 2280

C ATG TCC GGG TTA ACC CTC GGG CGA GGC CCT GGG GGC GTG CGA CCG 2326 Met Ser Gly Leu Thr Leu Gly Arg Gly Pro Gly Gly Val Arg Pro 1 5 10 15

ACT CAA ACC GCA ACT TTT ACC ACC CAC CAC CCG TCC GCC GAT GCT GAC 2374 Thr Gin Thr Ala Thr Phe Thr Thr His His Pro Ser Ala Asp Ala Asp 20 25 30

CGC TCC TCC AAC AAC CTC CCC CCT ACC TCC TCG CAG CTG TCC GAT GAC 2422 Arg Ser Ser Asn Asn Leu Pro Pro Thr Ser Ser Gin Leu Ser Asp Asp 35 40 45

TTT TCT TTC GGT TCC CCT CTG AGC CCC GCC GAC TCA CAG GCC CAT GAC 2470 Phe Ser Phe Gly Ser Pro Leu Ser Pro Ala Asp Ser Gin Ala His Asp 50 55 60

GGC CTA CTT CAG GAC TCC CTC TTC CCT GAA TGG GGG TCT GGT GCG CCT 2518 Gly Leu Leu Gin Asp Ser Leu Phe Pro Glu Trp Gly Ser Gly Ala Pro 65 70 75

CGA CCC GGC ATT GAC AGT CCG GAT GAG ATG CAG AGG CAA GAT CCG CTA 2566 Arg Pro Gly He Asp Ser Pro Asp Glu Met Gin Arg Gin Asp Pro Leu 80 85 90 95

GCG ACT CAA ATA TGG AAG CTC TAT TCT AGG ACC AAG GCC CAG TTG CCC 2614 Ala Thr Gin He Trp Lys Leu Tyr Ser Arg Thr Lys Ala Gin Leu Pro 100 105 110

AAC CAG GAG CGT ATG GAA AAC CTG ACC TGG CGG ATG ATG GCG ATG AGT 2662 Asn Gin Glu Arg Met Glu Asn Leu Thr Trp Arg Met Met Ala Met Ser 115 120 125

TTG AAA CGT AAG GAG CGG GAA CGT GCT CAA CAG TCC AT GTAGGTGTTC 2710 Leu Lys Arg Lys Glu Arg Glu Arg Ala Gin Gin Ser Met 130 135 140

TCCCTCTGTA GAGGAACGGC TGGACCCGCT CATCATTAAT TTTTTTTTTG TCTGTGAAG G 2770

TTT CCT GCG AGA CGC GGT AGC GCT GGC CCC AGT GGT ATC GCT CAA CTG 2818 Phe Pro Ala Arg Arg Gly Ser Ala Gly Pro Ser Gly He Ala Gin Leu 145 150 155

CGC ATT TCC GAC CCG CCC GTT GCC ACC GGT AAC CCT CAG TCA ACC GAC 2866 Arg He Ser Asp Pro Pro Val Ala Thr Gly Asn Pro Gin Ser Thr Asp 160 165 170

CTG ACC GCC GAC CCT ATG AAC CTC GAC GAT TTC ATC GTG CCC TTC GAA 2914 Leu Thr Ala Asp Pro Met Asn Leu Asp Asp Phe He Val Pro Phe Glu 175 180 185

TCT CCT TCG GAC CAC CCC TCG CCC AGT GCC GTC AAG ATT TCC GAC TCC 2962 Ser Pro Ser Asp His Pro Ser Pro Ser Ala Val Lys He Ser Asp Ser 190 195 200

ACG GCG TCC GCG GCC ATT CCC ATC AAG TCC CGG AAA GAC CAG CTG AGA 3010 Thr Ala Ser Ala Ala He Pro He Lys Ser Arg Lys Asp Gin Leu Arg 205 210 215 220

GAT TCT ACC CCG GTG CCG GCC TCG TTC CAC CAT CCG GCT CAG GAT CAA 3058 Asp Ser Thr Pro Val Pro Ala Ser Phe His His Pro Ala Gin Asp Gin 225 230 235

CGG AAG AAC AGT GAA TTT GGC TAC GTC CCC CGT CGC GTG CGC AAG ACG 3106 Arg Lys Asn Ser Glu Phe Gly Tyr Val Pro Arg Arg Val Arg Lys Thr 240 245 250

AGT ATC GAC GAG CGT CAA TTT TTC TCA CTG CAG GTG CCG ACC CGA AAG 3154 Ser He Asp Glu Arg Gin Phe Phe Ser Leu Gin Val Pro Thr Arg Lys 255 260 265

CGA CCG GCC GAA TCC TCG CCC CAG GTA CCC CCC GTT TCC AAC TCG ATG 3202 Arg Pro Ala Glu Ser Ser Pro Gin Val Pro Pro Val Ser Asn Ser Met 270 275 280

TTG GCC CAC GAT CCG GAC CTC GCT TCC GGC GTG CCC GAT TAT GCC TTG 3250 Leu Ala His Asp Pro Asp Leu Ala Ser Gly Val Pro Asp Tyr Ala Leu 285 290 295 300

GAC GCC CCG TCC TCG GCC TTT GGC TTC CAT CAG GGT AAC CAC CAT CCG 3298 Asp Ala Pro Ser Ser Ala Phe Gly Phe His Gin Gly Asn His H s Pro 305 310 315

GTC AAT CAT CAC AAC CAC ACC TCC CCC GGG GCA CCG TTT GGC TTG GAT 3346 Val Asn H s His Asn His Thr Ser Pro Gly Ala Pro Phe Gly Leu Asp 320 325 330

ACG TTC GGC CTG GGA GAT GAT CCA ATC TTG CCC TCC GCG GGC CCC TAC 3394 Thr Phe Gly Leu Gly Asp Asp Pro He Leu Pro Ser Ala Gly Pro Tyr 335 340 345

CAG TCG CAA TTC ACC TTC TCA CCC AGC GAG TCT CCG ATG GCC TCC GGT 3442 Gin Ser Gin Phe Thr Phe Ser Pro Ser Glu Ser Pro Met Ala Ser Gly 350 355 360

CAT CCG TTT GCG AAC CTC TAT TCG CAT ACC CCG GTG GCT TCG TCC CTC 3490 His Pro Phe Ala Asn Leu Tyr Ser H s Thr Pro Val Ala Ser Ser Leu 365 370 375 380

AAC TCG ACG GAT TTC TTC TCT CCA CCG CCA TCA GGC TAC CAG TCC ACG 3538 Asn Ser Thr Asp Phe Phe Ser Pro Pro Pro Ser Gly Tyr Gin Ser Thr 385 390 395

GCA TCC ACG CCG CAG CCC ACC TAC GAC GGG GAC CAT TCC GTT TAT TTC 3586 Ala Ser Thr Pro Gin Pro Thr Tyr Asp Gly Asp His Ser Val Tyr Phe 400 405 410

GAT ATG CCG TCG GGC GAC GCG CGC ACC CAG CGC CGC ATT CCG AAC TAT 3634 Asp Met Pro Ser Gly Asp Ala Arg Thr Gin Arg Arg He Pro Asn Tyr 415 420 425

ATT TCG CAT CGG TCC AAC TTG TCT GCT TCG CTG CAG CCT CGG TAT ATG 3682 He Ser His Arg Ser Asn Leu Ser Ala Ser Leu Gin Pro Arg Tyr Met 430 435 440

TTC AAC CAG AAC AAC CAT GAA CAG GCC AGT TCG TCG ACG GTG CAT TCG 3730 Phe Asn Gin Asn Asn His Glu Gin Ala Ser Ser Ser Thr Val His Ser 445 450 455 460

CCG AGC TAC CCC ATT CCC CAG CCG CAA CAT GTG GAC CCC ACT CAG GTG 3778 Pro Ser Tyr Pro He Pro Gin Pro Gin His Val Asp Pro Thr Gin Val 465 470 475

TTG AAC GCC ACC AAT TAC TCG ACC GGC AAC TCC CAC CAT ACC GGC GCC 3826 Leu Asn Ala Thr Asn Tyr Ser Thr Gly Asn Ser His His Thr Gly Ala 480 485 490

ATG TTT TCA TTT GGA GCC GAT TCA GAT AAC GAG GAT GAC GAT GGT CAT 3874 Met Phe Ser Phe Gly Ala Asp Ser Asp Asn Glu Asp Asp Asp Gly His 495 500 505

CAG CTG TCC GAG CGG GCT GGT CTG GCG ATG CCG ACT GAA TAT GGG GAC 3922 Gin Leu Ser Glu Arg Ala Gly Leu Ala Met Pro Thr Glu Tyr Gly Asp 510 515 520

GAG GAC GGG TTC TCG TCG GGC ATG CAG TGG GAT GGG CAG TTC CCG GGC 3970 Glu Asp Gly Phe Ser Ser Gly Met Gin Trp Asp Gly Gin Phe Pro Gly 525 530 535 540

TCC TTC CAT TCG CTG CCG GGC TTT GGC CCT CAA CAT CGC AAG CAT GTT 4018 Ser Phe His Ser Leu Pro Gly Phe Gly Pro Gin His Arg Lys His Val 545 550 555

ACC ATC GGG TCC ACG GAC ATG ATG GAC ACC CCC GAG GAG TGG AAT CAC 4066 Thr He Gly Ser Thr Asp Met Met Asp Thr Pro Glu Glu Trp Asn His 560 565 570

GGT GGC AGT TTG GGT CGG ACT CAT GGG TCG GTG GCT TCG GTC AGT GAG 4114 Gly Gly Ser Leu Gly Arg Thr His Gly Ser Val Ala Ser Val Ser Glu 575 580 585

GTG CGC AAC CGA GAG CAG GAC CCT CGC CGG CAG AAG ATT GCC CGC ACC 4162 Val Arg Asn Arg Glu Gin Asp Pro Arg Arg Gin Lys He Ala Arg Thr 590 595 600

ACG TCC ACC CCC AAT ACG GCC CAG CTG TTG CGC CAA AGC ATG CAC TCT 4210 Thr Ser Thr Pro Asn Thr Ala Gin Leu Leu Arg Gin Ser Met His Ser 605 610 615 620

AAT AAC AAT ACG TCT CAT ACC TCC CCT AAT ACG CCG CCC GAG TCC GCC 4258 Asn Asn Asn Thr Ser His Thr Ser Pro Asn Thr Pro Pro Glu Ser Ala 625 630 635

CTG AGC AGC GCA GTT CCG TCC CGC CCG GCC AGT CCC GGG GGC AGC AAG 4306 Leu Ser Ser Ala Val Pro Ser Arg Pro Ala Ser Pro Gly Gly Ser Lys 640 645 650

AAC GGC GAC CAA GGC AGC AAC GGA CCG ACC ACC TGC ACG AAC TGC TTC 4354 Asn Gly Asp Gin Gly Ser Asn Gly Pro Thr Thr Cys Thr Asn Cys Phe 655 660 665

ACT CAA ACC ACT CCG CTG TGG CGT CGG AAC CCA GAG GGC CAG CCA CTG 4402 Thr Gin Thr Thr Pro Leu Trp Arg Arg Asn Pro Glu Gly Gin Pro Leu 670 675 680

TGC AAT GCC TGC GGG TTG TTT TTG AAA TTG CAC GGT GTC GTG CGC CCT 4450 Cys Asn Ala Cys Gly Leu Phe Leu Lys Leu H s Gly Val Val Arg Pro 685 690 695 700

CTG TCC CTG AAA ACG GAC GTT ATC AAA AAG CGC AAC CGT AGC AGT GCC 4498 Leu Ser Leu Lys Thr Asp Val He Lys Lys Arg Asn Arg Ser Ser Ala 705 710 715

AAC AGC TTG GCG GTT GGG ACC TCC CGT GCG TCG AAG AAG ACA GCC CGC 4546 Asn Ser Leu Ala Val Gly Thr Ser Arg Ala Ser Lys Lys Thr Ala Arg 720 725 730

AAG AAC TCG GTG CAG CAA GCA TCC GTC ACG ACT CCG ACA TCA AGC CGC 4594 Lys Asn Ser Val Gin Gin Ala Ser Val Thr Thr Pro Thr Ser Ser Arg 735 740 745

GCT CAG AAT GGG ACT TCC TTC GAA TCC CCG CCC GCC GGC TTT AGT GCT 4642 Ala Gin Asn Gly Thr Ser Phe Glu Ser Pro Pro Ala Gly Phe Ser Ala 750 755 760

GCC GCG GGA CGG TCG AAT GGG GTG GTA CCC ATT GCC GCC GCT CCT CCG 4690 Ala Ala Gly Arg Ser Asn Gly Val Val Pro He Ala Ala Ala Pro Pro 765 770 775 780

AAG GCA GCT CCC TCC GCA GCC GCC TCC CCT AGC ACG GGC CAG ACC CGC 4738 Lys Ala Ala Pro Ser Ala Ala Ala Ser Pro Ser Thr Gly Gin Thr Arg 785 790 795

AAC CCG ATC CAG GCT GCC CCG AAA CGT CAA CGA CGG CTG GAA AAG GCC 4786 Asn Pro He Gin Ala Ala Pro Lys Arg Gin Arg Arg Leu Glu Lys Ala 800 805 810

ACG GAG ATG GAA ACG GAC GAG GCT AAC AAG TCC GCG GGA GGC CGA TCC 4834 Thr Glu Met Glu Thr Asp Glu Ala Asn Lys Ser Ala Gly Gly Arg Ser 815 820 825

AAG GTG GTG CCT CTG GCA CCC GCC ATG CCA CCG GCA GCA GCC AAT CCG 4882 Lys Val Val Pro Leu Ala Pro Ala Met Pro Pro Ala Ala Ala Asn Pro 830 835 840

GCG AAC CAT AGT ATT GCC GGA GGC CAA GGG GCT AGT CAG GAA TGG GAG 4930 Ala Asn His Ser He Ala Gly Gly Gin Gly Ala Ser Gin Glu Trp Glu 845 850 855 860

TGG TTG ACG ATG AGT CTGTAATGGC CGCGCTTACC TCTCTACTTC TCTACACTCG 4985 Trp Leu Thr Met Ser Leu 865

TTTCTTAATA TCTTTCTTGA ACCCCCCCTT ATATTTTCCC ACCGTTGATG CTACGCCATG 5045

ACCGATAGAG ATGATGAATA CTGCAACCAA TGGAATCTCG CTAGACGAGA GGTGTTAGAT 5105

GACGTGGCCC GCGATGCACT TAATGAGATA CGAGGAGGTG CAATGCGTTG GTTACGCTAG 5165

TTTAATGGTA ACATGACGAG GGATATTCGC TCTGTTATTT CGGGCTTTGA TCTGTTTCAG 5225

TCTGCGATTT AACAGCGACT GATCCTCTGC TGTGACAATA CACAGCTTGT CTTGTGGTTC 5285

TGTTGTGGCT TTCTGTTTGT TTGGCTGATT TGATTTATGC TTGATACAAT CGCGTCTGTC 5345

CGGACCCCGG CCTTTGTTTT GTTTTCAGTT CTGATTCTTC ACTGTTTCTG ATTCTCTTGT 5405

TCATGTTTTT GATTTGTTCA AGGCTTGGGG CCGGGCAGAA GTGCGCATCT CTGCTTTGTG 5465

TTTTCCGTCA CCGTGCATAG ACGCTGTATG TATATGCTAC AGCAAGATTC TACTTATCCA 5525

GTCTGAGCCT GTATTCATTG AAGTGTAGCC AGCTGTCGAA TGAGCTTTTT AACGATATTG 5585

TTTTGTTGAG TAGTCAACAA GTAGTATCTG TATATTCCGG AGTCTAAGTA AGACACTT 5643

(2) INFORMATION FOR SEQ ID NO: 6:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 866 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6.

Met Ser Gly Leu Thr Leu Gly Arg Gly Pro Gly Gly Val Arg Pro Thr 1 5 10 15

Gin Thr Ala Thr Phe Thr Thr His His Pro Ser Ala Asp Ala Asp Arg 20 25 30

Ser Ser Asn Asn Leu Pro Pro Thr Ser Ser Gin Leu Ser Asp Asp Phe 35 40 45

Ser Phe Gly Ser Pro Leu Ser Pro Ala Asp Ser Gin Ala His Asp Gly 50 55 60

Leu Leu Gin Asp Ser Leu Phe Pro Glu Trp Gly Ser Gly Ala Pro Arg 65 70 75 80

Pro Gly He Asp Ser Pro Asp Glu Met Gin Arg Gin Asp Pro Leu Ala 85 90 95

Thr Gin He Trp Lys Leu Tyr Ser Arg Thr Lys Ala Gin Leu Pro Asn 100 105 110

Gin Glu Arg Met Glu Asn Leu Thr Trp Arg Met Met Ala Met Ser Leu 115 120 125

Lys Arg Lys Glu Arg Glu Arg Ala Gin Gin Ser Met Phe Pro Ala Arg 130 135 140

Arg Gly Ser Ala Gly Pro Ser Gly He Ala Gin Leu Arg He Ser Asp 145 150 155 160

Pro Pro Val Ala Thr Gly Asn Pro Gin Ser Thr Asp Leu Thr Ala Asp 165 170 175

Pro Met Asn Leu Asp Asp Phe He Val Pro Phe Glu Ser Pro Ser Asp 180 185 190

His Pro Ser Pro Ser Ala Val Lys He Ser Asp Ser Thr Ala Ser Ala 195 200 205

Ala He Pro He Lys Ser Arg Lys Asp Gin Leu Arg Asp Ser Thr Pro 210 215 220

Val Pro Ala Ser Phe His His Pro Ala Gin Asp Gin Arg Lys Asn Ser 225 230 235 240

Glu Phe Gly Tyr Val Pro Arg Arg Val Arg Lys Thr Ser He Asp Glu 245 250 255

Arg Gin Phe Phe Ser Leu Gin Val Pro Thr Arg Lys Arg Pro Ala Glu 260 265 270

Ser Ser Pro Gin Val Pro Pro Val Ser Asn Ser Met Leu Ala His Asp 275 280 285

Pro Asp Leu Ala Ser Gly Val Pro Asp Tyr Ala Leu Asp Ala Pro Ser 290 295 300

Ser Ala Phe Gly Phe His Gin Gly Asn His His Pro Val Asn His His 305 310 315 320

Asn His Thr Ser Pro Gly Ala Pro Phe Gly Leu Asp Thr Phe Gly Leu 325 330 335

Gly Asp Asp Pro He Leu Pro Ser Ala Gly Pro Tyr Gin Ser Gin Phe 340 345 350

Thr Phe Ser Pro Ser Glu Ser Pro Met Ala Ser Gly His Pro Phe Ala 355 360 365

Asn Leu Tyr Ser His Thr Pro Val Ala Ser Ser Leu Asn Ser Thr Asp 370 375 380

Phe Phe Ser Pro Pro Pro Ser Gly Tyr Gin Ser Thr Ala Ser Thr Pro 385 390 395 400

Gin Pro Thr Tyr Asp Gly Asp His Ser Val Tyr Phe Asp Met Pro Ser 405 410 415

Gly Asp Ala Arg Thr Gin Arg Arg He Pro Asn Tyr He Ser His Arg 420 425 430

Ser Asn Leu Ser Ala Ser Leu Gin Pro Arg Tyr Met Phe Asn Gin Asn 435 440 445

Asn His Glu Gin Ala Ser Ser Ser Thr Val His Ser Pro Ser Tyr Pro 450 455 460

He Pro Gin Pro Gin His Val Asp Pro Thr Gin Val Leu Asn Ala Thr 465 470 475 480

Asn Tyr Ser Thr Gly Asn Ser His His Thr Gly Ala Met Phe Ser Phe 485 490 495

Gly Ala Asp Ser Asp Asn Glu Asp Asp Asp Gly His Gin Leu Ser Glu 500 505 510

Arg Ala Gly Leu Ala Met Pro Thr Glu Tyr Gly Asp Glu Asp Gly Phe 515 520 525

Ser Ser Gly Met Gin Trp Asp Gly Gin Phe Pro Gly Ser Phe His Ser 530 535 540

Leu Pro Gly Phe Gly Pro Gin His Arg Lys His Val Thr He Gly Ser 545 550 555 560

Thr Asp Met Met Asp Thr Pro Glu Glu Trp Asn His Gly Gly Ser Leu 565 570 575

Gly Arg Thr His Gly Ser Val Ala Ser Val Ser Glu Val Arg Asn Arg 580 585 590

Glu Gin Asp Pro Arg Arg Gin Lys He Ala Arg Thr Thr Ser Thr Pro 595 600 605

Asn Thr Ala Gin Leu Leu Arg Gin Ser Met His Ser Asn Asn Asn Thr 610 615 620

Ser His Thr Ser Pro Asn Thr Pro Pro Glu Ser Ala Leu Ser Ser Ala 625 630 635 640

Val Pro Ser Arg Pro Ala Ser Pro Gly Gly Ser Lys Asn Gly Asp Gin 645 650 655

Gly Ser Asn Gly Pro Thr Thr Cys Thr Asn Cys Phe Thr Gin Thr Thr 660 665 670

Pro Leu Trp Arg Arg Asn Pro Glu Gly Gin Pro Leu Cys Asn Ala Cys 675 680 685

Gly Leu Phe Leu Lys Leu His Gly Val Val Arg Pro Leu Ser Leu Lys 690 695 700

Thr Asp Val He Lys Lys Arg Asn Arg Ser Ser Ala Asn Ser Leu Ala 705 710 715 720

Val Gly Thr Ser Arg Ala Ser Lys Lys Thr Ala Arg Lys Asn Ser Val 725 730 735

Gin Gin Ala Ser Val Thr Thr Pro Thr Ser Ser Arg Ala Gin Asn Gly 740 745 750

Thr Ser Phe Glu Ser Pro Pro Ala Gly Phe Ser Ala Ala Ala Gly Arg 755 760 765

Ser Asn Gly Val Val Pro He Ala Ala Ala Pro Pro Lys Ala Ala Pro 770 775 780

Ser Ala Ala Ala Ser Pro Ser Thr Gly Gin Thr Arg Asn Pro He Gin 785 790 795 800

Ala Ala Pro Lys Arg Gin Arg Arg Leu Glu Lys Ala Thr Glu Met Glu 805 810 815

Thr Asp Glu Ala Asn Lys Ser Ala Gly Gly Arg Ser Lys Val Val Pro 820 825 830

Leu Ala Pro Ala Met Pro Pro Ala Ala Ala Asn Pro Ala Asn His Ser 835 840 845

He Ala Gly Gly Gin Gly Ala Ser Gin Glu Trp Glu Trp Leu Thr Met 850 855 860

Ser Leu 865

(2) INFORMATION FOR SEQ ID NO: 7:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 27 nucleotides

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: DNA Primer 19819

(lli) HYPOTHETICAL: YES

(iii) ANTI-SENSE: NO

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7

GAAGATCTGC GCGGATGTAC ATTGTAG 27

(2) INFORMATION FOR SEQ ID NO: 8:

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 20 nucleotides

(B) TYPE: nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY: linear

(li) MOLECULE TYPE: DNA Primer 19821

(ill) HYPOTHETICAL: YES

(ill) ANTI-SENSE: NO

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8

TTAGTCAGAA ATTCGTCCCG 20

(2) INFORMATION FOR SEQ ID NO: 9:

( ) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 30 nucleotides

(B) TYPE nucleic acid

(C) STRANDEDNESS. single

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE DNA Primer 19820

(ill) HYPOTHETICAL- YES

(ill) ANTI-SENSE: NO

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 9

CCCAAGCTTC ATGCTCGACC AGGGCCTCCT 30

(2) INFORMATION FOR SEQ ID NO 10

(l) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 21 nucleotides

(B) TYPE: nucleic acid

(C) STRANDEDNESS- smgle

(D) TOPOLOGY- linear

(ii) MOLECULE TYPE. DNA Primer 19818

(in) HYPOTHETICAL- YES

(ill) ANTI-SENSE: NO

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10

GGTCTGTGTT AACCAAAGAA C 21

(2) INFORMATION FOR SEQ ID NO: 11.

(l) SEQUENCE CHARACTERISTICS.

(A) LENGTH: 14 nucleotides

(B) TYPE, nucleic acid

(C) STRANDEDNESS: single

(D) TOPOLOGY- linear

(n) MOLECULE TYPE. DNA Primer 8681

(ill) HYPOTHETICAL: YES

( ll) ANTI-SENSE. NO

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 11

GATCCACCAT GAAG 14

(2) INFORMATION FOR SEQ ID NO: 12:

(I) SEQUENCE CHARACTERISTICS.

(A) LENGTH. 14 nucleotides

(B) TYPE: nucleic acid

(C) STRANDEDNESS. single

(D) TOPOLOGY: linear

(II) MOLECULE TYPE: DNA Primer 8747 (ill) HYPOTHETICAL: YES

(ill) ANTI-SENSE: NO

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12

GTGGTACTTC AGCT 14