Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
YEAST EXPRESSION SYSTEMS
Document Type and Number:
WIPO Patent Application WO/2008/128701
Kind Code:
A2
Abstract:
The present invention relates to methods for increasing the secretion of a protein of interest (POI) from a eukaryotic cell comprising co-expression of a POI and of at least one protein that enhances protein secretion, said enhancing protein being selected from the group consisting of BMH2, BFR2, C0G6, C0Y1, CUP5, IMH 1, KIN2, SEC31, SSA4 and SSE1. The invention further relates to a yeast promoter sequence, in particular to a promoter sequence of the PET9 gene of P. pastoris, having, under comparable conditions, an increased promoter activity relative to a promoter sequence of the GAP protein. The invention further relates to an expression vector comprising such a promoter sequence and to the use of such an expression vector for expression of a POI in a host cell. The invention further relates to new yeast promoter sequences of genes from P. pastoris, which are useful for expression of a POI in yeast.

Inventors:
GASSER BRIGITTE (AT)
MATTANOVICH DIETHARD (AT)
SAUER MICHAEL (AT)
STADLMAYR GERHARD (AT)
Application Number:
PCT/EP2008/003076
Publication Date:
October 30, 2008
Filing Date:
April 17, 2008
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
POLYMUN SCIENT IMMUNBIO FORSCH (AT)
GASSER BRIGITTE (AT)
MATTANOVICH DIETHARD (AT)
SAUER MICHAEL (AT)
STADLMAYR GERHARD (AT)
International Classes:
C12N15/81; C07K14/39; C07K16/10; C12P21/02
Other References:
VASARA TUIJA ET AL: "Characterisation of two 14-3-3 genes from Trichoderma reesei: Interactions with yeast secretory pathway components" BIOCHIMICA ET BIOPHYSICA ACTA, vol. 1590, no. 1-3, 12 June 2002 (2002-06-12), pages 27-40, XP002499992 ISSN: 0006-3002
GASSER BRIGITTE ET AL: "Transcriptomics-based identification of novel factors enhancing heterologous protein secretion in Yeasts" APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 73, no. 20, October 2007 (2007-10), pages 6499-6507, XP002499993 ISSN: 0099-2240
GASSER BRIGITTE ET AL: "Engineering of Pichia pastoris for improved production of antibody fragments" BIOTECHNOLOGY AND BIOENGINEERING, vol. 94, no. 2, June 2006 (2006-06), pages 353-361, XP002499994 ISSN: 0006-3592 cited in the application
TOIKKANEN JAANA H ET AL: "Kluyveromyces lactis SSOI and SEBI genes are functional in Saccharamyces cerevisiae and enhance production of secreted proteins when overexpressed" YEAST, vol. 21, no. 12, September 2004 (2004-09), pages 1045-1055, XP002499995 ISSN: 0749-503X
MATTANOVICH D ET AL: "Stress in recombinant protein producing yeasts" JOURNAL OF BIOTECHNOLOGY, ELSEVIER SCIENCE PUBLISHERS, AMSTERDAM, NL, vol. 113, no. 1-3, 30 September 2004 (2004-09-30), pages 121-135, XP004569604 ISSN: 0168-1656 cited in the application
LIU YUAN YI ET AL: "Overexpression of an anti-CD3 immunotoxin increases expression and secretion of molecular chaperone BiP/Kar2p by Pichia pastoris" APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 71, no. 9, September 2005 (2005-09), pages 5332-5340, XP002499996 ISSN: 0099-2240
ELBERT MAYA ET AL: "The yeast Par-1 homologs Kin1 and Kin2 show genetic and physical interactions with components of the exocytic machinery" MOLECULAR BIOLOGY OF THE CELL, vol. 16, no. 2, February 2005 (2005-02), pages 532-549, XP002509287 ISSN: 1059-1524
DATABASE EMBL [Online] 28 May 1996 (1996-05-28), "S.cerevisiae chromosome XII reading frame ORF YLR096w" XP002509290 retrieved from EBI accession no. EMBL:Z73268 Database accession no. Z73268
ZHANG WEI ET AL: "Enhanced secretion of heterologous proteins in Pichia pastoris following overexpression of Saccharomyces cerevisiae chaperone proteins" BIOTECHNOLOGY PROGRESS, vol. 22, no. 4, August 2006 (2006-08), pages 1090-1095, XP002509288 ISSN: 8756-7938
NOSAKA KAZUTO ET AL: "Genetic regulation mediated by thiamin pyrophosphate-binding motif in Saccharomyces cerevisiae" MOLECULAR MICROBIOLOGY, vol. 58, no. 2, October 2005 (2005-10), pages 467-479, XP002509289 ISSN: 0950-382X
Attorney, Agent or Firm:
BOGENSBERGER PATENT- UND MARKENBÜRO (Vaduz, LI)
Download PDF:
Claims:

CLAIMS

We claim:

1 . A method of increasing the secretion of a POI from a eukaryotic cell comprising:

- providing a host cell comprising a recombinant nucleotide sequence encoding a POI and at least one recombinant nucleotide sequence encoding a protein that increases protein secretion; and

- expressing in the host cell the recombinant nucleotide sequence encoding a POI and the at least one recombinant nucleotide sequence encoding a protein that increases protein secretion,

wherein said protein that increases protein secretion is selected from the group consisting of BMH2, BFR2, C0G6, C0Y1 , CUP5, IMH 1 , KIN2, SEC31 , SSA4, SSE 1 , and a biologically active fragment of any of the foregoing proteins.

2. The method according to claim 1 , wherein the POI is a eukaryotic protein or a biologically active fragment thereof, preferably a Fab fragment, most preferably a Fab fragment of the monoclonal anti-HIV1 antibody 2F5.

3. The method according to claim 1 or 2, wherein the host cell is a fungal cell, preferably a yeast cell, or a higher eukaryotic cell, preferably a mammalian or a plant cell.

4. The method according to claim 3, wherein the yeast cell is a cell of the Komagataella genus, in particular a cell of a strain of Komagataella pastoris,

Komagataella pseudopastoris or Komagataella phaffii.

5. The method according to any one of claims 1 to 4, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from yeast, preferably from the species Saccharomyces cerevisiae or Pichia pastoris.

6. The method according to claim 5, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Saccharomyces cerevisiae and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41 .

7. The method according to claim 5, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51 .

8. The method according to any one of claims 1 to 7, wherein the recombinant nucleotide sequence encoding a POI is provided on a plasmid suitable for integration into the genome of the host cell or for autonomous replication in the host cell.

9. The method according to claim 8, wherein the plasmid is a eukaryotic expression vector, preferably a yeast expression vector.

10. The method according to claim 9, wherein the expression vector comprises a secretion leader sequence effective to cause secretion of the POI from the host cell.

1 1 . The method according to claim 9 or 10, wherein the expression vector comprises a promoter sequence effective to control expression of the POI in the host cell.

12. The method according to any one of claims 1 to 1 1 , wherein the nucleotide sequence encoding the POI is controlled by a promoter sequence which is a 1000 bp fragment from the 5'-non coding region of the PET9 gene of Pichia pastoris corresponding to SEQ ID NO 125, or a functionally equivalent variant thereof and the host cell is a cell of the genus

Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

13. Use of a nucleotide sequence isolated from Saccharomyces cerevisiae and encoding a protein that increases protein secretion and being selected from the group consisting of BMH2, BFR2, COG6, C0Y1 , CUP5, IMH 1 , KIN2, SEC31 , SSA4, SSE 1 , and a biologically active fragment of any of the foregoing proteins, as a secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell, preferably in a yeast cell and most preferred in a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

14. The use according to claim 13, wherein the nucleotide sequence encoding a protein that increases protein secretion is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41 .

15. Use of a nucleotide sequence isolated from Pichia pastoris and encoding a protein that increases protein secretion and being selected from the group consisting of BMH2, BFR2, COG6, COY1 , CUP5, IMH 1 , KIN2, SEC31 , SSA4, SSE1 , and a biologically active fragment of any of the foregoing proteins, as a secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell, preferably in a yeast cell and most preferred in a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

1 6. The use according to claim 15, wherein the nucleotide sequence encoding a protein that increases protein secretion is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51 .

17. The use according to any one of claims 13 to 16 in a method according to claim 1 .

18. A nucleotide sequence encoding a protein that increases protein secretion from a host cell, wherein the nucleotide sequence is isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a nucleotide sequence encoding the protein BMH2 (SEQ ID NO 42), a nucleotide sequence encoding the protein BFR2 (SEQ ID NO 43), a nucleotide sequence encoding the protein COG6 (SEQ ID NO 44), a nucleotide sequence encoding the protein C0Y1 (SEQ ID NO 45), a nucleotide sequence encoding the protein CUP5 (SEQ ID NO 46), a nucleotide sequence encoding the protein IMH 1 (SEQ ID NO 47), a nucleotide sequence encoding the protein KIN2 (SEQ ID NO 48), a nucleotide sequence encoding the protein SEC31 (SEQ ID NO 49), a nucleotide sequence encoding the protein SSA4 (SEQ ID NO 50) and a nucleotide sequence encoding the protein SSE1 (SEQ ID NO 51 ).

19. A yeast promoter sequence being a 1000 bp fragment from the 5'-non coding region of the PET9 gene corresponding to SEQ ID NO 125, or a functionally equivalent variant thereof and being isolated from Pichia pastoris.

20. The yeast promoter sequence of claim 19 which has, under comparable conditions, improved properties for expression of a POI in yeast, preferably in a strain of the genus Komagataella, in particular in a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii, relative to a yeast promoter known in the art, in particular relative to a GAP promoter isolated from Pichia pastoris.

21 . The yeast promoter sequence according to claim 20, having, under comparable conditions, at least the same, or at least about a 1 .5-fold, or at least about a 2-fold, or at least about a 4-fold, 7-fold, 10-fold, or at least up to about a 1 5-fold promoter activity relative to a GAP promoter isolated from Pichia pastoris.

22. A eukaryotic expression vector based on the pPuzzle backbone further comprising the following components operably linked to each other: a recombinant nucleotide sequence encoding a POI, optionally linked to a leader sequence effective to cause secretion of the POI from the host cell; a promoter effective to control protein expression in a host cell;

a transcription terminator; a selection marker; either homologous integration sequences or autonomous replication sequences,

wherein the promoter is a 1000 bp fragment from the 5'-non coding region of the PET9 gene of Pichia pastoris (SEQ ID NO 125), or a functionally equivalent variant thereof, the transcription terminator is the transcription terminator of the cytochrome c gene from S. cerevisiae, the selection marker is a zeocin resistance gene and the host cell is a yeast cell, preferably a cell of a strain of the genus Komagataella, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.

23. Use of an expression vector as defined in claim 22 for recombinant expression of a POI in a host cell.

24. A yeast promoter sequence being isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of a 1000 bp fragment from the 5'-non coding region of the GND1 gene (SEQ ID NO 126), a 1000 bp fragment from the 5'-non coding region of the GPM1 gene (SEQ ID NO 127), a 1000 bp fragment from the 5'-non coding region of the HSP90 gene (SEQ ID NO 128), a 1000 bp fragment from the 5'-non coding region of the KAR2 gene (SEQ ID NO 129), a 1000 bp fragment from the 5'-non coding region of the MCM 1 gene (SEQ ID NO 130), a 1000 bp fragment from the 5'-non coding region of the RAD2 gene (SEQ ID NO 131 ), a 1000 bp fragment from the 5'- non coding region of the RPS2 gene (SEQ ID NO 132), a 1000 bp fragment from the 5'-non coding region of the RPS31 gene (SEQ ID NO 133), a 1000 bp fragment from the 5'-non coding region of the SSA1 gene (SEQ ID NO 134), a 1000 bp fragment from the 5'-non coding region of the THI3 gene (SEQ ID NO 135), a 1000 bp fragment from the 5'-non coding region of the TPN gene (SEQ ID NO 136), a 1000 bp fragment from the 5'-non coding region of the UBI4 gene (SEQ ID NO 137), a 1000 bp fragment from the 5'-non coding region of the ENO1 gene (SEQ ID NO 138), a 1000 bp fragment from the 5'- non coding region of the RPS7A gene (SEQ ID NO 139), a 1000 bp fragment from the 5'-non coding region of the RPL1 gene (SEQ ID NO 140), a 1000 bp fragment from the 5'-non coding region of the TKL1 gene (SEQ ID NO 141 ), a

1000 bp fragment from the 5'-non coding region of the PIS1 gene (SEQ ID NO 142), a 1000 bp fragment from the 5'-non coding region of the FET3 gene (SEQ ID NO 143), a 1000 bp fragment from the 5'-non coding region of the FTR1 gene (SEQ ID NO 144), a 1000 bp fragment from the 5' non coding region of the NMT1 gene (SEQ ID NO 145), a 1000 bp fragment from the 5'- non coding region of the PHO8 gene (SEQ ID NO 146), and a 1000 bp fragment from the 5'-non coding region of the FET3 precursor (FET3pre) gene (SEQ ID NO 147), or a functionally equivalent variant of any of the foregoing sequences.

25. A eukaryotic expression vector based on the pPuzzle backbone further comprising the following components operably linked to each other:

a recombinant nucleotide sequence encoding a POI, optionally linked to a leader sequence effective to cause secretion of the POI from the host cell; a promoter effective to control protein expression in a host cell; a transcription terminator; a selection marker; - either homologous integration sequences or autonomous replication sequences,

wherein the promoter is a yeast promoter sequence isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131 , SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141 , SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147, or a functionally equivalent variant of any of the foregoing sequences, and the host cell is a yeast cell, preferably a cell of a strain of the genus Komagataella, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.

26. Use of an expression vector as defined in claim 25 for recombinant expression of a POI in a host cell.

27. Use of a yeast promoter sequence being isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131 , SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO

134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141 , SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147, or a functionally equivalent variant of any of the foregoing sequences for modulation of the expression of a homologous POI in a host cell.

28. The use according to claim 27, wherein the yeast promoter sequence has an increased promoter activity relative to the native promoter sequence of the POI.

29. The use according to claim 27, wherein the yeast promoter sequence has a decreased promoter activity relative to the native promoter sequence of the POI.

Description:

EXPRESSION SYSTEM

TECHNICAL FIELD

The present invention is in the field of biotechnology, in particular in the field of gene expression and relates to a method for increasing the secretion of a protein of interest (POI) from a eukaryotic cell, comprising co-expression of a recombinant nucleotide sequence encoding a protein of interest and at least one recombinant nucleotide sequence encoding a protein that increases protein secretion. The invention further relates to a yeast promoter sequence, in particular to a promoter sequence of the PET9 gene of Pichia pastoris (P. pastoris), which is particularly useful for expression of a protein of interest in yeast, preferably in a strain of the genus Komagataella (Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii), and which has an increased promoter activity relative to the promoter sequence of the glycerol aldehyde phosphate dehydrogenase (GAP) gene of Pichia pastoris under comparable conditions. The invention further relates to an expression vector based on the pPuzzle backbone comprising a PET9 promoter sequence from P. pastoris, as well as to the use of such an expression vector for expression of a protein of interest in a host cell, in particular in a strain of the genus Komagataella (K. pastoris, K. pseudopastoris or K. phaffii).

The invention also relates to new yeast promoter sequences of genes from P. pastoris, which are useful for expression of a protein of interest in yeast, preferably in a strain of the genus Komagataella (K. pastoris, K. pseudopastoris or K. phaffii).

BACKGROUND OF THE INVENTION

Successful secretion of proteins has been accomplished both with prokaryotic and eukaryotic hosts. The most prominent examples are bacteria like Escherichia coli, yeasts like Saccharomyces cerevisiae, Pichia pastoris or Hansenula polymorpha, filamentous fungi like Aspergillus awamori or Trichoderma reesei, or mammalian cells like e.g. CHO cells. While the secretion of some proteins is readily achieved at high rates, many other proteins are only secreted at comparatively low levels (Punt et al., 2002; Macauley-Patrick et al., 2005; Porro et al., 2005).

The heterologous expression of a gene in a host organism requires a vector allowing stable transformation of the host organism. This vector has to provide the gene with a functional promoter adjacent to the 5' end of the coding sequence. The transcription is thereby regulated and initiated by this promoter sequence. Most promoters used up to date have been derived from genes that code for metabolic enzymes that are usually present at high concentrations in the cell.

EP 0103409 discloses the use of yeast promoters associated with expression of specific enzymes in the glycolytic pathway, i.e. promoters involved in expression of pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, phosphoglycerate mutase, hexokinase 1 and 2, glucokinase, phosphofructose kinase, aldolase and glycolytic regulation gene.

WO 97/44470 describes yeast promoters from Yarrowia lipolytica for the translation elongation factor 1 (TEF 1 ) protein and for the ribosomal protein S7 that are suitable for heterologous expression of proteins in yeast.

WO 2005/003310 provides methods for the expression of a coding sequence of interest in yeast using a promoter of the glyceraldehyde-3-phosphate dehydrogenase or phosphoglycerate mutase from oleaginous yeast Yarrowia lipolytica.

One approach for the improvement of the secretion of a recombinant protein was done by random mutagenesis (Archer et al., 1994; Lang and Looman,

1995). The major disadvantage of this method is that positive results usually cannot be transferred to other strains.

The secretory pathway - the folding and processing of proteins - of eukaryotic organisms, e.g. of yeast, is very complex with many interacting participants.

Some of these proteins have catalytic activity on the proteins like protein disulfide isomerase (PDI), others act by binding to the proteins and preventing them from aggregation (chaperones, e.g. BiP), or by stimulating release of the protein to the cell exterior at a later step in the secretory pathway (SSO proteins). Due to this interdependence, increasing the rate of one reaction step in the secretory pathway may not automatically augment secretion of a protein of interest, but instead may cause a rate-limitation at one or more of

the subsequent reaction steps and thus may not remove but only shift bottle- neck(s) of the expression system.

The secretory pathway typically starts by translocation of transmembrane polypeptides and polypeptides intended for secretion into the lumen of the endoplasmatic reticulum (ER). For that purpose, these proteins possess an amino-terminal signal sequence. This signal sequence - also called leader sequence - typically consists of 13 to 36 rather hydrophobic amino acids; no special consensus sequence has been identified yet. On the ER luminal side the signal sequence is removed by a signal peptidase, while the nascent polypeptide is bound to chaperones to prevent miscoiling until translation has finished. ER resident proteins are responsible for correct folding mechanisms. They include, for example, calnexin, calreticulin, Erp72, GRP94, and PDI which latter catalyses the formation of disulfide bonds, and the prolyl-isomerase. Besides, some of the post-translational modifications such as ISI-glycosylation are initiated in the ER lumen. Proteins are exported to the Golgi apparatus by vesicular transport only after the correct conformation of the proteins has been assured by the ER quality control mechanism. Unless there is a differing signal, proteins intended for secretion are directed from the Golgi apparatus to the outside of the plasma membrane by specific transport vesicles (Stryer and Lubert, 1995; Gething and Sambrook, 1992).

In most cases the rate limiting step in the eukaryotic secretion pathway has been identified to be the move of proteins from the ER to the Golgi apparatus (Shuster, 1991 ). A mechanism called ER-associated protein degradation (ERAD) is responsible for the retention of misfolded or unmodified nonfunctional proteins in the ER and their subsequent removal.

It has been shown in several cases that the secretion process of heterologous proteins can be enhanced by co-overexpression of certain proteins that are involved in the secretory pathway and which support the folding and/or processing of other proteins (Mattanovich et al., 2004).

Co-expression of the gene encoding PDI and a gene encoding a heterologous disulphide-bonded protein was first suggested in WO 93/25676 as a means of increasing the production of the heterologous protein. WO 93/25676 reports

that the recombinant expression of antistasin and tick anticoagulant protein can be increased by co-expression with PDI.

WO 94/08012 provides methods for increasing protein secretion in yeast by increasing expression of a Hsp70 chaperone protein, i.e. KAR2 and BiP or a PDI chaperone protein.

The yeast syntaxin homologs SS01 and SSO2 are necessary for the fusion of secretory vesicles to the plasma membrane by acting as t-SNAREs.

WO 94/08024 discloses a process for producing increased amounts of secreted foreign or endogenous proteins by co-expression of the genes SSO1 and SSO2.

WO 03/057897 provides methods for the recombinant expression of a protein of interest by co-expressing at least two genes encoding proteins selected from the group consisting of the chaperone proteins GroEL, GRoES, Dnak, DnaJ, GRpe, CIpB and homologs thereof.

WO 2005/061 7818 and WO 2006/06751 1 provide methods for producing a desired heterologous protein in yeast by using a 2μm-based expression plasmid. It was demonstrated that the production of a heterologous protein is substantially increased when the genes for one or more chaperone protein(s) and a heterologous protein are co-expressed on the same plasmid.

Another approach to stimulate the secretory pathway is to overexpress the unfolded protein response (UPR) activating transcription factor HAC 1 . Transcriptional analyses revealed that up to 330 genes are regulated by HAC 1 , most of them belonging to the functional groups of secretion or the biogenesis of secretory organelles (e.g. ER-resident chaperones, foldases, components of the Translocon).

WO 01 /72783 describes methods for increasing the amount of a heterologous protein secreted from a eukaryotic cell by inducing an elevated unfolded protein response (UPR) , wherein the UPR is modulated by co-expression of a protein selected from the group consisting of HAC1 , PTC2 and IREI .

The flavoenzyme ER01 is required for oxidation of protein dithiols in the ER. It is oxidized by molecular oxygen and acts as a specific oxidant of PDI. Disulfides generated de novo within ER01 are transferred to PDI and then to substrate proteins by dithiol-disulfide exchange reactions.

WO 99/07727 discloses the use of ER01 to enhance disulfide bond formation and thereby to increase the yield of properly folded recombinant proteins.

While these approaches, once established, can be transferred to other strains and used for other proteins as well, they are limited by the actual knowledge about the function of such proteins supporting the secretion of other proteins.

It can be anticipated that the successful high level secretion of a recombinant protein may be limited at a number of different steps, like folding, disulfide bridge formation, glycosylation, transport within the cell, or release from the cell. As many of these processes are still not fully understood, it can also be anticipated that there are many more proteins involved which support the secretion of a protein, than is currently known. However, such helper functions cannot be predicted with the current knowledge of the state-of-the- art, even when the DNA sequence of the entire genome of a host organism is available.

Proteins known to be involved in the yeast secretory pathway frequently influence the process of protein folding and subsequent secretion at different steps of the secretion process.

Accordingly, it is desirable to provide new methods to increase production of secreted proteins in eukaryotic cells which are simple and efficient. It is also desirable to provide new genes to be used in methods for the increased production of secreted proteins. It is also desirable to provide new yeast promoters, especially for use in the expression of heterologous or homologous genes in yeast, in particular in a yeast of the genus Komagataella, but also for expression of a desired gene in any other eukaryotic expression system.

SUMMARY OF THE INVENTION

It is an objective of the present invention to provide a method of increasing the secretion of a protein of interest (POI) from a eukaryotic cell, comprising co-expression of a recombinant nucleotide sequence encoding a POI and at least one recombinant nucleotide sequence encoding a protein that increases protein secretion from a host cell. An increase in secretion of the POI is determined on the basis of a comparison of its secretion yield in the presence or absence of co-expression of a said protein that increases protein secretion.

In one aspect the invention relates to such a method including the co- expression of a recombinant nucleotide sequence encoding a POI and of at least one other recombinant nucleotide sequence encoding a protein that increases protein secretion, wherein said protein that increases protein secretion is selected from the group consisting of BMH2, BFR2, COG6, C0Y1 , CUP5, IMH 1 , KIN2, SEC31 , SSA4, SSE1 , and a biologically active fragment of any of the foregoing proteins.

In another aspect the invention relates to such a method wherein at least one other recombinant nucleotide sequence is obtained from a yeast, preferably from Saccharomyces cerevisiae or from Pichia pastoris.

In another aspect the invention relates to such a method wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Saccharomyces cerevisiae and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41 .

In another aspect the invention relates to such a method wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51 .

In yet another aspect the invention relates to the use of such a nucleotide sequence encoding a protein that increases protein secretion as a protein secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell.

It is another object of the invention to provide a nucleotide sequence encoding a protein that increases protein secretion from a host cell, wherein the nucleotide sequence is isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a nucleotide sequence encoding the protein

BMH2 (SEQ ID NO 42), a nucleotide sequence encoding the protein BFR2 (SEQ ID NO 43), a nucleotide sequence encoding the protein C0G6 (SEQ ID NO 44), a nucleotide sequence encoding the protein C0Y1 (SEQ ID NO 45), a nucleotide sequence encoding the protein CUP5 (SEQ ID NO 46), a nucleotide sequence encoding the protein IMH 1 (SEQ ID NO 47), a nucleotide sequence encoding the protein KIN2 (SEQ ID NO 48), a nucleotide sequence encoding the protein SEC31 (SEQ ID NO 49), a nucleotide sequence encoding the protein SSA4 (SEQ ID NO 50) and a nucleotide sequence encoding the protein SSE1 (SEQ ID NO 51 ).

It is another object of the invention to provide a yeast promoter sequence of the PET9 gene from Pichia pastoris, which is useful for expression of a POI in yeast, preferably in a strain of the genus Komagataella, in particular in a strain of K. pastoris, K. pseυdopastoris or K. phaffii, and which has, under comparable conditions, an increased promoter activity relative to the promoter sequence of the GAP protein of Pichia pastoris.

It is another object of the invention to provide such a yeast promoter sequence, particularly a yeast promoter sequence identical with or corresponding to and having the functional characteristics of SEQ ID NO 1 25, or a functionally equivalent variant thereof.

In another aspect the invention relates to an expression vector based on the pPuzzle backbone further comprising such a yeast promoter sequence of the PET9 gene from Pichia pastoris which is identical with or corresponding to and having the functional characteristics of SEQ ID NO 125, or a functionally equivalent variant thereof.

In yet another aspect the invention relates to the use of such a plasmid for the expression of a POI in a host cell, the host cell preferably being a cell of a strain of the genus Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

It is another object of the invention to provide a yeast promoter sequence from Pichia pastoris which is useful for the expression of a POI in yeast, preferably in a strain of the genus Komagataella, wherein the yeast promoter sequence is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a 1000 bp fragment from the 5'-non coding region of the GND1 gene (SEQ ID NO 126), a 1000 bp fragment from the 5'-non coding region of the GPM 1 gene (SEQ ID NO 127), a 1000 bp fragment from the 5'-non coding region of the HSP90 gene (SEQ ID NO 128), a 1000 bp fragment from the 5'-non coding region of the KAR2 gene (SEQ ID NO 129), a 1000 bp fragment from the 5'-non coding region of the MCM 1 gene (SEQ ID NO 130), a 1000 bp fragment from the 5'-non coding region of the RAD2 gene (SEQ ID NO 131 ), a 1000 bp fragment from the 5'-non coding region of the RPS2 gene (SEQ ID NO 132), a 1000 bp fragment from the 5'- non coding region of the RPS31 gene (SEQ ID NO 133), a 1000 bp fragment from the 5'-non coding region of the SSA1 gene (SEQ ID NO 134), a 1000 bp fragment from the 5'-non coding region of the THI3 gene (SEQ ID NO 135), a 1000 bp fragment from the 5'-non coding region of the TPM gene (SEQ ID NO 136), a 1000 bp fragment from the 5'-non coding region of the UBI4 gene (SEQ ID NO 137), a 1000 bp fragment from the 5'-non coding region of the ENO1 gene (SEQ ID NO 138), a 1000 bp fragment from the 5'-non coding region of the RPS7A gene (SEQ ID NO 139), a 1000 bp fragment from the 5 1 - non coding region of the RPL 1 gene (SEQ ID NO 140), a 1000 bp fragment from the 5'-non coding region of the TKL1 gene (SEQ ID NO 141 ), a 1000 bp fragment from the 5'-non coding region of the PIS1 gene (SEQ ID NO 142), a 1000 bp fragment from the 5'-non coding region of the FET3 gene (SEQ ID NO 143), a 1000 bp fragment from the 5'-non coding region of the FTR1 gene (SEQ ID NO 144), a 1000 bp fragment from the 5'-non coding region of the NMT1 gene (SEQ ID NO 145), a 1000 bp fragment from the 5'-non coding region of the PHO8 gene (SEQ ID NO 146), and a 1000 bp fragment from the 5'-non coding region of the FET3 precursor (FET3pre) gene (SEQ ID NO 147), or a functionally equivalent variant of any of the foregoing sequences.

In another aspect the invention relates to an expression vector based on the pPuzzle backbone further comprising such a yeast promoter sequence identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131 , SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141 , SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147, or a functionally equivalent variant of any of the foregoing sequences.

In another aspect the invention relates to the use of such an expression vector for the expression of a POI in a host cell, the host cell being a cell of a strain of the genus Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

The principle of the invention is further described in the independent claims, while the various embodiments of the invention are the subject matter of dependent claims.

BRIEF DESCRIPTION OF THE DRAWINGS

Fig.1 shows the structure and relevant restriction enzyme cleavage sites of the vector backbone of pPuzzle, comprising a AmpR selection marker for

E. coli amplified from the cloning vector pBR322 and an E. coli origin of replication (ORI) amplified from the cloning vector pUC19. A detailed description of the cloning procedure of the pPuzzle vector backbone is found in Example 3. Fig 2 shows the structure and relevant restriction enzyme cleavage sites of the vector pPuzzle_zeoR_P PET9 _eGFP_AOXTT, where the reporter gene GFP (green fluorescent protein) is under the control of a 1000 bp fragment from the 5'-non coding region of the PET9 gene of P. pastoris. The vector further comprises an E. coli ORI amplified from pUC19, the transcription terminator of the cytochrome c gene from 5. cerevisiae

(eye 1 TT), a zeocin selection marker and the promoter sequence of the AOX 1 gene of P. pastoris (AOXTT part 1 and 2).

DETAILED DESCRIPTION OF THE INVENTION

To understand more about the gene regulation of a host organism during protein production, DNA microarray hybridization experiments with P. pastoris clones expressing recombinant human (rh) trypsinogen in comparison to a non- producing strain (according to Sauer et al., 2004) were performed. A detailed description of the experimental procedure is found in Example 1 . These experiments allow for a determination of the transcription levels of approximately 1 /3 of all genes in P. pastoris, but they do not provide direct information on the potential of any hitherto unidentified protein to enhance secretion.

Additional analysis of the data derived from DNA microarray hybridization has allowed the identification of potential secretion supporting proteins, or their genes respectively. To achieve this, the relative expression levels of all measured genes of a P. pastoris strain being transformed with a plasmid carrying a gene for rh trypsinogen were compared to a wild type strain cultivated under the same conditions. Then the genes were ordered by the relative difference of their expression levels, and some 524 genes with the highest difference were considered for further analysis. As the DNA microarrays used for these experiments were derived from Saccharomyces cerevisiae gene sequences, only putative gene functions for P. pastoris can be assigned by the homology to S. cerevisiae. After ranking the 524 differentially regulated genes based on their putative intracellular localisation and function, and focusing on those being involved in secretion and/or general stress response, out of a number of 64 potentially interesting genes 15 were selected for further analysis. These genes were cloned from S. cerevisiae by PCR and subcloned into a P. pastoris expression vector, and subsequently transformed into a P. pastoris strain expressing the Fab fragment of a monoclonal antibody (2F5mAb) against HIV1 . By cultivating the clones producing both the Fab fragment and the different putative secretion helper proteins, compared to clones producing only the Fab fragment, a beneficial effect of the overexpression of the following genes encoding putative helper proteins on the secretion of the Fab fragment could be identified: PDM , CUP5, SSA4, BMH2, KIN2, KAR2, HAC1 , ERO1 , SSE1 , BFR2, COG6, SSO2, COY1 , IMH 1 and SEC31 .

The proteins PDH , KAR2, HAC1 , ERO1 and SSO2 are already known in the art as being successfully applicable folding/secretion helper factors when co- expressed during recombinant expression of heterologous proteins. The other proteins identified in the DNA microarray assay, i.e. CUP5, SSA4, BMH2, KIN2, SSE1 , BFR2, COG6, COY1 , IMH 1 and SEC31 have not yet been described as having a beneficial effect on the secretion of recombinantly produced POi.

Accordingly, the present invention in its first aspect relates to a method of increasing the secretion of a POI from a eukaryotic cell comprising:

- providing a host cell comprising a recombinant nucleotide sequence encoding a POI and at least one recombinant nucleotide sequence encoding a protein that increases protein secretion; and

- expressing in the host cell the recombinant nucleotide sequence encoding a POI and the at least one recombinant nucleotide sequence encoding a protein that increases protein secretion,

wherein said protein that increases protein secretion is selected from the group consisting of BMH2, BFR2, C0G6, C0Y1 , CUP5, IMH1 , KIN2, SEC31 , SSA4, SSE1 , and a biologically active fragment of any of the foregoing proteins.

The term "protein of interest (POI)" as used herein refers to a protein that is produced by means of recombinant technology in a host cell. More specifically, the protein may either be a polypeptide not naturally occurring in the host cell, i.e. a heterologous protein, or else may be native to the host cell, i.e. a homologous protein to the host cell, but is produced, for example, by transformation with a self replicating vector containing the nucleic acid sequence encoding the POI, or upon integration by recombinant techniques of one or more copies of the nucleic acid sequence encoding the POI into the genome of the host cell, or by recombinant modification of one or more regulatory sequences controlling the expression of the gene encoding the POI, e.g. of the promoter sequence.

The POI can be any eukaryotic or prokaryotic protein. The protein can be a naturally secreted protein or an intracellular protein, i.e. a protein which is not naturally secreted. The present invention also includes biologically active fragments of naturally secreted or not naturally secreted proteins.

A secreted POI referred to herein may be but is not limited to a protein suitable as a biopharmaceutical substance like an antibody or antibody fragment, growth factor, hormone, enzyme, vaccine, or a protein which can be used for industrial application like e.g. an enzyme.

A intracellular POI referred to herein may be but is not limited to a helper factor for protein secretion, or an enzyme used for metabolic engineering purposes.

In another embodiment, the POI is a eukaryotic protein or a biologically active fragment thereof, preferably an immunoglobulin or an immunoglobulin fragment such as a Fc fragment or a Fab fragment. Most preferably, the POI is a Fab fragment of the monoclonal anti-HIV1 antibody 2F5.

In general, the proteins of interest referred to herein may be produced by methods of recombinant expression well known to a person skilled in the art.

It is understood that the methods disclosed herein may further include cultivating said recombinant host cells under conditions permitting the expression of the POI. A secreted, recombinantly produced POI can then be isolated from the cell culture medium and further purified by techniques well known to a person skilled in the art.

As used herein, a "biologically active fragment" of a protein shall mean a fragment of a protein that exerts a biological effect similar or comparable to the full length protein. Such fragments can be produced e.g. by amino- and carboxy- terminal deletions as well as by internal deletions.

In general, the host cell from which the proteins are secreted can be any eukaryotic cell suitable for recombinant expression of a POI.

In a preferred embodiment, the invention relates to such a method, wherein the host cell is a fungal cell, e.g. a yeast cell, or a higher eukaryotic cell, e.g. a mammalian cell or a plant cell.

Examples of yeast cells include but are not limited to the Saccharomyces genus (e.g. Saccharomyces cerevisiae), the Komagataella genus (Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii), Pichia methanolica, Hansenula polymorpha or Kluyveromyces lactis.

In a preferred embodiment the invention relates to a method, wherein the yeast cell is a cell of the Komagataella genus, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.

The former species Pichia pastoris has been divided and renamed to Komagataella pastoris and Komagataella phaffii (Kurtzman, 2005). Therefore Pichia pastoris is synonymous for both Komagataella pastoris and Komagataella phaffii.

The nucleotide sequences encoding the proteins that increase protein secretion can be obtained from a variety of sources. Said proteins may be involved in the eukaryotic protein secretory pathway.

In one aspect the invention relates to such a method, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is a yeast nucleotide sequence, preferably but not limited to a nucleotide sequence of the yeast species Saccharomyces cerevisiae or Pichia pastoris. Also, homologous nucleotide sequences from other suitable yeasts or other fungi or from other organisms such as vertebrates can be used.

The term "homologous nucleotide sequences" as used herein refers to nucleotide sequences which are related but not identical in their nucleotide sequence with the contemplated nucleotide sequence, and perform essentially the same function.

In a further aspect the invention relates to such a method, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Saccharomyces cerevisiae and is identical with or

corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41 .

As used herein, the term "nucleotide sequence that corresponds to and has the functional characteristics of" is meant to encompass variations in its nucleotide composition including variations due to the degeneracy of the genetic code, whereby the nucleotide sequence performs essentially the same function.

By screening a P. pastoris genome database (ERGO™, IG-66, Integrated Genomics) with the nucleotide sequences of the secretion helper factors isolated from Saccharomyces cerevisiae homologous nucleotide sequences in Pichia pastoris have been identified. Preliminary experimental results indicate that these homologous nucleotide sequences isolated from Pichia pastoris show similar effects on protein secretion from a host cell when compared to the corresponding nucleotide sequences isolated from Saccharomyces cerevisiae.

In a further aspect the invention relates to such a method, wherein at least one recombinant nucleotide sequence encoding a protein that increases protein secretion is obtained from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51 .

In a further aspect the invention relates to such a method, wherein the recombinant nucleotide sequence encoding the POI is provided on a plasmid suitable for integration into the genome of the host cell, in a single copy or in multiple copies per cell. The recombinant nucleotide sequence encoding the POI may also be provided on an autonomously replicating plasmid in a single copy or in multiple copies per cell.

Alternatively, the recombinant nucleotide sequence encoding the POI and the recombinant nucleotide sequence encoding a protein that increases protein

secretion are present on the same plasmid in single copy or multiple copies per cell.

The terms "plasmid" and "vector" as used herein include autonomously replicating nucleotide sequences as well as genome integrating nucleotide sequences.

In a further aspect, the invention relates to such a method, wherein the plasmid is a eukaryotic expression vector, preferably a yeast expression vector.

"Expression vectors" as used herein are defined as DNA sequences that are required for the transcription of cloned recombinant nucleotide sequences, i.e. of recombinant genes and the translation of their mRNA in a suitable host organism. Such expression vectors usually comprise an origin for autonomous replication in the host cells, selectable markers (e.g. an amino acid synthesis gene or a gene conferring resistance to antibiotics such as zeocin, kanamycin, G418 or hygromycin), a number of restriction enzyme cleavage sites, a suitable promoter sequence and a transcription terminator, which components are operably linked together.

The term "operably linked" as used herein refers to the association of nucleotide sequences on a single nucleic acid molecule, e.g. a vector, in a way such that the function of one or more nucleotide sequences is affected by at least one other nucleotide sequence present on said nucleic acid molecule. For example, a promoter is operably linked with a coding sequence of a recombinant gene when it is capable of effecting the expression of that coding sequence.

Expression vectors may include but are not limited to cloning vectors, modified cloning vectors and specifically designed plasmids. The expression vector of the invention may be any expression vector suitable for expression of a recombinant gene in a host cell and is selected depending on the host organism.

In another aspect the invention relates to such a method, wherein the expression vector comprises a secretion leader sequence effective to cause secretion of the POI from the host cell.

The presence of such a secretion leader sequence in the expression vector is required when the POI intended for recombinant expression and secretion is a protein which is not naturally secreted and therefore lacks a natural secretion leader sequence, or its nucleotide sequence has been cloned without its natural secretion leader sequence. In general, any secretion leader sequence effective to cause secretion of the POI from the host cell may be used in the present invention. The secretion leader sequence may originate from yeast source, e.g. from yeast α-factor such as MFa of Saccharomyces cerevisiae, or yeast phosphatase, from mammalian or plant source, or others. The selection of the appropriate secretion leader sequence is apparent to a skilled person.

Alternatively, the secretion leader sequence can be fused to the nucleotide sequence encoding a POI intended for recombinant expression by conventional cloning techniques known to a skilled person prior to cloning of the nucleotide sequence in the expression vector or the nucleotide sequence encoding a POI comprising a natural secretion leader sequence is cloned in the expression vector. In these cases the presence of a secretion leader sequence in the expression vector is not required.

To allow expression of a recombinant nucleotide sequence in a host cell the expression vector has to provide the recombinant nucleotide sequence with a functional promoter adjacent to the 5' end of the coding sequence. The transcription is thereby regulated and initiated by this promoter sequence.

In a further aspect the invention relates to such a method, wherein the expression vector comprises a promoter sequence effective to control expression of the POI in the host cell.

"Promoter sequence" as used herein refers to a DNA sequence capable of controlling the expression of a coding sequence or functional RNA.

Suitable promoter sequences for use with yeast host cells may include but are not limited to promoters obtained from genes that code for metabolic enzymes

which are known to be present at high concentration in the cell, e.g. glycolytic enzymes like triosephosphate isomerase (TPI), phosphoglycerate kinase (PGK), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), alcohol oxidase (AOX), lactase (LAC) and galactosidase (GAL).

Suitable promoter sequences for use with mammalian host cells may include but are not limited to promoters obtained from the genomes of viruses, heterologous mammalian promoters, e.g. the actin promoter or an immunoglobulin promoter, and heat shock protein promoters.

In order to identify novel promoter sequences for use in yeast host cells, preferably for use in a strain of the Komagataella genus, in particular for use in a strain of K. pastoris, K. pseudopastoris or K. phaffii for recombinant expression of a POI, the data derived from the DNA microarray hybridisation described in Example 1 were evaluated in a specific manner.

The promoter sequences of the 23 most interesting genes identified by this analysis (up to 1000 bp of the 5 '-region of the respective genes) were amplified from P. pastoris by PCR and cloned into a P. pastoris expression vector, which additionally carries an enhanced green fluorescent protein

(eGFP) as a reporter gene. To test the properties of the different promoters, i.e. the promoter activity, the 25 vectors (including two control vectors) were subsequently transformed into a P. pastoris strain. The clones were cultivated under different culturing conditions and the amount of recombinant eGFP was quantified using flow cytometer analysis. A comparative analysis of the well established yeast promoter of GAP and the 23 promoter sequences is provided in Example 5.

The term "promoter activity" as used herein refers to an assessment of the transcriptional efficiency of a promoter. This may be determined directly by measurement of the amount of mRNA transcription from the promoter, e.g. by Northern Blotting or indirectly by measurement of the amount of gene product expressed from the promoter.

It was surprisingly found that a 1000 bp fragment from the 5'-non coding region of the PET9 gene of P. pastoris results in real unexpected high expression levels of recombinant eGFP, ranging from about 700% to about

1600% of the promoter activity of the GAP promoter, depending on the carbon source during cultivation, under the experimental conditions as described in Example 5.

PET9 is known from S. cerevisiae as a major ADP/ATP carrier of the mitochondrial inner membrane, which exchanges cytosolic ADP for mitochondrial synthesized ATP.

In another aspect the invention relates to a method of increasing the secretion of a POI from a eukaryotic cell, wherein the nucleotide sequence encoding the POI is controlled by a promoter sequence which is a 1000 bp fragment from the 5'-non coding region of the PET9 gene of Pichia pastoris corresponding to SEQ ID NO 125, or a functionally equivalent variant thereof and the host cell is a cell of the genus Komagataella, in particular a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

In another aspect the invention relates to the use of a nucleotide sequence isolated from Saccharomyces cerevisiae and encoding a protein that increases protein secretion and being selected from the group consisting of BMH2, BFR2, COG6, C0Y1 , CUP5, IMH 1 , KIN2, SEC31 , SSA4, SSE1 , and a biologically active fragment of any of the foregoing proteins, as a secretion enhancer, particularly as an enhancer of the secretion of a POI from a eukaryotic cell, preferably in a yeast cell and most preferred in a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

In a further aspect the invention relates to such a use wherein the nucleotide sequence encoding a protein that increases protein secretion is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 32, SEQ ID NO 33, SEQ ID NO 34, SEQ ID NO 35, SEQ ID NO 36, SEQ ID NO 37, SEQ ID NO 38, SEQ ID NO 39, SEQ ID NO 40 and SEQ ID NO 41 .

In another aspect the invention relates to the use of a nucleotide sequence isolated from Pichia pastoris and encoding a protein that increases protein secretion and being selected from the group consisting of BMH2, BFR2, C0G6, C0Y1 , CUP5, IMH 1 , KIN2, SEC31 , SSA4, SSE1 , and a biologically active fragment of any of the foregoing proteins, as a secretion enhancer, particularly

as an enhancer of the secretion of a POI from a eukaryotic cell, preferably in a yeast cell and most preferred in a cell of a strain of K. pastoris, K. pseudopastoris or K. phaffii.

In a further aspect the invention relates to such a use, wherein the nucleotide sequence encoding a protein that increases protein secretion is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 42, SEQ ID NO 43, SEQ ID NO 44, SEQ ID NO 45, SEQ ID NO 46, SEQ ID NO 47, SEQ ID NO 48, SEQ ID NO 49, SEQ ID NO 50 and SEQ ID NO 51 .

SSA4 is a member of the HSP70 family of molecular chaperones. SSA4 is participating in the SRP-dependent targeting of protein to the ER membrane prior to the cotranslational translocation of the protein into the ER-lumen, and is induced upon stress response.

The chaperonines of the SSE/HSP1 10 subclass of the HSP70 family, that are encoded by SSE1 and SSE2, assist in folding by binding to nascent peptides and holding them in a folding-competent state, however, they can not actively promote folding reactions. On the basis of their "holdase" activity, interactions to chaperones such as Ssai p and Ssbi p of the HSP70 family as well as to the HSP90 complex seem plausible.

Sec31 p is an essential phosphoprotein component of the coat protein complex Il (COPII) of secretory pathway vesicles, in complex with Sec13p.

Growth defects due to mutations in either Sec13 or Sec23 (as well as Sec16 and Ypt1 ) can be overcome by overexpression of the essential S. cerevisiae gene BFR2. It has been isolated as a multi-copy suppressor of the drug Brefeldin A, a fungal metabolite that perturbs the protein flux into the Golgi and the structure of the Golgi apparatus itself.

14-3-3 proteins, encoded by BMH 1 and BMH2, were identified to participate in multiple steps of vesicular trafficking, especially in protein exit from the ER, forward trafficking of multimeric cell surface membrane proteins as well as in retrograde transportation within the Golgi apparatus.

COG6 belongs to one of eight genes coding for the Conserved Oligomeric Golgi (COG) complex, an eight-subunit peripheral Golgi protein, that is engaged in membrane trafficking and synthesis of glycoconjugates. Moreover, the COG complex is not only necessary for maintaining normal Golgi structure and function, but is also directly involved in retrograde vesicular transport within the Golgi apparatus.

The molecular function of Coy1 , a protein identified by similarity to mammalian CASP, is not established yet, but is seems to be playing a role in Golgi vesicle transport through interaction with Gos1 . Gos1 is a SNARE

(soluble N-ethylmaleimide-sensitive factor attachment protein receptor) protein commonly used as marker of later compartments of the Golgi in S. cerevisiae.

The product of the IMH 1 /SYS3 gene is a member of the peripheral membrane Golgins involved in vesicular transport between the late Golgi and a prevacuolar, endosome-like compartment. Imh1 is recruited by to the Golgi by the two ARF-like (ARL) GTPases, ArI I p and Arl3p.

Kin2, and the closely related Kin1 , are two serine/threonine protein kinases localized at the cytoplasmic side of the plasma membrane. The catalytic activity of Kin2 is essential for its function in regulation of exocytosis by phosphorylation of the plasma membrane t-SNARE Sec9, a protein acting at the final step of exocytosis. Genetic analysis indicates that the KIN kinases act downstream of the Exocyst, the vesicle tethering factor at the site of exocytosis, and its regulator Sec4 (GTP binding protein of the Ras family).

CUP5 encodes the c subunit of the yeast vacuolar (H)-ATPase (V-ATPase) V 0 domain, belonging to a family of ATP-dependent proton pumps that acidify the yeast central vacuole. The V 0 domain is an integral membrane structure of five subunits responsible for transporting protons across the membrane.

Assembling of the V 0 domain is not possible in the absence of Cup5. V-ATPase function is important for many processes including endocytosis, protein degradation and coupled transport across the vacuolar membrane. Additionally, a role for V-ATPase in detoxification of copper, iron metabolism and mitochondrial function was reported.

In another aspect the invention relates to a nucleotide sequence encoding a protein that increases protein secretion from a host cell, wherein the nucleotide sequence is isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of a nucleotide sequence encoding the protein

BMH2 (SEQ ID NO 42), a nucleotide sequence encoding the protein BFR2 (SEQ ID NO 43), a nucleotide sequence encoding the protein COG6 (SEQ ID NO 44), a nucleotide sequence encoding the protein COY1 (SEQ ID NO 45), a nucleotide sequence encoding the protein CUP5 (SEQ ID NO 46), a nucleotide sequence encoding the protein IMH 1 (SEQ ID NO 47), a nucleotide sequence encoding the protein KIN2 (SEQ ID NO 48), a nucleotide sequence encoding the protein SEC31 (SEQ ID NO 49), a nucleotide sequence encoding the protein SSA4 (SEQ ID NO 50) and a nucleotide sequence encoding the protein SSE1 (SEQ ID NO 51 ).

In a further aspect the invention relates to a yeast promoter sequence being a 1000 bp fragment from the 5'-non coding region of the PET9 gene corresponding to SEQ ID NO 125, or a functionally equivalent variant thereof and being isolated from Pichia pastoris.

It should be recognized that promoter sequences of various diminishing length may have identical promoter activity and should be therefore also included in the present invention, since the exact boundaries of the regulatory sequence of the 5'-non coding region of the PET9 gene have not been defined.

Therefore the term "functionally equivalent variant" of a promoter sequence as used herein means a nucleotide sequence resulting from modification of this nucleotide sequence by insertion, deletion or substitution of one or more nucleotides within the sequence or at either or both of the distal ends of the sequence, and which modification does not affect (in particular impair) the promoter activity of this nucleotide sequence.

In a further aspect the invention relates to such a yeast promoter sequence which has, under comparable conditions, improved properties for expression of a POI in yeast, preferably in a strain of the genus Komagataella, in particular in a strain of Komagataella pastoris, Komagataella pseudopastoris or

Komagataella phaffii, relative to a yeast promoter known in the art, in particular relative to a GAP promoter isolated from Pichia pastoris.

In a further aspect the invention relates to such a yeast promoter sequence, having, under comparable conditions, at least the same, or at least about a 1 .5-fold, or at least about 2-fold, or at least about a 4-fold, 7-fold, 10-fold, or at least up to about a 15-fold promoter activity relative to a GAP promoter isolated from Pichia pastoris.

It is desirable to have an expression system for recombinant expression of a nucleotide sequence in a host organism, in particular in a yeast host, more particular in a strain of the genus Komagataella, which offers the opportunity to easily change the different parts of the vector, like the selection marker, e.g. a resistance for zeocin, kanamycin/geneticin, hygromycin and others, the promoter or the transcription terminator. It would be also advantageous if the vector could either be integrated into the genome of the host (using homologous integration sequences) or located episomally by exchanging a part of the vector which is not important for heterologous gene expression.

For construction of a novel vector system pPuzzle which provides the above mentioned advantages, in a first step a vector backbone of pPuzzle was generated carrying an origin of replication and a selection marker for Escherichia coli (E. co/i), which enables amplification of the vector backbone in E.coli. In addition, the vector backbone of pPuzzle comprises a multiple cloning site (see Figure 1 and Example 3).

In a second step the pPuzzle expression vector carrying a eukaryotic selection marker, a promoter for recombinant expression of a heterologous or homologous nucleotide sequence, a transcription terminator and optionally sequences for homologous integration of the vector in the host genome was constructed (see Example 4). The selection of the promoter sequence and the selection marker depends on the host organism which is used for recombinant expression of a nucleotide sequence. The transcription terminator can be, in principle, each functional transcription terminator and is in particular the transcription terminator of the cytochrome c gene from S. cerevisiae. Further, the presence of homologous integration sequences depends on whether the nucleotide sequence is intended to be integrated in the genome of the host

organism or not. Since the selection marker, the promoter sequence and the homologous integration sequences are flanked by unique restriction enzyme cleavage sites they can easily be exchanged, i.e. cut out and substituted, whereby the vector can be altered or adapted to a selected host organism in a simple and efficient way.

In detail, the selection marker is cloned in a unique Kpnl restriction site, the homologous integration sequences are cloned in a unique Notl restriction site, the promoter is cloned by using the Apal and the Sbfl/Aarl restriction site and the nucleotide sequence encoding a POI is cloned in the MCS (multiple cloning site) using the restriction sites Sbfl and SfII.

In a further aspect the invention relates to an eukaryotic expression vector based on the pPuzzle backbone further comprising the following components operably linked to each other:

a recombinant nucleotide sequence encoding a POI, optionally linked to a leader sequence effective to cause secretion of the POI from the host cell; - a promoter effective to control protein expression in a host cell; a transcription terminator; a selection marker; either homologous integration sequences or autonomous replication sequences,

wherein the promoter is a 1000 bp fragment from the 5'-non coding region of the PET9 gene of Pichia pastoris (SEQ ID NO 125), or a functionally equivalent variant thereof, the transcription terminator is the transcription terminator of the cytochrome c gene from S. cerevisiae, the selection marker is a zeocin resistance gene and the host cell is a yeast cell, preferably a cell of a strain of the genus Komagataella, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.

A detailed description of the procedure for the construction of such a vector, which additionally contains an enhanced green fluorescent protein eGFP as a reporter gene (pPuzzle_zeoR_Ppet9_eGFP_AOXTT) is found in Examples 3 to 5 and in Figure 2.

It is understood that any heterologous or homologous nucleotide sequence intended for recombinant expression in a host cell can be used in the position of eGFP.

In another aspect the invention relates to the use of such a eukaryotic expression vector for recombinant expression of a POI in a host cell.

Depending on the problem to be solved it can be desirable to either have a strong expression of a protein of interest in a host cell (e.g. for recombinant production of a POI in a host cell) or to have a weak or reduced expression of a protein of interest in a host cell (e.g. when analysing the molecular function of a POI in a host cell).

Particularly, in case of the analysis of the molecular function of a cellular POI or in case of a POI intended for metabolic engineering applications, which protein shall not be secreted, but develop its activity within a desired compartment of the cell, it would be attractive being able to regulate the expression level of this protein of interest via the promoter activity. It can be desirable to either have a strong expression of the POI (comparable to or stronger as from the GAP promoter) or to have a weak or reduced expression of the POI (less than from the GAP promoter). It is therefore useful to have a selection of different promoter sequences suitable for recombinant expression of a heterologous or homologous nucleotide sequence in a host organism, in particular in a yeast host, more particular in a strain of the genus

Komagataella, in particular in a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii, having different promoter activities under comparable cell culture conditions, varying from strong promoter activity to weak or reduced promoter activity as compared to the GAP promoter. This allows to regulate the expression level of a protein of interest by selection of a suitable promoter sequence according to the experimental situation.

From the comparative analysis of promoter sequences as described in Example 5, i.e. from the analysis of the promoter activity, several promoter sequences with different promoter activities, ranging from 0 % to about 135 % of the

promoter activity of a GAP promoter isolated from Pichia pastoris, under the experimental conditions as described in Example 5, have been found.

A summary of the promoter activities of the yeast promoter sequences tested in Example 5 (determined by measurement of the relative expression level in % of the reporter gene product eGFP and standardisation on eGFP expression under the GAP promoter) is found in Table 8.

In detail, a 1000 bp fragment from the 5'-non coding region of the GND1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 67% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the GPM 1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from about 19% to about 41 % of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the HSP90 gene had, under the experimental conditions of Example 5, a promoter activity ranging from about 6% to about 81 % of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the KAR2 gene had, under the experimental conditions of Example 5, a promoter activity ranging from about 1 1 % to about 135% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the MCM 1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 6% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the RAD2 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 5% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the RPS2 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 12% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5 '-non coding region of the RPS31 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 8% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the SSA1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 30% of the promoter activity of the GAP promoter.

A 1000 bp fragment from the 5'-non coding region of the THI3 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 42% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the TPH gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 92% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the UBI4 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 4% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the ENO1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 17% to about 47% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the RPS7A gene had, under the experimental conditions of Example 5, a promoter activity ranging from 1 % to about 18% of the promoter activity of the GAP promoter.

A 1000 bp fragment from the 5'-non coding region of the RPL1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 1 1 % of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5 -non coding region of the TKL1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 9% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the PIS1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 7% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the FET3 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 7% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the FTR1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 6% of the promoter activity of the GAP promoter.

A 1000 bp fragment from the 5'-non coding region of the NMT1 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 5% of the promoter activity of the GAP promoter. A 1000 bp fragment from the 5'-non coding region of the PHO8 gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 6% of the promoter activity of the GAP promoter.

A 1000 bp fragment from the 5'-non coding region of the FET3 precursor (FET3pre) gene had, under the experimental conditions of Example 5, a promoter activity ranging from 0% to about 7% of the promoter activity of the GAP promoter.

In another aspect the invention relates to a yeast promoter sequence being isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of a 1000 bp fragment from the 5'-non coding region of the GND1 gene (SEQ ID NO 126), a 1000 bp fragment from the 5'-non coding region of the GPM1 gene (SEQ ID NO 127), a 1000 bp fragment from the 5'-non coding region of the HSP90 gene (SEQ ID NO 128), a 1000 bp fragment from the 5'- non coding region of the KAR2 gene (SEQ ID NO 129), a 1000 bp fragment from the 5'-non coding region of the MCM 1 gene (SEQ ID NO 130), a 1000 bp fragment from the 5'-non coding region of the RAD2 gene (SEQ ID NO 131 ), a 1000 bp fragment from the 5'-non coding region of the RPS2 gene (SEQ ID NO 132), a 1000 bp fragment from the 5'-non coding region of the RPS31 gene (SEQ ID NO 133), a 1000 bp fragment from the 5'-non coding region of the SSA1 gene (SEQ ID NO 134), a 1000 bp fragment from the 5'-non coding region of the THI3 gene (SEQ ID NO 135), a 1000 bp fragment from the 5'- non coding region of the TPM gene (SEQ ID NO 136), a 1000 bp fragment from the 5'-non coding region of the UBI4 gene (SEQ ID NO 137), a 1000 bp fragment from the 5'-non coding region of the ENO1 gene (SEQ ID NO 138), a 1000 bp fragment from the 5'-non coding region of the RPS7A gene (SEQ ID NO 139), a 1000 bp fragment from the 5'-non coding region of the RPL1 gene (SEQ ID NO 140), a 1000 bp fragment from the 5'-non coding region of the TKL1 gene (SEQ ID NO 141 ), a 1000 bp fragment from the 5'-non coding region of the PIS1 gene (SEQ ID NO 142), a 1000 bp fragment from the 5'- non coding region of the FET3 gene (SEQ ID NO 143), a 1000 bp fragment from the 5'-non coding region of the FTR1 gene (SEQ ID NO 144), a 1000 bp fragment from the 5'-non coding region of the NMT1 gene (SEQ ID NO 145), a 1000 bp fragment from the 5'-non coding region of the PHO8 gene (SEQ ID NO 146), and a 1000 bp fragment from the 5'-non coding region of the FET3 precursor (FET3pre) gene (SEQ ID NO 147), or a functionally equivalent variant of any of the foregoing sequences.

Enolase 1 (ENO1 ) is a phosphopyruvate hydratase that catalyzes the conversion of 2-phosphoglycerate to phosphoenolpyruvate during glycolysis and the reverse reaction during gluconeogenesis.

Triose phosphate isomerase (TPH ) is an abundant glycolytic enzyme. It catalyzes the interconversion of glyceraldehyde-3-phosphate and dihydroxyacetone phosphate during glycolysis.

THI3 is a probable decarboxylase, required for expression of enzymes involved in thiamine biosynthesis and may have a role in catabolism of amino acids to long- chain and complex alcohols.

SSA1 is an ATPase involved in protein folding and nuclear localization signal (NLS)-directed nuclear transport. SSA1 is member of heat shock protein 70 (HSP70) family.

RPS7A is a protein component of the small (40S) ribosomal subunit.

6-Phosphogluconate dehydrogenase (GND 1 ) catalyzes an NADPH regenerating reaction in the pentose phosphate pathway and is required for growth on D- glucono-delta-lactone and adaptation to oxidative stress.

GPM 1 encodes the phosphoglycerate mutase, which is a tetrameric enzyme responsible for the conversion of 3-phospholycerate to 2-phosphoglycerate during glycolysis (, and the reverse reaction during gluconeogenesis.

Transketolase (TKL 1 ) catalyzes conversion of xylulose-5-phosphate and ribose-5- phosphate to sedoheptulose-7-phosphate and glyceraldehyde-3-phosphate in the pentose phosphate pathway and is needed for synthesis of aromatic amino acids.

Heat Shock Protein 90 (HSP90) is a cytoplasmic chaperone (Hsp90 family).

RPS2 is a protein component of small ribosomal(40S) subunit.

RPS31 is a fusion protein that is cleaved to yield a ribosomal protein of the small (40S) subunit and ubiquitin.

RPL1 A is a protein component of the large ribosomal (60S) subunit.

The phosphatidylinositol synthase PIS1 is required for biosynthesis of phosphatidylinositol, which is a precursor for polyphosphoinositides, sphingolipids, and glycolipid anchors for some of the plasma membrane proteins.

Ferro-0 2 -oxidoreductase (FET3) belongs to class of integral membrane multicopper oxidases and is required for high-affinity iron uptake and involved in mediating resistance to copper ion toxicity, FET3pre its precursor.

The high affinity iron permease (FTR 1 ) is involved in the transport of iron across the plasma membrane and forms complex with Fet3p.

PHO8 is a repressible alkaline phosphatase.

N-myristoyl transferase NMT1 catalyzes the cotranslational, covalent attachment of myristic acid to the N-terminal glycine residue of several proteins involved in cellular growth and signal transduction.

The transcription factor MCM 1 is involved in cell-type-specific transcription and pheromone response.

Ubiquitin (UBI4) becomes conjugated to proteins, marking them for selective degradation via the ubiquitin-26S proteasome system.

RAD2, a single-stranded DNA endonuclease, cleaves single-stranded DNA during nucleotide excision repair to excise damaged DNA.

In a further aspect the invention relates to a eukaryotic expression vector based on the pPuzzle backbone further comprising the following components operably linked to each other:

a recombinant nucleotide sequence encoding a POI, optionally linked to a leader sequence effective to cause secretion of the POI from the host cell; a promoter effective to control protein expression in a host cell; a transcription terminator;

a selection marker; either homologous integration sequences or autonomous replication sequences,

wherein the promoter is a yeast promoter sequence isolated from Pichia pastoris and is identical with or corresponds to and has the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131 , SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO 140, SEQ ID NO 141 , SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147, or a functionally equivalent variant of any of the foregoing sequences, and the host cell is a yeast cell, preferably a cell of a strain of the genus Komagataella, in particular a cell of a strain of Komagataella pastoris, Komagataella pseudopastoris or Komagataella phaffii.

In another aspect the invention relates to the use of such a eukaryotic expression vector for recombinant expression of a POI in a host cell.

In case, that the POI is a cellular protein intended for metabolic engineering applications, i.e. for expression and developing its activity within a desired compartment of a host cell the POI may be expressed from a eukaryotic expression vector based on the pPuzzle backbone without a leader sequence effective to cause secretion of the POI from the host cell.

If the cellular POI is a homologous protein to the host cell, i.e. a protein which is naturally occurring in the host cell, the expression of the POI in the host cell may be modulated by the exchange of its native promoter sequence with a yeast promoter sequence isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131 , SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO

140, SEQ ID NO 141 , SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ

ID NO 145, SEQ ID NO 146 and SEQ ID NO 147 or a functionally equivalent variant of any of the foregoing sequences.

This purpose may be achieved e.g. by transformation of a host cell with a recombinant DNA molecule comprising homologous sequences of the target gene to allow site specific recombination, the desired yeast promoter sequence and a selective marker suitable for the host cell. The site specific recombination shall take place in order to operably link the yeast promoter sequence with the nucleotide sequence encoding the POI. This results in the expression of the POI from the yeast promoter sequence instead of from the native promoter sequence.

Depending on the problem to be solved the selected yeast promoter may have either an increased promoter activity relative to the native promoter sequence leading to an increased expression of a POI in the host cell or may have a decreased promoter activity relative to the native promoter sequence leading to a reduced expression of a POI in the host cell.

In another aspect the invention relates to the use of a yeast promoter sequence being isolated from Pichia pastoris and being identical with or corresponding to and having the functional characteristics of a sequence selected from the group consisting of SEQ ID NO 125, SEQ ID NO 126, SEQ ID NO 127, SEQ ID NO 128, SEQ ID NO 129, SEQ ID NO 130, SEQ ID NO 131 , SEQ ID NO 132, SEQ ID NO 133, SEQ ID NO 134, SEQ ID NO 135, SEQ ID NO 136, SEQ ID NO 137, SEQ ID NO 138, SEQ ID NO 139, SEQ ID NO

140, SEQ ID NO 141 , SEQ ID NO 142, SEQ ID NO 143, SEQ ID NO 144, SEQ ID NO 145, SEQ ID NO 146 and SEQ ID NO 147 or a functionally equivalent variant of any of the foregoing sequences for modulation of the expression of a homologous POI in a host cell.

In another aspect the invention relates to such a use, wherein the yeast promoter sequence has an increased promoter activity relative to the native promoter sequence of the POI.

In another aspect the invention relates to such a use, wherein the yeast promoter sequence has a decreased promoter activity relative to the native promoter sequence of the POI.

In order that the invention described herein may be more fully understood, the following examples are set forth. The examples are for illustrative purposes only and are not to be construed as limiting this invention in any respect. It is further understood that the present invention shall also comprise variations of the expressly disclosed embodiments to an extent as would be contemplated by a person of ordinary skill in the art.

Examples

Examples 1 and 2 below illustrate the materials and methods used to investigate the effect of co-expression of different proteins involved in the eukaryotic secretion pathway (secretion helper factors) on the yield of a secreted heterologous protein of interest, i.e. on the secretion of the Fab fragment of the monoclonal anti-HIV1 antibody 2F5 in P. pastoris.

Example 1 : Identification and cloning of several secretion helper factors from Saccharomyces cerevisiae

In order to identify genes and their respective proteins which play a potential role during protein production, e.g. in the protein secretory pathway of P. pastoris the gene expression pattern of a P. pastoris strain containing the gene for human trypsinogen 1 was compared before and after induction of heterologous protein production (induction was done by a switch from glycerol to methanol as the sole carbon source), i.e. of trypsinogen production by microarray analysis.

As the genome sequence of P. pastoris has not been published and not many genes are characterized for P. pastoris DNA microarrays of S. cerevisiae were used for heterologous hybridization with P. pastoris cDNA. The experimental procedure of the microarray hybridisation and the evaluation of the obtained data was carried out as described in Sauer et al. (2004). Further details are found below.

a) Strain:

The expression strain was P. pastoris strain X33 (Invitrogen), a wild type strain which can grow on minimal media without supplements. The selection mechanism was based on the Zeocin™ resistance of the transformation vector. Transformation of the strain was carried out with a plasmid derived from

pPICZαB (Invitrogen), containing the gene for human trypsinogen 1 (Hohenblum et al., 2003). pPICZαB utilises the AOX 1 promoter of P. pastoris, which promoter is repressed by many carbon sources such as glucose, glycerol or ethanol but induced by the carbon source methanol, and the α-factor leader sequence of 5. cerevisiae for product secretion. The selected strain was of the methanol utilisation positive (mut + ) phenotype, which means that it is fully capable to metabolise methanol as the sole carbon source.

b) Cell culture:

Fermentation of P. pastoris

Fed batch fermentations were performed with a MBR mini bioreactor with a final working volume of 2 I, essentially as described by Hohenblum et al.

(2003). The media were as follows:

PTM 1 trace salts stock solution contained per litre

6.0 g CuSO 4 . 5H 2 O, 0.08 g NaI, 3.0 g MnSO 4 - H 2 O, 0.2 g Na 2 MoO 4 . 2H 2 O, 0.02 g H 3 BO 3 , 0.5 g CoCI 2 , 20.0 g ZnCI 2 , 65.0 g FeSO 4 . 7H 2 O, 0.2 g biotin and 5.0 ml H 2 SO 4 (95 %-98 %). All chemicals for PTM 1 trace salts stock solution were from Riedel-de Haen, except for biotin (Sigma), and H 2 SO 4 (Merck Eurolab).

Batch medium contained per litre

23.7 ml H 3 PO 4 (85 %), 0.6 g CaSO 4 .2H 2 O, 9.5 g K 2 SO 4 , 7.8 g MgSO 4 . 7H 2 O, 2.6 g KOH, 40 g glycerol, 4.4 ml PTM 1 trace salts stock solution.

Glycerol fed-batch solution contained per litre

632 g glycerol ( 100 %) and 12 ml PTM 1 trace salts stock solution.

Methanol fed- batch solution contained per litre 988 ml methanol (100 %) and 12 ml PTM 1 trace salts stock solution.

The dissolved oxygen was controlled at DO = 30 % with the stirrer speed (600 - 1200 rpm). Aeration rate was 100 I h 1 air, which was supplemented with oxygen (up to 25 %) after the begin of the fed batch. The temperature was 25 0 C, and the pH was controlled with NH 3 (25 %).

Before starting the fermentation, the pH of 1 .2 I batch medium was set to 5.0 with NH 3 (25 %). The batch phase of approximately 32 h was followed by a 4 h fed batch with glycerol medium (feed rate 15.6 ml Ir 1 ), leading to a dry biomass concentration of approximately 40 g I 1 . Then, the feed with methanol medium was started with a feed rate of 6.4 ml h "1 . Methanol induces the production of the heterologous protein trypsinogen and serves as a carbon source at the same time. The fermentation was terminated 14 h after the methanol feed start. The pH was 5.0 during batch, and kept at 5.0 throughout the fermentation.

Samples were taken at the end of the glycerol fed batch phase (trypsinogen non-expressing cells) and at the end of the methanol fed batch phase

(trypsinogen expressing cells), respectively. Cells were centrifuged to separate the cell culture supernatant, then the cell pellets were resuspended in 10 x the volume of TRI-reagent (Sigma) and frozen.

c) RNA Isolation:

The samples were thawed on ice and after addition of acid washed glass beads the cells were homogenised in a Ribolyser (Hybaid Ltd.) for 2 x 20 sec, in between cooling on ice. After addition of chloroform, the samples were centrifuged and the total RNA was precipitated from the aqueous phase adding isopropanol. The pellet was washed 2 x with 70% ethanol, dried and re- suspended in RNAse free water. mRNA was isolated using the MicroPoly( A) Purist mRNA purification Kit (Ambion) according to the manufacturers protocol.

d) Synthesis and labelling of cDNA: 5 μg of mRNA and 0.5 μg of oligodT primer were mixed in 7 μl of water, incubated for 5 min at 70 0 C and subsequently at 42 0 C for about 3 min. The following components were added to 5 μl of said reaction mixture: 4 μl reaction buffer (5 x) for Superscript Il reverse transcriptase (Invitrogen), 2 μl dTTP (2 mM), 2 μl dATP, dGTP, dCTP (5 mM), 2 μl DTT ( 100 mM), 2.5 μl RNasin (40 U, Promega) and either 2 μl FluoriLink Cy3-dUTP (1 mM) or 2 μl FluoriLink Cy5-dUTP ( 1 mM, Amersham Biosciences) respectively , and 1 μl Superscript Il reverse transcriptase (200 U, Invitrogen) to result in a total of 19.5 μl. The mixture was incubated for 1 h at 42°C. After addition of further 200 U Superscript Il reverse transcriptase the mixture was incubated for another 1 h at 42°C. 7 μl of 0.5 M NaOH/50 mM EDTA were added and the mixture was incubated at 7O 0 C for 15 min. The reaction mixture was

neutralised by addition of 10 μl Tris-HCI pH 7.5 (1 M). The labelled cDNA was purified with Qiaquick purification columns (Qiagen) according to the manufacturer's protocol.

e) Chip hybridisation and set-up of microarrays:

The S. cerevisiae cDNA microarrays used for this study were Hyper Gene Yeast Chips from Hitachi Software Engineering Europe AG. According to the manufacturer, about 0.1 to 0.3 ng of PCR amplified cDNA (approximately 200 bp to 8000 bp) were spotted onto a poly-L-lysine coated glass slide and fixed by baking, succinic anhydride blocking and heat denaturation.

Labelled cDNA was resuspended in about 70 μl of 5 x SSC/0.05% SDS, heat denatured at 95° C for 3 min and cooled on ice. SDS crystals appearing were dissolved by short and slight warming and the mixture was gently applied to a Yeast Chip. The spotted area was covered with a cover glass and the chips were placed in an airtight container with a humidified atmosphere at 60 0 C for 16 h.

The cover glasses were removed in 2 x SSC/0.1 % SDS and the chips were washed consecutively for 5-10 min each in 2 x SSC/0.1 % SDS, 0.5 x SSC/0.1 % SDS, and 0.2 x SSC/0.1 % SDS at RT. The chips were centrifuged at 600 rpm for 3 min in order to dry them. The washing conditions were chosen according to the manufacturer's manual.

Each sample (labelled cDNA from trypsinogen non-expressing cells and from trypsinogen expressing cells) was used for hybridisation of two parallel cDNA mircoarrays to test the reproducibility of the signals.

f) Data acquisition and statistical evaluation of microarray data:

Images were scanned at a resolution of 50 μm with a G2565AA Microarray scanner (Agilent) and were imported into the GenePix Pro 4.1 (Axon Instruments) microarray analysis software. GenePix Pro 4.1 was used for the quantification of the spot intensities. Each appearing gene spot was averaged. The data set was then imported into GeneSpring 6.1 (Silicon Genetics) for further normalisation and data analysis.

All of the values of each channel on each chip were divided by their respective median for normalisation. Subsequently, the median intensity of all TE spots (spotted with buffer, no DNA) deduced from each value, and all spot values less than the standard deviation of said threshold values were considered to be not

significant and were set to the value of the standard deviation. To determine induction or repression of gene activity, the normalised signals on each spot were compared, and all genes showing a signal difference exceeding the threshold (1 .5 fold) on both parallel independent microarrays were judged as significantly regulated.

After determination of the the relative expression levels of all measured genes, the genes were ordered by the relative difference of their expression levels, and the 524 with the highest difference were considered for further analysis. As the DNA microarrays used for these experiments were derived from Saccharomyces cerevisiae gene sequences, only putative gene functions for P. pastoris can be assigned by the homology to S. cerevisiae. After ranking the 524 differentially regulated genes based on their putative intracellular localisation and function, and focusing on those being involved in secretion and/or general stress response, out of the 64 potentially interesting genes 15 were selected for further analysis: PDH , CUP5, SSA4, BMH2, KIN2, KAR2, HAC1 , ERO1 , SSE1 , BFR2, COG6, SSO2, COY1 , IMH 1 and SEC31 .

g) Construction of an expression vector for cloning of the identified secretion helper factors: To generate a vector containing the GAP promoter and the his4 gene as selection marker, the A0X1 promoter of the vector pPIC9 (Invitrogen) was exchanged to the GAP promoter of pGAPZ B (Invitrogen) by restriction digest of both vectors with Notl and Mph1 1031 and subsequent ligation following a standard protocol. The newly constructed vector is referred to as pGAPHis.

h) Isolation of the helper factor genes from Saccharomyces cerevisiae and cloning into pGAPHis:

All the genes apart from Had were amplified directly from Saccharomyces cerevisiae genomic DIMA by PCR with specific oligonucleotide primers depicted in Table 1 . The P. pastoris Kozac sequence (ACG) was inserted directly before the start codon ATG. The non-template coded restrictions sites Sacll (Xhol for the gene PDN ) and either PmII or Sfil (EcoR1 for the gene PDH ) were added by using the respective forward and backward primer (see Table 1 ). After restriction digest of the PCR fragments of correct length (checked by agarose gel separation) with Sacll (Xhol for the gene PD11 ) and either PmII or Sfil

(EcoR1 for the gene PDH ) as shown in Table 1 , these fragments were cloned into the pGAPHis vector (also digested with the respective restriction enzymes

and treated with alkaline phosphatase). To construct the induced variant of the HAC 1 gene of S. cerevisiae, the DNA fragment coding for the first 220 amino acids was combined with the fragment coding for the 18 amino acid exon of the induced Haci p (Mori et al., 2000) in a two step PCR reaction, and the resulting fragment was ligated into pGAPHis.

All ligated plasmids were transformed into E. coli Top 10F ' (Invitrogen) and plated on Ampicillin containing LB-agar. Restriction enzyme analysis was performed to verify the correct identity of the respective plasmids.

Table 1 : PCR primers for amplification of the secretion helper factors from Saccharomyces cerevisiae (SEQ ID NO 1 to SEQ ID NO 31 )

Forward primer BFR2 FORW PmII (SEQ ID NO 1 ), 54 0 C: 5' _ TAAACACGTGAGCATGGAAAAATCACTAGCGG - 3'

Backward primer BFR2 BACK Sacll (SEQ ID NO 2), 56 ° C: 5' - TACACCGCGGTCAACCAAAGATTTGGATATC - 3'

Forward primer BMH2 FORW PmII (SEQ ID NO 3), 56 °C: 5' - TAACCACGTGAGCATGTCCCAAACTCGTGAAG - 3'

Backward primer BMH2 BACK Sacll (SEQ ID NO 4), 58 °C: 5' - TATGCCGCGGTTATTTGGTTGGTTCACCTTG - 3'

Forward primer COG6 FORW PmII (SEQ ID NO 5), 56 °C: 5' - TAAGCACGTGAGCATGGATTTCGTTGTAGACTAT - 3'

Backward primer COG6 BACK Sacll (SEQ ID NO 6), 60 °C: 5' - TAAGCCGCGGTCAGTGATCAATACCTATCAAC - 3'

Forward primer COY1 FORW PmII (SEQ ID NO 7), 54 °C: 5' - TAGTCACGTGAGCATGGATACGTCAGTATATTC - 3'

Backward primer C0Y1 BACK Sacll (SEQ ID NO 8), 58 °C: 5' - TACACCGCGGCTATCGATTTATGCCATGAAC - 3'

Forward primer CUP5 FORW PmII (SEQ ID NO 9), 54 °C: 5' - TATCCACGTGAGCATGACTGAATTGTGTCCTG - 3'

Backward primer CUP5 BACK Sacll (SEQ ID NO 10), 54 0 C : 5' - TACACCGCGGTTAACAGACAACATCTTGAG - 3'

Forward primer ER01 FORW Sfil (SEQ ID NO 1 1 ), 62 0 C:

5' - TATAGGCCCAGCCGGCCACGATGAGATTAAGAACCGCCATTG - 3'

Backward primer ER01 BACK Sacll (SEQ ID NO 12), 58 0 C: 5' - TGTCCCGCGGTTATTGTATATCTAGCTTATAGG - 3'

Forward primer IMH 1 FORW Sfil (SEQ ID NO 13), 54 °C:

5' - TATAGGCCCAGCCGGCCACGATGTTCAAACAGCTGTCAC - 3'

Backward primer IMH1 BACK Sacll (SEQ ID NO 14), 58 °C: 5' - TAGACCGCGGTTACTTCAGAGACATAACCAG - 3'

Forward primer KIN2 FORW Sfil (SEQ ID NO 15), 64 0 C:

5' - TCAAGGCCCAGCCGGCCACGATGCCTAATCCGAATACAGCAG - 3'

Backward primer KIN2 BACK Sacll (SEQ ID NO 16), 66 °C:

5' - TCTGCCGCGGCTATAGGTTTAATTCTTTTAAAATATAC - 3'

Forward primer KAR2 FORW PmII (SEQ ID NO 17), 56 °C: 5' - TAAGCACGTGACGATGTTTTTCAACAGACTAAGC - 3'

Backward primer KAR2 BACK Sacll (SEQ ID NO 18), 56 °C: 5' - TATGCCGCGGCTACAATTCGTCGTGTTCG - 3'

Forward primer PDH FORW EcoRI (SEQ ID NO 19), 58 °C: 5' - CGCCGAATTCACGATGAAGTTTTCTGCTGGTGC - 3'

Backward primer PDH BACK Xhol (SEQ ID NO 20), 58 °C: 5' - CCTCCTCGAGTTACAATTCATCGTGAATGGC - 3'

Forward primer SEC31 FORW Sfil (SEQ ID NO 21 ), 56 0 C:

5' - TATAGGCCCAGCCGGCCACGATGGTCAAACTTGCTGAGTT - 3'

Backward primer SEC31 BACK Sacll (SEQ ID NO 22), 58 0 C: 5' - TATGCCGCGGTTAATTCAAAGTCGCTTCAGC - 3'

Forward primer SSA4 FORW PmII (SEQ ID NO 23), 60 0 C: 5' - TATGCACGTGACGATGTCAAAAGCTGTTGGTATTG - 3'

Backward primer SSA4 BACK Sacll (SEQ ID NO 24), 58 °C: 5' - TATCCCGCGGCTAATCAACCTCTTCAACCG - 3'

Forward primer SS02 FORW PmII (SEQ ID NO 25), 62 °C: 5' - TACACACGTGACGATGAGCAACGCTAATCCTTATG - 3'

Backward primer SS02 BACK Sacll (SEQ ID NO 26), 60 °C: 5' - TATGCCGCGGTTACTTTCTTGTTTCCACAACG - 3'

Forward primer SSE1 FORW PmII (SEQ ID NO 27), 60 0 C: 5' - TAGACACGTGACGATGAGTACTCCATTTGGTTTAG - 3'

Backward primer SSE1 BACK Sacll (SEQ ID NO 28), 60 °C:

5' - TATCCCGCGGTTAGTCCATGTCAACATCACC - 3'

Forward primer HAC 1 FORW Sfil (SEQ ID NO 29), 58 ° C:

5' - GCAAGGCCCAGCCGGCCACGATGGAAATGACTGATTTTGAAC - 3'

Backward primer HAC BACK1 (SEQ ID NO 30), containing 3 ' -end of inactive had pu (5 ' -splicing site), 58 °C:

5' - TGGTCATCGTAATCACGGC - 3'

Backward primer HAC BACK2 Sacll (SEQ ID NO 31 ), containing the sequence encoding the last 18 aa of active had p, 58 °C:

5' - CCTCCCGCGGTCATGAAGTGATGAAGAAATCATTCAATTCAAATGAA TTCAAACCTGACTGCGCTTCTGGATTACGCCAATTGTCAAG - 3'

Example 2: Investigation of the effect of the secretion helper factors on heterologous protein production of recombinant 2F5 Fab in P. pastoris

The plasmid DNA from E. coli from Example 1 was used to transform P. pastoris strain SMD1 168 already containing the expression cassettes for 2F5 Fab under control of the GAP promoter, which strain was pre-selected for a high Fab secretion level. The strain SMD1 168 is a P. pastoris his4-defective strain (a pep4 mutant). Selection was based on zeocin resistance for the antibody genes, and histidin auxotrophy for the other genes.

a) Construction of the P. pastoris strain SMD 1 168 secreting the Fab fragment of the monoclonal anti-HIV1 antibody 2F5:

2F5 antibody fragment sequences for the Fab light and heavy chain were amplified by PCR from pRC/RSV containing the humanized IgG I mAb as disclosed in Gasser et al., 2006. The restriction sites EcoRI and Sacll were used for cloning.

In detail, for the generation of Fab, the entire light chain genes (vL and cL) and the vH and cH1 region of the heavy chain genes were amplified by PCR. The light chain fragment was ligated into a modified version of pGAPZαA, where the Avrll restriction site was changed into Ndel by site directed mutagenesis to allow subsequent linearization of the plasmids containing two cassettes.

The heavy chain fragment was inserted into the original version of pGAPZαA, which contains the constitutive P. pastoris glycerol aldehyde phosphate dehydrogenase (GAP) promoter followed by the MFa leader sequence of S. cerevisiae (Invitrogen, Carlsbad, CA, USA).

Plasmids combining the expression cassettes for both Fab chains on one vector were produced by double digestion of the light chain vector with BgI Il and Bam\λ\, and subsequent insertion into the unique BamH\ site of the vector pGAPZαA already containing a single copy of the expression cassette of the

heavy chain fragment. Plasmids were then linearized with Avrll prior to electrotransformation into P. pastoris.

All constructed expression cassettes were checked by DNA sequencing with the GAP forw/AOX3 ' back primers (Invitrogen).

b) Construction of P. pastoris strains co-expressing 2F5 Fab and a secretion helper factor:

Transformation of P. pastoris strains obtained in step a) was carried out with the plasmids of Example 1 , which are linearized in the HIS4 locus. The plasmids were introduced into the cells by electrotransformation. The transformed cells were cultivated on RDB-agar (lacking histidine) for selection of His-prototrophic clones, which contain the expression cassettes for the secretion helper factors.

c) Culturing transformed P. pastoris strains in shake flask cultures:

5 ml YP-medium (10 g/l yeast extract, 20 g/l peptone) containing 20 g/l glycerol were inoculated with a single colony of P. pastoris selected from the RDB plates and grown overnight at 28°C. Aliquots of these cultures corresponding to a final OD 600 of 0.1 were transferred to 10 ml of main culture medium (per liter: 10 g yeast extract, 10 g peptone, 100 mM potassium phosphate buffer pH 6.0, 13.4 g yeast nitrogen base with ammonium sulfate, 0.4 mg biotin) and incubated for 48 h at 28° C at vigorous shaking in 100 ml Erlenmeyer flasks. To induce recombinant protein expression, cultures with the GAP promoter were supplemented with 10 g/l glucose. The same amounts of substrate were added repeatedly 4 times every 12 h, before cells were harvested by centrif ugation at 2500 x g for 5 min at room temperature and prepared for analysis (biomass determination by measuring optical density at 600 nm, ELISA for Fab quantification in the culture supernatant).

d) Evaluation of the effect of co-overexpression of single folding helper factors by quantification of 2F5 Fab:

To determine the amount of secreted recombinantly expressed 2F5 Fab, 96 well microtiter plates (MaxiSorb, Nunc, Denmark) were coated with anti-hlgG (Fab specific) overnight at RT (Sigma l-5260; 1 : 1000 in PBS, pH 7.4), before serially diluted supernatants of P. pastoris cultures secreting 2F5 Fab from step c) (starting with a 1 :200 dilution in PBS/Tween20 (0.1 %) + 1 % BSA) were applied and incubated for 2 h at RT. A human Fab of normal IgG

(Rockland) was used as a standard protein at a starting concentration of 200 ng/ml. After each incubation step the plates were washed four times with PBS containing 0.1 % Tween 20 adjusted to pH 7.4. 100 μl of anti-kappa light chain - AP conjugate as secondary antibody ( 1 : 1000 in PBS/Tween + 1 % BSA) were added to each well, and incubated for 1 h at RT. After washing, the plates were stained with pNPP (1 mg/ml p-nitrophenyl phosphate in coating buffer, 0.1 N Na 2 C(VNaHCO 3 ; pH 9.6) and read at 405 nm (reference wavelength 620 nm).

Of each of the 15 different secretion helper factor constructs, 16 individual clones were cultivated in shake flask cultures as described in step c) and compared to 1 6 individual clones of the control strain, that was transformed with the pGAPHis vector lacking a gene. The 2F5 Fab productivity (μg Fab/biomass) was determined for all the analyzed cultures (first screening round). The 6 best clones of each of the constructs were then re-analyzed using the same system in a second screening round (for results see Table 2).

Table 2 shows the mean relative productivity of the 6 best clones of each tested secretion helper factor construct including the control construct (empty pGAPHis vector). The table shows the mean improvement factor of 2F5 Fab secretion of two screening rounds obtained by co-overexpression of the secretion helper factors relative to the control cultures. The secretion helper factors which are known in the art improving the secretion of heterologous proteins when co-overexpressed (PDM , KAR2, HAC1 , ERO 1 and SSO2) are included in Table 2 for comparative reasons.

Table 2: Mean relative productivity of the tested secretion helper factors

As can be seen from Table 2, that the secretion of the heterologous protein, i.e. the secretion of 2F5 Fab was increased for most of the analyzed secretion helper factors, in a range between 1 .2 and 1 .7-fold. Apart from the secretion helper factors already known in the art having a positive effect on the secretion of a heterologous protein co-overexpression of the secretion helper factors CUP5, SSA4, BMH2, KIN2, SSE 1 and BFR2 showed a highly significant increase in the amount of secreted heterologous protein and co-overexpression of C0G6, COY 1 and IMH 1 showed a significant increase in the amount of secreted heterologous protein.

Sequence information for the secretion helper factors PDN , KAR2, HAC 1 , ERO1 and SSO2 is disclosed in the prior art.

The nucleotide sequences of the secretion helper factors which are not yet known in the art improving the secretion of heterologous proteins when co- overexpressed are shown in Table 3 below.

Table 3: Nucleotide sequences of the isolated secretion helper factors (SEQ ID NO 32 to SEQ ID NO 41 )

S. cerevisiae BMH2 (SEQ ID NO 32)

ATGTCCCAAACTCGTGAAGATTCTGTTTACCTAGCTAAATTAGCTGAACAAG

CCGAACGTTATGAAGAAATGGTCGAAAACATGAAGGCCGTTGCTTCATCAGG

TCAAGAGTTATCTGTCGAAGAACGGAATCTATTGTCGGTTGCTTACAAGAAC

GTCATCGGTGCTCGCCGTGCTTCATGGAGAATAGTTTCTTCGATCGAACAAA

AAGAAGAATCAAAGGAGAAATCTGAACATCAAGTTGAATTAATCCGTTCTTA

CCGTTCTAAAATTGAAACTGAATTGACCAAAATCTCTGACGACATTTTATCTG

TGTTAGATTCTCATTTAATCCCTTCTGCTACTACTGGTGAGTCTAAAGTATTTT

ACTATAAGATGAAGGGTGACTACCACCGTTATTTAGCTGAATTTTCCAGCGG

AGATGCAAGAGAAAAGGCAACCAACTCCTCTTTGGAGGCTTATAAAACCGCT

TCCGAAATCGCCACAACTGAATTGCCTCCAACTCACCCAATTCGTTTAGGTCT

AGCTTTGAATTTCTCCGTCTTCTATTACGAAATTCAAAACTCTCCTGATAAGG

CTTGCCACTTGGCCAAACAAGCCTTTGATGATGCTATTGCTGAGTTAGATACT

TTATCTGAAGAATCATACAAGGATAGCACTTTGATCATGCAATTATTAAGGG

ACAACTTGACCTTATGGACCTCTGATATTTCTGAATCTGGTCAAGAAGATCAA

CAACAACAACAACAACAGCAACAGCAACAGCAACAACAGCAACAACAAGCT

CCAGCTGAACAAACTCAAGGTGAACCAACCAAATAA

S. cerevisiae BFR2 (SEQ ID NO 33)

ATGGAAAAATCACTAGCGGATCAAATTTCCGATATCGCCATTAAACCGGTC

AATAAAGACTTCGATATTGAAGATGAGGAAAATGCATCTTTATTTCAACAC

AATGAAAAAAATGGAGAAAGTGATTTAAGCGACTATGGAAATAGCAACAC

AGAAGAAACCAAGAAGGCGCACTATTTGGAGGTGGAAAAGTCTAAGTTAA

GAGCAGAAAAAGGTTTAGAACTAAACGATCCAAAATATACAGGTGTTAAAG

GTTCAAGACAAGCATTATATGAAGAAGTTTCCGAGAATGAGGACGAAGAAG

AAGAAGAAGAAGAGGAAGAAGAAAAAGAGGAAGATGCTCTTTCATTCAGG

ACAGATTCTGAAGATGAAGAAGTAGAGATTGATGAAGAAGAATCAGACGC

GGACGGCGGTGAAACGGAGGAGGCTCAACAGAAAAGGCATGCACTATCGA

AACTAATTCAACAAGAGACTAAACAAGCTATTAACAAACTGTCTCAATCAG

TTCAAAGAGATGCTTCGAAGGGTTATTCCATTTTACAACAGACAAAATTATT

TGACAACATCATTGATTTGAGAATAAAACTACAAAAAGCTGTAATTGCAGC

AAATAAGCTCCCATTAACTACAGAGTCCTGGGAAGAGGCTAAAATGGATGA

TTCAGAGGAAACAAAGCGTTTGCTGAAGGAAAACGAAAAACTGTTCAATAA

TTTATTCAATCGGTTGATAAATTTCAGAATAAAATTCCAACTTGGCGATCAT

ATCACTCAAAATGAAGAGGTGGCGAAGCATAAATTGTCCAAAAAAAGATCT

CTCAAAGAGCTTTACCAAGAAACTAATAGCTTAGACTCAGAACTAAAAGAG

TACAGGACTGCCGTATTAAACAAGTGGTCTACCAAAGTTTCTTCTGCATCAG

GTAACGCTGCTTTATCATCTAACAAATTCAAAGCTATCAACTTACCTGCAGA

TGTACAAGTCGAAAACCAATTATCCGATATGTCCCGTTTGATGAAAAGAAC

AAAGTTGAACAGGAGAAACATAACGCCTTTGTATTTCCAAAAAGACTGTGC

TAATGGCAGGCTACCAGAATTGATTTCTCCCGTTGTCAAAGATAGTGTTGAT

GACAATGAGAATTCGGATGATGGGCTTGATATCCCGAAAAACTATGACCCA

AGAAGAAAGGATAACAATGCCATTGACATTACCGAAAACCCATATGTTTTT

GATGACGAAGATTTTTACCGTGTTTTACTAAACGATTTAATTGACAAAAAGA

TTTCCAACGCTCACAATTCTGAAAGTGCAGCAATTACAATCACCTCAACTA

ATGCTCGTTCGAACAACAAGCTAAAGAAGAATATCGATACTAAGGCTTCCA

AGGGTAGGAAATTGAACTACTCAGTTCAAGATCCAATTGCGAATTATGAAG

CCCCCATCACATCCGGATACAAATGGTCAGACGACCAAATCGATGAATTCT TTGCGGGATTGTTAGGTCAACGAGTGAACTTTAATGAAAATGAGGATGAGG AACAACATGCCAGAATAGAAAATGACGAAGAATTAGAGGCTGTTAAAAAC GATGATATCCAAATCTTTGGTTGA

S. cerevisiae COG6 (SEQ ID NO 34)

ATGGATTTCGTTGTAGACTATCAGACCTACGCAATGGCGGATACTGCCACG

CCAGAATTACCAGAACCTGAGCCAAGACTAAACTTAACCTCAGATGCACAG

TCACAGCCCACCGGTAAACTAGATCTACAGTTTAAGTTGCCCGACCTTCAA

CGTTATTCCAATAATAATGCAACTTTGCCAGTAGATAATGATGGTGCTGGTT

CGAAAGACCTACATAAGAAAATGACACATTACGCAATGTCTTCCATTGATA

AAATACAGCTTTCAAATCCAAGCAAACAATTAGGGCAAAATTCCCAGGATG

AAAAACTATCGCAGCAAGAATCTCAAAATTTCACGAATTACGAGCCAAAAA

ACCTTGATTTATCAAAATTAGTATCCCCGTCAAGTGGTTCCAACAAAAATAC

CACAAATTTGGTTCTTTCGAATAAACTATCCAAGATATTGAACAATTACACA

TTGATTAACTATCAGGCCACAGTCCAACTAAGAAAATCCCTAAAGGTTCTA

GAAGAGAATAAAGAGAGATTGTCCCTTGATGAACAAAAGCTCATGAATCCT

GAATATGTAGGTACTTTGGCAAGAAGAGCATTGAGGACTGATTTGGAATCT

CAACTGCTAAAGGAACATATTACGGTACTTGAGGAATTCAAACCTATCATT

AGAAGGATTAAACGATTATCTTCTTCCGTCGAAAAAATACAAAGAACGAGC

GAAAAATTACTAAGTAATGAGACAAATGAGGTTCCAACAAATAACGTGGTA

CTTCAGGAAATAGATCAATACCGTTTAAAGGCAGAGCAGTTGAAGCTGAAA

AAAAAAATACTGTTATCTATAAGGGATAGGTTTACTTTGAATCAGGTAGAG

GACGATGTAATCACCAATGGTACTATAGACAACATCTTTTTCGAGGTAGTAA

AGAAAGTAATCAATATTAAAGATGAATCAAGTTTCTTGCTGACGCTTCCTAA

TTTGAATGCTGGAAATGCTTTGATAATGGGAGTTAATGAAATTTTAGAAAAG

ACAAACAAAAAAATCTTCAATTATTTGATCGATTTTTTATATAGTTTTGAAT

CCTCTTCAAATTTATTAAATGACCATGGTACTACTGAACAAGAAAGCTTAAA

CATTTTTCGGAAGAGTCTGGTCTTCCTGTCAAGTGATCTAGAATTATTTAAT

GAGTTGTTGAAAAGAGTGACCACACTGAGATCCAAGAGTATTCTGGATGAG

TTTTTGTCTCAATTCGATATGAATTCAACTACCTCTAAACCCATCATATTATC

GGCACACGATCCAATTAGGTATATTGGTGACGTACTAGCGTCCGTTCATTCC

ATCATCGCAAATGAAGCTGATTTCGTGAAGTCACTATTTGACTTTCAGGATG

AAGACTTAAAAGATACCCCAATTTCTATACTTCAACAAAACAAGACATTCT

TGAAAGGCATCGACAACAAATTGCTGAACGATATCATCCAGTCGCTATCCA

ATTCGTGTCGTATTCGTATCGAGCAAATCGTGAGGTTTGAAGAAAATCCGAT

CATCAATTTCGAGATTGTGAGGCTGCTGAAACTTTACAGAGTTATGTTCGAG

AGAAAGGGAATTCAGGACGATAGTTCTATTATTAACAATTTAAAGTCGTTG

GAAGACATTTCCAAAAACAGAATTATTGGATACTATGAAGACTATATGAAG

CAAACAGTCATGGCGGAAACAAAAAATTCTTCAGATGATTTACTGCCACCA

GAGTGGCTATCAGAGTATATGAATAAATTGGTAGAGTTATTTGAAATTTATG

AAAAGACACATGCTGCCGAAGATGAGGAATCAGAAGATAATAAATTGCTCT

CATCTAAGAATTTACAAACAATTGTAGAACAACCAATAAAAGATGTTCTGT

TAAAACAATTGCAAACATCTTTTCCTTTGGCGAAAAAAAATGAAAAAGAAA

AGGCATCATTGCTAACTATAGAGATAAACTGTTTCGATTTAATTAAATCTAG

ACTTCAACCTTTTGAGGGATTGTTTGCACAAGATGATGACAGCCGGAAAAT

CACCATCTGGGTTTGTGATAAACTGAAGGAATATACTAAGCAAATGCTAAC

TTTACAAATAAAATTCCTATTTGAGAATACAGGTTTAGACCTTTACAGCAAT

TTGGTCAATATGATTTTTCCTGTGGACTCAGTAAAGGATGAATTGGATTATG

ATATGTACTTAGCCCTGAGGGATAATTCATTGATGGAATTAGACATGGTCAG

AAAAAATGTGCATGATAAGTTGAACTATTATCTACCTCAGGCGTTAACAGA

TGTTCAAGGTAATTTACTATTTAAATTAACGTCACCAATGATAGCTGATGAA

ATATGCGATGAATGTTTCAAGAAGTTGTCGCTATTTTATAATATCTTCAGGA

AACTGTTGATTCATTTGTATCCGAACAAGAAGGATCAGGTATTCGAAATTTT

AAATTTTTCCACTGATGAATTTGACATGTTGATAGGTATTGATCACTGA

S. cerevisiae COYI (SEQ ID NO 35)

ATGGATACGTCAGTATATTCTCATGCATTGGATATTTGGGCCAAGGCAGATT

TAACGAATCTTCAAAGAGAATTGGATGCTGATGTTATAGAGATTAAGGATA

AAGAAACCCTGTCCTTGAATTCAAGAAAGTCATTAGCCACTGAGACTAAAA

AATTTAAAAAACTCGAACCTGAGGAAAAATTGAACAATGTGAATAAAATAA

TTAAGCAGTACCAACGTGAAATTGATAATTTGACACAGAGATCAAAATTCT

CTGAAAAGGTTCTTTTTGACGTATACGAAAAGCTTTCAGAGGCTCCTGATCC

ACAGCCGCTACTACAAAGTTCGTTGGAAAAATTGGGCAAAATTGATGACTC

GAAGGAACTTAAGGAAAAAATAAGCTACCTAGAAGATAAGCTAGCCAAAT

ATGCAGATTATGAGACTTTGAAATCAAGGTTACTGGACCTAGAGCAAAGCT

CTGCAAAAACATTGGCAAAAAGACTGACTGCGAAAACTCAAGAAATCAATT

CTACCTGGGAGGAAAAAGGAAGAAATTGGAAAGAGAGAGAAGCAGATCTA

TTGAAACAATTAACAAATGTACAGGAGCAAAACAAGGCACTAGAGGCCAA

AATATCTAAAAATATAGATATAGAAGGTAATGGAAACGAAGATGGTGACCA

AGAAAACAATCAAAAAGAAGTATCTACAAGGATTGCTGAATATAATCTAGT

AACACAGGAGTTGGAAACTACGCAGGCTAGAATATATCAGTTAGAGAAAAG

AAATGAGGAACTAAGTGGTGCTCTTGCAAAGGCAACTAGTGAAGCAGAAAA

AGAAACTGAGTTACATGCAAAGGAACTAAAACTTAACCAGCTGGAAAGCG

AAAATGCATTGTTGAGTGCATCCTATGAGCAGGAACGGAAATCAACATCAC

ATGCAATAAATGAGTTAAAAGAACAATTAAATAGCGTTGTGGCGGAATCGG

AATCTTACAAGTCGGAGCTAGAAACTGTTAGAAGAAAACTAAACAATTATT

CTGATTACAATAAGATAAAAGAAGAACTTTCTGCATTGAAAAAAATTGAGT

TTGGGGTAAACGAAGATGATTCTGATAATGACATTCGCTCTGAAGACAAGA

ATGATAATACTTTCGAAAGTTCCTTACTATCTGCAAATAAGAAGCTCCAGGC

TACTTTGGCGGAATACCGCTCAAAAAGTACGGCTCAAGAGGAAGAACGAA

ACGAATTGAAAAAATCTGTGGACCAATTGAAGCAGCAAATAGCTACTCTCA

AAGAAGCAAATGAAAAATTAGAGACGGACCTAGAAAAAGTAGAGAACGTC

AGTCCTCACTTCAACGAGACTGCAAGTATGATGTCTGGTGTAACAAGACAA

ATGAACAATCGTACGTCCCATAAAATGTCCCCAACGAGTTCTATTATTGGTA

TTCCAGAAGATGGGGAACTTTCTGGAAACCAATCAACCATTTTACCAATAG

TTACTAAACAAAGAGACAGATTTCGTTCGAGAAATATGGATCTGGAAAAGC

AACTAAGACAAGGAAACTCAGAAAAGGGTAAGCTTAAACTAGAAATTTCGA

AGCTAAAAGGCGACAATACGAAGCTTTATGAACGGATTAGGTATCTGCAAT

CCTATAATAATAACAACGCTCCCGTTAATCAAAGTACAGAGCGTATTGACG

TGGAATCCCAATACTCAAGGGTGTATGATGAATCGTTGCATCCAATGGCAA

ATTTTAGACAGAACGAATTAAACCACTACAAAAACAAGAAATTATCAGCTT

TAGAGAAGTTATTTTCCAGTTTTGCAAAAGTCATTTTACAAAATAAAATGAC

AAGGATGGTATTCCTCTTTTACTGTATCGGTTTACACGGACTCGTATTCATG

ATGAGCATGTATGTGATTAATATTAGCGGCTACATGACACCTGAGGTTGGTA

TAGTACAATCGGCAAAGTCTTCTTCAAATCTCAACGGAGGACTTGGGGGAG

CAGAAAAAGTAGCTGCAGGCGTTGGTTCAGTTCATGGCATAAATCGATA

S. cerevisiae CUP5 (SEQ ID NO 36)

ATGACTGAATTGTGTCCTGTCTACGCCCCTTTCTTTGGTGCCATTGGTTGTGC

CTCTGCAATTATCTTCACCTCATTAGGTGCTGCTTACGGTACTGCTAAGTCT

GGTGTTGGTATCTGTGCCACTTGTGTGTTGAGACCAGACCTATTATTCAAGA

ACATTGTTCCTGTTATTATGGCTGGTATCATTGCCATTTACGGTTTAGTTGTT

TCCGTTTTGGTTTGTTATTCGTTGGGTCAAAAGCAAGCTTTGTACACCGGTTT

CATCCAATTGGGTGCCGGTCTATCAGTCGGTTTGAGTGGTCTAGCTGCTGGT

TTCGCTATTGGTATTGTCGGTGATGCAGGTGTTAGAGGTTCCTCTCAACAAC

CAAGATTATTCGTCGGTATGATTTTGATTTTGATTTTTGCTGAAGTTTTGGGT

CTATACGGTTTGATTGTTGCTTTGTTGTTGAACTCCAGGGCTACTCAAGATG

TTGTCTGTTAA

S. cerevisiae IMH 1 (SEQ ID NO 37)

ATGTTCAAACAGCTGTCACAAATTGGTAAGAATCTTACCGATGAATTAGCG AAGGGCTTAGCCGATGATATGAGCCCTACCCCGTCAGAACAACAAATCGAA GATGATAAGAGTGGCTTGCCAAAAGAAATACAAGCTAAATTAAGAAAATTT GAGAAATATGAACAAAAATACCCTTTGCTACTCTCCGCATACAAAAATGAA

AAATTAAAGTCAGAGAAGTTAGAGGCTGTTGAAAAGATTTTAGCGGAAAAT

ACACCCATATCTAATATTGACGACGCAGTGGATACGTTGCCAGCTTTTTTCC

AGGATTTAAACAACAAAAATAACCTATTGAATGATGAGATCAAGAGATTAA

CTAAGCAGAACTCGGAAATTCCAGAAAGCGCCTCTAGTGAAACTCTGAAGG

ATAAAGAAGAGGAATTTTTGAAAAAAGAGCAAAATTATAAAAATGACATAG

ACGATCTAAAAAAAAAAATGGAAGCTTTAAACATAGAATTGGATACTGTAC

AAAAAGAAAAAAATGATACTGTTTCAGGTTTGAGAGAAAAAATAGTTGCAC

TGGAAAATATACTAAAGGAAGAAAGGGAGGCCAAAAAACAGAAAGAAGAA

GTATCTATATCCGAACTGAAGGAAGAATTGGCTATAAAGAACCATTCTCTC

GAGGACAGTCGAATGAAGATAACCGAATTGGAGCAAAATTTGTCTTCGAAA

AGTACTATAATGGAGGAAAAGTCCTCAGAGTTGGCAGAACTAAATATTACT

TTAAAAGAGAAAGAGCGCAAGCTGAGTGAATTGGAAAAAAAAATGAAGGA

GTTACCGAAGGCGATATCTCATCAAAATGTAGGAAACAATAACAGAAGGAA

AAAGAATAGAAACAAGGGAAAGAAAAATAAGGGAGGCATAACTACGGGTG

ATATCAGTGAAGAGGAAACGGTCGATAACTCAATCAATACTGAAGAATATG

ATAAGCTTAAAGAAAATTTGCAAGAATTACAAGAAAAATATAAAGATTGTG

AAGATTGGAAGCAAAAGTATGAAGATATAGAAGCAGAACTAAAAGATGCT

AAAGAATTGGAAAACTCACAGCTCGAAAAATCAGCAAAGGAGCTGGAAAC

CCTTAACACCGAGTTGATCGATACCAAGAAGTCATTGAAAGAAAAAAATTC

GGAGCTAGAGGAGGTGAGAGATATGCTGAGGACTGTAGGCAATGAGCTTGT

GGACGCAAAAGATGAGATTAAAGAGTCTTCGAGTAAACAAAATGAAGAAG

TGAAAACCGTTAAGCTGGAGCTCGATGATTTACGCCATAAAAATGCAACGA

TGATCGAGGCCTACGAAGCTAAAAATACTGAGTTGAGAAGTAAGATAGAGT

TATTGAGCAAGAAAGTAGAGCATCTGAAGAATTTATGTACAGAAAAGGAGA

AAGAGCAGACTACATCGCAGAACAAGGTAGCCAAATTAAATGAGGAGATA

TCTCAACTTACCTACGAAAAATCAAACATAACAAAGGAGCTTACTTCTTTA

AGAACCTCTTATAAACAAAAGGAGAAAACTGTGAGTTACTTGGAGGAACAA

GTTAAACAATTTAGTGAGCAAAAGGACGTGGCTGAAAAATCCACAGAACAG

CTGAGAAAAGATCATGCTAAAATTTCTAACAGATTAGACTTATTAAAAAAG

GAAAATGAGACACTGCATAATGATATCGCAAAGAATTCTAATTCCTACGAG

GAGTATTTGAAAGAAAATGGTAAATTATCGGAAAGATTGAATATTTTGCAA

GAAAAATACAATACCTTGCAAAATGTAAAAAGTAATTCGAATGAACACATA

GATTCTATCAAAAGACAATGTGAGGAACTAAATGTCAAGTTGAAGGAATCT

ACAAAAAAAATTTTATCTTTAGAAGATGAACTAAATGAATATGCTAATATTG

TTCAAGACAAAACCAGAGAAGCTAACACATTGAGAAGGTTAGTTTCGGACA

GTCAGACAGATGATTCGAGCAAACAAAAAGAGTTGGAGAATAAATTGGCCT

ATTTAACGGATGAAAAGAATAAATTGGAAGCAGAATTAGACTTACAAACAT

CCAGAAAGGCCACTGAATTACAAGAGTGGAAGCATACAGTAACTGAGCTG

AAATCGGAAATACACGCTTTAAAGCTTCGTGAAGAGGGACTAAAATCAGAG

GTTGACGCATTGAAACATGTTAACAATGACATCAAAAGGAAGACTCAAGCC

ACTTCAGATGATTCCGATCAGTTGGAACAGATCACATCTAATTTAAAACTCT

CATTGTCTAAGGCTGATGAAAAGAATTTTGAGCTACAGTCTGCCAATGAGA

AACTTCTGAATTTAAATAACGAACTTAACAAGAAATTTGATCGATTACTAAA

AAATTATCGTTCATTGTCCTCTCAATTGAATGCTTTAAAGGAAAGACAATAC

AGTGACAAGTCAGGAAGAGTTAGTAGGTCTGGTTCTATCGGTACTCTAGCT

AACGCGAATATTGATTCCTCACCAGCGAATAACTCTAATCCAACTAAATTA

GAGAAGATACGATCATCAAGTTCATTGGAGTTAGACTCTGAGAAAAATGAA

AAAATTGCATATATAAAAAATGTTTTGTTGGGATTTTTGGAGCACAAGGAAC

AACGGAACCAATTACTTCCTGTAATTTCTATGTTGTTACAACTGGACAGTAC

TGATGAAAAAAGACTGGTTATGTCTCTGAAGTAA

S. cerevisiae KIN2 (SEQ ID NO 38)

ATGCCTAATCCGAATACAGCAGATTACTTGGTGAATCCAAATTTCAGGACC

AGTAAGGGCGGATCTTTATCGCCGACGCCAGAAGCTTTCAACGACACGCGA

GTTGCTGCACCAGCCACTCTTCGCATGATGGGCAAGCAATCTGGACCAAGA

AATGACCAGCAACAAGCACCACTGATGCCTCCTGCAGATATCAAACAGGGC

AAGGAACAGGCAGCTCAGAGACAAAATGATGCATCGAGGCCTAATGGCGC

CGTGGAATTAAGGCAATTTCATAGAAGATCTTTGGGAGATTGGGAGTTCCTT

GAAACGGTTGGCGCAGGCTCTATGGGTAAAGTTAAATTGGTCAAGCATCGT

CAAACAAAGGAAATTTGTGTAATAAAGATTGTTAATAGGGCTTCCAAGGCT

TATCTCCATAAACAGCACTCTTTACCTTCCCCAAAGAATGAGAGTGAGATAT

TAGAAAGACAAAAGCGGTTAGAAAAAGAAATTGCGAGGGATAAAAGGACT

GTTAGGGAAGCCTCTTTGGGCCAAATCCTTTACCATCCTCATATCTGTCGTT

TATTTGAAATGTGCACTATGTCAAACCATTTTTATATGCTTTTTGAATACGTT

TCCGGTGGACAGCTGTTAGATTATATTATTCAGCATGGCTCATTAAAGGAAC

ACCATGCGAGGAAATTTGCCAGAGGTATAGCTAGTGCGCTGCAATACTTAC

ATGCCAATAATATTGTTCATCGAGATCTGAAAATTGAGAATATAATGATATC

TAGTTCAGGTGAAATTAAGATCATTGATTTTGGTCTTTCCAACATTTTTGATT

ATAGGAAACAATTACATACGTTTTGTGGTTCCTTGTACTTTGCAGCACCAGA

ACTATTAAAAGCGCAGCCATACACAGGACCTGAGGTAGATATTTGGTCGTT

TGGTATTGTTCTTTATGTCTTGGTCTGCGGTAAAGTACCATTTGATGATGAG

AACTCAAGCATTTTACATGAAAAAATAAAAAAAGGTAAAGTAGACTATCCT

TCACACTTATCCATTGAAGTTATATCTTTATTAACCAGGATGATTGTTGTCG

ACCCATTAAGAAGAGCAACATTAAAGAATGTCGTTGAGCATCCATGGATGA

ACAGAGGATACGATTTTAAGGCTCCATCATATGTTCCTAATCGTGTTCCATT

AACCCCTGAAATGATAGATAGCCAAGTTCTGAAGGAAATGTATCGCCTAGA

ATTTATTGACGATATTGAAGATACAAGAAGATCATTGATCCGATTAGTAACT

GAAAAGGAATACATCCAACTTTCCCAAGAATACTGGGACAAATTATCCAAC

GCCAAGGGGTTGAGTTCAAGTTTAAATAATAACTACCTAAATTCAACGGCA

CAACAAACCTTAATACAAAATCATATTACAAGTAATCCATCGCAAAGTGGT

TATAATGAACCAGATAGTAATTTTGAAGATCCTACTTTAGCATATCATCCAT

TACTATCAATATATCACTTGGTTTCAGAAATGGTTGCACGGAAATTAGCGAA

GTTGCAAAGAAGGCAAGCATTGGCCCTGCAAGCGCAAGCTCAGCAAAGGC

AACAACAGCAACAAGTAGCACTTGGCACTAAGGTCGCCTTAAATAATAACT

CCCCGGATATTATGACCAAAATGAGGAGCCCTCAGAAAGAAGTAGTACCTA

ATCCTGGTATTTTTCAAGTGCCGGCAATTGGAACATCGGGAACCTCAAACA

ACACTAATACCTCAAACAAACCTCCACTGCATGTAATGGTTCCTCCTAAACT

AACAATACCGGAACAAGCGCATACTTCTCCAACATCTAGGAAGAGTTCCGA

CATTCATACGGAATTAAATGGTGTTTTGAAATCAACACCAGTCCCCGTGTCT

GGCGAATATCAGCAACGTTCTGCTTCACCCGTAGTAGGTGAACATCAGGAA

AAGAATACAATAGGCGGCATATTCAGAAGAATATCACAAAGTGGACAATCT

CAGCATCCCACACGGCAACAGGAACCTCTTCCAGAAAGAGAACCTCCAAC

ATATATGTCAAAATCAAATGAAATTTCCATCAAAGTACCGAAAAGCCATAG

TCGTACTATATCAGATTATATTCCTAGCGCTAGAAGATATCCATCTTACGTG

CCAAATTCTGTTGATGTAAAACAGAAACCCGCTAAAAACACTACCATAGCA

CCTCCTATAAGGTCAGTATCACAAAAGCAAAACAGTGATCTTCCAGCTTTA

CCTCAGAACGCCGAACTAATTGTTCAAAAACAACGGCAAAAACTATTACAG

GAAAATCTCGACAAATTACAAATTAATGATAATGATAACAACAATGTGAAC

GCTGTAGTCGATGGTATCAATAATGATAATAGTGACCATTATCTCTCCGTTC

CGAAGGGTCGTAAGTTACATCCTAGTGCAAGGGCTAAATCGGTGGGGCATG

CTCGTCGTGAATCTTTGAAATTTACTAGGCCGCCTATACCAGCAGCCCTTCC

GCCATCAGATATGACAAACGATAACGGCTTTTTGGGAGAGGCAAACAAGGA

GAGATACAATCCTGTTAGCAGTAACTTTTCGACCGTTCCTGAAGATTCTACC

ACATACAGTAACGATACTAACAATAGACTGACTTCGGTGTATTCTCAGGAG

CTTACTGAGAAGCAAATTTTGGAGGAAGCTTCAAAGGCACCCCCCGGGTCT

ATGCCATCAATTGATTATCCAAAGTCAATGTTTTTGAAGGGTTTTTTCTCTGT

ACAAACAACCTCCTCTAAACCATTGCCTATTGTTCGTCACAATATCATATCT

GTTTTAACAAGAATGAATATTGATTTCAAAGAAGTGAAAGGCGGGTTCATA

TGTGTCCAACAAAGGCCATCTATTGAGACTGCAGCTGTCCCTGTTATAACCA

CTACTGGCGTGGGTTTGGATTCCGGAAAGGCGATGGATCTGCAAAATAGTT

TAGACAGTCAATTATCATCCAGTTACCATAGTACAGCGTCCTCAGCATCAA

GAAATAGTTCGATAAAACGCCAAGGTTCTTATAAGAGGGGCCAGAATAATA

TACCACTAACACCTTTAGCGACCAATACACATCAAAGAAATTCATCTATCC

CAATGTCTCCAAACTACGGAAACCAAAGTAATGGTACATCAGGGGAACTAT

CTTCCATGTCATTAGATTATGTTCAACAACAGGATGATATTTTAACAACATC

AAGAGCCCAAAATATAAATAACGTAAATGGTCAAACAGAGCAAACCAATA

CTTCTGGTATAAAAGAAAGGCCTCCTATTAAATTTGAGATTCACATTGTAAA

GGTTCGTATCGTCGGCCTAGCAGGTGTACATTTCAAAAAGGTTTCTGGTAAT

ACGTGGCTATATAAAGAATTGGCATCGTATATTTTAAAAGAATTAAACCTAT

AG

S. cerevisiae SEC31 (SEQ ID NO 39)

ATGGTCAAACTTGCTGAGTTTTCTCGAACAGCCACGTTTGCGTGGTCACATG

ATAAAATTCCATTATTGGTCTCTGGTACCGTATCTGGTACGGTGGATGCTAA

TTTCTCCACTGATTCATCTCTAGAATTGTGGTCATTGTTGGCTGCTGATTCGG

AGAAGCCTATTGCTTCCTTGCAAGTGGATTCCAAATTCAATGATTTGGATTG

GTCTCATAATAACAAGATTATTGCTGGTGCTCTGGATAACGGTAGTTTGGAA

TTGTACTCCACCAATGAAGCAAACAACGCTATCAACTCCATGGCCAGATTT

AGCAACCATTCTTCCTCTGTGAAGACGGTAAAGTTTAACGCAAAGCAAGAC

AACGTTCTTGCTTCGGGTGGTAACAACGGTGAAATTTTTATTTGGGACATGA

ATAAATGCACTGAATCGCCCTCCAATTATACTCCATTGACACCGGGTCAAT

CGATGTCGTCCGTTGACGAGGTCATTTCCCTAGCATGGAACCAATCTTTGGC

CCATGTTTTTGCATCTGCCGGGTCGTCTAATTTCGCATCTATTTGGGATTTGA

AGGCTAAGAAGGAAGTCATTCATCTAAGTTACACTTCACCTAATTCAGGTAT

CAAGCAACAGCTGTCCGTTGTTGAATGGCACCCAAAAAACTCCACAAGAGT

GGCAACGGCTACTGGTAGCGATAATGATCCATCTATCCTGATCTGGGATTTA

AGAAACGCCAACACACCATTGCAGACTTTAAATCAAGGCCATCAAAAGGGT

ATTTTGTCATTAGATTGGTGTCATCAGGACGAACATCTATTATTGTCCAGTG

GTAGAGATAATACCGTTCTTCTATGGAACCCTGAGTCAGCCGAACAACTGT

CCCAATTCCCAGCTCGTGGAAACTGGTGTTTTAAGACCAAATTTGCACCAG

AGGCTCCAGACCTATTTGCTTGTGCCTCCTTTGATAACAAAATTGAGGTACA

GACTTTGCAAAATCTCACAAACACTTTGGATGAGCAAGAAACCGAAACTAA

GCAGCAAGAATCTGAAACAGATTTTTGGAATAATGTTTCCCGAGAGGAATC

AAAAGAGAAGCCATCTGTTTTCCATTTACAAGCCCCAACTTGGTATGGGGA

ACCATCTCCCGCAGCTCATTGGGCTTTCGGTGGTAAATTGGTTCAAATTACT

CCAGATGGTAAAGGTGTATCTATAACAAACCCAAAAATTTCAGGCTTAGAA

TCAAACACTACTTTGAGTGAAGCGTTGAAAACTAAGGATTTCAAACCATTA

ATAAATCAAAGACTGGTCAAAGTTATTGATGACGTTAATGAAGAAGATTGG

AATTTATTGGAAAAGTTATCAATGGACGGTACTGAGGAGTTCTTGAAAGAG

GCTCTTGCATTCGACAACGATGAATCAGATGCACAAGACGATGCCAACAAT

GAGAAAGAAGACGATGGGGAAGAATTCTTTCAACAAATTGAAACCAATTTC

CAACCCGAGGGCGATTTCTCCTTGTCTGGTAATATCGAACAAACTATTTCCA

AGAACTTGGTTTCTGGCAACATTAAGAGCGCTGTGAAAAATTCTCTAGAGA

ATGACTTACTAATGGAGGCCATGGTGATCGCATTAGATTCAAATAACGAAA

GATTAAAGGAAAGTGTCAAGAATGCCTATTTTGCGAAGTATGGATCTAAAT

CATCGCTCTCGAGGATACTATACTCCATTTCTAAGAGGGAAGTAGATGATTT

GGTTGAAAATTTGGATGTCTCTCAGTGGAAGTTTATCTCTAAAGCAATTCAA

AACTTATATCCAAATGATATCGCCCAGAGGAATGAAATGTTGATTAAATTG

GGAGACAGGTTAAAGGAAAATGGTCATAGACAAGATTCTTTGACTTTGTAC

TTGGCTGCCGGATCATTAGATAAGGTGGCTTCAATTTGGTTATCAGAATTTC

CAGATTTGGAGGATAAATTGAAGAAAGATAATAAGACAATTTATGAAGCTC

ATTCCGAATGTCTAACTGAGTTCATTGAAAGATTCACCGTATTTTCCAACTT

CATTAATGGAAGCTCTACCATTAATAATGAGCAATTAATTGCCAAATTTTTG

GAATTTATCAACTTAACTACTTCCACAGGAAATTTCGAACTAGCCACTGAAT

TCTTAAATAGTTTACCAAGTGACAATGAAGAGGTTAAAACAGAAAAGGCAC

GTGTCTTGATTGCTTCCGGCAAATCATTACCGGCACAAAATCCTGCGACAG

CGACGACCAGCAAAGCCAAGTATACAAACGCCAAGACAAATAAGAACGTT

CCTGTACTACCAACTCCTGGAATGCCTTCTACTACTTCTATTCCTAGTATGC

AGGCACCATTTTATGGTATGACACCAGGCGCCTCTGCAAATGCTCTACCTCC

AAAGCCGTACGTTCCAGCAACCACCACTAGTGCTCCTGTTCATACAGAAGG

TAAATATGCGCCACCAAGCCAACCTTCGATGGCGTCACCTTTTGTTAACAA

AACAAATAGCTCGACCAGATTGAATTCTTTTGCTCCTCCGCCTAACCCATAT

GCCACTGCAACAGTTCCTGCAACGAACGTATCTACAACGTCGATTCCGCAA

AACACTTTTGCTCCTATACAACCTGGTATGCCTATTATGGGCGACTATAATG

CTCAATCTAGCTCTATTCCTTCACAACCTCCAATTAATGCTGTATCGGGTCA

AACGCCACATCTCAACCGTAAAGCCAATGATGGTTGGAATGATTTGCCTTT

GAAGGTCAAAGAAAAACCATCTCGTGCCAAGGCTGTATCTGTTGCCCCTCC

AAATATCCTATCGACACCAACTCCATTAAATGGTATCCCTGCAAATGCTGCT

AGTACCATGCCTCCGCCACCTCTTTCCAGAGCTCCCTCTTCTGTGTCAATGG

TATCACCACCTCCTCTACACAAAAATTCTAGAGTCCCATCCTTGGTTGCAAC

TTCTGAGTCACCAAGGGCATCCATATCAAATCCATACGCTCCTCCTCAATCA

TCACAACAATTCCCAATAGGTACTATTTCTACAGCAAACCAAACGTCAAAC

ACCGCTCAGGTAGCTTCATCGAACCCCTATGCTCCACCACCACAACAAAGA

GTAGCAACCCCATTATCTGGAGGCGTGCCTCCAGCTCCGTTGCCAAAGGCC

TCTAATCCATATGCTCCAACTGCAACCACTCAACCCAACGGTTCCTCCTATC

CTCCAACCGGTCCGTATACTAATAACCATACCATGACCTCTCCTCCTCCCGT

TTTTAACAAACCTCCCACTGGCCCCCCTCCGATTAGCATGAAGAAGAGAAG

CAACAAGTTAGCTAGTATAGAACAAAACCCATCTCAAGGTGCTACTTATCC

TCCAACCCTTTCCAGCTCGGCCTCTCCATTGCAGCCTTCTCAACCGCCAACT

TTGGCTTCTCAGGTTAATACCTCCGCTGAGAATGTCAGTCATGAAATTCCAG

CTGATCAACAACCCATTGTCGACTTCTTGAAAGAAGAACTGGCTCGCGTAA

CACCATTGACCCCAAAGGAGTACTCCAAACAATTAAAGGATTGTGATAAAC

GATTAAAGATTCTTTTCTACCATTTGGAAAAGCAGGATTTATTAACCCAACC

AACAATCGATTGTTTACATGACCTCGTCGCATTAATGAAGGAAAAGAAATA

CAAAGAAGCTATGGTCATCCATGCTAATATCGCTACAAACCATGCTCAAGA

GGGTGGTAACTGGCTGACAGGAGTGAAGAGGTTGATTGGCATAGCTGAAGC

GACTTTGAATTAA

S. cerevisiae SSA4 (SEQ ID NO 40)

ATGTCAAAAGCTGTTGGTATTGATTTAGGTACAACCTATTCATGTGTTGCTC

ATTTTGCAAACGATAGGGTTGAAATTATCGCTAACGATCAAGGTAATAGAA

CGACGCCTTCTTATGTGGCTTTTACTGACACAGAAAGGCTAATTGGTGACGC

TGCGAAGAATCAAGCTGCGATGAACCCACATAATACAGTATTCGATGCTAA

GCGTCTGATCGGACGTAAATTCGATGATCCAGAAGTGACGAACGATGCTAA

GCATTACCCATTCAAAGTGATTGACAAGGGAGGTAAACCGGTAGTGCAAGT

GGAATATAAAGGCGAGACAAAGACATTTACTCCAGAAGAAATTTCCTCAAT

GATCTTGACAAAGATGAAGGAGACTGCTGAGAACTTTTTAGGAACAGAAGT

GAAAGATGCTGTAGTAACGGTTCCAGCCTATTTCAACGATTCACAAAGGCA

AGCAACAAAAGATGCCGGTACAATCGCGGGCTTGAACGTTCTTCGTATCAT

TAATGAACCTACAGCTGCCGCTATTGCGTATGGGCTGGACAAGAAATCGCA

GAAGGAGCACAACGTCTTGATCTTTGATTTAGGTGGTGGTACTTTTGATGTC

TCTCTGCTATCCATAGATGAAGGTGTCTTTGAGGTTAAGGCTACTGCTGGTG

ACACTCACTTGGGTGGTGAAGATTTCGATAGTAGGCTGGTTAACTTTCTAGC

CGAGGAGTTCAAAAGAAAAAATAAAAAGGATCTAACAACTAACCAAAGGT

CCCTAAGGAGGTTAAGGACCGCCGCTGAAAGGGCCAAGAGAACTCTGTCTT

CGTCTGCTCAGACATCTATAGAAATAGATTCATTATTTGAGGGTATCGATTT

CTATACTTCCATTACAAGGGCAAGATTTGAAGAATTATGTGCTGATTTGTTT

AGATCTACATTGGAGCCAGTGGAAAAAGTTTTGGCTGATTCAAAATTAGAT

AAGTCACAAATTGATGAAATTGTACTTGTTGGTGGTTCAACAAGAATTCCAA

AAGTACAAAAACTGGTTTCTGATTTTTTCAATGGTAAAGAACCAAACCGTTC

GATTAACCCTGATGAGGCCGTCGCTTATGGTGCTGCCGTACAGGCTGCCAT

CTTAACGGGTGACCAGTCGTCGACGACCCAAGATTTACTGTTGCTGGATGTT

GCACCATTATCTCTAGGTATTGAAACTGCAGGTGGTATTATGACAAAGTTGA

TCCCAAGAAATTCGACTATCCCAACAAAAAAATCGGAAGTGTTTTCCACCT

ACGCTGACAACCAACCTGGTGTGTTGATACAAGTTTTTGAGGGTGAAAGGA

CAAGGACAAAAGACAACAATCTACTGGGTAAATTTGAGTTGAGCGGTATTC

CACCCGCTCCAAGAGGCGTACCACAAATTGAAGTTACATTTGATATCGATG

CAAATGGTATTCTGAACGTATCTGCCGTTGAAAAAGGTACTGGTAAATCTA

ACAAGATTACAATTACTAACGATAAGGGAAGATTATCGAAGGAAGATATCG

ATAAAATGGTTGCTGAGGCAGAAAAGTTCAAGGCCGAAGATGAACAAGAA

GCTCAACGTGTTCAAGCTAAGAATCAGCTAGAATCGTACGCGTTTACTTTGA

AAAATTCTGTGAGCGAAAATAACTTCAAGGAGAAGGTGGGTGAAGAGGATG

CCAGGAAATTGGAAGCCGCCGCCCAAGATGCTATAAATTGGTTAGATGCTT

CGCAAGCGGCCTCCACCGAGGAATACAAGGAAAGGCAAAAGGAACTAGAA

GGTGTTGCAAACCCCATTATGAGTAAATTTTACGGAGCTGCAGGTGGTGCC

CCAGGAGCAGGCCCAGTTCCGGGTGCTGGAGCAGGCCCCACTGGAGCACC

AGACAACGGCCCAACGGTTGAAGAGGTTGATTAG

S. cerevisiae SSEI (SEQ ID NO 41 )

ATGAGTACTCCATTTGGTTTAGATTTAGGTAACAATAACTCTGTCCTTGCCG

TTGCTAGAAACAGAGGTATCGACATTGTCGTTAATGAAGTCTCTAACCGTTC

CACCCCATCTGTTGTTGGTTTTGGTCCAAAGAACAGATACTTGGGTGAAACT

GGTAAGAACAAGCAGACTTCCAACATCAAGAACACTGTCGCCAACTTGAAA

AGAATTATTGGTTTGGATTACCACCATCCAGATTTCGAGCAAGAATCTAAGC

ACTTCACCTCTAAGTTGGTTGAATTGGATGACAAGAAGACTGGTGCCGAAG

TTAGATTCGCTGGTGAGAAACATGTTTTTTCAGCTACTCAACTAGCTGCCAT

GTTCATCGACAAAGTCAAGGACACCGTCAAGCAGGACACAAAGGCAAATA

TTACCGATGTTTGTATTGCTGTCCCACCTTGGTACACCGAAGAACAACGTTA

CAACATTGCTGATGCTGCTAGAATTGCTGGTTTGAACCCTGTTAGAATTGTC

AACGACGTTACTGCTGCCGGTGTTTCTTACGGTATCTTCAAGACTGATTTGC

CTGAAGGCGAAGAAAAGCCAAGAATTGTTGCCTTTGTTGATATTGGTCACTC

TTCCTACACCTGTTCTATCATGGCCTTCAAGAAGGGTCAATTGAAAGTCTTA

GGAACTGCCTGCGACAAGCATTTTGGTGGTAGGGACTTCGATTTGGCTATAA

CAGAACATTTCGCCGATGAGTTCAAAACTAAATACAAGATTGACATCAGAG

AAAATCCAAAGGCTTACAACAGAATTCTAACTGCTGCTGAAAAGTTGAAGA

AAGTTTTGTCTGCTAATACTAATGCCCCATTCTCTGTTGAATCCGTCATGAA

CGACGTTGATGTTTCCTCTCAATTATCTCGTGAAGAATTAGAAGAATTGGTC

AAGCCATTGTTGGAACGTGTTACTGAACCAGTTACCAAAGCTTTAGCTCAA

GCCAAATTATCTGCTGAAGAAGTTGATTTTGTTGAAATTATTGGTGGTACTA

CTCGTATCCCAACATTGAAACAATCCATTTCTGAAGCCTTCGGCAAGCCATT

GTCCACCACTTTGAACCAAGATGAAGCCATCGCCAAGGGTGCCGCCTTTAT

TTGCGCCATTCACTCTCCAACTCTAAGAGTTAGACCATTCAAGTTTGAGGAT

ATCCATCCTTACTCTGTCTCTTACTCTTGGGACAAGCAAGTTGAGGACGAAG

ACCACATGGAAGTTTTCCCAGCTGGTTCATCCTTCCCATCTACTAAATTGAT

CACTTTGAACCGTACGGGTGACTTTTCAATGGCTGCTAGCTACACTGACATC

ACACAGTTACCACCAAACACTCCAGAACAAATCGCTAACTGGGAGATCACT

GGTGTTCAATTACCAGAAGGTCAAGACTCTGTTCCTGTTAAGTTAAAGTTGA

GATGCGACCCCTCTGGTTTACACACAATTGAAGAGGCTTACACTATTGAAG

ATATTGAAGTTGAAGAACCTATTCCATTACCAGAAGATGCTCCAGAAGATG

CTGAGCAAGAATTTAAGAAGGTTACTAAAACTGTAAAGAAGGATGACTTAA

CCATCGTTGCACACACCTTTGGCCTAGACGCTAAAAAGTTGAATGAATTAA

TTGAAAAAGAAAATGAAATGCTTGCTCAAGATAAGCTAGTTGCTGAGACAG

AAGACCGTAAGAACACTCTTGAAGAGTACATCTACACATTGCGTGGTAAGT

TGGAAGAAGAGTATGCTCCATTTGCTTCCGATGCTGAAAAGACGAAGTTAC

AAGGTATGTTAAACAAGGCCGAAGAGTGGTTATACGATGAAGGTTTCGATT

CCATCAAAGCTAAGTACATTGCCAAATACGAAGAATTGGCTTCTCTAGGTA

ACATTATTAGAGGTAGATACTTGGCTAAAGAAGAAGAAAAGAAGCAAGCTA

TAAGATCTAAGCAAGAAGCATCCCAAATGGCTGCTATGGCTGAAAAGTTGG

CTGCTCAAAGAAAGGCAGAAGCTGAAAAGAAGGAAGAAAAGAAGGACACT

GAAGGTGATGTTGACATGGACTAA

By screening a P. pastoris genome database (ERGO™, IG-66, Integrated Genomics) with the nucleotide sequences of the secretion helper factors isolated from Saccharomyces cerevisiae (SEQ ID NO 32 to SEQ ID NO 41 ) homologous nucleotide sequences in Pichia pastoris have been identified and are shown in Table 4 below.

Table 4: Homologous Pichia pastoris nucleotide sequences (SEQ ID NO 42 to SEQ ID NO 51 ) and respective ERGO™ database information

BMH2 (SEQ ID NO 42); RPPA07190 - Pichia pastoris (IG-66)

ATGTCAAGAGAAGATTCTGTTTATTTAGCAAAACTAGCTGAGCAAGCTGAGC

GTTATGAGGAGATGGTCGAGAACATGAAGACCGTCGCCTCTTCCGGCTTAGA

GTTGTCTGTCGAAGAGAGAAACTTGCTTTCTGTTGCATACAAAAACGTAATTG

GAGCTAGAAGAGCTTCTTGGAGAATCGTCTCCTCAATTGAACAGAAAGAGGA

AGCCAAGGGTAACCAATCACAAGTGTCTTTGATCAGAGAATACCGCTCCAAG

ATTGAGACCGAATTGGCCAACATTTGTGAGGATATTTTGTCTGTTTTGAGTGA

GCACCTTATTCCTTCTGCCAGAACTGGCGAATCCAAGGTCTTCTACTTTAAGA

TGAAGGGTGATTACCACCGTTATTTGGCCGAATTCGCTGTTGGTGACAAGCG

AAAGGAAGCTGCTAATTTGTCATTGGAGGCTTACAAGTCTGCCTCTGACGTT

GCTGTTACGGAGCTACCTCCAACTCATCCAATTAGATTGGGTCTGGCTCTGAAl

TTTCTCAGTCTTCTACTACGAGATTCTAAACTCTCCTGACCGCGCCTGTCATT

TAGCCAAGCAAGCTTTCGACGATGCTATTGCTGAGTTAGAAACCCTATCTGA

AGAATCTTACAAAGACTCCACTTTGATTATGCAACTGCTGCGTGACAACTTG

ACTTTGTGGACCTCAGACATGTCTGAAACTGGACAAGAAGAGTCATCCAATA

GCCAAGATAAGACAGAAGCTGCTCCCAAAGATGAAGAGTGAATA

BFR2 (SEQ IN NO 43); RPPA04523- Pichia pastoris (IG-66)

ATGGCTAGAAAGACATTGGCTGAAACATTGGCAGAATTGTCTCAACCAGCG

TCTGGAGATTTTGATATAGAAGACCAAGAAGGAGGAGCAGTACTTGACTAT

GGAGATAATAGTTCTTTTGGCTCCGAGAGTGAAGAGGATAAAAGTAACCAC

TATGTTAAAGTTGGCAAGTCAAGGATAAGAGAGAACGCAGTTAAATTGGGA

GGACAATACGAGGGAAAAAAGAGTAGTAGAGCCGATGTTTTTGGAGACGA

GGACGATGAGGAGGAGGACGATGAGGATGTTGAACATTCGGAAACTGAAG

ATGCACTTTCGGTTTCAGGATCAGAGTCCGAATCGGATGAAAAAAATAGTG

ATCAAAGCCAAGGTGATTCTGAGAGTGAAGAAGAATCTAACTCAGGTGAAG

ATCTAGACTACAAGAGATCAAAACTACAGCAACTTATAAGCTCCGAAAGGA

AAACCATTGTAAACCAATTATCAACTTCCAATAAACAAGATGCACTGAAAG

GGTTTGCAGTGTTGAATCAGCAGATACAGTATGATCAATTGGTTGACCTCAG

AATAAAATTACAGAAAGGATTAGTAGCATCGAATGGTCTACCCATTAACAA

AGAATATTACGAACAGAATAAAGCACCAAAGTCTTCCAAACACCTGGATAA

GCTACAAGATAAACTATACAATTTATTGGATGTCACTTTAGAACTGAGAGG

CAAGCTATTAAACAAAAGCAAGATTGTGAGCCAAGAGTTTCCCCCTATTCC

AAGTAAGAAACGTAGTTTACAGCATTATTTGGAGGAATCTTCCAAGTTGGAT

AACATAGTTAATGAATATAGAAGGAACGTCCTCGTTAAATGGTCTCAAAAA

GTCCAAAATGCTTCCGGAGCAACTGCTTTGAGCTCATCCAAATTCAAGGCT

ATTAACCAAGATAGTTCGACTCAAGTGGACAACTATTTGGCAGACATGGAT

AGATTAATCAAAAGAACCAGACTCAACAGAAGAAGCGTAGTGCCATTAGG

ATACACCGAGACAGAAGAAGTAGTAGATGATGATGAATTGATCGACAACGA

TAAAGATAACAATGAGACCAAATACTTCAGCAACATTGACCGATCTTTGAA

GGAAAACAAATATATCTATGATGATGACGATTTCTATAGAGTTCTTCTGAAC

GATCTAGTCGATAAGAAAGTTTCTGATACACAGAAGCTGACATCTACATCA

ACTGTTATTACATTTTCGAAATCCAAATTGCATAAAAGTTATGAAAGAAAAG

CGACTAAGGGTCGTAAGCTGAGGTATACAGTTCAAGATCCATTATTGAATTT

TGAAGCCTCCAACCCACATGCCTACAAGTGGAACGACTACCAAATTGACGA

GTTTTTTGCGTCATTATTTGGGCAAAAGGTCAACATGAACGAGGATGAGCAT

AACGAAGAGGTACAAGGTGAATCAGAAGGAGAGGACATTTTGAAGGATGA

TATCAAACTGTTTGGATAA

COG6 (SEQ ID NO 44); RPPA07651 - Pichia pastoris (IG-66)

ATGGACTTTGTATATGAGTACTCAGATGCTACCCCTAGTGGCACATTTGATG

ACCCATTGCCTGCAGAGCCCGAACCACCATTCAATTTGTCAAACTTAAACT

CGTACAAAGATGATTTGACTAAAAAATTCTCCAAAATGAGCATTCTGAAAA

GTCTGAAAAATGACACCAATTCAGTTGACGATGTCGACGACTCACAATCGA

TCTCCAATGACGGGCAGAGGGCTTATAAATACGCCAATCAGTCTCTGGATC

TGGTTAACCAGCACACCACTAATAAATCAATCAGAACCACCAGCGATGAAC

AACCTTCGGTGTCCACTGTTTTGAGCAACAGACTGAGCAGAGTGCTCAATA

ATACTAATTACGACCCTTCAACCAAGGAACTACTCTCCATTGTGGAGAAGA

AAATAAAAGAAGATACGGCGCATGAATACGACAAAGTTACTGACCCAAGTT

TTGTTGGAAACCTTGCTAGAAGAAAGTTGCGTAACGACATTGAACATGATG

TTGTAGATGCCAACTTCAATTTCTTGAAACAATTGCAACCCTTAAGAAAGAC

CTTGGGCCAGATTGAAAGTGACTTGAATGAAATGAACGAGCTCAACAATCA

AATCACTGAAAAGTTGTCCTCTAGAGTTGAAGATACCACTAGGTTGGATAA

TTCCATACACGAGTTGCATGCAACTTCCAAGATTATTTCCATCAAAAAGAA

GCTTTTGCAGAATTTCCAGAACCGCTATACTCTCTCCCATTTCGAAGCACAT

CAATTAGAGTTTGGTGAAATTGACGATTCCTTTCTAGAAATACTGAAAAAAG

CTGAGTCAATTCATGATGATTGTTCAATTTTGTTAACCATGGAGAATGCTAC

TGTGGGTATTAATATTATGAACGACATGAAAAAGCTTTCCAATAATGCCATC

GACAGATTGTCGACATTTGTCACCAAACATTTTTCTAGGTTAAGTTCGTCCA

ACAATACCTCCGCCTCCATAGAGGATAAGGCATTCCTGAAAAGATCTATAC

TCTTCATTTCCGAAAGATACCCGGAGCAGCTCTCTGGAATCACCAACCAAA

TAGTCGAATCAAGGTCAAAGTCTTTGCTTGACGAGTTCCAAATACAATTGAA

TGGTTATGCAGATTCAGCATCAAGAAACGAAAGGGATGTTAATAAACCATT

GTTCCTTTCCGCATATGATTCAGAAAGGTTTCTCGGAGATTTACTTGCTTAT

ATTCATGGCACAATTGTTAATGAAAGAGAAACCGTCGAAAGCTTGTTCAGT

TTGCAAGATGAGGAGAAAGATAATATCGTTTTGACAACACTCGTAGAGTCA

ATTGTTTCAAAGAACATCGAATCTCTAGCTACCCCCCTGAATTTAAAGATTG

AACAGATCATCAGAAATGAGTCTAAGCTGACAGCGATCCAAGCTTTTTATG

ACCTGCTCTCACTTTATTCCATGATGCTTGAAAAAACTTTAGGTTCTAAGAA

TGCCCTTTTGAGTACAATCAACTCTTTAAAAGTTTCGGCTTTGGGTAAGATT

CAAAGTTCAATCAACATTAAACTTAAAAACATAGAGCGGACTGCCAATGAG

AGCATGTCATATTATAATGAAGATGAACAACTAGATGGTACAAACCACAAC

TTTGTTTCAGAAACTCATTACATTGAAGAAATCACACCTGAGCTAGCTGTGC

CTGATTGGTTGATCAATTTCTATGGTGACGTACTTCCCATCTTTGATAATGA

AAAGGTGACAAATGCCAAAGAACTGTATGAGGATTTACTCAAATACTGTTT

TGAACAAATCATTCAACTTATCGAGAAACAAATAGCTCAGAATAAATTGAA

TGATGCTAGAGAGATATTGATTTTCAAATCAAACTGTTACGATTTTGTTTATT

CCAAAATTGTGACCCTGAACATCTTTAAGGAGAAACTGGATCGATTAGAGG

TAATGATAAAGGAATGCGAATCAAAATTGACCGAAATTCAGTACACTTATC

TTCTCAAACAATCAGGGTTATATGATATTCACAACCTTGTCAACATGATATC

CTCAACTAGGGAAGATTTCTTTGACGTCTCCGTTTATGAACCAATTACGGAG

AACTCACTATTCAATGGTGACAAATTCAAAGAAATATCAGATCGCCTTCAA

GATTTTCTTCCAATTGCATTAATTGATTACCAAGAGGAGCGATTGTTGTATC

TATTACCTCCCACGCTTGTTAACTCTATCATTCAAAACTCCTCTGTGGATTTT

GTCAACTTTTATTTCAAATTATCGTTGATCGTGAAGGAATATTTGAAAGCCA

GTGAAGGATGTCTCAGATGGGATGACATGGAGGT

COY1 partial (SEQ ID NO 45); RPPA05747 - Pichia pastoris (IG-66)

AAAAGTTAAGTAATGAGTTGGTTAGCTACAGATCGATAACAAGAGGACATGG

TTAATTCAATTCAGGAACTGGAAGAGAAGCTTGCGTTTTCTCAAAAGCAAGTA

GAGCAGTTACAGCAGTTAAACCAGGATTTGGAGAAGGAGACTAGTGTGGAAA

AATGGGATGCAATTTCAATGATTTCTGCCAGGCCGGACACTTCAATACAGGAC

AATTCGTTGATTACAATGGTATCACAACAAAGAGATCGATTTAAGCAAAGGAA

CAAAGATCTTGAAAAAGACGTTAGATTACAATTGAACAAAATTTCTGAGCTTC

AAAGAAAGGTCCAATCACTTTCTTCAGACAATAATCAATTATACGAGAGAATC

AGGTTTTTGTCATCCTATGACAGTAATAAGAATCAGTCCAAGGAGAGCCAGTC

GGAAGAGTACTACAAGAGAAGTTACGAAGACAAATTGCATCCGATAGAACAA

TTTAGCATATTGCATCAGCCTGCACTTAATTGTGATGTCAATGACGATGTATGT

GATGAATATCCATAATGA

RPPA04443 - Pichia pastoris (IG-66)

AATTCATACCGTTGTCACGTAATCGCGGGGGTAGTGTGCATCGCATCGTATT

GGAGACATCTGTCTGTTTTCTTCCCTCACATCGAAATACAACTTCACCATGA

CTGACGCTGACTTCCAAGTAGTATTTGAAGCGTGGCAAGCTGTCGATCTAC

AGGGTGTTAAGAAGCTTGTAGATGATGAGGCAAAAGAGATTGAAAGTTCGA

AGTCTTCAAGTTTGGATCAAAGAAAGCAGTTGAGTTTTAAGACGAAGGAGT

TCAAGAAATTGGACGATGAGCGTAAATTGACACAATGGAGGTCGTTGTTGA

AGGAGTATCAAAACTACATTGATGATTTGACCAAGGGAAATAATCGTGTTG

TACAGACATTCTTGGAATTGCATAAAGTAGTGGTGGATTTGAAGGATCCTAC

AAGTACTTTGAGCAAGGAGCAAGAGACGAATACCGAATTACAGAAAGCTGT

GAAAAAACTTTCCACAGAACTGAGGCATTCAGAACAACACTGGGCTTCCGA

GAAGAAAGGATTGGAAGAAAAATTTAACGTACGTAAAAGGGAAACGGAAG

AGAAAAGTCTTGATCAGATTAAGACAGCCCAAACTGAAATAGTCCAATTGA

GGGATGAGCTGAAGCAAAAATCTTCAGAAAATGAAGAGCTTCAAGTGGTGA

TTGAAACCCTTGATGCCAAGCTAAAAAAGAACAGCCAAGGACAAAATAAT

GATGATACATACTCCAATTATGACATGTTAAATAGAGATTTGGAGTCCAATA

AACTAAAGATCCTTGAATTGGAAAGGTTGAACAACTCTCTAAAGGAGGAAT

TAGCAAAGAAAGATGACAAAGCCTACCAGGAGAGGGTCACCGAACTCGAA

AAGGAGAGTGTGGAGTATCTCTCTTAAAG

CUP5 partial (SEQ ID NO 46); RPPA09067 - Pichia pastoris (IG-66)

GTCCTGTTTATGCTCCATTCTTTGGATCCATTGGTTGTGCTGCGGCCATCATC

TTTACCTGTTTTGGTGCCGCCTATGGTACTGCTAAGTCGGGTGTAGGTATTT

GTGCCACCTGTGTCTTGCGTCCAGACTTACTGATCAAGAATACAGTGCCTGT

TATTATGGCTGGTATCATTGCTATTTATGGGTTGGTGGTGTCTGTGTTGATCT

CTTCATCGTTGCAACAGAAGCAGGCTTTGTATACTGGCTTTATCCAATTGGG

TGCCGGTTTATCAGTTGGTCTGTCAGGTCTGGCTGCTGGTTTTGCCATCGGA

ATTGTTGGTGATGCTGGTGTCAGAGGTACTGCTCAACAGCCAAGACTTTTCG

TCGGTATGATTCTGATTTTGATTTTTGCTGAAGTTTTGGGTCTTTACGGTCTG

ATTGTTGCTCTTCTACTGAACTCTAGAGCTTCCCAAGATGTCACTTGTTAAA

GC

IMHI (SEQ ID NO 47); RPPA04985 - Pichia pastoris (IG-66)

ATGTTCTCAAAACTTTCCCAGTTATCCCAGAATTTAGGCGAAGAGCTCTCTA

GGATTAATGAGGAGGTTGCTGCCTCTAGAAGGAACCAACTAAAGAAAAGCA

GGGATTCGGAGAGGGATACAAAGTTCCTTAACATCAAAACTCCCGATCCTG

AAGCTCTACAACAACCGGGTCATGAGGTGAATGAAGGCGCTGAAACCGAA

ACAGATGCTACTGAGTCAAAGGGCCAAGTGGTTCCAAACACAAATATACAC

TTCAATGATCTGCCTATGGAGATTAGGGCCCGCTTGAAAAAGTTTGCGAAAT

ATGAGCAGAAATATCCGTTGTTGTTGGACGCTTACAAAACTGAGAAGGCCA

AATCTGAAATAGTTCATGCTTTTGAATCAACTTTACAAGAAGTCACTCCTTT

GCAGACAATTGGAGAAATTGAACAATTCAAAGACTTTATCAGCAATATGAC

CCAAAAGGCTAAATTAATGGATGAAGAATTGAGAGCCAAAACTGGCGAGTT

GAATGGCCTAAAAAACGAAGTGACAGAAATGAAAGAGAAATTGAAGGCTG

TTCAAGGTGAGATGAAAGCCAAGTCTGCTTTAGCAGAAGAATCTGCGATGA

AAGCCGATCAACTTAGTGTGGATCTTGAACGAGTTGTGAGTGAGCTAGAAA

ATTTGAAAAAGGAAAGGGAAGAGATTGTTACGGAGCGTGATGAGGCAACC

AAGGAACGTGACGAGTCAACAAGAGAAAGAGATATCATTCTAGAAGAAGT

CAAATCTAATAAAAATCAAGAACTTTTGGAGGAATACAAGTCTGAGTTAGA

AGAAGCAAAGAACGCTCTTGCATTGAGAACTGAGGAGATTGAAAATCTAAA

CTTGAAGTTGGAGTCTGAAAAGTCGGCAAAGTTATCATTAGAGGGTGTAGC

AGATGAGCGAGATGGCCTTAAAGCAAAGTTAGAAGCGCAAACAACTTCCTT

CCAGGAAGAATTAGACCAACTTTCTCAAGAACGGGATCGTTTGAATTCTCA

ACTAACAATAGGAGAAAAGTCACAGATAGAAATTGAGCAAGAAAAAAATG

AGCTCAAGAGTCAGTACAATTCTGAGATTAAGTCATCGCTTAGTAAACTGG

AATCCGTAATTAAAGAAAGAAATGAGCTACAACAGCAATTGGAATCTCAAG

AGTCTTTAACTTTCGAAGTGGATAAGCTCTCTAAAGAGAGAGATGAGCTAA

GGATGCAGTTGGATAGAGAAAAAGAAAATTCTGCAAAGGCTAGCATCACGC

CCCAGAACTTTGAGGTTAAAACTAAAGTTGAATCCAACAAGAACATTGAAG

CACCTTTATCTGAAGAACTCAGGCAAGTCACCAGAGAAAGAGATGAGCTAA

AAGCTCAGTTGTTGCTCATTCAAAAGAACCCAGGACCAAGTAAGAAGACCA

ATGAAGGAAACCGTAACTTAGAGCGGAACGGTGAAAAAAAGTCCCATGAC

CAAAATGGTGCTGACGATGATCTTATCGAACGGAAGGGTGAAAGCGGTACA

GATGATTCAAAGGATAATCAGCTGAAACTAATAGAGCAGTCAACCACTATT

ACGATGTTGAATGAAGAGATTGAAAACTTGAAAGATATGCTGCGTGACGTC

GGAGACGATCTTGTAGTGGCTAAAGACAAGCTTTCACAGGTCTCTGCTGTTG

ATGAAAAAAAACAGCACGCTCTTGAAAGGGAACTGGAATCCTCAAAGTTGA

AGCTTGCTGAGATTGAAAAGGATTACAACGACGATAGAGTTGACCTCAAGA

ATGAATTGAAACTCGTTACCGAGGAAAAGGAAAATCTTGAACATGAGAATG

AAACATTGAGTCAAAGCTTGACCGAGCTTGAAAAGCTGAAACAAGAAGTCA

AGGAGAAAGCTCAAGCCGTTCAGAATTATGAATCTAAGTATTCGACACTAT

CGGATGAATTATCTTTAGTCCTCTCCAAACGAGATGAATTGGAAAAAGACA

AGGAAAGTTTCAGGCTGAAATTGAAAGATTTGGAACAGAAAAATTCCGAGA

CTGAACAACAAGAGGAATCGCAGAGAACTGGAACTGCAGAAATGGAAGAG

GAGTTGCAGACTTTGAAAAGGGAGCTAGAAGCAAGCTCCCAGCGCATTGAG

GATTATAAGCAAAAGCAACTAGAACTAGATGATGAAATATCTCTTGTACGT

TCTCAAAAAGATGAATTACAGAAAACGATAACTAGCCTGCAAGAAGACCTG

GGGAAAGAGAAAGAGAATGTTAAGTTACTCCGTGAAAATATCATTGCCGAA

GAAAAAGCCAAAAATTCCCAAAAACTGGCAGAAAATGTGGCGCAGCTAGA

CAAATTCAAAAAGCAGGAGATATCACTCAAGCTTGAGATTGCAAACCTTGA

AAATCTAAATGCTGAAAAAGGCTCAAAGATCAAAAGTTTGGAAGAGCATAT

CACTTATCTTAATACAGAGAGGCAATCTAATTATGATGAGAATCAGAAACT

GATCTCCCATACTAATGAGAAATGGAAGCAGGATTACACGGAGTTAGTCAT

AAAATTGAACAAGTGTCAGTCAGAGAACAATAGACTTACAAAAGAATTGAA

CGAATACAAAGATAAACTAAAAGATATGAACACCTCAAAGCTGAACAGTA

GCGAGACAATCGAGTCAATCAGAAGACAATGCGAAGAACTTAAAATGATG

AATAATGAATATTCTTTGAAGATTGAAAGTCTACATGAAGAACTAAGTTCTT

CGAGTTCAATTTTACAGGAACGTTCCAGAGAAATGAATACTATACGTAAAC

TGCTAGCTGATACTGAGTCCAAATGTGACGAAAGAATCAAACAGTTAAAAG

CAAGAATTGATAGGTTAGAAGAAGAAAAGGAGACGACTAGCCATGAAAGC

TCTGTCCAGGCAAGAAAGCTGAGTAAAACAATCGACCAGTTAAAGAAAGG

CAAGAATGAATTGTCAGTGCAGCTAGAACAATGTAAGCTAGAGCTGGAACA

TCTGAAATCCGTCCCATCTAGAGTGGATGTTGACAATAAAAACGGTGCTTC

AAATGAAAACAGTGATGAAAACCAATCTGATATTGAATCTGGAATTATCGA

ACAGCTCAGAAACTCGCTAAAGGGATATGAAGAACAACTAAAACAATACC

AAGATTCCAACGTCTTACTCAAGAAGGTTAACGAAGAGCAGTTGCTGAAGT

TCGAGAGACTGCATTCAAATTTCAAGATTGTATCTAAACAATATAGAATGCT

GAAAGATCAAAAGGACGAAGTCAATACGAGAAGTAGAAACAATTCAGTTA

TAAGTTCAACGAGCGCGGGGAGTGATGAAAATGAGAGAGATAAAGTTGCCT

ATATTAAGAACGTCCTTCTAGGATTTTTGGAACACAAAGATCAACGAGCTAT

GCTTTTTCCTGTAGTGAAGATGCTACTTATGCTGGACGATGATGAAGAGAGA

AGGT

KIN2 (SEQ ID NO 48); RPPA04639 - Pichia pastoris (IG-66)

ATGGATAGAGAACAGGGTATTCTGCCACAGGATCCCTTCTCCAACTCGGTG

CATGTACCAAAGTTGAGAGCTTCTTCTGGTGGCCAGCCACAGAAGCCTGTA

ATACAAAATTCTGCTCCTGCTACTGCTAGGATGCTTCGCAATGCAAGTTCAA

GTACGTCAGCAGCTTTGTTGAAAGAATTAAACACACATGAACACTCTCAAC

GTCAACATACTCCACAGAAACAACCATCATTGGATGCCCCGGCAGCATTGG

TTCCAGTTGAATCTGCCACAAAACAATTCCACCGAACCTCCATTGGAGACT

GGGAATTTAGTAATACAATTGGGGCAGGCTCGATGGGTAAGGTCAAAGTCG

CCAAACATAGAGTCACTCACGAGGTATGTGCCATCAAAATAGTCATTAGGT

CAGCCAAAATCTGGCAGAGAAATCACCAAAACGATCCAGAACCTGAAACT

GAAGAAAAAAGAAAGAAGCTGCGTGATGAATACAAGAAGGAATTGGAACG

CGATGAACGTACTGTCAGAGAGGCAGCACTAGGAAAAATAATGTACCACCC

AAATATTTGTCGGTTGTTCGAATGCTATACAATGTCTAATCACTACTACATG

CTTTTTGAAATAGTCCAGGGGGTACAGTTACTGGATTATATTGTTTCTCATG

GCAAATTGAAGGAAACACGCGTTCGCCAGTTTGCCAGAAGCATTGCTTCTG

CTTTAGATTACTGCCATTCTAATAACATCGTTCACAGAGATCTGAAAATTGA

AAACATAATGATTAACAAACAGGGTGAAATCAAGTTGATTGACTTTGGCCT

TTCCAACATGTATGATAGAAGAAATCTCCTGAAAACCTTTTGCGGCTCCCTA

TACTTTGCAGCACCGGAGCTTTTGTCTTGCCGTCCTTACATTGGTCCTGAAA

TTGATGTCTGGTCTTTTGGGGTTGTATTATTTGTCCTTGTTTCCGGTAAGGTT

CCCTTTGATGACGACAGCGTGCCAAAGCTTCATGCTAAAATCAAAAGAGGA

AAAGTTGAGTATCCTGAGTTTATTTCACCTTTATGTCATTCATTGCTATCTCA

GATGTTAGTCGTTAATCCAGATCATAGAGTCACTTTGAAAGCTGCAATGGA

GCACCCTTGGATGACCTTAGGATTTGCAGGGCCTCCATCAAACTATCTCCCT

CAGCGGTCACCTATTGTATTACCGTTGGATTTAAGTGTAGTAAGAGAGATTG

CAAATCTGGGTTTAGGAAATGAAGAACAAATTGCTCGAGATATCACAAACC

TGATCTCGAGCAGAGAATATGAAGCGTGTGTTGAGAGGTGGAAACTTGATC

AACAGAAAGCTAATATCAAGGGCTATTCCGCGCGTGACGATTCTGCTATCA

TCGCCTTCCACCCGTTACTTTCAACGTACTACCTCGTGGATGAAATGAGGAA

GAGGAAGCTAGCAAAAGGTGCTCTCAAGGGACAGACCTCGGTATTAGACAC

TGTCAAGGTGTCTCCAGACATTCCAAAGACACCAGCTATTCCCCAGAAACT

AGAAACTACGGATGTGGAACAGCCATTGCTTGCCACTGTCCCACCTGCTTA

TACATCTCCGCATGGACAGCCAGCTGAACTGGAAGCGATGATTGAACCGGC

ACAGCCATTATCTAGTGCTCATCCTTTCGAGATGGATATGACGCAGCAACA

ACATGCTAGCAGAAAGACCCATATCAAGCATGCTCCAGAACGACAAGATCG

TGGCGGCTATAATGTACACAAGAATAACTCTGGTGGTCTTAACTCTTTATTC

CGAAGACTCAGTGGAAAACGACCCCATAAGAATGAGGCTGAATGGGAGCC

TTCATCTCCCCCACCTCAAGTTCATCCATTTTCAGTTAATGATGCGGACAGG

ACTTCAGTACGTGGCGTTTCACCAATTACTCAACCAGCTGCTGTGAAGAATG

TGACCTCCAATAACTCCAAAAACTACCTGGACCCTGTTGATGATAGTAAATT

AGTTCGTCGTGTAGGAAGTTTGAGAATTACCAACAAAGAAAAGCAACAAGT

GACATCTGACTTTCCCCGACTGCCCAATTTTACGATTCCAGAGCAACCGCCT

AAGAATGCTCCCATACCGATACATGCCCAACCTACCACTACAGGTACAACC

TTTCAATCCAATGATCATGAAATCAAAAAGAAGTTACAGGCTTCGACTAGT

CCAAACGAACAACGTGGGCCTCCAACATTGGCTCCTAGTCAACAGAGACGG

CTACATCCCACTGCGAGAGCCAAGTCACTTGGCCATTCTCGCAAGCAATCG

CTTAATTTCAAATTCGGAGGACCAGCAAACAATCAATTACCTGCGTTGCCTA

CTAAAGAAAATTATGATGTGTTTGAAGATGCCCAAATTACCGATAACAATTT

ATTAAACCCAGAAGGGAAATACTCTGCTAATACTAACGTGCATATCAAACC

AATGACAGAATCCCAAATTTTATTTGAGGCAGAACATGCTCCACCTGGAAC

TATGCCCTCAGTTGAGTACCCCAGGACCTTGTTTCTCAAAGGATTTTTCTCT

GTTCAGACTACATCCTCGAAGCCGTTACCTGTTATTCGATACAACATTATAG

CAGCTCTCTGCAAACTTAACATTCAATTCACTGAAGTTAACGGTGGGTTTGT

TTGCGTTTACAGAAAAACTGAAAATTTACAAATTGGGGATATCAGATCTCC

AGTTATAGAGTCAAGAGTGACCGATGACACTGACTCCGATGTTGCAAACTC

TTCCAAATTGTCATCTTCGTCAACAGCCAATACCAGAGTCAATGTTATTGAG

GATGATAGTTCATCGCCGTCCTCAGCAAGATTGAAACATCGCCGAAAGTTT

TCTCTTGGAAACGGAATCCTTAACCATATAAGGAAACCCACGCTTGACGGG

ACAGAATTTGATGACTACGATGCAACCGTAAATACCCCTGTTACTCCTGCA

CCTGCAAATGTTCATTCTCGTTCATCGTCTTATCATACCGAGAGTGATAATG

AGTCCATGGAGTCGCTGCATGATATAAGAGGTGGCAGTGATATGATCTTGA

AAAATGTTCCAGAAAGAAATGCTAGACAGATAGACACAGTCAAGGAAGAG

GAAACAGATGATGATGATCTTGGTAGTATCAACGAAGGATCAACACACCGT

ACACCTTTGAAATTTGAAATTCATATTGTCAAAGTCCCTCTGGTTGGACTAT

ATGGTGTGAGGTTCAAGAAAATTCTGGGAAATGCTTGGATTTACAAAAGGT

TGGCGTCAAAGCTGCTACAAGAATTGAATTTATAGTTC

SEC31 partial (SEQ ID NO 49); RPPA06211 - Pichia pastoris (IG-66)

ATGGTGAAAATAAGTGAAATAAAAAGTACTTCAACATTTGCATGGTCGTCTGT

AGACTCTAATGTCTTGGCTACAGGGACCTTGGCTGGGGCTGTTGACGACTCAT

TCTCTACCACTTCGTCATTGGAACTTTGGGATGTCCTGAACACCTCAGCTCCCA

TATTCCGAACCAATGTTGGTGCAAGATTTCATGATCTTGCGTGGAGTAATCCA

ATCTCTAAGTACCAGAGAGGACTACTTGCAGGTGCTTTTGATAATGGAACAAT

TCAATTGTGGGATTCCTCATCATTGCTGAATGGATCATCTGACAGTTTAATAGA

GCTAAAGAAACACACTGCGCCTGTTAAAACAATATCTTTCAATCCTACAGAGT

CACAGATATTTGCATCTGGTGCTTCCAATGGCCAATTATTCATTTGGGATATAA

ATCATCTTTCAGAGCCTATTTCACCGGGTGCTTCTACTACCCCTATTAATGACA

TAAACTCCATTGCTTGGAACTCCAAGATACGTCATATTTTGGCCTCTGCTGGA

ACCTCAGGCTACGCATCCATTTGGGATTTAAAGACCAAGAAAGAACTATTGAA

CTTGAGTTACACTGCTCCATCAGGTCAAAGAGCTAACTTAAGCACCGTTGCAT

GGCATCCTACTAATTCGACAAGTGTAATAACAGCTTCTGATTCGGACGCTGTA

CCATTGATAATGACTTGGGATTTAAGGAATACTAATGTACCTGTAGCTACTCTT

GAAGGTCATCAAAAGGGTGTATTGTCCCTGGATTGGTGTTCGTGGGACTCAGA

ACTATTACTTTCTTCTGGAAAGGATAACTCTACCCTATTGTGGAATCCCATCAG

AGGCTCTTTGTTAGCGGAATACCCAACCACCACTAATTGGGCCTTCAAGACCC

GCTTTTCTTCCAAGCTTCCTGACATTTTTGCAACCAGTTCATTTGATGGTAAGA

TAACGGTGCAGACCTTACAGGATACTACACCTGCAGAGGCTCAACAAGCAAA

AGCTATCAACGATGACGAATTCTGGGCAGACCTGTCCAACAGCGATAAGAAA

CATCCTAATTTTTTACAACGTCAAACTCCGGCCTGGCTTAAAGTGCCTTCCAG

CGTTTCATTTGGATTTGGTGGAAAGATTGTAAAAGTTTCCAAGGCCTCTGATAA

CCAGTCAATTGTTGTAATTGATAATTTCACAACTAACGATACGCTGGCCAAGT

CCACTTCCCTTCTTGCAAGCACCATCAGCACAAACGATTACCAAACTCTTGTC

GACGAAAAACTTCGTACCGAAGCAAATAACCACGACTGGCAATTATTGAACG

ATCTATTAAAAGCGGATGATGTGAAAGATTATTTCAGATTTCAGATTGTGGAT

CCTTCCGTATTGAAACATGACAAGTCTGAACAAAAGGTTGAAAACGGACAAG

ACATATTTGAAAACATTGAGCAAACTGATGAAGACTTTTTCAACAATCTTGAA

AGGGAAAAGAATTCAGTTTCTGTCAACATTCCATCATACTCTCCAACTGCTCT

CAGCCAGGGACTAATCCAGGAGGCTCTAGTATTGGCTTTAGGTGCATCTGAAT

CATTACAGGCTAAAGTTAGGAATGCCTATTTTAACCAAACGCAAAAGTCCTCT

CTACCAAGATTGATTTACAGTGCTACTGCTAACGATGTTAATGACCTCGTTGCT

AATGGTACTATCTCTGGTTGGAGGGATATAGCAGCTGCTATTTTTGCTTACTCT

ACGGAGAAAGAAGAGTTTTCAAAGTTCATTGTGGAACTAGGTGATAGGCTATT

AGCCAGTTCCCTTTCAGATAGACGCTCTGATGCTCTGCTTTGCTTCCTTGCTGG

TGGTGCGCTCAACAAGGCGTCTACAATTTGGAATGCCGAGTTGAGTTCTCGTG

AAGAGGTTCTCAAATCTGAAAACCCTCAGCTCTCATCTTATGAAGCTCATAAT

ATTGTGTTGACTGAGTTTGTTGAAAAAATTGCCGCATTCAAGTATGCATTAAG

GATCAGCAATAAGTTCAGTGGGCAGGGCGTTAATACGTTGAACAATTCATTCC

TAGAGTTTGCTTCTTTGGTGTCATCTCAAGGGCAATTTGATTTGGCCTTGAACT

TATTGGAGAACTTATCTACCGAGGATGAAGACATTAAACTTGAGATAAAGCGA

ATCTCAACAGCATCAGGAAAAACTCTTTCCAGTA

RPPA07281 - Pichia pastoris (IG-66)

TCCTCCCNCGCTCACCCCTCCCGNCCTTCCCCTATCGCATTTCCAAAGGTCT

TCTCGTGGAGGCTCATTCTCCGTTCCACCTCCTAATCCATACGTAGGTAGCT

CAGTGAATGGGAATGGAGGAGTGCACGGAGGCGCACCAGCTATCCCTGTTG

CCAACAACCCTTATGCTAACAACAATCAAAATGCATCATATGGCCAAGCAA

ACGGTCCACTAAATGGATTTGTACCTCCGCCACCAATGCCTGAGAAAATGG

GAGGACTTTCCTCACAGAATTACCCCAAGAGAGCAGCAAGTAGAGCAAATA

GCACTGCTGGATATGCGCCATCACTAAGATCGCCCAGTGTGCAACAATTTC

AACCACCACCACCTCCGGCACTAGCTCAACATGTGCAACCGCCACCACCTC

CTGAACTAGTTCAGCAGGTACCTCCACCCGCGCCGTCTGTACAACACCAAG

TATCACAAGGATCTCAAGGATCTCAAGGTGGGCCCCCTGCACAACAACAGA

CCAGATTTCCCAGTGGAGATAGATCACATATAAGTGATGAGGCTTTCCCTAT

TTATGAGTACCTGAGTAAGGAGTTGGAAAATGTTAAGCCTAAGATTCCAGA

AAGATTTACCAAACAACTCGTAGACGCTGAGAAGAGATTGAATATCTTGTT

TGATCATTTGAATAACAATGAGCTGTTAACCGCTCCTACGATTACACTGTTG

TCTAATCTTTCAAAGTCTCTAGCTGACCATGACTTTAAGACTGCTGAATCGT

TACTGATTCAAATTACTACCATTCATAACAACGAGGCAGGAAACTGGAGCG

TTGGTGTGAAACGTCTTATCCAGATGTCCTCGGCTCTGAGTAGCTAAGAA

SSA4 (SEQ ID NO 50); RPPAI 0651 - Pichia pastoris (IG-66)

ATGGGTAAATCAATTGGAATTGATTTGGGTACCACATACTCTTGTGTGGCAC ATTTTGCTAATGATCGTGTTGAGATCATAGCTAACGACCAAGGTAACAGGA CGACTCCATCGTTCGTCGCCTTTACCGACACTGAAAGATTGATTGGTGATGC TGCAAAGAACCAAGCTGCCATGAATCCAGCTAACACTGTTTTCGATGCCAA

ACGTTTAATCGGTAGAAAATTCGACGACCCGGAAACTCAGGCCGATATTAA

GCACTTCCCTTTCAAAGTTATCAACAAGGGGGGAAAGCCTAATATCCAAGT

CGAATTTAAGGGTGAGACTAAGGTTTTCAGCCCCGAAGAGATTTCCTCCAT

GGTTCTAACAAAAATGAAGGATACTGCTGAGCAGTATTTGGGTGAGAAAAT

CAACGATGCAGTTGTCACTGTTCCTGCTTACTTCAATGACTCTCAAAGACAA

GCCACCAAGGATGCTGGTTTGATTGCTGGTTTGAACGTTCAAAGAATCATTA

ATGAGCCCACCGCTGCCGCAATTGCTTACGGGTTGGACAAGAAGGATGCAG

GCCACGGTGAGCACAACATTCTAATCTTCGATCTAGGTGGAGGAACTTTCG

ATGTTTCTCTACTATCTATTGATGAGGGTATTTTCGAAGTCAAGGCCACCGC

AGGTGACACCCACTTGGGTGGTGAGGACTTCGATAACAGATTAGTCAACCA

CTTTATCGCCGAGTTCAAGAGAAAGACCAAGAAAGATCTTTCTACAAACCA

GAGATCCCTTAGAAGACTAAGAACCGCTTGTGAGCGTGCAAAGAGAACTTT

GTCTTCTTCTGCTCAGACCTCCATCGAGATTGATTCTTTGTTCGAGGGTATC

GACTTCTACACCTCGATCACTAGAGCTAGATTCGAGGAGCTCTGTGCCGAC

TTGTTCAGATCCACCATCGAGCCTGTTGAGAGAGTCTTGAAAGACTCCAAG

TTGGACAAATCTCAAGTTCATGAGATTGTTTTGGTTGGTGGTTCTACCAGAA

TTCCAAAGGTTCAGAAATTAGTTTCTGACTTTTTCAATGGTAAGGAGCCAAA

CAAGTCCATCAACCCAGACGAAGCCGTTGCATATGGTGCTGCTGTCCAAGC

AGCTATTTTGTCTGGAGATACTTCTTCCAAGACACAAGACTTGTTATTGCTG

GATGTTGCTCCTCTATCTTTGGGTATTGAAACCGCTGGTGGTATCATGACCA

AGCTGATCCCAAGAAACTCCACAATCCCAGCCAAAAAGTCAGAAATCTTTT

CGACATATGCTGACAACCAACCAGGTGTTTTGATTCAAGTCTTTGAAGGTGA

GAGAACTAGAACCAAGGACAACAACCTGTTGGGTAAGTTTGAACTTTCTGG

TATTCCTCCTGCTCCAAGAGGTGTTCCTCAAATTGAGGTCACCTTCGATATG

GATGCCAACGGTATTTTGAATGTATCTGCTGTTGAGAAGGGTACCGGTAAG

ACTCAAAAGATTACTATTACCAACGATAAGGGAAGATTGTCCAAGGAAGAC

ATCGAGAGAATGGTTTCTGAAGCTGAAAAATTCAAGGATGAAGACGAGAAG

GAAGCCGAGAGAGTTGCTGCCAAGAATGGCTTGGAATCATATGCTTACTCT

CTGAAGAACTCTGCAGCTGAATCTGGATTCAAGGACAAGGTTGGAGAGGAT

GATCTTGCCAAGTTGAACAAGTCAGTTGAAGAGACAATATCTTGGTTAGAT

GAGTCACAATCTGCTTCCACAGACGAGTACAAGGACAGGCAAAAGGAATTG

GAAGAAGTTGCTAACCCAATAATGAGCAAGTTCTATGGAGCTGCTGGTGGA

GCTCCTGGTGGAGCTCCTGGTGGCTTCCCTGGAGGTTTCCCTGGCGGAGCTG

GCGCAGCTGGCGGTGCCCCAGGTGGTGCTGCCCCAGGCGGAGACAGCGGA

CCAACCGTGGAAGAAGTCGATTAA

SSE1 (SEQ ID NO 51); RPPAI 0049 - Pichia pastoris (IG-66)

ATGAGTGTTCCATTTGGAGTAGATCTAGGTAACAACAACACTGTGATCGGT

GTTGCCCGTAACAGAGGTATTGATATTCTTGTCAATGAAGTCTCTAATCGTC

AGACCCCCAGCATTGTCGGATTTGGCGCTAAGTCTAGAGCCATCGGGGAAT

CAGGAAAGACCCAACAGAACTCTAACTTGAAGAATACCGTTGAACATTTGG

TCCGTATTCTCGGGCTTCCTGCAGACTCTCCTGACTATGAAATTGAGAAGAA

GTTCTTCACTTCGCCCCTGATTGAGAAGGACAATGAGATCCTGTCTGAAGTT

AACTTCCAAGGTAAGAAGACTACCTTCACACCCATTCAGCTGGTTGCCATG

TACCTGAACAAGATTAAGAACACTGCCATAAAGGAAACAAAGGGAAAGTT

CACTGATATCTGTCTTGCTGTCCCTGTTTGGTTCACCGAGAAACAGAGAAGT

GCTGCTTCCGATGCTTGTAAGGTTGCTGGTCTGAACCCAGTTAGAATTGTCA

ACGACATCACAGCTGCTGCAGTTGGATATGGTGTCTTCAAGACTGACCTAC

CAGAGGATGAACCCAAGAAGGTTGCAATCGTTGATATAGGCCACTCTACCT

ATTCTGTTTTGATTGCTGCTTTCAAGAAAGGTGAGCTGAAAGTGTTAGGATC

TGCTTCTGACAAGCATTTCGGTGGTCGTGATTTCGACTATGCCATCACCAAG

CACTTTGCAGAGGAGTTCAAGAGCAAATACAAGATTGATATCACTCAAAAT

CCTAAGGCTTGGTCTCGTGTTTACACTGCTGCCGAAAGGTTGAAGAAGGTTT

TGTCCGCTAACACTACAGCTCCATTCAATGTTGAATCTGTTATGAACGACGT

TGATGTTTCTTCTTCGCTGACTAGAGAGGAGTTAGAAAAGCTGGTGCAACCA

TTATTAGACCGTGCTCATATTCCCGTTGAGCGTGCTCTGGCCATGGCAGGTC

TCAAGGCTGAAGATGTGGACACTGTTGAGGTTGTCGGAGGTTGTACTCGTGT

TCCAACCTTGAAAGCTACTCTATCTGAAGTCTTTGGAAAGCCCTTATCTTTC

ACTTTAAACCAAGATGAGGCAATTGCTCGTGGTGCAGCTTTCATCTGTGCAA

TGCACTCCCCTACACTTAGAGTTCGTCCATTCAAGTTTGAGGACGTTAACCC

TTACTCTGTGTCATATTATTGGGACAAAGATCCTGCCGCTGAGGACGATGAC

CACTTAGAGGTCTTCCCAGTGGGTGGTTCTTTCCCATCAACTAAGGTGATCA

CACTTTACCGTTCACAAGATTTCAACATTGAAGCCCGCTACACGGACAAGA

ATGCACTTCCAGCTGGCACTCAGGAGTTCATTGGCAGGTGGAGCATCAAGG

GTGTTGTTGTCAATGAAGGTGAAGATACTATCCAGACTAAGATTAAGCTGA

GAAATGATCCATCTGGTTTCCATATCGTCGAATCTGCTTACACAGTCGAGAA

GAAGACTATTCAAGAGCCAATCGAGGATCCAGAAGCTGATGAAGATGCAG

AACCTCAGTACAGGACAGTTGAGAAGCTCGTCAAAAAGAACGACTTGGAG

ATTACTGGACAGACACTCCACCTACCAGATGAGCTATTAAACTCTTATCTTG

AGACAGAGGCTGCCTTAGAGGTCCAAGACAAACTTGTTGCAGACACCGAGG

AGCGCAAGAACGCTCTGGAGGAGTACATTTACGAGCTTAGAGGTAAGTTGG

AAGACCAGTACAAGGAGTTTGCTAGCGAACAGGAAAAAACCAAGCTTACA

GCTAAGCTAGAGAAAGCTGAGGAATGGCTTTACGACGAAGGTTATGATTCT

ACTAAAGCTAAGTACATTGCTAAATACGAAGAGCTTGCCTCCATTGGAAAT

GTTATCCGAGGTCGTTATCTTGCCAAAGAGGAGGAGAAGAAACAAGCTATC

CGTGAAAAGGAAGAATCTAAGAAGGCTTCTGCTATCGCTGAAAAGATGGCT GCCGAGCGTGCTTCTCGTGAAGCTGCTGGTTCTACAAATGAACAAGCCCAG AAGAATGAAGAAAACACCAAAGATGCCGACGGTGATGTTTCTATGAACCAA GATGAGCTAGATTAAACT

Example 3: Cloning of the vector backbone of pPuzzle

For construction of the novel vector system pPuzzle a 2884bp fragment carrying an origin of replication and a selection marker for E. coli (AmpR cassette) was amplified from a common used cloning vector pBR322 (Fermentas Life Science, Germany, #SD0041 pBR322 DNA) by PCR. Two non- template coded Notl restrictions sites were added by using the forward primer pBR322_FOR_Notl and the backward primer pBR322_BACK_Notl. This PCR fragment was used as a shuttle supplying a temporary origin of replication and a selection marker for amplifying an artificial multiple cloning site in E. coli. A 244bp synthetic DNA fragment (synthesised and subcloned in the EcoRV site of the pUC57 plasmid by GeneScript Corp. Piscataway, NJ 08854 USA) was cut with Notl and ligated with the Notl and alkaline phosphatase treated shuttle fragment and amplified in E. coli. The resulting product was called pBR3227 2 artMCS. To generate pBR3227 2 artMCS_ORI, a 670bp fragment carrying the origin of replication from a commercial available cloning vector pUC19 (Fermentas Life Science Germany; #SD0061 pUC19 DNA; bases 812- 1481 ) was amplified by PCR using the forward primer pUC19ORI #1 -Sacl and the backward primer pUC19ORI #2-Sacl and cloned in the unique Sacl site of pBR322V 2 artMCS.

To generate the vector backbone of pPuzzle (see Fig. 1 ), the ampicillin resistance gene (PCR amplified from pUC 19 with primers ampR#1 Hindlll and ampR#2Hindlll) is cloned into the Hindlll restriction site of pBR3227 2 artMCS_ORI, the resulting plasmid is cut Notl and religated.

In a further cloning step the transcription terminator of the cytochrome c gene from S. cerevisiae (a 276 bp fragment of the 3 ' region of the Cytochrome c, isoform 1 CYC 1 gene from S. cerevisiae chromosome X bases 526663-

526937) was amplified by PCR (forward primer cyc1 TT_new_F0R_BamH1 and reverse primer cyd TT #2-Agel) for genomic DNA and inserted into the BamHI

and Agel (alkaline phosphatase treated) site of pBR3227 2 artMCS_ORI resulting in a vector called pBR3227 2 artMCS_ORI_cyc1 TT.

Example 4: Construction of a pPuzzle zeoR eGFP expression vector

The zeocin selection marker for E. coli and P. pastoris consists of the ORF of the Sh ble gene from Streptoalloteichus hindustanus under the control of the TEF1 (translational elongation factor 1 ) promoter from 5. cerevisiae and an artificial E. coli promoter sequence EM7. The Sh ble gene is flanked by a transcription terminator of the cytochrome c (CYC1 ) gene from S. cerevisiae. The TEF1 promoter (5 ' promoter region of TEF1 alpha of S. cerevisiae chromosome XVI bases 700170-700578) was amplified by PCR from S. cerevisiae genomic DNA using the forward primer zeoR_neu_#1_kpn1 (adding a non-template coded Kpn I site) and the reverse primer TEF1_back:_#1 . An artificial E. coli EM7 promoter sequence and an Ncol restriction site were added to the 3 ' end of the TEF 1 promoter by primer extension using the forward primer zeoR_neu_#1_kpn 1 and the reverse primer TEF1 _back:_#2Ncol. The resulting PCR fragment was treated with Ncol and fused to the Ncol site of the 5 ' end of the Sh ble ORF. The Sh ble ORF was amplified by PCR using the forward primer Sh ble_FOR_#1_Ncol (adding a non-template coded Ncol site) and the reverse primer Sh ble_back_#2_Aatl (adding a non-template coded Aatl site) from a pUT737 plasmid (Cayla Toulouse, France pUT737 catalog # VECT 7371 ). The product of this fusion was used as a template for PCR (forward primer zeoR_neu_#1_kpn1 and reverse primer Sh ble_back_#2_Aatl) resulting in a 893 bp fragment.

The transcription terminator of the cytochrome c (CYC 1 ) gene from S. cerevisiae (Cytochrome c, isoform 1 gene from S. cerevisiae chromosome X bases 526663-526937) was amplified by PCR from genomic DNA using the forward primer cyc1 TT_FOR_#1_aat1 (adding a non-template coded Aatl site) and the reverse primer cyc1 TT_neu_back_Kpn 1 (adding a non-template coded Kpn I site), treated by Aatl and fused to the Aatl treated 893 bp hybrid of TEF1 promoter and Sh ble ORF. The zeocin cassette of the final size of 1 1 70 bp was amplified by PCR using the forward primer zeoR_neu_#1_kpn1 and the reverse primer eye 1 TT_neu_back_Kpn 1 . The PCR product was purified by agarose gel electrophoresis and the fragment of the correct size was used as a template for a second PCR. The second PCR fragment was treaded by Kpnl

cloned in the Kpnl sites of pBR322y 2 artMCS_C)RI_cyc1 TT vector resulting in a vector called pBR322 1 / 2 artMCS_ORI_cyc1 TT_zeoR.

For integration of the pPuzzle vector system in the genome of P. pastoris it was decided to use a target sequence in the 3 ' area of the A0X1 gene of P. pastoris. Two 400bp fragment called AOXTTparti and AOXTTpart2 (sequences from Integrated-Genomics, Chicago USA, ERGO database, P. pastoris IG66 Contig 1471 bases 52189-52588 and 52589-52979) were amplified by PCR from genomic DNA of P. pastoris. By using the forward primer 5 A0X TT #1 Hindlll/Notl and the reverse primer 5 A0X TT #2

Ascl/BamHI non-template coded Hindlll and Notl restriction sites were added to the 5 ' side and Ascl and BamHI restriction sites to the 3 ' side of the fragment AOXTTparti . For adding a 5 ' BamHI site, a 3 ' Notl site and a 5 ' EcoRI site to the fragment A0XTTpart2 the forward primer 3 A0X TT #3 BamHI and the reverse primer 3 A0X TT #4aNotl/EcoRI were used. For assembling AOXTTparti and AOXTTpart2 according to their orientation in the genome the fragment AOXTTparti was subcloned in the EcoRV site of pSTBIue-1 using the Novogen Perfectly Blunt ® Cloning Kits, pSTBIue-1 (Merck Biosciences, Germany). A 500bp fragment was amplified by PCR using the forward primer T7 and the reverse primer 5 A0X TT #2 Ascl/BamHI. This fragment was cut by BamHI and ligated with the BamHI treated AOXTTpart2 fragment. The ligation mixture was used directly as a template for PCR with T7 as forward primer and 3 A0X TT #4Notl/EcoRI as reverse primer. The fragment of the correct size ( ~ 900bp) was purified by agarose gel electrophoresis and used as a template for a second PCR with 5 A0X TT #1 Hindlll/Notl and 3 A0X TT #4Notl/EcoRI. The presents of the Ascl restriction site in the middle of the PCR fragment was checked by Ascl endonuclease digest of the resulting 800bp fragment called AOXTTparti + 2 To get ride of the pBR322 shuttle in the pBR3227 2 artMCS_ORI_cyc1 TT_zeoR vector it was cut by Notl and the 2270 bp vector backbone of the pPuzzle zeoR was separated from the 2884 bp pBR322 shuttle fragment by agarose gel electrophoresis treated with alkaline phospatase and ligated with the Notl treated PCR fragment AOXTTparti + 2. The resulting vector was called pPuzzle zeoR AOXTT.

Starting from the pPuzzle zeoR AOXTT vector backbone an enhanced green fluorescent protein (eGFP) gene was inserted into the MCS using the

restriction sites Sbfl and SfII. The eGFP gene (718 bp) was amplified by PCR e.g. from the vector pcDNA™6.2n-EmGFP-DEST (Invitrogen Austria). Two non- template coded restriction sites Sbfl and Sfil were attached by primer extension using the forward primer eGFP#1 Aarl/Sbfl and the reverse primer eGFP#2Sfil. The Sbfl and Sfil treated PCR product of eGFP was inserted into the alkaline phosphatase treated Sbfl and Sfil sites of pPuzzle zeoR AOXTT. The resulting vector was called pPuzzle zeoR eGFP.

In Table 5 the PCR primer sequences used in the cloning procedures of Example 3 and 4 are summarized.

Table 5: PCR primers for cloning of pPuzzle zeoR eGFP (SEQ ID NO 52 to SEQ ID NO 74) pBR322_FOR_Notl (SEQ ID NO 52):

5' . AATAGCGGCCGCGCATCTCGGGCAGCGTTGGGTCCTG - 3' pBR322_BACK_Notl (SEQ ID NO 53):

5' - GATTGCGGCCGCGACGTCAGGTGGCACTTTTCGGGGAAAT - 3' puc19ORI #1 -Sacl (SEQ ID NO 54):

5' - GATCGAGCTCTGAGCAAAAGGCCAGCAAAG - 3' pud 9ORI #2-Sacl (SEQ ID NO 55):

5' - GAAAGAGCTCCCGTAGAAAAGATCAAAGG - 3' ampR #1 Hind III (SEQ ID NO 56):

5' - GCCGAAGCTTACAATAACCCTGATAAATGC - 3' ampR #2 Hind III (SEQ ID NO 57):

5' - GCCGAAGCTTAAATCAATCTAAAGTATAT - 3' cyc1 TT_neu_F0R_BamH1 (SEQ ID NO 58):

5' - CAATGGATCCCCTTTTCCTTTGTCGATATCATGTAATTAGTT - 3' cyd TT #2-Age I (SEQ ID NO 59):

5' - GTGGACCGGTAGCTTGCAAATTAAAGCCTTCGAG - 3' zeoR_neu_#1_kpn 1 (SEQ ID NO 60):

5' - GATCGGTACCCACACACCATAGCTTCAAAATGTTTCTACTCCT - 3'

TEF1_back:_#1 (SEQ ID NO 61 ):

5 1 - TACTATGCCGATGATTAATTGTCAACACCGCCCTTAGATTAGATTGCTAT

GCTTTCTTTCTA - 3'

TEF1_back:_#2_Nco1 (SEQ ID NO 62):

5' - TTGGCCATGGTTTAGTTCCTCACCTTGTCGTATTATACTATGCCGATATA

CTATGCCGATGATTAATTGTCAACACCGCCC - 3'

Example 5: Comparative yeast promoter activity studies in P. pastoris

a) Amplification and cloning strategy of promoter sequences from P. pastoris:

To identify novel promoter sequences for use in a strain of the genus Komagataella for recombinant expression of a heterologous protein the normalized signals of all measured genes of trypsinogen producing and non- producing cells, respectively, obtained from the DNA microarray hybridisation

described in Example 1 were ordered by their relative expression levels.

Further the relative expression level of each measured gene was compared between trypsinogen producing and non-producing cells. From these data the 23 genes with the highest expression level in trypsinogen producing and non- producing cells were considered for further analysis. A listing of the genes selected for further analysis is found in Table 6. Further, only such genes for which genomic sequence data were available have been included in the selection. The promoter sequences of these 23 potential interesting genes (up to I OOObp of the 5'-non coding region of the respective genes) were identified using a P. pastoris genome database (ERGO™, IG-66, Integrated Genomics) and amplified from P. pastoris by PCR. Additionally, the well known promoter sequences of AOX and of GAP were amplified via PCR from P. pastoris for comparative reasons (primer and primer sequences see Tables 6 and 7). In 25 final cloning steps the 25 promoters (including the two control promoter sequences) obtained from P. pastoris were inserted upstream of the start codon of the eGFP gene using the Apal and the Sbfl restriction site of the multiple cloning site of the vector pPuzzle ZeoR eGFP or in case of the promoter of FET3pre using the Apal and the Aarl restriction site (see Table 6).

Table 6: Overview of the genes, the PCR primers used for amplification of the promoter sequences, the restriction enzymes used for cloning of the promoter sequences and the fragment length of the promoter sequences from P. pastoris

Table 7: PCR primers used for amplification of the promoter sequences from P. pastoris (SEQ ID NO 75 to SEQ ID NO 1 24)

Paox # 1 Apa I (SEQ ID NO 75):

5' - AACCGGGCCCTCTAACATCCAAAGACGAAAGG - 3'

Paox #2 Sbf I (SEQ ID NO 76):

5' - CATGGCCTGCAGGTGTCGTTTCGAATAATTAGTTGT - 3'

Pgap # 1 Apa I (SEQ ID NO 77):

5 ' - AACCGGGCCCAGATCTTTTTTGTAGAAATGT - 3'

Pgap #2 Sbf I (SEQ ID NO 78):

5' - CATGGCCTGCAGGTGATAGTTGTTCAATTGATTGAAATAGGGAC

AAAT - 3'

Pgnd i # 1 Apa I (SEQ ID NO 79):

5 ' - TATCGGGCCCTATGGTAGAATCATCAATTGGAAT - 3'

Pgnd i #2 Sbf I (SEQ ID NO 80):

5 1 - CATGGCCTGCAGGTGATTTGTATCAGTCTTGTTTCTTTTCTTT - 3'

Pgpm i # 1 Apa I (SEQ ID NO 81 ):

5' - TATTGGGCCCGAAAGAAGGTTTATCTGACTGTTGCGCAC - 3'

Pgpm i #2 Sbf I (SEQ ID NO 82):

5 1 - CATGGCCTGCAGGTGTGTTTGTTTGTGTAATTGAAAGTT - 3'

PHSP90 # 1 Apa I (SEQ ID NO 83):

5' - GACTGGGCCCTTCAAGATCTTTTGAGGACTAGAGA - 3'

PHSP90 #2 Sbf I (SEQ ID NO 84):

5 1 - CATGGCCTGCAGGTGATTGATATTTTTCCAAAATTAAAAAGTTAA - 3 1

Pkar2 #1 Apa I (SEQ ID NO 85):

5 1 - ATCAGGGCCCACTATCAAAGCTATCAATTGTGGAAATGGACAGCA - 3'

Pkar2 #2 Apa I (SEQ ID NO 86):

5 1 - CATGGCCTGCAGGTGTCTTGAGTGTTGGAATTGAAATTAAGGAAG

AAG - 3'

Pmcml #1 Apa I (SEQ ID NO 87):

5 1 - GTACGGGCCCACAGCTTTGGCTTGAACAAT - 3'

Pmcm l #2 Sbf I (SEQ ID NO 88):

5 1 - CATGGCCTGCAGGTGGCTAAATGAATGCGGGTTAGTGTTTGA - 3'

Ppet9 #1 Apa I (SEQ ID NO 89):

5' - AGTACGGGCCCTAGAAAATTCACCACTGTCGGAAAGT - 3'

Ppet9 #2 Sfi I (SEQ ID NO 90):

5' - CATGGCCTGCAGGTGGAAGTCGACGAAGAAGTTAGACTTGTTGTT - 3'

Prad2 #1 Apa I (SEQ ID NO 91 ):

5' - GTAAGGGCCCGTATAGTTTGCAGACATAGTAGGAGAGTTT - 3'

Prad2 #2 Sbf I (SEQ ID NO 92):

5' - CATTGCCTGCAGGTGATCCTTAGCCCAACCTGATGGAAAAACGG - 3'

Prps2 #1 Apa I (SEQ ID NO 93):

5' - GTACGGGCCCTCCTGAGAACGGACAGCAGC - 3'

Prps2 #2 Sbf I (SEQ ID NO 94):

5' - CATGGCCTGCAGGTGATTAACTACACTGAAAAAGTCGGAATGTAC - 3'

Prps31 #1 Apa I (SEQ ID NO 95):

5' - GTACGGGCCCTTGTTTATAGCCTATAATCGCAGA - 3'

Prps31 #2 Sbf I (SEQ ID NO 96):

5 1 - CATGGCCTGCAGGTGTTTGGCTTCGTCGGCAATACGTGAATGCTT - 3'

Pssa1_2 # 1 Apa I (SEQ ID NO 97):

5' - GTAAGGGCCCGTTGTATCCATTCACTATTT - 3'

Pssa1_2 #2 Sbf I (SEQ ID NO 98):

5' - CATGGCCTGCAGGTGAATGTTTAACTTTGTTTAATTTCTATGC - 3'

Pthi i #1 Apa I (SEQ ID NO 99):

5' - GTAAGGGCCCATCTTTTCAGCTTCATCGTCAG - 3'

Pthil #2 Sbf I (SEQ ID NO 100):

5 1 - CATGGCCTGCAGGTGGATGATTTATTGAAGTTTCCAAAGTTG - 3'

Ptpi #2 Apa I (SEQ ID NO 101 ):

5' - GTAAGGGCCCTTCAACGAGACACTCTTCCGTCA - 3 1

Ptpi #2 Sbf I (SEQ ID NO 102):

5' - CATGGCCTGCAGGTGTGTGTTTGTGATAGATCTTGTATAT - 3 1

Pubi4 #1 Apa I (SEQ ID NO 103):

5' - AGAAGGGCCCAGAAGATTACCATAAATTGAGA - 3'

Pubi4 #2 Sbf I (SEQ ID NO 104):

5 1 - CATGGCCTGCAGGTGAAAGCGACAAACGTCACGTGAACAAAAG - 3'

Peno # 1 Apa I (SEQ ID NO 105):

5' - TATCGGGCCCAAAGAGTGAGAGGAAAGTACCT - 3'

Peno #2 Sbf I (SEQ ID NO 106):

5' - CATGGCCTGCAGGTGTTTTAGATGTAGATTGTTATAATTGTGT - 3 1

Prsp7 #1 Apa I (SEQ ID NO 107):

5 1 - TATCGGGCCCTTTCATCCAGCTCTTTAACCTTAT - 3'

Prsp7 #2 Sbf I (SEQ ID NO 108):

5' - CATGGCCTGCAGGTGCTTGTGATACTGCTGTTACCGTGTGAGTTT - 3'

PrpH #1 Apa I (SEQ ID NO 109):

5 1 - TATCGGGCCCATAAGTCCTAGAACACCACTTGTTAGTAAAACCGGT - 3'

PrpH #2 Sbf I (SEQ ID NO 1 10):

5' - CATGGCCTGCAGGTGTTTCTATTAATTCGTCTCCCTAGCAAAAAG - 3'

Ptkl #1 Apa I (SEQ ID NO 1 1 1 ):

5' - TTTAGGGCCCGATATCGATTCCACTGCTCAGAGTCTTTTC - 3'

Ptkl #2 Sbf I (SEQ ID NO 1 12):

5' - CATGGCCTGCAGGTGTGTGTAGAGTGGATGTAGAATACAAGTC - 3 1

Ppis #1 Apa I (SEQ ID NO 1 13):

5' - AACCGGGCCCTTTTTCCTCTTCGTTGTGTGGTAAACTCGG - 3'

Ppis #2 Sbf I (SEQ ID NO 1 14):

5' - TGATGCCTGCAGGTGGACTATCTAGAGACAAGTAAATTTCCATGTT - 3'

Pfet3 #1 Apa I (SEQ ID NO 1 15):

5 1 - AACCGGGCCCTTTCGTACCAAATGGAAAAATCACGTACAA - 3'

Pfet3 #2 Sbf I (SEQ ID NO 1 1 6):

5' - TAATGCCTGCAGGTGAAAACTAGATCCTCTTTGGAACAGGCCGT - 3 1

Pftri # 1 Apa (SEQ ID NO 1 1 7):

5' - AACCGGGCCCTCGAGTAACACACTACTAACTTTTTA - 3'

Pftri # 2 Sbf I (SEQ ID NO 1 18):

5' - TAATGCCTGCAGGTGTTTGAAAAGAACTACAACGACCACTGA - 3 1

Pnmti #1 Apa I (SEQ ID NO 1 19):

5' - AACCGGGCCCTAACATGATATCATGATGTACGTACAAACTAGGA

TCT - 3'

Pnmti #2 Sbf I (SEQ ID NO 120):

5' - TAATGCCTGCAGGTGGATTGGTGATTTTGATGGTCA - 3'

Ppho8 #1 Apa I (SEQ ID NO 121 ):

5' - ATTAGGGCCCGGTATAAGTATAGCACATGTTGACG - 3 1

Ppho8 #2 Sbf I (SEQ ID NO 122):

5" - TAATGCCTGCAGGTGTGCTTTGAAATTGAAGGGGAGAGGACGCTA - 3'

Pfet3pre #1 Apa I (SEQ ID NO 123):

5' - AGCAGGGCCCTTGTGGTCCTATGAATTAACCATTTAA - 3'

Pfet3pre #2 Aar I (SEQ ID NO 1 24):

5' - CTAGTCATGGCCTGCAGGTGTCGATGGAGTGTTGGCGGCAGTGGT

TAC - 3'

b) Analysis of promoter activity in P. pastoris:

To test the properties and the activities of the different promoters, the 25 vectors prepared in step a) were digested with Ascl and used for transforming P. pastoris via electroporation (using a standard transformation protocol for P. pastoris). Transformed P. pastoris cells were grown on YPD-medium ( 1 % yeast extract, 2% peptone, 2% glucose) containing 100 mg/l zeocin. From each transformation 10 single colonies were picked on a YPD-zeocin agar plate and used to inoculate a 10 ml liquid culture. The eGFP expression was measured either when the cells were cultured on glucose as the single carbon source or on glycerol/methanol as the single carbon source. The amount of recombinant eGFP was quantified using flow cytometer analysis and the relative eGFP expression levels were calculated as shown below.

A untransformed P. pastoris wild type strain and P. pastoris transformed with a pPuzzle_zeoR_P AO χJacZ_AOXTT vector were used as negative controls for eGFP expression.

Calculation of relative eGFP expression levels: FL 1 (fluorescence channel 1 ): GeoMean of 10000 events FSC (forward scatter): GeoMean of 10000 events

rfu: relative fluorescent units rel.Exp[%]: relative eGFP expression normalized on GAP promoter

eGFP expression on glucose as single carbon source:

Shake flask cultures in 100 ml Erlenmeyer flasks on 10 ml medium (containing 1 % yeast extract, 2% peptone, 100 mM potassium phosphate buffer pH 6.0, 1 .34% yeast nitrogen base with ammonium sulfate, 4x10 5 % biotin, 2% glucose) were inoculated with a single colony from a YPD-zeocin agar master- plate and cultivated at 28°C and 180 rpm. Glucose was added to a final concentration of 0.5% every 12h. Samples were taken 16h, 40h and 67h after inoculation, diluted with sterile PBS to OD 600 of approximately 0.1 -0.2 and analysed on GFP expression by flow cytometer analysis (BD Facs Calibur). The results are shown in Table 8.

eGFP expression on glycerol/methanol as single carbon source:

Shake flask cultures in 100 ml Erlenmeyer flasks on 10 ml YPG-medium

(containing 1 % yeast extract, 2% peptone, 1 % glycerol) were inoculated with a single colony from a YPD-zeocin agar master-plate and cultivated at 28°C and 180 rpm. After 22h cells were harvested by centrifugation ( 1 500xg 5min.) and resuspended in 10 ml MM-medium ( 100 mM potassium phosphate buffer pH 6.0, 1 .34% yeast nitrogen base with ammonium sulfate, 4x10 5 % biotin, 0.5% methanol). Every 12h methanol was added to a final concentration of 0.5% . Samples were taken 22h, 42h, 64h and 90h after inoculation, diluted with sterile PBS to OD 600 Of approximately 0.1 -0.2 and analysed on GFP expression by flow cytometer analysis. The results are shown in Table 8.

Table 8: Relative eGFP expression levels in % (standardized on eGFP expression under the GAP promoter) in P pa s ton ' s.

16h 4Oh 67h 22h 2Oh 44h 7Oh glucose glucose glucose glycerol => methanol methanol methanol

AOX 3.6 1.0 119.0 162.9 184.9

GAP 100.0 100.0 100.0 100.0 100.0 100.0 100.0

GND1 0.0 0.0 0.0 0.0 4.3 8.4 66.7

GPM1 41.0 25.2 24.8 23.5 19.2 22.1 25.2

KAR2 134.3 12.3 10.9 71.7 37.6 35.6 42.9

TKL1 1.4 0.0 0.0 5.4 4.3 7.1 9.3

PET9 1698.7 483.7 490.3 1203.9 740.6 750.6 926.2

HSP90 81.1 6.4 5.4 39.5 21.2 21.6 32.9

RPS2 0.0 0.0 0.0 9.4 5.7 7.9 12.4

SSA1 0.0 17.3 21.3 11.4 9.1 9.2 30.1

PIS1 0.0 0.0 0.0 3.0 0.0 2.8 6.5

FET3 0.0 0.0 0.0 3.9 0.0 3.0 7.0

FET3pre 0.0 0.0 0.0 3.9 2.2 2.3 7.1

RPS31 6.3 1.0 0.0 2.9 2.9 4.0 7.7

ENO1 22.3 46.6 45.8 30.6 28.9 17.3 26.1

PHO8 0.0 0.0 0.0 0.0 2.3 1.9 6.1

FTR 1 0.0 0.0 0.0 0.0 0.0 1.9 5.6

NMT1 0.0 0.0 0.0 0.0 0.0 2.3 5.0

RAD2 0.0 0.0 0.0 0.0 1.4 0.0 5.4

RPS7A 17.7 2.7 1.4 13.7 10.0 6.6 12.4

MCM1 0.0 0.0 0.0 1.4 2.9 0.0 6.0

UBI4 0.0 0.0 0.0 0.0 1.2 0.0 4.2

RPL1A 0.0 0.0 8.7 9.4 4.9 2.1 10.9

THI3 0.0 0.0 1.2 1.2 13.7 13.4 41.5

TPM 0.0 12.1 13.9 4.8 57.3 38.0 91.7

From Table 8 can be seen that there are promoters with different transcription levels on different carbon sources in a range from 0% to 1600% available (relative to the eGFP expression under the well known GAP promoter, which was set as 100%). Real unexpected were the high eGFP expression levels obtained from the vector pPuzzle_zeoR_P PET9 _eGFP_AOXTT (see Fig. 2), wherein the eGFP is under the control of the PET9 promoter, i.e. a 1000 bp fragment from the 5'-non coding region of the PET9 gene. The eGFP

expression obtained from the vector pPuzzle_zeoR_Pp ET9 _eGFP_AOXTT under the experimental conditions as described above ranges from about 480% to about 1700 % when the P. pastoris cells were cultured on glucose as the single carbon source and ranges from about 740 % to about 1200 % when the P. pastoris cells were cultured on glycerol/methanol as the single carbon source (standardized on eGFP expression under the GAP promoter). The nucleotide sequence of the 1000 bp fragment from the 5'-non coding region of the PET9 gene of P. pastoris is shown in Table 9 below.

Table 9: Nucleotide sequence of the PET9 promoter (a 1000 bp fragment from the 5 '-non coding region of the PET9 gene) of P. pastoris (SEQ ID NO 1 25)

CAGGTAGAAAATTCACCACTGTCGGAAAGTTGTCTACTTCCGTCGGTTGGAA

ATACGAGTCTGTTGTTGAGAAGTTGGAGGAGAAGAGAAAGGCTGAGGAAGCT

GAGTACCAGGAGAAGAAGAGAGCTTACACCCAGAGATTAGACGCAGCTAGT

GCCGAGTTTGCCCAAACCGAGGAGGGAAAGCAGTTGGCTGCCTTTGGTTACT

AAATAGTAAAGTAGGGTATCTTCAAGTAATAGTATACTAACCATCTGAAATA

ACCACCGTCCTGTAGTTTTTTTTCGATATCGAAGAGCCTATGCTAGTACTGTG

GATTTGCGCTCCATCCAACATCTGTGCGCAAACTAAAACTTCCGAGACTGAC

ATCTACCATCGCTAGACCCTAAGTAAAACCAATCTCGCGTCCGAACTTTTAA

ATTTCAGTCCTTAAAACTTCAGAGCATTGGTTGTAGTTTCCGGATCTGAGGGG

TCGTATTGGAGTCAAGAGACGGAGCTGCCTCCACAGCGCGAAACGTCAACCC

CAACACCAACCTGAATTTGCAATCACCATGGGGACAAGTTTCAGCAGTCAAT

GGGCAATTCAGACGTTGATACGGTACCCATTTGCTAAGCTCAATGACGATCC

ATCCAACTTCAGAGAAAGGCCTTTCTCTGGTATGCTCTGGTATTCATTCGTCT

TTTATCACTCTCGTTGCACAATGCCCGGGTACTCCCGGAACAAGGGAGTCTT

CCAGCCAAGCTGTACAGAGTGAAAAATAGAAATACACCTTTGCAATCAAGAC

GCGCGTTGGCCAATCACAAGACTTAATCGGTGCAAAGAAGGATTACCAAATT

TTTTTTTCCCAAAATCGCTATATAGAAATAATGGAGGAAAAAGGGTTAATAT

AAAGGAGAATTCCCCCGTTTTTCTCCCCTTGTCTTTTCTTCTTCAGGCTTTCTT

ACAAATCTATAATATTCCAAAATGGCTGACAACAACAAGTCTAACTTCTTCG

TCGACTTCATG

Additional interesting promoter sequences with different transcription levels on different carbon sources in a range from 0% to 135% (relative to the eGFP expression under the GAP promoter) are shown in Table 10.

Table 10: Nucleotide sequences of promoter sequences (1000 bp fragments from the 5 '-non coding region of the respective genes) of several P. pastoris genes (SEQ ID NO 126 to SEQ ID NO 147) and respective ERGO™ database information

GND1 (SEQ ID NO 126); RPPA07075 - Pichia pastoris (IG-66), Contig1891 9813 1 1291

TATGGTAGAATCATCAATTGGAATGACCCTATCGTTGCTATACAGACTCGTAG

CCCCATTTCTTGTTTGTTGGTTTTCGTCGTCAATTGCCTCCAAAATTGAAGAG

ATGGCACTTCGTCGCGTGGGTTCCTGGGAAATATTTTGAAGTGGAACATCGTT

GAAATAAGGCAGTATATCATGCTGATTCTCAGCATTGGAGTCAGTAGCTTCA

AGTGAATCTATACTGTATGAGTCGGAAGAGGATTTCCGGCTCGTTTTTGTCTT

CATTTTGTTATTAGAAGGAATGATAACGTTGGGAAACCGGAGGTTGGAGATT

TTGTATATATAAAACTTTCTTGGAGCTTATTAATAAATGCGGGATGCAGTAAA

CTTGCATATATCTATTGTAACACTTTTGCAATAGCTGCATGCCTTGACTCATC

ATTCAGTATCGTGTGAAAACCAATGATACATCCGTACATTCAAACTACAACC

TTCCTCATTAGTAATTCTTTTTGAATTTTTCGGAACCCGAAGCTCCGCCTATC

CCCCCAACTAACACATCTTCCAATTTGGGTGGGAGAACACCTAGCAACATCA

CGATCATTGCGCGAACGTTCGCACTGTATTTTTTTCTCCCAAACACCCAACTT

CTAGGCCAAATATCCACTTCTCGGGGTTCTATTCACCCATTTAATTGTTGGCC

TTAAAAGTCAATTGAGTTCCAATCATAGTCCCTAGTTGATTGCTTGTAGCAAA

TGCCACAACAGTAGGCATTTACGTCCTCACAGTCTCTTCCCTTGTCCCTCATT

GATACCTCTTTATTCTCCCCCACCACCATACACTACCTTCCTCGCACCCCTGT

CATCACAACCGCAATATAATCGATGCGCGGTTTCTTGCCTAATCCATCGTCC

AACAGAGAGGTCGCTCTCCTTATATATATAGTTGATCCCCCTTTTTTTCTACC

CTTGCAATTTTTTTTTTGGGACCAAAGAAAAGAAACAAGACTGATACAAATA

TG

GPM1 (SEQ ID NO 127); RPPA05288 - Pichia pastoris (IG-66), Contig 1212 2407 3064

GAAAGAAGGTTTATCTGACTGTTGCGCACCACCGAAACCAAAAGCGGGTTTA

GCTGCAGAACTTTCAGTGGCCGGCTTGTTGCCCAAAGAGACTGTTACTGCCT

CCATCGGTCTTGGTAGCGCCTGAACCAAACGAAAAAGCACCACTGGGATTAT

TGTTTCCTCCAAAAGACGCGGTCGTGTTTGATGCTGTTGAGGCCGCAGGCGC

TTGACCAAAACTAAACGCTTTTGATGTATTGTTTGTAGGGGCTGTTGCGTTGT

TAGAAGCTGGCTGGTTGTTCCCAAAGGAAAAAGCACCTGTTGATCCAGTGTT

AGATCCACCTGCGGCATTCCCTTGAGCTGCGGAGCCAAAAATTGATCCTTGG

TTTGAGCTCTGTGTACCAAATAAAGATGAATTGGCCTTAGCTCCGTCAGCAG

GAGCGTTTGCTTGTCCAAAGGGGCCTCCAGACGACGTTTGCGTGTTGTTGGC

ATTACTAGTATTGGCAGGCTGATTTCCAAACAATGAGGGCTTGGAGGCGTTC

TGGGAGTTACCAAATGAGAACCCTGTATTTCCTTGGCCAGAGCCTGAGAAAC

TGAACCCTGATCCAGACATTATCAATACCTTGGGTTATTAGTAGTGTCCGTTA

TTTTTCTGTTTAGGTTACGATTTTGCCAGATTTTTTGGGAGGAGGGAAACAAA

AGAACCAGTGCTACACGACCTTTAAGTGCCATCAGGCATCCTGTTTTCTCGA

CCTCATCTCATCACATCCGTCAGTCTGAGCTTTCAGTTCTCAGTTTTCGATTG

ACTCTTGCCCTGCTGCGCGCACACCATACCCTGGCTCCCTCTCATGCTTCTGG

CGTTACCCCGGGAATCGTACATCCATGCCGCGAATCCCGGACAGGACTCAGA

CGGATTTCACTATTCCTAGTGGGCACAACCTCCATATATAAGGATACTTCTGC

CCCCTAGCTTAAGTGCCTCTATTTTGTCAGTAACAACTTTCAATTACACAAAC

AAACAATG

HSP90 (SEQ ID NO 128); RPPA05876 - Pichia pastoris (IG-66), Contig1951 9775 1 1895

TTCAAGATCTTTTGAGGACTAGAGAGCAAGACCTGCATTTGATAGTCGAAGA

GGAGAGGCAATTTAAAGATCTAGTATTGGAGAAGGACAGACTGCTCGATGTT

CAAGATAAGAATGCTTTGAATCTAAGGCTACAACTAGGTTTTGCTTCCCACA

AGGTAATTGATCTAAACCAATTACTTCCTGTGCAAAAAACACTTGGAGCCAC

TACTAGTCTGGAGTTAATCAACTTTCAAACAGACATCAAAACTTTACACGTC

GTAAACAATGTCATGGACAAAATCCGGGTTAAGGTCACTGGGGAGAAGCGA

CTCGTGACACTGACTCCTGACAGCACCAACAAGGCTCTAGAATTTCCCACTG

AGAATGATAGGCCACTGCAAGAACAACAAAATACAGAGCATGAAAACGAAA

ACGGTGCTCAAGATCCCATAGACTCAAACCAAACGAATGATTCTCAGACATA

CTCCACACCTAGCCGTATCCATTATACGACTAACGAGGAGAGTTCCTCTAAG

AAGGTTAAAATATCAAATTGAGCATGATACGATAGCCCGCTGGCAAGGAATT

AAGTGACACTAGATCGCAAATAATTCGGAATCACTTTATGTGGAATCATCAT

TCTTGATAAATTGTGATAAAGTATCGTTCGCGAGATGGTTTTTCCAGAAATTT

CTTGTGGATCAATATCTTTAAACGATAGTAAACGATGTTGACAGCTCCAACA

ATGTCGGAACTGCTCATCAAAGACGGACAATTTCCCTATTAGCAGAAAAATA

GCCCATTTATCATCACCGCAGCGCTATTACTGAATGATTTTTTTGAATTTTTCC

TTGTCTGTGAAGCCGCAAACCACATTTTCTGGCGCGGTTACCCGGTCATAGA

AGCTGGTGGAATATTCCATGCTACCTCGACGATGAGCTTGCGAAATATATTTA

AGTGTAAGAACTCCGCGTCATTAAGACTCTGGTTAACTTTTTAATTTTGGAAA

AATATCAATATG

KAR2 (SEQ ID NO 129); RPPA06939 - Pichia pastoris (IG-66), Contigi 900 3677 1641

CCACTATCAAAGCTATCAATTGTGGAAATGGACAGCAAAACGGAAACGCAG TCAATGACGATGATTTCAGAGCTAAGCGATTGTGCGAAAGGTGGTTGGACAA

TCTGTTTATGCTTCTTTATGAAGATCTGAGGGTGTATACGATGTATCGTGCCG

AACACATGCACTTGACGGCACAGCAAATGGAATTCAAGAAGACCACTTTAGA

ATGGGAGTTAATAGGGATGGTTTCATGGAGGTTAAAACACTTCAAGGAGGCA

TCTGAAGCATTCAAGTATGCACTAGGTCTGAGGTTTTCGGTCAAGGCATGCA

AGAAATTAATTGAATTCTATCTGAACGAACGCTCCAGAATGAACCAGCCAGA

AACCTCAATTGCCCTCAACAACTTAAATCAATCCACATTATCCATCCAAGAG

ATTCTCAAGTATCGTTCGTTCCTCGATATCAACCTAATTTCAAACTTGGTCAA

ACTAGGAGTTTGGAATCACCGCTGGTATGCTGAGTTTTCTCCAAAACTCATAG

AAAGCCTTGCGGTTGTTGTGGAGAACGGAGGGCTTATCAAGGTAGAAAACGA

GGTTAAGGCTACCTATTTCGATTCACAAGATGGAGTTTACGACTTGATGAAC

GAGGTATTCAAGTTCATGAAGCATTACGATTATCCTGGGACTGACAACTAAG

AGCTCCTAGTGAAGACTTGAGATGGACATGATAAACAATTATAGTGAAAATA

GAAACCATAATACAATATTCTAATAGAGGAACCGTTTACCTGTGGTTCCTATT

GTGGCCTACTGTTACTAGCTAGTGTAATACACCCTTGCCTCAGCTTTGCAAGT

TGACAACTCAGCCAAATGATCTTTGAATGCGCGAAACCTCAAGGTCCATCGA

ATTTTCTCGAATTTTCAGTGTTTTCATACAGCGTGTCATCTTCTTTCGCGTACT

TATTTAAATCGTACCCAGATCCCTTCTTCTTCCTTAATTTCAATTCCAACACTC

AAGAATG

MCM1 (SEQ ID NO 130); RPPA08348 - Pichia pastoris (IG-66), Contig 1815_6838_7665

ACAGCTTTGGCTTGAACAATAGTGGTTGGATGTTACCCGGTGCGACAAGTGC

TGAGTTGCGGTAATTTACGATTTCAGCGTCCACCAGAATGGGAATTCCGGGT

AACACGCATACCAGGGGAAGAATATCACAAAGATCACAGAGATTGGATAAA

ACTGGACCAGAAACTACCAAATAAGTAGACCGTCATACTCAGCTCCTACACC

AAGAGGTACTTCAGCCTTTTACCGGTTTAAAAAGCCCCCCGAAATCGACTAA

TTACCGAGGCATTATGTTTACTACTGATGGAGATTTCGAAATATCCTTCCCGA

TTAGTCCAACATCTCGAAATTAAACTGCGCAGCACTATGCCGAAAGCTATAT

AAACAATCATTTCCCCAACTGGAACATCTTTTTTCTCTTTTCTTCGTGTATCAT

CCTTTGGTCTTTTAATCTTTCAGAAAAGTTTTCATTAAAATGCTTCCAGCTGGT

GTTATTTTAGTCTTTTGTCTACTTTTCATTATCGGGGTAATTATCGCAGTTATC

CTGGGCATGAAGTGGTACAAGAAGAGAAAGAACTGAGCATGGACGAAGAAA

ATTACTAGACCAAAGATGTATCACCAAACAAACCCCCCCAAGAATCTGTCAT

AAACGAAGTTGTTATAGGATGGTTATCACTCTCACATTTTAATACAGGAATAC

CCTAATTTTTCCTCAGTTCGGTGACGAATGGAGTTGTAGTTATCTTTCTTTCCC

TCGTTTCTGCTCTTATTTCCCCCTCTCGTCGCATCATTTGCATCAAATTGGTGC

AACGTGCGCGCACGCTCGGCTTTGGCTGCAACGACTTTCTGTCGTGTACCGA

CCCATATTCTATCGCTTCGGGTAGTCCGCAACACCCTCAACTTCCTAATTTTG

TGTTCTTACAAGCTGCAAAAAAGTAGGACTCACTCAATAAGGTAAGTCCCCA

AACATCAATCCCAATTGGGTAATCCATCAAACACTAACCCGCATTCATTTAG

CATG

RAD2 (SEQ ID NO 131); RPPA09286 - Pichia pastoris (IG-66), Contig 1185 5060 8191

ATGGTATAGTTTGCAGACATAGTAGGAGAGTTTGCGGGCTTGCAACTCTAAC

AAGTCTCTGTAGTCGCTGAGACTAATGCAATCGTCAAGACTACTGGGCTCGT

TATCCGAAAAAGTCCCGGAGTAACTAGTCGGTAACTCTCTGTACCGAATCAT

ATTCATCCTGGGGTTGATCTGATCTTGGTCCATGTCATCATTCTCTCCATTTCC

CGTAGGGTAGGGCAATCCCACCAGCCACTTGTTAACCTTTTCTTTATGCGGCA

TATTGGGAGTCTGCTTGGGGTTGAAGCTTTCTGGAAACTCCTTTTTCTTGATC

AGATCGCTGTCACTACAGAACCCAGGATGCTGCTGATAAGGGGAAGAAACA

TAGCCCATGCCCATTGAGTTGAAATTGAAGGGTTCCTGTGCTGCACCTGCATT

CAACGGATATACATGTATAGGATGATGGATCATGTGGCTGTAAGAGGGGCCA

GACGCTGAGTTATCACCATTAACATTGTTAGGAGATCCTGAGCCTTGAAAGG

GCACATTGCTAGTCTTTCCCGTAGTCATTACAAACAGCGGGCCATAGTTCCC

GTTGTCACAAGGATATGCAATTCCTGTAGGGGTATAGATGTGTCTGTTGAATG

GGGTCATGGCGGGATTCTAGAAAAGAGACGGCCCGTTAGTTAAATCCACCCA

CCGATTATTGAGGTCCAACTTACAATTTGGTTTTGACGGTCTTCCATTGGAGG

ATATGTTGTTCTGAGTTGGTGGTAGTTACGTTTATTTCTCCTTCCTTTTTGGTT

TGTGTTAGAGCAAGGACACGGAGCACTGACTACCTTCAAGGCCATTTGTTAA

GAATCCACAGATAGACACTACCACATGGGAAATGTTGTTAGGGAAGAGCTGA

TGCTGAGGCTCAACATCGCGCGACTCCTTTTAGTCAGTTTCCAAGTACGGTTC

ACTGTAGACATTTCTCTTTTTAGTTTCCGTTTTTCCATCAGGTTGGGCTAAGGA

TATG

RPS2 (SEQ ID NO 132); RPPA04904 - Pichia pastoris (IG-66), Contig1847 1328 537

CTCCTGAGAACGGACAGCAGCGCTGGAGGCGGCCTCTTTAACGGTGGCGGCG

AAGTCAACAAGGGTAGTTTTGATTTTGCCTAACAACTGTTGCAGGTCCTTCTC

AGTCTTTGGGTGGCTCTTCTTGAACTCGTCCGATGCATCCTTGACGACCTTCT

CAGCGTCGTCGGCAATCTCGTCGACAACTTGCACTGATCCATGAGGGTCTTG

GGTTTGTTCGACTTCTGGAACTGGTTTTGGCTTGCCAGCGGAAACTCCTTCTG

GGGGTGTGGCAGCTTCGGCGTAGGTACTATCAGTGTTAATATCGGGGATTGG

TTGATGTGCGAGACGAGGCAGGCGGGAGATCCGTCTGCCTCGTCCAACTGCG

ATAGCTCTTACTTACCTCATGATGCTTGTAAAAAAGTTAACTGAAAGATGGA

AATGGGAGGGGGAAAAGAATTGTGGTCAAATCCACGTCTTGCGATAACCTCA

TACATTTGCTGCATGATTGGGGGATCATCGCAATTTCGGTCCTTGTGACACAC

GCGAGCTTTCCGCCTGGGGTTAGAGCGCGGAGCAAGCATACTATCAAGCAAG

AAAAAATGGTATGGAGAAACCTAATTGGTTAAGATATATGAAAACTACGACG

GCTTCATACGTGGTATCTCTGGTTGAGCTACTACGAACCATTTTCCCCCTTCA

AACCCTTTGGCCCATGCCATTCATGCCTTGCCTCTCTCTCAAGCAACTAAGCA

ATCAAGCAATTTCCCGCCTTGCTGCACATGACTGTTCGGAAATCGGAGACCC

AAACACCACTTGTTATCTATGCACGTGATTTTTATCCAGGGCAATAAATACTC

ACTTTTGCTTCAAAACGCTTGGGGCGCGCGAGCGGCAGGCTGGGAAAAAAAT

AATCTCAGACTTTTCAAAAGACTCTCCTCTTTAATCATTGAATACCGTCGATC

GAAAACCACCACCATCGGCTTTCCACGTACATTCCGACTTTTTCAGTGTAGTT

AATATG

RPS31 (SEQ ID IMO 133); RPPA07467 - Pichia pastoris (IG-66), Contigi 605 1365 1817

TTGTTTATAGCCTATAATCGCAGACTTTGTTAACCCGTAGGAATAATCCTACC

CACTAGAATTAGATCTTCCAACCATACATAGCGTGTACTACAACTTAAGCTG

GTCCTCTTCTTTTATCATTCGCGCGGCTCTATCTCATCTACACCTCCATTGAA

CTAATGGAAAGCAAATACGCCGAAAAATATCAAGATCGATTGATTCTTGGAG

ATGATGGCGATGAGGACGAATTGTTTGACGAGCTGGAAAAAGATATTGAGGA

TCAATTCTTGGCCAAATACAGAGCAGAGAGAATCCAACAGTTAAAACAGGA

GATTACCAAGATCAAGGACCATAGTTCAAACATCAACCTCAATGACCACGGT

AACATGAAAACAATAGATACTGATGACGAACTATTGAAAGAAACTGTTGATA

GCGAACGTGTTGTGATCCATTTCTTTAACCCATCGTTTAGCACTTGCCGTATC

ATGGATGAGAAGCTGTCTATAATCAGCACCAAACATATTGGAACGCGTTTTT

TCAGAATTGAAGCACATAGAGCTCCATTTTTAGTTGCAAAGCTTGGTATCAA

AGTGCTTCCATGTGTTGTATTGTACTACAAAGGATTAGAAAGGGATAGAATT

GTCGGATTTGACAGATTAAGTAATTCTCAGACCAATTTTGAGCTAGAAGCTTT

AGAGGAGTTACTCTTAGATAGTGGAATTGTGGAACGAAGAACTGTCGATTTT

AGCAACCTGAGAAACAAGGTCCAAAACAAGGTGGATCAGTCAAAATCAGAC

TCAGAAAGTGATCTAGATATGTGATAGATGGCGGATGGCAGGTTCATTCTAG

TGTTTCACGTGACACACGTGAGCGTTTAAGGGCACACACCCTGACTGACGCG

CGAACATCTAATCTGTTCCGCATGAAAAAAAAAAACTACCTCGACGAAATTC

TCTTCTAGACAGTTTTTACATTGGTAAGAAAGAAGCATTCACGTATTGCCGAC

GAAGCCAAAATG

SSA1 (SEQ ID NO 134); RPPA10651 - Pichia pastoris (IG-66), Contig2026 5140 3168

GGGTTGTATCCATTCACTATTTACTCTTTGTTTCATTTCTTGAATTATTTGGAT ACTACTCTGCTGGCAACTCTACCAGTCTCAAACGCAGACCAGGTTCGCAATT TGATTAGAATGTTCGTGAGCTCTTACAATGAAAAGTCCATGTACCTTGCGGCT

AGTTGTGAATTATTTTTAGTTCCTTCTTTGTTGCTATCCTCTTTGAAGTCGATT

ATATTGCTGGAATGGTATAGGGCTCCCTTTTCATTTATCAGGCAATTAATCGT

GGTATTCTCCGTGATCTCGTTTCTGAGATTAAGATATCAACAGAATGTTTACA

TGAAACAATTAGTTGATAGTTATGATTTGAAGATCAGTCAACTCTTATACCAT

CCCCAACTTCCTCAAGGATTCAGGTTGGGATATTTACGATTTAAGAGTCTATT

AACAAGCACGCTAGGATACTTAGAATTGGAAAAAAAGACCAGATAATGAGA

TTGAACTCGAAATTTAGGATCACCCATATGACGAAGAATTCATTTAGATTATT

GAAGGTGTTTTCATGTTTACCTCCATGAGACCATTTCTGTCACAGCAAATACA

GGCAACGCTTTTCACCAGAGCTTGTTGGTACAACTTTTCAGATGACGCCAAA

TTCTCACGCGCCTCACTTTGTGCGGCGCTAACAATAGGCCATTTTTTTGTACC

TCCCGGATGGTTCAGCTCAATCACTCGATTGAGAGGTTTTTGTTCCGCGATTT

TTGTTCACCCCACACTTTTCTCGAAGGTTCTAGCAATCAAGATAAACACCGC

AAAGAGAGCCGCAGGAACCATATGTGGTACCACAAGTGGTCTTAAACAACTC

TGGTAGAATTCGATGGAATTCGATGGAAGCCGATCGACTCCGATCGAATTGA

AGCAATTCGTATATATAAGGAGAACCTAGTTCCACCCCTTACTCGACCATTA

GTTTACAAGACTAACTTCACAGAAGCATAGAAATTAAACAAAGTTAAACATT

ATG

THI3 (SEQ ID NO 135); RPPA07820 - Pichia pastoris (IG-66), Contig21 16 14564 15586

ATCTTTTCAGCTTCATCGTCAGTGATATTTCTCAGCCCACAGACCAAGTCAAC

TTTGGAATCTAACAACCTTGTTCTTACAATGTTAGAACTCTTAAGTCGCATGC

CATGATCTTCAAGCTGAATTTTGTGAAGGAGGTCAAACCCCACAATGGCATC

TAGTTGTTTAGAATACATGCCTTCGACAAGTGTTTGAGTGTCCAAAATCAAGA

GCTCAAAATTATTGAATTTGTCTGCCAATAACGCCGTAAATTGATTAGTGTCC

AGCCCACCAACAATAGGAGCACCTATAGTTAATTTTTCAGATAAATTTAAGTT

ATCAAGGTAAAGGAGCTCTAAGTTTACCCCTTCCAACAGGGTTATTTGAGAA

CTCAATAAATTGTTGAATTCAAAACCAATTGTCTTTGAATTCTCCACTGGAGC

TTCCTTGCTGAAATTGATTTTGATACCATTGGCATCAAAGAGACCCGTATGAT

AACTCCATAAAAAGGGGAGATGATAGGCCTTAAATTCATCGTTAATCTGCAA

ATTTATTCCTGACATGTCTTTGTAAATAGTTATAGTTCAGAAACTGGAATTGA

GCTCAAAAAACTGGAATCGAGCGGATATTTGAAGATTGATGCCTTACTCATG

AATTGATTGATAAGAGCTCCGTGATTCACTCTGTCAATGATTACCCCTCTCCT

ACCCGATTTGGGACTTTTTCTTCAGTCTTGGGGACTTTTTTTCATATGACTTGA

CCTTGCTTTCCCAATAGGGAAGGACTCACCCATGGATGATTAAGTTTGGATTA

CTCGTTTAGGAAATAGTAGCCATGAATCAATTTGAATCATACCATCATGAAA

TAGGGTTAGGCTGTAAATGCCTCAAAAATGGCTCTTGAGGCTGGATTTTTGGG

TATTGGAATGTTGGTAGCAATTGGTATAAAAGGCCATTTGTATTTCACTTTTTT

GTCCTTCATACTTTACTCTTCTCAACTTTGGAAACTTCAATAAATCATCATG

TPH (SEQ ID NO 136); RPPA06839 - Pichia pastoris (IG-66), Contigi 564 2883 2026

TTCAACGAGACACTCTTCCGTCAGTTCCAAAACCATAAGTTTGCCGATGTGTT

GGTCCTTGTAACGCATGGAATTTGGGCCAGGGTATTTTTGATGAAATGGTTCA

GATGGTCTGTGGAGGAGTTTGAAGGCTTACGAAATATACCACATTGCCAGTT

TATACAGATGGTTAAGGGTGAAAATCAACGTTACACCTTGACGACCCCATTA

TTACGATGGCGTGAAGGAGATGAAGACCGGGTAGAAGAAATAAGAAAAGCG

GTACAGTTTAGGTCCGGAGATCTAGGGAAGGAGGCCTTAGCTTATATTGTAG

CTGCTGAGAGAGAGGCAGCTGCTGGAAGATCTGAAGGCCCTATCACGTATGA

TGATGGTGATGACCATTAGAGAACGCCCAGAGATTGATAGCCAGTTCTTGGA

CAACAATTCGGAACTTTATTCACGGTGCAAACATGATTTGTGTGGATAGCTTC

AAGTCAGACATTTCATCTCATCCCCCCTTTTACTGCTGCTAATCACCGTTAGT

CCGACAGTTACTCTAATCAATATTTATTAGTGTTTTAGTTGCGCAAAACTCGA

GCCTCTTTTCCTTATCTCTTGACACTTCCTGGAGTCGAAGTTTTTCAGCGCAA

ATTCACTCTACAATGTCTACCGATACTAGACCGCCTATCTTCCCCCTCTAAAT

AGCCTATTGGAAGGGTGCAATAAGGTATATAAATCTGGCGCGATTCCCCCGG

ACTTTTATGATCCACATCACCTCATCTTACTGCCCTCACTCTCTTTCCTGATCC

TCCCAGGTCCACCGATTTCCTCACTATCGTCGGATTTCTCCTTCCAGCGCCCT

AGAGAATTCCGTAACCACCGCAAAAATAGCAGCCCCCCCCTCACCCATTTTT

TTATTTAAAAGAACACCTTACTGGCCCGTTTTCGTTTCTCCTTTACTACAATTG

ATTTTTAATTTTCAGTTTTTTTTCATTGATATACAAGATCTATCACAAACACAA

TG

UBI4 (SEQ ID NO 137); RPPA07243 - Pichia pastoris (IG-66), Contig 1945_6169_6092, Contig 1945_6013 4858

AGAAGATTACCATAAATTGAGAACTCAAATGACCAAGCCAGGAGTCAAGAA

GAATTCTCAAAAGGGATACGCAAATATTGATTCGGATGACGAAGAGCTGACT

GAGGCTCTCGATTTCAAATTGGCACCAAGGTCAACAAAAAGTCTAGACATAA

CACGGTCTGGACTACCGCAACAATATAATTTTGAAAATGTTCTTAGTAGCGA

CGAAGAAGATGGAGGCGGAGAGTTTCAGGGTGATTCTGTATCCCCAATTAAA

AACAGTTTTCATTCAAGTCCAAGGAAGCAGCGATCTCTACTCCACGTCATAG

ACGAAGGAAGTCCATCTATTGGCCCAATTTCTGTTCAACTGAAAAGAAGTTT

GACTTCTCCCACAAAGAGACACAAGAATGCTCAGGAAGTGATAAGCCAACC

AGAAAGCATGACATCCTCAAACTTGTTTTCGCCTACACATTCCTCCGCCTCAA

CTACATCAATAATATCTCCCAAACCCCAAGCGGCAATGATCCCAAGTCTTAA

CCCAAATAACCCTTTTTATACTGATACTAATAGCTAATATGTAATAATGATTA

ATAAATGCACCACTTTAATTTCTTTATCAAGAATAGTTTTACTATTTCTCTTTC

GGGCATATACTTGACCCGCTCCATATATTTCACCTTTATCGTGACATTATCTG

CTGCACGACCCGGAATTACTTTAAACTTCTAAGAATTTCCAAAATAGGAATC

GGGATGCCGTATTTTCCAGTCTGCTGGACAGAACTTGTGCTCTCTAAAATGAA

TTGTAGTGGTCCTCAAAGGCTACTGCTACTCACAGTCTTTACTACTCCAGATT

GATCCATTCCTTGGCCCATGCACAGTAAATTAGACTACTGATCTGATGTTGTA

TACTTTTGAGCTGATTGATTACAGTTAACCTGGTCTTATCTAATCTCTCGTTTC

TTTTTATCTCTATTCCCATTTCTGTACTCTTTTGTTCACGTGACGTTTGTCGCTT

TATG

ENOI (SEQ ID NO 138); RPPA05175 - Pichia pastoris (IG-66), Contig 1903 1777_3087

ATGAAAGAGTGAGAGGAAAGTACCTGGGCAAAATCACACAATTCCAAACCA

TGCTAAATGAGATTTAAAGAACAAACGATGGCAAAAGGCAACCGTTATAAAT

GTGATCTTTCTTGGCAGTTATCTGTCAATTTTTCTAAGGAACAGTGAATTCAT

CATAGGAGAGATGTTATACGTTACATAATCATACATACTGCATGTATCTCACC

TACTTTACCTCATCAACTCTAAAACAGTTCTAGTCCCAACCCCAGATTCCTAG

TCATGACACAAGTCCGCACCGGACAGGACTCACAACCAGCAAGAGAAGCTA

ACAAATTTACGCCCCGGTAAAACATTCTTTAGGGGCCGTTCAATGGTAATTTT

CCTCTCACCCGTTTAAACTTACCTCCGGGCGGTATCTTCAATAACCTCTGTTG

TCCCCGGGTATCATTGGAAACAGTGAGGGACGTTGAACAGAAGAGAGGATC

ACCGTAAATTTGCCTTGCAATTGGCCCTAACCACGGATGGTTAACTTCAAGC

CATCACGACAGCAATTGAGTCGGCGCATAGCTACCCTCCTCTTCTTGACCCC

ATGCATAGGACCAACCTTAACCGATGGAACAGGTTCCTCCGCTCCGTCCCCT

GGTAGTGTCTCTGCGCAAGAAATAGTTAAGGTATGAAGACTGATCTCTCGCA

CCCCCCTCACAGTACTGTTATGGTGAATTGACAAAGCCATTGGCTAGATTGA

AACATGTAATTCATATGTAATCTTGTTCAATTAACGAGCTTCGTACAGTCTCA

ATCTAGACGTCTGATAATGGCGTTTGTGCTCCTAATCGATGAGCCATCTCATG

TGACGTCTATACGCTTCGATGGCTTCCGTCGCGAATATAGAACCACTTGAAA

TATGCTGCAAACCACGATCCACCCTGGTCCTGAAAAGATATAAATACAGCAC

ATCTAGCAGGCTTTTGTCTTCTTGGTTGAAACACACAATTATAACAATCTACA

TCTAAAAATG

RPS7A (SEQ ID NO 139); RPPA04215 - Pichia pastoris (IG-66), Contig1695 3185_3751

CCTATGTAAGCCTTCTCAGCATAGTCAACAAAGAAACCTGTTTCGACGACTC CTACCAGTCGTATCAGCTTTTCATCCAGCTCTTTAACCTTATTTAACGGAATC TCTCCAAAATCAGCATCAATGACGAAGTTACCATTATCTGTGACCACTGGAC CAGCTTTCGCCTTTCCAGCATCTCTCAGAGTGGCGCTGTTGGCTCCTAGTTTG AAAAGATCTTTCAATACTTTAACATAACTGTTGGGTACAACCTCAATGGGAA

CGCCTTTGCGCCAGAAATGACCCAGTCTTTCTGGAGACTGTTTACGAAAATCT

GCCACCACAACGAATTTACGAGACAATGATGCCACCAACTTTTCTTGATAAA

GCGCTGCCCCTCCTCCCTTTATCAAATTCAAGTTGGTGTCAATTTCATCAGCT

CCATCAAAGGCTACGTCTATCTCATCGAACTCCTCCACATTTCCTAAACGAA

GGCCATTGTCCAAGATAAGTTGCTTTGATTGAAAGCTGGTAGGGATACAAAT

AAACTTTTCTTTATTTTCCAACTGCCCCAAGCGTTCGGCGACATAGACAACGG

TGGATCCAGACCCAACTCCTATAATTCTGTCAGACTTCGAAACGTTTTCATTG

ACTGCTGCGAAAGCAGCCAATTTTTTTGACTTCTCAATTAGTTCCTCAGACAT

TAGTTGTCTAAGATGCTAGAGGCAAGACGTGAATAAGTACCCACAAAGGACG

GTCTAATTTCCCTTACATTCATCAATAGGCTATTTCAAGCATGAACTTACATA

AGCGTATTGTGCATTAGATATTAACCTCTTAGCATCCCAACACCGTACTGGGC

TTCGACCCTAACCACATCCCCTACATCACATGACTGTAATTTTTTTTGTTACT

ACTCTTCTCTTGGGACTGTTGTGCGCGAATCCAGTTTTTCCTCCAGAGCCTTA

ACAATCTTTTTTCAATTCTGTCGAAACTCACACGGTAACAGCAGTATCACAA

GATG

RPL1 (SEQ ID NO 140); RPPA10390 - Pichia pastoris (IG-66), Contigi 965 1563_2216

CATAAGTCCTAGAACACCACTTGTTAGTAAAACCGGTAAAATTCCCCATTTTC

TCATATTCAACTTACAATGCTCAAAAGATGATCTTTGAAGGCCATCCCAGTG

GTGTTAGCCATCCTATGGAAGGGTTTGTTAGTAATTGCTGATCCAAAACCTGA

AGACTGAAAAAAAGGCTCGCACATGTCACTCATTACCACATCCACAGGGAAT

TGCTGCTTCTGCTTGTTACTTATCTCGAGGTTATCGTCTACTTCAGCCTGAAA

GTAGGAAGGTCCATCCGCTATTGGTTCAATTTCTTGAAGCTTTCTAAGGTCTT

TGTCCATGTTCAACTGGTAAAAATGAGTCTTTATCATCTCATGCGTGGTCTTA

CTCAAGATGTTGGCCTGCATTGAAGACACCCCTTTTGGAGGCTCACAGGCTA

GGATATCTATCCCAAGAATATTGCCTCTGGAAGTCACTTTATCGGCTGCAACT

TGGGACCATGCACCAGGAGCAAACCCCAAATCCACGACATTTTGCCCTGGAC

GGAAGATATTGAACTTCTCATGGATTTGGAGCAGCTTAAACGCTGCCCTTGAT

TTCAATGCCTTTGCGTGAGCTTCCTTTGTGAAGGGGTCAGATGATTGACGTTC

CAGCCATTTCCTAGAGGATGAACTCTTTCCTTTTACTTTCAGGGAAACAAACC

TAGCAAACTGCCTCCGTAGCAGCGACACAGAAGGATGCTGTATCATGTGATG

AACGTAGCTAAGCACAACCTGTGTGACAATTCCGTGTCGTTTATCTTATCTTG

CTAGTCTAGATATAAAGTATCACGTGACAAATATTATTTTTGCCCCAACACCG

GCTTGAAACTTAGGGCTCGCGCCCTCGTCGCGAGAATTCTTACCTTTCGTTAA

ATTTTTTTTCAGTGTTCCAACTTTTCCCTTCTCTATCCATCGTCGAATCAAACC

TTTACGGGATTACCTACAACCTTTTTGCTAGGGAGACGAATTAATAGAAAAT

G

TKL1 (SEQ ID NO 141 ); RPPA06932 - Pichia pastoris (IG-66), Contig1351 2056 46

GATATCGATTCCACTGCTCAGAGTCTTTTCATCAAGCTCTAAGAAAAGTCCG

GGGATGGAGGAGCCAGCGCAACAGAAAACTATGTCGGGACTAAAACCAACT

TCCTCTAAAACACGCTCACACTGGTCATATTTGGAGATATCTGCTGCCACGTA

AGAAATTGTGTTGCCCTTCTCGGTCCCGTGAATATCAATAGCGTCCTTTACTA

CCTGCTTTAGTAAGGATTCTGTTCTAGCCACAATCACAACGGAACATCCTTTT

CCATACAAAAGTTTAGCAAACTCGGCTCCAACACCCTGGGATGCACCCGTAA

TAATTGCCTTTTTGTTGGTGACATCAAAATTATCAAACATTCCAGTTCCCTCT

ACACTTTGTATGAGAATGATAGCTGAAATTGTGCACCAGATGTTAGAAGATA

AGGTCGTGTCATGAACTAATATCATGAATTCCGAGGGTGGCTCAACAACTAT

TCACGTGACTTGGACGTTGGAAGTTGAGGTGGTTGGTGGATGTTGCACGGAG

TATCATTTGTAAGCATGAAATCAGTCTAAAAAACTTGCAGAATAGCAGAGCG

GTTCGGAAATTCATTCAAAACCACCTCCTCAGATTGGATCTGCCCTACTCTGT

TTAGCTCTGGGAGATTTTCTCGGTCGTGTTCTTTCGCTGGTCTACCCACGCTA

TAGGAATCGCTGTGAACGCTACCTTCTTCCCAACTTCTCGGTGACTATTATAA

GCCATTCCCACTTTGTTTTCAAGCACCAACAACCCACCCCCACCTTATCTACT

CCATCTTGGGTGTCCCCGCGCCTGTTGCAAAGTCCGAACCATAGAACCCCCG

ACCTTTGTCCCACTAACCCTCAGACACCCCTCGGAAGTCAGGGAGAAACCAC

TCCGAAGTACATTAATCATCCCTCGTATTCTCGACGGTGCCCATTTTCTTTAT

AAAAAGGGAGACACAGGTTGCTTCACTAACTCTAGACTTGTATTCTACATCC

ACTCTACACAATG

PIS1 (SEQ ID NO 142); RPPA06550 - Pichia pastoris (IG-66), Contig2109_25041 25727

TTTTTCCTCTTCGTTGTGTGGTAAACTCGGTAACGAGGCTAAAAGTTTTTGCA

AATTTGAGTCTTCATTCAAATCTAAGAACTCTGAATAAATCTCAAACACTTCT

TTCATTTCGATCAATAAGTGTAGTAGTTTCTGTCTATAAATTTCTTCGGTGTTT

GTTCTTTCTAGAGCGTATTGTATACTAACAAAGTTGGATACGACCAAAGTTTG

GAGCTCATTGTCCATTTTGTGTAATCCTTCATCGTCATCAAGCAGAATTCTTA

GCTCCTTTTGGATCTTGGTTGTAAAAGGTCTCAACGTTGTAGTATTGTTCATA

AGAAGCTTCTGTAACACAAAATTAGCTGGCTTGGGGCAAAGATACATCACGC

TAACAAGCGATTGAATAATTGACGGAAGCCTTGGTGTCAGTATTTCTCTGGTA

AGTGCAACCTTATGCCTTATTTTGTCCATAATAAAATCTAGAGTGGATATAGC

TGATTCCAAACAGGTCATCTGTCGTACATCAATGGTTGATGTGATGTAACACT

TCGACTCAATAATCTTGGTCAAAGCACTAATATAGTTACCAGCGTCCGATGC

AATAATCCTTGGGTGAAGGCAAAGGACGTGGACAAGTTTTGTGCCAAACCAA

CGGACTTCAGATTGATTTGATCTAAGATAATTTGCAGCACGAGAGGTAACAT

GCGTCAAGTCGGCTTTTGAAGCTTCATTAACGGCGTTCTCATCAGCAAGGAT

GGGGAGTATTTCCATCAGATCCGCCTCAACTGAATTATCCTTCAAATGCGGA

ATGACAACGTCGAGTCCAACAAATCCTTGATATCCCATCCTGGAAGTCCAAT

TGTATGATCTAAAGAGATTACTTCATTCTATTTTTTTTTTTTTTTTATTGGAATT

TTTCAGCCCAAGGCTCTCTCTATCAGTGTACCGACCAATACGTATCAGATAC

AGATAATTTTGAGCTCAGGCAACATGGAAATTTACTTGTCTCTAGATAGTCAT

G

FET3 (SEQ ID NO 143); RPPA06678 - Pichia pastoris (IG-66), Contig1928 10057 1 1658

TGGGCTTTCGTACCAAATGGAAAAATCACGTACAAGTATGCCCAGAGCTAAG

CTAATCGGATGGCAAGTAGAAAGTGGATGGGTTTCACAGAACAATAAAGAAT

GAGTGACGGATATTATTGGCTGGCAGGCATTAAAGATGCATAATTTAAGCTT

TCTGTTTTCTACTTTTGGAATTGTCACAAATTTGAACTGTGGATGTTATTGAAA

CACAGACCCGTATAAATACCTCTTGAGAGAAATTTGAAAGTGAAGCTATTTC

AGTGAATTAATCACTCGCCATACACGAGGTAGATAATTCACGTAGACGAATT

TCTTTTGATCCATTTTATTCAGGGTGGACAGTCAGAAGTGTTCGTTCACCTGA

TATGTTCTAGATGCAGCTCGAAACGCTGTAAAAAAAAAAAGTCCCAAAAGTC

ACGTGCATAAAGGTGTAGTTCAATTTAATGGAGATAACATAACATCTATGAC

TCCTTTCATGAATCCATCTCAAAAACACAAACTTTGCTAGAATATCTGGTGGC

ACCGATTTTTCATCATTTCACGAGTTTATATAGCATATGCGCCAACAGAACGT

TGCCTGACACAATGTTAAGGCTTTTAAATTTTGCTTGTGTAGTAAAAAGTTAG

TAGTGTGTTACTCGATATCATATTTCTATCAGAAGTGGAATATTCTAATCTCT

CCTCTACCTTTGTTACAATCCGTTTCGAACAGAAAAAAGAATTTATGATGATT

TTATGGAGAATCATTCCAATAATAGCATTGCTAATTAGATTGACGGTAGCAA

AGACTCACAAGTTCAATTTGACAGCTTCTTGGGTGAAAGCAAATCCAGATGG

TGTTTTTGAGAGAGATGTCATTGGACTTAACGGACAGTGGCCTCTACCAGTTC

TAAGGGTGAATCAAGGAGACAGGATCGAACTGTTGTTGACGAATGGTCTTGG

CAATGCGAATACATCTTTGCATTTCCACGGCCTGTTCCAAAGAGGATCTAGTT

TTATG

FTR1 (SEQ ID NO 144); RPPA04329 - Pichia pastoris (IG-66), Contigi 928_8701 7604

TCGAGTAACACACTACTAACTTTTTACTACACAAGCAAAATTTAAAAGCCTT

AACATTGTGTCAGGCAACGTTCTGTTGGCGCATATGCTATATAAACTCGTGAA

ATGATGAAAAATCGGTGCCACCAGATATTCTAGCAAAGTTTGTGTTTTTGAGA

TGGATTCATGAAAGGAGTCATAGATGTTATGTTATCTCCATTAAATTGAACTA

CACCTTTATGCACGTGACTTTTGGGACTTTTTTTTTTTACAGCGTTTCGAGCTG

CATCTAGAACATATCAGGTGAACGAACACTTCTGACTGTCCACCCTGAATAA

AATGGATCAAAAGAAATTCGTCTACGTGAATTATCTACCTCGTGTATGGCGA

GTGATTAATTCACTGAAATAGCTTCACTTTCAAATTTCTCTCAAGAGGTATTT

ATACGGGTCTGTGTTTCAATAACATCCACAGTTCAAATTTGTGACAATTCCAA

AAGTAGAAAACAGAAAGCTTAAATTATGCATCTTTAATGCCTGCCAGCCAAT

AATATCCGTCACTCATTCTTTATTGTTCTGTGAAACCCATCCACTTTCTACTTG

CCATCCGATTAGCTTAGCTCTGGGCATACTTGTACGTGATTTTTCCATTTGGT

ACGAAAGCCCACGAACCACACGACTGTACTTGACTTTTGGGGTCGTTAATGT

GCACAGCCCAGAGATGATCTGATAATTAATATCATTTCGCACCACTGTTTAA

AAAATTCGATAATTTGTTACTAAAATGCTATTTTTGATGCAATGCGTGCATGT

TCATGCACCAGTAGATAATAATCTTAAATTTACAATATAGAAGCTAGTTTCTT

AAAGTTTTATTGGCTTATGTGTTTTAGGGAGAAAAGTTTCAGAGCTATATAAA

CTCGATCCTTCCTTCCACAAATCTCGCCTTCAAGTCATTTCTCGAACTTCCTT

CCAATAGCAGTGCCACTCAACGTTCAGTGGTCGTTGTAGTTCTTTTCAAAATG

NMT1 (SEQ ID NO 145); RPPA07685 - Pichia pastoris (IG-66), Contig2096 16323 14941

TCGACACACAGAGGAAACGAGGGTAACATGATATCATGATGTACGTACAAA

CTAGGATCTCTGGTCATTCCACGGGCCACATCAGGATCGAATTTGTCGCCGT

CCTCTTCTTCAGCGCCATCAGGGGCTCCAGCACCGTCGTCGTATACGTAAAC

ATCTAAAAAGGAGATATCGTCTTCAGTTCTGGTGGAAAGGATCATATTGTCTG

TGGGGTATACTTGCAACTCTTGTTTCTCTTCTAAAATTTCACGTTCCGTTGGA

AGAGTGATGTATGCGTCACCATCTGCGTTGTTCTCATCTTCGTGGTAACGGGC

ACCCGTACTTGATAACCCAGGTAGCATCGAGACTTTGAGGCCATTGGCAACA

GAATCCTCATACGTTTCGTCATCATCGTAATGTTCTAAATCATATTCCTTTAG

ATCATCATCAATGTCATCTTGAGACTGTAGTTTGCTAGCTATAGATTTGTTGA

TGTTGGCTTTCATTTCTGAAATATCAGAATCATCTTTCTGTGTTTCGGCCAGCT

CATCCTGGGCATCTTTCAGTTTCAAATTGGCCATGGCTTCAATTCTCTCCATC

TCAACATCGTTCATTTCATATTTCTCGGGAAACTCCGAGGGGAAGCCTCTGG

GAACCCATGTTGTAGACGAAATCATAAGTGTACAATACTGTCTCTTTAAACTG

AAAAAAAAAATTTGAGAAGAAAAAAAAAATTGCTGGTCACATGATAAGGTG

GATCCCGTAATCAAGCATTAACCTGGTTCAGGTTCGTGGATTGTTTAGGCTTT

ATAAAATTGACTGGTAGGTCCCCAGTTTCAAGTTTTCTTTAGTAATCTACTCG

CACAAATTATTGTTGGGCATCTCGTCCTATCAGATTCATATAGATCTATAATT

GATTCCCCTCCTGAAACACTTTCAACCAACATCTGAGGGTTCTCGAAAATTC

AGTAAGATTGACTCCCTTCTCATCCAGGCTTGGTGACCATCAAAATCACCAA

TCATG

PHO8 (SEQ ID NO 146); RPPA04253 - Pichia pastoris (IG-66), Contig 1417 1294 2871

GGTATAAGTATAGCACATGTTGACGCTGATATAATCATCCCAAAAAAGTGTG

CGGTTTATTTAGAAAGGAGAAGAGGAACTCATGAAATTTTTGAAGGAACGCA

CCCATTCTAGGTGATGTACGGGTGTCTTTTGATAAGTAGTGGATAGGTTTTAT

TCACTGGACTATATGGTACTGTACAAAAGAAGTTCCTCTTCAGATGGTCTGTA

GCTTGCATGCAATGTTCTCTCTAAGGAATGTGTATTTGAAAGGTCATGACTTA

CCATAACCAGTCGCAAAATGCTTTTGTTCCAGGTTCTCTTCCTGAGGAACGTT

TTGACGGGGTATACATACCTTATCGAGATCAATTTTAGGAATAGATACTTTCT

CTCCAACCGTTAAAAATTTGGAAATCTTCACTTTGGCAGGTTTTAACTCGTCA

AGAGACTTGACGTCTAGGATCTTATACTTGCTTATGTCGTTCTTGGTGTCCTG

ATCGAGAGAGTCTTCCCTGACTTGGTATTTCTTACCTTCGTGCTTAAAACTAT

GAGCCGTTTTGTCCTTAATGGGCATGGGAAGAGTTTTCAATGTGCTTAAATCC

AAATCTTTGGGAGTTTTCAACAGCCATACTTCTTTCTTTCCCAAATCCTTGAA

CGGAGATATTTCTTTGAGGTCTGTGCGCTGTTTGAAATTAGCAGGTGGTTCAA

AAGTGAAATCCTCTGAATCACTGCTTTCAGAGTTGTCGGAGCTAATATCGGA

CTCTGATATAAACTCCCTTGGATATTCTTTTCATTTCAGAGACACCCTAATAG

TTGTAGGAAAGATTAAACCAAGTGAAAAAAAAAATTCAATCGCGATGGTAGA

GTCTGGAAGGAGATGTTCTCCTTCTGGCTCACATCTCGCTAAGTGGACATTCT

TCACGGAACTACGCACAAATCTATGCCCCTTTCAGGATCTCGCGCAATCTGC

ATTGATATGTGATATTTGTTTTAGCGTCCTCTCCCCTTCAATTTCAAAGCAAT

G

FET3pre (SEQ ID NO 147); RPPA06003 - Pichia pastoris (IG-66), Contig2108 1359 14

TTGTGGTCCTATGAATTAACCATTTAAAAATATTGCTATTAATGATTATGTTA

GGTGGATTAATGTTGTCATTAGTACATATATTCGTATTTTAACTTAATTACTCA

TTAATTAAGTGTAGAATGTCATTTCACTGGATTTAACTTGCTTTGTAAGTACG

AAACAATGCTCGATCAATAATCAAACAGTTTAGTTCAGAGATTAGATTCAAA

TATAATTCTAAATTTATAGTATCAGAAATATATATTAAGGACAGTGTTAATTT

TACCTCATTTAAACTGCTGCTAGCCATTAAACGATTGTGCTTTTTTAGTCCAT

GTTTAAGAACAAATGACCATTTGCTAAACCTACTGTAATCTTATTATAAAAAT

GAAGCCTACAAATCATTTCTTACAAGCAAATGATTTTACAGCCCCTTGTGCTG

AATGTATAGTACACATTGGACGAGGAGAAACGATCTAAGTGGCCACTCGTAG

GTGTTAATGATACTATTCGCTCGAACTTAACGGTGCTTAGGATTCTACTTAAT

CCTTGCACGTGACAAGGCTTAAATTAAGCTTGTTTTCTGATGTGTGTCAAAAA

AATTTTAGCTTGACATATATCGAACGATACGGCGTATTCTTAGGCAACAATTT

CAACTTCTTCGCTTATTTGCAATACTGGTATCCACGGGGTAATGCCAAACTTA

ACAAACAGTAATGGATTTGAAGCTAGCATGCTGTATTTCAAGGAGCGGAGAA

GCAAAGATAGCCCCCCTTAGATTAAATGCATGTTTAAGCTTTCGAGGCACTTT

TCTGATTAATGCATAAGCCTGCAGGAAAGGATTCCACTTATTTAGTAAAATTA

TCAGTTCAATCGCCAAATAAAGGCACAACAAGAAAAAGAAACAAAACAATA

AAATTTCAACCTTCTTTCAGCGTATATAAAAGAAATTTACTGCCCACTTCCTC

GAAGTTTTCCTTTCCCCTAGCTGTAACCACTGCCGCCAACACTCCATCGAAG

AATG

References

Archer, D., Jeenes, D. and Mackenzie, D. ( 1994). Strategies for improving heterologous protein production from filamentous fungi. Antonie Van Leeuwenhoek 65, 245-50.

Gasser, B., Maurer, M., Gach, J., Kunert, R. and Mattanovich, D. (2006).

Engineering of Pichia pastoris for improved production of antibody fragments. Biotechnol Bioeng 94, 353-61 .

Gething, M. and Sambrook, J. (1992). Protein folding in the cell. Nature 355, 33-45.

Hohenblum, H., et al. (2003) Assessing viability and cell-associated product of recombinant protein producing Pichia pastoris with flow cytometry. J

Biotechnol 102, 281 -290

Kurtzman CP. (2005). Description of Komagataella phaffii sp. nov. and the transfer of Pichia pseudopastoris to the methylotrophic yeast genus

Komagataella. lnt J Syst Evol Microbiol 55, 973-976. Lang, C. and Looman, A. ( 1995). Efficient expression and secretion of

Aspergillus niger RH5344 polygalacturonase in Saccharomyces cerevisiae. Appl Microbiol Biotechnol 44, 147-56. Macauley-Patrick, S., Fazenda, M. L., McNeil, B. and Harvey, L. M. (2005).

Heterologous protein production using the Pichia pastoris expression system. Yeast 22, 249-70. Mattanovich, D., Gasser, B., Hohenblum, H. and Sauer, M. (2004). Stress in recombinant protein producing yeasts. J Biotechnol 1 13, 121 -35. Mori, K., Ogawa, N., Kawahara, T., Yanagi, H. and Yura, T. (2000). mRNA splicing-mediated C-terminal replacement of transcription factor Had p is required for efficient activation of the unfolded protein response. Proc

Natl Acad Sci U S A 97, 4660-5.

Porro, D., Sauer, M., Branduardi, P. and Mattanovich, D. (2005). Recombinant protein production in yeasts. MoI Biotechnol 31 , 245-59.

Punt, P. J., van Biezen, N., Conesa, A., Albers, A., Mangnus, J. and van den

Hondel, C. (2002). Filamentous fungi as cell factories for heterologous protein production. Trends Biotechnol 20, 200-6.

Sauer, M., Branduardi, P., Gasser, B., VaIIi, M., Maurer, M., Porro, D. and Mattanovich, D. (2004). Differential gene expression in recombinant

Pichia pastoris analysed by heterologous DNA microarray hybridisation.

Microb Cell Fact 3, 1 7.

Shuster, J. (1991 ). Gene expression in yeast: protein secretion. Curr Opin

Biotechnol 2, 685-90. Stryer, L. (1995). Biochemie. Spektrum der Wissenschaft Verlags GmbH.