RECOMBINANT MICRO-ORGANISM FOR USE IN METHOD WITH INCREASED PRODUCT YIELD

Title:

RECOMBINANT MICRO-ORGANISM FOR USE IN METHOD WITH INCREASED PRODUCT YIELD

Document Type and Number:

WIPO Patent Application WO/2014/129898

Kind Code:

Abstract:

The invention relates to a recombinant yeast cell, in particular a transgenic yeast cell, functionally expressing one or more recombinant, in particular heterologous, nucleic acid sequences encoding ribulose-1,5-biphosphate carboxylase oxygenase (Rubisco) and phosphoribulokinase (PRK). The invention further relates to the use of carbon dioxide as an electron acceptor in a recombinant chemotrophic micro-organism, in particular a eukaryotic micro-organism.

Inventors:

VAN MARIS ANTONIUS JEROEN ADRIAAN (NL)
PRONK JACOBUS THOMAS (NL)
GUADALUPE MEDINA VICTOR GABRIEL (NL)
WISSELINK HENDRIK WOUTER (NL)

Application Number:

PCT/NL2014/050106

Publication Date:

August 28, 2014

Filing Date:

February 21, 2014

Export Citation:

Click for automatic bibliography generation Help

Assignee:

UNIV DELFT TECH (NL)

International Classes:

C12N1/16

Domestic Patent References:

WO2008028019A1	2008-03-06
WO2011010923A1	2011-01-27
WO2009013159A2	2009-01-29

Foreign References:

US20120064622A1

2012-03-15

Other References:

NISSEN ET AL., YEAST, vol. 16, 2000, pages 463 - 474
SAMBROOK ET AL.: "Molecular Cloning-A Laboratory Manual", vol. 1-3, 1989
KRUSKAL, J. B.: "Time warps, string edits and macromolecules: the theory and practice of sequence comparison", 1983, ADDISON WESLEY, article "An overview of sequence comparison", pages: 1 - 44
NEEDLEMAN, S. B.; WUNSCH, C. D., J. MOL. BIOL., vol. 48, 1970, pages 443 - 453
RICE,P.; LONGDEN,!.; BLEASBY,A.: "EMBOSS: The European Molecular Biology Open Software Suite", TRENDS IN GENETICS, vol. 16, no. 6, 2000, pages 276 - 277, XP004200114, Retrieved from the Internet DOI: doi:10.1016/S0168-9525(00)02024-2
SHARP; LI, NUCLEIC ACIDS RESEARCH, vol. 15, 1987, pages 1281 - 1295
JANSEN ET AL., NUCLEIC ACIDS RES., vol. 31, no. 8, 2003, pages 2242 - 51
HUGO YEBENES ET AL.: "Chaperonins: two rings for folding", TRENDS IN BIOCHEMICAL SCIENCES, vol. 36, no. 8, August 2011 (2011-08-01), XP028258831, DOI: doi:10.1016/j.tibs.2011.05.003
ZEILSTRA-RYALLS J; FAYET O; GEORGOPOULOS C: "The universally conserved GroE (Hsp60) chaperonins", ANNU REV MICROBIOL., vol. 45, 1991, pages 301 - 25
HORWICH AL; FENTON WA; CHAPMAN E; FARR GW: "Two Families of Chaperonin: Physiology and Mechanism", ANNU REV CELL DEV BIOL., vol. 23, 2007, pages 115 - 45
GIETZ ET AL.: "Method in Yeast Protocol", 2006, HUMANA PRESS, article "Yeast Transformation by the LiAc/SS Carrier DNA/PEG"
ZERBINO ET AL.: "Velvet: Algorithms for De Novo Short Read Assembly Using De Bruijn Graphs", GENOME RESEARCH, 2008
GÜLDENER ET AL.: "A second set of loxP marker cassettes for Cre-mediated multiple gene knockouts in budding yeast", NUCLEIC ACIDS RESEARCH, 2002
BASSO ET AL.: "Engineering topology and kinetics of sucrose metabolism in Saccharomyces cerevisiae for improved ethanol yield", METABOLIC ENGINEERING, vol. 13, 2011, pages 694 - 703
MASHEGO ET AL.: "Critical evaluation of sampling techniques for residual glucose determination in carbon-limited chemostat culture of Saccharomyces cerevisiae", BIOTECHNOLOGY AND BIOENGINEERING, vol. 83, 2003, pages 395 - 399, XP055211574, DOI: doi:10.1002/bit.10683
GUADALUPE MEDINA ET AL.: "Elimination of glycerol production in anaerobic cultures of a Saccharomyces cerevisiae strain engineered to use acetic acid as an electron acceptor", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 76, 2010, pages 190 - 195, XP002603125, DOI: doi:10.1128/AEM.01772-09
GUADALUPE MEDINA ET AL.: "Elimination of glycerol production in anaerobic cultures of a Saccharomyces cerevisiae strain engineered to use acetic acid as an electron acceptor", APPL. ENVIRON. MICROBIOL., vol. 76, 2010, pages 190 - 195, XP002603125, DOI: doi:10.1128/AEM.01772-09
ABBOTT ET AL.: "Catalase Overexpression reduces lactic acid-induced oxidative stress in Saccharomyces cerevisiae", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 75, 2009, pages 2320 - 2325
MACELROY ET AL.: "Properties of Phosphoribulokinase from Thiobacillus neapolitanus", JOURNAL OF BACTERIOLOGY, vol. 112, 1972, pages 532 - 538
"Protein measurement with the Folin phenol reagent", THE JOURNAL OF BIOLOGICAL CHEMISTRY, vol. 193, 1951, pages 265 - 275
ABBOTT, D. A. ET AL.: "Catalase overexpression reduces lactic acid-induced oxidative stress in Saccharomyces cerevisiae", APPL. ENVIRON. MICROBIOL., vol. 75, 2009, pages 2320 - 2325
BEUDEKER ET AL.: "Relations between d-ribulose-1,5- biphosphate carboxylase, carboxysomes and C02 fixing capacity in the obligate chemolithotroph Thiobacillus neapolitanus grown under different limitations in the chemostat", ARCHIVES OF MICROBIOLOGY, vol. 124, 1980, pages 185 - 189
LOWRY, O. H.; ROSEBROUGH, N. J.; FARR, A. L.; RANDALL, R. J.: "Protein measurement with the Folin phenol reagent", J. BIOL. CHEM., vol. 193, 1951, pages 265 - 275, XP000196391

Attorney, Agent or Firm:

JANSEN, C.M. (Johan de Wittlaan 7, JR Den Haag, NL)

Download PDF:

View/Download PDF PDF Help

Claims:

Claims

1. A recombinant yeast cell, in particular a transgenic yeast cell, functionally expressing one or more recombinant, in particular heterologous, nucleic acid sequences encoding ribulose-l,5-biphosphate carboxylase oxygenase (Rubisco) and phosphoribulokinase (PRK).

2. A recombinant yeast cell according to claim 1, wherein said yeast cell further comprises one or more prokaryotic molecular chaperones, said chaperones preferably originating from a bacterium, more preferably from Escherichia coli (E. coli .

3. A recombinant yeast cell according to claim 1 or 2, wherein said chaperones are selected from the group of GroEL, GroES, functional homologues of GroEL and functional homologues of GroES.

4. A recombinant yeast cell according to any one of the preceding claims, wherein said Rubisco is a single subunit Rubisco.

5. A recombinant yeast cell according to any one of the preceding claims, wherein said Rubisco is a prokaryotic form-II Rubisco.

6. A recombinant yeast cell according to any one of the preceding claims, wherein said yeast cell is selected from the group of Saccharomyceraceae, such as Saccharomyces cerevisiae, Saccharomyces pastorianus, Saccharomyces beticus, Saccharomyces fermentati, Saccharomyces paradoxus, Saccharomyces uvarum and Saccharomyces bay anus; Schizosaccharomyces such as Schizosaccharomyces pombe, Schizosaccharomyces japonicus, Schizosaccharomyces octosporus and Schizosaccharomyces cryophilus; Torulaspora such as Torulaspora delbrueckii; Kluyveromyces such as Kluyveromyces marxianus; Pichia such as Pichia stipitis, Pichia pastoris or Pichia angusta, Zygosaccharomyces such as Zygosaccharomyces bailii; Brettanomyces such as Brettanomyces inter medius, Brettanomyces

bruxellensis, Brettanomyces anomalus, Brettanomyces custersianus, Brettanomyces naardenensis, Brettanomyces nanus, Dekkera bruxellensis and Dekkera anomala; Metschnikowia, Issatchenkia, such as Issatchenkia orientalis, Kloeckera such as Kloeckera apiculata; Aureobasidium such as Aureobasidium pullulans, preferably a Saccharomyces cerevisiae cell.

7. A recombinant yeast cell according to claim 6, wherein the yeast cell is selected from the group of Saccharomyceraceae.

8. A recombinant yeast cell according to claim 7, wherein the yeast cell is selected from the group of Saccharomyces cerevisiae, Saccharomyces pastorianus, Saccharomyces beticus, Saccharomyces fermentati, Saccharomyces paradoxus, Saccharomyces uvarum and Saccharomyces bayanus

9. A yeast cell according to any of the preceding claims, wherein the PRK is a PRK originating from a eukaryote.

10 A yeast cell according to claim 9, wherein the PRK is originating from a plant selected from Caryophyllales , in particular from Amaranthaceae, in particular from Spinacia.

11 A yeast cell according to any of the preceding claims, wherein the Rubisco has an activity, defined by the rate of ribulose-l,5-bisphosphate- dependent ¹⁴C-bicarbonate incorporation by cell extracts, of at least 1 nmol.min- ^x.(mg protein) ( at 30 °C, as determinable by the method referred to in Example 4). 12. One or more vectors for the functional expression of a heterologous polypeptide in a yeast cell, wherein said vector or vectors comprise one or more heterologous nucleic acid sequence encoding Rubisco and PRK, wherein said Rubisco exhibits activity of carbon fixation.

13. A method for preparing an alcohol, organic acid or amino acid, comprising fermenting a carbon source, in particular a carbohydrate with a yeast cell according to any one of claims 1- 11, thereby forming the alcohol, organic acid or amino acid, wherein the yeast cell is present in a reaction medium.

14. A method according to claim 13, wherein the reaction medium comprises carbon dioxide wherein the carbon dioxide concentration in the reaction medium is at least 5 % of the carbon dioxide saturation concentration, in particular at least 10 %, more in particular at least 20 %.

15. Method according to claim 13 or 14, wherein ethanol is formed.

16. Use of carbon dioxide as an electron acceptor in a recombinant chemotrophic micro-organism, in particular a chemotrophic eukaryotic microorganism.

17. Use according to claim 16, wherein the micro-organism comprises - a heterologous nucleic acid sequence encoding a polypeptide from a (naturally) autotrophic organism, which polypeptide is selected from the group consisting of carbonic anhydrases, carboxylases, oxygenases, hydrogenases, dehydrogenases, isomerases, aldolases, transketolases, transaldolases,

phosphatases, epimerases, kinases, carboxykinases, oxidoreductases, aconitases, fumarases, reductases, lactonases, phosphoenolpyruvate (PEP) carboxylases, phosphoglycerate kinases, glyceraldehyde 3-phosphate dehydrogenases, triose phosphate isomerases, fructose- 1,6-bisphosphatases, sedoheptulose-1,7- bisphosphatases, phosphopentose isomerases, phosphopentose epimerase, phosphoribulokinases (PRK), glucose 6-phosphate dehydrogenases, 6- phosphogluconolactonases, 6-phosphogluconate dehydrogenases, ribulose 5- phosphate isomerases, ribulose 5-phosphate 3-epimerases, Ribulose- 1,5- bisphosphate carboxylase oxygenases, lactate dehydrogenases, malate synthases, isocitrate lyases, pyruvate carboxylases, phosphoenolpyruvate carboxykinases, fructose- 1,6-bisphosphatases, phosphoglucoisomerases, glucose-6-phosphatases, hexokinases, glucokinases, phosphofructokinases, pyruvate kinases, succinate dehydrogenases, citrate synthases, isocitrate dehydrogenases, a-ketoglutarate dehydrogenases, succinyl-CoA synthetases, malate dehydrogenases, nucleoside- diphosphate kinases, xylose reductases, xylitol dehydrogenases, xylose isomerases, isoprenoid synthases, and xylonate dehydratases; and

- a heterologous nucleic acid sequence encoding a first prokaryotic chaperone for said polypeptide and preferably a nucleic acid sequence encoding a second prokaryotic chaperone - different from the first - for said polypeptide.

18. Use according to claim 16 or 17, wherein the micro-organism produces an organic compound under anaerobic conditions, and/or wherein the carbon dioxide serves as an electron acceptor in a process with NADH as an electron donor.

19. Recombinant micro-organism, in particular a eukaryotic microorganism, having an enzymatic system comprising one or more recombinant enzymes allowing the micro-organism to use carbon dioxide as an electron acceptor under chemotrophic (non-phototrophic) conditions, in particular a recombinant micro-organism as defined in any of the claims 16- 18.

Description:

Title: RECOMBINANT MICRO-ORGANISM FOR USE IN METHOD WITH INCREASED PRODUCT YIELD

The invention relates to a recombinant micro-organism having the ability to produce a desired fermentation product, to the functional expression of

heterologous peptides in a micro-organism, and to a method for producing a fermentation product wherein said microorganism is used. In a preferred embodiment the micro-organism is a yeast. The invention is further related to a use of CO2 in micro-organisms.

Microbial fermentation processes are applied for industrial production of a broad and rapidly expanding range of chemical compounds from renewable carbohydrate feedstocks.

Especially in anaerobic fermentation processes, redox balancing of the cofactor couple NADH NAD ⁺ can cause important constraints on product yields. This challenge is exemplified by the formation of glycerol as major by-product in the industrial production of - for instance - fuel ethanol by Saccharomyces cerevisiae, a direct consequence of the need to reoxidize NADH formed in biosynthetic reactions.

Ethanol production by Saccharomyces cerevisiae is currently, by volume, the single largest fermentation process in industrial biotechnology, but various other compounds, including other alcohols, carboxylic acids, isoprenoids, amino acids etc, are currently produced in industrial biotechnological processes.

Various approaches have been proposed to improve the fermentative properties of organisms used in industrial biotechnology by genetic modification.

WO 2008/028019 relates to a method for forming fermentation products utilizing a microorganism having at least one heterologous gene sequence, the method comprising the steps of converting at least one carbohydrate to 3 -phosphoglycerate and fixing carbon dioxide, wherein at least one of said steps is catalyzed by at least one exogenous enzyme. Further, it relates to a microorganism for forming

fermentation products through fermentation of at least one sugar, the microorganism comprising at least one heterologous gene sequence encoding at least one enzyme selected from the group consisting of phosphopentose epimerase,

phosphoribulokinase, and ribulose bisphosphate carboxylase. In an example, a yeast is mentioned wherein a heterologous PRK and a heterologous Rubisco gene are incorporated. In an embodiment the yeast is used for ethanol production. The results (Figure 24) show concentrations for transgenic controls and the modified strains. Little difference is noticeable between modified yeast and its corresponding control. No information is apparent regarding product yield, sugar conversion, yeast growth, evaporation rates of ethanol. Thus, it is apparent that results are not conclusive with respect to an improvement in ethanol yield.

Further, WO 2008/028019 is silent on the problem of glycerol side-product formation.

A major challenge relating to the stoichiometry of yeast-based production of ethanol, but also of other compounds, is that substantial amounts of NADH- dependent side-products (in particular glycerol) are generally formed as a by-product, especially under anaerobic and oxygen-limited conditions or under conditions where respiration is otherwise constrained or absent. It has been estimated that, in typical industrial ethanol processes, up to about 4 wt.% of the sugar feedstock is converted into glycerol (Nissen et al. Yeast 16 (2000) 463-474). Under conditions that are ideal for anaerobic growth, the conversion into glycerol may even be higher, up to about 10 %.

Glycerol production under anaerobic conditions is primarily linked to redox metabolism. During anaerobic growth of S. cerevisiae, sugar dissimilation occurs via alcoholic fermentation. In this process, the NADH formed in the glycolytic

glyceraldehyde-3-phosphate dehydrogenase reaction is reoxidized by converting acetaldehyde, formed by decarboxylation of pyruvate to ethanol via NAD ⁺-dependent alcohol dehydrogenase. The fixed stoichiometry of this redox-neutral dissimilatory pathway causes problems when a net reduction of NAD ⁺ to NADH occurs elsewhere in metabolism. Under anaerobic conditions, NADH reoxidation in S. cerevisiae is strictly dependent on reduction of sugar to glycerol. Glycerol formation is initiated by reduction of the glycolytic intermediate dihydroxyacetone phosphate (DHAP) to glycerol 3-phosphate (glycerol- 3P), a reaction catalyzed by NAD ⁺-dependent glycerol 3-phosphate dehydrogenase. Subsequently, the glycerol 3-phosphate formed in this reaction is hydrolysed by glycerol-3-phosphatase to yield glycerol and inorganic phosphate. Consequently, glycerol is a major by-product during anaerobic production of ethanol by S. cerevisiae, which is undesired as it reduces overall conversion of sugar to ethanol. Further, the presence of glycerol in effluents of ethanol production plants may impose costs for waste-water treatment.

In WO 2011/010923, the NADH-related side-product (glycerol) formation in a process for the production of ethanol from a carbohydrate containing feedstock - in particular a carbohydrate feedstock derived from lignocellulosic biomass - glycerol side-production problem is addressed by providing a recombinant yeast cell comprising one or more recombinant nucleic acid sequences encoding an NAD ⁺- dependent acetylating acetaldehyde dehydrogenase (EC 1.2.1.10) activity, said cell either lacking enzymatic activity needed for the NADH-dependent glycerol synthesis or the cell having a reduced enzymatic activity with respect to the NADH-dependent glycerol synthesis compared to its corresponding wild-type yeast cell. A cell is described that is effective in essentially eliminating glycerol production. Also, the cell uses acetate to reoxidise NADH, whereby ethanol yield can be increased if an acetate- containing feedstock is used.

Although the described process in WO 2011/010923 is advantageous, there is a continuing need for alternatives, in particular alternatives that also allow the production of a useful organic compound, such as ethanol, without needing acetate or other organic electron acceptor molecules in order to eliminate or at least reduce NADH-dependent side-product synthesis. It would in particular be desirable to provide a microorganism wherein NADH-dependent side-product synthesis is reduced and which allows increased product yield, also in the absence of acetate.

The inventors realised that it may be possible to reduce or even eliminate NADH-dependent side-product synthesis by functionally expressing a recombinant enzyme in a heterotrophic, chemotrophic microorganism cell, in particular a yeast cell, using carbon dioxide as a substrate.

Accordingly, the present invention relates to the use of carbon dioxide as an electron acceptor in a recombinant chemoheterotrophic micro-organism, in particular a eukaryotic micro-organism. Chemotrophic, (chemo) heterotrophic and autotrophic and other classifications of a microorganism are herein related to the micro-organism before recombination, this organism is herein also referred to as the host. For instance, through recombination as disclosed herein a host micro-organism that is originally (chemo)heterotroph and not autotrophic may become autotrophic after recombination, since applying what is disclosed herein causes that the recombined organism may assimilate carbon dioxide, thus resulting in (partial) (chemo)autotrophy.

Advantageously, the inventors have found a way to incorporate the carbon dioxide as a co-substrate in metabolic engineering of heterotrophic industrial microorganisms that can be used to improve product yields and/or to reduce side- product formation.

In particular, the inventors found it to be possible to reduce or even eliminate NADH-dependent side-product synthesis by functionally expressing at least two recombinant enzyme from two specific groups in a eukaryotic microorganism, in particular a yeast cell, wherein one of the enzymes catalysis a reaction wherein carbon dioxide is used and the other uses ATP as a cofactor.

Accordingly, the invention further relates to a recombinant, in a particular transgenic, eukaryotic microorganism, in particular a yeast cell, said microorganism functionally expressing one or more recombinant, in particular heterologous, nucleic acid sequences encoding a ribulose-l,5-biphosphate carboxylase oxygenase (Rubisco) and a phosphoribulokinase (PRK).

A microorganism according to the invention has in particular been found advantageous in that in the presence of Rubisco and the PRK NADH-dependent side- product formation (glycerol) is reduced considerably or essentially completely eliminated and production of the desired product can be increased. It is thought that the carbon dioxide acts as an electron acceptor for NADH whereby less NADH is available for the reaction towards the side-product (such as glycerol).

The invention further relates to a method for preparing an organic compound, in particular an alcohol, organic acid or amino acid, comprising converting a carbon source, in particular a carbohydrate or another organic carbon source using a microorganism, thereby forming the organic compound, wherein the microorganism is a microorganism according to the invention or wherein carbon dioxide is used as an electron acceptor in a recombinant chemotrophic or chemoheterotrophic micro- organism.

The invention further relates to a vector for the functional expression of a heterologous polypeptide in a yeast cell, wherein said vector comprises a heterologous nucleic acid sequence encoding Rubisco and PRK, wherein said Rubisco exhibits activity of carbon fixation. The term "a" or "an" as used herein is defined as "at least one" unless specified otherwise.

When referring to a noun (e.g. a compound, an additive, etc.) in the singular, the plural is meant to be included. Thus, when referring to a specific moiety, e.g. "compound", this means "at least one" of that moiety, e.g. "at least one compound", unless specified otherwise.

The term 'or' as used herein is to be understood as 'and/or'.

When referring to a compound of which several isomers exist (e.g. a D and an L enantiomer), the compound in principle includes all enantiomers, diastereomers and cis/trans isomers of that compound that may be used in the particular method of the invention; in particular when referring to such as compound, it includes the natural isomer (s).

For the purpose of clarity and a concise description features are described herein as part of the same or separate embodiments, however, it will be appreciated that the scope of the invention may include embodiments having combinations of all or some of the features described". In view of this passage it is evident to the skilled reader that the variants of claim 1 as filed may be combined with other features described in the application as filed, in particular with features disclosed in the dependent claims, such claims usually relating to the most preferred embodiments of an invention.

The term 'fermentation', 'fermentative' and the like is used herein in a classical sense, i.e. to indicate that a process is or has been carried out under anaerobic conditions. Anaerobic conditions are herein defined as conditions without any oxygen or in which essentially no oxygen is consumed by the yeast cell, in particular a yeast cell, and usually corresponds to an oxygen consumption of less than 5 mmol/l.h, in particular to an oxygen consumption of less than 2.5 mmol/l.h, or less than 1 mmol/l.h. More preferably 0 mmol/L/h is consumed (i.e. oxygen consumption is not detectable. This usually corresponds to a dissolved oxygen concentration in the culture broth of less than 5 % of air saturation, in particular to a dissolved oxygen concentration of less than 1 % of air saturation, or less than 0.2 % of air saturation.

The term "yeast" or "yeast cell" refers to a phylogenetically diverse group of single-celled fungi, most of which are in the division of Ascomycota and Basidiomycota. The budding yeasts ("true yeasts") are classified in the order

Saccharomycetales, with Saccharomyces cerevisiae as the most well known species.

The term "recombinant (cell)" or "recombinant micro-organism" as used herein, refers to a strain (cell) containing nucleic acid which is the result of one or more genetic modifications using recombinant DNA technique(s) and/or another mutagenic technique(s). In particular a recombinant cell may comprise nucleic acid not present in a corresponding wild-type cell, which nucleic acid has been introduced into that strain (cell) using recombinant DNA techniques (a transgenic cell), or which nucleic acid not present in said wild-type is the result of one or more mutations - for example using recombinant DNA techniques or another mutagenesis technique such as UV-irradiation - in a nucleic acid sequence present in said wild-type (such as a gene encoding a wild-type polypeptide) or wherein the nucleic acid sequence of a gene has been modified to target the polypeptide product (encoding it) towards another cellular compartment. Further, the term "recombinant (cell)" in particular relates to a strain (cell) from which DNA sequences have been removed using recombinant DNA techniques.

The term "transgenic (yeast) cell" as used herein, refers to a strain (cell) containing nucleic acid not naturally occurring in that strain (cell) and which has been introduced into that strain (cell) using recombinant DNA techniques, i.e. a recombinant cell).

The term "mutated" as used herein regarding proteins or polypeptides means that at least one amino acid in the wild-type or naturally occurring protein or polypeptide sequence has been replaced with a different amino acid, inserted or deleted from the sequence via mutagenesis of nucleic acids encoding these amino acids. Mutagenesis is a well-known method in the art, and includes, for example, site- directed mutagenesis by means of PCR or via oligonucleotide-mediated mutagenesis as described in Sambrook et al., Molecular Cloning-A Laboratory Manual, 2nd ed., Vol. 1-3 (1989). The term "mutated" as used herein regarding genes means that at least one nucleotide in the nucleic acid sequence of that gene or a regulatory sequence thereof, has been replaced with a different nucleotide, or has been deleted from the sequence via mutagenesis, resulting in the transcription of a protein sequence with a qualitatively of quantitatively altered function or the knock-out of that gene. The term "gene", as used herein, refers to a nucleic acid sequence containing a template for a nucleic acid polymerase, in eukaryotes, RNA polymerase II. Genes are transcribed into mRNAs that are then translated into protein.

The term "nucleic acid" as used herein, includes reference to a deoxyribonucleotide or ribonucleotide polymer, i.e. a polynucleotide, in either single or double-stranded form, and unless otherwise limited, encompasses known analogues having the essential nature of natural nucleotides in that they hybridize to single- stranded nucleic acids in a manner similar to naturally occurring nucleotides (e. g., peptide nucleic acids). A polynucleotide can be full-length or a subsequence of a native or heterologous structural or regulatory gene. Unless otherwise indicated, the term includes reference to the specified sequence as well as the complementary sequence thereof. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are "polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two examples, are polynucleotides as the term is used herein. It will be appreciated that a great variety of modifications have been made to DNA and RNA that serve many useful purposes known to those of skill in the art. The term polynucleotide as it is employed herein embraces such chemically, enzymatically or metabolically modified forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including among other things, simple and complex cells.

The terms "polypeptide", "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The essential nature of such analogues of naturally occurring amino acids is that, when incorporated into a protein, that protein is specifically reactive to antibodies elicited to the same protein but consisting entirely of naturally occurring amino acids. The terms "polypeptide", "peptide" and "protein" are also inclusive of modifications including, but not limited to, glycosylation, lipid

attachment, sulphation, gamma-carboxylation of glutamic acid residues,

hydroxylation and ADP-ribosylation. When an enzyme is mentioned with reference to an enzyme class (EC), the enzyme class is a class wherein the enzyme is classified or may be classified, on the basis of the Enzyme Nomenclature provided by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB), which nomenclature may be found at http://www.chem.qmul.ac.uk/iubmb/enzyme/. Other suitable enzymes that have not (yet) been classified in a specified class but may be classified as such, are meant to be included.

If referred herein to a protein or a nucleic acid sequence, such as a gene, by reference to a accession number, this number in particular is used to refer to a protein or nucleic acid sequence (gene) having a sequence as can be found via

www . ncbi . nlm . nih . gov/, (as available on 13 July 2009) unless specified otherwise.

Every nucleic acid sequence herein that encodes a polypeptide also, by reference to the genetic code, describes every possible silent variation of the nucleic acid. The term "conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences,

conservatively modified variants refers to those nucleic acids which encode identical or conservatively modified variants of the amino acid sequences due to the degeneracy of the genetic code. The term "degeneracy of the genetic code" refers to the fact that a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are "silent variations" and represent one species of conservatively modified variation.

The term "functional homologue" (or in short "homologue") of a polypeptide having a specific sequence (e.g. SEQ ID NO: X), as used herein, refers to a polypeptide comprising said specific sequence with the proviso that one or more amino acids are substituted, deleted, added, and/or inserted, and which polypeptide has (qualitatively) the same enzymatic functionality for substrate conversion. This functionality may be tested by use of an assay system comprising a recombinant yeast cell comprising an expression vector for the expression of the homologue in yeast, said expression vector comprising a heterologous nucleic acid sequence operably linked to a promoter functional in the yeast and said heterologous nucleic acid sequence encoding the homologous polypeptide of which enzymatic activity for converting acetyl- Coenzyme A to acetaldehyde in the yeast cell is to be tested, and assessing whether said conversion occurs in said cells. Candidate homologues may be identified by using in silico similarity analyses. A detailed example of such an analysis is described in Example 2 of WO2009/013159. The skilled person will be able to derive there from how suitable candidate homologues may be found and, optionally upon codon(pair) optimization, will be able to test the required functionality of such candidate homologues using a suitable assay system as described above. A suitable homologue represents a polypeptide having an amino acid sequence similar to a specific polypeptide of more than 50%, preferably of 60 % or more, in particular of at least 70 %, more in particular of at least 80 %, at least 90 %, at least 95 %, at least 97 %, at least 98 % or at least 99 % and having the required enzymatic functionality. With respect to nucleic acid sequences, the term functional homologue is meant to include nucleic acid sequences which differ from another nucleic acid sequence due to the degeneracy of the genetic code and encode the same polypeptide sequence.

Sequence identity is herein defined as a relationship between two or more amino acid (polypeptide or protein) sequences or two or more nucleic acid

(polynucleotide) sequences, as determined by comparing the sequences. Usually, sequence identities or similarities are compared over the whole length of the sequences compared. In the art, "identity" also means the degree of sequence relatedness between amino acid or nucleic acid sequences, as the case may be, as determined by the match between strings of such sequences.

Amino acid or nucleotide sequences are said to be homologous when exhibiting a certain level of similarity. Two sequences being homologous indicate a common evolutionary origin. Whether two homologous sequences are closely related or more distantly related is indicated by "percent identity" or "percent similarity", which is high or low respectively. Although disputed, to indicate "percent identity" or "percent similarity", "level of homology" or "percent homology" are frequently used interchangeably. A comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. The skilled person will be aware of the fact that several different computer programs are available to align two sequences and determine the homology between two sequences (Kruskal, J. B. (1983) An overview of sequence comparison In D. Sankoff and J. B. Kruskal, (ed.), Time warps, string edits and macromolecules: the theory and practice of sequence comparison, pp. 1-44 Addison Wesley). The percent identity between two amino acid sequences can be determined using the Needleman and Wunsch algorithm for the alignment of two sequences. (Needleman, S. B. and Wunsch, C. D. (1970) J. Mol. Biol. 48, 443-453). The algorithm aligns amino acid sequences as well as nucleotide sequences. The Needleman- Wunsch algorithm has been implemented in the computer program NEEDLE. For the purpose of this invention the NEEDLE program from the EMBOSS package was used (version 2.8.0 or higher, EMBOSS: The European Molecular Biology Open Software Suite (2000) Rice, P. Longden,I. and Bleasby,A. Trends in Genetics 16, (6) pp276— 277, http://emboss.bioinformatics.nl/). For protein sequences, EBLOSUM62 is used for the substitution matrix. For nucleotide sequences, EDNAFULL is used. Other matrices can be specified. The optional parameters used for alignment of amino acid sequences are a gap-open penalty of 10 and a gap extension penalty of 0.5. The skilled person will appreciate that all these different parameters will yield slightly different results but that the overall percentage identity of two sequences is not significantly altered when using different algorithms.

Global Homology Definition

The homology or identity is the percentage of identical matches between the two full sequences over the total aligned region including any gaps or extensions. The homology or identity between the two aligned sequences is calculated as follows: Number of corresponding positions in the alignment showing an identical amino acid in both sequences divided by the total length of the alignment including the gaps. The identity defined as herein can be obtained from NEEDLE and is labelled in the output of the program as "IDENTITY".

Longest Identity Definition

The homology or identity between the two aligned sequences is calculated as follows: Number of corresponding positions in the alignment showing an identical amino acid in both sequences divided by the total length of the alignment after subtraction of the total number of gaps in the alignment. The identity defined as herein can be obtained from NEEDLE by using the NOBRIEF option and is labeled in the output of the program as "longest-identity".

A variant of a nucleotide or amino acid sequence disclosed herein may also be defined as a nucleotide or amino acid sequence having one or several substitutions, insertions and/or deletions as compared to the nucleotide or amino acid sequence specifically disclosed herein (e.g. in de the sequence listing).

Optionally, in determining the degree of amino acid similarity, the skilled person may also take into account so-called "conservative" amino acid substitutions, as will be clear to the skilled person. Conservative amino acid substitutions refer to the interchange ability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulphur-containing side chains is cysteine and methionine. Preferred conservative amino acids substitution groups are: valine-leucine-isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine -valine, and asparagine -glutamine. Substitutional variants of the amino acid sequence disclosed herein are those in which at least one residue in the disclosed sequences has been removed and a different residue inserted in its place. Preferably, the amino acid change is conservative. Preferred conservative substitutions for each of the naturally occurring amino acids are as follows: Ala to ser; Arg to lys; Asn to gin or his; Asp to glu; Cys to ser or ala; Gin to asn; Glu to asp; Gly to pro; His to asn or gin; He to leu or val; Leu to ile or val; Lys to arg; gin or glu; Met to leu or ile; Phe to met, leu or tyr; Ser to thr; Thr to ser; Trp to tyr; Tyr to trp or phe; and, Val to ile or leu.

Nucleotide sequences of the invention may also be defined by their capability to hybridise with parts of specific nucleotide sequences disclosed herein, respectively, under moderate, or preferably under stringent hybridisation conditions. Stringent hybridisation conditions are herein defined as conditions that allow a nucleic acid sequence of at least about 25, preferably about 50 nucleotides, 75 or 100 and most preferably of about 200 or more nucleotides, to hybridise at a temperature of about 65°C in a solution comprising about 1 M salt, preferably 6 x SSC or any other solution having a comparable ionic strength, and washing at 65°C in a solution comprising about 0.1 M salt, or less, preferably 0.2 x SSC or any other solution having a comparable ionic strength. Preferably, the hybridisation is performed overnight, i.e. at least for 10 hours and preferably washing is performed for at least one hour with at least two changes of the washing solution. These conditions will usually allow the specific hybridisation of sequences having about 90% or more sequence identity.

Moderate conditions are herein defined as conditions that allow a nucleic acid sequences of at least 50 nucleotides, preferably of about 200 or more nucleotides, to hybridise at a temperature of about 45°C in a solution comprising about 1 M salt, preferably 6 x SSC or any other solution having a comparable ionic strength, and washing at room temperature in a solution comprising about 1 M salt, preferably 6 x SSC or any other solution having a comparable ionic strength. Preferably, the hybridisation is performed overnight, i.e. at least for 10 hours, and preferably washing is performed for at least one hour with at least two changes of the washing solution. These conditions will usually allow the specific hybridisation of sequences having up to 50% sequence identity. The person skilled in the art will be able to modify these hybridisation conditions in order to specifically identify sequences varying in identity between 50% and 90%.

"Expression" refers to the transcription of a gene into structural RNA (rRNA, tRNA) or messenger RNA (mRNA) with subsequent translation into a protein.

As used herein, "heterologous" in reference to a nucleic acid or protein is a nucleic acid or protein that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. For example, a promoter operably linked to a heterologous structural gene is from a species different from that from which the structural gene was derived, or, if from the same species, one or both are substantially modified from their original form. A heterologous protein may originate from a foreign species or, if from the same species, is substantially modified from its original form by deliberate human intervention.

The term "heterologous expression" refers to the expression of heterologous nucleic acids in a host cell. The expression of heterologous proteins in eukaryotic host cell systems such as yeast are well known to those of skill in the art. A polynucleotide comprising a nucleic acid sequence of a gene encoding an enzyme with a specific activity can be expressed in such a eukaryotic system. In some embodiments, transformed/transfected yeast cells may be employed as expression systems for the expression of the enzymes. Expression of heterologous proteins in yeast is well known. Sherman, F., et al., Methods in Yeast Genetics, Cold Spring Harbor Laboratory (1982) is a well recognized work describing the various methods available to express proteins in yeast. Two widely utilized yeasts are Saccharomyces cerevisiae and Pichia pastoris. Vectors, strains, and protocols for expression in Saccharomyces and Pichia are known in the art and available from commercial suppliers (e.g., Invitrogen). Suitable vectors usually have expression control sequences, such as promoters, including 3- phosphoglycerate kinase or alcohol oxidase, and an origin of replication, termination sequences and the like as desired.

As used herein "promoter" is a DNA sequence that directs the

transcription of a (structural) gene. Typically, a promoter is located in the 5'-region of a gene, proximal to the transcriptional start site of a (structural) gene. Promoter sequences may be constitutive, inducible or repressible. If a promoter is an inducible promoter, then the rate of transcription increases in response to an inducing agent.

The term "vector" as used herein, includes reference to an autosomal expression vector and to an integration vector used for integration into the

chromosome.

The term "expression vector" refers to a DNA molecule, linear or circular, that comprises a segment encoding a polypeptide of interest under the control of (i.e. operably linked to) additional nucleic acid segments that provide for its transcription. Such additional segments may include promoter and terminator sequences, and may optionally include one or more origins of replication, one or more selectable markers, an enhancer, a polyadenylation signal, and the like. Expression vectors are generally derived from plasmid or viral DNA, or may contain elements of both. In particular an expression vector comprises a nucleic acid sequence that comprises in the 5' to 3' direction and operably linked: (a) a yeast-recognized transcription and translation initiation region, (b) a coding sequence for a polypeptide of interest, and (c) a yeast- recognized transcription and translation termination region. "Plasmid" refers to autonomously replicating extrachromosomal DNA which is not integrated into a microorganism's genome and is usually circular in nature. An "integration vector" refers to a DNA molecule, linear or circular, that can be incorporated in a microorganism's genome and provides for stable inheritance of a gene encoding a polypeptide of interest. The integration vector generally comprises one or more segments comprising a gene sequence encoding a polypeptide of interest under the control of (i.e. operably linked to) additional nucleic acid segments that provide for its transcription. Such additional segments may include promoter and terminator sequences, and one or more segments that drive the incorporation of the gene of interest into the genome of the target cell, usually by the process of homologous recombination. Typically, the integration vector will be one which can be transferred into the target cell, but which has a replicon which is nonfunctional in that organism. Integration of the segment comprising the gene of interest may be selected if an appropriate marker is included within that segment.

By "host cell" is meant a cell which contains a vector and supports the replication and/or expression of the vector. Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or mammalian cells. Preferably, host cells are eukaryotic cells of the order of Actinomycetales .

"Transformation" and "transforming", as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, irrespective of the method used for the insertion, for example, direct uptake, transduction, f-mating or electroporation. The exogenous polynucleotide may be maintained as a non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the host cell genome.

The microorganism, preferably is selected from the group of

Saccharomyceraceae, such as Saccharomyces cerevisiae, Saccharomyces pastorianus, Saccharomyces beticus, Saccharomyces fermentati, Saccharomyces paradoxus,

Saccharomyces uvarum and Saccharomyces bay anus; Schizosaccharomyces such as Schizosaccharomyces pombe, Schizosaccharomyces japonicus, Schizosaccharomyces octosporus and Schizosaccharomyces cryophilus; Torulaspora such as Torulaspora delbrueckii; Kluyveromyces such as Kluyveromyces marxianus; Pichia such as Pichia stipitis, Pichia pastoris or pichia angusta, Zygosaccharomyces such as

Zygosaccharomyces bailii; Brettanomyces such as Brettanomyces

inter medius, .Brettanomyces bruxellensis, Brettanomyces anomalus, Brettanomyces custersianus, Brettanomyces naardenensis, Brettanomyces nanus, Dekkera bruxellensis and Dekkera anomala; Metschmkowia, Issatchenkia, such as Issatchenkia orientalis, Kloeckera such as Kloeckera apiculata; Aureobasidium such as

Aureobasidium pullulans.

In a highly preferred embodiment, the microorganism is a yeast cell is selected from the group of Saccharomyceraceae. In particular, good results have been achieved with a Saccharomyces cerevisiae cell. It has been found possible to use such a cell according to the invention in a method for preparing an alcohol (ethanol) wherein the NADH- dependent side-product formation (glycerol) was reduced by about 90 %, and wherein the yield of the desired product (ethanol) was increase by about 10 %, compared to a similar cell without Rubisco and PRK.

The Rubisco may in principle be selected from eukaryotic and prokaryotic

Rubisco's.

The Rubisco is preferably from a non-photo trophic organism. In particular, the Rubisco may be from a chemolithoautotrophic microorganism.

Good results have been achieved with a bacterial Rubisco. Preferably, the bacterial Rubisco originates from a Thiobacillus, in particular, Thiobacillus denitrif icons, which is chemolithoautotrophic.

The Rubisco may be a single-subunit Rubisco or a Rubisco having more than one subunit. In particular, good results have been achieved with a single-subunit Rubisco.

In particular, good results have been achieved with a form-II Rubisco, more in particular CbbM.

SEQUENCE ID NO: 2 shows the sequence of a particularly preferred Rubisco in accordance with the invention. It is encoded by the cbbM gene from

Thiobacillus denitrif icans. A preferred alternative to this Rubisco, is a functional homologue of this Rubisco, in particular such functional homologue comprising a sequence having at least 80% , 85%, 90 % or 95% sequence identity with SEQUENCE ID NO: 2. Suitable natural Rubisco polypeptides are given in Table 1. Table 1: Rubisco polypeptides

Source Accession no. MAX ID (%)

Thiobacillus denitrificans AAA99178.2 100

Sideroxydans lithotrophicus ES-1 YP_003522651.1 94 Thiothrix nivea DSM 5205 ZP_10101642.1 91

Halothiobacillus neapolitanus c2 YP_003262978.1 90

Acidithiobacillus ferrooxidans ATCC YP_002220242.1 88

53993

Rhodoferax ferrireducens T118 YP_522655.1 86

Thiorhodococcus drewsii AZ1 ZP_08824342.1 85 uncultured prokaryote AGE 14067.1 82

In accordance with the invention, the Rubisco is functionally expressed in the microorganism, at least during use in an industrial process for preparing a compound of interest.

To increase the likelihood that herein enzyme activity is expressed at sufficient levels and in active form in the transformed (recombinant) host cells of the invention, the nucleotide sequence encoding these enzymes, as well as the Rubisco enzyme and other enzymes of the invention (see below), are preferably adapted to optimise their codon usage to that of the host cell in question. The adaptiveness of a nucleotide sequence encoding an enzyme to the codon usage of a host cell may be expressed as codon adaptation index (CAI). The codon adaptation index is herein defined as a measurement of the relative adaptiveness of the codon usage of a gene towards the codon usage of highly expressed genes in a particular host cell or organism. The relative adaptiveness (w) of each codon is the ratio of the usage of each codon, to that of the most abundant codon for the same amino acid. The CAI index is defined as the geometric mean of these relative adaptiveness values. Non-synonymous codons and termination codons (dependent on genetic code) are excluded. CAI values range from 0 to 1, with higher values indicating a higher proportion of the most abundant codons (see Sharp and Li , 1987, Nucleic Acids Research 15: 1281-1295; also see: Jansen et al., 2003, Nucleic Acids Res. 31(8):2242-51). An adapted nucleotide sequence preferably has a CAI of at least 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8 or 0.9. Most preferred are the sequences which have been codon optimised for expression in the fungal host cell in question such as e.g. S. cerevisiae cells.

Preferably, the functionally expressed Rubisco has an activity, defined by the rate of ribulose- l,5-bisphosphate- dependent ¹⁴C-bicarbonate incorporation by cell extracts of at least 1 nmol.min ^mg protein) ¹, in particular an activity of at least 2 nmol.min ^mg protein) ¹ , more in particular an activity of at least 4 nmol.min ^mg protein) ¹. The upper limit for the activity is not critical. In practice, the activity may be about 200 nmol.min ^mg protein) ¹ or less, in particular 25 nmol.min ^mg protein) ¹ , more in particular 15 nmol.min ^mg protein) ¹ or less, e.g. about 10 nmol.min-^mg protein) ¹ or less. When referred herein to the activity of Rubisco, in particular the activity at 30 °C is meant. The conditions for an assay for determining this Rubisco activity are as found in the Examples, below (Example 4).

A functionally expressed phosphoribulokinase (PRK, (EC 2.7.1.19)) according to the invention is capable of catalysing the chemical reaction:

ATP + D-ribulose 5-phosphate Ψ^ΑΌΡ + D-ribulose 1,5-bisphosphate (1)

Thus, the two substrates of this enzyme are ATP and D-ribulose 5- phosphate, whereas its two products are ADP and D-ribulose 1,5-bisphosphate.

PRK belongs to the family of transferases, specifically those transferring phosphorus-containing groups (phosphotransferases) with an alcohol group as acceptor. The systematic name of this enzyme class is ATP:D-ribulose-5-phosphate 1- phosp ho transferase. Other names in common use include phosphopentokinase, rib ulose- 5-phosphate kinase, phosphopentokinase, phosphoribulokinase

(phosphorylating), 5-phosphoribulose kinase, ribulose phosphate kinase, PKK, PRuK, and PRK. This enzyme participates in carbon fixation.

The PRK can be from a prokaryote or a eukaryote. Good results have been achieved with a PRK originating from a eukaryote. Preferably the eukaryotic PRK originates from a plant selected from Caryophyllales , in particular from

Amaranthaceae, more in particular from Spinacia.

As a preferred alternative to PRK from Spinacia a functional homologue of PRK from Spinacia may be present, in particular a functional homologue comprising a sequence having at least 70%, 75%, 80%. 85%, 90 % or 95% sequence identity with SEQUENCE ID NO 4.

Suitable natural PRK polypeptides are given in Table 2.

Table 2: Natural PRK polypeptides suitable for expression

Source Accession no. MAX ID (%)

Spinacia oleracea P09559.1 100

Medicago truncatula XP_003612664.1 88 Arabidopsis thaliana NP_174486.1 87

Vitis vinifera XP_002263724.1 86

Closterium peracerosum BAL03266.1 82

Zea mays NP_001148258.1 78

In an advantageous embodiment, the recombinant microorganism further comprises a nucleic acid sequence encoding one or more heterologous prokaryotic or eukaryotic molecular chaperones, which - when expressed - are capable of functionally interacting with an enzyme in the microorganism, in particular with at least one of Rubisco and PRK.

Chaperonins are proteins that provide favourable conditions for the correct folding of other proteins, thus preventing aggregation. Newly made proteins usually must fold from a linear chain of amino acids into a three-dimensional form.

Chaperonins belong to a large class of molecules that assist protein folding, called molecular chaperones. The energy to fold proteins is supplied by adenosine

triphosphate (ATP). A review article about chaperones that is useful herein is written by Yebenes (2001); "Chaperonins: two rings for folding"; Hugo Yebenes et al. Trends in Biochemical Sciences, August 2011, Vol. 36, No. 8.

In a preferred embodiment, the chaperone or chaperones are from a bacterium, more preferably from Escherichia, in particular E. coli GroEL and GroEs from E. coli may in particular encoded in a microorganism according to the invention. Other preferred chaperones are chaperones from Saccharomyces, in particular Saccharomyces cerevisiae HsplO and Hsp60. If the chaperones are naturally expressed in an organelle such as a mitochondrion (examples are Hsp60 and HsplO of Saccharomyces cerevisiae) relocation to the cytosol can be achieved e.g. by modifying the native signal sequence of the chaperonins.

In eukaryotes the proteins Hsp60 and HsplO are structurally and functionally nearly identical to GroEL and GroES, respectively. Thus, it is

contemplated that Hsp60 and HsplO from any eukaryotic cell may serve as a chaperone for the Rubisco. See Zeilstra-Ryalls J, Fayet O, Georgopoulos C (1991). "The universally conserved GroE (Hsp60) chaperonins". Annu Rev Microbiol. 45: 301- 25. doi: 10.1146/annurev.mi.45.100191.001505. PMID 1683763 and Horwich AL, Fenton WA, Chapman E, Farr GW (2007). "Two Families of Chaperonin: Physiology and Mechanism". Annu Rev Cell Dev Biol. 23: 115-45.

doi: 10.1146/annurev.cellbio.23.090506.123555. PMID 17489689.

Particularly good results have been achieved with a recombinant yeast cell comprising both the heterologous chaperones GroEL and GroES.

As a preferred alternative to GroEL a functional homologue of GroEL may be present, in particular a functional homologue comprising a sequence having at least 70%, 75%, 80%, 85%, 90 % or 95% sequence identity with SEQUENCE ID NO: 10.

Suitable natural chaperones polypeptide homologous to SEQUENCE ID NO: 10 are given in Table 3.

Table 3: Natural chaperones homologous to SEQUENCE ID NO: 10 polypeptides suitable for expression

>gi 1 115388105 I ref I XP_001211558.1 1 :2-101 10 kDa heat shock protein,

mitochondrial [Aspergillus terreus NIH2624]

>gi 1 116196854 I ref I XP_001224239.1 1 : 1-102 conserved hypothetical protein

[Chaetomium globosum CBS 148.51]

>gi | 119175741 1 ref I XP_001240050.1 1 :3-102 hypothetical protein CIMG_09671

[Coccidioides immitis RS]

>gi 1 119471607 I ref I XP_001258195.1 1 : 12- 111 chaperonin, putative [Neosartorya fischeri NRRL181]

>gi 1 121699818 I ref I XP_001268174.1 1 :8-106 chaperonin, putative [Aspergillus clavatus NRRL 1]

>gi 1 126274604 | ref I XP_001387607.1 1 :2-102 predicted protein [Scheffersomyces stipitis CBS 6054]

>gi | 146417701 1 ref I XP_001484818.1 1 :5-106 conserved hypothetical protein

[Meyerozyma guilliermondii ATCC 6260]

>gi 1 154303611 1 ref I XP_001552212.1 1 : 1-102 10 kDa heat shock protein,

mitochondrial [Botryotinia fuckeliana B05.10]

>gi | 156049571 1 ref I XP_001590752.1 1 : 1-102 hypothetical protein SS1G_08492

[Sclerotinia sclerotiorum 1980]

>gi 1 156840987 I ref I XP_001643870.1 1 : 1-103 hypothetical protein Kpol_495pl0 [Vanderwaltozyma polyspora DSM 70924]

>gi | 169608295 I ref I XP_001797567.1 1 : 1-101 hypothetical protein SNOG_07218 [Phaeosphaeria nodorum SN15]

>gi 1 171688384 | ref I XP_001909132.1 1 : 1-102 hypothetical protein [Podospora anserina S mat+]

>gi 1 189189366 I ref I XP_001931022.1 1 :71- 168 10 kDa chaperonin [Pyrenophora tritici-repentis Pt-lC-BFP]

>gi 1 19075598 I ref I NP_588098.1 1 : 1-102 mitochondrial heat shock protein Hsp lO (predicted) [Schizosaccharomyces pombe 972h-]

>gi I 212530240 I ref I XP_002145277.1 1 :3-100 chaperonin, putative [Talaromyces marneffei ATCC 18224]

>gi I 212530242 | ref I XP_002145278.1 1 :3-95 chaperonin, putative [Talaromyces marneffei ATCC 18224]

>gi I 213404320 I ref I XP_002172932.1 1 : 1-102 mitochondrial heat shock protein HsplO [Schizosaccharomyces japonicus yFS275]

>gi I 225557301 1 gb I EEH05587.1 1 :381-478 pre-mRNA polyadenylation factor fipl [Ajellomyces capsulatus G186AR]

>gi I 225684092 | gb I EEH22376.1 1 :3-100 heat shock protein [Paracoccidioides brasiliensis Pb03

>gi I 238490530 I ref I XP_002376502.1 1 :2-104 chaperonin, putative [Aspergillus flavus NRRL3357

>gi I 238878220 I gb I EEQ41858.1 1 : 1- 106 10 kDa heat shock protein,

mitochondrial [Candida albicans WO- 1]

>gi I 240280207 I gb I EER43711.1 1 :426-523 pre-mRNA polyadenylation factor fipl [Ajellomyces capsulatus H143]

>gi I 241950445 I ref I XP_002417945.1 1 : 1-103 10 kda chaperonin, putative; 10 kda heat shock protein mitochondrial (hsplO), putative [Candida dubliniensis CD36]

>gi I 242819222 | ref I XP_002487273.1 1 :90- 182 chaperonin, putative

[Talaromyces stipitatus ATC

>gi I 254566327 I ref I XP_002490274.1 1 : 1-102 Putative protein of unknown function [Komagataella pastoris GS115] >gi I 254577241 1 ref I XP_002494607.1 1 : 1-103 ZYRO0A05434p

[Zygosaccharomyces rouxii]

>gi I 255717999 | ref I XP_002555280.1 1 : 1-103 KLTH0G05588p [Lachancea thermotolerans]

>gi I 255956581 1 ref I XP_002569043.1 1 :2-101 Pc21g20560 [Penicillium chrysogenum Wisconsin 54- 1255]

>gi I 258572664 | ref I XP_002545094.1 1 : 16- 108 chaperonin GroS [Uncinocarpus reesii 1704]

>gi I 261190594 I ref I XP_002621706.1 1 : 3- 100 chaperonin [Ajellomyces dermatitidis SLH14081]

>gi I 295664909 I ref I XP_002793006.1 1 :3-100 10 kDa heat shock protein, mitochondrial [Paracoccidioides sp. 'lutzii'PbOl]

>gi I 296412657 I ref I XP_002836039.1 1 :76- 177 hypothetical protein [Tuber melanosporum Mel28]

>gi I 302307854 | ref I NP_984626.2 | :2- 102 AEL235Wp [Ashbya gossypii ATCC 10895]

>gi I 302894117 I ref I XP_003045939.1 1 : 1-102 predicted protein [Nectria haematococca mpVI 77-13-4]

>gi I 303318351 1 ref I XP_003069175.1 1 :3-100 10 kDa heat shock protein, mitochondrial , putative [Coccidioides posadasii C735 delta SOWgp]

>gi I 310795300 I gb I EFQ30761.1 1 : 1-102 chaperonin 10 kDa subunit [Glomerella graminicola Ml.001]

>gi I 315053085 I ref I XP_003175916.1 1 : 12- 109 chaperonin GroS [Arthroderma gypseum CBS 118893]

>gi I 317032114 | ref I XP_001394060.2 | :334-433 heat shock protein [Aspergillus niger CBS 513.88]

>gi I 317032116 I ref I XP_001394059.2 | :2-101 heat shock protein [Aspergillus niger CBS 513.88]

>gi I 320583288 I gb I EFW97503.1 1 :6- 106 chaperonin, putative heat shock protein, putative [Ogataea parapolymorpha DL- 1]

>gi I 320591507 I gb I EFX03946.1 1 : 1- 102 heat shock protein [Grosmannia clavigera kwl407] >gi I 322700925 I gb I EFY92677.1 1 : 1- 102 chaperonin [Metarhizium acridum CQMa 102]

>gi I 325096696 I gb I EGC50006.1 1 :409-506 pre-mRNA polyadenylation factor fipl [Ajellomyces capsulatus H88]

>gi I 326471604 | gb I EGD95613.1 1 : 14- 111 chaperonin 10 Kd subunit

[Trichophyton tonsurans CBS 112818]

>gi I 327293056 I ref I XP_003231225.1 1 :3-100 chaperonin [Trichophyton rubrum CBS 118892]

>gi I 330942654 I ref I XP_003306155.1 1 :37- 136 hypothetical protein PTT_19211 [Pyrenophora teres f. teres 0- 1]

>gi I 336268042 | ref I XP_003348786.1 1 :47- 147 hypothetical protein SMAC_01809 [Sordaria macrospora khell]

>gi I 340519582 | gb I EGR49820.1 1 : 1- 109 predicted protein [Trichoderma reesei QM6a]

>gi I 340960105 I gb I EGS21286.1 1 :3- 103 putative mitochondrial 10 kDa heat shock protein [Chaetomium thermophilum var. thermophilum DSM 1495] >gi I 342883802 | gb I EGU84224.1 1 : 1-102 hypothetical protein FOXB_05181 [Fusarium oxysporum Fo5176]

>gi I 344302342 | gb I EGW32647.1 1 :2- 102 hypothetical protein

SPAPADRAFT_61712 [Spathaspora passalidamm NRRL Y-27907]

>gi I 345570750 I gb I EGX53571.1 1 : 1- 102 hypothetical protein AOL_s00006g437 [Arthrobotrys oligospora ATCC 24927]

>gi I 346321154 | gb I EGX90754.1 1 : 1- 102 chaperonin [Cordyceps militaris CMOl] >gi I 346970393 I gb I EGY13845.1 1 : 1- 102 heat shock protein [Verticillium dahliae VdLs.17]

>gi I 354548296 I emb I CCE45032.1 1 : 1-106 hypothetical protein CPAR2_700360 [Candida parapsilosis]

>gi I 358385052 | gb I EHK22649.1 1 : 1- 102 hypothetical protein

TRIVIDRAFT_230640 [Trichoderma virens Gv 29-8]

>gi I 358393422 | gb I EHK42823.1 1 : 1- 101 hypothetical protein

TRIATDRAFT_258186 [Trichoderma atroviride IMI 206040]

>gi I 361126733 I gb I EHK98722.1 1 : l-97 putative 10 kDa heat shock protein, mitochondrial [Glare lozoyensis 74030]

>gi I 363753862 | ref I XP_003647147.1 1 :2-102 hypothetical protein Ecym_5593 [Eremothecium cymbalariae DBVPG#7215]

>gi I 365758401 1 gb I EHN00244.1 1 : 1- 106 Hsp lOp [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]

>gi I 365987664 I ref I XP_003670663.1 1 : 1-103 hypothetical protein

NDAI_0F01010 [Naumovozyma dairenensis CBS 421]

>gi I 366995125 I ref I XP_003677326.1 1 : 1-103 hypothetical protein

NCAS_0G00860 [Naumovozyma castellii CBS 4309]

>gi I 366999797 I ref I XP_003684634.1 1 : 1-103 hypothetical protein

TPHA_0C00430 [Tetrapisispora phaffii CBS 4417]

>gi I 367009030 I ref I XP_003679016.1 1 : 1-103 hypothetical protein

TDEL_0A04730 [Torulaspora delbruekii]

>gi I 367023138 I ref I XP_003660854.1 1 : 1-104 hypothetical protein

MYCTH_59302 [Myceliophthora thermophila ATCC 42464]

>gi I 367046344 I ref I XP_003653552.1 1 : 1-102 hypothetical protein

THITE_2116070 [Thielavia terrestris NRRL8126]

>gi I 378726440 I gb I EHY52899.1 1 :9-109 chaperonin GroES [Exophiala dermatitidis NIH/UT8656]

>gi I 380493977 I emb I CCF33483.1 1 : 1- 102 chaperonin 10 kDa subunit

[Colletotrichum higginsianu

>gi I 385305728 I gb I EIF49680.1 1 : 1-102 10 kda heat shock mitochondrial

[Dekkera bruxellensis AWRI1499]

>gi I 389628546 I ref I XP_003711926.1 1 : 1-102 hsplO-like protein [Magnaporthe oryzae 70-15]

>gi I 396462608 I ref I XP_003835915.1 1 : 1-101 similar to 10 kDa heat shock protein [Leptosphaeria maculans JN3]

>gi I 398392541 1 ref I XP_003849730.1 1 : 1-102 hypothetical protein

MYC GRDRAFT_ 105721 [Zymoseptoria tritici IP0323]

>gi I 400597723 I gb I EJP65453.1 1 :24- 124 chaperonin 10 kDa subunit [Beauveria bassiana ARSEF 2860]

>gi I 401623646 I gb I EJS41738.1 1 : 1- 106 hsp lOp [Saccharomyces arboricola H-6] >gi I 401842164 | gb I EJT44422.1 1 : l-92 HSPlO-like protein [Saccharomyces kudriavzevii IFO 1802]

>gi I 402084027 I gb I EJT79045.1 1 : 1-102 hsp lO-like protein [Gaeumannomyces graminis var. triti

>gi I 403215209 I emb I CCK69709.1 1 : 1- 104 hypothetical protein KNAG_0C06130 [Kazachstania naganishii CBS 8797]

>gi I 406604629 I emb I CCH43969.1 1 :4-100 hypothetical protein BN7_3524

[Wickerhamomyces ciferrii]

>gi I 406867021 1 gb I EKD20060.1 1 :56- 156 hypothetical protein MBM_02012 [Marssonina brunnea f.sp. 'multigermtubi' MB_ml]

>gi I 407926227 I gb I EKG19196.1 1 :74- 174 GroES-like protein [Macrophomina phaseolina MS6]

>gi I 408398157 I gb I EKJ77291.1 1 : 11- 111 hypothetical protein FPSE_02566 [Fusarium pseudograminearum CS3096]

>gi I 410082063 I ref I XP_003958610.1 1 : 1-103 hypothetical protein

KAFR_0H00660 [Kazachstania africana CBS2517]

>gi I 425777664 | gb I EKV15823.1 1 :58-157 Chaperonin, putative [Penicillium digitatum Pdl]

>gi I 440639680 I gb I ELR09599.1 1 : 1- 102 chaperonin GroES [Geomyces destructans 20631-21]

>gi I 444323906 | ref I XP_004182593.1 1 : 1-105 hypothetical protein

TBLA_0J00760 [Tetrapisisporablattae CBS 6284]

>gi I 448083208 I ref I XP_004195335.1 1 :2-101 Piso0_005888 [Millerozyma farinosa CBS 7064]

>gi I 448087837 I ref I XP_004196425.1 1 :2-102 Piso0_005888 [Millerozyma farinosa CBS 7064]

>gi I 448534948 I ref I XP_003870866.1 1 : 1-106 HsplO protein [Candida orthopsilosis Co 90- 125]

>gi I 449295977 I gb I EMC91998.1 1 : 1-102 hypothetical protein

BAUCODRAFT_39148 [Baudoinia compn

>gi I 46123659 I ref I XP_386383.1 1 :3-103 hypothetical protein FG06207.1

[Gibberella zeae PH-1] >gi I 50289455 I ref I XP_447159.1 1 : 1-103 hypothetical protein [Candida glabrata CBS 138]

>gi I 50308731 1 ref I XP_454370.1 1 : 1-103 hypothetical protein [Kluyveromyces lactis NRRL Y- 1140]

>gi | 50411066 I ref | XP_457014.1 1 : 1-106 DEHA2B01122p [Debaryomyces

hansenii CBS 767]

>gi I 50545998 I ref I XP_500536.1 1 : 1-102 YALI0B05610p [Yarrowia lipolytica] >gi I 51013895 I gb I AAT93241.1 1 : 1-106 YOR020C [Saccharomyces cerevisiae] >gi I 6324594 | ref I NP_014663.1 1 : 1- 106 HsplOp [Saccharomyces cerevisiae

S288c]

>gi I 67523953 I ref I XP_660036.1 1 :2-101 hypothetical protein AN2432.2

[Aspergillus nidulans FGSC A4]

>gi I 70992219 I ref I XP_750958.1 1 : 12- 106 chaperonin [Aspergillus fumigatus

Af293]

>gi I 85079266 I ref I XP_956315.1 1 : 1-104 hypothetical protein NCU04334

[Neurospora crassa OR74A]

As a preferred alternative to GroES a functional homologue of GroES may be present, in particular a functional homologue comprising a sequence having at least 70%, 75%, 80%, 85%, 90 % or 95% sequence identity with SEQUENCE ID NO: 12.

Suitable natural chaperones polypeptides homologous to SEQUENCE ID NO: 12 are given in Table 4.

Table 4: Natural chaperones homologous to SEQUENCE ID NO: 12 polypeptides suitable for expression

>gi 1 115443330 I ref I XP_001218472.1 1 heat shock protein 60, mitochondrial precursor [Aspergillus terreus NIH2624]

>gi 1 114188341 1 gb I EAU30041.1 1 heat shock protein 60, mitochondrial precursor [Aspergillus terreus NIH2624]

>gi 1 119480793 I ref I XP_001260425.1 1 antigenic mitochondrial protein HSP60, putative [Neosartorya fischeri NRRL 181] >gi 1 119408579 I gb I EAW18528.1 1 antigenic mitochondrial protein HSP60, putative [Neosartorya fischeri NRRL 181] >gi 1 126138730 I ref I XP_001385888.1 1 hypothetical protein PICST_90190

[Scheffersomyces stipitis CBS 6054] >gi 1 126093166 I gb I ABN67859.1 1

mitochondrial groEL-type heat shock protein [Scheffersomyces stipitis CBS 6054] >gi 1 145246630 I ref I XP_001395564.1 1 heat shock protein 60 [Aspergillus niger CBS 513.88] >gi 1 134080285 I emb I CAK46207.1 1 unnamed protein product [Aspergillus niger] >gi | 350636909 I gb I EHA25267.1 1 hypothetical protein ASPNIDRAFT_54001 [Aspergillus niger ATCC 1015]

>gi 1 146413148 I ref I XP_001482545.1 1 heat shock protein 60, mitochondrial precursor [Meyerozyma guilliermondii ATCC 6260]

>gi 1 154277022 | ref I XP_001539356.1 1 heat shock protein 60, mitochondrial precursor [Ajellomyces capsulatus NAml] >gi 1 150414429 I gb I EDN09794.1 1 heat shock protein 60, mitochondrial precursor [Ajellomyces capsulatus NAml]

>gi 1 154303540 I ref I XP_001552177.1 1 heat shock protein 60 [Botryotinia fuckeliana B05.10] >gi I 347840915 I emb I CCD55487.1 1 similar to heat shock protein 60

[Botryotinia fuckeliana]

>gi 1 156063938 I ref I XP_001597891.1 1 heat shock protein 60, mitochondrial precursor [Sclerotinia sclerotiorum 1980] >gi 1 154697421 1 gb I EDN97159.1 1 heat shock protein 60, mitochondrial precursor [Sclerotinia sclerotiorum 1980 UF-70] >gi 1 156844469 I ref I XP_001645297.1 1 hypothetical protein Kpol_1037p35

[Vanderwaltozyma polyspora DSM 70294] >gi 1 156115957 I gb I ED017439.1 1 hypothetical protein Kpol_1037p35 [Vanderwaltozyma polyspora DSM 70294] >gi 1 16416029 I emb I CAB91379.2 | probable heat-shock protein hsp60 [Neurospora crassa] >gi I 350289516 I gb I EGZ70741.1 1 putative heat-shock protein hsp60

[Neurospora tetrasperma FGSC 2509]

>gi 1 169626377 I ref I XP_001806589.1 1 hypothetical protein SNOG_16475

[Phaeosphaeria nodorum SN15] >gi 1 111055053 I gb I EAT76173.1 1 hypothetical protein SNOG_16475 [Phaeosphaeria nodorum SN15]

>gi 1 169783766 I ref I XP_001826345.1 1 heat shock protein 60 [Aspergillus oryzae RIB40] >gi I 238493601 1 ref I XP_002378037.1 1 antigenic mitochondrial protein HSP60, putative [Aspergillus flavus NRRL3357] >gi I 83775089 I dbj I BAE65212.1 1 unnamed protein product [Aspergillus oryzae RIB40] >gi I 220696531 1 gb I EED52873.1 1 antigenic mitochondrial protein HSP60, putative [Aspergillus flavus NRRL3357] >gi I 391869413 I gb I EIT78611.1 1 chaperonin, Cpn60/Hsp60p [Aspergillus oryzae 3.042]

>gi 1 189190432 | ref I XP_001931555.1 1 heat shock protein 60, mitochondrial precursor [Pyrenophora tritici-repentis Pt- lC-BFP]

>gi 1 187973161 1 gb I EDU40660.1 1 heat shock protein 60, mitochondrial precursor [Pyrenophora tritici-repentis Pt- lC-BFP]

>gi 1 190348913 I gb I EDK41467.2 | heat shock protein 60, mitochondrial precursor [Meyerozyma guilliermondii ATCC 6260]

>gi I 225554633 I gb I EEH02929.1 1 hsp60-like protein [Ajellomyces capsulatus G186AR]

>gi I 238880068 I gb I EEQ43706.1 1 heat shock protein 60, mitochondrial precursor [Candida albicans WO-1]

>gi I 239613490 I gb I EEQ90477.1 1 chaperonin GroL [Ajellomyces dermatitidis ER-3] >gi I 240276977 I gb I EER40487.1 1 hsp60-like protein [Ajellomyces capsulatus H143] >gi I 241958890 I ref I XP_002422164.1 1 heat shock protein 60, mitochondrial precursor, putative [Candida dubliniensis CD36] >gi 1223645509 I emb I CAX40168.1 1 heat shock protein 60, mitochondrial precursor, putative [Candida dubliniensis CD36]

>gi I 254572906 I ref I XP_002493562.1 1 Tetradecameric mitochondrial chaperonin [Komagataella pastoris GS115] >gi I 238033361 1 emb I CAY71383.1 1 Tetradecameric mitochondrial chaperonin [Komagataella pastoris GS115]

>gi I 254579947 I ref I XP_002495959.1 1 ZYRO0C07106p [Zygosaccharomyces rouxii] >gi I 238938850 I emb I CAR27026.1 1 ZYRO0C07106p [Zygosaccharomyces rouxii] >gi I 255712781 1 ref I XP_002552673.1 1 KLTH0C10428p [Lachancea thermotolerans] >gi I 238934052 | emb I CAR22235.1 1 KLTH0C10428p [Lachancea thermotolerans CBS 6340]

>gi I 255721795 I ref I XP_002545832.1 1 heat shock protein 60, mitochondrial precursor [Candida tropicalis MYA-3404] >gi | 240136321 1 gb I EER35874.1 1 heat shock protein 60, mitochondrial precursor [Candida tropicalis MYA-3404]

>gi I 255941288 I ref I XP_002561413.1 1 Pcl6gl l070 [Penicillium chrysogenum Wisconsin 54-1255] >gi I 211586036 I emb I CAP93777.1 1 Pcl6gl l070 [Penicillium chrysogenum Wisconsin 54-1255]

>gi I 259148241 1 emb I CAY81488.1 1 Hsp60p [Saccharomyces cerevisiae EC1118] >gi I 260950325 I ref I XP_002619459.1 1 heat shock protein 60, mitochondrial precursor [Clavispora lusitaniae ATCC 42720] >gi I 238847031 1 gb I EEQ36495.1 1 heat shock protein 60, mitochondrial precursor [Clavispora lusitaniae ATCC 42720] >gi I 261194577 I ref I XP_002623693.1 1 chaperonin GroL [Ajellomyces dermatitidis SLH14081] >gi I 239588231 1 gb I EEQ70874.1 1 chaperonin GroL [Ajellomyces dermatitidis SLH14081] >gi I 327355067 I gb I EGE83924.1 1 chaperonin GroL

[Ajellomyces dermatitidis ATCC 18188]

>gi I 296422271 1 ref I XP_002840685.1 1 hypothetical protein [Tuber melanosporum Mel28] >gi I 295636906 I emb I CAZ84876.1 1 unnamed protein product [Tuber melanosporum]

>gi I 296809035 I ref I XP_002844856.1 1 heat shock protein 60 [Arthroderma otae CBS 113480] >gi I 238844339 I gb I EEQ34001.1 1 heat shock protein 60 [Arthroderma otae CBS 113480]

>gi I 302308696 I ref I NP_985702.2 | AFR155Wp [Ashbya gossypii ATCC 10895] >gi I 299790751 1 gb I AAS53526.2 | AFR155Wp [Ashbya gossypii ATCC 10895] >gi I 374108933 I gb I AEY97839.1 1 FAFR155Wp [Ashbya gossypii FDAG1]

>gi I 302412525 I ref I XP_003004095.1 1 heat shock protein [Verticillium albo-atrum VaMs.102] >gi | 261356671 1 gb I EEY19099.1 1 heat shock protein [Verticillium albo- atrum VaMs.102]

>gi I 302505585 I ref I XP_003014499.1 1 hypothetical protein ARB_07061

[Arthroderma benhamiae CBS 112371] >gi I 291178320 I gb I EFE34110.1 1

hypothetical protein ARB_07061 [Arthroderma benhamiae CBS 112371]

>gi I 302656385 I ref I XP_003019946.1 1 hypothetical protein TRV_05992

[Trichophyton verrucosum HKI 0517] >gi I 291183723 I gb I EFE39322.1 1 hypothetical protein TRV_05992 [Trichophyton verrucosum HKI 0517]

>gi I 302915513 I ref I XP_003051567.1 1 predicted protein [Nectria haematococca mpVI 77- 13-4] >gi | 256732506 I gb I EEU45854.1 1 predicted protein [Nectria haematococca mpVI 77-13-4]

>gi I 310794550 I gb I EFQ30011.1 1 chaperonin GroL [Glomerella graminicola M1.001] >gi I 315048491 1 ref I XP_003173620.1 1 chaperonin GroL [Arthroderma gypseum CBS 118893] >gi I 311341587 I gb I EFR00790.1 1 chaperonin GroL [Arthroderma gypseum CBS 118893]

>gi I 320580028 I gb I EFW94251.1 1 Tetradecameric mitochondrial chaperonin

[Ogataea parapolymorpha DL-1]

>gi I 320586014 | gb I EFW98693.1 1 heat shock protein mitochondrial precursor

[Grosmannia clavigera kwl407]

>gi I 322692465 I gb I EFY84374.1 1 Heat shock protein 60 precursor (Antigen HIS-62) [Metarhizium acridum CQMa 102]

>gi I 322705285 I gb I EFY96872.1 1 Heat shock protein 60 (Antigen HIS-62)

[Metarhizium anisopliae ARSEF 23]

>gi I 323303806 I gb I EGA57589.1 1 Hsp60p [Saccharomyces cerevisiae FostersB] >gi I 323307999 I gb I EGA61254.1 1 Hsp60p [Saccharomyces cerevisiae FostersO] >gi I 323332364 | gb I EGA73773.1 1 Hsp60p [Saccharomyces cerevisiae AWRI796] >gi I 326468648 I gb I EGD92657.1 1 heat shock protein 60 [Trichophyton tonsurans CBS 112818] >gi I 326479866 I gb I EGE03876.1 1 chaperonin GroL [Trichophyton equinum CBS 127.97]

>gi I 330915493 I ref I XP_003297052.1 1 hypothetical protein PTT_07333

[Pyrenophora teres f. teres 0- 1] >gi I 311330479 I gb I EFQ94847.1 1 hypothetical protein PTT_07333 [Pyrenophora teres f. teres 0- 1]

>gi I 336271815 I ref I XP_003350665.1 1 hypothetical protein SMAC_02337 [Sordaria macrospora k-hell] >gi I 380094827 I emb I CCC07329.1 1 unnamed protein product [Sordaria macrospora k-hell]

>gi I 336468236 I gb I EG056399.1 1 hypothetical protein NEUTE1DRAFT_122948 [Neurospora tetrasperma FGSC 2508]

>gi I 340522598 I gb I EGR52831.1 1 hsp60 mitochondrial precursor-like protein

[Trichoderma reesei QM6a]

>gi I 341038907 I gb I EGS23899.1 1 mitochondrial heat shock protein 60-like protein [Chaetomium thermophilum var. thermophilum DSM 1495]

>gi I 342886297 I gb I EGU86166.1 1 hypothetical protein FOXB_03302 [Fusarium oxysporum Fo5176]

>gi I 344230084 | gb I EGV61969.1 1 chaperonin GroL [Candida tenuis ATCC 10573] >gi I 344303739 I gb I EGW33988.1 1 hypothetical protein SPAPADRAFT_59397 [Spathaspora passalidarum NRRL Y-27907]

>gi I 345560428 I gb I EGX43553.1 1 hypothetical protein AOL_s00215g289

[Arthrobotrys oligospora ATCC 24927]

>gi I 346323592 | gb I EGX93190.1 1 heat shock protein 60 (Antigen HIS-62)

[Cordyceps militaris CMOl]

>gi I 346975286 I gb I EGY18738.1 1 heat shock protein [Verticillium dahliae VdLs.17] >gi I 354545932 | emb I CCE42661.1 1 hypothetical protein CPAR2_203040 [Candida parapsilosis]

>gi I 358369894 | dbj I GAA86507.1 1 heat shock protein 60, mitochondrial precursor [Aspergillus kawachii IFO 4308]

>gi I 358386867 I gb I EHK24462.1 1 hypothetical protein TRIVIDRAFT_79041

[Trichoderma virens Gv29-8]

>gi I 358399658 I gb I EHK48995.1 1 hypothetical protein TRIATDRAFT_297734

[Trichoderma atroviride IMI 206040]

>gi I 363750488 I ref I XP_003645461.1 1 hypothetical protein Ecym_3140

[Eremothecium cymbalariae DBVPG#7215]

>gi I 356889095 I gb I AET38644.1 1 Hypothetical protein Ecym_3140 [Eremothecium cymbalariae DBVPG#7215]

>gi I 365759369 I gb I EHN01160.1 1 Hsp60p [Saccharomyces cerevisiae x

Saccharomyces kudriavzevii VIN7]

>gi I 365764091 1 gb I EHN05616.1 1 Hsp60p [Saccharomyces cerevisiae x

Saccharomyces kudriavzevii VIN7]

>gi I 365985626 I ref I XP_003669645.1 1 hypothetical protein NDAI_0D00880

[Naumovozyma dairenensis CBS 421]

>gi I 343768414 | emb I CCD24402.1 1 hypothetical protein NDAI_0D00880

[Naumovozyma dairenensis CBS 421]

>gi I 366995970 I ref I XP_003677748.1 1 hypothetical protein NCAS_0H00890

[Naumovozyma castellii CBS 4309]

>gi I 342303618 I emb I CCC71399.1 1 hypothetical protein NCAS_0H00890

[Naumovozyma castellii CBS 4309]

>gi I 367005154 | ref I XP_003687309.1 1 hypothetical protein TPHA_0J00520

[Tetrapisispora phaffii CBS 4417] >gi I 357525613 I emb I CCE64875.1 1 hypothetical protein TPHA_0J00520 [Tetrapisispora phaffii CBS 4417]

>gi I 367017005 I ref I XP 003683001.1 1 hypothetical protein TDEL_0G04230

[Torulaspora delbrueckii] >gi | 359750664 | emb I CCE93790.1 1 hypothetical protein TDEL_0G04230 [Torulaspora delbrueckii]

>gi I 367035486 I ref I XP_003667025.1 1 hypothetical protein MYCTH_2097570

[Myceliophthora thermophila ATCC 42464]

>gi I 347014298 I gb I AE061780.1 1 hypothetical protein MYCTH_2097570

[Myceliophthora thermophila ATCC 42464]

>gi I 367055018 I ref I XP_003657887.1 1 hypothetical protein THITE_127923

[Thielavia terrestris NRRL 8126] >gi I 347005153 I gb I AE071551.1 1 hypothetical protein THITE_127923 [Thielavia terrestris NRRL 8126]

>gi I 378728414 | gb I EHY54873.1 1 heat shock protein 60 [Exophiala dermatitidis NIH/UT8656]

>gi I 380494593 I emb I CCF33032.1 1 heat shock protein 60 [Colletotrichum

higginsianum]

>gi I 385305893 I gb I EIF49836.1 1 heat shock protein 60 [Dekkera bruxellensis AWRI1499]

>gi I 389638386 I ref I XP_003716826.1 1 heat shock protein 60 [Magnaporthe oryzae 70- 15] >gi I 351642645 I gb I EHA50507.1 1 heat shock protein 60 [Magnaporthe oryzae 70- 15] >gi I 440474658 I gb I ELQ43388.1 1 heat shock protein 60 [Magnaporthe oryzae Y34] >gi I 440480475 I gb I ELQ61135.1 1 heat shock protein 60 [Magnaporthe oryzae P131]

>gi I 393243142 | gb I EJD50658.1 1 chaperonin GroL [Auricularia delicata TFB- 10046 SS5]

>gi I 396494741 1 ref I XP_003844378.1 1 similar to heat shock protein 60

[Leptosphaeria maculans JN3] >gi I 312220958 I emb I CBY00899.1 1 similar to heat shock protein 60 [Leptosphaeria maculans JN3]

>gi I 398393428 I ref I XP_003850173.1 1 chaperone ATPase HSP60 [Zymoseptoria tritici IP0323] >gi I 339470051 1 gb I EGP85149.1 1 hypothetical protein

MYCGRDRAFT_75170 [Zymoseptoria tritici IP0323]

>gi I 401624479 I gb I EJS42535.1 1 hsp60p [Saccharomyces arboricola H-6]

>gi I 401842294 | gb I EJT44530.1 1 HSP60-like protein [Saccharomyces kudriavzevii IFO 1802]

>gi I 402076594 | gb I EJT72017.1 1 heat shock protein 60 [Gaeumannomyces graminis var. tritici R3- ll la-l]

>gi I 403213867 I emb I CCK68369.1 1 hypothetical protein KNAG_0A07160

[Kazachstania naganishii CBS 8797]

>gi I 406606041 1 emb I CCH42514.1 1 Heat shock protein 60, mitochondrial

[Wickerhamomyces ciferrii]

>gi I 406863285 I gb I EKD 16333.1 1 heat shock protein 60 [Marssonina brunnea f. sp. 'multigermtubi' MB_ml]

>gi I 407922985 I gb I EKG16075.1 1 Chaperonin Cpn60 [Macrophomina phaseolina MS 6]

>gi I 408399723 I gb I EKJ78816.1 1 hypothetical protein FPSE_00959 [Fusarium pseudograminearum CS3096]

>gi I 410083028 I ref I XP_003959092.1 1 hypothetical protein KAFR_0I01760

[Kazachstania africana CBS 2517] >gi I 372465682 | emb I CCF59957.1 1 hypothetical protein KAFR_0I01760 [Kazachstania africana CBS 2517]

>gi I 444315528 I ref I XP_004178421.1 1 hypothetical protein TBLA_0B00580

[Tetrapisispora blattae CBS 6284] >gi | 387511461 1 emb I CCH58902.1 1 hypothetical protein TBLA_0B00580 [Tetrapisispora blattae CBS 6284]

>gi I 448090588 I ref I XP_004197110.1 1 Piso0_004347 [MiUerozyma farinosa CBS 7064] >gi I 448095015 I ref I XP_004198141.1 1 Piso0_004347 [MiUerozyma farinosa CBS 7064] >gi I 359378532 | emb I CCE84791.1 1 Piso0_004347 [MiUerozyma farinosa CBS 7064] >gi I 359379563 I emb I CCE83760.1 1 Piso0_004347 [MiUerozyma farinosa CBS 7064]

>gi I 448526196 I ref I XP_003869293.1 1 Hsp60 heat shock protein [Candida orthopsilosis Co 90- 125] >gi I 380353646 I emb I CCG23157.1 1 Hsp60 heat shock protein [Candida orthopsilosis]

>gi I 46123737 I ref I XP_386422.1 1 HS60_AJECA Heat shock protein 60,

mitochondrial precursor (Antigen HIS-62) [Gibberella zeae PH- 1]

>gi I 50292099 I ref I XP_448482.1 1 hypothetical protein [Candida glabrata CBS 138] >gi I 49527794 | emb I CAG61443.1 1 unnamed protein product [Candida glabrata] >gi I 50310975 I ref I XP_455510.1 1 hypothetical protein [Kluyveromyces lactis NRRL Y- 1140] >gi I 49644646 I emb I CAG98218.1 1 KLLA0F09449p [Kluyveromyces lactis] >gi I 50422027 I ref I XP_459575.1 1 DEHA2E05808p [Debaryomyces hansenii CBS767] >gi I 49655243 I emb I CAG87802.1 1 DEHA2E05808p [Debaryomyces hansenii CBS 767]

>gi I 50555023 I ref I XP_504920.1 1 YALI0F02805p [Yarrowia lipolytica]

>gi I 49650790 I emb I CAG77725.1 1 YALI0F02805p [Yarrowia lipolytica CLIB122]

>gi I 6323288 I ref I NP_013360.1 1 Hsp60p [Saccharomyces cerevisiae S288c]

>gi 1 123579 I sp I P19882.1 1 HSP60_YEAST RecName: Full=Heat shock protein 60, mitochondrial; AltName: Full=CPN60; AltName: Full=P66; AltName:

Full=Stimulator factor I 66 kDa component; Flags: Precursor

>gi 1 171720 I gb I AAA34690.1 1 heat shock protein 60 (HSP60) [Saccharomyces cerevisiae] >gi I 577181 1 gb I AAB67380.1 1 Hsp60p: Heat shock protein 60

[Saccharomyces cerevisiae] >gi 1 151941093 I gb I EDN59473.1 1 chaperonin

[Saccharomyces cerevisiae YJM789] >gi 1 190405319 I gb I EDV08586.1 1 chaperonin

[Saccharomyces cerevisiae RMl l- la] >gi I 207342889 I gb I EDZ70518.1 1 YLR259Cp- like protein [Saccharomyces cerevisiae AWRI1631]

>gi I 256271752 | gb I EEU06789.1 1 Hsp60p [Saccharomyces cerevisiae JAY291] >gi I 285813676 I tpg I DAA09572.1 1 TPA: chaperone ATPase HSP60 [Saccharomyces cerevisiae S288c] >gi I 323353818 I gb I EGA85673.1 1 Hsp60p [Saccharomyces cerevisiae VL3] >gi I 349579966 I dbj I GAA25127.1 1 K7_Hsp60p [Saccharomyces cerevisiae Kyokai no. 7] >gi I 392297765 I gb I EIW08864. i l Hsp60p [Saccharomyces cerevisiae CEN.PK113-7D] >gi | 226279 I prf | 1 1504305A mitochondrial assembly factor

>gi I 68485963 I ref I XP_713100.1 1 heat shock protein 60 [Candida albicans SC5314] >gi I 68486010 I ref I XP_713077.1 1 heat shock protein 60 [Candida albicans SC5314] >gi I 6016258 I sp I 074261.1 1 HSP60_CANAL RecName: Full=Heat shock protein 60, mitochondrial; AltName: Full=60 kDa chaperonin; AltName: Full=Protein Cpn60; Flags: Precursor >gi | 3552009 I gb I AAC34885.1 1 heat shock protein 60 [Candida albicans] >gi I 46434552 | gb I EAK93958.1 1 heat shock protein 60 [Candida albicans SC5314] >gi I 46434577 I gb I EAK93982.1 1 heat shock protein 60 [Candida albicans SC5314]

>gi I 71001164 | ref I XP_755263.1 1 antigenic mitochondrial protein HSP60

[Aspergillus fumigatus Af293] >gi I 66852901 1 gb I EAL93225.1 1 antigenic mitochondrial protein HSP60, putative [Aspergillus fumigatus Af293]

>gi 1 159129345 I gb I EDP54459.1 1 antigenic mitochondrial protein HSP60, putative

[Aspergillus fumigatus A1163]

>gi I 90970323 I gb I ABE02805.1 1 heat shock protein 60 [Rhizophagus intraradices]

In an embodiment, a 10 kDa chaperone from Table 3 is combined with a matching 60kDa chaperone from table 4 of the same organism genus or species for expression in the host.

For instance: >gi 1 189189366 I ref I XP_001931022.1 1 : 71-168 10 kDa chaperonin [Pyrenophora tritici-repentis] expressed together with matching

>gi 1 189190432 | ref I XP_001931555.1 1 heat shock protein 60, mitochondrial precursor [Pyrenophora tritici-repentis Pt-lC-BFP].

All other combinations from Table 3 and 4 similarly made with same organism source are also available to the skilled person for expression.

Further, one may combine a chaperone from Table 3 from one organism with a chaperone from Table 4 from another organism, or one may combine GroES with a chaperone from Table 3, or one may combine GroEL with a chaperone from Table 4.

As follows from the above, the invention further relates to a method for preparing an organic compound comprising converting a carbon source, using a microorganism, thereby forming the organic compound. The method may be carried out under aerobic, oxygen-limited or anaerobic conditions.

The invention allows in particular a reduction in formation of an NADH dependent side-product, especially glycerol, by up to 100 %, up to 99 %, or up to 90 %, compared to said production in a corresponding reference strain. The NADH dependent side-product formation is preferably reduced by more than 10 % compared to the corresponding reference strain, in particular by at least 20 %, more in particular by at least 50 %. NADH dependent side-product production is preferably reduced by 10-100 %, in particular by 20-95 %, more in particular by 50-90 %.

In preferred method wherein Rubisco, or another enzyme capable of catalysing the formation of an organic compound from CO2 (and another substrate) or another enzyme that catalyses the function of CO2 as an electron acceptor, is used, the carbon dioxide concentration in the reaction medium is at least 5 % of the CO2 saturation concentration under the reaction conditions, in particular at least 10 % of said CO2 saturation concentration, more in particular at least 20 % of said CO2 saturation concentration. This is in particular advantageous with respect to product yield. The reaction medium may be oversaturated in CO2 concentration, saturated in CO2 concentration or may have a concentration below saturation concentration. In a specific embodiment, the CO2 concentration is 75 % of the saturation concentration or less, in particular 50 % of said saturation concentration or less, more in particular is 25 % of the CO2 saturation concentration or less.

In a specific embodiment, the carbon dioxide or part thereof is formed in situ by the microorganism. If desired, the method further comprises the step of adding external C02 to the reaction system, usually by aeration with CO2 or a gas mixture containing CO2, for instance a CO2 /nitrogen mixture. Adding external CO2 in particular is used to (increase or) maintain the CO2 within a desired concentration range, if no or insufficient CO2 is formed in situ.

Determination of the CO2 concentration in a fluid is within the routine skills of the person skilled in the art. In practice, one may routinely determine the CO2 concentration in the gas phase above a culture of the yeast (practically the off-gas if the medium is purged with a gas). This can routinely be measured using a commercial gas analyser, such as a RosemountNGA200000 gas analyser (Rosemount Analytical,Orrvile,USA). The concentration in the liquid phase (relative to the saturation concentration), can then be calculated from the measured value in the gas, from the CO2 saturation concentration and Henri coefficients of under the existing conditions in the method. These parameters are available from handbooks or can be routinely determined.

As a carbon source, in principle any carbon source that the microorganism can use as a substrate can be used. In particular an organic carbon source may be used, selected from the group of carbohydrates and lipids (including fatty acids). Suitable carbohydrates include monosaccharides, disaccharides, and hydrolysed polysaccharides (e.g. hydrolysed starches, lignocellulosic hydrolysates). Although a carboxylic acid may be present, it is not necessary to include a carboxylic acid such as acetic acid, as a carbon source.

It is in particular an advantage of the present invention that an improved ethanol yield and a reduced glycerol production is feasible compared to, e.g., a wild type yeast cell, without needing to intervene in the genome of the cell by inhibition of a glycerol 3-phosphate phosphohydrolase and/or encoding a glycerol 3-phosphate dehydrogenase gene.

Still, in a specific embodiment, a yeast cell according to the invention may comprise a deletion or disruption of one or more endogenous nucleotide sequence encoding a glycerol 3-phosphate phosphohydrolase and/or encoding a glycerol 3- phosphate dehydrogenase gene:

Herein in the cell, enzymatic activity needed for the NADH- dependent glycerol synthesis is reduced or deleted. The reduction or deleted of this enzymatic activity can be achieved by modifying one or more genes encoding a NAD-dependent glycerol 3-phosphate dehydrogenase activity (GPD) or one or more genes encoding a glycerol phosphate phosphatase activity (GPP), such that the enzyme is expressed considerably less than in the wild-type or such that the gene encoded a polypeptide with reduced activity.

Such modifications can be carried out using commonly known

biotechnological techniques, and may in particular include one or more knock-out mutations or site-directed mutagenesis of promoter regions or coding regions of the structural genes encoding GPD and/or GPP. Alternatively, yeast strains that are defective in glycerol production may be obtained by random mutagenesis followed by selection of strains with reduced or absent activity of GPD and/or GPP. S. cerevisiae GPD1, GPD2, GPP1 and GPP2 genes are shown in WO 2011/010923, and are disclosed in SEQ ID NO: 24-27 of that application. The contents of this application are incorporated by reference, in particular the contents relating to GPD and/or GPP.

As shown in the Examples below, the invention is in particular found to be advantageous in a process for the production of an alcohol, notably ethanol. However, it is contemplated that the insight that CO2 can be used as an electron acceptor in microorganisms that do not naturally allow this, has an industrial benefit for other biotechnological processes for the production of organic molecules, in particular organic molecules of a relatively low molecular weight, particularly organic molecules with a molecular weight below 1000 g/mol. The following items are mentioned herein as preferred embodiments of the use of carbon dioxide as an electron acceptor in accordance with the invention. 1. Use of carbon dioxide as an electron acceptor in a recombinant chemotrophic micro-organism is a non-phototrophic eukaryotic micro-organism.

2. Use of carbon dioxide as an electron acceptor in a recombinant chemotrophic micro-organism , wherein the micro-organism produces an organic compound under anaerobic conditions.

3. Use according to item 1 or 2, wherein the carbon dioxide serves as an electron acceptor in a process with NADH as an electron donor.

5. Use according to any of the preceding items, wherein the microorganism produces an organic compound in a process with an excess production of ATP and/or NADH.

6. Use according to any of the preceding items, wherein the microorganism comprises a heterologous nucleic acid sequence encoding a polypeptide from a (naturally) autotrophic organism.

7. Use according to item 6, wherein the micro-organism comprises a heterologous nucleic acid sequence encoding a first prokaryotic chaperone for said polypeptide and preferably a nucleic acid sequence encoding a second prokaryotic chaperone - different from the first - for said polypeptide.

8. Use according to item 7, wherein the chaperones are GroEL and

GroES.

9. Use according to any of the preceding items, wherein the microorganism produces an organic compound selected from the group consisting of alcohols (such as methanol, ethanol, propanol, butanol, phenol, polyphenol), ribosomal peptides, antibiotics (such as penicillin), bio-diesel, alkynes, alkenes, isoprenoids, esters, carboxylic acids (such as succinic acid, citric acid, adipic acid, lactic acid ), amino acids, polyketides, lipids, and carbohydrates.

10. Use according to any of the preceding items, wherein the microorganism comprises a heterologous nucleic acid sequence functionally expressing a polypeptide selected from the group consisting of carbonic anhydrases, carboxylases, oxygenases, hydrogenases, dehydrogenases, isomerases, aldolases, transketolases, transaldolases, phosphatases, epimerases, kinases, carboxykinases, oxidoreductases, aconitases, fumarases, reductases, lactonases, phosphoenolpyruvate (PEP)

carboxylases, phosphoglycerate kinases, glyceraldehyde 3-phosphate dehydrogenases, triose phosphate isomerases, fructose- 1,6-bisphosphatases, sedoheptulose-1,7- bisphosphatases, phosphopentose isomerases, phosphopentose epimerase,

phosphoribulokinases (PRK), glucose 6-phosphate dehydrogenases, 6- phosphogluconolactonases, 6-phosphogluconate dehydrogenases, ribulose 5-phosphate isomerases, ribulose 5-phosphate 3-epimerases, Ribulose- 1,5-bisphosphate

carboxylase oxygenases, lactate dehydrogenases, malate synthases, isocitrate lyases, pyruvate carboxylases, phosphoenolpyruvate carboxykinases, fructose- 1,6- bisphosphatases, phosphoglucoisomerases, glucose-6-phosphatases, hexokinases, glucokinases, phosphofructokinases, pyruvate kinases, succinate dehydrogenases, citrate synthases, isocitrate dehydrogenases, a-ketoglutarate dehydrogenases, succinyl-CoA synthetases, malate dehydrogenases, nucleoside-diphosphate kinases, xylose reductases, xylitol dehydrogenases, xylose isomerases, isoprenoid synthases, and xylonate dehydratases.

11. Use according to item 10, wherein the microorganism comprises a heterologous nucleic acid sequence functionally expressing Ribulose- 1,5-bisphosphate carboxylase oxygenase (Rubisco) and/or a heterologous nucleic acid sequence functionally expressing a phosphoribulokinase (PRK).

12. Use according to any of the preceding items, wherein the

microorganism is selected from the group of is selected from the group consisting of Saccharomyceraceae, Penicillium, Yarrowia and Aspergillus.

13. Use according to any of the preceding items, wherein the carbon dioxide is used as an electron acceptor to reduce production of an NAD+- dependent side-product or NADH-dependent side-product, such as glycerol, in a process for preparing another organic compound, such as another alcohol or a carboxylic acid.

14. Recombinant micro-organism, in particular a eukaryotic micro- organism, having an enzymatic system allowing the micro-organism to use carbon dioxide as an electron acceptor under chemotrophic (non-phototrophic) conditions., wherein the microorganism is preferably as defined in the prevision items.

15. Recombinant micro-organism according to item 14, wherein the microorganism has an enzymatic system for producing an organic compound in a process with an excess production of ATP and/or NADH.

The production of the organic compound of interest may take place in a organism known for it usefulness in the production of the organic compound of interest, with the proviso that the organism has been genetically modified to enable the use of carbon dioxide as an electron acceptor in the organism.

Although it is contemplated that the invention is interesting for the production of a variety of industrially relevant organic compounds, a method or use according the invention is in particular considered advantageous for the production of an alcohol, in particular an alcohol selected from the group of ethanol, n-butanol and 2,3-butanediol; or in the production of an organic acid/carboxylate, in particular a carboxylate selected from the group of L-lactate, 3-hydroxypropionate, D-malate, L- malate, succinate, citrate, pyruvate and itaconate.

Regarding the production of ethanol, details are found herein above, when describing the yeast cell comprising PRK and Rubisco and in the examples. The ethanol or another alcohol is preferably produced in a fermentative process.

For the production of several organic acids (carboxylates), e.g. citric acid, an aerobic process is useful. For citric acid production for instance Aspergillus niger, Yarrowia lipolytica, or another known citrate producing organism may be used.

An example of an organic acid that is preferably produced anaerobically is lactic acid. Various lactic acid producing bacterial strains and yeast strains that have been engineered for lactate production are generally known in the art.

EXAMPLES

Example 1. Construction of the expression vector

Phosphoribulokinase (PRK) cDNA from Spinacia oleracea (spinach) (EMBL accession number: X07654.1) was PCR- amplified using Phusion Hot-start polymerase (Finnzymes, Landsmeer, the Netherlands) and the oligonucleotides XbaI_prk-FW2 and RVl_XhoI_prk (Table 5), and was ligated in pCR®-Blunt II- TOPO® (Life Technologies Europe BV, Bleiswijk, the Netherlands).

Table 5 Oligonucleotides

Number Name Sequence (5' to 3') Purpose

Cloning

1 XbaI_prk_FW2 TGACATCTAGATGTCACAACAACAAACAATTG cloning of PRK into pUDE046.

2 RV 1 Xhol prk TGACATCTAGATGTCACAACAACAAACAATTG cloning of PRK into pUDE046.

Primers used for in vivo plasmid assembly

TTGTAAAACGACGGCCAGTGAGCGCGCGTAATACGAC Rubisco cbbM cassette for plasmids

HR-cbbM-FW-65 TCACTATAGGGCGAATTGGGTACAGCTGGAGCTCAGT pUDC075, pUDC099, and pUDClOO.

TTATCATTATC

GGAATCTGTGTAGTATGCCTGGAATGTCTGCCGTGCCA Rubisco cbbM cassette for plasmids

HR-cbbM-RV-65 TAGCCATGTATGCTGATATGTCGGTACCGGCCGCAAA pUDC075, pUDC099, and pUDClOO

TTAAAG

ATCACTCTTACCAGGCTAGGACGACCCTACTCATGTAT Linker fragment for assembly of plasmid TGAGATCGACGAGATTTCTAGGCCAGCTTTTGTTCCCT pUDC099.

linker-cbb02-pRS416

TTAGTGAGGGTTAATTGCGCGCTTGGCGTAATCATGGT CATAGC

GACATATCAGCATACATGGCTATGGCACGGCAGACAT TCCAGGCATACTACACAGATTCCATCACTCTTACCAGG Linker fragment for assembly of plasmid linker-cbbM-GroEL

CTAGGACGACCCTACTCATGTATTGAGATCGACGAGA pUDClOO.

TTTCTAGG

Primers used for in vivo integration assembly

GTTGGATC I ^s cloning expression cassette linker

TATTTCAGAGTTCTTCAGACTTCTTAACTCCTGTAAAA fragment between CAN1 upstream and FW pTDH3- HR-CANlup ACAAAAAAAAAAAAAGGCATAGCAAGCTGGAGCTCA PRK expression cassette (IMI229), and

GTTTATC CANlup-linker and KILEU2 expression cassette (IMI232).

AGATATACTGCAAAGTCCGGAGCAACAGTCGTATAAC 1 ^st cloning fragment: linker fragment

RV linker-iHR2B TCGAGCAGCCCTCTACTTTGTTGTTGCGCTAAGAGAAT between CANlup-linker and PRK

GGACC expression cassette (IMI229).

GCTATGACCATGATTACGCCAAGCGCGCAATTAACCC 1 ^st cloning fragment: linker fragment

RV linker-iHR6 TCACTAAAGGGAACAAAAGCTGGTTGCGCTAAGAGAA between CANlup-linker and KILEU2

TGGACC expression cassette (IMI232).

CAACAAAGTAGAGGGCTGCTCGAGTTATACGACTGTT 2 ^nd cloning fragment: GALl _v-PRK-CYCl _t

FW pGALl-prk HR2B GCTCCGGACTTTGCAGTATATCTGCTGGAGCTCTAGTA expression cassette (IMI229) from

CGGATT pUDE046.

GGAATCTGTGTAGTATGCCTGGAATGTCTGCCGTGCCA 2 ^nd cloning fragment: GALlp-PRK-CYCl _t

11 RV CYClt-prk HR2 TAGCCATGTATGCTGATATGTCGTACCGGCCGCAAATT expression cassette (IMI229) from

AAAG pUDE046.

GACATATCAGCATACATGGCTATGG 3 ^rdI cloning fragment: PGIl _v-cbbQ2-

12 FW HR2-cbbQ2-HR3

TEF2t cassette (IMI229).

GGACACGCTTGACAGAATGTCAAAGG 3 ^r cloning fragment: PGIl _v-cbbQ2-

13 RV HR2-cbbQ2-HR3

TEF2t cassette (IMI229).

CGTCCGATATGATCTGATTGG 4*TARI cloning fragment: PGK1 _V-

14 FW HR3-cbb02-HR4

cbb02-ADHl _t cassette (IMI229).

CCTAGAAATCTCGTCGATCTC 4 ^th cloning fragment: PGKl _v-cbb02-

15 RV HR3-cbb02-HR4

ADHl _t cassette (IMI229).

ATCACTCTTACCAGGCTAGG 5 ^th cloning f gmenV.TEFl _v-groEL-ACTl _t

16 FW HR4-GroEL-HR5

cassette (IMI229).

CTGGACCTTAATCGTGTGCGCATCCTC 5 ^th cloning fragment: TEFl _v-groEL-

17 RV HR4-GroEL-HR5

ACTl _t cassette (IMI229).

CCGTATAGCTTAATAGCCAGCTTTATC 6 ^th cloning fragment: TPIl _v-groES-PGIl _t

18 FW HR5-GroES-HR6

cassette (IMI229).

GCTATGACCATGATTACGCCAAGC 6 ^th cloning fragment: TPIl _v-groES-PGIl _t

19 RV HR5 -GroES -HR6

cassette (IMI229).

CCAGCTTTTGTTCCCTTTAGTGAGGGTTAATTGCGCGC 7* (ΙΜΙ229) or 2 ^nd (IMI232) cloning

20 FW HR6-LEU2-CANldwn TTGGCGTAATCATGGTCATAGCCTGTGAAGATCCCAG fragment: KILEU2 cassette from pUG73.

CAAAG

AGCTCATTGATCCCTTAAACTTTCTTTTCGGTGTATGA 7* (ΙΜΙ229) or 2 ^nd (IMI232) cloning CTTATGAGGGTGAGAATGCGAAATGGCGTGGAAATGT fragment: KILEU2 cassette from pUG73.

21 RV LEU2 HR-CAN1

GATCAAAGGTAATAAAACGTCATATATCCGCAGGCTA ACCGGAAC

Primers used for verification the in vivo assembled constructs

Diagnostic for assembly of plasmids

22 m-PCR-HRl-FW GGCGATTAAGTTGGGTAACG

pUDC075, pUDC099, and pUDClOO,. Diagnostic for assembly of plasmids

23 m-PCR-HRl-RV AACTGAGCTCCAGCTGTACC pUDC075, pUDC099, pUDClOO, and integration in strain IMI229.