Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
RECOMBINANT YEAST STRAINS FOR PENTOSE FERMENTATION
Document Type and Number:
WIPO Patent Application WO/2018/114973
Kind Code:
A1
Abstract:
Described herein are recombinant yeast cells expressing a xylulose kinase (XK) whihch are suitable for fermentation of pentoses. Also described are recombinant yeast cells with higher toleranance to formic and/or acetic acid and suitable for fermentation of pentoses. Also described are recombinant yeast cells expressing an enolase, a phophofructokinase beta subunit, a 6-phosphofructo-2-kinase, a glucose-6-phosphate isomerase, a phosphoglycerate mutase and/or a triose-phosphate isomerase, and suitable for fermentation of pentoses. Also described are recombinant yeast cells expressing a a phosphoglucomutase and/or phosphoribomutase which are suitable for fermentation of pentoses. Further described are are methods of using or producing such recombinant yeast cells and related materials.

Inventors:
KARHUMAA KAISA (SE)
SENDELIUS MALIN (SE)
SÁNCHEZ I NÓGUE VIOLETA (SE)
Application Number:
PCT/EP2017/083586
Publication Date:
June 28, 2018
Filing Date:
December 19, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NOVOZYMES AS (DK)
International Classes:
C12N9/12; C12N15/52; C12P7/10; C12R1/865
Domestic Patent References:
WO2010039692A22010-04-08
WO2012045088A22012-04-05
WO2009017441A12009-02-05
WO2010059095A12010-05-27
WO2012135110A12012-10-04
WO1995013362A11995-05-18
WO1995017413A11995-06-29
WO1995022625A11995-08-24
WO1992006204A11992-04-16
WO2011078262A12011-06-30
WO2012009272A22012-01-19
WO2010059095A12010-05-27
Foreign References:
US20120309093A12012-12-06
JP2015177760A2015-10-08
US20090246857A12009-10-01
US20120270289A12012-10-25
US5223409A1993-06-29
US20120184020A12012-07-19
EP0073657A11983-03-09
US4931373A1990-06-05
Other References:
C. GUO ET AL: "Cloning and molecular characterization of a gene coding d-xylulokinase (CmXYL3) from Candida maltosa", JOURNAL OF APPLIED MICROBIOLOGY., vol. 101, no. 1, 1 July 2006 (2006-07-01), GB, pages 139 - 150, XP055459154, ISSN: 1364-5072, DOI: 10.1111/j.1365-2672.2006.02915.x
D. J. WOHLBACH ET AL: "Comparative genomics of xylose-fermenting fungi for enhanced biofuel production", PROCEEDINGS NATIONAL ACADEMY OF SCIENCES PNAS, vol. 108, no. 32, 25 July 2011 (2011-07-25), US, pages 13212 - 13217, XP055459033, ISSN: 0027-8424, DOI: 10.1073/pnas.1103039108
KUYPER M ET AL: "Metabolic engineering of a xylose-isomerase-expressing Saccharomyces cerevisiae strain for rapid anaerobic xylose fermentation", FEMS YEAST RESEARCH, WILEY-BLACKWELL PUBLISHING LTD, GB, NL, vol. 5, no. 4-5, 26 November 2004 (2004-11-26), pages 399 - 409, XP027680444, ISSN: 1567-1356, [retrieved on 20050201]
MOLLAPOUR; PIPER, MOLECULAR MICROBIOLOGY, vol. 42, no. 4, 2001, pages 919 - 930
CUNNINGHAM; WELLS, SCIENCE, vol. 244, 1989, pages 1081 - 1085
HILTON ET AL., J. BIOL. CHEM., vol. 271, 1996, pages 4699 - 4708
DE VOS ET AL., SCIENCE, vol. 255, 1992, pages 306 - 312
SMITH ET AL., J. MOL. BIOL., vol. 224, 1992, pages 899 - 904
WLODAVER ET AL., FEBS LETT, vol. 309, 1992, pages 59 - 64
BOWIE ET AL.: "Residues that are directly involved in protein functions such as binding or catalysis will certainly be among the most conserved", SCIENCE, vol. 247, 1990, pages 1306 - 1310
REIDHAAR-OLSON; SAUER, SCIENCE, vol. 241, 1988, pages 53 - 57
BOWIE; SAUER, PROC. NATL. ACAD. SCI., vol. 86, 1989, pages 2152 - 2156
LOWMAN ET AL., BIOCHEMISTRY, vol. 30, 1991, pages 10832 - 10837
DERBYSHIRE ET AL., GENE, vol. 46, 1986, pages 145
NER ET AL., DNA, vol. 7, 1988, pages 127
NESS ET AL., NATURE BIOTECHNOLOGY, vol. 17, 1999, pages 893 - 896
VAN WINDEN ET AL., FEMS YEAST RESEARCH, vol. 5, 2005, pages 559 - 568
SAMBROOK, J.; FRITSCH, E. F.; MANIATIS, T.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LABORATORY PRESS
SILHAVY, T. J.; BENNAN, M. L.; ENQUIST, L. W.: "Experiments with Gene Fusions", 1984, COLD SPRING HARBOR LABORATORY PRESS
AUSUBEL ET AL.: "Short Protocols in Molecular Biology", 1995, WILEY & SONS
AUSUBEL, F. M. ET AL.: "Current Protocols in Molecular Biology", 1987, GREENE PUBLISHING ASSOC. AND WILEY-INTERSCIENCE
"Guide to Yeast Genetics and Molecular and Cell Biology", vol. 194, 2004, ELSEVIER ACADEMIC PRESS, article "Methods in Enzymology"
SHARP; LI, NUCLEIC ACIDS RESEARCH, vol. 15, 1987, pages 1281 - 1295
JANSEN ET AL., NUCLEIC ACIDS RES., vol. 31, no. 8, 2003, pages 2242 - 51
HAUF ET AL., ENZYM MICROB TECHNOL, vol. 26, 2000, pages 688 - 698
GIETZ; SUGINO: "New yeast vectors constructed with in vitro mutagenized yeast genes lacking six-base pair restriction sites", GENE, vol. 74, 1988, pages 527 - 534, XP000006074, DOI: doi:10.1016/0378-1119(88)90185-0
OKKELS, ANN. NEW YORK ACAD. SCI., vol. 782, 1996, pages 202 - 207
MNAIMNEH ET AL., CELL, vol. 118, no. 1, 2004, pages 31 - 44
WACH, YEAST, vol. 10, 1994, pages 1793 - 1808
METHODS IN ENZYMOLOGY, vol. 194, 1991, pages 281 - 301
CHANG; HO, APPL BIOCHEM BIOTECHNOL., vol. 17, 1988, pages 313 - 318
ELIASSON ET AL., ENZYME AND MICROBIAL TECHNOLOGY, vol. 29, 2001, pages 288 - 297
GIETZ; SCHIESTL, NATURE PROTOCOLS, vol. 2, 2007, pages 31 - 34
HOU, APPL MICROBIOL BIOTECHNOL, vol. 94, 2012, pages 205 - 214
INOUE ET AL., GENE, vol. 96, no. 1, 1990, pages 23 - 28
JIN ET AL., APP ENVIRONMENTAL MICROBIOL, vol. 69, no. 1, 2003, pages 495 - 503
JOHANSSON; HAHN-HAGERDAL, YEAST, vol. 19, no. 3, 2002, pages 225 - 231
JOHANSSON ET AL., APPL ENVIRONMENTAL MICROBIOL, vol. 67, no. 9, 2001, pages 4249 - 4255
KARHUMAA ET AL., MICROB CELL FACT, vol. 5, no. 6, 2007, pages 5
KARHUMAA ET AL., YEAST, vol. 22, no. 5, 2005, pages 359 - 68
KARHUMAA ET AL., YEAST, vol. 26, no. 7, 2009, pages 371 - 82
KIM ET AL., BIOTECHNOL ADV., vol. 31, no. 6, 2013, pages 851 - 61
KUYPER ET AL., FEMS YEAST RES., vol. 5, no. 4-5, 2005, pages 399 - 409
LINDEN T; PEETRE J; HAHN-HAGERDAL B, APPL ENVIRON MICROBIOL, vol. 58, no. 5, 1992, pages 1661 - 9
LONN ET AL., ENZYME MICROB. TECHNOL., vol. 32, no. 5, 2003, pages 567 - 573
MADHAVAN ET AL., APPL MICROBIOL BIOTECHNOL, vol. 82, no. 6, 2009, pages 1067 - 1078
MATSUSHIKA ET AL., ENZYME AND MICROBIAL TECHNOLOGY, vol. 48, 2011, pages 466 - 471
MOLLAPOUR M; PIPER PW, MOL MICROBIOL., vol. 42, no. 4, 2001, pages 919 - 30
MUMBERG D; MULLER R; FUNK M, GENE, vol. 156, no. 1, 14 April 1995 (1995-04-14), pages 119 - 22
NEEDLEMAN; WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443 - 453
NGUYEN ET AL., MYCOLOGICAL RES, vol. 110, 2006, pages 1232 - 1241
PARACHIN ET AL., METABOLIC ENGINEERING, vol. 13, 2011, pages 508 - 517
RICE ET AL., TRENDS GENET, vol. 16, 2000, pages 276 - 277
RUNQUIST ET AL., APPL. ENVIRON. MICROBIOL., vol. 76, no. 23, 2010, pages 7796 - 7802
SHAMANNA; SANDERSON, J. BACTERIOL., vol. 139, 1979, pages 64 - 70
TIWARI ET AL., BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, vol. 366, 2008, pages 340 - 345
WALTHER ET AL., FEBS LETT, vol. 586, no. 23, 30 November 2012 (2012-11-30), pages 4114 - 8
WOHLBACH ET AL., PROC NATL ACAD SCI, vol. 108, no. 32, 2011, pages 13212 - 7
XU ET AL., MOL SYST BIOL, vol. 9, 2013, pages 665
"NCBI", Database accession no. ADW23548
Download PDF:
Claims:
What is claimed is:

1 . A recombinant Saccharomyces cell which is capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK), wherein the XK:

- provides an enzymatic activity for converting D-xylulose to xylulose 5-phosphate at least twice that provided by S. cerevisiae XK (SEQ ID NO: 32), and

- provides for an anaerobic growth rate of the recombinant cell on xylose which is higher than that provided by a S. cerevisiae XK.

2. The recombinant cell of claim 1 , wherein the XK further provides for an aerobic growth rate of the cell on xylose which is higher than that provided by S. cerevisiae XK.

3. The recombinant cell of claim 1 or claim 2, wherein the XK comprises the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof.

4. The recombinant cell of any one claims 1 -3, wherein the XK has a sequence identity of atleast 80% with the amino acid sequence of SEQ ID NO: 6.

5. A vector comprising genes encoding

- an XK comprising the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof,

- an XR, and

- an XDH; and optionally, regulatory sequences for expressing the genes in a 6. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- an XK comprising the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof,

- an XR, and

- an XDH, and optionally, regulatory sequences for expressing the genes in the host cell.

7. A recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, wherein the Yme2p polypeptide provides for an increased tolerance of the recombinant cell to formic acid, acetic acid, or both.

8. The recombinant yeast cell of claim 7, wherein the Yme2p polypeptide provides for

(a) an increased anaerobic growth on xylose,

(b) an increased xylose consumption rate,

(c) an increased ethanol production rate, or

(d) a combination of two or all of (a) to (c),

of the recombinant cell in the presence of formic acid. 9. The recombinant yeast cell of claim 7 or claim 8, wherein the Yme2p polypeptide has a sequence identity of at least 70% with the amino acid sequence of SEQ ID NO: 50.

10. The recombinant yeast cell of any one of claims 7-9, wherein the Yme2p polypeptide comprises the amino acid sequence of SEQ ID NO: 50.

1 1 . A vector comprising genes encoding

- an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment thereof,

- an XR,

- an XDH, and

- an XK; and

- optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell. 12. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment thereof,

- an XR,

- an XDH,

- an XK, and

optionally, regulatory sequences for expressing the genes in the host cell.

13. A method for increasing the tolerance of a Saccharomyces cell to formic acid, comprising transforming the cell with a gene encoding a Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, and expressing the gene.

14. A recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding

(a) an enolase comprising the amino acid sequence of SEQ ID NO: 132,

(b) a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134,

(c) a 6-phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO:

136,

(d) a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138,

(e) a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO:

140,

(f) a triose-phosphate isomerase comprising the amino acid sequence of SEQ ID NO:

142,

(g) a catalytically active variant, fragment or yeast ortholog of any one of (a) to (f), or

(h) a combination of any two or more of (a) to (g).

15. The recombinant yeast cell of claim 14, wherein the heterologous gene or combination provides for an increased anaerobic growth rate on xylose, an increased aerobic growth rate on xylose, an increased ethanol production from xylose, or a combination of any two or all thereof.

16. A vector comprising genes encoding

- a polypeptide comprising the amino acid sequence of SEQ ID NO: 132, 134, 136, 138 140 or 142 or a catalytically active variant, fragment or yeast ortholog of any thereof,

- an XR,

- an XDH, and

- an XK; and

optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

17. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- a polypeptide comprising the amino acid sequence of SEQ ID NO: 132, 134, 136, 138 140 or 142 or a catalytically active variant or fragment of any thereof,

- an XR,

- an XDH,

- an XK, and, optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

18. A recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding a phosphoglucomutase and/or phosphoribomutase comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant or fragment thereof.

19. The recombinant yeast cell of claim 18, wherein the heterologous gene provides for an increased anaerobic growth rate on xylose, an increased xylose consumption, an increased ethanol production from xylose, or a combination of any two or all thereof.

20. The recombinant yeast cell of claim 18 or claim 19, wherein the catalytically active variant has a sequence identity of at least 30%, at least 80%, at least 90% or at least 95% with the amino acid sequence of SEQ ID NO: 150.

21 . The recombinant yeast cell of any one of claims 18-20, wherein the phosphoglucomutase and/or phosphoribomutase comprises the amino acid sequence of SEQ ID NO: 150.

22. A vector comprising genes encoding

- a polypeptide comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant, fragment or yeast ortholog thereof,

- an XR,

- an XDH, and

- an XK; and

optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

23. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- a polypeptide comprising the amino acid sequence ofSEQ ID NO: 150 or a catalytically active variant or fragment thereof,

- an XR,

- an XDH,

- an XK, and,

optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

24. The recombinant yeast cell of any one of claims 1 -4, 7-10, 14, 15, or 18-21 , which is derived from a Saccharomyces, Rhodotorula Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodospohdium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell. 25. The recombinant cell of claim 24, which is derived from a Saccharomyces cerevisiae cell.

26. The recombinant cell of any one of claims 1 -4, 7-10, 14, 15, 18-21 , or 24, which comprises a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI), and

(a) a heterologous gene encoding a xylose reductase (XR) and an heterologous gene encoding a xylitol dehydrogenase (XDH), and/or

(b) a heterologous gene encoding a xylose isomerase (XI). 27. A method for producing a fermentation product, comprising

(a) contacting the recombinant cell of any one of claims 1 -4, 7-10, 14, 15, 18-21 , or 24 with a medium comprising a carbon source comprising xylose or arabinose under anaerobic conditions, and

(b) isolating the fermentation product from the medium.

Description:
RECOMBINANT YEAST STRAINS FOR PENTOSE FERMENTATION

Field of the Invention

Described herein are genetically modified recombinant yeast cells and strains capable of fermenting pentoses, as well as to the preparation and use of such cells and strains. Reference to a Sequence Listing

This application contains a Sequence Listing in computer readable form, which is incorporated herein by reference.

Background

Bioethanol production from renewable feedstock by baker's yeast Saccharomyces cerevisiae has become an attractive alternative to fossil fuels, but the use of lignocellulosic feedstocks for such purposes poses challenges. For example, a substantial fraction of lignocellulosic material consists of pentoses such as xylose and arabinose. Native Saccharomyces species cannot ferment these pentoses, but genetic engineering techniques to provide Saccharomyces with this ability are now well-established (Kim et al., 2013). These include heterologous expression of xylose reductase (XR) and xylitol dehydrogenase (XDH) from naturally xylose fermenting yeasts such as Scheffersomyces (Pichia) stipitis and various Candida species, as well as the overexpression of xylulokinase (XK) and the four enzymes in the non-oxidative pentose phosphate pathway (PPP), namely transketolase (TKL), transaldolase (TAL), ribose-5-phosphate ketol-isomerase (RKI) and D-ribulose-5-phosphate 3- epimerase (RPE). Modifying the co-factor preference of S. stipitis XR towards NADH in such systems has been found to provide metabolic advantages as well as improving anaerobic growth, and replacing the XR/XDH with heterologous XI has been reported to reduce unwanted xylitol by-product. These and other modifications have been described in, e.g., WO2009/017441 , WO2010/059095, WO2012/1351 10, Karhumaa et al., 2005; Karhumaa et al. , 2007; Kuyper et al. , 2005).

Some degree of overexpression of XK is generally considered necessary for ethanol formation, directing the xylose metabolism towards central metabolism (Chang and Ho, 1988; Eliasson et al., 2001 ), and XK from various species, e.g., S. cerevisiae, E. coli, Pichia stipitis and P. tannophilus, have been used or proposed (WO 95/13362; US 2009/0246857). It has also been reported that XK reduces the production of unwanted xylitol and acetate byproducts (Johansson et al., 2001 ; Parachin et al., 201 1 ; Matsushika et al., 201 1 ). Several investigators have concluded, however, that it is necessary to limit the levels of XK since XK overexpression inhibited growth of S. cerevisiae on xylose (Jin et al., 2003; Matsushika et al., 201 1 ), reduced xylose consumption (Johansson et al., 2001 ), or drained ATP (Eliasson et al., 2001 ), suggesting that moderate or low XK levels are needed for optimal xylose fermentation. A lower XK activity could, however, limit the metabolic flux.

Some yeast species are naturally capable of fermenting xylose, e.g., Pichia stipites, Spathaspora passalidarum, Candida jeffriesii and Candida tenuis (Nguyen et al., 2006; US 2012/270289 A1 ; Wohlbach et al., 201 1 ). Wohlbach and co-workers applied a comparative genomic approach to identify genes involved in xylose metabolism in Spathaspora passalidarum and Candida tenuis, and found that a Cten aldo/keto reductase, CtAKR, significantly improved xylose consumption in engineered S. cerevisiae strains during both aerobic and anaerobic growth, although this did not result in improved ethanol production. Hou (2012) then found that the ability of Spathaspora passalidarum to utilize xylose under anaerobic conditions was possibly due to the balance of cofactors in the XR-XDH pathway.

Lignocellulosic hydrolysates contain complex mixtures of other compounds, many of which are inhibitory to microbial fermentation, growth or viability. When the lignocellulosic material is heated during pretreatment, some of the sugars are dehydrated to furans such as furfural (from pentoses) and HMF (from hexoses), which are toxic to most micro-organisms. Further, when the hemicellulose is hydrolysed to release the monomeric sugars, acetic acid is formed by the deacetylation of this fraction. More acids are formed if the lignocellulosic hydrolysate containing furfural and HMF is further heated, since these compounds can degrade into formic and levulinic acids, which are even more potent inhibitors of microorganisms than acetic acid. The toxicity and acidity of the pretreated and hydrolysed lignocellulosic material presents a strong limitation on the fermenting micro-organism.

WO 2009/017441 describes a mutant of alcohol dehydrogenase (ADH1 ) from S. cerevisiae, which was capable of reducing HMF.

Mollapour and Piper (Molecular Microbiology (2001 ) 42(4):919-930) describes that the

YME2p gene from the food spoilage yeast Zygosaccharomyces bailii, heterologously expressed in S. cerevisiae cells, could enable growth of the latter on benzoate, sorbate and phenylalanine.

For the purpose of further examining the factors improving xylose utilization, Karhumaa et a/.(2009) compared the proteome of mutant S. cerevisiae strain TMB 3400, which has good xylose fermentation properties, with that of its parental strain. Although the level of acetaldehyde dehydrogenase (Ald6) and some other proteins were found to be increased, the most significant changes detected by proteome analysis were 6-10-fold increased levels of XR, XDH and TKL1 in the mutant, which was in accordance with previous knowledge from rational engineering of xylose metabolism in yeast.

WO 2010/059095 describes that increased levels of phosphoglucomutase obtained by constitutive overexpression of the PGM2 gene with a constitutive promoter improved ethanol production from xylose.

Tiwari et al. (2008) describes that PGM1 , PGM2 encode minor and major isozymes of phosphoglucomutase, and further notes that the protein product of YMR278w exhibits phosphoglucomutase activity, designating YMR278w as PGM3. PGM3 has now, however, also been named PRM15 (Xu et al., 2013, Walther et al., 2012), since the enzyme, apart from phosphoglucomutase activity, also has a significant phosphoribomutase activity (Xu et al., 2013).

Despite these and other advances in the art, there is still a need for improved yeast strains providing for cost-effective production of ethanol and other fermentation products from pentoses such as xylose.

Summary

Described herein are improved recombinant yeast cells for production of ethanol and other fermentation products. This is based, at least in part, on the discovery of xylulokinases (XKs) which, even if present or expressed at high XK activities, have one or more improved properties such as high aerobic or anaerobic growth rates, increased xylose consumption and/or improved ethanol production.

Accordingly, in one aspect is a recombinant yeast cell, such as a recombinant Saccharomyces cell, which is capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK), wherein the XK can have an enzymatic activity for converting D-xylulose to xylulose 5-phosphate at least twice that of S. cerevisiae XK (SEQ ID NO: 32), and can provide for an anaerobic growth rate of the recombinant cell on xylose which is higher than that provided by a S. cerevisiae XK, or both. In one embodiment, the XK can further provide for an aerobic growth rate of the cell on xylose which is higher than that provided by S. cerevisiae XK. Representative assays for measuring these activities are provided by the Examples. For example, the enzymatic activity for converting D-xylulose to xylulose 5-phosphate can be measured according to Example 7, and the anaerobic growth rate can be measured according to Example 9.

In one aspect is a recombinant yeast cell capable of fermenting xylose and which comprises an heterologous gene encoding a xylulokinase (XK) comprising the amino acid sequence of SEQ ID NO: 6, SEQ I D NO: 22, or a catalytically active variant or fragment of any thereof.

In one embodiment of any preceding aspect or embodiment, the recombinant cell is derived from a Saccharomyces, Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell. In one embodiment, the recombinant cell is derived from a Saccharymyces cell, such as a Saccharomyces cerevisiae, bayanus or carlsbergensis cell. In a particular embodiment, the recombinant cell is derived from a Saccharomyces cerevisiae cell.

In one embodiment, the recombinant yeast cell comprises a heterologous gene encoding a xylose reductase (XR), a heterologous gene encoding a xylitol dehydrogenase (XDH), and/or a heterologous gene encoding a xylose isomerase (XI). In one embodiment, the XR is Pichia stipitis XR or an NADH-preferring variant thereof. In one embodiment, the XDH is Pichia stipitis XDH or a catalytically active variant thereof. In a particular embodiment, the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q. The cell may also comprise a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and/or a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI). In some embodiments, the recombinant yeast cell is isolated.

In one aspect is a vector comprising a gene encoding a xylulokinase (XK) comprising the amino acid sequence of SEQ ID NO: 6, SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof.ln some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), and/or a gene encoding a xylose isomerase (XI). The vector may further comprise regulatory sequences for expressing the genes in a yeast host cell such as a Saccharomyces or Saccaromyces cerevisiae host cell.

In one aspect, is a process for producing a recombinant cell described herein (e.g., a Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes a xylulokinase (XK) comprising the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof.

In one embodiment of any preceding aspect or embodiment, the XK comprises the amino acid sequence of SEQ ID NO: 6, or a catalytically active variant or fragment thereof. For example, the XK may have a sequence identity of at least 80% with the amino acid sequence of SEQ ID NO: 6, such as at least 90%, 95%, 97%, 98%, or 99% with the amino acid sequence of SEQ ID NO: 6. In a particular embodiment, the XK comprises the amino acid sequence of SEQ ID NO: 6.

In an alternative embodiment of any preceding aspect or embodiment, the XK the amino acid sequence comprises SEQ ID NO: 22, or a catalytically active variant or fragment thereof. For example, the XK may have a sequence identity of at least 80% with SEQ ID NO: 22, such as at least 90%, 95%, 97%, 98%, or 99% with the amino acid sequence of SEQ ID NO: 22. In a particular embodiment, the XK comprises the amino acid sequence of SEQ ID NO: 22.

Also described herein are improved recombinant yeast cells more tolerant to e.g. formic acid and acetic acid, useful for production of ethanol and other fermentation products from fermentation media derived from lignocellulosic hydrolysates. The invention is based, at least in part, on the discovery that expression of the YME2p gene product in a recombinant yeast cell increases the tolerance of the cell to inhibitors such as formic acid, providing for improved anaerobic growth rates and more cost-efficient production of fermentation products such as ethanol.

Accordingly, in one aspect is a recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, wherein the Yme2p polypeptide can provide for an increased tolerance of the recombinant cell to formic acid, acetic acid, or both. In separate and specific embodiments, the Yme2p polypeptide can provide for an increased anaerobic growth rate on xylose, an increased xylose consumption rate, an increased ethanol production rate, or a combination of two or more of any thereof, of the recombinant cell in the presence of formic acid. In other separate and specific embodiments, the Yme2p polypeptide can provide for an increased xylose consumption rate, an increased ethanol production rate, or a combination thereof, of the recombinant cell in the presence of acetic acid. Representative assays for measuring these activities are provided by the Examples. For example, the anaerobic growth, xylose consumption, ethanol production rate, or combination in the presence of formic acid can measured according to Example 14, and the xylose consumption rate, ethanol production rate or combination in the presence of acetic acid can be measured according to Example 13.

In one embodiment, the recombinant cell is derived from a Saccharomyces, Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell. In one embodiment, the recombinant cell is derived from a Saccharymyces cell, such as a Saccharomyces cerevisiae, bayanus or carlsbergensis cell. In a particular embodiment, the recombinant cell is derived from a Saccharomyces cerevisiae cell.

In one embodiment, the recombinant yeast cell comprises a heterologous gene encoding a xylose reductase (XR), a heterologous gene encoding a xylitol dehydrogenase (XDH), and/or a heterologous gene encoding a xylose isomerase (XI). In one embodiment, the XR is Pichia stipitis XR or an NADH-preferring variant thereof. In one embodiment, the XDH is Pichia stipitis XDH or a catalytically active variant thereof. In a particular embodiment, the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q. The cell may also comprise a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and/or a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI). In some embodiments, the recombinant yeast cell is isolated.

In one aspect is a vector comprising a gene encoding an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment thereof. In some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), a gene encoding a xylulose kinas (XK), and/or a gene encoding a xylose isomerase (XI). The vector may also comprise regulatory sequences for expressing the genes in a Saccharomyces host cell.

In one aspect is a process for producing a recombinant cell described herein (e.g., a

Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment thereof.

In one aspectis a method for increasing the tolerance of a cell descrdibed herein (e.g., a yeast cell such as a Saccharomyces cell) to formic acid, comprising transforming the cell with a gene (e.g., a vector) encoding a Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, and expressing the gene. The gene can, for example, be operably linked to an inducible, a regulated or a constitutive promoter.

In one embodiment of any preceding aspect or embodiment, the Yme2p polypeptide has a sequence identity of at least 70% with the amino acid sequence of SEQ ID NO: 50, such as at least 80%, at least 90%, at least 95%, or at least 97%, or at least 98%, or at least 99% identity. In one embodiment, the Yme2p polypeptide comprises the amino acid sequence of SEQ ID NO: 50.

Also described herein are improved recombinant yeast cells for production of ethanol and other fermentation products. This is based, at least in part, on the discovery of genes providing one or more improved properties such as a higher aerobic or anaerobic growth rates, increased xylose consumption and/or improved ethanol production.

Accordingly, in one aspect is a recombinant yeast cell, such as a recombinant Saccharomyces cell, which is capable of fermenting xylose and which comprises a heterologous gene encoding an enolase comprising the amino acid sequence of SEQ ID NO: 132, a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134, a 6-phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO: 136, a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138, a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO: 140, a triose-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 142, a catalytically active variant, fragment or yeast ortholog of any one of the aforementioned enzymes, or a combination of any two or more of the aforementioned enzymes and catalytically active variants, fragments or yeast orthologs thereof. In one embodiment, the heterologous gene or combination can provide for an increased anaerobic growth rate on xylose, an increased aerobic growth rate on xylose, an increased ethanol production from xylose, or a combination of any two or all thereof. Representative assays for measuring these activities are provided by the Examples. The anaerobic growth rate can, for example, be measured according to Example 23, the aerobic growth according to Example 17, and/or the ethanol production according to Example 24.

In one embodiment, the recombinant yeast cell is derived from a Saccharomyces, Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell. In one embodiment, the recombinant cell is derived from a Saccharymyces cell, such as a Saccharomyces cerevisiae, bayanus or carlsbergensis cell. In a particular embodiment, the recombinant cell is derived from a Saccharomyces cerevisiae cell.

In one embodiment, the recombinant yeast cell comprises a heterologous gene encoding a xylose reductase (XR), a heterologous gene encoding a xylitol dehydrogenase (XDH), a heterologous gene encoding a xylulose kinase (XK) and/or a heterologous gene encoding a xylose isomerase (XI). In one embodiment, the XR is Pichia stipitis XR or an NADH-preferring variant thereof. In one embodiment, the XDH is Pichia stipitis XDH or a catalytically active variant thereof. In a particular embodiment, the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q. The cell may also comprise a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and/or a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI). In some embodiments, the recombinant yeast cell is isolated.

In one aspect is a vector comprising a gene encoding a polypeptide comprising the amino acid sequence of SEQ ID NO: 132, 84, 86, 88, 90 or 92 or a catalytically active variant, fragment or yeast ortholog of any thereof. In some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), a gene encoding a xylulose kinase (XK), and/or a gene encoding a xylose isomerase (XI). The vector may further comprise regulatory sequences for expressing the genes in a Saccharomyces host cell.

In one aspect is a process for producing a recombinant cell described herein (e.g., a yeast cell such as a Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes an polypeptide comprising the amino acid sequence of SEQ ID NO: 132, 134, 136, 138, 140 or 142 or a catalytically active variant or fragment of any thereof.

In one embodiment of any preceding aspect or embodiment, the gene encodes an enolase comprising the amino acid sequence of SEQ ID NO: 132 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the gene encodes a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the gene encodes a 6-phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO: 136 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the gene encoding a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the gene encodes a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO: 140 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the gene encodes a triose-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 142 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog of has a least 50% sequence identity to the amino acid sequence of SEQ I D NO: 132, 134, 136, 138, 140 or 142, such as at least 70%, at least 80%, at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% sequence identity to SEQ ID NO: 132, 84, 86, 88, 90 or 92.

Also described herein are improved recombinant yeast cells for production of ethanol and other fermentation products. This is based, at least in part, on the discovery that overexpression of PGM3 (SEQ ID NO: 150) provides one or more improved properties such as a higher aerobic or anaerobic growth rates, increased xylose consumption and/or improved ethanol production.

Accordingly, in one aspect, is a recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding a phosphoglucomutase and/or phosphoribomutase comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant or fragment thereof. In separate and specific embodiments, the heterologous gene can provide for an increased anaerobic growth rate on xylose, an increased xylose consumption, an increased ethanol production from xylose, or a combination of any two or all thereof. Representative assays for measuring these activities are provided by the Examples.

For example, the anaerobic growth rate can be measured according to Example 26, the anaerobic growth can be measured according to Example 27, and/or the ethanol production can be measured according to Example 27.

In one embodiment, the recombinant cell is derived from a Saccharomyces,

Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell. In one embodiment, the recombinant cell is derived from a Saccharymyces cell, such as a Saccharomyces cerevisiae, bayanus or carlsbergensis cell. In a particular embodiment, the recombinant cell is derived from a Saccharomyces cerevisiae cell.

In one embodiment, the recombinant yeast cell comprises a heterologous gene encoding a xylose reductase (XR), a heterologous gene encoding a xylitol dehydrogenase (XDH), a heterologous gene encoding a xylulose kinase (XK) and/or a heterologous gene encoding a xylose isomerase (XI). In one embodiment, the XR is Pichia stipitis XR or an NADH-preferring variant thereof. In one embodiment, the XDH is Pichia stipitis XDH or a catalytically active variant thereof. In a particular embodiment, the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q. The cell may also comprise a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and/or a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI). In some embodiments, the recombinant yeast cell is isolated.

In one aspectis a vector comprising a gene encoding a polypeptide comprising a phosphoglucomutase and/or phosphoribomutase comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant, fragment or yeast ortholog thereof. In some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), a gene encoding a xylulose kinase (XK), and/or a gene encoding a xylose isomerase (XI). The vector can also comprise regulatory sequences for expressing the genes in a Saccharomyces host cell.

In one aspectis a process for producing a recombinant cell described herein (e.g., a yeast cell such as a Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes a phosphoglucomutase and/or phosphoribomutase comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant or fragment thereof.

In one embodiment of any preceding aspect or embodiment, the phosphoglucomutase and/or phosphoribomutase has a sequence identity of at least 30%, at least 50%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 97%, or at least 98%, or at least 99% with the amino acid sequence of SEQ ID NO: 150. In one embodiment, the phosphoglucomutase and/or phosphoribomutase comprises the amino acid sequence of SEQ ID NO: 150.

In any preceding aspect or embodiment, each gene can be operably linked to an inducible, a regulated or a constitutive promoter, and/or can optionally be integrated into the genome of the cell.

In one aspect is a strain or clone comprising the recombinant cell or vector(s) of any of the aspects or embodiments.

In one aspect is a method for producing a fermentation product, comprising contacting any recombinant yeast cell, strain or clone described herein with a medium comprising a carbon source comprising a pentose such as xylose or arabinose under anaerobic conditions, and isolating the fermentation product from the medium. This method is suitable for producing, for example, a fermentation product comprising at least one of ethanol, butanol, isobutanol, isopentanol, lactate, isoamylacetate, glycerol, sorbitol, mannitol, xylitol, arabinitol; 3- hydroxybutyrolactone; hydrogen gas; L-ascorbic acid, 2,5-furan dicarboxylic acid, 3-hydroxy propionic acid, aspartic acid, glutaric acid, glutamic acid, itaconic acid, levulinic acid; succinic acid, fumaric acid, malic acid or other 1 ,4-diacid; a fatty acid, a fatty acid derived molecule; an isoprenoid, an isoprenoid-derived molecule; and an alkane. In particular, the method is suitable for producing a fermentation product comprising ethanol from a carbon source comprising xylose.

These and other aspects and embodiments are described in more detail below.

Brief Description of the Figures

Fig. 1 : Curves for the aerobic growth on xylose for C5LTe1035 (diamonds) and C5LTe1042 (triangles).

Fig. 2: Curves over anaerobic growth of strain C5LTe1042 ("+") compared with control strain C5LTe1035 ("-").

Fig. 3: Curves over anaerobic growth of strain C5LTe1043 ("+") compared with control strain C5LTe1035 ("-").

Fig. 4: Curves over anaerobic growth of strain C5LTe1040 ("+") compared with control strain C5LTe1035 ("-").

Fig. 5: Graph showing xylose fermentation by strains C5LTe1035 (dashed line) and

C5LTe1042 (solid line). Glucose: circles, Xylose: squares, Xylitol: stars, Biomass: line.

Fig. 6: Graph showing xylose fermentation by strains C5LTe1036 (dashed line) and C5LTe1048 (solid line). Glucose: circles, Xylose: squares, Xylitol: stars, Biomass: line.

Fig. 7: Graph showing xylose fermentation by 5 strains C5LTe1204 (dashed line) and C5LTe1208 (solid line). Glucose: circles, Xylose: squares, Xylitol: stars, Biomass: line.

Fig. 8: Graph showing a summary of xylose consumption and ethanol production in anaerobic xylose fermentation of 50 g/l xylose and 20 g/l glucose in mineral medium within 72 hours of fermentation.

Fig. 9: Graph showing change in OD (620nm) in microplate experiments of cells growing on xylose in the presence of various concentrations of acetic (A) and formic (B) acids (g/g, vertical axis).

Fig. 10: Graph showing fermentation results of fermentation of xylose in the presence of acetic acid with strains C5LTe1202 (A) and C5LTe1212 (B).

Fig. 1 1 : Graph showing fermentation profiles of strains C5LTe1202 and C5LTe1212 in mineral medium in the presence of acetic acid.

Fig. 12: Graph showing overlay of xylose consumption (A) and ethanol production (B) 5 profiles of strains C5LTe1202 and C5LTe1212.

Fig. 13: Graph showing fermentation profiles of strains C5Lte1202 (A) and C5Lte1212

(B) in mineral medium in the presence of formic acid.

Fig. 14: Graph showing fermentation results of fermentation of xylose in the presence of formics acid with strains C5LTe1202 and C5LTe1212.

i o Fig. 15: Graph showing overlay of xylose consumption (A) and ethanol production (B) profiles of strains C5LTe1202 and C5LTe1212.

Fig. 16: Graphic representation of the growth characteristics (measured as change in OD at 620 nm) of strains C5LTe1202 (upper curve) and C5LTe1212 (lower curve).

Fig. 17: Graph showing aerobic growth on xylose of clones expressing various 15 glycolytic genes. Normalized OD (620 nm) for clones with the expressed genes is shown.

Fig. 18: Graph showing anaerobic growth on xylose of clones expressing various glycolytic genes. Normalized OD (620 nm) for clones with the expressed genes is shown, the black bars and the striped bars representing results from two independent experiments.

Fig. 19: Graph showing anaerobic growth of C5LTe1051 ("x") compared with control 20 strain C5LTe1035 ("-").

Fig. 20: Graph showing anaerobic growth of C5LTe1052 compared with control strain C5LTe1035.

Fig. 21 : Graph showing anaerobic growth of C5LTe1054 compared with control strain C5LTe1035.

25 Fig. 22: Graph showing anaerobic growth of C5LTe1055 compared with control strain

C5LTe1035.

Fig. 23: Fermentation graphs of xylose fermentation by strains MC2 (A), MC3 (B), MC4

(C) , MC1 1 (D), MC14 (E) and MC22 (F) compared with control strain (G). Symbols: diamond - xylose, square - xylitol, triangle - glycerol, star - ethanol.

30 Fig. 24: Graphic representation of anaerobic growth on xylose of control strain and strains carrying PGM1 and PGM3.

Fig. 25: Graphic representation of consumed xylose after fermentation in mineral medium for 120h with control strain and strains carrying PGM1 and PGM3.

Fig. 26: Graphic representation of produced ethanol after fermentation in mineral 35 medium for 120h with control strain and strains carrying PGM1 and PGM3.

Definitions The term "pathway", "biometabolic pathway" and the like herein refers to an enzymatic pathway present in a cell, typically a yeast cell, which converts or processes an initial substrate to an intermediate or a final product in a series of enzyme-catalyzed reactions.

The term "gene" refers to a nucleic acid sequence that is capable of being expressed as a specific protein, such as an enzyme, optionally including regulatory sequences preceding (5' non-coding sequences) and following (3' non-coding sequences) the coding sequence.

The term "transformation" refers to the transfer of a nucleic acid sequence such as, e.g., a gene, into a host cell, typically a yeast host cell, resulting in genetically stable inheritance.

As used herein, "recombinant" refers to a host cell into which a nucleic acid sequence, such as a gene, has been transferred, typically by transformation.

A "yeast" is any of various small, single-celled eukaryotic fungi of the phylum

Ascomycota that reproduce by fission or budding, and that are capable of fermenting carbohydrates into alcohol and carbon dioxide. Preferably, a yeast cell as used herein refers to a cell of a genus selected form the group consisting of Saccharomyces, Rhodotorula,

Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida,

Yarrowia, Lipomyces, Cryptococcus, and Dekkera sp.

A metabolic pathway, protein, polypeptide, enzyme, nucleic acid sequence or gene may be "heterologous" or "foreign" to a host cell, meaning that the pathway, enzyme, nucleic acid sequence or gene is not normally found in the host cell, typically a yeast host cell of a specific taxonomic classification. "Endogenous" refers to a pathway, protein, polypeptide, enzyme, nucleic acid or gene normally present in the host cell, typically a yeast host cell of a specific taxonomic classification.

The term "heterologous gene" is defined herein as a gene that is not native to the host cell; a native gene in which structural modifications have been made to the coding region; a native gene whose expression is quantitatively altered (e.g., "overexpressed") as a result of a manipulation of the DNA by recombinant DNA techniques, e.g., a different (foreign) promoter; or a gene in a host cell having one or more extra copies of the coding sequence to quantitatively alter expression. For avoidance of doubt, as used herein, a described Saccharomyces gene shall be considered a "heterologous gene" when expressed in a

Saccharomyces host so long as the gene is not in its native form and is altered by any means as described above (e.g., transformed into the host).

As used herein, an "overexpressed" gene encoding an enzyme means that the enzyme is produced at a higher level of specific enzymatic activity as compared to the unmodified host cell under identical conditions. Usually this means that the enzymatically active protein (or proteins in case of multi-subunit enzymes) is produced in greater amounts, or rather at a higher steady state level as compared to the unmodified host cell under identical conditions. Typically, this is the result of the mRNA coding for the enzymatically active protein being produced in greater amounts, or again rather at a higher steady state level as compared to the unmodified host cell under identical conditions. Overexpression of an enzyme is thus preferably determined by measuring the level of the enzyme specific activity in the host cell using appropriate enzyme assays as described herein. Alternatively, overexpression of the enzyme may be determined indirectly by quantifying the specific steady state level of enzyme protein, e.g. using antibodies specific for the enzyme, or by quantifying the specific steady level of the mRNA coding for the enzyme. The latter may particularly be suitable for enzymes of the pentose phosphate pathway for which enzymatic assays are not easily feasible as substrates for the enzymes are not commercially available. Preferably, in the host cells described herein, a heterologous gene is overexpressed by at least a factor 1 .1 , 1 .2, 1 .5, 2, 5, 10 or 20 as compared to a host cell which is genetically identical except for the genetic modification causing the overexpression. It is to be understood that these levels of overexpression may apply to the steady state level of the enzyme's activity, the steady state level of the enzyme's protein as well as to the steady state level of the transcript coding for the enzyme.

The term "isolated" means a substance in a form or environment that does not occur in nature. Non-limiting examples of isolated substances include (1 ) any non-naturally occurring substance, (2) any substance including, but not limited to, any host cell, enzyme, variant, nucleic acid, protein, peptide or cofactor, that is at least partially removed from one or more or all of the naturally occurring constituents with which it is associated in nature; (3) any substance modified by the hand of man relative to that substance found in nature; or (4) any substance modified by increasing the amount of the substance relative to other components with which it is naturally associated.

The term "operably linked" means a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of a polynucleotide such that the control sequence directs expression of the coding sequence.

A "variant" of a reference enzyme as used herein is similar in its amino acid sequence to the reference enzyme, such as a "parent" or wild-type enzyme, having an amino acid sequence identity of at least 30%, such as at least 40%, such as at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% to the amino acid sequence of the reference. Enzyme variants can be made by a wide variety of different mutagenesis techniques well known to those skilled in the art. In addition, mutagenesis kits are also available from many commercial molecular biology suppliers. Methods are available to make specific substitutions at defined amino acids (site-directed), specific or random mutations in a localized region of the gene (regio-specific) or random mutagenesis over the entire gene (e.g., saturation mutagenesis). Numerous suitable methods are known to those in the art to generate enzyme variants, including but not limited to site- directed mutagenesis of single-stranded DNA or double-stranded DNA using PCR, cassette mutagenesis, gene synthesis, error-prone PCR, shuffling, and chemical saturation.

As used herein, a "fragment" of a reference enzyme such as a parent or wild-type enzyme comprises a segment of the reference enzyme amino acid sequence. The amino acid sequence of the fragment typically comprises at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95% of the amino acid sequence of the reference enzyme, lacking either an N-terminal portion, a C-terminal portion, or both N-terminal and C-terminal portions of the reference. Typically, a fragment is a catalytically active, at least retaining the enzyme activity of the reference enzyme, though fragments having improved enzyme activity, improved thermostability, altered co-factor dependency, or the like, are also encompassed. The fragment can be designed and expressed using recombinant methods, simply omitting the coding sequences for the relevant N-terminal and/or C-terminal portions.

An "ortholog" of a wild-type reference enzyme from a particular organism can readily be identified as being similar in its amino acid sequence to the reference enzyme though being encoded by a gene from another organism. Preferred orthologs are yeast orthologs, which can be readily identified by, e.g., searching public genomic sequence databases or screening EST libraries for nucleic acid sequences which hybridize to the wild-type nucleic acid sequence encoding the reference enzyme under moderate or stringent hybridization conditions. Typically, the ortholog has an amino acid sequence identity of at least 30%, such as at least 40%, such as at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% to the amino acid sequence of the reference enzyme, and is catalytically active such that it at least retains the enzyme activity of the reference enzyme, though orthologs having improved enzyme activity, improved thermostability, altered co-factor dependency, or the like, are also encompassed.

The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter "sequence identity".

For purposes described herein, the degree of sequence identity between two amino acid sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970) as implemented in the Needle program of the EMBOSS package (Rice et al., 2000), preferably version 3.0.0 or later. The optional parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. The output of Needle labeled "longest identity" (obtained using the -nobrief option) is used as the percent identity and is calculated as follows:

(Identical Residues x 100)/(Length of the Referenced Sequence - Total Number of Gaps in Alignment)

For purposes described herein, the degree of sequence identity between two 5 deoxyribonucleotide sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 3.0.0 or later. The optional parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) i o substitution matrix. The output of Needle labeled "longest identity" (obtained using the -nobrief option) is used as the percent identity and is calculated as follows:

(Identical Deoxyribonucleotides x 100)/(Length of Referenced Sequence - Total Number of Gaps in Alignment)

The variants, fragments, and orthologs of reference enzymes as described herein are

15 typically "catalytically active", meaning that at least retain the enzyme activity of the reference enzyme, though variants, fragments and orthologs having improved enzyme activity, improved thermostability, altered co-factor dependency, improved affinity, improved catalytic rate, or the like, are also encompassed. For example, a catalytically active variant, fragment or ortholog of Sp. passalidarum XK (SEQ ID NO: 6) or K. marxianus XK (SEQ ID NO: 22) thus has, e.g., at

20 least twice the enzymatic activity of S. cerevisiae XK (SEQ ID NO: 32) for converting D- xylulose to xylulose 5-phosphate, preferably when measured according to Example 7, and provides for an anaerobic growth rate of a recombinant S. cerevisiae or other yeast cell on xylose which is higher than that provided by a S. cerevisiae XK, preferably when measured according to Example 8. In another example, a functional variant, fragment or ortholog of

25 YME2p from Zygosaccharomyces bailii (SEQ ID NO:50) thus provides for one or more, preferably all, of an increased anaerobic growth, an increased xylose consumption rate, and an increased ethanol production rate of a recombinant yeast cell in the presence of formic acid and/or acetic acid, preferably when measured according to Example 14 or 13, respectively. In another example, a catalytically active variant, fragment or yeast ortholog of S. cerevisiae

30 EN01 , PFK2, PFK26, PGM , GPM1 or TPI 1 thus provides for one or more, preferably all, of an increased aerobic growth on xylose, an increased anaerobic growth rate on xylose, an increased xylose consumption rate, and an increased ethanol production rate from xylose, preferably when measured according to Example 17, 18, 23 and/or 24. In another example, a catalytically active variant, fragment or ortholog of S. cerevisiae PGM3/PRM15 thus provides

35 for one or more, preferably all, of an increased anaerobic growth on xylose, an increased xylose consumption rate, and an increased ethanol production rate from xylose, preferably when measured according to Example 26 and/or 27 and, typically, substantially retained or improved phosphoglucomutase activity, phosphoribomutase activity, or both. Phosphoglucomutase activity and phosphoribomutase activity can be determined as known in the art (e.g., See Xu et al., 2013).

Reference to "about" a value or parameter herein includes aspects that are directed to that value or parameter per se. For example, description referring to "about X" includes the aspect "X". When used in combination with measured values, "about" includes a range that encompasses at least the uncertainty associated with the method of measuring the particular value, and can include a range of plus or minus two standard deviations around the stated value.

As used herein and in the appended claims, the singular forms "a," "or," and "the" include plural referents unless the context clearly dictates otherwise. It is understood that the aspects described herein include "consisting" and/or "consisting essentially of" aspects.

Unless defined otherwise or clearly indicated by context, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art.

Detailed Description

Recombinant cells

Table 1 below summarizes some key results from the Examples. When tested in cell extracts of recombinant S. cerevisiae cells capable of fermenting xylose, the extracts from those cells overexpressing Sp. passalidarium XK (SEQ ID NO: 6) had a consistently higher XK activity than the control strains overexpressing S. cerevisiae XK (SEQ ID NO: 32). Surprisingly, the anaerobic growth rate on xylose of the cells overexpressing XK from either Sp. passalidarium or K. marxianus (SEQ ID NO: 22) was nonetheless higher than those of the control strains overexpressing S. cerevisiae XK or £. co// XK (SEQ ID NO: 18). In addition, the cell overexpressing Sp. passalidarium XK also provided for higher xylose consumption and ethanol production in fermentation on xylose than the corresponding S. cerevisiae XK control strains.

Table 1. Overview of strains constructed and tested as described in the Examples.

mXR(N272D)

P. stipitis XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

C5LTe1036 P. stipitis 0.84 ± 0.20 21 15 mXR(N272D)

P. stipitis XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

C5LTe1040 P. stipitis Slower than

mXR(N272D) e1035

P. stipitis XDH

E. coli XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

C5LTe1042 P. stipitis 5.21 ± 0.37 0.19 Faster than 46 23

mXR(N272D) e1035

P. si/ /i/s XDH

S. passalidarum XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RKI 1

C5LTe1043 P. stipitis Faster than

mXR(N272D) e1035

P. si/p/i/s XDH

K. marxianus XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RKI 1

C5LTe1048 4.31 ± 0.06 35 19

S. passalidarum XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

C5LTe1204 P. stipitis 0.92 ± 0.27 25 15

mXR(N272D)

P. si/p/i/s XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

C5LTe1208 P. stipitis 3.67 ± 0.19 37 19

mXR(N272D)

P. si/p/i/s XDH

S. passalidarum XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RKI 1

Accordingly, in one aspect, is a recombinant yeast cell such as a Saccharomyces cell, optionally a S. cerevisiae cell, which is capable of fermenting xylose and which comprises a heterologous gene encoding a polypeptide having enzymatic xylulokinase (XK) activity.

In one aspect, is a recombinant yeast cell such as a Saccharomyces cell, optionally a S. cerevisiae cell, which is capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK) providing an enzymatic activity for converting D-xylulose to xylulose 5-phosphate which is higher than that provided by S. cerevisiae XK (SEQ ID NO: 32) and yet provides for an anaerobic growth rate which is higher than that provided by S. cerevisiae XK, £. co// XK, or both.

In one aspect, is a recombinant Saccharomyces cell which is capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK), wherein the XK has an enzymatic XK activity, which is higher than that of S. cerevisiae XK, and provides for an anaerobic growth rate of the recombinant cell on xylose which is higher than that provided by a S. cerevisiae XK.

In one aspect, is a recombinant yeast cell such as a Saccharomyces cell, optionally a S. cerevisiae cell, capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK) comprising SEQ ID NO: 6 or a catalytically active variant thereof.

In separate and specific embodiments of any aforementioned aspect, the XK provides for an enzymatic XK activity which is at least 1 .1 , e.g., at least 1 .2, at least 1 .5, at least 1 .7, at least twice (2), at least 2.5 or at least 3 times that provided by S. cerevisiae XK. In one embodiment, the XK has an enzymatic activity for converting D-xylulose to xylulose 5- phosphate at least twice that of S. cerevisiae XK. In one embodiment, the XK further provides for an aerobic growth rate of the cell on xylose which is higher than that provided by S. cerevisiae XK, E. coli XK, or both. For example, the XK may provide for an aerobic growth rate which is 1 .1 , 1 .2, 1 .3, 1 .4 or 1 .5 times that provided by S. cerevisiae XK. Additionally, the XK may provide for a higher xylose consumption, a higher ethanol production, or both, of the recombinant cell than that provided by S. cerevisiae XK, £. co// XK or both.

Advantageously, the XK activity, anaerobic and aerobic growth rates, and xylose consumption and ethanol production, can be tested in the assays and strain constructs according to the Examples. For example, in one embodiment, the enzymatic activity for converting D-xylulose to xylulose 5-phosphate is measured according to Example 7. In an additional or alternative embodiment, the anaerobic growth rate can be measured according to Example 9. Likewise, the aerobic growth rate can be measured according to Example 8, and/or the xylose consumption and ethanol production can be measured according to Example 10. In these tests, the recombinant strains can, for example, be prepared from C5LTe1000 or an equivalent or similar laboratory or commercially available S. cerevisiae strain, for example CEN.PK or s288c, and then tested in the form of live cells or cell extracts as described. Typically, for conducting such test assays, S. cerevisiae cells are transformed with genes encoding P. stipitis mXR with an N272D mutation, P. stipitis XDH, the XK to be examined or control S. cerevisiae or E. coli XK, and S. cerevisiae TAL1 , TKL1 , and RKI 1 , so that each gene is chromosomally integrated and operably linked to a constitutive promoter.

In one embodiment of any preceding aspect or embodiment, the XK comprises the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof. In one embodiment, the XK is a catalytically active variant of SEQ ID NO: 6, comprising an amino acid sequence which is at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% identical to SEQ ID NO: 6. In one embodiment, the XK is a variant of SEQ ID NO: 22, comprising an amino acid sequence which is at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 98%, such as at least 99% identical to SEQ ID NO: 6. In one embodiment, the XK is a fragment (e.g., a catalytically active fragment) of SEQ ID NO: 6 corresponding to at least 50%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% of the full-length enzyme amino acid sequence. In one embodiment, the XK is a fragment (e.g., a catalytically active fragment) of SEQ ID NO: 22 corresponding to at least 50%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% of the full-length enzyme amino acid sequence.

Table 2 below additionally summarizes additional key results from the Examples. Surprisingly, expressing the heterologous YME2 gene (i.e., overexpressing the YME2 gene) in recombinant S. cerevisiae cells capable of fermenting xylose resulted in a higher anaerobic growth rate, more efficient xylose consumption and increased ethanol production in fermentation on xylose.

Table 2. Overview of strains constructed and tested in the presence of formic acid as described in the Examples. The growth and fermentation experiments reported here were conducted on xylose and under anaerobic conditions.

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

Z. bailii YME2

Accordingly, in one aspect is a recombinant yeast cell such as a Saccharomyces cell, optionally a S. cerevisiae cell, which is capable of fermenting xylose and which comprises a heterologous gene encoding an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, wherein the Yme2p polypeptide provides for an increased tolerance of the recombinant cell to formic acid, acetic acid, or both. In one embodiment, the Yme2p polypeptide further provides for (a) an increased anaerobic growth, (b) an increased xylose consumption rate, (c) an increased ethanol production rate, or (d) a combination of two or all of (a) to (c), of the recombinant cell in the presence of formic acid. In one embodiment, the Yme2p polypeptide further provides for (a) an increased xylose consumption rate, (b) an increased ethanol production rate, or (c) a combination of (a) and (b), of the recombinant cell in the presence of acetic acid.

Advantageously, the anaerobic growth, xylose consumption, ethanol production rate, or combination can be measured according to Examples 13 or 14. Notably, these Example show that xylose fermentation in presence of formic or acetic acid was improved in yeast overexpressing YME2. Specifically, xylose consumption and ethanol production increased by 13% and 12%, respectively, in the presence of acetic acid, and by 7% and 12%, respectively, in the presence of formic acid. Formic acid is common in lignocellulosic hydrolysates, and strongly contributes to the toxicity of such hydrolysates.

In separate and specific embodiments of any aforementioned aspect or embodiment, the Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a functional variant, fragment or yeast ortholog thereof, provides for a xylose consumption which is at least 5%, e.g., at least 10% higher, an ethanol production which is at least 5%, e.g., at least 10% higher, or both, of the recombinant cell when tested according to Example 14. In these tests, the recombinant strains can, for example, be prepared from TMB 3000 or an equivalent or similar laboratory or commercially available S. cerevisiae strain, and then tested in as described. Typically, for conducting such test assays, S. cerevisiae cells are transformed with genes encoding P. stipitis mXR with an N272D mutation, P. stipitis XDH, the Yme2p polypeptide to be examined, and S. cerevisiae XK, TAL1 , TKL1 , and RKI 1 , so that each gene is chromosomally integrated and operably linked to a constitutive promoter.

In one embodiment of any preceding aspect or embodiment, the Yme2p polypeptide comprises the amino acid sequence of SEQ ID NO: 50 or a functional variant, fragment or yeast ortholog thereof. In one embodiment, the Yme2p polypeptide is a functional variant or yeast ortholog of the amino acid sequence of SEQ ID NO: 50, comprising an amino acid sequence which is at least 50%, such as at least 60%, such as at least 70%, such as at least 80%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99% identical to the amino acid sequence of SEQ ID NO: 50. In one embodiment, the Yme2p polypeptide is a functional fragment of the amino acid sequence of SEQ ID NO: 50 corresponding to at least 50%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% of the full-length enzyme amino acid sequence.

Tables 3 and 8 below additionally summarize some key results from the Examples, showing that overexpression of genes encoding EN01 , PFK2, PFK26, PGM , GPM1 or TPI 1 in recombinant S. cerevisiae cells capable of fermenting xylose resulted in a higher anaerobic growth rate, more efficient xylose consumption and/or increased ethanol production in fermentation on xylose.

Table 3. Overview of strains constructed and tested as described in the Examples. See also Table 8, for results on xylose consumption and ethanol production.

Strain Heterologous Aerobic Anaerobic Xylose Ethanol genes growth on growth on consumed produced in xylose xylose (h 1 ) in 140h, 140h (g/L)

(g/L)

C5LTe1035 P. stipitis mXR(N272D) 0.058

P. stipitis 3 XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

C5LTe1051 P. stipitis mXR(N272D) 0.066

P. stipitis 3 XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae EN01

C5LTe1052 P. stipitis mXR(N272D) 0.063

P. stipitis 3 XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae PFK2

C5LTe1054 P. stipitis mXR(N272D) 0.071

P. stipitis 3 XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae PGM

C5LTe1055 P. stipitis mXR(N272D) 0.075 P. stipitis 3 XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae PFK26

MC2 P. stipitis mXR(N272D) Faster than Faster than 51 18

P. stipitis 3 XDH average average

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae RPE1

S. cerevisiae PGM

MC3 P. si/ / ' i/ ' s mXR(N272D) Slower than Faster than 53 21

P. si/p/ ' i/ ' s a XDH average average

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae RPE1

S. cerevisiae PFK26

MC4 P. sf/p/tfs mXR(N272D) Faster than Faster than 50 22

P. si/p/ ' i/ ' s a XDH average average

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae RPE1

S. cerevisiae PFK2

MC11 P. si/p i s mXR(N272D) Slower than Faster than 50 18

P. si/p/ ' i/ ' s a XDH average average

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae RPE1

S. cerevisiae PFK2

MC14 P. si/p/i/s mXR(N272D) Faster than Faster than 54 20

P. si/p/ ' i/ ' s a XDH average average

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae RPE1

S. cerevisiae EN01

MC22 P. si/p/i/s mXR(N272D) Slower than Faster than 51 18

P. si/p i s a XDH average average

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RK11

S. cerevisiae RPE1

S. cerevisiae EN01

Control P. si/p/i/s mXR(N272D) 36 13

P. stipitis 3 XDH

S. cerevisiae XK

S. cerevisiae TAL1

S. cerevisiae TKL1

S. cerevisiae RKI1 S. cerevisiae RPE1

URA3+

Accordingly, in one aspect is a recombinant yeast cell such as a Saccharomyces cell, optionally a S. cerevisiae cell, which is capable of fermenting xylose and which comprises a heterologous gene encoding an enolase comprising the amino acid sequence of SEQ ID NO: 5 132 (EN01 ), a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134 (PFK2), a 6-phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO: 136 (PFK26), a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138 (PGI1 ), a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO: 140 (GPM1 ), a triose-phosphate isomerase comprising i o the amino acid sequence of SEQ ID NO: 142 (TPI 1 ), or a catalytically active variant, fragment or yeast ortholog of any one of EN01 , PFK2, PFK26, PGM , GPM1 and TPI 1 , or a combination of any two or more of EN01 , PFK2, PFK26, PGM , GPM1 and TPI 1 thereof,such as three, four, five or all of EN01 , PFK2, PFK26, PGI1 , GPM1 and TPI 1. In one embodiment, the heterologous gene or combination provides for an increased anaerobic growth rate on xylose,

15 an increased aerobic growth rate on xylose, an increased ethanol production from xylose, or a combination of any two or all thereof. Advantageously, the anaerobic growth rate can be measured according to Example 23, the aerobic growth can be measured according to Example 5, and/or the ethanol production can be measured according to Example 24.

In separate and specific embodiments of any aforementioned aspect or embodiment,

20 the EN01 , PFK2, PFK26, PGI1 , GPM1 , TPI1 or catalytically active variant, fragment or yeast ortholog of any thereof, provides for an ethanol yield from xylose which is at least 5%, e.g., at least 10% higher, at least 15% higher, or at least 20% higher, of the recombinant cell when tested according to Example 12. In these tests, the recombinant strains can, for example, be prepared from CEN.PK or an equivalent or similar laboratory or commercially available S.

25 cerevisiae strain, and then tested in as described. Typically, for conducting such test assays, S. cerevisiae cells are transformed with genes encoding P. stipitis mXR with an N272D mutation, P. stipitis XDH, the EN01 , PFK2, PFK26, PGM , GPM1 , or TPI 1 to be examined, and S. cerevisiae XK, TAL1 , TKL1 , and RKI 1 , so that each gene is chromosomally integrated and operably linked to a constitutive promoter.

30 In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding an enolase comprising the amino acid sequence of SEQ ID NO: 132 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog has a sequence identity of at least 80%, e.g., at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% to the amino acid

35 sequence of SEQ ID NO: 132. In one embodiment, the heterologous gene encodes the amino acid sequence of SEQ ID NO: 132.

In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog has a sequence identity of at least 80%, e.g., at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% to the amino acid sequence of SEQ ID NO: 134. In one embodiment, the heterologous gene encodes the amino acid sequence of SEQ ID NO: 134.

In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding a 6-phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO: 136 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog has a sequence identity of at least 80%, e.g., at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% to the amino acid sequence of SEQ ID NO: 136. In one embodiment, the heterologous gene encodes the amino acid sequence of SEQ ID NO: 136.

In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog has a sequence identity of at least 80%, e.g., at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% to the amino acid sequence of SEQ ID NO: 138. In one embodiment, the heterologous gene encodes the amino acid sequence of SEQ ID NO: 138.

In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO: 140 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog has a sequence identity of at least 80%, e.g., at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% to the amino acid sequence of SEQ ID NO: 140. In one embodiment, the heterologous gene encodes the amino acid sequence of SEQ ID NO: 140.

In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding a triose-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 142 or a catalytically active variant, fragment or yeast ortholog thereof. In one embodiment, the catalytically active variant or yeast ortholog has a sequence identity of at least 80%, e.g., at least 90%, at least 95%, at least 97%, at least 98%, or at least 99% to the amino acid sequence of SEQ ID NO: 142. In one embodiment, the heterologous gene encodes the amino acid sequence of SEQ ID NO: 142.

In one embodiment, the heterologous gene encodes a fragment of the amino acid sequence of SEQ ID NO: 132, 134, 136, 138, 140 or 142 corresponding to at least 50%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% of the full-length amino acid sequence.

Table 4 below additionally summarizes some key results from the Examples, showing that overexpression of PGM3 in recombinant S. cerevisiae cells capable of fermenting xylose resulted in a higher anaerobic growth rate, and more efficient xylose consumption and increased ethanol production in anaerobic fermentation of xylose.

Table 4. Overview of strains constructed and tested as described in the Examples.

Accordingly, in one aspect is a recombinant yeast cell such as a Saccharomyces cell, optionally a S. cerevisiae cell, which is capable of fermenting xylose and which comprises an heterologous gene encoding a phosphoglucomutase and/or phosphoribomutase comprising the amino acid sequence of SEQ ID NO: 150 (PGM3) or a catalytically active variant or fragment thereof. In one embodiment, the heterologous gene or combination provides for an increased anaerobic growth rate on xylose, an increased xylose consumption, an increased ethanol production from xylose, or a combination of any two or all thereof.

In separate and specific embodiments of any aforementioned aspect or embodiment, the PGM3 or catalytically active fragment or variant provides for an anaerobic growth rate of the recombinant cell which is at least 10%, e.g., at least 20%, at least 50%, at least 80%, or at least 100% higher than a relevant control not comprising the heterologous PGM3 gene (in which the PGM3 is not overexpressed), (e.g., when tested according to Example 26). In separate and specific embodiments of any aforementioned aspect or embodiment, the PGM3 or catalytically active fragment or variant also or alternatively provides for a xylose consumption rate in anaerobic fermentation which is at least 10%, e.g., at least 20%, at least 50%, at least 80%, or at least 100% higher than a relevant control not comprising the heterologous PGM3 gene (in which the PGM3 is not overexpressed), (e.g., when tested according to Example 27). In separate and specific embodiments of any aforementioned aspect or embodiment, the PGM3 or catalytically active fragment or variant also or alternatively provides for an ethanol yield in anaerobic fermentation which is at least 10%, e.g., at least 20%, at least 50%, at least 80%, or at least 100% higher than a relevant control not comprising the heterologous PGM3 gene (in which the PGM3 is not overexpressed), (e.g., when tested according to Example 27). In these tests, the recombinant strains can, for example, be prepared from CEN.PK or an equivalent or similar laboratory or commercially available S. cerevisiae strain, and then tested in as described. Typically, for conducting such test assays, S. cerevisiae cells are transformed with genes encoding P. stipitis mXR with an N272D mutation, P. stipitis XDH, the PGM3 or catalytically active fragment or variant to be examined, and S. cerevisiae XK, TAL1 , TKL1 , and RKI 1 , so that each gene is chromosomally integrated and operably linked to a constitutive promoter.

In one embodiment or any preceding aspect or embodiment, the cell comprises a heterologous gene encoding a PGM3 comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant or fragment thereof. In one embodiment, the catalytically active variant has a sequence identity of at least 30%, such as at least 50%, such as at least 80%, such as at least 90%, such as least 95%, such as at least 97%, such as at least 98%, such as at least 99% to the amino acid sequence of SEQ ID NO: 150. In one embodiment, the heterologous gene encodes a PGM3 comprising the amino acid sequence of SEQ ID NO: 150.

In one embodiment, the heterologous gene encoding PGM3 or catalytically active fragment or variant encodes a fragment of SEQ I D NO: 150 corresponding to at least 50%, at least 80%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% of the full-length amino acid sequence.

The skilled artisan is well aware that variants and fragments of an enzyme sequence can be modified by replacing, inserting, or deleting amino acids using standard recombinant techniques while still retaining, or even improving, the enzyme activity of interest. Although such variants include those having amino acid sequences with one or more conservative or non-conservative substitutions relative to the amino acid sequence of SEQ ID NO: 6, 22, 50, 132, 134, 136, 138, 140, 142 and/or 150, conservative substitutions are typically of most interest. As used herein, the term "conservative substitution" refers to the substitution of a residue for another residue that does not generally alter the specific activity of the encoded polypeptide. An exemplary conservative substitution is a substitution that is within the same group of basic amino acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino acids (glycine, alanine, serine, threonine, proline, cysteine and methionine). Amino acid substitutions that do not generally alter the specific activity are well- known in the art. The most commonly occurring exchanges are Ala/Ser, Val/lle, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr. Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu lie, LeuA al, Ala/Glu, and Asp/Gly, as well as these in reverse. In some embodiments, the substitutions are of a low percentage, typically less than about 10%, more typically less than 5%, and often less than about 2% of the amino acids of the polypeptide sequence, with a conservatively selected amino acid of the same conservative substitution group. In one preferred embodiment of any preceding aspect or embodiment, the XK comprises SEQ ID NO: 6. In another preferred embodiment of any preceding aspect or embodiment, the XK comprises SEQ ID NO: 22. In another preferred embodiment of any preceding aspect or embodiment, the Yme2p comprises the amino acid sequence of SEQ ID NO: 50. In another preferred embodiment of any preceding aspect, the enzyme comprises the amino acid sequence of SEQ ID NO: 134, 136, 138, 140, 142 or 150.

Essential amino acids of enzymes can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, 1989, Science 244: 1081 -1085). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant mutant molecules are tested for enzymatic activity to identify amino acid residues that are critical to the activity of the molecule. See also, Hilton et al., 1996, J. Biol. Chem. 271 : 4699-4708. The active site of the enzyme or other biological interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction, or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., 1992, Science 255: 306-312; Smith et al., 1992, J. Mol. Biol. 224: 899-904; Wlodaver et al., 1992, FEBS Lett. 309: 59-64. The identities of essential amino acids can also be inferred from analysis of identities with other enzymes that are related to the referenced enzymes.

Additional guidance on the structure-activity relationship of the enzymes herein can be determined using multiple sequence alignment (MSA) techniques well-known in the art. Based on the teachings herein, the skilled artisan could make similar alignments with any number of enzymes described herein or known in the art. Such alignments aid the skilled artisan to determine potentially relevant domains (e.g., binding domains or catalytic domains), as well as which amino acid residues are conserved and not conserved among the different enzyme sequences. It is appreciated in the art that changing an amino acid that is conserved at a particular position between the disclosed enzyme will more likely result in a change in biological activity (Bowie et al., 1990, Science 247: 1306-1310: "Residues that are directly involved in protein functions such as binding or catalysis will certainly be among the most conserved"). In contrast, substituting an amino acid that is not highly conserved among the enzymes will not likely or significantly alter the biological activity.

Single or multiple amino acid substitutions, deletions, and/or insertions can be made and tested using known methods of mutagenesis, recombination, and/or shuffling, followed by a relevant screening procedure, such as those disclosed by Reidhaar-Olson and Sauer, 1988, Science 241 : 53-57; Bowie and Sauer, 1989, Proc. Natl. Acad. Sci. USA 86: 2152-2156; WO 95/17413; or WO 95/22625. Other methods that can be used include error-prone PCR, phage display (e.g., Lowman et al., 1991 , Biochemistry 30: 10832-10837; U.S. Patent No. 5,223,409; WO 92/06204), and region-directed mutagenesis (Derbyshire et al., 1986, Gene 46: 145; Ner et al., 1988, DNA 7: 127).

Mutagenesis/shuffling methods can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides expressed by host cells (Ness et al., 1999, Nature Biotechnology 17: 893-896). Mutagenized DNA molecules that encode active enzymes (e.g., xylulokinases) can be recovered from the host cells and rapidly sequenced using standard methods in the art. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide.

The host cells for preparing the recombinant cells of the invention can be from any suitable yeast strain, including, but not limited to, a Saccharomyces, Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell. In particular, Saccharomyces host cells are contemplated, such as Saccharomyces cerevisiae, bayanus or carlsbergensis cells. Preferably, the yeast cell is a Saccharomyces cerevisiae cell. Suitable cells can, for example, be derived from commercially available strains and polyploid or aneuploid industrial strains, including but not limited to those from Superstart™, THERMOSACC®, etc. (Lallemand); RED STAR and ETHANOL RED® (Fermentis/Lesaffre); FALI (AB Mauri); Baker's Best Yeast, Baker's Compressed Yeast, etc. (Fleishmann's Yeast); BIOFERM AFT, XP, CF, and XR (North American Bioproducts Corp.); Turbo Yeast (Gert Strand AB); and FERMIOL® (DSM Specialties). Other useful yeast strains are available from biological depositories such as the American Type Culture Collection (ATCC) or the Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH (DSMZ), such as, e.g., BY4741 (e.g., ATCC 201388); Y108-1 (ATCC PTA.10567) and NRRL YB-1952 (ARS Culture Collection). Still other S. cerevisiae strains suitable as host cells DBY746, [Alpha][Eta]22, S150-2B, GPY55-15Ba, CEN.PK, USM21 , TMB3500, TMB3400, VTT-A-63015, VTT-A-85068, VTT-c-79093 and their derivatives as well as Saccharomyces sp. 1400, 424A (LNH-ST), 259A (LNH-ST) and derivatives thereof.

As previously mentioned, some wild-type yeast cells, e.g., Saccharomyces cells, cannot naturally ferment xylose. However, it is now well within the level of skill in the art to apply genetic engineering technology to prepare recombinant Saccharomyces cells which are capable of fermenting xylose. For example, XR and XDH enzymes from naturally xylose fermenting yeasts such as Scheffersomyces (Pichia) stipitis and various Candida species can be expressed in Saccharomyces cells to provide this ability and/or an XI enzyme suitable for expression in a yeast host cell. Additionally, it is contemplated to use catalytically active variants of XR and/or XDH. For example, variants of P. stipitis XR exist which change the cofactor preference of the XR from NADPH to NADH. These variants are referred to herein as "NADH-preferring" XR variants, and include, but are not limited to, N272D, K270R and P275Q. Still other variants of XR and XK are described in WO 2012/1351 10. As for XI, XI from the fungus Piromyces sp. (Kuyper et al., 2005) or other sources (Madhavan et al., 2009) have been expressed in S. cerevisiae host cells. Still other Xls suitable for expression in yeast have been described in US 2012/0184020 (an XI from Ruminococcus flavefaciens), WO 201 1/078262 (several Xls from Reticulitermes speratus and Mastotermes darwiniensis) and WO 2012/009272 (constructs and fungal cells containing an XI from Abiotrophia defectiva). Optionally, additional improvements in xylose fermentation can also be achieved by strain adaptation to selective conditions, according to techniques known in the art.Additionally, the xylose fermentative capability of a yeast cell can also be increased by increasing the flux of the pentose-phosphate pathway (PPP) by overexpressing one or more genes encoding enzymes of the non-oxidative pathway, which includes TAL (EC 2.2.1 .2), TKL (EC 2.2.1 .1 ), RKI (EC 5.3.1 .6) and RPE (EC 5.1 .3.4) (Karhumaa et al., 2005). Preferably, in the yeast cells of the invention, the genes encoding TAL, TKL and RKI are overexpressed. Typically, although not necessarily, the endogenous genes of the PPP are overexpressed. An increased flux in the PPP can be measured by metabolic flux analysis with 13C-labeled glucose as described in, e.g., van Winden et al. (2005, FEMS Yeast Research 5:559-568).

Accordingly, in one embodiment of any preceding aspect or embodiment, the recombinant cell comprises a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and/or a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI). In some embodiments, the recombinant cell comprises a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI).

In one embodiment of any preceding aspect or embodiment, the recombinant cell comprises a heterologous gene encoding a xylose reductase (XR), a heterologous gene encoding a xylitol dehydrogenase (XDH), and/or a heterologous gene encoding a xylose isomerase (XI). Preferred XRs are Pichia stipitis XR and NADH-preferring variants thereof, such as Pichia stipites XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q. Most preferred is Pichia stipitis XR(N272D). Preferred XDHs are Pichia stipitis XDH and catalytically active variants thereof. Preferred TALs, TKLs and RKIs are those that are endogenous to the cell.

The specific coding or amino acid sequences for the various S. cerevisiae or other yeast or fungal enzymes referred to above can be identified in the literature and in bioinformatics databases well known to the skilled person, such as the BRENDA comprehensive enzyme information system, available at www.brenda-enzymes.org, KEGG (www.genome.jp/kegg/kegg2.html/), UniProt (http://ca.expasy.org/sprot/), Metacyc (www.metacyc.com/), and the Saccharomyces genome database (www.yeastgenome.org). Particular XR, XDH, XK, TAL, TKL and RKI nucleic acid and amino acid sequences can also be prepared as described in the Examples, or are provided in the accompanying sequence listing.

In another aspect is a method for increasing the tolerance of a yeast cell such as a

Saccharomyces cell to formic acid, comprising transforming the cell with a gene encoding a Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, and expressing the gene.

In one embodiment, each of the heterologous gene or genes in the recombinant cell according to the invention is operably linked to an inducible, a regulated or a constitutive promoter. Optionally, one or more, optionally all, of the genes are integrated into the genome of the cell. In a specific embodiment, the gene encoding an XK is operably linked to a constitutive promoter endogenous to the cell. Related recombinant techniques are described in more detail below.

Strains or clones of the recombinant cells of any of the preceding aspects or embodiments are also provided by the invention. A "clone" in this context refers to a number of cells which all are derived from the same parent cell by cell division.

Recombinant methods

The invention also relates to vectors comprising genes encoding an XK as described in the preceding aspects and embodiments which can be used for transforming a yeast host cell, optionally a Saccharomyces cell, such as an S. cerevisiae cell. The yeast host cell can either be naturally capable of fermenting xylose, or may be transformed with genes providing this capability, as described above.

Accordingly, in one aspect is a process for producing a recombinant cell described herein (e.g., a Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes a XK comprising the amino acid sequence of SEQ I D NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically variant or fragment of any thereof.

In one embodiment, the process comprises transforming the cell with a heterologous gene (e.g., a vector) that encodes a XK comprising the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically variant or fragment of any thereof; a heterologous gene (e.g., a vector) that encodes a XR, and a heterologous gene (e.g., a vector) that encodes a XDH. The heterologous genes may be in the form of one or more vectors.

In one embodiment, the process comprises transforming the cell with a heterologous gene (e.g., a vector) that encodes a XK comprising the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically variant or fragment of any thereof; and a heterologous gene (e.g., a vector) that encodes an XI. The heterologous genes may be in the form of one or more vectors.

In one aspect is a vector comprising a gene encoding a xylulokinase (XK) comprising the amino acid sequence of SEQ ID NO: 6, SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof.ln some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), and/or a gene encoding a xylose isomerase (XI). The vector may further comprise regulatory sequences for expressing the genes in a yeast host cell such as a Saccharomyces or Saccaromyces cerevisiae host cell.

The invention also relates to vectors comprising genes encoding an Yme2p as described in the preceding aspects and embodiments which can be used for transforming a yeast host cell, optionally a Saccharomyces cell, such as an S. cerevisiae cell. The yeast host cell can either be naturally capable of fermenting xylose, or may be transformed with genes providing this capability, as described above.

Accordingly, in one aspect is a process for producing a recombinant cell described herein (e.g., a Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes a Yme2p comprising the amino acid sequence of SEQ ID NO: 50, or a catalytically variant or fragment of any thereof.

In one embodiment, the process comprises transforming the cell with a heterologous gene (e.g., a vector) that encodes a Yme2p comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically variant or fragment of any thereof; a heterologous gene (e.g., a vector) that encodes a XR, and a heterologous gene (e.g., a vector) that encodes a XDH. The heterologous genes may be in the form of one or more vectors.

In one embodiment, the process comprises transforming the cell with a heterologous 5 gene (e.g., a vector) that encodes a Yme2p comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically variant or fragment of any thereof; and a heterologous gene (e.g., a vector) that encodes an XI. The heterologous genes may be in the form of one or more vectors.

In one aspect is a vector comprising a gene encoding a Yme2p comprising the amino i o acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment of any thereof.ln some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), and/or a gene encoding a xylose isomerase (XI). The vector may further comprise regulatory sequences for expressing the genes in a yeast host cell such as a Saccharomyces or Saccaromyces cerevisiae host cell.

15 The invention also relates to vectors comprising genes encoding an EN01 , PFK2,

PFK26, PGI1 , GPM1 , TPI1 polypeptide or catalytically active variant, fragment or yeast ortholog thereof as described in the preceding aspects and embodiments which can be used for transforming a yeast host cell, optionally a Saccharomyces cell, such as an S. cerevisiae cell. The yeast host cell can either be naturally capable of fermenting xylose, or may be

20 transformed with genes providing this capability, as described above.

Accordingly, in one aspectis a process for producing a recombinant cell described herein (e.g., a Saccharomyces cell), comprising transforming thecell with one or more heterologous genes (e.g., vectors) that encode EN01 , PFK2, PFK26, PGI1 , GPM1 , TPI1 or a catalytically active variant, fragment or yeast ortholog of any thereof.

25 In one embodiment, the process comprises transforming the cell with one or more heterologous genes (e.g., vectors) that encodes EN01 , PFK2, PFK26, PGM , GPM1 , TPI 1 or a catalytically active variant, fragment or yeast ortholog of any thereof; a heterologous gene (e.g., a vector) that encodes a XR, and a heterologous gene (e.g., a vector) that encodes a XDH. In some embodiments, the process further comprises transforming the cell with a

30 heterologous gene encoding a xylulose kinase (XK). The heterologous genes may be in the form of one or more vectors.

In one embodiment, the process comprises transforming the cell with one or more heterologous genes (e.g., vectors) that encodes EN01 , PFK2, PFK26, PGM , GPM1 , TPI 1 or a catalytically active variant, fragment or yeast ortholog of any thereof; and a heterologous gene

35 (e.g., a vector) that encodes an XI. In some embodiments, the process further comprises transforming the cell with a heterologous gene encoding a xylulose kinase (XK). The heterologous genes may be in the form of one or more vectors. In another aspect is a vector comprising one or more heterologous genes encoding EN01 , PFK2, PFK26, PGM , GPM1 , TPI 1 or a functional variant, fragment or yeast ortholog of any thereof In some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), In some embodiments, the process further comprises transforming the cell with a heterologous gene encoding a xylulose kinase (XK), and/or a gene encoding a xylose isomerase (XI). The vector may further comprise regulatory sequences for expressing the genes in a yeast host cell such as a Saccharomyces or Saccaromyces cerevisiae host cell.

The invention also relates to vectors comprising genes encoding an PGM3 polypeptide or a catalytically active variant or fragment thereof as described in the preceding aspects and embodiments which can be used for transforming a yeast host cell, optionally a

Saccharomyces cell, such as an S. cerevisiae cell. The yeast host cell can either be naturally capable of fermenting xylose, or may be transformed with genes providing this capability, as described above.

Accordingly, in one aspect is a process for producing a recombinant cell described herein (e.g., a Saccharomyces cell), comprising transforming the cell with a heterologous gene (e.g., a vector) that encodes the PGM3 comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant or fragment thereof.

In one embodiment, the process comprises transforming the cell with a heterologous gene (e.g., a vector) that encodes a PGM3 comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically variant or fragment of any thereof; a heterologous gene (e.g., a vector) that encodes a XR, and a heterologous gene (e.g., a vector) that encodes a XDH. The heterologous genes may be in the form of one or more vectors.

In one embodiment, the process comprises transforming the cell with a heterologous gene (e.g., a vector) that encodes a PGM3 comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically variant or fragment of any thereof; and a heterologous gene (e.g., a vector) that encodes an XI. The heterologous genes may be in the form of one or more vectors.

In another aspect is a vector comprising a gene encoding the PGM3 comprising the amino acid sequence of SEQ ID NO: 150 or a functional variant or fragment thereof In some embodiments, the vector further comprises a gene encoding a xylose reductase (XR), a gene encoding a xylitol dehydrogenase (XDH), and/or a gene encoding a xylose isomerase (XI). The vector may further comprise regulatory sequences for expressing the genes in a yeast host cell such as a Saccharomyces or Saccaromyces cerevisiae host cell.

Many methods for genetic modification, including transformation of yeast host cells are known to one skilled in the art and may be used to create the present recombinant cells, some of which are exemplified below. Standard recombinant DNA and molecular cloning techniques useful for transforming microbial cells with a desired nucleic acid sequence or gene, or otherwise manipulate the microbial cell, are described, e.g., in Sambrook, J., Fritsch, E. F. and Maniatis, T., Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); and by Silhavy, T. J., Bennan, M. L. and Enquist, L. W., Experiments with Gene Fusions, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1984); Ausubel et al., Short Protocols in Molecular Biology, 3rd ed., Wiley & Sons, 1995; and by Ausubel, F. M. et al., Current Protocols in Molecular Biology, published by Greene Publishing Assoc. and Wiley-lnterscience (1987). Additional methods used here are in Methods in Enzymology, Volume 194, Guide to Yeast Genetics and Molecular and Cell Biology (Part A, 2004, Christine Guthrie and Gerald R. Fink (Eds.), Elsevier Academic Press, San Diego, Calif.).

Overexpressing a gene can be achieved by increasing the copy number of the gene coding for the enzyme in the cell, e.g. by integrating additional copies of the gene in the cell genome, expressing the gene from an episomal multicopy expression vector or introducing an episomal expression vector that comprises multiple copies of the gene; upregulating the endogenous gene, and the like. In a preferred embodiment, the gene is introduced into the microbial cell via, e.g., transformation with one or more expression vectors. For example, for a yeast host cell, the level of a recombinantly expressed enzyme in the cell can be increased by clone one or more recombinant genes in a multicopy plasmid in the manner described by Mumberg et al. (1995).

The gene can either be synthesized or cloned from a host organism in which the corresponding DNA sequence is endogenous (see, e.g., Examples 1 -5). Standard cloning procedures used in genetic engineering can be used to relocate a nucleic acid sequence from its natural location to a different site where it will be reproduced. The cloning procedures may involve excision and isolation of a desired DNA fragment comprising the DNA sequence encoding the polypeptide of interest, insertion of the fragment into a vector molecule, and incorporation of the recombinant vector into a host cell where multiple copies or clones of the DNA sequence will be replicated. An isolated DNA sequence may be manipulated in a variety of ways to provide for expression of the polypeptide of interest. Manipulation of the DNA sequence prior to its insertion into a vector may be desirable or necessary depending on the expression vector and the host cell.

For example, to increase the likelihood that a, e.g., a bacterial enzyme gene is expressed in a yeast cell, the nucleotide sequence encoding the heterologous sequence may be adapted to optimize its codon usage to that of the yeast cell. The adaptiveness of a nucleotide sequence encoding enzyme to the codon usage of the host cell may be expressed as codon adaptation index (CAI). The CAI is herein defined as a measurement of the relative adaptiveness of the codon usage of a gene towards the codon usage of highly expressed genes. The relative adaptiveness (w) of each codon is the ratio of the usage of each codon, to that of the most abundant codon for the same amino acid. The CAI is defined as the geometric mean of these relative adaptiveness values. Non-synonymous codons and termination codons (dependent on genetic code) are excluded. CAI values range from 0 to 1 , with higher values indicating a higher proportion of the most abundant codons (see Sharp and Li, 1987, Nucleic Acids Research 15: 1281 -1295; also see: Jansen et al., 2003, Nucleic Acids Res. 31 (8):2242- 51 ). An adapted or "optimized" nucleotide sequence preferably has a CAI of at least 0.2, 0.3, 0.4, 0.5, 0.6 or 0.7.

The nucleotide sequence to be introduced into the DNA of the host cell may be integrated in vectors comprising the nucleotide sequence operably linked to one or more control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences. The control sequence may be an appropriate promoter sequence, a nucleotide sequence which is recognized by a host cell for expression of the nucleotide sequence. The promoter sequence contains transcriptional control sequences, which mediate the expression of the coding sequence. The promoter may be any nucleotide sequence which shows transcriptional activity in the host cell of choice including native, mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell. The promoter may be a weak or a strong promoter that is constitutive or regulated in the host to be used. Strong, constitutive promoters are generally preferred for overexpression of the genes. Examples of suitable promoters for directing the transcription of the genes and vector constructs of the present invention in a yeast host cell are promoters obtained for example from the genes for Saccharomyces cerevisiae enolase (EN01 ), S. cerevisiae galactokinase (GAL1 ), S. cerevisiae alcohol dehydrogenase 2 (ADH2), S. cerevisiae glyceraldehyde-3-phosphate dehydrogenase (TDH 1 ), S. cerevisiae glyceraldehyde-3 - phosphate dehydrogenase (TDH3), S. cerevisiae alcohol dehydrogenase 1 (ADH1 ), S. cerevisiae 3-phosphoglycerate kinase (PGK1 ) or S. cerevisiae cytochrome C (CYC1 ) (Karhumaa et al, 2005), translation elongation factor 1 alpha (TEF1/TEF2) (Mumberg et al., 1995), PDC1 pyruvate decarboxylase (PDC1 ), pyruvate kinase (PYK1 ), and the constitutive truncated HXT7 promoter (Hauf et al. Enzym Microb Technol (2000) 26:688-698.) Other suitable vectors and promoters for use in yeast expression are further described in EP A- 73,657 to Hitzeman, which is hereby incorporated by reference. Preferred promoters for overexpressing a gene encoding an XK in a recombinant yeast cell according to the invention include, but are not limited to, PGK1 , TPI 1 , HXT7, TDH3, ADH2 and TEF2 promoters. Preferably, the promoter for overexpressing a gene encoding an XK according to the invention is TPI 1. Promoters for overexpressing the genes encoding an XR, an XDH, an XI, a TAL, a TKL and an RPI in a recombinant yeast cell according to the invention are preferably separately selected from the following: PGK1 , TDH3, TEF2, PDC1 , HXT truncated, TPI1 and PYK1 .

The above disclosed vectors may comprise a gene encoding the enzyme polypeptide, a promoter, and transcriptional and translational stop signals as well as other regulatory or structural DNA sequences known to a person of skill in the art. The vector may be any vector or nucleic acid (e.g., a plasmid, virus, integration vector or integration fragment), which can be conveniently subjected to recombinant procedures and can bring about the expression of the gene in the yeast host cell. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vectors may be linear or closed circular plasmids, and may contain any means for assuring self-replication.

Alternatively, the vector may be one which, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. The vector system may be a single vector or plasmid or two or more vectors or plasmids, which together contain the total DNA to be introduced into the genome of the host cell, or a transposon. The vector may also be an integration vector comprising solely the gene or part of the gene to be integrated. For integration, the vector may rely on the DNA sequence encoding the polypeptide of interest or any other element of the vector for stable integration of the vector into the genome by homologous or non homologous recombination. More than one copy of a DNA sequence encoding a polypeptide of interest may be inserted into the host cell to amplify expression of the DNA sequence.

Optionally, the vectors of the present invention may contain one or more selectable markers, which permit easy selection of transformed cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs and the like. Useful expression vectors for yeast cells include, for example, the 2 [mu] (micron) plasmid and derivatives thereof, the Yip, YEp and YCp vectors described by Gietz and Sugino (1988. "New yeast vectors constructed with in vitro mutagenized yeast genes lacking six-base pair restriction sites", Gene 74:527-534), the vectors described in Mumberg et al. (1995, supra), YEplac-HXT vector (Karhumaa et al., 2005), the POT1 vector (U.S. Pat. No. 4,931 ,373), the pJS037 vector described in Okkels, Ann. New York Acad. Sci. 782, 202-207, 1996, the pPICZ A, B or C vectors (Invitrogen).

To achieve overexpression of an endogenous gene, promoter replacement methods may also be used to exchange the endogenous transcriptional control elements of the gene for another promoter (see, e.g., Mnaimneh et al. (2004) Cell 1 18(1 ):31 -44). Deletions of DNA control elements preventing a high expression of an endogenous target gene may be made using mitotic recombination as described in Wach et al. ((1994) Yeast 10:1793-1808). This method involves preparing a DNA fragment that contains a selectable marker between genomic regions that may be as short as 20 bp, and which bound a target DNA sequence. This DNA fragment can be prepared by PCR amplification of the selectable marker gene using as primers oligonucleotides that hybridize to the ends of the marker gene and that include the genomic regions that can recombine with the yeast genome. The linear DNA fragment can be efficiently transformed into a cell and recombined into the genome resulting in gene replacement including with deletion of the target DNA sequence (as described in Methods in Enzymology, v194, pp 281 -301 (1991 )).

For yeasts such as for Saccharomyces cerevisiae, DNA sequences surrounding a target gene coding sequence can be identified, e.g., in the complete genome sequence coordinated by Genome Project ID9518 of Genome Projects coordinated by NCBI (National Center for Biotechnology Information) with identifying GOPID #13838. Additional examples of yeast genomic sequences include that of Yarrowia lipolytica, GOPIC #13837, and of Candida albicans, which is included in GPID #10771 , #10701 and #16373. Additional genomes have been completely sequenced and annotated and are publicly available for the following yeast strains Candida glabrata CBS 138, Kluyveromyces lactis NRRL Y-1 140, Pichia stipitis CBS 6054, and Schizosaccharomyces pombe 972h-, available at http://biocyc.org/. Fermentation methods

In one aspectis a method for producing a fermentation product, comprising contacting the recombinant cell of any one of the preceding aspects and embodiments with a medium comprising a carbon source comprising xylose and/or arabinose under anaerobic conditions, and recovering or isolating the fermentation product from the medium.

The fermentation product may be or comprise, for example, at least one of ethanol, butanol, isobutanol, isopentanol, lactate, isoamylacetate, glycerol, sorbitol, mannitol, xylitol, arabinitol; 3-hydroxybutyrolactone; hydrogen gas; L-ascorbic acid, 2,5-furan dicarboxylic acid, 3-hydroxy propionic acid, aspartic acid, glutaric acid, glutamic acid, itaconic acid, levulinic acid; succinic acid, fumaric acid, malic acid or other 1 ,4-diacid; a fatty acid, a fatty acid derived molecule; an isoprenoid, an isoprenoid-derived molecule; and an alkane.

However, it is contemplated that other fermentation products can also be produced using the methods of the present invention. Preferably, the carbon source comprises xylose and the fermentation product comprises ethanol.

In some embodiments, the medium, i.e., the fermentation medium, is feedstock from a cellulosic saccharification process and/or feedstock from a hemicellulose pre-treatment process. Such feedstocks include, but are not limited to carbohydrates (e.g., lignocellulose, xylans, cellulose, starch, etc.), other sugars (e.g., glucose, xylose, arabinose, etc.), and other compositions. Compositions of fermentation media suitable for the growth of yeast are well known in the art and there are various reference texts that provide recipes for these media.

Typically, the fermentation takes place under conditions known to be suitable for generating the fermentation product. Fermentation conditions suitable for generating desired fermentation products are well known in the art and any suitable method finds use in the present invention. In some embodiments, the fermentation process is carried out under aerobic or microaerophilic (i.e., where the concentration of oxygen is less than that in air), or anaerobic conditions. In some embodiments, fermentation is conducted under anaerobic conditions (i.e., no detectable oxygen), or less than about 5, about 2.5, or about 1 mmol/L/h oxygen. In the absence of oxygen, the NADH produced in glycolysis cannot be oxidized by oxidative phosphorylation. Under anaerobic conditions, pyruvate or a derivative thereof may be utilized by the host cell as an electron and hydrogen acceptor in order to generate NAD+.

The fermentation process is typically run at a temperature that is optimal for the recombinant fungal cell. For example, in some embodiments, the fermentation process is performed at a temperature in the range of from about 25°C to about 42°C. Typically the process is carried out a temperature that is less than about 38°C, less than about 35°C, less than about 33°C, or less than about 38°C, but at least about 20°C, 22°C, or 25°C. Example 10 describes an exemplary assay for evaluating xylose consumption and/or ethanol production during fermentation of a xylose-containing fermentation medium.

In some embodiments, the recombinant cells of the present invention are grown under batch or continuous fermentation conditions. Classical batch fermentation is a closed system, wherein the composition of the medium is set at the beginning of the fermentation and is not subject to artificial alterations during the fermentation. A variation of the batch system is a fed- batch fermentation, which also finds use in the present invention. In this variation, the substrate is added in increments as the fermentation progresses. Fed-batch systems are useful when catabolite repression is likely to inhibit the metabolism of the cells and/or where it is desirable to have limited amounts of substrate in the medium. Batch and fed-batch fermentations are common and well known in the art. Continuous fermentation is an open system where a defined fermentation generally maintains the culture at a constant high density where cells are primarily in log phase growth. Continuous fermentation systems strive to maintain steady state growth conditions. Methods for modulating nutrients and growth factors for continuous fermentation processes, as well as techniques for modulating nutrients and growth factors for continuous fermentation processes as well as techniques for maximizing the rate of product formation are well known in the art of industrial microbiology.

The invention may further be described in the following numbered paragraphs: Paragraph [1 ]. A recombinant Saccharomyces cell which is capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK), wherein the XK:

- provides an enzymatic activity for converting D-xylulose to xylulose 5-phosphate at 5 least twice that provided by S. cerevisiae XK (SEQ ID NO: 32), and

- provides for an anaerobic growth rate of the recombinant cell on xylose which is higher than that provided by a S. cerevisiae XK.

Paragraph [2]. The recombinant cell of paragraph [1 ], wherein the XK further provides for an i o aerobic growth rate of the cell on xylose which is higher than that provided by S. cerevisiae XK.

Paragraph [3]. The recombinant cell of paragraph [1 ] or [2], wherein

(a) the enzymatic activity for converting D-xylulose to xylulose 5-phosphate is 15 measured according to Example 7,

(b) the anaerobic growth rate is measured according to Example 9, or

(c) both (a) and (b).

Paragraph [4]. The recombinant cell any one of the preceding paragraphs, wherein the XK 20 comprises the amino acid sequence of SEQ ID NO: 6, the amino acid sequence of SEQ ID NO: 22 or a catalytically active variant or fragment of any thereof.

Paragraph [5]. A recombinant yeast cell capable of fermenting xylose and which comprises a heterologous gene encoding a xylulokinase (XK) comprising the amino acid sequence of SEQ 25 ID NO: 6 or a catalytically active variant or fragment thereof.

Paragraph [6]. The recombinant yeast cell of paragraph [5], which is derived from a Saccharomyces, Rhodotorula Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell.

30

Paragraph [7]. The recombinant cell of any one of the preceding paragraphs, wherein the XK has a sequence identity of atleast 80% with the amino acid sequence of SEQ ID NO: 6.

Paragraph [8]. The recombinant cell of any one of the preceding paragraphs, wherein the XK 35 has a sequence identity of at

least 90% with the amino acid sequence of SEQ ID NO: 6. Paragraph [9]. The recombinant cell of any one of the preceding paragraphs, which comprises a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI), and

(a) a heterologous gene encoding a xylose reductase (XR) and a heterologous gene encoding a xylitol dehydrogenase (XDH), and/or

(b) a heterologous gene encoding a xylose isomerase (XI). Paragraph [10]. The recombinant cell of paragraph [9], wherein the XR is Pichia stipitis XR or an NADH-preferring variant thereof, and the XDH is Pichia stipitis XDH or a catalytically active variant thereof.

Paragraph [1 1 ]. The recombinant cell of any one of paragraphs [9] and [10], wherein the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q.

Paragraph [12]. The recombinant cell of any one of paragraphs [9] to [1 1 ], wherein the TAL, TKL and RKI coding sequences are endogenous to the cell.

Paragraph [13]. The recombinant cell of any one of the preceding paragraphs, which is derived from a Saccharomyces cerevisiae, bayanus or carlsbergensis cell.

Paragraph [14]. The recombinant cell of any one of the preceding paragraphs, which is derived from a Saccharomyces cerevisiae cell.

Paragraph [15]. The recombinant cell of any one of the preceding paragraphs, wherein each of said gene or genes is operably linked to an inducible, a regulated or a constitutive promoter, and is optionally integrated into the genome of the cell.

Paragraph [16]. The recombinant cell of any one of the preceding paragraphs, wherein the gene encoding an XK is operably linked to a strong constitutive promoter endogenous to the cell. Paragraph [17]. A strain or clone comprising the recombinant cell of any one of the preceding paragraphs. Paragraph [18]. A method for producing a fermentation product, comprising

(a) contacting the recombinant cell of any one of paragraphs [1 ] to [16] or the strain or clone of paragraph [17] with a medium comprising a carbon source comprising xylose or arabinose under anaerobic conditions, and

(b) isolating the fermentation product from the medium.

Paragraph [19]. The method of paragraph [18], wherein the fermentation product comprises at least one of ethanol, butanol, isobutanol, isopentanol, lactate, isoamylacetate, glycerol, sorbitol, mannitol, xylitol, arabinitol; 3-hydroxybutyrolactone; hydrogen gas; L-ascorbic acid, 2,5-furan dicarboxylic acid, 3-hydroxy propionic acid, aspartic acid, glutaric acid, glutamic acid, itaconic acid, levulinic acid; succinic acid, fumaric acid, malic acid or other 1 ,4-diacid; a fatty acid, a fatty-acid derived molecule; an isoprenoid, an isoprenoid-derived molecule; and an alkane.

Paragraph [20]. The method of any one paragraph [18] or [19], wherein the carbon source comprises xylose

and the fermentation product comprises ethanol.

Paragraph [21 ]. A vector comprising genes encoding

- an XK comprising the amino acid sequence of SEQ ID NO:6, the amino acid sequence of SEQ ID NO:22 or a catalytically active variant or fragment of any thereof,

- an XR, and

- an XDH; and optionally, regulatory sequences for expressing the genes in a

Saccharomyces host cell.

Paragraph [22]. A process for producing a recombinant Saccharomyces cell, comprising transforming a

Saccharomyces cell with one or more vectors comprising genes encoding

- an XK comprising the amino acid sequence of SEQ ID NO:6, the amino acid sequence of SEQ ID NO:22 or a catalytically active variant or fragment of any thereof,

- an XR, and

- an XDH, and optionally, regulatory sequences for expressing the genes in the host cell.

Paragraph [23]. A recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, wherein the Yme2p polypeptide provides for an increased tolerance of the recombinant cell to formic acid, acetic acid, or both. Paragraph [24]. The recombinant yeast cell of paragraph [23], wherein the Yme2p polypeptide provides for

(a) an increased anaerobic growth on xylose,

(b) an increased xylose consumption rate,

(c) an increased ethanol production rate, or

(d) a combination of two or all of (a) to (c),

of the recombinant cell in the presence of formic acid.

Paragraph [25]. The recombinant yeast cell of paragraph [24], wherein the anaerobic growth, xylose consumption, ethanol production rate, or combination is measured according to Example 14.

Paragraph [26]. The recombinant yeast cell of any one of paragraphs [23] to [25], wherein the Yme2p polypeptide provides for

(a) an increased xylose consumption rate,

(b) an increased ethanol production rate, or

(c) a combination of (a) and (b),

of the recombinant cell in the presence of acetic acid.

Paragraph [27]. The recombinant yeast cell of paragraph [26], wherein the xylose consumption rate, ethanol production rate or combination is measured according to Example 13.

Paragraph [28]. The recombinant yeast cell of any one of paragraphs [23] to [27], wherein the Yme2p polypeptide has a sequence identity of at least 70% with the amino acid sequence of SEQ ID NO: 50.

Paragraph [29]. The recombinant yeast cell of any one of paragraphs [23] to [28], wherein the Yme2p polypeptide comprises the amino acid sequence of SEQ ID NO: 50.

Paragraph [30]. The recombinant yeast cell of any one of paragraphs [23] to [29], which comprises a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI), a heterologous gene encoding a xylulokinase (XK), and (a) a heterologous gene encoding a xylose reductase (XR) and a heterologous gene encoding a xylitol dehydrogenase (XDH), and/or

(b) a heterologous gene encoding a xylose isomerase (XI). Paragraph [31 ]. The recombinant yeast cell of paragraph [30], wherein the XR is Pichia stipitis XR or an NADH-preferring variant thereof, and the XDH is Pichia stipitis XDH or a catalytically active variant thereof.

Paragraph [32]. The recombinant yeast cell of paragraph [30] or [31 ], wherein the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q.

Paragraph [33]. The recombinant yeast cell of any one of paragraphs [30] to [32], wherein the TAL, TKL, RKI and XK coding sequences are endogenous to the cell.

Paragraph [34]. The recombinant yeast cell of any one of paragraphs [23] to [33], wherein the yeast cell is

derived from a Saccharomyces, Rhodotorula Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell.

Paragraph [35]. The recombinant yeast cell of any one of paragraphs [23] to [34], which is derived from a Saccharomyces cerevisiae, bayanus or carlsbergensis cell. Paragraph [36]. The recombinant yeast cell of any one of paragraphs [23] to [35], which is derived from a Saccharomyces cerevisiae cell.

Paragraph [37]. The recombinant yeast cell of any one of paragraphs [23] to [36], wherein each of said gene or genes is operably linked to an inducible, a regulated or a constitutive promoter, and is optionally integrated into the genome of the cell.

Paragraph [38]. The recombinant yeast cell of any one of paragraphs [23] to [38], wherein the gene encoding an Yme2p polypeptide is operably linked to a strong constitutive promoter endogenous to the cell.

Paragraph [39]. A strain or clone comprising the recombinant yeast cell of any one of paragraphs [23] to [38].

Paragraph [40]. A method for producing a fermentation product, comprising

(a) contacting the recombinant yeast cell of any one of paragraphs [23] to [38] or the strain or clone of paragraph [39] with a medium comprising a carbon source comprising xylose or arabinose under anaerobic conditions, and

(b) isolating the fermentation product from the medium.

Paragraph [41 ]. The method of paragraph [40], wherein the fermentation product comprises at least one of ethanol, butanol, isobutanol, isopentanol, lactate, isoamylacetate, glycerol, sorbitol, mannitol, xylitol, arabinitol; 3-hydroxybutyrolactone; hydrogen gas; L-ascorbic acid, 2,5- furan dicarboxylic acid, 3-hydroxy propionic acid, aspartic acid, glutaric acid, glutamic acid, itaconic acid, levulinic acid; succinic acid, fumaric acid, malic acid or other 1 ,4-diacid; a fatty acid, a fatty-acid derived molecule; an isoprenoid, an isoprenoid-derived molecule; and an alkane.

Paragraph [42]. The method of paragraph [40] or [41 ], wherein the carbon source comprises xylose and the fermentation product comprises ethanol.

Paragraph [43]. A vector comprising genes encoding

- an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment thereof,

- an XR,

- an XDH, and

- an XK; and

- optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

Paragraph [44]. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- an Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant or fragment thereof,

- an XR,

- an XDH,

- an XK, and

optionally, regulatory sequences for expressing the genes in the host cell. Paragraph [45]. A method for increasing the tolerance of a Saccharomyces cell to formic acid, comprising transforming the cell with a gene encoding a Yme2p polypeptide comprising the amino acid sequence of SEQ ID NO: 50 or a catalytically active variant, fragment or yeast ortholog thereof, and expressing the gene.

5

Paragraph [46]. The method of paragraph 23, wherein the gene is operably linked to an inducible, a regulated or a constitutive promoter.

Paragraph [47]. A recombinant yeast cell which is capable of fermenting xylose and which i o comprises a heterologous gene encoding

(a) an enolase comprising the amino acid sequence of SEQ ID NO: 132,

(b) a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134,

(c) a 6-phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO:

15 136,

(d) a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138,

(e) a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO:

140,

20 (f) a triose-phosphate isomerase comprising the amino acid sequence of SEQ ID NO:

142,

(g) a catalytically active variant, fragment or yeast ortholog of any one of (a) to (f), or

(h) a combination of any two or more of (a) to (g).

25 Paragraph [48]. The recombinant yeast cell of paragraph [47], wherein the heterologous gene or combination provides for an increased anaerobic growth rate on xylose, an increased aerobic growth rate on xylose, an increased ethanol production from xylose, or a combination of any two or all thereof.

30 Paragraph [49]. The recombinant yeast cell of paragraph [48], wherein the anaerobic growth rate is measured according to Example 23, the aerobic growth is measured according to Example 17, and/or the ethanol production is measured according to Example 24.

Paragraph [50]. The recombinant yeast cell of any one of paragraphs [47] to [49], comprising a 35 heterologous gene encoding an enolase comprising the amino acid sequence of SEQ ID NO:

132 or a catalytically active variant, fragment or yeast ortholog thereof. Paragraph [51 ]. The recombinant yeast cell of any one of paragraphs [47] to [49], comprising a heterologous gene encoding an a phosphofructokinase beta subunit polypeptide comprising the amino acid sequence of SEQ ID NO: 134 or a catalytically active variant, fragment or yeast ortholog thereof.

Paragraph [52]. The recombinant yeast cell of any one of paragraphs [47] to [49], comprising a heterologous gene encoding a 6- phosphofructo-2-kinase comprising the amino acid sequence of SEQ ID NO: 136 or a catalytically active variant, fragment or yeast ortholog thereof. Paragraph [53]. The recombinant yeast cell of any one of paragraphs [47] to [49], comprising a heterologous gene encoding a glucose-6-phosphate isomerase comprising the amino acid sequence of SEQ ID NO: 138 or a catalytically active variant, fragment or yeast ortholog thereof. Paragraph [54]. The recombinant yeast cell of any one of paragraphs [47] to [49], comprising a heterologous gene encoding a phosphoglycerate mutase comprising the amino acid sequence of SEQ ID NO: 140 or a catalytically active variant, fragment or yeast ortholog thereof.

Paragraph [55]. The recombinant yeast cell of any one of paragraphs [47] to [49], comprising a heterologous gene encoding a triosephosphate isomerase comprising the amino acid sequence of SEQ ID NO: 142 or a catalytically active variant, fragment or yeast ortholog thereof.

Paragraph [56]. The recombinant yeast cell of any one of paragraphs [47] to [55], wherein the catalytically active variant or yeast ortholog of has a least 90% sequence identity to the amino acid sequence of SEQ ID NO: 132, 134, 136, 138 140 or 142.

Paragraph [57]. The recombinant yeast cell of any one of paragraphs [47] to [56], wherein the catalytically active variant or yeast ortholog of has a least 95% sequence identity to the amino acid sequence of SEQ ID NO: 132, 134, 136, 138 140 or 142.

Paragraph [58]. The recombinant yeast cell of any one of paragraphs [47] to [57], which comprises a heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), and a heterologous gene encoding a ribose 5-phosphate ketol-isomerase (RKI), a heterologous gene encoding an XK, and at least one of

(a) a heterologous gene encoding a xylose reductase (XR) and an overexpressed gene encoding a xylitol dehydrogenase (XDH), and/or (b) a heterologous gene encoding a xylose isomerase (XI).

Paragraph [59]. The recombinant yeast cell of paragraph [58], wherein the XR is Pichia stipitis XR or an NADH-preferring variant thereof, and the XDH is Pichia stipitis XDH or a catalytically active variant thereof.

Paragraph [60]. The recombinant yeast cell of paragraph [58] or [59], wherein the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q.

Paragraph [61 ]. The recombinant yeast cell of any one of paragraphs [58] to [60], wherein the TAL, TKL, RKI and XK are endogenous to the cell.

Paragraph [62]. The recombinant yeast cell of any one of paragraphs [47] to [61 ], wherein the yeast cell is derived from a Saccharomyces, Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell.

Paragraph [63]. The recombinant yeast cell of any one of paragraphs [47] to [62], which is derived from a Saccharomyces cerevisiae, bayanus or carlsbergensis cell.

Paragraph [64]. The recombinant yeast cell of any one of paragraphs [47] to [63],, which is derived from a Saccharomyces cerevisiae cell. Paragraph [65]. The recombinant yeast cell of any one of paragraphs [47] to [64], wherein each of said gene or genes is operably linked to an inducible, a regulated or a constitutive promoter, and is optionally integrated into the genome of the cell.

Paragraph [66]. The recombinant yeast cell of any one of paragraphs [47] to [65], wherein the gene encoding an enolase, phosphofructokinase beta subunit polypeptide, 6-phosphofructo-2- kinase, glucose-6-phosphate isomerase, phosphoglycerate mutase or triose-phosphate isomerase is operably linked to a strong constitutive promoter endogenous to the cell.

Paragraph [67]. A strain or clone comprising the recombinant yeast cell of any one of paragraphs [47] to [66], Paragraph [68]. A method for producing a fermentation product, comprising

(a) contacting the recombinant cell of any one of paragraphs [47] to [66], 0 or the strain or clone of paragraph [67] with a medium comprising a carbon source comprising xylose or arabinose under anaerobic conditions, and

(b) isolating the fermentation product from the medium.

Paragraph [69]. The method of paragraph [68], wherein the fermentation product comprises at least one of ethanol, butanol, isobutanol, isopentanol, lactate, isoamylacetate, glycerol, sorbitol, mannitol, xylitol, arabinitol; 3-hydroxybutyrolactone; hydrogen gas; L-ascorbic acid, 2,5-furan dicarboxylic acid, 3-hydroxy propionic acid, aspartic acid, glutaric acid, glutamic acid, itaconic acid, levulinic acid; succinic acid, fumaric acid, malic acid or other 1 ,4-diacid; a fatty acid, a fatty-acid derived molecule; an isoprenoid, an isoprenoid-derived molecule; and an alkane. Paragraph [70]. The method of paragraph [68] or [69], wherein the carbon source comprises xylose and the fermentation product comprises ethanol.

Paragraph [71 ]. A vector comprising genes encoding

- a polypeptide comprising the amino acid sequence of SEQ ID NO: 132, 134, 136, 138 140 or 142 or a catalytically active variant, fragment or yeast ortholog of any thereof,

- a XR,

- a XDH, and

- a XK; and

optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

Paragraph [72]. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- a polypeptide comprising the amino acid sequence of SEQ ID NO: 132, 134, 136, 138 140 or 142 or a catalytically active variant or fragment of any thereof,

- an XR,

- an XDH,

- an XK, and,

optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell. Paragraph [73]. A recombinant yeast cell which is capable of fermenting xylose and which comprises a heterologous gene encoding a phosphoglucomutase and/or phosphoribomutase comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant or fragment thereof.

Paragraph [74]. The recombinant yeast cell of paragraph [73], wherein the heterologous gene provides for an increased anaerobic growth rate on xylose, an increased xylose consumption, an increased ethanol production from xylose, or a combination of any two or all thereof.

Paragraph [75]. The recombinant yeast cell of paragraph [74], wherein the anaerobic growth rate is measured according to Example 26, the anaerobic growth is measured according to Example 27, and/or the ethanol production is measured according to Example 27.

Paragraph [76] The recombinant yeast cell of any one of paragraphs [73] to [75], wherein the catalytically active variant has a sequence identity of at least 30%, at least 50%, at least 70%, at least 80%, at least 90% or at least 95% with the amino acid sequence of SEQ ID NO: 150. Paragraph [77]. The recombinant yeast cell of any one of paragraphs [73] to [76], wherein the phosphoglucomutase and/or phosphoribomutase comprises the amino acid sequence of SEQ ID NO: 150.

Paragraph [78]. The recombinant yeast cell of any one of paragraphs [73] to [77], which comprises heterologous gene encoding a transaldolase (TAL), a heterologous gene encoding a transketolase (TKL), a heterologous gene encoding a ribose 5-phosphate ketol-isomerase

(RKI), a heterologous gene encoding a xylulokinase (XK), and at least one of:

(a) a heterologous gene encoding a xylose reductase (XR) and a heterologous gene encoding a xylitol dehydrogenase (XDH), and/or

(b) a heterologous gene encoding a xylose isomerase (XI).

Paragraph [79]. The recombinant yeast cell of paragraph [78], wherein the XR is Pichia stipitis XR or an NADH-preferring variant thereof, and the XDH is Pichia stipitis XDH or a catalytically active variant thereof.

Paragraph [80]. The recombinant yeast cell of paragraph [78] or [79], wherein the XR is Pichia stipitis XR comprising one or more amino acid substitutions selected from N272D, K270R and P275Q. Paragraph [81 ]. The recombinant yeast cell of any one of paragraphs [78] to [80], wherein the TAL, TKL, RKI and XK coding sequences are endogenous to the cell. Paragraph [82]. The recombinant yeast cell of any one of paragraphs [73] to [81 ], wherein the yeast cell is derived from a Saccharomyces, Rhodotorula, Schizosaccharomyces, Kluyveromyces, Pichia, Hansenula, Rhodosporidium, Candida, Yarrowia, Lipomyces, Cryptococcus, or Dekkera sp. cell.

Paragraph [83]. The recombinant yeast cell of any one of paragraphs [73] to [82], which is derived from a Saccharomyces cerevisiae, bayanus or carlsbergensis cell.

Paragraph [84]. The recombinant yeast cell of any one of paragraphs [73] to [83], which is derived from a Saccharomyces cerevisiae cell.

Paragraph [85]. The recombinant yeast cell of any one of paragraphs [73] to [84], wherein each of said gene or genes is operably linked to an inducible, a regulated or a constitutive promoter, and is optionally integrated into the genome of the cell.

Paragraph [86]. The recombinant yeast cell of any one of paragraphs [73] to [85], wherein the overexpressed gene encoding a phosphoglucomutase and/or phosphoribomutase is operably linked to a strong constitutive promoter endogenous to the cell.

Paragraph [87]. A strain or clone comprising the recombinant yeast cell of any one of paragraphs [73] to [86].

Paragraph [88]. A method for producing a fermentation product, comprising

(c) contacting the recombinant cell of any one of paragraphs [73] to [86] or the strain or clone of paragraph [87] with a medium comprising a carbon source comprising xylose or arabinose under anaerobic conditions, and

(d) isolating the fermentation product from the medium.

Paragraph [89]. The method of paragraph [88], wherein the fermentation product comprises at least one of ethanol, butanol, isobutanol, isopentanol, lactate, isoamylacetate, glycerol, sorbitol, mannitol, xylitol, arabinitol; 3-hydroxybutyrolactone; hydrogen gas; L-ascorbic acid, 2,5- furan dicarboxylic acid, 3-hydroxy propionic acid, aspartic acid, glutaric acid, glutamic acid, itaconic acid, levulinic acid; succinic acid, fumaric acid, malic acid or other 1 ,4-diacid; a fatty acid, a fatty-acid derived molecule; an isoprenoid, an isoprenoid-derived molecule; and an alkane. Paragraph [90]. The method of paragraph [88] or [89], wherein the carbon source comprises xylose and the fermentation product comprises ethanol.

Paragraph [91 ]. A vector comprising genes encoding

5 - a polypeptide comprising the amino acid sequence of SEQ ID NO: 150 or a catalytically active variant, fragment or yeast ortholog thereof,

- an XR,

- an XDH, and

- an XK; and

i o optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

Paragraph [92]. A process for producing a recombinant Saccharomyces cell, comprising transforming a Saccharomyces cell with one or more vectors comprising genes encoding

- a polypeptide comprising the amino acid sequence ofSEQ ID NO: 150 or a 15 catalytically active variant or fragment thereof,

- an XR,

- an XDH,

- an XK, and,

optionally, regulatory sequences for expressing the genes in a Saccharomyces host cell.

20

The following examples are provided by way of illustration and are not intended to be limiting of the invention.

Examples

25 Example 1 : Construction of a genetically modified S. cerevisiae strain overexpressing three of the genes in the pentose phosphate pathway (TAL1 (transaldolase), TKL1 (transketolase) and RKI1 (ribose 5-phosphate ketol isomerase)

Strains, media and genetic techniques

Escherichia coli strain DH5a (Life Technologies, Rockville, MD, USA) was used for

30 subcloning. E. coli was grown in LB-medium (Ausubel et al., 1995). Yeast cells from freshly streaked YPD plates (Ausubel et al., 1995) were used for inoculation. Plasmid DNA was prepared with the GeneJETTM Plasmid Miniprep Kit (Fermentas UAB, Vilnius, Lithuania). Agarose gel DNA extraction was made with QIAquick® Gel Extraction Kit (Qiagen GmbH, Hilden, Germany). Primers from MWG-Biotech AG (Ebersberg, Germany) and Pfu DNA

35 Polymerase and dNTP from Fermentas (Vilnius, Lithuania) were used for polymerase chain reactions (PCR). PCR product purification was made with the E.Z.N.A.® Cycle Sequencing Kit (Omega Bio-tek Inc., Doraville, GA, USA). Sequencing was performed by MWG-Biotech AG (Ebersberg, Germany). Restriction endonucleases, FastAP Thermosensitive Alkaline Phosphatase and T4 DNA Ligase from Fermentas (Vilnius, Lithuania) were used for DNA manipulation. Competent E. coli DH5a cells were transformed as described elsewhere (Inoue et al., 1990) and transformed E. coli cells were selected on LB plates (Ausubel et al., 1995) containing 100 mg/l ampicillin (IBI Shelton Scientific Inc., Shelton, CT, USA). E. coli strains were grown in LB medium containing 100 mg/l ampicillin for plasmid amplifications. Yeast strains were transformed with the lithium acetate method (Gietz and Schiestl, 2007) and transformed yeast strains were selected on YPD plates with a pH set to 7.5 containing 50 μg/ml zeocin (Invitrogen, Groningen, The Netherlands).

Construction of YlpTAL containing the S. cerevisiae transaldolase (TAL1 ) gene

Plasmid pB3 PGK TAL1 (Johansson and Hahn-Hagerdal, 2002) containing the S. cerevisiae transaldolase (TAL1 ) gene under control of the PGK1 promoter and the GCY1 terminator from S. cerevisiae was digested with restriction enzymes Xcml and Ehel, and ends of the resulting fragment were made blunt by the use of T4 DNA polymerase (Boehringer Mannheim, Indianapolis, IN) and the vector was re-ligated resulting in YlpTAL. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing. Construction of YIpTALTKL containing the S. cerevisiae transaldolase (TAL1 ) gene and transketolase (TKL1 ) gene

The DNA cassette PGKp-TKL1 -GCYt containing the S. cerevisiae transketolase (TKL1 ) gene under control of the PGK1 promoter and the GCY1 terminator from S. cerevisiae was PCR amplified having as template plasmid pB3 PGK TKL1 (Johansson and Hahn-Hagerdal, 2002) and using primers FwdTKL and RevTKL identified by

GGTACCGAGCTCTAACTGATCTATCCAAAACTG (SEQ ID NO: 1 ) and

GGTACCGATCAGCATGCGATCGCTCGACATTTGATATAC (SEQ ID NO: 2),

which were including the restriction site Kpnl at the ends of the amplified DNA cassette. The PCR product PGKp-TKL1 -GCYt was then digested with Kpnl restriction enzyme. The resulting purified DNA fragment was inserted into the plasmid YlpTAL, which had also been cleaved with the restriction enzyme Kpnl and which had been dephosphorylated. The resulting plasmid was named YIpTALTKL. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of YlpTTR containing the S. cerevisiae transaldolase (TAL1 ) gene, transketolase (TKL1 ) gene and ribose 5-phosphate ketol isomerase (RKI 1 ) gene

The DNA cassette PGKp-RKI 1 -GCYt containing the S. cerevisiae ribose 5-phosphate ketol isomerase (RKI1 ) gene under control of the PGK1 promoter and the GCY1 terminator from S. cerevisiae was PCR amplified having as template plasmid pB3 PGK RKI 1 (Johansson and Hahn-Hagerdal, 2002) and using primers FwdRKI and RevRKI identified by 5'-CCGCGGGAGCTCTAACTGATCTATCCAAAACTG-3' (SEQ ID NO: 3) and

5'-CCGCGGGATCAGCATGCGATCGCTCGACATTTGATATAC-3' (SEQ ID NO: 4) which were including the restriction site Sacl l at the ends of the amplified DNA cassette. The PCR product PGKp-RKI 1 -GCYt was then digested with Sacll restriction enzyme. The resulting purified DNA fragment was inserted into the plasmid YIpTALTKL, which also had been cleaved with the restriction enzyme Kpnl and which had been dephosphorylated. The resulting plasmid was named YlpTTR. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1001 containing YlpTTR

YlpTTR was cleaved with Spel within the RKI 1 gene and transformed into strain C5LTe1000. This resulted in strain C5LTe1001 . Strain C5LTe1000 was deposited in accordance with the terms of the Budapest Treaty on 4 November 2014 with DSMZ - Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, Inhoffenstr. 7B, 38124 Braunschweig, Germany - under DSMZ accession number DSM 29597.

Example 2: Strains, Media and Genetic Techniques

Escherichia coli strain NEB 5-alpha (New England BioLabs, Ipswich, MA, USA) was used for subcloning. £ coli was grown in LB-medium (Ausubel et al., 1995). Yeast cells from freshly streaked YPD plates (Ausubel et al., 1995) were used for inoculation. Plasmid DNA was prepared with the GeneJETTM Plasmid Miniprep Kit (Fermentas UAB, Vilnius, Lithuania). Agarose gel DNA extraction was made with QIAquick® Gel Extraction Kit (Qiagen GmbH, Hilden, Germany). Primers from Eurofins MWG Operon (Ebersberg, Germany) and Phusion Hot Start II High-Fidelity DNA Polymerase and dNTP from Fermentas (Vilnius, Lithuania) were used for polymerase chain reactions (PCR). PCR product purification was made with the E.Z.N.A.® Cycle Sequencing Kit (Omega Bio-tek Inc., Doraville, GA, USA). Sequencing was performed by Eurofins MWG Operon (Ebersberg, Germany). Restriction endonucleases from Fermentas (Vilnius, Lithuania) were used for DNA manipulation. Competent £ coli cells were transformed as described elsewhere (Inoue et al., 1990) and transformed £ coli cells were selected on LB plates (Ausubel et al., 1995) containing 100 mg/l ampicillin (IBI Shelton Scientific Inc., Shelton, CT, USA). £. coli strains were grown in LB medium containing 100 mg/l ampicillin for plasmid amplifications. Yeast strains were transformed with the lithium acetate method (Gietz and SchiestI, 2007) and transformed yeast strains were selected on Yeast Nitrogen Base plates (YNB) (6.7 g/l Difco Yeast Nitrogen Base without amino acids; Becton Dickinson and Company, Sparks, MD, USA) supplemented with 40 g/l xylose and buffered at pH 5.5 with 10.21 g/l potassium hydrogen phthalate. Example 3: Construction of an S. cerevisiae strain overexpressing three of the genes in the pentose phosphate pathway and expressing a Spathaspora passalidarum xylulose kinase gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a 5 mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

Strains, media and genetic techniques

The strains, media and genetic techniques described in Example 2 were used.

Construction of synthetic xylulose kinase gene encoding Spathaspora passalidarum xylulose kinase based on NCBI Accession Code XP 0073731 12 under control of the TPI1 Promoter i o and the PGK1 terminator from S. cerevisiae

The entire Spathaspora passalidarum xylulose kinase gene (XKsp) was synthesized and assembled by Eurofins MWG Operon (Ebersberg, Germany). Codon usage in the sequence was optimised based on the yeast codon usage table from the Kazusa codon usage database. The TPI1 promoter from S. cerevisiae and the PGK1 terminator from S. cerevisiae

15 were also included in the synthetic construct; the TPI1 promoter before the ATG-start codon and the PGK1 terminator after the stop-codon. The nucleotide sequence of TPI1 promoter, XKsp and PGK1 terminator is identified as SEQ ID NO: 5, showing the synthesized DNA sequence. The corresponding amino acid sequence of the coding region of XKsp is identified in SEQ ID NO: 6. The harboring plasmid was named pC5e0022.

20 Construction of pC5e0049 containing a Spathaspora passalidarum xylulose kinase gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

The DNA cassette TPI1 p-XKsp-PGK1 t was PCR amplified having as template plasmid pC5e0022 and using the following primers.

25 TPI 1 p_fwd (SEQ ID NO: 7):

5'-TCTTC CACAC CTGCA GTATA TCTAG GAACC CATCA G-3'

reverse-PGK1 t (SEQ ID NO: 8):

5'-ATCAG TTAGA CTGCA GGAAC ATAGA AATAT CGAAT GGGAA-3'

The resulting purified DNA fragment was inserted into plasmid YlpDR7 (Runquist et al. 30 2010), containing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene under control of the PGK1 promoter and the PGK1 terminator from S. cerevisiae and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene under control of the TDH3 promoter and the ADH1 terminator from S. cerevisiae. YlpDR7 which had been cleaved with restriction enzyme Pstl and the DNA fragment was inserted by In-Fusion cloning (Clontech, 35 California, CA, USA). The resulting plasmid was named pC5e0049. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1042 containing YlpTTR and pC5e0049

The Plasmid pC5e0049 was digested with restriction enzyme EcoRV within the URA3 gene and it was thereafter transformed into strain C5LTe1001 . This resulted in strain C5LTe1042.

Example 4: Construction of an S. cerevisiae strain overexpressing three of the genes in the pentose phosphate pathway and a xylulose kinase (XK) gene and expressing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

Strains, media and genetic techniques

The strains, media and genetic techniques described in Example 2 were used.

Construction of pC5e0024 containing a S. cerevisiae xylulose kinase gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

The TPI1 promoter from S. cerevisiae (TPM p) was PCR amplified having as template plasmid pC5e0022 and using the following primers.

TPI 1 p_fwd (SEQ ID NO: 9):

5'-TCTTC CACAC CTGCA GTATA TCTAG GAACC CATCA G-3'

R_TPI1 p (SEQ ID NO: 10):

5 -CTGTC TCTGA ATTAC TGAAC ACAAC ATTTT TAGTT TATGT ATGTG TTT-3

The PGK1 terminator from S. cerevisiae (PGK1 t) was PCR amplified having as template plasmid pC5e0022 and using the following primers.

fwdS_PGK1 t (SEQ ID NO: 1 1 ):

5'-GCGAACTGGAAAAGACTCTCATCTAAAGATCTCCCATGTCTCTACTGG-3' reverse_PGK1 t (SEQ ID NO: 12):

5'-ATCAGTTAGACTGCAGGAACATAGAAATATCGAATGGGAA-3'

The S. cerevisiae xylulose kinase gene (Xkcere; SEQ ID NO: 31 ), encoding S. cerevisiae XK (SEQ ID NO: 32) was PCR amplified having as template plasmid YIpXK (Lonn et a/., 2003) and using the following primers

XKcere_fwd (SEQ ID NO: 13):

5'-AAACACATACATAAACTAAAAATGTTGTGTTCAGTAATTCAGAGACAG-3'

XKcere_rev (SEQ ID NO: 14):

5 -CCAGTAGAGACATGGGAGATCTTTAGATGAGAGTCTTTTCCAGTTCGC-3' The DNA cassette TP11 p-XKcere-PGK11 was PCR amplified by overlap extension PCR having as template the three purified DNA fragments TPU p, PGK1 t and XKcere using the following primers

TPI 1 p_fwd (SEQ ID NO: 15):

5'-TCTTCCACACCTGCAGTATATCTAGGAACCCATCAG-3'

reverse_PGK1 t (SEQ ID NO: 16):

5'-ATCAGTTAGACTGCAGGAACATAGAAATATCGAATGGGAA-3'

The resulting purified DNA fragment was inserted into the plasmid YlpDR7 (Runquist et al. 2010), which had been cleaved with restriction enzyme Pstl. The DNA fragment was inserted by In-Fusion cloning (Clontech, California, CA, USA). The resulting plasmid was named pC5e0024. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of of C5LTe1035 containing YlpTTR and pC5e0024

Plasmid pC5e0024 was cleaved with restriction enzyme EcoRV within the URA3 gene and it was thereafter transformed into strain C5LTe1001. This resulted in strain C5LTe1035.

Example 5: Construction of an S. cerevisiae strain overexpressing three of the genes in the pentose phosphate pathway and expressing an Escherichia coli xylulose kinase (XK) gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

Strains, media and genetic techniques

The strains, media and genetic techniques described in Example 2 were used.

Construction of synthetic xylulose kinase gene encoding Escherichia coli xylulose kinase based on NCBI Accession Code YP 001460359 under control of the TPI1 Promoter and the PGK1 terminator from S. cerevisiae

The entire E. coli xylulose kinase gene (XKcoli) was synthesized and assembled by Eurofins MWG Operon (Ebersberg, Germany). Codon usage in the sequence was optimised based on the yeast codon usage table from the Kazusa codon usage database. The TPI1 promoter from S. cerevisiae and the PGK1 terminator from S. cerevisiae were also included in the synthetic construct; the TPI1 promoter before the ATG-start codon and the PGK1 terminator after the stop-codon. The nucleotide sequence of TPI 1 promoter, XKcoli and PGK1 terminator is identified as SEQ ID NO: 17, showing the synthesized DNA sequence. The corresponding amino acid sequence of the coding region of XKcoli is identified in SEQ ID NO: 18. The harboring plasmid was named pC5e0012. Construction of pC5e0046 containing a E. co// xylulose kinase gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

The DNA cassette TPI 1 p-XKcoli-PGK1 t was PCR amplified having as template plasmid 5 pC5e0012 and using the following primers.

TPI 1 p_fwd (SEQ ID NO: 19):

5'-TCTTCCACACCTGCAGTATATCTAGGAACCCATCAG-3'

reverse-PGK1 t (SEQ ID NO: 20):

5'-ATCAGTTAGACTGCAGGAACATAGAAATATCGAATGGGAA-3'. i o The resulting purified DNA fragment was inserted into plasmid YlpDR7 (Runquist et al.

2010), containing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene under control of the PGK1 promoter and the PGK1 terminator from S. cerevisiae and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene under control of the TDH3 promoter and the ADH1 terminator from S. cerevisiae. YlpDR7 which had been cleaved with

15 restriction enzyme Pstl and the DNA fragment was inserted by In-Fusion cloning (Clontech, California, CA, USA). The resulting plasmid was named pC5e0046. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1040 containing YlpTTR and pC5e0046

Plasmid pC5e0046 was digested with restriction enzyme EcoRV within the URA3 gene 20 and it was thereafter transformed into strain C5LTe1001. This resulted in strain C5LTe1040.

Example 6: Construction of an S. cerevisiae strain overexpressing three of the genes in the pentose phosphate pathway and expressing a Kluyveromyces marxianus xylulose kinase (XK) gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a 25 mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

Strains, media and genetic techniques

The strains, media and genetic techniques described in Example 2 were used.

Construction of synthetic xylulose kinase gene encoding Kluyveromyces marxianus xylulose kinase based on NCBI Accession Code ADW23548

The entire Kluyveromyces marxianus xylulose kinase gene (XKmarx) was synthesized and assembled by Eurofins MWG Operon (Ebersberg, Germany). Codon usage in the sequence was optimised based on the yeast codon usage table from the Kazusa codon usage database. The nucleotide sequence of XKmarx is identified as SEQ ID NO:21 , showing the synthesized DNA sequence. The corresponding amino acid sequence of the coding region of

XKmarx is identified in SEQ ID NO: 22. The harboring plasmid was named pC5e0043. Construction of pC5e0051 containing a Kluyveromyces marxianus xylulose kinase gene, a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

The TPI1 promoter from S. cerevisiae (TPM p) was PCR amplified having as template 5 plasmid pC5e0022 and using the following primers.

TPI 1 p_fwd (SEQ ID NO: 23):

5'-TCTTCCACACCTGCAGTATATCTAGGAACCCATCAG-3'

Rev_TPI 1 p (SEQ ID NO: 24):

5 -GCCTAAGTAATATGGAGTCGACATTTTTAGTTTATGTATGTGTTT-3 . i o The PGK1 terminator from S. cerevisiae (PGK1 t) was PCR amplified having as template plasmid pC5e0022 and using the following primers.

PGK1 t_forS (SEQ ID NO: 25):

5'-CTTTAGCACAATCTCAGGGTCAATAAAGATCTCCCATGTCTCTACTGG-3' reverse_PGK1 t (SEQ ID NO: 26):

15 5'-ATCAGTTAGACTGCAGGAACATAGAAATATCGAATGGGAA-3'

The Kluyveromyces marxianus kinase gene (XKmarx) was PCR amplified having as template plasmid pC5e0043 and using the following primers

XKmarx_fwd (SEQ ID NO: 27):

5'-AAACACATACATAAACTAAAAATGTCGACTCCATATTACTTAGGC-3' 20 XKmarx_rev (SEQ ID NO: 28):

5 -CCAGTAGAGACATGGGAGATCTTTATTGACCCTGAGATTGTGCTAAAG-3'

The DNA cassette TPI 1 p-XKmarx-PGK1t was PCR amplified by overlap extension PCR having as template the three purified DNA fragments TPM p, PGK1 t and XKmarx using the following primers:

25 TPI 1 p_fwd (SEQ ID NO: 29):

5'-TCTTCCACACCTGCAGTATATCTAGGAACCCATCAG-3'

reverse_PGK1 t (SEQ ID NO: 30):

5'-ATCAGTTAGACTGCAGGAACATAGAAATATCGAATGGGAA-3'

The resulting purified DNA fragment was inserted into the plasmid YlpDR7 (Runquist et 30 al. 2010), which had been cleaved with restriction enzyme Pstl. The DNA fragment was inserted by In-Fusion cloning (Clontech, California, CA, USA). The resulting plasmid was named pC5e0051 . Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing. Construction of C5LTe1043 containing YlpTTR and pC5e0051

Plasmid pC5e0051 was digested with restriction enzyme EcoRV within the URA3 gene and it was thereafter transformed into strain C5LTe1001 . This resulted in strain C5LTe1043.

Example 7: Enzyme activity assays

Cell extracts for activity assays were prepared from exponentially growing aerobic batch cultures. Cells were collected by centrifugation, washed with sterile water, resuspended in an appropriate amount of Y-PER reagent (Pierce; Rockford, III., USA), and processed according to the instructions. Protein concentrations were determined with the Bradford Protein Assay (Pierce, Rockford, IL, USA) against a bovine serum albumin standard. Xylulose kinase (XK) and xylitol dehydrogenase (XDH) were measured as described by Shamanna and Sanderson (1979). XK activity was determined in two steps. First, the XDH activity was determined in the absence of ATP, and then the sum of the XK and XDH activities in the presence of ATP was determined, the XK activity being the difference. All enzyme activity measurements were performed at 30°C. Specific activities are expressed as units per milligram of protein. One unit of enzyme activity is defined as 1 μηηοΙ of substrate converted per minute.

As shown in Table 5, strains C5LTe1042, C5LTe1048 and C5LT1208 had higher XK activity than their corresponding control strains. All strains with a heterologous XK gene had higher XK activity when compared to the wild-type strain C5LTe1000.

Table 5. XK activities in constructed strains.

Example 8: Aerobic growth on xylose

Cells were grown on YPD medium (Yeast extract, Peptone and 20 g/l glucose) overnight, and inoculated into mineral medium with xylose as the sole carbon source (13.4 g/l Yeast Nitrogen Base and 50 g/l xylose, buffered with 10.2 g/l potassium hydrogen phthalate to pH 5.5). Starting OD at 620 nm was around 0.1 . Growth was measured by increase in OD 620nm. Maximum specific growth rate was calculated from time points 8 to 30 hours, and they were 0.16 h-1 for C5LTe1035 and 0.19 h-1 for C5LTe1042. See Fig. 1 , which shows curves for the aerobic growth on xylose for C5LTe1035 and C5LTe1042.

In conclusion, strains C5LTe1042, C5LTe1048 and C5LT1208 grew more than their corresponding control strains. Example 9: Anaerobic growth on xylose

Yeast strains were analysed for anaerobic growth on xylose in 96-well microplate yeast cultures. Prior to experiments, yeast was grown in semi-aerobic precultures in YNB medium with 50 g/l xylose, in microplates, overnight. For measurement of anaerobic growth, cells were inoculated in 200 μΙ of the same medium where solution of ergosterol and Tween80 had been added at final concentration of 0.03 and 1 .2 g/l, respectively. 50 μΙ mineral oil was added on top of each well to keep culture anaerobic. 20 μΙ of pre-cultured cells were added. The growth was followed in Multiskan FC (Thermo Scientific) at 30°C and growth was measured as increase of OD (620nm).

In conclusion, strains with K. marxianus XK or S. passalidarium XK grow faster and more under anaerobic conditions than the control strain with S. cerevisiae XK. See Fig. 2, which shows curves over anaerobic growth of C5LTe1042 compared with control strain C5LTe1035, Fig. 3, which shows curves over anaerobic growth of C5LTe1043 compared with control strain C5LTe1035, and Fig. 4, which shows curves over anaerobic growth of C5LTe1040 compared with control strain C5LTe1035.

Example 10: Anaerobic fermentation on xylose

Cells were pre-grown on mineral medium with glucose as a carbon source (YNB 6.7 g/l and 20 g/l glucose). Cells were then inoculated in fermenters (Applikon) at starting biomass concentration of about 0.15 g/l cell dry weight. Mixing was set at 200 rpm and the gas outlet was closed by a waterlock. Fermentation medium consisted of YNB (6.7 g/l) with 50 g/l xylose and 20 g/l glucose, supplemented with Tween80 (1 .2 g/l) and ergosterol (0.03 g/l). The pH was controlled at 5.5 with KOH.

Concentrations of glucose, xylose, ethanol, glycerol, and xylitol were determined by high performance liquid chromatography (Waters, Milford, MA, USA). The compounds were separated with a Shodex SUGAR SP0810 Pb2+ copolymer-based column (Showa Denko America, NY, USA) preceded by a Micro-Guard Carbo-C guard column (Bio-Rad, Hercules, CA, USA). Separation was performed at 80°C, with H20 at a flow rate of 0.6 ml min-1 as mobile phase. Compounds were quantified by refractive index detection (Waters). A seven- point calibration curve was made for each compound to calculate concentrations. See Fig. 5, which shows xylose fermentation by strains C5LTe1035 (dashed line) and C5LTe1042 (solid line). Glucose: circles, Xylose: squares, Xylitol: stars, Biomass: line. See also Fig. 6, which shows xylose fermentation by strains C5LTe1036 (dashed line) and C5LTe1048 (solid line). Glucose: circles, Xylose: squares, Xylitol: stars, Biomass: line. And, see also Fig. 7, which shows xylose fermentation by strains C5LTe1204 (dashed line) and C5LTe1208 (solid line). Glucose: circles, Xylose: squares, Xylitol: stars, Biomass: line.

Table 6. Summary of xylose consumption and ethanol production in anaerobic xylose fermentation of 50 g/l xylose and 20 g/l glucose in mineral medium within 72 hours of fermentation. See also a graphic depiction of this data in Fig. 8.

5

In conclusion, these data show that strains C5LTe1042, C5LTe1048 and C5LT1208 carrying S. passalidarium XK consumed more xylose and produced more ethanol than their corresponding control strains C5LTe1035, C5LTe1036 and C5LT1204, respectively. i o Example 11 : Construction of an S. cerevisiae strain overexpressing three of the genes in the pentose phosphate pathway and a xylulose kinase (XK) gene and expressing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

Strains, media and genetic techniques

15 Yeast cells from freshly streaked YPD plates (Ausubel et al., 1995) were used for inoculation. Plasmid DNA was prepared with the GeneJETTM Plasmid Miniprep Kit (Fermentas UAB, Vilnius, Lithuania). Agarose gel DNA extraction was made with QIAquick® Gel Extraction Kit (Qiagen GmbH, Hilden, Germany). Primers from Eurofins MWG Operon (Ebersberg, Germany) and Phusion Hot Start II High-Fidelity DNA Polymerase and dNTP from Fermentas

20 (Vilnius, Lithuania) were used for polymerase chain reactions (PCR). PCR product purification was made with the E.Z.N.A.® Cycle Sequencing Kit (Omega Bio-tek Inc., Doraville, GA, USA). Sequencing was performed by Eurofins MWG Operon (Ebersberg, Germany). Restriction endonucleases from Fermentas (Vilnius, Lithuania) were used for DNA manipulation. Competent E. coli cells were transformed as described elsewhere (Inoue et al., 1990) and

25 transformed E. coli cells were selected on LB plates (Ausubel et al., 1995) containing 100 mg/l ampicillin (IBI Shelton Scientific Inc., Shelton, CT, USA). E. coli strains were grown in LB medium containing 100 mg/l ampicillin for plasmid amplifications. Yeast strains were transformed with the lithium acetate method (Gietz et al., 2007) and transformed yeast strains were selected on Yeast Nitrogen Base plates (YNB) (6.7 g/l Difco Yeast Nitrogen Base without

30 amino acids; Becton Dickinson and Company, Sparks, MD, USA) supplemented with 40 g/l xylose and buffered at pH 5.5 with 10.21 g/l potassium hydrogen phthalate.

Construction of C5LTe1201 containing YlpTTR

YlpTTR was cleaved with Spel within the RKI 1 gene and transformed into strain TMB 3000, a robust strain of Saccharomyces cerevisiae (Linden et al., 1992). This resulted in strain\ C5LTe1201 .

Construction of C5LTe1202 containing YlpTTR and pC5e0024

Plasmid pC5e0024 was cleaved with restriction enzyme EcoRV within the URA3 gene and it was thereafter transformed into strain C5LTe1201. This resulted in strain C5LTe1202. Example 12: Preparation of plasmid pC50042 and transformant C5LTe1212

Generation of fragments

YME2-PGK1 t was synthesized and assembled by Eurofins MWG Operon (Ebersberg, Germany) and that the harboring plasmid was named pEX-A1 -YME2. The fragment YME2- PGK1 t was generated by PCR using the plasmid pEX-A1 -YME2 as template and the following primers (Eurofins MWG Operon):

5'-GATCCCCGGGCTGCAATGTTGCCCATTTCTGGACCTT-3' (SEQ ID NO: 45)

5'-CGCTGCAGGTCGACGTGTTACATGCGTACACGCGTCT-3' (SEQ ID NO: 46)

The vector pUG6-HXT7'p was generated by PCR using the plasmid pUG6-HXT-PGM2 (WO2010/059095 A1 ) as template and the following primers (Eurofins MWG Operon):

5'-CGTCGACCTGCAGCGTAC -3' (SEQ ID NO: 47)

5'-TGCAGCCCGGGGATCCTTTTT-3' (SEQ ID NO: 48)

The three PCR products were purified using QIAquick Gel Extraction Kit (Qiagen, Venlo, Netherlands) and DNA concentration was determined.

SEQ ID NO: 49 shows the nucleic acid sequence for codon-optimized YME2 tolerance gene, whereas SEQ ID NO: 50 shows the encoded amino acid sequence.

In-Fusion cloning and E.coli transformation

Fragment YME2-PGK1 t and vector pUG6-HXT7'p were introduced into NEB5a E. coli competent cells (New England Biolabs, Ipswich MA, USA) using In-Fusion HD Cloning Kit (Clontech, Mountain View CA, USA) following the manufacturer instructions procedure. Transformants were selected on LB agar plates containing 100 μg/mL ampicillin and incubated overnight.

Four colonies were randomly selected and growth overnight on LB containing 100 μg/mL ampicillin. Plasmid DNA was prepared using the GeneJET Plasmid MiniPrep Kit (Thermo Scientific, Walthman MA, USA) by following the manufacturer instructions and concentration was determined.

Plasmid DNA was digested with restriction enzymes Xhol and Smal (Thermo Scientific) by following the manufacturer instructions. All the evaluated clones containing the YME2- plasmid displayed the expected size fragments (1904 and 3105 bp) and one of them was chosen to be sequenced (Eurofins MWG Operon) using the following primers:

5'-CCTGCGTGTTCTTCTGAGGTTC-3' (SEQ ID NO: 21 )

5'-ATATTGTCGTTAGAACGCGG-3' (SEQ ID NO: 22)

The obtained sequence was as predicted when using in silico cloning tools and the plasmid was named pC5e0042.

Genomic integration of linearized plasmid pC50042 into C5LTe1202

5 μg of plasmid was linearized with Bmrl (New England Biolabs) following the manufacturer instructions procedure and kept on ice until used in the transformation system.

Strain C5LTe1202 was transformed using a lithium acetate-based method (Gietz and Schiestl, 2007) and 20 μΙ_ of linearized pC50042 were used in the transformation system. YPD agar plates supplemented with 20g/L glucose and 500 mg/L geneticin were incubated aerobically at 30°C and transformant C5LTe1212 was selected after 2 days.

Example 13: Toxicity of formic acid on yeast growing on xylose

The inhibitory effect of acetic and formic acids on cells growing on xylose was initially demonstrated with strain C5LTe1202.

Mineral medium supplemented with 1 10 g/L of xylose, 50 mM potassium phthalate buffer and containing the concentrations (in g/L) of single inhibitors indicated in Fig. 9. Anaerobic growth was followed in Multiskan FC (Thermo Scientific) at 30°C and growth was measured as increase of OD (620nm). The increment of OD after 60 hours of cultivation was chosen to estimate the tolerance to single lignocellulosic inhibitors, since no exponential growth can be observed when using initial high concentration of xylose.

Example 14: Fermentation in the presence of acetic acid

Cells aerobically grown overnight on YNB medium supplemented with 20 g/L glucose were used to inoculate a serum flask containing 30 mL of YNB medium supplemented with glucose (20 g/L) and xylose (50 g/L) as carbon source and acetic acid (8 g/L) at initial concentration of 1 g cell dry weight/L.

Concentrations of glucose, xylose, ethanol, glycerol, xylitol and acetic acid were determined by high performance liquid chromatography (Waters, Milford, MA, USA). The compounds were separated with a Shodex SUGAR SP0810 Pb2+ copolymer-based column (Showa Denko America, NY, USA), or a Rezex H+ column, preceded by a Micro-Guard Carbo- C guard column (Bio-Rad, Hercules, CA, USA). Separation was performed at 80°C, with H20 at a flow rate of 0.6 ml min-1 as mobile phase. Compounds were quantified by refractive index detection (Waters). A seven-point calibration curve was made for each compound to calculate 5 concentrations.

Results are presented graphically in Fig. 10, Fig. 1 1 , and Fig. 12. In conclusion, more xylose was consumed, and more ethanol was produced, in strain C5LTe1212 expressing the YME2 gene. i o Example 15: Fermentation and growth characteristics in the presence of formic acid

Cells aerobically grown overnight on YNB medium supplemented with 20 g/L glucose were used to inoculate a serum flask containing 30 mL of YNB medium supplemented with glucose (20 g/L) and xylose (50 g/L) as carbon source and formic acid (4.5 g/L) at initial concentration of 1 g CDW/L. Glucose, xylose, ethanol, xylitol, glycerol and acetate were 15 analysed using HPLC (Waters) using the same procedure as in Example 7.

Results are shown graphically in Fig. 13, Fig. 14, and Fig. 15. In conclusion, the more xylose was consumed, and more ethanol was produced, in strain C5LTe1212 expressing the YME2 gene.

In another tolerance experiment, cells grown overnight on YNB medium supplemented 20 with 50 g/L xylose were used to inoculate a microtiter plate containing YNB medium supplemented with 1 10 g/L xylose as carbon source and 4.5 g/L formic acid. Fig. 16 shows a graphic representation of the anaerobic growth characteristics (measured as change in OD at 620 nm) of strains C5LTe1202 (upper curve) and C5LTe1212 (lower curve).

To conclude, overexpression of the YME2 gene clearly improves tolerance towards 25 both formic acid and acetic acid. Xylose fermentation in presence of formic or acetic acid was improved in yeast overexpressing YME2. Specifically, xylose consumption and ethanol production increased by 13% and 12%, respectively, in the presence of acetic acid, and by 7% and 12%, respectively, in the presence of formic acid. Formic acid is common in lignocellulosic hydrolysates, and strongly contributes to the toxicity of such hydrolysates.

30

Example 16: Construction and selection of MC strains

For this Example, certain S. cerevisiae genes coding for enzymes in the main pathways of the central metabolism were tested.

Yeast strain C5LTe1 101 was constructed by transforming yeast strain TMB 3043 35 (Karhumaa et al. 2005) in its ura3 locus with a DNA fragment containing URA3 gene,TDH3p- XYL1 (N272D)-ADH1 t, and PGK1 p-XYL2-PGK1 t. Selection was made on YNB (Yeast nitrogen base 6,7 g/l and 20 g/l glucose, 20 g/l agar) plates supplemented with 200 mg/L leucine. To C5LTe1 101 were transformed DNA fragments obtained with PCR with primers shown in Table 7 together with the plasmid p245GPD (Mumberg et al., 1995) linearized with Smal. A control strain was constructed by transforming with unlinearized plasmid p245GPD. Selection was made on YNB agar plates with 20 g/l glucose as a carbon source. Selected colonies were tested by colony PCR and clones with PCR-products indicating right size of plasmid insert were chosen.

Table 7. Genes Included:

Strain Gene Forward primer (5' -> 3') (SEQ ID NO:) Reverse primer (5' -> 3') (SEQ ID NO:)

MC1 GLK1 CTAGAACTAGTGGATCCCCCATGTCA ATATCG AATTCCTG CAG CCCTCATG CT

TTCGACGACTTACACAAAG (63) ACAAGCGCACACAA (64)

MC2 PGM CTAGAACTAGTGGATCCCCCATGTCC ATATCGAATTCCTGCAGCCCTCACATC

AATAACTCATTCACTAACTTCA (65) CATTCCTTGAATTG (66)

MC3 PFK2 CTAGAACTAGTGGATCCCCCATGTTC ATATCGAATTCCTGCAGCCCTTAAACG 6 AAACCAGTAGACTTCTCTGA (67) TG ACTTTG G CTG C (68)

MC4 PFK2 CTAGAACTAGTGGATCCCCCATGACT ATATCGAATTCCTGCAGCCCTTAATCAA

GTTACTACTCCTTTTGTGAATG (69) CTCTCTTTCTTCCAACC (70)

MC5 PFK2 CTAGAACTAGTGGATCCCCCATGGG ATATCG AATTCCTG CAG CCCTCAAG CA 7 TG GTTCTTCCG ATTCA (71 ) AATCCGTTGCTTTC (72)

MC6 FBA1 CTAGAACTAGTGGATCCCCCATGGG ATATCGAATTCCTGCAGCCCTTAATCAA

TGTTGAACAAATCTTAAAGAG (73) CTCTCTTTCTTCCAACC (74)

MC7 TDH1 CTAGAACTAGTGGATCCCCCATGATC ATATCG AATTCCTG CAG CCCTTAAG CC

AGAATTGCTATTAACGGTTTC (75) TTGGCAACATATTCG (76)

MC8 TDH2 CTAGAACTAGTGGATCCCCCATGGTT ATATCG AATTCCTG CAG CCCTTAAG CC

13 AGAGTTGCTATTAACGGTTTC (77) TTGGCAACGTGTT (78)

MC1 1 GPM CTAGAACTAGTGGATCCCCCATGCCA ATATCGAATTCCTGCAGCCCTTATTTCT 1 AAGTTAGTTTTAGTTAGACACG (79) TACCTTGGTTGGCAAC (80)

MC12 GPM CTAGAACTAGTGGATCCCCCATGACT ATATCG AATTCCTG CAG CCCTTAAG G A 2 GCAAGCACACCATCCAAT (81 ) TTTTTTATGAAACCCTCA (82)

MC13 GPM CTAGAACTAGTGGATCCCCCATGACT ATATCGAATTCCTGCAGCCCTCATGGA

3 GTTACTGACACTTTTAAACTG (83) TTCTTTTCGAAACCC (84)

MC14 EN01 CTAGAACTAGTGGATCCCCCATGGG ATATCGAATTCCTGCAGCCCTTATAATT

TGTCTCTAAAGTTTACGC (85) TGTCACCGTGGTGG (86)

MC15 PYK1 CTAGAACTAGTGGATCCCCCATGTCT ATATCGAATTCCTGCAGCCCTTAAACG

AGATTAGAAAGATTGACCTCA (87) GTAGAGACTTGCAAAGTG (88)

MC16 PYK2 CTAGAACTAGTGGATCCCCCATGCCA ATATCGAATTCCTGCAGCCCCTAGAAT

GAGTCCAGATTGCA (89) TCTTGACCAACAGTAGAAATG (90)

MC17 PDC1 CTAGAACTAGTGGATCCCCCATGTCT ATATCG AATTCCTG CAG CCCTTATTG C

GAAATTACTTTGGGTAAA (91 ) TTAGCGTTGGTAGCAG (92)

MC18 PDC6 CTAGAACTAGTGGATCCCCCATGTCT ATATCGAATTCCTGCAGCCCTTATTGTT

GAAATTACTCTTGGAAAATACT (93) TGGCATTTGTAGCGG (94)

MC19 ALD6 CTAGAACTAGTGGATCCCCCATGACT ATATCGAATTCCTGCAGCCCTTACAAC

AAG CTACACTTTG ACACTG C (95) TTAATTCTGACAGCTTTTAC (96)

MC20 ADH5 CTAGAACTAGTGGATCCCCCATGCCT ATATCGAATTCCTGCAGCCCTCATTTA

TCGCAAGTCATTCCT (97) GAAGTCTCAACAACATATC (98)

MC21 ADH6 CTAGAACTAGTGGATCCCCCATGTCT ATATCGAATTCCTGCAGCCCCTAGTCT

TATCCTG AG AAATTTG AAG GT (99) GAAAATTCTTTGTCGTAGCC (100)

MC22 TPI 1 CTAGAACTAGTGGATCCCCCATGGG ATATCGAATTCCTGCAGCCCTTAGTTT

TAGAACTTTCTTTGTCGG (101 ) CTAGAGTTGATGATATCAACA (102)

MC23 GPD1 CTAGAACTAGTGGATCCCCCATGTCT ATATCGAATTCCTGCAGCCCCTAATCTT

G CTG CTG CTG ATAG ATT (103) CATGTAGATCTAATTCTTCA (104)

MC24 GPD2 CTAGAACTAGTGGATCCCCCATGCTT ATATCGAATTCCTGCAGCCCCTATTCG

G CTGTC AG AAG ATTAACA (105) TCATCGATGTCTAGCTCT (106)

MC25 HOR CTAGAACTAGTGGATCCCCCATGGG ATATCGAATTCCTGCAGCCCTTACCATT 2 ATTGACTACTAAACCTCTATCT (107) TCAACAGATCGTCC (108) MC26 SNF3 CTAGAACTAGTGGATCCCCCATGGAT ATATCGAATTCCTGCAGCCCTTATTTCA

CCTAATAGTAACAGTTCTAGCG (109) AATC ATTATTTTCATTTAC AG GTTG

(1 10)

MC27 RGT2 CTAGAACTAGTGGATCCCCCATGAAC ATATCG AATTCCTG CAG CCCTTATTG G

G ATAG CC AAAACTG C (1 1 1 ) GGGGAAGTGTATTG (1 12)

MC28 MIG1 CTAGAACTAGTGGATCCCCCATGCAA ATATCGAATTCCTGCAGCCCTCAGTCC

AG CCCATATCCAATG (1 13) ATGTGTGGGAAGG (1 14)

MC29 STD1 CTAGAACTAGTGGATCCCCCATGTTT ATATCG AATTCCTG CAG CCCCTAG G AC

GTTTCACCACCTCCA (1 15) ATTCCATCAGGCTT (1 16)

MC30 ADH1 CTAGAACTAGTGGATCCCCCATGTCT ATATCGAATTCCTGCAGCCCTTATTTAG

ATCCCAG AAACTCAAAAAG G (1 17) AAGTGTCACAACGTATCTACC (1 18)

MC31 PGM CTAGAACTAGTGGATCCCCCATGAAC ATATCG AATTCCTG CAG CCCTTATTG G 1 G ATAG CC AAAACTG C (1 19) GGGGAAGTGTATTG (120)

MC32 PGM CTAGAACTAGTGGATCCCCCATGTTG ATATCGAATTCCTGCAGCCCTCAAAATT

3 CAAG G AATTTTAG AAACCG (121 ) TTGTAACTATATTCATTTCATCTG ( 122 )

MC33 GAL3 CTAGAACTAGTGGATCCCCCATGAAT ATATCGAATTCCTGCAGCCCTTATTGTT

ACAAACGTTCCAATATTCAG (123) CGTACAAACAAGTACCC (124)

MC34 GAL1 CTAGAACTAGTGGATCCCCCATGACT ATATCGAATTCCTGCAGCCCTTATAATT

AAATCTCATTCAGAAGAAGTGA (125) CATATAG ACAG CTG CCCA (126)

MC35 GAL4 CTAGAACTAGTGGATCCCCCATGAA ATATCGAATTCCTGCAGCCCTTACTCTT

GCTACTGTCTTCTATCGAACAAG TTTTTGGGTTTGGTGG (128)

(127)

MC36 GAL1 CTAGAACTAGTGGATCCCCCATGACT ATATCGAATTCCTGCAGCCCTTACAGT

G CTG AAG AATTTG ATTTTTC (129) CTTTGTAGATAATGAATCTGACC (130)

The nucleic acid and amino acid sequences of EN01 are shown in SEQ ID NOs: 131 and 132, respectively. The nucleic acid and amino acid sequences of PFK2 are shown in SEQ ID NOs: 133 and 134, respectively. The nucleic acid and amino acid sequences of PFK26 are shown in SEQ ID NOs: 135 and 136, respectively. The nucleic acid and amino acid sequences 5 of PGI1 are shown in SEQ ID NOs: 137 and 138, respectively. The nucleic acid and amino acid sequences of GMP1 are shown in SEQ ID NOS: 139 and 140, respectively. The nucleic acid and amino acid sequences of TPI 1 are shown in SEQ ID NOs: 141 and 142, respectively.

Example 17: Aerobic growth on xylose

i o Aerobic growth on xylose was measured in mineral medium (Yeast nitrogen base, 13.4 g/l, xylose 50 g/l) in 5 ml cultures in 50 ml falcon tubes. Samples were taken and OD at 620 nm was measured. See Fig. 17: For clarity result is presented as normalized OD, against average of all results, for OD at time point of 48h. Values exceeding 1 indicate that the strain is better than the average of the strains in the experiment.

15

Example 18: Anaerobic growth on xylose

Yeast strains were analysed for anaerobic growth on xylose in 96-well microplate yeast cultures. Prior to experiments, yeast was grown in semi-aerobic precultures in YNB medium with 50 g/l xylose, in microplates, overnight. For measurement of anaerobic growth, cells were

20 inoculated in 200 μΙ of the same medium where solution of ergosterol and Tween80 had been added at final concentration of 0.03 and 1 .2 g/l, respectively. 50 μΙ mineral oil was added on top of each well to keep culture anaerobic. 20 μΙ of precultured cells were added. The growth was followed in Multiskan FC (Thermo Scientific) at 30°C and growth was measured as increase of OD (620nm). See Fig. 18: For clarity result is presented as normalized OD, against average of all results, for OD at time point of 80h. Values exceeding 1 indicate that the strain is better than the average of the strains in the experiment.

5

Example 19: Construction of an S. cerevisiae strain overexpressing an enolase, three of the genes in the pentose phosphate pathway and a xylulose kinase (XK) gene and expressing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

i o In this example, the strains, media and genetic techniques described in above were used, except that yeast strains were transformed with the lithium acetate method (Gietz and Schiestl, 2007) and transformed yeast strains were selected on YPD plates containing 500 μg mL-1 geneticin (Gibco Invitrogen, Paisley, UK).

Construction of pC5e0057 containing a S. cerevisiae enolase (ΕΝΟ ) gene

15 Plasmid pUG6 HXT-PGM2 (WO2010 059095 (A1 )) contains the truncated HXT7' promoter and the PGK1 terminator from S. cerevisiae in pUG6. The DNA cassette HXT7' p- pUG6-PGK1 t was PCR amplified having as template pUG6 HXT-PGM2 and using the following primers

2_fwd:

20 5'-TGCAGCCCGGGGATCCTTTTT-3' (SEQ ID NO: 93)

2_rev:

5'-AGGAATTCTAGATCTCCCATGTCTCT-3' (SEQ ID NO: 94).

The entire S. cerevisiae enolase (EN01) gene was PCR amplified having as template genomic DNA from CEN.PK and using the following primers.

25 EN 1_fwd

5'-GATCCCCGGGCTGCAATGGCTGTCTCTAAAGTTTACGC-3' (SEQ ID NO: 95)

EN1_rev

5'-AGATCTAGAATTCCTTTATAATTTGTCACCGTGGTGG-3' (SEQ ID NO: 96)

The two DNA fragments were fused together by In-Fusion cloning (Clontech, California, 30 CA, USA). The resulting plasmid was named pC5e0057. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1051 overexpressing the gene coding for ENQ1 and capable of growing on solely xylose

Plasmid pC5e0057 was digested with restriction enzyme Kpnl within the EN01 gene and it was thereafter transformed into strain C5LTe1035. This resulted in strain C5LTe1051 .

Example 20: Construction of an S. cerevisiae strain overexpressing a 6- phosphofructokinase, three of the genes in the pentose phosphate pathway and a xylulose kinase (XK) gene and expressing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

In this example, the strains, media and genetic techniques described above were used, except that yeast strains were transformed with the lithium acetate method (Gietz and Schiestl, 2007) and transformed yeast strains were selected on YPD plates containing 500 μg mL-1 geneticin (Gibco Invitrogen, Paisley, UK).

Construction of pC5e0058 containing a S. cerevisiae 6-phosphofructokinase subunit beta (PFK2) gene

Plasmid pUG6 HXT-PGM2 (WO 2010/059095 (A1 )) contains the truncated HXT7' promoter and the PGK1 terminator from S. cerevisiae in pUG6. The DNA cassette HXT7'p- pUG6-PGK1 t was PCR amplified having as template pUG6 HXT-PGM2 and using the following primers.

2_fwd:

5'-TGCAGCCCGGGGATCCTTTTT-3' (SEQ ID NO: 97)

2_rev:

5'-AGGAATTCTAGATCTCCCATGTCTCT-3' (SEQ ID NO: 98)

The entire S. cerevisiae 6-phosphofructokinase subunit beta (PFK2) gene was PCR amplified having as template genomic DNA from CEN.PK and using the following primers. PF2_fwd

5'-GATCCCCGGGCTGCAATGACTGTTACTACTCCTTTTGTGAATG-3' (SEQ ID NO: 99) PF2_rev:

5'-AGATCTAGAATTCCTTTAATCAACTCTCTTTCTTCCAACC-3' (SEQ ID NO: 100)

The two DNA fragments were fused together by In-Fusion cloning (Clontech, California, CA, USA). The resulting plasmid was named pC5e0058. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1052 overexpressing the gene coding for PFK2 and capable of growing on solely xylose

Plasmid pC5e0058 was digested with restriction enzyme Kpnl within the PFK2 gene and it was thereafter transformed into strain C5LTe1035, resulting in strain C5LTe1052. Example 21 : Construction of an S. cerevisiae strain overexpressing a glucose-6- phosphate isomerase, three of the genes in the pentose phosphate pathway and a xylulose kinase (XK) gene and expressing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

In this example, the strains, media and genetic techniques described above were used, except that yeast strains were transformed with the lithium acetate method (Gietz and Schiestl, 2007) and transformed yeast strains were selected on YPD plates containing 500 μg mL-1 geneticin (Gibco Invitrogen, Paisley, UK). Construction of pC5e0060 containing a S. cerevisiae glucose-6-phosphate isomerase (PGM ) gene

Plasmid pUG6 HXT-PGM2 (WO2010/059095 (A1 )) contains the truncated HXT7' promoter and the PGK1 terminator from S. cerevisiae in pUG6. The DNA cassette HXT7'p-pUG6-PGK1t was PCR amplified having as template pUG6 HXT-PGM2 and using the following primers.

2_fwd:

5 -TGCAGCCCGGGGATCCTTTTT-3' (SEQ ID NO: 101 )

2_rev:

5 -AGGAATTCTAGATCTCCCATGTCTCT-3' (SEQ ID NO: 102)

The entire S. cerevisiae glucose-6-phosphate isomerase (PGM ) gene was PCR amplified having as template genomic DNA from CEN PK and using the following primers.

PG1_fwd:

5 -GATCCCCGGGCTG C AATGTCC AATAACTC ATTCACTAACTTCA-3 ' (SEQ ID NO: 103)

PG1_rev:

5 -AGATCTAGAATTCCTTCACATCCATTCCTTGAATTG-3' (SEQ ID NO: 104)

The two DNA fragments were fused together by In-Fusion cloning (Clontech, California, CA, USA). The resulting plasmid was named pC5e0060. Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1054 overexpressing the gene coding for PGI 1 and capable of growing on solely xylose

Plasmid pC5e0060 was digested with restriction enzyme Kpnl within the PGI 1 gene and it was thereafter transformed into strain C5LTe1035. This resulted in strain C5LTe1054.

Example 22: Construction of an S. cerevisiae strain overexpressing a 6-phosphofructo-2- kinase three of the genes in the pentose phosphate pathway and a xylulose kinase (XK) gene and expressing a Scheffersomyces stipitis xylitol dehydrogenase (XDH) gene and a mutated Scheffersomyces stipitis xylose reductase (XR(N272D)) gene

In this example, the strains, media and genetic techniques described above were used, except that yeast strains were transformed with the lithium acetate method (Gietz and Schiestl, 2007) and transformed yeast strains were selected on YPD plates containing 500 μg mL-1 geneticin (Gibco Invitrogen, Paisley, UK).

Construction of pC5e0060 containing a S. cerevisiae 6-phosphofructo-2-kinase (PFK26) gene Plasmid pUG6 HXT-PGM2 (WO2010/059095 A1 ) contains the truncated HXT7' promoter and the PGK1 terminator from S. cerevisiae in pUG6. The DNA cassette HXT7'p- pUG6-PGK1 t was PCR amplified having as template pUG6 HXT-PGM2 and using the following primers.

2_fwd:

5'-TGCAGCCCGGGGATCCTTTTT-3' (SEQ ID NO: 105)

2_rev:

5'-AGGAATTCTAGATCTCCCATGTCTCT-3' (SEQ ID NO: 106)

The entire S. cerevisiae 6-phosphofructo-2-kinase (PFK26) gene was PCR amplified having as template genomic DNA from CEN PK and using the following primers.

PF26_fwd:

5'-GATCCCCGGGCTGCAATGTTCAAACCAGTAGACTTCTCTGA-3' (SEQ ID NO: 107) PF26_rev:

5'-AGATCTAGAATTCCTTTAAACGTGACTTTGGCTGC-3' (SEQ ID NO: 108)

The two DNA fragments were fused together by In-Fusion cloning (Clontech, California, CA, USA). The resulting plasmid was named pC5e0061 . Correct orientations and sequences of the inserts were verified by restriction analysis and sequencing.

Construction of C5LTe1055 overexpressing the gene coding for PFK26 and capable of growing on solely xylose

Plasmid pC5e0061 was digested with restriction enzyme Kpnl within the PFK26 gene and it was thereafter transformed into strain C5LTe1035. This resulted in strain C5LTe1055.

Example 23: Anaerobic growth on xylose

Yeast strains were analysed for anaerobic growth on xylose in 96-well microplate yeast cultures. Prior to experiments, yeast was grown in semi-aerobic pre-cultures in YNB medium with 50 g/l xylose, in microplates, overnight. For measurement of anaerobic growth, cells were inoculated in 250 μΙ of the same medium where solution of ergosterol and Tween80® had been added at final concentration of 0.03 and 1 .2 g/l, respectively. 50 μΙ mineral oil was added on top of each well to keep culture anaerobic. 10 μ I of pre-cultured cells were added. The growth was followed in Multiskan FC (Thermo Scientific) at 30°C and growth was measured as increase of OD (620nm). Data is shown graphically in Fig. 19, Fig. 20, Fig. 21 , and Fig. 22. In conclusion, strains with the gene coding for EN01 or the gene coding for PFK2 or the gene coding for PGM or the gene coding for PFK26 grow faster and more under anaerobic conditions than the control strain without overexpression of any of these four genes.

Example 24: Serum flask fermentation

Fermentation procedure:

Cells aerobically grown overnight on YNB medium supplemented with 20 g/L glucose were used to inoculate a shake flask containing 50 mL of YNB medium supplemented with xylose (70 g/L) as carbon source. When the cells were growing exponentially they were used to inoculate a serum flask containing 30 mL of YNB medium supplemented with xylose (55 g/L) and a solution of ergosterol and Tween80 with final concentration of 0.03 and 1 .2 g/l, respectively. 7 ml mineral oil was added on top of each serum flask to keep culture anaerobic. Concentrations of xylose, ethanol, glycerol, and xylitol were determined by high performance liquid chromatography (Waters, Milford, MA, USA).

The results are shown in Table 8 and Fig. 23. Notably, all chosen strains carrying overexpressed genes consumed more xylose and produced more ethanol than the control strain without overexpressed genes.

Table 8.

Example 25: Preparation of strains carrying PGM1 and PGM3

Yeast strain C5LTe1 101 was constructed by transforming yeast strain TMB 3043 (Karhumaa et al. 2005) in its ura3 locus with a DNA fragment containing URA3 gene,TDH3p- XYL1 (N272D)-ADH1 t, and PGK1 p-XYL2-PGK1 t. Selection was made on YNB (Yeast nitrogen base 6,7 g/l and 20 g/l glucose, 20 g/l agar) plates supplemented with 200 mg/L leucine.

C5LTe1 101 was transformed with DNA fragments obtained with the following PGM1 - specific PCR primers

5'-CTAGAACTAGTGGATCCCCCATGAACGATAGCCAAAACTGC-3' (SEQ ID NO: 143) 5'-ATATCGAATTCCTGCAGCCCTTATTGGGGGGAAGTGTATTG-3' (SEQ ID NO: 144) and with DNA fragments obtained with the following PGM3-specific PCR primers

5'-CTAGAACTAGTGGATCCCCCATGTTGCAAGGAATTTTAGAAACCG-3' (SEQ ID NO: 145)

5'-ATATCGAATTCCTGCAGCCCTCAAAATTTTGTAACTATATTCATTTCATCTG-3 ' (SEQ ID NO: 146)

together with the plasmid p245GPD (Mumberg et al) linearized with Smal. Selection was made on YNB agar plates 5 with 20 g/l glucose as a carbon source. Selected colonies were tested by colony PCR and clones with PCR-products indicating right size of plasmid insert were chosen.

The nucleic acid and amino acid sequences of PGM1 are shown as SEQ I D NOs: 147 and 148, respectively. The nucleic acid and amino acid sequences of PGM3 are shown as SEQ ID NOs: 149 and 150, respectively.

Example 26: Anaerobic growth of strains carrying PGM1 and PGM3

Yeast strains obtained in Example 25 were analysed for anaerobic growth on xylose in 96-well microplate yeast cultures. Prior to experiments, yeast was grown in semi-aerobic pre- cultures in YNB medium with 50 g/l xylose, in microplates, overnight. For measurement of anaerobic growth, cells were inoculated in 250 μΙ of the same medium where a solution of ergosterol and Tween80 had been added at final concentration of 0.03 and 1 .2 g/l, respectively. 50 μΙ mineral oil was added on top of each well to keep the cultures anaerobic. 10 μΙ of pre-cultured cells were added. The growth was followed in Multiskan FC (Thermo Scientific) at 30°C and growth was measured as increase of OD (620nm). The growth curves are presented in Fig. 24.

Example 27: Small scale fermentation of strains carrying PGM1 and PGM3

Cells aerobically grown overnight on YNB medium supplemented with 20 g/L glucose were used to inoculate a serum flask containing 30 mL of YNB medium supplemented with glucose (20 g/L) and xylose (50 g/L) as carbon source and formic acid (4.5 g/L) at initial concentration of 1 g CDW/L.

Concentrations of glucose, xylose, ethanol, glycerol, and xylitol were determined by high performance liquid chromatography (Waters, Milford, MA, USA). The compounds were separated with a Shodex SUGAR SP0810 Pb2+ copolymer-based column (Showa Denko America, NY, USA) preceded by a Micro-Guard Carbo-C guard column (Bio-Rad, Hercules, CA, USA). Separation was performed at 80°C, with H20 at a flow rate of 0.6 ml min-1 as mobile phase. Compounds were quantified by refractive index detection (Waters). A seven- point calibration curve was made for each compound to calculate concentrations. Results are presented graphically in Fig. 25 (xylose consumption) and Fig. 26 (ethanol production) as well as in Table 4.

In conclusion, these data show that strains carrying PGM1 or PGM3 consumed more xylose and produced more ethanol than the control strain. REFERENCES

All references cited herein are incorporated by reference in their entirety. To the extent publications and patents or patent applications incorporated by reference contradict the invention contained in the description, the latter supersedes any contradictory material. Ausubel et al., Short Protocols in Molecular Biology, 3rd ed., Wiley & Sons, 1995.

Chang and Ho, Appl Biochem Biotechnol. 1988, 17, 313-318.

Eliasson et ai, Enzyme and Microbial Technology (2001 ) 29:288-297.

Gietz and Schiestl, Nature protocols (2) (2007) 31 -34.

Hou, Appl Microbiol Biotechnol (2012) 94:205-214.

Inoue et ai, Gene 96 (1 ) (1990) 23-28.

Jin et al., App Environmental Microbiol (2003) 69(1 ):495-503.

Johansson and Hahn-Hagerdal, Yeast 19 (3) (2002) 225-231 .

Johansson et ai, Appl Environmental Microbiol (2001 ) 67(9):4249-4255.

Karhumaa et ai, Microb Cell Fact. (2007) 5;6:5.

Karhumaa et al., Yeast (2005) 22(5):359-68.

Karhumaa et al., Yeast 2009; 26(7):371 -82.

Kim et ai., Biotechnol Adv. (2013) 31 (6):851-61.

Kuyper et ai. FEMS Yeast Res. (2005) 5(4-5):399-409.

Linden T, Peetre J, Hahn-Hagerdal B. (1992) Appl Environ Microbiol.58(5):1661 -9.

Lonn et al., Enzyme Microb. Technol. 32 (5) (2003) 567-573.

Madhavan et ai, Appl Microbiol Biotechnol. 2009, 82(6), 1067-1078.

Matsushika et al, Enzyme and Microbial Technology (201 1 ) 48:466-471 .

Mollapour M, Piper PW. Mol Microbiol. (2001 ) 42(4):919-30.

Mumberg D, Miiller R, Funk M. Gene. 1995 Apr 14;156(1 ):1 19-22.

Needleman and Wunsch, J. Mol. Biol. (1970), 48:443-453.

Nguyen et ai, Mycological Res (2006) 1 10:1232-1241 .

Parachin et ai, Metabolic Engineering (201 1 ) 13:508-517.

Rice et al., Trends Genet 2000, 16:276-277.

Runquist et ai, Appl. Environ. Microbiol. 76 (23) (2010) 7796-7802.

Shamanna and Sanderson, J. Bacteriol. 139 (1979) 64-70.

Tiwari et ai, Biochemical and Biophysical Research Communications 366 (2008) 340-345. Walther et al. FEBS Lett. 2012 Nov 30;586(23):41 14-8.

Wohlbach et al., Proc Natl Acad Sci U S A (201 1 ) 108(32):13212-7).

Xu et al., Mol Syst Biol 9 (2013) 665.

US 2009/0246857

US 2012/270289 A1

WO 2009/017441

WO 2010/059095

WO 2012/1351 10

WO 95/13362

SEQUENCES

SEQ ID NO: 5 - Artificial Spathaspora passalidarum XK gene

Artificial Spathaspora passalidarum XK gene with TPI1 promoter before the ATG-start codon and PGK1 terminator after the stop-codon, and coding region based on the amino acid sequence from NCBI Accession Code XP_0073731 12:

TATATCTAGGAACCCATCAGGTTGGTGGAAGATTACCCGTTCTAAGACTTTTCAGCT TCCTCTATTGA TGTTACACCTGGACACCCCTTTTCTGGCATCCAGTTTTTAATCTTCAGTGGCATGTGAGA TTCTCCGA AATTAATTAAAGCAATCACACAATTCTCTCGGATACCACCTCGGTTGAAACTGACAGGTG GTTTGTTA CGCATGCTAATGCAAAGGAGCCTATATACCTTTGGCTCGGCTGCTGTAACAGGGAATATA AAGGGCAG CATAATTTAGGAGTTTAGTGAACTTGCAACATTTACTATTTTCCCTTCTTACGTAAATAT TTTTCTTT TTAATTCTAAATCAATCTTTTTCAATTTTTTGTTTGTATTCTTTTCTTGCTTAAATCTAT AACTACAA AAAACACATACATAAACTAAAAATGACAGTAGAACTACCCGCTTCAGAACCTTTGTTTCT TGGGTTTG ATCTTAGCACTCAACAGTTGAAAATCATAGTGACAAACCAGAAATTAGCTGCACTAAAAT CTTACAAC GTTGAATTCGATGTGGCATTTAAGGAGAAATATGGGATAACCAAGGGAGTCCTAACGAAC AAAGAGGA CGGAGAAGTGGTTTCTCCAGTTGGTATGTGGTTAGATTCCATAAACCATGTATTCGACCA AATGAAAC AAGATGACTTTCCGTTCAATCAAGTTGCAGGCATTTCAGGCTCTTGTCAACAACATGGTT CTGTGTTT TGGTCACATGAAGCTGAGAAGCTTTTATCAGGTTTACAGAAGGATCAAGATCTGTCGACT CAACTAAA GGACGCTTTATCTTGGGACAAAAGTCCCAATTGGCAAGATCATTCGACTTTAGAGGAAAG TAAGGCTT TCGTAGATGCTGTAGGGAGGGAAGAGTTAGCCGATATTACTGGTAGTAGAGATCACTTAA GATTCACT GGATTGCAAATTAGGAAGTTTGCCACTAGATCACATCCCGATAAGTATGCGAATACTAGT AGAATCTC ACTGGTTAGCTCCTTCATAACAAGTGTTCTTCTGGGTGAGATTACCGAATTGGAAGAATC TGATGCTT GTGGCATGAACTTGTATGACATCAAAGCCGGTGATTTCAATGAAGAATTGTTGGCTCTAG CAGCCGGT GTTCATCCTAAAGTTGACAACATAACGAAAGATGATCCGAAATATAAAGCCGGAATTGAG GACATCAA AGCGAAACTTGGGAAGATCTCCCCAATTACATATAAAAGCTCCGGATCCATTGCTTCATA TTACGTTG AAAAGTACGGTTTGAATCCTAAGTGCCAGATTTACAGCTTTACCGGTGACAATTTGGCTA CAATCTTG AGTTTGCCATTACAGCCTAACGATTGCTTGATTTCGTTAGGTACTTCGACTACGGTCTTG CTAATCAC TAAGAATTACCAACCTTCTTCTCAATATCACTTGTTTAAGCATCCAACCATACCAGATGG ATATATGG GCATGATCTGCTATTGCAATGGCTCTTTGGCCAGAGAGAAGATAAGAGATGAAGTTAATG AATACTAT AAGGTGGAAGATAAGAAGAGTTGGGATAAATTTAGTGAAATTCTGGATAAGTCGACCAAA TTCGATAA TAAGCTGGGTATTTTCTTTCCGTTAGGTGAAATCGTTCCACAGGCAAAGGCACAAACTGT CAGAGCAG TATTGGAGAATGACAAAGTCATAGAAGTAGGTTTGGATACACACGGATTTGATATTGATC ACGACGCA AGAGCTATTGTCGAAAGCCAAGCCTTATCTTGTAGACTTAGAGCTGGCCCTATGTTATCC AAATCATC ACGTGCTTCCGTCACATCACCAACGGAGTTAAAAGGCGTATACCATGACATAGTGGCCAA ATATGGTG ACCTGTACACAGATGGTAAACAACAAACCTATGAGTCACTTACATCTAGGCCAAATCGTT GTTTCTAT GTTGGCGGCGGGAGCAATAACATTTCCATCATTAGTAAAATGGGTTCTATTCTAGGTCCT GTTCACGG TAACTTTAAAGTCGATATTCCAAACGCGTGTTCTCTAGGTGGAGCATACAAAGCATCCTG GTCTTATG AATGTGAACAGAAAGGTGAATGGATTAATTATGACCAATACATAAATCAGTTATTGAAAG AATTGAAG TCATTTAACGTGGAGGACAAATGGTTAGAATACTTTGATGGAGTTGCGCTTTTGGCTAAG ATGGAAGA AACCCTGTTGAAATAAAGATCTCCCATGTCTCTACTGGTGGTGGTGCTTCTTTGGAATTA TTGGAAGG TAAGGAATTGCCAGGTGTTGCTTTCTTATCCGAAAAGAAATAAATTGAATTGAATTGAAA TCGATAGA TCAATTTTTTTCTTTTCTCTTTCCCCATCCTTTACGCTAAAATAATAGTTTATTTTATTT TTTGAATA TTTTTTATTTATATACGTATATATAGACTATTATTTATCTTTTAATGATTATTAAGATTT TTATTAAA AAAAAATTCGCTCCTCTTTTAATGCCTTTATGCAGTTTTTTTTTCCCATTCGATATTTCT ATGTTC

SEQ ID NO: 6 - Spathaspora passalidarum XK

MTVELPASEPLFLGFDLSTQQLKI IVTNQKLAALKSYNVEFDVAFKEKYGITKGVLTNKEDGEVVSPV GMWLDSINHVFDQMKQDDFPFNQVAGISGSCQQHGSVFWSHEAEKLLSGLQKDQDLSTQL KDALSWDK SPNWQDHSTLEESKAFVDAVGREELADITGSRDHLRFTGLQIRKFATRSHPDKYANTSRI SLVSSFIT SVLLGEITELEESDACGMNLYDIKAGDFNEELLALAAGVHPKVDNITKDDPKYKAGIEDI KAKLGKIS PITYKSSGSIASYYVEKYGLNPKCQIYSFTGDNLATILSLPLQPNDCLISLGTSTTVLLI TKNYQPSS QYHLFKHPTI PDGYMGMICYCNGSLAREKIRDEVNEYYKVEDKKSWDKFSEILDKSTKFDNKLGI FFP LGEIVPQAKAQTVRAVLENDKVIEVGLDTHGFDI DHDARAIVESQALSCRLRAGPMLSKSSRASVTSP TELKGVYHDIVAKYGDLYTDGKQQTYESLTSRPNRCFYVGGGSNNI SI I SKMGSILGPVHGNFKVDI P NACSLGGAYKASWSYECEQKGEWINYDQYINQLLKELKSFNVEDKWLEYFDGVALLAKME ETLLK SEQ ID NO: 17 - Artificial Escherichia coli xylulose kinase gene

Artificial Escherichia coli xylulose kinase gene with TPI 1 promoter before the ATG-start codon and the PGK1 terminator after the stop-codon, and coding region based on the amino acid sequence from NCBI Accession Code YP_001460359:

TATATCTAGGAACCCATCAGGTTGGTGGAAGATTACCCGTTCTAAGACTTTTCAGCT TCCTCTATTGA TGTTACACCTGGACACCCCTTTTCTGGCATCCAGTTTTTAATCTTCAGTGGCATGTGAGA TTCTCCGA AATTAATTAAAGCAATCACACAATTCTCTCGGATACCACCTCGGTTGAAACTGACAGGTG GTTTGTTA CGCATGCTAATGCAAAGGAGCCTATATACCTTTGGCTCGGCTGCTGTAACAGGGAATATA AAGGGCAG CATAATTTAGGAGTTTAGTGAACTTGCAACATTTACTATTTTCCCTTCTTACGTAAATAT TTTTCTTT TTAATTCTAAATCAATCTTTTTCAATTTTTTGTTTGTATTCTTTTCTTGCTTAAATCTAT AACTACAA AAAACACATACATAAACTAAAAATGTATATCGGCATTGATTTGGGTACTTCTGGCGTAAA GGTTATCC TGCTGAATGAACAGGGTGAAGTGGTTGCCTCACAAACGGAAAAGTTGACTGTATCTAGGC CACATCCT TTGTGGAGCGAACAAGATCCAGAACAGTGGTGGCAAGCTACAGATAGAGCAATGAAAGCG TTAGGTGA CCAGCATTCCTTACAGGACGTTAAAGCCTTGGGGATTGCTGGCCAAATGCATGGTGCGAC ACTGCTTG ATGCCCAACAAAGGGTCTTAAGGCCTGCAATACTGTGGAATGATGGACGTTGTGCTCAGG AGTGTACC TTATTGGAAGCAAGAGTGCCTCAATCCAGGGTGATAACCGGTAACTTGATGATGCCTGGA TTTACAGC CCCAAAATTGTTATGGGTTCAAAGACACGAACCAGAGATCTTCCGTCAAATCGACAAGGT CTTATTAC CGAAGGACTACTTGAGACTACGTATGACTGGTGAATTCGCTTCAGACATGAGTGACGCAG CAGGAACC ATGTGGTTGGATGTCGCGAAAAGAGATTGGAGTGACGTTATGTTACAAGCTTGCGATCTA TCTAGAGA TCAAATGCCAGCTCTGTATGAGGGCTCAGAAATTACCGGTGCATTATTACCTGAAGTCGC TAAAGCAT GGGGTATGGCTACTGTCCCAGTTGTTGCCGGTGGTGGTGACAATGCCGCAGGAGCTGTTG GAGTTGGT ATGGTGGATGCAAATCAAGCGATGTTGTCTCTTGGCACATCAGGCGTCTATTTTGCCGTA TCGGAAGG GTTTCTGTCGAAACCAGAATCAGCCGTACATTCCTTTTGTCACGCTCTTCCACAAAGATG GCATCTAA TGAGCGTGATGCTTTCTGCAGCATCATGCTTGGATTGGGCCGCTAAATTGACGGGTTTGA GTAATGTT CCGGCACTTATAGCAGCTGCACAACAAGCAGATGAAAGTGCTGAACCCGTTTGGTTCTTG CCCTATCT TTCCGGAGAGAGAACACCACACAACAATCCTCAAGCCAAAGGTGTGTTCTTTGGGTTAAC TCACCAAC ATGGTCCAAACGAATTGGCGAGAGCAGTATTGGAAGGAGTAGGGTATGCTCTTGCTGATG GTATGGAT GTTGTCCATGCATGTGGCATAAAGCCGCAATCTGTTACGCTTATTGGAGGTGGTGCCAGA AGCGAATA CTGGAGACAAATGTTAGCCGATATTTCCGGTCAACAACTAGACTACAGAACAGGAGGCGA TGTAGGGC CAGCTTTGGGTGCTGCTAGATTGGCTCAGATTGCTGCTAACCCTGAGAAGTCGTTGATTG AGCTACTA CCTCAGTTACCCTTAGAACAGTCTCATCTACCAGATGCCCAGAGATATGCTGCGTACCAA CCTAGAAG AGAGACTTTTCGTAGGTTATACCAGCAATTACTACCCTTGATGGCGTAAAGATCTCCCAT GTCTCTAC TGGTGGTGGTGCTTCTTTGGAATTATTGGAAGGTAAGGAATTGCCAGGTGTTGCTTTCTT ATCCGAAA AGAAATAAATTGAATTGAATTGAAATCGATAGATCAATTTTTTTCTTTTCTCTTTCCCCA TCCTTTAC GCTAAAATAATAGTTTATTTTATTTTTTGAATATTTTTTATTTATATACGTATATATAGA CTATTATT TATCTTTTAATGATTATTAAGATTTTTATTAAAAAAAAATTCGCTCCTCTTTTAATGCCT TTATGCAG TTTTTTTTTCCCATTCGATATTTCTATGTTC SEQ ID NO: 18 - Escherichia co// xylulose kinase

MYIGIDLGTSGVKVILLNEQGEVVASQTEKLTVSRPHPLWSEQDPEQWWQATDRAMKALG DQHSLQDV KALGIAGQMHGATLLDAQQRVLRPAILWNDGRCAQECTLLEARVPQSRVITGNLMMPGFT APKLLWVQ RHEPEI FRQI DKVLLPKDYLRLRMTGEFASDMSDAAGTMWLDVAKRDWSDVMLQACDLSRDQMPALYE GSEITGALLPEVAKAWGMATVPVVAGGGDNAAGAVGVGMVDANQAMLSLGTSGVYFAVSE GFLSKPES AVHSFCHALPQRWHLMSVMLSAASCLDWAAKLTGLSNVPALIAAAQQADESAEPVWFLPY LSGERTPH NNPQAKGVFFGLTHQHGPNELARAVLEGVGYALADGMDVVHACGIKPQSVTLIGGGARSE YWRQMLAD I SGQQLDYRTGGDVGPALGAARLAQIAANPEKSLIELLPQLPLEQSHLPDAQRYAAYQPRR ETFRRLY QQLLPLMA SEQ ID NO: 21 - Kluyveromyces marxianus xylulose kinase gene Artificial Kluyveromyces marxianus xylulose kinase gene, with coding region based on the amino acid sequence from NCBI Accession Code ADW23548:

ATGTCGACTCCATATTACTTAGGCTTTGATTTGTCAACTCAGCAGTTGAAATGTCTA GCAATAGATGA TCAGTTAAACATCGTGACTTCAGTTAGCATTGAATTTGACCGTAATTTCCCAGCTTACAA CACAAAGA AAGGGGTATACATCAAGAATGGTGGTGTGATAGACGCACCAGTTGCTATGTGGTTAGAAG CTGTTGAT TTATGCTTTAGTCAGCTTGCTGAAAGAATCGACTTGAAGAGAGTTCAATCAATGTCAGGT TCTTGCCA ACAACATGGCACCGTCTACTGGAACTGTGAGCATCTACCAAGTAACCTTGATCCTGCTTC AACCTTGA GAGAGCAACTTCAAGGCAGTTTATCAAGACCAGTTGCACCCAATTGGCAAGATCATTCCA CCAAGAAA CAATGTGATGAATTGGCAGAATCGGTAGGAGGACCTGAAGAACTAGCAAGGATTACAGGT TCTGGAGC ACACTATAGATTTTCCGGTTCCCAAATTGCCAAAATCCATGAAACTGAACCTGAAGTCTA TGAGGCTA CTAAACGTATTTCGTTGGTAAGTAGCTTTCTAGCGTCTGTTTTAGTAGGTGACATTGTGC CCTTGGAA GAAGCGGATGCTTGTGGCATGAACTTATACGATCTATCCAAACACGACTTTGACGAAACA TTACTGGC TGTCGTTGACCATGATACAGCGAGATTGAGAAGGAAACTATCAGATCCACCCGTTGGAGC TCCTACCA GAGAAAGTCCTCTGACCTCCTTGGGTAAAGTCTCTAAGTACTTTCAGGACAAATATGGGG TTAACTGT GAATGTGAGATCTTCCCGTTTACTGGCGATAACCTGGCAACGATTTGTTCCCTACCTTTG CAAAAGAA TGATGTCTTGATTAGTCTAGGTACTTCGACCACGATTTTGTTGGTAACTGACCAATATCA CTCTTCTC CCAATTATCACTTGTTTATACATCCGACAGTGCCAGGTTATTACATGGGTATGATTTGCT ATTGCAAT GGGTCTTTGGCTCGTGAGAGAGTAAGAGATGATCTGGCTGGACCACAAGCCTCTCAAGCT CCTGGGGA GCAAGTTCCATGGACTCAATTCAATGACGCATTACTGGATGACTCATTGAGCAATGACAA TGAGATAG GCCTTTACTTCCCTCTTGGTGAGATTGTCCCAAATGTTGATGCCGTCACCAAAAGATGGA CATTCGAA AGAAAAGAGAACCATTCGAACAAAAGTATCGTTCTTCACGAGTTGGATCAATTCACGCCA AAGAGGAA AGATGCAAAGAACATAGTGGAAAGTCAGGCCTTAAGCTGTAGAGTGCGTATCTCTCCATT GCTGTCTG ATGAAACAGATGCCTTAAGCGAAACTCAAGTGTTGTCAAAGAAGGAGAATACCCAAGTTA CGTTTGAC TACGATGCATTTCCGTTGTGGACGTATGCCAAAAGACCGAATAGAGCGTTCTTTGTTGGT GGTGCCTC CAAGAATGATGCCATAGTCAGGACAATGGCAAATGTAATAGGTGCTAGGAATGGAAATTA TAGACTTG AAACTCCCAATTCCTGTGCTTTAGGAGGCTGTTATAAAGCGATGTGGTCATGGTTAAAGG TACATGAA CCTACAACTACACCATCTTTCGATGTTTGGTTAAACGCAAGCTTTAACTGGCAGAGAGAT TGCGAATT CGTGTGCCAGTCTGACGCCGCTAAGTGGGAACAATCTAATGGTAAAATTCAAGCTTTATC AGAAGCTG AAGCCTATGTTAAAGCTTTAGCACAATCTCAGGGTCAA

SEQ ID NO: 22 - Kluyveromyces marxianus xylulose kinase

MSTPYYLGFDLSTQQLKCLAI DDQL IVTSVSIEFDRNFPAYNTKKGVYIKNGGVI DAPVAMWLEAVD LCFSQLAERIDLKRVQSMSGSCQQHGTVYWNCEHLPSNLDPASTLREQLQGSLSRPVAPN WQDHSTKK QCDELAESVGGPEELARITGSGAHYRFSGSQIAKIHETEPEVYEATKRI SLVSSFLASVLVGDIVPLE EADACGMNLYDLSKHDFDETLLAVVDHDTARLRRKLSDPPVGAPTRESPLTSLGKVSKYF QDKYGVNC ECEIFPFTGDNLATICSLPLQKNDVLISLGTSTTILLVTDQYHSSPNYHLFIHPTVPGYY MGMICYCN GSLARERVRDDLAGPQASQAPGEQVPWTQFNDALLDDSLSNDNEIGLYFPLGEIVPNVDA VTKRWTFE RKENHSNKSIVLHELDQFTPKRKDAK IVESQALSCRVRI SPLLSDETDALSETQVLSKKENTQVTFD YDAFPLWTYAKRPNRAFFVGGASKNDAIVRTMANVIGARNGNYRLETPNSCALGGCYKAM WSWLKVHE PTTTPSFDVWLNASFNWQRDCEFVCQSDAAKWEQSNGKIQALSEAEAYVKALAQSQGQ

SEQ ID NO: 31 - Saccharomyces cerevisiae xylulokinase gene (XKS1 )

ATGTTGTGTTCAGTAATTCAGAGACAGACAAGAGAGGTTTCCAACACAATGTCTTTAGAC TCATACTA TCTTGGGTTTGATCTTTCGACCCAACAACTGAAATGTCTCGCCATTAACCAGGACCTAAA AATTGTCC ATTCAGAAACAGTGGAATTTGAAAAGGATCTTCCGCATTATCACACAAAGAAGGGTGTCT ATATACAC GGCGACACTATCGAATGTCCCGTAGCCATGTGGTTAGAGGCTCTAGATCTGGTTCTCTCG AAATATCG CGAGGCTAAATTTCCATTGAACAAAGTTATGGCCGTCTCAGGGTCCTGCCAGCAGCACGG GTCTGTCT ACTGGTCCTCCCAAGCCGAATCTCTGTTAGAGCAATTGAATAAGAAACCGGAAAAAGATT TATTGCAC TACGTGAGCTCTGTAGCATTTGCAAGGCAAACCGCCCCCAATTGGCAAGACCACAGTACT GCAAAGCA ATGTCAAGAGTTTGAAGAGTGCATAGGTGGGCCTGAAAAAATGGCTCAATTAACAGGGTC CAGAGCCC ATTTTAGATTTACTGGTCCTCAAATTCTGAAAATTGCACAATTAGAACCAGAAGCTTACG AAAAAACA AAGACCATTTCTTTAGTGTCTAATTTTTTGACTTCTATCTTAGTGGGCCATCTTGTTGAA TTAGAGGA GGCAGATGCCTGTGGTATGAACCTTTATGATATACGTGAAAGAAAATTCAGTGATGAGCT ACTACATC TAATTGATAGTTCTTCTAAGGATAAAACTATCAGACAAAAATTAATGAGAGCACCCATGA AAAATTTG ATAGCGGGTACCATCTGTAAATATTTTATTGAGAAGTACGGTTTCAATACAAACTGCAAG GTCTCTCC CATGACTGGGGATAATTTAGCCACTATATGTTCTTTACCCCTGCGGAAGAATGACGTTCT CGTTTCCC TAGGAACAAGTACTACAGTTCTTCTGGTCACCGATAAGTATCACCCCTCTCCGAACTATC ATCTTTTC ATTCATCCAACTCTGCCAAACCATTATATGGGTATGATTTGTTATTGTAATGGTTCTTTG GCAAGGGA GAGGATAAGAGACGAGTTAAACAAAGAACGGGAAAATAATTATGAGAAGACTAACGATTG GACTCTTT TTAATCAAGCTGTGCTAGATGACTCAGAAAGTAGTGAAAATGAATTAGGTGTATATTTTC CTCTGGGG GAGATCGTTCCTAGCGTAAAAGCCATAAACAAAAGGGTTATCTTCAATCCAAAAACGGGT ATGATTGA AAGAGAGGTGGCCAAGTTCAAAGACAAGAGGCACGATGCCAAAAATATTGTAGAATCACA GGCTTTAA GTTGCAGGGTAAGAATATCTCCCCTGCTTTCGGATTCAAACGCAAGCTCACAACAGAGAC TGAACGAA GATACAATCGTGAAGTTTGATTACGATGAATCTCCGCTGCGGGACTACCTAAATAAAAGG CCAGAAAG GACTTTTTTTGTAGGTGGGGCTTCTAAAAACGATGCTATTGTGAAGAAGTTTGCTCAAGT CATTGGTG CTACAAAGGGTAATTTTAGGCTAGAAACACCAAACTCATGTGCCCTTGGTGGTTGTTATA AGGCCATG TGGTCATTGTTATATGACTCTAATAAAATTGCAGTTCCTTTTGATAAATTTCTGAATGAC AATTTTCC ATGGCATGTAATGGAAAGCATATCCGATGTGGATAATGAAAATTGGGATCGCTATAATTC CAAGATTG TCCCCTTAAGCGAACTGGAAAAGACTCTCATCTAA

SEQ ID NO: 32 - Saccharomyces cerevisiae xylulokinase

MLCSVIQRQTREVSNTMSLDSYYLGFDLSTQQLKCLAINQDLKIVHSETVEFEKDLPHYH TKKGVYIH GDTIECPVAMWLEALDLVLSKYREAKFPLNKVMAVSGSCQQHGSVYWSSQAESLLEQLNK KPEKDLLH YVSSVAFARQTAPNWQDHSTAKQCQEFEECIGGPEKMAQLTGSRAHFRFTGPQILKIAQL EPEAYEKT KTI SLVSNFLTSILVGHLVELEEADACGMNLYDIRERKFSDELLHLI DSSSKDKTIRQKLMRAPMKNL IAGTICKYFIEKYGFNTNCKVSPMTGDNLATICSLPLRKNDVLVSLGTSTTVLLVTDKYH PSPNYHLF IHPTLPNHYMGMICYCNGSLARERIRDELNKERENNYEKTNDWTLFNQAVLDDSESSENE LGVYFPLG EIVPSVKAI KRVI FNPKTGMIEREVAKFKDKRHDAK IVESQALSCRVRI SPLLSDSNASSQQRLNE DTIVKFDYDESPLRDYLNKRPERTFFVGGASKNDAIVKKFAQVIGATKGNFRLETPNSCA LGGCYKAM WSLLYDSNKIAVPFDKFLNDNFPWHVMESI SDVDNENWDRYNSKIVPLSELEKTLI *

SEQ ID NO: 49 - Codon optimized Zyqosaccharomyces bailii YME2 gene

ATGTTGCCCATTTCTGGACCTTCCAACATGCTGCATGGCCTCGTTTCAGCCCGTTGTGCA GGGGGTTG GAGGCCACTTATCTCGCATTTGCGTAGGGGAGTTTTTCCTAAGATGCTTACCATGACAGG TATTGGGG CCAAGAGATTTGTCTCCAGCGAAATACAGGAGAAAGACGAACAAGCTGGTGAGTCTACTA CTGCTACA GATACTGGTATCATTCATAAAACGGAGCAGGAGACCCTAGTATATTTCGACAACGTCTAT CCACGGAC CGCATCTCTATGGAGCCCTGCGCAATGGTACAATCTACTTCTAACTAATCAATCGAGGGA GGCTGTTA GGCAAAAGATCAGCGGTTCGATCCCGCTAGAGACCATTTTTTGGCTTCATTGA

SEQ ID NO: 50- Zygosaccharomyces bailii YME2

MLPI SGPSNMLHGLVSARCAGGWRPLI SHLRRGVFPKMLTMTGIGAKRFVSSEIQEKDEQAGESTTATD TGI IHKTEQETLVYFDNVYPRTASLWSPAQWYNLLLTNQSREAVRQKI SGSI PLETI FWLH SEQ ID NO: 131 - Saccharomyces cerevisiae ENQ1 DNA

ATGGCTGTCTCTAAAGTTTACGCTAGATCCGTCTACGACTCCCGTGGTAACCCAACCGTC GAAGTCGAA TTAACCACCGAAAAGGGTGTTTTCAGATCCATTGTCCCATCTGGTGCTTCTACCGGTGTC CACGAAGCT TTGGAAATGAGAGATGGTGACAAATCCAAGTGGATGGGTAAGGGTGTTTTGCACGCTGTT AAGAACGTC AACGATGTCATTGCTCCAGCTTTCGTTAAGGCTAACATTGATGTTAAGGACCAAAAGGCC GTCGATGAC TTCTTGATTTCTTTGGACGGTACTGCCAACAAATCCAAGTTGGGTGCTAACGCTATCTTG GGTGTTTCT TTGGCTGCTTCCAGAGCTGCCGCTGCTGAAAAGAATGTCCCATTATACAAGCACTTGGCT GACTTGTCT AAGTCCAAGACCTCTCCATACGTTTTGCCAGTTCCATTCTTGAACGTTTTGAACGGTGGT TCCCACGCT GGTGGTGCTTTGGCTTTGCAAGAATTTATGATTGCTCCAACTGGTGCTAAGACCTTCGCT GAAGCTTTG AGAATTGGTTCCGAAGTTTACCACAACTTGAAGTCTTTGACCAAGAAGAGATACGGTGCT TCTGCCGGT AACGTCGGTGACGAAGGTGGTGTTGCTCCAAACATTCAAACTGCTGAAGAAGCTTTGGAC TTGATTGTT GACGCTATCAAGGCTGCTGGTCACGACGGTAAGATCAAGATCGGTTTGGACTGTGCTTCC TCTGAATTC TTCAAGGACGGTAAGTACGACTTGGACTTCAAGAATCCAAACTCTGACAAATCCAAGTGG TTGACTGGT CCTCAATTGGCTGACTTGTACCACTCCTTGATGAAGAGATACCCAATTGTCTCCATCGAA GATCCATTT GCTGAAGATGACTGGGAAGCTTGGTCTCACTTCTTCAAGACCGCTGGTATTCAAATTGTT GCTGATGAC TTGACTGTCACCAACCCAAAGAGAATTGCTACCGCTATCGAAAAGAAGGCTGCCGACGCT TTGTTGTTG AAGGTCAACCAAATC

GGTACCTTGTCTGAATCCATCAAGGCTGCTCAAGACTCTTTCGCTGCCGGTTGGGGT GTTATGGTTTCC CACAGATCTGGTGAAACTGAAGACACTTTCATTGCTGACTTGGTCGTCGGTTTGAGAACT GGTCAAATC AAGACTGGTGCTCCAGCTAGATCCGAAAGATTGGCTAAATTGAACCAATTGTTGAGAATC GAAGAAGAA TTAGGTGACAACGCTGTTTTCGCTGGTGAAAACTTCCACCACGGTGACAAATTATAA

SEQ ID NO: 132 - Saccharomyces cerevisiae ENQ1

MAVSKVYARSVYDSRGNPTVEVELTTEKGVFRSIVPSGASTGVHEALEMRDGDKSKWMGK GVLHAVKNV NDVIAPAFVKANI DVKDQKAVDDFLI SLDGTANKSKLGANAILGVSLAASRAAAAEKNVPLYKHLADLS KSKTSPYVLPVPFLNVLNGGSHAGGALALQEFMIAPTGAKTFAEALRIGSEVYHNLKSLT KKRYGASAG NVGDEGGVAPNIQTAEEALDLIVDAIKAAGHDGKIKIGLDCASSEFFKDGKYDLDFKNPN SDKSKWLTG PQLADLYHSLMKRYPIVSIEDPFAEDDWEAWSHFFKTAGIQIVADDLTVTNPKRIATAIE KKAADALLL KVNQIGTLSESIKAAQDSFAAGWGVMVSHRSGETEDTFIADLWGLRTGQIKTGAPARSER LAKLNQLL RIEEELGDNAVFAGENFHHGDKL*

SEQ ID NO: 133 - Saccharomyces cerevisiae PFK2 DNA

ATGACTGTTACTACTCCTTTTGTGAATGGTACTTCTTATTGTACCGTCACTGCATATTCC GTTCAATCT TATAAAGCTGCCATAGATTTTTACACCAAGTTTTTGTCATTAGAAAACCGCTCTTCTCCA GATGAAAAC TCCACTTTATTGTCTAACGATTCCATCTCTTTGAAGATCCTTCTACGTCCTGATGAAAAA ATCAATAAA AATGTTGAGGCTCATTTGAAGGAATTGAACAGTATTACCAAGACTCAAGACTGGAGATCA CATGCCACC CAATCCTTGGTATTTAACACTTCCGACATCTTGGCAGTCAAGGACACTCTAAATGCTATG AACGCTCCT CTTCAAGGCTACCCAACAGAACTATTTCCAATGCAGTTGTACACTTTGGACCCATTAGGT AACGTTGTT GGTGTTACTTCTACTAAGAACGCAGTTTCAACCAAGCCAACTCCACCACCAGCACCAGAA GCTTCTGCT GAGTCTGGTCTTTCCTCTAAAGTTCACTCTTACACTGATTTGGCTTACCGTATGAAAACC ACCGACACC TATCCATCTCTGCCAAAGCCATTGAACAGGCCTCAAAAGGCAATTGCCGTCATGACTTCC GGTGGTGAT GCTCCAGGTATGAACTCTAACGTTAGAGCCATCGTGCGTTCCGCTATCTTCAAAGGTTGT CGTGCCTTT GTTGTCATGGAAGGTTATGAAGGTTTGGTTCGTGGTGGTCCAGAATACATCAAGGAATTC CACTGGGAA GACGTCCGTGGTTGGTCTGCTGAAGGTGGTACCAACATTGGTACTGCCCGTTGTATGGAA TTCAAGAAG CGCGAAGGTAGATTATTGGGTGCCCAACATTTGATTGAGGCCGGTGTCGATGCTTTGATC GTTTGTGGT GGTGACGGTTCTTTGACTGGTGCTGATCTGTTTAGATCAGAATGGCCTTCTTTGATCGAG GAATTGTTG AAAACAAACAGAATTTCCAACGAACAATACGAAAGAATGAAGCATTTGAATATTTGCGGT ACTGTCGGT TCTATTGATAACGATATGTCCACCACGGATGCTACTATTGGTGCTTACTCTGCCTTGGAC AGAATCTGT AAGGCCATCGATTACGTTGAAGCCACTGCCAACTCTCACTCAAGAGCTTTCGTTGTTGAA GTTATGGGT AGAAACTGTGGTTGGTTAGCTTTATTAGCTGGTATCGCCACTTCCGCTGACTATATCTTT ATTCCAGAG AAGCCAGCCACTTCCAGCGAATGGCAAGATCAAATGTGTGACATTGTCTCCAAGCACAGA TCAAGGGGT AAGAGAACCACCATTGTTGTTGTTGCAGAAGGTGCTATCGCTGCTGACTTGACCCCAATT TCTCCAAGC GACGTCCACAAAGTTCTAGTTGACAGATTAGGTTTGGATACAAGAATTACTACCTTAGGT CACGTTCAA AGAGGTGGTACTGCTGTTGCTTACGACCGTATCTTGGCTACTTTACAAGGTCTTGAGGCC GTTAATGCC GTTTTGGAATCCACTCCAGACACCCCATCACCATTGATTGCTGTTAACGAAAACAAAATT GTTCGTAAA CCATTAATGGAATCCGTCAAGTTGACCAAAGCAGTTGCAGAAGCCATTCAAGCTAAGGAT TTCAAGAGA GCTATGTCTTTAAGAGACACTGAGTTCATTGAACATTTAAACAATTTCATGGCTATCAAC TCTGCTGAC CACAACGAACCAAAGCTACCAAAGGACAAGAGACTGAAGATTGCCATTGTTAATGTCGGT GCTCCAGCT GGTGGTATCAACTCTGCCGTCTACTCGATGGCTACTTACTGTATGTCCCAAGGTCACAGA CCATACGCT ATCTACAATGGTTGGTCTGGTTTGGCAAGACATGAAAGTGTTCGTTCTTTGAACTGGAAG GATATGTTG GGTTGGCAATCCCGTGGTGGTTCTGAAATCGGTACTAACAGAGTCACTCCAGAAGAAGCA GATCTAGGT ATGATTGCTTACTATTTCCAAAAGTACGAATTTGATGGTTTGATCATCGTTGGTGGTTTC GAAGCTTTT GAATCTTTACATCAATTAGAGAGAGCAAGAGAAAGTTATCCAGCTTTCAGAATCCCAATG GTCTTGATA CCAGCTACTTTGTCTAACAATGTTCCAGGTACTGAATACTCTTTGGGTTCTGATACCGCT TTGAATGCT CTAATGGAATACTGTGATGTTGTTAAACAATCCGCTTCTTCAACCAGAGGTAGAGCCTTC GTTGTCGAT TGTCAAGGTGGTAACTCAGGCTATTTGGCCACTTACGCTTCTTTGGCTGTTGGTGCTCAA GTCTCTTAT GTCCCAGAAGAAGGTATTTCTTTGGAGCAATTGTCCGAGGATATTGAATACTTAGCTCAA TCTTTTGAA AAGGCAGAAGGTAGAGGTAGATTTGGTAAATTGATTTTGAAGAGTACAAACGCTTCTAAG GCTTTATCA GCCACTAAATTGGCTGAAGTTATTACTGCTGAAGCCGATGGCAGATTTGACGCTAAGCCA GCTTATCCA GGTCATGTACAACAAGGTGGTTTGCCATCTCCAATTGATAGAACAAGAGCCACTAGAATG GCCATTAAA GCTGTCGGCTTCATCAAAGACAACCAAGCTGCCATTGCTGAAGCTCGTGCTGCCGAAGAA AACTTCAAC GCTGATGACAAGACCATTTCTGACACTGCTGCTGTCGTTGGTGTTAAGGGTTCACATGTC GTTTACAAC TCCATTAGACAATTGTATGACTATGAAACTGAAGTTTCCATGAGAATGCCAAAGGTCATT CACTGGCAA GCTACCAGACTCATTGCTGACCATTTGGTTGGAAGAAAGAGAGTTGATTAA

SEQ ID NO: 134 - Saccharomyces cerevisiae PFK2

MTVTTPFVNGTSYCTVTAYSVQSYKAAI DFYTKFLSLENRSSPDE STLLSNDSI SLKILLRPDEKI K NVEAHLKELNSITKTQDWRSHATQSLVFNTSDILAVKDTLNAMNAPLQGYPTELFPMQLY TLDPLGNVV GVTSTKNAVSTKPTPPPAPEASAESGLSSKVHSYTDLAYRMKTTDTYPSLPKPLNRPQKA IAVMTSGGD APGMNSNVRAIVRSAI FKGCRAFVVMEGYEGLVRGGPEYIKEFHWEDVRGWSAEGGT IGTARCMEFKK REGRLLGAQHLIEAGVDALIVCGGDGSLTGADLFRSEWPSLIEELLKTNRI SNEQYERMKHL ICGTVG SI DNDMSTTDATIGAYSALDRICKAI DYVEATANSHSRAFVVEVMGRNCGWLALLAGIATSADYI FI PE KPATSSEWQDQMCDIVSKHRSRGKRTTIVVVAEGAIAADLTPISPSDVHKVLVDRLGLDT RITTLGHVQ RGGTAVAYDRILATLQGLEAVNAVLESTPDTPSPLIAVNENKIVRKPLMESVKLTKAVAE AIQAKDFKR AMSLRDTEFIEHLNNFMAINSADHNEPKLPKDKRLKIAIVNVGAPAGGINSAVYSMATYC MSQGHRPYA IYNGWSGLARHESVRSLNWKDMLGWQSRGGSEIGTNRVTPEEADLGMIAYYFQKYEFDGL I IVGGFEAF ESLHQLERARESYPAFRI PMVLI PATLSNNVPGTEYSLGSDTALNALMEYCDVVKQSASSTRGRAFVVD CQGGNSGYLATYASLAVGAQVSYVPEEGI SLEQLSEDIEYLAQSFEKAEGRGRFGKLILKSTNASKALS ATKLAEVITAEADGRFDAKPAYPGHVQQGGLPSPI DRTRATRMAIKAVGFIKDNQAAIAEARAAEENFN ADDKTI SDTAAVVGVKGSHVVYNSIRQLYDYETEVSMRMPKVIHWQATRLIADHLVGRKRVD*

SEQ ID NO: 135 - Saccharomyces cerevisiae PFK26 DNA

ATGTTCAAACCAGTAGACTTCTCTGAAACATCTCCTGTGCCGCCTGATATTGATCTTGCT CCTACACAA TCTCCACACCATGTGGCACCTAGTCAAGACTCCAGTTATGATCTTTTATCCCGGAGTTCC GATGATAAA ATTGATGCTGAAAAGGGTCCGCATGATGAATTATCTAAGCACTTACCACTTTTTCAGAAA AGACCTTTG AGCGATACTCCTATATCGAGCAATTGGAACTCTCCTGGAATCACTGAAGAAAATACACCT TCTGACTCT CCTGAAAATAGCGCTACTAATTTGAAATCGCTACATCGATTGCATATTAACGACGAAACG CAACTAAAA AATGCTAAAATTCCCACAAACGATACTACTGACTACATGCCTCCTTCAGATGGAGCAAAT GAGGTAACT CGGATTGATTTGAAAGACATTAAATCACCTACGAGACACCATAAAAGAAGACCTACCACC ATCGATGTT CCTGGTTTAACAAAGTCTAAAACATCTCCAGATGGTCTCATATCAAAGGAAGATAGTGGA TCAAAGTTA GTGATTGTCATGGTCGGACTGCCAGCTACGGGAAAGTCATTTATTACAAATAAATTATCC AGATTTTTA AATTATTCTTTATACTATTGTAAAGTGTTTAATGTCGGTAACACTAGAAGGAAGTTTGCT AAGGAGCAT GGCCTAAAGGACCAGGATTCAAAGTTTTTCGAGCCGAAAAACGCCGACTCTACTAGGTTG AGAGACAAA TGGGCCATGGATACTCTGGATGAATTGCTAGATTATTTATTAGAAGGTTCAGGATCTGTG GGAATTTTT GATGCTACAAATACCTCTCGTGAAAGAAGAAAAAACGTTCTGGCTAGAATCAGAAAGAGA AGTCCTCAT TTGAAGGTTTTATTTTTAGAATCTGTTTGTTCGGATCATGCACTGGTACAGAAAAATATT AGACTCAAA TTATTTGGTCCAGATTACAAAGGTAAAGATCCTGAAAGCTCTTTAAAAGATTTTAAAAGT CGCCTGGCA AACTACTTGAAAGCCTATGAACCAATTGAGGATGACGAAAATTTGCAGTACATCAAAATG ATAGATGTG GGAAAGAAAGTCATCGCATACAATATTCAAGGGTTTTTAGCTTCGCAGACGGTATATTAT TTGTTAAAT TTCAATTTGGCTGACAGACAAATTTGGATAACGAGAAGTGGCGAGAGCGAAGATAATGTT AGTGGCCGT ATAGGCGGAAATTCCCATTTGACTCCTCGTGGTCTAAGATTTGCTAAAAGTCTACCAAAA TTCATTGCC AGACAGAGAGAAATATTTTATCAAAATCTCATGCAACAAAAAAAGAATAATGAAAATACA GATGGGAAC ATTTATAATGACTTTTTCGTTTGGACCAGCATGCGTGCTAGGACTATAGGGACTGCTCAA TATTTCAAC GAAGATGATTATCCTATCAAACAAATGAAAATGTTAGATGAGTTAAGTGCAGGTGATTAT GATGGTATG ACATATCCAGAAATTAAAAACAACTTTCCTGAAGAATTCGAAAAAAGACAGAAAGATAAG TTGAGATAC AGATACCCTGGTATTGGCGGTGAATCGTATATGGACGTTATTAATAGACTCAGACCTGTT ATCACAGAA CTAGAAAGAATCGAGGATAACGTTCTTATTATTACACACCGGGTGGTGGCAAGAGCCTTA TTGGGTTAT TTTATGAACTTGAGTATGGGTATTATTGCCAATTTGGATGTCCCATTACATTGTGTATAT TGCCTAGAA CCAAAACCATATGGAATCACTTGGTCATTATGGGAGTATGATGAAGCATCGGATTCATTT TCTAAGGTC CCACAAACGGACTTGAATACCACCAGAGTAAAGGAGGTTGGCCTTGTTTATAATGAAAGA AGATATTCT GTTATACCAACAGCTCCGCCAAGTGCAAGAAGCAGCTTTGCAAGTGACTTTTTGTCAAGA AAAAGATCT AATCCTACTTCTGCATCTTCATCCCAGAGTGAATTATCAGAACAACCCAAGAATAGCGTT AGTGCTCAA ACTGGCAGCAATAATACCACTCTCATTGGGAGCAACTTTAACATCAAGAATGAAAATGGT GATTCGAGA ATACCATTATCTGCACCACTTATGGCCACTAATACTTCTAATAACATCTTAGATGGTGGA GGTACCTCA ATTTCGATACATCGTCCCAGGGTTGTTCCAAATCAAAACAACGTGAATCCTCTTTTGGCT AACAACAAT AAAGCGGCTTCTAATGTACCTAATGTAAAGAAGTCAGCGGCTACACCAAGGCAAATTTTT GAAATAGAT AAAGTGGACGAAAAGTTATCCATGTTGAAAAATAAAAGTTTTCTATTACATGGAAAGGAT TATCCTAAT AATGCTGATAATAATGACAACGAAGATATAAGGGCAAAAACCATGAATCGCAGCCAAAGT CACGTTTAA SEQ ID NO: 136 - Saccharomyces cerevisiae PFK26

MFKPVDFSETSPVPPDI DLAPTQSPHHVAPSQDSSYDLLSRSSDDKI DAEKGPHDELSKHLPLFQKRPL SDTPI SSNWNSPGITEENTPSDSPENSATNLKSLHRLHI DETQLKNAKI PTNDTTDYMPPSDGANEVT RI DLKDIKSPTRHHKRRPTTI DVPGLTKSKTSPDGLI SKEDSGSKLVIVMVGLPATGKSFITNKLSRFL NYSLYYCKVFNVGNTRRKFAKEHGLKDQDSKFFEPKNADSTRLRDKWAMDTLDELLDYLL EGSGSVGIF DATNTSRERRKNVLARIRKRSPHLKVLFLESVCSDHALVQKNIRLKLFGPDYKGKDPESS LKDFKSRLA NYLKAYEPIEDDENLQYIKMI DVGKKVIAY IQGFLASQTVYYLLNFNLADRQIWITRSGESEDNVSGR IGGNSHLTPRGLRFAKSLPKFIARQREI FYQNLMQQKKNNENTDG IYNDFFVWTSMRARTIGTAQYFN EDDYPIKQMKMLDELSAGDYDGMTYPEIKNNFPEEFEKRQKDKLRYRYPGIGGESYMDVI NRLRPVITE LERIEDNVLI ITHRVVARALLGYFMNLSMGI IANLDVPLHCVYCLEPKPYGITWSLWEYDEASDSFSKV PQTDLNTTRVKEVGLVYNERRYSVI PTAPPSARSSFASDFLSRKRSNPTSASSSQSELSEQPKNSVSAQ TGSNNTTLIGSNFNIKNENGDSRIPLSAPLMATNTSNNILDGGGTSISIHRPRVVPNQNN VNPLLANNN KAASNVPNVKKSAATPRQI FEI DKVDEKLSMLKNKSFLLHGKDYPNNADNNDNEDIRAKTMNRSQSHV*

SEQ ID NO: 137 - Saccharomyces cerevisiae PGM DNA

ATGTCCAATAACTCATTCACTAACTTCAAACTGGCCACTGAATTGCCAGCCTGGTCTAAG TTGCAAAAA ATTTATGAATCTCAAGGTAAGACTTTGTCTGTCAAGCAAGAATTCCAAAAAGATGCCAAG CGTTTTGAA AAATTGAACAAGACTTTCACCAACTATGATGGTTCCAAAATCTTGTTCGACTACTCAAAG AACTTGGTC AACGATGAAATCATTGCTGCATTGATTGAACTGGCCAAGGAGGCTAACGTCACCGGTTTG AGAGATGCT ATGTTCAAAGGTGAACACATCAACTCCACTGAAGATCGTGCTGTCTACCACGTCGCATTG AGAAACAGA GCTAACAAGCCAATGTACGTTGATGGTGTCAACGTTGCTCCAGAAGTCGACTCTGTCTTG AAGCACATG AAGGAGTTCTCTGAACAAGTTCGTTCTGGTGAATGGAAGGGTTATACCGGTAAGAAGATC ACCGATGTT GTTAACATCGGTATTGGTGGTTCCGATTTGGGTCCAGTCATGGTCACTGAGGCTTTGAAG CACTACGCT GGTGTCTTGGATGTCCACTTCGTTTCCAACATTGACGGTACTCACATTGCTGAAACCTTG AAGGTTGTT GACCCAGAAACTACTTTGTTTTTGATTGCTTCCAAGACTTTCACTACCGCTGAAACTATC ACTAACGCT AACACTGCCAAGAACTGGTTCTTGTCGAAGACAGGTAATGATCCATCTCACATTGCTAAG CATTTCGCT GCTTTGTCCACTAACGAAACCGAAGTTGCCAAGTTCGGTATTGACACCAAAAACATGTTT GGTTTCGAA AGTTGGGTCGGTGGTCGTTACTCTGTCTGGTCGGCTATTGGTTTGTCTGTTGCCTTGTAC ATTGGCTAT GACAACTTTGAGGCTTTCTTGAAGGGTGCTGAAGCCGTCGACAACCACTTCACCCAAACC CCATTGGAA GACAACATTCCATTGTTGGGTGGTTTGTTGTCTGTCTGGTACAACAACTTCTTTGGTGCT CAAACCCAT TTGGTTGCTCCATTCGACCAATACTTGCACAGATTCCCAGCCTACTTGCAACAATTGTCA ATGGAATCT AACGGTAAGTCTGTTACCAGAGGTAACGTGTTTACTGACTACTCTACTGGTTCTATCTTG TTTGGTGAA CCAGCTACCAACGCTCAACACTCTTTCTTCCAATTGGTTCACCAAGGTACCAAGTTGATT CCATCTGAT TTCATCTTAGCTGCTCAATCTCATAACCCAATTGAGAACAAATTACATCAAAAGATGTTG GCTTCAAAC TTCTTTGCTCAAGCTGAAGCTTTAATGGTTGGTAAGGATGAAGAACAAGTTAAGGCTGAA GGTGCCACT GGTGGTTTGGTCCCACACAAGGTCTTCTCAGGTAACAGACCAACTACCTCTATCTTGGCT CAAAAGATT ACTCCAGCTACTTTGGGTGCTTTGATTGCCTACTACGAACATGTTACTTTCACTGAAGGT GCCATTTGG AATATCAACTCTTTCGACCAATGGGGTGTTGAATTGGGTAAAGTCTTGGCTAAAGTCATC GGCAAGGAA TTGGACAACTCCTCCACCATTTCTACCCACGATGCTTCTACCAACGGTTTAATCAATCAA TTCAAGGAA TGGATGTGA

SEQ ID NO: 138 - Sacc aromyces cerevisiae PGM

MSNNSFTNFKLATELPAWSKLQKIYESQGKTLSVKQEFQKDAKRFEKLNKTFTNYDGSKI LFDYSKNLV NDEIIAALIELAKEANVTGLRDAMFKGEHINSTEDRAVYHVALRNRANKPMYVDGVNVAP EVDSVLKHM KEFSEQVRSGEWKGYTGKKITDVVNIGIGGSDLGPVMVTEALKHYAGVLDVHFVSNI DGTHIAETLKVV DPETTLFLIASKTFTTAETITNANTAKNWFLSKTGNDPSHIAKHFAALSTNETEVAKFGI DTKNMFGFE SWVGGRYSVWSAIGLSVALYIGYDNFEAFLKGAEAVDNHFTQTPLED I PLLGGLLSVWYNNFFGAQTH LVAPFDQYLHRFPAYLQQLSMESNGKSVTRGNVFTDYSTGSILFGEPATNAQHSFFQLVH QGTKLIPSD FILAAQSHNPIENKLHQKMLASNFFAQAEALMVGKDEEQVKAEGATGGLVPHKVFSGNRP TTSILAQKI TPATLGALIAYYEHVTFTEGAIWNINSFDQWGVELGKVLAKVIGKELDNSSTI STHDASTNGLINQFKE WM*

SEQ ID NO: 139 - Saccharomyces cerevisiae GPM1 DNA

ATGCCAAAGTTAGTTTTAGTTAGACACGGTCAATCCGAATGGAACGAAAAGAACTTATTC ACCGGTTGG GTTGATGTTAAATTGTCTGCCAAGGGTCAACAAGAAGCCGCTAGAGCCGGTGAATTGTTG AAGGAAAAG AAGGTCTACCCAGACGTCTTGTACACTTCCAAGTTGTCCAGAGCTATCCAAACTGCTAAC ATTGCTTTG GAAAAGGCTGACAGATTATGGATTCCAGTCAACAGATCCTGGAGATTGAACGAAAGACAT TACGGTGAC TTACAAGGTAAGGACAAGGCTGAAACTTTGAAGAAGTTCGGTGAAGAAAAATTCAACACC TACAGAAGA TCCTTCGATGTTCCACCTCCCCCAATCGACGCTTCTTCTCCATTCTCTCAAAAGGGTGAT GAAAGATAC AAGTACGTTGACCCAAATGTCTTGCCAGAAACTGAATCTTTGGCTTTGGTCATTGACAGA TTGTTGCCA TACTGGCAAGATGTCATTGCCAAGGACTTGTTGAGTGGTAAGACCGTCATGATCGCCGCT CACGGTAAC TCCTTGAGAGGTTTGGTTAAGCACTTGGAAGGTATCTCTGATGCTGACATTGCTAAGTTG AACATCCCA ACTGGTATTCCATTGGTCTTCGAATTGGACGAAAACTTGAAGCCATCTAAGCCATCTTAC TACTTGGAC CCAGAAGCTGCCGCTGCTGGTGCCGCTGCTGTTGCCAACCAAGGTAAGAAATAA SEQ ID NO: 140 - Saccharomyces cerevisiae GPM1

MPKLVLVRHGQSEWNEKNLFTGWVDVKLSAKGQQEAARAGELLKEKKVYPDVLYTSKLSR AIQTANIAL EKADRLWI PVNRSWRLNERHYGDLQGKDKAETLKKFGEEKFNTYRRSFDVPPPPI DASSPFSQKGDERY KYVDPNVLPETESLALVI DRLLPYWQDVIAKDLLSGKTVMIAAHGNSLRGLVKHLEGI SDADIAKLNI P TGI PLVFELDENLKPSKPSYYLDPEAAAAGAAAVANQGKK

SEQ ID NO: 141 - Saccharomyces cerevisiae TPI1 DNA

ATGGCTAGAACTTTCTTTGTCGGTGGTAACTTTAAATTAAACGGTTCCAAACAATCCATT AAGGAAATT GTTGAAAGATTGAACACTGCTTCTATCCCAGAAAATGTCGAAGTTGTTATCTGTCCTCCA GCTACCTAC TTAGACTACTCTGTCTCTTTGGTTAAGAAGCCACAAGTCACTGTCGGTGCTCAAAACGCC TACTTGAAG GCTTCTGGTGCTTTCACCGGTGAAAACTCCGTTGACCAAATCAAGGATGTTGGTGCTAAG TGGGTTATT TTGGGTCACTCCGAAAGAAGATCTTACTTCCACGAAGATGACAAGTTCATTGCTGACAAG ACCAAGTTC GCTTTAGGTCAAGGTGTCGGTGTCATCTTGTGTATCGGTGAAACTTTGGAAGAAAAGAAG GCCGGTAAG ACTTTGGATGTTGTTGAAAGACAATTGAACGCTGTCTTGGAAGAAGTTAAGGACTGGACT AACGTCGTT GTCGCTTACGAACCAGTCTGGGCCATTGGTACCGGTTTGGCTGCTACTCCAGAAGATGCT CAAGATATT CACGCTTCCATCAGAAAGTTCTTGGCTTCCAAGTTGGGTGACAAGGCTGCCAGCGAATTG AGAATCTTA TACGGTGGTTCCGCTAACGGTAGCAACGCCGTTACCTTCAAGGACAAGGCTGATGTCGAT GGTTTCTTG GTCGGTGGTGCTTCTTTGAAGCCAGAATTTGTTGATATCATCAACTCTAGAAACTAA

SEQ ID NO: 142 - Saccharomyces cerevisiae TPI 1

MARTFFVGGNFKLNGSKQSIKEIVERLNTASI PENVEVVICPPATYLDYSVSLVKKPQVTVGAQNAYLK ASGAFTGENSVDQIKDVGAKWVILGHSERRSYFHEDDKFIADKTKFALGQGVGVILCIGE TLEEKKAGK TLDVVERQLNAVLEEVKDWTNVVVAYEPVWAIGTGLAATPEDAQDIHASIRKFLASKLGD KAASELRIL YGGSANGSNAVTFKDKADVDGFLVGGASLKPEFVDI I SRN SEQ ID NO: 147 - Saccharomyces cerevisiae PGM1 DNA

ATGTCACTTCTAATAGATTCTGTACCAACAGTTGCTTATAAGGACCAAAAACCGGGTACT TCAGGTTTA CGTAAGAAGACCAAGGTTTTCATGGATGAGCCTCATTATACTGAGAACTTCATTCAAGCA ACAATGCAA TCTATCCCTAATGGCTCAGAGGGAACCACTTTAGTTGTTGGAGGAGATGGTCGTTTCTAC AACGATGTT ATCATGAACAAGATTGCCGCAGTAGGTGCTGCAAACGGTGTCAGAAAGTTAGTCATTGGT CAAGGCGGT TTACTTTCAACACCAGCTGCTTCTCATATAATTAGAACATACGAGGAAAAGTGTACCGGT GGTGGTATC ATATTAACTGCCTCACACAACCCAGGCGGTCCAGAGAATGATTTAGGTATCAAGTATAAT TTACCTAAT GGTGGGCCAGCTCCAGAGAGTGTCACTAACGCTATCTGGGAAGCGTCTAAAAAATTAACT CACTATAAA ATTATAAAGAACTTCCCCAAGTTGAATTTGAACAAGCTTGGTAAAAACCAAAAATATGGC CCATTGTTA GTGGACATAATTGATCCTGCCAAAGCATACGTTCAATTTCTGAAGGAAATTTTTGATTTT GACTTAATT AAAAGCTTCTTAGCGAAACAGCGCAAAGACAAAGGGTGGAAGTTGTTGTTTGACTCCTTA AATGGTATT ACAGGACCATATGGTAAGGCTATATTTGTTGATGAATTTGGTTTACCGGCAGAGGAAGTT CTTCAAAAT TGGCACCCTTTACCTGATTTCGGCGGTTTACATCCCGATCCGAATCTAACCTATGCACGA ACTCTTGTT GACAGGGTTGACCGCGAAAAAATTGCCTTTGGAGCAGCCTCCGATGGTGATGGTGATAGG AATATGATT TACGGTTATGGCCCTGCTTTCGTTTCGCCAGGTGATTCTGTTGCCATTATTGCCGAATAT GCACCCGAA ATTCCATACTTCGCCAAACAAGGTATTTATGGCTTGGCACGTTCATTTCCTACATCCTCA GCCATTGAT CGTGTTGCAGCAAAAAAGGGATTAAGATGTTACGAAGTTCCAACCGGCTGGAAATTCTTC TGTGCCTTA TTTGATGCTAAAAAGCTATCAATCTGTGGTGAAGAATCCTTCGGTACAGGTTCCAATCAT ATCAGAGAA AAGGACGGTCTATGGGCCATTATTGCTTGGTTAAATATCTTGGCTATCTACCATAGGCGT AACCCTGAA AAGGAAGCTTCGATCAAAACTATTCAGGACGAATTTTGGAACGAGTATGGCCGTACTTTC TTCACAAGA TACGATTACGAACATATCGAATGCGAGCAGGCCGAAAAAGTTGTAGCTCTTTTGAGTGAA TTTGTATCA AGGCCAAACGTTTGTGGCTCCCACTTCCCAGCTGATGAGTCTTTAACCGTTATCGATTGT GGTGATTTT TCGTATAGAGATCTAGATGGCTCCATCTCTGAAAATCAAGGCCTTTTCGTAAAGTTTTCG AATGGGACT AAATTTGTTTTGAGGTTATCCGGCACAGGCAGTTCTGGTGCAACAATAAGATTATACGTA GAAAAGTAT ACTGATAAAAAGGAGAACTATGGCCAAACAGCTGACGTCTTCTTGAAACCCGTCATCAAC TCCATTGTA AAATTCTTAAGATTTAAAGAAATTTTAGGAACAGACGAACCAACAGTCCGCACATAG

SEQ ID NO: 148 - Saccharomyces cerevisiae PGM1

MSLLI DSVPTVAYKDQKPGTSGLRKKTKVFMDEPHYTENFIQATMQSI PNGSEGTTLVVGGDGRFYNDV IMNKIAAVGAANGVRKLVIGQGGLLSTPAASHI IRTYEEKCTGGGI ILTASHNPGGPENDLGIKYNLPN GGPAPESVTNAIWEASKKLTHYKI IKNFPKLNLNKLGKNQKYGPLLVDI I DPAKAYVQFLKEI FDFDLI KSFLAKQRKDKGWKLLFDSLNGITGPYGKAI FVDEFGLPAEEVLQNWHPLPDFGGLHPDPNLTYARTLV DRVDREKIAFGAASDGDGDRNMIYGYGPAFVSPGDSVAI IAEYAPEI PYFAKQGIYGLARSFPTSSAI D RVAAKKGLRCYEVPTGWKFFCALFDAKKLSICGEESFGTGSNHIREKDGLWAI IAWL ILAIYHRRNPE KEASIKTIQDEFWNEYGRTFFTRYDYEHIECEQAEKVVALLSEFVSRPNVCGSHFPADES LTVI DCGDF SYRDLDGSI SENQGLFVKFSNGTKFVLRLSGTGSSGATIRLYVEKYTDKKENYGQTADVFLKPVI SIV KFLRFKEILGTDEPTVRT

SEQ ID NO: 149 - Saccharomyces cerevisiae PGM3 DNA

ATGTTGCAAGGAATTTTAGAAACCGTACCATCTGACTTGAAAGATCCGATATCATTA TGGTTTAAGCAA GACCGCAACCCAAAAACTATAGAAGAGGTCACCGCTCTCTGCAAAAAATCCGACTGGAAT GAGTTACAC AAAAGATTTGATTCTAGAATTCAGTTTGGCACTGCTGGTTTAAGATCGCAAATGCAAGCT GGCTTTAGC AGGATGAATACTTTAGTAGTCATACAAGCGTCTCAGGGATTGGCAACTTATGTAAGACAA CAGTTTCCA GACAATTTGGTAGCTGTTGTGGGACACGATCATAGATTCCATTCTAAGGAGTTCGCTAGA GCTACTGCT GCTGCATTTCTTTTAAAAGGATTTAAGGTACATTATTTGAATCCTGACCACGAATTTGTT CATACCCCT TTAGTTCCCTTTGCAGTGGATAAGCTAAAGGCCTCCGTTGGCGTAATGATAACAGCAAGT CACAACCCA AAAATGGATAATGGATATAAAGTATACTATTCCAATGGATGCCAAATCATTCCACCTCAC GATCATGCC ATCTCTGATTCCATTGACGCAAATTTAGAACCATGGGCCAATGTGTGGGATTTCGACGAT GTTCTAAAT AAGGCTCTCAAACAAGGGAAATTGATGTATTCCAGAGAAGAAATGCTGAAGTTATATTTA GAGGAGGTT TCTAAAAATCTGGTAGAAATCAACCCATTAAAGCTTGAAGTAAAAGCCAAACCTTGGTTC GTTTACACT CCAATGCATGGGGTTGGATTTGACATTTTCAGCACCATCGTAAAAAAAACACTGTGCCTG GTAGAAGGT AAGGATTACCTATGTGTTCCTGAACAACAAAATCCAGATCCTTCTTTCCCAACTGTTGGA TTTCCTAAC CCTGAAGAAAAAGGTGCTTTAGACATTGGTATAAACTTGGCTGAAAAACATGACATTGAC TTACTTGTT GCCAACGACCCTGACGCTGATAGATTCTCTGTTGCTGTTAAAGATATGCAGTCAGGCGAA TGGCGACAA CTAACAGGTAACGAAATCGGTTTTCTTTTTGCATTTTATGAATATCAGAAATATAAAAGT ATGGACAAA GAATTTCAGCACGTTCATCCGTTGGCTATGTTAAATTCAACAGTGTCTTCACAAATGATA AAAAAAATG GCAGAAATAGAAGGGTTCCATTATGAGGATACATTAACAGGATTTAAGTGGATCGGAAAT CGTGCCATA CTCTTGGAAAAGAAAGGCTATTACGTTCCTTTTGGATTCGAGGAAGCAATAGGCTACATG TTTCCAGCA ATGGAGCATGATAAGGATGGTATCAGTGCATCCATTGTCTTCTTGCAAGCCTACTGTAAG TGGAAAATA GACCACAATTTGGACCCGCTAAATGTCTTAGAAAATGGCTTCAAAAAATATGGCGTGTTC AAAGAGTAC AATGGCTATTATGTCGTTCCAAATCCAACTGTTACAAAAGATATATTTGACTACATCAGG AATGTCTAC ACTCCTGAGGGCGCGTCATATCCTTCATCTATTGGTGAAGAAATCGAAGTACTTTACTAT CGAGATTTA ACCACTGGTTACCAATCGGATACCATAAATCATAAACCTACTCTACCCGTCGATCCTACA TCACAAATG ATAACAGTATCTGCTAGACCAAGTAACGGTAGTGAGAATGAGCATATCCGCTTCACTATT CGCGGGTCC GGAACAGAACCAAAACTTAAAGTATATATTGAAGCTTGCGCAAATGAAGAACAAAGAGCC TCTTTCTTG GCGAAATTGACTTGGAATGTGCTGAGACGTGAATGGTTTAGACCAGATGAAATGAATATA GTTACAAAA TTTTGA

SEQ ID NO: 150 - Saccharomyces cerevisiae PGM3

MLQGILETVPSDLKDPI SLWFKQDRNPKTIEEVTALCKKSDWNELHKRFDSRIQFGTAGLRSQMQAGFS RMNTLVVIQASQGLATYVRQQFPDNLVAVVGHDHRFHSKEFARATAAAFLLKGFKVHYLN PDHEFVHTP LVPFAVDKLKASVGVMITASHNPKMDNGYKVYYSNGCQI I PPHDHAI SDSI DANLEPWANVWDFDDVLN KALKQGKLMYSREEMLKLYLEEVSKNLVEI PLKLEVKAKPWFVYTPMHGVGFDI FSTIVKKTLCLVEG KDYLCVPEQQNPDPSFPTVGFPNPEEKGALDIGINLAEKHDI DLLVANDPDADRFSVAVKDMQSGEWRQ LTGNEIGFLFAFYEYQKYKSMDKEFQHVHPLAMLNSTVSSQMIKKMAEIEGFHYEDTLTG FKWIGNRAI LLEKKGYYVPFGFEEAIGYMFPAMEHDKDGI SASIVFLQAYCKWKI DHNLDPLNVLENGFKKYGVFKEY NGYYVVPNPTVTKDI FDYIRNVYTPEGASYPSSIGEEIEVLYYRDLTTGYQSDTINHKPTLPVDPTSQM ITVSARPSNGSENEHIRFTIRGSGTEPKLKVYIEACANEEQRASFLAKLTWNVLRREWFR PDEMNIVTK F