Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
IMPROVED ACETATE RESISTANCE IN YEAST BASED ON INTRODUCTION OF A MUTANT HAA1 ALLELE
Document Type and Number:
WIPO Patent Application WO/2014/018450
Kind Code:
A1
Abstract:
Improved haa1 transcriptional regulatory proteins, polynucleotides encoding improved haa1 transcriptional regulatory proteins and vectors and cells thereof are provided, as well as methods for converting a cellulose-containing biomass feedstock to ethanol using improved haa1 transcriptional regulatory proteins and cells expressing heterologous haa1 transcriptional regulatory proteins as disclosed herein.

Inventors:
ZAHN KENNETH (US)
JACOBSON SANDRA (US)
Application Number:
PCT/US2013/051500
Publication Date:
January 30, 2014
Filing Date:
July 22, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
EDENIQ INC (US)
International Classes:
C07H21/04; C07H21/02; C12N1/14; C12N1/15
Domestic Patent References:
WO2009126795A22009-10-15
WO2011031319A22011-03-17
Foreign References:
US20120009640A12012-01-12
US20110159560A12011-06-30
US20120184020A12012-07-19
US20100311137A12010-12-09
Other References:
DATABASE UNIPROTKB 22 February 2012 (2012-02-22), HOH2A2, retrieved from http://www.uniprot.org/uniprot/HOH2A2.txt?version=4 accession no. OH2A2_9SACH
MIRA ET AL.: "Identification of a DNA-binding site for the transcription factor Haa1, required for Saccharomyces cerevisiae response to acetic acid stress.", NUCLEIC ACIDS RESEARCH, vol. 39, no. 16, 1 September 2011 (2011-09-01), pages 6896 - 6907
CLIFTEN ET AL.: "Finding functional features in Saccharomyces genomes by phylogenetic footprinting", SCIENCE, vol. 301, no. 5629, 4 July 2003 (2003-07-04), pages 71 - 76
FERNANDES ET AL.: "Saccharomyces cerevisiae adaptation to weak acids involves the transcription factor Haa1p and Haalp-regulated genes", BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, vol. 337, no. 1, 11 November 2005 (2005-11-11), pages 95 - 103
MIRA ET AL.: "Genomic Expression Program Involving the Haalp-Regulon in Saccharomyces cerevisiae Response to Acetic Acid.", OMICS A JOURNAL OF INTEGRATIVE BIOLOGY, vol. 14, no. 5, October 2010 (2010-10-01), pages 587 - 601
Attorney, Agent or Firm:
BORING, Landin F. et al. (Two Embarcadero Center 8th Floo, San Francisco California, US)
Download PDF:
Claims:
WHAT IS CLAIMED IS:

1. An isolated polynucleotide encoding a transcription factor polypeptide comprising an amino acid sequence that:

a. is at least 80% identical to amino acids 1 -554 of SEQ ID NO:2; and b. comprises at least one amino acid difference compared to SEQ ID O:2. selected from the group consisting of F440Y, P518S, D508Y, N510K, A527V, 1591V, H605Y, S622F, S639F, and S673L.

2. The isolated polynucleotide of claim 1 , wherein the transcription factor polypeptide binds to SEQ ID NO: 1 6.

3. The isolated polynucleotide of claim 1 , wherein the amino acid sequence is at least 80% identical to SEQ ID NO:2.

4. The isolated polynucleotide of claim 3, wherein the amino acid sequence comprises the following amino acid differences compared to SEQ ID NO:2:

F440Y, P5 I 8S, 159 IV, H605Y, S622F, S639F, and S673L.

5. The isolated polynucleotide of claim 1 , wherein the polypeptide has a C-termina! deletion resulting in a polypeptide having fewer than 600 (e.g., fewer than 590, 580, 570, 560, 550) amino acids.

6. The isolated polynucleotide of claim 1 or 5, wherein the amino acid sequence comprises the following amino acid differences compared to SEQ ID NO:2:

D508Y, N510K, and A527V.

7. The isolated polynucleotide of claim I , wherein the amino acid sequence comprises at least two (e.g., 2, 3, 4, 5, 6, 7, or more) amino acid differences compared to SEQ ID NO:2 selected from the group consisting of F440Y, P518S, D508Y, N510K, A527V, 1591 V, H605Y, S622F, S639F, and S673L.

8. An expression cassette comprising a heterologous promoter operably linked to the polynucleotide of any of claims 1 -7.

9. The expression cassette of claim 8, wherein the promo ter is heterologous to the polynucleotide.

10. A yeast cell comprising the expression ca ssette of claim 8 or 9, wherein the yeast cell ferments sugar in the presence of acetate with increased kinetics or mcreased fermentation-product titer than a control yeast cell lacking the expression cassette.

1 1. The yeast cell of claim 10, wherein the yeast cell is a Saccharomyces cervisiae or Pichia stipitis cell.

12. The yeast cell of claim 1 1 , wherein the yeast cell lacks a wild type allele of ΙΙΛΛ1.

13. The yeast cell of claim 1 1 comprising a genomically- integrated mutant haul allele replacing the wild type HAA1 allele,

14. The yeast ceil of claim 1 1 comprising a mutant haul allele on a heterologous plasmid, wherein the cell also comprises a genomic wild type allele οΐΗΑΑΙ,

15. The yeast ceil of any of claims 10- 14 exhibiting fermentation with increased kinetics or increased fermentation-product titer compared to a control yeast strain in the presence of at least 0.5% w/v acetate.

16. The yeast cell of any of claims 10- 14 exhibiting fermentation with increased kinetics or mcreased fermentation-product titer compared to a control yeast strain in the presence of at least 0.8% w/v acetate.

17. The yeast ceil of any of claims 10-14 exhibiting fermentation with increased kinetics or increased fermentation-product titer compared to a control strain in the presence of at least 0.5% acetic acid.

18. The yeast ceil of any of claims 10-14 exhibiting fermentation with mcreased kinetics or increased fennentation-product titer compared to a control yeast strain in the presence of at least 0.8% w/v acetic acid.

19. A method of making ethanoi from sugar, the method comprising contacting the yeast of any of claims 10- 18 to a solution comprising sugar under conditions to allow for fermentation of the sugar into ethanoi; and recovering the ethanoi.

2.0. The method of claim 19, wherein the solution comprises sufficient furfural to inhibit fermentation of a control yeast lacking the expression cassette.

23. The meihod of claim 39, wherein the sugars are generated by cellulosic enzymes.

22. An aqueous mixture com rising sugars, acetic acid and/or acetate and the yeast cell of any of claims 10-18.

23. The mixture of claim 22, wherein the mixture comprises at least 0.5% or 0.8% w/v acetic acid or acetate.

24. The mixture of claim 22 or 23, wherein the mixture comprises ceiluiose-containina biomass.

Description:
IMPROVED ACETATE RESISTANCE IN YEAST BASED ON INTRODUCTION

OF A MUTANT HAA1 ALLELE

CROSS-REFERENCE TO RELATED PATE T APPLICATIONS

[0001] The present patent application claims benefit of priority to US Patent Application No. 61/674,676, filed July 23, 2012, which is incorporated by reference herein in its entirety.

BACKGROUND OF THE INVENTION

[0002] Acetate inhibition is a well-recognized impediment to the efficient fermentation of most types of biomass, particularly those with high bemicellulose contents. The release of acetic acid during the acid, heat and pressure-induced breakdown of hemicellulose is known to produce levels of acetic acid which can exceed 1%, a level which is highly inhibitory (in terms of growth, viability and/or performance of desired metabolic function e.g. fermentation of sugars) to most microorganisms. Solutions to the problem include efforts to block the initial release by altering the pretreatment conditions, removal of acetate by chemical or physical methods and genetic improvement of the fermentation organisms to utilize or better tolerate acetate,

BRIEF SUMMARY OF THE INVENTION

[0003] In some embodiments, isolated polynucleotides are provided. In some

embodiments, the isolated polynucleotide encodes a transcription factor polypeptide comprising an amino acid sequence that: a. is substantially identical (e.g., at least 60, 70, 80, 90, 95, 97, 98, or 99%) to amino acids 1-554 of SEQ ID NO:2; and b. comprises at least one amino acid difference compared to SEQ ID NO:2 selected from the group consisting of F44QY, P518S, D508Y, N51QK, A527V, 1591V, H605Y, S622F, S639F, and S673L.

[0004] In some embodiments, the transcription factor polypeptide binds to SEQ ID NO: 16.

[0005] In some embodiments, the amino acid sequence is at least 80% identical to SEQ ID

NO:2. In some embodiments, the amino acid sequence comprises the following amino acid differences compared to SEQ ID NO:2: F440Y, P518S, T591V, Η605Ύ, S622F, S639F, and S673L.

[0006] Tn some embodiments, the polypeptide has fewer than 600 (e.g., fewer than 590, 580, 570, 560, 550) amino acids.

[0007] In some embodiments, the amino acid sequence comprises the following amino acid differences compared to SEQ ID NO:2: D508Y, N51 OK, and A527V.

[0008] In some embodiments, the amino acid sequence comprises at least two (e.g., 2, 3, 4, 5, 6, 7, or more) amino acid differences compared to SEQ ID NO:2 selected from the group consisting of F440Y, P518S, D508Y, N510K, A527V, 159 IV, H605Y, S622F, S639F, and S673L.

[0009] In some embodiments, an expression cassette is provided. In some embodiments, the expression cassette comprises a heterologous promoter operably linked to the polynucleotide as described above or elsewhere herein.

[0010] In some embodiments, the promoter is heterologous to the polynucleotide.

[0011] In some embodiments, a yeast cell, or a culture comprising the yeast cell, is provided. In some embodiments, the yeast cell comprises an expression cassette as described above or elsewhere herein, wherein the yeast cell ferments sugar in the presence of acetate better than a control yeast cell lacking the expression cassette.

[0012] In some embodiments, the yeast cell is a Saccharom ces cervisiae or Pichia stipitis cell. In some embodiments, the yeast cell lacks a wild type allele oiHAAl. In some embodiments, the yeast cell comprises a genomically-integrated mutant haul allele replacing the wild type HAA1 allele. In some embodiments, the yeast cell comprises a mutant haal allele on a heterologous plasmid, wherein the cell also comprises a genomic wild type allele of HAA L

[0013] In some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of greater than 0.5% w/v acetate. In some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of greater than 0.8% w/v acetate. In some embodiments, the yeast cell exhibits better fermentation compared to a control strain in the presence of greater than 0.5% w/v acetic acid. In some embodiments, the yeast cell exhibits better fermentation compared to a control y east strain in the presence of greater than 0.8% w/v acetic acid. In some embodiments, the y easi cell exhibits better iermentaiion compared to a control y easi strain in the presence of at least 0.5% w/v acetate or at least 0.8% w/v acetate. In some embodiments, the yeast cell exhibits better fermentation compared to a control strain in the presence of at least 0.5% w/v acetic acid or at least 0.8% w/v acetic acid. In some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of about 0.5% w/v to about 2.0% w/v acetate, or about 0.5% w/v to about 1.0% w/v acetate, or about 0.5% to about 0.8% acetate. In some embodiments, the yeast ceil exhibits better fermentation compared to a control yeast strain in the presence of at least 0.5%, 0,6%, 0.7%, 0.8%, 0.9%, 1.0%, or 1.5% acetate, or about 2,0% w/v acetate. Thus, in some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of at least 0.5% to at least 1.0% w/v acetic acid, at least 0.8% to at least 1 ,5% w/v acetic acid, or at least 1.0% to about 2.0% w/v acetate. In some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of about 0.5% w/v to about 2.0% w/v acetic acid, or about 0.5% w/v to about 1.0% w/v acetic acid, or about 0.5% to about 0.8% acetic acid. In some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of at least 0.5%, 0.6%, 0,7%, 0.8%, 0.9%, 1.0%, or 1.5% acetic acid, or about 2.0% w/v acetic acid. Thus, in some embodiments, the yeast cell exhibits better fermentation compared to a control yeast strain in the presence of at least 0.5% to at least 1.0% w/v acetic acid, at least 0.8% to at least 1 ,5% w/v acetic acid, or at least 1 ,0% to about 2.0%> w/v acetic acid.

[0014] In some embodiments, better fermentation is caused by expression of the improved Haal proteins resulting in increased growth of the microorganism relative to a control or reference microorganism in a specified time under specified conditions.

[0015] In some embodiments, better fermentation is caused by expression of the improved Haal proteins resultmg in increased rate (e.g., kinetics) of formation of fermentation product or increased titer of fermentation product by a microorganism relative to a control or reference microorganism in a specified time under specified conditions.

[0016] In some embodiments, better fermen tation is caused by expression of the improved Haal proteins resulting in increased tolerance of a microorganism to higher acetic acid levels relative to a control or reference microorganism in a specified time under specified conditions.

[0017] In some embodiments, better fermentation is caused by expression of the improved Haal proteins resulting in increased tolerance of a microorganism to higher acetate levels relative to a control or reference microorganism in a specified time under specified conditions.

18] Tn some embodiments, better fermentation is caused by expression of the improved Haal proteins resulting in decreased concentration of fermentable carbohydrates during fermentation relative to a control or reference microorganism in a specified time under specified conditions.

In some embodiments, methods of making ethanol from sugar are provided. In some embodiments, the method comprises contacting the yeast as described above or elsewhere herein to a solution comprising sugar under conditions to allow for fermentation of the sugar into ethanol; and recovering the ethanol.

In some embodiments, the solution comprises sufficient furfural to inhibit fermentation of a control yeast lacking the expression cassette. In some embodiments, the sugars are generated by cellulosic enzymes.

I] Also provided are aqueous mixtures comprising sugars, acetic acid and/or acetate and the yeast cell as described above or elsewhere herein. In some embodiments, the mixture comprises at least 0,5% or 0.8% w/v acetic acid or acetate. In some embodiments, the mixture comprises cellulose-containing biomass.

[0022] Also provided herein are engineered microorganisms with enhanced tolerance to acetic acid or its cognate base acetate, containing an improved Haal protein. The improved Haal protein is encoded by an HAA1 open reading frame containing one or more mutations conferring its improved characteristic. In some embodiments, microorganisms containing the improved haal gene either alone or in combination with the wild type copy of HAA1 have an increased fermentation of sugars to a desired product relative to the natural strain, in some embodiments, microorganisms containing the improved haal gene have increased fermentation in cellulosic sugar material derived from pretreaied and saccharified biomass. In some embodiments, the desired fermentation product is ethanol.

Provided herein, inter alia, are improved Haa l proteins substantially identical (e.g., at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%) to SEQ ID NO:2 and having improved acetate resistance compared to a control Haal protein comprising SEQ ID NO:2. In some embodiments, the improved Haal proteins are substantially identical (e.g., at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%) to SEQ ID NO:3 and have Haal protein activity conferring increased acetate tolerance compared to a control Haal protein comprising SEQ ID NO:2. In some embodiments, the improved Haal proteins are substantially identical (e.g., at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%) to SEQ ID NO:4 and have Haal protein activity conferring increased acetate tolerance compared to a control haal protein comprising SEQ ID NO:2. In some embodiments, the improved Haal proteins are substantially identical (e.g., at feast 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%) to any of SEQ ID NO:5 through SEQ ID NO: 15 and have improved Haal protein activity conferring increased acetate tolerance compared to a control Haal protein comprising SEQ ID NO:2.

[0024] In some embodiments, the improved Haal protein comprises one or more mutations at a position corresponding to a position selected from the group consisting of (a) position 440 of SEQ ID NO:2, having an amino acid other than F; (b) position 518 of SEQ ID NO:2 having an amino acid other than P; (c) position 508 of SEQ ID NO:2 having an amino acid other than D; (d) position 510 of SEQ ID NO:2 having an amino acid other than N; (e) position 527 of SEQ ID NO:2 having an amino acid other than A; (f) position 591 of SEQ ID NO:2 having an amino acid other than I; (g) position 605 of SEQ ID NO:2 having an amino acid other than H; (h) position 622 of SEQ ID NO:2 having an amino acid other than S; (i) position 639 of SEQ ID MO:2 having an amino acid other than S; j) position 673 of SEQ ID NO:2 having an amino acid other than S and (k) position 554 of SEQ ID NO:2 having an amino acid other than N.

[0025] In some embodiments, the improved Haal protein is encoded by a mutant haal gene designated haal mut2 and contains seven mutations in the HAAl coding region as put forth in SEQ ID NO:3. In some embodiments, haal mut2 gene is expressed from a plasmid in a yeast cell containing a wild type HAAl allele. In some embodiments, the haal mut.2 gene is integrated into the HAAl genomic focus and replaces the wild type gene function.

[0026] In some embodiments, the improved Haal protein is expressed in the presence of a genomic wild type copy of HAA 1 and is encoded by a mutant haal gene designated haal mut40 expressed from a heterologous plasmid construct that contains four mutations plus a C-ierminal haal truncation. This truncated protein is caused by a nucleotide deletion and projected frameshift in the mutant haal allele, causing the translation product to go out of frame and become truncated after amino acid 554.

[0027] In another aspect, provided herein are polynucleotides comprising a nucleic acid encoding the improved Haal protein provided herein. In some embodiments, the polynucleotide comprises an expression cassette comprising a heterologous promoter Gperably linked to the nucleic acid. Also provided herein are vectors comprising the polynucleotides provided herein, and isolated cells or culture of ceils comprising the heterologous polynucleotides provided herein. In some embodiments, the cell is a yeast cell. In some embodiments, the cell is a Saccharomyces cerevisiae.

[0028] In another aspect, provided herein are methods for growing cells in the presence of medium containing increasing amounts of acetate and monitoring cell growth. In some embodiments, the cells are yeast cells. In some embodiments, the methods comprise growing the ceils expressing an improved Haal protein as described herein (e.g., comprising one or more mutaiions at a position corresponding io a position selected from the group consisting of (a) position 440 of SEQ ID Q:2, having an amino acid other than F; (b) position 518 of SEQ ID NO:2 having an amino acid other than P; (c) position 508 of SEQ ID NO:2 having an amino acid other than D; (d) position 510 of SEQ ID NO:2 having an amino acid other than N; (e) position 527 of SEQ ID NO:2 having an amino acid other than A; (f) position 591 of SEQ ID NO:2 having an amino acid other than I; (g) position 605 of SEQ ID NO:2 having an amino acid other than H; (h) position 622 of SEQ ID NO:2 having an amino acid other than S; (i) position 639 of SEQ ID NQ:2 having an amino acid other than S; (j) position 673 of SEQ ID O:2 having an amino acid other than S and (k) position 554 of SEQ ID NO:2 having an amino acid other than N).

[0029] In some embodiments, the ceils ferment carbohydrates in the presence of acetic acid levels higher than tolerable for the natural strain. In some embodiments, the cells ferment carbohydrates in the presence of acetate levels toxic for the natural parental yeast strain. In some embodiments the cells are yeast cells.

[0030] In some embodiments, the cells ferment carbohydrates in a cellulosic sugar solution derived from pretreated bioniass that has been saccharified. In some embodiments, this saccharification has been performed by cellulosic enzyme mixtures. In some embodiments, this saccharification has been performed by cells expressing cellulosic enzymes. In some embodiments, this saccharification has been performed by cells expressing cellulosic enzymes and the improved haal protein.

[0031] In some embodiments, the method comprises treating the pretreated biomass with a ceil or culture of cells that express the improved Haal protein. In some embodiments, the cell is a Saccharomyces cerevisiae.

[0032] In some embodiments, the cellulose-containing biomass feedstock is a woody material. In some embodiments, the woody material is cellulosic or lignoceUulosic plant material selected from the group consisting of orchard primings, chaparral, mill waste, urban wood waste, municipal waste, logging waste, forest thinnings, short-rotation woody crops, and industrial waste. In some embodiments, the cellulose-containing biomass feedstock is a non-woody material. In some embodiments, the non-woody material is selected from the group consisting of gramineous agricultural residue, wheat straw, oat straw, rice straw, barley straw, rye straw, flax straw, sugar cane, bagasse, com stover, corn stalks, com cobs, com husks, prairie grass, switchgrass, gamagrass, foxtail, sugar beet pulp, citrus fruit pulp, seed hulls, cellulosic animal wastes, lawn clippings, seaweed, bagasse, energy cane, and giant reed. In some embodiments, the cellulose-containing biomass feedstock is corn grain, barley grain, milo grain, wheat grain or rice grain.

BRIEF DESCRIPTION OF THE DRAWINGS

[0033] Figure 1. Schematic representation of the experimental process leading to the isolation of novel haal alleles

[0034] The upper left shows PCR synthesis of the HAAlgeae from yeast DNA and cloning in the shuttle plasmid vector p416 Tef. The upper middle shows mutagenic PCR of the HAA1 gene generating many allelic variants to create a library of mutants and their transformation into yeast. The lower right portion shows the process of screening these mutants, picking out the most acetate resistant variants and determining their D A sequence. Other relevant characteristics of the mutants are also studied such as growth rate, fermentation ability temperature resistance and dependence of the acetate resistant phenotype on the resident plasmid.

[0035] Figure 2. Acetate gradient plates for screening haal mutant libraries for acetate resistant variants

[0036] Three acetate gradient plates are shown. From left to right, a control plating of the unmutagemzed HAA1 gene cloned in the p416 Tef vector, high density plating of the mutagenized haal gene "library" cloned in the p416 Tef vector and lower density plating of the same mutagenized library as shown in the middle panel. The arrow indicates the direction of increasing concentration of acetate in the plate. Colonies with higher tolerance to increased le vels of acetate were isolated from the leading edge of the ceil plating area by manually picking individual colonies and re-streaking for single clonal isolates on a fresh agar-substrate Petri dish.

[0037] Figure 3. Conversion of Acetate Concentration from mM to % w/ ' V [0038] The range shown is 0 - 300 mM or 0 - 1.5% w/v.

[0039] Figure 4. Screening for acetate resistance among candidates containing plasmid- borne haal alleles.

[0040] Colony growth plating assay of candidates containing plasmids bearing different haal alleles is shown. From left to right, an increasing amount of acetate is applied in the plates which all contain a synthetic defined media of CSM (complete supplement mixture) with glucose and lacking uracil. Medium lacking uracil is used to provide selection for the URA3 marked plasmid. Acetate concentrations are shown at the bottom of the panel; 0 mM, 100 mM, 120 mM, 130 mM, 140 mM. Candidate strains are spotted in ten-fold dilutions from top-to-bottom in each set of plates. Two sets of equivalent dilutions are spotted in each plate as duplicate platings to evaluate phenotypic reproducibility. Strains plated in the experiment are indicated a number at the top of each dilution series and by the legend on the bottom; 1 and 8 are plasmid vector and wild type HA A 1 plasmid clone controls, respectively, 2-7 are independent candidates containing different cloned haal alleles. Plates were incubated for eight days to produce the result shown.

[0041] Figure 5. Sequence illustration of six site-directed acetate resistant mutants in the carboxyl terminus of the HAA1 gene (HAAl gene sequence from Wild type (WT) S288C = SEQ ID NO:l ; WT TR3 = SEQ ID NO:2; mut2 = SEQ ID NO:3; mut24 = SEQ ID NO: 17; mut33 = SEQ ID NO: 18; mut36 = SEQ ID NO: 19; mut40 = SEQ ID NO:4; mut5 = SEQ ID NO:20; consensus = SEQ ID NO:21). Figure 5A: alignment of Haal partial polypeptide comprising amino acids 1 -240; Figure 5B: alignment of Haal partial polypeptide comprising amino acids 241-480; Figure 5C: alignment of Haal partial polypeptide comprising amino acids 481 -694.

[0042] Protein sequences of six acetate resistant mutants culled from the original set of 40 isolates recovered from an acid gradient plate are shown. The DNA sequence of each mutant was determined and conceptual translation of the DNA sequence was used to obtain the amino acid sequence. Alignment of all of the amino acid sequences was carried out using the Clustal program. The entire amino acid sequence of the Haal protein is shown. The top line displays the sequence of the non-mutant Haal protein determined from yeast strain S288C, in which the genome sequence has been entirely determined. Belo this the sequence of the wild type HAA l gene from strain TR3, the strain employed in these studies, is presented. Numbering indicates the amino acid position and blocks of black lettering on white background indicate entirely conserved sequences. In addition beyond position 553, blocks of white lettering on black background show entirely conserved amino acids in all sequences. Dashed lines indicate the extent of deletions in two of the sequences. Grey shading of the amino acid coding letter indicates amino acid positions that have amino acid substitutions which are not conserved. Black shading of the amino acid coding indicates amino acid positions that have amino acid substitutions which are conserved.

[0043] Figure 6. Summary of amino acid changes in the two of the most highly acetate resistant haal mutants

[0044] Sequence differences of haal mutants 2 and 40 from the wild type sequence are indicated. Amino acid abbreviations are shown below:

F- phenylalanine Y- tyrosine P- proline S- serine 1- isoleucine

V- valine H- histidine L- leucine D- glutamic acid

K- lysine A- alanine

FS- indicates a frameshift mutation

Trunc- indicates the point of premature protein termination

[0045] Figure 7. Schematic representation of the procedure used to integrate haal alleles into the chromosome of Saccharomyces cerevisiae strain TR3

[0046] In the top portion (PCR 1), three PGR fragments were synthesized independently; from left, a -4000 bp piece (I) to provide homology to the region upstream of HAAI, an ~3 kB piece containing the haal gene including heterologous Tef promoter and eye I terminator

(II) , a 1.5 kB piece encompassing the URA3 gene and two identical flanking swo segments

(III) . In PCR 2, fragments UTI are recombined in vitro using overlap extension PCR (SOE) based on homology between the primers at the 3' end of the haal upstream and the 5" end of the p ' T ' ef promoter piece, in PCR. 3, the recombined haal upsrream/pTef-haal - eye! fragment is recombined in vitro with the downstream ura3 piece to produce a 5.5 kB fragment bearing a selectable marker. The 5.5 kB fragment was then purified and used for transformation of S. cerevisiae strain TR3. Two possible crossover sites are shown in the lower portion; the left side crossover results in insertion of the Tef promoter into the chromosome whereas the right side crossover residts in recombination within the HAAI open reading frame and retains the endogenous HAAI promoter.

[0047] Figure 8. Screening for acetate resistance among candidates with chromosomally inserted haal alleles. [0048] Colony growth plating assay of candidates containing chromosomal insertions of haal mut2 or mut40 alleles is shown. From left to right, an increasing amount of acetate is applied in the plates which all contain a base media of CSM glucose. Acetate concentrations are shown at the top of the panel; 100 mM, 120 mM, 130 mM, 140 mM. Candidate strains are spotted in ten-fold dilutions from top-to-bottom in each set of plates. Two sets of equivalent dilutions are spotted in each plate. Strains plated in the experiment are indicated with a number at the top of each dilution series and by the legend on the right side; 1 -5 are independent candidates containing haal mull allele insertions, 6 and Γ-ό' are independent clones containing haal mut40 allele insertions. Plates were incubated for four days to produce the result shown.

[0049] Figure 9. Screening for resistance to biomass among candidates containing plasmid-borae haal alleles.

[0050] Colony growth plating assay of candidates containing plasmids bearing different Aaai alleles is shown. Every plate has one half volume substituted by either H 2 0 (the leftmost plate) or a. saccharified pretreaied biomass sample prepared by a different pretreatment (the right five plates). All plates were prepared with CSM glucose media lacking uracil. From left to right, 50% of a 0%, 40%, 60% or 80% "recycle HPHT" mixture respectively was added to the plates. "Recycle HPHT" is defined as biomass that has been pretreated by EdeniQ proprietary thermomechanical means, saccharified by cellulosic enzymes, glucose fermented into ethanol by EdeniQ proprietary yeast, centrifuged to recover the supernatant used for diluting fresh biomass to a particular % solids and subjecting it to the same form of pretreatment and saccharification. Candidate strains are spotted in ten- fold dilutions from top-to-bottom in each set of plates. Two sets of equivalent dilutions are spotted in each plate. Strains plated in the experiment are indicated a number at the top of each dilution series and by the legend on the bottom. 1 and 2 are plasmid vector and wild type HAAI plasmid clone controls, 3-8 are independent candidates containing different cloned haal alleles. Plates were incubated for eight days to produce the result shown.

[0051] Figure 10. Screening for furfural resistance and/or furfural plus acetate resistance among candidates containing plasmid-borne haal alleles.

[0052] Spot testing of candidates containing plasmids bearing different Aaaialieles is shown.. From left to right, an increasing amount of furfural was added into the solid medium plates. All plates contain a base media of CSM glucose lacking uracil or the same media plus 100 mM acetate. The plate on the lower left has only 100 mM acetate added. In all other plates furfural concentrations are shown at the top and bottom of the panel; 0.4 g L, 0.6 g/L, 0.8 g/L and 1.0 g/L. In addition to varying furfural, the upper set of plates contains acetaie at 100 mM. Candidaie strains are spotted in ten- fold dilutions from top to boitom in each set of plates. Strains plated in the experiment are indicated as a number at the top of each dilution series and by the legend on the right. 1 and 2 are plasmid vector and wild type Haal protein plasmid clone controls, 3-8 are independent candidates containing different cloned haal alleles. Plates were incubated for eight lays to produce the result shown.

[0053] Figure 1 1. Fermentation of Sugar Cane Bagasse Biomass by the wild type strain TR3 or the TR3 haal mut2 mutant strain improved for acetate resistance.

[0054] Graphical representation of the levels of key metabolites present in pretreated saccharified sugarcane bagasse during biomass fermentation by wild type control and the improved haalmui2 ' yeast strains. Metabolite levels of glucose, glycerol, ethanol and acetate are shown (% w/v) on the Y-axis at T=0 (control, control (0.8)) or after 24 hours of fermentation by strains TR3, TR3 haal, TR3(0.8) , TR3 haal(().8) to the right along the X- axis where (0.8) indicates addition of acetate to greater than 0.8% w/v and haal indicates the chromosomaliy integrated copy of the haal mut2 allele. Note that two control conditions are used, differing only by the addition of acetate to greater than 0.8% w v. All determinations were by HPLC measurement. Control samples were run in duplicate; experimental samples were ran in triplicate.

DEFINITIONS

[0055] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. Although essentially any methods and materials similar to those described herein can be used in the practice or testing of the present invention, only exemplary methods and materials are described. For purposes of the present invention, the following terms are defined below.

[0056] The terms "a," "an," and "the" include plural referents, unless the context clearly indicates otherwise.

[0057] The terms "polypeptide," "peptide" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical mimetic of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers and non- naturally occurring amino acid polymers.

[0058] The term "isolated," when applied to a protein or nucleic acid, denotes that the protein or nucleic acid, respectively, is essentially free of other cellular components with which it is associated in the natural state. It is preferably in a substantially homogeneous state, and for example, can be in either a dry or aqueous solution. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacryfamide gel electrophoresis or high performance liquid chromatography .

[0059] The term "amino acid" refers to naturally occurring and synthetic amino acids, as well as amino acid analogs and amino acid mimetics that function in a manner similar to the naturally occurring amino acids. Naturally occurring amino acids are those encoded by the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, γ- carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical structure as a naturally occurring amino acid. Naturally encoded amino acids are the 20 common amino acids (alanine, arginine, asparagine, aspartic acid, cysteine, glutamine, glutamic acid, glycine, hisiidine, isoleucine, leucine, lysine, methionine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine, and valine) and pyrrolysine and

selenocysteine.

[0060] "Conservatively modified variants" applies to both amino acid and nucleic acid sequences. With respect to particular nucleic acid sequences, conservatively modified variants refers to those nucleic acids which encode identical or essentially identical amino acid sequences, or where the nucleic acid does not encode an amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large number of functionally identical nucleic acids encode any given protein. For instance, the codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded polypeptide. Such nucleic acid variations are "silent variations," which are one species of conservatively modified variations. Every nucleic acid sequence herein which encodes a polypeptide also describes every possible silent variation of the nucleic acid. One of skill will recognize that each codon in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, which is ordinarily the only codon for tiyptophan) can be modified to yield a functionally identical molecule. Accordingly, each silent variation of a nucleic acid that encodes a polypeptide is implicit in each described sequence.

[0061] As to amino acid sequences, one of skill will recognize that indi vidual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which afters, adds or deletes a single amino acid or a small percentage of amino acids in the encoded sequence is a "conservatively modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles of the invention.

[0062] The following eight groups each contain amino acids that are conservative substitutions for one another:

1) Alanine (A), Glycine (G);

2) Aspartic acid (D), Glutamic acid (E);

3) Asparagine (N), Glutamine (Q);

4) Afginine (R), Lysine (K);

5) Tsoleucine (I), Leucine (L), Methionine (M), Valine (V);

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W);

7) Serine (S), Threonine (T); and

8) Cysteine (C), Methionine (M) (see, e.g., Creighton, Proteins ( 1984)).

[0063] "Percentage of sequence identity" is determined by comparing two optimally aligned sequences over a comparison window, wherein the portion of the poly nucleotide sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (e.g., a polypeptide of the invention), which does not comprise additions or deletions, for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleic acid ba se or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity .

[0064] The terms "identical" or percent "identity," in the context of two or more nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that are the same. The term "substantially identical" refers to two or more sequences or subsequences that have a specified percentage of amino acid residues or nucleotides that are the same (i.e., at least about 40% identity, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and aligned for maximum correspondence over a comparison window or designated region) as measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default parameters described below, or by manual alignment and visual inspection (see e.g., NCBl web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). The definition includes sequences that have deletions and/or additions, as well as those that have substitutions. As described below, algorithms can account for gaps and the like. When not specified, identity or substantial identity is determined over the entire length of the reference sequence. When specified, identity can be determined over a region that is at least about 10 amino acids or nucleotides in length, at least about 25 amino acids or nucleotides in length, or over a region that is 50- 100 amino acids or nucleotides in length.

[0065] For sequence comparison, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.

[0066] A "comparison window", as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may ¬ be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith and Waterman (1970) Adv. Appi. Math. 2:482c, by the homology alignment algorithm of Needleman and Wunscb. (1970) J. Mol. Biol. 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Nat l. Acad. Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wl), or by manual alignment and visual inspection (see, e.g., Ausubei et al., Current Protocols in Molecular Biology (1995 supplement)).

[0067] An exemplary algorithm suitable for determining percent sequence identity and sequence similarity is BLAST 2.0 algorithm, which is described in Altschul et al. (1990) J. Mot Biol. 215:403-410, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned w ith a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. The word hits are extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid sequences, a scoring matrix is used to calculate the cumulati v e score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantify X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLA8TN program (for nucleotide sequences) uses as defaults a wordlength (W) of 1 1, an expectation (E) or 10, M=5, N=-4 and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. Acad. Set USA 89: 10915) alignments (B) of 50, expectation (E) of 10, M-5, N=-4, and a comparison of both strands.

[0068] The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin and Altschul (1993) Proc. Nail. Acad. Sci. USA 90:5873- 5787). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more preferably less than about 0.01 , and most preferably Jess than about 0.001. [0069] To determine which amino acid of a first protein "corresponds" to the position of an amino acid in a second protein, the amino acid sequences of the two proteins are optimally aligned (e.g., using a BLAST algorithm). This is particularly useful, for example, where two proteins have high homology but where one protein contains one or more insertions or deletions relative to the second protein. In such cases, for example, position 57 of a first protein may align with position 51 in a second protein when the two proteins are optimally aligned. Thus position 51 of the second protein "corresponds" to position 57 of the first protein.

[0070] A "heterologous sequence," "heterologous polypeptide," or a "heterologous nucleic acid", as used herein, is one that originates from a source foreign to the particular host cell, or, if from the same source, is modified from its original form. Thus, a heterologous expression cassette in a ceil is an expression cassette that is not endogenous to the particular host cell, for example by being linked to nucleotide sequences from an expression vector rather than chromosomal DNA or by being linlced to a heterologous promoter or by being linlced to a reporter gene, etc.

[0071] "Expression cassette" refers to a polynucleotide comprising a promoter or other regulatory sequence operably linked to a sequence encoding a protein.

[0052] "Acetic Acid" also known as ethanoic acid is an organic compound with the chemical formula C¾COOH. The hydrogen center in the carboxyi group (-COOH) in carboxylic acids such as acetic acid can separate from the molecule by ionizatiomCiliCCbH → CH 3 CO 2 " + H + Because of this release of the proton (ΤΓ), acetic acid has acidic character. Acetic acid is a weak monoprotic acid. In aqueous solution, it has a pKa value of 4.75, Its conjugate base is acetate (CH 3 COO ).

[0072] "Acetate" is a derivative of acetic acid which can occur in the form of salts or esters. The acetate anion [CH 3 COO] , is one of the carboxylate family. It is the conjugate base of acetic acid. Above a pH of 5.5, acetic acid converts to acetate: CH3COOH ^ CH3COO + H . Many acetate salts are ionic consistent with their tendency to dissolve in water. Acetate is a common anion in biology. It is mainly utilized by organisms in the form of acetyl coenzyme A.

[0073] "Furfural" is defined as a heterocyclic aldehyde with a chemical formula of OC 4 H 3 CHO. It is formed from plant material containing the polysaccharide hemicellulose when reacted with dilute acid and/or heat whereby hemicellulose undergoes hydrolysis to yield xylose which can be subjected to a dehydration reaction forming furfural. [0074] "Pretreatrnent" is defined as a physical means of reducing the recalcitrance of lignocellulosic biomass such that the plant ceil walls are less resistant to decoristruction.

[0075] "Biomass" is defined as biological material from living or recently living organisms.

[0076] "Lignocellulosic biomass" is defined as biological material of plants that is comprised in part of lignin, hemiceiluiose and cellulose.

[0077] The term "cellulose-containing biomass feedstock" is defined herein to mean any ceflufosic or lignocellulosic plant material, waste material, including but not limited to, leaves and stalks of both woody and non-woody plants. The term "w oody" is used herein both in the botanical sense to mean "comprising wood"; thai is, composed of extensive xylem tissue as found in trees and shrubs, and also in the sense of "being woodlike". Accordingly, "nonwoody" refers to materials lacking these characteristics. Cellulose-containing biomass feedstock includes, but is not limited to, crops such as starch crops (e.g., corn, wheat, rice or barley), sugar crops (e.g., sugarcane, energy cane or sugarbeet), forage crops (e.g., grasses, alfalfa, or clover), and oilseed crops (e.g., soybean, sunflower, or safflower); wood products such as trees, shrubs, and wood residues (e.g., sawdust, bark or the like from forest clearings and mills); waste products such as municipal solid waste (MSW; e.g., paper, food and yard wastes or wood), and process waste; and aquatic plants such as algae, water weed, water hyacinth, or reed and rushes.

[0078] In some embodiments, cellulose-containing biomass feedstock from woody plants can include orchard prunings, chaparral, mill waste (such as bark, chips, shavings, sawdust, and the like), urban wood waste (such as discarded lumber, wood pallets, crates, tree and brush trimmings, etc.), municipal waste (such as newspaper and discarded grocery produce), logging waste and forest thinnings (tree tops, limbs and cull material), short-rotation woody crops such as poplar and Cottonwood, and industrial waste (such as wood pulp sludge).

[0079] The preponderance of biomass from non-woody plants in agriculture is derived from monocotyledonous plants, and especially grassy species belonging to the family Gramineae. Of primary interest are gramineous agricultural residues; that is, the portion of grain-bearing plants that remain after harvesting the seed, illustrative of such residues, without limitation thereto, are wheat straw, oat straw, rice straw, barley straw, rye straw, flax straw, sugar cane, corn stover, corn stalks, corn cobs, corn husks, and the like. Also included within this definition are grasses not conventionally cultivated for agricultural purposes, such as prairie grasses (e.g. big bluestem, little bluestem, Indian grass), switcbgrass, gamagrass, and foxtail. In some embodiments, the agricultural biomass comprises com kernel, barley kernel, milo kernel, wheat kernel or rice kernel.

)80] Byproducts of agriculture industrial process can have high amounts of acetic acid, furfural and 5-HMF that can inhibit growth and/or fermentation of microorganisms.

Other agricultural byproducts in the category of biomass include waste streams components from commercial processing of crop materials (such as sugar beet pulp, citrus fruit pulp, sugarcane bagasse, seed hulls, and the like), cellulosic animal wastes, lawn clippings, seaweed, etc. In some embodiments, the biomass is distillers grains.

[0082] Any of the aforementioned biomass materials would be utilized as substrates for fermentative conversion to ethanol.

[0083] "Biomass derived intermediate" refers to a carbohydrate or non-sugar intermediate derived from biomass deconstruction.

[0084] "Hemicellulose" refers to any of several branched heteropolymers including arabinoxylans present along with cellulose in most plant cell wails, Hemicellulose has a random amorphous structure that is easily hydrolyzed by dilute acid or base or hemicellulose enzymes to liberate xylose and other carbohydrates.

[0085] "Xylan" refers to a variety of complex polysaccharides found in plant cell walls consisting of xylose.

[0086] "Cellulose" refers to a polysaccharide consisting of a linear chain of beta 1 -4 linked D glucose units.

[0087] "Cellulosic enzymes" or "ceiluiase" refers to proteins that catalyze the hydrolysis of cellulose.

[0088] "Saccharified" means release of carbohydrates such as glucose and xylose from pretreated biomass using chemical or biological methods, including enzymatic digestion. Cellulosic enzymes commonly used for saccharification of pretreated biomass include, but are not limited to lignin peroxidases, cellobiohydrolases, endoglucanases, beta-glucosidases and xylanases. Cellulosic enzyme mixtures may be a mixture of any of the abo ve enzymes and can be derived from organisms naturally expressing the enzymes or from organisms expressing these enzymes as heterologous proteins.

"Control cell" refers to a cell that expresses an unmodified form of the Haal In some embodiments, the control cell expresses an unmodified form of the Ha l protein from the endogenous chromosomal gene locus. In some embodiments, the control ceil contains an expression cassette or vector that does not encode an Haal protein.

DETAILED DESCRIPTION OF THE INVENTION

Improved Haal trimsription factor proteins

[0090] As shown in the Examples, a series of amino acid changes have been introduced in the yeast gene HAA1 and have been shown to improve tolerance to acetic acid of yeast strains carrying the mutations. Thus, the improved Haal proteins described herein, wherein expressed in a microorganism (e.g., yeast), increase the amount of microorganism growth relative to a control or reference microorganism in a specified time under specified conditions. In some embodiments, the improved Haal proteins described herein increase the rate (e.g., kinetics) of fermentation or increase the titer of fermentation product performed by a microorganism relative to a control or reference Haal protein in a specified time under specified conditions. In some embodiments, the improved Haal proteins enable the microorganisms to tolerate higher acetic acid levels in the medium relative to a control or reference microorganism. In some embodiments, the improved Haal proteins enable the microorganisms to tolerate higher acetate levels in the medium relative to a control or reference microorganism. In some embodiments, the improved Haal proteins enable the microorganisms to decrease the concentration of fermentable carbohydrates during fermentation. In some embodiments, the carbohydrates fermented are polysaccharides from cellulose and/or hemicellulose. In some embodiments, the carbohydrates measured are fermentable sugars. In some embodiments, the conditions are those specified in the examples.

[0091] In some embodiments, there is greater than 50% increase in titer of fermentation product performed by the haal mutant strain compared to a control strain under conditions of greater than 0.5% acetate under specified time and conditions.

[0092] In some embodiments, there is greater than 50% increase in titer of fermentation product performed by the haal mutant strain compared to a control strain under conditions of greater than 0,8% acetic acid under specified time and conditions.

[0093] In some embodiments, the cell comprising a mutant haal allele described herein has improved acetate resistance compared to a control yeast cell. For example, in some embodiments, the cell comprising a mutant haal allele described herein has increased resistance to solutions or biomass containing 0.01%, 0.05%, 0, 1%, 0.2%, 0.3%, 0.4%, 0.5%, 0.6%, 0.7%), or 0.8%) more w/v acetate as compared to a control cell. In some embodiments, the cell comprising a mutant haal allele described herein has improved acetic acid resistance compared to a control yeast cell. For example, in some embodiments, the cell comprising a mutant haal allele described herein has increased resistance to solutions or biomass containing 0.01%, 0.05%, 0.1 %, 0.2%, 0.3%, 0.4%, or 0.5%, 0.6%, 0.7%, or 0.8% more w v acetic acid as compared to a control cell. In some embodiments, the increased resistance to acetate and/or acetic acid is determined by measuring the growth of the cell in solutions or biomass containing acetate and/or acetic acid. In some embodiments, the increased resistance to acetate and/or acetic acid is determined by measuring the rate of formation of a fermentation product or titer of a fermentation product in solutions or biomass containing acetate and/or acetic acid. In some embodiments, the cell is a yeast cell.

[0094] In some embodiments, an increase in the amount of fermented carbohydrate can be determined by measuring the amount of desired product produced from fermenting the biomass hydrolysate under specified conditions. For example, in some embodiments, the carbohydrate is derived from glucan, xylan, or another fermentable sugar polysaccharide. In some embodiments, the carbohydrate is a sugar biomass- derived intermediate. In some embodiments, the downstream product is a non-sugar biomass-derived intermediate. In some embodiments, the conditions are those specified in the examples.

[0095] The improved or unmodified Haal protein control preparations provided herein can be of any origin, e.g., they can be from bacteria, yeast, fungus or other organisms, in some embodiments, the Haal proteins are yeast proteins. In some embodiments, the Haal proteins are Haal protein variants having at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or higher sequence identity to a yeast Haal protein, e.g., a Haal protein of SEQ ID NOS: l-15 (e.g., SEQ ID NOS: l , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15) and in some embodiments contain one or more of the mutations described herein.

[0096] Accordingly, in some embodiments, the unmodified form of the Haal protein is a wild-type or a naturally occurring Haal protein, such as, for example, a yeast Haal protein. In some embodiments, a control yeast is a yeast that expresses an unmodified form of the Haal protein. In some embodiments, the control yeast expresses an unmodified form of the Haal protein from the endogenous chromosomal gene locus. In some embodiments, the control yeast contains an expression cassette or vector thai does not encode an Ha l protein,

[0097] The improved Haal proteins provided herein comprise one or more amino acid substitutions relative to the unmodified Haal protein. In some embodiments, the amino acid substitution^) comprise at least an amino acid substitution at a position corresponding to a position selected from the group consisting of (a) position 440 of SEQ ID NO:2, having an amino acid other than F: (b) position 518 of SEQ ID NO:2 having an amino acid other than P; (c) position 508 of SEQ ID NO:2 having an amino acid other than D; (d) position 510 of SEQ ID NO:2 having an amino acid other than N; (e) position 527 of SEQ ID NO:2 having an amino acid other than A; (f) position 591 of SEQ ID NO:2 having an amino acid other than I; (g) position 605 of SEQ ID NO:2 having an amino acid other than H; (h) position 622 of SEQ ID NO:2 having an amino acid other than S; (i) position 639 of SEQ ID NO: 2 having an amino acid other than S; (j) position 673 of SEQ ID NO:2 having an amino acid other than S and (k) position 554 of SEQ ID NO:2 having an amino acid other than N.

[0098] In some embodiments, the improved Haal protein is a mutant Haal protein comprising a sequence substantially identical (e.g., at least 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity) to SEQ ID NO: l or 2 and having improved Haal protein activity compared with a control Haal protein of SEQ ID NO: 2 wherein the amino acid of the mutant Haal protein corresponding to position 440 is Phenylalanine (F) substituted with Tyrosine (Y); position 518 is Proline (P) substituted with Serine (S); position 591 is Isoleucine (I) substituted with Valine (V); position 605 is Histidine (H) substituted with Tyrosine (Y); position 622 is Serine (S) substituted with Phenylalanine (F); position 639 is Serine (8) substituted with

Phenylalanine (F); and position 673 is Serine (S) substituted with Leucine (L).

[0099] In some embodiments, the improved Haal protein is a mutant Haal protein comprising a sequence substantially identical (e.g., at least 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity) to any of SEQ ID NO:4 and having improved Haal protein activity compared with a control Haal protein of SEQ ID NO: 2 wherein the amino acid of the mutant Ha l protein corresponding to position 508 is Aspartie Acid (D) substituted with tyrosine (Y); position 510 is Asparagine (N) substituted with Lysine (K); position 527 is alanine (A) substituted with Valine (V); position 553 has a single base pair insertion in the codon resulting in frameshift here and asparagine (N) substitution with isoleucine (I); additionally resulting in premature translational termination after position 554 and truncation of the haal protein.

[018(5] In some embodiments, the improved Haal protein is a mutant Haal protein comprising a sequence substantially identical (e.g., at least 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% sequence identity) to any of SEQ ID NOS: 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15 and having improved Haal protein activity compared with a control Haal protein of SEQ ID NO: 2.

[0101] Further provided herein are improved Haal proteins having improved Haal activity compared to a control Haal protein (e.g., a Haal protein comprising SEQ ID NO: i). In some embodiments, the improved Haal protein is a mutant Ha l protein substantially identical to SEQ ID NO:2 and improved acetic acid tolerance compared to a strain containing a control Haal protein (e.g., a Haal protein comprising SEQ ID NO: l or SEQ ID NO:2).

[0102] I some embodiments, the amino acid of the mutant Haal protein corresponding to position 440 of SEQ ID NO:2 is any amino acid other than F, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 440 of SEQ ID NO:2 is F. In some embodiments, the amino acid of the mutant Haal protein corresponding to position 440 of SEQ ID NO:2 is Tyrosine (Y), for example, as set forth in SEQ ID NO:5.

[0103] In some embodiments, the amino acid of the mutant Haal protein corresponding to position 508 of SEQ ID NO:2 is any amino acid other than D, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 508 of SEQ ID NO:2 is D. In some embodiments, the amino acid of the mutant Haal protein corresponding to position 508 of SEQ ID NO:2 is Tyrosine (Y) , for example, as set forth in SEQ ID NO:6. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 508 of SEQ ID NO:2 further comprises a C-terminal deletion. In one embodiment, the C-terminal deletion begins at the position corresponding to amino acid 555 of SEQ ID NO:2. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 508 of SEQ ID NO:2 comprises a deletion of amino acids corresponding to positions 555-694 of SEQ ID NO:2.

[0104] In some embodiments, the amino acid of the mutant Haal protein corresponding to position 510 of SEQ ID NO:2 is any amino acid other than N, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Ha l protein corresponding to position 510 of SEQ ID NO: 2 is N. In some embodiments, the amino acid of the mutant Haa l protein corresponding to position 510 of SEQ ID NO:2 is Lysine (K) , for example, as set forth in SEQ ID NO:7. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 510 of SEQ ID NO:2 further comprises a C-terminal deletion. In one embodiment, the C-terminal deletion begins at the position corresponding to amino acid 555 of SEQ ID NO:2. In some embodiments ; the mutant Haal protein having a substitution at the position correspondmg to position 510 of SEQ ID NQ:2 comprises a deletion of amino acids correspondmg to positions 555-694 of SEQ ID NO:2.

[010S] In some embodiments, the amino acid of the mutant Haal protein corresponding to position 51 8 of SEQ ID NO:2 is any amino acid other than P, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 510 of SEQ ID NO:2 is P. In some embodiments, the amino acid of the mutant Haal protein corresponding to position 518 of SEQ ID NO:2 is Serine (S) , for example, as set forth in SEQ ID M):8.

[01Θ6] In some embodiments, the amino acid of ihe mutant Haal protein corresponding to position 527 of SEQ ID NO:2 is any amino acid other than A, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein correspondmg to position 527 of SEQ ID NO:2 is A. In some embodiments, the amino acid of the mutant Haal protein corresponding to position 527 of SEQ ID NO:2 is Valine (V) , for example, as set forth in SEQ ID NO:9. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 527 of SEQ ID NO:2 further comprises a C-terminal deieiion. In one embodimeni, the C-terminal deletion begins at the position corresponding to amino acid 555 of SEQ ID NO:2. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 527 of SEQ ID NO:2 comprises a deletion of amino acids corresponding to positions 555-694 of SEQ ID NO:2.

[0107] In some embodiments, the amino acid of the mutant Haal protein corresponding to position 554 of SEQ ID O:2 is any amino acid other than N, and the control Haal protein has the same amino acid sequence as the mutant Ha l protein except that the amino acid of the control Haa l protein corresponding to position 554 of SEQ ID NO:2 is . In some embodiments, the amino acid of the mutant Haal protem corresponding to position 554 of SEQ ID NO:2 is lsoleucine (I) , for example, as set forth in SEQ ID NO: 10. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 554 of SEQ ID NO:2 further comprises a C-terminal deletion. In one embodiment, the C-terminal deieiion begins at the position corresponding to amino acid 555 of SEQ ID NO:2. In some embodiments, the mutant Haal protein having a substitution at the position corresponding to position 554 of SEQ ID NO:2 comprises a deletion of amino acids corresponding to positions 555-694 of SEQ ID NO:2. [1)108] In some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 591 of SEQ ID NO:2 is any amino acid other than I, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haa l protein corresponding to position 510 of SEQ ID NO:2 is I. in some embodiments, the amino acid of the mutant Haal protein corresponding to position 591 of SEQ ID O:2 is Valine (V) , for example, as set forth in SEQ ID NO: l 1.

[0109] In some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 605 of SEQ ID NO:2 is any amino acid other than H, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 605 of SEQ ID NO:2 is H. In some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 605 of SEQ ID NO:2 is Tyrosine (Y) , for example, as set forth in SEQ ID NO: 12.

[0110] In some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 622 of SEQ ID NO:2 is any amino acid other than S, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 5.10 of SEQ ID NO: 2 is S, in some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 622 of SEQ ID NO:2 is Phenylalanine (F) , for example, as set forth in SEQ ID NO: 13.

[0111] In some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 639 of SEQ ID NO:2 is any amino acid other than S, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 639 of SEQ ID NO: 2 is S, in some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 639 of SEQ ID NO:2 is Phenylalanine (F) , for example, as set forth in SEQ ID NO: 14.

[0112] In some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 673 of SEQ ID NO:2 is any amino acid other than S, and the control Haal protein has the same amino acid sequence as the mutant Haal protein except that the amino acid of the control Haal protein corresponding to position 673 of SEQ ID NO:2 is S, in some embodiments, the amino acid of the mutant Haal protein coiTesponding to position 673 of SEQ ID NO:2 is Leucine (L) , for example, as set forth in SEQ ID NO: 15.

[0113] In some embodiments, the mutant Haal protein comprises one, two, three, four, five, or more mutations as described herein. For example, in some embodiments, the mutant Haal protein is substantially identical to SEQ ID NO:2 and comprises one, two, three, four, five, six, or seven, or more mutations selected from the group consisting of mutations at a position corresponding to a position selected from the group consisting of (a) position 440 of SEQ ID NO:2, having an amino acid other than F: (b) position 518 of SEQ ID NO:2 having an amino acid other than P; (c) position 591 of SEQ ID O:2 having an amino acid other than I; (d) position 605 of SEQ ID NO:2 having an amino acid other than H; (e) position 622 of SEQ ID NO:2 having an amino acid other than S; (f) position 639 of SEQ ID NO:2 having an amino acid other than S; and (g) position 673 of SEQ ID O:2 having an amino acid other than S.

[0114] In some embodiments, the mutant Haal protein comprises one, two, three, four, five, or more mutations as described herein. In some embodiments, the mutant Haal protein has a C-terminal Intncation starting at the amino acid corresponding to position 554 of SEQ ID NO:2. For example, in some embodiments, the mutant Haal protein is substantially identical to amino acids 1-554 of SEQ ID NO: 2 and comprises one, two, three, four, or more mutations selected from the group consisting of mutations at a position corresponding to a position selected from the group consisting of (a) position 508 of SEQ ID NO:2, having an amino acid other than D; (b) position 510 of SEQ ID NO:2 having an amino acid oiher than N; (c) position 527 of SEQ ID NO:2 having an amino acid other than A; (d) position 554 of SEQ ID NO:2 having an amino acid other than N; (e) position 555 containing a stop mutation due to translational frameshifi due to upstream nucleotide insertion.

[0115] Wifdtype Haal protein has been reported to be a transcription factor that binds to SEQ ID NO: 16 in promoter sequences (e.g., the ACRE region of the TP03 promoter). See, e.g., Mira, N.P., Nuc. Acids Res. 1-12 (201 1). Without intending to limit the scope of the invention, it is believed that improved haal mutant proteins described herein also bind to SEQ ID NO: 16 in promoter sequences, thereby regulating transcription. Promoter binding assays are described in, e.g., Mira, N.P., Nuc. Acids Res. 1-12 (201 1 ), and can be used to measure protein binding to DNA sequences.

[0116] The improved Haal protein discussed herein may be recombinantly expressed by molecular cloning into an expression vector containing a suitable promoter and other appropriate transcription regulatory elements, and transferred into prokaryotic or eukaryotic host ceils to produce recombinant enzymes. Techniques for such manipulations are fully described by Sambrook et ai. (Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., (1989); Current Protocols in Molecular Biology, Ausubel et al., Green Pub. Associates and Wiley-Tnterscience, New York ( 1988); Yeast Genetics: A Laboratory Course Manual, Rose et al, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., (1990)).

[0117] A variety of techniques are available and known to those skilled in the art for introduction of nucleic acid constructs into a cellular host. Transformation of microbial cells may be accomplished through, e.g., use of polyethylene glycol, calcium chloride, viral infection, DEAE dextran, phage infection, electroporation and other methods known in the art. Transformation of fungus, in particular Pichia, may be accomplished, for example, according to "Pichia Protocols", in Methods Mol. Biol, Higgi s, David R. and Cregg, James M.; Eds. (Humana, Totowa, N.J.) (1998). Introduction of the recombinant vector into yeasts can be accomplished by methods including electroporation, use of spheroplasts, lithium acetate, and the like,

[0118] Polynucleotides comprising a nucleic acid encoding the improved Haal protein are also provided. In some embodiments, the polynucleotide comprises an expression cassette comprising a heterologous promoter operably linked to the nucleic acid. Also provided herein are vectors comprising the polynucleotides provided herein, and isolated cells or culture of cells comprising the polynucleotides that is heterologous to the cell. In some embodiments, the cell is a bacteria or yeast cell. In some embodiments, the cell is a Saccharomyces cerevisiae.

8. Yeasts

[0119] In some embodiments, the improved Haal proteins or other enzymes discussed herein are heterologous!}' expressed in one or more yeast strain. Any yeast strain can be used according to the present invention. Yeast are unicellular microorganisms thai belong to one of three classes: Ascomycetes , Basidiomycetes and Fungi Imperfecti. While pathogenic yeast strains, or nonpathogenic mutants thereof, can be used in accordance with the present invention, nonpathogenic y east strains will generally be used. Exemplary genera of yeast strains include Saccharomyces, Candida, Cryptococcus, Hansenula, Kluyveromyces, Pichia, Rhodotorula, Schizosaccharomyc.es and Yarrowia. Exemplary species of yeast strains include Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Candida albicans, Candida kefyr, Candida tropicalis, Candida intermedia, Cryptococcus laurentii,

Cryptococcus neoformans, Hansenula anomala, Hansenula polymorpha, Kluyveromyces fragilis, Kluyveromyces laciis, Kluyveromyces marxianus var. lactis, Pichia pasioris, Rhodotorula rubra, Schizosaccharomyces pombe, and Yarrowia lipolylica. It is to be appreciated that a number of these species include a variety of subspecies, types, subtypes, etc. that are meant to be included within the aforementioned species. In some embodiments, a yeast strain capable of replicating plasmids to a particularly high copy number is used. In some embodiments, a temperature -tolerant yeast strain is used. In some embodiments, an inhibitor-tolerant yeast strain is used.

[0120] The present invention provides for yeast strains that express the improved Haal protein discussed herein. Yeast expressing the improved Haal protein discussed herein can be generated as is known in the art. For example, expression cassettes comprising a promoter operably linked to a coding sequence for the improved Haal protein discussed herein can be (optionally inserted into a nucleic acid vector and) introduced into the yeast. A number of expression vectors for various yeast species are known in the art and some can be obtained commercially. Vectors can optionally include an origin of replication and/or a marker gene for identifying cells transformed with the vector. In some embodiments, the expression cassettes are stably introduced into a yeast chromosome or extrachromosomal DNA.

[0121] Any number of promoters can be used to drive expression from the expression cassettes of the invention. Exemplary promoters include, e.g., constitutive or inducible promoters. Recombinant gene expression can be driven by promoters including, but not limited to, the yeast GAL 10 gene promoter, the phosphogiycerate kinase (PGK) promoter (see, e.g., Tuite, M. F. el al. ( 1982) EMBO Journal 1, 603-608.: WO 84/04757),

GAL10/PGK promoter chimeras (see, e.g., US Patent No. 5,739,007) or other yeast promoters such as alcohol dehydrogenase (see, e.g., Bennetzen, J.L. and Hall, B.D. J. Biol. Chem. 257:3018 (1982); Ammerer, G. in Methods in Enzymology Vol 101, p. 192 ( 1983)) phosphogiycerate kinase (see, e.g., Derynek, R., Hitzemann, R.A., Gray, P.W., Goeddei, D.V., in Experimental Manipulation of Gene Expression, 1983, p. 247, ed. M. Inouye, Academic Press), triose phosphate isomerase (see, e.g., Alber, T. and Kawasaki, G., ./. Molec and Applied Genet. 1 : 419 - 434 (1982)), or enolase (see, e.g., In es, M.A. et al. Science 226:21 ( 1985)) can be used in a similar manner.

[0122] Expression vectors used in yeast cells can also contain sequences necessary for the termination of transcription and for stabilizing the lnRNA. Such sequences are commonly available from 3' to the translation termination codon, in untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA. 9. Method for converting a cellulose-containing biomass feedstock to ethanol

[0123] In some embodiments, the pretreated and saccharified cellulosic biomass feedstock is added directly to cells containing the mutant Haal protein provided herein to form an aqueous mixture and incubated under conditions to allow for efficient fermentation of cellulosic sugars to ethanol or other products. Accordingly, provided herein are methods for converting cellulosic sugars derived from biomass feedstocks into ethanol. in some embodiments, the cellulose-containing biomass feedstock is corn grain or corn stover. In some embodiments, the cellulose-containing biomass feedstock is sugarcane bagasse.

[0124] In some embodiments, the cellulose-containing biomass feedstock is contacted with a ceil expressing an improved Haal protein. The heterologous Haa l protein-expressing ceil can be any cell known in the art, including bacteria, yeast, or other cells.

[0125] In some embodiments, the cellulose-containing biomass feedstock is first pretreated to render the cellulose more available to the enzymes. In some embodiments, the feedstock is ground into finer pieces or otherwise treated to increase surface area of the material. In some embodiments, the pre-treatment comprises at least one of the following: acid hydrolysis (see, e.g., US Patent Nos . 4,174,976 and 5,597,714; and PCT Publication WO/2006/086861), steam explosion (see, e.g., US Patent No. 6,506,282 and PCT Publication WO/2000/039387), autohydrolysis, ionic liquids (see, e.g., US Patent No. 6,824,599), hot water, ammonia explosion (see, e.g., US Patent No. 5,037,663), extrusion (see, e.g., US Patent No. 7,037,096: ), or microwave treatment (see, e.g., US Patent No. 5,196,069),

[0126] In some embodiments, the cellulose-containing biomass feedstock is first pretreated to render biomass particles having small sizes (e.g., milled). It has been noted that yield of biofuel (e.g., ethanol) can be improved by using biomass particles having small sizes, e.g., biomass particles having a relatively uniform particle size of less than 1600 microns. For example, at least 75%, 85%, or 95% of the pretreated biomass particles have a particle size from about 100 microns to about 800 microns, or a particle size from about 100 microns to about 500 microns, Pretreated biomass particles can be generated by, e.g., a hammer mill or a colloid mill or a shear mill or a cavitation mill; serial combinations of any two or more of these can also be employed. For example, the colloidal mill can be used to select the resulting particle size distribution through the use of gap rotational controls. A relatively precise particle size distribution can be obtained from much larger biomass material using a colloid mill in contrast to alternative pretreatment techniques such as comminution with a hammer mill. An appropriate gap size on the colloid mill can produce a highly uniform suspension of biomass, where the maximum particle size of the biomass is greatly reduced and significantly more uniform compared to using only the comminution device. The radial gap size for a colloidal mill used in a corn ethanol plant can range from 0.104 ■■ 0.728 millimeters, e.g., from 0.104-0.520 millimeters, e.g., from 0.208 - 0.52.0 millimeters, such that the resulting particle sizes are in the range of 100 - 800 microns. For example, in some embodiments, a gap setting of 0.1-0.15 is used for corn stover or other cellulosic biomass and a gap setting of 0.2-0.3 mm is used for grains including but not limited to corn kernels. As a second example, a shear mill can be used to reduce particle size of cellulose-containing materials under high shear action, especially for fibrous woody material. In shear milling, the material is processed through several generator stages (typically three) of a dispersing device which produces very fine suspensions. The stages consist of rotor-stator combinations to reduce particle size and create a very narrow size distribution from larger-sized woody feedstock material. Various combinations of generators can be used to achieve desired particle size reductions, such as suspensions containing particles in the range of 100 to 300 microns. Techniques for generating biomass particles having small sizes are fully described by, e.g., U.S. Patent Application Publication No. 20100055741 , the content of which is incorporated by reference in its entirety herein.

[0127] Tn some embodiments, fermentation temperatures will be controlled between 28 - 35°C and pH 4.0-5.5. In some embodiments, a temperature-tolerant yeast cell strain can be used, and accordingly a higher fermentation temperatures can be used (e.g., at or above 35°C, 36°C, 37°C, 38°C, 39°C, 40°C, 41°C, or 42°Q.

[0128] In some embodiments, the cellulose- containing biomass feedstock can be used as an inexpensive form of sugar (i.e., for value added products). In some of these embodiments, excess sugar is bled from the saccharification tank(s) (i.e., where the enzymes are converting plant material to sugar), for example using a sequential membrane, filtrate wash, or other sugar removal system. This reduces the sugar concentration in the saccharification tank(s) and allows for hydrolysis to continue without being inhibited by excess sugar. Residual non- sugar producing solids can be optionally purged forward for further processing or for other uses (such as for fuel value in a cogeneration system).

[0129] Acetic acid or its cognate base acetate can accumulate resulting from some biomass pretreatment methods. The methods, polynucleotide sequences and cells provided herein provide for cells with increased resistance to high levels of acetic acid (e.g., >0.8%), which are toxic to most yeast cells. Thus, in some embodiments, the biomass contacted with the cell expressing the Haal mutant protein has at least 0.5%, 0.8%, or 1 .0% acetic acid and/or acetate.

[0130] Furfurals can accumulate in some pretreatment methods and/or saccharification reactions. Furfurals can in some embodiments act as yeast growth inhibitors. Thus, bacteria or yeast that consume furfurals can also be added to the fermentation to selectively reduce or eliminate the furfurals. The methods, polynucleotide sequences and cells provided herein provide for cells with increased resistance to high levels of furfural (>1000 ppm) which are toxic to most yeast. The methods, polynucleotide sequences and cells provided herein also pro vide for ceils with increased resistance to elevated levels of furfural (>600 ppm) in combination with 100 mM acetate.

[0131] In some embodiments, the mixture of improved y east cells and the cellulose- containing biomass feedstock are incubated to result in production of sugars from cellulose or other plant material and subsequent fermentation of the sugars into alcohols. Industrial fermentation conditions are known in the art. In some embodiments, a modified form of Simultaneous Saccharification and Fermentation (SSF) can be accomplished by using a small saccharification step in order to produce a small amount of sugar to promote yeast growth. This partially converted media is then sent to the fermenter. After the ferm enter vol me is approximately 10-20% of the total fermenter volume the yeast inoculum is added. The tank is then continuously filled in a fed batch mode over a period of 25-35 hours and then held at 35°C until the fermentation is complete (~72hrs). This allows sufficient use of the sugars to prevent inhibition of the process. To improve alcohol production, yeast strains with a high ethanol tolerance can be selected. In some embodiments, yeast growth stimulants can also be added to the mixture. For example, sterols can be added to stimulate yeast growth and enzyme production.

[0132] In some embodiments, the yeasts provided herein are exceptionally efficient for the production of ethanol. However, some of the same yeasts can be used for saccharification without subsequent fermentation. This can be accomplished, for example, by, e.g., allowing the yeasts to generate biomass hydroiysate, limiting ethanol production, followed by deactivation of the yeast so the fluid contains free enzymes and proteins. In the case of yeasts that have the expressed enzymes attached to the surface, the yeasts can be cultivated, deactivated with ultrasound and then used as immobilized enzymes within the

saccharification vessel. The yeast can be filtered at the end of the saccharification process along with the other solids in this manner. [0133] In some embodiments, the mixture of yeast cells and the cellulose-containing biomass feedstock are incubated to result in production of biomass-derived intermediates from cellulose or other plant material. As defined herein, the term "biomass- derived intermediate" refers to a carbohydrate intermediate derived from biomass deconstraction. In some embodiments, the biomass-derived intermediates are simple sugars, e.g.,

monosaccharides and disaecharides such as glucose, fructose, mannose, and galactose, sucrose, maltose, lactose, cellobiose, and derivatives thereof. In some embodiments, the biomass-derived intermediates are partial hydrolysis or partial depolymerization intermediates, e.g., cellobiose. In some embodiments, the biomass-derived intermediates are non- sugar biomass-derived intermediates. In some embodiments, the non-sugar biomass- derived intermediates are poJyols, e.g., sorbitol, anhydrosorbitoL glycerol, and propanediol. In some embodiments, the non-sugar biomass-derived intermediates are isomerization and dehydration products derived from biomass hydrolysis and fermentation process, e.g., "reversion products," "acyclic intermediates," and "fruciofuranosyl intermediates" as described in Chheda et al., Angew. Chem. Int. Ed. 46:7164-71 83, 2007. In some embodiments, the non-sugar biomass-derived intermediates are additional dehydration and fragmentation products of acyclic intermediates and fruciofuranosyl intermediates as described in Chheda et al., Angew. Chem, Int.. Ed. 46:7164-7183, 2007. In some embodiments, the non-sugar biomass-derived intermediates include furans, e.g., furfural, 5- hydroxymethylfu fural, di-formylfuran, and derivatives thereof (e.g., 2,5-funandicarbox lic acid, di(hydroxymetliyl)tetrahydi'ofuran, methyl tetrahydrofuran). Additional examples of biomass-derived intermediates are known in the art and disclosed in Chheda et al, Angew. Chem. Int. Ed. 46:7164-7183, 2007. In some embodiments, the non-sugar biomass-derived intermediates are amino acids and organic acids, such as ievulinic acid, formic acid, fumaric acid, aspartic acid, succinic acid, malic acid, 3-hydroxypropionic acid, aspartic acid, itaconic acid, glutamic acid, glucaric acid, gluconic acid. If desired, any of the above products (i.e., sugar biomass-derived intermediates or non-sugar biomass-derived intermediates) can be further purified from the remainder of the reaction mixtures and/or chemically or enzymatic-ally converted to yet another desired product,

[0134] The activity or amount of fermentation of the strain expressing the improved Haal protein described herein can be determined directly by HPLC assay for ethanol. The improved activity can also be determined by measuring the amount of carbohydrates converted to the desired produci by HPLC. The amount of acetic acid can be determined directly by HPLC assay. [0135] The applications of increased acetic acid or acetate tolerance of yeast strains include commercial processes as those employed in corn ethanol plants and cellulosic ethanoi biorefmeries or a merging of fermentation of the two corn grains; in operations for production of bio-based chemicals; and as a selective mechanism against growth of native contaminating yeast strains in fermentations.

Examples

1. Background.

[0136] Biological conversion of sugars present in lignocellulosic biomass to a desired end product is confined to microorganisms that typically use glucose and/or xylose as carbon sources. The efficient use of sugars in biofuels applications is particularly important due to the high fractional percentage of hemicellulose containing xylan, the branched polymeric precursor to xylose. Upon biomass pretreatment, the hemicellulose and therefore xylan is disrupted to liberaie inhibitors of microbiological fermentation. One of the pretreatment products derived from xylan is acetic acid which can form from hydrolysis of acetyl groups attached to xylan polymers during the pretreatment processes.

[0137] The yeast Saccharomyces cerevisiae is a preferred organism for biological conversion of sugars into desired metabolic products. Its long history of industrial applications is evidence of its robust metabolic pathways, high tolerance to ethanol, rapid gro wth rate, and efficient conversion of glucose. However, it is sensiti ve to acetic acid in concentrations starting above 50 m ' M (about 0.3% w/v; see Fig. 3 for conversion graph). This poses a problem for biofuels application in which pretreated biomass can liberate acetic acid at higher concentrations. In the case of thermo- mechanical pretreatment, acetic acid le vels associated with hemicellulose deconstruction can reach >0.5% w/v using greater than 10% biomass solids.

[0138] Acetic acid can exist as the protonated acid, or as its conjugate base acetate. The pKa of acetic acid is 4.75, near the pH at which fermentations are performed with 5'.

cerevisiae. At pH <5.5, acetic acid present in the cellulosic sugars material will be at equilibrium favoring acetic acid over acetate. The metabolic inhibitory effect mediated by this organic acid is more pronounced in its undissociated state at lower pH.

[0139] The mechanism of acetic acid toxicity has been investigated, but commercially relevant strains of engineered S, cerevisiae strains with higher tolerance to acetic acid have not been reported. Naturally arising yeast variants have been used commercially in producing sake and sourdough, but the mixture and concentrations of inhibitors generated during biomass pretreatment for cellulosic ethanol production present a novel acetic acid mileu for organisms to handle.

[0140] Further, yeast strains have differing sensitivities to acetic acid; Pichia stipitis, which ferments xylose, is highly sensitive at 0.5% typically, whereas Zygosaccharomyces bailii is highly resistant at this concentration. This reflects differing ability to utilize and/or evade the toxic effects of acetate in these species. Acetic acid toxicity derives from alteration of the proton gradient across the cell membrane leading to cell wall defects, disruption of mitochondrial function, inhibition of growth rate and extension of lag phase, decreased life span, down regulation of genes invol ved in mitochondrial protein synthesis and carbohydrate metabolism, and/or up regulation of genes involved in amino acid metabolism. Metabolomie analysis of bacteria showed that acetic acid stress impacted numerous functions including lactate, formate and ethanol fermentation pathways, electron transport, and fatty acid biosynthesis.

[0141] Some of the pleiotropic effects on metabolic function derive from altered gene regulation as shown by transcriptomic studies. This pointed to transcriptional regulation of acetic acid stress response genes via transcription factors and associated protein complexes. Therefore a protein that can regulate acetic acid response that is naturally present in 51 cerevisiae was targeted for PGR mutagenesis. Adaptive selection has traditionally been used for commercial yeast strains; however the generation of yeast with enhanced functional capabilities has been greatly accelerated due to 51 cerevisiae genomic sequence availability and the requisite molecular biological tools.

[0142] This approach to generate commercially useful strains that have higher tolerance to acetic acid through genetic engineering builds on the cell's natural defense mechanisms of resistance or adaptation, Sacch romyces can respond to high concentration of acid by actively extruding it or neutralizing it. The vacuole is a cellular organelle which plays a major role in intracellular pH regulation. Cellular metabolism depends on maintenance of a pH difference of -1.7 pH units between the vacuole and cytoplasm such that the vacuole is usually at a pH of 6 or lower while the cytoplasm is closer to neutrality. The vacuolar H ATPase is the pump which drives this reaction. Another H + ATPase resides in the plasma membrane and pumps protons out. Other transporters contribute to maintenance of the pH gradient by moving amino acids, polyarnines and metal ions in or out of the vacuole or by- pumping monocarboxylic acid anions out through the plasma membrane. Movement of imidazole-contain ng amino acids such as histidine appears to function prominently in the vacuole-dependent "buffering" process. [0143] At lower pH, acetate exists substantially in the undissociated state (CH3COOH), a form which potently inhibits growth. The undissociated acid, being uncharged, readily diffuses across the cell membrane only to dissociate in the higher pH environment of the cytosol. Such dissociation generates protons and the acid anion (CH3COO-). The acid anion will tend to accumulate intracellularly to very high levels as, being charged, it cannot very readily diffuse from the cell.

[0144] This high anion accumulation may generate an abnormally high turgor pressure. It can also influence free radical production, leading to the severe oxidative stress that is a major component of weak organic acid stress in aerobic S. cerevisiae. The proton release can potentially acidify the cytosol. This acidifscation, if it occurs, will inhibit many metabolic functions. Reductions in S. cerevisiae intracellular pH have been demonstrated following the addition of acetate although a reduction in intracellular pH is not always a feature of organic acid stress.

[0145] The HAAl gene is located centromere-proximal on chromosome XVI and encodes a DN A- binding transcription factor. Null mutants in HAAl are more sensitive to bui ric acid, propionic acid and acetic acid (Fernandes 2005). The pleiotropic response of acetic acid stress is further explained by the fact that the HAAl gene has an Adrl protein regulatory- binding site upstream, a carbon source-responsive zinc-finger transcription factor, required for transcription of the glucose-repressed gene ADH2, of peroxisomal protein genes, and of genes required for ethanoi, glycerol, and fatty acid utilization.

[0146] Provided herein are engineered microorganisms with enhanced tolerance to acetic acid or its cognate base acetate.

2. Cloning and Expression of HAAl wild type S. cerevisiae gene.

[0147] The HAAl gene was cloned in the plasmid shuttle vector p416 Tef (Mumberg et al., 1995) by PGR amplification of a DNA fragment from Sacc aromyces cerevisiae strain TR3, cleavage with restriction enzymes Xba I and Sal I and ligation into the appropriate sites of the vector for expression from the plasmid-borne Tef promoter.

[0148] Mutagenesis of the carboxy-terminal -350 amino acids of the HAA l gene was carried out by PGR synthesis of a smaller Xba I-Sal I DNA suhfragment of the HAAl gene using the Mutazyme protocol (GeneMorph II Kit). This fragment was then recloned into the same vector using the same restriction enzyme cleavage and ligation protocol, and the mixture of mutated and non-mutated DNAs was transformed into strain TR3. [0149] After appropriate recover '- of trans formants in rich media, a set of 40 independent acetate resistant mutants was isolated from the leading edge of an acetate gradient plate, containing the higher acetate concentration, made with media selective for the vector. The mutants were colony purified and plasmid DN A was isolated and retransformed into bacteria in order to amplify it for DNA sequence determinat on. A simplified scheme of the protocol used to mutagenize, clone and screen for mutants is shown in Figure 1. Acetate gradient plate screening is illustrated in Figure 2.

3. haul Mutant Screening.

[0150] Each of the mutants was grown in liquid selective media and the cell densities were normalized to enable spot testing on a series of plates containing increasing acetate concentrations. A subset of six of the more highly acetate resistant strains was chosen for further characterization (Figure 4). Sequences of these mutants were aligned to determine if any consistent pattern of mutated residues was present (Figure 5A-C). This was further refined to two distinct mutant strains which showed consistent growth on 140 mM acetate (0.82% w/v). The sequence differences from the wild type sequence are shown in Figure 6.

4. Integration of haul mutant alleles into S. cerevistae genome,

[0151] Both haul mutant alleles were PGR amplified along with the flanking DNA containing the heterologous constitutive promoter and downstream terminator. In addition, a URA3 marker was incorporated into the fragment to provide a selectable marker for the transformation and homology to the HAA1 focus was incorporated in the fragment on both sides to promote integration into this region of the chromosome. The DNA fragments were used to transform a yeast strain that was deficient for uridine biosynthesis and 10-20 URA3 recombinants were isolated for each mutant. The strategy of PGR s nthesis by the overlapping PGR method and integration of the PGR fragment into the genomic HAA1 focus is shown in Figure 7.

[0 52] PGR analysis of the chromosomal DNA was carried out and those candidates demonstrated to contain full-length insertions of both alleles along with the flanking DNA were analyzed for acetate resistance. Clonal isolates containing genomic insertion of only one of the alleles conferred acetate resistance (haal mut2). Both alleles were subjected to counterselection on plates containing 5-fluoroorotic acid to identify recombinants which had lost the URA3 marker. Again, se veral ura- candidates of each of the alleles were characterized by DNA sequencing to demonstrate that the amino acid changes were incorporated into the yeast genome. [0153] Acetate resistance of the recombinants was again assayed. It was clear from the DNA sequencing experiment that the mutational changes of both alleles were incorporated into the genomic HAA1 locus as predicted. However, one integrated allele {haal mat 2) could provide acetate resistance as the sole source of Haa l function, whereas the other could not (haal mut 40), This result is depicted in Figure 8.

Example 1 : selection of haal viable cells from colonies at leading edge of acet te gradient plates.

[0154] Gradient plates containing acetate (pH 4.5) were poured using a modification of the method (BRYSON V, SZYBALSKT W. Microbial selection. Science. 1952

Jul 18 ; 1 16(3003):45-51 ). Plate pouring was carried out in a sterile biological safety cabinet. Omni trays (NUNC, 86 x 128 mm) were used and inclined on the short edge by leaning the plate bottom on the plate lid. Molten CSM agar (- uracil) was prepared from the standard CSM premix (MP Biochemicals) and 0.7 ml 5M sodium acetate (pH 4.5) was added to 30 ml of agar which was poured into the plate and allowed to solidify as the lower "wedge". The plate was laid horizontally in the sterile hood and the upper "wedge" of 30 ml of CSM agar ( ■■ uracil) was poured on top of the lower layer. The pl te was allowed to solidify and mixtures of freshly transformed yeast cells were spread on the plates using sterile glass beads.

"Libraries" of transformed yeast cells were plated which contained a mixture of the mutagenized gene encoding the Haal protein inserted within the p416 tef shuttle vector. These were incubated at 30° C for 5-7 days in order to produce distinct large colonies which could be subsequently re-isolated using sterile toothpicks and streaked on CSM (-uracil) agar without acetate for colony purification. An example of such a gradient plating is shown in Figure 2. Forty distinct clearly separated colonies were isolated from the leading edge of the gradient in this way. These represented a random sampling of the most putatively highly acetate-resistant haal mutants.

Example 2: comparative growth of haul mutants on single concentration acetate medium in Petri dishes.

[0155] Putative haal mutants colony purified from the gradient plates were subjected to a detailed analysis of acetate resistance by replating on a series of CSM (-uracil) plates containing fixed concentrations of acetate, prepared as described above, but without introduction of a gradient. The plates contained 0 niM, 100 mM, 1 10 mM, 12.0 mM, 130 mM or 140 mM acetate and were prepared in standard petri dishes (BD Falcon, 100 x 15 mm). Cells were grown overnight in culture tubes containing CSM (-uracil) liquid media and normalization to similar OD600 values was carried by measurement of QD600 in a spectrophotometer (Genesys 10UV) and dilution with sterile media. Appropriately diluted cultures were then sterilely transferred to 96 well plates (Greiner Bio One, PS-microplate, flat bottom) and serially diluted with sterile water in 10-fold steps down adjacent columns of the plate. Approximately equal volumes (3 ul) were transferred to each of the series of fixed concentration acetate plates using a 48 pin replicator (V&P Scientific, VP 407 AH multi-blot replicator). In this way, acetate resistance of up to six strains could be compared on a single petri plate. Duplicate plates at each concentration of acetate were used for the analysis.

Plates were allowed to incubate approximately 5 days at 30° C and then growth was compared. A sample of this result is shown in Figure 4.

Example 3: Colony growth plating assay to assess acetate resistance of haul plasmid bearing strains in recycle medium.

[0156] To investigate the growth properties of haal mutant strains in a more relevant biomass-based growth material, cells were plated on solid medium containing varying percentages of pretreated, saccharified and fermented cellulosic sugars mixtures.

[0157] A series of plates was prepared by mixing one volume of 2x concentrated C SM (- uracil) agar and one volume of sterile high pressure/high temperature pretreated (HPHT) recycle mixture or sterile water as a control. HPHT mixtures were prepared by mixing mechanically treated biomass samples with water and treating in a sealed pressure cell at high temperature, allowing the mixture to cool and then saccharifying with a mixture of celluloiytic enzymes and then subjecting this to fermentation with Ede iQ proprietary Saccharomyces cerevisiae strains. The saccharified and fermented HPHT mixture was then centrifuged to remove solids and the supernatant was treated in a rotary evaporator to remove ethanol (Buchi otavapor R-2.15). This mixture was diluted with water appropriately (0%, 40%, 60%, 80%, 100%) and mixed with biomass and another round of HPHT treatment followed by saccharification was performed to generate the "HPHT recycle" mixture. This material was subjected to centrifugation to remove solids and filter sterilized to generate the 0%, 40%, 60%, 80%, 100% HPHT which were then mixed 1 : 1 with the molten 2x CSM ( ■■ uracil) agar components. CSM agar-HPHT recycle mixtures were poured into Omni trays (NUNC) as described above to create a series of plates containing recycle mixtures at five different concentrations. Six plasmid-borne versions of the haul gene isolated from a mutagenic "library" and containing changes at various positions in the haul gene were compared to a naive strain containing the p416 Tef vector or a strain bearing the unmutated haul gene. Each isolate was grown in selective media and cell concentrations were normalized as described above. Spot testing was arned out as described above using the 48 pin replicator. Plates were incubated at 30°C for eight days and growth was compared.

[0158] Higher levels of acetate resistance were observed when strains bearing certain plasmids are plated in a 50% mixture of CSM -ura media and "recycle HPHT" extracts under some conditions. For example, the haal mut40 mutant gave >10Q fold greater colony forming units when normalized to the reference strain in plating controls when grown on biomass "recycle HPHT' medium at the 80% recycle level. This result is shown in Figure 9.

Example 4: colony growth assay on acetate-furfural containing plates.

[0159] To demonstrate that synergistic inhibitor effects are contributing to the inhibition phenotypes observed in more complex mixtures, an experiment testing the effect of a series of increasing furfural concentrations mixed either with no acetate or 100 mM acetate with was perfonned

[0160] A set of plates was prepared by mixing CSM (-uracil) agar and a series of increasing amounts of furfural dissolved in ethanol (0.4 g/ml, 0.6 g ml, 0.8 g/ml and 1.0 g/ml). Another set of plates was prepared by adding acetate to 100 mM as described above and then combining this with a series of increasing amounts of furfural dissolved in ethanol as described above. Control plates were prepared containing CSM (-uracil) agar and 100 mM acetate. CSM (-uracil) agar-furfural-acetate mixtures were poured into Omni trays (NUNC) as described above to create a series of plates containing furfural-acetate mixtures at the four different concentrations. Six plasmid-borne versions of the haal gene isolated from a mutagenic "librar ''" and containing changes at various positions in the haal gene were compared to a na ' ive strain containing the p416 Tef vector or a strain bearing the un-mutated haal gene. Each isolate was grown in selective media and cell concentrations were normalized as described above. Spot testing was carried out as described above using the 48 pin replicator as shown in Figure 10. Plates were incubated at 30°C for eight to ten days and growth was compared.

[01 1] Tt can be seen that tolerance to higher furfural levels occurred in the absence of acetate (lower panel). Presence of 100 mM acetate in the mix reduced furfural tolerance of the TR3 Saccharomyces cerevisiae yeast strain by more than 0.2 grams/liter. Cells expressing the improved Haal mut2 and Haal mut40 proteins produced > 100 fold increase in colony forming units when normalized to a reference strain in 600 ppm furfural and 100 mM acetate. This result can be seen in Figure 10. Example 5: Fermentation with WT an Haal mntl in pretreated/saccharified biomass cellulosic sugars extract.

[0162] Single colonies of either the TR3 wild type or TR3 haal mull strain were isolated from cultures streaked onto YPDC plates and inoculated into 2.5 ml volumes of YPDC liquid media and incubated overnight at 30°C in an orbital shaker at 130 rpm. Each cell type was subcuitured overnight in 200 ml YPDC liquid media in a 1 L baffled Erleiimeyer flask and incubated as described above. Ceils were pelleted by centrifugation at 5000 rpm for 10 minutes and then resuspended and pre-adapted overnight in a mixture of 70% of 4% molasses medium/30% pretreated saccharified sugar cane bagasse at 34°C in a Thermo MaxQ rotary- shaker at 130 rpm. A portion of the culture was diluted 1 : 100 in distilled water and stained with methylene blue and then counted on a Petroff-Hausser brightline hemocytometer to determine viable cell number.

[0163] Saccharified bagasse that had been subjected to thermo-mechanical pretreatment was adjusted to 18% solids for fermentation with the wild type and improved haal mut2 yeast strains. Fermentation was carried out at 34°C, in 500 ml Erlenmeyer flasks containing 100 g of biomass sealed with a rubber stopper and aspirator needle in a Thermo MaxQ rotary shaker at 130 rpm. 40 x 10 6 cells of either the wild type or the haal miii2 allele were added per gram of biomass to the flasks. In half of the flasks, acetate in the form of 5M sodium acetate pH 4.8, was supplemented to a final concentration greater than 0,8% w/v. The control (TO) was 0.85%) w/v; wild type TR3 (T24 hr fermentation) was 0.81% w/v; improved TR3 haal mut2 (T2.4 hr fermentation) was 0.82% w/v. Flasks were sampled at 0 hours and 2.4 hours after addition of the fermentation organisms.

[0164] Samples were analyzed for sugars, acetic acid, glycerol and ethanoi on a Bio-Rad Labs Aminex HPX87H 300 x 7.8 mm, analytical HPLC column employing a mobile phase of 0.005N H2SO at a flow rate of 0.6 ml/minute.

[0165] As shown in control experiments in Figure 1 1 , the parental wild ty e strain TR3 or the mutant, TR3 haal mut2, in the absence of exogenous acetate can completely ferment glucose present in saccharified sugarcane bagasse within 24 hours to quantitatively yield ethanoi.

[0166] As also shown in the control experiments, the level of acetate present as a result of pretreatment of the biomass, approximately 0.2% w/v, does not significantly inhibit the fermentation. When fermentation flasks were supplemented with acetate to greater than 0.8% w/v, strong inhibitory effects were observed with the parental strain TR3, with only about 20% of the glucose utilized at 24 hrs. Irs contrast, the TR3 haal mut2 strain utilized 80% of the available glucose under the same conditions at 24 hrs and produced almost twice as much ethanol. Specifically, there is greater than 50% increase in titer of fermentation product performed by the haal mutant strain compared to control strain under conditions of greater than 0.8%) w/v acetate under at 24 hrs under these fermentation conditions using

pretreated-'saccharified bagasse biomass.

[0167] Similar results were obtained with pretreated saccharified corn stover at the same solids concentration as sugar cane bagasse. The faster fermentation kinetics of the improved haal nxut2 engineered yeast strain is useful in commercial operations where higher and faster throughput can lower operational expense.

[0168] It is understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. All publications, sequence accession numbers, patents, and patent applications cited herein are hereby incorporated by reference in their entirety for all purposes.