Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
THERMOSTABLE C. BESCII ENZYMES
Document Type and Number:
WIPO Patent Application WO/2012/088165
Kind Code:
A1
Abstract:
The disclosure provides thermostable enzymes isolated from Caldicellulosiruptor bescii and fragments thereof useful for the degradation of cellulose and/or hemicellulose, including thermostable cellulases and hemicellulases. The disclosure further provides nucleic acids encoding the thermostable enzymes of the disclosure. The disclosure also provides methods for the conversion of cellulose and hemicellulose into fermentable sugars using thermostable enzymes of the disclosure. The disclosure also provides enzyme cocktails containing multiple enzymes disclosed herein. The enzymes can be used to release sugars present in cellulose or hemicellulose for subsequent fermentation to produce value-added products.

Inventors:
HAN YEJUN (US)
MOON YOUNG HWAN (US)
SU XAIOYUN (US)
YOSHIDA SHOSUKE (US)
MIYAGI ATSUSHI (US)
DODD DYLAN (US)
MACKIE RODERICK I (US)
CANN ISAAC K O (US)
Application Number:
PCT/US2011/066272
Publication Date:
June 28, 2012
Filing Date:
December 20, 2011
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV ILLINOIS (US)
HAN YEJUN (US)
MOON YOUNG HWAN (US)
SU XAIOYUN (US)
YOSHIDA SHOSUKE (US)
MIYAGI ATSUSHI (US)
DODD DYLAN (US)
MACKIE RODERICK I (US)
CANN ISAAC K O (US)
International Classes:
C12P1/04; C07K14/195
Domestic Patent References:
WO2010075529A22010-07-01
WO2010027857A22010-03-11
WO2006056819A12006-06-01
Other References:
YANG SUNG-JAE ET AL: "Efficient Degradation of Lignocellulosic Plant Biomass, without Pretreatment, by the Thermophilic Anaerobe Anaerocellum thermophilum DSM 6725", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, AMERICAN SOCIETY FOR MICROBIOLOGY, US, vol. 75, no. 14, 1 July 2009 (2009-07-01), pages 4762 - 4769, XP002557402, ISSN: 0099-2240, [retrieved on 20090522], DOI: 10.1128/AEM.00236-09
I. A. KATAEVA ET AL: "Genome Sequence of the Anaerobic, Thermophilic, and Cellulolytic Bacterium "Anaerocellum thermophilum" DSM 6725", JOURNAL OF BACTERIOLOGY, vol. 191, no. 11, 1 June 2009 (2009-06-01), pages 3760 - 3761, XP055022135, ISSN: 0021-9193, DOI: 10.1128/JB.00256-09
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Glycoside hydrolase family 9; Flags: Precursor;", XP002671899, retrieved from EBI accession no. UNIPROT:B9MKU5 Database accession no. B9MKU5
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Mannan endo-1,4-beta-mannosidase., Cellulase; EC=3.2.1.4; EC=3.2.1.78; Flags: Precursor;", XP002671900, retrieved from EBI accession no. UNIPROT:B9MKU6 Database accession no. B9MKU6
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Glycoside hydrolase family 48; Flags: Precursor;", XP002671901, retrieved from EBI accession no. UNIPROT:B9MKU7 Database accession no. B9MKU7
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Glycoside hydrolase family 5; Flags: Precursor;", XP002671902, retrieved from EBI accession no. UNIPROT:B9MKT9 Database accession no. B9MKT9
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Cellulase; EC=3.2.1.4; Flags: Precursor;", XP002671903, retrieved from EBI accession no. UNIPROT:B9MPF9 Database accession no. B9MPF9
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Beta-galactosidase; EC=3.2.1.21;", XP002671904, retrieved from EBI accession no. UNIPROT:B9MNR1 Database accession no. B9MNR1
DATABASE UniProt [online] 24 March 2009 (2009-03-24), "SubName: Full=Heat shock protein Hsp20;", XP002671905, retrieved from EBI accession no. UNIPROT:B9MJU9 Database accession no. B9MJU9
BLUMER-SCHUETTE SARA E ET AL: "Phylogenetic, Microbiological, and Glycoside Hydrolase Diversities within the Extremely Thermophilic, Plant Biomass-Degrading Genus Caldicellulosiruptor", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 76, no. 24, 22 October 2010 (2010-10-22), pages 8084 - 8092, XP002671943
LAWRENCE P WACKETT: "Engineering microbes to produce biofuels", CURRENT OPINION IN BIOTECHNOLOGY, vol. 22, no. 3, 9 November 2010 (2010-11-09), pages 388 - 393, XP055022488, ISSN: 0958-1669, DOI: 10.1016/j.copbio.2010.10.010
BLUMER-SCHUETTE S E ET AL: "Extremely thermophilic microorganisms for biomass conversion: status and prospects", CURRENT OPINION IN BIOTECHNOLOGY, LONDON, GB, vol. 19, no. 3, 1 June 2008 (2008-06-01), pages 210 - 217, XP022734223, ISSN: 0958-1669, [retrieved on 20080602], DOI: 10.1016/J.COPBIO.2008.04.007
S.-J. YANG ET AL: "Classification of 'Anaerocellum thermophilum' strain DSM 6725 as Caldicellulosiruptor bescii sp. nov.", INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, vol. 60, no. 9, 1 September 2010 (2010-09-01), pages 2011 - 2015, XP055022136, ISSN: 1466-5026, DOI: 10.1099/ijs.0.017731-0
P. DAM ET AL: "Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725", NUCLEIC ACIDS RESEARCH, vol. 39, no. 8, 1 April 2011 (2011-04-01), pages 3240 - 3254, XP055002293, ISSN: 0305-1048, DOI: 10.1093/nar/gkq1281
AMY L. VANFOSSEN ET AL: "Glycoside hydrolase inventory drives plant polysaccharide deconstruction by the extremely thermophilic bacterium Caldicellulosiruptor saccharolyticus", BIOTECHNOLOGY AND BIOENGINEERING, vol. 108, no. 7, 1 July 2011 (2011-07-01), pages 1559 - 1569, XP055022471, ISSN: 0006-3592, DOI: 10.1002/bit.23093
CANN ET AL., J. BACTERIAL., vol. 181, 1999, pages 1643 - 1651
LEVER, M.: "A new reaction for colorimetric determination carbohydrates", ANAL. BIOCHEM., vol. 47, 1972, pages 273 - 279, XP024820395, DOI: doi:10.1016/0003-2697(72)90301-6
HIGGINS ET AL., GENE, vol. 73, 1988, pages 237 244
HIGGINS ET AL., CABIOS, vol. 5, 1989, pages 151 153
CORPET ET AL., NUCLEIC ACIDS RES., vol. 16, 1988, pages 10881 90
HUANG ET AL., CABIOS, vol. 8, 1992, pages 155 65
PEARSON ET AL., METH. MOL. BIOL., vol. 24, 1994, pages 307 331
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403
ALTSCHUL ET AL., NUCLEIC ACIDS RES., vol. 25, 1997, pages 3389
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR LABORATORY PRESS
KIM JS; LEE YY; TORGET, RW.: "Cellulose hydrolysis under extremely low sulfuric acid and high-temperature conditions", APPL. BIOCHEM. BIOTECHNOL., vol. 91-93, 2001, pages 331 - 340
ZVERLOV V; MAHR S; RIEDEL K; BRONNENMEIER K: "Properties and gene structure of a bifunctional cellulolytic enzyme (CelA) from the extreme thermophile 'Anaerocellum thermophilum' with separate glycosyl hydrolase family 9 and 48 catalytic domains", MICROBIOLOGY, vol. 144, 1998, pages 457 - 465, XP002195181
TE'O VS; SAUL DJ; BERGQUIST PL: "celA, another gene coding for a multidomain cellulase from the extreme thermophile Caldocellum saccharolyticum", APPL MICROBIOL BIOTECHNOL, vol. 43, 1995, pages 291 - 296, XP001064651, DOI: doi:10.1007/s002530050405
HALLD6RSD6TTIR S; THÓRÓLFSDÓTTIR ET; SPILLIAERT R; JOHANSSON M; THORBJAMARDOTTIR SH; PALSDOTTIR A; HREGGVIDSSON GO; KRISTJANSSO: "Cloning, sequencing and overexpression of a Rhodothermus marinus gene encoding a thermostable cellulase of glycosyl hydrolase family 12", APPL MICROBIOL BIOTECHNOL, vol. 49, 1998, pages 277 - 284, XP002902257, DOI: doi:10.1007/s002530051169
KIM D; PARK BH; JUNG B-W; KIM M-K; HONG SI; LEE, DS: "Identification and molecular modeling of a family 5 endocellulase from Thermus caldophilus GK24, a cellulolytic strain of Thermus thermophilus", INT J MOL SCI, vol. 7, 2006, pages 571 - 589
MAWADZA, C; HATTI-KAUL, R.; ZVAUYA, R.; MATTIASSON, B.: "Purification and characterization of cellulases produced by two Bacillus strains", J. BIOTECHNOL., vol. 83, 2000, pages 177 - 187, XP027424100, DOI: doi:10.1016/S0168-1656(00)00305-9
KENGEN, S.; LUESINK, E.; STAMS, A.; ZEHNDER, A.: "Purification and characterization of an extremely thermostable p-glucosidase from the hyperthermophilic archaeon Pyrococccusfuriosus", EUR. J. BIOCHEM., vol. 213, 1993, pages 305 - 312, XP001312978
ANDO, S.; ISHIDA, H.; KOSUGI, Y.; ISHIKAWA, K.: "Hyperthermostable endoglucanase from Pyrococcus horikoshi", APPL. ENVIRON. MICROBIOL., vol. 68, 2002, pages 430 - 433, XP002976319, DOI: doi:10.1128/AEM.68.1.430-433.2002
HREGGVIDSSON, GO.; KAISTE, E.; HOIST, O.; EGGERTSSON, G.; PALSDOTTIR, A.; KRISTJANSSON, J.K.: "An extremely thermostable cellulase from the thermophilic eubacterium Rhodothermus marinus", APPL. ENVIRON. MICROBIOL., vol. 62, 1996, pages 3047 - 3049
BRONNENMEIER, K.; KEM, A.; LIBEL, W.; STAUDENBAUER, W.: "Purification of Thermatoga maritema enzymes for the degradation of cellulose materials", APPL. ENVIRON. MICROBIOL., vol. 61, 1995, pages 1399 - 1407
BOK, J.; GOERS, S.; EVELEIGH, D.: "Cellulase and xylanase systems of Thermatoga neapolitana", ACS SYMP. SER., vol. 566, 1994, pages 54 - 65
BOK, J.; DIENESH, A.; YEMOOL, D.; EVELEIGH, D.: "Purification, characterization and molecular analysis of thermostable cellulases CeIA and CeIB from Thermatoga neapolitana", APPL. ENVIRON. MICROBIOL., vol. 64, 1998, pages 4774 - 4781
MORAG, E.; BAYER, E.A.; LAMED, R.: "Relationship of cellulosomal and non-cellulosomal xylanases of Clostridium thermocellum to cellulose degrading enzymes", J. BACTERIOL., vol. 172, 1990, pages 6098 - 6105
ANDO, S.; H. ISHIDA; Y. KOSUGI; K. ISHIKAWA: "Hyperthermostable endoglucanase from Pyrococcus horikoshii", APPL ENVIRON MICROBIOL, vol. 68, 2002, pages 430 - 433, XP002976319, DOI: doi:10.1128/AEM.68.1.430-433.2002
ARAI, T. ET AL.: "Sequence of celQ and properties of celQ, a component of the Clostridium thermocellum cellulosome", APPL MICROBIOL BIOTECHNOL, vol. 57, 2001, pages 660 - 666
BAUER, M. W. ET AL.: "An endoglucanase, EglA, from the hyperthermophilic archaeon Pyrococcusfuriosus hydrolyzes b-1,4 bonds in mixed-linkage (1-->3),(1-->4)-b-D-glucans and cellulose", J BACTERIOL, vol. 181, 1999, pages 284 - 290
BLUMER-SCHUETTE, S. E.; D. L. LEWIS; R. M. KELLY: "Phylogenetic, microbiological, and glycoside hydrolase diversities within the extremely thermophilic, plant biomass-degrading genus Caldicellulosiruptor", APPL ENVIRON MICROBIOL, vol. 76, 2010, pages 8084 - 8092, XP002671943, DOI: doi:10.1128/aem.01400-10
BOK, J. D.; D. A. YEMOOL; D. E. EVELEIGH: "Purification, characterization, and molecular analysis of thermostable cellulases CelA and CelB from Thermotoga neapolitana", APPL ENVIRON MICROBIOL, vol. 64, 1998, pages 4774 - 4781, XP002330149
BRONNENMEIER, K.; W. L. STAUDENBAUER: "Cellulose hydrolysis by a highly thermostable endo-1,4-b-glucanase (Avicelase I) from Clostridium stercorarlum.", ENZYME MICROB TECHNOL, vol. 12, 1990, pages 431 - 436, XP023854226, DOI: doi:10.1016/0141-0229(90)90053-S
CHIRIAC, A. I. ET AL.: "Engineering a family 9 processive endoglucanase from Paenibacillus barcinonensis displaying a novel architecture", APPL MICROBIOL BIOTECHNOL, vol. 86, 2010, pages 1125 - 1134, XP019799897
DAM, P. ET AL.: "Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725", NUCLEIC ACIDS RES, vol. 39, 2011, pages 3240 - 3254, XP055002293, DOI: doi:10.1093/nar/gkq1281
DODD, D. ET AL.: "Biochemical analysis of a b-D-xylosidase and a bifunctional xylanase-ferulic acid esterase from a xylanolytic gene cluster in Prevotella ruminicola 23", J BACTERIOL, vol. 191, 2009, pages 3328 - 3338, XP055111901, DOI: doi:10.1128/JB.01628-08
GAL, L. ET AL.: "CeIG from Clostridium cellulolyticum: a multidomain endoglucanase acting efficiently on crystalline cellulose", J BACTERIOL, vol. 179, 1997, pages 6595 - 6601
GIBBS, M. D. ET AL.: "Multidomain and multifunctional glycosyl hydrolases from the extreme thermophile Caldicellulosiruptor isolate Tok7B.1", CURR MICROBIOL, vol. 40, 2000, pages 333 - 340, XP002218962, DOI: doi:10.1007/s002849910066
GILAD, R. ET AL.: "Cell, a noncellulosomal family 9 enzyme from Clostridium thermocellum, is a processive endoglucanase that degrades crystalline cellulose", J BACTERIOL, vol. 185, 2003, pages 391 - 398
HOFFMAN, G. G.; O. DAVULCU; S. SONA; W. R. ELLINGTON: "Contributions to catalysis and potential interactions of the three catalytic domains in a contiguous trimeric creatine kinase", FEBS J, vol. 275, 2008, pages 646 - 654
HONDA, Y.; N. SHIMAYA; K. ISHISAKI; M. EBIHARA; H. TANIGUCHI.: "Elucidation of exo-b-D-glucosaminidase activity of a family 9 glycoside hydrolase (PBPRA0520) from Photobacterium profundum SS9", GLYCOBIOLOGY, vol. 21, 2011, pages 503 - 511
HORN, S. J. ET AL.: "Costs and benefits of processivity in enzymatic degradation of recalcitrant polysaccharides", PROC NATL ACAD SCI USA, vol. 103, 2006, pages 18089 - 18094
IRWIN, D. ET AL.: "Roles of the catalytic domain and two cellulose binding domains of Thermomonospora fusca E4 in cellulose hydrolysis", J BACTERIOL, vol. 180, 1998, pages 1709 - 1714
IRWIN, D. C.; M. SPEZIO; L. P. WALKER; D. B. WILSON: "Activity studies of eight purified cellulases: Specificity, synergism, and binding domain effects", BIOTECHNOL BIOENG, vol. 42, 1993, pages 1002 - 1013, XP002523929, DOI: doi:10.1002/bit.260420811
JAURIS, S. ET AL.: "Sequence analysis of the Clostridium stercorarium celZ gene encoding a thermoactive cellulase (Avicelase I): identification of catalytic and cellulose-binding domains", MOL GEN GENET, vol. 223, 1990, pages 258 - 267, XP008179742, DOI: doi:10.1007/BF00265062
JINDOU, S. ET AL.: "Novel architecture of family-9 glycoside hydrolases identified in cellulosomal enzymes of Acetivibrio cellulolyticus and Clostridium thermocellum", FEMS MICROBIOL LETT, vol. 254, 2006, pages 308 - 316
KATAEVA, I. A. ET AL.: "Genome sequence of the anaerobic, thermophilic, and cellulolytic bacterium ''Anaerocellum thermophilum'' DSM 6725", J BACTERIOL, vol. 191, 2009, pages 3760 - 3761, XP055022135, DOI: doi:10.1128/JB.00256-09
LEVER, M.: "A new reaction for colorimetric determination of carbohydrates", ANAL BIOCHEM, vol. 47, 1972, pages 273 - 279, XP024820395, DOI: doi:10.1016/0003-2697(72)90301-6
LI, Y.; D. C. IRWIN; D. B. WILSON: "Increased crystalline cellulose activity via combinations of amino acid changes in the family 9 catalytic domain and family 3c cellulose binding module of Thermobifidafusca Cel9A", APPL ENVIRON MICROBIOL, vol. 76, 2010, pages 2582 - 2588
LI, Y.; D. C. IRWIN; D. B. WILSON: "Processivity, substrate binding, and mechanism of cellulose hydrolysis by Thermobifidafusca Ce19A", APPL ENVIRON MICROBIOL, vol. 73, 2007, pages 3165 - 3172
LIANG, C. ET AL.: "Cloning and characterization of a thermostable and halo-tolerant endoglucanase from Thermoanaerobacter tengcongensis MB4", APPL MICROBIOL BIOTECHNOL, vol. 89, 2011, pages 315 - 326
LOCHNER, A. ET AL.: "Use of label-free quantitative proteomics to distinguish the secreted cellulolytic systems of Caldicellulosiruptor bescii and Caldicellulosiruptor obsidiansis", APPL ENVIRON MICROBIOL, vol. 77, 2011, pages 4042 - 4054
MANDELMAN, D. ET AL.: "X-Ray crystal structure of the multidomain endoglucanase Cel9G from Clostridium cellulolyticum complexed with natural and synthetic cello- oligosaccharides", J BACTERIOL, vol. 185, 2003, pages 4127 - 4135
MEINKE, A. ET AL.: "Unusual sequence organization in CenB, an inverting endoglucanase from Cellulomonas fimi.", J BACTERIOL, vol. 173, 1991, pages 308 - 314
MEJIA-CASTILLO, T.; M. E. HIDALGO-LARA; L. G. BRIEBA; J. ORTEGA-LOPEZ: "Purification, characterization and modular organization of a cellulose-binding protein, CBP105, a processive b-1,4-endoglucanase from Cellulomonas flavigena", BIOTECHNOL LETT, vol. 30, 2008, pages 681 - 687, XP019570037
MOON, Y. H.; M. LAKIVIAK; S. BAUER; R. I. MACKIE; I. K. CANN: "Biochemical analyses of multiple endoxylanases from the rumen bacterium Ruminococcus albus 8 and their synergistic activities with accessory hemicellulose degrading enzymes", APPL ENVIRON MICROBIOL, 2011
PARK, J. K.; L. X. WANG; H. V. PATEL; S. ROSEMAN: "Molecular cloning and characterization of a unique b-glucosidase from Vibrio cholerae", J BIOL CHEM, vol. 277, 2002, pages 29555 - 29560
PARSIEGLA, G.; A. BELAICH; J. P. BELAICH; R. HASER: "Crystal structure of the cellulase Cel9M enlightens structure/function relationships of the variable catalytic modules in glycoside hydrolases", BIOCHEMISTRY, vol. 41, 2002, pages 11134 - 11142
PASTOR, F. I. ET AL.: "Molecular cloning and characterization of a multidomain endoglucanase from Paenibacillus sp BP-23: evaluation of its performance in pulp refining", APPL MICROBIOL BIOTECHNOL, vol. 55, 2001, pages 61 - 68
QI, M.; H. S. JUN; C. W. FORSBERG: "Cel9D, an atypical 1,4-b-D-glucan glucohydrolase from Fibrobacter succinogenes: characteristics, catalytic residues, and synergistic interactions with other cellulases", J BACTERIOL, vol. 190, 2008, pages 1976 - 1984
SAKON, J.; D. IRWIN; D. B. WILSON; P. A. KARPLUS: "Structure and mechanism of endo/exocellulase E4 from ThermomonosporaJusca", NAT STRUCT BIOL, vol. 4, 1997, pages 810 - 818
SAUL, D. J. ET AL.: "celB, a gene coding for a bifunctional cellulase from the extreme thermophile ''Caldocellum saccharolyticum''.", APPL ENVIRON MICROBIOL, vol. 56, 1990, pages 3117 - 3124
SCHUBOT, F. D. ET AL.: "Structural basis for the exocellulase activity of the cellobiohydrolase CbhA from Clostridium thermocellum.", BIOCHEMISTRY, vol. 43, 2004, pages 1163 - 1170
SU, X. ET AL.: "Mutational insights into the roles of amino acid residues in ligand binding for two closely related family 16 carbohydrate binding modules", J BIOL CHEM, vol. 285, 2010, pages 34665 - 34676
TAMARU, Y.; S. KARITA; A. IBRAHIM; H. CHAN; R. H. DOI.: "A large gene cluster for the Clostridium cellulovorans cellulosome", J BACTERIOL, vol. 1 82, 2000, pages 5906 - 59 1 0
TOLONEN, A. C.; A. C. CHILAKA; G. M. CHURCH: "Targeted gene inactivation in Clostridium phytofermentans shows that cellulose degradation requires the family 9 hydrolase Cphy3367", MOL MICROBIOL, vol. 74, 2009, pages 1300 - 1313
UDA, K. ET AL.: "Two-domain arginine kinase from the deep-sea clam Calyptogena kaikoi--evidence of two active domains", COMP BIOCHEM PHYSIOL B BIOCHEM MOL BIOL, vol. 151, 2008, pages 176 - 182, XP024341113, DOI: doi:10.1016/j.cbpb.2008.06.012
VANFOSSEN, A. L.; I. OZDEMIR; S. L. ZELIN; R. M. KELLY: "Glycoside hydrolase inventory drives plant polysaccharide deconstruction by the extremely thermophilic bacterium Caldicellulosiruptor saccharolyticus", BIOTECHNOL BIOENG, vol. 108, 2011, pages 1559 - 1569, XP055022471, DOI: doi:10.1002/bit.23093
VLASENKO, E.; M. SCHULEIN; J. CHERRY; F. XU.: "Substrate specificity of family 5, 6, 7, 9, 12, and 45 endoglucanases", BIORESOUR TECHNOL, vol. 101, pages 2405 - 2411, XP026822534
YANG, S. J. ET AL.: "Classification of 'Anaerocellum thermophilum' strain DSM 6725 as Caldicellulosiruptor bescii sp. nov", INT J SYST EVOL MICROBIOL, vol. 60, 2010, pages 2011 - 2015, XP055022136, DOI: doi:10.1099/ijs.0.017731-0
YANG, S. J. ET AL.: "Efficient degradation of lignocellulosic plant biomass, without pretreatment, by the thermophilic anaerobe ''Anaerocellum thermophilum'' DSM 6725", APPL ENVIRON MICROBIOL, vol. 75, 2009, pages 4762 - 4769, XP002557402, DOI: doi:10.1128/AEM.00236-09
YEOMAN, C. J. ET AL.: "Thermostable enzymes as biocatalysts in the biofuel industry", ADV APPL MICROBIOL, vol. 70, 2010, pages 1 - 55, XP009143248, DOI: doi:10.1016/S0065-2164(10)70001-0
YOSHIDA, S.; R. I. MACKIE; I. K. CANN: "Biochemical and domain analyses of FSUAxe6B, a modular acetyl xylan esterase, identify a unique carbohydrate binding module in Fibrobacter succinogenes S85", J BACTERIOL, vol. 192, 2010, pages 483 - 493
ZAKARIASSEN, H. ET AL.: "Aromatic residues in the catalytic center of chitinase A from Serratia marcescens affect processivity, enzyme activity, and biomass converting efficiency", J BIOLCHEM, vol. 284, 2009, pages 10610 - 10617
ZHANG, X. Z.; N. SATHITSUKSANOH; Y. H. ZHANG: "Glycoside hydrolase family 9 processive endoglucanase from Clostridium phytofermentans: heterologous expression, characterization, and synergy with family 48 cellobiohydrolase", BIORESOUR TECHNOL, vol. 101, 2010, pages 5534 - 5538
ZVERLOV, V.; S. MAHR; K. RIEDEL; K. BRONNENMEIER: "Properties and gene structure of a bifunctional cellulolytic enzyme (CelA) from the extreme thermophile 'Anaerocellum thermophilu»i with separate glycosyl hydrolase family 9 and 48 catalytic domains", MICROBIOLOGY, vol. 144, 1998, pages 457 - 465, XP002195181
ZVERLOV, V. V.; G. A. VELIKODVORSKAYA; W. H. SCHWARZ: "Two new cellulosome components encoded downstream of cell in the genome of Clostridium thermocellum: the non-processive endoglucanase CeIN and the possibly structural protein CseP", MICROBIOLOGY, vol. 149, 2003, pages 515 - 524
Attorney, Agent or Firm:
WARD, Michael R. et al. (425 Market StreetSan Francisco, CA, US)
Download PDF:
Claims:
CLAIMS

What is claimed:

1. A host cell, comprising two or more recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

2. The host cell of claim 1, comprising three recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

3. The host cell of claim 1, comprising four recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

4. The host cell of claim 1, comprising five recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

5. The host cell of claim 1, comprising six recombinant nucleic acids selected from the

group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

6. A host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3

[Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc- arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], and g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37 [Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide].

7. A method for producing at least two of the enzymes selected from the group consisting of endoxylanase, cc-arabinofuranosidase, cc-glucuronidase, β-xylosidase, and acetyl xylan esterase, comprising: culturing the host cell of any one of claims 1-6 in a culture medium, under suitable conditions to produce the endoxylanase, cc-arabinofuranosidase, cc- glucuronidase, β-xylosidase, and acetyl xylan esterase.

8. The host cell of any one of claims 1-6, the host cell further comprises one or more

recombinant nucleic acids, wherein the recombinant nucleic acids encode one or more polypeptides selected from the group consisting of: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides.

9. The host cell of claim 8, comprising two or more recombinant nucleic acids, wherein the recombinant nucleic acids encode two or more of said polypeptides.

10. The host cell of claim 8, comprising three or more recombinant nucleic acids, wherein the recombinant nucleic acids encode three or more of said polypeptides.

11. The host cell of claim 8, comprising four or more recombinant nucleic acids, wherein the recombinant nucleic acids encode four or more of said polypeptides.

12. The host cell of claim 8, comprising five or more recombinant nucleic acids, wherein the recombinant nucleic acids encode five or more of said polypeptides.

13. The host cell of claim 8, comprising six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode six of said polypeptides.

14. The host cell of any one of claims 8-13,

wherein the CM952 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 44, 114, 124, 126, 128, and 46;

wherein the CM953 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 60, 61, and 111; wherein the CM954 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 74, 121, and 76;

wherein the CM946 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 86, 87, and 113;

wherein the Cb629 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 98, 119, and 100; and

wherein the Cb486 polypeptide has a sequence of SEQ ID NO: 106.

15. The host cell of claim 14,

wherein the CM952 polypeptide has the sequence of SEQ ID NO: 46;

wherein the CM953 polypeptide has the sequence of SEQ ID NO: 111;

wherein the CM954 polypeptide has the sequence of SEQ ID NO: 76;

wherein the CM946 polypeptide has the sequence of SEQ ID NO: 113;

wherein the Cb629 polypeptide has the sequence of SEQ ID NO: 100; and wherein the Cb486 polypeptide has the sequence of SEQ ID NO: 106.

16. The host cell of any one of claims 8-13,

wherein the recombinant nucleic acid encoding a CM952 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 45, 115, 125, 127, 129, and 47; wherein the recombinant nucleic acid encoding a CM953 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 62, 63, and 110;

wherein the recombinant nucleic acid encoding a CM954 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 116, 75, and 77;

wherein the recombinant nucleic acid encoding a CM946 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 88, 89, and 112;

wherein the recombinant nucleic acid encoding a Cb629 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 99, 120, and 101; and, wherein the recombinant nucleic acid encoding a Cb486 polypeptide has the sequence of SEQ ID NO: 107.

17. The host cell of claim 16,

wherein the recombinant nucleic acid encoding a CM952 polypeptide has the sequence of SEQ ID NO: 47;

wherein the recombinant nucleic acid encoding a CM953 polypeptide has the sequence of SEQ ID NO: 110;

wherein the recombinant nucleic acid encoding a CM954 polypeptide has the sequence of SEQ ID NO: 77;

wherein the recombinant nucleic acid encoding a CM946 polypeptide has the sequence of SEQ ID NO: 112;

wherein the recombinant nucleic acid encoding a Cb629 polypeptide has the sequence of SEQ ID NO: 101; and,

wherein the recombinant nucleic acid encoding a Cb486 polypeptide has the sequence of SEQ ID NO: 107.

18. The host cell of any one of claims 1-6, wherein the host cell further comprises six

recombinant nucleic acids having the sequences of SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

19. The host cell of any one of claims 1-6, wherein the host cell further comprises one or more recombinant nucleic acids encoding one or more polypeptides that having a sequence selected from the group consisting of SEQ ID NOs: 46, 111, 76, 113, 124, 126, 128, and 100.

20. A host cell comprising one or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one or more polypeptides selected from the group consisting of: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides.

21. The host cell of claim 20, comprising two or more recombinant nucleic acids, wherein the recombinant nucleic acids encode two or more of said polypeptides.

22. The host cell of claim 20, comprising three or more recombinant nucleic acids, wherein the recombinant nucleic acids encode three or more of said polypeptides.

23. The host cell of claim 20, comprising four or more recombinant nucleic acids, wherein the recombinant nucleic acids encode four or more of said polypeptides.

24. The host cell of claim 20, comprising five or more recombinant nucleic acids, wherein the recombinant nucleic acids encode five or more of said polypeptides.

25. The host cell of claim 20, comprising six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode six of said polypeptides.

26. The host cell of any one of claims 20-25,

wherein the CM952 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 44, 114, 124, 126, 128, and 46;

wherein the CM953 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 60, 61, and 111;

wherein the CM954 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 74, 121, and 76;

wherein the CM946 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 86, 87, and 113;

wherein the Cb629 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 98, 119, and 100; and

wherein the Cb486 polypeptide has a sequence of SEQ ID NO: 106.

27. The host cell of claim 26,

wherein the CM952 polypeptide has the sequence of SEQ ID NO: 46; wherein the CM953 polypeptide has the sequence of SEQ ID NO: 111; wherein the CM954 polypeptide has the sequence of SEQ ID NO: 76;

wherein the CM946 polypeptide has the sequence of SEQ ID NO: 113;

wherein the Cb629 polypeptide has the sequence of SEQ ID NO: 100; and

wherein the Cb486 polypeptide has the sequence of SEQ ID NO: 106.

28. The host cell of any one of claims 20-25,

wherein the recombinant nucleic acid encoding a CM952 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 45, 115, 125, 127, 129, and 47;

wherein the recombinant nucleic acid encoding a CM953 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 62, 63, and 110;

wherein the recombinant nucleic acid encoding a CM954 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 116, 75, and 77;

wherein the recombinant nucleic acid encoding a CM946 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 88, 89, and 112;

wherein the recombinant nucleic acid encoding a Cb629 polypeptide has a sequence selected from the group consisting of SEQ ID NOs: 99, 120, and 101; and,

wherein the recombinant nucleic acid encoding a Cb486 polypeptide has the sequence of SEQ ID NO: 107.

29. The host cell of claim 28,

wherein the recombinant nucleic acid encoding a CM952 polypeptide has the sequence of SEQ ID NO: 47;

wherein the recombinant nucleic acid encoding a CM953 polypeptide has the sequence of SEQ ID NO: 110;

wherein the recombinant nucleic acid encoding a CM954 polypeptide has the sequence of SEQ ID NO: 77; wherein the recombinant nucleic acid encoding a CM946 polypeptide has the sequence of SEQ ID NO: 112;

wherein the recombinant nucleic acid encoding a Cb629 polypeptide has the sequence of SEQ ID NO: 101; and,

wherein the recombinant nucleic acid encoding a Cb486 polypeptide has the sequence of SEQ ID NO: 107.

30. A host cell comprising six recombinant nucleic acids, wherein the nucleic acids have the sequences of SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

31. A host cell comprising one or more recombinant nucleic acids, wherein the nucleic acids encode one or more polypeptides that has a sequence selected from the group consisting of SEQ ID NOs: 46, 111, 76, 113, 124, 126, 128, and 100.

32. The host cell of any one of claims 20-31, wherein the host cell further comprises two or more recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

33. The host cell of claim 32, comprising three recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

34. The host cell of claim 32, comprising four recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

35. The host cell of claim 32, comprising five recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

36. The host cell of claim 32, comprising six recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

37. The host cell of any one of claims 20-31, wherein, the host cell further comprises two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (CM93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7

[Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (Cbl 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19

[Caldicellulosiruptor bescii cc-glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl 62)], and g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37 [Caldicellulosiruptor bescii endoxylanase (Cbl 93) lacking signal peptide].

38. A method for producing one or more cellulases, the method comprising:

a) culturing the host cell of any one of claims 20-37 in culture media under conditions sufficient to support the expression of said recombinant nucleic acid(s); and

b) collecting one or more cellulases from said media and/or said host cell.

39. The host cell of any one of claims 1-6 and 8-37, wherein two or more of said recombinant nucleic acids are present in a contiguous polydeoxyribonucleotide chain.

40. The host cell of any one of claims 1-6, 8-37, and 39, wherein the host cell is selected from the group consisting of Escherichia, Pseudomonas, Proteus, Ralstonia,

Streptomyces, Staphylococcus, Lactococcus, Bacillus, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Yarrowia lipolytica, Hansenula polymorpha,

Kluyveromyces lactis, Pichia pastoris, Aspergillus, Chrysosporium lucknowense, or Trichoderma reesei.

41. A composition comprising the host cell of any one of claims 1-19 and culture medium.

42. A composition comprising two or more recombinant proteins, wherein the recombinant proteins are selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37.

43. The composition of claim 42, comprising six recombinant proteins, wherein the recombinant proteins are selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37.

44. The composition of claim 42 or claim 43, further comprising one or more recombinant polypeptides selected from the group consisting of CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides.

45. The composition of claim 44,

wherein the recombinant CM952 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46;

wherein the recombinant CM953 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 60, 61, and 111;

wherein the recombinant CM954 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 74, 121, and 76;

wherein the recombinant CM946 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 86, 87, and 113;

wherein the recombinant Cb629 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 98, 119, and 100; and,

wherein the recombinant Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

46. The composition of claim 42 or claim 43, further comprising six recombinant

polypeptides, wherein the recombinant polypeptides are the polypeptides of SEQ ID NOs: 46, 111, 76, 113, 100, and 106.

47. The composition of claim 42 or claim 43, further comprising one or more recombinant polypeptides, wherein the one or more recombinant polypeptides are selected from the group consisting of the polypeptides of SEQ ID NOs: 46, 111, 76, 113, 124, 126, 128, and 100.

48. A composition comprising one or more recombinant polypeptides, wherein the one or more recombinant polypeptides are selected from the group consisting of: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides.

49. The composition of claim 48, comprising two or more of said polypeptides.

50. The composition of claim 48, comprising three or more of said polypeptides.

51. The composition of claim 48, comprising four or more of said polypeptides.

52. The composition of claim 48, comprising five or more of said polypeptides.

53. The composition of claim 48, comprising six or more of said polypeptides.

54. The composition of any one of claims 48-53,

wherein the recombinant CM952 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46;

wherein the recombinant CM953 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 60, 61, and 111;

wherein the recombinant CM954 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 74, 121, and 76;

wherein the recombinant CM946 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 86, 87, and 113;

wherein the recombinant Cb629 polypeptide is selected from the group consisting of the polypeptides of SEQ ID NOs: 98, 119, and 100; and,

wherein the recombinant Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

55. The composition of claim 53,

wherein the recombinant CM952 polypeptide is the polypeptide of SEQ ID NO: 46; wherein the recombinant CM953 polypeptide is the polypeptide of SEQ ID NO: 111; wherein the recombinant CM954 polypeptide is the polypeptide of SEQ ID NO: 76; wherein the recombinant CM946 polypeptide is the polypeptide of SEQ ID NO: 113; wherein the recombinant Cb629 polypeptide is the polypeptide of SEQ ID NO: 100; and, wherein the recombinant Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

56. A composition comprising six recombinant polypeptides, wherein the recombinant

polypeptides are the polypeptides of SEQ ID NOs: 46, 111, 76, 113, 100, and 106.

57. A composition comprising one or more recombinant polypeptides, wherein the one or more recombinant polypeptides are selected from the group consisting of the

polypeptides of SEQ ID NOs: 46, 111, 76, 113, 124, 126, 128, and 100.

58. The composition of any one of claims 48-57, further comprising two or more

recombinant proteins, wherein the recombinant proteins are selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37.

59. The composition of claim 58, comprising six recombinant proteins, wherein the

recombinant proteins are selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37.

60. A method for degrading a lignocellulose-containing material, the method comprising: a) contacting the lignocellulose-containing material with a host cell or composition of any one of claims 1-6, 8-37, 39, or 41-59; and

b) incubating the host cell or composition and lignocellulose-containing material under conditions that support lignocellulose degradation.

61. The method of claim 60, wherein the host cell further comprises the enzymes necessary to convert the degraded lignocellulose to a fermentation product.

62. A method of converting lignocellulose-containing material to fermentation product, comprising contacting the lignocellulose-containing material with the host cell of any one of claims 1-6 or 8-37, or 39; and culturing the host cell under conditions sufficient to produce a fermentation product from the lignocellulose-containing material.

63. A method of reducing the viscosity of a pretreated lignocellulose-containing material, the method comprising contacting pretreated lignocellulose-containing material with the host cell or composition of any of claims 1-6, 8-37, 39, or 41-59.

64. The method of any one of claims 60-63, wherein the lignocellulose comprises

hemicellulose.

65. The method of any one of claims 60-64, wherein the lignocellulose comprises cellulose.

66. A method of converting biomass to fermentation product, comprising contacting the

biomass with the host cell or composition of any one of claims 1-6, 8-37, 39, or 41-59, to yield a sugar solution; and culturing the sugar solution with a fermentative

microorganism under conditions sufficient to produce a fermentation product.

67. A method of converting biomass to fuel comprising contacting the biomass with the host cell or composition of any one of claims 1-6, 8-37, 39, or 41-59, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel.

68. A method of degrading biomass comprising contacting the biomass with the host cell or composition of any one of claims 1-6, 8-37, 39, or 41-59, to yield a sugar solution.

69. The method of any one of claims 60-68, wherein the lignocellulose-containing material or biomass is subjected to pretreatment prior to being contacted with the host cell of composition of any one of claims 1-6, 8-37, or 40-58, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

70. The method of any one of claims 60-69, wherein the lignocellulose-containing material or biomass comprises a plant material.

71. The method of claim 70, wherein the plant material is selected from the group consisting of Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane.

72. The method of any of claims 60-71, wherein at least a portion of the method is conducted at a temperature between 40° and 80° C.

73. The method claim 72, wherein at least a portion of the method is conducted at a

temperature between 60° and 80° C.

74. A method for degrading hemicellulose, comprising the steps of:

a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising -l,4-linkages and one or more functional groups; and

b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose.

75. A method for degrading hemicellulose, said method comprising the steps of:

a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising -l,4-linkages and one or more functional groups; and

b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose.

76. The method of claim 74 or claim 75, wherein said one or more functional groups are selected from the group consisting of arabinose, glucuronyl, and acetyl.

77. The method of any one of claims 74-76, wherein said treating is conducted at a

temperature between 40° and 80°C.

78. The method of any one of claims 74-76, wherein said treating is conducted at a

temperature between 60° and 80°C.

79. The host cell of any one of claims 1-6, 8-37, and 39-40, wherein the host cell further comprises a recombinant nucleic acid encoding a CM581 polypeptide.

80. The host cell of claim 79, wherein the CM581 polypeptide has a sequence of SEQ ID NO: 146.

81. The composition of any one of claims 41-49, wherein the composition further contains a CM581 polypeptide.

82. The composition of claim 81, wherein the CM581 polypeptide has a sequence of SEQ ID NO: 146.

83. The method of any one of claims 60-78, wherein the method further includes contacting said lignocellulose-containing material, biomass, plant material, or hemicellulose with a CM581 polypeptide.

84. The method of claim 83, wherein the CM581 polypeptide has a sequence of SEQ ID NO:

146.

Description:
THERMOSTABLE C. BESCII ENZYMES

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of U.S. Provisional Application No. 61/425,623, filed December 21, 2010, and U.S. Provisional Application No. 61/532,060, filed September 7, 2011, both of which are hereby incorporated by reference in their entirety.

FIELD

[0002] The present disclosure relates to compositions and methods for the degradation of cellulose, hemicellulose, and cellulose and/or hemicellulose-containing materials. In particular, the disclosure provides thermostable enzymes for the degradation of cellulose, nucleic acids encoding the enzymes, and methods of use thereof. The disclosure also provides thermostable enzymes for the degradation of hemicellulose, nucleic acids encoding the enzymes, and methods of use thereof. The disclosure further provides thermostable enzymes that enhance the activity of thermostable cellulase and/or hemicellulases, nucleic acids encoding the enzymes, and methods of use thereof.

SUBMISSION OF SEQUENCE LISTING ON ASCII TEXT FILE

[0003] The content of the following submission on ASCII text file is incorporated herein by reference in its entirety: a computer readable form (CRF) of the Sequence Listing (file name: 658012000940SeqListing.txt, date recorded: December 7, 2011, size: 513 KB).

BACKGROUND

[0004] Microorganisms that are currently being used to ferment sugars to biofuels such as ethanol usually cannot utilize complex polysaccharides such as cellulose and hemicellulose. As a result, a significant bottleneck occurs in the conversion of lignocellulosic materials to biofuels.

[0005] Cellulose, a major component of plants and one of the most abundant organic compounds on earth, is a polysaccharide composed on long chains of β(1-4) linked D-glucose molecules. Due to its sugar-based composition, cellulose is a rich potential source material for the production of biofuels. For example, sugars from cellulose may be fermented into biofuels such as ethanol. In order for the sugars within cellulose to be used for the production of biofuels or other commodity chemicals, the cellulose must be broken down into smaller molecules.

[0006] Cellulose may be enzymatically hydrolyzed by the action of cellulases. Cellulases include endoglucanases, exoglucanases, and beta-glucosidases. The actions of cellulases cleave the 1-4 β-D-glycosidic linkages in cellulose, and result in the ultimate release of β-D-glucose molecules. During the breakdown of cellulose into individual sugar molecules, glucose polymers of various lengths may be formed as intermediate breakdown products. Glucose polymers of approximately 2-6 molecules in length derived from the hydrolysis of cellulose are referred to as "cellodextrins" or "cellooligosaccharides."

[0007] Hemicellulose constitutes the second largest component of polysaccharides in many plants, such as the perennial grasses switchgrass and Miscanthus. Hemicellulose is a complex polysaccharide that has a xylose-linked backbone, with side chains of arabinose, glucuronyl, and acetyl groups. A structural model of a hemicellulose illustrates the xylose backbone residues joined together in beta-l,4-linkages (Figure 1). Several functional groups decorate the backbone, including esters of acetyl (Ac) groups, arabinose, glucuronic acids, and esters of feroryl groups. The feroryl groups link the entire structure to lignin. Enzyme cocktails that hydrolyze hemicellulose into its major component sugars such as xylose (a 5-carbon sugar) and arabinose (a 5-carbon sugar) will significantly increase the fermentable sugars for biofuel production from lignocellulose-based feedstock. Enzymatic removal of hemicellulose by hemicellulases will also increase accessibility of cellulases to the cellulose component of plant cell walls or

lignocellulosic feedstocks. Thus, the degradation of hemicellulose is a critical step in the utilization of lignocellulose feedstock for biofuel production.

[0008] Thermostable enzymes are particularly desirable for the efficient degradation of cellulose and hemicellulose, because thermostable enzymes are more compatible than non- thermostable enzymes with other processes involved in converting lignocellulose-based materials into biofuels. For example, treatments of lignocellulose-based materials to decrease the crystallinity of cellulose may require high temperatures that inactivate non-thermostable enzymes. [0009] In addition, thermostable enzymes are desirable for the degradation of cellulose and/or hemicellulose because they may have a higher specific activity as compared to their mesophilic counterparts, and because they can operate at high temperatures that reduce or eliminate the risk of microbial contamination.

[0010] Accordingly, there is a need for thermostable enzymes and enzyme cocktails capable of degrading cellulose and/or hemicellulose.

BRIEF SUMMARY

[0011] This disclosure provides enzymes and enzyme cocktails which satisfy the need for thermostable enzymes capable of degrading cellulose and/or hemicellulose. In some aspects, the disclosure provides enzymes having cellulase activity. In some aspects, the disclosure provides truncated enzymes having cellulase activity. In some aspects, the disclosure provides improved enzyme mixtures for the degradation of cellulose-containing materials. In some aspects, the disclosure provides enzymes having hemicellulase activity. In some aspects, the disclosure provides improved enzyme mixtures for the degradation of hemicellulose-containing materials. The disclosure further provides enzyme cocktails containing one or more cellulases and one or more hemicellulases with improved activity on materials containing both cellulose and hemicellulose, wherein cellulase and hemicellulase mixtures have synergistic activity. The disclosure further provides polypeptides that enhance the activity of enzymes having cellulase or hemicellulase activity, and/or mixtures thereof. The disclosure further provides nucleotide sequences encoding the polypeptides disclosed herein. The polypeptides disclosed herein can be utilized alone, in combination, or with other enzymes.

[0012] In one embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38. In another embodiment, a host cell comprising three recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38 is provided. In another embodiment, a host cell comprising four recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38 is provided. In another embodiment, a host cell comprising five recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38. In another embodiment, a host cell comprising six recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38 is provided.

[0013] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide].

[0014] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide], wherein the host cell further comprises one or more recombinant nucleic acids encoding one or more cellulases.

[0015] In another embodiment, the disclosure provides a host cell comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 {Caldicellulosiruptor bescii endoxylanase (CM93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide], wherein the host cell is selected from the group consisting of Escherichia spp., Pseudomonas spp., Proteus spp., Ralstonia spp., Streptomyces spp., Staphylococcus spp., Lactococcus spp., Bacillus spp., Saccharomyces cerevisiae, Schizosaccharomyces pombe, Yarrowia lipolytica, Hansenula polymorpha, Kluyveromyces lactis, Pichia pastoris, Aspergillus spp., Chrysosporium

lucknowense, or Trichoderma reesei.

[0016] In another embodiment, the disclosure provides a method for producing at least two of the enzymes selected from the group consisting of endoxylanase, cc-arabinofuranosidase, cc- glucuronidase, β-xylosidase, and acetyl xylan esterase, comprising: culturing a host cell comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii

endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7

[Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (Cbl 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl 62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37 [Caldicellulosiruptor bescii endoxylanase (Cbl 93) lacking signal peptide] in a culture medium, under suitable conditions to produce the endoxylanase, cc- arabinofuranosidase, cc-glucuronidase, β-xylosidase, and acetyl xylan esterase. [0017] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide] and culture medium.

[0018] In another embodiment, the disclosure provides a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37.

[0019] In another embodiment, the disclosure provides a composition comprising six recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37.

[0020] In another embodiment, the disclosure provides a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein the composition further comprises one or more recombinant cellulases.

[0021] In yet another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product. [0022] In another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product, and wherein the biomass is subjected to pretreatment prior to being contacted with the composition comprising two or more recombinant proteins, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

[0023] In another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product, wherein the biomass comprises a plant material.

[0024] In another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product, wherein the biomass comprises a plant material selected from the group consisting of Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane.

[0025] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel.

[0026] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel, and wherein the biomass is subjected to pretreatment prior to being contacted with the composition comprising two or more recombinant proteins, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

[0027] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel, wherein the biomass comprises a plant material. [0028] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel, wherein the biomass comprises a plant material selected from the group consisting of Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with

Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane.

[0029] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution.

[0030] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution, and wherein the biomass is subjected to pretreatment prior to being contacted with the composition comprising two or more recombinant proteins, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid. [0031] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution, wherein the biomass comprises a plant material.

[0032] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution, wherein the biomass comprises a plant material selected from the group consisting of Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane.

[0033] In yet another embodiment, the disclosure provides a method for degrading hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose.

[0034] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said one or more functional groups are selected from the group consisting of arabinose, glucuronyl, and acetyl.

[0035] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 40 and 80°C.

[0036] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 60 and 80°C.

[0037] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose.

[0038] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said one or more functional groups are selected from the group consisting of arabinose, glucuronyl, and acetyl.

[0039] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 40 and 80°C.

[0040] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 60 and 80°C.

[0041] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38, wherein at least one of the two or more recombinant nucleic acids is selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 8, 14, 20, 28, and 34. In another embodiment, a host cell comprising three recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38, wherein at least two of the three recombinant nucleic acids are selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 8, 14, 20, 28, and 34, is provided. In another embodiment, a host cell comprising four recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38, wherein at least three of the four recombinant nucleic acids are selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 8, 14, 20, 28, and 34, is provided. In another embodiment, a host cell comprising five recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38, wherein at least four of the five recombinant nucleic acids are selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 8, 14, 20, 28, and 34, is provided In another embodiment, a host cell comprising six recombinant nucleic acids selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38, wherein at least five of the six recombinant nucleic acids are selected from the group consisting of the nucleotide sequences of SEQ ID NOs: 8, 14, 20, 28, and 34, is provided.

[0042] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (CM95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide], wherein at least one of the two or more recombinant nucleic acids is selected from the group consisting of: a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii cc-glucuronidase (Cb909)], a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], and a nucleic acid encoding the polypeptide of SEQ ID NO: 33 {Caldicellulosiruptor bescii acetyl xylan esterase (CM62)].

[0043] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide], wherein at least one of the two or more recombinant nucleic acids is selected from the group consisting of: a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii cc-glucuronidase (Cb909)], a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], and a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], and wherein the host cell further comprises one or more recombinant nucleic acids encoding one or more cellulases.

[0044] In another embodiment, the disclosure provides a host cell comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide], wherein at least one of the two or more recombinant nucleic acids is selected from the group consisting of: a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii cc-glucuronidase (Cb909)], a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], and a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], and wherein the host cell is selected from the group consisting of Escherichia spp., Pseudomonas spp., Proteus spp., Ralstonia spp., Streptomyces spp.,

Staphylococcus spp., Lactococcus spp., Bacillus spp., Saccharomyces cerevisiae,

Schizosaccharomyces pombe, Yarrowia lipolytica, Hansenula polymorpha, Kluyveromyces lactis, Pichia pastoris, Aspergillus spp., Chrysosporium lucknowense, or Trichoderma reesei.

[0045] In another embodiment, the disclosure provides a method for producing at least two of the enzymes selected from the group consisting of endoxylanase, cc-arabinofuranosidase, cc- glucuronidase, β-xylosidase, and acetyl xylan esterase, comprising: culturing a host cell comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii

endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7

[Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (Cbl 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl 62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37 [Caldicellulosiruptor bescii endoxylanase (Cbl 93) lacking signal peptide], wherein at least one of the two or more recombinant nucleic acids is selected from the group consisting of: a nucleic acid encoding the polypeptide of SEQ ID NO: 7

[Caldicellulosiruptor bescii endoxylanase (CM95)], a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii cc-glucuronidase (Cb909)], a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], and a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)] in a culture medium, under suitable conditions to produce the endoxylanase, cc-arabinofuranosidase, cc-glucuronidase, β-xylosidase, and acetyl xylan esterase.

[0046] In another embodiment, the disclosure provides a host cell, comprising two or more recombinant nucleic acids selected from the group consisting of: a) a nucleic acid encoding the polypeptide of SEQ ID NO: 3 [Caldicellulosiruptor bescii endoxylanase (Cbl93)], b) a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], c) a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], d) a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii a- glucuronidase (Cb909)], e) a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], f) a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], g) a nucleic acid encoding the polypeptide of SEQ ID NO: 37

[Caldicellulosiruptor bescii endoxylanase (Cbl93) lacking signal peptide], wherein at least one of the two or more recombinant nucleic acids is selected from the group consisting of: a nucleic acid encoding the polypeptide of SEQ ID NO: 7 [Caldicellulosiruptor bescii endoxylanase (Cbl95)], a nucleic acid encoding the polypeptide of SEQ ID NO: 13 [Caldicellulosiruptor bescii cc-arabinofuranosidase (CM 172)], a nucleic acid encoding the polypeptide of SEQ ID NO: 19 [Caldicellulosiruptor bescii cc-glucuronidase (Cb909)], a nucleic acid encoding the polypeptide of SEQ ID NO: 27 [Caldicellulosiruptor bescii β-xylosidase (Cb2487)], and a nucleic acid encoding the polypeptide of SEQ ID NO: 33 [Caldicellulosiruptor bescii acetyl xylan esterase (Cbl62)], and culture medium. [0047] In another embodiment, the disclosure provides a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33.

[0048] In another embodiment, the disclosure provides a composition comprising six recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein five of the six recombinant proteins are selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33.

[0049] In another embodiment, the disclosure provides a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition further comprises one or more recombinant cellulases.

[0050] In yet another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a

fermentation product.

[0051] In another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product, and wherein the biomass is subjected to pretreatment prior to being contacted with the composition comprising two or more recombinant proteins, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

[0052] In another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product, wherein the biomass comprises a plant material.

[0053] In another embodiment, the disclosure provides a method of converting biomass to fermentation product comprising contacting the biomass with a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product, wherein the biomass comprises a plant material selected from the group consisting of Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with

Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane.

[0054] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel.

[0055] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel, and wherein the biomass is subjected to pretreatment prior to being contacted with the composition comprising two or more recombinant proteins, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

[0056] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel, wherein the biomass comprises a plant material.

[0057] In another embodiment, the disclosure provides a method of converting biomass to fuel comprising contacting the biomass with the composition a composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution; and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fuel, wherein the biomass comprises a plant material selected from the group consisting of

Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with Solubles

(DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane.

[0058] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution.

[0059] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution, and wherein the biomass is subjected to pretreatment prior to being contacted with the composition comprising two or more recombinant proteins, wherein the pretreatment comprises one or more of the treatments selected from the group consisting of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

[0060] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution, wherein the biomass comprises a plant material.

[0061] In another embodiment, the disclosure provides a method of degrading biomass comprising contacting the biomass with the composition comprising two or more recombinant proteins, the recombinant proteins selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 3, 7, 13, 19, 27, 33 and 37, wherein at least one of the two or more recombinant proteins is selected from the group consisting of the polypeptide sequences of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein the composition may further comprise one or more recombinant cellulases, to yield a sugar solution, wherein the biomass comprises a plant material selected from the group consisting of Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane. [0062] In yet another embodiment, the disclosure provides a method for degrading hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose.

[0063] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said one or more functional groups are selected from the group consisting of arabinose, glucuronyl, and acetyl.

[0064] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 40 and 80°C. [0065] In another embodiment, the disclosure provides a method for degrading hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with two or more enzymes selected from the group consisting of the polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 60 and 80°C.

[0066] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose.

[0067] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said one or more functional groups are selected from the group consisting of arabinose, glucuronyl, and acetyl. [0068] In another embodiment, the disclosure provides a method for degrading hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 40 and 80°C.

[0069] In another embodiment, the disclosure provides a method for degrading

hemicellulose, said method comprising the steps of: a) providing plant material comprising hemicellulose, wherein said hemicellulose comprises a xylose backbone comprising β-1,4- linkages and one or more functional groups; and b) treating said hemicellulose with a transgenic host cell that secretes two or more enzymes selected from the group consisting of the

polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, wherein at least one of the two or more enzymes is selected from the group consisting of the polypeptide of SEQ ID NOs: 7, 13, 19, 27, and 33, and wherein said treating cleaves said one or more functional groups from said xylose backbone to form cleaved hemicellulose, wherein said treating is conducted at a temperature between 60 and 80°C.

[0070] In one aspect, provided herein is a host cell containing one, two, three, four, five, six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one, two, three, four, five, or six polypeptides selected from: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides.

[0071] In another aspect, provided herein is a host cell containing one, two, three, four, five, six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one, two, three, four, five, or six polypeptides selected from: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the CM952 polypeptide has a sequence selected from SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide has a sequence selected from SEQ ID NOs: 60, 61, and 111. wherein the CM954 polypeptide has a sequence selected from SEQ ID NOs: 74, 121, and 76; wherein the CM946 polypeptide has a sequence selected from SEQ ID NOs: 86, 87, and 113; wherein the Cb629 polypeptide has a sequence selected from SEQ ID NOs: 98, 119, and 100; and wherein the Cb486 polypeptide has a sequence of SEQ ID NO: 106.

[0072] Also provided herein is a host cell containing one, two, three, four, five, six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one, two, three, four, five, or six polypeptides selected from polypeptides having the sequence of: SEQ ID NO: 46, 111, 76, 113, 100, and 106.

[0073] Also provided herein is a host cell containing one, two, three, four, five, six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one, two, three, four, five, or six polypeptides selected from: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide has a sequence selected from SEQ ID NOs: 45, 115, 125, 127, 129, and 47; wherein the recombinant nucleic acid encoding a CM953 polypeptide has a sequence selected from SEQ ID NOs: 62, 63, and 110; wherein the recombinant nucleic acid encoding a CM954 polypeptide has a sequence selected from SEQ ID NOs: 116, 75, and 77; wherein the recombinant nucleic acid encoding a CM946 polypeptide has a sequence selected from SEQ ID NOs: 88, 89, and 112; wherein the recombinant nucleic acid encoding a Cb629 polypeptide has a sequence selected from SEQ ID NOs: 99, 120, and 101; and, wherein the recombinant nucleic acid encoding a Cb486 polypeptide has the sequence of SEQ ID NO: 107.

[0074] Also provided herein is a host cell containing one, two, three, four, five, six, or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one, two, three, four, five, or six polypeptides selected from: CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the recombinant nucleic acids have a sequence selected from SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

[0075] Also provided herein host cell containing six recombinant nucleic acids, wherein the nucleic acids have the sequences of SEQ ID NOs: 47, 110, 77, 112, 101, and 107. [0076] Any of the host cells provided herein may also contain one or more recombinant nucleic acids encoding a hemicellulase, wherein the hemicellulase has a sequence selected from SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37. In some aspects, a nucleic acid encoding a

hemicellulase has a sequence selected from SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38. In some aspects, host cells provided herein may contain recombinant nucleic acids having the sequences of SEQ ID NOs: 8, 14, 20, 28, 34, and 38, or recombinant nucleic acids having the sequences of SEQ ID NOs: 8, 14, 20, 28, and 38.

[0077] Further provided herein is a composition containing one, two, three, four, five, six, or more recombinant polypeptides, wherein the recombinant polypeptides are selected from

CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides.

[0078] In another aspect, provided herein is a composition containing one, two, three, four, five, six, or more recombinant polypeptides, wherein the recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the CM952 polypeptide has a sequence selected from SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide has a sequence selected from SEQ ID NOs: 60, 61, and 111. wherein the CM954 polypeptide has a sequence selected from SEQ ID NOs: 74, 121, and 76; wherein the CM946 polypeptide has a sequence selected from SEQ ID NOs: 86, 87, and 113; wherein the Cb629 polypeptide has a sequence selected from SEQ ID NOs: 98, 119, and 100; and wherein the Cb486 polypeptide has a sequence of SEQ ID NO: 106.

[0079] Also provided herein is a composition containing one, two, three, four, five, six, or more recombinant polypeptides, wherein the recombinant polypeptides have a sequence selected from SEQ ID NOs: 46, 111, 76, 113, 100, and 106.

[0080] Also provided herein is a composition containing six recombinant polypeptides, wherein the recombinant polypeptides have the sequences of SEQ ID NOs: 46, 111, 76, 113, 100, and 106.

[0081] Also provided herein is a composition containing one or more recombinant polypeptides, wherein the one or more recombinant polypeptides are selected from the group consisting of the polypeptides of SEQ ID NOs: 46, 111, 76, 113, 124, 126, 128, and 100. [0082] Any of the compositions provided herein may also contain one or more hemicellulase polypeptides, wherein the hemicellulase has a sequence selected from SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37. In some aspects, compositions provided herein contain polypeptides having the sequences of SEQ ID NOs: 7, 13, 19, 27, 33, and 37 or polypeptides having the sequences of SEQ ID NOs: 7, 13, 19, 27, and 37.

[0083] In another aspect, provided herein is a method for producing one or more cellulases, the method including: a) culturing any of the host cells disclosed herein which contain one or more recombinant nucleic acids encoding one or more CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, in culture media under conditions sufficient to support the expression of the recombinant nucleic acid(s), and collecting one or more cellulases from said media and/or said host cell.

[0084] In another aspect, provided herein is a method for degrading a cellulose-containing material, the method including: a) contacting the cellulose-containing material with any host cell or composition disclosed herein, and, b) incubating the host cell or composition and cellulose- containing material under conditions that support cellulose degradation.

[0085] Cellulose-containing material may be pretreated prior to being contacted with a composition or host cell disclosed herein. Pre-treatment steps may include one or more of the treatments of: ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, treatment with acidic solutions, treatment with organic solvents, treatment with high pressure, treatment with high temperature, treatment with ionic liquids (IL), treatment with electrolyzed water, and treatment with phosphoric acid.

[0086] Also provided herein is a method of reducing the viscosity of a pretreated cellulose- containing material, the method including contacting pretreated cellulose-containing material with any host cell or composition provided herein.

[0087] Also provided herein is a method of converting a cellulose-containing material to fermentation product, the method including: a) contacting the cellulose-containing material with any host cell or composition provided herein, to yield a sugar solution, and culturing the sugar solution with a fermentative microorganism under conditions sufficient to produce a fermentation product.

[0088] Also provided herein is a method for degrading a cellulose-containing material, the method including: a) contacting the cellulose-containing material with one or more polypeptides selected from SEQ ID NOs: 46, 111, 76, 113, 124, 126, 128, and 100, and b) incubating the one or more polypeptides and cellulose-containing material under conditions that support cellulose degradation.

[0089] In some aspects, cellulose-containing material provided herein is a plant material. Plant material may include, without limitation, Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, rye hulls, wheat hulls, sugarcane bagasse, copra meal, copra pellets, palm kernel meal, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine wood, birch wood, willow wood, aspen wood, poplar wood, energy cane, waste paper, sawdust, forestry wastes, waste paper, and crop residues.

[0090] In some aspects, at least a portion of any of the methods provided herein may be conducted at a temperature above 50° C. In some aspects, at least a portion of any of the methods provided herein may be conducted at a temperature between 40° and 80°, 50° and 80°, 60° and 80°, 70° and 80°, 45° and 55°, 50° and 60°, 55° and 65°, 60° and 70°, 65° and 75°, 75° and 85°, or 80° and 90° C.

[0091] In some aspects, in any host cells disclosed herein that contain two or more recombinant nucleic acids, two or more of the recombinant nucleic acids may be present in a contiguous polydeoxyribonucleotide chain.

[0092] In any of the compositions or methods above, a CM581 polypeptide may be provided in the composition or the method with the cellulases and/or hemicellulases. In some aspects, the CM581 polypeptide is a polypeptide containing the sequence of SEQ ID NO: 146. Also provided herein are any of the above host cells that further contain a nucleic acid encoding a CM581 polypeptide, In some aspects, a nucleic acid encoding a CM581 polypeptide is a nucleic acid containing the sequence of SEQ ID NO: 147.

[0093]

BRIEF DESCRIPTION OF THE DRAWINGS

[0094] Figure 1 shows a model of a typical hemicellulose such as xylan.

[0095] Figure 2, part (A), shows the putative domain architecture of the Cbl93 and Cbl95 proteins. Part (B) shows an SDS-PAGE of purified Cbl93 and Cbl95; the molecular markers are in the lane marked M. The proteins were purified by metal affinity chromatography, followed by ion exchange chromatography and then gel filtration. The predicted molecular masses of Cbl93 and Cbl95 were 77.7 kDa and 42.0 kDa, respectively. Part (C) shows the enzymatic activity of Cbl93 on natural substrates using TLC analysis. Various substrates were tested: soluble wheat arabinoxylan (SWAX), oat-spelt xylan (OSX), birchwood xylan (BWX), carboxymethyl cellulose (CMC), lichenan, glucomannan, 1,4 β-mannan, and arabinan. In the case of SWAX, OSX, and BWX, in the presence of Cbl93 (+), short xylose chains were released. In the minus (-) lanes, no enzyme was added and therefore no products of hydrolysis were released. XI (xylose monomer), X2 (xylose dimer or a disaccharide), X3 (trisaccharide), X4 (tetrasaccharide), and pentasaccharide (X5) were loaded in the first lane (M) as markers. The results showed that this enzyme releases shorter chains or oligosaccharides from the complex substrates (SWAX, OSX, and BWX). Part (D) shows the enzymatic activity of Cbl95 on natural substrates using TLC analysis. Various substrates were tested: SWAX, OSX, BWX, CMC, lichenan, glucomannan, 1,4 β-mannan, and arabinan. In the case of SWAX, OSX, and BWX, in the presence of Cbl95 (+), short xylose chains were released. In the minus (-) lanes, no enzyme was added and therefore no products of hydrolysis were released. XI (xylose monomer), X2 (xylose dimer or a disaccharide), X3 (trisaccharide), X4 (tetrasaccharide), and pentasaccharide (X5) were loaded in the first lane (M) as markers. The results showed that this enzyme releases shorter chains or oligosaccharides from the complex substrates (SWAX, OSX, and BWX). Part (E) shows the enzymatic activity of Cbl93 and Cbl95 on natural substrates from a reducing sugar assay. In this experiment, a different assay for reducing sugars was used to determine the release of products from the substrates. A standard was made based on known glucose concentrations and their absorbance (color development) in the presence of para-hydroxy- benzoic acid hydrazide (Cann et al. 1999. J. Bacterial. 181: 1643-1651 and other reference above- Laver, M. 1972.). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents.

[0096] Figure 3, part (A) shows the thermostability of Cbl93, and part (B) shows the thermostability of Cbl95. 5 nM of Cbl93 and Cbl95 were incubated at different temperatures ranging from 65-90 °C. For Cbl93, the enzymes were incubated at 70 °C, 75 °C, 80 °C, 85 °C, and 90 °C; for Cbl95, the enzymes were incubated at 65 °C, 70 °C, 75 °C, and 80 °C. The incubated enzymes were taken out at certain time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7 h, 11 h, 16 h, and 24 h) as indicated, and immediately incubated with wheat arabinoxylan (final 1%, w/v) to measure the enzyme activity. The initial velocity of reaction was calculated. The residue activity ( ) was calculated by dividing the activity of each samples by the initial activity at zero time. Bars are shown with standard errors for three independent experiments.

[0097] Figure 4 shows the kinetic data of Cbl93 on hydrolysis of wheat arabinoxylan, oat spelt xylan, and birchwood xylan. The K m , fc cat , and k c K m are indicated as well. In part (A), the experiment was conducted at 75 °C with 50 mM citrate buffer (pH 6.0). In part (B), the experiment was conducted at 85°C with 50 mM citrate buffer (pH 6.0). Xylan substrates (final 2.5 -50 mg/mL) were incubated with Cbl93 (final 5 nM for wheat arabinoxylan and final 50 nM for oat spelt xylan and birchwood xylan). The initial velocity of reaction was calculated. The initial velocities were then plotted against the concentrations of xylan substrates. The K m and fc cat were calculated by non-linear fit using the Graphpad software. Bars are shown with standard errors for three independent experiments.

[0098] Figure 5 shows the kinetic data of Cbl95 on hydrolysis of wheat arabinoxylan, oat spelt xylan, and birchwood xylan. The K m , fc cat , and k c K m are indicated as well. In part (A), the experiment was conducted at 75 °C with 50 mM citrate buffer (pH 6.0). In part (B), the experiment was conducted at 75°C with 50 mM sodium phosphate buffer (pH 6.5). Xylan substrates (final 2.5 -50 mg/mL) were incubated with Cbl95 (final 5 nM for wheat arabinoxylan and final 50 nM for oat spelt xylan and birchwood xylan). The initial velocity of reaction was calculated. The initial velocities were then plotted against the concentrations of xylan substrates. The K m and fc cat were calculated by non-linear fit using the Graphpad software. Bars are shown with standard errors for three independent experiments.

[0099] Figure 6, part (A) shows an SDS-PAGE of purified Cbl 172. Part (B) shows the enzymatic activity of Cbl 172 on natural substrates from a reducing sugar assay. Five different hemicellulosic substrates were tested: arabinan (sugar beet), SWAX, rye arabinoxylan (RAX), OSX and debranched arabinan. Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of arabinose equivalents. Hydrolysis of arabinan was higher than hydrolysis of other natural substrates. Part (C) shows the enzymatic activity of Cbl 172 on natural substrates using HPLC analysis. Five different hemicellulosic substrates were tested: arabinan (sugar beet), SWAX, RAX, OSX and debranched arabinan. In each case, in the presence of Cbl 172, arabinose was released. In the absence of Cbl 172, only minor amount of arabinose was observed for debranched arabinan; no products of hydrolysis were released for other natural polysaccharides. The results showed that this enzyme releases arabinose from complex substrates (arabinan, SWAX, RAX, OSX and debranched arabinan). Part (D) shows the domain architecture of the Cbl 172 protein; it has a glycoside hydrolase (GH) family 51 catalytic domain. Part (E) shows the thermostability of Cbl 172. Cbl 172 has 57%, 45%, 35% and 22% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. Fifty nM Cbl 172 was kept at different temperatures (70 °C, 75 °C, 80 °C, 85 °C and 90 °C). The samples were taken out at the following time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately applied to enzyme activity measurement.

[00100] Figure 7 shows the kinetic data of Cbl 172 on hydrolysis of pNP-CC-L- arabinofuranoside. The K m , fc cat , and k c K m are indicated as well. In part (A), the experiment was conducted at 90°C; in part (B), the experiment was conducted at 75°C. One hundred μΐ pNP-CC-L- arabinofuranoside substrate of different concentrations was kept at 85 °C for three minutes to equilibrate. Then twenty five μΐ of the protein sample (fifty nM) was added to the substrate and mixed by pipetting up and down for several times. The optical density at 400 nm was recorded by a Cary 300 UV- Visible spectrophotometer for 2.5 minutes. The initial velocity of reaction in the first minute was calculated. The initial velocities were then plotted against the concentrations of pNP-CC-L-arabinofuranoside. The K m and k cat were calculated by non-linear fit using the Graphpad software.

[00101] Figure 8, part (A) shows putative domain architecture of Cb909. Part (B) shows SDS-PAGE of purified Cb909. Part (C) shows the activity of Cb909. The substrate is aldouronic acids, that is a mixture of xylo-oligosaccharides decorated with MeGlcA. After incubation with Cb909 at 75 °C for 60 minutes, MeGlcA group was cleaved by Cb909 from aldouronic acids to release undecorated xylose, xylobiose, xylotriose and xylotetraose as products. The condition of the reaction was as follows: 6nM Cb909, 50mM Phosphate buffer pH 6.0, 150mM NaCl, lmg/ml aldouronic acids. Part (D) shows the results of a pH optimization assay. The maximum activity was detected at pH 5.5. This assay was carried out as follows: lmg/ml aldouronic acids solution was incubated with 6nM Cb909 for 10 minutes at 75 °C at each pH. 50mM citrate buffer containing 150mM NaCl was used in the range from pH 5 to pH 6. 50mM phosphate buffer containing 150mM NaCl was used in the range of pH 6 to pH 7. After the reaction, the temperature was quickly increased to 100 °C to terminate the reaction. The amounts of products were detected by HPLC. Part (E) shows the results of optimum temperature assay. The maximum activity of Cb909 was detected at 75 °C (xylobiose and xylotriose). Xylose was produced most efficiently at 70 °C but the amounts of produced xylose at 70 °C and 75 °C were almost the same. This assay was carried out as follows: lmg/ml aldouronic acids solution was incubated with 6nM Cb909 for 10 minutes in 50mM citrate buffer pH 5.5 that contained 150 mM NaCl. After the reaction the temperature was quickly increased to 100 °C to terminate the reaction. The amounts of products were detected by HPLC.

[00102] Figure 9, part (A) shows the putative domain architecture of Cb2487. The putative conserved domains of Cb2487 were analyzed through the NCBI Conserved Domains Database search tool. Part (B) shows SDS-PAGE of purified Cb2487. Part (C) shows a biochemical assay to determine the optimum pH of Cb2487. For pH optimum assay, para-nitrophenyl-beta-D- xylopyranoside (pNP-X, 0.8 mM) was incubated with Cb2487 (concentration 10 nM) at 75°C in different buffers: pH 4.0-6.0 (citrate buffer, 50 mM, 150 mM NaCl), pH 6.0-8.0 (phosphate buffer, 50 mM, 150 mM NaCl), pH 8.5-9.0 (Tris-HCl, 50 mM, 150 mM NaCl). Part (D) shows a biochemical assay to determine the optimum temperature of Cb2487. For temperature optimum assay, pNP-X (0.8 mM) was incubated with Cb2487 (10 nM) in citrate buffer (50 mM, pH 6.0, 150 mM NaCl) at different temperatures (40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 °C). Part (E) shows the kinetic parameters of Cb2487 with /?NP-P-D-xylopyranoside as substrate. For the left side panel, the kinetic parameters were determined at 90 °C, pH 6.0. For the right side panel, the kinetic parameters were determined at 75 °C, pH 6.0. For these assays, different concentrations of pNP-X (0.08-24 mM) were incubated with Cb2487 (10 nM) in citrate buffer (50 mM, pH 6.0, 150 mM NaCl) at 75 and 90°C. Part (F) shows hydrolytic activity of Cb2487 on xylo-oligosaccharides. Cb2487 (0.5 μΜ) was incubated with different xylo-oligosaccharides (X 2-6 ) at 75 °C for 15 hr and then the products were separated by TLC. Part (G) shows thermostability assay for Cb2487. Cb2487 was incubated in citrate buffer (pH 6.0, 50 mM) at different temperatures (70, 75, 80, 85, 90, 95°C) without substrate addition, the protein was taken at different times (0, 10 min, 30 min, 1 h, 3 h, 4, 8 h, 12 h, 24 h) and the residual activity was assayed with /?NP-X as substrate. Part (H) shows synergism of β-xylosidase (Cb2487) and a-glucuronidase (Cb909). Aldouronic acids were incubated with Cb2487 (0.5 μΜ) and Cb909 (0.5 μΜ) in citrate buffer (pH 6.0) at 75 °C overnight, then assayed with HPLC. Adding Cb909 cleaved off the methylglucuronic acid decorations in aldouronic acids to release xylose and xylo- oligosaccharides. Adding Cb2487 cleaved available beta-l,4-xylosidic linkages to release more xylose. Mixing the two enzymes led to the conversion of the xylo-oligosaccharides released by Cb909 to xylose by Cb2487.

[00103] Figure 10, part (A) shows the domain structure of Cbl62; the protein has a single domain of acetyl xylan esterase. Part (B) shows an SDS-PAGE of purified Cbl62. Part (C) shows the pH profile of Cbl62 on /?NP-acetate using /?ara-nitrophenol adducted acetate (pNP- acetate) as a substrate. The released /?NP was monitored continuously at an absorbance of 400 nm using Synergy 2 Microplate reader (BioTek). The initial rate of hydrolysis was adopted as an enzyme activity. The pH effect on the Cbl62 was examined at 50°C in the presence of 50 mM citrate-NaOH (pH 4.0 to 6.0) or 50 mM Na 2 HP0 4 -HCl (pH 6.0 to 8.0), with 150 mM NaCl. 0.1 μΜ of purified Cbl62 and 2 mM /?NP-acetate were used for this assay. Part (D) shows the temperature profile of Cbl62 on /?NP-acetate. The temperature profile was performed in 50 mM Na 2 HP0 4 -HCl, pH 7.0 and 150 mM NaCl, at temperatures between 40 °C and 75 °C with 5 °C increments. 0.04 μΜ of purified Cbl62 and 2 mM /?NP-acetate were used for this assay. Part (E) shows the thermostability profile of Cbl62 on /?NP-acetate; 0.02 μΜ of purified Cbl62 in 50 mM Na 2 HP0 4 -HCl, pH 7.0 and 150 mM NaCl was incubated for 0 to 24 hours at temperatures between 60 °C and 80 °C with 5 °C intervals, and the residual activities were measured. Part (F) shows a kinetic study of Cbl62. 0.04 μΜ of purified Cbl62 in 50 mM Na 2 HP0 4 -HCl, pH 6.0, and 150 mM NaCl was incubated with various concentrations of /?NP-acetate, and the initial rate of hydrolysis was plotted on the graph. The kinetic parameters were determined by Michaelis- Menten equation utilizing Graph Pad Prism v5.01 (GraphPad Software).

[00104] Figure 11 shows synergy of C. bescii hemicellulolytic enzymes on soluble wheat arabinoxylan (SWAX) hydrolysis. SWAX (8.0%, w/v) was incubated with different

hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar [part (A)] and HPLC [part (B)] analysis. The hemicellulases applied include Cbl93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ).

[00105] Figure 12 shows synergy of C. bescii hemicellulolytic enzymes on oatspelt xylan (OSX) hydrolysis. OSX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar [part (A)] and HPLC [part (B)] analysis. The hemicellulases applied include Cbl93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ).

[00106] Figure 13, part (A) shows soluble wheat arabinoxylan hydrolysis with hemicellulase cocktail at different temperatures. SWAX (8.0%, w/v) was incubated with CM93 (0.5 μΜ), Cb2487 (4 μΜ), CM 172 (0.5 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) at 65 °C, 70 °C, 75 °C, 80 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (B) shows birch wood xylan hydrolysis with hemicellulase cocktail at different temperatures. BWX (8.0%, w/v) was incubated with CM93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ) at 65 °C, 70 °C, 75 °C, 80 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (C) shows oat spelt xylan hydrolysis with hemicellulase cocktail at different temperatures. OSX (8.0%, w/v) was incubated with CM93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ) at 65 °C, 70 °C, 75 °C, 80 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay.

[00107] Figure 14, part (A) shows SWAX hydrolysis was improved by adding two xylanases (Cbl95 and Cbl93) in the hemicellulase mixture. SWAX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar analysis. Different hemicellulase mixtures were applied in the hydrolysis: Mix I) Cbl95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ); Mix II) Cbl93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ); or Mix III) CM95 (0.25 μΜ), CM93 (0.25 μΜ), CM 172 (0. 5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ). Part (B) shows BWX hydrolysis was improved by adding two xylanases (CM 95 and CM 93) in the hemicellulase mixture. BWX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar analysis. Different hemicellulase mixtures were applied in the hydrolysis: Mix I) CM95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ); Mix II) CM93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ); or Mix III) CM95 (0.25 μΜ), CM93 (0.25 μΜ), CM 172 (0. 5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ). Part (C) shows OSX hydrolysis was improved by adding two xylanases (CM 95 and CM 93) in the hemicellulase mixture. OSX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar analysis. Different hemicellulase mixtures were applied in the hydrolysis: Mix I) CM 95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ); Mix II) CM93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ); or Mix III) CM95 (0.25 μΜ), CM93 (0.25 μΜ), CM 172 (0. 5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ).

[00108] Figure 15 shows soluble wheat arabinoxylan hydrolysis with hemicellulase cocktail of Caldicellulosiruptor bescii. Different concentrations of SWAX (1.0, 2.0, 4.0, 6.0, 8.0%, w/v) were incubated with CM93 (0.5 μΜ), CM95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) for 15 hr at 75 °C in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (A) shows reducing sugar in the control and hydrolysis mixtures, and part (B) shows comparison of calculated and average of actual reducing sugar in hydrolysis mixtures with different substrate concentrations.

[00109] Figure 16 shows birch wood xylan hydrolysis with hemicellulase cocktails of Caldicellulosiruptor bescii. Different concentrations of BWX (1.0, 2.0, 4.0, 6.0, 8.0%, w/v) were incubated with Cbl93 (0.5 μΜ), Cbl95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cbl62 (0.5 μΜ), and Cb909 (0.5 μΜ) at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (A) shows reducing sugar in the control and hydrolysis mixtures, and part (B) shows comparison of calculated and average of actual reducing sugar in hydrolysis mixtures with different substrate concentrations.

[00110] Figure 17 shows oat spelt xylan hydrolysis with hemicellulase cocktail of

Caldicellulosiruptor bescii. Different concentrations of OSX (1.0, 2.0, 4.0, 6.0, 8.0%, w/v) were incubated with Cbl93 (0.5 μΜ), Cbl95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (A) shows reducing sugar in the control and hydrolysis mixtures, and part (B) shows comparison of calculated and average of actual reducing sugar in hydrolysis mixtures with different substrate concentrations.

[00111] Figure 18: Schematic structures of wild-type CM952 and its truncation mutants. The signal peptide is shown in filled rectangle. GH9: family 9 glycoside hydrolase domain; GH5: family 5 glycoside hydrolase domain; CBM3c: family 3 type C carbohydrate binding module; CBM3b: family 3 type B carbohydrate binding module.

[00112] Figure 19: SDS-PAGE of CM952 wild-type and its truncation mutants. Lane 1: protein molecular mass marker; lane 2: CM 952 wild-type; lane 3: CM952TM1; lane 4:

CM952TM2; lane 5: CM952TM3; lane 6: CM952TM4; lane 7: CM952TM5; lane 8:

CM952TM6; lane 9: CM952TM7. Two μg of each enzyme was resolved on a 12% SDS polyacrylamide gel.

[00113] Figure 20: Enzymatic activity of CM952WT on natural substrates from a reducing sugar assay. Twelve different substrates were tested: Avicel, phosphoric acid swollen cellulose (PASC), sodium carboxymethyl cellulose (CMC-Na), lichenin, mannan, locust bean gum (LBG), guar gum, konjac glucomannan (KGM), wheat arabinoxylan (WAX), birchwood xylan (BWX), oat-spelt xylan (OSX) and xyloglucan. Incubation of enzymes with Avicel, PASC, CMC-Na, lichenin, mannan, LBG, guar gum, KGM, WAX and OSX substrates led to release of products that were quantified as a concentration of glucose equivalents. The CM952WT mainly hydrolyzes glucose- and mannose-configured substrates, but not xylose-configured substrates.

[00114] Figure 21: Enzymatic activity of CM952TM1 on natural substrates from a reducing sugar assay. Twelve different substrates were tested: Avicel, phosphoric acid swollen cellulose (PASC), sodium carboxymethyl cellulose (CMC-Na), lichenin, mannan, locust bean gum (LBG), guar gum, konjac glucomannan (KGM), wheat arabinoxylan (WAX), birchwood xylan (BWX), oat-spelt xylan (OSX) and xyloglucan. Incubation of enzymes with Avicel, PASC, CMC-Na, lichenin, mannan, LBG, guar gum, KGM, WAX, BWX, OSX and xyloglucan substrates led to release of products that were quantified as a concentration of glucose equivalents. The results show that CM952TM1 mainly hydrolyzes glucose-configured substrates. It also has some activities on mannose-configured substrates. It has low activities on xylose-configured substrates.

[00115] Figure 22: Enzymatic activity of CM952TM5 on natural substrates from a reducing sugar assay. Twelve different substrates were tested: Avicel, phosphoric acid swollen cellulose (PASC), sodium carboxymethyl cellulose (CMC-Na), lichenin, mannan, locust bean gum (LBG), guar gum, konjac glucomannan (KGM), wheat arabinoxylan (WAX), birchwood xylan (BWX), oat-spelt xylan (OSX) and xyloglucan. Incubation of enzymes with CMC-Na, lichenin, mannan, LBG, guar gum and KGM substrates led to release of products that were quantified as a concentration of mannose equivalents. The CM952TM5 mainly hydrolyzes mannose-configured substrates, but does not have obvious activity on glucose- or xylose-configured substrates.

[00116] Figure 23: Thin Layer Chromatography (TLC) analysis of enzymatic activity of CM952WT, CM952TM1 and CM952TM5 on glucose and cellooligosaccharides. Gl, G2, G3, G4, G5, and G6 refer to glucose, cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose, respectively. CM952WT and CM952TM1 hydrolyze cellotriose, cellotetraose, cellopentaose and cellohexaose into glucose and cellobiose, but have no activity on cellobiose. CM952TM5 has no activity on glucose and any of the cellooligosaccharides tested. None of the enzyme has transglycosylation activity on glucose and cellooligosaccharides.

[00117] Figure 24: Thin Layer Chromatography (TLC) analysis of enzymatic activity of CM952WT , CM952TM1 and CM952TM5 on mannose and mannooligosaccharides. Ml, M2, M3, M4, M5, and M6 refer to mannose, mannobiose, mannotriose, mannotetraose,

mannopentaose and mannohexaose, respectively. CM952WT and CM952TM5 hydrolyze mannotriose, mannootetraose, mannopentaose and mannohexaose into mannose and smaller mannooligosaccharides, but have no hydrolyzing activity on mannobiose. CM952TM1 hydrolyzes mannopentaose and mannohexaose into smaller oligosaccharides but has no hydrolyzing activity on mannobiose, mannotriose, mannotriose and mannotetraose. None of the enzyme has transglycosylation activity on mannose and mannooligosaccharides.

[00118] Figure 25: HPLC analysis of enzymatic activity of CM952TM1 on cellulose substrates. Three different cellulosic substrates were tested: Avicel, CMC-Na and PASC. In each case, in the presence of CM952TM1, glucose and cellobiose were released. In the absence of CM952TM1, neither glucose nor cellobiose was observed for all the substrates. The results showed that this part of the enzyme or polypeptide (CM952) cleaves glucose and cellobiose as end products from cellulosic substrates (Avicel, CMC-Na and PASC).

[00119] Figure 26: HPLC analysis of time-course hydrolysis of PASC by CM952TM1. 100 nanomolar of CM952TM1 was incubated with 2.5 mg/ml PASC at 75 °C. At different time intervals (0, 0.5 min, 2 min, 10 min, 1 h, 4 h and 24 h), samples were taken out and immediately boiled for 10 min to inactivate the enzyme. After centrifugation, the supernatants of the samples were appropriately diluted with water and applied to HPLC analysis. The results show that CM952TM1 initially releases glucose, cellobiose, cellotriose and cellotetraose. With increasing time, only glucose and cellobiose were left in the reaction mixture.

[00120] Figure 27: Thermostability of CM952WT using PASC as substrate for activity measurement. CM952WT has 75%, 43%, 17% and 12% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. 500 nM CM952WT was kept at different

temperatures (70 °C, 75 °C, 80 °C and 85 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used for enzyme activity measurement. The enzyme activity was measured at pH 5.5 and at 85 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 8.31 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (A new reaction for colorimetric determination carbohydrates. Anal. Biochem. 1972: 47; 273-279). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively.

[00121] Figure 28: Thermostability of CM952TM1 using PASC as substrate for activity measurement. CM952TM1 has 94%, 76%, 18% and 13% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. 500 nM CM952TM1 was kept at different

temperatures (70 °C, 75 °C, 80 °C and 85 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used for enzyme activity

measurement. The enzyme activity was measured at pH 5.5 and at 85 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 8.31 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively.

[00122] Figure 29: Domain architecture of wild-type (WT) CM953, CM953TM1 and CM953TM2.

[00123] Figure 30: SDS-polyacrylamide gel with purified wild-type CM953, CM953TM1 and CM953TM2 proteins. [00124] Figure 31: A zymogram of CM953WT, CM953TM1, CM953TM2 on carboxylmethyl cellulose (CMC). The gel was prepared as in standard dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) with CMC substrate (final 0.1%, w/v). After electrophoretic fractionation of the proteins, gels were washed twice in distilled water and incubated in 30 mL of refolding buffer (20 mM citrate buffer, pH 6.0, 2.5% Triton X-100, 2 mM dithiothreitol, 2.5 mM CaCl 2 ) for 1 hour at 25 °C and then held overnight in fresh buffer at 37°C. The gel was washed twice in 50 mM Citrate buffer (pH 6.0) and then the results were visualized by staining with 0.1% Congo red and destaining with 1M NaCl. As shown in Figure 31, CM953WT and CM953TM2 showed significant white bands at the positions of their expected sizes indicating cellulase activity, but not CM953TM1 protein.

[00125] Figures 32 and 33: Enzymatic activity of CM953WT, CM953TM1, and

CM953TM2 on natural substrates from a reducing sugar assay. Seven different substrates were tested: Avicel, Phosphoric acid swollen cellulose (PASC), carboxylmethyl cellulose (CMC), wheat arabinoxylan (WAX), lichenin, konjac glucomannan, and mannan. Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents. The tubes were incubated with constant mixing in a Thermomixer R (Eppendorf) at 75°C for 16 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 50 of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay. The wavelength at 410 nm was measured for the standards and samples. The A 41 o n m and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The reactions were resolved by thin layer chromatography (TLC), The mobile phase consisted of n-butanol: acetic acid:H20, 10:5: 1 (vol/vol/vol) and 10cm x 20cm plates were used. The reducing sugar assay (Figure 32) and TLC (Figure 33) results show that CM953WT and CM953TM2 have cellulase activity whereas CM953TM1 has mannanase activity.

[00126] Figure 34: HPLC analysis of time course of enzymatic activity of CM953TM2 on PASC. For analysis of the products of hydrolysis, the samples were analyzed by high performance anion-exchange chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA). For the analysis, glucose and five different cellooligosaccharides (cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose) were used as standards. In the reaction, CM953TM2 started to release cellooligosaccharides (C2- C4) and then glucose was released later. The results showed that this enzyme releases mainly cellobiose from PASC.

[00127] Figures 35 and 36: Thermostability of CM953WT (Figure 35) and CM953TM2 (Figure 36) on PASC. Fifty nM CM953WT and CM953TM2 were kept at different

temperatures (70 °C, 75 °C, 80 °C, 85 °C and 90 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used in enzyme activity measurement. The enzyme activity was measured at 85 °C using Cary 300 UV-Vis

spectrophotometer (Varian). The initial velocity of reaction in the first minute was calculated. The initial velocity of reaction for time 0 was set as 100; then the remaining activities

(percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the initial velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the initial velocity of reaction at time 0, then multiplied by 100. From the results, CM953WT (Figure 35) and

CM953TM2 (Figure 36) were quite stable at 70 °C and 75 °C, maintaining activity of 75-90% of heat non-treated proteins.

[00128] Figure 37: Kinetic studies of CM953WT (Figure 37A) and CM953TM2 (Figure 37B) on PASC. 0.05 μΜ of purified CM953WT or CM953TM2 in 50 mM Na 2 HP0 4 -HCl, pH 6.0, and 150 mM NaCl was incubated with various concentrations of phosphoric acid swollen cellulose (PASC), and the initial rate of hydrolysis was plotted against substrate concentration. The kinetic parameters ( K m : 7.603 mg/mL, k cai : 7.513 s "1 and k c K m : 0.988 s "1 mL/mg for CM953WT and K m : 3.032 mg/mL, £ cat : 5.411 s "1 and k c K m : 1.785 s "1 mL/mg for

CM953TM2 ) were determined by fitting the data to the Michaelis-Menten equation (Graph Pad Prism v5.01).

[00129] Figure 38: Domain architecture of wild-type (WT) CM954, CM954TM3 and CM954TM5 polypeptides. [00130] Figure 39: Panel (A): SDS-polyacrylamide gel with purified CM954TM3 protein. Panel (B): Enzymatic activity of CM954TM3 on natural substrates from a reducing sugar assay. Three different cellulose substrates were tested: Avicel, sodium carboxymethyl cellulose (CMC- Na) and phosphoric acid swollen cellulose (PASC). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents.

Hydrolysis of PASC was higher than hydrolysis of other substrates.

[00131] Figure 40: HPLC analysis of enzymatic activity of CM954TM3 on cellulosic substrates. Three different cellulosic substrates were tested: Avicel, CMC-Na and PASC. In each case, in the presence of CM954TM3, glucose and cellobiose were released. In the absence of CM954TM3, neither glucose nor cellobiose was observed for all the substrates. The results showed that this enzyme releases glucose and cellobiose, and also longer chain oligosaccharides as end products from cellulosic substrates (CMC-Na and PASC).

[00132] Figure 41: Thermostability of CM954TM3. CM954TM3 has 75%, 87%, 64% and 7% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. 500 nM CM954TM3 was kept at different temperatures (70 °C, 75 °C, 80 °C and 85 °C). The enzyme activity was measured at pH 5.5 and at 95 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 10 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively.

[00133] Figure 42: Domain architecture of wild-type (WT) CM946, CM946TM1 and CM946TM2 polypeptides.

[00134] Figure 43: SDS-polyacrylamide gel with purified wild-type CM946, CM946TM1 and CM946TM2 proteins. [00135] Figure 44: Zymogram of CM946WT, CM946TM1, and CM946TM2 on carboxylmethyl cellulose (CMC) agar plate. The agar plate was prepared with CMC substrate (final 0.25%, w/v). After spotting 1 μg of each protein on agar-CMC plates, the plate was incubated at 37°C overnight and then the gel was visualized by staining with 0.1% Congo red and destaining with 1M NaCl. As shown in Figure 44, CM946WT and CM946TM2 showed significant halos on the agar plate indicating cellulase activity, but not CM953TM1 proteins.

[00136] Figures 45 and 46: Thin Layer Chromatography (TLC) (Figure 45) and High Performance Liquid Chromatography (HPLC) (Figure 46) analysis of enzymatic activity of CM946WT, CM946TM1, CM946TM2 on phosphoric acid swollen cellulose (PASC). Each enzyme (final 0.5 μΜ) was reacted with phosphoric acid swollen cellulose (PASC) at 1% final concentration in 50 mM citrate-150 mM NaCl, pH 6.0 at 75°C for 16 hours. The reactions were resolved by thin layer chromatography (TLC) (Figure 45). The mobile phase consisted of n- butanol: acetic acid:H20, 10:5: 1 (vol/vol/vol) and 10cm x 20cm plates were used. In Figure 45, CI, C2, C3, C4, and C5 refer to glucose, cellobiose, cellotriose, cellotetraose and cellopentaose, respectively. For more quantitative analysis of the products of hydrolysis, the samples were analyzed by high performance anion-exchange chromatography (HPAEC) (Figure 46). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected into a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA). For the TLC and HPLC analysis, glucose and five different cellooligosaccharides were used: cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose as standards. Based on the results of TLC and HPLC, CM953WT and CM953TM2 showed significant release of products such as glucose, cellobiose, cellotriose, and cellotetraose from PASC substrate, indicating that

CM946WT and CM953TM2 have cellulase activities, but not CM953TM1.

[00137] Figure 47: Domain architecture of wild-type Cb629 and Cb629TMl polypeptides.

[00138] Figure 48: SDS-polyacrylamide gel with purified Cb629TMl protein. [00139] Figure 49: Enzymatic activity of Cb629TMl on substrates with products determined through a reducing sugar assay. Three different cellulose substrates were tested: Avicel, sodium carboxymethyl cellulose (CMC-Na) and phosphoric acid swollen cellulose (PASC). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents. Hydrolysis of PASC was higher than hydrolysis of the other substrates.

[00140] Figure 50: HPLC analysis of enzymatic activity of Cb629TMl on substrates. Three different cellulosic substrates were tested: Avicel, CMC-Na and PASC. In each case, in the presence of Cb629TMl, glucose and cellobiose were released. In the absence of Cb629TMl, neither glucose nor cellobiose was observed from all the substrates. The results showed that this enzyme releases glucose and cellobiose as end products from cellulosic substrates (Avicel, CMC-Na and PASC).

[00141] Figure 51: TLC analysis of enzymatic activity of Cb629TMl on cello- oligosaccharides. Gl, G2, G3, G4, G5, and G6 refer to glucose, cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose respectively.

[00142] Figure 52: Thermostability of Cb629TMl. Cb629TMl has 109%, 99%, 96%, 83% and 34% activity after incubation at 60 °C, 65 °C, 70 °C, 75 °C and 80 °C for 24 h, respectively. 500 nM Cb629TMl was kept at different temperatures (60 °C, 65 °C, 70 °C, 75 °C and 80 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used for enzyme activity measurement. The enzyme activity was measured at pH 5.5 and at 70 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 8.31 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively. [00143] Figure 53: Panel (A): Domain architecture of wild-type Cb486. Panel (B): SDS- polyacrylamide gel with purified wild-type Cb486 protein.

[00144] Figure 54: TLC analysis of enzymatic activity of Cb486 on xylo-oligosaccharides (X 2 -X 6 ). The following xylo-oligosaccharides (X 2 -X 6 ) were tested: xylobiose, xylotriose, xylotetraose, xylopentaose and xylohexaose. This was done by an overnight hydrolysis of the xylo-oligosaccharides followed by resolving of the products with TLC. In each case, in the presence of Cb486, xylose and xylobiose were released. In the absence of Cb486, only minor amount of xylose was observed for xylobiose; no products of hydrolysis were released for other xylo-oligosaccharides. The results showed that this enzyme releases xylose and xylobiose from xylo-oligosaccharides (xylobiose, xylotriose, xylotetraose, xylopentaose and xylohexaose). XI, X2, X3, X4, X5, and X6 refer to xylose, xylobiose, xylotriose, xylotetraose, xylopentaose, and xylohexaose, respectively.

[00145] Figure 55: TLC analysis of enzymatic activity of Cb486 on glucose and

cellooligosaccharides. Glucose and five different cellooligosaccharides were used for the assay: cellobiose, cellotriose, cellotetraose, cellopentaose and cellohexaose. C2, C3, C4, and C5 refer to cellobiose, cellotriose, cellotetraose and cellopentaose, respectively.

[00146] Figure 56: Panels A and B show the pH and temperature profiles, respectively of the activity of Cb486. For these assays, the enzyme concentration of Cb486 was 10 nM. For pH profiling, the reactions were carried out in two buffers: 50 mM sodium citrate, 150mM NaCl (pH 4.0-pH 6.0) and 50 mM Na 2 HP0 4 -NaH 2 P0 4 , 150 mM NaCl (pH 6.5-pH 8.0). The enzyme was incubated with 1 mM /?NP- -D-galactopyranoside in each buffer at a given pH at 75°C, and the activities in a 30 min assay were determined. For determination of optimal temperature, 10 nM of Cb486 was incubated with 1 mM /?NP- -D-galactopyranoside at pH 5.5 at different temperatures ranging from 40 °C to 95 °C with a 5°C interval. The releasing of /?NP was recorded by monitoring the increase of optical density at 410 nM with a Cary 300 UV- Visible spectrophotometer (Agilent, Santa Clara CA). [00147] Figure 57: Domain architecture (top) and SDS-polyacrylamide gels containing purified proteins (bottom) of a cellulase mixture composed of Cb629TMl, Cb486, CM946TM2, CM952TM1, CM953TM2, and Cbl954TM3 cellulases.

[00148] Figure 58: SDS-polyacrylamide gels containing purified proteins of the

hemicellulases Cbl93, Cbl95, Cbl l72, Cb909, Cb2487, and Cbl62.

[00149] Figures 59 and 60: TLC (Figure 59) and HPLC (Figure 60) analysis of samples of microwave -pretreated Miscanthus that were treated with a cellulase mixture containing

Cb629TMl, Cb486, CM946TM2, CM952TM1, CM953TM2, and CM954TM3 cellulases and/or a hemicellulase mixture containing Cbl93, Cbl95, Cbl 172, Cb909, and Cb2487 hemicellulases. Figure 59 shows analysis of assays with samples containing 2%, 5%, or 8% Miscanthus, and Figure 60 shows analysis of an assay with a sample containing 8% Miscanthus. In Figure 59, CI, C2, C3, C4, and C5 refer to glucose, cellobiose, cellotriose, cellotetraose and cellopentaose, respectively. XI, X2, X3, X4, and X5 refer to xylose, xylobiose, xylotriose, xylotetraose and xylopentaose, respectively. Al refers to arabinose. For Figure 60, the 8% substrate reaction samples were analyzed by high performance anion-exchange chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA).

[00150] Figures 61 and 62: TLC (Figure 61) and HPLC (Figure 62) analysis of samples of autoclave -pretreated Miscanthus that were treated with a cellulase mixture containing

Cb629TMl, Cb486, CM946TM2, CM952TM1, CM953TM2, and CM954TM3 cellulases and/or a hemicellulase mixture containing Cbl93, Cbl95, Cbl 172, Cb909, and Cb2487 hemicellulases. Figure 61 shows analysis of assays with samples containing 2%, 5%, or 8% Miscanthus, and Figure 62 shows analysis of an assay with a sample containing 8% Miscanthus. In Figure 61, CI, C2, C3, C4, and C5 refer to glucose, cellobiose, cellotriose, cellotetraose and cellopentaose, respectively. XI, X2, X3, X4, and X5 refer to xylose, xylobiose, xylotriose, xylotetraose and xylopentaose, respectively. For Figure 62, the 8% substrate reaction samples were analyzed by high performance anion-exchange chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA).

[00151] Figures 63 and 64: TLC (Figure 63) and HPLC (Figure 64) analysis of samples of microwave -pretreated 8% Miscanthus samples that were treated with a cellulase mixture containing Cb629TMl, CM946TM2, CM952TM1, CM953TM2, and CM954TM3 cellulases (the mixture lacks the β-glucosidase Cb486), and/or a hemicellulase mixture containing Cbl93, Cbl95, Cbl 172, Cb909, and Cb2487 hemicellulases. In Figure 63, CI, C2, C3, C4, and C5 refer to glucose, cellobiose, cellotriose, cellotetraose and cellopentaose, respectively. XI, X2, X3, X4, and X5 refer to xylose, xylobiose, xylotriose, xylotetraose and xylopentaose, respectively. For Figure 64, the reaction samples were analyzed by high performance anion-exchange

chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex

Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA

Biosciences (Chelmsford, MA).

[00152] Figures 65 and 66: TLC (Figure 65) and HPLC (Figure 66) analysis of samples of autoclave -pretreated 8% Miscanthus samples that were treated with a cellulase mixture containing Cb629TMl, CM946TM2, CM952TM1, CM953TM2, and CM954TM3 cellulases (the mixture lacks the β-glucosidase Cb486), and/or a hemicellulase mixture containing Cbl93, Cbl95, Cbl 172, Cb909, and Cb2487 hemicellulases. In Figure 65, CI, C2, C3, C4, and C5 refer to glucose, cellobiose, cellotriose, cellotetraose and cellopentaose, respectively. XI, X2, X3, X4, and X5 refer to xylose, xylobiose, xylotriose, xylotetraose and xylopentaose, respectively. For Figure 66, the reaction samples were analyzed by high performance anion-exchange

chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex

Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA

Biosciences (Chelmsford, MA). [00153] Figure 67: Panels A and B show the pH and temperature profiles, respectively of the activity of CM952TM1. For pH profiling, the reactions were carried out in two buffers: 50 mM sodium citrate, 150mM NaCl (pH 4.0-pH 6.0) and 50 mM Na 2 HP0 4 -NaH 2 P0 4 , 150 mM NaCl (pH 6.5-pH 8.0). The enzyme concentration of CM952TM1 was 0.5 μΜ. The enzyme was incubated with 2.5 mg/ml PASC in each buffer at a given pH at 75°C, and the activities in a 10 min assay were determined. The reducing sugars released were measured using the /?HBAH assay. For determination of optimal temperature, 0.5 μΜ of CM952TM1 enzyme was incubated with 2.5 mg/ml PASC at pH 5.5 at different temperatures ranging from 40 °C to 95 °C with a 5°C interval.

[00154] Figure 68: Cleavage products resulting from the incubation of CM953TM2 with cellohexaose. Panel A: TLC analysis of reaction products; Panel B: HPLC analysis of reaction products. The data indicates that CM953TM2 hydrolyzes cellohexaose randomly.

[00155] Figure 69: Panels A and B show the pH and temperature profiles, respectively of the activity of CM954TM3.

[00156] Figure 70: Panels A and B show the pH and temperature profiles, respectively of the activity of Cb629TMl.

[00157] Figure 71: HPLC analysis of time course of enzymatic activity of Cb629TMl on PASC. For analysis of the products of hydrolysis, the samples were analyzed by high performance anion-exchange chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA). For the analysis, glucose, cellobiose, and cellotriose were used as standards.

[00158] Figure 72: Substrate specificity analysis of Cb486. 50 nM of Cb486 was incubated at 75°C in its optimal buffer (50 mM sodium citrate, 150 mM NaCl, pH5.5) with 1 mM each of /?NP-CC-L-arabinopyranoside, /?NP- -D-fucopyranoside, /?NP- -D-galactopyranoside, /?ΝΡ-β-ϋ- glucopyranoside, /?NP- -D-xylopyranoside, and /?NP- -D-cellobiose, respectively, for 30 min. The release of /?NP was recorded by monitoring the increase in optical density at 410 nM with a Cary 300 UV- Visible spectrophotometer (Agilent, Santa Clara CA).

[00159] Figure 73: Hydrolysis of treated miscanthus with cellulase and/or hemicellulase mixtures. Panel A: 0.5 μΜ enzyme; 2% substrate; Panel B: 0.5 μΜ enzyme; 5% substrate; Panel C: 0.5 μΜ enzyme; 8% substrate; Panel D: 1.0 μΜ enzyme; 10% substrate. Different

concentrations (2%, 5%, and 8%) of Miscanthus pre-treated using two different methods (autoclaving or micro waving) were incubated with either the cellulase mix (containing 0.5 μΜ each of CM946TM2, CM952TM1, CM953TM2, CM954TM3, Cb629TMl, and Cb486) or the hemicellulase mix (containing 0.5 μΜ each of Cbl93, Cbl95, CM 172, Cb2487, and Cb909), or both enzyme mixtures in a total volume of 500 μΐ at 75°C with an end-over-end shaking manner for 15 hours. Further, increased concentration (10%) of pretreated Miscanthus of both

pretreatment types was incubated with an increased enzyme concentration of 1.0 μΜ at 75°C with an end-over-end shaking manner for 15 hours. The reducing ends were measured using the pUBAU method.

[00160] Figure 74: Hydrolysis of treated miscanthus with enzyme mixtures lacking Cb486. For these assays, reaction mixtures with 0.5 μΜ enzyme and 8% substrate were used. Pretreated Miscanthus (8%) using two different methods (autoclaving or microwaving) was incubated with either the cellulase mix (containing 0.5 μΜ each of CM946TM2, CM952TM1, CM953TM2, CM954TM3, and Cb629TMl, but without Cb486) or the hemicellulase mix (containing 0.5 μΜ each of Cbl93, Cbl95, CM 172, Cb2487, and Cb909), or both enzyme mixtures in a total volume of 500 μΐ at 75°C with an end-over-end shaking manner for 15 hours. The reducing ends were measured using /?HBAH method.

[00161] Figure 75: Panel A: Domain architecture of CM952TM2, CM954TM3, Cb629TMl, and Cb486 polypeptides. Panel B: Analysis of samples of pretreated Miscanthus (AC = autoclaved; MW = microwaved) that were treated with CM952TM2, CM954TM3, Cb629TMl, Cb486, or a mixture containing CM952TM2, CM954TM3, Cb629TMl, and Cb486 cellulases. Pre-treated Miscanthus, using two different methods (autoclaving or microwaving), at a final concentration of 2% was incubated with an individual cellulase (CM952TM2, CM954TM3, Cb629TMl, or Cb486, 0.5 μΜ each) or a mixture containing all four cellulases in a total volume of 500 μΐ at 75°C with an end-over-end shaking manner for 15 hours. The reducing ends were measured using /?HBAH method.

[00162] Figure 76: Reducing sugar assay with Cbl946WT, Cb486, or a mixture containing CM946WT and Cb486 cellulases. The reactions were carried out using 0.5 μΜ of CM946WT, Cb486 or both enzymes in a phosphate buffer (50 mM sodium phosphate, 150 mM NaCl, pH 6.5) at a total volume of 500 μΐ in a 16-hours incubation with an end-over-end shaking manner at 75°C.

[00163] Figure 77: Analysis of PASC (Panel A) or Avicel (Panel B) samples treated with CM946WT, Cb486, or a mixture containing CM946WT and Cb486 cellulases. The reactions were carried out using 0.5 μΜ of either CM946WT or Cb486 or both enzymes in a phosphate buffer (50 mM sodium phosphate, 150 mM NaCl, pH 6.5) in a total volume of 500 μΐ in a 16- hours incubation with an end-over-end shaking manner at 75°C. Seven μΐ of the hydrolysis products were applied to TLC analysis.

[00164] Figure 78: Time course hydrolysis of PASC by CM952 WT (A), TM1 (B), and TM5 (C). Two point five mg/ml PASC was incubated with 0.5 μΜ CM 952 WT, TM1, and TM5 at 75°C. At different time intervals (0 min, 2 min, 10 min, 60 min, 4 h, and 24 h), samples were taken out and applied to HPAEC-PAD analysis.

[00165] Figure 79: Amino acid sequence alignment of the GH9 domain of CM952 with those of CloceCel9G {Clostridium cellulolyticum Cel9G, GenBank accession number:

AAA73868) (26) and ThefuCel9A (Thermobifida fusca Cel9A, GenBank accession number: AAB42155) (34). CloceCel9G (non-processive) and ThefuCel9A (processive) represent the two types of family 9 theme Bl endoglucanases whose enzyme-cello-oligosaccharide complex structures have been resolved. The asterisks indicate the identical or similar amino acid residues within the three sequences. The filled triangles indicate non-conserved residues. The numbers under a specific amino acid residue indicate the subsites of the cello-oligosaccharides interacting with this amino acid residue based on the CloceCel9G and ThefuCel9A enzyme- substrate complex structures.

[00166] Figure 80: Qualitative binding of CM952 wild- type and its truncation mutants with Avicel (A) and phosphoric acid swollen cellulose (PASC) (B). Thirty micrograms of each protein were incubated with 40 mg/ml Avicel cellulose or 2.5 mg/ml PASC in 50 mM Tris buffer, 150 mM NaCl (pH 7.5). The mixture was shaken end-over-end at 4°C for 1 h. Then the bound and unbound proteins were separated by centrifugation of the mixture at 16,400 rpm for 3 min. The cellulose pellet was washed with 1 ml buffer (50 mM Tris buffer, 150 mM NaCl, pH 7.5) for 4 times. Then the pellet was added with 70 μΐ of lx SDS-PAGE loading buffer and boiled for 5 min. The protein corresponding to one tenth volume of each fraction was applied to a 12% SDS-PAGE.

[00167] Figure 81: Thermostability of CM952 and its truncation mutants harboring cellulase activities. A: CM 952 TM2; B: CM 952 TM3; C: CM 952 TM4. The enzymes were incubated at 75°C, 80°C, and 85°C (WT, TMl, TM2, and TM3) or at 45°C, 50°C, and 55°C (TM4) on a Veriti 96-well thermal cycle. At different time points, samples were taken out and measured for their remaining activity using PASC as the substrate.

[00168] Figure 82: Amino acid sequence alignment of the CBM3c of CM952 with those from other family 9 glycoside hydrolases. The amino acid residues proposed to be involved in cellulose ligand binding based on the works of Jindou et al. (2006) and Li et al. (2010) are indicated with a filled triangle. The sources of the enzymes used for comparison are as follows. CM952: bifunctional cellulase/mannanase of Caldicellulosiruptor bescii (this study);

ADQ45731: putative cellulase of Caldicellulosiruptor kronotskyensis; ABP66693: putative cellulase of Caldicellulosiruptor saccharolyticus; ADL42950: putative Caldicellulosiruptor obsidiansis cellulase/mannan endo-l,4-beta-mannosidase; AAK06394: CelE of

Caldicellulosiruptor sp. Tok7B.l (11); AAA73868: Cel9G of Clostridium cellulolyticum (26); AAC38572: EngH of Clostridium cellulovorans (38); CAA39010: Cel9Z of Clostridium stercorarium (18); ABX43720: Cel9 of Clostridium phytofermentans (39, 48); ABN51860: Cel9I of Clostridium thermocellum DSM 1313 (50); CAB38941: Cel9B of Paenibacillus barcinonensis (32); BAB33148: CelQ of Clostridium thermocellum Fl (2); AAA23086: CenB of Cellulomonas fimi (27); AAW62376: CBP105 of Cellulomonas flavigena (28); AAB42155: Cel9A of Thermobifida fusca (16, 34).

[00169] Figure 83: SDS-PAGE of purified CM581.

[00170] Figure 84: Shows the enhancing effect of Cb 1581 on enzymatic hydrolysis of microwave pretreated miscanthus at 70°C (A) or 80°C (B). Enzymatic hydrolysis of pretreated miscanthus was carried out at pH 6.0 using 0.5 μΜ each of the cellulase/hemicellulase enzyme mixture in a total volume of 500 μΐ with 10% miscanthus as the substrate. The enzymes in the mixture include CM946TM2, CM952TM1, CM953TM2, CM954TM3, Cb629TMl, Cb486, Cbl93, Cbl95, Cb2487, CM 172, Cb909, and Cbl62, and variable amounts of recombinant CM581, as indicated. The concentration of glucose equivalents was determined following enzymatic hydrolysis of microwave pretreated miscanthus, according to the methods of Lever, M. The releasing of sugars is enhanced with the increasing amount of CM581 in the reaction mixture at both 70°C and 80°C.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

[00171] The present disclosure relates to thermostable cellulose and hemicellulose-degrading enzymes and to methods of using these enzymes for the degradation of cellulose, hemicellulose, and cellulose and hemicellulose-containing materials. The present disclosure also relates to nucleic acids encoding the enzymes disclosed herein, and enzyme cocktails containing various enzymes disclosed herein.

[00172] In one aspect, the disclosure provides enzymes having cellulase activity. Provided herein are truncated enzymes that have improved cellulase activity over wild-type cellulase proteins. Also provided herein are truncated enzymes that have similar cellulase activity to wild- type cellulase proteins. Truncated proteins may be advantageous over wild- type proteins, for example, due to lower cost or improved ease of production of truncated proteins.

[00173] In another aspect, the disclosure provides enzymes having hemicellulase activity. The hemicellulose-degrading enzymes of the present disclosure can be used alone, or in combination to degrade hemicellulose, i.e., convert hemicellulose into its structural components by cleavage of bonds, or linkages, between the component subunits present in hemicellulose. Bonds or linkages may include bonds between xylose subunits, or bonds between xylose and functional groups, or bonds between functional groups.

[00174] In another aspect, the disclosure provides enzymes that enhance the activity of enzymes having cellulase or hemicellulase activity, and/or mixtures thereof. Enzymes that enhance the activity of cellulases and/or hemicellulases may be provided alone, with cellulases, with hemicellulases, or with mixtures of cellulases and hemicellulases.

[00175] Cellulose or hemicellulose treated with the methods of the present disclosure may be at least 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% degraded. Degradation products may include glucose, cellobiose, cellodextrins, xylose, arabinose, glucuronyl groups, and acetyl groups, in addition to other functional groups and hydrocarbons. The degradation products may find use as biofuels or other value-added compounds. For example, sugars released from the cellulose or hemicellulose may be fermented for the production of ethanol.

[00176] The cellulose and hemicellulose-degrading enzymes of the present disclosure are thermostable, and are optimally able to degrade cellulose and/or hemicellulose into sugars such as glucose, xylose, or arabinose at temperatures above 50 °C. In addition, the enzymes retain substantial activity when maintained at various temperatures above 50 °C.

[00177] Without wishing to be bound by theory, another important feature of the enzyme cocktails described herein are that they are derived from the same organism, ensuring that the enzymes will function together to degrade cellulose and/or hemicellulose. Caldicellulosiruptor bescii contains a complete set of enzymes for degrading cellulose, and hemicelluloses such as xylan. Xylan is the main hemicellulose in perennial grasses, such as switchgrass, and is most likely the main hemicellulose in the giant grass Miscanthus.

[00178] In one aspect, the present disclosure provides nucleotide and amino acid sequences for thermostable enzymes that degrade hemicellulose, including Cbl93, Cbl95, CM 172, Cb909, Cb2487, and Cbl62. Cbl93 and Cbl95 function as endoxylanases. CM 172 functions as an a- arabinofuranosidase. Cb909 functions as a glucuronidase. Cb2487 functions as a β-xylosidase. Cbl62 functions as an acetyl xylan esterase. Variants of the enzymes that retain partial or complete functional activity are also encompassed by the present disclosure. The enzymes disclosed herein can be used in various combinations.

[00179] In one aspect, the disclosure provides improved enzyme mixtures for the

degradation of cellulose-containing materials. Improved enzyme mixtures for the degradation of cellulose-containing materials may contain, for example, improved mixtures of cellulases and/or truncated cellulase enzymes.

[00180] In another aspect, the disclosure provides improved enzyme mixtures for the degradation of materials containing both cellulose and hemicellulose. Enzyme mixtures disclosed herein containing both cellulases and hemicellulases provide the surprising result of synergistic activity on plant material containing both cellulose and hemicellulose. For example, as shown in Example 15 below, an enzyme cocktail provided herein containing a mixture of cellulases and a mixture of hemicellulases has greater cellulase activity on plant material than the same mixture of cellulases alone. Additionally, the enzyme cocktail containing a mixture of cellulases and a mixture of hemicellulases has greater hemicellulase activity on plant material than the same mixture of hemicellulases alone.

[00181] Combinations of enzymes, i.e., an enzyme cocktail, can be tailored to the cellulose and/or hemicellulose structure of a specific feedstock to increase the level of degradation. Initial analysis of the enzyme cocktails described herein suggests that the components have a long shelf life, an important characteristic in an industrial enzyme mix.

Abbreviations / Definitions

[00182] The following abbreviations are used in the present disclosure: TLC (thin layer chromatography); SWAX (soluble wheat arabinoxylan); OSX (oat-spelt xylan); BWX

(birchwood xylan); CMC (carboxymethyl cellulose); RAX (rye arabinoxylan); MeGlcA (4-0- methyl-D-glucuronosyl); /?NP-X (para-nitrophenyl-beta-D-xylopyranoside);GH (glycoside hydrolase); CBM (carbohydrate binding module); SDS-PAGE (sodium dodecyl sulfate polyacrylamide gel electrophoresis); PASC (phosphoric acid swollen cellulose); CMC-Na (sodium carboxymethyl cellulose); LBG (locust bean gum); KGM (konjac glucomannan); WAX (wheat arabinoxylan); HPAEC (high performance anion-exchange chromatography); HPLC (high performance liquid chromatography)

[00183] As used herein, a "polypeptide" is a chain of consecutive polymerized amino acid residues (e.g., at least about 5 consecutive polymerized amino acid residues). As used herein, the terms "polypeptide", "protein", and "amino acid sequence" are used interchangeably.

[00184] As used herein, "cellulase" activity refers to enzymatic activity which cleaves 1-4 β- D-glycosidic linkages between glucose molecules in cellulose and/or cellooligosaccharides. Cellulase activity includes endoglucanase, exoglucanase, and beta-glucosidase activity.

[00185] As used herein, "hemicellulase" activity refers to enzymatic activity which cleaves a bond in a molecule that is a component of hemicellulose, including endoxylanase, cc- arabinofuranosidase, glucuronidase, β-xylosidase, and acetyl xylan esterase activity.

[00186] Polypeptides of the Disclosure

[00187] In some aspects, polypeptides of the disclosure relate to recombinant polypeptides of the thermophilic bacterium Caldicellulosiruptor bescii (formerly Anaerocellum thermophilum DSMZ 6725), truncations, and variations thereof.

[00188] In one aspect, the present disclosure provides recombinant polypeptides related to the degradation of cellulose. In some aspects, the disclosure provides recombinant CM952, CM953, CM954, CM946, Cb629, and Cb486 polypeptides which have cellulase activity.

[00189] In one aspect, the present disclosure provides recombinant polypeptides related to the degradation of hemicellulose. In some aspects, the disclosure provides recombinant Cbl93, Cbl95, CM 172, Cb909, Cb2487, and Cbl62 polypeptides which have hemicellulase activity.

[00190] In one aspect, the present disclosure provides recombinant polypeptides that enhance the hydrolysis of cellulose and/or hemicellulose during treatment of cellulose and/or

hemicellulose with cellulase and/or hemicellulases. In one aspect, the disclosure provides recombinant CM581 polypeptide, which is a heat shock protein that enhances the hydrolysis of cellulose and/or hemicellulose during treatment of cellulose and/or hemicellulose with cellulase and/or hemicellulases.

[00191] Cellulases

[00192] CM952 polypeptides

[00193] In some aspects, the present disclosure relates to recombinant CM952 polypeptides. As used herein, a "CM952 polypeptide" refers to the polypeptide of SEQ ID NO: 44, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have cellulase activity. "CM952 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46. As used herein, "CM952 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46.

[00194] The polypeptide of SEQ ID NO: 44 is the product of the CM 952 gene in

Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The CM952 polypeptide of SEQ ID NO: 44 is an endocellulase that has a glycoside hydrolase (GH) family 9 catalytic domain (cellulase domain), three family 3 carbohydrate binding modules (CBMs) and one GH5 catalytic domain (mannanase domain) (Figure 18)

[00195] The present disclosure also includes the CM952 polypeptide of SEQ ID NO: 114, which is the CM952 polypeptide of SEQ ID NO: 44 without the signal peptide sequence. The signal peptide is produced as part of the initially translated CM952 protein to target the protein for secretion from the cell, and it may be cleaved from the protein during the secretion process. The disclosure also includes the CM952 polypeptide of SEQ ID NO: 114 with a methionine residue at the start of the polypeptide chain. [00196] The disclosure further includes the Cb 1952 polypeptide of SEQ ID NO: 46 ("CM952TM1"), which is a truncational mutant ("TM") of wild-type CM952. The CM952TM1 polypeptide includes the cellulase domain and CBMs of wt CM952, but does not include the mannase domain (Figure 18).

[00197] The disclosure also includes the CM952 polypeptide of SEQ ID NO: 124

("CM952TM2"), which is a truncational mutant of wild-type CM952 that does not include the mannase domain or the C-terminal CBM (Figure 18).

[00198] The disclosure also includes the CM952 polypeptide of SEQ ID NO: 126

("CM952TM3"), which is a truncational mutant of wild-type CM952 that does not include the mannase domain or the 2 most C-terminal CBMs (Figure 18).

[00199] The disclosure also includes the CM952 polypeptide of SEQ ID NO: 128

("CM952TM4"), which is a truncational mutant of wild-type CM952 that includes the GH9 cellulase domain, but that does not contain any of the CBMs or the mannose domain (Figure 18).

[00200] CM952 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a CM952 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a CM952 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00201] CM953 polypeptides

[00202] In some aspects, the present disclosure relates to recombinant CM953 polypeptides. As used herein, a "CM953 polypeptide" refers to the polypeptide of SEQ ID NO: 60, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have cellulase activity. "CM953 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 60, 61, and 111. As used herein, "CM953 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 60, 61, and 111.

[00203] The CM 953 polypeptide of SEQ ID NO: 60 is the product of the CM 953 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The CM 953 polypeptide of SEQ ID NO: 60 is an endoglucanase that cleaves mostly cellobiose from cellulose, and it has two glycoside hydrolase (GH) family 5 catalytic domains and 3

carbohydrate binding modules (CBM) (Figure 29).

[00204] The present disclosure also includes the Cb 1953 polypeptide of SEQ ID NO: 61 , which is the CM 953 polypeptide of SEQ ID NO: 60 without the signal peptide sequence. The signal peptide is produced as part of the initially translated CM 953 protein to target the protein for secretion from the cell, and it may be cleaved from the protein during the secretion process. The disclosure also includes the CM 953 polypeptide of SEQ ID NO: 61 with a methionine residue at the start of the polypeptide chain.

[00205] The disclosure further includes the Cb 1953 polypeptide of SEQ ID NO: 111

("CM953TM2"), which is a truncational mutant ("TM") of wild-type CM 953. The CM953TM2 polypeptide includes the C-terminal GH5 domain and the 3 CBMs of wt CM953, but does not include the N-terminal GH5 domain. (Figure 29).

[00206] CM953 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a CM 953 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a CM953 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C. [00207] CM954 polypeptides

[00208] In some aspects, the present disclosure relates to recombinant CM954 polypeptides. As used herein, a "CM954 polypeptide" refers to the polypeptide of SEQ ID NO: 74, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have cellulase activity. "CM954 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 74, 121, and 76. As used herein, "CM954 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 74, 121, and 76.

[00209] The CM 954 polypeptide of SEQ ID NO: 74 is the product of the CM 954 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The CM 954 polypeptide of SEQ ID NO: 74 is an endoglucanase that has a glycoside hydrolase (GH) family 9 catalytic domain (a cellulase domain), 3 carbohydrate binding modules (CBM), and one GH48 catalytic domain (Figure 38).

[00210] The present disclosure also includes the CM954 polypeptide of SEQ ID NO: 121, which is the CM 954 polypeptide of SEQ ID NO: 74 without the signal peptide sequence. The signal peptide is produced as part of the initially translated CM 954 protein to target the protein for secretion from the cell, and it may be cleaved from the protein during the secretion process. The disclosure also includes the CM954 polypeptide of SEQ ID NO: 121 with a methionine residue at the start of the polypeptide chain.

[00211] The disclosure further includes the CM954 polypeptide of SEQ ID NO: 76

("CM954TM3"), which is a truncational mutant ("TM") of wild-type CM954. The CM954TM3 polypeptide includes the GH9 domain and the N-terminal-most CBM of wt CM 954, but does not include the middle or C-terminal CBM, or the GH48 domain. (Figure 38). [00212] CM954 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a CM954 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a CM954 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00213] CM946 polypeptides

[00214] In some aspects, the present disclosure relates to recombinant CM946 polypeptides. As used herein, a "CM946 polypeptide" refers to the polypeptide of SEQ ID NO: 86, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have cellulase activity. "CM946 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 86, 87, and 113. As used herein, "CM946 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 86, 87, and 113.

[00215] The Cb 1946 polypeptide of SEQ ID NO: 86 is the product of the Cb 1946 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The CM946 polypeptide of SEQ ID NO: 86 is an endoglucanase that has a glycoside hydrolase (GH) family 5 catalytic domain at the N-terminal region, a GH family 44 catalytic domain at the C-terminal region and 2 carbohydrate binding modules (CBMs) between the two GH catalytic domains (Figure 42).

[00216] The present disclosure also includes the CM946 polypeptide of SEQ ID NO: 87, which is the CM946 polypeptide of SEQ ID NO: 86 without the signal peptide sequence. The signal peptide is produced as part of the initially translated CM946 protein to target the protein for secretion from the cell, and it may be cleaved from the protein during the secretion process. The disclosure also includes the CM946 polypeptide of SEQ ID NO: 87 with a methionine residue at the start of the polypeptide chain.

[00217] The disclosure further includes the Cb 1946 polypeptide of SEQ ID NO: 113

("CM946TM2"), which is a truncational mutant ("TM") of wild-type CM946. The CM946TM2 polypeptide includes the C-terminal GH44 domain and the 2 CBMs of wt CM946, but does not include the N-terminal GH5 domain. (Figure 42).

[00218] CM946 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a CM946 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a CM946 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00219] Cb629 polypeptides

[00220] In some aspects, the present disclosure relates to recombinant Cb629 polypeptides. As used herein, a "Cb629 polypeptide" refers to the polypeptide of SEQ ID NO: 98, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have cellulase activity. "Cb629 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 98, 119, and 100. As used herein, "Cb629 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 98, 119, and 100. [00221] The Cb629 polypeptide of SEQ ID NO: 98 is the product of the Cb629 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cb629 polypeptide of SEQ ID NO: 98 is an endocellulase that initially cleaves glucose, cellobiose, and cellotriose from cellulose, and it has a glycoside hydrolase (GH) family 5 catalytic domain, a Carbohydrate Binding Module (CBM) family 17_28 domain, and three surface layer homology (SLH) modules likely used in anchoring the enzyme to the cell surface (Figure 47).

[00222] The present disclosure also includes the Cb629 polypeptide of SEQ ID NO: 119, which is the Cb629 polypeptide of SEQ ID NO: 98 without the signal peptide sequence. The signal peptide is produced as part of the initially translated Cb629 protein to target the protein for secretion from the cell, and it may be cleaved from the protein during the secretion process. The disclosure also includes the Cb629 polypeptide of SEQ ID NO: 119 with a methionine residue at the start of the polypeptide chain.

[00223] The disclosure further includes the Cb629 polypeptide of SEQ ID NO: 100

("Cb629TMl"), which is a truncational mutant ("TM") of wild-type Cb629. The Cb629TMl polypeptide includes the N-terminal GH5 domain and the CBM17_28 domain of wt Cb629, but does not include the C-terminal SLH modules (Figure 47).

[00224] Cb629 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cb629 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cb629 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00225] CM86 polypeptides

[00226] In some aspects, the present disclosure relates to recombinant Cb486 polypeptides.

[00227] As used herein, a "Cb486 polypeptide" refers to the polypeptide of SEQ ID NO: 106, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have cellulase activity. "Cb486 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 106. As used herein, "Cb486 polypeptide" also refers to a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 106.

[00228] The Cb486 polypeptide of SEQ ID NO: 106 is the product of the Cb486 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cb486 polypeptide of SEQ ID NO: 106 is a β-glucosidase that catalyzes the hydrolysis of cellobiose (a disaccharide of glucose) into two units of glucose, and it has a glycoside hydrolase (GH) family 1 catalytic domain (Figure 53 A).

[00229] Cb486 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cb486 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cb486 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00230] Hemicellulases

[00231] The disclosure also provides for polypeptides of thermostable hemicellulose- degrading enzymes Cbl93 (SEQ ID NO: 3), Cbl95 (SEQ ID NO: 7), CM 172 (SEQ ID NO: 13), Cb909 (SEQ ID NO: 19), Cb2487 (SEQ ID NO: 27), and Cbl62 (SEQ ID NO: 33), or subsequences thereof. The disclosure further provides for an isolated or recombinant polypeptide comprising an amino acid sequence having at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more, or complete (100%) sequence identity to Cbl93, Cbl95, CM 172, Cb2487, Cb909 or Cbl62.

[00232] Hemicellulases of the present disclosure may contain one or more glycoside hydrolase (GH) domains. Hemicellulases may also contain one or more carbohydrate binding modules (CBM). The CBM modules may interrupt a GH domain or be located in between two GH domains. Hemicellulases may also contain an acetyl xylan esterase domain. In certain embodiments, the GH, CBM and/or acetyl xylan esterase domain sequence is conserved in polypeptide variants.

[00233] CM93 polypeptides

[00234] In some aspects, the present disclosure relates to recombinant Cbl93 polypeptides. As used herein, a "Cbl93 polypeptide" refers to the polypeptide of SEQ ID NO: 3, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have hemicellulase activity. "Cbl93 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 3 and/or 37. As used herein, "Cbl93 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 3 and/or 37.

[00235] The Cb 193 polypeptide of SEQ ID NO: 3 is the product of the Cb 193 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cbl93 polypeptide of SEQ ID NO: 3 or 37 is an endoxylanase cleaves the xylose backbone of hemicellulose at random to generate shorter chains of xylose in P-l,4-linkages. These xylo- oligosaccharides can range from two or more sugar subunits. Cbl93 has a signal peptide (corresponding to amino acids 1-41 of SEQ ID NO: 3), which may be removed. The amino acid sequence of the Cbl93 protein without the signal peptide is disclosed in SEQ ID NO: 37. The protein has two putative carbohydrate binding modules (CBM) inserted within the glycoside hydrolase (GH) family 10 catalytic domain (Figure 2A).

[00236] The present disclosure also includes the Cbl93 polypeptide of SEQ ID NO: 37, which is the Cbl93 polypeptide of SEQ ID NO: 3 without the signal peptide sequence. The signal peptide is produced as part of the initially translated Cbl93 protein to target the protein for secretion from the cell, and it may be cleaved from the protein during the secretion process. The disclosure also includes the Cbl93 polypeptide of SEQ ID NO: 37 with a methionine residue at the start of the polypeptide chain.

[00237] Cbl93 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cbl93 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cbl93 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00238] CM95 polypeptides

[00239] In some aspects, the present disclosure relates to recombinant Cbl95 polypeptides. As used herein, a "Cbl95 polypeptide" refers to the polypeptide of SEQ ID NO: 7, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have hemicellulase activity. "Cbl95 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 7. As used herein, "Cbl95 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 7. [00240] The Cb 195 polypeptide of SEQ ID NO: 7 is the product of the Cb 195 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cbl95 polypeptide of SEQ ID NO: 7 is an endoxylanase that cleaves the xylose backbone of hemicellulose at random to generate shorter chains of xylose in P-l,4-linkages. These xylo- oligosaccharides can range from containing two or more sugar subunits.

[00241] Cbl95 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cbl95 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cbl95 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00242] Cbl 172 polypeptides

[00243] In some aspects, the present disclosure relates to recombinant Cbl 172 polypeptides. As used herein, a "Cbl 172 polypeptide" refers to the polypeptide of SEQ ID NO: 13, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have hemicellulase activity. "Cbl 172 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 13. As used herein, "Cbl 172 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 13.

[00244] The Cbl 172 polypeptide of SEQ ID NO: 13 is the product of the Cbl 172 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cbl 172 polypeptide of SEQ ID NO: 13 is an cc-L-arabinofuranosidase that cleaves arabinose moiety from the xylose backbone or from branched or debranched arabinan of hemicellulose to generate exclusively arabinose. The protein has a glycoside hydrolase (GH) family 51 catalytic domain (Figure 6D).

[00245] Cbl 172 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cbl 172 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cbl 172 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00246] Cb909 polypeptides

[00247] In some aspects, the present disclosure relates to recombinant Cb909 polypeptides. As used herein, a "Cb909 polypeptide" refers to the polypeptide of SEQ ID NO: 19, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have hemicellulase activity. "Cb909 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 19. As used herein, "Cb909 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 19.

[00248] The Cb909 polypeptide of SEQ ID NO: 19 is the product of the Cb909 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cb909 polypeptide of SEQ ID NO: 19 is an a- glucuronidase that cleaves the cc-l,2-glycosidic bond between 4-O-methyl-D-glucuronic acid and the β-1,4- xylosidic linkage backbone of xylan.

[00249] Cb909 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cb909 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73,

74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cb909 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00250] Cb2487 polypeptides

[00251] In some aspects, the present disclosure relates to recombinant Cb2487 polypeptides. As used herein, a "Cb2487 polypeptide" refers to the polypeptide of SEQ ID NO: 27, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have hemicellulase activity. "Cb2487 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 27. As used herein, "Cb2487 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 27.

[00252] The Cb2487 polypeptide of SEQ ID NO: 27 is the product of the Cb2487 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cb2487 polypeptide of SEQ ID NO: 27 is a β-xylosidase.

[00253] Cb2487 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cb2487 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cb2487 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70,

75, 80, 85, or 90 °C.

[00254] CM 62 polypeptides [00255] In some aspects, the present disclosure relates to recombinant Cbl62 polypeptides. As used herein, a "Cbl62 polypeptide" refers to the polypeptide of SEQ ID NO: 33, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have hemicellulase activity. "Cbl62 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 33. As used herein, "Cbl62 polypeptide" also refers to a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 33.

[00256] The Cb 162 polypeptide of SEQ ID NO: 33 is the product of the Cb 162 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The Cbl62 polypeptide of SEQ ID NO: 33 is an acetyl xylan esterase that cleaves the linkages between xylose and the side chain of acetyl groups in hemicellulose to provide more accessibility to other hemicellulases such as xylanase and beta-xylosidase to the backbone of xylan. The protein has a single domain of acetyl xylan esterase (Figure 10A).

[00257] Cbl62 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a Cbl62 polypeptide of the present disclosure has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a Cbl62 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00258] Polypeptides that Enhance Enzymatic Hydrolysis of Cellulose and/or Hemicellulose

[00259] In some aspects, the disclosure provides for recombinant polypeptides that enhance the enzymatic hydrolysis of cellulose and/or hemicellulose. [00260] In one aspect, a recombinant polypeptide that enhances the enzymatic hydrolysis of cellulose and/or hemicellulose is a recombinant CM581 polypeptide.

[00261] As used herein, a "CM581 polypeptide" refers to the polypeptide of SEQ ID NO: 146, and truncational mutants thereof, homologs thereof, and truncational mutants of homologs thereof, which have enzymatic hydrolysis of cellulose and/or hemicellulose-enhancing activity. "CM581 polypeptide" also refers to a polypeptide that has enzymatic hydrolysis of cellulose and/or hemicellulose-enhancing activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 146. As used herein, "CM581 polypeptide" also refers to a polypeptide that has enzymatic hydrolysis of cellulose and/or hemicellulose- enhancing activity, and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 146.

[00262] The CM581 polypeptide of SEQ ID NO: 146 is the product of the CM581 gene in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The CM581 polypeptide is a small heat shock protein.

[00263] CM581 polypeptides of the present disclosure are thermophilic and thermostable. In some aspects, a CM581 polypeptide of the present disclosure has peak enzymatic hydrolysis of cellulose and/or hemicellulose-enhancing activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, a CM581 polypeptide of the present disclosure retains at least 60% of its initial rate of enzymatic hydrolysis of cellulose and/or hemicellulose- enhancing activity for a period of at least 24 hours when incubated at a temperature of about 55, 60, 65, 70, 75, 80, 85, or 90 °C.

[00264] Polypeptides with protein "tags "

[00265] Polypeptides of the disclosure further include any of the recombinant polypeptides disclosed herein with a polypeptide "tag." Polypeptide tags are polypeptides that may be attached to a protein of interest through gene cloning, and may be used to facilitate the purification, increase the solubility, and /or increase the stability of the "tagged" protein. Protein tags are well known in the art and include, without limitation, poly-histidine (e.g. 6 consecutive His-residues), glutathione S-transferase (GST), T7, FLAG, hemagglutinin (HA), MYC and maltose-binding protein (MBP) tags.

[00266] Production of Polypeptides

[00267] The polypeptides can be expressed in and purified from their native host,

Caldicellulosiruptor bescii. Polypeptides may also be expressed in and purified from transgenic expression systems. Transgenic expression systems can be prokaryotic or eukaryotic.

Transgenic host cells may include yeast and E. coli. Transgenic host cells may secrete the polypeptide out of the host cell. In certain embodiments, the isolated or recombinant polypeptide lacks a signal sequence. Methods for the production of recombinant polypeptides are further discussed infra.

[00268] Nucleic Acids of the Disclosure

[00269] The present disclosure further provides recombinant nucleic acids that encode any of the polypeptides disclosed herein. Nucleic acids that encode a polypeptide are also referred to herein as "genes". Methods for determining the relationship between a polypeptide and a nucleic acid that encodes the polypeptide are well known to one of skill in the art. Similarly, methods of determining the polypeptide sequence encoded by a polynucleotide sequence are well known to one of skill in the art. Due to codon degeneracy, multiple different nucleic acid sequences may encode the same polypeptide sequence.

[00270] As used herein, the terms, "nucleic acid" "polynucleotide", and variations thereof are generic to polydeoxyribonucleotides (containing 2-deoxy-D-ribose), to polyribonucleotides (containing D-ribose), to any other type of polynucleotide that is an N-glycoside of a purine or pyrimidine base, and to other polymers containing non-nucleotidic backbones, provided that the polymers contain nucleobases in a configuration that allows for base pairing and base stacking, as found in DNA and RNA. Thus, these terms include known types of nucleic acid sequence modifications, for example, substitution of one or more of the naturally occurring nucleotides with an analog, and inter-nucleotide modifications. As used herein, the symbols for nucleotides and polynucleotides are those recommended by the IUPAC-IUB Commission of Biochemical Nomenclature.

[00271] As used herein, more than one "nucleic acid" or "polynucleotide" may be present in a single contiguous polydeoxyribonucleotide chain / strand of DNA. Thus, a single strand of DNA (such as in a plasmid) may contain more than one "nucleic acid" or "polynucleotide", and thus, may contain sequences encoding more than one different polypeptide.

[00272] The nucleic acids may be synthesized, isolated, or manipulated using standard molecular biology techniques such as those described in Sambrook, J. et al. 2000. Molecular Cloning: A Laboratory Manual (Third Edition). Techniques may include cloning, expression of cDNA libraries, and amplification of mRNA or genomic DNA.

[00273] The nucleic acids of the present disclosure, or subsequences thereof, may be incorporated into a cloning vehicle comprising an expression cassette or vector. The cloning vehicle can be a viral vector, a plasmid, a phage, a phagemid, a cosmid, a fosmid, a

bacteriophage, or an artificial chromosome. The viral vector can comprise an adenovirus vector, a retroviral vector, or an adeno-associated viral vector. The cloning vehicle can comprise a bacterial artificial chromosome (BAC), a plasmid, a bacteriophage PI -derived vector (PAC), a yeast artificial chromosome (YAC), or a mammalian artificial chromosome (MAC).

[00274] The nucleic acids may be operably linked to a promoter. The promoter can be a viral, bacterial, mammalian or plant promoter. The promoter can be a constitutive promoter, an inducible promoter, a tissue- specific promoter, or an environmentally regulated or a

developmentally regulated promoter.

[00275] Nucleic Acids That Encode Cellulases

[00276] Cbl952 polynucleotides [00277] The present disclosure includes recombinant polynucleotides that encode a CM952 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode a polypeptide of SEQ ID NOs: 44, 114, 124, 126, 128, or 46.

[00278] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46.

Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46.

[00279] In some aspects, the disclosure includes the recombinant polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47. The polynucleotide of SEQ ID NO: 45 encodes the polypeptide of SEQ ID NO: 44. The polynucleotide of SEQ ID NO: 115 encodes the

polypeptide of SEQ ID NO: 45. The polynucleotide of SEQ ID NO: 47 encodes the polypeptide of SEQ ID NO: 46. The polynucleotide of SEQ ID NO: 125 encodes the polypeptide of SEQ ID NO: 124. The polynucleotide of SEQ ID NO: 127 encodes the polypeptide of SEQ ID NO: 126. The polynucleotide of SEQ ID NO: 129 encodes the polypeptide of SEQ ID NO: 128.

[00280] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to any of the sequences of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of any of the sequences of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46.

[00281] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode CM952 polypeptides disclosed herein.

[00282] Cbl953 polynucleotides

[00283] The present disclosure includes recombinant polynucleotides that encode a CM953 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode a polypeptide of SEQ ID NOs: 60, 61, or 111.

[00284] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 60, 61, and 111. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 60, 61, and 111.

[00285] In some aspects, the disclosure includes the recombinant polynucleotides of SEQ ID NOs: 62, 63, or 110. The polynucleotide of SEQ ID NO: 62 encodes the polypeptide of SEQ ID NO: 60. The polynucleotide of SEQ ID NO: 63 encodes the polypeptide of SEQ ID NO: 61. The polynucleotide of SEQ ID NO: 110 encodes the polypeptide of SEQ ID NO: 111. [00286] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to any of the sequences of SEQ ID NOs: 62, 63, and 110, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 60, 61, and 111. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of any of the sequences of SEQ ID NOs: 62, 63, or 110, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 60, 61, and 111.

[00287] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode CM953 polypeptides disclosed herein.

[00288] CM954 polynucleotides

[00289] The present disclosure includes recombinant polynucleotides that encode a CM954 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode a polypeptide of SEQ ID NOs: 74, 121, or 76.

[00290] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 74, 121, and 76. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 74, 121, and 76.

[00291] In some aspects, the disclosure includes the recombinant polynucleotides of SEQ ID NOs: 116, 75, or 77. The polynucleotide of SEQ ID NO: 116 encodes the polypeptide of SEQ ID NO: 74. The polynucleotide of SEQ ID NO: 75 encodes the polypeptide of SEQ ID NO: 121. The polynucleotide of SEQ ID NO: 77 encodes the polypeptide of SEQ ID NO: 76.

[00292] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to any of the sequences of SEQ ID NOs: 116, 75, and 77, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 74, 121, and 76. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of any of the sequences of SEQ ID NOs: 116, 75, and 77, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 74, 121, and 76.

[00293] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode CM954 polypeptides disclosed herein.

[00294] CM946 polynucleotides

[00295] The present disclosure includes recombinant polynucleotides that encode a CM946 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode a polypeptide of SEQ ID NOs: 86, 87, or 113. [00296] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 86, 87, and 113. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 86, 87, and 113.

[00297] In some aspects, the disclosure includes the recombinant polynucleotides of SEQ ID NOs: 88, 89, or 112. The polynucleotide of SEQ ID NO: 88 encodes the polypeptide of SEQ ID NO: 86. The polynucleotide of SEQ ID NO: 89 encodes the polypeptide of SEQ ID NO: 87. The polynucleotide of SEQ ID NO: 112 encodes the polypeptide of SEQ ID NO: 113.

[00298] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to any of the sequences of SEQ ID NOs: 88, 89, and 112, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 86, 87, and 113. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of any of the sequences of SEQ ID NOs: 88, 89, or 112, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 86, 87, and 113. [00299] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode CM946 polypeptides disclosed herein.

[00300] Cb629 polynucleotides

[00301] The present disclosure includes recombinant polynucleotides that encode a Cb629 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode a polypeptide of SEQ ID NOs: 98, 119, or 100.

[00302] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 98, 119, and 100. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 98, 119, and 100.

[00303] In some aspects, the disclosure includes the recombinant polynucleotides of SEQ ID NOs: 99, 120, or 101. The polynucleotide of SEQ ID NO: 99 encodes the polypeptide of SEQ ID NO: 98. The polynucleotide of SEQ ID NO: 120 encodes the polypeptide of SEQ ID NO: 119. The polynucleotide of SEQ ID NO: 101 encodes the polypeptide of SEQ ID NO: 100.

[00304] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to any of the sequences of SEQ ID NOs: 99, 120, or 101, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 98, 119, and 100. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of any of the sequences of SEQ ID NOs: 99, 120, or 101, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 98, 119, and 100.

[00305] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cb629 polypeptides disclosed herein.

[00306] Cb486 polynucleotides

[00307] The present disclosure includes recombinant polynucleotides that encode a Cb486 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode the polypeptide of SEQ ID NO: 106.

[00308] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 106. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has cellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 106.

[00309] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 107. The polynucleotide of SEQ ID NO: 107 encodes the polypeptide of SEQ ID NO: 106.

[00310] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 107, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 106. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 107, and that encode a polypeptide that has cellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 106.

[00311] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cb486 polypeptides disclosed herein.

[00312] Nucleic Acids that Encode Hemicellulases

[00313] The present disclosure provides nucleotide sequences encoding the hemicellulose- degrading enzymes Cbl93 (SEQ ID NO: 4), Cbl95 (SEQ ID NO: 8), CM 172 (SEQ ID NO: 14), Cb909 (SEQ ID NO: 20), Cb2487 (SEQ ID NO: 28), and Cbl62 (SEQ ID NO: 34), or subsequences thereof. The disclosure also provides for nucleotide sequences having at least about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more, or complete (100%) sequence identity to the nucleic acid sequences encoding Cbl93, Cbl95, CM 172, Cb909, Cb2487, and Cbl62.

[00314] Nucleotide sequences of the present disclosure may encode polypeptides with one or more glycoside hydrolase (GH) domains. Nucleotide sequences may also encode polypeptides with one or more carbohydrate binding modules (CBM). The CBM modules may interrupt a GH domain or be located in between two GH domains. Nucleotide sequences may also encode polypeptides with an acetyl xylan esterase domain. In certain embodiments, the GH, CBM and/or acetyl xylan esterase domain sequence is conserved in nucleotide variants.

[00315] Cbl93 polynucleotides

[00316] The present disclosure includes recombinant polynucleotides that encode a Cbl93 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode a polypeptide of SEQ ID NOs: 3 or 37.

[00317] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 3 and/or 37. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of any of the polypeptides of SEQ ID NOs: 3 and/or 37.

[00318] In some aspects, the disclosure includes the recombinant polynucleotides of SEQ ID NOs: 4 or 38. The polynucleotide of SEQ ID NO: 4 encodes the polypeptide of SEQ ID NO: 3. The polynucleotide of SEQ ID NO: 38 encodes the polypeptide of SEQ ID NO: 37.

[00319] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to any of the sequences of SEQ ID NOs: 4 and/or 38, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 3 and/or 37. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of any of the sequences of SEQ ID NOs: 4 or 38, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of any of the polypeptides of SEQ ID NOs: 3 and/or 37.

[00320] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cbl93 polypeptides disclosed herein.

[00321] Cbl95 polynucleotides

[00322] The present disclosure includes recombinant polynucleotides that encode a Cbl95 polypeptide of the disclosure. In some aspects, the disclosure includes a recombinant polynucleotide that encodes a polypeptide of SEQ ID NO: 7.

[00323] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 7. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 7.

[00324] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 8. The polynucleotide of SEQ ID NO: 8 encodes the polypeptide of SEQ ID NO: 7.

[00325] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 8, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 7. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 8, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence the polypeptide of SEQ ID NO: 7.

[00326] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cbl95 polypeptides disclosed herein.

[00327] Cbll72 polynucleotides

[00328] The present disclosure includes recombinant polynucleotides that encode a Cbl 172 polypeptide of the disclosure. In some aspects, the disclosure includes a recombinant polynucleotide that encodes a polypeptide of SEQ ID NO: 13.

[00329] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 13. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 13.

[00330] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 14. The polynucleotide of SEQ ID NO: 14 encodes the polypeptide of SEQ ID NO: 13. [00331] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 14, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 13. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 14, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence the polypeptide of SEQ ID NO: 13.

[00332] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cbl 172 polypeptides disclosed herein.

[00333] Cb909 polynucleotides

[00334] The present disclosure includes recombinant polynucleotides that encode a Cb909 polypeptide of the disclosure. In some aspects, the disclosure includes a recombinant polynucleotide that encodes a polypeptide of SEQ ID NO: 19.

[00335] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 19. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 19.

[00336] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 20. The polynucleotide of SEQ ID NO: 20 encodes the polypeptide of SEQ ID NO: 19.

[00337] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 20, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 19. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 20, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence the polypeptide of SEQ ID NO: 19.

[00338] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cb909 polypeptides disclosed herein.

[00339] Cb2487 polynucleotides

[00340] The present disclosure includes recombinant polynucleotides that encode a Cb2487 polypeptide of the disclosure. In some aspects, the disclosure includes a recombinant polynucleotide that encodes a polypeptide of SEQ ID NO: 27.

[00341] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 27. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 27.

[00342] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 28. The polynucleotide of SEQ ID NO: 28 encodes the polypeptide of SEQ ID NO: 27.

[00343] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 28, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 27. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 28, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence the polypeptide of SEQ ID NO: 27.

[00344] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cb2487 polypeptides disclosed herein.

[00345] Cbl62 polynucleotides

[00346] The present disclosure includes recombinant polynucleotides that encode a Cbl62 polypeptide of the disclosure. In some aspects, the disclosure includes a recombinant polynucleotide that encodes a polypeptide of SEQ ID NO: 33. [00347] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 33. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that has hemicellulase activity and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 33.

[00348] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 34. The polynucleotide of SEQ ID NO: 34 encodes the polypeptide of SEQ ID NO: 33.

[00349] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 34, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 33. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 34, and that encode a polypeptide that has hemicellulase activity and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence the polypeptide of SEQ ID NO: 33.

[00350] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode Cbl62 polypeptides disclosed herein. [00351] Nucleic Acids that Encode Polypeptides that Enhance Enzymatic Hydrolysis of Cellulose and/or Hemicellulose

[00352] The present disclosure includes recombinant polynucleotides that encode a CM581 polypeptide of the disclosure. In some aspects, the disclosure includes recombinant

polynucleotides that encode the polypeptide of SEQ ID NO: 146.

[00353] Polynucleotides of the disclosure include recombinant polynucleotides that encode a polypeptide that enhances enzymatic hydrolysis of cellulose and/or hemicellulose and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 146. Polynucleotides of the disclosure also include recombinant polynucleotides that encode a polypeptide that enhances enzymatic hydrolysis of cellulose and/or hemicellulose and that has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 80, at least 100, at least 120, at least 140, at least 160, at least 180, or at least 200 consecutive amino acids of the polypeptide of SEQ ID NO: 146.

[00354] In some aspects, the disclosure includes the recombinant polynucleotide of SEQ ID NO: 147. The polynucleotide of SEQ ID NO: 147 encodes the polypeptide of SEQ ID NO: 146.

[00355] Polynucleotides of the disclosure also include recombinant polynucleotides having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of SEQ ID NO: 147, and that encode a polypeptide that enhances enzymatic hydrolysis of cellulose and/or hemicellulose and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 146. Polynucleotides of the disclosure also include recombinant polynucleotides that have at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, at least 26, at least 28, or at least 30 consecutive nucleotides of the sequence of SEQ ID NO: 147, and that encode a polypeptide that enhances enzymatic hydrolysis of cellulose and/or hemicellulose and that has at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94% at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identity to the sequence of the polypeptide of SEQ ID NO: 146.

[00356] Polynucleotides of the disclosure further include recombinant polynucleotides that are complementary to polynucleotides that encode CM581 polypeptides disclosed herein.

[00357] Recombinant Polynucleotides Encoding Polypeptides with Protein "tags "

[00358] Further disclosed herein are recombinant polynucleotides that encode polypeptides of the disclosure with a polypeptide "tag." Polynucleotides that encode a polypeptide "tag" may be added to a polynucleotide encoding a polypeptide of the disclosure by standard molecular biology cloning techniques. (See, for example, Sambrook et al., Molecular Cloning: A

Laboratory Manual, 3rd Ed., Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (2001)).

Variants, Sequence Identity, and Sequence Similarity

[00359] Methods of alignment of sequences for comparison are well-known in the art. For example, the determination of percent sequence identity between any two sequences can be accomplished using a mathematical algorithm. Non-limiting examples of such mathematical algorithms are the algorithm of Myers and Miller (1988) CABIOS 4: 11 17; the local homology algorithm of Smith et al. (1981) Adv. Appl. Math. 2:482; the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48:443 453; the search-for- similarity- method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. 85:2444 2448; the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad. Sci. USA 872264, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873 5877.

[00360] Computer implementations of these mathematical algorithms can be utilized for comparison of sequences to determine sequence identity. Such implementations include, but are not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, Mountain View, Calif.); the ALIGN program (Version 2.0) and GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 (available from Genetics Computer Group (GCG), 575 Science Drive, Madison, Wis., USA). Alignments using these programs can be performed using the default parameters. The CLUSTAL program is well described by Higgins et al. (1988) Gene 73:237 244 (1988); Higgins et al. (1989) CABIOS 5: 151 153; Corpet et al. (1988) Nucleic Acids Res. 16: 10881 90; Huang et al. (1992) CABIOS 8: 155 65; and Pearson et al. (1994) Meth. Mol. Biol. 24:307 331. The ALIGN program is based on the algorithm of Myers and Miller (1988) supra. A PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used with the ALIGN program when comparing amino acid sequences. The BLAST programs of Altschul et al. (1990) J. Mol. Biol. 215:403 are based on the algorithm of Karlin and Altschul (1990) supra. BLAST nucleotide searches can be performed with the BLASTN program, score=100, wordlength=12, to obtain nucleotide sequences homologous to a nucleotide sequence encoding a protein of the invention. BLAST protein searches can be performed with the BLASTX program, score=50, wordlength=3, to obtain amino acid sequences homologous to a protein or polypeptide of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST (in BLAST 2.0) can be utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25:3389. Alternatively, PSI-BLAST (in BLAST 2.0) can be used to perform an iterated search that detects distant relationships between molecules. See Altschul et al. (1997) supra. When utilizing BLAST, Gapped BLAST, or PSI- BLAST, the default parameters of the respective programs (e.g., BLASTN for nucleotide sequences, BLASTX for proteins) can be used. BLAST is available, for example, from the National Center for Biotechnology Information (NCBI). Alignment may also be performed manually by inspection.

[00361] As used herein "sequence identity" refers to the percentage of residues that are identical in the same positions in the sequences being analyzed. As used herein "sequence similarity" refers to the percentage of residues that have similar biophysical / biochemical characteristics in the same positions (e.g. charge, size, hydrophobicity) in the sequences being analyzed.

[00362] The functional activity of enzyme variants can be evaluated using standard molecular biology techniques including thin layer chromatography or a reducing sugar assay. Enzymatic activity can be determined using cellulose, hemicellulose or an artificial substrate. [00363] Compositions

[00364] The present disclosure further includes compositions containing one or more recombinant polypeptides disclosed herein. In some aspects, provided herein are compositions containing two or more recombinant polypeptides disclosed herein. Compositions containing two or more recombinant polypeptides may be referred to as a "cocktail" of polypeptides and/or enzymes.

[00365] In some aspects, disclosed herein are compositions that contain one or more recombinant polypeptides disclosed herein, wherein the one or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides.

[00366] In some aspects, disclosed herein are compositions that contain two or more recombinant polypeptides disclosed herein, wherein the two or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides.

[00367] In some aspects, disclosed herein are compositions that contain three or more recombinant polypeptides disclosed herein, wherein the three or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides.

[00368] In some aspects, disclosed herein are compositions that contain four or more recombinant polypeptides disclosed herein, wherein the four or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides.

[00369] In some aspects, disclosed herein are compositions that contain five or more recombinant polypeptides disclosed herein, wherein the five or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides.

[00370] In some aspects, disclosed herein are compositions that contain six or more recombinant polypeptides disclosed herein, wherein the six or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides. [00371] In some aspects, disclosed herein are compositions that contain one or more recombinant polypeptides disclosed herein, wherein the one or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00372] In some aspects, disclosed herein are compositions that contain two or more recombinant polypeptides disclosed herein, wherein the two or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00373] In some aspects, disclosed herein are compositions that contain three or more recombinant polypeptides disclosed herein, wherein the three or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106. [00374] In some aspects, disclosed herein are compositions that contain four or more recombinant polypeptides disclosed herein, wherein the four or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00375] In some aspects, disclosed herein are compositions that contain five or more recombinant polypeptides disclosed herein, wherein the five or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00376] In some aspects, disclosed herein are compositions that contain six or more recombinant polypeptides disclosed herein, wherein the six or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106. [00377] In some aspects, disclosed herein are compositions that contain two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00378] In some aspects, disclosed herein are compositions that contain three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00379] In some aspects, disclosed herein are compositions that contain four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00380] In some aspects, disclosed herein are compositions that contain five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00381] In some aspects, disclosed herein are compositions that contain six recombinant polypeptides disclosed herein, wherein the six polypeptides are the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00382] Any of the compositions disclosed herein containing one or more recombinant cellulases disclosed herein may further contain one or more recombinant hemicellulases.

Hemicellulases include, without limitation, endoxylanases, exoxylanases, cc- arabinofuranosidases, glucuronidases, β-xylosidases, and acetyl xylan esterases. In some aspects, hemicellulases include the polypeptides that contain the amino acid sequence of any of SEQ ID NOs: 3, 7, 13, 19, 27, 33, or 37. Any of the compositions disclosed herein containing one or more recombinant cellulases disclosed herein may further contain an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose. In one aspect, an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose contains the amino acid sequence of SEQ ID NO: 146.

[00383] The present disclosure provides for compositions including the recombinant amino acid sequence of Cbl93 alone or in combination with one or more of the recombinant amino acid sequences of Cbl95, Cbl 172, Cb909, Cb2487 and Cbl62. The present disclosure also provides for compositions including the recombinant amino acid sequence of Cbl 95 alone or in combination with one or more of the recombinant amino acid sequences of Cbl 93, Cbl 172, Cb909, Cb2487 and Cbl 62. The present disclosure also provides for compositions including the recombinant amino acid sequence of Cbl 172 alone or in combination with one or more of the recombinant amino acid sequences of Cbl93, Cbl95, Cb909, Cb2487 and Cbl62. The present disclosure also provides for compositions including the recombinant amino acid sequence of Cb909 alone or in combination with one or more of the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb2487 and Cbl62. The present disclosure also provides for compositions including the recombinant amino acid sequence of Cb2487 alone or in combination with one or more of the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb909 and Cbl 62. The present disclosure also provides for compositions including the recombinant amino acid sequence of Cbl 62 alone or in combination with one or more of the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb2487 and Cb909.

[00384] The present disclosure also provides for compositions including two or more of the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb909, Cb2487, and Cbl62. One composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl 172, Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb909, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl 172, Cb909, Cb2487, and Cbl62.

Another composition includes the recombinant amino acid sequences of Cbl95, Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, Cb909, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl 172, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl 172, Cb909, and Cbl62. Another composition includes the

recombinant amino acid sequences of Cbl93, Cbl 172, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cb909, and Cbl 62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, and Cbl62. Another composition includes the

recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, Cbl 172, and Cb909. Another composition includes the recombinant amino acid sequences of Cb909, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl 172, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl 172, Cb909, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl 172, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl95, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cb909, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl95, Cbl 172, and Cb909. Another composition includes the recombinant amino acid sequences of Cbl93, Cb2487, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cb909, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cb909, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl 172, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl 172, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl 172, and Cb909. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, and Cb909. Another composition includes the recombinant amino acid sequences of Cbl93, Cbl95, and Cbl 172. Another composition includes the recombinant amino acid sequences of Cbl95 and Cb909. Another composition includes the recombinant amino acid sequences of Cbl93 and Cbl 95. Another composition includes the recombinant amino acid sequences of Cbl 93 and Cbl 172. Another composition includes the recombinant amino acid sequences of Cbl93 and Cb909. Another composition includes the recombinant amino acid sequences of Cbl93 and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl 93 and Cbl 62. Another composition includes the recombinant amino acid sequences of Cbl 95 and Cbl 172. Another composition includes the recombinant amino acid sequences of Cbl95 and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl 95 and Cbl62. Another composition includes the recombinant amino acid sequences of Cbl 172 and Cb909. Another composition includes the recombinant amino acid sequences of Cbl 172 and Cb2487. Another composition includes the recombinant amino acid sequences of Cbl 172 and Cbl 62. Another composition includes the recombinant amino acid sequences of Cb909 and Cb2487. Another composition includes the recombinant amino acid sequences of Cb909 and Cbl 62. Another composition includes the recombinant amino acid sequences of Cb2487 and Cbl 62.

[00385] Compositions may include a transgenic host cell comprising one or more of the amino acid sequences encoding Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62. The one or more polypeptides may be secreted from the transgenic host cell.

[00386] The present disclosure provides for compositions including the recombinant nucleotide sequence encoding Cbl 93 alone or in combination with one or more of the recombinant nucleotide sequences encoding Cbl95, Cbl 172, Cb909, Cb2487 and Cbl62. The present disclosure also provides for compositions including the recombinant nucleotide sequence encoding Cbl 95 alone or in combination with one or more of the recombinant nucleotide sequences encoding Cbl93, Cbl 172, Cb909, Cb2487 and Cbl62. The present disclosure also provides for compositions including the recombinant nucleotide sequence encoding Cbl 172 alone or in combination with one or more of the recombinant nucleotide sequences encoding Cbl93, Cbl95, Cb909, Cb2487 and Cbl62. The present disclosure also provides for

compositions including the recombinant nucleotide sequence encoding Cb909 alone or in combination with one or more of the recombinant nucleotide sequences encoding Cbl93, Cbl95, Cbl 172, Cb2487 and Cbl62. The present disclosure also provides for compositions including the recombinant nucleotide sequence encoding Cb2487 alone or in combination with one or more of the recombinant nucleotide sequences encoding Cbl93, Cbl95, Cbl 172, Cb909 and Cbl 62. The present disclosure also provides for compositions including the recombinant nucleotide sequence encoding Cbl 62 alone or in combination with one or more of the recombinant nucleotide sequences encoding Cbl93, Cbl95, Cbl 172, Cb2487 and Cb909.

[00387] The present disclosure also provides for compositions including two or more of the recombinant nucleotide sequences encoding Cbl62, Cbl93, Cbl95, Cbl 172, Cb2487 and Cb909. Compositions may include vectors comprising the nucleotide sequence encoding one or more of Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62.

[00388] Any of the compositions disclosed herein containing one or more recombinant hemicellulases disclosed herein may further contain one or more recombinant cellulases. In some aspects, cellulases include the polypeptides that contain the amino acid sequence of any of SEQ ID NOs: 44, 114, 124, 126, 128, 46, 60, 61, 111, 74, 121, 76, 86, 87, 113, 98, 119, 100, and 106. Any of the compositions disclosed herein containing one or more recombinant

hemicellulases disclosed herein may further contain an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose. In one aspect, an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose contains the amino acid sequence of SEQ ID NO: 146.

[00389] Compositions disclosed herein containing one or more recombinant polypeptides disclosed herein may contain the proteins in any form. In some aspects, the polypeptides are in a liquid solution. In some aspects, the polypeptides are lyophilized. In some aspects, additional material is included in compositions containing one or more recombinant polypeptides disclosed herein to help preserve the stability of the polypeptides. In some aspects, additional material is included in compositions containing one or more recombinant polypeptides disclosed herein to help preserve the stability of the polypeptides, wherein the additional material is additional polypeptides. In some aspects, the compositions are stable for at least six months. In some aspects, the compositions are stable for at least one year.

[00390] Host Cells

[00391] The present disclosure further provides host cells that contain a recombinant nucleic acid encoding a recombinant polypeptide of the disclosure. In some aspects, the disclosure provides host cells containing two or more recombinant nucleic acids encoding two or more recombinant polypeptides of the disclosure.

[00392] "Host cell" and "host microorganism" are used interchangeably herein to refer to a living biological cell that can be transformed via insertion of recombinant DNA or RNA. Such recombinant DNA or RNA can be in an expression vector. A host organism or cell as described herein may be a prokaryotic organism or a eukaryotic cell.

[00393] Any prokaryotic or eukaryotic host cell may be used in the present disclosure so long as it remains viable after being transformed with a sequence of nucleic acids. Preferably, the host cell is not adversely affected by the transduction of the necessary nucleic acid sequences, the subsequent expression of the proteins (e.g., transporters), or the resulting intermediates.

[00394] In some aspects, the host cell is a prokaryotic cell. Any prokaryotic cell suitable for expression of a recombinant polypeptide may be used to produce recombinant polypeptides of the present disclosure. Prokaryotic host cells of the disclosure include, without limitation, Escherichia coli, Bacillus subtilis, Corynebacterium spp., Pseudomonas spp., Proteus spp., Ralstonia spp., Streptomyces spp., Staphylococcus spp., Lactococcus spp., Zymomonas mobilis, Clostridium spp., Thermoanaerobacterium spp., Caldicellulosiruptor spp. and Klebsiella spp..

[00395] In some aspects, the host cell is a eukaryotic cell. Any eukaryotic cell suitable for expression of a recombinant polypeptide may be used to produce recombinant polypeptides of the present disclosure. Suitable eukaryotic cells include, but are not limited to, fungal, plant, insect or mammalian cells. [00396] In certain aspects, the host cell is a fungal strain. "Fungi" as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi.

[00397] In certain embodiments, the fungal host is a yeast strain. "Yeast" as used herein refers to any single cell fungus that reproduces asexually by budding or division, and it includes fungi of both Ascomycota and Basidiomycota.

[00398] In certain embodiments, the fungal host is of the genus Saccharomyces,

Schizosaccharomyces, Leucosporidium, Dekkera/Brettanomyces, Zygosaccharomyces, Yarrowia, Hansenula, Kluyveromyces, Scheffersomyces (Pichia), Neurospora or Candida.

[00399] In some aspects, the host cell is a thermophilic microorganism.

[00400] The host cells of the present disclosure may be genetically modified in that recombinant nucleic acids have been introduced into the host cells, and as such the genetically modified host cells do not occur in nature. The suitable host cell is one capable of expressing one or more nucleic acid constructs encoding one or more proteins for different functions.

[00401] "Recombinant nucleic acid" or "heterologous nucleic acid" or "recombinant polynucleotide", "recombinant nucleotide" or "recombinant DNA" as used herein refers to a polymer of nucleic acids wherein at least one of the following is true: (a) the sequence of nucleic acids is foreign to (i.e., not naturally found in) a given host cell; (b) the sequence may be naturally found in a given host cell, but in an unnatural (e.g., greater than expected) amount; (c) the sequence of nucleic acids contains two or more subsequences that are not found in the same relationship to each other in nature; (d) the polynucleotide is isolated from an organism in which the polynucleotide naturally occurs; or (e) the polynucleotide is synthetically prepared. For example, regarding instance (c), a recombinant nucleic acid sequence will have two or more sequences from unrelated genes arranged to make a new functional nucleic acid. Specifically, the present disclosure describes the introduction of an expression vector into a host cell, wherein the expression vector contains a nucleic acid sequence coding for a protein that is not normally found in a host cell or contains a nucleic acid coding for a protein that is normally found in a cell but is under the control of different regulatory sequences. With reference to the host cell's genome, then, the nucleic acid sequence that codes for the protein is recombinant. As used herein, the term "recombinant polypeptide" refers to a polypeptide generated from a

"recombinant nucleic acid" or "heterologous nucleic acid" or "recombinant polynucleotide", "recombinant nucleotide" or "recombinant DNA" as described above.

[00402] In some aspects, the host cell naturally produces a protein encoded by a

polynucleotide of the disclosure. A gene encoding the desired protein may be heterologous to the host cell or the gene may be endogenous to the host cell but is operatively linked to a heterologous promoters and/or control region which results in the higher expression of the gene in the host cell.

[00403] Host cell components

[00404] In some aspects, host cells disclosed herein contain one or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one or more polypeptides selected from CM952, CM953, CM954, CM946, Cb629, or Cb486 polypeptides.

[00405] In some aspects, host cells disclosed herein contain two or more recombinant nucleic acids, wherein the recombinant nucleic acids encode two or more polypeptides selected from CM952, CM953, CM954, CM946, Cb629, or Cb486 polypeptides.

[00406] In some aspects, host cells disclosed herein contain three or more recombinant nucleic acids, wherein the recombinant nucleic acids encode three or more polypeptides selected from CM952, CM953, CM954, CM946, Cb629, or Cb486 polypeptides.

[00407] In some aspects, host cells disclosed herein contain four or more recombinant nucleic acids, wherein the recombinant nucleic acids encode four or more polypeptides selected from CM952, CM953, CM954, CM946, Cb629, or Cb486 polypeptides.

[00408] In some aspects, host cells disclosed herein contain five or more recombinant nucleic acids, wherein the recombinant nucleic acids encode five or more polypeptides selected from CM952, CM953, CM954, CM946, Cb629, or Cb486 polypeptides. [00409] In some aspects, host cells disclosed herein contain six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode six or more polypeptides selected from CM952, CM953, CM954, CM946, Cb629, and Cb486 polypeptides.

[00410] In some aspects, host cells disclosed herein contain one or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one or more recombinant polypeptides disclosed herein, wherein the one or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00411] In some aspects, host cells disclosed herein contain two or more recombinant nucleic acids, wherein the recombinant nucleic acids encode two or more recombinant polypeptides disclosed herein, wherein the two or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00412] In some aspects, host cells disclosed herein contain three or more recombinant nucleic acids, wherein the recombinant nucleic acids encode three or more recombinant polypeptides disclosed herein, wherein the three or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00413] In some aspects, host cells disclosed herein contain four or more recombinant nucleic acids, wherein the recombinant nucleic acids encode four or more recombinant polypeptides disclosed herein, wherein the four or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00414] In some aspects, host cells disclosed herein contain five or more recombinant nucleic acids, wherein the recombinant nucleic acids encode five or more recombinant polypeptides disclosed herein, wherein the five or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00415] In some aspects, host cells disclosed herein contain six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode six or more recombinant polypeptides disclosed herein, wherein the six or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106.

[00416] In some aspects, host cells disclosed herein contain two or more recombinant nucleic acids encoding two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00417] In some aspects, host cells disclosed herein contain three or more recombinant nucleic acids encoding three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00418] In some aspects, host cells disclosed herein contain four or more recombinant nucleic acids encoding four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00419] In some aspects, host cells disclosed herein contain five or more recombinant nucleic acids encoding five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00420] In some aspects, host cells disclosed herein contain six recombinant nucleic acids encoding six recombinant polypeptides disclosed herein, wherein the six polypeptides are the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113.

[00421] In some aspects, host cells disclosed herein contain one or more recombinant nucleic acids, wherein the recombinant nucleic acids encode one or more recombinant polypeptides disclosed herein, wherein the one or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide is selected from the polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, wherein the recombinant nucleic acid encoding a CM953 polypeptide is selected from the polynucleotides of SEQ ID NOs: 62, 63, and 110, wherein the recombinant nucleic acid encoding a CM954 polypeptide is selected from the polynucleotides of SEQ ID NOs: 116, 75, and 77, wherein the recombinant nucleic acid encoding a CM946 polypeptide is selected from the polynucleotides of SEQ ID NOs: 88, 89, and 112, wherein the recombinant nucleic acid encoding a Cb629 polypeptide is selected from the polynucleotides of SEQ ID NOs: 99, 120, and 101, and wherein the recombinant nucleic acid encoding a Cb486 polypeptide is the polynucleotide of SEQ ID NO: 107.

[00422] In some aspects, host cells disclosed herein contain two or more recombinant nucleic acids, wherein the recombinant nucleic acids encode two or more recombinant polypeptides disclosed herein, wherein the two or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide is selected from the polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, wherein the recombinant nucleic acid encoding a CM953 polypeptide is selected from the polynucleotides of SEQ ID NOs: 62, 63, and 110, wherein the recombinant nucleic acid encoding a CM954 polypeptide is selected from the polynucleotides of SEQ ID NOs: 116, 75, and 77, wherein the recombinant nucleic acid encoding a CM946 polypeptide is selected from the polynucleotides of SEQ ID NOs: 88, 89, and 112, wherein the recombinant nucleic acid encoding a Cb629 polypeptide is selected from the polynucleotides of SEQ ID NOs: 99, 120, and 101, and wherein the recombinant nucleic acid encoding a Cb486 polypeptide is the polynucleotide of SEQ ID NO: 107.

[00423] In some aspects, host cells disclosed herein contain three or more recombinant nucleic acids, wherein the recombinant nucleic acids encode three or more recombinant polypeptides disclosed herein, wherein the three or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide is selected from the polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, wherein the recombinant nucleic acid encoding a CM953 polypeptide is selected from the polynucleotides of SEQ ID NOs: 62, 63, and 110, wherein the recombinant nucleic acid encoding a CM954 polypeptide is selected from the polynucleotides of SEQ ID NOs: 116, 75, and 77, wherein the recombinant nucleic acid encoding a CM946 polypeptide is selected from the polynucleotides of SEQ ID NOs: 88, 89, and 112, wherein the recombinant nucleic acid encoding a Cb629 polypeptide is selected from the polynucleotides of SEQ ID NOs: 99, 120, and 101, and wherein the recombinant nucleic acid encoding a Cb486 polypeptide is the polynucleotide of SEQ ID NO: 107.

[00424] In some aspects, host cells disclosed herein contain four or more recombinant nucleic acids, wherein the recombinant nucleic acids encode four or more recombinant polypeptides disclosed herein, wherein the four or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide is selected from the polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, wherein the recombinant nucleic acid encoding a CM953 polypeptide is selected from the polynucleotides of SEQ ID NOs: 62, 63, and 110, wherein the recombinant nucleic acid encoding a CM954 polypeptide is selected from the polynucleotides of SEQ ID NOs: 116, 75, and 77, wherein the recombinant nucleic acid encoding a CM946 polypeptide is selected from the polynucleotides of SEQ ID NOs: 88, 89, and 112, wherein the recombinant nucleic acid encoding a Cb629 polypeptide is selected from the polynucleotides of SEQ ID NOs: 99, 120, and 101, and wherein the recombinant nucleic acid encoding a Cb486 polypeptide is the polynucleotide of SEQ ID NO: 107.

[00425] In some aspects, host cells disclosed herein contain five or more recombinant nucleic acids, wherein the recombinant nucleic acids encode five or more recombinant polypeptides disclosed herein, wherein the five or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 or Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide is selected from the polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, wherein the recombinant nucleic acid encoding a CM953 polypeptide is selected from the polynucleotides of SEQ ID NOs: 62, 63, and 110, wherein the recombinant nucleic acid encoding a CM954 polypeptide is selected from the polynucleotides of SEQ ID NOs: 116, 75, and 77, wherein the recombinant nucleic acid encoding a CM946 polypeptide is selected from the polynucleotides of SEQ ID NOs: 88, 89, and 112, wherein the recombinant nucleic acid encoding a Cb629 polypeptide is selected from the polynucleotides of SEQ ID NOs: 99, 120, and 101, and wherein the recombinant nucleic acid encoding a Cb486 polypeptide is the polynucleotide of SEQ ID NO: 107.

[00426] In some aspects, host cells disclosed herein contain six or more recombinant nucleic acids, wherein the recombinant nucleic acids encode six or more recombinant polypeptides disclosed herein, wherein the six or more recombinant polypeptides are selected from CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides, and wherein the recombinant nucleic acid encoding a CM952 polypeptide is selected from the polynucleotides of SEQ ID NOs: 45, 115, 125, 127, 129, and 47, wherein the recombinant nucleic acid encoding a CM953 polypeptide is selected from the polynucleotides of SEQ ID NOs: 62, 63, and 110, wherein the recombinant nucleic acid encoding a CM954 polypeptide is selected from the polynucleotides of SEQ ID NOs: 116, 75, and 77, wherein the recombinant nucleic acid encoding a CM946 polypeptide is selected from the polynucleotides of SEQ ID NOs: 88, 89, and 112, wherein the recombinant nucleic acid encoding a Cb629 polypeptide is selected from the polynucleotides of SEQ ID NOs: 99, 120, and 101, and wherein the recombinant nucleic acid encoding a Cb486 polypeptide is the polynucleotide of SEQ ID NO: 107.

[00427] In some aspects, host cells disclosed herein contain two or more recombinant nucleic acids encoding two or more recombinant polypeptides disclosed herein, wherein the two or more recombinant nucleic acids are selected from the polynucleotides of SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

[00428] In some aspects, host cells disclosed herein contain three or more recombinant nucleic acids encoding three or more recombinant polypeptides disclosed herein, wherein the three or more recombinant nucleic acids are selected from the polynucleotides of SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

[00429] In some aspects, host cells disclosed herein contain four or more recombinant nucleic acids encoding four or more recombinant polypeptides disclosed herein, wherein the four or more recombinant nucleic acids are selected from the polynucleotides of SEQ ID NOs: 47, 110, 77, 112, 101, and 107. [00430] In some aspects, host cells disclosed herein contain five or more recombinant nucleic acids encoding five or more recombinant polypeptides disclosed herein, wherein the five or more recombinant nucleic acids are selected from the polynucleotides of SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

[00431] In some aspects, host cells disclosed herein contain six recombinant nucleic acids encoding six recombinant polypeptides disclosed herein, wherein the six recombinant nucleic acids are the polynucleotides of SEQ ID NOs: 47, 110, 77, 112, 101, and 107.

[00432] Any of the host cells disclosed herein containing one or more recombinant nucleic acids encoding one or more recombinant cellulases disclosed herein may further contain one or more recombinant nucleic acids encoding one or more recombinant hemicellulases. In some aspects, polynucleotides that encode hemicellulases include nucleic acids that contain the polynucleotide sequence of any of SEQ ID NOs: 4, 8, 14, 20, 28, 34, or 38. Any of the host cells disclosed herein containing one or more recombinant cellulases disclosed herein may further contain an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose. In one aspect, an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose contains the amino acid sequence of SEQ ID NO: 146.

[00433] The disclosure further provides for a transformed transgenic host cell comprising one or more of the nucleic acids encoding Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62. The transformed cell can be, without limitation, a bacterial cell, a mammalian cell, a fungal cell, a yeast cell, an insect cell, or a plant cell. In certain embodiments, the transformed cell is E. coli. In certain embodiments, the transformed cell is a thermophilic microorganism.

[00434] Any of the host cells disclosed herein containing one or more recombinant nucleic acids encoding one or more recombinant hemicellulases disclosed herein may further contain one or more recombinant nucleic acids encoding one or more recombinant cellulases. In some aspects, polynucleotides that encode cellulases include nucleic acids that contain the

polynucleotide sequence of any of SEQ ID NOs: 44, 114, 124, 126, 128, 46, 60, 61, 111, 74, 121, 76, 86, 87, 113, 98, 119, 100, and 106. Any of the host cells disclosed herein containing one or more recombinant hemicellulases disclosed herein may further contain an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose. In one aspect, an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose contains the amino acid sequence of SEQ ID NO: 146.

[00435] Methods of Producing and Culturing Host Cells of the Disclosure

[00436] Methods of producing and culturing host cells of the disclosure may include the introduction or transfer of expression vectors containing the recombinant nucleic acids of the disclosure into the host cell. Such methods for transferring expression vectors into host cells are well known to those of ordinary skill in the art. For example, one method for transforming cells with an expression vector involves a calcium chloride treatment wherein the expression vector is introduced via a calcium precipitate. Other salts, e.g., calcium phosphate, may also be used following a similar procedure. In addition, electroporation (i.e., the application of current to increase the permeability of cells to nucleic acid sequences) may be used to transfect the host cell. Cells also may be transformed through the use of spheroplasts (Schweizer, M, Proc. Natl. Acad. Sci., 78: 5086-5090 (1981)). Also, microinjection of the nucleic acid sequences provides the ability to transfect host cells. Other means, such as lipid complexes, liposomes, and dendrimers, may also be employed. Those of ordinary skill in the art can transfect a host cell with a desired sequence using these or other methods.

[00437] In some cases, cells are prepared as protoplasts or spheroplasts prior to

transformation. Protoplasts or spheroplasts may be prepared, for example, by treating a cell having a cell wall with enzymes to degrade the cell wall. Fungal cells may be treated, for example, with zymolyase or chitinase.

[00438] The vector may be an autonomously replicating vector, i.e., a vector which exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. The vector may contain any means for assuring self -replication. Alternatively, the vector may be one which, when introduced into the host, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids which together contain the total DNA to be introduced into the genome of the host, or a transposon may be used.

[00439] The vectors preferably contain one or more selectable markers which permit easy selection of transformed hosts. A selectable marker is a gene the product of which provides, for example, biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like. Selection of bacterial cells may be based upon antimicrobial resistance that has been conferred by genes such as the amp, gpt, neo, tet, camR and hyg genes.

[00440] Selectable markers for use in fungal host cells include, but are not limited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar (phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-5'-phosphate decarboxylase), sC (sulfate adenyltransferase), and trpC (anthranilate synthase), as well as equivalents thereof. Suitable markers for S. cerevisiae hosts include, for example, ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3.

[00441] The vectors may contain an element(s) that permits integration of the vector into the host's genome or autonomous replication of the vector in the cell independent of the genome.

[00442] For integration into the host genome, the vector may rely on the gene's sequence or any other element of the vector for integration of the vector into the genome by homologous or nonhomologous recombination. Alternatively, the vector may contain additional nucleotide sequences for directing integration by homologous recombination into the genome of the host. The additional nucleotide sequences enable the vector to be integrated into the host genome at a precise location(s) in the chromosome(s). To increase the likelihood of integration at a precise location, the integrational elements should contain a sufficient number of nucleic acids, such as 100 to 10,000 base pairs, 400 to 10,000 base pairs, or 800 to 10,000 base pairs, which are highly homologous with the corresponding target sequence to enhance the probability of homologous recombination. The integrational elements may be any sequence that is homologous with the target sequence in the genome of the host. Furthermore, the integrational elements may be non- encoding or encoding nucleotide sequences. On the other hand, the vector may be integrated into the genome of the host by non-homologous recombination. [00443] For autonomous replication, the vector may further contain an origin of replication enabling the vector to replicate autonomously in the host in question. The origin of replication may be any plasmid replicator mediating autonomous replication which functions in a cell. The term "origin of replication" or "plasmid replicator" is defined herein as a sequence that enables a plasmid or vector to replicate in vivo.

[00444] The vector may further contain a promoter for regulation of expression of a recombinant nucleic acid in the vector. Promoters for the regulation of expression of a gene are well-known in the art, and include constitutive promoters, and inducible promoters. Promoters are described, for example, in Sambrook, et al. Molecular Cloning: A Laboratory Manual, 3 rd edition, Cold Spring Harbor Laboratory Press, (2001). In some aspects, vectors for use in Saccharomyces spp. may include the TDH1 or PGK1 promoter, which are strong and

constitutive promoters.

[00445] More than one copy of a gene may be inserted into the host to increase production of the gene product. An increase in the copy number of the gene can be obtained by integrating at least one additional copy of the gene into the host genome or by including an amplifiable selectable marker gene with the nucleotide sequence where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the gene, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.

[00446] The procedures used to ligate the elements described above to construct the recombinant expression vectors of the present invention are well known to one skilled in the art (see, e.g., Sambrook et al., 2001, supra).

[00447] Once the host cell has been transformed with the expression vector, the host cell is allowed to grow. Growth of a host cell in a medium may involve the process of fermentation. Methods of the disclosure may include culturing the host cell such that recombinant nucleic acids in the cell are expressed. Media, temperature ranges and other conditions suitable for growth are known in the art.

[00448] Expression of Recombinant Polypeptides of the Disclosure [00449] The disclosure further provides for the expression of polypeptides of the disclosure. Polypeptides of the disclosure may be prepared by standard molecular biology techniques such as those described herein and in Sambrook, et al. Molecular Cloning: A Laboratory Manual, 3 rd edition, Cold Spring Harbor Laboratory Press, (2001). Recombinant polypeptides may be expressed in and purified from transgenic expression systems. Transgenic expression systems can be prokaryotic or eukaryotic. In some aspects, transgenic host cells may secrete the polypeptide out of the host cell. In some aspects, transgenic host cells may retain the expressed polypeptide in the host cell.

[00450] In certain aspects, recombinant polypeptides of the disclosure are partially or substantially isolated from a host cell, or from the growth media of the host cell. In certain aspects, a recombinant polypeptide of the disclosure is prepared with a protein "tag" to facilitate protein purification, such as a GST-tag or poly-His tag. In some aspects, a recombinant polypeptide of the disclosure is prepared with a signal sequence to direct the export of the polypeptide out of the cell. In some aspects, recombinant polypeptides may be only partially purified (e.g. <80 pure, <70 pure, <60 pure, <50 pure, <40 pure, <30 pure, <20 pure, <10 pure, <5 pure). In some aspects, recombinant polypeptides of the present disclosure may be purified to a high degree of purity (e.g. >99 pure, >98 pure, >95 pure, >90 pure, etc.). Recombinant polypeptides may be purified through a variety of techniques known to those of skill in the art, including for example, ion-exchange chromatography, size exclusion chromatography, and affinity chromatography.

[00451] In one aspect, a method for producing any of the recombinant polypeptides disclosed herein (including cellulases, hemicellulases, and enzymes that enhances enzymatic hydrolysis of cellulose and/or hemicellulose) includes the steps of: A) culturing a host cell containing one or more recombinant nucleic acids encoding the one or more recombinant polypeptides disclosed herein in media under conditions necessary to support the expression of the recombinant nucleic acid(s), and collecting the one or more polypeptides from the media and/or host cell.

[00452] In one aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing one or more recombinant nucleic acids encoding one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, Cbl946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acid(s), and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing two or more recombinant nucleic acids encoding two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing three or more recombinant nucleic acids encoding three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, C 946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing four or more recombinant nucleic acids encoding four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing five or more recombinant nucleic acids encoding five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing six or more recombinant nucleic acids encoding six or more recombinant polypeptides disclosed herein, wherein the six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, Cbl946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell.

[00453] In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing one or more recombinant nucleic acids encoding one or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acid(s), and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing two or more recombinant nucleic acids encoding two or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing three or more recombinant nucleic acids encoding three or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing four or more recombinant nucleic acids encoding four or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing five or more recombinant nucleic acids encoding five or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell. In another aspect, a method for producing cellulases includes the steps of: A) culturing a host cell containing six or more recombinant nucleic acids encoding six of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and collecting the one or more polypeptides from the media and/or host cell.

Thermostability of Enzymes

[00454] The enzymes of the present disclosure are thermophilic and thermostable. As used herein, "thermophilic" refers to the characteristic of an enzyme to have peak activity at a high temperature (e.g. above 50° C). As used herein, "thermostable" refers to the characteristic of an enzyme to retain activity at high temperatures (e.g. above 50° C) for a significant period of time. For CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides of the disclosure, "enzymatic" activity refers to cellulase activity (including β-glucosidase activity). For Cbl93, Cbl95, CM 172, Cb2487, Cb909, and Cbl62 polypeptides of the disclosure, "enzymatic" activity refers to hemicellulase activity. For CM581 polypeptides of the disclosure, "enzymatic" activity refers to activity that enhances enzymatic hydrolysis of cellulose and/or hemicellulose.

[00455] Cellulases

[00456] In certain aspects, one or more of the CM952, CM953, CM954, CM946, Cb629 and Cb486 polypeptides of the disclosure has a peak rate of enzymatic activity on cellulose or cellulose-containing material at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail has a peak rate of enzymatic activity on cellulose or a cellulose-containing material at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 has a peak rate of enzymatic activity on cellulose or a cellulose-containing material at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. [00457] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00458] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00459] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00460] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the cocktail retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00461] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C. [00462] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00463] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00464] In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In another aspect, an enzyme cocktail is provided herein, wherein the cocktail contains the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00465] Hemicellulases [00466] In certain embodiments, one or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 has peak rate of enzymatic activity at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In other embodiments, an enzyme 'cocktail' that contains two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 has peak rate of enzymatic activity on hemicellulose or a hemicellulose-derived substrate at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In one embodiment, an enzyme 'cocktail' that contains all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 has peak rate of enzymatic activity on hemicellulose or a hemicellulose-derived substrate at a temperature of about 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C.

[00467] Enzymes of the present disclosure retain significant enzymatic activity for significant periods of time at high temperatures. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00468] In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00469] In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00470] In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing two or more of the enzymes Cbl 93, Cbl 95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00471] In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and Cbl62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and CM62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 90% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00472] In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl 93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and CM62 retains at least 75% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C. [00473] In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl 62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 50% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00474] In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl 93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 55 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 60 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl 62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 65 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 70 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 75 °C. In one embodiment, an enzyme cocktail containing all six of the enzymes Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62 retains at least 25% of its initial rate of enzymatic activity for a period of at least 24 hours when incubated at a temperature of about 80 °C.

[00475] Any of the cellulase cocktails disclosed herein having the thermophilic and thermostable characteristics disclosed herein may further include any of the hemicellulases disclosed herein and / or an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose disclosed herein. Additionally, any of the hemicellulase cocktails disclosed herein having the thermophilic and thermostable characteristics disclosed herein may further include any of the cellulases disclosed herein and / or an enzyme that enhances enzymatic hydrolysis of cellulose and/or hemicellulose disclosed herein.

[00476] Applications

[00477] Methods of Degrading Cellulose-Containing Material

[00478] In one aspect, provided herein are methods for degrading cellulose-containing material.

[00479] In one aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: Cbl 952 polypeptides, Cbl 953 polypeptides, Cbl 954 polypeptides, Cbl 946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the polypeptide(s) and cellulose- containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: Cbl 952 polypeptides, Cbl 953 polypeptides, Cbl 954 polypeptides, Cbl 946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose- containing material includes contacting a cellulose-containing material with a composition containing three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the polypeptides and cellulose- containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose- containing material includes contacting a cellulose-containing material with a composition containing six or more recombinant polypeptides disclosed herein, wherein the six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation.

[00480] In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106, and incubating the polypeptide(s) and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, Cbl954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing six or more recombinant polypeptides disclosed herein, wherein the six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and wherein the CM952 polypeptide is selected from the polypeptides of SEQ ID NOs: 44, 114, 124, 126, 128, and 46, wherein the CM953 polypeptide is selected from the polypeptides of SEQ ID NOs: 60, 61, and 111, wherein the CM954 polypeptide is selected from the polypeptides of SEQ ID NOs: 74, 121, and 76, wherein the CM946 polypeptide is selected from the polypeptides of SEQ ID NOs: 86, 87, and 113, wherein the Cb629 polypeptide is selected from the polypeptides of SEQ ID NOs: 98, 119, and 100, and wherein the Cb486 polypeptide is the polypeptide of SEQ ID NO: 106, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation.

[00481] In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing one or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the polypeptide(s) and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing two or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing three or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing four or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing five or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing six of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation.

[00482] In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing one or more recombinant nucleic acids encoding one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acid(s), and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing two or more recombinant nucleic acids encoding two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing three or more recombinant nucleic acids encoding three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing four or more recombinant nucleic acids encoding four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing five or more recombinant nucleic acids encoding five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing six or more recombinant nucleic acids encoding six or more recombinant polypeptides disclosed herein, wherein the six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation.

[00483] In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing one or more recombinant nucleic acids encoding one or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acid(s), and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing two or more recombinant nucleic acids encoding two or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose- containing material with a host cell containing three or more recombinant nucleic acids encoding three or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing four or more recombinant nucleic acids encoding four or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing five or more recombinant nucleic acids encoding five or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose- containing material with a host cell containing six or more recombinant nucleic acids encoding six of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, in media under conditions necessary to support the expression of the recombinant nucleic acids, and incubating the cell and cellulose-containing material under conditions that support cellulose degradation.

[00484] In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a composition containing one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, or Cb629 polypeptides, and wherein the composition does not contain a Cb486 polypeptide, and incubating the polypeptides and cellulose-containing material under conditions that support cellulose degradation. In another aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing one or more recombinant nucleic acids encoding one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, or Cb629 polypeptides, and wherein the host cell does not contain a recombinant nucleic acid encoding a Cb486 polypeptide, in media under conditions necessary to support the expression of the recombinant nucleic acid(s), and incubating the cell and cellulose-containing material under conditions that support cellulose degradation. Contacting a cellulose-containing material with one or more cellulases disclosed herein, but not Cb486, may lead to the accumulation of cellobiose and/or other oligosaccharides during the degradation of the cellulose-containing material. Products containing cellobiose and/or oligosaccharides may be useful as feedstocks for organisms or processes that effectively utilize cellobiose and/or oligosaccharides to generate desired end products, such as biofuels. [00485] As used herein, a "cellulose-containing material" is any material that contains cellulose, including biomass. Biomass suitable for use with the currently disclosed methods include any cellulose-containing material, and includes, without limitation, Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, rye hulls, wheat hulls, sugarcane bagasse, copra meal, copra pellets, palm kernel meal, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine wood, birch wood, willow wood, aspen wood, poplar wood, energy cane, waste paper, sawdust, forestry wastes, municipal solid waste, waste paper, crop residues, other grasses, and other woods. In some aspects, biomass is lignocellulosic material.

[00486] Commonly, cellulose-containing materials also contain hemicellulose. For example, unprocessed or partially processed plant materials generally contain hemicellulose. In some aspects, any of the methods for degrading a cellulose-containing material disclosed herein may further include contacting a cellulose-containing material with one or more hemicellulases.

[00487] Any of the methods disclosed herein for degrading a cellulose-containing material that includes contacting a cellulose-containing material with a composition containing one, two, three, four, five, six or more polypeptides selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, may further include contacting the cellulose-containing material with one or more, two or more, three or more, four or more, five or more, or the six recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, a method for degrading a cellulose- containing material includes contacting a cellulose-containing material with a composition containing the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, provided herein is a method for degrading a biomass-containing material, including contacting a cellulose- containing material with a composition containing the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37, and incubating the polypeptides and biomass-containing material under conditions that support cellulose degradation. [00488] Any of the methods disclosed herein for degrading a cellulose-containing material that includes contacting a cellulose-containing material with a host cell containing one, two, three, four, five, or six recombinant nucleic acids encoding one, two, three, four, five, or six recombinant polypeptides disclosed herein, wherein the one, two, three, four, five, six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954

polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, in media under conditions necessary to support the expression of the recombinant nucleic acids, may further include contacting the cellulose-containing material with one or more, two or more, three or more, four or more, five or more, or six or more recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, a method for degrading a cellulose-containing material includes contacting a cellulose-containing material with a host cell containing recombinant nucleic acids encoding the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, provided herein is a method for degrading a biomass-containing material, including contacting a cellulose-containing material with a host cell containing recombinant nucleic acids encoding the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37, and incubating the host cell and biomass- containing material under conditions that support cellulose degradation.

[00489] In some aspects, any of the methods disclosed herein for degrading a cellulose- containing material may be carried out at a high temperature. In some aspects, any of the methods disclosed herein for degrading a cellulose-containing material may be carried out at about 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, any of the methods disclosed herein for degrading a cellulose-containing material may be carried out for at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 18, 20, 22, or 24 hours at about 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. If a method disclosed herein for degrading a cellulose containing material is carried out at a high temperature and it uses host cells expressing recombinant polypeptides disclosed herein, in some aspects, the host cell is a thermophilic organism. [00490] Methods of Reducing Viscosity of Pretreated Biomass Mixtures

[00491] Further provided herein are methods for reducing the viscosity of pre-treated biomass.

[00492] Biomass that that is used for degradation into component sugars or oligosaccharides may contains high levels of lignin, which can block hydrolysis of the cellulosic component of the biomass. Typically, biomass is pretreated with, for example, high temperature and/or high pressure to increase the accessibility of the cellulosic component to hydrolysis. Other pretreatments include, without limitation, ammonia fiber expansion (AFEX), steam explosion, and treatment with alkaline aqueous solutions, acidic solutions, organic solvents, ionic liquids (IL), electrolyzed water, phosphoric acid, or combinations thereof. However, pretreatment generally results in a biomass mixture that is highly viscous. The high viscosity of the pretreated biomass mixture can increase the difficulty in handling the pretreated biomass, and it can also interfere with effective hydrolysis of the pretreated biomass. Advantageously, recombinant polypeptides disclosed herein can be used to reduce the viscosity of pretreated biomass mixtures prior to further degradation of the biomass.

[00493] Accordingly, certain aspects of the present disclosure relate to methods of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from one or more of the cellulases, hemicellulases, and polypeptides that enhance enzymatic hydrolysis of cellulose and/or hemicellulose, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture.

[00494] Certain aspects of the present disclosure relate to methods of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing one or more recombinant polypeptides disclosed herein, wherein the one or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing two or more recombinant polypeptides disclosed herein, wherein the two or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a

composition containing three or more recombinant polypeptides disclosed herein, wherein the three or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, C 946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing four or more recombinant polypeptides disclosed herein, wherein the four or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a

composition containing five or more recombinant polypeptides disclosed herein, wherein the five or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing six or more recombinant polypeptides disclosed herein, wherein the six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture.

[00495] In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing one or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a

composition containing two or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing three or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing four or more of the

recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a composition containing five or more of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. In another aspect, the disclosure includes a method of reducing the viscosity of a pretreated biomass mixture, by contacting a pretreated biomass mixture having an initial viscosity with a

composition containing six of the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and incubating the contacted biomass mixture under conditions sufficient to reduce the initial viscosity of the pretreated biomass mixture. [00496] In some aspects, the disclosed methods are carried out as part of a pretreatment process. The pretreatment process may include the additional step of adding a composition containing one, two, three, four, five, six or more recombinant polypeptides disclosed herein, wherein the one, two, three, four, five, six or more polypeptides are selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, to pretreated biomass mixtures after a step of pretreating the biomass, and incubating the pretreated biomass with the polypeptides under conditions sufficient to reduce the viscosity of the mixture. The polypeptides or compositions may be added to pretreated biomass mixture while the temperature of the mixture is high, or after the temperature of the mixture has decreased. In some aspects, the methods are carried out in the same vessel or container where the pretreatment was performed. In other aspects, the methods are carried out in a separate vessel or container where the pretreatment was performed.

[00497] In some aspects, the methods are carried out in the presence of high salt, such as solutions containing saturating concentrations of salts, solutions containing sodium chloride (NaCl) at a concentration of at least at or about 0.1 M, 0.2 M, 0.3 M, 0.4 M, 0.5 M, 1 M, 1.5 M, 2 M, 2.5 M, 3 M, 3.5 M, or 4 M sodium chloride, or potassium chloride (KCl), at a concentration at or about 0.1 M, 0.2 M, 0.3 M, 0.4 M, 0.5 M, 1 M, 1.5 M, 2 M, 2.5 M 3.0 M or 3.2 M KCl and/or ionic liquids, such as 1,3-dimethylimidazolium dimethyl phosphate ([DMIM]DMP) or [EMIM]OAc, or in the presence of one or more detergents, such as ionic detergents (e.g. , SDS, CHAPS), sulfydryl reagents, such as in saturating ammonium sulfate or ammonium sulfate between at or about 0 and 1 M. In some aspects, the methods are carried out at a temperature of about 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, or 90 °C. In some aspects, the methods are carried out over a broad temperature range, such as between at or about 20°C and 50°C, 25°C and 55°C, 30°C and 60°C, 40°C and 80°C, 60°C and 80°C, or 60°C andlOO°C. In some aspects, the methods may be performed over a broad pH range, for example, at a pH of between about 4.5 and 8.75, at a pH of greater than 7 or at a pH of 8.5, or at a pH of at least 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0, or 8.5.

[00498] Any of the methods disclosed herein for reducing the viscosity of a pretreated biomass mixture that includes contacting a pretreated biomass mixture with a composition containing one, two, three, four, five, six or more polypeptides selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, may further include contacting the pretreated biomass mixture with one or more, two or more, three or more, four or more, five or more, or six recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, a method for reducing the viscosity of a pretreated biomass mixture includes contacting a pre-treated biomass mixture with a composition containing the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, provided herein is a method for reducing the viscosity of a pretreated biomass mixture, including contacting a pre-treated biomass mixture with a composition containing the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37, and incubating the polypeptides and reducing the viscosity of a pretreated biomass mixture to reduce the viscosity of the pretreated biomass mixture.

[00499] Methods of Converting Cellulose-Containing Materials to Fermentation Product

[00500] Further provided herein are methods for converting cellulose-containing materials to a fermentation production. In one aspect, a method for converting a cellulose-containing material into a fermentation product includes the steps of: A) contacting a cellulose-containing material with a composition containing one, two, three, four, five, six or more polypeptides selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides; B) incubating the cellulose- containing material with the composition containing one, two, three, four, five, six or more polypeptides under conditions that support cellulose degradation, in order to obtain sugars; and C) culturing the sugars with a fermentative microorganism under conditions sufficient to produce a fermentation product.

[00501] In another aspect, a method for converting a cellulose-containing material into a fermentation product includes the steps of: A) contacting a cellulose-containing material with a composition containing the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 ; B) incubating the cellulose-containing material with the composition containing the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, under conditions that support cellulose degradation, in order to obtain sugars; and C) culturing the sugars with a fermentative microorganism under conditions sufficient to produce a fermentation product.

[00502] Any of the methods disclosed herein for converting a cellulose-containing material into a fermentation product may further include contacting the pretreated biomass mixture with one or more, two or more, three or more, four or more, five or more, or six recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, a method for converting a cellulose-containing material into a fermentation product includes contacting a cellulose- containing material with a composition containing the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37. In one aspect, provided herein is a method for converting a cellulose- containing material into a fermentation product including the steps of: A) contacting a cellulose- containing material with a composition containing the recombinant polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113 and the recombinant polypeptides of SEQ ID NOs: 7, 13, 19, 27, 33, and 37; B) incubating the cellulose-containing material with the composition containing the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, 113, 7, 13, 19, 27, 33, and 37 under conditions that support cellulose degradation, in order to obtain sugars; and C) culturing the sugars with a fermentative microorganism under conditions sufficient to produce a fermentation product.

[00503] Sugars that may be obtained from the degradation of cellulose-containing materials include, without limitation, glucose, cellobiose, xylose, arabinose, galactose, glucuronic acid, and mannose.

[00504] Fermentation products that may be produced from sugars obtained from the degradation of cellulose-containing materials include, without limitation, ethanol, n-propanol, n- butanol, iso-butanol, 3-methyl-l-butanol, 2-methyl-l-butanol, 3-methyl-l-pentanol, and octanol.

[00505] Fermentative organisms include, without limitation, Saccharomyces spp.

[00506] Methods of Consolidated Bioprocessing [00507] Further provided herein are methods for converting cellulose-containing materials to a fermentation production, by consolidated bioprocessing. Consolidated bioprocessing combines enzyme generation, biomass hydrolysis, and biofuel production into a single stage. In one aspect, a method for converting a cellulose-containing material into a fermentation product by consolidated bioprocessing includes the steps of: A) contacting a cellulose-containing material with a cell having recombinant nucleic acids encoding one, two, three, four, five, six or more polypeptides selected from: CM952 polypeptides, CM953 polypeptides, CM954 polypeptides, CM946 polypeptides, Cb629 polypeptides, or Cb486 polypeptides, and one or more recombinant nucleic acids encoding one or more polypeptides involved in a biochemical pathway for the production of a biofuel, under conditions sufficient to support expression of the nucleic acids; B) incubating the cellulose-containing material with the cell expressing recombinant nucleic acids under conditions that support cellulose degradation and fermentation, in order to produce a fermentation product.

[00508] In another aspect, a method for converting a cellulose-containing material into a fermentation product by consolidated bioprocessing includes the steps of: A) contacting a cellulose-containing material with a cell having recombinant nucleic acids encoding the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, and 113, and one or more recombinant nucleic acids encoding one or more polypeptides involved in a biochemical pathway for the production of a biofuel, under conditions sufficient to support expression of the nucleic acids; B) incubating the cellulose-containing material with the cell expressing recombinant nucleic acids under conditions that support cellulose degradation and fermentation, in order to produce a fermentation product.

[00509] In another aspect, a method for converting a cellulose-containing material into a fermentation product by consolidated bioprocessing includes the steps of: A) contacting a cellulose-containing material with a cell having recombinant nucleic acids encoding the polypeptides of SEQ ID NOs: 46, 76, 100, 106, 111, 113, 7, 13, 19, 27, 33, and 37, and one or more recombinant nucleic acids encoding a polypeptide involved in a biochemical pathway for the production of a biofuel under conditions sufficient to support expression of the nucleic acids; B) incubating the cellulose-containing material with the cell expressing recombinant nucleic acids under conditions that support cellulose degradation and fermentation, in order to produce a fermentation product.

[00510] Fermentation products that may be produced from sugars obtained from the degradation of cellulose-containing materials include, without limitation, ethanol, n-propanol, n- butanol, iso-butanol, 3-methyl-l-butanol, 2-methyl-l-butanol, 3-methyl-l-pentanol, and octanol.

[00511] Concentration of Polypeptides

[00512] In certain aspects, polypeptides of the disclosure are provided with a substrate at a concentration of at least 0.01 nM of each polypeptide. In certain aspects, the polypeptides are provided with a substrate at a concentration of at least 0.1 nM of each polypeptide. In certain aspects, the polypeptides are provided with a substrate at a concentration of at least 1 nM of each polypeptide. In certain aspects, the polypeptides are provided with a substrate at a concentration of at least 10 nM of each polypeptide. In certain aspects, the polypeptides are provided with a substrate at a concentration of at least 0.1 μΜ of each polypeptide. In certain aspects, the polypeptides are provided with a substrate at a concentration of at least 10 μΜ of each polypeptide. In certain aspects, the polypeptides are provided with a substrate at a concentration of at least 100 μΜ of each polypeptide.

[00513] Combination of Thermostable Cellulases with Thermostable Hemicellulose- Degrading Enzymes

[00514] In some aspects, thermostable cellulose-degrading enzymes of the present disclosure are provided with thermostable hemicellulases. Thermostable hemicellulases may be provided with the thermostable cellulose-degrading enzymes of the present disclosure in order to increase the degradation of materials containing both cellulose and hemicellulose, such as biomass from terrestrial plants.

[00515] In some aspects disclosed herein, mixtures of cellulases of the present disclosure exhibit surprising synergistic effects when combined with mixtures of hemicellulases. In such examples, mixtures containing multiple cellulases have greater cellulase activity when they are combined in a cocktail with a mixture containing multiple hemicellulases, as compared to when the mixture of cellulases is not combined with a mixture containing multiple hemicellulases. Also, in some examples, mixtures containing multiple hemicellulases have greater hemicellulase activity when they are combined in a cocktail with a mixture containing multiple cellulases, as compared to the activity of the mixture of hemicellulases when it is not combined with a mixture containing multiple cellulases. Thus, cellulase and hemicellulase mixtures provided herein may have surprising synergistic effects together, wherein each enzyme mixture has greater activity when combined with the other than when either enzyme mixture is provided with a substrate alone.

[00516] Thermostable hemicellulases may be obtained from organisms capable of degrading hemicellulose. In one aspect, thermostable hemicellulases may be isolated directly from organisms capable of degrading cellulose. In another aspect, thermostable hemicellulases are produced recombinantly, through the use of host cells and expression vectors containing genes encoding thermostable hemicellulases. Thermostable hemicellulases and/or genes encoding thermostable hemicellulases may be isolated from various organisms capable of degrading hemicellulose including, for example and without limitation, archaeal, bacterial, fungal, and protozoan organisms.

[00517] In some aspects, thermostable hemicellulases are recombinant polypeptides related to thermostable hemicellulases of C. bescii. In some aspects, thermostable hemicellulases contain the amino acid sequence of any of SEQ ID NOs: 3, 7, 13, 19, 27, 33, and 37. In some aspects, polynucleotides encoding thermostable hemicellulases contain the nucleic acid sequence of any of SEQ ID NOs: 4, 8, 14, 20, 28, 34, and 38.

Synergy of Hemicellulase Enzymatic Activity

[00518] In certain embodiments, the enzymes of the present disclosure are provided as an enzyme 'cocktail' wherein two or more of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and Cbl62 are provided together to degrade hemicellulose or a hemicellulose-derived substrate. In certain embodiments, the enzymes function synergistically and the combination of two or more of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and CM62 is more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than the activity of a single enzyme. Similarly, in certain embodiments, enzyme cocktails with three or more of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and Cbl62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with one or two of the enzymes. In certain embodiments, enzyme cocktails with four or more of the enzymes Cbl93, Cbl95, CM 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with one, two, or three of the enzymes. In certain embodiments, enzyme cocktails with five or more of the enzymes CM93, CM95, CM 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with one, two, three, or four of the enzymes. In certain embodiments, enzyme cocktails with all six of the enzymes CM93, CM95, CM 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with one, two, three, four, or five of the enzymes.

[00519] In other embodiments, enzyme cocktails with two or more of the enzymes CM93, CM95, CM 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with the same total amount of enzyme units but with only one of the species of enzymes. In other embodiments, enzyme cocktails with three or more of the enzymes CM93, CM95, CM 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with the same total amount of enzyme units but with only one or two of the species of enzymes. In other embodiments, enzyme cocktails with four or more of the enzymes CM93, CM95, CM 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with the same total amount of enzyme units but with only one, two, or three of the species of enzymes. In other embodiments, enzyme cocktails with five or more of the enzymes CM 93, CM 95, Cbl 172, Cb2487, Cb909, and CM62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with the same total amount of enzyme units but with only one, two, three, or four of the species of enzymes. In other embodiments, enzyme cocktails with all six of the enzymes CM93, CM95, Cbl 172, Cb2487, Cb909, and Cbl 62 are more effective at degrading hemicellulose and releasing monosaccharides from hemicellulose than enzyme cocktails with the same total amount of enzyme units but with only one, two, three, four, or five of the species of enzymes.

Treatment Methods of Hemicellulose and Hemicellulose-Containing Materials

[00520] The above-described hemicellulase enzymes and variants can be used alone or in combination to degrade hemicellulose by cleaving one or more functional groups from the xylose backbone to form cleaved hemicellulose.

[00521] Hemicellulose treated with the methods of the present disclosure may be at least 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% degraded. The hemicellulose substrate is degraded when the enzymes cleave the bonds or linkages present between the subunits present in the hemicellulose. Degradation products may comprise xylose, arabinose, glucuronyl groups, acetyl groups, in addition to other functional groups and hydrocarbons.

[00522] In one aspect, plant material containing hemicellulose, or isolated hemicellulose, is treated with one or more of the above-described enzymes, such as Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62. In one embodiment, hemicellulose is treated with Cbl93 in combination with one or more enzymes including Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62. In one embodiment, hemicellulose is treated with Cbl 95 in combination with one or more enzymes including Cbl93, Cbl 172, Cb909, Cb2487, and Cbl62.

[00523] Without wishing to be bound by theory, Applicants believe that the methods of the present disclosure degrade hemicellulose via the following mechanisms. Treatment of hemicellulose with endoxylanases Cbl93, Cbl95, or a variant cleaves P-l,4-xylose linkages in the xylose backbone to generate shorter chains of xylose in P-l,4-linkages. Treatment of hemicellulose with the cc-L-arabinofuranosidase Cbl 172 or a variant cleaves arabinose moiety from the xylose backbone or from branched or debranched arabinan of hemicelluloses to generate exclusively arabinose. Treatment of hemicellulose with the a-glucuronidase Cb909 or a variant cleaves the alpha- 1,2,-glycosidic bond between 4-O-methyl-D-glucuronic acid and the beta-l,4-xylosidic linkage backbone of xylan. Treatment of hemicellulose with the β-xylosidase Cb2487 or a variant cleaves beta-l,4-xylosidic linkages in the xylose backbone. Treatment of hemicellulose with Cbl62 or a variant cleaves the linkages between xylose and the side chain of acetyl groups in hemicellulose to provide more accessibility to other hemicellulases such as xylanase and β-xylosidase to the backbone of xylan. Using a combination or two or more enzymes is believed to provide synergistic hemicellulose degradation activity.

[00524] In certain embodiments, plant material containing hemicellulose, or isolated hemicellulose, may be treated with one or more isolated or recombinant polypeptides comprising an amino acid sequence having at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or more, or complete (100%) sequence identity/sequence similarity to Cbl93, Cbl95, CM 172, Cb2487, Cb909, and Cbl62.

[00525] The polypeptides may be administered directly, either alone, or as a composition.

[00526] In other methods of the present disclosure, hemicellulose is degraded by contact with a transgenic host cell secreting one or more polypeptides including Cbl93, Cbl95, Cbl 172, Cb2487, Cb909, and Cbl62. In some embodiments, the transgenic host cell may be Escherichia, Pseudomonas, Proteus, Ralstonia, Streptomyces, Staphylococcus, Lactococcus, Bacillus, Saccharomyces cerevisiae, Schizosaccharomyces pombe, Yarrowia lipolytica, Hansenula polymorpha, Kluyveromyces lactis, Pichia pastoris, Aspergillus, Chrysosporium lucknowense, or Trichoderma reesei. In some embodiments, the transgenic host cell may be a thermophilic microorganism. In one embodiment, the thermophilic host cell is Caldicellulosiruptor bescii.

[00527] The transgenic host cell may contain a vector encoding Cbl93, Cbl95, Cbl 172, Cb909, Cb2487, Cbl62 or variants thereof. In some embodiments, the hemicellulose is degraded by treating with Cbl 93 or a variant alone, or in combination with one or more of Cbl95, Cbl 172, Cb909, Cb2487, Cbl62, and variants thereof. In some embodiments, the hemicellulose is degraded by treating with Cbl 95 or a variant alone, or in combination with one or more of Cbl93, Cbl 172, Cb909, Cb2487, Cbl62, and variants thereof. [00528] The methods of the present disclosure can be practiced with any plant material that contains hemicellulose. Plant material suitable for use with the currently disclosed methods include Miscanthus, switchgrass, cord grass, rye grass, reed canary grass, elephant grass, common reed, wheat straw, barley straw, canola straw, oat straw, corn stover, soybean stover, oat hulls, sorghum, rice hulls, rye hulls, wheat hulls, sugarcane bagasse, corn fiber, Distillers Dried Grains with Solubles (DDGS), Blue Stem, corncobs, pine, birch, willow, aspen, poplar wood, and energy cane. The methods may also be practiced on isolated hemicellulose.

[00529] In certain embodiments, thermophilic enzymes of the present disclosure are provided with a substrate at a concentration of at least 0.01 nM enzyme of each enzyme. In certain embodiments, the enzymes are provided with a substrate at a concentration of at least 0.1 nM enzyme of each enzyme. In certain embodiments, the enzymes are provided with a substrate at a concentration of at least 1 nM enzyme of each enzyme. In certain embodiments, the enzymes are provided with a substrate at a concentration of at least 10 nM enzyme of each enzyme. In certain embodiments, the enzymes are provided with a substrate at a concentration of at least 0.1 μΜ enzyme of each enzyme. In certain embodiments, the enzymes are provided with a substrate at a concentration of at least 10 μΜ enzyme of each enzyme. In certain embodiments, the enzymes are provided with a substrate at a concentration of at least 100 μΜ enzyme of each enzyme.

[00530] The methods of the present disclosure can be practiced at any pH and temperature at which hemicellulose can be degraded; however, in certain embodiments, the methods of the present disclosure are practiced in a pH range of about 5 to about 7 and at or between a temperature between about 60 and about 80° C.

[00531] Combination of Thermostable Hemicellulose-Degrading Enzymes with

Thermostable Cellulases

[00532] In some embodiments, thermostable hemicellulose-degrading enzymes of the present disclosure are provided with thermostable cellulases. Cellulases are enzymes that can hydrolyze cellulose, and they include, but are not limited to, exoglucanses, endoglucanases, and β- glucosidases. In some aspects, thermostable cellulases have optimal enzymatic activity at temperatures above 55 °C. Thermostable cellulases may be provided with the thermostable hemicellulose-degrading enzymes of the present disclosure in order to increase the degradation of materials containing both cellulose and hemicellulose, such as biomass from terrestrial plants. For example and without limitation, in one aspect, microorganisms can be provided that express hemicellulose-degrading enzymes of the present disclosure and thermostable cellulases. In one aspect, compositions containing hemicellulose-degrading enzymes of the present disclosure may also contain thermostable cellulases. In other aspects, methods of degrading biomass, of converting biomass into fermentation product, and of converting biomass to fuel are provided, in which biomass is contacted with hemicellulose-degrading enzymes of the present disclosure and with thermostable cellulases

[00533] Thermostable cellulases may be obtained from organisms capable of degrading cellulose. In one aspect, thermostable cellulases are obtained directly from organisms capable of degrading cellulose. In another aspect, thermostable cellulases are produced recombinantly, through the use of host cells and expression vectors containing genes encoding thermostable cellulases. Thermostable cellulases and/or genes encoding thermostable cellulases may be isolated from various organisms capable of degrading cellulose including, for example and without limitation, archaeal, bacterial, fungal, and protozoan organisms.

[00534] Organisms capable of degrading cellulose include for example and without limitation, those belonging to the genera Aquifex, Bacillus, Rhodothermus, Thermobifida, Thermotoga, Anaerocellum, Sulfolobus, Pyrococcus and Caldicellulosiruptor. A recombinant thermostable endoglucanase of Aquifex aeolicus produced in E. coli showed maximal activity at 80°C and pH 7.0 with a half-life of 2 h at 100°C (Kim JS, Lee YY, Torget, RW. (2001).

Cellulose hydrolysis under extremely low sulfuric acid and high-temperature conditions. Appl. Biochem. Biotechnol. 91-93. 331-340)). The endoglucanases produced by Anaerocellum thermophilum and Caldicellulosiruptor saccharolyticus are multidomain enzymes composed of two catalytic domains, linked to carbohydrate binding domains by proline-threonine-rich regions (Zverlov V, Mahr S, Riedel K, Bronnenmeier K (1998a), "Properties and gene structure of a bifunctional cellulolytic enzyme (CelA) from the extreme thermophile Anaerocellum

thermophilum' with separate glycosyl hydrolase family 9 and 48 catalytic domains,"

Microbiology 144 ( Pt 2): 457-465; Te'o VS, Saul DJ, Bergquist PL (1995), "celA, another gene coding for a multidomain cellulase from the extreme thermophile Caldocellum saccharolyticum " Appl Microbiol Biotechnol 43: 291-296; Saul et al. 1990. The recombinant endoglucanase of Rhodothermus marinus has a pH optimum of 6.0-7.0 and a temperature optimum at 100°C (Halldorsdottir S, Thorolfsdottir ET, Spilliaert R, Johansson M,

Thorbjarnardottir SH, Palsdottir A, Hreggvidsson GO, Kristjansson JK, Hoist O, Eggertsson G. (1998), "Cloning, sequencing and overexpression of a Rhodothermus marinus gene encoding a thermostable cellulase of glycosyl hydrolase family 12," Appl Microbiol Biotechnol 49: 277- 284). The aerobic thermophilic bacterium Thermus caldophilus also produces an endoglucanase which exhibits high activity on CMC with cellobiose and cellotriose as products (Kim D, Park BH, Jung B-W, Kim M-K, Hong SI, Lee, DS (2006) Identification and molecular modeling of a family 5 endocellulase from Thermus caldophilus GK24, a cellulolytic strain of Thermus theraiophilus. Int J Mol Sci 7: 571-589). Thermostable cellulases have also been described from Bacillus subtilis (Mawadza, C, Hatti-Kaul, R., Zvauya, R. and Mattiasson, B., 2000. Purification and characterization of cellulases produced by two Bacillus strains. J. Biotechnol. 83, pp. 177- 187), from Pyrococcus furiosus (Kengen, S., Luesink, E., Stams, A. and Zehnder, A., 1993. Purification and characterization of an extremely thermostable β-glucosidase from the hyperthermophilic archaeon Pyrococccus furiosus . Eur. J. Biochem. 213, pp. 305-312.), from Pyrococcus horikoshi (Ando, S., Ishida, H., Kosugi, Y. and Ishikawa, K., 2002.

Hyperthermostable endoglucanase from Pyrococcus horikoshi. Appl. Environ. Microbiol. 68, pp. 430-433.), from Rhodothermus marinus (Hreggvidsson, GO., Kaiste, E., Hoist, O., Eggertsson, G., Palsdottir, A. and Kristjansson, J.K., 1996. An extremely thermostable cellulase from the thermophilic eubacterium Rhodothermus marinus. Appl. Environ. Microbiol. 62, pp. 3047- 3049.), from Thermatoga maritema (Bronnenmeier, K., Kern, A., Libel, W. and Staudenbauer, W., 1995. Purification of Thermatoga maritema enzymes for the degradation of cellulose materials. Appl. Environ. Microbiol. 61, pp. 1399-1407.), and from Thermatoga neapolitana (Bok, J., Goers, S. and Eveleigh, D., 1994. Cellulase and xylanase systems of Thermatoga neapolitana. ACS Symp. Ser. 566, pp. 54-65; Bok, J., Dienesh, A., Yernool, D. and Eveleigh, D., 1998. Purification, characterization and molecular analysis of thermostable cellulases CelA and CelB from Thermatoga neapolitana. Appl. Environ. Microbiol. 64, pp. 4774-4781.).

[00535] In some aspects, the thermostable cellulases are any of CM952, CM953, CM954, CM 946, Cb629, or Cb486 polypeptides. [00536] In some aspects, any mixture of hemicellulases or hemicellulase with cellulases provided herein may further be provided with CM581 polypeptides.

Additional Applications

[00537] The methods described herein can be practiced in combination with other methods useful for converting lignocellulosic materials into biofuels.

[00538] For example, plant material may be subjected to pretreatment including ammonia fiber expansion (AFEX), steam explosion, treatment with alkaline aqueous solutions, acidic solutions, organic solvents, ionic liquids (IL), electrolyzed water, phosphoric acid, and combinations thereof. Pretreatments that remove lignin from the plant material may increase the overall amount of sugar released from the hemicellulose.

[00539] In certain embodiments, where a cellulase mixture is being used to release glucose from plant cell walls, a hemicellulase enzyme cocktail of the present disclosure may be used to hydrolyze the hemicellulosic component of the plant material and increase accessibility of the cellulase cocktail to the cellulose fraction of the plant material.

[00540] Typically, the compositions and methods of the present disclosure are used to generate biofuels or specialty chemicals. In one aspect, the compositions and methods of the present disclosure are used to degrade hemicellulose into fermentable sugars. The fermentable sugars are then converted into biofuel components, such as ethanol, propanol, and butanol, or specialty chemicals, such as ketones and aldehydes. The fermentable sugars may be converted by a microorganism, such as yeast, or by isolated enzymes.

[00541] The hemicellulose-related methods described herein can be practiced in combination with cellulases. Additional methods are provided for the use of the polypeptides and

compositions as feed additives for monogastric animal agriculture, including pigs and poultry production.

EXAMPLES [00542] The following Examples are merely illustrative and are not meant to limit any aspects of the present disclosure in any way.

Example 1: Endoxylanase Cbl93 (SEQ ID NOs: 3 and 4)

[00543] An endoxylanase, Cbl93, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of Cb l93, where Cb stands for C. bescii. The endoxylanase cleaves the xylose backbone of hemicellulose at random to generate shorter chains of xylose in β-1,4- linkages. These xylo-oligosaccharides can range from two or more sugar subunits. The Cbl93 protein is 671 amino acids long and has a molecular mass of 77.7 kDa (His-tag + truncated Cbl93 protein). The protein has two putative carbohydrate binding modules (CBM) inserted within the glycoside hydrolase (GH) family 10 catalytic domain (Figure 2A).

Cloning of Cb!93

[00544] The gene for Cbl93 was amplified from Caldicellulosiruptor bescii genomic DNA by PCR using iProof HF DNA polymerase (BIO-RAD). The Cbl93 gene was amplified using the following primer set:

Cbl93For

5'- GACGACGACAAGATGAACTTTGAAGGAAGAGAC-3' (SEQ ID NO: 134) Cbl93Rev

5'- GAGGAGAAGCCCGGTTATTTT TTAGCCTTTAC-3 ' (SEQ ID NO: 135) [00545] The polymerase chain reaction mixture contained the following:

PCR reaction

2 υ/μί iProof HF DNA polymerase 0.5

13.7 ng^L C. bescii gDNA 1

50 μΜ Fw Primer 0.5

50 μΜ Rv Primer 0.5

10 mM dNTP Mixture 1

5x iProof HF Buffer 10

dH 2 0 36.5

Total 50 iL [00546] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 98°C 10 sec

Annealing 62°C 30 sec 35 cycles

Elongation 72°C 120 sec

Last 4°C oo

[00547] After the PCR amplification described above, the amplification of Cbl93 gene was confirmed by 1% agarose gel electrophoresis. T4 DNA polymerase (Novagen) was then added to the purified PCR product to generate compatible overhangs.

T4 DNA polymerase treatment Incubation

2.5 U/mL T4 DNA Polymerase 0.2 22°C 30 min

Purified PCR Product 2.1 75°C 20 min

25 mM dATP 1 4°C oo

100 mM DTT 0.5

lOx T4 DNA Polymerase Buffer 1

dH 2 0 5.2

Total

After the reaction, the following annealing reaction was prepared with pET46

Annealing Incubation pET46 Ek/LIC vector 0.5 22°C 5 min

Reaction Mixture 1

Total 1.5

[00549] After the incubation, EDTA was added to the reaction.

Annealing Incubation 25 mM EDTA 0.5 22°C 5 min pET46 Ek/LIC vector 0.5

Reaction Mixture 1

Total 2 μΙ.

[00550] The annealing mixture for Cb 193-pET46 Ek/LIC was introduced into E. coli JM 109 by electroporation and the cells were plated on LB-ampicillin. After overnight incubation at 37° C, three colonies were selected and used to inoculate 10 mL cultures of LB-ampicillin. The cultures were grown at 37 °C with vigorous aeration for 16 hours and minipreps were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to confirm the size of plasmid/insert DNA. Next, the integrity of the gene was confirming by nucleotide sequencing.

[00551] For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with

chloramphenicol and ampicillin at 100 μg/ml and 50 μg/ml and incubated at 37°C overnight. Five to six colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37°C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.5 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification

[00552] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (50 mM Tris-HCL pH 7.5, 300 mM of NaCl). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed several times to remove the unbound proteins. The bound protein (6-Histidine-tagged Cbl93) was then eluted from the resin with an elution buffer composed of the lysis buffer supplemented with 150 mM imidazole.

[00553] The gene product of Cbl93 was expressed in a truncated form. The first 41 amino acids, which represent a signal peptide, were removed. In the native organism, C. bescii, the signal peptide facilitates transport of the Cbl93 out of the cell so that it can act on its target substrate (xylan or plant cell wall) in the medium. Usually after transportation outside the cell, the signal peptide is processed (cleaved) off the protein. Signal peptides can often become a problem during production of recombinant proteins. To circumvent this potential problem, i.e., to prevent secretion of the protein into the periplasm, the PCR primers were designed to remove the signal peptide. The signal peptide does not influence catalytic activity. The design of the PCR primers also ensured that the protein was fused to 6-histidines encoded in the plasmid. The six histidines will bind to either a nickel-charged resin or a cobalt-charged resin. The bound protein can be displaced from the resin with a buffer containing imidazole. This method facilitates quick purification of the protein of interest.

Cbl93 (amino acid sequence)

[00554] The Cbl93 [ENDO- 1 ,4-BETA-XYLANASE A PRECURSOR (EC 3.2.1.8)] amino acid sequence is disclosed in SEQ ID NO: 3. The signal peptide of Cbl93, corresponding to amino acid numbers 1-41 of SEQ ID NO: 3 was removed to create the Cbl93 protein expression vector. Thus, the expressed Cbl93 protein did not contain amino acids 1-41 of SEQ ID NO: 3. The amino acid sequence of the Cbl93 protein without the signal peptide is disclosed in SEQ ID NO: 37.

Cbl93 (nucleotide sequence)

[00555] The Cbl93 nucleotide sequence is disclosed in SEQ ID NO: 4. Nucleotide numbers 1-123 of SEQ ID NO: 4 correspond to the signal peptide of Cbl93, and were not present in the gene cloned to make Cbl93. The Cbl93 gene without the first 123 nucleotides is disclosed in SEQ ID NO: 38, which encodes the amino acid sequence of SEQ ID NO: 37.

[00556] The procedure of cloning the gene for Cbl93 into the plasmid pET46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The short peptide comprises the first 15 amino acids of SEQ ID NO: 6. The nucleotide sequence encoding SEQ ID NO: 6 is SEQ ID NO: 5. [00557] The Cbl93 gene was expressed in E. coli cells, and the protein was purified in three steps (TALON affinity chromatography, ion exchange chromatography, and gel filtration). Figure 2B shows an SDS-PAGE of purified Cbl93. The molecular markers are in the lane marked M.

Enzyme Activity

[00558] The enzymatic activity of Cbl93 was measured according to the methods of Morag, E., Bayer, E.A., and Lamed, R. (Relationship of cellulosomal and non-cellulosomal xylanases of Clostridium thermocellum to cellulose degrading enzymes. J. Bacteriol. 1990: 172; 6098-6105). 1 \L of sample supernatant (substrate reacted with enzyme) was spotted on TLC plate. A marker mixture was made by combining each 0.2 of 1%

xylose/xylobiose/xylotriose/xylotetraose/xylopentaose. All sugars were purchased from

Megazyme. The spots were dried and the TLC plate was developed in a developing tank for 1 hour. The plate was dried in a chamber for 30 min. The plate was sprayed with visualizing reagent and incubated for 5 to 10 min at 75°C to visualize the results.

[00559] Figure 2C shows the enzymatic activity of Cbl93 on natural substrates using TLC analysis. Various substrates were tested: soluble wheat arabinoxylan (SWAX), oat-spelt xylan (OSX), birchwood xylan (BWX), carboxymethyl cellulose (CMC), lichenan, glucomannan, 1,4 β-mannan, arabinan. In the case of SWAX, OSX, and BWX, in the presence of Cbl93 (+), short xylose chains were released. In the minus (-) lanes, no enzyme was added and therefore no products of hydrolysis were released. XI (xylose monomer), X2 (xylose dimer or a

disaccharide), X3 (trisaccharide), X4 (tetrasaccharide), and pentasaccharide (X5) were loaded in the first lane (M) as markers. The results showed that this enzyme releases shorter chains or oligosaccharides from the complex substrates (SWAX, OSX, and BWX).

[00560] The concentration of glucose equivalents was determined following enzymatic hydrolysis of SWAX and OSX according to the methods of Lever, M. (A new reaction for colorimetric determination carbohydrates. Anal. Biochem. 1972: 47; 273-279). 1.5 mL microcentrifuge tubes were "zeroed" in an analytical balance. Next, 5 ± 0.1 mg SWAX or OSX were added to each tube, and the mass measured and recorded. The volumes needed to be added to each tube were calculated based on the mass. Sodium phosphate reaction buffer and enzymes were added to each tube beginning with the reaction buffer. The tubes were incubated with constant mixing in a Rotisserie- style tube mixer at 37°C for 15 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 100 μΙ_, of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay, and 150 μΙ_, of sodium citrate reaction buffer was added for a final volume of 250 μί. 1 mL of a stock solution of glucose was made at a concentration of 20 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following concentrations (20 mM, 10 mM, 5 mM, 2.5 mM, 1.25 mM, 0.625 mM, 0.3125 mM). 50 mg of /?HBAH was dissolved in 50 mL of ice-cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution kept on ice. 112.5 μΙ_, of /?HBAH solution was added to 37.5 μΙ_, of the sample and glucose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A 41 o n m and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The correlation coefficient (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based upon their absorbances.

[00561] Figure 2E shows the enzymatic activity of Cbl93 on natural substrates from a reducing sugar assay. In this experiment, a different assay for reducing sugars was used to determine the release of products from the substrates. A standard was made based on known glucose concentrations and their absorbance (color development) in the presence of para- hydroxy-benzoic acid hydrazide (Cann et al. 1999. J. Bacterial. 181: 1643-1651 and other reference above-Laver, M. 1972.). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents.

[00562] Figure 3 A shows the thermostability of Cbl93. Final 5 nM of Cbl93 was incubated at different temperatures from 70-90 °C. The Cbl93 enzymes were incubated at 70 °C, 75 °C, 80 °C, 85 °C, 90 °C. The incubated enzymes were taken out at certain time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7 h, 11 h, 16 h, and 24 h) and immediately incubated with wheat arabinoxylan (final 1%, w/v) to measure the enzyme activity. The initial velocity of reaction was calculated. The residue activity (%) was calculated by dividing the activity of each sample by the initial activity at zero time. Bars are shown with standard errors for three independent experiments.

[00563] Figure 4 shows the kinetic data of Cbl93 on hydrolysis of wheat arabinoxylan, oat spelt xylan, and birchwood xylan. The K m , k and k c K m are indicated as well. In part (A), the experiment was conducted at 75 °C with 50 mM citrate buffer (pH 6.0). In part (B), the experiment was conducted at 85°C with 50 mM citrate buffer (pH 6.0). Xylan substrates (final 2.5 -50 mg/mL) were incubated with CB193 (final 5 nM for wheat arabinoxylan and final 50 nM for oat spelt xylan and birchwood xylan). The initial velocity of reaction was calculated. The initial velocities were then plotted against the concentrations of xylan substrates. The K m and k cat were calculated by non-linear fit using the Graphpad software. Bars are shown with standard errors for three independent experiments.

Example 2: Endoxylanase Cbl95 (SEQ ID NOs: 7 and 8)

[00564] An endoxylanase, Cbl95, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of Cbl95, where Cb stands for C. bescii. The endoxylanase cleaves the xylose backbone of hemicellulose at random to generate shorter chains of xylose in β-1,4- linkages. These xylo-oligosaccharides can range from containing two or more sugar subunits. The Cbl95 protein is 351 amino acids long and has a molecular weight of 41.9 kDa (His-tag + CM 95 protein) (Figure 2A).

Cloning of Cb!95

[00565] The gene for Cbl95 was amplified from Caldicellulosiruptor bescii genomic DNA by PCR using iProof HF DNA polymerase (BIO-RAD).

[00566] The polymerase chain reaction mixture contained the following:

PCR reaction

2 υ/μί iProof HF DNA polymerase 0.5

13.7 ngl L C. bescii gDNA 1

50 μΜ Fw Primer 0.5 50 μΜ Rv Primer 0.5

10 mM dNTP Mixture 1

5x iProof HF Buffer 10

dH 2 Q 36.5

Total 50 μΐ.

[00567] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 98°C 10 sec

Annealing 62°C 30 sec 35 cycles

Elongation 72°C 120 sec

Last 4°C oo

[00568] After the PCR amplification described above, the amplification of Cbl95 gene was confirmed by 1% agarose gel electrophoresis. T4 DNA polymerase (Novagen) was then added to the purified PCR product to generate compatible overhangs.

T4 DNA polymerase treatment Incubation

2.5 U/mL T4 DNA Polymerase 0.2 22°C 30 min

Purified PCR Product 2.1 75°C 20 min

25 mM dATP 1 4°C oo

100 mM DTT 0.5

lOx T4 DNA Polymerase Buffer 1

dH 2 0 5.2

Total

After the reaction, the following annealing reaction was prepared with pET46

Annealing Incubation pET46 Ek/LIC vector 0.5 22°C 5 min

Reaction Mixture 1_

Total 1.5 μΐ ^ [00570] After the incubation, EDTA was added to the reaction.

Annealing Incubation

25 niM EDTA 0.5 22°C 5 min

pET46 Ek/LIC vector 0.5

Reaction Mixture 1

Total 2 L

[00571] The annealing mixtures for Cbl95-pET46 Ek/LIC was introduced into E. coli JM109 by electroporation and the cells were plated on LB-ampicillin. After overnight incubation at 37° C, three colonies were selected and used to inoculate 10 mL cultures of LB-ampicillin. The cultures were grown at 37 °C with vigorous aeration for 16 hours and minipreps were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to confirm the size of plasmid/insert DNA. Next, the integrity of the gene was confirming by nucleotide sequencing.

[00572] For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with

chloramphenicol and ampicillin at 100 μg/ml and 50 μg/ml and incubated at 37°C overnight. Five to six colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37°C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.5 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification

[00573] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (50 mM Tris-HCL pH 7.5, 300 mM of NaCl). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed several times to remove the unbound proteins. The bound protein (6-Histidine-tagged Cbl95) was then eluted from the resin with an elution buffer composed of the lysis buffer supplemented with 150 mM imidazole. The protein was purified in three steps (TALON affinity chromatography, ion exchange chromatography, and gel filtration). Figure 2B shows an SDS-PAGE of purified Cbl95. The molecular mass markers are in the lane marked M.

[00574] The Cbl95 [ENDO- 1 ,4-BETA-XYLANASE A PRECURSOR (EC 3.2.1.8)] amino acid sequence is disclosed in SEQ ID NO: 7. The nucleotide sequence encoding Cbl95 is disclosed in SEQ ID NO: 8.

[00575] For protein expression, Cbl95 was cloned into the plasmid pET46 Ek/LIC. The amino acid sequence of Cbl95-pET46 Ek/LIC is SEQ ID NO: 10. Amino acid numbers 1-15 of SEQ ID NO: 10 are from the pET46 Ek/LIC plasmid, and include a sequence of six histidines to facilitate protein purification. The nucleotide sequence encoding SEQ ID NO: 10 is disclosed in SEQ ID NO: 9. Nucleotide numbers 1-45 of SEQ ID NO: 9 are from the pET46 Ek/LIC plasmid.

Enzyme Activity

[00576] The enzymatic activity of Cbl95 was measured according to the methods of Morag, E., Bayer, E.A., and Lamed, R. (Relationship of cellulosomal and non-cellulosomal xylanases of Clostridium thermocellum to cellulose degrading enzymes. J. Bacteriol. 1990: 172; 6098-6105). 1 \L of sample supernatant (substrate reacted with enzyme) was spotted on TLC plate. A marker mixture was made by combining each 0.2 μL· of 1%

xylose/xylobiose/xylotriose/xylotetraose/xylopentaose. All sugars were purchased from

Megazyme. The spots were dried and the TLC plate was developed in a developing tank for 1 hour. The plate was dried in a chamber for 30 min. The plate was sprayed with visualizing reagent and incubated for 5 to 10 min at 75°C to visualize the results.

[00577] Figure 2D shows the enzymatic activity of Cbl95 on natural substrates using TLC analysis. Various substrates were tested: soluble wheat arabinoxylan (SWAX), oat-spelt xylan (OSX), birchwood xylan (BWX), carboxymethyl cellulose (CMC), lichenan, glucomannan, 1,4 β-mannan, arabinan. In the case of SWAX, OSX, and BWX, in the presence of Cbl95 (+), short xylose chains were released. In the minus (-) lanes, no enzyme was added and therefore no products of hydrolysis were released. XI (xylose monomer), X2 (xylose dimer or a disaccharide), X3 (trisaccharide), X4 (tetrasaccharide), and pentasaccharide (X5) were loaded in the first lane (M) as markers. The results showed that this enzyme releases shorter chains or oligosaccharides from the complex substrates (SWAX, OSX, and BWX).

[00578] The concentration of glucose equivalents was determined following enzymatic hydrolysis of soluble wheat arabinoxylan (SWAX) and oat- spelt xylan (OSX) according to the methods of Lever, M. (A new reaction for colorimetric determination carbohydrates. Anal. Biochem. 1972: 47; 273-279). 1.5 mL microcentrifuge tubes were "zeroed" in an analytical balance. Next, 5 ± 0.1 mg SWAX or OSX were added to each tube, and the mass measured and recorded. The volumes needed to be added to each tube were calculated based on the mass. Sodium phosphate reaction buffer and enzymes were added to each tube beginning with the reaction buffer. The tubes were incubated with constant mixing in a Rotisserie- style tube mixer at 37°C for 15 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 100 μΐ. of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay, and 150 of sodium citrate reaction buffer was added for a final volume of 250 ^L. 1 mL of a stock solution of glucose was made at a concentration of 20 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following concentrations (20 mM, 10 mM, 5 mM, 2.5 mM, 1.25 mM, 0.625 mM, 0.3125 mM). 50 mg of pHBAH was dissolved in 50 mL of ice-cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution kept on ice. 112.5 of /?HBAH solution was added to 37.5 of the sample and glucose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A 41 o n m and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The correlation coefficient (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based upon their absorbances.

[00579] Figure 2E shows the enzymatic activity of Cbl95 on natural substrates from a reducing sugar assay. In this experiment, a different assay for reducing sugars was used to determine the release of products from the substrates. A standard was made based on known glucose concentrations and their absorbance (color development) in the presence of para- hydroxy-benzoic acid hydrazide (Cann et al. 1999. J. Bacterial. 181: 1643-1651 and other reference above-Laver, M. 1972.). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents.

[00580] Figure 3B shows the thermostability of Cbl 95. Final 5 nM of Cbl 95 were incubated at different temperatures ranging from 65-80 °C. The Cbl 95 enzymes were incubated at 65 °C, 70 °C, 75 °C, and 80 °C. The incubated enzymes were taken out at certain time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7 h, 11 h, 16 h, and 24 h) and immediately incubated with wheat arabinoxylan (final 1%, w/v) to measure the enzyme activity. The initial velocity of reaction was calculated. The residue activity ( ) was calculated by dividing the activity of each sample by the initial activity at zero time. Bars are shown with standard errors for three independent experiments.

[00581] Figure 5 shows the kinetic data of Cbl 95 on hydrolysis of wheat arabinoxylan, oat spelt xylan, and birchwood xylan. The K m , fc cat , and k c K m are indicated as well. In part (A), the experiment was conducted at 75 °C with 50 mM citrate buffer (pH 6.0). In part (B), the experiment was conducted at 75°C with 50 mM sodium phosphate buffer (pH 6.5). Xylan substrates (final 2.5 -50 mg/mL) were incubated with Cbl 95 (final 5 nM for wheat arabinoxylan and final 50 nM for oat spelt xylan and birchwood xylan). The initial velocity of reaction was calculated. The initial velocities were then plotted against the concentrations of xylan substrates. The K m and k cat were calculated by non-linear fit using the Graphpad software. Bars are shown with standard errors for three independent experiments.

Example 3: a-L-arabinofuranosidase Cbll72 (SEQ ID NOs: 13 and 14)

[00582] An a-L-arabinofuranosidase, Cbl 172, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of Cbl 172. The α-L-arabinofuranosidase cleaves arabinose moiety from the xylose backbone or from branched or debranched arabinan of hemicellulose to generate exclusively arabinose. The Cbl 172 protein is 505 amino acids long and has a molecular mass of 59.6 kDa (His-tag + Cbl 172 protein). The protein has a glycoside hydrolase (GH) family 51 catalytic domain (Figure 6D). Cloning of Cb 1172

[00583] The gene for Cbl 172 was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using iProof™ High-Fidelity DNA Polymerase (BIO-RAD) . The Cbl 172 gene was amplified using the following primer set:

Cbl l72Forward

5'-GAC GAC GAC AAG ATG AAA AAA GCA AAA GTC ATC TAC-3' (SEQ ID NO: 136 )

Cbl l72Reverse

5 '-GAG GAG AAG CCC GGT TAA TTT TCT TTC TTC TTT AAC CTG-3' (SEQ ID NO: 137)

[00584] The polymerase chain reaction mixture contained the following:

PCR reaction

2 \5l xL iProoF M High-Fidelity DNA Polymerase 0.5

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 0.5

50 μΜ Rv Primer 0.5

10 mM dNTP Mixture 1

5 X iProof HF Buffer 10

dH 2 Q 36,5

50

Total μΐ,

[00585] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 98°C 30 sec 1 cycle

Denaturing 98°C 10 sec

Annealing 62°C 30 sec 35 cycles

Elongation 72°C 2 min

Elongation 72°C 10 min 1 cycle

Last 4°C oo [00586] After the PCR reaction described above, the amplification of Cbl 172 gene was confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel Extraction kit to extract the DNA out of the gel.

[00587] The Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows:

Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X

10X T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 0.2

Total 10 22°C 30 min

[00588] After the reaction, the enzyme was deactivated by incubating at 75 °C for 20 min. [00589] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector.

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00590] The ligation mixture for Cb 1172-pET-46 Ek/LIC was introduced into E. coli JM 109 by electroporation method, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and used individually to inoculate 6 mL cultures of LB-ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and plasmid minipreps (QIAGEN) were made from the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to confirm the size of the plasmid DNA. The plasmid inserts (genes) were sequenced to confirm their identity.

[00591] For gene expression, one of the correct plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with chloramphenicol (100 μg/ml) and ampicillin (50 μg/ml) and incubated at 37°C overnight. Five to six colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.01 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification

[00592] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (25 mM Tris-HCL pH 7.8, 750 mM of NaCl, 5% glycerol, 20 mM imidazole, 1.25% Tween-20). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged Cbl 172) was then eluted from the resin with an elution buffer (50 mM Tris- HCL, pH7.5, 250 mM imidazole).

[00593] The gene product of Cbl 172 was expressed in its full length form. The design of the PCR primers ensured that the protein was fused to 6-histidines encoded in the plasmid. The six histidines will bind to either a nickel-charged resin or a cobalt-charged resin. The bound protein can then be displaced from the resin with a buffer containing imidazole. This method facilitated quick purification of the protein.

[00594] The CM 172 [cc-L-arabinofuranosidase (EC 3.2.1.55)] amino acid sequence is disclosed in SEQ ID NO: 13. The nucleotide sequence encoding Cbl 172 is disclosed in SEQ ID NO: 14. [00595] For protein expression, Cbl 172 was cloned into the plasmid pET46 Ek/LIC. The amino acid sequence of Cbl95-pET46 Ek/LIC is SEQ ID NO: 16. Amino acid numbers 1-15 of SEQ ID NO: 16 are from the pET46 Ek/LIC plasmid, and include a sequence of six histidines to facilitate protein purification. The nucleotide sequence encoding SEQ ID NO: 16 is disclosed in SEQ ID NO: 15. Nucleotide numbers 1-45 of SEQ ID NO: 15 are from the pET46 Ek/LIC plasmid.

[00596] The Cbl 172 gene was expressed in E. coli cells, and the protein was purified in two steps, including a talon resin purification (immobilized metal affinity chromatography) step making use of the 6-histidines encoded by the plasmid and an anion exchange step using Hitrap Q column. Figure 6A shows an SDS-PAGE of purified Cbl 172.

Enzyme Activity

[00597] Figure 6B shows the enzymatic activity of Cbl 172 on natural substrates from a reducing sugar assay. Five different hemicellulosic substrates were tested: arabinan (sugar beet), soluble wheat arabinoxylan (SWAX), rye arabinoxylan (RAX), oat spelt xylan (OSX) and debranched arabinan. Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of arabinose equivalents. Hydrolysis of arabinan (from sugar beet) was higher than hydrolysis of other natural substrates.

[00598] The concentration of arabinose equivalents was determined following enzymatic hydrolysis of arabinan (sugar beet), soluble wheat arabinoxylan (SWAX), rye arabinoxylan (RAX), oat spelt xylan (OSX) and debranched arabinan, according to the methods of Lever, M. (A new reaction for colorimetric determination carbohydrates. Anal. Biochem. 1972: 47; 273- 279). 1.5 mL microcentrifuge tubes were "zeroed" in an analytical balance. Next, 2 ± 0.1 mg arabinan (sugar beet), SWAX, RAX, OSX and debranched arabinan were added to each tube, and the mass measured and recorded. The volumes needed to be added to each tube were calculated based on the mass. Sodium citrate reaction buffer and enzymes were added to each tube beginning with the reaction buffer. The tubes were incubated with constant mixing in a Thermomixer R (Eppendorf) at 75°C for 16 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 50 \L of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay. 1 mL of a stock solution of arabinose was made at a concentration of 100 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following concentrations (50 mM, 25 mM, 12.5 mM and 6.25 mM). 50 mg of /?HBAH was dissolved in 50 mL of ice-cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution was kept on ice. 150 of /?HBAH solution was added to 50 μΙ_, of the sample and arabinose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A4 1 o nm and arabinose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The correlation coefficient (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based upon their absorbances.

[00599] Figure 6C shows the enzymatic activity of Cbl 172 on natural substrates using HPLC analysis. Five different hemicellulosic substrates were tested: arabinan (sugar beet), soluble wheat arabinoxylan (SWAX), rye arabinoxylan (RAX), oat spelt xylan (OSX) and debranched arabinan. In each case, in the presence of Cbl 172, arabinose was released. In the absence of Cbl 172, only minor amount of arabinose was observed for debranched arabinan; no products of hydrolysis were released for other natural polysaccharides. The results showed that this enzyme releases arabinose from complex substrates (arabinan, SWAX, RAX, OSX and debranched arabinan).

[00600] Figure 6E shows the thermostability of Cbl 172. Cbl 172 has 57%, 45%, 35% and 22% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. Fifty nM Cbl 172 was kept at different temperatures (70 °C, 75 °C, 80 °C, 85 °C and 90 °C). The samples were taken out at the following time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately applied to enzyme activity measurement. The enzyme activity was measured at 85 °C using Cary 300 UV-Vis spectrophotometer (Varian). One hundred μΐ 1.25 mM pNP-CC-L- arabinofuranoside substrate was kept at 85 °C for three minutes to equilibrate. Then twenty five μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The optical density at 400 nm was recorded by the spectrophotometer for 2.5 minutes. And the initial velocity of reaction in the first minute was calculated. The initial velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the initial velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the initial velocity of reaction at time 0, then multiplied by 100, respectively.

[00601] Figure 7 shows the kinetic data of Cbl 172 on hydrolysis of pNP-CC-L- arabinofuranoside. The K m , k ca t, an d k c K m are indicated as well. In part (A), the experiment was conducted at 90°C; in part (B), the experiment was conducted at 75°C. One hundred μΐ pNP-CC-L- arabinofuranoside substrate of different concentrations was kept at 85 °C for three minutes to equilibrate. Then twenty five μΐ of the protein sample (fifty nM) was added to the substrate and mixed by pipetting up and down for several times. The optical density at 400 nm was recorded by a Cary 300 UV- Visible spectrophotometer for 2.5 minutes. The initial velocity of reaction in the first minute was calculated. The initial velocities were then plotted against the concentrations of pNP-CC-L-arabinofuranoside. The K m and fc cat were calculated by non-linear fit using the Graphpad software.

Example 4: a-glucuronidase Cb909 (SEQ ID NOs: 19 and 20)

[00602] An a-glucuronidase, Cb909, was identified in Caldicellulosiruptor bescii. The cc- glucuronidase cleaves the cc-l,2-glycosidic bond between 4-O-methyl-D-glucuronic acid and the β-1,4- xylosidic linkage backbone of xylan.

[00603] The Cb909 gene was amplified by PCR using iProoF M High-Fidelity DNA

Polymerase (Bio-Rad) and subcloned into pET46 Ek/LIC vector using Ek/LIC Cloning Kits (Novagen). The forward (For) and reverse (Rev) primer sequences are below:

CB909For

5'- GAC GAC GAC AAG ATG ATT TTA TCA AGG AGC AGT AAC -3' (SEQ ID NO: 138 ) CB909Rev

5'- GAG GAG AAG CCC GGT TAC GGA TAT ATT AGT CTT C -3' (SEQ ID NO: 139 )

[00604] The PCR mixture and the amplification procedure appear below: PCR mixture

2 υ/μί iProoF M High-Fidelity DNA

Polymerase 0.5

Genomic DNA 1

50 μΜ Fw Primer 0.5

50 μΜ Rv Primer 0.5

10 mM dNTP Mixture 1

5 x iProof HF Buffer 10

dH 2 Q 3^5

Total 50

PCR Protocol

Denature 98°C 30 sec

Denature 98°C 10 sec

Anneal 62°C 30 sec 35 Cycles

Elongate 72°C 2 min

Elongate 72°C 10 min

Final 4°C oo

[00605] After the PCR amplification described above, the amplification of Cb909 gene was confirmed by 1% agarose gel electrophoresis. T4 DNA polymerase (Novagen) was then added to the purified PCR product to generate compatible overhangs.

T4 DNA polymerase treatment Incubation

2.5 υ/μί T4 DNA Polymerase 0.2 22°C 30 min

Purified PCR Product 0.5 75°C 20 min

25 mM dATP 1 4°C oo

100 mM DTT 0.5

lOx T4 DNA Polymerase Buffer 1

dH 2 0 6.8

Total

After the reaction, the following annealing reaction was prepared with pET46 Annealing Incubation pET46 Ek/LIC vector 0.5 22°C 5 min

Reaction Mixture 1

Total 1.5

[00607] After the incubation, EDTA was added to terminate the annealing reaction.

Termination reaction Incubation

25 mM EDTA 0.5 22°C 5 min

pET46 Ek/LIC vector 0.5

Reaction Mixture 1

Total 2 μL·

[00608] The annealing mixture for Cb909-pET46 Ek/LIC was used to transform E. coli JM109 by electroporation and the cells were plated on LB-ampicillin plates. After overnight incubation at 37° C, three colonies were selected and each was used to inoculate 10 mL cultures of LB-ampicillin. The cultures were grown at 37 °C with vigorous aeration for 16 hours and plasmid minipreps were made of each cell culture. The individual plasmid preparations were then electrophoresed on a 1% agarose gel to confirm the size of plasmid/insert DNA. Next, the integrity of the gene was confirmed by nucleotide sequencing.

[00609] The Cb909 (cc-glucuronidase) amino acid sequence is disclosed in SEQ ID NO: 19. The nucleotide sequence encoding Cb909 is disclosed in SEQ ID NO: 20.

[00610] For protein expression, Cb909 was cloned into the plasmid pET46Ek/LIC. The amino acid sequence of Cb909-pET46 Ek/LIC is SEQ ID NO: 24. Amino acid numbers 1-15 of SEQ ID NO: 24 are from the pET46 Ek/LIC plasmid, and include a sequence of six histidines to facilitate protein purification. The nucleotide sequence encoding SEQ ID NO: 24 is disclosed in SEQ ID NO: 23. Nucleotide numbers 1-45 of SEQ ID NO: 23 are from the pET46 Ek/LIC plasmid. [00611] Figure 8A shows putative domain architecture of Cb909. Figure 8B show SDS- PAGE of purified Cb909.

[00612] Figure 8C shows the activity of Cb909. The substrate is aldouronie acids, that is a mixture of xylo-oligosaccharides decorated with 4-O-methyl-D-glucuronosyl (MeGlcA). After incubation with Cb909 at 75 °C for 60 minutes, MeGlcA group was cleaved by Cb909 from aldouronie acids to release undecorated xylose, xylobiose, xylotriose and xylotetraose as products. The condition of the reaction was as follows: 6nM Cb909, 50mM Phosphate buffer pH 6.0, 150mM NaCl, lmg/ml aldouronie acids.

[00613] Figure 8D shows the results of pH optimization assay for Cb909. The maximum activity was detected at pH 5.5. This assay was carried out as follows: lmg/ml aldouronie acids solution was incubated with 6nM Cb909 for 10 minutes at 75 °C at each pH. 50mM citrate buffer containing 150mM NaCl was used in the range from pH 5 to pH 6. 50mM phosphate buffer containing 150mM NaCl was used in the range of pH 6 to pH 7. After the reaction, the temperature was quickly increased to 100 °C to terminate the reaction. The amounts of products were detected by HPLC.

[00614] Figure 8E shows the results of optimum temperature assay. The maximum activity of Cb909 was detected at 75 °C (xylobiose and xylotriose). Xylose was produced most efficiently at 70 °C but the amounts of produced xylose at 70 °C and 75 °C were almost the same. This assay was carried out as follows: lmg/ml aldouronie acids solution was incubated with 6nM Cb909 for 10 minutes in 50mM citrate buffer pH 5.5 that contained 150mM NaCl. After the reaction the temperature was quickly increased to 100 °C to terminate the reaction. The amounts of products were detected by HPLC.

Example 5: β-xylosidase Cb2487 (SEQ ID NOs: 27 and 28)

[00615] Another enzyme in the enzyme cocktail is a β-xylosidase that was amplified from a Caldicellulosiruptor bescii, Cb2487. [00616] The Cb2487 gene was amplified by PCR using iProof™ High-Fidelity DNA

Polymerase (Bio-Rad) and subcloned into pET46 Ek/LIC vector using Ek/LIC Cloning Kits (Novagen). The forward (For) and reverse (Rev) primer sequences are below:

CB2487For

5 ' -GACGACGACAAGATGTCAATTGAAAAAAGGGTAAAC-3 ' (SEQ ID NO: 140 ) CB2487Rev

5 ' -GAGGAGAAGCCCGGTTATTC ACACCATGC A-3 ' (SEQ ID NO: 141 )

[00617] The PCR mixture and the amplification procedure appear below:

PCR mixture

2 υ/μί iProoF M High-Fidelity DNA

Polymerase 0.5

Genomic DNA 1

50 μΜ Fw Primer 0.5

50 μΜ Rv Primer 0.5

10 mM dNTP Mixture 1

5 x iProof HF Buffer 10

dH 2 Q 36^

Total 50

PCR Protocol

Denature 98°C 30 sec

Denature 98°C 10 sec

Anneal 62°C 30 sec 35 Cycles

Elongate 72°C 2 min

Elongate 72°C 10 min

Final 4°C oo

[00618] After the PCR amplification described above, the amplification of Cb2487 gene was confirmed by 1% agarose gel electrophoresis. T4 DNA polymerase (Novagen) was then added to the purified PCR product to generate compatible overhangs.

T4 DNA polymerase treatment Incubation 2.5 υ/μί T4 DNA Polymerase 0.2 22°C 30 min

Purified PCR Product 0.5 75°C 20 min

25 mM dATP 1 4°C oo

100 mM DTT 0.5

lOx T4 DNA Polymerase Buffer 1

dH 2 0 6.8

Total

After the reaction, the following annealing reaction was prepared with pET46

Annealing Incubation pET46 Ek/LIC vector 0.5 22°C 5 min

Reaction Mixture 1

Total 1.5

[00620] After the incubation, EDTA was added to terminate the reaction.

Termination reaction Incubation

25 mM EDTA 0.5 22°C 5 min

pET46 Ek/LIC vector 0.5

Reaction Mixture 1

Total 2 μL·

[00621] The annealing mixtures for Cb2487-pET46 Ek/LIC was transformed into E. coli JM109 by electroporation and the cells were plated on LB-ampicillin plates. After overnight incubation at 37° C, three colonies were selected and each was used to inoculate 10 mL cultures of LB-ampicillin. The cultures were grown at 37 °C with vigorous aeration for 16 hours and plasmid minipreps were made from each cell culture. The individual plasmid preparations were then electrophoresed on a 1% agarose gel to confirm the size of plasmid/insert DNA. Next, the integrity of the gene was confirmed by nucleotide sequencing. [00622] The Cb2487 (β-xylosidase) amino acid sequence is disclosed in SEQ ID NO: 27. The nucleotide sequence encoding Cb2487 is disclosed in SEQ ID NO: 28.

[00623] For protein expression, Cb2487 was cloned into the plasmid pET46 Ek/LIC. The amino acid sequence of Cb2487-pET46 Ek/LIC is SEQ ID NO: 30. Amino acid numbers 1-15 of SEQ ID NO: 30 are from the pET46 Ek/LIC plasmid, and include a sequence of six histidines to facilitate protein purification. The nucleotide sequence encoding SEQ ID NO: 30 is disclosed in SEQ ID NO: 29. Nucleotide numbers 1-45 of SEQ ID NO: 29 are from the pET46 Ek/LIC plasmid.

[00624] Figure 9A shows putative domain architecture of Cb2487. Figure 9B shows SDS- PAGE of purified Cb2487. Figure 9C shows biochemical assay to determine the optimum pH of Cb2487. Figure 9D shows biochemical assay to determine the optimum temperature of Cb2487. Figure 9E shows the kinetic parameter of Cb2487 with /?NP-P-D-xylopyranoside as substrate. Figure 9F shows xylo-oligosaccharides hydrolysis products analysis through thin layer chromatography (TLC). Figure 9G shows thermostability assay for Cb2487. Figure 9H shows synergism of β-xylosidase (Cb2487) and a-glucuronidase (Cb909).

[00625] Figure 9 A shows putative domain architecture of Cb2487. The putative conserved domains of Cb2487 were analyzed through the NCBI Conserved Domains Database search tool.

[00626] Figure 9B shows SDS-PAGE of purified Cb2487. The lane next to MW shows the protein molecular mass marker. The lane Cb2487 shows the purified protein.

Purification of Cb2487

[00627] For Cb2487 purification, the cell pellet was re-suspended in binding buffer (50 mM Tris-HCl, 300 mM NaCl, pH 7.5), then lysed by passing through an EmulsiFlex C-3 cell homogenizer. The lysate was centrifuged at 20,000 x g for 20 min at 4°C to remove cell debris. The supernatant was incubated at 75°C for 30 min and centrifuged at 20,000 x g for 15 min at 4°C to remove heat labile proteins. The supernatant after heating was purified by Talon Metal Affinity Resin pre-equilibrated with binding buffer and incubated for 1 h at 4°C. The resin was washed with 50 column volumes of binding buffer, then eluted with 10 column volumes of elution buffer (50 mM Tris-HCl, 300 mM NaCl, 250 mM Imidazole, pH 7.5). The elution fractions were pooled and concentrated with Amicon Ultra- 15 centrifugal filter units (50,000 MMCO), and exchanged into Tris-HCl buffer (20 mM, pH 7.5) by three successive

concentration and dilution cycles, then purified with Hitrap Q HP column. The elution fractions were pooled and concentrated with Amicon Ultra- 15 centrifugal filter units (50,000 MMCO), and exchanged into Tris-HCl buffer (50 mM, pH 7.5, 300 mM NaCl). The proteins were then purified with a Superdex™ 200 Hiload™ 16/60 size exclusion column using an AKTAxpress system equipped with a UV detector.

[00628] Figure 9C shows a biochemical assay to determine the optimum pH of Cb2487. For the pH optimum assay, para-nitrophenyl-beta-D-xylopyranoside (pNP-X, 0.8 mM) was incubated with Cb2487 concentration (10 nM) at 75°C in different buffer: pH 4.0-6.0 (citrate buffer, 50 mM, 150 mM NaCl), pH6.0-8.0 (phosphate buffer, 50 mM, 150 mM NaCl), pH 8.5- 9.0 (Tris-HCl, 50 mM, 150 mM NaCl).

[00629] Figure 9D shows a biochemical assay to determine the optimum temperature of Cb2487. For temperature optimum assay, pNP-X (0.8 mM) was incubated with Cb2487 (10 nM) in citrate buffer (50 mM, pH 6.0, 150 mM NaCl) at different temperatures (40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 °C).

[00630] Figure 9E shows a determination of kinetic parameters for Cb2487 with /?ΝΡ-β-ϋ- xylopyranose as substrate. For the left side panel, the kinetic parameters were determined at 90 °C, pH 6.0. For the right side panel, the kinetic parameters were determined at 75 °C, pH 6.0. For these assays, different concentrations of /?NP-X (0.08-24 mM) were incubated with Cb2487 (10 nM) in citrate buffer (50 mM, pH 6.0, 150 mM NaCl) at 75 and 90°C, respectively.

[00631] Figure 9F shows hydrolytic activity of Cb2487 on xylo-oligosaccharides. Cb2487 (0.5 μΜ) was incubated with different xylo-oligosaccharides (X 2 -6 ) at 75 °C for 15 hr and then the products were separated by TLC.

[00632] Figure 9G shows a thermostability assay for Cb2487. Cb2487 was incubated in citrate buffer (pH 6.0, 50 mM) at different temperatures (70, 75, 80, 85, 90, and 95°C) without substrate addition, the protein was taken at different times (0, 10 min, 30 min, 1 h, 3 h, 4, 8 h, 12 h, 24 h) and the residual activity was assayed with /?NP-X as substrate.

[00633] Figure 9H shows synergism of β-xylosidase (Cb2487) & a-glucuronidase (Cb909). Aldouronic acids were incubated with Cb2487 (0.5 μΜ), Cb909 (0.5 μΜ) in citrate buffer (pH 6.0), 75 °C overnight, then assayed with HPLC. Adding Cb909 cleaved off the methylglucuronic acid decorations in aldouronic acids to release xylose and xylo-oligosaccharides. Adding Cb2487 cleaved available beta-l,4-xylosidic linkages to release more xylose. Mixing the two enzymes led to the conversion of the xylo-oligosaccharides released by Cb909 to xylose by Cb2487.

Example 6: Acetyl Xylan Esterase Cbl62 (SEQ ID NOs: 33 and 34)

[00634] An acetyl xylan esterase, Cbl62, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of Cbl62, where Cb stands for C. bescii. The acetyl xylan esterase cleaves the linkages between xylose and the side chain of acetyl groups in hemicellulose to provide more accessibility to other hemicellulases such as xylanase and beta-xylosidase to the backbone of xylan. The Cbl62 protein is 321 amino acids long and has a predicted molecular mass of 38.7 kDa (His-tag + Cbl62 protein). The protein has a single domain of acetyl xylan esterase (Figure 10A).

Cloning of Cb!62

[00635] The gene for Cbl62 was amplified from Caldicellulosiruptor bescii genomic DNA by PCR using PrimeSTAR HS DNA polymerase (TaKaRa). The Cbl62 gene was amplified using the following primer set:

Cbl62-Fw

5'- GACGACGACAAGATGGTTTTTGAAATGCCACTTGAAAAG -3' (SEQ ID NO: 142 )

Cbl62-Rv

5'-

GAGGAGAAGCCCGGTTATTTTATCATCTCCATAAGATACATAAATATCTTGTC

-3' (SEQ ID NO: 143 ) [00636] The polymerase chain reaction mixture contained the following: PCR reaction

2.5 U/mL PrimeSTAR DNA polymerase 0.5

19 ng/mL C. bescii gDNA 1

10 mM Fw Primer 1

10 mM Rv Primer 1

2.5 mM dNTP Mixture 4

5x PrimeSTAR Buffer 10

dH 2 Q 32.5

Total 50 μΐ.

[00637] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 98°C 10 sec

Annealing 55°C 5 sec 30 cycles

Elongation 72°C 60 sec

Last 4°C oo

[00638] The Ek/LIC cloning kit was utilized (Novagen). Both ends of the amplified gene fragment were digested, in the presence of dATP, with the 3' to 5' exonuclease activity of T4 DNA polymerase. The resultant fragment was annealed to the pET-46 Ek/LIC vector.

[00639] The ligation mixtures for Cbl62-pET46 were introduced into E. coli JM109 by heat shock method and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and used to inoculate, individually, 10 mL of LB-ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with chloramphenicol and ampicillin at 100 μg/ml and 50 μg/ml and incubated at 37°C overnight. Five to six colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37°C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification

[00640] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (50 mM Tris-HCL pH 7.5, 20 mM imidazole and 300 mM of NaCl). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a nickel-charged resin (GE Healthcare) and washed several times to remove the unbound proteins. The bound protein (6-Histidine-tagged Cbl62) was then eluted from the resin with an elution buffer composed of the lysis buffer supplemented with 250 mM imidazole. The eluted protein was further purified by passing through Hiload 16/20 prepgrade gel-filtration column (GE Healthcare) under the 50 mM Na 2 HP0 4 -HCl pH 6.5 and lOO mM NaCl buffer.

[00641] The Cbl62 (acetyl xylan esterase) amino acid sequence is disclosed in SEQ ID NO: 33. The nucleotide sequence encoding Cbl62 is disclosed in SEQ ID NO: 34.

[00642] For protein expression, Cbl62 was cloned into the plasmid pET46 Ek/LIC. The amino acid sequence of Cbl62-pET46 Ek/LIC is SEQ ID NO: 36. Amino acid numbers 1-15 of SEQ ID NO: 36 are from the pET46 Ek/LIC plasmid, and include a sequence of six histidines to facilitate protein purification. The nucleotide sequence encoding SEQ ID NO: 36 is disclosed in SEQ ID NO: 35. Nucleotide numbers 1-45 of SEQ ID NO: 35 are from the pET46 Ek/LIC plasmid.

[00643] Figure 10A shows the domain structure of Cbl62; the protein has an acetyl xylan esterase domain.

[00644] The Cbl62 gene was expressed in E. coli cells, and the protein was purified in two steps, making use of the 6-histidines encoded by the plasmid. Figure 10B shows an SDS-PAGE of purified Cbl62. The molecular markers are in the lane next to the purified Cbl62. [00645] Figure IOC shows the enzymatic activity of Cbl62 at different pHs using para- nitrophenol adducted acetate (pNP-acetate) as a substrate. The released /?NP was monitored continuously at an absorbance of 400 nm using Synergy 2 Microplate reader (BioTek). The initial rate of hydrolysis was adopted as an enzyme activity. The figure shows the pH profile of Cbl62 on /?NP-acetate. The pH effect on the Cbl62 was examined at 50°C in the presence of 50 mM citrate-NaOH (pH 4.0 to 6.0), 50 mM Na 2 HP0 4 -HCl (pH 6.0 to 8.0), with 150 mM NaCl, respectively. 0.1 μΜ of purified Cbl62 and 2 mM /?NP-acetate were used for this assay.

[00646] Figure 10D shows the temperature profile of Cbl62 on /?NP-acetate. The temperature profile was performed in 50 mM Na 2 HP0 4 -HCl, pH 7.0, and 150 mM NaCl, at temperatures between 40 °C and 75 °C with 5 °C increments. 0.04 μΜ of purified Cbl62 and 2 mM /?NP- acetate were used for this assay.

[00647] Figure 10E shows the thermostability profile of Cbl62 on /?NP-acetate. 0.02 μΜ of purified CM 62 in 50 mM Na 2 HP0 4 -HCl, pH 7.0, and 150 mM NaCl was incubated for 0 to 24 hours at temperatures between 60 °C and 80 °C with 5 °C intervals, and the residual activities were measured.

[00648] Figure 10F shows the kinetic study of Cbl62. 0.04 μΜ of purified Cbl62 in 50 mM Na 2 HP0 4 -HCl, pH 6.0, and 150 mM NaCl was incubated with a various concentration of /?NP- acetate, and the initial rate of hydrolysis was plotted on the graph. The kinetic parameters were determined by Michaelis-Menten equation utilizing Graph Pad Prism v5.01 (GraphPad

Software).

Example 7: Hydrolysis of polysaccharides with enzyme cocktails of Caldicellulosiruptor bescii hemicellulases containing a single type of endoxylanase

[00649] Mixtures of one or more of the enzymes endoxylanase (Cbl93), cc- arabinofuranosidase (CM 172), β-xylosidase (Cb2487), cc-glucuronidase (Cb909), and acetyl xylan esterase (Cbl62) were incubated with the polysaccharides soluble wheat arabinoxylan, birch wood xylan, and oat spelt xylan. For each substrate, incubation of the substrate with a cocktail containing all of the enzymes endoxylanase (Cbl93), cc-arabinofuranosidase (CM 172), β-xylosidase (Cb2487), α-glucuronidase (Cb909), and acetyl xylan esterase (Cbl62) yielded a greater release of monosaccharides from xylan than incubating the substrate with an enzyme cocktail containing less than all of the enzymes.

[00650] Figure 11 shows synergy of C. bescii hemicellulolytic enzymes on soluble wheat arabinoxylan (SWAX) hydrolysis. SWAX (8.0%, w/v) was incubated with different

hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar [part (A)] and HPLC [part (B)] analysis. The hemicellulases applied include Cbl93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ).

[00651] Figure 12 shows synergy of C. bescii hemicellulolytic enzymes on oatspelt xylan (OSX) hydrolysis. OSX (8.0%, w/v) was incubated with different hemicellulase at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar [part (A)] and HPLC [part (B)] analysis. The hemicellulases applied include Cbl93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ).

[00652] Figure 13A shows SWAX hydrolysis with a hemicellulase cocktail at different temperatures. SWAX (8.0%, w/v) was incubated with CM93 (0.5 μΜ), Cb2487 (4 μΜ), CM 172 (0.5 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) at 65 °C, 70 °C, 75 °C, 80 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay.

[00653] Figure 13B shows BWX hydrolysis with a hemicellulase cocktail at different temperatures. BWX (8.0%, w/v) was incubated with CM93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ) at 65 °C, 70 °C, 75 °C, 80 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay.

[00654] Figure 13C shows OSX hydrolysis with a hemicellulase cocktail at different temperatures. OSX (8.0%, w/v) was incubated with CM93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ) at 65 °C, 70 °C, 75 °C, 80 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Example 8: Hydrolysis of polysaccharides with enzyme cocktails of Caldicellulosiruptor bescii hemicellulases containing two types of endoxylanase

[00655] Mixtures containing the enzymes a-arabinofuranosidase (Cbl 172), β-xylosidase (Cb2487), cc-glucuronidase (Cb909), acetyl xylan esterase (Cbl62), and one or both of the endoxylanases (Cbl 93 and Cbl 95) were incubated with the polysaccharides soluble wheat arabinoxylan, birch wood xylan, and oat spelt xylan. For each substrate, incubation of the substrate with a cocktail containing both of the endoxylanases (Cbl 93 and Cbl 95) yielded a greater release of monosaccharides from xylan than incubating the substrate with an enzyme cocktail containing only one of the endoxylanases.

[00656] Figure 14A shows SWAX hydrolysis was improved by adding two xylanases (Cbl 95 and Cbl93) in the hemicellulase mixture. SWAX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar analysis. Different hemicellulase mixtures were applied in the hydrolysis: Mix I) Cbl95 (0.5 μΜ), Cbl 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ); Mix II) Cbl93 (0.5 μΜ), Cbl 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ); Mix III) Cbl95 (0.25 μΜ), Cbl93 (0.25 μΜ), Cbl 172 (0. 5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ).

[00657] Figure 14B shows BWX hydrolysis was improved by adding two xylanases (Cbl 95 and Cbl 93) in the hemicellulase mixture. BWX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar analysis. Different hemicellulase mixtures were applied in the hydrolysis: Mix I) Cbl95 (0.5 μΜ), Cbl 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ); Mix II) Cbl93 (0.5 μΜ), Cbl 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ); Mix III) Cbl95 (0.25 μΜ), Cbl93 (0.25 μΜ), Cbl 172 (0. 5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ).

[00658] Figure 14C shows OSX hydrolysis was improved by adding two xylanases (Cbl 95 and Cbl93) in the hemicellulase mixture. OSX (8.0%, w/v) was incubated with different hemicellulase mixes at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar analysis. Different hemicellulase mixtures were applied in the hydrolysis: Mix I) Cbl95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ); Mix II) Cbl93 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and CM62 (0.5 μΜ); Mix III) CM95 (0.25 μΜ), Cbl93 (0.25 μΜ), Cbl l72 (0. 5 μΜ), Cb2487 (4 μΜ), Cb909 (0.5 μΜ), and Cbl62 (0.5 μΜ).

[00659] Figure 15 shows soluble wheat arabinoxylan hydrolysis with hemicellulase cocktail of Caldicellulosiruptor bescii. Different concentrations of SWAX (1.0, 2.0, 4.0, 6.0, 8.0%, w/v) were incubated with CM93 (0.5 μΜ), CM95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) for 15 hr at 75 °C in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (A) shows reducing sugar in the control and hydrolysis mixtures, and part (B) shows comparison of calculated and average of actual reducing sugar in hydrolysis mixtures with different substrate concentrations.

[00660] Figure 16 shows birch wood xylan hydrolysis with hemicellulase cocktails of Caldicellulosiruptor bescii. Different concentrations of BWX (1.0, 2.0, 4.0, 6.0, 8.0%, w/v) were incubated with CM93 (0.5 μΜ), CM95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (A) shows reducing sugar in the control and hydrolysis mixtures, and part (B) shows comparison of calculated and average of actual reducing sugar in hydrolysis mixtures with different substrate concentrations.

[00661] Figure 17 shows oat spelt xylan hydrolysis with hemicellulase cocktail of

Caldicellulosiruptor bescii. Different concentrations of OSX (1.0, 2.0, 4.0, 6.0, 8.0%, w/v) were incubated with CM93 (0.5 μΜ), CM95 (0.5 μΜ), CM 172 (0.5 μΜ), Cb2487 (4 μΜ), CM62 (0.5 μΜ), and Cb909 (0.5 μΜ) at 75 °C for 15 hr in citrate buffer (50 mM, pH 6.0, 150 mM NaCl), and subjected to reducing sugar assay. Part (A) shows reducing sugar in the control and hydrolysis mixtures, and part (B) shows comparison of calculated and average of actual reducing sugar in hydrolysis mixtures with different substrate concentrations.

[00662] Example 9: Endocellulase/mannanase Cbl952 [00663] An endocellulase/mannanase, CM952, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of CM952, wherein Cb stands for Caldicellulosiruptor bescii. The protein has a Glycoside Hydrolase (GH) family 9 catalytic domain (cellulase domain), three family 3 carbohydrate binding modules (CBMs) (one CBM3c and two CBM3b modules) and one GH5 catalytic domain (mannanase domain) (Figure 18).

[00664] A wild- type CM952 protein, lacking the signal peptide, and several truncational mutations (TMl, TM2, TM3, TM4, TM5, TM6, and TM7) were systematically constructed for functional analysis (Figure 18).

[00665] As shown in Figure 18, TMl contained the GH9 module and the three CBMs, TM2 contained the GH9 module and two CBMs, TM3 contained the GH9 module and one CBM (CBM3c), and TM4 was made up of only the GH9 module. The truncated mutant TM5 was composed of the three CBMs linked to the GH5 module, whereas TM6 and TM7 were composed of the CBM3c and CBM3b, respectively. The SDS-PAGE results in Figure 19 show that all protein constructs were successfully expressed as soluble proteins and highly purified.

Cloning of Cb 1952 wild-type

[00666] The gene for CM952 wild- type was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The CM 952 wild-type gene was amplified using the following primer set:

[00667] Cb 1952 wild-type Forward: 5 '- GAC GAC GAC AAG ATG GCA ACA ACC TTT AACTAT GGT GAA GCT C -3' (SEQ ID NO: 39)

[00668] Cb 1952 wild-type Reverse: 5 '- GA GGA GAA GCC CGG TTA TTC AGC ACC AAT CGC ATT AGT TTT ATA CC -3' (SEQ ID NO: 40)

[00669] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4 17 ng/μΐ. Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ.

[00670] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 5 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

Cloning of CM952TMl

[00671] The gene for CM952TM1 was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The

CM952TM1 gene was amplified using the following primer set:

[00672] Cbl952TMlForward: 5'- GAC GAC GAC AAG ATG GCA ACA ACC TTT AAC TAT GGT GAA GCT C -3' (SEQ ID NO: 41)

[00673] Cbl952TMlReverse: 5'- GAG GAG AAG CCC GGT TAG CTA GTA TCT ATC TTC ACT ATT CCA CTG -3' (SEQ ID NO: 42)

[00674] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1 50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ.

[00675] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

Cloning of CM952TM5

[00676] The gene for CM952TM5 was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The

CM952TM5 gene was amplified using the following primer set:

[00677] Cb 1952TM5Forward: 5 '- GAC GAC GAC AAG ATG A AT TTC AAA GCT ATC GAA AAG CCA AC -3' (SEQ ID NO: 43)

[00678] Cb 1952TM5Reverse: 5 '- GA GGA GAA GCC CGG TTA TTC AGC ACC AAT CGC ATT AGT TTT ATA CC -3' (SEQ ID NO: 40)

[00679] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1 5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ.

[00680] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

[00681] After the PCR reactions described above, the amplification of CM952 wild- type, CM952TM1 and CM952TM5 gene was confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel

Extraction kit to extract the DNA out of the gel.

[00682] A Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows:

Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X

10X T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 0.2

Total 10 22°C 30 min

[00683] After the reaction, the enzyme was inactivated by incubating at 75 °C for 20 min.

[00684] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector. Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00685] The ligation mixtures for Cb 1952 wild-type, Cb 1952TM 1 - or Cb 1952TM5-pET-46 Ek/LIC were introduced into E. coli NovaBlue competent cells by chemical transformation method, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and each was used to inoculate 6 mL cultures of LB-ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. After confirmation that the gene has been inserted into plasmids, the genes were sequenced to confirm their identity. The plasmids with the right insertion sequences were selected for recombinant protein production.

[00686] CM952TM2, CM952TM3, CM952TM4, CM952TM6, and CM9527 were prepared through similar steps as above, with different steps as appropriate (e.g. primer sequences).

[00687] For expression of each enzyme, plasmid containing the wild type, TM1, TM2, TM3, TM4, TM5, TM6, or TM7 was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with chloramphenicol (50 μg/ml) and ampicillin (100 μg/ml) and incubated at 37 °C overnight. Five to six colonies were inoculated into 10 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 6 hours. Ten mL of the culture was added to 1000 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37°C until the absorbance at 600 nm reached -0.3. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification [00688] Cultures were centrifuged to collect the cell pellet. For CM952 wild- type, the pellet was then suspended in a lysis buffer (25 mM Tris-HCL pH 7.8, 750 mM of NaCl, 5% glycerol, 20 mM imidazole, 1.25% Tween-20). For CM952TM1, the pellet was then suspended in a lysis buffer (25 mM Tris-HCL pH 7.8, 100 mM of NaCl, 10% glycerol, 10 mM imidazole, 1.25% Tween-20). For the other CM952 TM mutants, the pellet was then suspended in a lysis buffer without imidazole (50 mM Tris-HCL pH 7.5, 300 mM of NaCl). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein was then eluted from the resin with an elution buffer (50 mM Tris-HCL, pH7.5, 250 mM imidazole).

[00689] The design of the PCR primers ensured that each of the proteins was fused to 6- histidines (N-terminal tag) encoded in the plasmid. The six histidines will bind to either a nickel- charged resin or a cobalt-charged resin. The bound protein can be displaced from the resin with a buffer containing imidazole. This method facilitates quick purification of the protein of interest. All recombinant proteins were purified by immobilized metal ion affinity

chromatography (IMAC) using talon resin (Clontech, Mountain View, CA) according to the manufacturer's instructions. For CM952 wild-type, the eluted protein was dialyzed against a protein storage buffer (50 mM Tris-HCl, 150 mM NaCl, pH7.5). The protein was heated at 75°C for 10 min and centrifuged at 16,400 rpm for 20 min to precipitate any co-eluting thermo-labile host proteins. The recombinant protein was further purified by gel filtration using an

AKTAxpress TWIN fast protein liquid chromatograph (FPLC) system equipped with a Hiload 16/60 Superdex 200 column (GE Healthcare, Piscataway, NJ). For CM952TM1, the eluted protein was dialyzed against the protein storage buffer. The protein was then heated at 75°C for 20 min and centrifuged at 16,400 rpm for 20 min. The supernatant was further purified by gel filtration as described above. For the other mutants, the recombinant proteins eluted from Talon resin were directly applied to gel filtration for purification close to homogeneity. Figure 19 shows an SDS-PAGE of purified CM 952 proteins.

Gene and protein sequences of Cbl952WT. CM952TML and CM952TM5

CM952 full-length amino acid sequence [00690] The full-length CM952 endocellulase/mannanase (EC 3.2.1.4/EC 3.2.1.78) amino acid sequence is disclosed in SEQ ID NO: 44. The signal peptide of CM952, corresponding to amino acid numbers 1-28 of SEQ ID NO: 44 was removed during all PCR amplifications. Thus, the expressed wild-type CM952 protein did not contain amino acid numbers 1-28 of SEQ ID NO: 44. The amino acid sequence of the wild- type CM952 protein without the signal peptide is disclosed in SEQ ID NO: 114.

[00691] The procedure of cloning the gene for wild- type CM952 (without the signal peptide) into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The wild- type CM952 amino acid sequence (without the signal peptide) with the short peptide is disclosed in SEQ ID NO: 51. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 51.

CM952 full-length nucleotide sequence

[00692] The full-length CM 952 nucleotide sequence is disclosed in SEQ ID NO: 45. The signal peptide of CM952, corresponding to nucleotide numbers 1-84 of SEQ ID NO: 45 was removed during all PCR amplifications. Thus, the nucleotide sequence used to express wild- type CM952 protein did not contain nucleotide numbers 1-84 of SEQ ID NO: 45. The nucleotide sequence encoding the wild-type CM952 protein without the signal peptide is disclosed in SEQ ID NO: 115.

[00693] The wild- type CM952 nucleotide sequence (without the signal peptide) with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 50. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 50.

CM952TM1 amino acid sequence

[00694] The CM952TM1 endocellulase (EC 3.2.1.4) amino acid sequence is disclosed in SEQ ID NO: 46. The procedure of cloning the gene for CM952TM1 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM952TM1 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 53. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 53.

CM952TM1 nucleotide sequence

[00695] The CM952TM1 nucleotide sequence is disclosed in SEQ ID NO: 47. The

CM952TM1 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 52. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 52.

CM952TM5 amino acid sequence

[00696] The CM952TM5 amino acid sequence is disclosed in SEQ ID NO: 48. The procedure of cloning the gene for CM952TM5 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM952TM5 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 55. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 55.

CM952TM5 nucleotide sequence

[00697] The CM952TM5 nucleotide sequence is disclosed in SEQ ID NO: 49. The

CM952TM5 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 54. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 54.

Enzyme Activity

Determination of optimal pH, optimal temperature, and thermostability

[00698] The optimal pH for Cb 1952 WT, TM 1 , TM2, and TM3 with PASC, as substrate, were in the range of pH5.0-5.5 and the optimal temperature for each of these proteins was 85°C. In the case of TM4, the optimal pH and temperature with PASC were 6.5 and 55°C, respectively. The thermostability assays were carried out on the wild type and truncation mutants harboring cellulase activities. At 80°C and 85°C, the residual activities of WT, TM1, and TM2 after 24 h incubation were less than 20% except TM3, which retained 61.8% activity. At 75°C, the residual activities of WT, TM1, TM2, and TM3 after 24 h incubation were 43.1%, 75.7%, 53.6%, and 101.7%, respectively. Deletion of CBM3c dramatically reduced the thermostability of the enzyme. The truncated mutant TM4 remained stable at 45°C and 50°C, but the enzyme rapidly lost its activity at temperatures above 55°C (Figure 81C). The pH and temperature optima were also determined for hydrolysis of mannan substrates. For the wild-type enzyme the optimal pH and temperature for mannan hydrolysis were 5.5-6.5 and 90°C, respectively, and for TM5 the values were 6.5 and 90°C, respectively (data not shown).

Hydrolysis of phosphoric acid swollen cellulose, cello- and manno-oligosaccharides by CM952 and its mutants

[00699] The capacity of the wild- type CM952 and its TM1 and TM5 mutants, representing the mutants that harbored the GH9 module with the 3 CBMs and the GH5 module together with 3 CBMs (Figure 18) were investigated in a time course approach for hydrolysis of PASC. As shown in the chromatograph in Figure 78, release of products, mostly cellobiose and glucose, was observed for the wild- type (A) and the TM1 (B) mutant which contains the GH9 module. Very little to no hydrolysis of PASC was detected from TM5 (C) (the construct with the GH5 module). By further testing hydrolysis of cello-oligosaccharides, it was confirmed that the β-1,4- glucose cleaving activity was present in the GH9 domain (Figure 23). On manno- oligosaccharides hydrolysis, the wild-type and TM5 showed cleavage activity of

oligosaccharides with degree of polymerization (DP) of 3 and above (Figure 24). Interestingly, TM1 also showed activity on substrates of DP of 5 or higher, albeit the activity was lower than the wild- type enzyme and the TM5 mutant (Figure 24). No transglycosylation activities were found for the wild-type, TM1, and TM5 on glucose, cello-oligosaccharides, mannose, and manno-oligosaccharides.

Activities and kinetic parameters of CM952 and its mutants on cellulosic substrates

Specific activities were determined for the wild-type protein and each of the mutants with Avicel, a model crystalline cellulose, and filter paper, as substrates. On Avicel, deletion of the individual CBMs led to a decrease in specific activity of the truncated mutant (TM1, TM2, and TM3) (Table 1). The truncated mutant with either two or one of the CBM3b (TM1 and TM2, respectively) only showed a slight decrease in specific activity compared with the WT enzyme. In contrast, deleting the two CBM3b's led to a protein with less than half the specific activity of the WT protein on Avicel. A similar trend was observed for specific activity on filter paper as substrate, although the decreases in activity were less pronounced (Table 1). On both substrates, a construct made up of the GH9 catalytic module alone had only 3.8% and 16.2% of the activities observed for the WT protein on Avicel and filter paper, respectively.

TABLE 1. Specific activities and kinetic parameters of Cb 1952 wild-type, its truncation mutants, and the mutants of TM3 on cellulose substrates "

Avicel Filter paper PASC

(μιηοΐ (μιηοΐ

Protein sugar/min/μ sugar/min/μ ^cat ^cat IK m

mol mol (s- 1 ) (mg/ml) (s _1 ml/mg) protein) protein)

WT 10.15+0.51 16.12+2.86 2. .58+0. ,15 0.36+0.10 7.16

TM1 8.53+1.47 17.27+2.06 2. .12+0. ,13 0.14+0.07 15.14

TM2 8.94+0.89 14.31+3.13 2. ,16+0. ,18 0.19+0.10 11.37

TM3 4.47+0.81 12.87+1.44 3. ,09+0. ,30 0.65+0.24 4.75

TM3G208

3.68+0.69 13.74+1.80 7, .92+0 .78 1.71+0.45 4.63 WG

TM3G208

4.86+0.49 14.61+3.41 6 .36+0 .74 1.35+0.46 4.71

W

TM3T298F 5.53+0.53 15.14+1.71 8 .53+0 .67 2.17+0.42 3.93

TM4 0.39+0.02 2.62+0.56 0. ,08+0. ,01 3.73+0.81 0.02

a: The reactions were carried out at 75°C except that for TM4, which was done at 45°C.

b: PASC: phosphoric acid swollen cellulose.

[00700] The phosphoric acid swollen cellulose, derived from Avicel, was used to examine the kinetic parameters of the WT protein and its mutants (Table 1). The estimated fc cat for the WT (2.58 s "1 ) and its truncated mutants (2.12 - 3.09 s "1 ) was very modest. Interestingly TM1 exhibited a catalytic efficiency twice higher than that of the wild type, suggesting that the catalytic activities of the GH9 and GH5 modules are functionally coupled. Similar functional coupling of different catalytic modules within a single polypeptide was proposed for another plant cell wall degrading enzyme Prevotella ruminicola XynlOD-FaelA (9), a two-domain arginine kinase from the deep-sea clam Calyptogena kaikoi (40), and a flagellar creatine kinase from Chaetopterus variopedatus (13). The kinetic parameters of TM4, the protein with only the GH9 catalytic module were very poor compared to the proteins linked to the CBMs, alluding to the importance of these auxiliary modules to the function of CM952.

Activities and kinetic parameters of CM952 and its mutants on mannan-like substrates

[00701] The enzymatic activities of CM952 and its mutants on mannan-like substrates were also investigated. The substrates tested were locust bean gum, guar gum, and konjac

glucomannan. The wild type enzyme exhibited very high k C3t on all tested mannose based substrates. On locust bean gum, konjac glucomannan, and guar gum, the k C3t values were 1420 s " \ 1068 s "1 , and 696 s "1 , respectively (Table 2). Based on the data in Table 2, the catalytic activity for degradation of mannan and mannose-configured substrates is located in the GH5 module. It was observed that cleaving the GH9 module from the polypeptide to create the TM5 mutant increased the k C3t of this mutant, compared to the wild-type, by 2.4-, 2.8-, and 1.6-fold for locust bean gum, guar gum, and konjac glucomannan, respectively. Note that the standard error was quite high for the fc cat for guar gum. A corresponding increase in the K m of TM5 on each mannose-configured substrate led to catalytic efficiencies that were lower than those determined for the wild- type protein (Table 2). The truncated mutants containing the GH9 catalytic module in addition to either all three CBMs (TM1) or only the CBM3c (TM3) were almost devoid of activity on both locust bean gum and guar gum. These mutants, however, exhibited very high activity on konjac glucomannan.

Docket No. 658012000940

TABLE 2. Kinetic parameters of Cbl952 wild-type, its truncation mutants, and the mutants of TM3 on mannan substrates and konjac glucomannan a

Locust bean gum Guar gum Konjac glucomannan

1420+15 0.62+0.27 2290 696+56.7 2.26+0.42 1.84+1.0

WT 308 1068+271 581

8 3

0.23+0.0 1.85+0.3

TM1 3.89+0.41 5.9xl0 "2 n.d n.d n.d 907+50.7 490

1 0

0.15+0.0 (1.03+0.17) 1.30+0.4

TM3 4.36+2.82 3.5xl0 "2 0.94+0.50 1.10 xlO "2 611+68.9 470

6 xlO "2 3

TM3G208W 2.31+0.1 2.37+0.4

1.93+0.31 1.2 1.03+0.35 9.28+4.36 1.11 xlO "1 1614+143 681 G 5 9

0.12+0.0 (1.01+0.01) 1.80+0.6

TM3G208W 3.33+1.49 3.7xl0 "2 0.50+0.20 2.01 xlO "2 1119+160 621

3 xlO "2 8

1.12+0.5 12.58+7.9 (8.92+1.98) 2.61+0.4

TM3T298F 8.9xl0 "2 3.62+1.53 2.47 xlO "2 1102+77.4 422

5 4 xlO "2 3

3446+36 3.72+0.4

TM5 1.82+0.48 1893 1940+570 11.98+4.69 162 1710+119 460

7 8

: Konjac glucomannan is a polysaccharide with mixed linkage of glucose and mannose.

Site-directed mutagenesis

[00702] The architectural diversity of GH9 modules have been assigned to four different groups known as theme A, B, C, and D (19). In CM952, the GH9 catalytic module is linked to an accessory CBM3c at its C-terminus, and this is the architecture of the members of theme Bl. In theme Bl, there are both processive endoglucanases (7, 12, 34) and non-processive endoglucanases (2, 10). The distribution of reducing ends in the soluble and insoluble fractions of cellulase-hydrolyzed filter paper is commonly used to estimate the processivity of a cellulase (17). Our results, based on such an experiment, determined that CM952 and its truncation mutants (TM1, TM2, TM3, and TM4) do not harbor a processive GH9 catalytic module since their end products contained 40 -50 insoluble reducing ends (Table 3).

[00703] Table 3: Distribution of reducing sugars in soluble and insoluble fractions of filter paper hydrolyzed by Cbl952 wild-type, its truncation mutants, and the mutants of TM3 a

Reducin g sugar ( ) Ratio

Soluble Insoluble (Sol./Insol. b

Protein

(mM) (mM) Soluble Insoluble Reducing sugar)

WT C 1.32+0.07 1.32+0.03 50.0 50.0 1.00

TM1 C 2.02+0.04 1.38+0.08 59.4 40.6 1.46

TM2 C 2.13+0.11 1.42+0.06 60.0 40.0 1.50

TM3 C 1.98+0.08 1.56+0.12 55.9 44.1 1.27

TM4 d 3.18+0.15 2.72+0.40 53.9 46.1 1.17

TM3G208WG C 1.80+0.12 1.44+0.13 55.6 44.4 1.25

TM3G208W C 1.74+0.08 1.57+0.11 52.6 47.4 1.11

TM3T298F C 1.87+0.10 1.52+0.12 55.2 44.8 1.23

The reactions were carried out at 75°C for 16 h for all enzymes except TM4, which was carried out at 45°C.

b: Sol./Insol.: soluble versus insoluble.

c: Enzyme concentration was 0.5 μΜ.

d: Enzyme concentration was 10 μΜ.

[00704] An amino acid sequence alignment of the GH9 domain of CM952 with those of Clostridium cellulolyticum Cel9G (a non-processive endoglucanase) and Thermobifida fusca Cel9A (a processive endoglucanase) was examined. The C. cellulolyticum and T. fusca proteins represent two types of family 9 theme Bl endoglucanases with enzyme-cello-oligosaccharides

- 198 - co-crystal structures solved (26, 34). The amino acid sequence alignment showed that most of the residues involved in cellulose substrate binding are well conserved in the GH9 module of CM952 (Figure 79). However, neither of two aromatic residues (Trp-209 in T. fusca and Phe- 308 in C. cellulolyticum) responsible for hydrophobic stacking at subsite -3, is present in CM952 (Figure 79). As aromatic residues involved in hydrophobic stacking interactions with the substrates contribute to the processivity of the enzyme during hydrolysis of crystalline substrate (15, 47), we mutated the corresponding amino acid residue in CM952TM3 to an aromatic residue by changing Gly-208 to Trp-208 or by inserting a tryptophan before Gly-208 to obtain a TM3G208W and a TM3G208WG mutant, respectively. These mutants mimicked the T. fusca enzyme. In addition, T-298 was also changed to Phe-298 to obtain TM3T298F mutant, which mimicked the C. cellulolyticum enzyme.

[00705] The secondary structures of the three mutants did not show any gross differences compared to CM952TM3 as revealed by circular dichroism (CD) scans (Table 4), suggesting that the mutations did not result in gross changes in the secondary structural elements of the proteins compared to C 952TM3. Compared to parental protein (TM3), the specific activities of the three mutants on Avicel and filter paper were not different (Table 1). The mutations also did not aid us in modifying TM3 into a processive endoglucanase, as the ratio of soluble versus insoluble reducing ends remained unchanged (Table 3). The k C3t values of the mutants with PASC as substrate increased by about 2-fold. However, the K m values also increased leading to catalytic efficiencies (k ca t/K m ) that were similar to that of CM952TM3 (Table 1).

[00706] Table 4: CD spectroscopy analysis of CbCelB/Man5ATM3 and its mutants

Unordered

Protein cc-helix (%) -sheet (%) Turn (%)

( )

TM3 35.0+1.7 24.0+1.0 16.0+1.0 25.0+1.0

TM3G208WG 36.0+1.0 22.7+2.0 16.3+1.1 25.3+0.6

TM3G208W 35.7+1.5 23.3+1.1 16.3+1.1 25.3+0.6

TM3T298F 32.7+0.6 23.7+0.6 17.3+0.6 26.7+0.6

[00707] Unexpectedly, the k cat values of TM3G208WG with locust bean gum and guar gum, as substrates, were increased 15- and 100-fold compared with the values determined for TM3 (Table 2). Moreover, the catalytic efficiencies of this mutant for locust bean gum and guar gum also increased by 34-fold and 10-fold, respectively, (Table 2). The site-directed mutagenesis of the TM3 truncated mutant also increased its k cat on konjac glucomannan by two-fold or higher (Table 2).

Binding of CM952 to insoluble cellulose substrates

[00708] The CM952 wild-type, TM1, and TM5, which harbored all three CBMs (one CBM3c and two CBM3b) bound tightly to Avicel (Figure 80A) and PASC (Figure 80B). The truncated mutant TM2, which harbored the CBM3c and one CBM3b, also bound tightly to the two cellulosic substrates. The binding of TM3, which was composed of the GH9 module and the CBM3c, to the insoluble cellulose was weaker than those for wild- type, TM1, TM2, and TM5 (Figure 80A, B). Depletion binding isotherms were used to estimate the dissociation constant and maximal binding capacity of TM3 to Avicel as 0.52+0.20 M "1 and 423.9+50.7 nmol protein/g Avicel, respectively. The two components of TM3, i.e., the GH9 module and CBM3c, were observed to weakly bind to insoluble cellulose (Figure 80A, B). The binding of the CBM3c of CbCel9AMan5B (TM6) to insoluble cellulose was unexpected since this binding was not observed for other CBM3c characterized by this method (7, 10, 12, 16). Note, however, that the bindings were weak and thus preventing us from obtaining the binding constants of the GH9 and CBM3c modules for Avicel. The CBM3b (TM7) also bound to Avicel and PASC (Figure 80A, B), although in this case also the binding constants could not be determined.

Methods used with CM952 Polypeptides

[00709] Methods used with the experiments above for CM952 polypeptides include the following:

[00710] Determination of optimal pH and temperature: Two buffers were used for pH profiling of CM952: 50 mM sodium citrate, 150mM NaCl (pH 4.0-pH 6.0) and 50 mM

Na 2 HP0 4 -NaH 2 P0 4 , 150 mM NaCl (pH 6.5-pH 8.0). To measure the optimal pH of the enzymes on cellulose substrate, 0.5 μΜ CM952 wild-type or one of its truncation mutants was incubated with 2.5 mg/ml PASC in each buffer at a given pH at 75°C, and the activities in a 10 min assay were determined. The reducing sugars released were measured using the /?HBAH assay. For determination of optimal temperature, 0.5 μΜ of each enzyme was incubated with 2.5 mg/ml PASC at pH 5.5 at different temperatures ranging from 40 °C to 95 °C with a 5°C interval. The optimal pH and temperature for mannanase activity were determined as described above, except for the replacement of PASC with mannan as the substrate and change of the enzyme

concentration to 12.5 nM.

[00711] Enzymatic assays: The specific activities of CM952 wild-type and its mutants on Avicel and filter paper were determined at 75°C in the optimal buffer for the enzymes. The enzyme concentrations were 0.3 μΜ for each protein except for TM4 (5 μΜ). At different time intervals in a 90 min assay, samples were taken out and the products released determined as the amount of reducing ends present in the reaction mixture. The specific activities were determined in the region where the relation of reducing sugar versus time was linear.

[00712] The kinetics of CM952 wild- type and its mutants on PASC, locust bean gum, guar gum, and konjac glucomannan were determined in a 30 min assay. Different concentrations of the enzymes were incubated with a range of concentrations of substrates at 75°C. The velocities of release of reducing ends were determined and plotted against the concentrations of the substrates to estimate the kinetic parameters using the software GraphPad Prism 5.01 (GraphPad, San Diego, CA).

[00713] Time course hydrolysis of phosphoric acid swollen cellulose (PASC): Two point five mg/ml PASC was incubated with 0.5 μΜ CM952 WT, TMl, and TM5 at 75°C. At different time intervals (0 min, 2 min, 10 min, 60 min, 4 h, and 24 h), samples were taken out and applied to HPAEC-PAD analysis as described earlier (29).

[00714] Analyses of oligosaccharides hydrolysis and transglycosylation activity: Glucose, cello-oligosaccharides (cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose), mannose, and manno-oligosaccharides (mannobiose, mannotriose, mannotetraose,

mannopentaose, and mannohexaose), each at a final concentration of 1 mg/ml were incubated with 0.1 μΜ CM952 wild-type, CM952TM1, and CM952TM5 in a citrate buffer (10 mM sodium citrate, 150mM NaCl, pH5.5) at 75 °C for 14 h. The total reaction volume was 40 μΐ. The reaction products were dried using a SpeedVac concentrator (Thermo Fisher Scientific,

Pittsburgh, PA) and dissolved in 3.5 μΐ of H 2 0, and 1 μΐ of the products were analyzed by thin- layer chromatography (TLC) using a 250 μιη thick Whatman silica gel 60A (Maidstone, England). The TLC method was the same as described in our earlier report (29).

[00715] Thermostability assay: The thermostability of C 952 and its truncation mutants harboring cellulase activity were determined by incubating the enzymes at 75 °C, 80 °C, and 85 °C (WT, TM1, TM2, and TM3) or at 45 °C, 50°C, and 55°C (TM4) on a Veriti 96-well thermal cycler (Applied Biosystems, Carlsbad, CA). At different time points, aliquots were taken from the reaction mixture and residual enzymatic activity was determined with PASC as the substrate.

[00716] Site-directed mutagenesis and circular dichroism'. For site-directed mutagenesis, the QuikChange Multi Site-Directed Mutagenesis Kit (Stratagene, La Jolla, CA) was used according to the manufacturer's instructions. One hundred nanograms of the plasmid encoding CM952TM3 were used as the template in the PCR amplification. The reaction mixture contained 100 ng of the mutagenic primer, 1 μΐ dNTP mix, 0.75 μΐ QuikSolution and 1 μΐ QuikChange Multi enzyme blend. The nucleotide sequences of the mutagenic primers used for mutagenesis are shown in Supplemental Table 1. The PCR amplification steps were carried out as follows: an initial denaturation at 95°C for 1 min, followed by 30 cycles of 95°C for 1 min, 55 °C for 1 min, and 65 °C for 15 min. The PCR product was digested with Dpnl (New England Biolabs) at 37°C for 4 hours to degrade the parental plasmid DNA. The product from the Dpnl digestion was used in electro transforming JM109 competent cells using a Gene Pulser Xcell electroporation system (BioRAD, Hercules, CA). The E. coli cells were spread on LB plates containing 100 μg/ml ampicillin and incubated at 37 °C overnight. Single colonies were inoculated in 7 ml LB medium supplemented with 100 μg/ml ampicillin and cultured for 10 h. The plasmids were extracted from the recombinant E. coli cells and the inserts were sequenced (W. M. Keck Center for Comparative and Functional Genomics, UIUC) to confirm the presence of the desired mutation. Circular dichroism scans of mutated proteins were carried out as described in our previous report (37).

[00717] Measurement of reducing sugar in the soluble and insoluble fraction of hydrolyzed filter paper: The reducing sugars in the soluble and insoluble fractions of filter paper hydrolysis products were determined as described by Irwin et al. (17). The CM952 wild-type and its mutants (0.5 μΜ each except TM4, which was 10 μΜ) were incubated with five plates of Whatman No. 1 filter paper (0.6 cm in diameter) in a citrate buffer (pH5.5) at 75°C (for TM4, the temperature was 45°C, since this construct has lower thermostability) in 200 μΐ. The mixtures were shaken end-over-end for 16 h. The reaction products were centrifuged, and the supernatants (soluble fractions) were analyzed for the amounts of reducing ends. For reducing sugar determination in the insoluble fraction, the filter papers were initially washed four times each with 1 ml of the citrate buffer. Two hundred microliters of the citrate buffer was then added to the insoluble fraction (precipitated filter paper) followed by assaying for reducing ends through the pH AH method.

[00718] Binding of Cbl952 wild-type and its truncated mutants to cellulose: For qualitative measurements of the capacity of the individual polypeptides to bind to cellulose, thirty micrograms of CM952 wild- type and its mutants were incubated with 40 mg/ml Avicel cellulose or 2.5 mg/ml PASC in 50 mM Tris-HCl, 150 mM NaCl (pH 7.5). The mixture was shaken end- over-end at 4°C for 1 h. Then the bound and unbound proteins were separated by centrifugation of the mixture at 16,400 rpm for 3 min. The cellulose pellet was washed four times with 1 ml buffer (50 mM Tris buffer, 150 mM NaCl, pH 7.5). Seventy microliters of lx SDS-PAGE loading buffer was added to the pellet and boiled for 5 min to release bound proteins. The protein present in one tenth of the volume of the supernatant (unbound protein) and the cellulose pellet (bound protein) was examined by a 12% SDS-PAGE.

[00719] For quantitative binding assay, different concentrations of proteins were mixed with 2 mg/ml Avicel in 50 mM Tris-HCl, 150 mM NaCl, pH7.5 buffer in a 2-ml tube. As a control, proteins with the same concentrations were incubated without Avicel in the tube. After 1.5 h end-over-end incubation at 4°C, the mixtures were centrifuged at 16,400 rpm for 3 min. The protein concentrations in the supernatant were determined using a bicinchoninic acid (BCA) Protein Assay Reagent Kit (Thermo Scientific, Rockford, IL). Taking the protein concentration from the tube without cellulose as the total protein, the concentrations of bound protein were obtained by subtracting the protein concentration of the sample with cellulose from the total protein concentration. For determination of the binding parameters, the Michaelis/Langmuir equation (q a Jq=K p xq milx /(l+K p xq)) as described in our previous report (46) was used. The q ad in the equation represents the amount of bound protein (nmol of protein per gram of Avicel), q is the free protein (μΜ), and max is the maximal amount of bound protein to Avicel. The calculation of the binding parameters was carried out with GraphPad Prism 5.01.

[00720] Amino acid sequence alignment: The amino acid sequences of the family 9 glycoside hydrolase catalytic module of the Clostridium cellulolyticum Cel9G (GenBank accession number AAA73868)(26) and that of the Thermobifida fusca Cel9A (GenBank accession number:

AAB42155)(34) were retrieved from Carbohydrate Active enZYme database and the Genbank database and aligned with the GH9 sequence of CM952 by using ClustalX. Similarly, the amino acid sequences of the CBM3c modules from the characterized cellulases of different bacterial sources in the published literatures were aligned. These include: ADQ45731 : putative cellulase of Caldicellulosiruptor kronotskyensis; ABP66693: putative cellulase of Caldicellulosiruptor saccharolyticus; ADL42950: putative Caldicellulosiruptor obsidiansis cellulase/mannan endo- 1,4-beta-mannosidase; AAK06394: CelE of Caldicellulosiruptor sp. Tok7B. l (11); AAA73868: Cel9G of Clostridium cellulolyticum (26); AAC38572: EngH of Clostridium cellulovorans (38); CAA39010: Cel9Z of Clostridium stercorarium (18); ABX43720: Cel9 of Clostridium phytofermentans (39, 48); ABN51860: Cel9I of Clostridium thermocellum DSM 1313 (50); CAB38941: Cel9B of Paenibacillus barcinonensis (32); BAB33148: CelQ of Clostridium thermocellum Fl (2); AAA23086: CenB of Cellulomonas fimi (27); AAW62376: CBP105 of Cellulomonas flavigena (28); AAB42155: Cel9A of Thermobifida fusca (16, 34). The aligned sequences were analyzed using the BOXSHADE 3.21 with a default setting of the fraction of sequences parameter as 0.5.

Additional Assays

[00721] Figure 20 shows the enzymatic activity of CM952 wild-type on natural substrates from a reducing sugar assay. Twelve different substrates were tested: Avicel, phosphoric acid swollen cellulose (PASC), sodium carboxymethyl cellulose (CMC-Na), lichenin, mannan, locust bean gum (LBG), guar gum, konjac glucomannan (KGM), wheat arabinoxylan (WAX), birchwood xylan (BWX), oat-spelt xylan (OSX) and xyloglucan were used. Incubation of enzymes with Avicel, PASC, CMC-Na, lichenin, mannan, LBG, guar gum, KGM, WAX and OSX substrates led to release of products that were quantified as a concentration of glucose equivalents. The CM952 wild-type mainly hydrolyzes glucose- and mannose-configured substrates, but not xylose-configured substrates.

[00722] Figure 21 shows the enzymatic activity of CM952TM1 on natural substrates from a reducing sugar assay. Twelve different substrates were tested: Avicel, phosphoric acid swollen cellulose (PASC), sodium carboxymethyl cellulose (CMC-Na), lichenin, mannan, locust bean gum (LBG), guar gum, konjac glucomannan (KGM), wheat arabinoxylan (WAX), birchwood xylan (BWX), oat- spelt xylan (OSX) and xyloglucan were used. Incubation of enzymes with Avicel, PASC, CMC-Na, lichenin, mannan, LBG, guar gum, KGM, WAX, BWX, OSX and xyloglucan substrates led to release of products that were quantified as a concentration of glucose equivalents. The results show that CM952TM1 mainly hydrolyzes glucose-configured substrates. It also has some activities on mannose-configured substrates. Its activities on xylose- configured substrates are low.

[00723] Figure 22 shows the enzymatic activity of CM952TM5 on natural substrates from a reducing sugar assay. Twelve different substrates were tested: Avicel, phosphoric acid swollen cellulose (PASC), sodium carboxymethyl cellulose (CMC-Na), lichenin, mannan, locust bean gum (LBG), guar gum, konjac glucomannan (KGM), wheat arabinoxylan (WAX), birchwood xylan (BWX), oat- spelt xylan (OSX) and xyloglucan were used. Incubation of enzymes with CMC-Na, lichenin, mannan, LBG, guar gum and KGM substrates led to release of products that were quantified as a concentration of mannose equivalents. The CM952TM5 mainly hydrolyzes mannose-configured substrates, but does not have obvious activity on glucose- or xylose- configured substrates.

[00724] Figure 23 shows the enzymatic activity of Cb 1952 wild-type, Cb 1952TM 1 and CM952TM5 on glucose and cellooligosaccharides from a thin-layer chromatography (TLC) assay. Glucose and five different cellooligosaccharides were used: cellobiose, cellotriose, cellotetraose, cellopentaose and cellohexaose. CM952 wild-type and CM952TM1 hydrolyze cellotriose, cellotetraose, cellopentaose and cellohexaose into glucose and cellobiose, but have no activity on cellobiose. CM952TM5 has no activity on glucose and any of the

cellooligosaccharides tested. None of the enzyme has transglycosylation activity on glucose and cellooligosaccharides. [00725] Figure 24 shows the enzymatic activity of Cb 1952 wild-type, Cb 1952TM 1 and CM952TM5 on mannose and mannooligosaccharides from a thin-layer chromatography (TLC) assay. Mannose and five different mannooligosaccharides were used: mannobiose, mannotriose, mannotetraose, mannopentaose and mannohexaose. CM952 wild-type and CM952TM5 hydrolyze mannotriose, mannootetraose, mannopentaose and mannohexaose into mannose and smaller mannooligosaccharides, but have no hydrolyzing activity on mannobiose. CM952TM1 hydrolyzes mannopentaose and mannohexaose into smaller oligosaccharides but has no hydrolyzing activity on mannobiose, mannotriose, mannotriose and mannotetraose. None of the enzyme has transglycosylation activity on mannose and mannooligosaccharides.

[00726] The concentration of glucose or mannose equivalents was determined following enzymatic hydrolysis of the natural polysaccharides, according to the methods of Lever, M. (A new reaction for colorimetric determination carbohydrates. Anal. Biochem. 1972: 47; 273-279). 1.5 mL microcentrifuge tubes were "zeroed" in an analytical balance. Next, 2 ± 0.1 mg Avicel or mannan were added to each tube, and the mass measured and recorded. The volumes needed to be added to each tube were calculated based on the mass. For CMC-Na and PASC, a stock substrate solution of CMC-Na (2%) and PASC (6.11 mg/ml) were used. For lichenin, KGM, WAX, BWX, OSX and xyloglucan, 2% stock solution was used. For LBG and guar gum, 0.5% stock solution was used. Sodium citrate reaction buffer and enzymes were added to each tube beginning with the reaction buffer. The tubes were incubated with constant mixing in a

Thermomixer R (Eppendorf) at 75°C for 16 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 50 of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay. 1 mL of a stock solution of glucose was made at a concentration of 100 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following concentrations (20 mM, 10 mM and 5 mM). 50 mg of /?HBAH was dissolved in 50 mL of ice-cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution was kept on ice. 150 of /?HBAH solution was added to 50 of the sample and glucose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A 41 o n m and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The correlation coefficient (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based upon their absorbances.

[00727] Figure 25 shows the enzymatic activity of CM952TM1 on cellulose substrates using HPLC analysis. Three different cellulosic substrates were tested: Avicel, CMC-Na and PASC. In each case, in the presence of CM952TM1, glucose and cellobiose were released. In the absence of CM952TM1, neither glucose nor cellobiose was observed for all the substrates. The results showed that this part of the enzyme or polypeptide (CM952) cleaves glucose and cellobiose as end products from cellulosic substrates (Avicel, CMC-Na and PASC).

[00728] Figure 26 shows a time-course hydrolysis of PASC by CM952TM1. 100 nanomolar of CM952TM1 was incubated with 2.5 mg/ml PASC at 75 °C. At different time interval (0, 0.5 min, 2 min, 10 min, 1 h, 4 h and 24 h), samples were taken out and immediately boiled for 10 min to inactivate the enzyme. After centrifugation, the supernatants of the samples were appropriately diluted with water and applied to HPLC analysis. The results show that

CM952TM1 initially releases glucose, cellobiose, cellotriose and cellotetraose. With increasing time, only glucose and cellobiose were left in the reaction mixture.

[00729] Figure 27 shows the thermostability of CM952 wild-type using PASC as substrate for activity measurement. CM952 wild-type has 75%, 43%, 17% and 12% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. 500 nM CM 952 wild- type was kept at different temperatures (70 °C, 75 °C, 80 °C and 85 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used for enzyme activity measurement. The enzyme activity was measured at pH 5.5 and at 85 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 8.31 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively. [00730] Figure 28 shows the thermostability of CM952TM1 using PASC as substrate for activity measurement. CM952TM1 has 94%, 76%, 18% and 13% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. 500 nM CM952TM1 was kept at different temperatures (70 °C, 75 °C, 80 °C and 85 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used for enzyme activity

measurement. The enzyme activity was measured at pH 5.5 and at 85 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 8.31 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively.

Discussion of Results with CM952 Polypeptides

[00731] CM952 is the first GH9 cellulase characterized with two tandem CBM3b modules linked to the GH9-CBM3c domains. The CBM3b module (TM7) binds to insoluble cellulose (Figure 80A and B). Deletion of one CBM3b (TM2) from TM1 did not significantly affect the binding to these substrates (Figure 80 A and B). Correspondingly, the specific activities and kinetic parameters of TM1 and TM2 are similar for cellulose substrates (Table 1). Further deletion of another CBM3b (TM3) reduced both the binding to the insoluble substrates and the specific activities for Avicel and filter paper (Table 1). Therefore, the CBM3b modules facilitate the deconstruction of crystalline cellulose by CM952.

[00732] CM952 and its truncation mutants, especially TM3, retained considerable activities after incubation at 75°C for 24 h. For other hyperthermophilic endoglucanases, Cel5A of Thermoanaerobacter tengcongensis has above 80% residual activity after incubation at 60°C for 24 h (24), the Avicelase I of Clostridium stercorarium has above 60% residual activity after incubation at 80°C for 12 h (6), and the CelB of Caldicellulosiruptor saccharolyticus has a half- life of 29 h at 70°C (35). The thermostability property of the multi-functional enzyme can facilitate recycling during its use in releasing fermentable sugars from cellulosic substrates. Introduction of an enzyme recycling step in cellulosic ethanol production can significantly reduce the cost of production of the value added product.

[00733] The C. bescii CM954 (CelA) (ORF1954, GenBank accession number ACM60955) is the first cellulase characterized from this bacterium (49). It is the most highly secreted cellulase when C. bescii is grown on Avicel medium (25). Similar but not identical to CM952, CM954 is composed of an N-terminally located GH9 module, a C-terminally located GH48 module, and three CBM3 modules between the two catalytic domains. The specific activity of CM952 on Avicel (10.15 μιηοΐ sugar/min/μιηοΐ enzyme) was much lower than that of the full-length CM954 / CelA (55.0 μιηοΐ sugar/min/μηιοΐ enzyme), but only slightly lower than that of its truncation mutant CelA' containing the GH9 catalytic module and CBMs (18.0 μιηοΐ

sugar/min/μιηοΐ enzyme) (49). In a comparison with other hyperthermostable endoglucanases, this specific activity of CM952 was lower than those of Cel5A of Thermoanaerobacter tengcongensis (60.0 μιηοΐ sugar/min/μηιοΐ enzyme) (24) and Avicelase I of Clostridium stercorarium (30.2 μιηοΐ sugar/min/μηιοΐ enzyme) (6), comparable to that of EGPh of

Pyrococcus horikoshii (12.7 μιηοΐ sugar/min/μηιοΐ enzyme) (1), but much higher than that of the C. saccharolyticus CelB (0.4 μιηοΐ sugar/min/μηιοΐ enzyme) (41). The specific activity of CM952 on filter paper was comparable to those of CelB of Thermotoga neapolitana (20.8 μιηοΐ sugar/min/μηιοΐ enzyme) (5), Cel5A of Thermoanaerobacter tengcongensis (18.5 μιηοΐ sugar/min/μηιοΐ enzyme) (24), and EglA of Pyrococcus furious (18.7 μιηοΐ sugar/min/μηιοΐ enzyme) (3), but much higher than those of CelA of Thermotoga neapolitana (3.2 μιηοΐ sugar/min/μηιοΐ enzyme) (5) and CelB of C. saccharolyticus (1.8 μιηοΐ sugar/min/μηιοΐ enzyme) (41). Note that the assay conditions (reaction temperature, buffer, reaction period, method for measuring reducing sugar, and lab equipment) for these specific activities may vary among the enzymes described above. Nevertheless, CM952 is an effective enzyme for releasing

fermentable sugars from cellulosic substrates at high temperatures.

[00734] Interestingly, seven out of the nine genes in the gene cluster in which the CM952 encoding gene is located also contain CBM3b modules identical to or highly similar to (identity > 98%) the CBM3b of CM952. Six of the polypeptides in the gene cluster have either two or three tandem CBM3b repeats. It is reasonable to postulate that these CBM3b modules aid in plant cell wall hydrolysis.

[00735] The mannanase activity of CM952 was mainly located in the GH5 module; however, limited mannanase activity was also observed with the construct containing the GH9 domain as the catalytic module. In most cases, family 9 glycoside hydrolases are described as

endoglucanase (10, 16), cellobiohydrolase (36), l,4- -D-glucan glucohydrolase (33), β- glucosidase (30), and exo- -glucosaminidase (14). The Bacillus licheniformis Cel9 is the only member of this family reported to have mannanase activity (42); however, its kinetic data and hydrolysis pattern on mannose-configured substrates are unknown. The TM1 mutant of CM952 showed different hydrolysis patterns compared with the TM5 mutant, in that the GH9 needed a minimal chain length of five and released mannobiose as the shortest end-product, while the GH5 needed a minimal chain length of three and released mannose as the shortest end-product. The ability of the GH9 module of CM952 to hydrolyze mannose-configured substrates suggests that the catalytic module can both accommodate the equatorial C-2 hydroxyl of glucose and also tolerate the axial C-2 hydroxyl of mannose.

[00736] The absence of a tryptophan for -3 subsite hydrophobic interaction was proposed to destabilize the non-productive binding which might impair the processivity of a GH9 cellulase (31). The mutations of G208 and T298 into aromatic residues (TM3G208WG, TM3G208W, and TM3T298F), however, did not change the processivity of TM3 as reflected by the unaltered ratios of soluble versus insoluble reducing ends. The specific activities of the mutants on crystalline cellulose (Avicel and filter paper) were also comparable to that of the parental TM3. However, for non-crystalline PASC, all turnover numbers of the mutants were increased by roughly 2 folds while the catalytic efficiencies remained unchanged. The different structures of crystalline and non-crystalline cellulose might affect the performance of these enzymes. One of the mutants, TM3G208WG, increased its substrate specifity for locust bean gum by 35 folds

suggesting that residues for subsite -3 interaction might be involved in substrate selection.

[00737] CBM3c has been proposed to bind loosely to the cellulose ligand and feed a cellulose chain into the GH9 catalytic module (34). However, no biochemical data was provided for this binding. In the co-crystal structure of family 9 cellulase in complex with cello-oligosaccharides, the binding of the cello-oligosaccharide to CBM3c has not been observed so far (26, 34). Our results suggest that a CBM3c can indeed bind to insoluble cellulose although the binding appeared weak. A sequence alignment of CM952 with its homologs revealed that considerable differences exist in the amino acid residues proposed to interact with the ligand (19, 22, 23) between CM952 CBM3c with its homologues. The conserved Q553, R557, E559, and R563 residues in ThefuCel9A proposed to interact with the ligand are correspondingly replaced by E545, K549, Q561, and K565, respectively, in the CBM3c of CM952 (Figure 82). This observation may be akin to the fine-tuning demonstrated in a Caldanaerobius

polysaccharolyticus family 16 CBM by mutating one polar residue (Q121 to E121) involved in hydrogen bonding with the ligand (37). The E545, K549, Q561, and K565 residues can also be found in four CBM3 modules from the related organisms C. kronotskyensis, C. saccharolyticus, C. obsidiansis, and Caldicellulosiruptor sp. Tok7B. l (Figure 82). A three-dimensional structure of a CBM3c in complex with a ligand is still lacking, which hinders accurate designation of residues important for ligand binding. Due to the diversity of CBM3c modules (19), one may postulate that other variants of CBM3c might exist which could hold a cello-oligosaccharide tightly enough to capture this complex in a crystal.

Example 10: Endo-glucanase/mannanase Cbl953

[00738] A putative endoglucanase, CM953WT, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of cbl953, where Cb stands for Caldicellulosiruptor bescii. The endoglucanase cleaves mostly cellobiose from cellulose. The CM953WT protein is 1391 amino acids long and has a molecular weight of 153.6 kDa (His-tag + CM953 protein). The

CM953WT has two Glycoside Hydrolase (GH) family 5 catalytic domain and 3 carbohydrate binding proteins (Figure 29). Two truncated mutants were made, as shown in Figure 29, to determine the activity in each GH5 module. For the truncated mutants, CM953TM1 (961 amino acids, 103.9 kDa) has N-terminal GH 5 catalytic domains with 3 carbohydrate binding modules, whereas CM953TM2 (1108 amino acids, 121.7 kDa) has C-terminal GH5 catalytic domains with 3 carbohydrate binding modules as like shown in Figure 29.

PGR amplification of Cb!953WT [00739] The genes were amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The CM953WT, CM953TM1, and CM953TM2 coding sequences were amplified using the following respective primer set:

[00740] Cbl953WTForward: 5'-GAC GAC GAC AAG ATG GCT ACA TCT AAT

GATGGA GTA GTG AAG -3' (SEQ ID NO: 56)

[00741] Cbl953WTReverse: 5 '-GAG GAG AAG CCC GGT TAA TTT TGC GGC TGG AAC TGG CGC TGG TTC -3' (SEQ ID NO: 57)

[00742] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00743] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 5 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

Clonins of CM953TMl [00744] Cbl953TMlForward: 5'-GAC GAC GAC A AG ATG GCT AC A TCT A AT GATGGA GTA GTG AAG -3' (SEQ ID NO: 56)

[00745] Cbl953TMlReverse: 5'-GAG GAG AAG CCC GGT TAT GGC ATT GGT ATT ACT GTC TGC ACC GG -3' (SEQ ID NO: 58)

[00746] The polymerase chain reaction mixture contained the following: PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00747] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

Clonins of Cb!953TM2

[00748] Cbl953TM2Forward: 5' -GAC GAC GAC AAG ATG

GGTGCCTCTTCAGTACCTACTTCAACACC -3' (SEQ ID NO: 59)

[00749] Cbl953TM2Reverse: 5'- GAG GAG AAG CCC GGT TAA TTT TGC GGC TGG AAC TGG CGC TGG TTC -3' (SEQ ID NO: 57) [00750] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μL Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ.

[00751] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

[00752] After the PCR amplification described above, the products of Cb 1953WT, CM953TM1, and CM953TM2 were confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel Extraction kit to extract the DNA out of the gel.

[00753] The Novagen pET-46 Ek/LIC kit was used to treat each purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows:

Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X 1 OX T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 02

Total 10 22°C 30 min

After the reaction, the enzyme was inactivated by incubation at 75 °C for 20 min.

[00754] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector:

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00755] The ligation mixture for Cb 1953-pET-46 Ek/LIC was introduced into E. coli JM 109 by electroporation, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and each was used to inoculate 6 mL cultures of LB- ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made from the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. After confirmation of insertion of the gene into the plasmid, the inserts were sequenced to confirm the integrity of their sequences.

[00756] For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with

chloramphenicol (100 μg/ml) and ampiciUin (50 μg/ml) and incubated at 37°C overnight. Five to six colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16° C overnight.

Gene and protein sequences of Cbl953WT. CM953TML and CM953TM2 Wild type CM953 amino acid sequence

[00757] The wild-type CM953 amino acid sequence is disclosed in SEQ ID NO: 60. The signal peptide of CM953, corresponding to amino acid numbers 1-38 of SEQ ID NO: 60 was removed during all PCR amplifications. Thus, the expressed wild-type CM953 protein did not contain amino acid numbers 1-38 of SEQ ID NO: 60. The amino acid sequence of the wild-type CM953 protein without the signal peptide is disclosed in SEQ ID NO: 61.

[00758] The procedure of cloning the gene for wild- type CM953 (without the signal peptide) into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The wild-type CM953 amino acid sequence (without the signal peptide) with the short peptide is disclosed in SEQ ID NO: 65. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 65.

Wild type CM953 nucleotide sequence

[00759] The wild-type CM 953 nucleotide sequence is disclosed in SEQ ID NO: 62. The signal peptide of CM953, corresponding to nucleotide numbers 1-114 of SEQ ID NO: 62 was removed during all PCR amplifications. Thus, the nucleotide sequence used to express wild- type CM953 protein did not contain nucleotide numbers 1-114 of SEQ ID NO: 62. The nucleotide sequence encoding the wild-type CM953 protein without the signal peptide is disclosed in SEQ ID NO: 63.

[00760] The wild-type CM953 nucleotide sequence (without the signal peptide) with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 64. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 64.

CM953TM1 amino acid sequence

[00761] The CM953TM1 amino acid sequence is disclosed in SEQ ID NO: 122. The procedure of cloning the gene for CM953TM1 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM953TM1 amino acid sequence with the short peptide from pET-46 Ek/LIC is disclosed in SEQ ID NO: 67. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 67.

CM953TM1 nucleotide sequence

[00762] The CM953TM1 nucleotide sequence is disclosed in SEQ ID NO: 123. The CM953TM1 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 66. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 66.

CM953TM2 amino acid sequence

[00763] The CM953TM2 amino acid sequence is disclosed in SEQ ID NO: 111. The procedure of cloning the gene for CM953TM2 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM953TM2 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 69. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 69.

CM953TM2 nucleotide sequence

[00764] The CM953TM2 nucleotide sequence is disclosed in SEQ ID NO: 110. The CM953TM2 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 68. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 68.

Purification of Cb!953WT, Cbl953TML and CM953TM2 proteins

[00765] The CM953WT, CM953TM1, CM953TM2 were expressed in E. coli BL-21 CodonPlus (DE3) RIL competent cells by heat shock. The recombinant cells were then grown overnight in LB agar plates supplemented with ampicillin (100 μg/mL) and chloramphenicol (50 μg/ml) at 37 °C. After 8h, the starter cultures were diluted into fresh LB supplemented with ampicillin (100 μg/mL) and chloramphenicol (50 μg/ml) at 37 °C with aeration until the absorbance at 600 nm reached 0.5. Gene expression was then induced by addition of IPTG at a final concentration of 0.1 mM and the temperature for culturing was lowered to 16 °C. After 16 hours, the cells were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (25 mM Tris-HCL pH 7.8, 750 mM of NaCl, 5% glycerol, 20 mM imidazole, 1.25% Tween-20). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged target proteins) was then eluted from the resin with an elution buffer (50 mM Tris-HCL, pH7.5, 250 mM imidazole). The eluted fractions were then heat-treated at 65 °C for 30 minutes and then centrifuged to remove the precipitated proteins. The proteins were then purified by gel filtration chromatography (HiLoad 16/20 Superdex 200, GE Healthcare) with a Tris-HCl elution buffer (50 mM Tris-HCl, 150 mM NaCl, pH 7.5). Aliquots of eluted fractions were analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and proteins bands were visualized by staining with Coomassie brilliant blue G-250 (Figure 30).

Enzyme Activity

[00766] Figure 31 shows the zymogram of CM953WT, CM953TM1, CM953TM2 on carboxylmethyl cellulose (CMC). The gel was prepared as in standard dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) with CMC substrate (final 0.1%, w/v). After electrophoretic fractionation of the proteins, gels were washed twice in distilled water and incubated in 30 mL of refolding buffer (20 mM citrate buffer, pH 6.0, 2.5% Triton X-100, 2 mM dithiothreitol, 2.5 mM CaCl 2 ) for 1 hour at 25 °C and then held overnight in fresh buffer at 37°C. The gel was washed twice in 50 mM Citrate buffer (pH 6.0) and then the results were visualized by staining with 0.1% Congo red and destaining with 1M NaCl. As shown in Figure 31, CM953WT and CM953TM2 showed significant white bands at the positions of their expected sizes indicating cellulase activity, but not CM953TM1 protein.

[00767] Figures 32 and 33 show the enzymatic activity of Cb 1953WT, Cb 1953TM 1 , and CM953TM2 on natural substrates from a reducing sugar assay. Seven different substrates were tested: Avicel, Phosphoric acid swollen cellulose (PASC), carboxylmethyl cellulose (CMC), wheat arabinoxylan (WAX), lichenin, konjac glucomannan, and mannan. Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents. The tubes were incubated with constant mixing in a Thermomixer R (Eppendorf) at 75°C for 16 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 50 of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay. The wavelength at 410 nm was measured for the standards and samples. The A 41 o n m and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The reactions were resolved by thin layer chromatography (TLC), The mobile phase consisted of n-butanol: acetic acid:H20, 10:5: 1 (vol/vol/vol) and 10cm x 20cm plates were used. The reducing sugar assay (Figure 32) and TLC (Figure 33) results show that CM953WT and CM953TM2 have cellulase activity whereas CM953TM1 has mannanase activity. Through the zymogram, reducing assay, and TLC analysis on various substrates, we concluded that the C- terminal GH5 in CM953WT functions as a cellulase whereas the N-terminal GH5 functions as mannase.

[00768] Figure 34 shows the time course of enzymatic activity of CM953TM2 on PASC using HPLC analysis. For analysis of the products of hydrolysis, the samples were analyzed by high performance anion-exchange chromatography (HPAEC). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA). For the TLC and HPLC analysis, glucose and five different cellooligosaccharides (cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose) were used as standards. In the reaction, CM953 started to release

cellooligosaccharides (C2-C4) and then glucose was released later. The results showed that this enzyme releases mainly cellobiose from PASC.

[00769] Figures 35 and 36 shows the thermostability of CM953WT and CM953TM2 on PASC. 50 nM CM953WT and CM953TM2 were kept at different temperatures (70 °C, 75 °C, 80 °C, 85 °C and 90 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used in enzyme activity measurement. The enzyme activity was measured at 85 °C using Cary 300 UV-Vis spectrophotometer (Varian). The initial velocity of reaction in the first minute was calculated. The initial velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the initial velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the initial velocity of reaction at time 0, then multiplied by 100. From the results, CM953WT (Figure 35) and CM953TM2 (Figure 36) were quite stable at 70 °C and 75 °C, maintaining activity of 75-90% of heat non-treated proteins.

[00770] Figure 37 shows the kinetic studies of CM953WT and CM953TM2 on PASC. 0.05 μΜ of purified CM953WT or CM953TM2 in 50 mM Na 2 HP0 4 -HCl, pH 6.0, and 150 mM NaCl was incubated with various concentrations of phosphoric acid swollen cellulose (PASC), and the initial rate of hydrolysis was plotted against substrate concentration. The kinetic parameters ( K m : 7.603 mg/mL, £ cat : 7.513 s "1 and k c K m : 0.988 s "1 mL/mg for CM953WT and K m : 3.032 mg/mL, k cai : 5.411 s "1 and k c K m : 1.785 s "1 mL/mg for CM953TM2 ) were determined by fitting the data to the Michaelis-Menten equation (Graph Pad Prism v5.01).

Example 11: Endocellulase Cbl954

[00771] A putative endoglucanase, CM954, was identified in Caldicellulosiruptor bescii, where Cb stands for Caldicellulosiruptor bescii. The protein has a Glycoside Hydrolase (GH) family 9 catalytic domain (putative cellulase domain), three family 3 carbohydrate binding modules (CBMs) and one GH48 catalytic domain (Figure 38). The CM954 wild-type is 1746 amino acids long and has a predicted molecular mass of 193.6 kDa (His-tag + CM954 wild- type protein). The enzyme C 954TM3 and CM954TM5 are the truncational mutants of the gene product of CM954, where Cb stands for Caldicellulosiruptor bescii. The endocellulase cleaves glucose and cellobiose from cellulose as end products. The CM954TM3 protein is 709 amino acids long and has a molecular weight of 78.57 kDa (His-tag + CM954TM3 protein). The protein has a Glycoside Hydrolase (GH) family 9 catalytic domain and one family 3

carbohydrate binding module (CBM3). The CM954TM5 protein is 1294 amino acids long and has a molecular weight of 142.82 kDa (His-tag + CM954TM5 protein). The protein has a Glycoside Hydrolase (GH) family 48 catalytic domain and three family 3 Carbohydrate binding modules (CBM3).

Cloning of Cbl954 wild-type [00772] The gene for CM954 wild- type was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The CM 954 wild-type gene was amplified using the following primer set:

[00773] The CM954 wild- type gene was amplified using the following primer set:

[00774] Cb 1954 wild-type Forward: 5 '- GAC GAC GAC AAG ATG CAA GAG GTT AGG GCT GGT TCG TTT AAC -3' (SEQ ID NO: 70)

[00775] Cb 1954 wild-type Reverse: 5 '- GA GGA GAA GCC CGG TTA TTG ATT GCC AAA CAG TAT TTC ATA TG -3' (SEQ ID NO: 71)

[00776] The polymerase chain reaction mixture contained the following: PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00777] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 6 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

Clonins of CM954TM3 [00778] The gene for CM954 was amplified from Caldicellulosiruptor bescii DSM 6725T

®

genomic DNA by PCR using Pfu Turbo DNA Polymerase. The CM954TM3 gene was amplified using the following primer set:

[00779] Cbl954TM3Forward: 5'- GAC GAC GAC AAG ATG CAA GAG GTT AGG GCTGGT TCG TTT AAC -3' (SEQ ID NO: 70)

[00780] Cb 1954TM3Reverse: 5 '- GA GGA GAA GCC CGG TTA TAC CTT TAT CTG TCC ACC TGC TAC-3' (SEQ ID NO: 72)

[00781] The polymerase chain reaction mixture contained the following:

PCR reaction

2 .5υ/μΕ Pfu Turbo DNA Polymerase 0.5

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

20 μΜ Fw Primer 1

20 μΜ Rv Primer 1

10 mM dNTP Mixture 1

10 X Cloned Pfu Turbo DNA Polymerase Buffer 5

dH 2 Q 40.5

Total 50 μΐ,

[00782] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 2 .5min

Elongation 72°C 10 min 1 cycle

Last 4°C oo

Clonins of CM954TM5 [00783] The gene for CM954 was amplified from Caldicellulosiruptor bescii DSM 6725T

®

genomic DNA by PCR using Pfu Turbo DNA Polymerase. The CM954TM3 gene was amplified using the following primer set:

[00784] Cbl954TM5Forward: 5'- GAC GAC GAC AAG ATG TTC AAA GCT ATT GAA ACT CCA ACA AAC -3' (SEQ ID NO: 73)

[00785] Cb 1954TM5Reverse: 5 '- GA GGA GAA GCC CGG TTA TTG ATT GCC AAA CAG TAT TTC ATA TG -3' (SEQ ID NO: 71)

[00786] The polymerase chain reaction mixture contained the following:

PCR reaction

2 .5υ/μΕ Pfu Turbo DNA Polymerase 0.5

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

20 μΜ Fw Primer 1

20 μΜ Rv Primer 1

10 mM dNTP Mixture 1

10 X Cloned Pfu Turbo DNA Polymerase Buffer 5

dH 2 Q 40.5

Total 50 μΐ,

[00787] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4min

Elongation 72°C 10 min 1 cycle

Last 4°C oo [00788] After the PCR described above, the amplification of Cb 1954wild-type, Cb 1954TM3 and CM954TM5 gene was confirmed by 1% agarose gel electrophoresis. The DNA

corresponding to the expected band on the gel was cut out and the amplified fragment was extracted using the Qiagen Gel Extraction kit.

[00789] A Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows:

Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X

10X T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 0.2

Total 10 22°C 30 min

After the reaction, the enzyme was inactivated by incubation at 75 °C for 20 min.

[00790] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector.

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00791] Each of the ligation mixture for Cbl954wild-type-, CM954TM3- or CM954TM5- pET-46 Ek/LIC was introduced into E. coli NovaBlue competent cells by chemical

transformation method, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and used to inoculate 6 mL cultures of LB-ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. After confirmation that the gene had been inserted into the plasmid, the genes were sequenced to confirm their identities.

[00792] For all the constructs of CM 954, only CM954TM3 could be cloned. Thus for the expression of this protein, one of the recombinant plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with chloramphenicol (50 μg/ml) and ampicillin (100 μg/ml) and incubated at 37°C overnight. Five colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification

[00793] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (50 mM Tris-HCL pH 7.5, 300 mM of NaCl). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged CM954TM3) was then eluted from the resin with an elution buffer (50 mM Tris-HCL, pH7.5, 250 mM imidazole).

[00794] The design of the PCR primers ensured that the protein was fused to 6-histidines encoded in the plasmid. The six histidines will bind to either a nickel-charged resin or a cobalt- charged resin. The bound protein can be displaced from the resin with a buffer containing imidazole. This method facilitates quick purification of the protein of interest.

[00795] The CM954TM3 gene was expressed in E. coli cells, and the protein was purified in three steps, including a talon resin purification step making use of the 6-histidines encoded by the plasmid, an anion exchange step using Hitrap Q column and a gel filtration step using Hiload 16/60 Superdex 200 column. Figure 39A shows an SDS-PAGE of purified CM954TM3.

Gene and protein sequences of Cbl954WT. CM954TM3, and CM954TM5 Wild type CM954 amino acid sequence

[00796] The wild-type CM954 endocellulase (EC 3.2.1.4) amino acid sequence is disclosed in SEQ ID NO: 74. The signal peptide of CM954, corresponds to amino acid numbers 1-27 of SEQ ID NO: 74. The amino acid sequence of the wild-type CM954 protein without the signal peptide is disclosed in SEQ ID NO: 121.

Wild type CM954 nucleotide sequence

[00797] The wild-type CM954 nucleotide sequence is disclosed in SEQ ID NO: 116. The signal peptide of CM954 corresponds to nucleotide numbers 1-81 of SEQ ID NO: 116. The nucleotide sequence encoding the wild- type CM954 protein without the signal peptide is disclosed in SEQ ID NO: 75.

CM954TM3 amino acid sequence

[00798] The CM954TM3 amino acid sequence is disclosed in SEQ ID NO: 76. The procedure of cloning the gene for CM954TM3 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM954TM3 amino acid sequence with the short peptide from pET-46 Ek/LIC is disclosed in SEQ ID NO: 81. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 81.

CM954TM3 nucleotide sequence

[00799] The CM954TM3 nucleotide sequence is disclosed in SEQ ID NO: 77. The

CM954TM3 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 80. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 80.

CM954TM5 amino acid sequence

[00800] The CM954TM5 amino acid sequence is disclosed in SEQ ID NO: 78.

CM954TM5 nucleotide sequence [00801] The CM954TM5 nucleotide sequence is disclosed in SEQ ID NO: 79.

Enzyme Activity

[00802] Figure 39B shows the enzymatic activity of CM954TM3 on natural substrates from a reducing sugar assay. Three different cellulose substrates were tested: Avicel, sodium

carboxymethyl cellulose (CMC-Na) and phosphoric acid swollen cellulose (PASC). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents. Hydrolysis of PASC was higher than hydrolysis of other substrates.

[00803] The concentration of glucose equivalents was determined following enzymatic hydrolysis of Avicel, CMC-Na and PASC, according to the methods of Lever, M. (supra). 1.5 mL microcentrifuge tubes were "zeroed" in an analytical balance. Next, 2 ± 0.1 mg Avicel were added to each tube, and the mass measured and recorded. For CMC-Na and PASC, a stock substrate solution of CMC-Na (2%) and PASC (6.11 mg/ml) were used. Sodium citrate reaction buffer and enzymes were added to each tube beginning with the reaction buffer. The tubes were incubated with constant mixing in a Thermomixer R (Eppendorf) at 75°C for 16 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 50 of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay to determine the reducing ends released by the enzyme. 1 mL of a stock solution of glucose was made at a concentration of 100 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following concentrations (20 mM, 10 mM and 5 mM). 50 mg of /?HBAH was dissolved in 50 mL of ice-cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution was kept on ice. 150 of /?HBAH solution was added to 50 of the sample and glucose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A 41 o n m and glucose concentrations were plotted against each other, and linear regression was used to fit a prediction equation to the data. The coefficient of determination (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based upon their absorbances. [00804] Figure 40 shows the enzymatic activity of CM954TM3 on cellulosic substrates using HPLC analysis. Three different cellulosic substrates were tested: Avicel, CMC-Na and PASC. In each case, in the presence of CM954TM3, glucose and cellobiose were released. In the absence of CM954TM3, neither glucose nor cellobiose was observed for all the substrates. The results showed that this enzyme releases glucose and cellobiose, and also longer chain oligosaccharides as end products from cellulosic substrates (CMC-Na and PASC).

[00805] Figure 41 shows the thermostability of CM954TM3. CM954TM3 has 75%, 87%, 64% and 7% activity after incubation at 70 °C, 75 °C, 80 °C and 85 °C for 24 h, respectively. 500 nM CM954TM3 was kept at different temperatures (70 °C, 75 °C, 80 °C and 85 °C). The enzyme activity was measured at pH 5.5 and at 95 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 10 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively.

Example 12: Endo-glucanase Cbl946

[00806] A putative endoglucanase, CM946WT, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of CM946WT, where Cb stands for Caldicellulosiruptor bescii. The CM946WT protein is 1271 amino acids long and has a molecular mass of 139.8 kDa (His- tag + CM 946 protein). The CM946WT has a Glycoside Hydrolase (GH) family 5 catalytic domain at the N-terminal region and Glycoside Hydrolase (GH) family 44 catalytic domain at the C-terminal region and 2 carbohydrate binding modules are positioned between the two catalytic domains (Figure 42). For the truncated mutants, CM946TM1 (653 amino acids, 71.3 kDa) has N-terminal GH5 catalytic domain with 2 carbohydrate binding modules, whereas CM946TM2 (1015 amino acids, 111.0 kDa) has C-terminal GH44 catalytic domains with 2 carbohydrate binding modules as shown in Figure 42. Clonins of Cb!946WT

[00807] The wild type gene and its two truncated mutants (Figure 42) were amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The nucleotide sequences encoding CM946WT, CM946TM1, and CM946TM2 were amplified using the following primer set and procedures:

[00808] Cb 1946WT Forward: 5 -GAC GAC GAC AAG ATG GCT ACA TCT AAT GAT GGA GTA GTG AAG -3' (SEQ ID NO: 82)

[00809] Cb 1946WT Reverse: 5 '-GAG GAG AAG CCC GGT TAA TTT AGT TTG TAC TGA GGT TGA ATA TAA AAC GAT ATG G -3' (SEQ ID NO: 83)

[00810] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA

50 μΜ Fw Primer

50 μΜ Rv Primer

10 mM dNTP Mixture

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00811] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 5 min

Elongation 72°C 7 min 1 cycle Last 4°C oo

Clonins of Cb!946TMl

[00812] Cb 1946TM 1 Forward: 5' -GAC GAC GAC AAG ATG GCT ACA TCT AAT GAT GGA GTA GTG AAG -3' (SEQ ID NO: 82)

[00813] Cb 1946TM 1 Reverse: 5 '-GAG GAG AAG CCC GGT TAG TTA AAC CTT ATC TGT ATC TCC CCT GTG TC -3' (SEQ ID NO: 84)

[00814] The polymerase chain reaction mixture contained the following: PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00815] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

Clonins of CM946TM2 [00816] Cb 1946TM2 Forward: 5 '-GAC GAC GAC AAG ATG GTA GGG TAC TTG GAC ATG GTA AAC AAT TGG GA -3' (SEQ ID NO: 85)

[00817] CM946TM2 Reverse: 5'-GAG GAG AAG CCC GGT TAA TTT AGT TTG TAC TGA GGT TGA ATA TAA AAC GAT ATG G -3' (SEQ ID NO: 83)

[00818] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00819] To amplify the coding sequence from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 4 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

[00820] After the PCR described above, the amplification of CM946 gene was confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel Extraction kit to extract the DNA out of the gel.

[00821] A Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows: Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X

1 OX T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 02

Total 10 22°C 30 min

After the reaction, the enzyme was inactivated by incubation at 75 °C for 20 min.

[00822] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector.

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00823] The ligation mixture for Cb 1946-pET-46 Ek/LIC was introduced into E. coli JM 109 by electroporation, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and used to inoculate 6 mL cultures of LB-ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made from the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to confirm the size of the plasmid DNA. After confirmation of the insert in the plasmid, the gene or coding sequences were sequenced to confirm their identity and integrity.

[00824] For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with

chloramphenicol (50 μg/ml) and ampicillin (100 μg/ml) and incubated at 37°C overnight. Five colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16°C overnight.

Gene and protein sequences of Cbl946WT. CM946TML and CM946TM2

CM946 wild- type amino acid sequence

[00825] The wild-type CM946 amino acid sequence is disclosed in SEQ ID NO: 86. The signal peptide of CM946, corresponding to amino acid numbers 1-38 of SEQ ID NO: 86 was removed during all PCR amplifications. Thus, the expressed wild- type CM946 protein did not contain amino acid numbers 1-38 of SEQ ID NO: 86. The amino acid sequence of the wild-type CM946 protein without the signal peptide is disclosed in SEQ ID NO: 87.

[00826] The procedure of cloning the gene for wild- type CM946 (without the signal peptide) into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The wild- type CM946 amino acid sequence (without the signal peptide) with the short peptide is disclosed in SEQ ID NO: 91. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 91.

CM946 wild- type nucleotide sequence

[00827] The wild-type CM 946 nucleotide sequence is disclosed in SEQ ID NO: 88. The signal peptide of CM946, corresponding to nucleotide numbers 1-114 of SEQ ID NO: 88 was removed during all PCR amplifications. Thus, the nucleotide sequence used to express wild- type CM946 protein did not contain nucleotide numbers 1-114 of SEQ ID NO: 88. The nucleotide sequence encoding the wild-type CM946 protein without the signal peptide is disclosed in SEQ ID NO: 89.

[00828] The wild- type CM946 nucleotide sequence (without the signal peptide) with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 90. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 90. CM946TM1 amino acid sequence

[00829] The CM946TM1 amino acid sequence is disclosed in SEQ ID NO: 117. The procedure of cloning the gene for CM946TM1 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM946TM1 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 93. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 93.

CM946TM1 nucleotide sequence

[00830] The CM946TM1 nucleotide sequence is disclosed in SEQ ID NO: 118. The CM946TM1 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 92. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 92.

CM946TM2 amino acid sequence

[00831] The CM946TM2 amino acid sequence is disclosed in SEQ ID NO: 113. The procedure of cloning the gene for CM946TM2 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The CM946TM2 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 95. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 95.

CM946TM2 nucleotide sequence

[00832] The CM946TM2 nucleotide sequence is disclosed in SEQ ID NO: 112. The CM946TM2 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 94. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 94.

Purification of Cbl946WT. CM946TML and CM946TM2 proteins

[00833] The Cb 1946WT and its truncated mutants Cb 1946TM 1 and Cb 1946TM2 were expressed in E. coli BL-21 CodonPlus (DE3) RIL competent cells by heat shock. The recombinant cells were then grown overnight in LB agars supplemented with ampicillin (100 μg/mL) and chloramphenicol (50 μg/ml) at 37 °C. After 8h, the starter cultures were diluted into fresh LB supplemented with ampicillin (100 μg/mL) and chloramphenicol (50 μg/ml) at 37 °C with aeration until the absorbance at 600 nm reached 0.5. Gene expression was then induced by addition of IPTG at a final concentration of 0.1 mM and the temperature for culturing was lowered to 16 °C. After 16 hours, the cells were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (25 mM Tris-HCL pH 7.8, 750 mM of NaCl, 5% glycerol, 20 mM imidazole, 1.25% Tween-20). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged target proteins) was then eluted from the resin with an elution buffer (50 mM Tris-HCL, pH7.5, 250 mM imidazole). The eluted fractions was then heat-treated at 65 °C for 30 minutes and then centrifuged to remove the precipitated proteins. The proteins were then purified by gel filtration chromatography (HiLoad 16/20 Superdex 200, GE Healthcare) with a Tris-HCl elution buffer (50 mM Tris-HCl, 150 mM NaCl, pH 7.5). Aliquots of eluted fractions were analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and proteins bands were visualized by staining with Coomassie brilliant blue G-250 (Figure 43).

Enzyme Activity

[00834] Figure 44 shows the zymogram of Cb 1946WT, Cb 1946TM 1 , and Cb 1946TM2 on carboxylmethyl cellulose (CMC) agar plate. The agar plate was prepared with CMC substrate (final 0.25%, w/v). After spotting 1 μg of each protein on agar-CMC plates, the plate was incubated at 37°C overnight and then the gel was visualized by staining with 0.1% Congo red and destaining with 1M NaCl. As shown in Figure 44, CM946WT and CM946TM2 showed significant halos on the agar plate indicating cellulase activity, but not CM953TM1 proteins.

[00835] Figure 45 shows the enzymatic activity of Cb 1946WT, Cb 1946TM 1 , Cb 1946TM2 on phosphoric acid swollen cellulose (PASC). Each enzyme (final 0.5 μΜ) was reacted with phosphoric acid swollen cellulose (PASC) at 1% final concentration in 50 mM citrate- 150 mM NaCl, pH 6.0 at 75°C for 16 hours. The reactions were resolved by thin layer chromatography (TLC). The mobile phase consisted of n-butanol: acetic acid:H20, 10:5: 1 (vol/vol/vol) and 10cm x 20cm plates were used. For more quantitative analysis of the products of hydrolysis, the samples were analyzed by high performance anion-exchange chromatography (HPAEC) (Figure 46). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected into a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA Biosciences (Chelmsford, MA). For the TLC and HPLC analysis, glucose and five different cellooligosaccharides were used: cellobiose, cellotriose, cellotetraose, cellopentaose, and cellohexaose as standards. Based on the results of TLC and HPLC, CM953WT and CM953TM2 showed significant release of products such as glucose, cellobiose, cellotriose, and cellotetraose from PASC substrate, indicating that

CM946WT and CM953TM2 have cellulase activities, but not CM953TM1.

Example 13: Endocellulase Cb629

[00836] An endocellulase, Cb629, was identified in Caldicellulosiruptor bescii. The enzyme Cb629TMl is the truncational mutant of the gene product of cb629, where Cb stands for Caldicellulosiruptor bescii. The endocellulase initially cleaves glucose, cellobiose and cellotriose from cellulose. The Cb629TMl protein is 562 amino acids long and has a molecular weight of 63.7 kDa (His-tag + Cb629TMl protein). The protein has a Glycoside Hydrolase (GH) family 5 catalytic domain and a Carbohydrate Binding Module (CBM) family 17_28 domain (Figure 47). In addition there is a N-terminal signal peptide (SP) for secretion and three surface layer homology (SLH) modules likely used in anchoring the enzyme to the cell surface. Since the SP and SLH are non-catalytic, they were cleaved from the polypeptide through the PCR amplification described below and the gene product was named Cb629TMl.

Clonins of Cb629TMl

[00837] The gene for Cb629TMl was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA). The

Cb629TMl gene was amplified using the following primer set: [00838] Cb629TMlForward: 5'- GAC GAC GAC AAG ATG CAG AGC ATA CTG TAT GAA AAG G -3' (SEQ ID NO: 96)

[00839] Cb629TMlReverse: 5'- GAG GAG AAG CCC GGT TAC TCA AAA AGG ATA TTG GTA AAT C -3' (SEQ ID NO: 97)

[00840] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μΕ Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 1

50 μΜ Rv Primer 1

10 mM dNTP Mixture 1

5 X PrimeSTAR Buffer 10

dH 2 Q 35.6

Total 50 μΐ,

[00841] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 2 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

[00842] After the PCR amplification described above, the amplification of Cb629TMl was confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel Extraction kit to extract the DNA out of the gel.

[00843] A Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows: Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X

1 OX T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 02

Total 10 22°C 30 min

After the reaction, inactivate the enzyme by incubating at 75 °C for 20 min.

[00844] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector.

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00845] The ligation mixture for Cb629TMl-pET-46 Ek/LIC were introduced into E. coli NovaBlue competent cells by chemical transformation method, and the cells were plated on LB- ampicillin. After overnight incubation at 37°C, four colonies were selected and each was used to inoculate 6 mL cultures of LB-ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. After confirmation that the gene had been inserted into the plasmid, the genes were sequenced to confirm the integrity of the coding sequence.

[00846] For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with

chloramphenicol (50 μg/ml) and ampicillin (100 μg/ml) and incubated at 37°C overnight. Five to six colonies were inoculated into 10 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 6 hours. Ten mL of the culture was added to 1000 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.3. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16°C overnight.

Protein purification

[00847] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (50 mM Tris-HCL pH 7.5, 300 mM of NaCl). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged Cb629TMl) was then eluted from the resin with an elution buffer (50 mM Tris-HCL, pH7.5, 250 mM imidazole).

[00848] The design of the PCR primers ensured that the protein was fused to 6-histidines encoded in the plasmid. The six histidines will bind to either a nickel-charged resin or a cobalt- charged resin. The bound protein can be displaced from the resin with a buffer containing imidazole. This method facilitates quick purification of the protein of interest.

Gene and protein sequences of Cb629WT and Cb629TMl

Cb629 wild-type amino acid sequence

[00849] The wild-type Cb629 endocellulase (EC 3.2.1.4) amino acid sequence is disclosed in SEQ ID NO: 98. The signal peptide of Cb629 corresponds to amino acid numbers 1-29 of SEQ ID NO: 98. The amino acid sequence of the wild- type Cb629 protein without the signal peptide is disclosed in SEQ ID NO: 119.

Cb629 wild-type nucleotide sequence

[00850] The wild-type Cb629 nucleotide sequence is disclosed in SEQ ID NO: 99. The signal peptide of Cb629 corresponds to nucleotide numbers 1-87 of SEQ ID NO: 99. The nucleotide sequence encoding the wild-type Cb629 protein without the signal peptide is disclosed in SEQ ID NO: 120. Cb629TMl amino acid sequence

[00851] The Cb629TMl endocellulase (EC 3.2.1.4) amino acid sequence is disclosed in SEQ ID NO: 100. The procedure of cloning the gene for Cb629TMl into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The Cb629TMl amino acid sequence with the short peptide is disclosed in SEQ ID NO: 103. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 103.

Cb629TMl nucleotide sequence

[00852] The Cb629TMl nucleotide sequence is disclosed in SEQ ID NO: 101. The

Cb629TMl nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 102. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 102.

[00853] The Cb629TMl coding sequence was expressed in E. coli cells, and the protein was purified in one step, i.e. the talon resin purification step making use of the 6-histidines encoded by the plasmid. Figure 48 shows an SDS-PAGE of purified Cb629TMl.

Enzyme Activity

[00854] Figure 49 shows the enzymatic activity of Cb629TMl on substrates with products determined through a reducing sugar assay. Three different cellulose substrates were tested: Avicel, sodium carboxymethyl cellulose (CMC-Na) and phosphoric acid swollen cellulose (PASC). Incubation of enzymes with the substrates led to release of products that were quantified as a concentration of glucose equivalents. Hydrolysis of PASC was higher than hydrolysis of the other substrates.

[00855] The concentration of glucose equivalents was determined following enzymatic hydrolysis of Avicel, CMC-Na and PASC, according to the methods of Lever, M. (supra). 1.5 mL microcentrifuge tubes were "zeroed" in an analytical balance. Next, 2 ± 0.1 mg Avicel were added to each tube, and the mass measured and recorded. The volumes that should be added to each tube were calculated based on the mass. For CMC-Na and PASC, a stock substrate solution of CMC-Na (2%) and PASC (6.11 mg/ml) were used. Sodium citrate reaction buffer and enzymes were added to each tube beginning with the reaction buffer. The tubes were incubated with constant mixing in a Thermomixer R (Eppendorf) at 75°C for 16 h. The tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 50 of sample supernatant was transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay. 1 mL of a stock solution of glucose was made at a concentration of 100 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following concentrations (20 mM, 10 mM and 5 mM). 50 mg of /?HBAH was dissolved in 50 mL of ice-cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution was kept on ice. 150 μΙ_, of /?HBAH solution was added to 50 μΙ_, of the sample and glucose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A4 1 o nm and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The coefficient of determination (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based on absorbance data.

[00856] Figure 50 shows the enzymatic activity of Cb629TMl on substrates using HPLC analysis. Three different cellulosic substrates were tested: Avicel, CMC-Na and PASC. In each case, in the presence of Cb629TMl, glucose and cellobiose were released. In the absence of Cb629TMl, neither glucose nor cellobiose was observed from all the substrates. The results showed that this enzyme releases glucose and cellobiose as end products from cellulosic substrates (Avicel, CMC-Na and PASC).

[00857] Figure 51 shows that this enzyme is also able to release mostly disaccharides (cellobiose) and glucose from cello-oligosaccharide. The enzyme does not cleave hydrolyze cellobiose (G2 in the figure).

[00858] Figure 52 shows the thermostability of Cb629TMl. Cb629TMl has 109%, 99%, 96%, 83% and 34% activity after incubation at 60 °C, 65 °C, 70 °C, 75 °C and 80 °C for 24 h, respectively. 500 nM Cb629TMl was kept at different temperatures (60 °C, 65 °C, 70 °C, 75 °C and 80 °C). The samples were taken out at different time points (0 h, 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h) and immediately used for enzyme activity measurement. The enzyme activity was measured at pH 5.5 and at 70 °C on a thermomixer. 2.5 mg/ml final concentration of PASC was used for measurement, and 8.31 μΐ of the protein sample was added to the substrate and mixed by pipetting up and down for several times. The total volume was 100 μΐ. The reducing ends corresponding to glucose equivalents were measured according to the methods of Lever, M. (supra). The velocity of reaction in 10 minutes was calculated. The velocity of reaction for time 0 was set as 100; then the remaining activities (percentage) for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h were calculated by dividing the velocities of reaction for time 0.5 h, 1 h, 2 h, 4 h, 7h, 11 h and 24 h by the velocity of reaction at time 0, then multiplied by 100, respectively.

Example 14: β-glucosidase Cb486

[00859] A putative β-glucosidase Cb486, was identified in Caldicellulosiruptor bescii. The enzyme is the gene product of Cb486, where Cb stands for Caldicellulosiruptor bescii. β- glucosidases catalyze the hydrolysis of cellobiose (a disaccharide of glucose) into two units of glucose. The Cb486 protein is 466 amino acids long and has a predicted molecular weight of 54.9 kDa (His-tag + Cb486 protein). The protein has a Glycoside Hydrolase (GH) family 1 catalytic domain (Figure 53A).

Clonins of Cb486

[00860] The gene for Cb486 was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using iProof™ High-Fidelity DNA Polymerase (BIO-RAD). The Cb486 gene was amplified using the following primer set:

[00861] Cb486Forward: 5'-GAC GAC GAC AAG ATG AGT TTA CCA AAA GGA TTT CTG TGG GGT GC-3' (SEQ ID NO: 104)

[00862] Cb 1172Reverse: 5 '-GAG GAG AAG CCC GGT TAT GAG TTT TCC TTT ATA TAC TGC TG-3' (SEQ ID NO: 105)

[00863] The polymerase chain reaction mixture contained the following:

PCR reaction 2 υ/μί iProoF M High-Fidelity DNA Polymerase 0.5

17 ng/μL Caldicellulosiruptor bescii genomic DNA 1

50 μΜ Fw Primer 0.5

50 μΜ Rv Primer 0.5

10 mM dNTP Mixture 1

5 X iProof HF Buffer 10

dH 2 Q 3^5

50

Total μΐ.

[00864] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 98°C 30 sec 1 cycle

Denaturing 98°C 10 sec

Annealing 62°C 30 sec 35 cycles

Elongation 72°C 2 min

Elongation 72°C 10 min 1 cycle

Last 4°C oo

[00865] After the PCR described above, the amplification of the gene for Cb486 was confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel Extraction kit to extract the DNA out of the gel.

[00866] A Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purified DNA was as follows:

Unit

Reaction (μΐ) Incubation

0.1 pmol purified PCR product X

10X T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 0.2

Total 10 22°C 30 min

After the reaction, the enzyme was inactivated by incubation at 75 °C for 20 min. [00867] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector.

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00868] The ligation mixture for Cb486-pET-46 Ek/LIC was introduced into E. coli JM109 by electroporation, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and each was used to inoculate 6 mL cultures of LB- ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. After confirmation that the gene had been inserted into the plasmid, the inserts were sequenced to confirm their identity and integrity of the sequence.

[00869] For gene expression, one of the plasmids was transformed into E. coli BL21 codon plus DE3 RIL by the heat shock method and plated on LB plates supplemented with

chloramphenicol (100 μg/ml) and ampicillin (50 μg/ml) and incubated at 37°C overnight. Five to six colonies were inoculated into 3 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 4 hours. One mL of the culture was added to 500 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.25. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16° C overnight.

Protein purification

[00870] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (25 mM Tris-HCL pH 7.8, 750 mM of NaCl, 5% glycerol, 20 mM imidazole, 1.25% Tween-20). The proteins in the cells were released through a French pressure cell. After centrifugation to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged Cb486) was then eluted from the resin with an elution buffer (50 mM Tris- HCL, pH7.5, 250 mM imidazole).

[00871] The gene product of Cb486 was expressed in its full-length form. The design of the PCR primers ensured that the protein was fused to 6-histidines encoded in the plasmid. The six histidines will bind to either a nickel-charged resin or a cobalt-charged resin. The bound protein can be displaced from the resin with a buffer containing imidazole. This method facilitates quick purification of the protein of interest.

Gene and protein sequences of Cb486WT

Cb486 wild-type amino acid sequence

[00872] The wild-type Cb486 β-glucosidase (EC 3.2.1.21) amino acid sequence is disclosed in SEQ ID NO: 106. The procedure of cloning the gene for wild-type Cb486 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The wild-type Cb486 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 109. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 109.

Cb486 wild-type nucleotide sequence

[00873] The wild-type Cb486 nucleotide sequence is disclosed in SEQ ID NO: 107. The wild-type Cb486 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 108. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 108.

[00874] The Cb486 gene was expressed in E. coli cells, and the protein was purified in one step, using the talon resin purification step making use of the 6-histidines encoded by the plasmid. Figure 53B shows an SDS-PAGE of purified Cb486.

Enzyme Activity [00875] Figure 54 shows the enzymatic activity of Cb486 on xylo-oligosaccharides (X 2 -X 6 ) through Thin Layer Chromatography (TLC) analysis. The following xylo-oligosaccharides (X 2 - X 6 ) were tested: xylobiose, xylotriose, xylotetraose, xylopentaose and xylohexaose. This was done by an overnight hydrolysis of the xylo-oligosaccharides followed by resolving of the products with TLC. In each case, in the presence of Cb486, xylose and xylobiose were released. In the absence of Cb486, only minor amount of xylose was observed for xylobiose; no products of hydrolysis were released for other xylo-oligosaccharides. The results showed that this enzyme releases xylose and xylobiose from xylo-oligosaccharides (xylobiose, xylotriose, xylotetraose, xylopentaose and xylohexaose).

[00876] Figure 55 shows that this enzyme is also capable of cleaving cello-oligosaccharides from cellobiose (2 glucose units joined by beta 1,4-linkage) to cellohexaose (six glucose units linked together by beta 1,4-linkages) to glucose. Thus this enzyme when coupled with an endoglucanase that release short chains of glucose should be able to convert the short chains to the monosaccharides glucose. The multi-functional activity (cleavage of different linkages) should make this enzyme an important enzyme in enzyme mixes used in hydrolyzing complex polysaccharides.

[00877] Figures 56A and 56B show the pH and temperature profiles, respectively of the activity of Cb486.

Example 15: Cellulase mixture from Caldicellulosiruptor bescii for the hydrolysis of Miscanthus

[00878] Based on the analyses above, a cellulase mixture containing Cb629TMl, Cb486, CM946TM2, CM952TM1, CM953TM2, and CM954TM3 was reconstituted to represent Caldicellulosiruptor bescii cellulases (Figure 57). A previously reconstituted hemicellulase of Caldicellulosiruptor bescii (Figure 58) was also prepared to test synergistic effects with the cellulase mixture. All enzyme mixtures (each 0.5 μΜ) were reacted with 2%, 5%, and 8% pretreated (autoclaved Miscanthus & 1% NaOH treated+microwaved Miscanthus) samples in 50 mM citrate- 150 mM NaCl buffer (pH 6.5) at 75 Q C overnight with shaking. [00879] The reactions were resolved by thin layer chromatography (TLC). The mobile phase consisted of n-butanol: acetic acid:H 2 0, 10:5: 1 (vol/vol/vol), and 10cm x 20cm plates were used. (Figures 59, 61, 63, and 65).

[00880] For further analysis of the products of hydrolysis, the 8% substrate reaction samples were analyzed by high performance anion-exchange chromatography (HPAEC) (Figures 60, 62, 64, and 66; i.e. Figure 60 is HPAEC data of samples from Figure 59, Figure 62 is HPAEC data of samples from Figure 61, etc.). For HPAEC analyses, 100 μΐ ^ of each diluted sample was injected onto a System Gold HPLC instrument from Beckman Coulter (Fullerton, CA) equipped with CarboPac™ PA1 guard (4 x 50 mm) and analytical (4 x 250 mm) columns from Dionex Corporation (Sunnyvale, CA) and a Coulochem III electrochemical detector from ESA

Biosciences (Chelmsford, MA).

[00881] For the TLC (Figures 59, 61, 63, and 65) and HPLC (Figures 60, 62, 64, and 66) analysis, glucose (CI) and five different cellooligosaccharides were used: cellobiose (C2), cellotriose (C3), cellotetraose (C4), cellopentaose (C5), and cellohexaose (C6) as standards. For the separation of xylose and glucose, Aminex HPX-87H column (300x7.8 mm, BioRad) was used with LC-20AT HPLC (SHIMADZU) with 5 mM sulfuric acid as mobile phase and 0.6 mL/mL flow rate at 65°C.

[00882] Based on TLC and HPLC data in Figures 59-62, in the presence of both cellulases and hemicellulases, the cellulase and hemicellulase mixtures released more glucose and xylose synergistically on pretreated Miscanthus samples than the amount of glucose released by the same cellulase mixture alone or the amount of xylose released by the same hemicellulase mixture alone. For example, as shown in Figure 60, more glucose was released from the microwave pretreated Miscanthus by the cellulase mixture while in the presence of the hemicellulase mixture (lane 4, CI peak; -11 mM) than when the cellulase mixture acted on Miscanthus alone (lane 2, CI peak; ~ 7 mM). Also, as shown in Figure 60, more xylose was released from the pretreated Miscanthus by the hemicellulase mixture while in the presence of the cellulase mixture (lane 4, XI peak; ~6 mM) than when the hemicellulase mixture acted on Miscanthus alone (lane 2, XI peak; ~ 3 mM). As shown in Figures 61 and 62, synergistic effects between the cellulase and hemicellulase mixtures were also obtained with the autoclave pretreated Miscanthus. Thus, the results provided herein show the surprising result that an enzyme cocktail containing a cellulase mixture disclosed herein and a hemicellulase mixture disclosed herein shows synergistic activity between the cellulase and hemicellulase mixtures.

[00883] The results in Figures 59-62 also show that more products were released from the microwave pretreated Miscanthus (Figures 59 and 60) than the autoclave pretreated Miscanthus samples (Figures 61 and 62).

[00884] In Figures 63-66, the enzyme mixture without Cb486 (β-glucosidase) was tested on both pretreated samples. The results show that the enzyme mixtures released mainly cellobiose in the mix without β-glucosidase (Cb486). The results in lane 4 of Figure 63 and Figure 64 shows that the mixture of hemicellulase and cellulose without the beta-glucosidase will lead to xylose and mostly cellobiose from the microwaved sample. Similar data is obtained for the same experiment but with autoclaved Miscanthus as the substrate (Figures 65 and 66).

[00885] Example 16: Heat shock protein Cbl581

[00886] A small heat shock protein, CM581, was identified in Caldicellulosiruptor bescii. The protein is the gene product of CM581, where Cb stands for Caldicellulosiruptor bescii. The protein is 162 amino acids long and has a molecular weight of 19.68 kDa (His-tag + CM581 protein).

Clonins of Cb!581

[00887] The gene for Cb 1581 was amplified from Caldicellulosiruptor bescii DSM 6725T genomic DNA by PCR using PrimeSTAR DNA Polymerase (TAKARA) . The cbl581 gene was amplified using the following primer set:

[00888] Cbl581Forward: 5'-

GACGACGACAAGATGCTCAGAGACATAGTTCCATTTGGC -3' (SEQ ID NO: 144) [00889] Cbl581Reverse: 5'-

GAGGAGAAGCCCGGTTATTCTATATCAATTGTTCTTACATC -3' (SEQ ID NO: 145) [00890] The polymerase chain reaction mixture contained the following:

PCR reaction

2.5 υ/μί PrimeSTAR DNA Polymerase 0.4

17 ng/μL Caldicellulosiruptor bescii genomic DNA 1

20 μΜ Fw Primer 1

20 μΜ Rv Primer 1

2.5 mM dNTP Mixture 4

5 X PrimeSTAR Buffer 10

dH 2 Q 32.6

Total 50 μΐ.

[00891] To amplify the gene from the genomic DNA, the following PCR cycling was used:

PCR protocol

Denaturing 95°C 5 min 1 cycle

Denaturing 94°C 30 sec

Annealing 50°C 30 sec 35 cycles

Elongation 72°C 1 min

Elongation 72°C 7 min 1 cycle

Last 4°C oo

[00892] After the PCR reaction described above, the amplification of cbl581 gene was confirmed by 1% agarose gel electrophoresis. The DNA corresponding to the expected band on the gel was cut out and applied to a Qiagen Gel Extraction kit to extract the DNA out of the gel.

[00893] A Novagen pET-46 Ek/LIC kit was used to treat the purified DNA and ligate it into the pET-46 Ek/LIC vector. The treatment of the purifed DNA was as follows:

Unit

Reaction (μΐ) Incubation 0.1 pmol purified PCR product X

1 OX T4 DNA Polymerase buffer 1

25 mM dATP 1

100 mM DTT 0.5

Nuclease-free water 7.3-X

2.5 U/μΙ T4 DNA Polymerase 0.2

Total 10 22°C 30 min

After the reaction, inactivate the enzyme by incubating at 75 °C for 20 min.

[00894] The following protocol was used to anneal the insert into the pET-46 Ek/LIC vector.

Unit

Reaction (μΐ) Incubation pET-46 Ek/LIC vector 0.5

T4 DNA Polymerase treated EK/LIC insert 1

Total 1.5 22°C 5 min

Then add 0.5 μΐ 25 mM EDTA. Mix by stirring with pipet tip. Incubate at 22°C for 5 min.

[00895] The ligation mixture for cbl581 -pET-46 Ek/LIC were introduced into E. coli XL10- Gold by electroporation method, and the cells were plated on LB-ampicillin. After overnight incubation at 37°C, four colonies were selected and used to inoculate 6 mL cultures of LB- ampicillin. The cultures were grown at 37°C with vigorous aeration for 16 hours, and minipreps (QIAGEN) were made of the cell cultures. The plasmids were then electrophoresed on a 1% agarose gel to check the size of the plasmid DNA. After confirmation that the gene had been inserted into the plasmid, the genes were sequenced to confirm their identity.

[00896] For gene expression, one of the plasmids was transformed into E. coli BL21- CodonPlus (DE3)-RIPL by the heat shock method and plated on LB plates supplemented with chloramphenicol (50 μg/ml) and ampicillin (100 μg/ml) and incubated at 37°C overnight. Five to six colonies were inoculated into 10 ml of LB broth supplemented with the two antibiotics at the same concentration and cultured for 6 hours. Ten mL of the culture was added to 1000 mL of LB broth supplemented with the two antibiotics at the same concentration and cultured at 37 °C until the absorbance at 600 nm reached -0.3. The inducer, IPTG, was then added at 0.1 mM final concentration, and the culturing continued at 16° C overnight. Protein purification

[00897] Cultures were centrifuged to collect the cell pellet. The pellet was then suspended in a lysis buffer (50 mM Tris-HCl, 300 mM NaCl, pH 7.5). The proteins in the cells were released through a French pressure cell. After centrifugation at 10000 rpm for 30 minutes to pellet the cell debris, the supernatant was applied to a cobalt-charged resin (TALON, Clontech) and washed three times to remove the unbound proteins. The bound protein (6-Histidine-tagged CM581) was then eluted from the resin with an elution buffer (50 mM Tris-HCl, 300 mM NaCl, 250 mM imidazole, pH7.5).

[00898] The design of the PCR primers ensured that the protein was fused to 6-histidines encoded in the plasmid. The six histidines will bind to either a nickel-charged resin or a cobalt- charged resin. The bound protein can be displaced from the resin with a buffer containing imidazole. This method facilitates quick purification of the protein of interest.

Gene and protein sequences of Cbl581

[00899] CM581 wild- type amino acid sequence

[00900] The wild-type CM581 amino acid sequence is disclosed in SEQ ID NO: 146. The procedure of cloning the gene for wild- type CM581 into the plasmid pET-46 Ek/LIC led to fusion of the gene to a short nucleotide sequence encoding a peptide that contains six histidines. The wild-type Cb486 amino acid sequence with the short peptide is disclosed in SEQ ID NO: 149. The amino acids of the short peptide are amino acids 1-14 of SEQ ID NO: 149.

CM581 wild- type nucleotide sequence

[00901] The wild-type CM581 nucleotide sequence is disclosed in SEQ ID NO: 147. The wild- type CM581 nucleotide sequence with the coding sequence for the short peptide from the plasmid pET-46 Ek/LIC is disclosed in SEQ ID NO: 148. The nucleotides coding for the short peptide nucleotides are nucleotides 1-42 of SEQ ID NO: 148. [00902] The cbl581 gene was expressed in E. coli cells, and the protein was purified in one step, that is, a talon resin purification step making use of the 6-histidines encoded by the plasmid. Figure 83 shows an SDS-PAGE of purified CM581.

Enhancing enzymatic hydrolysis of microwave pretreated miscanthus

[00903] Figure 84 shows the enhancing effect of Cb 1581 on enzymatic hydrolysis of microwave pretreated miscanthus at 70°C (A) or 80°C (B). The hydrolysis was carried out at pH 6.0 using 0.5 μΜ each of the cellulase/hemicellulase enzyme mixture in a total volume of 500 μΐ with 10% miscanthus as the substrate. The enzymes in the mixture include CM946TM2, CM952TM1, CM953TM2, CM954TM3, Cb629TMl, Cb486, Cbl93, Cbl95, Cb2487, CM 172, Cb909, and Cbl62. Two mL microcentrifuge tubes were "zeroed" in an analytical balance.

Next, 50 ± 0.2 mg microwave pretreated miscanthus were added to each tube. The tubes were incubated with constant rotation in a EchoTherm™RTl 1 Variable Speed Rotating Mixers (Torrey Pines Scientific) at 70°C or 80°C for 24 h.

[00904] The concentration of glucose equivalents was determined following enzymatic hydrolysis of microwave pretreated miscanthus, according to the methods of Lever, M. (A new reaction for colorimetric determination carbohydrates. Anal. Biochem. 1972: 47; 273-279). After the reaction, the tubes were centrifuged at 10,000 rpm for 5 min at 4°C. 45 of water and 5 of sample supernatant were transferred to a clean 1.5 mL centrifuge tube for the /?HBAH assay. 1 mL of a stock solution of glucose was made at a concentration of 100 mM in sodium citrate buffer, and then serial dilutions were made in sodium citrate buffer to the following

concentrations (20 mM, 10 mM and 5 mM). 50 mg of /?HBAH was dissolved in 50 mL of ice- cold citrate/NaOH solution for a final concentration of 0.1% (w/v), and the solution was kept on ice. 150 of /?HBAH solution was added to 50 μί of the sample and glucose standard solutions, and the tubes were incubated at 100°C for 10 min. The tubes were incubated at room temperature for 5 min. The wavelength at 410 nm was measured for the standards and samples. The A4 1 onm and glucose concentrations were plotted against each other, and linear regression was used to fit a line to the data. The correlation coefficient (R ) value was between 0.98 and 1.0. The equation from the standard curve was used to calculate the concentrations of reducing ends in the samples based upon their absorbance. The releasing of sugars is enhanced with the increasing amount of C 581 in the reaction mixture at both 70°C and 80°C.

References

1. Ando, S., H. Ishida, Y. Kosugi, and K. Ishikawa. 2002. Hyperthermo stable

endoglucanase from Pyrococcus horikoshii. Appl Environ Microbiol 68:430-433.

2. Arai, T., et al. 2001. Sequence of celQ and properties of celQ, a component of the Clostridium thermocellum cellulosome. Appl Microbiol Biotechnol 57:660-666.

3. Bauer, M. W., et al. 1999. An endoglucanase, EglA, from the hyperthermophilic archaeon Pyrococcus furiosus hydrolyzes b-1,4 bonds in mixed-linkage (1— >3),(1— >4)-b-D- glucans and cellulose. J Bacteriol 181:284-290.

4. Blumer-Schuette, S. E., D. L. Lewis, and R. M. Kelly. 2010. Phylogenetic,

microbiological, and glycoside hydrolase diversities within the extremely thermophilic, plant biomass-degrading genus Caldicellulosiruptor. Appl Environ Microbiol 76:8084-8092.

5. Bok, J. D., D. A. Yernool, and D. E. Eveleigh. 1998. Purification, characterization, and molecular analysis of thermostable cellulases CelA and CelB from Thermotoga neapolitana. Appl Environ Microbiol 64:4774-4781.

6. Bronnenmeier, K., and W. L. Staudenbauer. 1990. Cellulose hydrolysis by a highly thermostable endo-l,4-b-glucanase (Avicelase I) from Clostridium stercorarlum. Enzyme Microb Technol 12:431-436.

7. Chiriac, A. I., et al. 2010. Engineering a family 9 processive endoglucanase from

Paenibacillus barcinonensis displaying a novel architecture. Appl Microbiol Biotechnol 86: 1125-1134.

8. Dam, P., et al. 2011. Insights into plant biomass conversion from the genome of the anaerobic thermophilic bacterium Caldicellulosiruptor bescii DSM 6725. Nucleic Acids Res 39:3240-3254. 9. Dodd, D., et al. 2009. Biochemical analysis of a b-D-xylosidase and a bifunctional xylanase-ferulic acid esterase from a xylanolytic gene cluster in Prevotella ruminicola 23. J Bacteriol 191:3328-3338.

10. Gal, L., et al. 1997. CelG from Clostridium cellulolyticum: a multidomain endoglucanase acting efficiently on crystalline cellulose. J Bacteriol 179:6595-6601.

11. Gibbs, M. D., et al. 2000. Multidomain and multifunctional glycosyl hydrolases from the extreme thermophile Caldicellulosiruptor isolate Tok7B.l. Curr Microbiol 40:333-340.

12. Gilad, R., et al. 2003. Cell, a noncellulosomal family 9 enzyme from Clostridium thermocellum, is a processive endoglucanase that degrades crystalline cellulose. J Bacteriol 185:391-398.

13. Hoffman, G. G., O. Davulcu, S. Sona, and W. R. Ellington. 2008. Contributions to catalysis and potential interactions of the three catalytic domains in a contiguous trimeric creatine kinase. FEBS J 275:646-654.

14. Honda, Y., N. Shimaya, K. Ishisaki, M. Ebihara, and H. Taniguchi. 2011. Elucidation of exo-b-D-glucosaminidase activity of a family 9 glycoside hydrolase (PBPRA0520) from

Photobacterium profundum SS9. Glycobiology 21:503-511.

15. Horn, S. J., et al. 2006. Costs and benefits of processivity in enzymatic degradation of recalcitrant polysaccharides. Proc Natl Acad Sci U S A 103: 18089-18094.

16. Irwin, D., et al. 1998. Roles of the catalytic domain and two cellulose binding domains of Thermomonospora fusca E4 in cellulose hydrolysis. J Bacteriol 180: 1709-1714.

17. Irwin, D. C, M. Spezio, L. P. Walker, and D. B. Wilson. 1993. Activity studies of eight purified cellulases: Specificity, synergism, and binding domain effects. Biotechnol Bioeng 42: 1002-1013. 18. Jauris, S., et al. 1990. Sequence analysis of the Clostridium stercorarium celZ gene encoding a thermo active cellulase (Avicelase I): identification of catalytic and cellulose-binding domains. Mol Gen Genet 223:258-267.

19. Jindou, S., et al. 2006. Novel architecture of family-9 glycoside hydrolases identified in cellulosomal enzymes of Acetivibrio cellulolyticus and Clostridium thermocellum. FEMS Microbiol Lett 254:308-316.

20. Kataeva, I. A., et al. 2009. Genome sequence of the anaerobic, thermophilic, and cellulolytic bacterium "Anaerocellum thermophilum" DSM 6725. J Bacteriol 191:3760-3761.

21. Lever, M. 1972. A new reaction for colorimetric determination of carbohydrates. Anal Biochem 47:273-279.

22. Li, Y., D. C. Irwin, and D. B. Wilson. 2010. Increased crystalline cellulose activity via combinations of amino acid changes in the family 9 catalytic domain and family 3c cellulose binding module of Thermobifida fusca Cel9A. Appl Environ Microbiol 76:2582-2588.

23. Li, Y., D. C. Irwin, and D. B. Wilson. 2007. Processivity, substrate binding, and mechanism of cellulose hydrolysis by Thermobifida fusca Cel9A. Appl Environ Microbiol 73:3165-3172.

24. Liang, C, et al. 2011. Cloning and characterization of a thermostable and halo-tolerant endoglucanase from Thermoanaerobacter tengcongensis MB4. Appl Microbiol Biotechnol 89:315-326.

25. Lochner, A., et al. 2011. Use of label-free quantitative proteomics to distinguish the secreted cellulolytic systems of Caldicellulosiruptor bescii and Caldicellulosiruptor obsidiansis. Appl Environ Microbiol 77:4042-4054.

26. Mandelman, D., et al. 2003. X-Ray crystal structure of the multidomain endoglucanase Cel9G from Clostridium cellulolyticum complexed with natural and synthetic cello- oligosaccharides. J Bacteriol 185:4127-4135. 27. Meinke, A., et al. 1991. Unusual sequence organization in CenB, an inverting endoglucanase from Cellulomonas fimi. J Bacteriol 173:308-314.

28. Mejia-Castillo, T., M. E. Hidalgo-Lara, L. G. Brieba, and J. Ortega-Lopez. 2008.

Purification, characterization and modular organization of a cellulose-binding protein, CBP105, a processive b-l,4-endoglucanase from Cellulomonas flavigena. Biotechnol Lett 30:681-687.

29. Moon, Y. H., M. Iakiviak, S. Bauer, R. I. Mackie, and I. K. Cann. 2011. Biochemical analyses of multiple endoxylanases from the rumen bacterium Ruminococcus albus 8 and their synergistic activities with accessory hemicellulose degrading enzymes. Appl Environ Microbiol doi: 10.1128/AEM.00353- 11.

30. Park, J. K., L. X. Wang, H. V. Patel, and S. Roseman. 2002. Molecular cloning and characterization of a unique b-glucosidase from Vibrio choleme. J Biol Chem 277:29555-29560.

31. Parsiegla, G., A. Belaich, J. P. Belaich, and R. Haser. 2002. Crystal structure of the cellulase Cel9M enlightens structure/function relationships of the variable catalytic modules in glycoside hydrolases. Biochemistry 41: 11134-11142.

32. Pastor, F. I., et al. 2001. Molecular cloning and characterization of a multidomain endoglucanase from Paenibacillus sp BP-23: evaluation of its performance in pulp refining. Appl Microbiol Biotechnol 55:61-68.

33. Qi, M., H. S. Jun, and C. W. Forsberg. 2008. Cel9D, an atypical 1,4-b-D-glucan glucohydrolase from Fibrobacter succinogenes: characteristics, catalytic residues, and synergistic interactions with other cellulases. J Bacteriol 190: 1976-1984.

34. Sakon, J., D. Irwin, D. B. Wilson, and P. A. Karplus. 1997. Structure and mechanism of endo/exocellulase E4 from Thermomonospora fusca. Nat Struct Biol 4:810-818.

35. Saul, D. J., et al. 1990. celB, a gene coding for a bifunctional cellulase from the extreme thermophile " Caldocellum saccharolyticum" . Appl Environ Microbiol 56:3117-3124. 36. Schubot, F. D., et al. 2004. Structural basis for the exocellulase activity of the cellobiohydrolase CbhA from Clostridium thermocellum. Biochemistry 43: 1163-1170.

37. Su, X., et al. 2010. Mutational insights into the roles of amino acid residues in ligand binding for two closely related family 16 carbohydrate binding modules. J Biol Chem

285:34665-34676.

38. Tamaru, Y., S. Karita, A. Ibrahim, H. Chan, and R. H. Doi. 2000. A large gene cluster for the Clostridium cellulovorans cellulosome. J Bacteriol 182:5906-5910.

39. Tolonen, A. C, A. C. Chilaka, and G. M. Church. 2009. Targeted gene inactivation in Clostridium phytofermentans shows that cellulose degradation requires the family 9 hydrolase Cphy3367. Mol Microbiol 74: 1300-1313.

40. Uda, K., et al. 2008. Two-domain arginine kinase from the deep-sea clam Calyptogena kaikoi— evidence of two active domains. Comp Biochem Physiol B Biochem Mol Biol 151: 176- 182.

41. VanFossen, A. L., I. Ozdemir, S. L. Zelin, and R. M. Kelly. 2011. Glycoside hydrolase inventory drives plant polysaccharide deconstruction by the extremely thermophilic bacterium Caldicellulosiruptor saccharolyticus. Biotechnol Bioeng 108: 1559-1569.

42. Vlasenko, E., M. Schulein, J. Cherry, and F. Xu. Substrate specificity of family 5, 6, 7, 9, 12, and 45 endoglucanases. Bioresour Technol 101:2405-2411.

43. Yang, S. J., et al. 2010. Classification of 'Anaerocellum thermophilum' strain DSM 6725 as Caldicellulosiruptor bescii sp. nov. Int J Syst Evol Microbiol 60:2011-2015.

44. Yang, S. J., et al. 2009. Efficient degradation of lignocellulosic plant biomass, without pretreatment, by the thermophilic anaerobe "Anaerocellum thermophilum" DSM 6725. Appl Environ Microbiol 75:4762-4769.

45. Yeoman, C. J., et al. 2010. Thermostable enzymes as biocatalysts in the biofuel industry. Adv Appl Microbiol 70: 1-55. 46. Yoshida, S., R. I. Mackie, and I. K. Cann. 2010. Biochemical and domain analyses of FSUAxe6B, a modular acetyl xylan esterase, identify a unique carbohydrate binding module in Fibrobacter succinogenes S85. J Bacteriol 192:483-493.

47. Zakariassen, H., et al. 2009. Aromatic residues in the catalytic center of chitinase A from Serratia marcescens affect processivity, enzyme activity, and biomass converting efficiency. J Biol Chem 284: 10610-10617.

48. Zhang, X. Z., N. Sathitsuksanoh, and Y. H. Zhang. 2010. Glycoside hydrolase family 9 processive endoglucanase from Clostridium phytofermentans: heterologous expression, characterization, and synergy with family 48 cellobiohydrolase. Bioresour Technol 101:5534- 5538.

49. Zverlov, V., S. Mahr, K. Riedel, and K. Bronnenmeier. 1998. Properties and gene structure of a bifunctional cellulolytic enzyme (CelA) from the extreme thermophile

'Anaerocellum thermophilum' with separate glycosyl hydrolase family 9 and 48 catalytic domains. Microbiology 144 ( Pt 2):457-465.

50. Zverlov, V. V., G. A. Velikodvorskaya, and W. H. Schwarz. 2003. Two new cellulosome components encoded downstream of cell in the genome of Clostridium thermocellum: the non- processive endoglucanase CelN and the possibly structural protein CseP. Microbiology 149:515- 524.