Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PECTINASES WITH IMPROVED THERMOSTABILITY
Document Type and Number:
WIPO Patent Application WO/2017/102544
Kind Code:
A1
Abstract:
The invention is in the field of protein chemistry, in particular in the field of enzymology. It provides pectinases, i.e. polypeptides with pectin-degrading properties. In particular the invention provides polypeptides with pectate lyase activity (EC 4.2.2.2). Enzymes according to the invention have improved properties, such as improved thermostability and decreased calcium dependence.

Inventors:
BIRIKH KLARA (FI)
SUONPÄÄ ANU MINNA MAARET (FI)
Application Number:
PCT/EP2016/080265
Publication Date:
June 22, 2017
Filing Date:
December 08, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
METGEN OY (FI)
International Classes:
C12N9/88
Domestic Patent References:
WO2008138109A12008-11-20
Other References:
DATABASE UniProt [online] 1 April 2015 (2015-04-01), "SubName: Full=Ptate lyase {ECO:0000313|EMBL:KIL52125.1}; EC=4.2.2.2 {ECO:0000313|EMBL:KIL52125.1};", XP002757741, retrieved from EBI accession no. UNIPROT:A0A0C2RPE1 Database accession no. A0A0C2RPE1
Z. XIAO ET AL: "Improvement of the Thermostability and Activity of a Pectate Lyase by Single Amino Acid Substitutions, Using a Strategy Based on Melting-Temperature-Guided Sequence Alignment", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, vol. 74, no. 4, 15 February 2008 (2008-02-15), US, pages 1183 - 1189, XP055272786, ISSN: 0099-2240, DOI: 10.1128/AEM.02220-07
CHILIVERI SWARUPA RANI ET AL: "A novel thermostable, alkaline pectate lyase fromBacillus tequilensisSV11 with potential in textile industry", CARBOHYDRATE POLYMERS, APPLIED SCIENCE PUBLISHERS, LTD. BARKING, GB, vol. 111, 26 April 2014 (2014-04-26), pages 264 - 272, XP029035851, ISSN: 0144-8617, DOI: 10.1016/J.CARBPOL.2014.04.065
TETSUKO NAKANIWA ET AL: "An in vitro evaluation of a thermostable pectate lyase by using error-prone PCR", JOURNAL OF MOLECULAR CATALYSIS. B, ENZYMATIC, vol. 27, no. 2-3, 1 February 2004 (2004-02-01), NL, pages 127 - 131, XP055272910, ISSN: 1381-1177, DOI: 10.1016/j.molcatb.2003.10.005
LIANG CHAONING ET AL: "Improving the thermoactivity and thermostability of pectate lyase fromBacillus pumilusfor ramie degumming", APPLIED MICROBIOLOGY AND BIOTECHNOLOGY, SPRINGER, DE, vol. 99, no. 6, 8 October 2014 (2014-10-08), pages 2673 - 2682, XP035459129, ISSN: 0175-7598, [retrieved on 20141008], DOI: 10.1007/S00253-014-6091-Y
Attorney, Agent or Firm:
HABETS, Winand Johannes Antonius (NL)
Download PDF:
Claims:
CLAIMS

1 . A polypeptide with pectate lyase activity comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 , wherein the polypeptide comprises a small, polar, non-charged amino acid residue at an amino acid position corresponding to position 235 in SEQ ID NO: 1 and wherein the polypeptide has an improved thermostability.

2. Polypeptide according to claim 1 wherein the small, polar, non-charged amino acid residue is selected from the group consisting of amino acids Threonine, Serine, Asparagine and Cysteine.

3. Polypeptide according to claim 2 wherein the small, polar, non-charged amino acid residue is Threonine.

4. Polypeptide according to any one of claim 1 - 3 comprising an amino acid

sequence that is at least 75% identical to the amino acid according to SEQ ID NO: 1 , such as 80%, 85%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or even 100%

5. Polypeptide according to any one of claims 1 - 4 additionally comprising a

leucine residue at an amino acid position corresponding to position 231 in SEQ ID NO: 1 .

6. Polypeptide according to any one of claims 1 - 5 wherein the polypeptide is an isolated polypeptide.

7. Composition comprising a polypeptide according to any one of claims 1 - 6.

8. Nucleic acid encoding a polypeptide according to any one of claims 1 - 6.

9. Vector comprising a nucleic acid according to claim 8.

10. Composition comprising a nucleic acid or a vector according to claims 8 or 9.

1 1 . Recombinant host cell comprising a nucleic acid according to claim 8, a vector according to claim 9 or a composition according to claim 10.

12. Recombinant host cell according to claim 1 1 selected from the group consisting of Escherichia coli, Bacillus, Corynebacterium, Pseudomonas, Pichia pastoris, Saccharomyces cerevisiae, Yarrowia lipolytica, filamentous fungi, yeast and insect cells.

13. Method for producing a polypeptide according to any one of claims 1 - 6,

comprising the steps of:

a. culturing a recombinant host cell according to claim 10 or 1 1 under

conditions suitable for the production of the polypeptide, and

b. recovering the polypeptide obtained, and

c. optionally purifying said polypeptide.

14. Use of a polypeptide according to any one of claims 1 - 6 in an application

selected from the group consisting of pulp delignification, degrading or decreasing the structural integrity of lignocellulosic material, textile dye bleaching, wastewater detoxifixation, xenobiotic detoxification, production of a sugar from a lignocellulosic material and recovering cellulose from a biomass.

15. Method for improving the thermostability of a polypeptide with pectate lyase

activity comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 , the method comprising the step of altering the amino acid at a position corresponding to position 235 in SEQ ID NO: 1 to a a small, polar, non-charged amino acid residue, optionally additionally comprising the step of altering the amino acid at a position corresponding to position 231 in SEQ ID NO: 1 to a leucine residue.

Description:
PECTINASES WITH IMPROVED THERMOSTABILITY.

Field of the invention

The invention is in the field of protein chemistry, in particular in the field of enzymology. It provides pectinases, i.e. polypeptides with pectin-degrading properties. In particular the invention provides polypeptides with pectate lyase activity (EC 4.2.2.2). Enzymes according to the invention have improved properties, such as improved thermostability. Background of the invention

Plant cell wall degrading enzymes are carbohydrate-active enzymes that have been classified in different families based on homology criteria [http:// www.cazy.org/, Cantarel et al., 2009, Nucleic Acids Res 37: D233-D238].

Pectate lyases (EC 4.2.2.2), are an important group of plant cell wall degrading enzymes. They cleave pectin using an eliminative cleavage of (1 ->4)-alpha-D- galacturonan yielding oligosaccharides with 4-deoxy-alpha-D-galact-4-enuronosyl groups at their non-reducing ends. They are mainly produced by plant pathogens and plant- associated organisms, and only rarely by animals. Pectate lyases are also commonly produced in bacteria, either by bacteria living in close proximity with plants or by gut bacteria that find plant material in the digestive tract of their hosts. [Hugouvieux-Cotte-

Pattat et al., Environmental Microbiology reports (2014) doi 10, 1 1 1 1/1758-2229, 12166].

Pectate lyases favor pectate, the anion, over pectin, the methylated ester, which is the preferred substrate of pectin lyase EC 4.2.2.10. Pectate lyases are also known under different names, such as alpha-1 ,4-D-endopolygalacturonic acid lyase, endo-alpha-1 ,4-polygalacturonic acid lyase, endogalacturonate transeliminase, endopectin methyltranseliminase, pectate transeliminase, pectic acid lyase, pectic acid transeliminase, pectic lyase, pectin trans-eliminase, PGA lyase, polygalacturonate lyase, polygalacturonic acid lyase, polygalacturonic acid trans-eliminase, polygalacturonic transeliminase and PPase-N.

When pectate lyases are used in industrial processes, it is often advantageous that they are stable at higher temperatures (thermostable) and resistant to alkaline conditions. Thermostable alkaline pectate lyases for instance have potential applications in the textile industry as an alternative to chemical-based ramie degumming processes. Such enzymes have been described, and have been isolated and characterized from bacterial sources, mainly Bacillus [Swarupa Rani Chiliveri et al., Carbohydrate Polymers (2014), 1 1 1 : 264-272, Zhou et al., Appl Environ Microbiol (2015) 81 : 5714-5723]. Cleavage by pectate lyases requires the presence of cations, such as manganese, nickel, iron, cobalt or calcium ions [Celia Marin-Rodriguez et al., J. Exp. Bot. (2002) 53: 21 15-21 19, Hugouvieux-Cotte-Pattat et al., Environmental Microbiology reports (2014) doi 10, 1 1 1 1/1758-2229, 12166], with only rare exceptions [Kazemi-Pour et al., Proteomics (2004) 10: 3177-3186].

Recently, a thermostable pectate lyase was isolated from Bacillus, cloned, sequenced and characterized [Takao et al, Biosci. Biotechnol. Biochem. (2000) 64: 2360- 2367, Takao et al., Biosci. Biotechnol. Biochem. (2001 ) 65: 322-329].

This enzyme was described to be thermostable when produced in a homologous expression system in Bacillus subtilis, and capable of resisting pre-incubation for 30 minutes at 70 degrees Celsius [Takao et al., Biosci. Biotechnol. Biochem. (2001 ) 65: 322-329]. However, pre-incubation at 80 degrees Celsius completely abolished the enzymatic activity. Because many industrial processes are preferably performed at temperatures above 70 degrees Celsius, there is a need in the art for even more thermostable or thermoresistant polypeptides with pectate lyase activity.

Summary of the invention

The present invention addresses this need in that it provides a pectate lyase with improved thermostable properties. More in particular, the invention provides a polypeptide with pectate lyase activity comprising an amino acid sequence that is at least

70% identical to the amino acid according to SEQ ID NO: 1 , wherein the polypeptide comprises a small, polar, non-charged amino acid residue at an amino acid position corresponding to position 235 in SEQ ID NO: 1 .

The invention also relates to a composition comprising a polypeptide as described above, a nucleic acid encoding a polypeptide as described above, a vector comprising such a nucleic acid and a composition comprising such a nucleic acid or a vector.

The invention also provides a recombinant host cell comprising a nucleic acid, a vector or a composition as described above.

Moreover, the invention relates to a method for producing a polypeptide as described above, comprising the steps of: culturing a recombinant host cell as described above, under conditions suitable for the production of the polypeptide, and recovering the polypeptide obtained, and optionally purifying the polypeptide.

In addition, the invention relates to a polypeptide as described above in an application selected from the group consisting of pulp delignification, degrading or decreasing the structural integrity of lignocellulosic material, textile dye bleaching, wastewater detoxifixation, xenobiotic detoxification, production of a sugar from a lignocellulosic material and recovering cellulose from a biomass.

The invention also relates to a method for improving the thermostability of a polypeptide with pectate lyase activity comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 , the method comprising the step of altering the amino acid at a position corresponding to position 235 in SEQ ID NO: 1 to a a small, polar, non-charged amino acid residue.

Legend to the figures

Figure 1 : Diagram showing the relative pectate lyase activity of polypeptides according to SEQ ID NO: 1 (WT) and its K235 variants SEQ ID NO: 2 (K235T), SEQ ID NO: 3 (K235S), SEQ ID NO: 4 (K235N), and SEQ ID NO: 5 (K235C). Pectate lyase activity was determined after a pre-incubation of 10 minutes at elevated temperatures. RT = Room Temperature, 70 C is 70 degrees Celsius.

Figure 2: Diagram showing the relative pectate lyase activity of polypeptides according to SEQ ID NO: 6 (93% identity with SEQ ID NO: 1 ), and its K235 variants SEQ ID NO: 7 (K235T), SEQ ID NO: 8 (K235S), SEQ ID NO: 9 (K235N), and SEQ ID NO: 10 (K235C). Pectate lyase activity was determined after a pre-incubation of 10 minutes at elevated temperatures. RT = Room Temperature, 70 C is 70 degrees Celsius.

Figure 3: Diagram showing the relative pectate lyase activity of polypeptides according to SEQ ID NO: 1 1 (89% identity with SEQ ID NO: 1 ), and its K235 variants SEQ ID NO: 12 (K235T), SEQ ID NO: 13 (K235S), SEQ ID NO: 14 (K235N), and SEQ ID NO: 15 (K235C). Pectate lyase activity was determined after a pre-incubation of 10 minutes at elevated temperatures. RT = Room Temperature, 70 C is 70 degrees Celsius.

Figure 4: Diagram showing the relative pectate lyase activity of polypeptides according to SEQ ID NO: 1 (WT) and its variants A231 L, K235S, K235T, A231 L+K235S and A231 L+K235T. Pectate lyase activity was determined after a pre-incubation of 10 minutes at elevated temperatures. RT = Room Temperature, 70 C is 70 degrees Celsius.

Figure 5: Diagram showing the relative pectate lyase activity of polypeptides according to SEQ ID NO: 6 (93% identical with SEQ ID NO: 1 ) and its variants A231 L,

K235N and A231 L+K235N. Pectate lyase activity was determined after a pre-incubation of 10 minutes at elevated temperatures. RT = Room Temperature, 70 C is 70 degrees Celsius.

Figure 6: Diagram showing the relative pectate lyase activity of polypeptides according to SEQ ID NO: 1 1 (89% identical with SEQ ID NO: 1 ) and its variants A231 L,

K235C and A231 L+K235C. Pectate lyase activity was determined after a pre-incubation of 10 minutes at elevated temperatures. RT = Room Temperature, 70 C is 70 degrees Celsius.

Detailed description of the invention

The present invention is based on our observation that a single amino acid substitution (K235 variant) in different pectate lyases improves their thermostability.

The term "amino acid substitution" is used herein the same way as it is commonly used, i.e. the term refers to a replacement of one or more amino acids in a protein with one or more other amino acids. Such an amino acid substitution may also be referred to as a mutation, a variation or a variant.

We observed the same phenomenon in pectate lyases that were homologous to the polypeptide with an amino acid sequence according to SEQ ID NO: 1. When the same amino acid variations were introduced at position 235 in polypeptides that were 93% and 89% identical to the polypeptide according to SEQ ID NO: 1 , this also improved the thermostability of the homologous enzymes.

The invention thus relates to a polypeptide with pectate lyase activity comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 , wherein the polypeptide comprises a small, polar, non- charged amino acid residue at an amino acid position corresponding to position 235 in SEQ ID NO: 1.

Polypeptides with pectate lyase activity are also referred herein as pectate lyases, or pectate lyase enzymes.

The term "pectate lyase activity" is used herein to indicate the ability of a polypeptide to cleave pectin using an eliminative cleavage of (1 ->4)-alpha-D-galacturonan yielding oligosaccharides with 4-deoxy-alpha-D-galact-4-enuronosyl groups at their non- reducing ends.

The term "at least 70%" is used herein to include at least 71 %, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81 %, 82%, 83%, 84%, 85%, 88%, 87%, 88%, 89%, 90% or more, such as 91 %, 92%, 93%, 94%, 95%, 99%, 97%, 98%, 99%, or even 100%.

As used herein, the degree of identity between two or more amino acid sequences is equivalent to a function of the number of identical positions shared by the sequences; i.e., % identity = number of identical positions divided by the total number of aligned positions x 100, excluding gaps, which need to be introduced for optimal alignment of the two sequences, and overhangs. The alignment of two sequences is to be performed over the full length of the polypeptides. The comparison (aligning) of sequences is a routine task for the skilled person and can be accomplished using standard methods known in the art. For example, a freeware conventionally used for this purpose is "Align" tool at NCBI recourse http://blast.ncbi. nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&BLAST_SPEC=b last2s eq&LINK_LOC=align2seq, Other commercial and open software such as Vector NTI are also suitable for this purpose,

Introduction of a specific mutation in a recombinant gene is also among the routine skills of a molecular biologist. Specific guidance may be obtained from

Methods in Molecular Biology Vol 182, "In vitro mutagenesis protocols", Eds Jeff Braman, Humana Press 2002. There are commercially available kits for performing site-directed mutagenesis (for example, QuikChange II XL Site-Directed Mutagenesis kit Agilent Technologies cat No 200521 ).

SEQ ID NO: 1 provides the amino acid sequence of a known polypeptide [Takao et al, Biosci. Biotechnol. Biochem. (2000) 64: 2360-2367, Takao et al., Biosci. Biotechnol. Biochem. (2001 ) 65: 322-329] with pectate lyase activity. We replaced the lysine residue at position 235 of SEQ ID NO: 1 with a small, polar, non-charged amino acid residue in order to obtain the K235 variants described herein. Exemplified herein are the variants K235T, K235C, K235S and K235N. This annotation is used herein to indicate a replacement of the amino acid residue corresponding to position 235 of SEQ ID NO: 1 with either one of the residues T (threonine), C (cysteine), S (serine) or N (asparagine), thereby obtaining the polypeptides according to SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5, respectively. We found that the thermostability of the enzyme was thereby remarkably improved.

The term "mutant protein" or "mutation" is also used herein to refer to a polypeptide with pectate lyase activity as described herein, wherein the polypeptide comprises a small, polar, non-charged amino acid residue at an amino acid position corresponding to position 235 in SEQ ID NO: 1 .

The term "wild type protein" is also used herein to indicate a polypeptide identical to the mutant protein, with the exception that it does not comprise a small, polar, non-charged amino acid residue at an amino acid position corresponding to position 235 in SEQ ID NO: 1 .

The term "improved thermostability" in reference to a mutant polypeptide, as used herein, means that the mutant polypeptide has a higher residual pectate lyase activity than the corresponding wild type protein, after incubation for 10 minutes in 50 mM Tris-HCI pH 8.0 at a suitable temperature.

The term "suitable temperature" as used in this context refers to a temperature at which the wild type protein loses part of its pectate lyase activity after 10 minutes of incubation in 50 mM Tris-HCI pH 8.0. In other words, the term "suitable temperature" refers to a temperature chosen from a temperature range between temperatures X and Y, wherein X is the lowest temperature at which a wild type polypeptide shows a detectable loss of activity after 10 minutes of incubation in 50 mM Tris-HCI pH 8.0 and wherein temperature Y is the lowest temperature at which a wild type polypeptide loses all activity after 10 minutes of incubation in 50 mM Tris-HCI pH 8.0.

More specifically, in the thermostability assay, the polypeptides were heated to 70, 75 or 80 degrees Celsius for 10 minutes in 50 mM Tris-HCI at pH 8.0. The residual activity was measured at 60 degrees Celsius at pH 8.0 as described in example 5 and compared to the residual activity of the same polypeptides after preincubation at room temperature for 10 minutes. The results are shown in figures 1 - 3.

In more detail, we measured the residual relative pectate lyase activity after heat treatment of polypeptides with an amino acid sequence according to SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5 and compared this activity to that of a polypeptide with an amino acid sequence of SEQ ID NO: 1 after the same pre-treatment. We found that the introduction of the K235 mutation improved the thermostability of the pectate lyase enzyme.

This phenomenon appeared not to be restricted to the polypeptide with an amino acid sequence according to SEQ ID NO: 1 , but was also observed in polypeptides homologous to the polypeptide according to SEQ ID NO: 1 . Accordingly, we found that this amino acid position 235 could be changed in polypeptides with an amino acid sequence homologous to the sequence according to SEQ ID NO: 1 with the same effect. We constructed two pectate lyases that were 93% (SEQ ID NO: 6) and 89% (SEQ ID NO: 1 1 ) identical with the amino acid sequence according to SEQ ID NO: 1 and found that these two homologous enzymes also had an improved thermostability when the amino acid corresponding to position 235 in SEQ ID NO: 1 was changed to small, polar, non- charged amino acid residue.

Whereas the wild type sequence (SEQ ID NO: 1 ) only displayed 60% of its activity when pre-incubated at 70 degrees Celsius for 10 minutes, the variant polypeptides with the 235 mutation (K235T, K235S, K235N and K235C) were all at least as active as when pre-incubated at room temperature (table 1 and figure 1 ).

Table 1 ; Thermostability of SEQ ID NO: 1 and its K235 variants (Residual relative pectate lyase activity after pre-incubation at elevated temperatures). Temp SEQ ID NO: 1 K235T K235S K235N K235C

RT 100% 100% 100% 100% 100%

70 C 60% 1 10% 1 10% 105% 100%

75 C 1 % 50% 50% 30% 30%

80 C 1 % 35% 35% 10% 10%

Moreover, whereas the wild-type enzyme (WT, SEQ ID NO: 1 ) was not active anymore after pre-incubation at 75 degrees Celsius for 10 minutes, the K235 variants where active, even up to a level of 50% of the activity of the same enzyme, pre-incubated at room temperature for 10 minutes (figure 1 ).

Even more surprisingly, we found that the K235 variants were all able to resist pre-incubation at 80 degrees Celsius for 10 minutes, even up to a level of 35% of the activity of the same enzyme, pre-incubated at room temperature for 10 minutes (figure 1 ).

The expression "the amino acid corresponding to position 235 in SEQ ID NO: 1 " is to be understood as follows. If such a position is to be determined in a given amino acid sequence that is at least 70% identical with the amino acid sequence according to SEQ ID NO: 1 , then the two sequences are first to be aligned. That may be done by routine methods and software available in the art. The amino acid in the given amino acid sequence corresponding to amino acid 235 in SEQ ID NO: 1 is then the amino acid aligning with the lysine residue at position 235 in SEQ ID NO: 1.

We performed a homology search for proteins homologous to SEQ ID NO: 1 using SEQ ID NO: 1 as the query sequence in the "Standard protein BLAST" software, available at

http://blast.ncbi. nlm.nih.gov/Blast.cgi?PROGRAM=blastp&PAGE_TYPE=BlastSear ch&LI NK_LOC=blasthome. More information on the software and database versions is available at the National Center for Biotechnology Information at National library of Medicine at National institute of Health internet site www.ncbi.nlm.nih.gov. Therein, a number of molecular biology tools including BLAST (Basic Logical Alignment Search Tool) is to be found. BLAST makes use of the following databases: All non-redundant GenBank CDS translations + PDB + SwissProt + PIR + PRF excluding environmental samples from WGS projects.

There were no polypeptides found comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 .

The term "amino acid variant", "variant", "mutant" or "sequence variant" or equivalent has a meaning well recognized in the art and is accordingly used herein to indicate an amino acid sequence that has at least one amino acid difference as compared to another amino acid sequence, such as the amino acid sequence from which it was derived.

We also constructed homologous polypeptides, having 93% and 89% sequence identity to the wild type sequence according to SEQ ID NO: 1. These homologous polypeptides are referred to herein as polypeptides according to SEQ ID NO: 6 (93% identical) and SEQ ID NO: 1 1 (89% identical).

We found that K235 variants of these polypeptides also had an improved thermostability (tables 2 and 3 and figures 2 and 3).

Table 2; Thermostability of SEQ ID NO: 6 and its K235 variants (Residual relative pectate lyase activity after pre-incubation at elevated temperatures)

Table 3; Thermostability of SEQ ID NO: 1 1 and its K235 variants (Residual relative pectate lyase activity after pre-incubation at elevated temperatures).

The term "improved thermostability" as used herein means that the K235 variant polypeptides exhibited more pectate lysase activity after preincubation at elevated temperatures as compared to the activity of the same polypeptides without the mutation at position 235, such as the wild type sequence (SEQ ID NO: 1 ) or the two homologous polypeptides according to SEQ ID NO: 6 and SEQ ID NO: 1 1 as described herein.

Thermostable pectate lyases have been described to be produced by bacteria of the genus Bacillus [Takao et al, Biosci. Biotechnol. Biochem. (2000) 64: 2360-2367, Takao et al., Biosci. Biotechnol. Biochem. (2001 ) 65: 322-329, Swarupa Rani Chiliveri et al., Carbohydrate Polymers (2014), 1 1 1 : 264-272, Zhou et al., Appl Environ Microbiol (2015) 81 : 5714-5723], hence in a preferred embodiment the invention relates to a polypeptide as described herein wherein the polypeptide is capable of being expressed in a bacterium, such as a Bacillus species, more preferably Bacillus subtilis.

We have shown that several polypeptides may be produced that are homologous to the wild-type sequence and still retain their pectate lyase activity. A BLAST search revealed that pectate lyases are available from bacterial origin, in particular from Bacillus species, with an identity as low as 52% or less as compared to SEQ ID NO: 1. The skilled person will therefore have no difficulty in constructing a polypeptide with pectate lyase activity that is at least 70% identical to the sequence of SEQ ID NO: 1 following the procedures and guidance provided herein. He will also be able to make the K235 variants as described herein, thereby obtaining a pectate lyase with an improved thermostability.

In a preferred embodiment, the invention relates to a polypeptide as described herein comprising an amino acid sequence that is at least 75% identical to the amino acid according to SEQ ID NO: 1 , such as 80%, 85%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or even 100%.

Recovery of a polypeptide according to the invention as produced by a host cell may be performed by any technique known to those skilled in the art. Possible techniques include, but are not limited to secretion of the protein into the expression medium, and purification of the protein from cellular biomass. The production method may further comprise a step of purifying the polypeptide obtained. For thermostable polypeptides, non-limiting examples of such methods include heating of the disintegrated cells and removing coagulated thermo labile proteins from the solution. For secreted proteins, non-limiting examples of such methods include ion exchange chromatography, and ultra-filtration of the expression medium. It is preferred that the purification method of choice is such that the purified protein retains its activity.

Accordingly, in a further preferred embodiment, the invention relates to a polypeptide as described herein wherein the polypeptide is an isolated polypeptide.

We have shown herein that the K235 variants as described herein have an improved thermostability.

The polypeptides as described herein may be used in compositions containing several additional components, such as stabilizers, fillers, cell debris, culture medium etcetera. Hence, the invention provides a composition comprising a polypeptide as described herein.

Polypeptides as described herein may be obtained by expressing a recombinant DNA in a heterologous expression system. The term "heterologous expression system" or equivalent means a system for expressing a DNA sequence from one host organism in a recipient organism from a different species or genus than the host organism. The most prevalent recipients, known as heterologous expression systems, are chosen usually because they are easy to transfer DNA into or because they allow for a simpler assessment of the protein's function. Heterologous expression systems are also preferably used because they allow the upscaling of the production of a protein encoded by the DNA sequence in an industrial process. Preferred recipient organisms for use as heterologous expression systems include bacterial, fungal and yeast organisms, such as for example Escherichia coli, Bacillus, Corynebacterium, Pseudomonas, Pichia pastoris, Saccharomyces cerevisiae, Yarrowia lipolytica, filamentous fungi and many more systems well known in the art.

The presently disclosed polypeptides or proteins may be fused to additional sequences, by attaching or inserting, including , but not limited to, affinity tags, facilitating protein purification (S-tag, maltose binding domain, chitin binding domain), domains or sequences assisting folding (such as thioredoxin domain, SUMO protein), sequences affecting protein localization (periplasmic localization signals etc), proteins bearing additional function, such as green fluorescent protein (GFP), or sequences representing another enzymatic activity. Other suitable fusion partners for the presently disclosed polypeptides are known to those skilled in the art.

The present invention also relates to polynucleotides encoding any of the pectate lyase variants disclosed herein. Means and methods for cloning and isolating such polynucleotides are well known in the art.

Furthermore, the present invention relates to a vector comprising a polynucleotide according to the invention, optionally operably linked to one or more control sequences. Suitable control sequences are readily available in the art and include, but are not limited to, promoter, leader, polyadenylation, and signal sequences.

Pectate lyase variants according to various embodiments of the present invention may be obtained by standard recombinant methods known in the art. Briefly, such a method may comprise the steps of: culturing a recombinant host cell as described above under conditions suitable for the production of the polypeptide, and recovering the polypeptide obtained. The polypeptide may then optionally be further purified.

A large number of vector-host systems known in the art may be used for recombinant production of the pectate lyases as described herein. Possible vectors include, but are not limited to, plasmids or modified viruses which are maintained in the host cell as autonomous DNA molecule or integrated in genomic DNA. The vector system must be compatible with the host cell used as is well known in the art. Non-limiting examples of suitable host cells include bacteria (e.g. E.coli, bacilli), yeast (e.g. Pichia Pastoris, Saccharomyces Cerevisiae), fungi (e.g. filamentous fungi) insect cells (e.g. Sf9).

In yet other terms, the invention relates to a method for improving the thermostability of a polypeptide with pectate lyase activity comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 , the method comprising the step of altering the amino acid at a position corresponding to position 235 in SEQ ID NO: 1 to a small, polar, non-charged amino acid residue.

Surprisingly, we found that the thermostability of the above described K235 variant polypeptides could be even further improved if another variation was introduced in addition to the K235 variation. We introduced an A231 L variant into SEQ ID NO: 1 and found that this variation on its own improved the thermostability of the polypeptide (table 4, figure 4).

As used herein, the term "A231 L variant" indicates that the amino acid corresponding to the residue at position 231 of SEQ ID NO: 1 (alanine) is replaced by a leucine residue.

Table 4: Thermostability of a polypeptide according to SEQ ID NO 1 and its variants (Residual relative pectate lyase activity after pre-incubation at elevated temperatures).

Surprisingly, the polypeptides carrying both the A231 L variation and the K235 variations were more thermostable than each of the variant polypeptides alone. The effect was even found to be synergistic. Whereas the polypeptide according to SEQ ID NO: 1 did not have any significant residual activity after pre-incubation at 75 degrees C, the A231 L variant as well as the K235T and K235S variants retained 20%, 50% and 50% of their activity relative to the activity after pre-incubation at room temperature (relative activity, table 4 and figure 4). When both the A231 L and K235T mutations were introduced into SEQ ID NO: 1 , the relative activity of this double mutant (after pre-incubation at 75 C) improved to 90%. The same effect was found for the A231 L and K235S double mutant of the polypeptide according to SEQ ID NO: 1. Therein the activity of the double mutant remained at 100% after pre-incubation at 75 degrees Celsius for 10 minutes.

Most remarkably, the double mutants were exceptionally active after pre- incubation at 80 degrees Celsius. Whereas the A231 L variant of SEQ ID NO: 1 was no longer active after pre-incubation at 80 degrees Celsius, in combination with the K235 variants, it improved the thermostability from K235T variant from 35% to 80% and the thermostability of the K235S variant from 35 to 70% (table 4, figure 4)

We further investigated if this phenomenon also occurred with polypeptides that were homologous to the polypeptide according to SEQ ID NO: 1 . For that purpose we introduced the A231 L single and double mutations in polypeptides according to SEQ ID NO: 6 and 1 1 (93% and 89% identical with SEQ ID NO: 1 respectively).

We observed that the thermostability of these polypeptides could also be improved in the same manner as described above for the polypeptide according to SEQ D NO: 1.

The single mutation A231 L in SEQ ID NO: 6 improved the thermostability as shown in table 5. The double mutant A231 L plus K235N improved the thermostability synergistically, i.e. more than the sum of the contributions of each of the mutations separately. After pre-incubation at 75 degrees Celsius, the relative activity of the A231 L and K235N variants still was 25% and 30% respectively, whereas the combination of both mutations resulted in 80% relative activity.

Most remarkably, the double mutants were exceptionally active after preincubation at 80 degrees Celsius. The A231 L variant of SEQ ID NO: 1 was not significantly active after pre-incubation at 80 degrees Celsius. However, in combination with the K235N variant, it improved the thermostability of the double mutant from 20% to 70% (table 5, figure 5).

Table 5: Thermostability of a polypeptide according to SEQ ID NO 6 and its variants (Residual relative pectate lyase activity after pre-incubation at elevated temperatures).

The single mutation A231 L in SEQ ID NO: 1 1 improved the thermostability as shown in table 6. The double mutant A231 L plus K235C improved the thermostability synergistically, i.e. more than the sum of the contributions of each of the mutations separately. After pre-incubation at 75 degrees Celsius, the A231 L and K235C variants still had 18% and 35% relative activity respectively, whereas the combination of both mutations resulted in 90% relative activity. Most remarkably, the double mutants were exceptionally active after preincubation at 80 degrees Celsius. The A231 L variant of SEQ ID NO: 1 1 was not significantly active after pre-incubation at 80 degrees Celsius. However, in combination with the K235C variant, it improved the thermostability of the double mutant from 10% to 80% (table 6, figure 6).

Table 6: Thermostability of a polypeptide according to SEQ ID NO 1 1 and its variants

(Residual relative pectate lyase activity after pre-incubation at elevated temperatures).

Hence, the invention also relates to a K235 variant polypeptide as described above additionally comprising a leucine residue at an amino acid position corresponding to position 231 in SEQ ID NO: 1 .

The invention also relates to a method for improving the thermostability of a polypeptide with pectate lyase activity comprising an amino acid sequence that is at least 70% identical to the amino acid according to SEQ ID NO: 1 , the method comprising the step of altering the amino acid at a position corresponding to position 235 in SEQ ID NO: 1 to a small, polar, non-charged amino acid residue and altering the amino acid at a position corresponding to position 231 in SEQ ID NO: 1 to a leucine residue.

In a further preferred embodiment, the invention relates to any of the methods as described above, wherein the polypeptide with pectate lyase activity is capable of being expressed in a bacterium, such as a Bacillus species, more preferably Bacillus subtilis.

The polypeptides with pectate lyase activity according to the present invention may be used in a wide range of different industrial processes and applications, such as cellulose recovery from lignocellulosic biomass, decreasing the energy required for the refining of wood and production of a sugar from a lignocellulosic material. They may also be used in wood pulp preparation, in pulp delignification, textile dye bleaching, wastewater detoxifixation, xenobiotic detoxification, degrading or decreasing the structural integrity of lignocellulosic material and detergent manufacturing.

Examples

Example 1 : Preparation of a polypeptide according to SEQ ID NO: 1. The DNA construct disclosed in Takao et al., Biosci. Biotechnol. Biochem. (2001 ) 65: 322-329 encoding the polypeptide according to SEQ ID NO: 1 was optimized for expression in E. coli and commercially synthesized and cloned into a standard plasmid vector pET28a+ under the control of T7-RNA-polymerase promoter for expression in Escherichia coli BL21 (DE3). The nucleotide sequence of the construct is provided herein as SEQ ID NO: 16.

Example 2: Preparation of variants of a polypeptide according to SEQ ID NO: 1 with pectate lyase activity.

Homologous protein sequences (according to SEQ ID NO: 6 and SEQ ID NO:

1 1 ) were generated by random mutagenesis of SEQ ID NO:s 16 and SEQ ID NO: 21 using error-prone PCR essentially as described (Curr Protoc Mol Biol. 2001 May; Chapter 8: Unit 8.3. doi: 10.1002/0471 142727. mb0803s51 , Random mutagenesis by PCR. Wilson DS1 , Keefe AD) using a commercial random PCR mutagenesis kit (QuikChange® II XL Site-Directed Mutagenesis kit by Agilent Technologies). More in particular, the DNA sequence of SEQ ID NO: 21 was obtained from SEQ ID NO: 16 encoding the polypeptide according to SEQ ID NO: 1 . The DNA sequence of SEQ ID NO: 26 was obtained by random mutagenesis of SEQ ID NO: 21 encoding the polypeptide according to SEQ ID NO: 6. SEQ ID NO: 26 is the DNA sequence encoding the polypeptide according to SEQ ID NO: 1 1.

PCR fragments resulting from error-prone PCR were cloned to the plasmid vector pET28a+ under the control of T7-RNA-polymerase promoter for expression in Escherichia coli BL21 (DE3), and screened for pectate lyase activity of the recombinant proteins.

Active clones were subjected to further rounds of randomization using the same protocol. The polypeptide according to SEQ ID NO: 6 exhibited pectate lyase activity and was found to be 93% identical with SEQ ID NO: 1 . The polypeptide according to SEQ ID NO: 1 1 also exhibited pectate lyase activity and was found to be 89% identical with SEQ ID NO: 1 .

Example 3: Preparation of A231 L and K235 variant polypeptides.

In order to prepare polypeptide according to SEQ ID NO:s 2 - 5, mutations were inserted into the DNA coding for polypeptide according to SEQ ID NO: 1 at position 235. As a result, the lysine residue from that position in SEQ ID NO: 1 was replaced with either one of the residues T (threonine), C (cysteine), S (serine) or N (asparagine), thereby resulting in the polypeptides according to SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4 and SEQ ID NO: 5, respectively. These variants are referred to herein as K235T, K235S, K235N and K235C respectively.

This was achieved by standard site-directed mutagenesis essentially as described in WO 2013/038062. In more detail: To introduce mutation A231 L into the genes encoding SEQ ID NO: 1 , SEQ ID NO: 6 and SEQ ID NO: 1 1 , we carried out two separate PCR reactions:

(1 ) with primers Primer 1 gaaattaatacgactcactatagg (SEQ ID NO: 31 ) and

Primer 2(A231 L) GCCATCATGCTGCTGAAACGGACGACCAAAATAGGTG (SEQ ID NO: 38),

(2) with Primer3(A231 L) GGTCGTCCGTTTCAGCAGCATGATGGCctgCTGGATATC (SEQ ID NO: 39) and Primer 4 ggttatgctagttattgctcagcggtg (SEQ ID NO: 32).

In both reactions, recombinant gene without the mutation was used as the template. Primers 1 and 4 bind inside the vector sequence and are not specific to the recombinant gene. Primers 2 and 3 bind inside the recombinant gene and their binding sites overlap. Primer 3 binding site contains the mutation site. Primer 3 represents the mutated (desired) sequence, which is not 100% matching the template (lower case type font in the primer sequence indicates the mismatched nucleotides). However, the primer has enough affinity and specificity to the binding site to produce the desired PCR product. Purified PCR products from reactions (1 ) and (2) were combined and used as template for PCR reaction with Primer 1 and Primer 4. The products of this reaction, containing the A231 L variant sequence of the genes encoding the polypeptides according to SEQ ID NO:s 48 - 50 was cloned in a plasmid vector for expression in E. coli.

The same protocol and the same primers 1 and 4 were used for introducing the K235 mutations into the genes encoding the polypeptide according to SEQ ID NO: 1 , SEQ ID NO: 6 and SEQ ID NO: 1 1 . Primer 3 used for introducing the K235 mutations was AATAGCAGCGATTTTATCACCATCAGCTACAACGTGTTTA (SEQ ID NO: 37). The specific Primers 2 used for the mutations K235T, K235S, K235N and K235C are listed in table 7.

Table 7; primers used for introducing the 231 and 235 mutations.

Seq ID Primer name Sequence (5'-3')

NO:

31 Primer 1 GAAATTAATACGACTCACTATAGG

32 Primer 4 GGTTATGCTAGTTATTGCTCAGCGGTG

33 Primer 2 GCTGATGGTGATAAAATCGCTGCTATTggTGATATCCAG Seq ID Primer name Sequence (5'-3')

NO:

K235T

34 Primer 2 GCTGATGGTGATAAAATCGCTGCTATTgcTGATATCCAG

K235S

35 Primer 2 GCTGATGGTGATAAAATCGCTGCTATTgTTGATATCCAG

K235N

36 Primer 2 GCTGATGGTGATAAAATCGCTGCTATTgcaGATATCCAG

K235C

37 Primer 3 K235 AATAGCAGCGATTTTATCACCATCAGCTACAACGTGTTTA

38 Primer 2 A231 GCCATCATGCTGCTGAAACGGACGACCAAAATAGGTG

39 Primer 3 GGTCGTCCGTTTCAGCAGCATGATGGCctgCTGGATATCA

A231 L

Double mutants were prepared by introducing the mutation into the DNA encoding a polypeptide carrying a single mutation.

Example 4: Heterologous expression of polypeptides with pectate lyase activity.

For recombinant expression in E. coli, recombinant genes were cloned into pET-28 commercial expression vector under the control of T7 bacteriophage promoter.

Protein production was carried out in E.coli BL21 (DE3) strain according to the plasmid manufacturer protocol available at

http://richsingiser.com/4402/Novagen%20pET%20system%20man ual.pdf . The incubation temperature for protein production was 30 degrees C, which was found optimal for maximum yield of the active protein. Cells were lysed using lysis buffer (50 mM Tris-HCI pH7.4, 1 % Triton X100, 0.5 mM CaCI) and heated at 60 degrees C for 20 min. Coagulated cell debris was removed by centrifugation. The thermostable recombinant pectate lyases were detected in the soluble fraction only, consistent with the notion that they were thermostable enzymes.

Example 5: Pectate lyase activity assay

Pectate lyase activity assay was carried out essentially as described in Takao M, Nakaniwa T, Yoshikawa K, Terashita T, Sakai T., "Purification and characterization of thermostable pectate lyase with protopectinase activity from thermophilic Bacillus sp. TS 47". Biosci Biotechnol Biochem. 2000 64:2360-7. In more detail, pectate lyase activity was assayed by measuring the increase in absorbance at 235 nm of the reaction mixture. Polygalacturonic acid (PGA) sodium salt from de-methylated citrus pectin (purchased from MegaZyme) was used as substrate. A reaction mixture containing 1 ml of 0.1 % PGA in 10 mM Tris-HCI buffer, pH 8.0 and 0.5 mM CaCI2, and an appropriate amount of enzyme solution was incubated for 30 min at 60 C.

The reaction was stopped by placing the mixture in 100 degrees C (boiling water bath) for 5 min. Relative pectate lyase activity was was calculated from the difference in absorption of the reaction mixture at 235 nm at the start and at the end of the reaction.

Example 6: Thermostability of polypeptides with pectate lyase activity.

Thermostability of the polypeptides with pectate lyase activity was determined by pre-incubation for 10 minutes in 50 mM Tris-HCI pH 8.0, either at room temperature (control) or at 70 degrees C, 75 degrees C and 80 degrees C before measuring their activity according to example 5.

After pre-incubation, the samples were brought to 60 degrees C, substrate (PGA) was added and samples were assayed for activity as described in Example 5 at 60 degrees C pH 8.0. Residual activities for each sample were calculated as % of the activity of the corresponding sample pre-incubated at room temperature (control sample).

Example 7: Sequences provided herein

Amino acid sequence and nucleotide sequences are provided herewith in the WIPO ST_25 standard. For convenience the sequences are also provided in table 8.

SEQ ID NO: 1 is derived from the prior art and has been disclosed in Takao et al, Biosci. Biotechnol. Biochem. (2000) 64: 2360-2367 and in Takao et al., Biosci.

Biotechnol. Biochem. (2001 ) 65: 322-329.

Amino acids corresponding to positions 231 and 235 in SEQ ID NO: 1 are shown in bold and underlined type face.

SEQ ID NO: 6 was obtained by random mutagenesis of the DNA encoding SEQ ID NO: 1 (shown herein as SEQ ID NO: 16) as described in example 2.

SEQ ID NO: 1 1 was obtained by random mutagenesis of the DNA encoding SEQ ID NO: 6 (shown herein as SEQ ID NO: 21 ).

The DNA encoding the polypeptide according to SEQ ID NO: 1 1 is shown herein as SEQ ID NO: 26.

The amino acids deviating from the wild type sequence of SEQ ID NO: 1 are shown in capital letters. The polypeptide according to SEQ ID NO: 6 is a homologue of the polypeptide according to SEQ ID NO: 1. These two polypeptides have 385 of the 416 amino acids in common, in other words they are 93% identical.

The polypeptide according to SEQ ID NO: 1 1 is also a homologue of the polypeptide according to SEQ ID NO: 1. These two polypeptides have 369 of the 416 amino acids in common, in other words they are 89% identical.

The polypeptides according to SEQ ID NO:s 2 - 5 correspond to the polypeptide according to SEQ ID NO: 1 with variations K235T, K235S, K235N and K235C respectively.

The polypeptides according to SEQ ID NO:s 7 - 10 correspond to the polypeptide according to SEQ ID NO: 6 with variations K235T, K235S, K235N and K235C respectively.

The polypeptides according to SEQ ID NO:s 12 - 15 correspond to the polypeptide according to SEQ ID NO: 1 1 with variations K235T, K235S, K235N and K235C respectively.

The nucleotide sequences according to SEQ ID NO:s 16 - 30 encode the polypeptides with amino acid sequences according to SEQ ID NO:s 1 - 15 respectively.

SEQ ID NO:s 31 - 39 correspond to the primers used for producing the variant polypeptides as detailed in example 3.

Polypeptides carrying the double mutations A231 L with K235T, K235S, K235N and K235C double mutations are shown in SEQ ID NO: 40 - 43 respectively.

DNA encoding the polypeptides according to SEQ ID NO: 40 - 43 are shown in SEQ ID NO: 44 - 47.

SEQ ID NO:s 48, 49 and 50 correspond to variants A231 L in polypeptides according to SEQ ID NO: 1 , 6 and 1 1 respectively. SEQ ID NO:s 51 - 53 are the DNA sequences encoding the polypeptides according to SEQ ID NO:s 48 - 50

Table 8: Amino acid and nucleotide sequences disclosed herein.

SEQ ID NO: Sequence

1 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg aldiknssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 Ipngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

2 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg alditnssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 Ipngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

3 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg aldisnssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 Ipngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

SEQ ID NO: Sequence

4 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg aldinnssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 lpngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

5 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg aldicnssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 lpngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

6 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldiknssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

7 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

SEQ ID NO: Sequence

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg alditnssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 Ipngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

8 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldisnssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 Ipngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

9 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldinnssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 Ipngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

10 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

SEQ ID NO: Sequence

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldicnssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 Ipngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

11 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldiknssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 Ipngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

12 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg alditnssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 Ipngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

13 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

SEQ ID NO: Sequence

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldisnssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

14 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldinnssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

15 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg aldicnssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

16 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60 acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300

SEQ ID NO: Sequence

acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaaaaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

17 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60 acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaccaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960

SEQ ID NO: Sequence

agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60 acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcagcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

19 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60 acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240

SEQ ID NO: Sequence

gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaacaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

20 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60 acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tctgcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900

SEQ ID NO: Sequence

tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

21 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaaaaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

22 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180

SEQ ID NO: Sequence

atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaccaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

23 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcagcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840

SEQ ID NO: Sequence

aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

24 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaacaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

25 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt

acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa

SEQ ID NO: Sequence

ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tctgcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

26 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaaaaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780

SEQ ID NO: Sequence

gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

27 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaccaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780 gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

28 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60

SEQ ID NO: Sequence

acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcagcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780 gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

29 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaacaatag cagcgatttt 720

SEQ ID NO: Sequence

atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780 gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

30 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tctgcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780 gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

SEQ ID NO: Sequence

31 GAAATTAATACGACTCACTATAGG

32 GGTTATGCTAGTTATTGCTCAGCGGTG

33 GCTGATGGTGATAAAATCGCTGCTATTggTGATATCCAG

34 GCTGATGGTGATAAAATCGCTGCTATTgcTGATATCCAG

35 GCTGATGGTGATAAAATCGCTGCTATTgTTGATATCCAG

36 GCTGATGGTGATAAAATCGCTGCTATTgcaGATATCCAG

37 AATAGCAGCGATTTTATCACCATCAGCTACAACGTGTTTA

GCCATCATGCTGCTGAAACGGACGACCAAAATAGGTG

39 GGTCGTCCGTTTCAGCAGCATGATGGCctgCTGGATATCA

40 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg lldi nssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 Ipngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

41 1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg lldisnssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

SEQ ID NO: Sequence

361 lpngrryidl vasynesntl qlkkevt kp mfyhvihptp svpalvkaka gagnlh

42 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg lldinnssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

43 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg lldicnssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

44 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60

acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540

SEQ ID NO: Sequence

accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc ctgctggata tcaccaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

45 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60 acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc ctgctggata tcagcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200

SEQ ID NO: Sequence

agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

46 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc ctgctggata tcaacaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

47 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480

SEQ ID NO: Sequence

cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc ctgctggata tctgcaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780 gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

1 kelghevlkp ydgwaaygeg ttggamaspq nvfvvtnrte liqalggnnh tnqynsvpki

61 iyvkgtidln vddnnqpvgp dfykdphfdf eaylreydpa twgkkevegp leearvrsqk

121 kqkdrimvyv gsntsiigvg kdakikgggf liknvdnvii rniefeapld yfpewdptdg

181 tlgewnseyd sisiegsshi widhntftdg dhpdrslgty fgrpfqqhdg lldiknssdf

241 itisynvftn hdkvtligas dsrmadsghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvf sqiyaqnnyf sfdwdidpsl iikvwsknee smyetgtivd

361 lpngrryidl vasynesntl qlkkevtwkp mfyhvihptp svpalvkaka gagnlh

49 1 kelghDvlkp ydgwaSygeg ttggSmaspq nvYTvtnKte IVqalggnnh tnqynsvpki

61 iyvkgtiEln vddnnqpvgp EfykdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd yfpewdptdg

181 tlgewnseyd siTiegsHhi widhntftdg dhpdKslgty fgrpfqqhdg lldiknssdf

241 itisynvfKD hdkvtligas dsrmadEghl rvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

SEQ ID NO: Sequence

361 lpngrryidl vasynesntl qlkkevG kp mfyhvihptp svpalvkaka gagnlh

50 1 kelghDvlkp NdgwaSygeg ttggSEaspD nvYTvtnKSe IVqalggnnh tnqynsTpki

61 iyvkgtiEln vddnnqpvgp EYyDdphYdf eaylKeydpK KwgkkevSgp leearArsqk

121 kqkEriVvNv gsntsiigvg kdakiVgggf liknvdnvii rniefeapVd Ffpewdptdg

181 EYgewnseyd siTieSsHhi widhntftdg dhpdKslgty fgrpfqqhdg lldiknssdf

241 itisynvfKD hdkvSligSs dsrKTdEghl Kvtlhhnyyk nvtqrlprvr fgqvhiynny

301 yefsnladyd fqyawgvgvE sKiyaqnnyf sfdwdidpsK iikvwsknee smyeSgtivd

361 lpngrryidl vasynesntl qlkkevGwkp mfyhvihptp svpalvkaka gagnlh

51 aaagaactgg gtcatgaagt tctgaaaccg tatgatggtt gggcagcgta tggtgaaggt 60

acaaccggtg gtgcaatggc aagtccgcag aatgtttttg ttgttaccaa tcgtaccgaa 120 ctgattcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgatctgaac gtggatgata ataatcagcc ggttggtccg 240 gatttctata aagatccgca ttttgatttt gaggcctatc tgcgtgaata tgatccggca 300 acctggggta aaaaagaagt tgaaggtccg ctggaagaag cacgcgttcg tagccagaaa 360 aaacagaaag atcgtatcat ggtttatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagacgcga aaatcaaagg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accgctggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattagca ttgaaggcag cagccatatt 600 tggattgatc acaatacctt taccgatggc gatcatccgg atcgtagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaaaaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaccaac cacgataaag ttaccctgat tggtgcaagc 780 gatagccgta tggcagatag cggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900

SEQ ID NO: Sequence

tatgagttta gcaacctggc cgattatgat tttcagtatg catggggtgt tggtgtgttt 960 agccagattt atgcacagaa caactatttc agcttcgatt gggatattga tccgagcctg 1020 attatcaaag tttggagcaa aaatgaagaa agcatgtatg aaaccggcac catcgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttac ctggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

52 aaagaactgg gtcatgatgt gctgaaaccg tatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcatggc aagtccgcag aatgtttata ccgttaccaa taaaaccgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccgt gccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaattctata aagatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggtggat tattttccgg aatgggatcc gaccgatggc 540 accctgggtg aatggaatag cgaatatgat agcattacca ttgaaggcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaaaaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgaccctgat tggtgcaagc 780 gatagccgta tggcagatga aggtcatctg cgtgttaccc tgcatcacaa ttattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080

SEQ ID NO: Sequence

ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248

53 aaagaactgg gtcatgatgt gctgaaaccg aatgatggtt gggcaagcta tggtgaaggt 60 acaaccggtg gtagcgaagc aagtccggat aatgtttata ccgttaccaa taaaagcgaa 120 ctggttcagg cactgggtgg taataatcat accaatcagt ataattccac cccgaaaatc 180 atctatgtga aaggcaccat tgaactgaac gtggatgata ataatcagcc ggttggtccg 240 gaatattatg atgatccgca ttatgatttt gaagcctatc tgaaagagta tgatccgaaa 300 aaatggggca aaaaagaagt tagcggtccg ctggaagaag cacgcgcacg tagccagaaa 360 aaacagaaag aacgtattgt tgtgaatgtg ggtagcaaca ccagcattat tggtgttggt 420 aaagatgcca aaattgtggg tggtggtttc ctgattaaaa acgtggataa tgtgatcatc 480 cgcaacatcg aatttgaagc accggttgat ttttttccgg aatgggatcc gaccgatggt 540 gaatatggcg aatggaatag cgaatatgat agcattacca tcgaaagcag ccatcatatt 600 tggatcgatc acaatacctt taccgatggc gatcatccgg ataaaagcct gggcacctat 660 tttggtcgtc cgtttcagca gcatgatggc gcactggata tcaaaaatag cagcgatttt 720 atcaccatca gctacaacgt gtttaaagac catgataaag tgagcctgat tggttcaagc 780 gatagccgta aaaccgatga aggtcatctg aaagttaccc tgcatcacaa ctattacaaa 840 aatgttaccc agcgtctgcc tcgtgttcgt tttggtcagg ttcatatcta taacaactac 900 tatgagttta gcaacctggc cgattatgac tttcagtatg catggggtgt tggtgttgaa 960 agcaaaatct atgcccagaa caactatttc agcttcgatt gggatattga cccgagcaaa 1020 attatcaaag tttggagcaa aaacgaagaa agcatgtatg aaagcggtac gattgttgat 1080 ctgccgaatg gtcgtcgtta tattgatctg gttgcaagct ataatgaaag caataccctg 1140 cagctgaaaa aagaggttgg ttggaaaccg atgttctatc atgttattca tccgaccccg 1200 agcgttccgg cactggttaa agcaaaagcc ggtgcaggta atctgcat 1248