Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TOBACCO PROTEASE GENES
Document Type and Number:
WIPO Patent Application WO/2016/009006
Kind Code:
A1
Abstract:
The invention provides protease genes which are regulated in a specific manner during curing of tobacco plants material and which affect the flavour of cured tobacco.

Inventors:
BOVET LUCIEN (CH)
FLORACK DION (CH)
BATTEY JAMES (CH)
Application Number:
PCT/EP2015/066341
Publication Date:
January 21, 2016
Filing Date:
July 16, 2015
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PHILIP MORRIS PRODUCTS SA (CH)
International Classes:
A01H5/12; C12N9/50; C12N15/82
Foreign References:
US20040106198A12004-06-03
Other References:
CATHERINE NAVARRE ET AL: "Identification, gene cloning and expression of serine proteases in the extracellular medium ofcells", PLANT CELL REPORTS, SPRINGER, BERLIN, DE, vol. 31, no. 10, 17 July 2012 (2012-07-17), pages 1959 - 1968, XP035112133, ISSN: 1432-203X, DOI: 10.1007/S00299-012-1308-Y
G. BEYENE: "Two new cysteine proteinases with specific expression patterns in mature and senescent tobacco (Nicotiana tabacum L.) leaves", JOURNAL OF EXPERIMENTAL BOTANY, vol. 57, no. 6, 1 March 2006 (2006-03-01), GB, pages 1431 - 1443, XP055220617, ISSN: 0022-0957, DOI: 10.1093/jxb/erj123
BOVET, L. ET AL.: "Gene expression changes during tobacco curing", 2013, XP002746297, Retrieved from the Internet [retrieved on 20151015]
KAWASHIMA N ET AL: "STUDIES ON PROTEIN METABOLISM IN HIGHER PLANTS V SOME PROPERTIES OF A TOBACCO-D LEAF ENZ PROTEASE INCREASED DURING CURING INST COLUMN CHROMATOGRAPHY", AGRICULTURAL AND BIOLOGICAL CHEMISTRY, vol. 32, no. 9, 1968, pages 1141 - 1145, XP002746298, ISSN: 0002-1369
FLORIAN MARTIN ET AL: "Design of a tobacco exon array with application to investigate the differential cadmium accumulation property in two tobacco varieties", BMC GENOMICS, BIOMED CENTRAL LTD, LONDON, UK, vol. 13, no. 1, 28 November 2012 (2012-11-28), pages 674, XP021140791, ISSN: 1471-2164, DOI: 10.1186/1471-2164-13-674
Attorney, Agent or Firm:
MASCHIO, Antonio (Southampton Hampshire SO15 2ET, GB)
Download PDF:
Claims:
Claims

1 . A mutant, non-naturally occurring or transgenic tobacco plant cell comprising:

(i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80;

(ii) a polypeptide encoded by the polynucleotide set forth in (i);

(iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or

(iv) a construct, vector or expression vector comprising the isolated

polynucleotide set forth in (i),

and wherein the expression or activity of said protease is modulated as compared to a control tobacco plant cell in which the expression or activity of said protease has not been altered.

2. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 1 , wherein the expression or activity of said protease is upregulated compared to the control tobacco plant cell.

3. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 1 , wherein the expression or activity of said protease is downregulated compared to the control tobacco plant cell.

4. A mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein the expression or activity is modulated of a protease selected from: at least one of SEQ ID NO: 1 to 16; or

at least one of SEQ ID NO: 30 to 41 ; or

at least one of SEQ ID NO: 17 to 22; or

at least one of SEQ ID NO: 42 to 44; or

at least one of SEQ ID NO: 45 to 61 ; or

at least one of SEQ ID NO: 62 to 80 or

at least one of SEQ ID NO: 23 to 29.

5. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 30 to 41 is modulated in an Oriental type tobacco.

6. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from SEQ ID NO: 17 to 22 is modulated in a Virginia type tobacco.

7. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 42 to 44 is modulated in a Burley type tobacco.

8. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 45 to 61 is modulated in a Virginia or Oriental type tobacco.

9. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 62 to 80 is modulated in a Burley or Oriental type tobacco.

10. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 23 to 29 is modulated in a Burley or Virginia type tobacco.

1 1. The mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein said mutation(s) is a heterozygous or homozygous mutation.

12. The mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein the expression of one or more proteases is increased by about 10% to about 1000%,

13. The mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 12, wherein the expression of one or more proteases is increased by at least 10%, at least 20%, at least 25%, at least 50%, at least 100%, at least 200%, at least 500%, at least 750% or up to 1000%.

14. A mutant, non-naturally occurring or transgenic plant or component or part thereof comprising the plant cell according to any preceding claim.

15. Plant material including biomass, seed, stem, flowers or leaves from the plant of claim 14.

16. A tobacco product comprising the plant cell of any of claims 1 to 13, at least a part of the plant of claim 14 or the plant material according to claim 15.

17. A method for preparing a tobacco plant with modulated levels of protease, said method comprising the steps of:

(a) providing a plant comprising (i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80;

(b) inserting one or more mutations into said polynucleotide of said tobacco plant to create a mutant tobacco plant; and

(c) curing the tobacco plant material.

18. The method according to claim 17, wherein the tobacco plant in step (b) is a mutant tobacco plant, preferably, wherein said mutant tobacco plant comprises one or more mutations in one or more further sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80.

19. A method according to claim 17 or claim 18, wherein the genome of a cell of a tobacco plant is modified by a genome editing technology or by genome engineering techniques selected from CRISPR/Cas technology, zinc finger nuclease-mediated mutagenesis, chemical or radiation mutagenesis, homologous recombination,

oligonucleotide-directed mutagenesis and meganuclease-mediated mutagenesis.

20. A method for producing cured plant material, preferably cured leaves, or flowers with an altered flavour profile as compared to control plant material comprising the steps of:

(a) providing a plant according to claim 14 or the plant material according to claim 15;

(b) optionally harvesting the plant material therefrom; and

(c) curing the plant material for a period of time such that the levels of at least one protease are modulated compared to control cured plant material.

21 . The use of

(i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80;

(ii) a polypeptide encoded by the polynucleotide set forth in (i);

(iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or

(iv) a construct, vector or expression vector comprising the isolated

polynucleotide set forth in (i),

for the modulation of the expression or activity of one or more proteases in tobacco during a tobacco curing procedure.

22. Use according to claim 21 , wherein the curing procedure is selected from the group consisting of air curing, fire curing, smoke curing and flue curing.

Description:
TOBACCO PROTEASE GENES

FIELD OF THE INVENTION

The present invention concerns the use of proteases expressed in tobacco to alter the characteristics of cured tobacco products. In particular, the invention provides processes for altering the curing of tobacco leaf and modulating tobacco leaf composition by modulating the expression of one or more tobacco protease genes.

BACKGROUND OF THE INVENTION

Tobacco curing is a process of physical and biochemical changes that bring out the aroma and flavor of each variety of tobacco. After tobacco has been harvested, it is necessary to cure it and then age it before comsumption, to improve its flavour. There are four common methods of curing, and the method used depends on the type of tobacco and its intended use.

Air-cured tobacco is sheltered from wind and sun in a well-ventilated chamber, where it air- dries for six to eight weeks. Air-cured tobacco is low in sugar, which gives the tobacco smoke a light, sweet flavor, and high in nicotine. Cigar and burley tobaccos are air cured. In fire curing, smoke from a low-burning fire permeates the leaves. This gives the leaves a distinctive smokey aroma and flavour. Fire curing takes three to ten weeks and produces a tobacco low in sugar and high in nicotine. Pipe tobacco, chewing tobacco, and snuff are fire cured.

Flue-cured tobacco is kept in an enclosed heated area, but it is not directly exposed to smoke. This method produces cigarette tobacco that is high in sugar and has medium to high levels of nicotine. It is the fastest method of curing, requiring about a week. Virginia tobacco that has been flue cured is also called bright tobacco, because flue curing turns its leaves gold, orange, or yellow.

Sun-cured tobacco dries uncovered in the sun. This method is used in Turkey, Greece and other Mediterranean countries to produce oriental tobacco. Sun-cured tobacco is low in sugar and nicotine and is used in cigarettes.

Curing produces various compounds in the tobacco leaves that give cured tobacco its specific flavour and taste, such as for example a sweet hay, tea, rose oil, or fruity aromatic flavor.

During the first phase of curing, corresponding to the so-called yellowing phase and also known as color curing, the chlorophyll content is reduced. This phase takes between 2 and 8 days depending on the tobacco type. During this phase leaf metabolic activities are drastically changed. Not only is chlorophyll degraded but also, for example, starch and proteins. To date, the only methods for altering the curing process which have been proposed are base on altering the actual conditions to which the tobacco is exposed in the chosen curing procedure. Very little is known about gene expression in tobacco during curing, and moreover few data have been reported on the activities of proteases in tobacco leaf and their resulting products.

We have identified 80 protease genes that are activated during leaf curing in the three main tobacco types, Burley, Virginia and Oriental. We have found that specific protease expression is associated with particular flavour profiles in tobacco.

SUMMARY OF THE INVENTION

80 protease genes (SEQ ID NO: 1 -80) were identified that are up-regulated in Burley tobacco upon air curing, Virginia tobacco upon flue-curing and Oriental tobacco upon sun curing. Details on such up-regulation in one or more of the different tobacco types are summarised in Figure 2 and Table 1 & 2.

Such gene sequences and their regulatory sequences can be used to modulate or modify protease activity during curing. The polynucleotide sequences SEQ ID NO: 1 -80 include exon and intron sequences. The protein sequences relating to the coding sequence part of the polynucleotide sequences SEQ ID NO: 1 -80, are depicted in SEQ ID NO: 81 -160.

Accordingly, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell comprising:

(i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80;

(ii) a polypeptide encoded by the polynucleotide set forth in (i);

(iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or

(iv) a construct, vector or expression vector comprising the isolated polynucleotide set forth in (i),

and wherein the expression or activity of said protease is modulated as compared to a control tobacco plant cell in which the expression or activity of said protease has not been altered.

Alteration of protease expression in tobacco cells during the curing process imparts different flavours to the cured tobacco and products manufactured therefrom. The effects of different genes on different tobacco flavour profiles are further discussed below.

In embodiments, the expression or activity of said protease is upregulated compared to the control tobacco plant cell. However, in certain embodiments, the expression or activity of said protease is downregulated compared to the control tobacco plant cell. In still further embodiments, at least one protease can be upregulated at the same time as at least one protease is downregulated in the same cell.

In an exemplary embodiment, therefore, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein the expression or activity is modulated of a protease selected from:

at least one of SEQ ID NO: 1 to 16; or

at least one of SEQ ID NO: 30 to 41 ; or

at least one of SEQ ID NO: 17 to 22; or

at least one of SEQ ID NO: 42 to 44; or

at least one of SEQ ID NO: 45 to 61 ; or

at least one of SEQ ID NO: 62 to 80 or

at least one of SEQ ID NO: 23 to 29.

In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 30 to 41 is modulated in an Oriental type tobacco. In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from SEQ ID NO: 17 to 22 is modulated in a Virginia type tobacco.

In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 42 to 44 is modulated in a Burley type tobacco. In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 45 to 61 is modulated in a Virginia or Oriental type tobacco.

In a specific embodiment, there is provided a non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 62 to 80 is modulated in a Burley or Oriental type tobacco.

In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 23 to 29 is modulated in a Burley or Virginia type tobacco.

The mutant, non-naturally occurring or transgenic tobacco plant cell can be a tobacco plant cell wherein said mutation(s) is a heterozygous or homozygous mutation. In embodiments of the invention, the expression of the one or more proteases is increased by about 10% to about 1000%, for example by at least 10%, at least 20%, at least 25%, at least 50%, at least 100%, at least 200%, at least 500%, at least 750% or up to 1000%.

In a second aspect, there is provided a mutant, non-naturally occurring or transgenic plant or component or part thereof comprising the plant cell according to the preceding aspect of the invention.

In a third aspect, there is provided plant material including biomass, seed, stem, flowers or leaves from the plant of the second aspect of the invention.

In a fourth aspect, there is provided a method for preparing a tobacco plant with modulated levels of protease, said method comprising the steps of:

(a) providing a plant comprising (i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80;

(b) inserting one or more mutations into said polynucleotide of said tobacco plant to create a mutant tobacco plant; and

(c) curing the tobacco plant material.

In some embodiments, the tobacco plant in step (b) is a mutant tobacco plant, preferably, wherein said mutant tobacco plant comprises one or more mutations in one or more further sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80. Thus, a plant can be constructed in which one or more cells comprise multiple mutated proteases.

The mutated cells comprising modulated protease expression or activity are impart a different flavour profile to tobacco leaf during the curing process. By replicating a leaf chemistry of one tobacco type in another, it is possible to transfer flavour characteristics to a tobacco type which does not normally possess those characteristics.

In embodiments, the genome of a cell of a tobacco plant is modified by a genome editing technology or by genome engineering techniques selected from CRISPR/Cas technology, zinc finger nuclease-mediated mutagenesis, chemical or radiation mutagenesis, homologous recombination, oligonucleotide-directed mutagenesis and meganuclease-mediated mutagenesis.

In a further aspect, therefore, there is provided a method for producing cured plant material, preferably cured leaves, or flowers with an altered flavour profile as compared to control plant material comprising the steps of:

(a) providing a plant or the plant material according to the foregoing aspects of the invention;

(b) optionally harvesting the plant material therefrom; and (c) curing the plant material for a period of time such that the levels of at least one protease are modulated compared to control cured plant material.

In a still further aspect, there is provided use of

(i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80;

(ii) a polypeptide encoded by the polynucleotide set forth in (i);

(iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or

(iv) a construct, vector or expression vector comprising the isolated polynucleotide set forth in (i),

for the modulation of the expression or activity of one or more proteases in tobacco during a tobacco curing procedure.

The curing procedure in according to this aspect of the invention can be selected from the group consisting of air curing, fire curing, smoke curing and flue curing.

Modification or modulation of protease activity during curing can be through (further) up- regulation or down-regulation. Modification or modulation can be through genetic engineering using for example certain promoter sequences that are (at least) active during such curing. Modulation can also be through for example mutagenesis as claimed above, of such sequences and/or their regulatory region resulting in either up- or down-regulation, or complete knock-out, of the protease activity encoded thereby under the respective curing conditions.

In another embodiment there is provided the use of at least one of the 16 gene sequences SEQ ID NO: 1 to 16 (see Table 2), and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 16 sequences that are up-regulated in all three types of tobacco during curing for modifying the flavour of cured tobacco.

In another embodiment there is provided the use of at least one of the 12 gene sequences SEQ ID NO: 30 to 41 , and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 12 sequences that are up-regulated both in air-cured Burley and flue-cured Virginia, in an Oriental type tobacco to modify the flavour of said tobacco during curing.

In another embodiment there is provided the use of at least one of the 6 gene sequences SEQ ID NO: 17 to 22, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 6 sequences, that are up-regulated both in air-cured Burley and sun-cured Oriental, in a Virginia type tobacco to modify the flavour of said tobacco during curing.

In another embodiment there is provided the use of at least one of the 3 gene sequences SEQ ID NO: 42 to 44, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 3 sequences that are up-regulated in both flue-cured Virginia and sun-cured Oriental tobacco, to modify the flavour of a Burley type tobacco during curing.

In another embodiment there is provided the use of at least one of the 17 gene sequences SEQ ID NO: 45 to 61 , and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 17 sequences, that are uniquely up-regulated in air-cured Burley, to modify the flavour in a Virginia or Oriental type tobacco during curing.

In another embodiment there is provided the use of at least one of the 19 gene sequences SEQ ID NO: 62 to 80, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 19 sequences, that are uniquely up-regulated in flue-cured Virginia, to modify the flavour in a Burley or Oriental type tobacco during curing.

In another embodiment there is provided the use of at least one of the 7 gene sequences SEQ ID NO: 23 to 29, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 7 sequences, that are uniquely up-regulated in sun-cured Oriental, to modify the flavour of a Burley or Virginia type tobacco during curing.

As certain gene sequences are only up-regulated in one or two of the three tobacco types (as defined according to tobacco type and curing method), certain gene sequences can potentially be used to modify or modulate protease activity during curing such that the outcome with respect to leaf chemistry (for example the metabolite content of cell) and properties of the obtained tobacco leaf cell, are changed such that for example an air-cured Burley tobacco acquires certain characteristics of a flue-cured Virginia-type tobacco or sun- cured Oriental tobacco upon curing. This for example can be done by modulating the expression of one or more of the gene sequences that are up-regulated in one or two of tobacco types and not in the other tobacco. For example, 17 gene sequences are uniquely up-regulated in air-cured Burley, 19 in flue-cured Virginia, and 12 in both types of tobacco during curing. By selectively modulating one or more of the 19 gene sequences that are only up-regulated in air-cured Burley now in flue-cured Virginia, the leaf cell composition of the sun-cured Virginia tobacco upon curing can be altered towards a more Burley type. Using a genetic engineering approach this can be achieved using for example a promoter sequence that is active under the curing conditions of the targeted tobacco type. Promoter sequences of use therefore are for example the regulatory sequences driving the expression of the gene sequences listed here. Using a mutagenesis, genome editing or engineering approach, the mutated gene sequence can be active under the curing conditions of the targeted tobacco type.

In one example, a regulatory sequence is mutated such that the gene sequence downstream is active under the desired curing conditions. For example, by selectively modifying or modulating the expression of one or more of the 19 sequences that are uniquely up- regulated in flue-cured Virginia in an air-cured Burley type of tobacco, the leaf cell composition of the Burley type tobacco upon curing can be altered towards a more Virginia type. Also, by selectively modulating the expression of one or more of the 12 sequences that are up-regulated both in air-cured Burley and flue-cured Virginia, in a sun-cured Oriental tobacco, the leaf cell composition of the sun-cured Oriental tobacco upon curing can be altered such that it acquires Burley and Virginia characteristics.

In one embodiment, one of the gene sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of the listed sequences, is up-regulated. In another embodiment more than one of the gene sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such sequences, are up-regulated. In another embodiment, one or more of the gene sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such listed sequences, are down-regulated. In another embodiment one or more of sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such listed sequences are up-regulated, and one or more sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such listed sequences are down-regulated.

As curing conditions determine the ultimate leaf cell chemistry, such modification or modulation affects the way a consumer experiences a product made from such leaf material. Hence, the invention also provides tobacco leaves and products comprising such leaves, obtained according to the methods claimed above. Such products include but are not limited to chewing tobacco, tobacco sticks, extracts obtained therefrom and other smoking articles comprising such leaf material or a material derived therefrom. BRIEF DESCRIPTION OF THE DRAWINGS

Figure 1. Expression of CYP82E4 (AGD93125.1 / Gl:444237502) increased after 48h curing in the three main tobacco types, SC, sun-cured; FC, flue-cured; AC, air-cured.

Figure 2. The expression of 80 senescence-activated protease genes increased in the three main tobacco types

Figure 3. One APA 1 tobacco gene (SEQ 68) is only expressed during Virginia Curing. DEFINITIONS

The technical terms and expressions used within the scope of this application are generally to be given the meaning commonly applied to them in the pertinent art of plant and molecular biology. All of the following term definitions apply to the complete content of this application. The word "comprising" does not exclude other elements or steps, and the indefinite article "a" or "an" does not exclude a plurality. A single step may fulfil the functions of several features recited in the claims. The terms "about", "essentially" and "approximately" in the context of a given numerate value or range refers to a value or range that is within 20%, within 10%, or within 5%, 4%, 3%, 2% or 1 % of the given value or range.

The term "isolated" refers to any entity that is taken from its natural milieu, but the term does not connote any degree of purification.

An "expression vector" is a nucleic acid vehicle that comprises a combination of nucleic acid components for enabling the expression of nucleic acid. Suitable expression vectors include episomes capable of extra-chromosomal replication such as circular, double-stranded nucleic acid plasmids; linearized double-stranded nucleic acid plasmids; and other functionally equivalent expression vectors of any origin. An expression vector comprises at least a promoter positioned upstream and operably-linked to a nucleic acid, nucleic acid constructs or nucleic acid conjugate, as defined below.

The term "construct" refers to a double-stranded, recombinant nucleic acid fragment comprising one or more polynucleotides. The construct comprises a "template strand" base- paired with a complementary "sense or coding strand." A given construct can be inserted into a vector in two possible orientations, either in the same (or sense) orientation or in the reverse (or anti-sense) orientation with respect to the orientation of a promoter positioned within a vector - such as an expression vector.

A "vector" refers to a nucleic acid vehicle that comprises a combination of nucleic acid components for enabling the transport of nucleic acid, nucleic acid constructs and nucleic acid conjugates and the like. Suitable vectors include episomes capable of extra- chromosomal replication such as circular, double-stranded nucleic acid plasmids; linearized double-stranded nucleic acid plasmids; and other vectors of any origin. A "promoter" refers to a nucleic acid element/sequence, typically positioned upstream and operably-linked to a double-stranded DNA fragment. Promoters can be derived entirely from regions proximate to a native gene of interest, or can be composed of different elements derived from different native promoters or synthetic DNA segments.

The terms "homology, identity or similarity" refer to the degree of sequence similarity between two polypeptides or between two nucleic acid molecules compared by sequence alignment. The degree of homology between two discrete nucleic acid sequences being compared is a function of the number of identical, or matching, nucleotides at comparable positions. The percent identity may be determined by visual inspection and mathematical calculation. Alternatively, the percent identity of two nucleic acid sequences may be determined by comparing sequence information using a computer program such as - ClustalW, BLAST, FASTA or Smith-Waterman.

A "variant" means a substantially similar sequence. A variant can have a similar function or substantially similar function as a wild-type sequence. For a protease, a similar function is at least about 50%, 60%, 70%, 80% or 90% of wild-type enzyme function under the same conditions. For a protease, a substantially similar function is at least about 90%, 95%, 96%, 97%, 98% or 99% of wild-type enzyme function under the same conditions. For example, wild-type protease sequences are set forth in SEQ ID Nos: 81 -160. The variants can have one or more mutations that result in the enzyme having a reduced level of protease activity as compared to the wild-type protease. The variants can have one or more mutations that result in their protease activity being knocked out (i.e. a 100% inhibition, and thus a nonfunctional polypeptide). Variants can also have increased activity, leading to a more active protease enzyme function.

The term "plant" refers to any plant or part of a plant at any stage of its life cycle or development, and its progenies. In one embodiment, the plant is a "tobacco plant", which refers to a plant belonging to the genus Nicotiana. Preferred species of tobacco plant are described herein.

"Plant parts" include plant cells, plant protoplasts, plant cell tissue cultures from which a whole plant can be regenerated, plant calli, plant clumps and plant cells that are intact in plants or parts of plants such as embryos, pollen, anthers, ovules, seeds, leaves, flowers, stems, branches, fruit, roots, root tips and the like. Progeny, variants and mutants of regenerated plants are also included within the scope of the disclosure, provided that they comprise the introduced polynucleotides described herein.

A "plant cell" refers to a structural and physiological unit of a plant. The plant cell may be in the form of a protoplast without a cell wall, an isolated single cell or a cultured cell, or as a part of higher organized unit such as but not limited to, plant tissue, a plant organ, or a whole plant. The term "plant material" refers to any solid, liquid or gaseous composition, or a combination thereof, obtainable from a plant, including biomass, leaves, stems, roots, flowers or flower parts, fruits, pollen, egg cells, zygotes, seeds, cuttings, secretions, extracts, cell or tissue cultures, or any other parts or products of a plant. In one embodiment, the plant material comprises or consists of biomass, stem, seed or leaves. In another embodiment, the plant material comprises or consists of leaves.

The term "variety" refers to a population of plants that share constant characteristics which separate them from other plants of the same species. While possessing one or more distinctive traits, a variety is further characterized by a very small overall variation between individuals within that variety. A variety is often sold commercially.

A "type" of tobacco is defined by origin and curing method. Flue-cured tobacco, which accounts for 40% of global production, is also known as "Bright" and "Virginia" tobacco. It is used almost entirely in cigarette blends. Some of the heavier leaves may be used in mixtures for pipe smoking. Some English cigarettes are 100% flue-cured. Flue-cured leaf is characterized by a high sugar: nitrogen ratio. This ratio is enhanced by the picking of the leaf in an advanced stage of ripeness, and by the unique curing process which allows certain chemical changes to occur in the leaf. Cured leaves vary from lemon to orange to mahogany in colour.

Burley is light air-cured type derived from the White Burley which arose as a mutant on a farm in Ohio in 1864. Burley is used primarily in cigarette blends. Some of the heavier leaf is sued in pipe blends and also for chewing.

Cured burley leaf is characterized by low sugar content and a very low sugar to nitrogen ratio (high nicotine). This is enhanced by high Nitrogen fertilizer, harvesting at an early stage of senescence, and the air curing process which allows oxidation of any sugars which may have occurred.

Maryland is another light air-cured type. It is used to some extent in American blended cigarettes and to a greater extent in certain Swiss cigarette blends.

Maryland tobacco is extremely fluffy, has good burning properties, low nicotine, and neutral aroma.

Dark air-cured tobacco encompasses a number of types used mainly for chewing, snuff, cigar, and pipe blends. Most of the world production is confined to the tropics.

Oriental tobacco gives a mild smoke with very characteristic aroma. Resins, waxes and gum exuded by glandular hairs (trichomes) furnish the aroma. Nicotine is low averaging around

1 .0%.

Dark-fired tobacco is used in the production of snuff, chewing tobacco, and pipe blends. Dark-fired leaves are subjected to smoke from smoldering wood during the early stage of curing. The type of wood used is very important in determining taste and grown. Cured leaves are very dark in color and are long and heavy bodied.

The term "modulating" may refer to reducing, inhibiting, increasing or otherwise affecting the expression or activity of a polypeptide. The term may also refer to reducing, inhibiting, increasing or otherwise affecting the activity of a gene encoding a polypeptide which can include, but is not limited to, modulating transcriptional activity.

The term "reduce" or "reduced" as used herein, refers to a reduction of from about 10% to about 99%, or a reduction of at least 10%, at least 20%, at least 25%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or at least 100% or more of a quantity or an activity, such as but not limited to polypeptide activity, transcriptional activity and protein expression. The term "inhibit" or "inhibited" as used herein, refers to a reduction of from about 98% to about 100%, or a reduction of at least 98%, at least 99%, but particularly of 100%, of a quantity or an activity, such as but not limited to polypeptide activity, transcriptional activity and protein expression.

The term "increase" or "increased" as used herein, refers to an increase of from about 5% to about 99%, or an increase of at least 5%, at least 10%, at least 20%, at least 25%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, at least 100%, at least 500% or at least 1000% or more of a quantity or an activity, such as but not limited to polypeptide activity, transcriptional activity and protein expression.

The term "control" in the context of a control plant means a plant or plant cell in which the expression or activity of an enzyme has not been modified (for example, increased or reduced) and so it can provide a comparison with a plant in which the expression or activity of the enzyme has been modified. The control plant may comprise an empty vector. The control plant or plant cell may correspond to a wild-type plant or wild-type plant cell. For example, the control plant or plant cell can be the same genotype as the starting material for the genetic alteration that resulted in the subject plant. In all such cases, the subject plant and the control plant are cultured and harvested using the same protocols for comparative purposes. Changes in levels, ratios, activity, or distribution of the genes or polypeptides described herein, or changes in tobacco plant phenotype, particularly reduced production of proteases, can be measured by comparing a subject plant to the control plant, where the subject plant and the control plant have been cultured, harvested and cured using the same protocols. The control plant can provide a reference point for measuring changes in phenotype of the subject plant. The measurement of changes in phenotype can be measured at any time in a plant, including during plant development, senescence, or preferably after curing. Measurement of changes in phenotype can be measured in plants grown under any conditions, including from plants grown in growth chamber, greenhouse, or in a field. Changes in phenotype can be measured by determining the expression or activity of proteases identified herein in SEQ ID Nos 81 -160.

DETAILED DESCRIPTION

In one embodiment, there is provided an isolated polynucleotide comprising, consisting or consisting essentially of a polynucleotide sequence having at least 95% sequence identity to any of the sequences described herein, including any of polynucleotides shown in the sequence lisiting. Suitably, the isolated polynucleotide comprises, consists or consists essentially of a sequence having at least 95%, 96%, 97%, 98%, 99% or 100% sequence identity thereto.

Suitably, the polynucleotide(s) described herein encode a protein with protease activity that is at least about 50%, 60%, 70%, 80%, 90% 95%, 96%, 97%, 98%, 99% 100% or more of the activity of the protein set forth in SEQ ID NOs: 81 -160.

A polynucleotide as described herein can include a polymer of nucleotides, which may be unmodified or modified deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). Accordingly, a polynucleotide can be, without limitation, a genomic DNA, complementary DNA (cDNA), mRNA, or antisense RNA or a fragment(s) thereof. Moreover, a polynucleotide can be single-stranded or double-stranded DNA, DNA that is a mixture of single-stranded and double-stranded regions, a hybrid molecule comprising DNA and RNA, or a hybrid molecule with a mixture of single-stranded and double-stranded regions or a fragment(s) thereof. In addition, the polynucleotide can be composed of triple-stranded regions comprising DNA, RNA, or both or a fragment(s) thereof. A polynucleotide can contain one or more modified bases, such as phosphothioates, and can be a peptide nucleic acid. Generally, polynucleotides can be assembled from isolated or cloned fragments of cDNA, genomic DNA, oligonucleotides, or individual nucleotides, or a combination of the foregoing. Although the polynucleotide sequences described herein are shown as DNA sequences, the sequences include their corresponding RNA sequences, and their complementary (for example, completely complementary) DNA or RNA sequences, including the reverse complements thereof.

A polynucleotide as described herein will generally contain phosphodiester bonds, although in some cases, polynucleotide analogues are included that may have alternate backbones, comprising, for example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- methylphophoroamidite linkages; and peptide polynucleotide backbones and linkages. Other analogue polynucleotides include those with positive backbones; non-ionic backbones, and non-ribose backbones. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, for example, to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip. Mixtures of naturally occurring polynucleotides and analogues can be made; alternatively, mixtures of different polynucleotide analogues, and mixtures of naturally occurring polynucleotides and analogues may be made.

A variety of polynucleotide analogues are known, including, for example, phosphoramidate, phosphorothioate, phosphorodithioate, O-methylphophoroamidite linkages and peptide polynucleotide backbones and linkages. Other analogue polynucleotides include those with positive backbones, non-ionic backbones and non-ribose backbones. Polynucleotides containing one or more carbocyclic sugars are also included.

Other analogues include peptide polynucleotides which are peptide polynucleotide analogues. These backbones are substantially non-ionic under neutral conditions, in contrast to the highly charged phosphodiester backbone of naturally occurring polynucleotides. This may result in advantages. First, the peptide polynucleotide backbone may exhibit improved hybridization kinetics. Peptide polynucleotides have larger changes in the melting temperature for mismatched versus perfectly matched base pairs. DNA and RNA typically exhibit a 2-4 °C drop in melting temperature for an internal mismatch. With the non-ionic peptide polynucleotide backbone, the drop is closer to 7-9 °C. Similarly, due to their non- ionic nature, hybridization of the bases attached to these backbones is relatively insensitive to salt concentration. In addition, peptide polynucleotides may not be degraded or degraded to a lesser extent by cellular enzymes, and thus may be more stable.

Among the uses of the disclosed polynucleotides, and fragments thereof, is the use of fragments as probes in nucleic acid hybridisation assays or primers for use in nucleic acid amplification assays. Such fragments generally comprise at least about 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more contiguous nucleotides of a DNA sequence. In other embodiments, a DNA fragment comprises at least about 10, 15, 20, 30, 40, 50 or 60 or more contiguous nucleotides of a DNA sequence. Thus, in one aspect, there is also provided a method for detecting a polynucleotide encoding a protein with nicotine N-demethylase activity member or encoding a nicotine N-demethylase enzyme comprising the use of the probes or primers or both.

The basic parameters affecting the choice of hybridization conditions and guidance for devising suitable conditions are described by Sambrook, J., E. F. Fritsch, and T. Maniatis (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). Using knowledge of the genetic code in combination with the amino acid sequences described herein, sets of degenerate oligonucleotides can be prepared. Such oligonucleotides are useful as primers, for example, in polymerase chain reactions (PCR), whereby DNA fragments are isolated and amplified. In certain embodiments, degenerate primers can be used as probes for genetic libraries. Such libraries would include but are not limited to cDNA libraries, genomic libraries, and even electronic express sequence tag or DNA libraries. Homologous sequences identified by this method would then be used as probes to identify homologues of the sequences identified herein.

Also of potential use are polynucleotides and oligonucleotides (for example, primers or probes) that hybridize under reduced stringency conditions, typically moderately stringent conditions, and commonly highly stringent conditions to the polynucleotide(s) as described herein. The basic parameters affecting the choice of hybridization conditions and guidance for devising suitable conditions are set forth by Sambrook, J., E. F. Fritsch, and T. Maniatis (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. and can be readily determined by those having ordinary skill in the art based on, for example, the length or base composition of the polynucleotide. One way of achieving moderately stringent conditions involves the use of a prewashing solution containing 5x Standard Sodium Citrate, 0.5% Sodium Dodecyl Sulphate, 1 .0 mM Ethylenediaminetetraacetic acid (pH 8.0), hybridization buffer of about 50% formamide, 6x Standard Sodium Citrate, and a hybridization temperature of about 55 °C (or other similar hybridization solutions, such as one containing about 50% formamide, with a hybridization temperature of about 42°C), and washing conditions of about 60°C, in 0.5x Standard Sodium Citrate, 0.1 % Sodium Dodecyl Sulphate. Generally, highly stringent conditions are defined as hybridization conditions as above, but with washing at approximately 68 °C, 0.2x Standard Sodium Citrate, 0.1 % Sodium Dodecyl Sulphate. SSPE (1 x SSPE is 0.15 M sodium chloride, 10 mM sodium phosphate, and 1 .25 mM Ethylenediaminetetraacetic acid, pH 7.4) can be substituted for Standard Sodium Citrate (1 x Standard Sodium Citrate is 0.15 M sodium chloride and 15 mM sodium citrate) in the hybridization and wash buffers; washes are performed for 15 minutes after hybridization is complete. It should be understood that the wash temperature and wash salt concentration can be adjusted as necessary to achieve a desired degree of stringency by applying the basic principles that govern hybridization reactions and duplex stability, as known to those skilled in the art and described further below (see, for example, Sambrook, J., E. F. Fritsch, and T. Maniatis (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y). When hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region or regions of optimal sequence complementarity. The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should be 5 to 10 °C less than the melting temperature of the hybrid, where melting temperature is determined according to the following equations. For hybrids less than 18 base pairs in length, melting temperature (°C)=2(number of A+T bases)+4(number of G+C bases). For hybrids above 18 base pairs in length, melting temperature (°C)=81 .5+16.6(log10 [Na+])+0.41 (% G+C)-(600/N), where N is the number of bases in the hybrid, and [Na+] is the concentration of sodium ions in the hybridization buffer ([Na+] for 1 x Standard Sodium Citrate=0.165M). Typically, each such hybridizing polynucleotide has a length that is at least 25% (commonly at least 50%, 60%, or 70%, and most commonly at least 80%) of the length of a polynucleotide to which it hybridizes, and has at least 60% sequence identity (for example, at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100%) with a polynucleotide to which it hybridizes.

As will be understood by the person skilled in the art, a linear DNA has two possible orientations: the 5'-to-3' direction and the 3'-to-5' direction. For example, if a reference sequence is positioned in the 5'-to-3' direction, and if a second sequence is positioned in the 5'-to-3' direction within the same polynucleotide molecule/strand, then the reference sequence and the second sequence are orientated in the same direction, or have the same orientation. Typically, a promoter sequence and a gene of interest under the regulation of the given promoter are positioned in the same orientation. However, with respect to the reference sequence positioned in the 5'-to-3' direction, if a second sequence is positioned in the 3'-to-5' direction within the same polynucleotide molecule/strand, then the reference sequence and the second sequence are orientated in anti-sense direction, or have anti- sense orientation. Two sequences having anti-sense orientations with respect to each other can be alternatively described as having the same orientation, if the reference sequence (5'- to-3' direction) and the reverse complementary sequence of the reference sequence (reference sequence positioned in the 5'-to-3') are positioned within the same polynucleotide molecule/strand. The sequences set forth herein are shown in the 5'-to-3' direction.

Recombinant constructs provided herein can be used to transform plants or plant cells in order to modulate protein expression and/or activity levels. A recombinant polynucleotide construct can comprise a polynucleotide encoding one or more polynucleotides as described herein, operably linked to a regulatory region suitable for expressing the polypeptide. Thus, a polynucleotide can comprise a coding sequence that encodes the polypeptide as described herein. Plants or plant cells in which protein expression and/or activity levels are modulated can include mutant, non-naturally occurring, transgenic, man-made or genetically engineered plants or plant cells. Suitably, the transgenic plant or plant cell comprises a genome that has been altered by the stable integration of recombinant DNA. Recombinant DNA includes DNA which has been genetically engineered and constructed outside of a cell and includes DNA containing naturally occurring DNA or cDNA or synthetic DNA. A transgenic plant can include a plant regenerated from an originally-transformed plant cell and progeny transgenic plants from later generations or crosses of a transformed plant. Suitably, the transgenic modification alters the expression or activity of the polynucleotide or the polypeptide described herein as compared to a control plant.

The polypeptide encoded by a recombinant polynucleotide can be a native polypeptide, or can be heterologous to the cell. In some cases, the recombinant construct contains a polynucleotide that modulates expression, operably linked to a regulatory region. Examples of suitable regulatory regions are described herein.

Vectors containing recombinant polynucleotide constructs such as those described herein are also provided. Suitable vector backbones include, for example, those routinely used in the art such as plasmids, viruses, artificial chromosomes, bacterial artificial chromosomes, yeast artificial chromosomes, or bacteriophage artificial chromosomes. Suitable expression vectors include, without limitation, plasmids and viral vectors derived from, for example, bacteriophage, baculoviruses, and retroviruses. Numerous vectors and expression systems are commercially available. The vectors can include, for example, origins of replication, scaffold attachment regions or markers. A marker gene can confer a selectable phenotype on a plant cell. For example, a marker can confer biocide resistance, such as resistance to an antibiotic (for example, kanamycin, G418, bleomycin, or hygromycin), or an herbicide (for example, glyphosate, chlorsulfuron or phosphinothricin). In addition, an expression vector can include a tag sequence designed to facilitate manipulation or detection (for example, purification or localization) of the expressed polypeptide. Tag sequences, such as luciferase, beta-glucuronidase, green fluorescent protein, glutathione S-transferase, polyhistidine, c- myc or hemagglutinin sequences typically are expressed as a fusion with the encoded polypeptide. Such tags can be inserted anywhere within the polypeptide, including at either the carboxyl or amino terminus.

A plant or plant cell can be transformed by having the recombinant polynucleotide integrated into its genome to become stably transformed. The plant or plant cell described herein can be stably transformed. Stably transformed cells typically retain the introduced polynucleotide with each cell division. A plant or plant cell can be transiently transformed such that the recombinant polynucleotide is not integrated into its genome. Transiently transformed cells typically lose all or some portion of the introduced recombinant polynucleotide with each cell division such that the introduced recombinant polynucleotide cannot be detected in daughter cells after a sufficient number of cell divisions.

A number of methods are available in the art for transforming a plant cell which are all encompassed herein, including biolistics, gene gun techniques, Agrobacterium-mediated transformation, viral vector-mediated transformation and electroporation. The Agrobacterium system for integration of foreign DNA into plant chromosomes has been extensively studied, modified, and exploited for plant genetic engineering. Naked recombinant DNA molecules comprising DNA sequences corresponding to the subject purified tobacco protein operably linked, in the sense or antisense orientation, to regulatory sequences are joined to appropriate T-DNA sequences by conventional methods. These are introduced into tobacco protoplasts by polyethylene glycol techniques or by electroporation techniques, both of which are standard. Alternatively, such vectors comprising recombinant DNA molecules encoding the subject purified tobacco protein are introduced into live Agrobacterium cells, which then transfer the DNA into the tobacco plant cells. Transformation by naked DNA without accompanying T-DNA vector sequences can be accomplished via fusion of tobacco protoplasts with DNA-containing liposomes or via electroporation. Naked DNA unaccompanied by T-DNA vector sequences can also be used to transform tobacco cells via inert, high velocity microprojectiles.

If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.

The choice of regulatory regions to be included in a recombinant construct depends upon several factors, including, but not limited to, efficiency, selectability, inducibility, desired expression level, and cell- or tissue-preferential expression. It is a routine matter for one of skill in the art to modulate the expression of a coding sequence by appropriately selecting and positioning regulatory regions relative to the coding sequence. Transcription of a polynucleotide can be modulated in a similar manner. Some suitable regulatory regions initiate transcription only, or predominantly, in certain cell types. Methods for identifying and characterizing regulatory regions in plant genomic DNA are known in the art.

Suitable promoters include tissue-specific promoters recognized by tissue-specific factors present in different tissues or cell types (for example, root-specific promoters, shoot-specific promoters, xylem-specific promoters), or present during different developmental stages, or present in response to different environmental conditions. Suitable promoters include constitutive promoters that can be activated in most cell types without requiring specific inducers. Examples of suitable promoters for controlling RNAi polypeptide production include the cauliflower mosaic virus 35S (CaMV/35S), SSU, OCS, Iib4, usp, STLS1 , B33, nos or ubiquitin- or phaseolin-promoters. Persons skilled in the art are capable of generating multiple variations of recombinant promoters.

Tissue-specific promoters are transcriptional control elements that are only active in particular cells or tissues at specific times during plant development, such as in vegetative tissues or reproductive tissues. Tissue-specific expression can be advantageous, for example, when the expression of polynucleotides in certain tissues is preferred. Examples of tissue-specific promoters under developmental control include promoters that can initiate transcription only (or primarily only) in certain tissues, such as vegetative tissues, for example, roots or leaves, or reproductive tissues, such as fruit, ovules, seeds, pollen, pistols, flowers, or any embryonic tissue. Reproductive tissue-specific promoters may be, for example, anther-specific, ovule-specific, embryo-specific, endosperm-specific, integument- specific, seed and seed coat-specific, pollen-specific, petal-specific, sepal-specific, or combinations thereof.

Suitable leaf-specific promoters include pyruvate, orthophosphate dikinase (PPDK) promoter from C4 plant (maize), cab-m1 Ca+2 promoter from maize, the Arabidopsis thaliana myb- related gene promoter (Atmyb5), the ribulose biphosphate carboxylase (RBCS) promoters (for example, the tomato RBCS 1 , RBCS2 and RBCS3A genes expressed in leaves and light-grown seedlings, RBCS1 and RBCS2 expressed in developing tomato fruits or ribulose bisphosphate carboxylase promoter expressed almost exclusively in mesophyll cells in leaf blades and leaf sheaths at high levels).

Suitable senescence-specific promoters include a tomato promoter active during fruit ripening, senescence and abscission of leaves, a maize promoter of gene encoding a cysteine protease, the promoter of 82E4 and the promoter of SAG genes. Suitable anther- specific promoters can be used. Suitable root-preferred promoters known to persons skilled in the art may be selected. Suitable seed-preferred promoters include both seed-specific promoters (those promoters active during seed development such as promoters of seed storage proteins) and seed-germinating promoters (those promoters active during seed germination). Such seed-preferred promoters include, but are not limited to, Cim1 (cytokinin-induced message); cZ19B1 (maize 19 kDa zein); milps (myo-inositol-1 -phosphate synthase); mZE40-2, also known as Zm-40; nuclc; and celA (cellulose synthase). Gama-zein is an endosperm-specific promoter. Glob-1 is an embryo-specific promoter. For dicots, seed- specific promoters include, but are not limited to, bean beta-phaseolin, napin, β-conglycinin, soybean lectin, cruciferin, and the like. For monocots, seed-specific promoters include, but are not limited to, a maize 15 kDa zein promoter, a 22 kDa zein promoter, a 27 kDa zein promoter, a g-zein promoter, a 27 kDa gamma-zein promoter (such as gzw64A promoter, see Genbank Accession number S78780), a waxy promoter, a shrunken 1 promoter, a shrunken 2 promoter, a globulin 1 promoter (see Genbank Accession number L22344), an Itp2 promoter, cim1 promoter, maize endl and end2 promoters, nud promoter, Zm40 promoter, eepl and eep2; led , thioredoxin H promoter; mlip15 promoter, PCNA2 promoter; and the shrunken-2 promoter.

Examples of inducible promoters include promoters responsive to pathogen attack, anaerobic conditions, elevated temperature, light, drought, cold temperature, or high salt concentration. Pathogen-inducible promoters include those from pathogenesis-related proteins (PR proteins), which are induced following infection by a pathogen (for example, PR proteins, SAR proteins, beta-1 ,3-glucanase, chitinase). In addition to plant promoters, other suitable promoters may be derived from bacterial origin for example, the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from Ti plasmids, or may be derived from viral promoters (for example, 35S and 19S RNA promoters of cauliflower mosaic virus (CaMV), constitutive promoters of tobacco mosaic virus, cauliflower mosaic virus (CaMV) 19S and 35S promoters, or figwort mosaic virus 35S promoter).

Preferred promoters include the control elements provided herein, as part of SEQ ID Nos. 1 - 80, which demonstrate desirable expression during curing procedures in tobacco leaf.

In another aspect, there is provided an isolated polypeptide comprising, consisting or consisting essentially of a polypeptide sequence having at least 95% sequence identity to any of the polypeptide sequences described herein, including any of the polypeptides shown in the sequence lisiting. Suitably, the isolated polypeptide comprises, consists or consists essentially of a sequence having at least 95% 96%, 97%, 98%, 99%, 99.1 %, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8%, 99.9% or 100% sequence identity thereto.

The polypeptide can include sequences comprising a sufficient or substantial degree of identity or similarity to SEQ ID NOs: 81 -160 to function as proteases. Fragments of the polypeptide(s) typically retain some or all of the activity of the full length sequence.

As discussed herein, the polypeptides also include mutants produced by introducing any type of alterations (for example, insertions, deletions, or substitutions of amino acids; changes in glycosylation states; changes that affect refolding or isomerizations, three- dimensional structures, or self-association states), which can be deliberately engineered or isolated naturally provided that they still have some or all of their function or activity as a protease. Suitably, the function or activity as a protease is modulated, increased or reduced. Polypeptides include variants produced by introducing any type of alterations (for example, insertions, deletions, or substitutions of amino acids; changes in glycosylation states; changes that affect refolding or isomerizations, three-dimensional structures, or self- association states), which can be deliberately engineered or isolated naturally. The variant may have alterations which produce a silent change and result in a functionally equivalent protein. Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and the amphipathic nature of the residues as long as the secondary binding activity of the substance is retained. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine. Conservative substitutions may be made, for example according to the Table below. Amino acids in the same block in the second column and preferably in the same line in the third column may be substituted for each other:

The polypeptide may be a mature protein or an immature protein or a protein derived from an immature protein. Polypeptides may be in linear form or cyclized using known methods. Polypeptides typically comprise at least 10, at least 20, at least 30, or at least 40 contiguous amino acids.

A tobacco plant or plant cell comprising a mutation in a gene encoding a protease as described herein is disclosed, wherein said mutation results in modulated expression or modulated function of said protease. The expression or function of the protease(s) may be enhanced. Aside from one or more mutations in said protease, the mutant plants or plant cells can have one or more further mutations in one or more other genes or polypeptides. In certain embodiments, aside from the one or more mutations in a protease gene, the mutants can have one or more further mutations in one or more other genes or polypeptides - such as one or more other protease genes or polypeptides as described in the Sequence Listing. Suitably, a protease is expressed in the leaves of the mutant plant during the curing procedure.

There is also provided a method for modulating the level of a protease in a (cured) tobacco plant or in (cured) tobacco plant material said method comprising introducing into the genome of said plant one or more mutations that modulate expression of at least one protease gene, wherein said at least one protease gene is selected from SEQ ID Nos: 1 -80. There is also provided a method for identifying a tobacco plant with increased levels of protease, said method comprising screening a nucleic acid sample from a tobacco plant of interest for the presence of one or more mutations in SEQ ID NOs:1 -80, and optionally correlating the identified mutation(s) with mutation(s) that are known to modulate levels of protease. There is also disclosed a tobacco plant or plant cell that is heterozygous or homozygous for mutations in a gene encoding a protease, wherein said mutation results in modulated (enhanced or reduced) expression or function of said protease.

A number of approaches can be used to combine mutations in one plant including sexual crossing. A plant having one or more favourable heterozygous or homozygous mutations in a protease gene that enhances or reduces protease expression or activity can be crossed with a plant having one or more favourable heterozygous or homozygous mutations in one or more other protease genes that enhance or reduce protease activity. In one embodiment, crosses are made in order to introduce one or more favourable heterozygous or homozygous mutations within a protease gene within the same plant.

The activity of one or more protease polypeptides in a tobacco plant is reduced or enhanced according to the present disclosure if the protease activity is statistically lower or higher than the protease activity of the same protease(s) in a tobacco plant that has not been modified to inhibit the activity of that protease polypeptide and which has been cultured, harvested and cured using the same protocols.

In some embodiments, the mutation(s) is introduced into a tobacco plant or plant cell using a mutagenesis approach, and the introduced mutation is identified or selected using methods known to those of skill in the art - such as Southern blot analysis, DNA sequencing, PCR analysis, or phenotypic analysis. Mutations that impact gene expression or that interfere with the function of the encoded protein can be determined using methods that are well known in the art. Insertional mutations in gene exons usually result in null-mutants. Mutations in conserved residues can be particularly effective in inhibiting the metabolic function of the encoded protein.

Methods for obtaining mutant polynucleotides and polypeptides are also disclosed. Any plant of interest, including a plant cell or plant material can be genetically modified by various methods known to induce mutagenesis, including site-directed mutagenesis, oligonucleotide- directed mutagenesis, chemically-induced mutagenesis, irradiation-induced mutagenesis, mutagenesis utilizing modified bases, mutagenesis utilizing gapped duplex DNA, double- strand break mutagenesis, mutagenesis utilizing repair-deficient host strains, mutagenesis by total gene synthesis, DNA shuffling and other equivalent methods.

Fragments of protease polynucleotides and polypeptides encoded thereby are also disclosed. Fragments of a polynucleotide may encode protein fragments that retain the biological activity of the native protein and hence are involved in the metabolic conversion of nicotine to nornicotine. Alternatively, fragments of a polynucleotide that are useful as hybridization probes or PCR primers generally do not encode fragment proteins retaining biological activity. Furthermore, fragments of the disclosed nucleotide sequences include those that can be assembled within recombinant constructs as discussed herein. Fragments of a polynucleotide sequence may range from at least about 25 nucleotides, about 50 nucleotides, about 75 nucleotides, about 100 nucleotides about 150 nucleotides, about 200 nucleotides, about 250 nucleotides, about 300 nucleotides, about 400 nucleotides, about 500 nucleotides, about 600 nucleotides, about 700 nucleotides, about 800 nucleotides, about 900 nucleotides, about 1000 nucleotides, about 1 100 nucleotides, about 1200 nucleotides, about 1300 nucleotides or about 1400 nucleotides and up to the full-length polynucleotide encoding the polypeptides described herein. Fragments of a polypeptide sequence may range from at least about 25 amino acids, about 50 amino acids, about 75 amino acids, about 100 amino acids about 150 amino acids, about 200 amino acids, about 250 amino acids, about 300 amino acids, about 400 amino acids, about 500 amino acids, and up to the full-length polypeptide described herein.

Mutant polypeptide variants can be used to create mutant, non-naturally occurring or transgenic plants (for example, mutant, non-naturally occurring, transgenic, man-made or genetically engineered plants) or plant cells comprising one or more mutant polypeptide variants. Suitably, mutant polypeptide variants retain the activity of the unmutated polypeptide. The activity of the mutant polypeptide variant may be higher, lower or about the same as the unmutated polypeptide.

Mutations in the nucleotide sequences and polypeptides described herein can include man- made mutations or synthetic mutations or genetically engineered mutations. Mutations in the nucleotide sequences and polypeptides described herein can be mutations that are obtained or obtainable via a process which includes an in vitro or an in vivo manipulation step. Mutations in the nucleotide sequences and polypeptides described herein can be mutations that are obtained or obtainable via a process which includes intervention by man. By way of example, the process may include mutagenesis using exogenously added chemicals - such as mutagenic, teratogenic, or carcinogenic organic compounds, for example ethyl methanesulfonate (EMS), that produce random mutations in genetic material. By way of further example, the process may include one or more genetic engineering steps - such as one or more of the genetic engineering steps that are described herein or combinations thereof. By way of further example, the process may include one or more plant crossing steps.

A polypeptide may be prepared by culturing transformed or recombinant host cells under culture conditions suitable to express a polypeptide. The resulting expressed polypeptide may then be purified from such culture using known purification processes. The purification of the polypeptide may include an affinity column containing agents which will bind to the polypeptide; one or more column steps over such affinity resins; one or more steps involving hydrophobic interaction chromatography; or immunoaffinity chromatography. Alternatively, the polypeptide may also be expressed in a form that will facilitate purification. For example, it may be expressed as a fusion polypeptide, such as those of maltose binding polypeptide, glutathione-5-transferase, his-tag or thioredoxin. Kits for expression and purification of fusion polypeptides are commercially available. The polypeptide may be tagged with an epitope and subsequently purified by using a specific antibody directed to such epitope. One or more liquid chromatography steps - such as reverse-phase high performance liquid chromatography can be employed to further purify the polypeptide. Some or all of the foregoing purification steps, in various combinations, can be employed to provide a substantially homogeneous recombinant polypeptide. The polypeptide thus purified may be substantially free of other polypeptides and is defined herein as a "substantially purified polypeptide"; such purified polypeptides include polypeptides, fragments, variants, and the like. Expression, isolation, and purification of the polypeptides and fragments can be accomplished by any suitable technique, including but not limited to the methods described herein.

It is also possible to utilise an affinity column such as a monoclonal antibody generated against polypeptides, to affinity-purify expressed polypeptides. These polypeptides can be removed from an affinity column using conventional techniques, for example, in a high salt elution buffer and then dialyzed into a lower salt buffer for use or by changing pH or other components depending on the affinity matrix utilized, or be competitively removed using the naturally occurring substrate of the affinity moiety.

Isolated or substantially purified polynucleotides or protein compositions are disclosed. An "isolated" or "purified" polynucleotide or protein, or biologically active portion thereof, is substantially or essentially free from components that normally accompany or interact with the polynucleotide or protein as found in its naturally occurring environment. Thus, an isolated or purified polynucleotide or protein is substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Optimally, an "isolated" polynucleotide is free of sequences (optimally protein encoding sequences) that naturally flank the polynucleotide (for example, sequences located at the 5' and 3' ends of the polynucleotide) in the genomic DNA of the organism from which the polynucleotide is derived. For example, in various embodiments, the isolated polynucleotide can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequence that naturally flank the polynucleotide in genomic DNA of the cell from which the polynucleotide is derived. A protein that is substantially free of cellular material includes preparations of protein having less than about 30%, 20%, 10%, 5%, or 1 % (by dry weight) of contaminating protein.

A polypeptide may also be produced by known conventional chemical synthesis. Methods for constructing the polypeptides or fragments thereof by synthetic means are known to those skilled in the art. The synthetically-constructed polypeptide sequences, by virtue of sharing primary, secondary or tertiary structural or conformational characteristics with native polypeptides may possess biological properties in common therewith, including biological activity.

The term 'non-naturally occurring' as used herein describes an entity (for example, a polynucleotide, a genetic mutation, a polypeptide, a plant, a plant cell and plant material) that is not formed by nature or that does not exist in nature. Such non-naturally occurring entities or artificial entities may be made, synthesized, initiated, modified, intervened, or manipulated by methods described herein or that are known in the art. Such non-naturally occurring entities or artificial entities may be made, synthesized, initiated, modified, intervened, or manipulated by man. Thus, by way of example, a non-naturally occurring plant, a non- naturally occurring plant cell or non-naturally occurring plant material may be made using traditional plant breeding techniques - such as backcrossing - or by genetic manipulation technologies - such as antisense RNA, interfering RNA, meganuclease and the like. By way of further example, a non-naturally occurring plant, a non-naturally occurring plant cell or non-naturally occurring plant material may be made by introgression of or by transferring one or more genetic mutations (for example one or more polymorphisms) from a first plant or plant cell into a second plant or plant cell (which may itself be naturally occurring), such that the resulting plant, plant cell or plant material or the progeny thereof comprises a genetic constitution (for example, a genome, a chromosome or a segment thereof) that is not formed by nature or that does not exist in nature. The resulting plant, plant cell or plant material is thus artificial or non-naturally occurring. Accordingly, an artificial or non-naturally occurring plant or plant cell may be made by modifying a genetic sequence in a first naturally occurring plant or plant cell, even if the resulting genetic sequence occurs naturally in a second plant or plant cell that comprises a different genetic background from the first plant or plant cell. In certain embodiments, a mutation is not a naturally occurring mutation that exists naturally in a nucleotide sequence or a polypeptide - such as a gene or a protein.

Differences in genetic background can be detected by phenotypic differences or by molecular biology techniques known in the art - such as nucleic acid sequencing, presence or absence of genetic markers (for example, microsatellite RNA markers).

Antibodies that are immunoreactive with the polypeptides described herein are also provided. The polypeptides, fragments, variants, fusion polypeptides, and the like, as set forth herein, can be employed as "immunogens" in producing antibodies immunoreactive therewith. Such antibodies may specifically bind to the polypeptide via the antigen-binding sites of the antibody. Specifically binding antibodies are those that will specifically recognize and bind with a polypeptide, homologues, and variants, but not with other molecules. In one embodiment, the antibodies are specific for polypeptides having an amino acid sequence as set forth herein and do not cross-react with other polypeptides. More specifically, the polypeptides, fragment, variants, fusion polypeptides, and the like contain antigenic determinants or epitopes that elicit the formation of antibodies. These antigenic determinants or epitopes can be either linear or conformational (discontinuous). Linear epitopes are composed of a single section of amino acids of the polypeptide, while conformational or discontinuous epitopes are composed of amino acids sections from different regions of the polypeptide chain that are brought into close proximity upon polypeptide folding. Epitopes can be identified by any of the methods known in the art. Additionally, epitopes from the polypeptides can be used as research reagents, in assays, and to purify specific binding antibodies from substances such as polyclonal sera or supernatants from cultured hybridomas. Such epitopes or variants thereof can be produced using techniques known in the art such as solid-phase synthesis, chemical or enzymatic cleavage of a polypeptide, or using recombinant DNA technology.

Both polyclonal and monoclonal antibodies to the polypeptides can be prepared by conventional techniques. Hybridoma cell lines that produce monoclonal antibodies specific for the polypeptides are also contemplated herein. Such hybridomas can be produced and identified by conventional techniques. For the production of antibodies, various host animals may be immunized by injection with a polypeptide, fragment, variant, or mutants thereof. Such host animals may include, but are not limited to, rabbits, mice, and rats, to name a few. Various adjutants may be used to increase the immunological response. Depending on the host species, such adjuvants include, but are not limited to, Freund's (complete and incomplete), mineral gels such as aluminium hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. The monoclonal antibodies can be recovered by conventional techniques. Such monoclonal antibodies may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD, and any subclass thereof.

The antibodies can also be used in assays to detect the presence of the polypeptides or fragments, either in vitro or in vivo. The antibodies also can be employed in purifying polypeptides or fragments by immunoaffinity chromatography.

Other than mutagenesis, compositions that can modulate the expression or the activity of one or more of the proteases described herein include, but are not limited to, sequence- specific polynucleotides that can interfere with the transcription of one or more endogenous gene(s); sequence-specific polynucleotides that can interfere with the translation of RNA transcripts (for example, double-stranded RNAs, siRNAs, ribozymes); sequence-specific polypeptides that can interfere with the stability of one or more proteins; sequence-specific polynucleotides that can interfere with the enzymatic activity of one or more proteins or the binding activity of one or more proteins with respect to substrates or regulatory proteins; antibodies that exhibit specificity for one or more proteins; small molecule compounds that can interfere with the stability of one or more proteins or the enzymatic activity of one or more proteins or the binding activity of one or more proteins; zinc finger proteins that bind one or more polynucleotides; and meganucleases that have activity towards one or more polynucleotides. Gene editing technologies, genetic editing technologies and genome editing technologies are well known in the art.

One method of gene editing involves the use of transcription activator-like effector nucleases (TALENs) which induce double-strand breaks which cells can respond to with repair mechanisms. Non-homologous end joining reconnects DNA from either side of a double- strand break where there is very little or no sequence overlap for annealing. This repair mechanism induces errors in the genome via insertion or deletion, or chromosomal rearrangement. Any such errors may render the gene products coded at that location nonfunctional. Another method of gene editing involves the use of the bacterial CRISPR/Cas system. Bacteria and archaea exhibit chromosomal elements called clustered regularly interspaced short palindromic repeats (CRISPR) that are part of an adaptive immune system that protects against invading viral and plasmid DNA. In Type II CRISPR systems, CRISPR RNAs (crRNAs) function with trans-activating crRNA (tracrRNA) and CRISPR-associated (Cas) proteins to introduce double-stranded breaks in target DNA. Target cleavage by Cas9 requires base-pairing between the crRNA and tracrRNA as well as base pairing between the crRNA and the target DNA. Target recognition is facilitated by the presence of a short motif called a protospacer-adjacent motif (PAM) that conforms to the sequence NGG. This system can be harnessed for genome editing. Cas9 is normally programmed by a dual RNA consisting of the crRNA and tracrRNA. However, the core components of these RNAs can be combined into a single hybrid 'guide RNA' for Cas9 targeting. The use of a noncoding RNA guide to target DNA for site-specific cleavage promises to be significantly more straightforward than existing technologies - such as TALENs. Using the CRISPR/Cas strategy, retargeting the nuclease complex only requires introduction of a new RNA sequence and there is no need to reengineer the specificity of protein transcription factors. Antisense technology is another well-known method that can be used to modulate the expression of a polypeptide. A polynucleotide of the gene to be repressed is cloned and operably linked to a regulatory region and a transcription termination sequence so that the antisense strand of RNA is transcribed. The recombinant construct is then transformed into a plant cell and the antisense strand of RNA is produced. The polynucleotide need not be the entire sequence of the gene to be repressed, but typically will be substantially complementary to at least a portion of the sense strand of the gene to be repressed.

A polynucleotide may be transcribed into a ribozyme, or catalytic RNA, that affects expression of an mRNA. Ribozymes can be designed to specifically pair with virtually any target RNA and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA. Heterologous polynucleotides can encode ribozymes designed to cleave particular mRNA transcripts, thus preventing expression of a polypeptide. Hammerhead ribozymes are useful for destroying particular mRNAs, although various ribozymes that cleave mRNA at site-specific recognition sequences can be used. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The sole requirement is that the target RNA contains a 5'-UG-3' nucleotide sequence. The construction and production of hammerhead ribozymes is known in the art. Hammerhead ribozyme sequences can be embedded in a stable RNA such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo.

In one embodiment, the sequence-specific polynucleotide that can interfere with the translation of RNA transcript(s) is interfering RNA. RNA interference or RNA silencing is an evolutionarily conserved process by which specific mRNAs can be targeted for enzymatic degradation. A double-stranded RNA (double-stranded RNA) is introduced or produced by a cell (for example, double-stranded RNA virus, or interfering RNA polynucleotides) to initiate the interfering RNA pathway. The double-stranded RNA can be converted into multiple small interfering RNA duplexes of 21 -24 bp length by RNases III, which are double-stranded RNA-specific endonucleases. The small interfering RNAs can be subsequently recognized by RNA-induced silencing complexes that promote the unwinding of small interfering RNA through an ATP-dependent process. The unwound antisense strand of the small interfering RNA guides the activated RNA-induced silencing complexes to the targeted mRNA comprising a sequence complementary to the small interfering RNA anti-sense strand. The targeted mRNA and the anti-sense strand can form an A-form helix, and the major groove of the A-form helix can be recognized by the activated RNA-induced silencing complexes. The target mRNA can be cleaved by activated RNA-induced silencing complexes at a single site defined by the binding site of the 5'-end of the small interfering RNA strand. The activated RNA-induced silencing complexes can be recycled to catalyze another cleavage event. Interfering RNA expression vectors may comprise interfering RNA constructs encoding interfering RNA polynucleotides that exhibit RNA interference activity by reducing the expression level of mRNAs, pre-mRNAs, or related RNA variants. The expression vectors may comprise a promoter positioned upstream and operably-linked to an Interfering RNA construct, as further described herein. Interfering RNA expression vectors may comprise a suitable minimal core promoter, a Interfering RNA construct of interest, an upstream (5') regulatory region, a downstream (3') regulatory region, including transcription termination and polyadenylation signals, and other sequences known to persons skilled in the art, such as various selection markers. Various embodiments are directed to methods for modulating the expression level of one or more of the polynucleotide(s) described herein (or any combination thereof as described herein) by integrating multiple copies of the polynucleotide(s) into a (tobacco) plant genome, comprising: transforming a plant cell host with an expression vector that comprises a promoter operably-linked to a polynucleotide.

Various compositions and methods are provided for modulating the endogenous gene expression level by modulating the translation of mRNA. A host (tobacco) plant cell can be transformed with an expression vector comprising: a promoter operably-linked to a polynucleotide, positioned in anti-sense orientation with respect to the promoter to enable the expression of RNA polynucleotides having a sequence complementary to a portion of mRNA.

Various expression vectors for modulating the translation of mRNA may comprise: a promoter operably-linked to a polynucleotide in which the sequence is positioned in anti- sense orientation with respect to the promoter. The lengths of anti-sense RNA polynucleotides can vary, and may be from about 15-20 nucleotides, about 20-30 nucleotides, about 30-50 nucleotides, about 50-75 nucleotides, about 75-100 nucleotides, about 100-150 nucleotides, about 150-200 nucleotides, and about 200-300 nucleotides. As discussed herein, the expression of one or more polypeptides can be modulated by non- transgenic means - such as creating one or more mutations in one or more genes, as discussed herein. Methods that introduce a mutation randomly in a gene sequence can include chemical mutagenesis, EMS mutagenesis and radiation mutagenesis. Methods that introduce one or more targeted mutations into a cell include but are not limited to genome editing technology, particularly zinc finger nuclease-mediated mutagenesis and targeting induced local lesions in genomes (TILLING), homologous recombination, oligonucleotide- directed mutagenesis, and meganuclease-mediated mutagenesis. In one embodiment, TILLING is used. This is a mutagenesis technology that can be used to generate and/or identify polynucleotides encoding polypeptides with modified expression and/or activity. TILLING also allows selection of plants carrying such mutants. TILLING combines high- density mutagenesis with high-throughput screening methods. Methods for TILLING are well known in the art (see McCallum et al., (2000) Nat Biotechnol 18: 455-457 and Stemple (2004) Nat Rev Genet 5(2): 145-50).

Specific mutations in polynucleotides can be created that can result in modulated gene expression, modulated stability of mRNA, or modulated stability of protein. Such plants are referred to herein as "non-naturally occurring" or "mutant" plants. Typically, the mutant or non-naturally occurring plants will include at least a portion of foreign or synthetic or man- made nucleic acid (for example, DNA or RNA) that was not present in the plant before it was manipulated. The foreign nucleic acid may be a single nucleotide, two or more nucleotides, two or more contiguous nucleotides or two or more non-contiguous nucleotides - such as at least 10, 20, 30, 40, 50,100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400 or 1500 or more contiguous or non-contiguous nucleotides.

The mutant or non-naturally occurring plants or plant cells can have any combination of one or more mutations in one or more genes which results in modulated protein levels. For example, the mutant or non-naturally occurring plants or plant cells may have a single mutation in a single gene; multiple mutations in a single gene; a single mutation in two or more or three or more or four or more genes; or multiple mutations in two or more or three or more or four or more genes. Examples of such mutations are described herein. By way of further example, the mutant or non-naturally occurring plants or plant cells may have one or more mutations in a specific portion of the gene(s) - such as in a region of the gene that encodes an active site of the protein or a portion thereof. By way of further example, the mutant or non-naturally occurring plants or plant cells may have one or more mutations in a region outside of one or more gene(s) - such as in a region upstream or downstream of the gene it regulates provided that they modulate the activity or expression of the gene(s). Upstream elements can include promoters, enhancers or transription factors. Some elements - such as enhancers - can be positioned upstream or downstream of the gene it regulates. The element(s) need not be located near to the gene that it regulates since some elements have been found located several hundred thousand base pairs upstream or downstream of the gene that it regulates. The mutant or non-naturally occurring plants or plant cells may have one or more mutations located within the first 100 nucleotides of the gene(s), within the first 200 nucleotides of the gene(s), within the first 300 nucleotides of the gene(s), within the first 400 nucleotides of the gene(s), within the first 500 nucleotides of the gene(s), within the first 600 nucleotides of the gene(s), within the first 700 nucleotides of the gene(s), within the first 800 nucleotides of the gene(s), within the first 900 nucleotides of the gene(s), within the first 1000 nucleotides of the gene(s), within the first 1 100 nucleotides of the gene(s), within the first 1200 nucleotides of the gene(s), within the first 1300 nucleotides of the gene(s), within the first 1400 nucleotides of the gene(s) or within the first 1500 nucleotides of the gene(s). The mutant or non-naturally occurring plants or plant cells may have one or more mutations located within the first, second, third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth or fifteenth set of 100 nucleotides of the gene(s) or combinations thereof. Mutant or non-naturally occurring plants or plant cells (for example, mutant, non-naturally occurring or transgenic plants or plant cells and the like, as described herein) comprising the mutant polypeptide variants are disclosed.

In one embodiment, seeds from plants are mutagenised and then grown into first generation mutant plants. The first generation plants are then allowed to self-pollinate and seeds from the first generation plant are grown into second generation plants, which are then screened for mutations in their loci. Though the mutagenized plant material can be screened for mutations, an advantage of screening the second generation plants is that all somatic mutations correspond to germline mutations. One of skill in the art would understand that a variety of plant materials, including but not limited to, seeds, pollen, plant tissue or plant cells, may be mutagenised in order to create the mutant plants. However, the type of plant material mutagenised may affect when the plant nucleic acid is screened for mutations. For example, when pollen is subjected to mutagenesis prior to pollination of a non-mutagenized plant the seeds resulting from that pollination are grown into first generation plants. Every cell of the first generation plants will contain mutations created in the pollen; thus these first generation plants may then be screened for mutations instead of waiting until the second generation.

Mutagens that create primarily point mutations and short deletions, insertions, transversions, and or transitions, including chemical mutagens or radiation, may be used to create the mutations. Mutagens include, but are not limited to, ethyl methanesulfonate, methylmethane sulfonate, N-ethyl-N-nitrosurea, triethylmelamine, N-methyl-N-nitrosourea, procarbazine, chlorambucil, cyclophosphamide, diethyl sulfate, acrylamide monomer, melphalan, nitrogen mustard, vincristine, dimethylnitrosamine, N-methyl-N'-nitro-Nitrosoguanidine, nitrosoguanidine, 2-aminopurine, 7,12 dimethyl-benz(a)anthracene, ethylene oxide, hexamethylphosphoramide, bisulfan, diepoxyalkanes (diepoxyoctane, diepoxybutane, and the like), 2-methoxy-6-chloro-9[3-(ethyl-2-chloro-ethyl)aminopropylamin o]acridine dihydrochloride and formaldehyde.

Spontaneous mutations in the locus that may not have been directly caused by the mutagen are also contemplated provided that they result in the desired phenotype. Suitable mutagenic agents can also include, for example, ionising radiation - such as X-rays, gamma rays, fast neutron irradiation and UV radiation. Any method of plant nucleic acid preparation known to those of skill in the art may be used to prepare the plant nucleic acid for mutation screening.

Prepared nucleic acid from individual plants, plant cells, or plant material can optionally be pooled in order to expedite screening for mutations in the population of plants originating from the mutagenized plant tissue, cells or material. One or more subsequent generations of plants, plant cells or plant material can be screened. The size of the optionally pooled group is dependent upon the sensitivity of the screening method used.

After the nucleic acid samples are optionally pooled, they can be subjected to polynucleotide-specific amplification techniques, such as Polymerase Chain Reaction. Any one or more primers or probes specific to the gene or the sequences immediately adjacent to the gene may be utilized to amplify the sequences within the optionally pooled nucleic acid sample. Suitably, the one or more primers or probes are designed to amplify the regions of the locus where useful mutations are most likely to arise. Most preferably, the primer is designed to detect mutations within regions of the polynucleotide. Additionally, it is preferable for the primer(s) and probe(s) to avoid known polymorphic sites in order to ease screening for point mutations. To facilitate detection of amplification products, the one or more primers or probes may be labelled using any conventional labelling method. Primer(s) or probe(s) can be designed based upon the sequences described herein using methods that are well understood in the art.

To facilitate detection of amplification products, the primer(s) or probe(s) may be labelled using any conventional labelling method. These can be designed based upon the sequences described herein using methods that are well understood in the art. Polymorphisms may be identified by means known in the art and some have been described in the literature.

In a further aspect there is provided a method of preparing a mutant plant. The method involves providing at least one cell of a plant comprising a gene encoding a functional polynucleotide described herein (or any combination thereof as described herein). Next, the at least one cell of the plant is treated under conditions effective to modulate the activity of the polynucleotide(s) described herein. The at least one mutant plant cell is then propagated into a mutant plant, where the mutant plant has a modulated level of polypeptide(s) described (or any combination thereof as described herein) as compared to that of a control plant. In one embodiment of this method of making a mutant plant, the treating step involves subjecting the at least one cell to a chemical mutagenising agent as described above and under conditions effective to yield at least one mutant plant cell. In another embodiment of this method, the treating step involves subjecting the at least one cell to a radiation source under conditions effective to yield at least one mutant plant cell. The term "mutant plant" includes mutants plants in which the genotype is modified as compared to a control plant, suitably by means other than genetic engineering or genetic modification.

In certain embodiments, the mutant plant, mutant plant cell or mutant plant material may comprise one or more mutations that have occured naturally in another plant, plant cell or plant material and confer a desired trait. This mutation can be incorporated (for example, introgressed) into another plant, plant cell or plant material (for example, a plant, plant cell or plant material with a different genetic background to the plant from which the mutation was derived) to confer the trait thereto. Thus by way of example, a mutation that occurred naturally in a first plant may be introduced into a second plant - such as a second plant with a different genetic background to the first plant. The skilled person is therefore able to search for and identify a plant carrying naturally in its genome one or more mutant alleles of the genes described herein which confer a desired trait. The mutant allele(s) that occurs naturally can be transferred to the second plant by various methods including breeding, backcrossing and introgression to produce a lines, varieties or hybrids that have one or more mutations in the genes described herein. Plants showing a desired trait may be screened out of a pool of mutant plants. Suitably, the selection is carried out utilising the knowledge of the nucleotide sequences as described herein. Consequently, it is possible to screen for a genetic trait as compared to a control. Such a screening approach may involve the application of conventional nucleic acid amplification and/or hybridization techniques as discussed herein. Thus, a further aspect of the present invention relates to a method for identifying a mutant plant comprising the steps of: (a) providing a sample comprising nucleic acid from a plant; and (b) determining the nucleic acid sequence of the polynucleotide, wherein a difference in the sequence of the polynucleotide as compared to the polynucleotide sequence of a control plant is indicative that said plant is a mutant plant. In another aspect there is provided a method for identifying a mutant plant which accumulates increased or reduced levels of protease as compared to a control plant comprising the steps of: (a) providing a sample from a plant to be screened; (b) determining if said sample comprises one or more mutations in one or more of the polynucleotides described herein; and (c) determining at least the protease content of said plant during or after a curing procedure.

In another aspect there is provided a method for preparing a mutant plant which has increased or reduced levels of protease as compared to a control plant comprising the steps of: (a) providing a sample from a first plant; (b) determining if said sample comprises one or more mutations in one or more the polynucleotides described herein that result in modulated levels of a protease; and (c) transferring the one or more mutations into a second plant. Suitably at least the protease content is determined in cured leaf material. The mutation(s) can be transferred into the second plant using various methods that are known in the art - such as by genetic engineering, genetic manipulation, introgression, plant breeding, backcrossing and the like. In one embodiment, the first plant is a naturally occurring plant. In one embodiment, the second plant has a different genetic background to the first plant. In another aspect there is provided a method for preparing a mutant plant which has increased or reduced levels of a protease as compared to a control plant comprising the steps of: (a) providing a sample from a first plant; (b) determining if said sample comprises one or more mutations in one or more of the polynucleotides described herein that results in modulated levels of the protease; and (c) introgressing the one or more mutations from the first plant into a second plant. Suitably at least the protease content is determined in cured leaf material. In one embodiment, the step of introgressing comprises plant breeding, optionally including backcrossing and the like. In one embodiment, the first plant is a naturally occurring plant. In one embodiment, the second plant has a different genetic background to the first plant. In one embodiment, the first plant is not a cultivar or an elite cultivar. In one embodiment, the second plant is a cultivar or an elite cultivar. A further aspect relates to a mutant plant (including a cultivar or elite cultivar mutant plant) obtained or obtainable by the methods described herein. In certain embodiments, the "mutant plants" may have one or more mutations localised only to a specific region of the plant - such as within the sequence of the one or more polynucleotide(s) described herein. According to this embodiment, the remaining genomic sequence of the mutant plant will be the same or substantially the same as the plant prior to the mutagenesis.

In certain embodiments, the mutant plants may have one or more mutations localised in more than one region of the plant - such as within the sequence of one or more of the polynucleotides described herein and in one or more further regions of the genome. According to this embodiment, the remaining genomic sequence of the mutant plant will not be the same or will not be substantially the same as the plant prior to the mutagenesis. In certain embodiments, the mutant plants may not have one or more mutations in one or more, two or more, three or more, four or more or five or more exons of the polynucleotide(s) described herein; or may not have one or more mutations in one or more, two or more, three or more, four or more or five or more introns of the polynucleotide(s) described herein; or may not have one or more mutations in a promoter of the polynucleotide(s) described herein; or may not have one or more mutations in the 3' untranslated region of the polynucleotide(s) described herein; or may not have one or more mutations in the 5' untranslated region of the polynucleotide(s) described herein; or may not have one or more mutations in the coding region of the polynucleotide(s) described herein; or may not have one or more mutations in the non-coding region of the polynucleotide(s) described herein; or any combination of two or more, three or more, four or more, five or more; or six or more thereof parts thereof.

In a futher aspect there is provided a method of identifying a plant, a plant cell or plant material comprising a mutation in a gene encoding a polynucleotide described herein comprising: (a) subjecting a plant, a plant cell or plant material to mutagenesis; (b) obtaining a nucleic acid sample from said plant, plant cell or plant material or descendants thereof; and (c) determining the nucleic acid sequence of the gene encoding a polynucleotide described herein or a variant or a fragment thereof, wherein a difference in said sequence is indicative of one or more mutations therein.

Zinc finger proteins can be used to modulate the expression or the activity of one or more of the polynucleotides described herein. In various embodiments, a genomic DNA sequence comprising a part of or all of the coding sequence of the polynucleotide is modified by zinc finger nuclease-mediated mutagenesis. The genomic DNA sequence is searched for a unique site for zinc finger protein binding. Alternatively, the genomic DNA sequence is searched for two unique sites for zinc finger protein binding wherein both sites are on opposite strands and close together, for example, 1 , 2, 3, 4, 5, 6 or more basepairs apart. Accordingly, zinc finger proteins that bind to polynucleotides are provided.

A zinc finger protein may be engineered to recognize a selected target site in a gene. A zinc finger protein can comprise any combination of motifs derived from natural zinc finger DNA- binding domains and non-natural zinc finger DNA-binding domains by truncation or expansion or a process of site-directed mutagenesis coupled to a selection method such as, but not limited to, phage display selection, bacterial two-hybrid selection or bacterial one- hybrid selection. The term "non-natural zinc finger DNA-binding domain" refers to a zinc finger DNA-binding domain that binds a three-base pair sequence within the target nucleic acid and that does not occur in the cell or organism comprising the nucleic acid which is to be modified. Methods for the design of zinc finger protein which binds specific nucleotide sequences which are unique to a target gene are known in the art.

In other embodiments, a zinc finger protein may be selected to bind to a regulatory sequence of a polynucleotide. More specifically, the regulatory sequence may comprise a transcription initiation site, a start codon, a region of an exon, a boundary of an exon-intron, a terminator, or a stop codon. Accordingly, the invention provides a mutant, non-naturally occurring or transgenic plant or plant cells, produced by zinc finger nuclease-mediated mutagenesis in the vicinity of or within one or more polynucleotides described herein, and methods for making such a plant or plant cell by zinc finger nuclease-mediated mutagenesis. Methods for delivering zinc finger protein and zinc finger nuclease to a tobacco plant are similar to those described below for delivery of meganuclease.

Plants suitable for use in genetic modification include, but are not limited to, monocotyledonous and dicotyledonous plants and plant cell systems, including species from one of the following families: Acanthaceae, Alliaceae, Alstroemeriaceae, Amaryllidaceae, Apocynaceae, Arecaceae, Asteraceae, Berberidaceae, Bixaceae, Brassicaceae, Bromeliaceae, Cannabaceae, Caryophyllaceae, Cephalotaxaceae, Chenopodiaceae, Colchicaceae, Cucurbitaceae, Dioscoreaceae, Ephedraceae, Erythroxylaceae, Euphorbiaceae, Fabaceae, Lamiaceae, Linaceae, Lycopodiaceae, Malvaceae, Melanthiaceae, Musaceae, Myrtaceae, Nyssaceae, Papaveraceae, Pinaceae, Plantaginaceae, Poaceae, Rosaceae, Rubiaceae, Salicaceae, Sapindaceae, Solanaceae, Taxaceae, Theaceae, or Vitaceae.

Suitable species may include members of the genera Abelmoschus, Abies, Acer, Agrostis, Allium, Alstroemeria, Ananas, Andrographis, Andropogon, Artemisia, Arundo, Atropa, Berberis, Beta, Bixa, Brassica, Calendula, Camellia, Camptotheca, Cannabis, Capsicum, Carthamus, Catharanthus, Cephalotaxus, Chrysanthemum, Cinchona, Citrullus, Coffea, Colchicum, Coleus, Cucumis, Cucurbita, Cynodon, Datura, Dianthus, Digitalis, Dioscorea, Elaeis, Ephedra, Erianthus, Erythroxylum, Eucalyptus, Festuca, Fragaria, Galanthus, Glycine, Gossypium, Helianthus, Hevea, Hordeum, Hyoscyamus, Jatropha, Lactuca, Linum, Lolium, Lupinus, Lycopersicon, Lycopodium, Manihot, Medicago, Mentha, Miscanthus, Musa, Nicotiana, Oryza, Panicum, Papaver, Parthenium, Pennisetum, Petunia, Phalaris, Phleum, Pinus, Poa, Poinsettia, Populus, Rauwolfia, Ricinus, Rosa, Saccharum, Salix, Sanguinaria, Scopolia, Secale, Solanum, Sorghum, Spartina, Spinacea, Tanacetum, Taxus, Theobroma, Triticosecale, Triticum, Uniola, Veratrum, Vinca, Vitis, and Zea.

Suitable species may include Panicum spp., Sorghum spp., Miscanthus spp., Saccharum spp., Erianthus spp., Populus spp., Andropogon gerardii (big bluestem), Pennisetum purpureum (elephant grass), Phalaris arundinacea (reed canarygrass), Cynodon dactylon (bermudagrass), Festuca arundinacea (tall fescue), Spartina pectinata (prairie cord-grass), Medicago sativa (alfalfa), Arundo donax (giant reed), Secale cereale (rye), Salix spp. (willow), Eucalyptus spp. (eucalyptus), Triticosecale (tritic wheat times rye), bamboo, Helianthus annuus (sunflower), Carthamus tinctorius (safflower), Jatropha curcas (jatropha), Ricinus communis (castor), Elaeis guineensis (palm), Linum usitatissimum (flax), Brassica juncea, Beta vulgaris (sugarbeet), Manihot esculenta (cassaya), Lycopersicon esculentum (tomato), Lactuca sativa (lettuce), Musyclise alca (banana), Solanum tuberosum (potato), Brassica oleracea (broccoli, cauliflower, Brussels sprouts), Camellia sinensis (tea), Fragaria ananassa (strawberry), Theobroma cacao (cocoa), Coffe35ycliseca (coffee), Vitis vinifera (grape), Ananas comosus (pineapple), Capsicum annum (hot & sweet pepper), Allium cepa (onion), Cucumis melo (melon), Cucumis sativus (cucumber), Cucurbita maxima (squash), Cucurbita moschata (squash), Spinacea oleracea (spinach), Citrullus lanatus (watermelon), Abelmoschus esculentus (okra), Solanum melongena (eggplant), Rosa spp. (rose), Dianthus caryophyllus (carnation), Petunia spp. (petunia), Poinsettia pulcherrima (poinsettia), Lupinus albus (lupin), Uniola paniculata (oats), bentgrass (Agrostis spp.), Populus tremuloides (aspen), Pinus spp. (pine), Abies spp. (fir), Acer spp. (maple), Hordeum vulgare (barley), Poa pratensis (bluegrass), Lolium spp. (ryegrass) and Phleum pratense (timothy), Panicum virgatum (switchgrass), Sorghu35yclise35or (sorghum, sudangrass), Miscanthus giganteus (miscanthus), Saccharum sp. (energycane), Populus balsamifera (poplar), Zea mays (corn), Glycine max (soybean), Brassica napus (canola), Triticum aestivum (wheat), Gossypium hirsutum (cotton), Oryza sativa (rice), Helianthus annuus (sunflower), Medicago sativa (alfalfa), Beta vulgaris (sugarbeet), or Pennisetum glaucum (pearl millet).

Various embodiments are directed to mutant tobacco, non-naturally occurring tobacco or transgenic tobacco plants or plant cells modified to modulate gene expression levels thereby producing a plant or plant cell - such as a tobacco plant or plant cell - in which the expression level of a polypeptide is modulated within tissues of interest as compared to a control. The disclosed compositions and methods can be applied to any species of the genus Nicotiana, including N. rustica and N. tabacum (for example, LA B21 , LN KY171 , Tl 1406, Basma, Galpao, Perique, Beinhart 1000-1 , and Petico). Other species include N. acaulis, N. acuminata, N. africana, N. alata, N. ameghinoi, N. amplexicaulis, N. arentsii, N. attenuata, N. azambujae, N. benavidesii, N. benthamiana, N. bigelovii, N. bonariensis, N. cavicola, N. clevelandii, N. cordifolia, N. corymbosa, N. debneyi, N. excelsior, N. forgetiana, N. fragrans, N. glauca, N. glutinosa, N. goodspeedii, N. gossei, N. hybrid, N. ingulba, N. kawakamii, N. knightiana, N. langsdorffii, N. linearis, N. longiflora, N. maritima, N. megalosiphon, N. miersii, N. noctiflora, N. nudicaulis, N. obtusifolia, N. occidentalis, N. occidentalis subsp. hesperis, N. otophora, N. paniculata, N. pauciflora, N. petunioides, N. plumbaginifolia, N. quadrivalvis, N. raimondii, N. repanda, N. rosulata, N. rosulata subsp. ingulba, N. rotundifolia, N. setchellii, N. simulans, N. solanifolia, N. spegazzinii, N. stocktonii, N. suaveolens, N. sylvestris, N. thyrsiflora, N. tomentosa, N. tomentosiformis, N. trigonophylla, N. umbratica, N. undulata, N. velutina, N. wigandioides, and N. x sanderae. The use of tobacco cultivars and elite tobacco cultivars is also contemplated herein. The transgenic, non-naturally occurring or mutant plant may therefore be a tobacco variety or elite tobacco cultivar that comprises one or more transgenes, or one or more genetic mutations or a combiantion thereof. The genetic mutation(s) (for example, one or more polymorphisms) can be mutations that do not exist naturally in the individual tobacco variety or tobacco cultivar (for example, elite tobacco cultivar) or can be genetic mutation(s) that do occur naturally provided that the mutation does not occur naturally in the individual tobacco variety or tobacco cultivar (for example, elite tobacco cultivar).

Particularly useful Nicotiana tabacum varieties include Burley type, dark type, flue-cured type, and Oriental type tobaccos. Non-limiting examples of varieties or cultivars are: BD 64, CC 101 , CC 200, CC 27, CC 301 , CC 400, CC 500, CC 600, CC 700, CC 800, CC 900, Coker 176, Coker 319, Coker 371 Gold, Coker 48, CD 263, DF91 1 , DT 538 LC Galpao tobacco, GL 26H, GL 350, GL 600, GL 737, GL 939, GL 973, HB 04P, HB 04P LC, HB3307PLC, Hybrid 403LC, Hybrid 404LC, Hybrid 501 LC, K 149, K 326, K 346, K 358, K394, K 399, K 730, KDH 959, KT 200, KT204LC, KY10, KY14, KY 160, KY 17, KY 171 , KY 907, KY907LC, KY14xL8 LC, Little Crittenden, McNair 373, McNair 944, msKY 14xL8, Narrow Leaf Madole, Narrow Leaf Madole LC, NBH 98, N-126, N-777LC, N-7371 LC, NC 100, NC 102, NC 2000, NC 291 , NC 297, NC 299, NC 3, NC 4, NC 5, NC 6, NC7, NC 606, NC 71 , NC 72, NC 810, NC BH 129, NC 2002, Neal Smith Madole, OXFORD 207, PD 7302 LC, PD 7309 LC, PD 7312 LC, 'Perique' tobacco, PVH03, PVH09, PVH19, PVH50, PVH51 , R 610, R 630, R 7-1 1 , R 7-12, RG 17, RG 81 , RG H51 , RGH 4, RGH 51 , RS 1410, Speight 168, Speight 172, Speight 179, Speight 210, Speight 220, Speight 225, Speight 227, Speight 234, Speight G-28, Speight G-70, Speight H-6, Speight H20, Speight NF3, Tl 1406, Tl 1269, TN 86, TN86LC, TN 90, TN 97, TN97LC, TN D94, TN D950, TR (Tom Rosson) Madole, VA 309, VA359, AA 37-1 , B13P, Xanthi (Mitchell-Mor), Bel-W3, 79-615, Samsun Holmes NN, KTRDC number 2 Hybrid 49, Burley 21 , KY8959, KY9, MD 609, PG01 , PG04, P01 , P02, P03, RG1 1 , RG 8, VA509, AS44, Banket A1 , Basma Drama B84/31 , Basma I Zichna ZP4/B, Basma Xanthi BX 2A, Batek, Besuki Jember, C104, Coker 347, Criollo Misionero, Delcrest, Djebel 81 , DVH 405, Galpao Comum, HB04P, Hicks Broadleaf, Kabakulak Elassona, Kutsage E1 , LA BU 21 , NC 2326, NC 297, PVH 21 10, Red Russian, Samsun, Saplak, Simmaba, Talgar 28, Wislica, Yayaldag, Prilep HC-72, Prilep P23, Prilep PB 156/1 , Prilep P12-2/1 , Yaka JK-48, Yaka JB 125/3, TI-1068, KDH-960, TI-1070, TW136, Basma, TKF 4028, L8, TKF 2002, GR141 , Basma xanthi, GR149, GR153, Petit Havana. Low converter subvarieties of the above, even if not specifically identified herein, are also contemplated.

Embodiments are also directed to compositions and methods for producing mutant plants, non-naturally occurring plants, hybrid plants, or transgenic plants that have been modified to modulate the expression or activity of a polynucleotide(s) described herein (or any combination thereof as described herein). Advantageously, the mutant plants, non-naturally occurring plants, hybrid plants, or transgenic plants that are obtained may be similar or substantially the same in overall appearance to control plants. Various phenotypic characteristics such as degree of maturity, number of leaves per plant, stalk height, leaf insertion angle, leaf size (width and length), internode distance, and lamina-midrib ratio can be assessed by field observations.

One aspect relates to a seed of a mutant plant, a non-naturally occurring plant, a hybrid plant or a transgenic plant described herein. Preferably, the seed is a tobacco seed. A further aspect relates to pollen or an ovule of a mutant plant, a non-naturally occurring plant, a hybrid plant or a transgenic plant that is described herein. In addition, there is provided a mutant plant, a non-naturally occurring plant, a hybrid plant or a transgenic plant as described herein which further comprises a nucleic acid conferring male sterility.

Also provided is a tissue culture of regenerable cells of the mutant plant, non-naturally occurring plant, hybrid plant, or transgenic plant or a part thereof as described herein, which culture regenerates plants capable of expressing all the morphological and physiological characteristics of the parent. The regenerable cells include but are not limited to cells from leaves, pollen, embryos, cotyledons, hypocotyls, roots, root tips, anthers, flowers and a part thereof, ovules, shoots, stems, stalks, pith and capsules or callus or protoplasts derived therefrom.

A still further aspect, relates to a cured plant material - such as cured leaf or cured tobacco - derived or derivable from a mutant, non-naturally occurring or transgenic plant or cell, wherein expression of one or more of the polynucleotides described herein or the activity of the protein encoded thereby is modulated. Suitably the visual appearance of said plant (for example, leaf) is substantially the same as the control plant. Suitably, the plant is a tobacco plant.

Embodiments are also directed to compositions and methods for producing mutant, non- naturally occurring or transgenic plants or plant cells that have been modified to modulate the expression or activity of the one or more of the polynucleotides or polypeptides described herein which can result in plants or plant components (for example, leaves - such as green leaves or cured leaves - or tobacco) or plant cells with modulated levels of proteases.

In another aspect, there is provided a method for modulating (eg. increasing) the amount of protease in at least a part of a plant (for example, the leaves - such as cured leaves - or in tobacco), comprising the steps of: (i) modulating (eg. increasing) the expression or activity of an one or more of the polypeptides described herein (or any combination thereof as described herein), suitably, wherein the polypeptide(s) is encoded by the corresponding polynucleotide sequence described herein; (ii) measuring the protease content in at least a part (for example, the leaves - such as cured leaves - or tobacco or in smoke) of the mutant, non-naturally occurring or transgenic plant obtained in step (i); and (iii) identifying a mutant, non-naturally occurring or transgenic plant in which the protease content therein has been modulated (eg. increased) in comparison to a control plant. Suitably, the visual appearance of said mutant, non-naturally occurring or transgenic plant is substantially the same as the control plant. Suitably, the plant is a tobacco plant.

In another aspect, there is provided a method for modulating (eg. increasing) the amount of protease in at least a part of cured plant material - such as cured leaf - comprising the steps of: (i) modulating (eg. increasing) the expression or activity of an one or more of the polypeptides (or any combination thereof as described herein), suitably, wherein the polypeptide(s) is encoded by the corresponding polynucleotide sequence described herein; (ii) harvesting plant material - such as one or more of the leaves - and curing for a period of time; (iii) measuring the protease content in at least a part of the cured plant material obtained in step (ii) or during step (ii); and (iv) identifying cured plant material in which the protease content therein has been modulated (eg. increased) in comparison to a control plant.

An increase in expression as compared to the control may be from about 5 % to about 100 %, or an increase of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 % or more - such as 200%, 300%, 500%, 1000% or more, which includes an increase in transcriptional activity or polynucleotide expression or polypeptide expression or a combination thereof. An increase in activity as compared to a control may be from about 5 % to about 100 %, or an increase of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 % or more - such as 200%, 300%, 500%, 1000% or more.

A reduction in expression as compared to a control may be from about 5 % to about 100 %, or a reduction of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 %, which includes a reduction in transcriptional activity or polynucleotide expression or polypeptide expression or a combination thereof.

A reduction in activity as compared to a control may be from about 5 % to about 100 %, or a reduction of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 %.

Polynucleotides and recombinant constructs described herein can be used to modulate the expression of the proteases described herein in a plant species of interest, suitably tobacco. A number of polynucleotide based methods can be used to increase gene expression in plants and plant cells. By way of example, a construct, vector or expression vector that is compatible with the plant to be transformed can be prepared which comprises the gene of interest together with an upstream promoter that is capable of overexpressing the gene in the plant or plant cell. Exemplary promoters are described herein. Following transformation and when grown under suitable conditions, the promoter can drive expression in order to modulate (for example, reduce) the levels of this enzyme in the plant, or in a specific tissue thereof. In one exemplary embodiment, a vector carrying one or more polynucleotides described herein (or any combination thereof as described herein) is generated to overexpress the gene in a plant or plant cell. The vector carries a suitable promoter - such as the cauliflower mosaic virus CaMV 35S promoter - upstream of the transgene driving its constitutive expression in all tissues of the plant. The vector also carries an antibiotic resistance gene in order to confer selection of the transformed calli and cell lines.

In a preferred embodiment, a promoter and regulatory sequences are derived from one or more of SEQ ID Nos: 1 -80. These regulatory sequences can be used in conjunction with cognate or non-cognate expression sequences to increase expression of said sequences in a tobacco plant during the curing procedure.

The expression of sequences from promoters can be enhanced by including expression control sequences, including enhancers, chromatin activating elements, transcription factor responsive elements and the like. Such control sequences may be constitutive, and upregulate transcription in a universal manner; or they may be facultative, and upregulate transcription in response to specific signals. Signals associated with senescence and signals which are active during the curing procedure are specifically indicated.

Various embodiments are therefore directed to methods for modulating (for example, increasing) the expression level of one or more polynucleotides described herein (or any combination thereof as described herein) by integrating multiple copies of the polynucleotide into a plant genome, comprising: transforming a plant cell host with an expression vector that comprises a promoter operably-linked to one or more polynucleotides described herein. The polypeptide encoded by a recombinant polynucleotide can be a native polypeptide, or can be heterologous to the cell.

A tobacco plant carrying a mutant allele of one or more polynucleotides described herein (or any combination thereof as described herein) can be used in a plant breeding program to create useful lines, varieties and hybrids. In particular, the mutant allele is introgressed into the commercially important varieties described above. Thus, methods for breeding plants are provided, that comprise crossing a mutant plant, a non-naturally occurring plant or a transgenic plant as described herein with a plant comprising a different genetic identity. The method may further comprise crossing the progeny plant with another plant, and optionally repeating the crossing until a progeny with the desirable genetic traits or genetic background is obtained. One purpose served by such breeding methods is to introduce a desirable genetic trait into other varieties, breeding lines, hybrids or cultivars, particularly those that are of commercial interest. Another purpose is to facilitate stacking of genetic modifications of different genes in a single plant variety, lines, hybrids or cultivars. Intraspecific as well as interspecific matings are contemplated. The progeny plants that arise from such crosses, also referred to as breeding lines, are examples of non-naturally occurring plants of the invention.

In one embodiment, a method is provided for producing a non-naturally occurring tobacco plant comprising: (a) crossing a mutant or transgenic tobacco plant with a second tobacco plant to yield progeny tobacco seed; (b) growing the progeny tobacco seed, under plant growth conditions, to yield the non-naturally occurring tobacco plant. The method may further comprises: (c) crossing the previous generation of non-naturally occurring tobacco plant with itself or another tobacco plant to yield progeny tobacco seed; (d) growing the progeny tobacco seed of step (c) under plant growth conditions, to yield additional non- naturally occurring tobacco plants; and (e) repeating the crossing and growing steps of (c) and (d) multiple times to generate further generations of non-naturally occurring tobacco plants. The method may optionally comprises prior to step (a), a step of providing a parent plant which comprises a genetic identity that is characterized and that is not identical to the mutant or transgenic plant. In some embodiments, depending on the breeding program, the crossing and growing steps are repeated from 0 to 2 times, from 0 to 3 times, from 0 to 4 times, 0 to 5 times, from 0 to 6 times, from 0 to 7 times, from 0 to 8 times, from 0 to 9 times or from 0 to 10 times, in order to generate generations of non-naturally occurring tobacco plants. Backcrossing is an example of such a method wherein a progeny is crossed with one of its parents or another plant genetically similar to its parent, in order to obtain a progeny plant in the next generation that has a genetic identity which is closer to that of one of the parents. Techniques for plant breeding, particularly tobacco plant breeding, are well known and can be used in the methods of the invention. The invention further provides non- naturally occurring tobacco plants produced by these methods. Certain emboiments exclude the step of selecting a plant.

In some embodiments of the methods described herein, lines resulting from breeding and screening for variant genes are evaluated in the field using standard field procedures. Control genotypes including the original unmutagenized parent are included and entries are arranged in the field in a randomized complete block design or other appropriate field design. For tobacco, standard agronomic practices are used, for example, the tobacco is harvested, weighed, and sampled for chemical and other common testing before and during curing. Statistical analyses of the data are performed to confirm the similarity of the selected lines to the parental line. Cytogenetic analyses of the selected plants are optionally performed to confirm the chromosome complement and chromosome pairing relationships. DNA fingerprinting, single nucleotide polymorphism, microsatellite markers, or similar technologies may be used in a marker-assisted selection (MAS) breeding program to transfer or breed mutant alleles of a gene into other tobaccos, as described herein. For example, a breeder can create segregating populations from hybridizations of a genotype containing a mutant allele with an agronomically desirable genotype. Plants in the F2 or backcross generations can be screened using a marker developed from a genomic sequence or a fragment thereof, using one of the techniques listed herein. Plants identified as possessing the mutant allele can be backcrossed or self-pollinated to create a second population to be screened. Depending on the expected inheritance pattern or the MAS technology used, it may be necessary to self-pollinate the selected plants before each cycle of backcrossing to aid identification of the desired individual plants. Backcrossing or other breeding procedure can be repeated until the desired phenotype of the recurrent parent is recovered.

In a breeding program, successful crosses yield F1 plants that are fertile. Selected F1 plants can be crossed with one of the parents, and the first backcross generation plants are self- pollinated to produce a population that is again screened for variant gene expression (for example, the null version of the the gene). The process of backcrossing, self-pollination, and screening is repeated, for example, at least 4 times until the final screening produces a plant that is fertile and reasonably similar to the recurrent parent. This plant, if desired, is self- pollinated and the progeny are subsequently screened again to confirm that the plant exhibits variant gene expression. In some embodiments, a plant population in the F2 generation is screened for variant gene expression, for example, a plant is identified that fails to express a polypeptide due to the absence of the gene according to standard methods, for example, by using a PCR method with primers based upon the nucleotide sequence information for the polynucleotide(s) described herein (or any combination thereof as described herein).

Hybrid tobacco varieties can be produced by preventing self-pollination of female parent plants (that is, seed parents) of a first variety, permitting pollen from male parent plants of a second variety to fertilize the female parent plants, and allowing F1 hybrid seeds to form on the female plants. Self-pollination of female plants can be prevented by emasculating the flowers at an early stage of flower development. Alternatively, pollen formation can be prevented on the female parent plants using a form of male sterility. For example, male sterility can be produced by cytoplasmic male sterility (CMS), or transgenic male sterility wherein a transgene inhibits microsporogenesis and/or pollen formation, or self- incompatibility. Female parent plants containing CMS are particularly useful. In embodiments in which the female parent plants are CMS, pollen is harvested from male fertile plants and applied manually to the stigmas of CMS female parent plants, and the resulting F1 seed is harvested.

Varieties and lines described herein can be used to form single-cross tobacco F1 hybrids. In such embodiments, the plants of the parent varieties can be grown as substantially homogeneous adjoining populations to facilitate natural cross-pollination from the male parent plants to the female parent plants. The F1 seed formed on the female parent plants is selectively harvested by conventional means. One also can grow the two parent plant varieties in bulk and harvest a blend of F1 hybrid seed formed on the female parent and seed formed upon the male parent as the result of self-pollination. Alternatively, three-way crosses can be carried out wherein a single-cross F1 hybrid is used as a female parent and is crossed with a different male parent. As another alternative, double-cross hybrids can be created wherein the F1 progeny of two different single-crosses are themselves crossed. A population of mutant, non-naturally occurring or transgenic plants can be screened or selected for those members of the population that have a desired trait or phenotype. For example, a population of progeny of a single transformation event can be screened for those plants having a desired level of expression or activity of the polypeptide(s) encoded thereby. Physical and biochemical methods can be used to identify expression or activity levels. These include Southern analysis or PCR amplification for detection of a polynucleotide; Northern blots, S1 RNase protection, primer-extension, or RT-PCR amplification for detecting RNA transcripts; enzymatic assays for detecting enzyme or ribozyme activity of polypeptides and polynucleotides; and protein gel electrophoresis, Western blots, immunoprecipitation, and enzyme-linked immunoassays to detect polypeptides. Other techniques such as in situ hybridization, enzyme staining, and immunostaining and enzyme assays also can be used to detect the presence or expression or activity of polypeptides or polynucleotides.

Mutant, non-naturally occurring or transgenic plant cells and plants are described herein comprising one or more recombinant polynucleotides, one or more polynucleotide constructs, one or more double-stranded RNAs, one or more conjugates or one or more vectors/expression vectors.

Without limitation, the plants described herein may be modified for other purposes either before or after the expression or activity has been modulated according to the present invention. One or more of the following genetic modifications can be present in the mutant, non-naturally occurring or transgenic plants. In one embodiment, one or more genes that are involved in the conversion of nitrogenous metabolic intermediates is modified resulting in plants (such as leaves) that when cured, produces lower levels of at least one tobacco- specific nitrosamine than control plants. Non-limiting examples of genes that can be modified includegenes encoding a nicotine demethylase, such as CYP82E4, CYP82E5 and CYP82E10 which participate in the conversion of nicotine to nornicotine and are described in WO2006091 194, WO2008070274, WO2009064771 and PCT/US201 1/021088 and as described in detail herein. In another embodiment, one or more genes that are involved in heavy metal uptake or heavy metal transport are modified resulting in plants or parts of plants (such as leaves) having a lower heavy metal content than control plants or parts thereof without the modification(s). Non-limiting examples include genes in the family of multidrug resistance associated proteins, the family of cation diffusion facilitators (CDF), the family of Zrt-, Irt-like proteins (ZIP), the family of cation exchangers (CAX), the family of copper transporters (COPT), the family of heavy-metal P-type ATPases (for example, HMAs, as described in WO2009074325), the family of homologs of natural resistance-associated macrophage proteins (NRAMP), and the family of ATP-binding cassette (ABC) transporters (for example, MRPs, as described in WO2012/028309, which participate in transport of heavy metals, such as cadmium. The term heavy metal as used herein includes transition metals. Examples of other modifications include herbicide tolerance, for example, glyphosate is an active ingredient of many broad spectrum herbicides. Glyphosate resistant transgenic plants have been developed by transferring the aroA gene (a glyphosate EPSP synthetase from Salmonella typhimurium and E.coli). Sulphonylurea resistant plants have been produced by transforming the mutant ALS (acetolactate synthetase) gene from Arabidopsis. OB protein of photosystem II from mutant Amaranthus hybridus has been transferred in to plants to produce atrazine resistant transgenic plants; and bromoxynil resistant transgenic plants have been produced by incorporating the bxn gene from the bacterium Klebsiella pneumoniae. Another exemplary modification results in plants that are resistant to insects. Bacillus thuringiensis (Bt) toxins can provide an effective way of delaying the emergence of Bt-resistant pests, as recently illustrated in broccoli where pyramided crylAc and crylC Bt genes controlled diamondback moths resistant to either single protein and significantly delayed the evolution of resistant insects. Another exemplary modification results in plants that are resistant to diseases caused by pathogens (for example, viruses, bacteria, fungi). Plants expressing the Xa21 gene (resistance to bacterial blight) with plants expressing both a Bt fusion gene and a chitinase gene (resistance to yellow stem borer and tolerance to sheath) have been engineered. Another exemplary modification results in altered reproductive capability, such as male sterility. Another exemplary modification results in plants that are tolerant to abiotic stress (for example, drought, temperature, salinity), and tolerant transgenic plants have been produced by transferring acyl glycerol phosphate enzyme from Arabidopsis; genes coding mannitol dehydrogenase and sorbitol dehydrogenase which are involved in synthesis of mannitol and sorbitol improve drought resistance. Other exemplary modifications can result in plants with improved storage proteins and oils, plants with enhanced photosynthetic efficiency, plants with prolonged shelf life, plants with enhanced carbohydrate content, and plants resistant to fungi; plants encoding an enzyme involved in the biosynthesis of alkaloids. Transgenic plants in which the expression of S-adenosyl-L-methionine (SAM) and/or cystathionine gamma-synthase (CGS) has been modulated are also contemplated.

One or more such traits may be introgressed into the mutant, non-naturally occuring or transgenic tobacco plants from another tobacco cultivar or may be directly transformed into it. The introgression of the trait(s) into the mutant, non-naturally occuring or transgenic tobacco plants of the invention maybe achieved by any method of plant breeding known in the art, for example, pedigree breeding, backcrossing, doubled-haploid breeding, and the like (see, Wernsman, E. A, and Rufty, R. C. 1987. Chapter Seventeen. Tobacco. Pages 669- 698 In: Cultivar Development. Crop Species. W. H. Fehr (ed.), MacMillan Publishing Co, Inc., New York, N.Y 761 pp.). Molecular biology-based techniques described above, in particular RFLP and microsatelite markers, can be used in such backcrosses to identify the progenies having the highest degree of genetic identity with the recurrent parent. This permits one to accelerate the production of tobacco varieties having at least 90%, preferably at least 95%, more preferably at least 99% genetic identity with the recurrent parent, yet more preferably genetically identical to the recurrent parent, and further comprising the trait(s) introgressed from the donor parent. Such determination of genetic identity can be based on molecular markers known in the art. The last backcross generation can be selfed to give pure breeding progeny for the nucleic acid(s) being transferred. The resulting plants generally have essentially all of the morphological and physiological characteristics of the mutant, non-naturally occuring or transgenic tobacco plants of the invention, in addition to the transferred trait(s) (for example, one or more single gene traits). The exact backcrossing protocol will depend on the trait being altered to determine an appropriate testing protocol. Although backcrossing methods are simplified when the trait being transferred is a dominant allele, a recessive allele may also be transferred. In this instance, it may be necessary to introduce a test of the progeny to determine if the desired trait has been successfully transferred.

Various embodiments provide mutant plants, non-naturally occurring plants or transgenic plants, as well as biomass in which the expression level of a polynucleotide (or any combination thereof as described herein) is modulated to modulate the protease activity therein.

Parts of such plants, particularly tobacco plants, and more particularly the leaf lamina and midrib of tobacco plants, can be incorporated into or used in making various consumable products including but not limited to aerosol forming materials, aerosol forming devices, smoking articles, smokable articles, smokeless products, and tobacco products. Examples of aerosol forming materials include but are not limited to tobacco compositions, tobaccos, tobacco extract, cut tobacco, cut filler, cured tobacco, expanded tobacco, homogenized tobacco, reconstituted tobacco, and pipe tobaccos. Smoking articles and smokable articles are types of aerosol forming devices. Examples of smoking articles or smokable articles include but are not limited to cigarettes, cigarillos, and cigars. Examples of smokeless products comprise chewing tobaccos, and snuffs. In certain aerosol forming devices, rather than combustion, a tobacco composition or another aerosol forming material is heated by one or more electrical heating elements to produce an aerosol. In another type of heated aerosol forming device, an aerosol is produced by the transfer of heat from a combustible fuel element or heat source to a physically separate aerosol forming material, which may be located within, around or downstream of the heat source. Smokeless tobacco products and various tobacco-containing aerosol forming materials may contain tobacco in any form, including as dried particles, shreds, granules, powders, or a slurry, deposited on, mixed in, surrounded by, or otherwise combined with other ingredients in any format, such as flakes, films, tabs, foams, or beads. As used herein, the term 'smoke' is used to describe a type of aerosol that is produced by smoking articles, such as cigarettes, or by combusting an aerosol forming material.

In one embodiment, there is also provided cured plant material from the mutant, transgenic and non-naturally occurring tobacco plants described herein. Processes of curing green tobacco leaves are known by those having skills in the art and include without limitation air- curing, fire-curing, flue-curing and sun-curing as described herein.

In another embodiment, there is described tobacco products including tobacco-containing aerosol forming materials comprising plant material - such as leaves, preferably cured leaves - from the mutant tobacco plants, transgenic tobacco plants or non-naturally occurring tobacco plants described herein. The tobacco products described herein can be a blended tobacco product which may further comprise unmodified tobacco.

The mutant, non-naturally occurring or transgenic plants may have other uses in, for example, agriculture. For example, mutant, non-naturally occurring or transgenic plants described herein can be used to make animal feed and human food products.

The invention also provides methods for producing seeds comprising cultivating the mutant plant, non-naturally occurring plant, or transgenic plant described herein, and collecting seeds from the cultivated plants. Seeds from plants described herein can be conditioned and bagged in packaging material by means known in the art to form an article of manufacture. Packaging material such as paper and cloth are well known in the art. A package of seed can have a label, for example, a tag or label secured to the packaging material, a label printed on the package that describes the nature of the seeds therein.

Compositions, methods and kits for genotyping plants for identification, selection, or breeding can comprise a means of detecting the presence of a polynucleotide (or any combination thereof as described herein) in a sample of polynucleotide. Accordingly, a composition is described comprising one of more primers for specifically amplifying at least a portion of one or more of the polynucleotides and optionally one or more probes and optionally one or more reagents for conducting the amplification or detection.

Accordingly, gene specific oligonucleotide primers or probes comprising about 10 or more contiguous polynucleotides corresponding to the polynucleotide(s) described herein are dislcosed. Said primers or probes may comprise or consist of about 15, 20, 25, 30, 40, 45 or 50 more contiguous polynucleotides that hybridise (for example, specificially hybridise) to the polynucleotide(s) described herein. In some embodiments, the primers or probes may comprise or consist of about 10 to 50 contiguous nucleotides, about 10 to 40 contiguous nucleotides, about 10 to 30 contiguous nucleotides or about 15 to 30 contiguous nucleotides that may be used in sequence-dependent methods of gene identification (for example, Southern hybridization) or isolation (for example, in situ hybridization of bacterial colonies or bacteriophage plaques) or gene detection (for example, as one or more amplification primers in nucleic acid amplification or detection). The one or more specific primers or probes can be designed and used to amplify or detect a part or all of the polynucleotide(s). By way of specific example, two primers may be used in a polymerase chain reaction protocol to amplify a nucleic acid fragment encoding a nucleic acid - such as DNA or RNA. The polymerase chain reaction may also be performed using one primer that is derived from a nucleic acid sequence and a second primer that hybridises to the sequence upstream or downstream of the nucleic acid sequence - such as a promoter sequence, the 3' end of the mRNA precursor or a sequence derived from a vector. Examples of thermal and isothermal techniques useful for in vitro amplification of polynucleotides are well known in the art. The sample may be or may be derived from a plant, a plant cell or plant material or a tobacco product made or derived from the plant, the plant cell or the plant material as described herein.

In a further aspect, there is also provided a method of detecting a polynucleotide(s) described herein (or any combination thereof as described herein) in a sample comprising the step of: (a) providing a sample comprising, or suspected of comprising, a polynucleotide; (b) contacting said sample with one of more primers or one or more probes for specifically detecting at least a portion of the polynucleotide(s); and (c) detecting the presence of an amplification product, wherein the presence of an amplification product is indicative of the presence of the polynucleotide(s) in the sample. In a further aspect, there is also provided the use of one of more primers or probes for specifically detecting at least a portion of the polynucleotide(s). Kits for detecting at least a portion of the polynucleotide(s) are also provided which comprise one of more primers or probes for specifically detecting at least a portion of the polynucleotide(s). The kit may comprise reagents for polynucleotide amplification - such as PCR - or reagents for probe hybridization-detection technology - such as Southern Blots, Northern Blots, in-situ hybridization, or microarray. The kit may comprise reagents for antibody binding-detection technology such as Western Blots, ELISAs, SELDI mass spectrometry or test strips. The kit may comprise reagents for DNA sequencing. The kit may comprise reagents and instructions for determining at least the proteasae content. Suitably, the kit comprises reagents and instructions for determining at least protease content in plant material, cured plant material or cured leaves.

In some embodiments, a kit may comprise instructions for one or more of the methods described. The kits described may be useful for genetic identity determination, phylogenetic studies, genotyping, haplotyping, pedigree analysis or plant breeding particularly with co- dominant scoring.

The present invention also provides a method of genotyping a plant, a plant cell or plant material comprising a polynucleotide as described herein. Genotyping provides a means of distinguishing homologs of a chromosome pair and can be used to differentiate segregants in a plant population. Molecular marker methods can be used for phylogenetic studies, characterizing genetic relationships among crop varieties, identifying crosses or somatic hybrids, localizing chromosomal segments affecting monogenic traits, map based cloning, and the study of quantitative inheritance. The specific method of genotyping may employ any number of molecular marker analytic techniques including amplification fragment length polymorphisms (AFLPs). AFLPs are the product of allelic differences between amplification fragments caused by nucleotide sequence variability. Thus, the present invention further provides a means to follow segregation of one or more genes or nucleic acids as well as chromosomal sequences genetically linked to these genes or nucleic acids using such techniques as AFLP analysis.

In one embodiment, there is also provided cured plant material from the mutant, transgenic and non-naturally occurring plants described herein. For example, processes of curing tobacco leaves are known by those having skills in the field and include without limitation air- curing, fire-curing, flue-curing and sun-curing.

In another embodiment, there is described tobacco products including tobacco products comprising plant material - such as leaves, suitably cured plant material - such as cured leaves - from the mutant, transgenic and non-naturally occurring plants described herein or which are produced by the methods described herein. The tobacco products described herein may further comprise unmodified tobacco.

In another embodiment, there is described tobacco products comprising plant material, preferably leaves - such as cured leaves, from the mutant, transgenic and non-naturally occurring plants described herein. For example, the plant material may be added to the inside or outside of the tobacco product and so upon burning a desirable aroma is released. The tobacco product according to this embodiment may even be an unmodified tobacco or a modified tobacco. The tobacco product according to this embodiment may even be derived from a mutant, transgenic or non-naturally occurring plant which has modifications in one or more genes other than the genes disclosed herein.

The invention is further described in the Examples below, which are provided to describe the invention in further detail. These examples, which set forth a preferred mode presently contemplated for carrying out the invention, are intended to illustrate and not to limit the invention.

EXAMPLES

The following examples are provided as an illustration and not as a limitation. Unless otherwise indicated, the present invention employs conventional techniques and methods of molecular biology, plant biology, bioinformatics, and plant breeding.

Example 1

A 48h time-point following the curing start was selected to screen for curing-activated genes based on Affymetrix data essentially as described by Martin et al. (2012) BMC Genomics, 13:674). In brief, exon candidates from genomic DNA and from EST contigs were joined and the genomic candidates were cleaned for redundancies (98% threshold). This resulted in a set of 312,053 exon candidates, 12,925 of which were represented by ESTs, but were not included in the genome assembly. Data sets were verified as described by the manufacturer (Affymetrix). In addition, quality checks included probe-level models, Normalized Unsealed Standard Error (NUSE) and Relative Log Expression (RLE) plots, and the analysis of DABG results as described by the manufacturer.

As the exon array design had no mismatch probes, summarization was performed using Robust Multi-array Average (RMA) method. A total of 272,342 probeset expression values were generated, and DABG P-values were computed to assess the significance of the signal obtained for each probeset. This involved the background probes that are spread over the chip. These random probes have a varying GC content. Quality checks involved a combination of Affymetrix Power Tools (APT) and Bioconductor packages, for which the Tobacco Exon Array (TobArray520623F) cdf environment was created. Once the expression values were available, differential gene expression analysis was performed using moderated t-statistics in linear model LIMMA.

Example 2

Differential expression. The tissue samples were sequenced using RNA-seq; reads were mapped to the genomes of the 3 varieties using Tophat2. Previously published gene models were used as the basis for the differential gene expression analysis. Expression changes during curing were calculated using the Cuffdiff2 software based on the mapped reads. Genes were considered up-regulated if their expression levels increased significantly during the first 48h of curing, and not if the change was insignificant or decreased. Tobacco proteins were identified by a BLAST search against a database of transcripts for the 3 varieties and equivalent genes in the 3 varieties were identified by a mutual best BLAST hit search of the transcripts of the 3 varieties Burley, Virginia and Oriental (e-value cutoff 1 e- 80).

The data (Figure 2) shows the number of senescence-activated genes in the 3 cured varieties.

Example 3

The proteasae genes identified in Example 2 were analysed for membership of known protease families. The results are set forth in table 1.

The 80 curing-activated protease genes were found to belong to 21 different protease families. In the table, AC, air-cured; FC, flue-cured; SC, sun-cured. AC+FC+SC, up-regulated in all three types of tobacco ; AC+FC, up-regulated in air-cured and flue-cured tobacco ; AC+SC, up-regulated in air-cured and sun-cured tobacco ; FC+SC, up-regulated in flue-cured and sun-cured tobacco ; AC, FC and SC, up-regulated only in the respective tobacco type.

Protease coding genes AC+FC+SC AC+FC AC+SC FC+SC AC FC SC

Alpha/beta-Hydrolases 1

superfamily protein

Aspartic proteinase A1 2 1 2 1 *

(APA1 )

CLP protease/crotonase 1 1

family protein

Cysteine proteinases 3 2 3 1 2 1 2 superfamily protein

DegP protease 3 - - - - - 1 -

Eukaryotic aspartyl 4 1 1 1 3 2 2 protease family protein

FTSH protease 8 - - - - - - 1

Gamma-glutamyl 1

transpeptidase 4

Heat shock protein 101 1 1 - - 1 1 -

Ion protease 1 & 3 - - - - - 3 -

Metallopeptidase M24 1

family protein

Papain family cysteine 1

protease

Peptidase M20/M25/M40 1 1 family protein

Protease-related - - - - 1 2 -

SAG 12 - 1 - - - - -

Serine carboxypeptidase- 1 1 1 1 1

1

like

SERPIN - 1 1 - - 1 -

Signal peptide peptidase - - - - 1 - -

SITE-1 protease - 1 - - - -

Subtilisin-like ser 3 1 - 1 3 - endopeptidase fam prot.

Ubiquitin-specific 1 3 2

proteases

Total 16 12 6 3 17 19 7

Table 1.

Example 4

APA 1 is encoded by a single gene in Arabidopsis thaliana and 4 in Tomato. The gene activated in flue-cured Virginia tobacco (see Table 1 ) is close to APA1 -Tomato-1. Two gene copies from both ancestors N. sylvestris (S) and N. tomentosiformis (T) exist in N. tabacum. Affymetrix data confirmed the activation of the S form (upper panel) and apparently not the T form during Virginia flue-curing (lower panel).

Example 5

Table 2 illustrates the differential up-regulation of SEQ ID NO:1 to 80 in the three tobacco types air-cured Burley (AC), flue-cured Virginia (FC) and sun-cured Oriental (SC).

Table 2

SEQ ID AC-FC-

NO: AC FC SC AC-FC AC-SC FC-SC SC

1 X

2 X

3 X

4 X

5 X

6 X

7 X

8 X

9 X

10 X

11 X X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X SEQUENCE LISTING

SEQ ID NO: 1

ATGGCTCTTCGTTTCTCTTTAATTTTCCTATTTTCTCTTTTCTTAACGACGTCGTTATTG TT

GTCCGTTAACGGCAACATTAACGGCGGTGAAGATGACGATATTTTGATCCGTCAAGT CG

TAGGCGACGACGACGATCACTTGTTAAACGCCGATCATCACTTCACGATTTTTAAGA GG

AGGTTCGGCAAAACCTACGCGTCCGATGAGGAGCATCATTACAGATTCTCGGTGTTC AA

GGCTAACTTGCGCCGTGCAATGCGCCACCAGAAGCTTGATCCCTCCGCCGTTCACGG T

GTGACTCAGTTTTCCGATTTGACTCCGGCCGAGTTCCGCCGGAATTTTCTAGGAGTT AA

CCGTCGGCTCCGGCTTCCTTCTGATGCCAATAAAGCTCCTATTCTTCCTACTGAGGA TC

TCCCTTCAGGTTTCGATTGGAGAGATCACGGTGCCGTCACGTCAGTAAAGAATCAGG TA

CTAGTATATATCAATGTTTGTGTAAAGTTTATCTTTTTTTGGATAGGCGAAGTGTTC GTCA

TTAATGAATAATTACATAATTTCTATTTGTATCGATTGAAAAACTAGGGTTCATGTG GCTC

GTGCTGGTCATTTAGTACCACTGGTGCGTTAGAAGGTGCCACCTATCTTTCTACAGG GA

AGCTTGTAAGCCTCAGCGAGCAACAACTTGTGGACTGTGATCACGAGGTTTGACGTT CT

TCCTCTTTATCTTAGCTTAAAATCATGAATATATTGTCAATAGAGTTACTGTTTTTC TTTTT

TCTTTTTTTCTGGGACGTTTGAATGTGTAAAATAATTTTCGCTGTGGTGTGTCACAG GAT

TTG GTC CATAG CTGTC ATCTTTTTCTAGTTAAAGAAAATTGATAGCGTG AAG GACACTAA

CCGCATAAATTAAAGTGCTTTCTCTGATTCCGTCTCACTTTAAAGTTTAAGAACCCG TTT

GGCCATGAAATTTCTTTTTTTTTTCCGTAAAATTTAACTTTTCTTCTAAATCAATGT TTGGC

CATCAAATTTTTTATTTTCACTTGAAGATAATTTTACAATTTTTCAAAAATTTGAAA AACTT

CAAAAACTGTTTTTCAAAATTTTGAATATTGTTGTTGATGTAAAAAACAGACACTAA TTTA

TAAGAGTAATCTCCTCTTCTTTGTTTGGTGGATGGCCAGGGGTGGGGACTGGGGACC C

ATCTTAAG GG AGCG G AG G AAAAGTTGTTTTATTATTAGTTTATG G CTG GTTATG AAATTC

AACTAATTGATACTCTGAGGATAACCACGGACAAAATTGTTTGGATGATGAGGAAAT CG

CATCCAAAAATTGTCTGCATCTGAATATACTTTTAACATTACTTGAAGTTTCAAGTT TAAG

CTCGTGTATGCAACGTGGTGGGAGATGTACAAGGATAAATAGAAAGGCGTTGAGTTA TT

GAGATAGGTTTGTAAAACTCTTCTTAAATTTTCCATTGTTTGATTGCCATTATATAA TCAT

TTGTATAATTTCCAACTTGGAAAAAGCTGTTCAAACTCAAAATAAGGTTTAGGCTTG AAC

TTATTGCTATTTACGGTGTCTGCCATTTTATAATCAGAAATGGGATTGAATACAGAG TTA

ATAAGACCACTGACTCGCCTTATTTACCTCACTCGTCTCAGATGAATTTTATACTTC CAA

ATTTCAGTGTTCCCCATCTCCCTGAAAAATGTATAATTTGGCCTTGCATTTATCTGC AGT

GTGATCCAGAAGAAAAAGATTCATGTGACGCAGGGTGCAATGGTGGCCTAATGAATA G

TGCCTTTGAATACACTCTGAAAGCTGGTGGACTTATGCGAGAAGAAGATTATCCATA CA

CTGGCACCGATCGTGGAACCTGCAAATTTGACAACACCAAGGTTGCTGCTAAAGTTG CT

AACTTTAGCGTTGTCTCCCTTGACGAAGAACAAATCGCTGCTAATCTTGTCAAGAAT GG

TCCTCTCGCTGGTAAATAGTCTCTCAAAACACTTTTCAATTTGCCTATCATTATGCT TCTT

CTTTGTCCTTACTTGATATTGTCAAAGTATATACTTGGATTGTCATATTTATGCACT GGAA

TGTAAAAGGTATTTACACAATTAAGTCACTTATTAGGTAATTACAAGTAACTATTTT GATA

AGTTTTAATTAGTAATGTGTTAAAATGATAATTAACTTGCTATTTAAATTCACTGAT AGCC

GTAACAAAATCTTTTAACTATTAATATATATAATATAAATATTTGTTTTTTAATAAA CAACA

AATATTATTTGTGAAAGATCCAGTTATGTAGCTTGAAACTACATTTTGGGATTTTGA ATTA

TGTACTACTCTTCTTATGCTAATGGTTTTCAATTTTTCACTGATGTAAACTTCTGAA AGCA

TTTTTGTTGCTTGGCTTGCAGTGGCGATCAATGCAGTGTTCATGCAGACATACGTTG GC

GGAGTTTCCTGCCCATATATATGCTCTAAGAAGTTGGATCATGGTGTCTTATTAGTT GGT

TATGGTACTGGCTTTTCTCCCATTAGAATGAAAGAGAAACCATACTGGATCATCAAG AAC

TCATGGGGAGAGAAATGGGGTGAAAACGGATACTACAAAATCTGTAGAGGCCGCAAT G

TTTGCGGAGTGGATTCAATGGTTTCAACAGTTTCAGCTGTTAGTACCAGCTCACAC SEQ 2

TTAAGCTGCTTCAGCAAATCCAACTCTGAGTTTGCCATAATCGAAGACTGTGTGATATC

GACCCATGAAAACATCACCCAAGATCCTGTAACCAAAGGAATACCATAGAGAACTCA GT

G AACAAAAG AACTG CAG G CTCAG GTTTAATTGTGCTGTAGCTCTATAGTTCG G ATTAAA

CTTATATTTGGATTAACTGCATTGCTGATATTTATCTCTAAAACATAATTATAAACT AAAAT

AGAGAGAACATATAAAGATAACTTTACCAGAGTGGTCCGCGGGGAGGAGGAATGTCC A

AGCCAGTGAAACCACTAATACACTGTGCCTTAGCACCCTCGCCCACCTTGAGTATGT AC

TGATCACGGGAAAAGAATCCAAAATTAGAACATAGATCAATCTAGGTCAGCAATCTA AA

CAGACAACTGAAACAAGTAAAAGGGAACCACACATCGGATTACAACTTCACTCTTTC AA

TCTTG AAAAAATTTGTTG AAAG G GTG G G AAAAGACTAGAGTGATAGTCTAGTAG AG AAA

AGTTTTG CATATG GTCAAG GG GTTTG GTGTATCACTTG G ATTTTTTC CTTTGTAAGATGT

GGTCTATCCTGAATTATTCAAAGCTCAACCTCTTTATGTTACTAAACCACAAAACAA ACA

AATTCAGAAAAAATGCAAATGATCAAATTGATTTGGTGTACACTTGATGAATTTCTT CTTT

GTAAGATTTGTTCTATACTGAATCCTTCAGACATAAAAAAAAATATTTTTTTTTTGG GGGG

N CG G CCTGAATTAG CAAG GTCAG CAAGTAATACACTTCCATAAAAATAG CAAAGG GTAA

CTTTTTCACGGCACAAAGATCTTATGCAGGTTTTCTTAGATTACTTAGCTGGAAAAT GAG

ACATCTAAATTTAAGTAAAGTCGAAATACTCACAACCTCAAAATATAGAAGTACTTC TTGA

TGACAACAAACATCTACTTCTCTGTAGAAACTGAAAACCTTAAACACTAGAATCGGT TTT

GTAATATGACAATTAGTTGTAATGCCACAAAAGGACTCTATGATGAGCCACTTAATT TTT

TCTCTCTTTGACAATGTTGAATTAGAAGAGGAATAGCAATGTTTATTACTGTCAAAG ACC

ATTATAAAGCATACCTCCTTCGGGACGAGGTCAAAAACTTTGCCACCAATTGTGAAA GA

GACTGTAGGCATTGAAGAAAGCTTTCCACAGTCAACAGCTGATTCCCCCAATGGGCT TG

G GAG ACG CTCG CAAAG CTGTC AGTCAGTCAG CAATCTTTTCAAG AAAAG AAG AAAATTG

CAGTGACAGATGTTTTACCTCATTCACATAGTTTAATATGCGATCTTGAGTCTGGTT TTG

TCTCAGTTGATTCTCCATCCATATGACCGCCATTTCACAAGCAGAGCACATACCATC CT

G CAGTCCTGTG GATCTG CCAGCTTTCTC GTCTACAACACTCTCAATTCC CATACTG CAG

AAAAAAGGCCACAGAATTATTCAGTATTTATATCAACATTATGGATTAACCAAATGC ACT

ATATGTTCACATGAAGGGGCAAAGAGAGCCTGAAGACTAACCTAACTCCGCGGTTTC CA

TCGAAAGTGCATACTCCAACCTGTGAGCAAATCTTCTTTGGATGTGCCTGTTTAAAT ACT

GACCAATTAGATAACCGGGAAAGGCAACTAGATTGCCAAGTGTCATTTTGCTGTAAC TG

CACAGGAAACTGCATATCAAACAAATGAAAATGCAGTTACACAGTTGAATGCTCACC TC

TGCTAACAGCAAATCCATGATTGTCTGCCCGTACTGCTCCACTACAGATTTGCATTG TT

GGCTAGCAACTCCAGAGGCTCCAATGGCTTGATTAATCATAGTGATTATGGTCTGTA CG

GAAAGGGGTGGGTTTAAGATTGCTCAACCTTGGAAGTGTTTTAATCGTACAATTGTA GA

G AC AAAAG G CAG C AG ATTTTTACTTAATTTATATTGTC AAC ATTTC C AAG C C AAC AG GAT

AAAACTTGG CTACAGTTTTCG G GTTG G ATAATTTTCTTTTCAAATAG AAG AGG G GTAAAT

AAATAAGTCGACAGAAGACCAGGACTACAGCAGAAGTAAAAGCATCATCCTCATTGA AA

C GTAAT AAAAG C AAGTAAC AC AAAAC AAC AAGTAC CTACTG AG G CAG CTTTAAAC ATATT

AAACTG AAAG AC AG G G AAG AAAAAG C AG ATTTAC AG ACTTC G G C C C AGTG ATAAG CTAA

GATGGTATATCCAAAGGTAACCTCAGAAATGAAAAACCAATTTCACTACCATCCTCT CTG

TGATGAAAAATTAAAACACAACACAGATCAGATGATGGATTCGTGCTATTTAACTCA TGA

ATCTTAG G AAAATGTTACTTTTCTTGCTG AGCTGTTG AAG GTT CAAAG GAACAAG GAAAT

CAATAATCGAATTGCGGTTGACTTTGATGATGCAAACAAATAACAAAAACATAACAA ACA

AGCGATATGTCCCAAATCAAGGCTATAGATAATACCGTTGGACCAGCCAAGAGAGAA GT

CCCTGAATCCGCTATTGCAGAGCACCCACTTTCACAGTAACCTACACCCGATATGTT CA

TGTTAAAAACTCAAAGGAAGGGAAAATTCTATATCCAGGCACATAGCCTTCATCTAT ATT

CCCGAAATTCGGCAATCCAATTCAAAAGGTACATAGCAAAACATACCAGTAGCTTTA CC

CTCGATAAGAACATCACCCATATCAAACTGCCAATCATATTACTAAGATCAAGATAG CAT

TTGTACAAAAAATGAACATACATAGTATCGAATTGACCGAATGACAAACCTGCCAAT AAC

CTTTGTGTGTGACTGGGACATAAGTGATTTCTCCCTTATAGTGATTAGGATCAACCC CA CCAAACACGATTTCTCCGCCTTGTTCTTCCTCTGTATTTCGGTTGAGCCAAAATGAGAA

GACAGGATCCTTGATAAGACCCTGTTGGACCATGTTGTACCTGGAAAAGACAGGAGA T

GCTGCCCAGATGAATGTCAAATCAAATTTAAACAGAAAGAGACATCCAGCCTATCCT GC

ATTTATGGAAATCTAATCCTTCAATGTGTTAAACCTCTTCTGGAAAGGAAATTGTCT AGA

GCTTTAATTTGGTTTGTGGGAAAGAAATAGAGCAAACTAAATACCGCCCACGTACCA AA

CTGGAACAGCATTGCCAACTGAAATCTCCTGGAATCCAAGACCCAATATACCGTCAA AC

TTGGCTACCAAAAATGTCACGCTGGGTTCTCTGGTTGCCTCAATAAATTCCTAGTAC AT

GAACACCTTGAGATATAAGATTTCCACTTTCAAGAGATTTAAAACAAGTGAGGAGCC TCA

CTAACCTGATCTGTTACAACAAGGTCACCAACTTTGACGTTGTCTTGACTGAAGAAT CCA

GAAATAGCTCCACTACCATACTGAATTGCAGCAGACTTCCCTGTATGGCAAATCAAA AA

TTTATCACGAACTAAATCACATTAAATTACAATGCCAAATACGATCTCAGTCTTGTG GAA

ACATTCGATAAGATCTTAATGTTGTTCATTAAGGTAGGAGTGACCTAGTGTTCTTAA AAG

CAAAACGTGCAAAAAAATAAAACAAGGTCCACGGACTTGCATTGAATTGCGAGAAGT GA

AG C G C AAAATTAAC AG G AAAC AAG AAAATATC G AAC ATTTATG AATTTACTCTAC C ATAA

AAATTAAACTGCAGGTTAAATAACTAATTTCGGCATCCAATATACAATAATCCCAGT ATTA

ATTCAACTCCTCAAAATTGAGATTCAAAGAAGCAACCAATTCTAGTTGGAATCACTT TGT

GCACCATTATTTGAAGCGCAACTTCTCTAAAGCGCATGGCTTCAGCAATGAAGTGAT AG

CCCTTGCTGCATCGCTTCATAACTTTAAACGACCAAGCAATGGCTTTCAATAACACT GG

AGTGAACTCACCTGGCAAACCCATTCAACTGCTATAGACTGTTCAATCCATTTTCTT TGA

GCAACATATATAATTGTAATAGAACAAAAAATAAAGAATAACTAGTTCTCTGCGGAA AAT

TTCTTATGTCACAGACCTACATGGATACAAACCGAGTTAATAGGGAAGAAGAAAGAC CA

TCTAAAAAG G CATTG CATAG GTTAAG ACTTAAGACTATAC AAG GTG CAACG AAAAGG CA

CTAATCGCAGAGAGATATAAGGATATTGATGTTTCTTTTCCAAAACCTCACTAGTTA CAG

TAATATACTAAGAAACACAACATAAACATTAAACAGCCTCGTTTTATGTCTTAACAG TCAA

CTACATGTACTCGTCAATTAACCTTTCCAAGGGAATCCCTTGATGCACCGTGAGAAA CA

CATAAGGACAATACAAAAGATGTTCCATAATGAACAAGATGGCACGTATTCTAAACA ATA

ACAG G CATTAG AAGG AAG CATATGTTTCATGC AG CAATAAACAAG CAAATGGTAGAG AG

AAACAATTTGCATCAACATACAGAAATGGAAACATAACATACCATTCTTCTTATAAG TACT

TG ATTCG CTTG ATTTG AACTTG G AATGAAAG AAACAG G GAACCTACACCAAG ATAG GCA

GTCATCAAATTTTACATCACTCAAGATGGATGTACAATGCTATGCTTTGTATCATTT GCAT

GTATAG AAG CTTAC AG AG AAATAG C ACTTC G AC G AC G G C AC C C AC AAATTC G AG CTAC C

AGTGTCAAAGATTACAGTGAACTTCTGAGGTGGAGTGCCTACACCAATCTCCCCAAA AT

ATTGAGCATCCATATAGTTCTTCAGTGCTACAATGTCTGTATCCTCAGAGTCCCCGA GTT

TACCACGGAAGTTATACTTCCTAATAGACGCCCTCAAAACGTCCCCTTCCTTTGACT CAA

TGCGTGCAGCAAGCCGGTTATTTTGATCAAATTTCATTTTTTTCAAGCCAATTCTCA TCA

AGCCATCATTGGATGAGGAGGCCAAAGGAAAGAGCAGTGCTGAGAGAAACAGGGCAA

CAAGAAATACTTTTGCTCCCAT

SEQ 3

ATGGGTTCTTTCCTCTGTTTCTCCGTCATTGTTGTTCTCCTTGTTCTTCAGCCATGTTTA

GCCAAGAAAGTTTACATTGTTCACATGAAAAATCACCAAATACCTTCTTCTTTTGCT ACC

CATCACGATTGGTACAATGCTCAGCTCCAATCTTTGTCCTCTTCTTCTACCTCTGAT GAA

TCATCCCTTCTTTACTCTTACGACACTGCTTATTCTGGCTTTGCTGCTTCTCTTGAC CCA

CATGAAGCTGAACTACTCCGTCAATCTGATGATGTTGTTGGAGTTTACGAGGATACT GT

TTATACACTCCATACAACAAGGACTCCTGAGTTTCTGGGGTTGAATAATGAGCTCGG CC

TTTGGGCTGGTCACAGTCCACAGGAACTCAACAACGCTGCTCAGGATGTTGTTATCG G

AGTTCTTGACACCGGCGTTTGGCCGGAGTCGAAGAGCTATAACGATTTCGGTATGCC C

GATGTGCCGTCGAGGTGGAAGGGTGAATGTGAATCGGGTTCCGATTTCGATCCGAAA G

TACATTG CAACAAAAAG CTG ATAG GTG CTCGTTTTTTCTCCAAAG GTTATCAAATGTC GG CCTCTGGCTCGTTCACGAACCAACCTAGACAGCCGGAGTCACCTCGTGACCAAGACGG

TCATGGCACCCACACATCCAGCACCGCCGCTGGTGCACCTGTGGCGAACGCTAGCCT T

CTCGGGTACGCTAGTGGGGTCGCGCGTGGTATGGCACCTCGAGCGCGTGTAGCTACG

TACAAGGTATGCTGGCCTACTGGTTGTTTTGGTTCTGATATTCTAGCTGGTATGGAA CG

TGCTATTTTAGATGGAGTTGATGTACTTTCATTATCTTTGGGTGGTGGATCGGGTCC TTA

TTATCGTGATACAATTGCTATTGGTGCTTTCTCTGCTATGGAAAAAGGAATTGTTGT TTC

CTGTTCAGCTGGAAATAGCGGTCCAGCTAAAGGCTCACTTGCAAATACAGCTCCTTG GA

TCATGACCGTTGGTGCTGGTACCATAGATCGTGATTTCCCTGCATTTGCTACTTTAG GT

AACGGGAAAAAAATTACCGGAGTTTCGTTATACAGTGGAAAAGGAATGGGTAAAAAG GT

AGTTC CATTAGTTTACAG CACAG ACAGTAGTG CAAGTCTTTGTTTG CCG GGTTCACTTG

ACCCGAAAATGGTCCGAGGGAAAATAGTGTTATGTGATAGAGGGACAAATGCGAGAG T

AGAAAAGG GTTTAGTAGTG AAG GAAGCTG GTG GAGTTG G G ATG ATATTG GCTAATACG

GCGGAGAGCGGCGAGGAATTGGTGGCGGATAGTCATTTGTTGCCGGCGGTAGCTGTA

GGTAGGAAATTGGGAGATTTTATAAGGCAGTATGTAAAGAGTGAAAAGAATCCGGCC G

CCGTGCTCAGCTTTGGTGGGACGGTGGTGAATGTGAAACCGTCGCCGGTGGTGGCTG

CGTTTAGTTCAAGAGGGCCCAATACTGTAACTCCACAGATTTTGAAGCCCGATGTTA TT

GGGCCTGGAGTTAATATTTTGGCTGCTTGGTCTGAGGCTATTGGGCCCACTGGGCTT G

AAAAGGATACCAGAAGGACCAAGTTCAACATCATGTCTGGTAAGTATTACCAACAAC GG

CTAGTTTCTTAATTTAATCTTTTTCATGCTTAGCTTAATTATGGCCTTAATTATATT TTTAT

TAG ATCTC G C AATTATTAATACTAAC C GTAC AC ACTTAAAAAG G AAAAG AG G AAC G C GTA

GAATAAAGACACCTGTGGGTGATCTGGAATTATGTACTATGCACATTCCTAAACTTT AGA

GGGGTTCACATGTGTAGCATTGATAAGTTAATCCTAAATTACATTAGTTATAATTAA ATAT

TAATGCAGTTTCCAAGAAAATAGATGGACTAAAATTTAGACTTATTTGTATGATGTG ACG

TGTGGAATTAAATTTAAAAACTGCCCAAGCCTATATCAAATTTATGGCTAAAATAGC AAG

AAACGTCCCTTTAATAGGCACAGAAGAAATCCAAGAGGGGCTCGCTGTAGGAGTGTT A

AGAGTTTCGATATGAACAAGGTCTAGAGAAGAATTTATTAATTAATTTCAATAATAT ACGC

TAATGGTATTTGAAAACAATATATTGTAATTTATCGTAACAAGTTACTAATTTCGCT TATTA

TAGACCATTATTGTGAAGTTATTTCTATAGATAAGCCAATAGCATAAAATTCATCCG TCG

GAATGTGCAAGGTGTAGTGGTAGGAGTGCTACTCATGATGTGACAAGTGCATGTCAC G

GGGTTTGAATCATAATGCAAACAAAAGCCTGATATGTTAGTGAAAAATGATAGAGGG AC

G GGTTCATTATTC ACACAAAAG CTTG ATATTTAAGTGAAAAATG ATAG G GGAACG AGTTC

ATTATCCAAAGAGTTTCGAACCTAACCTTACACCATGGCCCTTCTTGGTATAATTTA CAC

TAGTTTTATGAGGCTCCTTTTTGTCTCACAAATTTGTGGATCCCAATCTTACACTTC TGG

GTCCACAAAATTGTGAGACAAAAAAGTTACCTCATACAACTAGTCTCGGTTGTGACT AGT

TGTATGAGACACAAAATAAAATTTTCCGAAAAAGTAGTATGGTCTGTCTTCCGCTAT GAG

ACTAGTTGTTTGAAACACAAAATAAAATTTTCCGAAAAAGTAGTATCGTCTGTCTTC CAC

TAGTGGGTCCTGGTCCCCTTGGAATCCCAGATTATTGGTCCCTACATAACTATAAAG GT

CATAACCTTATCATGGATTTAACATCAACCCTTTGCCCCATCTGAGCACTCTGGACC TAC

CTTAATCACTTTATTGGCTGGAAATAAGTTGATGAACTTTTTGAATTTTTCTTGAAA AAAC

AACAACAAAAAACCACTTGTGATCCCACAAGTGGGTCCGGGGATTAGTGTGTTATAA AG

AGGATGTTTCGGATAGACTTTCGGCTTAGGAAAGATCAATAAAGTAGTAGAAACAAG CA

ATAACAATAGCAAAATACTGAATTTTTCTTGAAAATCCTACACAAATCTCATACTTT GAAA

ATTGTATTTTGTTACATAATTTGATCATTTTTCACTTCGAACTCTTGTAGGCACATC CATG

TCCTGTCCTCATATCAGTGGCCTAGCTGCACTGCTGAAAGCAGCACATCCTGAATGG A

GTCCAAGCGCGATCAAATCTGCACTTATGACGACTGCCTATGTTCGCGACACCACCA AC

TCTCCTCTCCGCGACGCTGAAGGTGGCCAACTCTCCACTCCTTGGGCTCATGGATCA G

GTCATGTTGATCCCCATAAGGCACTTTCCCCCGGTCTAATCTATGATATTACCCCAG AG

GACTACATCAAATTCTTATGCTCCTTGGACTATGAGTTGAACCACATACAAGCCATT GTC

AAGCGCCCGAATGTCACTTGTACTAAGAAATTTGCAGATCCTGGGCAGATTAACTAC CC

TTCATTCTCAGTTTTGTTCGGGAAATCAAGGGTTGTTCGTTACACCCGTGCAGTCAT CAA

TGTAGGAGCTGCAGGATCCGTCTATGAGGTGACCGTTGATGCTCCCCCGTCTGTTAC T GTAACCGTGAAGCCATCAAAACTTGTATTCAAAAGGGTAGGAGAGAGGCTGCGTTACA CCGTTAC ATTCGTGTCAAAGAAG G GTGTTAACATG ATG AG AAAG AGTG CATTTG G CTCC ATTTCTTGGAATAATGCTCAAAACCAAGTTAGGAGTCCAGTTTCATATTCCTGGTCACAA CTATTAGAC

SEQ 4

TCAAGCATCAGCACATCTTGTTGGTGCATATCCCAGTCTGGACCTTTTGGTGTCATATAA

GATATGAAAATTCTGCTGCTGATAGTTTCCAATTATCGACAAAGCAGATCGAGGAGT CC

CTAAAACTGCCAAACAAACGATATCCTCTGGTTCGAGTTTGATAAAGTAGTTCTCTA CTG

GAAAATTCCATACAGCTCCATCACCAAACACGATCCCAAACGAGGGAAATTCCAAGT TC

TTCACACCAGACACATTGTAACACGGATTCAAAATAGGAAAGTCTTGTACAATGGGA TAT

CCCTTAACCTTATTGACAAATGCCTCTTTTATAATCTCATAAGCAGGATCCGCGAAA TAA

CTCAATGTGGTACCTGAATCAATGATTGCACCACCAAGACCTTCTAGCGATAAATTC CA

CGTCTCCTCGGGTATATTCAGTACCTCTCCTCCAACTATGACAGACTTTATCTGCAC ATA

GTAGAATGTTTCCACTTCTTTGCCTCCAACCAATGAAGTAAAATTCAACTGTGGATG TTT

CAAAAGTTCCTTATCTTCACCAAAAATCAACTTACTACTAACACTAGAATTGCTATT CCTA

TCAACAAGACAATACGAAAACGAATGACCATATAAAGATTGAAGCTGAGAAGCAAAC GA

AAGCGGCCCTCTCCCTAATCCTAACAAACCAGCAGCACCATGAAATAATCCTCTATT CC

AATGACCACAACCAAACATCACATTTTCCACCTTCCTAAATTCACTCCCACTCGTCG TCG

TGAGGTTAACAGTAAATGTCTCTAGCGCGAAATCGCCAGTAGTATTAGAACTATCAC CA

TACCAATAGTAATAAGGACAAGTTTGATTCTCGGATTTACAAAGCTGAGGAGGATCA GG

GGATGTAACAAATTTACACCTAGGATCATGACAACTTATATTTCTAAATGAAGTAGA GTC

TTGAGGATTATAATGAGGTCCATTTTGTTCAAAACAATCAAAACAAGGAACACATTG AAT

CCAATTAAGATCACTACCAGTATCAAGAATTAAAGAAAAATGCTTAGGTGGTGTACC AAC

AAACACATCCATAAAATACTCACCAGAGCCAAGGCTTACACCTGACTCCAAAGTCGC CA

TTAGTTTGCCGGAAAGTTCATAAGATTCCAGCGAAACTGCTGCCGGAGCAATCACAG G

CTTATGTTTGTCCACATGTTTTTCATTACTTTTTGCAAGTCTTGAATTGTAATTCTG ATTTT

TCTTCTCAACAATTCTTGTATGGAGTGTCTGAATTCTGCTTAAATCCCTTGCTCTTG ACT

CAAAGACTGAATCCTTAGCCTCAATTTTTTTACCAGCTGATCTGTGCCTTAACTGAA ACT

TTACAGCTTCTTTTTTCTGGTTTCCAAAAATGGAAACTTCTTCATTTTCTCCATTTT TTACA

TCAACACCATCTACTTCTTGAGCTATTGAATGGGTTTTTGATTTTTGAGAAACTCCA TAG

TTG CAATCTG AGTCAGCTGAAG AAG AAACAG CATTAAAG CTTG G ATG GTTAG GG AATTC

AATACCCGAAACACTAGAATTTAGATTTCTGAAGCTGTAAAATCCTCCACAGGCAAC AAA

ACCAGAGGAAAACAAGAATATAAACAACAAAATGAAAAGAATGAACTTTGTCCCCAT

SEQ 5

TTACATTGAGGCATCAAGGAAAGCATTAGAGACATCATCTAATTCCACATTCAGATTTTT

GGCTGAAGGCAATCCAGCCACCACATTATGTTCAATGCCACATTCATTTGTTCCTCT TCT

GATCTTGAAGTAACCATCCTGTTATCATAGAGATAGACATACATTTAGACAAGAAGC TTA

TACAAATAGAGTTTAACTTTTGTGTATTGATAGTTTTCAGTTTGTTTAATCATTCAG GCTA

AGATGACTTATGCAACTGTCTAAAGTAATTCTAATTTAGTAACTTTAAGAGTGCAGT AAAT

AACTTGCTTTAACTTTTAAGATACACAGATAGTGTAAAAATTATTTACGCTGTCAGT ATAG

TTAGCG CATCTACAACAAGG AGTTAG GATGTC GATGGTTTAAG G AAATTTGTG CTCATA

TATGAAAGCATAAGGAGAAAATAGAGTACATACATCACCCCAGCCTCTGTTCCAAGA AT

TAGCAATAAGCTGCAGAAGATAATTAAAAAAAAAGGTGATGATTAGATTTTGATAAA ATG

ATATGATTTTAGATAGAAAATAAGACACCACAACTTGAACACATCAACATACCCAAT AGT

CCTCTCCCTGCTCACTGGTTCCCCATCCGATAAGCTTAACAGCATGGCCTCCCATAC TT TGCCCTGTTACATGCTTGTAAACTCCAGACTTGTAGTGAGCAAAATCCTGTGATTGACA

AAAAAGTTTTAAGTCATTGGGTTAGCGGAAAGCATTGCTAAGAAAAAGAGAAAATAA CAA

TTATAATCAACGAGAACAATCAAATTCAAACAAGCATTAGTTTATACTAGAGAAACT GAG

AAACTAACAATTATAACCAGATAACTTCTCATATTGTGCTAAGCATTAAATTATCTC ATTT

ACAAGTATTAATTTGCAAGTAAATCTCGGACAACATAAAATGAGAAGGTACATCGGT AGT

TAAAGTTTCTCAATTATCATAATAGTTACTATCAGGTCACTTAAAAGATCATTACAT GTAT

CACATTATG GG ATTGG CAGGTAATTGTTTCTATGTACACATAC CACAG GACCAG CAT AT

AAGTTTACCAGCTCAGACAATGTCATGTGTGAAAACTATATCAAATAATTTTAGTAT CAG

AACTTAACCCTCCCCCTTATCCTCATTACCTCGTAGACGGTAAAAGAGACCTCGACT GG

TCCATTTTTGTAAATTTCTGTCATGATACTGTTGGGATCATGGTGGATCCTGTATGC ATT

GACACCATAATGCTTTGATTTCCCCCATAGTAGAATCTCCTTCACACACTTCCTCTG ACA

CTTTGGGGTGGGATATCCTGGTTCACAACCAGGGTGGGAACATCCCTCATTATCAAA GT

AAG G GTCACACTGC CAG AAAAAAC AACTTTATTAGTGATTG ATCATAAAGATCCACG GT

AGCTAATGGTTTTAGAGGAAGCTGTAATCTCTTTTGGTGAAAATAAGTCACCATTTA CCT

CTTCTGTGACCACACCCCTACGGATAAAGTATCGCCATGCTGTTATTGGATATCCAC CA

TCACAACCACTCCCACATAAAAAGCCACAGCATGCTAACAGATCATTTACAGACAGA GA

GATATTCTGCATTACACATTAGAAGTTTAACATCAGTGACCATAACTACAGAAATAG ATT

CACATACGTTTTGTGCTAGGTGAGATGGTTTCCACATGCTTAGACCATGAAAAAGAA TC

ATGAGGCTGGCACGTGAGAGCACTTGCTGAAGATATATAATTAAGTACATAAAAATG TG

TCATCTAAAGCTTTTAAATGAGGCGGTCACACAGTTCAACAAAATTTACCAAGTTAT GAT

G G ATAC AG AAAC G AT CAG AC AG C G ATTC AAC AG C AC C AAAAG C C C AAC AAG AAC C G C A

ATGTCCCTGATCTGTGACGCAAATTTGTCCCGGTGATTGATGTGCAAAGACGGAAAG CA

TTAGGATCACTATCATAGAATTATAATTCAATAGTAGTAAGCAAGAACAAGAAGAGA CTG

ACCCAGAATTCTTCCGATAGTACTACATTGAGGCCAAGCTTTTCGTGCATCAAACTC TTT

TGGTAGCTCCAAAAGTTTTGGATGAGTTAGAATAGGAATTCCCTCCAAATCACCTTC TCT

TGCGGGCTTAACTCCAAGAAGGCGCTTAAATTGCGAAACCTGTGAAAACCAAAGAAA AA

CATAATTATAAATCGACGTTTTCGGTGTATGGCTGCAAAATAAGCAACTGTTATTGA TGG

TTAGAGAAAGCGCTAAGCCATATGATGATCTGACCGTGAAATTCGAGAATCGAGGGT TG

AATGCAGCTTTCCACCCAGCTTTGGCATTTTCATTAACCTCTTTAATGATTGATTCC TAC

ATGATTATCCAAAAAGCTCTCTTAGTTTTAAATTTGAAGCAACAAGGGCAATAACAT CTT

CTCTAAACATGAAAAGAAGAATATACCTGAAGGATTGCAGATTCAACTTTAGCTTCA GAT

ATTGGCTGCTCTGCAACAACCTGTTTTCCATAAGAACAAAGAGATTCTACTCATCAA AAG

ACATCCTTAAAGCTTTTAGGAAACAGAGCTGCAACTCCAGGAACAAAAGCATGACAC AA

TGAGTGACAAACGAAGAACTTCGGGCTCGTTTGGTACGAGGGATAAGGGATAATTAA TC

TCGGGATTAAATTTGAGATGAGTTTATCCCATGTTTGATTGTAGTGTTATTTTAATA ATTA

TGGGAGGGTGGGATAAACAATCGCGGGATAACTAATTTCGGGATAATTAATCCTGCG AA

CCAAACAATCCCTAAAGGTTTCACTTTAATCAAGATGAAACTCTTCCACAACTTTTA TTTT

CAACATTATAATACTATTAGCCTGGAAAATTAATCAAAAGTTTGTAGGAAATTCATC ATAT

GTCTAAAGCACTATAACGTAGAGGAAAAAGAATCATAGAACAAGCAGAAATTGTAAT TA

GTCCATTATTTCTCCTCCTTCTTCTCCCCTTTCTGTATTTCTTTGTGAAGCAATACT TCCT

CTCATGTTATTATATTTCGACAAGTAAGTTAGCTAACACATAACATAAGTAATTTGC ATCA

AACCATATAATTAACTTCAGAAACATGTGTATACTTCTCTTTTCTCATTCTCACTAG GTAA

TGAGAAAATCATTAAAATTTGCTTCTACTCATGATTTCTAGTCAACGCTTAACTAAA GCAT

AAGAAGTCCAAAATACCCAACAATATTTGATCTTTCTGAAGAAACACAAAAAGGCTA ATC

CTTGTGTTCATCAAAAGCTATACAAATCAAATCAATACGCTAAATCCACCTAAAACA AAA

TCATCAATTCAATAGGCAAGAACTACCCATAAGACATACTCCTACTGTGAAAGGTTC AAA

GAATG AAG AAAC AAAC CTG C AATATAAG GAT AAAC AAAG C AC C AAAAAG C AAAG G AGTT

G CTAAAG ACTTC AG G GTC AAG G C C AT

SEQ 6 ACATTAGTCCTCCATACTTCTTTCTATCTTCTTCTGTCAGTCGCATCTCCCGGCGACTGT

CTCCTCCTCTCCATTTTTCCTTTCTCTTTTTCCTCACCGAGATATTTTCCCTATAAA CAAA

ACACCGTAAAAATCATCTCCTCTAATTTCCTATTTTCCCCATTTTTCCAAATGGGTT CTTT

CCTCTGTTTCTCTGTCATTGTTCTTTTCCTTGTTTTTCAGCCATGTTTTTCCAAGAA AGTT

TACATTGTTCACATGAAAAACCACCAAATACCTTCTTCTTTTGCTACACACCATGAT TGG

TACAATGCTCAGCTCCAATCCTTGTCCTCTTCTTCAACCTCTGACGAATCATCACTT CTT

TACTCTTACGACACTGCTTATTCTGGCTTTGCTGCTTCTCTTGACCCACATGAAGCT GAA

CTACTCCGTCAATCTGATGATGTTGTTGGAGTTTACGAGGATACTGTTTATACACTT CAT

ACAACAAGGACTCCTGAGTTTCTGGGGCTGAATAATGAGCTCGGTCTTTGGGCTGGT C

ACAGTCCGCAGGAACTCAACAACGCTGCTCAGGATGTTGTTATCGGAGTTCTCGACA C

CGGTGTTTGGCCGGAGTCGAAGAGCTTTAACGATTTCGGTATGCCCAATGTGCCGTC G

AGGTGGAAAGGTGAATGTGAATCGGGTCCTGATTTCGATCCGAAAGTACATTGCAAC AA

AAAGTTAATCGGTGCTCGATTTTTCTCCAAAGGTTACCAAATGTCGGCTTCTGGTTC ATT

TACGAACCAACCTAGACAGCCGGAGTCACCTCGGGACCAGGACGGTCATGGGACTCA

CACATCCAGTACCGCCGCTGGTGCACCGGTGGCGAACGCTAGCCTTCTCGGTTACGC T

AGCGGGGTCGCGCGTGGTATGGCACCGCGAGCGCGTGTAGCTACGTACAAGGTGTGC

TGGCCTACTGGTTGTTTTGGTTCTGATATTCTAGCTGGTATGGAACGTGCTATTTTA GAT

GGCGTTGATGTACTTTCTTTATCTTTGGGTGGTGGATCGGGTCCTTATTATCATGAT ACA

ATTGCTATTG GTG CTTTCTCTG CTATGG AAAAAG GAATTGTTGTTTCCTGTTCAG CTG GA

AATAGCGGTCCAGCCAAAGCTTCACTTGCAAATACAGCTCCTTGGATTATGACCGTT GG

TGCTGGTACCATAGATCGTGATTTCCCTGCTTTTGCTACTTTAGGTAACGGGAAAAA GA

TTACCGGAGTTTCGTTGTACAGTGGAAAAGGAATGGGTAAAAAGGTAGTTCCCTTAG TT

TACAGCACAGATAGTAGTGCAAGTCTTTGTTTGCCGGGTTCACTTGACCCGAAAATA GT

CCGTGGGAAAATAGTGTTATGTGATAGAGGGACAAATGCGAGAGTAGAAAAGGGTTT A

GTAGTGAAAGAAGCTGGTGGGGTTGGGATGATATTGGCGAACACGGCGGAGAGCGGC

GAGGAATTGGTGGCGGATAGTCATTTGTTACCGGCGGTAGCTGTAGGGAGGAAATTG G

GTGATTTTATAAGGCAGTATGTGAAGAGTGAGAAGAATCCGGCCGCCGTGCTCAGCT TT

GGTGGGACGGTGGTGAATGTGAAACCGTCGCCGGTGGTGGCTGCGTTTAGTTCAAGA

GGGCCCAATACTGTAACTCCACAGATTTTGAAGCCCGATGTTATTGGGCCTGGAGTT AA

TATTTTGGCTGCTTGGTCTGAAGCTATTGGGCCCACTGGGCTTGAAAAGGATACTAG AA

GAACTAAGTTCAACATCATGTCTGGTAAGTATTACCAACAACGGCTAGTTTTTGTCA TAA

TCTTTTTATTTATGCTTAGATTAATTATGGCCTTAATTATATTTTTATTAGATCTTG CAATT

ATTAATACTAATCGTACACACTTGAAAGGAAAAAGAGGAACATGTTTAATTAGTGCG TAG

TG ATCTG G AG CACATG CCTAAAGTTTAG AG G GGTTCACATGTGTTG CATTG ATAAGTTA

ATCCTAAATTACATTAGTTATAATTAAATATTAATGCGCTTCCAAGAAAAAGTTGAC TAAA

TTTATCATATATTTCCAAATTTGTTTTGAAAAATATGATTTTGGTGAAGTTTGGCTT GAAG

ATGAAAATGTGTTTGGACATCAATTTTCAAAACATATTTCCCAAATTTATTTTGGAA AAAC

ATGAAACATTTCTTATACCCACAAGTTTAAAAAACTATCACAAATATCCAACGGTAC CATT

ATCAATAACATTCATTATATTATCTCAAACCATAATCCTGAATATAAATAAATTTGG CACA

ATATTATCATTTTTATAATTAACTATATGATACACTATTAGATGATCGAGAATACGA AGCA

ACATCGTTTCAAAATAATAAATGAAAAATGGTGGACTCTTTTATATAATACAAAAGT TTGG

AATAATTTTTAAAAAATATAATAATGATATTTTGACCCAAAACCAACATGTAGTCAA AATC

TATGACCAAACATGTGTTTGCCAAATAAAACCCAAATTTATTTTGACAAAATATATG GCC

AAACGGGGCTTAGTTGTATGATGTGTCGTGTGATATTAATAAAAGAACTGCCGAAGC CT

AT AC C AAATTTATG G CTAAAATAG C AAG AAAC GTC C CTTTAAC AG G G AC AG AAG AAATC

CAAG AGG G GG CTCG CTG GTCTAG AG AAAAATTTATTAATTAATTTCAATAATACG CTG AT

GGTGTAAAAAATATTGACACCATCAATATATTGTAATATATCGTAAAAGTTTATTAA TTTC

ACTTATTATAGACAATTATTTGAAGTTATCTTTATAGCTAACCCAATAGTGTAAAAT TCAT

CCGTTGGAATGTGCAATATGATGTTTGTTTTCAGCTTTTGTGCAGTAGTGATTTTAA ATA

GGTATTACTTGGAGCTTTTGTGCGATGTGACAAAGTGCATGTCACAAGGTTAGAGTC AT

AATGAAGGCAAAAACATGATATTTAAGTGAAAAGTGATAGAGGGACGAGTTCATTGT CC G CAC AAAAG CCTGATATTTAAGTTAAAAAAAATAG GTG ACCAG CTTG ATATTTAAGTG AA

AAAGGATAGAAAGACGGGTTCATTATCCACCGAAAGTCGAACCTAACCTTTTGCCAT GG

CCTTTCTTGGTCATCAAAATAATTTAGGAGACTACCTAGGAAAAGTAGTATGGTCTG TCT

TCCACTAGTGGGTCATAACCTTAATATCATCCCCCTTGCCCCGTTGAGTACTCTGGA CC

TATCTTAATCACTTCATTAGCTGGAAATAAGTTGATGAACTTTTTGAATCATTCTTG AAAA

TTCACAAATTC GAACC GTG G AAACAATCTATTACAG G AATGC AGTCTAAGTCTTCGTACA

ATAGACCCTGTGGTCCGGCCCTTATAGCAGGAGCCTACTGCACTGGGCTGACCTTTT T

CTTTAAAATCTTACAGAGCTCAAAATTTGGACTTTGTACTGTTTCGTTACATTATTT GATC

CTTTTTGTACGTCAAACTCTTTCAGGCACATCCATGTCCTGTCCTCATATCAGTGGC CTA

GCTGCACTTCTGAAAGCAGCGCATCCCGAGTGGAGTCCAAGCGCGATCAAATCTGCA C

TTATGACGACTGCCTATGTTCACGACACCACCAACTCTCCTCTCCGTGACGCTGAAG GT

GGCCAACTCTCCACTCCTTTCGCTCATGGATCAGGTCATGTTGATCCCCACAAGGCA CT

TTCCCCGGGTCTCATCTATGATATTACTCCAGAGGACTACATCAAATTCTTATGCTC CTT

GGACTATGAGTTGAACCACATACAAGCCATTGTCAAGCGCCCGAATGTCACTTGTGC TA

AGAAATTTGCAGATCCCGGGCAGATTAACTACCCTTCGTTCTCAGTTTTGTTTGGGA AAT

CAAGGGTTGTTCGTTACACCCGTGCAGTGACCAATGTAGCAGCTGCAGGATCCGTTT AT

GAGGTAGTCGTTGATGCTCCCCCATCCGTTCTGGTGACCGTGAAGCCATCAAAGCTT G

TGTTCAAAAGGGTAGGAGAGAGGCTGCGCTACACCGTTACATTTGTGTCCAACAAGG G

TGTTAACATGATGAGAAAGAGTGCATTTGGTTCCATTTCTTGGAATAATGCTCAAAA CCA

AGTTAGGAGTCCAGTCTCATATTCCTGGTCACAACTATTAGAC

SEQ 8

ATGAATCCTGAAAAATTCACCCACAAGACTAACGAGGCCCTTGCTGGGGCACACGAGC

TAGCACTATCCGCAGGGCATGCTCAATTTACGCCTCTGCATATGGCTGTGGCCTTAA TA

TCTGATCACAATGGTATTTTTCGACAAGCGATTGTCAATGCTGGTGGGAATGAAGAA GT

AGCTAATTCAGTGGAGCGGGTATTGAATCAAGCGATGAAGAAGCTACCTTCTCAGAC AC

CGGCTCCAGACGAAATCCCACCTAGCACTTCCCTTATCAAGGTGTTACGACGAGCAC A

ATCGTCG CAGAAGTCTTGTG GTG ACAG CCATTTAG CAGTG G ATCAGTTGATTTTAG G AC

TGCTAGAAGACTCTCAAATCGGAGATCTTTTGAAAGAAGCTGGGGTGAGTGCATCAA GA

GTGAAATCAG AG GTAG AG AAACTTAG AGG AAAG GAAG G AAG AAAAGTGG AAAGTG CTT

CAGGGGATACCACATTCCAAGCACTCAAGACTTATGGCCGTGATCTTGTGGAACAAG C

AGGAAAGCTTGATCCCGTGATTGGTAGGGATGAAGAAATTAGAAGAGTCGTTCGGAT TT

TATCAAGGAGGACGAAGAACAACCCGGTTCTTATTGGAGAGCCTGGTGTGGGTAAAA C

AGCAGTTGTTG AAG G GCTAG CACAG AG G ATTGTAC GTG GTGATGTTCCAAGTAATTTAG

CTGATGTTAGGCTTATAGCATTGGATATGGGAGCGCTAGTTGCTGGAGCTAAGTACA GA

G GTG AATTTG AAG AG AG G CTG AAGG CTGTG CTGAAAG AAGTTG AAG AAG CAG AAG G GA

AAGTGATACTTTTCATTGACGAGATACATTTAGTCCTTGGTGCTGGTCGGACAGAAG GG

TCTATGGATGCTGCTAATCTGTTTAAGCCAATGCTAGCCAGAGGTCAATTACGGTGT AT

TG GTG CAACTACACTTGAAGAGTACAGG AAGTATGTTG AG AAG G ATG CTG CATTCG AGA

GGCGTTTCCAGCAGGTGTATGTGGCTGAGCCTAGTGTTACTGACACTATTAGTATTC TC

CG CG GGTTG AAG GAGAG GTATG AAG GG CATCATG GTGTTAAAATTCAG G AC AG AG CTC

TTGTGGTGGCTGCCCAGCTCTCATCTCGGTACATTACAGGTATCTATACTTTTGCTA TTT

TTACATAGCACCTTGTTTTGATGTCTTTTCTCCGTCAATAACTAAGCATGTATATGC ACTA

CTTTTTCCTCGTGCATTTCATTAACTCTATAAATCAGAATGGGACTTAGATTCGGTT AAG

CGAATGAAGGTGAATTTTAACCTAAAATGTTATGGTGTCGGAGCTATAGATGTATAT TTG

TCTG GTACTAAAATGACTTCTTG AAG CAGTAG CCAG AATTTTG ATTCATTTAAGCAG GTA

GGGCATGAGACTTAATTAGCATATCATTGTCTGCACTTCCTTCTGGACCTTTACCAG TGT

ATGAGTTGTTTTTGTGTTACAAGCTGCTCCCCATCTGGATAATGGTGGATTAAGACT TAT

ATGATTGTCAGAAGTGTACTAAAACTTCTTGAGGATAATTAAAAATTGCTCAAATCA AAT CCGTAGCTCGTTTTCCACTGTCAGTTTTTGCAAAATGCTTTTTATGTCTGTGTCGTGACA

AATTAAG CAGTCAG CCAGTTAAATTTTG GCAGTTTGG CATGC AAATTGTCTTTG CTG CAC

ATTTCAGGTGCAAAAATCACTAACCTCTTTGTATTTTCAGGTCGACATCTGCCAGAT AAG

GCTATTGACCTAGTTGATGAAGCTTGTGCAAATGTTAGAGTTCAGCTTGATAGTCAA CCT

GAGGAAATTGACAATCTCGAAAGGAAGAGAATTCAGCTAGAAGTTGAACTTCACGCT CT

CGAGAAGGAAAAAGACAAAGCTAGCAAAGCACGTCTAGTAGAAGTAAGTATTATATA CT

ACCAATGCTTTTACTGGTAATTGCTCTATTTTCTAAAAGATATGTTAAGAATTATAC TGAC

TCGAATTATACTGACACTGGTCCAGGTGAGGAAAGAACTTGATGATTTGAGAGACAA AC

TCCAGC CCTTGATG ATG AG GTACAAAAAAG AG AAG GAAAG GATAGATGAG CTTCG CAG

G CTCAAG CAAAAG CG CGATG AG CTCATTTATG CTTTAC AAGAAGCTG AAAGG AGATATG

ATCTGG CTAGG G CAG CAG ATCTG AGATATG G GG CAATTCAAGAAGTG G AAACTG CAAT

AGCAAATCTTGAGAGTACCTCAGCTGAGAGTACAATGCTAACAGAGACTGTGGGTCC T

GATCAGATTGCCGAAGTTGTGAGTCGCTGGACTGGTATTCCGGTCTCAAGGCTTGGG C

AGAATGAGAAAGAGAAACTGATTGGTCTTGGCGATAGGTTGCACCAAAGAGTGGTCG G

G CAAG ATCATG CAGTTAGAGCTGTTG CTG AAG CCGTGTTAAG ATCTAG AGCTG GTTTAG

GAAGGCCACAGCAACCAACTGGTTCATTCCTTTTCTTGGGGCCAACTGGTGTTGGAA A

G AC GG AG CTCG CTAAAGCTCTTG CAGAG CAG CTCTTTG ATG ATGATAAACTGATG ATCA

GAATAGACATGTCCGAGTACATGGAACAACACTCTGTTGCCCGGCTGATTGGTGCTC C

ACCAGGGTAAGTTTGAATCTAATTCTTTTCTTTTAATGTCATGTCATATTATTACAG TATT

CAATCACAGATTCTCATGTGTTCCACATCTGCAGTTATGTTG GG CACG ATG AG G GAG GA

CAACTTACTG AAG CTGTTAG GAG G CGG CCTTACAGTGTTGTG CTCTTTG ATG AAGTTGA

GAAAGCCCATCCTACGGTGTTTAATACATTGCTTCAAGTTTTGGATGATGGAAGGTT AA

CAGATGGTCAAGGCCGCACAGTTGATTTCACCAACACCGTGATTATTATGACTTCAA AC

TTGGGAGCAGAGTATCTGTTGTCTGGATTAATGGGAAAATGTACGATGGAGACGGCT C

GTGAAATGGTAATGCAGGAGGTAAATAGTCTCAAACTAGTAACTTCCCCTTTGCTGA TA

AAACTGGAAGAATACAGTGAAATAGTTTACCTTATTAGCTAGAATGACAACTGTTTA CAT

GTGTGTATGCTTTGTGATAGGTGCGAAAGCAGTTTAAGCCCGAGCTCCTGAATCGGC T

GGATGAGATTGTTGTGTTTGATCCCCTGTCCCACGAGCAGTTGAGGCAAGTATGCCG C

TACCAGATGAAAGACGTTGCACTACGGCTGGCTGAGAGGGGTATTGCATTGGGTGTT A

CTGAGGCAGCTCTAGATGTCATACTCTCAGAGAGTTATGACCCGGTAAGTGTTATAT CT

TGTAATCTAGTCCAATATTTTAGGATTATTTTGCGAACTTGTACTTATTGTGGTGAT CATG

G CATTCAG GTTT ATG GTG CAAG ACCTATTAG GAG ATG GTTGG AGAGG AAG GTGGTG AC

CGAGCTATCCAAGATGCTTGTGAAGGAGGAGATTGATGAGAACTCAACGGTTTACAT AG

ATGCTGGTGTCGGCAGGAAAGATCTAACCTACAGGGTGGAGAAGAATGGAGGTCTTG T

GAATGCTGCCACCGGGCAAAAATCTGATATATTGATTCAGCTTCCTAATGGTCCCAG GA

GTGATGCTGTCCAAGCAGTCAAGAAAATGAGGATTGAAGAAATTGAAGAAGACGAAA TG

GAAGAT

SEQ 9

TTATGTAAATGCTTCACGTTGCTGTGTAGGTAGCTCCAGTTCAGGCTCAAATGCTGTTA

GCCGAGAAAAATACTGATCTACCATGTCATCTGTTACTTTATCCAAGCTTGGAGGAT CC

CACTG CACGTTTTAACATAAATCAAG AAACTCTCTCTAAG GTTAAAGTTG ACTCTTAG GA

AAATTCCTCCAGGAAGGGGCTCATAATTCATAAAAATAGCATATTAGTTCGCAATAA TAT

TGATTTACCTTAGGTGCAAAATCTCTATCCACAAGCCGTGCACGAACCCCCTGATAG GA

AGATATTAGTACACTGTGATGATGATCTGTGGAATTTAGTACTGAGACTTTTTTTAG CAG

CATATATTACCTCACAGAAGTCATTAGTTATTTGTCCAGAGAAAGCTTGTACTGACA TTC

GATACTCACGAATTAGACACTGGTCCAGAGTCTGATGTCTGCCTTCGCGTATCTGAT CA

GAAGCATTATTGGAAGGTCATCTCACAATGCCCTGGTGGCAGAAGGGAAAAGGAGAA C

ATTCTATCTAGAGAAACTTACAGATCTCAGTGAAACCTTCAAGCTCAGTGGGGCTGT TT CTTGTAGTTTTCGCAATGTTGAAACACACCATGCATCTTGCTTCTTGGCTGCCTCACTTT

CCTGGCAACATTTAAATTTCTTTTTAAAAATGTAATCACACCTGTCCTTAAAAACTT CCAC

ACATGTAGCAGACTTTATATGTTTACTTAAATACTTTACTGTCAGTATTTTAAAAAC AAGA

GAAAATCCATCTCGCTCGAAAGAGCACCCAGAGGTCGTAGATGCAAACGCAATGACT C

AGCTTGTACTATGGTGCAACAAAGTGCAGCACTTAAAGGCTTCTCCGTGCAATTTAT AC

GTGATGTTAGAATCTCCACAATATGCTTTGCCCAAAGAATGATAGATATAAGTTAGT GTG

ACAACAGCCCCAGGACCTATAGGTACTTTTTGAACCTCAAGGAGCAAACAAAGAGTA TG

CTGAGCAATGATCATTGCATTGTTTCAGATTTTTCAAATAGTGGCAACTTGTGATGG CCG

AGGTATTAAAAGCAAAGCAATCCGATGCTGAACATTATTGCATTGATCAACTAAAAT ATG

TTTCCACTGGCACCTAAGGGGGTAGACTGTAGCCAAATAGTCAATGAAGTCGGTGAA AA

ACACGGGAGACTAGGTTTCAAATCTAGCAAAGAGTCGGAGAGAGTCAGAGACAGAAA G

GCTAGGTGATTTCTTCCTATCTGCCTAAAATTTGGTAGATAGCAAGTACCTGTTGGA ATA

GTCGAGGTGCGAGCAAATTGGCCCACACCACCGTTATAGAAAAGAATAATGTTTCTA CT

GGCCACACTTTGATTGTATGCCCATCCGATCGAGTTTCTTACCAGGTAATAGCCTTC AC

ACCCCAGGCTAGTGAATTAAGATGTGCTATCCTATAAAGACTTACTTCGTCCTCCAC TAA

TATGATCGTATCTTGTTACTGGAGTCGAAGATGGTATGTTGGCGAGGCTTCCACCTT CT

GAATACCACATGTTTAATCCCTGTTCTTTCTTTTCTCACTAGAGATATTAAGCAAAT TAAA

ATCAATTAACATCAGAATTTTGTATCTTTCAGAGACCTTTTCAGTATATTTGGGAAA CAAG

TTACACACTCAGGAACTATATGCTCACCAAAGCATCAATAATTTCTTCGACTGTGTC ATG

GCTGAAACATTTATTGAGAGTTTCAATCCTAAAGAAAGAGAAAAGATGGATTAGATT AAG

CTATCAAATACTCATTGATCTGCAAGCAAGATCACACAAACTTGATAAGAATTTATT TTTA

TAAAAACAAAAGGTTAAAGAACAAGACTAAATACAAAAACCAAAAGATCCATGTGAA ACT

ATGACAAAAATTATCATGAACCACAAAGTTTTAAGAATGTAGTGTGAAGTTTTAGAG AGC

TGAAGTTTGTTCCACTAGTTGATGTATTTTATATAACTCATGGAAAAGATGGTGCCA CTA

CCAGTTTCTCTATAATGGAAAAAACAATTTCGCCACTTACAAATCTAAGTGAAGCTA AAT

ATGAAGAGCTAATTGCAGGATTTTGTGTTTACTGAATTAAACAATTTACCAAAAATT ACCA

ATTTGATTCCTCGAAGAAGAAAGTAAGAGATTGATTTGCAATTCTTTCTTTTTGAGT AGTA

TGAAAAATGTTATGAAATGAGGGCAAGTGTCTCATCCAATTAAACCTGTAACTTGCT TTC

CAACAG CAG GTAAAATATTGAAGCAAG G CTG CAG CGTGTCCACTTATG ATTTCAATTAG

CCAAGGGATTCTTTCGATGTTCTATAAGAAACGTGAACGGGATTCCCCGAAGATGTA GG

TAGGGAATCCGCTGACTGGTGTGGTATATATGTTTGGTTAACAACTAGAAAAGGTGT TT

CAAATCCAAAAGCAGCCCTTAACATTAAACGGTAATATGTATCAGTCCACCCCCTTT CAA

ACTGTAG CAG G AACTAATATATTTATG GACATTCCAATTTCCATATTTAG CAAG GATG AC

AGGTACCTGTGAAGTACACTTGTTGGATCTGGATGGACAATCTCTCCACAATTTTCA AG

AGATCTTTCAATCACTGAAGGATCATCAGTCATCAATTTACCAAGTTGTTCCTCAAT TAA

GGGAAGCTTCTGAGAGGAGTGGAGAAGGCAGAGAAGAAGTTTTCAGAGCAATAGTCA A

ATAACCTAAGTAGGCTATTGGTACCTAGGTGCAAAGCAAATTAAGCAAAGGTAGTAG AC

TTACTGCACTGTGTAAGTAGTGAGTAGCAAGCCCACAGGATATCATTTCTGCTCCAT TG

ATCTTGTCTCCAGTTAGGGCCAGGTACTCTCCTGCAAATTGAAAACATTTAAATAGC TTT

CTTCCATGCTGATTTTTTCTTTCATTTACGCAGTAGCAAATGTCCAAACAGATGAGA GAC

ACAGAGAGAGGCAGCATACTAAAAACATATTCCACTTGAACCACAAACTAGTATAGC AT

AAGATAGATAGGGTTTACATGAGCTGACATGCGTGCATCAGGTTGCTTAACATTTAT AT

GCCATAGAATATGAAGTCATCAAAGAGTCAGCCATGTAAATGCAGTGCTACATGATC TA

GCATGAGTTATCAGTTATCTACAAAAAAAAGGCTTGTAACCCACTAATTTTAGTCCG CCA

CCATTTAAAGCTAAATATTAAATAGATAGAACCACATATTAACATCACCGTATGAAG TTAA

ATTTTTAAAAAAAAAGGCAGTCCAGACAAATTCCGAACATAGGTAGAATTCAAAGTC ATA

ATCTCCAAAATACATGAAACAGGAGGAGGAACACATGCTTACTTCCCAATGAGTTAG GG

AAAAATAAGATGAAGTAAAAAGAACATCACGTTAGTACAAATATTTTGATAACAAGC ATA

TAAGAAGGGAAACGTCCAGATATGACATGCTTATTTTTTCCCCATAGAAGATGCGCA TC

TCCCCAGAGAAGAGGAGATCGGGGAAGGGGGTGTGTGTGTGTGTGTGTAGAGAGAGA

GAG AG AG AG AG GGGGCGCGGG G ATC AG AAAAGTAGTTTC C AACTAG ATAATTC ATC AT CAAAGACGATCAGCTACATGACATGATAGAAGTTACTGATGGATCCAAACCACAGCATG

AGATCAGCAAAAGAATATACCCAAATAACCAGGGAGGTGTGAAAGGTAAAATGACGC C

CCAGCATCGGGATGGTAACCAATCAATGTTTCTGGTGTGGCAAAAACCTACATTGAT AC

AAGACCCAATACCCTTGTAAGAACGGTGAAAAATCGATCTAACAGCACTTAAAACAC CT

AAACCATCACCTCGTGCACCGGCCAAATTTGATAGACCTCTTTGACAATTAAGTTCG ATA

AATTTTTTATTTCCTGCACATTCCTTTAGTTCTTGAGTAATTTAAAATCAGGTTATA GCATT

TCAGTAAGACCAAGATATCTTAAATTTTTCAGCACTTTAAACGCTAAGCCTGTGGTT ATA

AGTCATATATAACCGAAATAAACAAGCTTCATCACGAAAAGTCAAAAGAATTGATAA TAT

ACAATTGAAATTACTATTTAACTTATTGGTACAAAGAAACTTCATCAATATAGAGCA TTCA

ATTTAAGAAGATTAAAAGATGAATTTTATCAAATTAACTCGGTACGAACTCATAAAT AGAT

AATAATAACTG ATAAG G C C AAAACTATC AATAAAG G AAAAG C AAG AG AAG G G G C AG AG G

AAAAATTTGCTGAATAAGTTTGTAGCACTAGGATTTGAACTTCTCTCTGAAGAGAAA TAA

TTAAGATTTAAAGAACAGAAAATTCATGATATGTGAAGTCACTAAGCTGTATATAAG AAT

GAGCACAAATGGAACTTCACATTACTTAGCAATGTAGTCTAGCAGTTCTTGAAGTAG GA

GAATTTATTCTGAACCAACAAATGAAAAGCTTAAACAAATAAAGCATGACTAATCTT TTCC

ATACAGTTTTCTCAGTTGCAACACGGAAAGTTCCAGGAATTGAGATGCCAGCCCCAC CA

CCCATGGTAATTCCATTCAAAAGAGCAACCTGCATATGGCATCTGAGGATTAATATC TAT

ACTTGGGAACCGCATCCAGGGACCAGCTAATGTCAACAAAAGTATGACAAGTACTGA AT

ATGCAATGACTGAATTCCTTCAACATGGAATTGAGGAGGTGAGAGCGAGAAGTAACT CA

AATCCGAAAGTGCGCAAAAAGCTCTAATCGATAAAAAGATACAAAGAAATATGGATG AT

AACTACTGATAAATGCACACCGGGTTTCTGAACTAAAAAAAGACGTATATTATGATT AAT

AAAAGAATGAAAAGACCATTATGGCTGCATCTTGTCAAAGCAAAAATGATCACATGA TTT

TAACAAAATAAACAACTTCCAAAATTGAGGAGATATTATTTCATGGGCAAAAAGGAA CAG

ACAAACTATAGGTCTAAACAGGTGACCACTTTGTTGAGCATCAACAACAGCTTATCC CA

AATCTTTCATTTCAGAAGTCAAAGAACACAGCAGCAAGAGAACTATGGATGAACTAA GA

AAATGGAGTTGAATATCTATGAATAATTGGATCAAATTTGCATTTGTCCTAAGGAAG TTC

TTTAAGCTTATGAATCATAATGTTAATGTGACTGATTATGTTTTCCTCAACAGGCAT CGTT

ATGAAGATTGTATAGCAGCCAGCATGATAAAGTAGTGTTGCTACCATTTCAAGTTTA TGA

ATAAGATCCAATCAAACTTTGGCAAGACACATCTAACCTCTCTGAATACCATTTTAA TTC

AGAAAGAAGTGATTGGTGTAATTAACTGTCGAGATGCTGTCTTTGCAGCAGCTTAGT CA

CAGTAAGATGAGAGAGAATCCAATTAACAGAACAAGCGATCTTCTAATAAATCCAGA TAT

TTCTATAAGAGTATCTTTAAAACTGCCAGCATAAAGTACAAAGGTGTTGAAATTTCA ATA

AGCAATGGCGAATATAGGATGAAAATGTTGATTCAACATCTAATAGAACTTACATAT ATT

TTTTAGAAATTCTTAGGTTCCAACTACAGAAGCATATTTTATTATGCGGAAGTCTAG ACG

CACAGTTAGATCACAACATAAAAGACATGTAACTACAAAATTTATAAGACGCTGGCC TCT

ATCAAGGTTTAGTAAATACAACAAAAGTTCCTGATACATATATAAGAGGCAGAAAAC AGA

AGAAAATTCAAGTCAACAGTTCTTTTAGTATCTGATGTTTGAACAATAATGATACTT ACAT

GTGGCTTCAAGAGTGTGCCGACAACGTATACTAAGTTATTTATTGTCCAACAAAAAT CTT

TACAGTCTTGAAGATTCCCTGCATCTATCCAATACAAAAAATATTACACAAAGAATG GAG

AAGATCAAATTAAAATTGAGGTAATAGTAAATTGGTAGCATGAAGATGCCGGCTGCA GT

TTTTAAAACGATGTAGAAACGTTTTAATTGTTCCCAAAAATAAATAGATATCAATAC CCTA

CTCAATAAAAACAACAGAACATCCAGCTATCAGCACAAATAGCTTAATTAAGATAAT TCA

CATCTAAACAACTTCTTCACACCCCAATAATACAATTATCGACAATGTTTAAATATT TAAA

ATTCCTTTAACCATGTATCACTTGCAACATAGCAGAATATGAACAAGTTCGCCGTGA AGT

ACAACAGTCAAAATGACACAACATACCTTGTTTTAACAAATTATAAATAGTGACAAT GTC

ACCCCCAGCAGAAAATGCCTTGCCACTTCCCTTCAAAGTTAAAGAATAACTGAGTTG CT

ACTCAGTAGATTAACATATGTAACTTCACCTTAAGATAATCCAGAAGAACAACAATA GGA

AAAG C C ATAAAATAATTC AAAC AATAATTTACTTAAC AAATTAC CTTC AATAC C AC G AATC

CAATATCAGGATCATCTTCCCAATTTTTGTACAGCTTTAGCAACCTATCCACCTAAA CAA

CAGTATCATGCAGAGTTTTTATTTATTATACAAGGTGGAAATTAAGATTGCCAACCA ACT

GGAATATCAAGATTCCCCAGTGTCATGTAGGGTTCATTCCCAAAGCCAAAACAAAAT CA ATTCAG CATAAAAACACG ATTTTGATG GTTG G ATATTAATTAAAACACG ATTTTACTTG GT GGGTATCAATAAAAAAAATCCAAAAAGAGAGCAATAGATTTCAAAAATGATCTTCTTGTG CGTACACACTGTAGATTTCAAAAATGGCAAAAACAGAAGCAAAAAGTAAAGGTCTTTAAA AGG CAGG AAACTAACAACTG AAAAATTG AG AG CATTTAACG CATG G GGTCTGTTAAG GA TTG CTGTTCTCGAAGAAG CTTTTCC CTCCACTAACAC CTGAAAAATG CAAACTCATATTT TATAACAAAATG G AAATTTTGATAATG AATCATAAATTG G ATTG ACAAATTTTCTTTTAAA AAAAATTCAGAACTCACAGTGCTTTGGGATTCATCAACAAGGGCATTGGTAGAGACACT GCAAAAGCTTCTGGAGTGAGAAACCAAGCGCGAATTCTGCAGTAAGCGCCTCAAAATA CTTGCTGATTTGAAGCTCTGCAT

SEQ 10

ATGG CCTTG ACTCTGAAGTCTTTAG CAACTCCTTTG CTTTTG G GTG CTTTCTTTATCCTT

GTATTGCAGGTTTGCTTCTTCATTCTTTGAACCTTTCTACAGTAGGAGTATTTCTAA TTAT

GGGTAGTCCTTGCCTATTGAATTGATGATTTTGTTTAGGTGGATTTAGCGTATTGGT TTA

ATAGCTTTTGATGATTTTACAATTTATATGGATTACCCTTTTCGTATTTCTTCAGAA AGAT

CAAAGATTATTGGGTATTTAGGACATCCTATGCTTTAGTTAAGCGTTGACTTGAAAT CAT

GAGTAGAAGTAAATTTTAATAATTTTCTCATTACCTAGTGAGAATGAGAAAAGAGAA GTA

TAGACATGTTCCTGCAGTTAAATATATGGTTTGATGCAAATTACTTATGTGTTAGTT AACT

TACATGTTTCTATATATAACATGAGAGGAAGTATTGCTTCACAAAGAAATACAGAAA GGG

CAGAAAATGGACGAAAAACAATGGACTAGTTACAGTTTCTGCTTTGCTCTATGATTC TTC

CTCTACGTTATAGTGCTTTAGACATATGATGCATTTGCTACAAATTTTTAATTAATT TTCC

AGGTTAATAGTATTATAATGTTGAAAATAAAAATTGTGGAAGTGTTTCATCTTGATT AAAG

TGAAACCTTTAGTTCTGCGTTTGTGACCCACTGTGTCATACTTTTGTTCCTGGAGTT TCA

ACTCTGTTTCCTAAAAGCTTTTAGCTTGTCTATTGATGAATAGAATTGATGTGTTCT TATG

GAAAGCAGGTTGTTGCAGAGAAGCCAATATCTGAAGCTAAAGTTGAGTCTGCAATCC TT

AAGGTATATTCTTCTGTTCATGATTAAAGAAGATGTTAGTGCCCTTGTTGCTTCAAA TTTA

AAACTTAAGAGCGCTTTTTGGATGATCGTGTAGGAATCTATCATCAAAGAGGTTAAT GAA

AATGCCAAAGCTGGATGGAAAGCTGCATTCAACCCTCAATTCTCGAATTTCACGGTC AG

ATCATCATATATCTTAGCGCTTTCTCTAACCATCAACAACAGTTGCTTATTTTGTTG CTAT

ACACTGAAAACATGCATTTATAATTATGTCCATCTTTGGTATTCACAGGTTTCACAA TTTA

AGCGCCTTCTTGGAGTTAAGCCCGCACGAGAAGGTGATTTGGAGGGAATTCCACTTC T

AACTCATCCTAAACTTTCGGAGCTACCAAAAGAGTTTGATGCACGAAAAGCTTGGCC TC

AATGTAGCACTATCGGAAGAATTCTGGGTCAGTTTCTTCTTGTTCTTGCTTACTACT ATT

GAATTATAATTCTATGATAGTGATCCTAATGCTTTCCGTCTTTGCACATCAATCACT GGG

ACAAATTTG CATCAC AG ATC AG G GACATTG CG GTTCTTGTTG G GCTTTTGGTG CTGTTG

AATCGTTGTCTGATCGTTTCTGTATCCATCACAACTTGGTAAATTCTGTTGAACTGT GTG

ACCACCTCATTTAAAAGCTTTAGATGACGCATTTTTATTTACTTATTTATATATCTT CAGC

ATACTCTCTCATGTGCGAGCCCTGATTCTTTCTCATGGGCCAAGCACGTGGAAACTA TC

TTATATTAGCACAAAATGCTTGTGAAGTTTTCACTATAGTTAATGTCACTAATGTTA ACTT

TTAATGTGTAATGCAGAATATCTCTCTGTCTGTAAATGATCTGCTAGCATGCTGTGG CTT

TTTATGTGGATCCGGTTGTGATGGTGGATATCCTATATCAGCATGGCGATACTTTAT CC

GTAGGGGTGTGGTCACAGAAGAGGTAAATGTTGTCTTATTTTCACCTCAAAAGAGAT TA

CAGCTTTCAGTAAAACCATTAGTTACCGTGGATCTTTATGATCAATCACTAATAAAG TTG

TTTTTATTCTTGCAGTGTGACCCTTACTTTGATAATGAGGGATGTTCGCACCCGGGT TGT

GAACCAGGATATCCCACCCCAAAGTGCCAGAGGAAGTGTGTGAAGGAGAACCTACTA T

GGGGGAAATCAAAGCATTATGGTGTCAATGCATACAGAATCCACCGTGATCCCTACA GT

ATCATGACAGAAATTTACAAAAATGGACCAGTTGAGGTCTCGTTTACAGTGTACGAG GT

AATGACGATAAGGAAGAATGTTAAGTTCTGATCCTAAAACTATTTGATACAGCTTTC CGT

ACATGACATTATCTGAGCTGGTAACCTTATATGTGGTTGCCTACCTATCCCAAAATG AGA TACATGTAATTATTTTTAGGTGACCTATAGTGTAACTGTTATGATAATTGAGAAACTTTA A

CTACCGATGTACCTTCCCAATTTATGTTTGCCCGAGATTTACTTGCAAACTAATATC TGT

AAATGAGATATTTAATGCTAACCACAAGACAATATCAGAAGTTACCTGTTGTCGTAA AAC

TGCATCATCTCTTTCTCGGTGCAAGTAGATTTGTTTAGATTTTGTTTGTTGTCTTTG ATCA

TAACTGTTATCATCTCTTTTTCTCAGCAATGCTTTCCTCTAACCAATGAGTCAATTT TTTT

TATTTTTTTTTTGTCAATCACAGGATTTTGCTCACTACAAGTCAGGAGTTTACAAGC ACG

TAACAG GTCAAAGTATG G GAG GC CATG CTGTTAAG CTTATCG G ATG G GG AACTAGTGA

ACAGGGAGAGGACTATTGGGTATGTAGATGTGTTCAAGTTCTGGTGTCCTGTTTTCT AT

TTAAAAGCATATCTTTTTGTCAAAATCTAATCACCTTATATATCATCTGCAGCTTAT CGCA

AATTCTTGGAACAGAGGCTGGGGTGATGTATGTCCTTAAATTCATCCCTATGTTTTC ATA

TATGAGCAAAAAGTCCTTAGACATAGGCATGCTAGCTTCTTGTTGTTGATGCACTAA CTG

G CAC ATCAATAAATG G ATTTCAACTTATATAAACTAACAACGTAAACAATTTTTG CACTAT

ATTTCAACTGGTAAAGTTATCTCTGTGTGACCTATTGGTCACGGGTTCGAGCCGTGG AA

GCAGCCACTAATGCTTGCATTTGGGTAGGCTGTCTACATCATACCCCTTGGGGCTAC G

GCCCTTCCCAGGACCCTGCGTGAACGCGGGATGCCTTGTGCACCAGACTGCCCCTTT T

ATATTTTAAC CAGTTAAG G CAAGTTATTTACTG CATTTTTTGAAGTTACTCATTTAG G ATT

ACTATAGAGAGTTACATGCCGTCGTATGTCATTTAACCTAATGATGCAAATAAATTG TAT

ACTATTTTAATGCACAGAAGTTAAAGTAGCTTCTTCTCTAAATGAATGTATATCTCC AATA

TGACAGGATGGTTACTTCAAGATCAGAAGAGGAACAAATGAGTGTGGCATTGAACAT AA

TGTGGTGGCTGGATTGCCTTCTGCAAAAAATCTGAATGTGGAACTTGATGATGTATC TG

ATGCTTTCCTTGATGCCTCAATG

SEQ 1 1

CTATACCATCATACCCATGTTGGAATGTGCCACTCTGACAACAAGTGGAGATGTTACCG

ATGTTCTTTTGTTCCTCCATGTCAAAGAACCAAATACATATCCCTGTGTTGGTGCAG CCA

CCTTGAAGTTCACTGTAAAATTCATCTTCTGGTAATATCTAGTGAAGGCTAATCTTC GGG

GAACCACAGTGACATTGACACCCGTAGGTGCATAGACAACTGCCTTGTAAATGCTTC TT

GCTTTTCCCACGTTAGTAACAGTTCGAGTTACTGAATATGTGCTTCTGAGGTTTGGT ATT

GTGATGGAGGGATAATTTAGTCCATTTGGTGATGCAAAGGTTTGATCACAGGTGCTA TT

GTCCCTTGTAATCAGATGCAGAGATTTCTCATCATAACCAATTGAACAAAGAAATGC TCT

GTAATCTGCTGGCTGTGCATCGTATATAAGACCAGGATCCAGGACATTCGTAGGGTT AA

CAAAGCCAGAACCAAAATCAAATGGAGTAGCTCTCTTCCCTTCAGGATCTACTATTA TG

GGTTTGTGATGCTTATCTGACAGTTTAGCTGCATTAATATTGATGAGATTAGTGCAT TGA

AGGCTTGAATGAAAGAGTTAGATTATGTAAAAGCTTTTATTCTACCTGTCGTCATGA TCG

C G G ATTTAATTG C AG AG G GAG AC C AAG ATG G ATG C AC AG CTTTTAAC AAG G C AAC AACT

CCTGTTATGTGGGGGCAAGCCATAGAAGTTCCGGATAGTACATTGAAGTTCAACTTA GT

AGAAGCTGCTGGAGACCATGCTGCCAGGATATTTAATCCAGGAGCTGCAATATCAGG C

TACAAAG G CAAATCAG CCAG GAAATTACTTG CAAG AAAAAG CCAAATCCTC AAATAAG G

TAAGAACCAAAGAAAATGCAAAGTAACAAAGAACTAATCCACAACCACATTGACGAT CC

AGGAAGTAAATATCAGGATGCAACATAAAACTTTGTTGGTCTACATGACAAAGGCAG AG

AGAGATCATTGTTGAAAACAAGTGGCAGTTGAAATTAAGTCCCTATATTACTATTTT TAG

CGCACAAATTACCTTCAAAATTTCTGGTGTTACAGAATTAGGACCTCTTGAAGAAAA TGC

TGCTACTCGAGGAGCAGGTTGAGCTCCCAAAACGGTTCTAGCAGAGAGAATCCTTGC C

ATGGGGAGGCTGTTAATATAATAAGATGTGAGATTGGGAAGAGATGATAGTTAACAA CA

TCAAAAAGTCAAGAAGAAGGAAGTGAACCGTGTATTGTTAATGTAAGCTAGGATCTT GT

TTCCAATCTTTTTCCCAACAGTTGCTGCAGGAATGACAAAAGGGATGGCCACACCCT TG

TCTGCGTCATCTATAAGGATCATCCCAACTCCACCGGCTTCTTTAACTATAATGCTT TTC

TCCATCTTTGACTCACTTGAGCTTCCAGCATGTAGGCACACAAGCACCTTCCCTTTG GC

CTTAGTTCTATTCAAAGAACTATCTAAGCAATAACTAGTGGAAAAGAGAGGGAAGGA AA AAAGTAAATTAAGATAATTGTCAATGCATACATATCTACATTTAACAAACAGCGAAAGTA

CCTTATGATTCACCTGGATTGATAGGGAGTGAAGTATCCAGCATAAGCTTCAGAAGC AG

GTATGATTCTTGTAGATGTATTCATTTGAGATAAGCTAAGACTTTCACCCTTCCAAA TATC

CACAACTTAATTAGAAAATAGAAATTGAATAATAATAACATGTAAACTTGTCGAAAA CTG

GGATTACCTTGAGCCGAACTCCATTTCCTAGTAAAATATCAGAAGTAAAATCTCTAT CAG

TTGAACTGGCTGCAACTGTGATCATCCAAGGAGCTAAATTTGTGGCTGAACCAGTGC TG

CCTTCATTTCCAACTGAAGCCACCACAAGTATTCCGCGGCTAACAGCATGATATGAC CC

CACAGAAATGGCATCATTGAAATAATCTCCTTGGGGAGCATCAGGGCCCAAAGATAG A

GAAATGACATGAACCCCATCTCTAATTGCATCATCAAATGCAGCCAATAAATCAACA TCA

TAGCAACCAGAACTCCAGCAGGTTTTATACACTGCTATCCTGGCCATTGGGGCACCA CC

TCTGGCTCCTCCATTTGCCAAACCTTTGTAATTCATATTAGCTACGTAACGCCCCGC TG

CTGTTGAAGCTGTGTGACTCCCATGACCAGAACTGTCCCTAGCAGACTTGTAAAACA TG

GTCTTCCCATTTTCTTCTTCAGCTTCATAGCCACTCATATAATATCTTGCCCCAATT ATTT

TCCTACATAAGACAAATCATATTGCACTTATCATCTAAATAACAAAAGAAGAGATGG TTG

CCAATCAAAAGAAAACCTGTTGCATATAGAGGCATTGAATGCTTCTCCTGATTGGCA TT

GTCCTTTCC ATC CAG CTG GCACTG GAGG CATGTTG GTATCACTAAAACTTG GAGACTCA

GGCCAAATTCCTGTTTCATTAAATATCTTAAGAGCTTAACCTCAAGTTCTAATTAGC TCG

AAAAACAAG G GAAACAGTG G AG CTG G GG AC CAG GTTGAG GATAAACTGATAAG GTTGT

GAACAGAGATAATATTTGCATTGAAACATGTCACTCTAATGTATAGTGGCTCTTCCC ATA

AAGTAACATTTACAAGTAGTTCAAG CACACTGTTAG GC ATAAATG CAAATGG CAAAATAT

G GG AAG AG GTGAG AAAG ATGAACTG G GAAGCTAAGAAATTG CATAAACTTG AGTTTTAA

AAAAATCTAAGCAAACATATTTCATCATTCGAAAATGATTGAAACAAGAATTGATTG ATGA

AAGGAACTACTTTCCTCAGGTTCAGCCATATGTACCCAAATGACAAACTTAGCACTT TTG

CAAAGTCATGTTATTGTACTCTTCTTAAGAAAACTAACAGAGACAAGAGCCCTTTTA AGT

GACAATACATTAAATGAAGGGACCAACACTAACTTGGTTGGTGCATTCCATCATTAA AC

GATCATATCTTTCACCTAACTCGGAAAAGATTGCTAGAATTTAAGATAATTAAAGCA AAA

GGAACAGAGAAACCACCTGTATCAATGAAACCAATGATTACATTAATTTGGTTCTTG GTA

GAAAAACCTGGAATTTCCATTGTTTCATCATCACTGAGCCCCATAAAATCCCATGAA TGA

GTTGTGTGTAGGCTCCTCTTAGTATTTGGAAACACGGATACCACTCCAGGCATTTCT GT

TTTATTTTAACATTAAAGACAAATTTCTCAGTACTTATTCATATCATTACCTTAAAG AAAAA

CATTTGGCATGGGCTTACTGGATATTTCAGAAGCCTGTGCCTCAGTCAACTTGGCTG CA

AAGCCTTTAAAACCATGCCTATAACTATATACATGTGAAGTCTTGGCTTGTTCAATG CTG

TAAAGTTACAACTTCAGTTTTTTTG CTAAAAG CAACCAGTGTAAAACC CAATGAACCAG C

TCAAAAAAGGGAAAAACCCTCATAGCTAAAGGTAACAAGAACTGACCTTCCTTTATG AAT

AGCAGTCAG CATTTGATG GTTTTG CCTCAAAATCTCATCTG G GTGTTCATCACTATCTTT

G CTTC C C ATGTAC AC C AC ATATAACTG G AAAAAAAAAAAAAC C AAG AAC AC AACTTTACT

AACTATTCATCATTTCAAAACATCATTCAAATATACACCCTCAATAACAGCACATAA AAAC

CCATATCAACATACAGACTACAGAGCCAAAGTTTATTTACCTTGGAAGAAAAGCAGA GG

CTAATATCTCCAAGAAAAACACAAAGAAAGAGTAAAAGAAGAGTCTTTTTTAGAACA CCC

AT

SEQ 12

ATGGGAGCAAAAGCATTTCTTGTTGCTATGTTTCTCTCAGCACTGTTATTTCCTTTTGCC

TCCTCATCCAATGATGGCTTGATGAGAATTGGCTTGAAAAAAATGAAATTTGATCAA AAT

AATCGGCTTGCTGCACGCATTGAGTCAAAGGAAGGGGATGTTTTGAGGGGGTCGATT A

G GAAGTATAACTTCCGTG GTAAACTG G GG G ACTTTG AGG ATACAGACATTGTAG CATTG

AAG AACTATATG GATG CTCAATACTTTG GG G AG ATTG GTGTAG GC ACTCCACCTCAG AA

GTTCACTGTAATCTTTGACACAGGTAGCTCGAATTTGTGGGTGCCGTCGTCGAAGTG CT

ATTTCTCTGTAAGCTTCTATACATGCAAATGATACAAAGGATAGCATTGAACATCCA TCT TGAGTGATGTAAAATTTGATGACTGCCTATCTTGGTGTAGGTTCCCTGTTTCTTTCATTC

CAAGTACAAATCAAGTGAATCAAGTACTTATAAGAAGAATGGTATGTTATGTTTCCA TTTT

TGTATATTGCTTCTCTCTACCATCTGGTTGTTTATTGCTGCATGAAACATATATATG CTTC

CTTCTAGTGCCGGTTATTGTTTAGAATATGTGCCATCTTGTTCATTTTAGAACACTT TTTG

TATTGTCCTTATGTGTTTCTCACGGTGCATCAAGGGATTACATTGGAAAAGTTAAAT GAT

GAGTACATGTAGTTGACTGTTGAGACATAAAAAGAGGCTGTTTATGTTATGTTTCTT AGT

ATATTACTGTAACTAGTGAGGTTCCAGAAAAGAAACACCAATATCCTTATCTCTCTC TGC

GATTAGTGCTTTTTGGTTGCGAGTTGTATAGTTTTAACCTCTGCAATGCCTCTTTAG GTG

GCCTTTCTTCTTCCCTATTAACTAGGTTTTGTATCCATGTCTTGGCATGTCTGTGAC ATA

AGAAATTTTCCGCAGAAAACTAGTTATTCTTGATTTTTTTGTTCTATTACAATTATA TATGT

TGCTCAAAGAAAATGAATTGAACAGTCTGTAGCAATTGAATGGGTTTGCCAGATGAG TT

CACTCCAGTGTTATTGAAGGCCATTGCTTGGTCGTTTAAAGTTATGAAGCGATACAG CA

AGGGCTATCGCTTCATCGCTGAAGCCATGCACTTTAGAGAAGTTTGCACTTCAAATA AT

GGTGCACAAAGTGATTCCAACGAGAATTTATTGTTTCATTGAACATCAACTTTTAGA AGT

TGGATTAATGTTGGCATTTTCACAAAGTGATTCCTGTTGGGTATAAAAATAATATTC ACG

GTATTAGTGATAAACGCGGAACACTAAGTTATGCTTAAATCAGTAAGAATAAAAATG CAG

C AAAAATG AC AC C AAG ATTTTAC CTAG AAAC C CTTCTG AATAAG G G AAAAAC C AC G G C C

AAG AAG AG CAACTG ATATCACTATAG CG AG G ATTTTACACTGTGTAGTAACG AGTAC GA

ATACTCCTAAGACCACTACACCCTCAAAAGAAATAAACACTCTTTTGCTTTTTCACC TCA

CTACAATATCTCTCACACTCTATTTTCTTTACAAACTATTTTCTTATAGTTTATGGA ATACC

TTGCTCTCTCTTTTTTCTCTCTTTGTTGGTGTGTAGAAATGAGAGTTAAAGCTCTCC TTTT

ATAGCCAAAACCTCACTCTCTAAAGCCTACAATATTTGACAATTTGCACTCCCTTTT CAC

AATTTCAACAAGGTTGGCTACCAAACCAAACCAAATCAATAAAATTGTCTACCAAAT CAT

ACCAATTCAACAAGGTTGGCTACCAAACCAAACCAAGTCAACAAAATTGGCTACCAA AT

CATTTGAATGAGATGGAAACCATATCAATCTCCCCCTCCAGTCTCATTCATCTAGAG GA

GGTAGCACTGTCTTCTAGTCTGAGTGCATGCCGACAAGTTCTTTGCATAGCTCGAAC TT

GTCTCTTGGTACCACCTTGGTCAACATATCAGCAGGATTTTCACTTGTGTGAATCTT TTT

GACCTGAAGTGATTCGTTCTCCACTTGCTCACGAATCCAATGATATCTGACGTCGAT GT

GTTTTGTTCTCGCATGGTACATGGAGTTTTTGCTTAGGTCTATTGCACTCTGACTGT CGC

AATAGACAACATACTCCATCTGTTGCAATCCAAGTTCTTGAAGAAATCTCTTGAGCC ATA

TCATCTCTTTGCCAGCTTCAGTAGCCGCAATATACTCTGCTTCAGTTGTAGATAGTG CG

CACTTCTGCAACTTCGACTGCTATGATATAGCTCCCCTTGAAAAAGTAAACAAATAT CCG

GTAGTGGATTTTCTGTTATCAAGGTCACCTGCCATATCAAAATCTGTATAACCTTTC AAA

ATTGGATTTGATCCTCCAAAACATAAGCATTCACGAGAGCTTCCTCTTAGATACCTG AGT

ATCCACTTTACAACTTCCCAATGCTCTTTTCCTGGATTTTCGAGAAATTTGCTAATA ACAC

CGACTACATGAGCAATATCTGGTCGATTGCATACCATTGCATACATCAAACTTCCGA CA

G CAG AAG AATAAG G AATCTTG GC CATTCTCTCTTTTTCCTCCGTTGTTGTAG GACACATC

TTCTTACTCAACTTCAAATG AC CAG G AAGG G GTGTG CG AACCAACTTAG CACTTTTC AT

ATTGAAGCGCTCCACTACACGTTCTATGTACTTCTCCGGTGACAAGTAAAGCTTTCT TTC

GTTTCTCGAACGAGTAATTCTCATGCCCAAAATCTGCTTAGCATGACCCAAGTCTTT TAT

TGCAAAAGACTTATTCAACTGTTTCTTCAACTCGTCAATCTTGGATGCATTCCTGCC CAC

AATCAACATACCATCCACATATAGCAAGAGGATGATAAAATCATCATCAGAAAATCT TTG

TACAAATACACAGTAATCTGAAGAAGTCTTCTTGTAGCCTTGCTCCCCCATAACAGA CTC

AAACTTCTTGTACCACTGTCTGGGAGCTTGCTTCAATCCATATAGACTCTTCTTAAG TTT

GCATACAAGATTTTCTTTACCTTTTGCATTGAAGCCTTCAGGTTGTTCCATATAAAT CTCC

TCTTCTAAGTCACCGTGAAGAAAAACAGTCTTCACATCCATCTGCTCAATCTCCAAA TCA

AGATTGGCAGTTAAACCAAGAACTGTCGGAATGGAGGACATTTTCACGATAGGAGAA AA

TATTTCGTCAAAGTCAATACCTTTCCTTTGACCAAATCCCTTGACAACCAATCTAGC TTT

GTATCTGGGCTTCAAACTATGTTCTTCAGCTTTAACTTTGAACACCCACTTGTTCTT CAA

AGCTCTCATGCCCTTAGGCAATTTCACCAACTCATAAGTATGGTTCTCATGCAGAGA TTT

CATCTCATCTTGCATGGCTTCAATCCATTGATCCTTGTGCTCATCTTCTATGGCCTC CGC ATAACATTCAGGTTCTCCCCCATCAGTGAGTAATACATATTCATTGGGTGAATAACGGG

AGGAAGGAGTACGAGGTCTAGAAGACCTCCTGAGTGGAATATCTAACTCGTCCACAG C

TTCGTGAGTAGGAGCATCTACCTCATCAACATTAGCATTGTTATCACCATCACCATC AAT

ATGCTGATCCAGAATATGGTTCTGGGCATCACCATCATCATTGAGCCCACCAACGTC AT

CCACATTTGTATGAGGAACTTGATCAAGATTAACTAAACCTTCAGAACTTGAAGATT TTA

GTTTCTCCGCTTTGTTAATATCTTCAATGGTTTGATCCTCCACGAAGATAACATCAC GGC

TTCTCACGACCTTCTTCTCAATTGGATCATATAACTTGTAACCAAACTCATCAAGGT CAT

AACCAATG AAG ATG CATTGC CTTGTCTTG G CAGTTAATTTTG AC CTCTCATCTTTAG GCA

CATGTACAAAAGCTTTGCAACCAAACACTTTCAAGTGGTCATAGGAAATATCCTTGC CAT

ACCAAACTTTGTTTG G AACATCACTTTG CAAAG CAACCACAG G G GAAAG ATTAATAACAT

GTGCGGCGGTCAACAAAGCCTCACCCTAAGAGGAATTCGGCAACTTTGCTTCAGAAA A

CAAACATCTGACTCTTTCCATCAAGGTCCTATTCATCTTTTCTGCTAAACTATTAAG CTGA

GGAGTCTTAGGAGGAGTCTTCTGGTGTCTGATACCCTGTTGTTTGCAGTATTCGTCA AA

CAGTCCACAATATTCACCACCGTTATCAGTACGAATACACTTCAGCTTCTTTCCAGT TTC

TCTTTTAGCTGAAGCCTAGAACTGCTTAAAGACACCCAACACTTGGTCTTTAGTCTT CAA

GATGTAGACCCAAAGTTTCCTTGAGCAATCATCAATAAAGGTAGCAAAATAAAGTGC AC

CACCCAAAGTC CTTGTCTTCATTG G ACCACATACGTCTG AATG CAC CAACTCAAG CAAC

TCGTCTTTCTTGAAAGAAGATGAGACTGGAAAGAAACTCTTTTTTGTTTTCCAGCCA AGA

AGTG CTCACATTTTTCTAATTTTGCACTTTCAAAATTTG ACAACAATTTCTTCTTG G CTAG

AACATTTAGTCCTTTCTCGCTAATGTGGCTAAACCTCTTATGCCATAACGTTGAAGA GTT

ATGGCTCTCAACGGCATTCACCATATCAACACAGGTAGAGGTCGTAGTCCAATATAG AC

CACGACGCTTTTCCCCACGAGCCATAATCATGGAGCCCTTAGTGAGCTTCCACTTTC TA

GCACCATTGGTACTGACATATCCCTCATCATCCAAAACACCAACAGAGATCAAGTGC AA

ACGAACATCAGGTGCGTGCTTTACATTGTTTAAAACTAGTTTAGTTCCAATACTAGT TTC

CAAACAAATCATTCCAACACCAGTCACCCTAGATAAGTTCCAAAGTCACCCTGAGTA TA

GGATGAGAAAATCCTTCCTTGATGTCACATGAGATGCGGCACCACTATCCACAACCC AG

CTTGACTCATCACAAGCAATATTTATCAAATCCGCATCAAGGACAATAACAAGATCT TCT

GTAGTGACGGTGGCCATACGATTGCCATCTTCTTTCTGTTCTTCCTTGTCTCTATTC TCC

TTTTTCAAAATCCGCAGAACTTCTTTGTGTGCCCTTTCTTCCCGCAATGATAACACT TAA

TATCTTTAAGTCTGCTTCTGGATTTGCTTCTATTATGTTCTCTATTTTGAGAACCCC GATT

CTTGCTTCTCCCCCTAGAGTCAGTCACCAAGACATCTGATGGGGAGGAACCTTGAGA TT

TTCTTCTCATCTCTTCATTTAAAAGACTGCTTTTGGCAAGATCCATAGAGATCACAC CAT

CCGGAGCAGAATTTGATAATGAAGTTCTAAGAATTTCCCAAGAACTTGGTAGGGAAC CA

AGTAG AAACAG GC CTTGAATTTCTTCATCAAATTTAATGCTC ATAG CAG ATAACTG GTTC

ATGATCCCCTGAAAATTATTCAGATGATCTGTCATCGCAGAACCATCATGGTATTTT AAA

CCCAACATCTGCTTTATCAGAAACATCTTGTTGTTTCCAGTTTTCCGAGCATACAAA CTT

TCAAGGTGCTCCCATAGGGTCCGAGCATGTGTCTCCCCAGAAATATGGTTCAAAACA TT

ATCGTCAACTCACTGTCTAATAAAGCCGCAAACCTGCCTGTGTAACAGATTCCACTC TT

CATCTGATTTATTATCAGGCTTTACAGTGGCGAAGACAGGTTGATGAAAATTCTTGA CAT

AGAGCAAATCTTCCATTTTGCCCTTCCAAATGGCATAATTTGTGCCATTCAAAGTAA CCA

TTCTACTAGTGTTGGCTTTCATCGTTTATCACAAATACAAATACTATTTATTATGAG ACCA

AAGTAATTCTTTTCTGATGTGGAAGTTCAGACTGTGCTGCAACCACAGAGCATACTC AN

N N N NTATTTATTATGAGACCAAAGTAATTCTTTTCTGATGTGGAAGTTCAGACTGTGCTG

CAACCACAAAG CATACTCAAACAG AACCTTG G CTCTG ATACCACTTGTTG G GAATAAAC

CCCGTAAAAATAATATTCACGATATTAGTGATAAACGCGGAACACTAAGTTATGGTT AAA

TCAGTAAGAATAAAAATGCAGCAAAAATGACACCAAGATTTTACGTGGAAACCCTTC TGA

AT AAG G G AAAAACTAC G G C C AAG AAG AG C AACTG ATATC ACTATAG C AAG G ATTTTAC A

CTGTGTAGTAACGAGTACGAATACTCCTAAGACCACTACACTCTCAAAAGAAATAAA CA

CTCTTTTGCTTTTTCACCTCACTACAATATCTCTCACACTCTATTTTTCTTCACAAA CTATT

TTCTTATAGTTTATGGAATATCTTGCTCTTTTTTCTCTCTTTGTTGGTGTGTAGAAA TGAG

AGTTAAAGCTCTCCTTTTATAGCCAAAACCTCACTCTCTAAAGCCTACAATATTTGA CAA TTTG CAC ATCCTTTTCACAATTTCAACAAG GTTG G CTAC CAAACC AAACC AAGTC AATAA

AATTGGCTACCAAATCATACCAATTAGTAATGGTGCACAAAGTGATTCCAACGAGAA TTT

ATTGTTTCATTGAACCTCAACTTTTAGAAGTTGGATTAATGTTGGCATTTTCACAAA GTGA

TTCCAACTAGAATTGGTTGCTTCTTTGAACCTCAATTTTGAGGAGTTGCATTAATAC GGG

GATTATTGTATATTGGATGCTGAAATTAGTTATTTCAACTGCAATTTGATTTTTATT GTAG

AGTAAATTAATAAATGTTTGATATTTTCTTGTTTTCTGTTAATTGTGCGCCTCACTT CTCG

CTATTCACTGCAAGTCTGTGGACCTTGTTTTATTTTGTTGCACGTTTTGGTTTTAAG AAC

ACTAGGTCACTCCTACCTAGGGGTGTCAATGGATATTAGAAAACCGACTTAACCGAC CG

AACCGTACCGTACCGAACCGATTTTTAGGTTTCTTTTAAAGAAACCGTAGGTTTTTA TAT

AAATCTATAATCGTACCGATAATTAGGGTAGGTTTTTTATTTTATAAAAATAAACCG AAAA

AATACCGAACCGTACCGAATAAGTTTTACATATGAAAAATATATTCATATAGTAAGT TTAA

AACTAGTAAAGTATTAAATTTTTCATTGGGTCTTGGAATTATGAAAACTGTTACAAG CCAA

TAAGTAATTAAACTCAAAATACTAATTCCTAAAACN N NNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNN CTTTTATATAATTTAGATTTATCTTTTTAAATATTTAATATAGACTTTA

TTCTTGAGTCCCAACTTGGTTAATATCTTTCCACTCGTGTGATTTATATCTTCTTTG CTTT

TACTTGGTTTCTTTTCGTTGGTGTCGAATAGTTGTGTATTTATACTCTAGCCATCTT TCAT

GTTTTTTAATTCATTATCCTTTAAACAGTAAAAATGTCTAGAGAGTTTCGCTAAGTC CTAT

AAAAGAACGTACGTTATTGCATTCTATTTTTACTGGTGAATTTTATATGACATTTAA AAAA

TACCGAAAATTAACCGAACCGTACCGATACCGAAGTGAAACCGACATGATTGGGACG G

TTTCGAAAAGTCTAGTTTTGGTTATACATAATAAAATAACCGAAAAATTGGTATGGT ATAA

ATTTTATAAAATAAG C C G AAC C G AAC C ATTG AC AC C C CTACTC CTAC CTTAATG AAC AAC

ATTTATAACTTATCGAATGTTTCCACAAGACAGAGATCGTATTTGGCATAGTAATTT TAAT

GTGATTTCATTCGTGATAAATTTTTGATTTTCCATATAGGGAAGTCTGCTGCAATTC AGT

ATGGTAGTGGAGCTATTTCTGGATTCTTCAGTCAAGATAACGTCAAAGTTGGTGACC TT

GTTGTAACAGATCAGGTGAGTGAGGCTCCTCACTTGTTTTAAGTGTCTTGAAGGTAG AA

ATCTTACATCTCAAGGTGTTCATGTACTAGGAATTTATTGAGGCAACCAGAGAACCC AG

CGTGACATTTTTGGTAGCCAAGTTTGACGGTATATTGGGTCTTGGATTCCAGGAGAT TT

CAGTTG G AAATG CTGTTCCAGTGTG GTACGTG GACAGCATTTAGTTTG CTCTCTTTCTTT

CCCACAAACCAAATTAAAGATCTAGACAATTCTTTTTTCCAGATGAGGTTTAACACA TTG

AAGGATTAGATTTCCATAAATGCAGGATAGGCTGGATGTCTCTTTCTGTTTAAATTT GAT

TTGGCATTCATCTGGGCAGCATCTCCTGTCTTTTCCAGGTACAACATGGTCAAACAG GG

TCTTATCAAGGATCCTGTCTTCTCGTTTTGGCTCAACCGAAATACAGAGGAAGAACA AG

GCGGAGAAATCGTATTTGGTGGGGTTGATCCTAATCACTATAAGGGAGAAATAACTT AT

GTTCCAGTCACACAGAAAGGTTATTGGCAGGTTTGTCATTCCGTCAATTCGTTACTA TGT

ATGTTCATGTTTTGTACAAATGCTATCTTAATCTTAGTAATATGATTGGCAGTTTGA TATG

GGTGATGTTCTTATCGATGGTAAAGCTACTGGTATGTTTTGCTCTGTACCTTTTGAA TTG

GATTGCTGAATTTTGCGAATATAGATGAGGGCTATGTGCCTGGATATAGTCTTCCTT TGA

GTTTTTAACATGAACATATCGGGTGTAGGTTACTGTGAAAGTGGGTGCTCTGCAATA GC

GGATTCAGGGACTTCTCTCTTGGCTGGTCCAACGGTATTATCTATAACCTTGATTTT GGA

CATATCGCTTTTTTGTTATGTTTTTGTTATTTGTTTCCACCATCAAAGTCAACCGCA ATTC

GATTATTGATTTCCTTGTTCCTTTGAACCTTCAACAGGTTGCTCAGCAAGAAAAGTA ACA

TTTTCCTAAGATTCATGAGTTAAATAGAACGAATCCAGTGTCTGATCTGTGTTGTGT TTT

AATTTTTCATCACAAAGAGGATGTTAGTGAAATTGGTTTTTCATTTCTGAGGTTACC TTTG

GATATACCATCTTAGCTTATCACTGGGTCGATGTCTGTAAATCTGCTTTCTCTTTCC TGT

CTTTCAGCTTAATATGTGTAAACCTGCCTCAGTAGGTACTTGTTGTTTTGTGTTACT TGC

TTTTATTACG CTTCAATGAG GATG ATG CTTTTACTTCTG CTTTAGTCCTAGTGTTCTGTC

GACTTATTTATTTACCCCTCTTCTATTTGAAAGAAAATTATCCAACCCGAAAACTGT AGC

CAGGTTTTATCCTGTTG ACTTG G AAATG CTGACAATTAAAT GAAATAAAAATCTG CTG CC

TTTTGTCTCTACAAATTCAGGGGCGAACGTTTACCCAAACGGTGTCATCCGACACCG CT TGGTCGAAATTTTTTACTGTATAGACATATATATGTGGGAAAAAACAGTACGCAATATAA

ATTATAAATGACCATTTATGCGTGTAGCTGTCTCTGGTATAATGGTCAAGTGCTGTT TTT

TCCCTACTTGACTTGATGTCGTGGGATTGACCTCAACGGATGGCATTTTTTTATTTC AAA

TTTTTTAGCATGGTCCTTTTAAAAACTAGAGTTTATATAGAGTTGAACCTTCAATAT CATC

TTAAAAAAACAACTAAATCTGGGGGACATGACGGCTTCCACGTTATAAGTGATAAAA AAT

TTATTTGAATG CATAAAGG AAG CTGTTTAAG CATACATAATATAATTTAG AATAATAATTT

TTAAAAAATTATTCGACACCGCTTACTAAAAGTTCTGCGTACGCCCTTGTCTACAAT TGT

ACGATTAAAACACTTCCAAGGTTGAGCAATCTTAAGCCCACCCTTTTCCGTACAGGC CA

TAATCACTATG ATTAATCAAG CCATTG G AG CCTCTG G AGTTG CTAG CCAACAATG CAAA

TCTGTAGTGGAGCAGTACGGGCAGACGATCATGGATTTGCTGTTAGCAGAGGTGAGC A

TTCAACTGTGTAACAGCATTTTCATTTGTTTGATATGCAGTTTCCTGTGCAGTTACA GCA

AAATGACACTTGGCAATCTAGTTGCCTTTCCCGGTTATCTAATTGGTCCGCATTTAA ACA

GGCACATCCAAAGAAGATTTGTTCACAGGTTGGAGTATGCACCTTTGATGGAAACCG CG

GAGTTAGGTTAGTCTTCAGGCTCTCTCTGCCCCTTCACGTGAACATATTGTGCATTT TGT

TAATCCATAATGTTGATATAAATACTGAATAATTCTGTGGCCTTTTTTCTGCAGTAT GGGA

ATTGACAGTGTTGTAGACGAGAAAGCTGGCAGATCCACAGGACTGCAGGATGGTATG T

GCTCTGCTTGTGAAATGGCGGTCATATGGATGGCGAATCAACTGAGACAAAACCAGA C

TCAAGATCGCATATTAAACTATGTGAATGAGGTAAAACATCTGTCACTGCAATTTTC TCC

TTTTCTTTGAAAAGAATGCTGACTGACTGACAGCTTTGCGAGCGTCTCCCAAGCCCA TT

G GG G GAATCAG CTGTTGACTGTG GAAAG CTTTCTTCAATG CCTAAAGTCTCTTTCAC AA

TTGGTGGCAAAGTGTTTGATCTCTCCCCAAATGAGGTATGCTTTATAATGGTGTTTG CG

AGTAATAAACTTTGCTATGCCTCTTGTAATTCAACATTGTCAGAGAGAAAAATTAAG TGG

CTCATCATAGAGTCTTTTTGTGGCATTACAACTAATTGTCATATTACAAAACCGATT CTAG

TGTTTAAGGTTTTCAGTTTCTACAGAGAAGTAGATGTTTGTTGTCATCAAGAAGTAC TTC

TATATTTTGAGGTTGTGAGTATTTCGATTTTACATCAATTTAGATGTCTCATTTTCC AACT

AAGTAATCTAAGAGAACCTCCGTAAGATCTTTGTGCCGTGAAAAAGTTACTCTTTGC TAT

TTGTATGGAAGTGTATTACATGCTGACCTTGCTAATGCAAGCCGCTTGAGAAGGCCC AA

TGAATCGAGATTTAGATGTCGTCCCCCCCACCCCCCGGGCCCCAAAAATAATAATAA TT

TATATCTGGATGATTCAGCATGGAACAATTCTCTACCTTCATAGACAGGGGTAAGGT CT

GCGTACACACTACCCTCCCACACCCCACTTGTGGATTCCACTGGGTTGTTGTTGTTG TT

GTTGATTCAGCATAGAACAAATCTTACAAAGAAGAAAATTCATCAAGTGTACACCAA ATC

AATTTGATCATTCACGTTTTTTCTAAATTTGTTTGTTTTGTGGTTTAGTAAACATAA AGAG

ATTGAGCTTTGAATAATTCAGGATAGATCACATCTTATAAAGGAAAAAATTCAAGCG AGA

CACCAAATCACTTGACCATACGCAAAACTTTTCTTTACCAGACCATCACTCTAGTCT TTT

CCTGCCCTTTCAACAAATTTTATTCAAGAATGAGAGTAAAGTTGTGATCCGATATGT GGT

TCTGTTTTACTTGTTTCAGTTGTTTGTTTAACTTGCTGACCTAGATTGATCTATATT CTAA

TTTTGGATTCTTTTCCCGTGATCAGTACATACTCAAGGTGGGCGAGGGTGCTAAGGC AC

AATGTATTAGTGGTTTCACTGGCTTGGACATTCCTCCTCCCCGCGGACCTCTCTGGT AA

AGTTATCTTTATATGTTCTCTCTATTTTAGTTTATAATTATGTTTTTTTGAGATAAC TATCA

GCAATGTAGTTAATCCAACAATAAGCTTAATCCGAATTATAGAGCTACGGCACAATT AAA

CCCGAGCCAGCAATTCTTTTGTTCACTGAGTTCTCTATGGTATTCCTTTGGTTACAG GAT

CTTGGGTGATATTTTCATGGGTCGATATCACACAGTTTTCGATTATGGCAAACTCAG AGT

TG GATTTG CTG AAG CAG CT

SEQ 13

ATGACTTTTTTCAGGTCGTTCTTATTCTTTCTTCTCACCTTATTTGTTATTTCATCTGCA CT CGACATGTCCATCATTAGTTACGACGAACAGCACGGCCAGATGGGGACAACACATCAT CGTACTG ACGATG AAGTCAG AG AATTGTAC GAATC GTG G CTTGTTAAG CACG GAAAG AA TTACAATGCCATCGGAGAGAAAGAGAGAAGATTTGAGATTTTTAACGATAATTTAAGATT CATCGACGAGCACAACGCTGAGAACCGCTCATATAAACTTGGGTTGAATCGATTCTCTG

ATCTTACCAACGAGGAATACCGTGCCATGTTCGTAGGTGGACGGTTGGATAGAAAGA C

GAGGTTGATGAAGAGCCCTAAAAGTAACCGTTACGCTTTTCAGGCCGGCGAAAAGTT G

CCGGAATCCGTTGATTGGAGAGAGAAAGGCGCCGTTGCCCCTGTTAAAGATCAAGGC C

AATG CG GTG AGTTTTTTTCTTCTTCAAAACTTTCCTACTATAAAG GAAAG CTCTG CTCTTT

ATCGTAAACATGTACTTTTGTTTTGTCTGCTTACGGAGTGAGACCAAGAGGAAGAGT TT

GGATAGATTGTTGAAAGGAGTCATATGTAGGTCAAAAGTTTTTGATTTTTAGGTTGT TTT

TTGACCTATGTTGTCGTCTTATACGGTCAATGATCTGTTATTGGGTAACTAATGATT CTG

TTTTCATGTTTATTTCAGTCAACAAATTGGAGAATAAATTAATTGCTGCTCTGTCTG GTAG

TTAATCTTCATGATATACACCTAAAGCTTACATCCTGATTTAGTATTTGGTGTCTCC AATT

GGAATGTTTATTTGCTTTGCTAGTGTTTCCTCTCTCTCTCTCTAGGGTAAATATAAA AAG

ATCTAAAATTTAGAGGTACCTGGTGTATATCTTAATATATTCCATGTACAAACTTTA AAAA

ATTATTTAAGCTTCCCCTAATTTGTTTAATACGCTGATAAGGGGTAATCAAAAAGCA TAA

AGATTAGATTGAACGGACACAGTATATATTTTGCTTTTGCAAGTTGATCAGTTTCTT TCTC

CATTCTAAATCGGAATCGACCAGAATTTAAAGCGGTATAACTTAAGATTAAGCCATG AAG

ACATATTTGGCTATTCTAGGTGTTATAAATTTTAACCCAAGTGTCCTAGGGAATTGA TGT

TTAATCTTGCTTTGATTATGACGAAACCCATATCTCGATTGGTTAGATATCAGTATA TCTA

TGTTATGTATAG AATCCTC GTTTG AAATTTG AG ATTTTCTTATG AAG G GAGTTGTTG G GC

ATTCTCAACGGTTGGCGCTGTTGAAGGAATAAATAAAATTGTAACGGGTGAATTAAT TAG

TCTGTCAGAGCAAGAGCTTGTTGATTGTGATAGGAGTTATAACCAGGGATGTAATGG CG

GTCTCATGGATTACGCCTTTGATTTCATCAAAAATAACGGTGGCATTGACACTGAAG AT

GACTACCCTTACCATGCTCAAGATGGCACTTGTGATCCATACAGGGTAAGTAATTAA CC

ATACTATCAAGAAAACATCCAAATATTAATTATGTACTATTTCAGAATGTAAGTCTA TATA

GCAAGTAATTAATAGTATTTGCTGACAAAATTTGGTCATTCAGAAAAATGCCCGTGT TGT

CTCCATTGAAGGGTATGAAGATGTTCCAGAAAACGATGAGAAGTCGTTGATGAAGGC A

GTGGCAAATCAACCAGTTAGTGTTGCTATTGAAGGTGGTGGCAGAGCTTTCCAGCAC TA

CTCTTCGGTATGGTGGGCGGATCTTGACTAATATATCCTTCTGAATATATATGTTAT TTG

TGTCTGAACTCACTGGCCCTAAATTCTGGATTCGTTATTGCATTTTAGTATGCCTGT GTC

CCTAATCTGCAAACACGGCTGCATTGTGCCTTGTTTTACTACTTAAAGCTAGTATAC TCA

TTTACCCTTCCAATTTTTATCAAATCATGCAGGGTGTTTTCACTGGATATTGTGGAA CGC

AACTAG ACCATG GTGTAGTTGTAGTTG G CTATG G AACAG AAAATG G CG AAGATTACTGG

ATTGTGAGGAATTCATGGGGTGCTAACTGGGGAGAAAGTGGTTACATCAAGCTTCAG C

GCAATTTCGCTAATTCTACAACTGGAAAGTGTGGAATTGCAATGCAGGCATCTTATC CT

CTTAAGTCTGGCGCAAATCCTCCTAATCCTGGTCCATCTCCTCCTACTCCTGTAACA CC

ATCAACTGTTTGCGATGAGTACTATAGCTGCCCACAGGGCACTACTTGCTGCTGCAT TT

ATCAATATGGCGAATACTGTTTTGGCTGGGGATGCTGTCCTTATGAGTCTGCTACCT GT

TGTGATGATAACTACAGCTGCTGTCCCCATGATTATCCTGTATGTGATGTTGATGCT GG

CACTTGCCTTATGGTAAATATTTTTTCCCTCCCATTCTGCTTTTTTCTCCTTTATAA TAAT

GATCGTCAATTTCACTTATTACGTGTAATATTCTACCAGCACAGGATTAATTAGATA ACT

CTGTCTACCAAAACTTTGGCAGATATTTAAACCTTCGTCTTCACTCGTTTATTGACC GCT

AGACCCACGTACAGATTCAACCTTTTATAGGTTTAATCATCAATGCAAGACTACTTA TCA

CAATCTTTTTTCTTTTTATGTGACAGAGCAAGGACAATCCATTAAAAGTAAAAGCAT TGA

AGAGAGGTCCAGCTAGAGTAAACTGGTCAGGGATGAAATCTAACAGGAAAGTGAGTT A

CGTT

SEQ 14

TCATGAAGAAACAATGATCAAATAATAGCTAAAAAGGGAAAACAGAGCCATCATAAGTT

GGCAAATGTAGGAATTGAAATGTGCTGGTGCATGGTTTATTGCAGGTCTAGATGACG GA

ACAGATGGAAACGAAGTAGCGGGTTCATTTCCACTTCCATTTCCCTTGGTGGCCTCT GG CACCACACTGGAGGGCGACGGCGCTTCAGTAGAATTACGTTTGTTCACTGGCAGAGTT

GTCGATTTGTCGTTGGATTCTCTAGAATCATAACCTGACAAGAATGGTTTCTAGTTA AAA

TATGG ACAG GTGTG CACACTAAAAAG GTCATACTCATGAATGCAAACTCACAATCG G AT

GGTTTCCAACCCAAAACCATCTTCTCTCGATCAAAAACCACGCGATAGCCTGTCATA AA

ATTTTCTGCAAAAAGATCCATGTATTAGTTTTCTGTCATAAATTCCAAATACAGAGA CAAA

ATCGAAGTAAATCAACAAATAGCTCAGATTTTTGACTATGTAACCAGTTTTACTAGT TGG

TTATGGACTTGATTCTTACTAATTTACTTTGTCACATTCACATTTCCAATAATATAT AAGCT

AGAAGTATGTAAAAAACTTGATAGGAACCAAAACTTCTAAAAGTTGGTAATTGTGAG ATC

ATTAGCTGGCATGGTGCAAGTTATATTGCAAAATTCCATGGATAGAAACAAAATCTA CGA

GCAAGCAACTGATAGATACTTACGTCCAATGATGTTGACATCCCCACTTTTCACAAC AG

CTAAGCAAAATGCGCGAGAACCATCCTGATGATCATGACTGGGTATTAGAATATTCT AA

AAGAGAACTTCTGTAATAAAGAAGGAGCAGAAACCATCTTACCTGGAGCGAGAGCAT AA

TTATCGGATCGAAAAGAAAAAACTGGTTGCCGCCTTTCATTGTCAAATTTAAATCAG GAA

CTTCGAATGTAGTTTGATTTGCACTGCGTATGTAGCATGTTAATCCTGAGAGTTCAA AGG

ACATCAAGAAAGTAAAAGTGATGAAGATTATAAAAGATGGTTCACCTTAGCCCGTAG CA

GTATTCAAAAGGAATTTCGCCATCAGGTTGAATACGTAGCTGTTTTGCTTGAGAATC AAA

CTGAAAAAAGAAACAAAATATCTTCAGTTTCACAATACAAAGAACAAACTCCCAACT AAC

AAATCATACAGTCAGCCTGTCACTCACGTTCTCTGTAATGACTTTGTAAGCTGGGTC GTT

CAAGTATGTGAATGAGGTGCCAGAGTCAAAAATGGCTGTGAAATCAACATCAGTGAT CT

TGTTTCCCACTGTTATTCCTGTCAAGCTGATGTTATAGGTTGGGCTGCAATTCAAGG AA

G GATAAG AGTG AATACATATTTG GTCG AATTCTCCTGAATC AAGC CAG G AACAG AGG CA

AACGCTAAACCTAGAATATCAAATTGACTGCTTACTGTAGTTGATCAAGATTGAGTG GTG

TTTCTCCTTGGTCTGGACTCCCTTTATCTCCAAACACTATTCTTCCAATACCATCAG GGC

CAAAGC ACATG GAGAAAG AATTTG CAG CAAG AC CTTTACTTG CTAACATG CTCGG AACA

GATATACTTTCCAAGCCAAGTCCAAATAGACCATTAGGAGCAGCGCCACTTAAAAAT GC

ACCGGTTTGTCTTATCCCACACCTGAAATTGAACAAGTAGAAAATTAACATCATGGT GGA

TAAAGATTG CATC CAAATTACACTG CATTTCTCTCAAAC CATACCCACCCTAGAG CAATT

GGAGCCTCAACACTTTTTTGTTGAGCATTATCTGTCTCTAAGTGCAAGATGTCTTCC ACC

AGTACCCCTGATGATGAGGTATTATTGGAGAGATATGCAACTCCATAAGCACATGCG TT

TTGTGAAGATAAGCATCGCCTCCTTTGTCCACACAGAGTGCCGTTGCAAGGAACAAT CT

GACCCGTTGACGACGTATTAGGGCTGTAAATATTGAGATTTATTCGCTGTTTCAAAG CA

AACAAAG GATTATAAGTGTCAAATAATACATAAAG AAACATTAAACG GAAAAGG CAAAAG

AGCAAAGAAAATTTTGTTCCTGAAGGTGAAATCCTGTGGAGTATGAAGTAATTGAGC TA

GTAAGAATCTCAAAATTCTTCAATGACAAACAAGCCACGGAACAATATGGATCAGAT ATT

TCATTTCAGAGTCAGTATATTACATGCATCAAACTCACTTTGCAAATATTGATTATG TTGA

ATTTTGCGCTAGTTATGTATCTTCCTTAACAGAAAAACAATTTTCTCAATACATTCT CCAC

CCCATATCTTGTTAACTAAGAGAATATATTATTGTCATAATGACGGAAAAAGACAGT TGA

AGAGAAATCCACTGTGTTACAGATTCAGCCATTGTGTTATCAGCAGGTTCCTTTCTG AAA

AAGGTTATCAGACGAGGATGATCATCTTCCTTGTAGCAGAAGGCAACATACAGACGA G

GATGATCATCGAAAAAGAAAAACTCTTTAAACATTTGAAAGTAGAAAGAAAAAGGTA CTA

GAATAAAGCAAACATACTCGTCCAGAGCGTGTCTCGAGGGCGCGCACACAATTGCTG C

AATCACAGGGTAGCCAAAACAAGTCACTGCCAGTGTCAAGTGCCACCAGAAATGATA G

CCCAG GAGTG CCCACTGTCACATTTG CATAATG CAAACTG CAAATTG G CAAAG AGTATT

AGTCACACACCTTAAGAAGAAAAAATCACAACTACAGATACTACATATTTTGCATTC AAC

TCTATCTTTATAACATATAAATTACAACATGCTACTGCAAGGAATTTTCAGAACAAT TCCT

TGTCTAG AGAG GAGATAAGTG GCAG CAG AGG AACG G AAAATCAG AAAAAAAAAAATG G

AATTTATTTC C G GAG CAT G G AATTC G AACTAG AAG AAG ACTATAATTAAATTTAG AGTC A

GTACTTTTAATATAGGAGTGAAATCGCCAAAATTCCAGTCCGATGAAACACACAAAC AG

AATTAAAGAACAGAAACAGGCCTAAATCTTTCTTTTTTTTGTTTTATCATATTTTCC TCCA

CATGAATCTCGTAAGAACTATTAATGGTACATGGAATTTATTTAAGTTAGGTAACCT ATTT

TTCCTGAACTGACACATCCAACTAAACAGACAAAAACAAACGCAAAGCTCAGTCAAC TC TAACATCACACTAAACGGACAAAAGTAAAAGACAGTAAACAAGAATTTCCAAAAACGTAC

TAGTAGTTCACAATCAGGAATAACAACAAAAATATTTTAAAAAAAAAATAGAGCAGC AAA

TAAACAACTGCAAAAGCAATCAGAAAAGAAAAATAGAGTGAGCTTACAATCCCAAAG AA

CTGAGGCGGAAAGTTTCATTTCCTCCGGAAAAAGAGAGAGGAGTGGGATTAGTTGTA T

CAGCAAGGCGGCGACCTTTGATAAAGCGATCACGCTGAGTCCAAGCTGAATAATACT C

AACACTTCCCTTCTCAGGCAATCCATGAAGGTCCAAAATACCCTTCACCGGATCCGA AT

ACCGGTGATGGATATCAAACCCGAACGTCCCAAACCCATCGCTGCTCTGCAATTGCA AT

C C C AG AATC G C C AAG AAAATAATAG G G G C AAG G AAAAAATT AAAACTTGTATAAG AATT

AG C CAT

SEQ 15

ATGGTGACAAAGTTTAGTATTTTTATTTTGGTGGTGTTGTTGAGGTTATTTTCATTTGGT T

CTGTAG CCTCAAGG G AAATTCACAATTCTG GTCTTAATCTG AATTCTAGTG CTTCTG GTA

TTGAATTCCCTCAACATCCAAGTTTCAACTCAGTTACTGCTTCTGGAAATTCAGATT GCA

GTTATGGAACATCCAAGAAATCAACAACCACCCATGTAATAACTCAAGAAGAAAATA GAT

CTGATGAAAAAGAAGATGAAGATTTAATGGTATCTAAAAACCAGCCAAGAGAAGCAG TC

AAGTTTCACCTAAGGCACAGATCAGCTGGTCAAAATATAGAGGCCAAAGACTCAATA TT

TG AGTCCACAACAAG G GACTTAGGTAG AATTCAG ACATTG CATACAAG GATTGTAG AGA

AAAAGAATCAGAACTCTATTTCAAGGCAAACAAAAAATAGTGAAAAACCTACACAAT CTT

CTTCATTTG AATTCTCAGG CAAG CTCATGG CAACATTAGAGTCAG GTGTAAGTCATG GT

TCAGGGGAGTATTTCATGGATGTTTTTGTCGGTACACCTCCTAAGCACTTCTCTTTG ATT

CTTGATACTGGTAGTGATCTTAATTGGATTCAGTCTGTTCCTTGTTATGATTGTTTT GAAC

AAAATGGTCCTCATTATGATCCTAAGGATTCTATCTCTTTCAAAAATATAAGCTGCC ATG

ATCCTAGGTGTCACCTTGTTTCATCTCCTGACCCTCCACAGCCTTGCAAGTCTGAAA AC

CAGACTTGCCCTTATTACTATTGGTACGGAGACAGCTCGAACACGACTGGTGATTTC GC

GCTTGAGACGTTTACGGTTAATCTCACAACCCCTAGTGGGGATTCAGAGATCAAGAA GG

TGGAAAATGTGATGTTTGGTTGTGGACATTGGAATAGAGGCTTGTTTCATGGTGCTG CT

GGTTTGTTAGGACTTGGTAGAGGACCGCTTTCGTTTTCGTCTCAGCTTCAATCTTTA TAT

GGCCATTCTTTTTCGTATTGTTTGGTTAATAGGAACAGCAATTCTAGCGTAAGCAGC AAA

TTGATTTTTGGTGAAGATAAGGAACTCTTGAAACACGCGAATTTGAACTTCACTTCA CTG

GTTGGTGGGAAAGAAAATCATTTGGAAACATTCTACTATGTGCAGATAAAATCAGTT ATA

GCTGGAGGTGAAGTGCTGAATATACCTGAGGAGACATGGAATTTGTCTACAGAAGGT G

TTG GTG G AACAATCATTG ATTCAG GAACTACTTTG AGCTATTTTG CAGAAC CAG CATATG

AGATTATAAAACAGGCATTTGTTAACAAGGTGAAGCACTATCCTGTTTTAGAAGATT TTC

CAATTTTGAAACCATGTTACAATGTTTCTGGAGTGGAGAAACTTGAATTGCCTTCAT TTG

G GATAGTTTTTG GTGATG GAG CTATATG GAATTTTCCAGTAGAG AACTACTTCATC AAAC

TTGAACCAGAGGATATTGTTTGTTTGGCAATGTTAGGAACTCCTCATTCGGCCATGT CG

ATAATTGGCAACTACCAACAGCAGAATTTTCATATCTTATATGACACCAAAAGGTCA AGG

CTGGGATTTGCACCAACAAGATGTGCTGATGCC

SEQ 16

TCACATAGGAGCAAGATGACCTTCTTTAGACAATTTATCTTGCATCCACCTCTGAAGCAT

TTCCATTGCTGCCTTAGGCTGATCCATTGGAACCATGTGACCTGCATCATGGACCTT AA

GGAAAGTTAAAGGTCCATAGTTTTTTTGAACTCCTTTCTCTACACCATCTACTGCAA AAG

AAACTTGTGTGGCTTTTCCAAAGGCTTTTTGCCCTGTCCATTTCATTGCATGCACCC ATC

TCGAATTCCCTGTCCATATAACAAGAAATAGAGTTTTATAATATTATTGTTAGTTGG TAGC

TTTGAATTGACTTATAAACAATATGATAGAGTTCAACTTCTATATTTTGACAGACGT AAAA G ATAACTTGAAGTAACTTTTGTTAAAAGTAG AATTACTAG CATG AAAAAATAAG GTAG GT

TAAAATACACTAATATAGTATGAAAATCCTCTTTTGTGTATATAAGTTAAATCCATA TGAT

AATATAAAG CTTAC C AAG C C AATTG C AG ATAAG GTC ATATTC C C C AG C ATAC ACTAGTAG

CTTG ATACCATCCTCAAG G AGTG AAG GAATTCC CAATTCAAG ATTC CTCATC CAGTCCA

ACTGCATTGCCTGGTAAACTTCAGAGCTACATGAAACAAACTCAATATCCCCAACAC CA

AGAGCCTTTTTAACTTGTTGATCATTGAGGAAAGTTTCCATTTTGGAGAAATCATAG CAT

AGATCGCCCTCACATCTCTTCCGCACATCATAGTACTGCAACTCGGAAATTACAAAA TTC

ATATGACTTTAACTTTTGTATACTGACAGTGTAAAAAAAACTTTATTCTATCACGTC ACTT

AAAGGATTGTAACTATAGATACCCGTTCATTATAAGTGAGATCAGTAATTTGGAAAA TAA

GACAAGTAATATGTTGTACATCATTAGCTCAGAGAAAATGAGATTGGTCTTTCTTAC GTT

TTTGTCACCAGCAATGTCCATAATCTTGTTGAAGATGCTTGTACAAACAAGATATGC AGC

CATGCAAGCAGTTCCGCCATCTTTTCCTAATATTATTTGCATAGAAGAAAGCTAATG TAA

AGACTAG CTG CTG CTATATGAAGAAG G AAAAG GACATTG AGG ATAAACAAAATGAATTA

CCACAAAGCTTAATTGCTAGTTGACATTTTGGATATGACTTCTCTATGGCATTGTAA TCA

GATTTTTTGATCAATTTCATATCCAGAGCATAGTCAGTGTAGGCTTTGTATTGAATT TCTG

GATCAGTGAGTCCATTACCAATAGCAAATCCCTAAAAAAAATTGTACTTTGTTAAGT CAT

TGGCATGACGACAAATTCAAATTAAACCTAACTAAAGGTAATTACTGGATAAAGAAA AAG

GGATGATATATGGTAAGAGTTAGAAATACCTTGAGATTTACGTAGATTCCTTCTTTA TTTT

TGTTTCCTTGGTGAACCCGAGAAGCAAATGCAGGAATGTAATGCCCAGCATATGATT CT

CCAGTAATATAGAAATCATTTTTTGCATACTGTGGATGTGCCTTGAAGAAGGCCTAT CAT

CAAAAGAATTTGAAAAAGTTTGAATTAAATTTTATTAATTATATCAGTTAAACTTTA GAGAT

TTATCACGAGCTAAAAAAAAGGAATGAAAGAATAGGATCAACCTGCAAGAAGTCATA GA

GATCATTGCTTACGCCCCTTTCATCGTGACGAATATCATCATCGTTTGAACTATAAC TGA

AACCAGTTCCAGTTGGCTGATCGACGTATATAAGATTTGAGACCTGTAAAATTGCAA TTT

ATCATATGTTATCATTCTTCAACTAACAAAGGAAAGTTGCATGTTTGATTATAGGAT TTAA

CCGGTGTAAACGATTTTTACACTATTGTTATATTTTAACATGTTGTAACATGTTGTA TTCG

TCCCACTTAAATAAAGTGAAGAGAAGCGTAGTAGTCATTGATGTCAATAAACGTTGA ACT

ACTTTCGAATTTTTGAAATTCTACAAGTCACAGCTAATGAACAACAAGTGTTAAAGA AAA

AAATGCTAGTAGGTAAAAAGGTATTTTGCATGATGGAGAAAGGTTGAATAACAAATA AAA

ACATGGAGGGAATTCTTTTAGATTTTTACCATATTCAAAAGATCTAACTGACGTTTC TTGA

G AAATTAATTGG GTAAAATAAAAAGAATAAACTG AAAAAAAG AG AG G AAAAAACAAAAGA

AAAAG C AAAAG G AAG AAAAC AAG AAC CTTGTC C C AG C C G AAATC ATTC C AG AC AAG AG A

CATGTTATCTGCAATTTTGAATGGTCCATTTTCATAAAACACAGCCAATTCACTGCT ACA

TCCTGGCCCTCCAGTTAGCCATATAACTACTGGATCATTCTTCCTGCTCCTCGATTC AAA

G AAAAAGTAAAAC ATC CTG C C AAAAAC AG ATAATTTAG C ATTAATTAATAATAC C CAT AAA

TTCATTTTTTTACCAAAATGAAGCAAGAAGAACATTTAATCCAATTCAAACCTTGCA TCTT

TAGTATGTGGAAGACGATAATAACCAGCGTGATGACCCAAGTCTTGAACTGTAGACC CA

GAATTACCAACATAAGATAAATTCAATTTCTTTTCAAAAAGTCTCTGTTCAGTAACT GCTG

CAGAATCCCCTGTTGCTGCAGCCTTGTTGATATCATGCTTAGGGAATAAATTAAGCT GT

CTG ATTAG CTTTTCTG CCATTGTTAATG G GAATTTTG G AGTAGAAG ATAG G AAAAAATCA

TCATCATTAGAATTTAAAGTTGATGAGAAAGATAAGGAAATAGAAGCAAGAAGCAGA GT

AAGAAAGAGAAGAGAGAAAGATGAAGGCAT

SEQ 17

TCACACGACACTTTGGGGTGGGATATCCTGGTTCACAACCCGGGTGTGAAATCCCTCA

TTATCAAAGTAAGGGTCACACTGCCAGAAAAAACAACTTTATTAGTGGTTGATCAAA AAG

ATCCACAGTAGCTAATGGTTTTAGTGGAAGCTGTAACCTCTGTGAGGTGAAAATAAG TC

AGCATTTACCTCTTCTGTGACCACACCCCTACGGATAAAGTATCGCCATGCTGATAT AG

GATATCCACCATCACAACCACTCCCACGTAAAAAGCCACAGCATGCTAACAGATCAT TT ACGGACAGAGAGATATTCTGCAATACACATTAAAAGTTTAGCATCAGTGACCATAACTAC

AGAAATACTTCACAAACATTTTGTGCTAATTAAGATAAGATGGTTTCCATGTGCTTG GCC

CATGAAAAAG AATCAGG G CTCG CACGTG AG AGAG CATG CTG AAG ATATATAAATAACTA

AATAAAAATGTGTCATCTAAAGCTTTTAAATGAGGCGGTCACACAGTGCAACAGAAT TTA

CCAAGTTATGATGGATACAGAAACGATCAGACAGAGATTCAACAGCACCAAAAGCCC AA

CAAGAACCGCAATGTCCCTGATCTGTGATGCAAATTTGTCCCGGTGATTGATGTGCA AA

GATGGAAAGCATTAGGATCACTAAAATAGAATTATAATTCAGTAGTAGTAAGCAAGA ACA

AGAAGAAACTGACCTAGAATTCTTCCGATAGTGCTACATTGAGGCCAAGCTTTTCGT GC

ATCAAACTCTTTTGGTAGCTCCAAAAGCTTTGGATGAGTTAGAATCGGAATTCCCTC CAA

ATCACCTTCTCTTGCGGGCTTAACTCCAAGAAGGCGCTTAAATTGTGAAACCTGTGA AT

ACCAAAGATGG AGATAATTATAAAAG CATATTTTCATTGTATAG CTG CAAAATAAG CAAC

TGTTATTGATGGTTCGAGAAAGCGCTAAACTATATGATGATATGACGACAAGAGGGG GT

TGCTCCGATGGTCAGCATCCTCCACCTACGACCCCAGGATTGTGGGTTCGAGTCACC A

AAG GAG C AATAG CTC C AAC AAAG AG G AT C AC AG G G G AATC AAAAG G G GAG G G G AATTT

TAAAAAAATGATGATCTGACCGTGAAATTCGAGAATCGAGGGTTGAATGCAGCTTTC CA

TCCAGCTTTGGCATTTTCATTAACCTCTTTGATGATTGATTCCTGCATGATCATCCA AAA

AAG CTCTCTCAGTTTTCGAATTG AAG G ACAAG G GCTATAACATCTTG AATCATGAAAAGA

AGAATAATGTACCTGAAGGATTGCAGATTCAACTTTAGCTTCAGATATTGGCTTCTC TGC

AACAACCTGCTTTCCATAAGAACACATCAATTCTATTCATCAATAGACAAGCTAAAA GCT

TTTAG GAAACAGAGTTGCAATTC CAG G AACAAAAGTATG AC AGTACTGTG ACAAACAAA

GAACTAAAGGTTTCACTTTAATCAAGATGAAACGCTTCCAACATTTCTTATTTTCGA CATT

ATAATCCTCTTAACTTAGGAAAATTAATAAAAAATTTGTAGCAAATGCATCATATGT CTAA

AGCACTATAACATAGAGGAAGAACCATAGAACAAGCATAAATTGTAACTATTCCATT ATT

TGTCCTCCTTTTTCTCCCCTTTCTGTATTTCTTTGTGAAGCAATACTTCCTCTCATG TTAT

ATATAGAAACATGTAAGTTAGCTAACACATAAGTAATTTGCATCAAACCATATATTT AACT

TCAGAAACATGTCTATACTTCTGTTTTCTCATTCTCACTAGGTAATAAGAAAATCAT TAAA

ATTTATTTCTACTCATGATTTCAAGTCAACGCTTAACTAAAGCATAAAAAGTCCAAA ATAC

CCAACAATATTTGATCTTTCTGAAGAAATACAAAAAGGGTAATCCATGTAATCATCA AAA

CCTATATAAATTAAACCAATAATCTAAATCCATCTAAACAAAGAAATACTCTTACTG TAGA

AAG GTTC AAC G AATG AAG AAAC AAAC CTG C AATATAAG G ATAC AAAAAG C AC C C AAAAA

CAAAGGAGCTGCTAAAGACTTCAGAGTCAAGGTCAT

SEQ18

ATGTTCCGACTAGTAATGGTGACAAAGTTTAGTATTTTTATTTTGGTGGTGTTGTTGAGG

TTATTTTCATTTGGTTTTGTAGCCTCAAGAGAAATTCACAATTTTGGTATTAATCTG AATT

TTAGTGCTTCTGGTATTGAATTCCCTCAACATCCAAGCTTCAACTCTGTTACTGCTT CTG

GAAATTCAGATTGCAGTTATGGAACATCCAAGAAATCAACAACCACCCATGTAATAA CTC

AAGAAGAAAATAATTCTGATGAAAAAGAAGATGAAGATTTAATGGTATCTGAAAACC AGC

CAAGAGAAGCAGTCAAGTTTCACTTAAGGCACAGATCAGCTGGTCAAAATATAGAGG CC

AAAGACTCAATATTTG AGTCCACAACAAG G GACTTG G GTAGAATTCAG ACATTG CATAC

AAGGATTGTAGAGAAAAAGAATCAGAACTTTATTTCAAGGCAAACAAAAAATAGTGA AAA

AACTACACAATCTTCTTCATTTGAATTCTCAGGTAAGCTCATGGCAACATTAGAGTC AGG

TGTGAGTCATGGTTCAGGGGAGTATTTCATGGATGTTTTTGTTGGTACACCTCCTAA AC

ACTTCTCTTTGATTCTTGATACTGGTAGTGATCTTAATTGGATTCAATCTGTTCCTT GTTA

TGATTGTTTTGAACAAAATGGTCCTCATTATGATCCTAAGGATTCTATCTCTTTCAA GAAT

ATAAGTTGCGATGATCCGAGGTGTCACCTTGTTTCATCTCCTGACCCTCCACAGCCT TG

CAAGTCTGAAAACCAGACTTGCCCTTATTACTATTGGTATGGAGACAGCTCGAACAC GA

CTGGTGATTTCGCGCTTGAGACGTTCACGGTTAATCTCACAACCCCTAATGGGGATT CA

GAG AT CAAGAAAGTG GAAAATGTG ATGTTTG GTTGTG G ACATTG GAATAG AG G CTTATT TCATGGTGCTGCTGGTTTGTTAGGACTTGGTAGAGGACCTCTTTCGTTTTCGTCTCAGC

TTCAATCTTTATATGGCCATTCCTTTTCGTATTGTTTGGTTAATAGGAACAGCAATT CTAG

TGTAAGCAGCAAGTTGATTTTTGGTGAAGATAAGGAACTCTTGAAACACCTGAATTT GAA

TTTCACTTCATTGGTTGGTGGGAAAGAAAATCATTTGGAAACATTCTATTATGTGCA GAT

AAAATCAGTTATAGTTGGAGGTGAAGTGCTGAATATACCTGAGGAGACATGGAATTT GT

CTACAGAAGGTGTTGGTGGAACGATCATCGATTCAGGAACCACTTTGAGCTATTTTG CA

GAACCAGCATATGAGATTATAAAACAGGCATTTGTTAACAAGGTGAAGCGCTATCCT ATT

TTAGATGATTTTCCAATTTTGAAACCATGTTACAATGTTTCTGGAGTGGAGAAACTT GAA

TTGCCTTCATTTGGGATAGTTTTTGGTGATGGAGCTATATGGACTTTTCCAGTAGAG AAC

TACTTCATCAAACTTGAACCAGAGGACATTGTTTGTTTGGCAATTTTAGGAACTCCT CAT

TCGGCCATGTCGATAATTGGCAACTACCAACAGCAGAATTTTCATATCTTATATGAC ACC

AAAAG GTCAAG GCTG G GATTTG CACC AAGAAGATGTG CTG ATGCC

SEQ 19

TTACAAAGGTTGCTGAGCTATCCATCTTTTAAACATATGAAAGCTCTCTTCACGTTTGTA

CTCAGGAGCTGTGTGCCCTCCTCCCTAAAGAGAAAAAAAGACAAGAAGGAAGAACAA A

AACATCTGAGAACTGTGAAAATGTGAGCACAAGACAATTTTACTATCGAGTGTGAGT AC

AAAATTCTAACCTTTACTGTTGCATATGTCATATGATTAGAGAAAGATCTTGTGTAA CTG

CACATCAAAGCTATAAATTTAGAATTTCATAACAATTATATCGTCTATCCAACTGGA ATAG

AGATCCAGAGAAATTAAAATGGAGAATCATACCCTGCAACTTGACCATCAATTGTCC AA

GGGCGCCAATCATCAATGATAGAATAATTTAGATACTTTATCCATGCTTGCGTCGAT TGG

AAAGGAACAACCATGTCATGATCACCACTGGGCAAAAAAAAAAAAAAAAAAAAAAAA CTT

TGTTATGTATATAATAACTTCAGCTAAATCTTTTTTGCATAAAGCACAAATTAGAGA ACTA

CAAACATTGGTTTAGGTTTAAAGTATCAAACCTGTATATGAGTGATCGATAACCTTT GGA

ACTAAGGTTAACATGGTAAGGTATACTATTCATGAAAGTAACTCTATAAGTTGTACC CAT

AATACTTTGCCTACATCTCGCCCACGCTCTTCTTATAGTTCCCTAATCAAAGAAGAA CCA

AAAAG AAAAG AAAAAAAG C CTTTTAATATCTTG C AC CTTAATG C C AAG C AAC CTTC ACTA

TGTACAAGTACAAACAATTGAAATACCTTTCTAACATGGAGAGCCTCTTGAACACTA GG

GTCATTTGCCCAATGATTGGAGAGCTTGCGTGTCGCAACCTAAAATCAGAAAGCATA TG

TATATCTAAGTATCCTACATTTATCTGTTTATCCCGAGAAATCATGAAAAAGAACCG GGC

ATATATGCCTCTTTTAACATAAATAGGTATATATCTTAAAATTATTAAACAATATTC AAACT

TTATTCGGTGTAAAGGCATTTAGTTGCTGCAATATTCAAACTTTATATAGTAATATT TGAG

GGACTTACACGACTTTCTCGACAAATGAAGTCATCATGTTTTAGAAAGATAAAATCC TCT

TCAAGAGATCTCCTTTCACCAGACAATTGGCGTGGGTTTGGCGATTCTGAGTCCGTT CC

ACAAAAAGG CTCTAGTATTTG CTG ATCATTTATACTG CTTACAAG CTACAAAGTTG CATT

ACATGAACATTGTGGTATTACCTAATTACAAACTTAGTACCAATTCAAAACATTAAA ACAA

GAATATGTAAGAAGGAATATAACCTTACCTTCTTGAACATTTTAAAGTTTTCTAAAC ATAG

TTTATTGGTAGGATCAATGTTTCGGCAATCGCCTTTGCAAGTCTCCTTCAGTGACTG ATA

CAACAAGATCCAAATTATCATTTTTGTGCCAAACTGTGTTGGTATGATACAAAGTTA ATAT

ATGAATTGAAGTAAGAGATGAACCAACCTCATAAAGTTCATTAGATATTAGTCCCAT ACC

ATGACAGAAAG GAATTTG GTAATTG CTTTCTTCAG G AAATGTTAG CGG ATTGCCAAGTG

AATAACCCTGAAAGGATATTAATTACCATTAAAAATAATAATTTTTCATGTATGAGA TATA

TTTTAAAT AAAAG G AAAG G AATAG AAAAC CTTAAG GTTG ATTAGTG GCTTTTTG CCTG CT

TCAATTCCTGCATTTTCATATCACTCTTGAAGTGTCAACAGAGAACACTTTGGCTAG AAA

TGTTTTGAGAATCTTCTCTTCAACCAAGTGAGGCTACACATTCAGTTTTGAAGAGAC AAC

AAAAGAATGTAGTGTTCTATATATCTTATCTAATATCTGTGGAAGTAATGATCAATA ATAA

CAAAGCATAAACTTGGCTTAGAGCCCTAGGAATGGAGAACCTTTGATCAATAGAGTC AC

TTTACTCCCCCTAGTGGACCAATTCAGCATTAATTGCCAGCGGGCTTCAAATAACGA AT

GGCTAAACCGGAAAAATAATAATAACAAAGCATACACTAAAGAGTGATGGGGTGATG AG AACGATCACAACCGCACATTATATGAAAATACTAAATGACACTTACCATCTGATATTAGT

TGAACAATAACTGGAACTGTAATGCCTGAATATGAGTCCCCAGAAACATAGAAAGGG TT

G GAAATG AATTCTG GATG ATTATTGAACCACTG CAAAATAAATCACAACTTATTTCGC GA

TACATTTTTGTTATATTTGGTTGCTATTTAATTATTACTGTCTTCATTTCTTCACGG TTCAT

TTG ATCAATCCAC CAAGATACACATATATTAGG ATTTAG GTGG GTG GAGG G AATTCAAA

TACGTATATATTGAGCCTAAAAAATTTATATCCTAGATCCTCTACCAATGAACAATG CAAA

ATGAGAAGTATATTGAGGAATACATGAAAATTCAGAGTTAACTCTGCTTACTAATTC AAT

CTAAAAAATACTCAGTTTACTAATTCATATGTATCTAACTAAATGAACTTGAAATGA TTTT

CAAATACCTTTAGTAGAAATTCATAGACCTGGTCGCACGCTTGTAGATCAGTACACT TG

GATGCCGCTGAAGTTGTTGCATATGAGAACCCAGTATTTACAGGCTGTTCCAAGAAA AG

TATGCTCGCAAACTACCAAGTTTAAGGAACACATTAATTTGATGATCTAATGTTAAC CAT

AAGAGGATAAAAGAGAACATGAAGTGTGGATTAGAATATATATGCAGTTTGTTGACA GC

AAAATGAAGTGTGCAATAGAAAAATAACATGCTCTTCAGTGTTACCTTTGTCCAGGA ATA

TGGAGTTGAAACAAGAATTGGTAGGCTCCCATTGTATGCCTTCTGACCAAAAGCCAA TG

GCCCTACAAAAGAGAATTCAGAAGTTAATTTTCTCCAACTATGAGTTACACGTAATA CCA

AGCTTTTCCACACCTATATATTTACTACTAATTGACTAATTAGAAACATCGACTAAA AGTA

AATTGCTTTTGTGATACTAACAAAAGCATTTCTCAAATGAGTTAAGTTAATCACAAA AGGT

TAAACTCTATTTGACAAAAATTACATTTGAACACAAACAATAATGGTAACTGTTGGG ATAA

AGATAACCTCCAGACTATGATTACATATTTAAGGTGAGTTACTGATAAACATACTTG ATA

ATACAATAGTAGTATAACTAACATACTATCATAGGTTAAATAATATTTTATAAAAAA TATTT

ACATTGTCAGTATATACAATATAAATATTTACCTACTTCATACGCCACACCCGTGAA GGA

TGAGCAACCAGGCCCTCCCGTTAGCCATAGCAAGAGTGGATCTTTTTTAGGGTTGGA TT

CTGATTTGACAAAGTAATAGAATAGTTGCACTTCCTCGGATTTGCCAACTCCAATAT ATC

TAACAATTGTATTCAAAATACATCACTTCAACAAACTTGTTTTACTACTCCACTATA TATG

TAGCCAGTATGTTCTGAATGAAGTAAATTACCTAAGAAAAAGTTTAGATTCTTTTTA TACT

AATTGATCTTTTGATCAAATACAAGTTAAAATTCAAAGGGTGGTAAAATTAACGTAC CCA

GTCTCAAG ATAAAAAG G AAG AG G GCCATCAAAACCAG G AAG AAACTC AACAGTTG AG C

TATTCTGAG GAAG ACTTTGTACATATTGTAG AAAG AG AGTAAG AG G AAG AAGAAGATGA

AAC AATAGTG G C AG G C G AAAAC C AG AC AT

SEQ 20

TTAACCAGCTAGAGGATTCATCACACTGCCAATGAATAACACCGCACCAGTAGATTCGT

CTCTTATAAGGAATAGGAATGGATGGTCCGCAACAAAATCCATTTCCTTCTCAATAA TCA

AGGACATGGTCATTATTACAGTAGCGGTAACAGCTGCAGCTTCGGTTCCTTCCTCAT TT

ACCTCAATGAAAGACTTGTGAAAAACCTGTGAAACAGACAGGTTCTGAGGCATAGGA GA

ATCAACCATCTCAGTGAGGCTACCACCACAAAAAGGCAACGTGAGGCCGAGTCCCTT T

AGAATGTTGGAAGCTTCAAATCCAAAAGTTATTTTAAATTTAGGGATAAGAAACTTG CGC

GCTCTAACTTTTCCATATGGAACATGGTTATTTAAAAATCCTGGTTCTAAGCTGATT TTTT

CCAGTAAAGCAGGTAATCCATCATGGGCATCTGGGAGAATGAAATACATACAGAAGC G

ACGCGTATCCGTGCCTTGTTTATAAGGAAGCCTCAATATTTTAAAGCAATCAAACGC TG

CTATGTACTGCTTCTTCTTGCTAGTCATAAATGGTGCTTGAATAGACCCTCCATTGA GGA

GATGGAAGTCATGATCTTTCGTTTCTGACACATCGAACTTCTCATTCCATTCTCCTT TGA

AATATAGTG CATTG G AC AAGATCAGC CTTGTC ATGTTGTTCACTG CATCG CGAG GAAGA

ATCTCTTTGATAAGACCATTTGTCTCCAT

SEQ 21 TTAGAAAAAAAG CCAATG CTTCTTTCTG CG CACTCTAG CTG GAACCTCTGTGG CACATT

ATGTTCAAAGTAAAGCAAAAATTAGTATTCAAGAATAGCTATGACAAAAAATTCTGA ACT

CAGAAATAGTTAAAGCAAGAAGACACTTACCTCCAATTTCGTATAACGAATTGTAGT GAA

CTTCGCTCCAGAAACTCAGCCACAATTCTGAAAATCAAGAATAACAAGTCAAAAGTT TTG

TCTTCAAGAGATTAAACATATCGAGAAGAAATGTTCGTCAGCGCCACACCAACACAT TT

CCGAACAAGTGATTTGGAATTTATTTACTTGTTAAGATAAGAGTTAATCTGATCAGG TTT

ACTCAGTCTTTCCTAAGATCACCATGGATGTGTTTGTTTATGAAAATATAATCAGCA GAA

CAT CATC AAACATGTTAAAG G G GAG CTTTGG AGCAACGGTAAAGTTGTCTC CATGTG AC

CTATAGGTCACGGGTTCGAGCCGTGAAAGCGGCCACTAATGTTCGCATTAGGATAGA C

TGGCTACATCACACTCCTTGGGATACATCCCTTCCTCGGACCCTGCATGAACACGGG AT

GCCTTATGCACCGGGCTGCCTTTTTTTAATCATTAAACATGTTCAGCATATTCAATT TCTT

GAGAAAACAATTTTAGCATAACAAAAGAGAACTATTAAAACAGGAGCGACGATCTGA TG

TTACTTTATTG G CAG CG GATCAAGTTTATCGTCTGTAACAAATAACG CTTAATAAGTG CT

TGCATCAGTAAATTGTGTGTTTCACGTGAAGCAAGACTCTGGAGATAAATTCTCCTG ATA

GCAACTAATACTGCTCTTAGTAAGAGCATCAGGAAACTCTGTACAAAGCTTACATTC ACT

AATTGTAAACATACAAATGATCCACAGTTCACAAATGACAGCGAAGGATTCTTGTCA TTA

GAATCAGCCTCGTTCAATGATGCAACAATGTCATTACAAATTAAAGGTGTTTTAGTT TTA

CTATTTAAAAATTAGTTAGTACCAATTCACTTCGACATACTTCCACTCACAATCTAT TGTG

TGTGGTACCAAGCAATACGATGATACTTTCTCCTAAATGAGAAAAGCACATCCATTT GTC

AAAATAAACAAAATGATTCATTTTTGTTTTCCTTTTTTTTCATTTTTGTTTTCCTTT TTTTTT

CATTTTTGTTTTATATATGTATTCTTATAATAAAGTGGGAGCATGCTAGAGAGTTCA AAGT

AGGACCATGCTACAAAGTTCAGAAGAATACTTTTGGTTTAGTGTCTCAAACAAAACC AG

CAAGTACTATTATTTAAATTCTGGAATTTATATCATAATATCATTATTTTAGAGTTA TTTGT

AAATTTCAAGTATTTATTTTATTTTTTGAGTTTAAAAACTCAAG CAG G AAGTTAAAG AACT

AATAATCAGAAGTATTGATTGTGCCATGATCATTAGATACAAATAACATAAAATGTA TGTA

CCCCTAGAAGGTTGTATGTCCTTGGGAAGAATATCAATGTAGCCGTTGTCTCGGAAA GA

C GTG AC C AAAC ATATTTTTAC C C C AAACTG C AAAC AAAATAC AAG C AAAC AC AAGTAC C A

TC AGTC AC C C AAAAC G G G AAAG C AAAAAAATAAAAATG C AG AG G G C ATAAG G G C G AAA

G GCTG ATTAACAACACAAATATCTAAAG CAGTTCC ACAAAG GTAAAAGAG CAAG AG AAT

GGTCATATTGCGGGAAATTCTTTAGGTAGTAGATCTCCTGCTTTTAAGTTGCTTTCG CGT

TTGATGCTATGTTGAACTTCAAATATATTCAAGTTTGAACCCATAATTTCTAAAGTG TAGC

AAATTTAGTGGTAAGAACCTAAAAGCTGAACCCACCAAACTTAAATCCTGAATCCGC CTT

CGTATTTGATGTCTTAGACATTTGTATGCCTTCTAAGCGTAAATGTAGGTTGACAAG ATG

AAACAGGGCAAATTTAACTTTTTTTCTCATTTCTGCTCTAATATCTACAGTTAAATT TCAG

TAACCTACATGCAACACAGGCATACTGCACAAAAAGTCAGGCTAAAAAATAAGAAGA TG

CCGGCGTTTGTAAGAAAGTAAGAGGATTATTAGAAATGTTCTTGGTTATTTTCTCCG ACA

GCCTTGGGCAAGAATAAAGCTCTAAATTTCACCGTCAAAGAGTTCTAATCAGGAACC AT

TTTAATACATTG AGG G GTTTTCTCTTAAG GAAAAATAATTTTCATAAAAG CCAAATG GTTA

AAGGTTAACAGCTGAGCCGAATTGCCTTAATACTCATAACTCTTAATGCTACCATAT TAT

ATACCAACACCAACTCAAAAATTCACTTCCTAAAATTTTTCAACTTTTTCACAGGAA ACTA

TTCCACTATGTGTATTATGCAATGTTAGATACTATTAACAAAGTTGCTTCATTGTTT TTTTT

CTCATTTAAAAAGCTTAAATTTGCAATGCAACAGTTCCACTATCCAGGCAATACACC TTT

ATTGACAGTATAATTTGTTGGCTTTTCTTGTTCGTAGTGTTCAAATGCTGTCTTACA TACT

AACAAGTGTACCGTATATATGCCTCAAAAGCTAGTTGAGGACAAAAGAGAAACTTTC AA

CTTACTCTATCGGCAGCGGCTTGTAGAGTGACATGATCTCCCCACTCTCCCAACCTG GT

TAGAGCAAGAATATGTTGGTAATATATTGAGATTTCCCGCAATAAGAAGCACTGAAA ATT

GACCAGACAGCTTACCTCTTCATTTTCCTCAAGTAGCTTTTGTATCTCATGGGCACA TAA

CCTTCATATAACTTTCTAAAGCGCTTTAGCTGAAATGGATTGCCAACTAGGATAAGA AAC

GTGTAGGACATAAACTAGACATAGGCGTCAAACTCAAGCAAGTATAGGACTCTGTTG AT

GTTTAATAGATCCAGAAATTAGGTTAAGCAGTTGAGCTGATGAGTCTCAAGCAAACA CC

GGGATATTTTATTTTTAGTACTCTATTATTATGCACAAGTTACTTTAGTTCTACTAA AACT CACGAAAACGAATATAAACTGCAGGATTTGTTAATCTAGGATGAACACAATCCTCAAACA

TAATCATTATGTAATCTCAACTTTTCACCCATCCTGTTTCACAACCAAAGGCAAGAG ATG

AGAAGCAAATATTGAAAAAATGCAAACAGACCTGTTTAACGACCTCCTTCCTTACAT GCT

TATGATACTCTGGATTATGATACAACTGATCCGAAAGGGCCCGAAACTACAATAAGC CA

AGTAGACAGTCATATAGCAGGATTAATCACCTGTGAATTAATAAGTAGCAGGTGGGT CT

CAAGTTCAATGAGAGTTTGTCGACTAGTCAAAAGAGACCGACCTGGCAATTTCCATC TC

CTTCAATTTGCATTTCAGCAAGACCATATGTCGCTAACCTGAAACAAAGAAGTTACA AAT

AG G ATC C G AGTTG C C AAAAG G C G AAAAAC ATAG AAAAG C AC C AAG AAATTTC AC C CTTT

AAGTGCTAAGTGCAATTAAAGCATGAATTAAGCTATATAGAAAGACACTAATGAAGA AAA

AGATAACCTATGAGCAGAAACTATAATAATTTGGACAAAGAATTGAAAAACTAAAAT TAA

TCAATAAATATGAAGTAAAAATCATGTCAATTAAGTTCCATAATACTGGATTCACAA AATA

AGTCAACCAATGACCTCAAGCTGAACAACTATAGCATGCTCAGTAGTCAGTACAATG AT

GGATAACCTGCTAGAGAGCCTCCCATGGTCTAGTGTGGCATCATTTGGGTCAGGTAT CT

CCCCAATTACCCGTGGAGTGTGCTGCAAGGATAATAAGACTATAAGTAGCATGTAGA GA

TAGTGCAACATGGGTTTTCATGGGTTAAAGCTCAAAAAGGAATATATAGAGAAGCAA CC

TTTGGATACATACATGTGGTCACATTATACTCATTTAATTTGAAAGAGTTGCCCCCA AAA

ATGTAAATTTAGATAATTGTTCATAAAAAGGTCAAAAGTTCCCCGGATTTCTAAACT TCTT

GAACATTCTCCTTCTTCCTCTATGGATAGGGTGACGTGACGCGCAATGGATCAGTTA GA

AAGGAGAAGAAACTGATCAAGTGTCCCCTTCGCAACAACCTTTTTCCCTTTCTCGCA TC

CTTGGTCAAGCTTGAGATTTTCTTCTATCTGGATTTAGTTACCCGAACCTATGCAAA GCA

AGCCTACACTCTAGCTGGGGCCAAGCCATCTTTGCCTAAGTTCCATTACCTCATTAA CT

CG ATAAAG CTG GC ATTAAG AGAAG CTTG GTAG CATAAAGTGTCAAG GAGGTG GTTTTCG

CCTATCTATAAGGACAGTTGGAGTTGATGGTCCTGGTTTTGTTACAGTAAGGGATGT TG

CAGGAATCACTTGCATTGATGGAACAGAAGCTCTTCCTGCTCTTGTATTCGACGAAG AA

TAGGCCCAAGGGACATCAATGGACAATCGCCAAGCACATGGTTTAGCTAATTCAATA GC

CTTCGGAGAAAAGTGATAAGTACCATCGAACAAGCAAATCACTTCTGGATTATTAAC CTT

TAAGGATCCACTTATGGTGAAATTTTGGAATGATGCGGGGATAACCCAAGGGAGTCA G

CTAAACAGAGACGGAAAATATCAGTACCATTCTGGTCGGCGTATGCTTGTTGAAGAG AA

CAAGCTCACTTCTAGCTAGCTAAAGGGAAAGAGAGATACTATAGAATACCTAAGTCA AT

TTATCATTGAGCTGCAAAGATCATAGCAGACACTACGAATTAATAGCAAATCTCGGC AC

AAACCTCTACATGTCAAGATAGTGAGTCCATCAGATCACAAAAGACTTAATACCAGA TAC

CTTTCTAGATAAAGGGGTAAAATCACTTTCTTTCTTCCACAAGATCTCCATTTAAAG AGA

CTATCAACTGTACTTAAATGTAATCATATCAACTGTACTTAAATGTAATCATACCGG GATT

GAGTCCAAATGAGAAAGTCTCCTCCCAAGTTTATTGCCACCATATTTGAGAGCATTT TCT

TCTTCTTCAGCTAAAATTCTTGCAATGGTATGATCATCCTCAGTACCGTGTGAACTA CTA

TTCAAACTAGATGTAGTAGAACTCGAGCTTGCTCTTGAATTTCCATAGGATTCATTC AT

SEQ 22

CTAGAAAGGGTAAATGAGACCTCCGAACTTCCCAGAAAATGCTTCCTTCTGAGGCTGTT

G CACTATAGTTG CATTGCTCATTCG CTG CATTTAAAACAAATTAACTGTGAAAACTACAG

TAGCAAAAGGTTAAAGAAAACGAACATGAATAGCACGTCAAGAGAAATTGGCTTTGC TT

TAACGGTTATTTCATCTCTGTCAACAATGAAATGGCAATGAGTGAACCTTTTCAGAA TAA

GTTGGCTTATCTCATTAATGAGAGACAGATAACAAGAGATGTCTCCTCTAATCTCTA AAT

TGATATTTCATGTTGTATGGATCCTAATAGGATGAGAAATGCATCAAATACAGAAAG GAA

TGGCAGAAGTGGAGATATACCTTGAGGCAGAGGTTCCGACTTGTGTCACAAATAGGA T

AATCATGTGGGCAACAGTGGCGTCCATCTTTGCAACACACTGCAGAATCCAACCCAC AA

CATTTCCAAGAAACGCAAACTCCAAGAAGCCTCCACCCACAGCAGCAGGTTTCACCT TG

ACCACATGAGGTAAACATACTGCATTTGCTTGGACCAGGAGATGGAGGAGATGGTGG A

TTTGGGCTACTCTTAGTTGGATATGAAGCTAGCTTATTGATCCCACATATCCCTTCT TGA TTCCCACTATTACGCTGCATGTGCATATAACCATTTATTCCCCAGCTTGTTCCCCATGAA

TTTTTTATAATCCAGTAATCAACTCCATTTTCAGAACCATAGCCCACAATCAGTACC GCA

TGATCAAGTACTGTAGAACACGGTCCAGTGAATATCCCCTGCAACACAAAGATAGCA CC

TTTATATTTCTCTC CGACAAACAATTTAACTG ATTAG GAG ATTG GTAATTTG GAG ATG GA

AGATACCTTTGAATATGATTGAAATGCTCTCTCACTGCCGCATATCCCAACACTCAC GG

GTTGATTTGCCACCGCCTTTAGAAGCTTGTCCTCATCATATTGGGGAACATCAGTAT ATC

CATCAATGGTTACAACACGTCTTTGTAGCTGCAAAGTCGACAAGTTAAGCCAAGCAA TC

ATATGTTAACGTTGTTCATTATGTTTTATCTGGAATAAACTTGTCCTAGGTTCTCTA TATT

AATTATG AAATC C AG AAG C AG GAG G G CAT AT AT AG G AC AAAAAAAG ATTTAGTAC AAG A

ATAGGAGCAAGAAGATGAGAGAACAATTATGTACCTTGTTTTTGTTGCATGTTCCTT CTC

TTTCATTAAAGGGGTAATCCTCTTCAGTGTCAATACCACCATTCTTTTTGACAAATT CAAA

AGCATAGTCCATCAATCCACCTCCACAGCCGTCATTGTAACTTTTGTCGCAATCAAT TAA

CTCCTGCTCAGAGAGACTTACAAGAGATCCAGTGACAATCTTATTGATACCTTCGAT TG

CTCCAGTGGCTGAGAATGACCAGCAAGCACCTGGGAAAATGAAACAGAAGTAACTGG T

TTTAGTTACAGAAGCTAGTTGCTGAGATTAAGTATATGGAATGACAATAGAATGACA GTG

TTGTGGACAAGGGCAAATTTGATTCATATTATCTCAGAACAAATTCACAAAAAGGCT AGA

TCTTCACTTCCGTCCTATATTCAGGCTGACATTACCAGACATATCTACAGAAAATAA TTA

CTTGAGAAACATAAAGGCAGTGATAAATTTTAAGAATAAACTATACTTAGTGAGAAT TGT

GTGCAGTCATAAAAGTAAC AAGTCTAAGTCCCTGAAG CAAATTCTG CATTG G GAG GAAA

GTATATTTCCGCGTATATGACAGCCAAATTAGTTGCTATAAAACATCACACTAGTAT GTG

ACTCAATATTGACAGTAAAATTATAAAACATGTTCCTTCGATTGCACAAGCAAGTAG AGA

ATCATAGGGTACAATTGTGTACACAGTTCCAAAAACAAGAAAGAAGAGCTAAAACAA TG

AATTGTGAGTCAATGATTCAATGGTCGAAAACAGGACCAAGAAATGGATCAGTGGAT AT

TTATATTTATTTCTATTTTTAAAACTTAAAGGGCATGATGTGAAGACTGAAGCTGGT AATC

CAGTTTTGATTGTGATGCACATGAATGGATGTGGAAAGTAATATTCTCTGGAAGACA GA

AGACCACTAACCACCTCAGTTGCTCAGACCAAGATAGTGAGTAGATCTCCCCTAATC TA

TTCAACAAAGTTCATTGGAAAGAAACAAACAATGAAATGGCGGATCTCCGAGCAGTC TG

GTGAAGTTTATGGTCCAGTGGTTAAAAAGAATGACAACTCAACCATATTTTACTCCT CCG

ATGTGCTCCGTTCATGATCTTTTTAATATTTCTCGCTAATCCGCTAATCAATAAAGA ATGA

GATACTGTATCAGTATGTCCTATTATTGTTGTCTTCCAGTCACTCTGAAGAAATGAT TTTC

ACATACATAGAGACAAAAATTGAAAGTAAGAAACAACAACAACAAACCAGTGGAATC AC

ATAAGTGGGGTCCGGGAGTATAATGTGTACGCAGACCTTAGAGGTTGTTTCTGATAG AC

CCTCGGCTCAAGAACAGTGAGAAAATTGAAAGTAAGAAACAAACAGTATATTCATTC CTA

ATCAACTCATGAAAGGACGAGCTCATGAGACTAAGTTTCAACAACAACAACCATATG ATT

GTTTATTCCACTTCATCTTGATTCCAATACCTAATAATTTGTCTTTTGGGGCAACTC AAG

GGTTCCTAAAGCTAAGAATTCTCTAAATCTCACACTTCTCCTTATACAAACATTCAA ATCC

TAACCAAACTGAAAGTGCTCCTGTCTAATACTGATGAACTAAAGTAAGTGCTGAGGC TA

GGTTTCAATGAAGTAAATTAGTCCTGAACTTCAACCTGTCAAATAATACAAGGAAAA GCA

AAAAAGGGTAGCTCCCAGACAAGAGAAAAAGGCAAAACTAACATCACAAGTTTCCAT TG

TCATTTGAGAAAAAAATCATCAAAATCCAAACTTTGTAAAAATTTCTAATGTTGGCT CTAC

TATGCACAAGTTATATATCCTCCACATAAATGAAATCACTATAAAGATACAACTAAA AGAT

AACGCAATAAACTGAGCATACCACAACTGCCTTGATTCTTGACTTTAGTAACAGCTC CTT

TCTCTCTCCAATCCAAAGAAGAAGGAATATCAACAACACCAACATCATTAAAAACTC CAG

CAGAAGACGACCCAGTTTTCAATCTAATAAAATCATTAGCAGAAGAGGACAAACCCA AA

AAAGAGTTCTTGAATTCATGGTGAGTGAGATCAGAAAAGGCATTGAGATTAAGGGTA TA

AGTGGAATTCCCCTTACTATTATGCTCTATAATATAAGCATAATTTTCTTCAAACAC CTCG

AGTCTGTACACCCTTTCTTGTTCAGAAGAATATGTCTTTCCATTTTGCTGACACCAA CTT

TCAAAAAGATCAGAAATTGATGAACAAGTGCAAATTGGTCCTTGAAAAATTAGAAGT ACA

AGAAC CAAAGATG GACATAACCAACTCAT SEQ 23

TTAATG CTTATTCCAGAAACTCCACTTCTTCTTCTTCTTCTTAAAGTCAAATG G CAG G AA

GTCTGGAATGGAGCATCAAACTACAGTATTAGAATAATATGATAAGGGTAGTGTGTA CG

CCGGTGGCGGACCCAGGATTTTGTGCAAGCGGGTTCAATCTTAGAAGTATATAACTT TA

GTTGTAAAATAGTAGTTGTCAAGTGGGTTCAAATAAAATATTTAAACAAAATTTACG CAG

CTTTAATCCTAATTTATACATATATACAGTATTAGTTTTTGATGCTTGCCACCACGT GCGT

CCACCACTGTGTACACATACCCCTTACCCCCTACCTTGTGAGGATAGAAATCTAAAT GA

AGCAAGAGCAGCAAACTTCCGGTCAACTCTCAATGTTCATCGCTTTGTCCATAAGCA TG

TGATAAACAAAAAGTGTTATTCCGTAATGCCCATGGTAACCCCCCCCCCCCGGGGGG G

G N G GTTTAAAC ATGTAATTAATC AG ATATAG G C C AATTAATAATAGTTG AG C G AC C ATG C

TAAAACCACGGAACTCCGGAGTACCTAACCCCCCCCCCCCGGGGGGGGGATGTCCAT

G CTAAG ACAAACTAAACAGAAACG G GACATAAAAGTACAAG CAACTACCTCCTTGAG GA

TAGATTGAGTTGTAGTGCACCTCTGCCCAGAAACTCAAGTATATGACTGCAAGAAGA GA

AAAATAAGAGAAACAATTAAATTGGTGTGAGCGAATGAATATCAAGACTTCAAAACA CCG

CTCTTACAAAGTGCACATGCACAAAGAAATTGCATTCATACTTTAATTTCTCTTCCA AGC

AATCTCAAGATTTGCTTGCCCACACTTGGCTTCATAGTATAGGTATGATACAATGGC ATA

G AATAAATG AC ATG C ATAC ATAAC C AATATAAAG CTTG C C C C C ATTC ATTAAACTTAC AA

CCGTCTCATAATCATTCAATATGTTAAAACAGACAAATTCCGGTCTCTAAAAGGAGA ATG

TGAATGTCAAAGCATCCATGTTATGAGATGGAATTTAGATTTCAAAAGAGCTAAAAC GGA

CGACTCTTCAAAAATCAAAATCTCCTTCTCATGAAACGCAAAATCGAATTTGCTTAA GAT

TGTCCTTAAGGGTTCATAGTCATCCATTCATCCCTCCTTCCCCTTGCGCAATTTTTT GGT

CAAG G CAG G CCG AG GTACTAACTTTACAGTCCAAAGATCAAAAGTACTATTTGC ATTCT

TCACGACTCATGTAATAAACTTATGTTGTCTCTTTAACTCCAGTGGTCCTACTTTAT CAG

AGTCGTTATCTGATTTTGGACTTCTGAAAAAGTTTGATACAAAGATCGTACTAACTT TTC

CATAGTTGGCACAAATTTCAAGAATCAAATTCATCCAGTAAATCAGGTTGCTCTGGT ACC

AGCTACTTCTATAATTTAATTTACTACTATTACTACAATATGCATAATCAAATTATC TGCTT

CATCTCCATGTGTAGCCTGTGTCGTCTGAAACGCCAATGGGGGAGTACTAATTTGGT G

GTTACGAATTATCATACTATCTCCCCTTTTTGAATTGTTGAATTTGGTCCTGAAAAA TGTT

G CGGTTTTG G CTAAAAGTCTAAAACTG CATTG CAGAAAGTATC AATAG AACG AC ATAAG

AACTCGATAATGGTTTCTCAGTTAGTGGAATTACAGCTGAGGAAAGCATCTTTAACC GC

AAACTGGAAATACGTACAGCATTAAGCGACTCGTGACTTTTTATTTGAGACACATGG AAA

TTGAGAAATAGGATCTATGTCACTCCCACTTCCAAATATTTTTGTAATAAAAACTTG TTCA

ATCGCATTTTGTGAAGTAGAGGATATTACAGAAAGGTAAAAGCAATTCAAAGTTTGA GAA

CTAGCCTACCTCTGTTTGACTTCTGATTCTTCGGAAGAATCTCGATGTAACATGTAT CCT

TGAATGACGTTATAACAAGAATTTTCACACCATACTGCCCACAAGACAAAAAGACAA AAT

CAATGTGCAACACAAGTAGGTTCATTCACAAACCAATGCATCCTAGTCTAGCATCAT CA

CAAATAAAATCTTCATAAAAG GAG CTG CG CAT ATAC AATAAATAAAAAATG CATC ATAAC

CACTCAAAATG GAG AGTG AAAG AAAGAAATAGCAAAATAG AG G CAC ATG AATTAACAAA

AGCTAGTAAAGCACCCAATGGAGGCACTATACCAGGACATCCAAATTATGGTCTGGC CA

ACAATAGCTTAAGCTTCTTATATTCCAAGGTTAAAAAGTAAACCAAAGTAATCAAAT GGA

G AG AAAAAAC CAAG G AAG C AAATAAG G G G AATAATC AATAC C G AGTC AG CAG CAG C CT

GCAACGTAACATGATCGCCCCATTCCCCACTCCTGGTTCAACCAATATATAAGCAGA TG

TTGATTTAAAGGAGAAAAGATAATGCAGAATTTCAAAATGGAGAAACAAAGGTCTGA GA

ATTACTTGGACATCCTCGTCAAGTACTCTCCATACTCCATTGGGACATATCCCTCAT ACA

TCTCCGGATGATGTTGAAACTGACAAGACAAAATGATTTTAAAAAACATACTCAGCT AAA

ATGTATGTGAAATAATTCAAAAATGAAATCGAAATATCATAAAGAAGATTATCTTAT TCAC

TTAGACAAG CAC CTGG CTG ACTACTTG CTGTCTGACAAATTTGTG GTG CTCTGGTGTAC

G ATAG AATTG ATCTG ATAAAG CAC GG AACTG GAACAAAAGAG GACATGTG AATAGATGT

G C ATTAG AAAG AAATAG G ATG G AAC CTTATTTC AAC AAACTTAG AATC G GAG G C G AAG C

CTAATCCACTTGGGAAAAATAGGCAAAGTTCCCATACCCCATAAAACTAAAGAGTTG AG AAAAAAATTGAATTATCCTTTTGTCAAACTACTAAAAATCCACAAATATTTTTAATAAAG TT

GGAATGGCAAGCAGCCTGACAATACGTCACTTGAGTTTCATATTCCAATTTTTTTAA AAA

TCCTATTATGAACCACTTCATTCACCATTTCACTGTCACAAACACCAACAACAAGTT TCT

ATCAGAGCCAAAGAGTTAATTCCAAAGTAAGGAATACTGATAAACCGTCATCAAACA TTA

TGTTTTTTGTTTTTCATTCCCTTTCTTCTTTGAACCAGAGAGTAAGACCTCCATACC ACCT

AGCACCTTGATATACTGAGATTTTTCATGAGACAAACTTAAAGAATTGGGGATCCTT CTT

TTGTTTATGGTTAAAAATTATGCTACAAGTAGTTTAAGGAAAGGGAAAACAATTTTT TTCT

TTCACTAGAAATCAAATCATGAGCGTCTCTAGACAGTTTGATATCATTACTGCAAGA CAA

ATCAACCAAGTAATACAGTAACCTGGCAGTTGCCATCTCCTTGCACTTTGTGCTCCA CC

AAGTCAAATAATTGCAATCTGAAGCAAGAAAAACCATAGCATGGGATCCTTAGCAAA GA

ATAACTTGCAGCAATAATATTACCATTCAATTGCATTGGCAAAAATATCAACTTCAC AGG

AAACTTAGGTGGCACAAAACCTAATAAAAAACACAACAATATCAACTTCATAGGAAG CTT

AGGTGGCACAAAATCTAATAAAGAAACACAACAAAAGAAAATAAAATGAAAAACCAC ATG

CATTGTCCTGTGACTTTAAGAAAACAAAGGGTTAATAGTATCTATTGGAATGTGCTT CAT

AAGTGTTTATTTACAAAGCACAAAATACTAGTTGGCTCATTCCACACCAATTATTTT CTGT

GCTTGTCTCGGCTCCCATCTCTCCCATTTGATTCTCGTTTTCTCAAAATGCTTGAGG GGT

CAGATGTTACTTTTCGAATAGCAGGCAGTAAGAACCACCAGTAAAAGAATGAGTTAA TA

AAGAAAGAGAAAACCTAACAAAATAAAAATAAAGAAATGTTTCTGGAGACACAGGAC CC

CTATCACAATAGAGGTATGCATTTCTCACAGTGAAGCTATATTTCATTACCTTGAAA AGT

GCTCAGAATTTGGAGAGAATTAAGCAAAGCACTATCATAAGATCATATGTCATTAAC TTG

TTCATACACAATTTTTATCTACTTGAAAACCTAAGTCGGAGGGATCAGGAGATACCT ATT

TAGCAGCCTTTGATGATCAGAAGTTGCTTCATCGACTGAAGGTATGTCCCCATTTAT TCT

AGG AACATG CTG CATG GTTAATAACACAAATTGTTATAAGAAACTAAAGCAG CCTCAAG

AAAATG G CCATAG GTGCAAAG CACCACACATGTC CTCGTACACAAAGTG AATTAGTGTT

CTCAATATACTAACAG ACACTACACTTAC AG GAACAG CACTCAG CTG GTTTATTCTCTTC

CCTACTTCTCCATCAAGCTCAAATTCATCTTGTATTTCCAATGTGTAGGTATACTCT TCTC

CATCGTATGATCTGTCTCCAGGACTCGAACAAGAACTTGAAGGCCCTACATCATCAG CT

TCTAGGCTGGTGTCATGCCCTGTGAATCATATATAATTAGCAAATGTTTAAACTCAA GGA

ACATCACAGAATTGAAAACAAGAAATGTACCAGCATAATACTCTCTTGGAGGAGTAT GC

CAATGTTGTACACCAGTGGAGGCTTGCAAATACTGCTCGTCTGCATGTGAAGATTCA GC

ATCTTCTGCGATGGACAACTCTGACAAATCTTCTTGTAGAACATGAGCAATAGCCTC ATC

ATTGTCAACATTGCAATATGATGTGTGATAGTGGTTTTCTCTGGCATATTGTTCATG ACA

TATCTCAACGTCATGCTTTCTACCATCACCGTAATAGTTGGAACTAAAAAGTTGGTC CAC

ATCGAGAAAACTAAGAACGCCACGAGCAGCTTCAGATTCCGGCTCACACAT

SEQ 24

ATGCCTTCACTTCTTCAAATTTTCCTTCCTTTGTTTCCATTCTTTTTCTTGGTTTCTTTC TC

AGTTTCTCACGGACCCTTTTTGCCAAAGGCCATTATTCTTCCTGTAAACAAAGATCT GTC

AACTTTTCAGTATGTTACTCAAGTTTACATGGGTGCTCATCTTGTTCCTACCAATTT AGTT

GTAGATCTTGGAGGTTCATTTCTCTGGACTAATTGTGGCTTAACTTCTGTATCTTCA AGT

CAGAAACTTGTCCCCTGTAATTCACTCAAATGCTCAATGGCTAAACCTAATGGTTGC ACT

AACAAGATTTGTGGTGTACAATCAGAAAATCCTTTTACAAAAGTGGCTGCAACAGGG GA

ATTAGCAGAGGACATGTTTGCTGTGGAATTCATAGATGAGTTAAAAACAGGTTCAAT TG

CTTCAATACATGAATTCTTGTTTTCTTGTG CATCAACTACTTTGTTG CAAG GTCTTG CTAG

AGGTGCCAAAGGAATGTTAGGACTTGGAAATTCAAGAATTGCATTGCCATCTCAGTT GT

CTGATACATTTGGTTTCCAGAGGAAATTTGCTCTCTGTTTGTCTTCTTCAAATGGTG CTA

TAATATCTGGTGAAAGTCCTTACTTGTCACTTTTGGGTCATGATGTTTCAAGATCTA TGC

TTTATACACCTTTGATTTCATCTAAAGATGGTGTTTCAGAAGAGTATTATATCAACG TTAA

ATCCATCAAAATTAATGGCAAGAAACTGTCGTTAAACACATCTTTGTTTGCAATGGA TGA AGGTGTTGGAGGGACAAAGATTAGTACAATTCCCCCTTTTACCACCATGAAAAGCTCAA

TTTATAAGTCATTTATTGAAGCTTATGAGAAATTTGCTATTTCCATGGAATTGAATA AAGT

G GAAGCTATAG CACCATTTGAG CTTTGCTTTAG CAC AAAG G GG ATAG ATGTCACAAAAG

TGGGGCCAAATGTGCCAACTACGGATCTTGTGTTGCAAAGTGAAATGGTTAAGTGGA G

GATTTATGGGAGAAATTCAATGGTGAAAGTAAGTGATGAAGTGATGTGTTTGGGATT CT

TGAATGGAGGGGTGAATCAAAAGGCTTCAATTGTTATAGGGGGTTACCAGTTGGAGG A

TAATCTTTTG G AGTTTAACTTG G GAACTTCTATG CTTG G ATTTACTTCTTCACTTTCAATG

GCAGAAACAAGCTGTTCTGACTTTATGTTCCATTCTGTATCAAAAGATTCAGCTTTT GAT

TCT

SEQ 25

TTAAGAAGAATGAGAAGTAAACTTATTTGTTGAATTTAAGAGGTAAGAATATGCAAAAGT

AGCATGAATTGCAGCACCAATTGGAAGGACATCCTCATCAATGATGAAATGTGGATT GT

GTGG AGG GTAAATAG CAC CAATCTTTTCATTTTTTGTTC CCAAAAG GAAGAAG G AACCA

G GAACTTTCTCTAAAAACACTG CAAAATCTTCACTTC CCATG AAG CTAGGTG CTATTTTG

AAACTCTCTTCCCCAACAATCATTTTTGAAACTTTTCGGGCATGTTCGTATATTCTC TCAT

CGTTTATTGTTGGAGGAAGTGTTGGATTTTCTCGACCATCAAAGTCAATCTCGACCG TA

CATCGATGTACTGCTGCTTGTGCTCGTATCACCTGAAAATTTTACCAATAAAAAGTT TAA

TTACCAAATATTGAATATAATAATATGTTCTAAAAATAACATGGATGTCTATTCCTA ATTAT

TAGCAAGTTATTTCATTCTCCCTAGTTGATTAGTGAATTACTGAAAGGTTATGGCGT TCT

GATATTGTTAAATGTACCACTTATTTTATTGAAAAGTTATATTGCATCTCTAAAGAA CTGA

AAAGTCATTTGACCTCTTGGCTCGTGACCCTTCTCAAAAAACAGTTCTTTGGCTATA AAT

AAGATTATTTTGTGTTGAATGAATATATCAAGCAACTTGAAAATATTAAAACCTCTT TCCC

GAAACAAATCCTACAATTTCCTCAAGTACTCGTCTTTTCAAACAAAGTATTAATGAA AAA

GAAACGTAACTTGTTTGAATAAATAAAATTTGCACATATAAACTTTGTAAAGAGACG TGT

GGGACTTTGAATATTGGTCAAAGTATCCAAGATTTTTGTTCTAAATTACAAGTAGTT TAC

CTCTTCAATTCTTTTCCTCAAACCGTAGAAACTCTTCTTACTGAATGCTCTATAGGT CCC

GGAAATTGTAGCTAATTCTGGTATGATATTAAATGCATGCCCCCCTTCAATCATGGC AAC

AGAAACTACCTGAAATTTCAAAAACTAAATATATAAGAATTGATAAAATAAAAATTT AAAA

TTTGTTTGAATAAGTTTGGAAAAGAAAATTGTTAATCTCTATGTTTCAAAAAGATTG TTCT

AGTTTGACTTGACACAAATTTTAATAAGGAAGAAAAGACTTTGAGATATGTGGTCCT AAA

TAAACCATATCATTTGTGTGACTGTAAAACTTTTGAAACTTGTGATCTTAAACTTAC TATA

ACATTTGTGTAACTATAAATGCTTCTAATAAAAAAAATATTAAAATTTGTCAATTTT TTTGA

AACAG ACCAATAAATAAATAGTGTCAATG CTTTTG AAACG G AG GTAGTAC CTGG G ATTC

AAGAGGATCAGTCTCTCTAGAGACAATACTTTGCAAACTAATAACAGAAGTAGAAGC AG

CCAAAATTGGATCAACAGAATCGTGTGGAACAGCAGCATGACCTCCTTTTCCTCTAA TT

GTAGCTTTAAAGCTTCCACATCCAGCCAAGAATTCACCAGGCCTAGATGCAACTACT CC

ACTTTCATACTTATGAACTAAGTGCATTCCAAAAATGGCTTCCACATTTTCAAGAAC TCC

TTCTTCTATCATATCTTTAGCCCCATGCCCTCGTTCTTCAGCTGGTTGAAAAATTAA CAC

CACTGTTCCCTGCAATATTATTATACACGGTAATTAAATTCATTACTTCAACTAATC CATT

AGCTTAGAAGTATGTATTTAGAGCTTAATTAAGGGTTTTATTTAACCTGTAAATTGT GTCG

GAGTTGTTGTAATATCTTGGCAGCACCAAGAAGCATGGCAGTATGGGCATCATGAGC A

CAAGCATGCATTTTTCCATCAACTTTGCTCTTGTGCTCCCATTTCGCCAATTCCTAC ATT

AGAAGAATTCAACTTTGACTCACGACTCTTTTATTGATCAAATTATTCACTTTATAG ATTT

TTGAAGAATTGATTAATCGAGAATAAATATAGAGTCCTACTGTAGAGGCATATTATA TGA

TATTGACCTCTACAACTTATAAAACCCGACTTATGATCTTTATTTTCTTTTTCTGTT GTTTA

GATCACAATTGATATTTGATGTTCAAATTAAATGTTTTAGCGGTGTAATATTATTAC TTAT

GGTACTTTCGGCCATCCTATCCAATTTTACTACTAGGAAAATAAAAAACGTGTTGAC CCT

TTATTCCACAACATATGAAACTAAAAGTAAAAAGAGATGGTCACCATAGAAGAAAAC TAG CTAAAGTATATACCTACGAATTGAAGTGTTTTCTCTTTCCCAATGAAGTTCCAAAATTCA A

GAATCTCTTTGTTTTAGGTATAATTAAGCTGTTTCGAACTCTATACTTAATTCAATA TTAA

GAGAGATCTGATTTATTACTTTCCTTTCATGGCTTAAATATTACCGCCGCCGCCGCC ATT

TCTGACAAAAACGGAAAGTAAACTGCCGCAAGTAATTTCTTCTTCTGCCATAGTTAA TTT

AGTCGCCCACAAAATTAATAAAATGACTCAAATTTACTGCCTACACCCTAGTTCCGA CCG

AATACAACATATAAATGATCCCCGTGCTGTTGTCATCTCGAACATCCTTAATAACAA TCT

CCAAAACCATTAATGAAATACAGACAAAGGTAAGAAGTAAATTTGAAGATATATAGT ACT

ATATTAGGCCTATAGATCTACCTCCTAAACTCCACAAACTGTTTAAAGTGAATAAAA CAT

TTAAAGAGTTCATATCAATTTTTTTTGATATGAAGAGTTATCCGTGGTTATAAATGA ACTA

AACGTGATACTAGTATAAATATTCTTACCGTTTTTTGTTTTGAATATAATTGCAGGG TTGA

GAAAATTTCCAAGCAGAGACTACTAACCTGAATAGGCAAAGCATCCATGTCTGCTCT GA

GAGCCACAAATGGCGGCTTACCGGAGCCGATGGTGGCAACAACTCCGGTCTTAGCCA

CCGGCCACCGGTACTTTACTCCCATCCGATCAAGCTCCTCTCTGATCAAACCACTCG TC

TTAAATTCTTCATAAGCAAGTTCTGGGTTCTCGTGAATTTGTCTCCTTATTTTCATC ATCC

ACTTCACTGTCTCCGTAGCATTTGCTAATTTTGTAATATAATCTTTCACGTAACAGT TTTG

ATCTACCAAAAACGGATTCAAGCACTCATCATCGCCGTGACACGAAGGAAAAACAAT GA

ACATACATACAAGCACCAAAATTAGAACTTCCTTAGCACCCAT

SEQ 26

ATGAAACTGAATCCTTACTCATGGACAAAGGTAAGTACTTGATTGTGAATTATAACTGTA

TTATGTACATAAGGTCGCTGCACAACACAAAATGTTGAAAATAAGATGGAATTATTA GGT

GGCAAGCATTATTTTCTTAGACTTACCAGTAGGCACTGGATTTTCCTATGCAAGAAC TCC

AACAGCTTTACAGTCATCTGATTTACAAGCAAGTGATCAAGCATATGAGTTCCTTTA CAA

GGTAATTAGATTCTTCACGAAATTATTAGTTAAATGTATTTTCTCCTTTGCCCCTCA ATGT

TGTTCAATATGTAGTAGAACAGTCAATAATTTTATGTTGTTTGCAGTGGTTCCTTGA TCA

CCCAGAATTCTTAAAGAATCCATTGTATGTTGGCGGCGACTCATATTCAGGGATGGT TG

TTCCCATCATTACTCAAATTATAGCAACTAGTAAGACTATATTTTCCCTCAAATAGT TGTG

AAACAAGTAATGGCAGCCTAAGGTAGTAAGGTGTTCTGTTCTTGTACTATAACATTT TGT

GGCCTTGTGATAATGCAGAAAATGAGATGGGAATAAAACCTTTTGTGGATCTTCAGG TT

TGTCATTTTTCTTGTATATATTCTCTTTTCCCTACGGATAAGCAGACGGATTACATA CCAA

CTCAGAATTTGTAACGAAATTGTTATGAGAATGTCACGACCCAAGCCCATAGCATGT ATT

GTCTGCTTTGGGCCTAGGCTCGCACGGATTTGTCTTTCGGGCTACGCCACCTCGAGC C

CCAAAAGCGCGTGCACCATGTGAACTTGTGTCATACCTTATAAAGTTCATCACTTTC CTC

TATTATTCCGATATGGGGATTCGTCTAAGGTGACATGTGCACCGCTTATTCAGAAGT TT

GGCAGCCTAGAAGCTAGTCAGTCCTACTTAACTTGCCCTCATCAGCCCCCTCCTTCA TG

GGCATCACACAGAATCAAAAGTCACTGTAGAATGTGAGTTGATTTGCAAAATGTATG AC

CTGATATCTCTCGTCAAGTGGTTTCAGGGATATTTACTCGGAAATCCATCGACTTTT AAA

G GTG AAAAG AATTATG AG ATTCCATTTG CTTATG GAATG G G ACTTATTTCTG ATG AACTC

TATGAGGTTGGTTTTCCTTTGGTGTTATATAGTACAGTCAAACCTTTCTATAATAGC TACA

TTTGTTCCGATATTTTTTGGATGCTATAATGAAGTGTTGTTATAGAGGATATATATT AGTA

TAACATAACATACAAAATCGGCTCCGAGAAAAACTTGGCTTTATAGTAAATGACTAT TAT

ATATGGATGCTGTTATACAGAGGTTTGACCGTAAGATCTTAAATATCCTCCAGTTAT GCG

CTTTAATTTAGTTTGCTTACATTGTCCTTAGAACTAATTGATTTCCCTTTCTCAAAT AGTC

CTTGACGAGAAATTGTAAAGGAGAGTATCAAAACACTGATCCAAGCAATACACAATG TTT

GCAAGATGTTCATACTTTTCAAGAGGTTGGATCCTATTTTGAGGAAAATCAAATATC ATC

TGTTTGTTTTATGATAGGTTCATTAACATACTGACCTTATGCAGCTTCTGAAAAGAA TTAA

TAATCCCCATATTCTGGAGCCCAAATGTCAGTTTGCTTCACCAAAGCCACACCTATT GTT

TGGCCAAAGAAGATCTCTTAATGTGAAGTTTCATCAACTTAACAATCCTCAACAACT CCC

TGCGCTAAAGTGTCGCGTGGGTACTCATCAACAAACTCTAGCATTCTTTATGCTATT GAT TTTTTGTTTCACTGAGATACTTACGAGAATTTACAACTTGCAATTGATTTAGAATGATTG G

TACAAACTTTCTTCTCATTGGGCTGATGATGGCCAAGTTAGAGAGGCCCTCCATATC CG

AAAGGTACGTTAGTTCTTGTTGGAAGGGGAACCTTGGAGCAACGGTAAAAATATCTC TG

TGTGATCTATAGAGCACGGATTTGAGCCATGAAAGCAGTAATGCTTGCATTATGATA GG

CTGTCTATATCACACCCTTGAGATGCGGCCACCTTGCATGAATGCGTGATACTTTGT GC

ATCATGCTGCCTTTTTTTTTGAAGAACAACAAAATTTAACAAAGTGTGCTACACAAA ACTA

AAAATATGATCAATTTGATTACAGGGAACTATTGGAAAATGGGTGAGATGTGCAAGT TTG

CAATACCAAAAGACAATCATGAGTAGCATACCATATCATGCAAACCTCAGTGCTAAA GG

TTACAGATCTCTTATATACAGGTTGAGTAAGATTGTTGTGTTTGCAAGATTGGAATA ACT

ACATAAATAGTTGAAGATTATTATCTCTGTGAAACTATTTACTTAGTTTTCTATGTT TTTTG

AATTAAGCAGTGGAGATCATGACAAGGTTGTTACCTTCCTATCAACTCAAGCATGGA TA

AAATCTCTTAACTACTCCATTGTTGATGATTGGCGACCGTGGATCGTTGACAATCAA GTT

GCCGGGTTAGTTTATGATGAAAACATTGTACGCTAGTCATAAGCTCTGTCAAGGTAT AG

AAGTTAAACTCATTTTTTGTCTTTTGCATGATTGTAGTTACACGAGAAGTTACTCAA ATCG

GATGACATTTGCCACAGTAAAGGCAAGATATCTCTTTCACTTGCTTTTCTCAGTTAA GTT

TGAAGATAAAAAATTTTGTTAAATAGTTGGTGTTTAAATTGCACTATTTTGTTACAG GGAG

CAGGGCATACTGCACCAGAGTATAAGCCTCGTGAATGTCTGGCCATGCTCAAAAGGT T

GATGTCTTACAAGCCTTTG

SEQ 27

ATGTGTGAACCGGAGTCTGAAGCAACTCGTGGGGTTCTTAGTTTTCTCGATGTGGACCA

ACTTTTCAGTTCCAACTATTACGGCGATGGTAGAAAGCATGACGTTGAGATATGTCA TG

AACAATATGCCAGAGAAAACCAGTATCACACATCATATTGCAATGTTGACAGTGATG AG

G CTATTG CTCATCTTTTACAAG AAGAATTGTCAG AGTTGTCCATCG CAG AAGATG CTGA

ATCTTCACATGCAGATGAGCAGTATTTTCAAGCCTCCACTGGTGTACAACATTGGCA TA

CTCCTCCAAGGGAGTACTATGCCGGTACATTTCTTGTTTTCAGTTTTGTGATTTTTC CTC

GAGTTTAAACATTTGCTAATTTATATATGATTCACAGGGCATGACACTGGTCTAGAA GCT

GATGATGTGGGGCCTTCAAGTTCTTGTTCTAGTCCTGGCGACAGATCATACGATGGA GA

AGAGTATACCTAC ACATTG GAAATACAAG ATG AATTTG AG CTTG ATG G AG AAGTAG G GA

AGAGAATAAACCAGCTGAGTGCTGTTCCTGTAAGTGTAGTGTCTGTTAGTATATCAA GA

ACACTAATTCACTTTGTGTACGAGGACATGTGCGGCGCTCTGCAACTTTGGCCATTT TC

TTGTCACTGCTTTAGTTTCTTATAACAATTTGTGTTATTAACCGTGCAGCATGTTCC TAGA

ATAAATGGAGACATACCTTCAGTCGATGAAGCAACTTCTGATCATCAAAGGCTGCTA GA

TAGGTATCTCCTGATCCCTCCGACTTAGGTTTTCAAGTTGACAGAAATTTTGTGTAT GAA

CAAGTTAATGACATATGATCTTATGGTAGTGCTTTGCTTAATTCTCTCTCAGATTAG CAC

TTTCCAAGGTAATGAAATATAAGTTCACTGCGAGAAATGCATACCTCTATTGTGATT AGG

TGTCCTGTGTCTCCAGAATCATTTCTGTATTTTTTTTAGGTTTTCTCTTTCTTTATT AATTC

ATTCTTTTCCCGGTGGTTCTTACTGCCTGCTATTTGAAAAGTAACATCAAACCCCTC ATG

C ATTTTG AG AAAAG AG AATC AAATG G GAG AG ATG G G AC C C G G G AC AAG C AC AG AAAAT

AATTGTTGTGGAATGAGCCAACTAGTATGTTGTGCTTTCTAAATAAACACTTACGAA GCA

CATTCCAGTAGATACTGTTAACCCTTTGTTTGCTTAAAGTCACAGGACAATGCATGC GGT

TTTTCATTTTGTTCTGTTTTTTTATTAGGTTTTGTGCCACCTAAGTTTCCTAGGAAG TTGA

TATTGTTGTGTTTTTTCATTAGGTTTTGTGCCACCTAGTTTCCTATGAAGTTGATAT TTTT

GCTAATTCATTTGAATGGTAATACTATTGCTATAATAACTTATTTTCTGCTAAGCAT CCCA

TG CTGTGATTTTTCTTG CTTCAG ATTGC AATTATTTG ACTTGGTG GAG CAC AAAGTG CAA

GGAGATGGCAACTGTCAGGTTATCATATTACCTGGTTGATTTATCTTGCAGTAATGA TAT

CAAACTGTCTAGATGCGCTCATGATTTGATTTCTAGTGGAAGAAAAAAACTGTATTC CCT

TTCCTTAAACTACTTACAGCATAAGTATTAATCTTAAACATAATGTTTATCAGTATT CCTT

CCTTTTGGAATTGTTCTGGTAGAAACTTGTTCTTGGTGTTTGTGACAATGTCTTAGC TTT CTTTATTACTTTTTAGTTATGCTTGAAAACAGTGGAAACAGTAAAGTTATCTCCATATAA A

GTTGTCTCTGTGTG ACATATAG GTC ATG AGTTTG AG CCGTG GAAG CAG CCATTAATGCT

TGCATTAGGTTAGGCTATCTATATCACACCCCTTGGGTGAGGCTCTTCTCGGGACCC TG

CGTGAATGTGGTCGGGACCCTGCGTGAATGTGGGATGCTTTGTGCACTGGGCTGCCA T

TTTAGTTATGCTTGAAATTCTCAACTTTTTAATTTTCATATTTGGTTTTTACTTGTC TATTC

TTTCCATTAGCCTTAAGCAGTTGCTCACTGTTCATCATATTTCATTTAAGTTTGTGA AGTG

TGTGAGACCATATACAATATTGCTGAATTATGATATACATTGGGGATTGGCAATTTC ATT

TAAATTGAATTCTTTAGTGATTAGTTCAATAAAGTCACAAAAAGAAAATCGGACTTG AATT

ATTGATTTGGGAGTTATTTAATTATGAAATGAATACTAGTAAGAAGCGAGTCAAGAA ATT

TGAGACTGAATGTGAAAATTGGATGGAAGATGTTCACGGAGAAAAGCTGATTAATAG TA

ATGTTGGTAAAATAGGAAGGGATTAGAACTCGGATAATGAATGTAGAGCGAACTACA AA

ATATAAGAAGTTGAGAGTTCGGATGGAGTTGGGGGGATGGGTGGTGAATGGAAGTGG T

TCATAATAGGATTTTGGAGAAAAACTGGAATATGAAAACTCAAGTGATATATCGGCA GGT

TG CTTGC CATG CCAAGTG CCAACTTTATGAAAATTATTTGTG GATTTTCAGTTAGTTTGA

CAAAAG G ACAATTCAAATTTTTTCTTAACTCTTTAGTATTATG G G GGTATG G G AACTTAG

CCCGTTTTTCCTTTCTGTGAATTAGGTTTCACCTCCGATTCTAAGTTTGTTGAGATA AGG

TTCCATCCTATCTCTTTCTAATGCACATCTATTCACCTGTCTTCTTTTGTTCCAGTT CCGT

GCTTTATCAGATCAATTCTATCGTACACCGGAGCACCACAAATTTGTCAGACAGCAA GT

AGTCAGTCAGGTGCTTGTCTAAGTGAATAAGATTATCTTCTCTATGATATTTCGGTT TTC

ATTTTTGAATTATTTCACATACATTTTAGCTGAGTAGGTTTTTTAAAATCATTTTGT TTTGT

CAGCTTAAACATCATCCAGAGATGTATGAGGGATATGTCCCAATGGAATATGGAGAG TA

CTTGAAGAGGATGTCCAAGTAATTCTCAGACCTTTGTTTTTCCATTTTGAAATCGTG CAT

TACCTTCTCTCCTTTAATTTACATCTGACTTTTATATTGGTTGAACCAGGAGTGGGG AAT

GGGGCGATCATGTTACGTTGCAGGCTGCTGCTGACTCGGTACTGATTATTGCCCTTA CT

TTGGTTCCTTGGTTTTTCTCTCCATTTGATTACTGCCTTTTGGTTTGTTTCTTAACC TTGG

AAATAAGAACCGTAAGCTATTGTTGGCCAAACCATAACTTGGATGTCCTCATATGGT GC

CTTCTTTGGATATTGTAGTAGCTTGTTAATTGCAAGTTGATGGTATGTAGGAAGTAA CAT

GCTTCTATAGAATTTGTGATCCTGTAGTTTTTCATGAGTATGTGTTAATCCTTTATT TTGT

AGTGTG G AAGAAAATGTGTGTTTATGTG CCTCTCTTG CTATTTTTTTCTTTCAGTCTC CAT

TGTGTGGTCGTGTTGCATTTTTTATTGTATATGCTTAGCTCCTTTACGAAGATTTTG CTTG

TGATAATATTAGATGAGGATGGATTGGTTTGTGAATGACCTATTTGTGCTGCACATT GAA

TTGTCTTTTTGTCTTGTGAGCAGTATGGTGTGAAAATTCTCGTTATAACGTCATTCA AGG

ATACATGTTACATCGAGATTCTTCCGAAGAATCAAAAGTCAAACAGAGGTAACTAGT TCT

CAAATTTTGAATTGCTTTTACCTTTCTGTAATATCCTCTACTTCATAGAATGTGATT GAAC

AAGTTTTCATTAACAAAATATTTGGAAGTGGTAGTGGCATGAATTCTCAATTTCCAT GTAT

CTCATATACAAATACATGTGTCACCGAATTGTGTACGTACTTCCAGTTCGCAGTTAA GGA

TTTTCCCCAACTTTAATTCCACTATGTGAGCAACCCTTACAAAGTTCTTCAAACATT CTTT

ATTGGTTATTTCTGAAATGTGGTTTTAGACTTATAGATAATACCAGAATATTATCCA GGG

CCAAATTTCAACAATTCAAACAAGGAAAGATAGTATGATCATTCTTAACACCACTGT AGT

TACCCCCCATTGAATTTTCCGATAACGAGGGCTACACATGGAGATGAAGGAGATGAT CT

G ATTATG CATATTGTAGTAATAGTAGTAG ATAAATTTATAAAAGTAG CTAATACTAG ATAA

CCCGATTTATTATCTGAGTTTGATTCTTGAAACTTGTTCGAGTTATGGAGAAACTAG TAT

GGCCTTTGTATAGGACCTTTCCGAAAGTCCCAGATCCGATAACGATCTGATAAAGTA GG

ACCACTGGAGTTATGAGACAACATTAGTCTATAATATAAATGATCAGAATATTGCAA CAG

AAAACATCAGTTGTCTCTTTCCTCTCTTCGATAGAGGCAAAGGAGATTGAATCTAAT TGA

TTCCGAAATGCTTCATTGGATATTCAATAGTTAAATCGAACTATTCATCTTGCAACT CTGA

AAACAG ACGTG CTATACATG CAGTATAAGAG CAACATAATTAACATACACTAGGTTG GA

GGTTTACTTATCTATGTTTAGGTGGTCGGTTTATGTGCAAGTTTTCCATTTTTCAAC AATT

TAGAGTTATCAGAG CTACTTATAAG ACATG ATACTTTTG CTGTATTTAACTTTTTTTGTAA

AGTTCAGCAAGAGTTTTTGCTAGTCCGAGTGGAAATAATTATTTGTTGACTACCCAT TTT

CCCTTTTTACTTGAGAAAAAGATTGAGAGGGGGGAGGCAGAGCAGCATCATGAGTCA T CTGGAGAATGCAAAGAGTACTTTTGATCTTTGGACTGAAGAGTTAGAACGTTGGCCTTC

CTTGATAACAAAATTTTCAAGGGGAGGGGGGATTAATGGATGACTATGAACCGTTAT GG

ACAATCTTAAGCAAATCCGATCTTGGGTTTCATGAAAAGGAGATTCCCAAGGGTTGA CC

AGTTTTG GTTGTTTTG AAATCTAAAAG ATG G ACTG ATGAG CATCCTTATTG CCTTTTTAG

AGACCTAAAGTTGTCTACTTTAACATATTGAATGATTATGAGACAGTTCTAAGTTTA ATGA

ATGGGGACAGCTTTATATCGGTCATGTATGCATGACATTTATTCCATGCCATTATAT GAT

ACCTCTACTGTG AAG CCAAG G GTG G GC ACAACTTAG CAATG GATACTTAAATCTGG AGA

TTGCTTGGAACATCTTTGTGCATATGCACTTTGTAAGAGCGATGTTTGAAGTCTTGA TAT

TG ATTCG CTTACAC CAATTTAATTGTTTTGTTTCTCTC GTTTTTCTCTTG GTG CAGTCATA

TACTTAAGTTTCTGGGCAGAGGTGCACTACAACTCAATCTATCCTCAAGGAGGTAGT TG

CTTGTACATTTATCTCCTGTTCCTGTTTATTTTGTCTTAGCATTGAGAGTTGAGTGG GAG

TTTGCTGATCTTGCTTAGTTTAGAATTCCATTCTATTATCATATTATTCTAATACTG TAGTT

TGATGCTCCATTCTAGACTTCCTGCCATTTGATCTTAAGAAGAAGAAGAAGAAGTGG AG

TTTCTGGAACAAGCAT

SEQ 28

TCAAGAATCAAAAGCTGAATCTTTCGATACAGAATGGATCATGAAGTCAGAACAGCTTG

TTTCTGCTGTAGACAGTGAAGAAGTAAATCCAAGCATAGAAGTTCCCAAGTTAAACT CC

AAAAGATTATTCTCCAACTGGTAACCCCCTATAACAATTGAAGCCTTTTGATTCACC CCT

CCATCCAAAAATCCCCAACACATCACTTCATCACTTACTTTCACCATTGAATTTCTC CCA

TAAATCCTCCACTTAACCATTTCACTTTGCAACACAAGATCCATAGTTGGAACATTT GGC

CCCACTTTTGTGACATCTATCCCCTCTGTGCTAAAGCAAAGCTCAAATGGTGCTATG GA

TTCCACTTTAGTCAAATTCACGGAAATAGCAATTTTTTCATAAGCTTCCATAAATGT CCTA

TAAATTGAGCTTTTCATGCTAGTAAAAGGGGAAATTGTACTAATCTTTGTCCCGCCA ACA

CCTTCTTCATCCATTGTAAACAAAGATATGTTTAAAGACAGTTTATTGCCATTAATT TTTA

TGGATTTGACATTGATGTAATACTCTTCTGAAACACCATTTTTAGATGAAATCAAAG GTG

TGTAAAGCATAGATCTTGAAACATCATGACCCAAAAGTGACAAGTAAGGACTTTCAC CA

GATATTATAGCACCATTTGAAGAAGACAAACAGAGAGCAAATTTCCTCTGGAAACCA AAT

GTATCAGACAACTGAGATGGCAATGCAATTCTTGAATTTCCAAGTCCTAACATTCCT TTG

GCACCTCTAGCAAGACCTTGTAACAAAGTAGTTGATGCACAAGAAAACAAGAATTCA TG

TATTGAAGCAATTGAACCTGTTTTTAACTCATCTATGAATTCCACAGCAAACATGTC CTC

TGCTAATTCCCCTGTTGCAGCCACTTTTGTGAAAGGATTTTCTGATTGTACACCACA AAT

CTTGTTAGTG CAACCATTAG GTTTAG CCATTGAG CACTTG AGTG AATTACAG G G GACAA

GTTTCTGACTTGAAGATACAGAAGTTAAGCCACAATTAGTCCAGAGAAACGAACCTC CA

AGATCTACAACTAAATTGGTAGGAACAAGATGAGCACCCATGTAAACTTGAGTAACA TA

CTG AAAAGTG GACAGATCTTTGTTTACAG GAAG AATAATG G CCTTAGG CAAAAAG GGTC

CATGAG AAACTG AGAAAG AAACAAAG AAAAAG AACG GAAACAAAG GAAG G AATATTTGA

AGAAGTGAAGGCAT

SEQ 29

TTAGGCCTCAATCAGTTCTCTAATTGGTTTGCTGTTTATGTTAGCTGATGGGATATTAGT

GAATTCAGAGAGTATAGCTCTGAATTCATCTCCTGTAAGAGTCTCCTTTTCTAGCAA CAC

ATCCACTAATTTGTCGATTGCCTCCCTGTTGTTCCTTATGTGGTTCTTTGCAATTTC ATAT

GCTCTCTCAATTATGTGCCTTACCGATGCATCAATGTCTTCTGCTAGTTTCTCTGAC ATT

TGATTCCTCGCCAGCATTCTCAGCACCACATCACCACTCTGTGTTGCTGGATCTGTT AA

CGCCCATGGTCCTATCTCAGACATCCCGAACATTGTCACCATCTGCTCATGATATAA AC

ATTGGCAAGTTAATACTTGTGTGTATTCGAATATGTTGTTCTCTTTTAATGTGGTGC AAC AAGATGATGTGTTAAGTAAATACCTGTCTTGCTATTTGAGTTATTTGTTGCAAGTCTCCG

GCTGCACCAGTAGTGATTTCTGCTTCACCAAAAATTATTTCCTCTGCTGCTCTACCT CCT

AAGCTTCCAACTATTCTAGCAAAAAGTTGCTGCTTAGATATCAAGGTTGGATCTTCA CCA

GGAATAAACCATGTAAGACCGCGAGCTTGCCCTCTTGGGATCAATGTAACTTTCTGT AC

TGCATCATGGCCAGGGGTCAATGTCCTATAAGCACAAGGACACATTCTTTAGTACTG TG

TCTTTTGATTACAAATAACAAACTGAAAAGATTGAATACTTAGACAGCTTCTTAAAT TTGT

CCGGTTTTTTCATCTAAACACCTTGTCTAAGGGCCTGATATATTGAACACTTGATGT TAG

TTGAAAATTCAATAAGGAGCAAATTACTCCCTTTTTTTGCTTCATGTATAATCTAGT ATAA

ATGAAAATAATGAGAGGAAAGAAATGATTGTTAACTTACGCGCAGACACCATGTCCA AC

TTCATGATATGCTACCAAAATCTTGTTTTTGCCATCTGTCATCTTGGTTCCTTCCAT TCCA

GCAACAATTCTATCGATGGAATCATCAATCTCTTTCGAGGTAATCTTATCTTTTCCT CTTC

TTCCAGCTAGAATAGCAGCTTCATTCATGAGGTTTGCAAGATCTGCACCACTGAATC CT

GGAGTTCTCATTGCAATAACACTTAGAGACACATCTTTATCAAGCTTCTTGTTGTTA CTA

TGAACCTTCAATATTTCTTCCCTTCCTCTTATATCAGGCAGTCCAACACTTACCTAA TAAA

ATGAAATATCAATATAAGTGAAGTGTATTCTGGAAACTGTATAATACACCTCATTTT ATTG

GAATTTTACAATCAAAATCTCATTTTATACCTGTCTATCAAATCTTCCAGGTCGAAG CAAA

G CTTGATCAAG AATTTCAG GCCTATTAGTG G CAG CAATG ACAATG ACTCCAGTGTTTCC

AGTGAAACCATCCATTTCAGTGAGAAGTTGGTTAAGTGTCTGCTCTCTTTCATCATT TCC

ACCGCCAATACCAGTTCCTCTTTGCCTCCCAACAGCATCAATCTCATCAATAAAGAC TAA

ACAAGGTGAATTTTCCTTTGCCTTGTTGAATAAGTCCCTAACTCTAGAAGCTCCCAC ACC

AACAAACATCTCAACAAACTCTGAACCAGAGAGAGATAAGAATGGAACCTCTGCTTC TC

CGGCAATCGCCTTAGCTAGCAATGTCTTCCCTGTCCCTGGTGGCCCTACTAAGAGAA CT

CCCTTTGGTATCTTTGCCCCAACTGCTGCAAACTTTTCTGGGGTTTTCAAGAACTCA ACA

ATCTCTTGAAAATCTTGCTTTGCATCATCTACCCCAGCCACATCATCAAATGTTACT CCT

GTATTTGGTTCCATCTGGAATTTTGCTTTGCTCCTGCATTATTCACAAACAAATACT AGTT

ATTAGTAGTTGTTGAAGATTACATCACTAGACATAATGTTCAATCTTGATCATGTTT ATGG

AATTTCTATTATAGCATACTGTTGGGTTTCTTAAAGAGATGGAAATGATTGAAATTG TCTC

TC CTAAGTTTTATTAACTATAG AG C G ATTTAAATAG C C AACTTG AAAATAAAATAC AC AAA

TTTATAAAATATTGAAAAACCTAAAATATCTCAACAACCTAAAATATCTAACCGAAA TTTA

AATTCAAACAAAGTAGACTACTTTTACCACTAAAAATTACTCCTTCTATTTCAATTT AGAT

GATACAATTTCCTATTAGTACGTTCCAAAAAGAATTATACATTTCTATAATTGAAAA TAAT

TCAACTTTAAACTCTTTATTTTATCTATTTTAACCTTAATAAAAAACTTTTATAACT ACACA

AATATCATGCCCCCCACAAAGCTTTTACCTCTTAAACTTTTTCAAAAGTCTTCTGTT TTTT

TTTTTTAAACTACGTGCCGAGTCAAACTAACTAATTTAAATTTAAACCGAGGAAGTA TTAT

TCTAGTAAATTAACAGTAACAGAAGCTATATACAAGACATACCTTCCTAATCCAAAA GGC

AGGTTTGGCCCTCCAGGAGTATTTGAAGAAGAGGTTCTCAACAGCAAAGAGCCAAGC A

ATATCAATGGAAAAGCTAAATTCCCAAGTAAATCAAGAAGTGGCCCTATGACATTCA TTT

CAGGGAGATGAGCAGCAAAATCTACATCCTTCTCTCTAAGTTTTCTCACCAATTCTG GT

GGCAATCCTGGCAACTGAACTTTAACTCTCTGGACTTTGTTAAGAGCAGGATTGAAT AT

CTCAG CAACAG CACTACTCTCAAAAAAATCAACTTTTTTC ACAG CACCTTC ATTCAAGTA

TTCCAAGAATCTTGAATATGACATTCTACTTGAAGTTGCTTCAATTGGTGCTTCAGT TTCT

GCTCTTGCTGGTTTAGCCAAAGTCCCTGCTACAAGGCTCAAACCACTACCACTCAAC AG

CTTCCTCCTATTTATTCTGGTGTCTGAATATGATTTTTGACATGGGGTTTCTTTACT AAAG

ATTTTAG G ATTGTTAGTATCCTTAGAAAG ATCTTG G GATTTG CATAG GG G AAATTGAATG

ACAGACAAAGAAAGGGCAGGGGACATTTTCAT

SEQ 30

TTAGGCAGTGGGATAAGAAGCGTCCATAGCAAGTCCACAAAGGCCTTCTTTCTCATGAA CATCCCTTTTGATGCGCATATATCCACTGTCACCCCATTTACTGCCCCATGAATTCTTTA TAATCCAATATTTTGTACCGTCAGTTGTTGCACCATATCCCACTGCTGTAACAGCGTGGT

TAAGCCAAGTGCTGCATGATCCACTGAATACACCACTTGAATAGAACTGGAAATCGA AG

CTACTCCCGTCTATTGCCACCGAAACAGGTTGATTAGCCACTGCCTGCAATAGAGCC TT

CTCACTGTTCGCTGGCACATCTTCATATCCTGTAATAAGAGGCGTAAGTCATAATTT CAA

GCTTATGGATTCGGAATATTTATCGTTTGAAGTTGCTGGGCTAGATCATAATTAAAC CAA

CTCACCCAATTGAAGATTTGTTCTACCCCTTATATTTTTATGGGCTTACCTGTAATT TTGG

CTGCTGAAAGAGCTGACTTTTTCTTGTTGCAGACACCATCTTCTCCTTTGTATGGAT AGT

TTACTTCTGTTGTGAGGCCCTTGTTTTTCAGGATGAAATCAAAGGCAGTGTCCAAGA GT

CCACCGCTGCAACCTTCGTCCTCGCCTTCGACATCACAGTCTACAAGCTCTTGCTCT GA

TAAAGGGATCAACTCTCCTGTTTTCAGTTGGTGTAGCCCTTCCAT

SEQ 31

ATGGGATGCCGCATGAAATTCTTGAATGTGGTTTTGGTGGTGGCGGCGGTGATGGCTG

CTGCCGCCGCCGTGGCCTTCGGAGCTGAGAAATTGCCGGCGGGAGTGCTTAGTTTGG

AAAGG ATTTTTCCTTTG AATGG G AAG ATG G AG CTG GAG GAG GTTAG AG CAAG GG ACAG

AGCTAGGCATGCTCGAATGTTGCAGAGTTTTGCTGGTGGTATTGTTAATTTTCCTGT TGT

CGGTTCATCTGACCCTTATCTTGTCGGGTAATTACTTTGTTACGACCAATTTGATAA GAT

TATATTTGTGATGTTTTTAGTGTTTTCTTCCTTTTTCTAATGTGGAGTTATATTGCT ATATT

TGCTATATTTTATTTGGTATGATGACGATGATATGGCTTGAGCTTAAATGGAGAAGT GAT

GATTGGTATAGCGGACTCCAACTTGTTTGGGACCGAGGCGTTGTTGTTGTTGAGTTT TG

TTTGGTTAATTTAGTCATTTTTTGGAAAGTTTGATTCTTTATGATGTTAAAACTTGG AACT

TTTGGTGAATGTATGGAAGCTATGGACTATTTGATGTGTTATTAACGTCTTATGAAT TTG

ATCTCATGAATTCTGATGTAAATTTTGTTTTAGTTTGAGGGTAATTGATTTTAAGTG TATT

AAAGTACTTGTAACACAATG AATTTTG GTGTG CTGTTTTTCTTTTCTAG GTGCTTTCTTGT

TAATTATCGGTTGGATGGTGTTGTTGATGGTAGTGATAGTTCTGTTTGCTTTATCTT GTA

TCGTTTCTTGTTCAGGTGCAAAATATTTAGGTACAATTGAATGATGAGTTTCGTGTT GCT

CTGAATATGAACATTAGCTATATCAGTTTGGCATTTGCTTTCTGTTATTTGTGGATG AGG

GATATTCTTATTGATTTGACTATCAATTTTGTTGACCATCGTCTCTTTCTCTCTCTT CTAT

GCTTTTGGTGTATTTGTAGCCTTTATTTTACAAAAGTAAGACTGGGAACTCCACCAA GAG

AATACAATGTGCAGATCGACACTGGCAGTGATATCCTATGGGTCACATGTAGTTCCT GC

GATGATTGTCCTCGGACAAGTGGACTTGGGGTAACTCATCTTCCCTTCATCTTGTTA TTA

CTTTTTTAGTTTCTTGTTTAAAGTGTGGTGAAGGAATAAACTGTTACGTGGGTGCAG GTT

GAGCTCAACTTCTATGATGCTACCATCTCGTCAACTGCTTCTCCCATTTCTTGTGCA GAC

CAAGTGTGCGCCTCTATAGTTCAAACTGCCTCCGCTGAGTGCTCTACGGAAACCAAT CA

GTGTGGTTACTCCTTTCAATATGGAGATGGGAGTGGCACAACTGGCCATTACGTAGC C

GATTTACTATATTTTGACACAGTCCTGGGAACTTCTTTGATTGCCAACTCTTCAGCA CCG

ATTATTTTTGGGTGAGTTCTTATTTTTTAAATACCCCTATATCTATACTTAAAATTT CATTA

GAAATAGTTGTGGGTCATTTGAACCAGAAATATCTTTGGCCCAATTTACAAAAAAAC CAT

GTTTGTTTACTCAAAG CTTATACTTG GATATG ATTTAAAACAGGTG CAG CACCTCTCAGT

CTGGGGACTTGACCAAGACGGACAGAGCAATTGATGGGATATTTGGGTTTGGTCAAC A

GGGTCTTTCAGTAATATCTCAACTGTCTTCTCATCGGATTACTCCTAAAGTATTTTC ACAT

TG CTTG AAAG G AG AGG G AAATGGTG GAG GTATACTAGTCCTTG GTGAG ATTTTG G ATC

CGAGAATCGTATATAGTCCCCTTGTTCCGTCACAGTACGTATTGTTACAGTACAATG AA

GTTTCTTTTCTTGCTTATGACGAATATAGAGATTTAATTGTTTTCATCTTTAGTGTG CCTT

GTGCTACATGATATAAAACAGTTGTGTTCTTTATAGTTTGTGATCCAGCTTGAGCAT GTG

AAATATACCTCTCATGCGCTACATCCTGATTTTATTGAAATTTCGTCACTATATTAT TGGT

TTTGCATCTACAGATATATAGTAGTTGGGTCTTGGGAAGATGACATCAATGAAACTT TAC

TTTGTACATATAAAAAAGGGCAGCCCGGTGCACAATTTTGAGTGTTATATATATATA TAT

ATATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNN NTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG

TGAACATGTCGATATATGTGTTTATTTTTTATGTTTTTCTACATTATTTGTTTTCTA ACTAT

AAGCACGGGGATTGGCTGGGCACTAGGAAAGAAAATGGTTTGTAGCAAGCTTGATTG T

ATCCGCTTTCCACTTTTGCAGGGCGCATTACAATGTATATCTGCAGAGCATTGCTGT TAA

TGGACAGTTGGTGCCTGTTGATCCATCAGTGTTTGCGACATCTGGCAATCGAGGAAC TA

TTGTGGATTCTGGTACAACTTTGGCTTATATTGCCACAGAAGCTTATGATCCCTTTG TCA

ATGCTGTAAGTTCCTACATTTTGCCAATTTATTTACTCCCTCCGTCTCAACTTGAAT GTC

CAATTGCTCTCTTTGTCTATACCAAAATTATATCCACATCTCCTAAATATAAAAACT GACC

ACATAAC AACAAG AACAACTACG CCTGTTG G G GTTG G CGAAAAAG G GCAG ATAACTGTT

GATCAAAACCCCTGAAAAATGTTTTATCATTCATCTGCAGATAACTGCTGCTGTTTC ACC

ATCAGTTAGGCCAATCATCTCACGAGGAAAACCGTGCTTTCTAGTGTCCTCGAGGTT CA

TCCTTGTATAATTCAATAGATATTTACTTTGAGCTTTTAATGACAAAAATGTCTTTA CCCA

ACTGCTTTGGTGACTCGAATATGACACCCTGTTTTGTTCTGAAAACAGCATAGCAGA GA

TATTTCCCCCAGTTTCTCTAAACTTTGATGGTGGTGCATCGATGGCTTTAAGACCAT CAG

ACTACCTTGTGCATATGGGCTTTGTCGTGAGTACCAGAATCTGTATTGTGTTTGAAT GTC

TTCTTGAAGTCTCATCTGAGACTATATCAACAATGCTATATGCAGCACTATTGTCTT TTTG

ATGAACTACAAAG CTAAGTG ACTCAATTGAATGTTTCTAACAG G AAG GTG CTG CTATGT

G GTG CATCG GCTTTG AAAAACAGG ATCAAG GTGTAACAATTTTAG GAG GTTG GTTCCTT

GTTTACTACAATATTTATGCCTCAACCCCAAGTGGGTTCGCTGGTGTATTTATCAAC ATA

TCTTCTAGGACTATGGTTACATCTAATATCTGCTGCATCATTACGTGTGCTAGCGTC CTT

AATGGGTGCTTAAGCACTCCATCACTCAAACCATCTAAAAAGGACCTAACTATACCT TGT

CCTGTATTAATTTAAAAAAAAATATAACTTGGCGAGTGTCTGCCTAGATGGAATGTA CCT

TAGTGATTGCATGCTTAGATAAAGTCATCATTCCTGCAGATATCGATGTTGATGAAA CAC

GGCCATCCATTTTCTATTAGAATATTAGGTCATTGATTTGCAATTTGAAAATTGTAG CTTG

AAGATTGCAGTCCCTTTGTCTTTTCTAATTTTGCATCCGTTCTTTTATTGTGATTGA CCAG

AGGATAGGTATACTTGTAGTACCAGTAATTCGGTTAGATATGTTCCGTAGCGTGCCA AG

GAATTTATATTTCCTGTTTTGCCTAGTTTTTCGAAGTTTATGTCAAAGTTACTTGCT TTGG

ATTCCGTGCGTGGGACAACTATATATCTTGTAGAGGCAGTGATTGTTACTTTGATGT TGA

AAG ACAG CAG AGG GTGG AATTCATAG AAAGTTAATACGTG GAAAG GG GTGTTAG ACTG

GATACATGACTGCAGTCCTAAATCAAGGTTGGAGTTGCATTTAATATGGCTATTCCT AAA

GGCTAAAAGACACAGTCTCATCAAGATTGTATGTTGAGAACAAGTCTACGAGAGTGC GC

TCTGACTGTTCTCAGAGATAGGTGCCTCAGACTCTAGGGATTAAGCTAACTACGTGT TT

TG AGTG GTGTTTTGTTCTTTCTTTTCCTTTTTCACTG CTG G CTAACACAACTTG AAG ACTT

GAATTCCAAACTGCTTCAAGTTTTAGGTCTATTATTCATGCCTTCTAACTTCTTAGA GTGT

TGGTTCCCTAGTCCTCTTTTTTTTCCAACTCGTGCATGCACGCTCATGCCCGTATAC GTA

CATGCAAGACATCTACAGTTGTAAACACAATTTATGACCAAATATTAACGGAAAGGT ATT

AATCCCTTTCTATTTCTTGTTTGTGTGTGTATGTGTAAACCAAGCACAATGTTATGA CCA

ACATTAAGTGAAGAGAATTAATCTTCTTTTCTTTTGGCCCATGTCCGTGTATTGTAT GCG

CATTGAATGTGTGAGCCTTTCCTTGTTTATCCTGGTTATGAGTTAATCGGTAGCATA CAT

GGTGAGGTTTCAGGATGCATGATAGTTGCAGATGATAGTGTGAATTAACAAAATTAG TC

AAAAG CAGCTC CATG GTGAAACACTTAG CAAG GTTTTGG AATAAGTG G AAACAAGATAC

TATCCAGGCAATGCAAGTTTACTCCGCCTAAAAAGTATTAAGGTGAAATGATATTAG ATT

ATTATGGTTCCTAAATGCTGGCAGTCAAGTTATTTAAGCTCAAATCTTCCAGGAGAA TAG

AGAAATGTCGTG GTAG ATGAG GAAG G AACACATAG AACTAAAATATG CTG GTTG AC ATG

AAAGAGTGTTGCATGAGTGTTATTAGACAGAAAGATTCCTAGAAAATTGAAATATAA GTT

AGTATAACGGTTGTGAGATCAACAATAATATGTGAGAGTGAACATTACATGTCCGCT AAT

TGAATATCATAAAAATGCGGATGTAAAGATGGTTGTGCAATCATATAAGATTGAACA TGA

TTAGAAACAATCACTTTTGTTGAAGG GTGCAAGTAGG G CAC ATATAG AG CATGAAAAG G

ACGTCACATGAGATGGTTTGGCCATATCCTATTAGTCCGCCATATGAACCGGTTATT AA GTGTG G CAATATTGTGTTTAAAGTGCTG AAAG GAAC GAG GTAGACTG ATG ACTAC ATTC

AAAAATTTGTCTCAAATGACTTAGAATCTCATGGAGTCAATACGGCTTGAACTATAA ACA

AAACCTTATGGAAGAAATGGATCCATATAGGCAATACTAACTAGTTGAAATAAGGCT TAC

TTGATTTTACTCTACTGGGTGCAGTATTTGTCAGGAGTCTTTATAATTTGGTTAGAG ACA

TGTTTGTAAGTGTTGGTATAAGTTAGAGATTTAGAGAACCTCTAGATTTAAGAGAAC CCC

TTGTCTTAAAAAGATTATGTACTATCGGAGTGAATTATTTCAAAAAAAGAAGATTAT TTAC

TATCAGAGTGAATTGGGTTCAAATAGAGCAGAATGGCCCCAAATGATATTAGACCTC AA

CTAGTTTGGGACAGAAGTGTAGTTGATTTATTGATATGCATCTTATGTGCAAATTTT ATTT

TAGTCATGCGTGTGCCCTTGAGCTCCTTCCTCTTCCCTCTTCCCTCCTAGTCATTCT ACT

AAATTTGGCTATTCAATTAGTTCTGGCTATGCTTTGGCATTTGAACCATGTATTCCT TGT

CAGTTGACTTATTGCATTGCTCTGCATCTTCATTTGTCGAGTATTCTGGATCCATAG ATA

AACAAGAATCTAGGTTGGTCTAACTGTATTTTTCTTGATTTTCAGATCTTGTTCTGA AAGA

TAAGATCTTTGTGTATGACTTAGCTCGGCAAAGGATTGGATGGGCAGATTATGATTG TA

AGTACCTCTTTTCAAAAATGAAGTCAACATTTGCTTTTGTCTCTATTTTGCCCCCTT TTCT

TTTGGGGTGGGGAGGGGGTTGTTTGTTGTGACATGACTTACTTCTTCCCTCTGATTT TC

TCTCTATTTTTAATGGTTTTCATGATTAATGTTTTTATTATAGGTTCATCATCTGTG AATGT

GTCTATAACCTCTGGCAAGGATGAATTTATCAATGCTGGACAGTTAAGCGTGAACCG TG

CATCAGGCAGTTTGCTGTTCAATCCGCGGCACACTAGAACTATATTTCATCTGCTAT CG

TTGGTTCTGATGATTGGTTCCCCATTTTTAACT

SEQ 32

TCAAG CAACAATAG GGTATGATG CG CAAGTTGCAATACCTACAAACATATCATTAG GAT

CAGTTATTTACCTCATCAAAATCATGACTACATTGACTAGAAAGTTTTTATTATTAT TTGT

GTCGATCTAAGATGAACTCAACTTGAACTCGTATATAGAATAGCATTATAGTTCAGA TTT

TGTGGCCTCTTTTTTTATGGTCCAAGTTGAAAGTTATTTTTCATTAATTTATCCTAA TAGA

ATAAGCTTTCACAATCCGAAAGCAGTGTGCATCCAGTATGCAGAAAGGGGGAAAGGA G

ACTTGATGATGTGGGAGAAACTTACCACACATGTTCTTTCCCATCTCCATTTTGAAG TAT

CCATTGTCACCCCATTCAGCTCCCCACGAGTTCTTGATCAGCCAATATGGAGTACCA TT

ATCAACACCATATCCCACAGCAAGAACAGCATGGTTCACATCCTGTCAATGCAATAG AA

AATGTTAATGTAGTATTATGTCATTGTCTCGCATATTTAGTGAAGGTCGGACCATAT ACT

TTGTTGATGTCATTGATATTTAACAAAAGAAGAAAAAGAATATGTGCCTCTTCATTG CAG

CAAAATCAATCATGTCTCAGTGCCAGATTAGTTGGGTTGGGTGGGGGTATGAATCCT CT

GTGTACACTGCTCATCAGACCTTTTTATTCCAATACTACATAATCTGTATATTAAGG CAG

ATTAGGAGTTTCTAAAAGTAGAGACTATTTCAATCTTTCCTATCATCCAGCATCTTT TTTT

GGTTGACAAAGCATGGTATCTTTCCTATCAGTATTATGACACTCGAATTGTTAAAAA TCA

ATAAACACAGAGAGAGAGAGACACACACACACATTACGCATAACTTAAAGATAATTG CA

GACACCCTTCTCGACGAGTAATCAAGCACTGGGTTATAACTAACTACTGGGTACAAT TC

TTTCCCCTACGGTGAGTTACATGGAAAATGTGCTTCTTCATCGACAAAGAATAGCAG ATT

AAAAGGGCAATATCTCAAATTCATCGGAACTAAAAACTCACCTGGGGAGTGTTGCCA CA

TACGGTGCTGCTGTAAATTCCACCCTTGTACTGTTTGAAACCTTTTACCACCTGATA AGC

TACACTAACCGGTCTAATAAATGCAATCGCGTATTTTAGTTCATCTTCAGCACCCTG TAA

TTATAAATGTTTAATTGTAAGGAATCCAACCAATGCCCATTGACCTTTTCAGGTCAA ATA

ACGCATATTAAAAGATCCACTCAACTGAAAATCATCAACAATGAAACACTAGAACTA TGA

AAGGTTTTTCCCGTCGGAAAACTACAATAATCTTCCTTTAGTGTGTCAAATACAAGA AAT

G ATAAG CACTG ATGTAAG CAGTTAAAAG CTG ATC CAAG ACATTGCTGTTCCAG AGTAAA

GAGGTATCCTACCTTGGTAATATTAACAGAATCAACGACTTTAACAGCAACATTTTC TGA

CGAGAATTTGCATACACCAGCCTTTCCAGCATAAGGATATTCTTCTTCAGTGTCAAG AC

CACCACTGTATTTAATGTACTCAAAAGCCTGTGATGGAAGCCCGCCATGGCATCCAA AG

TTATTAAAAGCTCCAGCACAGTCCAAGAGCTGCTGTTCAGACAGAGAGATGTTCTTC CC AAATGCCTGGGCATATGCTGCCTCCAGAGCACCAGTAGTGCTGCCAAAACATCATGCG

CCGACTCCGTCATTTTTCAACTTCAGAAACCCGAATAGAATGGAAAGGATATAGAAT TTA

CATGTTGTTAAGAATAAGTAGTTTAGGCTTAACGCATAGTTGATTAACGATTGATAG GTA

GAATGCCATTCCAAACTGAAAATTTAGGAACCTAGTGACATGATCCTAACTACTCCT TAC

CTGAATGTCCAGCAAGACCCGCACTTGCCCTGCTTCTTCACTGGGCTTACTATCCCT GC

TTCCCTCCAGTCTTTCTGAAATGACACATTCGGAGATCAAAATTAGGAACAAGAGCT ACT

TTTGGTAACCAACAATTCGTTCACTTAGCACATGCTCTTCTAGGCAGCGGCTGAATG GA

AGGCAACTGTTCCTGAACTTACTACTACTTTTAGCTTGGAAAAGAAGAAAAAGTAAA TAT

ATATAGATAGACAATACACACACACGTTTTTCTAAACACACATATATACCAAATTTT TATT

TCATTCAGATACAGTGGATGCAGCAGTACTCTGCACCTCTTGACTTAAAATCCTACG TC

CAGCGTTGCTCCTAGATGCAACAGCTATCTATGTTGTCATCTCAAGAAATTCCATCA TCT

TGAAAGTAATAAAAGTTAAAACATAACAAAGTGATAAAAATCCAATGAAAGAAGAAT GAC

AGCAATATTAGGCTGCAAAATGGACAAGACTAAGCATTTATGAATTTTCCTTTTAGG AGA

AC ATAC AAG ATAG AG G C AC AAAG AAAG C ACTG AAAG CTAG ATTTC C AATAAC AG G ATTT

CCGGTTGGAGTGAATGGATGCAAATGGTAATGATTTTTGAGAAAGTATCATAAACAA CT

GCAGAAAAAGATATAATGGAGTTCTGATTGCAAATACCGTCTCTGGTAGGTTGACAT TA

GTGAGCTGAAGATCGCTCTTTGTGGTAGCGGAACAGTTTTGAGGAGCTCCTAGCCTT TC

TCTCCTAAACTCATCCCATGTTAGGTCAGTAAACTCTGCCAAGAAGCATAGTCATTG TCA

AGAG CAAAATACAG CAAG CAAAATATG CAGTG CCTTCTTGTTTCTTTATTTTGTTTGTTCT

TCTCCTTTACCTTGACCTTGGTTTTCATATTTTTATTTTCCCTTTCTTCACTCACGT ACAA

GAAATCTGCTAGGAGTTCTAATTAGAAGTAACAAAAACATTAATCTACTACATTTTG AAC

CACAAATCAAGTTTTTGGCGTCGTTGAAGTCTTAAGAAGGCATCCACCTTAGCATCC GC

ACCTGGGCACAACAAGGTTGAGGGAACAGAGGGTGGACGCGCAGCATCCACCCCAGC

ATCCGATGCTGAGAAGATCAAGATGAGGCGGACGCGAAGCATCCAACTCAACATTAA T

CCCTGAAGCTGATTCGGGAAAGCAAAAGGAAAACTTTGGCCCATAACTTTTGGACGC AA

TATATAAGCCAAAAACGGCTCTTTTAGGTCATCGAACACACTTTTTGAAGGGGATTC GA

CCTAGGGAGAGCAAGGAGCCGCCGTGGAGGCCGAATTTCATCTTCTTCCGCCAAACT T

AGTAATTTTTATGTTTCTTTGTATGATTTGTTGTTTGGCTACCATGTCTATGTGGAG CTAA

ACTTCAC GTTCTAATGTTCTG GTTCTTTCATGACTATTGTTATTCG AGTTG ATTTTC GTTT

CTTGATTTATCATATTAGTTTATTTATTCAATCCTGCGCTTAATTATTTGATTGCTT GATCA

CCAATTAAAACTATCTACGAATCTAGAATTGAACTCGAAAGTGTGAATTCTAGATTG CAT

ATAGGATTAAATAGAGCAAGTTCTTGAACCTGGGTATCGGGGAACGGATTTGCGGTT AG

GATAAACATATATACCCGATTGCCTTGCTTGGTTGATTTACACGAATTTCAAATGCG TTC

TTGTTAGTTCTAATTC CATAG ACATATTG G CGTTAG GTTAG CTTG AATAG AC GAGTAAGA

ACTCGAGAGATTCTTATGAGCAATATTAACACTGTCAACCAATAAACTAGATAAATT AGT

TAGTCAATTCAATTGAAGAATACAATAGGAATGTTAGATAACTCATAACCCTAGATC GTT

TTCATTACACTGATAAT AT AAAAATCAGCTCTTCCTTTGTTCAGAGTTCATT ATTT ATTTTC

TTTTTAGTTTAGTTACTTTTGCATCACTACTTTTGGGTTTAATCCTTGTTTAGATAA TTAAC

AAGTCCTCATGGGTTCGACACTCTATCTTATCACTTTATTACTTGACGACCGCATAT ACT

ATACAAGTCAACTTTATGTATCCACTCTATTCAGATCATTTCATGAACTATAAGTAA GAAG

AAAAACCAAAACGAAAAAGGGCAAATTGTCCATAAAGGCATATATTGTCCAGCCTAT AG

CTAATAG G AAAC C ATATAGTATAC AC AAAG G C AATTAC C ATTG AC AC C AAGTTTGTAG G A

AAGTCGTTGCTTGTTGTGAGACCTAATCATCTTCAGATTGTCCAAGTATATCTCAAA CCT

TTGCTTGATCTCCTCAACTGAGTCGTATCTCTTCCCATACCTTTTAAAGTTAACAGA TAT

CCCAAAAATAAATAATTTTTAAGTAAAAAGGAAAACAAAGCTTATTCATTCAAATAG GAG

GAAATTAGATGAATCGAACTGACCTGCGAACAAAGCGAACGAAGGAGAGAGCACGGC G

CGTTTGGCCGATGAGTTGGAGAATTCCATTCTCCAGCTCCTGCAAACCGTCGGATAC TA

CTACTTGCCTGATCGGATTATCATCGTCAAACGTCAACGCTCCGCCTTGTGCGGCGG C

GATTGAGGTCGCGATGAGTAATAGTAATAGTATAATCGAGGCGCGAGTCAT SEQ 33

ATGAATC CTGAAAAGTTTACTCACAAG AC CAATG AG G CACTTG CTG AG G CACATG AACT

AGCTATATCAGCAGGGCATGCTCAATTTACCCCCTTACATATGGCACTGGCCTTAAT AT

CCGATCACAACGGTATTTTCCGGCAAGCTATTGTGAATGCTGCTGGTAGTGAAGAAA CA

GCTAATTCAGTTGAAAGGGTATTCAAACAAGCCATGAAGAAAATCCCTTCTCAAACA CC

AGCACCTGATCAAATCCCACCTAGCACATCACTGATTAAGGTGCTCCGACGAGCTCA GT

CGTTGCAAAAGTCTCGCAGAGACACCCATTTGGCAGTTGATCAGTTGATTTTAGGCC TT

CTAGAAGATTCCCAAATTGGTGATCTTTTAAAAGAAGCTGGGATTGGTGCAGCAAGA GT

G AAATCAG AAGTAG AG AAACTTAG G GG AAAAG ATGG CAAAAAG GTTG AAAGTG CTTCA

G GG G ACACTAATTTC CAAG CACTTAAGACTTATGGTCGTG ATCTTGTTG AACAAG CAGG

AAAACTTGATCCTGTGATCGGTAGGGATGAAGAAATTCGAAGAGTAATTCGGATTTT GT

CGAGGAGGACGAAGAATAATCCGGTGCTTATTGGTGAGCCTGGTGTTGGTAAAACAG C

AGTAGTTGAAGGGCTAGCACAAAGGATTGTTCGAGGCGATGTCCCGAGTAATTTGTC T

G ATGTTAGACTTATAG CATTGG ATATG GG G G CATTAATTG CTG G AG CAAAATATAG AGG

TG AATTTG AAGAG AG GTTGAAG G CAGTGTTAAAG G AAGTGG AAG AAG CAG AAG G GAAA

GTGATCCTTTTTATTGATGAGATTCACTTGGTTTTAGGTGCTGGTAGGACTGAAGGG TC

TATGGATGCTGCCAATTTGTTTAAGCCAATGCTTGCTAGGGGCCAATTAAGGTGCAT TG

GTGCAACAACTCTCGAGGAGTATAGGAAGTATGTCGAAAAGGATGCTGCGTTCGAAA G

GCGTTTCCAGCAGGTATACGTGGCTGAGCCTAGTGTTCCTGACACTATTAGTATCCT TC

GTGGGTTGAAGGAGAAGTATGAAGGGCATCATGGTGTCAAAATTCAAGATAGAGCTC TT

GTGGTGGCAGCCCAGCTTTCGGCTCGATACATTACAGGTATGTCCTTTTTTGGATTG TC

ATTGTATTTTATGAATTTTACCTTTGATCTTTAATCGAGTAAAGATGCCACTACAGG AATA

TAGCAATGTATGTAATGTTGAAATGTGATGTGTCACACGTTTGTATTGTGGTTGTCA AAA

CATTTCCTAAAATTTTGAGGAGATAGTCCCTTTCCTTTATGTCTATGCAGGATGGAT GTG

AATCTAGTTTTATACTTAATTTAGCTGAATCACGTCCCATTTGAATGATAAAGTTAT TTTC

TG CTTCATTGTG CTTTTCAAG GTG ATAAC CTCTAACCTTTG GTTTGTAGTTTCAG ACTTAT

AAAAGTATGATTGGTGCGTGCTCACCTTAATTGATTGGATGGGATTATGTGTTTGCT CTC

TATTAATAC GAATTTTCTTTAAAG CTTTTTCTCTC CCTTG CTATG G AG AATTG CTACTGTT

GTTTTGCGTATCATTTGCCAGTTTGCCATAATTTTGTGCATATAGGGATTACTAATC TGT

GAATTTACGTTCAGGTCGTCATTTGCCAGATAAGGCTATTGACCTAGTTGATGAGGC TT

GTGCAAATGTAAGAGTTCAACTTGATAGTCAACCTGAAGAAATTGATAATCTTGAAA GAA

AGAG GATTCAG CTAGAAGTTGAACTTCATG CACTTG AGAAG G AAAAG GACAAG G CTAG

CAAAGCTCGACTCGTTGAAGTAAGTATACATCCCGGAAATGCTTTGACCTATAATTC TAG

AACCTGTGTAGGAAATGTGGACAAATAACGTAATTACTATTTCAGGTGAGAAAAGAA CTT

G ATG ATTTGAG GG AC AAACTC CAG CCTTTG AC GATG AG GTATAAG AAAGAG AAG GAAA

GAATTGACGAGCTTCGCAGGCTCAAACAAAAGCGTGATGAACTCACGTATGCTTTAC AA

G AAG CTGAAAG GAG ATATG ATCTTG CTAGAG CAG CAGATCTTAGGTATGG G GCTATCC

AAGAAGTGGAAGCTGCTATAGCAAATCTCGAGAGTAGCACAGATGAGAGTACAATGT TA

ACTGAGACTGTTGGACCTGATCAAATCGCGGAAGTAGTCAGTCGGTGGACTGGTATT C

CTGTGTCAAGGCTTGGTCAGAATGAGAAAGACAAATTGATTGGTCTTGCTAATAGAT TG

CACCAAAGAGTGGTTGGGCAGGATGATGCAGTTAGAGCTGTTGCTGAGGCTGTATTA A

G GTCTAG AG CTG GGTTG G GAAG G CCACAACAACCAACTG GTTCATTC CTTTTCTTGG G

ACCAACTGGTGTTGGAAAAACTGAACTTGCTAAGGCTCTCGCTGAGCAGCTCTTTGA TG

ACGACAAGTTGATGGTCAGAATTGACATGTCCGAATACATGGAACAGCATTCTGTTG CC

AGGTTGATTGGTGCTCCACCAGGGTAAGGACCCTTTAACTATTGATAGGATAAAAGA AC

AAATCATACTTTTACGAGTAAACTGTATCTGCCATAATGAGATTGTGGATTGCACCT TTT

GTAGAACTCTGTAGCCTCATATTTGTCTAGGTACTTAATAGTTTTACGTCTGAAGTG ATG

AATG CTG AACATGTTATGTGTGTG CAGTTATGTTG G AC ATG AG GAAG GAG GACAACTCA

CTGAAGCTGTGAGGAGGCGCCCTTACAGTGTAGTGCTTTTTGACGAAGTGGAAAAAG C

TCATCCCACTGTATTCAATACCTTGCTCCAAGTGCTGGACGATGGACGATTAACAGA TG GCCAGGGTCGTACCGTTGATTTTACTAATACAGTCATCATTATGACCTCAAATCTAGGA

GCAGAGTATCTCTTGTCAGGATTAATGGGCAAGTGCACCATGGAGAAGGCCCGCGAT A

TG GTC ATG CAGG AG GTAAG CTAG AACAG CCTATTTTCTGCTAATTTTCTG AG CATTGTTT

CCTAGTTTACATCTTTATTTGAGGAAGGATTGTTCACATATATCTTTTTGTGACAGG TGA

GGAAGCAGTTTAAGCCTGAGTTATTGAACCGGCTAGATGAGATTGTAGTGTTTGATC CT

TTATCACACGAGCAGTTGAGGCAAGTATGCCGTCACCAACTGAAAGATGTAGCAAGC C

GTTTAGCTGAGAGGGGTATCGCCTTGGGCGTTACCGAGGCCGCGTTAGATGTCATAC T

TGCTCAGAGTTATGACCCTGTAAGTATCACCATCTGGTATTTCAACCTGACATTTCA TGG

TGATTAGACTAGGGTCTGAGTTGAGATACCAACTATGCAGATTTTTGCATTTATCTT GCT

GTGGCGGGTTACACTTGTTTTTTCAGTTGCTAATTTCACTTATTATGGAAAATTATT TGTA

GTTACATTTTAGGTGATCTAACATTCTAAAAATTATCTTAGAACCGTTGGCGTATAG AAG

CGAAATACTTTTGACAATTGATTGTGCTAACTTTTGTTACAATTACATCACAGGTTT ATGG

TGCAAGACCTATTAGAAGGTGGTTGGAGAAAAAAGTGGTAACTGAGTTATCCAAGAT GC

TCGTGAAAGAGGAGATTGATGAGAATTCTACCGTCTACGTCGATGCTGCATCCAGTG G

GAAAGATCTAAGCTACCGAGTGGAGAAAAATGGAGGGCTTGTCAATGCTGCCACTGG G

AAAAAATCTGATATATTGATTCAGCTCCCTAATGGAGTGAGGAGTGATGCTGCTCAA GC

AGTGAAAAAGATGAAGATTGAAGAAATAGTAGACGAA

SEQ 34

TCACTTG CTTTCAG GTATG ATACTAACAAG GAG ACATACTATG CCAGTAACAACAG GG C

TCGCCACACTTGTGCCAGATAATCGTTTACAACGTGTGCTGATTTTGGATCCCATAA TCT

CACGCCCATATGCAACAATGTCTGGCTTTACACGACCATAACTGTCAAGGCACAGTA CA

GTTAAGAATCTATGGTTTGATAGACGAGTGTAATTATTTAACTGTTATTAGAAAAGC AAG

ATCCTAAAATATTCAAAGAAAAAGAAAGAAAAAAAAATGAAGCAAAGAACCATAATT TTG

AACTTAATTCTTTTTC AAG AAAAAAAAG AAG C AAC AATAAC AAC AC C AATAAC AAG C C C A

GTATTTTTCCACAAGTGGGGTCTGGAGAGGGTGGGAAGTACGTACCCTTACCCCTAC C

CTAGAAGGACAGAGAGCTTGTTTCCGATAGACCCTCGGCTGGAGAATGGATGACAAA A

ATAATG G C AAC AATAAG G AATAAC AAC AAG AT AAAAATACTG AAG C C AAG AAAG C AG CT

AAACTCTAGGTAATAATAGCAATCTATGAATAAAAGGATATCATACTAACACTGATG CTA

GCGAACTGGGAAAGACAAAGAGATACGTTCGACTACCTACTAGCCTTCTACCCTAAT TC

TCGACCTCCACACCCTCCTATCTAGGGTCATGTCCTCAGTCAACTCCAGTTGCGCCA TG

TGTTGTAACCTCGCCCCAAGACTTCTTAGGCCTGCCTCTACCCCTCCCGATACCCAT TG

TGGCTAACCTCTCGCACCTTCTAACTGGGGTTTCTATACTTCTCCTCTTAACATGCC CGA

ACCATCTCAACCTCGTCTCCCGCATCTTTTCCTCCACCGAAGCCACTCCCACCTTAT CC

CGAATGATTTCATTCTTAATACCTAGTATGCCCACACATCCATCTTAACATCCTCAT CTC

AACTACTTTTATCTTCTGGACATGAGTGTTCTTGACCGGCCGACACTCTGTGTCATA CAA

CATAGTTGGTCTAACTACAGCCTTGTAGAACTTACCTTTAAGTCTCATCGGCACATT TTT

ATTACACAAGACACCGGTAGCGAGCCTCCACTTCATCCATCCAGCTCTGATACTGTG TG

TGACATCCTCATCAATCTCCACATTACCTTGTATTATAGATCTAAGGTACTTAAAAC TACC

TCTCTTAGGGATAACTTGTGTATCAAGCTTCACCTCCATGTCCGCTTCCCGGGTAAC GT

TGTTGAACTTGCACTCCAAGTATTCCGTCTTGATCTTGCTCAACTTGAAACCCTTAG ACT

CTAGAGTCTGCCTCCAAACCTCCAGCCTCTCATTAACACCACCCCGACTCTCGTCAA TC

AGAACTATGTCATCAGCAAATAACATACACCATGGCACCTCTCCTTGAATGTGTTGC GT

CAATG CGTCCAAAAAAAAAAAG AAG CAAAGAG CTTAATTGTGACTTTTTTCTATTTCATG

TTTACGGTTCATCTTTCTTCCTTTCCTTTTTTCCTTAGAAGCTGAGTGGATTGTACA AGA

GGCATTCAACAGATGTCATGCTCCTATTCATCCATAAAGTTTTTGCCATTTTCACCC ATC

ATTTTCCACTCAGCAGAATTTTACTCGAAGCATCACAACCATGGATAGAATAAAGCT CAT

AGAATGCTCGTTTGTTTCACAAGAGCTGATTTTTAAAGGCATCTTTTTTAAATGAAG TTG

GTACATCCAACACACTCATTGACTTCCTATGTGGTCATATAGTAGAACACAACTTTA ATA C AG AG AAG GAG AG G G C AG AAAAATAAAG AATAC AG ACTATACTC C ATTG C AAAG AGTAA

TACATAGCAAAGAAAGGAGAAAGAGATACCCATGAGGAATCTCCCAGGTACTCATTC CA

CGTGAAGAAAATGAGGCTAAATGATTACTTTGATCAATGGCACCAACACCAATAACA TC

ACTTTGATCAGCAGGATTGTTAAGAGTACCATAAAGTGGTCCATCATTTCCAATAGC AGA

AACCATGATAATATTGTTGGCAGTAAGCTCCCAAACCTAGCGGAAAATATTGATTAC GTC

CATCAATATAAAGCAACTATGAGGAGAGACTCCAGAGAGTAAAGGTGTTAAGACAAG TA

GAAATAAGACTGAATATATGCATGCTAAGTTTAGCAAGCATGAGAAGAGTGAAGTTG AG

GTAAGATTAGATAAGATTCCCATACCTAAAATGCACACAATTCAGATATCTAGACTC ATT

TTCCATGAAATTAGTATGATCCGTGAAGATATCACATCAAATTTGAATAGAATTGGT TGT

AATGGAGGTGCGCTACAGAGGGTTATGTGATAAGAGGATACCTATCAAAGTTAAAGA CA

AGTTTATATTGGGGTAAATGTTTGGCTTTGTATTAAAGATATGAGCATCACCAAAAT GCA

TATGCAAGATGGGTATTATAAAAGTTTCACAAGATAAGAAATGATCAAATTTGACAG AAC

ATAAAGGATAAAATAAGAATGTCGTTTGAGATGGTCTCATCATGTCCTAAATAAATC TCC

AAAGGCACTGGTCCTTAGATGGAAACCATGATGATTGGAGGTGCTAAAAGAGATGTA TA

CCTAAAATCACATGGAAGGAAGTTGTCCCAAAAGACTTATAATCTCGTTGAATTCAT ACA

GACTCAAAACAGAACACAACAGAAGCAAAAGTCTGTTATACGCGATATCGACTATTA AG

AATCAAGGTGTAGTCATGCTAGTGCACTTACTTTAGGTCCAATGTCTACTAGGAATC TTT

TTAGTCTGTCTGCACTTCTGAAGTTTGTATGTCAGTAGAAAAAGAACCTCCAGATTT ATA

GATATCCAAACTACTGAATCTTTGATGACTCCAAATGGAGCAGAATGGATGGTGAGG AT

TCATTAGCCAACCCAACTAGCTTGGAATTAAGGAGTAATTATTCGTGTTGTTGTACA TCT

CATCAATATAAAGGTGAAAAGTTCTGCTAATGTTGTTTCAGGTCCTGCTGAAAGTAA TGT

TAATTTCAGTGAAATGACAGGCTTAAACAACTCCGAATCTCTTTTACAAAATTGAGT ACT

AGATATAATATACAACCCTTGTGTTTAAGACATCCATGACATAGTTCAGCTTGCAAA ATT

AATAGATCTCATGAAACAAACGCCCCATTAAGCTCAAGAAAGCCAAGTAAAATCCAT GC

ACTGCATAAGAAATATAAGATTACATGCTGCCAATTATAATCAAACTTCTAATACTT CCGA

G ACCACATATTATACAG AAACTTAG ACAATAAGG G GTTATG GAAC AACAG CAAG ATCAT

TTCAATGCCTATGCTGTAGACAAAATGCAATCCAGTATCATACCACATAAAAACAAT AAA

GATATAAACCAATAGATAAGTGACCTCACCTTTTCCACAAAAGGGAGATCCAAATAA TCA

GGTCCACCTATGCTCAAATTCAGAACATCCATGTTGGTTGCAATTGCGTAATTAAAT GCA

TCGAGAAACCACGATGTGTAAGAGACCTGCAATAAACTGAAGAGCCACTTCTTATAA TG

CTAAATTGGTCATTACAAGATTGATCTTTTTATTTCTAACTTTTTTATAGGTCGCCT AGCG

TTGTCCTTGTCTGTAACAGTAGCTTTAGTACATGAGTTAGTGTTATTTATGTATTTT CGTA

TTCCTTGACTTATGTGATTACTTGTCGTTGCTTTCGTTCCGGCCTTCTAATTGCAAT ACT

CAGTTTTAGTTTTGTTCCTTTGTATTTTTTGCTTCGGTTTTCTAATTGGTGTGCTTG TTGC

TGCTCTTCCTTTTATCTTTCCTAAACCAAGGGTCTTCCGGAAATAACCTCTGCCTTC TTG

AAGGTAGGGGTAAGGTCTGTGTATGTACTACCCTCCCTAGACCCCACTTGTGGGATT AC

ACTGGGTCTGTTGTTGTTGTTGTTGATAATGATGGTGTCAAAGCAAAACTTGTCTCG ACT

ATTCCAAGGATACCTGCAACCTCCCACTAGCACAGGTACCGGGTATCTCAACCCACC AA

G G CTTAG G C AG ATG G GTAG ATATC AC CTAG C ATTTTTTATCTAG G CAAG G ATTTG AAC C

ATAGTCTCCAAAATTTTAACCCACTTCATTGAACGCTACCCAACACCCTTGGGTGCT ACA

AGATTGTTCCTTTTTGTGTGAATAGACTCTCTTTCAAACCCCAACATCAAGGATTCA AAC

CCATCGAACCCATGATGTGCGTCTAACTCACACATCACTTGTTGCGCTCTTACCACT AC

ACCAAAGCCCTGGGGGTGAATACTCCATATCATCTTGATTGTCCTTACTTGCGGATA TG

GTTTGTAGCTACTGAAATCAATAGATTGCACAAAGCCAATGATAGGTAGCTTAATTG GAA

AAACTGAGGTCCAAGAACGATAAGGTCTTGGCCTCAAAATCTCCATGAAAGGAAAAT AG

TATTTAATGTGTGCCTTATGTAAGTAATTTTGTTTTCCAATTTACTATTAGCAACAC TGTT

ACTTGTTATAGTATCTCAAACAGCGCATACAGTTTTTATAATATTTCAAACTGCTTA TACC

TCCCCAATAGTGGGCTAATACTAAGTATGGGCTTCCTTGAAAACAAAATAGGAATTA TAG

ATTGCACATAATTCGCAGACAAGTTCCTGGCTTTTCTAAAACATAAAAGTAAACAAT GTC

CTCCCTCCCCAATCCCCTGAAAAATAGTTGCAATCTTATCACTAAAGTCATAATAAG ATG

GCAGAAGAAATATTATATGTTCAATAACATAGCATGTAACATGGACTCCACCACTAA ATC CAATCAGTGGGTTTGGCCGCTGGAAAGGGTATGAGAAAACTGTGTATATTTAGGTGCAT

ATTCTTCC AAG ATATTGTAG CTTAATG ATG AGAAGTTAAAG CTAG CACAAAATAAG GTGC

AGAAG CAGAACTTGTCATTTACAG AGACTAG G CAGTCTAAAGTATTTTTTCTTC CATTCC

AGAGGACTTTTCACTAAAAACTATGACTGCAAGAATTTGCTATATTAGGTTCACCAC TCA

TGAGGTGGATGTGGCACACTCTACTAGCAGAAAACTGGAAGGGAACGGGGGAAGGAT

CTTAACACATCAAGTATTTGCTTTGCTGCAATTAACAACGAAAGGACCGTTTGATCA TAG

GAATCATCATTAGCACTCAGCAAGAAGCAGACTTGTATAAAACATCAGTACAATAAA TTA

GAGGCAATAATCCAAGACATCAGATTGTTGAAGATCTTCAAGTCTCAGCTTACTTAA ACA

GTTTAAGAAAATAAAGCCCCGTCACCCCCCAAAGAAAAGGAATTGGAATACTCGTTC AA

AACAATCCATTACCTGTGCATCTGTAAATACATGGAAAGCATAGATTTCCGCATCTG GA

GCAAAACCGAGGCATTCTTCATCCTGACCAGCAATAACACCAGCTACAAATGTCCCG TG

TCCAACATTGTCATTCAATGTATCTTCGTTGGTCCAATTTGTGCGTTCCTGAAATTT AAG

GTACCAAGCCCACTGTCTGTTATTAATGACGTAAAGATGGTACGAAGATATATACAT TG

GCATATATGTAAATGTCAAGCCTCAAACCACATTGGATTCAACTAAAGCATTTCTTT GTT

ACAGAAAATATATTTTTAAACGACAAAAGACAAGATACCTTGATATTACGAAAATGT GGG

TGATCTGCACGGATGCCTGTATCAAAAATTGCCATTTTGACCTTAGCACCAGTATGC CC

TTTTGACCAAAGCTCATGTGCCCCAAAGAGGGATGTGACTCGAGATTTCTGCAAAAT AA

TCATACAGGACAGACTGTATATCAAGAAAAATACAGCGAACAACAACAATATGTTCA ATC

TAAAAAAGAAATAGAAAATAGAGCAGCAACATGAGACCCAGACACTCAAAGAATGCA GA

CCATGTTTTTCAAAAAGGAAGTATGCCCTGACAACCTTGAGGAAATAAGAATGACAA ATT

ATAAACCTATTAACACTCTCCGATCTGAATGTTACGCTCCAACGGCCAACCAAGTTA AA

GTGAGCCACTTAATGATCTAAGGATATATAATCCATTCTCGTTAGAGGTACTTTAGT GTC

TTG AAAG ACTAAAAAAACAAAAC ATG GTCATATATC CAGTG CAAAAGAGAATATTG GG G

CATAGCAGAGACAACTTGTGAAATTATATGGATCACAGGGTCGCACAAAGATTAAAC TT

TATAGTGATCAGTAAGGTGCAGCTTCTGTGTAATAATCAAGTTTCCCTTCATATTGC GTT

GAATTCAGTGTGTCATGAAAGATATAAAAGAATATTATCAATTGTAACTTCGTCACA GAG

AAGATAATCTCAGAAGTCATTTTCACAAGTTCGTGAAGTCGAATGCTTAGATTGTAG ATA

TATCCACGAGTTCTTCACTAGTCCCTGAATTAGTCACATATGTAACAAGCACTAGAA AGG

GACTGTTAGATAGTTACGGGAAAATAGCTAAATGTAAATACTTATATTTATTATAAG TGTC

CCACTTCGGGAAAACACAGGTATAGATATTATTACATTGTCAAGTGTGCCCATAAAA GG

AATCAGTTGTAAGATATTAGTCTTCAAGCATTCTCTAATCTTTCTCTTATTTTTCTC ACTAT

GGAGTCTCAGCATCATGACATATTAATGGAATAACAGACACTTGATTAGACCAAAAT AG

G AAAAG GACAAAGACAAAAG G GAACTGAAAG AG ATTAATTTCCTTTGAACATATAC CAT

GCAATAAAGTTGCAACTATCATATGTCATGAATGCAAAGAGAAGAGTTGCATACATT CCA

ATGATGTAAATTATCAGTAAACGCATACAAAATAAAACACAAATAATCAGTGGTCTT GCC

TGCATCAATAGATGTCTGCTCCAGCTAATTCTCATTATACTAGTGTTGGCCACCGCA TAG

TTTTG ACCTTCACTG AAG G ACATAG CAGTAAAAATCTTTC CTGG CCTCTTCTTC CCATTG

GCAAAAGCCCCATTCTTCTCACTCTTCTCTTCAAGAACTATCCTTTGATAGCTCAAA TCC

AACGAAACGTCTTTTACAAGATTCATTTTTCTGAACTTTTCTAGCAAGAGTTCTTTC ATTG

ACTCGTCGATTTCCACCAATCCAAAGTCAGTAGGAAATCTCGCAGCCGGATTTTTCC GC

TCAATCCATTGCCAACCCTTAAATTTCAAGTTGTTTTGAAGATAATTCCAGTGATCC TCA

GGTTCCTTATAATGATAGAATCGAACAATATAATTTCTGCTATCAGATTGTTGCTTC TGG

TCATGTTGGCACTCATCGGAGCTACTAGAAATTAATGGCTCAGACTCTATTGGTGGG TT

GAAGCGGATGAGTGTATATACCGGAAGGAAGGGGACAAGTGAGAGGGTGAAGAATGA

TTTCTTAG GAG CTTCAG GCAT

SEQ 35

TCATATTGAAGCGACCAAGTCTTCAGTCTCAGTCGTCTGCTGAGAAAGGGTGCCTCCAA TCCACCTCTTGAGCATCTCTAAGGCAACTTTAGGTTGATCCATTGGAACCATGTGTCCA GCATCGTGAACCTGTTATAAGACACCAAAACAGTTAGCTCAAACATCCATCAGTAGAATT

TGAACAATAACATCGACAAAGAAAGGCACCTTCAGGAAACTCAGAGGCCCATGGCTT TT

CAACAATCCAGCTTCAGAACTGTCAACTTCAAAAGGAACATCGGGAGATGCTACAAA CT

CTTTCTGACCACTCCATTCCATAGCCTGAACCCATCTTGAGTTACCTGGTAAAAGAG AG

GCGTTATAATATCCGAAATATTTATGTGAAAAGTTTCCATCATTAGGCTTAGAGTTG AGT

CAAAGCTTACCAAGCCAGTTGCAAATAAGATCATATTCTCCAGCATAAACAAGCAAC TTT

ATTCCATCCTCGAGCAAGGTTGGAATGCCAGCCTCAAGATTCCTCATCCAATCAACA AG

CATGGCCTGGTACACAGTAGTGCTGCATGAGACAAACTCTATATCCTCAACTCCAAG AG

CCTGCTTAACAGAGTGCATATTCAGCAATTTCTCCATGTTTGAGAAGTCATAGCAGA GT

GCTCCAACGCATTTCTTTCTGATGTCGTAATGCTGCAGGTGAAAGCTCAATGGATCA GA

AATATGGTTAATCAGTCATTTGTTCCAAACTTTGGAAGGCATGCCAATGACAATGTG ACC

TCTTGAGTAATTTAACATATTCAACATGAAATGATATGGAGTAGCGATTTAGAAGAA ATA

G ATTTCTG G G ATCATTTCTACTCTTTCTG AG G CTAGTAACACCTATTTCTCTACGAAGTA

C AG AATAAGTGTAATAG G C ACTAG AATTAGTAG AAAAC G G G AAAC AG AC AG AAAG G G CT

GATAACTTACATTGATGTCAGCCCCAGCACGTGCACGAACAGCAGAGAATATAGAAT TG

C AAAC AAAATAG G C AG C C AAG C AAG AG ATTTTC C C ATC AGTAC CTAAAAG AATAAAAAA

GACAGAGAAACTGAGAAGCAAAACAAAGTACAGAGAATTGATTTGCTGTGGTCAAAG AA

CATCCATTTACTTCATCAGCTCCTTCCCCTTTTCTTTTTCACCCAGGGAAAGCCCGA ATG

AGTCAAATGATATGGAGGAAAGAAAGATAGTAAACAGTAAATTAAATAATGTACCAC AAA

GGTTTATTGCAACTTCACAAACTGGAAGTATTTTGTTGATACGATCATGATCAGACT TTG

AAATTAATCCCATGTCCAATGCATAGTCAGTATACGCAGCGTATTGTATTTTGGGAT CTG

TAAGCCCATTCCCAATGGCAAATCCCTGTTCAAATACTTTCAATGAAGTAAAGACAC ATG

ATTAAGGAAATAAGAATTCAATAACTGGGAAAATGAGGTACCTTTAAGTTTATATGT ATT

CCTTCTTTAGCCTTGTTTCCCTTGTGTACTCTAGCAGCAAAAGCAGGAATATAGTGC CC

AGCATATGATTCTCCAGTTATGTAGAAGTCATTCTTTACAAGCTCAGGATGCTCTTC AAA

GAAAGCCTACACCATTATTAATGATCATACCAACACAAACAAGTCAGGATCATATTA TCT

CTGTTACGCATCAATTTAGAAAAATGCTAACTGTGTCACATAAAAGAACGAATCTGA AAT

AGCCAATGTGTCACAAATGCTCAAGAGAATTCATACCACATTCGGCCTTCAGAATTT GC

AAAGCAGAAAAATACAACAATAAAAGCAACATATAACAATATTTCCAACTAGAAATC TGT

TGAAAAATTCACGTTCCGAATAGGTAATGTATAGTCTTAAGGCGGCTAAGCCAAGTT CT

GCTAAATATTCTGGTTTAAAAGCTGTTATACATGCTAACAAAATGCATCATGAGGGA AAC

AACTG AC CAACAAGTTACC CAG CCAAAATTCAG GG ATCAG CTG CAGTTTGTAGTAAAAA

ACAG G AAAACCAG CCCATGAAG AAG G GTATTGAATACTG CAAAAAG GTTGAG GG AC AG

GGTTTTCTAGCAATGTGATCACATCTTTTGCCCCTAATGCATTTGGCAGATAAATGG AGT

CAAAATATTTTAACGCCTCCATTTTTGTTGGGATCAGAACCCAGGCATCAACCAACT ATC

ATTTCATAAGCACAATATAAGACTCAAGTCCTAGTATATGACATCTCTCCAATTATC TATA

TG GTAAAAGTATTAAGTG ACCATGTTTCTTTTG ACAAG AGTG G GTTG CTCTAGTG GTGA

GCACCCTCCACTTCCAACCAAGAGGTTGTGAGTTCGAGTCACCCCAAGAGCAAGGTG G

GGAGTTCTTGGAGGGAGGGAGCCTAGGGTCTATCGGAAACAGCCTCTCTACCCCAGG

GTAGGGGTAAGGTCTGCGTACACACTACCCTCTCCTGACCCCACTAGTGGGATTATA CT

GGGTTGTTGTTGTTGTTCTTGTTGTTTCTGTAAGGCTCGAGTTCTAGCATTAGAAAT CAG

CCTTTTGAGCTCCTGTAGACCTATTGTACTGTACCCCGTCTTTGTATCACATGTACA CAG

GTGATCAACACACAGACAAGAGAACTGACAAGCAAACCGCACTCTGAGAGTCCGAGC A

ATTACTTGAAAGGGATCCCAGACAACCCTTACTGGCACCAGTACTTTAATTTGTCAC CA

CTCACCAGGCGGCCAATTCTAGACAGGTCAGCAAGGCGATAAACAGAGGTGTCTTAA T

CATCACAATATCATCCCG CACACAG GAGTGG G GAT CAG CAAACACTTCTTTT AT AACTG

AGAAACTATTGTTTCCAGCATTGAAACTGTGGATAATGAGCCTGCCCCTGTACTTTC TTC

AATTTTTTTCAGAACAGGATTCAAACTACTGACATGCGCCTACCACACATCCCATGT TCG

AAATTGAAACCAAAGCTCTGGGGCACTGGAGATTAGCAAATAGGCTAGGTGATTATA AA

GATATCATTCAAGAGTTCTCCTACTAATTCACGGTACTTTCTACAAACCCCTCCCTC CTT

CCACAGTTGATCACAATAAGCTTGACTACTGACGTATATGTCAATACCACAGCCTCT GT G AG ATAG AAAAG CTTCCATAATG ACTACTTG AAAG G AG ACCAAG GG G GTTTAG AAAGTA

TTTATCATTCTGTAAG CTACTGCAACAATAATG ATTTTACTTAACG G AAAG G AATGC CAT

AAATGAATTTGTATTCTTGAGGATGTTCACAACCAGGACTGAAGTTGCTTCCACCCC TA

GCTACATTCTTTATCCGTATTAAGGAAAAGTTTACCATCCTTTTTTTCCAGGTGAAA TGTT

TTATTGGCCTTTAGAAGCAGGACAAATTGTCCAGTGCAAGCCTCATGAATAACATAC AT

GAAACTAGGAATTGATAGGTGAAGAAAATATAGGAAACCATCCATATCATTAAGTTA AAC

AATTGACTCTCTGTCATCATACGACAATGATTACAACGGTTGATTCCAGAGGAATAA ATA

TGGTTCCAAGTTGCTTTAGGGGTTTAATTTCACACAGAACAAACCTGTAGGAAGTCA TA

CAAGTCGTCGCTAACACCTGCTTCACTGTGACGGATGTCATGTCTGTCAGAACTGTA AC

TAAAGCCAGTACCTGTAGGTTGGTCCACATAGATAAGGTTTGATACCTGCAGAAATA AC

ACGTCAACATCATTTCTTCAGGNCGTCAACATCATTTCTCTAGGTTTGGAAAATCTA TTA

AGGTATTCTAGTGTCCTTGTAGGAGAAAAAGGAAAATCGATAATAAAAATGAAACAT CTA

CTTTACAAG GAACAAATGTGG AACACAAG G CAAACTTG ACACTCTAG GAGTCAG CAATA

AAAGACCCAAACCACAAAAACCAAAACTCAAGATCTTATGGAACATAAAGCACTTTC CTC

TCTG CATTCTTGAATTG CCGTG CAG GTAATTTTTTTCAGTAATGAGAAAAAAG AACATTA

AAACAG G CAG AAG CATGACATG G AATTAGG G AAGTG CAGTATCAGAG GTCTAATG AAA

AAAATATG GCTGACATGTTTCCTATG CAAAG CATTAAG ATTTCAGTAAAACACAAAG CTC

CTTCCG G GAAAAAAAGTTTTCTTG CAGTCTG GTT GAT GAT AACTACATAAATG CTG AAAG

TGTAACTATCTAAAGGCTAAAAAGGACATTTCTCCTGACAGATGTATCATGTGCAAG AAA

GAAATGGTCCCCACAGATTTTCTTCACATTGATGAAGTATCAATTTGTGCCTGTGAA TTG

GCATTGCATTCATTTCAGGTTGGTTACTGGTACATCTCGAAGAAGGATAATGGTGAA GC

TTGGATTATCAAAGCTTAATCACAAACCAGTATCCTTATACTTATGATTCTTCGATT TTAC

ACAACATTGAACTAGATATAGACATGTTATTAACTGTTTTATCTTGATTTCCTTTTC AAGT

AATCTTTTCAAAACTAAAAATGATAATAACAAAGAGGAACATCCACAACGAACAAGC TTT

TTCG CTAATTGG CACTTTACAAATAAATTAG CAGG CTG CAAAACTACTTATCATG AAAAC

GACTGGATAAAGGACATGCAAAAGATGTTGCCAATGCAAAATCTAATAATCTATACG CC

CATGCACCATTATACTCTTAACTTTCATGTACCATTGAGTAAAAGCAAGAGAAAGTC TTA

TACTATCGAGCTGAATCATCTACTAAAGAGCAAAGAAGGTAAATTGCTACTATTGCT TAA

GCAAAGCTGTGTTTACATACCACAATAAACTGCTGTTAAATAGCATAAACCATTAAG ACT

ATAAGGTGGTCTTAAAGATAGTCAATTACCTTGTCCCACCCATATTCATTCCGCACA AGT

GACAAATTATTTGAAATAGAG AAAG GTCCATTTTCATAG AAAAG GG CCAACTCACTG CT

GCAACCAGGCCCTCCACTCAACCAGATGACAACAGGATCGTCCTTACTACCGCGTGA T

TCAAAGAAGAAATAGAACAACCTACAAAATGAAAGATAACACAGTTTCAAGCTAAAT AAG

AG C C C AAC C AAC AATAAG AAG AAG CTAGTTG ATAC GTTAG AC AG G AAG AC G AAG AT ATA

AAGCCTTATTACTCCGGAGATGTATATTTTGGTAAATGCTCACGAAAGACAAATATT TAA

ATGAAACAATGTTGCAGCATTCCTTTCAATTCTTACAACTGATTTGGAAACTTCTTA AGC

GCACTGATTGAGTTAAACAGATAAGTGGTCAAACTGTAACAATCTGATGGTTCCCCA CA

AAAGCTACATGTAATGACATTTGAAGGTAAATAAATCAAGCAGACAGAATATTTCTC CAT

TAATCAACCAATCAAATAATTCCAAATTAATTGGGTTGACTAAATGAATACTCTTTA ACAA

TTCTGCTATATTTAAATCCAATTTGTTGCAATATTAAATAAATATCCTTTGAATTCA TTTCC

ACACCAGTTACAACCGTTTTATTCGAATTTCCGAAGTGGAAAGTGAATTTGTAGGAG GA

TAGAG GG G AATG CAG G GGTG CACTG CACG ATAGAAG ACTACTG AGAACTTGG CTACAT

TTTGTAGACAACAAAAACCTATTACCCCAGTACACCACAATGCAATGCTGAATCCGA CA

GTATTTTAACATAAAAGAGAAAAGAAAAAATTGAAACTATAATTTGGTTGGATGCCT ATG

ATAATGCAAATCGGAACAAGAATCACCAAAAACCAGGAAGTGTTTGAAGCAACAAAT AT

GGTACTAGTCTGTTCCAATTTGATAAACTTTCTATTTTTGGGACGTCCGGGAACATT TGA

CACCAGTTTGCTAGTATCTTAACAATTTTCAAGATACTCTAATTCATTTCTTTGAAA AGAA

TAGACATTAGGCCTAACTCAACATCAAAAGCTAGCTCATGAGGTGATTGATTGTTCA TTC

AATATATAAGGAGACAACAGTCCACTCACTCCACCAATTTGGGACACTTTAATATTT CCA

CACGACGAGGCCGTGGACAACTGGAGTGTGGACAGCATAACTTGCGACCCCAACATG

GAGAAACACAATGTCGACCTTACCCTACCACAACCTCAAAAACTAGCTCACGAGGTG AG GAACGGAGGATTGCTTGATACCATAGGAGACAACACTCCATTCCCTCAACCAATGTGAC

ACTTCAACATTATGTGTTATTTTTATCGAGTTACATTTTCTACAGTTATCGAAATAT GTTTC

TTACAATACTACATCGTACATACTCTAATAATTAAATCAAATCAACATCTTAAATTC CGTG

TTC AACTAAAC ACTAC C ATAAAAATC G AG AC AG AG ATTG AAAAAAAAACTTAAC CTAG G G

TTTGAAAAGGTACCTAGCAGCATGAGAATGCTTAATCTTATAATAACCGGCATGATG CC

CCAAATCTTCAAAGGATATAACACTCGAATTCGTCAAATTAGCGAAATTAAATCGCT TCT

CGACGATTCTAGAAGCAGCGGTGGGAAATGGATCCCGATCGACAATGTTATCGGATT C

TTTCGGGAACAAATTTAGCTCGTGTATCAACTTCTCAGCTTGCTTCGATGCTAACTT CGA

AG ATATTG AAAC CTTC G C G AATG AAG AAG G AG AAAAAG C AAG G AG AAG AAC AAG AG AG

AGAAAAAGGGAAAGTGAAAGCTTCATTTGCGCCAT

SEQ 36

TCAAGAAGAAGGGGTCTTCTTTCCCTCATTTCCGAAGGTTCCAATAGGTTTGGCAAGAA

TGTGCTGCATTGCTTTGACTCCTAGTGGATTGTTCTTGCTCTGTACCAACAACATTT AAA

ACACAATCACGTAAGTAAATGAAACAACCATATCCTTCTGAACAAAACTGTTAAAAG ATA

AACCTTGGCCTAACTAAAGGGCAGCACGGGGCACTAAGCTCCCGCTATGCGCGGAGT

C C G GAG AAG G G C C G G ACTAC AAG G AC CTATTGTAC G C AAACTTAC C CTG C ATTTTTG C

AAGAGAATGTTTCTAAAGCTCGAACATATGATAGCTACTTTACCAGTTAAACCTTGG CCT

AACTCAAACAAAAATCTAG CTCATGAGATGAG AATTG CCAAAGAAC CCATTC CCTTAATC

AATGTGCGACAATCTAACAATAACAAGCAGAGATCACTTACAATTGAGCAGGTGCCT GC

TTTAACATTG CAG ACAG G ATAATCATGTGG G CAACAACTGTTATG GTCTTTACAGCAAGT

AGCTCCTTCCATGGGACAACAACCCCAAGCAAAACAGTAGTTATAATACTTGTAGAC AC

AGCAGCACGTTGTTCCGGCTGGGCATTCGTTATAATCATCACATTGAGTGGGTGGCT TG

ACTGGTGATGGAGGAGATGGAGCTGGTTTTGGGGGGTTTTGGCCTGTCTTTACAGGG T

AAG AAG CAATTGTAG CAATACCACACAAACCTTTG G G GTTG CCAATGTTTCG CTG CATC

CTGAGGTAACCATTTTCTCCCCACGAAGCACCCCATGAGTTCCTCACGATCCAATAA TC

CATGCCATTTTCACTACCATATCCTACTGCAACCACACCATGGTCCACTGCTGCACC AC

ATTTTC C G GTAAAG ATAC C CTG GAG CAT AG G C G G ATC C AAG ATTTAAACTTAATAG GAT

CAACATTTAAATTTTTTAATACTGAACTCATTGTGACTTTGAAAATACAGAAATATT TGTT

GAATCCGTGTAAATACTGGCTAATTCGATCAAAAAGATAACAACTTTATTGACAATA GGC

AAATCTGAAGTAATACCGATTTATAGTGCTGGAAGTCTTTGCCGCCAGCTTCGATAG CA

ACGCTGACGGGTTGACCCGCGACGGCCTTTTTCAGTGCCTTTTCATCATTAGCAGGA A

CATCTTCATACCCGTCGATGGTGACAACCTTGGCATTTTTCTATAATTGTTTCAAAC AAA

ATATTAATAGTCATAACTTGTATTTAGTTCATTTCCAATAATCCCTTTTAAGACAGA GAAC

ACCATACTGACCCTTGCTTGATCGCATTTTCCATCTTTGGCTTTGTAGGGGTAGTCT TCT

TCAGTGTCTATTCCTCCATTTTGAATGACGAATTTAAAGGCATCGTCCATTAGACCC CCT

TGGCAGCCTTGGTTATCGGCAGTATCACAATCTACCAGCTCTTGTTCAGATAACGAG AT

CAGATTACCTGTCATTATCTTGTTTACTGCTTCAATTGAAGCAACTGCTGAGAAAGC CCA

GCAACTCCCTATTTCATTTCCCAATTGTAACACAATTAAGACGAAAATCATTAGTTG CTTT

ACGTAATATAAAAATATCATTATTTCTTCAGCATTTATTTATGCATAAAATGGAAGA TAAT

CCAATCAATCAAG GCTG CATAATTTG CCAAAACAATTG CACAAAAATACATACTGATG AA

CATAGTGTTTTGTTCAAGGGAAAAAACATTACTTATATTGCCATAATTAGATGCGGA AAT

TG GAG CAAAAGTCCTG CTAACCTAATTTATCATACTTAACAACAACAATAACAAATC CAC

TGAAATCCCATCGTGTGGAATCTCCTTACCCCATCTAGATAAAGTATAGAGACACTT TCT

G AC AG AC C C CAAAAG ATAC AAAG GTAAAG AG G AAAAAAG G C C CAC G AAAG GTAAAAAG

CAGGGAATTAAATAACAATAAACAGCAATACCAACAAAGCAATCAGAAAATTTAAAA ATT

AAACATG CTTAACCTTAG CTAAAATTAACAATAATGAAGG GTCAACTG AACTTG CCATTG

ATCTTAGGTTAAAACTTATGACACTGTGATCATTTGAATCCCTTACCCTAGCAAACT ACT

AAATTTTCCGAATTTCCATCAAAGCCATCAGTAAAGAAAGCATTGACAAATGTTAAA CAG AAAATAAGTATTAACAAAAAGGAAAACAAAAACAAGTAAGAGGTGCAGAGATGAAGAAG

GCAGAGTACGAAAGGAAAATACCACATTGTCCTTGATTTTTGACGTCAACAAGAACA CC

TTTCTTCCTCCAGTCAACGGAATCCGGCAAACTATCTCCGACCTTGGGGGCATAACG GT

CACTTTGGGTATACGACAACCTACTACGACCATCGGGCTTAGTACCCAAGTAGATGG AC

TTGTACTC CTCGTTG GTCAAATCTG CAAACTGAGTC AAACC CAG CTTGTAACTTTTTTCA

GGCGCAGAGTTCTGTTCATCGATGTATCTAAGGTTGTCCTTAAAGATCTGGAACCGC TT

GTCCTTTTCTCCTAAGGCGTTATACACTTTTTTATGTTCAACTAGCCAAGATTCATA CAAA

GACACGATTTCATCGTCTGTTCGCCAGACCGTTGACTCGCCGTTCGTGTGATGTTTT TC

GTTGTAGCTTATAATGGACATGTCCTCCGCCGATGATGTTACGGCGGAGAACATGAG C

ATTACGAGTATGGAGATGGAGAGGGTGGAAGTATGAATCGCCAT

SEQ 37

TC AG G AAG G C G AAAC AG C AAG AG G ATTC AAC AAAGTG C C C AC G AAC AAAAC AAC AC C A

GCAGTCTCATCTTTCACAAGGAACATGAAGGGATGATCAGCAACAAAGTCTATCTCC TC

TTCAACCTTCATCATCGAGCACCCGAACATCATTGTAGCAACAGTAACAGCTGGAGC TA

CTGGAGCCTCCTCATTTACTTCGATAAAGGCCTTGTGAAAAGCTTTTGCAGCTGGAG CT

TCTGCACCTTCCTCATTTACTTCAATAAAGGCCTTGTGAAAAACGTTTGCAACTGCC AGA

GGATAATTCTCGCCCACCATCTCAGTGAGACCACCTTTAAAAGGTAATGTGAGCTCG AG

TCCTTTTAGAACTTCTAAAGCTTCAATCCCCAAAGATATTTTGAACTTAGGGATAAG AAA

CTCGTGCACTTTAACTTTTTCATATGGAACATGGCGATCCAAAAATCCAGGTTCCGA ACT

AATTTTCTCCAGTAAAGTTGGTAATCCATCACGGGCATTTGGGAGATACACATACAT GTT

GAGAAATCGCTTGTCCTCGCCCTGTTTATAACGAAGCCTTAACACTTGGAAACCATC AA

AGGCCTTCACGTATTGCCTTTTTTTGCTGGTCATTAAGGGTGCTTGAACAGATCCTC CAT

TAAG G AG ATG G AACTCATG GTCTTTTGTATCTGAAGCATTCAACTTTTCAGTCCATG CTC

CTTTG AAATATAGTG CATTCG CTAAG ATCAG ACTTGTACCG CTATTG ACTG CAACAG GA

GGAAGAATTTGTTTGATAAGACCATTCGTTTTCTCTTCAGCCCACTTATTGACTTCA CCA

GTAACCTCATCACCCTACATAAGAAAAATTAAAACAAACAGAGAACATAGAATCAGC AG

GCTAGTAACAGGATAGAGTTGAAAGTAAAAGAAACAAAAAAGATGTCCTATATGACA TTA

GATTCGTTTGTTTGTAATCTTATTCGAAAAGCATAATAAACGTTTCTAATGTGCTTA CTAA

TTTGAAAATATTCTGTTTACCAACATGCCCATAATATTTTGTTATACATTATAATAC TCCCT

TTGTTTCGTACTTCGAGGGTCAAACTTTTCAATTTTAACCGTGAATTCGAACATGAA ATTT

TAATTTTTGACATAAAAGTCACATATTTAGAGACTGTAAAAGTAATATAAGTCATTT ATAA

GGAATATAAGAAAAATCGCAGTCAAGAAAAACTCGACTCTCGAAATCCAAAAGGTGT CA

CATAAATTGGGATGGAGGGAGTATCATGTAATTTCAAACATCAATTTCTTTTCTAGT GAG

TG AAG G ATTAAG ACTAAC CTTGTTC C G AAAAT CAACAG AAG C C G AAG CAG C CTTATAAA

CATTGTCCATAACCTGTTTGAAAGAATGCTTAAAAGACAAAGATTGGTCAACCCAGG CC

CAATTAGTGACAGACAAACGAGGACCTCCCATGGGGCTGCCATCGGCTAAGACGTCG G

TGATGACCCGAGAATAAACAGAGTTAAGTTCTTCAACAGAGTTGAATTTGAGAAAAG CC

AACAGTTGATCCAATGTGGAACCACTGGAGCCTGCTGCAATAAGAGCAAAAATTATT TG

AATGGAGACCGGGGAAAACACCATGTTTGCGTTGTTGGACTCGTCCTCGTCGGCTTT A

AACTTGCTGAAGAATACATGCTTTGAAAGAATCAATGGAACATCCATCATTTCTGAT GAG

GGAAACTAAAATTGAAACAGAGAACGTAAATCCATAAGATGAGGGAGACTGAAACTG AA

ACGGAGAACAGAAATCCATAATTTCTGATGAGGGGAACTGAAACTGAAACTGAAACG GA

GAACGGAAATCCATAATTTCTGACGAGGGAAACTAAGGTACCTTCACAGTTGAACTT GA

CAGCATGGAGAGAATGCTGATGCATATAGAACTAACAGTCATGGTAGGCGACCACGA G

TCATACAGAATATCTGCAAGACAGAAAATAGTTGTCAACTGCTGCTTTTATTAGATG CAT

TCTAACTTTTTTCATGAGTCCGGTAGAAAATCTTAATTATTAAAAAAAAAAACTTAT TCTG

GAAGATGAGAGGAATAACTGTGAATGAAAGGCGTTTTGCTTTCACTTTATTAGGCAG GT

AACTATAG AAAGTTTAAG AG ATTAATAAG C AAG C CAT AG G C ATAG AG CAG AAAC C AAAA TTATGCAGGAAAAGAAACTAGTATGATCTCTGCAATTCTTAGATGAAATGACAAATTTGA

C GAG AC C C AC AG AC AAAC AAAAAAC AG ATC C AAAGTTTATTCTACTAG G G CTAAG AGTT

CTGAATCAGTAAACAACTGTGTTTTTTAATCATCCGTCAACCATCATTGACTATTAC TCCA

TAATTAGATACGAATCAACAAATATTATCGGGACATGAAAGGGGAAAAGAATCAATA ACA

AAAAC AC C AAG AC C AAGTAAAAAG CTC AC G G ATATAG C G AG AC AATCTAC C ATC ATAC C

CATCATCAACTAAGTTGGCATAGAGCATTTCCAAAACTATAATACCTGTCATCTAAT CTTT

ATTAAGAGTTCATATCACAAATCATCCATAATTGCCAAGGATTTTCGTTAAATGCTT CATC

TAATGCATTCTAAATCAAGCCATTAAGCTCGAAGAGTAACCAAATAAATAAATTTCT GTT

GGTGATAAGAAATAGAGGAGACTTGAGCTGTTGAATAAATACCTAAACAAATGTGCC CG

TCCCTATAGATATCAGGGTGCAAAGGAGCTGGAGGCAAAAATATCACCTGCATCAAT GT

AGTCAGCCCTAATTAGTATGATAGGAATTTCAATCAAGAACTTTAACAAGAAACCAC ATC

TTATATCCACCTGGGGAGCTTTAATAGGATAATGTTCAGGGAATTCGGCTTGAAGCT GA

TAAGTTTCATTAGCATACAGCGTCCCAGGAGCACCATTCACTTCAATTAACCACCTT TTC

AAAAATCAAAAGTTCAGTAATAATAGCATGGGACTTCGAATTCAATATTAGAAATTT ATTG

GAAAACGACAAAAAACTTAACAACAAACAGGAAAAATGAAGACCTTTGAAGATAATC GG

AGGGTTCAAGATTGAAGCCAGACGGGGGATTGACCTGCCAGTTCCCTAGCTCTGTAT G

GAGTCGATTGCATGCCAT

SEQ 38

CTGAAAGTTGGTTCCTTTTTTTCTTCTCTTATTTATTCATGCAATAAAGCATCTCCAAAC T

TCTATTCTTATTCATTCTCTCTGCTTTCTTGCTTCATCGAACTGGTGAGTAGTTGTT TTGT

TTCCTTTTTCTTCTATTTAAGAAAAAATTAACTCTCTTTTTGTCGATATATTTTATC CTTTTT

TTTTCTTTTTTCTTTTTTTGTTTCTGTGGGTATTAGAGGTTTTGTGCTTCATTTCAT ATATT

TGTCTCATGATTTTACTACTTTCAAGGTTGGGCTTTTTCCTTAGCAAGAAAGAACTA TTTC

TGTTTATGTTTCATTTTTCTTTTGGACCTTGGTTTTCTGTTCTCGAGGATTGTATCT GTTA

AAAATTGAAGTACTTTTTTTTCCCTTCATCTTTTTAATTGATGTTCTGTTTAGTGTT ATTTT

CACCTTTTATGGCATTTAGCAATGTTTGTGCTTTGACGGGTTGCTGTTATAAACATA AAT

TTTGGGAAAATAATTACCAGGTAAACTTGTTATTATGCAAGTGCAATTTGTGTGCGT GTG

GTG GTTTTGTTG CTAG G GAG CAAGG CATGTG ATTAGTG ATAAG AG G GTTAAAAGG G GA

GTAG ATAAACAAAG CTCCACTTTTTAGG CTATTGTTTTTACTTG G GTTCTTC CATTTTTTA

TTATAGCTTGATGAAGTAATATGTAGCTTATAAATTTCCCAGAATAAGAATCATCTC TTGC

CTTAGAAAAAATAATTTACCAGTAAGAGCAGAATATATGGTAGGATTCATCCACTCA ACT

CCAATTAGTTTGTGACTGAGGCAAAGTTGATTGAGTGATCGATTGAGTTTAGTCTCA TTA

GATTGTCATTTATCCATTAAAAACATGCAGCAGGCATAACATGAGTGATTTGATCTT CTG

AGCATTTTCTCTTGTTTGTTGAATTTAATATATCTTCACTAATTGCTTGGCCTAAAT TTTAT

TAACTCAAAGTGATGATTTGCCTAGGTCAATATGGGAGCAAAAGCTTTTCTTGTCAC CAT

TTTACTCTCATCGCTGTTATTTCCTTTGGCCTTGTCTACGTCAAATGATGGCTTGGT TAG

AATTGGACTGAAAAAGATAAAATTTGATCAAAACAATCGACTTGCTGCACGCGTCGA GT

CCAAGGAGGGCGAGGCTGTGAGAGCCTCTATTAGGAAGTATAATAACTTCCATGGTA AT

CTTG GG G CCTCTG AG G ATACAGACATTGTAGC ACTG AAG AACTATATG G ATG CTCAGTA

CTTTGGGGAGATTGGTATAGGCTCTCCCCCTCAGAAGTTCACAGTCATCTTTGATAC TG

GTAGCTCTAATTTGTGGGTGCCTTCATCAAAGTGCTACTTCTCAGTAAGTTATTTTT TTC

CTTAAAAGAATGCATAATAGAGAAAGCTAGTATTGGCTACATAATTTGATGATCATC AAT

ATTTATGTTTCTCTATGTTTGTGCAGGTTCCCTGTTTTTTCCATTCCAAGTATAAGT CAAG

CCAATCAAGCACTTATAAGAAAAATGGTCTGTTTCTGACCTTTGTCTATATTTGATA ATTG

CAACACGACACGTGCTTTTCTCTTATACTTGTTATTTATGCTCAATGCTTGCTTGTA AGA

GAAAGCGTTCCATTATTGGCATTATACATGACATGTCTTAGGTTTTGAGATCAAAAC TAT

TAACTCTG CTACCAACTTAG G ATTTTTTTAAAAAG AAAATAAAG G AAACC CTCAC CATTTT

TATTGTTGTCATCCAATTATGTGCCTTGTATCAAAGTTTTTTGTTGAAAAATATAAT TTGG CAAGTTTATGTTGTTGGCTTTCCCTGCCAAAAATGTGCTAATGTTATCTCTCTGATTTTT T

TTACTCATGATTTGCAATAAAAGCTTGTGCCTTTTAAACTGTTTTGTCTATCAAGGA ATCT

GTTATGCTGGAGTTCCTTTATTGAGTTTTGATATCTATCATAATTTACTTTCCTGGA AAAT

TGATGTCTGCTGTGTGTTTGATATGACCTTTGAATATTCTTCTCTGTCGTTGAGTTG GTC

AACGTGTTCAATTGGTTGTTGACCTAAGAACCTGTTCATCCAAACCTTTTTCTGTTT AATA

TGCCATACAGGGAAGTCTGCTGCAATTCGTTATGGTACTGGAGCAATATCTGGATTT TT

CAGTCAAGATAG CGTTAAAGTCG GTGACCTTATTGTG CAAAATC AG GTG AATGTG GCTT

CTCACTTCCTTTTTTTTAATTTTTTTTTATGTTTCTTGAATATATGGTCTCTCATCT GTCGA

GATTGTTAATGACATCAGGAGTTCATTGAGGCAACAAGAGAACCCAGTGTGACTTTT TT

G GTAG CCAAGTTTGATG GTATATTGG GTCTTG GTTTCC AG G AG ATTTCTGTTG G AAATG

CTGTTCCAGTATGGTATGTGGGTTTATTTTGTTTGCGTTCTCTTCTTTCCAAATGTT TCTT

CAATTTCCTATTAACCAAGTGCGTGCCTTGTGAATTTCATTATTATTGAAATGATTT TATC

TTCTGGATTGCAGAATTTCATGAACATTTTCTTCTATATAAAGTTTTAAGTGATACC GGTC

TTG ACG GTTTCTTCTGTGTTTTATAG GTACAACATG GTCAAACAG GGTCTTGTCAAG GA

GCCTGTCTTCTCATTTTGGCTCAACCGAAATACAAAGGAAGACGAAGGGGGCGAAAT T

GTGTTTGGTGGGGTTGATCCTAACCACTATAAGGGAAAGCACACCTATGTCCCAGTC AC

ACGGAAAGGTTATTGGCAGGTAAATATCCCTATATCTTCGGAAGATTGATGTTTTGC TTT

CTGCAACTGTTTTCTTACTCTTCAGAATATAATATGCAGTTTGACATGGGTGATGTT CTG

ATTGATGGTCAAGCTACTGGTATGTTACGTTACTTCCTTTTCTATTTTTTTGTGTGT GGA

GATTTCGAGGATATTGATGAGAGCACTTTCCCATGATTTCCCTGCTTTTTCGTTGTA TTG

ACATACTGAATAATGTAGGTTACTGTGACAATGGATGTTCTGCAATAGCGGATTCTG GG

ACTTCTCTCTTGGCTGGTCCAACGGTATTCTCCAAAGCATATTCCACTTTTTGTCCC TAT

TATTCAGCTATTTTCAATAGTGAACTAGCTCAGAATATTTTTTGTACCTTCTTGTTC ATGT

GTAGCTTCAACAATCTTCGAGCGATGAATAGGTTTAGTTTTTGGTTGGAATATCAGT TAA

ATAATAATCAGCCATTCCTTTGAACTTTTCTCGTTTTTTCCTTTTCCTATTCAAAAA AAGG

ACGACGGGAAGTGCAGTGGAATTGATGTTCATCCCAGTATCAGGACAAACTACCTTG TT

GATTGTCATACCTAAGAAATGTTTTTTTTTAACTTTTGCCTGTTGTTTCTGTCTTAT TAAAT

TAATGCAACTTGAGAACTGCTTCTTTCTTCTCATCTTTAAGGCATGGTTGACAAATA TGA

TACAAGG AAAAAG CTGC AG CTTTATTTGTCTAGACAATTG CAGTAGTGAAATG CTTTACT

ACTACATTTTCTAGTTCTCATCACTGTATCCTTCCTCCTCTATCTTGCAGACTGTTA TCAC

TATGATTAATCATGCCATTGGCGCCTCGGGGGTTGTAAGCCAACAATGCAAAGCTGT TG

TTG AACAGTATG GACAAACAATAATG GATATGCTTTTAG CG GAG GTG AGCAATTAATTAT

TTTAGTTGATAGTTTGTTTTTGTTTTTACCAATAGTTTTCCGTGGTATCTGCAAAGA GGGT

GGTTTCGTGCTACTAGTTGCCTTCCCAATATTCTGATGGATTGGCGTCTTAACAGGC AC

ATCCAAAGAAGATCTGCTCGCAGGTTGGGTTATGCACCTTTGATGGAACTCGTGGCA TT

AGGTTAGGCTAATCATTTCTTTCCTAACCTTGGCCAATCATTTGATATGTTAAATCC TATT

ATAAAATGTGTGCTGAGTGGATTTATGTCCTCCACGTGTAGTATGGGCATTGAGAGT GT

TGTAGATGAGAATGCTGGCAAATCTTCAGGACTGCATGATGCTATGTGCTCCGCTTG TG

AAATGGCGGTTGTCTGGATGCAGAACCAACTTAGACAGAACCAGACCCAAGAACGCA T

CTTGAACTATGTGAATGAGGTAAATAGCATCAGTCACATGCTTTCTCTTCTCATCTT AGG

TTAG ATTACTG AC C ATCTTTAAC AG CTTTG C GAG C G ACTAC C AAG C C C AATG G G AC AAT

CTGCTGTTGATTGTGGAAAACTTTCTGGCATGCCTAGTGTTTCCTTCACAATTGGTG GC

AGAACATTTGACCTCTCTCCTGAGGAGGTATGTCTGATATCAATCTTCTGTAGTATA CAT

GGTGTCTTCTCAACTTGTAAATGGCTTTTGATTCTTCTGAACGACGTGGTTGGTTGT AGA

ATCCTTTTGTCATGTTTCAGTTTGGCAGTTCAATTCTTTTTGGTTTTCACTAGATTA GCTA

GCAAGGTGTTACGCTGCTTTCAAGAGAAGTACACTTGTCTTGTAGAAAATTTCAACC AT

GACAGCTAAGTGTAGTTTGGATAATTAATGATATTGAATGTGTCGAGCTTCAATATC AGT

TTCTTTGCTTGATAAGTTAACTTATGATTGGATAATTAATGTCATTGAAGTGTGTCG AGCT

TTGATATCAGTTTCTTTGCTTGGTAAGTTCATATGATTGTACTAAGCTTGCATGCTT GTCT

TGTCACCAGTACATACTCAAGGTGGGCGAGGGTCCTGCTGCACAATGTATTAGTGGC T

TCATTGCCTTGGATGTTCCTCCACCCCGTGGACCTCTCTGGTATGTTTTCTTTTCGT CTT AACACACGTGCAGATTCTGTTATTCTAGAAAAGTTATACCAGCTCCCTTTTGATAATGCT GTTTGCTTATGGCTTTGGTGGTGCAGGATCTTGGGGGATGTTTTCATGGGTCGATATCA CACCGTCTTTGATTTTGGCAAACTTAGAGTTGGATTTGCAGAAGCAGCT

SEQ 39

TCATATGGCTGCAGGTCTTCCATCTTTCCTAGGATCACTTACCGCCACAAGCATCCCAT

GAAAAACCCCGTTTTTATATTCTTTCCCGCTCCTTCTTCCCAACTTTAAATGAGAAT TTG

G AAG GTTTTG AACAATTAG CTGACAGATGG CTCCTC CGTTATGTGCTTC GAGTTG ATGA

CCCCTCTCTTCCAAGAAATGCTTTTTCTCATCCGATAGCTCAATGTGATCGCCATCG ATG

CATGTCCAGTTCTCATACAGAACTACATTCGGAATTAGCTGCAAAAAGAGGAACACG AA

TCAATACTCGGGGTTTCAGAATTCGTCCGTATCAAGAAATACGTGAATTGCTCAGTA GT

GCTTCGCCACTACGCTAGCTGGCGGAAATAGCTTAATGGTAGAGCATAGCCTTTCCA A

GGCTGAGGTTGAGGGTTCAAGTCCCTCCGCTCCTGGCTTCGTCGTTTAGTGGTAACA A

GTTCCGTGCATAAGCCACTTTAGAGATAGGTGATCCTTAAAAATACTCCCTCTGTTC CAC

TTTATGTGAGCCTTTTCGGAGCACGAGGTTCAAATTGACCAATTTTCCTTGTGGATT GAG

ACATAGAATTTTCAAAAATTACTACATAAAAAGTACTATAAGTCATACTAATAGTTA ACAA

TTCAAAATAAGAAAACTTTGTCTGACTCCCTAAATAGTAATAGATTCACATAAAGTG GAA

CAGAAGGAGTAATATACATTGCTGATCAGGCAAAGGGACCGACCTCGTGGTAGACTC T

TGGACTCTGCACTGCTGCTAAAGGATCCATTCCCAAGATGAAATGGTTGATGAAAAC CT

G G AC C AC C G C G G G G ATTATTTTC ATG C C AC C ACTAC C AC C AATTAC AC C AG C C AACTG A

TTATCCTGTTAACAAGAAGAAGAAAGAATCAATAAAAGATATACTAAACAACAACTA CTG

AGGAAGTTCGGATTGTCCATATACTAAGCAGTTAAGAAAGATGACGCATTGCTCAAA GG

ATCGTTTCCTGGATTGGATACCTTGAGAACAATGATTGGAGCCATGGACGACAACGG TC

TCTTTTTTGGTTGAATAAAATTAGCCGGGGCAGGAGGGAGTTCATCAGGGGATATCT CA

CTAGGTGTTGAGAAATCTCCCATTTCGTCGTTGAGTACAATACCAGTTGATGGAGAG AG

CACACCGGCTCCAAATGGATAGTTTACTGTGGTAGTTACTGATACAGCATTTCGATC AG

AATCTACAATACAAAAGTGACTTGTTCCGTGATCTCTTAGCTGACTCCACCTACAAA TCA

AGGACCAAAACCGCTTTAGAGCCATTGGTGTGCAATCAATAGGCTTGTTAGTTCTGG GC

TCAGTCTGAAATACTCCTTCTGTCCCAATTTACGTGGCGGTGTTGGATTTCGAGAAT CA

AAAAAGTTTTTCTTTGACTGCGATTTTTTCATAAGCCTTTTAAATATTTTGATTTAA TTATT

ATTGTGACTTATAGTACTTTTTGCGTAGTTTCCAAAGATTTAAATTTTATTTCAAGA CTAA

AAGATTCTATGTCCAAATTCATGGTCAAAGTTAACTTATTTGACTCTCGAAATTCAC AAAC

CGCCACATAATTGGGACGGAGGAAGTAGAATTTATCTAACCTGGGCATATAGTATTC AG

GAGGAAAGGTGGTATTGTCGAAAATCTTCTGTCGAATTGCTTTGGCAAAAGATGGGG AA

AGCATGTCTGATACAGTTTTGCTGATATTTACAAAGTCGGGATCACCGAGGTCCATC CG

AAATGCAAACATGTGTTTCATCGCCTCAATTAGTCGATGCAGACCTAAAGAACCTTC TG

CAGCATTATAGCTTTCAAGGATTTTAAGAATCTGCATTTCCCCAATCGCAATGGTTT AGT

CAG ACAATTG CGTTGTATTAAGTTTAG ACATAATTG AG G AAGATTG GAACCAATAC GTCT

ACGTCTGTGTTATTAGAAGGAAAATAGCAGTACAAGCTAAGTTGGTTTATATACATA CCA

GAGAAATCCCCAGTGTTCCACTGGACGGAGGTGGCATTCCAACGATGGTGTAGCCCA T

AGCATTAACGGTAACTGCTTCTGGAGTTTCCACTTTGTAATTCCTCAAATCGTCCAT TGT

CAAAATTCCACCCGCTTTTTTCACATCTTCGACAAGCTTTTCACCAACCTCTCCATT ATA

GAATGCTTCAGGCCCTTGTTCAGCAATAAGCTCTAAGCTGTGGCTAAGTTTTACATT ATG

GCAAATATCACCTGCCCGTAACAATTTCCCCTCTGGTGCAATTACTTGTCGTAAACC AG

G ATCTTTAAGTATCAACTTTG CTTTTG AC GCAATATG ATGTG CAAG ATATG GAG CAACCA

CGAATCCATCTCTAGCAAGTTTAATCGCTGGTTGAAATAGGGTCTTCCACGGCAACC TG

CCATGTTTTGACCAAGCGGCGTGAAGACCAGCTAACTCACCGGGAACTCCCATGGAC A

ATGCTCCCTCTAACTTGGATTTTCCATTATTATCATACATGTTCTGCTTGAGGACAA ACA

AAACAAATTCAGAACCTGCAACTTCGTGTTGTCTATGTTTATATACTAAATACTAAT TCCA ATTCCCAATAAGAGCAGAACTGCAGTTTTCTCATGTAGACAGCTACAACAGCTATTACTA

CTATGCCTCAATTCCAAGCAAGTTGGATCAGCTACATGAATCCTCACTGTCCATTTC GCT

TCATTAAGCCACAGTTTACTTATGCCGATACAAATTAAAGAATTTAACTTCTATACA CTAA

TAACCTAATTGTATTTTACAATATCAGTACTTCAATCCAAAGCAAGTTGGGATCGAT TATA

CGAATCCTTAGTGTTCATGTCTCTCCATTTGAGCCAAAGTTTACTTATGTCGATACA AAT

TAAAGAATATAAACGTAATACACTAACAACCTAATTGTCTTTTTACAATATCAGTAC TTCA

ATCCAGAGCAAGTTGAGATCCGCAATATGAATCCTCAATGTTCATGTCGCTCCATTT AA

GCCACAATTTACTCATGTAGGTACAAGTTAAAGAATTTTAACTTATATACACTGATA ATAT

AATTCTTTTATACAATATCGGTGTATTTAACTTGGTGTAATAGGCAGCGTGTCTTAT TTTC

CACATTAGTATTCCCACTATTTATGGACGATTGCATGTAATTTTCGGTTAGGTGACC TGA

TACTGTAAAAATTCATTTCTCCAAATTAAATAGTTATTTGATGATGTTTGGATGAGT TGAA

ACTTGAATGAGAAAAAG CAATG CTAAACG AGTGAATTACG AACCTGTG AAG CAG CTAAA

GGAGCAGTTTCCCTCATATCAATAGCTTGAACTTCTGATGTTGATGAAGATCTAACA ACC

ATAAAACCTCCACCGCCAAGTCCGCTGGCCATTGGATTGACAACTCCAAGGCAAAGT G

CTGTGGCAACTGCAGCATCAACAGCATGTCCACCAATTTTAAGCATGGATATACCAA TT

TCCGAGCATCGACCATCATCAGCAGCAACAACTGCTTGCTCCGATTCAACAACGTCA GC

ATTTTGCTGCAGTTTTCCATTATATCTCTCAACATCTCCGATTAGCCAAATACCAAC GTG

TCCATGGTGTCTAAGGCCTATAACTAAATTAAGATGGAGGAAAAAAAGGAACTTAAA AAT

CAAATG ACAAAG AAGTGCAAATTG CAGGTG CTAAAATAATTGTGTAAG CTCG AG CAACT

TCTTGATTCTTATTTCAGAAGATAATTAGACAATCGATGTTAAAGGTAGGAATATAC ATG

CCAATTCTTTATATTTTTTATAATGGAAATATGCCCGAGAGCTAATGGCGCATTGTT CGA

AACTCAATGGATAGTGGGCCCGCTCCTCTAATTCTCACTTAAAGTAGGATTTTTGTC TAT

GACAATGTTCCACTGCTAATTTTGGTTTAAAAAAGTAGGAATGTACATGCCAAATTT TATT

TTTTGATAACCGACAAGCGATTATCAGAAAAGTGCAACCCGGTGCACTAAGCTTCCG CT

ATGCGCGGGGTCCGCAAAAAGGCCCTTGTGGTCTGGCCCTTTCCTGGACCCGTTGCA T

AGCGGGAGCTTAGTGCACTGAGTTGCCTTTTTTGGTAACCGACAAATCCCAGGGTCA TT

AGCGTATTGTTCGAAACTCAACGGATAATGGGTCCGCTCCTCTAAATTCTCACTTAA ATA

CTAGGATTTTTGTGTATGACAAGGTTTGAACCTGTGAAATGCGCACTCACACATCAC AA

GTTGTGCTTTTACCACTAGACCAAAACCCCACAAGCTTGTACATGCCAATTTGCCAG GT

CAAATATTATCCAATTCAAGAAAGCCTGTAAATTTGACAATTTTTAGGCAAAAACCC AGA

ATTTGATTTTTTTTTAAAATATGTCCTTACGATAAAGGGTCAGCAAAATTTTACGCT CTAC

GGTAATTACTCAACAAATTCTTTCACAATTTCAAGATTTATAACATAACACTTCACT TCTC

TTAAGTAAAGCTAAAGAGAGAAATTCCTGTTAGGATCAAAAAGTCACGTGTCATGCG GA

AGCTAGTAACACAAATCTTGAACGACGATAAATCAAACAACAAAAGAGAAATATACC AAA

AGAGACACAAACATTTAACGTGGTTCGGTCAACTGACATACGTCCACGGCGGAGATG A

GCAATCCACTATATATAAAAGAGAGTTCAAAATATCGAGATAACAACCTCACGAAGA GG

CAAACACAAGTGATACACTAACATTTGTCCCGTAAAATTCTCCCCCTAAACACGACT CTC

AAACCTCATATGGCTACATCGTGGATGTTAGAGATAAAGTTCAATCTCTATAAGTTA GGA

TAGAAATCTCTATTAGTTAGGATAGCTATGTTCTGTTAGCTATATTTTAGGATATAT GATT

GTTCTATTAGTTACCTTATCTCCCTAGTCTTCTATAGTGTGTTGTAGACTGTTGTAT ATAT

ATTCAACTATGTACTCAATAGAAAATCATCGAATTCTCTCAACATCATCTCTCATAA TGCT

ACTGAATGGGAAAGAAAGATCTCAATTTATAGAAGTTCAAACATTTTTCTACCAGAA AAG

G GACTAGCC AACTATG G AAG CATTATATTTTC CTTCTAG G AAAAG AAAAACTGAATTATG

GTAAATATGTTGTTCTTTCCTCCGTGAAATAGGAAAATCAATTATAGTAAAAAAATC TAGA

CAAACACGTAACAATTCCATAATCATGGTGTTAATTAACTTCATTTTTCATAGCTTT TTAA

AGCCCAATTAACGAAATTCCTACAGAATTCAACTGAATATTCTGTTAACAGAATTGC AAA

TACTAAGAAAACAAAGAAGAAGACAAAAAGTCAAAGGTGAAAAACTCACATGAAATG GC

AGTGAGTGCAAATAAAAAGCAAAGAGCAAAACTCCATTTCTTCCTTCTATTGAAAGT AGC

AGG AGAAG G GTCCAACAATG G AG CTTCTAAATTCTGTTTACTC AT SEQ 40

TCAGTTGTGTCCTGTCAAAG G ATCTACTTTTATGCTTGTG G CAACAATTG GACTTCTAAC

CACATATTTACCTTCAGTTTCTACCCAGCTCAAAGAACCATAAACCACAATATCATC CAT

AACTAATG G AC CTTCTATTCTTAG CTTGTAG CTCAACTTTTC ATACTTCTCTTTG AAAACC

AACTTTTCAGGTACAAGATTAACTTTAAATTTACCCATTGTGGTCAATTTTGCTGTG TATA

CTGACATACCATCTCCAATATTAGTCACGGTCCTCTGGAATTCTTGTATCCTTCTAG GAT

CCGACTCGCTGCTGTTCCCATTGAAAAATCCAATGAAAGATGGATAGTTTAAGTCCA AT

GATGGGTTGGAGCAAGTATAAGATGAGGATCTTGTGATGGTTTTTATTTGTTTGGAT GT

GAAGTTCAGAGCACAGAGAAGATTGACATAATCTTGTGGTGTCGCATCATAGATAAG TC

CAGGATCTAGTGCCTTGTTTGGATCGATATGGCCAGCTCCCATGGCTAGAGGAGTAG C

AGCAGCATTCTTACTACCTGTTGAGATATAATATAATTAAATGATTAAATATATGCT CTCT

CTAACATATAAGCTTACTTGATTCAACATAGTATCAGAGCATGCAAGAGGTCCTAGG TTC

AAATCTCACCGCCACCAAAAAAGTCATAAAATAATTCCAAGTGTTTGGTCCATGAAA AAA

AATCAAACTTTTAGATGAGATGGTCACACAATTCAATATTACCTATGTCTCGGATGG GAC

TTTGTGTGTTGTCCATCGCATTGGAAGTGGTCATCATGGCAGATCGGATGGCTGCAG G

GCTCCATTCAGGGTGTGCGGCTTTTAGAAGTGCTGCTACACCAGAAGCATGTGGACA T

GACATTGATGTACCAGATATAATATTGAAGTTACTAAAAAGTTTTCCTGAGGTAACA TCA

GTCACTGGTGATTGTTGTGGCCATGAAGCTAGTATTAAGGCACCAGGAGCCATGAGA T

CAGGCTTGAGGATACTTGGACAGCTCGGTGACGGTCCTCTTGAGCTATAGGTAGCAA C

TTTTG GTG CTG GTTTAG CACCAATATGTGTCACTCG GAATTC AAGTTTTC CTTTAG GTGC

AGAGTTGCTCTTAATGTACTCTAGAACTTTATCACCCTCTTGTAAGTTCAAGAACAC AGC

CGGGAATTCGCTTTGGAGGTAGAATTCCAAATCAGTTATATTAGTTATGAAGACAGC CC

CAGCAACTTTTGAATTTCTCACATTGTACACATGCTCACTGACCGAATCATTCTTGT CAA

GGCAGACAACAATATTGTGTGCACTTTTTTGCAGTTCCTTGTCATCTTGGCATTCAA CAT

AGACAATG G AG CTTTCACTTG AACTAGAATTC CCAG G GTAG AG CG ATAAGCC AGTG ACT

GAAACTCCATTTCCAAGAGTTAATGCGCCAATAAATTCGCGGTCAACTGTGCCAGCT GC

AACAGTTAGCACCCAAGGTGTTCCATTGTGCAAAGTCTCATAATAAGGCCCTTCATT TC

CTGCAGAGGTGGAAACAAATATACCTTTCTCCAATGCAGCAAATGCGGCAATTGCCA CA

GGATCTTCGTGTAGTGGAATCGCGTCTATGCCTAATGACAAGGATAAAACATCTACA CC

ATCTGTAATTGCTTGATCAATTGCAGCAAGAACATCAGACAAGTATACACCTTCTTC CCA

TAGAGCCTTGTACATAGCCACATGAGCCTTTGGTGCTATGCCAATAGCAGTGCCGGT G

GCATAGCCAAAATAAGATGCACCCTCGACATAACTTCCCGCAGCTGTGGAAGAAGTG T

GAGTTCCATGTCCATCTGTATCTCTAGCAGAATTCATTGAAATGTTAAGATTTGGAT TGT

TGGCAAGTAGGCCTTTATTGAAGTAACGAGCGCCAATGATTTTCTTGTTACACAAAG AG

GAATTGAACTCAATGCCACTTTCACATTCTCCTTTCCATCTTGATGGTACTTCACTA ATC

CCATAATCACTATAGCTTTTACTCTCTGGCCATATTCCAGTATCAACTAAGCCAATT ATG

ATATCTTTACCATAGTCGGACGTTGGCCATACACCAGACTCAGAGTTTAGGCCAAGG AA

TTGGGATGTGTGAGTTGTGTCAATTTTAACTGACATATCCTTAATTGAAGAAACATA ACC

TG GAG AATTTTTTATG G CTTCAAATTCAG AAG GAG AAAG ACTTGCACTAAAACC ATTG AT

GGCATTAGTATAAGCATAGACTAGTTTTGAGGACAAGAATTCTTTGTGATTTGTACT ACT

GTCTGATAAAGAAGCAAGTGTTGTCAAGTACCAATTATGATGGCTAGCAAAAGCTTT TG

GCATGGCTGACAAATCCATATGAATGATATATGTTTCTGGCTTTGCTAGTGAAATTA TAG

AAATAAAGAAGAAAAGCAACCAAATACACAAGGTAATATGACTTGCCATGTTGAGTA ATA

TATTGAAGGAGGATATTTTTTTTAACAT

SEQ 41

ATGGAATTTTACCAAAAACTGGCAACATGTTCTCATTTGTCGCTTTTGTGCTTCATCCTC TTACATTCCATTCAAGTTCAAGGTAGCTACTTTGATCAAGAATATGGTAAGCAGGTACTG AGCTCAGCAATACAAGATAAAGATTGGTTAGTATCCATAAGAAGGATAATTCATGAATAC

CCAGAACTCAGATTCCAAGAATATAACACCAGTGCTCTCATTCGTACTGAACTTGAT AAA

CTTGGCATTTATTATGAATACCCTTTTGCCAAAACTGGTCTTGTTGCTCTAATTGGC AGC

AGTTCTCCTCCTGTTGTTGCTTTACGAGCTGATATGGATGCCCTTCCTCTCCAGGTT CAT

ACACAATTTTTTTACTATCAATCAATTATACCTCAATCGTCAATTAGTTGGGCAGTT ATAT

G CAGTTCG GAG CTAG GTTGTTCC CTAAGG G GAATCAACATATAAAGAAGTAAAG ACG AA

AAAGCCACGGAGATTCAATATATAGTGTATATACAAAAAAAAAATAAAAAAATTGAC CTA

TTTACCCTGTGTAATTTTCGACCCAAAGGGTATCAGTTAACTCCCCTTGGATAAGGT TGC

TCTGCCCCTAGTTATATGAATCTTCTTGTATCTAATTGAGAGGATTCAATATAGTTA AATT

ATTTATGCACCGGTCGTCAACCTAGCACAATCCTCCAACTTTATTTGAATCTGCAAC TGG

CTATGCTTTGTGAAGCTTAAATAGGTGTAGTTAGAAAGAAATATTCTTAATAGTGTG CAT

ATTTAGTTATGGAATGTCTCTAACATTATTCTCGAGTGAATATAACCATAGGAGCTT GTT

GAATGGGAGCATAAGAGCAAAGTTACTGGCAAAATGCATGGATGTGGACATGATGCC C

ACACGGCGATGCTTCTTGGCGCTGCTAAGCTGCTGAATGAGCGAAAGGACAAACTTA A

TGTAAGTTTGTTAACCTTACCCACTTCACTAATGCTGATTCATTTGGAATGTAATTT GTGC

TTGTGTGATTCTTTAACAAAAGATTTTTTGCACAATGTTGACCAATGACCAGATTGT CTT

GTTCTCAGAAGTAATAATATTAGGTTTGCGCTATAGTGATTATGCTGATCATTTTAT CCG

TTGTGCTTTGACTTCTTATCTAGGTTTGCATGTACACTAGGCCTTTGGAGCTTATTC TAA

AAGGGGGTATTTCTTAAACATAGAGGACTGTAAGAAGATAGATGAAAACATTCTTTA ATA

GAGGGGGGTATTAAGTGTACTTTGTCGAGATAATGAAAGAACAGACTCAAAAGGAAT AG

ACCAAAAAGGATATCTTTTTGCTTTGTTATCAGATTTAGTTCACTTATTCACATGTC TCCC

TCGGAACAGTCCAAATTTCATAGCAGTGTCGCAAAAAAGGAATAGTTGTGCTGTTTG TT

ATCGATGATGCTTCTTAACTTGGATATGACCATGTTATTCTTTGATTCTTTAAATCT GAAA

CTTGGATCGTCCTTCTGTGGGTGACTAGCAGTGCCTGTGGGTAATCATTTTTGCCTT TT

CCTTAGATGAACATATAAAGTGATTTTGCCCATTGAACATAGTTGTGACCATTCATG ATT

CATCAATTGTCTCGATGTGGAGAACCTAGCCCTCTGATCCTCCATGGCTTGCGAGTT CA

CATCCAGATGAAGCAACCAGAGAAACTAATTCAGGCATGACGAGAAATTTTCCGGTC AA

GAGAGAGGATCGATCAGAACCTGTTGAAGGAAATGGTAGATGACGGAGCATTGGCCC A

AATCAATTTCTCTCTGGAACCACGAAAAAGAAGCTGAGAAGACCGATAACTTCTATC TAC

ATTACAATAACAATACATGGCTGCATGTATAGGAAACGAGGAAACCATGAATGTTTT TTT

GAATTCTTTTTTGCTTGACCAATAAAAAGGAATTCAAGACTGAACCACACTTTCTAA TTAC

TTGTTAGTCTGTAATTGTCTGACTGATACTATTAGATATTTCTTTTCAACTTTATAA GAATA

CATTTGTCACATGACACTCGTAAAGCACTGTTCGAATTGACTTAATCTGTTTTTGCC CTT

TGTGTGGCATCATTCATTATCTATCCATTCTTGGGGTAGTCTACAATAGAAAGTTGA TTT

GTTGCTTGTCTCTATTTTTATTTTTTGAACCCGAAAAGGGAACGGTAAGACTTGTTT TCC

AACCTGCGGAGGAGGGAGGAGCTGGTGCATATCATATGATCAACGAAGGGGCTCTAG

GTGATG CAG AAG CTATATTTG GAATGCATGTTGATTTTAAAAG AC CTACAG GG AG CATC

GGTACTAGTCCTGGGCCGATTTTAGCTGCTGTTTCCTTCTTTGAGGCAAAAATAGAA GG

AAAAGGTGGGCATGCTGCAGAACCCCATGCTACTGTGGATCCAATACTTGCTGCATC AT

TTG CAGTTGTGG CATTGCAG CAG CTCATCTCAAG AG AAGTAG ATCC CCTTCATAGTCAA

GTATGTAGCCTAATCTCAATTAGAAGTATAAATCTTTGGTTTACACACACACAGAGA CAC

ACAGACACATAATTATGTAGGTACATATATTCCCTTCAGGAACATTTCTTGTTTTAG AAA

G CAG TAT AG CATTTG AGACCTG AAG CCTCATTGACAGTTAAG CTG ACTG AGATTG AAAT

TCTCATTTCTG CCTG AAGGTTCTTTCTGTTACTTATGTCAGAG GTG G ATC AG CATC AAAC

GTAATTCCGCCTTATGTTGAATTTGGGGGAACTCTGAGGAGTCTTACAACTGAAGGC TT

GCTTCAACTTCAAAAGAGGGTGAAAGAGGTAGGTTGCTTACATGAACCTTTGACTGT TG

TTGACTATCAACATCTGCACACTAGATTGTCTGCCAGATGTCTTCAACATGTAGTTT TCT

GTTAAAAAATTTAGTGATTTTTTTGAGTGATGTTTAATAGCCTTAAACTGAGCCTTC TTAG

GTACTGAGAGCTACGTAATCAAATTAATAAGATTAAGGGTGAATAATTCTCGAACAC GTG

TTCACATGAATATAGAAGTCTCAGCTGAATGAATGATATAACTTGTGGTCTGCTTGC AAT

TTTCCCATGAAAATGCCATGTAACTCTAGCATTCATAACTGATCATCTTTCCCTGCT TTG CTTCTCTTTCTTTGTCAAAATCAATTTTATGCCTGTCCTCAACATAGAAGCTTATCATTT T

TATTATTGAATCCTCTATTTCTATTTCGCATTGTTGAATTAGATGCTAATCGTCTTC AATG

TCAAGTATTGCGGCAAGATCTTACTAATTAATGTGAACAGAACCTAGATTTCTTGTG GCA

ATTTTGTGCATTTGTAACAC ATATTTACATG GAGC CTGCAG GTAATTG AAG G ACAGG CT

GCTGTGCATAGGTGTAAGGCGTACATTGACATGAAAGAGGAGGATTTCCCAGCATAT C

CAGCTTGCATAAATGATGAGCGCTTACATCAACATGTAGGGAGGGTTGGCAAACTCC TG

CTTG GTTC CGAG AACATC AAGG AAACTG AAAAG GTTATGG CAG GTG AG G ACTTTG CCTT

CTATCAAGAATTGATCCCTGGAGTTATGTTTCAAATTGGAATCAGAAATGAAAAACT GGG

CTCTACCCACGCTCCACACTCCCCTCACTTCTTTCTCGATGAGGATGTCCTGCCAAT TG

GAGCAGCGTTGCACACAGCCATAGCAGAGATGTATCTGAATGATTACCAACATCCCA TT

GCGGTT

SEQ 42

TTAGATTTCCTCAACTCGTCTATAAAATAGGACATAGGCGGCCGAGGTTTTGAGCTTGT

CCTGGCTGATGGGATACACATGGCTGTCATCGAAGTCATACCACCGATCAGCACCTT G

CTAGATTATTAGAAGAAAAAAACACAAAGTTAGAATATCTGGATTAAACTGGGAAGA CTG

TAAAAGTCTGAATATTTGACCTTCTTGCATATTCTTCATGATGAAGAAATAAAAACA ATGA

AGATGCATGACCAAAGTTAAATATATAATAGATGCACATATGTGCAATATGTGATTA ATTT

AGATG CAG ATGATG CATTG G AACTAAAAAATACATCAAAG GAACACTCACTACG AAATA

GGAAGTTTCTATATTTGCCCTTGGAGGTAGGTTTCAGTGACCGCCCACCACCCACCC TT

CTCCCCAAGTACTGATCTTTAGATGCCAATTTTATCAGGTATAAAAGAATCCTATTT ACC

ATAG AATAG AAAC AAAAC C AAG AAAAAG AAG C AAG GTAATC C AAGTC G CAG C AC C ACTT

ACATGAACAAACGCAGTGTAGTGACCCCCTCCCATGCTTCCATAATGGTTGCTAATT GC

ATAAAGCATATACCGGTAGGAAGATTTGCCATCTTTGTAGGCCAAATATGAGGATAA AT

CAAGATCATGAGTTGGGAAGTCAACATACGTCTCCAACTTGTTCTTCAGAAACCGGT TG

TACGAGAACCTCTTCAGGTGGATGACCAGAATCTCCGGCAGTCTCCAAAGATCCAAC TT

TTTAGTAGCTTGGCGATGCTGCTTGCATGCAGGGCAGTACCTAATGTTGGATATAAG AC

AAAGAAAGTTAGGCAGATCATATTCATACTTCCCAAGCGAATGACACAAAATTAGAT GAT

AAGATAGAATACTAGAAAGATTTCATGAAAGAAGTCTTTTCCCCCATGAGTGGCCCC AG

GTAAATAGCTGAATTTTATTATCTCAATTACTGTCTGTAGTTGCTATGAATATACAA GAAA

CAAAGAGCAACACAAAAACTATTTCAGAAGCACAATGTGCAGAAAATCAATAGGTGT TA

CATAAGATCATCAGATGCTGTCATTGTTTCTTTAACATAACTGAACTAGAAGTTGGA AGG

GTACTACTCATGCCATGTTGCAAAGTCAAGGTCCAATTTGGTTGAGGGGAGGCTAAT AA

GAATGGTTTATCCACAATAAGCAATCACGACGTGGTGGAAAATATTGACAGAAATGA AA

TAAAATG GTATAATTG G AAAAATACAATTATGTAACTACAAAAGTTGAG GGTTTATAATAT

TAGCATAAACCAAAGAAGAAGCAAAAAGGGAAGCATTACGAGAAATCTCTGAACATT CA

TATGTTATGAATAGTGAATACATAGCTTGCATATGAATGGTGTACAATCACAGAAGT GAG

GGAGTTGCATGCTTACCACATATCTTCTGGCCCTAGAGGCTCTTCCTTCAGAAATGC CT

CAAGACATTTATACAGAGAGACAGATTCTTGTGGTCTTTTGGCAAAAAACCCAGATT TAA

AAACTTCTGGCAGTGAGCTGAAAAGGCCTGTATTGTACTGTTCAAGCATTTTAGGTG AC

CAACTTACAAGTACATTTAACCGTCCAGAGATATCTGTGGACTGTAATGGCTCATTC ATT

ACAATCTCGGAGCCTTTAAAGGTTGCCTTATCATCTGATAGGTAAAATTCAAAGTCC ATG

TCTAAAGGTTCGGCAGTATCTTCTTCAGCAATGCTTTCTGGAACCCCGTTAACTATT GAG

TTGCCAGGTTCCATGTCTGTGCTGACTTCTGAATCTGTACATACTTCAGTAGCACTT CTA

TCACAGTTAAGAGTAGCACTTCTATCACAGTTAAGATTATCTGCTTGGGCTGTAGTG TG

GACTAAGAATGGTGTAAGTATCTGTAGATAAAGACTACGGATATAAGATCCTGTAAG AA

CTCTACTATGCGCGGCAAGCGGAATTCCAAATGTCTTCATATTTGAGGTCAGCTTTC CG

TATATGTAATGCCT SEQ 43

CTAGCTAACCTGGTTTTCACTAGCCTTGTAATACAGCATTTTCTGCACATAAAACATCAT

GTATCCTTGCGCAGCTCTCACAATACTCTCGCTCACCTGAGTTATCCAGGCATCGTC AC

ATTTGTACCATTGATTGCTTAACCTCAGATATGTTACGTAATGACCAGCATCAAGTT TAC

CAGTATGGGTGATGACAGCAAACAACTCAAATTCCGAGGACGATTCACAGGACGCAT C

TTGCTCGTCCCCGTCAAAGGAGAAGATTCTATTTCCAAATCGACTCCTCAAGATAGA TG

AAG AG AG ATAAG G CG ACATGTCCAAG GAAAAAG G AAACTGTAG GTAGTG ATCAACCTT

CCTTGACATTTTCTTAATCACAGAATGCTCAAACCTTTTGATATGGAAGCAAGAAAC TAA

AGGCAGTTTTCTTATGGACATCTGTTTAAGAGATTCCTGTCTCACTTGACAATGTTG GCA

GAAGAACTTCTGATCAGAACCCAATTTCTCAGGTCTTGTGAAATGATCTAAGCATCC CAT

CAACGTAGAAATTCGACCGTTTTGGCTAAACTTTCCAGATTCTGCCTCCTTTTTGTG AGT

ATTATGAGACTTCTTTGATGTCATCTTGGCGGAACTCCCCTGGCTCAGTTCCAAGTC CA

AGGAGATGTCTATACATGGATCATATGTAGTAGATGTGAAGCCACAAGCTGTACACA TG

ACATCAGACCGCAAGATCCCAGAAAATACTCTATGAGCAATGCAACAGTCTCCGCTG CC

TGCACAAAAGAAAAAAATTAGTCAAGATTAGATTACAGATGACAAAGTGCAATGTGT ACT

GCTCCTAATAAGTTACATACAGTAACATAGTATGCACAGCTTAAAGTGCTCCTCAAC ACA

TTTCATAAAAAGCAAAAGTCCAGTTATGCCATAAAGACAAAACAACAATGGTCCTAT GAG

GAATGGCAACCAACAGAATGAGTAGCTACAAGCAATGGAGTACAATTGACTTTTGTA AA

AAAAACCTCAGTAAGAAAAGTATCCGCAAGACACTTTATTGCCTCAAAAGGTCTATT ATT

CCATTTACACAAAACCATTCAAAATCATAGATCCTTGTATTCATTATTTAACTGCAA CAAC

CAGGTGTTAACTAAATTTTGCAAACTCAGGGTAGCCCTGTACTAATCCTCTAATTAA GAG

GAATAAGAATAGGTGTTAACTCTCTCAATTAAAGTACGTGTGACTTAAGCGGGAATC AAA

TCAGCATATTTACAGTGGCAGAAATGAATTGTTTGCTAAGGAGTTACCCTACTAATG ATT

GCTCTGCTAAAGAAATTTGAGGTACCGAGGAGGTAGACTCACAAGAATAATAAGACA CA

CAGGGGCTTATAGAAAAAAGTAAGCAAAAAGTTGTCAACCTAGTAAAGACTTCAAGC TT

TCTTGAAGGTTGCGATCAGCTCACTCGAGAGTAATACCCTAAAACATGGTAAAAGTG CG

AGTTAATATGAGAACTATTGTACCAGAAGAAAACTCCTATGCTGAAATCAATAAGAC TAA

CCAAACTG AG ACTTACTG G G AAGAGTAG GAG G AAATCTTTTTC CTTTTGAG AAAC ATCT

CTAAGCCTGACAGCGAATATCAACTGGTAAATCGCCTATGGCAAAGAAAAGACCTAA AC

CATAACCTGCATTCAAATATTTCTATCTTTTCTCAGTGACAACGGAAGTTGGGATTG CCA

TG AG AGATG AG ATG CTAGAACAACAAAGATAAG CCTATCAAGTAG G CCCTG AGTG CATT

TTAGTTAGACGTTACTTGACATCACAAAGATGCTTGAACACACTATTCTACTTCCTA CAG

AAATTACTTTTTCCACCCCCTCTCCACCAACAAAAAAGTTCAAAAAATTCACCTACT GAA

GTACTTGACCTTTGCAAGTAACTATACAAATTTCAGTAATCCACTTATTTGAGGTTT TTAC

TGCACTAGCCTCCCTTGACTATGTTAGATTTATGTTGCTTTATTAAAATATGCATAA ACAA

TGTTGCCAATAATTTTCACAGCACAATAAAAAATATTTAATCTCAATGATTGGTCTA ATTC

GGAAGAAAAGGAAAAAAGAAAGAACAAGAAACTAACTATGCAAGGTGATGGGGGGAG A

AAAGATGGGTAACTAATAGATACTCTAACGCAACAAACAAATATACCTGGACTCAAC GC

CTTTCCTTTATCGTTCTGCATCCTTTCATGAATCCCATCAAGCACGGAAATGAAAAA CTC

ATGAGCATCCTGCTGTTCATAACTTGCAAGATTTGATGCATGCTTCCACCAACTGTT CCA

AAGAAAGGTTTACATCCATCAGAATCAGATACAGAACAGACTGCAATTAGATACTCA AAA

AAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNN CAATATAACAAATTACAAATGTATTATATCACAACCAAGTTACACACC

ACTAGCTAATTTTACTTCAATGAAGGGATAAGGGTGGTACATATAACCCCAACAATA AAA

AATTAATCTTGTCCCTACATCATTTGCAAATAAGGGCACAACCAAGCTAAATAAGGA ATC

TTTTACCAATATATCATTATCCTTAAAAAAGAATTATAAGTATACAACAACTATGCC TCAA

TTCCAAGCAAATCTGGATCACCTATACGAACTTCTCACCCAAAAAAAAAAAAGAACG AGT

GCACATTATAAGAAAGCCAAACATGATCTCAACAATCCAAGAAAACATGATCAAATG CA GACCTGTAGAGGAACTTTGCAGGACTAATAGGGGTCCAATCGCCGGAGAAAACAGCAG

AAAACATTGCATCCAAATCACAAGCTAAACACAGCATTGTTGAGTTCTTATTCCCAT TAT

CACTACTACTCCTTGTTATAACACTGTTATTCTTTCGCTGGCAAAAATATCTGTTAT GCTT

GTCACTCAGAAAGTAATTCCTCAATGGTGGTGTATGAAGCAATGCTTGAAGCACTGA AT

TCATAAAACACGTGTTTCCAAGATTGTTAAGGCCCCTCAAACCCCATTGTACTTCTG GG

GTTGTTGAGTCATTGCCGAGCTGACTCGGCAACGGACTCGAATTCCCAACGATCAAG A

CCTGCTCTTTCACATCAGGCGTCCACGGCTTATACTCCACGCGCCTCCTCTTGCGCG T

GTTCTCCGGCTGCGGCGGAGGGTCTTGTATTGATCCGATCACGGTAGCCTCCGTCTG C

GCTAGTACTACGGCGGCGTCGAAGTCTCGATCGTATACCTGGTCCCTACACCCGCAG C

AGAACAGCTCGGCCCTGTCGATGTCCACGGCGATGCTGTGCAGCGAGGGATCGGAAG

CGTTCCCCACCGGATGCGACGGCGCGTGCACACGGCAGAATACCGCGGCGCACGTGA

CGCAGGCGTACAACCGCGGCGGCGCCTGTCCGCACGCACCACATCTCACCAGCTCAT

TCGGCGGCTCACGGCGGATGGACGCACGCCCCAATGGCCGGACCTTCACGCAGCCCC

GGAAGTTGAAAAATGGGTTCGACCCAACTCTGGATCGGAGCTCCGACAAATGCTCGC A

TGTACCACTATCGGTATAAACAGGCGACCCGTTTTTATTTTCGGATCCGGGTTGGAT CT

TACCGATTGACAAATGGATCTTAAGACTGTTCCTAGACAT

SEQ 44

TCAAGAACTGCTCTTCCTTCCTCCATTTATGAAGGCCTCAATTGGTTCGGCATGGATGT

GCTTCATTGCCTTCACTCCCAGCGGGTTGTCCTTGCTCTGTATCAACAAATGCAATC AC

GAG CATTGTTTAG AAAG AATG G CGAG CAACATG CTACAACCAGTTAAATTACACTG ATA

ATATCAAAATTCTTTGGACGATCAATGTATAGAAATCAAACTTCTCTTTAGAAATAA TAAG

TGCGAGTCACTTACTATTGAGCAGGTGCCTGCACGAATATTGCAGACAGGATAGTCG T

GTGGGCAACAACTGTAGTGATCTTCACAGCAAGTGGCTCCTTCAAGAGGGCAACATC C

C C AAG AAAAG C AG G AGTTATAGTATTC AAAG AC AC AAC AG C AAGTG GTAC CCTCGGGG

CATTGAG CATAATC ATCACACTGAGTAG GTG G CTTG ATTG G AG ATG G AG G AG AAG GTC

CAGGTTTAGGGGGATTTACGCCTGTCTTGACTGGGTAAGAAGGCTCTATGGCTAATC CA

CACAAACCTTTAGAACTGGCAACGTTACGTTGAACCCGAAGGTAACCTTTCTCTCCC CA

GGAAGCACCCCAAGAGTTCCTGATAATCCAATAATCCATACCATTCTCACTACCATA TCC

AACAGCAACCACGCCATGGTCCACTGCAGTTCCACATTTTCCAGTAAAGATACCCTG CA

AAATATATGTATTAGAACCATATAGCTACTAGCCAATAAGAACTTCATTATAATAAG ATTT

AAATATAGAATTGTAAGTAGTGATAATTGGTAAATCATAGTTTTTGAATTATAACAG GAAT

CTATTAACTCATGCAGCCAGCGTTATCTTTTGCTACATTCTTTGGTCGAAACAAAAC TAT

TATATCTGATCGAAAGGTAACGTATTACTGGAACGACGATGTCTTTCTTTGAATTCT GTG

ACTAACTAAAAGACTTCAGATTTTGCATGTTCACTTGTATTTATTACTTTCAGAAGA AAAA

GATAGGTCAATGCGAATAAAAACATTACCGATACATAGTGTTGAAAGTCGTGGCCAC CA

G CTTCAATG G CAATG CTCACAG GTTG ATTTGC AACAG CCTTTTG CAGTGCCTTTTC ATTA

TTTGCAGGAACATCTTCATACCCATCTATGGTAACAACCTTGGCATTTTTCTGCATG ACA

CAAATGTTTCAATAAATAAATATTCAACTCACTACATAAATCAATATATAAAGTGAA TTTA

GTCATTTCAAGAATGCAAAATTTACCCTTGACTGGTCACATCTGCCATCACGGCCTG TG

TAAGGGTAGTCTTCCTCAGTGTCAATTCCTCCATTTTTGATGATGAAATCAAAGGCA TAG

TCCATAAGACCGCCATTGCAGCCGTCATTGTAGGAAGTATCACAATCCACCAGTTCT TG

CTCCGATAGTGAAATCACATCTCCGGTGACTATCGAGTTCACTGCTTCAACGGAAGC AA

TTGCAGAGAATGCCCAACAGCTCCCTGGTTCATTAACCGATAGATTAATACACAAAT ATT

GAGCCATACAACATAAATAATGAATTGAGAAATTTGAACTCAGTGATTTATAAAAGA AAC

TGATATGCAGGCATTAATAAAGAATACACGTCAGTGTCACATGGTATTTAGAAGGCC CG

TCGGGTCACTTCGTGTAACATTTGAGCACATTAAGAGGTCGTTTGGTAGAATGTGTT AG

AGAAAATAATGCATGCATTAGCTTTGTGTATTAGTAATGCTTTGTTTGATACACTTT TTCA

ACCTATGTATAACGGATACAAGCATTAGTTATACAGTCTATTTGGTATTATCCTATG TATA G CTAATG CATAG AAAACCATGACATTAG CTATATCG AGG CTATTAATACTTG CATTAG CA

TGGTTAAAGACAAAATTATCCTTAAAGTCCCTTAAGTAAAGAATATGGAGGGCATTT TTG

TAAACAATTAAATATCTAAAAAATTATGCAATGCATTTTAATTTTTAATACACCACA CCAAA

CAATGCATAAAAAATAATCTCTGTATAACTAATGCTTGCATTACAAACCCCTGCATT ACTA

ATGCACCTTATTTAGCATTATTCTTATACGCCCTACCAAACGACCCCTAAATGTTGT AGC

AACTAGCAAATATTCTAGTGCCAAAACTCAGCATCAATTGGTCAGCATTTTCTATCA TTA

TCAAGTTGTGCTGTTAAGCTTTTCACCCTATAATAATTGCTCATGCGTATAGAAAGA TGG

GGTCGGGGAATAATGACTAGGAAAAAAGAAAAGATGGGGTCCTGGAATAATATGATT AA

AAGGGAAATTCGGTGCCCTAAGCTCAGGGTATGTGCGAGGTGCGAAGAAGAAGTGGA T

CATAAGGATTTATTGTACGCGACCTTACCCTGCATCTTTCTATACACCAAAAAATGT AAA

GAATTGTTACATATTACTAGGTAAAATTTGATCTATAACAAATAACTCTTCATATAT ATAAT

AC C G G CATAG G ACTATTTAAAC AATAAAATAAC AAACTGTTAC AC C C AATTAAATTAC AC

TAATACTATAAAAAATTATACTATCAAAGTATAGAATTTAAACTCGAAGGAAATTGT AAAA

ATAGCACGGTATAGCCAGTTTTCGGATTGGTCATTCAAAAATAGTCAGCGTTTATCA AGT

CAATGAAAAATAGCCACTATTTTGCTGCAACAAAGACCGATCCAACATAATATACTG GAG

TTCGGTGCAACTGTGTATGAACTACAACATATTATGCTGGACCGATATACTTTGTTA GCT

CCAGTATATTACTGTAGCACCGGTGCTCCAAACTCCAGTATATTATGTTGGACCGGT AT

ACTTGATGGAACTCCAGTATATTATGCTGGAGTTCTAGTGCGCTTATGCAGATAGAG TT

CCAGCATACTTATCCTGGAACTCCAGTATAATATGTTGGAGTTCAAGTATACTTATG CTG

GAACTCCAGCATAATATACTGGCGTATTTTCTGAGTTTTAAACAGTGTTTTCGCTCA AAT

TTATCTTTACATAAAAAGTGGCTAAATTTCGATTACTTTTAAAATTTGGCTATTTTT GAAC

GACCAGCTATTTTTTATTTTCACACAAGGAAATCATAAAGGTATCCTACGTTTATGC GTA

GACACACAAATTGGCTTTCTACTCATAGAGGACTAGAACTGAACGCCACCATACTAT TG

ACATGAGAGTTTATTTAAGTGCAATGAAAGTGTAAAGAACAGAAAAAATATTTAAAT CTG

ATATAATAACATAGAAAATAATTCTTGATATCGATTCTAATCTAATAAAATAGGAAA TAAT

AAATAACTAGTTATATTTAGAAACAATTTAACTTTAACGTTTTCATTTTATCTATTT TATCA

TTAGAGAGAAACTTTTATAATCACACGAATGTTACCCACAAATCTTTTGCCCTTGAC CTT

TTAGGACCACATGATCAAAAGTTTTCTTTTCTTCTTTTTTTTTAAAAACTTTATATC AAGTC

AAATTATATCATTTAAATTG AAACG GGTAGAGTATTTATATATTTTAG CACAATAAG G CAC

GTATGATTTCCTTTGTTTGTCAAGTTCGTAGAAATACTTTCTACATATAAATTAATT ATGG

TAGTG G AATACC AATAG CCTATCTCTATATG CTTTAATAACAAATTAAATCAG G AAAATAT

C ATCTAAAAC G C C GTCTAATTAATTATAG ATC C ATAAC C C AAAAAC C G G AG ATAAAC G AA

AAAGACAAAGCCATTGCTAATTAACTTACCACAGCTTCCTTGATCCTTAACTCCAAC AAG

AACACCTTTCTCTCTCCAATCAACTGAGTCCGGCAAGCTATCCCCAACTTTAGGAAG AT

ACCGATCGCTTTTGTTTTTCAACAACCTGCGACGATCACTGGTCTTAGTACCTAAGT ACA

TG GACCTGTACTC CTCGTTG GTCAG ATCAG CAAATTTG GTTAAACCAAG CTTGTAACTC

TTGTTTGGAACGGAGTTTTGTTCATCGATGTATCTTAAGTTATCTTTAAAGATCTGA AAC

CGCTTGTCTTTTTCGTCTAAGGCGTTGTACGATTTTCCATGTTCGAGTAGCCATGAC TC

GTACAAGGACATGACTTCATCGTCCGTTCGAAAGTGTTGGTTTTCGTCGTAGGTTAA GA

TGGACATGTCGGAAGCGGAAGATAAGGTGGAGAAGAAGAAGAAGAAGAGGAGGAGAA

GTAGGGATATGGATATGGTGAGAGTGGAGCTATGAGTTGCCAT

SEQ 45

TTAGAGTTCATCCTTAGGTGCTGTTGCAGGAGACCCTGTGGGGTTGCTTGATGAAGTCT

TGATGGGGTAGGATGGTTGCATTGCTATACCACACAATCCCTCTTCAGCATCAATCT CG

CGTTGCATCCTAATGTATCCTTTTTCTCCCCATTCAGGTCCCCACGAGTTCCTCACA ATC

CAGTATTTGGTTCCATCAAGGGTTGTGCCATAGCCCACAATTGCCACACCATGGTCC AA

CTCAGTACCACAGTCTCCGGTGAATACACCCTGCCATATTTACAGTTCGTAAATGTT TAT

ACCTAGTAAAAAACTTTTTAACTTGAGATAAATGGTCTATCATCATTTACCTCAGAG TAGA ACTGG AAGTCAGAAC CTGAAGCTTGTATAG CTACAG AAACAG G CTG GTTG G CTACTGCT

TTAAGTAGGGAATCCTCATCATTAGGAGGAACATCCTCATATCCGTCAATTGATACC ACA

G GAG AATTCCTCTG CCAATTCC ATAAAATTC ATG CACGTG G ATTAG AAACAAGACTG GT

TCGATCTGACAGACTGACACCCTACAGATGTAACAGAATCTTACCTTTTGAATATCA CAC

TCGCCACCTTCAGCCATGTATGGATAGTTCTCTTCAGTATTGATGCCTCCCTTCTTC TTG

ATGAATTCAAATGCCATGTCCATCAACCCTCCATTGCATCCTTGGTTTTGACTAGTG TCA

CAGTCAACAAGTTCTTGTTCTGATAAAGATACTAACTCATTTGTTTTGATTTGGTTT ATCC

CCTCTACTGCAACGACAGTTGAAAATGCCCAGCAACTTCCTATAACAGGCAAAAGGT CA

GTTTCCATCAGCTATAATATTTTGAAAGAACATATCATATGGTTTACCCTTATGTTA TTAT

G CTAG AGGTGTAAAG CG G CATAG AATTAATG ACATG CTACTCTTTTCTTACCACATTTGC

CTTGGTCTTTGACAGGAGTAACAGCACCCTTCTTCCTCCAGTCAACAGAGGGAGGGA C

ATCTTCCACATTGGCGTACATGAAAGTTCCATTTGCTCGTGAAGCTCCAAGAAAAGA AC

GATGATGCTTAATCTTGGAACCAGCATAATGGTGTCTGAATTCATGGTTAGTCATGT CTG

CAAACTTGTTCAATTTCAACTTATAAGGCTTATCCTTCTTGTTGAAGTTGTGAACAT AGTG

TACATTAGCCTTGAACACATTGAACCTCTTGTCTTTCTCATCAAGGCTCCTCGATAC AGT

GTGATGGCTTCTCCATCTCTCATACAACTCCCACAATTTTTCCTCAGTTTCCAACTC CTT

CTCGTGGAAATCGAAACTCTCCCCAAGCCTAAGTACCAAAGCCAAAGAGAAAAGAAC CA

GAAATAACTTCTTCAT

SEQ 46

AAAACCAACCTGTGAGACATTAACATCCAACTCTTGGGCAATGAAATGGGCAAGTTCTG

GAATGCGAGGCTTCAACTCTGAGCAGTTGACACTTAGTGACAAGTAAAATGTAATGA GC

CCAACTTTAATTTCCACTGCAAGTCGACATAAAAGATGAATGTGATTACAACCATAA GTC

TTTGTAATGGAATTATCTAATTTCAATAGCCATCATATCTGCACCGAAGCCTAGCTC AAG

TTTGTGAGAATAGATGTAGTGAAGTAGAAAAAGGGGACTAATACTTGCAAAACTAAA TTG

AAATCTTGAAAAGTTTTACAGCAGATAAAAGTCAAAGCATTTGAGATTATGCAAACC ATT

GAAGAGGTACATCAAATTGAAATAATACAAAACAGGGCTATGTTTCAACAATGCAAA CA

GGAAATATTAGGCAGGAAAAATTTTGCGATTCTGTCATTACTTTAAGGTCTTGCCAC AAA

TTTCTCATGCTTGTCGTTGTCCAGTCACAAAATTCACTAGAAATTTGACAATTGATT ACTA

TAACTTAGTGGATGGATTTTCAGATAGTCGGTATATGGTCAATGCATGTTCACTTGG TAT

CAGTTGTCGTAGTCCTTAGAAATAACTTTTTGGTCCCTTGATTATACCATATTTGTA CTTT

AGATCCCTCAACTATTCAGCTTTACACATTAAGCCTACAATTTAACGAACTTTACAG ATG

TAGTCCAATAATAAACAAAACTAACTAACCCACGACATATTCATATTTCAAGTCTCC TTTT

TTAATAATACAAATTTCAAAAGGAGCATTAATGTTGTAAAAGTACCGTTGCCTACTA AAAT

ATCCCAAAAAGATGAACACGCTGCTTTGGAAATGGAACGCACCAATCACACGAGTTG C

GGATTAACAAAATCTAAAAATTCTAGTTCTAATTAAGATCTGACTAAATCTGCAAAC TCGA

CAAATTAAAAG GCAGATGTCAAAGTC GG AG AGTTG GACTAAAAGTG G AAACAG GG G GA

TAACAGGGGACCAAAGATTTTCTCAGTATTCCTTGAAGTTATTATAATAAATTTCCA GTTT

AAGTAAATCTTTCTCAAACTACAAGAAGGCTGAGAATGCTGTGACATCAGTACAATT TTA

TG CAG G CAGTTG CTTCTTAGAAATTTTAAAATACCAGG G AAACAAGATG ATTACAAG ACT

AATTTCAG AG AAAG GTCAGATGTC AACTTGAATAG G ATT AT AAACAG GGATCTTT AC ACA

AATAGCCGGCTATATTCATGTTTACTTTTTCTAGCCATATACACAGATTATACATTG ATGA

TACACAATTATGCACATATAATACATAAATTATGCATTCACACAAATACCAGCATTC TGGA

CATAAGAGACAGAATGTTGATTGCCCAAAAATGATCTAATCGAAGGCAACACATCAA AA

TCAGCATGATGCAATTCTAATTTTGATTCTCATTTTCATAAAAGAAACCACAAACCA TTAT

AGTTCAAAATTGAAG G G AAAATTG AAAAGG G AATTGTATTAATCTATTAGAAAACAG AGC

TAGTAGAGATGCGAAAAATGAAACCAGTTCAGGTTATGACAGCATTCTAATGGGCAG AG

TAACCTTACAGTAAAAATACTATGTGAAGAAAAGCTGTCCCTTCATACCAGGTGTGT TGT

ACCCAGGGGGTCCACTAGGAGCTGATGAAGGAGACAGGTGCGCACTGGAGTTTGTGT TATCCAAGCTTGAGACCGATGGTGAAGGTGGAGATGGAGGGGATAAATTGAGTCTGTC

CCATAACTCAGAACAATTAGTTTTCCAAAAACCAATTCTTTTATTTTCACGATCGTA AGTA

ACAAGAGTGTTGCGAACAACGATTCCTGTTGCAATTAAAAGCACGCATGTTACAGCA CC

GAAGACCTAGGACGATGAACTTTGTCAAGTAGTTAGAAATACCTCCAAGAAGACTAG CT

G GATTCTTTCCATTCG GG AAAATTC CTAG G CAATAAG CAC CACGTACTTTG AAGTG CTTT

ATAGCATACAAGTTACAATGAAGAAGGGGGAAATTAATCAATACTAAATCTTATAAA GAA

AAAGAAGCCAGTAGATGTGACGCCAAAATAGTGTCGGCAGATAGTTAAGAAAGAACT AA

AACAGAATAAAATGCCTACCTGAAACAAGTAATTTTCAGGAGAGAGAGTTAGTTTCT TTC

CATCGCTGAATACCATATCGACACGCGGAAAGTTCTTTGAGAGTTCTGATATGTTGC TG

G AATATATAAG AC ATTAAATAATAAG G GTAAC AC C AAC AATAG G AAAAAAAAAC AAATTT

AGACAAGAAAATCATAGACGTCTAATTAATTTTGGAACTCTCCTTATAAATACCTTC CAG

CACCAGAAAAGCAGATATCTTTAAAACTAGGATCTGGCCCTTCAATCTGTTTTAAAG AAT

GAAGCTCTTTCACTACCTGAAAAGAAAAGTTGAAGAGGTTAATCCCAAAAAGGAAAA TA

CTTAAGTTTATCTAAAGATAATCGCCACAAGTTACATACTGTATAAATTCACAACCA GTG

ACTCCACTTTTCCTTCGCTAAATGTAACATAGCAAAAGCAATGAAAGACAACATCAA AGA

TATGCAGAAACTTGCAAGGAACTCTGTATATGTAGATAATTACTACCTGCTAACATC ATT

GGAAGAAAATGTTGTGTGGTGTATTACCAAATTCAGAAGTTTAAAACAGGAAAAATC AAC

TATTATAAGTGGTTAAGCAATCTAATTAGGTCCTTTAATAGCAGAGAAACTAACAAC AAA

AGAAGGGATAGAGGCCATAATATAGCCTTCCTTTCTTTTGATAAAGTACATAGCCTC CAA

CTCATCAC AG G AATAAGTTAG GTG G CAGAGTTATAGG ATCACAAGATAC CTGGTTCTTA

CTTTTCTTCAAAAACATGTTCTAGATATCAAAGTGTTAGCTCTCAAACTATTATTGA CCAA

GATGTTTTAGTTTCAGTCCCACTCATGTACAGGAATCTCCAACTATATATTTGATCC ACG

TCACAATAGGACATGATGGTTTGTTTCACCAATCAGTGGTGTAAATAGAACTTTGAA TGG

CAAGAAAAATTTAACAGATTGAGTTACATATAGAAAACACATTATTCAAATGAATAT CTTT

TCACATCTAATTTCTTCTAGGACATTCTGCTCATCAATAGCGTGCTATTCGCCACTA GTC

ACTTTCTCAGACAGAACGAATCAAACAAACAGACAGAAACATTGGCAGATATGTAGA TA

AACTATATCTTACAGCATTCTTGAAAGCTGCAAATGCTGCTTCTGGAAGGTACGCAT AG

GTGGTACCACTATCAAGTATAGTCCCATGTTTTCCACCAAAAACCCGTGGATTTAGG TTT

AGCGGCTTCCCAGCGACATGTATCTCCTTCAGGTCAATATTGTAGTACGGGCTGTTG CC

ATCCGAGACTAAAGTAAATGCGTAAGATTTATCAAGTGTATTTGCTAGGAGGAGACC TA

ACATCACTGTGTCTCTAAACAATTGTTTCCAATACATACCTGTGACCAAAATCTGAT TTG

GTAAAGGCCATGTCAGCAGGGGGTTTTACTCCACCAAGAACCATTGCCCCGCCACCA A

AATCCATCCCTCCATAGCACAAGGAGAAAGAATCACTAATTACATGTTTTTCAACAA GTT

GATCAACTATACTAAGATCACCTCGGCCCAAACCCATTATACCATCAGCACGTTGGC TG

TAAAGATCACCAGTTTCCGCAATTTCACATCCAAAAACAGCTCGTTGTGGTGCAAGC TC

ACTTAGATTTCCAAAAGATATGATGTCCTCTCCAAGCAACCCATAACTTGCACTCAT CTC

AGCGTACCGTCTCTCATAAATACATTGCTGCCTCTTATGGTCGCAGGGACAAGCCTT AT

TGCATTTCACAGATTGATAAGTGCTTGACATTTCCGGCTGAAACTTAGGATCCTAAA TAA

GACAACATACAGAGTACCACCGATCAAAAGACAAAAATCAAATGCCAAGCAATTCTA AG

TGCTATATTAAGCATTTCTCACAAAATTAATAGGCTGACTCAAAGTAAAATTACTAG TCAT

GAAGTTTCTTAAGTGCTGATATTTTCTCCATTAGGAGATCTTTTATGATTACAAGGC AAAT

TGAGCAAGAATCACATTTAAAACATCATGAAACTATACAATGGATTTGTACAGCTTA TCA

ACAAAGAGAGGCTTAGAACTATTTGTATCCTAGTTAGATTGGTTTGTTCTTGTTTAC CCT

CTTTCCCTTAATACTTACAAATAACTGCATCCTACTAAGCGATTTCCTCACAAACAA AAA

GTATATTAAGTAATGTTTATGAAATAGCACCTATACATAAACAGTTTCAAATTTTAA TTTC

CATATCAAGCTATCAAAACACACTAACTGCAAAATTAAGATAAATATGTAACCTTTA ATGT

TTATCCAGAAAAAGAAAAGAGGGAAAAAACCTGATGGTTGCCACACTTTTTACACTC AG

AGCAAGGGACATAGGTAACTGTACTCCCTGTATCAACAATAAGAGCGAACTTCTGCG GT

GGTGTTCCAATCCAAATATGAGTTGTATAGTATCTGCATCACAAGTTACTTGGAATC CAT

TCAACAAAATAAAAATTACTAATAAAGAAATTAGTCTGAACTAGAGGTATATAGAAA TATA

TTCCACAGTGAAAACGCAAAATACGCATGTGAATCAGCAGCCAAAAGAGTTAGTAAC AG TGAATTTAAATTTTCTGAGCAAAAGCTACGAATTTGAACCCGTTGAGGAGGAGATCATCA

TGGAGAGACATGCGAGCGCTGGCAGGACTTTTCTGGAGGTGGCGACGGGAGATTTCC

GCACGGCGTGAAGTGTCTTTCGGAGGAAAGAGCGGCAGCAGCATGGTTGTGTGACGG

CTGCCGTCGGCCGGCGAAGGGAGGAAAACGGAGCTGCCGTTAGTAACATCAGATAAT

CGGAAGCCGGAAACGACACCGTAATGGATCAACAGAGAGATGATCGCGAGAATAACG G

TGAACTGTGGCCGTGCCAT

SEQ 46

TCAATATTGTAGTACGGGCTGTTGCCATCCGAGACTAAAGTAAATGCGTAAGATTTATCA

AGTGTATTTGCTAGGAGGAGACCTAACATCACTGTGTCTCTAAACAATTGTTTCCAA TAC

ATACCTGTG ACC AAAATCTG ATTTG GTAAAG G CCATGTCAG CAG G GG GTTTTACTC CAC

CAAGAACCATTGCCCCGCCACCAAAATCCATCCCTCCATAGCACAAGGAGAAAGAAT CA

CTAATTACATGTTTTTCAACAAGTTGATCAACTATACTAAGATCACCTCGGCCCAAA CCC

ATTATACCATCAGCACGTTGGCTGTAAAGATCACCAGTTTCCGCAATTTCACATCCA AAA

ACAGCTCGTTGTGGTGCAAGCTCACTTAGATTTCCAAAAGATATGATGTCCTCTCCA AG

CAACCCATAACTTGCACTCATCTCAGCGTACCGTCTCTCATAAATACATTGCTGCCT CTT

ATGGTCGCAGGGACAAGCCTTATTGCATTTCACAGATTGATAAGTGCTTGACATTTC CG

G CTG AAACTTAG G ATC CTAAATAAG AC AAC ATAC AG AGTAC CAC C G ATC AAAAG AC AAA

AATCAAATG CCAAG CAATTCTAAGTG CTATATTAAG CATTTCTCACAAAATTAATAGG CT

GACTCAAAGTAAAATTACTAGTCATGAAGTTTCTTAAGTGCTGATATTTTCTCCATT AGG

AGATCTTTTATGATTACAAGGCAAATTGAGCAAGAATCACATTTAAAACATCATGAA ACT

ATACAATGGATTTGTACAGCTTATCAACAAAGAGAGGCTTAGAACTATTTGTATCCT AGT

TAGATTGGTTTGTTCTTGTTTACCCTCTTTCCCTTAATACTTACAAATAACTGCATC CTAC

TAAGCGATTTCCTCACAAACAAAAAGTATATTAAGTAATGTTTATGAAATAGCACCT ATAC

ATAAACAGTTTCAAATTTTAATTTCCATATCAAGCTATCAAAACACACTAACTGCAA AATT

AAGATAAATATGTAACCTTTAATGTTTATCCAGAAAAAGAAAAGAGGGAAAAAACCT GAT

GGTTGCCACACTTTTTACACTCAGAGCAAGGGACATAGGTAACTGTACTCCCTGTAT CA

ACAATAAGAGCGAACTTCTGCGGTGGTGTTCCAATCCAAATATGAGTTGTATAGTAT CT

GCATCACAAGTTACTTGGAATCCATTCAACAAAATAAAAATTACTAATAAAGAAATT AGTC

TGAACTAGAGGTATATAGAAATATATTCCACAGTGAAAACGCAAAATACGCATGTGA ATC

AGCAGCCAAAAGAGTTAGTAACAGTGAATTTAAATTTTCTGAGCAAAAGCTACGAAT TTG

AACCCGTTGAGGAGGAGATCATCATGGAGAGACATGCGAGCGCTGGCAGGACTTTTC T

GGAGGTGGCGACGGGAGATTTCCGCACGGCGTGAAGTGTCTTTCGGAGGAAAGAGCG

GCAGCAGCATGGTTGTGTGACGGCTGCCGTCGGCCGGCGAAGGGAGGAAAACGGAG

CTGCCGTTAGTAACATCAGATAATCGGAAGCCGGAAACGACACCGTAATGGATCAAC A

GAGAGATGATCGCGAGAATAACGGTGAACTGTGGCCGTGCCAT

SEQ 47

ATGGGAGCAAAATCTTTTCTTGTCGCCTTTTTCCTTTCATTGCTGTTATTTCCTTTGGCC T

TCTGTACATCAAATGATGGCTTGGTTAGAATTGGTTTAAAAAAGATAAAATTCGATC AAA

ACAACCGACTTGCTGCACGCGTCGAGTCCAAGGAGGGGGAGGCTTTGAGGGCCTCTT

TTAGGAAGTATAATAATCTCCGTGGTAATCTTGGGGCCTCTGAGGATACAGACATTG TA

GCACTGAAGAATTATATGGATGCTCAGTACTTTGGGGAGATTGGTATAGGCAGTCCC CC

TCAGAAGTTCACTGTCATCTTTGATACTGGTAGCTCTAATTTGTGGGTGCCTTCATC AAA

GTGCTACTTCTCAGTAAGCTTTCTATTACATTTTTACTGTCATAAAACATAACAGAG AAAG

CTAATGTTGGCGTATGCATAATTGACGAGCATCCATATTTATGCGTCTCTGTATTTA TGC

AGGTTCCATGCCTTTTCCATTCTAAGTACAAGTCAAGCCAATCAAGCACTTATAAGA AAA ATGGTTTGTGTCTTGACCTTTGTCTATAGCTGAAATTGCTGCATGAAAACATGCTTTTCT

CTTAAACTTGTTATTACGCTCAATGCTTGCTTGTAAGAGAAAGTGTTCAATTATTGC GTTT

TGAGATCAAAACTGTTAACCCTGCTCCCAACTTAGGAGATTTAAAAAAAAAAAAGAA AAT

AAAGAAGACCCTTACCATTCTTATTGTTGTCATCCAATTATGTGCCTTGCACCAAAG ATT

TCTGTTGAAAAATATAACATGCGAGATTATGTTGTTGGCTTTCCCTCCCAAAAGATG TGC

TAATGTTATATCTCTGATTTTTTTCTTTCAATTATTGGCAATAAAAGCTTGTGCCTT TTGAA

CCGTTTTGTCTATCGAGGAACCTGTTATGGTGGAGTTCCTTTATTGAGTTTTGGTAT CCA

TCATAATTTACTTTCCGGGAAAATTGGAGTCTGCTGTGTGATTGACATGACATGATT TTT

GATTATTCTTCTCTGTCTGCTTTCTAAGTTTCTACATTCTCGGTAGAGGTAAGATAT GCG

TACTATCTACCCTCCCCGGACCCCACTTATGGGACTAGGTTTTTTTTGTTGTTGTTG TCG

TCATCTACTTTCTAAGTTG GTCAACGTGTTCACTTG GTTGTTG ACATAAGAAC CTGTTCA

TTCAAACTTTTTTCCTGTTTAATATGCCATACAGGGAAGTCTGCTGCCATACGTTAT GGT

ACTGGAGCAATATCTGGATTTTTCAGTCAAGATAGCGTTAAAGTTGGTGATCTGGTT GT

GAAAAATCAGGTGAATGTGGCTTCCCACTTTGTGTGTGTGTGTGTGTGTGTGTTTTA AA

ATGTTTCTCGAGCATATAGTCTCTCATCTTGTTAATGACATCAGGAGTTCATCGAGG CAA

CCAGAGAACCCAGTGTAACTTTTTTGGTAGCCAAGTTTGATGGTATATTGGGTCTTG GT

TTCCAGGAGATTTCTGTTGGAAATGCTGTACCAGTATGGTATGTGGGTTTATTTTGT TTG

TGTTCTCTTCTTTCCAAATGTTTCTTCAATTTCCTATTATCCAAGTGCGTGCCTTGT GAAT

TTCATTATTACATTGAAATGATTTTATCTTCTGGACAGAATTTCATTAACATCTCCT TCTG

TATAAAGGTTTAAGTGATACTGGTCTTGACAGTTTCTTCTGTGTTTTATAGGTACAA CAT

G GTC AAACAG GGTCTTGTCAAG G AG CCTGTCTTCTCATTTTGG CTCAACC GAAATAC AG

AGGAAGATGAAGGGGGCGAAATTGTGTTTGGTGGGGTTGATCCTAACCACTATAAGG G

AAAGCACACTTATGTCCCAGTCACACGGAAAGGTTATTGGCAGGTAGATATCCCTAT AT

CTTTGGGAGATTGATGTTTGGCTTTTGCAACCGTTTTCTTACTCTCAGAATATAATT TGC

AGTTTGACATGGGTGATGTTCTGATTGATGGTCAAGCTACTGGTATGTTATGTTACT TCC

TTTTCTATTTTTTTGTGTG G AG ATTTCGAG GATAAG ATGAGAG CACTTTCACATG ATTTC

CATGCTTTTTCGTTGTATTGACATACTGAATACTGTAGGTTACTGTGACAATGGATG TTC

TGCAATAGCGGATTCTGGGACTTCTCTCTTGGCTGGTCCAACGGTATTCTCAAAAAC AT

GTTCCATTTTTTGTTCCTCTTATTCAGCTATTATCAATAATGAACTGTCTCATAATT TTTTT

TGTACCGTCCTGTTCATGTGTAGGTTTAATTTTTTCGCTGGAATATGAGTTGAATAA TAA

TCAGCCATTCATTTGAAGTATTCTCATTTTTTCCGTTTCTATTCAAAAAAAAAGGAG GATG

GCAAGTGCAGTGATATTGATATTCATTCCAGTATCTGGACATACTTCCTTGTTGATT TTC

ATACCTAAGAAATGTTTCTTTTTACTTTTGATCTGTTGTTTCTGTCTTCTTTGTGTG CTCTT

CTTCTTTATTAGGAAAAAAATTGTGCATCTTGAGAACTGCTTCTTAATTGTTTTCTT TTAT

G GCATG GTTG ACAATATG ATACAAG G AAAAACTGC AG CTTCTTTTGTCTAG ACAATTGTA

GTAGTGAAATGCTTTACTACTACATTTCTAGTTCTCATCATTCTTCCCTGTATCCTT CCTC

CTCTATCTTGCAGACTGTAGTCACTATGATTAATCATGCCATTGGCGCCTCGGGGGT TG

TAAGCCAACAATGTAAAGCTGTTGTTGAACAGTATGGACAAACAATAATGGATATGC TTT

TAGCAGAGGTGAGCAATTATTTGTTTTAGTTGATAGTTTTTTGTTGTTTTTACCAAT AGTT

TTCCGTGGTATCTGCAAAGAGGATGGTTTCATGCTACTAGTTGCCTTCCCAATATTC TGA

TGCATTGGCGTCTTAACAGGCACATCCAAAGAAGATCTGCTCACAGGTTGGGTTATG CA

CCTTTGATGGAACTCGTGGCGTTAGGTTAGGCTTCAGACCCTTTCTTTCCTCGCCTT GG

CCAATCATTTGATATGGTAAATCCTATTATAAAATGTGTGCTGAGTGGATTTATGTC CTC

CACGTGTAGTATGGGCATTGAGAGTGTTGTGGATGAGAATGCTGGCAAATCTTCAGG A

CTGCATGATGCTATGTGCTCCGCTTGTGAAATGGCGGTTGTCTGGATGCAGAACCAA CT

TAGACAGAACCAGACCCAAGAACGCATCTTGAACTATGTGAATGAGGTAAATAGCAT CA

GTCACATGCTTTCTCTTCTCATCTTAAGTTAGATTACTGACCATCTTTAACAGCTTT GCG

AGCGACTACCAAGCCCAATGGGACAATCAGCTGTTGATTGTGGAAAGCTTTCTGGCA T

G CCTAGTGTTTCCTTCACAATTG GTGG CAGAAC ATTTG AC CTCTCTCCTG AG G AG GTAT

GTCTGATATCAATCTTGCGTAGTGTACATGGCGTCTTCTCATTTTGTAAATGGCTTT GAT

TTTTCTGAACAAAGTGATTGGTTGTAGAATCCTTTTGTCATGTTTCAGTTAGGCAGT TCA TTTCTTTGTGGTTTTCACTAGATTAGCTAGCAAGGTGTTACTCTGCTTTCAAGAGAAGTA

CACTTGTCTTTTAGAAAATTTCAACCATGACAGCTAAGTGTCGTTTGGATAATTAAT GAT

ATTGAAGCGTGTCGAGCTTTAATATCAGTTTCTTTGCTTGATAAGTTAACTTGTGAT CGG

ATAATTAATGTTATTGAAGTGTGTCGAGCTTTGATATCAGTTTCTTTGCTTGATAAG TTCA

TATGATTGTACTAAGCTTGCATGCTTTTCTTGTCACCAGTACATACTCAAGGTGGGC GA

GGGTCCTGCTGCACAATGTATTAGTGGCTTCATTGCCTTGGATGTTCCTCCACCCCG TG

GACCTCTCTGGTATGTTTTCTTTTCGTCTTAACGCACAAATGCGTGGATTCTGTTAT TAC

CAGCTCCCTTTTGATAATGTTGTTTGCTTATGGCTTTGGTGGTGCAGGATCTTGGGG GA

TGTTTTCATGGGTCGATATCACACCGTCTTTGATTCTGGAAAACTTAGAGTTGGATT TGC

AGAAGCAGCT

SEQ 48

ATGGTTGTTGCATTTGTGGGCATAGCCAAGTCTATCGGGCAACAATGCTTGAGGCGATC

AAAACCCTACTCTTACTCTTACTTCTCCAGCTATGTTCGTTCCTCAAATTCTAAGTA TGG

ACTCCAAAATTGGCAATTTCAGAGTCATAGAACTCTAATTTTACAATCGGCTTCTGA ATC

CGTCAAATTAGAAAGACTCTCCGATTCCGATTCCGGTAATTCCACTCACTTTGTCCT ATT

TTACG G CGTACTGTTACTATTTG G GG AATCAAACTTCTTTTAATTTTGG GTACAATTG CT

TTCTGGGGTTAATTAACAGGGATTTTGGAGGTTAAATTGGATAGGCCCGAAGCGAGA AA

TGCAATAGGGAAGGATATGTTAAGAGGATTACAGCAAGCGTTTGAAGCCGTGAGTAA TG

AACGTTCAGCAAATGTTTTGATGATCTGCAGTTCGGTTCCCAAAGTGTTTTGTGCTG GA

G CTGATTTG AAG GTATAACAGCTTCTCTTCATGTTGTGTTTTTAG G AAAAAATG AG G CAA

AAAAAAAACTTTTGAAATCTTGTGCAGAGTATAGGACACACTATTTGGTTAACAAAA AAT

GTGTAATCAAAGGGTTCGAAGTCAGATTAAATAACAAAAATTATGGGTAAATATTAA AAA

TTTTAAAACTTTTAAAAAATCATACTCGTTCAAATTATATTAAATTTTTTCATTTCT CCTCTT

TGCTTCTTCTTCTCCTTCTTTCTTCGTTCTCCTCCTTAATTTCTTATTTCTCTTCAA CTTTC

GTTGTTGTTGCTGCTGGTTCGTCCATGTCATCTTCTTCTTCTTCATTTTATTTTTCC ATCT

TCGTCTCATTTTCATCTTTCTTCTTTTTTTCATTTTTTCTTTTCTTTCTAGTGGTTT AAAATA

TACGAGAAAAAGAGAAAAATATGAAATTTTACAAAGTAAGTATTTTGCAAAATACCC TCG

AGATATTTTGAGACATACCCATAAATGAGTATTCTACTGAAACATCCCCGGGTTGTT GTT

TGAAACATCCTACGGGATATTTCTTCTACTTCTTTCTTTTTTTCTAGTGTTGTTCTC ATTG

ATCTTTTTTTCACAGATGTTTCAAACGTTCCCACAGATGTGTGCAGATGTTTGAAAC ATT

TCAATAAATGTTTGAAACATTTCTTTAGATATTTGAAACATCTTTCCTAAATGTTTG AGGA

TTAGGAGTGGCGGGGATAGAGAGAGGAGCGTTGCGAGATGGAGTAACAAGGGGAGGA

G GAG GAGTG GCG G GAG GAG GAG GAG GGTTTTTTTTTTTGTAAATAAG AAAACTTTG GG

GGTTTTAAAAATAGTGTCATTAACCCTAATCACAGAAGTGTCCCTTTTACCCCATTC TTA

ACACTTTTGTCTTAAAAAGTGATATAAGTTTTGAAGTGTCTTAAAAATTTAAATGCC CCGT

GTTTTTAGTTGTAATATGTTTTTTCTTGCAAGCAGTATCACAACTTGTAGATATTAC ATCT

TCCTTTTCGTTATGTATCCTCTTCTCTTCATTGTGCCAACCATTGTATTTGTGTTTT CAAG

CAAGAGACAGATCCAAGATATGAACTTTATGTGTTCGAGTTCTAAATTCTGCCTAGT CCA

TTTGATTTACGGGGTTTGAAATCTATATTTGTACAAGTTTTGGTGAGTTTTTTAACA CATA

TATGTTTCGTTGATTCTCTCTGTAATGTCTTGTTGTCTTCTAGTATATTTTTCTTTT GTATC

GCGTGACATGTTGACAAGAAAAACAATAAAATTTCACAAGCTAGGAGCATCTGCCAA AC

ATACAACTTTCAATGTTGAAAAGTTTCTCTGTGGTTTGATCTCTTCGCTTGTGCAGC CTC

TGTGG G ACTC AAACATCTTCACAG ATTCAATTTTCTTTTGTG GG G ACTG CATTTATG ATT

GGCATGGCATATGATTGAATAGCTGATGTTTTTGGTGTCAGATGTTGATTTACGCAC TTT

AGTAGTCTTTCTACTGTTTGTTATCTGTGCTGTATTTTGTACCTACTGATAGCTAAT TCAA

TTG ATTG G CATTTG CTACATAG G AACG AAAAACTATG ATTCTTTCTG AAGTCCAG G ATTT

TGTAAGCACTTTGAGATCAACTTTTTCCTTTTTGGAGGTACGTGATTTTTATTGATG TTTT

GTTTAATATATTAAAGATCATAGTGTCTTAAAGCTCAAGAAGAAGTTTTTTTTATCA ATTT ATGAAAAGCAGAAGTTATCAGTTTATAATCTTCTGAATTCTTCCTTCAAAATGATTGGTT C

ATGAGATCATACTATGTCTCGTTTCTTCTTCCTCACTTATTACTTTATCAACCATAA AGTG

GTCCAGTTGTACATATCCTCTTCTTTTCTTTCACTCTATTTGAGTAAATATTTTCTT GCTC

AGGGTCTTCATATTCCTACAATTGCTGCCATTGAAGGTATAGCATTGGGTGGGGGGC TT

GAAATGGCGATGTCTTGTGATATCCGTATATGTGGTACGTGCTTTCTTGCACTTCTG GG

TGTACCATATTTTCTCCTTCTTGCTCTCTAGTTTGATAATGTGTTAACAGGTGAAGA TGC

AGTGCTGGGCTTGCCAGAAACAGGACTTGCTGTAATACCAGGGTAGGTATGCCTTAA TT

ACGCATTATATGTTTGCTTATGCAAAATCCCAAAATTCTTTGAAGGATGTGTTAGCT ATG

TGTGGTTTATTTCTAAATTTATCAGATCAGTGGGACGCATTTCCACTATCTTTTTGT CACT

TTGTAATCTTTTACTATTCAAAGGTTTCCAACTTCAGAAGTTGCTATAAATACTCTG TATG

CATAGATGATCTTCTTAATGGTATCCTCTTATTTCATACTGGCATTGTGCAGAGCAG GAG

GAACACAACGGCTTCCTAGATTGGTTGGAAAATCAATTGCAAAAGATATAATATTTA CTG

GCCGAAAGATAAGTGGGAAAGATGCTGTATCAATAGGTACGTGTATGACTTGTCAGA GC

TCATTTGTCAAGAGACAGGACTCCTTTGTCTTTCCAAGTTCTCTCTTGTTAATATAA AGAT

AGCAGTGATGTCAGCACTTCATTACAAATTATGGGTTAACAGTGTCCTCCAAGGTTT AG

GCAGATAGAAAGAAATCATCTAATTTTTGCTTCTGCTGTAATTTTGGACCTTGATCT CCT

ATGGTTTTCTTTTCCAATTTCTTAGTGAATAATACATTGTATGCAGGGCTTGTCAAT TACT

GTGTTCCTGCTGGTGAGGCTCGCCTCAAGACACTTGAACTTGCTAGGGATATTAATC AG

AAGGTTAGACTTTAGTTATTGAGATAAAGAGGATGTGATGTATTTATCCAGTGTGCC ACC

CATATGACTTCCAATTGCAATTTAGTCACGAACAAAGAAGAAACATAAAAGAAGTCC AAC

TCTTCCTATAAACAAAATGATTTCAAACTGTACTGTACATAGATAATTGTAAAGATT CGTT

AGCAGTAACGTGTACTCTTTTGTACCCTTTTCACCTTTTATGAGTTATGCACCCTCT TTT

GTACCCTTTTCACCTTTTATGAGTTATGCACCCTCAAGGCCATGAAATATGCTTGTC ACT

GGATTTTCTTTTCTTTTGTGTGTGTTGAATGAAGTTGAGGCTCTTGTTTGAACTTTT TATT

GTCATCCATGGACCTTAATTTAATGGCATTTACTAATCCTATGCTTGTTTGTTTTCC ACTT

TGTGCACTGCACCTTCATTTTTTGTGACAAGCTTTGTTTCTTGCTTTTGGTCTTTTT CTGT

CTTGTTTTTTCTTAGGTGGAGGATCATTGACTATTGCATAGTTCCTTGCTTTGGTTT CTTT

GTTTTCCTTTTCCCCTTCTTTTTCGATTTTTAGCTATTTTATGGCAGGTTCACATAG AAAA

CAAGTGTTACTCTATTGTTTTTCTTTCCCTTTTTTCTCCATGCATCTTATAGAAACC GAAG

CTTAAGGTTTCTCCACGCAAGCTGCAACGTTCTGTTTTATAAGCTTCATATATTTCT GGT

TTTCATGTAGTATGAAATGATATTGAGTGGGATTATTAGGAAGCTGAGACAGATTGA AAA

GAACAAGTAAAAGCCACATTGGTGATTCCCTCATGCTTTCAACCTTAAAAGGTCATT TCA

ATGTCCAGGGTCCAGAAAAGGGACTCACTCTATTCATGTTCTATAAGAATGGAGTAA TC

CACTTGACAAGTTCTGGTGGTACTACTTCTTCATAAGGTTTTATTATACTAGTCAGG GCT

GTGTTGAAAGGATATGGTAGCATCAACTAAGTCCAATTGTTTCGTATTGTATAGAGC CTT

CCTTTTTCCTTTTCCTTTTCCCTTGTTGACTCTGCTTTTATACCTCCAAATGGAATG GTCG

TAAAGCTTTTGCTCTTTATTCAATAAGCTAAAACTTCTGATGAATAAATTTGGTTTA AGGA

TGCACGAGGAGTGAAAGTAAAATAAATATTGATGAAGGTTTTGCTAAAGATGCTCTT TTT

TATGCTCGGGTTTTGCATGTCAACTGACATATACCTTATCAATGTCGACTGACATAT ATT

CTCTGACAGGGTCCGGTGGCGTTAAGGATGGCAAAATGTGCTATTGACAAAGGAGTG G

AGCTAAATATGGAGTCAGCCTTAGCTTTAGAGTGGGATTGCTATGAACAACTGTTAG AC

ACAAAAGATCGGTTAGAAGGCCTTGCTGCATTTGCCGAGAGGAGAAAACCTAGGTAT AA

GGGTGAA

SEQ 49

CTAGC AG CAACCAG CTATAG GAACAAATGTGTCAG CTCG AAGTGACATTGGTTG G CAG CTAACATCCTCACAACTCTTGTGGTAAAGCATCTTTTGTACATAGTACATCAGATAGCAT TGTGAGGCCCTGACAACTTCTTCATCGACTTCAGTAATCCATGCATCGTCGCATTTGTA CCATTGGTTTCTCAAACGCAAATAAGTCACATAGTGACCTGATTCTAACATCCCTGAATG TGTGACCACAGCGAAAATTTCAAATTCCGTAGAAATATCTGATTCATCACCGTCGAATGA

AAAGATTCTGTTCCCGTATCTCTTTCGTACAATTGAAGATGATAAATATGGTTTCAT GTCT

AAAGAAAAAGGAAATTGCAGGTGGCGGTCAATCTTTCTGGACATTTTTCGGGTGGGA GA

ATGTTCAAAGCGTTTTATATGAAAAGATAGCACCAGCGGAAGCTTCTTGATGGACAT TT

GTTTCAATGCATCTTGCTTTTCCTGACAATTTTCACAATACAGTTTCTGATCAGATC CCAA

CTTTTCTG GTCGTGTG AAG AG GTC CAAG CAACCTACAAGAG ATTCATTTG GCTTACTCG

ACTTATTAGCAAAATCCTTTGGGCTGGAGTTGCAGCTATTCAAGTCAAGAGAAATGT CC

ATACAAGGATCATGAGTTGTTGAAGTGAATCCGCACGATGTACATGTGACATCAGAT CT

CAAGAGCCCATAGAAAGTCCTATGAGCAATACACTGGCAATCTCCATTATCTGTTTA AG

AAATGTTATTTTTATCAGAGAAAAAGAACTAGAGGAAGCCACAAATCTTCAGTTCTC AAG

GTTATATATGAATCATCCCTCTTCTGATCAATGCATTGACAGCAAATGGAAGTACAG GCC

AAAATTGATATCCCAGAAAGGAAATTGCGCATGTACATAGACAAGATCTGTACTATA TTT

ACTCACTGCGTAGCATAATGTCCTACATAAACAAGAAGAATACCAGCCATTCTCAGC AA

AAGATACCTTTGGTTGCCAAACTAGCTTTCCCCTCTTTATCATGGATCCTGTCCATA ACT

GAAATGAAGAACTCATGAGCATCCTGCTGCTCGTAGGTAGCAAGATTTTCTGAATGC TG

CCACCAACTGCAAAAATAAACAACATTTCAATTAAAAGAATCACTACATAAGTTTTC AACA

TGTCAACGTCCAATAAAAGTTAACATTTTCCACGTTTAAGCATCCAATCTAAATTAA ACAA

ATGATGATATCTTAGGTGGACATAGCAGCATAGAGAAAATTTCAACAGATTTATTTT ATC

ACGATAAGAGAATAGCAATCTTGCTTATTGTCTGATTTTAGCGGTAATGCACCAAAT CTT

GTTTATCCAAAAACTTCAAAGTGAAAACCTATGCAGTTAGCGCTAAAAATTGTAAAC ATT

ATTTACAATTTTGCAGGCCATGTTATAATCAAATATCCAGTATCAATACATTAAATG GTGA

G CACAG ATAAAAAG CAATTAAG AGATAATG ACAG GAGATAAATCCTTTTAACACTTAC AG

ACCTGTAAAGAAACCGAGCTGGACTATAAGGGGTCCGATCACCAGAAAAGACAGCTG A

GAAGATAAGGTCAATATCACAAGGCAGGCACAACCGATCCGACGACATCTTTCTGCA AA

TATCTCGGTTATGCCTATCGCTAAGGAAGTAATTTCTCAAAGGGGGTGCATGAAGTA AC

ACTTGCAACACAGAGTTCATGAAACAAGTATTCCCCAAATTGTTCAAACCCCTTAAT ACT

AAAGGAAAACATGATTTCGACTTCTGATCCCTCCTTAAAAACAACGTCTTCATATTC TTT

GAATCTAAATCCATCCCAAAACTCAACCTTCTCCTCTTGCTCAACCTCAACTCACTC TCC

ACAACCCCAATTTCTGTTCTAGGAAACCCCATTATGTGTTTACACATCACAACCTTA TCA

AAATCAGGATCATACACCTGATCACAACACACTGAGCAATAAAGCTCAGCCCTTTCC AT

GTCTACTGAAATCTCATGCCCAGCTTTACACTGACTATGCAAAAGGGCATGGTTTGA TT

CAGGTGACAAACAACATAACACTGATGAACAGATCAAACACATGTAAAATCTACCCT CAT

GTCCACTACAAATACTACATCTAGGTAGCTCTGATTTGGATATTTCTAATGTGGTCC TAC

CATATGGGGTTGTCTTAAAACATTCTTGGATCAAACTATACCCACTCATACCATTTT TCA

CCTTGTAATCTGCAAGATGCTTACAGGGCTTTGGATTTATATATAAAGAGTTACTTG AGC

ACAT

SEQ 50

ATGAAAGAACTTCATTCTCTAAGAGAGATCGAAGGGCCTGACCCGAATTATAAAGATAT

ATGCTTTTCTGGTGCTGGAAGGTAATAAATTAACTATAGTAATGTTAGATCATTAAC TTTT

TCTTTCCTTTATTTTTGGTGTTGTTTCTTGTATTGAATGTCTTGTATATTGCAGTGA CATC

TCAGAGCTCTCAAAATCATTTCCTCCTATCGACATGGTATTTAGCAATGGAAAGAAA CTA

TCTCTCACTCCTGAAAACTACTTATTCAGGGTAGGCATCTTAATCCACATGGTTTTT ATC

TTACTACCTTGCCTTTAGAATCTGTATCTCCTTTTGGCTTCATCTCTCCTAGTGGCT ACA

TTTTTTGTCTTTGTTTTGATAAGTGGCTCCATTTTCTCTCTGTGTTTCTATATTGAC TAATT

CTGCCCTTTTGCTTACTGTAACTGATTGCTATAAAGCACTCAAAGGTGCGTGGGGCT TA

CTGCTTGGGAATTTTTCAGAATGGGAAGGATCCAACTACTCTTCTTGGAGGTATTTG TC

ATATATATCTTTTAGAATCTTGGGAAAGTTCATCTGCCAAATTCTTCAGTTGTATAA GCTG

TAACATGCGTGCCTTTGCTTTTAATTGCAACAGGTATTGTTGTCCGCAACACTCTTG TAA CCTACGATCGTGAAAATGAAAGGATTGGTTTTTGGAAAACCAATTGTTCTGAGTTATGG

GACCGACTAAATTTATCTCCTTCACCTCCACCTCCACCATTGCCCTCAGGCTTGGAC AA

CACAAACTCCAGTGCAAATTTGACTCCAGCACTGGCACCTAGTTTACCTCTGGAGCA TG

CACCTGGTACGAAGAAACTGTTCTCCTATCTTTTTGTCACCATTAGTATGCCTTTCA GTC

ATGCTTTTATCCAGTTTTGTAGTGGAACTGGTTTTATTTCAATTATTCTACCGGAAG GGG

GGAGCCTTAGAGCAACGGTAATGTTGTCTCCGTCTGGCCTATATGTCATGGGTTCGA G

AAGTGGAAGCAGCCACTATTGCTTGCATTAGGGTAGGCTGTCTACATCATACTCACA CC

CCTTAGGGTACGGCCCTTCCCGGAACCACATCAATCCGAGATGCTTTGTGCATCGGG C

TGTCCATTATAATTCCGCCAGGCTGTTTTGCATCATTTCCCCCTAATATTTTTAATC CATT

TTGGTTTCTGATTTGCTATGCTGGTTTTTTGCTATATCGCCTAAGATTAGGTTAGCT TTG

ATGATTTCACATCCTTTCTTTGATTAAGGTCCATGAATGTTCCTGTGTCTCCAAATG TCA

GCTTTCAAAATGACATTTGAGCTTGCGTTTTGTATTGTTTCATCAAGTTTTGTATTC ATCT

ATCTCCTTAGCATTCCAGAGTTCCTGAGAAGCACTCGCTAGTAAAGACGTATTCTGA TG

TCATGACATTTTTAACCTTGTTGGAGTTTGGACCCAAACAAACTTTTTTTACAGAAG GAA

ACTATAATTTTAAGGAGTACAACAGTTGCTGTATATAGAACATGGTGAGTAACTCCA CTC

TTGAGATGCCTTCTCTTCACTGAAGTCAATTTCTAAAAACCTCCGTGCTTGACACAG ATT

TGTTGGTATAGATTTGTGCTCCAGAGATGCAGATGGGTCCAGCGAATTTTTCACTAG AA

TTTTTCTTTTTTTTCTCACCACCTGTCCAGCATAGTGCTGTCAAAAGTGACAAGCTT AGA

AAAAAACCATGTGCTTGGTGGGGCTTTAACTGCAACATGCTATAAAAACGTTCACTA TAA

ATGTAATGTAAATAAATAACCATAAAC ACAAAATAG CAATTGTTG GAAAAATTG CAATTTA

GTGAAATACACGAGGTGTCAATCAAGTTCAGATCATATTGTAAGTCTTGATTCAAGG TG

CATTTTTAAATTAGTAATTTGGATCAATGATTAGTTTTCTTAGACTATAACTTTCAA TTTTC

ATACTCGTAAGAACTGATATACTCATATATAATATAACGTTTTTCTTAATAACTAAT AAAT

GCTTTCCTAACTATATTTATTTTTGTGCTATCCTATTAACAACAGAGCCTGTGGATG TTAG

GCACCCACTTTAAGGCCTTTTTCCTCGCACTGAAGCCCTACTTTTAAGGTTTACTGT CAC

GACCCAAAATTCCACCTTAAGGATCGTGATTTCACCTAGTCTCTAAAACTAGGTAAG TC

GATCACTTACAACAGTTAAACCATTAAAACATGATATTATGAAGCGGAGTTTAATAT AAAT

GCGAAAATAAAGGTGATACAAGCCAACACGGCGTTAATCACAACAAATCCCCAAGAC TA

GGTAATACAGAGTCACGAACTCTAACTGAATACATAGAAATATTTCAAAACAAAGAT ACA

ATACTGTTCTG GC AG ATAATTGACAGTATAAAGATAAG GAAAG ACTACAAG G G ACTTCG

ACGATCAAGCAGCTCTACCTTGAATCCTCGTGATCAAAAAGCTAACTCTGCCTAGGT CC

TATGCCTCCAACACCTTGATTTGCACAAAATGTGCAGAAGTGTAGTTTGAATACACC ATG

GTTGGTACCCAGTAAGTATCAAGGCTAACCTCGATGGAATATTGGCGAGGTTCAAGT AA

AGACACTCACTAGTCAAATAACCTGTGAAAAATATCAAAAATGGGCAAATGGAATAA TAA

CATAAAGTCATAACTGTAATCTCTTCAAATTAAACGATACCTATTTAGAATAATTAA AGGT

CCCGTTCTGACAATAAGCCATCAAATAGAATCACGCACACCCGGCACCTCGTACCCA CA

TTAACAATCACCCTCGCACGGCAAAGGCCTCGTGCCACAACATAAGATATACCTCGC AC

GACGAAGAGCTTGTGCCACAATATAAGTCACAACCGCATGGATAACTCATATGCCAA TA

TCACAATCCGCCTGGCGTGGTCACATGCTCAATATCACAATTCGCCCGGCGTGGTCA C

ATGCTCAATATCACAATCCACCCGGTGTGGTCACATGCTCAATATCCCAATCCGGCT GG

CATGGTCACCGGCTACCTGTCCAAATGTACATGATCAATGGACATCAAGTTTCATAC TC

CTGGACTGATATTAATGACATGTTATGGTATATGCATGTGCAAGTGTATTATCACAG CTT

AAATCATCTAAGTAATATCAGAGACACCAAGTGGCACATTAGGAACAACACAACAAA TC

ACGTAATATGTATGACACACACAAGGAAGTCAAAAGCAACAACCAGAATACTCCTCT TT

CATCAACAACATGCCCCCAGGCCATCACATAACATCCCCTTATTGCCACCCTTATGT CA

CCACGTTGACAATATCATAATAGCCACCCGTATCGCTCCGCCTAGGCAGTATATCAA TA

GCCACCCGTGTCACTGCGCGCAGACAATATATCAATAGCCATCCGTGTCACTCCGCA C

AATCAACAACAGTGAATTGTCATCCTTGTGCTCCGGATAACAACAATCGATCCACAC AT

GTCCACATATGCCACAATATCACAGGATAGTAGTATTAGAGATTTATCACGATACAA GCT

CACCACTCATCAACAAAGTGCACAAGGACATATCATTAATATAGAATTGCTGAGGGG TA

TTCAACATTTAAGCATGAAAGCTACTCAAATTAACAAGAGTCTCACAAGCGCCCAAC TTG GCCAAATAAGGAATTAAGATCCTAAAACATGATTTGTACATGGAATATAAATAACTTAAT

GTCAAAAATAACTTGATGTCATAAATAAAAGCCATAGGAAACGATTCTGAATAATAA AGC

TTCTATCTTGAACAAGAATAAAAAGTAATCCCAAAAAGTCAACCCCGGGCCCACACC GT

GGAATCCGACAAAACTCACAAATTCCGAACACCCGTTCAAATACGAGTCCAACCATA CC

AAAATCATCCAATTCCGGCCTCAAATCGGCCTTCAAATCATCAATTTATGTTTTAAA AAA

GTTTTTACTATGATCTCCAATTTCTCCCATTCAAATCATCAATCAAACACTAAAATT GAGA

TTGGAATCATGAGAATAAACAAATCCGAGTAAAAAATACTTACCCCAATCCAAATCG TGG

AAATTCCCCCAAAATCGCCCAAATCCGAGCTCTATAACTCAAAATGTGATAAAATAA CCA

AAACCTTTGAAATAGAGTACTTATAGATCTGCTCCAGGTAAACCCTTCTCAATTGCA GGA

CCAGCTTCGCAATCGCAAAGCACAAACTTAACTGACCACAGAAATACCCTTCGCGTT CG

CGGTACATACCTCGCGAACGCGATGCATGGCTGAGCCAGACCTACGCGAACGCGGCG

TAGACCACGTGACCGCGAAGACAATACCACCAGCTCCCAGTTCTTCATCGCGAACGC G

TCATTGCCATCGCGAACGCATTGACCAAGCCCCACAAAGCTACGGGAACGCGACCCT C

CAGTTGTGAATGCGAAGAGGAAAAACACTCAGCTCCAATCATACACTGCGCGATCGC G

GTTAGCCCCTTGCGATCGCTAAGAACGTCAGCAACAACAGAAAACCAGCAACACAAC AT

GAAGGAAAATGGTCCGAAATCACCCCGAAACTCACCCGAGCCCCTCGGGGCCCCGTT

CGAACATACCAACAAGTCCCAAAACATAGACAAACCTACTCGAGGTCCCAAATGACA CC

AAACAACATCAAAACTACGAATCACACCGCAAATTCAAGCCTAATGAACTAATGAAC TTT

CAATTTCCAAAACTCATGCCGAACCATACCAAATCAACTCAGAATGATCTCAAATTT TGC

ATGCAAGTCCCAGATGACATAAACGGACCTATACCAACTCTCAGAACCGCAATCCGA AC

CCGATATCAACAAAGTCAACTCTCGGTCAAATCTATCAACCTTCCAAACCTTCAACT ATC

CAACTTTTGCCGGTTCAAGCCAAAACAACCTAGGAGACTCCAAATCCACATCCGGAC AC

ACGCTAAATCCAAAATCACCATCCAGACCTAACAGAACCATCCAAACTCTGATCCGA GA

TCAAATACGCAAAAGTCAAACTTGGTCAACTCTTCCAATTTAAAGCTTCTAAAATGA GAA

TTATTCTTCCAAATCAATCCCGAAATGCTCGAAAACCGAAACCGACCATACACGCAA GT

TGTAATACATCATATGAAGCTACTCACGACCTCGAACCACCGAACAGAAATGCAAAT GA

TCAAAAC GACCG ATCG G GTC GTTACATTTATGTATG CTTCAAATG AG CATTCAGTG ACA

CTGTTCAGCAAAAGGAGAAACTCTACTAGCCACTTGTAGCCACCTCCAGGGACCCTC TC

TGTCTCGGCCATGGATTACTTTGAGGAGTAATAGGGCTTCTCCAAGCGGAAACATTC CA

CGCATGCTGTGATCCTCCATGTTTTCTCTGCTAATCTTTGCTACTTTTTCTGGCGGT CCA

ACTGGTTGATCAATTCTACTAACACCATCCGAATGAGCCCTCAGGAATTCATCTCCC TA

CTTCTTTGCCAGATACAAGGAAGGCGTTGTCCTCATAGGGTTGGCCTTGGCATAACT CT

CTG ATAATTTGACAC GTTG CTTTAATTTG GTAACAATTTAC AATTTTGG G GGTG CTG CAC

TTGATTCCATTGAGTTACACCATTTCTCATATTTAGGAATGGTCCTGTGCAAGATTG AAA

TGTACTGAACTCAGTTTTTCCCTGTGCAGATATTTATGATTGTTATTATTATTTCAT CTTT

G ACCTG AATTG G CAG G GAAAATCAAAATTG GACTCGTATC ATTTG ATATGTCACTG AGT

GTTGATTACTCAGCATTGAAGCCTCGTGTTCCAGAGCTTGCCCATTTTATTGCGCAA GA

GTTGGAGGTTAACGTCTCACAGGTAGTTTTTGCATGACCCAAAGTTGTGTCAGTCTG AT

GTAATCTAAAACTGTATATCCCATTTTCTTTAAGTTACTTAACTGTATTTTAATTTT GTTCA

ATATGATATGTCACTTATTGGAAGATACCTTGCAGGTTCACTTAATGAACTTTTCGA CAG

AAGGAAATGATTCCCTCATTAGATGGGCCATCTTTCCTGCAGGATCTGCAAACTACA TG

CCAAATGCCACTGCAACAGTAACTCTAAACATCTAGAATATGTGAGGACTATTTCTT GAT

TGAAGAACCCTTTATTCATCATTTACCTATTTGCAGGAAATAATAAACCGGTTGGCT GAG

AATCGTTTTCATCTTCCTGATACATTTGGAAGTTATAAATTAGTCAAATGGGACATT GAA

CCCCCACCAAAGAGGTATAAAAGCTATCTCCATTCTTTGCATGTTCATAAAATATTG AGT

TCTGCTGTACAAACTTTTAGCATCATAGCATTACTTATAAAATTATTCTGAATTGTC AAAA

CAAATGTG CCTTTTCTTTTC AAAATG CAAAATAAATCTCCG CATTG CATTTCAG ATG GG A

AAAACATGACACGCATCTTTTCATCTTGCCTTAAACACATGTTTGTAAGTTACATTC TAAA

TTAGGAAACGTGAATGAGTCTACATTGCATCGCACCAGTTCGACTGCATATTCCAAG GA

TAATGATGAATAGGTGATGACTTTCGTCTCCATTTTTCATTGTTTCAATTTTTCTCA AAGT

TTCTTACTTG GATTGGTG G ATAAAGTGG CAAAGC CAG AATTTTTTTATAAG G GATTCG AA AATACTAGAATGTCATAATTGAGATCTGAACTTGTGACTTGAAAGCAACTTTTGAATCCT

CTTTG CTACTAAACTAAAAAATTTCCCCTATG G CAAG GAG ATTCAATAG CTTATATATAA

CCAAAAAACTTCATTTTTACCCTATTCGCATACTATAATTTGAAATGTTTTTGGTCA AAGT

TTAATTTG CTG CATCTCAAAATCTTAATAG CAAAATATTAC CTTAATTAACTCTAATGTAA

AGAGATTGGATAACACACCACAACAATATTTTTGGTAGGTGAATATTACTTTAATTT TTTT

GACAATTAATTGAGATAGGAGTTCTTGATATTTTTTTTTTGGTTTTGGTAACTATCA AGTT

GTTGGTTTGATATGGTTTCATTGCAACGATTTAGGATACGATGGCAGCAAAATTACC TTG

TTGTAGTGTTTGCGCTACTAGTTGTCCTGATAATTGGATTATCAGCTTCTCTGGGAT GGT

TAATTTGGAGACGAAGGCAAGAAATCCCATATAATCCTGTTGGAAGCGCTGAAACAC AT

G AAAAAG AACTC C AG C C G CTAAAT

SEQ 51

ATGGTCACAGGTCTGAACTTCCGCCATAACTTTCTCGTGTCAGCTTTACTATACTCTAAA

TTTGAACTATAATACCACATTGATGTGAAAATTCACACTTAGGTATCGATTTTTTAA CACA

GAGATTTATTTTGTGTTCATGCTTTGGTTTCAAGTATTGGAGAACCTCGTAATCGTT CTC

TATAAGCTTCTGGTTTAACAGATCCTAATTTTTCTTAGAAGCTCGAATTATTTTGTA TTGG

AATGAAATGAACCTGAATATTGTGGACGATACAGAGGAATTATTGTGGTATAGTTGA TTG

ATTGATTGATAGTCTTAAGTAAGAAAAAGACCTATTGGAGATTATGGTGAAGCTTAT ACA

G GAG CAG CTG G ACTTG GTTTTCACAATATCTTTTTTGTTAAGGTTAGAATAAACCTG CTA

AAATTTTTTACTTATCAAAATAAATAAATAAACTTGCTAGAATTTTTTTCAAGTTGG TGATT

GTTTAAGTTTTTTCGATTGTTTTTTCCTTTGGTAAAAACGTTTTTGGCAAGAACTAT ATTTT

GAAGTTGTGGTTTGAGAGTGTTTGTCAAATAATCTTTTCAAACAAACTCTCTTTTCA AATA

TCCGAACATCTTCAACTTCCACGAAATAGGTGACAACTGGATTAAATTTGGGGGGGG G

GGGGAGTGGTTGATGGTGTATTAAGTTCAAACATCATTTATCTTTTTCTCTAGGAGC AGA

TTTTTAAATCATTATAGATACATTGCTTGATGTATGTTTGAGAAATACCATTGGTGT TTCA

TTTAGGCATCATCACTGTAATAGTTTGGTTAATGTTTTGTTAATTCATCATGGTGGT TCAT

TCAGACAGCATCATTCGGCTATGATGTTGATGTTGATTTGGTTACACAAGCAACTCA TTG

GTGGAAACAGTTTTCTAGGATACCCCTTTTATCATTTTCATTTGATTGTACCTCCTG TTTA

TTTTTG CACTTG G AC AATTACG G GCTACAATTCTCTCCTTGC AAATCTG GTGATTG GTTG

CAGTAAGTGTAAAGTG G CAAAAAGAAGTCTATCCTG CTGTG G AAATTG ACACTAG CCAG

CCTCCATATGTTTTCAAAGCCCAGCTGTATGATCTAACAGGGGTACCACCTGAAAGG CA

AAAGATAATGGTCAAAGGTGGTTTGCTTAAGGTATAAAATTTCGTTTCACTTAGCTT GTT

ATGCCATTTTTCACTTTGCAAATAAAGCACAAAACTCATTTGTGTTTTGAGGAACGC TGA

AATTCTCAGCATCTGATGCTTTGCTTGTTAATTTTGTTGTTAACTCTTCGGTTATTT CTAT

GGTTGTTTAGGACGATGCCGACTGGTCGAAAGTAGGAGTAAAAGAGGTACACGGCTA C

TCATTGAATTACTCTCTATTTTTATGCAATGAAGTGCCAATTATCTAGAAGCATCTG TTAT

TTATTATTTCCAGGGTCAAAGGCTGATGATGATGGGAACTGCAGATGAGATTGTGAA GG

CCCCCGAGAAGGGTCCTGTTTTTGCTGAAGATTTACCTGAAGAAGAGCAAGTGGTTA AT

GTAGTAAGTTTTTTGACACTGATGTTGTTGCATCAAATCGAATGATCCGGAGATGTG TGA

TTCCTTATGTTTAACTGCTTACATAGTTAGTCTTGTCTCATATGCTGTACTTATACC AGCA

CTG G ATCC CTAGTAGATTTATTG GTATAACTTTACCG CAATTG CTTTGTTCATTTTTTTTC

AAAAGCAGTTGCCTTTTCCAACTTCTACATGCAAATAAGCTTTAATATATAATTCTC TATT

CTTTTTCCGCTGGCACAAGTGATTTTGTGGATGCCAAGCGCTTGTCGAATGCGTTTC TT

GTTCCGCTGGCACAAGTGATTTTGTGGATTCCAAGCGCTTGTTGAATGCGTAATTTC TT

C ATTTAC AC ATTATG AATC G G C C CTTC C CAAG AC C C C G C AC ATAG CAG GAG CTTAGTG C

ACTGGGCTGCCTTTTTACACATTGTGAATCAGATTACTATGTTGTTTTAGAGTCCTG TCT

AAAAG AACTG CTAACTTTTATAATG G CAAG GCTTAGTTTTGTACTTTTAATCAGTAAATG

GGTGATGAGAATTTTTATAATTTTGTTTCCTCCAGGGTCATTCTGCTGGATTATTTA ATCT

CG GAAATACATG CTACATG AACTC CACAGTACAGTG CTTG CATT CAG TTCCAG AACTG A AGTCTGCTCTAACAGAGTGAGCATTTGCTTCTTTCATCCTTTCCTTCATTTTTGGGAGTC

TTTTG GTTTAG GCTTTTTTTTG GTCCTTTTG CTTTAG CCTTG ATTTC CCAAAACTTG ATC A

AATTCAATATGGTTGCTTTTAAGTCTAGTTCTGAAAATATTTAGGTCTATTTTGATT GCGT

TGCACTTTTTTGGTTAGGCAATTCGATCTATTTGCACCTAATCCGTAATTCCTGTTT TTGC

TAAAATTGATAATTCTGATTTTACTGTTTATATTTGTCAAATCTATTAATCATAATT TAACT

TATATAATGTGTCGCGTTGTATACACCTAGATAGTATGTATTTACGGAGACAAAGCG GA

GAAAACAGTAATTAGTAGAGGAGACATAAATTATCCTGTTTTAATTCCTATATATCC TCC

CTTATATAAATATGGACTCGTTTCTCGGCATGTTCTCCTTTGGATGAAATCAATCCA AAA

TGTAATCCACTTTGAATCAATTTGGACTCCGAAACTGTGGATCTTTCCCGAACATTA TCA

GAAAAAAGATCAAAATGGCTCCTGTTAAATACCAAGTGTAGGAGTTCCAAAAACAAC TC

CGTTAGGTACATTTCTTTTTGTGTTCCTGAGATTCTGAGTTTATTTATTCTTCCTGT TAGG

TATAACCAGCTTGGTAGAAGCAATGATTTGGATCACTCATCTCATCTCTTGACAGTT GCA

ACAAGAGATCTGTTTAATGACCTGGATAAAAATGTCAAACCAGTGGCACCAATGCAA TT

CTGGACGGTACTTTATTTGTCTTTATTCTACTCCTAATATTTTTGGTTACGACTTAG TATT

CCTGACTTTGTATTCTTAGAAAATGTGTTTGGATTTCGAACAAAGTTACCATACCTT TGA

AGGAGAATACGTATGCTGAGTAGGAGATAGTGTTTGCCAATAATTTCCTATTGGCAG AC

TTCAAAATATACTCGTTTAGCGTTGAACACTGAACTCGATATATTTTGTCATGACTT TTGT

GTGCAAATGGATTTGCTCTTAGAGAGCAAGGATGAAGTACTTTGTATAGGATGCGAA TA

GAATGACTTAAGCTTGGGCCTGTTGTCTACATGGTCAAAACTTTGTGCATTATCATT CTT

GCACAAGGTTACTTGGATTTATATGAGAACTATAAATGTAGCCTATGTTGATATGTT TGT

CTTTTTAGTGTTTTCGTCACATGAGCTACTCGGGCATCAACATTTGATTAGGTTTAT GTT

CACAGGTTTTGCGGAAGAAATATCCTCAATTTGGCCAGCAGAGCAATGGAGCTTTCA TG

CAACAGGTTCCAAGCTTACCTAAGCTACCACAATGCTTCCTTGTTATTAAAAAAAAA AAA

GTGTACCACTATTGCAATTGCTATATAGAGGTCCTACTGACATGTCCTGGATAATAA CAG

GATGCTGAAGAATGTTGGACGCAACTACTTTACACCCTTTCTCAGTCTCTTAAATCA CCG

AACTCTAGGTACTACATCTCCTCTCGGGATATTTCTTGCAGATGAAAGTCCCTTTTC TAA

ATAATTTCCATGTTTTGTTTCGCTAGTTGTTTTCTTTGTTTCAGTTTGGACATATGG TCCC

TATTTTGTAAAAATGTGAAGGAAAACTCTCCTATATATACATTGTGCTTCTTTATGT CATG

ATTGTGACTGCTCTTTATCTCGGTACTTGCAGTGGAAGTCCGGATATTGTGAAGGCT CT

CTTCG GTATTG AGTTTG ACAACAG GTATTTCTG CAGTCAAATGTTGTTTACCTTC CAGTT

ATTCTGTTACCTTATCCCCTTTGCATAGAGTTGTTCTGCACCTAAATATTATAAGAG GCA

TGTGAACTTACTGCTGTATATGTATTGAGATAGGAAGGAATGCAGCTAGTGGTCCTA GG

ATGTAGGATGTTCCCTGTTCTGACTTTGAGTATCTTCTGGGCAACCTGATGAGAATC AA

CATCCTCAACTTTTACTCTGTCATATTGTGAATCATGTAGTTGACAATAAGAGATGA ATTA

CTGAAGTTGTTTTGAAAGTTGAAGCTAAAAATCATGTTTATGTTGACTTCTTTTAGT TTCT

CCTACTGTTAGTTAAGTGTACTATAGTGCTACTAGTGTGTATTTGTATTACTGCTAA TGA

AAGGTCTGGCTGGTTTATGCCGTTCTAGGGTATTTTCAACAGGCTGTGCTTTTCTGC TG

TAGCCTAGACCTCTGGGCTAATATTTCTTGTCTAGGACCTGACCTACTGCAATGAGG TT

GGGAAGATCCCAATGCCCATCCCAGAGTTCTCATGTGCTCAAATTCATCTACTGATA TA

CAAAATTTAATTTTCGATAGTGTGGGAAGCTGTTTATCATTCATGTCTGACTTGATC TATA

CTGTTCTGACTGGATTATGGTGGTTGTGCTAGAGTCCACTCTGTGACTATGTTCCAT AT

CTGATATTAACTGCTAGATACTGAAGAAAAATGACTTAACTCTGCCCCTTATTCTCA TGG

TACTGATATGGACCAGGATTCATTGTGCTGAAAGTGGTGAAGAAAGCACAGAAACAG AA

ACTGTATATTC CCTTAAATG CCACATTTCACAG GAAGTG AACCATTTGCATG AG G GTTTG

AAACGTGTAAGTTCGGTTCTTTTCCTCCTTGTATGTCCCAACTTCTAACTTTAGTCT TGTT

TCCTCCCAAATGTTTCATATTACTGCTAAGTTCTGTCTCAATTTTTTCTGTTGTGCC AATC

CAGTAATCATCCAATTTGATTAAGAGGACAGTCCCAAAGTGAAAAATGACGTATCTA ATT

CATAGAAATTCCTTTGGTAGCTGTAACCTTTAAGGATAACTAACAGTTAGTCCTGAA ATG

GTTGGTTGGATGGAGAAACTATTATATAAGATGCCTCACGGGCGGCACATTGGGGGG G

GGGGGGGCTTTTNGGGGGGGGTCTTTTGCTTTAGAATTTTTTCATGCCATGATTGGA CC

AAATGTG G GG CCTG CATGTTG AG GTTCATG CAATAGTTTCACATCAG AG GTAG G CTGA GCCATTGGAGCCGACTCATAATGCTTTTGGTGGGAAAAGATGCTTTGGAGATTTTGTTT

AACTTGGTGCAAAATCGTTGATGATGTTTATACTGGAAATCTGGTATTTATCCCCTC AAA

ATAATTTAAATG CACTCATATG G CCATTG CTTTTTCTG AC AG GGTCTG AAATCAG AACTG

GAGAAGGCGTCTCCGTCACTTGGACGGAGTGCAGTTTATGTGAAAGACTCCCGAATC A

ATGGCTTGCCAAGGTATTAACTGGCTCGATTAAATTCCATGGCGATGTAGCGACATA TG

TATGATCCGTAGCTTCTGCATATAGACTATCTTAATCCACGCCTTTCACATAACAAA AAT

GCCTTTTGATGTGTTGAAGTAGTTCACCTCATTTTTGGCATCACTTCTTTCTCATTC TCCT

TTCTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNN N N G GG G CTTTTAATTCTCACACTTTTCTGG ATAGTTT

CTTTACTATTTGGTGCTGATTATTGACTGGTCTATGTCAAAATCTTCTTCTATTTTT ACTA

TTTCTTTTTCATTTGATGAATAGATACTTGACCATTCAGTTCGTCCGGTTTTTCTGG AAGA

GGGAATCAAATCAAAAGGCAAAGATTTTGCGGGTATGTTGACTTACTCATCTTCCTT TTC

ACATTGATCAAATAGTTGGTCCTCCTTGAAAATGTGTCGGAAGAGGGAAAAAGGATC AG

GTTACCATGTCCTGAACCGAGAAAGGGGGATGGAGAAAGGGAGTTCACTGTTTTACT TT

GTTTTGTGAGGATGAGCGGTTCTGTCATGTTTAGGTGCTCTAAAGTCTCGCCTATTC TC

TCGATTGTCAAATTCTGAACTGTATATTTTAACTTTAGGCTGGTTTGTGATGGTGTC GGT

GTAGGAGTCTTGGTCTATAATCCCTTCCCAATAAAAAATTAACTGAAACTTCTCTTA AGT

TTCTAATTG ATCTTGG AGTAG GTGTAACTTG G G CACAATTACAAAG AG G AGTTCTTATGC

ACCAATGACTAGATTTCAGCTACTAGAAGTAAGAAGTAAGAGAAGGATGTGATATTA ATA

CTTTGCTATTCTAGTGAGATAGTATGTACATAATATTTTTTTGAAAGAGAGTTCTGG TGA

GAAGCTGATATGTTTTTTTTTCTTTTTTCCTTTGTGTTGATACAAGATCTTCTTAAC AATCA

AAAATATCTGAAAAGCTTTTTCCTGGATTCGGCCATTCGGGATAATAACCCACCCAT TGC

CATAAGTTCTGGGATCTATAAAGTAGTTGTACTGGGTGTTTCAGATATCTTTGTGTT TGT

GAGAATGCACAGGAGAAATCTATTCTATTGATTAAAGTGTTGTACAACCCTATTTAT ATA

CAGTAATTACATAATAATAGGTATCTACTTCCCGATGTGGGACACTATAATACGAGA ACC

AGTAAGAGACTTAGTGAAAATATCTGCTATGCTAGCTAATCATTCTACTTTACAAAC TTT

GTAACAATATCTCCTGAGAGTATCTTTTCTCTGCCAAAGTGACAGTTGATCTCAATG TGT

TTAGTCCTCTCATGGAACACCAGATATGACGCAATATGAAGAGCAACTTGATTATCA CA

CGCCAGTTCCATCGTGCTGATTTCTCCGAACTTCAACTCCTTAAGCAACCGCTTGAT CC

AAACTAGCTCACACGTTGCCACAGCCATGGCCCGATATTCGGCTTCGGCGCTAGATC A

AGCAACTACATTTTGTTTCTTGCTCTTCCAAGACACCAAATTACTTCCTACTAGAAC GCA

ATATCCAGACGTAGAACGTCTATCAGAAGGTGATCCTGCCCAATCAGCATCTGTGTA CC

CAACAATCTGCTCGTGGCCTTGATCCTCGAATAGTAACCCTTTGCCTAGAGCTGACT TT

ATATACCGAAGAATGCGAACAGCTGCAATCCATAACTGACTTACAACACTCACCGGA AA

AGAAATGTCAGGTCTACTCACAGTGAGGTAATTCAATTTGCCAACCAACCTCCTATA TCT

TGTAGGATCTCTAAGAGGCTCCCCCTGTCCAGGCAGAAGCTTAGCATTCGGATCCAT A

GGAGTGTCAACCGGTCTACAGCCCATCATTCCCGTCTCCTCAAGAATGTCTTAAGGC AT

ACTTTCGTTGTGAAATAACAATACCTGAGCTAGACTGAGCGACAGTCTCGAAGTGCT GA

AAGAGTTGCTGCTTCAGATTAGTAATACCATCCTGATCATTGCCAGTAATAACAATA TCA

TC AAC ATAAAC C AC C AG AT AAT AC AC AG ATT AG GAG C AG AATG C C G ATAAAAC AC AAAG

TGATCAGGTTCACTACGAGTCATGCCGAACTCCTGAATAATTGTGCTGAACTTACCA AA

CCAAGTTC GAG GTG ATTGTTTCAAAC CATATAGTG ACCTG CG CAATCG G CATACAAG AT

CACTAGACTCCCCCTAGCAATAAAACCAGGTGGTTGCTCCATATAAACTTCTTCCTC AA

GATCACCATGGAGGAAAGCATTCTTAATGTCTAACTGATAAAGAGGCCAATGACGTA CA

ACAACCATGGACAAGAAGAGACGAGCAGACTTTAGCCGCGGGAGAGAAAGTGTCACT A

TAATCAAGCCCAAAAATCTGAGTGTATCCTTTTGCAACAAGACGAGCCTTAAGACGA TC

AACTTGGCCATCCGGGCCGACTTTGACTGCATAAACTCAACGACAACCAACAATAGA CT

TACCTGAAGGAAGAGGAACAAGCTCCCAAGTGCCACTCGCATGTAAAGCAGACATCT C

CTCAATCATAGCATGTCGCCATCCTGGATGAGATAGTGCCTCACCTGTAGACTTGGG AT AGAGACAGTTGACAAAGATGATATAAAAGCATAATGAGGTGATGACAGACGATGATAAC

TTAAACCGACATAATGAGGATTAGGATTAAGAGTGGATCGCGCACCTTTCCAAAGTG CA

ATCGGTTTACTAGGAAGAGGCAAGTCCGCAGTAGGAGCAGGATTAGGTGCAAAACGT G

AATCAGCTGGGCCTGATGCTGGCTGCGGACGACGATGATATGTCAAGAGTGGTGTTC C

TGTGGCGGGGAATCTAGGAGGAGTCTGTGGCGAAAGGTGAAGGAGGAGCTATAGTAA

G CTCCTTAAAG GTC GATACAG GTAAG ACCTC AG ATATATC AAGGTG GTCAG AAGAG GTA

AAGAAAGGTTTAGACTCAAAAAATATGACGTCAGATGATATAAAGTACTTATGAAGA TCA

GGTGAGTAACAACGATATTCCTTCTGAACACGAGAATAACCAAGGAAGACACACTTG AG

AGCACGAGGAGCTAACTTATCTTTCCCAGGGGCTAAGTTATGAACGAAGCAAATGCT CC

CTAAAAC AC GAG GAG G AAC AG AGTATAAG G GTG ATTG G G G AAC AATACTG C ATAC G G A

ATCTGATTCTGGATGGGAGATGAAGGCATCCGTTTAACCAAATAACAAGCTGTGAGA AC

TG CATCAG GTG GTCAG AAGAGGTAAAGAAAG GTTTAG ACTAAAAAAATGTG ATGTCAGA

TGGCATAAAGTACTTATGAAGATCAGGTGAGTAACAACGATATCCCTTCTGAACACG AG

AATAAC C AAG G AAG AC AC ACTTG AG AG C AC GAG GAG CTAATTTATCTTTTC C AAG G G CT

AAGTTATG AACG AAG CAAGTG CTACCAAAAACACG AG AG G AACAG AGTAGAAG G GTG A

CTGGGAAATAGTACTGCATACGGAATGTGATTCTGGATGGGAGATGAAGGCATCCGA T

TAACCAAATAACAATCTGTGAGAACTGCATCGCCCCAAAAACGCAACAGAACATGAG AT

TCAATGAGAAGTGTGCGAGCAATCTCAATGATGTGCCTATTCTTTCTCTCTGCAACC CTA

TTTTACTGAGGGGTATAAGAACAAGAGGTCTGATGAATAATTCCTTGAGAAGTCATA AAC

TGCTGAAATTGAGAGGATAAATATTCTAAGGCATTATCACTGCGAAAAGTGCGAATG GA

AACACCAAATTGATTTTTAATTTCAGCACAAAAATTCTGGAATATAGAAAACAACTC AGAA

CGATCTTTCATTAAGAAAATCCAAGTAATCTTGAATGATCATCAATGAGACTAACAA AAT

AACGAAATCCCAAGGTTGAACTGGCTCTACTAGGACCCCATATATCAGAATGAACTA AA

GAAAAAACAGACTCTGCATGACTCTCAATACTACGAGAAAAGGTTTGGGAATGTTTT CC

GAGCTGACATGACTCACACTATAATCTAGATAAACTAGACAAACTAGGCACCATCCT CT

GAAGCTTGGATAAGCTTGGATGTCCTAAACTTATGTGAATTAGGTCCGGAGGATCTG TA

G CTAG ACATGTCTTG G AG GAATTGAGTGAGTTAAGGTAGTAAAG GC CTTCTGATTCAAG

TCATGTTCCAATCGTCTGTCCCGTACTACGGTCCTGCATAATAAAAGAATCATCAAT AAA

ATATATACCACAATG G AG GG CACGAGTCAAATG ACTAG CAGATG CAAAG G ACAG CCAG

G GACATAAAG AACAGAATCTAG AGTG AC AG AG G GTG G GG G ATTTG CTTGTCCAACTCC

TTTTGCTTTAGTTTGAGACCCATTGGCTAAAATAATAGTGGGAAGAGACTGTGAATA TGC

AATATTTGACAAAAGTGATTTATTACCAGAGATATGATCAGAAGCTGCTGAGTCCAC AAC

CCATTATCCAAGAGTACTAGACTGGGAAACACAAGCAAAAGAATTATCAACAACAGA AG

TATCAGTCTGAGCAATAGAGGCTACTTGTGGAGATGTCTGCTTGCTCGATACGGAGG G

AACTCATTATATTCCCCTTCAAATAAAGAAAATCCCTGGTTACCTGTAGTCTCGGTC TGA

GCAACATAAGCATTTTTGAGTGGACGACCTTGTAAAGAATAGCACACGTCACGAGTG TG

TCCAAGTTTATGACAATAAGAGCAATTGGGCCTAGATCTTCCAAAACGACCACCTCC TC

GTCTATTCTCCATAGTTTGAGATACCTGATTGTCCACTGACTGGGATACGAGAACAG AT

GAGTCAAGTGTCTGTGATGAGCTTACTAGGTGACTTGGTATTGCAGCAAGGCGAAGT AA

TTGAGAGAATAATTCATCAACATTGGGGACAGTCGGACTAGCCAAAATCTGGTCACA TA

CTGAATCAAGGTCATTAGGGAGTTCAGCGAGTGTAAGAACTAGAAACATCTTCTGTC GT

TGCTCTTGTTGCTTTTCAATACTAGCAGAAACTGACATCAATTGCTCAAATTCTTCC ATG

ACTGCCTGTACTTGTCCTAAGTGTGTAGACATATCCAATTCCTGTTTCTTCAAGCTT GTC

ATTCGCGATATTACATCATAGAAACGAGATAGTCATTAGTGTATAAAGTACGAGCCT TCT

CCCAAACTAAATAACATGTCTGGAATGGATGGAACAAAGGCATCAACTTGGAATCAA TA

GATCGCCACAAGATACTACATAACTGAGCATCGACCTTCTCCCAAAGTGTTTTGGCC TT

TTCATCACCGTCGCTAGCCCTTTTTGTTAAATGATCTTGAACTCCTTGACCTTTACA CCA

CAACTCGACAGACGAAGCCCAAGCTAAGTAGTTTGAACTTCCCATTAAAGGTTCTGA GG

CAATCATAATACCATAACTTCCAGAACCCGTGTTTTTAGACCCAAATACATCCACTC CCA

AAGACATTATTGGATTGAAAAGAGATCTAGCAAATTAGCACCAAATAAAACAAAGAA TCA

ACTGTGGTTGCCGAAAAACTGCCGGAAAAAATACTGTAGTTGCAGGAAAATTTTCAA AG TGCTCGGAATCAAAAAATAAAAATATGGGAAGGCTCGGAATTGCAGGGCGATCAGACT

GTTCTAAAGAAGTTTTCTGAAAAAATGGACGGAACGGGCTCCACGCGCCGGCGCGTG G

AGTAGATCTTGCCGGCGAAAATTGTCTTCGGGCGGCGCGTGAGGCGGAGTCTGACGG

AGTTGTTTGCTGGGGTTTGGTCGCCGGAGGTTGGGGACCTTATGGTGGTGTTGGTTT T

TGCACAACACCGATGGAATTGGTTTTGACGAAAAAATAGCCCTAAAAGGTCACCGGG AT

GAAGCACGTCGACGACTGGGTTTTCATTCCCGGATGTTTTCTCACTGCCGCTCTGAT AC

CATGTGAGAATGCACGGGAGAAAAATCTATTCTATTGATTAAAGTGTTGTACAACCC TAT

TTATATACAGTAATTACATAATAATAGGTATCTACTTCCCGATGTGGGACACTAAAC ATG

ACTAACTACTTAACAGTGTTGAACATG G GTAAC CCAG G GG G GTCTTCACCTAATTCCAA

AACTGAAGTGAAAAGAGGAAAAAGAAAGCAGGCTTGTGAGGTTCCGCGTTCCTCTCC A

TCTGATATGACCTTTCTATATATATATATTCTTTTAAATGATGCTTCCCGGCTAGCT TATG

CGCACCTCGATTATTCTATTTAGTACATGCTACCTCCCATCAGAACATGCACAAGGT AAC

TCTGTCCATCAAGGCTTAGGAAAATAGAAGAAATCACCTACTCTCTCCGTTCCAATT TAT

GTGATCCTGTTTGACTGGGCACAGAGTTTAAGAAAAAATGAAGACTTTTAGAATTTG TG

GTCCTAAACAAGTCAAAAAGGGGCCTAGATTATTTGTGTGGTTATAAAAGCTTCTCA TTA

ATAGTAGAATTGTAAGTTTAAGCTAAATTGTTACCAAATTTAGAAATGGGTCATTCT TTTT

G AAAC GG AC CAAAAAGG AAATAG GTTCACATAAACTGG AACAG AG G GAGTAGTATTTTT

TGTTTCCATTGGGATTTGTGATTGAGATCTCATGGTTATGTATGCATATGTTGTGAT GTA

GTTCCATATCTTTACTGTTTAGATGGTTGACAGGGAAATAGTAAGTTCTTTTTTAAC TTAA

TTACGAAAATAAATTGTCTTCTTTTATTTAAGCTTATGTGACATTATTTCCTTTTTA GTTTG

CTTATAAAAGAATTAACCCTTTCTAAGTTTGGAGAACTAAATGTTCTCATTTTACGC TTAA

TGGCAAGCATTTATAGCCACACATTCGTTCTGCATATTTAAAACCTGAAGTTTCAAA AGT

CTTATTTAACCATAGCATGTTTAAGACTACAAGTTTCAAAAGTCATTCTTTTTTCCT TATTA

AACTTC CTGTGTAGTTAAACAAG GTCG CAATAAAATGAAATAGAG GG AGTACTATATTTA

AGAATG GTATC ATACTTG GTAGTTTTTCTCTTCTCGTCTCTCCTTTTTTG G GTAG GG G AT

GCATCAAGCTGTAGGTTCAATTGTTTATAACTTTTAAATAGCTAAAGAACTTGCTCT AGT

AGTGATTTCGGGGGTAAGATAATCTTTGTGGTTAAGAATCCATTGAATATGGGAATA ATA

AAAAAAAAAAAAG AAAAAG AATG C ATTG G ATTAG AG ATC AC AAC C ATTTTTAG C C GAT AT

TGGCTGCGATGTGTTCTGGCTAATTTTTTCTTGAAAGGTAACAGAGGATCGGTTCTG GC

GAATTAACTAAGGTTCATTTTAAATATCACCCATATCCAACCTAGCCACCCATCTGT CAC

GTATAAAACTTATTGTGGACAAAAATAGAATGGTCACTTCCTTTTCCACCCACCGCT AGT

TGCGACGTGACAACAAGGAGTTGTAAATGTGATGTGATGGATTTTAGTTATAGCAGG AA

CTTAAATGATAATCTAGGAAGCTACAATTTTGCTATGCTTCAAACAACAGTCATATC TCG

TATTAAAACTCAGCAAGTCTTGCTTTTTTAGTAATATAGGTTTGCTAATATTTCAAA ATCC

TATTTTATATTTTCTGCATGTATTGGATCTCTCATTGCTTTAAGATATAAGAAATGG TAAT

CTTAAACTATGTCTATGCCATCAGAAAGTGGATTACCCGCTGTCGTTGGATGTATAT GAT

TTTTGTTCGGAAGACCTTCGCAAGAAACTGGAAGGTCCTCGCCAGGTACTGTCTTTT TC

CCATTGATCAATGTCTTTTAAGAAATGAGGAAAGACCAGACCCTCTTTGGCCCCTCT TTC

TCTTTCTTGTTCTGTTATTACATGACTCTAAATTTGCTG CTAG GTTTTG AG G GATG CTG A

AGGTAAGAAGGCCGGTTTAAAAACCAGTGAGAAAACTTCAAGTTCAACTGACGGCGA C

GTTAAAATGACTGAGGCTGAGGTATGAATTAATCTTTGTAATGTAGGAGTGACTTAA GG

GGATAAAGAGGGACCTTTCGGGCCCACCTATGGCGGATTGTACAAGGTGGTTTAAAG T

GGGAAATTAAGAATGTCTATAAGTAACTTCTGCCTTTTCTCCTTTTTTTTCCTATTG TTTA

TG CAG GAATCATCTAGTG GAAGTG GAG AAG CGTCTAAAACAACCCAAGAAG GTAG AGA

AACACCTCCTTTTCTTGATAACTTGATGACTTGATAAACATATGCTGCTGCTGTATT TTAA

TTGGTAACAATGTCTGGCATTAAAATTGTAATATTTGGGAGAGAAGTTATTGTCATG AAA

TTACCTTCCAACTCACATATCCTTTTCAATGCTTTAAATGAAACTCTGTTAGTTAAT TGTC

AGATATAATTCTGCAGTAATATTGCGGTTCCATGAGGTTTTACAGTTCTTATGACAA CAA

TGTTCTGCCCTGGGTATTCAAACTTCTTTTCACAAAGTCACTGTTAGTATCTTTGAT TACA

AG CG ATTG AC CTTCTATTAACAATTTTG G GATCCCATAAAG ATTATTAACTTG GATCAGA

TTTATTCCTTTTTAAATTACTATATGTCCCTAGACACCGGCGGACGTTTGCCAAGTT TCT TTTGAAGGGGGCGCCTTTTAATTTTTAGAACTATGGAAGATCCTTAAGGTTTAGCTCTGC

TGATACAAGTACATTTTAATTTGTTTGGACATTTGTTTGTATGTAGAAAATGTAGAG AACC

TCTAGCATAGATAACCCCGTACTTGCCTTTGAATTTATATAACTACGAAAGATTCTA AAG

GTTTAG CTCTG CTG ATACACTTAC ATTAAGTTG GTGTTTTCTC GG AGTCCTTTAATTTGTT

TAGACATTTATTTGTATGTGGAAAATGTAGAGAACCTCTAGCATTGGATAACCCCAT AAA

TTGTCTTAAAAGAAAATTTTCTGAGTATTGGAATGAACTAGGGCCCAGATGGAGCAG AA

TG AATGTCG G GG ATTAAATAAG G GACTCTAACTG GTTCTGG ATTG G GAG CAGTAGTTGT

GAATGATCAATTTTATCAACAGTTTAGTGTTTCTGATATATAAGGGGAAGTTAGTCT AAG

AGCTCAATTTTGGAATTCTGTTATGAATGAGGAGTCGCAATGATTAGTGCTTTTTTT TTTT

TAATGATGCCCGATTAGTAAACTCCATTTCAGGTGTTCTGCCTGAGAAGGAACACCA CT

TG ACTG G AATATATG ATTTG GTG GCC GTG CTG ACTC ACAAG G G AAGAAGTG CTGACTCT

GGGCATTATGTTGCCTGGGTCAAGCAAGAAAACGGTCAGTTTAACTGGGAAGAGATT TT

GTTCTAGTAATCGTTGCTCTTGGACTACCATCTGATACAATATATTGAAAATCTCTT TGTA

AACCACAGGAAAGTGGGTTCAATTTGATGATGACAATCCAATTCCGCAGAGAGAAGA G

GACATCCCTAAACTTTCAGGAGGTGGTAAGTGAATCACTTGTGTATTACGTCTTCGG CA

AATTTTCAAAGTCTGG CAAG CATATCCTTTCTTATAACAACAAG ATGTAAAG CAGATG GA

ATATTTTGTTGCTTGTGTGCCTGAATGTGTTTTTCGTTCTGTCAGTTTATAGAAGTG CTTT

ATTTTTGGTTTCAGGTGATTGGCATATGGCTTATATTTGCATGTACAAGGCCCGTGT TGT

TCCCATG

SEQ 52

CTATTTCACTTGATGCAAGGAAAATTGATTACTCCTGGCACTACGAAGAAATGTTTGGTT

AAAAGG ACAAAG GTCAGTG CAAAAACCAGAATAGTTCATTG CTACACTAG GTCCAACTA

G C C AG ATAG ATG C AC AC AATTTG C G G C AG AG ATAACTATAAG AC AC C AC AGTG CTTAAC

TGCCTTTTAACCAAAGACCAAAAACACTCATGAAGAAGAAAAATGACAACCTTTTTA TAG

CTATGGACTGTCCATTTCATACGATTCCAGTTGTTAATACTTTATGCATGAGTCAAA TAG

AATTTGCATACAAAAAAATAGGTCCATTTGCAACAAATCCGAGTATAACTGAATGAA CAG

ATGAGAGCCATTAAAACCTTAATGTCAAATCCTACAAAACAATTGGATCATCTCCAC AAT

GCATGACGTAATTCATCTCATGCTGTAAATATATCAGTCTGTGGTTAATATGAAAGT TAT

AGATTAAAGATTCACAAGACACAAATGTGCCTCCTGAACTTCCTAGAGCAGCATATC AG

CTAAAGCAAAGAATAGAAATACACTAATACAGAACAGCAAAGAGAGACTTAATAGAC CT

GAGTTCCATATTTTCAATGCGGATACGTTCAACATATAAGTCATGATCTTCTGCTTC ACT

GTTGATCTCATCGTCTTTTTTGGGATCCTTTTTGCGCCGTGAACTTGATCTCCTCGA AAA

TAGTCTATCAATCCTG G CAG CATAAG AACCTAAG AGTG CTAAG ACACAATG CATTCACT

AAAACATTGAAAAAAGGCCCAAAACAGAGGCAAAGTGGAGATGAACTTTAGAATTCT AT

CTTCACATTTTCACCAGAAGGGTTCAGAAAGTTATAGGAACTTCATTTCAAGTACAA TGC

AAACAGTAAACAGGTTTTTACTATACTAAAGTATCTAAAAGTCACTTACCTTGTTTT TCTC

AAAATCCATGCCCAAGTGCGTCCAATAATCATGTCCACTTTTTCCATGAAAAAGTAA TCA

GTAGTCCTATCAATTCCAATGCCGCGTCGAAAGATTACATACTGGAAAGAAAAGGTA CG

TATATAGAGATTCAGTGGCATGATCAAACTCTTGACTCTCATCCTTTTCCCTCGAGA AAT

ATATGTTCATATTACACATTGATATACTTAATAAATTTCGAGGTAAAACATTGATAT ACCA

TATAAGTTGAACAGCTCAAATACAATATAATACATCCATGATACTATTTGCAGGTTC AAAA

AGATATAAGTG ACAACACGTG CACAAGTCACTCTTTAC CTTTGTTG AAG CTTGTGTG CA

CTATTACAG GTTTG AG GAG CAGACCTTG AACAATG ATTAATCTTTTGAAAACTAG CAAG A

AGATGAGAGGATAAAGATTCCAGTTTGTGCTCAGTCATAACACACTTTTTTGAATCC GTG

GTGAAAATAAAAATCTTTCATCAAAGAAAAAGAAGAAAGGGCCAGTAAGCACAATAT AAC

TCTAAAGTTCTTCGCAATAAATGATCGCATACTTTTGCTTGTGTTTTATTGGAGTGC AAAT

TCCTTCTTGCCAAAATCATGGGCAGTAATGGACATCCTGACACCTAAGAGCTAATGT TG

TATAGGGACAGTGGTTACTTCAAGATTCACTTTAAACAGCCAATCATACTTGCTGCA AAG CAACAATGAGATAGGGAAATAAATGGACACCATATGTGGATATCAAGGATTCAAGTCGC

ATTCTTGAACCCCAAACAATGCAAGGCACCAACAAAAAGCAGATTGGGAAATATTCA AT

TTGGTGAAAATTTCATAAATACAACCAGAAGCTACACTTGTCGACCTAGCATCTGGG AG

CCATTAACTTCATACCAAACGTTCCAGCAAAAGTAAAGGCCTCATTTCAAATTGTAA CTT

G AAG CTACACTAAG AG ATATACCTTGTCAGCAAACTCTG G AAG GTCCTCATG AG G ATG C

TCTG CAAAATACTTCTCTAAAAGTTTTTTGTCAAG CTG CATCC AG GAAAAGGTAGAGTGA

ATCGCATAGCGAACATGACAAAATGGAACTATATCAAGAGAGCCAATAATTGTCGGA AC

CAAATTATATGCGGAAGCAGCTGGTTAAGAACCTTAGATTCATCAACAGTGATTGGA AG

ATTTAGAAGATACTGTCCTGAATGTGCAACATCAATCTCTTCATCACTAGCTATTTT AAAA

TTGCTTTTGTGCATAATCTGCATCAGCATAAGAGAAATGATGATAGAAAGTGGAGAA TG

AAATAAGGGTGCCAGAACACAAATGTCATACCCATTTCCCTTAATCCTCCAGCTAGA TG

TGGAAAAGCATTCAGAATTCACTGCACTTAGAGTTTGATGCAAGTTTAAAATCCATT ACA

GGCTTGGCCTCAATAGCATGATAATGCAGTGCAAAATAACGGAAAACGACTTTAGCG AT

AAATGAATAATATATAGCCGTTCCCCATGCTTGAAAATAAAGAAATATCCCCTTCCA TTT

CGATTTTGCAACAAAACAGCATAAATATGCGAAACTTTCTTATTGTAAGGCTACTCC TTA

TG ATTAAAAATG G GAAG CATGTGTTAGAAAAAACAAAGTAAAAGAG AAAAAAAACAG GA

AAATGAAGGACGAAGCAACTCCCACTCCCTGAAGACAAAAACCTCGACCTCTTACCT TT

CACCTCACTGGTTGTCTCCCTCCCCTGAGAAAAAACAATTATTCGCTATGTCCCAAT TTA

TGTGATGCACTTTCCTTTTTAGTATGTCCCAAAAAGAATTATACCTTACTATATTTA AAAA

AAAATTAAAACTTTCCATTTTACCCTTAATGAGATAATCTATAGCCACAAAAATATC TATG

ACTTGTTTAGACCACAAGTTCCAAAAGCTTTCTTTCTTTCTTAAATTTTGTGCCCAG CTAA

ACAACATCACATAAAATGGGACGGAAGGAGTAGTTTTTCCTCTCAATTTAATCCAAT AGA

ATTTTCTTCCATTTCAGTGGAGATCTTACACTTAATAACTGATGCAGGATTATATCT TTTG

TTATATTTTTATTTCCTAGGCTCGGGAAGGAGAGGATAGCATGTTTTCACCTCCGCT GG

GGATTTTCTTTTGAAGGTACATAAATGTCAGGGCCCTCATACAAGGGTTGTAGATCG AA

AGTATTCTAACTATCTTCTTTTTTTCGGGTAAATTGTTGAGCCATGGGTACACACAT ACA

CAAATATATAGAGAGAGATATCTTCATATAAGCAACTGCAGACTATAAATATGCACA CAC

AGAAGACAGATATGAGATTGGATTTTTTATCAAATTTAAACTGGAACACCAATATCC CGA

CTAAAGATACGCGGATACAATAGAATATAGCAGCTGATGGTGTTACTGAAAGGTCCA AC

ACGGGTGCTCAAAGTCCACTTGGGCAAATGCGGCTAGGGAGTCAGAAAGAATGAGAT G

GGCGACGGACTTGCCCTACTGTCAGAGCTAGTGGTGTGTTCTTGAGATGAAGGCTTA G

CATTCACCACCTTCCACGTTTCCGACTTTCTCTTATTTGAACTTTTTAGATCAGTCA ACTT

G AGTAAG CAACTTG GG CCAAG G GATG AAACTCAAGTCTCTAG CTCTCAAGTATACTG AG

TAGGCAACTTGGGCTTGCATGTTATACATATTTTTTTAACTAGTATACTGAGTAGGC AAC

TGGGGCTTGCATGTTATATAATGAATCCAAACAGCCCATATAGCTGGCTCAGGTACC AG

AGGAAAACAAACTTTAGATGAAGCTGCAGATTTTTTACCTGAAACAAATATGTCAAG AAA

TTCTGTTCAAGAATGTCAATCTCTTCTGGAGATAACTTCTGTTGTTCCAATTTCTTA GCC

CCGTTCACAGGATCAAACAAAGAGTATAATTGCTGCAATCAGACAAATATTATTGAT CAA

AACACAATATCAACATCAGCATATTATATTCATTTGCAGCAGACTGATGAGAAAAGA TGT

CTGATCAAATAAAAAGTAGCATAAGATTATATTACCATGAGATCCTCAAATTGTAGA AGA

TACCAAGCATGAATTGTGTACTCAACCCTCTTGCAAAGCTTCAGAAATTCAGCCCGG TC

AGAACTATGTTCTG GG CAAG GAG GAAGGTCG GATCATTAG AACCTCAAAAAAGACTTTT

ACTTAAATAACCAATCATAAAAAAGCACTAAAGGGACTAGAAGCACAGATATTCATG CAG

CATTTTACAGTTTCAAGAAAAGTAGATTTTCTGAAGAAGAAAAAAAGAGTGGAAATG AGC

TGCCAAGACGCAGACATGATTTACGTATATCCAGAAGGAAAACTTGTTTCTGTTAAA AAT

TCTAAACATATAACACGTCAAAGCTTATCAGTGAAGGAAGGGTTTTGAAACCTTTTG GG

G ATTTTG ATCCATG ACC ATCAAGTGTCTCCGTAAAATTCACATG G AG G GTGTACAG G GT

CCCCCCTTCTTTTGAACTGGGGTCTCCTCCTTTTGAAATGTCCAGCACTAATTTACT GCG

TTGTTTTTACTGTGAGTAATACTGTTACTTATCTATCCAAAAAATTACATAGAGGGA AGTC

GTTTCAAAAG CTTAC AATTTCTTG AAATCC ATG AG G ACTTG ATGAAAAGATG GAAG CCTT

TCTATTTCGGTTAGAGTTTATATGCTCGAGTGACCGTGTAAGGAGAAGTGTAAGATT TA GACAATTCACGGTTATAGAGAAACCCTTATTTGTCCTAAAAGACAATTTATTGAGTAGTG

GAATGAAACAAGCCAATGGAGCAGAATGATAATGAGAAAATCACATAACCAACTCCA AC

TAGTTATGAATCAGGGTGGAGTTGCAAAAGTAAAATCTCTATTAAGCATCTAATTTC TTA

AATTCAAGAGATTGTGAACTTCTGAGTTGAGTGAAAGTGACTTACTCTAAGAAAGTA TTT

AATGATAAAACTCGCAATTGAGATAAGTCAAATGGTAAACCTTGGAAAATAATCTAA TTT

AATCTGGGAAAATATCTAATTCACACAACAGTAAATCTAATTAATCTGGGAAAATAA TCA

AGGTCAAATTGTCTGCTTTGTCACAATACAATTAGAGTGCCTCAGAAAACAACATTA AAT

TTTG CAAAATTAAACCATTAAGG G GGTTG CTGTTTACACGAAAAAAG CTAAAACTCAGAC

GTCACCAAAACTATGAGATCGTGTAAAGTTAGTAACATTTTAGGTTTAAATTCATGG CAC

GCAATAATTGCTTGACATTCAGGAGAAAGACAAAAAAGTGACTGCCGTTCCCTATAG AG

TCTAGCGTAGAAAGTTGTATTATTTGCTTCAAAGATCTCCTCTTGGAGAAAGAAGCA TTG

TTTCCTTACACTGCCATTTTCTATGGTCTAGTATAGGAAGTTCTATTGTTTGCTTTA AAAG

TCTCCTCGGATTATAGCAGAGTTGTACTTGTGCTAGTCAAATTTGGTAGTTTTTTCC ACA

GAAGTAACAGAAAGTGAAAATATCAGAGATATTATCCAATTAAATTAGAGGAGAAGA ATT

GTTTCAAAATTCAAACAACTGATCGACCACAAAATAGAGAGGGAAGAAAGAGTAGCC GG

TTGTTTGGGAGCATCCAATTTTTACCTATAAGGTCGGCGAGGGCCATGATGAGCCTG G

GTTTGAGGACTGGAATAACAGACTCGCGCTCCAAACGGATCACCTCTTTCTTCTTCT CC

AT

SEQ 53

TCACCGGTTTGTGACAACTGGATTTCCGTTCGCATCAGACAAATGGATACTCTGGGTTG

ACTTGTTAGCGACAACAGGATTTCCAGACTCATCTAAATTAATTATAGCAACATCAC CAG

GCTTGAGATCCCCAGAAAGGAAAGATTCACTCAGGAGATCTTCAACCATTTGAGTAA CA

GCCCTCCTAAGAGGGCGTGCACCGTAGTTTCTGTCAAATCCTTGTTGGCATATAAGC TC

CATTACTGCTTCTGACACCTCCAAGCTTATTTCCAATGAAACAAGCCTAGCCCTCAC CTC

CTG CAGCATCAGGTCTAGTATCTG GAG CATCTGAAAG GG AAAAAACAAAG CAGTTACTC

GCAGTAGCCGACTGCTTTCTAAAGAGATGTGCCAGAATAAAAGATCACCCCAATGTA CA

TTCAGTTATCAGATCAATGCAACTTTCCAAATCAAACAAGAGGTATATTATACGAAC CTG

GGGCTTCTCTAGAGGACGGAATACTACTACTTCGTCTAGCCTATTCATCAACTCAGG GC

GGAAATATGTCTTGAGCTCTTCCATCACTATTGCTTTCATACCAGCATAGGAGGCTG CT

GATTCATCATCAGCAAGCAAGAAGCCAATAGTATTCTGTCTACCCTTTACTATGGCT GTA

GAACCCACATTAGAAGTCATCACTATCAGGGCATTCTTAAATGACACTCTTCTTCCC TGT

TGAATCAAGTTTCATAATTATCAACGCCAATCTTCCAAAACAAGGTTTACACCCCAT ATT

GTGGAACATTCTAGAACAATAAGCCAGATGTAAAAACGGATCTAACCTGAGAGTCTG TT

AGGTGACCATCTTCAAACAACTGAAGGAGAATATTGAATATGTCAGGATGAGCCTTT TC

AATCTCATCTAGCAGCACTACAGTGAAGGGCTTTCTTCTGATAGCTTCAGTAAGTGT TC

CTCCTTCTCCATAGCCTACATAACCAGGAGGCGATCCAATTAACTTGCTCACAGTAT GC

CGCTCCATGTATTCACTCATATCCAATCTTAGCATGGCAGATTCCTGTGCAACACAA AAA

GATACTTCACTGAGTACATAGAAATTCACAAAACCAAGTGATTGTTAAACTGAAACA GAA

CCGAAGATAGCAAATATAACTCTAGCAATTTGATGATGAACTTCTAAGAAAAGTAGA TGC

ATAACTCTTGATGATGAAATTCTTTTTGATAAAATCCTAGCCACAATTTTCATCTAG AAGG

TTGAACAGAGAATGTCGCTCAGATGAATTTGATTTGATTCAATCTCTGCACTGAAGT CAC

AAACCGTCTACGACCAAGAGATCCAAATTCAGTGTGATGACAAGATTAAAAGCAGCG CT

AACTATATGACTATGATCCATTATCAAAGCTCACTTAGTACTTTTCATTTATTTTGC CAAA

TTACGTGCCTCCACCATAGGTGATTAGAGACTCCTTCTGTTCTTTCTTTTGGGGCGG GC

TGAGAAGGGGTGGAAGGTGCAAGGATCTAGTAAATGTAACAGACTTATCAGATATAT AT

ACAATGGTTGACTTCATTTCCCATTGAAATGGATGAAGGAATAATCTGATCCTGGCA ACA

GGGAAAGAGATTTGAAATAAGCCAGTAATAGAGCACTACCTAGTACCTAGTATGTCT AA

ACTTGAAGTACTTATTAGTGCCCACCAATAATCAGAAGTCGTGCTCCAACAAGATTA ATT GGTGGTATTTCCAACGCTACATTTGACCATGCCGCAATAGCCATAGAAACCTAGAATGG

ACACACAGCACAGCCTCCGGTACCCTTTCCTTCTCTCCTTTTGTTTTTATTATGTGT GTC

CGACCACCTAAAACACAGTCACCACATCTTACTATCATAGTATAATACTTTCTTTAC CCG

AATCACCACTACCAACAACATAGAAAATTCCCAAGAACACCAGAACCAAAAAAAGTC CT

GAGAACACCAGTACTATCACCAAAACCTATCCATTGTCACTGAAGTGGCTGGATTGC AT

AACTTCAGTGG CAAAACATG GTG ACTGTCTG GTGAAACAATTAAGG CATCTAG AATG AA

AAGATGAAGCACATTTCTTATCTTACATAATAATTCTTCTCAAATTTAGACAAACTA AAAA

GAGCAAGATTGTGTTTGTGCAAATTATGCTGCCAGAACTCTTGGTCAACACGATTCA ATT

TCAGAGTTCCAC AATTTCTACTCAATTG CTTAATCTG GAG ACG CATCTTTG GAG GAATAA

TG C AAAAC AG CTC ATTTTATTAATACTTAC AG AAC C AAAATAAG AC G CTG C C AAAG CTTT

AGCTAGTTCAGATTTTCCAACTCCAGTAGGACCACAGAAGAGCATTGCCGAAATTGG TC

TATTTGGGTCCTTAAGACCAGTTCTAGATCTCTTAACAGCCCGACAAATGGCTGCAA CA

GCCTCATCCTGACCAACAACCCTTTTTTTAAGCTGCTCATCAAGACCAACCAGAAGC AT

TCTTTCATCAACAGTAAG CTGCTTAAGG G GAATG CCTGTCCAGAGTGAAG CAACTG CTG

CTATTTCCTCAGGTCCAACTACCGGAGGTCTAGAGTAAAAATCAAAGGATTTAAGTT AAT

TTACTTCTGATTTAAATTTATGACTTCAGACAATCTTTATTCTACTATAATGCCCTT GTAG

GAACATGGAGGCAGATACGTACTCATCTTCATCAGATGTAGAAGGTGATGCTGGCTG TA

AATGAAGTTCACTGCCATCATTCAAACGAGATGCATCATCATTTTCTGTCAGCTTGC TTG

CCAAG ATCTGTTTATG GAG GG G GC ACAGGTAATTGTATTTAG AC CACATAAG GAAGTTC

AAATTTTTCAAG AC GTG CAAAATCTAG AG AGTTTGTATAACATTGAGTACTATTAC CACTT

CATGCATGGCTTGAACAGCTCTAATCTCCTGCCAATAATCACTTGGTGATTGTGAGA GT

ACAGATATCTGCTGTTCCTTTCTTCTTTTGTGAGCTTGCATACGAGATTTACTACCA GCC

TCATCAATAAGATCAATAGCTTTGTCAGGAAGATACCTATCCGGTATATATCTTGCT GAC

AGTTGCACAGCAGCATTTATGGCTTCCAAACTGTATATACACTTATGATGTGACTCA TAT

TTCTCACGCAATCCCAACAGTATCTGGACAGCATCCGCCTATATAATTTAGGAAACA AG

AATATCTGTTAG G GC CTTAG ACACC CACTTG AGC ATACAC ATG CATAG AGTATTATTCAC

CTGACTTGGTTCATTAATCAAGACAGGCTGGAATCTTCGGGCAAAGGCCTTGTCCTT CT

CAATATGCAATCTGAACTCATCCATGGTGGTAGATGCAATACACTGTTACAAGAACA AAA

TTTAAG CG CAG AGTCC ATAAAAG CCAACATTATG AATGC AACTAC ATG GAGTAATG CAT

G G AAC C AG AATAG G AAATTAC CTG C AGTTC G C C C C G C C C AAGTG CTG G CTTT AG C AAA

TTAGCAATGTCAAGACCAGAACCCTTATTTCCCCTTCCAACTGTACCAGCACCAACA AG

GATGTGGACCTCATCTATGAATAGAATGATATTGCCTGCAAATTATGTCACAAGCTA AGC

CAATGATCTAAGAATTTGACCAAATTTTACTTCCATTCTATGCTCACCTGACTTTTT GACC

TCCTTAATTAATGTAGTCACACGCCCCTCTAGTTCGCCCCTCTCCTTTGCACCTGAA ATG

AGTAGGCCAATGTCTAAAGACATTACCCGCTTTTTCTGTGTATATAGCAAATAACCA CTA

GTTAAGTAG GTG CAG CAG CTAGTG G AACAAG AGTATAG G AG ATG CATATTAAATTACAA

AATAGTTCAAAGAATCCCTGGAAAAAAAATGAATTCATTGAAAAGCCCACCATTAAA AAT

GCAGGAATATTTCCCTCAGCAATGTTTATCGCCAGCCCTTCGGCTATCGCTGTTTTC CC

AACCCCAGCTTGACCAAGCAGAATAGGATTGTTTTTGGTTCGACGGCAGAGAATCTC GA

TAATTCGCTGAACTTCAATCTCTCTGCCAATTACTGGGTCTATAAGGCCCTCACTCA CAC

G GG CAGTAAG ATCTACACAGAATTG CTCCAG CGC ATTTTTCTCTG CTG GAAATG AAATG

C ATTTG AG AC AG C GTG C C AAAC C AAAC ACTAAC AAG AG AG AG G AACTG G C C ATAC CTTT

TGCTTTCTCAGCGGATCTGTCGATAGTTATTTTTCCAGGAAAGGATTTCTCACGCGA CC

TTTTG AATG AAATTG G CTCTCTACCATCTTTAG CAAG CTCTCCTTG AAG CCTG GAAACTG

CCTCAGCTGCCAAACGATTTACATTTACTCCTAACCTGAAATTAGGTCAGTCGACCA TG

CAAATCTTAATTTCTTATATAAATAGCGTGTAATAGATGACAGAAAGATAATCATAT TGCA

AGAGAGAAGGCATATTAAACTCAATCAGGAAGGTGAAGAAGGATCCTATTCCCAAGC CA

GTAGCATTGTAAGATAAATTATGCAGCAAAAAGGAAATCCGTAAGCATTTGTTTATT TAT

TAACAAGTACATCAGTATGCTTGGTTTAGAGCACAACATGTGCTCATGCATAGCGTT CC

ATCAGAAACAG AAATTG CTT AT CAACAG G AG GTCAAAATATTAAGCTAG G GTTGTCAGG

CGTACAACTTGAAAAAGAATATGTACACATACATGTACACAACACAAGAAGACACCA AAA AAAAAAGAGAGGCAGATACCATTCATAAGTTAAGAACAGAATACTAAGGTAACAACAAC

AATCCAGTGTATTCCCACAGAGGATAAGATGTACGTAGCCTTACCCCTACCCCGGAA AG

GCTGAGAGACTGTTTCCGGTAGACCCTTGGCAAACCAAAATATCTCTTGGTATTTTG TAT

ACGCTTGGGAGGGAAAACAACACCAAATGGAATGTAAAATATGGCAAGCAAGGATAA AT

GAGTGAAGGATAAATTCTACCAATAAGTTCATTGCTTCCTAGCAACAGTGGGAGACA AC

ATCTCATTCCTCCTCAATTGTAGAAGCTGTTTTGGACTCAAAATAAGCAACAATAGC TAA

TCTATTCATGATCCCAAGATTGTTTCAGCTTACCAAAAGCCTATTACTTTTGTTCTT AATA

ACCTCATCATCCACTACTAGAGGCAGATGTAGTGCTTGAGTTAAAGGTTCATCTAAA CC

CATTAACTTTGGTTCAAACTTGTATGTATGTTAAAATATGCACCAAATAAGTACAAA TAAT

ATATTTCAAACCAAGAATGGGCTGCGGAACGCATATTCAAATCGTTGATCTGCCTCT AA

CTACCTACTTGCTCTACCAAAAAAACTAGCAGTTCCTATACTGAAAACAAAACTACT TGC

CAAAAAAAGTTAAGAAAAAAAAAAGCATTCTTGACTGACTATCTGTAATACCTCTTG AGC

ACACGAGTGGCGTTACCATCATCAACAGTAAACAAACCAAAGGCCATATGCTCGGGA G

CAATAAAATTATGCCCCATGGTCCTTGAATACTCAACCGCAGCCTCAAAAACGCGCT TC

GTACTTGAAGAAAACGCCACATCAGTAGCCGACGTAGCAGAACCGGAGTCCTGAGAA G

CCAATTTTTCTTTATCATCCTCCACGTCATCATGCCATATGCTCCGAACAGCTTCGC GG

GCTTTATCAATTGTTATTCGAGAACCAAGGAATCCACCAGGGCTACGATCCTCTGCG AT

CAGACCCAGCAAAAGATGCTGTGTATACACCATATCTTTGCCCAAAGCCTTTGCTTC TTT

TTGAGAAAACATCACAGCTTTGATTGATCTCTCAGTAAATCTCTCGAACACTCCAGA GAC

AATATAC AAAG AG C G CTTG ATTTTAC GAG G AATTG AG CTG C AG G G C CTATG AG AAAG G

GAAATTCCAAAAAGGGACGAAGTAGAACTGCTAGTACTACAAGCAGCGGTAGTGGCG G

TAGTAATAGTAATATG AG AG GAG G AAG AAG G G C AATATG G G AAAAG C G AAAAC AC G GT

TTG ACATCTCTTGTGAG GGTACACAG AG CCATAG CG ACG AAG CTGAG GATTGAAG CTG

ATTGTTGAGTTCACAGAAAGTGGAGAAGAACACGTTAATTCCAT

SEQ 54

ATGAAGAATATCGAGCGTCTCGCAAATGTTGCTTTATTAGGTATGGTTTCTTTTGATTTT

GATTCATACATTATATCTTTTGATGATAGCTGAATTGCATAGTATACTTGTTTGATT TGTG

CTTGTGAAGTTCAGAAAGTAAAGTAAATCCTGTTTGATTTATAGCTTCGTTTTTTGC CCC

TTAGTTTGTGTTGGTTACTCATTCAGATCATTTTTCCGCTAGATAGATTGCTAAAGC TTTT

GATG CTAATTCTTTGTTGTT AATTG GAAG GAG G CTAC CCTTCCAGG GTAG G GGTAAG AC

TACGTACATCTTATCCTCCCCAGACCCCACTCGTGGGAATTCACTGGGTTTATTGTT GTT

GTTGTTGTTAATTGGAAGGACTCAATGTAGGGAAAGGTGCTAATTATTGTGTAGTTG GA

ATTTGAGGTGTGGTTGATGGTTACCCTAAATATATCTATCCAGCATGTTGGAGTAGA TTC

TATAGCGGTTGGAATGAATATTCAAATCCCCTTGGGCAGTCACATTACTACTGTTAC CC

GCTTTCCTTTATGTCACAGTAGGTTCCACTTCCACAGTTCCAGTTCAATCGGTAGAC AAA

GATGGTCATGTGGGTTCTTTATATCAGTTTTAGCATTTTCTTATATGTTGGATGTTT GTTT

CATCATATTGCCTTTTTGAGGACATTTCACTACGTAATAGCAGCTATGCCGTTCTTG GAA

ATTTACAATGTACGATTATTTGGTCATGGCAATTTCACATCACTTTCCAAATTTTAT GTTG

ACGCAATTACCTTGAAACTCTTGCTTTTTTGGTGGATTTCAGGTTTGAGTCTGGCAC CAC

TGGTGGTGAATGTGGATCCAAATGTAAATGTCATAGTAACAGCTTGCCTTACTGTCT TTG

TGGGATGCTACCGTTCTGTCAAGCCTACTCCACCTTCAGTATATCTTCTGTACTCCA AGT

TGCAGCTTCCCTTTTTCTTAGATCTGTTTTGATGTCACTTAAACATATTCTACTGCT GTTT

TCCAGGAAACAATGTCTAATGAACACGCAATGAGGTTCCCCTTGGTTGGAAGTGCAA TG

CTCTTGTCATTGTTCTTGCTTTTTAAGTTCCTGTCAAAAGACCTGGTTAATGCCGTA TTG

ACATGCTACTTCTTCGTTCTTGGCATTGCTGCACTTTCGTATGTTCTCTCCGTATGG ATC

ATTCTGTGATGCTTAATATTTTCTATAACAAGTTCTTGAATAGTAGTTTTTCTGTGG GTGT

ATTGGATGTCATCTCTTTCTTTGTGTCTTTGCAGGGCGACATTGTTACCTGCTATCA GAC

GATTCTTGCCCAAAAAGTGGAATGATGATCTCATAATATGGCACTTCCCATATTTCC GCT GTAGGCACCACCTTTCTTGTCTCTTTTGAAATGCCAATTGATCCTTTAGAATCCTTGGGC

ATACAGATCTCATCTTAGTTATTTTGTTTCGTCTTTTTTCAGCTTTGGAGATTGAGT TCAC

AAGATCTCAGATTGTTGCCGCAATTCCTGGAACCATCTTCTGTGTTTGGTATGCTAA ACA

GAAGCATTGGCTAGCTAACAACGTTTTGGGCCTTGCCTTTTGCATTCAGGTTTGTCG GC

ATATCCATCCAAGTTACATTCTCATTCTTCAGGATATCTCAAAATGAAAAGTTGTGT AAAA

TAGTATTATTAGTACAATG GTAATATACAATTTTG GATATTTCAAAGTGAAAAGAGTATCA

TATAAATTGGGATAGAGGAAGTACTAAGACACTTGAATGAAGAGATCATATTTCATC ACT

AAAAAAGTTGCACTTATCTGTCCATACATGTTCTCGTAACCAAGCATGGTTGCTCTT TAA

TACCAGATGCAAAGGCTACCCGCCTTTATATCTAGCATTTAAATCCACGATAGCACT TGA

TGGCTTCCTCTTTAATTTGTTTGATAACTAGAATTCTCCCAGGAGTTGGCCTACATT TATT

AAACTATGGGAGTATAATAGGCCTCCTCTATCATGCTCCCACTAATATAGCGGCTCC TT

GTTAGTGATAGGGTTCTAACTCATGACGTGGACCCATATTCTGACATTGCGTCATTA CAT

TGAACCGGAGCCCCAGGGGCTTACTATTTGTGATTTTCTATTTATATATACATTGAG TTA

ATGGAGATTTTTGCAAGGGAGAAAAGGTTTGATCCTCTCTTATGTCATGTCTACATC AAT

GATTGATATTGATTTTCCCATTGCGATTTTGATTTTCAGGGTATTGAAATGCTTTCA CTTG

GATCATTTAAGACTGGCGCCATACTATTGGTAAGAAAGAAAATTTGTTTTCTAATTT CTAT

CTGTAATTATACATGGCTGACAGCTGTATTCTGTTTATGTGTTTCGCCTTAAACTAC ATAT

TGCTTGTCTTTTTGAATTTGATGCTAACCACATATCTCTTTATTCAAGCGGGAAGAG AAT

TTCATGAAATGAGCTATTAATGATTGTTATTGTTGAGCTATTAATGATTTTACATAC AAAA

ATACATAACATTTGCATGGATTATCCCTAATTGCAGAGTTTTTAGACATTTTTGAGG TATT

CTTTTATGTTGGCATTTTTGCTTGTTTATGCAACTATTTATATCCATTAACTTGTAG CTGA

TGTTGAATG CACATG GTTTTCG AG AATG CAG G CAG G ACTTTTTGTGTATG ACATCTTCT

GGGTCTTTTTTACCCCAGTGATGGTCAGTGTTGCCAAATCTTTTGATGCTCCTATCA AG

GTGTGCATACTGATTTTCTCATATAGCTATTTCTTTTGAATTTTCATTTCATGCCTT TATTA

GTTACAGAGTCCTGATTATAACTTCGCTTTCTCTGCAGCTTTTGTTCCCCACAGCAG ATG

CTAAACGCCCCTTCTCAATGTTGGGTCTTGGAGACATAGTTATCCCCGGTATAACCT CC

ATTTGCGTGAAAACTCCATTCACTTTATGTGGTTAGAACAGAGAGGTTTAGCATTTT GCC

TAGCGGAGGGATCCTCCACCTCAAACCATGTGGTTTGGGGTTTGAGGCAGTAGGGAA A

CGGTGGGAAAAGCCACTGTTGGTCCCTGGAGGGGAAAAAATGGGGGTGGGTGGGGG

GATGAGGTTTACCATATTAAAAATGAAAAGCTTACTTTTTGTGAGTAGCTACTTGAA TGT

ATTTTTCTGTTTCTTACACATGCTTATTATTAGCTTTTGCCATGATGCTGTATTTGT TTTCA

TTTTTCAACTTGTTTTTTGCTTGAATAGATACACTTGGTAACATTTGATCACTTCAA TCAT

GCAGGTATTTTTGTTGCATTGGCCCTCCGCTTTGACGTTTCCAGAGGGAAGGGGCCC C

AATACTTTAAG AGTG CATTTTTAG GATACACATTTG GTTTG G CTCTTACCATATTTGTTAT

G AACTG GTTTCAAGCTG CACAGGTTG GTG AATCAAAATAAAG CTTTTACACTTTATTTCT

CTTGCTAGAATTGCAGCGCCCTTATGTTTGACTTGGCCTTTGTTTTTTCCAGCCTGC TCT

GCTATATATTGTTCCAGCAGTGATTGGATTCTTAGCCGTACACTGCATATGGAACGG GG

ACGTGAAGCCTGTACGTTTTTTCTTTTGACAATCTGTTTCAACTTCATCCACTTGCT AACT

TTACTGTTTATGTTCTTTATATGCTGTTTACTACCATTTTAGCTACACACTATTTGT AGATT

ATATTTTCTAGGAGTTAATAGATATGAGAAAATGCATCTTATGGTTTACCTTTAATT CATC

CAAAGAAAACATGCATGTCATGATTTGTTTGGAAACTATGGACAATAAGTTAAAGGT AGG

GAAGGGAAGTTTTCTCGTTTCTTCGAGTTGGAAGCAAAAAAGGACGATCAAGAACTT CT

TTTTCGCCTAACCTTTGATAGAGAGAGTACAAACCCAAACCTTCATTGCCTTTTCTA GTT

TATACGGATACAGAGTTAACGAAATTTTCGTTTATGGAAGTAAGATTGGGGTCTATT CTC

AAGTCGTAGAAAATTACATTTTCGTTTTGATGCTGAGTTTGTATTCTTGATTTTCTG TTGA

GCAGTTGTTGGAGTTCGACGAGGGAAAGACGAAAGGCGCTGAAGAAGCCGATGCCAA

AG AAAG C AAG AAG GTAG AA

SEQ 55 CTAATTTTGTTTGAGATCTCTATAGTACTCTTCTAACCACTTTTGAATAATAGCCACTTC T

TGCTTCCTTTGCATGATCAACCAGCCAGGGTCATTCTTTGTCTCAGAACGAAAGTCA AC

ATGGTGTGCACCTAATTAAATAAGAAAAATTAAAACATGAAACTAGTAATAAAATAA ATCA

ATGTCTCTAATCAACAGCAATCGTTTTTCATTATGCTTTAATTCAATGAATATCATG ATAT

ACAAAAGCATATATTACACAAATAGCCGGTCATATAGCCGATGTACATAGATTATAC AGT

AATTATATATAGTTATACACATTTTATATATGAATTATACATAAATTGTACATGCGC TAGTT

ATTTTTAATTTAAGAAATCAGATCAGTGGCTATTTGGGTTAACTCTTCGTACCTTTT TGAG

TTACGAGTG CCACAATG CTAG CTG ATATATTTTTCAG CACACTGTCACAAAAAG AG AAA

GAAAAAAATCAATTTAATAACAATAAACAAAGACTTATTCAAGATTTAGAGTCTATA TGAG

TTTATAATATAAGTCGAAAATAATAGATTCAATTAAAACAACAGGACTTTACCATCA CAGA

TAGATATAGGACCACATTTAGTGGTTTTAATTGTATCTATCTATTGTTTTCCACTCC AATT

ATAAACTCATATGGACTCTAAATCTTGAATTTACGAGTCCTCACATTATTATATTTT TATTT

TTAAATTTTGAACTCACCCTCCTCTGCTCCATGGATCTTGCATTCCGTTAGAGAATA TCA

TATTACTGCCAAATCTCTTAAGAACTTGCTCAATTCTCTGAAATATAAACATATTTT TTTTA

ATTTCTTATTTTCGTGAATAAAAAAATATGTAAAGAGAATTTTCATTTGTTAACAAA AAAA

GAATGTAAATTAGAAAACTTACATAGCCACCAAATTCAGTAGTGATCCAATGTGGTC GA

GGCTCTACTCCATATTTCTTTTTGCAATCTTCTTTGAATTCCTTGTAACTATAGGAA GATG

GAGGAAACATGCTTTCATTTGAACAAGTCATTGGCATAACCATCTCTGTACATGCCT TAC

TCAAAAACCATGGAAAGAAACACACTAAAATTAGGCATACATTAGACCAAGATTTCA AGA

ATCATTTAATTTTAATCACACCATAACCAAACTAAAGTTTAATATAAGTACCAGTGG CGAA

TCCAACAATGCATTTACGGATTCGATCGAACTTAGTATTTACAGTATAGAAAAATTT GTAT

AT AC G AC AAC AAC ATATC AAG C C C AC C AAG C AG GGGCGTACG C ATG AATTTTTGTAAGT

GATGTCAAAATTTATAGAAGAACGGGAATTATAACTTTAATTATCATACTTCTAGAC AAGA

AATAACTTTAATCTATTGGTTTTGTTGAGTCTTAAGTTAGAAAAAGTCCAGAATTAA ATTC

TACTTCATGACAGTTTTAACCACATAGATTCTCTAAAATATTTGCAAAAAAAAAAAA AAAA

AAAAAAAAAAAAAAAAAAAAAAAAG GAAAG CAAAAG AAG CCTGTCAGTTCTC AAATTCAC

AACTTACAATTGAACAGAAACAACCTTCTTTTTTCGTNAGCAAAATCAAGCACATAA CCA

ATGCATCAAGAAATAATTTATGTCAAGTGGTGTCATTTTTTTCTACTTATCTGCTTC TTCA

CAGTATTAACACATATATTTAGTAAAAAAATTCGACGAAGCGGTGTCGCGTGACACC GC

TTCGATACATCTGCATCCGCCCCTACCACCAAGTGGGATTTGGGGGGTGGGTAGGAC A

TATGCAAACATTATCCCTACTTTTGTGAAGGAAAAAATATTATTTCCGGTAGACCCC CAA

CTCAAAGAAAGATAAAAAGAGATAGAAACAACAAATAGTAACAATCACAAGATAAGG AG

ATAAGGTGATAGAATAATGAGAGATAGAATAACATTTAATAGAAAAATCTGCGAATA AAA

AG CTAC AAAAATAC C ATACTAAAAAAATAC GTATAG ATAG AGTTAG ATATATTTTTC AAAA

TCTACACGTTTAAATTCTGAATCTGCCACTAATAAGTACAAAGAATAATAAATGGAA TAG

AATAAAAGGTAATTAAGGTAAAGAATTCTAAACCTGCCAATCCCAACCACGAAGACC AT

GAGCATCATCACCACCTTCTAAATTGAAACATTTTTCTCTTTTTGTATAATTGTAAT ATAA

ACTTGCCGCAGCAAATGCCCGGCTGATTTTGGAAGCTCCTTTTGGTAATCCATCAAT TA

TCTTGCACATCTTAATTTAAAAAAAAAAAAAAGTTTAAGAATTTTTTCACTCGACAA TATA

ATTTTTTTACACAATTAGTCATTTATAAGATAATTACAAATAAATTTTTATGATAAG TATTA

ATTGATGAACTGATAAAAATGATAAATAACCTGTTATAATATGTTAAACTACACTAA TCAT

GTACAAAATCTCCACATTGTCATTGCATTAAGTCTTATGTACAAATGGTATAAAAAA TATT

TATGCCAATCAGATAAATTATAAAGTACCTACAAGTAACCTTTGTGATAAACATTAA TTGG

TAAATGGCTAAAAATTGTATCTAGCATGTCATATCAGATTAAACTATACCCACTTTT GTAA

AAATATTTATCCTGTCAATTAGGGGCGGACCTACGTGGGGAGGGGTCACTAGACCCC G

TCAGTCTCGACAAAAAAACTGTATATAAATTTCATATATATCTATATATACAGTAAA GACG

CCTTAAATACTTTATGCGCCCCCCTAAAAGCACAAAAACTGGACAGAGGCACTGGTT TG

TAGGAGTGCTTAATCCAAGTTCGAATCTATGCTCCTACAACTTATATTTTTTATTTT ATTT

TTAAAGTGGTGTCGCCGTAATACTCAAATCCCAGGTCCGCATCTGTTATCAATGTAT ATC

AAGTTCAACAAAAGTTGATCAGGGGCGAAGCTATATATCCCAAAGGGTAGTCAACTG AC

CACCCTTCCTC GAAAAATTTACTTTG CGTATATAG GTAACATATTAG GTTTTAG AG GTAT ATAACATATATGAATACTCTTTATTAGAGAATTTTTTCCACTTCTTTAAGTTTGAACACC C

TTGAGCCTTGAGAAAATTACTGATTTCGCCACTGATGCTGATAGAAAAAAAGGAAAT GTA

ATTAAATGCAATGATTTTGAAACCAATTTAGTGCTAATTAAGCAGAATTGTCTACCT CTTG

TACTGGATATGCTGGCAATGGCATCATAAAGTTGGCTTTAGTAGGATAATTCACCAT TG

CTGTATATACAAAAGCTTCCCATAGCCAATCTCTAGCTGAATAAACTGAATGTAAAC CCC

TGAAATATAATATAAAACAAATCATTTTATTGAATCAAACTTGACCTTGTAATTTGC TACA

ATAAAAAATTATCATGTTTCTATTTTTATATTCAAATATACTTACTTGCAAGTTCTG AAAAG

TTTACTAACTTCAGTCAAGCCTTCTTCATGTTTTGATAAAGCATCCAACTCTGTCCA ACTT

CCCTTTATCACTCTATAACAATTCAAGCTTACCTCCTATTCGTGAGGGCAAATAAAT AAA

TAATAATAAGGTCTCTTATCCAAGAAAGATTTTTATGAATATATCCTATTCGAGATA AAAG

ATAATTACAGTAATCGCTCATAATAAAGTGAGATTAGTAATCTGCAAATAAGTCAAG TTA

CTTGTTACAACATGTTGAAAATATTGAGTGTAAAAATTTATTTACATTGTCAATAAC GGTG

GATCCTGATGGGTTCATCCTCTAGAGTTCAAATAATTTAAGGGTTTGTTTGGCCATG ATT

TTTTTTTTTACTTTTTTTTTGGAATCAGTGTTTGGCGATGAAAAATTCTAACATTTG AATTT

CTAAATTTTTTCGAATTTGAAAAACTTCAAAAAACTATTTTTCAAGATTTTCACTTC AAAAC

ACTTAAAAAAATTTAAAAACAACCCCAAATTATATTCATGTCCAAACACAATTCTAA TTTT

AAAATACCATTTTCAACTTGAAAACAAAAATTACTTGTTTAAGGAATTTCACAATTC TTAT

GTCCAAACACCCACAAATTTGTTGCGCTTTTAGAAGTAGAGTTTCAAAATTTAATAT TTAT

TAAAATTTACAATCTTTTTGCATTTCTTTATAAACCTAGTAAATAAGGCGATATTCG TCCC

AGTGTATATAAATTAAATCCTATTATAAAAGGAGCATCTAGGTGCATGAATGTGACC ATG

TAGGATCAGAACATTTTAAACAAAGGGGCTTAATAGATTGTAAAGGCTTTACTATTA CTA

ACGACAAGAGCGGGTTGCTCCAGTGGTGAGCACCCTCCACATTCAATCAAGAGGTTG T

GAATTCGAGTCACCCCAAAAGCAAGGTTAGGAGTTCTTCGAGGAAGGGAGCCGAGGG T

CTATCGGAAACAACCTCTCTATCCCAAGATAAAAGTAAAGTTTACGTACACACTACC CTC

CTCAGACCCTACTAGTAAAATTTTACCTGGTTATTATTGTTGTTGCTGCTTTACTAT TACT

AACCTTG AAATCTTG GG AAACAG CATCATAAAAG CTTGACCATGG G GTGATTTTGTCAA

ACTGC AAGATTG GTG CTG AAG ATG CCACTG CACCTATTG CTATATGTG G GTACTTCAAT

CTAAACCAAGAAGCCAACACTGGTTGACCAGAAAAAACAAAGTTAAAAGAGGAAAGG AA

AACAAGTATTTAAGAGTGAGAGCAATTTATTAATTATTGGATACTACTCGAGAAGAA TGA

ACCTAAATAGCCGTTCACTCAACTGCTTAAACTAAAATAGCTGACGGATGTATAAAA TAG

GAAAATTTTATTTTGTGTCTCAGACAACTTACACTAGTTGTATGAAGCATTTTTTTT ATCT

CACAGTTTTGTGGACCCGAATTTTTGTGGACCTAGAAGTGTAAGATTGGGGTCCACA AA

TTTATGAGACAACAAAGAGCCTCGCACAACTAATGTCAGTTGTGTGAGGCACACAAT AA

AACTTCTCTATAAAATATATAATTCATACTCTTATATAATACACCATCCGGTCCACT TTCA

TTGATTTTTTGACTCTTTTCACATATATTAGAAAATCACATTTTAGCATTAATTCAC AATGA

AATTGACCATATTAACCTTATTTTGTTCCTTGAAAATATAACAAATGCTCCTATGCT CTTT

ACTTCAAATGCAACTTTAAAAAAAAAATTAACTTATTCTTAATATCTGGAAAAAATC AAAT

ATTGTGGACCACAAAAAAAATTAAAAATTCAATTAAAATTGACCGGAGAGAGTATAT GCA

TAACTATGTATAATCTATATATATCGGCTAGAAACAAACAGTAAATTGAACTGGCTA TTTG

TGTAAAGATTCCTACTTAACAAATGCAAAAGTGGAAGAAAAGTTCTGTTTATTTGAA TAAT

TGAATGCATCTAATGCTAATGCTAAATTCATACAAAAGAGAACTTTCCATGAACATT TAG

CAACCATAGAATGTAATTATCATTGATTCACATGGATTGGACACTCAATAAGTCAAT ATG

TCCACACATGTAATGTCATGTCATTTCCATCTATCATTATGTCAAGGCAAAAAATTA GCT

AAAAGTTAAAACTTTTTCACTTATATTATTACTTTTCTTTCATTACTTTTTTTTTGT TTGTTT

GTGTGGTGTTCTACTATATTAGTGGCAGTTTGGACATAAGAATTGTAAAATTTCAAA AAA

AAAAAAATTAACAAAATTTAAGTAAAAATAATATTTGAAAATTAGAGTTGTATTAGA ATAT

GAACATAATTTAAAGCTGCTTTTGATTTTTTTTATGAATGATTTGAAATGAAAATTT TGAAA

AACAGCTTTTTGAAGTTTTTAAAATTTTCGAAAAATTCCAAAATTCAACTTCAAGTG AAAT

TTAAAATTTGCATGGCCAAACACTGATTTCGGGAAAAGTGAATGTTTTTTATGGCCA AAC

AGTTCCTTACTTACTTCCTCCATAAGAGCCACCAAAAACCACAACCGGTGATGATTC AG

AAG AAAGATTCTGCTTTAAACTCCTTATTAGAAC AG CATAATCAG CCAATG CTTG CTGTG AATTCAAGTATCCCAAAGTCTTTGGTGACTTGTAAGATTTCTTTCCAAATGGCATTGAAT

C C C C AT AAAAC CT AT G CT AAATTATT AC AAT AC AAAAAAC C ATT AT C AATTT C ATT C C C AA

CAAAGATAAATAATAATAATAATAATAAAATATAAAAAAGGTTCAATTTTACCTTTC TATAT

GAAGTATCTTAATATTACAATTCATTATACTTTGGGCCACTAATATCTTATTTTTGG AAAA

AATTCTTGTATTTGTCTTGATTCTAACGAAGTTCCAACTTGAAGTATAATAGATGGT AATT

TTAAATCATAGTGAATAGCTGGATAAATTTGGATTTTTTCTAGTAGTATTTTGATAC GTAG

AATCTACCAAATCAATATTGGAGTTTCATTAAACGTAGTATAAATACGATTCGATTT AATA

ACGGCAAGAATATAAATAATCCCTTAAATAAAACGAAGTGTAAAACTAAAATACTTC GTA

TAGTACAACAACAATAAATTCAGTGTAATTTCACATGTGAAGTTTGGGGAGGATAGT GTG

TACGTAGATCTTACACATATCTTGGGAAGATAAAGAAGTTGTTTTCGATAGACCCTC GG

CTCAACGAATAGTGAAAACAAAGTAACAAACAGTAGCAACAACAACATAATATGAAC AAA

AGGCAAAATACTTCGTATAGTATAGGAGTAAATTTAAATATTTTTCTCAAAAATAAA TACT

TCAAATAAAAAAACATTTCAAGATTATATACATACTTCAATGAAGACTAGAAGAGCA TGA

AACTTAGGAGCAATATCAAGCATAAATCCAGTATTTGCAGCAAACCAATCAATATTT CCT

TCATTTCCAGTGTAGACAAAGATAGGGCCTCCTTGTTTCCAATAATTATCATTTATG AGA

TATTTCTGTTTAAAAACTTTAGAACTCTTTGGTAGAAAAGTGAAATGGTCAAGAATT TGA

GGAAAGTAATGGACTTTAAATGGTATTTTTGACTTGACATGTTGTTTTTCTAATGAA GATT

GATAAGTTCCAGGTAGATAAATTGGCTTAATTTCTCCAACTACAAAAGAGATAATAA ACA

GTAAAATCAAGAAAATGAAAGAAAAATAAGAAGAAGAAAAAGCCAT

SEQ 56

ATGTCTCGTTTCTCACTCCTATTGGCTCTCGTCGTCGCCGGTGGCCTTTTCGCCTCCGC

ACTCGCCGGACCGGCGACCTTTGCCGATGAGAATCCGATCAGACAAGTCGTTTCTGA C

GGTTTACATGAGCTGGAGAACGCAATTCTCCAAGTCGTCGGCAAGACCCGCCATGCT C

TCTCCTTCGCTCGCTTTGCTCACAGGTACGATGATCTCTACATGGAAATGAGATTTT TTT

TTGTTATTTGCTTATTAATAGTAATTGTTTTATTTTGAGTTTAAGTTCTATATATGC AGTGA

GCATATTTTTTTTTTTACATAAATAAGATAATAACAAATAAATCACTTAATATGTAT TAGTT

GGTAATGATAGTGTAAAAAAATATTATACTGTAATGTGTATATAACTTAAATCTTTT TATTT

TTGGGACGATATTTAAGGTATGGGAAGAGGTACGAGTCAGTTGAGGAGATAAAGCAA A

G GTTCGAG GTATTTTTG GACAATTTG AAG ATG ATTC GATCG CACAACAAG AAAG G ACTA

TCATACAAACTCGGTGTCAATGGTATAATTAATATTATGGCATAACGCTAAGGCCCT GCT

CTTTTCCTTTTTTCTCTTTTGCTTAAGTGGAGTCTTAATTTGTTGATTTGGAGGTAA CAAG

TTATAGTTTTGTGGTTCCTTTACCGGAATACTCTTTGTTTTTATCTTCAGCTAAGGT AACA

GATTAAGGCGTAATTATAGTTATTATTGTAAAATAAGGTAATTTTTATTTAGAAGCT TCAA

AATTAAGTACAAGCAATTGAATACTACTCTTTGTAAGTAGACTTTGTATATATGTTT TTAT

TTCATTCTCTTTTTTCATTTGGAGAGATGTGGACAAATTAAAATTATAATATAATGC GATA

AAACATATGTTCCACTACAGTATCAGTATGGTATTTATAGTTTGCATATTTTATTAG TAAT

TAATTGGTCTAGTGCCTTATCATGTGTAGATATTTCATTCATATTGTGTGGCTAGTG GGT

ACCCTTTCTCTCTCCAATCAAAAAACTTTTTTTAAAAGCTCAATTCAAAAGCTTTTC TTCA

TTACAACTGATCCTGCTTAAAGACTAAAAACAATCTAAATTGAATTCTTAATTCTTC TCTA

TTCATTCATATATGGACATAAAAACAAAATCACAGTACATGGAAAGAATATAAGCAC CTA

AGCATTGGACTGCCCAAATGAAAAGTTTTTGCAACTTAATCTAGTTGTGCATAGATT CAA

CAACAAAAAGTAAAGAAATAAGTATGCATTTTATGCTTCTAAGTTCTAGTATATATG GCC

CTTATTGTTTATCGATTATTATGTTTCATGACAGAGTTTACCGACCTAACATGGGAC GAG

TTCCGGAGAGACAGGTTGGGGGCAGCTCAAAACTGTTCAGCCACCACAAAGGGCAAT C

TCAAAGTCACTAACGTTGTTCTGCCGGAGACGGTATATGCACTCAGAACTCCTCTGT AT

CTATTTCTGGAGTTAGTGATCATTAGAGTTAAACTACTTTCTGATGATTTATTATTT CCAG

AATTGTGGAGTGCTCTGAGTTTAATTATGCTGTAACTATAGAAACACTAACTAAAAA GAT

CTTGAATAGGTATCCTACAACAATAAATAGAATCCTCATAAGAAATACCACTAGATC GAG CACCAGTCATGATTTCATATCTGGTAAAAATCTTGGCTAATTGATCGAAGTGGAGTAGAC

TAGC GAG CATGTACTGAGCTAATG CACAATTG GTTG CAAAAAGAAGTTTTTTCTTTCCTA

ACCGAAATTTCCAATTTCGTAATTATAGAAAGACTGGCGGGAAGCTGGGATTGTCAG CC

CAGTCAAGAAC CAG G GCAAGTG CG GATCTTG CTGG ACATTC AG GTAAGAATTAGTTAG

AATCTCACATCATTGGACTCTTAAATTGTAAGTCTTGAAATTGCACTCTTAAGCTGA AATA

TAAC G G AG AAG G C ACTTG G CAG C ACTACTG GTG C ACTAG AAG CAG CAT AT AG C C AAG C

ATTTGGGAAGGGAATCTCTCTATCTGAGCAGCAGCTTGTGGACTGTGCTGGAGCTTT TA

ATAACTTTGGCTGCAATGGTGGGCTCCCATCACAAGCCTTTGAGTATATTAAATCCA AT

G GTG GTCTTGACACTGAAGAAG CATATCC ATACACTG G CAAG AATG GCTTATGTAAATT

CTCATCAGAAAATGTTGGTGTCAAAGTCATCGATTCCGTCAATATTACCCTGGTATG ATA

TCTCTTTCCTCCAGTATGCAACCAATCTTTGCCAGTGTTAATATCCAACCTTAATGG TCA

ATAAGGATTGGTTAAGTTCCTTACATACGTGTCATTACAGGGTGCTGAAGATGAACT AAA

ATACGCGGTTGCATTGGTTAGGCCCGTTAGTATAGCTTTTGAGGTGATAAAAGGTTT CA

AACAATACAAGAGTGGTGTTTACACCAGCACCGAATGCGGCAACACTCCCATGGTAA GT

CATCTGTCCCTAGGAACGTGATATGCAAATATATTGACATAGTTACCTAAATACAGG GG

AAAGCTACAGCCGACCAAGGGTCGTCAGTTGAACACCCTTCACTTCACTGTCGTGCA TA

TATTAAATCTTGAACACCCTAAGTGAAATTTATAACTTCGCTAAATAGGCATATACA CAAT

ATTACAAACATTGTGTGTTGCATTGGCAGGATGTAAACCATGCTGTTCTTGCTGTGG GT

TACGGTGTTGAAAATGGTGTTCCCTATTGGCTCATCAAGAATTCATGGGGAGCAGAT TG

GGGTGACAATGGATACTTCAAAATGGAGATGGGAAAGAACATGTGTGGTATTGCCAC TT

GCGCATCCTACCCTGTCGTTGCC

SEQ 57

ATGGAGAAGGAACACAAATACTCTTTGTTTCTCACAAAGTTGAAGTTGTTTTTTCTTGTT A

CATTAAGTACTTTCCATGGCCTTAGCCATGGCTTCCAAATGGATCAGGCACGTACAT TA

ATGTCTTGGCGTCGTTCTAAAATGCATGCTCAGACAACTACTTATGCTACTAATGAG GAT

GAGACAGAAAACTTAGTATTTTCCGATGAAAAACATGTCGGAAATATGGAGGATGAT CT

TATTAAAGATGGTCTTCCAGCGCAGCCTTCAAATGTGATGTTTAAGCAATATGCAGG ATA

TGTTAATGTTGATGTAAAGAATGGAAGAAGCCTTTTCTATTACTTTGCTGAAGCTTC TTC

TGGAAATGCTTCTTCAAAACCTCTTGTTCTTTGGCTAAATGGAGGTAAATTATATGT GTT

GATGATTCTTTCTCAACTTAATTTTGTCTTACTAATTACTCATCTTCTCTTAATTCT TTTGT

CATGCACCTAATTTGATTAAGTACTCATTTGTTTTGTTTCGATTTAATCTAACTTAC CCTT

TATGCACATATATTCTGCATCAAATTAAGTTGAATATTACTCCACGTGTTCCACATT ATAT

ACTTAACATTTTTTTTTTCCAATCTATTTTACACATTTTATATATTTGAATACTTTT TTAACT

TTAGACATTTCAATTTACCCTTAATAGTATATTCTTGTAGCCGATCAAATATCTATG AGAT

ATTTTTGAAGTCTTTTTTCTTCCTAAATCAAGTCAAATGTTAATGTATAAAATAAAG CAGA

GGGAGCAATAACTTTCGTTTTATGTTTGTAATTTTTCTTAATAGTGATACTCATTAC TCTC

CCCGGTCCACAATAAGTGACTATTTTACTTTTTTATTTTGGTCAAAAATAAGTATCC ATTT

ACCTAATCAATAAGGAATTAATTTTATTTTTCTAAAATTTACCCTTATTTACATATT CCAAC

GTGTCAAGGAAATAATTAATTAAGGTTAATTTAGTGAATATATTTTTTTTCTCTAAG AGTT

CGTATTTCTTTAATGGATGTGCCAACTATAAAATGGTCACTTATTAGGGACCAGAAG AGT

AACTCTTTATTATTTGAAATTTTG ATTTC CCAAAAG GTATAAATG GTCC GAG GAAC ATCTA

ATTTTCGCTCATTTG CAAG AAAGTG GTTCATAGACAAATG GAGTT ATT AAGTG GG GACG

CGCAACAGAGAATTATTGGTCACTTTATTCGTTTTGTTCGCTCTTTTTCTTTCTTTA CTTC

TTTACTAAAGTAGAAGAGAAAAAAAGGAAGTTTAAAAAATTGTTAGTGTATATGAAG ATA

AGAG CTGTCATTTTCTTCG G CTATTG AATG ACGAATAAAAG CACAATTGG GTACAG GTC

CAGGATGTTCATCATTAGGATTCGGGGCCATGCTAGAGCTTGGGCCTTTTGGTGTAA AC

CCTGATGGTAAAACCCTTTATTCCAGAAGATTTGCATGGAACAAAGGTACATTTCAT TTG

CTAAACTAATATAGACCTACTTATAATTAATGAAACTAATTTCTCAAGAAATAAGAC AACT ATTTTTTGTAATAACGTATACTCTATCTTTTTCAATTTATGTGACAACATTATGTTAAAG GT

GTCACGTGAGTAGGAAGCCAGCTGACACTTAGTAGGCAAAGAGTCTGTTAGATTAGT TG

TTAATTATACAATTAGAAATTAGTTAGAATCAGTTGGATTACATTGTATATGTATGT GTAT

AGACGGTTATTCAATACAACAGTAATTTTCTCATCTTCTCTTTTCCTCTCTAAGCTG CGAT

CTCTCTTAG CTCAATCTAG AAGC ATCCACG AC AG ATGTTTG G CATG GTATCAG AG CTTT

GTGCGATCATTGCTCTCGTCTAATTCTCCTCTGAGTTCATGTGAACGAAACTTCAAC TCG

TTCGTCTTCATCTCCTTCCCTCGAGAACATCGAATCGACGATGACGACGGAAAAAAT TG

ACCACATTCATCCTCTGTTTGTGCATCCCTCAGATACTCCAAGTTTCATGTTGATTC CAG

TCCAACTCACTG G ATCTG AGAATTACG AATTATGG CG G AG ATC GATG AAAATTG CACTT

AGGCAAAACGAAAGTTAGGGTTCGTCAATGGCACACCCACTAAGGATCAGTTTAGGT CA

GAG CTACATG AAG ACTG G G AG ACATGTAATG CGATTGTGCTCTC GTG G ATTATGAACAC

AGTATCTCCAAATTTACGTACTTAGTGGAATTGTGTATGCTTTTAATGCTCACCTAG TAT

GAGAAGATCTAAAGGAGATGTTTGATAAGGTGAATACGATGAGGATCTTTCAATTTC ATA

GAGAAATTGCTACAATTTTCCAACGAACAGATTCAGTGTCCATGTATTTTATAAAAT TGA

AGGAGCTCTGGCTGAGTATGATGCAATGGTACCCTCAACAAATTCGAAGTAGTATGC TG

ATCATCTTCAGCGGCAGAGGCTATTACAATTTCTAAGTGGACTGAATGATTCCTATG CTC

AAG CTAGAAG ACAG ATTCTAATG AAATC AGTAGAAC CTACTTTG AATCG G CCTTATG CTC

CAATTGTTG AAGACG GAAGTCAAATG AGTACATCG G GAACTTTATCACAC ATTG GG CTG

AACTCAATAGCCGAGGAAAATGACATTACAACATTGTGGAGCTCAGCAGTAAAATGA GG

TTCAATCAAGAAGAACAAAAGGAATTACAGTATATTTTGTGAACATTGCAAGATGAA AGG

ACATAGTAAAGAAAATTAGTACCAGCTCATTGGTTATCCGACAGACTTTAAAGATAG AAG

AAAACAAG GAG CACCTACTGGTTAC CAATGAG CACCTATTG GTC ACCAAG G AACAATTG

AGG AAG ATG CAATG CAAG G GAAACAG GTATG ACTGTAGATTTTG GG AATCTTTATG CAG

GCATATCATATGGGGACAACAACATATGCAGGTGCAAAGGCAAGGGACTCATAATCC T

GTACACATGGAAGATGCTCAATCTCAGGGACAATCTTAGGGATATACAGGTGGTGTC AC

TGTTATATTTACTCCGGAACAGTATAGTCAAATCTTACAAATGCTCAACAAAGATTA TGTT

CCAGAAACATCAGCTAATATGGCAGGTACTATTTGTTCTTTTCTGGCTAGTAAAACC GG

GCACAATTGGATAATGGACATAGGAGCAACAGATCATATGGTATCTACTCCTTAAAT GTT

ATTTGATTTGAATGACTATG CTAAG CAAG GCTCACTGTTG CATTTACCTG ATG GAAAAAG

TTGCCTATTAGTTATGTTGGTAAATGTAGATTGGCACAAGGGGACATCAGGGATGTG TT

GTGTGTACCAGACTTCAAGTTTAACTTGTTGTCAGTGGCTAAACTAACTAGAGAATG CA

GTGTTTCATGTCTTTCTATCTTGATTTTTTTCTGATGCAGGACCTTCACATTGGGAA GGT

GAAAGGGACTGATAGAATGCACAATGACTTGTACTATTGGAGAAATAATATAGAGAA TA

AGATAC CACAATCATTG G CTACTACTTTG ACTCAATCTG CAG CATTGTG G CAT AAG AG G

TTG GG G CATGTTCATCATAG AATACTACAACAAATGAACTTTTTTAAAGATATC AAGACA

AATACTGGCAGAACTTGTTCTATATGTCCTTTAGCTAAGCAAACTAGGCTTTCTTTT CCT

CAAAGTACTAGTGGAACTACTACACTGTTTGAGCTAGTTCATGGTGATGTATGGGGT CC

ATACAATGTACCTACATATGGTGGTCATAGATTCTTTCTTACACTTGTAGACGATTG TAG

CAG GATG GTCTG G GTTTTCTTGTTAAG GTTGAAGAGTG ATGTCTCATTTGTATTAAAAGA

TTTTATGTCATTAATAAAGACACAGTTTTATAGTTCAATCAAGGTTTTCAGAAGTGA TAAT

GGTACAGAGTTATTTAACTCACATTGTATAGATTTGTTCAGTGGTGCATGAATTGTA CAT

CAAAACTCATGTGTTCATACTCCACAGCAGAATGAAGTTGTTGAACGAAAGCACATA TAA

TTTTTTG AG GTAGG AAG AGGTTTCAG G GTTG CATTCCTCTAACTTTCTGG G GATTATGT

GTTCAGAATGCTGCGTATCTGATTAACAGGATTCCATCCACTACTGTGGCAAGAAAG TC

ACCATTTGAGGCATTCTATAGGAGGAGTCCTAACCTACAACACCTAAGGGGTGCTTA TG

TTATGCCATAAGTGTGGGTGCCAAAAGTGACAAATTTGGAGCAAAAGCAATCCCAAC AG

TGCATATGGGATACTCTACCACTCAGAAAGGCTATAGGTTGTATAACACAGCCAATA AA

CTGATCTTTGTCAGCAGGGATGTTTCATTTAGAGAAGATATATTTCCCTTCAAGTCC TCC

TACTATCAACTTAGACCACCTAATCTTGTGGAGTATTGGAATGGTCGCCATGATCCC TTT

GTTCTTGAAACTACTATTGATGCAGCTCCATTGGAGACTTCATCTATAGTTGAGCCA GTC

TTTGTCCCCTCTAGTCCTTCTATTCCTACTTCTTTGAATTTAGGAGACTCTACAGCT GGT GTCTCTGAGAATGCTACTACTGTATCAGTCCCTGCTGCTAGTACTGATTCTCTCATTCTT

AGTAAGGCTCCTTATGATAATGTAGCAGATATTACTGTTGCTCCAGATTCTTGAGAG CTT

ACAGTCACAAGAAAGTCATGCAGAACCTCCAAGACTCCTAGTTGGCTTAGTGACTAT GT

TCATAAGGGGTCCAAGCCTCTATCACATGCTGTAATGGGCACAAGTTATCCTTTATC AG

TATATATGTCATATCCTTCACTTTCAGACCCCTATTACAAGGTCATTTATAGCATCT CATC

TGTGAGGGAGCCTGATACTCATGAAGAAGCTCTTTATGATCCACAGTGGGTAGTAGC TA

TG CAACAAG AACTG CAAG CCTTTCAAG ACAATCACACTTG ACAG CTG GTTAATATACCT

CCTGAAAAGAGAGTCATTGGTTGTAAATGAGTATTCAAAGTCAAATACAATGCTAAA GGT

GAGGTGGATAGATACAAAGTTCGTTTGGTAGCCAAGGGATATACTCAGCAGGAGGGG T

TG GATTACTAAGAG ACTTTTTCTCCTGTG G ATAAGATG GTCACTGTGAG GACTATCTTAT

CCTTGGCTGCAATGCATGGTTGAAGGTTGCATCAAATGGATATATTCAATGCATTCC TC

CAGGGTGATCTTGTAGAGGATGTTTACATGGTTCTACCTCCTGGTCTTCTAGGACAT GG

GGGGGAGGGNNNNNAGGGGGGGGGGATGTAGGAGAGTATGCAAGCTACATAAGTCTA

TGTGTGGCTTGAAACAAGCCTCTCGACAGTGAAATCTTAAGCTTTGTGAGGCACTTC TC

TCCTCAGGCTTTATTCTAAGTCATCATGACTAGTCCCTCTTCACTCAAAGATCAGGG AAT

GAGCTGTTCCTCATCCTAGTTTATGTGGATAACCTCCTCATCACATGTTCTTCTCCT TCT

CTCATTCATGCAG CTAACTCATGCTC CATCAG CATTTCAAGATCAAG GATCTG G GG G AG

ATGAGATACTTTCTTGGTCTTGAAATTGCAAAGAGCACAATGGAATACTAGTATGTT AAA

G AAAGTTTG CACTAG CTCCTTATTGCAG ACTTATG AGTGG CTG CTTCTAAG CCTACTAG

CATACCTACGGAGGTCAATCAAAGGTTCACTAGTGAATAATTTGATCACAACTATAA GAC

TGAGGGCAATACTGATGAGTTGTTGTCCGATCCTACTGGCTATCAGAAACTAGTAGG GA

AGCTGCTATACCTAACAATGACTCGACCAGATATAAGATACACAGTGCAGAACCTGA GT

CAATTTATG CATAAACCAAAG AG ATC ACACGTGG AAG G GG CTCTAAG G GTG GTGAAGT

ACTTAAAGAATGCACCTGGTTTGGGCATCTTGTTACCTTCTAAGCCATCCTCACAAC TTA

CAGTCTACCGTGATGCAGACTAGGCCAATTGTCCCATGACAAGAAGGTTAGTT AGTGG

CTTCATAGTCAAGCTGGGAGACTCCTTGATTTCTTGAAAATCAAAGAAGCAAAGTAC AG

TGTCAAGAAGTTCAGCAGAGGCATAATACAGAAGTATGGCCAATGCAATTGCAGAAA TA

GTTTGGCTCATTAGACTGTGTGAGGAACTGAAGGTGAAGCTGGAGTTGCCTGTTAAA CT

ATATTGTGATAGCAAGGGAGCACTTCAAATTGCTGCTAATCCTATCTATCATGAACG AAC

GAAGCACATAGAAATCGACCGTCACTTCATTAGGGAAAAGATACATGAGGGCATTAT AC

ACACAGAACATGTGTCCACAAGTTTGCAGCTGGCAGATATTCTAACTAAAGGTTTAG GA

AAGGCGCAATATGACTTCCTATTATCCAAGCTAGGAATGTTCAATTTGTTCATATTA CAT

AGCTTGAGGGGGAGTGTTAAAGGTGTCACATGAGTAGGAAGCCAGATGACACTTAGC T

GGTAAAGAGTGTTAGATTAGTTGCTAATTATACAATTAGAAGTTAGTTAGAGTCAGT TGG

ATTACACTTTATATGTATGTGTATAGACGGTTATTCAATACAATAGTGAAAATAATT TTCT

CATCTTCTCTTTTCCTCTCTAAGTTGCGATCTCTCTTAGCTTAATCTAGAAGCATCC ATG

ACAG ATGTTTG ACACATTACTATTTG G G GAG CCAAAGAG GTTCTTCTTTATCATGTGTTT

TCTTAAATGTTTTATAAATATTTTGAATTATAATTTTTTTTATGAATTATAGTACTT TTTATG

TAAAAAAAATGAATTTGTATCTAAATTTACGGTGTAAAGTAAGCTAGCGTTTGGCCA TAG

ATTCCCAAATTTGTTCTG AAAAATCTG ATTTG G GTG AAGTTTG GTTTG G AG ATG AAAATG

CGTTTGGACATCAGTTTTCAAAACATATTTCCCAAATTTATTTTGGAAAAACATGAA ATAT

GATTTATACCCACAAGTTCTAAAAACTATCACAAATACCCAACAGTACCATTATCAA TAA

CATTCATTAAAAAACTTTGATTCTCGTAAAAACTTTGATTATCAATCACAAATATCC AAAT

TTATTTTGGCAAAATCTATGGTCAAACGGGTATTAAGAACTTTGATTCTCGTGCTAT GTA

CCTTGCCCGGAATGGAATTGTACGGGTATTAGAAAAATACATTGTGCTGCCAAATGC AT

TGTACAATAACTATAACTACTTATAGTTATTGTATGTTTTTTCTTTCTTTTATTTAT TTACAT

ATGTAATGATGGTATACAGTTGCGAATGTGATGTTTCTGGAGTCGCCGGCAGGGGTT G

GGTTCTCTTATTCCAACACTACCTCGGACTATTCAAAGTCAGGCGATAAGAGGACTG GT

ACACACCGAAAAATCTCGTTAATACAATAGTAATAATTGTCAGTTTCATTATTATTT TTTTA

AAACAATTTTAACAGTCAAAATGATGAAATTTTACTCTTTCATTTAACTCCTCAACT TCAA

TTTCAACTTCACATGCTCTATTCGTCAACACTCAACTCCAATCAAACATTGTGCAAA CAG TTATATTATTATCGTTTGTAGTCTGTAACTATTTTTTAATTTTTTTTAAAGACTACACTT GA

GTATCGTTAAAAACATGGTCAAATCTTTTGGTCACTTAAAGTGAGGCAGAGGTGGTG TC

TTTTCTTATCAAACGAGATTTTTCATTTTTTTATTTATCATTAATTCAGTTATATAT TTATTT

CTTTTCCTTAAACTATATTCTTTTTTATTGGGTGACAGCTGAAGATGCATATAGGTT TCTA

GTGAATTGGTTCAAGAGGTTTCCACATTACAAAGGCAGGGATTTCTACATCATGGGA GA

AAGCTATGCAGGTATCTAGTACAGTATCAGTAATTAACAAACGAAAAAATACAAAAC AAA

AACTTTTGATATTCTTGACTTATCCTTCTAGTAGTGAGAGCCCTCATTATTGAGTTC TGTC

CAATAAAATTTGTAAGAATTAAGGACCACTGTATCAGAGACAGCTTGTGCATATTTC AGA

CCATTCACAGAAATGTCCTCCTGTACAGTCTCAGCTCAAAGCCGAAATGAGCATTCT GA

TTGAAAATTTTGGCTATATTTGTAATACAATCTTATATAAGTAGTCTATATCTAAAG TCTAA

TATTGACATGACAGCTAATATTGCTGTGACTGCTCATCACAGGATTCTACGTACCAG AG

CTAGCAGATATCATTGTCAAGAGGAACATGTTGCCTACCACAAACTTCTACATCCAA TTC

AAAGGAATCATGGTATTATATCATTTAATTTGTTGACCTTTTAATTTGTTTGATCTC TCTG

TTATCAAATCTTACTTGTATACCTAGTGATGAGGGGCGGATTTAGGGGTGCAAGGGT GT

TCACCCGAATCCCTTCGCCGAAAAATTACACGGTATATATAAGAAAAAGTCTGATAT TTA

CCTTTATATATTATGTTTTGAATTTCCTTTACACAGCCCAAAAGTCTACTCTATGTC ATGA

CATAAATTATTTCTTTATATTG CAG ATAG G GAATG GTATAATG AATG ATG AAACAG ACGA

GAAAGGGACATTGGATTATTTATGGAGTCATGCACTAATCTCAGACGAGACTCATCG AG

GTCTCCTACAACACTGCAAAACGGAGACCGAAACATGCCAACATTTTCAGAACATAG CA

GAGGCTGAGTTGGGAAACGTCGATCCTTACAACATCTATGGTCCCCAATGTTCCATT AA

TTCAAAGAGCAGATCTTCTTCTCCGAAACTGAAGAATGGATATGATCCTTGCGAACA AC

AATACGTTCAGAATTATCTCAATCTTCCTCATGTGCAGAAGGCCTTGCATGCTAACC TCA

CTAACCTTCCTTATCTTTGGAACCCATGCAGGTAATCCAACTAAGTAAATATTATGT ATA

GCATATCGATTTAACTTATATATACCGATAGTATAAACAATTTTTACACTGTCGTTG TATA

TGTATTGATTTATATATATACACTGTTAGTGTAAAATGTGTTGTAACAAATAATCTA TGTT

ATTTTTCTATTTATAAATTAAATTCTACTTTTTATAAATAATAACTTGTACTTATCT TTTTGG

TCACCTGATAGAAACTCTTTTATATCATCCAATGTGTATTTAAATCTGTTGCGGCAA TGAT

ATTTCTTATTTTCAAGATTACAAAATCTCACTCTTTATGTTTAGTTTATGTCACTTT TAATA

TGTAGAAGGTAATTGAACTCATATAAAAAATAGTGTATATGATATGATATGATGATT TTTT

TTCTTTTTTTTTTTTCATTTGGTATGGTAGCAATTTGGATTGGAAGGATACTCCAGC AACC

ATGTTTCCGATATACAAGAGACTTATTGCATCTGGTCTACGTATACTTCTTTACAGG TAA

CTTTATTATGGGCTTATCTTAGACTTTGGTTTATGTTCATGATACAATATTTTTAAT TGTTC

GAATAAAGAACAAGTGGATTTGTATTGTTTGGAAACAGTGGAGATGTTGATGCAGTA GT

TTCAGTTACTTCAACTCGCTATAGCCTTAGTGCTATGAACCTTAAGGTGATCAAACC TTG

GCGTCCTTGGCTTGATGACACACAAGAAGTACGTTCTTCGAATATATTTTTTAATGA TAA

TTTTATATATTTGTGGTGAGAAATAAATCTTATTGTTTCGTTCTTTGTTTTTTTTTT ATAATT

TAAAGGTAGTTTGTATAATTTCTGCAGGTAGCTGGATATATGGTGGTTTATGATGGA TTA

GCTTTCGCAACAGTTAGGGGAGCAGGGCACCAAGTTCCACAATTTCAACCACGTCGA G

CTTTTG CTTTGTTG AATATGTTCTTTG CCAATCATTCT

SEQ 58

ATGGCTAATTCTTATACAAGTATTAATTTTTTCCTTGCCCCTATTATTTTCTTGGCGATT C

TGGGATTGCAGTTGCAGAGCAGCGATGGTTTTGGGACATTCGGGTTTGATATCCATC AC

CGGTATTCGGATCCGGTGAAGGGTATTTTGGACCTTCATGGATTGCCTGAGAAGGGC A

GTGTTGAGTATTATTCAGCTTGGACTCAGCGTGATCGCTTTATCAAGGGTCGCCGCC TT

GCTGAAGCTGATACAGCTAATTCCACTCCCCTCTCTTTTTCAGGAGGGAATGAAACT TT

CCGCCTCAGTTCTTTGGGATTGTAAGCTTCCCTCTATGCATTTTTCTGATTGCTTTT TGC

ACTTGTCTATATCTTTATTGTTTACTTTTTCTAGTCATATACATAGATTATATACTA ATTAT

ACATAATTATACATATATAATACAAAAATTATACCTTTTAAGTGGTTGGGTGGGCGG CTA TTTGGGTTAATTCTTCTTCTTTTTTTGTATGTGTGTTTTGTATCTGTGTTATTATTCCTG AT

TGTGAACTAGTACGTCTTTGGAAATTCTTGTTTACTGTCTTTTCCTTTTGTCTGTTT AGTG

TGATGTTAGAGTTGACTGAGCTTTACGTTTGTTTTTGTCTGTTTAGTTGGATGGCAG TTC

AGGAAAAATAGGTTACCTAACTTAAATAAGTTCCATGTGCCATTTTTAACGAGATTC AAG

TGGAGAAAATATGAAGAAGAAAAAGAATGATTTAGGCCTGTTCTGTTCTATAATTCT GTT

TGTGTGTTTGATTGGACTGGAATTTTGTCGATTTAACTACTACATAAAATACTGACT CTTA

ATTTGATTTTACTTTTCTTCTATTTCGAATTCCAAGCTCCGGAAATAAATTCCGTTC TTTTT

TCTGATTTTCCTCTCCTCTGCCGCCACTTAACTCCTCTCCAGACAAGGAATTGTTCT GAA

GTTTCTGGCAGTAGCATGTTGTAATTTATGTGTTATAAAGATAGAGTTGCAAAATCT GTA

GTATCTGTAGTTGTGATTTTTTCTTCTTAAGGTGTGTGACTAAATATTCTTTGGCAA TTTG

CAGTTTG CATTATG CAAATGTGACAGTG G GCACTCCTG G ACTATCATTTCTAGTG G CAC

TTGACACTGGCAGTGACTTGTTTTGGCTACCCTGTGATTGCAGCAATTGTGTGCGTG CC

CTCGAGACACGCTCTGGACGAGTATGTTTGCTTCATTCTAGTACCTTTTTCTTTCTA CTT

TCAAATGTTTAAAGAGTTTTTCTTTTTTTTGATCGTCATCCTCGTCTGTATATTGCC TTCT

GCTACAAGGAAGTTGTGCATACTTCTCTTCCTTTTGTAATTATGAGACTTTCTGATA ACC

TTTTTCAGAAAG G AACCTG CTGATAACACAATG G CTGAATCTGAAACACAGTG G ATTTCT

CTTCAACTGTCTTTTTCGGTCATTATGACAATAATATATTCTCTTAGTTAACAAGAT ATGG

G GTAG AG AATGTATTGAG G AAATTGTTTTTCTGTTAAG GAAG ATACATAACTAGCG CAAA

AAAGAAGATTTAAACATAATCAATATTTGCAAAGTGAGTCTGATGCATGTAATATAC TGA

CTCTGAAATGAAATTTCTGATCCATATTGTTCCGTGGCTTGTTTGTCCTTGAAGAAT TTT

GAGATTCTTACTAGCTCAAGTACTTCAACTTGTCACGACCCAAAAATCCCACCACAG GC

GTCGTGATGGCACCTAGTCTCTAAAACTAGGTAAGCCGATTTCAATTACATTTTTGG AG

CCATTTTTTTTTTAATTAAATAAGTAACCAAAACTAACAGCGGAACAAATATGAATG TACA

ATCTCCCAAGACTGGTAGTACTAAGTCACGAACTCTAACTGAATACATGGAATGATC AC

GAGGACCGAATATACAATACTGTTTGATTAAAAACTCCACAGGAGTTCACCTTGAAG AA

CAAAATTTTCTTTGCTCTTTTGCCTTTTCCTTTTAATGTTTCTGCATGTATTATTTG ACACT

TGTAATCTTTTGTTTGCTTTTGAAACAGCGAATAAATCTCAATATTTACAGCCCTAA TACG

TCGTCAACGGGTCAGATTGTTCCTTGCAACAGCACTCTGTGTGGACAAAGGAGACGA T

GCTTATCTTCACAAAATGCATGTGCTTATGGAGTTGCATATCTCTCCAATAACACCT CAT

CATCAG G GGTACTGGTG GAAG ACATCTTG CACTTAG AGACAGATAATG CTCAACAAAAA

AGTGTTGAGGCTCCAATTGCTCTGGGGTGGGTATGCTTTAGTTTTTTCTCTTTATCT TTG

GAAGAGATTATCTTTGGATCTTCTGATGCATTTCTTTATCCGCCATGATTTTTTATA TTCT

ACTTGTTCAATTTCAG GTGTG G GATAAG ACAAACTG GTGCATTTTTAAGTG GCG CAGCT

CCTAATGGTCTATTCGGACTTGGCTTGGAAAATATATCTGTTCCGAGCATGTTAGCA AG

TAAAGGTCTTGCTGCAAATTCTTTCTCCATGTGCTTTGGGCCTGATGGTATTGGAAG AAT

AGTCTTTG G AG ATAAAG G GAGTCCAG CCCAAG G AG AAACACCACTC AATCTTGATCAAC

TACAGTAAG CAAGTCACTTTG ATATTCTG G GTTTATCG GTTG CTTCTGTTTCTG G CTTGA

TTTAGGAGAATGCGACTGAATATTTATTAACTCTTACCCTTTCCTGAATTGCAGCCC AAC

TTATAACATCAGCTTGACAGGAATAACAGTGGGAAACAAGATCACTGATGTTGATTT CAC

AGCCATTTTTGACTCTGGCACTTCATTCACATACTTGAATGACCCAGCTTACAAAGT CAT

TACAGAGAACGTGAGCGACAAGCTGACTGTATGATTTTAAGTTGGAGTTTGTAACTT TG

TATTGTAAAACTGAAGATATTTTTTTTCTTTTTTCAGTTTGATTCTCAAGCAAAACA GCCA

CGTATTCAACCTGATGGCGAAATTCCTTTTGAATACTGCTACGGGCTAAGGTGAACC AT

CTTTTATAATCTTCATCATTTATTACTTTCTTGACGTCCTTTGAACTCTCAGGATTA ACAT

GCTACATACGCAGTGCAAATCAAACTACCTTCGAAGTTCCTGATGTAAATTTGACAA TGA

AAGGCGGCAACCAATTATTTCTTTTTGATCCGATAATAATGCTCTCGCTCCAGGTAA GAT

GGTTTCTGCTCCTTTTATATTACAAAAGTTCTCTTTTAGAATATCCTAATATCCAGT GATG

ATCATCAGGATCGTTCTGGCGCATATTGCTTAGCTGTTGTGAAAAGTGGGGATGTCA AC

ATCATTGGACGTAAGTATCTATCAGTTGCTTGCTCGTAAGATTTTGTTTCTATCCAT GGA

ATTCTGCAATATAACTTGCACCATGCCAGCTAATGATCTCACAATTACCAACTTTTA GAA

GTTTTGGTTCCTATCGAGTTTTTTACATACTTCTAGCTTATGTATAATTGGAAATGT GAAT GTGACAAAGTAAATTAGTAAAAACCAACTAGTAAAACTGGTTCCATTGTCAAAAGTCTGA

GCTATTTGTTGATTTACTTGGATTTTGTCTCTCTATTTGGAATTCATGACAGAAAAC TAAT

ACACGGATGTTTTTGCAGAAAATTTTATGACAGGCTATCGCGTGGTTTTCGATCGGG AG

AAGATGGTTTTGGGTTGGAAACCATCGGATTGTGAGTTCGCATTCCTGAGTATGACC TC

TTTAGTGTGCACACCTGCTCATATAATTTAACTATAAACCTTTCTTGGCAGGTTATG ATTC

TAGAGGATCCAACGACAAATCGACAACTCTGCCAGTGAACAAGCGTAATTCTACTGA AG

CGCCTTCGCCCTCCAGTGTGGTGCCAGAGGCCACCAAGGGAAATGGAAGTGGAAATG

AACCCGCTACTTCGTTTCCATCTGTTCAATCATCTAAACCTGCAGCAAACCAAGCAC CA

GCACATTTCATTTGCCAACTTATGATGGCTCTGTTTTCCCTTTTTAGCTATTATTTG ATCA

TTATTTCTTCA

SEQ 59

ATGGCGATTCATACTTCCACTCTCTCCATCTCCATACTTGTAATGCTCATGTTCTCCGTC

GTATCATCATCGGCGGCGGAGGACATGTCCATTATAAGCTACAACGAAAAACATCAC AC

GAACGGCGAGTCAACGGTCTGGCGAACAGACGATGAAGTCATGTCTTTATATGAATC TT

GGCTAGTTGAACATAAGAAAGTGTACAACGCCTTAGGAGAAAAGGACAAACGGTTTC AG

ATCTTTAAAGATAACCTTAGATACATCGATGAACATAACTCTGTGCCCGATAAAAGT TAC

AAGCTGGGTTTGACCCAGTTTGCAGATTTGACCAACGAGGAGTACAAGTCCATCTAC TT

GGGTACTAAGCCCGATGGTCGTAGCAGGTTGTTAAATACCCAAAGTGACCGTTATGC C

CCTAAGGTCGGAGATAGTTTGCCGGATTCCGTTGACTGGAGGAAGAAAGGTGTTCTT G

TTGACGTCAAAAATCAAGGGCAATGTGGTATTTTCCTTTTACCCTCTGCCTTGACTC TGC

ACCTGTTGTTTTTGTTTTCCTTTTTGTTCGTACTTATTTTCTGTTTAAAGTTTGTCC ATGCT

TTCTTTACTGATGGCTTTGATGGAAATTTGGAAACTTTAGTAGTTTGATAAGGTAAG ATA

TTAAAATAATCACAGAGTCATGAGTTTTAATCTAAGATCAATTTTAATGGCAAGTTC AGTT

GACCCTGCATTATTGTAAATTTTAGCTTAACATTAAGTATGATTAATTAGGTCAGCA CGA

TG AAGTTG ACAACTTTTG CTCCAATTTCCG CATCTAATTGTG G CAATATAAGTAATG CTT

TTTTCCCTTGGACAAAACACTAGTTTCCGGAATTGAGCTATTTTATTCAATTTAAAA TGAA

AATTTTCTGTTTTAATGTATTAGAACTATAAAGAAACCGAAACATTAAGTAAACTTC GGAT

TGATCTGTGTTTTTCGGGAATTTAGTTGTTAGTGGTCTAATTTTCGGTTTAAATGCA GTT

CTTAATATTGGATAGGCATTTTGGCACTTTTCTTGGCTGTCGCTTCTCTTACCTTAA AATT

AAAATTATGGAGTACCTACCAAGTTCAAGATCTTATGGTTGTAAATTGAATTTGTAA AAG

GGGTTCTTCTTCGTTTGCTCTGAGATCCTTCTTTTAGCTCGCTCCTTAAATATTTAC TAAT

CAGTGGTTTGTAGCTCCAACCGAGTGTCTATCGGAAACAAACTCTTTACCCTTCTAG GG

TAGGGGTAAGGCTGCGTCACTTGTGTGAACTCACTGGGTTTGTTGTTGGTCTGTAGT CC

GATATACCCCCATCAAACACCCTTGGAGTTGTTTCACTATGTCTAGTTGTGTCAATT GTT

TTGGCAAATTATGCAGCCTTGATTGATTGGATTATCTTCCATTTTATGCATAAGTAA ATG

CTGAGGAAAAAATGATATGTTTATATCACATAAAGCAACTAATAATTTTCTTCGTAA TTGG

TGTTGC AATTGG G AAATGAAACAG GG AGTTGTTG GG CTTTCTCAG CAGTTG CTTCAATT

G AAG CAGTAAACAAG ATAGTG ACAG GTAATCTG ATCTCGTTATCTGAACAAGAG CTG GT

AGATTGTGATACGTCCGATAACCAAGGCTGTCAAGGGGGTCTAATGGACGATGCCTT TA

AATTCGTCATTCAAAATG GAG GAATAG ACACTG AG G AAG ATTATCCTTAC AAAG CCAAA

GATGGAAAATGCGACCAAGCAAGGGTCAGTATGGTGTTCTCTGTCTTAAAGGGATTA TA

G GAAATG AACTAAATACAAGTTGTG ACTATTAATATTTTGTTTG CAGAAAAATG CCAG GG

TTGTCACCATCGACGGGTATGAAGATGTTCCTGATAATGATGAAAAGGCACTGAAAA AG

GCCGTTGCTGGTCAACCCGTCAGCGTTGCTATCGAAGCTGGTGGCAAAGACTTCCAG C

ACTATAAATCGGTATTACTTCAGATTTGCCTATTGTCAGTAAAGTTGTTTTCTTTTA ATCG

AATTAGCTAGTGTTTACACAGGCTCAACAAATATTTCTGTATTTTCAAAGTTACAGT GAG

TTCAGTATTAAAATTTTTAAATGTTGATCCTATTAAGTTTAAATGTTGGATCCGCCT ATGC

CCCAGGGTATCTTTACCGGAAAATGTGGTGCAGCAGTGGACCATGGTGTGGTTGCAG T AGGGTATGGTAGTGAAAATGGCATGGATTATTGGATTGTGAGGAACTCGTGGGGTGCT

TCGTGGGGTGAAAAGGGCTACCTCAGGATGCAGCGAAACATTGGCAACCCCAAGGGT T

TGTGTGGTATTGCTACGATTGCTTCTTACCCTGTAAAGACAGGCCAAAACCCTCCAA AA

CCAGCTCCATCTCCTCCACCAGTCAAGCCGCCCACTCAATGTGATGATTATAACGAA TG

CCCAGCTGGAACGACGTGCTGCTGTGTCTACGAGTACTATAAATACTGCTTTGCTTG GG

GTTGTTGTCCCATGGAAGGAGCTACTTGCTGTAAAGACCATAACAGTTGCTGCCCAC AT

GATTATCCTGTCTGCAATGTTAAAGCAGGCACCTGCTCAATTGTAAGTGATCTCTGC TT

GTTATTGTTAGATTGTCCCGCATTGGTTGAGGGGAAGTGTTGTTGTCTCCTTATATA GTC

TTCGGCAAGTCTTTTTAACAGTTAAGGTTGTTTCCTTTACTTATGGAATCATGTTTT TGTT

G ATACAGAGC AAGAACAACCCACTAG G AGTCAAAG CAATG CAG CACATTCTG G CCAAA

CCTATTGGTACCTTCGGAAATGAGGGAAAGAAGAGCCCTTCTTCT

SEQ 60

CTAAGCACTTTCTGCAAATCCAATTTGTGAACTGCCAAAGTCGAAAACTGTGTGATATGC

TCTCAAGAATGCATCTCCAAGAACCCTGCAAAAGTAGAGTTGAATATATCATACAAC TG

GATCTTCATGAATATATAATATATTATAACTTATGGCAGTGAAAATAGTCTTACCAG AGG

GGACGTCGCGGATGCGCGTTTAAAGTTGTAAATCCACTAATACAGTGGACACCTTGG CT

GTCATCAACTCTGATAACATACTATTACACAGAAAGGAAGATAGTAAGTGGAAGAAG AG

AAAG C ACTG CTATTTAAAACTATAATATGTTACTAATATAC G G C CTAAAAAC AAG AC G CT

AACGGCTTTTCCAATGTACCTTACTGGAAAACCAGTTCAAGTTGTCGAAGCAATTTT AAT

CTACATTGCTCCGATCCTCCAAAAATGCTAACCGCACTCATGTTGGATCCTCCAAAA AG

TGCAATGAGATTTTTGCGGGATCCCAGCAATCAGTGGCGAATCCAGGATTTGAATTT TA

TGGGTTCAATCTTTAAGATTTTTAGTATTGAACTCATTGTATTTTGAAGTTATTGCT TCAG

TACTACTATTTATTAGATTTGACTGAACCCGGTACTAATATGATGCATCTGCCTCTG CCA

GCAACATAAATTTTAATGGGAAGATAAGAACTTTTCTCCGATATTATGTCATCTCCG ATA

AGAACAGATGTAATTCTAGCACCAGTTGCTACCTTTAACACATATTGATAATGGAGG AAT

AGTATCAG ATAACAACAAACCTG ATCTG G AG AAAGG G GAAAAGATTTGTCTCCAATG GT

AAATGTTATATGTGGCAGGGCAAAGACATCACAGTTGATAAATGATTTTCCCCCGGG AT

TCGGAAGCTTCTCACACAGCTTCAGCACAAATTACATGTATTAGTAGTAAGTAACTA ATG

AGAACTCGAAAAACAAAAAGACACAACTGTGACATGAATACCTGATTGGCATATTGA AA

CGCTTTTTCTTTTGATCTCTCTTTTCTGATCTCTACTTGTATCCAGAACACTATCAT CTCA

CAAGAAGAACATAACGACCCATTATTTGTACAGAGTCCAATTCTGTTGCATACGTTC TCC

G GTTGTAACTG CATGCAG CCAAACAAAAG GATG ACATCTAAATAAGAG AACAC CTAAAA

TACCCGCACACAACAATTAGAAGTTAAAAAGCAGCTACCCCTGCTATCAAGCGTTCC CA

GATCGAATCCCCATAACTCGAGACAACTTTTTTGCATTCCAAACTAATAATTCCTTC CGC

TCCAATGGCATGATTTATTTGAGTTAAAATAGTCTGAAATGGAGATGGCTGAGAAGA AAT

TGATCTTGCAAAAAAGAATTAAAATCAACAATGGTTTAAATACAGACAGTTGGACCA GCG

ATAAATGATGTCCCTGTATCCACAATAGCTGGACATCCATCCTTACAAAGGCCTGAA AG

G GAAAG AG CAG AATTACACTAG AAATG CATTGTTTTTATATAGTAAATCACTTATGTATAT

GAAGATATTACCTGTTGAATTGCTTCCTATAAAAAGATCCCCTATCTCAATCTGGAA TTTT

TGAAAAAAAACTCAGTTTTTTCAGGTTACCAAGTAAACAATACAAATAAAGTTCCAT TAAA

CGGAGGCGCAGGTTACCTCCCAATAACCATTTTGAGCGACTGGTACGTATGTATGCT G

ACCCCTGAAGTGAGTCCAATCCATGCCTCCAAAGATAATTTCACCCGCTATCTTAGA CG

TAGGATCTCGATTTAGCCAGAATGAGAAGATTGACTTGGTAACCATATGCTGAAGCA AC

ATGTTATACCTGAAATTAACTCACACAAAAGAATGTGGATTTCAATTTCAATGACAC AGG

TAAGAAATGAATGAGAGAACAAGCTCATTAAAACTATGTCAGAAAGCATAAATTACT ATG

TCAAAAGTTATTATAATATGAAGAGATAATAATTTACCATACTGGTGTGACATTCCT TGAT

GTCGTGCTCTGATCAAATCCAAGTCCTAGTACTCCATCAAATCGTGCACGCAACAAT GT

CAAGTATCCCTCCCGTGTTACCTCAGTGAAAACCTGTTTAATAAATTTTATTCAACA TGT AACTTG AAAACATATATATCTAC AATTTCAG CTG CAAAGACCG G CACCTG CTG CTTTAAG

ACAGCACCTCCAACTTTCACATTGTCTTGGCTGAAGAATCCATGAACTGAACCAGTG CC

AAAAGGGATTTTGCTAGACTTTCCTATCATAAAAGCCCATGTATAGAAGTGATAAGT ATT

TCGATTAAGCTATTCTACGCTTAAATATCAACAATGCTCTAACCAAAATAAAGGTTT GGA

GGTCCATTGGGTAACTACCAATTTTTGTATACGTATTTGATAGTCTTGATTTGTACC TGG

AACGAAGATAACATGCAATCTGCATTGGATAAATCCATTCAAAAGTTAGTTAGTTTC ATG

TAGTGAAAATTGTTAATCCACAACTTGACAAAGACTAACCGAGAAGAAACATCTGGA AG

AAGGGACCCAAAGATTGGAACTTCCAGTATCAAACACAACAATGAAGCGTTGGGGCG G

TGAACCAATACCAATCTCCGCGAAGTACTGAACATCATGATAATTTTTGAGGTAAAC TAT

CTGGTCATTCGGAGCAGCCAAATTTCTATTGCGACCCCTGAGATCTTTAGCGTAGAT TC

TTGCATCGCTTATGCTAGAAAGGTCCAACGATTGCCTTTTTAGCTCAATCCTAACCA TAT

CATCAG CATATACGTTGATG CAG GTTATATAC CATATTAC AAGTG ATGCAAGAAG G ATTT

TGATCTCCAT

SEQ 61

ATGGCGTCAATTTTCGCTCTTTCATTATTTTTCATTATTATCTCTTTCTGCATCACTTCG A

TCACCATTCCCGTTCAATCCGACGGTCACGAAACTTTCATCATTCACGTTTCTAAAT CCG

ATAAGCCCCGTGTGTTCGCCACCCACCACCATTGGTACTCCTCCATCATCCGATCCG TT

TCTCAACACCCTTCTAAAATCCTCTACACCTACTCACGCGCTGCCGTGGGCTTCTCC GC

CCGCCTCACCGCCGCGCAGGCCGATCAGCTCCGCCGTATTCCCGGCGTAATCTCCGT

CCTTCCCGACGAAGTACGCCACCTCCACACCACCCATACCCCTACCTTCTTAGGCCT TG

CTGACTCCTTCGGCCTTTGGCCCAACTCCGATTACGCCGATGACGTCATCATCGGAG TT

CTGGACACAGGTATATGGCCGGAAAGACCGAGTTTTTCCGATGAGGGTCTCTCTCCT G

TTCCTTCAAGTTGGAAAGGGAAGTGCGCTACTGGACCGGATTTTCCTGAAACCTCAT GT

AATAAAAAAATCATAGGTGCCCAAATGTTTTACAAAGGCTATGAAGCTTCACATGGC CCA

ATGGATGAATCAAAAGAATCGAAATCGCCAAGAGATACTGAAGGACATGGAACACAC AC

AGCATCAACTGCAGCTGGTTCTGTAGTGGCAAATGCTAGCTTTTATCAATATGCCAA AG

GTGAAG CTAG AG GTATG GCTATAAAAG CAAG AATAGCTG CTTACAAG ATTTG CTG GAAA

AATGGTTGTTTTAATTCTGATATATTGGCTGCCATGGATCAAGCTGTTAACGATGGT GTG

CATGTGATTTCACTTTCCGTTGGGGCTAACGGTTATGCTCCACATTATCTCCTTGAT TCT

ATTGCAATTGGAGCTTTTGGTGCATCTGAACATGGCGTCCTCGTCTCATGTTCAGCT GG

AAATTCTGGTCCCGGCGCTTATACGGCAGTGAACATTGCCCCCTGGATTCTCACCGT TG

GTGCTTCAACTATAGATCGTGAGTTCCCTGCAGATGTTATTCTAGGAGATAATAGAA TAT

TTGGTGGCGTATCATTGTACTCCGGCGATCCATTGACCGATGCCAAATTGCCGGTGG TT

TATTCCGGCGACTGTGGTAGCAAATACTGTTATCCAGGAAAGCTAGACCATAAAAAA GT

CGCTGGAAAAATTGTTTTGTGCGATAGGGGAGGCAACGCTAGGGTTGAAAAAGGGAG T

GCAGTGAAGCAGGCAGGCGGAGTAGGGATGATACTCCTTAATTTGGCCGACTCCGGT G

AAGAGCTCGTCGCCGATTCACATCTTCTCCCCGCGACGATGGTAGGTCAAAAAGCAG G

AGACAAAATAAGACACTACGTAAAGTCTGATCCTTCACCGACGGCGACGATCGTGTT CA

GAGGAACCGTGATCGGAAAATCACCGGCGGCGCCACGTGTAGCGGCGTTCTCGAGCA

GGGGACCGAATCATTTGACGCCGGAGATTCTCAAACCGGATGTTATTGCACCTGGAG T

TAACATTTTGGCCGGTTGGACCGGATCTGTTGGACCGACCGATTTGGATATTGACAC GA

GAAGAGTGGAATTTAATATTATTTCTGGAACTTCCATGTCGTGCCCTCACGCTAGTG GA

TTGGCTGCGTTACTTAAAAGGGCCCACCCTAAATGGACCCCAGCAGCGGTAAAGTCA G

CACTCATGACAACAGCTTACAATTTGGACAATTCTGGTAAAGTATTTACAGATCTTG CCA

CTGGCCAAGAATCTACTCCTTTCGTTCATGGATCAGGTCATGTAGACCCGAACCGAG CA

TTGGATCCGGGTTTGGTTTACGATATCGAAACTAGCGATTACGTGAATTTCCTATGC TC

CATTGGCTATGACGGCGACGATGTCGCCGTGTTCGTGAGAGATTCTTCTCGAGTGAA TT

GCAGTGAACAGAATTTGGCTACTCCAGGAGACCTGAATTACCCGTCGTTCTCTGTTG TT TTTACCGGTGAGAGTAACGGTGTGGTTAAATACAAGCGGGTGATGAAAAATGTAGGGA

AAAATACAGATGCTGTTTATGAAGTGAAGGTGAACGCGCCGTCGTCTGTGGAGGTGA G

TGTGTCGCCGGCGAAGCTTGTATTCAGTGAGGAAAAGAAAAGCTTGTCGTATGAGAT TA

G CTTTAAG AGTAAAAGCAGTG GTG ATTTGG AG ATGGTG AAGG G G ATTG AATCTG CATTT

GGGTCGATTGAGTGGAGTGATGGAATTCACAATGTGAGAAGCCCAATTGCAGTGCGT T

GGCGTCACTATTCTGCGGCATCCATT

SEQ 62

TCACATAGGAGCAAGATGACCTTGTTTGGACAATTTATCTTGCATCCACCTGTGAAGCA

TTTCCAGTGCTGCCTTAGGTTGATCCATTGGAACCATGTGACCAGCATCATGGACCT TT

AAGAAAGTTAAAGGCCCATAGTTCTTTTGAACACCTTTCTCTACACCATCTACTGCA AAA

GAAACTTGTGTGGCTTTTCCAAAGGCTTTTTGCCCTGACCATTTCATTGCATGCACC CAT

CTCGAGTTTCCTGCCCATTATAGCAAGAAATTGAGTTTAGTTGTCAATTACTAGGTT GTT

TTCATCTTTCAGTTTATTGTAACAAAATTATGTTTATATATCACATATAAAATAAAA ATAGT

TACCGGAAATATATACTAAATCCGGTCAAAGAGAGATCATATCCATGTAACACAATG TAT

AGGAAGGCCCATTTTTTTTTTGGTTGGTCGGGTAATGTTTTGTTTTGACAATAAGTA GTG

TCACATGGATATTCTGAATGTAAGGTCAGTTGCCGATATGACTATATTATTACATGT AAA

TGTTATACATTTGGCAGCCTACTTGCACTTTCTTAGAACCGGCTTTAACATTTTTTG ACTT

TGTATTTTAATTTTCTGTGATACATAGGTATAATTAAGTTATTTTGCTTTACAGAAA CATG

CTTAATTTTGTCCATATATGAAATTAGATGTTGGGTCAGAGATGAATGAAGCTTACC AAG

CCAATTGCAGATAAGGTCATATTCCCCAGCATACACAAGTAGCTTGATACCATCCTC AA

GGAGTGAAGGAATTCCCTCTTCAAGATTCCTCATCCAGTCCAACTGCATTGCCTGGT AA

ACCTCAGAGCTACATGAAACAAACTCAATATCCCCAACACCAAGTGCCTTTTTAACT TGT

TGGTCATTGAGGAAAGTTTCCATTTTTGAGAAGTCATAGCAGAGATCACCCTCACAT CT

CTTCCGCACGTCATAGTACTGCAACTCGGAAATTACATGACTTTAACTTCTCTACAC TAA

CTAACAATGTAAAGAAATTTTTATTGTTACTTAAAAGATAACTACAGATAATAAGTG AGAT

CAGTAATTTGGAAAATGAGACTGCAAGTAGCGTGTTGTACATCATTAGCTCAAAGAA AAT

GAGATTGCGCTTTCTTACATTTTTGTCACCAGCAATGTCCATAATCTTGTTGAAGAT GCT

TGTACAAACAAGATATGCAGCCATGCAAGCAGTTCCGCCATCTTTTCCTAATATTAT TTG

CATAG AAATAAAG CTAATTAATGTCAAG ATTATATTAG CTG CTGCTATATGAAG GAG AAA

GAACTTACAGGATAAACAAAAAATTAAGAATTACCACAAAGCTTAATTGCCAACTGA CAT

TTTGGATATGATTTCTCTATGGCATTGTAATCAGATTTTTTTATCAATTTCATATCC AGAG

CATAGTCAGTGTAGGCTTTGTATTGAATTTCTGGATCAGTGAGTCCATTACCAATAG CAA

ATCCCTATACAAATTAAATACACTTGGTTAAGTTATCGGCATGATGACAAATTTAAA TTAA

ACCTACCTAAAAGTTTAACTGAAAAAAAAAAAGAATGGTGGAGGAGCTAATGAGTTA GA

AATACCTTGAGATTTACGTAGATGCCTTCTTTATTTTTGTTTCCTTGGTGGACCCGA GAA

GCAAATGCAGGAATGTAATGCCCAGCATATGATTCTCCAGTAATATAGAAATCATTC TTT

G CATACTG AG G GTGTG CCTTGAAG AAGG CCTAT CAT CAAAAGAATTTGAATTAAATTTTA

TTAATTATATCAGTTAAACTTTAGAGACTTATCACGAGCTAAAAAAAGAAGAATGAA AGA

ATAAGATCAACCTGCAAGAAGTCATAGAGATCATTGCTTACGCCCCTTTCATCGTGA CG

AATATCATCATCGTTTGAACTATAACTGAAACCAGTTCCAGTTGGCTGATCGACGTA TAT

AAGGTTTGAGACCTGTCAAATTGCAATTTATCTTATGTTATCATCATTCTTCAACTA ACAA

ATGAAAGTTGCATGTTTGATTATAGGATTTAACCAATGTAAACGACTTTTACAGTAT TGTT

ATATATATATATATATATATATATATATTAACATGTTGTATTAGTCCGGTTACTAAT CCCAC

TTAAATAAAGAGAAGCGTAGTAGTCATTGCTGTCAATAAGCGATGAACTACTTTTAA ACT

TTTG AATTCTACAAGTCACAACTAATG AACAAGTGATAAAGAAAG GAAATG CTAGTAG GT

AAAAAG GTACTTTG CATGATG GAG CAAG GTTG AGTAAC AAATAAAAACATG GAG GG AAT

TCTTTTAGACTTTTACCATATTCAAAAGATCTAACCGACGTTTCTTGAAAAATTAAT TGGG

TAAAATAAAAAAAATAAAAAAT AAAAAG C AAAAG G AAG AAG AC AAG AAC CTTGTC C C AG C CGAAATCATTCCAGACAAGAGACATATTATCTGCAATTTTGAATGGTCCGTTTTCGTAAA

ACACAGCCAATTCACTGCTACATCCTGGCCCTCCTGTTAGCCATATAACTACTGGAT CA

TTCTTCCTGCTTCTCGATTCAAAGAAAAAGTAAAACATCCTGCGAAAAACATATTAA AAA

AAC AC AC AG ATAATTTAG C ATTAACTAATAATAC C C ATAAATG AAG C AAAAAG AG C ATTT

AATCCAATTCAAACCTTGCATCTTTAGTATGTGGAAGACGATAATAACCAGCGTGAT GAC

CCAAGTCTTGAACTGTAGACCCAGAATTACCAACATAAGATAAATTCAATTTCCTCT CAA

AAAGCCTCTGTTCAGAATCCCCTGTTGCTGCAGCCTTGTTAATATCATGTTTAGGGA ATA

AATTAAGCTGTCTGATTAGCTTTTCTGCCATTGTTAATGGGAATTTAGGAGTAGAAG ATA

GGAAAAACTCATCGTCATTAGAATTTAAAGTTGATGAGAAAGATAAGGAAATAGAAG CAA

G AAG C AG AGTAAG AAAG AG G G ATG AAG G C AT

SEQ 63

ATGTTAGTTATCAGTGATTGTTATATAAATTCTTGCAAAGCTTTCAACTTTGTGATCAAT T

TGCCCGTCATGGGACACTCTCACTCTCATTCTTCTCATTCTCACTCTCACTTTCACT CAT

CTAAATCTTCCGATGATCAAAATATGGATATGGGGGAATCGATCACCACCCAAACAG AC

GTTTCTTTCATGCTCGCTAAGCATGTTTTCTCCAAAGAAGTTAAGGGCGATTCCAAC CTG

GTGTTTTCTCCTCTCTCAATTCAAATAGTACTTGGCCTGATTGCGGCCGGTTCTAAG GG

GCCAACTAAGGATCAGCTGCTCTGCTTCCTCAAGTCCAAATCCATTGATGAACTCAA CT

CTCTTTATTCTCATTTTGTCAGCGTCGTCTTTGTTGATGGCAGCCCCAATGGAGGTC CT

CGTTTGTCTGTTGTTAATGGTGTTTGGATCGACCAAACACTGCCTTTTAAGCCTTCT TAC

AAAAAG GTTGTG GAT AAAG TTTACAAAGCAG CTTCCAATTCTGTTGATTTTCAGTG CAAG

GTTAGGCCTTTATTCGTTTGTTTCATTCAAATCTTGTTTCTTTTGTGCTGGGGTTTA ATAT

TCTTTGTTCATGCTGACTGCTGAAAATTGGTTCTTTAACTAGTATAATTGACCCTGC ATAT

TACTCTCATCATAAGCCCTCCAAATATATCATATAAAATGGATATACATATAGTAAA CTGC

AACTAATTAACTTGGGATTGAGGTATAAATGATTGATTGACCAATCTGACTTTAAAA TAAT

GAAAAGTGTTAAACAATTAGGACAGAAGCTATATTGCTTAGCCTCAAGTAGTAACAA AAC

TAAATAATGTCAACGGTTGATACTCGTTTCACAGAATTGAGGCAGTTTAAAGAGTAA AAA

GTATTGGTTGTTAATTTGAAAAGTAAGATGAAAGAGTCACAATTCATCTTCATCAAT GCT

TATTGTTTAGCAGGTTAGTTGACTAGTTCGACATTTTACTGAGTGGTAATAAGCTTC TTTT

TTGTAGGTAGCTAAAGAAGCCTAAGTAGTTCTAAGCTCAACTGGATATGTGGCCGTG CT

TAATTTTGTAAAACTTCAGTTTTTGGCCTAAATCTACACCCAATCAGTGCTTAAAAT ATAC

CATGTAAAGCATCCAAATTCTCACTTACCCCTTGCAAGTACTGTAATCAATCTTCTT ACT

GCAAACTCCCTTTGTTGGCTAAAGCATATACGTGTTAATTCTGTCGTATACTCTGTT TGT

CTTGCTAATTGAATAAGGCTGCTGAGGTTGCCAATCAAGTCAATCAGTGGGCTAAAA TG

AAGACAAATAATCTCATTAAAGAGATTCTTCCTCATGGAACAGTAAACAATATGACA AGG

CTCATCTTTGCAAATGCATTATATTTTAAAGGAGTATGGAATGACAAGTTCAATGCT TCA

GAAACAAAAGACCATAAATTCCATCTCCTCAGTGGAGGGTCTATTAAAGCGCCGTTC AT

G ACTAG CAAG AACAAG CAATATG CAGTAG CCTTTG ATG GCTTCAAAGTGTTGG G ACTTC

ATTACAAGCAAGGCAAAGATATGCGTCGTTTCTGCATGTATTTAATTTTGCCAGATG CTC

GTGATGAATTACCAGCTCTATTGGACAAGATTAGTTCAGAACCTGGTTTTATAGATC ATC

ACATTCCGTTTGAAAAAGCTAAAATGCGCAAGTTTCTTATCCCTAAATTCAAAACAA CTTT

TGGTTTTGAAGCTTCCAAGGTTCTAAAGGGACTTGGCCTCACATTGCCTTTCTCCAG TG

GTGGCCTCACTGAGATGGTGGATTCCCCGTTAGCTGGGAGGTTGTTTGTTTCGCAGA T

TTTTCACAAGTCCTTCATTGAGGTAAATGAGGAAGGAACAGAAGCTGCAGCTGTTAC AG

CTAGTGTAATAATGACCAAGTCCTTGATAATTGAGAAGGAAATGGAGTTTGTTGCTG AC

CATCCATTTCTATTCCTTATAAGAGACGAATCTACCGGTGCTGTGTTTTTCATAGGG AGC

GTGCTGAATCCTCTAGCTGGT SEQ 64

TTAGCTTGAGCAAGCTGACTGAAGTTCAACTGCATTTTCATCATCAGTAAGGTCACTAGA

CATGGCATGAGGTATTCTATGGCGTTTCAATATACGCGATGTGGCTATTCTTGCCGA TT

CATAGTTCAAAACAATCACCTTCTCATCATCCAAATCAAATCTTACATTCTTTTGGT TACC

ATCTTCCACAAGCTGACGTAAATGTTTCAAATTCAGAACTTCTACGCCATTAACCTT CTT

CACCTGTACAAAACGGTTCAAAATTTCCCCGAAAACAAAGAGGAAAAATAATTATAA GG

CGCCCTTATGTTGTCTTTTAAGGGAGAATGAAGTAATAGAAGCAACAAGTGTTTTAC CTG

CAACTC GG CAAG G CGCTC ATAACC AG CATTAATATCATC CATCAACAC CTAATTAATTG C

AAACG GTCATATTAG CAG CATATAG CTCATTG ATCTGCG AGTACAATCCTAAAG G AAAA

ACAGACCTCCGTCTCTATGTATAGGTTAGGAATCATGTTGCGAAGCAAAAGTTAAGG CA

CAAAAAATTGAAGTAAACAAATACAAAGTTTATTATTTATGAAGTTAGCCAAATGAC ATAG

ATTGTCAAGTAAAATAAGACATGCCTTCCTCCCATGTCAGAAATCTTACTGGAAACA TTT

ACAAAAGGTTATCTAGCAGCAAATATCTTAATACGTTAACAATGTCTGCCCATTTGG ATA

GACCCGTCCACCATCAGTTTTGTGTGTCTCAGTATGTATTTAGCATGTGAAATTATG GTG

GAACTTCATGGCGTCTTTATGCTTTTTTGTTTTTCTATTTTTATAAGAGATCAAATT ATTTG

AGTTTTCTATTTGACAAAAGGATAAGAAACTCTAGATACCTGAGAAAGGATGATGAA TTG

TTCACCAGGTTTCTTAGGTAGTTCCCGAAGGGCTCGTTCACACAACCGACGGGGTGA G

GCATTATACCAGTCTTCTCCATACTCGTGAAGGAATGGTTGAGTTAATGGAATAAAG AC

GAG AC CAG CAAATATG AAATAACTCG GAAGCTTGTCAAATTGATG AACTG GAAC AAGTG

GCTGCAACTGCATAATGTTAACTAGTTGTTAGATACAGTAGTCAGTTACACAGCATC AAA

GAAACCTTTAGATTACCAATCATACAAGTTGATAAAGTGTTAAATCTAAGGCTTAGC AAA

AAGTGCAGTCACCTTCCCAACACCTTACTCCTATGCATCAGATTCCCAAGAGCGACA AA

GCAAAAAATAACCTTGGTCCACTCGCACACCAAAATACTTCTAACAAAACAAAAATT TAC

CTTGCTCCAATACCGCACCAAAATACTTCTAACTACATAAATTATGTGACACCCCCA AAT

ACTTCAAGGCTTCTTTCATCTTTTCTATTTTTCTATTTTTGTAAAAGATCAGATTGT TTAAG

TTTTCCATTTAACAAAAGGATACTTGATGAGAGCAACTATAGATACCTGAAAGAGGA TGA

TGCATTGTTCAACCTTCATATTATAACTTGTGCAAATGATGAATTTACTTCTACCGG AGC

GATAATGTCAATAAGTTATTTTACCAGAGTAATTCTAATTGAACAATGTAGATGCAG TGA

TTTCGATCTATTCGTCTGATTACATGCACCCAGGATATGAGCAGAAGTTTGAAAAAT CAC

TGGGAGATACATTCGAAACAACAATTAATTATGAAGGGTTAATAGAAGGGTAATTAT GAA

GGGTAAATAAAAGTGACTTACAGGATGAAGCGTGATTTTGAAGTCATGCACTTTGCC AT

TTCTCAAGACTTTAAGTTCAGCAGTTTCATTAGGTTTCTTCATAGATACCAGATGGT CAA

ATGTGATCCTCTCTCTGTTTCGGAAAGGAACTGCAGATCATAAGCAATCTCAAAACT TTA

GTCCAGG G AAATAAG GTTG CTAATTTTG ATATG CATTTTAGTCAAAATG CAG AAG GG CA

AACTTTTCTAACAGAAAATAAGTTATTCTAGCCTTACAATTTTCATATCCTGCACAC CCAC

TTATCTGGTTTCTGGTATAATTATCAGCTATTCACATGGCAAGAGAAGAAACTCAGA ATT

AAAAACGACTAGACTCTAGGCTTTTCCATCTCTCAAAAGGAGGTCATTAATTTGTTA CTC

ATAGTTGCTCGTTGAACATGAATACTTTTTAGTTGGTGCTGGCCTGCCTAAAAGAGC CT

CTACAGCAAGAACCACATCCATGTTTCATCCAAATCCGTCATTTTACGTCACGAAAC ATT

TAGAGAAAGAAAAAGGCATGTGCCATAAAATGTAAGAAAAAGCTGTGAAGAAGAATG TT

GTACTAGTTTAACTGTC GTACATAAGAATTACAAG AAATAG AG AGAAG CAAG G CAATAA

GAGAAAGCCTTACCTGTTCCATCATTTGCTATGGGTACGCCATCAAATGAGAGGATT AT

GTCGTCTTTCTTTAATACTCTAGAAGCATCAGAAAGTGGGTTGATTCGGCTAACAAG CA

CACCTGTCAATTTGGACTGCATTTGGAAGTACTCTCGAATTTGTGCATTTTCAGTAG GTT

GGCATGACAAGCCCAGAGAGCAAAACCCAATGTATTCACCCCGTTCTTCTACTCCAG CT

ATAAAATGCTTTATCACAGGAACAGGAATAATGTAGCTGCAAAATCCAAATAGAACT GAA

ACTTTTAAGGCCAACTATGCACACTGTAAATTTCCTTTCCAATTACAACAGTTTTTC TCCA

AGGTAGATTGTTTAATCAAGGGTATCAGTTTAGATTTGATTGTTTGCAGCACTGAGG AAG

AGTCGAAATAGAATGAACTTGAATTCGAGACCGCTATTTGAAATCATAGTGAAATAC TGG

AATTTTTATCTCATGTCTAAGAGCTACTAAATGTTCCCACAAGCTAAGCAAATGTTG ATTA AAACTAGTAAATGTCATCAACCAAATCTCCTTATCAACTGCATGTCATCACAACTAAAAG

CTTTCAGGCATTCCCACATATGCCATCCTTTGTAACCCCTCTGAGATGAAAAGAATA ATA

TTATGAAG CTAGAG CCAAAGG G CTACAACTCAAG CTTCAAATTTGTG AATG CATG AACA

AGGACTGCGTGAAGGGAAAAATCTGAATATTATGACGAAGAAAAATGGAAGAGAAGA TT

TG CAG G AG AATACATGTGTGAAG G G AAGAATAACATG CAGTCAG ATTCAGGTAAAGG A

GAGAAAAATCTGAATTTTGTGATGATGCAAGTGGATATTAGAGTATATACCCCATCA ATA

TTTAACTAAATAATATATG GTAG AC CCAG CAATAAATG ATG CAATG G AACTAAATCTGTA

TAGTAGTTCTATCCCTCCGGGGTAGGGGTAAGGTCTGCGTACACTCTACCCTCCCCA G

ACCCCACTTGTGGGATCCTACTGGGTTGTTGTTGTTGTTGTAGTTACAGAGACACAA AG

TACCAAAATAAAATTCTAATTTACCCAATATTCTCTGCACCAGAGAGGTTTTGGAAA GCA

ACTCCAGCAACTTTGTCACCCATAATTGCTGGTCCTCCACTATTCCCTGGATTTATA GCC

GCATCAATTTGTATTGCCAATAGTTGACTAGCGCCGTGTACATATTGCGTAGGTTCT AC

CCTTGAGACAACACCTTTTGTCACGGATATATTATCTCCCCCTAGAAAAAAGCAAAA ATT

ATTAGAGAACTCCTCAGGTGAAGGTATTTGTGGCTTACAATTAAGTAAAGAAAAAAA AAA

AGAGAACATAGTATAGTGAAGAAAACAAAAATATAACTAGACGTCAACAAAAGATTA AGA

AGGATCTGCACTATTGAAGACAAGAATCTAGTATATGCAAGCTACAAATATCCAGCC TT

GCACCTAGTTGACACCAGAGAGAAACAAAATACATCATGAAAGTTTCCTTTTCACTC TTC

TGGATCTTATTTGTTCGCTGCTCGTATGAGCCCTCGAAAAGGGTACACCGAATCCGA GA

AAATAGTGCACTCTCTCGCGGAGTATAGCTCACATCCAAACATCTGATTAGGGAATG GG

GCAATGCCCATGAAGCTCTGGCGGAAAGGGAAGGCATGCCAGGCCGTATGCCTATGG

GTGCACAATTCTTCGAAAAAGCGCATGCTACCTCGGAGACCTGGGACCTTGGCTTAG T

AATGAATGAAGGGAAGCTCTTCGAGCTTTCTCCGCCAGCGGCTTATGTAGTGGTCGG C

CTTATAAAGCTCGCTAAGCCTCGCTTCCCTCTCCCCTTCACTTATTAATTAAGTGGA AAA

TAGTCGTCGGCATTCTATAAGCGACTTGACCGAGTCTACGAAGCTTTGCTTTTCTTG TA

GTCGGCCCGTAATGCCTCCCTTCATTTGCTTGCCTCCTTTCAGCTCAGAATGCCTAA TG

CCTATTATTTAGTGCTAGAAGCTAACCGCCATAAGCTCACCCTTCGGGTCTCGCTTC CA

GCGCAGGAGGCCAAGCATTCTGCCAGACGTCCCGGCCTGGGAGCCCCGTTCGCCTTT

GGTNGCTTATTAATTAAGTGAAAAATAGTCGTCGGCATTCTATAAGCGACTTGAGCA GA

G GAG ACAAG AAAAG G CTCAGTG GTAACTGTTATG AATGTGG AGG AGTG G GG AATTTTT

GTCGTTACTGTTACAATGTTCAGACGGACGAATGCTGCTAGGGGAGTGCAGAATAGA C

CAGCTGAAC GTAGTC GG AGAAG CTCTAAAG GTAAGTTC AAGTCG GAAACATTGTGTTTC

CACAAGAAAAGAAGCACATATTTTTTATAGGATTTGATTTGATAAATGGAAGGAAAT TGT

CAATTGAATTGGCAGAAAACTATGGTAAAGGGGAGCCCAACGTCCTCAGCAGAGAGG A

ATGGCAGATGCATTCCCCCCNAATGACATTAGCCTAATTTATTTGAACTATAGACCA AAA

AGAGCCCCATTGGAGTGAAATATATATAGTACTCATACTGCTGTTCCCAATTGATTT GGA

TTGAGGTGTAGCTGATTGGTTGATTTCGTGAGAAAACATGAATAAAAGGAGCTCACT TG

TGAATTGTGATTCATCTGCCGTTGTATTTTAAGACACGAAAAGAAGAATGTGAATTT CAT

ATTAACTCGACACCGTATCAACAACTCAGTTATTTTTATTTTTCTGAAAATTATATC AATA

ACTCAGTTATTTCGAATAGGTATAGCATCCCATTGCTAAACATGAATGTTGTATATC ACA

CAAACAATAGGGGGGAAAGGGAGGTTCAGAGTTCGATTCAAGTGTATGGGGGGTTGG

GTGTTTACATTACATCTTGAATCAGTTACAAGAAAAATAATTATCTTGAGGATGACC GCT

G ATTAAAAAAAAAATTAC CTTG AG G ATAAC C AAC AAC AG C C AC AG CTTCTTG G AG AAAT

GGAACATCACCAAGCTCCAAAGAGTTCATGCCCTCCCAGAATTCTTCACTTTCTACC AC

CAGAATAGCCAAGTCACATTCATGACCAACAGCTTGCACTGTTGCTCTATACTTGGT AG

GAGAACCATGCTTTCTTACAAGTACAAACGTATGATCAGCCACAACATGAGCATTTG TTA

GGATCCTCTTTCCCCGAATAACAAAACCTATAACATTAAGAATGCAAAATCCGAAAG TAA

GCTTTATGTTCTCTTAAATTATTAGTAGGTCAGATACTTGATAAGTTGATATGAGCT CTGT

ATTCTCTTCTTTTAAG AAAAAGAAATTATTACACATAG AG G GTG G GG AAG GG G AAATG G

G G G AG G G G ATTAC AAGTG G G G AATC G AAAC C CAAACTTGTAAC AAAAG C AG ATAAG AG

AATTAACCTTCCAAAATCGAGTCCCCATAACAAATCAATCAACAATCCTCAATCCTA AAC

TAATTGTAGTTACAACATCAATAATTTCATGCTTTAATCCTAAACTAGTTGGCGTTG GAGT TG GG ACAACAAATCTCTAATACTTG CAG CCCTTG G GACAACAACAAACTATTTATCTCAA

TCAGTGGCGGAGTCACCTTATACCAAGGGGTGTCAATTTGACACCTGACACCCCTTC AC

GGGAAAAAAATATACTACATAGGTAGGTAAAAAAAATTATATATATGTTGACTCCCC TTA

ATTTTTTCGTCTATTTACTTATATATATTTTGACACCCCAATGAAAAGCATGCCTCC GCCA

CTGATCTCAATGAACAACAACTTCATAAAAAGTAGCCTTTTGACAAGGCTTCCTTAG TAA

ATGAAGTGCCAATGTAAGATTTTCACAATAACCAAATGGCTAGTAAAAAAACGGAAG CAT

TACACTGATAGAGAATAACATATTTAGAAAGTAAATGAAAGAGAATAAAAATACCAG AGC

CCGTAGTTTCACGCTGGGACTTGTTCTGCCATGGAAGGAAGTAATTAGGACTACTGG AA

ACAGTGAATATTTTAACTACAGAATCCAATGCTAGCTCTATTGCTAAATAAGCATCC ACC

ATTCCACTACTTAATCGCTGCTCCACCGCCGCCACCGGTTCTACTTCCGATGCATTG GA

TTGAAGACTATCATTTTCCTCAGCTCTCTCGACATGAGGTGTCGTAGAAGTTGTTGA GC

TATTGCTGTTATTCAAGGTGGAAAAAAATGAGGCGGAAGCAGAAGTATTACCAACAG TG

CTGCTATAATTGCAGCGCCGAACAAATCGATGAAGCTCTTGACGTCGGTGATGTACC G

GAGCTACATCTCCGGCGATAATAGGGGATTGAAAGTGGAGATTTCTGTTTAAGAGCT TT

CGTGCTGTACGGAGACTTGGACCTATTCGTAACAT

SEQ 65

CTAGCTAACTTGCTTTTCACTAGCCTTGTAATACAGCATTTTCTGCACATAGAACATCAT

GTATCCTTGCGCAGCTCTCACAATGTTTTCGCTCACTTGAGTGATCCAAGCATCATC AC

ATTTGTACCATTGATTACTTAACCTAAGATATGTTACGTAATGACCAGCATCAAGTT TAC

CGGTATGGGTGATGACAGCAAACAGCTCAAACTCTGAGGACGATTCACAGGACGCAT C

TTGCTCGTCCCCATCAAAGGAGAAGATTCTATTTCCAAATCGACTCCTCAAGATAGA TG

AAGAAAGGTAAGGCGACATGTCCAAGGAAAAAGGAAACTGTAGGTAGTGATCAACCT T

CCTTGACATTTTCTTAATCACAGAATGCTCAAACCTTTTGATATGGAAGCAAGAAAC CAA

AGGCAATTTTCTTATGGACATCTGTTTAAGAGATTCCTGTCTCACTTGACAATGTTG GCA

GAAGAACTTCTGATCAGAACCCAATTTCTCAGGTCTTGTGAAATGATCTAAACATCC CAT

CAACGAAGAAATTCGACCGTTTTGGCTAAACTTTCCAGATTCTGCTTCCTTCTTGTG AGT

ATTATGAGACTTCTTTGATGTCATCTTTGAGGAACTCCCCTGGCTCAGTTCCAAGTC CAA

GGAGATGTCTATACATGGATCATATGTAGTAGATGTGAAGCCACAAGCTGTACACAT GA

CATCAGACCGCAAGATTCCAGAAAATACTCTATGAGCAATGCAACAGTCTCCACTAC CT

GTGCCAAAGAAAAAGATTTGATAGAATGAGATCACAGCTGAAAAAACACAATGTGTA CT

CCTAATAAGTTACACAGAGTAACATACTTAATATGCACAGCTTAACGTGCTCCTCGA CAC

ACTTCATAACAAGTAAAAGTCCAGTTATGCCATAAAGTTCTATCAGCTATTGCATTA CTA

GACAAAACAAAAATCGTCCTATCAGGAATGGCAACCAACAGAATGAGTAGCTAAAAG CA

ACGGAGTACTATTGAAATAAAAGTAAAAAACTCAGTAAGAAATGTATCCGCAAGACA CAT

TTATTGCCTCAAAAGCTCTATCTATTCCATTTACACAAAACCATTCAGAATCATAGA TAAT

GTGTATTCATTATTTAAATGCAATACCCAGATGTTAACTAAATTTTGCAAACTCAGG GCA

GATAGCCCTGCACCAATCCTCTAATTAAGAGGAATGAGAATAGGTGTGAACTCTCAC AA

TCAAAGTACGTGTCACTTAAGTGAGAACCAAATCAGCATATTTACAATGGTAGAAAT GAA

TTGTTGCTAAGGAGTTACCCCTCTAATGATTGCTCTGCTAAAGAAATGCGAGGTACC GA

G GAG CTAG ACTC AC AAG AATAATAAG G C AC ATAAG G G CTTATAG AAAG GAG G AAG C AA

AAG G CTGTCAACCTAGTAAAG ATTTCAAG CTTTCTTG AAG GTTGCG ATCAG CTCACTCG

AGAGTAATACCCTAAAACATGGTAAAATTGCGAGTTAATACGAGAACTATTGTACCA GAA

G AAAACTCCTATG CTGAAATCAATAAGACTAACCAAACTG AG ACTTACTG G G GAGAG AG

TAGGAGGAAATCTTTTTCCTTTTGAGAAACATCTCTAAGCCTGACAGCGAATATCAA CTT

GTAAATCGCCTATGGCAAAGAAAAGACCTAAACCATAACCTGCATTCAAATATTACT ATT

TTTTCTCAGTGACAATGGAAGTTGGGATTGCCATGAGAGATGAGATGCTAGAACAAA GA

TAAGCCCATCAAGCAGGCCCTGAGTGCCTTTTAGTCAGACGTTACTAGACATCACAA AG

ATGCTTGAACACACTATTCTGCTTCTGACAGAAATTGCTTCTTCCACCCCCTCCCCA CCA ACAAAAGAAATTCAAAAAATTCACCTACTGAAGGACTTGACCTTTGCAAGTACCTGTACA

AGTTTCAGTAATCCACTTGTTTGAGGTTTTTACAATACTAGCCTCCCTTGGCTATGT TAC

ATTTATGTTACTTTAAAGTTGCTGCCATGTGACCTGGAGGTCACGGGTTCGAGCCGT GG

AAACAACCTCTGCAGAAATGCATGGTAAGGCTGCGTCCGATAGACGCCTGTGGTCCA G

CCCTTCCCCGGACCCCGCGCATAGCGGGAGCTTAGTGCACCGGGCTTCCTTTTTTTT T

TATTAAAATATATATAAACAATGTTGTCAATAATTTTCCCAGTACAACAAAAAAAAG AAAT

CTCAATGATTGGTCTAATTCGGAAGAAAAGGGAAAAAGGAAGTATAAGAAACTAATA TA

GGCAAGGTGATGGGCGGAGAAACGATGGGCAACTAATAGGTACTCTAATGCAACAAA C

AAATTTACCTGGACTCAACGCCTTTCCCTTATCGTTCTGCATCCTTTCATGAATCCC GTC

AAG CAC GG AAATGAAAAACTCATG AG CATC CTGCTGTT CAT AACTTG CAAGATTTGATG

CATGCTTCCACCAGCTGTTCCAAAGAAAGGTTTACATCCATCTGAATGAGACACAGC AC

AGACTGCAACTAGATACTCAAAAAGTCGAAATCCACATCTAATAAAACAAATTACAA ATG

TATGTATATCACAACCAAGTTACACACCACCAGTGGCGGAGCCAGGATCTCCGCGAA G

GGGGTTCAAGAAAAAAAAAATCGTAGCTAGTGGGAATTGAACCTATGACCTTTCAAA GA

TTTTGAACCCCCTTGACCACTAAGCTACACTTATGGTTGTGTCAAGGGGGTTCAAAA CT

TAATATATAG AG GTAAAAAACAGATTTTG CCTTATATATACAGTGTAATTTTTC GG CG AA

GGGGGTTCGGGCGAACCCCCTTTCGCCCCCCTAAATCCGCCCCAGCACACCACTGTC T

AATTTCACCTCTATGAAGGGAAAAGCGTGGTACATACAATCCCAATAATAAAAAACT AAT

CTTGTCCCTACATCATTTTCAAAGAAGTGCACAACCCAAGCTAAATTAAGGAATCTT TTA

CCAATGTATTGTTATCCTTATAAAAAAGAATTATATACAACTATACCTCAATCCCAA GCAA

ATCGGGATCAGCTATATGAACTTCACAACACACACACACACACACACAAAAAAAAAA AA

CGTGCACATTATAACAAAGCCAAACATTATCTCAACAAACCAAGAAAACATGATCAA ATG

CAGACCTGTAAAGGAACTTTGCAGGACTAATAGGGGTCCGATCGCCAGAGAAAACAG C

AGAAAACATTGCATCCAAATCACAAGCCAAACACAGCATTGTTGAGTTCTTATTCCC ATT

ATCACTACTACTCCTTGTTATAACACTGCTATTCTTTCGCTGGCAAAAATATCTGTT ATGC

TTGTCACTCAGAAAGTAATTCCTCAATGGTGGTGTATGAAGCAATGCTTGAAGCACT GA

ATTCATAAAACACGTGTTTCCAAGATTGTTAAGGCCCCTCAAACCCCATTGTACTTC TGG

GGTTGTTGAGTCATTGCCGAGCTGACTCGGCAACGGACTCGAGTTCCCAACGATCAA G

ACTTGCTCTTTCACATCAGGCGTCCACGGTTTGTACTCCACGCGCCTCCTCTTGCGC GT

GCTCTCCGGATGCGGCGGAGGGTCTTGTATTGATCCGATCACGGTGGCCTCCGTCTG C

GCTAGTGCCACGGCGGCGTCGAAGTCGCTATTGTACACCTGGTCCCTACACCCGCAG C

AGAACAGCTCGGCCCGGTCGATGTCCACCGCGATGCAGTGCAGCGAGGGGTCCGCAG

CGTTTCCCGCCGGATGTGACGGCGCGTGCACGCGGCAGAATACCTCGGCGCACGTGA

CGCAGGCGTACAACCGCGGCGGCGCGTGTCCGCACGCACCACATCTCACCAGCTCAT

TCGGCGGCTCGCGGCAGAT

SEQ 66

TCATAACTTACTGTGCACGAGCTTATTTGAAAGACGTTCAACCTTGCCAAGTTGAGTTGC

GCTCTTCAGCATTGCCTCGCTGTCAGCTATAAACTTTGCAAACTTAGAACCCTTTGT ATC

ACCATCACTATCTCCTCGAGGAAGCATTGGCAGATGAAGCATAGAGGGACTTTGCTT G

G AAGATAAAGG CG G AGTTAATG CCCACACAGAAGAAAG CTGTTCATCTG GTTTGTCAAG

ATATTCCAATGATAGATTCTGCATGTCTGTNTAACGGTGATAGATCCTGCATGTCTG TCA

GGAAGAACAGAACAGAAATATCAGACGTTGGTATCGTGCCTACCAAGTGTTATTCGG AA

CTG ATTTCTTG AG AGTTG CAAAACTTATTAAG AG CTTG CAG ATC AAATGTTACTGCTTTT

ATTCTTCATACGGGAAAAGACCACCCATTATCTGAAAATGGAAGTATCAGGAACAGT CG

AATAAAGTACCTTCAACAAACTTG AAG ATG G GCTCTAAAG CTG CACATG GAATG CTG AA

GTTCAAATGTGGAATGACCGTCCCTCCACCATGTCTAGCATTACTGCATCAAAAATA AAA

CTACAAGACATGAAGTTTCAACAAGAAAGATATTAATAGATGGAAAAACTATAGAAA GAC

CATGATATGTGAAGATTAACATAAATGAGATGACTAAAAGATCTTCTGTATGAGATC ATT CAAATTGGACACCATTATTTTTCCTGTATGAAAAGCGTATAACCAATATACATTTTTGGT A

AGGACAATTAATATACATTCAAACAGGAATATCTTTCTTCAAGGACTTCTCAAAGTA CTC

CAGGACGCAGTGTACCACAATATCATTAGATTGAACTTCAAGAACAAACACATGTAA ATA

TACATAAGCTGAAAAAGAAATATCCTCAATTATAAGCATCCCCAGTTGTCATCCAAA AGT

TAATTACCTTGTTACAAGGGCAATCATGTGTCCTTCTGAATTAACAACAGCTCCACC ACT

ACCACCAGGGTGTACAGCAGCCGTTGTTTCAAGCATTGCCGGAAAATGTTCTCCTAA AC

TTGATTGGTTGAGCAGAGACCGCTTGGCTTCAACTACCTTAGCTATTGCACCCACAC AA

G CAGATG GAAG GAAGTCTATATTATAAG AAAAAGTAAG ACAAATTACAAATACAACTAAA

GAAGTCTAATATAGGTATGAAACATATGAAATAAATATACGTAGTATATCATGTTCA AAAT

GAAAGAACTTAACAAAATTATTCACATGAAAAGCAATTTAACCTCCAGAACAACATG ATT

TAGTACTATTGGGCGCACAAAGATAGTCAGTTCCAGAAAATTATGTTCAGCAAAGGT TAT

GGAACAGACAAGTTATCTGTATCAACGAAAAAAGATGGAACAGACAAGTTAAGATTG CA

TCAATAAACAATAGTAGCACTTGCAACAACCTAGCTACTATTAAAATATCCTTGAGA TAC

AGCCCGACTCGAATAAGTGAGTTACCAGGAATTTCCTATTCAAAAATCCCATTCTTT AAA

GCTGATCATTTGTACTTGCTTTCACAATAGAAAACATCAATTTAATGCTCCAGAAAT TTAC

CTTTTTTCGTGATAACTGATAAGTGACTTCAAAACTCTAGATTTGATTCCCCAATTC CACT

TTGTTAGCATAGGTATTAGGTATATATCATTCTTATGGATGAAGATCTGAATTAGTG CCT

ATGGCTTTTATTAGCCCACGAAAGGAAAACGCTTCTTTTTAATTTCGTCTACCTTTC TCC

TTGTTCTGCTAGCCTTGTTTGAGCCCTACAACAACCTCGCTATTCTTAATCTGACGT GCA

ATTTTTTTTAACCAGAAGATCAAAACACTGACTTGGACTACAAATCAAATTCAGTAT CAGT

AAACAATGTCTTCACCTAAAAGATTACCCAGTTTTGAGCCCTCCCGACCAATCTGGT TAT

TATTTCTCCATTGGAAGACACCTCAAGTTCCCTTGCGGAGGCGTCGACATCACCTCA TA

ACTACTCAATCAGTCATCAAATGATCATCATTTGGTGAAGGAAGAAACATCAAGTAT TCC

AGCAGTAACAAGGAGATGAAAATGATATAATACGACCCAATCCTGCAATTGATTATA ATG

ACACTTCAACAAATTCTTAACACAAGAGAAGCAAGGTGGAAGAGAGAGAAATTCAAG AT

AAACAAAGTTTTTGTAGAATATTCTAAAATTTCAGATTTACTGTGATGCGTGTGTCA AAAT

AAAAGTAAAGGCAAAAATATTTTATTTAGACAACAAATCTAAAGCAAGATTTACCAC ATC

GTGGTCCAAATAGCCCATGTCCGAGAATGTATGCTTTTGATCCAGGGGACGGGCACA T

G AAGTCAG CAGTAATG G G ACAG AG CTG ATCAGG AACTAGCTC AAGTTGTAGTAATG CA

ACATCCAGAGGTCCTCTGGAGACATGAACTACCTTTGCATTTGTCCATACCCAGGGA TC

CATAAAATCCAAGCGAACACGAATGATCCTACTGCCTGTCTTTGCCAGGTTAACTCT AA

AGCTACCTTGCTCATTGTCAACCAAGAAATGCGGAGTTTTTAGTTCCTTTTGAATCA AAT

GTTTATTCCTACGCTGAATGTCAAATTTCTCAACCCCTGGATGCTCAGATTGATCAG AAG

GGATGAGAACTACATCAGATTTGGTGTTATATCCTGAACCGTTTACAGATGTTTTTC CAA

ATCTCCATGGCTCTAGAAGATGAGCATTTGTAAGAAGAAGACCCTGCTTGTTGAGCA AA

ACTCCAGAAGCCCATGCTCCATCATCAACAGTGATAAGGCAGATAGATGTCATTGCC TT

CTCAATCAAG G ATG G GG G AACAG G ATCTATTAG G AG ATGCTCTCG AGTATCATTG G AA

GGTCCATTCCGAATATTATTGGAGGGTGATTCATTTTTAACACTGATTAGGTTTCCA TTA

TCAAAATG G ATCTTTCTCCTAGTTTGTAGCTCTTCTTTAAG CAG G CTACC ACAAG CAG AT

GTAATAGCTTCCCATGGAATCACCATCTGCCACCCAATGCTTCAATTTCAAATTAAA GCA

ATTTATGTGAAGATAGATCTCAGCAAGAACATAGAGCACATTATGAAATGTTTACCT GAA

TTTCAGCAGCAGTAGCCCTTTGTCTGAGTGGCCGAGACAGAACACCAATAAGCTCTG C

ATGTTCCCCTAACACTGGGCTACCTTCCAT

SEQ 67

CTGGATATAAGTTTGAACATGATCCTTCCAGAGAGAACTTGTCTTCAAAACATCCCTTAG GGTTGATAGAACTTCAAAAGATGTCGCCTTTATAACATCGTCATCCTTGTTGTACGGCTG TTCCTGCAACATAATTAAGAAAGCAAATTTTAGAAACAATTTCAAAACTGAACATCAAGC TTTATGAGTTTATCATTGCTGGGTAGAAAGAATGCACAAAAAACTGAGGCATGCCTTGA GATGATCAACTTTCACTGTAAGGGGTTCTTCACTGACCTGCAAATGAATGCCACTGTTT

CAGTCCTCAAAGTGTTACTCTTGTACATATCACAAAATGGGCTATCGAGACGGGAAA AA

ACAACATGGTGAATAAAGTAAGGAAATCTGACAGTGAACATTCAAGTGTGACAAATC TA

CTGCATAGTCATCAAATTGTTACACTAGTATATTGAAGAAAATTTTTATTGAAACAA AAGA

AATAACTACATAAGGGAATCTGACTGCAAACATCCAACAATAAATAAACCCAGTGTA ATC

CCACATGTGGGGGTCCAAGGAGGGTAGTGTGTGCGCAAACCTTACCCTACCTTGTGA A

GGTAGAGAGGTAAACATCCAACAGTATGTATAATCTAATAGGAGAATTATTAATAAG GAG

CTCAGG G GTTG AGAG ATTTAAGTCCAATAGAAGAG GATAAG AG GTC CTCAATCACAAAA

TAAAAC G AAAAAATAAG AAG AC ATAAG AC C CTC C AATG G CTAAGTAAG C C C ATAAATG G

CTGACCATGCTTGAGTACTCCAGCTTTACAGCTAAAGCCAAGAATACGACTCTATAC CA

AATTAGAGCTCAAACAAATAGGATTCTATTCAATCTTTTATGTGTTTTATATATGTT TAAG

CCCCAGGGCTTTGGTCTAGTGGTAAGAGTACAGCCCGTGATATGTAGGTTGGGTGCA C

ATCACAGTTCGTGCTCTGGCGCAAACAAAAGCCTAGTATTTAGGTAGACAATGGTAG AA

TGGCGAGCCCATTATCCACCGAGTTTGAAACCATGAGCCACTTACCCTCAGATTTCT CG

GTTATCAAAAGTCTAGGATTTCATTCAAACCAAATGCATCGGCAATCAAACAGATGC TAA

ACTAGCACATCATAGAAGAAACTCAATAGCTTTTCTCTTCTACAACCCCGAAGTGCA ATA

AGAAAGACATTCCTATGACTTTGCAGGAAACCAGTTGTCCAAAATTATTACAAAGTG CTA

CTTTCTGTTGCAAACAAGCAATCAAGAATAACTAAAAGAGTGTTTCATTCTATTGGT AAC

ACATCATACACAACTGG G AAAAACAG CTCATTTAG CCTCAAATTG CAACAG ACCTCTATC

GCTTGACAACGCAGCTGCTCAGTTATATGAAGCCAAAAGGAGAAAGACTGGGTGTTG G

GACACAAAAGGTAGGAAAGAGAGAAAAGTATACAAGTGCAATGAACCCAAGATGTCA AA

TCGGCAGTCTCAAAATCAGGAGATGATGGATATGTAGATGAAAAATACAACTAAATA GG

AC AC AAAG AAAAC ATATG G AG CTACTTATAAC C AAAAAATAAC G C AG AC ATTTACTTAC C

ACCTCTGTCATACGTATCCGCCTGTGACCAATAAGAATAACCTGGTCGTCTTTAATA CTT

GTTATCTGGCTCATCATAAGTTGCATCAACCTTGATCAGAAACTTCTTCAATAACTT AAA

GCAACAGAATAACACAAGGAACAACAAGATGTACCTGAGCAAGTGTACCAACTTCAT GA

AGACGGTTCAACATGTCTTTTCCTTTAAGCTCATAGATGTTTTTTTCTGTATCTGAG GCA

GACACAACATTAGGATCAGTCCCTTGCTCGTCTTTCATAAGGAAAGCGCCAGCATAA GG

TGCTTGCCTTTTTCGACTTTCCAGCAAGGCTGCTAATACCTTGGGATCCTGCAGTTA AAT

TTACGGATTATGTGGAGCCTCAGATCAATTTGTTGCATGCCATTACAGTATAATTTA TCT

TTTTCTGGAAACTAACAAATGAGGCTAAAGAATATTACAATTGAGCAAAGACGCTCT CCG

AATTG AAG GTTTTAG CTTTTCAAG CTTACAACTTCAG CCAAG AACTTTCCATCAATTAAA

GTAGAGCATTCTTAAAGCTGAATGCTGGTTACAGACTTAAGAGGAAATAGGGTCCAA AA

CACTAGCTGAAACAGTACAACTCTCAGAAAGTTGACAGTAATTATACACTTAATGTT TAA

AGCTTATATCTATTTTATTTTGAAAATTACGAAGACTAAGACATGGTTTAACCCATC CTAG

ACAAGGTACCATACCTTCACATAAATATGCATATAAAACCCTGGAAATAACGGTCTG TGT

G GAAGTG G CAGTG CTAG AACCTG CACCAAG AAAACAACATGC ATCATTATAAATTATAA

AGAATTAAAAAAGTACAGCCCACACAGACTTTATTTATTTTTAGTATAATAAATGAC ATAA

AGCCCTTAATTGCAAGTCTTACAGGTAAATCTTATCCAAAGATAAAACCATAACCAA CCA

AGAAGGCAAGAAATGGGTCGCATCCATTAAATTTACAAACAGATGAATTATGAGCAA AC

AAGAAGCCTCCAAGAAACAGTGTTTTCTTTGCATTCGTTCAGTATCTCTCTTTTTTA GGG

AAGGTTGATAAATACCATTTTTACACTGATCTTGCAACAACTAGCAAAAACTGAAAT ACA

CATATGAATCTTCCTTTCAAAGAAAAACGAGATAGTAGGGCAACCTATTACTTTATC CTC

ACATGTACTTTCCAATGCTTTACTTCTTAAATAACCTAGAAGCTGTTGATTAAATTG AAAC

TAAAGCATGAAGTAACCCTCTCTAGGCCTCCAAATATAGGATGAAATGTTAAAAATT AAC

ATCCATCAAGGTGCCTTCTGTACGGCCAGAAGTCAGAGGCATGAGGCATAACTGGTC G

AATGTTGACTCTGAAAAGAGCATATGGGAAGGAGAGTATTCTAATTACAATGGACAA GC

AAACAAAAAAAAATTCTTTTGCTGAACAATGTGAATCACCAGCTTCTTGCAGGTGAG AAA

TCATTCAAAATCATTAGGAACTTTCTGAAGCTTCAGTCTTGACATAGAGCTGATCAA TTT

G CCTCTACAAGGTG GG AGTTACACATG GTCCAG AGAGAG ACAAG G CAG G AAGTACTCA

AGGATCGACGGGTTTCTAATATCAGGAGATTTCAAAGAAACAATCTCAACCAATACG GA TG GTTC AACTG ACAATGTG CTCAAAACAAATCAATCTTCTTG ATTATG G GATTCTCTGTA

AAAAAAATAAGCCATTTAAATCTGAAAATTGTGAGACAATAGAAGAAGCAGATGAAA AGA

TAAGTTGATGAATACTTATTACATAATACATGACACTTAATTTTCAGGTGTGAAAAG AACA

ATAATCTTTGATGTTGGACTAACTTAACATTAACATTCTACTAACTAACATTGCCAT TGTA

TTATGTGTCCTCGAACTCAAGGTAAGAAGAAGTGAAGAACACCTTGAATTTGCTTTT CTG

GAAAACACGGAATAGTGAACAATAGTCGTAATAGACTAATAGTAAAGCATCACCTAG AA

AGTGAACAGTTCAAATCGTGATCAGTCACCAGAACACCATAATGGATTTTTTCGCTG AAA

GTCACGAAAATAGCCCAGCTAAAGAAATCATTAATATTCAGCATAGACTAAGGGAAC CG

ACTCTCAATGACCAGAGAGAACGCGATACCAGGAATCTCAAAACCAACCTTGGTGTG G

CGTAAGGTGGCAGCTAATGTGGCTCGAATTGGCGGCTAAACAGTGACTAGAATAGCA A

TGAAATAGCGTCCAAAATAGATATCTGGAACAATAATCATAATACTGGCCTGAAAGA TG

GTAG AAAACTCATTG GAAG AATG G CCAAAATAGTTCTTTTACACTGTAAATG GG ACATG

GTTGGGATGGTGGCAGAAACTCAATGGAAGAATGGTCAAAAAGGAGGAATTTTACAG C

TAAAAAGATGACTGAGAAAGAGGCATGATACAGTCAAATGGTCTACAGAAATAGGTA TG

AGGTTAACGATCACCAAAAGTGGAAAGAATAGTAAAAAAGTTGACCAATCGGGTAGC AT

AGGCTGCCCCTTATTAAGATCACCAATGGTATGTGATACGATCTGAGAAAATAAGAA AG

AGGATGAATTGGTAACTTGATGAAAATTGGAATACAAGACACATCTATTTGTACCTA ACA

ATCAAATAGAAAAGACACATCTATGTGTAGGTGAGAAATACATTAACTTTATGATAT CTTT

CCAATGTGAGACTAATTTAACACTACTAAGATAACACACATTAACATTTGAACAAGA ACA

AGTG G GTG CTACAAG ATG GTCAAG GG ATGACAAG AATAATATCAC ATTG CCG AAAATCC

AAGCTGTGTGATTGGAAAAAAACTTGAAATTACTTACAGATGATCTTAGAAGATGAA AAT

AGAGAGGTATATGCTTGAGCGGAGATGGAGATGAAAGAATTGTTGGAAAAATTGAAA CC

TTTAGAAATCATTAG GTGAAG CAAAAGAG ATG ATTG CAAATGAATTACCC CTAATGGAAG

ACTTGATTAAG GAGTTGG CAGACTGTG CCTT AGTG GAG GAG GTATCAAG G AAGCAAAA

ATCAATAACATTGTGGTTGAAGGAGGGGTACAGAAATACCACATTCTTTCTTCCTCC TCT

TCTTTTTTTTTTTTG G AAGAAAGTG GAATACCACATTCTTTCATTG AAAG GGTAATG CTAA

TTCAAG CTACAATAGC ATTAAAAAAGTTG CTCACTG ATG AG AG G CTCTCTG ACG ATCCT

AAGAAGATTCAGGACAAGAAATCTTTAAAAAGACGACTAGATTTTAAAAATTTGTCA ATT

GTCATTTCATTAAAATACTAATGCTAATTGAAGCTACAATAACATTAAAAGTGTCGG TCAT

CGATGTGAGGTTGTCTGAGGATCTTAAGAAGATTGATATCAAGATTCTTTACCACTA CAG

CCTG ATTTAAAAAAGTCTACACTACTATTCTTTC GTCTCAAACG G AATGG CTG G GTTCAA

AATATGAGAATCAAAGCATCTGATAATAAAACCGTTCTTTTTGAGACAAAGAGGGTA TCA

AAAGCTGTTCTTTGGAAAATAAAATGTTGAAGACCTTCACAGCATTATATCACGTCA AGG

TTG GG AAG G CCTTTTACTAAAG AATAATTG AGTGTACAAGTCTAC GAG AAATATAATCTT

GGCGATCCACCTTTTATATCCTAGAAAGTTAACTTCTTAGCCAACCATCCCTTTTGA TTG

CCGGCAGAGTGTTGCAACCAAAGTTAAAAATATTTTCTATATCAAATTGATCCCAAT TCT

GCGTCAATGCCAACCACAAAGATTAGAAATTTCATCATCAGCGTCCATGTTTGCATC CAT

ATGTACAAATTTGTATAATAATGAATAAAATCATATCACAACTACCCAAAGAAGACA TAAA

ACCAGCACTAACAACAGAAGGAACTCACTTAATATCTTCTCTTTTGGTGCACAATCT GAC

ATGGCCACGCCGTTATAACAAAAAGTTGACATCTTTTCTTAAGGAGTCATTTTACCA GCC

ATCAAATATTG GAATAAAATGTACTTATATAATGATG GAG CAG G AAAATACTTAATTATGC

GCCAAATTATATTAGTTAATAACAATGAAGTGATAAATTGAACTGAGAATGTACATT AGTT

AACAAAAGTCATAAAATTTATTTTAAAGAGACGTGTAATTCTATTGGTGCGGGTGCA AGA

AATTATTATG AACTAACC GTAAG G CAATCTTCAGG CTTAAAAACAGTAG GAACAATAG CA

GCGGAAGCCTTAGAATCAGCATCTCCGCCTTTCTCGGCCGGCTTGGCTTCGGATGCA G

CAGCTTCGGAATTCGATTCGGACCCATCAGTAGAATCCGAACAAAAGAATCGTCGAG AC

AAATAAG GGCCCCTGC G ATTC G AACTTCTTAG C G AAC CTAAAAC C C GAAG C AAG G G C G

TATTCGAGTCAGTGCCACGGCGAACTTGAGGGGTAAATGCCGTTGTGACGGCGTGGA A

ACG ATTCTG CAG ACATG AG G ATGTG AG AG CCTTCAACAT SEQ 68

TCACGCAGCTTCAGCAAAACCCAGTTGAAGATTACCATAGTCAAATACAGTATGATACA

CCCCCATGAATACATTACCAAGAATCCTGCATGTACATGAATCTTTTCGTATGAAAC AGA

TCGAGAAGCTTTTGAATAAATTGAGGTATTCTTGTCTATGCATGCAATTTCTATGAC CTT

ATTAAAGCGAAATGGTTGTAAGAATAATAAAAACGTACCAAAGAGGACCACGAGGTG GT

GGCACATCCAAAGCAACAAACCCACTGAGGCAAATGGTAGCAATCCCCTCTCCAGTT TT

CAAAATATACTGGAAACAGAAACAAAGCCACAGTTGAAACAAGGCTGAGTATGCAAC AT

AACAATGTAACATATATAAAATTAGGAAGAGAGGCATAACCTGATCTGGAGTCAGGA CA

AAATCCTTATCACCAATATTGAATGTAACATTTGGCATGGATGATATGCTATTGCAG TCG

ATTACGGATTGCCCCATAGGACTTGGTAATTTCTCACAAAGCTGTGAAATACCAAAC ATT

AGGCATTAAGACAATGCTCTGCACTACTTAAAGAGAAAATTCACTCACCTGATTCAC ATA

TTCTAGCACACTCTCCTTTGTTGTCTTCTGTTTCAGCTGGTTCTGCATCCAAATTAC AGC

CATCTCACAGGCAGTGCACAATGGGGCCTCTCCTATGGAACTTCCTTCATTTTCCTT CT

CAACCACACTTCTGATATTCGAGCTGCAAAATTGGTTGTAGATTGTCACATGGTTAC ACA

ATTCAACATGTATTGTAACAGACATAGATCCTTGGTTCAAATCTCATTACCAACCTT CAT

CATAATGAATTTCCAAGTACTAGGCCCATGAAAAGATCATACTTACACGTAAGAGAG CAT

GTCCAAGATATAATAATAATTGAATAAGATTGTGACATCTTTGACAGCTTAAGTTTT TAAA

CGAGATGGTTACACAATTCAACGTAAACAACATAGCTCTAGAGCCTTGACATATGTT CC

AAAGACAGATTGTCCAGTTCAATTTAGTAACTTCACCAAAACCACCTTAATTCACAT ATG

CTCCAAAGAACAGGAGAAAAGCATCCATACCTCAAATGCTGAGCTCCATTAAGATAA CA

TAAACCTACTTGTAAACAAATTTGATCTGGTGTGACCTGCAGACAATTACAAGTAGC ATT

ATATCTACAATGTCACATGCAGTAGTGGCCGGCTCAAATAAATAAAAGAAGAGAAAT AG

AACAAACCCCTGATACTAGTAAATCCCAAATCATTTCCCCATATTGAGAAATGGTTT CTT

TACATTCCATGCTCAATACTCCTTCTGCTCCAATGGCATGGTTGACTTGTGTCACAA CAG

CCTGTTATGTACTTGCTATAATTACTTTCAAGAATTTGTTCAAGATAACTAGATAGT CGAA

AGGTCTACAACTTCTGCAACAACCCAACTAAAGTAAATCATACAGTTGGACCAGCAA GC

AATGATGTTCCAGAATCAACTATAGCAGCACAACCGCCTTCACAGAAGCCTGTAAGG GA

AAAGTATTAAG AC AAAAAG AAATTATTCTTATCTTAG GCTC GTAATC AACAG CAAC GCTC

ACCTGTTGATTGGTTCCCAATAGAGAAATCTCCCATTTTAAACTGAAACAGAAAAAG GAA

AAG AACAGATGG G GG G ACC GTG AAAAAAG G CAAAAAG G AATG GAAACCATAGTG GAAA

AAACAATTCAAGTACCTGCCAGTAACCTTTCTGAGTCAAAGGAACATAAGTATGTTT ATC

CTTGAAGTGTTTTGGATCAACACCACCAAAAACAAGTTCACCTCCCTCTTTTGCATT TAT

ATCGCGATTAAGCCAGAAAGAGAACACAGGCTCCTTTACGAGATCTTGCTTCACCAT AT

TGTACCTGCACATATATCATGAATCAAACAGACAAGATTCCAAAAACTGAGAAGAAA GG

AAAACAAGATAGAATTGGTTAAAAACTGAACTGAACAAATCAATTGCAGGCCTTATT ACC

AGACAGGTGTAGTGTTTCCAACAGCAATTTCCTTGAAACCAAGCCCAAGTATTCCAT CA

AACTTTGCAACTATAAATGTAACACTTGATTCCCGTGTCGCCTCAATAAAGACCTGA AGA

AATTGATGTAAAAAATTCTCATCCATTGTGTTTTCAGAAGAGCAGAAAGGACCATAA TAT

GAG G CAGTG ATG ACTTATTG CCAAG CAAG ATTTCAC CTGATCCGTG ACTACAAGATCG C

CAACTTGAACATTATCTTGACTGAGAAATCCTGAAATTGATCCAGATCCATAGTGGA TTG

AACAAGATTCTCCTGGATTAGAATAAACATCAAATATAATCAGAAGCCATCAATAAA TAA

CTTCTTAGTCTTTCAATTAATGTGAAAGAAATATAACATTAAACTATGATATGAACA TCAC

CTTTTTTTGTGTATGTACTAGACTTCCTTGCCTTGTATTTGGAATGGATCCAGCATG CAA

TCTGGAAATAATTCAAGTTTAAGGAAAAATTCTGTATAAACCGGTAATTCAACAAAG GAA

CAAATAACTAAAAGAATTCAAGTTATATACACTAAAAGTGTAAGGATTTTTGTTACT ATCA

GTATAGTTTAACTTCTGATAGAAATAATTAAGTACCAATTATTGTTACCAATTAATG TTTT

CTTATAGAGATTTACATGTAATTACCTTATAAGTGACCTGATTATATAACTAACCTT TGCA

C C GTTAG C AC ATATAG AAC GTAAAC ATTAACTAAAAAG AAG C AC AACTTAC AG AG AAATA

ACATCTTGATGATGGAACCCAGAGATTAGAACTTCCTGTATCAAATATGACAGTGAA ATT

TTGAGGGGGTGAACCAATACTAATATCTCCATAATATTGAGCATCCAAGTAGTTCTT TAA GGACACTATATCTGAATTTGTGTCAGATTTCTTCTTCTTCTTCTTCTTCTCTATGTCCTT C ATCACATGCTTTCCATATCTGTCTTCAAGTCTTGCTACATTGGCTACATTTAAGCTACTG ATATCTAATTGTCGCTTCTTCAGACTAATTCTTAGCAAACTATCAGAGGAAGCAGGAAAT AC AAAG C AG G C AATG G C C AATAAAAG GAG AG C AG C C C AAAG ATG CTTC CTTTC C AT

SEQ 69

TCAAGATCTTATTACAACATACTTCTATTACAATATCTTTTTCTTTTTGTAATGGCTTTG AT

CCTTGGATGGAAAATACTATTTATCCTTCTTTTTGTGATAATTGGGATGTGTACATC TCAA

GTCACTTCTCGTAATATTCAAGCTTTATCCATGTTAGAAAAGCACGAGTTATGGATG TCA

AGTCATGGACGTACTTACAAAAATGAAGCAGAGAAGGAAAAGAGATTGAATATATTT AAA

GAGAATGTGAAATTTATTGAGTCTTTCAACAATAATGGGACTAAAAAGCCATACAAA TTA

G GC ATCAATG CATTTG CTGATCTTACTG CAG AGG AATTCTTG AGTTATTATACTACTG GA

CTTAAGTTGTCTAATTCCTACTCTCAAATTCAATCATCATTTAAGTATGAAAACTTG AGTG

ATGTTCCATCTGTTATGGACTGGAGAAAGAGTGGTGCTGTCACTAGAATCAAACATC AA

GGTCAATGTGGTAAGGCACAGTTTCCTATTCAAGAAAAGTTTCATATTCTCTTCTTA TTA

AGTG CTG AC GTAACTAGTAAAGTTG ATGATATGTG ACCAG CAG GTCACG GGTTCAAGTC

ATAGAAACATTCTCTTGCAGAAATGTAGGGTAAGGCTGCGTACAATAGACCCTTGTG GT

CCGGCCCTTCCCCAGACTCCGTACATAGCGAGAGTTTAGTGCACTGAGCTGCCCTTT TT

ATTAAGTATTGAGAAAGGATTTAAGTAAAATACTACATACTCCTTTCAAATTTGTGA TCTT

AAACATGTTTTATCATTGTATTATAACGGAGTATCACTAAGGTTAAAATGAGAATAT TAGA

AGCAAGCATACTAAATATAAAAATACATTCTTTCTGTAATAGACTAAAATGGAAAAT AAGA

TATGCATAGAGTACTCTCTTCTTGTCCAATAATGTTGACAAGGCACTTAAATTATGA GTG

TGTGAAGTCTCACATTGGTAACTGAAAAAATTAGGAGTCTACATATAAGCCTACATA TAA

GGTTTAGAGTTTTTTTATGGTGTGAGGTCTTTTGAAAAAAATCGTGCGGACTTAATC CAA

AGTGGATAATATCACACTATTCTAAGAGTATCTTTGAGCTGTTTTAGCTCAACAACT CGT

ATCAGATCCCAGGTTCTGCGGACGAGCATAGCGATGGCGACCTGTGGATCGTGGTAA T

AGCCACATGAAACTGGTTCGACGGGGAGACCCGTGGATCATGATCATGGTAGTGAGC C

ACATAAAACTTAGTTCGAGAGAAGGATTATTGGGTATGCAAACAAAGTCTCACATTA ATA

G CTAAAAAGTTTG G GAG CCTG CATATAAG GCGTAGAG AACTTTTAATATTGTG AGTCCT

TTTGGGGAAACCGTACAGTTTGGCCAAAGCGGACTATATCATACTAAGTTAAGAGTA TC

TTTGAGCCATTTTAGCCCAACAAATCATATATGATAATTTAAATTTGTTTTACACTA CCAA

TAATGTATTTGACCTACTTTGCAGTATAGTTACTATTTTTGTATGTTTATCATAAAA GTTAA

CCTTTAAAACAATACAAGTGATATGATTTGTATAAATATGTGCATAGAACTTCCAAC TCAT

TAATAAATTGCATGAAATATAGGATGTTGCTGGGCATTTTCAGCAGTTGCAGCCTTA GAA

GGAGCAAACAAACTCTCAACGAACAACTTGATTTCACTCTCCGAACAACAACTGTTA GA

TTGCACCACCGAAAATAACGGTTGCAACGGCGGTTTAATGACCACAGCCTACGATTT CA

TCATTCAAAATGGCGGCATTGCCACAGAATCCAACTACCCTTACGAGGAATATCAAG AT

TCATGCAAAAGCCAAGAGATGAACTCTGCAGTGAAAATCAATCGTTACGAAACTCTG CC

CTCGACTGAATCAGCATTGTTAAAAGCCGTAGCTAAACAACCGGTCTCTATCGGTAT TG

CAGTG AATG AAG ATTTTC ATCTGTACCAAAATG GTGTTTACAATG GAAATTG CG AG G GT

CAAGAACTAAATCATGCAGTTACTGTAATTGGTTATGGGACAGAAAATGATGGTACA AAA

TATTGGTTGATCAAGAATTCTTGGGGGACAAGTTGGGGTGAAAATGGTTACATGAAA AT

TGCTAGAGATACTGGAATTGAAGGAGGTCTTTGTGGGATCACCACTTTAGCTTCCTA TC

CTGTTCTT

SEQ 70 TCATAACTTACTGTGCACGAGTTTATTTGAAAGACGTTCAACCTTGCCAAGTTGAGTTGC

ACTCTTCAGCATCGCCTCGCTGTCAGCTATAAACTTTGCAAACTTAGAACCCTTTGT ATC

ACCATCACTATCTCCTCGAGGAAGCATTGGCAGATGAAGCATAGAGGGACTTTGCTT G

G AAG ATAAAG G CGG AGTTAATG CCCACACAG AAGAAAG CTGTTCATCTG GTTTGTCAAG

ATATTCCAATGATAGATTCTGCATGTCTGTCAGGAAAAACAAAACAGAAACATCAGA CGT

TGGTATCGTGCCTACCAAGTGTTATTCGGAACAGATTTCGTGAGAGTTGCGAAACTT AT

TAAG AG CTTAC G G ATC AAAG ATTACTG CTTTTATTCTTC ATAC G G G AAAAAAC C AC C CAT

TATCTGAAAATGGAAGTATCAGGAATAGTCGAATAAAGTACCTTCAGCAAACTTGAA GAT

GGGCTCTAAAGCTGCACATGGAATGCTGAAGTTCAAATGTGGAATGACCGTCCCTCC A

CCATGTCTAG CATTACTG CAT CAAAAAT AAAG CTACAAG AC ATG AAGTTTCAACAAG AAA

GACATTAATCGATGGAAAAACTATAGAAAGACCATGATATGTGAAGATTAGCATAAA TGA

GATGACTGAAAGATCTTCCATATGAGATCATTCAAATTGGACACCATTATTTTTTTC CTGT

ATGCAAAGCGTATAATTAATATACATTTTTGGTAAGGACAATTAATATACATTCAAA CAGG

AATATCTTTCTTCAAGGACTTCTCAAAGTACTCCAGGACGCAGTGTACCACAATATC ATT

AGATTGAACTTCAAGAACAAACACACGTAAACATACATAAGCTGAAAAAGAAATATC CTC

AATTATAAGCATCCCCAGTTGTCATCCAAAAGTTAATTACCTTGTTACAAGGGCAAT CAT

GTGTCCTTCTGAATTAACAACAGCTCCACCACTACCACCAGGGTGTACAGCAGCCGT TG

TTTCAAGCATTGCCGGAAAATGTCCTCCTAAACTTGATTGGTTGAGCAGAGGCCGCT TT

G CTTC AACTAC CTTAG CTATTG C AC C C AC AC AAG C AG ATG G AAG G AAGTCTATATTATA

AGAAAAAGTAAGACAAATTACAAATATAACTAAAGATGTCTAAATAGGTATGAAACA TAT

GAAATAAATATACGGTATTATATCATGTTCAAAATGAAAGAACTTAACAAAATTATT TACA

TGAAAAGCTATTTAACCTCCAGAGCAACATGATTTAGTACTATTGGGCGCACAAAGA TA

GTCAGTTCCAGAAAATTATGTTCAGCAAAGGTTATGGAACAGACAAGTTAACTTTAT CAA

CGAAAAAAGATGGAACAGACAAGTTAAGATTGCATCAATAAACAATAGTAGCACTTC CA

ACAACCAAGCTACTATTAAAATATCCTTGAGATACAGCCGACTCGATTAAGTGAGTT ACC

AGGAATTTCCTATTTTAAAACCCCATTCTTTAAAGCTGATCATTTGTACTTGCTTTC ACCA

TAGAAAATATCAATTTAATGCTCCAGAAATTTACCTCTTTTCGTGATAAGTGACTTC AAAA

CTCTAGATTTGATTCCCCAATTCCGCTTTGTTAGCATAGGTATTAGGTATATGATCA TTC

TTATGGATGAAGATCTGAATTAGTGCCTATGGCTTTTATTAGCCCACGAAAAGAAAA CG

CTTTTTTGTTTTTTAATTTGGTCTACCTTTCTCCTTGTTCTACTAGCCTTGTTTGAG CCCA

ACAACAACCTCGCTATTCTTAATCTGACAAGTGCAATTTTTTTTAACCGGAAGATCA AAA

CGTTAACCTGGACTACAAATCAAATTCAGTATCAATAAACAATGTCTTCACCTAAAA GAT

TACCCAGTTTTGAGCCCTCCCGACCAATCTGGTTACTATTTCTCCACTGGAAGACAC CT

CAAGTTCCCTTGCGGAGGCATCGACATCACCTCATAACTACTCAATCAGTCATCAAA TG

GTCATCATTTG GTG AAG G AAGAAACATCAAGTATTCCAG CAGTAACAAG G ACATG AAAA

TGATATAATACGACCCAATCCTGCAATTGATTATAATGACACTTCAACAAATTCTTA ACAC

G AG AGAAG CAAG GTG GAAG AGAG AG AAATTCAAG ATAAACAAAG ATTTTGTAGAATATT

CTAAAATTTCAGATTTACTGTGATGCGTGTGTCCAAATAAAAGTAAAGGCACAAATT TTT

TATTTAGACAAGAACATATCTAAAGCAAGATTTACCACATCGTGGTCCAAATAGCCC ATG

TCCGAGAATGTATGCTTTTGATCCGGGGGATGGGCACATGAAGTCAACAATAATGGG A

CAGAG CTGATCTG GAACTAG CTCAAGTTGTAGTAATGC AACATCC AG AG GTCCTCTG GA

GACATGAACTACCTTTGCATTTGTCCATACCCAGGGATCCATAAAATCCAAGCGAAC AC

G AATG GTCCTACTG CCTGTGTTTG CCAG GTTAACTCTAAAG CTACATTG CTCATTGTCAA

CCAAGAAATGCGGAGTTTTTAGTTCCTTTTGAATCAAATGTTTATTCCTACGCTGAA TGT

CAAATTTCTCAACCCCTGGATGCTCAGATTGATCAGAAGGGATGAGAACTACATCAG AT

TTGGTGTTATATCCTGAACCGTTTACAGATGTTTTTCCAAATCTCCATGGCTCTAGA AGA

TGAGCATTTGTAAGAAGAAGACCCTGCTTGTTGAGCAAAACTCCAGAAGCCCATGCT CC

ATCATCAACAGCGATAAGACAGATAGATGTCATTGCTTTCTCAATCAAGGATGGGGG AA

CAGGATCTATTTGGAGATGCTCTTGAGTATCATTGGCATGTCCATCCTGAATATTAT TGG

AGAATGATTCTTTTTTAACGCTGATTAGGTTTCCATTACCAAAATGGATCTTTCTCC TAGT

TTGTAGCTCTTCTTTAAGCAGGCTACCACAAGCAGATGTAATAGCTTCCCATGGAAT CA CCATCTGCCACCCAATGCTTCAATTTCAAATTAAAGCAATACATGTGAATGTAGATCTCA

GCAAGAACATAGAGCACATTATGAAATGTTTACCTGAATTTCAGCAGCAGTAGCCCT TT

GTCTGAGTGGCCGAGACAGAACACCAATAAGCTCTGCATGTTCCCCTAACACTGGGC T

ACCTTCCATTCCTGAAACATAAGGTACCATACGTTATCAGGTACCCAAAGATGGAAA GG

ATTAATTAACTAAAGTTGCAAG G CAAAAG CTCAAACCAGG G AG ACAACG GATGTCAG CA

ATCAACAGTG CTTTATTCTGTG G ACTAG GTG G ATAGCTGTTTG CAATG G ACCCAACTGA

TATGCTGTATAGCAACAAAAGATCAAATTGAGCTTTCACAAGGAGATGTAACAAGAA TCA

AAGTGAATATTAATCCTATGTATGATCACAACTTGGCAGACAAATAATCATCGAACA ACA

TCTGCATACACCAGATACAGCTTATATAGCTTACTCAAGTAAAGATGAGCTAAAACA TCT

TTAGCACTAGCAACAAGAATTACACCAGTGCTTTCATATTCAGATAGTCTATTAACA CAG

CACAAATGTTCAGTCATCATACATTTTGATGCTGACAAGCAAGGATGGAAAGATTAC AA

GAAAGTTGTAATTCCAACATATGATAGAAACAATACCAAAGCAACTACCTGTTGAAA AAG

TGACTGGGAGACAAGATACCGAAAGGAGAACCCATACCCAGAAGAAGATCACCTCTC C

TACTCCAG GTAG CCACTTTTAATG CAG G CAG GTC CTGAAAG GAAATACCTATG CAGTCA

TTATCATCAAAAACCTACTAAATG GAG CTTTCAAAAATTACCAG CAAAAAAG CAAG CCCA

TTCAGCAAAATGCAATGCCATGCTTCATTTCACAAGTTTGATGCCACTTAGTGCAGT AAA

AGCCTGAAACCTCATATGGATTTGAAGAAACTCTGAGAAGAGCAATTCTTGTAGTTG AT

GTTCCTATCACACTTGGCAGGCTGGACTGCGCCTCCATCATTGGAGTTTGACTTGGG AA

AGAAATTTTCTCAACCTGAAATTATCAGTTGACAGTTCCTGTTAAAAAATAGTCAAT CTAC

AAAATCAATCTCCGTGGTCCTTTGAGACTAGCATCTCATAGACCTTTGGCTATGAAA ACT

AGAATG CTTTGTG CAAAAATTGTTTG G GC CATCTTG AACATAGTACTAGG G ACAATTG A

GATGCCACCGTTGAAGAACACCATGGTAAGAGTTCCTTATGTCATGCCATTTTGCAC TT

GACAGCGTGAGACTCAAATGTGCTCCTTTCCAATAACACTGCCGAATGCTTATACCA TT

ACAAATATGTAACCAGAGTAGCTTACTGAGTTGGAGTTCTTAATATGAATTATTTTA AGA

AATGCCTCCAAGTTTTACGGGTGGTAGACTACCTTGTATTCAGACACGGGAAGGTGA TA

TTCAATCATTAATCTTTGAATTGGAATTAGGTGTTGAGACATTCTATGATGATAAAG AGG

CAACTATTTTGGCATGAGTAGAGACATGAAACAATGTGATGTCACTCTATTGTAAAT ATA

GGGAGCATGAAATAAGGAACTCATTACATGAATTTCCATTATTCTCCTATAAAAAGA GGT

TACAGAACATATAAAAGGTTCTGCTTGGAAACAACTCAACTGATAATGCAATTAATG CTA

AATATATGGAGGAAACTTGCATGTTCCACAACTCGAAGAAGTGGTGTGCAACATATA TT

AGTTCAAATCCATACTGCTCTTATGCCAGAAAAGAGAAGAAAAAACAGAACCCTGAA AG

TG CAAAGTG GTCATG ATTAG AC GTAAG ACAG G AATAAAATG G CATTTCTG CTCAAAAGA

AATAGGTGCTTGATTTATTTAGTTATTTAAGAATGATAAATGATATGCCTTTCAGTT GATA

GAACTTTAAAGTGTTAGCCTGATAAAATTATAGTTATTTGATGAAGTCTTTTGAAAT TTGA

ACCACAAAGTGGGATCATAGCAGAAGTTAGCTCATGAAAAATGACCAAGGATCTACA GC

ATCCAATCAAAATATGCATGCAAGAGAATTTGGCTTATCTTGGTCTCGGCGACATTT AAT

TATCTTTGAATGGAAGTATTGTCATGTTAATCCTATTTATGACTATGTTATGCATTA ATGA

AACAATCACCTAATTCAATAATAACACAGCTAGATAGTGTCAGGAGTATTTAGAAAG CGA

GTGAGAGGCTCGGACACATCTATCAATAGGTTGTACCAATCATATATAGATGAAGTA CA

GAGACTATGGTTACATATTTACTCATTAATACATAAAGGTACAGAGATTGTTATTGG GTA

CACATTTACTGCATGTAAGTGCCAAACAGAGGAAACATCATACCTGTGTGCGCTTAG TG

TTAGTAAATG ATTG AC G G G AATTTC C ATAG G CAG CTAG G G AC C AAC C AAC CTC C C AC C C

ATGTTCAATTGAACTAGACGAACCTTCAACTAGGGACTGGACAGCAGCAGATGACAC AG

GGATGTCAACCTTCAGCAAGAACACCAAATAGACGGTGTCAAGTGCAACTTTTACGT TA

GATCAGTAACAAAATAGCGACAAAGAAGAACACCAGCCATAAAAGTGATCAGTTAAC CC

ATGCTAAAAACTAGGATAAAGTTCAAAACTAGGTATTCCTTCTCCATTTTATACGGC ACA

CTTTCCTTTTTAGTATGTTCCAAAAAGAATAGAACCCTTCTCTATTTGGAATATCTT AAAA

CTTTAAACTTCCCACTTTACCTTAATGACATGCTCTTATAGCCATAGAAGTGTTATG AAAT

GTTTAAAACCACAACTTCTAAAG GTAATTTG GTATGTGTCAAAATCTTTTGTG CACG G GC

ACAAAACACCAGATGACATCAAACTAGAATTTATATGCATGCATCAAAATGAGAGCA CTT

ATCAATTCAATATTGCAAATAAAAAACATATATAAGATAAAGTAACAGGTTTTATGA TAAT CAGCATTCAGATTACAAAATCCTTTCAGTCTCCTACTAAATCCTACCACTCTTAGGAGTT

CTGCAGGTAGCCAGTTCAAACCCTCTTTGTTGGTCACTTTGATATCATTTTGCAATG TAT

TTCCTCCCTGTGAACCACGAGCAGGACATCCAATGTCAATTTAATATATCAATGAAC CTA

AAAAACGTATTTTCTTCCGTAGGCTCAAATAAATACCTCCCACAGTATATCAATTTG AGC

ACCAGGAATCAGCTCCGGCTTATCCTGCAAGATGGTACATTGTAAAAGAATTATGAA AA

AGCATAACGACAAAGAAATAATCATGTTGTTCCATTTTATACTACATCCATTCAAAA ACCT

ACCTTTCTATACATAGAATTTTTTTAATCTTGTAATTCTCATTTTGCACTTAATGGC ATGCT

CTTATAGGAATTGACATGGCATGCTAAAGACAACTAGATAACTTCTTACACATGTAA TTA

AATATGTGACAAAAGTGGTTCTTTCTTTATTAAACTCCTTCTCCAATCAAACACCAT CATA

TAAAGTGAAACCAAACAGAGGGAGTAATTATCAACTGAAAGAAGACTAAAGATCCAA AC

CTTTGATATGTCCCCTCTATCCTGTTGTACAACAAAAGGCTCAATAACAGAAGCAAC TGT

TAAAACC AAGAAGTG AC CTCCAAAAG AGTG CAACTTG CTTTCACCTTG AATCTG CTTAG

ACACTGAAGCATTAACAAAGGAACTGGGCAAAAGCATCCCAGATGCTGATAGTGTCG TC

TTC C C AG AACTAC AC C AC AAAATTAC AAC C C AAATTTAAAACTTTC AGTTC AAAAC AC AT

AACATAAACAACTATAATATAGAGACAGAGAGAGCTATGTAAAATCACTTACTTGTA CAG

GTGG AAAG CATGTTTTCG CATTTTTAG G CCTTTAG G GTC CTTCAAG AAG AAGAATTTATT

TGTATTTTTTCAAAAATTAATTAGTAGAATAAGCAAAGTGATTGAAAATTACAGTAT CTTG

GCTTAAGAAAAGGGACTCACTGGGCCTTGAATTCTGACCATGACGGCATAATTGCGG G

CAACATCAACCACTTCAGGAAGACCCAT

SEQ 71

ATGGATAACCCATCGGAGGATTCCTCGGATTCTCCTCAACAGCAGCCCGAATCTCCTGT

AAACGATGACCAACGTGTTTATTTAGTTCCTTACAGGTAAAATCTCCCTTCCCCGTT TTG

ACCCATTCCTCATGCAACTGTTTGTTTATGTATATCAACATAAAAGTAAAAATAAAT AAAA

ATAAAGAATTGAATTCTCGGATTTTGCTTTCCCAATTGATTTTATGATTTGGTTTGA TCCA

ATTCAGCTAAACCCGAATCTGAACCCATGAGATAACGAGAAAGTCGAAACAAGTTCT AG

TTTTTTTTTTCTTTTTTCTTTTTGTTTAAATTACTTATATTTTTATTTGTATTACTT GTCATTT

AGATTGGTAATTGTATTAGCTTCCCTACATTGGAATGTTGTAGTTTTTTTAATCAAG TCTT

ATTATCTGGATCAAATCGTGTTGTGAGTTTTTTTATTTTTTTTATTAGTTGCCATTT GGATT

GGTAATTGTATTAGCTTTTGTACATTGAACTAGTGTTGGTTTTTAATCAATGTTGTT GGTT

TTTGTTATCTGTTAACCGGTGGATCAAATCATGTTGTGGGTTGTATATTTTTGTTTT GTGA

G CTTAAG CATAAG AAAGTATCG G CCTTG GATTTTCAGTTGTGTTTTTTTGATG AAGTAAA

TAGTTTCACCAATGTCATCAAGAAGATGCAAGTATTACGAAAGATTAGGCCAGAGAG TA

TCAGCTTC AATTACATTG GTCTAG ATTG CTAAG GAG CTGATAAAGTCCAG AAAGTTAACA

GGGTAAGTTACAATAGATAGTTTTGCCAACTAAATAAAAGTAAAAGACACCTAGCTA TCA

GTTGTTAACAATGGAGAAGTAGTATAGCAAAGTGCCGGCAAGATCTGAAAGTGGTGG TT

ATAGGGACCTGTTTAATAACTTAGTAAACCTTAGAAGAAGCTGACAAATTGTTCCAT CTA

CAATTTGTCAACCTTAATAGAGGTGCACACAAGCTGGTCGGACACCACGGTTATCAA AT

TTTTTGTTTAAAAAATGTTCCATCTCGATAAAATATCAATTGATTATGCATTATGTT GTCA

GTTCAAATATTGTTTCTCGCAATTATTATAAAAAGTGCATATCTGTGGAGAAGTGCT CCG

CGGGCTAGTGCGGTGGTAGGGGAGAGTGGTAAAAATGACACAAATGATGCTTTCCAC T

TGCTAGTGGTTGTTAAGAAGAGAGAGAATGTTTGAGCGGGAAGGACGGGGTAAATAG C

ATGGAAATGTTAATTGAAAGAAGTTAAAAGTTACCCTTTGCAGCATCTTCTCTAGGT AAG

AATTTTTTGTCTGTGTTTTCCCGAGTAGAGGGTTAAAGTGTTGCACACACATATATT ACA

GGTGCCACAGACACGTATATGTTTAGAGTACTATATAAGAAAGCGTGTTTGTGTTCT AG

GTGGTGGAAAGAAGCACAGGAGTCATCACCATCAGATGGGAAGTCAGTGACTTTGTA C

G C AG C G G C AC C AG CTC C ATCTTAT G GAG G G C C AATG AAAATC ATTAAC AAC ATATTTAG CCCAGACGTCGCATTTAACTTGAGGAGAGAGGAGGAATCTTTATCACAGAGTCAGGAG

AATGGTGAAGTTGGGGTATCTGGTCGGGACTATGCTTTGGTCCCTGGCGACATTTGG C

TGCAGGCACTCAAATGGTCAGTATTTTAGAGCAGTTTCCAATTTGTATTCCTTGAAG TGT

GTTAGATAAAGCCTCTTCTGACGGAGATTTACGCCATAGTTGTTGAGCATTCTGAGG AT

ACCATTTGCATATGTGTTTTTCTCGACTTCAAATAAAACATTGATTTTTCACTTCTG GTTA

CAACAACCACTTGCAATTTGTTGTTTGGTTTCTTCTGCTTTTCAGACCATTCACATT TTCA

TTTCACATGAAAGAGGCCTCAAGCCTTTCGAGGCTTCATTGTTGTTGCTAGTCCGAT GG

CAATTCCCAGTTATAAATATATATTGTTAAATGCCTTGTGAATGCATATGGAAGCTC GTTT

TTTAAAGCATTTTGAGATTTCATTCTAAAAAGACCACTGTTTATTCTTTCAGCTTTA AAGT

GCTAAGCTCAATCTATTAATTCGCTTCCTTATTTTCTTTGTCTCTTTCATATATTTT TTTTG

TGTGTGTGGGGGGTGGGGATTGGTGTTAACTTATAACTGATTATTTCACTTTCCTTT TTG

GTGTTTTTGCACATCTAAGAAAGGGAATTTGTCTTTTGATCCTAGTAACATGTTATT TAG

CACGTTAATTTCATACATCTGGCACTATGTAAAAGTTGATCTTTTGATTATAGAGTT CTGA

TTAGTTTGATTGGAATTGCTCCTTTCCATCCAGGCACAGTAACTCTAAAGCTGCGGC TA

AGAATGGAAAAAGCTTTTNCAAAATTGCTCCTTTCCATCCAGGCACAGTAACTCTAA AGC

TGCGGCTAAGAATGGAAAAAGCTTTTCAGCTACAGATGAGGATATTGCAGATGTCTA TC

CTTTACAGCTGAGGCTTTCTGTTTTGCGGGAAACCAGTTCCTTGGGAGTCAGGATAA GC

AAAAAGGTTAATAATAACTTTGGAATTTCTAGTTTCATCTACAATTCCCATGAGATT TGTA

CTGTCATAATATCCATAGAGTGCATAACCACATGTGATCTTTTTGGTACAGACCTTG CCA

CAATTTGTGAAGCTGACTTTCTTTTTCATCAGCTGCTTTGCCATCTTTCATCCTTCA CTTT

TGTTGTTGCTGTTTATTGTTGAAACTGAAGGATCTAAGATGGACAAGTCCAAAATAT CAA

CTTTAAGAACAAGGGTTATGCATGAAGCCCTCTATTCTCATCCTCATATTAATTACT CGG

AATGGGCATAGCTGTTAAGTGCTTCCATTTTTGTTTGATATTTAAATATTAACAGTA AGAG

CTTTTTATG GTTCTG G GTTGTG CAAAAAG AGG ACAAGTATGTTAG CTGG ATAC GTATCTT

TTTGCTGACAAAGTGGGATAAATTTTGCATGGTAATTTTTGGCTTTACATGAATTTG TTG

ACAAGGATATCAATCCTATTGATATTTATGTAATTCACTTGAAGCATTTAATACTTC TATT

TCCCAATTACAGAATTGCTCATCAGCAATTTTTGACCACAGTGAGTTCTAGAGAAGC GA

GTCTTTTGAAAATGATAGTAGAAAGAGCGCTTCCTTTCTCTAGATTGTCTTTCTAGC AAA

GTAAATTAATCTAGAGTTAAATTGTATTGACAGGAATAAACAATATAAGCGGTATTT TCAT

CCAGACACCCCTCCCCTGTTTGTGGAGCAGAAAGAAACTATGTAATTGGGAATCACT TA

GTTTTGAGATAACATAGTAGACTATGTGACAGTTTTATTCTTTTAATTTAAAACTAA CAAG

TTGTCTTATATCTAAAGTTTTAGCAGAATCATTTATTCTGCCTCTAATAGCTTGGAA GAAC

TATATATCATTTGAGGTCTTTACTTGCCGAAAGACATCGGAGATGAAGTTAGTTTTT TATT

AGATCAGATATGAAATTAGTTCTAGCTTTTTTTTATATACTCAAGGATGCCCTGACT TCCT

CCATCTTTATCTATTTTTGAGAAATTCTCTTTCTTGACTGCCAAATGCTAAGAGGAA ATG

GTACCAAGCGGTTACATGCAAACCCTGGTCTGAAGAAGTAAAATAGCTAGAGTTCTA AT

TTTCATAAAGCTAATAGGAAATAATTTGATCACTTGTGAAAATAAGCCAAAAGAATG ATG

CTCATTCGAACAAAGTTCTCTTAGAGTTACTACATATTTCGTTGTGTATACATGATC CTTC

AATGCTACTTCATATTTATTATTTACCCGAAAGTTGATGTTAATTTGAGCTCTTTTT TTCTT

AACAATGTTATTGCTGACTTGTCCGTTTACTGCCTCAGCTTGTGTACATAAAAATGT AGT

TTCCAAAATGTTGTTTGTATTTTGTAACTGTTGCATGTTAAACATTTGCAATATTGC GGTG

GACAACGTTTCTTTTTTTTTTTCTTGCAATAATATCCTCACCGATGTTCTTTTTTTT TTTTT

AACTATCCTCACCGATGCTTTGGAGTCAGTTAAGCTGCTTTATTTCCTTTAGAGTCT AGT

TTTACATTTGTCTTCTCACATTTGATTCAGGACAATACAGTTGAATGCTTTAAAAGA GCCT

GCAGAATTTTTAGTGTCGATACAGAACCCGTAAGTTTCAATACTGTTGTTAATCAAT TGC

AATGGTATCTCTTTCAGGAGATTAATGGTATTTTGGTTCTCTGCAGTTACGGATTTG GGA

TTTATCTGGGCAGACGGCATTGTTTTTTTCAGATGAAAACAATAAGATCCTCAAAGA CTC

TCAGAAACAGTCAGAGCAAGATGTATGTACTTTCAACTGTGTCATACTTCATGACTA ACC

AATAAACAAGTCGACCAATGCTTCTGCGGCATTCACTATTTTTCCTGTCTTTACTAA GGA

AATAATTTAGTTATGCTTTTTTCTAATTGTTTTCTAATTAAGTGTTTTTAGCAGATT TTTCC

ATTTCTTTATCTAGTGTTTGTGTCGTAAAAAGATATATAATGATTGAGGTGATGAAT ATGC TTACTTAACACTTCATCTAGGAATGAAGTGAGACAATGATTTTTCTCCATTTTCTATATA A

GTGTTGTTTTTTCTTGAGCATGGACAATGCTAAGCCCACCAAAATTCAGTTTTATGC GAC

TCTCTTTCATTTTAGGGTTCGTTCTGGAGTTATTCATTTATAAGCAGTAGATGTGCT CTTT

CCTTGTACTTTCAATGATGTACACTCTAAGAAACTTTAGCTCTTTTATTACCCTGGG ACA

AAAGAAACACATAATAAACGGGACTGTCATGTCTAACGACCAGCTTATACACCCATT CG

TCTTG GAG AACAGG CG GGTAAATTCAGCTTAGTTATG CTGTTTTTCAG CTG G ATCTAAG

ATTACAAAAAGAGCAACTGTTTTGTTTTTTGTTTTTTTCCATTTGTGGCAGTTATTA CCGG

TGTGGTTCATCATTGATTGTTTTTGTATTTTCTTAGGCTGTTTATCTGTGAAACAAA TTGA

AAGAGATGCCAAAGTTTTATCTGTTTTATGTTTCTTTTTTCCTTGTGGGTTACCATT TAAC

TGAGAGCAAGGTAAACCTTTACTGTTGAAGGCATTTTGCTGGTTATGGGTTGCCTTA TA

GTCTTATTACTGACTCTTGAATTAACTCTAGAATTTAGTGTTTAATGGTTCGCACCG CTT

GTAAG CAAGAAATG ATTTG G AC AAACTTCTTATTTTGTCCTCTTATGTTTTTG CTTG CAGA

TGCTCTTGGAGTTGCAGGTCTATGGGTTATCAGATTCTGTTAAAAATAAAGTGAAAA AAG

ATGAGATGTCAATGCAATACCCTAATGGTTCTTCTTTTCTGATGAATGGTACTGGCA GTG

GTATAAC CTCTAATCTCACTAG GAG CAGTTCTTCATCATTTTCTG G AG GTCCATGTG AAG

CTGGTACCTTGGGCTTGACTGGATTGCAAAACCTAGGGAACACCTGTTTCATGAACA GT

GCTCTTCAGTGCCTTGCACATACGCCAAAGCTTGTTGATTACTTTCTCGGGGACTAC AA

GAGAGAAATAAATCATGATAACCCTTTGGGAATGAATGTAAGCAATCTTGAATATTT CAA

GATCATTACGTGCTGCTTTAGATGTTTTCTTCAGTTCTCTCTGAATAAGTCAATGTT GAC

ATCCCTTAACCTATTCTACATTATATGTGGTTGGAAAAGTAAAAGAAAAAGAGAAAT TCA

TTTGATTACTCTCCAGGTGAGGAATTCTTTATTTACCTCCAATTGTTTTGTTAGCCC GGA

CAAAAGAAAACGATATGCTTATCCGTTCCATTCAATTTAGTAGGGGTTGAGAAAATT GAC

TCGGAGGGTATTCAATATCTCCACTTTTTGTTTCGTACCAAACAAGGGGAATAAACT TTA

CCTCTTTTACTTTTCCTCCTCCTTCCACCTCATCTCATCCCAATCAAACATTGTGTT CTAA

TCTGTCTCCTTACATATTTTATTGTCTAAGTTCCTCTCTTTAAATTCTTTCAGGGTG AAAT

TGCATCTGCTTTTGGTGACCTTTTGAAGAAATTATGGGCTCCTGGAGCGACTCCTGT GG

CACCTAGAACATTCAAATTAAAGCTTGCTCATTTTGCTCCTCAATTCAGCGGCTTTA ATC

AGCATGATTCTCAGGTCCTTTCAGTCCTTCCTGTTGGATTTAGTTTCCCAGTTTTAG GTC

ACTTATTAACGCTCTCTTTTCTGTCCTCTCATTTTGTGGGCATCTTTTGACATCTAA TTCT

CCTATTTATATCTGCAGGAGCTCCTAGCTTTTCTATTGGATGGACTCCACGAAGATT TGA

ACCGTGTCAAGAATAAACCTTATGTTGAAGCTAAGGATGGAGATGATCGTCCAGATG AA

GAAATTGCTGATGAATACTGGAATAATCATCTGGCTCGTAATGATTCCATCATAGTG GAC

GTTTGC CAG GTAAGTAACATCC GATG GTCTCTTGTATCTCACTAG AAGTAG G AAACATTT

GATATCACCGGCACTCAGTGGTCTCTCGTCTCTCACAGGCAAATGTGAATTATTGAT CT

CATTTCAACATTGACTTGAAAAAGCAAGAAGAATGAAGTGGCATATTTTTTAAAAAT ATCT

GAACTCTACTGTATTTGTGCTGCGAGAGTTGTTCTAGGATGAGAGAGTAATTATACC CC

AACTGTTTG G G GAAGTTTAACC AGTGTTC CTAAAG CTTG CTTCAAATTTCTCAG ATATTT

TTGTCTAGATTCTCTGCCTTTTTCATCAATAAGATTTCTTACCTTACTCAAAAGAAA TTGT

AAGTAATGGAAATTGAATTCAACTCTTACCATAAGTAATGGAAATTGAATTAGTCGT CCC

TTTCAAATCAATCCGATAACCAACTTGGTTCAATAATTCGGAATAGTGGGAGTACTA TTT

GTTAACAACTGACATACTATTTTCCTAGAATGCAGTCCTGAACTAAGAGCTGAATTT TGG

ATTCAGGTTTATTTTAAAACAAATTAGTTTTATTTGTAGTGCTCCCTGTTTTCTCTA TATG

GTCTCCGCATTTTATCTTTGATGTCTTTTTGAGTTTTACTGAAATTTCCTAAAAGAA GAAG

AATATTCCAGCCTTTAATCTCCATAAGAAAGTTAAATTTTTGTTCTTCCATAGTTTA CAAG

TTTAATTATATAAAAACTTGAACCTACCTTATCAAAAAAGAAAAAGAATTATGTGAA AACC

TTGAACATTGTGAATGTTTCTGACATTTGTGCACTCTATAGGTGTGTTTGGAAGTAA TTG

GTCTTTTTATGTGTTATGTAGTCTATGGCGGTATGTAATTTCTAAATTCCTCTTTGG CCAA

TCATCTGTGAGATAAAGCTCTGCAATTCTAAGATAATTTCGAATACCACCAATTGAT GAT

AGCTTGTCATTTTTTTTAGTTTTTTTATTCTGTAATTTTTTGCATTAATAAAGTTGA CTCTT

AGCCATGTCTATCATTAGATGTCGTAAATTTTGAAATTTCACTTTAGAAACCATGTA CATA

GTACATGTTTCCTAGCAGGTCGTGATTTGTCTATGACTTTTAGGAGGATTTAAAAGT TTA CTTCATCTG ACTC CCTTTTTCTG CAC ATTTATACAAGTTTCTTTTCTAGTCTTG CTTCTCA

GAGCTTATCTCGATTGTTGCAGGGTCAATATCGTTCCACATTGGTCTGTCCTGTTTG CA

AAAAGGTCTCCATCATGTTTGATCCTTTCATGTATTTGTCACTGCCTCTTCCATCTA CAT

CTATGAGGTCAATGACTGTCACAGTTATAAAAAATGGCAGTGATATTCAGATATCTG CCT

TTACAATCACTGTTTCCAAGGATGGAAGACTTGAAGATCTTATTCGTGCTTTAAGCA CTG

CATGCTCTTTGGACGCTGATGAGACCCTTTTGGTGGCTGAGGTAAAGTGCAGAATTT CC

AGTGATGAGAAATGGTTATGGATTTCAAGTTGTTGCTTTATTGTTTCCTAAATAGAA CTTA

TTACATACTGTGTATTGGATAGTCAAGTAGAGTCCTTTTTCCTATTTCCAAAATTTT ATTT

CCAGCTCTTGCTGGGTTGTTGTTGTTGTATTTCCAGCTCTTACTCCATTTAATGTTA CAG

ATATACAACAACCGCATTATACGTTATCTTGAGGAGCCAGCCGATTCATTATCCTTA ATA

AGAGATGGTGACCGACTTGTTGCTTATCGGTTGCACAAGGGTACTGAAGAAGCCCCC T

TGGTTGTGTTTACGCATCAACAGATTGATGAGTATGTCTTGACTTCATAATTTGGGC ATT

ATCTTTTTTTGCTTTAAAGTTCATCAAACATTACTAGCCATTACTCAGATGTGTCTT GCAT

GCACAGCTATGTTTCATAAGTAATAAGTTGGGGGAAAAAGTACTCCAAGGGTGGTGC TT

CCACATCATCACTCTTAATC ATG G CAG G GTTTG G ATGTG GG CGTACTTGATG ACTACAT

TGCTTAAAAGAATTGACAAAATATTTTCGCAGATGACATATGTAGTAATATCTCAGT CTAT

TAGTTTGCTTTATGGAGATCGGGTGATTAATTCATGATCGACACAACTCCAGTTAGT TAA

TAGAGTAGGCTGTTAGTTGTCATATACTTCTATCTTGTATAAAGTAAAAATGTGAGG TGG

TTTATTTAGTACTGTTGAGCATCCTCAGTCTCAATTCGCTTCACTTGAATACATTAC AAAT

CATTGTTATGCATGGTTCGTCGAGCAACATGTAGTTCAGATGATGTGTGTGATCCTT CTA

GATTATTTGGACAATCATGAAACTATTGCTTCTTCCATGCATCTTACTGCTGAAGCT GTA

TATGATATGGAATTTCATGCTGTTTTGTTTGCTGATTATGTTTAGTTTAACTTTTGA TCCA

TTGAAAGATTCTCATAGTGGTCCTTGACTCATTAAATGAGATGGCTGATATTTATTT TGG

CAAAATATCATTTCCTTCTTGATTTCCTCTTCCATTCTAGCAATCTTATGAAGACTG CATG

TGCAGGCATTATATATACGGAAAGCTGACCTCAAACATGAAGACATTTGGCATTCCG CT

TGCCGCGCATAGTAGAGTTCTTACAGGATCTGATATCCGTAGTCTTTATCTACAGAT ACT

TACACCATTCTTAGTCCACAATACAGCCCAAGCAGATAATCTTAACTGTGATAGAAG TGC

TACTGAAGCATGTACAGATTCAGAAGTCATCACAGACATGGAACCTGGCAACTCAAT AG

TAAACGGGGTTCCAGAAAGCATTGCTGAAGAAGATACTGCCGAACCTTTAGACATGG AA

TTTCAATTTTACCTATCAGATGATAAGGCAACCTTTAAAGGCTCCGAGATTGTAATG AAT

GAGCCATTACAGTCCACAGATATCTCTGGACGGTTAAATGTACTTGTAAGTTGGTCA CC

TAAAATTCTTGAACAGTACAATACAGGCCTTTTCAGCTCACTGCCAGAAGTTTTTAA ATC

TGGTTTTTTTGCCAAAAGACCACAAGAATCTGTCTCTCTGTATAAATGTCTTGAGGC ATT

TCTGAAGGAAGAGCCTCTAGGGCCAGAAGATATGTGGTAAGTATGCAACTCCCTCAC TT

CTGTGATTGTACACCATTCATATGCAAGCTATGTATTCATAACATATGAAATTTCTC GTAA

TGCTTCCCTTTTTGCTTCTTCTTTGGTTTGTGCTAATATTATAAACCCTCAACTTTT GTAA

TTACATAATTGTATTTTTCCAATTATACCATTTTATTTCATTTCTGTCAATATTTTC CACCG

CGTCGTGATTGCTTATTGTGGATAAACCATTCTTATTAGCCTCCCCTCAACCAAATT GGA

CCTTGACTTTGCAACATGGCATGAGTAGTATCCTTCCAACTTCTAGTTCAGTTATGT TAA

AGAAACAATGACAGCATCTGATGATCTTATGTAGCACCTATTGATTTTCTGCACATT GTG

CTTCTGAAATGCTTTATGTGTTGCTCTTTGTTTCTTGTATATTCATAGCAACTACAG ACAG

TAATTGAGATAATAATATTCAGCTATTTACCTGGGGCCACTCATGGGAGGAAAGACT TCT

TTCATGAAATGTTTCTAGTATTCTATCTTATCATCTAATCATTTATTTGTGTCATTC GCTTG

GGAAGTATGAATATAATCTGCCTAACTTTCTTTGTCTTATATCCAACATTAGGTACT GCC

CTGCATGCAAGCAGCATCGCCAAGCTACTAAAAAGTTGGATCTTTGGAGACTGCCGG A

GATTCTGGTCATCCACCTGAAGAGGTTCTCGTACAACCGGTTTCTGAAGAACAAGTT GG

AGACGTATGTTGACTTCCCAACTCATGATCTTGATTTATCCTCATATTTGGCCTACA AGG

ATGGCAAATCTTCCTATCGGTATATGCTTTATGCAATTAGCAACCATTATGGAAGCA TGG

GAGGGGGTCACTACACTGCGTTTGTTCATGTAAGTGGTGCTGCGACTTGGATTACCT TG

CTTCTTTTTCTTGGTTTTGTTTCTATTCTATGGTAAATAGGATTCTTTTATACCTGA TAAAA

ATGGCATCTTAAGATCAGTACTTGGGGAGAAGGGTGGGTGGTGGGCGGTCACTGAAA C CTACCTCCAAGGGCAAATATAGAAATTTCCTCTATTGGTCTTATTCTTATTGTTCGTAGT

GAGTGTTCCTTTGATGTATTTTTTAGTTCCAATGCATCATCTGCATCTAAATTAATC ACAT

ATTGCACACATGTGCATCTATTATATATTTAACTTTGGTCATGCGTCTTCATTTTTT TTATT

TCTTCATCATGAAGAATATGCAAGAAGGTCAAATATTCAGACTTTTACAGTCTTCCT AGT

TTAATCCAGATATTCTAACTTTGTGTTTTTCTTCTTCTAATAATCTAGCAAGGTGCT GATC

GGTGGTATGACTTCGATGACAGCCATGTGTATTCCATCAGCCAGGACAAGCTCAAAA CC

TCGGCCGCCTATGTTCTATTTTATAGACGAGTTGAAGAAATC

SEQ 72

TTACACTTGCCTACTACACTCTCCTTTGCCAAAACCTACTCGTCGATTTCTCATATCAAA

TTCTACCCATAAATTTTGCTGGTGAAAATTACCAATAATATTGCTTGCTATTCCAAG TGAT

TCTGACCGTCCGATTCCAACACAATGGATCCCACCTTCTACTTCATCCAACATCCTT TCC

TTATTGATCAAAATATCAACCCCGTTTTCAAATTGCAATGTCATATCACCTATCAAC CGTC

CGATTTCGATCGGACGGTTATCGAAGCACATGTCGAGTGCACCACCATAAACGTAAC CT

TTTTTCAATCTTGGACCTACTAACCTAACAATTTCTTCTCTGACCTTATTGTACGCT TCTT

CCACTAAGAAAGTGTACTCCGTGCCGGAATCAATGATCGTCTGGCCGGAACCACCAG C

GTTTGGCCGGAAAACCCTCCCGGAGATGTTTAATTTTTTGCCGCCAATTTTTATCCC CA

CCATGCCAACAGTAAAAGCTAGTGGATCCAAATTTGGCATGCGTTGACTTTGAGGAA AA

GTCAAAAGATTTATGTATTGAAATGTATGGGAATTAGGGTTTTGGCCTAGGTAAAAT GTT

CCACTAGGTTTAACTGCATGGCTACCTTGTCTAATTGGCACGCAATATGAGAATTTT TGT

ACCTTAGCTTGGGAGGCAAAAGAAAACCGTCCAAGATTCATTCCCAAAATACCCTCA GC

ATCTTCGGACTCGGTCGCACAACCAAGAATCAAAGGAGGGGTACTTTGGGAACGTGA A

AATGTAATTTTTTCACGGACAAGATTACCCTCAGCTAAAGTACCATCAGCATAAAAG TAG

G AATAGTG G CACAAACG ATTTTG GTCACAAGTAGTTG GAAG G GTAAAATCG GG AATTCT

TGGCTTACATAAAGGATGAGTACAAGGAAGAACAGAGAAAGTAGAAGACAAAGAAGG A

TCAAACGACGTCGTTGGTGGGGGTCTTTTGGGAATTTTCTTATGACATTGAATCCAA GA

AAGTTGGCTACCAGTGTCCAAAACCATTTGTTGATTTTGTGGTGGTGTTCCTATTGG TAG

TGTAACAATTAAAGCCATTGAATATTTAAAAGTTGATTTATAGTTCAAAGATGGAAT TCTA

GACATAGTTTTTGTATTTTGAGTTTGTCTTCTATTATTAGAAGCCATAAAAGAAGAA AGAA

AAAGAGCTTTAGAAGAAGAGTTATGTGATAAAGATGTTGAAATAAGAGGAAATGACA TA

GAAAAAGGCTTATGTTTAATGGTTTTTTGTGCTGAGATGTAGAGAAAATTGAAGATT ATG

AG AAG AAG AAG AAC AAAAACTCTAG AAG AAG AAG C CAT

SEQ 73

TACACTATAATTATATTTTCGTTAAATATGAAGATTTTTTCCATATTCTCTTTGCTTCTT CT

CCTTCTCCTTCCCATCTTGGCTTCATGTCATGAAAAACAGGTACAAGCATATACAAT TCT

AGTTTCTCATTGATTCTTTAATCGCAGTTCTACTTCTGTTTATTCTTTGTTTTAATT ATGGG

GTTTTGTTTTG CAG GTTTATATAGTGTATTTTG GAG GACATAAAGG G GAG AAAG CATTGC

ATGAGATTGAAGAAAACCATCACTCATATCTCATGTCAGTGAAGGAAAGTGAAGAAG AA

GCCAGATATTCTCTTATTTACAGTTACAAACATAGCATCAATGGCTTTGCTGCACTT CTC

ACCCCACATGAAGCCTCCAAGTTATCTGGTATAATAACCACGAAAAAAGTTCACTCT TTC

AAAGAAAGAGTTTAAGTTACATATAGTAAAATTTAATTGGTTATAGCAGGTTATTGC TCTA

TTTTCTAGGTCAGAGTAACTTGTTTTCATATGTCAAATTAATCTGATAGTGTAAAAA ATCC

TGTATAAGAAACACAAGGTTCTTGTATGTAGAAGAACTTACCTTATGTATTATTTGA ACA

CAGAATTGGAAGAAGTGGTATCGGTGTATAAAAGTGAGCCAAGGAAATACAGATTGC AA

ACAACAAGGTCATGGGAATTTTCTGGAGTGGAAGAGTCAGTGCAACCAAATTCCTTG AA

CAAGGATAACTTGCTACTGAAAGCCAGATATGGCAAAGATGTCATTATTGGCGTTCT TG ACAGCGGTACATACATATATATTTGCTTACCATTATTTCCAATATGGCATTATTTTCCCT T

TGTTTTAAATTTTAAATGTATTTCCACAAAGGGCTACATAATCTAGCATGTGATTAT CGTT

TCTCCAATAGTGATACAGACAATCTTATTAGTAAGACTAATGCCTTGTATGTATAAT AGTA

GAAAGGGATAACACGTGAGGAATCAACCTATATATATATATATATATATATATATAA ATGT

ATTTCAAAAAATACTACTTATAG ACATATAAG GAAAATTGTG AGAAG CCTTGTAC CAAAG

GGAGTCTAAAGTTAAAATAAAAATTCAACATGTTTAAGGATTATGGTTATATAGGAT GGA

CGTGTAACTGTGTCTATCCTCCGGCTTATCACTGGCAACTGAACACGAGGGTTGCGC T

CGTTGCGGGACTCATTAATTATGAGATTATCAACTGTAACTAGTGTTAATTGACTAG TCT

GATACTTAAAAAAAAATTGGAGTATGATATTATGTGATGAATGTTGTTGGATGATTT ACC

AGG G CTATG G CCAGAATCTAAGAG CTTTAGTG ATG AAG G GTTG G GACCGATTC CAAAG

TCATGGAAAGGAATCTGCCAATCTGGAGATGCTTTCAACTCTTCAAACTGTAATAAG TGA

GTGTAATTCCTCTTCCATATGTTTTATATCTTTCCTTTAACTTTTTCTTTCTTTCTT TATCTT

ATCCCTTTTTATTATCTCGATGATCTGATGTCTACCTGTTTTACAATGATTTAATGT GGAT

TTTAGCCATTCTTGGGTTAGAAAATGTTCAGCTGCTCTACAACCCTAGACCACATTC TTT

TTGTTTTGGGAATTCCTGCTAAAATAAGCTGATTTACTACCTTAGACGTTTGGTTTA TCAA

ATATACCAACCTATACGTATTTCTTTATTTTTCTTTTTTTAATAAACTTTATTAAAT TTTATA

AGGCTGAGATGACTTTGAACGAAAAATATGATTCATTTAGTTTAAATCCAACTTATT TGG

AACTGGCATAATAGTTGTTGTTGCTATTAAATTTCATAAGTAGGCTTAATAAACATG TCAT

CAAGTTTTGTGCGCACCTATCATATGATGCCTTGTTTATCCAATTATGGATTTCAGG ATT

TGCTTGGAAATGAAGTGTTTGGCACTTATCTCTTGTCTCTTATACATTGATCTAACT TCG

TAAGATTAATTGTATTTAAATGGCTTGTAATAGAAAAGGCCAAAGGTCAATTTCAAG GCC

GATTTTTGGAAGTTTTCCTTTGTCTTCTTTATCAGTTGACCCTAAACCATTCTCATA ATTT

AGCTTAATTAAAATCAATTAAAAGAAAGCAGATACATGTTTAGTTTTTTAATCTTGT ACCT

CTCTAAAGAGTGAAAGAGAGTTTTTTTGAGAGGACAGGACCCATTGGGTGTCCATGC CT

GTCCTTTGGTGGCCTTAGGATATCAGTGTAATAATTTCAATATTGTCCATTTCAATC AAA

CCAAG AGAG GTTATG CTGACAAGTTG CTAATTGTTTTTTG GATTCTTG CTTTG CCATCTT

GTGAACTTTGTATCCTTCCAATGCTTTGTTGTGCAGTAATTTGTTTTTTGCATGTGT GTTG

TCATTATGGTTATTGTGAAGTCTATAGTGAAATTTTGTGAGGCCCTTACTTCCAGTT TTG

CACGGATATTCTCAGTAGTAGCCAGTAATATTATCCATTTTGACTATCTCATGACTT CCA

TGCAGCAGCTTTTTGACCTTTAGAAAGTTGATGATGAAATTCTACCATTTTAGAATG ATA

AGTCATTTTCTAGCTGTTAAGTCACAAAAAGAGCACTAGAGCAGTAAAACTTTTGAA GTT

TCATTGTGAGGTTGGGAGGAGTGGGCCTGATTATCACATTCTTGTCCTAATTTGTTA CT

GCTACTATCCTTTTTTTTTTTTCTTATTAAGAAGAAGAAAGCCTTTTCTTCCCTTCT TTTCA

AAGGGTAGGGGGTGGGTGGAATATATTAGCCTAATTTGTCATATTTTCCTTCTCGTA TAT

AACCATGCTACTATATATGTTGTACTCAAAATATAAGATTTTGTATACCTTTTCCTC TATA

TACTAGATAGTGTGATCCCCTCATGCATCATCTTCTTTTCTCTAGAAGAAAATGTTT TATT

CATGGTGACAGGGGAGGGAGAGGGTGGGAATGTTGGGATCATATCTTGATATCTTGT C

TAATTGATCATCTCAGGCAAATTTAGGGTGGTCATGTGAGTTAAACTAAATAATTTT ATTT

CATAGGATCAGCCCGCCCTGATCAAGAATTACTTACTAGCTAGCCAGACTAGTGGAG C

CCTAGCCGGAGACATTCTCTAAATCATGCCTTAACGCGCCCATCTTCCAAATAAAAA AG

GGCTAGTTAGTAAGAAAGATGGAAAGACCTTTATCCATAATTCTTTCCCAGTCTACC TCC

TTCCTTAATTGTGACATGTCCCGTTGATCCCACCTACGAGCTATCTGTCTTTGCCTA GCA

AGATAATTTTTGGTCTCCTATTCTTGCCTATTTTTATAGCCTGTCTTTATCAAGCGA GATA

ATTCTAGTTCTTTTATTTTTGCCTATCATGGTAGGAAATTGGTTCGGCTTGATTGAA ATTT

TTTAAAATGTTTACATATAAAAAGAGTACACGCATTCTGAACCCACCAACTCTAAAT CCT

GAACTTGCTTCTCCTAATTATGTAAGATAACTTTAATATTTATTCTCCTATGCTACT TTGG

G ACTTCTATTG CAGG AAAATAATTG G AG CTAGGTACTAC AT CAAAG GTTACG AG CAATA

TTATGGCCCTCTAAACCGAACTCTAGATTATCTATCTCCACGAGACAAGGATGGACA TG

G AACTCATACATCATCAACAG CAG GAGG CAGAAAG GTTCCAAATGTCTCTG CCATTG GT

GGCTTTGCATCTGGCACCGCCTCGGGTGGCGCGCCACTCGCACGGCTAGCAATGTAC

AAAGTCTGCTGGGCTATTCCGAAGGAGGGCAAAGAAGATGGAAACACTTGCTTTGAC G AAGATATGTTAGCAGCAATGGATGATGCTATTGCAGATGGTGTTGATGTTATTAGCATTT

CTATTGGAACAAAAGAACCTCAGCCTTTTGATCAAGATAGCATTGCTATTGGAGCAC TTT

ATGCTGTGAAGAAAAACATTGTTGTGTCTTGTAGTGCAGGGAATTCAGGACCTGCAC CT

TCTACATTGTCTAACACAGCTCCCTGGATTATCACTGTTGGTGCTAGCAGTGTTGAC AG

AGCATTCTTGTCACCTGTTATCCTAGGAAATGGCAAGAAATTTACGGTAACACGATA ATC

TATTCATTTTCTGTACACTATTTCATCTAAAATGTTGTAACACTAGGATCATAACGT TTTC

CTTTATCTATTTAATTACATTCATATTGGAATGAAATTGAATCCATTTTTCGTTTGC TTAAT

ATCAGGGACAAACAGTTACACCTTACAAGCTCGAGAAGGAGATGTACCCTCTAGTTT AT

GCAGGACAAGTAATCAACTCTAACGTAACCAAAGATGTAGCAGGGTACTCTCCTTGC CT

CAAAGTTTCAATATTTTTAATTAATAATCATAATTTTCTTTTGGTTGATTATGTTAA ACACT

ATCTGAAACTTTTTCAAAAAAAAAATTCAGGCAATGTTTACCAGGTTCCCTTTCGCC GAA

AAAGGCCAAGGGGAAGATAGTAATATGCTTGAGAGGGAACGGGACAAGAGTAGGAAA A

G GTG G AG AG GTG AAAAG G G C AG GAG G AATTG GTTAC ATACTAG G AAATAATAAAG C AA

ATGGAGCTGAATTAGTAGCTGATCCTCACTTTCTTCCAGCCACTGCAGTGGACTATA AA

AGTGCAATGCAGATTCTCAACTACATCAATTCTACAAAGTCCCCAGTGGCATATATT GTC

CCAGCTAAAACAGTTTTGCATTCTAAACCAGCACCTTACATGGCTTCCTTCACTAGT AGA

GGTCCAAGTGCAGTTGCACCTGATATCCTCAAGGTCAGAATTTACATAACAAACTTA AG

ATATTTACCTGACTTATGATTTATGCTTCCTCATCTAAATTAAATTCTGATTTTCGC TACTT

CCACAGCCTGATATCACCGCACCAGGGCTGAATATATTGGCAGCATGGAGTGGCGGA T

CTTCCCCAACGAAACTAGATATCGATGATCGTGTGGTTGAGTATAACATAATCTCAG GT

ACTTCCATGTCTTGCCCACATGTCGGTGGCGCCGCTGCACTTTTGAAGGCTATACAT CC

CACTTGGAGCAGTGCTGCAATAAGATCTGCTCTTATAACCTCAGGTACCTCTCAACT AC

TTTTGAACTTAACTTATATACACTAACTACAGTATTTTAACCTGTTATAACATATAT AGTTA

TTTTGCTGCAGGTGACCTGATAGTGTGTAAACATTATTTTACATTGTCGGTGTATAG AAT

TTAAACTCCTTTTTCGTCCAAAATTTTGTATTTTGAACTGATCAATCGTTATATTTT CAGCT

GGATTACGAAATAATGTTGGTGAGCAAATAACGGATGCATCAGGGAAGCCAGCAGAT C

CATTCCAATTCGGAGGAGGGCATTTCAGGCCATCAAAGGCAGCAGATCCTGGACTTG T

CTACGATGCTTCCTACCAAGACTATCTTCTCTTCCTTTGCGCTTCTGGTATTAAGGA TCT

TGACAAATCCTTCAAGTGTCCCAAGAAATCACATTTACCTAACAACCTAAATTATCC ATC

TCTG G CTATTCCC AATCTCAATG GTACTGTTACTGTTAG CAG AAG GTTGACAAATGTTG

GTGCACCAAAGAGTGTTTACTTTGCCAGTGCTAAACCTCCATTGGGATTCTCTGTTG AG

ATTTCTCCTCCCGTCTTGTCTTTTAAGCACGTTGGTTCGAAGAGGACGTTCACTATT ACA

GTGAAAGTTCGAAGTGATATGATTGACAGTATTCCGAAAGATCAGTATGTGTTTGGA TG

GTATTCCTGGAATGATGGAATCCATAATGTTAGGAGTCCAATTGCAGTCAAATTGGC A

SEQ 74

ATGGCAACACGTAGAAGCTCTAGCTCTGCTCTCACGGCCCTTGCGGCGTCTCGTTCCC

GCCTACTCTCGCGGTTTCGTCCTGCAGTTTCTCGTCTCTCTCAGAATACTTTACTCG GC

ACCGGCAGGTGTCCACCTCCCAATAGTGGATTTTTTGTTGCAGAAACAACTGCTGCA CT

TTGGCCGAATTATAACGTGTTGTCCAAAAGTTTCGTGCACTCTTACTCTACTACTGC TGC

TAGCTCCGGACAGGCACGACTTTCTTCTTCCTAATTGCATTCTTCTCTGTTCAACGA CTT

TTCTTCTTCCTAATTGCATTATTCTGTCCGTTCAATTGGAAGTGCTAATAGAATTAA CTCT

AATTG AC GTTTAG ATTAAACTTG AATG AATG CTGTTG GTTCTTTTATTTAG CTTTTG ATG C

GAAGTGAAGTAATCTCTATTTAGATATTGTCAGTTAGAGAACTATTTTCTCAACGTT AAG

G AACATCATTTC CAG CCTTTTTTTTTTTTG CAG AGTG GAAG CCTTAAATTGTGTATTTTTG

GACGAGAAATAACAAAAATGGTCCCTTATATGTGGGGTAGAATAAAATAGTCCCTTA ATA

TACTCCTGAGCAGTTTTGGTTCTTCAAGTTTGCAAAAAAGTGAGCAGTTTTAGTAGT CGT

CAATTATTTTAACAAACTCTGGTTGTTTAATTTGACGAATACGAAGTCGCATTTGGA GGT

GCATTTTTTGCCGTTTATGATATATTTGGTCTATTTCTGGTGTCTGATGGAGTTTCT GGG ATTTAATGGGTCTTTCCTAGTGGTTCTAGTTAAATCCTTTTCTTTTTCATAGTTTCGCTA A

ATCTAG CAGC CAAATTTTGATAAACAGTTG ACAAGATAAAAAATG CTCATG CGTGG CAAA

CATAG ATCCTTCTG ATAAGC GTCAAGCAGTG GAAAACACTTTTAG GTG CTG AAGTG G AT

TTTTATAAATTGGCAGTTACGTGTTAAGTGAGAAGTGGAACTGATAATCAATTAGTA TGG

TTGGTAAAAAAACTGTTGATAAACACTTTTTTTGCTAAAATAACTGTAATGACCTTA AAGT

TATTTACAAATTCTATAATTTTAAAGTATTTATTACATAAAAAGACGAAAAATAGAG GTAAT

TAAAAGTTATGTTAGAAGAATATATTGGAGATTACAAAAGATCATAGGGATAAAATC GTA

AAAGG CTTG GTCAAACAAAAAATGTTTATAAG GTATAACTTTTG ACTGATTTTG G CTTAC

AAGTTCTTCTCGTACGAGCACTTTTGATGTTTATCAAACGTGTAGATAAGCCAAAAT GTG

CTTACAAGCTAGTAGGACCCTCTTATAGCTTAGACAAATACATGTATTTAAGAGTCT ATT

TTATACCTACCTGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNN N AAAAAAATAAAAGTTACTACTTTTGTCATTTTCTAAT

TGTTGGACAGGCGGTTTCATTCTACTATTTTTTGACAATTTCGTGGGTCATCCTTGG GGT

AGGGTAGGCTGCCTTCGTCACACACCCTTGGGGTGCGACCCTTTCTCGGGCCCTACA T

GAATGCACGATGTTTCGTGCACTGGACTTCCCTTTGTGGAGTACACTTTATCAAAGT TA

CGTATAAACACATGCACTAAACTAGACTTTTGTGATAGTTGTTTCACTTGAGGCTGT CAG

TTTACTTCTTTTTTTGTTTGGCTGCTATTGGTGTTGCCATAAAAACTACTCTAGAGT TTAC

TATTGATGCTTAGAACTTATGGGGTGAATGAAAGGACGGGTGACTGAAAGGACATCT CC

TTCCTCAAACGTGTTATATCTTCTTAAACAAGAAATTTTGATGTACTCAAACAATAC TAAG

GGGGGTTGTGCAGATAAAAACAGCTATTTATAAAGGTCCATTGGTCTCCCTGGCCTT CT

CTAGAATGAGGAGTGCCATGATTGTAGATATTTACCTATACCATATCGCCATTTGTC AAT

TCTCTTGGATTTGTCCTGATGATTACTTATCTTTAATTTTGGGTAATTTTAAATTGT TTTG

G CATATTATTTGTTGTTATGTTAATTAGTATG AAG ATTTAAC AG ATTAATAATATG G ACTA

CACCGAGATGGCTCTTGAGGGTATTGTTGGTGCTGTAGAGGCGGCACGGACTAGCAA G

CAACAAGTAGTTG AGACTG AG CACTTAATG AAAG CTCTTTTG G AG CAGAAG G ATG G GTT

G GCTC GAAG AATATTCACTAAGG CTG GGTTG GACAACTCATCAGTTCTG CAAG AAACAG

ATCAATTTATATCTCAGCAGCCAAAGGTATGAAAAATGGAGACTGATTGTGGATTCT GAT

GAGTTCTTGGACTAGAGAATCAGATATTTTTTCTTGCAGGTAGTAGGTGATACTAGT GG

CCCCATATTGGGGTCACATCTTAGTTCTCTCCTAGAGAATGCGAAGAAGCACAAGAA AG

AAATGGGAGATTCCTTTGTGTCTGTGGAGCATATGTTGTTATCTTTTTTGTCAGACA CAA

G ATTTG GTCAAAAGTTATTC AG GAATCTCCAGCTTACG GAG AAG GCTTT GAAG G ATG CT

GTCAATGCTGTTCGTGGAAGTCAGAGAGTAACTGATCCAAGTATGTATATATTTATA TAG

CTTCATGTTTCGTGGCCATGTCTCTTATGATTTCATTCGTTCTGGTTGAATGGTAAA TCC

CAGTTGGCGAAAGGATCTTACTTTATCACAGCATGAGATGATCCTTACTTTGGTTGA TTG

GGTGGATCGTGCAATATTGTCTCTCTGGTTTGTCATGGACATTAAATTTTTCCTATT GAT

GGGGTGGGGGGGGGAGTCTTGTAAGGTGGTTAGAGGCTTAGAGCTGATAATATTTCA A

AAAGTACTTGGCAAAAAAGGCTAATTCCAAGTAGATGGTATAATACTCCATTCACAT GAA

CTTAGATTT AAATCTGAATTT GAAG AATGACTTTT GAT ATCAAAAGGTAGAAT GAG GAAT

TAGCAAAATTGCATTGAGGGAGTATGGATGTCCTTGATATATATGACAATGCATATT ACT

GTCTCTTAGTGACTACATCATGAAGTTTCAAAATGATTGACCATGCTCAGTGCAAAT AAG

ATATCTAATTGTGTCAAAATAGATTTTAGTATGGCTCCGATATTTATCAAGATCTTC CAGC

GGCTCTTCCTGTCTAGGAATGAGTTTGAAATCTCGATCCTAAGGGTGTGAACATGTT TG

CAATCTGACACGTTTGTTTACTATAGTCGTCAAAAGCATGCTTTGTGAGATAGCAAT GCC

TATCTTTGATTGTGCTACCTCAATCTCCTGTAATTTTCTCAAATCCTGAGCCTTACA GCA

ATTGAACAAGTGTTGCTAGAGTTAAGAAACTATTTGGTCATTGCCTTCTATAATAAA ATG

TCATCAACATAGTCCACTAGTAAATACATTAACCACATTGTTGTGAAGAGCGTGAAG TGA GAAAAAGCGACAACCCCCATTTCGCTTAAAGCGAGAAGCGTAGCACTCACTTTTTTGAA

GTGAAGCCGAATTTTCAAAAAAAAATTAAAATAAATACTGCATAGACAACACATGTA ATT

GTAAGCAAATGTTCAATACTTCAATGTAAAAACTAAATAGTAGCATCAATTAAAGCA CAA

AATGAGCATCCTATTCTTCTACAAGATTGTGAAATTCTTGTATTCCACTATCATTAT TATA

TTGCTCGTCATCTTCTTCAACTTCTTCTTCTTCATCAACTAGGGACAAAGTAGCCGC CTC

TTTTCCCTTCCTCTGTGAACTTGAAGTTGAGGTACTCCCCTTCAAACCATAAATCCT CTC

CCCAATTTCACACGCCTCCGCAACATCACCCCAAGTGAAATCAGAAGTTTCCTCAAA TA

CTTCTTCATCTGCATGATCTTCCGGGACTCCAATTAGCCATTCATTAGCATCATTGA TGT

TGTCCAAACTAATTGGATCAATTACATTGCGAGCATTGTAACGACGCCTCAATGTTC TAT

TGTACTTAATGAAGACTAGATCATTGAGACGCTTCAAGGTTACTTTGTTCCTCTTTT TGG

TGTGGATCTGCAAATAATAAGTAATTAGTAAGATTAGGACAATTGAGATATAATTTA ATTA

TCTCAATATACTAAATCTTTCTTCTTAGTTCTTACATGTTCAAACACGCTCCAATTC CTTT

CACACCCGGATGAACTACATATTAGACTTAGAACCTTAATGGCAAACTTTTGTAAAT CTG

GGGTGAAATGGCCATATTGCTTTCACCATTCAACTGTAGAAGTAGAACAAATTTTCA AAA

CATAATATTGATAAATTAATAACTTATTAACTATAAAACAACATATAAGCACTCGTT CACC

TGGTGACTTCGTCTTTTTTTGTCTAATCGCCATGTTTTTTCCAAAAGTTGCTCAGCA TTC

CTATAAATACTAAATTGCTCTGTTATTTTATCTTGCACGGATTCTTTGGGTATCAAC TTCT

CAGTACATTCATAGTATCCATTCCACAAATGTTCATCTCCTAGAATCCTCTCTTCAT TGTC

ATAAAACAGTTCCGGGTTCAAAATAAGTCCAGCTACATGCAAAGAGCTATGAAGCTT AC

TATCCCACCTTTTATTTATGAAATACACTTTTGTATTTTCTTTGATCACTAAAAGAG ACTT

GAATAGCCTTCTTTGCCCTATCCATTGCTTCGTACATGTAGCCCATTGGTGGCCTTT GC

TCCCCATCCACCAAACAAAGCACTTTAGCTAAAGGACCACCAATCTTCAATGCATGA AC

CACATTGTTCAAGAATGAAGGAGAAAGTATAATATCTGCAAATTCTCTCCCTCGAGC TTC

CCTTCCATAGGCACCGTTAGTGTACTCATCTGAAACAAACAACTTCTTCAAATTGCT TTT

TTGCTCATACATCCTATGCAAAGTCAAGAAAGCGGTAGTAAATTTTGTCTTTGCAGT TTC

ACCAAGCTTCTTTGTTTAGTGAATCTCTTCATCATATTCAATAACAAAGGCCTTTGA ACAA

TAGAG GAATG CACTCTAATTG CCTG ATTAAAG ACTG AATTG ATGG GTCTTTCCTTG AAAA

TGTCACCGAAAATCAAATTAATGCAATGTGCTGCACACAGAGTCCAATAAATATGCG CG

TACCCAACAGACATCAAATCACCAGCTTTAACATTTTCACTGGCGTTGTCCATGACA ACA

TGAACAACATTTTCTGCTCCAATAGAGTCTATTGTACTCTTGAACAAGGAGTATATT TTG

GTTGAATCAGTCAAAGAGTCGCTTGCATTAACGGACTCAAGAAACATACTTCTTTTA GGA

GAATTCACCAAGATATTGATGATCATTTTTCCATTTCTCGCCGTCCACTTATCCATC ATAA

TGGAACAACCAAACTTGTTCCATTCTACTTTGTGATCCTCCCCGATTTTGTTCAACT CTG

CCGCCTCTTTTTTTTAGATATAGACCTCTAACTTCATGATAAGTTGGAGGCTTCATT CCT

G GACCATATTG G CCTAC GACATC AATAAAAG CAG AAAAAGTGTCAGTATAATTAACACA

ATTAAAAGGAAGACCTGCATCATACATCCACCACGCAAACATTGTGACTGCACGAAC CC

TCAAAATCGCTTTGGCATCAATTTGAGGATTACCACTTTTTCCTCCCTTATCCCCAG ATT

TTTGCGGGAAGTAACAATCCATAGGACCTTTGGTCTTGCCAGTAGATCCACAGCTAG AA

GACATTGGTTGCATCCTTCCCTGCTTTTGTGATTTTGGTGGAAGCGACGAAGCATCG TC

ACCTTCTTCTGTTTCATCATCGTCATCATCATGATTATACAGTTCTTGTTCATGAAT CATT

TG AGTCTTTAACTCTTTTTTTTTTTG AAG GAATG CTTTCAATTCTG CCTTCACATG CGATG

GAACTTTAGGACAATATGCGACATTTGGATCACCACCGATTAGATGCGCTATTTGAC CG

ATAGATTCCCCCATTTGAAATCTTGTCACAAAAAAGACATCTAATTGCCATCTTGTT TTGT

TGGTCTCGCTAACTCTTTCAGAGTAAGTCCAAGCCGGATCTTTCCTATCTTCTTTTG GTG

CCATTAAAAGAACAACTACATAACACATAAGAAAGACAATAAGAACTAAGAATAAAG GAT

ATAAAAATTAG AG G AAG GG CAGTAATTCTCTTTTAAAAATTGG G CAG AGTTAAAAAAAAA

AAAACTAGGCAGTAATTCTCTGTTAAAAATTGGCATCAATTCTGAGAAAGAACCGAA GGT

TAAAAAAAAAAATTTGCCAGCATTTCGCTGCTTTTTGGACGTTTAAACAGTGAAAGA AAA

AGAAAAGAAGAAGAAGAAGGAGGAAAATCAGAACATACCTGTTGCTGTTGAAGACTT GA

ACTTGAAGTTGAAGACTTGAACTTGAAGTTGAAGTTGAACTCTTGAAGAAGAAGAAG AA

CCCCAGTCGATGCTTGTCGATGCTTTCAGAAGTTGATGAAGAAGAAGAAGTCGACTA GT GTCTCTGCTTTAAAACCCTAGTCGCATTTGTTCGTTTAATGAAAGACCAGACATCTTGTT

TTTAATAAAATAGGGTTGAGTCTGATTTAAAACACAGAAGCGATCGCTTCTCTCGCA TCG

CATCGCTTTCCTGCTTCTCGTTTTTTAGTGGGAAGCGGTCGCTTTTCTACACCTAAG TC

GCTTCACCCTGTTGAAGCGTGCACTTTCTTGCTTCGCTTCGCTTCTCGCTTAAAGCG AG

GAAGCGGACGCTTTTTTAAACACTGATTAACCATGACCATTATAATTGTATACGGGT AAA

ACCGAGCCCATGATGCACCTCGATTTCCGACAAGAGAAGCCAGGCTCGAGATGTGAT G

GCAAGGGACAAATATCAAGCCGAAAGTCCCATTGAGCCAGAGCCCTGGGACACGATG C

CTGCCCTCGGGAATATCGAGGTCATAATTACAGAATCGGTCCTAACCTCGAACAACT TC

GAG G AAC ATTATC G G AC G ATC AAG C GTAG C C AAC AG AAAG C C G AAATATC C ATG AC C G

GCCGAGTATCACGACGGGGATCTCGGCACGTATCGATAAGGAACCTTCAACCAGTTA A

TCAGAAGACCTTTTACCTTTTACAGAGTTGTACCTAAAGTAGGACTCCCCTACTATA TAA

AGGGGGTTTGATAATTCATGTAACACATTGAAAACACGCGTTCCAAGGAAATATATT ATC

ATTTTCTCTTTTATCTAGCTTTTTTCACTTGTTCATCAGTGTTGACTATAGCAAGCC CGGG

ATCGAGGGTGAACAATTTTACTAAGGTTGAATCTGTCTTATTCGCATGGTTTGAATT CAT

TTTATCTTTACTAGTTCAATCTAATCCAATTTATAGCTTTGTGTCAAATTAATCCGC GTAT

CCTTAAAACCACTTATAAATTCAATTGTTATCCGATTTTGAGGGTAAACAATAATGA AAG

GAAATTGAACGATCAGCTTCACTTTGGACACTTCCCATGATATTGGACCACGAACTG AA

TTAGAGAAATTCCAAGCCACACTCCCTATAAGTGTTTTAGACCATATAAGTGACTGT CAA

CAGCATACTAAGCCTGAATCCTTTGAGCAACAAGCTAGAGTGGTTGCTTCATGTACA CC

TCCATGGCTAATTCTCCATGCAAGAAGGCATTTCAGATATCCTGTTTATAAAGTGTA CAT

TAAAACATTGACGCTAAAAAGTTAAACAGTTGTGTGGGTGGATACCATATAAGTAGC GA

CGTCCAAAAAACACTTACCAGAATTAGCATGAGTTTCAGATTCAAATTCCTGCGGAG GC

AAAAGACACTAGGTGATTTCTTTTTGTATGTCCAAGTCTTGGTGAACATATGTGAGA AAG

AAAGAGTTTGAG AG AGAACTCAG CTTG ATTATTAC CTTAAG CAGTG GG CTTTAAATAGCT

AAGTGCACATATTCTAAGTATTAGATCAAGTTTGAGACCTAGTAAACTTCCACAGGA AAA

GGGATAGAGTCAGATACCTAAACCTAGTTATACTCTAAAGTTTTTATGTAATTAGAC CTC

CTTAGTCTCTATCCTAATGGTATAGTCTCATCTGTGCTGTCCCAAATTAATATATTG AAG

AGAAATTCAATCCCAAAGTTGTGTGTGAATTTATATGGTATCAGAAGCCATGTCGAT ATC

TTCCTCCTCCTCCCCTCCATCACCTACACTCGTTAACCCTCTTTCTTCGTCATCTTC CTC

GCATGCACCCCTTGACCATGCTCATCACTTCATTTCAGTTAAATTAACTTCTACGAA ATT

TTTCTTTTTTGGAAGACGCAACTATTACATTTTCTTCGAGGACAAAATCTCCACAAA AACT

CCACGGCTATATTGATGGAACTAATCCTTGCCCACCATCACACACTACGGTTGAAGG CA

AAGAAATACCAAATACAACCTATGTATAATGGATCTAACAGGACCAATTGATTCTTA GCC

TGTTGATTTCATCACTTTCCAAAGAAATGTTGCCCATGAAAATTGGTTTAAATACCT CCA

AAGCAGTTTGTGATGCACTTGAGGCAGCCCTATCCGAACCTTCAAATGCACGAATCC TC

AATCTTCATATGCAACTTTAAAACTTGAAGCAAGAAGATCTTTCGGTTACTCAATAC TTG

CACAAGGCCAAACTCATCTCCGACGAGTTGGCAGTTGCTGCAAGGCCCCTTCGTCTT G

CCGATCAAAATGTGTACATCTTTAAGGGACTGAGATCTGATTTCAAGGACATTGTTA CAA

CTCTCTCAGAACGACATGAACCAATCACATTCTCAGAACTTCACAGCCTCTTGCTTA ACC

ATCAATTTAGACATGGTTCCTCTATCTCCTCACTTTCCTTAACCACCCCAAAACCAC CTG

CTCTTCAACAATACCCACAGCTAACTTCAATCAACGAACTACAAATCTGATCGTAAT AAT

G GTTTCAATTCAAATAG GG G ACG AG G CAG ATCTTCGTGTG AAAGAG GG G GTAGAG GTG

GTTGTTCATCCTCAAGGAATTTCTCTAACAATGGACAATCTTGGTCTCAATATGATC AGC

GAACCCGGTGTCAAATATGCAATGGTACCAACCATCTTGCATCAACTTGCTTCCAGA GG

TACAATCACTTGATTAACCCTATGGCTTATTTGTCTAACCAAGCTCCTTTACCCTCA ACTT

TGCAATGGTTTGCGGACATTGGAGCCACTCGCTACATCACTTTGGATCTCACAAATA TT

CATCAAGTTGAAGATTATAGGGGTTCAGATCAGGTCCAAATTGGCTATAGACATGGC CT

TTCTATCCATCGCACTGGTAACTCCTCTCTCTGATCACCCTCTTGGTCTCTCTATCT TAA

GAATATCCTTCATGTTCCTTCAATTACCAAACGTTTACTCTTTGTTCAACTTTTGCT CGTC

ACAATAATGTCTTCGAACTTCATCCCTTTCATTTTGTTGTCAAGGATCTACAATCCA GGA

CACCTCTTTTTACAGGGCAGAGTGATGGCGATTTATACACACTTCCATCCAAGTCTT CTT CTTCTTCCATCTCCCAGCCAGTTCCAGCCTCTCCAACAGCTTCTCTATCCATCAACACAT

CACCTTCATGCTGACATCTTCATCTTGGTCACCCCCATCAACTAGTACTTACGCAGA TTC

TTAGGACCTACTACAATCTGAAATGAATGCTTTGCTACGAAATAATACCTGGTCTTT GGT

TCCTCATAATCTTTCAATGAATGTTTTAGGATGCAAATGGGTGTTTCTCATTTAAAA AAAT

TTCTATTGGGGCAATTAAGAGATGAAAAGCCCATCTTGTGGCTAAAGGTTTTCATCA ACT

TGAAGGCCAGGACTACTCTAAGACTTTCAGTCCAGTTGTAAGGCTGCAACCATTTGC AT

TGTTCTATTTTTAGCAGCTTCACATGGGCGGTCTCTCCAACAATTTGATGTGCAAAA TGC

AGTTTTACATGGTGAGCTTCAAGACCATGTGTTCATGAGCCAGCCTTCAGGTTTCAT CC

ATCCTCTTTTTCCTCATCATGTTTGTCAACGTAAGAAGTCACTATACGAGCTCAAAT GGC

TCCCAAGGCATGGTATATGCGTCTCCATAAGTTCTTGCTCAGCGTAGGCTTCATCAC CT

CTAGATCGGACACTTCCCTGTTTGTCTGCAACTCAAATGGTGTTGTCGCCTACCTCT TA

GTATACGTTGATGATACATAGTCACTGGCAGTGGTACCTCCTTTTTAGAATCCATTT TCC

TCAAACTTGGAGATGTCTTTTCCATATGTAATCTTGGTCCTCTCAGTTTCTTTCTTG GTCT

TCAGGTTTCACGTGATCACCATGGCATCTCTATGTCCCAAGCTGAACACATTAAGAC TA

TTCTTGCAAGAGCACGTATGTAGCACTGCAAACCTTTAATTACTCCCATGGAAGTGA AT

GTCAAACTTCACAATGGAGAAAGTCTTAGCTTTCATGATCCTACCTTGTACTGTCAT ATT

GTGGGCCTTACAGTATGTTACTCTCACTTGGCCGGACTTAGCTTTTGTGGTGAATAA AG

CTTGTCAATTCATGC ACAATC CTACTATG AGTCAGTG G GCAG CAGTCAAG CG CATACTC

TGCTATTTGATGCATACCCAACGTATGTGTTTTCACATTCCTAGGTCTCTTACACTC ACT

TTTCAAGCCTTCACACACTCAGATTGGGCAGGTTCACTCGATGATCGTAAGTCCACT AC

GGTTATGCCATTATCTTGGGTGAAGCTATTCTCATGGTCGTTCAAAAAGCGGCGCAT TG

TAGTAGATCTTCCACAGGTTCAGAGTATAAAGCTTTAGTAGATGCAGCTGCCGAGCT GA

CTTG GATTCTGTCTCTCTTGTTTG AG CTTG GTGTTTAACTTCC CAAAG CTCCAATTCTAT

GGTTTGACTACCTATCTTTCGGTAATCCTGTGTTTCATGCACGAACCAAGCATGCGG AA

ATTAATTTTCACCTTGTTAGACAAAGTAGCTCGAAAGGATCTCACAGTTCAATTTTT ATCC

TCCAAAGATCAGCTTGGTGATGTCTTCACAAAGCCACTAGCTTCCTCTAGATTTGAG TTC

CTTTG GTC GAAG CTCAATGTG GTTTATCCACCTCAGCTTGCAAG G GAGTATTGTATCAA

CTTTGAGTCCTGGTAAACTTAGGATATAGTCGGGTACCTAAACCTAGTTATACTTTG AAG

TTCTTATGTAGTTAGATCTCCTTAGTCCCTATCCTATAGTATATACTCATCTCTATA AATG

TACGACCGCTGTACCAAATTAATACATTGAAGAGAAATTCAATCACAAAGTTGTGTG TGA

ATTTATACTAAAAGGAAAGAGAATTAATCACAATGAAAATACAATCAAGCTATACTC TATT

TACAAGATCCTAGAATATTCTAGAATATTGACAAGATTCTAATAAGAGTTAGCCCGT ATC

TGTGCTGGTGGGAGGTAGCAGGTATCCTTTAGAATTAGTGGAGGTGTGCGCAAGCGC C

AGAACACCGTGGTTATTAAAAAAAATCCCTACGAATTATTGGTATGAATAACAACAA TCT

CCTCTATCATGGAATGACTCCATATTAGATGTATCAATAATTTTCCAACGTCTTTTA AATG

TGGAAACACTAGAAAAAGTTGACACATTGATTAAGAAAGTAACAAAATGATACGGGA AG

TACGAATATCTTTGCGTGAGGTAATGAATGGAGTTGTTATGTCTGTTGAAGGCATGA TTA

G GATTAGTG ATG G AG GAG GAG GAG AAAGTCATTGAATG CTG GC GATGTACTTGTATCA

TGAGAAAAATCTCATCTGTATTAGTAGGAGGATGTTTATGAACTGATTAGACTCGGA CA

G GACAAACTTGTGTTTTAAGAAGTTAGG G ACTAGTAG ATGTCAG GAG GG AAATTTTATTT

CATAG AATTTG GTG AAAAGTATAG AGATGAGTGG G AG GTTACATTTG CTGG G ACG AACT

TTACTTTTTGGGGGATTATAAACATTTGCAACTTTTCTTGAAAATGGGTGAGTAAAA AAA

GACACAATTCTGATTAGGATTGAGGTCTGTCTTTTCCAGTAGTGTGATCATGAAGGA AG

CAAGTACATCCAAGCACTTCCAATTGTAGATGGTGGAGATTTTGATCTGGAAATTTT AGT

GTGGACAGTAAAGTTGCAACATAGAAAAGATTGATTTGTACATTTGCCATCGATAAC AAT

GCTTCTAGCAACATGATTCTAGGGTGAATGGGTTTGTTGGGGCTCTATGAGTCTTCC TC

GAGTGGCGGTGAGGTCTAGAAGTCGATGCAAAGAGAATTGAGTGGTACAGTCAGTGA C

TATGACTCATTCTGTCAGCCTCTGTAATTTTTTGCCGCTATAGCTTTTCAGTAAGTA GTTT

CTTCCCTGGTTTTAATGTCATGAAAACTAGAAATAAAAGAATGAAAAATGATGGAAA AAC

TTGATTATTTCCTCAAGTTTGATTAAGAACTTAAACTAGTTACAATGTTGGTAAAAT ACAG

AATATGGTGAAGTAATTCTCCTAATGTGACTCATTCATAGCAACTGAGTTAACCCAA TAA ATGCAAATGACTGGACGATCTCGACAAATCCAAGTCTATCAATTTCAACAAGTTTCGCTG

CCTTAATCATGACAATATATTTAAGTGATGGTCATTAAATTGGAAAGAGTTGCTGTT GCT

CTTGTTTTTGCCATCATTCAGCTGTTCACTGTGGTAGATTATGGTTTCCTACCAAGT CCA

ATGAAACTGAGCAGTCTTGACAATGCCTGATGTTCAGTTTCTAAAGTCTGTTCTCTC TCC

AAAAGTAGAGGAACATAATGTTATCTGATTGGCTCGGGAAAAAGTTGTATGTAGGGT GA

AACCTTAATGACATAATGAAACATGTAATGGTCTTTGTGCCTTTGGTTCATTATGTC TGC

TACTAGATACTGAAATTGCTGCTGAAAGTGCTTTTTGAGGTGTCACTCATTTTTTCT TGC

TG CTATTAATACATAG CGTTCTG ATTCTTTTCAG AC CCAG AGG G AAAGTATGAG GC ACTT

GAGAAATATGGAAATGACTTAACTGAACTTGCCAGACGTGGAAAACTTGACCCGGTG AT

AGGAAGAGATGATGAAATACGGCGCTGCATCCAAATATTAAGTCGGAGGACAAAGAA TA

ATCCTGTTATTATTGGTGAGCCTGGAGTGGGGAAAACTGCAATTGCCGAAGGGTATG AT

CTCTAGCCTTTTTTGGTCTCACGGGGTGATGTATGAACATGTTTTTCCTTATATTTA TTTG

TCTGGATCCTGGTTCAGTTGAAAACAATCCAAAAACAGTAATGGAATAGCAGATCTG TG

GAGGACATTTTTATTATTACTTGTCTACAATGATATTCTTTTGGTTGGATATGCTGT TTTA

ATTAGTTTTGTTGAATAATGCTGCCTGGCAAACCAAACATTGAACTTTAAAAGAGTT ATG

TTTCTAGAAAAGATATGCTCGGAATAGCATGATTATCCACTCAGAGAGGTTTGTGAT ATT

TAGCAGACTGTATGTGGGGTTTACAGGAAAAAATCATTGCTAATACATATTGTTTCT GGA

CAATACGTTATTCCTGTTAACTATTTGAATTGTGAGATGGGTCAGGGTGTGTTATGT GGA

GTTGTTGAAAATTAGTAATTGTAGGATGGAATGAAACAATTATAATCTTTTTTGTTT ACCA

TGGTTCCTTGTGTTTATTTTTAATGAGCAGTGGCGTGGTAATGAAATGTAGTTATAG AAC

TTCTTACTAAG GG GTCTGC CTATATGTTG CAG AATCAG G ATCG CTG AC AAGGTTCAGTG

AAGAGTTCTGATGGATGCATTCAACATTTCTTGAAAATATATATCAAAATAATGCTG TTGT

TAGTGAATCACTGAATTTGTGTTAACTTCCCTTTGATTTGAAACTACCTTACCCGTA AGC

CTGTAAAATTGAGGTGGCTTGCAGAGCCTTTGAATGATCAAATTTCTATTTAATAAT TAAT

ATAATGCCAAAATTGTGGCAGCTTGTAGAGCCTCTGTGTGACCTCGTTTTAACTTTT TAC

ATAGGTCCAAGGTAGTGTTAGCTGTAATATTTGCGTGTACTTAAGAGTCCTGTTCAA GT

GGTGGCATTTTTCACGTCATCTACTTTATGCAATATGTTATGTTTCCAGCCTTCGTG AAA

TGGGGATGTGTTTTTGACAGATTAACTGATAAAAGTCAATCAGTTCTGTCTCTTGTA ATG

ATCTTTTTCCAG CAAAG GAG GCTTTTTATTTATTACTTGTTAC CAGTAC GTTATG AG ATTT

AGAGCCTTTGTGGTTTGGATATTTAAAAAGTTAATCAATACTTACTTTTTATTAGAT CAGT

TGTGCTAGTTGTAACTTACTTATCTTACATATCGGAATAATTAGTTTGGTTTAACTG CCAG

ACGAATGTGTTTCTGCACCGGAAGTGAATAATTCTAAATTGATATTGGATAGTGACT TAT

TTGAGTGTATCTGCAATATAGTTTTTTTTTGCTTAGACGTAGAGAAAGCATATGATG AGT

TGGGAAAAAATGTCGTTTAGTGGGTGCTTGAGAACAAAGAAATTTCTACTCAAAATA TAT

AAGTACCATAAAAGATATATATGAAGGAGTGGTCACAAGTGAGGGACCAGTGGGCGG A

GATATTGAGGAGTTCCTTGTAACCGTGAGTCCATAATAGGGAGCTGCCCCATAGTTG CT

TACCCTGGCTATGGAATAAGTTATTCAGTAGCAAATAGGACGGCATTATGGGATATG CT

ATTTGTCGATGATATTTTGTTATTTGACGAGGGGCAGGGCAAGAAAGTAACAATCAT AC

CCGAGAGAGTCAATTGTTAATTGACGAAATTAGTTAGGAGCCGATCAAAAGTTGGAA TT

ATGGAGAAGCACGCTAGAGAATAAGGATTTTACAATAGGTAGAAGTTAAACAGAAGA TA

TG CAATG CAAGTTTAG CTTTTG AAGTG AG GTTAG ATAG GATACTAGTGTCG AG AC AAAA

AATTCAGATATCTAGACTCGATCTTTCAATAGAATGACATGATAGATGAATATGTAA CAC

AATAATATGATGGCCGAATTGAAGGAAGCCCATGGAAATGCTTTGCGATAATGATAT AG

CTACTAAAGTAGAACATAAGGTCTATGGAAACTGGTGATATAAACAGGGTTATATAG GA

GTAAAGGCTGGAATTTTAAGACCCACAATATCGGCAAGATTAGCATCGTAAATATGT GG

TGTCAGATGGATGTACTATACGACCCAAAAATGGTTGAGTCCAAGTCACTACATTGA CA

AGATGAGCATCGCAAATATGTTGCGTCAGATGGATGTATACACACCAAAAATGGTTG TG

TCCATGTGAAGGTGCATGTAGCACACAGTGATAGTAAATTGAGAGATGGCCACATTT CC

ATCATGTCTTTCCTAGGCCTTCCAAGTGCATTGGTTTATTGGTGAGACTATGATGAC TAA

AGCTGTTGAAAAGGTATGAACTAGATCTAAAATTACATGAAGAGAAGTCATCTAGAA TCA

CCTACAATCTCACAGAATCTGCGTGGATTCATTATGAACATAGCACAATGAAAGCAA ATT ATCAAAAG CATG CAATAGTAATTAG CTAG AGTTG AAG CCTAGTCG CTATTG CTCTTAC AT

TAGGTTGTGTGTTTTCCAGGAGCTTTTAATTTGGTTAGAGATTTGTATATCAAATGT GAG

ATATAGAGACCCTCTAGTTTTAACAAACACCCTTAAGTTATCTCAAAAGAGAGCAAA ATA

GATAGATGATTCATATAACGATCCCAACTAGGTTGGATTTGAGGTATTGATTAGAAT GAT

TGATATAGTTATCCAAATTTTAAAAATCAACCTTATTAGTAAGGCAAAGATGCTCTT AACA

TGTTAAAAAGAAGTCGAACAAGAAATTGTTCTTCCTTTCCTTTGATATAAGATATTT CTCT

CCCACCATCCTGGAAGGAGAAGGTAAGGATTGTGAGAATGCATGGGAGAAAAATCTA T

CTTTTTAAATATATGATACAACGAGCCCTATATATAATATATATTCTACTCCTACTA CATAT

AGGACTAGGACATATTCTACTCCCACTGACTGGAATGATGCACACAACGGCTAGAAT AG

CCTCCAGAGAGGGTGGAGCAGCAACCGAAGAAAATGTTGGCGGGTTGGGCTGCCGGT

AGACAGAAGTTATTGAGTCTCTACTAAGAGAATAGAGGTTTTGGTATCATGAGAAAA GA

AGAGAGTACTTATTGACTTGATTATTGACACAATGAGAGTGTTTTTTATAAAGGATT CTTA

TTCTAGTGATATAAGCTTAAGTATTTATATTATGCTAATGATATGAATAGTGATTTT TCTCT

TGTAGTTAAGTAAAGAATACTCCAAAATATCATATAAAATATTTTACATATTCTCCT ATTCA

G ATACAATG AG ACTAG G CAATATTAACCTATAATTACTTTGATTCTTTTG GTATTTTAACA

CG GTTTTTAGTCTTTAAAAG AAATCAG G AAAAAGAG AG GTG AAAG ATG GTATCAGC CAA

ATCTATATGACAATTAACCTCGGAATGATTATGCTTTCTGCTCCTCATCCATGCAGG TTA

GCTCAAAGGATAGTCCGCGGTGATGTTCCTGAACCTTTGATGAATCGGAAGGTTATA AC

TTCTCTTCCTTGGTCTTAATTTGATTGACTTTATTTTATTGTGAAAAGGTCAAGTGA TTGT

GCATATGCAAGATCTTGGAAGATGGCCTTTGCGTAGTGTGCATTGGATTGCTTTCTC TT

CAATTTGAAAAAGATAGTATCTGGGAAAGGCATGTCTTGAAGAAATAAATCAATGGT TGA

AAACCTGCTGCCTTTCTCTTGAGAACCATGAGCCGTAACTTGTTGCTTACTGCAAAT AGT

TCTGTTTCTCGTGTTGCAGTGATTCATTTGTATGACGCCAGTATTATTGGATCATAA TTTT

CTTTTCAAGATATGAGTCTGCTAAAGACAGTTGATGCAGTCTTTACTCAAATGAACG ACC

GTGGTTAAGTAACATATTGTTTTAATCATTTCTGGGAGCAATTTCAGTCGAGTTTGA TAC

TAAGTGAGATATGCATAACATTGCACTCAAATGTCAAACTAAATTTTGATCATCTAA GCT

AGTCCGATGAGTTAGTCTCTTTTCTGAAACCCCCAACCCCTCACTCAGCAAAGGGGT AG

GAGAAAAGGAAACGAACCACCTCATTCTCTAAAAAGAGTGGAACTAAATTATGATGC CT

CCTAAAGTAAAGTAGTTGGACATAAGTTTCTGAAGTTTCAAATTTGAGAGTGGGATT CCG

AGGAAAGCCTTCTGTAGCTTGAACCTTAATTGTCTACTAGGATTTGTTAATTTCGAC TTT

TAAGTGCTAATACTGTAATTTGATTTCCTTTTTCCTTTTCATTTTATTTAGTTGATG TCTCT

CGATATGGGTGCCTTGCTTGCTGGTGCAAAGTACCGTGGAGATTTTGAGGAAAGGCT G

AAAGCTGTTTTAAAGGAAGTCTCTTCATCCAATGGGCAGATAATATTGTTTATTGAT GAG

ATACACACTGTAGTTGGTGCAGGTCTGGTACTTTTTTTTTAATATCCATTTCTCCAT GAA

GGAAGAAGTTTATTTCTACCGACTGGTTAGAAAATTTGCCAAATGTATTCTTTCTCT CTA

AGATCAAATCTCTATTATTTATAGAGTTCGATATTAAAGAAAAGTGCTGACTCAAAC TGC

TTGTCTGCTTTCAAATCTCATGGCGTGTGGACAGCAAGTAGTGACTGCTCTTTTTGT TG

GATGATTTTTATCCTTGCTTAATTCAATCTTAAACAATCCAAAGTTCATGATTTAAT TCATT

TGATGTCACTGGGAAACACTTGTTCTTCATTCTAGTGAGGTTAAAAAGCACTATTGC TTC

GTGATTGTGTTTTAGGTGATCCATTTTTAAATTTGAGCCTGGAGCTGGGAAATGCTT GAA

CTAAAGTTTACTTTTCTGTTAATTTGAGCCTGGACCTTGGAAATGATTGGAATAGGT TTA

TTTTTCTGTTCCATAAAGCTGATAAAACATTACTGATGATGTCCTTTATGTATGCAA CCGA

GTAAAG GAG AAG GTACCTTTACAC CCAAG ATATAGTTTCTGTTTGTAG CTGG CTTATATT

AGTAACTAATTCAGCGAATGTTACTTGGTCAAATTATGTTGATACGTTATATTCTAA AACT

TTTGTTTGTTTATACCTTCCTAATCATTGTGTTTGCTTATGCTTTGTGTCTAGGAGC TACT

AGTGGGGCCATGGATGCAGGGAATTTGTTGAAACCCATGCTTGGTCGGGGTGAACTT A

GATGTATCGGAGCAACCACTTTGAATGAATATAGGAAGTACATTGAGAAGGACCCTG CT

CTGGAGCGCAGATTTCAACAAGTATATTGTGGCCAACCATCTGTGGAAGATGCAATT TC

CATCCTCCGTGGATTGCGTGAACGATATGAGCTGCATCATGGTGTTAAAATATCAGA CA

GCGCTCTTGTATCAGCTGCAGTTCTTGCAGATCGATATATCACTGAGCGATTTTTGC CG

GACAAGGGTAGGCTAATGTATCCTTAGAACTGCAAGTTGTCTGAAATACTTGCTTTT CAT TCCTATAAAATTCTTGTGAACGTTTTTCATGATATCTTCAAATAATACAGCAGCCTAATG T

TACTTTTACATAATAAG AAAGTTACAG GGTTACAAGTAG CTTATTTTTATG G CTTCTTTAC

ATGTTTTATTG CATTG AGTG G ATCAATGG GTCCAG ATTTTCAAG CTTCTTCTAAATGTTTT

TAGCTGTGCGTGATCTGACATACGTTACTTGGGGCTTTTCACTTATGCTCAGTTCTT TCT

TTTCAGCC ATTG ATCTTGTTG ATG AAG CTG CTG CAAAACTAAAAATG GAAATTACTTCAA

AG CCAACTG AATTG GATGAG ATAGATAGG G CAGTGCTAAAGTTG GAAATG GAGAAACT

CTCCCTGAAAAATGACACGGATAAAGCATCTAAAGAAAGACTTAACAAGCTAGAAAG TG

ATTTGAAGTCCCTTAAGGCAAAGCAGAAAGAGTTAAACGAACAGTGGGAACGCGAGA A

AGATCTGATGACACGTATACGTTCTATAAAGGAGGAGGTAAATTGCATCTTTCATTG ATG

AGGTCAAATCAAAGTTGCAGTTTTTCTTTGTTTTCTCATGATTACTGTTCAATTTTT TCCG

TTGCGTAGATTGACAGGGTGAACTTAGAGATGGAAGCTGCTGAACGTGAGTATGACT T

GAATCGTGCTGCTGAACTCAAGTATGGCACCCTAATCTCCCTTCAACGGCAGCTAGG A

GAAGCAGAGAAAAACCTGGCAGACTACCGGAAGTCTGGGAGTTCGTTGCTTCGTGAA G

AAGTAACAGATCTTGATATTACTGAAATTGTTAGCAAGTGGACGGGTATACCACTAT CAA

ACCTTCAG CAGTCTGAG AG G GACAAG CTTGTCTTTCTAG AGAATG AACTTC ACAAAAGA

GTTGTTGGTCAGGATATGGCAGTAAAATCTGTGGCTGATGCAATCAGGCGATCTCGG G

CAGGCCTGTCCGATCCAAATCGGCCCATTGCAAGCTTCATGTTCATGGGTCCCACTG G

AGTTG G CAAAACTG AACTTG G AAAAG CTCTTG CTGC GTACCTTTTCAATACTG AAAATG

CTCTGGTGCGTATTGACATGAGTGAATACATGGAAAAACATGCTGTTTCACGGTTGG TT

GGTGCACCACCAGGTTATGTTGGATATGAAGAGGGTGGGCAACTCACTGAAGTGGTC C

GTCGGAGGCCTTACTCTGTGGTCCTTTTTGATGAAATTGAGAAAGCGCATCATGATG TT

TTTAACATTCTCTTACAGTTGTTGGATGATGGAAGAATAACTGATTCTCAAGGGAGG ACT

GTTAGTTTCACAAACACTGTTGTAATAATGACATCAAACATCGGGTCACATTACATT CTT

GAGACGCTGCAAAACACTCGAGATAGCCAGGAGGCAGTTTATGATGCGATGAAAAAG C

AGGTTATTGAATTGGCAAGACGGACTTTCCGGCCTGAGTTCATGAATCGGATTGATG AA

TACATTGTTTTCCAACCTCTGGACCTTAAGCAAGTTAGCAGAATTGTTGAGCTCCAG GTA

ATACAGATCTGTAATCTGTTGAATTCTGATTCTCCTGACTTCATACGTTTTTCTTCT GTGT

TGTTTTCTGTTTGCTGCGGTGTCATCTGCTTTCTGATTACTTTGACTTTAAGAGTTT TATA

AGCACTACAGCAGATTACTGTTTGTGCGTTATCTCTGTAAATTTCAGTTTTTCTGTG TGA

GAACAAAAAAATGTTTTAGTGTGCATTAGATCTCAAAATTACACATAAGTACATCTC ATTT

GCTTGGTGGTCGTCGTCCTAGTTTGTCCTCCTTGCTGCTTTCTGATGAGTGCATGGT TG

AGTATGTCAAGCTTGAGAACTGCAGCGCACTGCGCATCCTGTCTAATGTCTGCTCTT GC

AGTAGTTTTCTAACAGAGTATAATGTAAAATATATCATTTCATCTGGTGGTTAAGCT TTCT

CCAAGATGAAACATAATTTGATATCTGTTCTTTGTGGTTCTTAATTTGGGGAAAGTG TTT

GGCTATTTCTTATTTTAACCTTATCATCGCATCTGCAGCCATAGCATAACCTTGGTG AGG

TTCATGGGAAAGAAAGTTACCGAGGCTACCTGACAATATCGTTAACTGATGAAAATA TTT

GTAGAACAAAACTTTGTGTTTTCATTATCATTACTATATTAGTTGCTCTCTTTTATC TTTTT

TTCGTCCATCTTTTCTGTTTGAAGAGATTTTTCTTCTGTTTCATGAGATAATTCGAG GTG

GAACTGCTGAGTGCTATGTATTACATGCGGCTGATTATCTATTTCATTTTCTATAAA ACC

TTCTCTCTATCGTGAGAAAGCGAAGGTCTCCCTCTGAAGTTATACTGCTGATATTAG AGT

TTCTTAGAACCTGACGACTAGTTCTCTTTTTCTCTTCGCAGATGAGAAGGGTGAAAG AC

AGACTCAAACAG AAG AAAATTG ATCTTCATTACAC GCAG GAAG CTATCAGTCTACTGG C

AAATATGGGCTTCGACCCTAACTATGGAGCTCGACCCGTTAAACGAGTGATTCAGCA GA

TGGTTGAGAACGAAGTAGCAATGGGTGTTTTAAGAGGAGATTTTTCGGAGGAAGACA TG

ATTATCGTTGATGCTGATGCTTCTCCTCAGGGGAAGGACCTTCTTCCCGAGAAGAGA CT

GTTGATACGAAGAATTGAAAATGGTTCCAACATGGATGCCATGGTTGCCAACGAT

SEQ 75 GTGAATGTGAAATGTTTCTTTGTTTCTTTCTTTTTTTCTTTTTCTTGTATGTCACTTTTT TTT

TTG CAAG GCTG G AACTTTG AAACTTTTTGTTTG AAAAC ACAATC ATTCG CAGTAACAAAC

AAGAACCACCGTCCCCATCTTCACTCCCATCACTCTTCTTTTCTTTGTTTTCACACT TCAT

ATTTACTCTTCTTTCTCATCCTTTATATTTACATAGCAAAAACAACGTCAAGATTTG CAAA

AACACAGCAACCCCCCCAAAAAATGTCAAGATTTACAATGCTAGTAGTTCTTGTTCT TCT

TCTTCTATGTCTATGCCATTTATCAGTAGCAACAATAGGAAGTAGTAGTAATAAGAA GAG

TACTTAC ATAGTAC AC GT G G C AAAATC C C AAATG C C G G AG AGTTTT G AAAAC C ATAAAC

ACTGGTATGATTCATCACTAAAATCAGTTTCTGATTCAGCAGAAATGTTGTATGTTT ACA

ACAACGTTGTACATG GTTTCTC AG CAAGACTGACTGTTCAAG AAG CAG AATCACTTG AG

AGACAAAGTGGGATTCTGTCTGTTTTGCCGGAGATGAAATATGAACTTCACACGACA AG

AACACCATCTTTTCTGGGTCTTGATCGAAGTGCTGATTTTTTCCCAGAATCAAATGC TAT

GAGTGATGTGATTGTTGGGGTTCTTGATACTGGAGTTTGGCCAGAAAGTAAGAGTTT TG

ATGATACTG G ACTTG G AC CTGTTCCTG ATTCTTGG AAAG GAGAGTGTG AATCTG GTACC

AATTTCAGTTCTTCAAATTGCAATAGGAAATTAATTGGTGCAAGGTAAAACTTTTCT AAAA

GTTTATGCGGTTAGAGACAAGACATTTTTAAGTTAGTTAATTATATTATATCTCAAA TTGT

GGTCGCGAGGATTCATATTGCTTACTTCAACTTTTTTGGGACTGGGACGTAGCAGTT GT

TATTATATGTTAAACTCGTCCTCTCGCATGTTGGTCTGATTAATTTTATGATTTCTC TAGT

TG G CAGTGAAATTTG AATCTG G GATTTTTTG CTTG GTTTG ATACCATGTTGAGTTGTCTG

ATTAGTTGCATAACTAAGTGGTAACTGGTAAAGCTGCTCCCATATGATCGGAAGGTC AC

GGGTTCGAATCGTGAAACCAGCCTCTTGCTGAAATGCACGGTAAGGCTGTATACAAT AA

ACCTTTTGTGGTCCGGTCCTTACCCGGATACTGCTATGGTATAGCGGGAGCTTAGTG CA

CGGGGCTCCCTTTTGGCATAGCTAAGTGTGTTGAAGAGGGTTTAGTTAAATCCCATT CA

TCAGAGTTGTACTGTACAAATAAGCTAAAAATGAATTATTTTTGTGTATGTAAATTG GTGT

ATCCCATTGATAATACAGGTTTGTACTTTTTTGAATTTCCTTGTTAGAATTATTTTA AAAAA

AAATAAAAAATATCATGGCTCTGCCACTGTTGTGCTCAACTTATCTAAAAGCTAAAA CTA

TTAGAGATAAGATATACTTTTAATTACTTAATCATATTATGTCTGTTGATAGGTACT TCTC

GAAAGGTTATGAGACCACTTTGGGTCCAGTTGATGTATCCAAAGAGTCGAAATCTGC GA

GGGACGATGACGGACATGGAACACACACTGCTACTACTGCAGCTGGTTCAATTGTTC A

GGGCGCTAGTCTCTTTGGTTATGCTTCTGGAACTGCTCGTGGAATGGCAACACGCGC T

AGAGTTGCTGTGTACAAAGTTTGCTGGATTGGTGGTTGTTTTAGTTCTGATATATTA GCA

GCTATGGACAAAGCAATTGATGATAATGTGAATGTGCTTTCTTTGTCACTTGGTGGT GG

CAATTCAGATTATTATAGAGATAGCGTCGCAATTGGAGCATTTGCTGCTATGGAGAA AG

GGATTCTAGTCTCTTGCTCTGCAGGTAATTATGCTAGTCGGAAAATATGAAGAACTT CTA

GTACTTCTTAATTATTACATTTTATTTTATACTAGACCAGACTAGTTTAAAACTGAG CGAC

ATTAACAATGAAGATTCATTCATATTGCCGATTCTAACTTGCTTGGGATTGAGACGT AAT

TGTTGTTGTTGCTCTGCAGGTAACGCTGGTCCTGGTCCCTATAGTTTGTCCAATGTA GC

GCCGTGGATAACTACTGTGGGTGCAGGAACATTGGACCGTGATTTTCCTGCATATGT AA

G CCTTG GCAATGGTAAG AATTTCTCTG GTGTTTCACTTTACAAAG G G GATTTGTC GCTG

AGTAAAATGCTTCCGTTTGTGTACGCTGGTAATGCTAGTAATACTACAAATGGAAAT CTT

TGCATGACGGGTACCTTGATTCCTGAGAAGGTTAAAGGGAAAATTGTTCTATGTGAC CG

CGGGATAAATCCCAGGGTCCAAAAAGGTTCTGTGGTAAAAGAAGCTGGTGGGGTCGG T

ATGGTTTTGGCTAACACTGCCGCCAACGGGGATGAGCTGGTGGCTGATGCCCATTTG C

TTCCAGCAACGACAGTTGGTCAGACGACAGGGGAAGCAATCAAGAAATACTTAACCT C

GGATCCTAATCCAACCGCTACAATTCTTTTCGAGGGAACTAAGGTGGGGATCAAACC AT

CACCAGTG GTTG CTG CATTTAG CTCCAGAG GACCAAACTCAATCACG CAG G AAATTCTC

AAACCGGACATCATAGCACCAGGTGTTAACATTCTCGCAGGGTGGACAGGTGGTGTT G

GACCAACAGGGTTGGCCGAGGACACGAGACGTGTCGGGTTCAACATTATCTCGGGCA

CGTCTATGTCTTGCCCGCACGTGAGTGGTTTGGCTGCTTTGCTTAAAGGAGCGCACC C

CGATTGGAGTCCAGCGGCTATTCGCTCGGCTCTTATGACCACGGCTTATACAGTGTA CA

AGAACGGCGGTGCACTCCAAGATGTCTCGACGGGAAAGCCATCCACACCATTTGATC A

TGGTGCAGGACATGTAGACCCTGTTGCAGCACTAAACCCCGGACTTGTTTACGACTT GA GGGCTGATGATTATCTGAATTTCCTCTGTGCCTTGAACTACACATCAATCCAGATTAATA

GCATTGCTAGAAGAAACTACAACTGTGAAACAAGTAAGAAATACAGTGTCACTGATT TG

AATTACCCTTCATTTGCTGTTGTTTTTCTAGAACAAATGACTGCAGGCAGTGGAAGC AGT

TCTAGCTCCGTTAAATATACACGAACGCTTACTAATGTTGGACCAGCAGGAACATAC AA

AGTTAGTACTGTTTTTTCATCAAGCAACTCAGTAAAAGTCTCGGTTGAGCCTGAAAC ATT

GGTTTTTACTCGTGTGAACGAGCAGAAGTCATATACTGTGACTTTCACTGCTCCTTC AAC

TCCATCAACTACG AATGTGTTTG GTAG AATCG AGTG GTCAG ATG G CAAG CATGTAGTTG

GTAGTCCAGTGGCCATTAGTTGGATA

SEQ 76

ATGTTGAAGGCTCTTACATCCTCATGTCTGCAGAATCGTTTCCACGCCGTCACAACGGC

ATTTACCCCTCAAGTTCGCCGTGGCACTGACTCGAATACGCCCTTGCTTCGGGTTTT AG

GTTCGCTAAGAAGTTCGAATCGCAGGGTCCCTTATTTGTCTCGACGATTCTTTTGTT CG

GATTCTACTGATGGGTCCGAATCGAATTCCGAGGCTGCTGCATCCGAAGCCAAGCCG G

CCGAGGAAGGTGGAGATGCTGATTCTAAGGCTTCGGCTGCTATGGTTCCCACTGTTT TT

AAGCCTGAAGATTGCCTTACGGTTAGTTCAAAATAATTCTTTGCACCCGCACCGATA GA

TTTAGACGTGTCTTTAAAATAAATTGTATGACTTTTGTTAACTAATGTACATTCTCA GTTC

AATTTATCACTTCATCATTATTAACTAACATAATTTGGTGCATAATTATGTATTTTC CTGCT

CCATCATTATATAAGTACATTTTATGCTAATATTTGATAACTGCTAAATGACTCCTT AAGA

AAAGATGTTAACTTTTTGTTATAACG GTGTG GC CATGTCTG CTTGTG CACCAAAAAAG AA

GATATTAAGTGAGTTCCTTCTGTTGTTAGTGCTGGTTTTATGTCTTCTTTGGGTAGT TTT

G ATATG ATTTTATTCATTATTATTCAAATTTGTACATATG G ATG CAAACATGACG CAG AAT

TGGGAACAATTTGATATAGAAATTATTTTTAACTTTGGTTGCAACACTCTGTTAAGA TTTC

GGCAATCGAAAGGGATGGTTGTTTAAGAAGTTAACTTTCTAGGATATAAAAAAGGTG GA

TCACCAAGATTATATTTCTCGTAGATTCGTACACTCAATCATTCTTTACTAAAAGAC CTTC

CCGACCTTGACTTGATATAATGCTGTGAAGGTCAACATTTTATTTTTCCAAAGAGAG GCT

TTTGATACCCTCTTTGTCTCTAAAAGAATGGTTTTATTATTAGATGCTTCGATCCTC ATAT

TTTGAACCCAGCCATTCCGTTTGAGATGAAAGAATAGTATTGTAGACTTTTTTAAAA TCA

GGCCGTAGTATTAAAGAATCTCGATATCAATCTTCTTAGGATCCTCAGACACCTCAC ATC

GATGACCAACACTTGTAATGTTATTGTAGCTTCAATTAGCATTAGTATTTTAATGAA ATGA

GAATTGACAAATTTTTATAATCTAGTCGTCTTTTTAAAGATTTCTTGTCCTGAATCT TCTTA

GGATCATCAGAGAGCCTCACATCAGTGAGCAACTCTTTTAATGCTATTGTAGCAATG TA

GCTTCAATTAGCATTACCCTTTCAATGAAAAAATGTGGTATTTCAGTACCCCTCCTT CAA

CCACAATGTTATTGATTTTTGCTTCCTTGATATCTCCTCCACTATTTATTCTGCTTT TATAT

GGCTTTTTGGTACTATCCCTTCTTGTCTATATTTTCATTAATGTGGTGCTTATGCTT TCCT

G AG CCG AG G GTCTATTG GAAACAACCTCTCTTT CAT CACAAG GTAG GG GTAAG GTCTG

CGTACACACTACCCTCCCCAGACTCCACGGGGTGGGATAAGACTGGGTATGTTGTTG T

TGTTGATACTTCCTCCACTAAGGCACAATCCGCCAACTCCTTAATCAAGTCTTCCAT TAG

GGGTAATTCATTTGCAATCATCTCTTTTGCTTCACCTAATGATTTCTAAAGGTTTCG ATTT

TTCCAACAATTCTTTCATCTCCATCTCCACTCAAGCATATACCTCTCCATTTTCATC ATCT

ATGATCAGCTCTAAGTAATTTCAAGTTTTTTCCCAATCACACAGCTTGGATTTTCGG CAA

TGCACTTGTTCTTGTTCAAATGTTAATGTGTGTTATCTTAGTAGTGTTAAATTAGTC TCAT

TGGAAAGATATCATAAAATTTATGTATTTCTCACCTACACATAGATGTGTCTTTTCT ATTT

GATTGTTAGATTTTCTCAGATTGTATCACATACCCTTGGTGATCTTAATAAGGGGCA GCC

TATGCTACCCGATTGGTCAACTTTTTCACTATTCTTTCCGCTTTTGGTGATCTTTAA CCTC

ATCCCTATTTCCGTAGACCGTTTGACAGTATCATGCCTCTTTCTCAGTCATCTTTTT AGC

TATAAAATTCCTCCTTTTTGACCATTCTTCCAGTGAGTTTCTGCCACCATTCCAACC ATG

TCCCATTTACAGTGTAAAATTACTATTTTGGCCATTCTTCCAATGAGTTTTCTACCA TCTT

TCAGGCCAGTATTATGATTATTGTTCCAGATACTATTTTGGACGCTATTCCATTGCT ACT C C AGT C ACTGTTTAG C C G C C AATTTG AG C C AC ATTAG CTG C C AC CTTAC G C C AC AC C AA

TGATGGTCTTGATATTCGTGGTATCGCGTTCTCTCTGGTCATTGAGAATCGGTTCCC TTT

TTTTTTTGATAAGGTAAATTGTATTAATCAAAAGGGAGAAAAAAAAAACTCCCGCAT ACA

AGAAGTATACAAAAAGTAGAGAATTTACATCAGAACATGATTCTCTACAAATGACGC CCA

ATCTTCTACGCAAGTAGGGGCTACATGTGTGCACCATTAAGCGATCAAAGATAAAAG AC

TATTCTTAAGATGTGAAAACGGAGACTCAATCCCCTCAAAAACTCTCTTATTTCTCT CTC

CCCAGACTAACCACATGAAAGCTAACGGGACGAACTTCCACGCCTTTTGCTTGCTTC TT

CTACGGAAATTGGCCCAGCTATATAGCATTTCCTTTACAGTGTTTGGCATCACCCAT TGA

AC AC C AAAAAG ATTC AG G ATTG C C CT C C ATAAAC C C C AG G AC AC AAG G C AAC G CATC AT

AAGATGATCAACTTCTTCCCCTGAGCTTTTACACATGTAGCACCAACTAACATGTGT AAT

TCCTCTCTTTCACAGATTTTCAGATGTCAAGATCACTTCCCTCGCTGCTAGCCATGC AAA

GAAGCACACCTTCGTGGGCGCCTTAGGAATCCAGATCGATGAGTATGGGAAAGTAGC T

TCCTGTCTCNTGCCGCATCCCAGAGAAAATCCCTTTGAAGCCGCTCCAGTTTTTCTG TG

ATGCTCACAGGTGCTTGCAACAGGGATAAGTAATAGGTAGGGATACTTGACAAAGTG CT

TTTAATAAGCACTTCCTTACCGCCTTTTGACAAATACCGTTTCTGCCAGCCTGCTAA TCG

TTTTTCAACCCTTTCAATGACTGGATTCCAAACNGAATTTGTTGGACCATCAATTCC TTC

TGGTTTGAAACTCTGAAGACGTGGGGGAAAGAGACCCTCAGAGTAGCCTCTCCACAC C

ACCTGTGATTCCAAAAGCTGATTCTCCTTCCATCACCCACCTTGTAAGTGATGTTGC CAT

AGAAAGCTTCCCAGTTCTTCATGATGTTCCTCCACATGCCACACCCGAACGGTGTTG TG

ATTGCCTTGGTCCTCCAACCCCCTCCCGTGGAATCATACTTTTCTGCTATGACCTCC CT

CCATAGAGCATGCTCTTCTACCCCGAATCTCCACAGCCACTTCCCCAGCAAAGCTCT GT

TGAATACCCTGAGATCTTTTACTCCAAGTCCACCCCACTTCTTTGGGGAAGTGACTG TC

TGCCAATTCACTAGATGAAACTTTCTAGTTCCATCTGCCGCATCCCAGAGAAAATCC CTT

TG AAGTCGCTC CAGTTTTTCTGTGATG CTCACAG GTG CTTG CAACAGG G ATAAGTAATA

GGTAGGGATACTTGACAAAGTGCTTTTAATAAGCACTTCCTTACCGTCTTTTGACAA ATA

CCGTTTCTGCCAGCCTGCTAATCGTTTTTCAACCCTTTCAATGACTGGATTCCAAAC AGT

AGTATCCTTTTGCAAAGCACCCAATGGTAGACCCAGGTAGGTAGTGGGGAGAGAGCC C

ATCTTGCATCCGAGAACATGAGACAAAGCATCAATGTTAGCAACCTCATCCACCGGG AA

AATCTCACACTTGCTGAGGTTGATTTTGAGTCCTGATACTATCTGAAATCACTGCAG TAG

CTGCTTCAGGCAGGTCAACTGATCCATATCGGCATCACAGAAAACTAGGGTGTCATC C

GCAAAAAGCAAATGAGAGACCCTTCGGGCACTGAGCACCTCGATCGGAGCTGAGAAA C

CTCTCAAGAAGCCTCCACTCGCTGCACGATCCATCATTTTACTCAGAGCATCCATCA CT

AAAATGAATAGCATGGGGATAAGGGGTCACCTTTCCTAAGCCCCCTGGAGCTGCCAA A

G AAACC ACACG AG CTACCATTAAC CAG G ACAGAG AATCTG ACTG ATGAAATG CAAAACT

TGATCCATCCCCTCCATCTTTCCCCAAACCCCATCCGTTTCATAATGAAGTCCAGGA AC

TCCCAATTGACATGATCAAAGCCTTCTCAAGGTCCAACTTGCACAGTAATCCGGATT CT

CTATTTTTCCTTTTGGAGTCTACAAGTTCATTTGCCACCAGAGCAGCATCCAGGATC TGC

CTACCTTCCACAAACGCATTCTGGGAGGACGAAACAGACACGTCAAGAACCTTCTTT AG

TCTGTTAGAGAGCACTTTAGAAATAATCTTGTAAATGCTCCCCACTAGACTGATAGG CCT

ATAGTCTCTGATACAAGATGCACCTTCTTTCTTAGGCACAATGGTAATAAAAGAAGC ATT

GATGCTTCTCTCGAAAGCACCATTCACGTGGAAGTATTCGATGGCTTCCATCAACTN NN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCATC

GTGCACCTCCTCCTCCTCGAAAGCTCTTTCTAGCCACTCATTGTCTCCCTCCCCTAT GT

GGTTGAACTCTATTCCCCCCAAAGTAGGCCTCCAACTAACTTCCTCTTTATATAAGT TCT

CATAAAACCCCACTATCGTCCCGTTTACCTCCTCCTCACCCTCGATCCTCACCCCAT CT ACAACTAAGGACTCTATGAAGTTTCTCCTTCGATTTGCAATCGCAACCCTGTGAAAAAAC

TTAGTATTCGAATCCCCCTCCTTTAACCAGAGCGCCCTTGATTTTTGTCGCCAACTA GTC

TCCTGAGCTATTGCTAGCTCGACTATTTCACTCTTAACCTCCCTCAATCTCACTTTC TCA

GATTCATCTAGCTCTCTCGCCCCTTCCCCGCTCTCTAATTCCCCCAATTCATGCATA AG

CTCCCTCATTTTGACCTTCACCCTCCCGAAGACCTCCTTGTTCCACCTAATAATATC TCC

CTTGAGCAATTTGAGTTTCTTCAACAGACGGAATGAGGGTGACCCTGATACCCCGTA GT

TGGTCCACCATTCTTTCACTTTGTCACAAAATCCTGGCACTTTCAACCACATGTTCT CGA

ACCGGAAGGGAGCACGGACCCCCCTCCCTCTGCATCCATCTAGCAAGATAGGCAAGT G

ATCCGAAGTCAGCCTAGGTAAAGGGATCTGCAACACATNTTGCAACACATTCGGTAC CA

GCTCATCCCAGGAGGACGATATCAAAAATCTATCGAGTCTGGACCTTGAGTTTGAAT CC

TCCGCCCTGGTCCAAGTAAACCTTTCCCCTGACAGAGGAAGATCAATGAAAAAGTGA TC

ATTGATGAAGTCAGAAAACTCCTCCATGGCCGTCGAAATCAAATTGCACCCTAATCT CT

CCTCCGGGAATCTAATTGTGTTAAAGTCTCCCCCTAGAACCCATGGAATGTCCCATT CC

CCCATAATATTAGCAAGTTCCTCCCAGAAAGACACTTTGGACTCTTCCCCTACCGGC CC

ATACACTCCCCCAAATCTCCACTCCACCCCACTCACTCTATCTTTGACGAGAGCTGC CA

AAGAGAAAGCTCCCTTCCTAATCTCCTTTACCTCCAAACTTCTATCATCCCACATAA GTA

GAATACCCCTGGCACTTCCTGCTGCAGGGACCCAGTCATATTTCACCCAACAACCCC C

CAGACACTTCTCACGATCGCATCCGACAAAACTTCCATTTTGGTCTCTTGAAAACAA ACT

AGGTTGGCACCCCACTCCTTAACTCCCACTTTGATGATGGCCCTTTTGTTTGGGTCA TT

CATCCCCCTGACATTCCATGAAAGGATCTTAACTTCCATAAGCAACAATAGCCCCCT TTT

CCCCACCCCCCTAGACGAGATCCCTTTCTCCATGTCACACACTCGACCCCTTCTCTG AT

ACACCACTACATCCCCTATTTTATTTTGATTAACACCTTTACCCAACTCCCTCCCCG CGC

CTTCGTAACAAGGCTGTTGCTCAATCTCCCTCAACAGTTCTAGCACCCTATCTTCCT TCC

CTTCAAAAGAAACACCTAAGAACTTCCCAAAGTTTAATAGTTTATCCTCCATCCATA AGG

ACATGTCCACTTCTCCAATCCGTGACGAAATTGGACATGCGTCTTCTGTAATTACCT TTT

CCTTGCCCCTACCGGAGGAATGCAAGACCATAGCATTGCTAGTAGTAGGAACTCTGG G

TGTCGGGTGGATCACGTCGTTTCGTTGAGAGGCACCAACCTCAGGAATTAATGCCTC T

CTTG GAG GAG ATTGTATTGTAAC CTCAG CACATCTAAGCACAGTG G GAG ATG AGG G GG

CGATGGAGCTGGGTCCATCAGAGGCGGGGGGCCGTGGAATAACACCGCTAGCTTTTG

CCGGTGCTGCCCCTGGAACAGTGGCTGAAGAGGAGCACAAGTCTCTAGTGTTTGGTA T

AGCACTGTTTGAGGCATCATTGGCAGAAGGAGGAGGAGTGACTTGCTCCCATATGTT TA

GGCGCCTTATCGGACGTCGCTTGATATAATTTGGGCCCTTTTGGATCGATTCTAATA GA

ATTGGGAAAAGTTGCTGGGCCCATGTGTAGAGGCCCAGACCTATGCTTGCTGAAATC TT

GCCCGACCCTATGATAGTTAAGAGAGGACGCATCACTATCCCATTCACGTCTTTTCC TC

CTGTTAAAATTTTGTCTTCTTCTCCCAAGTTCAAACCTCCCGTCACTTCTCGATCTA CCG

TCTGATTCCAGCTTAGTCCACTGGTCCGGTGAGCCCTCCCCTCCTCTCAGAAACTGG TT

TCCCCTTCCCTCCCTTCTATTTGACCTTTCCTTTTCTTCCGCCAAAATCGCAGGCAT AAA

GACAGGGCGAAATTCAGGGCTCAACCAAACCCTGTAGCTCAAGTAGCCATCTTCCAC A

ACAGTGGAGACAGGGATTCGGCCCCCTTTCCTCACCACAATCCTTACTCGCGAGAGA T

CGTGCAAGCTGCAAGTGACATCAATGAAACCCCCACAGATTTCCCCAACCTTCTTGA AT

AGGTCAAGACACCAGAGATGCACCGGGAGACCCACCATGTTGACCACCAATGTTTCG G

CAGGGAAACGAGGGTCGAGACACCCATCGTGCTCCACCCATCTGTCTAACTTTAAGA A

ATTCCCATCAAACCACCTGTTTCCTCTTACGAGAATTTCTGAAGCTTGAGACTCGGT GA

CAAATCGAAAGAGATATTGCGTGTCACCCAGTTTCGATATTTTCAGCCCGTCCTTGA CG

TTCCATGCCTCTGCTGTCCAGTTCCTAACAGTTTCTGGAGAACCAACCCATTTGGAG AA

AGACCCAATCAAACAAGAGTTCAAGAAACCTTTACGCCTGAGAGTGTGCTCCTCGGA CA

GCGAGAAGTGTGGCTCCTGCCCTACGTCAAGCTTTCGCTGGCATCTTCCCCCAAGTT C

G CGG G CTG GCC AAGATG CAG CTTTG CTAAAAG ACATG CTTTCTAATTTTCTAG ATTCTA

GGGAGAAGTAGGAGCTTCTCTCTGTAGAGTCCGTCATAGCTAAACCAAGAGCCTCGC A

AGGAGACTGACTGAAGAAGGGAAAGTGAGAGGGAGTGAAATCTGGCCGGAAAAAGAC

ACCTCCGTCGCCGAAAAATAGCAAAAGTGGTCGGAAGTTGGAGGTTGATGGATGGGT G AGGTCGGAATAGCCTCGCGAAGAGGCTGTCTGAAGGAAAAAAGTTGGGATCTGGCCTG

AAAAGGCGTCACGTGCCGGCGCGTGGGGTGTCAGATCCCGGCGATTCTTTCTGGGGT

AGTGTCGCGACAGGGTTGGCTCTCGGATGGTGGTGGTGGAATCCTAGGATTCATGGT T

CATGGTGGTGGTGAAGGGGTGGGATGGGGTGTCTGTGTGCTACAGTGGTTTTTTTCC G

CTTCATGCACATTGAGTTCTGGTGACTGATCTCGTTTTGAACTGTTCACTTTCTAGG TGA

CGCTTTACTATTACGACTATTGTTCACTATTCCGTGTTTTCCAGAAAAAGCAAACTC AAG

GTGTTCTTCACTTCTTCTTACCTTGAGTTCAAGGACACATAATACAATGGTAATGTT AGT

TAGTAGAATGTTAGTGTTAAGTTAGTCCAACATCAAAGATTATTGTGCTTTTCACAC CTG

AAAATTAAGTGTCATGTATTATGTAATAAGTATTCATCAACTTATCTTTTCATCTGC TTCTT

CTATTGTCTCACAATTTTCAGATTAAAATGGAGTATTTTTTTTACAAAGAATCCCAT TAGC

AAGAAGATTGTGATTGTGTCTTGGGCACATTGTCAGTTGAACCATCCGTATTGGTTG AG

ATTGTTTCTTTGAAATCTCCTGATATTAGAAACCCGTCGATCCTTGAGTACTTCCTG CCT

TGTTTCTCTCTGGACCATATGTAACTCCTACCTTGCAGAGGCAAATTGATCAACTCT ATG

TCAAGACTGAAGCTTCAGAAAGTTCCTAATGATTTTGAATGATTTCTCACCTGCAAG AAG

CTG GTG ATTCACATTGTTCAG CAAAAG ATTTTTTTTTTGTTTG CTTGTCAAATTAACATTG

TAATTAGAATACTCTCCTTCCCATATGCTCTTCTCAAAGTCAGCATTCAACCAGTTA TGC

CTCTTACTTCTGGCCGTACAGAAGGCACCTTGATGGATGTTAACTTTTAACACTTCA TCC

TATATTTG G AG G CCTAG AGAG GGTTACTTCATGCTTTAGTTTCAATTTAATCAACAG CTT

CTAGGTTATTTAAGAAGTAAAGCATTGGAAAATACATGCGCGGTTTAAGTAATAGGA TTG

CCCTAGTATCTCATTTTTCTTTGAAAAGAAGTTTGATTCATATGTGTATTTCAGTTT TTGC

TAGTTGTTGCAGGATCAGTGTAAAAATGGTATTTATCAGCCTTCCCTAAAAAAAGAG AGA

TACTGAACGAATG CAAAG AAAACACTGTTTCTTG G AG G CTTCTTATTTGCTCATAATTCA

TCTGTTTGTAAGATTTATGGGTGCGACCCATTTCTTGCCTTCTTGGTTGGTTATAGT TTT

ATCTTTGGATAAGATTTACCTGTAAGACTTGCAATTAAGGGCTTTATGTCATTTATC ATG

CTAAAAATAACATAAAGTCTGTGTGGGCTGTACTTTTTTAATTCTATATAATTTATA ATGA

TG CATGTTGTTTTCTTGGTG CAG GTTCTAG CACTG CCACTTC CACACAGACCGTTATTTC

CAGGGTTTTATATGCATATCTATGTGAAGGTATGGTACCTTGTCTAGAATGGGTTAA ACT

ATGTCTTAGTTAGTCTTCGTGATTTTTGAAATAGAATAGATATAAGCTTTAAACATT AAGA

GTAAAATTACTGTCAACTTTCTGAGAGTTGATACTGTTTCAGCTAGTGTTTTGGAGC CTA

TTTCCTCTTAAGTCTATAAAACCAGCATTCAGCTTTAACAATGCTACTTTAATTGAT GGAA

AGTTCTTGGCTGAAGTTGTAAGCTTGAAAAGCTAAAACCTTCAATTAGGAGAGCGTC TG

TTGCTCAATTGTAATATTCTTTAGCCTCATTTGGTAGTTTCCAGAAAAATATAAATT CTGC

TGTAATGGCATGCAACAAATTGATCTGAGGCTCCACATAATCCATAAATTTCACTGC AG

GATCCCAAGGTATTAGCAGCCTTGCTGGAAAGTCGAAAAAGGCAAGCACCTTATGCT G

GCGCTTTCCTTATGAAAGATGAGCAAGGGACTGATCCTAATGTTGTGTCTGCCTCAG AT

ACAGAAAAAAACATCTATGAGCTTAAAGGAAAAGACATGTTGAACCGTCTTCATGAA GTT

GGTACACTTGCTCAGGTACATCTTGTTGTTCCTTGTGTTATTCTGTTGCTTTAACTT TATT

GAAGAAGTTTCTGATCAAGGTTGATGCAACTTATGATGAGCCAGATAACAAGTATTA AA

GACGACCAGGTTATTCTTATTGGTCACAGGCGGATACGTATGGCAGAGGTGGTAAGT A

GTTGTCTGCTTTATTTTTTGGTTATAAGCAGCTCCATATGTTTTCTTTTTGTCCTAT TTAG

TTCTATTTTTCATCTACATATCCATCATCTCCTGATTTTGAGACTGCCCAATTGACA TCTT

GGGTTCATTGCACTTGTATATTTTTCTCTCTTTCCTACCTTTTGTGTCCCAACACCG AGT

CCTTCCCCTTTGGCTTCATATAACTGAGCAGCTGCATTGTCAAGCGATAGAGGTCTA TT

GCAATTTGAGGCTAAATGAGCTGTTTTCCCCAGTTGGTTATGACGTGTTATCAATAG AAT

GAAACCCTCTTTTAGTTATTCTTGATTGCTTGTTTGCAAGAGAAAGTAGCACTGATT TTG

TAATAATTTTTGGACAACTGGTTTCCTGCAAAGTCATAGGAATGTCTTTCTTATTGC ACTT

CG GG GTTGTAG AAG AG AAAAG CAATTG AGTTTCTTCTATG ATGTG CTAGTTTTG CATCT

GTTTGATTGCTGATGCATTTGGTTTGAATGAAATCCTAGACTTTTGATAACCTAGAA ATC

TGAGGGTAAGTGGCTCACGGTTTGAAACTCAGTGGATAATGGGCTCACCATTCTACC AT

TGTCCACCTAAATACTAGGCTTTTGTTTGCGCCACGGCACGAACTGTGATGTGCACC CA

AC CTAC ATATC AC G G G CTGTACTCTTAC CACTAG AC AAAAG C C C C G GAG G CTTAAAC AT ATATTAAAACACATAAAAGTTTGAATAGAATCCTATATATTTGGGCTGTAATTTGGTATA G

AGTCGTATTCTTGGCTTTAGCTGTAAAGCTGGAGTACTCATGCATGGTCAGCCATTA AT

GGGCTTACTTAGCCACTGGAGGGTCTTATGTCTTCTTATTTTTTCGTTTTATTTTGT GATT

GAGGACCTCTTATCCTCTTCTATTGGACTTAAATCTCTCATCCCCTGAGCTCCTCAG TAA

TAATTCTCCTATTATACTATACTTACTGTTGGATGTTTACCTCTCTACCTTCACATG GTAG

GGTAAGGTCTGCGTACACAGTACCTTTCTTGGACCCCACCTGCACGATTACACTGGG TT

TGTTTGTTGTTGGATGTTTGCAGTCAGATTTCCTTATGTAGTCATTTCTTTTGTTTC AATA

AACAATTTCTTCAATATACTAGTGTAACAATTTGATGACTATGCAGTAGATTTGTCA CACT

TGAATGTTCACTATCAGATTTCCTTACTTTATTCACCATGTTGTTCTTTTTCCGTCT CAAT

AACCCATTTTGTGAAATGTACAAGAGTAACACCTTGAGGACTGAAACAGTGACATTC ATT

TGCAGGTCAGTGAGGAACCCCTTACAGTGAAAGTTGATCATCTCAAGGCATGCCTCA GT

TTCTTGTGCATTCTTTCTACCCAGCAATGATAAACTCATAAAGCTTGATGTTCAGTT TTGA

AATTGTTTCTAAAATTTGCTTTCTTAATTATGTTGCAGGAACAGCCGTACAACAAGG ATG

ACGATGTTATAAAGGCGACATCTTTTGAAGTTCTATCAACCCTAAGGGATGTTTTGA AGA

CAAGTTCTCTCTGGAAGGATCACGTTCAAACTTATATCCAGGTGTTAGTCATTTCTT TCT

AAATGTTAAGTCCTAATTGTTTGATTTGGTGATAACTCCAAAAAAAAAAAATTTACT CTCT

CAAAATGCATTGCCTTTTGATATTCTAGCCCACACTATTGTGTGAAGCTCCACAATG CCG

GTGGATTGTGTTAAACTTAGATCATGCCCTAAGTTAGGCATTTCATGCCATTCTTTT TAA

TGGAAAATTATGTTCGTGTTGGTGGTAATCAAATCCATGACTAGATTCTCTGCCTTA TAG

TTG ATG CCCTG CTG AAAGAG GAGACTAATTAAAG ATGTCAAG ACTGCTAG G AAATG G AA

GGTCATAAATGTTCTCTGAACCAACTGTATACATATTGTTATACATATTTCTAAAGA GAC

GCTAGGAAGTTTATACAACTGAACAACTGTATAAGATGTATTTGGAGCTAGGAAGTT TAT

ACAATACCTAATTCTCTCTCTGCTGAATTTCTAAAAAGATGCTACTGATTGTTATGA ATAG

TGTCAAAATCAAAAG ATAATATTGTGG CATGTTTTG CTG AAGAAGTG G GAGTG G AG G GA

TAAAGGGATAGGGACTTACTGAATACTTCTAAGGGAAGCATTTGTGTCCCTCCAAGT TA

TTG GTACACTAGTAATTG G AGTAAG AGACATTACTCCTTTCGACCAATTTC AG AG G G GT

TATCAAGTCTTCAAAGCGTGGCGATATTTGTGGGGGTTGGCTGGAAATTGAAAAAAA GA

CAACTCTAAGAAATCATC CTAG ATG G GCTTCGATCAAG GAG AG G AG GC AG ATG G AG AA

GATTTCGAAAACCCTAGAAGTGGAGAAGAAGGGTTTTCTTTATAAATTACATGTTTG GAG

TGTAGCGTTGGTAACAATTATCTCCAATCGACGAAATGGAGAAGAAGATGGACCTGC TA

G GTACTATGATG AAGTTTG AGATAG G CCTG CTACC GTAG AAAAG ATCG GTCATGTG GG

G GTTG G GAAC AATATTTCAAATTCTAAAAAAGTGTG GACAG ATGTG G GAGTGG CACGTG

GAAAGGTCTTAAGGTAGAATGGGTCAGTAGTAAAGGAAAGGGCTCCAGAATTGGGCC A

CTTATGTCATGATGTTTGGAAGCTTAACATAACTTTTGGGTCATAAAAAATAATAGA TATA

GGCTCAGATGAGTCAACCTTTATTTTCAATAAGGGGCTTGGTCTTTTAGCCATGAAC CTT

TAGGGTCTCAGTTTTGATACTCTTTTAGTGGAGATTAATGTCATTAATTTGTCTGAA ACAT

G ATAC ATTTTAG ACT C CTTTC AC AAC AC C ATG ATC G G C AAAC C G ATG AAAGAG C AG G AA

AGCCCTAGTCTGCCACAAATTTCACAGAACTCACATGCAGCTGATTACACAAAAATT GT

CATTCAAGATTAAGAAAACTCTTTCTTCGTACAATCGAAAGAGGCTGACCAAAATCA GAT

AGTTTTTTGAATTTGAAAGAACTAAACCATGGCTTATCGAAGAGGCTACTCAAGCTT TCT

TATGGCTGCATAATAATGTTACAAAGATGAGCAAACAATATGGATTGAATTTTTGTG AAT

GTGAG ATTG AG G GTCTAG CTCTGTTC ACG AAG CTTGACAGG AGAAG G CATAAG AGAAA

TG AAG CTACCATGTCC AG ATTC ACAATTC CAAAAGTGATAG GTATAAAG GAG CTCCAAA

AACTGTTTTTTAATGTGAATTATGGGGAGCTCGGATCAATGATGGGAAGGGGGATCA CA

AACACTAGGTAC CCATG AAG CTG AATATTCTC ACTTG G AATATTAG GG G GTTG AATG AC

AGGGAAAAGATAAAGGTGATAAAAAGTTAATCCATAAATTGAAGGCAAATATTTATT GCT

TTCAAAGACGAACTTAGAAGGGGGTGTGGAAATACTAGTTAAACAAATGTGGTCAGA CC

CATCTCAAGTGTTGTTTG GAGTCCAACG AC AG GAAAG G GAAAAT ATTG GTG ATGTG G GA

AAAGAATGTTTG G ACAAG AGAAACTATCAACAAG G GATGTATACTATCACTCG CAAAATT

TCCTCATTATCACAGAATTTCTCCTGGCACCTCACAGGGGTGTATGAATCACACTGC AA

GTTGGAGAAACAAGAACGCTGGTGGGAGATAGTAGCATCTAAGGCATTTGTGCAGGG C CTTG GGTG GTGTATAG AGATTTTAACACTAATAG ATTCATAG CAAAAAGAAAG AACAACA

ATAAACTCACTAGGGCTATGATGGACTTCTCTAATTTTATAGATCATTAGAAGCTTG TAG

ATTCTAATCTTAATGGGGCTCCTTTTACTTGGACAAAGGGTAATAATCAGGAAAACT CTT

C AAG ATTG G ATAG ATTTTTTTC C C G G CTAAATG G G CTG AG G AATTAAAG AAC AAAAG G C

AAGCAGTACTCCCCAGTGTATTTTCTGATTGTACTCCCGTTTCTTTTCAATGCGGAG ATG

G GG AAGTTTAAAATCTTACTTCAAGTTTG AAAG CTG GTG GTTGG GTGTTGAGAG ATTCA

AT GAAATG GTG AAAAG CTGTG GAACTCTTTTGAAGTACAGG GTAG ACTAG ACTCCATTC

TTTCAAG CTAACTG AAG TTGTTG AATACAAAGCAGTGG G ATTG G GAATCTTCAG ACTAT

GACAGTGTTGTGAAGAGCCTGAAGCGAGGAAAAGCGATAAGCCCCTTTTCGCTTAAA G

CGAGAAGCGAGAAGCGAAGCGCTCGCTTTTTTGAAGTGAAGCGGTTTAAAAAGATAT TA

AAATAAATAATGCATAGACAACACATGTAACTGTAAGCAAATGTTCAATACTTCAAT GTA

AAAACTAAAGAGTAGCATCAATTAAAGCACAAAATGAGCATCATATTCTTCTTCAAG ATT

GTCAAATTCTTGTATTCCACTATCATTATTATATTGCTCGTCATCTTCTTCAACTTC TTCTT

CTACATCAACTAGAAATGAAGTAACTGCTTCTTTTCCCTTCCTCTGTGAGCTTGAAA TTG

AGGTACTCCCCCTCAAACCATAAATTTTCTCCCCAATTCCACGCGCCTCCGCAACAT CA

CCCCAAGTGAAATCAGAAGTTTCCTCAAATACTTCTTCATTTGCATGATCTTCCGGG ACT

CCAATTAGCCATTCATTAGCATCATCGATGTTGTCCAAACTAATTGGATCAATTACA TTG

CGAGCATTGTAACGACGCCTCATTGTTCTATTGTACTTAATGAAGACTAGATCATTG AGA

CGCTTCAAGGTTAGTTTGTTCCTCTTTTTGGTATGAATTTGCAAATAATAAGTAATT AGTA

AGATTGCATGCGCATAACTGTCTGTCATCATTCAACATTCTAACTTCTTGAAGTTGC AGC

ATATTG GTG ATTTCAATTATGCAAG GTTAG CAGATTTTG GAG CAG CAATATCTGG AGCC

AACAAGCTACAATGCCAGCAAGTGCTTGAAGAGCTAGATGTAAGTCCGTGGTTCAAG AA

GTTAGATATTCCCTCTGTTTCAATTTAGATGACACACTTTCCTTCTTAATCCGTTCC AAAA

AGAATGACACATTTCTACAATTGAAAATAATTCAACTTTAAACTTTTCATTTTACCC ATTTA

CCCTTAGTGAGAAGTTTTTATAACCACACAAATGTTATGCCCCCACAAAGCTTTTAC CCC

TTAAGCTTTTAAGTCCACAAGTTTCAGAAGTCTTTTTTCCTCTTAAACTTCCTGCCA AGTC

AAACTACCTCATCTAAATTTAAACGGAGGGAGTACATAATATTTTCTGTATGTGTGC CTT

TTTCAATCCAAACCTCAAAGAGAGAGGTTTACCAGTATTGTGAGTTCTAACCACTCT GGA

AGAGAGCGAGGTTTTATTACTAGGAGGTAGATCTAAGAATTTTTCCTTGCCATTTGC TTC

CTATTGTAACAAAAAAAAAATATATGTTTGGTCAGTGCTTCCCTGCTCTGAAAAGGA AAA

TG GTTG AAAGTTAAAAAAAAG GAG G ATCAAAG CGTATTAGCG GTAAAGCTTC ATTTCTT

GCTGATGGGTAAAATGGCTTCTCATTGGTACTCTTGCTCGAAAAATTAGGAATCTAA AAA

AGTTATCAGTTTGTTCACCACTTCATAAAAAGAAATTGTCAGTTTGTTCAAGTGGAG CTA

CTTATTTTG GG AAAATTAG AAAG G GAAATATTCAGTTTAG AACAG CTAG CAATATTTTGT

ATTTCCTCGTAAGAATCATAAACTTGCATTACAATGGTTATTTCATGATTTTCATCG AACT

GAGCCTTTTGTCTGCTCAGGGATCAGAATATAGTGTGTGCATGAGATAAAATGAATG TT

TCCCGCTTGTTTCATAGCTAACACCATAATACCTGCATCAGGTGCATAAGCGGCTAC AG

CTTACCCTGGAGCTAGTGAAGAAAGAAATGGAGATTAGTAAGATTCAGGTAAATGCA CA

TCAAGACGCATACCTGAACTTTAAATAGGTGCTATGCATGCATTTAGCATTTTACGT CTT

TTCTGTTGTCTGCAGGAATCAATAGCAAGAGCAATTGAAGAAAAAATAAGTGGAGAG CA

ACGCCGTTATTTGTTGAATGAACAATTAAAGGCCATAAAGAAGGCATGTGGATTATG TG

CAGCTTTTTTTGTGTTATCATCCTTAAACTTAGTACTTACATATGTTTATTCCTAGA TAATA

TATATGTTAAGCTTCTGATTCTTATGTTTGATTCACAGTTTAAACAAAGAGTTGATT TGAA

AAAAAAGTG CTTCCTTATGACGTG ATTTTG ATTG G CTTAATG CAG G AACTAGGTTTG GA

GACTGATGACAAGACAGCTCTTTCTGGTTCGTTGTCTCTTAATTACTACTGAAATGA ATA

ATGTTCTTTTTGGATTTATTACGGCCAGACGTGTTTTCCCATCTGGCCAATGAAACA TCT

TATGTTGGCCCTGGAAAAATTCCATGTAGCAGTAGCATAGAAAAGGCCATTTGCAAT AC

GTTTTGCCCTTTCTGTTGTAACAGTTTTATGGTGGCTGATTGATCTCCTGTTTACCA TTG

AAAATCTTC ACTAGAAGCATG AATAG CATTCAG G CG ATAATTG G CTTTACAG ATAG G AAA

TTGAAGGTTGAATTTTCTTTGTACACGGTCCAACCAACTTTATTGACATGCTTTCTA AAG

CTATTTGAGACATACCAAACTTTTGACAGATAAGAATACATTGTACCTATAGGGGGC TCT TTGTGTTTTCACTAAAGCACTTAATAAGACAATACTGGAATCCTTCAACTTCAATTTCCG A

AAGGTTACTGTCTACCATATTTGGTAATGTGGCACTAAATCAACATGGATTTACTGG AAG

GCCACATTGTTAGCACTGCATTGTTGTTTATTCAAGTTATGGACTGCTCATGATTCA GAT

TTATCTTCCTCCTCCAATGTTTCTTTTTCTCTTGAACTCTGTCCTGTTTCCCTTTTC AATG

CACTTCCTTATTAGAAGCCTACTTGCTTTTTCTGTCATGTACTCTTTCTTCTCCCCT TTGC

TCCTGTTG AG G AGTGTAATGCTTATG AAG GTG CTTCTTCCAG GTTTTG GTGATAATAGA

CATTACTGCAAACCCTAAAACGCGGTTCTCATTGTTTTTTCTCTATAATTTTCACAA AAAT

AGAGTTAAGCTGCGTGTTGTAACATTGTTTTGGTAGACCTGGTTTTGGTGTTGAAAT TGA

G AAG AAACTG CCTAATTATTAGGTTTG CTTTAAG G GCTTCG ATTTTTCTG GTCAATACTA

ACGAAAACCTGCAGTTTTTTGTTCCCTTTTTTATTACATCTAATGGTGTGACACTAT ATTT

GTTTATTACTG CAG CAAAGTTC AG G GAAAG ATTG G AG CCTAATAAAG AAAAAATACCAG

TACATGTTATGCAAGTTATTGAAGAAGAACTGACAAAACTGCAACTGTTGGAAGCTA GTT

CCAGTGAATTTAACGTAACACGTAATTATCTTGATTGGTTGACTGCCTTGCCATGGG GT

AATTACAGGTTTGTTGTCTATCGATTCTGCCTTACATTGTCTTGGGTTCAACCCAAC TGA

TGTTATCCTTATCCTTGGCTAGCTGTACTAGAGGAATCTGTTTGAGAAGCTGGCTAA AC

AGTCCAGCGAGAAATAAAAATGTTATTCTCTGAAATTTGCTGCTTCCAAGTTAACCT TAC

TGCCTAGTGATGTGACTTGCCTAAATATCTATCGAGTAATATCCATTTGTCTTTAAC TTTT

CTTTCTCCTCCAGTTC CTTATTTTG G GTTCTTACATGTCATG CTTCTG G CTTTG AG G ATG

CTTACTTGACATCCCAATGTATGAGTTTAGACCAGGATCTCATGAGAGCAGCAAAAC TA

GGATTGTACTTATGATGAGCTCCTTAAGATGGGGGCTTGATTTGCCGTAGTTCGTGT TG

TTTGCTGCTGATGGTGGTGGTGTTGGCTTTATAGTTTTGTTCTCTGCCATGGGTGTA TT

GCATTGGTTCCTGAAGTTTTCTTTTTATGATCCAAATGCAGTGATGAAAACTTTGAT GTA

CTACGGGCAGAACAAATTCTTGATGAAGACCACTATGGGTTAACCGATGTTAAGGAA AG

GATCTTGGAATTTATAGCTGTGGGAAAACTCAGAGGAACCTCGCAAGGTTGGTAAAT GC

CTTTTTTTAAAAATAATAACCCTCATTTTTATTAAAAAAAATCCTATTTTATAAGGT TCAGC

CATAATCATATTAAAAGAACGGAAAATGATCCAGCCATCTCCTTACTGTCCATTGTC ATA

ACATTATAATGGACCAATGGAAAATATATCCATAGAACATGAGATTTATGGTTCCCA AAT

ACTTTATTGACATCAAATTGAAACGAGTAAACGGAAAGAAGTGAACATTTTAGGGAA TTT

GAGAAATATTTATTGGTCAAACTTAGGTAATACTTTTTGTGTCAGTCTAGAGTTCCT CCA

ATGTTTTCTTGTGATTATCTGTGGAGTAAAGAAATATATCTTGAGCTTAATTTCTTC CCTT

GAAAAGCAACTAATGTGAATTAAACTGCTGCACCTTGGGCCATAGTTTGTTGGTGTT CTT

CTTACATTCTGATTTTGTGCTGTCCATGATTGGGCACTCGCTGTGTGGTATTCGATT GAT

AACTTACTTTCACCACCAGTTGTACTTGTATATCTTTGGGACATTGAACTTGAGATG TAG

TTGTTTGTTGAGGATAATCTTTGGAAACTATGAAGTGTTGAGAAAAAAACAGGTTGA ATG

AAAGTTAACAATATAATCCAAAGACAAAGGTTAACTCTAGAAAAATGTGAATTGCAT CAT

AGCTAGGACAATATTAGGTTCAAATGATAATATTAATCCCAATTATACACTGGCACT GTC

TTTCATATGTGCGGCTTCACCTGATTTATCTCAGTTTAATTTTGAATCTGAGTCAGG AAC

TAGAAACAGACTCATTGCTTATTTTTGTTTGAACAGGGAAAATCATATGCCTCTCTG GCC

CTCCTGGGGTGGGCAAAACCAGTATAGGTCGTTCAATTGCACGTGCATTGAACCGCA A

ATTTTACCGATTTTCTGTTGGAGGGCTGTCTGATGTTGCTGAAATAAAGGTAATGGG AAT

ATCTGG CCAG CTAAAACAG AGTTGTTTTGTG G CG CACAG AATCTTG AACTTTCATGACT

AACTTTGGGATACACTTCAAGGGACATCGACGAACTTATATCGGTGCCATGCCGGGG A

AGATGGTGCAATGTTTAAAAAGTGTGGGAACCGCTAATCCTCTTGTTTTGATAGACG AA

ATTGACAAGGTATTTTATGGTTTGTGAGTTCATGCTTCAATTGTATGGCTTTGACTA TGA

GAGGAAGTCTAACTTCTTTTTCACCATTTAATCTCGTTTTTCTGTATATGACCACTG GAA

GAATCTTGAGCCTGAACATATTATGTTTTTGCTTGGATTTCCTCTGCAATCTAAATG TTTG

AAGAAAGTGTTTATCGATCAGTTTAATAATAGCCTTGTATTTTTTCTTCTATGGCAG TTGG

GAAGAGGACATGCTGGTGATCCAGCAAGTGCTATGTTGGAGCTTCTTGATCCAGAAC A

GAATGCAAATTTCTTGGATCATTATCTTGATGTTCCTATTGACTTATCAAAGGTAGT TGTT

TTCTGGAGCACTTATCAAATTATTGTGGCTGTTGATTGGCCCTTATGAAATGCCCTC TAA

CATCATTTGATGAATGGGGACTAATGTGATATTAAAAATCTTGCAAATATCTACTAT CATT TTGCTTTTTCACCTTTTGATTTCCCCCCCTTTTTCTTTACTGATTGATGTTCTCTTTCGT C

TCTTACCTTAGTTAAGTTTGGAAAAGGTTTCGTGACTGAGCCTGTTTCTTTTATTGT CCA

TAATGGAAGGTTATCTGGAAGTATTTTATTTCACGTCTGTGTTACCTTTGTCTCTGT CATT

ATGACTGTAATATGATTAGTGTAAGTAGAATGTTGTTCTCTTGATACTTGAGAAAAA ACT

AGGTTTCATTGGCATTGGTTTTGGTGATCATTCAATAGGAAAAAGGTTGATGCATTG AGT

TTTCCTACTGTCTGTTCATATTATCTTTGATGGCTTTACTGATGAGGGATATGGTTT TATA

CCTCTTGGAGTTACCGATAATTGGAGCGATAGACTTTGAGTGTTGAGTGTTAGTTTA CC

TATTGATTAAAGAATTTGGCCCATAATTCAAATATGACAGTTATCGATACTATTTCA TTTT

ATTATATCAAACTTCAGCAAATCAACATGGCTACAGAGAGAAGGTTGATGGGTATAT TC

GCATACAATTTCATTTATTTTGTCCTTTAATGTGATAATTTACTGTGTGTTCCTATG TCTTT

TCATAATGGTAAATATTTGTGTTATATTTCCAAAGGGCTTTGTCAAAAGAAGATAGC ATC

TTTCGATTATTTTGGTAGTATTTTGGGTCTGACTTGGTTATGGGGAGGGAGGCTATA TCA

AAGCCCTGTTATGGATGTTGAGTTATATATGAACCTGAAGAAATTATAAAATCCAAC CTC

TAGGTTTTAATGTACTTCTAAAATTTGAAAAAATCGAGCTTTGGCACCTTCAGTGTT TTCT

ATCCTCTTTGCTTAATACAGTTACCTATTCCACTGTAGTTTATACGAGAATGACATC TGC

ACATTTATCAATTTTTTGAGTGCATAATTTGAGGTCGATCATTTTCCTTTTTGCGGC TGCA

GGTTTTGTTTGTCTGCACAGCCAATGTTGTAGAAATGATACCTAATCCTCTTTTGGA TAG

AATG G AAGTAATTTCAATTG CTG GTTACATTACG GATG AG AAAATG CACATAG CCAGG G

ATTATTTGGAGAAAGCTACTCGTGAAACATGTGGGATCAAGCCTGAGCAGGTATGTC TT

ATAGAAACATCTCTAGTTGCATCTCTTTCATTATCTCCTGTGCATATTCATTATCGG GTG

AATAGTTTTGTATTTTTTCCGCTCCCAATTTTTGACATTCAGAAGCAGACATTGGTC TGT

CGGGAAAATGTTCGAAACATGATTGGGCAAGAATGTAACTTCTTAAAGTAAAATGTG AA

AAGCTTTATTCAGTTAGAAGTTTATGGAATAAAACATATGATTGGAACATTCAGGGG ATC

AAG CTCTTTACACC CTTCCATATAGAACATG GCTAAAG G AAAAAATCG CAGTAAATCTCA

TTATAGGTTTTTTGCCATATGCTTTAATATTTCCTAGAAAATATACTGTTTGATGTA TTGAT

ATCATTTTCATCTTAAGCCTCTACATTTAGAATATACTTGCCTCCAAAGCTAATGGA TTAT

AAAGTTTAATACAAACTCTTCCATCACTATCCTTTTCAAGAAGTATTTAAGTGGTTC ACAT

AAACAACTGAACGAGTCCACATTGCTGTATATATCCTCTCAAGTGCTGCTTTCTCTT TGA

GGAATGTTTGCTTGACCGAGCCAAACTGTGACTGCCTATGCTAGCCAATGTGCTCGT G

CATCATTAACACAACCACTGTGCCACACTTTGCTGGGAGTATAAACGCAATTGAAGG TG

GTTCTGTCGTGGTAGTCGCCAATCTCTATTAGTAAATGGAGGTATAATGAGTTTCTT AGT

GGTAGGGATACATGTGCCTGGTGGGTGGGGTTGCTGGATGGTATTGTCTAACAGATT T

GAGCGGGGCGTGAACTAAAGCAGGATGGATGTTTAAAATATGCCAAGTGTTTTGGTG TT

TTTGCTGTTGACTTCAATTTGACCATCGTGTAAAAGAGAAACTAGACTAGTTTGTTA CAC

AGTAGTGAATTTTTTATTACTGTACAAGGAATCATCATACGATGCCAAGATTGTGGT GAT

GACTCCAAAAAACACTTGACAATGTCTATTGAGTGAGCTTAAGTTATAATATGTCTG CAG

TAAACATGTTCTCCCACAGCCTTAAAGATGAGATCCGTGGAATTCGGAGAACCTTAT ATT

TAGCTCTCTAATGTGATCTTCAGTTGAGTTACATGCCATATGGATCTCTGGTGAAGA AAA

GGGAGCTGAAGACTTACTGATTAAGGGCATCAAAATGACATTTGACCTCATCTAACT CT

GTGGTGTCATGTGGTGTCATTTACTGTGCATATATACTCTAATAATTAACGTTTTCC ACTT

TTCCGGGAAAAAACACCCTAAATATGGGTGTCAGAAGTTAATAAGTTGATATGTGGG AG

GGCTGATGGCTACTTTGGTTTGTTTTGGCCCTCCATAGCGGATAGGAGCAGTTCTAA TT

TTTGAAGTATCTTGTAGGCTGTTGGTACTTTTTCATGACATAGTAGTGGCGTTATGC TTA

TCTAATGGCAAATGTATGACAGGACGAGGGTAGAGCTTTGGGTACCATTTCTCGGGA AA

TGTTAACTCCCTAATGCTCCTGTAAAATTACTCGAGTTGATATGTAATCAAGTTGAA GAG

ATGCAACGTTTGAGCCCATGTATATGTCATCTCTCTGTCTCTCAGGCTTTTAACATG CTG

AGGGCCGTTTCTCAGATGCAGAGATGTGATATCAAGCCGGAAAATTAAGTGTGCATG AT

TTATGTTAGCAATTGATTGCAATTCAAAGCCTTGAATGGTTGATGTTTGGTTTAAAT ATTT

CAACACGTATAACCATCCTGGCAGTTAGTATATGTGGTTATAGGAAGACTTTCAATT GAT

GCCTTGATGTTGTTTTTCTAAATATTTCACAAGTTACACTGAAGCGTTACTTGCAAT AAG

CTAATTTTATTGGCTTCATGAATTCTTTATTTCAGGTTGAAGTGACCAATTCAGCTC TTCT TGCTTTAATAGAAAATTACTGCAGAGAAGCTGGTGTACGCAATCTGCAGAAGCAGATTG

AAAAGATTTATCGCAAGGTTCTGTGACCTGTCCTCTTTTGTATAATACATCTATGAT GAA

ATTGTCTCGAGTCTCGTACCTAGTCAACTTTTGATGAAAACTGATGGTTATTTCATG AAA

TCCAATACGAACACTTTGAATCGTTTTCATGCTCTGATACTTTGCACATCCTGTTCT TGC

TTTTCAGATAGCTCTAAAG CTTGTCAG G GAAG ATGG AGAG ATTG AG CCTCAG AATG CAG

AGGTAGGTGAGGTAGAAGCAGAATCTATCCATCTATCAGACGAAATCAAGTCTAAGG AA

GAAATTCAAGCTGGAGCTGAGTCCGCAAACGGTAGCAATGATGACAAGGCCTCTGAA A

ATAATGCTGAAGCTGAAGCACAGGGAGCACCAGTGAATCAAACACAGAAATCTGCTA AT

GAAGATGCTTGTTTACAGGTAAATGAAAAACATTAAAAAGCAAAATTATAATGTTTA GTA

CTTCAGGTGATTCTTGCCAGTTGTAACTATGTATGCTACAATGTATTTTAATGCTTC TAA

GTTTTATCTATGCTCAAAATAAAAATACAAGATGCAGTGTAATAGGTAGTTTTGTAG GCA

CAGAAGTGTCTTTTTACAACTTGTCTTACTGTCACATGTTGGATTAAATATGGTTAA CAAA

TG AATGTAAGAAATCATTTATCTG G CAATATAACAC CAAACAAGCAG GG G AAG CACTTT

CTTTTG AAATGTTAACTAAG AACAAG CAAG GG AAGC GCTTG GTATTCATTTTCTAC GCTA

AGTCCCCCCTGTATCTTAAAAAGGTATCTATGCATAATTTGCATTTATATCAGACTG TAA

GACAAGAGTGGGTTGCTCTAGTGGTGAGCACCCTCCACTTCCAACCAAGAGGTTGTG A

GTTCGAGTCACCCCAAGAGCAAGGTGGTGAGTTCTTGGAGGGAGGGAGCCGAGGGTC

TATCGGATACAACCTCTCTACCTCAGGGTAGGGGTAAGGTCTGCGTACACACTACCC TC

CCCAGACCCCACTAGTGGGATTATATTGGGTTGTTGTTGTTATCATACACTGTAATC AGT

TAAAATCAAATTCTTTG G GATG GAG GG GTTTCTTTAGTCTG G CTAATTAGACTTG ATAGT

CAAATTTGTTTTAAGAAGTTGTGCCAAATGTTAACTCACTTGTTATAATCTAGCGCG CGC

ACACACATTTTGATATGTTTGATGTTAAGCTGCTAATGAATAACGTCCTTATATTTT CCTT

GACATTGCATGATTTTATGTTTACATATTTATCTGCGTTTAATTTAATGCTATTGTG TCCT

TG CAACTG AATTTTG GTTCTG AGTG GTTGTG ATCTTAACAGG ATACTCAAGAAACTG AG

AAAGCAACAGAAAGTGAAGCGAGTAAAACAGTAAATAAAGTGGTTGTTGACTCGCCA AA

CCTAGCTGATTATGTTGGCAAACCTGTTTTCCATGCGGAGCGCATATACGATCAGAC AC

CAGTTG G AGTTGTGATG GGTCTTG CTTG GACTTCAATG GGTG GCTCAACACTCTATATA

G AAAC ATCTCTG GTGG AGCAAGG AGAAG G GAAAG G GG CTCTCAATGTAACAG GACAAC

TAGGCGACGTTATGAAAGAAAGTGCCCAAATTGCCCATACGGTTGCCAGGACCATTT TG

CAGGAAAAGGAGCCTGATAACCAATTCTTTGCAAATAGTAAGCTTCATCTTCATGTT CCT

GCAGGTGCTACCCCTAAGGATGGCCCTAGTGCTGGTTGTACTATGATAACGTCCTTG TT

GTCTCTTGCCATGAAAAAGCCTGTTAAAAAGGACCTGGCAATGACAGGGGAAGTCAC G

CTAACTGGCAAAATTCTTCCTATCGGCGGGGTATGTTAACAATTCTTACACCTCTCC TTA

TAATTTCATGCAGCTTTTGTGTCTGATCATCTATCATGTTTTCTTTTTATTTTTCGT TGATT

TTGTCTTTAATGTTCTTTATGCTTTAATTATTTCCTGTGTCTGTATGTGTTACATGC ATGC

GCATGTAAGCATAATAAGAGTGGTCTTTTCTTTTTGACCCAACCAGTGTTGGGTTCC TTT

CTTGATTTTACAAAAGCTTTACCTTTTGGTTCAATAATAGGTCAAGGAGAAAGCCAT AGC

AGCGCGAAGAAGTGATGTGAAAACTATAATATTCCCTTCAGCCAATCGCAGAGATTT TG

ACGAGCTTGCTCCTAATGTCAAGGAAGGCCTTGATGTACACTTTGTGGATGACTACA AG

CAAATATTTGATTTGGCATTT

SEQ 77

ATGCAGTTTTTCCGAAGAAACCCATCACTTCACAGAATCTCCTCCAGATTCCTTAATCAA

GTTCGTTTTCTTTTCTTTTCCTTTTCCGAAGTATAACTAGCTTTTCAATTTTTGTTT GGCTT

TTCGATTATCTTACTAGTGTAACATATATTTCATATGTCTTGAGGTTCATTGCAAAA ACTC

GTTATATTTCTAAATGGGGTTGAGCACGTGGTCCAATACAATTCAGTAGGACTTACA GC

TGTAGCCTGTAGTTAGAGGACATTGGATTAATTAATTATATGGCTGCAATTCAGATA TTC

AAAACGTTTCTTTTCCCTGTTTGCAATTTTTTCCTCCAAGTAGTGAAACAGTGGAAT TTTC

TCCCCATTCTTAGGTCAAGCTATCATCTTTTTGCTTAAGAGTTGGTTTGGATGTTTA CATT TATTTTCTAACACAATTTGTTTTGGTTTGCTGCTGATATCTCATGAATTGGATACGATAG G

TAGTCAAAACCAGTGCATATTCAACCAAGAAAGTTTACAATGCTGGGCAGCCGACTG CT

GCTACTCACCCTCAGGTACTTCTACACTGCATATGTAGTATTGACATTTGGTAGCAT AGA

ATTAAGACCGTTGACATTAACAAATGATAAATGTGCAGTAAATTTAAAATCTTGTTT TCTT

GGTGGTTGTTTTCCTTGTATGGGACATACTTCGCTGTCCTTTGGAGCTCCTTTGTGA ATT

TCTGTTAAATGTTGTTACATCACTGCATGCAGTTAATGAAGGAAGGGGAGATTACTC CT

GGCATTACCAGTGAAGAATATATGCAGAGAAGGAAGAAATTATTGGAGTTTCTTCCG GA

GAATAGCTTAGCAATTGTTGCAGCCGCTCCCATAAAAATGATGACTGATGTTGTACC AT

ACAATTTTAGGCAGGATGCTGACTATTTGTACATCACCGGATGCCAACAACCTGGTG GT

GTTGCAGTTCTAGGGCATGACTGTGGTTTATGCATGTTCATGCCAGAACAAAGCCCC CA

GGTATTTCAGGAACCATTCACTTGCTTCCTTCTTGTTGACAAGAAGCTGTTAATAAG AGA

AAAGCTTCGTCCTATAATTTAGTGACATTTTTCTTTAGATTCAGTTACTACCATGAT TTTT

TGGTAGTTAGTATACATTGTAGCAAGTTAAAGATTGTTTCCATACTAAAAGTGAAAA AGT

ATTTTTAG G ACG CTCTTTG G CAAG GAG AAACTGCTG G AGTTG ATG CAGCTCTACAG ATA

TTCAAGGCTGACCTTGCTTACCCTATTAACAGATTGCCTCAGGTAAATCTTTTTTAA AAT

CATATCTCCAACTGCAAATAAGTTTGAGATTCTTTTTAGAAGCGAATACCTCTCACA CTG

ATAAGTAAAAGGGCATATGATAACATCCCTTCTTTTATTCCTTTCAATAGGACAATG AAG

TACTTTATCTAAAAAGGGAGTGGAGACCTATTTGTCTCCTTTCCACTTGATTATAGA AAT

TTATGTCAGAGAATTGAATCTGTTAGAGTTGGCTTGTAGATACCTTTTGACTGTTGA TGC

AATTCTTAATATG CGTAAAAG AATTGTTTTTCTC CTTTTTCTCTTTTCTTG CCG GG G AAAA

GAATTGTTTTCCTCCTTTAATATGCGTAAAAGGTATAGAGGGAACAAAGTGGATGGA AG

TTAGAGTTTTCACCTAAGTTGCTCCGACACGGCAATTTAGGTGCCGCACCCATATCG AC

ACGACACTAGTATGGGTGTGGGTATGGGATCCGTACCGGATCTGGTCAAACAATTTT G

GGTACTTTGACCACGACGGATGGAAAAATTCGAGACGAGATACAATTTGATTCCCAA AA

TCAGAATCTAAGGTAAATTTAAATAAAATAATATACCTTATCTAGAAAATCAATCCT TTAC

TTATCTATAACTTGAAAATAAAAAGGAAATCCACACTTTACAAGCTATACGTAAGTA ATCC

ACAAAATTTCTCATAATTTAAAAATATTTTTATTTTTTTTGAATTATTTTTAGTCGG ATCCC

CGCACCCATATCTGTACTAGGATCTGTATCCCCGAATCTTAGAATTTACATCTCGAA GG

ATCCAACCTCTAGATTCGCACCCATGTCGGACACCCGCACCCGTGTCCGAGCAACTT A

G GTTTTCACATATATAG G AGTCG GG CCTG G CTTATTACTATAAATTCATGTTTG ATAG G A

CCTATTACTGGATGTAGCCTTTCCTCATAATTTTGAAAATCAAGCAGGCATCACGCT AGG

ATCGGTTGAGAATATAATATTATGGGTAAGAAATTGAAGAATGAAGGTTAACAGAAA GTG

G ACACTGTGTTC CAAATG G AAACTAG GTAAATG ATTTAG G CAG ACGG AATATTTTTTG GT

TGGCTGTATTTGGCTCTCAGGTTAACCGTTTGACCACTTAATGGTAATTTACTATTT AAA

TCGGCAACACAGAGAACAGGAAGTGAAGATGTATATATGACTGTGTTATTTGTAGAG AA

CCAGTTTATGGTGTAGGTTTTCTAGTTATTGTAGAGCACTTGCGTACAAGAGTTTAA ATT

CCG C AAC ATC G ATAAATTCTTAC CTTATAAAAAAG AG CAG G AAATG AAG AC GAG ATG C C

GATATCATGATTAGATCTATGTCAGCAAAGAAAAAATGTCAATTATTTCCTTCTAAG CTGT

CTTTCCTGTACATGGCTTCAATGTAGTGACTTTGTTTCACTTTCTCCATGTTCCAAT TTCT

CTTCTCTAATTTTTGCTGGACTTGTCAGATTCTCTCCAGGATGATAGAAAGTTCTTC CAC

TGTGTTC CATAATGTG AAG ACAAG G ACTTCATCCTACCTG G AG CTTG AG G CCTATAAAA

AAGCAGTTAGCAATTACAAAGTGAAAGATTTCTCTGTGTACACTCATGAAGCCCGAT TTG

TG AAGTCTCCAG CAGAG CTGAAATTG ATG AGAG ATTCTG CATCTATAG CTTGTC AG GTA

ATGGTAGTTCTTTCATTTTTGTCAGGTTCATGGGTTAGAGTGGTAGTTCTTACTCAT AGA

G GTTCTTGTTTTTATTG G ACATGG AAAAG CAGTCTCTGG GTTATAGAG ATG GAAAG ATA

GAGTGTACACTTGACACTACTTTTGTATGTTTATTTGTTTTCCATTGAAGTTGATAC TCTT

CACACAGTTAACATGTGACTAAGGTATTGATATGCCAGCGAGGTGTTTTCAGAATTT TAA

AAAGCTTATTTGCAGGTCAGCTGTTATACAATCTTAAGTAACATGTTTGTCATTTTG CTAT

ACGACAAAATTTTTTAGAAAGGTAAAATAGGTATTTGCATTTCCCTTTTTTTCCTCT TCTT

CTTTCATGTCTAG G GTG GTGTCTTC AG GTTG AAG ATACTACACTTCTGAG GATCTAAAAA

ATATTTCACAAAAG GAAAAAG G GTACAGTCAGAT AAAAG G ATCACCAGTCTAAAAGAAG ACGGTTCTTAATATTCCAAAAGTTGGAGTCCCAGCTTTCTTACTTGGGTCAACATATTCT

TGGTCTAATTGTGAAGGAACAGTTCTTGCATGTACAATCCTTTCTTTGATAATGTGC TTC

TGTGTTAAGTAGTTCAGAAGCTCTAGGCATGCTTAACCAAAAGATGTGTATATACTA CTC

ATTCATTCTATTTCACAATCATGATTTGCATGTTTTCTTATGAGAGACTGGTCTAGA AAAT

GCTTCTTCCTATTCCTGGATTTGTATGCAGTTGCCTAGCAATAAAGTTGCCAGTTAT ATG

GGAGTTGAGATATTTTCCTTTCACTAATTCAGTCCTTTTTTATACTGTATAAAGGAT ATTT

TTTATTTCTTGATCTTTTAATGTCTGTCTTGTCTTTCGGAAACAGCCTCTCTACCCC TCG

GGGTAGGGGTAAGGTCGGCGTACACACTACCTTCCCCAGACCCCACTAGTGGGATTT C

ACAGGGTCGTTGTTATTATTGTTGTTCTTTTAATGTATATATTTTTGGTAGGCACTT GTCC

AGACCATGTTGTACTCGAAGTTGTTTCCTGATGAAGGAATGCTGTCAGCCAAATTTG AA

TATGAATGCAGAGTTAGAGGTGCCCAAAGAATGGCGTAAGCTTTTTCTTGTAATAAT TTT

TGGAAGTTTGTATATAGAGAGGAGCACGTTGCAATTTCTAAGTATTTTAGTCTAACA TGA

GTTGCAGGAGAGTAAATCAAAATGCCACTAAGACCTCATGTGTAAACATGCAATTGA TTT

TCTTTTTCTTCTATTCTCTGCGATTCTGATAATTTGTTGTTTTTCCTGACTGTTAGT TTTG

GTCATACTTCTGGTTGAGATAGTTTCAAGGATTATACATTTTTCTTTTCCTGTTCCA CAG

GTTTAATCCTGTTGTTGGTGGCGGACCTAATGGCAGTGTCGTGCATTATTTTCGTAA TG

ACCAGAAAGTATGTTTACTGTCTTTAAGCACAGTTGAATTTGAATATCAAGCATATT GAG

TAGTAGTATCTAATTTGTTGTTTTAACAGATTGAAGATGGGGACTTTGTTGTTTTAA CAGA

TTGAAGATGGTAACCTTGTCCTCATGGATGTTGGATGCGAGCTCCACGGTTATGTCA GT

GATCTTACTCGTGTTTGGCCGCCCTTTGGAAAATTTTCTTCTGTTCAAGTAAGTATA GAA

TCCATGATTTTCTTCTCCGTTTTCCCCTTAAAACTCAAGTCAAACCCCACTCCTCTG GGT

AAAAACCCTGTTAGCTGATCAAAGTCATAGACAACCTTCCATTTCAGAAAGAATGCA CTG

AC CTAAATTG AAAC C C C AG ACTAC C AAAAG C AATCTAG AAAG AC AAAC G GTAAAATG AA

AATCATCAGGTAATGTAGCCTAGCAGCTAGCTTCACCCTCCAGTGGTATGAGTTATG AA

TCTTAATTCCAGATCCTCAATGGCCTTGCTGATTTGATGTGGATGTGAAGAATGAGA AT

GATATAATCATATAAAGTTCCTCCTTATTAGAAAAACAAAATTTCAATTTTACTTCT AAGC

TACAGATTATGCTTGAGAAATAAATTCTTCCCTTGGTATGGTTTTAAATTGCTACTT TTCT

GTGATATAGTCTCTATACATTATTGTCTGGAACTTAGTGATACGCTCCAAGATACAT TTC

AGG AGG AACTTTATAATCTTATTTTGG AGACAAACAAG GAATG CGTG GAG CTGTG CAGA

CCTGGCACAACCATCCGAGAAATACACCACTACTCGGTACTATTTTAGTTAATCCAT CTC

GTAATTTCTTTTG GTTTATATTCAAGG G GTAG CTG AG AGTAGG AATTTAATTTTTTTTCTC

TTGCCTTTCATAGACTTAGACCCAGTTATATTGCCAAGTTACATTGGTAGTCTCGGT GAT

AGAAATTTGGGTACAGTTGTGAAGGCCCTACTCTTGCTTTATGTTTTGCCTAATTCT CAA

GTTACACTGACTTCCACCTCCTATTGTGAATAGCAATGTTGCTTCAAAGTTTTCGTT CCT

ATGCATGAGCCGAGGAAAATGAGGCTTGATGGCATTTTCGAAGATAAGGAAAAAAGT TT

AATTCCTTTTAAGCCTTTGAGTATAGAGGTGTTGGGAAGAGATAGATGCCTTTTGAT TGC

CCTCCCTTGATTTGAAAATAGTATTTGTTCCCCCATTTCTTCATATATGATGAATAA TGCT

TTGTAAAATAAGCCAGTAAGGTAATGATTAGAGGTGTCTAATTAGTGTAGTGTGTGG TAA

TTGATTCTCAGTGAAGGGTAGTATTACCTGGTTGATGCAAATGCTTTATGAATTAAG GGT

CATTCTCTGCATTTGTTGTTAGGGCAGCTGTTAAAGGTCTTATTACTGTTCAAGATA TGG

GCGTATATGGTATATATTCTGTAGTGAGTAATTGCCCTATATCAGCATGCTCTTTTC TTTA

G ATACTTG AG G ACTG CCAAG GTCTCATTCTTTTTTTTTTATTTG ATGTGTATAG GTAG AA

ACGCTGCGAAGAGGATTCAAAGAAATTGGGATACTAAAAAATGATCGGCGTGGAAGA TA

TGAAATGTTAAATCCTACAAATATAGGTCTTTCCTTTTAACCCTTACTCTTCCGCTG CAG

ATATAAGTACAAATGCATGTGCAATAGCAGCAGAAACCTGCTCCCTCTCAATTGTCT TCA

CAGTTGCTAATGCTATTCCTATTATGCTTTTTGCTGAAAAGAGAAATGATTTCTTGT ACC

GCTGCAACCATCTCTGAATGAATTTGGTTTGTTCAATTATATTTCCAGGTCACTATC TAG

GAATGGACGTTCATGATTGTTCTACAATTGGAAATGATCGACCTCTGAAACCTGGTG TA

GTAAGTTTCCTTCCTTACTGATGATTGCTTTGATATTTGAAAATAATCGGGAAACTG CTA

GGTTTGCAAAAGAAATTGGTTGTCATTATTTTGAAATCCTCCTAACGAATGAGGACC AGT

G ACTTG CTCATTTG GAAACAAATGAG CTTTGC CATAATG CATCACTCTCTTTTAG CAATT TACAATGAGTTTTCCTCCATAGGATAGTTCAGTCATTCTCTCTTTGCTTCTTGACTGGAT

TACAATATGAACCAACTAAGAATGCTTATGTTTTTAGAGCCATGTGGTCAAATTTCC CTT

TTCCTTCACTTTTTTCATTTTTATAGAGATGCAAAGGGTTAAAAGAGAGGATTGAAT GAT

GATATATATGAATATTTTCTAAACTGCTTTTGTCATATGCACCTCTTTGGTCCTATT GCAG

CTTCTAATTTCATCACTTCTACTAGTAGTTACATGGAGAAAGTTAAATTCAGAAATG AAGT

TGGTTAGGTTACATGGAGAAAATTAAATTCAGAAATGAAGTTGGTTAGGTTTATTTC TAT

TGAAGGATGTGTACATCAGGGTAGTGCGTGTATTTGCATAAAAATTATGTTTGAAAT ATC

TGACGGTCCAATTAGCAGAAAGATCAAAGTATCTTTTGCTTCTCTCTAGATTCTATA TAG

ACCTTTGTTTTGATTGATTAATTTGAAATATTTGAAATGATTAATTCCTCGACCGCT TAGT

TCATTAACCCCGGCTTCATACACGATGCAATTTTTGTTATTGATAAAGGTTTCGTTC CTG

GTATGTCTACTCTTTATCAAAAAGTAAAGCTCTTGTATATCTTTTCATTCTGAGTCA CAAG

GAATGTAATATTCCTGCAGAATGTGTAATGTTATATGCAATTGTAGTAAAATTTCTC AGTA

GCTCGCGCATCTTGTTTTCATGTTGGTACTGCAAACTGTTAACTTATACTTTATGTA TAAT

CCATTTCATGACAAGTACCTTGCTCTTGTGAAAGGTACTTAACGTCCAACATGCTTG CTT

CCATGTATGAAAAATTATTAGTTATCCCATTTTGCTCCTTTTTCCTTCATTCTTCTA ATCAT

AAAAAATTGGAATATGCTCCCGACCTGTCTGATATGACAATAAAAACATACACAATA TAT

ATCAAGTCAGCTGTTATATGCAAAATTACTGAAGGTAATTCCAGTAATACAGCTTAT TGG

TGTCAGCGGTAGATTTATGTTAAATTATGCTTTAACTGAGGTCTATTTTGCCAGTGA TAT

CTGTATCCATGCATTTGTTTTTAGTCCTTACAAAAAACAATTTCAGAACTACTATGT TTTT

GAATAAGAAGCACCATGAACATGCTACTTAGAGGTCTTAGTTTGTATAATTTATGTT GAT

CATATCTGTGGAGAAATATGCTAATTTTGCGTGGGCCGTGGCATTTGTATTTGAAAG AA

GTACATGACTTTTGATTGTTCTGAGATTATGTGCAGAGTCTAGTTCTTTGACTTCGG GAC

TATGGAAAACTTTGTGTTTACTCTACATTCATACATCTAGACAAGGTCATGGCCAAT TGC

GAAACATGCTTTACGTTTTTTTAAAAGTGACGGAGACGCATATAGCTCAGAATGATG AG

ATGAAATTGGACAAAGCTAATAGTCTAGGATAAAATTGCCTGGTCATTGTTTAGACA TTT

GTAAACTCCTGTTCCCTCTGTTTGTTTCTACATTAACCTTGATGAGGCGGTCATTGA TAA

GAAGAATCTCTGACCCCATAAATAGATGAGTCCTTTTCATCTTAGCTTCCAATTACT TGT

GATTTCTGCAAGAACTTGTGATCAATCTTCATGGACTGTTATATGTAGGTCATCACA ATT

GAACCAGGAGTATACATCCCTTCATGCTTTGATTGTCCAGAAAGGTAATACTTGTTA CCT

CATCAAATTAATGTTCCTTTTGGCATGCATTCAGAAGTTACTGTATCTTAGATCATC CTC

CAGATTCTTGGTTTATTGAACTGGTTCTATCTGCCAAAATGTATTATGGTGGATGGA CGA

AGAGTTACTCTTTCATGCAGAAAAGATTGAGAATCATATAAAGATGTCTGTTCTAGG TGG

GTCAGGTAGATCTTGTAGGTCTAGTCAACATGTTAATCTAAGGACATTGTAGCAGAA GT

AGTCGTATCGTGCAAAGGAGTCCCAGTTTGGTTGGGCTTTCAGTGAATGACTGATGG CT

ACTTGTATGTTCAACCGAGAATGGGATCAGGAACCTTTACTTCGAAATTTCTTTTGC AAG

GTTCGAGAAAACTTCCCAGAAACAAATAGCTCGAAAGAAATATGAAAGAAATATTCC AC

GTATCAATGGCATATTCTGGCTGTCCCATTCTAGGGAAAATTGTTGCTTTTGTCATA AAT

CTTAGGGAGAAAGAATTATTGCTCACTTGAACTAAGAAGCTTCATCTGCTGATTCAT CTA

TTGTTAAGGAGCTAGATTATTCTCTTACCCCAATAAGGAACAACGTGTCTGTTTCCT TAG

AATTCTTGATTTTTACTCACTTATATATGTCTATATTCACTTATGCTCATTCAGGTT CCAA

GGCATTGGATTTAGGATTGAAGATGAAGTCCTTATTACAGAATCAGGTTATGAGGTA TA

GTTACAGAAATCGTTCAATTGTTTGAACAACCGAGTTATACAAGTACCAGTTCATAT GAT

CTCTGATACTTTGATCACTTCCGACACTTGTTAGCATTCAAGACCTGATTTTCTGCC CTA

CTGGAAACAGGTACTTACTGCATCCATACCGAAGGAAATTAAACACCTCGAGTCCTT GT

TGAACAACTTTGGCAGTGGGAGAGGAACAGAAATTAGAGCTGCTCTCAGT

SEQ 78

CTACTCACCTCTCACAAAAACCATATAATTCTCCTTCCCTTTCTTCTCTACAAAATCTTC A TTTCTCTCCAAAAACAAACTCTCATGGCTTCTTCTACTAGAGTTTTTGTTCTTCTCCTTC T CATAATCTTCAACTTTCTCTACATCTCAGCACAAAAAACCATTAAACATAAGCCTTTTTC A

ATGTCATTTCCTCTTACTTCAACATCTTTATCACATAACTCTTCTTCTAAAGCTCTT TTTCT

TTCTTCCCTTTTGGCTTCTAATCAAAGAAAACAAGCTCCAAACACAAAAACTGTGTC TAG

AATTCCATCTTTGAACTATAAATCAACTTTCAAATATTCAATGGCTTTAATTGTTAC ACTT

CCAATAGGGACACCACCACAAAATCAACAAATGGTTTTGGACACAGGCAGCCAACTT TC

TTGGATTCAATGTCACAAGAAAATTCCAAAAAGACCCCCACCAACGACGTCGTTTGA TC

CTTCTTTGTCCTCCACTTTTTCTGTTCTTCCTTGTACTCATCCTTTATGTAAGCCAA GAAT

TCCCGATTTTACCCTTCCAACTACTTGTGACCAAAATCGCTTGTGCCACTATTCTTA CTT

TTATGCTGATGGTACTTTAGCTGAGGGTAATCTTGTCCGTGAAAAAATTACATTTTC ACG

TTCCCAAAGTACCCCTCCTTTGATTCTTGGTTGTGCTACGGAGTCCGAAGATGCCGA GG

GTATTTTGGGAATGAATCTTGGACGGTTTTCTTTTGCCTCCCAAGCTAAGGTACAAA AAT

TCTCATATTGCGTGCCAATTAGACAAGGTAGCCATGCAGTTAAACCTAGTGGAACAT TTT

ACCTAGGCCAAAACCCTAATTCCCATACATTTCAATATATAAATCTTTTGACTTTTC CTCA

AAGTCAACGCATGCCAAATTTGGATCCACTAGCTTTCACTGTTGGCATGGTAGGGAT AA

AAATTGGCGGCAAAAAATTAAACATCTCCGGTAGGGTTTTCCGGCCAAATGCTGGTG GT

TCTGGCCAGACGATCATTGATTCCGGCACGGAATACACTTTCTTAGTGGAAGAAGCG TA

CAATAAGGTCAGAGAAGAAATTGTTAGGTTAGTTGGTCCAAGATTGAAAAAAGGTTA CG

TTTATGGTGGTGCACTTGACATGTGCTTCGATAACCGTCCGATGGAAATCGGACGGT TG

AT AG GTG ATATG ACATTG CAATTTG AG AACG G GGTTG AG ATTTTG ATCAATAAG GAAAG

GATGTTGGATGAAGTAGAAGGTGGGATCCATTGTGTTGGAATCGGACGGTCAGAATC A

CTCG G AATAG CAAG CAATATTATTG GTAATTTCCATCAG CAAAATTTATG GGTAGAATTT

G ATATGAG AAATCG AAG AGTAGGTTTTG GCAAAG GAG AGTGTAGTAG G CAAATG

SEQ 79

ATGGCTGCACTCAATTTCTTCATAATCTTCACATCACTAGTCTTACCAATTGCATCTGAT

CCTCTGTTGTCAACTTATGTTGTCCATGTTGACACCAAAGCCAAGCCATCTCATTAC TTA

ACTCAAGATGAATGGTATAATTCAGTGGTTGAGTCAGTTCTTGCAAACAAAATGGAC TCA

GATTCTACTTCTCCAAGATTGTTCTACTCATATGATGTAGTGTTACAAGGTTTTGCA GCA

AGATTGACTGATCAAGAATCTGAAAAACTAAATAAATTTCCAGAAGTCATTCACATT TTCA

AAGATCAGTCTAGAATCAAGCTTGACACAACACGTTCGCCGAATTTTCTTGGCCTAA AC

ACAGGTTATGGTCTGTGGCCACAATCTAACTTTGGAGATGATGTTATAATTGGCCTT GTT

GATACAGGGATTTGGCCTGAGAGTGAGAGTTTCAAGGACAATGGTATTGGTCCTATT CC

AACAAGGTGGAAAGGTAAATGTGTTGATGGAATTGAATTCAACGCGACGAGTAGTTG TA

ACAG AAAACTTATTG GTG CTAG GAATTTC GTTAAG G GTGTTG AG AATG ACTATCATCATC

AATCG G CACG AGATCAAAATG G ACATGG AACACATACTG CTTCAACTG CAGC AG GTACA

GAGGTAAATGGTGCCAATGTATTTGGTTTTGCTAAAGGGAAAGCACGAGGGATTGCG A

GTAAAGCTAGGATTGCAATGTACAAAGCTTGTGGGAGTAGTTCTTGTGCAGAATCTG AT

ATTTTAGCAGCTATTGAAAGTGCTATAAAAGATGGCGTAGACATACTTTCGCTCTCT TTA

GGATACGATGATGCTCCGTTTTATGAAAATCCAGTGGCAATTGCAACATTTGCTGCT GT

TAAAAGGAACATATTTGTTGCTTCTTCAGCTGGAAATCTTGGACCTTATCCATTTTC AGT

TCACAATACAGCACCTTGGGTTACAACAGTTGGAGCTGGATCACTTGATCGCGATTT CC

CCGTTG AAATC AACTTATC AAACAACAAG ACTTTTGTTG GTTCTTCTCTTTATC CAG G GA

GAATCAGTGGTAAAAGTTACTCTCTTGTTTATATTGAAAATTGTTCTATAATGACAA TCGA

TCGTTCTAAAGTTGAACGAAAGATTGTAGTTTGCAACACTAGTAAAATCGAAGCTCT TAG

AAATGGGATTTTAATTCAGAAAGCAGGTGGTTTTGGACTGATTCAATTAAATCTTCC AAC

TG AAG GAG AAGG G ATTAG AG CAATG G CTTACACATTGC CTTCTG CAACATTGG GTTATA

AAGAAGGTATAGAGCTTCTTTCTTATATCAAATCCAATGCTAATCCAAGAGCAGGGT TCG

TACGTC GAAAG GATAC AGTAATTG GG AAAAAAGTTAG AG CTCCAATTGTTG CTAG CTTTT

CTTCAAGAGGGCCTAATGTTGTTGTTCCTGAAGTCCTCAAACCTGACCTCATTGCTC CG GGTTTGAACATTCTTGCTGCATGGCCAGGTAACCAGAGACGGATCCAGGATTTATACCT

TATGCATTCAACCTTTATTCTTTACCATTGACCCCACGACACTTTTAAACTTATGAG GTA

GGAATTTTATACTTTTTGAAATTGTTGTGATTTTTCATATTGCGTGGAAGCAACCAC TAAT

GCTTATGGTAGGATAGGCTGTCTACATCACACTCCTTAAGTGCGGCCCTTCGCCCGA C

CCTGCGTGAGCAAGGGATACTTTATGCACTAGTCTACCGCTTTTCTTTATTTAGTGA TTC

TTCACATTGTGTGTGTCTATGCAGGTGACATTTCCCCAACACGTCTCAAGATGGATC CA

AGGAGAGTGAAGTTCAATATAAACTCGGGAACATCAATGGCGTGCCCTCACATAGCC G

GAGTAGCTGCATTAGTCCGCGCTGTTCATCCAGATTGGTCCCCGGCTGCTATAAAAT CC

GCACTCATGACTACATCCACAGCATTCGACAATGCACAACTCCCTATCATAAAACAC GA

AGACATGGAGCTAGCAACTCCGATCAGCATTGGAGCCGGGCACGTGAACCCTGAATC G

GCTATTGATCCGGGCCTAATATACGACACTGATACATCAGACTACATCAACCTACTA TG

CAGCTTGAATTACACAGAGAAACAAATGAAACTTTTCACGAACGAGTCAAATCCTTG CTC

GGGTTTCACTGGATCTCCACTTGATCTTAACTATCCATCACTTTCTGTTATGTTCAG GCC

TGATTCCTATGTTCATGTAGTTAAGAAGACACTGACACATGTCGCGGTATCTAAGCC CG

AGGTGTACAAAGTAAAGATAGTGAATCTGAATTCTGAAAAGGTGAGTTTAAGTATAG AG

CCAAGGAAGCTGATTTTCAATGAATCTTTACAGAAACAAAGCTATGTGGTCAAATTT GAG

AGCCATTATGCATTCAACAGCAGCAGGAAAATAGCTGAGCAAATGGCGTTTGGTTCG AT

ATTGTGGGAGAGTGAAAAGCACAATGTTAGGAGCCCCTTCGCTGTTATGTGGGTTCA G

CAAAATTTCAATAACAGTAGATTATACAAA

SEQ 80

TCAAAATGCCAAATCAAATATTTGCTTGTAGTCATCCACAAAGTGTACATCAAGGCCTTC

CTTG ACATTAGG AG CAAG CTCGTCAAAATCTCTG CG ATTG G CTG AAGG G AATATTATAG

TTTTCACATCACTTCTTCTCGCTGCTATGGCTTTCTCCTTGACCTATTATTGAACCA AAA

GGTAAAGCTTTTGTAAAATCAAGGCACTTCAGAAAAGGAACCCAACACTGGTGGGGT CA

AAAAGAAAAGACCACTCTTATTATGCTTACATGCACATGCATGTAAACACATACACA CAG

AG G AAATAATTAAAG C ATAAAG AAC ATTAAAG G C AAAATC AAC AAAAAATAAAAAG AAAA

CATGATAGACGATCAGACATAAAAGCTGCATGAGATTATAAGGAGGTGTAAGAATTG TT

AACATACCCCACCAATAGGAAGAATTTTTCCAGTTAGTGTGACTTCCCCTGTCATTG CCA

G GTC CTTTTTAACAGG CTTTTTCATG GCAAGAG ACAACAAG G ACGTTATCATAGTAC AA

CCAGCACTAGGGCCATCCTTGGGGGTAGCACCTGCAGGAACATGAAGATGAAGCTTA C

TATTTGCAAAGAATTGGTTATCAGGCTCCTTTTCCAGCAAAATGGTCCTGGCAACCG TAT

GGGCAATTTGGGCACTTTCTTTCATAACGTCGCCTAGTTGTCCTGTTACATTGAGAG CC

CCTTTCCCTTCTCCTTGCTCCACCAGAGATGTTTCTATATAGAGTGTTGAGCCACCC ATT

GAAGTCCAAGCAAGACCCATCACAACTCCAACTGGTGTCTGATCGTATATGCGCTCC G

CATGGAAAACAGGTTTGCCAACATAATCAGCTAGGTTTGGCGAGTCAACAACCACTT TA

TTTACTGTTTTACTCGCTTCACTTTCTGTTGCTTTCTCAGTTTCCTGAGTATCCTGT TAAG

ACCACAACCACTCAGAACCAAAATTCAGTTGCAAAGACACAATAGTCATTATATTAA ATG

CAG ATAAATATGTAAACATAAAATCATG CAACG CCAAG G AAATACAAG G ACATTATTCAT

TAGCAATTTAAATTGGTAGAATTCTATATATTTTTCTTTCAGTACAGACCAATATGA GGGT

AGAAATAACATCAAACATATCAAAATGTATGTGCGCGCGCTAGATTATAACAAGTGA GTT

AACATTTGGCACAACTTCTTAAAACGAATTTGACTATCAAGTCTAATTAGCCAGACT AAA

GAAACCCCTCCATCCCAAAGAATTTGATTAACTGATTACAGTGTATGATATAAATGC AAA

TC ATG C ATAG ATACTTTTTAAG ATAC AAG G G G G AC AT AG C ATAG AAAATG AATAC CAG G

CGCTTCCCTTGCTTGTTCTTAGTTAACATTTCAAAAGAAAGTGCTTCCCCTGCTTGT TTG

GTGTGATATTGCCAGATAAGTGATTTCTTACGTTTTGTTAACCATATTTAATCCAAC ATGA

GACAGTAAGACAAGTTGTAAAAAGACACTTCTGTGCCTACAAAACTACCTATTACAC TGC

ATCTTGTATTTTTATTTTACGCGTAGATAAAACTTAGAAGCACTAAAATACATTGTA GCAT

ACATAATTACAAATAGCAAGAATTACCTGAAGTACTAAACACTATATTTTTGCTTTT TAAT GTTTTTCATTTACCTGTAAACAAGTATCTTCATTAGCAGACTTCTGCGTTTGATTCTCTG C

TCCCTGTGCTTCAGCTTCAGCATTATTTTCAGAGGCCTCATCATCATTGCTACCGTT TGC

TGACTCAGCTCCAGCTTGAATTTCTTCCTTAGACTTGATTTCGTCTGATAGATGGAT AGA

TTCTGCTTTTACCTCATCTACCTCCGCATTCTGAGGCTCAATCTCTCCATCTTCTCT GAC

AAG CTTTAGAG CTATCTG AAAAG CAAG AACAG GATGTGCAAAGTATCAG AG CATGAAAA

CGATTCAAAGTGTTCGTGTTGGATTTCATGAAATAACCATCAGTTTTCATCAAAAGT TGA

CTAGGCACGAGACTCGGACAATTTCATCATTAGATGTGTTATACAAAAGAGGACAGG TC

ACAGAACCTTGCGATAAATTTTTTCAATCTGCTTCTGCAGATTGCGTACACCGGCTT CTC

TGCAGTAATTTTCTATTAAAGCAAGAAGAGCTGAATCGGTCACTTCAACCTGAAATA AAG

AATTCATGAAGCCAAATAAAATTAGCTTATTGCAAGAAACACTTCAAGTGTAACTTG TGA

AATATTTTGAAAAACAACATCAAGGCATCAATTGAAAATCTTTCTACAACCACATAT ACTA

ACTGCTAGGATAGTTACACGTGTTAAAATATTTAAACCAAACATCAACCATTCAAGG CTT

TGAATTGCAATCAATTGCTAATATAAATCATGCACACTTAATTTTCCAGCTTGATAT CACA

TCTTTTTTTTGATAAGGTGAAGATTTTATTAAAAACAGTATCAAGCTGATACTGTAA AAAT

ACAAGGACACTGCTGGCTTAAAAACATTAAAATCCTAAGCGGTCTAGCATGTCCAGC TT

G ATATCACATCTCTG CATCTG AG AAACAG CCTCAG CATGTT AAAAG CCTG AG AGACAAA

GAGATGACATATACATGGACTCAAATGTTGCATCTTTTCAACTAGATTACATATCAA CTC

GAGTAATTTTACAGGAGCATTAGGGAGCTAGCATTTCCCGAGAAATGGTACCCAAAG CT

CTACCCTAGTCCTGTCATATGTTTGCCATTAGATAAGCATAAAGCCACTACTATGTC ATC

AAAAGTACCAACAGCCTCAAGATACTGAAAAAATTAGAACTGCTCCTATCCGCTCTG GA

GGGCCAAAACAAACCAAAGTAGCCATCAGCCCTCCCACATATCAACTTATTAACTTC TG

AC G C C C AT ATTT AG GGTGGATTTTTTTTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGG

GTAGGCACAGTAAATGACACCGCAGATATAGATGAGGTCAAATGTGTCATTTTGATG CC

CTTAATCAGTAAGTATGCAGCTCCCTTTTCTTCACCAGACATCCATATGGCATGTAA CTC

AACTGAAGATCACATTAGAGAGCAAAATATAAGATCCACCGAATTCCATTGATCTCA TCT

TTAAGGCTGTGGGAGAACATATTTACTGCAGTCATCTTATAACTTAAGCTCCCTCAA TAG

ACATTATCAAGTGTTTTTTGGAGTCATCACCACCAATCTTGGCATTGTATGATGATT CCT

TGTACAGTAATAAAACATTCACTACTGTGTACTAGTCTAGTTTTTCTTTTACACGAT GGTC

AAATTGAAGTCAACAACAAAAAAAACAAAACACTTGGCATATTTTAAACATCCATCC TGC

TTTAGTCCATGCCACACTCAAATCTGTTAGACAATACCATCCAGCAACCCCAGCCCA CC

AGGCACATGTATCCCTACCAGTAAGAAACTCATTATACCTCCATCTACCAATAGAGA TTG

GCGACTACCACGACAGATCCACCTTCAATTGCGTTTATATTCCCAGTAAAGTGTGGC AC

AGTG GTTGTGTTAATG ATGCACG AG CACATTG G CTAG CAAAG GCAGTCAC AGTTTG GCT

CGGTCAAGCAAACATTCCTCAAGAGAAAGCAGCACTTGAGAGGATGTATACAGCAAT GT

GGACTCATTCAGTTGTTTATGTGAACCACTTAAATACTTCTTGAAAAGTATAGTGAT GGA

AGAGTTTGTATTAAACTTTATAATCCATTAGCTTTGGAGGTAAGTATATTCCAAATG TAGA

GGCTTAAGATGAAAATGATATCAATACATCAAACAGTATATTTCCTAGGAAATATTA AAG

CATATGGCCAAAAACCTATAATGGGATTTACTATATCCGTGCGATTTTTTTCCTTTA GCC

ATGTTCCATATGGAAGGGTGTAAAGAGCTTGATCCCCTGAACTTTCCAATCATATGT TTT

ATTCCATAAACTTCTAACTGAATAAAGCTTTTCACATTTTACTTTAAGAAGTTACAT TCTT

GCCCAATCATGTTTCAAACATTTTCCCAACAGACCAATGTCTGCTTCTGAAATGTCA AAA

ATTGGGAGCGGAAAAAATACAAAACTATTCACCCGATAATGAATATGCACAGGAGAT GA

TGAAAGAGATGCAACTAGAGACGTTTCTATAAGACATACCTGCTCGGGCTTGATCCC AC

ATGTTTCACGAGTAGCTTTCTCCAAATAATCCCTGGCTATGTGCACTTTCTCATCCG TAA

TGTAACCAGCAATTGAAATTACTTCCATTCTATCCAAAAGAGGATTAGGTATCATTT CTA

CAACATTGGCTGTGCAGACAAACAAAACCTACAGCCGCAAAAAGGAAAATGATCTAC CT

CAAATTATGCACTCAAAAAATGGATAATGTGCAGATGTCATTCTCATATAAAGTACA GTG

G AATAG GTAATTGTATTAAG CAAAG AG G ATAGAAAACACTGAAG GTGTC AAAG CTCCTT

TTTTTCAAATTTTG GAAGTACATTAAAACCTAGAG GTTG G ATTTTATAATTTCTTCAG GTT

CATATATAACTCAACATCCATAACAAGGTTTTGATATAGCCTCCCTCCCCATAACCA AGC

CAGACCCAAAATGCTACTAAAATAATCGAAAGATGCCATCTTCTTTCGACAGAGCCC TTT GGAAATATAACACAAATATTTACCATTATGAAAAAGACATAGTAACACACAGTAAATTAT C

ACATTAAAGGACAAAACAAATGAAATTGTATGCGAATATACCCATCATCCTTCTCTC TCT

AGCCATGTTGTTGATTTGCTGATGTTTGATATAATAAAATGAAATAGTACCGGTAAC TGC

AAAGTCGTCATATTTGAATTATGGGCCAAATTCTTTAATCAATAGGTAAACTAACGC TCA

ACACTCAAAGTCTATCGCATAATTGTAAAGTCGTCATATTTGATTTATGGGCCAAAT TCT

TTAATCAATAGGCAAACTAACGCTCAACACTCAAAGTCTATCGCTCCAATTATCGGC AAC

TTCAAGAGGAATAAAACCATATCCCTCATCAGTAAAGCCATCAAAGATAATATGAAC AGA

CAGTAGGAAATCTCAATGCATCAACCTTTTTCCTATTGAATGATCACCAAAACCAAT GCC

AATGAAACCTAGTTTTTCTCAAGTGTTAGATATAGAAATGTAGTTGTCCCACATTGG AAT

AGGTGTAGTATGCCTTTGTATAGAGTAGCTATAAATAAGCCCATCTTGTATTGCATT AGA

CACACAATATCAATATATCATATTTTCTCCCGTGTCTTCTCACATGGTATCAAAGCA ATC

GTGAGAGATTTATCGTTGTGCATAAATTCCAGCGACTCCGGGAAGGAAAATCAGTTG AC

CGGAAGCCTTTTCCGGCAGGTCTGCCGCAAGTAAAAAAAAAGCCACTTCGTCAGTGT T

GTGCAAAAAAACCAACACCACCACGAAGTAGATCGGGCTCTGGCAACCAACCCATAA A

AAAATCTCCGTCAGAATACCCTCCACGCGCCGTCACTTGCTACCGGAAGAAAATTTT CC

GGCGAAGTTCCGACGTCGCGTGGGCCACCTTCCAGCCATTTTTTGGCGACGACTCTT C

AGGACAAATTATTCCCCTTGCAATTCCGAGCCTACCCATCCAGGTTACACCAAATTC CA

G ACAACTTATATATTTTTTCCAG CATG CATAGTGATTTCAAAAGTG G ACTTCC GG CAATT

TTTTGAAAACGTTTCTTCAGAACAGTTGGGTCATCTGGTAATTCCGATCCTACCCCT ACT

GTTTTTATTTCATTCCGACCACTTTGAATTTTCCCGGCAGCTACAGTACTATTCCGA CTG

CTACAGTAATATTCCGATAGCTACAGTATTTCCTTATTCTGTTTCACTGTTCCTTAC TCTG

TTTTAGTGGATTAAATTTGATTATTTCTATAATTTGGTAATAATTTGCAACGATGTC TATG

GGAATTGATGCTTTTGGGTCTAAAAACATGAGTTCTGGAAGCTCTAGTGTTATGATT ACT

TCAAAACCTTTAATGTG AG GTTCAAACTACTTAG CTTG GG CTTCATCTGTCGAGTTGTG G

TGTAAAGGTGAAGGTGTTCAAGATCATCTAATTAAACAGTCTAGCGAAGGAGATGAA AA

GGCGATAGCGCTTTGGGCAAAGATTGATGCTCAATTATGTAGCATCTTGTGCCGTTC TA

TTGATTCCAAGTTGATGCCTTTGTTTCGTCCATTCCAGACATGTTATTTGGTTTGGG CAA

AGGCTCGTACCTTATACACTAATGACATATCTCGCTTCTATAATGTGATATCACGGA TGA

CAAACTTAAAGAAGCAAGAATTAGATATGTCTACTTAATTGGGTCAAGTACAAGCAA TCA

TG GAG GAATTTGAG ACATTAATG CCAGTTTCTG CTAGTGTG G CAAAACAACAAGAG CAG

CGACAAAAGATGTTCTAGTTCTTACACTCGCTAGACTTCCTAATGATCTTGATTCAG TGC

GAG AC CAG ATTTTG G CTAGTCCG ACTGTTC CCACAGTTGATG AATTATTCTCTC GATTAC

TCCGCCTTGCCGCACCACCAAGTCACCCAGTGATCTCATCACAAATACTTGATTCCT CT

CTCACATCGCAGACGGTGGATGTTCGGGCGTCTCAAACTATGAAGAACAGAGGAGGA C

GAGGTCGTTTTGGGAGATCTAGACCCAAGTGTTCTTATTGTCACAAACTTGGATACA CT

CGTG AAATGTG CTATTCCTTAC ATG GTCGTC CACC CAAAAATCTTAC GTTG CTCAG ACT

GAGACTACATGTAACCAAGGTTTTTCTGTATCTAAAGAAGAATATAATGAGCTCCTT CAG

TATCGAGCAAGTAAGCAGACATCTCCACAAGTAGCCTCAATTGCCCAGACTGATACT CC

AGTTGTTGGTAATTCTTTTGCTTGTGTTTCCCAGTCTAGTACTCTTGGACCATGGGT CAT

GGACTCAGGCGCTTCTGATCACATCTCTGGTAATAAATCACTTTTGTCGAATATTGT ATA

TTCACAGTCTCTTCCCACTGTTACTTTAGCCAAGGGATGTCAAACTAAGGCACAAGG AG

TTGGACAAGCTAACCCATTGTCTTCTATCACCCTAGATTCCGTTCTTTATGTCCTTG GTT

GTCCTTTTAGTCGTGCATCTGTTAGTCGTTTGACTTGTGCCCTCCATTGTGGTATAT ATT

TATTAATGATTCTTTTATTATGCAGGACCGCAGTACGGGACAGACAATTGGTACAGG AC

GTGAATCAGAAGGCCTTTACTACCTTAATTCACTCAGTCCTTCCACAACATGTCTAG TTA

CTGATCCTCCGGACCTAATCCACTGTCGTTTAGGACACCCAAGTTTATCCAAACTTC AG

AAGATGGTGCCTCTTTTAGGACACCCAAGTTTATCCAAACTTCACAGTCTACATTAG ATT

GTAAGTCGTGTCAGCTTGGGAAACATACCTGAGCTTCCTTTCCGCGTAGTGTTGAGA GT

CATGTAGAGTCTGTTTTCTCCTTGGTTCATTCTGATATATGGGGTCCTAGTAGAGTC AGT

TCAACCTTGGGATTTCGTTATTTTGTTAGTTTCATTGATGATTACTCAAGATGTACT TGGC

TTTTCTTAATGAAAGATCGTTCTGAGTTATTCTCTATATTCTAGAATTTTTGTGCTG AAAT AAAAAATAAATTTAGTGTCTCTATTTGCATTTTTCGTAGTGATAATGCCTTAGAATATGT A

TCTTCTCAGTTTCAGCAATTTATGACTTCTCATGGAATTATTCATCAGACATCTTGC CTTA

TACCCCTCAGCAAAATGGGGTTGCAGAGAGAAAGAATAGGCACCTTATTGAGACTGC T

CGTACACTTCTAATTGAATCTCGTGTTCCGTTGTGTTTTTGCGGCGATGTAGTTCTC ACA

GCTTGTTATTTGATTAATAGGATGCCTTCATCTCCCATCAAGGATCAGATTCCGCTT TCA

GTATTGTTTCCCCAGTCAGCCTTATACCCTCTTCCACCTCGTGTTTTTGGGAGCACA TAT

TTTGTTCATAACTTAGCCCCTAGGAAAGATAAGTTAGCTCCTCGTACTCTCAAGTGT ATC

TTCCTTGGCTATTCTCGTGTTCAGAAGGGATATCGTTGTTATTCACTTGATCTCCGT AGG

TATCTTATGTCAGCTGACGTCACATTTTTTGAGTCTAAACCTTTCTTTGCTTCTGCT GAC

CACCATGATATATCTGAGGTCTTACCTATACCGACCTTTGAGGAGTTTCCTATAGCT CCT

CCTCCACCTTCGAACACAGAGGTTTCACCCATACTAACCATTGAGGAGTCTAGTGTT GT

TCCTCCTAGTTCCCCAGTCACAGGAACATCACTCTTGACTTATCATCGTCGTCTGCG CC

CTACATCAGGCCCAACTGGTTCTCGTCCTGCACCTAACCCTGCTCCTACTGCGGACC C

TGCTCCTAGGACACTGATTGCACTTCGAAAAGGTATACGGACCACACTTAACCCTAA TC

CTCATTATGTTGGTTTGAGTTATCATCGTCTGTCATCTCCCCATTATGTTTTTATAT CTTC

TTTGTCCTCGGTTTCCATCTCTAAGTCTACAGGTGAAGCGTTGTCTCATCCAGGATG GC

G ACAG GCTAG GAGTGATG AG ATGTCTGTTTTACATACAAGTG GTACTTG G GAG CTTGTT

CCTCTTCCTTCGGGTAAATCTACTGTTGGCTGTCGTTGGGTTTATGCGGTCAAAGTT GG

TCCCGATGGCCAGATTGATCGACTTAAGGCCCATCTTGTTGCCAAAGGATATACTTA GA

TATTTGGGCTCGATTACAGTGATACCTTCTCTCTTGTGGCTAAAGTGGCATCAGTCC GC

CTTTTTCTATCCATGGCTGCGGTTCGTCATTGGCCCCTCTATCAGCTGGACATTAAG AA

TGTCTTTTTTCACGGTGATCTTGAGGATAAGGTTTATATGGAGCAACCACCTGGTTT TGT

TGCTCAGGGGGAGTCTCGTGGCCTTGTATGTCGCTTGCGTCGGTCACTTTATGGTCT TA

AGCAATCTCCTCGAGCCTGGTTTGGTAAGTTCAGCACGGTTATCCAGGAGTTTGGCA TG

ACTCGTAGTGAAGCTGATCACTCTGTATTTTATCGGCACCCTGCTTCAAGTCTATGT ATT

TATCAGGTAGTCTATGTTGATGATATTGTTATTACTCGCAATGATCAGGATGGTATT ACT

AATCTGAAGAAGCATCTCTTCCAGCATTTTCAAACTAAGGATCTAGGCAGATTGAAG TAC

TTTCTAGGTATTGAGGTTGCTCAATCTAGCTCAGGTATTGTTATTTCTCAAAGGAAA TAT

GCTTTAGACATTCTTGAGAAGATAGGGATGATAGGTTGCAGACCTGTTGATACTCCA AT

GGATCCGAATTCTAAACTTCTGCCAGGACAGGGGGAGCCGCTTAGCGATCCTGCAAG C

TATAGGCGGTTGGTTGGTAAATAAAATTATTTCACAGTGACTAGACCCGACATTTCT TAT

CCTGTGAATGTTGTAAGTCAGTTTATAAATTCTCCCTATGATAGTCATTGGGATGCA GTC

GTCCGCATTATCCGGTATATAGAATCGGCTCCAGGCAAAGGATTACTGTTTGAGGAT CG

AGGTCATGAGCAGATCGTTGGGTACTCAAATGCTGATTGGGCAGGATCACCTTCTGA TA

GACGTTCTACGTCTGGATGTTGTGTTTTAGTAGGAGGAAATTTGGTGTCCTGGAAAA GC

AAG AAACAG AATGTAGTTG CTCG GTCTAGTG CAG AAG CAG AATATCG AG CAATG G CTAT

G GTAAC ATGTG AACTAGTCTG GACCAAACAATTG CTCAAG G AGTTGAAATTTG GTG AAA

TCGGTTAGATGGAACTTGTGGAACTTGTGTGCGATAATCAAGATGCCCTTCATATTG CA

TCAAATCTGGTGTTTCATGAGAGAACTAAACACATTGAGATTGATTGTCACTTCGTA AGA

GAGAAGATACTTTCAGGAGATATTACTACGAAGTTTGTGAGGTCGAATGATCAACTT GC

AGATATTTTCACCAAGTCCTTCACCGATCCTTGCATTGGTTATATATGTAACAAGCT CGG

TACATATGATTTGTATGCTCCGGCTTGAGGGGGAGTGTTAGATATAGATATGTAGTT GC

CCCACATTGGAATAGGTGTAGTATGCCCTTTGTATAGAGTAGCTATAAATAAGCTCA TCT

TGTATTGCATTAGACACACAATATCAATATATCATATTTTCTCCCGTGCCTTCTCAC ATCA

AGTATCAAGAGAACAACATTCTACTTACGCTAATCTTATTACAGTCATAAAGACAGA GAC

AAAGGTAACACAGACGTGAAATAAAATACTTCCAGATAACCTTTCATTATGGACAAT AAA

AGAAACATGTTCAGTCACGAAACTTTTTCCAAACTTAATTAAGGTAAGAGACGAAAG AAA

ACATCAATCAGTAAAGAAAAAGAAGGGGAAATTAAAAGGTGAAAAAGCAAAATGATA TTA

GATATTTGCAAGATTTTTAATATCACATTAGTCCCCACTCATCAAATGATGGTAGAG GGC

ATTTCATAAGGGCCAATCAACAGCCACAACAATTCGATAAGTGCTCCAGGAAACAAC TA

ACCTTTGATAAATCAATAGGAACATCAAGATAATGATCTAAGAAATTTGCATTCTGT TCT GGATCAAGAAGCTCCAACATAGCACTTGCTGGATCACCAGCATGTCCTCTTCCCAACTG

CCATAGAAGAAAAAATACAAAGCTATTATTAAATTTGATCGATAATTATACACATTC TTTA

AACATTTAGATTGCAGAAGAAATCCAAGCAAAAACATAATATGTTAATGCCCAAGAT TCT

TCCGGTGGTCATACACAGAAAAGCAAGATTTAAATAGTGGAAAAGAAGTTACACTTC AC

TCATAGTCAAAGCATATAATTGAAGTATGAACTCACAAACCATAAAATACCTTGTCA ATTT

CATCGATCAAAACAAGAGGATTAGCGGTTCCCACACTTTTTAAACATTGCACCATCT TCC

CCGGCATGGCACCAATATAAGTTCGTCGATGTCCCTTGAAGTGTATCCCAAAGTTAG TC

ATGAAAGTTCAAGATTATGTGCACAACAAAACAACTCTGTTTTAGCTGGTTAGATAT TTC

TGTTACCTTTATTTCAGCAACATCAGACAGCCCTCCAACAGAAAATCGGTAAAATTT GCG

GTTCAATGCACGTGCAATTGAACGACCTATACTGGTTTTGCCCACCCCAGGAGGGCC A

GAGAGGCATATGATTTTCCCTGTTCAAACAAAAATAAGCAATGAGTTTGTTTAGTTC CTG

ACTCAGATTCACAAATTAAACTGAGATAAATCAGATAAAGCCGCACATATGAAAGAC AGT

GCCAGTGTATAATTGCGATTAATATCATCACTTGAATCTAACATTGTCCTAGCTATG GTG

CAATTCACATTTTTCTAGAGTTGCCCTTTGTCTGTTTTTTCCTCAACACTTCATAGT TTTC

AAAGATTTTCCTCAACAAACAACTACATCTCAAGTTCAAATGTCCCAAAGATATACA AGT

ACAACTGGCAGTGAAAGTAAGTTATCAATAAAATACCATAGAGCGAGTGCCCAATCA TG

GACAACACAATATCAGAATTTAAGAAGAACACCAACAAAGTATGGCCCAAGGTGCAG CT

ATTTAACTCACATTAATTGCTTTTCAAGGGAAGAAATTAAGCTCAAGATATATTTCT TTAC

TCCACAGATAATCACAAGAAAACATTGGAGGAACTCTCGACTGTCACAAAAAGTATT AC

CTAAGTTTGACCAATAAATATTTCTCAAATGCCCTAAAATGTTCACTTCTTCCAATT TACT

CGTTCCAATTTGATGTCAATATAGTATGTGGGAACCATAAATCCCATGTTCTATGGA TAT

ATTTTCCATTGGTATATTATAATGTTATGAAAATGGACGGTAAGGAGATGGCTGGAT CAT

TTTCCGTTCTTTTAATATGATTATAGCTGAACCTTATAAAACTGAGGATTTTATTAA AAAT

GAG G GTCATTATTTTTAAAAAATAAG G CATTTACCAAC CTTGTG AGGTTCCTCTG AGTTT

TCCCACAGCTATAAATTCCAAGATCCTTTCCTTAACATCGGTTAACCCATAGTGGTC TTC

ATCAAGAATTTGTTCTGCCCGTAGTACATCAAAGTTTTCATCACTGCATTTTGGATC ATA

AAAG AAG AAAAGTTC AG G AAC C AATG C AATAC AC C C AG G C AG AG ATC AAAACTATAAAG

C C AAC AC C AC C AC CATC AG C AG C AAAC AAC AC G AACTATG AC AAATC AAG C C C C C ATCT

TAACCTACATAAGGAGCTCATCATAAGTACAATCCTAGTTTTGCTGCTCTCATGAGA TCC

TG GTCTAAACTGATACATTG GG ATGTCAAGG AAG CATC CTCAAAG CCAG AAGCATG ACA

TGTAAG AAC C C AAAATAAG G AAC AG GAG GAG AAAG AAAAGTT AAAG AC AAATG GAT ATT

ACTCGATAGAAATTTAGGCAAGTCACATCACTAGGCAGTAAGGTTAACTTGGAAGCC TA

G CAAATTTCAG AC AATAACAATTTTATTTCTCATAACTGTTTAG CCAG CTTCTAAC AAACA

GATTCCTCTAGTACAGCTAACCAAGGATAAGGTTAACATCAGTTGGATTGAACCCAA GA

CAATGTAAGGTAGAATAGATAGACAACAAACCTGTAACTACCCCATGGCAAGGCAGT CA

ACCAATCAAGATAATTACGTGTCACGTTAAATTCACTGGAACTAGCTTCCAACAGTT GCA

GTTTTGTCAGTTCTTCTTCAATAACTTGCATAACATGTACTGGTATTTTTTCTTTAT TAGG

CTCCAATCTTTCCCTGAACTTTGCTGCAGTAATAAACAAATATAGTGTCACACCATT AGA

TGTAATTAAAAAGGGAACAAAAAACTGCAGGTTTCCCTTAGTATTGACCAGAAAAAT CCA

AGCCCTTAAAGCGAACCTAATCATTAGGCAGTTTCTTCTCTCAATTTCAACACCAAA ACC

AGGTCTACCAAAACAATG CTACAATG CG GAG CTCAACTTTATCTTTGTG AAAATTATAGA

GAAAAAATAATGAGAACCGCATTTTAGGGTTTGCAGTAACGTCTATTATCACCAAAA CCT

G GAAGAAG CACCTTCTTAAG CATCACACTCCTCAACAG GAG CAAAGG AGAG AAGAAAG

AGCACATAACAG AAAAAG CAAGTAG GCTTCTAATTAG G AAGTG CACTG AAGG G AAAC AG

GACAGAATTCATGAGAAAAAGAAGACATTGGAGGAGGAAGATAATCTGAATCATGAG CA

GTCCATAACATGAATAAACAACAATGCAGTGCTAAAACGGAAAATGTGGCCTTCCAG TA

AATCCATGTTGAATTTGTGCCACATTACCAAATATGGTAGACAGTAACCTTTCGGAA TTG

AAGTTGAAGGATTCCAGTATTGTCTTATTAAGTGCTTTCGTGAAAACACAAAACAGC CCC

CTATAGGTACAATGTATTCTAATCTGTCAAAAGTTTGGAATGTCTCAAATAGTTTTA GAAA

GCATGTCAATAAAGTTGGTTGGACTGTGTACAAAGAAAATTCAACCTTCAATTTCCT ATA

TGTAAAGCCAATTATCGCTTGAATGCTATTCATGCTTCTAGTGAAGATTTTCAATGG TAA AC AG G AG ATC AATC AG C C AC C ATAAAACTTTTAC AAC AG AAAG G G C AAAAC ATATTG C A

AATG G CCTTTTCTATG CTACTG CTAAATG G AACTTTACCAGG G ACAAC ATAAG ATGTTTC

ACTGGCCAGATGGGAAAACACGTCTGGCCATAATAAATCCAAAAAGAACATTATCCA TT

TCAATAATAATTAAGAGACAACTAACCAGAAAGAGCTGTCTTGTCATCAGTCTCCAA ACC

TAGTTCCTGCATTAAGCCAATCAAAACCACGTCATAAGTACTTTTTTTTCAAATCAA CTCT

TTGTTTAAACTGTGAATCAAACATAAGAACCAGAAGCTTAACATATATATTATCTAG GAAT

AAACATATGTAAGTACTAAGTTTAAGGATGATAACACAAAAAAAGCTGCACATAATC CAC

ATGCCTTCTTTATGGCCTTTAATTGTTCATTCAACAAATAACGGCGTTGCTCTCCGC TTA

TTTTTTCTTCAATTGCTCTTGCTATTGATTCCTGCAGAAAACAAAGACGACGTAAAA TGC

TAAATGCATGCATAACAACTATTCAAAGTTCTGGTATGCGTCTTGACGTGCATTTAC CTG

AATCTTACTAATCTCCAT

SEQ 81 to 160 are putative protein SEQ related to SEQ 1-80

SEQ 81

MALRFSLIFLFSLFLTTSLLLSVNGNINGGEDDDILIRQVVGDDDDHLLNADHHFTIFKR RFGKTYA

SDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDLTPAEFRRNFLGVNRRLRL PSDANK

APILPTEDLPSGFDWRDHGAVTSVKNQGSCGSCWSFSTTGALEGATYLSTGKLVSLS EQQLVD

CDHECDPEEKDSCDAGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDNTK VAAKV

ANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKKLDHGVLLVG YGTGFSPI

RMKEKPYWIIKNSWGEKWGENGYYKICRGRNVCGVDSMVSTVSAVSTSSH

SEQ 82

MGAKVFLVALFLSALLFPLASSSNDGLMRIGLKKMKFDQNNRLAARIESKEGDVLRASIR KYNFR

GKLGDSEDTDIVALKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSV PCFFHSK

FKSSESSTYKKNGKSAAIQYGSGAISGFFSQDNVKVGDLVVTDQEFIEATREPSVTF LVAKFDGIL

GLGFQEISVGNAVPVWYNMVQQGLIKDPVFSFWLNRNTEEEQGGEIVFGGVDPNHYK GEITYV

PVTHKGYWQFDMGDVLIEGKATGYCESGCSAIADSGTSLLAGPTTIITMINQAIGAS GVASQQCK

SVVEQYGQTIMDLLLAEAHPKKICSQVGVCTFDGNRGVSMGIESVVDEKAGRSTGLQ DGMCSA

CEMAVIWMENQLRQNQTQDRILNYVNELCERLPSPLGESAVDCGKLSSMPTVSFTIG GKVFDLV

PKEYILKVGEGAKAQCISGFTGLDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFA EAA

SEQ 83

MGSFLCFSVIVVLLVLQPCLAKKVYIVHMKNHQIPSSFATHHDWYNAQLQSLSSSSTSDE SSLLY

SYDTAYSGFAASLDPHEAELLRQSDDVVGVYEDTVYTLHTTRTPEFLGLNNELGLWA GHSPQEL

NNAAQDVVIGVLDTGVWPESKSYNDFGMPDVPSRWKGECESGSDFDPKVHCNKKLIG ARFFS

KGYQMSASGSFTNQPRQPESPRDQDGHGTHTSSTAAGAPVANASLLGYASGVARGMA PRAR

VATYKVCWPTGCFGSDILAGMERAILDGVDVLSLSLGGGSGPYYRDTIAIGAFSAME KGIVVSCS

AGNSGPAKGSLANTAPWIMTVGAGTIDRDFPAFATLGNGKKITGVSLYSGKGMGKKV VPLVYST

DSSASLCLPGSLDPKMVRGKIVLCDRGTNARVEKGLVVKEAGGVGMILANTAESGEE LVADSHL

LPAVAVGRKLGDFIRQYVKSEKNPAAVLSFGGTVVNVKPSPVVAAFSSRGPNTVTPQ ILKPDVIG

PGVNILAAWSEAIGPTGLEKDTRRTKFNIMSGTSMSCPHISGLAALLKAAHPEWSPS AIKSALMT

TAYVRDTTNSPLRDAEGGQLSTPWAHGSGHVDPHKALSPGLIYDITPEDYIKFLCSL DYELNHIQ

AIVKRPNVTCTKKFADPGQINYPSFSVLFGKSRVVRYTRAVINVGAAGSVYEVTVDA PPSVTVTV

KPSKLVFKRVGERLRYTVTFVSKKGVNMMRKSAFGSISWNNAQNQVRSPVSYSWSQL LD

SEQ 84

MGTKFILFILLFIFLFSSGFVACGGFYSFRNLNSSVSGIEFPNHPSFNAVSSSADSDCNY GVSQKS

KTHSIAQEVDGVDVKNGENEEVSIFGNQKKEAVKFQLRHRSAGKKIEAKDSVFESRA RDLSRIQT

LHTRIVEKKNQNYNSRLAKSNEKHVDKHKPVIAPAAVSLESYELSGKLMATLESGVS LGSGEYF

MDVFVGTPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPHYNPQDSTSFRNISCHDP RCKFVTS

PDPPQLCKSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTTSGSEFRKVENVMFGC GHWNR

GLFHGAAGLLGLGRGPLSFASQLQSLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELL KHPQLNF

TSLVGGKEVETFYYVQIKSVIVGGEVLNIPEETWNLSLEGLGGAIIDSGTTLSYFAD PAYEIIKEAF

VNKVKGYPIVQDFPILNPCYNVSGVKNLEFPSFGIVFGDGAVWNFPVENYFIKLEPE DIVCLAVLG

TPRSALSIIGNYQQQNFHILYDTKRSRLGYAPTRCADA SEQ 85

MALTLKSLATPLLFGALFILILQVVAEQPISEAKVESAILQESIIKEVNENAKAGWKAAF NPRFSNFT

VSQFKRLLGVKPAREGDLEGIPILTHPKLLELPKEFDARKAWPQCSTIGRILDQGHC GSCWAFGA

VESLSDRFCIHHNLNISLSVNDLLACCGFLCGSGCDGGYPITAWRYFIRRGVVTEEC DPYFDNE

GCSHPGCEPGYPTPKCQRKCVKEILLWGKSKHYGVNAYRIHHDPNSIMTEIYKNGPV EVSFTVY

EDFAHYKSGVYKHVTGQSMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGDDGYFKI RRGT

NECGIEHNVVAGLPSAKNLNVELDDVSNAFLDASM

SEQ 86

TLVLHTSFYLLLSVASPGDCLLLSIFPFSFSSPRYFPYKQNTVKIISSNFLFSPFFQMGS FLCFSVIV

LFLVFQPCFSKKVYIVHMKNHQIPSSFATHHDWYNAQLQSLSSSSTSDESSLLYSYD TAYSGFA

ASLDPHEAELLRQSDDVVGVYEDTVYTLHTTRTPEFLGLNNELGLWAGHSPQELNNA AQDVVIG

VLDTGVWPESKSFNDFGMPNVPSRWKGECESGPDFDPKVHCNKKLIGARFFSKGYQM SASGS

FTNQPRQPESPRDQDGHGTHTSSTAAGAPVANASLLGYASGVARGMAPRARVATYKV CWPTG

CFGSDILAGMERAILDGVDVLSLSLGGGSGPYYHDTIAIGAFSAMEKGIVVSCSAGN SGPAKASL

ANTAPWIMTVGAGTIDRDFPAFATLGNGKKITGVSLYSGKGMGKKVVPLVYSTDSSA SLCLPGS

LDPKIVRGKIVLCDRGTNARVEKGLVVKEAGGVGMILANTAESGEELVADSHLLPAV AVGRKLG

DFIRQYVKSEKNPAAVLSFGGTVVNVKPSPVVAAFSSRGPNTVTPQILKPDVIGPGV NILAAWSE

AIGPTGLEKDTRRTKFNIMSGTSMSCPHISGLAALLKAAHPEWSPSAIKSALMTTAY VHDTTNSP

LRDAEGGQLSTPFAHGSGHVDPHKALSPGLIYDITPEDYIKFLCSLDYELNHIQAIV KRPNVTCAK

KFADPGQINYPSFSVLFGKSRVVRYTRAVTNVAAAGSVYEVVVDAPPSVLVTVKPSK LVFKRVG

ERLRYTVTFVSNKGVNMMRKSAFGSISWNNAQNQVRSPVSYSWSQLLD

SEQ 87

MASSCLHAILLCFLLFITSTTAQNQTSFRPKGLILPITKDASTLQYLTQIHQRTPLVPVS LTLDLGGQ

FLWLDCDQGYVSSSYKPARCRSAQCSLAGAGSGCGQCFSPPKPGCNNNTCSLLPDNT ITRTAT

SGELASDTVQVQSSNGKNPGRNVTDKDFLFVCGATFLLEGLASGVKGMAGLGRTIIS LPSQFSA

EFSFPRKFAVCLSSSTNSKGVVLFGDGPYSFLPNREFSNNDFSYTPLFINPVSTASA FSSGEPSS

EYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTNFFV KELVNITRVASV

APFGACFDSRNIVSTRVGPAVPSIDLVLQNENVFWRIFGANSMVQVSENVLCLGFVD GGVNPRT

SIVIGGYTIENNLLQFDLAGSRLGFTSSILSRLTTCANFNFTSIT

SEQ 88

MNPEKFTHKTNEALAGAHELALSAGHAQFTPLHMAVALISDHNGIFRQAIVNAGGNEEVA NSVE

RVLNQAMKKLPSQTPAPDEIPPSTSLIKVLRRAQSSQKSCGDSHLAVDQLILGLLED SQIGDLLKE

AGVSASRVKSEVEKLRGKEGRKVESASGDTTFQALKTYGRDLVEQAGKLDPVIGRDE EIRRVVR

ILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLADVRLIALDMGALVAGA KYRGEFEE

RLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGAT TLEEYRKYV

EKDAAFERRFQQVYVAEPSVTDTISILRGLKERYEGHHGVKIQDRALVVAAQLSSRY ITGRHLPD

KAIDLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLVEVR KELDDLRD

KLQPLMMRYKKEKERIDELRRLKQKRDELIYALQEAERRYDLARAADLRYGAIQEVE TAIANLES

TSAESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKEKLIGLGDRLHQRVVGQDHA VRAVAEA

VLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMIRIDMSEYMEQ HSVARLI

GAPPGYVGHDEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQG RTVDFT

NTVIIMTSNLGAEYLLSGLMGKCTMETAREMVMQEVRKQFKPELLNRLDEIVVFDPL SHEQLRQ

VCRYQMKDVALRLAERGIALGVTEAALDVILSESYDPVYGARPIRRWLERKVVTELS KMLVKEEI

DENSTVYIDAGVGRKDLTYRVEKNGGLVNAATGQKSDILIQLPNGPRSDAVQAVKKM RIEEIEED

EMED

SEQ 89

MQSFKSASILRRLLQNSRLVSHSRSFCSVSTNALVDESQSTVLVEGKASSRTAILNRPHA LNALN

FSVVDRLLKLYKNWEDDPDIGFVVLKGSGKAFSAGGDIVTIYNLLKQDAGNLQDCKD FCWTINNL

VYVVGTLLKPHVALLNGITMGGGAGISIPGTFRVATEKTVFATPETLIGYHPDAGAS FYLSHLPGY

LGEYLALTGDKINGAEMISCGLATHYLHSAKLPLIEEQLGKLMTDDPSVIERSLENC GEIVHPDPT

SVLHRIETLNKCFSHDTVEEIIDALESEAAKKQDAWCVSTLRKLQETAPLSLKVSLR SIREGRHQT LDQCLIREYRMSVQAFSGQITNDFCEGVRARLVDRDFAPKWDPPSLDKVTDDMVDQYFSR LTA FEPELELPTQQREAFT

SEQ 90

MALTLKSLATPLLLGAFFILVLQVVAEKPISEAKVESAILKESIIKEVNENAKAGWKAAF NPQFSNF

TVSQFKRLLGVKPAREGDLEGIPLLTHPKLSELPKEFDARKAWPQCSTIGRILDQGH CGSCWAF

GAVESLSDRFCIHHNLNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFIRRGVVTE ECDPYFD

NEGCSHPGCEPGYPTPKCQRKCVKENLLWGKSKHYGVNAYRIHRDPYSIMTEIYKNG PVEVSF

TVYEDFAHYKSGVYKHVTGQSMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGDDGY FKIRR

GTNECGIEHNVVAGLPSAKNLNVELDDVSDAFLDASM

SEQ 91

MGVLKKTLLLLFLCVFLGDISLCFSSKLYVVYMGSKDSDEHPDEILRQNHQMLTAIHKGS IEQAKT

SHVYSYRHGFKGFAAKLTEAQASEISKMPGVVSVFPNTKRSLHTTHSWDFMGLSDDE TMEIPG

FSTKNQINVIIGFIDTGIWPESPSFSDTNMPPVPAGWKGQCQSGEAFNASICNRKII GARYYMSG

YEAEEENGKTMFYKSARDSSGHGSHTASTAAGRYVANMNYKGLANGGARGGAPMARI AVYKT

CWSSGCYDVDLLAAFDDAIRDGVHVISLSLGPDAPQGDYFNDAISVGSYHAVSRGIL VVASVGN

EGSTGSATNLAPWMITVAASSTDRDFTSDILLGNGVRLKGESLSLSQMNTSTRIIPA SEAYAGYF

TPYQSSYCLDSSLNRTKAKGKVLVCLHAGSSSESKMEKSIIVKEAGGVGMILIDDAD KGVAIPFVI

PAATVGKKIGNKILAYINNTRLPMARILSARTVLGAQPAPRVAAFSSRGPNSVTPEI LKPDIAAPGL

NILAAWSPAASTKLNFNVLSGTSMACPHITGVVALLKAVHPSWSPSAIKSAIMTTAK LSDKHHKPII

VDPEGKRATPFDFGSGFVNPTNVLDPGLIYDAQPADYRAFLCSIGYDEKSLHLITRD NSTCDQTF

ASPNGLNYPSITIPNLRSTYSVTRTVTNVGKARSIYKAVVYAPTGVNVTVVPRRLAF TRYYQKMN

FTVNFKVAAPTQGYVFGSLTWRNKRTSVTSPLVVRVAHSNMGMMV

SEQ 92

MGAKAFLVAMFLSALLFPFASSSNDGLMRIGLKKMKFDQNNRLAARIESKEGDVLRGSIR KYNF

RGKLGDFEDTDIVALKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFS VPCFFHS

KYKSSESSTYKKNGKSAAIQYGSGAISGFFSQDNVKVGDLVVTDQEFIEATREPSVT FLVAKFDG

ILGLGFQEISVGNAVPVWYNMVKQGLIKDPVFSFWLNRNTEEEQGGEIVFGGVDPNH YKGEITY

VPVTQKGYWQFDMGDVLIDGKATGYCESGCSAIADSGTSLLAGPTAIITMINQAIGA SGVASQQC

KSVVEQYGQTIMDLLLAEAHPKKICSQVGVCTFDGNRGVSMGIDSVVDEKAGRSTGL QDGMCS

ACEMAVIWMANQLRQNQTQDRILNYVNELCERLPSPLGESAVDCGKLSSMPKVSFTI GGKVFDL

SPNEYILKVGEGAKAQCISGFTGLDIPPPRGPLWILGDIFMGRYHTVFDYGKLRVGF AEAA

SEQ 93

MTFFRSFLFFLLTLFVISSALDMSIISYDEQHGQMGTTHHRTDDEVRELYESWLVKHGKN YNAIG

EKERRFEIFNDNLRFIDEHNAENRSYKLGLNRFSDLTNEEYRAMFVGGRLDRKTRLM KSPKSNR

YAFQAGEKLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLS EQELVD

CDRSYNQGCNGGLMDYAFDFIKNNGGIDTEDDYPYHAQDGTCDPYRKNARVVSIEGY EDVPEN

DEKSLMKAVANQPVSVAIEGGGRAFQHYSSGVFTGYCGTQLDHGVVVVGYGTENGED YWIVR

NSWGANWGESGYIKLQRNFANSTTGKCGIAMQASYPLKSGANPPNPGPSPPTPVTPS TVCDEY

YSCPQGTTCCCIYQYGEYCFGWGCCPYESATCCDDNYSCCPHDYPVCDVDAGTCLMS KDNPL

KVKALKRGPARVNWSGMKSNRKVSYV

SEQ 94

MANSYTSFNFFLAPIIFLAILGLQLQSSDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGS VEYYSA

WTQRDRFIKGRRLADTTNPTPLSFSGGNETFRLSSLGFLHYANVTVGTPGLSFLVAL DTGSDLF

WLPCDCSNCVRALETRSGRRINLNIYSPNTSSTGQIVPCNGTLCGQRRRCLSSQNAC AYGVAYL

SNNTSSSGVLVEDILHLETDNAQQKSVEAPIALGCGIRQTGAFLSGAAPNGLFGLGL ESISVPSM

LASKGLAANSFSMCFGPDGIGRIVFGDKGSPDQGETPLNLDQLHPTYNISLTGITVG NKITDVDFT

AIFDSGTSFTYLNDPAYKVITENFDSQAKQLRIQPDGEIPFEYCYGLSANQTTFEVP DLNLTMKG

GNQFFLFDPIIMLSLQDGSRAFCLAVVKSGDVNIIGQNFMTGYRVVFDREKMVLGWK PSDCYDS

RESNDKSTTLPVNKRNSTEAPSPSSVVPEATKGNGSGNEPATSFPSVPSSRPAINHA PAHFNSY

ICQLMMALFSLFSYYLIIVSS

SEQ 95 MVTKFSIFILVVLLRLFSFGSVASREIHNSGLNLNSSASGIEFPQHPSFNSVTASGNSDC SYGTSK

KSTTTHVITQEENRSDEKEDEDLMVSKNQPREAVKFHLRHRSAGQNIEAKDSIFEST TRDLGRIQ

TLHTRIVEKKNQNSISRQTKNSEKPTQSSSFEFSGKLMATLESGVSHGSGEYFMDVF VGTPPKH

FSLILDTGSDLNWIQSVPCYDCFEQNGPHYDPKDSISFKNISCHDPRCHLVSSPDPP QPCKSEN

QTCPYYYWYGDSSNTTGDFALETFTVNLTTPSGDSEIKKVENVMFGCGHWNRGLFHG AAGLLG

LGRGPLSFSSQLQSLYGHSFSYCLVNRNSNSSVSSKLIFGEDKELLKHANLNFTSLV GGKENHL

ETFYYVQIKSVIAGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIKQA FVNKVKHYPV

LEDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWNFPVENYFIKLEPEDIVCLAMLG TPHSAMSIIG

NYQQQNFHI LYDTKRSRLG FAPTRCADA

SEQ 96

MPSSFSLLFLTLLLASISLSFSSTLNSNDDDFFLSSTPKFPLTMAEKLIRQLNLFPKHDI NKAAATG

DSAAVTEQRLFEKKLNLSYVGNSGSTVQDLGHHAGYYRLPHTKDARMFYFFFESRSR KNDPVVI

WLTGGPGCSSELAVFYENGPFKIADNMSLVWNDFGWDKVSNLIYVDQPTGTGFSYSS NDDDIR

HDERGVSNDLYDFLQAFFKAHPQYAKNDFYITGESYAGHYIPAFASRVHQGNKNKEG IYVNLKG

FAIGNGLTDPEIQYKAYTDYALDMKLIKKSDYNAIEKSYPKCQLAIKLCGKDGGTAC MAAYLVCTS

IFNKIMDIAGDKNYYDVRKRCEGDLCYDFSKMETFLNDQQVKKALGVGDIEFVSCSS EVYQAMQ

LDWMRNLELGIPSLLEDGIKLLVYAGEYDLICNWLGNSRWVHAMKWTGQKAFGKATQ VSFAVD

GVEKGVQKNYGPLTFLKVHDAGHMVPMDQPKAAMEMLQRWMQDKLSKEGHLAPM

SEQ 97

MTLTLKSLAAPLFLGAFCILILQVVAEKPISEAKVESAILQESIIKEVNENAKAGWKAAF NPRFSNFT VSQFKRLLGVKPAREGDLEGIPILTHPKLLELPKEFDARKAWPQCSTIGRILDQGHCGSC WAFGA VESLSDRFCIHHNLNISLSVNDLLACCGFLRGSGCDGGYPISAWRYFIRRGVVTEECDPY FDNE GFHTRVVNQDIPPQSVV

SEQ 98

MFRLVMVTKFSIFILVVLLRLFSFGFVASREIHNFGINLNFSASGIEFPQHPSFNSVTAS GNSDCSY

GTSKKSTTTHVITQEENNSDEKEDEDLMVSENQPREAVKFHLRHRSAGQNIEAKDSI FESTTRDL

GRIQTLHTRIVEKKNQNFISRQTKNSEKTTQSSSFEFSGKLMATLESGVSHGSGEYF MDVFVGT

PPKHFSLILDTGSDLNWIQSVPCYDCFEQNGPHYDPKDSISFKNISCDDPRCHLVSS PDPPQPC

KSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTPNGDSEIKKVENVMFGCGHWNRG LFHGA

AGLLGLGRGPLSFSSQLQSLYGHSFSYCLVNRNSNSSVSSKLIFGEDKELLKHLNLN FTSLVGGK

ENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEI IKQAFVNKVK

RYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPEDIVCL AILGTPHSAM

SIIGNYQQQNFHILYDTKRSRLGFAPRRCADA

SEQ 99

MSGFRLPLLFHLLLPLTLFLQYVQSLPQNSSTVEFLPGFDGPLPFYLETGYIGVGKSEEV QLFYYF

VKSESNPKKDPLLLWLTGGPGCSSFTGVAYEVGPLAFGQKAYNGSLPILVSTPYSWT KFASILFL

EQPVNTGFSYATTSAASKCTDLQACDQVYEFLLKWFNNHPEFISNPFYVSGDSYSGI TVPVIVQL

ISDGIEAGKKPLINLKGYSLGNPLTFPEESNYQIPFCHGMGLISNELYESLKETCKG DCRNIDPTN

KLCLENFKMFKKLVSSINDQQILEPFCGTDSESPNPRQLSGERRSLEEDFIFLKHDD FICRESRVA

TRKLSNHWANDPSVQEALHVRKGTIRRAWARCRQSIMGTTYRVTFMNSIPYHVNLSS KGYRSLI

YSGDHDMVVPFQSTQAWIKYLNYSIIDDWRPWTIDGQVAGYTRSFSNHMTYATVKGG GHTAPE

YKREESFHMFKRWIAQQPL

SEQ 100

METNGLIKEILPRDAVNNMTRLILSNALYFKGEWNEKFDVSETKDHDFHLLNGGSIQAPF MTSKK KQYIAAFDCFKILRLPYKQGTDTRRFCMYFILPDAHDGLPALLEKISLEPGFLNNHVPYG KVRARK FLIPKFKITFGFEASNILKGLGLTLPFCGGSLTEMVDSPMPQNLSVSQVFHKSFIEVNEE GTEAAA VTATVIMTMSLIIEKEMDFVADHPFLFLIRDESTGAVLFIGSVMNPLAG

SEQ 101

MNESYGNSRASSSSTTSSLNSSSHGTEDDHTIARILAEEEENALKYGGNKLGRRLSHLDS IPHTP RVIGEIPDPNDATLDHGRLSSRLATYGLAEMQIEGDGNCQFRALSDQLYHNPEYHKHVRK EVVK QLKRFRKLYEGYVPMRYKSYLRKMKRLGEWGDHVTLQAAADRFGVKICLVTSFRDNGYID ILPK DIQPSRELWLSFWSEVHYNSLYEIGEVPARVRRKKHWLFF

SEQ 102

MSWLCPSLVLVLLIFQGPICTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYI IEHNS

KGNSTYTLNLNAFSDLTHHEFKNSFLGLSSSANDFIRLKTGSSSAGVFNDVGVVDIP SSLDWREK

GAVTKVKNQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGG LMDYAFE

FVKKNGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQP VSVGICG

SERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHM QRNSGN

QEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVCVSW KCCGL

DSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQPQKEAFSGKFGGLIYPF

SEQ 103

MCEPESEAARGVLSFLDVDQLFSSNYYGDGRKHDVEICHEQYARENHYHTSYCNVDNDEA IAH

VLQEDLSELSIAEDAESSHADEQYLQASTGVQHWHTPPREYYAGHDTSLEADDVGPS SSCSSP

GDRSYDGEEYTYTLEIQDEFELDGEVGKRINQLSAVPHVPRINGDIPSVDEATSDHQ RLLNRLQL

FDLVEHKVQGDGNCQFRALSDQFYRTPEHHKFVRQQVVSQFQHHPEMYEGYVPMEYG EYLTR

MSKSGEWGDHVTLQAAADSYGVKILVITSFKDTCYIEILPKNQKSNRVIYLSFWAEV HYNSIYPQ

GDFLPFDFKKKKKKWSFWNKH

SEQ 104

MPSLLQIFLPLFPFFFLVSFSVSHGPFLPKAIILPVNKDLSTFQYVTQVYMGAHLVPTNL VVDLGG

SFLWTNCGLTSVSSSQKLVPCNSLKCSMAKPNGCTNKICGVQSENPFTKVAATGELA EDMFAV

EFIDELKTGSIASIHEFLFSCASTTLLQGLARGAKGMLGLGNSRIALPSQLSDTFGF QRKFALCLS

SSNGAIISGESPYLSLLGHDVSRSMLYTPLISSKDGVSEEYYINVKSIKINGKKLSL NTSLFAMDEG

VGGTKISTIPPFTTMKSSIYKSFIEAYEKFAISMELNKVEAIAPFELCFSTKGIDVT KVGPNVPTTDL

VLQSEMVKWRIYGRNSMVKVSDEVMCLGFLNGGVNQKASIVIGGYQLEDNLLEFNLG TSMLGF

TSSLSMAETSCSDFMFHSVSKDSAFDS

SEQ 105

MGAKEVLILVLVCMFIVFPSCHGDDECLNPFLVDQNCYVKDYITKLANATETVKWMMKIR RQIHE

NPELAYEEFKTSGLIREELDRMGVKYRWPVAKTGVVATIGSGKPPFVALRADMDALP IQELAKW

EHKSKVDGKMHACAHDAHTAMLLGAAKILQQLRHNLQGTVVLIFQPAEERGHGAKDM IEEGVLE

NVEAIFGMHLVHKYESGVVASRPGEFLAGCGSFKATIRGKGGHAAVPHDSVDPILAA STSVISLQ

SIVSRETDPLESQVVSVAMIEGGHAFNIIPELATISGTYRAFSKKSFYGLRKRIEEV IRAQAAVHRC

TVEIDFDGRENPTLPPTINDERIYEHARKVSKMIVGEESFKIAPSFMGSEDFAVFLE KVPGSFFLL

GTKNEKIGAIYPPHNPHFIIDEDVLPIGAAIHATFAYSYLLNSTNKFTSHSS

SEQ 106

MKLNPYSWTKVASIIFLDLPVGTGFSYARTPTALQSSDLQASDQAYEFLYKWFLDHPEFL KNPLY

VGGDSYSGMVVPIITQIIATKNEMGIKPFVDLQGYLLGNPSTFKGEKNYEIPFAYGM GLISDELYE

SLTRNCKGEYQNTDPSNTQCLQDVHTFQELLKRINNPHILEPKCQFASPKPHLLFGQ RRSLNVK

FHQLNNPQQLPALKCRNDWYKLSSHWADDGQVREALHIRKGTIGKWVRCASLQYQKT IMSSIP

YHANLSAKGYRSLIYSGDHDKVVTFLSTQAWIKSLNYSIVDDWRPWIVDNQVAGYTR SYSNRMT

FATVKGAGHTAPEYKPRECLAMLKRLMSYKPL

SEQ 107

MCEPESEATRGVLSFLDVDQLFSSNYYGDGRKHDVEICHEQYARENQYHTSYCNVDSDEA IAH

LLQEELSELSIAEDAESSHADEQYFQASTGVQHWHTPPREYYAGHDTGLEADDVGPS SSCSSP

GDRSYDGEEYTYTLEIQDEFELDGEVGKRINQLSAVPHVPRINGDIPSVDEATSDHQ RLLDRLQL

FDLVEHKVQGDGNCQFRALSDQFYRTPEHHKFVRQQVVSQLKHHPEMYEGYVPMEYG EYLKR

MSKSGEWGDHVTLQAAADSYGVKILVITSFKDTCYIEILPKNQKSNRVIYLSFWAEV HYNSIYPQ

GDFLPFDLKKKKKKWSFWNKH

SEQ 108 MPSLLQIFLPLFPFFFFVSFSVSHGPFLPKAIILPVNKDLSTFQYVTQVYMGAHLVPTNL VVDLGG

SFLWTNCGLTSVSSSQKLVPCNSLKCSMAKPNGCTNKICGVQSENPFTKVAATGELA EDMFAV

EFIDELKTGSIASIHEFLFSCASTTLLQGLARGAKGMLGLGNSRIALPSQLSDTFGF QRKFALCLS

SSNGAIISGESPYLSLLGHDVSRSMLYTPLISSKNGVSEEYYINVKSIKINGNKLSL NISLFTMDEE

GVGGTKISTISPFTSMKSSIYRTFMEAYEKIAISVNLTKVESIAPFELCFSTEGIDV TKVGPNVPTM

DLVLQSEMVKWRIYGRNSMVKVSDEVMCWGFLDGGVNQKASIVIGGYQLENNLLEFN LGTSML

GFTSSLSTAETSCSDFMIHSVSKDSAFDS

SEQ 109

MKMSPALSLSVIQFPLCKSQDLSKDTNNPKIFSKETPCQKSYSDTRINRRKLLSGSGLSL VAGTL

AKPARAETEAPIEATSSRMSYSRFLEYLNEGAVKKVDFFESSAVAEIFNPALNKVQR VKVQLPGL

PPELVRKLREKDVDFAAHLPEMNVIGPLLDLLGNLAFPLILLGSLLLRTSSSNTPGG PNLPFGLGR

SKAKFQMEPNTGVTFDDVAGVDDAKQDFQEIVEFLKTPEKFAAVGAKIPKGVLLVGP PGTGKTL

LAKAIAGEAEVPFLSLSGSEFVEMFVGVGASRVRDLFNKAKENSPCLVFIDEIDAVG RQRGTGIG

GGNDEREQTLNQLLTEMDGFTGNTGVIVIAATNRPEILDQALLRPGRFDRQVSVGLP DIRGREEI

LKVHSNNKKLDKDVSLSVIAMRTPGFSGADLANLMNEAAILAGRRGKDKITSKEIDD SIDRIVAGM

EGTKMTDGKNKILVAYHEVGHGVCATLTPGHDAVQKVTLIPRGQARGLTWFIPGEDP TLISKQQ

LFARIVGSLGGRAAEEIIFGEAEITTGAAGDLQQITQIARQMVTMFGMSEIGPWALT DPATQSGD

VVLRMLARNQMSEKLAEDIDASVRHIIERAYEIAKNHIRNNREAIDKLVDVLLEKET LTGDEFRAIL

SEFTNIPSANINSKPIRELIEA

SEQ 1 10

MEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKG EDGV CNKKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCS TWLN HAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA

SEQ 1 1 1

MGCRMKFLNVVLVVAAVMAAAAAVAFGAEKLPAGVLSLERIFPLNGKMELEEVRARDRAR HAR

MLQSFAGGIVNFPVVGSSDPYLVGLYFTKVRLGTPPREYNVQIDTGSDILWVTCSSC DDCPRTS

GLGVELNFYDATISSTASPISCADQVCASIVQTASAECSTETNQCGYSFQYGDGSGT TGHYVAD

LLYFDTVLGTSLIANSSAPIIFGCSTSQSGDLTKTDRAIDGIFGFGQQGLSVISQLS SHRITPKVFS

HCLKGEGNGGGILVLGEILDPRIVYSPLVPSQAHYNVYLQSIAVNGQLVPVDPSVFA TSGNRGTI

VDSGTTLAYIATEAYDPFVNAITAAVSPSVRPIISRGKPCFLVSSSIAEIFPPVSLN FDGGASMALR

PSDYLVHMGFVEGAAMWCIGFEKQDQGVTILGDLVLKDKIFVYDLARQRIGWADYDC SSSVNVS

ITSGKDEFINAGQLSVNRASGSLLFNPRHTRTIFHLLSLVLMIGSPFLT

SEQ 1 12

MTRASIILLLLLIATSIAAAQGGALTFDDDNPIRQVVVSDGLQELENGILQLIGQTRRAL SFVRFVR

RYGKRYDSVEEIKQRFEIYLDNLKMIRSHNKQRLSYKLGVNEFTDLTWDEFRRERLG APQNCSA

TTKSDLQLTNVNLPETKDWREAGIVSPVKKQGKCGSCWTFSTTGALEAAYAQAFGKN ISLSEQQ

LLDCAGAFNNFGCHGGLPSQAFEYIKYSGGLDTEEEYPYAGKAGVCKFSSENVAVKV VDSVNIT

KGAEDELKYAIAFIRPVSVAYQVVKGFKQYKGGIYSSTVCGNTPQDVNHAVLAVGYG VDNGTPY

WLIKNSWGAEWGDNGYFKMEMGKNMCGIATCASYPIVA

SEQ 1 13

MNPEKFTHKTNEALAEAHELAISAGHAQFTPLHMALALISDHNGIFRQAIVNAAGSEETA NSVER

VFKQAMKKIPSQTPAPDQIPPSTSLIKVLRRAQSLQKSRRDTHLAVDQLILGLLEDS QIGDLLKEA

GIGAARVKSEVEKLRGKDGKKVESASGDTNFQALKTYGRDLVEQAGKLDPVIGRDEE IRRVIRIL

SRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLSDVRLIALDMGALIAGAKY RGEFEERL

KAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTL EEYRKYVEK

DAAFERRFQQVYVAEPSVPDTISILRGLKEKYEGHHGVKIQDRALVVAAQLSARYIT GRHLPDKAI

DLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLVEVRKEL DDLRDKLQ

PLTMRYKKEKERIDELRRLKQKRDELTYALQEAERRYDLARAADLRYGAIQEVEAAI ANLESSTD

ESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKDKLIGLANRLHQRVVGQDDAVRA VAEAVLR

SRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMVRIDMSEYMEQHSV ARLIGAP

PGYVGHEEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQGRTV DFTNTVI

IMTSNLGAEYLLSGLMGKCTMEKARDMVMQEVRKQFKPELLNRLDEIVVFDPLSHEQ LRQVCR HQLKDVASRLAERGIALGVTEAALDVILAQSYDPVYGARPIRRWLEKKVVTELSKMLVKE EIDEN STVYVDAASSGKDLSYRVEKNGGLVNAATGKKSDILIQLPNGVRSDAAQAVKKMKIEEIV DE

SEQ 1 14

MPEAPKKSFFTLSLVPFLPVYTLIRFNPPIESEPLISSSSDECQHDQKQQSDSRNYIVRF YHYKEP

EDHWNYLQNNLKFKGWQWIERKNPAARFPTDFGLVEIDESMKELLLEKFRKMNLVKD VSLDLS

YQRIVLEEKSEKNGAFANGKKRPGKIFTAMSFSEGQNYAVANTSIMRISWSRHLLMQ KSRVTSL

FGAHELWSKGHTGAKVKMAIFDTGIRADHPHFRNIKERTNWTNEDTLNDNVGHGTFV AGVIAG

QDEECLGFAPDAEIYAFHVFTDAQVSYTSWFLDAFNYAIATNMDVLNLSIGGPDYLD LPFVEKVW

ELTANNIIMVSAIGNDGPLYGTLNNPADQSDVIGVGAIDQSNHLASFSSRGMSTWEI PHGYGRVK

PDIVAYGREIMGSKISTRCKRLSGTSVASPVVTGIVCLLVSIIPESK

SEQ 1 15

MAQMKLSLSLFLSLVLLLAFSPSSFAKVSISSKLASKQAEKLIHELNLFPKESDNIVDRD PFPTAAS

RIVEKRFNFANLTNSSVISFEDLGHHAGYYKIKHSHAARLFYFFFESRGSKDDPVVI WLSGGPGC

SSELALFYENGPFSISNNLSLVRNEYGWDKVSNLIYVDQPTGTGFSYSSDRHDIRHS EAGVSDD

LYDFLQAFFEEHPELVKNDFYITGESYAGHYIPAFAARVHKGNKAKEGIHINLKGFA IGNGLTDPKI

QYAAYTDYALDMGLISKSDHDRINKILPVCEVAINLCGTDGKISCLAAYFVCNSIFS AVRARAGADI

NHYDIRKKCVGALCYDFSNMEKLLNMHSVKQALGVEDIEFVSCSTTVYQAMLVDWMR NLEAGIP

TLLEDGIKLLVYAGEYDLICNWLGNSRWVQAMEWSGQKEFVASPDVPFEVDSSEAGL LKSHGP

LSFLKVHDAGHMVPMDQPKVALEMLKRWIGGTLSQQTTETEDLVASI

SEQ 1 16

MAIHTSTLSISILVMLMFSAVTSSAEDMSIISYNEKHHTNGESTVWRTDDEIVSLYESWL VEHKKV

YNALGEKDKRFQIFKDNLRYIDEQNSAPEKSYKLGLTQFADLTNEEYKSIYLGTKPD GRSRLSYT

QSDRYAPKVGDSLPDSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIMTGNL ISLSEQ

ELVDCDTADNQGCQGGLMDDAFKFVIQNGGIDTEEDYPYKAKDGKCDQARKNAKVVT IDGYED

VPANDEKALKKAVAGQPVSVAIEAGGKDFQHYKSGIFTGKCGAAVDHGVVAVGYGSE NGMDY

WIVRNSWGASWGENGYLRMQRNIGNPKGLCGIATIASYPVKTGQNPPKPAPSPPSPV KPPTQC

DDYNECPAGTTCCCVYKYYNYCFAWGCCPMEGATCCKDHNSCCPHDYPVCNVKAGTC SISKN

NPLGVKAMQHILAKPIGTFGNEGKKTPSS

SEQ 1 17

MACNRLHTELGNWQVNPPSGFNLEPSDYLQRWLIEVNGAPGTLYANETYQLQAEFPEHYP IKA

PQVIFLPPAPLHPDIYRDGHICLDILYDSWSPTMTVSSICISILSMLSSSTVKFPSS EMMDVPLILSK

HVFFSKFKADEDESNNANMVFSPVSIQIIFALIAAGSSGSTLDQLLAFLKFNSVEEL NSVYSRVITD

VLADGSPMGGPRLSVTNWAWVDQSLSFKHSFKQVMDNVYKAASASVDFRNKGDEVTG EVNK

WAEEKTNGLIKQILPPVAVNSGTSLILANALYFKGAWTEKLNASDTKDHEFHLLNGG SVQAPLMT

SKKRQYVKAFDGFQVLRLRYKQGEDKRFLNMYVYLPNARDGLPTLLEKISSEPGFLD RHVPYEK

VKVHEFLIPKFKISLGIEALEVLKGLELTLPFKGGLTEMVGENYPLAVANVFHKAFI EVNEEGAEAP

AAKAFH KAF I E VN E EAP VAPAVTVATM M FGCSMMKVEEEID F VAD H P FM FLVKD ETAG VVLF VG

TLLNPLAVSPS

SEQ 1 18

LKVGSFFSSLIYSCNKASPNFYSYSFSLLSCFIELVNMGAKAFLVTILLSSLLFPLALST SNDGLVRI

GLKKIKFDQNNRLAARVESKEGEAVRASIRKYNNFHGNLGASEDTDIVALKNYMDAQ YFGEIGIG

SPPQKFTVIFDTGSSNLWVPSSKCYFSVPCFFHSKYKSSQSSTYKKNGKSAAIRYGT GAISGFFS

QDSVKVGDLIVQNQEFIEATREPSVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVK QGLVKEPV

FSFWLNRNTKEDEGGEIVFGGVDPNHYKGKHTYVPVTRKGYWQFDMGDVLIDGQATG YCDNG

CSAIADSGTSLLAGPTTVITMINHAIGASGVVSQQCKAVVEQYGQTIMDMLLAEAHP KKICSQVG

LCTFDGTRGISMGIESVVDENAGKSSGLHDAMCSACEMAVVWMQNQLRQNQTQERIL NYVNEL

CERLPSPMGQSAVDCGKLSGMPSVSFTIGGRTFDLSPEEYILKVGEGPAAQCISGFI ALDVPPPR

GPLWILGDVFMGRYHTVFDFGKLRVGFAEAA

SEQ 1 19 MSKQNLEAPLLDPSPATFNRRKKWSFALCFLFALTAISFIGLRHHGHVGIWLIGDVERYN GKLQQ

NADVVESEQAVVAADDGRCSEIGISMLKIGGHAVDAAVATALCLGVVNPMASGLGGG GFMVVR

SSSTSEVQAIDMRETAPLAASQNMYDNNGKSKLEGALSMGVPGELAGLHAAWSKHGR LPWKT

LFQPAIKLARDGFVVAPYLAHHIASKAKLILKDPGLRQVIAPEGKLLRAGDICHNVK LSHSLELIAE

QGPEAFYNGEVGEKLVEDVKKAGGILTMDDLRNYKVETPEAVTVNAMGYTIVGMPPP SSGTLGI

SLILKILESYNAAEGSLGLHRLIEAMKHMFAFRMDLGDPDFVNISKTVSDMLSPSFA KAIRQKIFDN

TTFPPEYYMPRWSQLRDHGTSHFCIVDSDRNAVSVTTTVNYPFGAGVLSPSTGIVLN DEMGDF

STPSEISPDELPPAPANFIQPKKRPLSSMAPIIVLKDNQLAGVIGGSGGMKIIPAVV QVFINHFILGM

DPLAAVQSPRVYHELIPNVVLYENWTCIDGDHIELSDEKKHFLEERGHQLEAHNGGA ICQLIVQN

LPNSHLKLGRRSGKEYKNGVFHGMLVAVSDPRKDGRPAAI

SEQ 120

MLKKISSFNILLNMASHITLCIWLLFFFISIISLAKPETYIIHMDLSAMPKAFASHHNWY LTTLASLSD

SSTNHKEFLSSKLVYAYTNAINGFSASLSPSEFEAIKNSPGYVSSIKDMSVKIDTTH TSQFLGLNS

ESGVWPTSDYGKDIIIGLVDTGIWPESKSYSDYGISEVPSRWKGECESGIEFNSSLC NKKIIGARY

FNKGLLANNPNLNISMNSARDTDGHGTHTSSTAAGSYVEGASYFGYATGTAIGIAPK AHVAMYK

ALWEEGVYLSDVLAAIDQAITDGVDVLSLSLGIDAIPLHEDPVAIAAFAALEKGIFV STSAGNEGPY

YETLHNGTPWVLTVAAGTVDREFIGALTLGNGVSVTGLSLYPGNSSSSESSIVYVEC QDDKELQ

KSAHNIVVCLDKNDSVSEHVYNVRNSKVAGAVFITNITDLEFYLQSEFPAVFLNLQE GDKVLEYIK

SNSAPKGKLEFRVTHIGAKPAPKVATYSSRGPSPSCPSILKPDLMAPGALILASWPQ QSPVTDVT

SGKLFSNFNIISGTSMSCPHASGVAALLKAAHPEWSPAAIRSAMMTTSNAMDNTQSP IRDIGSKN

AAATPLAMGAGHIDPNKALDPGLIYDATPQDYVNLLCALNFTSKQIKTITRSSSYTC SNPSLDLNY

PSFIGFFNGNSSESDPRRIQEFQRTVTNIGDGMSVYTAKLTTMGKFKVNLVPEKLVF KEKYEKLS

YKLRIEGPLVMDDIVVYGSLSWVETEGKYVVRSPIVATSIKVDPLTGHN

SEQ 121

MEFYQKLATCSHLSLLCFILLHSIQVQGSYFDQEYGKQVLSSAIQDKDWLVSIRRIIHEY PELRFQ

EYNTSALIRTELDKLGIYYEYPFAKTGLVALIGSSSPPVVALRADMDALPLQELVEW EHKSKVTGK

MHGCGHDAHTAMLLGAAKLLNERKDKLNGTVRLVFQPAEEGGAGAYHMINEGALGDA EAIFGM

HVDFKRPTGSIGTSPGPILAAVSFFEAKIEGKGGHAAEPHATVDPILAASFAVVALQ QLISREVDP

LHSQVLSVTYVRGGSASNVIPPYVEFGGTLRSLTTEGLLQLQKRVKEVIEGQAAVHR CKAYIDMK

EEDFPAYPACINDERLHQHVGRVGKLLLGSENIKETEKVMAGEDFAFYQELIPGVMF QIGIRNEK

LGSTHAPHSPHFFLDEDVLPIGAALHTAIAEMYLNDYQHPIAV

SEQ 122

RHYIYGKLTSNMKTFGIPLAAHSRVLTGSYIRSLYLQILTPFLVHTTAQADNLNCDRSAT LNCDRS

ATEVCTDSEVSTDMEPGNSIVNGVPESIAEEDTAEPLDMDFEFYLSDDKATFKGSEI VMNEPLQS

TDISGRLNVLVSWSPKMLEQYNTGLFSSLPEVFKSGFFAKRPQESVSLYKCLEAFLK EEPLGPE

DMWYCPACKQHRQATKKLDLWRLPEILVIHLKRFSYNRFLKNKLETYVDFPTHDLDL SSYLAYK

DGKSSYRYMLYAISNHYGSMGGGHYTAFVHQGADRWYDFDDSHVYPISQDKLKTSAA YVLFYR

RVEEI

SEQ 123

MSRNSLKIHLSIGKIQPGSENKNGSPVYTDSGTCEHLSELRSRVGSNPFFNFRGCVKVRP LGRA

SIRREPPNELVRCGACGQAPPRLYACVTCAAVFCRVHAPSHPVGNASDPSLHSIAVD IDRAELF

CCGCRDQVYDRDFDAAVVLAQTEATVIGSIQDPPPQPENTRKRRRVEYKPWTPDVKE QVLIVG

NSSPLPSQLGNDSTTPEVQWGLRGLNNLGNTCFMNSVLQALLHTPPLRNYFLSDKHN RYFCQR

KNNSVITRSSSDNGNKNSTMLCLACDLDAMFSAVFSGDWTPISPAKFLYSWWKHASN LASYEQ

QDAHEFFISVLDGIHERMQNDKGKALSPGSGDCCIAHRVFSGILRSDVMCTACGFTS TTYDPCID

ISLDLELSQGSSAKMTSKKSHNTHKKEAESGKFSQNGRISTLMGCLDHFTRPEKLGS DQKFFCQ

HCQVRQESLKQMSIRKLPLVSCFHIKRFEHSVIKKMSRKVDHYLQFPFSLDMSPYLS SSILRSRF

GNRIFSFDGDEQDASCESSSEFELFAVITHTGKLDAGHYVTYLRLSNQWYKCDDAWI TQVSESI

VRAAQGYM M FYVQKM L YYKAS EN Q VS

SEQ 124

MATHSSTLTISISLLLLLFFFFFSTLSSASDMSILTYDENQHFRTDDEVMSLYESWLLEH GKSYNA LDEKDKRFQIFKDNLRYIDEQNSVPNKSYKLGLTKFADLTNEEYRSMYLGTKTSDRRRLL KNKSD RYLPKVGDSLPDSVDWREKGVLVGVKDQGSCGSCWAFSAIASVEAVNSIVTGDVISLSEQ ELVD

CDTSYNDGCNGGLMDYAFDFIIKNGGIDTEEDYPYTGRDGRCDQSRKNAKVVTIDGY EDVPAN

NEKALQKAVANQPVSIAIEAGGHDFQHYVSGIFTGKCGTAVDHGVVAVGYGSENGMD YWIIRNS

WGASWGEKGYLRVQRNVASSKGLCGLAIEPSYPVKTGVNPPKPGPSPPSPIKPPTQC DDYAQC

PEGTTCCCVFEYYNSCFSWGCCPLEGATCCEDHYSCCPHDYPVCNIRAGTCSISKDN PLGVKA

M KH I H AEP I EAF I N GG RKSSS

SEQ 125

MKKLFLVLFSLALVLRLGESFDFHEKELETEEKLWELYERWRSHHTVSRSLDEKDKRFNV FKAN

VHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSKIKHHRSFLGASRANGTFMYA NVEDVP

PSVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS QNQGC

NGGLMDMAFEFIKKKGGINTEENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDED SLLKAVA

NQPVSVAIQASGSDFQFYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWG PEWGEK

GYIRMQREIDAEEGLCGIAMQPSYPIKTSSSNPTGSPATAPKDEL

SEQ 126

MARPQFTVILAIISLLIHYGVVSGFRLSDVTNGSSVFLPSPADGSRHTTMLLPLFPPKDT SRRAEIS

RRHLQKSPASARMSLHDDLLLNGYYTTHIWIGTPPQKFALIVDTGSTVTYVPCSECK KCGNHQD

PKFQPEMSSTYQSVKCNKACPCDHKRQQCIYERRYAEMSASYGLLGEDIISFGNLSE LAPQRAV

FGCEIAETGDLYSQRADGIMGLGRGDLSIVDQLVEKHVISDSFSLCYGGMDFGGGAM VLGGVK

PPADMAFTKSDFGHSPYYNIDLKEIHVAGKPLNLNPRVFGGKHGTILDSGTTYAYLP EAAFAAFK

NAVVKELHSLKQIEGPDPSFKDICFSGAGSNISELSKNFPRVDMVFSDGKKLTLSPE NYLFQHFK

VRGAYCLGIFPNGKNPASLLGGIVVRNTLVTYDRENKRIGFWKTNCSELWDRLNLSP PSPPSPS

VSSLDNTNSSAHLSPSSAPSGPPGYNTPVEIKVGLITFYLSLSVNCSELKPRIPELA HFIAQELDV

NVSQVGF

SEQ 127

MGAKSFLVAFFLSLLLFPLAFCTSNDGLVRIGLKKIKFDQNNRLAARVESKEGEALRASF RKYNN

LRGNLGASEDTDIVALKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSSKCYF SVPCLFHS

KYKSSQSSTYKKNGKSAAIRYGTGAISGFFSQDSVKVGDLVVKNQEFIEATREPSVT FLVAKFDG

ILGLGFQEISVGNAVPVWYNMVKQGLVKEPVFSFWLNRNTEEDEGGEIVFGGVDPNH YKGKHT

YVPVTRKGYWQFDMGDVLIDGQATGYCDNGCSAIADSGTSLLAGPTTVVTMINHAIG ASGVVSQ

QCKAVVEQYGQTIMDMLLAEAHPKKICSQVGLCTFDGTRGVSMGIESVVDENAGKSS GLHDAM

CSACEMAVVWMQNQLRQNQTQERILNYVNELCERLPSPMGQSAVDCGKLSGMPSVSF TIGGR

TFDLSPEEYILKVGEGPAAQCISGFIALDVPPPRGPLWILGDVFMGRYHTVFDSGKL RVGFAEAA

SEQ 128

MVVAFVGIAKSIGQQCLRRSKPYSYSYFSSYVRSSNSKYGLQNWQFQSHRTLILQSASES VKLE

RLSDSDSGILEVKLDRPEARNAIGKDMLRGLQQAFEAVSNERSANVLMICSSVPKVF CAGADLK

ERKTMILSEVQDFVSTLRSTFSFLEGLHIPTIAAIEGIALGGGLEMAMSCDIRICGE DAVLGLPETG

LAVIPGAGGTQRLPRLVGKSIAKDIIFTGRKISGKDAVSIGLVNYCVPAGEARLKTL ELARDINQKG

PVALRMAKCAIDKGVELNMESALALEWDCYEQLLDTKDRLEGLAAFAERRKPRYKGE

SEQ 129

MCSSNSLYINPKPCKHLADYKVKNGMSGYSLIQECFKTTPYGRTTLEISKSELPRCSICS GHEGR

FYMCLICSSVLCCLSPESNHALLHSQCKAGHEISVDMERAELYCSVCCDQVYDPDFD KVVMCK

HIMGFPRTEIGVVESELRLSKRRRLSFGMDLDSKNMKTLFLRRDQKSKSCFPLVLRG LNNLGNT

CFMNSVLQVLLHAPPLRNYFLSDRHNRDICRKMSSDRLCLPCDIDLIFSAVFSGDRT PYSPARFL

YSWWQHSENLATYEQQDAHEFFISVMDRIHDKEGKASLATKDNGDCQCIAHRTFYGL LRSDVT

CTSCGFTSTTHDPCMDISLDLNSCNSSPKDFANKSSKPNESLVGCLDLFTRPEKLGS DQKLYCE

NCQEKQDALKQMSIKKLPLVLSFHIKRFEHSPTRKMSRKIDRHLQFPFSLDMKPYLS SSIVRKRY

GNRIFSFDGDESDISTEFEIFAVVTHSGMLESGHYVTYLRLRNQWYKCDDAWITEVD EEVVRAS

QCYLMYYVQKMLYHKSCEDVSCQPMSLRADTFVPIAGCC

SEQ 130

MKELHSLREIEGPDPNYKDICFSGAGSDISELSKSFPPIDMVFSNGKKLSLTPENYLFRH SKVRG AYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRENERIGFWKTNCSELWDRLNLSPSPPPPP LPSGL DNTNSSANLTPALAPSLPLEHAPGKIKIGLVSFDMSLSVDYSALKPRVPELAHFIAQELE VNVSQV

HLMNFSTEGNDSLIRWAIFPAGSANYMPNATATEIINRLAENRFHLPDTFGSYKLVK WDIEPPPK

RIRWQQNYLVVVFALLVVLIIGLSASLGWLIWRRRQEIPYNPVGSAETHEKELQPLN

SEQ 131

MVTVSVKWQKEVYPAVEIDTSQPPYVFKAQLYDLTGVPPERQKIMVKGGLLKDDADWSKV GVK

EGQRLMMMGTADEIVKAPEKGPVFAEDLPEEEQVVNVGHSAGLFNLGNTCYMNSTVQ CLHSV

PELKSALTEYNQLGRSNDLDHSSHLLTVATRDLFNDLDKNVKPVAPMQFWTVLRKKY PQFGQQ

SNGAFMQQDAEECWTQLLYTLSQSLKSPNSSGSPDIVKALFGIEFDNRIHCAESGEE STETETV

YSLKCHISQEVNHLHEGLKRGLKSELEKASPSLGRSAVYVKDSRINGLPRYLTIQFV RFFWKRES

NQKAKILRKVDYPLSLDVYDFCSEDLRKKLEGPRQVLRDAEGKKAGLKTSEKTSSST DGDVKMT

EAEESSSGSGEASKTTQEGVLPEKEHHLTGIYDLVAVLTHKGRSADSGHYVAWVKQE NGKWV

QFDDDNPIPQREEDIPKLSGGGDWHMAYICMYKARVVPM

SEQ 132

MEKKKEVIRLERESVIPVLKPRLIMALADLIEHSSDRAEFLKLCKRVEYTIHAWYLLQFE DLMQLYS LFDPVNGAKKLEQQKLSPEEIDILEQNFLTYLFQIMHKSNFKIASDEEIDVAHSGQYLLN LPITVDE SKLDKKLLEKYFAEHPHEDLPEFADKYVIFRRGIGIDRTTDYFFMEKVDMIIGRTWAWIL RKTRID RLFSRRSSSRRKKDPKKDDEINSEAEDHDLYVERIRIENMELSARSNQFSLHQVK

SEQ 133

MELTCSSPLSVNSTISFNPQLRRYGSVYPHKRCQTVFSLFPYCPSSSSHITITTATTAAC STSSST

SSLFGISLSHRPCSSIPRKIKRSLYIVSGVFERFTERSIKAVMFSQKEAKALGKDMV YTQHLLLGLI

AEDRSPGGFLGSRITIDKAREAVRSIWHDDVEDDKEKLASQDSGSATSATDVAFSSS TKRVFEA

AVEYSRTMGHNFIAPEHMAFGLFTVDDGNATRVLKRLGVNVNRLAAEAVSRLQGELA KDGREPI

SFKRSREKSFPGKITIDRSAEKAKAEKNALEQFCVDLTARVSEGLIDPVIGREIEVQ RIIEILCRRTK

NNPILLGQAGVGKTAIAEGLAINIAEGNIPAFLMKKRVMSLDIGLLISGAKERGELE GRVTTLIKEVK

KSGNIILFIDEVHILVGAGTVGRGNKGSGLDIANLLKPALGRGELQCIASTTMDEFR LHIEKDKAFA

RRFQPVLINEPSQADAVQILLGLREKYESHHKCIYSLEAINAAVQLSARYIPDRYLP DKAIDLIDEA

GSKSRMQAHKRRKEQQISVLSQSPSDYWQEIRAVQAMHEVILASKLTENDDASRLND GSELHL

QPASPSTSDEDEPPVVGPEEIAAVASLWTGIPLKQLTVDERMLLVGLDEQLKKRVVG QDEAVAAI

CRAVKRSRTGLKDPNRPISAMLFCGPTGVGKSELAKALAASYFGSESAMLRLDMSEY MERHTV

SKLIGSPPGYVGYGEGGTLTEAIRRKPFTVVLLDEIEKAHPDIFNILLQLFEDGHLT DSQGRRVSF

KNALIVMTSNVGSTAIVKGRQNTIGFLLADDESAASYAGMKAIVMEELKTYFRPELM NRLDEVVV

FRPLEKPQMLQILDLMLQEVRARLVSLEISLEVSEAVMELICQQGFDRNYGARPLRR AVTQMVE

DLLSESFLSGDLKPGDVAIINLDESGNPVVANKSTQSIHLSDANGNPVVTNR

SEQ 134

MKNIERLANVALLGLSLAPLVVNVDPNVNVIVTACLTVFVGCYRSVKPTPPSETMSNEHA MRFPL

VGSAMLLSLFLLFKFLSKDLVNAVLTCYFFVLGIAALSATLLPAIRRFLPKKWNDDL IIWHFPYFRS

LEIEFTRSQIVAAIPGTIFCVWYAKQKHWLANNVLGLAFCIQGIEMLSLGSFKTGAI LLAGLFVYDIF

WVFFTPVMVSVAKSFDAPIKLLFPTADAKRPFSMLGLGDIVIPGIFVALALRFDVSR GKGPQYFKS

AFLGYTFGLALTIFVMNWFQAAQPALLYIVPAVIGFLAVHCIWNGDVKPLLEFDEGK TKGAEEAD

AKESKKVE

SEQ 135

MAFSSSYFSFIFLILLFIISFVVGEIKPIYLPGTYQSSLEKQHVKSKIPFKVHYFPQILD HFTFLPKSS

KVFKQKYLINDNYWKQGGPIFVYTGNEGNIDWFAANTGFMLDIAPKFHALLVFIEHR FYGDSMPF

GKKSYKSPKTLGYLNSQQALADYAVLIRSLKQNLSSESSPVVVFGGSYGGMLASWFR LKYPHIAI

GAVASSAPILQFDKITPWSSFYDAVSQDFKEVSLNCYRVIKGSWTELDALSKHEEGL TEVSKLFR

TCKGLHSVYSARDWLWEAFVYTAMVNYPTKANFMMPLPAYPVQEMCKIIDGLPKGAS KISRAFA

AASLYYNYTKREKCFNLEGGDDAHGLRGWDWQACTEMVMPMTCSNESMFPPSSYSYK EFKE

DCKKKYGVEPRPHWITTEFGGYRIEQVLKRFGSNMIFSNGMQDPWSRGGVLKNISAS IVALVTQ

KGAHHVDFRSETKNDPGWLIMQRKQEVAIIQKWLEEYYRDLKQN

SEQ 136 MSRFSLLLALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALS FARFA

HRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRL GAAQNC

SATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFG KGISLSE

QQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSSENVGV KVIDSV

NITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAV GYGVENGV

P YW LI KN SWG ADWG D N G YFKM EM G KN M CG I ATCAS YP VVA

SEQ 137

MEKEHKYSLFLTKLKLFFLVTLSTFHGLSHGFQMDQARTLMSWRRSKMHAQTTTYATNED ETE

NLVFSDEKHVGNMEDDLIKDGLPAQPSNVMFKQYAGYVNVDVKNGRSLFYYFAEASS GNASSK

PLVLWLNGGPGCSSLGFGAMLELGPFGVNPDGKTLYSRRFAWNKVANVMFLESPAGV GFSYS

NTTSDYSKSGDKRTAEDAYRFLVNWFKRFPHYKGRDFYIMGESYAGFYVPELADIIV KRNMLPT

TNFYIQFKGIMIGNGIMNDETDEKGTLDYLWSHALISDETHRGLLQHCKTETETCQH FQNIAEAEL

GNVDPYNIYGPQCSINSKSRSSSPKLKNGYDPCEQQYVQNYLNLPHVQKALHANLTN LPYLWN

PCSNLDWKDTPATMFPIYKRLIASGLRILLYSGDVDAVVSVTSTRYSLSAMNLKVIK PWRPWLDD

TQEVAGYMVVYDGLAFATVRGAGHQVPQFQPRRAFALLNMFFANHS

SEQ 138

MANSYTSINFFLAPIIFLAILGLQLQSSDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGS VEYYSA

WTQRDRFIKGRRLAEADTANSTPLSFSGGNETFRLSSLGFLHYANVTVGTPGLSFLV ALDTGSD

LFWLPCDCSNCVRALETRSGRRINLNIYSPNTSSTGQIVPCNSTLCGQRRRCLSSQN ACAYGVA

YLSNNTSSSGVLVEDILHLETDNAQQKSVEAPIALGCGIRQTGAFLSGAAPNGLFGL GLENISVPS

MLASKGLAANSFSMCFGPDGIGRIVFGDKGSPAQGETPLNLDQLHPTYNISLTGITV GNKITDVD

FTAIFDSGTSFTYLNDPAYKVITENFDSQAKQPRIQPDGEIPFEYCYGLSANQTTFE VPDVNLTMK

GGNQLFLFDPIIMLSLQDRSGAYCLAVVKSGDVNIIGQNFMTGYRVVFDREKMVLGW KPSDCYD

SRGSNDKSTTLPVNKRNSTEAPSPSSVVPEATKGNGSGNEPATSFPSVQSSKPAANQ APAHFI

CQLMMALFSLFSYYLIIISS

SEQ 139

MAIHTSTLSISILVMLMFSVVSSSAAEDMSIISYNEKHHTNGESTVWRTDDEVMSLYESW LVEHK

KVYNALGEKDKRFQIFKDNLRYIDEHNSVPDKSYKLGLTQFADLTNEEYKSIYLGTK PDGRSRLL

NTQSDRYAPKVGDSLPDSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIVTG NLISLS

EQELVDCDTSDNQGCQGGLMDDAFKFVIQNGGIDTEEDYPYKAKDGKCDQARKNARV VTIDGY

EDVPDNDEKALKKAVAGQPVSVAIEAGGKDFQHYKSGIFTGKCGAAVDHGVVAVGYG SENGM

DYWIVRNSWGASWGEKGYLRMQRNIGNPKGLCGIATIASYPVKTGQNPPKPAPSPPP VKPPTQ

CDDYNECPAGTTCCCVYEYYKYCFAWGCCPMEGATCCKDHNSCCPHDYPVCNVKAGT CSISK

NNPLGVKAMQHILAKPIGTFGNEGKKSPSS

SEQ 140

MEIKILLASLVIWYITCINVYADDMVRIELKRQSLDLSSISDARIYAKDLRGRNRNLAAP NDQIVYLK

NYHDVQYFAEIGIGSPPQRFIVVFDTGSSNLWVPSSRCFFSIACYLRSRYKSRLSNT YTKIGKSSK

IPFGTGSVHGFFSQDNVKVGGAVLKQQVFTEVTREGYLTLLRARFDGVLGLGFDQST TSRNVTP

VWYNMLLQHMVTKSIFSFWLNRDPTSKIAGEIIFGGMDWTHFRGQHTYVPVAQNGYW EIEIGDL

FIGSNSTGLCKDGCPAIVDTGTSFIAGPTTILTQINHAIGAEGIISLECKKVVSSYG DSIWERLIAGL

QPENVCNRIGLCTNNGSLCSSCEMIVFWIQVEIRKERSKEKAFQYANQLCEKLPNPG GKSFINC

DVFALPHITFTIGDKSFPLSPDQYVIRVDDSQGVHCISGFTTLNAHPRRPLWVLGDA FLRAYHTV

FDFGSSQIGFAESA

SEQ 141

MASIFALSLFFIIISFCITSITIPVQSDGHETFIIHVSKSDKPRVFATHHHWYSSIIRSV SQHPSKILYT

YSRAAVGFSARLTAAQADQLRRIPGVISVLPDEVRHLHTTHTPTFLGLADSFGLWPN SDYADDVI

IGVLDTGIWPERPSFSDEGLSPVPSSWKGKCATGPDFPETSCNKKIIGAQMFYKGYE ASHGPMD

ESKESKSPRDTEGHGTHTASTAAGSVVANASFYQYAKGEARGMAIKARIAAYKICWK NGCFNS

DILAAMDQAVNDGVHVISLSVGANGYAPHYLLDSIAIGAFGASEHGVLVSCSAGNSG PGAYTAV

NIAPWILTVGASTIDREFPADVILGDNRIFGGVSLYSGDPLTDAKLPVVYSGDCGSK YCYPGKLD

HKKVAGKIVLCDRGGNARVEKGSAVKQAGGVGMILLNLADSGEELVADSHLLPATMV GQKAGD

KIRHYVKSDPSPTATIVFRGTVIGKSPAAPRVAAFSSRGPNHLTPEILKPDVIAPGV NILAGWTGS VGPTDLDIDTRRVEFNIISGTSMSCPHASGLAALLKRAHPKWTPAAVKSALMTTAYNLDN SGKVF TDLATGQESTPFVHGSGHVDPNRALDPGLVYDIETSDYVNFLCSIGYDGDDVAVFVRDSS RVNC SEQNLATPGDLNYPSFSVVFTGESNGVVKYKRVMKNVGKNTDAVYEVKVNAPSSVEVSVS PAK LVFSEEKKSLSYEISFKSKSSGDLEMVKGIESAFGSIEWSDGIHNVRSPIAVRWRHYSAA SI

SEQ 142

MPSSLFLTLLLASISLSFSSTLNSNDDEFFLSSTPKFPLTMAEKLIRQLNLFPKHDINKA AATGDSE

QRLFERKLNLSYVGNSGSTVQDLGHHAGYYRLPHTKDARMFYFFFESRSRKNDPVVI WLTGGP

GCSSELAVFYENGPFKIADNMSLVWNDFGWDKVSNLIYVDQPTGTGFSYSSNDDDIR HDERGV

SNDLYDFLQAFFKAHPQYAKNDFYITGESYAGHYIPAFASRVHQGNKNKEGIYVNLK GFAIGNGL

TDPEIQYKAYTDYALDMKLIKKSDYNAIEKSYPKCQLAIKLCGKDGGTACMAAYLVC TSIFNKIMDI

AGDKNYYDVRKRCEGDLCYDFSKMETFLNDQQVKKALGVGDIEFVSCSSEVYQAMQL DWMRN

LEEGIPSLLEDGIKLLVYAGEYDLICNWLGNSRWVHAMKWSGQKAFGKATQVSFAVD GVEKGV

QKNYGPLTFLKVHDAGHMVPMDQPKAALEMLHRWMQDKLSKQGHLAPM

SEQ 143

MLVISDCYINSCKAFNFVINLPVMGHSHSHSSHSHSHFHSSKSSDDQNMDMGESITTQTD VSFM

LAKHVFSKEVKGDSNLVFSPLSIQIVLGLIAAGSKGPTKDQLLCFLKSKSIDELNSL YSHFVSVVFV

DGSPNGGPRLSVVNGVWIDQTLPFKPSYKKVVDKVYKAASNSVDFQCKAAEVANQVN QWAKM

KTNNLIKEILPHGTVNNMTRLIFANALYFKGVWNDKFNASETKDHKFHLLSGGSIKA PFMTSKNK

QYAVAFDGFKVLGLHYKQGKDMRRFCMYLILPDARDELPALLDKISSEPGFIDHHIP FEKAKMRK

FLIPKFKTTFGFEASKVLKGLGLTLPFSSGGLTEMVDSPLAGRLFVSQIFHKSFIEV NEEGTEAAA

VTASVIMTKSLIIEKEMEFVADHPFLFLIRDESTGAVFFIGSVLNPLAG

SEQ 144

MLRIGPSLRTARKLLNRNLHFQSPIIAGDVAPVHHRRQELHRFVRRCNYSSTVGNTSASA SFFST

LNNSNSSTTSTTPHVERAEENDSLQSNASEVEPVAAVEQRLSSGMVDAYLAIELALD SVVKIFTV

SSSPNYFLPWQNKSQRETTGSGFVIRGKRILTNAHVVADHTFVLVRKHGSPTKYRAT VQAVGHE

CDLAILVVESEEFWEGMNSLELGDVPFLQEAVAVVGYPQGGDNISVTKGVVSRVEPT QYVHGA

SQLLAIQIDAAINPGNSGGPAIMGDKVAGVAFQNLSGAENIGYIIPVPVIKHFIAGV EERGEYIGFC

SLGLSCQPTENAQIREYFQMQSKLTGVLVSRINPLSDASRVLKKDDIILSFDGVPIA NDGTVPFRN

RERITFDHLVSMKKPNETAELKVLRNGKVHDFKITLHPLQPLVPVHQFDKLPSYFIF AGLVFIPLTQ

PFLHEYGEDWYNASPRRLCERALRELPKKPGEQFIILSQVLMDDINAGYERLAELQV KKVNGVE

VLNLKHLRQLVEDGNQKNVRFDLDDEKVIVLNYESARIATSRILKRHRIPHAMSSDL TDDENAVEL

QSACSS

SEQ 145

ICREPPNELVRCGACGHAPPRLYACVTCAEVFCRVHAPSHPAGNAADPSLHCIAVDIDRA ELFC

CGCRDQVYNSDFDAAVALAQTEATVIGSIQDPPPHPESTRKRRRVEYKPWTPDVKEQ VLIVGNS

SPLPSQLGNDSTTPEVQWGLRGLNNLGNTCFMNSVLQALLHTPPLRNYFLSDKHNRY FCQRKN

SSVITRSSSDNGNKNSTMLCLACDLDAMFSAVFSGDRTPISPAKFLYSWWKHASNLA SYEQQD

AHEFFISVLDGIHERMQNDKGKALSPGSGDCCIAHRVFSGILRSDVMCTACGFTSTT YDPCIDISL

DLELSQGSSSKMTSKKSHNTHKKEAESGKFSQNGRISSLMGCLDHFTRPEKLGSDQK FFCQHC

QVRQESLKQMSIRKLPLVSCFHIKRFEHSVIKKMSRKVDHYLQFPFSLDMSPYLSSS ILRSRFGN

RIFSFDGDEQDASCESSSEFELFAVITHTGKLDAGHYVTYLRLSNQWYKCDDAWITQ VSENIVRA

AQG YM M FYVQKM LYYKASEKQVS

SEQ 146

MEGSPVLGEHAELIGVLSRPLRQRATAAEIQMVIPWEAITSACGSLLKEELQTRRKIHFD NGNLIS

VKNESPSNNIRNGPSNDTREHLLIDPVPPSLIEKAMTSICLITVDDGAWASGVLLNK QGLLLTNAH

LLEPWRFGKTSVNGSGYNTKSDVVLIPSDQSEHPGVEKFDIQRRNKHLIQKELKTPH FLVDNEQ

GSFRVNLAKTGSRIIRVRLDFMDPWVWTNAKVVHVSRGPLDVALLQLELVPDQLCPI TADFMCP

SPGSKAYILGHGLFGPRCDFLPSACVGAIAKVVEAKRSLLNQSSLGEHFPAMLETTA AVHPGGS

GGAVVNSEGHMIALVTSNARHGGGTVIPHLNFSIPCAALEPIFKFVEDMQNLSLEYL DKPDEQLS

SVWALTPPLSSKQSPSMLHLPMLPRGDSDGDTKGSKFAKFIADSEAMLKSATQLGKV ERLSNKL

VHSKL SEQ 147

MLKALTSSCLQNRFHAVTTAFTPQVRRGTDSNTPLLRVLGSLRSSNRRGPYLSRRFFCSD STD

GSESNSEAAASEAKPAEKGGDADSKASAAIVPTVFKPEDCLTVLALPLPHRPLFPGF YMHIYVKD

PKVLAALLESRKRQAPYAGAFLMKDEQGTDPNVVSASDTEKNIYELKGKDMLNRLHE VGTLAQI

TSIKDDQVILIGHRRIRMTEVVSEEPLTVKVDHLKEQPYNKDDDVIKATSFEVLSTL RDVLKTSSL

WKDHVQTYIQ

SEQ 148

MERKHLWAALLLLAIACFVFPASSDSLLRISLKKRQLDISSLNVANVARLEDRYGKHVMK DIEKKK

KKKKSDTNSDIVSLKNYLDAQYYGDISIGSPPQNFTVIFDTGSSNLWVPSSRCYFSI ACWIHSKYK

ARKSSTYTKKGESCSIHYGSGSISGFLSQDNVQVGDLVVTDQVFIEATRESSVTFIV AKFDGILGL

GFKEIAVGNTTPVWYNMVKQDLVKEPVFSFWLNRDINAKEGGELVFGGVDPKHFKDK HTYVPL

TQKGYWQFKMGDFSIGNQSTGFCEGGCAAIVDSGTSLLAGPTAVVTQVNHAIGAEGV LSMECK

ETISQYGEMIWDLLVSGVTPDQICLQVGLCYLNGAQHLSSNIRSVVEKENEGSSIGE APLCTACE

MAVIWMQNQLKQKTTKESVLEYVNQLCEKLPSPMGQSVIDCNSISSMPNVTFNIGDK DFVLTPD

QYILKTGEGIATICLSGFVALDVPPPRGPLWILGNVFMGVYHTVFDYGNLQLGFAEA A

SEQ 149

SRSYYNILLLQYLFLFVMALILGWKILFILLFVIIGMCTSQVTSRNIQALSMLEKHELWM SSHGRTY

KNEAEKEKRLNIFKENVKFIESFNNNGTKKPYKLGINAFADLTAEEFLSYYTTGLKL SNSYSQIQS

SFKYENLSDVPSVMDWRKSGAVTRIKHQGQCGCCWAFSAVAALEGANKLSTNNLISL SEQQLL

DCTTENNGCNGGLMTTAYDFIIQNGGIATESNYPYEEYQDSCKSQEMNSAVKINRYE TLPSTES

ALLKAVAKQPVSIGIAVNEDFHLYQNGVYNGNCEGQELNHAVTVIGYGTENDGTKYW LIKNSWG

TSWGENGYMKIARDTGIEGGLCGITTLASYPVL

SEQ 150

MGLPEVVDVARNYAVMVRIQGPDPKGLKMRKHAFHLYNSGKTTLSASGMLLPSSFVNASV SKQ

IQGESKLHSFGGHFLVLTVASVIEPFVVQQDRGDISKDKPELIPGAQIDILWEGGNT LQNDIKVTN

KEGLNWLPAELLRVVDIPVSSAAVQSLVEGSSSSIEHGWEVGWSLAAYGNSRQSFTN TKRTQV

EKISFPSQTPMMEAQSSLPSVIGTSTTRIALLRVSSNPYEDLPALKVATWSRRGDLL LGMGSPFG

ILSPSHFFNSISVGSIANSYPPSPQNKALLIADIRCLPGMEGSPVLGEHAELIGVLS RPLRQRATAA

EIQMVIPWEAITSACGSLLKEELQTRRKIHFGNGNLISVKKESFSNNIQDGHANDTQ EHLQIDPVP

PSLIEKAMTSICLIAVDDGAWASGVLLNKQGLLLTNAHLLEPWRFGKTSVNGSGYNT KSDVVLIP

SDQSEHPGVEKFDIQRRNKHLIQKELKTPHFLVDNEQCSFRVNLANTGSRTIRVRLD FMDPWV

WTNAKVVHVSRGPLDVALLQLELVPDQLCPIIVDFMCPSPGSKAYILGHGLFGPRCD FLPSACV

GAIAKVVEAKRPLLNQSSLGGHFPAMLETTAAVHPGGSGGAVVNSEGHMIALVTSNA RHGGGT

VIPHLNFSIPCAALEPIFKFAEDMQNLSLEYLDKPDEQLSSVWALTPPLSSKQSPSM LHLPMLPR

GDSDGDTKGSKFAKFIADSEAMLKSATQLGKVERLSNKLVHSKL

SEQ 151

MDNPSEDSSDSPQQQPESPVNDDQRVYLVPYRWWKEAQESSPSDGKSVTLYAAAPAPSYG G

PMKIINNIFSPDVAFNLRREEESLSQSQENGEVGVSGRDYALVPGDIWLQALKWHSN SKAAAKN

GKSFSATDEDIADVYPLQLRLSVLRETSSLGVRISKKDNTVECFKRACRIFSVDTEP LRIWDLSGQ

TALFFSDENNKILKDSQKQSEQDMLLELQVYGLSDSVKNKVKKDEMSMQYPNGSSFL MNGTGS

GITSNLTRSSSSSFSGGPCEAGTLGLTGLQNLGNTCFMNSALQCLAHTPKLVDYFLG DYKREIN

HDNPLGMNGEIASAFGDLLKKLWAPGATPVAPRTFKLKLAHFAPQFSGFNQHDSQEL LAFLLDG

LHEDLNRVKNKPYVEAKDGDDRPDEEIADEYWNNHLARNDSIIVDVCQGQYRSTLVC PVCKKVS

IMFDPFMYLSLPLPSTSMRSMTVTVIKNGSDIQISAFTITVSKDGRLEDLIRALSTA CSLDADETLL

VAEIYNNRIIRYLEEPADSLSLIRDGDRLVAYRLHKGTEEAPLVVFTHQQIDEHYIY GKLTSNMKTF

GIPLAAHSRVLTGSDIRSLYLQILTPFLVHNTAQADNLNCDRSATEACTDSEVITDM EPGNSIVNG

VPESIAEEDTAEPLDMEFQFYLSDDKATFKGSEIVMNEPLQSTDISGRLNVLVSWSP KILEQYNT

GLFSSLPEVFKSGFFAKRPQESVSLYKCLEAFLKEEPLGPEDMWYCPACKQHRQATK KLDLWR

LPEILVIHLKRFSYNRFLKNKLETYVDFPTHDLDLSSYLAYKDGKSSYRYMLYAISN HYGSMGGG

HYTAFVHQGADRWYDFDDSHVYSISQDKLKTSAAYVLFYRRVEEI

SEQ 152 MASSSRVFVLLLLIIFNFLYISAQKTIKHKPFSMSFPLISTSLSHNSSSKALFLSSFMAS NNRRQTQ

NTKTMSRIPSLNYKSTFKYSMALIVTLPIGTPPQNQQMVLDTGSQLSWIQCHKKIPK RPPPTTSF

DPSLSSTFSVLPCTHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGNLVREK ITFSRSQS

TPPLILGCATESEDAEGILGMNLGRFSFASQAKVQKFSYCVPIRQGSHAVKPSGTFY LGQNPNS

HTFQYINLLTFPQSQRMPNLDPLAFTVGMVGIKIGGKKLNISGRVFRPNAGGSGQTI IDSGTEYTF

LVEEAYNKVREEIVRLVGPRLKKGYVYGGALDMCFDNRPIEIGRLIGDMTLQFENGV DILINKERM

LDEVEGGIHCVGIGRSESLGIASNIIGNFHQQNLWVEFDMRNRRVGFGKGECSRQV

SEQ 153

YTIIIFSLNMKIFSIFSLLLLLLLPILASCHEKQVYIVYFGGHKGEKALHEIEENHHSYL MSVKESEEE

ARYSLIYSYKHSINGFAALLTPHEASKLSELEEVVSVYKSEPRKYRLQTTRSWEFSG VEESVQPN

SLNKDNLLLKARYGKDVIIGVLDSGLWPESKSFSDEGLGPIPKSWKGICQSGDAFNS SNCNKKII

GARYYIKGYEQYYGPLNRTLDYLSPRDKDGHGTHTSSTAGGRKVPNVSAIGGFASGT ASGGAP

LARLAMYKVCWAIPKEGKEDGNTCFDEDMLAAMDDAIADGVDVISISIGTKEPQPFD QDSIAIGAL

YAVKKNIVVSCSAGNSGPAPSTLSNTAPWIITVGASSVDRAFLSPVILGNGKKFTGQ TVTPYKLE

KEMYPLVYAGQVINSNVTKDVAGQCLPGSLSPKKAKGKIVICLRGNGTRVGKGGEVK RAGGIGY

ILGNNKANGAELVADPHFLPATAVDYKSAMQILNYINSTKSPVAYIVPAKTVLHSKP APYMASFTS

RGPSAVAPDILKPDITAPGLNILAAWSGGSSPTKLDIDDRVVEYNIISGTSMSCPHV GGAAALLKAI

HPTWSSAAIRSALITSAGLRNNVGEQITDASGKPADPFQFGGGHFRPSKAADPGLVY DASYQDY

LLFLCASGIKDLDKSFKCPKKSHLPNNLNYPSLAIPNLNGTVTVSRRLTNVGAPKSV YFASAKPPL

GFSVEISPPVLSFKHVGSKRTFTITVKVRSDMIDSIPKDQYVFGWYSWNDGIHNVRS PIAVKLA

SEQ 154

MATRRSSSSALTALAASRSRLLSRFRPAVSRLSQNTLLGTGRCPPPNSGFFVAETTAALW PNYN

VLSKSFVHSYSTTAASSGQINNMDYTEMALEGIVGAVEAARTSKQQVVETEHLMKAL LEQKDGL

ARRIFTKAGLDNSSVLQETDQFISQQPKVVGDTSGPILGSHLSSLLENAKKHKKEMG DSFVSVEH

MLLSFLSDTRFGQKLFRNLQLTEKALKDAVNAVRGSQRVTDPNPEGKYEALEKYGND LTELARR

GKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEP LMNRKLMSL

DMGALLAGAKYRGDFEERLKAVLKEVSSSNGQIILFIDEIHTVVGAGATSGAMDAGN LLKPMLGR

GELRCIGATTLNEYRKYIEKDPALERRFQQVYCGQPSVEDAISILRGLRERYELHHG VKISDSALV

SAAVLADRYITERFLPDKAIDLVDEAAAKLKMEITSKPTELDEIDRAVLKLEMEKLS LKNDTDKASK

ERLNKLESDLKSLKAKQKELNEQWEREKDLMTRIRSIKEEIDRVNLEMEAAEREYDL NRAAELKY

GTLISLQRQLGEAEKNLADYRKSGSSLLREEVTDLDITEIVSKWTGIPLSNLQQSER DKLVFLENE

LHKRVVGQDMAVKSVADAIRRSRAGLSDPNRPIASFMFMGPTGVGKTELGKALAAYL FNTENAL

VRIDMSEYMEKHAVSRLVGAPPGYVGYEEGGQLTEVVRRRPYSVVLFDEIEKAHHDV FNILLQL

LDDGRITDSQGRTVSFTNTVVIMTSNIGSHYILETLQNTRDSQEAVYDAMKKQVIEL ARRTFRPEF

MNRIDEYIVFQPLDLKQVSRIVELQMRRVKDRLKQKKIDLHYTQEAISLLANMGFDP NYGARPVK

RVIQQMVENEVAMGVLRGDFSEEDMIIVDADASPQGKDLLPEKRLLIRRIENGSNMD AMVAND

SEQ 155

VNVKCFFVSFFFSFSCMSLFFLQGWNFETFCLKTQSFAVTNKNHRPHLHSHHSSFLCFHT SYLL

FFLILYIYIAKTTSRFAKTQQPPQKMSRFTMLVVLVLLLLCLCHLSVATIGSSSNKK STYIVHVAKS

QMPESFENHKHWYDSSLKSVSDSAEMLYVYNNVVHGFSARLTVQEAESLERQSGILS VLPEMK

YELHTTRTPSFLGLDRSADFFPESNAMSDVIVGVLDTGVWPESKSFDDTGLGPVPDS WKGECE

SGTNFSSSNCNRKLIGARYFSKGYETTLGPVDVSKESKSARDDDGHGTHTATTAAGS IVQGASL

FGYASGTARGMATRARVAVYKVCWIGGCFSSDILAAMDKAIDDNVNVLSLSLGGGNS DYYRDS

VAIGAFAAMEKGILVSCSAGNAGPGPYSLSNVAPWITTVGAGTLDRDFPAYVSLGNG KNFSGVS

LYKGDLSLSKMLPFVYAGNASNTTNGNLCMTGTLIPEKVKGKIVLCDRGINPRVQKG SVVKEAG

GVGMVLANTAANGDELVADAHLLPATTVGQTTGEAIKKYLTSDPNPTATILFEGTKV GIKPSPVV

AAFSSRGPNSITQEILKPDIIAPGVNILAGWTGGVGPTGLAEDTRRVGFNIISGTSM SCPHVSGLA

ALLKGAHPDWSPAAIRSALMTTAYTVYKNGGALQDVSTGKPSTPFDHGAGHVDPVAA LNPGLV

YDLRADDYLNFLCALNYTSIQINSIARRNYNCETSKKYSVTDLNYPSFAVVFLEQMT AGSGSSSS

SVKYTRTLTNVGPAGTYKVSTVFSSSNSVKVSVEPETLVFTRVNEQKSYTVTFTAPS TPSTTNVF

GRIEWSDGKHVVGSPVAISWI

SEQ 156 MLKALTSSCLQNRFHAVTTAFTPQVRRGTDSNTPLLRVLGSLRSSNRRVPYLSRRFFCSD STDG

SESNSEAAASEAKPAEEGGDADSKASAAMVPTVFKPEDCLTVLALPLPHRPLFPGFY MHIYVKD

PKVLAALLESRKRQAPYAGAFLMKDEQGTDPNVVSASDTEKNIYELKGKDMLNRLHE VGTLAQI

TSIKDDQVILIGHRRIRMAEVVSEEPLTVKVDHLKEQPYNKDDDVIKATSFEVLSTL RDVLKTSSL

WKDHVQTYIQHIGDFNYARLADFGAAISGANKLQCQQVLEELDVHKRLQLTLELVKK EMEISKIQ

ESIARAIEEKISGEQRRYLLNEQLKAIKKELGLETDDKTALSAKFRERLEPNKEKIP VHVMQVIEEE

LTKLQLLEASSSEFNVTRNYLDWLTALPWGNYSDENFDVLRAEQILDEDHYGLTDVK ERILEFIA

VGKLRGTSQGKIICLSGPPGVGKTSIGRSIARALNRKFYRFSVGGLSDVAEIKGHRR TYIGAMPG

KMVQCLKSVGTANPLVLIDEIDKLGRGHAGDPASAMLELLDPEQNANFLDHYLDVPI DLSKVLFV

CTANVVEMIPNPLLDRMEVISIAGYITDEKMHIARDYLEKATRETCGIKPEQVEVTN SALLALIENY

CREAGVRNLQKQIEKIYRKIALKLVREDGEIEPQNAEVGEVEAESIHLSDEIKSKEE IQAGAESAN

GSNDDKASENNAEAEAQGAPVNQTQKSANEDACLQDTQETEKATESEASKTVNKVVV DSPNL

ADYVGKPVFHAERIYDQTPVGVVMGLAWTSMGGSTLYIETSLVEQGEGKGALNVTGQ LGDVMK

ESAQIAHTVARTILQEKEPDNQFFANSKLHLHVPAGATPKDGPSAGCTMITSLLSLA MKKPVKKD

LAMTGEVTLTGKILPIGGVKEKAIAARRSDVKTIIFPSANRRDFDELAPNVKEGLDV HFVDDYKQIF

DLAF

SEQ 157

MQFFRRNPSLHRISSRFLNQVVKTSAYSTKKVYNAGQPTAATHPQLMKEGEITPGITSEE YMQR

RKKLLEFLPENSLAIVAAAPIKMMTDVVPYNFRQDADYLYITGCQQPGGVAVLGHDC GLCMFMP

EQSPQDALWQGETAGVDAALQIFKADLAYPINRLPQILSRMIESSSTVFHNVKTRTS SYLELEAY

KKAVSNYKVKDFSVYTHEARFVKSPAELKLMRDSASIACQALVQTMLYSKLFPDEGM LSAKFEY

ECRVRGAQRMAFNPVVGGGPNGSVVHYFRNDQKIEDGNLVLMDVGCELHGYVSDLTR VWPPF

GKFSSVQEELYNLILETNKECVELCRPGTTIREIHHYSVETLRRGFKEIGILKNDRR GRYEMLNPT

NIGHYLGMDVHDCSTIGNDRPLKPGVVITIEPGVYIPSCFDCPERFQGIGFRIEDEV LITESGYEVL

TASIPKEIKHLESLLNNFGSGRGTEIRAALS

SEQ 158

LLTSHKNHIILLPFLLYKIFISLQKQTLMASSTRVFVLLLLIIFNFLYISAQKTIKHKPF SMSFPLTSTSL

SHNSSSKALFLSSLLASNQRKQAPNTKTVSRIPSLNYKSTFKYSMALIVTLPIGTPP QNQQMVLD

TGSQLSWIQCHKKIPKRPPPTTSFDPSLSSTFSVLPCTHPLCKPRIPDFTLPTTCDQ NRLCHYSY

FYADGTLAEGNLVREKITFSRSQSTPPLILGCATESEDAEGILGMNLGRFSFASQAK VQKFSYCV

PIRQGSHAVKPSGTFYLGQNPNSHTFQYINLLTFPQSQRMPNLDPLAFTVGMVGIKI GGKKLNIS

GRVFRPNAGGSGQTIIDSGTEYTFLVEEAYNKVREEIVRLVGPRLKKGYVYGGALDM CFDNRPM

EIGRLIGDMTLQFENGVEILINKERMLDEVEGGIHCVGIGRSESLGIASNIIGNFHQ QNLWVEFDM

RNRRVGFGKGECSRQM

SEQ 159

MAALNFFIIFTSLVLPIASDPLLSTYVVHVDTKAKPSHYLTQDEWYNSVVESVLANKMDS DSTSPR

LFYSYDVVLQGFAARLTDQESEKLNKFPEVIHIFKDQSRIKLDTTRSPNFLGLNTGY GLWPQSNF

GDDVIIGLVDTGIWPESESFKDNGIGPIPTRWKGKCVDGIEFNATSSCNRKLIGARN FVKGVEND

YHHQSARDQNGHGTHTASTAAGTEVNGANVFGFAKGKARGIASKARIAMYKACGSSS CAESDI

LAAIESAIKDGVDILSLSLGYDDAPFYENPVAIATFAAVKRNIFVASSAGNLGPYPF SVHNTAPWV

TTVGAGSLDRDFPVEINLSNNKTFVGSSLYPGRISGKSYSLVYIENCSIMTIDRSKV ERKIVVCNT

SKIEALRNGILIQKAGGFGLIQLNLPTEGEGIRAMAYTLPSATLGYKEGIELLSYIK SNANPRAGFV

RRKDTVIGKKVRAPIVASFSSRGPNVVVPEVLKPDLIAPGLNILAAWPGDISPTRLK MDPRRVKFN

INSGTSMACPHIAGVAALVRAVHPDWSPAAIKSALMTTSTAFDNAQLPIIKHEDMEL ATPISIGAG

HVNPESAIDPGLIYDTDTSDYINLLCSLNYTEKQMKLFTNESNPCSGFTGSPLDLNY PSLSVMFR

PDSYVHVVKKTLTHVAVSKPEVYKVKIVNLNSEKVSLSIEPRKLIFNESLQKQSYVV KFESHYAFN

SSRKIAEQMAFGSILWESEKHNVRSPFAVMWVQQNFNNSRLYK

SEQ 160

MEISKIQESIARAIEEKISGEQRRYLLNEQLKAIKKELGLETDDKTALSAKFRERLEPNK EKIPVHV

MQVIEEELTKLQLLEASSSEFNVTRNYLDWLTALPWGSYSDENFDVLRAEQILDEDH YGLTDVK

ERILEFIAVGKLRGTSQGKIICLSGPPGVGKTSIGRSIARALNRKFYRFSVGGLSDV AEIKGHRRTY

IGAMPGKMVQCLKSVGTANPLVLIDEIDKLGRGHAGDPASAMLELLDPEQNANFLDH YLDVPIDL

SKVLFVCTANVVEMIPNPLLDRMEVISIAGYITDEKVHIARDYLEKATRETCGIKPE QVEVTDSALL ALIENYCREAGVRNLQKQIEKIYRKIALKLVREDGEIEPQNAEVDEVKAESIHLSDEIKS KEEIQAG

AESANGSNDDEASENNAEAEAQGAENQTQKSANEDTCLQDTQETEKATESEASKTVN KVVVD

SPNLADYVGKPVFHAERIYDQTPVGVVMGLAWTSMGGSTLYIETSLVEQGEGKGALN VTGQLG

DVMKESAQIAHTVARTILLEKEPDNQFFANSKLHLHVPAGATPKDGPSAGCTMITSL LSLAMKKP

VKKDLAMTGEVTLTGKILPIGGVKEKAIAARRSDVKTIIFPSANRRDFDELAPNVKE GLDVHFVDD

YKQIFDLAF