FLORACK DION (CH)
BATTEY JAMES (CH)
US20040106198A1 | 2004-06-03 |
G. BEYENE: "Two new cysteine proteinases with specific expression patterns in mature and senescent tobacco (Nicotiana tabacum L.) leaves", JOURNAL OF EXPERIMENTAL BOTANY, vol. 57, no. 6, 1 March 2006 (2006-03-01), GB, pages 1431 - 1443, XP055220617, ISSN: 0022-0957, DOI: 10.1093/jxb/erj123
BOVET, L. ET AL.: "Gene expression changes during tobacco curing", 2013, XP002746297, Retrieved from the Internet
KAWASHIMA N ET AL: "STUDIES ON PROTEIN METABOLISM IN HIGHER PLANTS V SOME PROPERTIES OF A TOBACCO-D LEAF ENZ PROTEASE INCREASED DURING CURING INST COLUMN CHROMATOGRAPHY", AGRICULTURAL AND BIOLOGICAL CHEMISTRY, vol. 32, no. 9, 1968, pages 1141 - 1145, XP002746298, ISSN: 0002-1369
FLORIAN MARTIN ET AL: "Design of a tobacco exon array with application to investigate the differential cadmium accumulation property in two tobacco varieties", BMC GENOMICS, BIOMED CENTRAL LTD, LONDON, UK, vol. 13, no. 1, 28 November 2012 (2012-11-28), pages 674, XP021140791, ISSN: 1471-2164, DOI: 10.1186/1471-2164-13-674
Claims 1 . A mutant, non-naturally occurring or transgenic tobacco plant cell comprising: (i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80; (ii) a polypeptide encoded by the polynucleotide set forth in (i); (iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or (iv) a construct, vector or expression vector comprising the isolated polynucleotide set forth in (i), and wherein the expression or activity of said protease is modulated as compared to a control tobacco plant cell in which the expression or activity of said protease has not been altered. 2. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 1 , wherein the expression or activity of said protease is upregulated compared to the control tobacco plant cell. 3. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 1 , wherein the expression or activity of said protease is downregulated compared to the control tobacco plant cell. 4. A mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein the expression or activity is modulated of a protease selected from: at least one of SEQ ID NO: 1 to 16; or at least one of SEQ ID NO: 30 to 41 ; or at least one of SEQ ID NO: 17 to 22; or at least one of SEQ ID NO: 42 to 44; or at least one of SEQ ID NO: 45 to 61 ; or at least one of SEQ ID NO: 62 to 80 or at least one of SEQ ID NO: 23 to 29. 5. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 30 to 41 is modulated in an Oriental type tobacco. 6. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from SEQ ID NO: 17 to 22 is modulated in a Virginia type tobacco. 7. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 42 to 44 is modulated in a Burley type tobacco. 8. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 45 to 61 is modulated in a Virginia or Oriental type tobacco. 9. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 62 to 80 is modulated in a Burley or Oriental type tobacco. 10. A mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 23 to 29 is modulated in a Burley or Virginia type tobacco. 1 1. The mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein said mutation(s) is a heterozygous or homozygous mutation. 12. The mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein the expression of one or more proteases is increased by about 10% to about 1000%, 13. The mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 12, wherein the expression of one or more proteases is increased by at least 10%, at least 20%, at least 25%, at least 50%, at least 100%, at least 200%, at least 500%, at least 750% or up to 1000%. 14. A mutant, non-naturally occurring or transgenic plant or component or part thereof comprising the plant cell according to any preceding claim. 15. Plant material including biomass, seed, stem, flowers or leaves from the plant of claim 14. 16. A tobacco product comprising the plant cell of any of claims 1 to 13, at least a part of the plant of claim 14 or the plant material according to claim 15. 17. A method for preparing a tobacco plant with modulated levels of protease, said method comprising the steps of: (a) providing a plant comprising (i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80; (b) inserting one or more mutations into said polynucleotide of said tobacco plant to create a mutant tobacco plant; and (c) curing the tobacco plant material. 18. The method according to claim 17, wherein the tobacco plant in step (b) is a mutant tobacco plant, preferably, wherein said mutant tobacco plant comprises one or more mutations in one or more further sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80. 19. A method according to claim 17 or claim 18, wherein the genome of a cell of a tobacco plant is modified by a genome editing technology or by genome engineering techniques selected from CRISPR/Cas technology, zinc finger nuclease-mediated mutagenesis, chemical or radiation mutagenesis, homologous recombination, oligonucleotide-directed mutagenesis and meganuclease-mediated mutagenesis. 20. A method for producing cured plant material, preferably cured leaves, or flowers with an altered flavour profile as compared to control plant material comprising the steps of: (a) providing a plant according to claim 14 or the plant material according to claim 15; (b) optionally harvesting the plant material therefrom; and (c) curing the plant material for a period of time such that the levels of at least one protease are modulated compared to control cured plant material. 21 . The use of (i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80; (ii) a polypeptide encoded by the polynucleotide set forth in (i); (iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or (iv) a construct, vector or expression vector comprising the isolated polynucleotide set forth in (i), for the modulation of the expression or activity of one or more proteases in tobacco during a tobacco curing procedure. 22. Use according to claim 21 , wherein the curing procedure is selected from the group consisting of air curing, fire curing, smoke curing and flue curing. |
FIELD OF THE INVENTION
The present invention concerns the use of proteases expressed in tobacco to alter the characteristics of cured tobacco products. In particular, the invention provides processes for altering the curing of tobacco leaf and modulating tobacco leaf composition by modulating the expression of one or more tobacco protease genes.
BACKGROUND OF THE INVENTION
Tobacco curing is a process of physical and biochemical changes that bring out the aroma and flavor of each variety of tobacco. After tobacco has been harvested, it is necessary to cure it and then age it before comsumption, to improve its flavour. There are four common methods of curing, and the method used depends on the type of tobacco and its intended use.
Air-cured tobacco is sheltered from wind and sun in a well-ventilated chamber, where it air- dries for six to eight weeks. Air-cured tobacco is low in sugar, which gives the tobacco smoke a light, sweet flavor, and high in nicotine. Cigar and burley tobaccos are air cured. In fire curing, smoke from a low-burning fire permeates the leaves. This gives the leaves a distinctive smokey aroma and flavour. Fire curing takes three to ten weeks and produces a tobacco low in sugar and high in nicotine. Pipe tobacco, chewing tobacco, and snuff are fire cured.
Flue-cured tobacco is kept in an enclosed heated area, but it is not directly exposed to smoke. This method produces cigarette tobacco that is high in sugar and has medium to high levels of nicotine. It is the fastest method of curing, requiring about a week. Virginia tobacco that has been flue cured is also called bright tobacco, because flue curing turns its leaves gold, orange, or yellow.
Sun-cured tobacco dries uncovered in the sun. This method is used in Turkey, Greece and other Mediterranean countries to produce oriental tobacco. Sun-cured tobacco is low in sugar and nicotine and is used in cigarettes.
Curing produces various compounds in the tobacco leaves that give cured tobacco its specific flavour and taste, such as for example a sweet hay, tea, rose oil, or fruity aromatic flavor.
During the first phase of curing, corresponding to the so-called yellowing phase and also known as color curing, the chlorophyll content is reduced. This phase takes between 2 and 8 days depending on the tobacco type. During this phase leaf metabolic activities are drastically changed. Not only is chlorophyll degraded but also, for example, starch and proteins. To date, the only methods for altering the curing process which have been proposed are base on altering the actual conditions to which the tobacco is exposed in the chosen curing procedure. Very little is known about gene expression in tobacco during curing, and moreover few data have been reported on the activities of proteases in tobacco leaf and their resulting products.
We have identified 80 protease genes that are activated during leaf curing in the three main tobacco types, Burley, Virginia and Oriental. We have found that specific protease expression is associated with particular flavour profiles in tobacco.
SUMMARY OF THE INVENTION
80 protease genes (SEQ ID NO: 1 -80) were identified that are up-regulated in Burley tobacco upon air curing, Virginia tobacco upon flue-curing and Oriental tobacco upon sun curing. Details on such up-regulation in one or more of the different tobacco types are summarised in Figure 2 and Table 1 & 2.
Such gene sequences and their regulatory sequences can be used to modulate or modify protease activity during curing. The polynucleotide sequences SEQ ID NO: 1 -80 include exon and intron sequences. The protein sequences relating to the coding sequence part of the polynucleotide sequences SEQ ID NO: 1 -80, are depicted in SEQ ID NO: 81 -160.
Accordingly, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell comprising:
(i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80;
(ii) a polypeptide encoded by the polynucleotide set forth in (i);
(iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or
(iv) a construct, vector or expression vector comprising the isolated polynucleotide set forth in (i),
and wherein the expression or activity of said protease is modulated as compared to a control tobacco plant cell in which the expression or activity of said protease has not been altered.
Alteration of protease expression in tobacco cells during the curing process imparts different flavours to the cured tobacco and products manufactured therefrom. The effects of different genes on different tobacco flavour profiles are further discussed below.
In embodiments, the expression or activity of said protease is upregulated compared to the control tobacco plant cell. However, in certain embodiments, the expression or activity of said protease is downregulated compared to the control tobacco plant cell. In still further embodiments, at least one protease can be upregulated at the same time as at least one protease is downregulated in the same cell.
In an exemplary embodiment, therefore, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to any preceding claim, wherein the expression or activity is modulated of a protease selected from:
at least one of SEQ ID NO: 1 to 16; or
at least one of SEQ ID NO: 30 to 41 ; or
at least one of SEQ ID NO: 17 to 22; or
at least one of SEQ ID NO: 42 to 44; or
at least one of SEQ ID NO: 45 to 61 ; or
at least one of SEQ ID NO: 62 to 80 or
at least one of SEQ ID NO: 23 to 29.
In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 30 to 41 is modulated in an Oriental type tobacco. In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from SEQ ID NO: 17 to 22 is modulated in a Virginia type tobacco.
In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 42 to 44 is modulated in a Burley type tobacco. In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 45 to 61 is modulated in a Virginia or Oriental type tobacco.
In a specific embodiment, there is provided a non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 62 to 80 is modulated in a Burley or Oriental type tobacco.
In a specific embodiment, there is provided a mutant, non-naturally occurring or transgenic tobacco plant cell according to claim 4, wherein the expression or activity of a protease selected from at least one of SEQ ID NO: 23 to 29 is modulated in a Burley or Virginia type tobacco.
The mutant, non-naturally occurring or transgenic tobacco plant cell can be a tobacco plant cell wherein said mutation(s) is a heterozygous or homozygous mutation. In embodiments of the invention, the expression of the one or more proteases is increased by about 10% to about 1000%, for example by at least 10%, at least 20%, at least 25%, at least 50%, at least 100%, at least 200%, at least 500%, at least 750% or up to 1000%.
In a second aspect, there is provided a mutant, non-naturally occurring or transgenic plant or component or part thereof comprising the plant cell according to the preceding aspect of the invention.
In a third aspect, there is provided plant material including biomass, seed, stem, flowers or leaves from the plant of the second aspect of the invention.
In a fourth aspect, there is provided a method for preparing a tobacco plant with modulated levels of protease, said method comprising the steps of:
(a) providing a plant comprising (i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80;
(b) inserting one or more mutations into said polynucleotide of said tobacco plant to create a mutant tobacco plant; and
(c) curing the tobacco plant material.
In some embodiments, the tobacco plant in step (b) is a mutant tobacco plant, preferably, wherein said mutant tobacco plant comprises one or more mutations in one or more further sequence encoding a functional protease and having at least 95% sequence identity to at least one of SEQ ID NO:1 to SEQ ID No: 80. Thus, a plant can be constructed in which one or more cells comprise multiple mutated proteases.
The mutated cells comprising modulated protease expression or activity are impart a different flavour profile to tobacco leaf during the curing process. By replicating a leaf chemistry of one tobacco type in another, it is possible to transfer flavour characteristics to a tobacco type which does not normally possess those characteristics.
In embodiments, the genome of a cell of a tobacco plant is modified by a genome editing technology or by genome engineering techniques selected from CRISPR/Cas technology, zinc finger nuclease-mediated mutagenesis, chemical or radiation mutagenesis, homologous recombination, oligonucleotide-directed mutagenesis and meganuclease-mediated mutagenesis.
In a further aspect, therefore, there is provided a method for producing cured plant material, preferably cured leaves, or flowers with an altered flavour profile as compared to control plant material comprising the steps of:
(a) providing a plant or the plant material according to the foregoing aspects of the invention;
(b) optionally harvesting the plant material therefrom; and (c) curing the plant material for a period of time such that the levels of at least one protease are modulated compared to control cured plant material.
In a still further aspect, there is provided use of
(i) a polynucleotide comprising, consisting or consisting essentially of a sequence encoding a functional protease and having at least 95% sequence identity to any one of SEQ ID NO:1 to SEQ ID No: 80;
(ii) a polypeptide encoded by the polynucleotide set forth in (i);
(iii) a polypeptide comprising, consisting or consisting essentially of a sequence encoding a protease and having at least 95% sequence identity to SEQ ID NO:81 to SEQ ID No: 160; or
(iv) a construct, vector or expression vector comprising the isolated polynucleotide set forth in (i),
for the modulation of the expression or activity of one or more proteases in tobacco during a tobacco curing procedure.
The curing procedure in according to this aspect of the invention can be selected from the group consisting of air curing, fire curing, smoke curing and flue curing.
Modification or modulation of protease activity during curing can be through (further) up- regulation or down-regulation. Modification or modulation can be through genetic engineering using for example certain promoter sequences that are (at least) active during such curing. Modulation can also be through for example mutagenesis as claimed above, of such sequences and/or their regulatory region resulting in either up- or down-regulation, or complete knock-out, of the protease activity encoded thereby under the respective curing conditions.
In another embodiment there is provided the use of at least one of the 16 gene sequences SEQ ID NO: 1 to 16 (see Table 2), and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 16 sequences that are up-regulated in all three types of tobacco during curing for modifying the flavour of cured tobacco.
In another embodiment there is provided the use of at least one of the 12 gene sequences SEQ ID NO: 30 to 41 , and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 12 sequences that are up-regulated both in air-cured Burley and flue-cured Virginia, in an Oriental type tobacco to modify the flavour of said tobacco during curing.
In another embodiment there is provided the use of at least one of the 6 gene sequences SEQ ID NO: 17 to 22, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 6 sequences, that are up-regulated both in air-cured Burley and sun-cured Oriental, in a Virginia type tobacco to modify the flavour of said tobacco during curing.
In another embodiment there is provided the use of at least one of the 3 gene sequences SEQ ID NO: 42 to 44, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 3 sequences that are up-regulated in both flue-cured Virginia and sun-cured Oriental tobacco, to modify the flavour of a Burley type tobacco during curing.
In another embodiment there is provided the use of at least one of the 17 gene sequences SEQ ID NO: 45 to 61 , and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 17 sequences, that are uniquely up-regulated in air-cured Burley, to modify the flavour in a Virginia or Oriental type tobacco during curing.
In another embodiment there is provided the use of at least one of the 19 gene sequences SEQ ID NO: 62 to 80, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 19 sequences, that are uniquely up-regulated in flue-cured Virginia, to modify the flavour in a Burley or Oriental type tobacco during curing.
In another embodiment there is provided the use of at least one of the 7 gene sequences SEQ ID NO: 23 to 29, and sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of said 7 sequences, that are uniquely up-regulated in sun-cured Oriental, to modify the flavour of a Burley or Virginia type tobacco during curing.
As certain gene sequences are only up-regulated in one or two of the three tobacco types (as defined according to tobacco type and curing method), certain gene sequences can potentially be used to modify or modulate protease activity during curing such that the outcome with respect to leaf chemistry (for example the metabolite content of cell) and properties of the obtained tobacco leaf cell, are changed such that for example an air-cured Burley tobacco acquires certain characteristics of a flue-cured Virginia-type tobacco or sun- cured Oriental tobacco upon curing. This for example can be done by modulating the expression of one or more of the gene sequences that are up-regulated in one or two of tobacco types and not in the other tobacco. For example, 17 gene sequences are uniquely up-regulated in air-cured Burley, 19 in flue-cured Virginia, and 12 in both types of tobacco during curing. By selectively modulating one or more of the 19 gene sequences that are only up-regulated in air-cured Burley now in flue-cured Virginia, the leaf cell composition of the sun-cured Virginia tobacco upon curing can be altered towards a more Burley type. Using a genetic engineering approach this can be achieved using for example a promoter sequence that is active under the curing conditions of the targeted tobacco type. Promoter sequences of use therefore are for example the regulatory sequences driving the expression of the gene sequences listed here. Using a mutagenesis, genome editing or engineering approach, the mutated gene sequence can be active under the curing conditions of the targeted tobacco type.
In one example, a regulatory sequence is mutated such that the gene sequence downstream is active under the desired curing conditions. For example, by selectively modifying or modulating the expression of one or more of the 19 sequences that are uniquely up- regulated in flue-cured Virginia in an air-cured Burley type of tobacco, the leaf cell composition of the Burley type tobacco upon curing can be altered towards a more Virginia type. Also, by selectively modulating the expression of one or more of the 12 sequences that are up-regulated both in air-cured Burley and flue-cured Virginia, in a sun-cured Oriental tobacco, the leaf cell composition of the sun-cured Oriental tobacco upon curing can be altered such that it acquires Burley and Virginia characteristics.
In one embodiment, one of the gene sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to one or more of the listed sequences, is up-regulated. In another embodiment more than one of the gene sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such sequences, are up-regulated. In another embodiment, one or more of the gene sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such listed sequences, are down-regulated. In another embodiment one or more of sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such listed sequences are up-regulated, and one or more sequences listed in SEQ ID Nos: 1 -80 or sequences comprising, consisting or consisting essentially of a sequence having at least 95% sequence identity to such listed sequences are down-regulated.
As curing conditions determine the ultimate leaf cell chemistry, such modification or modulation affects the way a consumer experiences a product made from such leaf material. Hence, the invention also provides tobacco leaves and products comprising such leaves, obtained according to the methods claimed above. Such products include but are not limited to chewing tobacco, tobacco sticks, extracts obtained therefrom and other smoking articles comprising such leaf material or a material derived therefrom. BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1. Expression of CYP82E4 (AGD93125.1 / Gl:444237502) increased after 48h curing in the three main tobacco types, SC, sun-cured; FC, flue-cured; AC, air-cured.
Figure 2. The expression of 80 senescence-activated protease genes increased in the three main tobacco types
Figure 3. One APA 1 tobacco gene (SEQ 68) is only expressed during Virginia Curing. DEFINITIONS
The technical terms and expressions used within the scope of this application are generally to be given the meaning commonly applied to them in the pertinent art of plant and molecular biology. All of the following term definitions apply to the complete content of this application. The word "comprising" does not exclude other elements or steps, and the indefinite article "a" or "an" does not exclude a plurality. A single step may fulfil the functions of several features recited in the claims. The terms "about", "essentially" and "approximately" in the context of a given numerate value or range refers to a value or range that is within 20%, within 10%, or within 5%, 4%, 3%, 2% or 1 % of the given value or range.
The term "isolated" refers to any entity that is taken from its natural milieu, but the term does not connote any degree of purification.
An "expression vector" is a nucleic acid vehicle that comprises a combination of nucleic acid components for enabling the expression of nucleic acid. Suitable expression vectors include episomes capable of extra-chromosomal replication such as circular, double-stranded nucleic acid plasmids; linearized double-stranded nucleic acid plasmids; and other functionally equivalent expression vectors of any origin. An expression vector comprises at least a promoter positioned upstream and operably-linked to a nucleic acid, nucleic acid constructs or nucleic acid conjugate, as defined below.
The term "construct" refers to a double-stranded, recombinant nucleic acid fragment comprising one or more polynucleotides. The construct comprises a "template strand" base- paired with a complementary "sense or coding strand." A given construct can be inserted into a vector in two possible orientations, either in the same (or sense) orientation or in the reverse (or anti-sense) orientation with respect to the orientation of a promoter positioned within a vector - such as an expression vector.
A "vector" refers to a nucleic acid vehicle that comprises a combination of nucleic acid components for enabling the transport of nucleic acid, nucleic acid constructs and nucleic acid conjugates and the like. Suitable vectors include episomes capable of extra- chromosomal replication such as circular, double-stranded nucleic acid plasmids; linearized double-stranded nucleic acid plasmids; and other vectors of any origin. A "promoter" refers to a nucleic acid element/sequence, typically positioned upstream and operably-linked to a double-stranded DNA fragment. Promoters can be derived entirely from regions proximate to a native gene of interest, or can be composed of different elements derived from different native promoters or synthetic DNA segments.
The terms "homology, identity or similarity" refer to the degree of sequence similarity between two polypeptides or between two nucleic acid molecules compared by sequence alignment. The degree of homology between two discrete nucleic acid sequences being compared is a function of the number of identical, or matching, nucleotides at comparable positions. The percent identity may be determined by visual inspection and mathematical calculation. Alternatively, the percent identity of two nucleic acid sequences may be determined by comparing sequence information using a computer program such as - ClustalW, BLAST, FASTA or Smith-Waterman.
A "variant" means a substantially similar sequence. A variant can have a similar function or substantially similar function as a wild-type sequence. For a protease, a similar function is at least about 50%, 60%, 70%, 80% or 90% of wild-type enzyme function under the same conditions. For a protease, a substantially similar function is at least about 90%, 95%, 96%, 97%, 98% or 99% of wild-type enzyme function under the same conditions. For example, wild-type protease sequences are set forth in SEQ ID Nos: 81 -160. The variants can have one or more mutations that result in the enzyme having a reduced level of protease activity as compared to the wild-type protease. The variants can have one or more mutations that result in their protease activity being knocked out (i.e. a 100% inhibition, and thus a nonfunctional polypeptide). Variants can also have increased activity, leading to a more active protease enzyme function.
The term "plant" refers to any plant or part of a plant at any stage of its life cycle or development, and its progenies. In one embodiment, the plant is a "tobacco plant", which refers to a plant belonging to the genus Nicotiana. Preferred species of tobacco plant are described herein.
"Plant parts" include plant cells, plant protoplasts, plant cell tissue cultures from which a whole plant can be regenerated, plant calli, plant clumps and plant cells that are intact in plants or parts of plants such as embryos, pollen, anthers, ovules, seeds, leaves, flowers, stems, branches, fruit, roots, root tips and the like. Progeny, variants and mutants of regenerated plants are also included within the scope of the disclosure, provided that they comprise the introduced polynucleotides described herein.
A "plant cell" refers to a structural and physiological unit of a plant. The plant cell may be in the form of a protoplast without a cell wall, an isolated single cell or a cultured cell, or as a part of higher organized unit such as but not limited to, plant tissue, a plant organ, or a whole plant. The term "plant material" refers to any solid, liquid or gaseous composition, or a combination thereof, obtainable from a plant, including biomass, leaves, stems, roots, flowers or flower parts, fruits, pollen, egg cells, zygotes, seeds, cuttings, secretions, extracts, cell or tissue cultures, or any other parts or products of a plant. In one embodiment, the plant material comprises or consists of biomass, stem, seed or leaves. In another embodiment, the plant material comprises or consists of leaves.
The term "variety" refers to a population of plants that share constant characteristics which separate them from other plants of the same species. While possessing one or more distinctive traits, a variety is further characterized by a very small overall variation between individuals within that variety. A variety is often sold commercially.
A "type" of tobacco is defined by origin and curing method. Flue-cured tobacco, which accounts for 40% of global production, is also known as "Bright" and "Virginia" tobacco. It is used almost entirely in cigarette blends. Some of the heavier leaves may be used in mixtures for pipe smoking. Some English cigarettes are 100% flue-cured. Flue-cured leaf is characterized by a high sugar: nitrogen ratio. This ratio is enhanced by the picking of the leaf in an advanced stage of ripeness, and by the unique curing process which allows certain chemical changes to occur in the leaf. Cured leaves vary from lemon to orange to mahogany in colour.
Burley is light air-cured type derived from the White Burley which arose as a mutant on a farm in Ohio in 1864. Burley is used primarily in cigarette blends. Some of the heavier leaf is sued in pipe blends and also for chewing.
Cured burley leaf is characterized by low sugar content and a very low sugar to nitrogen ratio (high nicotine). This is enhanced by high Nitrogen fertilizer, harvesting at an early stage of senescence, and the air curing process which allows oxidation of any sugars which may have occurred.
Maryland is another light air-cured type. It is used to some extent in American blended cigarettes and to a greater extent in certain Swiss cigarette blends.
Maryland tobacco is extremely fluffy, has good burning properties, low nicotine, and neutral aroma.
Dark air-cured tobacco encompasses a number of types used mainly for chewing, snuff, cigar, and pipe blends. Most of the world production is confined to the tropics.
Oriental tobacco gives a mild smoke with very characteristic aroma. Resins, waxes and gum exuded by glandular hairs (trichomes) furnish the aroma. Nicotine is low averaging around
1 .0%.
Dark-fired tobacco is used in the production of snuff, chewing tobacco, and pipe blends. Dark-fired leaves are subjected to smoke from smoldering wood during the early stage of curing. The type of wood used is very important in determining taste and grown. Cured leaves are very dark in color and are long and heavy bodied.
The term "modulating" may refer to reducing, inhibiting, increasing or otherwise affecting the expression or activity of a polypeptide. The term may also refer to reducing, inhibiting, increasing or otherwise affecting the activity of a gene encoding a polypeptide which can include, but is not limited to, modulating transcriptional activity.
The term "reduce" or "reduced" as used herein, refers to a reduction of from about 10% to about 99%, or a reduction of at least 10%, at least 20%, at least 25%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, or at least 100% or more of a quantity or an activity, such as but not limited to polypeptide activity, transcriptional activity and protein expression. The term "inhibit" or "inhibited" as used herein, refers to a reduction of from about 98% to about 100%, or a reduction of at least 98%, at least 99%, but particularly of 100%, of a quantity or an activity, such as but not limited to polypeptide activity, transcriptional activity and protein expression.
The term "increase" or "increased" as used herein, refers to an increase of from about 5% to about 99%, or an increase of at least 5%, at least 10%, at least 20%, at least 25%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 95%, at least 98%, at least 99%, at least 100%, at least 500% or at least 1000% or more of a quantity or an activity, such as but not limited to polypeptide activity, transcriptional activity and protein expression.
The term "control" in the context of a control plant means a plant or plant cell in which the expression or activity of an enzyme has not been modified (for example, increased or reduced) and so it can provide a comparison with a plant in which the expression or activity of the enzyme has been modified. The control plant may comprise an empty vector. The control plant or plant cell may correspond to a wild-type plant or wild-type plant cell. For example, the control plant or plant cell can be the same genotype as the starting material for the genetic alteration that resulted in the subject plant. In all such cases, the subject plant and the control plant are cultured and harvested using the same protocols for comparative purposes. Changes in levels, ratios, activity, or distribution of the genes or polypeptides described herein, or changes in tobacco plant phenotype, particularly reduced production of proteases, can be measured by comparing a subject plant to the control plant, where the subject plant and the control plant have been cultured, harvested and cured using the same protocols. The control plant can provide a reference point for measuring changes in phenotype of the subject plant. The measurement of changes in phenotype can be measured at any time in a plant, including during plant development, senescence, or preferably after curing. Measurement of changes in phenotype can be measured in plants grown under any conditions, including from plants grown in growth chamber, greenhouse, or in a field. Changes in phenotype can be measured by determining the expression or activity of proteases identified herein in SEQ ID Nos 81 -160.
DETAILED DESCRIPTION
In one embodiment, there is provided an isolated polynucleotide comprising, consisting or consisting essentially of a polynucleotide sequence having at least 95% sequence identity to any of the sequences described herein, including any of polynucleotides shown in the sequence lisiting. Suitably, the isolated polynucleotide comprises, consists or consists essentially of a sequence having at least 95%, 96%, 97%, 98%, 99% or 100% sequence identity thereto.
Suitably, the polynucleotide(s) described herein encode a protein with protease activity that is at least about 50%, 60%, 70%, 80%, 90% 95%, 96%, 97%, 98%, 99% 100% or more of the activity of the protein set forth in SEQ ID NOs: 81 -160.
A polynucleotide as described herein can include a polymer of nucleotides, which may be unmodified or modified deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). Accordingly, a polynucleotide can be, without limitation, a genomic DNA, complementary DNA (cDNA), mRNA, or antisense RNA or a fragment(s) thereof. Moreover, a polynucleotide can be single-stranded or double-stranded DNA, DNA that is a mixture of single-stranded and double-stranded regions, a hybrid molecule comprising DNA and RNA, or a hybrid molecule with a mixture of single-stranded and double-stranded regions or a fragment(s) thereof. In addition, the polynucleotide can be composed of triple-stranded regions comprising DNA, RNA, or both or a fragment(s) thereof. A polynucleotide can contain one or more modified bases, such as phosphothioates, and can be a peptide nucleic acid. Generally, polynucleotides can be assembled from isolated or cloned fragments of cDNA, genomic DNA, oligonucleotides, or individual nucleotides, or a combination of the foregoing. Although the polynucleotide sequences described herein are shown as DNA sequences, the sequences include their corresponding RNA sequences, and their complementary (for example, completely complementary) DNA or RNA sequences, including the reverse complements thereof.
A polynucleotide as described herein will generally contain phosphodiester bonds, although in some cases, polynucleotide analogues are included that may have alternate backbones, comprising, for example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- methylphophoroamidite linkages; and peptide polynucleotide backbones and linkages. Other analogue polynucleotides include those with positive backbones; non-ionic backbones, and non-ribose backbones. Modifications of the ribose-phosphate backbone may be done for a variety of reasons, for example, to increase the stability and half-life of such molecules in physiological environments or as probes on a biochip. Mixtures of naturally occurring polynucleotides and analogues can be made; alternatively, mixtures of different polynucleotide analogues, and mixtures of naturally occurring polynucleotides and analogues may be made.
A variety of polynucleotide analogues are known, including, for example, phosphoramidate, phosphorothioate, phosphorodithioate, O-methylphophoroamidite linkages and peptide polynucleotide backbones and linkages. Other analogue polynucleotides include those with positive backbones, non-ionic backbones and non-ribose backbones. Polynucleotides containing one or more carbocyclic sugars are also included.
Other analogues include peptide polynucleotides which are peptide polynucleotide analogues. These backbones are substantially non-ionic under neutral conditions, in contrast to the highly charged phosphodiester backbone of naturally occurring polynucleotides. This may result in advantages. First, the peptide polynucleotide backbone may exhibit improved hybridization kinetics. Peptide polynucleotides have larger changes in the melting temperature for mismatched versus perfectly matched base pairs. DNA and RNA typically exhibit a 2-4 °C drop in melting temperature for an internal mismatch. With the non-ionic peptide polynucleotide backbone, the drop is closer to 7-9 °C. Similarly, due to their non- ionic nature, hybridization of the bases attached to these backbones is relatively insensitive to salt concentration. In addition, peptide polynucleotides may not be degraded or degraded to a lesser extent by cellular enzymes, and thus may be more stable.
Among the uses of the disclosed polynucleotides, and fragments thereof, is the use of fragments as probes in nucleic acid hybridisation assays or primers for use in nucleic acid amplification assays. Such fragments generally comprise at least about 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more contiguous nucleotides of a DNA sequence. In other embodiments, a DNA fragment comprises at least about 10, 15, 20, 30, 40, 50 or 60 or more contiguous nucleotides of a DNA sequence. Thus, in one aspect, there is also provided a method for detecting a polynucleotide encoding a protein with nicotine N-demethylase activity member or encoding a nicotine N-demethylase enzyme comprising the use of the probes or primers or both.
The basic parameters affecting the choice of hybridization conditions and guidance for devising suitable conditions are described by Sambrook, J., E. F. Fritsch, and T. Maniatis (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.). Using knowledge of the genetic code in combination with the amino acid sequences described herein, sets of degenerate oligonucleotides can be prepared. Such oligonucleotides are useful as primers, for example, in polymerase chain reactions (PCR), whereby DNA fragments are isolated and amplified. In certain embodiments, degenerate primers can be used as probes for genetic libraries. Such libraries would include but are not limited to cDNA libraries, genomic libraries, and even electronic express sequence tag or DNA libraries. Homologous sequences identified by this method would then be used as probes to identify homologues of the sequences identified herein.
Also of potential use are polynucleotides and oligonucleotides (for example, primers or probes) that hybridize under reduced stringency conditions, typically moderately stringent conditions, and commonly highly stringent conditions to the polynucleotide(s) as described herein. The basic parameters affecting the choice of hybridization conditions and guidance for devising suitable conditions are set forth by Sambrook, J., E. F. Fritsch, and T. Maniatis (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. and can be readily determined by those having ordinary skill in the art based on, for example, the length or base composition of the polynucleotide. One way of achieving moderately stringent conditions involves the use of a prewashing solution containing 5x Standard Sodium Citrate, 0.5% Sodium Dodecyl Sulphate, 1 .0 mM Ethylenediaminetetraacetic acid (pH 8.0), hybridization buffer of about 50% formamide, 6x Standard Sodium Citrate, and a hybridization temperature of about 55 °C (or other similar hybridization solutions, such as one containing about 50% formamide, with a hybridization temperature of about 42°C), and washing conditions of about 60°C, in 0.5x Standard Sodium Citrate, 0.1 % Sodium Dodecyl Sulphate. Generally, highly stringent conditions are defined as hybridization conditions as above, but with washing at approximately 68 °C, 0.2x Standard Sodium Citrate, 0.1 % Sodium Dodecyl Sulphate. SSPE (1 x SSPE is 0.15 M sodium chloride, 10 mM sodium phosphate, and 1 .25 mM Ethylenediaminetetraacetic acid, pH 7.4) can be substituted for Standard Sodium Citrate (1 x Standard Sodium Citrate is 0.15 M sodium chloride and 15 mM sodium citrate) in the hybridization and wash buffers; washes are performed for 15 minutes after hybridization is complete. It should be understood that the wash temperature and wash salt concentration can be adjusted as necessary to achieve a desired degree of stringency by applying the basic principles that govern hybridization reactions and duplex stability, as known to those skilled in the art and described further below (see, for example, Sambrook, J., E. F. Fritsch, and T. Maniatis (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y). When hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region or regions of optimal sequence complementarity. The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should be 5 to 10 °C less than the melting temperature of the hybrid, where melting temperature is determined according to the following equations. For hybrids less than 18 base pairs in length, melting temperature (°C)=2(number of A+T bases)+4(number of G+C bases). For hybrids above 18 base pairs in length, melting temperature (°C)=81 .5+16.6(log10 [Na+])+0.41 (% G+C)-(600/N), where N is the number of bases in the hybrid, and [Na+] is the concentration of sodium ions in the hybridization buffer ([Na+] for 1 x Standard Sodium Citrate=0.165M). Typically, each such hybridizing polynucleotide has a length that is at least 25% (commonly at least 50%, 60%, or 70%, and most commonly at least 80%) of the length of a polynucleotide to which it hybridizes, and has at least 60% sequence identity (for example, at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or 100%) with a polynucleotide to which it hybridizes.
As will be understood by the person skilled in the art, a linear DNA has two possible orientations: the 5'-to-3' direction and the 3'-to-5' direction. For example, if a reference sequence is positioned in the 5'-to-3' direction, and if a second sequence is positioned in the 5'-to-3' direction within the same polynucleotide molecule/strand, then the reference sequence and the second sequence are orientated in the same direction, or have the same orientation. Typically, a promoter sequence and a gene of interest under the regulation of the given promoter are positioned in the same orientation. However, with respect to the reference sequence positioned in the 5'-to-3' direction, if a second sequence is positioned in the 3'-to-5' direction within the same polynucleotide molecule/strand, then the reference sequence and the second sequence are orientated in anti-sense direction, or have anti- sense orientation. Two sequences having anti-sense orientations with respect to each other can be alternatively described as having the same orientation, if the reference sequence (5'- to-3' direction) and the reverse complementary sequence of the reference sequence (reference sequence positioned in the 5'-to-3') are positioned within the same polynucleotide molecule/strand. The sequences set forth herein are shown in the 5'-to-3' direction.
Recombinant constructs provided herein can be used to transform plants or plant cells in order to modulate protein expression and/or activity levels. A recombinant polynucleotide construct can comprise a polynucleotide encoding one or more polynucleotides as described herein, operably linked to a regulatory region suitable for expressing the polypeptide. Thus, a polynucleotide can comprise a coding sequence that encodes the polypeptide as described herein. Plants or plant cells in which protein expression and/or activity levels are modulated can include mutant, non-naturally occurring, transgenic, man-made or genetically engineered plants or plant cells. Suitably, the transgenic plant or plant cell comprises a genome that has been altered by the stable integration of recombinant DNA. Recombinant DNA includes DNA which has been genetically engineered and constructed outside of a cell and includes DNA containing naturally occurring DNA or cDNA or synthetic DNA. A transgenic plant can include a plant regenerated from an originally-transformed plant cell and progeny transgenic plants from later generations or crosses of a transformed plant. Suitably, the transgenic modification alters the expression or activity of the polynucleotide or the polypeptide described herein as compared to a control plant.
The polypeptide encoded by a recombinant polynucleotide can be a native polypeptide, or can be heterologous to the cell. In some cases, the recombinant construct contains a polynucleotide that modulates expression, operably linked to a regulatory region. Examples of suitable regulatory regions are described herein.
Vectors containing recombinant polynucleotide constructs such as those described herein are also provided. Suitable vector backbones include, for example, those routinely used in the art such as plasmids, viruses, artificial chromosomes, bacterial artificial chromosomes, yeast artificial chromosomes, or bacteriophage artificial chromosomes. Suitable expression vectors include, without limitation, plasmids and viral vectors derived from, for example, bacteriophage, baculoviruses, and retroviruses. Numerous vectors and expression systems are commercially available. The vectors can include, for example, origins of replication, scaffold attachment regions or markers. A marker gene can confer a selectable phenotype on a plant cell. For example, a marker can confer biocide resistance, such as resistance to an antibiotic (for example, kanamycin, G418, bleomycin, or hygromycin), or an herbicide (for example, glyphosate, chlorsulfuron or phosphinothricin). In addition, an expression vector can include a tag sequence designed to facilitate manipulation or detection (for example, purification or localization) of the expressed polypeptide. Tag sequences, such as luciferase, beta-glucuronidase, green fluorescent protein, glutathione S-transferase, polyhistidine, c- myc or hemagglutinin sequences typically are expressed as a fusion with the encoded polypeptide. Such tags can be inserted anywhere within the polypeptide, including at either the carboxyl or amino terminus.
A plant or plant cell can be transformed by having the recombinant polynucleotide integrated into its genome to become stably transformed. The plant or plant cell described herein can be stably transformed. Stably transformed cells typically retain the introduced polynucleotide with each cell division. A plant or plant cell can be transiently transformed such that the recombinant polynucleotide is not integrated into its genome. Transiently transformed cells typically lose all or some portion of the introduced recombinant polynucleotide with each cell division such that the introduced recombinant polynucleotide cannot be detected in daughter cells after a sufficient number of cell divisions.
A number of methods are available in the art for transforming a plant cell which are all encompassed herein, including biolistics, gene gun techniques, Agrobacterium-mediated transformation, viral vector-mediated transformation and electroporation. The Agrobacterium system for integration of foreign DNA into plant chromosomes has been extensively studied, modified, and exploited for plant genetic engineering. Naked recombinant DNA molecules comprising DNA sequences corresponding to the subject purified tobacco protein operably linked, in the sense or antisense orientation, to regulatory sequences are joined to appropriate T-DNA sequences by conventional methods. These are introduced into tobacco protoplasts by polyethylene glycol techniques or by electroporation techniques, both of which are standard. Alternatively, such vectors comprising recombinant DNA molecules encoding the subject purified tobacco protein are introduced into live Agrobacterium cells, which then transfer the DNA into the tobacco plant cells. Transformation by naked DNA without accompanying T-DNA vector sequences can be accomplished via fusion of tobacco protoplasts with DNA-containing liposomes or via electroporation. Naked DNA unaccompanied by T-DNA vector sequences can also be used to transform tobacco cells via inert, high velocity microprojectiles.
If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.
The choice of regulatory regions to be included in a recombinant construct depends upon several factors, including, but not limited to, efficiency, selectability, inducibility, desired expression level, and cell- or tissue-preferential expression. It is a routine matter for one of skill in the art to modulate the expression of a coding sequence by appropriately selecting and positioning regulatory regions relative to the coding sequence. Transcription of a polynucleotide can be modulated in a similar manner. Some suitable regulatory regions initiate transcription only, or predominantly, in certain cell types. Methods for identifying and characterizing regulatory regions in plant genomic DNA are known in the art.
Suitable promoters include tissue-specific promoters recognized by tissue-specific factors present in different tissues or cell types (for example, root-specific promoters, shoot-specific promoters, xylem-specific promoters), or present during different developmental stages, or present in response to different environmental conditions. Suitable promoters include constitutive promoters that can be activated in most cell types without requiring specific inducers. Examples of suitable promoters for controlling RNAi polypeptide production include the cauliflower mosaic virus 35S (CaMV/35S), SSU, OCS, Iib4, usp, STLS1 , B33, nos or ubiquitin- or phaseolin-promoters. Persons skilled in the art are capable of generating multiple variations of recombinant promoters.
Tissue-specific promoters are transcriptional control elements that are only active in particular cells or tissues at specific times during plant development, such as in vegetative tissues or reproductive tissues. Tissue-specific expression can be advantageous, for example, when the expression of polynucleotides in certain tissues is preferred. Examples of tissue-specific promoters under developmental control include promoters that can initiate transcription only (or primarily only) in certain tissues, such as vegetative tissues, for example, roots or leaves, or reproductive tissues, such as fruit, ovules, seeds, pollen, pistols, flowers, or any embryonic tissue. Reproductive tissue-specific promoters may be, for example, anther-specific, ovule-specific, embryo-specific, endosperm-specific, integument- specific, seed and seed coat-specific, pollen-specific, petal-specific, sepal-specific, or combinations thereof.
Suitable leaf-specific promoters include pyruvate, orthophosphate dikinase (PPDK) promoter from C4 plant (maize), cab-m1 Ca+2 promoter from maize, the Arabidopsis thaliana myb- related gene promoter (Atmyb5), the ribulose biphosphate carboxylase (RBCS) promoters (for example, the tomato RBCS 1 , RBCS2 and RBCS3A genes expressed in leaves and light-grown seedlings, RBCS1 and RBCS2 expressed in developing tomato fruits or ribulose bisphosphate carboxylase promoter expressed almost exclusively in mesophyll cells in leaf blades and leaf sheaths at high levels).
Suitable senescence-specific promoters include a tomato promoter active during fruit ripening, senescence and abscission of leaves, a maize promoter of gene encoding a cysteine protease, the promoter of 82E4 and the promoter of SAG genes. Suitable anther- specific promoters can be used. Suitable root-preferred promoters known to persons skilled in the art may be selected. Suitable seed-preferred promoters include both seed-specific promoters (those promoters active during seed development such as promoters of seed storage proteins) and seed-germinating promoters (those promoters active during seed germination). Such seed-preferred promoters include, but are not limited to, Cim1 (cytokinin-induced message); cZ19B1 (maize 19 kDa zein); milps (myo-inositol-1 -phosphate synthase); mZE40-2, also known as Zm-40; nuclc; and celA (cellulose synthase). Gama-zein is an endosperm-specific promoter. Glob-1 is an embryo-specific promoter. For dicots, seed- specific promoters include, but are not limited to, bean beta-phaseolin, napin, β-conglycinin, soybean lectin, cruciferin, and the like. For monocots, seed-specific promoters include, but are not limited to, a maize 15 kDa zein promoter, a 22 kDa zein promoter, a 27 kDa zein promoter, a g-zein promoter, a 27 kDa gamma-zein promoter (such as gzw64A promoter, see Genbank Accession number S78780), a waxy promoter, a shrunken 1 promoter, a shrunken 2 promoter, a globulin 1 promoter (see Genbank Accession number L22344), an Itp2 promoter, cim1 promoter, maize endl and end2 promoters, nud promoter, Zm40 promoter, eepl and eep2; led , thioredoxin H promoter; mlip15 promoter, PCNA2 promoter; and the shrunken-2 promoter.
Examples of inducible promoters include promoters responsive to pathogen attack, anaerobic conditions, elevated temperature, light, drought, cold temperature, or high salt concentration. Pathogen-inducible promoters include those from pathogenesis-related proteins (PR proteins), which are induced following infection by a pathogen (for example, PR proteins, SAR proteins, beta-1 ,3-glucanase, chitinase). In addition to plant promoters, other suitable promoters may be derived from bacterial origin for example, the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from Ti plasmids, or may be derived from viral promoters (for example, 35S and 19S RNA promoters of cauliflower mosaic virus (CaMV), constitutive promoters of tobacco mosaic virus, cauliflower mosaic virus (CaMV) 19S and 35S promoters, or figwort mosaic virus 35S promoter).
Preferred promoters include the control elements provided herein, as part of SEQ ID Nos. 1 - 80, which demonstrate desirable expression during curing procedures in tobacco leaf.
In another aspect, there is provided an isolated polypeptide comprising, consisting or consisting essentially of a polypeptide sequence having at least 95% sequence identity to any of the polypeptide sequences described herein, including any of the polypeptides shown in the sequence lisiting. Suitably, the isolated polypeptide comprises, consists or consists essentially of a sequence having at least 95% 96%, 97%, 98%, 99%, 99.1 %, 99.2%, 99.3%, 99.4%, 99.5%, 99.6%, 99.7%, 99.8%, 99.9% or 100% sequence identity thereto.
The polypeptide can include sequences comprising a sufficient or substantial degree of identity or similarity to SEQ ID NOs: 81 -160 to function as proteases. Fragments of the polypeptide(s) typically retain some or all of the activity of the full length sequence.
As discussed herein, the polypeptides also include mutants produced by introducing any type of alterations (for example, insertions, deletions, or substitutions of amino acids; changes in glycosylation states; changes that affect refolding or isomerizations, three- dimensional structures, or self-association states), which can be deliberately engineered or isolated naturally provided that they still have some or all of their function or activity as a protease. Suitably, the function or activity as a protease is modulated, increased or reduced. Polypeptides include variants produced by introducing any type of alterations (for example, insertions, deletions, or substitutions of amino acids; changes in glycosylation states; changes that affect refolding or isomerizations, three-dimensional structures, or self- association states), which can be deliberately engineered or isolated naturally. The variant may have alterations which produce a silent change and result in a functionally equivalent protein. Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity and the amphipathic nature of the residues as long as the secondary binding activity of the substance is retained. For example, negatively charged amino acids include aspartic acid and glutamic acid; positively charged amino acids include lysine and arginine; and amino acids with uncharged polar head groups having similar hydrophilicity values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, phenylalanine, and tyrosine. Conservative substitutions may be made, for example according to the Table below. Amino acids in the same block in the second column and preferably in the same line in the third column may be substituted for each other:
The polypeptide may be a mature protein or an immature protein or a protein derived from an immature protein. Polypeptides may be in linear form or cyclized using known methods. Polypeptides typically comprise at least 10, at least 20, at least 30, or at least 40 contiguous amino acids.
A tobacco plant or plant cell comprising a mutation in a gene encoding a protease as described herein is disclosed, wherein said mutation results in modulated expression or modulated function of said protease. The expression or function of the protease(s) may be enhanced. Aside from one or more mutations in said protease, the mutant plants or plant cells can have one or more further mutations in one or more other genes or polypeptides. In certain embodiments, aside from the one or more mutations in a protease gene, the mutants can have one or more further mutations in one or more other genes or polypeptides - such as one or more other protease genes or polypeptides as described in the Sequence Listing. Suitably, a protease is expressed in the leaves of the mutant plant during the curing procedure.
There is also provided a method for modulating the level of a protease in a (cured) tobacco plant or in (cured) tobacco plant material said method comprising introducing into the genome of said plant one or more mutations that modulate expression of at least one protease gene, wherein said at least one protease gene is selected from SEQ ID Nos: 1 -80. There is also provided a method for identifying a tobacco plant with increased levels of protease, said method comprising screening a nucleic acid sample from a tobacco plant of interest for the presence of one or more mutations in SEQ ID NOs:1 -80, and optionally correlating the identified mutation(s) with mutation(s) that are known to modulate levels of protease. There is also disclosed a tobacco plant or plant cell that is heterozygous or homozygous for mutations in a gene encoding a protease, wherein said mutation results in modulated (enhanced or reduced) expression or function of said protease.
A number of approaches can be used to combine mutations in one plant including sexual crossing. A plant having one or more favourable heterozygous or homozygous mutations in a protease gene that enhances or reduces protease expression or activity can be crossed with a plant having one or more favourable heterozygous or homozygous mutations in one or more other protease genes that enhance or reduce protease activity. In one embodiment, crosses are made in order to introduce one or more favourable heterozygous or homozygous mutations within a protease gene within the same plant.
The activity of one or more protease polypeptides in a tobacco plant is reduced or enhanced according to the present disclosure if the protease activity is statistically lower or higher than the protease activity of the same protease(s) in a tobacco plant that has not been modified to inhibit the activity of that protease polypeptide and which has been cultured, harvested and cured using the same protocols.
In some embodiments, the mutation(s) is introduced into a tobacco plant or plant cell using a mutagenesis approach, and the introduced mutation is identified or selected using methods known to those of skill in the art - such as Southern blot analysis, DNA sequencing, PCR analysis, or phenotypic analysis. Mutations that impact gene expression or that interfere with the function of the encoded protein can be determined using methods that are well known in the art. Insertional mutations in gene exons usually result in null-mutants. Mutations in conserved residues can be particularly effective in inhibiting the metabolic function of the encoded protein.
Methods for obtaining mutant polynucleotides and polypeptides are also disclosed. Any plant of interest, including a plant cell or plant material can be genetically modified by various methods known to induce mutagenesis, including site-directed mutagenesis, oligonucleotide- directed mutagenesis, chemically-induced mutagenesis, irradiation-induced mutagenesis, mutagenesis utilizing modified bases, mutagenesis utilizing gapped duplex DNA, double- strand break mutagenesis, mutagenesis utilizing repair-deficient host strains, mutagenesis by total gene synthesis, DNA shuffling and other equivalent methods.
Fragments of protease polynucleotides and polypeptides encoded thereby are also disclosed. Fragments of a polynucleotide may encode protein fragments that retain the biological activity of the native protein and hence are involved in the metabolic conversion of nicotine to nornicotine. Alternatively, fragments of a polynucleotide that are useful as hybridization probes or PCR primers generally do not encode fragment proteins retaining biological activity. Furthermore, fragments of the disclosed nucleotide sequences include those that can be assembled within recombinant constructs as discussed herein. Fragments of a polynucleotide sequence may range from at least about 25 nucleotides, about 50 nucleotides, about 75 nucleotides, about 100 nucleotides about 150 nucleotides, about 200 nucleotides, about 250 nucleotides, about 300 nucleotides, about 400 nucleotides, about 500 nucleotides, about 600 nucleotides, about 700 nucleotides, about 800 nucleotides, about 900 nucleotides, about 1000 nucleotides, about 1 100 nucleotides, about 1200 nucleotides, about 1300 nucleotides or about 1400 nucleotides and up to the full-length polynucleotide encoding the polypeptides described herein. Fragments of a polypeptide sequence may range from at least about 25 amino acids, about 50 amino acids, about 75 amino acids, about 100 amino acids about 150 amino acids, about 200 amino acids, about 250 amino acids, about 300 amino acids, about 400 amino acids, about 500 amino acids, and up to the full-length polypeptide described herein.
Mutant polypeptide variants can be used to create mutant, non-naturally occurring or transgenic plants (for example, mutant, non-naturally occurring, transgenic, man-made or genetically engineered plants) or plant cells comprising one or more mutant polypeptide variants. Suitably, mutant polypeptide variants retain the activity of the unmutated polypeptide. The activity of the mutant polypeptide variant may be higher, lower or about the same as the unmutated polypeptide.
Mutations in the nucleotide sequences and polypeptides described herein can include man- made mutations or synthetic mutations or genetically engineered mutations. Mutations in the nucleotide sequences and polypeptides described herein can be mutations that are obtained or obtainable via a process which includes an in vitro or an in vivo manipulation step. Mutations in the nucleotide sequences and polypeptides described herein can be mutations that are obtained or obtainable via a process which includes intervention by man. By way of example, the process may include mutagenesis using exogenously added chemicals - such as mutagenic, teratogenic, or carcinogenic organic compounds, for example ethyl methanesulfonate (EMS), that produce random mutations in genetic material. By way of further example, the process may include one or more genetic engineering steps - such as one or more of the genetic engineering steps that are described herein or combinations thereof. By way of further example, the process may include one or more plant crossing steps.
A polypeptide may be prepared by culturing transformed or recombinant host cells under culture conditions suitable to express a polypeptide. The resulting expressed polypeptide may then be purified from such culture using known purification processes. The purification of the polypeptide may include an affinity column containing agents which will bind to the polypeptide; one or more column steps over such affinity resins; one or more steps involving hydrophobic interaction chromatography; or immunoaffinity chromatography. Alternatively, the polypeptide may also be expressed in a form that will facilitate purification. For example, it may be expressed as a fusion polypeptide, such as those of maltose binding polypeptide, glutathione-5-transferase, his-tag or thioredoxin. Kits for expression and purification of fusion polypeptides are commercially available. The polypeptide may be tagged with an epitope and subsequently purified by using a specific antibody directed to such epitope. One or more liquid chromatography steps - such as reverse-phase high performance liquid chromatography can be employed to further purify the polypeptide. Some or all of the foregoing purification steps, in various combinations, can be employed to provide a substantially homogeneous recombinant polypeptide. The polypeptide thus purified may be substantially free of other polypeptides and is defined herein as a "substantially purified polypeptide"; such purified polypeptides include polypeptides, fragments, variants, and the like. Expression, isolation, and purification of the polypeptides and fragments can be accomplished by any suitable technique, including but not limited to the methods described herein.
It is also possible to utilise an affinity column such as a monoclonal antibody generated against polypeptides, to affinity-purify expressed polypeptides. These polypeptides can be removed from an affinity column using conventional techniques, for example, in a high salt elution buffer and then dialyzed into a lower salt buffer for use or by changing pH or other components depending on the affinity matrix utilized, or be competitively removed using the naturally occurring substrate of the affinity moiety.
Isolated or substantially purified polynucleotides or protein compositions are disclosed. An "isolated" or "purified" polynucleotide or protein, or biologically active portion thereof, is substantially or essentially free from components that normally accompany or interact with the polynucleotide or protein as found in its naturally occurring environment. Thus, an isolated or purified polynucleotide or protein is substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. Optimally, an "isolated" polynucleotide is free of sequences (optimally protein encoding sequences) that naturally flank the polynucleotide (for example, sequences located at the 5' and 3' ends of the polynucleotide) in the genomic DNA of the organism from which the polynucleotide is derived. For example, in various embodiments, the isolated polynucleotide can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotide sequence that naturally flank the polynucleotide in genomic DNA of the cell from which the polynucleotide is derived. A protein that is substantially free of cellular material includes preparations of protein having less than about 30%, 20%, 10%, 5%, or 1 % (by dry weight) of contaminating protein.
A polypeptide may also be produced by known conventional chemical synthesis. Methods for constructing the polypeptides or fragments thereof by synthetic means are known to those skilled in the art. The synthetically-constructed polypeptide sequences, by virtue of sharing primary, secondary or tertiary structural or conformational characteristics with native polypeptides may possess biological properties in common therewith, including biological activity.
The term 'non-naturally occurring' as used herein describes an entity (for example, a polynucleotide, a genetic mutation, a polypeptide, a plant, a plant cell and plant material) that is not formed by nature or that does not exist in nature. Such non-naturally occurring entities or artificial entities may be made, synthesized, initiated, modified, intervened, or manipulated by methods described herein or that are known in the art. Such non-naturally occurring entities or artificial entities may be made, synthesized, initiated, modified, intervened, or manipulated by man. Thus, by way of example, a non-naturally occurring plant, a non- naturally occurring plant cell or non-naturally occurring plant material may be made using traditional plant breeding techniques - such as backcrossing - or by genetic manipulation technologies - such as antisense RNA, interfering RNA, meganuclease and the like. By way of further example, a non-naturally occurring plant, a non-naturally occurring plant cell or non-naturally occurring plant material may be made by introgression of or by transferring one or more genetic mutations (for example one or more polymorphisms) from a first plant or plant cell into a second plant or plant cell (which may itself be naturally occurring), such that the resulting plant, plant cell or plant material or the progeny thereof comprises a genetic constitution (for example, a genome, a chromosome or a segment thereof) that is not formed by nature or that does not exist in nature. The resulting plant, plant cell or plant material is thus artificial or non-naturally occurring. Accordingly, an artificial or non-naturally occurring plant or plant cell may be made by modifying a genetic sequence in a first naturally occurring plant or plant cell, even if the resulting genetic sequence occurs naturally in a second plant or plant cell that comprises a different genetic background from the first plant or plant cell. In certain embodiments, a mutation is not a naturally occurring mutation that exists naturally in a nucleotide sequence or a polypeptide - such as a gene or a protein.
Differences in genetic background can be detected by phenotypic differences or by molecular biology techniques known in the art - such as nucleic acid sequencing, presence or absence of genetic markers (for example, microsatellite RNA markers).
Antibodies that are immunoreactive with the polypeptides described herein are also provided. The polypeptides, fragments, variants, fusion polypeptides, and the like, as set forth herein, can be employed as "immunogens" in producing antibodies immunoreactive therewith. Such antibodies may specifically bind to the polypeptide via the antigen-binding sites of the antibody. Specifically binding antibodies are those that will specifically recognize and bind with a polypeptide, homologues, and variants, but not with other molecules. In one embodiment, the antibodies are specific for polypeptides having an amino acid sequence as set forth herein and do not cross-react with other polypeptides. More specifically, the polypeptides, fragment, variants, fusion polypeptides, and the like contain antigenic determinants or epitopes that elicit the formation of antibodies. These antigenic determinants or epitopes can be either linear or conformational (discontinuous). Linear epitopes are composed of a single section of amino acids of the polypeptide, while conformational or discontinuous epitopes are composed of amino acids sections from different regions of the polypeptide chain that are brought into close proximity upon polypeptide folding. Epitopes can be identified by any of the methods known in the art. Additionally, epitopes from the polypeptides can be used as research reagents, in assays, and to purify specific binding antibodies from substances such as polyclonal sera or supernatants from cultured hybridomas. Such epitopes or variants thereof can be produced using techniques known in the art such as solid-phase synthesis, chemical or enzymatic cleavage of a polypeptide, or using recombinant DNA technology.
Both polyclonal and monoclonal antibodies to the polypeptides can be prepared by conventional techniques. Hybridoma cell lines that produce monoclonal antibodies specific for the polypeptides are also contemplated herein. Such hybridomas can be produced and identified by conventional techniques. For the production of antibodies, various host animals may be immunized by injection with a polypeptide, fragment, variant, or mutants thereof. Such host animals may include, but are not limited to, rabbits, mice, and rats, to name a few. Various adjutants may be used to increase the immunological response. Depending on the host species, such adjuvants include, but are not limited to, Freund's (complete and incomplete), mineral gels such as aluminium hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. The monoclonal antibodies can be recovered by conventional techniques. Such monoclonal antibodies may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD, and any subclass thereof.
The antibodies can also be used in assays to detect the presence of the polypeptides or fragments, either in vitro or in vivo. The antibodies also can be employed in purifying polypeptides or fragments by immunoaffinity chromatography.
Other than mutagenesis, compositions that can modulate the expression or the activity of one or more of the proteases described herein include, but are not limited to, sequence- specific polynucleotides that can interfere with the transcription of one or more endogenous gene(s); sequence-specific polynucleotides that can interfere with the translation of RNA transcripts (for example, double-stranded RNAs, siRNAs, ribozymes); sequence-specific polypeptides that can interfere with the stability of one or more proteins; sequence-specific polynucleotides that can interfere with the enzymatic activity of one or more proteins or the binding activity of one or more proteins with respect to substrates or regulatory proteins; antibodies that exhibit specificity for one or more proteins; small molecule compounds that can interfere with the stability of one or more proteins or the enzymatic activity of one or more proteins or the binding activity of one or more proteins; zinc finger proteins that bind one or more polynucleotides; and meganucleases that have activity towards one or more polynucleotides. Gene editing technologies, genetic editing technologies and genome editing technologies are well known in the art.
One method of gene editing involves the use of transcription activator-like effector nucleases (TALENs) which induce double-strand breaks which cells can respond to with repair mechanisms. Non-homologous end joining reconnects DNA from either side of a double- strand break where there is very little or no sequence overlap for annealing. This repair mechanism induces errors in the genome via insertion or deletion, or chromosomal rearrangement. Any such errors may render the gene products coded at that location nonfunctional. Another method of gene editing involves the use of the bacterial CRISPR/Cas system. Bacteria and archaea exhibit chromosomal elements called clustered regularly interspaced short palindromic repeats (CRISPR) that are part of an adaptive immune system that protects against invading viral and plasmid DNA. In Type II CRISPR systems, CRISPR RNAs (crRNAs) function with trans-activating crRNA (tracrRNA) and CRISPR-associated (Cas) proteins to introduce double-stranded breaks in target DNA. Target cleavage by Cas9 requires base-pairing between the crRNA and tracrRNA as well as base pairing between the crRNA and the target DNA. Target recognition is facilitated by the presence of a short motif called a protospacer-adjacent motif (PAM) that conforms to the sequence NGG. This system can be harnessed for genome editing. Cas9 is normally programmed by a dual RNA consisting of the crRNA and tracrRNA. However, the core components of these RNAs can be combined into a single hybrid 'guide RNA' for Cas9 targeting. The use of a noncoding RNA guide to target DNA for site-specific cleavage promises to be significantly more straightforward than existing technologies - such as TALENs. Using the CRISPR/Cas strategy, retargeting the nuclease complex only requires introduction of a new RNA sequence and there is no need to reengineer the specificity of protein transcription factors. Antisense technology is another well-known method that can be used to modulate the expression of a polypeptide. A polynucleotide of the gene to be repressed is cloned and operably linked to a regulatory region and a transcription termination sequence so that the antisense strand of RNA is transcribed. The recombinant construct is then transformed into a plant cell and the antisense strand of RNA is produced. The polynucleotide need not be the entire sequence of the gene to be repressed, but typically will be substantially complementary to at least a portion of the sense strand of the gene to be repressed.
A polynucleotide may be transcribed into a ribozyme, or catalytic RNA, that affects expression of an mRNA. Ribozymes can be designed to specifically pair with virtually any target RNA and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA. Heterologous polynucleotides can encode ribozymes designed to cleave particular mRNA transcripts, thus preventing expression of a polypeptide. Hammerhead ribozymes are useful for destroying particular mRNAs, although various ribozymes that cleave mRNA at site-specific recognition sequences can be used. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The sole requirement is that the target RNA contains a 5'-UG-3' nucleotide sequence. The construction and production of hammerhead ribozymes is known in the art. Hammerhead ribozyme sequences can be embedded in a stable RNA such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo.
In one embodiment, the sequence-specific polynucleotide that can interfere with the translation of RNA transcript(s) is interfering RNA. RNA interference or RNA silencing is an evolutionarily conserved process by which specific mRNAs can be targeted for enzymatic degradation. A double-stranded RNA (double-stranded RNA) is introduced or produced by a cell (for example, double-stranded RNA virus, or interfering RNA polynucleotides) to initiate the interfering RNA pathway. The double-stranded RNA can be converted into multiple small interfering RNA duplexes of 21 -24 bp length by RNases III, which are double-stranded RNA-specific endonucleases. The small interfering RNAs can be subsequently recognized by RNA-induced silencing complexes that promote the unwinding of small interfering RNA through an ATP-dependent process. The unwound antisense strand of the small interfering RNA guides the activated RNA-induced silencing complexes to the targeted mRNA comprising a sequence complementary to the small interfering RNA anti-sense strand. The targeted mRNA and the anti-sense strand can form an A-form helix, and the major groove of the A-form helix can be recognized by the activated RNA-induced silencing complexes. The target mRNA can be cleaved by activated RNA-induced silencing complexes at a single site defined by the binding site of the 5'-end of the small interfering RNA strand. The activated RNA-induced silencing complexes can be recycled to catalyze another cleavage event. Interfering RNA expression vectors may comprise interfering RNA constructs encoding interfering RNA polynucleotides that exhibit RNA interference activity by reducing the expression level of mRNAs, pre-mRNAs, or related RNA variants. The expression vectors may comprise a promoter positioned upstream and operably-linked to an Interfering RNA construct, as further described herein. Interfering RNA expression vectors may comprise a suitable minimal core promoter, a Interfering RNA construct of interest, an upstream (5') regulatory region, a downstream (3') regulatory region, including transcription termination and polyadenylation signals, and other sequences known to persons skilled in the art, such as various selection markers. Various embodiments are directed to methods for modulating the expression level of one or more of the polynucleotide(s) described herein (or any combination thereof as described herein) by integrating multiple copies of the polynucleotide(s) into a (tobacco) plant genome, comprising: transforming a plant cell host with an expression vector that comprises a promoter operably-linked to a polynucleotide.
Various compositions and methods are provided for modulating the endogenous gene expression level by modulating the translation of mRNA. A host (tobacco) plant cell can be transformed with an expression vector comprising: a promoter operably-linked to a polynucleotide, positioned in anti-sense orientation with respect to the promoter to enable the expression of RNA polynucleotides having a sequence complementary to a portion of mRNA.
Various expression vectors for modulating the translation of mRNA may comprise: a promoter operably-linked to a polynucleotide in which the sequence is positioned in anti- sense orientation with respect to the promoter. The lengths of anti-sense RNA polynucleotides can vary, and may be from about 15-20 nucleotides, about 20-30 nucleotides, about 30-50 nucleotides, about 50-75 nucleotides, about 75-100 nucleotides, about 100-150 nucleotides, about 150-200 nucleotides, and about 200-300 nucleotides. As discussed herein, the expression of one or more polypeptides can be modulated by non- transgenic means - such as creating one or more mutations in one or more genes, as discussed herein. Methods that introduce a mutation randomly in a gene sequence can include chemical mutagenesis, EMS mutagenesis and radiation mutagenesis. Methods that introduce one or more targeted mutations into a cell include but are not limited to genome editing technology, particularly zinc finger nuclease-mediated mutagenesis and targeting induced local lesions in genomes (TILLING), homologous recombination, oligonucleotide- directed mutagenesis, and meganuclease-mediated mutagenesis. In one embodiment, TILLING is used. This is a mutagenesis technology that can be used to generate and/or identify polynucleotides encoding polypeptides with modified expression and/or activity. TILLING also allows selection of plants carrying such mutants. TILLING combines high- density mutagenesis with high-throughput screening methods. Methods for TILLING are well known in the art (see McCallum et al., (2000) Nat Biotechnol 18: 455-457 and Stemple (2004) Nat Rev Genet 5(2): 145-50).
Specific mutations in polynucleotides can be created that can result in modulated gene expression, modulated stability of mRNA, or modulated stability of protein. Such plants are referred to herein as "non-naturally occurring" or "mutant" plants. Typically, the mutant or non-naturally occurring plants will include at least a portion of foreign or synthetic or man- made nucleic acid (for example, DNA or RNA) that was not present in the plant before it was manipulated. The foreign nucleic acid may be a single nucleotide, two or more nucleotides, two or more contiguous nucleotides or two or more non-contiguous nucleotides - such as at least 10, 20, 30, 40, 50,100, 200, 300, 400, 500, 600, 700, 800, 900, 1000, 1 100, 1200, 1300, 1400 or 1500 or more contiguous or non-contiguous nucleotides.
The mutant or non-naturally occurring plants or plant cells can have any combination of one or more mutations in one or more genes which results in modulated protein levels. For example, the mutant or non-naturally occurring plants or plant cells may have a single mutation in a single gene; multiple mutations in a single gene; a single mutation in two or more or three or more or four or more genes; or multiple mutations in two or more or three or more or four or more genes. Examples of such mutations are described herein. By way of further example, the mutant or non-naturally occurring plants or plant cells may have one or more mutations in a specific portion of the gene(s) - such as in a region of the gene that encodes an active site of the protein or a portion thereof. By way of further example, the mutant or non-naturally occurring plants or plant cells may have one or more mutations in a region outside of one or more gene(s) - such as in a region upstream or downstream of the gene it regulates provided that they modulate the activity or expression of the gene(s). Upstream elements can include promoters, enhancers or transription factors. Some elements - such as enhancers - can be positioned upstream or downstream of the gene it regulates. The element(s) need not be located near to the gene that it regulates since some elements have been found located several hundred thousand base pairs upstream or downstream of the gene that it regulates. The mutant or non-naturally occurring plants or plant cells may have one or more mutations located within the first 100 nucleotides of the gene(s), within the first 200 nucleotides of the gene(s), within the first 300 nucleotides of the gene(s), within the first 400 nucleotides of the gene(s), within the first 500 nucleotides of the gene(s), within the first 600 nucleotides of the gene(s), within the first 700 nucleotides of the gene(s), within the first 800 nucleotides of the gene(s), within the first 900 nucleotides of the gene(s), within the first 1000 nucleotides of the gene(s), within the first 1 100 nucleotides of the gene(s), within the first 1200 nucleotides of the gene(s), within the first 1300 nucleotides of the gene(s), within the first 1400 nucleotides of the gene(s) or within the first 1500 nucleotides of the gene(s). The mutant or non-naturally occurring plants or plant cells may have one or more mutations located within the first, second, third, fourth, fifth, sixth, seventh, eighth, ninth, tenth, eleventh, twelfth, thirteenth, fourteenth or fifteenth set of 100 nucleotides of the gene(s) or combinations thereof. Mutant or non-naturally occurring plants or plant cells (for example, mutant, non-naturally occurring or transgenic plants or plant cells and the like, as described herein) comprising the mutant polypeptide variants are disclosed.
In one embodiment, seeds from plants are mutagenised and then grown into first generation mutant plants. The first generation plants are then allowed to self-pollinate and seeds from the first generation plant are grown into second generation plants, which are then screened for mutations in their loci. Though the mutagenized plant material can be screened for mutations, an advantage of screening the second generation plants is that all somatic mutations correspond to germline mutations. One of skill in the art would understand that a variety of plant materials, including but not limited to, seeds, pollen, plant tissue or plant cells, may be mutagenised in order to create the mutant plants. However, the type of plant material mutagenised may affect when the plant nucleic acid is screened for mutations. For example, when pollen is subjected to mutagenesis prior to pollination of a non-mutagenized plant the seeds resulting from that pollination are grown into first generation plants. Every cell of the first generation plants will contain mutations created in the pollen; thus these first generation plants may then be screened for mutations instead of waiting until the second generation.
Mutagens that create primarily point mutations and short deletions, insertions, transversions, and or transitions, including chemical mutagens or radiation, may be used to create the mutations. Mutagens include, but are not limited to, ethyl methanesulfonate, methylmethane sulfonate, N-ethyl-N-nitrosurea, triethylmelamine, N-methyl-N-nitrosourea, procarbazine, chlorambucil, cyclophosphamide, diethyl sulfate, acrylamide monomer, melphalan, nitrogen mustard, vincristine, dimethylnitrosamine, N-methyl-N'-nitro-Nitrosoguanidine, nitrosoguanidine, 2-aminopurine, 7,12 dimethyl-benz(a)anthracene, ethylene oxide, hexamethylphosphoramide, bisulfan, diepoxyalkanes (diepoxyoctane, diepoxybutane, and the like), 2-methoxy-6-chloro-9[3-(ethyl-2-chloro-ethyl)aminopropylamin o]acridine dihydrochloride and formaldehyde.
Spontaneous mutations in the locus that may not have been directly caused by the mutagen are also contemplated provided that they result in the desired phenotype. Suitable mutagenic agents can also include, for example, ionising radiation - such as X-rays, gamma rays, fast neutron irradiation and UV radiation. Any method of plant nucleic acid preparation known to those of skill in the art may be used to prepare the plant nucleic acid for mutation screening.
Prepared nucleic acid from individual plants, plant cells, or plant material can optionally be pooled in order to expedite screening for mutations in the population of plants originating from the mutagenized plant tissue, cells or material. One or more subsequent generations of plants, plant cells or plant material can be screened. The size of the optionally pooled group is dependent upon the sensitivity of the screening method used.
After the nucleic acid samples are optionally pooled, they can be subjected to polynucleotide-specific amplification techniques, such as Polymerase Chain Reaction. Any one or more primers or probes specific to the gene or the sequences immediately adjacent to the gene may be utilized to amplify the sequences within the optionally pooled nucleic acid sample. Suitably, the one or more primers or probes are designed to amplify the regions of the locus where useful mutations are most likely to arise. Most preferably, the primer is designed to detect mutations within regions of the polynucleotide. Additionally, it is preferable for the primer(s) and probe(s) to avoid known polymorphic sites in order to ease screening for point mutations. To facilitate detection of amplification products, the one or more primers or probes may be labelled using any conventional labelling method. Primer(s) or probe(s) can be designed based upon the sequences described herein using methods that are well understood in the art.
To facilitate detection of amplification products, the primer(s) or probe(s) may be labelled using any conventional labelling method. These can be designed based upon the sequences described herein using methods that are well understood in the art. Polymorphisms may be identified by means known in the art and some have been described in the literature.
In a further aspect there is provided a method of preparing a mutant plant. The method involves providing at least one cell of a plant comprising a gene encoding a functional polynucleotide described herein (or any combination thereof as described herein). Next, the at least one cell of the plant is treated under conditions effective to modulate the activity of the polynucleotide(s) described herein. The at least one mutant plant cell is then propagated into a mutant plant, where the mutant plant has a modulated level of polypeptide(s) described (or any combination thereof as described herein) as compared to that of a control plant. In one embodiment of this method of making a mutant plant, the treating step involves subjecting the at least one cell to a chemical mutagenising agent as described above and under conditions effective to yield at least one mutant plant cell. In another embodiment of this method, the treating step involves subjecting the at least one cell to a radiation source under conditions effective to yield at least one mutant plant cell. The term "mutant plant" includes mutants plants in which the genotype is modified as compared to a control plant, suitably by means other than genetic engineering or genetic modification.
In certain embodiments, the mutant plant, mutant plant cell or mutant plant material may comprise one or more mutations that have occured naturally in another plant, plant cell or plant material and confer a desired trait. This mutation can be incorporated (for example, introgressed) into another plant, plant cell or plant material (for example, a plant, plant cell or plant material with a different genetic background to the plant from which the mutation was derived) to confer the trait thereto. Thus by way of example, a mutation that occurred naturally in a first plant may be introduced into a second plant - such as a second plant with a different genetic background to the first plant. The skilled person is therefore able to search for and identify a plant carrying naturally in its genome one or more mutant alleles of the genes described herein which confer a desired trait. The mutant allele(s) that occurs naturally can be transferred to the second plant by various methods including breeding, backcrossing and introgression to produce a lines, varieties or hybrids that have one or more mutations in the genes described herein. Plants showing a desired trait may be screened out of a pool of mutant plants. Suitably, the selection is carried out utilising the knowledge of the nucleotide sequences as described herein. Consequently, it is possible to screen for a genetic trait as compared to a control. Such a screening approach may involve the application of conventional nucleic acid amplification and/or hybridization techniques as discussed herein. Thus, a further aspect of the present invention relates to a method for identifying a mutant plant comprising the steps of: (a) providing a sample comprising nucleic acid from a plant; and (b) determining the nucleic acid sequence of the polynucleotide, wherein a difference in the sequence of the polynucleotide as compared to the polynucleotide sequence of a control plant is indicative that said plant is a mutant plant. In another aspect there is provided a method for identifying a mutant plant which accumulates increased or reduced levels of protease as compared to a control plant comprising the steps of: (a) providing a sample from a plant to be screened; (b) determining if said sample comprises one or more mutations in one or more of the polynucleotides described herein; and (c) determining at least the protease content of said plant during or after a curing procedure.
In another aspect there is provided a method for preparing a mutant plant which has increased or reduced levels of protease as compared to a control plant comprising the steps of: (a) providing a sample from a first plant; (b) determining if said sample comprises one or more mutations in one or more the polynucleotides described herein that result in modulated levels of a protease; and (c) transferring the one or more mutations into a second plant. Suitably at least the protease content is determined in cured leaf material. The mutation(s) can be transferred into the second plant using various methods that are known in the art - such as by genetic engineering, genetic manipulation, introgression, plant breeding, backcrossing and the like. In one embodiment, the first plant is a naturally occurring plant. In one embodiment, the second plant has a different genetic background to the first plant. In another aspect there is provided a method for preparing a mutant plant which has increased or reduced levels of a protease as compared to a control plant comprising the steps of: (a) providing a sample from a first plant; (b) determining if said sample comprises one or more mutations in one or more of the polynucleotides described herein that results in modulated levels of the protease; and (c) introgressing the one or more mutations from the first plant into a second plant. Suitably at least the protease content is determined in cured leaf material. In one embodiment, the step of introgressing comprises plant breeding, optionally including backcrossing and the like. In one embodiment, the first plant is a naturally occurring plant. In one embodiment, the second plant has a different genetic background to the first plant. In one embodiment, the first plant is not a cultivar or an elite cultivar. In one embodiment, the second plant is a cultivar or an elite cultivar. A further aspect relates to a mutant plant (including a cultivar or elite cultivar mutant plant) obtained or obtainable by the methods described herein. In certain embodiments, the "mutant plants" may have one or more mutations localised only to a specific region of the plant - such as within the sequence of the one or more polynucleotide(s) described herein. According to this embodiment, the remaining genomic sequence of the mutant plant will be the same or substantially the same as the plant prior to the mutagenesis.
In certain embodiments, the mutant plants may have one or more mutations localised in more than one region of the plant - such as within the sequence of one or more of the polynucleotides described herein and in one or more further regions of the genome. According to this embodiment, the remaining genomic sequence of the mutant plant will not be the same or will not be substantially the same as the plant prior to the mutagenesis. In certain embodiments, the mutant plants may not have one or more mutations in one or more, two or more, three or more, four or more or five or more exons of the polynucleotide(s) described herein; or may not have one or more mutations in one or more, two or more, three or more, four or more or five or more introns of the polynucleotide(s) described herein; or may not have one or more mutations in a promoter of the polynucleotide(s) described herein; or may not have one or more mutations in the 3' untranslated region of the polynucleotide(s) described herein; or may not have one or more mutations in the 5' untranslated region of the polynucleotide(s) described herein; or may not have one or more mutations in the coding region of the polynucleotide(s) described herein; or may not have one or more mutations in the non-coding region of the polynucleotide(s) described herein; or any combination of two or more, three or more, four or more, five or more; or six or more thereof parts thereof.
In a futher aspect there is provided a method of identifying a plant, a plant cell or plant material comprising a mutation in a gene encoding a polynucleotide described herein comprising: (a) subjecting a plant, a plant cell or plant material to mutagenesis; (b) obtaining a nucleic acid sample from said plant, plant cell or plant material or descendants thereof; and (c) determining the nucleic acid sequence of the gene encoding a polynucleotide described herein or a variant or a fragment thereof, wherein a difference in said sequence is indicative of one or more mutations therein.
Zinc finger proteins can be used to modulate the expression or the activity of one or more of the polynucleotides described herein. In various embodiments, a genomic DNA sequence comprising a part of or all of the coding sequence of the polynucleotide is modified by zinc finger nuclease-mediated mutagenesis. The genomic DNA sequence is searched for a unique site for zinc finger protein binding. Alternatively, the genomic DNA sequence is searched for two unique sites for zinc finger protein binding wherein both sites are on opposite strands and close together, for example, 1 , 2, 3, 4, 5, 6 or more basepairs apart. Accordingly, zinc finger proteins that bind to polynucleotides are provided.
A zinc finger protein may be engineered to recognize a selected target site in a gene. A zinc finger protein can comprise any combination of motifs derived from natural zinc finger DNA- binding domains and non-natural zinc finger DNA-binding domains by truncation or expansion or a process of site-directed mutagenesis coupled to a selection method such as, but not limited to, phage display selection, bacterial two-hybrid selection or bacterial one- hybrid selection. The term "non-natural zinc finger DNA-binding domain" refers to a zinc finger DNA-binding domain that binds a three-base pair sequence within the target nucleic acid and that does not occur in the cell or organism comprising the nucleic acid which is to be modified. Methods for the design of zinc finger protein which binds specific nucleotide sequences which are unique to a target gene are known in the art.
In other embodiments, a zinc finger protein may be selected to bind to a regulatory sequence of a polynucleotide. More specifically, the regulatory sequence may comprise a transcription initiation site, a start codon, a region of an exon, a boundary of an exon-intron, a terminator, or a stop codon. Accordingly, the invention provides a mutant, non-naturally occurring or transgenic plant or plant cells, produced by zinc finger nuclease-mediated mutagenesis in the vicinity of or within one or more polynucleotides described herein, and methods for making such a plant or plant cell by zinc finger nuclease-mediated mutagenesis. Methods for delivering zinc finger protein and zinc finger nuclease to a tobacco plant are similar to those described below for delivery of meganuclease.
Plants suitable for use in genetic modification include, but are not limited to, monocotyledonous and dicotyledonous plants and plant cell systems, including species from one of the following families: Acanthaceae, Alliaceae, Alstroemeriaceae, Amaryllidaceae, Apocynaceae, Arecaceae, Asteraceae, Berberidaceae, Bixaceae, Brassicaceae, Bromeliaceae, Cannabaceae, Caryophyllaceae, Cephalotaxaceae, Chenopodiaceae, Colchicaceae, Cucurbitaceae, Dioscoreaceae, Ephedraceae, Erythroxylaceae, Euphorbiaceae, Fabaceae, Lamiaceae, Linaceae, Lycopodiaceae, Malvaceae, Melanthiaceae, Musaceae, Myrtaceae, Nyssaceae, Papaveraceae, Pinaceae, Plantaginaceae, Poaceae, Rosaceae, Rubiaceae, Salicaceae, Sapindaceae, Solanaceae, Taxaceae, Theaceae, or Vitaceae.
Suitable species may include members of the genera Abelmoschus, Abies, Acer, Agrostis, Allium, Alstroemeria, Ananas, Andrographis, Andropogon, Artemisia, Arundo, Atropa, Berberis, Beta, Bixa, Brassica, Calendula, Camellia, Camptotheca, Cannabis, Capsicum, Carthamus, Catharanthus, Cephalotaxus, Chrysanthemum, Cinchona, Citrullus, Coffea, Colchicum, Coleus, Cucumis, Cucurbita, Cynodon, Datura, Dianthus, Digitalis, Dioscorea, Elaeis, Ephedra, Erianthus, Erythroxylum, Eucalyptus, Festuca, Fragaria, Galanthus, Glycine, Gossypium, Helianthus, Hevea, Hordeum, Hyoscyamus, Jatropha, Lactuca, Linum, Lolium, Lupinus, Lycopersicon, Lycopodium, Manihot, Medicago, Mentha, Miscanthus, Musa, Nicotiana, Oryza, Panicum, Papaver, Parthenium, Pennisetum, Petunia, Phalaris, Phleum, Pinus, Poa, Poinsettia, Populus, Rauwolfia, Ricinus, Rosa, Saccharum, Salix, Sanguinaria, Scopolia, Secale, Solanum, Sorghum, Spartina, Spinacea, Tanacetum, Taxus, Theobroma, Triticosecale, Triticum, Uniola, Veratrum, Vinca, Vitis, and Zea.
Suitable species may include Panicum spp., Sorghum spp., Miscanthus spp., Saccharum spp., Erianthus spp., Populus spp., Andropogon gerardii (big bluestem), Pennisetum purpureum (elephant grass), Phalaris arundinacea (reed canarygrass), Cynodon dactylon (bermudagrass), Festuca arundinacea (tall fescue), Spartina pectinata (prairie cord-grass), Medicago sativa (alfalfa), Arundo donax (giant reed), Secale cereale (rye), Salix spp. (willow), Eucalyptus spp. (eucalyptus), Triticosecale (tritic wheat times rye), bamboo, Helianthus annuus (sunflower), Carthamus tinctorius (safflower), Jatropha curcas (jatropha), Ricinus communis (castor), Elaeis guineensis (palm), Linum usitatissimum (flax), Brassica juncea, Beta vulgaris (sugarbeet), Manihot esculenta (cassaya), Lycopersicon esculentum (tomato), Lactuca sativa (lettuce), Musyclise alca (banana), Solanum tuberosum (potato), Brassica oleracea (broccoli, cauliflower, Brussels sprouts), Camellia sinensis (tea), Fragaria ananassa (strawberry), Theobroma cacao (cocoa), Coffe35ycliseca (coffee), Vitis vinifera (grape), Ananas comosus (pineapple), Capsicum annum (hot & sweet pepper), Allium cepa (onion), Cucumis melo (melon), Cucumis sativus (cucumber), Cucurbita maxima (squash), Cucurbita moschata (squash), Spinacea oleracea (spinach), Citrullus lanatus (watermelon), Abelmoschus esculentus (okra), Solanum melongena (eggplant), Rosa spp. (rose), Dianthus caryophyllus (carnation), Petunia spp. (petunia), Poinsettia pulcherrima (poinsettia), Lupinus albus (lupin), Uniola paniculata (oats), bentgrass (Agrostis spp.), Populus tremuloides (aspen), Pinus spp. (pine), Abies spp. (fir), Acer spp. (maple), Hordeum vulgare (barley), Poa pratensis (bluegrass), Lolium spp. (ryegrass) and Phleum pratense (timothy), Panicum virgatum (switchgrass), Sorghu35yclise35or (sorghum, sudangrass), Miscanthus giganteus (miscanthus), Saccharum sp. (energycane), Populus balsamifera (poplar), Zea mays (corn), Glycine max (soybean), Brassica napus (canola), Triticum aestivum (wheat), Gossypium hirsutum (cotton), Oryza sativa (rice), Helianthus annuus (sunflower), Medicago sativa (alfalfa), Beta vulgaris (sugarbeet), or Pennisetum glaucum (pearl millet).
Various embodiments are directed to mutant tobacco, non-naturally occurring tobacco or transgenic tobacco plants or plant cells modified to modulate gene expression levels thereby producing a plant or plant cell - such as a tobacco plant or plant cell - in which the expression level of a polypeptide is modulated within tissues of interest as compared to a control. The disclosed compositions and methods can be applied to any species of the genus Nicotiana, including N. rustica and N. tabacum (for example, LA B21 , LN KY171 , Tl 1406, Basma, Galpao, Perique, Beinhart 1000-1 , and Petico). Other species include N. acaulis, N. acuminata, N. africana, N. alata, N. ameghinoi, N. amplexicaulis, N. arentsii, N. attenuata, N. azambujae, N. benavidesii, N. benthamiana, N. bigelovii, N. bonariensis, N. cavicola, N. clevelandii, N. cordifolia, N. corymbosa, N. debneyi, N. excelsior, N. forgetiana, N. fragrans, N. glauca, N. glutinosa, N. goodspeedii, N. gossei, N. hybrid, N. ingulba, N. kawakamii, N. knightiana, N. langsdorffii, N. linearis, N. longiflora, N. maritima, N. megalosiphon, N. miersii, N. noctiflora, N. nudicaulis, N. obtusifolia, N. occidentalis, N. occidentalis subsp. hesperis, N. otophora, N. paniculata, N. pauciflora, N. petunioides, N. plumbaginifolia, N. quadrivalvis, N. raimondii, N. repanda, N. rosulata, N. rosulata subsp. ingulba, N. rotundifolia, N. setchellii, N. simulans, N. solanifolia, N. spegazzinii, N. stocktonii, N. suaveolens, N. sylvestris, N. thyrsiflora, N. tomentosa, N. tomentosiformis, N. trigonophylla, N. umbratica, N. undulata, N. velutina, N. wigandioides, and N. x sanderae. The use of tobacco cultivars and elite tobacco cultivars is also contemplated herein. The transgenic, non-naturally occurring or mutant plant may therefore be a tobacco variety or elite tobacco cultivar that comprises one or more transgenes, or one or more genetic mutations or a combiantion thereof. The genetic mutation(s) (for example, one or more polymorphisms) can be mutations that do not exist naturally in the individual tobacco variety or tobacco cultivar (for example, elite tobacco cultivar) or can be genetic mutation(s) that do occur naturally provided that the mutation does not occur naturally in the individual tobacco variety or tobacco cultivar (for example, elite tobacco cultivar).
Particularly useful Nicotiana tabacum varieties include Burley type, dark type, flue-cured type, and Oriental type tobaccos. Non-limiting examples of varieties or cultivars are: BD 64, CC 101 , CC 200, CC 27, CC 301 , CC 400, CC 500, CC 600, CC 700, CC 800, CC 900, Coker 176, Coker 319, Coker 371 Gold, Coker 48, CD 263, DF91 1 , DT 538 LC Galpao tobacco, GL 26H, GL 350, GL 600, GL 737, GL 939, GL 973, HB 04P, HB 04P LC, HB3307PLC, Hybrid 403LC, Hybrid 404LC, Hybrid 501 LC, K 149, K 326, K 346, K 358, K394, K 399, K 730, KDH 959, KT 200, KT204LC, KY10, KY14, KY 160, KY 17, KY 171 , KY 907, KY907LC, KY14xL8 LC, Little Crittenden, McNair 373, McNair 944, msKY 14xL8, Narrow Leaf Madole, Narrow Leaf Madole LC, NBH 98, N-126, N-777LC, N-7371 LC, NC 100, NC 102, NC 2000, NC 291 , NC 297, NC 299, NC 3, NC 4, NC 5, NC 6, NC7, NC 606, NC 71 , NC 72, NC 810, NC BH 129, NC 2002, Neal Smith Madole, OXFORD 207, PD 7302 LC, PD 7309 LC, PD 7312 LC, 'Perique' tobacco, PVH03, PVH09, PVH19, PVH50, PVH51 , R 610, R 630, R 7-1 1 , R 7-12, RG 17, RG 81 , RG H51 , RGH 4, RGH 51 , RS 1410, Speight 168, Speight 172, Speight 179, Speight 210, Speight 220, Speight 225, Speight 227, Speight 234, Speight G-28, Speight G-70, Speight H-6, Speight H20, Speight NF3, Tl 1406, Tl 1269, TN 86, TN86LC, TN 90, TN 97, TN97LC, TN D94, TN D950, TR (Tom Rosson) Madole, VA 309, VA359, AA 37-1 , B13P, Xanthi (Mitchell-Mor), Bel-W3, 79-615, Samsun Holmes NN, KTRDC number 2 Hybrid 49, Burley 21 , KY8959, KY9, MD 609, PG01 , PG04, P01 , P02, P03, RG1 1 , RG 8, VA509, AS44, Banket A1 , Basma Drama B84/31 , Basma I Zichna ZP4/B, Basma Xanthi BX 2A, Batek, Besuki Jember, C104, Coker 347, Criollo Misionero, Delcrest, Djebel 81 , DVH 405, Galpao Comum, HB04P, Hicks Broadleaf, Kabakulak Elassona, Kutsage E1 , LA BU 21 , NC 2326, NC 297, PVH 21 10, Red Russian, Samsun, Saplak, Simmaba, Talgar 28, Wislica, Yayaldag, Prilep HC-72, Prilep P23, Prilep PB 156/1 , Prilep P12-2/1 , Yaka JK-48, Yaka JB 125/3, TI-1068, KDH-960, TI-1070, TW136, Basma, TKF 4028, L8, TKF 2002, GR141 , Basma xanthi, GR149, GR153, Petit Havana. Low converter subvarieties of the above, even if not specifically identified herein, are also contemplated.
Embodiments are also directed to compositions and methods for producing mutant plants, non-naturally occurring plants, hybrid plants, or transgenic plants that have been modified to modulate the expression or activity of a polynucleotide(s) described herein (or any combination thereof as described herein). Advantageously, the mutant plants, non-naturally occurring plants, hybrid plants, or transgenic plants that are obtained may be similar or substantially the same in overall appearance to control plants. Various phenotypic characteristics such as degree of maturity, number of leaves per plant, stalk height, leaf insertion angle, leaf size (width and length), internode distance, and lamina-midrib ratio can be assessed by field observations.
One aspect relates to a seed of a mutant plant, a non-naturally occurring plant, a hybrid plant or a transgenic plant described herein. Preferably, the seed is a tobacco seed. A further aspect relates to pollen or an ovule of a mutant plant, a non-naturally occurring plant, a hybrid plant or a transgenic plant that is described herein. In addition, there is provided a mutant plant, a non-naturally occurring plant, a hybrid plant or a transgenic plant as described herein which further comprises a nucleic acid conferring male sterility.
Also provided is a tissue culture of regenerable cells of the mutant plant, non-naturally occurring plant, hybrid plant, or transgenic plant or a part thereof as described herein, which culture regenerates plants capable of expressing all the morphological and physiological characteristics of the parent. The regenerable cells include but are not limited to cells from leaves, pollen, embryos, cotyledons, hypocotyls, roots, root tips, anthers, flowers and a part thereof, ovules, shoots, stems, stalks, pith and capsules or callus or protoplasts derived therefrom.
A still further aspect, relates to a cured plant material - such as cured leaf or cured tobacco - derived or derivable from a mutant, non-naturally occurring or transgenic plant or cell, wherein expression of one or more of the polynucleotides described herein or the activity of the protein encoded thereby is modulated. Suitably the visual appearance of said plant (for example, leaf) is substantially the same as the control plant. Suitably, the plant is a tobacco plant.
Embodiments are also directed to compositions and methods for producing mutant, non- naturally occurring or transgenic plants or plant cells that have been modified to modulate the expression or activity of the one or more of the polynucleotides or polypeptides described herein which can result in plants or plant components (for example, leaves - such as green leaves or cured leaves - or tobacco) or plant cells with modulated levels of proteases.
In another aspect, there is provided a method for modulating (eg. increasing) the amount of protease in at least a part of a plant (for example, the leaves - such as cured leaves - or in tobacco), comprising the steps of: (i) modulating (eg. increasing) the expression or activity of an one or more of the polypeptides described herein (or any combination thereof as described herein), suitably, wherein the polypeptide(s) is encoded by the corresponding polynucleotide sequence described herein; (ii) measuring the protease content in at least a part (for example, the leaves - such as cured leaves - or tobacco or in smoke) of the mutant, non-naturally occurring or transgenic plant obtained in step (i); and (iii) identifying a mutant, non-naturally occurring or transgenic plant in which the protease content therein has been modulated (eg. increased) in comparison to a control plant. Suitably, the visual appearance of said mutant, non-naturally occurring or transgenic plant is substantially the same as the control plant. Suitably, the plant is a tobacco plant.
In another aspect, there is provided a method for modulating (eg. increasing) the amount of protease in at least a part of cured plant material - such as cured leaf - comprising the steps of: (i) modulating (eg. increasing) the expression or activity of an one or more of the polypeptides (or any combination thereof as described herein), suitably, wherein the polypeptide(s) is encoded by the corresponding polynucleotide sequence described herein; (ii) harvesting plant material - such as one or more of the leaves - and curing for a period of time; (iii) measuring the protease content in at least a part of the cured plant material obtained in step (ii) or during step (ii); and (iv) identifying cured plant material in which the protease content therein has been modulated (eg. increased) in comparison to a control plant.
An increase in expression as compared to the control may be from about 5 % to about 100 %, or an increase of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 % or more - such as 200%, 300%, 500%, 1000% or more, which includes an increase in transcriptional activity or polynucleotide expression or polypeptide expression or a combination thereof. An increase in activity as compared to a control may be from about 5 % to about 100 %, or an increase of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 % or more - such as 200%, 300%, 500%, 1000% or more.
A reduction in expression as compared to a control may be from about 5 % to about 100 %, or a reduction of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 %, which includes a reduction in transcriptional activity or polynucleotide expression or polypeptide expression or a combination thereof.
A reduction in activity as compared to a control may be from about 5 % to about 100 %, or a reduction of at least 10 %, at least 20 %, at least 25 %, at least 30 %, at least 40 %, at least 50 %, at least 60 %, at least 70 %, at least 75 %, at least 80 %, at least 90 %, at least 95 %, at least 98 %, or 100 %.
Polynucleotides and recombinant constructs described herein can be used to modulate the expression of the proteases described herein in a plant species of interest, suitably tobacco. A number of polynucleotide based methods can be used to increase gene expression in plants and plant cells. By way of example, a construct, vector or expression vector that is compatible with the plant to be transformed can be prepared which comprises the gene of interest together with an upstream promoter that is capable of overexpressing the gene in the plant or plant cell. Exemplary promoters are described herein. Following transformation and when grown under suitable conditions, the promoter can drive expression in order to modulate (for example, reduce) the levels of this enzyme in the plant, or in a specific tissue thereof. In one exemplary embodiment, a vector carrying one or more polynucleotides described herein (or any combination thereof as described herein) is generated to overexpress the gene in a plant or plant cell. The vector carries a suitable promoter - such as the cauliflower mosaic virus CaMV 35S promoter - upstream of the transgene driving its constitutive expression in all tissues of the plant. The vector also carries an antibiotic resistance gene in order to confer selection of the transformed calli and cell lines.
In a preferred embodiment, a promoter and regulatory sequences are derived from one or more of SEQ ID Nos: 1 -80. These regulatory sequences can be used in conjunction with cognate or non-cognate expression sequences to increase expression of said sequences in a tobacco plant during the curing procedure.
The expression of sequences from promoters can be enhanced by including expression control sequences, including enhancers, chromatin activating elements, transcription factor responsive elements and the like. Such control sequences may be constitutive, and upregulate transcription in a universal manner; or they may be facultative, and upregulate transcription in response to specific signals. Signals associated with senescence and signals which are active during the curing procedure are specifically indicated.
Various embodiments are therefore directed to methods for modulating (for example, increasing) the expression level of one or more polynucleotides described herein (or any combination thereof as described herein) by integrating multiple copies of the polynucleotide into a plant genome, comprising: transforming a plant cell host with an expression vector that comprises a promoter operably-linked to one or more polynucleotides described herein. The polypeptide encoded by a recombinant polynucleotide can be a native polypeptide, or can be heterologous to the cell.
A tobacco plant carrying a mutant allele of one or more polynucleotides described herein (or any combination thereof as described herein) can be used in a plant breeding program to create useful lines, varieties and hybrids. In particular, the mutant allele is introgressed into the commercially important varieties described above. Thus, methods for breeding plants are provided, that comprise crossing a mutant plant, a non-naturally occurring plant or a transgenic plant as described herein with a plant comprising a different genetic identity. The method may further comprise crossing the progeny plant with another plant, and optionally repeating the crossing until a progeny with the desirable genetic traits or genetic background is obtained. One purpose served by such breeding methods is to introduce a desirable genetic trait into other varieties, breeding lines, hybrids or cultivars, particularly those that are of commercial interest. Another purpose is to facilitate stacking of genetic modifications of different genes in a single plant variety, lines, hybrids or cultivars. Intraspecific as well as interspecific matings are contemplated. The progeny plants that arise from such crosses, also referred to as breeding lines, are examples of non-naturally occurring plants of the invention.
In one embodiment, a method is provided for producing a non-naturally occurring tobacco plant comprising: (a) crossing a mutant or transgenic tobacco plant with a second tobacco plant to yield progeny tobacco seed; (b) growing the progeny tobacco seed, under plant growth conditions, to yield the non-naturally occurring tobacco plant. The method may further comprises: (c) crossing the previous generation of non-naturally occurring tobacco plant with itself or another tobacco plant to yield progeny tobacco seed; (d) growing the progeny tobacco seed of step (c) under plant growth conditions, to yield additional non- naturally occurring tobacco plants; and (e) repeating the crossing and growing steps of (c) and (d) multiple times to generate further generations of non-naturally occurring tobacco plants. The method may optionally comprises prior to step (a), a step of providing a parent plant which comprises a genetic identity that is characterized and that is not identical to the mutant or transgenic plant. In some embodiments, depending on the breeding program, the crossing and growing steps are repeated from 0 to 2 times, from 0 to 3 times, from 0 to 4 times, 0 to 5 times, from 0 to 6 times, from 0 to 7 times, from 0 to 8 times, from 0 to 9 times or from 0 to 10 times, in order to generate generations of non-naturally occurring tobacco plants. Backcrossing is an example of such a method wherein a progeny is crossed with one of its parents or another plant genetically similar to its parent, in order to obtain a progeny plant in the next generation that has a genetic identity which is closer to that of one of the parents. Techniques for plant breeding, particularly tobacco plant breeding, are well known and can be used in the methods of the invention. The invention further provides non- naturally occurring tobacco plants produced by these methods. Certain emboiments exclude the step of selecting a plant.
In some embodiments of the methods described herein, lines resulting from breeding and screening for variant genes are evaluated in the field using standard field procedures. Control genotypes including the original unmutagenized parent are included and entries are arranged in the field in a randomized complete block design or other appropriate field design. For tobacco, standard agronomic practices are used, for example, the tobacco is harvested, weighed, and sampled for chemical and other common testing before and during curing. Statistical analyses of the data are performed to confirm the similarity of the selected lines to the parental line. Cytogenetic analyses of the selected plants are optionally performed to confirm the chromosome complement and chromosome pairing relationships. DNA fingerprinting, single nucleotide polymorphism, microsatellite markers, or similar technologies may be used in a marker-assisted selection (MAS) breeding program to transfer or breed mutant alleles of a gene into other tobaccos, as described herein. For example, a breeder can create segregating populations from hybridizations of a genotype containing a mutant allele with an agronomically desirable genotype. Plants in the F2 or backcross generations can be screened using a marker developed from a genomic sequence or a fragment thereof, using one of the techniques listed herein. Plants identified as possessing the mutant allele can be backcrossed or self-pollinated to create a second population to be screened. Depending on the expected inheritance pattern or the MAS technology used, it may be necessary to self-pollinate the selected plants before each cycle of backcrossing to aid identification of the desired individual plants. Backcrossing or other breeding procedure can be repeated until the desired phenotype of the recurrent parent is recovered.
In a breeding program, successful crosses yield F1 plants that are fertile. Selected F1 plants can be crossed with one of the parents, and the first backcross generation plants are self- pollinated to produce a population that is again screened for variant gene expression (for example, the null version of the the gene). The process of backcrossing, self-pollination, and screening is repeated, for example, at least 4 times until the final screening produces a plant that is fertile and reasonably similar to the recurrent parent. This plant, if desired, is self- pollinated and the progeny are subsequently screened again to confirm that the plant exhibits variant gene expression. In some embodiments, a plant population in the F2 generation is screened for variant gene expression, for example, a plant is identified that fails to express a polypeptide due to the absence of the gene according to standard methods, for example, by using a PCR method with primers based upon the nucleotide sequence information for the polynucleotide(s) described herein (or any combination thereof as described herein).
Hybrid tobacco varieties can be produced by preventing self-pollination of female parent plants (that is, seed parents) of a first variety, permitting pollen from male parent plants of a second variety to fertilize the female parent plants, and allowing F1 hybrid seeds to form on the female plants. Self-pollination of female plants can be prevented by emasculating the flowers at an early stage of flower development. Alternatively, pollen formation can be prevented on the female parent plants using a form of male sterility. For example, male sterility can be produced by cytoplasmic male sterility (CMS), or transgenic male sterility wherein a transgene inhibits microsporogenesis and/or pollen formation, or self- incompatibility. Female parent plants containing CMS are particularly useful. In embodiments in which the female parent plants are CMS, pollen is harvested from male fertile plants and applied manually to the stigmas of CMS female parent plants, and the resulting F1 seed is harvested.
Varieties and lines described herein can be used to form single-cross tobacco F1 hybrids. In such embodiments, the plants of the parent varieties can be grown as substantially homogeneous adjoining populations to facilitate natural cross-pollination from the male parent plants to the female parent plants. The F1 seed formed on the female parent plants is selectively harvested by conventional means. One also can grow the two parent plant varieties in bulk and harvest a blend of F1 hybrid seed formed on the female parent and seed formed upon the male parent as the result of self-pollination. Alternatively, three-way crosses can be carried out wherein a single-cross F1 hybrid is used as a female parent and is crossed with a different male parent. As another alternative, double-cross hybrids can be created wherein the F1 progeny of two different single-crosses are themselves crossed. A population of mutant, non-naturally occurring or transgenic plants can be screened or selected for those members of the population that have a desired trait or phenotype. For example, a population of progeny of a single transformation event can be screened for those plants having a desired level of expression or activity of the polypeptide(s) encoded thereby. Physical and biochemical methods can be used to identify expression or activity levels. These include Southern analysis or PCR amplification for detection of a polynucleotide; Northern blots, S1 RNase protection, primer-extension, or RT-PCR amplification for detecting RNA transcripts; enzymatic assays for detecting enzyme or ribozyme activity of polypeptides and polynucleotides; and protein gel electrophoresis, Western blots, immunoprecipitation, and enzyme-linked immunoassays to detect polypeptides. Other techniques such as in situ hybridization, enzyme staining, and immunostaining and enzyme assays also can be used to detect the presence or expression or activity of polypeptides or polynucleotides.
Mutant, non-naturally occurring or transgenic plant cells and plants are described herein comprising one or more recombinant polynucleotides, one or more polynucleotide constructs, one or more double-stranded RNAs, one or more conjugates or one or more vectors/expression vectors.
Without limitation, the plants described herein may be modified for other purposes either before or after the expression or activity has been modulated according to the present invention. One or more of the following genetic modifications can be present in the mutant, non-naturally occurring or transgenic plants. In one embodiment, one or more genes that are involved in the conversion of nitrogenous metabolic intermediates is modified resulting in plants (such as leaves) that when cured, produces lower levels of at least one tobacco- specific nitrosamine than control plants. Non-limiting examples of genes that can be modified includegenes encoding a nicotine demethylase, such as CYP82E4, CYP82E5 and CYP82E10 which participate in the conversion of nicotine to nornicotine and are described in WO2006091 194, WO2008070274, WO2009064771 and PCT/US201 1/021088 and as described in detail herein. In another embodiment, one or more genes that are involved in heavy metal uptake or heavy metal transport are modified resulting in plants or parts of plants (such as leaves) having a lower heavy metal content than control plants or parts thereof without the modification(s). Non-limiting examples include genes in the family of multidrug resistance associated proteins, the family of cation diffusion facilitators (CDF), the family of Zrt-, Irt-like proteins (ZIP), the family of cation exchangers (CAX), the family of copper transporters (COPT), the family of heavy-metal P-type ATPases (for example, HMAs, as described in WO2009074325), the family of homologs of natural resistance-associated macrophage proteins (NRAMP), and the family of ATP-binding cassette (ABC) transporters (for example, MRPs, as described in WO2012/028309, which participate in transport of heavy metals, such as cadmium. The term heavy metal as used herein includes transition metals. Examples of other modifications include herbicide tolerance, for example, glyphosate is an active ingredient of many broad spectrum herbicides. Glyphosate resistant transgenic plants have been developed by transferring the aroA gene (a glyphosate EPSP synthetase from Salmonella typhimurium and E.coli). Sulphonylurea resistant plants have been produced by transforming the mutant ALS (acetolactate synthetase) gene from Arabidopsis. OB protein of photosystem II from mutant Amaranthus hybridus has been transferred in to plants to produce atrazine resistant transgenic plants; and bromoxynil resistant transgenic plants have been produced by incorporating the bxn gene from the bacterium Klebsiella pneumoniae. Another exemplary modification results in plants that are resistant to insects. Bacillus thuringiensis (Bt) toxins can provide an effective way of delaying the emergence of Bt-resistant pests, as recently illustrated in broccoli where pyramided crylAc and crylC Bt genes controlled diamondback moths resistant to either single protein and significantly delayed the evolution of resistant insects. Another exemplary modification results in plants that are resistant to diseases caused by pathogens (for example, viruses, bacteria, fungi). Plants expressing the Xa21 gene (resistance to bacterial blight) with plants expressing both a Bt fusion gene and a chitinase gene (resistance to yellow stem borer and tolerance to sheath) have been engineered. Another exemplary modification results in altered reproductive capability, such as male sterility. Another exemplary modification results in plants that are tolerant to abiotic stress (for example, drought, temperature, salinity), and tolerant transgenic plants have been produced by transferring acyl glycerol phosphate enzyme from Arabidopsis; genes coding mannitol dehydrogenase and sorbitol dehydrogenase which are involved in synthesis of mannitol and sorbitol improve drought resistance. Other exemplary modifications can result in plants with improved storage proteins and oils, plants with enhanced photosynthetic efficiency, plants with prolonged shelf life, plants with enhanced carbohydrate content, and plants resistant to fungi; plants encoding an enzyme involved in the biosynthesis of alkaloids. Transgenic plants in which the expression of S-adenosyl-L-methionine (SAM) and/or cystathionine gamma-synthase (CGS) has been modulated are also contemplated.
One or more such traits may be introgressed into the mutant, non-naturally occuring or transgenic tobacco plants from another tobacco cultivar or may be directly transformed into it. The introgression of the trait(s) into the mutant, non-naturally occuring or transgenic tobacco plants of the invention maybe achieved by any method of plant breeding known in the art, for example, pedigree breeding, backcrossing, doubled-haploid breeding, and the like (see, Wernsman, E. A, and Rufty, R. C. 1987. Chapter Seventeen. Tobacco. Pages 669- 698 In: Cultivar Development. Crop Species. W. H. Fehr (ed.), MacMillan Publishing Co, Inc., New York, N.Y 761 pp.). Molecular biology-based techniques described above, in particular RFLP and microsatelite markers, can be used in such backcrosses to identify the progenies having the highest degree of genetic identity with the recurrent parent. This permits one to accelerate the production of tobacco varieties having at least 90%, preferably at least 95%, more preferably at least 99% genetic identity with the recurrent parent, yet more preferably genetically identical to the recurrent parent, and further comprising the trait(s) introgressed from the donor parent. Such determination of genetic identity can be based on molecular markers known in the art. The last backcross generation can be selfed to give pure breeding progeny for the nucleic acid(s) being transferred. The resulting plants generally have essentially all of the morphological and physiological characteristics of the mutant, non-naturally occuring or transgenic tobacco plants of the invention, in addition to the transferred trait(s) (for example, one or more single gene traits). The exact backcrossing protocol will depend on the trait being altered to determine an appropriate testing protocol. Although backcrossing methods are simplified when the trait being transferred is a dominant allele, a recessive allele may also be transferred. In this instance, it may be necessary to introduce a test of the progeny to determine if the desired trait has been successfully transferred.
Various embodiments provide mutant plants, non-naturally occurring plants or transgenic plants, as well as biomass in which the expression level of a polynucleotide (or any combination thereof as described herein) is modulated to modulate the protease activity therein.
Parts of such plants, particularly tobacco plants, and more particularly the leaf lamina and midrib of tobacco plants, can be incorporated into or used in making various consumable products including but not limited to aerosol forming materials, aerosol forming devices, smoking articles, smokable articles, smokeless products, and tobacco products. Examples of aerosol forming materials include but are not limited to tobacco compositions, tobaccos, tobacco extract, cut tobacco, cut filler, cured tobacco, expanded tobacco, homogenized tobacco, reconstituted tobacco, and pipe tobaccos. Smoking articles and smokable articles are types of aerosol forming devices. Examples of smoking articles or smokable articles include but are not limited to cigarettes, cigarillos, and cigars. Examples of smokeless products comprise chewing tobaccos, and snuffs. In certain aerosol forming devices, rather than combustion, a tobacco composition or another aerosol forming material is heated by one or more electrical heating elements to produce an aerosol. In another type of heated aerosol forming device, an aerosol is produced by the transfer of heat from a combustible fuel element or heat source to a physically separate aerosol forming material, which may be located within, around or downstream of the heat source. Smokeless tobacco products and various tobacco-containing aerosol forming materials may contain tobacco in any form, including as dried particles, shreds, granules, powders, or a slurry, deposited on, mixed in, surrounded by, or otherwise combined with other ingredients in any format, such as flakes, films, tabs, foams, or beads. As used herein, the term 'smoke' is used to describe a type of aerosol that is produced by smoking articles, such as cigarettes, or by combusting an aerosol forming material.
In one embodiment, there is also provided cured plant material from the mutant, transgenic and non-naturally occurring tobacco plants described herein. Processes of curing green tobacco leaves are known by those having skills in the art and include without limitation air- curing, fire-curing, flue-curing and sun-curing as described herein.
In another embodiment, there is described tobacco products including tobacco-containing aerosol forming materials comprising plant material - such as leaves, preferably cured leaves - from the mutant tobacco plants, transgenic tobacco plants or non-naturally occurring tobacco plants described herein. The tobacco products described herein can be a blended tobacco product which may further comprise unmodified tobacco.
The mutant, non-naturally occurring or transgenic plants may have other uses in, for example, agriculture. For example, mutant, non-naturally occurring or transgenic plants described herein can be used to make animal feed and human food products.
The invention also provides methods for producing seeds comprising cultivating the mutant plant, non-naturally occurring plant, or transgenic plant described herein, and collecting seeds from the cultivated plants. Seeds from plants described herein can be conditioned and bagged in packaging material by means known in the art to form an article of manufacture. Packaging material such as paper and cloth are well known in the art. A package of seed can have a label, for example, a tag or label secured to the packaging material, a label printed on the package that describes the nature of the seeds therein.
Compositions, methods and kits for genotyping plants for identification, selection, or breeding can comprise a means of detecting the presence of a polynucleotide (or any combination thereof as described herein) in a sample of polynucleotide. Accordingly, a composition is described comprising one of more primers for specifically amplifying at least a portion of one or more of the polynucleotides and optionally one or more probes and optionally one or more reagents for conducting the amplification or detection.
Accordingly, gene specific oligonucleotide primers or probes comprising about 10 or more contiguous polynucleotides corresponding to the polynucleotide(s) described herein are dislcosed. Said primers or probes may comprise or consist of about 15, 20, 25, 30, 40, 45 or 50 more contiguous polynucleotides that hybridise (for example, specificially hybridise) to the polynucleotide(s) described herein. In some embodiments, the primers or probes may comprise or consist of about 10 to 50 contiguous nucleotides, about 10 to 40 contiguous nucleotides, about 10 to 30 contiguous nucleotides or about 15 to 30 contiguous nucleotides that may be used in sequence-dependent methods of gene identification (for example, Southern hybridization) or isolation (for example, in situ hybridization of bacterial colonies or bacteriophage plaques) or gene detection (for example, as one or more amplification primers in nucleic acid amplification or detection). The one or more specific primers or probes can be designed and used to amplify or detect a part or all of the polynucleotide(s). By way of specific example, two primers may be used in a polymerase chain reaction protocol to amplify a nucleic acid fragment encoding a nucleic acid - such as DNA or RNA. The polymerase chain reaction may also be performed using one primer that is derived from a nucleic acid sequence and a second primer that hybridises to the sequence upstream or downstream of the nucleic acid sequence - such as a promoter sequence, the 3' end of the mRNA precursor or a sequence derived from a vector. Examples of thermal and isothermal techniques useful for in vitro amplification of polynucleotides are well known in the art. The sample may be or may be derived from a plant, a plant cell or plant material or a tobacco product made or derived from the plant, the plant cell or the plant material as described herein.
In a further aspect, there is also provided a method of detecting a polynucleotide(s) described herein (or any combination thereof as described herein) in a sample comprising the step of: (a) providing a sample comprising, or suspected of comprising, a polynucleotide; (b) contacting said sample with one of more primers or one or more probes for specifically detecting at least a portion of the polynucleotide(s); and (c) detecting the presence of an amplification product, wherein the presence of an amplification product is indicative of the presence of the polynucleotide(s) in the sample. In a further aspect, there is also provided the use of one of more primers or probes for specifically detecting at least a portion of the polynucleotide(s). Kits for detecting at least a portion of the polynucleotide(s) are also provided which comprise one of more primers or probes for specifically detecting at least a portion of the polynucleotide(s). The kit may comprise reagents for polynucleotide amplification - such as PCR - or reagents for probe hybridization-detection technology - such as Southern Blots, Northern Blots, in-situ hybridization, or microarray. The kit may comprise reagents for antibody binding-detection technology such as Western Blots, ELISAs, SELDI mass spectrometry or test strips. The kit may comprise reagents for DNA sequencing. The kit may comprise reagents and instructions for determining at least the proteasae content. Suitably, the kit comprises reagents and instructions for determining at least protease content in plant material, cured plant material or cured leaves.
In some embodiments, a kit may comprise instructions for one or more of the methods described. The kits described may be useful for genetic identity determination, phylogenetic studies, genotyping, haplotyping, pedigree analysis or plant breeding particularly with co- dominant scoring.
The present invention also provides a method of genotyping a plant, a plant cell or plant material comprising a polynucleotide as described herein. Genotyping provides a means of distinguishing homologs of a chromosome pair and can be used to differentiate segregants in a plant population. Molecular marker methods can be used for phylogenetic studies, characterizing genetic relationships among crop varieties, identifying crosses or somatic hybrids, localizing chromosomal segments affecting monogenic traits, map based cloning, and the study of quantitative inheritance. The specific method of genotyping may employ any number of molecular marker analytic techniques including amplification fragment length polymorphisms (AFLPs). AFLPs are the product of allelic differences between amplification fragments caused by nucleotide sequence variability. Thus, the present invention further provides a means to follow segregation of one or more genes or nucleic acids as well as chromosomal sequences genetically linked to these genes or nucleic acids using such techniques as AFLP analysis.
In one embodiment, there is also provided cured plant material from the mutant, transgenic and non-naturally occurring plants described herein. For example, processes of curing tobacco leaves are known by those having skills in the field and include without limitation air- curing, fire-curing, flue-curing and sun-curing.
In another embodiment, there is described tobacco products including tobacco products comprising plant material - such as leaves, suitably cured plant material - such as cured leaves - from the mutant, transgenic and non-naturally occurring plants described herein or which are produced by the methods described herein. The tobacco products described herein may further comprise unmodified tobacco.
In another embodiment, there is described tobacco products comprising plant material, preferably leaves - such as cured leaves, from the mutant, transgenic and non-naturally occurring plants described herein. For example, the plant material may be added to the inside or outside of the tobacco product and so upon burning a desirable aroma is released. The tobacco product according to this embodiment may even be an unmodified tobacco or a modified tobacco. The tobacco product according to this embodiment may even be derived from a mutant, transgenic or non-naturally occurring plant which has modifications in one or more genes other than the genes disclosed herein.
The invention is further described in the Examples below, which are provided to describe the invention in further detail. These examples, which set forth a preferred mode presently contemplated for carrying out the invention, are intended to illustrate and not to limit the invention.
EXAMPLES
The following examples are provided as an illustration and not as a limitation. Unless otherwise indicated, the present invention employs conventional techniques and methods of molecular biology, plant biology, bioinformatics, and plant breeding.
Example 1
A 48h time-point following the curing start was selected to screen for curing-activated genes based on Affymetrix data essentially as described by Martin et al. (2012) BMC Genomics, 13:674). In brief, exon candidates from genomic DNA and from EST contigs were joined and the genomic candidates were cleaned for redundancies (98% threshold). This resulted in a set of 312,053 exon candidates, 12,925 of which were represented by ESTs, but were not included in the genome assembly. Data sets were verified as described by the manufacturer (Affymetrix). In addition, quality checks included probe-level models, Normalized Unsealed Standard Error (NUSE) and Relative Log Expression (RLE) plots, and the analysis of DABG results as described by the manufacturer.
As the exon array design had no mismatch probes, summarization was performed using Robust Multi-array Average (RMA) method. A total of 272,342 probeset expression values were generated, and DABG P-values were computed to assess the significance of the signal obtained for each probeset. This involved the background probes that are spread over the chip. These random probes have a varying GC content. Quality checks involved a combination of Affymetrix Power Tools (APT) and Bioconductor packages, for which the Tobacco Exon Array (TobArray520623F) cdf environment was created. Once the expression values were available, differential gene expression analysis was performed using moderated t-statistics in linear model LIMMA.
Example 2
Differential expression. The tissue samples were sequenced using RNA-seq; reads were mapped to the genomes of the 3 varieties using Tophat2. Previously published gene models were used as the basis for the differential gene expression analysis. Expression changes during curing were calculated using the Cuffdiff2 software based on the mapped reads. Genes were considered up-regulated if their expression levels increased significantly during the first 48h of curing, and not if the change was insignificant or decreased. Tobacco proteins were identified by a BLAST search against a database of transcripts for the 3 varieties and equivalent genes in the 3 varieties were identified by a mutual best BLAST hit search of the transcripts of the 3 varieties Burley, Virginia and Oriental (e-value cutoff 1 e- 80).
The data (Figure 2) shows the number of senescence-activated genes in the 3 cured varieties.
Example 3
The proteasae genes identified in Example 2 were analysed for membership of known protease families. The results are set forth in table 1.
The 80 curing-activated protease genes were found to belong to 21 different protease families. In the table, AC, air-cured; FC, flue-cured; SC, sun-cured. AC+FC+SC, up-regulated in all three types of tobacco ; AC+FC, up-regulated in air-cured and flue-cured tobacco ; AC+SC, up-regulated in air-cured and sun-cured tobacco ; FC+SC, up-regulated in flue-cured and sun-cured tobacco ; AC, FC and SC, up-regulated only in the respective tobacco type.
Protease coding genes AC+FC+SC AC+FC AC+SC FC+SC AC FC SC
Alpha/beta-Hydrolases 1
superfamily protein
Aspartic proteinase A1 2 1 2 1 *
(APA1 )
CLP protease/crotonase 1 1
family protein
Cysteine proteinases 3 2 3 1 2 1 2 superfamily protein
DegP protease 3 - - - - - 1 -
Eukaryotic aspartyl 4 1 1 1 3 2 2 protease family protein
FTSH protease 8 - - - - - - 1
Gamma-glutamyl 1
transpeptidase 4
Heat shock protein 101 1 1 - - 1 1 -
Ion protease 1 & 3 - - - - - 3 -
Metallopeptidase M24 1
family protein
Papain family cysteine 1
protease
Peptidase M20/M25/M40 1 1 family protein
Protease-related - - - - 1 2 -
SAG 12 - 1 - - - - -
Serine carboxypeptidase- 1 1 1 1 1
1
like
SERPIN - 1 1 - - 1 -
Signal peptide peptidase - - - - 1 - -
SITE-1 protease - 1 - - - -
Subtilisin-like ser 3 1 - 1 3 - endopeptidase fam prot.
Ubiquitin-specific 1 3 2
proteases
Total 16 12 6 3 17 19 7
Table 1.
Example 4
APA 1 is encoded by a single gene in Arabidopsis thaliana and 4 in Tomato. The gene activated in flue-cured Virginia tobacco (see Table 1 ) is close to APA1 -Tomato-1. Two gene copies from both ancestors N. sylvestris (S) and N. tomentosiformis (T) exist in N. tabacum. Affymetrix data confirmed the activation of the S form (upper panel) and apparently not the T form during Virginia flue-curing (lower panel).
Example 5
Table 2 illustrates the differential up-regulation of SEQ ID NO:1 to 80 in the three tobacco types air-cured Burley (AC), flue-cured Virginia (FC) and sun-cured Oriental (SC).
Table 2
SEQ ID AC-FC-
NO: AC FC SC AC-FC AC-SC FC-SC SC
1 X
2 X
3 X
4 X
5 X
6 X
7 X
8 X
9 X
10 X
11 X X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X SEQUENCE LISTING
SEQ ID NO: 1
ATGGCTCTTCGTTTCTCTTTAATTTTCCTATTTTCTCTTTTCTTAACGACGTCGTTATTG TT
GTCCGTTAACGGCAACATTAACGGCGGTGAAGATGACGATATTTTGATCCGTCAAGT CG
TAGGCGACGACGACGATCACTTGTTAAACGCCGATCATCACTTCACGATTTTTAAGA GG
AGGTTCGGCAAAACCTACGCGTCCGATGAGGAGCATCATTACAGATTCTCGGTGTTC AA
GGCTAACTTGCGCCGTGCAATGCGCCACCAGAAGCTTGATCCCTCCGCCGTTCACGG T
GTGACTCAGTTTTCCGATTTGACTCCGGCCGAGTTCCGCCGGAATTTTCTAGGAGTT AA
CCGTCGGCTCCGGCTTCCTTCTGATGCCAATAAAGCTCCTATTCTTCCTACTGAGGA TC
TCCCTTCAGGTTTCGATTGGAGAGATCACGGTGCCGTCACGTCAGTAAAGAATCAGG TA
CTAGTATATATCAATGTTTGTGTAAAGTTTATCTTTTTTTGGATAGGCGAAGTGTTC GTCA
TTAATGAATAATTACATAATTTCTATTTGTATCGATTGAAAAACTAGGGTTCATGTG GCTC
GTGCTGGTCATTTAGTACCACTGGTGCGTTAGAAGGTGCCACCTATCTTTCTACAGG GA
AGCTTGTAAGCCTCAGCGAGCAACAACTTGTGGACTGTGATCACGAGGTTTGACGTT CT
TCCTCTTTATCTTAGCTTAAAATCATGAATATATTGTCAATAGAGTTACTGTTTTTC TTTTT
TCTTTTTTTCTGGGACGTTTGAATGTGTAAAATAATTTTCGCTGTGGTGTGTCACAG GAT
TTG GTC CATAG CTGTC ATCTTTTTCTAGTTAAAGAAAATTGATAGCGTG AAG GACACTAA
CCGCATAAATTAAAGTGCTTTCTCTGATTCCGTCTCACTTTAAAGTTTAAGAACCCG TTT
GGCCATGAAATTTCTTTTTTTTTTCCGTAAAATTTAACTTTTCTTCTAAATCAATGT TTGGC
CATCAAATTTTTTATTTTCACTTGAAGATAATTTTACAATTTTTCAAAAATTTGAAA AACTT
CAAAAACTGTTTTTCAAAATTTTGAATATTGTTGTTGATGTAAAAAACAGACACTAA TTTA
TAAGAGTAATCTCCTCTTCTTTGTTTGGTGGATGGCCAGGGGTGGGGACTGGGGACC C
ATCTTAAG GG AGCG G AG G AAAAGTTGTTTTATTATTAGTTTATG G CTG GTTATG AAATTC
AACTAATTGATACTCTGAGGATAACCACGGACAAAATTGTTTGGATGATGAGGAAAT CG
CATCCAAAAATTGTCTGCATCTGAATATACTTTTAACATTACTTGAAGTTTCAAGTT TAAG
CTCGTGTATGCAACGTGGTGGGAGATGTACAAGGATAAATAGAAAGGCGTTGAGTTA TT
GAGATAGGTTTGTAAAACTCTTCTTAAATTTTCCATTGTTTGATTGCCATTATATAA TCAT
TTGTATAATTTCCAACTTGGAAAAAGCTGTTCAAACTCAAAATAAGGTTTAGGCTTG AAC
TTATTGCTATTTACGGTGTCTGCCATTTTATAATCAGAAATGGGATTGAATACAGAG TTA
ATAAGACCACTGACTCGCCTTATTTACCTCACTCGTCTCAGATGAATTTTATACTTC CAA
ATTTCAGTGTTCCCCATCTCCCTGAAAAATGTATAATTTGGCCTTGCATTTATCTGC AGT
GTGATCCAGAAGAAAAAGATTCATGTGACGCAGGGTGCAATGGTGGCCTAATGAATA G
TGCCTTTGAATACACTCTGAAAGCTGGTGGACTTATGCGAGAAGAAGATTATCCATA CA
CTGGCACCGATCGTGGAACCTGCAAATTTGACAACACCAAGGTTGCTGCTAAAGTTG CT
AACTTTAGCGTTGTCTCCCTTGACGAAGAACAAATCGCTGCTAATCTTGTCAAGAAT GG
TCCTCTCGCTGGTAAATAGTCTCTCAAAACACTTTTCAATTTGCCTATCATTATGCT TCTT
CTTTGTCCTTACTTGATATTGTCAAAGTATATACTTGGATTGTCATATTTATGCACT GGAA
TGTAAAAGGTATTTACACAATTAAGTCACTTATTAGGTAATTACAAGTAACTATTTT GATA
AGTTTTAATTAGTAATGTGTTAAAATGATAATTAACTTGCTATTTAAATTCACTGAT AGCC
GTAACAAAATCTTTTAACTATTAATATATATAATATAAATATTTGTTTTTTAATAAA CAACA
AATATTATTTGTGAAAGATCCAGTTATGTAGCTTGAAACTACATTTTGGGATTTTGA ATTA
TGTACTACTCTTCTTATGCTAATGGTTTTCAATTTTTCACTGATGTAAACTTCTGAA AGCA
TTTTTGTTGCTTGGCTTGCAGTGGCGATCAATGCAGTGTTCATGCAGACATACGTTG GC
GGAGTTTCCTGCCCATATATATGCTCTAAGAAGTTGGATCATGGTGTCTTATTAGTT GGT
TATGGTACTGGCTTTTCTCCCATTAGAATGAAAGAGAAACCATACTGGATCATCAAG AAC
TCATGGGGAGAGAAATGGGGTGAAAACGGATACTACAAAATCTGTAGAGGCCGCAAT G
TTTGCGGAGTGGATTCAATGGTTTCAACAGTTTCAGCTGTTAGTACCAGCTCACAC SEQ 2
TTAAGCTGCTTCAGCAAATCCAACTCTGAGTTTGCCATAATCGAAGACTGTGTGATATC
GACCCATGAAAACATCACCCAAGATCCTGTAACCAAAGGAATACCATAGAGAACTCA GT
G AACAAAAG AACTG CAG G CTCAG GTTTAATTGTGCTGTAGCTCTATAGTTCG G ATTAAA
CTTATATTTGGATTAACTGCATTGCTGATATTTATCTCTAAAACATAATTATAAACT AAAAT
AGAGAGAACATATAAAGATAACTTTACCAGAGTGGTCCGCGGGGAGGAGGAATGTCC A
AGCCAGTGAAACCACTAATACACTGTGCCTTAGCACCCTCGCCCACCTTGAGTATGT AC
TGATCACGGGAAAAGAATCCAAAATTAGAACATAGATCAATCTAGGTCAGCAATCTA AA
CAGACAACTGAAACAAGTAAAAGGGAACCACACATCGGATTACAACTTCACTCTTTC AA
TCTTG AAAAAATTTGTTG AAAG G GTG G G AAAAGACTAGAGTGATAGTCTAGTAG AG AAA
AGTTTTG CATATG GTCAAG GG GTTTG GTGTATCACTTG G ATTTTTTC CTTTGTAAGATGT
GGTCTATCCTGAATTATTCAAAGCTCAACCTCTTTATGTTACTAAACCACAAAACAA ACA
AATTCAGAAAAAATGCAAATGATCAAATTGATTTGGTGTACACTTGATGAATTTCTT CTTT
GTAAGATTTGTTCTATACTGAATCCTTCAGACATAAAAAAAAATATTTTTTTTTTGG GGGG
N CG G CCTGAATTAG CAAG GTCAG CAAGTAATACACTTCCATAAAAATAG CAAAGG GTAA
CTTTTTCACGGCACAAAGATCTTATGCAGGTTTTCTTAGATTACTTAGCTGGAAAAT GAG
ACATCTAAATTTAAGTAAAGTCGAAATACTCACAACCTCAAAATATAGAAGTACTTC TTGA
TGACAACAAACATCTACTTCTCTGTAGAAACTGAAAACCTTAAACACTAGAATCGGT TTT
GTAATATGACAATTAGTTGTAATGCCACAAAAGGACTCTATGATGAGCCACTTAATT TTT
TCTCTCTTTGACAATGTTGAATTAGAAGAGGAATAGCAATGTTTATTACTGTCAAAG ACC
ATTATAAAGCATACCTCCTTCGGGACGAGGTCAAAAACTTTGCCACCAATTGTGAAA GA
GACTGTAGGCATTGAAGAAAGCTTTCCACAGTCAACAGCTGATTCCCCCAATGGGCT TG
G GAG ACG CTCG CAAAG CTGTC AGTCAGTCAG CAATCTTTTCAAG AAAAG AAG AAAATTG
CAGTGACAGATGTTTTACCTCATTCACATAGTTTAATATGCGATCTTGAGTCTGGTT TTG
TCTCAGTTGATTCTCCATCCATATGACCGCCATTTCACAAGCAGAGCACATACCATC CT
G CAGTCCTGTG GATCTG CCAGCTTTCTC GTCTACAACACTCTCAATTCC CATACTG CAG
AAAAAAGGCCACAGAATTATTCAGTATTTATATCAACATTATGGATTAACCAAATGC ACT
ATATGTTCACATGAAGGGGCAAAGAGAGCCTGAAGACTAACCTAACTCCGCGGTTTC CA
TCGAAAGTGCATACTCCAACCTGTGAGCAAATCTTCTTTGGATGTGCCTGTTTAAAT ACT
GACCAATTAGATAACCGGGAAAGGCAACTAGATTGCCAAGTGTCATTTTGCTGTAAC TG
CACAGGAAACTGCATATCAAACAAATGAAAATGCAGTTACACAGTTGAATGCTCACC TC
TGCTAACAGCAAATCCATGATTGTCTGCCCGTACTGCTCCACTACAGATTTGCATTG TT
GGCTAGCAACTCCAGAGGCTCCAATGGCTTGATTAATCATAGTGATTATGGTCTGTA CG
GAAAGGGGTGGGTTTAAGATTGCTCAACCTTGGAAGTGTTTTAATCGTACAATTGTA GA
G AC AAAAG G CAG C AG ATTTTTACTTAATTTATATTGTC AAC ATTTC C AAG C C AAC AG GAT
AAAACTTGG CTACAGTTTTCG G GTTG G ATAATTTTCTTTTCAAATAG AAG AGG G GTAAAT
AAATAAGTCGACAGAAGACCAGGACTACAGCAGAAGTAAAAGCATCATCCTCATTGA AA
C GTAAT AAAAG C AAGTAAC AC AAAAC AAC AAGTAC CTACTG AG G CAG CTTTAAAC ATATT
AAACTG AAAG AC AG G G AAG AAAAAG C AG ATTTAC AG ACTTC G G C C C AGTG ATAAG CTAA
GATGGTATATCCAAAGGTAACCTCAGAAATGAAAAACCAATTTCACTACCATCCTCT CTG
TGATGAAAAATTAAAACACAACACAGATCAGATGATGGATTCGTGCTATTTAACTCA TGA
ATCTTAG G AAAATGTTACTTTTCTTGCTG AGCTGTTG AAG GTT CAAAG GAACAAG GAAAT
CAATAATCGAATTGCGGTTGACTTTGATGATGCAAACAAATAACAAAAACATAACAA ACA
AGCGATATGTCCCAAATCAAGGCTATAGATAATACCGTTGGACCAGCCAAGAGAGAA GT
CCCTGAATCCGCTATTGCAGAGCACCCACTTTCACAGTAACCTACACCCGATATGTT CA
TGTTAAAAACTCAAAGGAAGGGAAAATTCTATATCCAGGCACATAGCCTTCATCTAT ATT
CCCGAAATTCGGCAATCCAATTCAAAAGGTACATAGCAAAACATACCAGTAGCTTTA CC
CTCGATAAGAACATCACCCATATCAAACTGCCAATCATATTACTAAGATCAAGATAG CAT
TTGTACAAAAAATGAACATACATAGTATCGAATTGACCGAATGACAAACCTGCCAAT AAC
CTTTGTGTGTGACTGGGACATAAGTGATTTCTCCCTTATAGTGATTAGGATCAACCC CA CCAAACACGATTTCTCCGCCTTGTTCTTCCTCTGTATTTCGGTTGAGCCAAAATGAGAA
GACAGGATCCTTGATAAGACCCTGTTGGACCATGTTGTACCTGGAAAAGACAGGAGA T
GCTGCCCAGATGAATGTCAAATCAAATTTAAACAGAAAGAGACATCCAGCCTATCCT GC
ATTTATGGAAATCTAATCCTTCAATGTGTTAAACCTCTTCTGGAAAGGAAATTGTCT AGA
GCTTTAATTTGGTTTGTGGGAAAGAAATAGAGCAAACTAAATACCGCCCACGTACCA AA
CTGGAACAGCATTGCCAACTGAAATCTCCTGGAATCCAAGACCCAATATACCGTCAA AC
TTGGCTACCAAAAATGTCACGCTGGGTTCTCTGGTTGCCTCAATAAATTCCTAGTAC AT
GAACACCTTGAGATATAAGATTTCCACTTTCAAGAGATTTAAAACAAGTGAGGAGCC TCA
CTAACCTGATCTGTTACAACAAGGTCACCAACTTTGACGTTGTCTTGACTGAAGAAT CCA
GAAATAGCTCCACTACCATACTGAATTGCAGCAGACTTCCCTGTATGGCAAATCAAA AA
TTTATCACGAACTAAATCACATTAAATTACAATGCCAAATACGATCTCAGTCTTGTG GAA
ACATTCGATAAGATCTTAATGTTGTTCATTAAGGTAGGAGTGACCTAGTGTTCTTAA AAG
CAAAACGTGCAAAAAAATAAAACAAGGTCCACGGACTTGCATTGAATTGCGAGAAGT GA
AG C G C AAAATTAAC AG G AAAC AAG AAAATATC G AAC ATTTATG AATTTACTCTAC C ATAA
AAATTAAACTGCAGGTTAAATAACTAATTTCGGCATCCAATATACAATAATCCCAGT ATTA
ATTCAACTCCTCAAAATTGAGATTCAAAGAAGCAACCAATTCTAGTTGGAATCACTT TGT
GCACCATTATTTGAAGCGCAACTTCTCTAAAGCGCATGGCTTCAGCAATGAAGTGAT AG
CCCTTGCTGCATCGCTTCATAACTTTAAACGACCAAGCAATGGCTTTCAATAACACT GG
AGTGAACTCACCTGGCAAACCCATTCAACTGCTATAGACTGTTCAATCCATTTTCTT TGA
GCAACATATATAATTGTAATAGAACAAAAAATAAAGAATAACTAGTTCTCTGCGGAA AAT
TTCTTATGTCACAGACCTACATGGATACAAACCGAGTTAATAGGGAAGAAGAAAGAC CA
TCTAAAAAG G CATTG CATAG GTTAAG ACTTAAGACTATAC AAG GTG CAACG AAAAGG CA
CTAATCGCAGAGAGATATAAGGATATTGATGTTTCTTTTCCAAAACCTCACTAGTTA CAG
TAATATACTAAGAAACACAACATAAACATTAAACAGCCTCGTTTTATGTCTTAACAG TCAA
CTACATGTACTCGTCAATTAACCTTTCCAAGGGAATCCCTTGATGCACCGTGAGAAA CA
CATAAGGACAATACAAAAGATGTTCCATAATGAACAAGATGGCACGTATTCTAAACA ATA
ACAG G CATTAG AAGG AAG CATATGTTTCATGC AG CAATAAACAAG CAAATGGTAGAG AG
AAACAATTTGCATCAACATACAGAAATGGAAACATAACATACCATTCTTCTTATAAG TACT
TG ATTCG CTTG ATTTG AACTTG G AATGAAAG AAACAG G GAACCTACACCAAG ATAG GCA
GTCATCAAATTTTACATCACTCAAGATGGATGTACAATGCTATGCTTTGTATCATTT GCAT
GTATAG AAG CTTAC AG AG AAATAG C ACTTC G AC G AC G G C AC C C AC AAATTC G AG CTAC C
AGTGTCAAAGATTACAGTGAACTTCTGAGGTGGAGTGCCTACACCAATCTCCCCAAA AT
ATTGAGCATCCATATAGTTCTTCAGTGCTACAATGTCTGTATCCTCAGAGTCCCCGA GTT
TACCACGGAAGTTATACTTCCTAATAGACGCCCTCAAAACGTCCCCTTCCTTTGACT CAA
TGCGTGCAGCAAGCCGGTTATTTTGATCAAATTTCATTTTTTTCAAGCCAATTCTCA TCA
AGCCATCATTGGATGAGGAGGCCAAAGGAAAGAGCAGTGCTGAGAGAAACAGGGCAA
CAAGAAATACTTTTGCTCCCAT
SEQ 3
ATGGGTTCTTTCCTCTGTTTCTCCGTCATTGTTGTTCTCCTTGTTCTTCAGCCATGTTTA
GCCAAGAAAGTTTACATTGTTCACATGAAAAATCACCAAATACCTTCTTCTTTTGCT ACC
CATCACGATTGGTACAATGCTCAGCTCCAATCTTTGTCCTCTTCTTCTACCTCTGAT GAA
TCATCCCTTCTTTACTCTTACGACACTGCTTATTCTGGCTTTGCTGCTTCTCTTGAC CCA
CATGAAGCTGAACTACTCCGTCAATCTGATGATGTTGTTGGAGTTTACGAGGATACT GT
TTATACACTCCATACAACAAGGACTCCTGAGTTTCTGGGGTTGAATAATGAGCTCGG CC
TTTGGGCTGGTCACAGTCCACAGGAACTCAACAACGCTGCTCAGGATGTTGTTATCG G
AGTTCTTGACACCGGCGTTTGGCCGGAGTCGAAGAGCTATAACGATTTCGGTATGCC C
GATGTGCCGTCGAGGTGGAAGGGTGAATGTGAATCGGGTTCCGATTTCGATCCGAAA G
TACATTG CAACAAAAAG CTG ATAG GTG CTCGTTTTTTCTCCAAAG GTTATCAAATGTC GG CCTCTGGCTCGTTCACGAACCAACCTAGACAGCCGGAGTCACCTCGTGACCAAGACGG
TCATGGCACCCACACATCCAGCACCGCCGCTGGTGCACCTGTGGCGAACGCTAGCCT T
CTCGGGTACGCTAGTGGGGTCGCGCGTGGTATGGCACCTCGAGCGCGTGTAGCTACG
TACAAGGTATGCTGGCCTACTGGTTGTTTTGGTTCTGATATTCTAGCTGGTATGGAA CG
TGCTATTTTAGATGGAGTTGATGTACTTTCATTATCTTTGGGTGGTGGATCGGGTCC TTA
TTATCGTGATACAATTGCTATTGGTGCTTTCTCTGCTATGGAAAAAGGAATTGTTGT TTC
CTGTTCAGCTGGAAATAGCGGTCCAGCTAAAGGCTCACTTGCAAATACAGCTCCTTG GA
TCATGACCGTTGGTGCTGGTACCATAGATCGTGATTTCCCTGCATTTGCTACTTTAG GT
AACGGGAAAAAAATTACCGGAGTTTCGTTATACAGTGGAAAAGGAATGGGTAAAAAG GT
AGTTC CATTAGTTTACAG CACAG ACAGTAGTG CAAGTCTTTGTTTG CCG GGTTCACTTG
ACCCGAAAATGGTCCGAGGGAAAATAGTGTTATGTGATAGAGGGACAAATGCGAGAG T
AGAAAAGG GTTTAGTAGTG AAG GAAGCTG GTG GAGTTG G G ATG ATATTG GCTAATACG
GCGGAGAGCGGCGAGGAATTGGTGGCGGATAGTCATTTGTTGCCGGCGGTAGCTGTA
GGTAGGAAATTGGGAGATTTTATAAGGCAGTATGTAAAGAGTGAAAAGAATCCGGCC G
CCGTGCTCAGCTTTGGTGGGACGGTGGTGAATGTGAAACCGTCGCCGGTGGTGGCTG
CGTTTAGTTCAAGAGGGCCCAATACTGTAACTCCACAGATTTTGAAGCCCGATGTTA TT
GGGCCTGGAGTTAATATTTTGGCTGCTTGGTCTGAGGCTATTGGGCCCACTGGGCTT G
AAAAGGATACCAGAAGGACCAAGTTCAACATCATGTCTGGTAAGTATTACCAACAAC GG
CTAGTTTCTTAATTTAATCTTTTTCATGCTTAGCTTAATTATGGCCTTAATTATATT TTTAT
TAG ATCTC G C AATTATTAATACTAAC C GTAC AC ACTTAAAAAG G AAAAG AG G AAC G C GTA
GAATAAAGACACCTGTGGGTGATCTGGAATTATGTACTATGCACATTCCTAAACTTT AGA
GGGGTTCACATGTGTAGCATTGATAAGTTAATCCTAAATTACATTAGTTATAATTAA ATAT
TAATGCAGTTTCCAAGAAAATAGATGGACTAAAATTTAGACTTATTTGTATGATGTG ACG
TGTGGAATTAAATTTAAAAACTGCCCAAGCCTATATCAAATTTATGGCTAAAATAGC AAG
AAACGTCCCTTTAATAGGCACAGAAGAAATCCAAGAGGGGCTCGCTGTAGGAGTGTT A
AGAGTTTCGATATGAACAAGGTCTAGAGAAGAATTTATTAATTAATTTCAATAATAT ACGC
TAATGGTATTTGAAAACAATATATTGTAATTTATCGTAACAAGTTACTAATTTCGCT TATTA
TAGACCATTATTGTGAAGTTATTTCTATAGATAAGCCAATAGCATAAAATTCATCCG TCG
GAATGTGCAAGGTGTAGTGGTAGGAGTGCTACTCATGATGTGACAAGTGCATGTCAC G
GGGTTTGAATCATAATGCAAACAAAAGCCTGATATGTTAGTGAAAAATGATAGAGGG AC
G GGTTCATTATTC ACACAAAAG CTTG ATATTTAAGTGAAAAATG ATAG G GGAACG AGTTC
ATTATCCAAAGAGTTTCGAACCTAACCTTACACCATGGCCCTTCTTGGTATAATTTA CAC
TAGTTTTATGAGGCTCCTTTTTGTCTCACAAATTTGTGGATCCCAATCTTACACTTC TGG
GTCCACAAAATTGTGAGACAAAAAAGTTACCTCATACAACTAGTCTCGGTTGTGACT AGT
TGTATGAGACACAAAATAAAATTTTCCGAAAAAGTAGTATGGTCTGTCTTCCGCTAT GAG
ACTAGTTGTTTGAAACACAAAATAAAATTTTCCGAAAAAGTAGTATCGTCTGTCTTC CAC
TAGTGGGTCCTGGTCCCCTTGGAATCCCAGATTATTGGTCCCTACATAACTATAAAG GT
CATAACCTTATCATGGATTTAACATCAACCCTTTGCCCCATCTGAGCACTCTGGACC TAC
CTTAATCACTTTATTGGCTGGAAATAAGTTGATGAACTTTTTGAATTTTTCTTGAAA AAAC
AACAACAAAAAACCACTTGTGATCCCACAAGTGGGTCCGGGGATTAGTGTGTTATAA AG
AGGATGTTTCGGATAGACTTTCGGCTTAGGAAAGATCAATAAAGTAGTAGAAACAAG CA
ATAACAATAGCAAAATACTGAATTTTTCTTGAAAATCCTACACAAATCTCATACTTT GAAA
ATTGTATTTTGTTACATAATTTGATCATTTTTCACTTCGAACTCTTGTAGGCACATC CATG
TCCTGTCCTCATATCAGTGGCCTAGCTGCACTGCTGAAAGCAGCACATCCTGAATGG A
GTCCAAGCGCGATCAAATCTGCACTTATGACGACTGCCTATGTTCGCGACACCACCA AC
TCTCCTCTCCGCGACGCTGAAGGTGGCCAACTCTCCACTCCTTGGGCTCATGGATCA G
GTCATGTTGATCCCCATAAGGCACTTTCCCCCGGTCTAATCTATGATATTACCCCAG AG
GACTACATCAAATTCTTATGCTCCTTGGACTATGAGTTGAACCACATACAAGCCATT GTC
AAGCGCCCGAATGTCACTTGTACTAAGAAATTTGCAGATCCTGGGCAGATTAACTAC CC
TTCATTCTCAGTTTTGTTCGGGAAATCAAGGGTTGTTCGTTACACCCGTGCAGTCAT CAA
TGTAGGAGCTGCAGGATCCGTCTATGAGGTGACCGTTGATGCTCCCCCGTCTGTTAC T GTAACCGTGAAGCCATCAAAACTTGTATTCAAAAGGGTAGGAGAGAGGCTGCGTTACA CCGTTAC ATTCGTGTCAAAGAAG G GTGTTAACATG ATG AG AAAG AGTG CATTTG G CTCC ATTTCTTGGAATAATGCTCAAAACCAAGTTAGGAGTCCAGTTTCATATTCCTGGTCACAA CTATTAGAC
SEQ 4
TCAAGCATCAGCACATCTTGTTGGTGCATATCCCAGTCTGGACCTTTTGGTGTCATATAA
GATATGAAAATTCTGCTGCTGATAGTTTCCAATTATCGACAAAGCAGATCGAGGAGT CC
CTAAAACTGCCAAACAAACGATATCCTCTGGTTCGAGTTTGATAAAGTAGTTCTCTA CTG
GAAAATTCCATACAGCTCCATCACCAAACACGATCCCAAACGAGGGAAATTCCAAGT TC
TTCACACCAGACACATTGTAACACGGATTCAAAATAGGAAAGTCTTGTACAATGGGA TAT
CCCTTAACCTTATTGACAAATGCCTCTTTTATAATCTCATAAGCAGGATCCGCGAAA TAA
CTCAATGTGGTACCTGAATCAATGATTGCACCACCAAGACCTTCTAGCGATAAATTC CA
CGTCTCCTCGGGTATATTCAGTACCTCTCCTCCAACTATGACAGACTTTATCTGCAC ATA
GTAGAATGTTTCCACTTCTTTGCCTCCAACCAATGAAGTAAAATTCAACTGTGGATG TTT
CAAAAGTTCCTTATCTTCACCAAAAATCAACTTACTACTAACACTAGAATTGCTATT CCTA
TCAACAAGACAATACGAAAACGAATGACCATATAAAGATTGAAGCTGAGAAGCAAAC GA
AAGCGGCCCTCTCCCTAATCCTAACAAACCAGCAGCACCATGAAATAATCCTCTATT CC
AATGACCACAACCAAACATCACATTTTCCACCTTCCTAAATTCACTCCCACTCGTCG TCG
TGAGGTTAACAGTAAATGTCTCTAGCGCGAAATCGCCAGTAGTATTAGAACTATCAC CA
TACCAATAGTAATAAGGACAAGTTTGATTCTCGGATTTACAAAGCTGAGGAGGATCA GG
GGATGTAACAAATTTACACCTAGGATCATGACAACTTATATTTCTAAATGAAGTAGA GTC
TTGAGGATTATAATGAGGTCCATTTTGTTCAAAACAATCAAAACAAGGAACACATTG AAT
CCAATTAAGATCACTACCAGTATCAAGAATTAAAGAAAAATGCTTAGGTGGTGTACC AAC
AAACACATCCATAAAATACTCACCAGAGCCAAGGCTTACACCTGACTCCAAAGTCGC CA
TTAGTTTGCCGGAAAGTTCATAAGATTCCAGCGAAACTGCTGCCGGAGCAATCACAG G
CTTATGTTTGTCCACATGTTTTTCATTACTTTTTGCAAGTCTTGAATTGTAATTCTG ATTTT
TCTTCTCAACAATTCTTGTATGGAGTGTCTGAATTCTGCTTAAATCCCTTGCTCTTG ACT
CAAAGACTGAATCCTTAGCCTCAATTTTTTTACCAGCTGATCTGTGCCTTAACTGAA ACT
TTACAGCTTCTTTTTTCTGGTTTCCAAAAATGGAAACTTCTTCATTTTCTCCATTTT TTACA
TCAACACCATCTACTTCTTGAGCTATTGAATGGGTTTTTGATTTTTGAGAAACTCCA TAG
TTG CAATCTG AGTCAGCTGAAG AAG AAACAG CATTAAAG CTTG G ATG GTTAG GG AATTC
AATACCCGAAACACTAGAATTTAGATTTCTGAAGCTGTAAAATCCTCCACAGGCAAC AAA
ACCAGAGGAAAACAAGAATATAAACAACAAAATGAAAAGAATGAACTTTGTCCCCAT
SEQ 5
TTACATTGAGGCATCAAGGAAAGCATTAGAGACATCATCTAATTCCACATTCAGATTTTT
GGCTGAAGGCAATCCAGCCACCACATTATGTTCAATGCCACATTCATTTGTTCCTCT TCT
GATCTTGAAGTAACCATCCTGTTATCATAGAGATAGACATACATTTAGACAAGAAGC TTA
TACAAATAGAGTTTAACTTTTGTGTATTGATAGTTTTCAGTTTGTTTAATCATTCAG GCTA
AGATGACTTATGCAACTGTCTAAAGTAATTCTAATTTAGTAACTTTAAGAGTGCAGT AAAT
AACTTGCTTTAACTTTTAAGATACACAGATAGTGTAAAAATTATTTACGCTGTCAGT ATAG
TTAGCG CATCTACAACAAGG AGTTAG GATGTC GATGGTTTAAG G AAATTTGTG CTCATA
TATGAAAGCATAAGGAGAAAATAGAGTACATACATCACCCCAGCCTCTGTTCCAAGA AT
TAGCAATAAGCTGCAGAAGATAATTAAAAAAAAAGGTGATGATTAGATTTTGATAAA ATG
ATATGATTTTAGATAGAAAATAAGACACCACAACTTGAACACATCAACATACCCAAT AGT
CCTCTCCCTGCTCACTGGTTCCCCATCCGATAAGCTTAACAGCATGGCCTCCCATAC TT TGCCCTGTTACATGCTTGTAAACTCCAGACTTGTAGTGAGCAAAATCCTGTGATTGACA
AAAAAGTTTTAAGTCATTGGGTTAGCGGAAAGCATTGCTAAGAAAAAGAGAAAATAA CAA
TTATAATCAACGAGAACAATCAAATTCAAACAAGCATTAGTTTATACTAGAGAAACT GAG
AAACTAACAATTATAACCAGATAACTTCTCATATTGTGCTAAGCATTAAATTATCTC ATTT
ACAAGTATTAATTTGCAAGTAAATCTCGGACAACATAAAATGAGAAGGTACATCGGT AGT
TAAAGTTTCTCAATTATCATAATAGTTACTATCAGGTCACTTAAAAGATCATTACAT GTAT
CACATTATG GG ATTGG CAGGTAATTGTTTCTATGTACACATAC CACAG GACCAG CAT AT
AAGTTTACCAGCTCAGACAATGTCATGTGTGAAAACTATATCAAATAATTTTAGTAT CAG
AACTTAACCCTCCCCCTTATCCTCATTACCTCGTAGACGGTAAAAGAGACCTCGACT GG
TCCATTTTTGTAAATTTCTGTCATGATACTGTTGGGATCATGGTGGATCCTGTATGC ATT
GACACCATAATGCTTTGATTTCCCCCATAGTAGAATCTCCTTCACACACTTCCTCTG ACA
CTTTGGGGTGGGATATCCTGGTTCACAACCAGGGTGGGAACATCCCTCATTATCAAA GT
AAG G GTCACACTGC CAG AAAAAAC AACTTTATTAGTGATTG ATCATAAAGATCCACG GT
AGCTAATGGTTTTAGAGGAAGCTGTAATCTCTTTTGGTGAAAATAAGTCACCATTTA CCT
CTTCTGTGACCACACCCCTACGGATAAAGTATCGCCATGCTGTTATTGGATATCCAC CA
TCACAACCACTCCCACATAAAAAGCCACAGCATGCTAACAGATCATTTACAGACAGA GA
GATATTCTGCATTACACATTAGAAGTTTAACATCAGTGACCATAACTACAGAAATAG ATT
CACATACGTTTTGTGCTAGGTGAGATGGTTTCCACATGCTTAGACCATGAAAAAGAA TC
ATGAGGCTGGCACGTGAGAGCACTTGCTGAAGATATATAATTAAGTACATAAAAATG TG
TCATCTAAAGCTTTTAAATGAGGCGGTCACACAGTTCAACAAAATTTACCAAGTTAT GAT
G G ATAC AG AAAC G AT CAG AC AG C G ATTC AAC AG C AC C AAAAG C C C AAC AAG AAC C G C A
ATGTCCCTGATCTGTGACGCAAATTTGTCCCGGTGATTGATGTGCAAAGACGGAAAG CA
TTAGGATCACTATCATAGAATTATAATTCAATAGTAGTAAGCAAGAACAAGAAGAGA CTG
ACCCAGAATTCTTCCGATAGTACTACATTGAGGCCAAGCTTTTCGTGCATCAAACTC TTT
TGGTAGCTCCAAAAGTTTTGGATGAGTTAGAATAGGAATTCCCTCCAAATCACCTTC TCT
TGCGGGCTTAACTCCAAGAAGGCGCTTAAATTGCGAAACCTGTGAAAACCAAAGAAA AA
CATAATTATAAATCGACGTTTTCGGTGTATGGCTGCAAAATAAGCAACTGTTATTGA TGG
TTAGAGAAAGCGCTAAGCCATATGATGATCTGACCGTGAAATTCGAGAATCGAGGGT TG
AATGCAGCTTTCCACCCAGCTTTGGCATTTTCATTAACCTCTTTAATGATTGATTCC TAC
ATGATTATCCAAAAAGCTCTCTTAGTTTTAAATTTGAAGCAACAAGGGCAATAACAT CTT
CTCTAAACATGAAAAGAAGAATATACCTGAAGGATTGCAGATTCAACTTTAGCTTCA GAT
ATTGGCTGCTCTGCAACAACCTGTTTTCCATAAGAACAAAGAGATTCTACTCATCAA AAG
ACATCCTTAAAGCTTTTAGGAAACAGAGCTGCAACTCCAGGAACAAAAGCATGACAC AA
TGAGTGACAAACGAAGAACTTCGGGCTCGTTTGGTACGAGGGATAAGGGATAATTAA TC
TCGGGATTAAATTTGAGATGAGTTTATCCCATGTTTGATTGTAGTGTTATTTTAATA ATTA
TGGGAGGGTGGGATAAACAATCGCGGGATAACTAATTTCGGGATAATTAATCCTGCG AA
CCAAACAATCCCTAAAGGTTTCACTTTAATCAAGATGAAACTCTTCCACAACTTTTA TTTT
CAACATTATAATACTATTAGCCTGGAAAATTAATCAAAAGTTTGTAGGAAATTCATC ATAT
GTCTAAAGCACTATAACGTAGAGGAAAAAGAATCATAGAACAAGCAGAAATTGTAAT TA
GTCCATTATTTCTCCTCCTTCTTCTCCCCTTTCTGTATTTCTTTGTGAAGCAATACT TCCT
CTCATGTTATTATATTTCGACAAGTAAGTTAGCTAACACATAACATAAGTAATTTGC ATCA
AACCATATAATTAACTTCAGAAACATGTGTATACTTCTCTTTTCTCATTCTCACTAG GTAA
TGAGAAAATCATTAAAATTTGCTTCTACTCATGATTTCTAGTCAACGCTTAACTAAA GCAT
AAGAAGTCCAAAATACCCAACAATATTTGATCTTTCTGAAGAAACACAAAAAGGCTA ATC
CTTGTGTTCATCAAAAGCTATACAAATCAAATCAATACGCTAAATCCACCTAAAACA AAA
TCATCAATTCAATAGGCAAGAACTACCCATAAGACATACTCCTACTGTGAAAGGTTC AAA
GAATG AAG AAAC AAAC CTG C AATATAAG GAT AAAC AAAG C AC C AAAAAG C AAAG G AGTT
G CTAAAG ACTTC AG G GTC AAG G C C AT
SEQ 6 ACATTAGTCCTCCATACTTCTTTCTATCTTCTTCTGTCAGTCGCATCTCCCGGCGACTGT
CTCCTCCTCTCCATTTTTCCTTTCTCTTTTTCCTCACCGAGATATTTTCCCTATAAA CAAA
ACACCGTAAAAATCATCTCCTCTAATTTCCTATTTTCCCCATTTTTCCAAATGGGTT CTTT
CCTCTGTTTCTCTGTCATTGTTCTTTTCCTTGTTTTTCAGCCATGTTTTTCCAAGAA AGTT
TACATTGTTCACATGAAAAACCACCAAATACCTTCTTCTTTTGCTACACACCATGAT TGG
TACAATGCTCAGCTCCAATCCTTGTCCTCTTCTTCAACCTCTGACGAATCATCACTT CTT
TACTCTTACGACACTGCTTATTCTGGCTTTGCTGCTTCTCTTGACCCACATGAAGCT GAA
CTACTCCGTCAATCTGATGATGTTGTTGGAGTTTACGAGGATACTGTTTATACACTT CAT
ACAACAAGGACTCCTGAGTTTCTGGGGCTGAATAATGAGCTCGGTCTTTGGGCTGGT C
ACAGTCCGCAGGAACTCAACAACGCTGCTCAGGATGTTGTTATCGGAGTTCTCGACA C
CGGTGTTTGGCCGGAGTCGAAGAGCTTTAACGATTTCGGTATGCCCAATGTGCCGTC G
AGGTGGAAAGGTGAATGTGAATCGGGTCCTGATTTCGATCCGAAAGTACATTGCAAC AA
AAAGTTAATCGGTGCTCGATTTTTCTCCAAAGGTTACCAAATGTCGGCTTCTGGTTC ATT
TACGAACCAACCTAGACAGCCGGAGTCACCTCGGGACCAGGACGGTCATGGGACTCA
CACATCCAGTACCGCCGCTGGTGCACCGGTGGCGAACGCTAGCCTTCTCGGTTACGC T
AGCGGGGTCGCGCGTGGTATGGCACCGCGAGCGCGTGTAGCTACGTACAAGGTGTGC
TGGCCTACTGGTTGTTTTGGTTCTGATATTCTAGCTGGTATGGAACGTGCTATTTTA GAT
GGCGTTGATGTACTTTCTTTATCTTTGGGTGGTGGATCGGGTCCTTATTATCATGAT ACA
ATTGCTATTG GTG CTTTCTCTG CTATGG AAAAAG GAATTGTTGTTTCCTGTTCAG CTG GA
AATAGCGGTCCAGCCAAAGCTTCACTTGCAAATACAGCTCCTTGGATTATGACCGTT GG
TGCTGGTACCATAGATCGTGATTTCCCTGCTTTTGCTACTTTAGGTAACGGGAAAAA GA
TTACCGGAGTTTCGTTGTACAGTGGAAAAGGAATGGGTAAAAAGGTAGTTCCCTTAG TT
TACAGCACAGATAGTAGTGCAAGTCTTTGTTTGCCGGGTTCACTTGACCCGAAAATA GT
CCGTGGGAAAATAGTGTTATGTGATAGAGGGACAAATGCGAGAGTAGAAAAGGGTTT A
GTAGTGAAAGAAGCTGGTGGGGTTGGGATGATATTGGCGAACACGGCGGAGAGCGGC
GAGGAATTGGTGGCGGATAGTCATTTGTTACCGGCGGTAGCTGTAGGGAGGAAATTG G
GTGATTTTATAAGGCAGTATGTGAAGAGTGAGAAGAATCCGGCCGCCGTGCTCAGCT TT
GGTGGGACGGTGGTGAATGTGAAACCGTCGCCGGTGGTGGCTGCGTTTAGTTCAAGA
GGGCCCAATACTGTAACTCCACAGATTTTGAAGCCCGATGTTATTGGGCCTGGAGTT AA
TATTTTGGCTGCTTGGTCTGAAGCTATTGGGCCCACTGGGCTTGAAAAGGATACTAG AA
GAACTAAGTTCAACATCATGTCTGGTAAGTATTACCAACAACGGCTAGTTTTTGTCA TAA
TCTTTTTATTTATGCTTAGATTAATTATGGCCTTAATTATATTTTTATTAGATCTTG CAATT
ATTAATACTAATCGTACACACTTGAAAGGAAAAAGAGGAACATGTTTAATTAGTGCG TAG
TG ATCTG G AG CACATG CCTAAAGTTTAG AG G GGTTCACATGTGTTG CATTG ATAAGTTA
ATCCTAAATTACATTAGTTATAATTAAATATTAATGCGCTTCCAAGAAAAAGTTGAC TAAA
TTTATCATATATTTCCAAATTTGTTTTGAAAAATATGATTTTGGTGAAGTTTGGCTT GAAG
ATGAAAATGTGTTTGGACATCAATTTTCAAAACATATTTCCCAAATTTATTTTGGAA AAAC
ATGAAACATTTCTTATACCCACAAGTTTAAAAAACTATCACAAATATCCAACGGTAC CATT
ATCAATAACATTCATTATATTATCTCAAACCATAATCCTGAATATAAATAAATTTGG CACA
ATATTATCATTTTTATAATTAACTATATGATACACTATTAGATGATCGAGAATACGA AGCA
ACATCGTTTCAAAATAATAAATGAAAAATGGTGGACTCTTTTATATAATACAAAAGT TTGG
AATAATTTTTAAAAAATATAATAATGATATTTTGACCCAAAACCAACATGTAGTCAA AATC
TATGACCAAACATGTGTTTGCCAAATAAAACCCAAATTTATTTTGACAAAATATATG GCC
AAACGGGGCTTAGTTGTATGATGTGTCGTGTGATATTAATAAAAGAACTGCCGAAGC CT
AT AC C AAATTTATG G CTAAAATAG C AAG AAAC GTC C CTTTAAC AG G G AC AG AAG AAATC
CAAG AGG G GG CTCG CTG GTCTAG AG AAAAATTTATTAATTAATTTCAATAATACG CTG AT
GGTGTAAAAAATATTGACACCATCAATATATTGTAATATATCGTAAAAGTTTATTAA TTTC
ACTTATTATAGACAATTATTTGAAGTTATCTTTATAGCTAACCCAATAGTGTAAAAT TCAT
CCGTTGGAATGTGCAATATGATGTTTGTTTTCAGCTTTTGTGCAGTAGTGATTTTAA ATA
GGTATTACTTGGAGCTTTTGTGCGATGTGACAAAGTGCATGTCACAAGGTTAGAGTC AT
AATGAAGGCAAAAACATGATATTTAAGTGAAAAGTGATAGAGGGACGAGTTCATTGT CC G CAC AAAAG CCTGATATTTAAGTTAAAAAAAATAG GTG ACCAG CTTG ATATTTAAGTG AA
AAAGGATAGAAAGACGGGTTCATTATCCACCGAAAGTCGAACCTAACCTTTTGCCAT GG
CCTTTCTTGGTCATCAAAATAATTTAGGAGACTACCTAGGAAAAGTAGTATGGTCTG TCT
TCCACTAGTGGGTCATAACCTTAATATCATCCCCCTTGCCCCGTTGAGTACTCTGGA CC
TATCTTAATCACTTCATTAGCTGGAAATAAGTTGATGAACTTTTTGAATCATTCTTG AAAA
TTCACAAATTC GAACC GTG G AAACAATCTATTACAG G AATGC AGTCTAAGTCTTCGTACA
ATAGACCCTGTGGTCCGGCCCTTATAGCAGGAGCCTACTGCACTGGGCTGACCTTTT T
CTTTAAAATCTTACAGAGCTCAAAATTTGGACTTTGTACTGTTTCGTTACATTATTT GATC
CTTTTTGTACGTCAAACTCTTTCAGGCACATCCATGTCCTGTCCTCATATCAGTGGC CTA
GCTGCACTTCTGAAAGCAGCGCATCCCGAGTGGAGTCCAAGCGCGATCAAATCTGCA C
TTATGACGACTGCCTATGTTCACGACACCACCAACTCTCCTCTCCGTGACGCTGAAG GT
GGCCAACTCTCCACTCCTTTCGCTCATGGATCAGGTCATGTTGATCCCCACAAGGCA CT
TTCCCCGGGTCTCATCTATGATATTACTCCAGAGGACTACATCAAATTCTTATGCTC CTT
GGACTATGAGTTGAACCACATACAAGCCATTGTCAAGCGCCCGAATGTCACTTGTGC TA
AGAAATTTGCAGATCCCGGGCAGATTAACTACCCTTCGTTCTCAGTTTTGTTTGGGA AAT
CAAGGGTTGTTCGTTACACCCGTGCAGTGACCAATGTAGCAGCTGCAGGATCCGTTT AT
GAGGTAGTCGTTGATGCTCCCCCATCCGTTCTGGTGACCGTGAAGCCATCAAAGCTT G
TGTTCAAAAGGGTAGGAGAGAGGCTGCGCTACACCGTTACATTTGTGTCCAACAAGG G
TGTTAACATGATGAGAAAGAGTGCATTTGGTTCCATTTCTTGGAATAATGCTCAAAA CCA
AGTTAGGAGTCCAGTCTCATATTCCTGGTCACAACTATTAGAC
SEQ 8
ATGAATCCTGAAAAATTCACCCACAAGACTAACGAGGCCCTTGCTGGGGCACACGAGC
TAGCACTATCCGCAGGGCATGCTCAATTTACGCCTCTGCATATGGCTGTGGCCTTAA TA
TCTGATCACAATGGTATTTTTCGACAAGCGATTGTCAATGCTGGTGGGAATGAAGAA GT
AGCTAATTCAGTGGAGCGGGTATTGAATCAAGCGATGAAGAAGCTACCTTCTCAGAC AC
CGGCTCCAGACGAAATCCCACCTAGCACTTCCCTTATCAAGGTGTTACGACGAGCAC A
ATCGTCG CAGAAGTCTTGTG GTG ACAG CCATTTAG CAGTG G ATCAGTTGATTTTAG G AC
TGCTAGAAGACTCTCAAATCGGAGATCTTTTGAAAGAAGCTGGGGTGAGTGCATCAA GA
GTGAAATCAG AG GTAG AG AAACTTAG AGG AAAG GAAG G AAG AAAAGTGG AAAGTG CTT
CAGGGGATACCACATTCCAAGCACTCAAGACTTATGGCCGTGATCTTGTGGAACAAG C
AGGAAAGCTTGATCCCGTGATTGGTAGGGATGAAGAAATTAGAAGAGTCGTTCGGAT TT
TATCAAGGAGGACGAAGAACAACCCGGTTCTTATTGGAGAGCCTGGTGTGGGTAAAA C
AGCAGTTGTTG AAG G GCTAG CACAG AG G ATTGTAC GTG GTGATGTTCCAAGTAATTTAG
CTGATGTTAGGCTTATAGCATTGGATATGGGAGCGCTAGTTGCTGGAGCTAAGTACA GA
G GTG AATTTG AAG AG AG G CTG AAGG CTGTG CTGAAAG AAGTTG AAG AAG CAG AAG G GA
AAGTGATACTTTTCATTGACGAGATACATTTAGTCCTTGGTGCTGGTCGGACAGAAG GG
TCTATGGATGCTGCTAATCTGTTTAAGCCAATGCTAGCCAGAGGTCAATTACGGTGT AT
TG GTG CAACTACACTTGAAGAGTACAGG AAGTATGTTG AG AAG G ATG CTG CATTCG AGA
GGCGTTTCCAGCAGGTGTATGTGGCTGAGCCTAGTGTTACTGACACTATTAGTATTC TC
CG CG GGTTG AAG GAGAG GTATG AAG GG CATCATG GTGTTAAAATTCAG G AC AG AG CTC
TTGTGGTGGCTGCCCAGCTCTCATCTCGGTACATTACAGGTATCTATACTTTTGCTA TTT
TTACATAGCACCTTGTTTTGATGTCTTTTCTCCGTCAATAACTAAGCATGTATATGC ACTA
CTTTTTCCTCGTGCATTTCATTAACTCTATAAATCAGAATGGGACTTAGATTCGGTT AAG
CGAATGAAGGTGAATTTTAACCTAAAATGTTATGGTGTCGGAGCTATAGATGTATAT TTG
TCTG GTACTAAAATGACTTCTTG AAG CAGTAG CCAG AATTTTG ATTCATTTAAGCAG GTA
GGGCATGAGACTTAATTAGCATATCATTGTCTGCACTTCCTTCTGGACCTTTACCAG TGT
ATGAGTTGTTTTTGTGTTACAAGCTGCTCCCCATCTGGATAATGGTGGATTAAGACT TAT
ATGATTGTCAGAAGTGTACTAAAACTTCTTGAGGATAATTAAAAATTGCTCAAATCA AAT CCGTAGCTCGTTTTCCACTGTCAGTTTTTGCAAAATGCTTTTTATGTCTGTGTCGTGACA
AATTAAG CAGTCAG CCAGTTAAATTTTG GCAGTTTGG CATGC AAATTGTCTTTG CTG CAC
ATTTCAGGTGCAAAAATCACTAACCTCTTTGTATTTTCAGGTCGACATCTGCCAGAT AAG
GCTATTGACCTAGTTGATGAAGCTTGTGCAAATGTTAGAGTTCAGCTTGATAGTCAA CCT
GAGGAAATTGACAATCTCGAAAGGAAGAGAATTCAGCTAGAAGTTGAACTTCACGCT CT
CGAGAAGGAAAAAGACAAAGCTAGCAAAGCACGTCTAGTAGAAGTAAGTATTATATA CT
ACCAATGCTTTTACTGGTAATTGCTCTATTTTCTAAAAGATATGTTAAGAATTATAC TGAC
TCGAATTATACTGACACTGGTCCAGGTGAGGAAAGAACTTGATGATTTGAGAGACAA AC
TCCAGC CCTTGATG ATG AG GTACAAAAAAG AG AAG GAAAG GATAGATGAG CTTCG CAG
G CTCAAG CAAAAG CG CGATG AG CTCATTTATG CTTTAC AAGAAGCTG AAAGG AGATATG
ATCTGG CTAGG G CAG CAG ATCTG AGATATG G GG CAATTCAAGAAGTG G AAACTG CAAT
AGCAAATCTTGAGAGTACCTCAGCTGAGAGTACAATGCTAACAGAGACTGTGGGTCC T
GATCAGATTGCCGAAGTTGTGAGTCGCTGGACTGGTATTCCGGTCTCAAGGCTTGGG C
AGAATGAGAAAGAGAAACTGATTGGTCTTGGCGATAGGTTGCACCAAAGAGTGGTCG G
G CAAG ATCATG CAGTTAGAGCTGTTG CTG AAG CCGTGTTAAG ATCTAG AGCTG GTTTAG
GAAGGCCACAGCAACCAACTGGTTCATTCCTTTTCTTGGGGCCAACTGGTGTTGGAA A
G AC GG AG CTCG CTAAAGCTCTTG CAGAG CAG CTCTTTG ATG ATGATAAACTGATG ATCA
GAATAGACATGTCCGAGTACATGGAACAACACTCTGTTGCCCGGCTGATTGGTGCTC C
ACCAGGGTAAGTTTGAATCTAATTCTTTTCTTTTAATGTCATGTCATATTATTACAG TATT
CAATCACAGATTCTCATGTGTTCCACATCTGCAGTTATGTTG GG CACG ATG AG G GAG GA
CAACTTACTG AAG CTGTTAG GAG G CGG CCTTACAGTGTTGTG CTCTTTG ATG AAGTTGA
GAAAGCCCATCCTACGGTGTTTAATACATTGCTTCAAGTTTTGGATGATGGAAGGTT AA
CAGATGGTCAAGGCCGCACAGTTGATTTCACCAACACCGTGATTATTATGACTTCAA AC
TTGGGAGCAGAGTATCTGTTGTCTGGATTAATGGGAAAATGTACGATGGAGACGGCT C
GTGAAATGGTAATGCAGGAGGTAAATAGTCTCAAACTAGTAACTTCCCCTTTGCTGA TA
AAACTGGAAGAATACAGTGAAATAGTTTACCTTATTAGCTAGAATGACAACTGTTTA CAT
GTGTGTATGCTTTGTGATAGGTGCGAAAGCAGTTTAAGCCCGAGCTCCTGAATCGGC T
GGATGAGATTGTTGTGTTTGATCCCCTGTCCCACGAGCAGTTGAGGCAAGTATGCCG C
TACCAGATGAAAGACGTTGCACTACGGCTGGCTGAGAGGGGTATTGCATTGGGTGTT A
CTGAGGCAGCTCTAGATGTCATACTCTCAGAGAGTTATGACCCGGTAAGTGTTATAT CT
TGTAATCTAGTCCAATATTTTAGGATTATTTTGCGAACTTGTACTTATTGTGGTGAT CATG
G CATTCAG GTTT ATG GTG CAAG ACCTATTAG GAG ATG GTTGG AGAGG AAG GTGGTG AC
CGAGCTATCCAAGATGCTTGTGAAGGAGGAGATTGATGAGAACTCAACGGTTTACAT AG
ATGCTGGTGTCGGCAGGAAAGATCTAACCTACAGGGTGGAGAAGAATGGAGGTCTTG T
GAATGCTGCCACCGGGCAAAAATCTGATATATTGATTCAGCTTCCTAATGGTCCCAG GA
GTGATGCTGTCCAAGCAGTCAAGAAAATGAGGATTGAAGAAATTGAAGAAGACGAAA TG
GAAGAT
SEQ 9
TTATGTAAATGCTTCACGTTGCTGTGTAGGTAGCTCCAGTTCAGGCTCAAATGCTGTTA
GCCGAGAAAAATACTGATCTACCATGTCATCTGTTACTTTATCCAAGCTTGGAGGAT CC
CACTG CACGTTTTAACATAAATCAAG AAACTCTCTCTAAG GTTAAAGTTG ACTCTTAG GA
AAATTCCTCCAGGAAGGGGCTCATAATTCATAAAAATAGCATATTAGTTCGCAATAA TAT
TGATTTACCTTAGGTGCAAAATCTCTATCCACAAGCCGTGCACGAACCCCCTGATAG GA
AGATATTAGTACACTGTGATGATGATCTGTGGAATTTAGTACTGAGACTTTTTTTAG CAG
CATATATTACCTCACAGAAGTCATTAGTTATTTGTCCAGAGAAAGCTTGTACTGACA TTC
GATACTCACGAATTAGACACTGGTCCAGAGTCTGATGTCTGCCTTCGCGTATCTGAT CA
GAAGCATTATTGGAAGGTCATCTCACAATGCCCTGGTGGCAGAAGGGAAAAGGAGAA C
ATTCTATCTAGAGAAACTTACAGATCTCAGTGAAACCTTCAAGCTCAGTGGGGCTGT TT CTTGTAGTTTTCGCAATGTTGAAACACACCATGCATCTTGCTTCTTGGCTGCCTCACTTT
CCTGGCAACATTTAAATTTCTTTTTAAAAATGTAATCACACCTGTCCTTAAAAACTT CCAC
ACATGTAGCAGACTTTATATGTTTACTTAAATACTTTACTGTCAGTATTTTAAAAAC AAGA
GAAAATCCATCTCGCTCGAAAGAGCACCCAGAGGTCGTAGATGCAAACGCAATGACT C
AGCTTGTACTATGGTGCAACAAAGTGCAGCACTTAAAGGCTTCTCCGTGCAATTTAT AC
GTGATGTTAGAATCTCCACAATATGCTTTGCCCAAAGAATGATAGATATAAGTTAGT GTG
ACAACAGCCCCAGGACCTATAGGTACTTTTTGAACCTCAAGGAGCAAACAAAGAGTA TG
CTGAGCAATGATCATTGCATTGTTTCAGATTTTTCAAATAGTGGCAACTTGTGATGG CCG
AGGTATTAAAAGCAAAGCAATCCGATGCTGAACATTATTGCATTGATCAACTAAAAT ATG
TTTCCACTGGCACCTAAGGGGGTAGACTGTAGCCAAATAGTCAATGAAGTCGGTGAA AA
ACACGGGAGACTAGGTTTCAAATCTAGCAAAGAGTCGGAGAGAGTCAGAGACAGAAA G
GCTAGGTGATTTCTTCCTATCTGCCTAAAATTTGGTAGATAGCAAGTACCTGTTGGA ATA
GTCGAGGTGCGAGCAAATTGGCCCACACCACCGTTATAGAAAAGAATAATGTTTCTA CT
GGCCACACTTTGATTGTATGCCCATCCGATCGAGTTTCTTACCAGGTAATAGCCTTC AC
ACCCCAGGCTAGTGAATTAAGATGTGCTATCCTATAAAGACTTACTTCGTCCTCCAC TAA
TATGATCGTATCTTGTTACTGGAGTCGAAGATGGTATGTTGGCGAGGCTTCCACCTT CT
GAATACCACATGTTTAATCCCTGTTCTTTCTTTTCTCACTAGAGATATTAAGCAAAT TAAA
ATCAATTAACATCAGAATTTTGTATCTTTCAGAGACCTTTTCAGTATATTTGGGAAA CAAG
TTACACACTCAGGAACTATATGCTCACCAAAGCATCAATAATTTCTTCGACTGTGTC ATG
GCTGAAACATTTATTGAGAGTTTCAATCCTAAAGAAAGAGAAAAGATGGATTAGATT AAG
CTATCAAATACTCATTGATCTGCAAGCAAGATCACACAAACTTGATAAGAATTTATT TTTA
TAAAAACAAAAGGTTAAAGAACAAGACTAAATACAAAAACCAAAAGATCCATGTGAA ACT
ATGACAAAAATTATCATGAACCACAAAGTTTTAAGAATGTAGTGTGAAGTTTTAGAG AGC
TGAAGTTTGTTCCACTAGTTGATGTATTTTATATAACTCATGGAAAAGATGGTGCCA CTA
CCAGTTTCTCTATAATGGAAAAAACAATTTCGCCACTTACAAATCTAAGTGAAGCTA AAT
ATGAAGAGCTAATTGCAGGATTTTGTGTTTACTGAATTAAACAATTTACCAAAAATT ACCA
ATTTGATTCCTCGAAGAAGAAAGTAAGAGATTGATTTGCAATTCTTTCTTTTTGAGT AGTA
TGAAAAATGTTATGAAATGAGGGCAAGTGTCTCATCCAATTAAACCTGTAACTTGCT TTC
CAACAG CAG GTAAAATATTGAAGCAAG G CTG CAG CGTGTCCACTTATG ATTTCAATTAG
CCAAGGGATTCTTTCGATGTTCTATAAGAAACGTGAACGGGATTCCCCGAAGATGTA GG
TAGGGAATCCGCTGACTGGTGTGGTATATATGTTTGGTTAACAACTAGAAAAGGTGT TT
CAAATCCAAAAGCAGCCCTTAACATTAAACGGTAATATGTATCAGTCCACCCCCTTT CAA
ACTGTAG CAG G AACTAATATATTTATG GACATTCCAATTTCCATATTTAG CAAG GATG AC
AGGTACCTGTGAAGTACACTTGTTGGATCTGGATGGACAATCTCTCCACAATTTTCA AG
AGATCTTTCAATCACTGAAGGATCATCAGTCATCAATTTACCAAGTTGTTCCTCAAT TAA
GGGAAGCTTCTGAGAGGAGTGGAGAAGGCAGAGAAGAAGTTTTCAGAGCAATAGTCA A
ATAACCTAAGTAGGCTATTGGTACCTAGGTGCAAAGCAAATTAAGCAAAGGTAGTAG AC
TTACTGCACTGTGTAAGTAGTGAGTAGCAAGCCCACAGGATATCATTTCTGCTCCAT TG
ATCTTGTCTCCAGTTAGGGCCAGGTACTCTCCTGCAAATTGAAAACATTTAAATAGC TTT
CTTCCATGCTGATTTTTTCTTTCATTTACGCAGTAGCAAATGTCCAAACAGATGAGA GAC
ACAGAGAGAGGCAGCATACTAAAAACATATTCCACTTGAACCACAAACTAGTATAGC AT
AAGATAGATAGGGTTTACATGAGCTGACATGCGTGCATCAGGTTGCTTAACATTTAT AT
GCCATAGAATATGAAGTCATCAAAGAGTCAGCCATGTAAATGCAGTGCTACATGATC TA
GCATGAGTTATCAGTTATCTACAAAAAAAAGGCTTGTAACCCACTAATTTTAGTCCG CCA
CCATTTAAAGCTAAATATTAAATAGATAGAACCACATATTAACATCACCGTATGAAG TTAA
ATTTTTAAAAAAAAAGGCAGTCCAGACAAATTCCGAACATAGGTAGAATTCAAAGTC ATA
ATCTCCAAAATACATGAAACAGGAGGAGGAACACATGCTTACTTCCCAATGAGTTAG GG
AAAAATAAGATGAAGTAAAAAGAACATCACGTTAGTACAAATATTTTGATAACAAGC ATA
TAAGAAGGGAAACGTCCAGATATGACATGCTTATTTTTTCCCCATAGAAGATGCGCA TC
TCCCCAGAGAAGAGGAGATCGGGGAAGGGGGTGTGTGTGTGTGTGTGTAGAGAGAGA
GAG AG AG AG AG GGGGCGCGGG G ATC AG AAAAGTAGTTTC C AACTAG ATAATTC ATC AT CAAAGACGATCAGCTACATGACATGATAGAAGTTACTGATGGATCCAAACCACAGCATG
AGATCAGCAAAAGAATATACCCAAATAACCAGGGAGGTGTGAAAGGTAAAATGACGC C
CCAGCATCGGGATGGTAACCAATCAATGTTTCTGGTGTGGCAAAAACCTACATTGAT AC
AAGACCCAATACCCTTGTAAGAACGGTGAAAAATCGATCTAACAGCACTTAAAACAC CT
AAACCATCACCTCGTGCACCGGCCAAATTTGATAGACCTCTTTGACAATTAAGTTCG ATA
AATTTTTTATTTCCTGCACATTCCTTTAGTTCTTGAGTAATTTAAAATCAGGTTATA GCATT
TCAGTAAGACCAAGATATCTTAAATTTTTCAGCACTTTAAACGCTAAGCCTGTGGTT ATA
AGTCATATATAACCGAAATAAACAAGCTTCATCACGAAAAGTCAAAAGAATTGATAA TAT
ACAATTGAAATTACTATTTAACTTATTGGTACAAAGAAACTTCATCAATATAGAGCA TTCA
ATTTAAGAAGATTAAAAGATGAATTTTATCAAATTAACTCGGTACGAACTCATAAAT AGAT
AATAATAACTG ATAAG G C C AAAACTATC AATAAAG G AAAAG C AAG AG AAG G G G C AG AG G
AAAAATTTGCTGAATAAGTTTGTAGCACTAGGATTTGAACTTCTCTCTGAAGAGAAA TAA
TTAAGATTTAAAGAACAGAAAATTCATGATATGTGAAGTCACTAAGCTGTATATAAG AAT
GAGCACAAATGGAACTTCACATTACTTAGCAATGTAGTCTAGCAGTTCTTGAAGTAG GA
GAATTTATTCTGAACCAACAAATGAAAAGCTTAAACAAATAAAGCATGACTAATCTT TTCC
ATACAGTTTTCTCAGTTGCAACACGGAAAGTTCCAGGAATTGAGATGCCAGCCCCAC CA
CCCATGGTAATTCCATTCAAAAGAGCAACCTGCATATGGCATCTGAGGATTAATATC TAT
ACTTGGGAACCGCATCCAGGGACCAGCTAATGTCAACAAAAGTATGACAAGTACTGA AT
ATGCAATGACTGAATTCCTTCAACATGGAATTGAGGAGGTGAGAGCGAGAAGTAACT CA
AATCCGAAAGTGCGCAAAAAGCTCTAATCGATAAAAAGATACAAAGAAATATGGATG AT
AACTACTGATAAATGCACACCGGGTTTCTGAACTAAAAAAAGACGTATATTATGATT AAT
AAAAGAATGAAAAGACCATTATGGCTGCATCTTGTCAAAGCAAAAATGATCACATGA TTT
TAACAAAATAAACAACTTCCAAAATTGAGGAGATATTATTTCATGGGCAAAAAGGAA CAG
ACAAACTATAGGTCTAAACAGGTGACCACTTTGTTGAGCATCAACAACAGCTTATCC CA
AATCTTTCATTTCAGAAGTCAAAGAACACAGCAGCAAGAGAACTATGGATGAACTAA GA
AAATGGAGTTGAATATCTATGAATAATTGGATCAAATTTGCATTTGTCCTAAGGAAG TTC
TTTAAGCTTATGAATCATAATGTTAATGTGACTGATTATGTTTTCCTCAACAGGCAT CGTT
ATGAAGATTGTATAGCAGCCAGCATGATAAAGTAGTGTTGCTACCATTTCAAGTTTA TGA
ATAAGATCCAATCAAACTTTGGCAAGACACATCTAACCTCTCTGAATACCATTTTAA TTC
AGAAAGAAGTGATTGGTGTAATTAACTGTCGAGATGCTGTCTTTGCAGCAGCTTAGT CA
CAGTAAGATGAGAGAGAATCCAATTAACAGAACAAGCGATCTTCTAATAAATCCAGA TAT
TTCTATAAGAGTATCTTTAAAACTGCCAGCATAAAGTACAAAGGTGTTGAAATTTCA ATA
AGCAATGGCGAATATAGGATGAAAATGTTGATTCAACATCTAATAGAACTTACATAT ATT
TTTTAGAAATTCTTAGGTTCCAACTACAGAAGCATATTTTATTATGCGGAAGTCTAG ACG
CACAGTTAGATCACAACATAAAAGACATGTAACTACAAAATTTATAAGACGCTGGCC TCT
ATCAAGGTTTAGTAAATACAACAAAAGTTCCTGATACATATATAAGAGGCAGAAAAC AGA
AGAAAATTCAAGTCAACAGTTCTTTTAGTATCTGATGTTTGAACAATAATGATACTT ACAT
GTGGCTTCAAGAGTGTGCCGACAACGTATACTAAGTTATTTATTGTCCAACAAAAAT CTT
TACAGTCTTGAAGATTCCCTGCATCTATCCAATACAAAAAATATTACACAAAGAATG GAG
AAGATCAAATTAAAATTGAGGTAATAGTAAATTGGTAGCATGAAGATGCCGGCTGCA GT
TTTTAAAACGATGTAGAAACGTTTTAATTGTTCCCAAAAATAAATAGATATCAATAC CCTA
CTCAATAAAAACAACAGAACATCCAGCTATCAGCACAAATAGCTTAATTAAGATAAT TCA
CATCTAAACAACTTCTTCACACCCCAATAATACAATTATCGACAATGTTTAAATATT TAAA
ATTCCTTTAACCATGTATCACTTGCAACATAGCAGAATATGAACAAGTTCGCCGTGA AGT
ACAACAGTCAAAATGACACAACATACCTTGTTTTAACAAATTATAAATAGTGACAAT GTC
ACCCCCAGCAGAAAATGCCTTGCCACTTCCCTTCAAAGTTAAAGAATAACTGAGTTG CT
ACTCAGTAGATTAACATATGTAACTTCACCTTAAGATAATCCAGAAGAACAACAATA GGA
AAAG C C ATAAAATAATTC AAAC AATAATTTACTTAAC AAATTAC CTTC AATAC C AC G AATC
CAATATCAGGATCATCTTCCCAATTTTTGTACAGCTTTAGCAACCTATCCACCTAAA CAA
CAGTATCATGCAGAGTTTTTATTTATTATACAAGGTGGAAATTAAGATTGCCAACCA ACT
GGAATATCAAGATTCCCCAGTGTCATGTAGGGTTCATTCCCAAAGCCAAAACAAAAT CA ATTCAG CATAAAAACACG ATTTTGATG GTTG G ATATTAATTAAAACACG ATTTTACTTG GT GGGTATCAATAAAAAAAATCCAAAAAGAGAGCAATAGATTTCAAAAATGATCTTCTTGTG CGTACACACTGTAGATTTCAAAAATGGCAAAAACAGAAGCAAAAAGTAAAGGTCTTTAAA AGG CAGG AAACTAACAACTG AAAAATTG AG AG CATTTAACG CATG G GGTCTGTTAAG GA TTG CTGTTCTCGAAGAAG CTTTTCC CTCCACTAACAC CTGAAAAATG CAAACTCATATTT TATAACAAAATG G AAATTTTGATAATG AATCATAAATTG G ATTG ACAAATTTTCTTTTAAA AAAAATTCAGAACTCACAGTGCTTTGGGATTCATCAACAAGGGCATTGGTAGAGACACT GCAAAAGCTTCTGGAGTGAGAAACCAAGCGCGAATTCTGCAGTAAGCGCCTCAAAATA CTTGCTGATTTGAAGCTCTGCAT
SEQ 10
ATGG CCTTG ACTCTGAAGTCTTTAG CAACTCCTTTG CTTTTG G GTG CTTTCTTTATCCTT
GTATTGCAGGTTTGCTTCTTCATTCTTTGAACCTTTCTACAGTAGGAGTATTTCTAA TTAT
GGGTAGTCCTTGCCTATTGAATTGATGATTTTGTTTAGGTGGATTTAGCGTATTGGT TTA
ATAGCTTTTGATGATTTTACAATTTATATGGATTACCCTTTTCGTATTTCTTCAGAA AGAT
CAAAGATTATTGGGTATTTAGGACATCCTATGCTTTAGTTAAGCGTTGACTTGAAAT CAT
GAGTAGAAGTAAATTTTAATAATTTTCTCATTACCTAGTGAGAATGAGAAAAGAGAA GTA
TAGACATGTTCCTGCAGTTAAATATATGGTTTGATGCAAATTACTTATGTGTTAGTT AACT
TACATGTTTCTATATATAACATGAGAGGAAGTATTGCTTCACAAAGAAATACAGAAA GGG
CAGAAAATGGACGAAAAACAATGGACTAGTTACAGTTTCTGCTTTGCTCTATGATTC TTC
CTCTACGTTATAGTGCTTTAGACATATGATGCATTTGCTACAAATTTTTAATTAATT TTCC
AGGTTAATAGTATTATAATGTTGAAAATAAAAATTGTGGAAGTGTTTCATCTTGATT AAAG
TGAAACCTTTAGTTCTGCGTTTGTGACCCACTGTGTCATACTTTTGTTCCTGGAGTT TCA
ACTCTGTTTCCTAAAAGCTTTTAGCTTGTCTATTGATGAATAGAATTGATGTGTTCT TATG
GAAAGCAGGTTGTTGCAGAGAAGCCAATATCTGAAGCTAAAGTTGAGTCTGCAATCC TT
AAGGTATATTCTTCTGTTCATGATTAAAGAAGATGTTAGTGCCCTTGTTGCTTCAAA TTTA
AAACTTAAGAGCGCTTTTTGGATGATCGTGTAGGAATCTATCATCAAAGAGGTTAAT GAA
AATGCCAAAGCTGGATGGAAAGCTGCATTCAACCCTCAATTCTCGAATTTCACGGTC AG
ATCATCATATATCTTAGCGCTTTCTCTAACCATCAACAACAGTTGCTTATTTTGTTG CTAT
ACACTGAAAACATGCATTTATAATTATGTCCATCTTTGGTATTCACAGGTTTCACAA TTTA
AGCGCCTTCTTGGAGTTAAGCCCGCACGAGAAGGTGATTTGGAGGGAATTCCACTTC T
AACTCATCCTAAACTTTCGGAGCTACCAAAAGAGTTTGATGCACGAAAAGCTTGGCC TC
AATGTAGCACTATCGGAAGAATTCTGGGTCAGTTTCTTCTTGTTCTTGCTTACTACT ATT
GAATTATAATTCTATGATAGTGATCCTAATGCTTTCCGTCTTTGCACATCAATCACT GGG
ACAAATTTG CATCAC AG ATC AG G GACATTG CG GTTCTTGTTG G GCTTTTGGTG CTGTTG
AATCGTTGTCTGATCGTTTCTGTATCCATCACAACTTGGTAAATTCTGTTGAACTGT GTG
ACCACCTCATTTAAAAGCTTTAGATGACGCATTTTTATTTACTTATTTATATATCTT CAGC
ATACTCTCTCATGTGCGAGCCCTGATTCTTTCTCATGGGCCAAGCACGTGGAAACTA TC
TTATATTAGCACAAAATGCTTGTGAAGTTTTCACTATAGTTAATGTCACTAATGTTA ACTT
TTAATGTGTAATGCAGAATATCTCTCTGTCTGTAAATGATCTGCTAGCATGCTGTGG CTT
TTTATGTGGATCCGGTTGTGATGGTGGATATCCTATATCAGCATGGCGATACTTTAT CC
GTAGGGGTGTGGTCACAGAAGAGGTAAATGTTGTCTTATTTTCACCTCAAAAGAGAT TA
CAGCTTTCAGTAAAACCATTAGTTACCGTGGATCTTTATGATCAATCACTAATAAAG TTG
TTTTTATTCTTGCAGTGTGACCCTTACTTTGATAATGAGGGATGTTCGCACCCGGGT TGT
GAACCAGGATATCCCACCCCAAAGTGCCAGAGGAAGTGTGTGAAGGAGAACCTACTA T
GGGGGAAATCAAAGCATTATGGTGTCAATGCATACAGAATCCACCGTGATCCCTACA GT
ATCATGACAGAAATTTACAAAAATGGACCAGTTGAGGTCTCGTTTACAGTGTACGAG GT
AATGACGATAAGGAAGAATGTTAAGTTCTGATCCTAAAACTATTTGATACAGCTTTC CGT
ACATGACATTATCTGAGCTGGTAACCTTATATGTGGTTGCCTACCTATCCCAAAATG AGA TACATGTAATTATTTTTAGGTGACCTATAGTGTAACTGTTATGATAATTGAGAAACTTTA A
CTACCGATGTACCTTCCCAATTTATGTTTGCCCGAGATTTACTTGCAAACTAATATC TGT
AAATGAGATATTTAATGCTAACCACAAGACAATATCAGAAGTTACCTGTTGTCGTAA AAC
TGCATCATCTCTTTCTCGGTGCAAGTAGATTTGTTTAGATTTTGTTTGTTGTCTTTG ATCA
TAACTGTTATCATCTCTTTTTCTCAGCAATGCTTTCCTCTAACCAATGAGTCAATTT TTTT
TATTTTTTTTTTGTCAATCACAGGATTTTGCTCACTACAAGTCAGGAGTTTACAAGC ACG
TAACAG GTCAAAGTATG G GAG GC CATG CTGTTAAG CTTATCG G ATG G GG AACTAGTGA
ACAGGGAGAGGACTATTGGGTATGTAGATGTGTTCAAGTTCTGGTGTCCTGTTTTCT AT
TTAAAAGCATATCTTTTTGTCAAAATCTAATCACCTTATATATCATCTGCAGCTTAT CGCA
AATTCTTGGAACAGAGGCTGGGGTGATGTATGTCCTTAAATTCATCCCTATGTTTTC ATA
TATGAGCAAAAAGTCCTTAGACATAGGCATGCTAGCTTCTTGTTGTTGATGCACTAA CTG
G CAC ATCAATAAATG G ATTTCAACTTATATAAACTAACAACGTAAACAATTTTTG CACTAT
ATTTCAACTGGTAAAGTTATCTCTGTGTGACCTATTGGTCACGGGTTCGAGCCGTGG AA
GCAGCCACTAATGCTTGCATTTGGGTAGGCTGTCTACATCATACCCCTTGGGGCTAC G
GCCCTTCCCAGGACCCTGCGTGAACGCGGGATGCCTTGTGCACCAGACTGCCCCTTT T
ATATTTTAAC CAGTTAAG G CAAGTTATTTACTG CATTTTTTGAAGTTACTCATTTAG G ATT
ACTATAGAGAGTTACATGCCGTCGTATGTCATTTAACCTAATGATGCAAATAAATTG TAT
ACTATTTTAATGCACAGAAGTTAAAGTAGCTTCTTCTCTAAATGAATGTATATCTCC AATA
TGACAGGATGGTTACTTCAAGATCAGAAGAGGAACAAATGAGTGTGGCATTGAACAT AA
TGTGGTGGCTGGATTGCCTTCTGCAAAAAATCTGAATGTGGAACTTGATGATGTATC TG
ATGCTTTCCTTGATGCCTCAATG
SEQ 1 1
CTATACCATCATACCCATGTTGGAATGTGCCACTCTGACAACAAGTGGAGATGTTACCG
ATGTTCTTTTGTTCCTCCATGTCAAAGAACCAAATACATATCCCTGTGTTGGTGCAG CCA
CCTTGAAGTTCACTGTAAAATTCATCTTCTGGTAATATCTAGTGAAGGCTAATCTTC GGG
GAACCACAGTGACATTGACACCCGTAGGTGCATAGACAACTGCCTTGTAAATGCTTC TT
GCTTTTCCCACGTTAGTAACAGTTCGAGTTACTGAATATGTGCTTCTGAGGTTTGGT ATT
GTGATGGAGGGATAATTTAGTCCATTTGGTGATGCAAAGGTTTGATCACAGGTGCTA TT
GTCCCTTGTAATCAGATGCAGAGATTTCTCATCATAACCAATTGAACAAAGAAATGC TCT
GTAATCTGCTGGCTGTGCATCGTATATAAGACCAGGATCCAGGACATTCGTAGGGTT AA
CAAAGCCAGAACCAAAATCAAATGGAGTAGCTCTCTTCCCTTCAGGATCTACTATTA TG
GGTTTGTGATGCTTATCTGACAGTTTAGCTGCATTAATATTGATGAGATTAGTGCAT TGA
AGGCTTGAATGAAAGAGTTAGATTATGTAAAAGCTTTTATTCTACCTGTCGTCATGA TCG
C G G ATTTAATTG C AG AG G GAG AC C AAG ATG G ATG C AC AG CTTTTAAC AAG G C AAC AACT
CCTGTTATGTGGGGGCAAGCCATAGAAGTTCCGGATAGTACATTGAAGTTCAACTTA GT
AGAAGCTGCTGGAGACCATGCTGCCAGGATATTTAATCCAGGAGCTGCAATATCAGG C
TACAAAG G CAAATCAG CCAG GAAATTACTTG CAAG AAAAAG CCAAATCCTC AAATAAG G
TAAGAACCAAAGAAAATGCAAAGTAACAAAGAACTAATCCACAACCACATTGACGAT CC
AGGAAGTAAATATCAGGATGCAACATAAAACTTTGTTGGTCTACATGACAAAGGCAG AG
AGAGATCATTGTTGAAAACAAGTGGCAGTTGAAATTAAGTCCCTATATTACTATTTT TAG
CGCACAAATTACCTTCAAAATTTCTGGTGTTACAGAATTAGGACCTCTTGAAGAAAA TGC
TGCTACTCGAGGAGCAGGTTGAGCTCCCAAAACGGTTCTAGCAGAGAGAATCCTTGC C
ATGGGGAGGCTGTTAATATAATAAGATGTGAGATTGGGAAGAGATGATAGTTAACAA CA
TCAAAAAGTCAAGAAGAAGGAAGTGAACCGTGTATTGTTAATGTAAGCTAGGATCTT GT
TTCCAATCTTTTTCCCAACAGTTGCTGCAGGAATGACAAAAGGGATGGCCACACCCT TG
TCTGCGTCATCTATAAGGATCATCCCAACTCCACCGGCTTCTTTAACTATAATGCTT TTC
TCCATCTTTGACTCACTTGAGCTTCCAGCATGTAGGCACACAAGCACCTTCCCTTTG GC
CTTAGTTCTATTCAAAGAACTATCTAAGCAATAACTAGTGGAAAAGAGAGGGAAGGA AA AAAGTAAATTAAGATAATTGTCAATGCATACATATCTACATTTAACAAACAGCGAAAGTA
CCTTATGATTCACCTGGATTGATAGGGAGTGAAGTATCCAGCATAAGCTTCAGAAGC AG
GTATGATTCTTGTAGATGTATTCATTTGAGATAAGCTAAGACTTTCACCCTTCCAAA TATC
CACAACTTAATTAGAAAATAGAAATTGAATAATAATAACATGTAAACTTGTCGAAAA CTG
GGATTACCTTGAGCCGAACTCCATTTCCTAGTAAAATATCAGAAGTAAAATCTCTAT CAG
TTGAACTGGCTGCAACTGTGATCATCCAAGGAGCTAAATTTGTGGCTGAACCAGTGC TG
CCTTCATTTCCAACTGAAGCCACCACAAGTATTCCGCGGCTAACAGCATGATATGAC CC
CACAGAAATGGCATCATTGAAATAATCTCCTTGGGGAGCATCAGGGCCCAAAGATAG A
GAAATGACATGAACCCCATCTCTAATTGCATCATCAAATGCAGCCAATAAATCAACA TCA
TAGCAACCAGAACTCCAGCAGGTTTTATACACTGCTATCCTGGCCATTGGGGCACCA CC
TCTGGCTCCTCCATTTGCCAAACCTTTGTAATTCATATTAGCTACGTAACGCCCCGC TG
CTGTTGAAGCTGTGTGACTCCCATGACCAGAACTGTCCCTAGCAGACTTGTAAAACA TG
GTCTTCCCATTTTCTTCTTCAGCTTCATAGCCACTCATATAATATCTTGCCCCAATT ATTT
TCCTACATAAGACAAATCATATTGCACTTATCATCTAAATAACAAAAGAAGAGATGG TTG
CCAATCAAAAGAAAACCTGTTGCATATAGAGGCATTGAATGCTTCTCCTGATTGGCA TT
GTCCTTTCC ATC CAG CTG GCACTG GAGG CATGTTG GTATCACTAAAACTTG GAGACTCA
GGCCAAATTCCTGTTTCATTAAATATCTTAAGAGCTTAACCTCAAGTTCTAATTAGC TCG
AAAAACAAG G GAAACAGTG G AG CTG G GG AC CAG GTTGAG GATAAACTGATAAG GTTGT
GAACAGAGATAATATTTGCATTGAAACATGTCACTCTAATGTATAGTGGCTCTTCCC ATA
AAGTAACATTTACAAGTAGTTCAAG CACACTGTTAG GC ATAAATG CAAATGG CAAAATAT
G GG AAG AG GTGAG AAAG ATGAACTG G GAAGCTAAGAAATTG CATAAACTTG AGTTTTAA
AAAAATCTAAGCAAACATATTTCATCATTCGAAAATGATTGAAACAAGAATTGATTG ATGA
AAGGAACTACTTTCCTCAGGTTCAGCCATATGTACCCAAATGACAAACTTAGCACTT TTG
CAAAGTCATGTTATTGTACTCTTCTTAAGAAAACTAACAGAGACAAGAGCCCTTTTA AGT
GACAATACATTAAATGAAGGGACCAACACTAACTTGGTTGGTGCATTCCATCATTAA AC
GATCATATCTTTCACCTAACTCGGAAAAGATTGCTAGAATTTAAGATAATTAAAGCA AAA
GGAACAGAGAAACCACCTGTATCAATGAAACCAATGATTACATTAATTTGGTTCTTG GTA
GAAAAACCTGGAATTTCCATTGTTTCATCATCACTGAGCCCCATAAAATCCCATGAA TGA
GTTGTGTGTAGGCTCCTCTTAGTATTTGGAAACACGGATACCACTCCAGGCATTTCT GT
TTTATTTTAACATTAAAGACAAATTTCTCAGTACTTATTCATATCATTACCTTAAAG AAAAA
CATTTGGCATGGGCTTACTGGATATTTCAGAAGCCTGTGCCTCAGTCAACTTGGCTG CA
AAGCCTTTAAAACCATGCCTATAACTATATACATGTGAAGTCTTGGCTTGTTCAATG CTG
TAAAGTTACAACTTCAGTTTTTTTG CTAAAAG CAACCAGTGTAAAACC CAATGAACCAG C
TCAAAAAAGGGAAAAACCCTCATAGCTAAAGGTAACAAGAACTGACCTTCCTTTATG AAT
AGCAGTCAG CATTTGATG GTTTTG CCTCAAAATCTCATCTG G GTGTTCATCACTATCTTT
G CTTC C C ATGTAC AC C AC ATATAACTG G AAAAAAAAAAAAAC C AAG AAC AC AACTTTACT
AACTATTCATCATTTCAAAACATCATTCAAATATACACCCTCAATAACAGCACATAA AAAC
CCATATCAACATACAGACTACAGAGCCAAAGTTTATTTACCTTGGAAGAAAAGCAGA GG
CTAATATCTCCAAGAAAAACACAAAGAAAGAGTAAAAGAAGAGTCTTTTTTAGAACA CCC
AT
SEQ 12
ATGGGAGCAAAAGCATTTCTTGTTGCTATGTTTCTCTCAGCACTGTTATTTCCTTTTGCC
TCCTCATCCAATGATGGCTTGATGAGAATTGGCTTGAAAAAAATGAAATTTGATCAA AAT
AATCGGCTTGCTGCACGCATTGAGTCAAAGGAAGGGGATGTTTTGAGGGGGTCGATT A
G GAAGTATAACTTCCGTG GTAAACTG G GG G ACTTTG AGG ATACAGACATTGTAG CATTG
AAG AACTATATG GATG CTCAATACTTTG GG G AG ATTG GTGTAG GC ACTCCACCTCAG AA
GTTCACTGTAATCTTTGACACAGGTAGCTCGAATTTGTGGGTGCCGTCGTCGAAGTG CT
ATTTCTCTGTAAGCTTCTATACATGCAAATGATACAAAGGATAGCATTGAACATCCA TCT TGAGTGATGTAAAATTTGATGACTGCCTATCTTGGTGTAGGTTCCCTGTTTCTTTCATTC
CAAGTACAAATCAAGTGAATCAAGTACTTATAAGAAGAATGGTATGTTATGTTTCCA TTTT
TGTATATTGCTTCTCTCTACCATCTGGTTGTTTATTGCTGCATGAAACATATATATG CTTC
CTTCTAGTGCCGGTTATTGTTTAGAATATGTGCCATCTTGTTCATTTTAGAACACTT TTTG
TATTGTCCTTATGTGTTTCTCACGGTGCATCAAGGGATTACATTGGAAAAGTTAAAT GAT
GAGTACATGTAGTTGACTGTTGAGACATAAAAAGAGGCTGTTTATGTTATGTTTCTT AGT
ATATTACTGTAACTAGTGAGGTTCCAGAAAAGAAACACCAATATCCTTATCTCTCTC TGC
GATTAGTGCTTTTTGGTTGCGAGTTGTATAGTTTTAACCTCTGCAATGCCTCTTTAG GTG
GCCTTTCTTCTTCCCTATTAACTAGGTTTTGTATCCATGTCTTGGCATGTCTGTGAC ATA
AGAAATTTTCCGCAGAAAACTAGTTATTCTTGATTTTTTTGTTCTATTACAATTATA TATGT
TGCTCAAAGAAAATGAATTGAACAGTCTGTAGCAATTGAATGGGTTTGCCAGATGAG TT
CACTCCAGTGTTATTGAAGGCCATTGCTTGGTCGTTTAAAGTTATGAAGCGATACAG CA
AGGGCTATCGCTTCATCGCTGAAGCCATGCACTTTAGAGAAGTTTGCACTTCAAATA AT
GGTGCACAAAGTGATTCCAACGAGAATTTATTGTTTCATTGAACATCAACTTTTAGA AGT
TGGATTAATGTTGGCATTTTCACAAAGTGATTCCTGTTGGGTATAAAAATAATATTC ACG
GTATTAGTGATAAACGCGGAACACTAAGTTATGCTTAAATCAGTAAGAATAAAAATG CAG
C AAAAATG AC AC C AAG ATTTTAC CTAG AAAC C CTTCTG AATAAG G G AAAAAC C AC G G C C
AAG AAG AG CAACTG ATATCACTATAG CG AG G ATTTTACACTGTGTAGTAACG AGTAC GA
ATACTCCTAAGACCACTACACCCTCAAAAGAAATAAACACTCTTTTGCTTTTTCACC TCA
CTACAATATCTCTCACACTCTATTTTCTTTACAAACTATTTTCTTATAGTTTATGGA ATACC
TTGCTCTCTCTTTTTTCTCTCTTTGTTGGTGTGTAGAAATGAGAGTTAAAGCTCTCC TTTT
ATAGCCAAAACCTCACTCTCTAAAGCCTACAATATTTGACAATTTGCACTCCCTTTT CAC
AATTTCAACAAGGTTGGCTACCAAACCAAACCAAATCAATAAAATTGTCTACCAAAT CAT
ACCAATTCAACAAGGTTGGCTACCAAACCAAACCAAGTCAACAAAATTGGCTACCAA AT
CATTTGAATGAGATGGAAACCATATCAATCTCCCCCTCCAGTCTCATTCATCTAGAG GA
GGTAGCACTGTCTTCTAGTCTGAGTGCATGCCGACAAGTTCTTTGCATAGCTCGAAC TT
GTCTCTTGGTACCACCTTGGTCAACATATCAGCAGGATTTTCACTTGTGTGAATCTT TTT
GACCTGAAGTGATTCGTTCTCCACTTGCTCACGAATCCAATGATATCTGACGTCGAT GT
GTTTTGTTCTCGCATGGTACATGGAGTTTTTGCTTAGGTCTATTGCACTCTGACTGT CGC
AATAGACAACATACTCCATCTGTTGCAATCCAAGTTCTTGAAGAAATCTCTTGAGCC ATA
TCATCTCTTTGCCAGCTTCAGTAGCCGCAATATACTCTGCTTCAGTTGTAGATAGTG CG
CACTTCTGCAACTTCGACTGCTATGATATAGCTCCCCTTGAAAAAGTAAACAAATAT CCG
GTAGTGGATTTTCTGTTATCAAGGTCACCTGCCATATCAAAATCTGTATAACCTTTC AAA
ATTGGATTTGATCCTCCAAAACATAAGCATTCACGAGAGCTTCCTCTTAGATACCTG AGT
ATCCACTTTACAACTTCCCAATGCTCTTTTCCTGGATTTTCGAGAAATTTGCTAATA ACAC
CGACTACATGAGCAATATCTGGTCGATTGCATACCATTGCATACATCAAACTTCCGA CA
G CAG AAG AATAAG G AATCTTG GC CATTCTCTCTTTTTCCTCCGTTGTTGTAG GACACATC
TTCTTACTCAACTTCAAATG AC CAG G AAGG G GTGTG CG AACCAACTTAG CACTTTTC AT
ATTGAAGCGCTCCACTACACGTTCTATGTACTTCTCCGGTGACAAGTAAAGCTTTCT TTC
GTTTCTCGAACGAGTAATTCTCATGCCCAAAATCTGCTTAGCATGACCCAAGTCTTT TAT
TGCAAAAGACTTATTCAACTGTTTCTTCAACTCGTCAATCTTGGATGCATTCCTGCC CAC
AATCAACATACCATCCACATATAGCAAGAGGATGATAAAATCATCATCAGAAAATCT TTG
TACAAATACACAGTAATCTGAAGAAGTCTTCTTGTAGCCTTGCTCCCCCATAACAGA CTC
AAACTTCTTGTACCACTGTCTGGGAGCTTGCTTCAATCCATATAGACTCTTCTTAAG TTT
GCATACAAGATTTTCTTTACCTTTTGCATTGAAGCCTTCAGGTTGTTCCATATAAAT CTCC
TCTTCTAAGTCACCGTGAAGAAAAACAGTCTTCACATCCATCTGCTCAATCTCCAAA TCA
AGATTGGCAGTTAAACCAAGAACTGTCGGAATGGAGGACATTTTCACGATAGGAGAA AA
TATTTCGTCAAAGTCAATACCTTTCCTTTGACCAAATCCCTTGACAACCAATCTAGC TTT
GTATCTGGGCTTCAAACTATGTTCTTCAGCTTTAACTTTGAACACCCACTTGTTCTT CAA
AGCTCTCATGCCCTTAGGCAATTTCACCAACTCATAAGTATGGTTCTCATGCAGAGA TTT
CATCTCATCTTGCATGGCTTCAATCCATTGATCCTTGTGCTCATCTTCTATGGCCTC CGC ATAACATTCAGGTTCTCCCCCATCAGTGAGTAATACATATTCATTGGGTGAATAACGGG
AGGAAGGAGTACGAGGTCTAGAAGACCTCCTGAGTGGAATATCTAACTCGTCCACAG C
TTCGTGAGTAGGAGCATCTACCTCATCAACATTAGCATTGTTATCACCATCACCATC AAT
ATGCTGATCCAGAATATGGTTCTGGGCATCACCATCATCATTGAGCCCACCAACGTC AT
CCACATTTGTATGAGGAACTTGATCAAGATTAACTAAACCTTCAGAACTTGAAGATT TTA
GTTTCTCCGCTTTGTTAATATCTTCAATGGTTTGATCCTCCACGAAGATAACATCAC GGC
TTCTCACGACCTTCTTCTCAATTGGATCATATAACTTGTAACCAAACTCATCAAGGT CAT
AACCAATG AAG ATG CATTGC CTTGTCTTG G CAGTTAATTTTG AC CTCTCATCTTTAG GCA
CATGTACAAAAGCTTTGCAACCAAACACTTTCAAGTGGTCATAGGAAATATCCTTGC CAT
ACCAAACTTTGTTTG G AACATCACTTTG CAAAG CAACCACAG G G GAAAG ATTAATAACAT
GTGCGGCGGTCAACAAAGCCTCACCCTAAGAGGAATTCGGCAACTTTGCTTCAGAAA A
CAAACATCTGACTCTTTCCATCAAGGTCCTATTCATCTTTTCTGCTAAACTATTAAG CTGA
GGAGTCTTAGGAGGAGTCTTCTGGTGTCTGATACCCTGTTGTTTGCAGTATTCGTCA AA
CAGTCCACAATATTCACCACCGTTATCAGTACGAATACACTTCAGCTTCTTTCCAGT TTC
TCTTTTAGCTGAAGCCTAGAACTGCTTAAAGACACCCAACACTTGGTCTTTAGTCTT CAA
GATGTAGACCCAAAGTTTCCTTGAGCAATCATCAATAAAGGTAGCAAAATAAAGTGC AC
CACCCAAAGTC CTTGTCTTCATTG G ACCACATACGTCTG AATG CAC CAACTCAAG CAAC
TCGTCTTTCTTGAAAGAAGATGAGACTGGAAAGAAACTCTTTTTTGTTTTCCAGCCA AGA
AGTG CTCACATTTTTCTAATTTTGCACTTTCAAAATTTG ACAACAATTTCTTCTTG G CTAG
AACATTTAGTCCTTTCTCGCTAATGTGGCTAAACCTCTTATGCCATAACGTTGAAGA GTT
ATGGCTCTCAACGGCATTCACCATATCAACACAGGTAGAGGTCGTAGTCCAATATAG AC
CACGACGCTTTTCCCCACGAGCCATAATCATGGAGCCCTTAGTGAGCTTCCACTTTC TA
GCACCATTGGTACTGACATATCCCTCATCATCCAAAACACCAACAGAGATCAAGTGC AA
ACGAACATCAGGTGCGTGCTTTACATTGTTTAAAACTAGTTTAGTTCCAATACTAGT TTC
CAAACAAATCATTCCAACACCAGTCACCCTAGATAAGTTCCAAAGTCACCCTGAGTA TA
GGATGAGAAAATCCTTCCTTGATGTCACATGAGATGCGGCACCACTATCCACAACCC AG
CTTGACTCATCACAAGCAATATTTATCAAATCCGCATCAAGGACAATAACAAGATCT TCT
GTAGTGACGGTGGCCATACGATTGCCATCTTCTTTCTGTTCTTCCTTGTCTCTATTC TCC
TTTTTCAAAATCCGCAGAACTTCTTTGTGTGCCCTTTCTTCCCGCAATGATAACACT TAA
TATCTTTAAGTCTGCTTCTGGATTTGCTTCTATTATGTTCTCTATTTTGAGAACCCC GATT
CTTGCTTCTCCCCCTAGAGTCAGTCACCAAGACATCTGATGGGGAGGAACCTTGAGA TT
TTCTTCTCATCTCTTCATTTAAAAGACTGCTTTTGGCAAGATCCATAGAGATCACAC CAT
CCGGAGCAGAATTTGATAATGAAGTTCTAAGAATTTCCCAAGAACTTGGTAGGGAAC CA
AGTAG AAACAG GC CTTGAATTTCTTCATCAAATTTAATGCTC ATAG CAG ATAACTG GTTC
ATGATCCCCTGAAAATTATTCAGATGATCTGTCATCGCAGAACCATCATGGTATTTT AAA
CCCAACATCTGCTTTATCAGAAACATCTTGTTGTTTCCAGTTTTCCGAGCATACAAA CTT
TCAAGGTGCTCCCATAGGGTCCGAGCATGTGTCTCCCCAGAAATATGGTTCAAAACA TT
ATCGTCAACTCACTGTCTAATAAAGCCGCAAACCTGCCTGTGTAACAGATTCCACTC TT
CATCTGATTTATTATCAGGCTTTACAGTGGCGAAGACAGGTTGATGAAAATTCTTGA CAT
AGAGCAAATCTTCCATTTTGCCCTTCCAAATGGCATAATTTGTGCCATTCAAAGTAA CCA
TTCTACTAGTGTTGGCTTTCATCGTTTATCACAAATACAAATACTATTTATTATGAG ACCA
AAGTAATTCTTTTCTGATGTGGAAGTTCAGACTGTGCTGCAACCACAGAGCATACTC AN
N N N NTATTTATTATGAGACCAAAGTAATTCTTTTCTGATGTGGAAGTTCAGACTGTGCTG
CAACCACAAAG CATACTCAAACAG AACCTTG G CTCTG ATACCACTTGTTG G GAATAAAC
CCCGTAAAAATAATATTCACGATATTAGTGATAAACGCGGAACACTAAGTTATGGTT AAA
TCAGTAAGAATAAAAATGCAGCAAAAATGACACCAAGATTTTACGTGGAAACCCTTC TGA
AT AAG G G AAAAACTAC G G C C AAG AAG AG C AACTG ATATC ACTATAG C AAG G ATTTTAC A
CTGTGTAGTAACGAGTACGAATACTCCTAAGACCACTACACTCTCAAAAGAAATAAA CA
CTCTTTTGCTTTTTCACCTCACTACAATATCTCTCACACTCTATTTTTCTTCACAAA CTATT
TTCTTATAGTTTATGGAATATCTTGCTCTTTTTTCTCTCTTTGTTGGTGTGTAGAAA TGAG
AGTTAAAGCTCTCCTTTTATAGCCAAAACCTCACTCTCTAAAGCCTACAATATTTGA CAA TTTG CAC ATCCTTTTCACAATTTCAACAAG GTTG G CTAC CAAACC AAACC AAGTC AATAA
AATTGGCTACCAAATCATACCAATTAGTAATGGTGCACAAAGTGATTCCAACGAGAA TTT
ATTGTTTCATTGAACCTCAACTTTTAGAAGTTGGATTAATGTTGGCATTTTCACAAA GTGA
TTCCAACTAGAATTGGTTGCTTCTTTGAACCTCAATTTTGAGGAGTTGCATTAATAC GGG
GATTATTGTATATTGGATGCTGAAATTAGTTATTTCAACTGCAATTTGATTTTTATT GTAG
AGTAAATTAATAAATGTTTGATATTTTCTTGTTTTCTGTTAATTGTGCGCCTCACTT CTCG
CTATTCACTGCAAGTCTGTGGACCTTGTTTTATTTTGTTGCACGTTTTGGTTTTAAG AAC
ACTAGGTCACTCCTACCTAGGGGTGTCAATGGATATTAGAAAACCGACTTAACCGAC CG
AACCGTACCGTACCGAACCGATTTTTAGGTTTCTTTTAAAGAAACCGTAGGTTTTTA TAT
AAATCTATAATCGTACCGATAATTAGGGTAGGTTTTTTATTTTATAAAAATAAACCG AAAA
AATACCGAACCGTACCGAATAAGTTTTACATATGAAAAATATATTCATATAGTAAGT TTAA
AACTAGTAAAGTATTAAATTTTTCATTGGGTCTTGGAATTATGAAAACTGTTACAAG CCAA
TAAGTAATTAAACTCAAAATACTAATTCCTAAAACN N NNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNN CTTTTATATAATTTAGATTTATCTTTTTAAATATTTAATATAGACTTTA
TTCTTGAGTCCCAACTTGGTTAATATCTTTCCACTCGTGTGATTTATATCTTCTTTG CTTT
TACTTGGTTTCTTTTCGTTGGTGTCGAATAGTTGTGTATTTATACTCTAGCCATCTT TCAT
GTTTTTTAATTCATTATCCTTTAAACAGTAAAAATGTCTAGAGAGTTTCGCTAAGTC CTAT
AAAAGAACGTACGTTATTGCATTCTATTTTTACTGGTGAATTTTATATGACATTTAA AAAA
TACCGAAAATTAACCGAACCGTACCGATACCGAAGTGAAACCGACATGATTGGGACG G
TTTCGAAAAGTCTAGTTTTGGTTATACATAATAAAATAACCGAAAAATTGGTATGGT ATAA
ATTTTATAAAATAAG C C G AAC C G AAC C ATTG AC AC C C CTACTC CTAC CTTAATG AAC AAC
ATTTATAACTTATCGAATGTTTCCACAAGACAGAGATCGTATTTGGCATAGTAATTT TAAT
GTGATTTCATTCGTGATAAATTTTTGATTTTCCATATAGGGAAGTCTGCTGCAATTC AGT
ATGGTAGTGGAGCTATTTCTGGATTCTTCAGTCAAGATAACGTCAAAGTTGGTGACC TT
GTTGTAACAGATCAGGTGAGTGAGGCTCCTCACTTGTTTTAAGTGTCTTGAAGGTAG AA
ATCTTACATCTCAAGGTGTTCATGTACTAGGAATTTATTGAGGCAACCAGAGAACCC AG
CGTGACATTTTTGGTAGCCAAGTTTGACGGTATATTGGGTCTTGGATTCCAGGAGAT TT
CAGTTG G AAATG CTGTTCCAGTGTG GTACGTG GACAGCATTTAGTTTG CTCTCTTTCTTT
CCCACAAACCAAATTAAAGATCTAGACAATTCTTTTTTCCAGATGAGGTTTAACACA TTG
AAGGATTAGATTTCCATAAATGCAGGATAGGCTGGATGTCTCTTTCTGTTTAAATTT GAT
TTGGCATTCATCTGGGCAGCATCTCCTGTCTTTTCCAGGTACAACATGGTCAAACAG GG
TCTTATCAAGGATCCTGTCTTCTCGTTTTGGCTCAACCGAAATACAGAGGAAGAACA AG
GCGGAGAAATCGTATTTGGTGGGGTTGATCCTAATCACTATAAGGGAGAAATAACTT AT
GTTCCAGTCACACAGAAAGGTTATTGGCAGGTTTGTCATTCCGTCAATTCGTTACTA TGT
ATGTTCATGTTTTGTACAAATGCTATCTTAATCTTAGTAATATGATTGGCAGTTTGA TATG
GGTGATGTTCTTATCGATGGTAAAGCTACTGGTATGTTTTGCTCTGTACCTTTTGAA TTG
GATTGCTGAATTTTGCGAATATAGATGAGGGCTATGTGCCTGGATATAGTCTTCCTT TGA
GTTTTTAACATGAACATATCGGGTGTAGGTTACTGTGAAAGTGGGTGCTCTGCAATA GC
GGATTCAGGGACTTCTCTCTTGGCTGGTCCAACGGTATTATCTATAACCTTGATTTT GGA
CATATCGCTTTTTTGTTATGTTTTTGTTATTTGTTTCCACCATCAAAGTCAACCGCA ATTC
GATTATTGATTTCCTTGTTCCTTTGAACCTTCAACAGGTTGCTCAGCAAGAAAAGTA ACA
TTTTCCTAAGATTCATGAGTTAAATAGAACGAATCCAGTGTCTGATCTGTGTTGTGT TTT
AATTTTTCATCACAAAGAGGATGTTAGTGAAATTGGTTTTTCATTTCTGAGGTTACC TTTG
GATATACCATCTTAGCTTATCACTGGGTCGATGTCTGTAAATCTGCTTTCTCTTTCC TGT
CTTTCAGCTTAATATGTGTAAACCTGCCTCAGTAGGTACTTGTTGTTTTGTGTTACT TGC
TTTTATTACG CTTCAATGAG GATG ATG CTTTTACTTCTG CTTTAGTCCTAGTGTTCTGTC
GACTTATTTATTTACCCCTCTTCTATTTGAAAGAAAATTATCCAACCCGAAAACTGT AGC
CAGGTTTTATCCTGTTG ACTTG G AAATG CTGACAATTAAAT GAAATAAAAATCTG CTG CC
TTTTGTCTCTACAAATTCAGGGGCGAACGTTTACCCAAACGGTGTCATCCGACACCG CT TGGTCGAAATTTTTTACTGTATAGACATATATATGTGGGAAAAAACAGTACGCAATATAA
ATTATAAATGACCATTTATGCGTGTAGCTGTCTCTGGTATAATGGTCAAGTGCTGTT TTT
TCCCTACTTGACTTGATGTCGTGGGATTGACCTCAACGGATGGCATTTTTTTATTTC AAA
TTTTTTAGCATGGTCCTTTTAAAAACTAGAGTTTATATAGAGTTGAACCTTCAATAT CATC
TTAAAAAAACAACTAAATCTGGGGGACATGACGGCTTCCACGTTATAAGTGATAAAA AAT
TTATTTGAATG CATAAAGG AAG CTGTTTAAG CATACATAATATAATTTAG AATAATAATTT
TTAAAAAATTATTCGACACCGCTTACTAAAAGTTCTGCGTACGCCCTTGTCTACAAT TGT
ACGATTAAAACACTTCCAAGGTTGAGCAATCTTAAGCCCACCCTTTTCCGTACAGGC CA
TAATCACTATG ATTAATCAAG CCATTG G AG CCTCTG G AGTTG CTAG CCAACAATG CAAA
TCTGTAGTGGAGCAGTACGGGCAGACGATCATGGATTTGCTGTTAGCAGAGGTGAGC A
TTCAACTGTGTAACAGCATTTTCATTTGTTTGATATGCAGTTTCCTGTGCAGTTACA GCA
AAATGACACTTGGCAATCTAGTTGCCTTTCCCGGTTATCTAATTGGTCCGCATTTAA ACA
GGCACATCCAAAGAAGATTTGTTCACAGGTTGGAGTATGCACCTTTGATGGAAACCG CG
GAGTTAGGTTAGTCTTCAGGCTCTCTCTGCCCCTTCACGTGAACATATTGTGCATTT TGT
TAATCCATAATGTTGATATAAATACTGAATAATTCTGTGGCCTTTTTTCTGCAGTAT GGGA
ATTGACAGTGTTGTAGACGAGAAAGCTGGCAGATCCACAGGACTGCAGGATGGTATG T
GCTCTGCTTGTGAAATGGCGGTCATATGGATGGCGAATCAACTGAGACAAAACCAGA C
TCAAGATCGCATATTAAACTATGTGAATGAGGTAAAACATCTGTCACTGCAATTTTC TCC
TTTTCTTTGAAAAGAATGCTGACTGACTGACAGCTTTGCGAGCGTCTCCCAAGCCCA TT
G GG G GAATCAG CTGTTGACTGTG GAAAG CTTTCTTCAATG CCTAAAGTCTCTTTCAC AA
TTGGTGGCAAAGTGTTTGATCTCTCCCCAAATGAGGTATGCTTTATAATGGTGTTTG CG
AGTAATAAACTTTGCTATGCCTCTTGTAATTCAACATTGTCAGAGAGAAAAATTAAG TGG
CTCATCATAGAGTCTTTTTGTGGCATTACAACTAATTGTCATATTACAAAACCGATT CTAG
TGTTTAAGGTTTTCAGTTTCTACAGAGAAGTAGATGTTTGTTGTCATCAAGAAGTAC TTC
TATATTTTGAGGTTGTGAGTATTTCGATTTTACATCAATTTAGATGTCTCATTTTCC AACT
AAGTAATCTAAGAGAACCTCCGTAAGATCTTTGTGCCGTGAAAAAGTTACTCTTTGC TAT
TTGTATGGAAGTGTATTACATGCTGACCTTGCTAATGCAAGCCGCTTGAGAAGGCCC AA
TGAATCGAGATTTAGATGTCGTCCCCCCCACCCCCCGGGCCCCAAAAATAATAATAA TT
TATATCTGGATGATTCAGCATGGAACAATTCTCTACCTTCATAGACAGGGGTAAGGT CT
GCGTACACACTACCCTCCCACACCCCACTTGTGGATTCCACTGGGTTGTTGTTGTTG TT
GTTGATTCAGCATAGAACAAATCTTACAAAGAAGAAAATTCATCAAGTGTACACCAA ATC
AATTTGATCATTCACGTTTTTTCTAAATTTGTTTGTTTTGTGGTTTAGTAAACATAA AGAG
ATTGAGCTTTGAATAATTCAGGATAGATCACATCTTATAAAGGAAAAAATTCAAGCG AGA
CACCAAATCACTTGACCATACGCAAAACTTTTCTTTACCAGACCATCACTCTAGTCT TTT
CCTGCCCTTTCAACAAATTTTATTCAAGAATGAGAGTAAAGTTGTGATCCGATATGT GGT
TCTGTTTTACTTGTTTCAGTTGTTTGTTTAACTTGCTGACCTAGATTGATCTATATT CTAA
TTTTGGATTCTTTTCCCGTGATCAGTACATACTCAAGGTGGGCGAGGGTGCTAAGGC AC
AATGTATTAGTGGTTTCACTGGCTTGGACATTCCTCCTCCCCGCGGACCTCTCTGGT AA
AGTTATCTTTATATGTTCTCTCTATTTTAGTTTATAATTATGTTTTTTTGAGATAAC TATCA
GCAATGTAGTTAATCCAACAATAAGCTTAATCCGAATTATAGAGCTACGGCACAATT AAA
CCCGAGCCAGCAATTCTTTTGTTCACTGAGTTCTCTATGGTATTCCTTTGGTTACAG GAT
CTTGGGTGATATTTTCATGGGTCGATATCACACAGTTTTCGATTATGGCAAACTCAG AGT
TG GATTTG CTG AAG CAG CT
SEQ 13
ATGACTTTTTTCAGGTCGTTCTTATTCTTTCTTCTCACCTTATTTGTTATTTCATCTGCA CT CGACATGTCCATCATTAGTTACGACGAACAGCACGGCCAGATGGGGACAACACATCAT CGTACTG ACGATG AAGTCAG AG AATTGTAC GAATC GTG G CTTGTTAAG CACG GAAAG AA TTACAATGCCATCGGAGAGAAAGAGAGAAGATTTGAGATTTTTAACGATAATTTAAGATT CATCGACGAGCACAACGCTGAGAACCGCTCATATAAACTTGGGTTGAATCGATTCTCTG
ATCTTACCAACGAGGAATACCGTGCCATGTTCGTAGGTGGACGGTTGGATAGAAAGA C
GAGGTTGATGAAGAGCCCTAAAAGTAACCGTTACGCTTTTCAGGCCGGCGAAAAGTT G
CCGGAATCCGTTGATTGGAGAGAGAAAGGCGCCGTTGCCCCTGTTAAAGATCAAGGC C
AATG CG GTG AGTTTTTTTCTTCTTCAAAACTTTCCTACTATAAAG GAAAG CTCTG CTCTTT
ATCGTAAACATGTACTTTTGTTTTGTCTGCTTACGGAGTGAGACCAAGAGGAAGAGT TT
GGATAGATTGTTGAAAGGAGTCATATGTAGGTCAAAAGTTTTTGATTTTTAGGTTGT TTT
TTGACCTATGTTGTCGTCTTATACGGTCAATGATCTGTTATTGGGTAACTAATGATT CTG
TTTTCATGTTTATTTCAGTCAACAAATTGGAGAATAAATTAATTGCTGCTCTGTCTG GTAG
TTAATCTTCATGATATACACCTAAAGCTTACATCCTGATTTAGTATTTGGTGTCTCC AATT
GGAATGTTTATTTGCTTTGCTAGTGTTTCCTCTCTCTCTCTCTAGGGTAAATATAAA AAG
ATCTAAAATTTAGAGGTACCTGGTGTATATCTTAATATATTCCATGTACAAACTTTA AAAA
ATTATTTAAGCTTCCCCTAATTTGTTTAATACGCTGATAAGGGGTAATCAAAAAGCA TAA
AGATTAGATTGAACGGACACAGTATATATTTTGCTTTTGCAAGTTGATCAGTTTCTT TCTC
CATTCTAAATCGGAATCGACCAGAATTTAAAGCGGTATAACTTAAGATTAAGCCATG AAG
ACATATTTGGCTATTCTAGGTGTTATAAATTTTAACCCAAGTGTCCTAGGGAATTGA TGT
TTAATCTTGCTTTGATTATGACGAAACCCATATCTCGATTGGTTAGATATCAGTATA TCTA
TGTTATGTATAG AATCCTC GTTTG AAATTTG AG ATTTTCTTATG AAG G GAGTTGTTG G GC
ATTCTCAACGGTTGGCGCTGTTGAAGGAATAAATAAAATTGTAACGGGTGAATTAAT TAG
TCTGTCAGAGCAAGAGCTTGTTGATTGTGATAGGAGTTATAACCAGGGATGTAATGG CG
GTCTCATGGATTACGCCTTTGATTTCATCAAAAATAACGGTGGCATTGACACTGAAG AT
GACTACCCTTACCATGCTCAAGATGGCACTTGTGATCCATACAGGGTAAGTAATTAA CC
ATACTATCAAGAAAACATCCAAATATTAATTATGTACTATTTCAGAATGTAAGTCTA TATA
GCAAGTAATTAATAGTATTTGCTGACAAAATTTGGTCATTCAGAAAAATGCCCGTGT TGT
CTCCATTGAAGGGTATGAAGATGTTCCAGAAAACGATGAGAAGTCGTTGATGAAGGC A
GTGGCAAATCAACCAGTTAGTGTTGCTATTGAAGGTGGTGGCAGAGCTTTCCAGCAC TA
CTCTTCGGTATGGTGGGCGGATCTTGACTAATATATCCTTCTGAATATATATGTTAT TTG
TGTCTGAACTCACTGGCCCTAAATTCTGGATTCGTTATTGCATTTTAGTATGCCTGT GTC
CCTAATCTGCAAACACGGCTGCATTGTGCCTTGTTTTACTACTTAAAGCTAGTATAC TCA
TTTACCCTTCCAATTTTTATCAAATCATGCAGGGTGTTTTCACTGGATATTGTGGAA CGC
AACTAG ACCATG GTGTAGTTGTAGTTG G CTATG G AACAG AAAATG G CG AAGATTACTGG
ATTGTGAGGAATTCATGGGGTGCTAACTGGGGAGAAAGTGGTTACATCAAGCTTCAG C
GCAATTTCGCTAATTCTACAACTGGAAAGTGTGGAATTGCAATGCAGGCATCTTATC CT
CTTAAGTCTGGCGCAAATCCTCCTAATCCTGGTCCATCTCCTCCTACTCCTGTAACA CC
ATCAACTGTTTGCGATGAGTACTATAGCTGCCCACAGGGCACTACTTGCTGCTGCAT TT
ATCAATATGGCGAATACTGTTTTGGCTGGGGATGCTGTCCTTATGAGTCTGCTACCT GT
TGTGATGATAACTACAGCTGCTGTCCCCATGATTATCCTGTATGTGATGTTGATGCT GG
CACTTGCCTTATGGTAAATATTTTTTCCCTCCCATTCTGCTTTTTTCTCCTTTATAA TAAT
GATCGTCAATTTCACTTATTACGTGTAATATTCTACCAGCACAGGATTAATTAGATA ACT
CTGTCTACCAAAACTTTGGCAGATATTTAAACCTTCGTCTTCACTCGTTTATTGACC GCT
AGACCCACGTACAGATTCAACCTTTTATAGGTTTAATCATCAATGCAAGACTACTTA TCA
CAATCTTTTTTCTTTTTATGTGACAGAGCAAGGACAATCCATTAAAAGTAAAAGCAT TGA
AGAGAGGTCCAGCTAGAGTAAACTGGTCAGGGATGAAATCTAACAGGAAAGTGAGTT A
CGTT
SEQ 14
TCATGAAGAAACAATGATCAAATAATAGCTAAAAAGGGAAAACAGAGCCATCATAAGTT
GGCAAATGTAGGAATTGAAATGTGCTGGTGCATGGTTTATTGCAGGTCTAGATGACG GA
ACAGATGGAAACGAAGTAGCGGGTTCATTTCCACTTCCATTTCCCTTGGTGGCCTCT GG CACCACACTGGAGGGCGACGGCGCTTCAGTAGAATTACGTTTGTTCACTGGCAGAGTT
GTCGATTTGTCGTTGGATTCTCTAGAATCATAACCTGACAAGAATGGTTTCTAGTTA AAA
TATGG ACAG GTGTG CACACTAAAAAG GTCATACTCATGAATGCAAACTCACAATCG G AT
GGTTTCCAACCCAAAACCATCTTCTCTCGATCAAAAACCACGCGATAGCCTGTCATA AA
ATTTTCTGCAAAAAGATCCATGTATTAGTTTTCTGTCATAAATTCCAAATACAGAGA CAAA
ATCGAAGTAAATCAACAAATAGCTCAGATTTTTGACTATGTAACCAGTTTTACTAGT TGG
TTATGGACTTGATTCTTACTAATTTACTTTGTCACATTCACATTTCCAATAATATAT AAGCT
AGAAGTATGTAAAAAACTTGATAGGAACCAAAACTTCTAAAAGTTGGTAATTGTGAG ATC
ATTAGCTGGCATGGTGCAAGTTATATTGCAAAATTCCATGGATAGAAACAAAATCTA CGA
GCAAGCAACTGATAGATACTTACGTCCAATGATGTTGACATCCCCACTTTTCACAAC AG
CTAAGCAAAATGCGCGAGAACCATCCTGATGATCATGACTGGGTATTAGAATATTCT AA
AAGAGAACTTCTGTAATAAAGAAGGAGCAGAAACCATCTTACCTGGAGCGAGAGCAT AA
TTATCGGATCGAAAAGAAAAAACTGGTTGCCGCCTTTCATTGTCAAATTTAAATCAG GAA
CTTCGAATGTAGTTTGATTTGCACTGCGTATGTAGCATGTTAATCCTGAGAGTTCAA AGG
ACATCAAGAAAGTAAAAGTGATGAAGATTATAAAAGATGGTTCACCTTAGCCCGTAG CA
GTATTCAAAAGGAATTTCGCCATCAGGTTGAATACGTAGCTGTTTTGCTTGAGAATC AAA
CTGAAAAAAGAAACAAAATATCTTCAGTTTCACAATACAAAGAACAAACTCCCAACT AAC
AAATCATACAGTCAGCCTGTCACTCACGTTCTCTGTAATGACTTTGTAAGCTGGGTC GTT
CAAGTATGTGAATGAGGTGCCAGAGTCAAAAATGGCTGTGAAATCAACATCAGTGAT CT
TGTTTCCCACTGTTATTCCTGTCAAGCTGATGTTATAGGTTGGGCTGCAATTCAAGG AA
G GATAAG AGTG AATACATATTTG GTCG AATTCTCCTGAATC AAGC CAG G AACAG AGG CA
AACGCTAAACCTAGAATATCAAATTGACTGCTTACTGTAGTTGATCAAGATTGAGTG GTG
TTTCTCCTTGGTCTGGACTCCCTTTATCTCCAAACACTATTCTTCCAATACCATCAG GGC
CAAAGC ACATG GAGAAAG AATTTG CAG CAAG AC CTTTACTTG CTAACATG CTCGG AACA
GATATACTTTCCAAGCCAAGTCCAAATAGACCATTAGGAGCAGCGCCACTTAAAAAT GC
ACCGGTTTGTCTTATCCCACACCTGAAATTGAACAAGTAGAAAATTAACATCATGGT GGA
TAAAGATTG CATC CAAATTACACTG CATTTCTCTCAAAC CATACCCACCCTAGAG CAATT
GGAGCCTCAACACTTTTTTGTTGAGCATTATCTGTCTCTAAGTGCAAGATGTCTTCC ACC
AGTACCCCTGATGATGAGGTATTATTGGAGAGATATGCAACTCCATAAGCACATGCG TT
TTGTGAAGATAAGCATCGCCTCCTTTGTCCACACAGAGTGCCGTTGCAAGGAACAAT CT
GACCCGTTGACGACGTATTAGGGCTGTAAATATTGAGATTTATTCGCTGTTTCAAAG CA
AACAAAG GATTATAAGTGTCAAATAATACATAAAG AAACATTAAACG GAAAAGG CAAAAG
AGCAAAGAAAATTTTGTTCCTGAAGGTGAAATCCTGTGGAGTATGAAGTAATTGAGC TA
GTAAGAATCTCAAAATTCTTCAATGACAAACAAGCCACGGAACAATATGGATCAGAT ATT
TCATTTCAGAGTCAGTATATTACATGCATCAAACTCACTTTGCAAATATTGATTATG TTGA
ATTTTGCGCTAGTTATGTATCTTCCTTAACAGAAAAACAATTTTCTCAATACATTCT CCAC
CCCATATCTTGTTAACTAAGAGAATATATTATTGTCATAATGACGGAAAAAGACAGT TGA
AGAGAAATCCACTGTGTTACAGATTCAGCCATTGTGTTATCAGCAGGTTCCTTTCTG AAA
AAGGTTATCAGACGAGGATGATCATCTTCCTTGTAGCAGAAGGCAACATACAGACGA G
GATGATCATCGAAAAAGAAAAACTCTTTAAACATTTGAAAGTAGAAAGAAAAAGGTA CTA
GAATAAAGCAAACATACTCGTCCAGAGCGTGTCTCGAGGGCGCGCACACAATTGCTG C
AATCACAGGGTAGCCAAAACAAGTCACTGCCAGTGTCAAGTGCCACCAGAAATGATA G
CCCAG GAGTG CCCACTGTCACATTTG CATAATG CAAACTG CAAATTG G CAAAG AGTATT
AGTCACACACCTTAAGAAGAAAAAATCACAACTACAGATACTACATATTTTGCATTC AAC
TCTATCTTTATAACATATAAATTACAACATGCTACTGCAAGGAATTTTCAGAACAAT TCCT
TGTCTAG AGAG GAGATAAGTG GCAG CAG AGG AACG G AAAATCAG AAAAAAAAAAATG G
AATTTATTTC C G GAG CAT G G AATTC G AACTAG AAG AAG ACTATAATTAAATTTAG AGTC A
GTACTTTTAATATAGGAGTGAAATCGCCAAAATTCCAGTCCGATGAAACACACAAAC AG
AATTAAAGAACAGAAACAGGCCTAAATCTTTCTTTTTTTTGTTTTATCATATTTTCC TCCA
CATGAATCTCGTAAGAACTATTAATGGTACATGGAATTTATTTAAGTTAGGTAACCT ATTT
TTCCTGAACTGACACATCCAACTAAACAGACAAAAACAAACGCAAAGCTCAGTCAAC TC TAACATCACACTAAACGGACAAAAGTAAAAGACAGTAAACAAGAATTTCCAAAAACGTAC
TAGTAGTTCACAATCAGGAATAACAACAAAAATATTTTAAAAAAAAAATAGAGCAGC AAA
TAAACAACTGCAAAAGCAATCAGAAAAGAAAAATAGAGTGAGCTTACAATCCCAAAG AA
CTGAGGCGGAAAGTTTCATTTCCTCCGGAAAAAGAGAGAGGAGTGGGATTAGTTGTA T
CAGCAAGGCGGCGACCTTTGATAAAGCGATCACGCTGAGTCCAAGCTGAATAATACT C
AACACTTCCCTTCTCAGGCAATCCATGAAGGTCCAAAATACCCTTCACCGGATCCGA AT
ACCGGTGATGGATATCAAACCCGAACGTCCCAAACCCATCGCTGCTCTGCAATTGCA AT
C C C AG AATC G C C AAG AAAATAATAG G G G C AAG G AAAAAATT AAAACTTGTATAAG AATT
AG C CAT
SEQ 15
ATGGTGACAAAGTTTAGTATTTTTATTTTGGTGGTGTTGTTGAGGTTATTTTCATTTGGT T
CTGTAG CCTCAAGG G AAATTCACAATTCTG GTCTTAATCTG AATTCTAGTG CTTCTG GTA
TTGAATTCCCTCAACATCCAAGTTTCAACTCAGTTACTGCTTCTGGAAATTCAGATT GCA
GTTATGGAACATCCAAGAAATCAACAACCACCCATGTAATAACTCAAGAAGAAAATA GAT
CTGATGAAAAAGAAGATGAAGATTTAATGGTATCTAAAAACCAGCCAAGAGAAGCAG TC
AAGTTTCACCTAAGGCACAGATCAGCTGGTCAAAATATAGAGGCCAAAGACTCAATA TT
TG AGTCCACAACAAG G GACTTAGGTAG AATTCAG ACATTG CATACAAG GATTGTAG AGA
AAAAGAATCAGAACTCTATTTCAAGGCAAACAAAAAATAGTGAAAAACCTACACAAT CTT
CTTCATTTG AATTCTCAGG CAAG CTCATGG CAACATTAGAGTCAG GTGTAAGTCATG GT
TCAGGGGAGTATTTCATGGATGTTTTTGTCGGTACACCTCCTAAGCACTTCTCTTTG ATT
CTTGATACTGGTAGTGATCTTAATTGGATTCAGTCTGTTCCTTGTTATGATTGTTTT GAAC
AAAATGGTCCTCATTATGATCCTAAGGATTCTATCTCTTTCAAAAATATAAGCTGCC ATG
ATCCTAGGTGTCACCTTGTTTCATCTCCTGACCCTCCACAGCCTTGCAAGTCTGAAA AC
CAGACTTGCCCTTATTACTATTGGTACGGAGACAGCTCGAACACGACTGGTGATTTC GC
GCTTGAGACGTTTACGGTTAATCTCACAACCCCTAGTGGGGATTCAGAGATCAAGAA GG
TGGAAAATGTGATGTTTGGTTGTGGACATTGGAATAGAGGCTTGTTTCATGGTGCTG CT
GGTTTGTTAGGACTTGGTAGAGGACCGCTTTCGTTTTCGTCTCAGCTTCAATCTTTA TAT
GGCCATTCTTTTTCGTATTGTTTGGTTAATAGGAACAGCAATTCTAGCGTAAGCAGC AAA
TTGATTTTTGGTGAAGATAAGGAACTCTTGAAACACGCGAATTTGAACTTCACTTCA CTG
GTTGGTGGGAAAGAAAATCATTTGGAAACATTCTACTATGTGCAGATAAAATCAGTT ATA
GCTGGAGGTGAAGTGCTGAATATACCTGAGGAGACATGGAATTTGTCTACAGAAGGT G
TTG GTG G AACAATCATTG ATTCAG GAACTACTTTG AGCTATTTTG CAGAAC CAG CATATG
AGATTATAAAACAGGCATTTGTTAACAAGGTGAAGCACTATCCTGTTTTAGAAGATT TTC
CAATTTTGAAACCATGTTACAATGTTTCTGGAGTGGAGAAACTTGAATTGCCTTCAT TTG
G GATAGTTTTTG GTGATG GAG CTATATG GAATTTTCCAGTAGAG AACTACTTCATC AAAC
TTGAACCAGAGGATATTGTTTGTTTGGCAATGTTAGGAACTCCTCATTCGGCCATGT CG
ATAATTGGCAACTACCAACAGCAGAATTTTCATATCTTATATGACACCAAAAGGTCA AGG
CTGGGATTTGCACCAACAAGATGTGCTGATGCC
SEQ 16
TCACATAGGAGCAAGATGACCTTCTTTAGACAATTTATCTTGCATCCACCTCTGAAGCAT
TTCCATTGCTGCCTTAGGCTGATCCATTGGAACCATGTGACCTGCATCATGGACCTT AA
GGAAAGTTAAAGGTCCATAGTTTTTTTGAACTCCTTTCTCTACACCATCTACTGCAA AAG
AAACTTGTGTGGCTTTTCCAAAGGCTTTTTGCCCTGTCCATTTCATTGCATGCACCC ATC
TCGAATTCCCTGTCCATATAACAAGAAATAGAGTTTTATAATATTATTGTTAGTTGG TAGC
TTTGAATTGACTTATAAACAATATGATAGAGTTCAACTTCTATATTTTGACAGACGT AAAA G ATAACTTGAAGTAACTTTTGTTAAAAGTAG AATTACTAG CATG AAAAAATAAG GTAG GT
TAAAATACACTAATATAGTATGAAAATCCTCTTTTGTGTATATAAGTTAAATCCATA TGAT
AATATAAAG CTTAC C AAG C C AATTG C AG ATAAG GTC ATATTC C C C AG C ATAC ACTAGTAG
CTTG ATACCATCCTCAAG G AGTG AAG GAATTCC CAATTCAAG ATTC CTCATC CAGTCCA
ACTGCATTGCCTGGTAAACTTCAGAGCTACATGAAACAAACTCAATATCCCCAACAC CA
AGAGCCTTTTTAACTTGTTGATCATTGAGGAAAGTTTCCATTTTGGAGAAATCATAG CAT
AGATCGCCCTCACATCTCTTCCGCACATCATAGTACTGCAACTCGGAAATTACAAAA TTC
ATATGACTTTAACTTTTGTATACTGACAGTGTAAAAAAAACTTTATTCTATCACGTC ACTT
AAAGGATTGTAACTATAGATACCCGTTCATTATAAGTGAGATCAGTAATTTGGAAAA TAA
GACAAGTAATATGTTGTACATCATTAGCTCAGAGAAAATGAGATTGGTCTTTCTTAC GTT
TTTGTCACCAGCAATGTCCATAATCTTGTTGAAGATGCTTGTACAAACAAGATATGC AGC
CATGCAAGCAGTTCCGCCATCTTTTCCTAATATTATTTGCATAGAAGAAAGCTAATG TAA
AGACTAG CTG CTG CTATATGAAGAAG G AAAAG GACATTG AGG ATAAACAAAATGAATTA
CCACAAAGCTTAATTGCTAGTTGACATTTTGGATATGACTTCTCTATGGCATTGTAA TCA
GATTTTTTGATCAATTTCATATCCAGAGCATAGTCAGTGTAGGCTTTGTATTGAATT TCTG
GATCAGTGAGTCCATTACCAATAGCAAATCCCTAAAAAAAATTGTACTTTGTTAAGT CAT
TGGCATGACGACAAATTCAAATTAAACCTAACTAAAGGTAATTACTGGATAAAGAAA AAG
GGATGATATATGGTAAGAGTTAGAAATACCTTGAGATTTACGTAGATTCCTTCTTTA TTTT
TGTTTCCTTGGTGAACCCGAGAAGCAAATGCAGGAATGTAATGCCCAGCATATGATT CT
CCAGTAATATAGAAATCATTTTTTGCATACTGTGGATGTGCCTTGAAGAAGGCCTAT CAT
CAAAAGAATTTGAAAAAGTTTGAATTAAATTTTATTAATTATATCAGTTAAACTTTA GAGAT
TTATCACGAGCTAAAAAAAAGGAATGAAAGAATAGGATCAACCTGCAAGAAGTCATA GA
GATCATTGCTTACGCCCCTTTCATCGTGACGAATATCATCATCGTTTGAACTATAAC TGA
AACCAGTTCCAGTTGGCTGATCGACGTATATAAGATTTGAGACCTGTAAAATTGCAA TTT
ATCATATGTTATCATTCTTCAACTAACAAAGGAAAGTTGCATGTTTGATTATAGGAT TTAA
CCGGTGTAAACGATTTTTACACTATTGTTATATTTTAACATGTTGTAACATGTTGTA TTCG
TCCCACTTAAATAAAGTGAAGAGAAGCGTAGTAGTCATTGATGTCAATAAACGTTGA ACT
ACTTTCGAATTTTTGAAATTCTACAAGTCACAGCTAATGAACAACAAGTGTTAAAGA AAA
AAATGCTAGTAGGTAAAAAGGTATTTTGCATGATGGAGAAAGGTTGAATAACAAATA AAA
ACATGGAGGGAATTCTTTTAGATTTTTACCATATTCAAAAGATCTAACTGACGTTTC TTGA
G AAATTAATTGG GTAAAATAAAAAGAATAAACTG AAAAAAAG AG AG G AAAAAACAAAAGA
AAAAG C AAAAG G AAG AAAAC AAG AAC CTTGTC C C AG C C G AAATC ATTC C AG AC AAG AG A
CATGTTATCTGCAATTTTGAATGGTCCATTTTCATAAAACACAGCCAATTCACTGCT ACA
TCCTGGCCCTCCAGTTAGCCATATAACTACTGGATCATTCTTCCTGCTCCTCGATTC AAA
G AAAAAGTAAAAC ATC CTG C C AAAAAC AG ATAATTTAG C ATTAATTAATAATAC C CAT AAA
TTCATTTTTTTACCAAAATGAAGCAAGAAGAACATTTAATCCAATTCAAACCTTGCA TCTT
TAGTATGTGGAAGACGATAATAACCAGCGTGATGACCCAAGTCTTGAACTGTAGACC CA
GAATTACCAACATAAGATAAATTCAATTTCTTTTCAAAAAGTCTCTGTTCAGTAACT GCTG
CAGAATCCCCTGTTGCTGCAGCCTTGTTGATATCATGCTTAGGGAATAAATTAAGCT GT
CTG ATTAG CTTTTCTG CCATTGTTAATG G GAATTTTG G AGTAGAAG ATAG G AAAAAATCA
TCATCATTAGAATTTAAAGTTGATGAGAAAGATAAGGAAATAGAAGCAAGAAGCAGA GT
AAGAAAGAGAAGAGAGAAAGATGAAGGCAT
SEQ 17
TCACACGACACTTTGGGGTGGGATATCCTGGTTCACAACCCGGGTGTGAAATCCCTCA
TTATCAAAGTAAGGGTCACACTGCCAGAAAAAACAACTTTATTAGTGGTTGATCAAA AAG
ATCCACAGTAGCTAATGGTTTTAGTGGAAGCTGTAACCTCTGTGAGGTGAAAATAAG TC
AGCATTTACCTCTTCTGTGACCACACCCCTACGGATAAAGTATCGCCATGCTGATAT AG
GATATCCACCATCACAACCACTCCCACGTAAAAAGCCACAGCATGCTAACAGATCAT TT ACGGACAGAGAGATATTCTGCAATACACATTAAAAGTTTAGCATCAGTGACCATAACTAC
AGAAATACTTCACAAACATTTTGTGCTAATTAAGATAAGATGGTTTCCATGTGCTTG GCC
CATGAAAAAG AATCAGG G CTCG CACGTG AG AGAG CATG CTG AAG ATATATAAATAACTA
AATAAAAATGTGTCATCTAAAGCTTTTAAATGAGGCGGTCACACAGTGCAACAGAAT TTA
CCAAGTTATGATGGATACAGAAACGATCAGACAGAGATTCAACAGCACCAAAAGCCC AA
CAAGAACCGCAATGTCCCTGATCTGTGATGCAAATTTGTCCCGGTGATTGATGTGCA AA
GATGGAAAGCATTAGGATCACTAAAATAGAATTATAATTCAGTAGTAGTAAGCAAGA ACA
AGAAGAAACTGACCTAGAATTCTTCCGATAGTGCTACATTGAGGCCAAGCTTTTCGT GC
ATCAAACTCTTTTGGTAGCTCCAAAAGCTTTGGATGAGTTAGAATCGGAATTCCCTC CAA
ATCACCTTCTCTTGCGGGCTTAACTCCAAGAAGGCGCTTAAATTGTGAAACCTGTGA AT
ACCAAAGATGG AGATAATTATAAAAG CATATTTTCATTGTATAG CTG CAAAATAAG CAAC
TGTTATTGATGGTTCGAGAAAGCGCTAAACTATATGATGATATGACGACAAGAGGGG GT
TGCTCCGATGGTCAGCATCCTCCACCTACGACCCCAGGATTGTGGGTTCGAGTCACC A
AAG GAG C AATAG CTC C AAC AAAG AG G AT C AC AG G G G AATC AAAAG G G GAG G G G AATTT
TAAAAAAATGATGATCTGACCGTGAAATTCGAGAATCGAGGGTTGAATGCAGCTTTC CA
TCCAGCTTTGGCATTTTCATTAACCTCTTTGATGATTGATTCCTGCATGATCATCCA AAA
AAG CTCTCTCAGTTTTCGAATTG AAG G ACAAG G GCTATAACATCTTG AATCATGAAAAGA
AGAATAATGTACCTGAAGGATTGCAGATTCAACTTTAGCTTCAGATATTGGCTTCTC TGC
AACAACCTGCTTTCCATAAGAACACATCAATTCTATTCATCAATAGACAAGCTAAAA GCT
TTTAG GAAACAGAGTTGCAATTC CAG G AACAAAAGTATG AC AGTACTGTG ACAAACAAA
GAACTAAAGGTTTCACTTTAATCAAGATGAAACGCTTCCAACATTTCTTATTTTCGA CATT
ATAATCCTCTTAACTTAGGAAAATTAATAAAAAATTTGTAGCAAATGCATCATATGT CTAA
AGCACTATAACATAGAGGAAGAACCATAGAACAAGCATAAATTGTAACTATTCCATT ATT
TGTCCTCCTTTTTCTCCCCTTTCTGTATTTCTTTGTGAAGCAATACTTCCTCTCATG TTAT
ATATAGAAACATGTAAGTTAGCTAACACATAAGTAATTTGCATCAAACCATATATTT AACT
TCAGAAACATGTCTATACTTCTGTTTTCTCATTCTCACTAGGTAATAAGAAAATCAT TAAA
ATTTATTTCTACTCATGATTTCAAGTCAACGCTTAACTAAAGCATAAAAAGTCCAAA ATAC
CCAACAATATTTGATCTTTCTGAAGAAATACAAAAAGGGTAATCCATGTAATCATCA AAA
CCTATATAAATTAAACCAATAATCTAAATCCATCTAAACAAAGAAATACTCTTACTG TAGA
AAG GTTC AAC G AATG AAG AAAC AAAC CTG C AATATAAG G ATAC AAAAAG C AC C C AAAAA
CAAAGGAGCTGCTAAAGACTTCAGAGTCAAGGTCAT
SEQ18
ATGTTCCGACTAGTAATGGTGACAAAGTTTAGTATTTTTATTTTGGTGGTGTTGTTGAGG
TTATTTTCATTTGGTTTTGTAGCCTCAAGAGAAATTCACAATTTTGGTATTAATCTG AATT
TTAGTGCTTCTGGTATTGAATTCCCTCAACATCCAAGCTTCAACTCTGTTACTGCTT CTG
GAAATTCAGATTGCAGTTATGGAACATCCAAGAAATCAACAACCACCCATGTAATAA CTC
AAGAAGAAAATAATTCTGATGAAAAAGAAGATGAAGATTTAATGGTATCTGAAAACC AGC
CAAGAGAAGCAGTCAAGTTTCACTTAAGGCACAGATCAGCTGGTCAAAATATAGAGG CC
AAAGACTCAATATTTG AGTCCACAACAAG G GACTTG G GTAGAATTCAG ACATTG CATAC
AAGGATTGTAGAGAAAAAGAATCAGAACTTTATTTCAAGGCAAACAAAAAATAGTGA AAA
AACTACACAATCTTCTTCATTTGAATTCTCAGGTAAGCTCATGGCAACATTAGAGTC AGG
TGTGAGTCATGGTTCAGGGGAGTATTTCATGGATGTTTTTGTTGGTACACCTCCTAA AC
ACTTCTCTTTGATTCTTGATACTGGTAGTGATCTTAATTGGATTCAATCTGTTCCTT GTTA
TGATTGTTTTGAACAAAATGGTCCTCATTATGATCCTAAGGATTCTATCTCTTTCAA GAAT
ATAAGTTGCGATGATCCGAGGTGTCACCTTGTTTCATCTCCTGACCCTCCACAGCCT TG
CAAGTCTGAAAACCAGACTTGCCCTTATTACTATTGGTATGGAGACAGCTCGAACAC GA
CTGGTGATTTCGCGCTTGAGACGTTCACGGTTAATCTCACAACCCCTAATGGGGATT CA
GAG AT CAAGAAAGTG GAAAATGTG ATGTTTG GTTGTG G ACATTG GAATAG AG G CTTATT TCATGGTGCTGCTGGTTTGTTAGGACTTGGTAGAGGACCTCTTTCGTTTTCGTCTCAGC
TTCAATCTTTATATGGCCATTCCTTTTCGTATTGTTTGGTTAATAGGAACAGCAATT CTAG
TGTAAGCAGCAAGTTGATTTTTGGTGAAGATAAGGAACTCTTGAAACACCTGAATTT GAA
TTTCACTTCATTGGTTGGTGGGAAAGAAAATCATTTGGAAACATTCTATTATGTGCA GAT
AAAATCAGTTATAGTTGGAGGTGAAGTGCTGAATATACCTGAGGAGACATGGAATTT GT
CTACAGAAGGTGTTGGTGGAACGATCATCGATTCAGGAACCACTTTGAGCTATTTTG CA
GAACCAGCATATGAGATTATAAAACAGGCATTTGTTAACAAGGTGAAGCGCTATCCT ATT
TTAGATGATTTTCCAATTTTGAAACCATGTTACAATGTTTCTGGAGTGGAGAAACTT GAA
TTGCCTTCATTTGGGATAGTTTTTGGTGATGGAGCTATATGGACTTTTCCAGTAGAG AAC
TACTTCATCAAACTTGAACCAGAGGACATTGTTTGTTTGGCAATTTTAGGAACTCCT CAT
TCGGCCATGTCGATAATTGGCAACTACCAACAGCAGAATTTTCATATCTTATATGAC ACC
AAAAG GTCAAG GCTG G GATTTG CACC AAGAAGATGTG CTG ATGCC
SEQ 19
TTACAAAGGTTGCTGAGCTATCCATCTTTTAAACATATGAAAGCTCTCTTCACGTTTGTA
CTCAGGAGCTGTGTGCCCTCCTCCCTAAAGAGAAAAAAAGACAAGAAGGAAGAACAA A
AACATCTGAGAACTGTGAAAATGTGAGCACAAGACAATTTTACTATCGAGTGTGAGT AC
AAAATTCTAACCTTTACTGTTGCATATGTCATATGATTAGAGAAAGATCTTGTGTAA CTG
CACATCAAAGCTATAAATTTAGAATTTCATAACAATTATATCGTCTATCCAACTGGA ATAG
AGATCCAGAGAAATTAAAATGGAGAATCATACCCTGCAACTTGACCATCAATTGTCC AA
GGGCGCCAATCATCAATGATAGAATAATTTAGATACTTTATCCATGCTTGCGTCGAT TGG
AAAGGAACAACCATGTCATGATCACCACTGGGCAAAAAAAAAAAAAAAAAAAAAAAA CTT
TGTTATGTATATAATAACTTCAGCTAAATCTTTTTTGCATAAAGCACAAATTAGAGA ACTA
CAAACATTGGTTTAGGTTTAAAGTATCAAACCTGTATATGAGTGATCGATAACCTTT GGA
ACTAAGGTTAACATGGTAAGGTATACTATTCATGAAAGTAACTCTATAAGTTGTACC CAT
AATACTTTGCCTACATCTCGCCCACGCTCTTCTTATAGTTCCCTAATCAAAGAAGAA CCA
AAAAG AAAAG AAAAAAAG C CTTTTAATATCTTG C AC CTTAATG C C AAG C AAC CTTC ACTA
TGTACAAGTACAAACAATTGAAATACCTTTCTAACATGGAGAGCCTCTTGAACACTA GG
GTCATTTGCCCAATGATTGGAGAGCTTGCGTGTCGCAACCTAAAATCAGAAAGCATA TG
TATATCTAAGTATCCTACATTTATCTGTTTATCCCGAGAAATCATGAAAAAGAACCG GGC
ATATATGCCTCTTTTAACATAAATAGGTATATATCTTAAAATTATTAAACAATATTC AAACT
TTATTCGGTGTAAAGGCATTTAGTTGCTGCAATATTCAAACTTTATATAGTAATATT TGAG
GGACTTACACGACTTTCTCGACAAATGAAGTCATCATGTTTTAGAAAGATAAAATCC TCT
TCAAGAGATCTCCTTTCACCAGACAATTGGCGTGGGTTTGGCGATTCTGAGTCCGTT CC
ACAAAAAGG CTCTAGTATTTG CTG ATCATTTATACTG CTTACAAG CTACAAAGTTG CATT
ACATGAACATTGTGGTATTACCTAATTACAAACTTAGTACCAATTCAAAACATTAAA ACAA
GAATATGTAAGAAGGAATATAACCTTACCTTCTTGAACATTTTAAAGTTTTCTAAAC ATAG
TTTATTGGTAGGATCAATGTTTCGGCAATCGCCTTTGCAAGTCTCCTTCAGTGACTG ATA
CAACAAGATCCAAATTATCATTTTTGTGCCAAACTGTGTTGGTATGATACAAAGTTA ATAT
ATGAATTGAAGTAAGAGATGAACCAACCTCATAAAGTTCATTAGATATTAGTCCCAT ACC
ATGACAGAAAG GAATTTG GTAATTG CTTTCTTCAG G AAATGTTAG CGG ATTGCCAAGTG
AATAACCCTGAAAGGATATTAATTACCATTAAAAATAATAATTTTTCATGTATGAGA TATA
TTTTAAAT AAAAG G AAAG G AATAG AAAAC CTTAAG GTTG ATTAGTG GCTTTTTG CCTG CT
TCAATTCCTGCATTTTCATATCACTCTTGAAGTGTCAACAGAGAACACTTTGGCTAG AAA
TGTTTTGAGAATCTTCTCTTCAACCAAGTGAGGCTACACATTCAGTTTTGAAGAGAC AAC
AAAAGAATGTAGTGTTCTATATATCTTATCTAATATCTGTGGAAGTAATGATCAATA ATAA
CAAAGCATAAACTTGGCTTAGAGCCCTAGGAATGGAGAACCTTTGATCAATAGAGTC AC
TTTACTCCCCCTAGTGGACCAATTCAGCATTAATTGCCAGCGGGCTTCAAATAACGA AT
GGCTAAACCGGAAAAATAATAATAACAAAGCATACACTAAAGAGTGATGGGGTGATG AG AACGATCACAACCGCACATTATATGAAAATACTAAATGACACTTACCATCTGATATTAGT
TGAACAATAACTGGAACTGTAATGCCTGAATATGAGTCCCCAGAAACATAGAAAGGG TT
G GAAATG AATTCTG GATG ATTATTGAACCACTG CAAAATAAATCACAACTTATTTCGC GA
TACATTTTTGTTATATTTGGTTGCTATTTAATTATTACTGTCTTCATTTCTTCACGG TTCAT
TTG ATCAATCCAC CAAGATACACATATATTAGG ATTTAG GTGG GTG GAGG G AATTCAAA
TACGTATATATTGAGCCTAAAAAATTTATATCCTAGATCCTCTACCAATGAACAATG CAAA
ATGAGAAGTATATTGAGGAATACATGAAAATTCAGAGTTAACTCTGCTTACTAATTC AAT
CTAAAAAATACTCAGTTTACTAATTCATATGTATCTAACTAAATGAACTTGAAATGA TTTT
CAAATACCTTTAGTAGAAATTCATAGACCTGGTCGCACGCTTGTAGATCAGTACACT TG
GATGCCGCTGAAGTTGTTGCATATGAGAACCCAGTATTTACAGGCTGTTCCAAGAAA AG
TATGCTCGCAAACTACCAAGTTTAAGGAACACATTAATTTGATGATCTAATGTTAAC CAT
AAGAGGATAAAAGAGAACATGAAGTGTGGATTAGAATATATATGCAGTTTGTTGACA GC
AAAATGAAGTGTGCAATAGAAAAATAACATGCTCTTCAGTGTTACCTTTGTCCAGGA ATA
TGGAGTTGAAACAAGAATTGGTAGGCTCCCATTGTATGCCTTCTGACCAAAAGCCAA TG
GCCCTACAAAAGAGAATTCAGAAGTTAATTTTCTCCAACTATGAGTTACACGTAATA CCA
AGCTTTTCCACACCTATATATTTACTACTAATTGACTAATTAGAAACATCGACTAAA AGTA
AATTGCTTTTGTGATACTAACAAAAGCATTTCTCAAATGAGTTAAGTTAATCACAAA AGGT
TAAACTCTATTTGACAAAAATTACATTTGAACACAAACAATAATGGTAACTGTTGGG ATAA
AGATAACCTCCAGACTATGATTACATATTTAAGGTGAGTTACTGATAAACATACTTG ATA
ATACAATAGTAGTATAACTAACATACTATCATAGGTTAAATAATATTTTATAAAAAA TATTT
ACATTGTCAGTATATACAATATAAATATTTACCTACTTCATACGCCACACCCGTGAA GGA
TGAGCAACCAGGCCCTCCCGTTAGCCATAGCAAGAGTGGATCTTTTTTAGGGTTGGA TT
CTGATTTGACAAAGTAATAGAATAGTTGCACTTCCTCGGATTTGCCAACTCCAATAT ATC
TAACAATTGTATTCAAAATACATCACTTCAACAAACTTGTTTTACTACTCCACTATA TATG
TAGCCAGTATGTTCTGAATGAAGTAAATTACCTAAGAAAAAGTTTAGATTCTTTTTA TACT
AATTGATCTTTTGATCAAATACAAGTTAAAATTCAAAGGGTGGTAAAATTAACGTAC CCA
GTCTCAAG ATAAAAAG G AAG AG G GCCATCAAAACCAG G AAG AAACTC AACAGTTG AG C
TATTCTGAG GAAG ACTTTGTACATATTGTAG AAAG AG AGTAAG AG G AAG AAGAAGATGA
AAC AATAGTG G C AG G C G AAAAC C AG AC AT
SEQ 20
TTAACCAGCTAGAGGATTCATCACACTGCCAATGAATAACACCGCACCAGTAGATTCGT
CTCTTATAAGGAATAGGAATGGATGGTCCGCAACAAAATCCATTTCCTTCTCAATAA TCA
AGGACATGGTCATTATTACAGTAGCGGTAACAGCTGCAGCTTCGGTTCCTTCCTCAT TT
ACCTCAATGAAAGACTTGTGAAAAACCTGTGAAACAGACAGGTTCTGAGGCATAGGA GA
ATCAACCATCTCAGTGAGGCTACCACCACAAAAAGGCAACGTGAGGCCGAGTCCCTT T
AGAATGTTGGAAGCTTCAAATCCAAAAGTTATTTTAAATTTAGGGATAAGAAACTTG CGC
GCTCTAACTTTTCCATATGGAACATGGTTATTTAAAAATCCTGGTTCTAAGCTGATT TTTT
CCAGTAAAGCAGGTAATCCATCATGGGCATCTGGGAGAATGAAATACATACAGAAGC G
ACGCGTATCCGTGCCTTGTTTATAAGGAAGCCTCAATATTTTAAAGCAATCAAACGC TG
CTATGTACTGCTTCTTCTTGCTAGTCATAAATGGTGCTTGAATAGACCCTCCATTGA GGA
GATGGAAGTCATGATCTTTCGTTTCTGACACATCGAACTTCTCATTCCATTCTCCTT TGA
AATATAGTG CATTG G AC AAGATCAGC CTTGTC ATGTTGTTCACTG CATCG CGAG GAAGA
ATCTCTTTGATAAGACCATTTGTCTCCAT
SEQ 21 TTAGAAAAAAAG CCAATG CTTCTTTCTG CG CACTCTAG CTG GAACCTCTGTGG CACATT
ATGTTCAAAGTAAAGCAAAAATTAGTATTCAAGAATAGCTATGACAAAAAATTCTGA ACT
CAGAAATAGTTAAAGCAAGAAGACACTTACCTCCAATTTCGTATAACGAATTGTAGT GAA
CTTCGCTCCAGAAACTCAGCCACAATTCTGAAAATCAAGAATAACAAGTCAAAAGTT TTG
TCTTCAAGAGATTAAACATATCGAGAAGAAATGTTCGTCAGCGCCACACCAACACAT TT
CCGAACAAGTGATTTGGAATTTATTTACTTGTTAAGATAAGAGTTAATCTGATCAGG TTT
ACTCAGTCTTTCCTAAGATCACCATGGATGTGTTTGTTTATGAAAATATAATCAGCA GAA
CAT CATC AAACATGTTAAAG G G GAG CTTTGG AGCAACGGTAAAGTTGTCTC CATGTG AC
CTATAGGTCACGGGTTCGAGCCGTGAAAGCGGCCACTAATGTTCGCATTAGGATAGA C
TGGCTACATCACACTCCTTGGGATACATCCCTTCCTCGGACCCTGCATGAACACGGG AT
GCCTTATGCACCGGGCTGCCTTTTTTTAATCATTAAACATGTTCAGCATATTCAATT TCTT
GAGAAAACAATTTTAGCATAACAAAAGAGAACTATTAAAACAGGAGCGACGATCTGA TG
TTACTTTATTG G CAG CG GATCAAGTTTATCGTCTGTAACAAATAACG CTTAATAAGTG CT
TGCATCAGTAAATTGTGTGTTTCACGTGAAGCAAGACTCTGGAGATAAATTCTCCTG ATA
GCAACTAATACTGCTCTTAGTAAGAGCATCAGGAAACTCTGTACAAAGCTTACATTC ACT
AATTGTAAACATACAAATGATCCACAGTTCACAAATGACAGCGAAGGATTCTTGTCA TTA
GAATCAGCCTCGTTCAATGATGCAACAATGTCATTACAAATTAAAGGTGTTTTAGTT TTA
CTATTTAAAAATTAGTTAGTACCAATTCACTTCGACATACTTCCACTCACAATCTAT TGTG
TGTGGTACCAAGCAATACGATGATACTTTCTCCTAAATGAGAAAAGCACATCCATTT GTC
AAAATAAACAAAATGATTCATTTTTGTTTTCCTTTTTTTTCATTTTTGTTTTCCTTT TTTTTT
CATTTTTGTTTTATATATGTATTCTTATAATAAAGTGGGAGCATGCTAGAGAGTTCA AAGT
AGGACCATGCTACAAAGTTCAGAAGAATACTTTTGGTTTAGTGTCTCAAACAAAACC AG
CAAGTACTATTATTTAAATTCTGGAATTTATATCATAATATCATTATTTTAGAGTTA TTTGT
AAATTTCAAGTATTTATTTTATTTTTTGAGTTTAAAAACTCAAG CAG G AAGTTAAAG AACT
AATAATCAGAAGTATTGATTGTGCCATGATCATTAGATACAAATAACATAAAATGTA TGTA
CCCCTAGAAGGTTGTATGTCCTTGGGAAGAATATCAATGTAGCCGTTGTCTCGGAAA GA
C GTG AC C AAAC ATATTTTTAC C C C AAACTG C AAAC AAAATAC AAG C AAAC AC AAGTAC C A
TC AGTC AC C C AAAAC G G G AAAG C AAAAAAATAAAAATG C AG AG G G C ATAAG G G C G AAA
G GCTG ATTAACAACACAAATATCTAAAG CAGTTCC ACAAAG GTAAAAGAG CAAG AG AAT
GGTCATATTGCGGGAAATTCTTTAGGTAGTAGATCTCCTGCTTTTAAGTTGCTTTCG CGT
TTGATGCTATGTTGAACTTCAAATATATTCAAGTTTGAACCCATAATTTCTAAAGTG TAGC
AAATTTAGTGGTAAGAACCTAAAAGCTGAACCCACCAAACTTAAATCCTGAATCCGC CTT
CGTATTTGATGTCTTAGACATTTGTATGCCTTCTAAGCGTAAATGTAGGTTGACAAG ATG
AAACAGGGCAAATTTAACTTTTTTTCTCATTTCTGCTCTAATATCTACAGTTAAATT TCAG
TAACCTACATGCAACACAGGCATACTGCACAAAAAGTCAGGCTAAAAAATAAGAAGA TG
CCGGCGTTTGTAAGAAAGTAAGAGGATTATTAGAAATGTTCTTGGTTATTTTCTCCG ACA
GCCTTGGGCAAGAATAAAGCTCTAAATTTCACCGTCAAAGAGTTCTAATCAGGAACC AT
TTTAATACATTG AGG G GTTTTCTCTTAAG GAAAAATAATTTTCATAAAAG CCAAATG GTTA
AAGGTTAACAGCTGAGCCGAATTGCCTTAATACTCATAACTCTTAATGCTACCATAT TAT
ATACCAACACCAACTCAAAAATTCACTTCCTAAAATTTTTCAACTTTTTCACAGGAA ACTA
TTCCACTATGTGTATTATGCAATGTTAGATACTATTAACAAAGTTGCTTCATTGTTT TTTTT
CTCATTTAAAAAGCTTAAATTTGCAATGCAACAGTTCCACTATCCAGGCAATACACC TTT
ATTGACAGTATAATTTGTTGGCTTTTCTTGTTCGTAGTGTTCAAATGCTGTCTTACA TACT
AACAAGTGTACCGTATATATGCCTCAAAAGCTAGTTGAGGACAAAAGAGAAACTTTC AA
CTTACTCTATCGGCAGCGGCTTGTAGAGTGACATGATCTCCCCACTCTCCCAACCTG GT
TAGAGCAAGAATATGTTGGTAATATATTGAGATTTCCCGCAATAAGAAGCACTGAAA ATT
GACCAGACAGCTTACCTCTTCATTTTCCTCAAGTAGCTTTTGTATCTCATGGGCACA TAA
CCTTCATATAACTTTCTAAAGCGCTTTAGCTGAAATGGATTGCCAACTAGGATAAGA AAC
GTGTAGGACATAAACTAGACATAGGCGTCAAACTCAAGCAAGTATAGGACTCTGTTG AT
GTTTAATAGATCCAGAAATTAGGTTAAGCAGTTGAGCTGATGAGTCTCAAGCAAACA CC
GGGATATTTTATTTTTAGTACTCTATTATTATGCACAAGTTACTTTAGTTCTACTAA AACT CACGAAAACGAATATAAACTGCAGGATTTGTTAATCTAGGATGAACACAATCCTCAAACA
TAATCATTATGTAATCTCAACTTTTCACCCATCCTGTTTCACAACCAAAGGCAAGAG ATG
AGAAGCAAATATTGAAAAAATGCAAACAGACCTGTTTAACGACCTCCTTCCTTACAT GCT
TATGATACTCTGGATTATGATACAACTGATCCGAAAGGGCCCGAAACTACAATAAGC CA
AGTAGACAGTCATATAGCAGGATTAATCACCTGTGAATTAATAAGTAGCAGGTGGGT CT
CAAGTTCAATGAGAGTTTGTCGACTAGTCAAAAGAGACCGACCTGGCAATTTCCATC TC
CTTCAATTTGCATTTCAGCAAGACCATATGTCGCTAACCTGAAACAAAGAAGTTACA AAT
AG G ATC C G AGTTG C C AAAAG G C G AAAAAC ATAG AAAAG C AC C AAG AAATTTC AC C CTTT
AAGTGCTAAGTGCAATTAAAGCATGAATTAAGCTATATAGAAAGACACTAATGAAGA AAA
AGATAACCTATGAGCAGAAACTATAATAATTTGGACAAAGAATTGAAAAACTAAAAT TAA
TCAATAAATATGAAGTAAAAATCATGTCAATTAAGTTCCATAATACTGGATTCACAA AATA
AGTCAACCAATGACCTCAAGCTGAACAACTATAGCATGCTCAGTAGTCAGTACAATG AT
GGATAACCTGCTAGAGAGCCTCCCATGGTCTAGTGTGGCATCATTTGGGTCAGGTAT CT
CCCCAATTACCCGTGGAGTGTGCTGCAAGGATAATAAGACTATAAGTAGCATGTAGA GA
TAGTGCAACATGGGTTTTCATGGGTTAAAGCTCAAAAAGGAATATATAGAGAAGCAA CC
TTTGGATACATACATGTGGTCACATTATACTCATTTAATTTGAAAGAGTTGCCCCCA AAA
ATGTAAATTTAGATAATTGTTCATAAAAAGGTCAAAAGTTCCCCGGATTTCTAAACT TCTT
GAACATTCTCCTTCTTCCTCTATGGATAGGGTGACGTGACGCGCAATGGATCAGTTA GA
AAGGAGAAGAAACTGATCAAGTGTCCCCTTCGCAACAACCTTTTTCCCTTTCTCGCA TC
CTTGGTCAAGCTTGAGATTTTCTTCTATCTGGATTTAGTTACCCGAACCTATGCAAA GCA
AGCCTACACTCTAGCTGGGGCCAAGCCATCTTTGCCTAAGTTCCATTACCTCATTAA CT
CG ATAAAG CTG GC ATTAAG AGAAG CTTG GTAG CATAAAGTGTCAAG GAGGTG GTTTTCG
CCTATCTATAAGGACAGTTGGAGTTGATGGTCCTGGTTTTGTTACAGTAAGGGATGT TG
CAGGAATCACTTGCATTGATGGAACAGAAGCTCTTCCTGCTCTTGTATTCGACGAAG AA
TAGGCCCAAGGGACATCAATGGACAATCGCCAAGCACATGGTTTAGCTAATTCAATA GC
CTTCGGAGAAAAGTGATAAGTACCATCGAACAAGCAAATCACTTCTGGATTATTAAC CTT
TAAGGATCCACTTATGGTGAAATTTTGGAATGATGCGGGGATAACCCAAGGGAGTCA G
CTAAACAGAGACGGAAAATATCAGTACCATTCTGGTCGGCGTATGCTTGTTGAAGAG AA
CAAGCTCACTTCTAGCTAGCTAAAGGGAAAGAGAGATACTATAGAATACCTAAGTCA AT
TTATCATTGAGCTGCAAAGATCATAGCAGACACTACGAATTAATAGCAAATCTCGGC AC
AAACCTCTACATGTCAAGATAGTGAGTCCATCAGATCACAAAAGACTTAATACCAGA TAC
CTTTCTAGATAAAGGGGTAAAATCACTTTCTTTCTTCCACAAGATCTCCATTTAAAG AGA
CTATCAACTGTACTTAAATGTAATCATATCAACTGTACTTAAATGTAATCATACCGG GATT
GAGTCCAAATGAGAAAGTCTCCTCCCAAGTTTATTGCCACCATATTTGAGAGCATTT TCT
TCTTCTTCAGCTAAAATTCTTGCAATGGTATGATCATCCTCAGTACCGTGTGAACTA CTA
TTCAAACTAGATGTAGTAGAACTCGAGCTTGCTCTTGAATTTCCATAGGATTCATTC AT
SEQ 22
CTAGAAAGGGTAAATGAGACCTCCGAACTTCCCAGAAAATGCTTCCTTCTGAGGCTGTT
G CACTATAGTTG CATTGCTCATTCG CTG CATTTAAAACAAATTAACTGTGAAAACTACAG
TAGCAAAAGGTTAAAGAAAACGAACATGAATAGCACGTCAAGAGAAATTGGCTTTGC TT
TAACGGTTATTTCATCTCTGTCAACAATGAAATGGCAATGAGTGAACCTTTTCAGAA TAA
GTTGGCTTATCTCATTAATGAGAGACAGATAACAAGAGATGTCTCCTCTAATCTCTA AAT
TGATATTTCATGTTGTATGGATCCTAATAGGATGAGAAATGCATCAAATACAGAAAG GAA
TGGCAGAAGTGGAGATATACCTTGAGGCAGAGGTTCCGACTTGTGTCACAAATAGGA T
AATCATGTGGGCAACAGTGGCGTCCATCTTTGCAACACACTGCAGAATCCAACCCAC AA
CATTTCCAAGAAACGCAAACTCCAAGAAGCCTCCACCCACAGCAGCAGGTTTCACCT TG
ACCACATGAGGTAAACATACTGCATTTGCTTGGACCAGGAGATGGAGGAGATGGTGG A
TTTGGGCTACTCTTAGTTGGATATGAAGCTAGCTTATTGATCCCACATATCCCTTCT TGA TTCCCACTATTACGCTGCATGTGCATATAACCATTTATTCCCCAGCTTGTTCCCCATGAA
TTTTTTATAATCCAGTAATCAACTCCATTTTCAGAACCATAGCCCACAATCAGTACC GCA
TGATCAAGTACTGTAGAACACGGTCCAGTGAATATCCCCTGCAACACAAAGATAGCA CC
TTTATATTTCTCTC CGACAAACAATTTAACTG ATTAG GAG ATTG GTAATTTG GAG ATG GA
AGATACCTTTGAATATGATTGAAATGCTCTCTCACTGCCGCATATCCCAACACTCAC GG
GTTGATTTGCCACCGCCTTTAGAAGCTTGTCCTCATCATATTGGGGAACATCAGTAT ATC
CATCAATGGTTACAACACGTCTTTGTAGCTGCAAAGTCGACAAGTTAAGCCAAGCAA TC
ATATGTTAACGTTGTTCATTATGTTTTATCTGGAATAAACTTGTCCTAGGTTCTCTA TATT
AATTATG AAATC C AG AAG C AG GAG G G CAT AT AT AG G AC AAAAAAAG ATTTAGTAC AAG A
ATAGGAGCAAGAAGATGAGAGAACAATTATGTACCTTGTTTTTGTTGCATGTTCCTT CTC
TTTCATTAAAGGGGTAATCCTCTTCAGTGTCAATACCACCATTCTTTTTGACAAATT CAAA
AGCATAGTCCATCAATCCACCTCCACAGCCGTCATTGTAACTTTTGTCGCAATCAAT TAA
CTCCTGCTCAGAGAGACTTACAAGAGATCCAGTGACAATCTTATTGATACCTTCGAT TG
CTCCAGTGGCTGAGAATGACCAGCAAGCACCTGGGAAAATGAAACAGAAGTAACTGG T
TTTAGTTACAGAAGCTAGTTGCTGAGATTAAGTATATGGAATGACAATAGAATGACA GTG
TTGTGGACAAGGGCAAATTTGATTCATATTATCTCAGAACAAATTCACAAAAAGGCT AGA
TCTTCACTTCCGTCCTATATTCAGGCTGACATTACCAGACATATCTACAGAAAATAA TTA
CTTGAGAAACATAAAGGCAGTGATAAATTTTAAGAATAAACTATACTTAGTGAGAAT TGT
GTGCAGTCATAAAAGTAAC AAGTCTAAGTCCCTGAAG CAAATTCTG CATTG G GAG GAAA
GTATATTTCCGCGTATATGACAGCCAAATTAGTTGCTATAAAACATCACACTAGTAT GTG
ACTCAATATTGACAGTAAAATTATAAAACATGTTCCTTCGATTGCACAAGCAAGTAG AGA
ATCATAGGGTACAATTGTGTACACAGTTCCAAAAACAAGAAAGAAGAGCTAAAACAA TG
AATTGTGAGTCAATGATTCAATGGTCGAAAACAGGACCAAGAAATGGATCAGTGGAT AT
TTATATTTATTTCTATTTTTAAAACTTAAAGGGCATGATGTGAAGACTGAAGCTGGT AATC
CAGTTTTGATTGTGATGCACATGAATGGATGTGGAAAGTAATATTCTCTGGAAGACA GA
AGACCACTAACCACCTCAGTTGCTCAGACCAAGATAGTGAGTAGATCTCCCCTAATC TA
TTCAACAAAGTTCATTGGAAAGAAACAAACAATGAAATGGCGGATCTCCGAGCAGTC TG
GTGAAGTTTATGGTCCAGTGGTTAAAAAGAATGACAACTCAACCATATTTTACTCCT CCG
ATGTGCTCCGTTCATGATCTTTTTAATATTTCTCGCTAATCCGCTAATCAATAAAGA ATGA
GATACTGTATCAGTATGTCCTATTATTGTTGTCTTCCAGTCACTCTGAAGAAATGAT TTTC
ACATACATAGAGACAAAAATTGAAAGTAAGAAACAACAACAACAAACCAGTGGAATC AC
ATAAGTGGGGTCCGGGAGTATAATGTGTACGCAGACCTTAGAGGTTGTTTCTGATAG AC
CCTCGGCTCAAGAACAGTGAGAAAATTGAAAGTAAGAAACAAACAGTATATTCATTC CTA
ATCAACTCATGAAAGGACGAGCTCATGAGACTAAGTTTCAACAACAACAACCATATG ATT
GTTTATTCCACTTCATCTTGATTCCAATACCTAATAATTTGTCTTTTGGGGCAACTC AAG
GGTTCCTAAAGCTAAGAATTCTCTAAATCTCACACTTCTCCTTATACAAACATTCAA ATCC
TAACCAAACTGAAAGTGCTCCTGTCTAATACTGATGAACTAAAGTAAGTGCTGAGGC TA
GGTTTCAATGAAGTAAATTAGTCCTGAACTTCAACCTGTCAAATAATACAAGGAAAA GCA
AAAAAGGGTAGCTCCCAGACAAGAGAAAAAGGCAAAACTAACATCACAAGTTTCCAT TG
TCATTTGAGAAAAAAATCATCAAAATCCAAACTTTGTAAAAATTTCTAATGTTGGCT CTAC
TATGCACAAGTTATATATCCTCCACATAAATGAAATCACTATAAAGATACAACTAAA AGAT
AACGCAATAAACTGAGCATACCACAACTGCCTTGATTCTTGACTTTAGTAACAGCTC CTT
TCTCTCTCCAATCCAAAGAAGAAGGAATATCAACAACACCAACATCATTAAAAACTC CAG
CAGAAGACGACCCAGTTTTCAATCTAATAAAATCATTAGCAGAAGAGGACAAACCCA AA
AAAGAGTTCTTGAATTCATGGTGAGTGAGATCAGAAAAGGCATTGAGATTAAGGGTA TA
AGTGGAATTCCCCTTACTATTATGCTCTATAATATAAGCATAATTTTCTTCAAACAC CTCG
AGTCTGTACACCCTTTCTTGTTCAGAAGAATATGTCTTTCCATTTTGCTGACACCAA CTT
TCAAAAAGATCAGAAATTGATGAACAAGTGCAAATTGGTCCTTGAAAAATTAGAAGT ACA
AGAAC CAAAGATG GACATAACCAACTCAT SEQ 23
TTAATG CTTATTCCAGAAACTCCACTTCTTCTTCTTCTTCTTAAAGTCAAATG G CAG G AA
GTCTGGAATGGAGCATCAAACTACAGTATTAGAATAATATGATAAGGGTAGTGTGTA CG
CCGGTGGCGGACCCAGGATTTTGTGCAAGCGGGTTCAATCTTAGAAGTATATAACTT TA
GTTGTAAAATAGTAGTTGTCAAGTGGGTTCAAATAAAATATTTAAACAAAATTTACG CAG
CTTTAATCCTAATTTATACATATATACAGTATTAGTTTTTGATGCTTGCCACCACGT GCGT
CCACCACTGTGTACACATACCCCTTACCCCCTACCTTGTGAGGATAGAAATCTAAAT GA
AGCAAGAGCAGCAAACTTCCGGTCAACTCTCAATGTTCATCGCTTTGTCCATAAGCA TG
TGATAAACAAAAAGTGTTATTCCGTAATGCCCATGGTAACCCCCCCCCCCCGGGGGG G
G N G GTTTAAAC ATGTAATTAATC AG ATATAG G C C AATTAATAATAGTTG AG C G AC C ATG C
TAAAACCACGGAACTCCGGAGTACCTAACCCCCCCCCCCCGGGGGGGGGATGTCCAT
G CTAAG ACAAACTAAACAGAAACG G GACATAAAAGTACAAG CAACTACCTCCTTGAG GA
TAGATTGAGTTGTAGTGCACCTCTGCCCAGAAACTCAAGTATATGACTGCAAGAAGA GA
AAAATAAGAGAAACAATTAAATTGGTGTGAGCGAATGAATATCAAGACTTCAAAACA CCG
CTCTTACAAAGTGCACATGCACAAAGAAATTGCATTCATACTTTAATTTCTCTTCCA AGC
AATCTCAAGATTTGCTTGCCCACACTTGGCTTCATAGTATAGGTATGATACAATGGC ATA
G AATAAATG AC ATG C ATAC ATAAC C AATATAAAG CTTG C C C C C ATTC ATTAAACTTAC AA
CCGTCTCATAATCATTCAATATGTTAAAACAGACAAATTCCGGTCTCTAAAAGGAGA ATG
TGAATGTCAAAGCATCCATGTTATGAGATGGAATTTAGATTTCAAAAGAGCTAAAAC GGA
CGACTCTTCAAAAATCAAAATCTCCTTCTCATGAAACGCAAAATCGAATTTGCTTAA GAT
TGTCCTTAAGGGTTCATAGTCATCCATTCATCCCTCCTTCCCCTTGCGCAATTTTTT GGT
CAAG G CAG G CCG AG GTACTAACTTTACAGTCCAAAGATCAAAAGTACTATTTGC ATTCT
TCACGACTCATGTAATAAACTTATGTTGTCTCTTTAACTCCAGTGGTCCTACTTTAT CAG
AGTCGTTATCTGATTTTGGACTTCTGAAAAAGTTTGATACAAAGATCGTACTAACTT TTC
CATAGTTGGCACAAATTTCAAGAATCAAATTCATCCAGTAAATCAGGTTGCTCTGGT ACC
AGCTACTTCTATAATTTAATTTACTACTATTACTACAATATGCATAATCAAATTATC TGCTT
CATCTCCATGTGTAGCCTGTGTCGTCTGAAACGCCAATGGGGGAGTACTAATTTGGT G
GTTACGAATTATCATACTATCTCCCCTTTTTGAATTGTTGAATTTGGTCCTGAAAAA TGTT
G CGGTTTTG G CTAAAAGTCTAAAACTG CATTG CAGAAAGTATC AATAG AACG AC ATAAG
AACTCGATAATGGTTTCTCAGTTAGTGGAATTACAGCTGAGGAAAGCATCTTTAACC GC
AAACTGGAAATACGTACAGCATTAAGCGACTCGTGACTTTTTATTTGAGACACATGG AAA
TTGAGAAATAGGATCTATGTCACTCCCACTTCCAAATATTTTTGTAATAAAAACTTG TTCA
ATCGCATTTTGTGAAGTAGAGGATATTACAGAAAGGTAAAAGCAATTCAAAGTTTGA GAA
CTAGCCTACCTCTGTTTGACTTCTGATTCTTCGGAAGAATCTCGATGTAACATGTAT CCT
TGAATGACGTTATAACAAGAATTTTCACACCATACTGCCCACAAGACAAAAAGACAA AAT
CAATGTGCAACACAAGTAGGTTCATTCACAAACCAATGCATCCTAGTCTAGCATCAT CA
CAAATAAAATCTTCATAAAAG GAG CTG CG CAT ATAC AATAAATAAAAAATG CATC ATAAC
CACTCAAAATG GAG AGTG AAAG AAAGAAATAGCAAAATAG AG G CAC ATG AATTAACAAA
AGCTAGTAAAGCACCCAATGGAGGCACTATACCAGGACATCCAAATTATGGTCTGGC CA
ACAATAGCTTAAGCTTCTTATATTCCAAGGTTAAAAAGTAAACCAAAGTAATCAAAT GGA
G AG AAAAAAC CAAG G AAG C AAATAAG G G G AATAATC AATAC C G AGTC AG CAG CAG C CT
GCAACGTAACATGATCGCCCCATTCCCCACTCCTGGTTCAACCAATATATAAGCAGA TG
TTGATTTAAAGGAGAAAAGATAATGCAGAATTTCAAAATGGAGAAACAAAGGTCTGA GA
ATTACTTGGACATCCTCGTCAAGTACTCTCCATACTCCATTGGGACATATCCCTCAT ACA
TCTCCGGATGATGTTGAAACTGACAAGACAAAATGATTTTAAAAAACATACTCAGCT AAA
ATGTATGTGAAATAATTCAAAAATGAAATCGAAATATCATAAAGAAGATTATCTTAT TCAC
TTAGACAAG CAC CTGG CTG ACTACTTG CTGTCTGACAAATTTGTG GTG CTCTGGTGTAC
G ATAG AATTG ATCTG ATAAAG CAC GG AACTG GAACAAAAGAG GACATGTG AATAGATGT
G C ATTAG AAAG AAATAG G ATG G AAC CTTATTTC AAC AAACTTAG AATC G GAG G C G AAG C
CTAATCCACTTGGGAAAAATAGGCAAAGTTCCCATACCCCATAAAACTAAAGAGTTG AG AAAAAAATTGAATTATCCTTTTGTCAAACTACTAAAAATCCACAAATATTTTTAATAAAG TT
GGAATGGCAAGCAGCCTGACAATACGTCACTTGAGTTTCATATTCCAATTTTTTTAA AAA
TCCTATTATGAACCACTTCATTCACCATTTCACTGTCACAAACACCAACAACAAGTT TCT
ATCAGAGCCAAAGAGTTAATTCCAAAGTAAGGAATACTGATAAACCGTCATCAAACA TTA
TGTTTTTTGTTTTTCATTCCCTTTCTTCTTTGAACCAGAGAGTAAGACCTCCATACC ACCT
AGCACCTTGATATACTGAGATTTTTCATGAGACAAACTTAAAGAATTGGGGATCCTT CTT
TTGTTTATGGTTAAAAATTATGCTACAAGTAGTTTAAGGAAAGGGAAAACAATTTTT TTCT
TTCACTAGAAATCAAATCATGAGCGTCTCTAGACAGTTTGATATCATTACTGCAAGA CAA
ATCAACCAAGTAATACAGTAACCTGGCAGTTGCCATCTCCTTGCACTTTGTGCTCCA CC
AAGTCAAATAATTGCAATCTGAAGCAAGAAAAACCATAGCATGGGATCCTTAGCAAA GA
ATAACTTGCAGCAATAATATTACCATTCAATTGCATTGGCAAAAATATCAACTTCAC AGG
AAACTTAGGTGGCACAAAACCTAATAAAAAACACAACAATATCAACTTCATAGGAAG CTT
AGGTGGCACAAAATCTAATAAAGAAACACAACAAAAGAAAATAAAATGAAAAACCAC ATG
CATTGTCCTGTGACTTTAAGAAAACAAAGGGTTAATAGTATCTATTGGAATGTGCTT CAT
AAGTGTTTATTTACAAAGCACAAAATACTAGTTGGCTCATTCCACACCAATTATTTT CTGT
GCTTGTCTCGGCTCCCATCTCTCCCATTTGATTCTCGTTTTCTCAAAATGCTTGAGG GGT
CAGATGTTACTTTTCGAATAGCAGGCAGTAAGAACCACCAGTAAAAGAATGAGTTAA TA
AAGAAAGAGAAAACCTAACAAAATAAAAATAAAGAAATGTTTCTGGAGACACAGGAC CC
CTATCACAATAGAGGTATGCATTTCTCACAGTGAAGCTATATTTCATTACCTTGAAA AGT
GCTCAGAATTTGGAGAGAATTAAGCAAAGCACTATCATAAGATCATATGTCATTAAC TTG
TTCATACACAATTTTTATCTACTTGAAAACCTAAGTCGGAGGGATCAGGAGATACCT ATT
TAGCAGCCTTTGATGATCAGAAGTTGCTTCATCGACTGAAGGTATGTCCCCATTTAT TCT
AGG AACATG CTG CATG GTTAATAACACAAATTGTTATAAGAAACTAAAGCAG CCTCAAG
AAAATG G CCATAG GTGCAAAG CACCACACATGTC CTCGTACACAAAGTG AATTAGTGTT
CTCAATATACTAACAG ACACTACACTTAC AG GAACAG CACTCAG CTG GTTTATTCTCTTC
CCTACTTCTCCATCAAGCTCAAATTCATCTTGTATTTCCAATGTGTAGGTATACTCT TCTC
CATCGTATGATCTGTCTCCAGGACTCGAACAAGAACTTGAAGGCCCTACATCATCAG CT
TCTAGGCTGGTGTCATGCCCTGTGAATCATATATAATTAGCAAATGTTTAAACTCAA GGA
ACATCACAGAATTGAAAACAAGAAATGTACCAGCATAATACTCTCTTGGAGGAGTAT GC
CAATGTTGTACACCAGTGGAGGCTTGCAAATACTGCTCGTCTGCATGTGAAGATTCA GC
ATCTTCTGCGATGGACAACTCTGACAAATCTTCTTGTAGAACATGAGCAATAGCCTC ATC
ATTGTCAACATTGCAATATGATGTGTGATAGTGGTTTTCTCTGGCATATTGTTCATG ACA
TATCTCAACGTCATGCTTTCTACCATCACCGTAATAGTTGGAACTAAAAAGTTGGTC CAC
ATCGAGAAAACTAAGAACGCCACGAGCAGCTTCAGATTCCGGCTCACACAT
SEQ 24
ATGCCTTCACTTCTTCAAATTTTCCTTCCTTTGTTTCCATTCTTTTTCTTGGTTTCTTTC TC
AGTTTCTCACGGACCCTTTTTGCCAAAGGCCATTATTCTTCCTGTAAACAAAGATCT GTC
AACTTTTCAGTATGTTACTCAAGTTTACATGGGTGCTCATCTTGTTCCTACCAATTT AGTT
GTAGATCTTGGAGGTTCATTTCTCTGGACTAATTGTGGCTTAACTTCTGTATCTTCA AGT
CAGAAACTTGTCCCCTGTAATTCACTCAAATGCTCAATGGCTAAACCTAATGGTTGC ACT
AACAAGATTTGTGGTGTACAATCAGAAAATCCTTTTACAAAAGTGGCTGCAACAGGG GA
ATTAGCAGAGGACATGTTTGCTGTGGAATTCATAGATGAGTTAAAAACAGGTTCAAT TG
CTTCAATACATGAATTCTTGTTTTCTTGTG CATCAACTACTTTGTTG CAAG GTCTTG CTAG
AGGTGCCAAAGGAATGTTAGGACTTGGAAATTCAAGAATTGCATTGCCATCTCAGTT GT
CTGATACATTTGGTTTCCAGAGGAAATTTGCTCTCTGTTTGTCTTCTTCAAATGGTG CTA
TAATATCTGGTGAAAGTCCTTACTTGTCACTTTTGGGTCATGATGTTTCAAGATCTA TGC
TTTATACACCTTTGATTTCATCTAAAGATGGTGTTTCAGAAGAGTATTATATCAACG TTAA
ATCCATCAAAATTAATGGCAAGAAACTGTCGTTAAACACATCTTTGTTTGCAATGGA TGA AGGTGTTGGAGGGACAAAGATTAGTACAATTCCCCCTTTTACCACCATGAAAAGCTCAA
TTTATAAGTCATTTATTGAAGCTTATGAGAAATTTGCTATTTCCATGGAATTGAATA AAGT
G GAAGCTATAG CACCATTTGAG CTTTGCTTTAG CAC AAAG G GG ATAG ATGTCACAAAAG
TGGGGCCAAATGTGCCAACTACGGATCTTGTGTTGCAAAGTGAAATGGTTAAGTGGA G
GATTTATGGGAGAAATTCAATGGTGAAAGTAAGTGATGAAGTGATGTGTTTGGGATT CT
TGAATGGAGGGGTGAATCAAAAGGCTTCAATTGTTATAGGGGGTTACCAGTTGGAGG A
TAATCTTTTG G AGTTTAACTTG G GAACTTCTATG CTTG G ATTTACTTCTTCACTTTCAATG
GCAGAAACAAGCTGTTCTGACTTTATGTTCCATTCTGTATCAAAAGATTCAGCTTTT GAT
TCT
SEQ 25
TTAAGAAGAATGAGAAGTAAACTTATTTGTTGAATTTAAGAGGTAAGAATATGCAAAAGT
AGCATGAATTGCAGCACCAATTGGAAGGACATCCTCATCAATGATGAAATGTGGATT GT
GTGG AGG GTAAATAG CAC CAATCTTTTCATTTTTTGTTC CCAAAAG GAAGAAG G AACCA
G GAACTTTCTCTAAAAACACTG CAAAATCTTCACTTC CCATG AAG CTAGGTG CTATTTTG
AAACTCTCTTCCCCAACAATCATTTTTGAAACTTTTCGGGCATGTTCGTATATTCTC TCAT
CGTTTATTGTTGGAGGAAGTGTTGGATTTTCTCGACCATCAAAGTCAATCTCGACCG TA
CATCGATGTACTGCTGCTTGTGCTCGTATCACCTGAAAATTTTACCAATAAAAAGTT TAA
TTACCAAATATTGAATATAATAATATGTTCTAAAAATAACATGGATGTCTATTCCTA ATTAT
TAGCAAGTTATTTCATTCTCCCTAGTTGATTAGTGAATTACTGAAAGGTTATGGCGT TCT
GATATTGTTAAATGTACCACTTATTTTATTGAAAAGTTATATTGCATCTCTAAAGAA CTGA
AAAGTCATTTGACCTCTTGGCTCGTGACCCTTCTCAAAAAACAGTTCTTTGGCTATA AAT
AAGATTATTTTGTGTTGAATGAATATATCAAGCAACTTGAAAATATTAAAACCTCTT TCCC
GAAACAAATCCTACAATTTCCTCAAGTACTCGTCTTTTCAAACAAAGTATTAATGAA AAA
GAAACGTAACTTGTTTGAATAAATAAAATTTGCACATATAAACTTTGTAAAGAGACG TGT
GGGACTTTGAATATTGGTCAAAGTATCCAAGATTTTTGTTCTAAATTACAAGTAGTT TAC
CTCTTCAATTCTTTTCCTCAAACCGTAGAAACTCTTCTTACTGAATGCTCTATAGGT CCC
GGAAATTGTAGCTAATTCTGGTATGATATTAAATGCATGCCCCCCTTCAATCATGGC AAC
AGAAACTACCTGAAATTTCAAAAACTAAATATATAAGAATTGATAAAATAAAAATTT AAAA
TTTGTTTGAATAAGTTTGGAAAAGAAAATTGTTAATCTCTATGTTTCAAAAAGATTG TTCT
AGTTTGACTTGACACAAATTTTAATAAGGAAGAAAAGACTTTGAGATATGTGGTCCT AAA
TAAACCATATCATTTGTGTGACTGTAAAACTTTTGAAACTTGTGATCTTAAACTTAC TATA
ACATTTGTGTAACTATAAATGCTTCTAATAAAAAAAATATTAAAATTTGTCAATTTT TTTGA
AACAG ACCAATAAATAAATAGTGTCAATG CTTTTG AAACG G AG GTAGTAC CTGG G ATTC
AAGAGGATCAGTCTCTCTAGAGACAATACTTTGCAAACTAATAACAGAAGTAGAAGC AG
CCAAAATTGGATCAACAGAATCGTGTGGAACAGCAGCATGACCTCCTTTTCCTCTAA TT
GTAGCTTTAAAGCTTCCACATCCAGCCAAGAATTCACCAGGCCTAGATGCAACTACT CC
ACTTTCATACTTATGAACTAAGTGCATTCCAAAAATGGCTTCCACATTTTCAAGAAC TCC
TTCTTCTATCATATCTTTAGCCCCATGCCCTCGTTCTTCAGCTGGTTGAAAAATTAA CAC
CACTGTTCCCTGCAATATTATTATACACGGTAATTAAATTCATTACTTCAACTAATC CATT
AGCTTAGAAGTATGTATTTAGAGCTTAATTAAGGGTTTTATTTAACCTGTAAATTGT GTCG
GAGTTGTTGTAATATCTTGGCAGCACCAAGAAGCATGGCAGTATGGGCATCATGAGC A
CAAGCATGCATTTTTCCATCAACTTTGCTCTTGTGCTCCCATTTCGCCAATTCCTAC ATT
AGAAGAATTCAACTTTGACTCACGACTCTTTTATTGATCAAATTATTCACTTTATAG ATTT
TTGAAGAATTGATTAATCGAGAATAAATATAGAGTCCTACTGTAGAGGCATATTATA TGA
TATTGACCTCTACAACTTATAAAACCCGACTTATGATCTTTATTTTCTTTTTCTGTT GTTTA
GATCACAATTGATATTTGATGTTCAAATTAAATGTTTTAGCGGTGTAATATTATTAC TTAT
GGTACTTTCGGCCATCCTATCCAATTTTACTACTAGGAAAATAAAAAACGTGTTGAC CCT
TTATTCCACAACATATGAAACTAAAAGTAAAAAGAGATGGTCACCATAGAAGAAAAC TAG CTAAAGTATATACCTACGAATTGAAGTGTTTTCTCTTTCCCAATGAAGTTCCAAAATTCA A
GAATCTCTTTGTTTTAGGTATAATTAAGCTGTTTCGAACTCTATACTTAATTCAATA TTAA
GAGAGATCTGATTTATTACTTTCCTTTCATGGCTTAAATATTACCGCCGCCGCCGCC ATT
TCTGACAAAAACGGAAAGTAAACTGCCGCAAGTAATTTCTTCTTCTGCCATAGTTAA TTT
AGTCGCCCACAAAATTAATAAAATGACTCAAATTTACTGCCTACACCCTAGTTCCGA CCG
AATACAACATATAAATGATCCCCGTGCTGTTGTCATCTCGAACATCCTTAATAACAA TCT
CCAAAACCATTAATGAAATACAGACAAAGGTAAGAAGTAAATTTGAAGATATATAGT ACT
ATATTAGGCCTATAGATCTACCTCCTAAACTCCACAAACTGTTTAAAGTGAATAAAA CAT
TTAAAGAGTTCATATCAATTTTTTTTGATATGAAGAGTTATCCGTGGTTATAAATGA ACTA
AACGTGATACTAGTATAAATATTCTTACCGTTTTTTGTTTTGAATATAATTGCAGGG TTGA
GAAAATTTCCAAGCAGAGACTACTAACCTGAATAGGCAAAGCATCCATGTCTGCTCT GA
GAGCCACAAATGGCGGCTTACCGGAGCCGATGGTGGCAACAACTCCGGTCTTAGCCA
CCGGCCACCGGTACTTTACTCCCATCCGATCAAGCTCCTCTCTGATCAAACCACTCG TC
TTAAATTCTTCATAAGCAAGTTCTGGGTTCTCGTGAATTTGTCTCCTTATTTTCATC ATCC
ACTTCACTGTCTCCGTAGCATTTGCTAATTTTGTAATATAATCTTTCACGTAACAGT TTTG
ATCTACCAAAAACGGATTCAAGCACTCATCATCGCCGTGACACGAAGGAAAAACAAT GA
ACATACATACAAGCACCAAAATTAGAACTTCCTTAGCACCCAT
SEQ 26
ATGAAACTGAATCCTTACTCATGGACAAAGGTAAGTACTTGATTGTGAATTATAACTGTA
TTATGTACATAAGGTCGCTGCACAACACAAAATGTTGAAAATAAGATGGAATTATTA GGT
GGCAAGCATTATTTTCTTAGACTTACCAGTAGGCACTGGATTTTCCTATGCAAGAAC TCC
AACAGCTTTACAGTCATCTGATTTACAAGCAAGTGATCAAGCATATGAGTTCCTTTA CAA
GGTAATTAGATTCTTCACGAAATTATTAGTTAAATGTATTTTCTCCTTTGCCCCTCA ATGT
TGTTCAATATGTAGTAGAACAGTCAATAATTTTATGTTGTTTGCAGTGGTTCCTTGA TCA
CCCAGAATTCTTAAAGAATCCATTGTATGTTGGCGGCGACTCATATTCAGGGATGGT TG
TTCCCATCATTACTCAAATTATAGCAACTAGTAAGACTATATTTTCCCTCAAATAGT TGTG
AAACAAGTAATGGCAGCCTAAGGTAGTAAGGTGTTCTGTTCTTGTACTATAACATTT TGT
GGCCTTGTGATAATGCAGAAAATGAGATGGGAATAAAACCTTTTGTGGATCTTCAGG TT
TGTCATTTTTCTTGTATATATTCTCTTTTCCCTACGGATAAGCAGACGGATTACATA CCAA
CTCAGAATTTGTAACGAAATTGTTATGAGAATGTCACGACCCAAGCCCATAGCATGT ATT
GTCTGCTTTGGGCCTAGGCTCGCACGGATTTGTCTTTCGGGCTACGCCACCTCGAGC C
CCAAAAGCGCGTGCACCATGTGAACTTGTGTCATACCTTATAAAGTTCATCACTTTC CTC
TATTATTCCGATATGGGGATTCGTCTAAGGTGACATGTGCACCGCTTATTCAGAAGT TT
GGCAGCCTAGAAGCTAGTCAGTCCTACTTAACTTGCCCTCATCAGCCCCCTCCTTCA TG
GGCATCACACAGAATCAAAAGTCACTGTAGAATGTGAGTTGATTTGCAAAATGTATG AC
CTGATATCTCTCGTCAAGTGGTTTCAGGGATATTTACTCGGAAATCCATCGACTTTT AAA
G GTG AAAAG AATTATG AG ATTCCATTTG CTTATG GAATG G G ACTTATTTCTG ATG AACTC
TATGAGGTTGGTTTTCCTTTGGTGTTATATAGTACAGTCAAACCTTTCTATAATAGC TACA
TTTGTTCCGATATTTTTTGGATGCTATAATGAAGTGTTGTTATAGAGGATATATATT AGTA
TAACATAACATACAAAATCGGCTCCGAGAAAAACTTGGCTTTATAGTAAATGACTAT TAT
ATATGGATGCTGTTATACAGAGGTTTGACCGTAAGATCTTAAATATCCTCCAGTTAT GCG
CTTTAATTTAGTTTGCTTACATTGTCCTTAGAACTAATTGATTTCCCTTTCTCAAAT AGTC
CTTGACGAGAAATTGTAAAGGAGAGTATCAAAACACTGATCCAAGCAATACACAATG TTT
GCAAGATGTTCATACTTTTCAAGAGGTTGGATCCTATTTTGAGGAAAATCAAATATC ATC
TGTTTGTTTTATGATAGGTTCATTAACATACTGACCTTATGCAGCTTCTGAAAAGAA TTAA
TAATCCCCATATTCTGGAGCCCAAATGTCAGTTTGCTTCACCAAAGCCACACCTATT GTT
TGGCCAAAGAAGATCTCTTAATGTGAAGTTTCATCAACTTAACAATCCTCAACAACT CCC
TGCGCTAAAGTGTCGCGTGGGTACTCATCAACAAACTCTAGCATTCTTTATGCTATT GAT TTTTTGTTTCACTGAGATACTTACGAGAATTTACAACTTGCAATTGATTTAGAATGATTG G
TACAAACTTTCTTCTCATTGGGCTGATGATGGCCAAGTTAGAGAGGCCCTCCATATC CG
AAAGGTACGTTAGTTCTTGTTGGAAGGGGAACCTTGGAGCAACGGTAAAAATATCTC TG
TGTGATCTATAGAGCACGGATTTGAGCCATGAAAGCAGTAATGCTTGCATTATGATA GG
CTGTCTATATCACACCCTTGAGATGCGGCCACCTTGCATGAATGCGTGATACTTTGT GC
ATCATGCTGCCTTTTTTTTTGAAGAACAACAAAATTTAACAAAGTGTGCTACACAAA ACTA
AAAATATGATCAATTTGATTACAGGGAACTATTGGAAAATGGGTGAGATGTGCAAGT TTG
CAATACCAAAAGACAATCATGAGTAGCATACCATATCATGCAAACCTCAGTGCTAAA GG
TTACAGATCTCTTATATACAGGTTGAGTAAGATTGTTGTGTTTGCAAGATTGGAATA ACT
ACATAAATAGTTGAAGATTATTATCTCTGTGAAACTATTTACTTAGTTTTCTATGTT TTTTG
AATTAAGCAGTGGAGATCATGACAAGGTTGTTACCTTCCTATCAACTCAAGCATGGA TA
AAATCTCTTAACTACTCCATTGTTGATGATTGGCGACCGTGGATCGTTGACAATCAA GTT
GCCGGGTTAGTTTATGATGAAAACATTGTACGCTAGTCATAAGCTCTGTCAAGGTAT AG
AAGTTAAACTCATTTTTTGTCTTTTGCATGATTGTAGTTACACGAGAAGTTACTCAA ATCG
GATGACATTTGCCACAGTAAAGGCAAGATATCTCTTTCACTTGCTTTTCTCAGTTAA GTT
TGAAGATAAAAAATTTTGTTAAATAGTTGGTGTTTAAATTGCACTATTTTGTTACAG GGAG
CAGGGCATACTGCACCAGAGTATAAGCCTCGTGAATGTCTGGCCATGCTCAAAAGGT T
GATGTCTTACAAGCCTTTG
SEQ 27
ATGTGTGAACCGGAGTCTGAAGCAACTCGTGGGGTTCTTAGTTTTCTCGATGTGGACCA
ACTTTTCAGTTCCAACTATTACGGCGATGGTAGAAAGCATGACGTTGAGATATGTCA TG
AACAATATGCCAGAGAAAACCAGTATCACACATCATATTGCAATGTTGACAGTGATG AG
G CTATTG CTCATCTTTTACAAG AAGAATTGTCAG AGTTGTCCATCG CAG AAGATG CTGA
ATCTTCACATGCAGATGAGCAGTATTTTCAAGCCTCCACTGGTGTACAACATTGGCA TA
CTCCTCCAAGGGAGTACTATGCCGGTACATTTCTTGTTTTCAGTTTTGTGATTTTTC CTC
GAGTTTAAACATTTGCTAATTTATATATGATTCACAGGGCATGACACTGGTCTAGAA GCT
GATGATGTGGGGCCTTCAAGTTCTTGTTCTAGTCCTGGCGACAGATCATACGATGGA GA
AGAGTATACCTAC ACATTG GAAATACAAG ATG AATTTG AG CTTG ATG G AG AAGTAG G GA
AGAGAATAAACCAGCTGAGTGCTGTTCCTGTAAGTGTAGTGTCTGTTAGTATATCAA GA
ACACTAATTCACTTTGTGTACGAGGACATGTGCGGCGCTCTGCAACTTTGGCCATTT TC
TTGTCACTGCTTTAGTTTCTTATAACAATTTGTGTTATTAACCGTGCAGCATGTTCC TAGA
ATAAATGGAGACATACCTTCAGTCGATGAAGCAACTTCTGATCATCAAAGGCTGCTA GA
TAGGTATCTCCTGATCCCTCCGACTTAGGTTTTCAAGTTGACAGAAATTTTGTGTAT GAA
CAAGTTAATGACATATGATCTTATGGTAGTGCTTTGCTTAATTCTCTCTCAGATTAG CAC
TTTCCAAGGTAATGAAATATAAGTTCACTGCGAGAAATGCATACCTCTATTGTGATT AGG
TGTCCTGTGTCTCCAGAATCATTTCTGTATTTTTTTTAGGTTTTCTCTTTCTTTATT AATTC
ATTCTTTTCCCGGTGGTTCTTACTGCCTGCTATTTGAAAAGTAACATCAAACCCCTC ATG
C ATTTTG AG AAAAG AG AATC AAATG G GAG AG ATG G G AC C C G G G AC AAG C AC AG AAAAT
AATTGTTGTGGAATGAGCCAACTAGTATGTTGTGCTTTCTAAATAAACACTTACGAA GCA
CATTCCAGTAGATACTGTTAACCCTTTGTTTGCTTAAAGTCACAGGACAATGCATGC GGT
TTTTCATTTTGTTCTGTTTTTTTATTAGGTTTTGTGCCACCTAAGTTTCCTAGGAAG TTGA
TATTGTTGTGTTTTTTCATTAGGTTTTGTGCCACCTAGTTTCCTATGAAGTTGATAT TTTT
GCTAATTCATTTGAATGGTAATACTATTGCTATAATAACTTATTTTCTGCTAAGCAT CCCA
TG CTGTGATTTTTCTTG CTTCAG ATTGC AATTATTTG ACTTGGTG GAG CAC AAAGTG CAA
GGAGATGGCAACTGTCAGGTTATCATATTACCTGGTTGATTTATCTTGCAGTAATGA TAT
CAAACTGTCTAGATGCGCTCATGATTTGATTTCTAGTGGAAGAAAAAAACTGTATTC CCT
TTCCTTAAACTACTTACAGCATAAGTATTAATCTTAAACATAATGTTTATCAGTATT CCTT
CCTTTTGGAATTGTTCTGGTAGAAACTTGTTCTTGGTGTTTGTGACAATGTCTTAGC TTT CTTTATTACTTTTTAGTTATGCTTGAAAACAGTGGAAACAGTAAAGTTATCTCCATATAA A
GTTGTCTCTGTGTG ACATATAG GTC ATG AGTTTG AG CCGTG GAAG CAG CCATTAATGCT
TGCATTAGGTTAGGCTATCTATATCACACCCCTTGGGTGAGGCTCTTCTCGGGACCC TG
CGTGAATGTGGTCGGGACCCTGCGTGAATGTGGGATGCTTTGTGCACTGGGCTGCCA T
TTTAGTTATGCTTGAAATTCTCAACTTTTTAATTTTCATATTTGGTTTTTACTTGTC TATTC
TTTCCATTAGCCTTAAGCAGTTGCTCACTGTTCATCATATTTCATTTAAGTTTGTGA AGTG
TGTGAGACCATATACAATATTGCTGAATTATGATATACATTGGGGATTGGCAATTTC ATT
TAAATTGAATTCTTTAGTGATTAGTTCAATAAAGTCACAAAAAGAAAATCGGACTTG AATT
ATTGATTTGGGAGTTATTTAATTATGAAATGAATACTAGTAAGAAGCGAGTCAAGAA ATT
TGAGACTGAATGTGAAAATTGGATGGAAGATGTTCACGGAGAAAAGCTGATTAATAG TA
ATGTTGGTAAAATAGGAAGGGATTAGAACTCGGATAATGAATGTAGAGCGAACTACA AA
ATATAAGAAGTTGAGAGTTCGGATGGAGTTGGGGGGATGGGTGGTGAATGGAAGTGG T
TCATAATAGGATTTTGGAGAAAAACTGGAATATGAAAACTCAAGTGATATATCGGCA GGT
TG CTTGC CATG CCAAGTG CCAACTTTATGAAAATTATTTGTG GATTTTCAGTTAGTTTGA
CAAAAG G ACAATTCAAATTTTTTCTTAACTCTTTAGTATTATG G G GGTATG G G AACTTAG
CCCGTTTTTCCTTTCTGTGAATTAGGTTTCACCTCCGATTCTAAGTTTGTTGAGATA AGG
TTCCATCCTATCTCTTTCTAATGCACATCTATTCACCTGTCTTCTTTTGTTCCAGTT CCGT
GCTTTATCAGATCAATTCTATCGTACACCGGAGCACCACAAATTTGTCAGACAGCAA GT
AGTCAGTCAGGTGCTTGTCTAAGTGAATAAGATTATCTTCTCTATGATATTTCGGTT TTC
ATTTTTGAATTATTTCACATACATTTTAGCTGAGTAGGTTTTTTAAAATCATTTTGT TTTGT
CAGCTTAAACATCATCCAGAGATGTATGAGGGATATGTCCCAATGGAATATGGAGAG TA
CTTGAAGAGGATGTCCAAGTAATTCTCAGACCTTTGTTTTTCCATTTTGAAATCGTG CAT
TACCTTCTCTCCTTTAATTTACATCTGACTTTTATATTGGTTGAACCAGGAGTGGGG AAT
GGGGCGATCATGTTACGTTGCAGGCTGCTGCTGACTCGGTACTGATTATTGCCCTTA CT
TTGGTTCCTTGGTTTTTCTCTCCATTTGATTACTGCCTTTTGGTTTGTTTCTTAACC TTGG
AAATAAGAACCGTAAGCTATTGTTGGCCAAACCATAACTTGGATGTCCTCATATGGT GC
CTTCTTTGGATATTGTAGTAGCTTGTTAATTGCAAGTTGATGGTATGTAGGAAGTAA CAT
GCTTCTATAGAATTTGTGATCCTGTAGTTTTTCATGAGTATGTGTTAATCCTTTATT TTGT
AGTGTG G AAGAAAATGTGTGTTTATGTG CCTCTCTTG CTATTTTTTTCTTTCAGTCTC CAT
TGTGTGGTCGTGTTGCATTTTTTATTGTATATGCTTAGCTCCTTTACGAAGATTTTG CTTG
TGATAATATTAGATGAGGATGGATTGGTTTGTGAATGACCTATTTGTGCTGCACATT GAA
TTGTCTTTTTGTCTTGTGAGCAGTATGGTGTGAAAATTCTCGTTATAACGTCATTCA AGG
ATACATGTTACATCGAGATTCTTCCGAAGAATCAAAAGTCAAACAGAGGTAACTAGT TCT
CAAATTTTGAATTGCTTTTACCTTTCTGTAATATCCTCTACTTCATAGAATGTGATT GAAC
AAGTTTTCATTAACAAAATATTTGGAAGTGGTAGTGGCATGAATTCTCAATTTCCAT GTAT
CTCATATACAAATACATGTGTCACCGAATTGTGTACGTACTTCCAGTTCGCAGTTAA GGA
TTTTCCCCAACTTTAATTCCACTATGTGAGCAACCCTTACAAAGTTCTTCAAACATT CTTT
ATTGGTTATTTCTGAAATGTGGTTTTAGACTTATAGATAATACCAGAATATTATCCA GGG
CCAAATTTCAACAATTCAAACAAGGAAAGATAGTATGATCATTCTTAACACCACTGT AGT
TACCCCCCATTGAATTTTCCGATAACGAGGGCTACACATGGAGATGAAGGAGATGAT CT
G ATTATG CATATTGTAGTAATAGTAGTAG ATAAATTTATAAAAGTAG CTAATACTAG ATAA
CCCGATTTATTATCTGAGTTTGATTCTTGAAACTTGTTCGAGTTATGGAGAAACTAG TAT
GGCCTTTGTATAGGACCTTTCCGAAAGTCCCAGATCCGATAACGATCTGATAAAGTA GG
ACCACTGGAGTTATGAGACAACATTAGTCTATAATATAAATGATCAGAATATTGCAA CAG
AAAACATCAGTTGTCTCTTTCCTCTCTTCGATAGAGGCAAAGGAGATTGAATCTAAT TGA
TTCCGAAATGCTTCATTGGATATTCAATAGTTAAATCGAACTATTCATCTTGCAACT CTGA
AAACAG ACGTG CTATACATG CAGTATAAGAG CAACATAATTAACATACACTAGGTTG GA
GGTTTACTTATCTATGTTTAGGTGGTCGGTTTATGTGCAAGTTTTCCATTTTTCAAC AATT
TAGAGTTATCAGAG CTACTTATAAG ACATG ATACTTTTG CTGTATTTAACTTTTTTTGTAA
AGTTCAGCAAGAGTTTTTGCTAGTCCGAGTGGAAATAATTATTTGTTGACTACCCAT TTT
CCCTTTTTACTTGAGAAAAAGATTGAGAGGGGGGAGGCAGAGCAGCATCATGAGTCA T CTGGAGAATGCAAAGAGTACTTTTGATCTTTGGACTGAAGAGTTAGAACGTTGGCCTTC
CTTGATAACAAAATTTTCAAGGGGAGGGGGGATTAATGGATGACTATGAACCGTTAT GG
ACAATCTTAAGCAAATCCGATCTTGGGTTTCATGAAAAGGAGATTCCCAAGGGTTGA CC
AGTTTTG GTTGTTTTG AAATCTAAAAG ATG G ACTG ATGAG CATCCTTATTG CCTTTTTAG
AGACCTAAAGTTGTCTACTTTAACATATTGAATGATTATGAGACAGTTCTAAGTTTA ATGA
ATGGGGACAGCTTTATATCGGTCATGTATGCATGACATTTATTCCATGCCATTATAT GAT
ACCTCTACTGTG AAG CCAAG G GTG G GC ACAACTTAG CAATG GATACTTAAATCTGG AGA
TTGCTTGGAACATCTTTGTGCATATGCACTTTGTAAGAGCGATGTTTGAAGTCTTGA TAT
TG ATTCG CTTACAC CAATTTAATTGTTTTGTTTCTCTC GTTTTTCTCTTG GTG CAGTCATA
TACTTAAGTTTCTGGGCAGAGGTGCACTACAACTCAATCTATCCTCAAGGAGGTAGT TG
CTTGTACATTTATCTCCTGTTCCTGTTTATTTTGTCTTAGCATTGAGAGTTGAGTGG GAG
TTTGCTGATCTTGCTTAGTTTAGAATTCCATTCTATTATCATATTATTCTAATACTG TAGTT
TGATGCTCCATTCTAGACTTCCTGCCATTTGATCTTAAGAAGAAGAAGAAGAAGTGG AG
TTTCTGGAACAAGCAT
SEQ 28
TCAAGAATCAAAAGCTGAATCTTTCGATACAGAATGGATCATGAAGTCAGAACAGCTTG
TTTCTGCTGTAGACAGTGAAGAAGTAAATCCAAGCATAGAAGTTCCCAAGTTAAACT CC
AAAAGATTATTCTCCAACTGGTAACCCCCTATAACAATTGAAGCCTTTTGATTCACC CCT
CCATCCAAAAATCCCCAACACATCACTTCATCACTTACTTTCACCATTGAATTTCTC CCA
TAAATCCTCCACTTAACCATTTCACTTTGCAACACAAGATCCATAGTTGGAACATTT GGC
CCCACTTTTGTGACATCTATCCCCTCTGTGCTAAAGCAAAGCTCAAATGGTGCTATG GA
TTCCACTTTAGTCAAATTCACGGAAATAGCAATTTTTTCATAAGCTTCCATAAATGT CCTA
TAAATTGAGCTTTTCATGCTAGTAAAAGGGGAAATTGTACTAATCTTTGTCCCGCCA ACA
CCTTCTTCATCCATTGTAAACAAAGATATGTTTAAAGACAGTTTATTGCCATTAATT TTTA
TGGATTTGACATTGATGTAATACTCTTCTGAAACACCATTTTTAGATGAAATCAAAG GTG
TGTAAAGCATAGATCTTGAAACATCATGACCCAAAAGTGACAAGTAAGGACTTTCAC CA
GATATTATAGCACCATTTGAAGAAGACAAACAGAGAGCAAATTTCCTCTGGAAACCA AAT
GTATCAGACAACTGAGATGGCAATGCAATTCTTGAATTTCCAAGTCCTAACATTCCT TTG
GCACCTCTAGCAAGACCTTGTAACAAAGTAGTTGATGCACAAGAAAACAAGAATTCA TG
TATTGAAGCAATTGAACCTGTTTTTAACTCATCTATGAATTCCACAGCAAACATGTC CTC
TGCTAATTCCCCTGTTGCAGCCACTTTTGTGAAAGGATTTTCTGATTGTACACCACA AAT
CTTGTTAGTG CAACCATTAG GTTTAG CCATTGAG CACTTG AGTG AATTACAG G G GACAA
GTTTCTGACTTGAAGATACAGAAGTTAAGCCACAATTAGTCCAGAGAAACGAACCTC CA
AGATCTACAACTAAATTGGTAGGAACAAGATGAGCACCCATGTAAACTTGAGTAACA TA
CTG AAAAGTG GACAGATCTTTGTTTACAG GAAG AATAATG G CCTTAGG CAAAAAG GGTC
CATGAG AAACTG AGAAAG AAACAAAG AAAAAG AACG GAAACAAAG GAAG G AATATTTGA
AGAAGTGAAGGCAT
SEQ 29
TTAGGCCTCAATCAGTTCTCTAATTGGTTTGCTGTTTATGTTAGCTGATGGGATATTAGT
GAATTCAGAGAGTATAGCTCTGAATTCATCTCCTGTAAGAGTCTCCTTTTCTAGCAA CAC
ATCCACTAATTTGTCGATTGCCTCCCTGTTGTTCCTTATGTGGTTCTTTGCAATTTC ATAT
GCTCTCTCAATTATGTGCCTTACCGATGCATCAATGTCTTCTGCTAGTTTCTCTGAC ATT
TGATTCCTCGCCAGCATTCTCAGCACCACATCACCACTCTGTGTTGCTGGATCTGTT AA
CGCCCATGGTCCTATCTCAGACATCCCGAACATTGTCACCATCTGCTCATGATATAA AC
ATTGGCAAGTTAATACTTGTGTGTATTCGAATATGTTGTTCTCTTTTAATGTGGTGC AAC AAGATGATGTGTTAAGTAAATACCTGTCTTGCTATTTGAGTTATTTGTTGCAAGTCTCCG
GCTGCACCAGTAGTGATTTCTGCTTCACCAAAAATTATTTCCTCTGCTGCTCTACCT CCT
AAGCTTCCAACTATTCTAGCAAAAAGTTGCTGCTTAGATATCAAGGTTGGATCTTCA CCA
GGAATAAACCATGTAAGACCGCGAGCTTGCCCTCTTGGGATCAATGTAACTTTCTGT AC
TGCATCATGGCCAGGGGTCAATGTCCTATAAGCACAAGGACACATTCTTTAGTACTG TG
TCTTTTGATTACAAATAACAAACTGAAAAGATTGAATACTTAGACAGCTTCTTAAAT TTGT
CCGGTTTTTTCATCTAAACACCTTGTCTAAGGGCCTGATATATTGAACACTTGATGT TAG
TTGAAAATTCAATAAGGAGCAAATTACTCCCTTTTTTTGCTTCATGTATAATCTAGT ATAA
ATGAAAATAATGAGAGGAAAGAAATGATTGTTAACTTACGCGCAGACACCATGTCCA AC
TTCATGATATGCTACCAAAATCTTGTTTTTGCCATCTGTCATCTTGGTTCCTTCCAT TCCA
GCAACAATTCTATCGATGGAATCATCAATCTCTTTCGAGGTAATCTTATCTTTTCCT CTTC
TTCCAGCTAGAATAGCAGCTTCATTCATGAGGTTTGCAAGATCTGCACCACTGAATC CT
GGAGTTCTCATTGCAATAACACTTAGAGACACATCTTTATCAAGCTTCTTGTTGTTA CTA
TGAACCTTCAATATTTCTTCCCTTCCTCTTATATCAGGCAGTCCAACACTTACCTAA TAAA
ATGAAATATCAATATAAGTGAAGTGTATTCTGGAAACTGTATAATACACCTCATTTT ATTG
GAATTTTACAATCAAAATCTCATTTTATACCTGTCTATCAAATCTTCCAGGTCGAAG CAAA
G CTTGATCAAG AATTTCAG GCCTATTAGTG G CAG CAATG ACAATG ACTCCAGTGTTTCC
AGTGAAACCATCCATTTCAGTGAGAAGTTGGTTAAGTGTCTGCTCTCTTTCATCATT TCC
ACCGCCAATACCAGTTCCTCTTTGCCTCCCAACAGCATCAATCTCATCAATAAAGAC TAA
ACAAGGTGAATTTTCCTTTGCCTTGTTGAATAAGTCCCTAACTCTAGAAGCTCCCAC ACC
AACAAACATCTCAACAAACTCTGAACCAGAGAGAGATAAGAATGGAACCTCTGCTTC TC
CGGCAATCGCCTTAGCTAGCAATGTCTTCCCTGTCCCTGGTGGCCCTACTAAGAGAA CT
CCCTTTGGTATCTTTGCCCCAACTGCTGCAAACTTTTCTGGGGTTTTCAAGAACTCA ACA
ATCTCTTGAAAATCTTGCTTTGCATCATCTACCCCAGCCACATCATCAAATGTTACT CCT
GTATTTGGTTCCATCTGGAATTTTGCTTTGCTCCTGCATTATTCACAAACAAATACT AGTT
ATTAGTAGTTGTTGAAGATTACATCACTAGACATAATGTTCAATCTTGATCATGTTT ATGG
AATTTCTATTATAGCATACTGTTGGGTTTCTTAAAGAGATGGAAATGATTGAAATTG TCTC
TC CTAAGTTTTATTAACTATAG AG C G ATTTAAATAG C C AACTTG AAAATAAAATAC AC AAA
TTTATAAAATATTGAAAAACCTAAAATATCTCAACAACCTAAAATATCTAACCGAAA TTTA
AATTCAAACAAAGTAGACTACTTTTACCACTAAAAATTACTCCTTCTATTTCAATTT AGAT
GATACAATTTCCTATTAGTACGTTCCAAAAAGAATTATACATTTCTATAATTGAAAA TAAT
TCAACTTTAAACTCTTTATTTTATCTATTTTAACCTTAATAAAAAACTTTTATAACT ACACA
AATATCATGCCCCCCACAAAGCTTTTACCTCTTAAACTTTTTCAAAAGTCTTCTGTT TTTT
TTTTTTAAACTACGTGCCGAGTCAAACTAACTAATTTAAATTTAAACCGAGGAAGTA TTAT
TCTAGTAAATTAACAGTAACAGAAGCTATATACAAGACATACCTTCCTAATCCAAAA GGC
AGGTTTGGCCCTCCAGGAGTATTTGAAGAAGAGGTTCTCAACAGCAAAGAGCCAAGC A
ATATCAATGGAAAAGCTAAATTCCCAAGTAAATCAAGAAGTGGCCCTATGACATTCA TTT
CAGGGAGATGAGCAGCAAAATCTACATCCTTCTCTCTAAGTTTTCTCACCAATTCTG GT
GGCAATCCTGGCAACTGAACTTTAACTCTCTGGACTTTGTTAAGAGCAGGATTGAAT AT
CTCAG CAACAG CACTACTCTCAAAAAAATCAACTTTTTTC ACAG CACCTTC ATTCAAGTA
TTCCAAGAATCTTGAATATGACATTCTACTTGAAGTTGCTTCAATTGGTGCTTCAGT TTCT
GCTCTTGCTGGTTTAGCCAAAGTCCCTGCTACAAGGCTCAAACCACTACCACTCAAC AG
CTTCCTCCTATTTATTCTGGTGTCTGAATATGATTTTTGACATGGGGTTTCTTTACT AAAG
ATTTTAG G ATTGTTAGTATCCTTAGAAAG ATCTTG G GATTTG CATAG GG G AAATTGAATG
ACAGACAAAGAAAGGGCAGGGGACATTTTCAT
SEQ 30
TTAGGCAGTGGGATAAGAAGCGTCCATAGCAAGTCCACAAAGGCCTTCTTTCTCATGAA CATCCCTTTTGATGCGCATATATCCACTGTCACCCCATTTACTGCCCCATGAATTCTTTA TAATCCAATATTTTGTACCGTCAGTTGTTGCACCATATCCCACTGCTGTAACAGCGTGGT
TAAGCCAAGTGCTGCATGATCCACTGAATACACCACTTGAATAGAACTGGAAATCGA AG
CTACTCCCGTCTATTGCCACCGAAACAGGTTGATTAGCCACTGCCTGCAATAGAGCC TT
CTCACTGTTCGCTGGCACATCTTCATATCCTGTAATAAGAGGCGTAAGTCATAATTT CAA
GCTTATGGATTCGGAATATTTATCGTTTGAAGTTGCTGGGCTAGATCATAATTAAAC CAA
CTCACCCAATTGAAGATTTGTTCTACCCCTTATATTTTTATGGGCTTACCTGTAATT TTGG
CTGCTGAAAGAGCTGACTTTTTCTTGTTGCAGACACCATCTTCTCCTTTGTATGGAT AGT
TTACTTCTGTTGTGAGGCCCTTGTTTTTCAGGATGAAATCAAAGGCAGTGTCCAAGA GT
CCACCGCTGCAACCTTCGTCCTCGCCTTCGACATCACAGTCTACAAGCTCTTGCTCT GA
TAAAGGGATCAACTCTCCTGTTTTCAGTTGGTGTAGCCCTTCCAT
SEQ 31
ATGGGATGCCGCATGAAATTCTTGAATGTGGTTTTGGTGGTGGCGGCGGTGATGGCTG
CTGCCGCCGCCGTGGCCTTCGGAGCTGAGAAATTGCCGGCGGGAGTGCTTAGTTTGG
AAAGG ATTTTTCCTTTG AATGG G AAG ATG G AG CTG GAG GAG GTTAG AG CAAG GG ACAG
AGCTAGGCATGCTCGAATGTTGCAGAGTTTTGCTGGTGGTATTGTTAATTTTCCTGT TGT
CGGTTCATCTGACCCTTATCTTGTCGGGTAATTACTTTGTTACGACCAATTTGATAA GAT
TATATTTGTGATGTTTTTAGTGTTTTCTTCCTTTTTCTAATGTGGAGTTATATTGCT ATATT
TGCTATATTTTATTTGGTATGATGACGATGATATGGCTTGAGCTTAAATGGAGAAGT GAT
GATTGGTATAGCGGACTCCAACTTGTTTGGGACCGAGGCGTTGTTGTTGTTGAGTTT TG
TTTGGTTAATTTAGTCATTTTTTGGAAAGTTTGATTCTTTATGATGTTAAAACTTGG AACT
TTTGGTGAATGTATGGAAGCTATGGACTATTTGATGTGTTATTAACGTCTTATGAAT TTG
ATCTCATGAATTCTGATGTAAATTTTGTTTTAGTTTGAGGGTAATTGATTTTAAGTG TATT
AAAGTACTTGTAACACAATG AATTTTG GTGTG CTGTTTTTCTTTTCTAG GTGCTTTCTTGT
TAATTATCGGTTGGATGGTGTTGTTGATGGTAGTGATAGTTCTGTTTGCTTTATCTT GTA
TCGTTTCTTGTTCAGGTGCAAAATATTTAGGTACAATTGAATGATGAGTTTCGTGTT GCT
CTGAATATGAACATTAGCTATATCAGTTTGGCATTTGCTTTCTGTTATTTGTGGATG AGG
GATATTCTTATTGATTTGACTATCAATTTTGTTGACCATCGTCTCTTTCTCTCTCTT CTAT
GCTTTTGGTGTATTTGTAGCCTTTATTTTACAAAAGTAAGACTGGGAACTCCACCAA GAG
AATACAATGTGCAGATCGACACTGGCAGTGATATCCTATGGGTCACATGTAGTTCCT GC
GATGATTGTCCTCGGACAAGTGGACTTGGGGTAACTCATCTTCCCTTCATCTTGTTA TTA
CTTTTTTAGTTTCTTGTTTAAAGTGTGGTGAAGGAATAAACTGTTACGTGGGTGCAG GTT
GAGCTCAACTTCTATGATGCTACCATCTCGTCAACTGCTTCTCCCATTTCTTGTGCA GAC
CAAGTGTGCGCCTCTATAGTTCAAACTGCCTCCGCTGAGTGCTCTACGGAAACCAAT CA
GTGTGGTTACTCCTTTCAATATGGAGATGGGAGTGGCACAACTGGCCATTACGTAGC C
GATTTACTATATTTTGACACAGTCCTGGGAACTTCTTTGATTGCCAACTCTTCAGCA CCG
ATTATTTTTGGGTGAGTTCTTATTTTTTAAATACCCCTATATCTATACTTAAAATTT CATTA
GAAATAGTTGTGGGTCATTTGAACCAGAAATATCTTTGGCCCAATTTACAAAAAAAC CAT
GTTTGTTTACTCAAAG CTTATACTTG GATATG ATTTAAAACAGGTG CAG CACCTCTCAGT
CTGGGGACTTGACCAAGACGGACAGAGCAATTGATGGGATATTTGGGTTTGGTCAAC A
GGGTCTTTCAGTAATATCTCAACTGTCTTCTCATCGGATTACTCCTAAAGTATTTTC ACAT
TG CTTG AAAG G AG AGG G AAATGGTG GAG GTATACTAGTCCTTG GTGAG ATTTTG G ATC
CGAGAATCGTATATAGTCCCCTTGTTCCGTCACAGTACGTATTGTTACAGTACAATG AA
GTTTCTTTTCTTGCTTATGACGAATATAGAGATTTAATTGTTTTCATCTTTAGTGTG CCTT
GTGCTACATGATATAAAACAGTTGTGTTCTTTATAGTTTGTGATCCAGCTTGAGCAT GTG
AAATATACCTCTCATGCGCTACATCCTGATTTTATTGAAATTTCGTCACTATATTAT TGGT
TTTGCATCTACAGATATATAGTAGTTGGGTCTTGGGAAGATGACATCAATGAAACTT TAC
TTTGTACATATAAAAAAGGGCAGCCCGGTGCACAATTTTGAGTGTTATATATATATA TAT
ATATNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNN NTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTG
TGAACATGTCGATATATGTGTTTATTTTTTATGTTTTTCTACATTATTTGTTTTCTA ACTAT
AAGCACGGGGATTGGCTGGGCACTAGGAAAGAAAATGGTTTGTAGCAAGCTTGATTG T
ATCCGCTTTCCACTTTTGCAGGGCGCATTACAATGTATATCTGCAGAGCATTGCTGT TAA
TGGACAGTTGGTGCCTGTTGATCCATCAGTGTTTGCGACATCTGGCAATCGAGGAAC TA
TTGTGGATTCTGGTACAACTTTGGCTTATATTGCCACAGAAGCTTATGATCCCTTTG TCA
ATGCTGTAAGTTCCTACATTTTGCCAATTTATTTACTCCCTCCGTCTCAACTTGAAT GTC
CAATTGCTCTCTTTGTCTATACCAAAATTATATCCACATCTCCTAAATATAAAAACT GACC
ACATAAC AACAAG AACAACTACG CCTGTTG G G GTTG G CGAAAAAG G GCAG ATAACTGTT
GATCAAAACCCCTGAAAAATGTTTTATCATTCATCTGCAGATAACTGCTGCTGTTTC ACC
ATCAGTTAGGCCAATCATCTCACGAGGAAAACCGTGCTTTCTAGTGTCCTCGAGGTT CA
TCCTTGTATAATTCAATAGATATTTACTTTGAGCTTTTAATGACAAAAATGTCTTTA CCCA
ACTGCTTTGGTGACTCGAATATGACACCCTGTTTTGTTCTGAAAACAGCATAGCAGA GA
TATTTCCCCCAGTTTCTCTAAACTTTGATGGTGGTGCATCGATGGCTTTAAGACCAT CAG
ACTACCTTGTGCATATGGGCTTTGTCGTGAGTACCAGAATCTGTATTGTGTTTGAAT GTC
TTCTTGAAGTCTCATCTGAGACTATATCAACAATGCTATATGCAGCACTATTGTCTT TTTG
ATGAACTACAAAG CTAAGTG ACTCAATTGAATGTTTCTAACAG G AAG GTG CTG CTATGT
G GTG CATCG GCTTTG AAAAACAGG ATCAAG GTGTAACAATTTTAG GAG GTTG GTTCCTT
GTTTACTACAATATTTATGCCTCAACCCCAAGTGGGTTCGCTGGTGTATTTATCAAC ATA
TCTTCTAGGACTATGGTTACATCTAATATCTGCTGCATCATTACGTGTGCTAGCGTC CTT
AATGGGTGCTTAAGCACTCCATCACTCAAACCATCTAAAAAGGACCTAACTATACCT TGT
CCTGTATTAATTTAAAAAAAAATATAACTTGGCGAGTGTCTGCCTAGATGGAATGTA CCT
TAGTGATTGCATGCTTAGATAAAGTCATCATTCCTGCAGATATCGATGTTGATGAAA CAC
GGCCATCCATTTTCTATTAGAATATTAGGTCATTGATTTGCAATTTGAAAATTGTAG CTTG
AAGATTGCAGTCCCTTTGTCTTTTCTAATTTTGCATCCGTTCTTTTATTGTGATTGA CCAG
AGGATAGGTATACTTGTAGTACCAGTAATTCGGTTAGATATGTTCCGTAGCGTGCCA AG
GAATTTATATTTCCTGTTTTGCCTAGTTTTTCGAAGTTTATGTCAAAGTTACTTGCT TTGG
ATTCCGTGCGTGGGACAACTATATATCTTGTAGAGGCAGTGATTGTTACTTTGATGT TGA
AAG ACAG CAG AGG GTGG AATTCATAG AAAGTTAATACGTG GAAAG GG GTGTTAG ACTG
GATACATGACTGCAGTCCTAAATCAAGGTTGGAGTTGCATTTAATATGGCTATTCCT AAA
GGCTAAAAGACACAGTCTCATCAAGATTGTATGTTGAGAACAAGTCTACGAGAGTGC GC
TCTGACTGTTCTCAGAGATAGGTGCCTCAGACTCTAGGGATTAAGCTAACTACGTGT TT
TG AGTG GTGTTTTGTTCTTTCTTTTCCTTTTTCACTG CTG G CTAACACAACTTG AAG ACTT
GAATTCCAAACTGCTTCAAGTTTTAGGTCTATTATTCATGCCTTCTAACTTCTTAGA GTGT
TGGTTCCCTAGTCCTCTTTTTTTTCCAACTCGTGCATGCACGCTCATGCCCGTATAC GTA
CATGCAAGACATCTACAGTTGTAAACACAATTTATGACCAAATATTAACGGAAAGGT ATT
AATCCCTTTCTATTTCTTGTTTGTGTGTGTATGTGTAAACCAAGCACAATGTTATGA CCA
ACATTAAGTGAAGAGAATTAATCTTCTTTTCTTTTGGCCCATGTCCGTGTATTGTAT GCG
CATTGAATGTGTGAGCCTTTCCTTGTTTATCCTGGTTATGAGTTAATCGGTAGCATA CAT
GGTGAGGTTTCAGGATGCATGATAGTTGCAGATGATAGTGTGAATTAACAAAATTAG TC
AAAAG CAGCTC CATG GTGAAACACTTAG CAAG GTTTTGG AATAAGTG G AAACAAGATAC
TATCCAGGCAATGCAAGTTTACTCCGCCTAAAAAGTATTAAGGTGAAATGATATTAG ATT
ATTATGGTTCCTAAATGCTGGCAGTCAAGTTATTTAAGCTCAAATCTTCCAGGAGAA TAG
AGAAATGTCGTG GTAG ATGAG GAAG G AACACATAG AACTAAAATATG CTG GTTG AC ATG
AAAGAGTGTTGCATGAGTGTTATTAGACAGAAAGATTCCTAGAAAATTGAAATATAA GTT
AGTATAACGGTTGTGAGATCAACAATAATATGTGAGAGTGAACATTACATGTCCGCT AAT
TGAATATCATAAAAATGCGGATGTAAAGATGGTTGTGCAATCATATAAGATTGAACA TGA
TTAGAAACAATCACTTTTGTTGAAGG GTGCAAGTAGG G CAC ATATAG AG CATGAAAAG G
ACGTCACATGAGATGGTTTGGCCATATCCTATTAGTCCGCCATATGAACCGGTTATT AA GTGTG G CAATATTGTGTTTAAAGTGCTG AAAG GAAC GAG GTAGACTG ATG ACTAC ATTC
AAAAATTTGTCTCAAATGACTTAGAATCTCATGGAGTCAATACGGCTTGAACTATAA ACA
AAACCTTATGGAAGAAATGGATCCATATAGGCAATACTAACTAGTTGAAATAAGGCT TAC
TTGATTTTACTCTACTGGGTGCAGTATTTGTCAGGAGTCTTTATAATTTGGTTAGAG ACA
TGTTTGTAAGTGTTGGTATAAGTTAGAGATTTAGAGAACCTCTAGATTTAAGAGAAC CCC
TTGTCTTAAAAAGATTATGTACTATCGGAGTGAATTATTTCAAAAAAAGAAGATTAT TTAC
TATCAGAGTGAATTGGGTTCAAATAGAGCAGAATGGCCCCAAATGATATTAGACCTC AA
CTAGTTTGGGACAGAAGTGTAGTTGATTTATTGATATGCATCTTATGTGCAAATTTT ATTT
TAGTCATGCGTGTGCCCTTGAGCTCCTTCCTCTTCCCTCTTCCCTCCTAGTCATTCT ACT
AAATTTGGCTATTCAATTAGTTCTGGCTATGCTTTGGCATTTGAACCATGTATTCCT TGT
CAGTTGACTTATTGCATTGCTCTGCATCTTCATTTGTCGAGTATTCTGGATCCATAG ATA
AACAAGAATCTAGGTTGGTCTAACTGTATTTTTCTTGATTTTCAGATCTTGTTCTGA AAGA
TAAGATCTTTGTGTATGACTTAGCTCGGCAAAGGATTGGATGGGCAGATTATGATTG TA
AGTACCTCTTTTCAAAAATGAAGTCAACATTTGCTTTTGTCTCTATTTTGCCCCCTT TTCT
TTTGGGGTGGGGAGGGGGTTGTTTGTTGTGACATGACTTACTTCTTCCCTCTGATTT TC
TCTCTATTTTTAATGGTTTTCATGATTAATGTTTTTATTATAGGTTCATCATCTGTG AATGT
GTCTATAACCTCTGGCAAGGATGAATTTATCAATGCTGGACAGTTAAGCGTGAACCG TG
CATCAGGCAGTTTGCTGTTCAATCCGCGGCACACTAGAACTATATTTCATCTGCTAT CG
TTGGTTCTGATGATTGGTTCCCCATTTTTAACT
SEQ 32
TCAAG CAACAATAG GGTATGATG CG CAAGTTGCAATACCTACAAACATATCATTAG GAT
CAGTTATTTACCTCATCAAAATCATGACTACATTGACTAGAAAGTTTTTATTATTAT TTGT
GTCGATCTAAGATGAACTCAACTTGAACTCGTATATAGAATAGCATTATAGTTCAGA TTT
TGTGGCCTCTTTTTTTATGGTCCAAGTTGAAAGTTATTTTTCATTAATTTATCCTAA TAGA
ATAAGCTTTCACAATCCGAAAGCAGTGTGCATCCAGTATGCAGAAAGGGGGAAAGGA G
ACTTGATGATGTGGGAGAAACTTACCACACATGTTCTTTCCCATCTCCATTTTGAAG TAT
CCATTGTCACCCCATTCAGCTCCCCACGAGTTCTTGATCAGCCAATATGGAGTACCA TT
ATCAACACCATATCCCACAGCAAGAACAGCATGGTTCACATCCTGTCAATGCAATAG AA
AATGTTAATGTAGTATTATGTCATTGTCTCGCATATTTAGTGAAGGTCGGACCATAT ACT
TTGTTGATGTCATTGATATTTAACAAAAGAAGAAAAAGAATATGTGCCTCTTCATTG CAG
CAAAATCAATCATGTCTCAGTGCCAGATTAGTTGGGTTGGGTGGGGGTATGAATCCT CT
GTGTACACTGCTCATCAGACCTTTTTATTCCAATACTACATAATCTGTATATTAAGG CAG
ATTAGGAGTTTCTAAAAGTAGAGACTATTTCAATCTTTCCTATCATCCAGCATCTTT TTTT
GGTTGACAAAGCATGGTATCTTTCCTATCAGTATTATGACACTCGAATTGTTAAAAA TCA
ATAAACACAGAGAGAGAGAGACACACACACACATTACGCATAACTTAAAGATAATTG CA
GACACCCTTCTCGACGAGTAATCAAGCACTGGGTTATAACTAACTACTGGGTACAAT TC
TTTCCCCTACGGTGAGTTACATGGAAAATGTGCTTCTTCATCGACAAAGAATAGCAG ATT
AAAAGGGCAATATCTCAAATTCATCGGAACTAAAAACTCACCTGGGGAGTGTTGCCA CA
TACGGTGCTGCTGTAAATTCCACCCTTGTACTGTTTGAAACCTTTTACCACCTGATA AGC
TACACTAACCGGTCTAATAAATGCAATCGCGTATTTTAGTTCATCTTCAGCACCCTG TAA
TTATAAATGTTTAATTGTAAGGAATCCAACCAATGCCCATTGACCTTTTCAGGTCAA ATA
ACGCATATTAAAAGATCCACTCAACTGAAAATCATCAACAATGAAACACTAGAACTA TGA
AAGGTTTTTCCCGTCGGAAAACTACAATAATCTTCCTTTAGTGTGTCAAATACAAGA AAT
G ATAAG CACTG ATGTAAG CAGTTAAAAG CTG ATC CAAG ACATTGCTGTTCCAG AGTAAA
GAGGTATCCTACCTTGGTAATATTAACAGAATCAACGACTTTAACAGCAACATTTTC TGA
CGAGAATTTGCATACACCAGCCTTTCCAGCATAAGGATATTCTTCTTCAGTGTCAAG AC
CACCACTGTATTTAATGTACTCAAAAGCCTGTGATGGAAGCCCGCCATGGCATCCAA AG
TTATTAAAAGCTCCAGCACAGTCCAAGAGCTGCTGTTCAGACAGAGAGATGTTCTTC CC AAATGCCTGGGCATATGCTGCCTCCAGAGCACCAGTAGTGCTGCCAAAACATCATGCG
CCGACTCCGTCATTTTTCAACTTCAGAAACCCGAATAGAATGGAAAGGATATAGAAT TTA
CATGTTGTTAAGAATAAGTAGTTTAGGCTTAACGCATAGTTGATTAACGATTGATAG GTA
GAATGCCATTCCAAACTGAAAATTTAGGAACCTAGTGACATGATCCTAACTACTCCT TAC
CTGAATGTCCAGCAAGACCCGCACTTGCCCTGCTTCTTCACTGGGCTTACTATCCCT GC
TTCCCTCCAGTCTTTCTGAAATGACACATTCGGAGATCAAAATTAGGAACAAGAGCT ACT
TTTGGTAACCAACAATTCGTTCACTTAGCACATGCTCTTCTAGGCAGCGGCTGAATG GA
AGGCAACTGTTCCTGAACTTACTACTACTTTTAGCTTGGAAAAGAAGAAAAAGTAAA TAT
ATATAGATAGACAATACACACACACGTTTTTCTAAACACACATATATACCAAATTTT TATT
TCATTCAGATACAGTGGATGCAGCAGTACTCTGCACCTCTTGACTTAAAATCCTACG TC
CAGCGTTGCTCCTAGATGCAACAGCTATCTATGTTGTCATCTCAAGAAATTCCATCA TCT
TGAAAGTAATAAAAGTTAAAACATAACAAAGTGATAAAAATCCAATGAAAGAAGAAT GAC
AGCAATATTAGGCTGCAAAATGGACAAGACTAAGCATTTATGAATTTTCCTTTTAGG AGA
AC ATAC AAG ATAG AG G C AC AAAG AAAG C ACTG AAAG CTAG ATTTC C AATAAC AG G ATTT
CCGGTTGGAGTGAATGGATGCAAATGGTAATGATTTTTGAGAAAGTATCATAAACAA CT
GCAGAAAAAGATATAATGGAGTTCTGATTGCAAATACCGTCTCTGGTAGGTTGACAT TA
GTGAGCTGAAGATCGCTCTTTGTGGTAGCGGAACAGTTTTGAGGAGCTCCTAGCCTT TC
TCTCCTAAACTCATCCCATGTTAGGTCAGTAAACTCTGCCAAGAAGCATAGTCATTG TCA
AGAG CAAAATACAG CAAG CAAAATATG CAGTG CCTTCTTGTTTCTTTATTTTGTTTGTTCT
TCTCCTTTACCTTGACCTTGGTTTTCATATTTTTATTTTCCCTTTCTTCACTCACGT ACAA
GAAATCTGCTAGGAGTTCTAATTAGAAGTAACAAAAACATTAATCTACTACATTTTG AAC
CACAAATCAAGTTTTTGGCGTCGTTGAAGTCTTAAGAAGGCATCCACCTTAGCATCC GC
ACCTGGGCACAACAAGGTTGAGGGAACAGAGGGTGGACGCGCAGCATCCACCCCAGC
ATCCGATGCTGAGAAGATCAAGATGAGGCGGACGCGAAGCATCCAACTCAACATTAA T
CCCTGAAGCTGATTCGGGAAAGCAAAAGGAAAACTTTGGCCCATAACTTTTGGACGC AA
TATATAAGCCAAAAACGGCTCTTTTAGGTCATCGAACACACTTTTTGAAGGGGATTC GA
CCTAGGGAGAGCAAGGAGCCGCCGTGGAGGCCGAATTTCATCTTCTTCCGCCAAACT T
AGTAATTTTTATGTTTCTTTGTATGATTTGTTGTTTGGCTACCATGTCTATGTGGAG CTAA
ACTTCAC GTTCTAATGTTCTG GTTCTTTCATGACTATTGTTATTCG AGTTG ATTTTC GTTT
CTTGATTTATCATATTAGTTTATTTATTCAATCCTGCGCTTAATTATTTGATTGCTT GATCA
CCAATTAAAACTATCTACGAATCTAGAATTGAACTCGAAAGTGTGAATTCTAGATTG CAT
ATAGGATTAAATAGAGCAAGTTCTTGAACCTGGGTATCGGGGAACGGATTTGCGGTT AG
GATAAACATATATACCCGATTGCCTTGCTTGGTTGATTTACACGAATTTCAAATGCG TTC
TTGTTAGTTCTAATTC CATAG ACATATTG G CGTTAG GTTAG CTTG AATAG AC GAGTAAGA
ACTCGAGAGATTCTTATGAGCAATATTAACACTGTCAACCAATAAACTAGATAAATT AGT
TAGTCAATTCAATTGAAGAATACAATAGGAATGTTAGATAACTCATAACCCTAGATC GTT
TTCATTACACTGATAAT AT AAAAATCAGCTCTTCCTTTGTTCAGAGTTCATT ATTT ATTTTC
TTTTTAGTTTAGTTACTTTTGCATCACTACTTTTGGGTTTAATCCTTGTTTAGATAA TTAAC
AAGTCCTCATGGGTTCGACACTCTATCTTATCACTTTATTACTTGACGACCGCATAT ACT
ATACAAGTCAACTTTATGTATCCACTCTATTCAGATCATTTCATGAACTATAAGTAA GAAG
AAAAACCAAAACGAAAAAGGGCAAATTGTCCATAAAGGCATATATTGTCCAGCCTAT AG
CTAATAG G AAAC C ATATAGTATAC AC AAAG G C AATTAC C ATTG AC AC C AAGTTTGTAG G A
AAGTCGTTGCTTGTTGTGAGACCTAATCATCTTCAGATTGTCCAAGTATATCTCAAA CCT
TTGCTTGATCTCCTCAACTGAGTCGTATCTCTTCCCATACCTTTTAAAGTTAACAGA TAT
CCCAAAAATAAATAATTTTTAAGTAAAAAGGAAAACAAAGCTTATTCATTCAAATAG GAG
GAAATTAGATGAATCGAACTGACCTGCGAACAAAGCGAACGAAGGAGAGAGCACGGC G
CGTTTGGCCGATGAGTTGGAGAATTCCATTCTCCAGCTCCTGCAAACCGTCGGATAC TA
CTACTTGCCTGATCGGATTATCATCGTCAAACGTCAACGCTCCGCCTTGTGCGGCGG C
GATTGAGGTCGCGATGAGTAATAGTAATAGTATAATCGAGGCGCGAGTCAT SEQ 33
ATGAATC CTGAAAAGTTTACTCACAAG AC CAATG AG G CACTTG CTG AG G CACATG AACT
AGCTATATCAGCAGGGCATGCTCAATTTACCCCCTTACATATGGCACTGGCCTTAAT AT
CCGATCACAACGGTATTTTCCGGCAAGCTATTGTGAATGCTGCTGGTAGTGAAGAAA CA
GCTAATTCAGTTGAAAGGGTATTCAAACAAGCCATGAAGAAAATCCCTTCTCAAACA CC
AGCACCTGATCAAATCCCACCTAGCACATCACTGATTAAGGTGCTCCGACGAGCTCA GT
CGTTGCAAAAGTCTCGCAGAGACACCCATTTGGCAGTTGATCAGTTGATTTTAGGCC TT
CTAGAAGATTCCCAAATTGGTGATCTTTTAAAAGAAGCTGGGATTGGTGCAGCAAGA GT
G AAATCAG AAGTAG AG AAACTTAG G GG AAAAG ATGG CAAAAAG GTTG AAAGTG CTTCA
G GG G ACACTAATTTC CAAG CACTTAAGACTTATGGTCGTG ATCTTGTTG AACAAG CAGG
AAAACTTGATCCTGTGATCGGTAGGGATGAAGAAATTCGAAGAGTAATTCGGATTTT GT
CGAGGAGGACGAAGAATAATCCGGTGCTTATTGGTGAGCCTGGTGTTGGTAAAACAG C
AGTAGTTGAAGGGCTAGCACAAAGGATTGTTCGAGGCGATGTCCCGAGTAATTTGTC T
G ATGTTAGACTTATAG CATTGG ATATG GG G G CATTAATTG CTG G AG CAAAATATAG AGG
TG AATTTG AAGAG AG GTTGAAG G CAGTGTTAAAG G AAGTGG AAG AAG CAG AAG G GAAA
GTGATCCTTTTTATTGATGAGATTCACTTGGTTTTAGGTGCTGGTAGGACTGAAGGG TC
TATGGATGCTGCCAATTTGTTTAAGCCAATGCTTGCTAGGGGCCAATTAAGGTGCAT TG
GTGCAACAACTCTCGAGGAGTATAGGAAGTATGTCGAAAAGGATGCTGCGTTCGAAA G
GCGTTTCCAGCAGGTATACGTGGCTGAGCCTAGTGTTCCTGACACTATTAGTATCCT TC
GTGGGTTGAAGGAGAAGTATGAAGGGCATCATGGTGTCAAAATTCAAGATAGAGCTC TT
GTGGTGGCAGCCCAGCTTTCGGCTCGATACATTACAGGTATGTCCTTTTTTGGATTG TC
ATTGTATTTTATGAATTTTACCTTTGATCTTTAATCGAGTAAAGATGCCACTACAGG AATA
TAGCAATGTATGTAATGTTGAAATGTGATGTGTCACACGTTTGTATTGTGGTTGTCA AAA
CATTTCCTAAAATTTTGAGGAGATAGTCCCTTTCCTTTATGTCTATGCAGGATGGAT GTG
AATCTAGTTTTATACTTAATTTAGCTGAATCACGTCCCATTTGAATGATAAAGTTAT TTTC
TG CTTCATTGTG CTTTTCAAG GTG ATAAC CTCTAACCTTTG GTTTGTAGTTTCAG ACTTAT
AAAAGTATGATTGGTGCGTGCTCACCTTAATTGATTGGATGGGATTATGTGTTTGCT CTC
TATTAATAC GAATTTTCTTTAAAG CTTTTTCTCTC CCTTG CTATG G AG AATTG CTACTGTT
GTTTTGCGTATCATTTGCCAGTTTGCCATAATTTTGTGCATATAGGGATTACTAATC TGT
GAATTTACGTTCAGGTCGTCATTTGCCAGATAAGGCTATTGACCTAGTTGATGAGGC TT
GTGCAAATGTAAGAGTTCAACTTGATAGTCAACCTGAAGAAATTGATAATCTTGAAA GAA
AGAG GATTCAG CTAGAAGTTGAACTTCATG CACTTG AGAAG G AAAAG GACAAG G CTAG
CAAAGCTCGACTCGTTGAAGTAAGTATACATCCCGGAAATGCTTTGACCTATAATTC TAG
AACCTGTGTAGGAAATGTGGACAAATAACGTAATTACTATTTCAGGTGAGAAAAGAA CTT
G ATG ATTTGAG GG AC AAACTC CAG CCTTTG AC GATG AG GTATAAG AAAGAG AAG GAAA
GAATTGACGAGCTTCGCAGGCTCAAACAAAAGCGTGATGAACTCACGTATGCTTTAC AA
G AAG CTGAAAG GAG ATATG ATCTTG CTAGAG CAG CAGATCTTAGGTATGG G GCTATCC
AAGAAGTGGAAGCTGCTATAGCAAATCTCGAGAGTAGCACAGATGAGAGTACAATGT TA
ACTGAGACTGTTGGACCTGATCAAATCGCGGAAGTAGTCAGTCGGTGGACTGGTATT C
CTGTGTCAAGGCTTGGTCAGAATGAGAAAGACAAATTGATTGGTCTTGCTAATAGAT TG
CACCAAAGAGTGGTTGGGCAGGATGATGCAGTTAGAGCTGTTGCTGAGGCTGTATTA A
G GTCTAG AG CTG GGTTG G GAAG G CCACAACAACCAACTG GTTCATTC CTTTTCTTGG G
ACCAACTGGTGTTGGAAAAACTGAACTTGCTAAGGCTCTCGCTGAGCAGCTCTTTGA TG
ACGACAAGTTGATGGTCAGAATTGACATGTCCGAATACATGGAACAGCATTCTGTTG CC
AGGTTGATTGGTGCTCCACCAGGGTAAGGACCCTTTAACTATTGATAGGATAAAAGA AC
AAATCATACTTTTACGAGTAAACTGTATCTGCCATAATGAGATTGTGGATTGCACCT TTT
GTAGAACTCTGTAGCCTCATATTTGTCTAGGTACTTAATAGTTTTACGTCTGAAGTG ATG
AATG CTG AACATGTTATGTGTGTG CAGTTATGTTG G AC ATG AG GAAG GAG GACAACTCA
CTGAAGCTGTGAGGAGGCGCCCTTACAGTGTAGTGCTTTTTGACGAAGTGGAAAAAG C
TCATCCCACTGTATTCAATACCTTGCTCCAAGTGCTGGACGATGGACGATTAACAGA TG GCCAGGGTCGTACCGTTGATTTTACTAATACAGTCATCATTATGACCTCAAATCTAGGA
GCAGAGTATCTCTTGTCAGGATTAATGGGCAAGTGCACCATGGAGAAGGCCCGCGAT A
TG GTC ATG CAGG AG GTAAG CTAG AACAG CCTATTTTCTGCTAATTTTCTG AG CATTGTTT
CCTAGTTTACATCTTTATTTGAGGAAGGATTGTTCACATATATCTTTTTGTGACAGG TGA
GGAAGCAGTTTAAGCCTGAGTTATTGAACCGGCTAGATGAGATTGTAGTGTTTGATC CT
TTATCACACGAGCAGTTGAGGCAAGTATGCCGTCACCAACTGAAAGATGTAGCAAGC C
GTTTAGCTGAGAGGGGTATCGCCTTGGGCGTTACCGAGGCCGCGTTAGATGTCATAC T
TGCTCAGAGTTATGACCCTGTAAGTATCACCATCTGGTATTTCAACCTGACATTTCA TGG
TGATTAGACTAGGGTCTGAGTTGAGATACCAACTATGCAGATTTTTGCATTTATCTT GCT
GTGGCGGGTTACACTTGTTTTTTCAGTTGCTAATTTCACTTATTATGGAAAATTATT TGTA
GTTACATTTTAGGTGATCTAACATTCTAAAAATTATCTTAGAACCGTTGGCGTATAG AAG
CGAAATACTTTTGACAATTGATTGTGCTAACTTTTGTTACAATTACATCACAGGTTT ATGG
TGCAAGACCTATTAGAAGGTGGTTGGAGAAAAAAGTGGTAACTGAGTTATCCAAGAT GC
TCGTGAAAGAGGAGATTGATGAGAATTCTACCGTCTACGTCGATGCTGCATCCAGTG G
GAAAGATCTAAGCTACCGAGTGGAGAAAAATGGAGGGCTTGTCAATGCTGCCACTGG G
AAAAAATCTGATATATTGATTCAGCTCCCTAATGGAGTGAGGAGTGATGCTGCTCAA GC
AGTGAAAAAGATGAAGATTGAAGAAATAGTAGACGAA
SEQ 34
TCACTTG CTTTCAG GTATG ATACTAACAAG GAG ACATACTATG CCAGTAACAACAG GG C
TCGCCACACTTGTGCCAGATAATCGTTTACAACGTGTGCTGATTTTGGATCCCATAA TCT
CACGCCCATATGCAACAATGTCTGGCTTTACACGACCATAACTGTCAAGGCACAGTA CA
GTTAAGAATCTATGGTTTGATAGACGAGTGTAATTATTTAACTGTTATTAGAAAAGC AAG
ATCCTAAAATATTCAAAGAAAAAGAAAGAAAAAAAAATGAAGCAAAGAACCATAATT TTG
AACTTAATTCTTTTTC AAG AAAAAAAAG AAG C AAC AATAAC AAC AC C AATAAC AAG C C C A
GTATTTTTCCACAAGTGGGGTCTGGAGAGGGTGGGAAGTACGTACCCTTACCCCTAC C
CTAGAAGGACAGAGAGCTTGTTTCCGATAGACCCTCGGCTGGAGAATGGATGACAAA A
ATAATG G C AAC AATAAG G AATAAC AAC AAG AT AAAAATACTG AAG C C AAG AAAG C AG CT
AAACTCTAGGTAATAATAGCAATCTATGAATAAAAGGATATCATACTAACACTGATG CTA
GCGAACTGGGAAAGACAAAGAGATACGTTCGACTACCTACTAGCCTTCTACCCTAAT TC
TCGACCTCCACACCCTCCTATCTAGGGTCATGTCCTCAGTCAACTCCAGTTGCGCCA TG
TGTTGTAACCTCGCCCCAAGACTTCTTAGGCCTGCCTCTACCCCTCCCGATACCCAT TG
TGGCTAACCTCTCGCACCTTCTAACTGGGGTTTCTATACTTCTCCTCTTAACATGCC CGA
ACCATCTCAACCTCGTCTCCCGCATCTTTTCCTCCACCGAAGCCACTCCCACCTTAT CC
CGAATGATTTCATTCTTAATACCTAGTATGCCCACACATCCATCTTAACATCCTCAT CTC
AACTACTTTTATCTTCTGGACATGAGTGTTCTTGACCGGCCGACACTCTGTGTCATA CAA
CATAGTTGGTCTAACTACAGCCTTGTAGAACTTACCTTTAAGTCTCATCGGCACATT TTT
ATTACACAAGACACCGGTAGCGAGCCTCCACTTCATCCATCCAGCTCTGATACTGTG TG
TGACATCCTCATCAATCTCCACATTACCTTGTATTATAGATCTAAGGTACTTAAAAC TACC
TCTCTTAGGGATAACTTGTGTATCAAGCTTCACCTCCATGTCCGCTTCCCGGGTAAC GT
TGTTGAACTTGCACTCCAAGTATTCCGTCTTGATCTTGCTCAACTTGAAACCCTTAG ACT
CTAGAGTCTGCCTCCAAACCTCCAGCCTCTCATTAACACCACCCCGACTCTCGTCAA TC
AGAACTATGTCATCAGCAAATAACATACACCATGGCACCTCTCCTTGAATGTGTTGC GT
CAATG CGTCCAAAAAAAAAAAG AAG CAAAGAG CTTAATTGTGACTTTTTTCTATTTCATG
TTTACGGTTCATCTTTCTTCCTTTCCTTTTTTCCTTAGAAGCTGAGTGGATTGTACA AGA
GGCATTCAACAGATGTCATGCTCCTATTCATCCATAAAGTTTTTGCCATTTTCACCC ATC
ATTTTCCACTCAGCAGAATTTTACTCGAAGCATCACAACCATGGATAGAATAAAGCT CAT
AGAATGCTCGTTTGTTTCACAAGAGCTGATTTTTAAAGGCATCTTTTTTAAATGAAG TTG
GTACATCCAACACACTCATTGACTTCCTATGTGGTCATATAGTAGAACACAACTTTA ATA C AG AG AAG GAG AG G G C AG AAAAATAAAG AATAC AG ACTATACTC C ATTG C AAAG AGTAA
TACATAGCAAAGAAAGGAGAAAGAGATACCCATGAGGAATCTCCCAGGTACTCATTC CA
CGTGAAGAAAATGAGGCTAAATGATTACTTTGATCAATGGCACCAACACCAATAACA TC
ACTTTGATCAGCAGGATTGTTAAGAGTACCATAAAGTGGTCCATCATTTCCAATAGC AGA
AACCATGATAATATTGTTGGCAGTAAGCTCCCAAACCTAGCGGAAAATATTGATTAC GTC
CATCAATATAAAGCAACTATGAGGAGAGACTCCAGAGAGTAAAGGTGTTAAGACAAG TA
GAAATAAGACTGAATATATGCATGCTAAGTTTAGCAAGCATGAGAAGAGTGAAGTTG AG
GTAAGATTAGATAAGATTCCCATACCTAAAATGCACACAATTCAGATATCTAGACTC ATT
TTCCATGAAATTAGTATGATCCGTGAAGATATCACATCAAATTTGAATAGAATTGGT TGT
AATGGAGGTGCGCTACAGAGGGTTATGTGATAAGAGGATACCTATCAAAGTTAAAGA CA
AGTTTATATTGGGGTAAATGTTTGGCTTTGTATTAAAGATATGAGCATCACCAAAAT GCA
TATGCAAGATGGGTATTATAAAAGTTTCACAAGATAAGAAATGATCAAATTTGACAG AAC
ATAAAGGATAAAATAAGAATGTCGTTTGAGATGGTCTCATCATGTCCTAAATAAATC TCC
AAAGGCACTGGTCCTTAGATGGAAACCATGATGATTGGAGGTGCTAAAAGAGATGTA TA
CCTAAAATCACATGGAAGGAAGTTGTCCCAAAAGACTTATAATCTCGTTGAATTCAT ACA
GACTCAAAACAGAACACAACAGAAGCAAAAGTCTGTTATACGCGATATCGACTATTA AG
AATCAAGGTGTAGTCATGCTAGTGCACTTACTTTAGGTCCAATGTCTACTAGGAATC TTT
TTAGTCTGTCTGCACTTCTGAAGTTTGTATGTCAGTAGAAAAAGAACCTCCAGATTT ATA
GATATCCAAACTACTGAATCTTTGATGACTCCAAATGGAGCAGAATGGATGGTGAGG AT
TCATTAGCCAACCCAACTAGCTTGGAATTAAGGAGTAATTATTCGTGTTGTTGTACA TCT
CATCAATATAAAGGTGAAAAGTTCTGCTAATGTTGTTTCAGGTCCTGCTGAAAGTAA TGT
TAATTTCAGTGAAATGACAGGCTTAAACAACTCCGAATCTCTTTTACAAAATTGAGT ACT
AGATATAATATACAACCCTTGTGTTTAAGACATCCATGACATAGTTCAGCTTGCAAA ATT
AATAGATCTCATGAAACAAACGCCCCATTAAGCTCAAGAAAGCCAAGTAAAATCCAT GC
ACTGCATAAGAAATATAAGATTACATGCTGCCAATTATAATCAAACTTCTAATACTT CCGA
G ACCACATATTATACAG AAACTTAG ACAATAAGG G GTTATG GAAC AACAG CAAG ATCAT
TTCAATGCCTATGCTGTAGACAAAATGCAATCCAGTATCATACCACATAAAAACAAT AAA
GATATAAACCAATAGATAAGTGACCTCACCTTTTCCACAAAAGGGAGATCCAAATAA TCA
GGTCCACCTATGCTCAAATTCAGAACATCCATGTTGGTTGCAATTGCGTAATTAAAT GCA
TCGAGAAACCACGATGTGTAAGAGACCTGCAATAAACTGAAGAGCCACTTCTTATAA TG
CTAAATTGGTCATTACAAGATTGATCTTTTTATTTCTAACTTTTTTATAGGTCGCCT AGCG
TTGTCCTTGTCTGTAACAGTAGCTTTAGTACATGAGTTAGTGTTATTTATGTATTTT CGTA
TTCCTTGACTTATGTGATTACTTGTCGTTGCTTTCGTTCCGGCCTTCTAATTGCAAT ACT
CAGTTTTAGTTTTGTTCCTTTGTATTTTTTGCTTCGGTTTTCTAATTGGTGTGCTTG TTGC
TGCTCTTCCTTTTATCTTTCCTAAACCAAGGGTCTTCCGGAAATAACCTCTGCCTTC TTG
AAGGTAGGGGTAAGGTCTGTGTATGTACTACCCTCCCTAGACCCCACTTGTGGGATT AC
ACTGGGTCTGTTGTTGTTGTTGTTGATAATGATGGTGTCAAAGCAAAACTTGTCTCG ACT
ATTCCAAGGATACCTGCAACCTCCCACTAGCACAGGTACCGGGTATCTCAACCCACC AA
G G CTTAG G C AG ATG G GTAG ATATC AC CTAG C ATTTTTTATCTAG G CAAG G ATTTG AAC C
ATAGTCTCCAAAATTTTAACCCACTTCATTGAACGCTACCCAACACCCTTGGGTGCT ACA
AGATTGTTCCTTTTTGTGTGAATAGACTCTCTTTCAAACCCCAACATCAAGGATTCA AAC
CCATCGAACCCATGATGTGCGTCTAACTCACACATCACTTGTTGCGCTCTTACCACT AC
ACCAAAGCCCTGGGGGTGAATACTCCATATCATCTTGATTGTCCTTACTTGCGGATA TG
GTTTGTAGCTACTGAAATCAATAGATTGCACAAAGCCAATGATAGGTAGCTTAATTG GAA
AAACTGAGGTCCAAGAACGATAAGGTCTTGGCCTCAAAATCTCCATGAAAGGAAAAT AG
TATTTAATGTGTGCCTTATGTAAGTAATTTTGTTTTCCAATTTACTATTAGCAACAC TGTT
ACTTGTTATAGTATCTCAAACAGCGCATACAGTTTTTATAATATTTCAAACTGCTTA TACC
TCCCCAATAGTGGGCTAATACTAAGTATGGGCTTCCTTGAAAACAAAATAGGAATTA TAG
ATTGCACATAATTCGCAGACAAGTTCCTGGCTTTTCTAAAACATAAAAGTAAACAAT GTC
CTCCCTCCCCAATCCCCTGAAAAATAGTTGCAATCTTATCACTAAAGTCATAATAAG ATG
GCAGAAGAAATATTATATGTTCAATAACATAGCATGTAACATGGACTCCACCACTAA ATC CAATCAGTGGGTTTGGCCGCTGGAAAGGGTATGAGAAAACTGTGTATATTTAGGTGCAT
ATTCTTCC AAG ATATTGTAG CTTAATG ATG AGAAGTTAAAG CTAG CACAAAATAAG GTGC
AGAAG CAGAACTTGTCATTTACAG AGACTAG G CAGTCTAAAGTATTTTTTCTTC CATTCC
AGAGGACTTTTCACTAAAAACTATGACTGCAAGAATTTGCTATATTAGGTTCACCAC TCA
TGAGGTGGATGTGGCACACTCTACTAGCAGAAAACTGGAAGGGAACGGGGGAAGGAT
CTTAACACATCAAGTATTTGCTTTGCTGCAATTAACAACGAAAGGACCGTTTGATCA TAG
GAATCATCATTAGCACTCAGCAAGAAGCAGACTTGTATAAAACATCAGTACAATAAA TTA
GAGGCAATAATCCAAGACATCAGATTGTTGAAGATCTTCAAGTCTCAGCTTACTTAA ACA
GTTTAAGAAAATAAAGCCCCGTCACCCCCCAAAGAAAAGGAATTGGAATACTCGTTC AA
AACAATCCATTACCTGTGCATCTGTAAATACATGGAAAGCATAGATTTCCGCATCTG GA
GCAAAACCGAGGCATTCTTCATCCTGACCAGCAATAACACCAGCTACAAATGTCCCG TG
TCCAACATTGTCATTCAATGTATCTTCGTTGGTCCAATTTGTGCGTTCCTGAAATTT AAG
GTACCAAGCCCACTGTCTGTTATTAATGACGTAAAGATGGTACGAAGATATATACAT TG
GCATATATGTAAATGTCAAGCCTCAAACCACATTGGATTCAACTAAAGCATTTCTTT GTT
ACAGAAAATATATTTTTAAACGACAAAAGACAAGATACCTTGATATTACGAAAATGT GGG
TGATCTGCACGGATGCCTGTATCAAAAATTGCCATTTTGACCTTAGCACCAGTATGC CC
TTTTGACCAAAGCTCATGTGCCCCAAAGAGGGATGTGACTCGAGATTTCTGCAAAAT AA
TCATACAGGACAGACTGTATATCAAGAAAAATACAGCGAACAACAACAATATGTTCA ATC
TAAAAAAGAAATAGAAAATAGAGCAGCAACATGAGACCCAGACACTCAAAGAATGCA GA
CCATGTTTTTCAAAAAGGAAGTATGCCCTGACAACCTTGAGGAAATAAGAATGACAA ATT
ATAAACCTATTAACACTCTCCGATCTGAATGTTACGCTCCAACGGCCAACCAAGTTA AA
GTGAGCCACTTAATGATCTAAGGATATATAATCCATTCTCGTTAGAGGTACTTTAGT GTC
TTG AAAG ACTAAAAAAACAAAAC ATG GTCATATATC CAGTG CAAAAGAGAATATTG GG G
CATAGCAGAGACAACTTGTGAAATTATATGGATCACAGGGTCGCACAAAGATTAAAC TT
TATAGTGATCAGTAAGGTGCAGCTTCTGTGTAATAATCAAGTTTCCCTTCATATTGC GTT
GAATTCAGTGTGTCATGAAAGATATAAAAGAATATTATCAATTGTAACTTCGTCACA GAG
AAGATAATCTCAGAAGTCATTTTCACAAGTTCGTGAAGTCGAATGCTTAGATTGTAG ATA
TATCCACGAGTTCTTCACTAGTCCCTGAATTAGTCACATATGTAACAAGCACTAGAA AGG
GACTGTTAGATAGTTACGGGAAAATAGCTAAATGTAAATACTTATATTTATTATAAG TGTC
CCACTTCGGGAAAACACAGGTATAGATATTATTACATTGTCAAGTGTGCCCATAAAA GG
AATCAGTTGTAAGATATTAGTCTTCAAGCATTCTCTAATCTTTCTCTTATTTTTCTC ACTAT
GGAGTCTCAGCATCATGACATATTAATGGAATAACAGACACTTGATTAGACCAAAAT AG
G AAAAG GACAAAGACAAAAG G GAACTGAAAG AG ATTAATTTCCTTTGAACATATAC CAT
GCAATAAAGTTGCAACTATCATATGTCATGAATGCAAAGAGAAGAGTTGCATACATT CCA
ATGATGTAAATTATCAGTAAACGCATACAAAATAAAACACAAATAATCAGTGGTCTT GCC
TGCATCAATAGATGTCTGCTCCAGCTAATTCTCATTATACTAGTGTTGGCCACCGCA TAG
TTTTG ACCTTCACTG AAG G ACATAG CAGTAAAAATCTTTC CTGG CCTCTTCTTC CCATTG
GCAAAAGCCCCATTCTTCTCACTCTTCTCTTCAAGAACTATCCTTTGATAGCTCAAA TCC
AACGAAACGTCTTTTACAAGATTCATTTTTCTGAACTTTTCTAGCAAGAGTTCTTTC ATTG
ACTCGTCGATTTCCACCAATCCAAAGTCAGTAGGAAATCTCGCAGCCGGATTTTTCC GC
TCAATCCATTGCCAACCCTTAAATTTCAAGTTGTTTTGAAGATAATTCCAGTGATCC TCA
GGTTCCTTATAATGATAGAATCGAACAATATAATTTCTGCTATCAGATTGTTGCTTC TGG
TCATGTTGGCACTCATCGGAGCTACTAGAAATTAATGGCTCAGACTCTATTGGTGGG TT
GAAGCGGATGAGTGTATATACCGGAAGGAAGGGGACAAGTGAGAGGGTGAAGAATGA
TTTCTTAG GAG CTTCAG GCAT
SEQ 35
TCATATTGAAGCGACCAAGTCTTCAGTCTCAGTCGTCTGCTGAGAAAGGGTGCCTCCAA TCCACCTCTTGAGCATCTCTAAGGCAACTTTAGGTTGATCCATTGGAACCATGTGTCCA GCATCGTGAACCTGTTATAAGACACCAAAACAGTTAGCTCAAACATCCATCAGTAGAATT
TGAACAATAACATCGACAAAGAAAGGCACCTTCAGGAAACTCAGAGGCCCATGGCTT TT
CAACAATCCAGCTTCAGAACTGTCAACTTCAAAAGGAACATCGGGAGATGCTACAAA CT
CTTTCTGACCACTCCATTCCATAGCCTGAACCCATCTTGAGTTACCTGGTAAAAGAG AG
GCGTTATAATATCCGAAATATTTATGTGAAAAGTTTCCATCATTAGGCTTAGAGTTG AGT
CAAAGCTTACCAAGCCAGTTGCAAATAAGATCATATTCTCCAGCATAAACAAGCAAC TTT
ATTCCATCCTCGAGCAAGGTTGGAATGCCAGCCTCAAGATTCCTCATCCAATCAACA AG
CATGGCCTGGTACACAGTAGTGCTGCATGAGACAAACTCTATATCCTCAACTCCAAG AG
CCTGCTTAACAGAGTGCATATTCAGCAATTTCTCCATGTTTGAGAAGTCATAGCAGA GT
GCTCCAACGCATTTCTTTCTGATGTCGTAATGCTGCAGGTGAAAGCTCAATGGATCA GA
AATATGGTTAATCAGTCATTTGTTCCAAACTTTGGAAGGCATGCCAATGACAATGTG ACC
TCTTGAGTAATTTAACATATTCAACATGAAATGATATGGAGTAGCGATTTAGAAGAA ATA
G ATTTCTG G G ATCATTTCTACTCTTTCTG AG G CTAGTAACACCTATTTCTCTACGAAGTA
C AG AATAAGTGTAATAG G C ACTAG AATTAGTAG AAAAC G G G AAAC AG AC AG AAAG G G CT
GATAACTTACATTGATGTCAGCCCCAGCACGTGCACGAACAGCAGAGAATATAGAAT TG
C AAAC AAAATAG G C AG C C AAG C AAG AG ATTTTC C C ATC AGTAC CTAAAAG AATAAAAAA
GACAGAGAAACTGAGAAGCAAAACAAAGTACAGAGAATTGATTTGCTGTGGTCAAAG AA
CATCCATTTACTTCATCAGCTCCTTCCCCTTTTCTTTTTCACCCAGGGAAAGCCCGA ATG
AGTCAAATGATATGGAGGAAAGAAAGATAGTAAACAGTAAATTAAATAATGTACCAC AAA
GGTTTATTGCAACTTCACAAACTGGAAGTATTTTGTTGATACGATCATGATCAGACT TTG
AAATTAATCCCATGTCCAATGCATAGTCAGTATACGCAGCGTATTGTATTTTGGGAT CTG
TAAGCCCATTCCCAATGGCAAATCCCTGTTCAAATACTTTCAATGAAGTAAAGACAC ATG
ATTAAGGAAATAAGAATTCAATAACTGGGAAAATGAGGTACCTTTAAGTTTATATGT ATT
CCTTCTTTAGCCTTGTTTCCCTTGTGTACTCTAGCAGCAAAAGCAGGAATATAGTGC CC
AGCATATGATTCTCCAGTTATGTAGAAGTCATTCTTTACAAGCTCAGGATGCTCTTC AAA
GAAAGCCTACACCATTATTAATGATCATACCAACACAAACAAGTCAGGATCATATTA TCT
CTGTTACGCATCAATTTAGAAAAATGCTAACTGTGTCACATAAAAGAACGAATCTGA AAT
AGCCAATGTGTCACAAATGCTCAAGAGAATTCATACCACATTCGGCCTTCAGAATTT GC
AAAGCAGAAAAATACAACAATAAAAGCAACATATAACAATATTTCCAACTAGAAATC TGT
TGAAAAATTCACGTTCCGAATAGGTAATGTATAGTCTTAAGGCGGCTAAGCCAAGTT CT
GCTAAATATTCTGGTTTAAAAGCTGTTATACATGCTAACAAAATGCATCATGAGGGA AAC
AACTG AC CAACAAGTTACC CAG CCAAAATTCAG GG ATCAG CTG CAGTTTGTAGTAAAAA
ACAG G AAAACCAG CCCATGAAG AAG G GTATTGAATACTG CAAAAAG GTTGAG GG AC AG
GGTTTTCTAGCAATGTGATCACATCTTTTGCCCCTAATGCATTTGGCAGATAAATGG AGT
CAAAATATTTTAACGCCTCCATTTTTGTTGGGATCAGAACCCAGGCATCAACCAACT ATC
ATTTCATAAGCACAATATAAGACTCAAGTCCTAGTATATGACATCTCTCCAATTATC TATA
TG GTAAAAGTATTAAGTG ACCATGTTTCTTTTG ACAAG AGTG G GTTG CTCTAGTG GTGA
GCACCCTCCACTTCCAACCAAGAGGTTGTGAGTTCGAGTCACCCCAAGAGCAAGGTG G
GGAGTTCTTGGAGGGAGGGAGCCTAGGGTCTATCGGAAACAGCCTCTCTACCCCAGG
GTAGGGGTAAGGTCTGCGTACACACTACCCTCTCCTGACCCCACTAGTGGGATTATA CT
GGGTTGTTGTTGTTGTTCTTGTTGTTTCTGTAAGGCTCGAGTTCTAGCATTAGAAAT CAG
CCTTTTGAGCTCCTGTAGACCTATTGTACTGTACCCCGTCTTTGTATCACATGTACA CAG
GTGATCAACACACAGACAAGAGAACTGACAAGCAAACCGCACTCTGAGAGTCCGAGC A
ATTACTTGAAAGGGATCCCAGACAACCCTTACTGGCACCAGTACTTTAATTTGTCAC CA
CTCACCAGGCGGCCAATTCTAGACAGGTCAGCAAGGCGATAAACAGAGGTGTCTTAA T
CATCACAATATCATCCCG CACACAG GAGTGG G GAT CAG CAAACACTTCTTTT AT AACTG
AGAAACTATTGTTTCCAGCATTGAAACTGTGGATAATGAGCCTGCCCCTGTACTTTC TTC
AATTTTTTTCAGAACAGGATTCAAACTACTGACATGCGCCTACCACACATCCCATGT TCG
AAATTGAAACCAAAGCTCTGGGGCACTGGAGATTAGCAAATAGGCTAGGTGATTATA AA
GATATCATTCAAGAGTTCTCCTACTAATTCACGGTACTTTCTACAAACCCCTCCCTC CTT
CCACAGTTGATCACAATAAGCTTGACTACTGACGTATATGTCAATACCACAGCCTCT GT G AG ATAG AAAAG CTTCCATAATG ACTACTTG AAAG G AG ACCAAG GG G GTTTAG AAAGTA
TTTATCATTCTGTAAG CTACTGCAACAATAATG ATTTTACTTAACG G AAAG G AATGC CAT
AAATGAATTTGTATTCTTGAGGATGTTCACAACCAGGACTGAAGTTGCTTCCACCCC TA
GCTACATTCTTTATCCGTATTAAGGAAAAGTTTACCATCCTTTTTTTCCAGGTGAAA TGTT
TTATTGGCCTTTAGAAGCAGGACAAATTGTCCAGTGCAAGCCTCATGAATAACATAC AT
GAAACTAGGAATTGATAGGTGAAGAAAATATAGGAAACCATCCATATCATTAAGTTA AAC
AATTGACTCTCTGTCATCATACGACAATGATTACAACGGTTGATTCCAGAGGAATAA ATA
TGGTTCCAAGTTGCTTTAGGGGTTTAATTTCACACAGAACAAACCTGTAGGAAGTCA TA
CAAGTCGTCGCTAACACCTGCTTCACTGTGACGGATGTCATGTCTGTCAGAACTGTA AC
TAAAGCCAGTACCTGTAGGTTGGTCCACATAGATAAGGTTTGATACCTGCAGAAATA AC
ACGTCAACATCATTTCTTCAGGNCGTCAACATCATTTCTCTAGGTTTGGAAAATCTA TTA
AGGTATTCTAGTGTCCTTGTAGGAGAAAAAGGAAAATCGATAATAAAAATGAAACAT CTA
CTTTACAAG GAACAAATGTGG AACACAAG G CAAACTTG ACACTCTAG GAGTCAG CAATA
AAAGACCCAAACCACAAAAACCAAAACTCAAGATCTTATGGAACATAAAGCACTTTC CTC
TCTG CATTCTTGAATTG CCGTG CAG GTAATTTTTTTCAGTAATGAGAAAAAAG AACATTA
AAACAG G CAG AAG CATGACATG G AATTAGG G AAGTG CAGTATCAGAG GTCTAATG AAA
AAAATATG GCTGACATGTTTCCTATG CAAAG CATTAAG ATTTCAGTAAAACACAAAG CTC
CTTCCG G GAAAAAAAGTTTTCTTG CAGTCTG GTT GAT GAT AACTACATAAATG CTG AAAG
TGTAACTATCTAAAGGCTAAAAAGGACATTTCTCCTGACAGATGTATCATGTGCAAG AAA
GAAATGGTCCCCACAGATTTTCTTCACATTGATGAAGTATCAATTTGTGCCTGTGAA TTG
GCATTGCATTCATTTCAGGTTGGTTACTGGTACATCTCGAAGAAGGATAATGGTGAA GC
TTGGATTATCAAAGCTTAATCACAAACCAGTATCCTTATACTTATGATTCTTCGATT TTAC
ACAACATTGAACTAGATATAGACATGTTATTAACTGTTTTATCTTGATTTCCTTTTC AAGT
AATCTTTTCAAAACTAAAAATGATAATAACAAAGAGGAACATCCACAACGAACAAGC TTT
TTCG CTAATTGG CACTTTACAAATAAATTAG CAGG CTG CAAAACTACTTATCATG AAAAC
GACTGGATAAAGGACATGCAAAAGATGTTGCCAATGCAAAATCTAATAATCTATACG CC
CATGCACCATTATACTCTTAACTTTCATGTACCATTGAGTAAAAGCAAGAGAAAGTC TTA
TACTATCGAGCTGAATCATCTACTAAAGAGCAAAGAAGGTAAATTGCTACTATTGCT TAA
GCAAAGCTGTGTTTACATACCACAATAAACTGCTGTTAAATAGCATAAACCATTAAG ACT
ATAAGGTGGTCTTAAAGATAGTCAATTACCTTGTCCCACCCATATTCATTCCGCACA AGT
GACAAATTATTTGAAATAGAG AAAG GTCCATTTTCATAG AAAAG GG CCAACTCACTG CT
GCAACCAGGCCCTCCACTCAACCAGATGACAACAGGATCGTCCTTACTACCGCGTGA T
TCAAAGAAGAAATAGAACAACCTACAAAATGAAAGATAACACAGTTTCAAGCTAAAT AAG
AG C C C AAC C AAC AATAAG AAG AAG CTAGTTG ATAC GTTAG AC AG G AAG AC G AAG AT ATA
AAGCCTTATTACTCCGGAGATGTATATTTTGGTAAATGCTCACGAAAGACAAATATT TAA
ATGAAACAATGTTGCAGCATTCCTTTCAATTCTTACAACTGATTTGGAAACTTCTTA AGC
GCACTGATTGAGTTAAACAGATAAGTGGTCAAACTGTAACAATCTGATGGTTCCCCA CA
AAAGCTACATGTAATGACATTTGAAGGTAAATAAATCAAGCAGACAGAATATTTCTC CAT
TAATCAACCAATCAAATAATTCCAAATTAATTGGGTTGACTAAATGAATACTCTTTA ACAA
TTCTGCTATATTTAAATCCAATTTGTTGCAATATTAAATAAATATCCTTTGAATTCA TTTCC
ACACCAGTTACAACCGTTTTATTCGAATTTCCGAAGTGGAAAGTGAATTTGTAGGAG GA
TAGAG GG G AATG CAG G GGTG CACTG CACG ATAGAAG ACTACTG AGAACTTGG CTACAT
TTTGTAGACAACAAAAACCTATTACCCCAGTACACCACAATGCAATGCTGAATCCGA CA
GTATTTTAACATAAAAGAGAAAAGAAAAAATTGAAACTATAATTTGGTTGGATGCCT ATG
ATAATGCAAATCGGAACAAGAATCACCAAAAACCAGGAAGTGTTTGAAGCAACAAAT AT
GGTACTAGTCTGTTCCAATTTGATAAACTTTCTATTTTTGGGACGTCCGGGAACATT TGA
CACCAGTTTGCTAGTATCTTAACAATTTTCAAGATACTCTAATTCATTTCTTTGAAA AGAA
TAGACATTAGGCCTAACTCAACATCAAAAGCTAGCTCATGAGGTGATTGATTGTTCA TTC
AATATATAAGGAGACAACAGTCCACTCACTCCACCAATTTGGGACACTTTAATATTT CCA
CACGACGAGGCCGTGGACAACTGGAGTGTGGACAGCATAACTTGCGACCCCAACATG
GAGAAACACAATGTCGACCTTACCCTACCACAACCTCAAAAACTAGCTCACGAGGTG AG GAACGGAGGATTGCTTGATACCATAGGAGACAACACTCCATTCCCTCAACCAATGTGAC
ACTTCAACATTATGTGTTATTTTTATCGAGTTACATTTTCTACAGTTATCGAAATAT GTTTC
TTACAATACTACATCGTACATACTCTAATAATTAAATCAAATCAACATCTTAAATTC CGTG
TTC AACTAAAC ACTAC C ATAAAAATC G AG AC AG AG ATTG AAAAAAAAACTTAAC CTAG G G
TTTGAAAAGGTACCTAGCAGCATGAGAATGCTTAATCTTATAATAACCGGCATGATG CC
CCAAATCTTCAAAGGATATAACACTCGAATTCGTCAAATTAGCGAAATTAAATCGCT TCT
CGACGATTCTAGAAGCAGCGGTGGGAAATGGATCCCGATCGACAATGTTATCGGATT C
TTTCGGGAACAAATTTAGCTCGTGTATCAACTTCTCAGCTTGCTTCGATGCTAACTT CGA
AG ATATTG AAAC CTTC G C G AATG AAG AAG G AG AAAAAG C AAG G AG AAG AAC AAG AG AG
AGAAAAAGGGAAAGTGAAAGCTTCATTTGCGCCAT
SEQ 36
TCAAGAAGAAGGGGTCTTCTTTCCCTCATTTCCGAAGGTTCCAATAGGTTTGGCAAGAA
TGTGCTGCATTGCTTTGACTCCTAGTGGATTGTTCTTGCTCTGTACCAACAACATTT AAA
ACACAATCACGTAAGTAAATGAAACAACCATATCCTTCTGAACAAAACTGTTAAAAG ATA
AACCTTGGCCTAACTAAAGGGCAGCACGGGGCACTAAGCTCCCGCTATGCGCGGAGT
C C G GAG AAG G G C C G G ACTAC AAG G AC CTATTGTAC G C AAACTTAC C CTG C ATTTTTG C
AAGAGAATGTTTCTAAAGCTCGAACATATGATAGCTACTTTACCAGTTAAACCTTGG CCT
AACTCAAACAAAAATCTAG CTCATGAGATGAG AATTG CCAAAGAAC CCATTC CCTTAATC
AATGTGCGACAATCTAACAATAACAAGCAGAGATCACTTACAATTGAGCAGGTGCCT GC
TTTAACATTG CAG ACAG G ATAATCATGTGG G CAACAACTGTTATG GTCTTTACAGCAAGT
AGCTCCTTCCATGGGACAACAACCCCAAGCAAAACAGTAGTTATAATACTTGTAGAC AC
AGCAGCACGTTGTTCCGGCTGGGCATTCGTTATAATCATCACATTGAGTGGGTGGCT TG
ACTGGTGATGGAGGAGATGGAGCTGGTTTTGGGGGGTTTTGGCCTGTCTTTACAGGG T
AAG AAG CAATTGTAG CAATACCACACAAACCTTTG G G GTTG CCAATGTTTCG CTG CATC
CTGAGGTAACCATTTTCTCCCCACGAAGCACCCCATGAGTTCCTCACGATCCAATAA TC
CATGCCATTTTCACTACCATATCCTACTGCAACCACACCATGGTCCACTGCTGCACC AC
ATTTTC C G GTAAAG ATAC C CTG GAG CAT AG G C G G ATC C AAG ATTTAAACTTAATAG GAT
CAACATTTAAATTTTTTAATACTGAACTCATTGTGACTTTGAAAATACAGAAATATT TGTT
GAATCCGTGTAAATACTGGCTAATTCGATCAAAAAGATAACAACTTTATTGACAATA GGC
AAATCTGAAGTAATACCGATTTATAGTGCTGGAAGTCTTTGCCGCCAGCTTCGATAG CA
ACGCTGACGGGTTGACCCGCGACGGCCTTTTTCAGTGCCTTTTCATCATTAGCAGGA A
CATCTTCATACCCGTCGATGGTGACAACCTTGGCATTTTTCTATAATTGTTTCAAAC AAA
ATATTAATAGTCATAACTTGTATTTAGTTCATTTCCAATAATCCCTTTTAAGACAGA GAAC
ACCATACTGACCCTTGCTTGATCGCATTTTCCATCTTTGGCTTTGTAGGGGTAGTCT TCT
TCAGTGTCTATTCCTCCATTTTGAATGACGAATTTAAAGGCATCGTCCATTAGACCC CCT
TGGCAGCCTTGGTTATCGGCAGTATCACAATCTACCAGCTCTTGTTCAGATAACGAG AT
CAGATTACCTGTCATTATCTTGTTTACTGCTTCAATTGAAGCAACTGCTGAGAAAGC CCA
GCAACTCCCTATTTCATTTCCCAATTGTAACACAATTAAGACGAAAATCATTAGTTG CTTT
ACGTAATATAAAAATATCATTATTTCTTCAGCATTTATTTATGCATAAAATGGAAGA TAAT
CCAATCAATCAAG GCTG CATAATTTG CCAAAACAATTG CACAAAAATACATACTGATG AA
CATAGTGTTTTGTTCAAGGGAAAAAACATTACTTATATTGCCATAATTAGATGCGGA AAT
TG GAG CAAAAGTCCTG CTAACCTAATTTATCATACTTAACAACAACAATAACAAATC CAC
TGAAATCCCATCGTGTGGAATCTCCTTACCCCATCTAGATAAAGTATAGAGACACTT TCT
G AC AG AC C C CAAAAG ATAC AAAG GTAAAG AG G AAAAAAG G C C CAC G AAAG GTAAAAAG
CAGGGAATTAAATAACAATAAACAGCAATACCAACAAAGCAATCAGAAAATTTAAAA ATT
AAACATG CTTAACCTTAG CTAAAATTAACAATAATGAAGG GTCAACTG AACTTG CCATTG
ATCTTAGGTTAAAACTTATGACACTGTGATCATTTGAATCCCTTACCCTAGCAAACT ACT
AAATTTTCCGAATTTCCATCAAAGCCATCAGTAAAGAAAGCATTGACAAATGTTAAA CAG AAAATAAGTATTAACAAAAAGGAAAACAAAAACAAGTAAGAGGTGCAGAGATGAAGAAG
GCAGAGTACGAAAGGAAAATACCACATTGTCCTTGATTTTTGACGTCAACAAGAACA CC
TTTCTTCCTCCAGTCAACGGAATCCGGCAAACTATCTCCGACCTTGGGGGCATAACG GT
CACTTTGGGTATACGACAACCTACTACGACCATCGGGCTTAGTACCCAAGTAGATGG AC
TTGTACTC CTCGTTG GTCAAATCTG CAAACTGAGTC AAACC CAG CTTGTAACTTTTTTCA
GGCGCAGAGTTCTGTTCATCGATGTATCTAAGGTTGTCCTTAAAGATCTGGAACCGC TT
GTCCTTTTCTCCTAAGGCGTTATACACTTTTTTATGTTCAACTAGCCAAGATTCATA CAAA
GACACGATTTCATCGTCTGTTCGCCAGACCGTTGACTCGCCGTTCGTGTGATGTTTT TC
GTTGTAGCTTATAATGGACATGTCCTCCGCCGATGATGTTACGGCGGAGAACATGAG C
ATTACGAGTATGGAGATGGAGAGGGTGGAAGTATGAATCGCCAT
SEQ 37
TC AG G AAG G C G AAAC AG C AAG AG G ATTC AAC AAAGTG C C C AC G AAC AAAAC AAC AC C A
GCAGTCTCATCTTTCACAAGGAACATGAAGGGATGATCAGCAACAAAGTCTATCTCC TC
TTCAACCTTCATCATCGAGCACCCGAACATCATTGTAGCAACAGTAACAGCTGGAGC TA
CTGGAGCCTCCTCATTTACTTCGATAAAGGCCTTGTGAAAAGCTTTTGCAGCTGGAG CT
TCTGCACCTTCCTCATTTACTTCAATAAAGGCCTTGTGAAAAACGTTTGCAACTGCC AGA
GGATAATTCTCGCCCACCATCTCAGTGAGACCACCTTTAAAAGGTAATGTGAGCTCG AG
TCCTTTTAGAACTTCTAAAGCTTCAATCCCCAAAGATATTTTGAACTTAGGGATAAG AAA
CTCGTGCACTTTAACTTTTTCATATGGAACATGGCGATCCAAAAATCCAGGTTCCGA ACT
AATTTTCTCCAGTAAAGTTGGTAATCCATCACGGGCATTTGGGAGATACACATACAT GTT
GAGAAATCGCTTGTCCTCGCCCTGTTTATAACGAAGCCTTAACACTTGGAAACCATC AA
AGGCCTTCACGTATTGCCTTTTTTTGCTGGTCATTAAGGGTGCTTGAACAGATCCTC CAT
TAAG G AG ATG G AACTCATG GTCTTTTGTATCTGAAGCATTCAACTTTTCAGTCCATG CTC
CTTTG AAATATAGTG CATTCG CTAAG ATCAG ACTTGTACCG CTATTG ACTG CAACAG GA
GGAAGAATTTGTTTGATAAGACCATTCGTTTTCTCTTCAGCCCACTTATTGACTTCA CCA
GTAACCTCATCACCCTACATAAGAAAAATTAAAACAAACAGAGAACATAGAATCAGC AG
GCTAGTAACAGGATAGAGTTGAAAGTAAAAGAAACAAAAAAGATGTCCTATATGACA TTA
GATTCGTTTGTTTGTAATCTTATTCGAAAAGCATAATAAACGTTTCTAATGTGCTTA CTAA
TTTGAAAATATTCTGTTTACCAACATGCCCATAATATTTTGTTATACATTATAATAC TCCCT
TTGTTTCGTACTTCGAGGGTCAAACTTTTCAATTTTAACCGTGAATTCGAACATGAA ATTT
TAATTTTTGACATAAAAGTCACATATTTAGAGACTGTAAAAGTAATATAAGTCATTT ATAA
GGAATATAAGAAAAATCGCAGTCAAGAAAAACTCGACTCTCGAAATCCAAAAGGTGT CA
CATAAATTGGGATGGAGGGAGTATCATGTAATTTCAAACATCAATTTCTTTTCTAGT GAG
TG AAG G ATTAAG ACTAAC CTTGTTC C G AAAAT CAACAG AAG C C G AAG CAG C CTTATAAA
CATTGTCCATAACCTGTTTGAAAGAATGCTTAAAAGACAAAGATTGGTCAACCCAGG CC
CAATTAGTGACAGACAAACGAGGACCTCCCATGGGGCTGCCATCGGCTAAGACGTCG G
TGATGACCCGAGAATAAACAGAGTTAAGTTCTTCAACAGAGTTGAATTTGAGAAAAG CC
AACAGTTGATCCAATGTGGAACCACTGGAGCCTGCTGCAATAAGAGCAAAAATTATT TG
AATGGAGACCGGGGAAAACACCATGTTTGCGTTGTTGGACTCGTCCTCGTCGGCTTT A
AACTTGCTGAAGAATACATGCTTTGAAAGAATCAATGGAACATCCATCATTTCTGAT GAG
GGAAACTAAAATTGAAACAGAGAACGTAAATCCATAAGATGAGGGAGACTGAAACTG AA
ACGGAGAACAGAAATCCATAATTTCTGATGAGGGGAACTGAAACTGAAACTGAAACG GA
GAACGGAAATCCATAATTTCTGACGAGGGAAACTAAGGTACCTTCACAGTTGAACTT GA
CAGCATGGAGAGAATGCTGATGCATATAGAACTAACAGTCATGGTAGGCGACCACGA G
TCATACAGAATATCTGCAAGACAGAAAATAGTTGTCAACTGCTGCTTTTATTAGATG CAT
TCTAACTTTTTTCATGAGTCCGGTAGAAAATCTTAATTATTAAAAAAAAAAACTTAT TCTG
GAAGATGAGAGGAATAACTGTGAATGAAAGGCGTTTTGCTTTCACTTTATTAGGCAG GT
AACTATAG AAAGTTTAAG AG ATTAATAAG C AAG C CAT AG G C ATAG AG CAG AAAC C AAAA TTATGCAGGAAAAGAAACTAGTATGATCTCTGCAATTCTTAGATGAAATGACAAATTTGA
C GAG AC C C AC AG AC AAAC AAAAAAC AG ATC C AAAGTTTATTCTACTAG G G CTAAG AGTT
CTGAATCAGTAAACAACTGTGTTTTTTAATCATCCGTCAACCATCATTGACTATTAC TCCA
TAATTAGATACGAATCAACAAATATTATCGGGACATGAAAGGGGAAAAGAATCAATA ACA
AAAAC AC C AAG AC C AAGTAAAAAG CTC AC G G ATATAG C G AG AC AATCTAC C ATC ATAC C
CATCATCAACTAAGTTGGCATAGAGCATTTCCAAAACTATAATACCTGTCATCTAAT CTTT
ATTAAGAGTTCATATCACAAATCATCCATAATTGCCAAGGATTTTCGTTAAATGCTT CATC
TAATGCATTCTAAATCAAGCCATTAAGCTCGAAGAGTAACCAAATAAATAAATTTCT GTT
GGTGATAAGAAATAGAGGAGACTTGAGCTGTTGAATAAATACCTAAACAAATGTGCC CG
TCCCTATAGATATCAGGGTGCAAAGGAGCTGGAGGCAAAAATATCACCTGCATCAAT GT
AGTCAGCCCTAATTAGTATGATAGGAATTTCAATCAAGAACTTTAACAAGAAACCAC ATC
TTATATCCACCTGGGGAGCTTTAATAGGATAATGTTCAGGGAATTCGGCTTGAAGCT GA
TAAGTTTCATTAGCATACAGCGTCCCAGGAGCACCATTCACTTCAATTAACCACCTT TTC
AAAAATCAAAAGTTCAGTAATAATAGCATGGGACTTCGAATTCAATATTAGAAATTT ATTG
GAAAACGACAAAAAACTTAACAACAAACAGGAAAAATGAAGACCTTTGAAGATAATC GG
AGGGTTCAAGATTGAAGCCAGACGGGGGATTGACCTGCCAGTTCCCTAGCTCTGTAT G
GAGTCGATTGCATGCCAT
SEQ 38
CTGAAAGTTGGTTCCTTTTTTTCTTCTCTTATTTATTCATGCAATAAAGCATCTCCAAAC T
TCTATTCTTATTCATTCTCTCTGCTTTCTTGCTTCATCGAACTGGTGAGTAGTTGTT TTGT
TTCCTTTTTCTTCTATTTAAGAAAAAATTAACTCTCTTTTTGTCGATATATTTTATC CTTTTT
TTTTCTTTTTTCTTTTTTTGTTTCTGTGGGTATTAGAGGTTTTGTGCTTCATTTCAT ATATT
TGTCTCATGATTTTACTACTTTCAAGGTTGGGCTTTTTCCTTAGCAAGAAAGAACTA TTTC
TGTTTATGTTTCATTTTTCTTTTGGACCTTGGTTTTCTGTTCTCGAGGATTGTATCT GTTA
AAAATTGAAGTACTTTTTTTTCCCTTCATCTTTTTAATTGATGTTCTGTTTAGTGTT ATTTT
CACCTTTTATGGCATTTAGCAATGTTTGTGCTTTGACGGGTTGCTGTTATAAACATA AAT
TTTGGGAAAATAATTACCAGGTAAACTTGTTATTATGCAAGTGCAATTTGTGTGCGT GTG
GTG GTTTTGTTG CTAG G GAG CAAGG CATGTG ATTAGTG ATAAG AG G GTTAAAAGG G GA
GTAG ATAAACAAAG CTCCACTTTTTAGG CTATTGTTTTTACTTG G GTTCTTC CATTTTTTA
TTATAGCTTGATGAAGTAATATGTAGCTTATAAATTTCCCAGAATAAGAATCATCTC TTGC
CTTAGAAAAAATAATTTACCAGTAAGAGCAGAATATATGGTAGGATTCATCCACTCA ACT
CCAATTAGTTTGTGACTGAGGCAAAGTTGATTGAGTGATCGATTGAGTTTAGTCTCA TTA
GATTGTCATTTATCCATTAAAAACATGCAGCAGGCATAACATGAGTGATTTGATCTT CTG
AGCATTTTCTCTTGTTTGTTGAATTTAATATATCTTCACTAATTGCTTGGCCTAAAT TTTAT
TAACTCAAAGTGATGATTTGCCTAGGTCAATATGGGAGCAAAAGCTTTTCTTGTCAC CAT
TTTACTCTCATCGCTGTTATTTCCTTTGGCCTTGTCTACGTCAAATGATGGCTTGGT TAG
AATTGGACTGAAAAAGATAAAATTTGATCAAAACAATCGACTTGCTGCACGCGTCGA GT
CCAAGGAGGGCGAGGCTGTGAGAGCCTCTATTAGGAAGTATAATAACTTCCATGGTA AT
CTTG GG G CCTCTG AG G ATACAGACATTGTAGC ACTG AAG AACTATATG G ATG CTCAGTA
CTTTGGGGAGATTGGTATAGGCTCTCCCCCTCAGAAGTTCACAGTCATCTTTGATAC TG
GTAGCTCTAATTTGTGGGTGCCTTCATCAAAGTGCTACTTCTCAGTAAGTTATTTTT TTC
CTTAAAAGAATGCATAATAGAGAAAGCTAGTATTGGCTACATAATTTGATGATCATC AAT
ATTTATGTTTCTCTATGTTTGTGCAGGTTCCCTGTTTTTTCCATTCCAAGTATAAGT CAAG
CCAATCAAGCACTTATAAGAAAAATGGTCTGTTTCTGACCTTTGTCTATATTTGATA ATTG
CAACACGACACGTGCTTTTCTCTTATACTTGTTATTTATGCTCAATGCTTGCTTGTA AGA
GAAAGCGTTCCATTATTGGCATTATACATGACATGTCTTAGGTTTTGAGATCAAAAC TAT
TAACTCTG CTACCAACTTAG G ATTTTTTTAAAAAG AAAATAAAG G AAACC CTCAC CATTTT
TATTGTTGTCATCCAATTATGTGCCTTGTATCAAAGTTTTTTGTTGAAAAATATAAT TTGG CAAGTTTATGTTGTTGGCTTTCCCTGCCAAAAATGTGCTAATGTTATCTCTCTGATTTTT T
TTACTCATGATTTGCAATAAAAGCTTGTGCCTTTTAAACTGTTTTGTCTATCAAGGA ATCT
GTTATGCTGGAGTTCCTTTATTGAGTTTTGATATCTATCATAATTTACTTTCCTGGA AAAT
TGATGTCTGCTGTGTGTTTGATATGACCTTTGAATATTCTTCTCTGTCGTTGAGTTG GTC
AACGTGTTCAATTGGTTGTTGACCTAAGAACCTGTTCATCCAAACCTTTTTCTGTTT AATA
TGCCATACAGGGAAGTCTGCTGCAATTCGTTATGGTACTGGAGCAATATCTGGATTT TT
CAGTCAAGATAG CGTTAAAGTCG GTGACCTTATTGTG CAAAATC AG GTG AATGTG GCTT
CTCACTTCCTTTTTTTTAATTTTTTTTTATGTTTCTTGAATATATGGTCTCTCATCT GTCGA
GATTGTTAATGACATCAGGAGTTCATTGAGGCAACAAGAGAACCCAGTGTGACTTTT TT
G GTAG CCAAGTTTGATG GTATATTGG GTCTTG GTTTCC AG G AG ATTTCTGTTG G AAATG
CTGTTCCAGTATGGTATGTGGGTTTATTTTGTTTGCGTTCTCTTCTTTCCAAATGTT TCTT
CAATTTCCTATTAACCAAGTGCGTGCCTTGTGAATTTCATTATTATTGAAATGATTT TATC
TTCTGGATTGCAGAATTTCATGAACATTTTCTTCTATATAAAGTTTTAAGTGATACC GGTC
TTG ACG GTTTCTTCTGTGTTTTATAG GTACAACATG GTCAAACAG GGTCTTGTCAAG GA
GCCTGTCTTCTCATTTTGGCTCAACCGAAATACAAAGGAAGACGAAGGGGGCGAAAT T
GTGTTTGGTGGGGTTGATCCTAACCACTATAAGGGAAAGCACACCTATGTCCCAGTC AC
ACGGAAAGGTTATTGGCAGGTAAATATCCCTATATCTTCGGAAGATTGATGTTTTGC TTT
CTGCAACTGTTTTCTTACTCTTCAGAATATAATATGCAGTTTGACATGGGTGATGTT CTG
ATTGATGGTCAAGCTACTGGTATGTTACGTTACTTCCTTTTCTATTTTTTTGTGTGT GGA
GATTTCGAGGATATTGATGAGAGCACTTTCCCATGATTTCCCTGCTTTTTCGTTGTA TTG
ACATACTGAATAATGTAGGTTACTGTGACAATGGATGTTCTGCAATAGCGGATTCTG GG
ACTTCTCTCTTGGCTGGTCCAACGGTATTCTCCAAAGCATATTCCACTTTTTGTCCC TAT
TATTCAGCTATTTTCAATAGTGAACTAGCTCAGAATATTTTTTGTACCTTCTTGTTC ATGT
GTAGCTTCAACAATCTTCGAGCGATGAATAGGTTTAGTTTTTGGTTGGAATATCAGT TAA
ATAATAATCAGCCATTCCTTTGAACTTTTCTCGTTTTTTCCTTTTCCTATTCAAAAA AAGG
ACGACGGGAAGTGCAGTGGAATTGATGTTCATCCCAGTATCAGGACAAACTACCTTG TT
GATTGTCATACCTAAGAAATGTTTTTTTTTAACTTTTGCCTGTTGTTTCTGTCTTAT TAAAT
TAATGCAACTTGAGAACTGCTTCTTTCTTCTCATCTTTAAGGCATGGTTGACAAATA TGA
TACAAGG AAAAAG CTGC AG CTTTATTTGTCTAGACAATTG CAGTAGTGAAATG CTTTACT
ACTACATTTTCTAGTTCTCATCACTGTATCCTTCCTCCTCTATCTTGCAGACTGTTA TCAC
TATGATTAATCATGCCATTGGCGCCTCGGGGGTTGTAAGCCAACAATGCAAAGCTGT TG
TTG AACAGTATG GACAAACAATAATG GATATGCTTTTAG CG GAG GTG AGCAATTAATTAT
TTTAGTTGATAGTTTGTTTTTGTTTTTACCAATAGTTTTCCGTGGTATCTGCAAAGA GGGT
GGTTTCGTGCTACTAGTTGCCTTCCCAATATTCTGATGGATTGGCGTCTTAACAGGC AC
ATCCAAAGAAGATCTGCTCGCAGGTTGGGTTATGCACCTTTGATGGAACTCGTGGCA TT
AGGTTAGGCTAATCATTTCTTTCCTAACCTTGGCCAATCATTTGATATGTTAAATCC TATT
ATAAAATGTGTGCTGAGTGGATTTATGTCCTCCACGTGTAGTATGGGCATTGAGAGT GT
TGTAGATGAGAATGCTGGCAAATCTTCAGGACTGCATGATGCTATGTGCTCCGCTTG TG
AAATGGCGGTTGTCTGGATGCAGAACCAACTTAGACAGAACCAGACCCAAGAACGCA T
CTTGAACTATGTGAATGAGGTAAATAGCATCAGTCACATGCTTTCTCTTCTCATCTT AGG
TTAG ATTACTG AC C ATCTTTAAC AG CTTTG C GAG C G ACTAC C AAG C C C AATG G G AC AAT
CTGCTGTTGATTGTGGAAAACTTTCTGGCATGCCTAGTGTTTCCTTCACAATTGGTG GC
AGAACATTTGACCTCTCTCCTGAGGAGGTATGTCTGATATCAATCTTCTGTAGTATA CAT
GGTGTCTTCTCAACTTGTAAATGGCTTTTGATTCTTCTGAACGACGTGGTTGGTTGT AGA
ATCCTTTTGTCATGTTTCAGTTTGGCAGTTCAATTCTTTTTGGTTTTCACTAGATTA GCTA
GCAAGGTGTTACGCTGCTTTCAAGAGAAGTACACTTGTCTTGTAGAAAATTTCAACC AT
GACAGCTAAGTGTAGTTTGGATAATTAATGATATTGAATGTGTCGAGCTTCAATATC AGT
TTCTTTGCTTGATAAGTTAACTTATGATTGGATAATTAATGTCATTGAAGTGTGTCG AGCT
TTGATATCAGTTTCTTTGCTTGGTAAGTTCATATGATTGTACTAAGCTTGCATGCTT GTCT
TGTCACCAGTACATACTCAAGGTGGGCGAGGGTCCTGCTGCACAATGTATTAGTGGC T
TCATTGCCTTGGATGTTCCTCCACCCCGTGGACCTCTCTGGTATGTTTTCTTTTCGT CTT AACACACGTGCAGATTCTGTTATTCTAGAAAAGTTATACCAGCTCCCTTTTGATAATGCT GTTTGCTTATGGCTTTGGTGGTGCAGGATCTTGGGGGATGTTTTCATGGGTCGATATCA CACCGTCTTTGATTTTGGCAAACTTAGAGTTGGATTTGCAGAAGCAGCT
SEQ 39
TCATATGGCTGCAGGTCTTCCATCTTTCCTAGGATCACTTACCGCCACAAGCATCCCAT
GAAAAACCCCGTTTTTATATTCTTTCCCGCTCCTTCTTCCCAACTTTAAATGAGAAT TTG
G AAG GTTTTG AACAATTAG CTGACAGATGG CTCCTC CGTTATGTGCTTC GAGTTG ATGA
CCCCTCTCTTCCAAGAAATGCTTTTTCTCATCCGATAGCTCAATGTGATCGCCATCG ATG
CATGTCCAGTTCTCATACAGAACTACATTCGGAATTAGCTGCAAAAAGAGGAACACG AA
TCAATACTCGGGGTTTCAGAATTCGTCCGTATCAAGAAATACGTGAATTGCTCAGTA GT
GCTTCGCCACTACGCTAGCTGGCGGAAATAGCTTAATGGTAGAGCATAGCCTTTCCA A
GGCTGAGGTTGAGGGTTCAAGTCCCTCCGCTCCTGGCTTCGTCGTTTAGTGGTAACA A
GTTCCGTGCATAAGCCACTTTAGAGATAGGTGATCCTTAAAAATACTCCCTCTGTTC CAC
TTTATGTGAGCCTTTTCGGAGCACGAGGTTCAAATTGACCAATTTTCCTTGTGGATT GAG
ACATAGAATTTTCAAAAATTACTACATAAAAAGTACTATAAGTCATACTAATAGTTA ACAA
TTCAAAATAAGAAAACTTTGTCTGACTCCCTAAATAGTAATAGATTCACATAAAGTG GAA
CAGAAGGAGTAATATACATTGCTGATCAGGCAAAGGGACCGACCTCGTGGTAGACTC T
TGGACTCTGCACTGCTGCTAAAGGATCCATTCCCAAGATGAAATGGTTGATGAAAAC CT
G G AC C AC C G C G G G G ATTATTTTC ATG C C AC C ACTAC C AC C AATTAC AC C AG C C AACTG A
TTATCCTGTTAACAAGAAGAAGAAAGAATCAATAAAAGATATACTAAACAACAACTA CTG
AGGAAGTTCGGATTGTCCATATACTAAGCAGTTAAGAAAGATGACGCATTGCTCAAA GG
ATCGTTTCCTGGATTGGATACCTTGAGAACAATGATTGGAGCCATGGACGACAACGG TC
TCTTTTTTGGTTGAATAAAATTAGCCGGGGCAGGAGGGAGTTCATCAGGGGATATCT CA
CTAGGTGTTGAGAAATCTCCCATTTCGTCGTTGAGTACAATACCAGTTGATGGAGAG AG
CACACCGGCTCCAAATGGATAGTTTACTGTGGTAGTTACTGATACAGCATTTCGATC AG
AATCTACAATACAAAAGTGACTTGTTCCGTGATCTCTTAGCTGACTCCACCTACAAA TCA
AGGACCAAAACCGCTTTAGAGCCATTGGTGTGCAATCAATAGGCTTGTTAGTTCTGG GC
TCAGTCTGAAATACTCCTTCTGTCCCAATTTACGTGGCGGTGTTGGATTTCGAGAAT CA
AAAAAGTTTTTCTTTGACTGCGATTTTTTCATAAGCCTTTTAAATATTTTGATTTAA TTATT
ATTGTGACTTATAGTACTTTTTGCGTAGTTTCCAAAGATTTAAATTTTATTTCAAGA CTAA
AAGATTCTATGTCCAAATTCATGGTCAAAGTTAACTTATTTGACTCTCGAAATTCAC AAAC
CGCCACATAATTGGGACGGAGGAAGTAGAATTTATCTAACCTGGGCATATAGTATTC AG
GAGGAAAGGTGGTATTGTCGAAAATCTTCTGTCGAATTGCTTTGGCAAAAGATGGGG AA
AGCATGTCTGATACAGTTTTGCTGATATTTACAAAGTCGGGATCACCGAGGTCCATC CG
AAATGCAAACATGTGTTTCATCGCCTCAATTAGTCGATGCAGACCTAAAGAACCTTC TG
CAGCATTATAGCTTTCAAGGATTTTAAGAATCTGCATTTCCCCAATCGCAATGGTTT AGT
CAG ACAATTG CGTTGTATTAAGTTTAG ACATAATTG AG G AAGATTG GAACCAATAC GTCT
ACGTCTGTGTTATTAGAAGGAAAATAGCAGTACAAGCTAAGTTGGTTTATATACATA CCA
GAGAAATCCCCAGTGTTCCACTGGACGGAGGTGGCATTCCAACGATGGTGTAGCCCA T
AGCATTAACGGTAACTGCTTCTGGAGTTTCCACTTTGTAATTCCTCAAATCGTCCAT TGT
CAAAATTCCACCCGCTTTTTTCACATCTTCGACAAGCTTTTCACCAACCTCTCCATT ATA
GAATGCTTCAGGCCCTTGTTCAGCAATAAGCTCTAAGCTGTGGCTAAGTTTTACATT ATG
GCAAATATCACCTGCCCGTAACAATTTCCCCTCTGGTGCAATTACTTGTCGTAAACC AG
G ATCTTTAAGTATCAACTTTG CTTTTG AC GCAATATG ATGTG CAAG ATATG GAG CAACCA
CGAATCCATCTCTAGCAAGTTTAATCGCTGGTTGAAATAGGGTCTTCCACGGCAACC TG
CCATGTTTTGACCAAGCGGCGTGAAGACCAGCTAACTCACCGGGAACTCCCATGGAC A
ATGCTCCCTCTAACTTGGATTTTCCATTATTATCATACATGTTCTGCTTGAGGACAA ACA
AAACAAATTCAGAACCTGCAACTTCGTGTTGTCTATGTTTATATACTAAATACTAAT TCCA ATTCCCAATAAGAGCAGAACTGCAGTTTTCTCATGTAGACAGCTACAACAGCTATTACTA
CTATGCCTCAATTCCAAGCAAGTTGGATCAGCTACATGAATCCTCACTGTCCATTTC GCT
TCATTAAGCCACAGTTTACTTATGCCGATACAAATTAAAGAATTTAACTTCTATACA CTAA
TAACCTAATTGTATTTTACAATATCAGTACTTCAATCCAAAGCAAGTTGGGATCGAT TATA
CGAATCCTTAGTGTTCATGTCTCTCCATTTGAGCCAAAGTTTACTTATGTCGATACA AAT
TAAAGAATATAAACGTAATACACTAACAACCTAATTGTCTTTTTACAATATCAGTAC TTCA
ATCCAGAGCAAGTTGAGATCCGCAATATGAATCCTCAATGTTCATGTCGCTCCATTT AA
GCCACAATTTACTCATGTAGGTACAAGTTAAAGAATTTTAACTTATATACACTGATA ATAT
AATTCTTTTATACAATATCGGTGTATTTAACTTGGTGTAATAGGCAGCGTGTCTTAT TTTC
CACATTAGTATTCCCACTATTTATGGACGATTGCATGTAATTTTCGGTTAGGTGACC TGA
TACTGTAAAAATTCATTTCTCCAAATTAAATAGTTATTTGATGATGTTTGGATGAGT TGAA
ACTTGAATGAGAAAAAG CAATG CTAAACG AGTGAATTACG AACCTGTG AAG CAG CTAAA
GGAGCAGTTTCCCTCATATCAATAGCTTGAACTTCTGATGTTGATGAAGATCTAACA ACC
ATAAAACCTCCACCGCCAAGTCCGCTGGCCATTGGATTGACAACTCCAAGGCAAAGT G
CTGTGGCAACTGCAGCATCAACAGCATGTCCACCAATTTTAAGCATGGATATACCAA TT
TCCGAGCATCGACCATCATCAGCAGCAACAACTGCTTGCTCCGATTCAACAACGTCA GC
ATTTTGCTGCAGTTTTCCATTATATCTCTCAACATCTCCGATTAGCCAAATACCAAC GTG
TCCATGGTGTCTAAGGCCTATAACTAAATTAAGATGGAGGAAAAAAAGGAACTTAAA AAT
CAAATG ACAAAG AAGTGCAAATTG CAGGTG CTAAAATAATTGTGTAAG CTCG AG CAACT
TCTTGATTCTTATTTCAGAAGATAATTAGACAATCGATGTTAAAGGTAGGAATATAC ATG
CCAATTCTTTATATTTTTTATAATGGAAATATGCCCGAGAGCTAATGGCGCATTGTT CGA
AACTCAATGGATAGTGGGCCCGCTCCTCTAATTCTCACTTAAAGTAGGATTTTTGTC TAT
GACAATGTTCCACTGCTAATTTTGGTTTAAAAAAGTAGGAATGTACATGCCAAATTT TATT
TTTTGATAACCGACAAGCGATTATCAGAAAAGTGCAACCCGGTGCACTAAGCTTCCG CT
ATGCGCGGGGTCCGCAAAAAGGCCCTTGTGGTCTGGCCCTTTCCTGGACCCGTTGCA T
AGCGGGAGCTTAGTGCACTGAGTTGCCTTTTTTGGTAACCGACAAATCCCAGGGTCA TT
AGCGTATTGTTCGAAACTCAACGGATAATGGGTCCGCTCCTCTAAATTCTCACTTAA ATA
CTAGGATTTTTGTGTATGACAAGGTTTGAACCTGTGAAATGCGCACTCACACATCAC AA
GTTGTGCTTTTACCACTAGACCAAAACCCCACAAGCTTGTACATGCCAATTTGCCAG GT
CAAATATTATCCAATTCAAGAAAGCCTGTAAATTTGACAATTTTTAGGCAAAAACCC AGA
ATTTGATTTTTTTTTAAAATATGTCCTTACGATAAAGGGTCAGCAAAATTTTACGCT CTAC
GGTAATTACTCAACAAATTCTTTCACAATTTCAAGATTTATAACATAACACTTCACT TCTC
TTAAGTAAAGCTAAAGAGAGAAATTCCTGTTAGGATCAAAAAGTCACGTGTCATGCG GA
AGCTAGTAACACAAATCTTGAACGACGATAAATCAAACAACAAAAGAGAAATATACC AAA
AGAGACACAAACATTTAACGTGGTTCGGTCAACTGACATACGTCCACGGCGGAGATG A
GCAATCCACTATATATAAAAGAGAGTTCAAAATATCGAGATAACAACCTCACGAAGA GG
CAAACACAAGTGATACACTAACATTTGTCCCGTAAAATTCTCCCCCTAAACACGACT CTC
AAACCTCATATGGCTACATCGTGGATGTTAGAGATAAAGTTCAATCTCTATAAGTTA GGA
TAGAAATCTCTATTAGTTAGGATAGCTATGTTCTGTTAGCTATATTTTAGGATATAT GATT
GTTCTATTAGTTACCTTATCTCCCTAGTCTTCTATAGTGTGTTGTAGACTGTTGTAT ATAT
ATTCAACTATGTACTCAATAGAAAATCATCGAATTCTCTCAACATCATCTCTCATAA TGCT
ACTGAATGGGAAAGAAAGATCTCAATTTATAGAAGTTCAAACATTTTTCTACCAGAA AAG
G GACTAGCC AACTATG G AAG CATTATATTTTC CTTCTAG G AAAAG AAAAACTGAATTATG
GTAAATATGTTGTTCTTTCCTCCGTGAAATAGGAAAATCAATTATAGTAAAAAAATC TAGA
CAAACACGTAACAATTCCATAATCATGGTGTTAATTAACTTCATTTTTCATAGCTTT TTAA
AGCCCAATTAACGAAATTCCTACAGAATTCAACTGAATATTCTGTTAACAGAATTGC AAA
TACTAAGAAAACAAAGAAGAAGACAAAAAGTCAAAGGTGAAAAACTCACATGAAATG GC
AGTGAGTGCAAATAAAAAGCAAAGAGCAAAACTCCATTTCTTCCTTCTATTGAAAGT AGC
AGG AGAAG G GTCCAACAATG G AG CTTCTAAATTCTGTTTACTC AT SEQ 40
TCAGTTGTGTCCTGTCAAAG G ATCTACTTTTATGCTTGTG G CAACAATTG GACTTCTAAC
CACATATTTACCTTCAGTTTCTACCCAGCTCAAAGAACCATAAACCACAATATCATC CAT
AACTAATG G AC CTTCTATTCTTAG CTTGTAG CTCAACTTTTC ATACTTCTCTTTG AAAACC
AACTTTTCAGGTACAAGATTAACTTTAAATTTACCCATTGTGGTCAATTTTGCTGTG TATA
CTGACATACCATCTCCAATATTAGTCACGGTCCTCTGGAATTCTTGTATCCTTCTAG GAT
CCGACTCGCTGCTGTTCCCATTGAAAAATCCAATGAAAGATGGATAGTTTAAGTCCA AT
GATGGGTTGGAGCAAGTATAAGATGAGGATCTTGTGATGGTTTTTATTTGTTTGGAT GT
GAAGTTCAGAGCACAGAGAAGATTGACATAATCTTGTGGTGTCGCATCATAGATAAG TC
CAGGATCTAGTGCCTTGTTTGGATCGATATGGCCAGCTCCCATGGCTAGAGGAGTAG C
AGCAGCATTCTTACTACCTGTTGAGATATAATATAATTAAATGATTAAATATATGCT CTCT
CTAACATATAAGCTTACTTGATTCAACATAGTATCAGAGCATGCAAGAGGTCCTAGG TTC
AAATCTCACCGCCACCAAAAAAGTCATAAAATAATTCCAAGTGTTTGGTCCATGAAA AAA
AATCAAACTTTTAGATGAGATGGTCACACAATTCAATATTACCTATGTCTCGGATGG GAC
TTTGTGTGTTGTCCATCGCATTGGAAGTGGTCATCATGGCAGATCGGATGGCTGCAG G
GCTCCATTCAGGGTGTGCGGCTTTTAGAAGTGCTGCTACACCAGAAGCATGTGGACA T
GACATTGATGTACCAGATATAATATTGAAGTTACTAAAAAGTTTTCCTGAGGTAACA TCA
GTCACTGGTGATTGTTGTGGCCATGAAGCTAGTATTAAGGCACCAGGAGCCATGAGA T
CAGGCTTGAGGATACTTGGACAGCTCGGTGACGGTCCTCTTGAGCTATAGGTAGCAA C
TTTTG GTG CTG GTTTAG CACCAATATGTGTCACTCG GAATTC AAGTTTTC CTTTAG GTGC
AGAGTTGCTCTTAATGTACTCTAGAACTTTATCACCCTCTTGTAAGTTCAAGAACAC AGC
CGGGAATTCGCTTTGGAGGTAGAATTCCAAATCAGTTATATTAGTTATGAAGACAGC CC
CAGCAACTTTTGAATTTCTCACATTGTACACATGCTCACTGACCGAATCATTCTTGT CAA
GGCAGACAACAATATTGTGTGCACTTTTTTGCAGTTCCTTGTCATCTTGGCATTCAA CAT
AGACAATG G AG CTTTCACTTG AACTAGAATTC CCAG G GTAG AG CG ATAAGCC AGTG ACT
GAAACTCCATTTCCAAGAGTTAATGCGCCAATAAATTCGCGGTCAACTGTGCCAGCT GC
AACAGTTAGCACCCAAGGTGTTCCATTGTGCAAAGTCTCATAATAAGGCCCTTCATT TC
CTGCAGAGGTGGAAACAAATATACCTTTCTCCAATGCAGCAAATGCGGCAATTGCCA CA
GGATCTTCGTGTAGTGGAATCGCGTCTATGCCTAATGACAAGGATAAAACATCTACA CC
ATCTGTAATTGCTTGATCAATTGCAGCAAGAACATCAGACAAGTATACACCTTCTTC CCA
TAGAGCCTTGTACATAGCCACATGAGCCTTTGGTGCTATGCCAATAGCAGTGCCGGT G
GCATAGCCAAAATAAGATGCACCCTCGACATAACTTCCCGCAGCTGTGGAAGAAGTG T
GAGTTCCATGTCCATCTGTATCTCTAGCAGAATTCATTGAAATGTTAAGATTTGGAT TGT
TGGCAAGTAGGCCTTTATTGAAGTAACGAGCGCCAATGATTTTCTTGTTACACAAAG AG
GAATTGAACTCAATGCCACTTTCACATTCTCCTTTCCATCTTGATGGTACTTCACTA ATC
CCATAATCACTATAGCTTTTACTCTCTGGCCATATTCCAGTATCAACTAAGCCAATT ATG
ATATCTTTACCATAGTCGGACGTTGGCCATACACCAGACTCAGAGTTTAGGCCAAGG AA
TTGGGATGTGTGAGTTGTGTCAATTTTAACTGACATATCCTTAATTGAAGAAACATA ACC
TG GAG AATTTTTTATG G CTTCAAATTCAG AAG GAG AAAG ACTTGCACTAAAACC ATTG AT
GGCATTAGTATAAGCATAGACTAGTTTTGAGGACAAGAATTCTTTGTGATTTGTACT ACT
GTCTGATAAAGAAGCAAGTGTTGTCAAGTACCAATTATGATGGCTAGCAAAAGCTTT TG
GCATGGCTGACAAATCCATATGAATGATATATGTTTCTGGCTTTGCTAGTGAAATTA TAG
AAATAAAGAAGAAAAGCAACCAAATACACAAGGTAATATGACTTGCCATGTTGAGTA ATA
TATTGAAGGAGGATATTTTTTTTAACAT
SEQ 41
ATGGAATTTTACCAAAAACTGGCAACATGTTCTCATTTGTCGCTTTTGTGCTTCATCCTC TTACATTCCATTCAAGTTCAAGGTAGCTACTTTGATCAAGAATATGGTAAGCAGGTACTG AGCTCAGCAATACAAGATAAAGATTGGTTAGTATCCATAAGAAGGATAATTCATGAATAC
CCAGAACTCAGATTCCAAGAATATAACACCAGTGCTCTCATTCGTACTGAACTTGAT AAA
CTTGGCATTTATTATGAATACCCTTTTGCCAAAACTGGTCTTGTTGCTCTAATTGGC AGC
AGTTCTCCTCCTGTTGTTGCTTTACGAGCTGATATGGATGCCCTTCCTCTCCAGGTT CAT
ACACAATTTTTTTACTATCAATCAATTATACCTCAATCGTCAATTAGTTGGGCAGTT ATAT
G CAGTTCG GAG CTAG GTTGTTCC CTAAGG G GAATCAACATATAAAGAAGTAAAG ACG AA
AAAGCCACGGAGATTCAATATATAGTGTATATACAAAAAAAAAATAAAAAAATTGAC CTA
TTTACCCTGTGTAATTTTCGACCCAAAGGGTATCAGTTAACTCCCCTTGGATAAGGT TGC
TCTGCCCCTAGTTATATGAATCTTCTTGTATCTAATTGAGAGGATTCAATATAGTTA AATT
ATTTATGCACCGGTCGTCAACCTAGCACAATCCTCCAACTTTATTTGAATCTGCAAC TGG
CTATGCTTTGTGAAGCTTAAATAGGTGTAGTTAGAAAGAAATATTCTTAATAGTGTG CAT
ATTTAGTTATGGAATGTCTCTAACATTATTCTCGAGTGAATATAACCATAGGAGCTT GTT
GAATGGGAGCATAAGAGCAAAGTTACTGGCAAAATGCATGGATGTGGACATGATGCC C
ACACGGCGATGCTTCTTGGCGCTGCTAAGCTGCTGAATGAGCGAAAGGACAAACTTA A
TGTAAGTTTGTTAACCTTACCCACTTCACTAATGCTGATTCATTTGGAATGTAATTT GTGC
TTGTGTGATTCTTTAACAAAAGATTTTTTGCACAATGTTGACCAATGACCAGATTGT CTT
GTTCTCAGAAGTAATAATATTAGGTTTGCGCTATAGTGATTATGCTGATCATTTTAT CCG
TTGTGCTTTGACTTCTTATCTAGGTTTGCATGTACACTAGGCCTTTGGAGCTTATTC TAA
AAGGGGGTATTTCTTAAACATAGAGGACTGTAAGAAGATAGATGAAAACATTCTTTA ATA
GAGGGGGGTATTAAGTGTACTTTGTCGAGATAATGAAAGAACAGACTCAAAAGGAAT AG
ACCAAAAAGGATATCTTTTTGCTTTGTTATCAGATTTAGTTCACTTATTCACATGTC TCCC
TCGGAACAGTCCAAATTTCATAGCAGTGTCGCAAAAAAGGAATAGTTGTGCTGTTTG TT
ATCGATGATGCTTCTTAACTTGGATATGACCATGTTATTCTTTGATTCTTTAAATCT GAAA
CTTGGATCGTCCTTCTGTGGGTGACTAGCAGTGCCTGTGGGTAATCATTTTTGCCTT TT
CCTTAGATGAACATATAAAGTGATTTTGCCCATTGAACATAGTTGTGACCATTCATG ATT
CATCAATTGTCTCGATGTGGAGAACCTAGCCCTCTGATCCTCCATGGCTTGCGAGTT CA
CATCCAGATGAAGCAACCAGAGAAACTAATTCAGGCATGACGAGAAATTTTCCGGTC AA
GAGAGAGGATCGATCAGAACCTGTTGAAGGAAATGGTAGATGACGGAGCATTGGCCC A
AATCAATTTCTCTCTGGAACCACGAAAAAGAAGCTGAGAAGACCGATAACTTCTATC TAC
ATTACAATAACAATACATGGCTGCATGTATAGGAAACGAGGAAACCATGAATGTTTT TTT
GAATTCTTTTTTGCTTGACCAATAAAAAGGAATTCAAGACTGAACCACACTTTCTAA TTAC
TTGTTAGTCTGTAATTGTCTGACTGATACTATTAGATATTTCTTTTCAACTTTATAA GAATA
CATTTGTCACATGACACTCGTAAAGCACTGTTCGAATTGACTTAATCTGTTTTTGCC CTT
TGTGTGGCATCATTCATTATCTATCCATTCTTGGGGTAGTCTACAATAGAAAGTTGA TTT
GTTGCTTGTCTCTATTTTTATTTTTTGAACCCGAAAAGGGAACGGTAAGACTTGTTT TCC
AACCTGCGGAGGAGGGAGGAGCTGGTGCATATCATATGATCAACGAAGGGGCTCTAG
GTGATG CAG AAG CTATATTTG GAATGCATGTTGATTTTAAAAG AC CTACAG GG AG CATC
GGTACTAGTCCTGGGCCGATTTTAGCTGCTGTTTCCTTCTTTGAGGCAAAAATAGAA GG
AAAAGGTGGGCATGCTGCAGAACCCCATGCTACTGTGGATCCAATACTTGCTGCATC AT
TTG CAGTTGTGG CATTGCAG CAG CTCATCTCAAG AG AAGTAG ATCC CCTTCATAGTCAA
GTATGTAGCCTAATCTCAATTAGAAGTATAAATCTTTGGTTTACACACACACAGAGA CAC
ACAGACACATAATTATGTAGGTACATATATTCCCTTCAGGAACATTTCTTGTTTTAG AAA
G CAG TAT AG CATTTG AGACCTG AAG CCTCATTGACAGTTAAG CTG ACTG AGATTG AAAT
TCTCATTTCTG CCTG AAGGTTCTTTCTGTTACTTATGTCAGAG GTG G ATC AG CATC AAAC
GTAATTCCGCCTTATGTTGAATTTGGGGGAACTCTGAGGAGTCTTACAACTGAAGGC TT
GCTTCAACTTCAAAAGAGGGTGAAAGAGGTAGGTTGCTTACATGAACCTTTGACTGT TG
TTGACTATCAACATCTGCACACTAGATTGTCTGCCAGATGTCTTCAACATGTAGTTT TCT
GTTAAAAAATTTAGTGATTTTTTTGAGTGATGTTTAATAGCCTTAAACTGAGCCTTC TTAG
GTACTGAGAGCTACGTAATCAAATTAATAAGATTAAGGGTGAATAATTCTCGAACAC GTG
TTCACATGAATATAGAAGTCTCAGCTGAATGAATGATATAACTTGTGGTCTGCTTGC AAT
TTTCCCATGAAAATGCCATGTAACTCTAGCATTCATAACTGATCATCTTTCCCTGCT TTG CTTCTCTTTCTTTGTCAAAATCAATTTTATGCCTGTCCTCAACATAGAAGCTTATCATTT T
TATTATTGAATCCTCTATTTCTATTTCGCATTGTTGAATTAGATGCTAATCGTCTTC AATG
TCAAGTATTGCGGCAAGATCTTACTAATTAATGTGAACAGAACCTAGATTTCTTGTG GCA
ATTTTGTGCATTTGTAACAC ATATTTACATG GAGC CTGCAG GTAATTG AAG G ACAGG CT
GCTGTGCATAGGTGTAAGGCGTACATTGACATGAAAGAGGAGGATTTCCCAGCATAT C
CAGCTTGCATAAATGATGAGCGCTTACATCAACATGTAGGGAGGGTTGGCAAACTCC TG
CTTG GTTC CGAG AACATC AAGG AAACTG AAAAG GTTATGG CAG GTG AG G ACTTTG CCTT
CTATCAAGAATTGATCCCTGGAGTTATGTTTCAAATTGGAATCAGAAATGAAAAACT GGG
CTCTACCCACGCTCCACACTCCCCTCACTTCTTTCTCGATGAGGATGTCCTGCCAAT TG
GAGCAGCGTTGCACACAGCCATAGCAGAGATGTATCTGAATGATTACCAACATCCCA TT
GCGGTT
SEQ 42
TTAGATTTCCTCAACTCGTCTATAAAATAGGACATAGGCGGCCGAGGTTTTGAGCTTGT
CCTGGCTGATGGGATACACATGGCTGTCATCGAAGTCATACCACCGATCAGCACCTT G
CTAGATTATTAGAAGAAAAAAACACAAAGTTAGAATATCTGGATTAAACTGGGAAGA CTG
TAAAAGTCTGAATATTTGACCTTCTTGCATATTCTTCATGATGAAGAAATAAAAACA ATGA
AGATGCATGACCAAAGTTAAATATATAATAGATGCACATATGTGCAATATGTGATTA ATTT
AGATG CAG ATGATG CATTG G AACTAAAAAATACATCAAAG GAACACTCACTACG AAATA
GGAAGTTTCTATATTTGCCCTTGGAGGTAGGTTTCAGTGACCGCCCACCACCCACCC TT
CTCCCCAAGTACTGATCTTTAGATGCCAATTTTATCAGGTATAAAAGAATCCTATTT ACC
ATAG AATAG AAAC AAAAC C AAG AAAAAG AAG C AAG GTAATC C AAGTC G CAG C AC C ACTT
ACATGAACAAACGCAGTGTAGTGACCCCCTCCCATGCTTCCATAATGGTTGCTAATT GC
ATAAAGCATATACCGGTAGGAAGATTTGCCATCTTTGTAGGCCAAATATGAGGATAA AT
CAAGATCATGAGTTGGGAAGTCAACATACGTCTCCAACTTGTTCTTCAGAAACCGGT TG
TACGAGAACCTCTTCAGGTGGATGACCAGAATCTCCGGCAGTCTCCAAAGATCCAAC TT
TTTAGTAGCTTGGCGATGCTGCTTGCATGCAGGGCAGTACCTAATGTTGGATATAAG AC
AAAGAAAGTTAGGCAGATCATATTCATACTTCCCAAGCGAATGACACAAAATTAGAT GAT
AAGATAGAATACTAGAAAGATTTCATGAAAGAAGTCTTTTCCCCCATGAGTGGCCCC AG
GTAAATAGCTGAATTTTATTATCTCAATTACTGTCTGTAGTTGCTATGAATATACAA GAAA
CAAAGAGCAACACAAAAACTATTTCAGAAGCACAATGTGCAGAAAATCAATAGGTGT TA
CATAAGATCATCAGATGCTGTCATTGTTTCTTTAACATAACTGAACTAGAAGTTGGA AGG
GTACTACTCATGCCATGTTGCAAAGTCAAGGTCCAATTTGGTTGAGGGGAGGCTAAT AA
GAATGGTTTATCCACAATAAGCAATCACGACGTGGTGGAAAATATTGACAGAAATGA AA
TAAAATG GTATAATTG G AAAAATACAATTATGTAACTACAAAAGTTGAG GGTTTATAATAT
TAGCATAAACCAAAGAAGAAGCAAAAAGGGAAGCATTACGAGAAATCTCTGAACATT CA
TATGTTATGAATAGTGAATACATAGCTTGCATATGAATGGTGTACAATCACAGAAGT GAG
GGAGTTGCATGCTTACCACATATCTTCTGGCCCTAGAGGCTCTTCCTTCAGAAATGC CT
CAAGACATTTATACAGAGAGACAGATTCTTGTGGTCTTTTGGCAAAAAACCCAGATT TAA
AAACTTCTGGCAGTGAGCTGAAAAGGCCTGTATTGTACTGTTCAAGCATTTTAGGTG AC
CAACTTACAAGTACATTTAACCGTCCAGAGATATCTGTGGACTGTAATGGCTCATTC ATT
ACAATCTCGGAGCCTTTAAAGGTTGCCTTATCATCTGATAGGTAAAATTCAAAGTCC ATG
TCTAAAGGTTCGGCAGTATCTTCTTCAGCAATGCTTTCTGGAACCCCGTTAACTATT GAG
TTGCCAGGTTCCATGTCTGTGCTGACTTCTGAATCTGTACATACTTCAGTAGCACTT CTA
TCACAGTTAAGAGTAGCACTTCTATCACAGTTAAGATTATCTGCTTGGGCTGTAGTG TG
GACTAAGAATGGTGTAAGTATCTGTAGATAAAGACTACGGATATAAGATCCTGTAAG AA
CTCTACTATGCGCGGCAAGCGGAATTCCAAATGTCTTCATATTTGAGGTCAGCTTTC CG
TATATGTAATGCCT SEQ 43
CTAGCTAACCTGGTTTTCACTAGCCTTGTAATACAGCATTTTCTGCACATAAAACATCAT
GTATCCTTGCGCAGCTCTCACAATACTCTCGCTCACCTGAGTTATCCAGGCATCGTC AC
ATTTGTACCATTGATTGCTTAACCTCAGATATGTTACGTAATGACCAGCATCAAGTT TAC
CAGTATGGGTGATGACAGCAAACAACTCAAATTCCGAGGACGATTCACAGGACGCAT C
TTGCTCGTCCCCGTCAAAGGAGAAGATTCTATTTCCAAATCGACTCCTCAAGATAGA TG
AAG AG AG ATAAG G CG ACATGTCCAAG GAAAAAG G AAACTGTAG GTAGTG ATCAACCTT
CCTTGACATTTTCTTAATCACAGAATGCTCAAACCTTTTGATATGGAAGCAAGAAAC TAA
AGGCAGTTTTCTTATGGACATCTGTTTAAGAGATTCCTGTCTCACTTGACAATGTTG GCA
GAAGAACTTCTGATCAGAACCCAATTTCTCAGGTCTTGTGAAATGATCTAAGCATCC CAT
CAACGTAGAAATTCGACCGTTTTGGCTAAACTTTCCAGATTCTGCCTCCTTTTTGTG AGT
ATTATGAGACTTCTTTGATGTCATCTTGGCGGAACTCCCCTGGCTCAGTTCCAAGTC CA
AGGAGATGTCTATACATGGATCATATGTAGTAGATGTGAAGCCACAAGCTGTACACA TG
ACATCAGACCGCAAGATCCCAGAAAATACTCTATGAGCAATGCAACAGTCTCCGCTG CC
TGCACAAAAGAAAAAAATTAGTCAAGATTAGATTACAGATGACAAAGTGCAATGTGT ACT
GCTCCTAATAAGTTACATACAGTAACATAGTATGCACAGCTTAAAGTGCTCCTCAAC ACA
TTTCATAAAAAGCAAAAGTCCAGTTATGCCATAAAGACAAAACAACAATGGTCCTAT GAG
GAATGGCAACCAACAGAATGAGTAGCTACAAGCAATGGAGTACAATTGACTTTTGTA AA
AAAAACCTCAGTAAGAAAAGTATCCGCAAGACACTTTATTGCCTCAAAAGGTCTATT ATT
CCATTTACACAAAACCATTCAAAATCATAGATCCTTGTATTCATTATTTAACTGCAA CAAC
CAGGTGTTAACTAAATTTTGCAAACTCAGGGTAGCCCTGTACTAATCCTCTAATTAA GAG
GAATAAGAATAGGTGTTAACTCTCTCAATTAAAGTACGTGTGACTTAAGCGGGAATC AAA
TCAGCATATTTACAGTGGCAGAAATGAATTGTTTGCTAAGGAGTTACCCTACTAATG ATT
GCTCTGCTAAAGAAATTTGAGGTACCGAGGAGGTAGACTCACAAGAATAATAAGACA CA
CAGGGGCTTATAGAAAAAAGTAAGCAAAAAGTTGTCAACCTAGTAAAGACTTCAAGC TT
TCTTGAAGGTTGCGATCAGCTCACTCGAGAGTAATACCCTAAAACATGGTAAAAGTG CG
AGTTAATATGAGAACTATTGTACCAGAAGAAAACTCCTATGCTGAAATCAATAAGAC TAA
CCAAACTG AG ACTTACTG G G AAGAGTAG GAG G AAATCTTTTTC CTTTTGAG AAAC ATCT
CTAAGCCTGACAGCGAATATCAACTGGTAAATCGCCTATGGCAAAGAAAAGACCTAA AC
CATAACCTGCATTCAAATATTTCTATCTTTTCTCAGTGACAACGGAAGTTGGGATTG CCA
TG AG AGATG AG ATG CTAGAACAACAAAGATAAG CCTATCAAGTAG G CCCTG AGTG CATT
TTAGTTAGACGTTACTTGACATCACAAAGATGCTTGAACACACTATTCTACTTCCTA CAG
AAATTACTTTTTCCACCCCCTCTCCACCAACAAAAAAGTTCAAAAAATTCACCTACT GAA
GTACTTGACCTTTGCAAGTAACTATACAAATTTCAGTAATCCACTTATTTGAGGTTT TTAC
TGCACTAGCCTCCCTTGACTATGTTAGATTTATGTTGCTTTATTAAAATATGCATAA ACAA
TGTTGCCAATAATTTTCACAGCACAATAAAAAATATTTAATCTCAATGATTGGTCTA ATTC
GGAAGAAAAGGAAAAAAGAAAGAACAAGAAACTAACTATGCAAGGTGATGGGGGGAG A
AAAGATGGGTAACTAATAGATACTCTAACGCAACAAACAAATATACCTGGACTCAAC GC
CTTTCCTTTATCGTTCTGCATCCTTTCATGAATCCCATCAAGCACGGAAATGAAAAA CTC
ATGAGCATCCTGCTGTTCATAACTTGCAAGATTTGATGCATGCTTCCACCAACTGTT CCA
AAGAAAGGTTTACATCCATCAGAATCAGATACAGAACAGACTGCAATTAGATACTCA AAA
AAAAAAAAAAAAAAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNN CAATATAACAAATTACAAATGTATTATATCACAACCAAGTTACACACC
ACTAGCTAATTTTACTTCAATGAAGGGATAAGGGTGGTACATATAACCCCAACAATA AAA
AATTAATCTTGTCCCTACATCATTTGCAAATAAGGGCACAACCAAGCTAAATAAGGA ATC
TTTTACCAATATATCATTATCCTTAAAAAAGAATTATAAGTATACAACAACTATGCC TCAA
TTCCAAGCAAATCTGGATCACCTATACGAACTTCTCACCCAAAAAAAAAAAAGAACG AGT
GCACATTATAAGAAAGCCAAACATGATCTCAACAATCCAAGAAAACATGATCAAATG CA GACCTGTAGAGGAACTTTGCAGGACTAATAGGGGTCCAATCGCCGGAGAAAACAGCAG
AAAACATTGCATCCAAATCACAAGCTAAACACAGCATTGTTGAGTTCTTATTCCCAT TAT
CACTACTACTCCTTGTTATAACACTGTTATTCTTTCGCTGGCAAAAATATCTGTTAT GCTT
GTCACTCAGAAAGTAATTCCTCAATGGTGGTGTATGAAGCAATGCTTGAAGCACTGA AT
TCATAAAACACGTGTTTCCAAGATTGTTAAGGCCCCTCAAACCCCATTGTACTTCTG GG
GTTGTTGAGTCATTGCCGAGCTGACTCGGCAACGGACTCGAATTCCCAACGATCAAG A
CCTGCTCTTTCACATCAGGCGTCCACGGCTTATACTCCACGCGCCTCCTCTTGCGCG T
GTTCTCCGGCTGCGGCGGAGGGTCTTGTATTGATCCGATCACGGTAGCCTCCGTCTG C
GCTAGTACTACGGCGGCGTCGAAGTCTCGATCGTATACCTGGTCCCTACACCCGCAG C
AGAACAGCTCGGCCCTGTCGATGTCCACGGCGATGCTGTGCAGCGAGGGATCGGAAG
CGTTCCCCACCGGATGCGACGGCGCGTGCACACGGCAGAATACCGCGGCGCACGTGA
CGCAGGCGTACAACCGCGGCGGCGCCTGTCCGCACGCACCACATCTCACCAGCTCAT
TCGGCGGCTCACGGCGGATGGACGCACGCCCCAATGGCCGGACCTTCACGCAGCCCC
GGAAGTTGAAAAATGGGTTCGACCCAACTCTGGATCGGAGCTCCGACAAATGCTCGC A
TGTACCACTATCGGTATAAACAGGCGACCCGTTTTTATTTTCGGATCCGGGTTGGAT CT
TACCGATTGACAAATGGATCTTAAGACTGTTCCTAGACAT
SEQ 44
TCAAGAACTGCTCTTCCTTCCTCCATTTATGAAGGCCTCAATTGGTTCGGCATGGATGT
GCTTCATTGCCTTCACTCCCAGCGGGTTGTCCTTGCTCTGTATCAACAAATGCAATC AC
GAG CATTGTTTAG AAAG AATG G CGAG CAACATG CTACAACCAGTTAAATTACACTG ATA
ATATCAAAATTCTTTGGACGATCAATGTATAGAAATCAAACTTCTCTTTAGAAATAA TAAG
TGCGAGTCACTTACTATTGAGCAGGTGCCTGCACGAATATTGCAGACAGGATAGTCG T
GTGGGCAACAACTGTAGTGATCTTCACAGCAAGTGGCTCCTTCAAGAGGGCAACATC C
C C AAG AAAAG C AG G AGTTATAGTATTC AAAG AC AC AAC AG C AAGTG GTAC CCTCGGGG
CATTGAG CATAATC ATCACACTGAGTAG GTG G CTTG ATTG G AG ATG G AG G AG AAG GTC
CAGGTTTAGGGGGATTTACGCCTGTCTTGACTGGGTAAGAAGGCTCTATGGCTAATC CA
CACAAACCTTTAGAACTGGCAACGTTACGTTGAACCCGAAGGTAACCTTTCTCTCCC CA
GGAAGCACCCCAAGAGTTCCTGATAATCCAATAATCCATACCATTCTCACTACCATA TCC
AACAGCAACCACGCCATGGTCCACTGCAGTTCCACATTTTCCAGTAAAGATACCCTG CA
AAATATATGTATTAGAACCATATAGCTACTAGCCAATAAGAACTTCATTATAATAAG ATTT
AAATATAGAATTGTAAGTAGTGATAATTGGTAAATCATAGTTTTTGAATTATAACAG GAAT
CTATTAACTCATGCAGCCAGCGTTATCTTTTGCTACATTCTTTGGTCGAAACAAAAC TAT
TATATCTGATCGAAAGGTAACGTATTACTGGAACGACGATGTCTTTCTTTGAATTCT GTG
ACTAACTAAAAGACTTCAGATTTTGCATGTTCACTTGTATTTATTACTTTCAGAAGA AAAA
GATAGGTCAATGCGAATAAAAACATTACCGATACATAGTGTTGAAAGTCGTGGCCAC CA
G CTTCAATG G CAATG CTCACAG GTTG ATTTGC AACAG CCTTTTG CAGTGCCTTTTC ATTA
TTTGCAGGAACATCTTCATACCCATCTATGGTAACAACCTTGGCATTTTTCTGCATG ACA
CAAATGTTTCAATAAATAAATATTCAACTCACTACATAAATCAATATATAAAGTGAA TTTA
GTCATTTCAAGAATGCAAAATTTACCCTTGACTGGTCACATCTGCCATCACGGCCTG TG
TAAGGGTAGTCTTCCTCAGTGTCAATTCCTCCATTTTTGATGATGAAATCAAAGGCA TAG
TCCATAAGACCGCCATTGCAGCCGTCATTGTAGGAAGTATCACAATCCACCAGTTCT TG
CTCCGATAGTGAAATCACATCTCCGGTGACTATCGAGTTCACTGCTTCAACGGAAGC AA
TTGCAGAGAATGCCCAACAGCTCCCTGGTTCATTAACCGATAGATTAATACACAAAT ATT
GAGCCATACAACATAAATAATGAATTGAGAAATTTGAACTCAGTGATTTATAAAAGA AAC
TGATATGCAGGCATTAATAAAGAATACACGTCAGTGTCACATGGTATTTAGAAGGCC CG
TCGGGTCACTTCGTGTAACATTTGAGCACATTAAGAGGTCGTTTGGTAGAATGTGTT AG
AGAAAATAATGCATGCATTAGCTTTGTGTATTAGTAATGCTTTGTTTGATACACTTT TTCA
ACCTATGTATAACGGATACAAGCATTAGTTATACAGTCTATTTGGTATTATCCTATG TATA G CTAATG CATAG AAAACCATGACATTAG CTATATCG AGG CTATTAATACTTG CATTAG CA
TGGTTAAAGACAAAATTATCCTTAAAGTCCCTTAAGTAAAGAATATGGAGGGCATTT TTG
TAAACAATTAAATATCTAAAAAATTATGCAATGCATTTTAATTTTTAATACACCACA CCAAA
CAATGCATAAAAAATAATCTCTGTATAACTAATGCTTGCATTACAAACCCCTGCATT ACTA
ATGCACCTTATTTAGCATTATTCTTATACGCCCTACCAAACGACCCCTAAATGTTGT AGC
AACTAGCAAATATTCTAGTGCCAAAACTCAGCATCAATTGGTCAGCATTTTCTATCA TTA
TCAAGTTGTGCTGTTAAGCTTTTCACCCTATAATAATTGCTCATGCGTATAGAAAGA TGG
GGTCGGGGAATAATGACTAGGAAAAAAGAAAAGATGGGGTCCTGGAATAATATGATT AA
AAGGGAAATTCGGTGCCCTAAGCTCAGGGTATGTGCGAGGTGCGAAGAAGAAGTGGA T
CATAAGGATTTATTGTACGCGACCTTACCCTGCATCTTTCTATACACCAAAAAATGT AAA
GAATTGTTACATATTACTAGGTAAAATTTGATCTATAACAAATAACTCTTCATATAT ATAAT
AC C G G CATAG G ACTATTTAAAC AATAAAATAAC AAACTGTTAC AC C C AATTAAATTAC AC
TAATACTATAAAAAATTATACTATCAAAGTATAGAATTTAAACTCGAAGGAAATTGT AAAA
ATAGCACGGTATAGCCAGTTTTCGGATTGGTCATTCAAAAATAGTCAGCGTTTATCA AGT
CAATGAAAAATAGCCACTATTTTGCTGCAACAAAGACCGATCCAACATAATATACTG GAG
TTCGGTGCAACTGTGTATGAACTACAACATATTATGCTGGACCGATATACTTTGTTA GCT
CCAGTATATTACTGTAGCACCGGTGCTCCAAACTCCAGTATATTATGTTGGACCGGT AT
ACTTGATGGAACTCCAGTATATTATGCTGGAGTTCTAGTGCGCTTATGCAGATAGAG TT
CCAGCATACTTATCCTGGAACTCCAGTATAATATGTTGGAGTTCAAGTATACTTATG CTG
GAACTCCAGCATAATATACTGGCGTATTTTCTGAGTTTTAAACAGTGTTTTCGCTCA AAT
TTATCTTTACATAAAAAGTGGCTAAATTTCGATTACTTTTAAAATTTGGCTATTTTT GAAC
GACCAGCTATTTTTTATTTTCACACAAGGAAATCATAAAGGTATCCTACGTTTATGC GTA
GACACACAAATTGGCTTTCTACTCATAGAGGACTAGAACTGAACGCCACCATACTAT TG
ACATGAGAGTTTATTTAAGTGCAATGAAAGTGTAAAGAACAGAAAAAATATTTAAAT CTG
ATATAATAACATAGAAAATAATTCTTGATATCGATTCTAATCTAATAAAATAGGAAA TAAT
AAATAACTAGTTATATTTAGAAACAATTTAACTTTAACGTTTTCATTTTATCTATTT TATCA
TTAGAGAGAAACTTTTATAATCACACGAATGTTACCCACAAATCTTTTGCCCTTGAC CTT
TTAGGACCACATGATCAAAAGTTTTCTTTTCTTCTTTTTTTTTAAAAACTTTATATC AAGTC
AAATTATATCATTTAAATTG AAACG GGTAGAGTATTTATATATTTTAG CACAATAAG G CAC
GTATGATTTCCTTTGTTTGTCAAGTTCGTAGAAATACTTTCTACATATAAATTAATT ATGG
TAGTG G AATACC AATAG CCTATCTCTATATG CTTTAATAACAAATTAAATCAG G AAAATAT
C ATCTAAAAC G C C GTCTAATTAATTATAG ATC C ATAAC C C AAAAAC C G G AG ATAAAC G AA
AAAGACAAAGCCATTGCTAATTAACTTACCACAGCTTCCTTGATCCTTAACTCCAAC AAG
AACACCTTTCTCTCTCCAATCAACTGAGTCCGGCAAGCTATCCCCAACTTTAGGAAG AT
ACCGATCGCTTTTGTTTTTCAACAACCTGCGACGATCACTGGTCTTAGTACCTAAGT ACA
TG GACCTGTACTC CTCGTTG GTCAG ATCAG CAAATTTG GTTAAACCAAG CTTGTAACTC
TTGTTTGGAACGGAGTTTTGTTCATCGATGTATCTTAAGTTATCTTTAAAGATCTGA AAC
CGCTTGTCTTTTTCGTCTAAGGCGTTGTACGATTTTCCATGTTCGAGTAGCCATGAC TC
GTACAAGGACATGACTTCATCGTCCGTTCGAAAGTGTTGGTTTTCGTCGTAGGTTAA GA
TGGACATGTCGGAAGCGGAAGATAAGGTGGAGAAGAAGAAGAAGAAGAGGAGGAGAA
GTAGGGATATGGATATGGTGAGAGTGGAGCTATGAGTTGCCAT
SEQ 45
TTAGAGTTCATCCTTAGGTGCTGTTGCAGGAGACCCTGTGGGGTTGCTTGATGAAGTCT
TGATGGGGTAGGATGGTTGCATTGCTATACCACACAATCCCTCTTCAGCATCAATCT CG
CGTTGCATCCTAATGTATCCTTTTTCTCCCCATTCAGGTCCCCACGAGTTCCTCACA ATC
CAGTATTTGGTTCCATCAAGGGTTGTGCCATAGCCCACAATTGCCACACCATGGTCC AA
CTCAGTACCACAGTCTCCGGTGAATACACCCTGCCATATTTACAGTTCGTAAATGTT TAT
ACCTAGTAAAAAACTTTTTAACTTGAGATAAATGGTCTATCATCATTTACCTCAGAG TAGA ACTGG AAGTCAGAAC CTGAAGCTTGTATAG CTACAG AAACAG G CTG GTTG G CTACTGCT
TTAAGTAGGGAATCCTCATCATTAGGAGGAACATCCTCATATCCGTCAATTGATACC ACA
G GAG AATTCCTCTG CCAATTCC ATAAAATTC ATG CACGTG G ATTAG AAACAAGACTG GT
TCGATCTGACAGACTGACACCCTACAGATGTAACAGAATCTTACCTTTTGAATATCA CAC
TCGCCACCTTCAGCCATGTATGGATAGTTCTCTTCAGTATTGATGCCTCCCTTCTTC TTG
ATGAATTCAAATGCCATGTCCATCAACCCTCCATTGCATCCTTGGTTTTGACTAGTG TCA
CAGTCAACAAGTTCTTGTTCTGATAAAGATACTAACTCATTTGTTTTGATTTGGTTT ATCC
CCTCTACTGCAACGACAGTTGAAAATGCCCAGCAACTTCCTATAACAGGCAAAAGGT CA
GTTTCCATCAGCTATAATATTTTGAAAGAACATATCATATGGTTTACCCTTATGTTA TTAT
G CTAG AGGTGTAAAG CG G CATAG AATTAATG ACATG CTACTCTTTTCTTACCACATTTGC
CTTGGTCTTTGACAGGAGTAACAGCACCCTTCTTCCTCCAGTCAACAGAGGGAGGGA C
ATCTTCCACATTGGCGTACATGAAAGTTCCATTTGCTCGTGAAGCTCCAAGAAAAGA AC
GATGATGCTTAATCTTGGAACCAGCATAATGGTGTCTGAATTCATGGTTAGTCATGT CTG
CAAACTTGTTCAATTTCAACTTATAAGGCTTATCCTTCTTGTTGAAGTTGTGAACAT AGTG
TACATTAGCCTTGAACACATTGAACCTCTTGTCTTTCTCATCAAGGCTCCTCGATAC AGT
GTGATGGCTTCTCCATCTCTCATACAACTCCCACAATTTTTCCTCAGTTTCCAACTC CTT
CTCGTGGAAATCGAAACTCTCCCCAAGCCTAAGTACCAAAGCCAAAGAGAAAAGAAC CA
GAAATAACTTCTTCAT
SEQ 46
AAAACCAACCTGTGAGACATTAACATCCAACTCTTGGGCAATGAAATGGGCAAGTTCTG
GAATGCGAGGCTTCAACTCTGAGCAGTTGACACTTAGTGACAAGTAAAATGTAATGA GC
CCAACTTTAATTTCCACTGCAAGTCGACATAAAAGATGAATGTGATTACAACCATAA GTC
TTTGTAATGGAATTATCTAATTTCAATAGCCATCATATCTGCACCGAAGCCTAGCTC AAG
TTTGTGAGAATAGATGTAGTGAAGTAGAAAAAGGGGACTAATACTTGCAAAACTAAA TTG
AAATCTTGAAAAGTTTTACAGCAGATAAAAGTCAAAGCATTTGAGATTATGCAAACC ATT
GAAGAGGTACATCAAATTGAAATAATACAAAACAGGGCTATGTTTCAACAATGCAAA CA
GGAAATATTAGGCAGGAAAAATTTTGCGATTCTGTCATTACTTTAAGGTCTTGCCAC AAA
TTTCTCATGCTTGTCGTTGTCCAGTCACAAAATTCACTAGAAATTTGACAATTGATT ACTA
TAACTTAGTGGATGGATTTTCAGATAGTCGGTATATGGTCAATGCATGTTCACTTGG TAT
CAGTTGTCGTAGTCCTTAGAAATAACTTTTTGGTCCCTTGATTATACCATATTTGTA CTTT
AGATCCCTCAACTATTCAGCTTTACACATTAAGCCTACAATTTAACGAACTTTACAG ATG
TAGTCCAATAATAAACAAAACTAACTAACCCACGACATATTCATATTTCAAGTCTCC TTTT
TTAATAATACAAATTTCAAAAGGAGCATTAATGTTGTAAAAGTACCGTTGCCTACTA AAAT
ATCCCAAAAAGATGAACACGCTGCTTTGGAAATGGAACGCACCAATCACACGAGTTG C
GGATTAACAAAATCTAAAAATTCTAGTTCTAATTAAGATCTGACTAAATCTGCAAAC TCGA
CAAATTAAAAG GCAGATGTCAAAGTC GG AG AGTTG GACTAAAAGTG G AAACAG GG G GA
TAACAGGGGACCAAAGATTTTCTCAGTATTCCTTGAAGTTATTATAATAAATTTCCA GTTT
AAGTAAATCTTTCTCAAACTACAAGAAGGCTGAGAATGCTGTGACATCAGTACAATT TTA
TG CAG G CAGTTG CTTCTTAGAAATTTTAAAATACCAGG G AAACAAGATG ATTACAAG ACT
AATTTCAG AG AAAG GTCAGATGTC AACTTGAATAG G ATT AT AAACAG GGATCTTT AC ACA
AATAGCCGGCTATATTCATGTTTACTTTTTCTAGCCATATACACAGATTATACATTG ATGA
TACACAATTATGCACATATAATACATAAATTATGCATTCACACAAATACCAGCATTC TGGA
CATAAGAGACAGAATGTTGATTGCCCAAAAATGATCTAATCGAAGGCAACACATCAA AA
TCAGCATGATGCAATTCTAATTTTGATTCTCATTTTCATAAAAGAAACCACAAACCA TTAT
AGTTCAAAATTGAAG G G AAAATTG AAAAGG G AATTGTATTAATCTATTAGAAAACAG AGC
TAGTAGAGATGCGAAAAATGAAACCAGTTCAGGTTATGACAGCATTCTAATGGGCAG AG
TAACCTTACAGTAAAAATACTATGTGAAGAAAAGCTGTCCCTTCATACCAGGTGTGT TGT
ACCCAGGGGGTCCACTAGGAGCTGATGAAGGAGACAGGTGCGCACTGGAGTTTGTGT TATCCAAGCTTGAGACCGATGGTGAAGGTGGAGATGGAGGGGATAAATTGAGTCTGTC
CCATAACTCAGAACAATTAGTTTTCCAAAAACCAATTCTTTTATTTTCACGATCGTA AGTA
ACAAGAGTGTTGCGAACAACGATTCCTGTTGCAATTAAAAGCACGCATGTTACAGCA CC
GAAGACCTAGGACGATGAACTTTGTCAAGTAGTTAGAAATACCTCCAAGAAGACTAG CT
G GATTCTTTCCATTCG GG AAAATTC CTAG G CAATAAG CAC CACGTACTTTG AAGTG CTTT
ATAGCATACAAGTTACAATGAAGAAGGGGGAAATTAATCAATACTAAATCTTATAAA GAA
AAAGAAGCCAGTAGATGTGACGCCAAAATAGTGTCGGCAGATAGTTAAGAAAGAACT AA
AACAGAATAAAATGCCTACCTGAAACAAGTAATTTTCAGGAGAGAGAGTTAGTTTCT TTC
CATCGCTGAATACCATATCGACACGCGGAAAGTTCTTTGAGAGTTCTGATATGTTGC TG
G AATATATAAG AC ATTAAATAATAAG G GTAAC AC C AAC AATAG G AAAAAAAAAC AAATTT
AGACAAGAAAATCATAGACGTCTAATTAATTTTGGAACTCTCCTTATAAATACCTTC CAG
CACCAGAAAAGCAGATATCTTTAAAACTAGGATCTGGCCCTTCAATCTGTTTTAAAG AAT
GAAGCTCTTTCACTACCTGAAAAGAAAAGTTGAAGAGGTTAATCCCAAAAAGGAAAA TA
CTTAAGTTTATCTAAAGATAATCGCCACAAGTTACATACTGTATAAATTCACAACCA GTG
ACTCCACTTTTCCTTCGCTAAATGTAACATAGCAAAAGCAATGAAAGACAACATCAA AGA
TATGCAGAAACTTGCAAGGAACTCTGTATATGTAGATAATTACTACCTGCTAACATC ATT
GGAAGAAAATGTTGTGTGGTGTATTACCAAATTCAGAAGTTTAAAACAGGAAAAATC AAC
TATTATAAGTGGTTAAGCAATCTAATTAGGTCCTTTAATAGCAGAGAAACTAACAAC AAA
AGAAGGGATAGAGGCCATAATATAGCCTTCCTTTCTTTTGATAAAGTACATAGCCTC CAA
CTCATCAC AG G AATAAGTTAG GTG G CAGAGTTATAGG ATCACAAGATAC CTGGTTCTTA
CTTTTCTTCAAAAACATGTTCTAGATATCAAAGTGTTAGCTCTCAAACTATTATTGA CCAA
GATGTTTTAGTTTCAGTCCCACTCATGTACAGGAATCTCCAACTATATATTTGATCC ACG
TCACAATAGGACATGATGGTTTGTTTCACCAATCAGTGGTGTAAATAGAACTTTGAA TGG
CAAGAAAAATTTAACAGATTGAGTTACATATAGAAAACACATTATTCAAATGAATAT CTTT
TCACATCTAATTTCTTCTAGGACATTCTGCTCATCAATAGCGTGCTATTCGCCACTA GTC
ACTTTCTCAGACAGAACGAATCAAACAAACAGACAGAAACATTGGCAGATATGTAGA TA
AACTATATCTTACAGCATTCTTGAAAGCTGCAAATGCTGCTTCTGGAAGGTACGCAT AG
GTGGTACCACTATCAAGTATAGTCCCATGTTTTCCACCAAAAACCCGTGGATTTAGG TTT
AGCGGCTTCCCAGCGACATGTATCTCCTTCAGGTCAATATTGTAGTACGGGCTGTTG CC
ATCCGAGACTAAAGTAAATGCGTAAGATTTATCAAGTGTATTTGCTAGGAGGAGACC TA
ACATCACTGTGTCTCTAAACAATTGTTTCCAATACATACCTGTGACCAAAATCTGAT TTG
GTAAAGGCCATGTCAGCAGGGGGTTTTACTCCACCAAGAACCATTGCCCCGCCACCA A
AATCCATCCCTCCATAGCACAAGGAGAAAGAATCACTAATTACATGTTTTTCAACAA GTT
GATCAACTATACTAAGATCACCTCGGCCCAAACCCATTATACCATCAGCACGTTGGC TG
TAAAGATCACCAGTTTCCGCAATTTCACATCCAAAAACAGCTCGTTGTGGTGCAAGC TC
ACTTAGATTTCCAAAAGATATGATGTCCTCTCCAAGCAACCCATAACTTGCACTCAT CTC
AGCGTACCGTCTCTCATAAATACATTGCTGCCTCTTATGGTCGCAGGGACAAGCCTT AT
TGCATTTCACAGATTGATAAGTGCTTGACATTTCCGGCTGAAACTTAGGATCCTAAA TAA
GACAACATACAGAGTACCACCGATCAAAAGACAAAAATCAAATGCCAAGCAATTCTA AG
TGCTATATTAAGCATTTCTCACAAAATTAATAGGCTGACTCAAAGTAAAATTACTAG TCAT
GAAGTTTCTTAAGTGCTGATATTTTCTCCATTAGGAGATCTTTTATGATTACAAGGC AAAT
TGAGCAAGAATCACATTTAAAACATCATGAAACTATACAATGGATTTGTACAGCTTA TCA
ACAAAGAGAGGCTTAGAACTATTTGTATCCTAGTTAGATTGGTTTGTTCTTGTTTAC CCT
CTTTCCCTTAATACTTACAAATAACTGCATCCTACTAAGCGATTTCCTCACAAACAA AAA
GTATATTAAGTAATGTTTATGAAATAGCACCTATACATAAACAGTTTCAAATTTTAA TTTC
CATATCAAGCTATCAAAACACACTAACTGCAAAATTAAGATAAATATGTAACCTTTA ATGT
TTATCCAGAAAAAGAAAAGAGGGAAAAAACCTGATGGTTGCCACACTTTTTACACTC AG
AGCAAGGGACATAGGTAACTGTACTCCCTGTATCAACAATAAGAGCGAACTTCTGCG GT
GGTGTTCCAATCCAAATATGAGTTGTATAGTATCTGCATCACAAGTTACTTGGAATC CAT
TCAACAAAATAAAAATTACTAATAAAGAAATTAGTCTGAACTAGAGGTATATAGAAA TATA
TTCCACAGTGAAAACGCAAAATACGCATGTGAATCAGCAGCCAAAAGAGTTAGTAAC AG TGAATTTAAATTTTCTGAGCAAAAGCTACGAATTTGAACCCGTTGAGGAGGAGATCATCA
TGGAGAGACATGCGAGCGCTGGCAGGACTTTTCTGGAGGTGGCGACGGGAGATTTCC
GCACGGCGTGAAGTGTCTTTCGGAGGAAAGAGCGGCAGCAGCATGGTTGTGTGACGG
CTGCCGTCGGCCGGCGAAGGGAGGAAAACGGAGCTGCCGTTAGTAACATCAGATAAT
CGGAAGCCGGAAACGACACCGTAATGGATCAACAGAGAGATGATCGCGAGAATAACG G
TGAACTGTGGCCGTGCCAT
SEQ 46
TCAATATTGTAGTACGGGCTGTTGCCATCCGAGACTAAAGTAAATGCGTAAGATTTATCA
AGTGTATTTGCTAGGAGGAGACCTAACATCACTGTGTCTCTAAACAATTGTTTCCAA TAC
ATACCTGTG ACC AAAATCTG ATTTG GTAAAG G CCATGTCAG CAG G GG GTTTTACTC CAC
CAAGAACCATTGCCCCGCCACCAAAATCCATCCCTCCATAGCACAAGGAGAAAGAAT CA
CTAATTACATGTTTTTCAACAAGTTGATCAACTATACTAAGATCACCTCGGCCCAAA CCC
ATTATACCATCAGCACGTTGGCTGTAAAGATCACCAGTTTCCGCAATTTCACATCCA AAA
ACAGCTCGTTGTGGTGCAAGCTCACTTAGATTTCCAAAAGATATGATGTCCTCTCCA AG
CAACCCATAACTTGCACTCATCTCAGCGTACCGTCTCTCATAAATACATTGCTGCCT CTT
ATGGTCGCAGGGACAAGCCTTATTGCATTTCACAGATTGATAAGTGCTTGACATTTC CG
G CTG AAACTTAG G ATC CTAAATAAG AC AAC ATAC AG AGTAC CAC C G ATC AAAAG AC AAA
AATCAAATG CCAAG CAATTCTAAGTG CTATATTAAG CATTTCTCACAAAATTAATAGG CT
GACTCAAAGTAAAATTACTAGTCATGAAGTTTCTTAAGTGCTGATATTTTCTCCATT AGG
AGATCTTTTATGATTACAAGGCAAATTGAGCAAGAATCACATTTAAAACATCATGAA ACT
ATACAATGGATTTGTACAGCTTATCAACAAAGAGAGGCTTAGAACTATTTGTATCCT AGT
TAGATTGGTTTGTTCTTGTTTACCCTCTTTCCCTTAATACTTACAAATAACTGCATC CTAC
TAAGCGATTTCCTCACAAACAAAAAGTATATTAAGTAATGTTTATGAAATAGCACCT ATAC
ATAAACAGTTTCAAATTTTAATTTCCATATCAAGCTATCAAAACACACTAACTGCAA AATT
AAGATAAATATGTAACCTTTAATGTTTATCCAGAAAAAGAAAAGAGGGAAAAAACCT GAT
GGTTGCCACACTTTTTACACTCAGAGCAAGGGACATAGGTAACTGTACTCCCTGTAT CA
ACAATAAGAGCGAACTTCTGCGGTGGTGTTCCAATCCAAATATGAGTTGTATAGTAT CT
GCATCACAAGTTACTTGGAATCCATTCAACAAAATAAAAATTACTAATAAAGAAATT AGTC
TGAACTAGAGGTATATAGAAATATATTCCACAGTGAAAACGCAAAATACGCATGTGA ATC
AGCAGCCAAAAGAGTTAGTAACAGTGAATTTAAATTTTCTGAGCAAAAGCTACGAAT TTG
AACCCGTTGAGGAGGAGATCATCATGGAGAGACATGCGAGCGCTGGCAGGACTTTTC T
GGAGGTGGCGACGGGAGATTTCCGCACGGCGTGAAGTGTCTTTCGGAGGAAAGAGCG
GCAGCAGCATGGTTGTGTGACGGCTGCCGTCGGCCGGCGAAGGGAGGAAAACGGAG
CTGCCGTTAGTAACATCAGATAATCGGAAGCCGGAAACGACACCGTAATGGATCAAC A
GAGAGATGATCGCGAGAATAACGGTGAACTGTGGCCGTGCCAT
SEQ 47
ATGGGAGCAAAATCTTTTCTTGTCGCCTTTTTCCTTTCATTGCTGTTATTTCCTTTGGCC T
TCTGTACATCAAATGATGGCTTGGTTAGAATTGGTTTAAAAAAGATAAAATTCGATC AAA
ACAACCGACTTGCTGCACGCGTCGAGTCCAAGGAGGGGGAGGCTTTGAGGGCCTCTT
TTAGGAAGTATAATAATCTCCGTGGTAATCTTGGGGCCTCTGAGGATACAGACATTG TA
GCACTGAAGAATTATATGGATGCTCAGTACTTTGGGGAGATTGGTATAGGCAGTCCC CC
TCAGAAGTTCACTGTCATCTTTGATACTGGTAGCTCTAATTTGTGGGTGCCTTCATC AAA
GTGCTACTTCTCAGTAAGCTTTCTATTACATTTTTACTGTCATAAAACATAACAGAG AAAG
CTAATGTTGGCGTATGCATAATTGACGAGCATCCATATTTATGCGTCTCTGTATTTA TGC
AGGTTCCATGCCTTTTCCATTCTAAGTACAAGTCAAGCCAATCAAGCACTTATAAGA AAA ATGGTTTGTGTCTTGACCTTTGTCTATAGCTGAAATTGCTGCATGAAAACATGCTTTTCT
CTTAAACTTGTTATTACGCTCAATGCTTGCTTGTAAGAGAAAGTGTTCAATTATTGC GTTT
TGAGATCAAAACTGTTAACCCTGCTCCCAACTTAGGAGATTTAAAAAAAAAAAAGAA AAT
AAAGAAGACCCTTACCATTCTTATTGTTGTCATCCAATTATGTGCCTTGCACCAAAG ATT
TCTGTTGAAAAATATAACATGCGAGATTATGTTGTTGGCTTTCCCTCCCAAAAGATG TGC
TAATGTTATATCTCTGATTTTTTTCTTTCAATTATTGGCAATAAAAGCTTGTGCCTT TTGAA
CCGTTTTGTCTATCGAGGAACCTGTTATGGTGGAGTTCCTTTATTGAGTTTTGGTAT CCA
TCATAATTTACTTTCCGGGAAAATTGGAGTCTGCTGTGTGATTGACATGACATGATT TTT
GATTATTCTTCTCTGTCTGCTTTCTAAGTTTCTACATTCTCGGTAGAGGTAAGATAT GCG
TACTATCTACCCTCCCCGGACCCCACTTATGGGACTAGGTTTTTTTTGTTGTTGTTG TCG
TCATCTACTTTCTAAGTTG GTCAACGTGTTCACTTG GTTGTTG ACATAAGAAC CTGTTCA
TTCAAACTTTTTTCCTGTTTAATATGCCATACAGGGAAGTCTGCTGCCATACGTTAT GGT
ACTGGAGCAATATCTGGATTTTTCAGTCAAGATAGCGTTAAAGTTGGTGATCTGGTT GT
GAAAAATCAGGTGAATGTGGCTTCCCACTTTGTGTGTGTGTGTGTGTGTGTGTTTTA AA
ATGTTTCTCGAGCATATAGTCTCTCATCTTGTTAATGACATCAGGAGTTCATCGAGG CAA
CCAGAGAACCCAGTGTAACTTTTTTGGTAGCCAAGTTTGATGGTATATTGGGTCTTG GT
TTCCAGGAGATTTCTGTTGGAAATGCTGTACCAGTATGGTATGTGGGTTTATTTTGT TTG
TGTTCTCTTCTTTCCAAATGTTTCTTCAATTTCCTATTATCCAAGTGCGTGCCTTGT GAAT
TTCATTATTACATTGAAATGATTTTATCTTCTGGACAGAATTTCATTAACATCTCCT TCTG
TATAAAGGTTTAAGTGATACTGGTCTTGACAGTTTCTTCTGTGTTTTATAGGTACAA CAT
G GTC AAACAG GGTCTTGTCAAG G AG CCTGTCTTCTCATTTTGG CTCAACC GAAATAC AG
AGGAAGATGAAGGGGGCGAAATTGTGTTTGGTGGGGTTGATCCTAACCACTATAAGG G
AAAGCACACTTATGTCCCAGTCACACGGAAAGGTTATTGGCAGGTAGATATCCCTAT AT
CTTTGGGAGATTGATGTTTGGCTTTTGCAACCGTTTTCTTACTCTCAGAATATAATT TGC
AGTTTGACATGGGTGATGTTCTGATTGATGGTCAAGCTACTGGTATGTTATGTTACT TCC
TTTTCTATTTTTTTGTGTG G AG ATTTCGAG GATAAG ATGAGAG CACTTTCACATG ATTTC
CATGCTTTTTCGTTGTATTGACATACTGAATACTGTAGGTTACTGTGACAATGGATG TTC
TGCAATAGCGGATTCTGGGACTTCTCTCTTGGCTGGTCCAACGGTATTCTCAAAAAC AT
GTTCCATTTTTTGTTCCTCTTATTCAGCTATTATCAATAATGAACTGTCTCATAATT TTTTT
TGTACCGTCCTGTTCATGTGTAGGTTTAATTTTTTCGCTGGAATATGAGTTGAATAA TAA
TCAGCCATTCATTTGAAGTATTCTCATTTTTTCCGTTTCTATTCAAAAAAAAAGGAG GATG
GCAAGTGCAGTGATATTGATATTCATTCCAGTATCTGGACATACTTCCTTGTTGATT TTC
ATACCTAAGAAATGTTTCTTTTTACTTTTGATCTGTTGTTTCTGTCTTCTTTGTGTG CTCTT
CTTCTTTATTAGGAAAAAAATTGTGCATCTTGAGAACTGCTTCTTAATTGTTTTCTT TTAT
G GCATG GTTG ACAATATG ATACAAG G AAAAACTGC AG CTTCTTTTGTCTAG ACAATTGTA
GTAGTGAAATGCTTTACTACTACATTTCTAGTTCTCATCATTCTTCCCTGTATCCTT CCTC
CTCTATCTTGCAGACTGTAGTCACTATGATTAATCATGCCATTGGCGCCTCGGGGGT TG
TAAGCCAACAATGTAAAGCTGTTGTTGAACAGTATGGACAAACAATAATGGATATGC TTT
TAGCAGAGGTGAGCAATTATTTGTTTTAGTTGATAGTTTTTTGTTGTTTTTACCAAT AGTT
TTCCGTGGTATCTGCAAAGAGGATGGTTTCATGCTACTAGTTGCCTTCCCAATATTC TGA
TGCATTGGCGTCTTAACAGGCACATCCAAAGAAGATCTGCTCACAGGTTGGGTTATG CA
CCTTTGATGGAACTCGTGGCGTTAGGTTAGGCTTCAGACCCTTTCTTTCCTCGCCTT GG
CCAATCATTTGATATGGTAAATCCTATTATAAAATGTGTGCTGAGTGGATTTATGTC CTC
CACGTGTAGTATGGGCATTGAGAGTGTTGTGGATGAGAATGCTGGCAAATCTTCAGG A
CTGCATGATGCTATGTGCTCCGCTTGTGAAATGGCGGTTGTCTGGATGCAGAACCAA CT
TAGACAGAACCAGACCCAAGAACGCATCTTGAACTATGTGAATGAGGTAAATAGCAT CA
GTCACATGCTTTCTCTTCTCATCTTAAGTTAGATTACTGACCATCTTTAACAGCTTT GCG
AGCGACTACCAAGCCCAATGGGACAATCAGCTGTTGATTGTGGAAAGCTTTCTGGCA T
G CCTAGTGTTTCCTTCACAATTG GTGG CAGAAC ATTTG AC CTCTCTCCTG AG G AG GTAT
GTCTGATATCAATCTTGCGTAGTGTACATGGCGTCTTCTCATTTTGTAAATGGCTTT GAT
TTTTCTGAACAAAGTGATTGGTTGTAGAATCCTTTTGTCATGTTTCAGTTAGGCAGT TCA TTTCTTTGTGGTTTTCACTAGATTAGCTAGCAAGGTGTTACTCTGCTTTCAAGAGAAGTA
CACTTGTCTTTTAGAAAATTTCAACCATGACAGCTAAGTGTCGTTTGGATAATTAAT GAT
ATTGAAGCGTGTCGAGCTTTAATATCAGTTTCTTTGCTTGATAAGTTAACTTGTGAT CGG
ATAATTAATGTTATTGAAGTGTGTCGAGCTTTGATATCAGTTTCTTTGCTTGATAAG TTCA
TATGATTGTACTAAGCTTGCATGCTTTTCTTGTCACCAGTACATACTCAAGGTGGGC GA
GGGTCCTGCTGCACAATGTATTAGTGGCTTCATTGCCTTGGATGTTCCTCCACCCCG TG
GACCTCTCTGGTATGTTTTCTTTTCGTCTTAACGCACAAATGCGTGGATTCTGTTAT TAC
CAGCTCCCTTTTGATAATGTTGTTTGCTTATGGCTTTGGTGGTGCAGGATCTTGGGG GA
TGTTTTCATGGGTCGATATCACACCGTCTTTGATTCTGGAAAACTTAGAGTTGGATT TGC
AGAAGCAGCT
SEQ 48
ATGGTTGTTGCATTTGTGGGCATAGCCAAGTCTATCGGGCAACAATGCTTGAGGCGATC
AAAACCCTACTCTTACTCTTACTTCTCCAGCTATGTTCGTTCCTCAAATTCTAAGTA TGG
ACTCCAAAATTGGCAATTTCAGAGTCATAGAACTCTAATTTTACAATCGGCTTCTGA ATC
CGTCAAATTAGAAAGACTCTCCGATTCCGATTCCGGTAATTCCACTCACTTTGTCCT ATT
TTACG G CGTACTGTTACTATTTG G GG AATCAAACTTCTTTTAATTTTGG GTACAATTG CT
TTCTGGGGTTAATTAACAGGGATTTTGGAGGTTAAATTGGATAGGCCCGAAGCGAGA AA
TGCAATAGGGAAGGATATGTTAAGAGGATTACAGCAAGCGTTTGAAGCCGTGAGTAA TG
AACGTTCAGCAAATGTTTTGATGATCTGCAGTTCGGTTCCCAAAGTGTTTTGTGCTG GA
G CTGATTTG AAG GTATAACAGCTTCTCTTCATGTTGTGTTTTTAG G AAAAAATG AG G CAA
AAAAAAAACTTTTGAAATCTTGTGCAGAGTATAGGACACACTATTTGGTTAACAAAA AAT
GTGTAATCAAAGGGTTCGAAGTCAGATTAAATAACAAAAATTATGGGTAAATATTAA AAA
TTTTAAAACTTTTAAAAAATCATACTCGTTCAAATTATATTAAATTTTTTCATTTCT CCTCTT
TGCTTCTTCTTCTCCTTCTTTCTTCGTTCTCCTCCTTAATTTCTTATTTCTCTTCAA CTTTC
GTTGTTGTTGCTGCTGGTTCGTCCATGTCATCTTCTTCTTCTTCATTTTATTTTTCC ATCT
TCGTCTCATTTTCATCTTTCTTCTTTTTTTCATTTTTTCTTTTCTTTCTAGTGGTTT AAAATA
TACGAGAAAAAGAGAAAAATATGAAATTTTACAAAGTAAGTATTTTGCAAAATACCC TCG
AGATATTTTGAGACATACCCATAAATGAGTATTCTACTGAAACATCCCCGGGTTGTT GTT
TGAAACATCCTACGGGATATTTCTTCTACTTCTTTCTTTTTTTCTAGTGTTGTTCTC ATTG
ATCTTTTTTTCACAGATGTTTCAAACGTTCCCACAGATGTGTGCAGATGTTTGAAAC ATT
TCAATAAATGTTTGAAACATTTCTTTAGATATTTGAAACATCTTTCCTAAATGTTTG AGGA
TTAGGAGTGGCGGGGATAGAGAGAGGAGCGTTGCGAGATGGAGTAACAAGGGGAGGA
G GAG GAGTG GCG G GAG GAG GAG GAG GGTTTTTTTTTTTGTAAATAAG AAAACTTTG GG
GGTTTTAAAAATAGTGTCATTAACCCTAATCACAGAAGTGTCCCTTTTACCCCATTC TTA
ACACTTTTGTCTTAAAAAGTGATATAAGTTTTGAAGTGTCTTAAAAATTTAAATGCC CCGT
GTTTTTAGTTGTAATATGTTTTTTCTTGCAAGCAGTATCACAACTTGTAGATATTAC ATCT
TCCTTTTCGTTATGTATCCTCTTCTCTTCATTGTGCCAACCATTGTATTTGTGTTTT CAAG
CAAGAGACAGATCCAAGATATGAACTTTATGTGTTCGAGTTCTAAATTCTGCCTAGT CCA
TTTGATTTACGGGGTTTGAAATCTATATTTGTACAAGTTTTGGTGAGTTTTTTAACA CATA
TATGTTTCGTTGATTCTCTCTGTAATGTCTTGTTGTCTTCTAGTATATTTTTCTTTT GTATC
GCGTGACATGTTGACAAGAAAAACAATAAAATTTCACAAGCTAGGAGCATCTGCCAA AC
ATACAACTTTCAATGTTGAAAAGTTTCTCTGTGGTTTGATCTCTTCGCTTGTGCAGC CTC
TGTGG G ACTC AAACATCTTCACAG ATTCAATTTTCTTTTGTG GG G ACTG CATTTATG ATT
GGCATGGCATATGATTGAATAGCTGATGTTTTTGGTGTCAGATGTTGATTTACGCAC TTT
AGTAGTCTTTCTACTGTTTGTTATCTGTGCTGTATTTTGTACCTACTGATAGCTAAT TCAA
TTG ATTG G CATTTG CTACATAG G AACG AAAAACTATG ATTCTTTCTG AAGTCCAG G ATTT
TGTAAGCACTTTGAGATCAACTTTTTCCTTTTTGGAGGTACGTGATTTTTATTGATG TTTT
GTTTAATATATTAAAGATCATAGTGTCTTAAAGCTCAAGAAGAAGTTTTTTTTATCA ATTT ATGAAAAGCAGAAGTTATCAGTTTATAATCTTCTGAATTCTTCCTTCAAAATGATTGGTT C
ATGAGATCATACTATGTCTCGTTTCTTCTTCCTCACTTATTACTTTATCAACCATAA AGTG
GTCCAGTTGTACATATCCTCTTCTTTTCTTTCACTCTATTTGAGTAAATATTTTCTT GCTC
AGGGTCTTCATATTCCTACAATTGCTGCCATTGAAGGTATAGCATTGGGTGGGGGGC TT
GAAATGGCGATGTCTTGTGATATCCGTATATGTGGTACGTGCTTTCTTGCACTTCTG GG
TGTACCATATTTTCTCCTTCTTGCTCTCTAGTTTGATAATGTGTTAACAGGTGAAGA TGC
AGTGCTGGGCTTGCCAGAAACAGGACTTGCTGTAATACCAGGGTAGGTATGCCTTAA TT
ACGCATTATATGTTTGCTTATGCAAAATCCCAAAATTCTTTGAAGGATGTGTTAGCT ATG
TGTGGTTTATTTCTAAATTTATCAGATCAGTGGGACGCATTTCCACTATCTTTTTGT CACT
TTGTAATCTTTTACTATTCAAAGGTTTCCAACTTCAGAAGTTGCTATAAATACTCTG TATG
CATAGATGATCTTCTTAATGGTATCCTCTTATTTCATACTGGCATTGTGCAGAGCAG GAG
GAACACAACGGCTTCCTAGATTGGTTGGAAAATCAATTGCAAAAGATATAATATTTA CTG
GCCGAAAGATAAGTGGGAAAGATGCTGTATCAATAGGTACGTGTATGACTTGTCAGA GC
TCATTTGTCAAGAGACAGGACTCCTTTGTCTTTCCAAGTTCTCTCTTGTTAATATAA AGAT
AGCAGTGATGTCAGCACTTCATTACAAATTATGGGTTAACAGTGTCCTCCAAGGTTT AG
GCAGATAGAAAGAAATCATCTAATTTTTGCTTCTGCTGTAATTTTGGACCTTGATCT CCT
ATGGTTTTCTTTTCCAATTTCTTAGTGAATAATACATTGTATGCAGGGCTTGTCAAT TACT
GTGTTCCTGCTGGTGAGGCTCGCCTCAAGACACTTGAACTTGCTAGGGATATTAATC AG
AAGGTTAGACTTTAGTTATTGAGATAAAGAGGATGTGATGTATTTATCCAGTGTGCC ACC
CATATGACTTCCAATTGCAATTTAGTCACGAACAAAGAAGAAACATAAAAGAAGTCC AAC
TCTTCCTATAAACAAAATGATTTCAAACTGTACTGTACATAGATAATTGTAAAGATT CGTT
AGCAGTAACGTGTACTCTTTTGTACCCTTTTCACCTTTTATGAGTTATGCACCCTCT TTT
GTACCCTTTTCACCTTTTATGAGTTATGCACCCTCAAGGCCATGAAATATGCTTGTC ACT
GGATTTTCTTTTCTTTTGTGTGTGTTGAATGAAGTTGAGGCTCTTGTTTGAACTTTT TATT
GTCATCCATGGACCTTAATTTAATGGCATTTACTAATCCTATGCTTGTTTGTTTTCC ACTT
TGTGCACTGCACCTTCATTTTTTGTGACAAGCTTTGTTTCTTGCTTTTGGTCTTTTT CTGT
CTTGTTTTTTCTTAGGTGGAGGATCATTGACTATTGCATAGTTCCTTGCTTTGGTTT CTTT
GTTTTCCTTTTCCCCTTCTTTTTCGATTTTTAGCTATTTTATGGCAGGTTCACATAG AAAA
CAAGTGTTACTCTATTGTTTTTCTTTCCCTTTTTTCTCCATGCATCTTATAGAAACC GAAG
CTTAAGGTTTCTCCACGCAAGCTGCAACGTTCTGTTTTATAAGCTTCATATATTTCT GGT
TTTCATGTAGTATGAAATGATATTGAGTGGGATTATTAGGAAGCTGAGACAGATTGA AAA
GAACAAGTAAAAGCCACATTGGTGATTCCCTCATGCTTTCAACCTTAAAAGGTCATT TCA
ATGTCCAGGGTCCAGAAAAGGGACTCACTCTATTCATGTTCTATAAGAATGGAGTAA TC
CACTTGACAAGTTCTGGTGGTACTACTTCTTCATAAGGTTTTATTATACTAGTCAGG GCT
GTGTTGAAAGGATATGGTAGCATCAACTAAGTCCAATTGTTTCGTATTGTATAGAGC CTT
CCTTTTTCCTTTTCCTTTTCCCTTGTTGACTCTGCTTTTATACCTCCAAATGGAATG GTCG
TAAAGCTTTTGCTCTTTATTCAATAAGCTAAAACTTCTGATGAATAAATTTGGTTTA AGGA
TGCACGAGGAGTGAAAGTAAAATAAATATTGATGAAGGTTTTGCTAAAGATGCTCTT TTT
TATGCTCGGGTTTTGCATGTCAACTGACATATACCTTATCAATGTCGACTGACATAT ATT
CTCTGACAGGGTCCGGTGGCGTTAAGGATGGCAAAATGTGCTATTGACAAAGGAGTG G
AGCTAAATATGGAGTCAGCCTTAGCTTTAGAGTGGGATTGCTATGAACAACTGTTAG AC
ACAAAAGATCGGTTAGAAGGCCTTGCTGCATTTGCCGAGAGGAGAAAACCTAGGTAT AA
GGGTGAA
SEQ 49
CTAGC AG CAACCAG CTATAG GAACAAATGTGTCAG CTCG AAGTGACATTGGTTG G CAG CTAACATCCTCACAACTCTTGTGGTAAAGCATCTTTTGTACATAGTACATCAGATAGCAT TGTGAGGCCCTGACAACTTCTTCATCGACTTCAGTAATCCATGCATCGTCGCATTTGTA CCATTGGTTTCTCAAACGCAAATAAGTCACATAGTGACCTGATTCTAACATCCCTGAATG TGTGACCACAGCGAAAATTTCAAATTCCGTAGAAATATCTGATTCATCACCGTCGAATGA
AAAGATTCTGTTCCCGTATCTCTTTCGTACAATTGAAGATGATAAATATGGTTTCAT GTCT
AAAGAAAAAGGAAATTGCAGGTGGCGGTCAATCTTTCTGGACATTTTTCGGGTGGGA GA
ATGTTCAAAGCGTTTTATATGAAAAGATAGCACCAGCGGAAGCTTCTTGATGGACAT TT
GTTTCAATGCATCTTGCTTTTCCTGACAATTTTCACAATACAGTTTCTGATCAGATC CCAA
CTTTTCTG GTCGTGTG AAG AG GTC CAAG CAACCTACAAGAG ATTCATTTG GCTTACTCG
ACTTATTAGCAAAATCCTTTGGGCTGGAGTTGCAGCTATTCAAGTCAAGAGAAATGT CC
ATACAAGGATCATGAGTTGTTGAAGTGAATCCGCACGATGTACATGTGACATCAGAT CT
CAAGAGCCCATAGAAAGTCCTATGAGCAATACACTGGCAATCTCCATTATCTGTTTA AG
AAATGTTATTTTTATCAGAGAAAAAGAACTAGAGGAAGCCACAAATCTTCAGTTCTC AAG
GTTATATATGAATCATCCCTCTTCTGATCAATGCATTGACAGCAAATGGAAGTACAG GCC
AAAATTGATATCCCAGAAAGGAAATTGCGCATGTACATAGACAAGATCTGTACTATA TTT
ACTCACTGCGTAGCATAATGTCCTACATAAACAAGAAGAATACCAGCCATTCTCAGC AA
AAGATACCTTTGGTTGCCAAACTAGCTTTCCCCTCTTTATCATGGATCCTGTCCATA ACT
GAAATGAAGAACTCATGAGCATCCTGCTGCTCGTAGGTAGCAAGATTTTCTGAATGC TG
CCACCAACTGCAAAAATAAACAACATTTCAATTAAAAGAATCACTACATAAGTTTTC AACA
TGTCAACGTCCAATAAAAGTTAACATTTTCCACGTTTAAGCATCCAATCTAAATTAA ACAA
ATGATGATATCTTAGGTGGACATAGCAGCATAGAGAAAATTTCAACAGATTTATTTT ATC
ACGATAAGAGAATAGCAATCTTGCTTATTGTCTGATTTTAGCGGTAATGCACCAAAT CTT
GTTTATCCAAAAACTTCAAAGTGAAAACCTATGCAGTTAGCGCTAAAAATTGTAAAC ATT
ATTTACAATTTTGCAGGCCATGTTATAATCAAATATCCAGTATCAATACATTAAATG GTGA
G CACAG ATAAAAAG CAATTAAG AGATAATG ACAG GAGATAAATCCTTTTAACACTTAC AG
ACCTGTAAAGAAACCGAGCTGGACTATAAGGGGTCCGATCACCAGAAAAGACAGCTG A
GAAGATAAGGTCAATATCACAAGGCAGGCACAACCGATCCGACGACATCTTTCTGCA AA
TATCTCGGTTATGCCTATCGCTAAGGAAGTAATTTCTCAAAGGGGGTGCATGAAGTA AC
ACTTGCAACACAGAGTTCATGAAACAAGTATTCCCCAAATTGTTCAAACCCCTTAAT ACT
AAAGGAAAACATGATTTCGACTTCTGATCCCTCCTTAAAAACAACGTCTTCATATTC TTT
GAATCTAAATCCATCCCAAAACTCAACCTTCTCCTCTTGCTCAACCTCAACTCACTC TCC
ACAACCCCAATTTCTGTTCTAGGAAACCCCATTATGTGTTTACACATCACAACCTTA TCA
AAATCAGGATCATACACCTGATCACAACACACTGAGCAATAAAGCTCAGCCCTTTCC AT
GTCTACTGAAATCTCATGCCCAGCTTTACACTGACTATGCAAAAGGGCATGGTTTGA TT
CAGGTGACAAACAACATAACACTGATGAACAGATCAAACACATGTAAAATCTACCCT CAT
GTCCACTACAAATACTACATCTAGGTAGCTCTGATTTGGATATTTCTAATGTGGTCC TAC
CATATGGGGTTGTCTTAAAACATTCTTGGATCAAACTATACCCACTCATACCATTTT TCA
CCTTGTAATCTGCAAGATGCTTACAGGGCTTTGGATTTATATATAAAGAGTTACTTG AGC
ACAT
SEQ 50
ATGAAAGAACTTCATTCTCTAAGAGAGATCGAAGGGCCTGACCCGAATTATAAAGATAT
ATGCTTTTCTGGTGCTGGAAGGTAATAAATTAACTATAGTAATGTTAGATCATTAAC TTTT
TCTTTCCTTTATTTTTGGTGTTGTTTCTTGTATTGAATGTCTTGTATATTGCAGTGA CATC
TCAGAGCTCTCAAAATCATTTCCTCCTATCGACATGGTATTTAGCAATGGAAAGAAA CTA
TCTCTCACTCCTGAAAACTACTTATTCAGGGTAGGCATCTTAATCCACATGGTTTTT ATC
TTACTACCTTGCCTTTAGAATCTGTATCTCCTTTTGGCTTCATCTCTCCTAGTGGCT ACA
TTTTTTGTCTTTGTTTTGATAAGTGGCTCCATTTTCTCTCTGTGTTTCTATATTGAC TAATT
CTGCCCTTTTGCTTACTGTAACTGATTGCTATAAAGCACTCAAAGGTGCGTGGGGCT TA
CTGCTTGGGAATTTTTCAGAATGGGAAGGATCCAACTACTCTTCTTGGAGGTATTTG TC
ATATATATCTTTTAGAATCTTGGGAAAGTTCATCTGCCAAATTCTTCAGTTGTATAA GCTG
TAACATGCGTGCCTTTGCTTTTAATTGCAACAGGTATTGTTGTCCGCAACACTCTTG TAA CCTACGATCGTGAAAATGAAAGGATTGGTTTTTGGAAAACCAATTGTTCTGAGTTATGG
GACCGACTAAATTTATCTCCTTCACCTCCACCTCCACCATTGCCCTCAGGCTTGGAC AA
CACAAACTCCAGTGCAAATTTGACTCCAGCACTGGCACCTAGTTTACCTCTGGAGCA TG
CACCTGGTACGAAGAAACTGTTCTCCTATCTTTTTGTCACCATTAGTATGCCTTTCA GTC
ATGCTTTTATCCAGTTTTGTAGTGGAACTGGTTTTATTTCAATTATTCTACCGGAAG GGG
GGAGCCTTAGAGCAACGGTAATGTTGTCTCCGTCTGGCCTATATGTCATGGGTTCGA G
AAGTGGAAGCAGCCACTATTGCTTGCATTAGGGTAGGCTGTCTACATCATACTCACA CC
CCTTAGGGTACGGCCCTTCCCGGAACCACATCAATCCGAGATGCTTTGTGCATCGGG C
TGTCCATTATAATTCCGCCAGGCTGTTTTGCATCATTTCCCCCTAATATTTTTAATC CATT
TTGGTTTCTGATTTGCTATGCTGGTTTTTTGCTATATCGCCTAAGATTAGGTTAGCT TTG
ATGATTTCACATCCTTTCTTTGATTAAGGTCCATGAATGTTCCTGTGTCTCCAAATG TCA
GCTTTCAAAATGACATTTGAGCTTGCGTTTTGTATTGTTTCATCAAGTTTTGTATTC ATCT
ATCTCCTTAGCATTCCAGAGTTCCTGAGAAGCACTCGCTAGTAAAGACGTATTCTGA TG
TCATGACATTTTTAACCTTGTTGGAGTTTGGACCCAAACAAACTTTTTTTACAGAAG GAA
ACTATAATTTTAAGGAGTACAACAGTTGCTGTATATAGAACATGGTGAGTAACTCCA CTC
TTGAGATGCCTTCTCTTCACTGAAGTCAATTTCTAAAAACCTCCGTGCTTGACACAG ATT
TGTTGGTATAGATTTGTGCTCCAGAGATGCAGATGGGTCCAGCGAATTTTTCACTAG AA
TTTTTCTTTTTTTTCTCACCACCTGTCCAGCATAGTGCTGTCAAAAGTGACAAGCTT AGA
AAAAAACCATGTGCTTGGTGGGGCTTTAACTGCAACATGCTATAAAAACGTTCACTA TAA
ATGTAATGTAAATAAATAACCATAAAC ACAAAATAG CAATTGTTG GAAAAATTG CAATTTA
GTGAAATACACGAGGTGTCAATCAAGTTCAGATCATATTGTAAGTCTTGATTCAAGG TG
CATTTTTAAATTAGTAATTTGGATCAATGATTAGTTTTCTTAGACTATAACTTTCAA TTTTC
ATACTCGTAAGAACTGATATACTCATATATAATATAACGTTTTTCTTAATAACTAAT AAAT
GCTTTCCTAACTATATTTATTTTTGTGCTATCCTATTAACAACAGAGCCTGTGGATG TTAG
GCACCCACTTTAAGGCCTTTTTCCTCGCACTGAAGCCCTACTTTTAAGGTTTACTGT CAC
GACCCAAAATTCCACCTTAAGGATCGTGATTTCACCTAGTCTCTAAAACTAGGTAAG TC
GATCACTTACAACAGTTAAACCATTAAAACATGATATTATGAAGCGGAGTTTAATAT AAAT
GCGAAAATAAAGGTGATACAAGCCAACACGGCGTTAATCACAACAAATCCCCAAGAC TA
GGTAATACAGAGTCACGAACTCTAACTGAATACATAGAAATATTTCAAAACAAAGAT ACA
ATACTGTTCTG GC AG ATAATTGACAGTATAAAGATAAG GAAAG ACTACAAG G G ACTTCG
ACGATCAAGCAGCTCTACCTTGAATCCTCGTGATCAAAAAGCTAACTCTGCCTAGGT CC
TATGCCTCCAACACCTTGATTTGCACAAAATGTGCAGAAGTGTAGTTTGAATACACC ATG
GTTGGTACCCAGTAAGTATCAAGGCTAACCTCGATGGAATATTGGCGAGGTTCAAGT AA
AGACACTCACTAGTCAAATAACCTGTGAAAAATATCAAAAATGGGCAAATGGAATAA TAA
CATAAAGTCATAACTGTAATCTCTTCAAATTAAACGATACCTATTTAGAATAATTAA AGGT
CCCGTTCTGACAATAAGCCATCAAATAGAATCACGCACACCCGGCACCTCGTACCCA CA
TTAACAATCACCCTCGCACGGCAAAGGCCTCGTGCCACAACATAAGATATACCTCGC AC
GACGAAGAGCTTGTGCCACAATATAAGTCACAACCGCATGGATAACTCATATGCCAA TA
TCACAATCCGCCTGGCGTGGTCACATGCTCAATATCACAATTCGCCCGGCGTGGTCA C
ATGCTCAATATCACAATCCACCCGGTGTGGTCACATGCTCAATATCCCAATCCGGCT GG
CATGGTCACCGGCTACCTGTCCAAATGTACATGATCAATGGACATCAAGTTTCATAC TC
CTGGACTGATATTAATGACATGTTATGGTATATGCATGTGCAAGTGTATTATCACAG CTT
AAATCATCTAAGTAATATCAGAGACACCAAGTGGCACATTAGGAACAACACAACAAA TC
ACGTAATATGTATGACACACACAAGGAAGTCAAAAGCAACAACCAGAATACTCCTCT TT
CATCAACAACATGCCCCCAGGCCATCACATAACATCCCCTTATTGCCACCCTTATGT CA
CCACGTTGACAATATCATAATAGCCACCCGTATCGCTCCGCCTAGGCAGTATATCAA TA
GCCACCCGTGTCACTGCGCGCAGACAATATATCAATAGCCATCCGTGTCACTCCGCA C
AATCAACAACAGTGAATTGTCATCCTTGTGCTCCGGATAACAACAATCGATCCACAC AT
GTCCACATATGCCACAATATCACAGGATAGTAGTATTAGAGATTTATCACGATACAA GCT
CACCACTCATCAACAAAGTGCACAAGGACATATCATTAATATAGAATTGCTGAGGGG TA
TTCAACATTTAAGCATGAAAGCTACTCAAATTAACAAGAGTCTCACAAGCGCCCAAC TTG GCCAAATAAGGAATTAAGATCCTAAAACATGATTTGTACATGGAATATAAATAACTTAAT
GTCAAAAATAACTTGATGTCATAAATAAAAGCCATAGGAAACGATTCTGAATAATAA AGC
TTCTATCTTGAACAAGAATAAAAAGTAATCCCAAAAAGTCAACCCCGGGCCCACACC GT
GGAATCCGACAAAACTCACAAATTCCGAACACCCGTTCAAATACGAGTCCAACCATA CC
AAAATCATCCAATTCCGGCCTCAAATCGGCCTTCAAATCATCAATTTATGTTTTAAA AAA
GTTTTTACTATGATCTCCAATTTCTCCCATTCAAATCATCAATCAAACACTAAAATT GAGA
TTGGAATCATGAGAATAAACAAATCCGAGTAAAAAATACTTACCCCAATCCAAATCG TGG
AAATTCCCCCAAAATCGCCCAAATCCGAGCTCTATAACTCAAAATGTGATAAAATAA CCA
AAACCTTTGAAATAGAGTACTTATAGATCTGCTCCAGGTAAACCCTTCTCAATTGCA GGA
CCAGCTTCGCAATCGCAAAGCACAAACTTAACTGACCACAGAAATACCCTTCGCGTT CG
CGGTACATACCTCGCGAACGCGATGCATGGCTGAGCCAGACCTACGCGAACGCGGCG
TAGACCACGTGACCGCGAAGACAATACCACCAGCTCCCAGTTCTTCATCGCGAACGC G
TCATTGCCATCGCGAACGCATTGACCAAGCCCCACAAAGCTACGGGAACGCGACCCT C
CAGTTGTGAATGCGAAGAGGAAAAACACTCAGCTCCAATCATACACTGCGCGATCGC G
GTTAGCCCCTTGCGATCGCTAAGAACGTCAGCAACAACAGAAAACCAGCAACACAAC AT
GAAGGAAAATGGTCCGAAATCACCCCGAAACTCACCCGAGCCCCTCGGGGCCCCGTT
CGAACATACCAACAAGTCCCAAAACATAGACAAACCTACTCGAGGTCCCAAATGACA CC
AAACAACATCAAAACTACGAATCACACCGCAAATTCAAGCCTAATGAACTAATGAAC TTT
CAATTTCCAAAACTCATGCCGAACCATACCAAATCAACTCAGAATGATCTCAAATTT TGC
ATGCAAGTCCCAGATGACATAAACGGACCTATACCAACTCTCAGAACCGCAATCCGA AC
CCGATATCAACAAAGTCAACTCTCGGTCAAATCTATCAACCTTCCAAACCTTCAACT ATC
CAACTTTTGCCGGTTCAAGCCAAAACAACCTAGGAGACTCCAAATCCACATCCGGAC AC
ACGCTAAATCCAAAATCACCATCCAGACCTAACAGAACCATCCAAACTCTGATCCGA GA
TCAAATACGCAAAAGTCAAACTTGGTCAACTCTTCCAATTTAAAGCTTCTAAAATGA GAA
TTATTCTTCCAAATCAATCCCGAAATGCTCGAAAACCGAAACCGACCATACACGCAA GT
TGTAATACATCATATGAAGCTACTCACGACCTCGAACCACCGAACAGAAATGCAAAT GA
TCAAAAC GACCG ATCG G GTC GTTACATTTATGTATG CTTCAAATG AG CATTCAGTG ACA
CTGTTCAGCAAAAGGAGAAACTCTACTAGCCACTTGTAGCCACCTCCAGGGACCCTC TC
TGTCTCGGCCATGGATTACTTTGAGGAGTAATAGGGCTTCTCCAAGCGGAAACATTC CA
CGCATGCTGTGATCCTCCATGTTTTCTCTGCTAATCTTTGCTACTTTTTCTGGCGGT CCA
ACTGGTTGATCAATTCTACTAACACCATCCGAATGAGCCCTCAGGAATTCATCTCCC TA
CTTCTTTGCCAGATACAAGGAAGGCGTTGTCCTCATAGGGTTGGCCTTGGCATAACT CT
CTG ATAATTTGACAC GTTG CTTTAATTTG GTAACAATTTAC AATTTTGG G GGTG CTG CAC
TTGATTCCATTGAGTTACACCATTTCTCATATTTAGGAATGGTCCTGTGCAAGATTG AAA
TGTACTGAACTCAGTTTTTCCCTGTGCAGATATTTATGATTGTTATTATTATTTCAT CTTT
G ACCTG AATTG G CAG G GAAAATCAAAATTG GACTCGTATC ATTTG ATATGTCACTG AGT
GTTGATTACTCAGCATTGAAGCCTCGTGTTCCAGAGCTTGCCCATTTTATTGCGCAA GA
GTTGGAGGTTAACGTCTCACAGGTAGTTTTTGCATGACCCAAAGTTGTGTCAGTCTG AT
GTAATCTAAAACTGTATATCCCATTTTCTTTAAGTTACTTAACTGTATTTTAATTTT GTTCA
ATATGATATGTCACTTATTGGAAGATACCTTGCAGGTTCACTTAATGAACTTTTCGA CAG
AAGGAAATGATTCCCTCATTAGATGGGCCATCTTTCCTGCAGGATCTGCAAACTACA TG
CCAAATGCCACTGCAACAGTAACTCTAAACATCTAGAATATGTGAGGACTATTTCTT GAT
TGAAGAACCCTTTATTCATCATTTACCTATTTGCAGGAAATAATAAACCGGTTGGCT GAG
AATCGTTTTCATCTTCCTGATACATTTGGAAGTTATAAATTAGTCAAATGGGACATT GAA
CCCCCACCAAAGAGGTATAAAAGCTATCTCCATTCTTTGCATGTTCATAAAATATTG AGT
TCTGCTGTACAAACTTTTAGCATCATAGCATTACTTATAAAATTATTCTGAATTGTC AAAA
CAAATGTG CCTTTTCTTTTC AAAATG CAAAATAAATCTCCG CATTG CATTTCAG ATG GG A
AAAACATGACACGCATCTTTTCATCTTGCCTTAAACACATGTTTGTAAGTTACATTC TAAA
TTAGGAAACGTGAATGAGTCTACATTGCATCGCACCAGTTCGACTGCATATTCCAAG GA
TAATGATGAATAGGTGATGACTTTCGTCTCCATTTTTCATTGTTTCAATTTTTCTCA AAGT
TTCTTACTTG GATTGGTG G ATAAAGTGG CAAAGC CAG AATTTTTTTATAAG G GATTCG AA AATACTAGAATGTCATAATTGAGATCTGAACTTGTGACTTGAAAGCAACTTTTGAATCCT
CTTTG CTACTAAACTAAAAAATTTCCCCTATG G CAAG GAG ATTCAATAG CTTATATATAA
CCAAAAAACTTCATTTTTACCCTATTCGCATACTATAATTTGAAATGTTTTTGGTCA AAGT
TTAATTTG CTG CATCTCAAAATCTTAATAG CAAAATATTAC CTTAATTAACTCTAATGTAA
AGAGATTGGATAACACACCACAACAATATTTTTGGTAGGTGAATATTACTTTAATTT TTTT
GACAATTAATTGAGATAGGAGTTCTTGATATTTTTTTTTTGGTTTTGGTAACTATCA AGTT
GTTGGTTTGATATGGTTTCATTGCAACGATTTAGGATACGATGGCAGCAAAATTACC TTG
TTGTAGTGTTTGCGCTACTAGTTGTCCTGATAATTGGATTATCAGCTTCTCTGGGAT GGT
TAATTTGGAGACGAAGGCAAGAAATCCCATATAATCCTGTTGGAAGCGCTGAAACAC AT
G AAAAAG AACTC C AG C C G CTAAAT
SEQ 51
ATGGTCACAGGTCTGAACTTCCGCCATAACTTTCTCGTGTCAGCTTTACTATACTCTAAA
TTTGAACTATAATACCACATTGATGTGAAAATTCACACTTAGGTATCGATTTTTTAA CACA
GAGATTTATTTTGTGTTCATGCTTTGGTTTCAAGTATTGGAGAACCTCGTAATCGTT CTC
TATAAGCTTCTGGTTTAACAGATCCTAATTTTTCTTAGAAGCTCGAATTATTTTGTA TTGG
AATGAAATGAACCTGAATATTGTGGACGATACAGAGGAATTATTGTGGTATAGTTGA TTG
ATTGATTGATAGTCTTAAGTAAGAAAAAGACCTATTGGAGATTATGGTGAAGCTTAT ACA
G GAG CAG CTG G ACTTG GTTTTCACAATATCTTTTTTGTTAAGGTTAGAATAAACCTG CTA
AAATTTTTTACTTATCAAAATAAATAAATAAACTTGCTAGAATTTTTTTCAAGTTGG TGATT
GTTTAAGTTTTTTCGATTGTTTTTTCCTTTGGTAAAAACGTTTTTGGCAAGAACTAT ATTTT
GAAGTTGTGGTTTGAGAGTGTTTGTCAAATAATCTTTTCAAACAAACTCTCTTTTCA AATA
TCCGAACATCTTCAACTTCCACGAAATAGGTGACAACTGGATTAAATTTGGGGGGGG G
GGGGAGTGGTTGATGGTGTATTAAGTTCAAACATCATTTATCTTTTTCTCTAGGAGC AGA
TTTTTAAATCATTATAGATACATTGCTTGATGTATGTTTGAGAAATACCATTGGTGT TTCA
TTTAGGCATCATCACTGTAATAGTTTGGTTAATGTTTTGTTAATTCATCATGGTGGT TCAT
TCAGACAGCATCATTCGGCTATGATGTTGATGTTGATTTGGTTACACAAGCAACTCA TTG
GTGGAAACAGTTTTCTAGGATACCCCTTTTATCATTTTCATTTGATTGTACCTCCTG TTTA
TTTTTG CACTTG G AC AATTACG G GCTACAATTCTCTCCTTGC AAATCTG GTGATTG GTTG
CAGTAAGTGTAAAGTG G CAAAAAGAAGTCTATCCTG CTGTG G AAATTG ACACTAG CCAG
CCTCCATATGTTTTCAAAGCCCAGCTGTATGATCTAACAGGGGTACCACCTGAAAGG CA
AAAGATAATGGTCAAAGGTGGTTTGCTTAAGGTATAAAATTTCGTTTCACTTAGCTT GTT
ATGCCATTTTTCACTTTGCAAATAAAGCACAAAACTCATTTGTGTTTTGAGGAACGC TGA
AATTCTCAGCATCTGATGCTTTGCTTGTTAATTTTGTTGTTAACTCTTCGGTTATTT CTAT
GGTTGTTTAGGACGATGCCGACTGGTCGAAAGTAGGAGTAAAAGAGGTACACGGCTA C
TCATTGAATTACTCTCTATTTTTATGCAATGAAGTGCCAATTATCTAGAAGCATCTG TTAT
TTATTATTTCCAGGGTCAAAGGCTGATGATGATGGGAACTGCAGATGAGATTGTGAA GG
CCCCCGAGAAGGGTCCTGTTTTTGCTGAAGATTTACCTGAAGAAGAGCAAGTGGTTA AT
GTAGTAAGTTTTTTGACACTGATGTTGTTGCATCAAATCGAATGATCCGGAGATGTG TGA
TTCCTTATGTTTAACTGCTTACATAGTTAGTCTTGTCTCATATGCTGTACTTATACC AGCA
CTG G ATCC CTAGTAGATTTATTG GTATAACTTTACCG CAATTG CTTTGTTCATTTTTTTTC
AAAAGCAGTTGCCTTTTCCAACTTCTACATGCAAATAAGCTTTAATATATAATTCTC TATT
CTTTTTCCGCTGGCACAAGTGATTTTGTGGATGCCAAGCGCTTGTCGAATGCGTTTC TT
GTTCCGCTGGCACAAGTGATTTTGTGGATTCCAAGCGCTTGTTGAATGCGTAATTTC TT
C ATTTAC AC ATTATG AATC G G C C CTTC C CAAG AC C C C G C AC ATAG CAG GAG CTTAGTG C
ACTGGGCTGCCTTTTTACACATTGTGAATCAGATTACTATGTTGTTTTAGAGTCCTG TCT
AAAAG AACTG CTAACTTTTATAATG G CAAG GCTTAGTTTTGTACTTTTAATCAGTAAATG
GGTGATGAGAATTTTTATAATTTTGTTTCCTCCAGGGTCATTCTGCTGGATTATTTA ATCT
CG GAAATACATG CTACATG AACTC CACAGTACAGTG CTTG CATT CAG TTCCAG AACTG A AGTCTGCTCTAACAGAGTGAGCATTTGCTTCTTTCATCCTTTCCTTCATTTTTGGGAGTC
TTTTG GTTTAG GCTTTTTTTTG GTCCTTTTG CTTTAG CCTTG ATTTC CCAAAACTTG ATC A
AATTCAATATGGTTGCTTTTAAGTCTAGTTCTGAAAATATTTAGGTCTATTTTGATT GCGT
TGCACTTTTTTGGTTAGGCAATTCGATCTATTTGCACCTAATCCGTAATTCCTGTTT TTGC
TAAAATTGATAATTCTGATTTTACTGTTTATATTTGTCAAATCTATTAATCATAATT TAACT
TATATAATGTGTCGCGTTGTATACACCTAGATAGTATGTATTTACGGAGACAAAGCG GA
GAAAACAGTAATTAGTAGAGGAGACATAAATTATCCTGTTTTAATTCCTATATATCC TCC
CTTATATAAATATGGACTCGTTTCTCGGCATGTTCTCCTTTGGATGAAATCAATCCA AAA
TGTAATCCACTTTGAATCAATTTGGACTCCGAAACTGTGGATCTTTCCCGAACATTA TCA
GAAAAAAGATCAAAATGGCTCCTGTTAAATACCAAGTGTAGGAGTTCCAAAAACAAC TC
CGTTAGGTACATTTCTTTTTGTGTTCCTGAGATTCTGAGTTTATTTATTCTTCCTGT TAGG
TATAACCAGCTTGGTAGAAGCAATGATTTGGATCACTCATCTCATCTCTTGACAGTT GCA
ACAAGAGATCTGTTTAATGACCTGGATAAAAATGTCAAACCAGTGGCACCAATGCAA TT
CTGGACGGTACTTTATTTGTCTTTATTCTACTCCTAATATTTTTGGTTACGACTTAG TATT
CCTGACTTTGTATTCTTAGAAAATGTGTTTGGATTTCGAACAAAGTTACCATACCTT TGA
AGGAGAATACGTATGCTGAGTAGGAGATAGTGTTTGCCAATAATTTCCTATTGGCAG AC
TTCAAAATATACTCGTTTAGCGTTGAACACTGAACTCGATATATTTTGTCATGACTT TTGT
GTGCAAATGGATTTGCTCTTAGAGAGCAAGGATGAAGTACTTTGTATAGGATGCGAA TA
GAATGACTTAAGCTTGGGCCTGTTGTCTACATGGTCAAAACTTTGTGCATTATCATT CTT
GCACAAGGTTACTTGGATTTATATGAGAACTATAAATGTAGCCTATGTTGATATGTT TGT
CTTTTTAGTGTTTTCGTCACATGAGCTACTCGGGCATCAACATTTGATTAGGTTTAT GTT
CACAGGTTTTGCGGAAGAAATATCCTCAATTTGGCCAGCAGAGCAATGGAGCTTTCA TG
CAACAGGTTCCAAGCTTACCTAAGCTACCACAATGCTTCCTTGTTATTAAAAAAAAA AAA
GTGTACCACTATTGCAATTGCTATATAGAGGTCCTACTGACATGTCCTGGATAATAA CAG
GATGCTGAAGAATGTTGGACGCAACTACTTTACACCCTTTCTCAGTCTCTTAAATCA CCG
AACTCTAGGTACTACATCTCCTCTCGGGATATTTCTTGCAGATGAAAGTCCCTTTTC TAA
ATAATTTCCATGTTTTGTTTCGCTAGTTGTTTTCTTTGTTTCAGTTTGGACATATGG TCCC
TATTTTGTAAAAATGTGAAGGAAAACTCTCCTATATATACATTGTGCTTCTTTATGT CATG
ATTGTGACTGCTCTTTATCTCGGTACTTGCAGTGGAAGTCCGGATATTGTGAAGGCT CT
CTTCG GTATTG AGTTTG ACAACAG GTATTTCTG CAGTCAAATGTTGTTTACCTTC CAGTT
ATTCTGTTACCTTATCCCCTTTGCATAGAGTTGTTCTGCACCTAAATATTATAAGAG GCA
TGTGAACTTACTGCTGTATATGTATTGAGATAGGAAGGAATGCAGCTAGTGGTCCTA GG
ATGTAGGATGTTCCCTGTTCTGACTTTGAGTATCTTCTGGGCAACCTGATGAGAATC AA
CATCCTCAACTTTTACTCTGTCATATTGTGAATCATGTAGTTGACAATAAGAGATGA ATTA
CTGAAGTTGTTTTGAAAGTTGAAGCTAAAAATCATGTTTATGTTGACTTCTTTTAGT TTCT
CCTACTGTTAGTTAAGTGTACTATAGTGCTACTAGTGTGTATTTGTATTACTGCTAA TGA
AAGGTCTGGCTGGTTTATGCCGTTCTAGGGTATTTTCAACAGGCTGTGCTTTTCTGC TG
TAGCCTAGACCTCTGGGCTAATATTTCTTGTCTAGGACCTGACCTACTGCAATGAGG TT
GGGAAGATCCCAATGCCCATCCCAGAGTTCTCATGTGCTCAAATTCATCTACTGATA TA
CAAAATTTAATTTTCGATAGTGTGGGAAGCTGTTTATCATTCATGTCTGACTTGATC TATA
CTGTTCTGACTGGATTATGGTGGTTGTGCTAGAGTCCACTCTGTGACTATGTTCCAT AT
CTGATATTAACTGCTAGATACTGAAGAAAAATGACTTAACTCTGCCCCTTATTCTCA TGG
TACTGATATGGACCAGGATTCATTGTGCTGAAAGTGGTGAAGAAAGCACAGAAACAG AA
ACTGTATATTC CCTTAAATG CCACATTTCACAG GAAGTG AACCATTTGCATG AG G GTTTG
AAACGTGTAAGTTCGGTTCTTTTCCTCCTTGTATGTCCCAACTTCTAACTTTAGTCT TGTT
TCCTCCCAAATGTTTCATATTACTGCTAAGTTCTGTCTCAATTTTTTCTGTTGTGCC AATC
CAGTAATCATCCAATTTGATTAAGAGGACAGTCCCAAAGTGAAAAATGACGTATCTA ATT
CATAGAAATTCCTTTGGTAGCTGTAACCTTTAAGGATAACTAACAGTTAGTCCTGAA ATG
GTTGGTTGGATGGAGAAACTATTATATAAGATGCCTCACGGGCGGCACATTGGGGGG G
GGGGGGGCTTTTNGGGGGGGGTCTTTTGCTTTAGAATTTTTTCATGCCATGATTGGA CC
AAATGTG G GG CCTG CATGTTG AG GTTCATG CAATAGTTTCACATCAG AG GTAG G CTGA GCCATTGGAGCCGACTCATAATGCTTTTGGTGGGAAAAGATGCTTTGGAGATTTTGTTT
AACTTGGTGCAAAATCGTTGATGATGTTTATACTGGAAATCTGGTATTTATCCCCTC AAA
ATAATTTAAATG CACTCATATG G CCATTG CTTTTTCTG AC AG GGTCTG AAATCAG AACTG
GAGAAGGCGTCTCCGTCACTTGGACGGAGTGCAGTTTATGTGAAAGACTCCCGAATC A
ATGGCTTGCCAAGGTATTAACTGGCTCGATTAAATTCCATGGCGATGTAGCGACATA TG
TATGATCCGTAGCTTCTGCATATAGACTATCTTAATCCACGCCTTTCACATAACAAA AAT
GCCTTTTGATGTGTTGAAGTAGTTCACCTCATTTTTGGCATCACTTCTTTCTCATTC TCCT
TTCTTTTTTTTTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN N
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNN N N G GG G CTTTTAATTCTCACACTTTTCTGG ATAGTTT
CTTTACTATTTGGTGCTGATTATTGACTGGTCTATGTCAAAATCTTCTTCTATTTTT ACTA
TTTCTTTTTCATTTGATGAATAGATACTTGACCATTCAGTTCGTCCGGTTTTTCTGG AAGA
GGGAATCAAATCAAAAGGCAAAGATTTTGCGGGTATGTTGACTTACTCATCTTCCTT TTC
ACATTGATCAAATAGTTGGTCCTCCTTGAAAATGTGTCGGAAGAGGGAAAAAGGATC AG
GTTACCATGTCCTGAACCGAGAAAGGGGGATGGAGAAAGGGAGTTCACTGTTTTACT TT
GTTTTGTGAGGATGAGCGGTTCTGTCATGTTTAGGTGCTCTAAAGTCTCGCCTATTC TC
TCGATTGTCAAATTCTGAACTGTATATTTTAACTTTAGGCTGGTTTGTGATGGTGTC GGT
GTAGGAGTCTTGGTCTATAATCCCTTCCCAATAAAAAATTAACTGAAACTTCTCTTA AGT
TTCTAATTG ATCTTGG AGTAG GTGTAACTTG G G CACAATTACAAAG AG G AGTTCTTATGC
ACCAATGACTAGATTTCAGCTACTAGAAGTAAGAAGTAAGAGAAGGATGTGATATTA ATA
CTTTGCTATTCTAGTGAGATAGTATGTACATAATATTTTTTTGAAAGAGAGTTCTGG TGA
GAAGCTGATATGTTTTTTTTTCTTTTTTCCTTTGTGTTGATACAAGATCTTCTTAAC AATCA
AAAATATCTGAAAAGCTTTTTCCTGGATTCGGCCATTCGGGATAATAACCCACCCAT TGC
CATAAGTTCTGGGATCTATAAAGTAGTTGTACTGGGTGTTTCAGATATCTTTGTGTT TGT
GAGAATGCACAGGAGAAATCTATTCTATTGATTAAAGTGTTGTACAACCCTATTTAT ATA
CAGTAATTACATAATAATAGGTATCTACTTCCCGATGTGGGACACTATAATACGAGA ACC
AGTAAGAGACTTAGTGAAAATATCTGCTATGCTAGCTAATCATTCTACTTTACAAAC TTT
GTAACAATATCTCCTGAGAGTATCTTTTCTCTGCCAAAGTGACAGTTGATCTCAATG TGT
TTAGTCCTCTCATGGAACACCAGATATGACGCAATATGAAGAGCAACTTGATTATCA CA
CGCCAGTTCCATCGTGCTGATTTCTCCGAACTTCAACTCCTTAAGCAACCGCTTGAT CC
AAACTAGCTCACACGTTGCCACAGCCATGGCCCGATATTCGGCTTCGGCGCTAGATC A
AGCAACTACATTTTGTTTCTTGCTCTTCCAAGACACCAAATTACTTCCTACTAGAAC GCA
ATATCCAGACGTAGAACGTCTATCAGAAGGTGATCCTGCCCAATCAGCATCTGTGTA CC
CAACAATCTGCTCGTGGCCTTGATCCTCGAATAGTAACCCTTTGCCTAGAGCTGACT TT
ATATACCGAAGAATGCGAACAGCTGCAATCCATAACTGACTTACAACACTCACCGGA AA
AGAAATGTCAGGTCTACTCACAGTGAGGTAATTCAATTTGCCAACCAACCTCCTATA TCT
TGTAGGATCTCTAAGAGGCTCCCCCTGTCCAGGCAGAAGCTTAGCATTCGGATCCAT A
GGAGTGTCAACCGGTCTACAGCCCATCATTCCCGTCTCCTCAAGAATGTCTTAAGGC AT
ACTTTCGTTGTGAAATAACAATACCTGAGCTAGACTGAGCGACAGTCTCGAAGTGCT GA
AAGAGTTGCTGCTTCAGATTAGTAATACCATCCTGATCATTGCCAGTAATAACAATA TCA
TC AAC ATAAAC C AC C AG AT AAT AC AC AG ATT AG GAG C AG AATG C C G ATAAAAC AC AAAG
TGATCAGGTTCACTACGAGTCATGCCGAACTCCTGAATAATTGTGCTGAACTTACCA AA
CCAAGTTC GAG GTG ATTGTTTCAAAC CATATAGTG ACCTG CG CAATCG G CATACAAG AT
CACTAGACTCCCCCTAGCAATAAAACCAGGTGGTTGCTCCATATAAACTTCTTCCTC AA
GATCACCATGGAGGAAAGCATTCTTAATGTCTAACTGATAAAGAGGCCAATGACGTA CA
ACAACCATGGACAAGAAGAGACGAGCAGACTTTAGCCGCGGGAGAGAAAGTGTCACT A
TAATCAAGCCCAAAAATCTGAGTGTATCCTTTTGCAACAAGACGAGCCTTAAGACGA TC
AACTTGGCCATCCGGGCCGACTTTGACTGCATAAACTCAACGACAACCAACAATAGA CT
TACCTGAAGGAAGAGGAACAAGCTCCCAAGTGCCACTCGCATGTAAAGCAGACATCT C
CTCAATCATAGCATGTCGCCATCCTGGATGAGATAGTGCCTCACCTGTAGACTTGGG AT AGAGACAGTTGACAAAGATGATATAAAAGCATAATGAGGTGATGACAGACGATGATAAC
TTAAACCGACATAATGAGGATTAGGATTAAGAGTGGATCGCGCACCTTTCCAAAGTG CA
ATCGGTTTACTAGGAAGAGGCAAGTCCGCAGTAGGAGCAGGATTAGGTGCAAAACGT G
AATCAGCTGGGCCTGATGCTGGCTGCGGACGACGATGATATGTCAAGAGTGGTGTTC C
TGTGGCGGGGAATCTAGGAGGAGTCTGTGGCGAAAGGTGAAGGAGGAGCTATAGTAA
G CTCCTTAAAG GTC GATACAG GTAAG ACCTC AG ATATATC AAGGTG GTCAG AAGAG GTA
AAGAAAGGTTTAGACTCAAAAAATATGACGTCAGATGATATAAAGTACTTATGAAGA TCA
GGTGAGTAACAACGATATTCCTTCTGAACACGAGAATAACCAAGGAAGACACACTTG AG
AGCACGAGGAGCTAACTTATCTTTCCCAGGGGCTAAGTTATGAACGAAGCAAATGCT CC
CTAAAAC AC GAG GAG G AAC AG AGTATAAG G GTG ATTG G G G AAC AATACTG C ATAC G G A
ATCTGATTCTGGATGGGAGATGAAGGCATCCGTTTAACCAAATAACAAGCTGTGAGA AC
TG CATCAG GTG GTCAG AAGAGGTAAAGAAAG GTTTAG ACTAAAAAAATGTG ATGTCAGA
TGGCATAAAGTACTTATGAAGATCAGGTGAGTAACAACGATATCCCTTCTGAACACG AG
AATAAC C AAG G AAG AC AC ACTTG AG AG C AC GAG GAG CTAATTTATCTTTTC C AAG G G CT
AAGTTATG AACG AAG CAAGTG CTACCAAAAACACG AG AG G AACAG AGTAGAAG G GTG A
CTGGGAAATAGTACTGCATACGGAATGTGATTCTGGATGGGAGATGAAGGCATCCGA T
TAACCAAATAACAATCTGTGAGAACTGCATCGCCCCAAAAACGCAACAGAACATGAG AT
TCAATGAGAAGTGTGCGAGCAATCTCAATGATGTGCCTATTCTTTCTCTCTGCAACC CTA
TTTTACTGAGGGGTATAAGAACAAGAGGTCTGATGAATAATTCCTTGAGAAGTCATA AAC
TGCTGAAATTGAGAGGATAAATATTCTAAGGCATTATCACTGCGAAAAGTGCGAATG GA
AACACCAAATTGATTTTTAATTTCAGCACAAAAATTCTGGAATATAGAAAACAACTC AGAA
CGATCTTTCATTAAGAAAATCCAAGTAATCTTGAATGATCATCAATGAGACTAACAA AAT
AACGAAATCCCAAGGTTGAACTGGCTCTACTAGGACCCCATATATCAGAATGAACTA AA
GAAAAAACAGACTCTGCATGACTCTCAATACTACGAGAAAAGGTTTGGGAATGTTTT CC
GAGCTGACATGACTCACACTATAATCTAGATAAACTAGACAAACTAGGCACCATCCT CT
GAAGCTTGGATAAGCTTGGATGTCCTAAACTTATGTGAATTAGGTCCGGAGGATCTG TA
G CTAG ACATGTCTTG G AG GAATTGAGTGAGTTAAGGTAGTAAAG GC CTTCTGATTCAAG
TCATGTTCCAATCGTCTGTCCCGTACTACGGTCCTGCATAATAAAAGAATCATCAAT AAA
ATATATACCACAATG G AG GG CACGAGTCAAATG ACTAG CAGATG CAAAG G ACAG CCAG
G GACATAAAG AACAGAATCTAG AGTG AC AG AG G GTG G GG G ATTTG CTTGTCCAACTCC
TTTTGCTTTAGTTTGAGACCCATTGGCTAAAATAATAGTGGGAAGAGACTGTGAATA TGC
AATATTTGACAAAAGTGATTTATTACCAGAGATATGATCAGAAGCTGCTGAGTCCAC AAC
CCATTATCCAAGAGTACTAGACTGGGAAACACAAGCAAAAGAATTATCAACAACAGA AG
TATCAGTCTGAGCAATAGAGGCTACTTGTGGAGATGTCTGCTTGCTCGATACGGAGG G
AACTCATTATATTCCCCTTCAAATAAAGAAAATCCCTGGTTACCTGTAGTCTCGGTC TGA
GCAACATAAGCATTTTTGAGTGGACGACCTTGTAAAGAATAGCACACGTCACGAGTG TG
TCCAAGTTTATGACAATAAGAGCAATTGGGCCTAGATCTTCCAAAACGACCACCTCC TC
GTCTATTCTCCATAGTTTGAGATACCTGATTGTCCACTGACTGGGATACGAGAACAG AT
GAGTCAAGTGTCTGTGATGAGCTTACTAGGTGACTTGGTATTGCAGCAAGGCGAAGT AA
TTGAGAGAATAATTCATCAACATTGGGGACAGTCGGACTAGCCAAAATCTGGTCACA TA
CTGAATCAAGGTCATTAGGGAGTTCAGCGAGTGTAAGAACTAGAAACATCTTCTGTC GT
TGCTCTTGTTGCTTTTCAATACTAGCAGAAACTGACATCAATTGCTCAAATTCTTCC ATG
ACTGCCTGTACTTGTCCTAAGTGTGTAGACATATCCAATTCCTGTTTCTTCAAGCTT GTC
ATTCGCGATATTACATCATAGAAACGAGATAGTCATTAGTGTATAAAGTACGAGCCT TCT
CCCAAACTAAATAACATGTCTGGAATGGATGGAACAAAGGCATCAACTTGGAATCAA TA
GATCGCCACAAGATACTACATAACTGAGCATCGACCTTCTCCCAAAGTGTTTTGGCC TT
TTCATCACCGTCGCTAGCCCTTTTTGTTAAATGATCTTGAACTCCTTGACCTTTACA CCA
CAACTCGACAGACGAAGCCCAAGCTAAGTAGTTTGAACTTCCCATTAAAGGTTCTGA GG
CAATCATAATACCATAACTTCCAGAACCCGTGTTTTTAGACCCAAATACATCCACTC CCA
AAGACATTATTGGATTGAAAAGAGATCTAGCAAATTAGCACCAAATAAAACAAAGAA TCA
ACTGTGGTTGCCGAAAAACTGCCGGAAAAAATACTGTAGTTGCAGGAAAATTTTCAA AG TGCTCGGAATCAAAAAATAAAAATATGGGAAGGCTCGGAATTGCAGGGCGATCAGACT
GTTCTAAAGAAGTTTTCTGAAAAAATGGACGGAACGGGCTCCACGCGCCGGCGCGTG G
AGTAGATCTTGCCGGCGAAAATTGTCTTCGGGCGGCGCGTGAGGCGGAGTCTGACGG
AGTTGTTTGCTGGGGTTTGGTCGCCGGAGGTTGGGGACCTTATGGTGGTGTTGGTTT T
TGCACAACACCGATGGAATTGGTTTTGACGAAAAAATAGCCCTAAAAGGTCACCGGG AT
GAAGCACGTCGACGACTGGGTTTTCATTCCCGGATGTTTTCTCACTGCCGCTCTGAT AC
CATGTGAGAATGCACGGGAGAAAAATCTATTCTATTGATTAAAGTGTTGTACAACCC TAT
TTATATACAGTAATTACATAATAATAGGTATCTACTTCCCGATGTGGGACACTAAAC ATG
ACTAACTACTTAACAGTGTTGAACATG G GTAAC CCAG G GG G GTCTTCACCTAATTCCAA
AACTGAAGTGAAAAGAGGAAAAAGAAAGCAGGCTTGTGAGGTTCCGCGTTCCTCTCC A
TCTGATATGACCTTTCTATATATATATATTCTTTTAAATGATGCTTCCCGGCTAGCT TATG
CGCACCTCGATTATTCTATTTAGTACATGCTACCTCCCATCAGAACATGCACAAGGT AAC
TCTGTCCATCAAGGCTTAGGAAAATAGAAGAAATCACCTACTCTCTCCGTTCCAATT TAT
GTGATCCTGTTTGACTGGGCACAGAGTTTAAGAAAAAATGAAGACTTTTAGAATTTG TG
GTCCTAAACAAGTCAAAAAGGGGCCTAGATTATTTGTGTGGTTATAAAAGCTTCTCA TTA
ATAGTAGAATTGTAAGTTTAAGCTAAATTGTTACCAAATTTAGAAATGGGTCATTCT TTTT
G AAAC GG AC CAAAAAGG AAATAG GTTCACATAAACTGG AACAG AG G GAGTAGTATTTTT
TGTTTCCATTGGGATTTGTGATTGAGATCTCATGGTTATGTATGCATATGTTGTGAT GTA
GTTCCATATCTTTACTGTTTAGATGGTTGACAGGGAAATAGTAAGTTCTTTTTTAAC TTAA
TTACGAAAATAAATTGTCTTCTTTTATTTAAGCTTATGTGACATTATTTCCTTTTTA GTTTG
CTTATAAAAGAATTAACCCTTTCTAAGTTTGGAGAACTAAATGTTCTCATTTTACGC TTAA
TGGCAAGCATTTATAGCCACACATTCGTTCTGCATATTTAAAACCTGAAGTTTCAAA AGT
CTTATTTAACCATAGCATGTTTAAGACTACAAGTTTCAAAAGTCATTCTTTTTTCCT TATTA
AACTTC CTGTGTAGTTAAACAAG GTCG CAATAAAATGAAATAGAG GG AGTACTATATTTA
AGAATG GTATC ATACTTG GTAGTTTTTCTCTTCTCGTCTCTCCTTTTTTG G GTAG GG G AT
GCATCAAGCTGTAGGTTCAATTGTTTATAACTTTTAAATAGCTAAAGAACTTGCTCT AGT
AGTGATTTCGGGGGTAAGATAATCTTTGTGGTTAAGAATCCATTGAATATGGGAATA ATA
AAAAAAAAAAAAG AAAAAG AATG C ATTG G ATTAG AG ATC AC AAC C ATTTTTAG C C GAT AT
TGGCTGCGATGTGTTCTGGCTAATTTTTTCTTGAAAGGTAACAGAGGATCGGTTCTG GC
GAATTAACTAAGGTTCATTTTAAATATCACCCATATCCAACCTAGCCACCCATCTGT CAC
GTATAAAACTTATTGTGGACAAAAATAGAATGGTCACTTCCTTTTCCACCCACCGCT AGT
TGCGACGTGACAACAAGGAGTTGTAAATGTGATGTGATGGATTTTAGTTATAGCAGG AA
CTTAAATGATAATCTAGGAAGCTACAATTTTGCTATGCTTCAAACAACAGTCATATC TCG
TATTAAAACTCAGCAAGTCTTGCTTTTTTAGTAATATAGGTTTGCTAATATTTCAAA ATCC
TATTTTATATTTTCTGCATGTATTGGATCTCTCATTGCTTTAAGATATAAGAAATGG TAAT
CTTAAACTATGTCTATGCCATCAGAAAGTGGATTACCCGCTGTCGTTGGATGTATAT GAT
TTTTGTTCGGAAGACCTTCGCAAGAAACTGGAAGGTCCTCGCCAGGTACTGTCTTTT TC
CCATTGATCAATGTCTTTTAAGAAATGAGGAAAGACCAGACCCTCTTTGGCCCCTCT TTC
TCTTTCTTGTTCTGTTATTACATGACTCTAAATTTGCTG CTAG GTTTTG AG G GATG CTG A
AGGTAAGAAGGCCGGTTTAAAAACCAGTGAGAAAACTTCAAGTTCAACTGACGGCGA C
GTTAAAATGACTGAGGCTGAGGTATGAATTAATCTTTGTAATGTAGGAGTGACTTAA GG
GGATAAAGAGGGACCTTTCGGGCCCACCTATGGCGGATTGTACAAGGTGGTTTAAAG T
GGGAAATTAAGAATGTCTATAAGTAACTTCTGCCTTTTCTCCTTTTTTTTCCTATTG TTTA
TG CAG GAATCATCTAGTG GAAGTG GAG AAG CGTCTAAAACAACCCAAGAAG GTAG AGA
AACACCTCCTTTTCTTGATAACTTGATGACTTGATAAACATATGCTGCTGCTGTATT TTAA
TTGGTAACAATGTCTGGCATTAAAATTGTAATATTTGGGAGAGAAGTTATTGTCATG AAA
TTACCTTCCAACTCACATATCCTTTTCAATGCTTTAAATGAAACTCTGTTAGTTAAT TGTC
AGATATAATTCTGCAGTAATATTGCGGTTCCATGAGGTTTTACAGTTCTTATGACAA CAA
TGTTCTGCCCTGGGTATTCAAACTTCTTTTCACAAAGTCACTGTTAGTATCTTTGAT TACA
AG CG ATTG AC CTTCTATTAACAATTTTG G GATCCCATAAAG ATTATTAACTTG GATCAGA
TTTATTCCTTTTTAAATTACTATATGTCCCTAGACACCGGCGGACGTTTGCCAAGTT TCT TTTGAAGGGGGCGCCTTTTAATTTTTAGAACTATGGAAGATCCTTAAGGTTTAGCTCTGC
TGATACAAGTACATTTTAATTTGTTTGGACATTTGTTTGTATGTAGAAAATGTAGAG AACC
TCTAGCATAGATAACCCCGTACTTGCCTTTGAATTTATATAACTACGAAAGATTCTA AAG
GTTTAG CTCTG CTG ATACACTTAC ATTAAGTTG GTGTTTTCTC GG AGTCCTTTAATTTGTT
TAGACATTTATTTGTATGTGGAAAATGTAGAGAACCTCTAGCATTGGATAACCCCAT AAA
TTGTCTTAAAAGAAAATTTTCTGAGTATTGGAATGAACTAGGGCCCAGATGGAGCAG AA
TG AATGTCG G GG ATTAAATAAG G GACTCTAACTG GTTCTGG ATTG G GAG CAGTAGTTGT
GAATGATCAATTTTATCAACAGTTTAGTGTTTCTGATATATAAGGGGAAGTTAGTCT AAG
AGCTCAATTTTGGAATTCTGTTATGAATGAGGAGTCGCAATGATTAGTGCTTTTTTT TTTT
TAATGATGCCCGATTAGTAAACTCCATTTCAGGTGTTCTGCCTGAGAAGGAACACCA CT
TG ACTG G AATATATG ATTTG GTG GCC GTG CTG ACTC ACAAG G G AAGAAGTG CTGACTCT
GGGCATTATGTTGCCTGGGTCAAGCAAGAAAACGGTCAGTTTAACTGGGAAGAGATT TT
GTTCTAGTAATCGTTGCTCTTGGACTACCATCTGATACAATATATTGAAAATCTCTT TGTA
AACCACAGGAAAGTGGGTTCAATTTGATGATGACAATCCAATTCCGCAGAGAGAAGA G
GACATCCCTAAACTTTCAGGAGGTGGTAAGTGAATCACTTGTGTATTACGTCTTCGG CA
AATTTTCAAAGTCTGG CAAG CATATCCTTTCTTATAACAACAAG ATGTAAAG CAGATG GA
ATATTTTGTTGCTTGTGTGCCTGAATGTGTTTTTCGTTCTGTCAGTTTATAGAAGTG CTTT
ATTTTTGGTTTCAGGTGATTGGCATATGGCTTATATTTGCATGTACAAGGCCCGTGT TGT
TCCCATG
SEQ 52
CTATTTCACTTGATGCAAGGAAAATTGATTACTCCTGGCACTACGAAGAAATGTTTGGTT
AAAAGG ACAAAG GTCAGTG CAAAAACCAGAATAGTTCATTG CTACACTAG GTCCAACTA
G C C AG ATAG ATG C AC AC AATTTG C G G C AG AG ATAACTATAAG AC AC C AC AGTG CTTAAC
TGCCTTTTAACCAAAGACCAAAAACACTCATGAAGAAGAAAAATGACAACCTTTTTA TAG
CTATGGACTGTCCATTTCATACGATTCCAGTTGTTAATACTTTATGCATGAGTCAAA TAG
AATTTGCATACAAAAAAATAGGTCCATTTGCAACAAATCCGAGTATAACTGAATGAA CAG
ATGAGAGCCATTAAAACCTTAATGTCAAATCCTACAAAACAATTGGATCATCTCCAC AAT
GCATGACGTAATTCATCTCATGCTGTAAATATATCAGTCTGTGGTTAATATGAAAGT TAT
AGATTAAAGATTCACAAGACACAAATGTGCCTCCTGAACTTCCTAGAGCAGCATATC AG
CTAAAGCAAAGAATAGAAATACACTAATACAGAACAGCAAAGAGAGACTTAATAGAC CT
GAGTTCCATATTTTCAATGCGGATACGTTCAACATATAAGTCATGATCTTCTGCTTC ACT
GTTGATCTCATCGTCTTTTTTGGGATCCTTTTTGCGCCGTGAACTTGATCTCCTCGA AAA
TAGTCTATCAATCCTG G CAG CATAAG AACCTAAG AGTG CTAAG ACACAATG CATTCACT
AAAACATTGAAAAAAGGCCCAAAACAGAGGCAAAGTGGAGATGAACTTTAGAATTCT AT
CTTCACATTTTCACCAGAAGGGTTCAGAAAGTTATAGGAACTTCATTTCAAGTACAA TGC
AAACAGTAAACAGGTTTTTACTATACTAAAGTATCTAAAAGTCACTTACCTTGTTTT TCTC
AAAATCCATGCCCAAGTGCGTCCAATAATCATGTCCACTTTTTCCATGAAAAAGTAA TCA
GTAGTCCTATCAATTCCAATGCCGCGTCGAAAGATTACATACTGGAAAGAAAAGGTA CG
TATATAGAGATTCAGTGGCATGATCAAACTCTTGACTCTCATCCTTTTCCCTCGAGA AAT
ATATGTTCATATTACACATTGATATACTTAATAAATTTCGAGGTAAAACATTGATAT ACCA
TATAAGTTGAACAGCTCAAATACAATATAATACATCCATGATACTATTTGCAGGTTC AAAA
AGATATAAGTG ACAACACGTG CACAAGTCACTCTTTAC CTTTGTTG AAG CTTGTGTG CA
CTATTACAG GTTTG AG GAG CAGACCTTG AACAATG ATTAATCTTTTGAAAACTAG CAAG A
AGATGAGAGGATAAAGATTCCAGTTTGTGCTCAGTCATAACACACTTTTTTGAATCC GTG
GTGAAAATAAAAATCTTTCATCAAAGAAAAAGAAGAAAGGGCCAGTAAGCACAATAT AAC
TCTAAAGTTCTTCGCAATAAATGATCGCATACTTTTGCTTGTGTTTTATTGGAGTGC AAAT
TCCTTCTTGCCAAAATCATGGGCAGTAATGGACATCCTGACACCTAAGAGCTAATGT TG
TATAGGGACAGTGGTTACTTCAAGATTCACTTTAAACAGCCAATCATACTTGCTGCA AAG CAACAATGAGATAGGGAAATAAATGGACACCATATGTGGATATCAAGGATTCAAGTCGC
ATTCTTGAACCCCAAACAATGCAAGGCACCAACAAAAAGCAGATTGGGAAATATTCA AT
TTGGTGAAAATTTCATAAATACAACCAGAAGCTACACTTGTCGACCTAGCATCTGGG AG
CCATTAACTTCATACCAAACGTTCCAGCAAAAGTAAAGGCCTCATTTCAAATTGTAA CTT
G AAG CTACACTAAG AG ATATACCTTGTCAGCAAACTCTG G AAG GTCCTCATG AG G ATG C
TCTG CAAAATACTTCTCTAAAAGTTTTTTGTCAAG CTG CATCC AG GAAAAGGTAGAGTGA
ATCGCATAGCGAACATGACAAAATGGAACTATATCAAGAGAGCCAATAATTGTCGGA AC
CAAATTATATGCGGAAGCAGCTGGTTAAGAACCTTAGATTCATCAACAGTGATTGGA AG
ATTTAGAAGATACTGTCCTGAATGTGCAACATCAATCTCTTCATCACTAGCTATTTT AAAA
TTGCTTTTGTGCATAATCTGCATCAGCATAAGAGAAATGATGATAGAAAGTGGAGAA TG
AAATAAGGGTGCCAGAACACAAATGTCATACCCATTTCCCTTAATCCTCCAGCTAGA TG
TGGAAAAGCATTCAGAATTCACTGCACTTAGAGTTTGATGCAAGTTTAAAATCCATT ACA
GGCTTGGCCTCAATAGCATGATAATGCAGTGCAAAATAACGGAAAACGACTTTAGCG AT
AAATGAATAATATATAGCCGTTCCCCATGCTTGAAAATAAAGAAATATCCCCTTCCA TTT
CGATTTTGCAACAAAACAGCATAAATATGCGAAACTTTCTTATTGTAAGGCTACTCC TTA
TG ATTAAAAATG G GAAG CATGTGTTAGAAAAAACAAAGTAAAAGAG AAAAAAAACAG GA
AAATGAAGGACGAAGCAACTCCCACTCCCTGAAGACAAAAACCTCGACCTCTTACCT TT
CACCTCACTGGTTGTCTCCCTCCCCTGAGAAAAAACAATTATTCGCTATGTCCCAAT TTA
TGTGATGCACTTTCCTTTTTAGTATGTCCCAAAAAGAATTATACCTTACTATATTTA AAAA
AAAATTAAAACTTTCCATTTTACCCTTAATGAGATAATCTATAGCCACAAAAATATC TATG
ACTTGTTTAGACCACAAGTTCCAAAAGCTTTCTTTCTTTCTTAAATTTTGTGCCCAG CTAA
ACAACATCACATAAAATGGGACGGAAGGAGTAGTTTTTCCTCTCAATTTAATCCAAT AGA
ATTTTCTTCCATTTCAGTGGAGATCTTACACTTAATAACTGATGCAGGATTATATCT TTTG
TTATATTTTTATTTCCTAGGCTCGGGAAGGAGAGGATAGCATGTTTTCACCTCCGCT GG
GGATTTTCTTTTGAAGGTACATAAATGTCAGGGCCCTCATACAAGGGTTGTAGATCG AA
AGTATTCTAACTATCTTCTTTTTTTCGGGTAAATTGTTGAGCCATGGGTACACACAT ACA
CAAATATATAGAGAGAGATATCTTCATATAAGCAACTGCAGACTATAAATATGCACA CAC
AGAAGACAGATATGAGATTGGATTTTTTATCAAATTTAAACTGGAACACCAATATCC CGA
CTAAAGATACGCGGATACAATAGAATATAGCAGCTGATGGTGTTACTGAAAGGTCCA AC
ACGGGTGCTCAAAGTCCACTTGGGCAAATGCGGCTAGGGAGTCAGAAAGAATGAGAT G
GGCGACGGACTTGCCCTACTGTCAGAGCTAGTGGTGTGTTCTTGAGATGAAGGCTTA G
CATTCACCACCTTCCACGTTTCCGACTTTCTCTTATTTGAACTTTTTAGATCAGTCA ACTT
G AGTAAG CAACTTG GG CCAAG G GATG AAACTCAAGTCTCTAG CTCTCAAGTATACTG AG
TAGGCAACTTGGGCTTGCATGTTATACATATTTTTTTAACTAGTATACTGAGTAGGC AAC
TGGGGCTTGCATGTTATATAATGAATCCAAACAGCCCATATAGCTGGCTCAGGTACC AG
AGGAAAACAAACTTTAGATGAAGCTGCAGATTTTTTACCTGAAACAAATATGTCAAG AAA
TTCTGTTCAAGAATGTCAATCTCTTCTGGAGATAACTTCTGTTGTTCCAATTTCTTA GCC
CCGTTCACAGGATCAAACAAAGAGTATAATTGCTGCAATCAGACAAATATTATTGAT CAA
AACACAATATCAACATCAGCATATTATATTCATTTGCAGCAGACTGATGAGAAAAGA TGT
CTGATCAAATAAAAAGTAGCATAAGATTATATTACCATGAGATCCTCAAATTGTAGA AGA
TACCAAGCATGAATTGTGTACTCAACCCTCTTGCAAAGCTTCAGAAATTCAGCCCGG TC
AGAACTATGTTCTG GG CAAG GAG GAAGGTCG GATCATTAG AACCTCAAAAAAGACTTTT
ACTTAAATAACCAATCATAAAAAAGCACTAAAGGGACTAGAAGCACAGATATTCATG CAG
CATTTTACAGTTTCAAGAAAAGTAGATTTTCTGAAGAAGAAAAAAAGAGTGGAAATG AGC
TGCCAAGACGCAGACATGATTTACGTATATCCAGAAGGAAAACTTGTTTCTGTTAAA AAT
TCTAAACATATAACACGTCAAAGCTTATCAGTGAAGGAAGGGTTTTGAAACCTTTTG GG
G ATTTTG ATCCATG ACC ATCAAGTGTCTCCGTAAAATTCACATG G AG G GTGTACAG G GT
CCCCCCTTCTTTTGAACTGGGGTCTCCTCCTTTTGAAATGTCCAGCACTAATTTACT GCG
TTGTTTTTACTGTGAGTAATACTGTTACTTATCTATCCAAAAAATTACATAGAGGGA AGTC
GTTTCAAAAG CTTAC AATTTCTTG AAATCC ATG AG G ACTTG ATGAAAAGATG GAAG CCTT
TCTATTTCGGTTAGAGTTTATATGCTCGAGTGACCGTGTAAGGAGAAGTGTAAGATT TA GACAATTCACGGTTATAGAGAAACCCTTATTTGTCCTAAAAGACAATTTATTGAGTAGTG
GAATGAAACAAGCCAATGGAGCAGAATGATAATGAGAAAATCACATAACCAACTCCA AC
TAGTTATGAATCAGGGTGGAGTTGCAAAAGTAAAATCTCTATTAAGCATCTAATTTC TTA
AATTCAAGAGATTGTGAACTTCTGAGTTGAGTGAAAGTGACTTACTCTAAGAAAGTA TTT
AATGATAAAACTCGCAATTGAGATAAGTCAAATGGTAAACCTTGGAAAATAATCTAA TTT
AATCTGGGAAAATATCTAATTCACACAACAGTAAATCTAATTAATCTGGGAAAATAA TCA
AGGTCAAATTGTCTGCTTTGTCACAATACAATTAGAGTGCCTCAGAAAACAACATTA AAT
TTTG CAAAATTAAACCATTAAGG G GGTTG CTGTTTACACGAAAAAAG CTAAAACTCAGAC
GTCACCAAAACTATGAGATCGTGTAAAGTTAGTAACATTTTAGGTTTAAATTCATGG CAC
GCAATAATTGCTTGACATTCAGGAGAAAGACAAAAAAGTGACTGCCGTTCCCTATAG AG
TCTAGCGTAGAAAGTTGTATTATTTGCTTCAAAGATCTCCTCTTGGAGAAAGAAGCA TTG
TTTCCTTACACTGCCATTTTCTATGGTCTAGTATAGGAAGTTCTATTGTTTGCTTTA AAAG
TCTCCTCGGATTATAGCAGAGTTGTACTTGTGCTAGTCAAATTTGGTAGTTTTTTCC ACA
GAAGTAACAGAAAGTGAAAATATCAGAGATATTATCCAATTAAATTAGAGGAGAAGA ATT
GTTTCAAAATTCAAACAACTGATCGACCACAAAATAGAGAGGGAAGAAAGAGTAGCC GG
TTGTTTGGGAGCATCCAATTTTTACCTATAAGGTCGGCGAGGGCCATGATGAGCCTG G
GTTTGAGGACTGGAATAACAGACTCGCGCTCCAAACGGATCACCTCTTTCTTCTTCT CC
AT
SEQ 53
TCACCGGTTTGTGACAACTGGATTTCCGTTCGCATCAGACAAATGGATACTCTGGGTTG
ACTTGTTAGCGACAACAGGATTTCCAGACTCATCTAAATTAATTATAGCAACATCAC CAG
GCTTGAGATCCCCAGAAAGGAAAGATTCACTCAGGAGATCTTCAACCATTTGAGTAA CA
GCCCTCCTAAGAGGGCGTGCACCGTAGTTTCTGTCAAATCCTTGTTGGCATATAAGC TC
CATTACTGCTTCTGACACCTCCAAGCTTATTTCCAATGAAACAAGCCTAGCCCTCAC CTC
CTG CAGCATCAGGTCTAGTATCTG GAG CATCTGAAAG GG AAAAAACAAAG CAGTTACTC
GCAGTAGCCGACTGCTTTCTAAAGAGATGTGCCAGAATAAAAGATCACCCCAATGTA CA
TTCAGTTATCAGATCAATGCAACTTTCCAAATCAAACAAGAGGTATATTATACGAAC CTG
GGGCTTCTCTAGAGGACGGAATACTACTACTTCGTCTAGCCTATTCATCAACTCAGG GC
GGAAATATGTCTTGAGCTCTTCCATCACTATTGCTTTCATACCAGCATAGGAGGCTG CT
GATTCATCATCAGCAAGCAAGAAGCCAATAGTATTCTGTCTACCCTTTACTATGGCT GTA
GAACCCACATTAGAAGTCATCACTATCAGGGCATTCTTAAATGACACTCTTCTTCCC TGT
TGAATCAAGTTTCATAATTATCAACGCCAATCTTCCAAAACAAGGTTTACACCCCAT ATT
GTGGAACATTCTAGAACAATAAGCCAGATGTAAAAACGGATCTAACCTGAGAGTCTG TT
AGGTGACCATCTTCAAACAACTGAAGGAGAATATTGAATATGTCAGGATGAGCCTTT TC
AATCTCATCTAGCAGCACTACAGTGAAGGGCTTTCTTCTGATAGCTTCAGTAAGTGT TC
CTCCTTCTCCATAGCCTACATAACCAGGAGGCGATCCAATTAACTTGCTCACAGTAT GC
CGCTCCATGTATTCACTCATATCCAATCTTAGCATGGCAGATTCCTGTGCAACACAA AAA
GATACTTCACTGAGTACATAGAAATTCACAAAACCAAGTGATTGTTAAACTGAAACA GAA
CCGAAGATAGCAAATATAACTCTAGCAATTTGATGATGAACTTCTAAGAAAAGTAGA TGC
ATAACTCTTGATGATGAAATTCTTTTTGATAAAATCCTAGCCACAATTTTCATCTAG AAGG
TTGAACAGAGAATGTCGCTCAGATGAATTTGATTTGATTCAATCTCTGCACTGAAGT CAC
AAACCGTCTACGACCAAGAGATCCAAATTCAGTGTGATGACAAGATTAAAAGCAGCG CT
AACTATATGACTATGATCCATTATCAAAGCTCACTTAGTACTTTTCATTTATTTTGC CAAA
TTACGTGCCTCCACCATAGGTGATTAGAGACTCCTTCTGTTCTTTCTTTTGGGGCGG GC
TGAGAAGGGGTGGAAGGTGCAAGGATCTAGTAAATGTAACAGACTTATCAGATATAT AT
ACAATGGTTGACTTCATTTCCCATTGAAATGGATGAAGGAATAATCTGATCCTGGCA ACA
GGGAAAGAGATTTGAAATAAGCCAGTAATAGAGCACTACCTAGTACCTAGTATGTCT AA
ACTTGAAGTACTTATTAGTGCCCACCAATAATCAGAAGTCGTGCTCCAACAAGATTA ATT GGTGGTATTTCCAACGCTACATTTGACCATGCCGCAATAGCCATAGAAACCTAGAATGG
ACACACAGCACAGCCTCCGGTACCCTTTCCTTCTCTCCTTTTGTTTTTATTATGTGT GTC
CGACCACCTAAAACACAGTCACCACATCTTACTATCATAGTATAATACTTTCTTTAC CCG
AATCACCACTACCAACAACATAGAAAATTCCCAAGAACACCAGAACCAAAAAAAGTC CT
GAGAACACCAGTACTATCACCAAAACCTATCCATTGTCACTGAAGTGGCTGGATTGC AT
AACTTCAGTGG CAAAACATG GTG ACTGTCTG GTGAAACAATTAAGG CATCTAG AATG AA
AAGATGAAGCACATTTCTTATCTTACATAATAATTCTTCTCAAATTTAGACAAACTA AAAA
GAGCAAGATTGTGTTTGTGCAAATTATGCTGCCAGAACTCTTGGTCAACACGATTCA ATT
TCAGAGTTCCAC AATTTCTACTCAATTG CTTAATCTG GAG ACG CATCTTTG GAG GAATAA
TG C AAAAC AG CTC ATTTTATTAATACTTAC AG AAC C AAAATAAG AC G CTG C C AAAG CTTT
AGCTAGTTCAGATTTTCCAACTCCAGTAGGACCACAGAAGAGCATTGCCGAAATTGG TC
TATTTGGGTCCTTAAGACCAGTTCTAGATCTCTTAACAGCCCGACAAATGGCTGCAA CA
GCCTCATCCTGACCAACAACCCTTTTTTTAAGCTGCTCATCAAGACCAACCAGAAGC AT
TCTTTCATCAACAGTAAG CTGCTTAAGG G GAATG CCTGTCCAGAGTGAAG CAACTG CTG
CTATTTCCTCAGGTCCAACTACCGGAGGTCTAGAGTAAAAATCAAAGGATTTAAGTT AAT
TTACTTCTGATTTAAATTTATGACTTCAGACAATCTTTATTCTACTATAATGCCCTT GTAG
GAACATGGAGGCAGATACGTACTCATCTTCATCAGATGTAGAAGGTGATGCTGGCTG TA
AATGAAGTTCACTGCCATCATTCAAACGAGATGCATCATCATTTTCTGTCAGCTTGC TTG
CCAAG ATCTGTTTATG GAG GG G GC ACAGGTAATTGTATTTAG AC CACATAAG GAAGTTC
AAATTTTTCAAG AC GTG CAAAATCTAG AG AGTTTGTATAACATTGAGTACTATTAC CACTT
CATGCATGGCTTGAACAGCTCTAATCTCCTGCCAATAATCACTTGGTGATTGTGAGA GT
ACAGATATCTGCTGTTCCTTTCTTCTTTTGTGAGCTTGCATACGAGATTTACTACCA GCC
TCATCAATAAGATCAATAGCTTTGTCAGGAAGATACCTATCCGGTATATATCTTGCT GAC
AGTTGCACAGCAGCATTTATGGCTTCCAAACTGTATATACACTTATGATGTGACTCA TAT
TTCTCACGCAATCCCAACAGTATCTGGACAGCATCCGCCTATATAATTTAGGAAACA AG
AATATCTGTTAG G GC CTTAG ACACC CACTTG AGC ATACAC ATG CATAG AGTATTATTCAC
CTGACTTGGTTCATTAATCAAGACAGGCTGGAATCTTCGGGCAAAGGCCTTGTCCTT CT
CAATATGCAATCTGAACTCATCCATGGTGGTAGATGCAATACACTGTTACAAGAACA AAA
TTTAAG CG CAG AGTCC ATAAAAG CCAACATTATG AATGC AACTAC ATG GAGTAATG CAT
G G AAC C AG AATAG G AAATTAC CTG C AGTTC G C C C C G C C C AAGTG CTG G CTTT AG C AAA
TTAGCAATGTCAAGACCAGAACCCTTATTTCCCCTTCCAACTGTACCAGCACCAACA AG
GATGTGGACCTCATCTATGAATAGAATGATATTGCCTGCAAATTATGTCACAAGCTA AGC
CAATGATCTAAGAATTTGACCAAATTTTACTTCCATTCTATGCTCACCTGACTTTTT GACC
TCCTTAATTAATGTAGTCACACGCCCCTCTAGTTCGCCCCTCTCCTTTGCACCTGAA ATG
AGTAGGCCAATGTCTAAAGACATTACCCGCTTTTTCTGTGTATATAGCAAATAACCA CTA
GTTAAGTAG GTG CAG CAG CTAGTG G AACAAG AGTATAG G AG ATG CATATTAAATTACAA
AATAGTTCAAAGAATCCCTGGAAAAAAAATGAATTCATTGAAAAGCCCACCATTAAA AAT
GCAGGAATATTTCCCTCAGCAATGTTTATCGCCAGCCCTTCGGCTATCGCTGTTTTC CC
AACCCCAGCTTGACCAAGCAGAATAGGATTGTTTTTGGTTCGACGGCAGAGAATCTC GA
TAATTCGCTGAACTTCAATCTCTCTGCCAATTACTGGGTCTATAAGGCCCTCACTCA CAC
G GG CAGTAAG ATCTACACAGAATTG CTCCAG CGC ATTTTTCTCTG CTG GAAATG AAATG
C ATTTG AG AC AG C GTG C C AAAC C AAAC ACTAAC AAG AG AG AG G AACTG G C C ATAC CTTT
TGCTTTCTCAGCGGATCTGTCGATAGTTATTTTTCCAGGAAAGGATTTCTCACGCGA CC
TTTTG AATG AAATTG G CTCTCTACCATCTTTAG CAAG CTCTCCTTG AAG CCTG GAAACTG
CCTCAGCTGCCAAACGATTTACATTTACTCCTAACCTGAAATTAGGTCAGTCGACCA TG
CAAATCTTAATTTCTTATATAAATAGCGTGTAATAGATGACAGAAAGATAATCATAT TGCA
AGAGAGAAGGCATATTAAACTCAATCAGGAAGGTGAAGAAGGATCCTATTCCCAAGC CA
GTAGCATTGTAAGATAAATTATGCAGCAAAAAGGAAATCCGTAAGCATTTGTTTATT TAT
TAACAAGTACATCAGTATGCTTGGTTTAGAGCACAACATGTGCTCATGCATAGCGTT CC
ATCAGAAACAG AAATTG CTT AT CAACAG G AG GTCAAAATATTAAGCTAG G GTTGTCAGG
CGTACAACTTGAAAAAGAATATGTACACATACATGTACACAACACAAGAAGACACCA AAA AAAAAAGAGAGGCAGATACCATTCATAAGTTAAGAACAGAATACTAAGGTAACAACAAC
AATCCAGTGTATTCCCACAGAGGATAAGATGTACGTAGCCTTACCCCTACCCCGGAA AG
GCTGAGAGACTGTTTCCGGTAGACCCTTGGCAAACCAAAATATCTCTTGGTATTTTG TAT
ACGCTTGGGAGGGAAAACAACACCAAATGGAATGTAAAATATGGCAAGCAAGGATAA AT
GAGTGAAGGATAAATTCTACCAATAAGTTCATTGCTTCCTAGCAACAGTGGGAGACA AC
ATCTCATTCCTCCTCAATTGTAGAAGCTGTTTTGGACTCAAAATAAGCAACAATAGC TAA
TCTATTCATGATCCCAAGATTGTTTCAGCTTACCAAAAGCCTATTACTTTTGTTCTT AATA
ACCTCATCATCCACTACTAGAGGCAGATGTAGTGCTTGAGTTAAAGGTTCATCTAAA CC
CATTAACTTTGGTTCAAACTTGTATGTATGTTAAAATATGCACCAAATAAGTACAAA TAAT
ATATTTCAAACCAAGAATGGGCTGCGGAACGCATATTCAAATCGTTGATCTGCCTCT AA
CTACCTACTTGCTCTACCAAAAAAACTAGCAGTTCCTATACTGAAAACAAAACTACT TGC
CAAAAAAAGTTAAGAAAAAAAAAAGCATTCTTGACTGACTATCTGTAATACCTCTTG AGC
ACACGAGTGGCGTTACCATCATCAACAGTAAACAAACCAAAGGCCATATGCTCGGGA G
CAATAAAATTATGCCCCATGGTCCTTGAATACTCAACCGCAGCCTCAAAAACGCGCT TC
GTACTTGAAGAAAACGCCACATCAGTAGCCGACGTAGCAGAACCGGAGTCCTGAGAA G
CCAATTTTTCTTTATCATCCTCCACGTCATCATGCCATATGCTCCGAACAGCTTCGC GG
GCTTTATCAATTGTTATTCGAGAACCAAGGAATCCACCAGGGCTACGATCCTCTGCG AT
CAGACCCAGCAAAAGATGCTGTGTATACACCATATCTTTGCCCAAAGCCTTTGCTTC TTT
TTGAGAAAACATCACAGCTTTGATTGATCTCTCAGTAAATCTCTCGAACACTCCAGA GAC
AATATAC AAAG AG C G CTTG ATTTTAC GAG G AATTG AG CTG C AG G G C CTATG AG AAAG G
GAAATTCCAAAAAGGGACGAAGTAGAACTGCTAGTACTACAAGCAGCGGTAGTGGCG G
TAGTAATAGTAATATG AG AG GAG G AAG AAG G G C AATATG G G AAAAG C G AAAAC AC G GT
TTG ACATCTCTTGTGAG GGTACACAG AG CCATAG CG ACG AAG CTGAG GATTGAAG CTG
ATTGTTGAGTTCACAGAAAGTGGAGAAGAACACGTTAATTCCAT
SEQ 54
ATGAAGAATATCGAGCGTCTCGCAAATGTTGCTTTATTAGGTATGGTTTCTTTTGATTTT
GATTCATACATTATATCTTTTGATGATAGCTGAATTGCATAGTATACTTGTTTGATT TGTG
CTTGTGAAGTTCAGAAAGTAAAGTAAATCCTGTTTGATTTATAGCTTCGTTTTTTGC CCC
TTAGTTTGTGTTGGTTACTCATTCAGATCATTTTTCCGCTAGATAGATTGCTAAAGC TTTT
GATG CTAATTCTTTGTTGTT AATTG GAAG GAG G CTAC CCTTCCAGG GTAG G GGTAAG AC
TACGTACATCTTATCCTCCCCAGACCCCACTCGTGGGAATTCACTGGGTTTATTGTT GTT
GTTGTTGTTAATTGGAAGGACTCAATGTAGGGAAAGGTGCTAATTATTGTGTAGTTG GA
ATTTGAGGTGTGGTTGATGGTTACCCTAAATATATCTATCCAGCATGTTGGAGTAGA TTC
TATAGCGGTTGGAATGAATATTCAAATCCCCTTGGGCAGTCACATTACTACTGTTAC CC
GCTTTCCTTTATGTCACAGTAGGTTCCACTTCCACAGTTCCAGTTCAATCGGTAGAC AAA
GATGGTCATGTGGGTTCTTTATATCAGTTTTAGCATTTTCTTATATGTTGGATGTTT GTTT
CATCATATTGCCTTTTTGAGGACATTTCACTACGTAATAGCAGCTATGCCGTTCTTG GAA
ATTTACAATGTACGATTATTTGGTCATGGCAATTTCACATCACTTTCCAAATTTTAT GTTG
ACGCAATTACCTTGAAACTCTTGCTTTTTTGGTGGATTTCAGGTTTGAGTCTGGCAC CAC
TGGTGGTGAATGTGGATCCAAATGTAAATGTCATAGTAACAGCTTGCCTTACTGTCT TTG
TGGGATGCTACCGTTCTGTCAAGCCTACTCCACCTTCAGTATATCTTCTGTACTCCA AGT
TGCAGCTTCCCTTTTTCTTAGATCTGTTTTGATGTCACTTAAACATATTCTACTGCT GTTT
TCCAGGAAACAATGTCTAATGAACACGCAATGAGGTTCCCCTTGGTTGGAAGTGCAA TG
CTCTTGTCATTGTTCTTGCTTTTTAAGTTCCTGTCAAAAGACCTGGTTAATGCCGTA TTG
ACATGCTACTTCTTCGTTCTTGGCATTGCTGCACTTTCGTATGTTCTCTCCGTATGG ATC
ATTCTGTGATGCTTAATATTTTCTATAACAAGTTCTTGAATAGTAGTTTTTCTGTGG GTGT
ATTGGATGTCATCTCTTTCTTTGTGTCTTTGCAGGGCGACATTGTTACCTGCTATCA GAC
GATTCTTGCCCAAAAAGTGGAATGATGATCTCATAATATGGCACTTCCCATATTTCC GCT GTAGGCACCACCTTTCTTGTCTCTTTTGAAATGCCAATTGATCCTTTAGAATCCTTGGGC
ATACAGATCTCATCTTAGTTATTTTGTTTCGTCTTTTTTCAGCTTTGGAGATTGAGT TCAC
AAGATCTCAGATTGTTGCCGCAATTCCTGGAACCATCTTCTGTGTTTGGTATGCTAA ACA
GAAGCATTGGCTAGCTAACAACGTTTTGGGCCTTGCCTTTTGCATTCAGGTTTGTCG GC
ATATCCATCCAAGTTACATTCTCATTCTTCAGGATATCTCAAAATGAAAAGTTGTGT AAAA
TAGTATTATTAGTACAATG GTAATATACAATTTTG GATATTTCAAAGTGAAAAGAGTATCA
TATAAATTGGGATAGAGGAAGTACTAAGACACTTGAATGAAGAGATCATATTTCATC ACT
AAAAAAGTTGCACTTATCTGTCCATACATGTTCTCGTAACCAAGCATGGTTGCTCTT TAA
TACCAGATGCAAAGGCTACCCGCCTTTATATCTAGCATTTAAATCCACGATAGCACT TGA
TGGCTTCCTCTTTAATTTGTTTGATAACTAGAATTCTCCCAGGAGTTGGCCTACATT TATT
AAACTATGGGAGTATAATAGGCCTCCTCTATCATGCTCCCACTAATATAGCGGCTCC TT
GTTAGTGATAGGGTTCTAACTCATGACGTGGACCCATATTCTGACATTGCGTCATTA CAT
TGAACCGGAGCCCCAGGGGCTTACTATTTGTGATTTTCTATTTATATATACATTGAG TTA
ATGGAGATTTTTGCAAGGGAGAAAAGGTTTGATCCTCTCTTATGTCATGTCTACATC AAT
GATTGATATTGATTTTCCCATTGCGATTTTGATTTTCAGGGTATTGAAATGCTTTCA CTTG
GATCATTTAAGACTGGCGCCATACTATTGGTAAGAAAGAAAATTTGTTTTCTAATTT CTAT
CTGTAATTATACATGGCTGACAGCTGTATTCTGTTTATGTGTTTCGCCTTAAACTAC ATAT
TGCTTGTCTTTTTGAATTTGATGCTAACCACATATCTCTTTATTCAAGCGGGAAGAG AAT
TTCATGAAATGAGCTATTAATGATTGTTATTGTTGAGCTATTAATGATTTTACATAC AAAA
ATACATAACATTTGCATGGATTATCCCTAATTGCAGAGTTTTTAGACATTTTTGAGG TATT
CTTTTATGTTGGCATTTTTGCTTGTTTATGCAACTATTTATATCCATTAACTTGTAG CTGA
TGTTGAATG CACATG GTTTTCG AG AATG CAG G CAG G ACTTTTTGTGTATG ACATCTTCT
GGGTCTTTTTTACCCCAGTGATGGTCAGTGTTGCCAAATCTTTTGATGCTCCTATCA AG
GTGTGCATACTGATTTTCTCATATAGCTATTTCTTTTGAATTTTCATTTCATGCCTT TATTA
GTTACAGAGTCCTGATTATAACTTCGCTTTCTCTGCAGCTTTTGTTCCCCACAGCAG ATG
CTAAACGCCCCTTCTCAATGTTGGGTCTTGGAGACATAGTTATCCCCGGTATAACCT CC
ATTTGCGTGAAAACTCCATTCACTTTATGTGGTTAGAACAGAGAGGTTTAGCATTTT GCC
TAGCGGAGGGATCCTCCACCTCAAACCATGTGGTTTGGGGTTTGAGGCAGTAGGGAA A
CGGTGGGAAAAGCCACTGTTGGTCCCTGGAGGGGAAAAAATGGGGGTGGGTGGGGG
GATGAGGTTTACCATATTAAAAATGAAAAGCTTACTTTTTGTGAGTAGCTACTTGAA TGT
ATTTTTCTGTTTCTTACACATGCTTATTATTAGCTTTTGCCATGATGCTGTATTTGT TTTCA
TTTTTCAACTTGTTTTTTGCTTGAATAGATACACTTGGTAACATTTGATCACTTCAA TCAT
GCAGGTATTTTTGTTGCATTGGCCCTCCGCTTTGACGTTTCCAGAGGGAAGGGGCCC C
AATACTTTAAG AGTG CATTTTTAG GATACACATTTG GTTTG G CTCTTACCATATTTGTTAT
G AACTG GTTTCAAGCTG CACAGGTTG GTG AATCAAAATAAAG CTTTTACACTTTATTTCT
CTTGCTAGAATTGCAGCGCCCTTATGTTTGACTTGGCCTTTGTTTTTTCCAGCCTGC TCT
GCTATATATTGTTCCAGCAGTGATTGGATTCTTAGCCGTACACTGCATATGGAACGG GG
ACGTGAAGCCTGTACGTTTTTTCTTTTGACAATCTGTTTCAACTTCATCCACTTGCT AACT
TTACTGTTTATGTTCTTTATATGCTGTTTACTACCATTTTAGCTACACACTATTTGT AGATT
ATATTTTCTAGGAGTTAATAGATATGAGAAAATGCATCTTATGGTTTACCTTTAATT CATC
CAAAGAAAACATGCATGTCATGATTTGTTTGGAAACTATGGACAATAAGTTAAAGGT AGG
GAAGGGAAGTTTTCTCGTTTCTTCGAGTTGGAAGCAAAAAAGGACGATCAAGAACTT CT
TTTTCGCCTAACCTTTGATAGAGAGAGTACAAACCCAAACCTTCATTGCCTTTTCTA GTT
TATACGGATACAGAGTTAACGAAATTTTCGTTTATGGAAGTAAGATTGGGGTCTATT CTC
AAGTCGTAGAAAATTACATTTTCGTTTTGATGCTGAGTTTGTATTCTTGATTTTCTG TTGA
GCAGTTGTTGGAGTTCGACGAGGGAAAGACGAAAGGCGCTGAAGAAGCCGATGCCAA
AG AAAG C AAG AAG GTAG AA
SEQ 55 CTAATTTTGTTTGAGATCTCTATAGTACTCTTCTAACCACTTTTGAATAATAGCCACTTC T
TGCTTCCTTTGCATGATCAACCAGCCAGGGTCATTCTTTGTCTCAGAACGAAAGTCA AC
ATGGTGTGCACCTAATTAAATAAGAAAAATTAAAACATGAAACTAGTAATAAAATAA ATCA
ATGTCTCTAATCAACAGCAATCGTTTTTCATTATGCTTTAATTCAATGAATATCATG ATAT
ACAAAAGCATATATTACACAAATAGCCGGTCATATAGCCGATGTACATAGATTATAC AGT
AATTATATATAGTTATACACATTTTATATATGAATTATACATAAATTGTACATGCGC TAGTT
ATTTTTAATTTAAGAAATCAGATCAGTGGCTATTTGGGTTAACTCTTCGTACCTTTT TGAG
TTACGAGTG CCACAATG CTAG CTG ATATATTTTTCAG CACACTGTCACAAAAAG AG AAA
GAAAAAAATCAATTTAATAACAATAAACAAAGACTTATTCAAGATTTAGAGTCTATA TGAG
TTTATAATATAAGTCGAAAATAATAGATTCAATTAAAACAACAGGACTTTACCATCA CAGA
TAGATATAGGACCACATTTAGTGGTTTTAATTGTATCTATCTATTGTTTTCCACTCC AATT
ATAAACTCATATGGACTCTAAATCTTGAATTTACGAGTCCTCACATTATTATATTTT TATTT
TTAAATTTTGAACTCACCCTCCTCTGCTCCATGGATCTTGCATTCCGTTAGAGAATA TCA
TATTACTGCCAAATCTCTTAAGAACTTGCTCAATTCTCTGAAATATAAACATATTTT TTTTA
ATTTCTTATTTTCGTGAATAAAAAAATATGTAAAGAGAATTTTCATTTGTTAACAAA AAAA
GAATGTAAATTAGAAAACTTACATAGCCACCAAATTCAGTAGTGATCCAATGTGGTC GA
GGCTCTACTCCATATTTCTTTTTGCAATCTTCTTTGAATTCCTTGTAACTATAGGAA GATG
GAGGAAACATGCTTTCATTTGAACAAGTCATTGGCATAACCATCTCTGTACATGCCT TAC
TCAAAAACCATGGAAAGAAACACACTAAAATTAGGCATACATTAGACCAAGATTTCA AGA
ATCATTTAATTTTAATCACACCATAACCAAACTAAAGTTTAATATAAGTACCAGTGG CGAA
TCCAACAATGCATTTACGGATTCGATCGAACTTAGTATTTACAGTATAGAAAAATTT GTAT
AT AC G AC AAC AAC ATATC AAG C C C AC C AAG C AG GGGCGTACG C ATG AATTTTTGTAAGT
GATGTCAAAATTTATAGAAGAACGGGAATTATAACTTTAATTATCATACTTCTAGAC AAGA
AATAACTTTAATCTATTGGTTTTGTTGAGTCTTAAGTTAGAAAAAGTCCAGAATTAA ATTC
TACTTCATGACAGTTTTAACCACATAGATTCTCTAAAATATTTGCAAAAAAAAAAAA AAAA
AAAAAAAAAAAAAAAAAAAAAAAAG GAAAG CAAAAG AAG CCTGTCAGTTCTC AAATTCAC
AACTTACAATTGAACAGAAACAACCTTCTTTTTTCGTNAGCAAAATCAAGCACATAA CCA
ATGCATCAAGAAATAATTTATGTCAAGTGGTGTCATTTTTTTCTACTTATCTGCTTC TTCA
CAGTATTAACACATATATTTAGTAAAAAAATTCGACGAAGCGGTGTCGCGTGACACC GC
TTCGATACATCTGCATCCGCCCCTACCACCAAGTGGGATTTGGGGGGTGGGTAGGAC A
TATGCAAACATTATCCCTACTTTTGTGAAGGAAAAAATATTATTTCCGGTAGACCCC CAA
CTCAAAGAAAGATAAAAAGAGATAGAAACAACAAATAGTAACAATCACAAGATAAGG AG
ATAAGGTGATAGAATAATGAGAGATAGAATAACATTTAATAGAAAAATCTGCGAATA AAA
AG CTAC AAAAATAC C ATACTAAAAAAATAC GTATAG ATAG AGTTAG ATATATTTTTC AAAA
TCTACACGTTTAAATTCTGAATCTGCCACTAATAAGTACAAAGAATAATAAATGGAA TAG
AATAAAAGGTAATTAAGGTAAAGAATTCTAAACCTGCCAATCCCAACCACGAAGACC AT
GAGCATCATCACCACCTTCTAAATTGAAACATTTTTCTCTTTTTGTATAATTGTAAT ATAA
ACTTGCCGCAGCAAATGCCCGGCTGATTTTGGAAGCTCCTTTTGGTAATCCATCAAT TA
TCTTGCACATCTTAATTTAAAAAAAAAAAAAAGTTTAAGAATTTTTTCACTCGACAA TATA
ATTTTTTTACACAATTAGTCATTTATAAGATAATTACAAATAAATTTTTATGATAAG TATTA
ATTGATGAACTGATAAAAATGATAAATAACCTGTTATAATATGTTAAACTACACTAA TCAT
GTACAAAATCTCCACATTGTCATTGCATTAAGTCTTATGTACAAATGGTATAAAAAA TATT
TATGCCAATCAGATAAATTATAAAGTACCTACAAGTAACCTTTGTGATAAACATTAA TTGG
TAAATGGCTAAAAATTGTATCTAGCATGTCATATCAGATTAAACTATACCCACTTTT GTAA
AAATATTTATCCTGTCAATTAGGGGCGGACCTACGTGGGGAGGGGTCACTAGACCCC G
TCAGTCTCGACAAAAAAACTGTATATAAATTTCATATATATCTATATATACAGTAAA GACG
CCTTAAATACTTTATGCGCCCCCCTAAAAGCACAAAAACTGGACAGAGGCACTGGTT TG
TAGGAGTGCTTAATCCAAGTTCGAATCTATGCTCCTACAACTTATATTTTTTATTTT ATTT
TTAAAGTGGTGTCGCCGTAATACTCAAATCCCAGGTCCGCATCTGTTATCAATGTAT ATC
AAGTTCAACAAAAGTTGATCAGGGGCGAAGCTATATATCCCAAAGGGTAGTCAACTG AC
CACCCTTCCTC GAAAAATTTACTTTG CGTATATAG GTAACATATTAG GTTTTAG AG GTAT ATAACATATATGAATACTCTTTATTAGAGAATTTTTTCCACTTCTTTAAGTTTGAACACC C
TTGAGCCTTGAGAAAATTACTGATTTCGCCACTGATGCTGATAGAAAAAAAGGAAAT GTA
ATTAAATGCAATGATTTTGAAACCAATTTAGTGCTAATTAAGCAGAATTGTCTACCT CTTG
TACTGGATATGCTGGCAATGGCATCATAAAGTTGGCTTTAGTAGGATAATTCACCAT TG
CTGTATATACAAAAGCTTCCCATAGCCAATCTCTAGCTGAATAAACTGAATGTAAAC CCC
TGAAATATAATATAAAACAAATCATTTTATTGAATCAAACTTGACCTTGTAATTTGC TACA
ATAAAAAATTATCATGTTTCTATTTTTATATTCAAATATACTTACTTGCAAGTTCTG AAAAG
TTTACTAACTTCAGTCAAGCCTTCTTCATGTTTTGATAAAGCATCCAACTCTGTCCA ACTT
CCCTTTATCACTCTATAACAATTCAAGCTTACCTCCTATTCGTGAGGGCAAATAAAT AAA
TAATAATAAGGTCTCTTATCCAAGAAAGATTTTTATGAATATATCCTATTCGAGATA AAAG
ATAATTACAGTAATCGCTCATAATAAAGTGAGATTAGTAATCTGCAAATAAGTCAAG TTA
CTTGTTACAACATGTTGAAAATATTGAGTGTAAAAATTTATTTACATTGTCAATAAC GGTG
GATCCTGATGGGTTCATCCTCTAGAGTTCAAATAATTTAAGGGTTTGTTTGGCCATG ATT
TTTTTTTTTACTTTTTTTTTGGAATCAGTGTTTGGCGATGAAAAATTCTAACATTTG AATTT
CTAAATTTTTTCGAATTTGAAAAACTTCAAAAAACTATTTTTCAAGATTTTCACTTC AAAAC
ACTTAAAAAAATTTAAAAACAACCCCAAATTATATTCATGTCCAAACACAATTCTAA TTTT
AAAATACCATTTTCAACTTGAAAACAAAAATTACTTGTTTAAGGAATTTCACAATTC TTAT
GTCCAAACACCCACAAATTTGTTGCGCTTTTAGAAGTAGAGTTTCAAAATTTAATAT TTAT
TAAAATTTACAATCTTTTTGCATTTCTTTATAAACCTAGTAAATAAGGCGATATTCG TCCC
AGTGTATATAAATTAAATCCTATTATAAAAGGAGCATCTAGGTGCATGAATGTGACC ATG
TAGGATCAGAACATTTTAAACAAAGGGGCTTAATAGATTGTAAAGGCTTTACTATTA CTA
ACGACAAGAGCGGGTTGCTCCAGTGGTGAGCACCCTCCACATTCAATCAAGAGGTTG T
GAATTCGAGTCACCCCAAAAGCAAGGTTAGGAGTTCTTCGAGGAAGGGAGCCGAGGG T
CTATCGGAAACAACCTCTCTATCCCAAGATAAAAGTAAAGTTTACGTACACACTACC CTC
CTCAGACCCTACTAGTAAAATTTTACCTGGTTATTATTGTTGTTGCTGCTTTACTAT TACT
AACCTTG AAATCTTG GG AAACAG CATCATAAAAG CTTGACCATGG G GTGATTTTGTCAA
ACTGC AAGATTG GTG CTG AAG ATG CCACTG CACCTATTG CTATATGTG G GTACTTCAAT
CTAAACCAAGAAGCCAACACTGGTTGACCAGAAAAAACAAAGTTAAAAGAGGAAAGG AA
AACAAGTATTTAAGAGTGAGAGCAATTTATTAATTATTGGATACTACTCGAGAAGAA TGA
ACCTAAATAGCCGTTCACTCAACTGCTTAAACTAAAATAGCTGACGGATGTATAAAA TAG
GAAAATTTTATTTTGTGTCTCAGACAACTTACACTAGTTGTATGAAGCATTTTTTTT ATCT
CACAGTTTTGTGGACCCGAATTTTTGTGGACCTAGAAGTGTAAGATTGGGGTCCACA AA
TTTATGAGACAACAAAGAGCCTCGCACAACTAATGTCAGTTGTGTGAGGCACACAAT AA
AACTTCTCTATAAAATATATAATTCATACTCTTATATAATACACCATCCGGTCCACT TTCA
TTGATTTTTTGACTCTTTTCACATATATTAGAAAATCACATTTTAGCATTAATTCAC AATGA
AATTGACCATATTAACCTTATTTTGTTCCTTGAAAATATAACAAATGCTCCTATGCT CTTT
ACTTCAAATGCAACTTTAAAAAAAAAATTAACTTATTCTTAATATCTGGAAAAAATC AAAT
ATTGTGGACCACAAAAAAAATTAAAAATTCAATTAAAATTGACCGGAGAGAGTATAT GCA
TAACTATGTATAATCTATATATATCGGCTAGAAACAAACAGTAAATTGAACTGGCTA TTTG
TGTAAAGATTCCTACTTAACAAATGCAAAAGTGGAAGAAAAGTTCTGTTTATTTGAA TAAT
TGAATGCATCTAATGCTAATGCTAAATTCATACAAAAGAGAACTTTCCATGAACATT TAG
CAACCATAGAATGTAATTATCATTGATTCACATGGATTGGACACTCAATAAGTCAAT ATG
TCCACACATGTAATGTCATGTCATTTCCATCTATCATTATGTCAAGGCAAAAAATTA GCT
AAAAGTTAAAACTTTTTCACTTATATTATTACTTTTCTTTCATTACTTTTTTTTTGT TTGTTT
GTGTGGTGTTCTACTATATTAGTGGCAGTTTGGACATAAGAATTGTAAAATTTCAAA AAA
AAAAAAATTAACAAAATTTAAGTAAAAATAATATTTGAAAATTAGAGTTGTATTAGA ATAT
GAACATAATTTAAAGCTGCTTTTGATTTTTTTTATGAATGATTTGAAATGAAAATTT TGAAA
AACAGCTTTTTGAAGTTTTTAAAATTTTCGAAAAATTCCAAAATTCAACTTCAAGTG AAAT
TTAAAATTTGCATGGCCAAACACTGATTTCGGGAAAAGTGAATGTTTTTTATGGCCA AAC
AGTTCCTTACTTACTTCCTCCATAAGAGCCACCAAAAACCACAACCGGTGATGATTC AG
AAG AAAGATTCTGCTTTAAACTCCTTATTAGAAC AG CATAATCAG CCAATG CTTG CTGTG AATTCAAGTATCCCAAAGTCTTTGGTGACTTGTAAGATTTCTTTCCAAATGGCATTGAAT
C C C C AT AAAAC CT AT G CT AAATTATT AC AAT AC AAAAAAC C ATT AT C AATTT C ATT C C C AA
CAAAGATAAATAATAATAATAATAATAAAATATAAAAAAGGTTCAATTTTACCTTTC TATAT
GAAGTATCTTAATATTACAATTCATTATACTTTGGGCCACTAATATCTTATTTTTGG AAAA
AATTCTTGTATTTGTCTTGATTCTAACGAAGTTCCAACTTGAAGTATAATAGATGGT AATT
TTAAATCATAGTGAATAGCTGGATAAATTTGGATTTTTTCTAGTAGTATTTTGATAC GTAG
AATCTACCAAATCAATATTGGAGTTTCATTAAACGTAGTATAAATACGATTCGATTT AATA
ACGGCAAGAATATAAATAATCCCTTAAATAAAACGAAGTGTAAAACTAAAATACTTC GTA
TAGTACAACAACAATAAATTCAGTGTAATTTCACATGTGAAGTTTGGGGAGGATAGT GTG
TACGTAGATCTTACACATATCTTGGGAAGATAAAGAAGTTGTTTTCGATAGACCCTC GG
CTCAACGAATAGTGAAAACAAAGTAACAAACAGTAGCAACAACAACATAATATGAAC AAA
AGGCAAAATACTTCGTATAGTATAGGAGTAAATTTAAATATTTTTCTCAAAAATAAA TACT
TCAAATAAAAAAACATTTCAAGATTATATACATACTTCAATGAAGACTAGAAGAGCA TGA
AACTTAGGAGCAATATCAAGCATAAATCCAGTATTTGCAGCAAACCAATCAATATTT CCT
TCATTTCCAGTGTAGACAAAGATAGGGCCTCCTTGTTTCCAATAATTATCATTTATG AGA
TATTTCTGTTTAAAAACTTTAGAACTCTTTGGTAGAAAAGTGAAATGGTCAAGAATT TGA
GGAAAGTAATGGACTTTAAATGGTATTTTTGACTTGACATGTTGTTTTTCTAATGAA GATT
GATAAGTTCCAGGTAGATAAATTGGCTTAATTTCTCCAACTACAAAAGAGATAATAA ACA
GTAAAATCAAGAAAATGAAAGAAAAATAAGAAGAAGAAAAAGCCAT
SEQ 56
ATGTCTCGTTTCTCACTCCTATTGGCTCTCGTCGTCGCCGGTGGCCTTTTCGCCTCCGC
ACTCGCCGGACCGGCGACCTTTGCCGATGAGAATCCGATCAGACAAGTCGTTTCTGA C
GGTTTACATGAGCTGGAGAACGCAATTCTCCAAGTCGTCGGCAAGACCCGCCATGCT C
TCTCCTTCGCTCGCTTTGCTCACAGGTACGATGATCTCTACATGGAAATGAGATTTT TTT
TTGTTATTTGCTTATTAATAGTAATTGTTTTATTTTGAGTTTAAGTTCTATATATGC AGTGA
GCATATTTTTTTTTTTACATAAATAAGATAATAACAAATAAATCACTTAATATGTAT TAGTT
GGTAATGATAGTGTAAAAAAATATTATACTGTAATGTGTATATAACTTAAATCTTTT TATTT
TTGGGACGATATTTAAGGTATGGGAAGAGGTACGAGTCAGTTGAGGAGATAAAGCAA A
G GTTCGAG GTATTTTTG GACAATTTG AAG ATG ATTC GATCG CACAACAAG AAAG G ACTA
TCATACAAACTCGGTGTCAATGGTATAATTAATATTATGGCATAACGCTAAGGCCCT GCT
CTTTTCCTTTTTTCTCTTTTGCTTAAGTGGAGTCTTAATTTGTTGATTTGGAGGTAA CAAG
TTATAGTTTTGTGGTTCCTTTACCGGAATACTCTTTGTTTTTATCTTCAGCTAAGGT AACA
GATTAAGGCGTAATTATAGTTATTATTGTAAAATAAGGTAATTTTTATTTAGAAGCT TCAA
AATTAAGTACAAGCAATTGAATACTACTCTTTGTAAGTAGACTTTGTATATATGTTT TTAT
TTCATTCTCTTTTTTCATTTGGAGAGATGTGGACAAATTAAAATTATAATATAATGC GATA
AAACATATGTTCCACTACAGTATCAGTATGGTATTTATAGTTTGCATATTTTATTAG TAAT
TAATTGGTCTAGTGCCTTATCATGTGTAGATATTTCATTCATATTGTGTGGCTAGTG GGT
ACCCTTTCTCTCTCCAATCAAAAAACTTTTTTTAAAAGCTCAATTCAAAAGCTTTTC TTCA
TTACAACTGATCCTGCTTAAAGACTAAAAACAATCTAAATTGAATTCTTAATTCTTC TCTA
TTCATTCATATATGGACATAAAAACAAAATCACAGTACATGGAAAGAATATAAGCAC CTA
AGCATTGGACTGCCCAAATGAAAAGTTTTTGCAACTTAATCTAGTTGTGCATAGATT CAA
CAACAAAAAGTAAAGAAATAAGTATGCATTTTATGCTTCTAAGTTCTAGTATATATG GCC
CTTATTGTTTATCGATTATTATGTTTCATGACAGAGTTTACCGACCTAACATGGGAC GAG
TTCCGGAGAGACAGGTTGGGGGCAGCTCAAAACTGTTCAGCCACCACAAAGGGCAAT C
TCAAAGTCACTAACGTTGTTCTGCCGGAGACGGTATATGCACTCAGAACTCCTCTGT AT
CTATTTCTGGAGTTAGTGATCATTAGAGTTAAACTACTTTCTGATGATTTATTATTT CCAG
AATTGTGGAGTGCTCTGAGTTTAATTATGCTGTAACTATAGAAACACTAACTAAAAA GAT
CTTGAATAGGTATCCTACAACAATAAATAGAATCCTCATAAGAAATACCACTAGATC GAG CACCAGTCATGATTTCATATCTGGTAAAAATCTTGGCTAATTGATCGAAGTGGAGTAGAC
TAGC GAG CATGTACTGAGCTAATG CACAATTG GTTG CAAAAAGAAGTTTTTTCTTTCCTA
ACCGAAATTTCCAATTTCGTAATTATAGAAAGACTGGCGGGAAGCTGGGATTGTCAG CC
CAGTCAAGAAC CAG G GCAAGTG CG GATCTTG CTGG ACATTC AG GTAAGAATTAGTTAG
AATCTCACATCATTGGACTCTTAAATTGTAAGTCTTGAAATTGCACTCTTAAGCTGA AATA
TAAC G G AG AAG G C ACTTG G CAG C ACTACTG GTG C ACTAG AAG CAG CAT AT AG C C AAG C
ATTTGGGAAGGGAATCTCTCTATCTGAGCAGCAGCTTGTGGACTGTGCTGGAGCTTT TA
ATAACTTTGGCTGCAATGGTGGGCTCCCATCACAAGCCTTTGAGTATATTAAATCCA AT
G GTG GTCTTGACACTGAAGAAG CATATCC ATACACTG G CAAG AATG GCTTATGTAAATT
CTCATCAGAAAATGTTGGTGTCAAAGTCATCGATTCCGTCAATATTACCCTGGTATG ATA
TCTCTTTCCTCCAGTATGCAACCAATCTTTGCCAGTGTTAATATCCAACCTTAATGG TCA
ATAAGGATTGGTTAAGTTCCTTACATACGTGTCATTACAGGGTGCTGAAGATGAACT AAA
ATACGCGGTTGCATTGGTTAGGCCCGTTAGTATAGCTTTTGAGGTGATAAAAGGTTT CA
AACAATACAAGAGTGGTGTTTACACCAGCACCGAATGCGGCAACACTCCCATGGTAA GT
CATCTGTCCCTAGGAACGTGATATGCAAATATATTGACATAGTTACCTAAATACAGG GG
AAAGCTACAGCCGACCAAGGGTCGTCAGTTGAACACCCTTCACTTCACTGTCGTGCA TA
TATTAAATCTTGAACACCCTAAGTGAAATTTATAACTTCGCTAAATAGGCATATACA CAAT
ATTACAAACATTGTGTGTTGCATTGGCAGGATGTAAACCATGCTGTTCTTGCTGTGG GT
TACGGTGTTGAAAATGGTGTTCCCTATTGGCTCATCAAGAATTCATGGGGAGCAGAT TG
GGGTGACAATGGATACTTCAAAATGGAGATGGGAAAGAACATGTGTGGTATTGCCAC TT
GCGCATCCTACCCTGTCGTTGCC
SEQ 57
ATGGAGAAGGAACACAAATACTCTTTGTTTCTCACAAAGTTGAAGTTGTTTTTTCTTGTT A
CATTAAGTACTTTCCATGGCCTTAGCCATGGCTTCCAAATGGATCAGGCACGTACAT TA
ATGTCTTGGCGTCGTTCTAAAATGCATGCTCAGACAACTACTTATGCTACTAATGAG GAT
GAGACAGAAAACTTAGTATTTTCCGATGAAAAACATGTCGGAAATATGGAGGATGAT CT
TATTAAAGATGGTCTTCCAGCGCAGCCTTCAAATGTGATGTTTAAGCAATATGCAGG ATA
TGTTAATGTTGATGTAAAGAATGGAAGAAGCCTTTTCTATTACTTTGCTGAAGCTTC TTC
TGGAAATGCTTCTTCAAAACCTCTTGTTCTTTGGCTAAATGGAGGTAAATTATATGT GTT
GATGATTCTTTCTCAACTTAATTTTGTCTTACTAATTACTCATCTTCTCTTAATTCT TTTGT
CATGCACCTAATTTGATTAAGTACTCATTTGTTTTGTTTCGATTTAATCTAACTTAC CCTT
TATGCACATATATTCTGCATCAAATTAAGTTGAATATTACTCCACGTGTTCCACATT ATAT
ACTTAACATTTTTTTTTTCCAATCTATTTTACACATTTTATATATTTGAATACTTTT TTAACT
TTAGACATTTCAATTTACCCTTAATAGTATATTCTTGTAGCCGATCAAATATCTATG AGAT
ATTTTTGAAGTCTTTTTTCTTCCTAAATCAAGTCAAATGTTAATGTATAAAATAAAG CAGA
GGGAGCAATAACTTTCGTTTTATGTTTGTAATTTTTCTTAATAGTGATACTCATTAC TCTC
CCCGGTCCACAATAAGTGACTATTTTACTTTTTTATTTTGGTCAAAAATAAGTATCC ATTT
ACCTAATCAATAAGGAATTAATTTTATTTTTCTAAAATTTACCCTTATTTACATATT CCAAC
GTGTCAAGGAAATAATTAATTAAGGTTAATTTAGTGAATATATTTTTTTTCTCTAAG AGTT
CGTATTTCTTTAATGGATGTGCCAACTATAAAATGGTCACTTATTAGGGACCAGAAG AGT
AACTCTTTATTATTTGAAATTTTG ATTTC CCAAAAG GTATAAATG GTCC GAG GAAC ATCTA
ATTTTCGCTCATTTG CAAG AAAGTG GTTCATAGACAAATG GAGTT ATT AAGTG GG GACG
CGCAACAGAGAATTATTGGTCACTTTATTCGTTTTGTTCGCTCTTTTTCTTTCTTTA CTTC
TTTACTAAAGTAGAAGAGAAAAAAAGGAAGTTTAAAAAATTGTTAGTGTATATGAAG ATA
AGAG CTGTCATTTTCTTCG G CTATTG AATG ACGAATAAAAG CACAATTGG GTACAG GTC
CAGGATGTTCATCATTAGGATTCGGGGCCATGCTAGAGCTTGGGCCTTTTGGTGTAA AC
CCTGATGGTAAAACCCTTTATTCCAGAAGATTTGCATGGAACAAAGGTACATTTCAT TTG
CTAAACTAATATAGACCTACTTATAATTAATGAAACTAATTTCTCAAGAAATAAGAC AACT ATTTTTTGTAATAACGTATACTCTATCTTTTTCAATTTATGTGACAACATTATGTTAAAG GT
GTCACGTGAGTAGGAAGCCAGCTGACACTTAGTAGGCAAAGAGTCTGTTAGATTAGT TG
TTAATTATACAATTAGAAATTAGTTAGAATCAGTTGGATTACATTGTATATGTATGT GTAT
AGACGGTTATTCAATACAACAGTAATTTTCTCATCTTCTCTTTTCCTCTCTAAGCTG CGAT
CTCTCTTAG CTCAATCTAG AAGC ATCCACG AC AG ATGTTTG G CATG GTATCAG AG CTTT
GTGCGATCATTGCTCTCGTCTAATTCTCCTCTGAGTTCATGTGAACGAAACTTCAAC TCG
TTCGTCTTCATCTCCTTCCCTCGAGAACATCGAATCGACGATGACGACGGAAAAAAT TG
ACCACATTCATCCTCTGTTTGTGCATCCCTCAGATACTCCAAGTTTCATGTTGATTC CAG
TCCAACTCACTG G ATCTG AGAATTACG AATTATGG CG G AG ATC GATG AAAATTG CACTT
AGGCAAAACGAAAGTTAGGGTTCGTCAATGGCACACCCACTAAGGATCAGTTTAGGT CA
GAG CTACATG AAG ACTG G G AG ACATGTAATG CGATTGTGCTCTC GTG G ATTATGAACAC
AGTATCTCCAAATTTACGTACTTAGTGGAATTGTGTATGCTTTTAATGCTCACCTAG TAT
GAGAAGATCTAAAGGAGATGTTTGATAAGGTGAATACGATGAGGATCTTTCAATTTC ATA
GAGAAATTGCTACAATTTTCCAACGAACAGATTCAGTGTCCATGTATTTTATAAAAT TGA
AGGAGCTCTGGCTGAGTATGATGCAATGGTACCCTCAACAAATTCGAAGTAGTATGC TG
ATCATCTTCAGCGGCAGAGGCTATTACAATTTCTAAGTGGACTGAATGATTCCTATG CTC
AAG CTAGAAG ACAG ATTCTAATG AAATC AGTAGAAC CTACTTTG AATCG G CCTTATG CTC
CAATTGTTG AAGACG GAAGTCAAATG AGTACATCG G GAACTTTATCACAC ATTG GG CTG
AACTCAATAGCCGAGGAAAATGACATTACAACATTGTGGAGCTCAGCAGTAAAATGA GG
TTCAATCAAGAAGAACAAAAGGAATTACAGTATATTTTGTGAACATTGCAAGATGAA AGG
ACATAGTAAAGAAAATTAGTACCAGCTCATTGGTTATCCGACAGACTTTAAAGATAG AAG
AAAACAAG GAG CACCTACTGGTTAC CAATGAG CACCTATTG GTC ACCAAG G AACAATTG
AGG AAG ATG CAATG CAAG G GAAACAG GTATG ACTGTAGATTTTG GG AATCTTTATG CAG
GCATATCATATGGGGACAACAACATATGCAGGTGCAAAGGCAAGGGACTCATAATCC T
GTACACATGGAAGATGCTCAATCTCAGGGACAATCTTAGGGATATACAGGTGGTGTC AC
TGTTATATTTACTCCGGAACAGTATAGTCAAATCTTACAAATGCTCAACAAAGATTA TGTT
CCAGAAACATCAGCTAATATGGCAGGTACTATTTGTTCTTTTCTGGCTAGTAAAACC GG
GCACAATTGGATAATGGACATAGGAGCAACAGATCATATGGTATCTACTCCTTAAAT GTT
ATTTGATTTGAATGACTATG CTAAG CAAG GCTCACTGTTG CATTTACCTG ATG GAAAAAG
TTGCCTATTAGTTATGTTGGTAAATGTAGATTGGCACAAGGGGACATCAGGGATGTG TT
GTGTGTACCAGACTTCAAGTTTAACTTGTTGTCAGTGGCTAAACTAACTAGAGAATG CA
GTGTTTCATGTCTTTCTATCTTGATTTTTTTCTGATGCAGGACCTTCACATTGGGAA GGT
GAAAGGGACTGATAGAATGCACAATGACTTGTACTATTGGAGAAATAATATAGAGAA TA
AGATAC CACAATCATTG G CTACTACTTTG ACTCAATCTG CAG CATTGTG G CAT AAG AG G
TTG GG G CATGTTCATCATAG AATACTACAACAAATGAACTTTTTTAAAGATATC AAGACA
AATACTGGCAGAACTTGTTCTATATGTCCTTTAGCTAAGCAAACTAGGCTTTCTTTT CCT
CAAAGTACTAGTGGAACTACTACACTGTTTGAGCTAGTTCATGGTGATGTATGGGGT CC
ATACAATGTACCTACATATGGTGGTCATAGATTCTTTCTTACACTTGTAGACGATTG TAG
CAG GATG GTCTG G GTTTTCTTGTTAAG GTTGAAGAGTG ATGTCTCATTTGTATTAAAAGA
TTTTATGTCATTAATAAAGACACAGTTTTATAGTTCAATCAAGGTTTTCAGAAGTGA TAAT
GGTACAGAGTTATTTAACTCACATTGTATAGATTTGTTCAGTGGTGCATGAATTGTA CAT
CAAAACTCATGTGTTCATACTCCACAGCAGAATGAAGTTGTTGAACGAAAGCACATA TAA
TTTTTTG AG GTAGG AAG AGGTTTCAG G GTTG CATTCCTCTAACTTTCTGG G GATTATGT
GTTCAGAATGCTGCGTATCTGATTAACAGGATTCCATCCACTACTGTGGCAAGAAAG TC
ACCATTTGAGGCATTCTATAGGAGGAGTCCTAACCTACAACACCTAAGGGGTGCTTA TG
TTATGCCATAAGTGTGGGTGCCAAAAGTGACAAATTTGGAGCAAAAGCAATCCCAAC AG
TGCATATGGGATACTCTACCACTCAGAAAGGCTATAGGTTGTATAACACAGCCAATA AA
CTGATCTTTGTCAGCAGGGATGTTTCATTTAGAGAAGATATATTTCCCTTCAAGTCC TCC
TACTATCAACTTAGACCACCTAATCTTGTGGAGTATTGGAATGGTCGCCATGATCCC TTT
GTTCTTGAAACTACTATTGATGCAGCTCCATTGGAGACTTCATCTATAGTTGAGCCA GTC
TTTGTCCCCTCTAGTCCTTCTATTCCTACTTCTTTGAATTTAGGAGACTCTACAGCT GGT GTCTCTGAGAATGCTACTACTGTATCAGTCCCTGCTGCTAGTACTGATTCTCTCATTCTT
AGTAAGGCTCCTTATGATAATGTAGCAGATATTACTGTTGCTCCAGATTCTTGAGAG CTT
ACAGTCACAAGAAAGTCATGCAGAACCTCCAAGACTCCTAGTTGGCTTAGTGACTAT GT
TCATAAGGGGTCCAAGCCTCTATCACATGCTGTAATGGGCACAAGTTATCCTTTATC AG
TATATATGTCATATCCTTCACTTTCAGACCCCTATTACAAGGTCATTTATAGCATCT CATC
TGTGAGGGAGCCTGATACTCATGAAGAAGCTCTTTATGATCCACAGTGGGTAGTAGC TA
TG CAACAAG AACTG CAAG CCTTTCAAG ACAATCACACTTG ACAG CTG GTTAATATACCT
CCTGAAAAGAGAGTCATTGGTTGTAAATGAGTATTCAAAGTCAAATACAATGCTAAA GGT
GAGGTGGATAGATACAAAGTTCGTTTGGTAGCCAAGGGATATACTCAGCAGGAGGGG T
TG GATTACTAAGAG ACTTTTTCTCCTGTG G ATAAGATG GTCACTGTGAG GACTATCTTAT
CCTTGGCTGCAATGCATGGTTGAAGGTTGCATCAAATGGATATATTCAATGCATTCC TC
CAGGGTGATCTTGTAGAGGATGTTTACATGGTTCTACCTCCTGGTCTTCTAGGACAT GG
GGGGGAGGGNNNNNAGGGGGGGGGGATGTAGGAGAGTATGCAAGCTACATAAGTCTA
TGTGTGGCTTGAAACAAGCCTCTCGACAGTGAAATCTTAAGCTTTGTGAGGCACTTC TC
TCCTCAGGCTTTATTCTAAGTCATCATGACTAGTCCCTCTTCACTCAAAGATCAGGG AAT
GAGCTGTTCCTCATCCTAGTTTATGTGGATAACCTCCTCATCACATGTTCTTCTCCT TCT
CTCATTCATGCAG CTAACTCATGCTC CATCAG CATTTCAAGATCAAG GATCTG G GG G AG
ATGAGATACTTTCTTGGTCTTGAAATTGCAAAGAGCACAATGGAATACTAGTATGTT AAA
G AAAGTTTG CACTAG CTCCTTATTGCAG ACTTATG AGTGG CTG CTTCTAAG CCTACTAG
CATACCTACGGAGGTCAATCAAAGGTTCACTAGTGAATAATTTGATCACAACTATAA GAC
TGAGGGCAATACTGATGAGTTGTTGTCCGATCCTACTGGCTATCAGAAACTAGTAGG GA
AGCTGCTATACCTAACAATGACTCGACCAGATATAAGATACACAGTGCAGAACCTGA GT
CAATTTATG CATAAACCAAAG AG ATC ACACGTGG AAG G GG CTCTAAG G GTG GTGAAGT
ACTTAAAGAATGCACCTGGTTTGGGCATCTTGTTACCTTCTAAGCCATCCTCACAAC TTA
CAGTCTACCGTGATGCAGACTAGGCCAATTGTCCCATGACAAGAAGGTTAGTT AGTGG
CTTCATAGTCAAGCTGGGAGACTCCTTGATTTCTTGAAAATCAAAGAAGCAAAGTAC AG
TGTCAAGAAGTTCAGCAGAGGCATAATACAGAAGTATGGCCAATGCAATTGCAGAAA TA
GTTTGGCTCATTAGACTGTGTGAGGAACTGAAGGTGAAGCTGGAGTTGCCTGTTAAA CT
ATATTGTGATAGCAAGGGAGCACTTCAAATTGCTGCTAATCCTATCTATCATGAACG AAC
GAAGCACATAGAAATCGACCGTCACTTCATTAGGGAAAAGATACATGAGGGCATTAT AC
ACACAGAACATGTGTCCACAAGTTTGCAGCTGGCAGATATTCTAACTAAAGGTTTAG GA
AAGGCGCAATATGACTTCCTATTATCCAAGCTAGGAATGTTCAATTTGTTCATATTA CAT
AGCTTGAGGGGGAGTGTTAAAGGTGTCACATGAGTAGGAAGCCAGATGACACTTAGC T
GGTAAAGAGTGTTAGATTAGTTGCTAATTATACAATTAGAAGTTAGTTAGAGTCAGT TGG
ATTACACTTTATATGTATGTGTATAGACGGTTATTCAATACAATAGTGAAAATAATT TTCT
CATCTTCTCTTTTCCTCTCTAAGTTGCGATCTCTCTTAGCTTAATCTAGAAGCATCC ATG
ACAG ATGTTTG ACACATTACTATTTG G G GAG CCAAAGAG GTTCTTCTTTATCATGTGTTT
TCTTAAATGTTTTATAAATATTTTGAATTATAATTTTTTTTATGAATTATAGTACTT TTTATG
TAAAAAAAATGAATTTGTATCTAAATTTACGGTGTAAAGTAAGCTAGCGTTTGGCCA TAG
ATTCCCAAATTTGTTCTG AAAAATCTG ATTTG G GTG AAGTTTG GTTTG G AG ATG AAAATG
CGTTTGGACATCAGTTTTCAAAACATATTTCCCAAATTTATTTTGGAAAAACATGAA ATAT
GATTTATACCCACAAGTTCTAAAAACTATCACAAATACCCAACAGTACCATTATCAA TAA
CATTCATTAAAAAACTTTGATTCTCGTAAAAACTTTGATTATCAATCACAAATATCC AAAT
TTATTTTGGCAAAATCTATGGTCAAACGGGTATTAAGAACTTTGATTCTCGTGCTAT GTA
CCTTGCCCGGAATGGAATTGTACGGGTATTAGAAAAATACATTGTGCTGCCAAATGC AT
TGTACAATAACTATAACTACTTATAGTTATTGTATGTTTTTTCTTTCTTTTATTTAT TTACAT
ATGTAATGATGGTATACAGTTGCGAATGTGATGTTTCTGGAGTCGCCGGCAGGGGTT G
GGTTCTCTTATTCCAACACTACCTCGGACTATTCAAAGTCAGGCGATAAGAGGACTG GT
ACACACCGAAAAATCTCGTTAATACAATAGTAATAATTGTCAGTTTCATTATTATTT TTTTA
AAACAATTTTAACAGTCAAAATGATGAAATTTTACTCTTTCATTTAACTCCTCAACT TCAA
TTTCAACTTCACATGCTCTATTCGTCAACACTCAACTCCAATCAAACATTGTGCAAA CAG TTATATTATTATCGTTTGTAGTCTGTAACTATTTTTTAATTTTTTTTAAAGACTACACTT GA
GTATCGTTAAAAACATGGTCAAATCTTTTGGTCACTTAAAGTGAGGCAGAGGTGGTG TC
TTTTCTTATCAAACGAGATTTTTCATTTTTTTATTTATCATTAATTCAGTTATATAT TTATTT
CTTTTCCTTAAACTATATTCTTTTTTATTGGGTGACAGCTGAAGATGCATATAGGTT TCTA
GTGAATTGGTTCAAGAGGTTTCCACATTACAAAGGCAGGGATTTCTACATCATGGGA GA
AAGCTATGCAGGTATCTAGTACAGTATCAGTAATTAACAAACGAAAAAATACAAAAC AAA
AACTTTTGATATTCTTGACTTATCCTTCTAGTAGTGAGAGCCCTCATTATTGAGTTC TGTC
CAATAAAATTTGTAAGAATTAAGGACCACTGTATCAGAGACAGCTTGTGCATATTTC AGA
CCATTCACAGAAATGTCCTCCTGTACAGTCTCAGCTCAAAGCCGAAATGAGCATTCT GA
TTGAAAATTTTGGCTATATTTGTAATACAATCTTATATAAGTAGTCTATATCTAAAG TCTAA
TATTGACATGACAGCTAATATTGCTGTGACTGCTCATCACAGGATTCTACGTACCAG AG
CTAGCAGATATCATTGTCAAGAGGAACATGTTGCCTACCACAAACTTCTACATCCAA TTC
AAAGGAATCATGGTATTATATCATTTAATTTGTTGACCTTTTAATTTGTTTGATCTC TCTG
TTATCAAATCTTACTTGTATACCTAGTGATGAGGGGCGGATTTAGGGGTGCAAGGGT GT
TCACCCGAATCCCTTCGCCGAAAAATTACACGGTATATATAAGAAAAAGTCTGATAT TTA
CCTTTATATATTATGTTTTGAATTTCCTTTACACAGCCCAAAAGTCTACTCTATGTC ATGA
CATAAATTATTTCTTTATATTG CAG ATAG G GAATG GTATAATG AATG ATG AAACAG ACGA
GAAAGGGACATTGGATTATTTATGGAGTCATGCACTAATCTCAGACGAGACTCATCG AG
GTCTCCTACAACACTGCAAAACGGAGACCGAAACATGCCAACATTTTCAGAACATAG CA
GAGGCTGAGTTGGGAAACGTCGATCCTTACAACATCTATGGTCCCCAATGTTCCATT AA
TTCAAAGAGCAGATCTTCTTCTCCGAAACTGAAGAATGGATATGATCCTTGCGAACA AC
AATACGTTCAGAATTATCTCAATCTTCCTCATGTGCAGAAGGCCTTGCATGCTAACC TCA
CTAACCTTCCTTATCTTTGGAACCCATGCAGGTAATCCAACTAAGTAAATATTATGT ATA
GCATATCGATTTAACTTATATATACCGATAGTATAAACAATTTTTACACTGTCGTTG TATA
TGTATTGATTTATATATATACACTGTTAGTGTAAAATGTGTTGTAACAAATAATCTA TGTT
ATTTTTCTATTTATAAATTAAATTCTACTTTTTATAAATAATAACTTGTACTTATCT TTTTGG
TCACCTGATAGAAACTCTTTTATATCATCCAATGTGTATTTAAATCTGTTGCGGCAA TGAT
ATTTCTTATTTTCAAGATTACAAAATCTCACTCTTTATGTTTAGTTTATGTCACTTT TAATA
TGTAGAAGGTAATTGAACTCATATAAAAAATAGTGTATATGATATGATATGATGATT TTTT
TTCTTTTTTTTTTTTCATTTGGTATGGTAGCAATTTGGATTGGAAGGATACTCCAGC AACC
ATGTTTCCGATATACAAGAGACTTATTGCATCTGGTCTACGTATACTTCTTTACAGG TAA
CTTTATTATGGGCTTATCTTAGACTTTGGTTTATGTTCATGATACAATATTTTTAAT TGTTC
GAATAAAGAACAAGTGGATTTGTATTGTTTGGAAACAGTGGAGATGTTGATGCAGTA GT
TTCAGTTACTTCAACTCGCTATAGCCTTAGTGCTATGAACCTTAAGGTGATCAAACC TTG
GCGTCCTTGGCTTGATGACACACAAGAAGTACGTTCTTCGAATATATTTTTTAATGA TAA
TTTTATATATTTGTGGTGAGAAATAAATCTTATTGTTTCGTTCTTTGTTTTTTTTTT ATAATT
TAAAGGTAGTTTGTATAATTTCTGCAGGTAGCTGGATATATGGTGGTTTATGATGGA TTA
GCTTTCGCAACAGTTAGGGGAGCAGGGCACCAAGTTCCACAATTTCAACCACGTCGA G
CTTTTG CTTTGTTG AATATGTTCTTTG CCAATCATTCT
SEQ 58
ATGGCTAATTCTTATACAAGTATTAATTTTTTCCTTGCCCCTATTATTTTCTTGGCGATT C
TGGGATTGCAGTTGCAGAGCAGCGATGGTTTTGGGACATTCGGGTTTGATATCCATC AC
CGGTATTCGGATCCGGTGAAGGGTATTTTGGACCTTCATGGATTGCCTGAGAAGGGC A
GTGTTGAGTATTATTCAGCTTGGACTCAGCGTGATCGCTTTATCAAGGGTCGCCGCC TT
GCTGAAGCTGATACAGCTAATTCCACTCCCCTCTCTTTTTCAGGAGGGAATGAAACT TT
CCGCCTCAGTTCTTTGGGATTGTAAGCTTCCCTCTATGCATTTTTCTGATTGCTTTT TGC
ACTTGTCTATATCTTTATTGTTTACTTTTTCTAGTCATATACATAGATTATATACTA ATTAT
ACATAATTATACATATATAATACAAAAATTATACCTTTTAAGTGGTTGGGTGGGCGG CTA TTTGGGTTAATTCTTCTTCTTTTTTTGTATGTGTGTTTTGTATCTGTGTTATTATTCCTG AT
TGTGAACTAGTACGTCTTTGGAAATTCTTGTTTACTGTCTTTTCCTTTTGTCTGTTT AGTG
TGATGTTAGAGTTGACTGAGCTTTACGTTTGTTTTTGTCTGTTTAGTTGGATGGCAG TTC
AGGAAAAATAGGTTACCTAACTTAAATAAGTTCCATGTGCCATTTTTAACGAGATTC AAG
TGGAGAAAATATGAAGAAGAAAAAGAATGATTTAGGCCTGTTCTGTTCTATAATTCT GTT
TGTGTGTTTGATTGGACTGGAATTTTGTCGATTTAACTACTACATAAAATACTGACT CTTA
ATTTGATTTTACTTTTCTTCTATTTCGAATTCCAAGCTCCGGAAATAAATTCCGTTC TTTTT
TCTGATTTTCCTCTCCTCTGCCGCCACTTAACTCCTCTCCAGACAAGGAATTGTTCT GAA
GTTTCTGGCAGTAGCATGTTGTAATTTATGTGTTATAAAGATAGAGTTGCAAAATCT GTA
GTATCTGTAGTTGTGATTTTTTCTTCTTAAGGTGTGTGACTAAATATTCTTTGGCAA TTTG
CAGTTTG CATTATG CAAATGTGACAGTG G GCACTCCTG G ACTATCATTTCTAGTG G CAC
TTGACACTGGCAGTGACTTGTTTTGGCTACCCTGTGATTGCAGCAATTGTGTGCGTG CC
CTCGAGACACGCTCTGGACGAGTATGTTTGCTTCATTCTAGTACCTTTTTCTTTCTA CTT
TCAAATGTTTAAAGAGTTTTTCTTTTTTTTGATCGTCATCCTCGTCTGTATATTGCC TTCT
GCTACAAGGAAGTTGTGCATACTTCTCTTCCTTTTGTAATTATGAGACTTTCTGATA ACC
TTTTTCAGAAAG G AACCTG CTGATAACACAATG G CTGAATCTGAAACACAGTG G ATTTCT
CTTCAACTGTCTTTTTCGGTCATTATGACAATAATATATTCTCTTAGTTAACAAGAT ATGG
G GTAG AG AATGTATTGAG G AAATTGTTTTTCTGTTAAG GAAG ATACATAACTAGCG CAAA
AAAGAAGATTTAAACATAATCAATATTTGCAAAGTGAGTCTGATGCATGTAATATAC TGA
CTCTGAAATGAAATTTCTGATCCATATTGTTCCGTGGCTTGTTTGTCCTTGAAGAAT TTT
GAGATTCTTACTAGCTCAAGTACTTCAACTTGTCACGACCCAAAAATCCCACCACAG GC
GTCGTGATGGCACCTAGTCTCTAAAACTAGGTAAGCCGATTTCAATTACATTTTTGG AG
CCATTTTTTTTTTAATTAAATAAGTAACCAAAACTAACAGCGGAACAAATATGAATG TACA
ATCTCCCAAGACTGGTAGTACTAAGTCACGAACTCTAACTGAATACATGGAATGATC AC
GAGGACCGAATATACAATACTGTTTGATTAAAAACTCCACAGGAGTTCACCTTGAAG AA
CAAAATTTTCTTTGCTCTTTTGCCTTTTCCTTTTAATGTTTCTGCATGTATTATTTG ACACT
TGTAATCTTTTGTTTGCTTTTGAAACAGCGAATAAATCTCAATATTTACAGCCCTAA TACG
TCGTCAACGGGTCAGATTGTTCCTTGCAACAGCACTCTGTGTGGACAAAGGAGACGA T
GCTTATCTTCACAAAATGCATGTGCTTATGGAGTTGCATATCTCTCCAATAACACCT CAT
CATCAG G GGTACTGGTG GAAG ACATCTTG CACTTAG AGACAGATAATG CTCAACAAAAA
AGTGTTGAGGCTCCAATTGCTCTGGGGTGGGTATGCTTTAGTTTTTTCTCTTTATCT TTG
GAAGAGATTATCTTTGGATCTTCTGATGCATTTCTTTATCCGCCATGATTTTTTATA TTCT
ACTTGTTCAATTTCAG GTGTG G GATAAG ACAAACTG GTGCATTTTTAAGTG GCG CAGCT
CCTAATGGTCTATTCGGACTTGGCTTGGAAAATATATCTGTTCCGAGCATGTTAGCA AG
TAAAGGTCTTGCTGCAAATTCTTTCTCCATGTGCTTTGGGCCTGATGGTATTGGAAG AAT
AGTCTTTG G AG ATAAAG G GAGTCCAG CCCAAG G AG AAACACCACTC AATCTTGATCAAC
TACAGTAAG CAAGTCACTTTG ATATTCTG G GTTTATCG GTTG CTTCTGTTTCTG G CTTGA
TTTAGGAGAATGCGACTGAATATTTATTAACTCTTACCCTTTCCTGAATTGCAGCCC AAC
TTATAACATCAGCTTGACAGGAATAACAGTGGGAAACAAGATCACTGATGTTGATTT CAC
AGCCATTTTTGACTCTGGCACTTCATTCACATACTTGAATGACCCAGCTTACAAAGT CAT
TACAGAGAACGTGAGCGACAAGCTGACTGTATGATTTTAAGTTGGAGTTTGTAACTT TG
TATTGTAAAACTGAAGATATTTTTTTTCTTTTTTCAGTTTGATTCTCAAGCAAAACA GCCA
CGTATTCAACCTGATGGCGAAATTCCTTTTGAATACTGCTACGGGCTAAGGTGAACC AT
CTTTTATAATCTTCATCATTTATTACTTTCTTGACGTCCTTTGAACTCTCAGGATTA ACAT
GCTACATACGCAGTGCAAATCAAACTACCTTCGAAGTTCCTGATGTAAATTTGACAA TGA
AAGGCGGCAACCAATTATTTCTTTTTGATCCGATAATAATGCTCTCGCTCCAGGTAA GAT
GGTTTCTGCTCCTTTTATATTACAAAAGTTCTCTTTTAGAATATCCTAATATCCAGT GATG
ATCATCAGGATCGTTCTGGCGCATATTGCTTAGCTGTTGTGAAAAGTGGGGATGTCA AC
ATCATTGGACGTAAGTATCTATCAGTTGCTTGCTCGTAAGATTTTGTTTCTATCCAT GGA
ATTCTGCAATATAACTTGCACCATGCCAGCTAATGATCTCACAATTACCAACTTTTA GAA
GTTTTGGTTCCTATCGAGTTTTTTACATACTTCTAGCTTATGTATAATTGGAAATGT GAAT GTGACAAAGTAAATTAGTAAAAACCAACTAGTAAAACTGGTTCCATTGTCAAAAGTCTGA
GCTATTTGTTGATTTACTTGGATTTTGTCTCTCTATTTGGAATTCATGACAGAAAAC TAAT
ACACGGATGTTTTTGCAGAAAATTTTATGACAGGCTATCGCGTGGTTTTCGATCGGG AG
AAGATGGTTTTGGGTTGGAAACCATCGGATTGTGAGTTCGCATTCCTGAGTATGACC TC
TTTAGTGTGCACACCTGCTCATATAATTTAACTATAAACCTTTCTTGGCAGGTTATG ATTC
TAGAGGATCCAACGACAAATCGACAACTCTGCCAGTGAACAAGCGTAATTCTACTGA AG
CGCCTTCGCCCTCCAGTGTGGTGCCAGAGGCCACCAAGGGAAATGGAAGTGGAAATG
AACCCGCTACTTCGTTTCCATCTGTTCAATCATCTAAACCTGCAGCAAACCAAGCAC CA
GCACATTTCATTTGCCAACTTATGATGGCTCTGTTTTCCCTTTTTAGCTATTATTTG ATCA
TTATTTCTTCA
SEQ 59
ATGGCGATTCATACTTCCACTCTCTCCATCTCCATACTTGTAATGCTCATGTTCTCCGTC
GTATCATCATCGGCGGCGGAGGACATGTCCATTATAAGCTACAACGAAAAACATCAC AC
GAACGGCGAGTCAACGGTCTGGCGAACAGACGATGAAGTCATGTCTTTATATGAATC TT
GGCTAGTTGAACATAAGAAAGTGTACAACGCCTTAGGAGAAAAGGACAAACGGTTTC AG
ATCTTTAAAGATAACCTTAGATACATCGATGAACATAACTCTGTGCCCGATAAAAGT TAC
AAGCTGGGTTTGACCCAGTTTGCAGATTTGACCAACGAGGAGTACAAGTCCATCTAC TT
GGGTACTAAGCCCGATGGTCGTAGCAGGTTGTTAAATACCCAAAGTGACCGTTATGC C
CCTAAGGTCGGAGATAGTTTGCCGGATTCCGTTGACTGGAGGAAGAAAGGTGTTCTT G
TTGACGTCAAAAATCAAGGGCAATGTGGTATTTTCCTTTTACCCTCTGCCTTGACTC TGC
ACCTGTTGTTTTTGTTTTCCTTTTTGTTCGTACTTATTTTCTGTTTAAAGTTTGTCC ATGCT
TTCTTTACTGATGGCTTTGATGGAAATTTGGAAACTTTAGTAGTTTGATAAGGTAAG ATA
TTAAAATAATCACAGAGTCATGAGTTTTAATCTAAGATCAATTTTAATGGCAAGTTC AGTT
GACCCTGCATTATTGTAAATTTTAGCTTAACATTAAGTATGATTAATTAGGTCAGCA CGA
TG AAGTTG ACAACTTTTG CTCCAATTTCCG CATCTAATTGTG G CAATATAAGTAATG CTT
TTTTCCCTTGGACAAAACACTAGTTTCCGGAATTGAGCTATTTTATTCAATTTAAAA TGAA
AATTTTCTGTTTTAATGTATTAGAACTATAAAGAAACCGAAACATTAAGTAAACTTC GGAT
TGATCTGTGTTTTTCGGGAATTTAGTTGTTAGTGGTCTAATTTTCGGTTTAAATGCA GTT
CTTAATATTGGATAGGCATTTTGGCACTTTTCTTGGCTGTCGCTTCTCTTACCTTAA AATT
AAAATTATGGAGTACCTACCAAGTTCAAGATCTTATGGTTGTAAATTGAATTTGTAA AAG
GGGTTCTTCTTCGTTTGCTCTGAGATCCTTCTTTTAGCTCGCTCCTTAAATATTTAC TAAT
CAGTGGTTTGTAGCTCCAACCGAGTGTCTATCGGAAACAAACTCTTTACCCTTCTAG GG
TAGGGGTAAGGCTGCGTCACTTGTGTGAACTCACTGGGTTTGTTGTTGGTCTGTAGT CC
GATATACCCCCATCAAACACCCTTGGAGTTGTTTCACTATGTCTAGTTGTGTCAATT GTT
TTGGCAAATTATGCAGCCTTGATTGATTGGATTATCTTCCATTTTATGCATAAGTAA ATG
CTGAGGAAAAAATGATATGTTTATATCACATAAAGCAACTAATAATTTTCTTCGTAA TTGG
TGTTGC AATTGG G AAATGAAACAG GG AGTTGTTG GG CTTTCTCAG CAGTTG CTTCAATT
G AAG CAGTAAACAAG ATAGTG ACAG GTAATCTG ATCTCGTTATCTGAACAAGAG CTG GT
AGATTGTGATACGTCCGATAACCAAGGCTGTCAAGGGGGTCTAATGGACGATGCCTT TA
AATTCGTCATTCAAAATG GAG GAATAG ACACTG AG G AAG ATTATCCTTAC AAAG CCAAA
GATGGAAAATGCGACCAAGCAAGGGTCAGTATGGTGTTCTCTGTCTTAAAGGGATTA TA
G GAAATG AACTAAATACAAGTTGTG ACTATTAATATTTTGTTTG CAGAAAAATG CCAG GG
TTGTCACCATCGACGGGTATGAAGATGTTCCTGATAATGATGAAAAGGCACTGAAAA AG
GCCGTTGCTGGTCAACCCGTCAGCGTTGCTATCGAAGCTGGTGGCAAAGACTTCCAG C
ACTATAAATCGGTATTACTTCAGATTTGCCTATTGTCAGTAAAGTTGTTTTCTTTTA ATCG
AATTAGCTAGTGTTTACACAGGCTCAACAAATATTTCTGTATTTTCAAAGTTACAGT GAG
TTCAGTATTAAAATTTTTAAATGTTGATCCTATTAAGTTTAAATGTTGGATCCGCCT ATGC
CCCAGGGTATCTTTACCGGAAAATGTGGTGCAGCAGTGGACCATGGTGTGGTTGCAG T AGGGTATGGTAGTGAAAATGGCATGGATTATTGGATTGTGAGGAACTCGTGGGGTGCT
TCGTGGGGTGAAAAGGGCTACCTCAGGATGCAGCGAAACATTGGCAACCCCAAGGGT T
TGTGTGGTATTGCTACGATTGCTTCTTACCCTGTAAAGACAGGCCAAAACCCTCCAA AA
CCAGCTCCATCTCCTCCACCAGTCAAGCCGCCCACTCAATGTGATGATTATAACGAA TG
CCCAGCTGGAACGACGTGCTGCTGTGTCTACGAGTACTATAAATACTGCTTTGCTTG GG
GTTGTTGTCCCATGGAAGGAGCTACTTGCTGTAAAGACCATAACAGTTGCTGCCCAC AT
GATTATCCTGTCTGCAATGTTAAAGCAGGCACCTGCTCAATTGTAAGTGATCTCTGC TT
GTTATTGTTAGATTGTCCCGCATTGGTTGAGGGGAAGTGTTGTTGTCTCCTTATATA GTC
TTCGGCAAGTCTTTTTAACAGTTAAGGTTGTTTCCTTTACTTATGGAATCATGTTTT TGTT
G ATACAGAGC AAGAACAACCCACTAG G AGTCAAAG CAATG CAG CACATTCTG G CCAAA
CCTATTGGTACCTTCGGAAATGAGGGAAAGAAGAGCCCTTCTTCT
SEQ 60
CTAAGCACTTTCTGCAAATCCAATTTGTGAACTGCCAAAGTCGAAAACTGTGTGATATGC
TCTCAAGAATGCATCTCCAAGAACCCTGCAAAAGTAGAGTTGAATATATCATACAAC TG
GATCTTCATGAATATATAATATATTATAACTTATGGCAGTGAAAATAGTCTTACCAG AGG
GGACGTCGCGGATGCGCGTTTAAAGTTGTAAATCCACTAATACAGTGGACACCTTGG CT
GTCATCAACTCTGATAACATACTATTACACAGAAAGGAAGATAGTAAGTGGAAGAAG AG
AAAG C ACTG CTATTTAAAACTATAATATGTTACTAATATAC G G C CTAAAAAC AAG AC G CT
AACGGCTTTTCCAATGTACCTTACTGGAAAACCAGTTCAAGTTGTCGAAGCAATTTT AAT
CTACATTGCTCCGATCCTCCAAAAATGCTAACCGCACTCATGTTGGATCCTCCAAAA AG
TGCAATGAGATTTTTGCGGGATCCCAGCAATCAGTGGCGAATCCAGGATTTGAATTT TA
TGGGTTCAATCTTTAAGATTTTTAGTATTGAACTCATTGTATTTTGAAGTTATTGCT TCAG
TACTACTATTTATTAGATTTGACTGAACCCGGTACTAATATGATGCATCTGCCTCTG CCA
GCAACATAAATTTTAATGGGAAGATAAGAACTTTTCTCCGATATTATGTCATCTCCG ATA
AGAACAGATGTAATTCTAGCACCAGTTGCTACCTTTAACACATATTGATAATGGAGG AAT
AGTATCAG ATAACAACAAACCTG ATCTG G AG AAAGG G GAAAAGATTTGTCTCCAATG GT
AAATGTTATATGTGGCAGGGCAAAGACATCACAGTTGATAAATGATTTTCCCCCGGG AT
TCGGAAGCTTCTCACACAGCTTCAGCACAAATTACATGTATTAGTAGTAAGTAACTA ATG
AGAACTCGAAAAACAAAAAGACACAACTGTGACATGAATACCTGATTGGCATATTGA AA
CGCTTTTTCTTTTGATCTCTCTTTTCTGATCTCTACTTGTATCCAGAACACTATCAT CTCA
CAAGAAGAACATAACGACCCATTATTTGTACAGAGTCCAATTCTGTTGCATACGTTC TCC
G GTTGTAACTG CATGCAG CCAAACAAAAG GATG ACATCTAAATAAGAG AACAC CTAAAA
TACCCGCACACAACAATTAGAAGTTAAAAAGCAGCTACCCCTGCTATCAAGCGTTCC CA
GATCGAATCCCCATAACTCGAGACAACTTTTTTGCATTCCAAACTAATAATTCCTTC CGC
TCCAATGGCATGATTTATTTGAGTTAAAATAGTCTGAAATGGAGATGGCTGAGAAGA AAT
TGATCTTGCAAAAAAGAATTAAAATCAACAATGGTTTAAATACAGACAGTTGGACCA GCG
ATAAATGATGTCCCTGTATCCACAATAGCTGGACATCCATCCTTACAAAGGCCTGAA AG
G GAAAG AG CAG AATTACACTAG AAATG CATTGTTTTTATATAGTAAATCACTTATGTATAT
GAAGATATTACCTGTTGAATTGCTTCCTATAAAAAGATCCCCTATCTCAATCTGGAA TTTT
TGAAAAAAAACTCAGTTTTTTCAGGTTACCAAGTAAACAATACAAATAAAGTTCCAT TAAA
CGGAGGCGCAGGTTACCTCCCAATAACCATTTTGAGCGACTGGTACGTATGTATGCT G
ACCCCTGAAGTGAGTCCAATCCATGCCTCCAAAGATAATTTCACCCGCTATCTTAGA CG
TAGGATCTCGATTTAGCCAGAATGAGAAGATTGACTTGGTAACCATATGCTGAAGCA AC
ATGTTATACCTGAAATTAACTCACACAAAAGAATGTGGATTTCAATTTCAATGACAC AGG
TAAGAAATGAATGAGAGAACAAGCTCATTAAAACTATGTCAGAAAGCATAAATTACT ATG
TCAAAAGTTATTATAATATGAAGAGATAATAATTTACCATACTGGTGTGACATTCCT TGAT
GTCGTGCTCTGATCAAATCCAAGTCCTAGTACTCCATCAAATCGTGCACGCAACAAT GT
CAAGTATCCCTCCCGTGTTACCTCAGTGAAAACCTGTTTAATAAATTTTATTCAACA TGT AACTTG AAAACATATATATCTAC AATTTCAG CTG CAAAGACCG G CACCTG CTG CTTTAAG
ACAGCACCTCCAACTTTCACATTGTCTTGGCTGAAGAATCCATGAACTGAACCAGTG CC
AAAAGGGATTTTGCTAGACTTTCCTATCATAAAAGCCCATGTATAGAAGTGATAAGT ATT
TCGATTAAGCTATTCTACGCTTAAATATCAACAATGCTCTAACCAAAATAAAGGTTT GGA
GGTCCATTGGGTAACTACCAATTTTTGTATACGTATTTGATAGTCTTGATTTGTACC TGG
AACGAAGATAACATGCAATCTGCATTGGATAAATCCATTCAAAAGTTAGTTAGTTTC ATG
TAGTGAAAATTGTTAATCCACAACTTGACAAAGACTAACCGAGAAGAAACATCTGGA AG
AAGGGACCCAAAGATTGGAACTTCCAGTATCAAACACAACAATGAAGCGTTGGGGCG G
TGAACCAATACCAATCTCCGCGAAGTACTGAACATCATGATAATTTTTGAGGTAAAC TAT
CTGGTCATTCGGAGCAGCCAAATTTCTATTGCGACCCCTGAGATCTTTAGCGTAGAT TC
TTGCATCGCTTATGCTAGAAAGGTCCAACGATTGCCTTTTTAGCTCAATCCTAACCA TAT
CATCAG CATATACGTTGATG CAG GTTATATAC CATATTAC AAGTG ATGCAAGAAG G ATTT
TGATCTCCAT
SEQ 61
ATGGCGTCAATTTTCGCTCTTTCATTATTTTTCATTATTATCTCTTTCTGCATCACTTCG A
TCACCATTCCCGTTCAATCCGACGGTCACGAAACTTTCATCATTCACGTTTCTAAAT CCG
ATAAGCCCCGTGTGTTCGCCACCCACCACCATTGGTACTCCTCCATCATCCGATCCG TT
TCTCAACACCCTTCTAAAATCCTCTACACCTACTCACGCGCTGCCGTGGGCTTCTCC GC
CCGCCTCACCGCCGCGCAGGCCGATCAGCTCCGCCGTATTCCCGGCGTAATCTCCGT
CCTTCCCGACGAAGTACGCCACCTCCACACCACCCATACCCCTACCTTCTTAGGCCT TG
CTGACTCCTTCGGCCTTTGGCCCAACTCCGATTACGCCGATGACGTCATCATCGGAG TT
CTGGACACAGGTATATGGCCGGAAAGACCGAGTTTTTCCGATGAGGGTCTCTCTCCT G
TTCCTTCAAGTTGGAAAGGGAAGTGCGCTACTGGACCGGATTTTCCTGAAACCTCAT GT
AATAAAAAAATCATAGGTGCCCAAATGTTTTACAAAGGCTATGAAGCTTCACATGGC CCA
ATGGATGAATCAAAAGAATCGAAATCGCCAAGAGATACTGAAGGACATGGAACACAC AC
AGCATCAACTGCAGCTGGTTCTGTAGTGGCAAATGCTAGCTTTTATCAATATGCCAA AG
GTGAAG CTAG AG GTATG GCTATAAAAG CAAG AATAGCTG CTTACAAG ATTTG CTG GAAA
AATGGTTGTTTTAATTCTGATATATTGGCTGCCATGGATCAAGCTGTTAACGATGGT GTG
CATGTGATTTCACTTTCCGTTGGGGCTAACGGTTATGCTCCACATTATCTCCTTGAT TCT
ATTGCAATTGGAGCTTTTGGTGCATCTGAACATGGCGTCCTCGTCTCATGTTCAGCT GG
AAATTCTGGTCCCGGCGCTTATACGGCAGTGAACATTGCCCCCTGGATTCTCACCGT TG
GTGCTTCAACTATAGATCGTGAGTTCCCTGCAGATGTTATTCTAGGAGATAATAGAA TAT
TTGGTGGCGTATCATTGTACTCCGGCGATCCATTGACCGATGCCAAATTGCCGGTGG TT
TATTCCGGCGACTGTGGTAGCAAATACTGTTATCCAGGAAAGCTAGACCATAAAAAA GT
CGCTGGAAAAATTGTTTTGTGCGATAGGGGAGGCAACGCTAGGGTTGAAAAAGGGAG T
GCAGTGAAGCAGGCAGGCGGAGTAGGGATGATACTCCTTAATTTGGCCGACTCCGGT G
AAGAGCTCGTCGCCGATTCACATCTTCTCCCCGCGACGATGGTAGGTCAAAAAGCAG G
AGACAAAATAAGACACTACGTAAAGTCTGATCCTTCACCGACGGCGACGATCGTGTT CA
GAGGAACCGTGATCGGAAAATCACCGGCGGCGCCACGTGTAGCGGCGTTCTCGAGCA
GGGGACCGAATCATTTGACGCCGGAGATTCTCAAACCGGATGTTATTGCACCTGGAG T
TAACATTTTGGCCGGTTGGACCGGATCTGTTGGACCGACCGATTTGGATATTGACAC GA
GAAGAGTGGAATTTAATATTATTTCTGGAACTTCCATGTCGTGCCCTCACGCTAGTG GA
TTGGCTGCGTTACTTAAAAGGGCCCACCCTAAATGGACCCCAGCAGCGGTAAAGTCA G
CACTCATGACAACAGCTTACAATTTGGACAATTCTGGTAAAGTATTTACAGATCTTG CCA
CTGGCCAAGAATCTACTCCTTTCGTTCATGGATCAGGTCATGTAGACCCGAACCGAG CA
TTGGATCCGGGTTTGGTTTACGATATCGAAACTAGCGATTACGTGAATTTCCTATGC TC
CATTGGCTATGACGGCGACGATGTCGCCGTGTTCGTGAGAGATTCTTCTCGAGTGAA TT
GCAGTGAACAGAATTTGGCTACTCCAGGAGACCTGAATTACCCGTCGTTCTCTGTTG TT TTTACCGGTGAGAGTAACGGTGTGGTTAAATACAAGCGGGTGATGAAAAATGTAGGGA
AAAATACAGATGCTGTTTATGAAGTGAAGGTGAACGCGCCGTCGTCTGTGGAGGTGA G
TGTGTCGCCGGCGAAGCTTGTATTCAGTGAGGAAAAGAAAAGCTTGTCGTATGAGAT TA
G CTTTAAG AGTAAAAGCAGTG GTG ATTTGG AG ATGGTG AAGG G G ATTG AATCTG CATTT
GGGTCGATTGAGTGGAGTGATGGAATTCACAATGTGAGAAGCCCAATTGCAGTGCGT T
GGCGTCACTATTCTGCGGCATCCATT
SEQ 62
TCACATAGGAGCAAGATGACCTTGTTTGGACAATTTATCTTGCATCCACCTGTGAAGCA
TTTCCAGTGCTGCCTTAGGTTGATCCATTGGAACCATGTGACCAGCATCATGGACCT TT
AAGAAAGTTAAAGGCCCATAGTTCTTTTGAACACCTTTCTCTACACCATCTACTGCA AAA
GAAACTTGTGTGGCTTTTCCAAAGGCTTTTTGCCCTGACCATTTCATTGCATGCACC CAT
CTCGAGTTTCCTGCCCATTATAGCAAGAAATTGAGTTTAGTTGTCAATTACTAGGTT GTT
TTCATCTTTCAGTTTATTGTAACAAAATTATGTTTATATATCACATATAAAATAAAA ATAGT
TACCGGAAATATATACTAAATCCGGTCAAAGAGAGATCATATCCATGTAACACAATG TAT
AGGAAGGCCCATTTTTTTTTTGGTTGGTCGGGTAATGTTTTGTTTTGACAATAAGTA GTG
TCACATGGATATTCTGAATGTAAGGTCAGTTGCCGATATGACTATATTATTACATGT AAA
TGTTATACATTTGGCAGCCTACTTGCACTTTCTTAGAACCGGCTTTAACATTTTTTG ACTT
TGTATTTTAATTTTCTGTGATACATAGGTATAATTAAGTTATTTTGCTTTACAGAAA CATG
CTTAATTTTGTCCATATATGAAATTAGATGTTGGGTCAGAGATGAATGAAGCTTACC AAG
CCAATTGCAGATAAGGTCATATTCCCCAGCATACACAAGTAGCTTGATACCATCCTC AA
GGAGTGAAGGAATTCCCTCTTCAAGATTCCTCATCCAGTCCAACTGCATTGCCTGGT AA
ACCTCAGAGCTACATGAAACAAACTCAATATCCCCAACACCAAGTGCCTTTTTAACT TGT
TGGTCATTGAGGAAAGTTTCCATTTTTGAGAAGTCATAGCAGAGATCACCCTCACAT CT
CTTCCGCACGTCATAGTACTGCAACTCGGAAATTACATGACTTTAACTTCTCTACAC TAA
CTAACAATGTAAAGAAATTTTTATTGTTACTTAAAAGATAACTACAGATAATAAGTG AGAT
CAGTAATTTGGAAAATGAGACTGCAAGTAGCGTGTTGTACATCATTAGCTCAAAGAA AAT
GAGATTGCGCTTTCTTACATTTTTGTCACCAGCAATGTCCATAATCTTGTTGAAGAT GCT
TGTACAAACAAGATATGCAGCCATGCAAGCAGTTCCGCCATCTTTTCCTAATATTAT TTG
CATAG AAATAAAG CTAATTAATGTCAAG ATTATATTAG CTG CTGCTATATGAAG GAG AAA
GAACTTACAGGATAAACAAAAAATTAAGAATTACCACAAAGCTTAATTGCCAACTGA CAT
TTTGGATATGATTTCTCTATGGCATTGTAATCAGATTTTTTTATCAATTTCATATCC AGAG
CATAGTCAGTGTAGGCTTTGTATTGAATTTCTGGATCAGTGAGTCCATTACCAATAG CAA
ATCCCTATACAAATTAAATACACTTGGTTAAGTTATCGGCATGATGACAAATTTAAA TTAA
ACCTACCTAAAAGTTTAACTGAAAAAAAAAAAGAATGGTGGAGGAGCTAATGAGTTA GA
AATACCTTGAGATTTACGTAGATGCCTTCTTTATTTTTGTTTCCTTGGTGGACCCGA GAA
GCAAATGCAGGAATGTAATGCCCAGCATATGATTCTCCAGTAATATAGAAATCATTC TTT
G CATACTG AG G GTGTG CCTTGAAG AAGG CCTAT CAT CAAAAGAATTTGAATTAAATTTTA
TTAATTATATCAGTTAAACTTTAGAGACTTATCACGAGCTAAAAAAAGAAGAATGAA AGA
ATAAGATCAACCTGCAAGAAGTCATAGAGATCATTGCTTACGCCCCTTTCATCGTGA CG
AATATCATCATCGTTTGAACTATAACTGAAACCAGTTCCAGTTGGCTGATCGACGTA TAT
AAGGTTTGAGACCTGTCAAATTGCAATTTATCTTATGTTATCATCATTCTTCAACTA ACAA
ATGAAAGTTGCATGTTTGATTATAGGATTTAACCAATGTAAACGACTTTTACAGTAT TGTT
ATATATATATATATATATATATATATATTAACATGTTGTATTAGTCCGGTTACTAAT CCCAC
TTAAATAAAGAGAAGCGTAGTAGTCATTGCTGTCAATAAGCGATGAACTACTTTTAA ACT
TTTG AATTCTACAAGTCACAACTAATG AACAAGTGATAAAGAAAG GAAATG CTAGTAG GT
AAAAAG GTACTTTG CATGATG GAG CAAG GTTG AGTAAC AAATAAAAACATG GAG GG AAT
TCTTTTAGACTTTTACCATATTCAAAAGATCTAACCGACGTTTCTTGAAAAATTAAT TGGG
TAAAATAAAAAAAATAAAAAAT AAAAAG C AAAAG G AAG AAG AC AAG AAC CTTGTC C C AG C CGAAATCATTCCAGACAAGAGACATATTATCTGCAATTTTGAATGGTCCGTTTTCGTAAA
ACACAGCCAATTCACTGCTACATCCTGGCCCTCCTGTTAGCCATATAACTACTGGAT CA
TTCTTCCTGCTTCTCGATTCAAAGAAAAAGTAAAACATCCTGCGAAAAACATATTAA AAA
AAC AC AC AG ATAATTTAG C ATTAACTAATAATAC C C ATAAATG AAG C AAAAAG AG C ATTT
AATCCAATTCAAACCTTGCATCTTTAGTATGTGGAAGACGATAATAACCAGCGTGAT GAC
CCAAGTCTTGAACTGTAGACCCAGAATTACCAACATAAGATAAATTCAATTTCCTCT CAA
AAAGCCTCTGTTCAGAATCCCCTGTTGCTGCAGCCTTGTTAATATCATGTTTAGGGA ATA
AATTAAGCTGTCTGATTAGCTTTTCTGCCATTGTTAATGGGAATTTAGGAGTAGAAG ATA
GGAAAAACTCATCGTCATTAGAATTTAAAGTTGATGAGAAAGATAAGGAAATAGAAG CAA
G AAG C AG AGTAAG AAAG AG G G ATG AAG G C AT
SEQ 63
ATGTTAGTTATCAGTGATTGTTATATAAATTCTTGCAAAGCTTTCAACTTTGTGATCAAT T
TGCCCGTCATGGGACACTCTCACTCTCATTCTTCTCATTCTCACTCTCACTTTCACT CAT
CTAAATCTTCCGATGATCAAAATATGGATATGGGGGAATCGATCACCACCCAAACAG AC
GTTTCTTTCATGCTCGCTAAGCATGTTTTCTCCAAAGAAGTTAAGGGCGATTCCAAC CTG
GTGTTTTCTCCTCTCTCAATTCAAATAGTACTTGGCCTGATTGCGGCCGGTTCTAAG GG
GCCAACTAAGGATCAGCTGCTCTGCTTCCTCAAGTCCAAATCCATTGATGAACTCAA CT
CTCTTTATTCTCATTTTGTCAGCGTCGTCTTTGTTGATGGCAGCCCCAATGGAGGTC CT
CGTTTGTCTGTTGTTAATGGTGTTTGGATCGACCAAACACTGCCTTTTAAGCCTTCT TAC
AAAAAG GTTGTG GAT AAAG TTTACAAAGCAG CTTCCAATTCTGTTGATTTTCAGTG CAAG
GTTAGGCCTTTATTCGTTTGTTTCATTCAAATCTTGTTTCTTTTGTGCTGGGGTTTA ATAT
TCTTTGTTCATGCTGACTGCTGAAAATTGGTTCTTTAACTAGTATAATTGACCCTGC ATAT
TACTCTCATCATAAGCCCTCCAAATATATCATATAAAATGGATATACATATAGTAAA CTGC
AACTAATTAACTTGGGATTGAGGTATAAATGATTGATTGACCAATCTGACTTTAAAA TAAT
GAAAAGTGTTAAACAATTAGGACAGAAGCTATATTGCTTAGCCTCAAGTAGTAACAA AAC
TAAATAATGTCAACGGTTGATACTCGTTTCACAGAATTGAGGCAGTTTAAAGAGTAA AAA
GTATTGGTTGTTAATTTGAAAAGTAAGATGAAAGAGTCACAATTCATCTTCATCAAT GCT
TATTGTTTAGCAGGTTAGTTGACTAGTTCGACATTTTACTGAGTGGTAATAAGCTTC TTTT
TTGTAGGTAGCTAAAGAAGCCTAAGTAGTTCTAAGCTCAACTGGATATGTGGCCGTG CT
TAATTTTGTAAAACTTCAGTTTTTGGCCTAAATCTACACCCAATCAGTGCTTAAAAT ATAC
CATGTAAAGCATCCAAATTCTCACTTACCCCTTGCAAGTACTGTAATCAATCTTCTT ACT
GCAAACTCCCTTTGTTGGCTAAAGCATATACGTGTTAATTCTGTCGTATACTCTGTT TGT
CTTGCTAATTGAATAAGGCTGCTGAGGTTGCCAATCAAGTCAATCAGTGGGCTAAAA TG
AAGACAAATAATCTCATTAAAGAGATTCTTCCTCATGGAACAGTAAACAATATGACA AGG
CTCATCTTTGCAAATGCATTATATTTTAAAGGAGTATGGAATGACAAGTTCAATGCT TCA
GAAACAAAAGACCATAAATTCCATCTCCTCAGTGGAGGGTCTATTAAAGCGCCGTTC AT
G ACTAG CAAG AACAAG CAATATG CAGTAG CCTTTG ATG GCTTCAAAGTGTTGG G ACTTC
ATTACAAGCAAGGCAAAGATATGCGTCGTTTCTGCATGTATTTAATTTTGCCAGATG CTC
GTGATGAATTACCAGCTCTATTGGACAAGATTAGTTCAGAACCTGGTTTTATAGATC ATC
ACATTCCGTTTGAAAAAGCTAAAATGCGCAAGTTTCTTATCCCTAAATTCAAAACAA CTTT
TGGTTTTGAAGCTTCCAAGGTTCTAAAGGGACTTGGCCTCACATTGCCTTTCTCCAG TG
GTGGCCTCACTGAGATGGTGGATTCCCCGTTAGCTGGGAGGTTGTTTGTTTCGCAGA T
TTTTCACAAGTCCTTCATTGAGGTAAATGAGGAAGGAACAGAAGCTGCAGCTGTTAC AG
CTAGTGTAATAATGACCAAGTCCTTGATAATTGAGAAGGAAATGGAGTTTGTTGCTG AC
CATCCATTTCTATTCCTTATAAGAGACGAATCTACCGGTGCTGTGTTTTTCATAGGG AGC
GTGCTGAATCCTCTAGCTGGT SEQ 64
TTAGCTTGAGCAAGCTGACTGAAGTTCAACTGCATTTTCATCATCAGTAAGGTCACTAGA
CATGGCATGAGGTATTCTATGGCGTTTCAATATACGCGATGTGGCTATTCTTGCCGA TT
CATAGTTCAAAACAATCACCTTCTCATCATCCAAATCAAATCTTACATTCTTTTGGT TACC
ATCTTCCACAAGCTGACGTAAATGTTTCAAATTCAGAACTTCTACGCCATTAACCTT CTT
CACCTGTACAAAACGGTTCAAAATTTCCCCGAAAACAAAGAGGAAAAATAATTATAA GG
CGCCCTTATGTTGTCTTTTAAGGGAGAATGAAGTAATAGAAGCAACAAGTGTTTTAC CTG
CAACTC GG CAAG G CGCTC ATAACC AG CATTAATATCATC CATCAACAC CTAATTAATTG C
AAACG GTCATATTAG CAG CATATAG CTCATTG ATCTGCG AGTACAATCCTAAAG G AAAA
ACAGACCTCCGTCTCTATGTATAGGTTAGGAATCATGTTGCGAAGCAAAAGTTAAGG CA
CAAAAAATTGAAGTAAACAAATACAAAGTTTATTATTTATGAAGTTAGCCAAATGAC ATAG
ATTGTCAAGTAAAATAAGACATGCCTTCCTCCCATGTCAGAAATCTTACTGGAAACA TTT
ACAAAAGGTTATCTAGCAGCAAATATCTTAATACGTTAACAATGTCTGCCCATTTGG ATA
GACCCGTCCACCATCAGTTTTGTGTGTCTCAGTATGTATTTAGCATGTGAAATTATG GTG
GAACTTCATGGCGTCTTTATGCTTTTTTGTTTTTCTATTTTTATAAGAGATCAAATT ATTTG
AGTTTTCTATTTGACAAAAGGATAAGAAACTCTAGATACCTGAGAAAGGATGATGAA TTG
TTCACCAGGTTTCTTAGGTAGTTCCCGAAGGGCTCGTTCACACAACCGACGGGGTGA G
GCATTATACCAGTCTTCTCCATACTCGTGAAGGAATGGTTGAGTTAATGGAATAAAG AC
GAG AC CAG CAAATATG AAATAACTCG GAAGCTTGTCAAATTGATG AACTG GAAC AAGTG
GCTGCAACTGCATAATGTTAACTAGTTGTTAGATACAGTAGTCAGTTACACAGCATC AAA
GAAACCTTTAGATTACCAATCATACAAGTTGATAAAGTGTTAAATCTAAGGCTTAGC AAA
AAGTGCAGTCACCTTCCCAACACCTTACTCCTATGCATCAGATTCCCAAGAGCGACA AA
GCAAAAAATAACCTTGGTCCACTCGCACACCAAAATACTTCTAACAAAACAAAAATT TAC
CTTGCTCCAATACCGCACCAAAATACTTCTAACTACATAAATTATGTGACACCCCCA AAT
ACTTCAAGGCTTCTTTCATCTTTTCTATTTTTCTATTTTTGTAAAAGATCAGATTGT TTAAG
TTTTCCATTTAACAAAAGGATACTTGATGAGAGCAACTATAGATACCTGAAAGAGGA TGA
TGCATTGTTCAACCTTCATATTATAACTTGTGCAAATGATGAATTTACTTCTACCGG AGC
GATAATGTCAATAAGTTATTTTACCAGAGTAATTCTAATTGAACAATGTAGATGCAG TGA
TTTCGATCTATTCGTCTGATTACATGCACCCAGGATATGAGCAGAAGTTTGAAAAAT CAC
TGGGAGATACATTCGAAACAACAATTAATTATGAAGGGTTAATAGAAGGGTAATTAT GAA
GGGTAAATAAAAGTGACTTACAGGATGAAGCGTGATTTTGAAGTCATGCACTTTGCC AT
TTCTCAAGACTTTAAGTTCAGCAGTTTCATTAGGTTTCTTCATAGATACCAGATGGT CAA
ATGTGATCCTCTCTCTGTTTCGGAAAGGAACTGCAGATCATAAGCAATCTCAAAACT TTA
GTCCAGG G AAATAAG GTTG CTAATTTTG ATATG CATTTTAGTCAAAATG CAG AAG GG CA
AACTTTTCTAACAGAAAATAAGTTATTCTAGCCTTACAATTTTCATATCCTGCACAC CCAC
TTATCTGGTTTCTGGTATAATTATCAGCTATTCACATGGCAAGAGAAGAAACTCAGA ATT
AAAAACGACTAGACTCTAGGCTTTTCCATCTCTCAAAAGGAGGTCATTAATTTGTTA CTC
ATAGTTGCTCGTTGAACATGAATACTTTTTAGTTGGTGCTGGCCTGCCTAAAAGAGC CT
CTACAGCAAGAACCACATCCATGTTTCATCCAAATCCGTCATTTTACGTCACGAAAC ATT
TAGAGAAAGAAAAAGGCATGTGCCATAAAATGTAAGAAAAAGCTGTGAAGAAGAATG TT
GTACTAGTTTAACTGTC GTACATAAGAATTACAAG AAATAG AG AGAAG CAAG G CAATAA
GAGAAAGCCTTACCTGTTCCATCATTTGCTATGGGTACGCCATCAAATGAGAGGATT AT
GTCGTCTTTCTTTAATACTCTAGAAGCATCAGAAAGTGGGTTGATTCGGCTAACAAG CA
CACCTGTCAATTTGGACTGCATTTGGAAGTACTCTCGAATTTGTGCATTTTCAGTAG GTT
GGCATGACAAGCCCAGAGAGCAAAACCCAATGTATTCACCCCGTTCTTCTACTCCAG CT
ATAAAATGCTTTATCACAGGAACAGGAATAATGTAGCTGCAAAATCCAAATAGAACT GAA
ACTTTTAAGGCCAACTATGCACACTGTAAATTTCCTTTCCAATTACAACAGTTTTTC TCCA
AGGTAGATTGTTTAATCAAGGGTATCAGTTTAGATTTGATTGTTTGCAGCACTGAGG AAG
AGTCGAAATAGAATGAACTTGAATTCGAGACCGCTATTTGAAATCATAGTGAAATAC TGG
AATTTTTATCTCATGTCTAAGAGCTACTAAATGTTCCCACAAGCTAAGCAAATGTTG ATTA AAACTAGTAAATGTCATCAACCAAATCTCCTTATCAACTGCATGTCATCACAACTAAAAG
CTTTCAGGCATTCCCACATATGCCATCCTTTGTAACCCCTCTGAGATGAAAAGAATA ATA
TTATGAAG CTAGAG CCAAAGG G CTACAACTCAAG CTTCAAATTTGTG AATG CATG AACA
AGGACTGCGTGAAGGGAAAAATCTGAATATTATGACGAAGAAAAATGGAAGAGAAGA TT
TG CAG G AG AATACATGTGTGAAG G G AAGAATAACATG CAGTCAG ATTCAGGTAAAGG A
GAGAAAAATCTGAATTTTGTGATGATGCAAGTGGATATTAGAGTATATACCCCATCA ATA
TTTAACTAAATAATATATG GTAG AC CCAG CAATAAATG ATG CAATG G AACTAAATCTGTA
TAGTAGTTCTATCCCTCCGGGGTAGGGGTAAGGTCTGCGTACACTCTACCCTCCCCA G
ACCCCACTTGTGGGATCCTACTGGGTTGTTGTTGTTGTTGTAGTTACAGAGACACAA AG
TACCAAAATAAAATTCTAATTTACCCAATATTCTCTGCACCAGAGAGGTTTTGGAAA GCA
ACTCCAGCAACTTTGTCACCCATAATTGCTGGTCCTCCACTATTCCCTGGATTTATA GCC
GCATCAATTTGTATTGCCAATAGTTGACTAGCGCCGTGTACATATTGCGTAGGTTCT AC
CCTTGAGACAACACCTTTTGTCACGGATATATTATCTCCCCCTAGAAAAAAGCAAAA ATT
ATTAGAGAACTCCTCAGGTGAAGGTATTTGTGGCTTACAATTAAGTAAAGAAAAAAA AAA
AGAGAACATAGTATAGTGAAGAAAACAAAAATATAACTAGACGTCAACAAAAGATTA AGA
AGGATCTGCACTATTGAAGACAAGAATCTAGTATATGCAAGCTACAAATATCCAGCC TT
GCACCTAGTTGACACCAGAGAGAAACAAAATACATCATGAAAGTTTCCTTTTCACTC TTC
TGGATCTTATTTGTTCGCTGCTCGTATGAGCCCTCGAAAAGGGTACACCGAATCCGA GA
AAATAGTGCACTCTCTCGCGGAGTATAGCTCACATCCAAACATCTGATTAGGGAATG GG
GCAATGCCCATGAAGCTCTGGCGGAAAGGGAAGGCATGCCAGGCCGTATGCCTATGG
GTGCACAATTCTTCGAAAAAGCGCATGCTACCTCGGAGACCTGGGACCTTGGCTTAG T
AATGAATGAAGGGAAGCTCTTCGAGCTTTCTCCGCCAGCGGCTTATGTAGTGGTCGG C
CTTATAAAGCTCGCTAAGCCTCGCTTCCCTCTCCCCTTCACTTATTAATTAAGTGGA AAA
TAGTCGTCGGCATTCTATAAGCGACTTGACCGAGTCTACGAAGCTTTGCTTTTCTTG TA
GTCGGCCCGTAATGCCTCCCTTCATTTGCTTGCCTCCTTTCAGCTCAGAATGCCTAA TG
CCTATTATTTAGTGCTAGAAGCTAACCGCCATAAGCTCACCCTTCGGGTCTCGCTTC CA
GCGCAGGAGGCCAAGCATTCTGCCAGACGTCCCGGCCTGGGAGCCCCGTTCGCCTTT
GGTNGCTTATTAATTAAGTGAAAAATAGTCGTCGGCATTCTATAAGCGACTTGAGCA GA
G GAG ACAAG AAAAG G CTCAGTG GTAACTGTTATG AATGTGG AGG AGTG G GG AATTTTT
GTCGTTACTGTTACAATGTTCAGACGGACGAATGCTGCTAGGGGAGTGCAGAATAGA C
CAGCTGAAC GTAGTC GG AGAAG CTCTAAAG GTAAGTTC AAGTCG GAAACATTGTGTTTC
CACAAGAAAAGAAGCACATATTTTTTATAGGATTTGATTTGATAAATGGAAGGAAAT TGT
CAATTGAATTGGCAGAAAACTATGGTAAAGGGGAGCCCAACGTCCTCAGCAGAGAGG A
ATGGCAGATGCATTCCCCCCNAATGACATTAGCCTAATTTATTTGAACTATAGACCA AAA
AGAGCCCCATTGGAGTGAAATATATATAGTACTCATACTGCTGTTCCCAATTGATTT GGA
TTGAGGTGTAGCTGATTGGTTGATTTCGTGAGAAAACATGAATAAAAGGAGCTCACT TG
TGAATTGTGATTCATCTGCCGTTGTATTTTAAGACACGAAAAGAAGAATGTGAATTT CAT
ATTAACTCGACACCGTATCAACAACTCAGTTATTTTTATTTTTCTGAAAATTATATC AATA
ACTCAGTTATTTCGAATAGGTATAGCATCCCATTGCTAAACATGAATGTTGTATATC ACA
CAAACAATAGGGGGGAAAGGGAGGTTCAGAGTTCGATTCAAGTGTATGGGGGGTTGG
GTGTTTACATTACATCTTGAATCAGTTACAAGAAAAATAATTATCTTGAGGATGACC GCT
G ATTAAAAAAAAAATTAC CTTG AG G ATAAC C AAC AAC AG C C AC AG CTTCTTG G AG AAAT
GGAACATCACCAAGCTCCAAAGAGTTCATGCCCTCCCAGAATTCTTCACTTTCTACC AC
CAGAATAGCCAAGTCACATTCATGACCAACAGCTTGCACTGTTGCTCTATACTTGGT AG
GAGAACCATGCTTTCTTACAAGTACAAACGTATGATCAGCCACAACATGAGCATTTG TTA
GGATCCTCTTTCCCCGAATAACAAAACCTATAACATTAAGAATGCAAAATCCGAAAG TAA
GCTTTATGTTCTCTTAAATTATTAGTAGGTCAGATACTTGATAAGTTGATATGAGCT CTGT
ATTCTCTTCTTTTAAG AAAAAGAAATTATTACACATAG AG G GTG G GG AAG GG G AAATG G
G G G AG G G G ATTAC AAGTG G G G AATC G AAAC C CAAACTTGTAAC AAAAG C AG ATAAG AG
AATTAACCTTCCAAAATCGAGTCCCCATAACAAATCAATCAACAATCCTCAATCCTA AAC
TAATTGTAGTTACAACATCAATAATTTCATGCTTTAATCCTAAACTAGTTGGCGTTG GAGT TG GG ACAACAAATCTCTAATACTTG CAG CCCTTG G GACAACAACAAACTATTTATCTCAA
TCAGTGGCGGAGTCACCTTATACCAAGGGGTGTCAATTTGACACCTGACACCCCTTC AC
GGGAAAAAAATATACTACATAGGTAGGTAAAAAAAATTATATATATGTTGACTCCCC TTA
ATTTTTTCGTCTATTTACTTATATATATTTTGACACCCCAATGAAAAGCATGCCTCC GCCA
CTGATCTCAATGAACAACAACTTCATAAAAAGTAGCCTTTTGACAAGGCTTCCTTAG TAA
ATGAAGTGCCAATGTAAGATTTTCACAATAACCAAATGGCTAGTAAAAAAACGGAAG CAT
TACACTGATAGAGAATAACATATTTAGAAAGTAAATGAAAGAGAATAAAAATACCAG AGC
CCGTAGTTTCACGCTGGGACTTGTTCTGCCATGGAAGGAAGTAATTAGGACTACTGG AA
ACAGTGAATATTTTAACTACAGAATCCAATGCTAGCTCTATTGCTAAATAAGCATCC ACC
ATTCCACTACTTAATCGCTGCTCCACCGCCGCCACCGGTTCTACTTCCGATGCATTG GA
TTGAAGACTATCATTTTCCTCAGCTCTCTCGACATGAGGTGTCGTAGAAGTTGTTGA GC
TATTGCTGTTATTCAAGGTGGAAAAAAATGAGGCGGAAGCAGAAGTATTACCAACAG TG
CTGCTATAATTGCAGCGCCGAACAAATCGATGAAGCTCTTGACGTCGGTGATGTACC G
GAGCTACATCTCCGGCGATAATAGGGGATTGAAAGTGGAGATTTCTGTTTAAGAGCT TT
CGTGCTGTACGGAGACTTGGACCTATTCGTAACAT
SEQ 65
CTAGCTAACTTGCTTTTCACTAGCCTTGTAATACAGCATTTTCTGCACATAGAACATCAT
GTATCCTTGCGCAGCTCTCACAATGTTTTCGCTCACTTGAGTGATCCAAGCATCATC AC
ATTTGTACCATTGATTACTTAACCTAAGATATGTTACGTAATGACCAGCATCAAGTT TAC
CGGTATGGGTGATGACAGCAAACAGCTCAAACTCTGAGGACGATTCACAGGACGCAT C
TTGCTCGTCCCCATCAAAGGAGAAGATTCTATTTCCAAATCGACTCCTCAAGATAGA TG
AAGAAAGGTAAGGCGACATGTCCAAGGAAAAAGGAAACTGTAGGTAGTGATCAACCT T
CCTTGACATTTTCTTAATCACAGAATGCTCAAACCTTTTGATATGGAAGCAAGAAAC CAA
AGGCAATTTTCTTATGGACATCTGTTTAAGAGATTCCTGTCTCACTTGACAATGTTG GCA
GAAGAACTTCTGATCAGAACCCAATTTCTCAGGTCTTGTGAAATGATCTAAACATCC CAT
CAACGAAGAAATTCGACCGTTTTGGCTAAACTTTCCAGATTCTGCTTCCTTCTTGTG AGT
ATTATGAGACTTCTTTGATGTCATCTTTGAGGAACTCCCCTGGCTCAGTTCCAAGTC CAA
GGAGATGTCTATACATGGATCATATGTAGTAGATGTGAAGCCACAAGCTGTACACAT GA
CATCAGACCGCAAGATTCCAGAAAATACTCTATGAGCAATGCAACAGTCTCCACTAC CT
GTGCCAAAGAAAAAGATTTGATAGAATGAGATCACAGCTGAAAAAACACAATGTGTA CT
CCTAATAAGTTACACAGAGTAACATACTTAATATGCACAGCTTAACGTGCTCCTCGA CAC
ACTTCATAACAAGTAAAAGTCCAGTTATGCCATAAAGTTCTATCAGCTATTGCATTA CTA
GACAAAACAAAAATCGTCCTATCAGGAATGGCAACCAACAGAATGAGTAGCTAAAAG CA
ACGGAGTACTATTGAAATAAAAGTAAAAAACTCAGTAAGAAATGTATCCGCAAGACA CAT
TTATTGCCTCAAAAGCTCTATCTATTCCATTTACACAAAACCATTCAGAATCATAGA TAAT
GTGTATTCATTATTTAAATGCAATACCCAGATGTTAACTAAATTTTGCAAACTCAGG GCA
GATAGCCCTGCACCAATCCTCTAATTAAGAGGAATGAGAATAGGTGTGAACTCTCAC AA
TCAAAGTACGTGTCACTTAAGTGAGAACCAAATCAGCATATTTACAATGGTAGAAAT GAA
TTGTTGCTAAGGAGTTACCCCTCTAATGATTGCTCTGCTAAAGAAATGCGAGGTACC GA
G GAG CTAG ACTC AC AAG AATAATAAG G C AC ATAAG G G CTTATAG AAAG GAG G AAG C AA
AAG G CTGTCAACCTAGTAAAG ATTTCAAG CTTTCTTG AAG GTTGCG ATCAG CTCACTCG
AGAGTAATACCCTAAAACATGGTAAAATTGCGAGTTAATACGAGAACTATTGTACCA GAA
G AAAACTCCTATG CTGAAATCAATAAGACTAACCAAACTG AG ACTTACTG G G GAGAG AG
TAGGAGGAAATCTTTTTCCTTTTGAGAAACATCTCTAAGCCTGACAGCGAATATCAA CTT
GTAAATCGCCTATGGCAAAGAAAAGACCTAAACCATAACCTGCATTCAAATATTACT ATT
TTTTCTCAGTGACAATGGAAGTTGGGATTGCCATGAGAGATGAGATGCTAGAACAAA GA
TAAGCCCATCAAGCAGGCCCTGAGTGCCTTTTAGTCAGACGTTACTAGACATCACAA AG
ATGCTTGAACACACTATTCTGCTTCTGACAGAAATTGCTTCTTCCACCCCCTCCCCA CCA ACAAAAGAAATTCAAAAAATTCACCTACTGAAGGACTTGACCTTTGCAAGTACCTGTACA
AGTTTCAGTAATCCACTTGTTTGAGGTTTTTACAATACTAGCCTCCCTTGGCTATGT TAC
ATTTATGTTACTTTAAAGTTGCTGCCATGTGACCTGGAGGTCACGGGTTCGAGCCGT GG
AAACAACCTCTGCAGAAATGCATGGTAAGGCTGCGTCCGATAGACGCCTGTGGTCCA G
CCCTTCCCCGGACCCCGCGCATAGCGGGAGCTTAGTGCACCGGGCTTCCTTTTTTTT T
TATTAAAATATATATAAACAATGTTGTCAATAATTTTCCCAGTACAACAAAAAAAAG AAAT
CTCAATGATTGGTCTAATTCGGAAGAAAAGGGAAAAAGGAAGTATAAGAAACTAATA TA
GGCAAGGTGATGGGCGGAGAAACGATGGGCAACTAATAGGTACTCTAATGCAACAAA C
AAATTTACCTGGACTCAACGCCTTTCCCTTATCGTTCTGCATCCTTTCATGAATCCC GTC
AAG CAC GG AAATGAAAAACTCATG AG CATC CTGCTGTT CAT AACTTG CAAGATTTGATG
CATGCTTCCACCAGCTGTTCCAAAGAAAGGTTTACATCCATCTGAATGAGACACAGC AC
AGACTGCAACTAGATACTCAAAAAGTCGAAATCCACATCTAATAAAACAAATTACAA ATG
TATGTATATCACAACCAAGTTACACACCACCAGTGGCGGAGCCAGGATCTCCGCGAA G
GGGGTTCAAGAAAAAAAAAATCGTAGCTAGTGGGAATTGAACCTATGACCTTTCAAA GA
TTTTGAACCCCCTTGACCACTAAGCTACACTTATGGTTGTGTCAAGGGGGTTCAAAA CT
TAATATATAG AG GTAAAAAACAGATTTTG CCTTATATATACAGTGTAATTTTTC GG CG AA
GGGGGTTCGGGCGAACCCCCTTTCGCCCCCCTAAATCCGCCCCAGCACACCACTGTC T
AATTTCACCTCTATGAAGGGAAAAGCGTGGTACATACAATCCCAATAATAAAAAACT AAT
CTTGTCCCTACATCATTTTCAAAGAAGTGCACAACCCAAGCTAAATTAAGGAATCTT TTA
CCAATGTATTGTTATCCTTATAAAAAAGAATTATATACAACTATACCTCAATCCCAA GCAA
ATCGGGATCAGCTATATGAACTTCACAACACACACACACACACACACAAAAAAAAAA AA
CGTGCACATTATAACAAAGCCAAACATTATCTCAACAAACCAAGAAAACATGATCAA ATG
CAGACCTGTAAAGGAACTTTGCAGGACTAATAGGGGTCCGATCGCCAGAGAAAACAG C
AGAAAACATTGCATCCAAATCACAAGCCAAACACAGCATTGTTGAGTTCTTATTCCC ATT
ATCACTACTACTCCTTGTTATAACACTGCTATTCTTTCGCTGGCAAAAATATCTGTT ATGC
TTGTCACTCAGAAAGTAATTCCTCAATGGTGGTGTATGAAGCAATGCTTGAAGCACT GA
ATTCATAAAACACGTGTTTCCAAGATTGTTAAGGCCCCTCAAACCCCATTGTACTTC TGG
GGTTGTTGAGTCATTGCCGAGCTGACTCGGCAACGGACTCGAGTTCCCAACGATCAA G
ACTTGCTCTTTCACATCAGGCGTCCACGGTTTGTACTCCACGCGCCTCCTCTTGCGC GT
GCTCTCCGGATGCGGCGGAGGGTCTTGTATTGATCCGATCACGGTGGCCTCCGTCTG C
GCTAGTGCCACGGCGGCGTCGAAGTCGCTATTGTACACCTGGTCCCTACACCCGCAG C
AGAACAGCTCGGCCCGGTCGATGTCCACCGCGATGCAGTGCAGCGAGGGGTCCGCAG
CGTTTCCCGCCGGATGTGACGGCGCGTGCACGCGGCAGAATACCTCGGCGCACGTGA
CGCAGGCGTACAACCGCGGCGGCGCGTGTCCGCACGCACCACATCTCACCAGCTCAT
TCGGCGGCTCGCGGCAGAT
SEQ 66
TCATAACTTACTGTGCACGAGCTTATTTGAAAGACGTTCAACCTTGCCAAGTTGAGTTGC
GCTCTTCAGCATTGCCTCGCTGTCAGCTATAAACTTTGCAAACTTAGAACCCTTTGT ATC
ACCATCACTATCTCCTCGAGGAAGCATTGGCAGATGAAGCATAGAGGGACTTTGCTT G
G AAGATAAAGG CG G AGTTAATG CCCACACAGAAGAAAG CTGTTCATCTG GTTTGTCAAG
ATATTCCAATGATAGATTCTGCATGTCTGTNTAACGGTGATAGATCCTGCATGTCTG TCA
GGAAGAACAGAACAGAAATATCAGACGTTGGTATCGTGCCTACCAAGTGTTATTCGG AA
CTG ATTTCTTG AG AGTTG CAAAACTTATTAAG AG CTTG CAG ATC AAATGTTACTGCTTTT
ATTCTTCATACGGGAAAAGACCACCCATTATCTGAAAATGGAAGTATCAGGAACAGT CG
AATAAAGTACCTTCAACAAACTTG AAG ATG G GCTCTAAAG CTG CACATG GAATG CTG AA
GTTCAAATGTGGAATGACCGTCCCTCCACCATGTCTAGCATTACTGCATCAAAAATA AAA
CTACAAGACATGAAGTTTCAACAAGAAAGATATTAATAGATGGAAAAACTATAGAAA GAC
CATGATATGTGAAGATTAACATAAATGAGATGACTAAAAGATCTTCTGTATGAGATC ATT CAAATTGGACACCATTATTTTTCCTGTATGAAAAGCGTATAACCAATATACATTTTTGGT A
AGGACAATTAATATACATTCAAACAGGAATATCTTTCTTCAAGGACTTCTCAAAGTA CTC
CAGGACGCAGTGTACCACAATATCATTAGATTGAACTTCAAGAACAAACACATGTAA ATA
TACATAAGCTGAAAAAGAAATATCCTCAATTATAAGCATCCCCAGTTGTCATCCAAA AGT
TAATTACCTTGTTACAAGGGCAATCATGTGTCCTTCTGAATTAACAACAGCTCCACC ACT
ACCACCAGGGTGTACAGCAGCCGTTGTTTCAAGCATTGCCGGAAAATGTTCTCCTAA AC
TTGATTGGTTGAGCAGAGACCGCTTGGCTTCAACTACCTTAGCTATTGCACCCACAC AA
G CAGATG GAAG GAAGTCTATATTATAAG AAAAAGTAAG ACAAATTACAAATACAACTAAA
GAAGTCTAATATAGGTATGAAACATATGAAATAAATATACGTAGTATATCATGTTCA AAAT
GAAAGAACTTAACAAAATTATTCACATGAAAAGCAATTTAACCTCCAGAACAACATG ATT
TAGTACTATTGGGCGCACAAAGATAGTCAGTTCCAGAAAATTATGTTCAGCAAAGGT TAT
GGAACAGACAAGTTATCTGTATCAACGAAAAAAGATGGAACAGACAAGTTAAGATTG CA
TCAATAAACAATAGTAGCACTTGCAACAACCTAGCTACTATTAAAATATCCTTGAGA TAC
AGCCCGACTCGAATAAGTGAGTTACCAGGAATTTCCTATTCAAAAATCCCATTCTTT AAA
GCTGATCATTTGTACTTGCTTTCACAATAGAAAACATCAATTTAATGCTCCAGAAAT TTAC
CTTTTTTCGTGATAACTGATAAGTGACTTCAAAACTCTAGATTTGATTCCCCAATTC CACT
TTGTTAGCATAGGTATTAGGTATATATCATTCTTATGGATGAAGATCTGAATTAGTG CCT
ATGGCTTTTATTAGCCCACGAAAGGAAAACGCTTCTTTTTAATTTCGTCTACCTTTC TCC
TTGTTCTGCTAGCCTTGTTTGAGCCCTACAACAACCTCGCTATTCTTAATCTGACGT GCA
ATTTTTTTTAACCAGAAGATCAAAACACTGACTTGGACTACAAATCAAATTCAGTAT CAGT
AAACAATGTCTTCACCTAAAAGATTACCCAGTTTTGAGCCCTCCCGACCAATCTGGT TAT
TATTTCTCCATTGGAAGACACCTCAAGTTCCCTTGCGGAGGCGTCGACATCACCTCA TA
ACTACTCAATCAGTCATCAAATGATCATCATTTGGTGAAGGAAGAAACATCAAGTAT TCC
AGCAGTAACAAGGAGATGAAAATGATATAATACGACCCAATCCTGCAATTGATTATA ATG
ACACTTCAACAAATTCTTAACACAAGAGAAGCAAGGTGGAAGAGAGAGAAATTCAAG AT
AAACAAAGTTTTTGTAGAATATTCTAAAATTTCAGATTTACTGTGATGCGTGTGTCA AAAT
AAAAGTAAAGGCAAAAATATTTTATTTAGACAACAAATCTAAAGCAAGATTTACCAC ATC
GTGGTCCAAATAGCCCATGTCCGAGAATGTATGCTTTTGATCCAGGGGACGGGCACA T
G AAGTCAG CAGTAATG G G ACAG AG CTG ATCAGG AACTAGCTC AAGTTGTAGTAATG CA
ACATCCAGAGGTCCTCTGGAGACATGAACTACCTTTGCATTTGTCCATACCCAGGGA TC
CATAAAATCCAAGCGAACACGAATGATCCTACTGCCTGTCTTTGCCAGGTTAACTCT AA
AGCTACCTTGCTCATTGTCAACCAAGAAATGCGGAGTTTTTAGTTCCTTTTGAATCA AAT
GTTTATTCCTACGCTGAATGTCAAATTTCTCAACCCCTGGATGCTCAGATTGATCAG AAG
GGATGAGAACTACATCAGATTTGGTGTTATATCCTGAACCGTTTACAGATGTTTTTC CAA
ATCTCCATGGCTCTAGAAGATGAGCATTTGTAAGAAGAAGACCCTGCTTGTTGAGCA AA
ACTCCAGAAGCCCATGCTCCATCATCAACAGTGATAAGGCAGATAGATGTCATTGCC TT
CTCAATCAAG G ATG G GG G AACAG G ATCTATTAG G AG ATGCTCTCG AGTATCATTG G AA
GGTCCATTCCGAATATTATTGGAGGGTGATTCATTTTTAACACTGATTAGGTTTCCA TTA
TCAAAATG G ATCTTTCTCCTAGTTTGTAGCTCTTCTTTAAG CAG G CTACC ACAAG CAG AT
GTAATAGCTTCCCATGGAATCACCATCTGCCACCCAATGCTTCAATTTCAAATTAAA GCA
ATTTATGTGAAGATAGATCTCAGCAAGAACATAGAGCACATTATGAAATGTTTACCT GAA
TTTCAGCAGCAGTAGCCCTTTGTCTGAGTGGCCGAGACAGAACACCAATAAGCTCTG C
ATGTTCCCCTAACACTGGGCTACCTTCCAT
SEQ 67
CTGGATATAAGTTTGAACATGATCCTTCCAGAGAGAACTTGTCTTCAAAACATCCCTTAG GGTTGATAGAACTTCAAAAGATGTCGCCTTTATAACATCGTCATCCTTGTTGTACGGCTG TTCCTGCAACATAATTAAGAAAGCAAATTTTAGAAACAATTTCAAAACTGAACATCAAGC TTTATGAGTTTATCATTGCTGGGTAGAAAGAATGCACAAAAAACTGAGGCATGCCTTGA GATGATCAACTTTCACTGTAAGGGGTTCTTCACTGACCTGCAAATGAATGCCACTGTTT
CAGTCCTCAAAGTGTTACTCTTGTACATATCACAAAATGGGCTATCGAGACGGGAAA AA
ACAACATGGTGAATAAAGTAAGGAAATCTGACAGTGAACATTCAAGTGTGACAAATC TA
CTGCATAGTCATCAAATTGTTACACTAGTATATTGAAGAAAATTTTTATTGAAACAA AAGA
AATAACTACATAAGGGAATCTGACTGCAAACATCCAACAATAAATAAACCCAGTGTA ATC
CCACATGTGGGGGTCCAAGGAGGGTAGTGTGTGCGCAAACCTTACCCTACCTTGTGA A
GGTAGAGAGGTAAACATCCAACAGTATGTATAATCTAATAGGAGAATTATTAATAAG GAG
CTCAGG G GTTG AGAG ATTTAAGTCCAATAGAAGAG GATAAG AG GTC CTCAATCACAAAA
TAAAAC G AAAAAATAAG AAG AC ATAAG AC C CTC C AATG G CTAAGTAAG C C C ATAAATG G
CTGACCATGCTTGAGTACTCCAGCTTTACAGCTAAAGCCAAGAATACGACTCTATAC CA
AATTAGAGCTCAAACAAATAGGATTCTATTCAATCTTTTATGTGTTTTATATATGTT TAAG
CCCCAGGGCTTTGGTCTAGTGGTAAGAGTACAGCCCGTGATATGTAGGTTGGGTGCA C
ATCACAGTTCGTGCTCTGGCGCAAACAAAAGCCTAGTATTTAGGTAGACAATGGTAG AA
TGGCGAGCCCATTATCCACCGAGTTTGAAACCATGAGCCACTTACCCTCAGATTTCT CG
GTTATCAAAAGTCTAGGATTTCATTCAAACCAAATGCATCGGCAATCAAACAGATGC TAA
ACTAGCACATCATAGAAGAAACTCAATAGCTTTTCTCTTCTACAACCCCGAAGTGCA ATA
AGAAAGACATTCCTATGACTTTGCAGGAAACCAGTTGTCCAAAATTATTACAAAGTG CTA
CTTTCTGTTGCAAACAAGCAATCAAGAATAACTAAAAGAGTGTTTCATTCTATTGGT AAC
ACATCATACACAACTGG G AAAAACAG CTCATTTAG CCTCAAATTG CAACAG ACCTCTATC
GCTTGACAACGCAGCTGCTCAGTTATATGAAGCCAAAAGGAGAAAGACTGGGTGTTG G
GACACAAAAGGTAGGAAAGAGAGAAAAGTATACAAGTGCAATGAACCCAAGATGTCA AA
TCGGCAGTCTCAAAATCAGGAGATGATGGATATGTAGATGAAAAATACAACTAAATA GG
AC AC AAAG AAAAC ATATG G AG CTACTTATAAC C AAAAAATAAC G C AG AC ATTTACTTAC C
ACCTCTGTCATACGTATCCGCCTGTGACCAATAAGAATAACCTGGTCGTCTTTAATA CTT
GTTATCTGGCTCATCATAAGTTGCATCAACCTTGATCAGAAACTTCTTCAATAACTT AAA
GCAACAGAATAACACAAGGAACAACAAGATGTACCTGAGCAAGTGTACCAACTTCAT GA
AGACGGTTCAACATGTCTTTTCCTTTAAGCTCATAGATGTTTTTTTCTGTATCTGAG GCA
GACACAACATTAGGATCAGTCCCTTGCTCGTCTTTCATAAGGAAAGCGCCAGCATAA GG
TGCTTGCCTTTTTCGACTTTCCAGCAAGGCTGCTAATACCTTGGGATCCTGCAGTTA AAT
TTACGGATTATGTGGAGCCTCAGATCAATTTGTTGCATGCCATTACAGTATAATTTA TCT
TTTTCTGGAAACTAACAAATGAGGCTAAAGAATATTACAATTGAGCAAAGACGCTCT CCG
AATTG AAG GTTTTAG CTTTTCAAG CTTACAACTTCAG CCAAG AACTTTCCATCAATTAAA
GTAGAGCATTCTTAAAGCTGAATGCTGGTTACAGACTTAAGAGGAAATAGGGTCCAA AA
CACTAGCTGAAACAGTACAACTCTCAGAAAGTTGACAGTAATTATACACTTAATGTT TAA
AGCTTATATCTATTTTATTTTGAAAATTACGAAGACTAAGACATGGTTTAACCCATC CTAG
ACAAGGTACCATACCTTCACATAAATATGCATATAAAACCCTGGAAATAACGGTCTG TGT
G GAAGTG G CAGTG CTAG AACCTG CACCAAG AAAACAACATGC ATCATTATAAATTATAA
AGAATTAAAAAAGTACAGCCCACACAGACTTTATTTATTTTTAGTATAATAAATGAC ATAA
AGCCCTTAATTGCAAGTCTTACAGGTAAATCTTATCCAAAGATAAAACCATAACCAA CCA
AGAAGGCAAGAAATGGGTCGCATCCATTAAATTTACAAACAGATGAATTATGAGCAA AC
AAGAAGCCTCCAAGAAACAGTGTTTTCTTTGCATTCGTTCAGTATCTCTCTTTTTTA GGG
AAGGTTGATAAATACCATTTTTACACTGATCTTGCAACAACTAGCAAAAACTGAAAT ACA
CATATGAATCTTCCTTTCAAAGAAAAACGAGATAGTAGGGCAACCTATTACTTTATC CTC
ACATGTACTTTCCAATGCTTTACTTCTTAAATAACCTAGAAGCTGTTGATTAAATTG AAAC
TAAAGCATGAAGTAACCCTCTCTAGGCCTCCAAATATAGGATGAAATGTTAAAAATT AAC
ATCCATCAAGGTGCCTTCTGTACGGCCAGAAGTCAGAGGCATGAGGCATAACTGGTC G
AATGTTGACTCTGAAAAGAGCATATGGGAAGGAGAGTATTCTAATTACAATGGACAA GC
AAACAAAAAAAAATTCTTTTGCTGAACAATGTGAATCACCAGCTTCTTGCAGGTGAG AAA
TCATTCAAAATCATTAGGAACTTTCTGAAGCTTCAGTCTTGACATAGAGCTGATCAA TTT
G CCTCTACAAGGTG GG AGTTACACATG GTCCAG AGAGAG ACAAG G CAG G AAGTACTCA
AGGATCGACGGGTTTCTAATATCAGGAGATTTCAAAGAAACAATCTCAACCAATACG GA TG GTTC AACTG ACAATGTG CTCAAAACAAATCAATCTTCTTG ATTATG G GATTCTCTGTA
AAAAAAATAAGCCATTTAAATCTGAAAATTGTGAGACAATAGAAGAAGCAGATGAAA AGA
TAAGTTGATGAATACTTATTACATAATACATGACACTTAATTTTCAGGTGTGAAAAG AACA
ATAATCTTTGATGTTGGACTAACTTAACATTAACATTCTACTAACTAACATTGCCAT TGTA
TTATGTGTCCTCGAACTCAAGGTAAGAAGAAGTGAAGAACACCTTGAATTTGCTTTT CTG
GAAAACACGGAATAGTGAACAATAGTCGTAATAGACTAATAGTAAAGCATCACCTAG AA
AGTGAACAGTTCAAATCGTGATCAGTCACCAGAACACCATAATGGATTTTTTCGCTG AAA
GTCACGAAAATAGCCCAGCTAAAGAAATCATTAATATTCAGCATAGACTAAGGGAAC CG
ACTCTCAATGACCAGAGAGAACGCGATACCAGGAATCTCAAAACCAACCTTGGTGTG G
CGTAAGGTGGCAGCTAATGTGGCTCGAATTGGCGGCTAAACAGTGACTAGAATAGCA A
TGAAATAGCGTCCAAAATAGATATCTGGAACAATAATCATAATACTGGCCTGAAAGA TG
GTAG AAAACTCATTG GAAG AATG G CCAAAATAGTTCTTTTACACTGTAAATG GG ACATG
GTTGGGATGGTGGCAGAAACTCAATGGAAGAATGGTCAAAAAGGAGGAATTTTACAG C
TAAAAAGATGACTGAGAAAGAGGCATGATACAGTCAAATGGTCTACAGAAATAGGTA TG
AGGTTAACGATCACCAAAAGTGGAAAGAATAGTAAAAAAGTTGACCAATCGGGTAGC AT
AGGCTGCCCCTTATTAAGATCACCAATGGTATGTGATACGATCTGAGAAAATAAGAA AG
AGGATGAATTGGTAACTTGATGAAAATTGGAATACAAGACACATCTATTTGTACCTA ACA
ATCAAATAGAAAAGACACATCTATGTGTAGGTGAGAAATACATTAACTTTATGATAT CTTT
CCAATGTGAGACTAATTTAACACTACTAAGATAACACACATTAACATTTGAACAAGA ACA
AGTG G GTG CTACAAG ATG GTCAAG GG ATGACAAG AATAATATCAC ATTG CCG AAAATCC
AAGCTGTGTGATTGGAAAAAAACTTGAAATTACTTACAGATGATCTTAGAAGATGAA AAT
AGAGAGGTATATGCTTGAGCGGAGATGGAGATGAAAGAATTGTTGGAAAAATTGAAA CC
TTTAGAAATCATTAG GTGAAG CAAAAGAG ATG ATTG CAAATGAATTACCC CTAATGGAAG
ACTTGATTAAG GAGTTGG CAGACTGTG CCTT AGTG GAG GAG GTATCAAG G AAGCAAAA
ATCAATAACATTGTGGTTGAAGGAGGGGTACAGAAATACCACATTCTTTCTTCCTCC TCT
TCTTTTTTTTTTTTG G AAGAAAGTG GAATACCACATTCTTTCATTG AAAG GGTAATG CTAA
TTCAAG CTACAATAGC ATTAAAAAAGTTG CTCACTG ATG AG AG G CTCTCTG ACG ATCCT
AAGAAGATTCAGGACAAGAAATCTTTAAAAAGACGACTAGATTTTAAAAATTTGTCA ATT
GTCATTTCATTAAAATACTAATGCTAATTGAAGCTACAATAACATTAAAAGTGTCGG TCAT
CGATGTGAGGTTGTCTGAGGATCTTAAGAAGATTGATATCAAGATTCTTTACCACTA CAG
CCTG ATTTAAAAAAGTCTACACTACTATTCTTTC GTCTCAAACG G AATGG CTG G GTTCAA
AATATGAGAATCAAAGCATCTGATAATAAAACCGTTCTTTTTGAGACAAAGAGGGTA TCA
AAAGCTGTTCTTTGGAAAATAAAATGTTGAAGACCTTCACAGCATTATATCACGTCA AGG
TTG GG AAG G CCTTTTACTAAAG AATAATTG AGTGTACAAGTCTAC GAG AAATATAATCTT
GGCGATCCACCTTTTATATCCTAGAAAGTTAACTTCTTAGCCAACCATCCCTTTTGA TTG
CCGGCAGAGTGTTGCAACCAAAGTTAAAAATATTTTCTATATCAAATTGATCCCAAT TCT
GCGTCAATGCCAACCACAAAGATTAGAAATTTCATCATCAGCGTCCATGTTTGCATC CAT
ATGTACAAATTTGTATAATAATGAATAAAATCATATCACAACTACCCAAAGAAGACA TAAA
ACCAGCACTAACAACAGAAGGAACTCACTTAATATCTTCTCTTTTGGTGCACAATCT GAC
ATGGCCACGCCGTTATAACAAAAAGTTGACATCTTTTCTTAAGGAGTCATTTTACCA GCC
ATCAAATATTG GAATAAAATGTACTTATATAATGATG GAG CAG G AAAATACTTAATTATGC
GCCAAATTATATTAGTTAATAACAATGAAGTGATAAATTGAACTGAGAATGTACATT AGTT
AACAAAAGTCATAAAATTTATTTTAAAGAGACGTGTAATTCTATTGGTGCGGGTGCA AGA
AATTATTATG AACTAACC GTAAG G CAATCTTCAGG CTTAAAAACAGTAG GAACAATAG CA
GCGGAAGCCTTAGAATCAGCATCTCCGCCTTTCTCGGCCGGCTTGGCTTCGGATGCA G
CAGCTTCGGAATTCGATTCGGACCCATCAGTAGAATCCGAACAAAAGAATCGTCGAG AC
AAATAAG GGCCCCTGC G ATTC G AACTTCTTAG C G AAC CTAAAAC C C GAAG C AAG G G C G
TATTCGAGTCAGTGCCACGGCGAACTTGAGGGGTAAATGCCGTTGTGACGGCGTGGA A
ACG ATTCTG CAG ACATG AG G ATGTG AG AG CCTTCAACAT SEQ 68
TCACGCAGCTTCAGCAAAACCCAGTTGAAGATTACCATAGTCAAATACAGTATGATACA
CCCCCATGAATACATTACCAAGAATCCTGCATGTACATGAATCTTTTCGTATGAAAC AGA
TCGAGAAGCTTTTGAATAAATTGAGGTATTCTTGTCTATGCATGCAATTTCTATGAC CTT
ATTAAAGCGAAATGGTTGTAAGAATAATAAAAACGTACCAAAGAGGACCACGAGGTG GT
GGCACATCCAAAGCAACAAACCCACTGAGGCAAATGGTAGCAATCCCCTCTCCAGTT TT
CAAAATATACTGGAAACAGAAACAAAGCCACAGTTGAAACAAGGCTGAGTATGCAAC AT
AACAATGTAACATATATAAAATTAGGAAGAGAGGCATAACCTGATCTGGAGTCAGGA CA
AAATCCTTATCACCAATATTGAATGTAACATTTGGCATGGATGATATGCTATTGCAG TCG
ATTACGGATTGCCCCATAGGACTTGGTAATTTCTCACAAAGCTGTGAAATACCAAAC ATT
AGGCATTAAGACAATGCTCTGCACTACTTAAAGAGAAAATTCACTCACCTGATTCAC ATA
TTCTAGCACACTCTCCTTTGTTGTCTTCTGTTTCAGCTGGTTCTGCATCCAAATTAC AGC
CATCTCACAGGCAGTGCACAATGGGGCCTCTCCTATGGAACTTCCTTCATTTTCCTT CT
CAACCACACTTCTGATATTCGAGCTGCAAAATTGGTTGTAGATTGTCACATGGTTAC ACA
ATTCAACATGTATTGTAACAGACATAGATCCTTGGTTCAAATCTCATTACCAACCTT CAT
CATAATGAATTTCCAAGTACTAGGCCCATGAAAAGATCATACTTACACGTAAGAGAG CAT
GTCCAAGATATAATAATAATTGAATAAGATTGTGACATCTTTGACAGCTTAAGTTTT TAAA
CGAGATGGTTACACAATTCAACGTAAACAACATAGCTCTAGAGCCTTGACATATGTT CC
AAAGACAGATTGTCCAGTTCAATTTAGTAACTTCACCAAAACCACCTTAATTCACAT ATG
CTCCAAAGAACAGGAGAAAAGCATCCATACCTCAAATGCTGAGCTCCATTAAGATAA CA
TAAACCTACTTGTAAACAAATTTGATCTGGTGTGACCTGCAGACAATTACAAGTAGC ATT
ATATCTACAATGTCACATGCAGTAGTGGCCGGCTCAAATAAATAAAAGAAGAGAAAT AG
AACAAACCCCTGATACTAGTAAATCCCAAATCATTTCCCCATATTGAGAAATGGTTT CTT
TACATTCCATGCTCAATACTCCTTCTGCTCCAATGGCATGGTTGACTTGTGTCACAA CAG
CCTGTTATGTACTTGCTATAATTACTTTCAAGAATTTGTTCAAGATAACTAGATAGT CGAA
AGGTCTACAACTTCTGCAACAACCCAACTAAAGTAAATCATACAGTTGGACCAGCAA GC
AATGATGTTCCAGAATCAACTATAGCAGCACAACCGCCTTCACAGAAGCCTGTAAGG GA
AAAGTATTAAG AC AAAAAG AAATTATTCTTATCTTAG GCTC GTAATC AACAG CAAC GCTC
ACCTGTTGATTGGTTCCCAATAGAGAAATCTCCCATTTTAAACTGAAACAGAAAAAG GAA
AAG AACAGATGG G GG G ACC GTG AAAAAAG G CAAAAAG G AATG GAAACCATAGTG GAAA
AAACAATTCAAGTACCTGCCAGTAACCTTTCTGAGTCAAAGGAACATAAGTATGTTT ATC
CTTGAAGTGTTTTGGATCAACACCACCAAAAACAAGTTCACCTCCCTCTTTTGCATT TAT
ATCGCGATTAAGCCAGAAAGAGAACACAGGCTCCTTTACGAGATCTTGCTTCACCAT AT
TGTACCTGCACATATATCATGAATCAAACAGACAAGATTCCAAAAACTGAGAAGAAA GG
AAAACAAGATAGAATTGGTTAAAAACTGAACTGAACAAATCAATTGCAGGCCTTATT ACC
AGACAGGTGTAGTGTTTCCAACAGCAATTTCCTTGAAACCAAGCCCAAGTATTCCAT CA
AACTTTGCAACTATAAATGTAACACTTGATTCCCGTGTCGCCTCAATAAAGACCTGA AGA
AATTGATGTAAAAAATTCTCATCCATTGTGTTTTCAGAAGAGCAGAAAGGACCATAA TAT
GAG G CAGTG ATG ACTTATTG CCAAG CAAG ATTTCAC CTGATCCGTG ACTACAAGATCG C
CAACTTGAACATTATCTTGACTGAGAAATCCTGAAATTGATCCAGATCCATAGTGGA TTG
AACAAGATTCTCCTGGATTAGAATAAACATCAAATATAATCAGAAGCCATCAATAAA TAA
CTTCTTAGTCTTTCAATTAATGTGAAAGAAATATAACATTAAACTATGATATGAACA TCAC
CTTTTTTTGTGTATGTACTAGACTTCCTTGCCTTGTATTTGGAATGGATCCAGCATG CAA
TCTGGAAATAATTCAAGTTTAAGGAAAAATTCTGTATAAACCGGTAATTCAACAAAG GAA
CAAATAACTAAAAGAATTCAAGTTATATACACTAAAAGTGTAAGGATTTTTGTTACT ATCA
GTATAGTTTAACTTCTGATAGAAATAATTAAGTACCAATTATTGTTACCAATTAATG TTTT
CTTATAGAGATTTACATGTAATTACCTTATAAGTGACCTGATTATATAACTAACCTT TGCA
C C GTTAG C AC ATATAG AAC GTAAAC ATTAACTAAAAAG AAG C AC AACTTAC AG AG AAATA
ACATCTTGATGATGGAACCCAGAGATTAGAACTTCCTGTATCAAATATGACAGTGAA ATT
TTGAGGGGGTGAACCAATACTAATATCTCCATAATATTGAGCATCCAAGTAGTTCTT TAA GGACACTATATCTGAATTTGTGTCAGATTTCTTCTTCTTCTTCTTCTTCTCTATGTCCTT C ATCACATGCTTTCCATATCTGTCTTCAAGTCTTGCTACATTGGCTACATTTAAGCTACTG ATATCTAATTGTCGCTTCTTCAGACTAATTCTTAGCAAACTATCAGAGGAAGCAGGAAAT AC AAAG C AG G C AATG G C C AATAAAAG GAG AG C AG C C C AAAG ATG CTTC CTTTC C AT
SEQ 69
TCAAGATCTTATTACAACATACTTCTATTACAATATCTTTTTCTTTTTGTAATGGCTTTG AT
CCTTGGATGGAAAATACTATTTATCCTTCTTTTTGTGATAATTGGGATGTGTACATC TCAA
GTCACTTCTCGTAATATTCAAGCTTTATCCATGTTAGAAAAGCACGAGTTATGGATG TCA
AGTCATGGACGTACTTACAAAAATGAAGCAGAGAAGGAAAAGAGATTGAATATATTT AAA
GAGAATGTGAAATTTATTGAGTCTTTCAACAATAATGGGACTAAAAAGCCATACAAA TTA
G GC ATCAATG CATTTG CTGATCTTACTG CAG AGG AATTCTTG AGTTATTATACTACTG GA
CTTAAGTTGTCTAATTCCTACTCTCAAATTCAATCATCATTTAAGTATGAAAACTTG AGTG
ATGTTCCATCTGTTATGGACTGGAGAAAGAGTGGTGCTGTCACTAGAATCAAACATC AA
GGTCAATGTGGTAAGGCACAGTTTCCTATTCAAGAAAAGTTTCATATTCTCTTCTTA TTA
AGTG CTG AC GTAACTAGTAAAGTTG ATGATATGTG ACCAG CAG GTCACG GGTTCAAGTC
ATAGAAACATTCTCTTGCAGAAATGTAGGGTAAGGCTGCGTACAATAGACCCTTGTG GT
CCGGCCCTTCCCCAGACTCCGTACATAGCGAGAGTTTAGTGCACTGAGCTGCCCTTT TT
ATTAAGTATTGAGAAAGGATTTAAGTAAAATACTACATACTCCTTTCAAATTTGTGA TCTT
AAACATGTTTTATCATTGTATTATAACGGAGTATCACTAAGGTTAAAATGAGAATAT TAGA
AGCAAGCATACTAAATATAAAAATACATTCTTTCTGTAATAGACTAAAATGGAAAAT AAGA
TATGCATAGAGTACTCTCTTCTTGTCCAATAATGTTGACAAGGCACTTAAATTATGA GTG
TGTGAAGTCTCACATTGGTAACTGAAAAAATTAGGAGTCTACATATAAGCCTACATA TAA
GGTTTAGAGTTTTTTTATGGTGTGAGGTCTTTTGAAAAAAATCGTGCGGACTTAATC CAA
AGTGGATAATATCACACTATTCTAAGAGTATCTTTGAGCTGTTTTAGCTCAACAACT CGT
ATCAGATCCCAGGTTCTGCGGACGAGCATAGCGATGGCGACCTGTGGATCGTGGTAA T
AGCCACATGAAACTGGTTCGACGGGGAGACCCGTGGATCATGATCATGGTAGTGAGC C
ACATAAAACTTAGTTCGAGAGAAGGATTATTGGGTATGCAAACAAAGTCTCACATTA ATA
G CTAAAAAGTTTG G GAG CCTG CATATAAG GCGTAGAG AACTTTTAATATTGTG AGTCCT
TTTGGGGAAACCGTACAGTTTGGCCAAAGCGGACTATATCATACTAAGTTAAGAGTA TC
TTTGAGCCATTTTAGCCCAACAAATCATATATGATAATTTAAATTTGTTTTACACTA CCAA
TAATGTATTTGACCTACTTTGCAGTATAGTTACTATTTTTGTATGTTTATCATAAAA GTTAA
CCTTTAAAACAATACAAGTGATATGATTTGTATAAATATGTGCATAGAACTTCCAAC TCAT
TAATAAATTGCATGAAATATAGGATGTTGCTGGGCATTTTCAGCAGTTGCAGCCTTA GAA
GGAGCAAACAAACTCTCAACGAACAACTTGATTTCACTCTCCGAACAACAACTGTTA GA
TTGCACCACCGAAAATAACGGTTGCAACGGCGGTTTAATGACCACAGCCTACGATTT CA
TCATTCAAAATGGCGGCATTGCCACAGAATCCAACTACCCTTACGAGGAATATCAAG AT
TCATGCAAAAGCCAAGAGATGAACTCTGCAGTGAAAATCAATCGTTACGAAACTCTG CC
CTCGACTGAATCAGCATTGTTAAAAGCCGTAGCTAAACAACCGGTCTCTATCGGTAT TG
CAGTG AATG AAG ATTTTC ATCTGTACCAAAATG GTGTTTACAATG GAAATTG CG AG G GT
CAAGAACTAAATCATGCAGTTACTGTAATTGGTTATGGGACAGAAAATGATGGTACA AAA
TATTGGTTGATCAAGAATTCTTGGGGGACAAGTTGGGGTGAAAATGGTTACATGAAA AT
TGCTAGAGATACTGGAATTGAAGGAGGTCTTTGTGGGATCACCACTTTAGCTTCCTA TC
CTGTTCTT
SEQ 70 TCATAACTTACTGTGCACGAGTTTATTTGAAAGACGTTCAACCTTGCCAAGTTGAGTTGC
ACTCTTCAGCATCGCCTCGCTGTCAGCTATAAACTTTGCAAACTTAGAACCCTTTGT ATC
ACCATCACTATCTCCTCGAGGAAGCATTGGCAGATGAAGCATAGAGGGACTTTGCTT G
G AAG ATAAAG G CGG AGTTAATG CCCACACAG AAGAAAG CTGTTCATCTG GTTTGTCAAG
ATATTCCAATGATAGATTCTGCATGTCTGTCAGGAAAAACAAAACAGAAACATCAGA CGT
TGGTATCGTGCCTACCAAGTGTTATTCGGAACAGATTTCGTGAGAGTTGCGAAACTT AT
TAAG AG CTTAC G G ATC AAAG ATTACTG CTTTTATTCTTC ATAC G G G AAAAAAC C AC C CAT
TATCTGAAAATGGAAGTATCAGGAATAGTCGAATAAAGTACCTTCAGCAAACTTGAA GAT
GGGCTCTAAAGCTGCACATGGAATGCTGAAGTTCAAATGTGGAATGACCGTCCCTCC A
CCATGTCTAG CATTACTG CAT CAAAAAT AAAG CTACAAG AC ATG AAGTTTCAACAAG AAA
GACATTAATCGATGGAAAAACTATAGAAAGACCATGATATGTGAAGATTAGCATAAA TGA
GATGACTGAAAGATCTTCCATATGAGATCATTCAAATTGGACACCATTATTTTTTTC CTGT
ATGCAAAGCGTATAATTAATATACATTTTTGGTAAGGACAATTAATATACATTCAAA CAGG
AATATCTTTCTTCAAGGACTTCTCAAAGTACTCCAGGACGCAGTGTACCACAATATC ATT
AGATTGAACTTCAAGAACAAACACACGTAAACATACATAAGCTGAAAAAGAAATATC CTC
AATTATAAGCATCCCCAGTTGTCATCCAAAAGTTAATTACCTTGTTACAAGGGCAAT CAT
GTGTCCTTCTGAATTAACAACAGCTCCACCACTACCACCAGGGTGTACAGCAGCCGT TG
TTTCAAGCATTGCCGGAAAATGTCCTCCTAAACTTGATTGGTTGAGCAGAGGCCGCT TT
G CTTC AACTAC CTTAG CTATTG C AC C C AC AC AAG C AG ATG G AAG G AAGTCTATATTATA
AGAAAAAGTAAGACAAATTACAAATATAACTAAAGATGTCTAAATAGGTATGAAACA TAT
GAAATAAATATACGGTATTATATCATGTTCAAAATGAAAGAACTTAACAAAATTATT TACA
TGAAAAGCTATTTAACCTCCAGAGCAACATGATTTAGTACTATTGGGCGCACAAAGA TA
GTCAGTTCCAGAAAATTATGTTCAGCAAAGGTTATGGAACAGACAAGTTAACTTTAT CAA
CGAAAAAAGATGGAACAGACAAGTTAAGATTGCATCAATAAACAATAGTAGCACTTC CA
ACAACCAAGCTACTATTAAAATATCCTTGAGATACAGCCGACTCGATTAAGTGAGTT ACC
AGGAATTTCCTATTTTAAAACCCCATTCTTTAAAGCTGATCATTTGTACTTGCTTTC ACCA
TAGAAAATATCAATTTAATGCTCCAGAAATTTACCTCTTTTCGTGATAAGTGACTTC AAAA
CTCTAGATTTGATTCCCCAATTCCGCTTTGTTAGCATAGGTATTAGGTATATGATCA TTC
TTATGGATGAAGATCTGAATTAGTGCCTATGGCTTTTATTAGCCCACGAAAAGAAAA CG
CTTTTTTGTTTTTTAATTTGGTCTACCTTTCTCCTTGTTCTACTAGCCTTGTTTGAG CCCA
ACAACAACCTCGCTATTCTTAATCTGACAAGTGCAATTTTTTTTAACCGGAAGATCA AAA
CGTTAACCTGGACTACAAATCAAATTCAGTATCAATAAACAATGTCTTCACCTAAAA GAT
TACCCAGTTTTGAGCCCTCCCGACCAATCTGGTTACTATTTCTCCACTGGAAGACAC CT
CAAGTTCCCTTGCGGAGGCATCGACATCACCTCATAACTACTCAATCAGTCATCAAA TG
GTCATCATTTG GTG AAG G AAGAAACATCAAGTATTCCAG CAGTAACAAG G ACATG AAAA
TGATATAATACGACCCAATCCTGCAATTGATTATAATGACACTTCAACAAATTCTTA ACAC
G AG AGAAG CAAG GTG GAAG AGAG AG AAATTCAAG ATAAACAAAG ATTTTGTAGAATATT
CTAAAATTTCAGATTTACTGTGATGCGTGTGTCCAAATAAAAGTAAAGGCACAAATT TTT
TATTTAGACAAGAACATATCTAAAGCAAGATTTACCACATCGTGGTCCAAATAGCCC ATG
TCCGAGAATGTATGCTTTTGATCCGGGGGATGGGCACATGAAGTCAACAATAATGGG A
CAGAG CTGATCTG GAACTAG CTCAAGTTGTAGTAATGC AACATCC AG AG GTCCTCTG GA
GACATGAACTACCTTTGCATTTGTCCATACCCAGGGATCCATAAAATCCAAGCGAAC AC
G AATG GTCCTACTG CCTGTGTTTG CCAG GTTAACTCTAAAG CTACATTG CTCATTGTCAA
CCAAGAAATGCGGAGTTTTTAGTTCCTTTTGAATCAAATGTTTATTCCTACGCTGAA TGT
CAAATTTCTCAACCCCTGGATGCTCAGATTGATCAGAAGGGATGAGAACTACATCAG AT
TTGGTGTTATATCCTGAACCGTTTACAGATGTTTTTCCAAATCTCCATGGCTCTAGA AGA
TGAGCATTTGTAAGAAGAAGACCCTGCTTGTTGAGCAAAACTCCAGAAGCCCATGCT CC
ATCATCAACAGCGATAAGACAGATAGATGTCATTGCTTTCTCAATCAAGGATGGGGG AA
CAGGATCTATTTGGAGATGCTCTTGAGTATCATTGGCATGTCCATCCTGAATATTAT TGG
AGAATGATTCTTTTTTAACGCTGATTAGGTTTCCATTACCAAAATGGATCTTTCTCC TAGT
TTGTAGCTCTTCTTTAAGCAGGCTACCACAAGCAGATGTAATAGCTTCCCATGGAAT CA CCATCTGCCACCCAATGCTTCAATTTCAAATTAAAGCAATACATGTGAATGTAGATCTCA
GCAAGAACATAGAGCACATTATGAAATGTTTACCTGAATTTCAGCAGCAGTAGCCCT TT
GTCTGAGTGGCCGAGACAGAACACCAATAAGCTCTGCATGTTCCCCTAACACTGGGC T
ACCTTCCATTCCTGAAACATAAGGTACCATACGTTATCAGGTACCCAAAGATGGAAA GG
ATTAATTAACTAAAGTTGCAAG G CAAAAG CTCAAACCAGG G AG ACAACG GATGTCAG CA
ATCAACAGTG CTTTATTCTGTG G ACTAG GTG G ATAGCTGTTTG CAATG G ACCCAACTGA
TATGCTGTATAGCAACAAAAGATCAAATTGAGCTTTCACAAGGAGATGTAACAAGAA TCA
AAGTGAATATTAATCCTATGTATGATCACAACTTGGCAGACAAATAATCATCGAACA ACA
TCTGCATACACCAGATACAGCTTATATAGCTTACTCAAGTAAAGATGAGCTAAAACA TCT
TTAGCACTAGCAACAAGAATTACACCAGTGCTTTCATATTCAGATAGTCTATTAACA CAG
CACAAATGTTCAGTCATCATACATTTTGATGCTGACAAGCAAGGATGGAAAGATTAC AA
GAAAGTTGTAATTCCAACATATGATAGAAACAATACCAAAGCAACTACCTGTTGAAA AAG
TGACTGGGAGACAAGATACCGAAAGGAGAACCCATACCCAGAAGAAGATCACCTCTC C
TACTCCAG GTAG CCACTTTTAATG CAG G CAG GTC CTGAAAG GAAATACCTATG CAGTCA
TTATCATCAAAAACCTACTAAATG GAG CTTTCAAAAATTACCAG CAAAAAAG CAAG CCCA
TTCAGCAAAATGCAATGCCATGCTTCATTTCACAAGTTTGATGCCACTTAGTGCAGT AAA
AGCCTGAAACCTCATATGGATTTGAAGAAACTCTGAGAAGAGCAATTCTTGTAGTTG AT
GTTCCTATCACACTTGGCAGGCTGGACTGCGCCTCCATCATTGGAGTTTGACTTGGG AA
AGAAATTTTCTCAACCTGAAATTATCAGTTGACAGTTCCTGTTAAAAAATAGTCAAT CTAC
AAAATCAATCTCCGTGGTCCTTTGAGACTAGCATCTCATAGACCTTTGGCTATGAAA ACT
AGAATG CTTTGTG CAAAAATTGTTTG G GC CATCTTG AACATAGTACTAGG G ACAATTG A
GATGCCACCGTTGAAGAACACCATGGTAAGAGTTCCTTATGTCATGCCATTTTGCAC TT
GACAGCGTGAGACTCAAATGTGCTCCTTTCCAATAACACTGCCGAATGCTTATACCA TT
ACAAATATGTAACCAGAGTAGCTTACTGAGTTGGAGTTCTTAATATGAATTATTTTA AGA
AATGCCTCCAAGTTTTACGGGTGGTAGACTACCTTGTATTCAGACACGGGAAGGTGA TA
TTCAATCATTAATCTTTGAATTGGAATTAGGTGTTGAGACATTCTATGATGATAAAG AGG
CAACTATTTTGGCATGAGTAGAGACATGAAACAATGTGATGTCACTCTATTGTAAAT ATA
GGGAGCATGAAATAAGGAACTCATTACATGAATTTCCATTATTCTCCTATAAAAAGA GGT
TACAGAACATATAAAAGGTTCTGCTTGGAAACAACTCAACTGATAATGCAATTAATG CTA
AATATATGGAGGAAACTTGCATGTTCCACAACTCGAAGAAGTGGTGTGCAACATATA TT
AGTTCAAATCCATACTGCTCTTATGCCAGAAAAGAGAAGAAAAAACAGAACCCTGAA AG
TG CAAAGTG GTCATG ATTAG AC GTAAG ACAG G AATAAAATG G CATTTCTG CTCAAAAGA
AATAGGTGCTTGATTTATTTAGTTATTTAAGAATGATAAATGATATGCCTTTCAGTT GATA
GAACTTTAAAGTGTTAGCCTGATAAAATTATAGTTATTTGATGAAGTCTTTTGAAAT TTGA
ACCACAAAGTGGGATCATAGCAGAAGTTAGCTCATGAAAAATGACCAAGGATCTACA GC
ATCCAATCAAAATATGCATGCAAGAGAATTTGGCTTATCTTGGTCTCGGCGACATTT AAT
TATCTTTGAATGGAAGTATTGTCATGTTAATCCTATTTATGACTATGTTATGCATTA ATGA
AACAATCACCTAATTCAATAATAACACAGCTAGATAGTGTCAGGAGTATTTAGAAAG CGA
GTGAGAGGCTCGGACACATCTATCAATAGGTTGTACCAATCATATATAGATGAAGTA CA
GAGACTATGGTTACATATTTACTCATTAATACATAAAGGTACAGAGATTGTTATTGG GTA
CACATTTACTGCATGTAAGTGCCAAACAGAGGAAACATCATACCTGTGTGCGCTTAG TG
TTAGTAAATG ATTG AC G G G AATTTC C ATAG G CAG CTAG G G AC C AAC C AAC CTC C C AC C C
ATGTTCAATTGAACTAGACGAACCTTCAACTAGGGACTGGACAGCAGCAGATGACAC AG
GGATGTCAACCTTCAGCAAGAACACCAAATAGACGGTGTCAAGTGCAACTTTTACGT TA
GATCAGTAACAAAATAGCGACAAAGAAGAACACCAGCCATAAAAGTGATCAGTTAAC CC
ATGCTAAAAACTAGGATAAAGTTCAAAACTAGGTATTCCTTCTCCATTTTATACGGC ACA
CTTTCCTTTTTAGTATGTTCCAAAAAGAATAGAACCCTTCTCTATTTGGAATATCTT AAAA
CTTTAAACTTCCCACTTTACCTTAATGACATGCTCTTATAGCCATAGAAGTGTTATG AAAT
GTTTAAAACCACAACTTCTAAAG GTAATTTG GTATGTGTCAAAATCTTTTGTG CACG G GC
ACAAAACACCAGATGACATCAAACTAGAATTTATATGCATGCATCAAAATGAGAGCA CTT
ATCAATTCAATATTGCAAATAAAAAACATATATAAGATAAAGTAACAGGTTTTATGA TAAT CAGCATTCAGATTACAAAATCCTTTCAGTCTCCTACTAAATCCTACCACTCTTAGGAGTT
CTGCAGGTAGCCAGTTCAAACCCTCTTTGTTGGTCACTTTGATATCATTTTGCAATG TAT
TTCCTCCCTGTGAACCACGAGCAGGACATCCAATGTCAATTTAATATATCAATGAAC CTA
AAAAACGTATTTTCTTCCGTAGGCTCAAATAAATACCTCCCACAGTATATCAATTTG AGC
ACCAGGAATCAGCTCCGGCTTATCCTGCAAGATGGTACATTGTAAAAGAATTATGAA AA
AGCATAACGACAAAGAAATAATCATGTTGTTCCATTTTATACTACATCCATTCAAAA ACCT
ACCTTTCTATACATAGAATTTTTTTAATCTTGTAATTCTCATTTTGCACTTAATGGC ATGCT
CTTATAGGAATTGACATGGCATGCTAAAGACAACTAGATAACTTCTTACACATGTAA TTA
AATATGTGACAAAAGTGGTTCTTTCTTTATTAAACTCCTTCTCCAATCAAACACCAT CATA
TAAAGTGAAACCAAACAGAGGGAGTAATTATCAACTGAAAGAAGACTAAAGATCCAA AC
CTTTGATATGTCCCCTCTATCCTGTTGTACAACAAAAGGCTCAATAACAGAAGCAAC TGT
TAAAACC AAGAAGTG AC CTCCAAAAG AGTG CAACTTG CTTTCACCTTG AATCTG CTTAG
ACACTGAAGCATTAACAAAGGAACTGGGCAAAAGCATCCCAGATGCTGATAGTGTCG TC
TTC C C AG AACTAC AC C AC AAAATTAC AAC C C AAATTTAAAACTTTC AGTTC AAAAC AC AT
AACATAAACAACTATAATATAGAGACAGAGAGAGCTATGTAAAATCACTTACTTGTA CAG
GTGG AAAG CATGTTTTCG CATTTTTAG G CCTTTAG G GTC CTTCAAG AAG AAGAATTTATT
TGTATTTTTTCAAAAATTAATTAGTAGAATAAGCAAAGTGATTGAAAATTACAGTAT CTTG
GCTTAAGAAAAGGGACTCACTGGGCCTTGAATTCTGACCATGACGGCATAATTGCGG G
CAACATCAACCACTTCAGGAAGACCCAT
SEQ 71
ATGGATAACCCATCGGAGGATTCCTCGGATTCTCCTCAACAGCAGCCCGAATCTCCTGT
AAACGATGACCAACGTGTTTATTTAGTTCCTTACAGGTAAAATCTCCCTTCCCCGTT TTG
ACCCATTCCTCATGCAACTGTTTGTTTATGTATATCAACATAAAAGTAAAAATAAAT AAAA
ATAAAGAATTGAATTCTCGGATTTTGCTTTCCCAATTGATTTTATGATTTGGTTTGA TCCA
ATTCAGCTAAACCCGAATCTGAACCCATGAGATAACGAGAAAGTCGAAACAAGTTCT AG
TTTTTTTTTTCTTTTTTCTTTTTGTTTAAATTACTTATATTTTTATTTGTATTACTT GTCATTT
AGATTGGTAATTGTATTAGCTTCCCTACATTGGAATGTTGTAGTTTTTTTAATCAAG TCTT
ATTATCTGGATCAAATCGTGTTGTGAGTTTTTTTATTTTTTTTATTAGTTGCCATTT GGATT
GGTAATTGTATTAGCTTTTGTACATTGAACTAGTGTTGGTTTTTAATCAATGTTGTT GGTT
TTTGTTATCTGTTAACCGGTGGATCAAATCATGTTGTGGGTTGTATATTTTTGTTTT GTGA
G CTTAAG CATAAG AAAGTATCG G CCTTG GATTTTCAGTTGTGTTTTTTTGATG AAGTAAA
TAGTTTCACCAATGTCATCAAGAAGATGCAAGTATTACGAAAGATTAGGCCAGAGAG TA
TCAGCTTC AATTACATTG GTCTAG ATTG CTAAG GAG CTGATAAAGTCCAG AAAGTTAACA
GGGTAAGTTACAATAGATAGTTTTGCCAACTAAATAAAAGTAAAAGACACCTAGCTA TCA
GTTGTTAACAATGGAGAAGTAGTATAGCAAAGTGCCGGCAAGATCTGAAAGTGGTGG TT
ATAGGGACCTGTTTAATAACTTAGTAAACCTTAGAAGAAGCTGACAAATTGTTCCAT CTA
CAATTTGTCAACCTTAATAGAGGTGCACACAAGCTGGTCGGACACCACGGTTATCAA AT
TTTTTGTTTAAAAAATGTTCCATCTCGATAAAATATCAATTGATTATGCATTATGTT GTCA
GTTCAAATATTGTTTCTCGCAATTATTATAAAAAGTGCATATCTGTGGAGAAGTGCT CCG
CGGGCTAGTGCGGTGGTAGGGGAGAGTGGTAAAAATGACACAAATGATGCTTTCCAC T
TGCTAGTGGTTGTTAAGAAGAGAGAGAATGTTTGAGCGGGAAGGACGGGGTAAATAG C
ATGGAAATGTTAATTGAAAGAAGTTAAAAGTTACCCTTTGCAGCATCTTCTCTAGGT AAG
AATTTTTTGTCTGTGTTTTCCCGAGTAGAGGGTTAAAGTGTTGCACACACATATATT ACA
GGTGCCACAGACACGTATATGTTTAGAGTACTATATAAGAAAGCGTGTTTGTGTTCT AG
GTGGTGGAAAGAAGCACAGGAGTCATCACCATCAGATGGGAAGTCAGTGACTTTGTA C
G C AG C G G C AC C AG CTC C ATCTTAT G GAG G G C C AATG AAAATC ATTAAC AAC ATATTTAG CCCAGACGTCGCATTTAACTTGAGGAGAGAGGAGGAATCTTTATCACAGAGTCAGGAG
AATGGTGAAGTTGGGGTATCTGGTCGGGACTATGCTTTGGTCCCTGGCGACATTTGG C
TGCAGGCACTCAAATGGTCAGTATTTTAGAGCAGTTTCCAATTTGTATTCCTTGAAG TGT
GTTAGATAAAGCCTCTTCTGACGGAGATTTACGCCATAGTTGTTGAGCATTCTGAGG AT
ACCATTTGCATATGTGTTTTTCTCGACTTCAAATAAAACATTGATTTTTCACTTCTG GTTA
CAACAACCACTTGCAATTTGTTGTTTGGTTTCTTCTGCTTTTCAGACCATTCACATT TTCA
TTTCACATGAAAGAGGCCTCAAGCCTTTCGAGGCTTCATTGTTGTTGCTAGTCCGAT GG
CAATTCCCAGTTATAAATATATATTGTTAAATGCCTTGTGAATGCATATGGAAGCTC GTTT
TTTAAAGCATTTTGAGATTTCATTCTAAAAAGACCACTGTTTATTCTTTCAGCTTTA AAGT
GCTAAGCTCAATCTATTAATTCGCTTCCTTATTTTCTTTGTCTCTTTCATATATTTT TTTTG
TGTGTGTGGGGGGTGGGGATTGGTGTTAACTTATAACTGATTATTTCACTTTCCTTT TTG
GTGTTTTTGCACATCTAAGAAAGGGAATTTGTCTTTTGATCCTAGTAACATGTTATT TAG
CACGTTAATTTCATACATCTGGCACTATGTAAAAGTTGATCTTTTGATTATAGAGTT CTGA
TTAGTTTGATTGGAATTGCTCCTTTCCATCCAGGCACAGTAACTCTAAAGCTGCGGC TA
AGAATGGAAAAAGCTTTTNCAAAATTGCTCCTTTCCATCCAGGCACAGTAACTCTAA AGC
TGCGGCTAAGAATGGAAAAAGCTTTTCAGCTACAGATGAGGATATTGCAGATGTCTA TC
CTTTACAGCTGAGGCTTTCTGTTTTGCGGGAAACCAGTTCCTTGGGAGTCAGGATAA GC
AAAAAGGTTAATAATAACTTTGGAATTTCTAGTTTCATCTACAATTCCCATGAGATT TGTA
CTGTCATAATATCCATAGAGTGCATAACCACATGTGATCTTTTTGGTACAGACCTTG CCA
CAATTTGTGAAGCTGACTTTCTTTTTCATCAGCTGCTTTGCCATCTTTCATCCTTCA CTTT
TGTTGTTGCTGTTTATTGTTGAAACTGAAGGATCTAAGATGGACAAGTCCAAAATAT CAA
CTTTAAGAACAAGGGTTATGCATGAAGCCCTCTATTCTCATCCTCATATTAATTACT CGG
AATGGGCATAGCTGTTAAGTGCTTCCATTTTTGTTTGATATTTAAATATTAACAGTA AGAG
CTTTTTATG GTTCTG G GTTGTG CAAAAAG AGG ACAAGTATGTTAG CTGG ATAC GTATCTT
TTTGCTGACAAAGTGGGATAAATTTTGCATGGTAATTTTTGGCTTTACATGAATTTG TTG
ACAAGGATATCAATCCTATTGATATTTATGTAATTCACTTGAAGCATTTAATACTTC TATT
TCCCAATTACAGAATTGCTCATCAGCAATTTTTGACCACAGTGAGTTCTAGAGAAGC GA
GTCTTTTGAAAATGATAGTAGAAAGAGCGCTTCCTTTCTCTAGATTGTCTTTCTAGC AAA
GTAAATTAATCTAGAGTTAAATTGTATTGACAGGAATAAACAATATAAGCGGTATTT TCAT
CCAGACACCCCTCCCCTGTTTGTGGAGCAGAAAGAAACTATGTAATTGGGAATCACT TA
GTTTTGAGATAACATAGTAGACTATGTGACAGTTTTATTCTTTTAATTTAAAACTAA CAAG
TTGTCTTATATCTAAAGTTTTAGCAGAATCATTTATTCTGCCTCTAATAGCTTGGAA GAAC
TATATATCATTTGAGGTCTTTACTTGCCGAAAGACATCGGAGATGAAGTTAGTTTTT TATT
AGATCAGATATGAAATTAGTTCTAGCTTTTTTTTATATACTCAAGGATGCCCTGACT TCCT
CCATCTTTATCTATTTTTGAGAAATTCTCTTTCTTGACTGCCAAATGCTAAGAGGAA ATG
GTACCAAGCGGTTACATGCAAACCCTGGTCTGAAGAAGTAAAATAGCTAGAGTTCTA AT
TTTCATAAAGCTAATAGGAAATAATTTGATCACTTGTGAAAATAAGCCAAAAGAATG ATG
CTCATTCGAACAAAGTTCTCTTAGAGTTACTACATATTTCGTTGTGTATACATGATC CTTC
AATGCTACTTCATATTTATTATTTACCCGAAAGTTGATGTTAATTTGAGCTCTTTTT TTCTT
AACAATGTTATTGCTGACTTGTCCGTTTACTGCCTCAGCTTGTGTACATAAAAATGT AGT
TTCCAAAATGTTGTTTGTATTTTGTAACTGTTGCATGTTAAACATTTGCAATATTGC GGTG
GACAACGTTTCTTTTTTTTTTTCTTGCAATAATATCCTCACCGATGTTCTTTTTTTT TTTTT
AACTATCCTCACCGATGCTTTGGAGTCAGTTAAGCTGCTTTATTTCCTTTAGAGTCT AGT
TTTACATTTGTCTTCTCACATTTGATTCAGGACAATACAGTTGAATGCTTTAAAAGA GCCT
GCAGAATTTTTAGTGTCGATACAGAACCCGTAAGTTTCAATACTGTTGTTAATCAAT TGC
AATGGTATCTCTTTCAGGAGATTAATGGTATTTTGGTTCTCTGCAGTTACGGATTTG GGA
TTTATCTGGGCAGACGGCATTGTTTTTTTCAGATGAAAACAATAAGATCCTCAAAGA CTC
TCAGAAACAGTCAGAGCAAGATGTATGTACTTTCAACTGTGTCATACTTCATGACTA ACC
AATAAACAAGTCGACCAATGCTTCTGCGGCATTCACTATTTTTCCTGTCTTTACTAA GGA
AATAATTTAGTTATGCTTTTTTCTAATTGTTTTCTAATTAAGTGTTTTTAGCAGATT TTTCC
ATTTCTTTATCTAGTGTTTGTGTCGTAAAAAGATATATAATGATTGAGGTGATGAAT ATGC TTACTTAACACTTCATCTAGGAATGAAGTGAGACAATGATTTTTCTCCATTTTCTATATA A
GTGTTGTTTTTTCTTGAGCATGGACAATGCTAAGCCCACCAAAATTCAGTTTTATGC GAC
TCTCTTTCATTTTAGGGTTCGTTCTGGAGTTATTCATTTATAAGCAGTAGATGTGCT CTTT
CCTTGTACTTTCAATGATGTACACTCTAAGAAACTTTAGCTCTTTTATTACCCTGGG ACA
AAAGAAACACATAATAAACGGGACTGTCATGTCTAACGACCAGCTTATACACCCATT CG
TCTTG GAG AACAGG CG GGTAAATTCAGCTTAGTTATG CTGTTTTTCAG CTG G ATCTAAG
ATTACAAAAAGAGCAACTGTTTTGTTTTTTGTTTTTTTCCATTTGTGGCAGTTATTA CCGG
TGTGGTTCATCATTGATTGTTTTTGTATTTTCTTAGGCTGTTTATCTGTGAAACAAA TTGA
AAGAGATGCCAAAGTTTTATCTGTTTTATGTTTCTTTTTTCCTTGTGGGTTACCATT TAAC
TGAGAGCAAGGTAAACCTTTACTGTTGAAGGCATTTTGCTGGTTATGGGTTGCCTTA TA
GTCTTATTACTGACTCTTGAATTAACTCTAGAATTTAGTGTTTAATGGTTCGCACCG CTT
GTAAG CAAGAAATG ATTTG G AC AAACTTCTTATTTTGTCCTCTTATGTTTTTG CTTG CAGA
TGCTCTTGGAGTTGCAGGTCTATGGGTTATCAGATTCTGTTAAAAATAAAGTGAAAA AAG
ATGAGATGTCAATGCAATACCCTAATGGTTCTTCTTTTCTGATGAATGGTACTGGCA GTG
GTATAAC CTCTAATCTCACTAG GAG CAGTTCTTCATCATTTTCTG G AG GTCCATGTG AAG
CTGGTACCTTGGGCTTGACTGGATTGCAAAACCTAGGGAACACCTGTTTCATGAACA GT
GCTCTTCAGTGCCTTGCACATACGCCAAAGCTTGTTGATTACTTTCTCGGGGACTAC AA
GAGAGAAATAAATCATGATAACCCTTTGGGAATGAATGTAAGCAATCTTGAATATTT CAA
GATCATTACGTGCTGCTTTAGATGTTTTCTTCAGTTCTCTCTGAATAAGTCAATGTT GAC
ATCCCTTAACCTATTCTACATTATATGTGGTTGGAAAAGTAAAAGAAAAAGAGAAAT TCA
TTTGATTACTCTCCAGGTGAGGAATTCTTTATTTACCTCCAATTGTTTTGTTAGCCC GGA
CAAAAGAAAACGATATGCTTATCCGTTCCATTCAATTTAGTAGGGGTTGAGAAAATT GAC
TCGGAGGGTATTCAATATCTCCACTTTTTGTTTCGTACCAAACAAGGGGAATAAACT TTA
CCTCTTTTACTTTTCCTCCTCCTTCCACCTCATCTCATCCCAATCAAACATTGTGTT CTAA
TCTGTCTCCTTACATATTTTATTGTCTAAGTTCCTCTCTTTAAATTCTTTCAGGGTG AAAT
TGCATCTGCTTTTGGTGACCTTTTGAAGAAATTATGGGCTCCTGGAGCGACTCCTGT GG
CACCTAGAACATTCAAATTAAAGCTTGCTCATTTTGCTCCTCAATTCAGCGGCTTTA ATC
AGCATGATTCTCAGGTCCTTTCAGTCCTTCCTGTTGGATTTAGTTTCCCAGTTTTAG GTC
ACTTATTAACGCTCTCTTTTCTGTCCTCTCATTTTGTGGGCATCTTTTGACATCTAA TTCT
CCTATTTATATCTGCAGGAGCTCCTAGCTTTTCTATTGGATGGACTCCACGAAGATT TGA
ACCGTGTCAAGAATAAACCTTATGTTGAAGCTAAGGATGGAGATGATCGTCCAGATG AA
GAAATTGCTGATGAATACTGGAATAATCATCTGGCTCGTAATGATTCCATCATAGTG GAC
GTTTGC CAG GTAAGTAACATCC GATG GTCTCTTGTATCTCACTAG AAGTAG G AAACATTT
GATATCACCGGCACTCAGTGGTCTCTCGTCTCTCACAGGCAAATGTGAATTATTGAT CT
CATTTCAACATTGACTTGAAAAAGCAAGAAGAATGAAGTGGCATATTTTTTAAAAAT ATCT
GAACTCTACTGTATTTGTGCTGCGAGAGTTGTTCTAGGATGAGAGAGTAATTATACC CC
AACTGTTTG G G GAAGTTTAACC AGTGTTC CTAAAG CTTG CTTCAAATTTCTCAG ATATTT
TTGTCTAGATTCTCTGCCTTTTTCATCAATAAGATTTCTTACCTTACTCAAAAGAAA TTGT
AAGTAATGGAAATTGAATTCAACTCTTACCATAAGTAATGGAAATTGAATTAGTCGT CCC
TTTCAAATCAATCCGATAACCAACTTGGTTCAATAATTCGGAATAGTGGGAGTACTA TTT
GTTAACAACTGACATACTATTTTCCTAGAATGCAGTCCTGAACTAAGAGCTGAATTT TGG
ATTCAGGTTTATTTTAAAACAAATTAGTTTTATTTGTAGTGCTCCCTGTTTTCTCTA TATG
GTCTCCGCATTTTATCTTTGATGTCTTTTTGAGTTTTACTGAAATTTCCTAAAAGAA GAAG
AATATTCCAGCCTTTAATCTCCATAAGAAAGTTAAATTTTTGTTCTTCCATAGTTTA CAAG
TTTAATTATATAAAAACTTGAACCTACCTTATCAAAAAAGAAAAAGAATTATGTGAA AACC
TTGAACATTGTGAATGTTTCTGACATTTGTGCACTCTATAGGTGTGTTTGGAAGTAA TTG
GTCTTTTTATGTGTTATGTAGTCTATGGCGGTATGTAATTTCTAAATTCCTCTTTGG CCAA
TCATCTGTGAGATAAAGCTCTGCAATTCTAAGATAATTTCGAATACCACCAATTGAT GAT
AGCTTGTCATTTTTTTTAGTTTTTTTATTCTGTAATTTTTTGCATTAATAAAGTTGA CTCTT
AGCCATGTCTATCATTAGATGTCGTAAATTTTGAAATTTCACTTTAGAAACCATGTA CATA
GTACATGTTTCCTAGCAGGTCGTGATTTGTCTATGACTTTTAGGAGGATTTAAAAGT TTA CTTCATCTG ACTC CCTTTTTCTG CAC ATTTATACAAGTTTCTTTTCTAGTCTTG CTTCTCA
GAGCTTATCTCGATTGTTGCAGGGTCAATATCGTTCCACATTGGTCTGTCCTGTTTG CA
AAAAGGTCTCCATCATGTTTGATCCTTTCATGTATTTGTCACTGCCTCTTCCATCTA CAT
CTATGAGGTCAATGACTGTCACAGTTATAAAAAATGGCAGTGATATTCAGATATCTG CCT
TTACAATCACTGTTTCCAAGGATGGAAGACTTGAAGATCTTATTCGTGCTTTAAGCA CTG
CATGCTCTTTGGACGCTGATGAGACCCTTTTGGTGGCTGAGGTAAAGTGCAGAATTT CC
AGTGATGAGAAATGGTTATGGATTTCAAGTTGTTGCTTTATTGTTTCCTAAATAGAA CTTA
TTACATACTGTGTATTGGATAGTCAAGTAGAGTCCTTTTTCCTATTTCCAAAATTTT ATTT
CCAGCTCTTGCTGGGTTGTTGTTGTTGTATTTCCAGCTCTTACTCCATTTAATGTTA CAG
ATATACAACAACCGCATTATACGTTATCTTGAGGAGCCAGCCGATTCATTATCCTTA ATA
AGAGATGGTGACCGACTTGTTGCTTATCGGTTGCACAAGGGTACTGAAGAAGCCCCC T
TGGTTGTGTTTACGCATCAACAGATTGATGAGTATGTCTTGACTTCATAATTTGGGC ATT
ATCTTTTTTTGCTTTAAAGTTCATCAAACATTACTAGCCATTACTCAGATGTGTCTT GCAT
GCACAGCTATGTTTCATAAGTAATAAGTTGGGGGAAAAAGTACTCCAAGGGTGGTGC TT
CCACATCATCACTCTTAATC ATG G CAG G GTTTG G ATGTG GG CGTACTTGATG ACTACAT
TGCTTAAAAGAATTGACAAAATATTTTCGCAGATGACATATGTAGTAATATCTCAGT CTAT
TAGTTTGCTTTATGGAGATCGGGTGATTAATTCATGATCGACACAACTCCAGTTAGT TAA
TAGAGTAGGCTGTTAGTTGTCATATACTTCTATCTTGTATAAAGTAAAAATGTGAGG TGG
TTTATTTAGTACTGTTGAGCATCCTCAGTCTCAATTCGCTTCACTTGAATACATTAC AAAT
CATTGTTATGCATGGTTCGTCGAGCAACATGTAGTTCAGATGATGTGTGTGATCCTT CTA
GATTATTTGGACAATCATGAAACTATTGCTTCTTCCATGCATCTTACTGCTGAAGCT GTA
TATGATATGGAATTTCATGCTGTTTTGTTTGCTGATTATGTTTAGTTTAACTTTTGA TCCA
TTGAAAGATTCTCATAGTGGTCCTTGACTCATTAAATGAGATGGCTGATATTTATTT TGG
CAAAATATCATTTCCTTCTTGATTTCCTCTTCCATTCTAGCAATCTTATGAAGACTG CATG
TGCAGGCATTATATATACGGAAAGCTGACCTCAAACATGAAGACATTTGGCATTCCG CT
TGCCGCGCATAGTAGAGTTCTTACAGGATCTGATATCCGTAGTCTTTATCTACAGAT ACT
TACACCATTCTTAGTCCACAATACAGCCCAAGCAGATAATCTTAACTGTGATAGAAG TGC
TACTGAAGCATGTACAGATTCAGAAGTCATCACAGACATGGAACCTGGCAACTCAAT AG
TAAACGGGGTTCCAGAAAGCATTGCTGAAGAAGATACTGCCGAACCTTTAGACATGG AA
TTTCAATTTTACCTATCAGATGATAAGGCAACCTTTAAAGGCTCCGAGATTGTAATG AAT
GAGCCATTACAGTCCACAGATATCTCTGGACGGTTAAATGTACTTGTAAGTTGGTCA CC
TAAAATTCTTGAACAGTACAATACAGGCCTTTTCAGCTCACTGCCAGAAGTTTTTAA ATC
TGGTTTTTTTGCCAAAAGACCACAAGAATCTGTCTCTCTGTATAAATGTCTTGAGGC ATT
TCTGAAGGAAGAGCCTCTAGGGCCAGAAGATATGTGGTAAGTATGCAACTCCCTCAC TT
CTGTGATTGTACACCATTCATATGCAAGCTATGTATTCATAACATATGAAATTTCTC GTAA
TGCTTCCCTTTTTGCTTCTTCTTTGGTTTGTGCTAATATTATAAACCCTCAACTTTT GTAA
TTACATAATTGTATTTTTCCAATTATACCATTTTATTTCATTTCTGTCAATATTTTC CACCG
CGTCGTGATTGCTTATTGTGGATAAACCATTCTTATTAGCCTCCCCTCAACCAAATT GGA
CCTTGACTTTGCAACATGGCATGAGTAGTATCCTTCCAACTTCTAGTTCAGTTATGT TAA
AGAAACAATGACAGCATCTGATGATCTTATGTAGCACCTATTGATTTTCTGCACATT GTG
CTTCTGAAATGCTTTATGTGTTGCTCTTTGTTTCTTGTATATTCATAGCAACTACAG ACAG
TAATTGAGATAATAATATTCAGCTATTTACCTGGGGCCACTCATGGGAGGAAAGACT TCT
TTCATGAAATGTTTCTAGTATTCTATCTTATCATCTAATCATTTATTTGTGTCATTC GCTTG
GGAAGTATGAATATAATCTGCCTAACTTTCTTTGTCTTATATCCAACATTAGGTACT GCC
CTGCATGCAAGCAGCATCGCCAAGCTACTAAAAAGTTGGATCTTTGGAGACTGCCGG A
GATTCTGGTCATCCACCTGAAGAGGTTCTCGTACAACCGGTTTCTGAAGAACAAGTT GG
AGACGTATGTTGACTTCCCAACTCATGATCTTGATTTATCCTCATATTTGGCCTACA AGG
ATGGCAAATCTTCCTATCGGTATATGCTTTATGCAATTAGCAACCATTATGGAAGCA TGG
GAGGGGGTCACTACACTGCGTTTGTTCATGTAAGTGGTGCTGCGACTTGGATTACCT TG
CTTCTTTTTCTTGGTTTTGTTTCTATTCTATGGTAAATAGGATTCTTTTATACCTGA TAAAA
ATGGCATCTTAAGATCAGTACTTGGGGAGAAGGGTGGGTGGTGGGCGGTCACTGAAA C CTACCTCCAAGGGCAAATATAGAAATTTCCTCTATTGGTCTTATTCTTATTGTTCGTAGT
GAGTGTTCCTTTGATGTATTTTTTAGTTCCAATGCATCATCTGCATCTAAATTAATC ACAT
ATTGCACACATGTGCATCTATTATATATTTAACTTTGGTCATGCGTCTTCATTTTTT TTATT
TCTTCATCATGAAGAATATGCAAGAAGGTCAAATATTCAGACTTTTACAGTCTTCCT AGT
TTAATCCAGATATTCTAACTTTGTGTTTTTCTTCTTCTAATAATCTAGCAAGGTGCT GATC
GGTGGTATGACTTCGATGACAGCCATGTGTATTCCATCAGCCAGGACAAGCTCAAAA CC
TCGGCCGCCTATGTTCTATTTTATAGACGAGTTGAAGAAATC
SEQ 72
TTACACTTGCCTACTACACTCTCCTTTGCCAAAACCTACTCGTCGATTTCTCATATCAAA
TTCTACCCATAAATTTTGCTGGTGAAAATTACCAATAATATTGCTTGCTATTCCAAG TGAT
TCTGACCGTCCGATTCCAACACAATGGATCCCACCTTCTACTTCATCCAACATCCTT TCC
TTATTGATCAAAATATCAACCCCGTTTTCAAATTGCAATGTCATATCACCTATCAAC CGTC
CGATTTCGATCGGACGGTTATCGAAGCACATGTCGAGTGCACCACCATAAACGTAAC CT
TTTTTCAATCTTGGACCTACTAACCTAACAATTTCTTCTCTGACCTTATTGTACGCT TCTT
CCACTAAGAAAGTGTACTCCGTGCCGGAATCAATGATCGTCTGGCCGGAACCACCAG C
GTTTGGCCGGAAAACCCTCCCGGAGATGTTTAATTTTTTGCCGCCAATTTTTATCCC CA
CCATGCCAACAGTAAAAGCTAGTGGATCCAAATTTGGCATGCGTTGACTTTGAGGAA AA
GTCAAAAGATTTATGTATTGAAATGTATGGGAATTAGGGTTTTGGCCTAGGTAAAAT GTT
CCACTAGGTTTAACTGCATGGCTACCTTGTCTAATTGGCACGCAATATGAGAATTTT TGT
ACCTTAGCTTGGGAGGCAAAAGAAAACCGTCCAAGATTCATTCCCAAAATACCCTCA GC
ATCTTCGGACTCGGTCGCACAACCAAGAATCAAAGGAGGGGTACTTTGGGAACGTGA A
AATGTAATTTTTTCACGGACAAGATTACCCTCAGCTAAAGTACCATCAGCATAAAAG TAG
G AATAGTG G CACAAACG ATTTTG GTCACAAGTAGTTG GAAG G GTAAAATCG GG AATTCT
TGGCTTACATAAAGGATGAGTACAAGGAAGAACAGAGAAAGTAGAAGACAAAGAAGG A
TCAAACGACGTCGTTGGTGGGGGTCTTTTGGGAATTTTCTTATGACATTGAATCCAA GA
AAGTTGGCTACCAGTGTCCAAAACCATTTGTTGATTTTGTGGTGGTGTTCCTATTGG TAG
TGTAACAATTAAAGCCATTGAATATTTAAAAGTTGATTTATAGTTCAAAGATGGAAT TCTA
GACATAGTTTTTGTATTTTGAGTTTGTCTTCTATTATTAGAAGCCATAAAAGAAGAA AGAA
AAAGAGCTTTAGAAGAAGAGTTATGTGATAAAGATGTTGAAATAAGAGGAAATGACA TA
GAAAAAGGCTTATGTTTAATGGTTTTTTGTGCTGAGATGTAGAGAAAATTGAAGATT ATG
AG AAG AAG AAG AAC AAAAACTCTAG AAG AAG AAG C CAT
SEQ 73
TACACTATAATTATATTTTCGTTAAATATGAAGATTTTTTCCATATTCTCTTTGCTTCTT CT
CCTTCTCCTTCCCATCTTGGCTTCATGTCATGAAAAACAGGTACAAGCATATACAAT TCT
AGTTTCTCATTGATTCTTTAATCGCAGTTCTACTTCTGTTTATTCTTTGTTTTAATT ATGGG
GTTTTGTTTTG CAG GTTTATATAGTGTATTTTG GAG GACATAAAGG G GAG AAAG CATTGC
ATGAGATTGAAGAAAACCATCACTCATATCTCATGTCAGTGAAGGAAAGTGAAGAAG AA
GCCAGATATTCTCTTATTTACAGTTACAAACATAGCATCAATGGCTTTGCTGCACTT CTC
ACCCCACATGAAGCCTCCAAGTTATCTGGTATAATAACCACGAAAAAAGTTCACTCT TTC
AAAGAAAGAGTTTAAGTTACATATAGTAAAATTTAATTGGTTATAGCAGGTTATTGC TCTA
TTTTCTAGGTCAGAGTAACTTGTTTTCATATGTCAAATTAATCTGATAGTGTAAAAA ATCC
TGTATAAGAAACACAAGGTTCTTGTATGTAGAAGAACTTACCTTATGTATTATTTGA ACA
CAGAATTGGAAGAAGTGGTATCGGTGTATAAAAGTGAGCCAAGGAAATACAGATTGC AA
ACAACAAGGTCATGGGAATTTTCTGGAGTGGAAGAGTCAGTGCAACCAAATTCCTTG AA
CAAGGATAACTTGCTACTGAAAGCCAGATATGGCAAAGATGTCATTATTGGCGTTCT TG ACAGCGGTACATACATATATATTTGCTTACCATTATTTCCAATATGGCATTATTTTCCCT T
TGTTTTAAATTTTAAATGTATTTCCACAAAGGGCTACATAATCTAGCATGTGATTAT CGTT
TCTCCAATAGTGATACAGACAATCTTATTAGTAAGACTAATGCCTTGTATGTATAAT AGTA
GAAAGGGATAACACGTGAGGAATCAACCTATATATATATATATATATATATATATAA ATGT
ATTTCAAAAAATACTACTTATAG ACATATAAG GAAAATTGTG AGAAG CCTTGTAC CAAAG
GGAGTCTAAAGTTAAAATAAAAATTCAACATGTTTAAGGATTATGGTTATATAGGAT GGA
CGTGTAACTGTGTCTATCCTCCGGCTTATCACTGGCAACTGAACACGAGGGTTGCGC T
CGTTGCGGGACTCATTAATTATGAGATTATCAACTGTAACTAGTGTTAATTGACTAG TCT
GATACTTAAAAAAAAATTGGAGTATGATATTATGTGATGAATGTTGTTGGATGATTT ACC
AGG G CTATG G CCAGAATCTAAGAG CTTTAGTG ATG AAG G GTTG G GACCGATTC CAAAG
TCATGGAAAGGAATCTGCCAATCTGGAGATGCTTTCAACTCTTCAAACTGTAATAAG TGA
GTGTAATTCCTCTTCCATATGTTTTATATCTTTCCTTTAACTTTTTCTTTCTTTCTT TATCTT
ATCCCTTTTTATTATCTCGATGATCTGATGTCTACCTGTTTTACAATGATTTAATGT GGAT
TTTAGCCATTCTTGGGTTAGAAAATGTTCAGCTGCTCTACAACCCTAGACCACATTC TTT
TTGTTTTGGGAATTCCTGCTAAAATAAGCTGATTTACTACCTTAGACGTTTGGTTTA TCAA
ATATACCAACCTATACGTATTTCTTTATTTTTCTTTTTTTAATAAACTTTATTAAAT TTTATA
AGGCTGAGATGACTTTGAACGAAAAATATGATTCATTTAGTTTAAATCCAACTTATT TGG
AACTGGCATAATAGTTGTTGTTGCTATTAAATTTCATAAGTAGGCTTAATAAACATG TCAT
CAAGTTTTGTGCGCACCTATCATATGATGCCTTGTTTATCCAATTATGGATTTCAGG ATT
TGCTTGGAAATGAAGTGTTTGGCACTTATCTCTTGTCTCTTATACATTGATCTAACT TCG
TAAGATTAATTGTATTTAAATGGCTTGTAATAGAAAAGGCCAAAGGTCAATTTCAAG GCC
GATTTTTGGAAGTTTTCCTTTGTCTTCTTTATCAGTTGACCCTAAACCATTCTCATA ATTT
AGCTTAATTAAAATCAATTAAAAGAAAGCAGATACATGTTTAGTTTTTTAATCTTGT ACCT
CTCTAAAGAGTGAAAGAGAGTTTTTTTGAGAGGACAGGACCCATTGGGTGTCCATGC CT
GTCCTTTGGTGGCCTTAGGATATCAGTGTAATAATTTCAATATTGTCCATTTCAATC AAA
CCAAG AGAG GTTATG CTGACAAGTTG CTAATTGTTTTTTG GATTCTTG CTTTG CCATCTT
GTGAACTTTGTATCCTTCCAATGCTTTGTTGTGCAGTAATTTGTTTTTTGCATGTGT GTTG
TCATTATGGTTATTGTGAAGTCTATAGTGAAATTTTGTGAGGCCCTTACTTCCAGTT TTG
CACGGATATTCTCAGTAGTAGCCAGTAATATTATCCATTTTGACTATCTCATGACTT CCA
TGCAGCAGCTTTTTGACCTTTAGAAAGTTGATGATGAAATTCTACCATTTTAGAATG ATA
AGTCATTTTCTAGCTGTTAAGTCACAAAAAGAGCACTAGAGCAGTAAAACTTTTGAA GTT
TCATTGTGAGGTTGGGAGGAGTGGGCCTGATTATCACATTCTTGTCCTAATTTGTTA CT
GCTACTATCCTTTTTTTTTTTTCTTATTAAGAAGAAGAAAGCCTTTTCTTCCCTTCT TTTCA
AAGGGTAGGGGGTGGGTGGAATATATTAGCCTAATTTGTCATATTTTCCTTCTCGTA TAT
AACCATGCTACTATATATGTTGTACTCAAAATATAAGATTTTGTATACCTTTTCCTC TATA
TACTAGATAGTGTGATCCCCTCATGCATCATCTTCTTTTCTCTAGAAGAAAATGTTT TATT
CATGGTGACAGGGGAGGGAGAGGGTGGGAATGTTGGGATCATATCTTGATATCTTGT C
TAATTGATCATCTCAGGCAAATTTAGGGTGGTCATGTGAGTTAAACTAAATAATTTT ATTT
CATAGGATCAGCCCGCCCTGATCAAGAATTACTTACTAGCTAGCCAGACTAGTGGAG C
CCTAGCCGGAGACATTCTCTAAATCATGCCTTAACGCGCCCATCTTCCAAATAAAAA AG
GGCTAGTTAGTAAGAAAGATGGAAAGACCTTTATCCATAATTCTTTCCCAGTCTACC TCC
TTCCTTAATTGTGACATGTCCCGTTGATCCCACCTACGAGCTATCTGTCTTTGCCTA GCA
AGATAATTTTTGGTCTCCTATTCTTGCCTATTTTTATAGCCTGTCTTTATCAAGCGA GATA
ATTCTAGTTCTTTTATTTTTGCCTATCATGGTAGGAAATTGGTTCGGCTTGATTGAA ATTT
TTTAAAATGTTTACATATAAAAAGAGTACACGCATTCTGAACCCACCAACTCTAAAT CCT
GAACTTGCTTCTCCTAATTATGTAAGATAACTTTAATATTTATTCTCCTATGCTACT TTGG
G ACTTCTATTG CAGG AAAATAATTG G AG CTAGGTACTAC AT CAAAG GTTACG AG CAATA
TTATGGCCCTCTAAACCGAACTCTAGATTATCTATCTCCACGAGACAAGGATGGACA TG
G AACTCATACATCATCAACAG CAG GAGG CAGAAAG GTTCCAAATGTCTCTG CCATTG GT
GGCTTTGCATCTGGCACCGCCTCGGGTGGCGCGCCACTCGCACGGCTAGCAATGTAC
AAAGTCTGCTGGGCTATTCCGAAGGAGGGCAAAGAAGATGGAAACACTTGCTTTGAC G AAGATATGTTAGCAGCAATGGATGATGCTATTGCAGATGGTGTTGATGTTATTAGCATTT
CTATTGGAACAAAAGAACCTCAGCCTTTTGATCAAGATAGCATTGCTATTGGAGCAC TTT
ATGCTGTGAAGAAAAACATTGTTGTGTCTTGTAGTGCAGGGAATTCAGGACCTGCAC CT
TCTACATTGTCTAACACAGCTCCCTGGATTATCACTGTTGGTGCTAGCAGTGTTGAC AG
AGCATTCTTGTCACCTGTTATCCTAGGAAATGGCAAGAAATTTACGGTAACACGATA ATC
TATTCATTTTCTGTACACTATTTCATCTAAAATGTTGTAACACTAGGATCATAACGT TTTC
CTTTATCTATTTAATTACATTCATATTGGAATGAAATTGAATCCATTTTTCGTTTGC TTAAT
ATCAGGGACAAACAGTTACACCTTACAAGCTCGAGAAGGAGATGTACCCTCTAGTTT AT
GCAGGACAAGTAATCAACTCTAACGTAACCAAAGATGTAGCAGGGTACTCTCCTTGC CT
CAAAGTTTCAATATTTTTAATTAATAATCATAATTTTCTTTTGGTTGATTATGTTAA ACACT
ATCTGAAACTTTTTCAAAAAAAAAATTCAGGCAATGTTTACCAGGTTCCCTTTCGCC GAA
AAAGGCCAAGGGGAAGATAGTAATATGCTTGAGAGGGAACGGGACAAGAGTAGGAAA A
G GTG G AG AG GTG AAAAG G G C AG GAG G AATTG GTTAC ATACTAG G AAATAATAAAG C AA
ATGGAGCTGAATTAGTAGCTGATCCTCACTTTCTTCCAGCCACTGCAGTGGACTATA AA
AGTGCAATGCAGATTCTCAACTACATCAATTCTACAAAGTCCCCAGTGGCATATATT GTC
CCAGCTAAAACAGTTTTGCATTCTAAACCAGCACCTTACATGGCTTCCTTCACTAGT AGA
GGTCCAAGTGCAGTTGCACCTGATATCCTCAAGGTCAGAATTTACATAACAAACTTA AG
ATATTTACCTGACTTATGATTTATGCTTCCTCATCTAAATTAAATTCTGATTTTCGC TACTT
CCACAGCCTGATATCACCGCACCAGGGCTGAATATATTGGCAGCATGGAGTGGCGGA T
CTTCCCCAACGAAACTAGATATCGATGATCGTGTGGTTGAGTATAACATAATCTCAG GT
ACTTCCATGTCTTGCCCACATGTCGGTGGCGCCGCTGCACTTTTGAAGGCTATACAT CC
CACTTGGAGCAGTGCTGCAATAAGATCTGCTCTTATAACCTCAGGTACCTCTCAACT AC
TTTTGAACTTAACTTATATACACTAACTACAGTATTTTAACCTGTTATAACATATAT AGTTA
TTTTGCTGCAGGTGACCTGATAGTGTGTAAACATTATTTTACATTGTCGGTGTATAG AAT
TTAAACTCCTTTTTCGTCCAAAATTTTGTATTTTGAACTGATCAATCGTTATATTTT CAGCT
GGATTACGAAATAATGTTGGTGAGCAAATAACGGATGCATCAGGGAAGCCAGCAGAT C
CATTCCAATTCGGAGGAGGGCATTTCAGGCCATCAAAGGCAGCAGATCCTGGACTTG T
CTACGATGCTTCCTACCAAGACTATCTTCTCTTCCTTTGCGCTTCTGGTATTAAGGA TCT
TGACAAATCCTTCAAGTGTCCCAAGAAATCACATTTACCTAACAACCTAAATTATCC ATC
TCTG G CTATTCCC AATCTCAATG GTACTGTTACTGTTAG CAG AAG GTTGACAAATGTTG
GTGCACCAAAGAGTGTTTACTTTGCCAGTGCTAAACCTCCATTGGGATTCTCTGTTG AG
ATTTCTCCTCCCGTCTTGTCTTTTAAGCACGTTGGTTCGAAGAGGACGTTCACTATT ACA
GTGAAAGTTCGAAGTGATATGATTGACAGTATTCCGAAAGATCAGTATGTGTTTGGA TG
GTATTCCTGGAATGATGGAATCCATAATGTTAGGAGTCCAATTGCAGTCAAATTGGC A
SEQ 74
ATGGCAACACGTAGAAGCTCTAGCTCTGCTCTCACGGCCCTTGCGGCGTCTCGTTCCC
GCCTACTCTCGCGGTTTCGTCCTGCAGTTTCTCGTCTCTCTCAGAATACTTTACTCG GC
ACCGGCAGGTGTCCACCTCCCAATAGTGGATTTTTTGTTGCAGAAACAACTGCTGCA CT
TTGGCCGAATTATAACGTGTTGTCCAAAAGTTTCGTGCACTCTTACTCTACTACTGC TGC
TAGCTCCGGACAGGCACGACTTTCTTCTTCCTAATTGCATTCTTCTCTGTTCAACGA CTT
TTCTTCTTCCTAATTGCATTATTCTGTCCGTTCAATTGGAAGTGCTAATAGAATTAA CTCT
AATTG AC GTTTAG ATTAAACTTG AATG AATG CTGTTG GTTCTTTTATTTAG CTTTTG ATG C
GAAGTGAAGTAATCTCTATTTAGATATTGTCAGTTAGAGAACTATTTTCTCAACGTT AAG
G AACATCATTTC CAG CCTTTTTTTTTTTTG CAG AGTG GAAG CCTTAAATTGTGTATTTTTG
GACGAGAAATAACAAAAATGGTCCCTTATATGTGGGGTAGAATAAAATAGTCCCTTA ATA
TACTCCTGAGCAGTTTTGGTTCTTCAAGTTTGCAAAAAAGTGAGCAGTTTTAGTAGT CGT
CAATTATTTTAACAAACTCTGGTTGTTTAATTTGACGAATACGAAGTCGCATTTGGA GGT
GCATTTTTTGCCGTTTATGATATATTTGGTCTATTTCTGGTGTCTGATGGAGTTTCT GGG ATTTAATGGGTCTTTCCTAGTGGTTCTAGTTAAATCCTTTTCTTTTTCATAGTTTCGCTA A
ATCTAG CAGC CAAATTTTGATAAACAGTTG ACAAGATAAAAAATG CTCATG CGTGG CAAA
CATAG ATCCTTCTG ATAAGC GTCAAGCAGTG GAAAACACTTTTAG GTG CTG AAGTG G AT
TTTTATAAATTGGCAGTTACGTGTTAAGTGAGAAGTGGAACTGATAATCAATTAGTA TGG
TTGGTAAAAAAACTGTTGATAAACACTTTTTTTGCTAAAATAACTGTAATGACCTTA AAGT
TATTTACAAATTCTATAATTTTAAAGTATTTATTACATAAAAAGACGAAAAATAGAG GTAAT
TAAAAGTTATGTTAGAAGAATATATTGGAGATTACAAAAGATCATAGGGATAAAATC GTA
AAAGG CTTG GTCAAACAAAAAATGTTTATAAG GTATAACTTTTG ACTGATTTTG G CTTAC
AAGTTCTTCTCGTACGAGCACTTTTGATGTTTATCAAACGTGTAGATAAGCCAAAAT GTG
CTTACAAGCTAGTAGGACCCTCTTATAGCTTAGACAAATACATGTATTTAAGAGTCT ATT
TTATACCTACCTGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNN N AAAAAAATAAAAGTTACTACTTTTGTCATTTTCTAAT
TGTTGGACAGGCGGTTTCATTCTACTATTTTTTGACAATTTCGTGGGTCATCCTTGG GGT
AGGGTAGGCTGCCTTCGTCACACACCCTTGGGGTGCGACCCTTTCTCGGGCCCTACA T
GAATGCACGATGTTTCGTGCACTGGACTTCCCTTTGTGGAGTACACTTTATCAAAGT TA
CGTATAAACACATGCACTAAACTAGACTTTTGTGATAGTTGTTTCACTTGAGGCTGT CAG
TTTACTTCTTTTTTTGTTTGGCTGCTATTGGTGTTGCCATAAAAACTACTCTAGAGT TTAC
TATTGATGCTTAGAACTTATGGGGTGAATGAAAGGACGGGTGACTGAAAGGACATCT CC
TTCCTCAAACGTGTTATATCTTCTTAAACAAGAAATTTTGATGTACTCAAACAATAC TAAG
GGGGGTTGTGCAGATAAAAACAGCTATTTATAAAGGTCCATTGGTCTCCCTGGCCTT CT
CTAGAATGAGGAGTGCCATGATTGTAGATATTTACCTATACCATATCGCCATTTGTC AAT
TCTCTTGGATTTGTCCTGATGATTACTTATCTTTAATTTTGGGTAATTTTAAATTGT TTTG
G CATATTATTTGTTGTTATGTTAATTAGTATG AAG ATTTAAC AG ATTAATAATATG G ACTA
CACCGAGATGGCTCTTGAGGGTATTGTTGGTGCTGTAGAGGCGGCACGGACTAGCAA G
CAACAAGTAGTTG AGACTG AG CACTTAATG AAAG CTCTTTTG G AG CAGAAG G ATG G GTT
G GCTC GAAG AATATTCACTAAGG CTG GGTTG GACAACTCATCAGTTCTG CAAG AAACAG
ATCAATTTATATCTCAGCAGCCAAAGGTATGAAAAATGGAGACTGATTGTGGATTCT GAT
GAGTTCTTGGACTAGAGAATCAGATATTTTTTCTTGCAGGTAGTAGGTGATACTAGT GG
CCCCATATTGGGGTCACATCTTAGTTCTCTCCTAGAGAATGCGAAGAAGCACAAGAA AG
AAATGGGAGATTCCTTTGTGTCTGTGGAGCATATGTTGTTATCTTTTTTGTCAGACA CAA
G ATTTG GTCAAAAGTTATTC AG GAATCTCCAGCTTACG GAG AAG GCTTT GAAG G ATG CT
GTCAATGCTGTTCGTGGAAGTCAGAGAGTAACTGATCCAAGTATGTATATATTTATA TAG
CTTCATGTTTCGTGGCCATGTCTCTTATGATTTCATTCGTTCTGGTTGAATGGTAAA TCC
CAGTTGGCGAAAGGATCTTACTTTATCACAGCATGAGATGATCCTTACTTTGGTTGA TTG
GGTGGATCGTGCAATATTGTCTCTCTGGTTTGTCATGGACATTAAATTTTTCCTATT GAT
GGGGTGGGGGGGGGAGTCTTGTAAGGTGGTTAGAGGCTTAGAGCTGATAATATTTCA A
AAAGTACTTGGCAAAAAAGGCTAATTCCAAGTAGATGGTATAATACTCCATTCACAT GAA
CTTAGATTT AAATCTGAATTT GAAG AATGACTTTT GAT ATCAAAAGGTAGAAT GAG GAAT
TAGCAAAATTGCATTGAGGGAGTATGGATGTCCTTGATATATATGACAATGCATATT ACT
GTCTCTTAGTGACTACATCATGAAGTTTCAAAATGATTGACCATGCTCAGTGCAAAT AAG
ATATCTAATTGTGTCAAAATAGATTTTAGTATGGCTCCGATATTTATCAAGATCTTC CAGC
GGCTCTTCCTGTCTAGGAATGAGTTTGAAATCTCGATCCTAAGGGTGTGAACATGTT TG
CAATCTGACACGTTTGTTTACTATAGTCGTCAAAAGCATGCTTTGTGAGATAGCAAT GCC
TATCTTTGATTGTGCTACCTCAATCTCCTGTAATTTTCTCAAATCCTGAGCCTTACA GCA
ATTGAACAAGTGTTGCTAGAGTTAAGAAACTATTTGGTCATTGCCTTCTATAATAAA ATG
TCATCAACATAGTCCACTAGTAAATACATTAACCACATTGTTGTGAAGAGCGTGAAG TGA GAAAAAGCGACAACCCCCATTTCGCTTAAAGCGAGAAGCGTAGCACTCACTTTTTTGAA
GTGAAGCCGAATTTTCAAAAAAAAATTAAAATAAATACTGCATAGACAACACATGTA ATT
GTAAGCAAATGTTCAATACTTCAATGTAAAAACTAAATAGTAGCATCAATTAAAGCA CAA
AATGAGCATCCTATTCTTCTACAAGATTGTGAAATTCTTGTATTCCACTATCATTAT TATA
TTGCTCGTCATCTTCTTCAACTTCTTCTTCTTCATCAACTAGGGACAAAGTAGCCGC CTC
TTTTCCCTTCCTCTGTGAACTTGAAGTTGAGGTACTCCCCTTCAAACCATAAATCCT CTC
CCCAATTTCACACGCCTCCGCAACATCACCCCAAGTGAAATCAGAAGTTTCCTCAAA TA
CTTCTTCATCTGCATGATCTTCCGGGACTCCAATTAGCCATTCATTAGCATCATTGA TGT
TGTCCAAACTAATTGGATCAATTACATTGCGAGCATTGTAACGACGCCTCAATGTTC TAT
TGTACTTAATGAAGACTAGATCATTGAGACGCTTCAAGGTTACTTTGTTCCTCTTTT TGG
TGTGGATCTGCAAATAATAAGTAATTAGTAAGATTAGGACAATTGAGATATAATTTA ATTA
TCTCAATATACTAAATCTTTCTTCTTAGTTCTTACATGTTCAAACACGCTCCAATTC CTTT
CACACCCGGATGAACTACATATTAGACTTAGAACCTTAATGGCAAACTTTTGTAAAT CTG
GGGTGAAATGGCCATATTGCTTTCACCATTCAACTGTAGAAGTAGAACAAATTTTCA AAA
CATAATATTGATAAATTAATAACTTATTAACTATAAAACAACATATAAGCACTCGTT CACC
TGGTGACTTCGTCTTTTTTTGTCTAATCGCCATGTTTTTTCCAAAAGTTGCTCAGCA TTC
CTATAAATACTAAATTGCTCTGTTATTTTATCTTGCACGGATTCTTTGGGTATCAAC TTCT
CAGTACATTCATAGTATCCATTCCACAAATGTTCATCTCCTAGAATCCTCTCTTCAT TGTC
ATAAAACAGTTCCGGGTTCAAAATAAGTCCAGCTACATGCAAAGAGCTATGAAGCTT AC
TATCCCACCTTTTATTTATGAAATACACTTTTGTATTTTCTTTGATCACTAAAAGAG ACTT
GAATAGCCTTCTTTGCCCTATCCATTGCTTCGTACATGTAGCCCATTGGTGGCCTTT GC
TCCCCATCCACCAAACAAAGCACTTTAGCTAAAGGACCACCAATCTTCAATGCATGA AC
CACATTGTTCAAGAATGAAGGAGAAAGTATAATATCTGCAAATTCTCTCCCTCGAGC TTC
CCTTCCATAGGCACCGTTAGTGTACTCATCTGAAACAAACAACTTCTTCAAATTGCT TTT
TTGCTCATACATCCTATGCAAAGTCAAGAAAGCGGTAGTAAATTTTGTCTTTGCAGT TTC
ACCAAGCTTCTTTGTTTAGTGAATCTCTTCATCATATTCAATAACAAAGGCCTTTGA ACAA
TAGAG GAATG CACTCTAATTG CCTG ATTAAAG ACTG AATTG ATGG GTCTTTCCTTG AAAA
TGTCACCGAAAATCAAATTAATGCAATGTGCTGCACACAGAGTCCAATAAATATGCG CG
TACCCAACAGACATCAAATCACCAGCTTTAACATTTTCACTGGCGTTGTCCATGACA ACA
TGAACAACATTTTCTGCTCCAATAGAGTCTATTGTACTCTTGAACAAGGAGTATATT TTG
GTTGAATCAGTCAAAGAGTCGCTTGCATTAACGGACTCAAGAAACATACTTCTTTTA GGA
GAATTCACCAAGATATTGATGATCATTTTTCCATTTCTCGCCGTCCACTTATCCATC ATAA
TGGAACAACCAAACTTGTTCCATTCTACTTTGTGATCCTCCCCGATTTTGTTCAACT CTG
CCGCCTCTTTTTTTTAGATATAGACCTCTAACTTCATGATAAGTTGGAGGCTTCATT CCT
G GACCATATTG G CCTAC GACATC AATAAAAG CAG AAAAAGTGTCAGTATAATTAACACA
ATTAAAAGGAAGACCTGCATCATACATCCACCACGCAAACATTGTGACTGCACGAAC CC
TCAAAATCGCTTTGGCATCAATTTGAGGATTACCACTTTTTCCTCCCTTATCCCCAG ATT
TTTGCGGGAAGTAACAATCCATAGGACCTTTGGTCTTGCCAGTAGATCCACAGCTAG AA
GACATTGGTTGCATCCTTCCCTGCTTTTGTGATTTTGGTGGAAGCGACGAAGCATCG TC
ACCTTCTTCTGTTTCATCATCGTCATCATCATGATTATACAGTTCTTGTTCATGAAT CATT
TG AGTCTTTAACTCTTTTTTTTTTTG AAG GAATG CTTTCAATTCTG CCTTCACATG CGATG
GAACTTTAGGACAATATGCGACATTTGGATCACCACCGATTAGATGCGCTATTTGAC CG
ATAGATTCCCCCATTTGAAATCTTGTCACAAAAAAGACATCTAATTGCCATCTTGTT TTGT
TGGTCTCGCTAACTCTTTCAGAGTAAGTCCAAGCCGGATCTTTCCTATCTTCTTTTG GTG
CCATTAAAAGAACAACTACATAACACATAAGAAAGACAATAAGAACTAAGAATAAAG GAT
ATAAAAATTAG AG G AAG GG CAGTAATTCTCTTTTAAAAATTGG G CAG AGTTAAAAAAAAA
AAAACTAGGCAGTAATTCTCTGTTAAAAATTGGCATCAATTCTGAGAAAGAACCGAA GGT
TAAAAAAAAAAATTTGCCAGCATTTCGCTGCTTTTTGGACGTTTAAACAGTGAAAGA AAA
AGAAAAGAAGAAGAAGAAGGAGGAAAATCAGAACATACCTGTTGCTGTTGAAGACTT GA
ACTTGAAGTTGAAGACTTGAACTTGAAGTTGAAGTTGAACTCTTGAAGAAGAAGAAG AA
CCCCAGTCGATGCTTGTCGATGCTTTCAGAAGTTGATGAAGAAGAAGAAGTCGACTA GT GTCTCTGCTTTAAAACCCTAGTCGCATTTGTTCGTTTAATGAAAGACCAGACATCTTGTT
TTTAATAAAATAGGGTTGAGTCTGATTTAAAACACAGAAGCGATCGCTTCTCTCGCA TCG
CATCGCTTTCCTGCTTCTCGTTTTTTAGTGGGAAGCGGTCGCTTTTCTACACCTAAG TC
GCTTCACCCTGTTGAAGCGTGCACTTTCTTGCTTCGCTTCGCTTCTCGCTTAAAGCG AG
GAAGCGGACGCTTTTTTAAACACTGATTAACCATGACCATTATAATTGTATACGGGT AAA
ACCGAGCCCATGATGCACCTCGATTTCCGACAAGAGAAGCCAGGCTCGAGATGTGAT G
GCAAGGGACAAATATCAAGCCGAAAGTCCCATTGAGCCAGAGCCCTGGGACACGATG C
CTGCCCTCGGGAATATCGAGGTCATAATTACAGAATCGGTCCTAACCTCGAACAACT TC
GAG G AAC ATTATC G G AC G ATC AAG C GTAG C C AAC AG AAAG C C G AAATATC C ATG AC C G
GCCGAGTATCACGACGGGGATCTCGGCACGTATCGATAAGGAACCTTCAACCAGTTA A
TCAGAAGACCTTTTACCTTTTACAGAGTTGTACCTAAAGTAGGACTCCCCTACTATA TAA
AGGGGGTTTGATAATTCATGTAACACATTGAAAACACGCGTTCCAAGGAAATATATT ATC
ATTTTCTCTTTTATCTAGCTTTTTTCACTTGTTCATCAGTGTTGACTATAGCAAGCC CGGG
ATCGAGGGTGAACAATTTTACTAAGGTTGAATCTGTCTTATTCGCATGGTTTGAATT CAT
TTTATCTTTACTAGTTCAATCTAATCCAATTTATAGCTTTGTGTCAAATTAATCCGC GTAT
CCTTAAAACCACTTATAAATTCAATTGTTATCCGATTTTGAGGGTAAACAATAATGA AAG
GAAATTGAACGATCAGCTTCACTTTGGACACTTCCCATGATATTGGACCACGAACTG AA
TTAGAGAAATTCCAAGCCACACTCCCTATAAGTGTTTTAGACCATATAAGTGACTGT CAA
CAGCATACTAAGCCTGAATCCTTTGAGCAACAAGCTAGAGTGGTTGCTTCATGTACA CC
TCCATGGCTAATTCTCCATGCAAGAAGGCATTTCAGATATCCTGTTTATAAAGTGTA CAT
TAAAACATTGACGCTAAAAAGTTAAACAGTTGTGTGGGTGGATACCATATAAGTAGC GA
CGTCCAAAAAACACTTACCAGAATTAGCATGAGTTTCAGATTCAAATTCCTGCGGAG GC
AAAAGACACTAGGTGATTTCTTTTTGTATGTCCAAGTCTTGGTGAACATATGTGAGA AAG
AAAGAGTTTGAG AG AGAACTCAG CTTG ATTATTAC CTTAAG CAGTG GG CTTTAAATAGCT
AAGTGCACATATTCTAAGTATTAGATCAAGTTTGAGACCTAGTAAACTTCCACAGGA AAA
GGGATAGAGTCAGATACCTAAACCTAGTTATACTCTAAAGTTTTTATGTAATTAGAC CTC
CTTAGTCTCTATCCTAATGGTATAGTCTCATCTGTGCTGTCCCAAATTAATATATTG AAG
AGAAATTCAATCCCAAAGTTGTGTGTGAATTTATATGGTATCAGAAGCCATGTCGAT ATC
TTCCTCCTCCTCCCCTCCATCACCTACACTCGTTAACCCTCTTTCTTCGTCATCTTC CTC
GCATGCACCCCTTGACCATGCTCATCACTTCATTTCAGTTAAATTAACTTCTACGAA ATT
TTTCTTTTTTGGAAGACGCAACTATTACATTTTCTTCGAGGACAAAATCTCCACAAA AACT
CCACGGCTATATTGATGGAACTAATCCTTGCCCACCATCACACACTACGGTTGAAGG CA
AAGAAATACCAAATACAACCTATGTATAATGGATCTAACAGGACCAATTGATTCTTA GCC
TGTTGATTTCATCACTTTCCAAAGAAATGTTGCCCATGAAAATTGGTTTAAATACCT CCA
AAGCAGTTTGTGATGCACTTGAGGCAGCCCTATCCGAACCTTCAAATGCACGAATCC TC
AATCTTCATATGCAACTTTAAAACTTGAAGCAAGAAGATCTTTCGGTTACTCAATAC TTG
CACAAGGCCAAACTCATCTCCGACGAGTTGGCAGTTGCTGCAAGGCCCCTTCGTCTT G
CCGATCAAAATGTGTACATCTTTAAGGGACTGAGATCTGATTTCAAGGACATTGTTA CAA
CTCTCTCAGAACGACATGAACCAATCACATTCTCAGAACTTCACAGCCTCTTGCTTA ACC
ATCAATTTAGACATGGTTCCTCTATCTCCTCACTTTCCTTAACCACCCCAAAACCAC CTG
CTCTTCAACAATACCCACAGCTAACTTCAATCAACGAACTACAAATCTGATCGTAAT AAT
G GTTTCAATTCAAATAG GG G ACG AG G CAG ATCTTCGTGTG AAAGAG GG G GTAGAG GTG
GTTGTTCATCCTCAAGGAATTTCTCTAACAATGGACAATCTTGGTCTCAATATGATC AGC
GAACCCGGTGTCAAATATGCAATGGTACCAACCATCTTGCATCAACTTGCTTCCAGA GG
TACAATCACTTGATTAACCCTATGGCTTATTTGTCTAACCAAGCTCCTTTACCCTCA ACTT
TGCAATGGTTTGCGGACATTGGAGCCACTCGCTACATCACTTTGGATCTCACAAATA TT
CATCAAGTTGAAGATTATAGGGGTTCAGATCAGGTCCAAATTGGCTATAGACATGGC CT
TTCTATCCATCGCACTGGTAACTCCTCTCTCTGATCACCCTCTTGGTCTCTCTATCT TAA
GAATATCCTTCATGTTCCTTCAATTACCAAACGTTTACTCTTTGTTCAACTTTTGCT CGTC
ACAATAATGTCTTCGAACTTCATCCCTTTCATTTTGTTGTCAAGGATCTACAATCCA GGA
CACCTCTTTTTACAGGGCAGAGTGATGGCGATTTATACACACTTCCATCCAAGTCTT CTT CTTCTTCCATCTCCCAGCCAGTTCCAGCCTCTCCAACAGCTTCTCTATCCATCAACACAT
CACCTTCATGCTGACATCTTCATCTTGGTCACCCCCATCAACTAGTACTTACGCAGA TTC
TTAGGACCTACTACAATCTGAAATGAATGCTTTGCTACGAAATAATACCTGGTCTTT GGT
TCCTCATAATCTTTCAATGAATGTTTTAGGATGCAAATGGGTGTTTCTCATTTAAAA AAAT
TTCTATTGGGGCAATTAAGAGATGAAAAGCCCATCTTGTGGCTAAAGGTTTTCATCA ACT
TGAAGGCCAGGACTACTCTAAGACTTTCAGTCCAGTTGTAAGGCTGCAACCATTTGC AT
TGTTCTATTTTTAGCAGCTTCACATGGGCGGTCTCTCCAACAATTTGATGTGCAAAA TGC
AGTTTTACATGGTGAGCTTCAAGACCATGTGTTCATGAGCCAGCCTTCAGGTTTCAT CC
ATCCTCTTTTTCCTCATCATGTTTGTCAACGTAAGAAGTCACTATACGAGCTCAAAT GGC
TCCCAAGGCATGGTATATGCGTCTCCATAAGTTCTTGCTCAGCGTAGGCTTCATCAC CT
CTAGATCGGACACTTCCCTGTTTGTCTGCAACTCAAATGGTGTTGTCGCCTACCTCT TA
GTATACGTTGATGATACATAGTCACTGGCAGTGGTACCTCCTTTTTAGAATCCATTT TCC
TCAAACTTGGAGATGTCTTTTCCATATGTAATCTTGGTCCTCTCAGTTTCTTTCTTG GTCT
TCAGGTTTCACGTGATCACCATGGCATCTCTATGTCCCAAGCTGAACACATTAAGAC TA
TTCTTGCAAGAGCACGTATGTAGCACTGCAAACCTTTAATTACTCCCATGGAAGTGA AT
GTCAAACTTCACAATGGAGAAAGTCTTAGCTTTCATGATCCTACCTTGTACTGTCAT ATT
GTGGGCCTTACAGTATGTTACTCTCACTTGGCCGGACTTAGCTTTTGTGGTGAATAA AG
CTTGTCAATTCATGC ACAATC CTACTATG AGTCAGTG G GCAG CAGTCAAG CG CATACTC
TGCTATTTGATGCATACCCAACGTATGTGTTTTCACATTCCTAGGTCTCTTACACTC ACT
TTTCAAGCCTTCACACACTCAGATTGGGCAGGTTCACTCGATGATCGTAAGTCCACT AC
GGTTATGCCATTATCTTGGGTGAAGCTATTCTCATGGTCGTTCAAAAAGCGGCGCAT TG
TAGTAGATCTTCCACAGGTTCAGAGTATAAAGCTTTAGTAGATGCAGCTGCCGAGCT GA
CTTG GATTCTGTCTCTCTTGTTTG AG CTTG GTGTTTAACTTCC CAAAG CTCCAATTCTAT
GGTTTGACTACCTATCTTTCGGTAATCCTGTGTTTCATGCACGAACCAAGCATGCGG AA
ATTAATTTTCACCTTGTTAGACAAAGTAGCTCGAAAGGATCTCACAGTTCAATTTTT ATCC
TCCAAAGATCAGCTTGGTGATGTCTTCACAAAGCCACTAGCTTCCTCTAGATTTGAG TTC
CTTTG GTC GAAG CTCAATGTG GTTTATCCACCTCAGCTTGCAAG G GAGTATTGTATCAA
CTTTGAGTCCTGGTAAACTTAGGATATAGTCGGGTACCTAAACCTAGTTATACTTTG AAG
TTCTTATGTAGTTAGATCTCCTTAGTCCCTATCCTATAGTATATACTCATCTCTATA AATG
TACGACCGCTGTACCAAATTAATACATTGAAGAGAAATTCAATCACAAAGTTGTGTG TGA
ATTTATACTAAAAGGAAAGAGAATTAATCACAATGAAAATACAATCAAGCTATACTC TATT
TACAAGATCCTAGAATATTCTAGAATATTGACAAGATTCTAATAAGAGTTAGCCCGT ATC
TGTGCTGGTGGGAGGTAGCAGGTATCCTTTAGAATTAGTGGAGGTGTGCGCAAGCGC C
AGAACACCGTGGTTATTAAAAAAAATCCCTACGAATTATTGGTATGAATAACAACAA TCT
CCTCTATCATGGAATGACTCCATATTAGATGTATCAATAATTTTCCAACGTCTTTTA AATG
TGGAAACACTAGAAAAAGTTGACACATTGATTAAGAAAGTAACAAAATGATACGGGA AG
TACGAATATCTTTGCGTGAGGTAATGAATGGAGTTGTTATGTCTGTTGAAGGCATGA TTA
G GATTAGTG ATG G AG GAG GAG GAG AAAGTCATTGAATG CTG GC GATGTACTTGTATCA
TGAGAAAAATCTCATCTGTATTAGTAGGAGGATGTTTATGAACTGATTAGACTCGGA CA
G GACAAACTTGTGTTTTAAGAAGTTAGG G ACTAGTAG ATGTCAG GAG GG AAATTTTATTT
CATAG AATTTG GTG AAAAGTATAG AGATGAGTGG G AG GTTACATTTG CTGG G ACG AACT
TTACTTTTTGGGGGATTATAAACATTTGCAACTTTTCTTGAAAATGGGTGAGTAAAA AAA
GACACAATTCTGATTAGGATTGAGGTCTGTCTTTTCCAGTAGTGTGATCATGAAGGA AG
CAAGTACATCCAAGCACTTCCAATTGTAGATGGTGGAGATTTTGATCTGGAAATTTT AGT
GTGGACAGTAAAGTTGCAACATAGAAAAGATTGATTTGTACATTTGCCATCGATAAC AAT
GCTTCTAGCAACATGATTCTAGGGTGAATGGGTTTGTTGGGGCTCTATGAGTCTTCC TC
GAGTGGCGGTGAGGTCTAGAAGTCGATGCAAAGAGAATTGAGTGGTACAGTCAGTGA C
TATGACTCATTCTGTCAGCCTCTGTAATTTTTTGCCGCTATAGCTTTTCAGTAAGTA GTTT
CTTCCCTGGTTTTAATGTCATGAAAACTAGAAATAAAAGAATGAAAAATGATGGAAA AAC
TTGATTATTTCCTCAAGTTTGATTAAGAACTTAAACTAGTTACAATGTTGGTAAAAT ACAG
AATATGGTGAAGTAATTCTCCTAATGTGACTCATTCATAGCAACTGAGTTAACCCAA TAA ATGCAAATGACTGGACGATCTCGACAAATCCAAGTCTATCAATTTCAACAAGTTTCGCTG
CCTTAATCATGACAATATATTTAAGTGATGGTCATTAAATTGGAAAGAGTTGCTGTT GCT
CTTGTTTTTGCCATCATTCAGCTGTTCACTGTGGTAGATTATGGTTTCCTACCAAGT CCA
ATGAAACTGAGCAGTCTTGACAATGCCTGATGTTCAGTTTCTAAAGTCTGTTCTCTC TCC
AAAAGTAGAGGAACATAATGTTATCTGATTGGCTCGGGAAAAAGTTGTATGTAGGGT GA
AACCTTAATGACATAATGAAACATGTAATGGTCTTTGTGCCTTTGGTTCATTATGTC TGC
TACTAGATACTGAAATTGCTGCTGAAAGTGCTTTTTGAGGTGTCACTCATTTTTTCT TGC
TG CTATTAATACATAG CGTTCTG ATTCTTTTCAG AC CCAG AGG G AAAGTATGAG GC ACTT
GAGAAATATGGAAATGACTTAACTGAACTTGCCAGACGTGGAAAACTTGACCCGGTG AT
AGGAAGAGATGATGAAATACGGCGCTGCATCCAAATATTAAGTCGGAGGACAAAGAA TA
ATCCTGTTATTATTGGTGAGCCTGGAGTGGGGAAAACTGCAATTGCCGAAGGGTATG AT
CTCTAGCCTTTTTTGGTCTCACGGGGTGATGTATGAACATGTTTTTCCTTATATTTA TTTG
TCTGGATCCTGGTTCAGTTGAAAACAATCCAAAAACAGTAATGGAATAGCAGATCTG TG
GAGGACATTTTTATTATTACTTGTCTACAATGATATTCTTTTGGTTGGATATGCTGT TTTA
ATTAGTTTTGTTGAATAATGCTGCCTGGCAAACCAAACATTGAACTTTAAAAGAGTT ATG
TTTCTAGAAAAGATATGCTCGGAATAGCATGATTATCCACTCAGAGAGGTTTGTGAT ATT
TAGCAGACTGTATGTGGGGTTTACAGGAAAAAATCATTGCTAATACATATTGTTTCT GGA
CAATACGTTATTCCTGTTAACTATTTGAATTGTGAGATGGGTCAGGGTGTGTTATGT GGA
GTTGTTGAAAATTAGTAATTGTAGGATGGAATGAAACAATTATAATCTTTTTTGTTT ACCA
TGGTTCCTTGTGTTTATTTTTAATGAGCAGTGGCGTGGTAATGAAATGTAGTTATAG AAC
TTCTTACTAAG GG GTCTGC CTATATGTTG CAG AATCAG G ATCG CTG AC AAGGTTCAGTG
AAGAGTTCTGATGGATGCATTCAACATTTCTTGAAAATATATATCAAAATAATGCTG TTGT
TAGTGAATCACTGAATTTGTGTTAACTTCCCTTTGATTTGAAACTACCTTACCCGTA AGC
CTGTAAAATTGAGGTGGCTTGCAGAGCCTTTGAATGATCAAATTTCTATTTAATAAT TAAT
ATAATGCCAAAATTGTGGCAGCTTGTAGAGCCTCTGTGTGACCTCGTTTTAACTTTT TAC
ATAGGTCCAAGGTAGTGTTAGCTGTAATATTTGCGTGTACTTAAGAGTCCTGTTCAA GT
GGTGGCATTTTTCACGTCATCTACTTTATGCAATATGTTATGTTTCCAGCCTTCGTG AAA
TGGGGATGTGTTTTTGACAGATTAACTGATAAAAGTCAATCAGTTCTGTCTCTTGTA ATG
ATCTTTTTCCAG CAAAG GAG GCTTTTTATTTATTACTTGTTAC CAGTAC GTTATG AG ATTT
AGAGCCTTTGTGGTTTGGATATTTAAAAAGTTAATCAATACTTACTTTTTATTAGAT CAGT
TGTGCTAGTTGTAACTTACTTATCTTACATATCGGAATAATTAGTTTGGTTTAACTG CCAG
ACGAATGTGTTTCTGCACCGGAAGTGAATAATTCTAAATTGATATTGGATAGTGACT TAT
TTGAGTGTATCTGCAATATAGTTTTTTTTTGCTTAGACGTAGAGAAAGCATATGATG AGT
TGGGAAAAAATGTCGTTTAGTGGGTGCTTGAGAACAAAGAAATTTCTACTCAAAATA TAT
AAGTACCATAAAAGATATATATGAAGGAGTGGTCACAAGTGAGGGACCAGTGGGCGG A
GATATTGAGGAGTTCCTTGTAACCGTGAGTCCATAATAGGGAGCTGCCCCATAGTTG CT
TACCCTGGCTATGGAATAAGTTATTCAGTAGCAAATAGGACGGCATTATGGGATATG CT
ATTTGTCGATGATATTTTGTTATTTGACGAGGGGCAGGGCAAGAAAGTAACAATCAT AC
CCGAGAGAGTCAATTGTTAATTGACGAAATTAGTTAGGAGCCGATCAAAAGTTGGAA TT
ATGGAGAAGCACGCTAGAGAATAAGGATTTTACAATAGGTAGAAGTTAAACAGAAGA TA
TG CAATG CAAGTTTAG CTTTTG AAGTG AG GTTAG ATAG GATACTAGTGTCG AG AC AAAA
AATTCAGATATCTAGACTCGATCTTTCAATAGAATGACATGATAGATGAATATGTAA CAC
AATAATATGATGGCCGAATTGAAGGAAGCCCATGGAAATGCTTTGCGATAATGATAT AG
CTACTAAAGTAGAACATAAGGTCTATGGAAACTGGTGATATAAACAGGGTTATATAG GA
GTAAAGGCTGGAATTTTAAGACCCACAATATCGGCAAGATTAGCATCGTAAATATGT GG
TGTCAGATGGATGTACTATACGACCCAAAAATGGTTGAGTCCAAGTCACTACATTGA CA
AGATGAGCATCGCAAATATGTTGCGTCAGATGGATGTATACACACCAAAAATGGTTG TG
TCCATGTGAAGGTGCATGTAGCACACAGTGATAGTAAATTGAGAGATGGCCACATTT CC
ATCATGTCTTTCCTAGGCCTTCCAAGTGCATTGGTTTATTGGTGAGACTATGATGAC TAA
AGCTGTTGAAAAGGTATGAACTAGATCTAAAATTACATGAAGAGAAGTCATCTAGAA TCA
CCTACAATCTCACAGAATCTGCGTGGATTCATTATGAACATAGCACAATGAAAGCAA ATT ATCAAAAG CATG CAATAGTAATTAG CTAG AGTTG AAG CCTAGTCG CTATTG CTCTTAC AT
TAGGTTGTGTGTTTTCCAGGAGCTTTTAATTTGGTTAGAGATTTGTATATCAAATGT GAG
ATATAGAGACCCTCTAGTTTTAACAAACACCCTTAAGTTATCTCAAAAGAGAGCAAA ATA
GATAGATGATTCATATAACGATCCCAACTAGGTTGGATTTGAGGTATTGATTAGAAT GAT
TGATATAGTTATCCAAATTTTAAAAATCAACCTTATTAGTAAGGCAAAGATGCTCTT AACA
TGTTAAAAAGAAGTCGAACAAGAAATTGTTCTTCCTTTCCTTTGATATAAGATATTT CTCT
CCCACCATCCTGGAAGGAGAAGGTAAGGATTGTGAGAATGCATGGGAGAAAAATCTA T
CTTTTTAAATATATGATACAACGAGCCCTATATATAATATATATTCTACTCCTACTA CATAT
AGGACTAGGACATATTCTACTCCCACTGACTGGAATGATGCACACAACGGCTAGAAT AG
CCTCCAGAGAGGGTGGAGCAGCAACCGAAGAAAATGTTGGCGGGTTGGGCTGCCGGT
AGACAGAAGTTATTGAGTCTCTACTAAGAGAATAGAGGTTTTGGTATCATGAGAAAA GA
AGAGAGTACTTATTGACTTGATTATTGACACAATGAGAGTGTTTTTTATAAAGGATT CTTA
TTCTAGTGATATAAGCTTAAGTATTTATATTATGCTAATGATATGAATAGTGATTTT TCTCT
TGTAGTTAAGTAAAGAATACTCCAAAATATCATATAAAATATTTTACATATTCTCCT ATTCA
G ATACAATG AG ACTAG G CAATATTAACCTATAATTACTTTGATTCTTTTG GTATTTTAACA
CG GTTTTTAGTCTTTAAAAG AAATCAG G AAAAAGAG AG GTG AAAG ATG GTATCAGC CAA
ATCTATATGACAATTAACCTCGGAATGATTATGCTTTCTGCTCCTCATCCATGCAGG TTA
GCTCAAAGGATAGTCCGCGGTGATGTTCCTGAACCTTTGATGAATCGGAAGGTTATA AC
TTCTCTTCCTTGGTCTTAATTTGATTGACTTTATTTTATTGTGAAAAGGTCAAGTGA TTGT
GCATATGCAAGATCTTGGAAGATGGCCTTTGCGTAGTGTGCATTGGATTGCTTTCTC TT
CAATTTGAAAAAGATAGTATCTGGGAAAGGCATGTCTTGAAGAAATAAATCAATGGT TGA
AAACCTGCTGCCTTTCTCTTGAGAACCATGAGCCGTAACTTGTTGCTTACTGCAAAT AGT
TCTGTTTCTCGTGTTGCAGTGATTCATTTGTATGACGCCAGTATTATTGGATCATAA TTTT
CTTTTCAAGATATGAGTCTGCTAAAGACAGTTGATGCAGTCTTTACTCAAATGAACG ACC
GTGGTTAAGTAACATATTGTTTTAATCATTTCTGGGAGCAATTTCAGTCGAGTTTGA TAC
TAAGTGAGATATGCATAACATTGCACTCAAATGTCAAACTAAATTTTGATCATCTAA GCT
AGTCCGATGAGTTAGTCTCTTTTCTGAAACCCCCAACCCCTCACTCAGCAAAGGGGT AG
GAGAAAAGGAAACGAACCACCTCATTCTCTAAAAAGAGTGGAACTAAATTATGATGC CT
CCTAAAGTAAAGTAGTTGGACATAAGTTTCTGAAGTTTCAAATTTGAGAGTGGGATT CCG
AGGAAAGCCTTCTGTAGCTTGAACCTTAATTGTCTACTAGGATTTGTTAATTTCGAC TTT
TAAGTGCTAATACTGTAATTTGATTTCCTTTTTCCTTTTCATTTTATTTAGTTGATG TCTCT
CGATATGGGTGCCTTGCTTGCTGGTGCAAAGTACCGTGGAGATTTTGAGGAAAGGCT G
AAAGCTGTTTTAAAGGAAGTCTCTTCATCCAATGGGCAGATAATATTGTTTATTGAT GAG
ATACACACTGTAGTTGGTGCAGGTCTGGTACTTTTTTTTTAATATCCATTTCTCCAT GAA
GGAAGAAGTTTATTTCTACCGACTGGTTAGAAAATTTGCCAAATGTATTCTTTCTCT CTA
AGATCAAATCTCTATTATTTATAGAGTTCGATATTAAAGAAAAGTGCTGACTCAAAC TGC
TTGTCTGCTTTCAAATCTCATGGCGTGTGGACAGCAAGTAGTGACTGCTCTTTTTGT TG
GATGATTTTTATCCTTGCTTAATTCAATCTTAAACAATCCAAAGTTCATGATTTAAT TCATT
TGATGTCACTGGGAAACACTTGTTCTTCATTCTAGTGAGGTTAAAAAGCACTATTGC TTC
GTGATTGTGTTTTAGGTGATCCATTTTTAAATTTGAGCCTGGAGCTGGGAAATGCTT GAA
CTAAAGTTTACTTTTCTGTTAATTTGAGCCTGGACCTTGGAAATGATTGGAATAGGT TTA
TTTTTCTGTTCCATAAAGCTGATAAAACATTACTGATGATGTCCTTTATGTATGCAA CCGA
GTAAAG GAG AAG GTACCTTTACAC CCAAG ATATAGTTTCTGTTTGTAG CTGG CTTATATT
AGTAACTAATTCAGCGAATGTTACTTGGTCAAATTATGTTGATACGTTATATTCTAA AACT
TTTGTTTGTTTATACCTTCCTAATCATTGTGTTTGCTTATGCTTTGTGTCTAGGAGC TACT
AGTGGGGCCATGGATGCAGGGAATTTGTTGAAACCCATGCTTGGTCGGGGTGAACTT A
GATGTATCGGAGCAACCACTTTGAATGAATATAGGAAGTACATTGAGAAGGACCCTG CT
CTGGAGCGCAGATTTCAACAAGTATATTGTGGCCAACCATCTGTGGAAGATGCAATT TC
CATCCTCCGTGGATTGCGTGAACGATATGAGCTGCATCATGGTGTTAAAATATCAGA CA
GCGCTCTTGTATCAGCTGCAGTTCTTGCAGATCGATATATCACTGAGCGATTTTTGC CG
GACAAGGGTAGGCTAATGTATCCTTAGAACTGCAAGTTGTCTGAAATACTTGCTTTT CAT TCCTATAAAATTCTTGTGAACGTTTTTCATGATATCTTCAAATAATACAGCAGCCTAATG T
TACTTTTACATAATAAG AAAGTTACAG GGTTACAAGTAG CTTATTTTTATG G CTTCTTTAC
ATGTTTTATTG CATTG AGTG G ATCAATGG GTCCAG ATTTTCAAG CTTCTTCTAAATGTTTT
TAGCTGTGCGTGATCTGACATACGTTACTTGGGGCTTTTCACTTATGCTCAGTTCTT TCT
TTTCAGCC ATTG ATCTTGTTG ATG AAG CTG CTG CAAAACTAAAAATG GAAATTACTTCAA
AG CCAACTG AATTG GATGAG ATAGATAGG G CAGTGCTAAAGTTG GAAATG GAGAAACT
CTCCCTGAAAAATGACACGGATAAAGCATCTAAAGAAAGACTTAACAAGCTAGAAAG TG
ATTTGAAGTCCCTTAAGGCAAAGCAGAAAGAGTTAAACGAACAGTGGGAACGCGAGA A
AGATCTGATGACACGTATACGTTCTATAAAGGAGGAGGTAAATTGCATCTTTCATTG ATG
AGGTCAAATCAAAGTTGCAGTTTTTCTTTGTTTTCTCATGATTACTGTTCAATTTTT TCCG
TTGCGTAGATTGACAGGGTGAACTTAGAGATGGAAGCTGCTGAACGTGAGTATGACT T
GAATCGTGCTGCTGAACTCAAGTATGGCACCCTAATCTCCCTTCAACGGCAGCTAGG A
GAAGCAGAGAAAAACCTGGCAGACTACCGGAAGTCTGGGAGTTCGTTGCTTCGTGAA G
AAGTAACAGATCTTGATATTACTGAAATTGTTAGCAAGTGGACGGGTATACCACTAT CAA
ACCTTCAG CAGTCTGAG AG G GACAAG CTTGTCTTTCTAG AGAATG AACTTC ACAAAAGA
GTTGTTGGTCAGGATATGGCAGTAAAATCTGTGGCTGATGCAATCAGGCGATCTCGG G
CAGGCCTGTCCGATCCAAATCGGCCCATTGCAAGCTTCATGTTCATGGGTCCCACTG G
AGTTG G CAAAACTG AACTTG G AAAAG CTCTTG CTGC GTACCTTTTCAATACTG AAAATG
CTCTGGTGCGTATTGACATGAGTGAATACATGGAAAAACATGCTGTTTCACGGTTGG TT
GGTGCACCACCAGGTTATGTTGGATATGAAGAGGGTGGGCAACTCACTGAAGTGGTC C
GTCGGAGGCCTTACTCTGTGGTCCTTTTTGATGAAATTGAGAAAGCGCATCATGATG TT
TTTAACATTCTCTTACAGTTGTTGGATGATGGAAGAATAACTGATTCTCAAGGGAGG ACT
GTTAGTTTCACAAACACTGTTGTAATAATGACATCAAACATCGGGTCACATTACATT CTT
GAGACGCTGCAAAACACTCGAGATAGCCAGGAGGCAGTTTATGATGCGATGAAAAAG C
AGGTTATTGAATTGGCAAGACGGACTTTCCGGCCTGAGTTCATGAATCGGATTGATG AA
TACATTGTTTTCCAACCTCTGGACCTTAAGCAAGTTAGCAGAATTGTTGAGCTCCAG GTA
ATACAGATCTGTAATCTGTTGAATTCTGATTCTCCTGACTTCATACGTTTTTCTTCT GTGT
TGTTTTCTGTTTGCTGCGGTGTCATCTGCTTTCTGATTACTTTGACTTTAAGAGTTT TATA
AGCACTACAGCAGATTACTGTTTGTGCGTTATCTCTGTAAATTTCAGTTTTTCTGTG TGA
GAACAAAAAAATGTTTTAGTGTGCATTAGATCTCAAAATTACACATAAGTACATCTC ATTT
GCTTGGTGGTCGTCGTCCTAGTTTGTCCTCCTTGCTGCTTTCTGATGAGTGCATGGT TG
AGTATGTCAAGCTTGAGAACTGCAGCGCACTGCGCATCCTGTCTAATGTCTGCTCTT GC
AGTAGTTTTCTAACAGAGTATAATGTAAAATATATCATTTCATCTGGTGGTTAAGCT TTCT
CCAAGATGAAACATAATTTGATATCTGTTCTTTGTGGTTCTTAATTTGGGGAAAGTG TTT
GGCTATTTCTTATTTTAACCTTATCATCGCATCTGCAGCCATAGCATAACCTTGGTG AGG
TTCATGGGAAAGAAAGTTACCGAGGCTACCTGACAATATCGTTAACTGATGAAAATA TTT
GTAGAACAAAACTTTGTGTTTTCATTATCATTACTATATTAGTTGCTCTCTTTTATC TTTTT
TTCGTCCATCTTTTCTGTTTGAAGAGATTTTTCTTCTGTTTCATGAGATAATTCGAG GTG
GAACTGCTGAGTGCTATGTATTACATGCGGCTGATTATCTATTTCATTTTCTATAAA ACC
TTCTCTCTATCGTGAGAAAGCGAAGGTCTCCCTCTGAAGTTATACTGCTGATATTAG AGT
TTCTTAGAACCTGACGACTAGTTCTCTTTTTCTCTTCGCAGATGAGAAGGGTGAAAG AC
AGACTCAAACAG AAG AAAATTG ATCTTCATTACAC GCAG GAAG CTATCAGTCTACTGG C
AAATATGGGCTTCGACCCTAACTATGGAGCTCGACCCGTTAAACGAGTGATTCAGCA GA
TGGTTGAGAACGAAGTAGCAATGGGTGTTTTAAGAGGAGATTTTTCGGAGGAAGACA TG
ATTATCGTTGATGCTGATGCTTCTCCTCAGGGGAAGGACCTTCTTCCCGAGAAGAGA CT
GTTGATACGAAGAATTGAAAATGGTTCCAACATGGATGCCATGGTTGCCAACGAT
SEQ 75 GTGAATGTGAAATGTTTCTTTGTTTCTTTCTTTTTTTCTTTTTCTTGTATGTCACTTTTT TTT
TTG CAAG GCTG G AACTTTG AAACTTTTTGTTTG AAAAC ACAATC ATTCG CAGTAACAAAC
AAGAACCACCGTCCCCATCTTCACTCCCATCACTCTTCTTTTCTTTGTTTTCACACT TCAT
ATTTACTCTTCTTTCTCATCCTTTATATTTACATAGCAAAAACAACGTCAAGATTTG CAAA
AACACAGCAACCCCCCCAAAAAATGTCAAGATTTACAATGCTAGTAGTTCTTGTTCT TCT
TCTTCTATGTCTATGCCATTTATCAGTAGCAACAATAGGAAGTAGTAGTAATAAGAA GAG
TACTTAC ATAGTAC AC GT G G C AAAATC C C AAATG C C G G AG AGTTTT G AAAAC C ATAAAC
ACTGGTATGATTCATCACTAAAATCAGTTTCTGATTCAGCAGAAATGTTGTATGTTT ACA
ACAACGTTGTACATG GTTTCTC AG CAAGACTGACTGTTCAAG AAG CAG AATCACTTG AG
AGACAAAGTGGGATTCTGTCTGTTTTGCCGGAGATGAAATATGAACTTCACACGACA AG
AACACCATCTTTTCTGGGTCTTGATCGAAGTGCTGATTTTTTCCCAGAATCAAATGC TAT
GAGTGATGTGATTGTTGGGGTTCTTGATACTGGAGTTTGGCCAGAAAGTAAGAGTTT TG
ATGATACTG G ACTTG G AC CTGTTCCTG ATTCTTGG AAAG GAGAGTGTG AATCTG GTACC
AATTTCAGTTCTTCAAATTGCAATAGGAAATTAATTGGTGCAAGGTAAAACTTTTCT AAAA
GTTTATGCGGTTAGAGACAAGACATTTTTAAGTTAGTTAATTATATTATATCTCAAA TTGT
GGTCGCGAGGATTCATATTGCTTACTTCAACTTTTTTGGGACTGGGACGTAGCAGTT GT
TATTATATGTTAAACTCGTCCTCTCGCATGTTGGTCTGATTAATTTTATGATTTCTC TAGT
TG G CAGTGAAATTTG AATCTG G GATTTTTTG CTTG GTTTG ATACCATGTTGAGTTGTCTG
ATTAGTTGCATAACTAAGTGGTAACTGGTAAAGCTGCTCCCATATGATCGGAAGGTC AC
GGGTTCGAATCGTGAAACCAGCCTCTTGCTGAAATGCACGGTAAGGCTGTATACAAT AA
ACCTTTTGTGGTCCGGTCCTTACCCGGATACTGCTATGGTATAGCGGGAGCTTAGTG CA
CGGGGCTCCCTTTTGGCATAGCTAAGTGTGTTGAAGAGGGTTTAGTTAAATCCCATT CA
TCAGAGTTGTACTGTACAAATAAGCTAAAAATGAATTATTTTTGTGTATGTAAATTG GTGT
ATCCCATTGATAATACAGGTTTGTACTTTTTTGAATTTCCTTGTTAGAATTATTTTA AAAAA
AAATAAAAAATATCATGGCTCTGCCACTGTTGTGCTCAACTTATCTAAAAGCTAAAA CTA
TTAGAGATAAGATATACTTTTAATTACTTAATCATATTATGTCTGTTGATAGGTACT TCTC
GAAAGGTTATGAGACCACTTTGGGTCCAGTTGATGTATCCAAAGAGTCGAAATCTGC GA
GGGACGATGACGGACATGGAACACACACTGCTACTACTGCAGCTGGTTCAATTGTTC A
GGGCGCTAGTCTCTTTGGTTATGCTTCTGGAACTGCTCGTGGAATGGCAACACGCGC T
AGAGTTGCTGTGTACAAAGTTTGCTGGATTGGTGGTTGTTTTAGTTCTGATATATTA GCA
GCTATGGACAAAGCAATTGATGATAATGTGAATGTGCTTTCTTTGTCACTTGGTGGT GG
CAATTCAGATTATTATAGAGATAGCGTCGCAATTGGAGCATTTGCTGCTATGGAGAA AG
GGATTCTAGTCTCTTGCTCTGCAGGTAATTATGCTAGTCGGAAAATATGAAGAACTT CTA
GTACTTCTTAATTATTACATTTTATTTTATACTAGACCAGACTAGTTTAAAACTGAG CGAC
ATTAACAATGAAGATTCATTCATATTGCCGATTCTAACTTGCTTGGGATTGAGACGT AAT
TGTTGTTGTTGCTCTGCAGGTAACGCTGGTCCTGGTCCCTATAGTTTGTCCAATGTA GC
GCCGTGGATAACTACTGTGGGTGCAGGAACATTGGACCGTGATTTTCCTGCATATGT AA
G CCTTG GCAATGGTAAG AATTTCTCTG GTGTTTCACTTTACAAAG G G GATTTGTC GCTG
AGTAAAATGCTTCCGTTTGTGTACGCTGGTAATGCTAGTAATACTACAAATGGAAAT CTT
TGCATGACGGGTACCTTGATTCCTGAGAAGGTTAAAGGGAAAATTGTTCTATGTGAC CG
CGGGATAAATCCCAGGGTCCAAAAAGGTTCTGTGGTAAAAGAAGCTGGTGGGGTCGG T
ATGGTTTTGGCTAACACTGCCGCCAACGGGGATGAGCTGGTGGCTGATGCCCATTTG C
TTCCAGCAACGACAGTTGGTCAGACGACAGGGGAAGCAATCAAGAAATACTTAACCT C
GGATCCTAATCCAACCGCTACAATTCTTTTCGAGGGAACTAAGGTGGGGATCAAACC AT
CACCAGTG GTTG CTG CATTTAG CTCCAGAG GACCAAACTCAATCACG CAG G AAATTCTC
AAACCGGACATCATAGCACCAGGTGTTAACATTCTCGCAGGGTGGACAGGTGGTGTT G
GACCAACAGGGTTGGCCGAGGACACGAGACGTGTCGGGTTCAACATTATCTCGGGCA
CGTCTATGTCTTGCCCGCACGTGAGTGGTTTGGCTGCTTTGCTTAAAGGAGCGCACC C
CGATTGGAGTCCAGCGGCTATTCGCTCGGCTCTTATGACCACGGCTTATACAGTGTA CA
AGAACGGCGGTGCACTCCAAGATGTCTCGACGGGAAAGCCATCCACACCATTTGATC A
TGGTGCAGGACATGTAGACCCTGTTGCAGCACTAAACCCCGGACTTGTTTACGACTT GA GGGCTGATGATTATCTGAATTTCCTCTGTGCCTTGAACTACACATCAATCCAGATTAATA
GCATTGCTAGAAGAAACTACAACTGTGAAACAAGTAAGAAATACAGTGTCACTGATT TG
AATTACCCTTCATTTGCTGTTGTTTTTCTAGAACAAATGACTGCAGGCAGTGGAAGC AGT
TCTAGCTCCGTTAAATATACACGAACGCTTACTAATGTTGGACCAGCAGGAACATAC AA
AGTTAGTACTGTTTTTTCATCAAGCAACTCAGTAAAAGTCTCGGTTGAGCCTGAAAC ATT
GGTTTTTACTCGTGTGAACGAGCAGAAGTCATATACTGTGACTTTCACTGCTCCTTC AAC
TCCATCAACTACG AATGTGTTTG GTAG AATCG AGTG GTCAG ATG G CAAG CATGTAGTTG
GTAGTCCAGTGGCCATTAGTTGGATA
SEQ 76
ATGTTGAAGGCTCTTACATCCTCATGTCTGCAGAATCGTTTCCACGCCGTCACAACGGC
ATTTACCCCTCAAGTTCGCCGTGGCACTGACTCGAATACGCCCTTGCTTCGGGTTTT AG
GTTCGCTAAGAAGTTCGAATCGCAGGGTCCCTTATTTGTCTCGACGATTCTTTTGTT CG
GATTCTACTGATGGGTCCGAATCGAATTCCGAGGCTGCTGCATCCGAAGCCAAGCCG G
CCGAGGAAGGTGGAGATGCTGATTCTAAGGCTTCGGCTGCTATGGTTCCCACTGTTT TT
AAGCCTGAAGATTGCCTTACGGTTAGTTCAAAATAATTCTTTGCACCCGCACCGATA GA
TTTAGACGTGTCTTTAAAATAAATTGTATGACTTTTGTTAACTAATGTACATTCTCA GTTC
AATTTATCACTTCATCATTATTAACTAACATAATTTGGTGCATAATTATGTATTTTC CTGCT
CCATCATTATATAAGTACATTTTATGCTAATATTTGATAACTGCTAAATGACTCCTT AAGA
AAAGATGTTAACTTTTTGTTATAACG GTGTG GC CATGTCTG CTTGTG CACCAAAAAAG AA
GATATTAAGTGAGTTCCTTCTGTTGTTAGTGCTGGTTTTATGTCTTCTTTGGGTAGT TTT
G ATATG ATTTTATTCATTATTATTCAAATTTGTACATATG G ATG CAAACATGACG CAG AAT
TGGGAACAATTTGATATAGAAATTATTTTTAACTTTGGTTGCAACACTCTGTTAAGA TTTC
GGCAATCGAAAGGGATGGTTGTTTAAGAAGTTAACTTTCTAGGATATAAAAAAGGTG GA
TCACCAAGATTATATTTCTCGTAGATTCGTACACTCAATCATTCTTTACTAAAAGAC CTTC
CCGACCTTGACTTGATATAATGCTGTGAAGGTCAACATTTTATTTTTCCAAAGAGAG GCT
TTTGATACCCTCTTTGTCTCTAAAAGAATGGTTTTATTATTAGATGCTTCGATCCTC ATAT
TTTGAACCCAGCCATTCCGTTTGAGATGAAAGAATAGTATTGTAGACTTTTTTAAAA TCA
GGCCGTAGTATTAAAGAATCTCGATATCAATCTTCTTAGGATCCTCAGACACCTCAC ATC
GATGACCAACACTTGTAATGTTATTGTAGCTTCAATTAGCATTAGTATTTTAATGAA ATGA
GAATTGACAAATTTTTATAATCTAGTCGTCTTTTTAAAGATTTCTTGTCCTGAATCT TCTTA
GGATCATCAGAGAGCCTCACATCAGTGAGCAACTCTTTTAATGCTATTGTAGCAATG TA
GCTTCAATTAGCATTACCCTTTCAATGAAAAAATGTGGTATTTCAGTACCCCTCCTT CAA
CCACAATGTTATTGATTTTTGCTTCCTTGATATCTCCTCCACTATTTATTCTGCTTT TATAT
GGCTTTTTGGTACTATCCCTTCTTGTCTATATTTTCATTAATGTGGTGCTTATGCTT TCCT
G AG CCG AG G GTCTATTG GAAACAACCTCTCTTT CAT CACAAG GTAG GG GTAAG GTCTG
CGTACACACTACCCTCCCCAGACTCCACGGGGTGGGATAAGACTGGGTATGTTGTTG T
TGTTGATACTTCCTCCACTAAGGCACAATCCGCCAACTCCTTAATCAAGTCTTCCAT TAG
GGGTAATTCATTTGCAATCATCTCTTTTGCTTCACCTAATGATTTCTAAAGGTTTCG ATTT
TTCCAACAATTCTTTCATCTCCATCTCCACTCAAGCATATACCTCTCCATTTTCATC ATCT
ATGATCAGCTCTAAGTAATTTCAAGTTTTTTCCCAATCACACAGCTTGGATTTTCGG CAA
TGCACTTGTTCTTGTTCAAATGTTAATGTGTGTTATCTTAGTAGTGTTAAATTAGTC TCAT
TGGAAAGATATCATAAAATTTATGTATTTCTCACCTACACATAGATGTGTCTTTTCT ATTT
GATTGTTAGATTTTCTCAGATTGTATCACATACCCTTGGTGATCTTAATAAGGGGCA GCC
TATGCTACCCGATTGGTCAACTTTTTCACTATTCTTTCCGCTTTTGGTGATCTTTAA CCTC
ATCCCTATTTCCGTAGACCGTTTGACAGTATCATGCCTCTTTCTCAGTCATCTTTTT AGC
TATAAAATTCCTCCTTTTTGACCATTCTTCCAGTGAGTTTCTGCCACCATTCCAACC ATG
TCCCATTTACAGTGTAAAATTACTATTTTGGCCATTCTTCCAATGAGTTTTCTACCA TCTT
TCAGGCCAGTATTATGATTATTGTTCCAGATACTATTTTGGACGCTATTCCATTGCT ACT C C AGT C ACTGTTTAG C C G C C AATTTG AG C C AC ATTAG CTG C C AC CTTAC G C C AC AC C AA
TGATGGTCTTGATATTCGTGGTATCGCGTTCTCTCTGGTCATTGAGAATCGGTTCCC TTT
TTTTTTTGATAAGGTAAATTGTATTAATCAAAAGGGAGAAAAAAAAAACTCCCGCAT ACA
AGAAGTATACAAAAAGTAGAGAATTTACATCAGAACATGATTCTCTACAAATGACGC CCA
ATCTTCTACGCAAGTAGGGGCTACATGTGTGCACCATTAAGCGATCAAAGATAAAAG AC
TATTCTTAAGATGTGAAAACGGAGACTCAATCCCCTCAAAAACTCTCTTATTTCTCT CTC
CCCAGACTAACCACATGAAAGCTAACGGGACGAACTTCCACGCCTTTTGCTTGCTTC TT
CTACGGAAATTGGCCCAGCTATATAGCATTTCCTTTACAGTGTTTGGCATCACCCAT TGA
AC AC C AAAAAG ATTC AG G ATTG C C CT C C ATAAAC C C C AG G AC AC AAG G C AAC G CATC AT
AAGATGATCAACTTCTTCCCCTGAGCTTTTACACATGTAGCACCAACTAACATGTGT AAT
TCCTCTCTTTCACAGATTTTCAGATGTCAAGATCACTTCCCTCGCTGCTAGCCATGC AAA
GAAGCACACCTTCGTGGGCGCCTTAGGAATCCAGATCGATGAGTATGGGAAAGTAGC T
TCCTGTCTCNTGCCGCATCCCAGAGAAAATCCCTTTGAAGCCGCTCCAGTTTTTCTG TG
ATGCTCACAGGTGCTTGCAACAGGGATAAGTAATAGGTAGGGATACTTGACAAAGTG CT
TTTAATAAGCACTTCCTTACCGCCTTTTGACAAATACCGTTTCTGCCAGCCTGCTAA TCG
TTTTTCAACCCTTTCAATGACTGGATTCCAAACNGAATTTGTTGGACCATCAATTCC TTC
TGGTTTGAAACTCTGAAGACGTGGGGGAAAGAGACCCTCAGAGTAGCCTCTCCACAC C
ACCTGTGATTCCAAAAGCTGATTCTCCTTCCATCACCCACCTTGTAAGTGATGTTGC CAT
AGAAAGCTTCCCAGTTCTTCATGATGTTCCTCCACATGCCACACCCGAACGGTGTTG TG
ATTGCCTTGGTCCTCCAACCCCCTCCCGTGGAATCATACTTTTCTGCTATGACCTCC CT
CCATAGAGCATGCTCTTCTACCCCGAATCTCCACAGCCACTTCCCCAGCAAAGCTCT GT
TGAATACCCTGAGATCTTTTACTCCAAGTCCACCCCACTTCTTTGGGGAAGTGACTG TC
TGCCAATTCACTAGATGAAACTTTCTAGTTCCATCTGCCGCATCCCAGAGAAAATCC CTT
TG AAGTCGCTC CAGTTTTTCTGTGATG CTCACAG GTG CTTG CAACAGG G ATAAGTAATA
GGTAGGGATACTTGACAAAGTGCTTTTAATAAGCACTTCCTTACCGTCTTTTGACAA ATA
CCGTTTCTGCCAGCCTGCTAATCGTTTTTCAACCCTTTCAATGACTGGATTCCAAAC AGT
AGTATCCTTTTGCAAAGCACCCAATGGTAGACCCAGGTAGGTAGTGGGGAGAGAGCC C
ATCTTGCATCCGAGAACATGAGACAAAGCATCAATGTTAGCAACCTCATCCACCGGG AA
AATCTCACACTTGCTGAGGTTGATTTTGAGTCCTGATACTATCTGAAATCACTGCAG TAG
CTGCTTCAGGCAGGTCAACTGATCCATATCGGCATCACAGAAAACTAGGGTGTCATC C
GCAAAAAGCAAATGAGAGACCCTTCGGGCACTGAGCACCTCGATCGGAGCTGAGAAA C
CTCTCAAGAAGCCTCCACTCGCTGCACGATCCATCATTTTACTCAGAGCATCCATCA CT
AAAATGAATAGCATGGGGATAAGGGGTCACCTTTCCTAAGCCCCCTGGAGCTGCCAA A
G AAACC ACACG AG CTACCATTAAC CAG G ACAGAG AATCTG ACTG ATGAAATG CAAAACT
TGATCCATCCCCTCCATCTTTCCCCAAACCCCATCCGTTTCATAATGAAGTCCAGGA AC
TCCCAATTGACATGATCAAAGCCTTCTCAAGGTCCAACTTGCACAGTAATCCGGATT CT
CTATTTTTCCTTTTGGAGTCTACAAGTTCATTTGCCACCAGAGCAGCATCCAGGATC TGC
CTACCTTCCACAAACGCATTCTGGGAGGACGAAACAGACACGTCAAGAACCTTCTTT AG
TCTGTTAGAGAGCACTTTAGAAATAATCTTGTAAATGCTCCCCACTAGACTGATAGG CCT
ATAGTCTCTGATACAAGATGCACCTTCTTTCTTAGGCACAATGGTAATAAAAGAAGC ATT
GATGCTTCTCTCGAAAGCACCATTCACGTGGAAGTATTCGATGGCTTCCATCAACTN NN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCATC
GTGCACCTCCTCCTCCTCGAAAGCTCTTTCTAGCCACTCATTGTCTCCCTCCCCTAT GT
GGTTGAACTCTATTCCCCCCAAAGTAGGCCTCCAACTAACTTCCTCTTTATATAAGT TCT
CATAAAACCCCACTATCGTCCCGTTTACCTCCTCCTCACCCTCGATCCTCACCCCAT CT ACAACTAAGGACTCTATGAAGTTTCTCCTTCGATTTGCAATCGCAACCCTGTGAAAAAAC
TTAGTATTCGAATCCCCCTCCTTTAACCAGAGCGCCCTTGATTTTTGTCGCCAACTA GTC
TCCTGAGCTATTGCTAGCTCGACTATTTCACTCTTAACCTCCCTCAATCTCACTTTC TCA
GATTCATCTAGCTCTCTCGCCCCTTCCCCGCTCTCTAATTCCCCCAATTCATGCATA AG
CTCCCTCATTTTGACCTTCACCCTCCCGAAGACCTCCTTGTTCCACCTAATAATATC TCC
CTTGAGCAATTTGAGTTTCTTCAACAGACGGAATGAGGGTGACCCTGATACCCCGTA GT
TGGTCCACCATTCTTTCACTTTGTCACAAAATCCTGGCACTTTCAACCACATGTTCT CGA
ACCGGAAGGGAGCACGGACCCCCCTCCCTCTGCATCCATCTAGCAAGATAGGCAAGT G
ATCCGAAGTCAGCCTAGGTAAAGGGATCTGCAACACATNTTGCAACACATTCGGTAC CA
GCTCATCCCAGGAGGACGATATCAAAAATCTATCGAGTCTGGACCTTGAGTTTGAAT CC
TCCGCCCTGGTCCAAGTAAACCTTTCCCCTGACAGAGGAAGATCAATGAAAAAGTGA TC
ATTGATGAAGTCAGAAAACTCCTCCATGGCCGTCGAAATCAAATTGCACCCTAATCT CT
CCTCCGGGAATCTAATTGTGTTAAAGTCTCCCCCTAGAACCCATGGAATGTCCCATT CC
CCCATAATATTAGCAAGTTCCTCCCAGAAAGACACTTTGGACTCTTCCCCTACCGGC CC
ATACACTCCCCCAAATCTCCACTCCACCCCACTCACTCTATCTTTGACGAGAGCTGC CA
AAGAGAAAGCTCCCTTCCTAATCTCCTTTACCTCCAAACTTCTATCATCCCACATAA GTA
GAATACCCCTGGCACTTCCTGCTGCAGGGACCCAGTCATATTTCACCCAACAACCCC C
CAGACACTTCTCACGATCGCATCCGACAAAACTTCCATTTTGGTCTCTTGAAAACAA ACT
AGGTTGGCACCCCACTCCTTAACTCCCACTTTGATGATGGCCCTTTTGTTTGGGTCA TT
CATCCCCCTGACATTCCATGAAAGGATCTTAACTTCCATAAGCAACAATAGCCCCCT TTT
CCCCACCCCCCTAGACGAGATCCCTTTCTCCATGTCACACACTCGACCCCTTCTCTG AT
ACACCACTACATCCCCTATTTTATTTTGATTAACACCTTTACCCAACTCCCTCCCCG CGC
CTTCGTAACAAGGCTGTTGCTCAATCTCCCTCAACAGTTCTAGCACCCTATCTTCCT TCC
CTTCAAAAGAAACACCTAAGAACTTCCCAAAGTTTAATAGTTTATCCTCCATCCATA AGG
ACATGTCCACTTCTCCAATCCGTGACGAAATTGGACATGCGTCTTCTGTAATTACCT TTT
CCTTGCCCCTACCGGAGGAATGCAAGACCATAGCATTGCTAGTAGTAGGAACTCTGG G
TGTCGGGTGGATCACGTCGTTTCGTTGAGAGGCACCAACCTCAGGAATTAATGCCTC T
CTTG GAG GAG ATTGTATTGTAAC CTCAG CACATCTAAGCACAGTG G GAG ATG AGG G GG
CGATGGAGCTGGGTCCATCAGAGGCGGGGGGCCGTGGAATAACACCGCTAGCTTTTG
CCGGTGCTGCCCCTGGAACAGTGGCTGAAGAGGAGCACAAGTCTCTAGTGTTTGGTA T
AGCACTGTTTGAGGCATCATTGGCAGAAGGAGGAGGAGTGACTTGCTCCCATATGTT TA
GGCGCCTTATCGGACGTCGCTTGATATAATTTGGGCCCTTTTGGATCGATTCTAATA GA
ATTGGGAAAAGTTGCTGGGCCCATGTGTAGAGGCCCAGACCTATGCTTGCTGAAATC TT
GCCCGACCCTATGATAGTTAAGAGAGGACGCATCACTATCCCATTCACGTCTTTTCC TC
CTGTTAAAATTTTGTCTTCTTCTCCCAAGTTCAAACCTCCCGTCACTTCTCGATCTA CCG
TCTGATTCCAGCTTAGTCCACTGGTCCGGTGAGCCCTCCCCTCCTCTCAGAAACTGG TT
TCCCCTTCCCTCCCTTCTATTTGACCTTTCCTTTTCTTCCGCCAAAATCGCAGGCAT AAA
GACAGGGCGAAATTCAGGGCTCAACCAAACCCTGTAGCTCAAGTAGCCATCTTCCAC A
ACAGTGGAGACAGGGATTCGGCCCCCTTTCCTCACCACAATCCTTACTCGCGAGAGA T
CGTGCAAGCTGCAAGTGACATCAATGAAACCCCCACAGATTTCCCCAACCTTCTTGA AT
AGGTCAAGACACCAGAGATGCACCGGGAGACCCACCATGTTGACCACCAATGTTTCG G
CAGGGAAACGAGGGTCGAGACACCCATCGTGCTCCACCCATCTGTCTAACTTTAAGA A
ATTCCCATCAAACCACCTGTTTCCTCTTACGAGAATTTCTGAAGCTTGAGACTCGGT GA
CAAATCGAAAGAGATATTGCGTGTCACCCAGTTTCGATATTTTCAGCCCGTCCTTGA CG
TTCCATGCCTCTGCTGTCCAGTTCCTAACAGTTTCTGGAGAACCAACCCATTTGGAG AA
AGACCCAATCAAACAAGAGTTCAAGAAACCTTTACGCCTGAGAGTGTGCTCCTCGGA CA
GCGAGAAGTGTGGCTCCTGCCCTACGTCAAGCTTTCGCTGGCATCTTCCCCCAAGTT C
G CGG G CTG GCC AAGATG CAG CTTTG CTAAAAG ACATG CTTTCTAATTTTCTAG ATTCTA
GGGAGAAGTAGGAGCTTCTCTCTGTAGAGTCCGTCATAGCTAAACCAAGAGCCTCGC A
AGGAGACTGACTGAAGAAGGGAAAGTGAGAGGGAGTGAAATCTGGCCGGAAAAAGAC
ACCTCCGTCGCCGAAAAATAGCAAAAGTGGTCGGAAGTTGGAGGTTGATGGATGGGT G AGGTCGGAATAGCCTCGCGAAGAGGCTGTCTGAAGGAAAAAAGTTGGGATCTGGCCTG
AAAAGGCGTCACGTGCCGGCGCGTGGGGTGTCAGATCCCGGCGATTCTTTCTGGGGT
AGTGTCGCGACAGGGTTGGCTCTCGGATGGTGGTGGTGGAATCCTAGGATTCATGGT T
CATGGTGGTGGTGAAGGGGTGGGATGGGGTGTCTGTGTGCTACAGTGGTTTTTTTCC G
CTTCATGCACATTGAGTTCTGGTGACTGATCTCGTTTTGAACTGTTCACTTTCTAGG TGA
CGCTTTACTATTACGACTATTGTTCACTATTCCGTGTTTTCCAGAAAAAGCAAACTC AAG
GTGTTCTTCACTTCTTCTTACCTTGAGTTCAAGGACACATAATACAATGGTAATGTT AGT
TAGTAGAATGTTAGTGTTAAGTTAGTCCAACATCAAAGATTATTGTGCTTTTCACAC CTG
AAAATTAAGTGTCATGTATTATGTAATAAGTATTCATCAACTTATCTTTTCATCTGC TTCTT
CTATTGTCTCACAATTTTCAGATTAAAATGGAGTATTTTTTTTACAAAGAATCCCAT TAGC
AAGAAGATTGTGATTGTGTCTTGGGCACATTGTCAGTTGAACCATCCGTATTGGTTG AG
ATTGTTTCTTTGAAATCTCCTGATATTAGAAACCCGTCGATCCTTGAGTACTTCCTG CCT
TGTTTCTCTCTGGACCATATGTAACTCCTACCTTGCAGAGGCAAATTGATCAACTCT ATG
TCAAGACTGAAGCTTCAGAAAGTTCCTAATGATTTTGAATGATTTCTCACCTGCAAG AAG
CTG GTG ATTCACATTGTTCAG CAAAAG ATTTTTTTTTTGTTTG CTTGTCAAATTAACATTG
TAATTAGAATACTCTCCTTCCCATATGCTCTTCTCAAAGTCAGCATTCAACCAGTTA TGC
CTCTTACTTCTGGCCGTACAGAAGGCACCTTGATGGATGTTAACTTTTAACACTTCA TCC
TATATTTG G AG G CCTAG AGAG GGTTACTTCATGCTTTAGTTTCAATTTAATCAACAG CTT
CTAGGTTATTTAAGAAGTAAAGCATTGGAAAATACATGCGCGGTTTAAGTAATAGGA TTG
CCCTAGTATCTCATTTTTCTTTGAAAAGAAGTTTGATTCATATGTGTATTTCAGTTT TTGC
TAGTTGTTGCAGGATCAGTGTAAAAATGGTATTTATCAGCCTTCCCTAAAAAAAGAG AGA
TACTGAACGAATG CAAAG AAAACACTGTTTCTTG G AG G CTTCTTATTTGCTCATAATTCA
TCTGTTTGTAAGATTTATGGGTGCGACCCATTTCTTGCCTTCTTGGTTGGTTATAGT TTT
ATCTTTGGATAAGATTTACCTGTAAGACTTGCAATTAAGGGCTTTATGTCATTTATC ATG
CTAAAAATAACATAAAGTCTGTGTGGGCTGTACTTTTTTAATTCTATATAATTTATA ATGA
TG CATGTTGTTTTCTTGGTG CAG GTTCTAG CACTG CCACTTC CACACAGACCGTTATTTC
CAGGGTTTTATATGCATATCTATGTGAAGGTATGGTACCTTGTCTAGAATGGGTTAA ACT
ATGTCTTAGTTAGTCTTCGTGATTTTTGAAATAGAATAGATATAAGCTTTAAACATT AAGA
GTAAAATTACTGTCAACTTTCTGAGAGTTGATACTGTTTCAGCTAGTGTTTTGGAGC CTA
TTTCCTCTTAAGTCTATAAAACCAGCATTCAGCTTTAACAATGCTACTTTAATTGAT GGAA
AGTTCTTGGCTGAAGTTGTAAGCTTGAAAAGCTAAAACCTTCAATTAGGAGAGCGTC TG
TTGCTCAATTGTAATATTCTTTAGCCTCATTTGGTAGTTTCCAGAAAAATATAAATT CTGC
TGTAATGGCATGCAACAAATTGATCTGAGGCTCCACATAATCCATAAATTTCACTGC AG
GATCCCAAGGTATTAGCAGCCTTGCTGGAAAGTCGAAAAAGGCAAGCACCTTATGCT G
GCGCTTTCCTTATGAAAGATGAGCAAGGGACTGATCCTAATGTTGTGTCTGCCTCAG AT
ACAGAAAAAAACATCTATGAGCTTAAAGGAAAAGACATGTTGAACCGTCTTCATGAA GTT
GGTACACTTGCTCAGGTACATCTTGTTGTTCCTTGTGTTATTCTGTTGCTTTAACTT TATT
GAAGAAGTTTCTGATCAAGGTTGATGCAACTTATGATGAGCCAGATAACAAGTATTA AA
GACGACCAGGTTATTCTTATTGGTCACAGGCGGATACGTATGGCAGAGGTGGTAAGT A
GTTGTCTGCTTTATTTTTTGGTTATAAGCAGCTCCATATGTTTTCTTTTTGTCCTAT TTAG
TTCTATTTTTCATCTACATATCCATCATCTCCTGATTTTGAGACTGCCCAATTGACA TCTT
GGGTTCATTGCACTTGTATATTTTTCTCTCTTTCCTACCTTTTGTGTCCCAACACCG AGT
CCTTCCCCTTTGGCTTCATATAACTGAGCAGCTGCATTGTCAAGCGATAGAGGTCTA TT
GCAATTTGAGGCTAAATGAGCTGTTTTCCCCAGTTGGTTATGACGTGTTATCAATAG AAT
GAAACCCTCTTTTAGTTATTCTTGATTGCTTGTTTGCAAGAGAAAGTAGCACTGATT TTG
TAATAATTTTTGGACAACTGGTTTCCTGCAAAGTCATAGGAATGTCTTTCTTATTGC ACTT
CG GG GTTGTAG AAG AG AAAAG CAATTG AGTTTCTTCTATG ATGTG CTAGTTTTG CATCT
GTTTGATTGCTGATGCATTTGGTTTGAATGAAATCCTAGACTTTTGATAACCTAGAA ATC
TGAGGGTAAGTGGCTCACGGTTTGAAACTCAGTGGATAATGGGCTCACCATTCTACC AT
TGTCCACCTAAATACTAGGCTTTTGTTTGCGCCACGGCACGAACTGTGATGTGCACC CA
AC CTAC ATATC AC G G G CTGTACTCTTAC CACTAG AC AAAAG C C C C G GAG G CTTAAAC AT ATATTAAAACACATAAAAGTTTGAATAGAATCCTATATATTTGGGCTGTAATTTGGTATA G
AGTCGTATTCTTGGCTTTAGCTGTAAAGCTGGAGTACTCATGCATGGTCAGCCATTA AT
GGGCTTACTTAGCCACTGGAGGGTCTTATGTCTTCTTATTTTTTCGTTTTATTTTGT GATT
GAGGACCTCTTATCCTCTTCTATTGGACTTAAATCTCTCATCCCCTGAGCTCCTCAG TAA
TAATTCTCCTATTATACTATACTTACTGTTGGATGTTTACCTCTCTACCTTCACATG GTAG
GGTAAGGTCTGCGTACACAGTACCTTTCTTGGACCCCACCTGCACGATTACACTGGG TT
TGTTTGTTGTTGGATGTTTGCAGTCAGATTTCCTTATGTAGTCATTTCTTTTGTTTC AATA
AACAATTTCTTCAATATACTAGTGTAACAATTTGATGACTATGCAGTAGATTTGTCA CACT
TGAATGTTCACTATCAGATTTCCTTACTTTATTCACCATGTTGTTCTTTTTCCGTCT CAAT
AACCCATTTTGTGAAATGTACAAGAGTAACACCTTGAGGACTGAAACAGTGACATTC ATT
TGCAGGTCAGTGAGGAACCCCTTACAGTGAAAGTTGATCATCTCAAGGCATGCCTCA GT
TTCTTGTGCATTCTTTCTACCCAGCAATGATAAACTCATAAAGCTTGATGTTCAGTT TTGA
AATTGTTTCTAAAATTTGCTTTCTTAATTATGTTGCAGGAACAGCCGTACAACAAGG ATG
ACGATGTTATAAAGGCGACATCTTTTGAAGTTCTATCAACCCTAAGGGATGTTTTGA AGA
CAAGTTCTCTCTGGAAGGATCACGTTCAAACTTATATCCAGGTGTTAGTCATTTCTT TCT
AAATGTTAAGTCCTAATTGTTTGATTTGGTGATAACTCCAAAAAAAAAAAATTTACT CTCT
CAAAATGCATTGCCTTTTGATATTCTAGCCCACACTATTGTGTGAAGCTCCACAATG CCG
GTGGATTGTGTTAAACTTAGATCATGCCCTAAGTTAGGCATTTCATGCCATTCTTTT TAA
TGGAAAATTATGTTCGTGTTGGTGGTAATCAAATCCATGACTAGATTCTCTGCCTTA TAG
TTG ATG CCCTG CTG AAAGAG GAGACTAATTAAAG ATGTCAAG ACTGCTAG G AAATG G AA
GGTCATAAATGTTCTCTGAACCAACTGTATACATATTGTTATACATATTTCTAAAGA GAC
GCTAGGAAGTTTATACAACTGAACAACTGTATAAGATGTATTTGGAGCTAGGAAGTT TAT
ACAATACCTAATTCTCTCTCTGCTGAATTTCTAAAAAGATGCTACTGATTGTTATGA ATAG
TGTCAAAATCAAAAG ATAATATTGTGG CATGTTTTG CTG AAGAAGTG G GAGTG G AG G GA
TAAAGGGATAGGGACTTACTGAATACTTCTAAGGGAAGCATTTGTGTCCCTCCAAGT TA
TTG GTACACTAGTAATTG G AGTAAG AGACATTACTCCTTTCGACCAATTTC AG AG G G GT
TATCAAGTCTTCAAAGCGTGGCGATATTTGTGGGGGTTGGCTGGAAATTGAAAAAAA GA
CAACTCTAAGAAATCATC CTAG ATG G GCTTCGATCAAG GAG AG G AG GC AG ATG G AG AA
GATTTCGAAAACCCTAGAAGTGGAGAAGAAGGGTTTTCTTTATAAATTACATGTTTG GAG
TGTAGCGTTGGTAACAATTATCTCCAATCGACGAAATGGAGAAGAAGATGGACCTGC TA
G GTACTATGATG AAGTTTG AGATAG G CCTG CTACC GTAG AAAAG ATCG GTCATGTG GG
G GTTG G GAAC AATATTTCAAATTCTAAAAAAGTGTG GACAG ATGTG G GAGTGG CACGTG
GAAAGGTCTTAAGGTAGAATGGGTCAGTAGTAAAGGAAAGGGCTCCAGAATTGGGCC A
CTTATGTCATGATGTTTGGAAGCTTAACATAACTTTTGGGTCATAAAAAATAATAGA TATA
GGCTCAGATGAGTCAACCTTTATTTTCAATAAGGGGCTTGGTCTTTTAGCCATGAAC CTT
TAGGGTCTCAGTTTTGATACTCTTTTAGTGGAGATTAATGTCATTAATTTGTCTGAA ACAT
G ATAC ATTTTAG ACT C CTTTC AC AAC AC C ATG ATC G G C AAAC C G ATG AAAGAG C AG G AA
AGCCCTAGTCTGCCACAAATTTCACAGAACTCACATGCAGCTGATTACACAAAAATT GT
CATTCAAGATTAAGAAAACTCTTTCTTCGTACAATCGAAAGAGGCTGACCAAAATCA GAT
AGTTTTTTGAATTTGAAAGAACTAAACCATGGCTTATCGAAGAGGCTACTCAAGCTT TCT
TATGGCTGCATAATAATGTTACAAAGATGAGCAAACAATATGGATTGAATTTTTGTG AAT
GTGAG ATTG AG G GTCTAG CTCTGTTC ACG AAG CTTGACAGG AGAAG G CATAAG AGAAA
TG AAG CTACCATGTCC AG ATTC ACAATTC CAAAAGTGATAG GTATAAAG GAG CTCCAAA
AACTGTTTTTTAATGTGAATTATGGGGAGCTCGGATCAATGATGGGAAGGGGGATCA CA
AACACTAGGTAC CCATG AAG CTG AATATTCTC ACTTG G AATATTAG GG G GTTG AATG AC
AGGGAAAAGATAAAGGTGATAAAAAGTTAATCCATAAATTGAAGGCAAATATTTATT GCT
TTCAAAGACGAACTTAGAAGGGGGTGTGGAAATACTAGTTAAACAAATGTGGTCAGA CC
CATCTCAAGTGTTGTTTG GAGTCCAACG AC AG GAAAG G GAAAAT ATTG GTG ATGTG G GA
AAAGAATGTTTG G ACAAG AGAAACTATCAACAAG G GATGTATACTATCACTCG CAAAATT
TCCTCATTATCACAGAATTTCTCCTGGCACCTCACAGGGGTGTATGAATCACACTGC AA
GTTGGAGAAACAAGAACGCTGGTGGGAGATAGTAGCATCTAAGGCATTTGTGCAGGG C CTTG GGTG GTGTATAG AGATTTTAACACTAATAG ATTCATAG CAAAAAGAAAG AACAACA
ATAAACTCACTAGGGCTATGATGGACTTCTCTAATTTTATAGATCATTAGAAGCTTG TAG
ATTCTAATCTTAATGGGGCTCCTTTTACTTGGACAAAGGGTAATAATCAGGAAAACT CTT
C AAG ATTG G ATAG ATTTTTTTC C C G G CTAAATG G G CTG AG G AATTAAAG AAC AAAAG G C
AAGCAGTACTCCCCAGTGTATTTTCTGATTGTACTCCCGTTTCTTTTCAATGCGGAG ATG
G GG AAGTTTAAAATCTTACTTCAAGTTTG AAAG CTG GTG GTTGG GTGTTGAGAG ATTCA
AT GAAATG GTG AAAAG CTGTG GAACTCTTTTGAAGTACAGG GTAG ACTAG ACTCCATTC
TTTCAAG CTAACTG AAG TTGTTG AATACAAAGCAGTGG G ATTG G GAATCTTCAG ACTAT
GACAGTGTTGTGAAGAGCCTGAAGCGAGGAAAAGCGATAAGCCCCTTTTCGCTTAAA G
CGAGAAGCGAGAAGCGAAGCGCTCGCTTTTTTGAAGTGAAGCGGTTTAAAAAGATAT TA
AAATAAATAATGCATAGACAACACATGTAACTGTAAGCAAATGTTCAATACTTCAAT GTA
AAAACTAAAGAGTAGCATCAATTAAAGCACAAAATGAGCATCATATTCTTCTTCAAG ATT
GTCAAATTCTTGTATTCCACTATCATTATTATATTGCTCGTCATCTTCTTCAACTTC TTCTT
CTACATCAACTAGAAATGAAGTAACTGCTTCTTTTCCCTTCCTCTGTGAGCTTGAAA TTG
AGGTACTCCCCCTCAAACCATAAATTTTCTCCCCAATTCCACGCGCCTCCGCAACAT CA
CCCCAAGTGAAATCAGAAGTTTCCTCAAATACTTCTTCATTTGCATGATCTTCCGGG ACT
CCAATTAGCCATTCATTAGCATCATCGATGTTGTCCAAACTAATTGGATCAATTACA TTG
CGAGCATTGTAACGACGCCTCATTGTTCTATTGTACTTAATGAAGACTAGATCATTG AGA
CGCTTCAAGGTTAGTTTGTTCCTCTTTTTGGTATGAATTTGCAAATAATAAGTAATT AGTA
AGATTGCATGCGCATAACTGTCTGTCATCATTCAACATTCTAACTTCTTGAAGTTGC AGC
ATATTG GTG ATTTCAATTATGCAAG GTTAG CAGATTTTG GAG CAG CAATATCTGG AGCC
AACAAGCTACAATGCCAGCAAGTGCTTGAAGAGCTAGATGTAAGTCCGTGGTTCAAG AA
GTTAGATATTCCCTCTGTTTCAATTTAGATGACACACTTTCCTTCTTAATCCGTTCC AAAA
AGAATGACACATTTCTACAATTGAAAATAATTCAACTTTAAACTTTTCATTTTACCC ATTTA
CCCTTAGTGAGAAGTTTTTATAACCACACAAATGTTATGCCCCCACAAAGCTTTTAC CCC
TTAAGCTTTTAAGTCCACAAGTTTCAGAAGTCTTTTTTCCTCTTAAACTTCCTGCCA AGTC
AAACTACCTCATCTAAATTTAAACGGAGGGAGTACATAATATTTTCTGTATGTGTGC CTT
TTTCAATCCAAACCTCAAAGAGAGAGGTTTACCAGTATTGTGAGTTCTAACCACTCT GGA
AGAGAGCGAGGTTTTATTACTAGGAGGTAGATCTAAGAATTTTTCCTTGCCATTTGC TTC
CTATTGTAACAAAAAAAAAATATATGTTTGGTCAGTGCTTCCCTGCTCTGAAAAGGA AAA
TG GTTG AAAGTTAAAAAAAAG GAG G ATCAAAG CGTATTAGCG GTAAAGCTTC ATTTCTT
GCTGATGGGTAAAATGGCTTCTCATTGGTACTCTTGCTCGAAAAATTAGGAATCTAA AAA
AGTTATCAGTTTGTTCACCACTTCATAAAAAGAAATTGTCAGTTTGTTCAAGTGGAG CTA
CTTATTTTG GG AAAATTAG AAAG G GAAATATTCAGTTTAG AACAG CTAG CAATATTTTGT
ATTTCCTCGTAAGAATCATAAACTTGCATTACAATGGTTATTTCATGATTTTCATCG AACT
GAGCCTTTTGTCTGCTCAGGGATCAGAATATAGTGTGTGCATGAGATAAAATGAATG TT
TCCCGCTTGTTTCATAGCTAACACCATAATACCTGCATCAGGTGCATAAGCGGCTAC AG
CTTACCCTGGAGCTAGTGAAGAAAGAAATGGAGATTAGTAAGATTCAGGTAAATGCA CA
TCAAGACGCATACCTGAACTTTAAATAGGTGCTATGCATGCATTTAGCATTTTACGT CTT
TTCTGTTGTCTGCAGGAATCAATAGCAAGAGCAATTGAAGAAAAAATAAGTGGAGAG CA
ACGCCGTTATTTGTTGAATGAACAATTAAAGGCCATAAAGAAGGCATGTGGATTATG TG
CAGCTTTTTTTGTGTTATCATCCTTAAACTTAGTACTTACATATGTTTATTCCTAGA TAATA
TATATGTTAAGCTTCTGATTCTTATGTTTGATTCACAGTTTAAACAAAGAGTTGATT TGAA
AAAAAAGTG CTTCCTTATGACGTG ATTTTG ATTG G CTTAATG CAG G AACTAGGTTTG GA
GACTGATGACAAGACAGCTCTTTCTGGTTCGTTGTCTCTTAATTACTACTGAAATGA ATA
ATGTTCTTTTTGGATTTATTACGGCCAGACGTGTTTTCCCATCTGGCCAATGAAACA TCT
TATGTTGGCCCTGGAAAAATTCCATGTAGCAGTAGCATAGAAAAGGCCATTTGCAAT AC
GTTTTGCCCTTTCTGTTGTAACAGTTTTATGGTGGCTGATTGATCTCCTGTTTACCA TTG
AAAATCTTC ACTAGAAGCATG AATAG CATTCAG G CG ATAATTG G CTTTACAG ATAG G AAA
TTGAAGGTTGAATTTTCTTTGTACACGGTCCAACCAACTTTATTGACATGCTTTCTA AAG
CTATTTGAGACATACCAAACTTTTGACAGATAAGAATACATTGTACCTATAGGGGGC TCT TTGTGTTTTCACTAAAGCACTTAATAAGACAATACTGGAATCCTTCAACTTCAATTTCCG A
AAGGTTACTGTCTACCATATTTGGTAATGTGGCACTAAATCAACATGGATTTACTGG AAG
GCCACATTGTTAGCACTGCATTGTTGTTTATTCAAGTTATGGACTGCTCATGATTCA GAT
TTATCTTCCTCCTCCAATGTTTCTTTTTCTCTTGAACTCTGTCCTGTTTCCCTTTTC AATG
CACTTCCTTATTAGAAGCCTACTTGCTTTTTCTGTCATGTACTCTTTCTTCTCCCCT TTGC
TCCTGTTG AG G AGTGTAATGCTTATG AAG GTG CTTCTTCCAG GTTTTG GTGATAATAGA
CATTACTGCAAACCCTAAAACGCGGTTCTCATTGTTTTTTCTCTATAATTTTCACAA AAAT
AGAGTTAAGCTGCGTGTTGTAACATTGTTTTGGTAGACCTGGTTTTGGTGTTGAAAT TGA
G AAG AAACTG CCTAATTATTAGGTTTG CTTTAAG G GCTTCG ATTTTTCTG GTCAATACTA
ACGAAAACCTGCAGTTTTTTGTTCCCTTTTTTATTACATCTAATGGTGTGACACTAT ATTT
GTTTATTACTG CAG CAAAGTTC AG G GAAAG ATTG G AG CCTAATAAAG AAAAAATACCAG
TACATGTTATGCAAGTTATTGAAGAAGAACTGACAAAACTGCAACTGTTGGAAGCTA GTT
CCAGTGAATTTAACGTAACACGTAATTATCTTGATTGGTTGACTGCCTTGCCATGGG GT
AATTACAGGTTTGTTGTCTATCGATTCTGCCTTACATTGTCTTGGGTTCAACCCAAC TGA
TGTTATCCTTATCCTTGGCTAGCTGTACTAGAGGAATCTGTTTGAGAAGCTGGCTAA AC
AGTCCAGCGAGAAATAAAAATGTTATTCTCTGAAATTTGCTGCTTCCAAGTTAACCT TAC
TGCCTAGTGATGTGACTTGCCTAAATATCTATCGAGTAATATCCATTTGTCTTTAAC TTTT
CTTTCTCCTCCAGTTC CTTATTTTG G GTTCTTACATGTCATG CTTCTG G CTTTG AG G ATG
CTTACTTGACATCCCAATGTATGAGTTTAGACCAGGATCTCATGAGAGCAGCAAAAC TA
GGATTGTACTTATGATGAGCTCCTTAAGATGGGGGCTTGATTTGCCGTAGTTCGTGT TG
TTTGCTGCTGATGGTGGTGGTGTTGGCTTTATAGTTTTGTTCTCTGCCATGGGTGTA TT
GCATTGGTTCCTGAAGTTTTCTTTTTATGATCCAAATGCAGTGATGAAAACTTTGAT GTA
CTACGGGCAGAACAAATTCTTGATGAAGACCACTATGGGTTAACCGATGTTAAGGAA AG
GATCTTGGAATTTATAGCTGTGGGAAAACTCAGAGGAACCTCGCAAGGTTGGTAAAT GC
CTTTTTTTAAAAATAATAACCCTCATTTTTATTAAAAAAAATCCTATTTTATAAGGT TCAGC
CATAATCATATTAAAAGAACGGAAAATGATCCAGCCATCTCCTTACTGTCCATTGTC ATA
ACATTATAATGGACCAATGGAAAATATATCCATAGAACATGAGATTTATGGTTCCCA AAT
ACTTTATTGACATCAAATTGAAACGAGTAAACGGAAAGAAGTGAACATTTTAGGGAA TTT
GAGAAATATTTATTGGTCAAACTTAGGTAATACTTTTTGTGTCAGTCTAGAGTTCCT CCA
ATGTTTTCTTGTGATTATCTGTGGAGTAAAGAAATATATCTTGAGCTTAATTTCTTC CCTT
GAAAAGCAACTAATGTGAATTAAACTGCTGCACCTTGGGCCATAGTTTGTTGGTGTT CTT
CTTACATTCTGATTTTGTGCTGTCCATGATTGGGCACTCGCTGTGTGGTATTCGATT GAT
AACTTACTTTCACCACCAGTTGTACTTGTATATCTTTGGGACATTGAACTTGAGATG TAG
TTGTTTGTTGAGGATAATCTTTGGAAACTATGAAGTGTTGAGAAAAAAACAGGTTGA ATG
AAAGTTAACAATATAATCCAAAGACAAAGGTTAACTCTAGAAAAATGTGAATTGCAT CAT
AGCTAGGACAATATTAGGTTCAAATGATAATATTAATCCCAATTATACACTGGCACT GTC
TTTCATATGTGCGGCTTCACCTGATTTATCTCAGTTTAATTTTGAATCTGAGTCAGG AAC
TAGAAACAGACTCATTGCTTATTTTTGTTTGAACAGGGAAAATCATATGCCTCTCTG GCC
CTCCTGGGGTGGGCAAAACCAGTATAGGTCGTTCAATTGCACGTGCATTGAACCGCA A
ATTTTACCGATTTTCTGTTGGAGGGCTGTCTGATGTTGCTGAAATAAAGGTAATGGG AAT
ATCTGG CCAG CTAAAACAG AGTTGTTTTGTG G CG CACAG AATCTTG AACTTTCATGACT
AACTTTGGGATACACTTCAAGGGACATCGACGAACTTATATCGGTGCCATGCCGGGG A
AGATGGTGCAATGTTTAAAAAGTGTGGGAACCGCTAATCCTCTTGTTTTGATAGACG AA
ATTGACAAGGTATTTTATGGTTTGTGAGTTCATGCTTCAATTGTATGGCTTTGACTA TGA
GAGGAAGTCTAACTTCTTTTTCACCATTTAATCTCGTTTTTCTGTATATGACCACTG GAA
GAATCTTGAGCCTGAACATATTATGTTTTTGCTTGGATTTCCTCTGCAATCTAAATG TTTG
AAGAAAGTGTTTATCGATCAGTTTAATAATAGCCTTGTATTTTTTCTTCTATGGCAG TTGG
GAAGAGGACATGCTGGTGATCCAGCAAGTGCTATGTTGGAGCTTCTTGATCCAGAAC A
GAATGCAAATTTCTTGGATCATTATCTTGATGTTCCTATTGACTTATCAAAGGTAGT TGTT
TTCTGGAGCACTTATCAAATTATTGTGGCTGTTGATTGGCCCTTATGAAATGCCCTC TAA
CATCATTTGATGAATGGGGACTAATGTGATATTAAAAATCTTGCAAATATCTACTAT CATT TTGCTTTTTCACCTTTTGATTTCCCCCCCTTTTTCTTTACTGATTGATGTTCTCTTTCGT C
TCTTACCTTAGTTAAGTTTGGAAAAGGTTTCGTGACTGAGCCTGTTTCTTTTATTGT CCA
TAATGGAAGGTTATCTGGAAGTATTTTATTTCACGTCTGTGTTACCTTTGTCTCTGT CATT
ATGACTGTAATATGATTAGTGTAAGTAGAATGTTGTTCTCTTGATACTTGAGAAAAA ACT
AGGTTTCATTGGCATTGGTTTTGGTGATCATTCAATAGGAAAAAGGTTGATGCATTG AGT
TTTCCTACTGTCTGTTCATATTATCTTTGATGGCTTTACTGATGAGGGATATGGTTT TATA
CCTCTTGGAGTTACCGATAATTGGAGCGATAGACTTTGAGTGTTGAGTGTTAGTTTA CC
TATTGATTAAAGAATTTGGCCCATAATTCAAATATGACAGTTATCGATACTATTTCA TTTT
ATTATATCAAACTTCAGCAAATCAACATGGCTACAGAGAGAAGGTTGATGGGTATAT TC
GCATACAATTTCATTTATTTTGTCCTTTAATGTGATAATTTACTGTGTGTTCCTATG TCTTT
TCATAATGGTAAATATTTGTGTTATATTTCCAAAGGGCTTTGTCAAAAGAAGATAGC ATC
TTTCGATTATTTTGGTAGTATTTTGGGTCTGACTTGGTTATGGGGAGGGAGGCTATA TCA
AAGCCCTGTTATGGATGTTGAGTTATATATGAACCTGAAGAAATTATAAAATCCAAC CTC
TAGGTTTTAATGTACTTCTAAAATTTGAAAAAATCGAGCTTTGGCACCTTCAGTGTT TTCT
ATCCTCTTTGCTTAATACAGTTACCTATTCCACTGTAGTTTATACGAGAATGACATC TGC
ACATTTATCAATTTTTTGAGTGCATAATTTGAGGTCGATCATTTTCCTTTTTGCGGC TGCA
GGTTTTGTTTGTCTGCACAGCCAATGTTGTAGAAATGATACCTAATCCTCTTTTGGA TAG
AATG G AAGTAATTTCAATTG CTG GTTACATTACG GATG AG AAAATG CACATAG CCAGG G
ATTATTTGGAGAAAGCTACTCGTGAAACATGTGGGATCAAGCCTGAGCAGGTATGTC TT
ATAGAAACATCTCTAGTTGCATCTCTTTCATTATCTCCTGTGCATATTCATTATCGG GTG
AATAGTTTTGTATTTTTTCCGCTCCCAATTTTTGACATTCAGAAGCAGACATTGGTC TGT
CGGGAAAATGTTCGAAACATGATTGGGCAAGAATGTAACTTCTTAAAGTAAAATGTG AA
AAGCTTTATTCAGTTAGAAGTTTATGGAATAAAACATATGATTGGAACATTCAGGGG ATC
AAG CTCTTTACACC CTTCCATATAGAACATG GCTAAAG G AAAAAATCG CAGTAAATCTCA
TTATAGGTTTTTTGCCATATGCTTTAATATTTCCTAGAAAATATACTGTTTGATGTA TTGAT
ATCATTTTCATCTTAAGCCTCTACATTTAGAATATACTTGCCTCCAAAGCTAATGGA TTAT
AAAGTTTAATACAAACTCTTCCATCACTATCCTTTTCAAGAAGTATTTAAGTGGTTC ACAT
AAACAACTGAACGAGTCCACATTGCTGTATATATCCTCTCAAGTGCTGCTTTCTCTT TGA
GGAATGTTTGCTTGACCGAGCCAAACTGTGACTGCCTATGCTAGCCAATGTGCTCGT G
CATCATTAACACAACCACTGTGCCACACTTTGCTGGGAGTATAAACGCAATTGAAGG TG
GTTCTGTCGTGGTAGTCGCCAATCTCTATTAGTAAATGGAGGTATAATGAGTTTCTT AGT
GGTAGGGATACATGTGCCTGGTGGGTGGGGTTGCTGGATGGTATTGTCTAACAGATT T
GAGCGGGGCGTGAACTAAAGCAGGATGGATGTTTAAAATATGCCAAGTGTTTTGGTG TT
TTTGCTGTTGACTTCAATTTGACCATCGTGTAAAAGAGAAACTAGACTAGTTTGTTA CAC
AGTAGTGAATTTTTTATTACTGTACAAGGAATCATCATACGATGCCAAGATTGTGGT GAT
GACTCCAAAAAACACTTGACAATGTCTATTGAGTGAGCTTAAGTTATAATATGTCTG CAG
TAAACATGTTCTCCCACAGCCTTAAAGATGAGATCCGTGGAATTCGGAGAACCTTAT ATT
TAGCTCTCTAATGTGATCTTCAGTTGAGTTACATGCCATATGGATCTCTGGTGAAGA AAA
GGGAGCTGAAGACTTACTGATTAAGGGCATCAAAATGACATTTGACCTCATCTAACT CT
GTGGTGTCATGTGGTGTCATTTACTGTGCATATATACTCTAATAATTAACGTTTTCC ACTT
TTCCGGGAAAAAACACCCTAAATATGGGTGTCAGAAGTTAATAAGTTGATATGTGGG AG
GGCTGATGGCTACTTTGGTTTGTTTTGGCCCTCCATAGCGGATAGGAGCAGTTCTAA TT
TTTGAAGTATCTTGTAGGCTGTTGGTACTTTTTCATGACATAGTAGTGGCGTTATGC TTA
TCTAATGGCAAATGTATGACAGGACGAGGGTAGAGCTTTGGGTACCATTTCTCGGGA AA
TGTTAACTCCCTAATGCTCCTGTAAAATTACTCGAGTTGATATGTAATCAAGTTGAA GAG
ATGCAACGTTTGAGCCCATGTATATGTCATCTCTCTGTCTCTCAGGCTTTTAACATG CTG
AGGGCCGTTTCTCAGATGCAGAGATGTGATATCAAGCCGGAAAATTAAGTGTGCATG AT
TTATGTTAGCAATTGATTGCAATTCAAAGCCTTGAATGGTTGATGTTTGGTTTAAAT ATTT
CAACACGTATAACCATCCTGGCAGTTAGTATATGTGGTTATAGGAAGACTTTCAATT GAT
GCCTTGATGTTGTTTTTCTAAATATTTCACAAGTTACACTGAAGCGTTACTTGCAAT AAG
CTAATTTTATTGGCTTCATGAATTCTTTATTTCAGGTTGAAGTGACCAATTCAGCTC TTCT TGCTTTAATAGAAAATTACTGCAGAGAAGCTGGTGTACGCAATCTGCAGAAGCAGATTG
AAAAGATTTATCGCAAGGTTCTGTGACCTGTCCTCTTTTGTATAATACATCTATGAT GAA
ATTGTCTCGAGTCTCGTACCTAGTCAACTTTTGATGAAAACTGATGGTTATTTCATG AAA
TCCAATACGAACACTTTGAATCGTTTTCATGCTCTGATACTTTGCACATCCTGTTCT TGC
TTTTCAGATAGCTCTAAAG CTTGTCAG G GAAG ATGG AGAG ATTG AG CCTCAG AATG CAG
AGGTAGGTGAGGTAGAAGCAGAATCTATCCATCTATCAGACGAAATCAAGTCTAAGG AA
GAAATTCAAGCTGGAGCTGAGTCCGCAAACGGTAGCAATGATGACAAGGCCTCTGAA A
ATAATGCTGAAGCTGAAGCACAGGGAGCACCAGTGAATCAAACACAGAAATCTGCTA AT
GAAGATGCTTGTTTACAGGTAAATGAAAAACATTAAAAAGCAAAATTATAATGTTTA GTA
CTTCAGGTGATTCTTGCCAGTTGTAACTATGTATGCTACAATGTATTTTAATGCTTC TAA
GTTTTATCTATGCTCAAAATAAAAATACAAGATGCAGTGTAATAGGTAGTTTTGTAG GCA
CAGAAGTGTCTTTTTACAACTTGTCTTACTGTCACATGTTGGATTAAATATGGTTAA CAAA
TG AATGTAAGAAATCATTTATCTG G CAATATAACAC CAAACAAGCAG GG G AAG CACTTT
CTTTTG AAATGTTAACTAAG AACAAG CAAG GG AAGC GCTTG GTATTCATTTTCTAC GCTA
AGTCCCCCCTGTATCTTAAAAAGGTATCTATGCATAATTTGCATTTATATCAGACTG TAA
GACAAGAGTGGGTTGCTCTAGTGGTGAGCACCCTCCACTTCCAACCAAGAGGTTGTG A
GTTCGAGTCACCCCAAGAGCAAGGTGGTGAGTTCTTGGAGGGAGGGAGCCGAGGGTC
TATCGGATACAACCTCTCTACCTCAGGGTAGGGGTAAGGTCTGCGTACACACTACCC TC
CCCAGACCCCACTAGTGGGATTATATTGGGTTGTTGTTGTTATCATACACTGTAATC AGT
TAAAATCAAATTCTTTG G GATG GAG GG GTTTCTTTAGTCTG G CTAATTAGACTTG ATAGT
CAAATTTGTTTTAAGAAGTTGTGCCAAATGTTAACTCACTTGTTATAATCTAGCGCG CGC
ACACACATTTTGATATGTTTGATGTTAAGCTGCTAATGAATAACGTCCTTATATTTT CCTT
GACATTGCATGATTTTATGTTTACATATTTATCTGCGTTTAATTTAATGCTATTGTG TCCT
TG CAACTG AATTTTG GTTCTG AGTG GTTGTG ATCTTAACAGG ATACTCAAGAAACTG AG
AAAGCAACAGAAAGTGAAGCGAGTAAAACAGTAAATAAAGTGGTTGTTGACTCGCCA AA
CCTAGCTGATTATGTTGGCAAACCTGTTTTCCATGCGGAGCGCATATACGATCAGAC AC
CAGTTG G AGTTGTGATG GGTCTTG CTTG GACTTCAATG GGTG GCTCAACACTCTATATA
G AAAC ATCTCTG GTGG AGCAAGG AGAAG G GAAAG G GG CTCTCAATGTAACAG GACAAC
TAGGCGACGTTATGAAAGAAAGTGCCCAAATTGCCCATACGGTTGCCAGGACCATTT TG
CAGGAAAAGGAGCCTGATAACCAATTCTTTGCAAATAGTAAGCTTCATCTTCATGTT CCT
GCAGGTGCTACCCCTAAGGATGGCCCTAGTGCTGGTTGTACTATGATAACGTCCTTG TT
GTCTCTTGCCATGAAAAAGCCTGTTAAAAAGGACCTGGCAATGACAGGGGAAGTCAC G
CTAACTGGCAAAATTCTTCCTATCGGCGGGGTATGTTAACAATTCTTACACCTCTCC TTA
TAATTTCATGCAGCTTTTGTGTCTGATCATCTATCATGTTTTCTTTTTATTTTTCGT TGATT
TTGTCTTTAATGTTCTTTATGCTTTAATTATTTCCTGTGTCTGTATGTGTTACATGC ATGC
GCATGTAAGCATAATAAGAGTGGTCTTTTCTTTTTGACCCAACCAGTGTTGGGTTCC TTT
CTTGATTTTACAAAAGCTTTACCTTTTGGTTCAATAATAGGTCAAGGAGAAAGCCAT AGC
AGCGCGAAGAAGTGATGTGAAAACTATAATATTCCCTTCAGCCAATCGCAGAGATTT TG
ACGAGCTTGCTCCTAATGTCAAGGAAGGCCTTGATGTACACTTTGTGGATGACTACA AG
CAAATATTTGATTTGGCATTT
SEQ 77
ATGCAGTTTTTCCGAAGAAACCCATCACTTCACAGAATCTCCTCCAGATTCCTTAATCAA
GTTCGTTTTCTTTTCTTTTCCTTTTCCGAAGTATAACTAGCTTTTCAATTTTTGTTT GGCTT
TTCGATTATCTTACTAGTGTAACATATATTTCATATGTCTTGAGGTTCATTGCAAAA ACTC
GTTATATTTCTAAATGGGGTTGAGCACGTGGTCCAATACAATTCAGTAGGACTTACA GC
TGTAGCCTGTAGTTAGAGGACATTGGATTAATTAATTATATGGCTGCAATTCAGATA TTC
AAAACGTTTCTTTTCCCTGTTTGCAATTTTTTCCTCCAAGTAGTGAAACAGTGGAAT TTTC
TCCCCATTCTTAGGTCAAGCTATCATCTTTTTGCTTAAGAGTTGGTTTGGATGTTTA CATT TATTTTCTAACACAATTTGTTTTGGTTTGCTGCTGATATCTCATGAATTGGATACGATAG G
TAGTCAAAACCAGTGCATATTCAACCAAGAAAGTTTACAATGCTGGGCAGCCGACTG CT
GCTACTCACCCTCAGGTACTTCTACACTGCATATGTAGTATTGACATTTGGTAGCAT AGA
ATTAAGACCGTTGACATTAACAAATGATAAATGTGCAGTAAATTTAAAATCTTGTTT TCTT
GGTGGTTGTTTTCCTTGTATGGGACATACTTCGCTGTCCTTTGGAGCTCCTTTGTGA ATT
TCTGTTAAATGTTGTTACATCACTGCATGCAGTTAATGAAGGAAGGGGAGATTACTC CT
GGCATTACCAGTGAAGAATATATGCAGAGAAGGAAGAAATTATTGGAGTTTCTTCCG GA
GAATAGCTTAGCAATTGTTGCAGCCGCTCCCATAAAAATGATGACTGATGTTGTACC AT
ACAATTTTAGGCAGGATGCTGACTATTTGTACATCACCGGATGCCAACAACCTGGTG GT
GTTGCAGTTCTAGGGCATGACTGTGGTTTATGCATGTTCATGCCAGAACAAAGCCCC CA
GGTATTTCAGGAACCATTCACTTGCTTCCTTCTTGTTGACAAGAAGCTGTTAATAAG AGA
AAAGCTTCGTCCTATAATTTAGTGACATTTTTCTTTAGATTCAGTTACTACCATGAT TTTT
TGGTAGTTAGTATACATTGTAGCAAGTTAAAGATTGTTTCCATACTAAAAGTGAAAA AGT
ATTTTTAG G ACG CTCTTTG G CAAG GAG AAACTGCTG G AGTTG ATG CAGCTCTACAG ATA
TTCAAGGCTGACCTTGCTTACCCTATTAACAGATTGCCTCAGGTAAATCTTTTTTAA AAT
CATATCTCCAACTGCAAATAAGTTTGAGATTCTTTTTAGAAGCGAATACCTCTCACA CTG
ATAAGTAAAAGGGCATATGATAACATCCCTTCTTTTATTCCTTTCAATAGGACAATG AAG
TACTTTATCTAAAAAGGGAGTGGAGACCTATTTGTCTCCTTTCCACTTGATTATAGA AAT
TTATGTCAGAGAATTGAATCTGTTAGAGTTGGCTTGTAGATACCTTTTGACTGTTGA TGC
AATTCTTAATATG CGTAAAAG AATTGTTTTTCTC CTTTTTCTCTTTTCTTG CCG GG G AAAA
GAATTGTTTTCCTCCTTTAATATGCGTAAAAGGTATAGAGGGAACAAAGTGGATGGA AG
TTAGAGTTTTCACCTAAGTTGCTCCGACACGGCAATTTAGGTGCCGCACCCATATCG AC
ACGACACTAGTATGGGTGTGGGTATGGGATCCGTACCGGATCTGGTCAAACAATTTT G
GGTACTTTGACCACGACGGATGGAAAAATTCGAGACGAGATACAATTTGATTCCCAA AA
TCAGAATCTAAGGTAAATTTAAATAAAATAATATACCTTATCTAGAAAATCAATCCT TTAC
TTATCTATAACTTGAAAATAAAAAGGAAATCCACACTTTACAAGCTATACGTAAGTA ATCC
ACAAAATTTCTCATAATTTAAAAATATTTTTATTTTTTTTGAATTATTTTTAGTCGG ATCCC
CGCACCCATATCTGTACTAGGATCTGTATCCCCGAATCTTAGAATTTACATCTCGAA GG
ATCCAACCTCTAGATTCGCACCCATGTCGGACACCCGCACCCGTGTCCGAGCAACTT A
G GTTTTCACATATATAG G AGTCG GG CCTG G CTTATTACTATAAATTCATGTTTG ATAG G A
CCTATTACTGGATGTAGCCTTTCCTCATAATTTTGAAAATCAAGCAGGCATCACGCT AGG
ATCGGTTGAGAATATAATATTATGGGTAAGAAATTGAAGAATGAAGGTTAACAGAAA GTG
G ACACTGTGTTC CAAATG G AAACTAG GTAAATG ATTTAG G CAG ACGG AATATTTTTTG GT
TGGCTGTATTTGGCTCTCAGGTTAACCGTTTGACCACTTAATGGTAATTTACTATTT AAA
TCGGCAACACAGAGAACAGGAAGTGAAGATGTATATATGACTGTGTTATTTGTAGAG AA
CCAGTTTATGGTGTAGGTTTTCTAGTTATTGTAGAGCACTTGCGTACAAGAGTTTAA ATT
CCG C AAC ATC G ATAAATTCTTAC CTTATAAAAAAG AG CAG G AAATG AAG AC GAG ATG C C
GATATCATGATTAGATCTATGTCAGCAAAGAAAAAATGTCAATTATTTCCTTCTAAG CTGT
CTTTCCTGTACATGGCTTCAATGTAGTGACTTTGTTTCACTTTCTCCATGTTCCAAT TTCT
CTTCTCTAATTTTTGCTGGACTTGTCAGATTCTCTCCAGGATGATAGAAAGTTCTTC CAC
TGTGTTC CATAATGTG AAG ACAAG G ACTTCATCCTACCTG G AG CTTG AG G CCTATAAAA
AAGCAGTTAGCAATTACAAAGTGAAAGATTTCTCTGTGTACACTCATGAAGCCCGAT TTG
TG AAGTCTCCAG CAGAG CTGAAATTG ATG AGAG ATTCTG CATCTATAG CTTGTC AG GTA
ATGGTAGTTCTTTCATTTTTGTCAGGTTCATGGGTTAGAGTGGTAGTTCTTACTCAT AGA
G GTTCTTGTTTTTATTG G ACATGG AAAAG CAGTCTCTGG GTTATAGAG ATG GAAAG ATA
GAGTGTACACTTGACACTACTTTTGTATGTTTATTTGTTTTCCATTGAAGTTGATAC TCTT
CACACAGTTAACATGTGACTAAGGTATTGATATGCCAGCGAGGTGTTTTCAGAATTT TAA
AAAGCTTATTTGCAGGTCAGCTGTTATACAATCTTAAGTAACATGTTTGTCATTTTG CTAT
ACGACAAAATTTTTTAGAAAGGTAAAATAGGTATTTGCATTTCCCTTTTTTTCCTCT TCTT
CTTTCATGTCTAG G GTG GTGTCTTC AG GTTG AAG ATACTACACTTCTGAG GATCTAAAAA
ATATTTCACAAAAG GAAAAAG G GTACAGTCAGAT AAAAG G ATCACCAGTCTAAAAGAAG ACGGTTCTTAATATTCCAAAAGTTGGAGTCCCAGCTTTCTTACTTGGGTCAACATATTCT
TGGTCTAATTGTGAAGGAACAGTTCTTGCATGTACAATCCTTTCTTTGATAATGTGC TTC
TGTGTTAAGTAGTTCAGAAGCTCTAGGCATGCTTAACCAAAAGATGTGTATATACTA CTC
ATTCATTCTATTTCACAATCATGATTTGCATGTTTTCTTATGAGAGACTGGTCTAGA AAAT
GCTTCTTCCTATTCCTGGATTTGTATGCAGTTGCCTAGCAATAAAGTTGCCAGTTAT ATG
GGAGTTGAGATATTTTCCTTTCACTAATTCAGTCCTTTTTTATACTGTATAAAGGAT ATTT
TTTATTTCTTGATCTTTTAATGTCTGTCTTGTCTTTCGGAAACAGCCTCTCTACCCC TCG
GGGTAGGGGTAAGGTCGGCGTACACACTACCTTCCCCAGACCCCACTAGTGGGATTT C
ACAGGGTCGTTGTTATTATTGTTGTTCTTTTAATGTATATATTTTTGGTAGGCACTT GTCC
AGACCATGTTGTACTCGAAGTTGTTTCCTGATGAAGGAATGCTGTCAGCCAAATTTG AA
TATGAATGCAGAGTTAGAGGTGCCCAAAGAATGGCGTAAGCTTTTTCTTGTAATAAT TTT
TGGAAGTTTGTATATAGAGAGGAGCACGTTGCAATTTCTAAGTATTTTAGTCTAACA TGA
GTTGCAGGAGAGTAAATCAAAATGCCACTAAGACCTCATGTGTAAACATGCAATTGA TTT
TCTTTTTCTTCTATTCTCTGCGATTCTGATAATTTGTTGTTTTTCCTGACTGTTAGT TTTG
GTCATACTTCTGGTTGAGATAGTTTCAAGGATTATACATTTTTCTTTTCCTGTTCCA CAG
GTTTAATCCTGTTGTTGGTGGCGGACCTAATGGCAGTGTCGTGCATTATTTTCGTAA TG
ACCAGAAAGTATGTTTACTGTCTTTAAGCACAGTTGAATTTGAATATCAAGCATATT GAG
TAGTAGTATCTAATTTGTTGTTTTAACAGATTGAAGATGGGGACTTTGTTGTTTTAA CAGA
TTGAAGATGGTAACCTTGTCCTCATGGATGTTGGATGCGAGCTCCACGGTTATGTCA GT
GATCTTACTCGTGTTTGGCCGCCCTTTGGAAAATTTTCTTCTGTTCAAGTAAGTATA GAA
TCCATGATTTTCTTCTCCGTTTTCCCCTTAAAACTCAAGTCAAACCCCACTCCTCTG GGT
AAAAACCCTGTTAGCTGATCAAAGTCATAGACAACCTTCCATTTCAGAAAGAATGCA CTG
AC CTAAATTG AAAC C C C AG ACTAC C AAAAG C AATCTAG AAAG AC AAAC G GTAAAATG AA
AATCATCAGGTAATGTAGCCTAGCAGCTAGCTTCACCCTCCAGTGGTATGAGTTATG AA
TCTTAATTCCAGATCCTCAATGGCCTTGCTGATTTGATGTGGATGTGAAGAATGAGA AT
GATATAATCATATAAAGTTCCTCCTTATTAGAAAAACAAAATTTCAATTTTACTTCT AAGC
TACAGATTATGCTTGAGAAATAAATTCTTCCCTTGGTATGGTTTTAAATTGCTACTT TTCT
GTGATATAGTCTCTATACATTATTGTCTGGAACTTAGTGATACGCTCCAAGATACAT TTC
AGG AGG AACTTTATAATCTTATTTTGG AGACAAACAAG GAATG CGTG GAG CTGTG CAGA
CCTGGCACAACCATCCGAGAAATACACCACTACTCGGTACTATTTTAGTTAATCCAT CTC
GTAATTTCTTTTG GTTTATATTCAAGG G GTAG CTG AG AGTAGG AATTTAATTTTTTTTCTC
TTGCCTTTCATAGACTTAGACCCAGTTATATTGCCAAGTTACATTGGTAGTCTCGGT GAT
AGAAATTTGGGTACAGTTGTGAAGGCCCTACTCTTGCTTTATGTTTTGCCTAATTCT CAA
GTTACACTGACTTCCACCTCCTATTGTGAATAGCAATGTTGCTTCAAAGTTTTCGTT CCT
ATGCATGAGCCGAGGAAAATGAGGCTTGATGGCATTTTCGAAGATAAGGAAAAAAGT TT
AATTCCTTTTAAGCCTTTGAGTATAGAGGTGTTGGGAAGAGATAGATGCCTTTTGAT TGC
CCTCCCTTGATTTGAAAATAGTATTTGTTCCCCCATTTCTTCATATATGATGAATAA TGCT
TTGTAAAATAAGCCAGTAAGGTAATGATTAGAGGTGTCTAATTAGTGTAGTGTGTGG TAA
TTGATTCTCAGTGAAGGGTAGTATTACCTGGTTGATGCAAATGCTTTATGAATTAAG GGT
CATTCTCTGCATTTGTTGTTAGGGCAGCTGTTAAAGGTCTTATTACTGTTCAAGATA TGG
GCGTATATGGTATATATTCTGTAGTGAGTAATTGCCCTATATCAGCATGCTCTTTTC TTTA
G ATACTTG AG G ACTG CCAAG GTCTCATTCTTTTTTTTTTATTTG ATGTGTATAG GTAG AA
ACGCTGCGAAGAGGATTCAAAGAAATTGGGATACTAAAAAATGATCGGCGTGGAAGA TA
TGAAATGTTAAATCCTACAAATATAGGTCTTTCCTTTTAACCCTTACTCTTCCGCTG CAG
ATATAAGTACAAATGCATGTGCAATAGCAGCAGAAACCTGCTCCCTCTCAATTGTCT TCA
CAGTTGCTAATGCTATTCCTATTATGCTTTTTGCTGAAAAGAGAAATGATTTCTTGT ACC
GCTGCAACCATCTCTGAATGAATTTGGTTTGTTCAATTATATTTCCAGGTCACTATC TAG
GAATGGACGTTCATGATTGTTCTACAATTGGAAATGATCGACCTCTGAAACCTGGTG TA
GTAAGTTTCCTTCCTTACTGATGATTGCTTTGATATTTGAAAATAATCGGGAAACTG CTA
GGTTTGCAAAAGAAATTGGTTGTCATTATTTTGAAATCCTCCTAACGAATGAGGACC AGT
G ACTTG CTCATTTG GAAACAAATGAG CTTTGC CATAATG CATCACTCTCTTTTAG CAATT TACAATGAGTTTTCCTCCATAGGATAGTTCAGTCATTCTCTCTTTGCTTCTTGACTGGAT
TACAATATGAACCAACTAAGAATGCTTATGTTTTTAGAGCCATGTGGTCAAATTTCC CTT
TTCCTTCACTTTTTTCATTTTTATAGAGATGCAAAGGGTTAAAAGAGAGGATTGAAT GAT
GATATATATGAATATTTTCTAAACTGCTTTTGTCATATGCACCTCTTTGGTCCTATT GCAG
CTTCTAATTTCATCACTTCTACTAGTAGTTACATGGAGAAAGTTAAATTCAGAAATG AAGT
TGGTTAGGTTACATGGAGAAAATTAAATTCAGAAATGAAGTTGGTTAGGTTTATTTC TAT
TGAAGGATGTGTACATCAGGGTAGTGCGTGTATTTGCATAAAAATTATGTTTGAAAT ATC
TGACGGTCCAATTAGCAGAAAGATCAAAGTATCTTTTGCTTCTCTCTAGATTCTATA TAG
ACCTTTGTTTTGATTGATTAATTTGAAATATTTGAAATGATTAATTCCTCGACCGCT TAGT
TCATTAACCCCGGCTTCATACACGATGCAATTTTTGTTATTGATAAAGGTTTCGTTC CTG
GTATGTCTACTCTTTATCAAAAAGTAAAGCTCTTGTATATCTTTTCATTCTGAGTCA CAAG
GAATGTAATATTCCTGCAGAATGTGTAATGTTATATGCAATTGTAGTAAAATTTCTC AGTA
GCTCGCGCATCTTGTTTTCATGTTGGTACTGCAAACTGTTAACTTATACTTTATGTA TAAT
CCATTTCATGACAAGTACCTTGCTCTTGTGAAAGGTACTTAACGTCCAACATGCTTG CTT
CCATGTATGAAAAATTATTAGTTATCCCATTTTGCTCCTTTTTCCTTCATTCTTCTA ATCAT
AAAAAATTGGAATATGCTCCCGACCTGTCTGATATGACAATAAAAACATACACAATA TAT
ATCAAGTCAGCTGTTATATGCAAAATTACTGAAGGTAATTCCAGTAATACAGCTTAT TGG
TGTCAGCGGTAGATTTATGTTAAATTATGCTTTAACTGAGGTCTATTTTGCCAGTGA TAT
CTGTATCCATGCATTTGTTTTTAGTCCTTACAAAAAACAATTTCAGAACTACTATGT TTTT
GAATAAGAAGCACCATGAACATGCTACTTAGAGGTCTTAGTTTGTATAATTTATGTT GAT
CATATCTGTGGAGAAATATGCTAATTTTGCGTGGGCCGTGGCATTTGTATTTGAAAG AA
GTACATGACTTTTGATTGTTCTGAGATTATGTGCAGAGTCTAGTTCTTTGACTTCGG GAC
TATGGAAAACTTTGTGTTTACTCTACATTCATACATCTAGACAAGGTCATGGCCAAT TGC
GAAACATGCTTTACGTTTTTTTAAAAGTGACGGAGACGCATATAGCTCAGAATGATG AG
ATGAAATTGGACAAAGCTAATAGTCTAGGATAAAATTGCCTGGTCATTGTTTAGACA TTT
GTAAACTCCTGTTCCCTCTGTTTGTTTCTACATTAACCTTGATGAGGCGGTCATTGA TAA
GAAGAATCTCTGACCCCATAAATAGATGAGTCCTTTTCATCTTAGCTTCCAATTACT TGT
GATTTCTGCAAGAACTTGTGATCAATCTTCATGGACTGTTATATGTAGGTCATCACA ATT
GAACCAGGAGTATACATCCCTTCATGCTTTGATTGTCCAGAAAGGTAATACTTGTTA CCT
CATCAAATTAATGTTCCTTTTGGCATGCATTCAGAAGTTACTGTATCTTAGATCATC CTC
CAGATTCTTGGTTTATTGAACTGGTTCTATCTGCCAAAATGTATTATGGTGGATGGA CGA
AGAGTTACTCTTTCATGCAGAAAAGATTGAGAATCATATAAAGATGTCTGTTCTAGG TGG
GTCAGGTAGATCTTGTAGGTCTAGTCAACATGTTAATCTAAGGACATTGTAGCAGAA GT
AGTCGTATCGTGCAAAGGAGTCCCAGTTTGGTTGGGCTTTCAGTGAATGACTGATGG CT
ACTTGTATGTTCAACCGAGAATGGGATCAGGAACCTTTACTTCGAAATTTCTTTTGC AAG
GTTCGAGAAAACTTCCCAGAAACAAATAGCTCGAAAGAAATATGAAAGAAATATTCC AC
GTATCAATGGCATATTCTGGCTGTCCCATTCTAGGGAAAATTGTTGCTTTTGTCATA AAT
CTTAGGGAGAAAGAATTATTGCTCACTTGAACTAAGAAGCTTCATCTGCTGATTCAT CTA
TTGTTAAGGAGCTAGATTATTCTCTTACCCCAATAAGGAACAACGTGTCTGTTTCCT TAG
AATTCTTGATTTTTACTCACTTATATATGTCTATATTCACTTATGCTCATTCAGGTT CCAA
GGCATTGGATTTAGGATTGAAGATGAAGTCCTTATTACAGAATCAGGTTATGAGGTA TA
GTTACAGAAATCGTTCAATTGTTTGAACAACCGAGTTATACAAGTACCAGTTCATAT GAT
CTCTGATACTTTGATCACTTCCGACACTTGTTAGCATTCAAGACCTGATTTTCTGCC CTA
CTGGAAACAGGTACTTACTGCATCCATACCGAAGGAAATTAAACACCTCGAGTCCTT GT
TGAACAACTTTGGCAGTGGGAGAGGAACAGAAATTAGAGCTGCTCTCAGT
SEQ 78
CTACTCACCTCTCACAAAAACCATATAATTCTCCTTCCCTTTCTTCTCTACAAAATCTTC A TTTCTCTCCAAAAACAAACTCTCATGGCTTCTTCTACTAGAGTTTTTGTTCTTCTCCTTC T CATAATCTTCAACTTTCTCTACATCTCAGCACAAAAAACCATTAAACATAAGCCTTTTTC A
ATGTCATTTCCTCTTACTTCAACATCTTTATCACATAACTCTTCTTCTAAAGCTCTT TTTCT
TTCTTCCCTTTTGGCTTCTAATCAAAGAAAACAAGCTCCAAACACAAAAACTGTGTC TAG
AATTCCATCTTTGAACTATAAATCAACTTTCAAATATTCAATGGCTTTAATTGTTAC ACTT
CCAATAGGGACACCACCACAAAATCAACAAATGGTTTTGGACACAGGCAGCCAACTT TC
TTGGATTCAATGTCACAAGAAAATTCCAAAAAGACCCCCACCAACGACGTCGTTTGA TC
CTTCTTTGTCCTCCACTTTTTCTGTTCTTCCTTGTACTCATCCTTTATGTAAGCCAA GAAT
TCCCGATTTTACCCTTCCAACTACTTGTGACCAAAATCGCTTGTGCCACTATTCTTA CTT
TTATGCTGATGGTACTTTAGCTGAGGGTAATCTTGTCCGTGAAAAAATTACATTTTC ACG
TTCCCAAAGTACCCCTCCTTTGATTCTTGGTTGTGCTACGGAGTCCGAAGATGCCGA GG
GTATTTTGGGAATGAATCTTGGACGGTTTTCTTTTGCCTCCCAAGCTAAGGTACAAA AAT
TCTCATATTGCGTGCCAATTAGACAAGGTAGCCATGCAGTTAAACCTAGTGGAACAT TTT
ACCTAGGCCAAAACCCTAATTCCCATACATTTCAATATATAAATCTTTTGACTTTTC CTCA
AAGTCAACGCATGCCAAATTTGGATCCACTAGCTTTCACTGTTGGCATGGTAGGGAT AA
AAATTGGCGGCAAAAAATTAAACATCTCCGGTAGGGTTTTCCGGCCAAATGCTGGTG GT
TCTGGCCAGACGATCATTGATTCCGGCACGGAATACACTTTCTTAGTGGAAGAAGCG TA
CAATAAGGTCAGAGAAGAAATTGTTAGGTTAGTTGGTCCAAGATTGAAAAAAGGTTA CG
TTTATGGTGGTGCACTTGACATGTGCTTCGATAACCGTCCGATGGAAATCGGACGGT TG
AT AG GTG ATATG ACATTG CAATTTG AG AACG G GGTTG AG ATTTTG ATCAATAAG GAAAG
GATGTTGGATGAAGTAGAAGGTGGGATCCATTGTGTTGGAATCGGACGGTCAGAATC A
CTCG G AATAG CAAG CAATATTATTG GTAATTTCCATCAG CAAAATTTATG GGTAGAATTT
G ATATGAG AAATCG AAG AGTAGGTTTTG GCAAAG GAG AGTGTAGTAG G CAAATG
SEQ 79
ATGGCTGCACTCAATTTCTTCATAATCTTCACATCACTAGTCTTACCAATTGCATCTGAT
CCTCTGTTGTCAACTTATGTTGTCCATGTTGACACCAAAGCCAAGCCATCTCATTAC TTA
ACTCAAGATGAATGGTATAATTCAGTGGTTGAGTCAGTTCTTGCAAACAAAATGGAC TCA
GATTCTACTTCTCCAAGATTGTTCTACTCATATGATGTAGTGTTACAAGGTTTTGCA GCA
AGATTGACTGATCAAGAATCTGAAAAACTAAATAAATTTCCAGAAGTCATTCACATT TTCA
AAGATCAGTCTAGAATCAAGCTTGACACAACACGTTCGCCGAATTTTCTTGGCCTAA AC
ACAGGTTATGGTCTGTGGCCACAATCTAACTTTGGAGATGATGTTATAATTGGCCTT GTT
GATACAGGGATTTGGCCTGAGAGTGAGAGTTTCAAGGACAATGGTATTGGTCCTATT CC
AACAAGGTGGAAAGGTAAATGTGTTGATGGAATTGAATTCAACGCGACGAGTAGTTG TA
ACAG AAAACTTATTG GTG CTAG GAATTTC GTTAAG G GTGTTG AG AATG ACTATCATCATC
AATCG G CACG AGATCAAAATG G ACATGG AACACATACTG CTTCAACTG CAGC AG GTACA
GAGGTAAATGGTGCCAATGTATTTGGTTTTGCTAAAGGGAAAGCACGAGGGATTGCG A
GTAAAGCTAGGATTGCAATGTACAAAGCTTGTGGGAGTAGTTCTTGTGCAGAATCTG AT
ATTTTAGCAGCTATTGAAAGTGCTATAAAAGATGGCGTAGACATACTTTCGCTCTCT TTA
GGATACGATGATGCTCCGTTTTATGAAAATCCAGTGGCAATTGCAACATTTGCTGCT GT
TAAAAGGAACATATTTGTTGCTTCTTCAGCTGGAAATCTTGGACCTTATCCATTTTC AGT
TCACAATACAGCACCTTGGGTTACAACAGTTGGAGCTGGATCACTTGATCGCGATTT CC
CCGTTG AAATC AACTTATC AAACAACAAG ACTTTTGTTG GTTCTTCTCTTTATC CAG G GA
GAATCAGTGGTAAAAGTTACTCTCTTGTTTATATTGAAAATTGTTCTATAATGACAA TCGA
TCGTTCTAAAGTTGAACGAAAGATTGTAGTTTGCAACACTAGTAAAATCGAAGCTCT TAG
AAATGGGATTTTAATTCAGAAAGCAGGTGGTTTTGGACTGATTCAATTAAATCTTCC AAC
TG AAG GAG AAGG G ATTAG AG CAATG G CTTACACATTGC CTTCTG CAACATTGG GTTATA
AAGAAGGTATAGAGCTTCTTTCTTATATCAAATCCAATGCTAATCCAAGAGCAGGGT TCG
TACGTC GAAAG GATAC AGTAATTG GG AAAAAAGTTAG AG CTCCAATTGTTG CTAG CTTTT
CTTCAAGAGGGCCTAATGTTGTTGTTCCTGAAGTCCTCAAACCTGACCTCATTGCTC CG GGTTTGAACATTCTTGCTGCATGGCCAGGTAACCAGAGACGGATCCAGGATTTATACCT
TATGCATTCAACCTTTATTCTTTACCATTGACCCCACGACACTTTTAAACTTATGAG GTA
GGAATTTTATACTTTTTGAAATTGTTGTGATTTTTCATATTGCGTGGAAGCAACCAC TAAT
GCTTATGGTAGGATAGGCTGTCTACATCACACTCCTTAAGTGCGGCCCTTCGCCCGA C
CCTGCGTGAGCAAGGGATACTTTATGCACTAGTCTACCGCTTTTCTTTATTTAGTGA TTC
TTCACATTGTGTGTGTCTATGCAGGTGACATTTCCCCAACACGTCTCAAGATGGATC CA
AGGAGAGTGAAGTTCAATATAAACTCGGGAACATCAATGGCGTGCCCTCACATAGCC G
GAGTAGCTGCATTAGTCCGCGCTGTTCATCCAGATTGGTCCCCGGCTGCTATAAAAT CC
GCACTCATGACTACATCCACAGCATTCGACAATGCACAACTCCCTATCATAAAACAC GA
AGACATGGAGCTAGCAACTCCGATCAGCATTGGAGCCGGGCACGTGAACCCTGAATC G
GCTATTGATCCGGGCCTAATATACGACACTGATACATCAGACTACATCAACCTACTA TG
CAGCTTGAATTACACAGAGAAACAAATGAAACTTTTCACGAACGAGTCAAATCCTTG CTC
GGGTTTCACTGGATCTCCACTTGATCTTAACTATCCATCACTTTCTGTTATGTTCAG GCC
TGATTCCTATGTTCATGTAGTTAAGAAGACACTGACACATGTCGCGGTATCTAAGCC CG
AGGTGTACAAAGTAAAGATAGTGAATCTGAATTCTGAAAAGGTGAGTTTAAGTATAG AG
CCAAGGAAGCTGATTTTCAATGAATCTTTACAGAAACAAAGCTATGTGGTCAAATTT GAG
AGCCATTATGCATTCAACAGCAGCAGGAAAATAGCTGAGCAAATGGCGTTTGGTTCG AT
ATTGTGGGAGAGTGAAAAGCACAATGTTAGGAGCCCCTTCGCTGTTATGTGGGTTCA G
CAAAATTTCAATAACAGTAGATTATACAAA
SEQ 80
TCAAAATGCCAAATCAAATATTTGCTTGTAGTCATCCACAAAGTGTACATCAAGGCCTTC
CTTG ACATTAGG AG CAAG CTCGTCAAAATCTCTG CG ATTG G CTG AAGG G AATATTATAG
TTTTCACATCACTTCTTCTCGCTGCTATGGCTTTCTCCTTGACCTATTATTGAACCA AAA
GGTAAAGCTTTTGTAAAATCAAGGCACTTCAGAAAAGGAACCCAACACTGGTGGGGT CA
AAAAGAAAAGACCACTCTTATTATGCTTACATGCACATGCATGTAAACACATACACA CAG
AG G AAATAATTAAAG C ATAAAG AAC ATTAAAG G C AAAATC AAC AAAAAATAAAAAG AAAA
CATGATAGACGATCAGACATAAAAGCTGCATGAGATTATAAGGAGGTGTAAGAATTG TT
AACATACCCCACCAATAGGAAGAATTTTTCCAGTTAGTGTGACTTCCCCTGTCATTG CCA
G GTC CTTTTTAACAGG CTTTTTCATG GCAAGAG ACAACAAG G ACGTTATCATAGTAC AA
CCAGCACTAGGGCCATCCTTGGGGGTAGCACCTGCAGGAACATGAAGATGAAGCTTA C
TATTTGCAAAGAATTGGTTATCAGGCTCCTTTTCCAGCAAAATGGTCCTGGCAACCG TAT
GGGCAATTTGGGCACTTTCTTTCATAACGTCGCCTAGTTGTCCTGTTACATTGAGAG CC
CCTTTCCCTTCTCCTTGCTCCACCAGAGATGTTTCTATATAGAGTGTTGAGCCACCC ATT
GAAGTCCAAGCAAGACCCATCACAACTCCAACTGGTGTCTGATCGTATATGCGCTCC G
CATGGAAAACAGGTTTGCCAACATAATCAGCTAGGTTTGGCGAGTCAACAACCACTT TA
TTTACTGTTTTACTCGCTTCACTTTCTGTTGCTTTCTCAGTTTCCTGAGTATCCTGT TAAG
ACCACAACCACTCAGAACCAAAATTCAGTTGCAAAGACACAATAGTCATTATATTAA ATG
CAG ATAAATATGTAAACATAAAATCATG CAACG CCAAG G AAATACAAG G ACATTATTCAT
TAGCAATTTAAATTGGTAGAATTCTATATATTTTTCTTTCAGTACAGACCAATATGA GGGT
AGAAATAACATCAAACATATCAAAATGTATGTGCGCGCGCTAGATTATAACAAGTGA GTT
AACATTTGGCACAACTTCTTAAAACGAATTTGACTATCAAGTCTAATTAGCCAGACT AAA
GAAACCCCTCCATCCCAAAGAATTTGATTAACTGATTACAGTGTATGATATAAATGC AAA
TC ATG C ATAG ATACTTTTTAAG ATAC AAG G G G G AC AT AG C ATAG AAAATG AATAC CAG G
CGCTTCCCTTGCTTGTTCTTAGTTAACATTTCAAAAGAAAGTGCTTCCCCTGCTTGT TTG
GTGTGATATTGCCAGATAAGTGATTTCTTACGTTTTGTTAACCATATTTAATCCAAC ATGA
GACAGTAAGACAAGTTGTAAAAAGACACTTCTGTGCCTACAAAACTACCTATTACAC TGC
ATCTTGTATTTTTATTTTACGCGTAGATAAAACTTAGAAGCACTAAAATACATTGTA GCAT
ACATAATTACAAATAGCAAGAATTACCTGAAGTACTAAACACTATATTTTTGCTTTT TAAT GTTTTTCATTTACCTGTAAACAAGTATCTTCATTAGCAGACTTCTGCGTTTGATTCTCTG C
TCCCTGTGCTTCAGCTTCAGCATTATTTTCAGAGGCCTCATCATCATTGCTACCGTT TGC
TGACTCAGCTCCAGCTTGAATTTCTTCCTTAGACTTGATTTCGTCTGATAGATGGAT AGA
TTCTGCTTTTACCTCATCTACCTCCGCATTCTGAGGCTCAATCTCTCCATCTTCTCT GAC
AAG CTTTAGAG CTATCTG AAAAG CAAG AACAG GATGTGCAAAGTATCAG AG CATGAAAA
CGATTCAAAGTGTTCGTGTTGGATTTCATGAAATAACCATCAGTTTTCATCAAAAGT TGA
CTAGGCACGAGACTCGGACAATTTCATCATTAGATGTGTTATACAAAAGAGGACAGG TC
ACAGAACCTTGCGATAAATTTTTTCAATCTGCTTCTGCAGATTGCGTACACCGGCTT CTC
TGCAGTAATTTTCTATTAAAGCAAGAAGAGCTGAATCGGTCACTTCAACCTGAAATA AAG
AATTCATGAAGCCAAATAAAATTAGCTTATTGCAAGAAACACTTCAAGTGTAACTTG TGA
AATATTTTGAAAAACAACATCAAGGCATCAATTGAAAATCTTTCTACAACCACATAT ACTA
ACTGCTAGGATAGTTACACGTGTTAAAATATTTAAACCAAACATCAACCATTCAAGG CTT
TGAATTGCAATCAATTGCTAATATAAATCATGCACACTTAATTTTCCAGCTTGATAT CACA
TCTTTTTTTTGATAAGGTGAAGATTTTATTAAAAACAGTATCAAGCTGATACTGTAA AAAT
ACAAGGACACTGCTGGCTTAAAAACATTAAAATCCTAAGCGGTCTAGCATGTCCAGC TT
G ATATCACATCTCTG CATCTG AG AAACAG CCTCAG CATGTT AAAAG CCTG AG AGACAAA
GAGATGACATATACATGGACTCAAATGTTGCATCTTTTCAACTAGATTACATATCAA CTC
GAGTAATTTTACAGGAGCATTAGGGAGCTAGCATTTCCCGAGAAATGGTACCCAAAG CT
CTACCCTAGTCCTGTCATATGTTTGCCATTAGATAAGCATAAAGCCACTACTATGTC ATC
AAAAGTACCAACAGCCTCAAGATACTGAAAAAATTAGAACTGCTCCTATCCGCTCTG GA
GGGCCAAAACAAACCAAAGTAGCCATCAGCCCTCCCACATATCAACTTATTAACTTC TG
AC G C C C AT ATTT AG GGTGGATTTTTTTTTTTTTTTTTTTTTTTTTGGGGGGGGGGGGGGG
GTAGGCACAGTAAATGACACCGCAGATATAGATGAGGTCAAATGTGTCATTTTGATG CC
CTTAATCAGTAAGTATGCAGCTCCCTTTTCTTCACCAGACATCCATATGGCATGTAA CTC
AACTGAAGATCACATTAGAGAGCAAAATATAAGATCCACCGAATTCCATTGATCTCA TCT
TTAAGGCTGTGGGAGAACATATTTACTGCAGTCATCTTATAACTTAAGCTCCCTCAA TAG
ACATTATCAAGTGTTTTTTGGAGTCATCACCACCAATCTTGGCATTGTATGATGATT CCT
TGTACAGTAATAAAACATTCACTACTGTGTACTAGTCTAGTTTTTCTTTTACACGAT GGTC
AAATTGAAGTCAACAACAAAAAAAACAAAACACTTGGCATATTTTAAACATCCATCC TGC
TTTAGTCCATGCCACACTCAAATCTGTTAGACAATACCATCCAGCAACCCCAGCCCA CC
AGGCACATGTATCCCTACCAGTAAGAAACTCATTATACCTCCATCTACCAATAGAGA TTG
GCGACTACCACGACAGATCCACCTTCAATTGCGTTTATATTCCCAGTAAAGTGTGGC AC
AGTG GTTGTGTTAATG ATGCACG AG CACATTG G CTAG CAAAG GCAGTCAC AGTTTG GCT
CGGTCAAGCAAACATTCCTCAAGAGAAAGCAGCACTTGAGAGGATGTATACAGCAAT GT
GGACTCATTCAGTTGTTTATGTGAACCACTTAAATACTTCTTGAAAAGTATAGTGAT GGA
AGAGTTTGTATTAAACTTTATAATCCATTAGCTTTGGAGGTAAGTATATTCCAAATG TAGA
GGCTTAAGATGAAAATGATATCAATACATCAAACAGTATATTTCCTAGGAAATATTA AAG
CATATGGCCAAAAACCTATAATGGGATTTACTATATCCGTGCGATTTTTTTCCTTTA GCC
ATGTTCCATATGGAAGGGTGTAAAGAGCTTGATCCCCTGAACTTTCCAATCATATGT TTT
ATTCCATAAACTTCTAACTGAATAAAGCTTTTCACATTTTACTTTAAGAAGTTACAT TCTT
GCCCAATCATGTTTCAAACATTTTCCCAACAGACCAATGTCTGCTTCTGAAATGTCA AAA
ATTGGGAGCGGAAAAAATACAAAACTATTCACCCGATAATGAATATGCACAGGAGAT GA
TGAAAGAGATGCAACTAGAGACGTTTCTATAAGACATACCTGCTCGGGCTTGATCCC AC
ATGTTTCACGAGTAGCTTTCTCCAAATAATCCCTGGCTATGTGCACTTTCTCATCCG TAA
TGTAACCAGCAATTGAAATTACTTCCATTCTATCCAAAAGAGGATTAGGTATCATTT CTA
CAACATTGGCTGTGCAGACAAACAAAACCTACAGCCGCAAAAAGGAAAATGATCTAC CT
CAAATTATGCACTCAAAAAATGGATAATGTGCAGATGTCATTCTCATATAAAGTACA GTG
G AATAG GTAATTGTATTAAG CAAAG AG G ATAGAAAACACTGAAG GTGTC AAAG CTCCTT
TTTTTCAAATTTTG GAAGTACATTAAAACCTAGAG GTTG G ATTTTATAATTTCTTCAG GTT
CATATATAACTCAACATCCATAACAAGGTTTTGATATAGCCTCCCTCCCCATAACCA AGC
CAGACCCAAAATGCTACTAAAATAATCGAAAGATGCCATCTTCTTTCGACAGAGCCC TTT GGAAATATAACACAAATATTTACCATTATGAAAAAGACATAGTAACACACAGTAAATTAT C
ACATTAAAGGACAAAACAAATGAAATTGTATGCGAATATACCCATCATCCTTCTCTC TCT
AGCCATGTTGTTGATTTGCTGATGTTTGATATAATAAAATGAAATAGTACCGGTAAC TGC
AAAGTCGTCATATTTGAATTATGGGCCAAATTCTTTAATCAATAGGTAAACTAACGC TCA
ACACTCAAAGTCTATCGCATAATTGTAAAGTCGTCATATTTGATTTATGGGCCAAAT TCT
TTAATCAATAGGCAAACTAACGCTCAACACTCAAAGTCTATCGCTCCAATTATCGGC AAC
TTCAAGAGGAATAAAACCATATCCCTCATCAGTAAAGCCATCAAAGATAATATGAAC AGA
CAGTAGGAAATCTCAATGCATCAACCTTTTTCCTATTGAATGATCACCAAAACCAAT GCC
AATGAAACCTAGTTTTTCTCAAGTGTTAGATATAGAAATGTAGTTGTCCCACATTGG AAT
AGGTGTAGTATGCCTTTGTATAGAGTAGCTATAAATAAGCCCATCTTGTATTGCATT AGA
CACACAATATCAATATATCATATTTTCTCCCGTGTCTTCTCACATGGTATCAAAGCA ATC
GTGAGAGATTTATCGTTGTGCATAAATTCCAGCGACTCCGGGAAGGAAAATCAGTTG AC
CGGAAGCCTTTTCCGGCAGGTCTGCCGCAAGTAAAAAAAAAGCCACTTCGTCAGTGT T
GTGCAAAAAAACCAACACCACCACGAAGTAGATCGGGCTCTGGCAACCAACCCATAA A
AAAATCTCCGTCAGAATACCCTCCACGCGCCGTCACTTGCTACCGGAAGAAAATTTT CC
GGCGAAGTTCCGACGTCGCGTGGGCCACCTTCCAGCCATTTTTTGGCGACGACTCTT C
AGGACAAATTATTCCCCTTGCAATTCCGAGCCTACCCATCCAGGTTACACCAAATTC CA
G ACAACTTATATATTTTTTCCAG CATG CATAGTGATTTCAAAAGTG G ACTTCC GG CAATT
TTTTGAAAACGTTTCTTCAGAACAGTTGGGTCATCTGGTAATTCCGATCCTACCCCT ACT
GTTTTTATTTCATTCCGACCACTTTGAATTTTCCCGGCAGCTACAGTACTATTCCGA CTG
CTACAGTAATATTCCGATAGCTACAGTATTTCCTTATTCTGTTTCACTGTTCCTTAC TCTG
TTTTAGTGGATTAAATTTGATTATTTCTATAATTTGGTAATAATTTGCAACGATGTC TATG
GGAATTGATGCTTTTGGGTCTAAAAACATGAGTTCTGGAAGCTCTAGTGTTATGATT ACT
TCAAAACCTTTAATGTG AG GTTCAAACTACTTAG CTTG GG CTTCATCTGTCGAGTTGTG G
TGTAAAGGTGAAGGTGTTCAAGATCATCTAATTAAACAGTCTAGCGAAGGAGATGAA AA
GGCGATAGCGCTTTGGGCAAAGATTGATGCTCAATTATGTAGCATCTTGTGCCGTTC TA
TTGATTCCAAGTTGATGCCTTTGTTTCGTCCATTCCAGACATGTTATTTGGTTTGGG CAA
AGGCTCGTACCTTATACACTAATGACATATCTCGCTTCTATAATGTGATATCACGGA TGA
CAAACTTAAAGAAGCAAGAATTAGATATGTCTACTTAATTGGGTCAAGTACAAGCAA TCA
TG GAG GAATTTGAG ACATTAATG CCAGTTTCTG CTAGTGTG G CAAAACAACAAGAG CAG
CGACAAAAGATGTTCTAGTTCTTACACTCGCTAGACTTCCTAATGATCTTGATTCAG TGC
GAG AC CAG ATTTTG G CTAGTCCG ACTGTTC CCACAGTTGATG AATTATTCTCTC GATTAC
TCCGCCTTGCCGCACCACCAAGTCACCCAGTGATCTCATCACAAATACTTGATTCCT CT
CTCACATCGCAGACGGTGGATGTTCGGGCGTCTCAAACTATGAAGAACAGAGGAGGA C
GAGGTCGTTTTGGGAGATCTAGACCCAAGTGTTCTTATTGTCACAAACTTGGATACA CT
CGTG AAATGTG CTATTCCTTAC ATG GTCGTC CACC CAAAAATCTTAC GTTG CTCAG ACT
GAGACTACATGTAACCAAGGTTTTTCTGTATCTAAAGAAGAATATAATGAGCTCCTT CAG
TATCGAGCAAGTAAGCAGACATCTCCACAAGTAGCCTCAATTGCCCAGACTGATACT CC
AGTTGTTGGTAATTCTTTTGCTTGTGTTTCCCAGTCTAGTACTCTTGGACCATGGGT CAT
GGACTCAGGCGCTTCTGATCACATCTCTGGTAATAAATCACTTTTGTCGAATATTGT ATA
TTCACAGTCTCTTCCCACTGTTACTTTAGCCAAGGGATGTCAAACTAAGGCACAAGG AG
TTGGACAAGCTAACCCATTGTCTTCTATCACCCTAGATTCCGTTCTTTATGTCCTTG GTT
GTCCTTTTAGTCGTGCATCTGTTAGTCGTTTGACTTGTGCCCTCCATTGTGGTATAT ATT
TATTAATGATTCTTTTATTATGCAGGACCGCAGTACGGGACAGACAATTGGTACAGG AC
GTGAATCAGAAGGCCTTTACTACCTTAATTCACTCAGTCCTTCCACAACATGTCTAG TTA
CTGATCCTCCGGACCTAATCCACTGTCGTTTAGGACACCCAAGTTTATCCAAACTTC AG
AAGATGGTGCCTCTTTTAGGACACCCAAGTTTATCCAAACTTCACAGTCTACATTAG ATT
GTAAGTCGTGTCAGCTTGGGAAACATACCTGAGCTTCCTTTCCGCGTAGTGTTGAGA GT
CATGTAGAGTCTGTTTTCTCCTTGGTTCATTCTGATATATGGGGTCCTAGTAGAGTC AGT
TCAACCTTGGGATTTCGTTATTTTGTTAGTTTCATTGATGATTACTCAAGATGTACT TGGC
TTTTCTTAATGAAAGATCGTTCTGAGTTATTCTCTATATTCTAGAATTTTTGTGCTG AAAT AAAAAATAAATTTAGTGTCTCTATTTGCATTTTTCGTAGTGATAATGCCTTAGAATATGT A
TCTTCTCAGTTTCAGCAATTTATGACTTCTCATGGAATTATTCATCAGACATCTTGC CTTA
TACCCCTCAGCAAAATGGGGTTGCAGAGAGAAAGAATAGGCACCTTATTGAGACTGC T
CGTACACTTCTAATTGAATCTCGTGTTCCGTTGTGTTTTTGCGGCGATGTAGTTCTC ACA
GCTTGTTATTTGATTAATAGGATGCCTTCATCTCCCATCAAGGATCAGATTCCGCTT TCA
GTATTGTTTCCCCAGTCAGCCTTATACCCTCTTCCACCTCGTGTTTTTGGGAGCACA TAT
TTTGTTCATAACTTAGCCCCTAGGAAAGATAAGTTAGCTCCTCGTACTCTCAAGTGT ATC
TTCCTTGGCTATTCTCGTGTTCAGAAGGGATATCGTTGTTATTCACTTGATCTCCGT AGG
TATCTTATGTCAGCTGACGTCACATTTTTTGAGTCTAAACCTTTCTTTGCTTCTGCT GAC
CACCATGATATATCTGAGGTCTTACCTATACCGACCTTTGAGGAGTTTCCTATAGCT CCT
CCTCCACCTTCGAACACAGAGGTTTCACCCATACTAACCATTGAGGAGTCTAGTGTT GT
TCCTCCTAGTTCCCCAGTCACAGGAACATCACTCTTGACTTATCATCGTCGTCTGCG CC
CTACATCAGGCCCAACTGGTTCTCGTCCTGCACCTAACCCTGCTCCTACTGCGGACC C
TGCTCCTAGGACACTGATTGCACTTCGAAAAGGTATACGGACCACACTTAACCCTAA TC
CTCATTATGTTGGTTTGAGTTATCATCGTCTGTCATCTCCCCATTATGTTTTTATAT CTTC
TTTGTCCTCGGTTTCCATCTCTAAGTCTACAGGTGAAGCGTTGTCTCATCCAGGATG GC
G ACAG GCTAG GAGTGATG AG ATGTCTGTTTTACATACAAGTG GTACTTG G GAG CTTGTT
CCTCTTCCTTCGGGTAAATCTACTGTTGGCTGTCGTTGGGTTTATGCGGTCAAAGTT GG
TCCCGATGGCCAGATTGATCGACTTAAGGCCCATCTTGTTGCCAAAGGATATACTTA GA
TATTTGGGCTCGATTACAGTGATACCTTCTCTCTTGTGGCTAAAGTGGCATCAGTCC GC
CTTTTTCTATCCATGGCTGCGGTTCGTCATTGGCCCCTCTATCAGCTGGACATTAAG AA
TGTCTTTTTTCACGGTGATCTTGAGGATAAGGTTTATATGGAGCAACCACCTGGTTT TGT
TGCTCAGGGGGAGTCTCGTGGCCTTGTATGTCGCTTGCGTCGGTCACTTTATGGTCT TA
AGCAATCTCCTCGAGCCTGGTTTGGTAAGTTCAGCACGGTTATCCAGGAGTTTGGCA TG
ACTCGTAGTGAAGCTGATCACTCTGTATTTTATCGGCACCCTGCTTCAAGTCTATGT ATT
TATCAGGTAGTCTATGTTGATGATATTGTTATTACTCGCAATGATCAGGATGGTATT ACT
AATCTGAAGAAGCATCTCTTCCAGCATTTTCAAACTAAGGATCTAGGCAGATTGAAG TAC
TTTCTAGGTATTGAGGTTGCTCAATCTAGCTCAGGTATTGTTATTTCTCAAAGGAAA TAT
GCTTTAGACATTCTTGAGAAGATAGGGATGATAGGTTGCAGACCTGTTGATACTCCA AT
GGATCCGAATTCTAAACTTCTGCCAGGACAGGGGGAGCCGCTTAGCGATCCTGCAAG C
TATAGGCGGTTGGTTGGTAAATAAAATTATTTCACAGTGACTAGACCCGACATTTCT TAT
CCTGTGAATGTTGTAAGTCAGTTTATAAATTCTCCCTATGATAGTCATTGGGATGCA GTC
GTCCGCATTATCCGGTATATAGAATCGGCTCCAGGCAAAGGATTACTGTTTGAGGAT CG
AGGTCATGAGCAGATCGTTGGGTACTCAAATGCTGATTGGGCAGGATCACCTTCTGA TA
GACGTTCTACGTCTGGATGTTGTGTTTTAGTAGGAGGAAATTTGGTGTCCTGGAAAA GC
AAG AAACAG AATGTAGTTG CTCG GTCTAGTG CAG AAG CAG AATATCG AG CAATG G CTAT
G GTAAC ATGTG AACTAGTCTG GACCAAACAATTG CTCAAG G AGTTGAAATTTG GTG AAA
TCGGTTAGATGGAACTTGTGGAACTTGTGTGCGATAATCAAGATGCCCTTCATATTG CA
TCAAATCTGGTGTTTCATGAGAGAACTAAACACATTGAGATTGATTGTCACTTCGTA AGA
GAGAAGATACTTTCAGGAGATATTACTACGAAGTTTGTGAGGTCGAATGATCAACTT GC
AGATATTTTCACCAAGTCCTTCACCGATCCTTGCATTGGTTATATATGTAACAAGCT CGG
TACATATGATTTGTATGCTCCGGCTTGAGGGGGAGTGTTAGATATAGATATGTAGTT GC
CCCACATTGGAATAGGTGTAGTATGCCCTTTGTATAGAGTAGCTATAAATAAGCTCA TCT
TGTATTGCATTAGACACACAATATCAATATATCATATTTTCTCCCGTGCCTTCTCAC ATCA
AGTATCAAGAGAACAACATTCTACTTACGCTAATCTTATTACAGTCATAAAGACAGA GAC
AAAGGTAACACAGACGTGAAATAAAATACTTCCAGATAACCTTTCATTATGGACAAT AAA
AGAAACATGTTCAGTCACGAAACTTTTTCCAAACTTAATTAAGGTAAGAGACGAAAG AAA
ACATCAATCAGTAAAGAAAAAGAAGGGGAAATTAAAAGGTGAAAAAGCAAAATGATA TTA
GATATTTGCAAGATTTTTAATATCACATTAGTCCCCACTCATCAAATGATGGTAGAG GGC
ATTTCATAAGGGCCAATCAACAGCCACAACAATTCGATAAGTGCTCCAGGAAACAAC TA
ACCTTTGATAAATCAATAGGAACATCAAGATAATGATCTAAGAAATTTGCATTCTGT TCT GGATCAAGAAGCTCCAACATAGCACTTGCTGGATCACCAGCATGTCCTCTTCCCAACTG
CCATAGAAGAAAAAATACAAAGCTATTATTAAATTTGATCGATAATTATACACATTC TTTA
AACATTTAGATTGCAGAAGAAATCCAAGCAAAAACATAATATGTTAATGCCCAAGAT TCT
TCCGGTGGTCATACACAGAAAAGCAAGATTTAAATAGTGGAAAAGAAGTTACACTTC AC
TCATAGTCAAAGCATATAATTGAAGTATGAACTCACAAACCATAAAATACCTTGTCA ATTT
CATCGATCAAAACAAGAGGATTAGCGGTTCCCACACTTTTTAAACATTGCACCATCT TCC
CCGGCATGGCACCAATATAAGTTCGTCGATGTCCCTTGAAGTGTATCCCAAAGTTAG TC
ATGAAAGTTCAAGATTATGTGCACAACAAAACAACTCTGTTTTAGCTGGTTAGATAT TTC
TGTTACCTTTATTTCAGCAACATCAGACAGCCCTCCAACAGAAAATCGGTAAAATTT GCG
GTTCAATGCACGTGCAATTGAACGACCTATACTGGTTTTGCCCACCCCAGGAGGGCC A
GAGAGGCATATGATTTTCCCTGTTCAAACAAAAATAAGCAATGAGTTTGTTTAGTTC CTG
ACTCAGATTCACAAATTAAACTGAGATAAATCAGATAAAGCCGCACATATGAAAGAC AGT
GCCAGTGTATAATTGCGATTAATATCATCACTTGAATCTAACATTGTCCTAGCTATG GTG
CAATTCACATTTTTCTAGAGTTGCCCTTTGTCTGTTTTTTCCTCAACACTTCATAGT TTTC
AAAGATTTTCCTCAACAAACAACTACATCTCAAGTTCAAATGTCCCAAAGATATACA AGT
ACAACTGGCAGTGAAAGTAAGTTATCAATAAAATACCATAGAGCGAGTGCCCAATCA TG
GACAACACAATATCAGAATTTAAGAAGAACACCAACAAAGTATGGCCCAAGGTGCAG CT
ATTTAACTCACATTAATTGCTTTTCAAGGGAAGAAATTAAGCTCAAGATATATTTCT TTAC
TCCACAGATAATCACAAGAAAACATTGGAGGAACTCTCGACTGTCACAAAAAGTATT AC
CTAAGTTTGACCAATAAATATTTCTCAAATGCCCTAAAATGTTCACTTCTTCCAATT TACT
CGTTCCAATTTGATGTCAATATAGTATGTGGGAACCATAAATCCCATGTTCTATGGA TAT
ATTTTCCATTGGTATATTATAATGTTATGAAAATGGACGGTAAGGAGATGGCTGGAT CAT
TTTCCGTTCTTTTAATATGATTATAGCTGAACCTTATAAAACTGAGGATTTTATTAA AAAT
GAG G GTCATTATTTTTAAAAAATAAG G CATTTACCAAC CTTGTG AGGTTCCTCTG AGTTT
TCCCACAGCTATAAATTCCAAGATCCTTTCCTTAACATCGGTTAACCCATAGTGGTC TTC
ATCAAGAATTTGTTCTGCCCGTAGTACATCAAAGTTTTCATCACTGCATTTTGGATC ATA
AAAG AAG AAAAGTTC AG G AAC C AATG C AATAC AC C C AG G C AG AG ATC AAAACTATAAAG
C C AAC AC C AC C AC CATC AG C AG C AAAC AAC AC G AACTATG AC AAATC AAG C C C C C ATCT
TAACCTACATAAGGAGCTCATCATAAGTACAATCCTAGTTTTGCTGCTCTCATGAGA TCC
TG GTCTAAACTGATACATTG GG ATGTCAAGG AAG CATC CTCAAAG CCAG AAGCATG ACA
TGTAAG AAC C C AAAATAAG G AAC AG GAG GAG AAAG AAAAGTT AAAG AC AAATG GAT ATT
ACTCGATAGAAATTTAGGCAAGTCACATCACTAGGCAGTAAGGTTAACTTGGAAGCC TA
G CAAATTTCAG AC AATAACAATTTTATTTCTCATAACTGTTTAG CCAG CTTCTAAC AAACA
GATTCCTCTAGTACAGCTAACCAAGGATAAGGTTAACATCAGTTGGATTGAACCCAA GA
CAATGTAAGGTAGAATAGATAGACAACAAACCTGTAACTACCCCATGGCAAGGCAGT CA
ACCAATCAAGATAATTACGTGTCACGTTAAATTCACTGGAACTAGCTTCCAACAGTT GCA
GTTTTGTCAGTTCTTCTTCAATAACTTGCATAACATGTACTGGTATTTTTTCTTTAT TAGG
CTCCAATCTTTCCCTGAACTTTGCTGCAGTAATAAACAAATATAGTGTCACACCATT AGA
TGTAATTAAAAAGGGAACAAAAAACTGCAGGTTTCCCTTAGTATTGACCAGAAAAAT CCA
AGCCCTTAAAGCGAACCTAATCATTAGGCAGTTTCTTCTCTCAATTTCAACACCAAA ACC
AGGTCTACCAAAACAATG CTACAATG CG GAG CTCAACTTTATCTTTGTG AAAATTATAGA
GAAAAAATAATGAGAACCGCATTTTAGGGTTTGCAGTAACGTCTATTATCACCAAAA CCT
G GAAGAAG CACCTTCTTAAG CATCACACTCCTCAACAG GAG CAAAGG AGAG AAGAAAG
AGCACATAACAG AAAAAG CAAGTAG GCTTCTAATTAG G AAGTG CACTG AAGG G AAAC AG
GACAGAATTCATGAGAAAAAGAAGACATTGGAGGAGGAAGATAATCTGAATCATGAG CA
GTCCATAACATGAATAAACAACAATGCAGTGCTAAAACGGAAAATGTGGCCTTCCAG TA
AATCCATGTTGAATTTGTGCCACATTACCAAATATGGTAGACAGTAACCTTTCGGAA TTG
AAGTTGAAGGATTCCAGTATTGTCTTATTAAGTGCTTTCGTGAAAACACAAAACAGC CCC
CTATAGGTACAATGTATTCTAATCTGTCAAAAGTTTGGAATGTCTCAAATAGTTTTA GAAA
GCATGTCAATAAAGTTGGTTGGACTGTGTACAAAGAAAATTCAACCTTCAATTTCCT ATA
TGTAAAGCCAATTATCGCTTGAATGCTATTCATGCTTCTAGTGAAGATTTTCAATGG TAA AC AG G AG ATC AATC AG C C AC C ATAAAACTTTTAC AAC AG AAAG G G C AAAAC ATATTG C A
AATG G CCTTTTCTATG CTACTG CTAAATG G AACTTTACCAGG G ACAAC ATAAG ATGTTTC
ACTGGCCAGATGGGAAAACACGTCTGGCCATAATAAATCCAAAAAGAACATTATCCA TT
TCAATAATAATTAAGAGACAACTAACCAGAAAGAGCTGTCTTGTCATCAGTCTCCAA ACC
TAGTTCCTGCATTAAGCCAATCAAAACCACGTCATAAGTACTTTTTTTTCAAATCAA CTCT
TTGTTTAAACTGTGAATCAAACATAAGAACCAGAAGCTTAACATATATATTATCTAG GAAT
AAACATATGTAAGTACTAAGTTTAAGGATGATAACACAAAAAAAGCTGCACATAATC CAC
ATGCCTTCTTTATGGCCTTTAATTGTTCATTCAACAAATAACGGCGTTGCTCTCCGC TTA
TTTTTTCTTCAATTGCTCTTGCTATTGATTCCTGCAGAAAACAAAGACGACGTAAAA TGC
TAAATGCATGCATAACAACTATTCAAAGTTCTGGTATGCGTCTTGACGTGCATTTAC CTG
AATCTTACTAATCTCCAT
SEQ 81 to 160 are putative protein SEQ related to SEQ 1-80
SEQ 81
MALRFSLIFLFSLFLTTSLLLSVNGNINGGEDDDILIRQVVGDDDDHLLNADHHFTIFKR RFGKTYA
SDEEHHYRFSVFKANLRRAMRHQKLDPSAVHGVTQFSDLTPAEFRRNFLGVNRRLRL PSDANK
APILPTEDLPSGFDWRDHGAVTSVKNQGSCGSCWSFSTTGALEGATYLSTGKLVSLS EQQLVD
CDHECDPEEKDSCDAGCNGGLMNSAFEYTLKAGGLMREEDYPYTGTDRGTCKFDNTK VAAKV
ANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCPYICSKKLDHGVLLVG YGTGFSPI
RMKEKPYWIIKNSWGEKWGENGYYKICRGRNVCGVDSMVSTVSAVSTSSH
SEQ 82
MGAKVFLVALFLSALLFPLASSSNDGLMRIGLKKMKFDQNNRLAARIESKEGDVLRASIR KYNFR
GKLGDSEDTDIVALKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFSV PCFFHSK
FKSSESSTYKKNGKSAAIQYGSGAISGFFSQDNVKVGDLVVTDQEFIEATREPSVTF LVAKFDGIL
GLGFQEISVGNAVPVWYNMVQQGLIKDPVFSFWLNRNTEEEQGGEIVFGGVDPNHYK GEITYV
PVTHKGYWQFDMGDVLIEGKATGYCESGCSAIADSGTSLLAGPTTIITMINQAIGAS GVASQQCK
SVVEQYGQTIMDLLLAEAHPKKICSQVGVCTFDGNRGVSMGIESVVDEKAGRSTGLQ DGMCSA
CEMAVIWMENQLRQNQTQDRILNYVNELCERLPSPLGESAVDCGKLSSMPTVSFTIG GKVFDLV
PKEYILKVGEGAKAQCISGFTGLDIPPPRGPLWILGDVFMGRYHTVFDYGKLRVGFA EAA
SEQ 83
MGSFLCFSVIVVLLVLQPCLAKKVYIVHMKNHQIPSSFATHHDWYNAQLQSLSSSSTSDE SSLLY
SYDTAYSGFAASLDPHEAELLRQSDDVVGVYEDTVYTLHTTRTPEFLGLNNELGLWA GHSPQEL
NNAAQDVVIGVLDTGVWPESKSYNDFGMPDVPSRWKGECESGSDFDPKVHCNKKLIG ARFFS
KGYQMSASGSFTNQPRQPESPRDQDGHGTHTSSTAAGAPVANASLLGYASGVARGMA PRAR
VATYKVCWPTGCFGSDILAGMERAILDGVDVLSLSLGGGSGPYYRDTIAIGAFSAME KGIVVSCS
AGNSGPAKGSLANTAPWIMTVGAGTIDRDFPAFATLGNGKKITGVSLYSGKGMGKKV VPLVYST
DSSASLCLPGSLDPKMVRGKIVLCDRGTNARVEKGLVVKEAGGVGMILANTAESGEE LVADSHL
LPAVAVGRKLGDFIRQYVKSEKNPAAVLSFGGTVVNVKPSPVVAAFSSRGPNTVTPQ ILKPDVIG
PGVNILAAWSEAIGPTGLEKDTRRTKFNIMSGTSMSCPHISGLAALLKAAHPEWSPS AIKSALMT
TAYVRDTTNSPLRDAEGGQLSTPWAHGSGHVDPHKALSPGLIYDITPEDYIKFLCSL DYELNHIQ
AIVKRPNVTCTKKFADPGQINYPSFSVLFGKSRVVRYTRAVINVGAAGSVYEVTVDA PPSVTVTV
KPSKLVFKRVGERLRYTVTFVSKKGVNMMRKSAFGSISWNNAQNQVRSPVSYSWSQL LD
SEQ 84
MGTKFILFILLFIFLFSSGFVACGGFYSFRNLNSSVSGIEFPNHPSFNAVSSSADSDCNY GVSQKS
KTHSIAQEVDGVDVKNGENEEVSIFGNQKKEAVKFQLRHRSAGKKIEAKDSVFESRA RDLSRIQT
LHTRIVEKKNQNYNSRLAKSNEKHVDKHKPVIAPAAVSLESYELSGKLMATLESGVS LGSGEYF
MDVFVGTPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPHYNPQDSTSFRNISCHDP RCKFVTS
PDPPQLCKSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTTSGSEFRKVENVMFGC GHWNR
GLFHGAAGLLGLGRGPLSFASQLQSLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELL KHPQLNF
TSLVGGKEVETFYYVQIKSVIVGGEVLNIPEETWNLSLEGLGGAIIDSGTTLSYFAD PAYEIIKEAF
VNKVKGYPIVQDFPILNPCYNVSGVKNLEFPSFGIVFGDGAVWNFPVENYFIKLEPE DIVCLAVLG
TPRSALSIIGNYQQQNFHILYDTKRSRLGYAPTRCADA SEQ 85
MALTLKSLATPLLFGALFILILQVVAEQPISEAKVESAILQESIIKEVNENAKAGWKAAF NPRFSNFT
VSQFKRLLGVKPAREGDLEGIPILTHPKLLELPKEFDARKAWPQCSTIGRILDQGHC GSCWAFGA
VESLSDRFCIHHNLNISLSVNDLLACCGFLCGSGCDGGYPITAWRYFIRRGVVTEEC DPYFDNE
GCSHPGCEPGYPTPKCQRKCVKEILLWGKSKHYGVNAYRIHHDPNSIMTEIYKNGPV EVSFTVY
EDFAHYKSGVYKHVTGQSMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGDDGYFKI RRGT
NECGIEHNVVAGLPSAKNLNVELDDVSNAFLDASM
SEQ 86
TLVLHTSFYLLLSVASPGDCLLLSIFPFSFSSPRYFPYKQNTVKIISSNFLFSPFFQMGS FLCFSVIV
LFLVFQPCFSKKVYIVHMKNHQIPSSFATHHDWYNAQLQSLSSSSTSDESSLLYSYD TAYSGFA
ASLDPHEAELLRQSDDVVGVYEDTVYTLHTTRTPEFLGLNNELGLWAGHSPQELNNA AQDVVIG
VLDTGVWPESKSFNDFGMPNVPSRWKGECESGPDFDPKVHCNKKLIGARFFSKGYQM SASGS
FTNQPRQPESPRDQDGHGTHTSSTAAGAPVANASLLGYASGVARGMAPRARVATYKV CWPTG
CFGSDILAGMERAILDGVDVLSLSLGGGSGPYYHDTIAIGAFSAMEKGIVVSCSAGN SGPAKASL
ANTAPWIMTVGAGTIDRDFPAFATLGNGKKITGVSLYSGKGMGKKVVPLVYSTDSSA SLCLPGS
LDPKIVRGKIVLCDRGTNARVEKGLVVKEAGGVGMILANTAESGEELVADSHLLPAV AVGRKLG
DFIRQYVKSEKNPAAVLSFGGTVVNVKPSPVVAAFSSRGPNTVTPQILKPDVIGPGV NILAAWSE
AIGPTGLEKDTRRTKFNIMSGTSMSCPHISGLAALLKAAHPEWSPSAIKSALMTTAY VHDTTNSP
LRDAEGGQLSTPFAHGSGHVDPHKALSPGLIYDITPEDYIKFLCSLDYELNHIQAIV KRPNVTCAK
KFADPGQINYPSFSVLFGKSRVVRYTRAVTNVAAAGSVYEVVVDAPPSVLVTVKPSK LVFKRVG
ERLRYTVTFVSNKGVNMMRKSAFGSISWNNAQNQVRSPVSYSWSQLLD
SEQ 87
MASSCLHAILLCFLLFITSTTAQNQTSFRPKGLILPITKDASTLQYLTQIHQRTPLVPVS LTLDLGGQ
FLWLDCDQGYVSSSYKPARCRSAQCSLAGAGSGCGQCFSPPKPGCNNNTCSLLPDNT ITRTAT
SGELASDTVQVQSSNGKNPGRNVTDKDFLFVCGATFLLEGLASGVKGMAGLGRTIIS LPSQFSA
EFSFPRKFAVCLSSSTNSKGVVLFGDGPYSFLPNREFSNNDFSYTPLFINPVSTASA FSSGEPSS
EYFIGVKSIKINQKVVPINTTLLSIDNQGVGGTKISTVNPYTILETSIYNAVTNFFV KELVNITRVASV
APFGACFDSRNIVSTRVGPAVPSIDLVLQNENVFWRIFGANSMVQVSENVLCLGFVD GGVNPRT
SIVIGGYTIENNLLQFDLAGSRLGFTSSILSRLTTCANFNFTSIT
SEQ 88
MNPEKFTHKTNEALAGAHELALSAGHAQFTPLHMAVALISDHNGIFRQAIVNAGGNEEVA NSVE
RVLNQAMKKLPSQTPAPDEIPPSTSLIKVLRRAQSSQKSCGDSHLAVDQLILGLLED SQIGDLLKE
AGVSASRVKSEVEKLRGKEGRKVESASGDTTFQALKTYGRDLVEQAGKLDPVIGRDE EIRRVVR
ILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLADVRLIALDMGALVAGA KYRGEFEE
RLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGAT TLEEYRKYV
EKDAAFERRFQQVYVAEPSVTDTISILRGLKERYEGHHGVKIQDRALVVAAQLSSRY ITGRHLPD
KAIDLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLVEVR KELDDLRD
KLQPLMMRYKKEKERIDELRRLKQKRDELIYALQEAERRYDLARAADLRYGAIQEVE TAIANLES
TSAESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKEKLIGLGDRLHQRVVGQDHA VRAVAEA
VLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMIRIDMSEYMEQ HSVARLI
GAPPGYVGHDEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQG RTVDFT
NTVIIMTSNLGAEYLLSGLMGKCTMETAREMVMQEVRKQFKPELLNRLDEIVVFDPL SHEQLRQ
VCRYQMKDVALRLAERGIALGVTEAALDVILSESYDPVYGARPIRRWLERKVVTELS KMLVKEEI
DENSTVYIDAGVGRKDLTYRVEKNGGLVNAATGQKSDILIQLPNGPRSDAVQAVKKM RIEEIEED
EMED
SEQ 89
MQSFKSASILRRLLQNSRLVSHSRSFCSVSTNALVDESQSTVLVEGKASSRTAILNRPHA LNALN
FSVVDRLLKLYKNWEDDPDIGFVVLKGSGKAFSAGGDIVTIYNLLKQDAGNLQDCKD FCWTINNL
VYVVGTLLKPHVALLNGITMGGGAGISIPGTFRVATEKTVFATPETLIGYHPDAGAS FYLSHLPGY
LGEYLALTGDKINGAEMISCGLATHYLHSAKLPLIEEQLGKLMTDDPSVIERSLENC GEIVHPDPT
SVLHRIETLNKCFSHDTVEEIIDALESEAAKKQDAWCVSTLRKLQETAPLSLKVSLR SIREGRHQT LDQCLIREYRMSVQAFSGQITNDFCEGVRARLVDRDFAPKWDPPSLDKVTDDMVDQYFSR LTA FEPELELPTQQREAFT
SEQ 90
MALTLKSLATPLLLGAFFILVLQVVAEKPISEAKVESAILKESIIKEVNENAKAGWKAAF NPQFSNF
TVSQFKRLLGVKPAREGDLEGIPLLTHPKLSELPKEFDARKAWPQCSTIGRILDQGH CGSCWAF
GAVESLSDRFCIHHNLNISLSVNDLLACCGFLCGSGCDGGYPISAWRYFIRRGVVTE ECDPYFD
NEGCSHPGCEPGYPTPKCQRKCVKENLLWGKSKHYGVNAYRIHRDPYSIMTEIYKNG PVEVSF
TVYEDFAHYKSGVYKHVTGQSMGGHAVKLIGWGTSEQGEDYWLIANSWNRGWGDDGY FKIRR
GTNECGIEHNVVAGLPSAKNLNVELDDVSDAFLDASM
SEQ 91
MGVLKKTLLLLFLCVFLGDISLCFSSKLYVVYMGSKDSDEHPDEILRQNHQMLTAIHKGS IEQAKT
SHVYSYRHGFKGFAAKLTEAQASEISKMPGVVSVFPNTKRSLHTTHSWDFMGLSDDE TMEIPG
FSTKNQINVIIGFIDTGIWPESPSFSDTNMPPVPAGWKGQCQSGEAFNASICNRKII GARYYMSG
YEAEEENGKTMFYKSARDSSGHGSHTASTAAGRYVANMNYKGLANGGARGGAPMARI AVYKT
CWSSGCYDVDLLAAFDDAIRDGVHVISLSLGPDAPQGDYFNDAISVGSYHAVSRGIL VVASVGN
EGSTGSATNLAPWMITVAASSTDRDFTSDILLGNGVRLKGESLSLSQMNTSTRIIPA SEAYAGYF
TPYQSSYCLDSSLNRTKAKGKVLVCLHAGSSSESKMEKSIIVKEAGGVGMILIDDAD KGVAIPFVI
PAATVGKKIGNKILAYINNTRLPMARILSARTVLGAQPAPRVAAFSSRGPNSVTPEI LKPDIAAPGL
NILAAWSPAASTKLNFNVLSGTSMACPHITGVVALLKAVHPSWSPSAIKSAIMTTAK LSDKHHKPII
VDPEGKRATPFDFGSGFVNPTNVLDPGLIYDAQPADYRAFLCSIGYDEKSLHLITRD NSTCDQTF
ASPNGLNYPSITIPNLRSTYSVTRTVTNVGKARSIYKAVVYAPTGVNVTVVPRRLAF TRYYQKMN
FTVNFKVAAPTQGYVFGSLTWRNKRTSVTSPLVVRVAHSNMGMMV
SEQ 92
MGAKAFLVAMFLSALLFPFASSSNDGLMRIGLKKMKFDQNNRLAARIESKEGDVLRGSIR KYNF
RGKLGDFEDTDIVALKNYMDAQYFGEIGVGTPPQKFTVIFDTGSSNLWVPSSKCYFS VPCFFHS
KYKSSESSTYKKNGKSAAIQYGSGAISGFFSQDNVKVGDLVVTDQEFIEATREPSVT FLVAKFDG
ILGLGFQEISVGNAVPVWYNMVKQGLIKDPVFSFWLNRNTEEEQGGEIVFGGVDPNH YKGEITY
VPVTQKGYWQFDMGDVLIDGKATGYCESGCSAIADSGTSLLAGPTAIITMINQAIGA SGVASQQC
KSVVEQYGQTIMDLLLAEAHPKKICSQVGVCTFDGNRGVSMGIDSVVDEKAGRSTGL QDGMCS
ACEMAVIWMANQLRQNQTQDRILNYVNELCERLPSPLGESAVDCGKLSSMPKVSFTI GGKVFDL
SPNEYILKVGEGAKAQCISGFTGLDIPPPRGPLWILGDIFMGRYHTVFDYGKLRVGF AEAA
SEQ 93
MTFFRSFLFFLLTLFVISSALDMSIISYDEQHGQMGTTHHRTDDEVRELYESWLVKHGKN YNAIG
EKERRFEIFNDNLRFIDEHNAENRSYKLGLNRFSDLTNEEYRAMFVGGRLDRKTRLM KSPKSNR
YAFQAGEKLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTGELISLS EQELVD
CDRSYNQGCNGGLMDYAFDFIKNNGGIDTEDDYPYHAQDGTCDPYRKNARVVSIEGY EDVPEN
DEKSLMKAVANQPVSVAIEGGGRAFQHYSSGVFTGYCGTQLDHGVVVVGYGTENGED YWIVR
NSWGANWGESGYIKLQRNFANSTTGKCGIAMQASYPLKSGANPPNPGPSPPTPVTPS TVCDEY
YSCPQGTTCCCIYQYGEYCFGWGCCPYESATCCDDNYSCCPHDYPVCDVDAGTCLMS KDNPL
KVKALKRGPARVNWSGMKSNRKVSYV
SEQ 94
MANSYTSFNFFLAPIIFLAILGLQLQSSDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGS VEYYSA
WTQRDRFIKGRRLADTTNPTPLSFSGGNETFRLSSLGFLHYANVTVGTPGLSFLVAL DTGSDLF
WLPCDCSNCVRALETRSGRRINLNIYSPNTSSTGQIVPCNGTLCGQRRRCLSSQNAC AYGVAYL
SNNTSSSGVLVEDILHLETDNAQQKSVEAPIALGCGIRQTGAFLSGAAPNGLFGLGL ESISVPSM
LASKGLAANSFSMCFGPDGIGRIVFGDKGSPDQGETPLNLDQLHPTYNISLTGITVG NKITDVDFT
AIFDSGTSFTYLNDPAYKVITENFDSQAKQLRIQPDGEIPFEYCYGLSANQTTFEVP DLNLTMKG
GNQFFLFDPIIMLSLQDGSRAFCLAVVKSGDVNIIGQNFMTGYRVVFDREKMVLGWK PSDCYDS
RESNDKSTTLPVNKRNSTEAPSPSSVVPEATKGNGSGNEPATSFPSVPSSRPAINHA PAHFNSY
ICQLMMALFSLFSYYLIIVSS
SEQ 95 MVTKFSIFILVVLLRLFSFGSVASREIHNSGLNLNSSASGIEFPQHPSFNSVTASGNSDC SYGTSK
KSTTTHVITQEENRSDEKEDEDLMVSKNQPREAVKFHLRHRSAGQNIEAKDSIFEST TRDLGRIQ
TLHTRIVEKKNQNSISRQTKNSEKPTQSSSFEFSGKLMATLESGVSHGSGEYFMDVF VGTPPKH
FSLILDTGSDLNWIQSVPCYDCFEQNGPHYDPKDSISFKNISCHDPRCHLVSSPDPP QPCKSEN
QTCPYYYWYGDSSNTTGDFALETFTVNLTTPSGDSEIKKVENVMFGCGHWNRGLFHG AAGLLG
LGRGPLSFSSQLQSLYGHSFSYCLVNRNSNSSVSSKLIFGEDKELLKHANLNFTSLV GGKENHL
ETFYYVQIKSVIAGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIKQA FVNKVKHYPV
LEDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWNFPVENYFIKLEPEDIVCLAMLG TPHSAMSIIG
NYQQQNFHI LYDTKRSRLG FAPTRCADA
SEQ 96
MPSSFSLLFLTLLLASISLSFSSTLNSNDDDFFLSSTPKFPLTMAEKLIRQLNLFPKHDI NKAAATG
DSAAVTEQRLFEKKLNLSYVGNSGSTVQDLGHHAGYYRLPHTKDARMFYFFFESRSR KNDPVVI
WLTGGPGCSSELAVFYENGPFKIADNMSLVWNDFGWDKVSNLIYVDQPTGTGFSYSS NDDDIR
HDERGVSNDLYDFLQAFFKAHPQYAKNDFYITGESYAGHYIPAFASRVHQGNKNKEG IYVNLKG
FAIGNGLTDPEIQYKAYTDYALDMKLIKKSDYNAIEKSYPKCQLAIKLCGKDGGTAC MAAYLVCTS
IFNKIMDIAGDKNYYDVRKRCEGDLCYDFSKMETFLNDQQVKKALGVGDIEFVSCSS EVYQAMQ
LDWMRNLELGIPSLLEDGIKLLVYAGEYDLICNWLGNSRWVHAMKWTGQKAFGKATQ VSFAVD
GVEKGVQKNYGPLTFLKVHDAGHMVPMDQPKAAMEMLQRWMQDKLSKEGHLAPM
SEQ 97
MTLTLKSLAAPLFLGAFCILILQVVAEKPISEAKVESAILQESIIKEVNENAKAGWKAAF NPRFSNFT VSQFKRLLGVKPAREGDLEGIPILTHPKLLELPKEFDARKAWPQCSTIGRILDQGHCGSC WAFGA VESLSDRFCIHHNLNISLSVNDLLACCGFLRGSGCDGGYPISAWRYFIRRGVVTEECDPY FDNE GFHTRVVNQDIPPQSVV
SEQ 98
MFRLVMVTKFSIFILVVLLRLFSFGFVASREIHNFGINLNFSASGIEFPQHPSFNSVTAS GNSDCSY
GTSKKSTTTHVITQEENNSDEKEDEDLMVSENQPREAVKFHLRHRSAGQNIEAKDSI FESTTRDL
GRIQTLHTRIVEKKNQNFISRQTKNSEKTTQSSSFEFSGKLMATLESGVSHGSGEYF MDVFVGT
PPKHFSLILDTGSDLNWIQSVPCYDCFEQNGPHYDPKDSISFKNISCDDPRCHLVSS PDPPQPC
KSENQTCPYYYWYGDSSNTTGDFALETFTVNLTTPNGDSEIKKVENVMFGCGHWNRG LFHGA
AGLLGLGRGPLSFSSQLQSLYGHSFSYCLVNRNSNSSVSSKLIFGEDKELLKHLNLN FTSLVGGK
ENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEI IKQAFVNKVK
RYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPEDIVCL AILGTPHSAM
SIIGNYQQQNFHILYDTKRSRLGFAPRRCADA
SEQ 99
MSGFRLPLLFHLLLPLTLFLQYVQSLPQNSSTVEFLPGFDGPLPFYLETGYIGVGKSEEV QLFYYF
VKSESNPKKDPLLLWLTGGPGCSSFTGVAYEVGPLAFGQKAYNGSLPILVSTPYSWT KFASILFL
EQPVNTGFSYATTSAASKCTDLQACDQVYEFLLKWFNNHPEFISNPFYVSGDSYSGI TVPVIVQL
ISDGIEAGKKPLINLKGYSLGNPLTFPEESNYQIPFCHGMGLISNELYESLKETCKG DCRNIDPTN
KLCLENFKMFKKLVSSINDQQILEPFCGTDSESPNPRQLSGERRSLEEDFIFLKHDD FICRESRVA
TRKLSNHWANDPSVQEALHVRKGTIRRAWARCRQSIMGTTYRVTFMNSIPYHVNLSS KGYRSLI
YSGDHDMVVPFQSTQAWIKYLNYSIIDDWRPWTIDGQVAGYTRSFSNHMTYATVKGG GHTAPE
YKREESFHMFKRWIAQQPL
SEQ 100
METNGLIKEILPRDAVNNMTRLILSNALYFKGEWNEKFDVSETKDHDFHLLNGGSIQAPF MTSKK KQYIAAFDCFKILRLPYKQGTDTRRFCMYFILPDAHDGLPALLEKISLEPGFLNNHVPYG KVRARK FLIPKFKITFGFEASNILKGLGLTLPFCGGSLTEMVDSPMPQNLSVSQVFHKSFIEVNEE GTEAAA VTATVIMTMSLIIEKEMDFVADHPFLFLIRDESTGAVLFIGSVMNPLAG
SEQ 101
MNESYGNSRASSSSTTSSLNSSSHGTEDDHTIARILAEEEENALKYGGNKLGRRLSHLDS IPHTP RVIGEIPDPNDATLDHGRLSSRLATYGLAEMQIEGDGNCQFRALSDQLYHNPEYHKHVRK EVVK QLKRFRKLYEGYVPMRYKSYLRKMKRLGEWGDHVTLQAAADRFGVKICLVTSFRDNGYID ILPK DIQPSRELWLSFWSEVHYNSLYEIGEVPARVRRKKHWLFF
SEQ 102
MSWLCPSLVLVLLIFQGPICTCSSISDLFESWCQQNGKTYSSEQERVYRLEVFEENYAYI IEHNS
KGNSTYTLNLNAFSDLTHHEFKNSFLGLSSSANDFIRLKTGSSSAGVFNDVGVVDIP SSLDWREK
GAVTKVKNQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDKSYNDGCGGG LMDYAFE
FVKKNGGIDTEEDYPFNEREGTCNKNKLQRRVVTIDGYTDVPQYDEDKLLKAVANQP VSVGICG
SERAFQSYSKGIFTGPCSTVLDHAVLIVGYGSENGVDYWIIKNSWGTSWGINGYMHM QRNSGN
QEGICGINKLASYPTKSSPNPPSPPSPGPSKCSMFTSCGQGETCCCGWRLLGVCVSW KCCGL
DSAVCCKDGRHCCPHDYPICDTSRNLCLKRMSNATIVQQPQKEAFSGKFGGLIYPF
SEQ 103
MCEPESEAARGVLSFLDVDQLFSSNYYGDGRKHDVEICHEQYARENHYHTSYCNVDNDEA IAH
VLQEDLSELSIAEDAESSHADEQYLQASTGVQHWHTPPREYYAGHDTSLEADDVGPS SSCSSP
GDRSYDGEEYTYTLEIQDEFELDGEVGKRINQLSAVPHVPRINGDIPSVDEATSDHQ RLLNRLQL
FDLVEHKVQGDGNCQFRALSDQFYRTPEHHKFVRQQVVSQFQHHPEMYEGYVPMEYG EYLTR
MSKSGEWGDHVTLQAAADSYGVKILVITSFKDTCYIEILPKNQKSNRVIYLSFWAEV HYNSIYPQ
GDFLPFDFKKKKKKWSFWNKH
SEQ 104
MPSLLQIFLPLFPFFFLVSFSVSHGPFLPKAIILPVNKDLSTFQYVTQVYMGAHLVPTNL VVDLGG
SFLWTNCGLTSVSSSQKLVPCNSLKCSMAKPNGCTNKICGVQSENPFTKVAATGELA EDMFAV
EFIDELKTGSIASIHEFLFSCASTTLLQGLARGAKGMLGLGNSRIALPSQLSDTFGF QRKFALCLS
SSNGAIISGESPYLSLLGHDVSRSMLYTPLISSKDGVSEEYYINVKSIKINGKKLSL NTSLFAMDEG
VGGTKISTIPPFTTMKSSIYKSFIEAYEKFAISMELNKVEAIAPFELCFSTKGIDVT KVGPNVPTTDL
VLQSEMVKWRIYGRNSMVKVSDEVMCLGFLNGGVNQKASIVIGGYQLEDNLLEFNLG TSMLGF
TSSLSMAETSCSDFMFHSVSKDSAFDS
SEQ 105
MGAKEVLILVLVCMFIVFPSCHGDDECLNPFLVDQNCYVKDYITKLANATETVKWMMKIR RQIHE
NPELAYEEFKTSGLIREELDRMGVKYRWPVAKTGVVATIGSGKPPFVALRADMDALP IQELAKW
EHKSKVDGKMHACAHDAHTAMLLGAAKILQQLRHNLQGTVVLIFQPAEERGHGAKDM IEEGVLE
NVEAIFGMHLVHKYESGVVASRPGEFLAGCGSFKATIRGKGGHAAVPHDSVDPILAA STSVISLQ
SIVSRETDPLESQVVSVAMIEGGHAFNIIPELATISGTYRAFSKKSFYGLRKRIEEV IRAQAAVHRC
TVEIDFDGRENPTLPPTINDERIYEHARKVSKMIVGEESFKIAPSFMGSEDFAVFLE KVPGSFFLL
GTKNEKIGAIYPPHNPHFIIDEDVLPIGAAIHATFAYSYLLNSTNKFTSHSS
SEQ 106
MKLNPYSWTKVASIIFLDLPVGTGFSYARTPTALQSSDLQASDQAYEFLYKWFLDHPEFL KNPLY
VGGDSYSGMVVPIITQIIATKNEMGIKPFVDLQGYLLGNPSTFKGEKNYEIPFAYGM GLISDELYE
SLTRNCKGEYQNTDPSNTQCLQDVHTFQELLKRINNPHILEPKCQFASPKPHLLFGQ RRSLNVK
FHQLNNPQQLPALKCRNDWYKLSSHWADDGQVREALHIRKGTIGKWVRCASLQYQKT IMSSIP
YHANLSAKGYRSLIYSGDHDKVVTFLSTQAWIKSLNYSIVDDWRPWIVDNQVAGYTR SYSNRMT
FATVKGAGHTAPEYKPRECLAMLKRLMSYKPL
SEQ 107
MCEPESEATRGVLSFLDVDQLFSSNYYGDGRKHDVEICHEQYARENQYHTSYCNVDSDEA IAH
LLQEELSELSIAEDAESSHADEQYFQASTGVQHWHTPPREYYAGHDTGLEADDVGPS SSCSSP
GDRSYDGEEYTYTLEIQDEFELDGEVGKRINQLSAVPHVPRINGDIPSVDEATSDHQ RLLDRLQL
FDLVEHKVQGDGNCQFRALSDQFYRTPEHHKFVRQQVVSQLKHHPEMYEGYVPMEYG EYLKR
MSKSGEWGDHVTLQAAADSYGVKILVITSFKDTCYIEILPKNQKSNRVIYLSFWAEV HYNSIYPQ
GDFLPFDLKKKKKKWSFWNKH
SEQ 108 MPSLLQIFLPLFPFFFFVSFSVSHGPFLPKAIILPVNKDLSTFQYVTQVYMGAHLVPTNL VVDLGG
SFLWTNCGLTSVSSSQKLVPCNSLKCSMAKPNGCTNKICGVQSENPFTKVAATGELA EDMFAV
EFIDELKTGSIASIHEFLFSCASTTLLQGLARGAKGMLGLGNSRIALPSQLSDTFGF QRKFALCLS
SSNGAIISGESPYLSLLGHDVSRSMLYTPLISSKNGVSEEYYINVKSIKINGNKLSL NISLFTMDEE
GVGGTKISTISPFTSMKSSIYRTFMEAYEKIAISVNLTKVESIAPFELCFSTEGIDV TKVGPNVPTM
DLVLQSEMVKWRIYGRNSMVKVSDEVMCWGFLDGGVNQKASIVIGGYQLENNLLEFN LGTSML
GFTSSLSTAETSCSDFMIHSVSKDSAFDS
SEQ 109
MKMSPALSLSVIQFPLCKSQDLSKDTNNPKIFSKETPCQKSYSDTRINRRKLLSGSGLSL VAGTL
AKPARAETEAPIEATSSRMSYSRFLEYLNEGAVKKVDFFESSAVAEIFNPALNKVQR VKVQLPGL
PPELVRKLREKDVDFAAHLPEMNVIGPLLDLLGNLAFPLILLGSLLLRTSSSNTPGG PNLPFGLGR
SKAKFQMEPNTGVTFDDVAGVDDAKQDFQEIVEFLKTPEKFAAVGAKIPKGVLLVGP PGTGKTL
LAKAIAGEAEVPFLSLSGSEFVEMFVGVGASRVRDLFNKAKENSPCLVFIDEIDAVG RQRGTGIG
GGNDEREQTLNQLLTEMDGFTGNTGVIVIAATNRPEILDQALLRPGRFDRQVSVGLP DIRGREEI
LKVHSNNKKLDKDVSLSVIAMRTPGFSGADLANLMNEAAILAGRRGKDKITSKEIDD SIDRIVAGM
EGTKMTDGKNKILVAYHEVGHGVCATLTPGHDAVQKVTLIPRGQARGLTWFIPGEDP TLISKQQ
LFARIVGSLGGRAAEEIIFGEAEITTGAAGDLQQITQIARQMVTMFGMSEIGPWALT DPATQSGD
VVLRMLARNQMSEKLAEDIDASVRHIIERAYEIAKNHIRNNREAIDKLVDVLLEKET LTGDEFRAIL
SEFTNIPSANINSKPIRELIEA
SEQ 1 10
MEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKG EDGV CNKKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCS TWLN HAVTAVGYGATTDGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA
SEQ 1 1 1
MGCRMKFLNVVLVVAAVMAAAAAVAFGAEKLPAGVLSLERIFPLNGKMELEEVRARDRAR HAR
MLQSFAGGIVNFPVVGSSDPYLVGLYFTKVRLGTPPREYNVQIDTGSDILWVTCSSC DDCPRTS
GLGVELNFYDATISSTASPISCADQVCASIVQTASAECSTETNQCGYSFQYGDGSGT TGHYVAD
LLYFDTVLGTSLIANSSAPIIFGCSTSQSGDLTKTDRAIDGIFGFGQQGLSVISQLS SHRITPKVFS
HCLKGEGNGGGILVLGEILDPRIVYSPLVPSQAHYNVYLQSIAVNGQLVPVDPSVFA TSGNRGTI
VDSGTTLAYIATEAYDPFVNAITAAVSPSVRPIISRGKPCFLVSSSIAEIFPPVSLN FDGGASMALR
PSDYLVHMGFVEGAAMWCIGFEKQDQGVTILGDLVLKDKIFVYDLARQRIGWADYDC SSSVNVS
ITSGKDEFINAGQLSVNRASGSLLFNPRHTRTIFHLLSLVLMIGSPFLT
SEQ 1 12
MTRASIILLLLLIATSIAAAQGGALTFDDDNPIRQVVVSDGLQELENGILQLIGQTRRAL SFVRFVR
RYGKRYDSVEEIKQRFEIYLDNLKMIRSHNKQRLSYKLGVNEFTDLTWDEFRRERLG APQNCSA
TTKSDLQLTNVNLPETKDWREAGIVSPVKKQGKCGSCWTFSTTGALEAAYAQAFGKN ISLSEQQ
LLDCAGAFNNFGCHGGLPSQAFEYIKYSGGLDTEEEYPYAGKAGVCKFSSENVAVKV VDSVNIT
KGAEDELKYAIAFIRPVSVAYQVVKGFKQYKGGIYSSTVCGNTPQDVNHAVLAVGYG VDNGTPY
WLIKNSWGAEWGDNGYFKMEMGKNMCGIATCASYPIVA
SEQ 1 13
MNPEKFTHKTNEALAEAHELAISAGHAQFTPLHMALALISDHNGIFRQAIVNAAGSEETA NSVER
VFKQAMKKIPSQTPAPDQIPPSTSLIKVLRRAQSLQKSRRDTHLAVDQLILGLLEDS QIGDLLKEA
GIGAARVKSEVEKLRGKDGKKVESASGDTNFQALKTYGRDLVEQAGKLDPVIGRDEE IRRVIRIL
SRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLSDVRLIALDMGALIAGAKY RGEFEERL
KAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTL EEYRKYVEK
DAAFERRFQQVYVAEPSVPDTISILRGLKEKYEGHHGVKIQDRALVVAAQLSARYIT GRHLPDKAI
DLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLVEVRKEL DDLRDKLQ
PLTMRYKKEKERIDELRRLKQKRDELTYALQEAERRYDLARAADLRYGAIQEVEAAI ANLESSTD
ESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKDKLIGLANRLHQRVVGQDDAVRA VAEAVLR
SRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMVRIDMSEYMEQHSV ARLIGAP
PGYVGHEEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQGRTV DFTNTVI
IMTSNLGAEYLLSGLMGKCTMEKARDMVMQEVRKQFKPELLNRLDEIVVFDPLSHEQ LRQVCR HQLKDVASRLAERGIALGVTEAALDVILAQSYDPVYGARPIRRWLEKKVVTELSKMLVKE EIDEN STVYVDAASSGKDLSYRVEKNGGLVNAATGKKSDILIQLPNGVRSDAAQAVKKMKIEEIV DE
SEQ 1 14
MPEAPKKSFFTLSLVPFLPVYTLIRFNPPIESEPLISSSSDECQHDQKQQSDSRNYIVRF YHYKEP
EDHWNYLQNNLKFKGWQWIERKNPAARFPTDFGLVEIDESMKELLLEKFRKMNLVKD VSLDLS
YQRIVLEEKSEKNGAFANGKKRPGKIFTAMSFSEGQNYAVANTSIMRISWSRHLLMQ KSRVTSL
FGAHELWSKGHTGAKVKMAIFDTGIRADHPHFRNIKERTNWTNEDTLNDNVGHGTFV AGVIAG
QDEECLGFAPDAEIYAFHVFTDAQVSYTSWFLDAFNYAIATNMDVLNLSIGGPDYLD LPFVEKVW
ELTANNIIMVSAIGNDGPLYGTLNNPADQSDVIGVGAIDQSNHLASFSSRGMSTWEI PHGYGRVK
PDIVAYGREIMGSKISTRCKRLSGTSVASPVVTGIVCLLVSIIPESK
SEQ 1 15
MAQMKLSLSLFLSLVLLLAFSPSSFAKVSISSKLASKQAEKLIHELNLFPKESDNIVDRD PFPTAAS
RIVEKRFNFANLTNSSVISFEDLGHHAGYYKIKHSHAARLFYFFFESRGSKDDPVVI WLSGGPGC
SSELALFYENGPFSISNNLSLVRNEYGWDKVSNLIYVDQPTGTGFSYSSDRHDIRHS EAGVSDD
LYDFLQAFFEEHPELVKNDFYITGESYAGHYIPAFAARVHKGNKAKEGIHINLKGFA IGNGLTDPKI
QYAAYTDYALDMGLISKSDHDRINKILPVCEVAINLCGTDGKISCLAAYFVCNSIFS AVRARAGADI
NHYDIRKKCVGALCYDFSNMEKLLNMHSVKQALGVEDIEFVSCSTTVYQAMLVDWMR NLEAGIP
TLLEDGIKLLVYAGEYDLICNWLGNSRWVQAMEWSGQKEFVASPDVPFEVDSSEAGL LKSHGP
LSFLKVHDAGHMVPMDQPKVALEMLKRWIGGTLSQQTTETEDLVASI
SEQ 1 16
MAIHTSTLSISILVMLMFSAVTSSAEDMSIISYNEKHHTNGESTVWRTDDEIVSLYESWL VEHKKV
YNALGEKDKRFQIFKDNLRYIDEQNSAPEKSYKLGLTQFADLTNEEYKSIYLGTKPD GRSRLSYT
QSDRYAPKVGDSLPDSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIMTGNL ISLSEQ
ELVDCDTADNQGCQGGLMDDAFKFVIQNGGIDTEEDYPYKAKDGKCDQARKNAKVVT IDGYED
VPANDEKALKKAVAGQPVSVAIEAGGKDFQHYKSGIFTGKCGAAVDHGVVAVGYGSE NGMDY
WIVRNSWGASWGENGYLRMQRNIGNPKGLCGIATIASYPVKTGQNPPKPAPSPPSPV KPPTQC
DDYNECPAGTTCCCVYKYYNYCFAWGCCPMEGATCCKDHNSCCPHDYPVCNVKAGTC SISKN
NPLGVKAMQHILAKPIGTFGNEGKKTPSS
SEQ 1 17
MACNRLHTELGNWQVNPPSGFNLEPSDYLQRWLIEVNGAPGTLYANETYQLQAEFPEHYP IKA
PQVIFLPPAPLHPDIYRDGHICLDILYDSWSPTMTVSSICISILSMLSSSTVKFPSS EMMDVPLILSK
HVFFSKFKADEDESNNANMVFSPVSIQIIFALIAAGSSGSTLDQLLAFLKFNSVEEL NSVYSRVITD
VLADGSPMGGPRLSVTNWAWVDQSLSFKHSFKQVMDNVYKAASASVDFRNKGDEVTG EVNK
WAEEKTNGLIKQILPPVAVNSGTSLILANALYFKGAWTEKLNASDTKDHEFHLLNGG SVQAPLMT
SKKRQYVKAFDGFQVLRLRYKQGEDKRFLNMYVYLPNARDGLPTLLEKISSEPGFLD RHVPYEK
VKVHEFLIPKFKISLGIEALEVLKGLELTLPFKGGLTEMVGENYPLAVANVFHKAFI EVNEEGAEAP
AAKAFH KAF I E VN E EAP VAPAVTVATM M FGCSMMKVEEEID F VAD H P FM FLVKD ETAG VVLF VG
TLLNPLAVSPS
SEQ 1 18
LKVGSFFSSLIYSCNKASPNFYSYSFSLLSCFIELVNMGAKAFLVTILLSSLLFPLALST SNDGLVRI
GLKKIKFDQNNRLAARVESKEGEAVRASIRKYNNFHGNLGASEDTDIVALKNYMDAQ YFGEIGIG
SPPQKFTVIFDTGSSNLWVPSSKCYFSVPCFFHSKYKSSQSSTYKKNGKSAAIRYGT GAISGFFS
QDSVKVGDLIVQNQEFIEATREPSVTFLVAKFDGILGLGFQEISVGNAVPVWYNMVK QGLVKEPV
FSFWLNRNTKEDEGGEIVFGGVDPNHYKGKHTYVPVTRKGYWQFDMGDVLIDGQATG YCDNG
CSAIADSGTSLLAGPTTVITMINHAIGASGVVSQQCKAVVEQYGQTIMDMLLAEAHP KKICSQVG
LCTFDGTRGISMGIESVVDENAGKSSGLHDAMCSACEMAVVWMQNQLRQNQTQERIL NYVNEL
CERLPSPMGQSAVDCGKLSGMPSVSFTIGGRTFDLSPEEYILKVGEGPAAQCISGFI ALDVPPPR
GPLWILGDVFMGRYHTVFDFGKLRVGFAEAA
SEQ 1 19 MSKQNLEAPLLDPSPATFNRRKKWSFALCFLFALTAISFIGLRHHGHVGIWLIGDVERYN GKLQQ
NADVVESEQAVVAADDGRCSEIGISMLKIGGHAVDAAVATALCLGVVNPMASGLGGG GFMVVR
SSSTSEVQAIDMRETAPLAASQNMYDNNGKSKLEGALSMGVPGELAGLHAAWSKHGR LPWKT
LFQPAIKLARDGFVVAPYLAHHIASKAKLILKDPGLRQVIAPEGKLLRAGDICHNVK LSHSLELIAE
QGPEAFYNGEVGEKLVEDVKKAGGILTMDDLRNYKVETPEAVTVNAMGYTIVGMPPP SSGTLGI
SLILKILESYNAAEGSLGLHRLIEAMKHMFAFRMDLGDPDFVNISKTVSDMLSPSFA KAIRQKIFDN
TTFPPEYYMPRWSQLRDHGTSHFCIVDSDRNAVSVTTTVNYPFGAGVLSPSTGIVLN DEMGDF
STPSEISPDELPPAPANFIQPKKRPLSSMAPIIVLKDNQLAGVIGGSGGMKIIPAVV QVFINHFILGM
DPLAAVQSPRVYHELIPNVVLYENWTCIDGDHIELSDEKKHFLEERGHQLEAHNGGA ICQLIVQN
LPNSHLKLGRRSGKEYKNGVFHGMLVAVSDPRKDGRPAAI
SEQ 120
MLKKISSFNILLNMASHITLCIWLLFFFISIISLAKPETYIIHMDLSAMPKAFASHHNWY LTTLASLSD
SSTNHKEFLSSKLVYAYTNAINGFSASLSPSEFEAIKNSPGYVSSIKDMSVKIDTTH TSQFLGLNS
ESGVWPTSDYGKDIIIGLVDTGIWPESKSYSDYGISEVPSRWKGECESGIEFNSSLC NKKIIGARY
FNKGLLANNPNLNISMNSARDTDGHGTHTSSTAAGSYVEGASYFGYATGTAIGIAPK AHVAMYK
ALWEEGVYLSDVLAAIDQAITDGVDVLSLSLGIDAIPLHEDPVAIAAFAALEKGIFV STSAGNEGPY
YETLHNGTPWVLTVAAGTVDREFIGALTLGNGVSVTGLSLYPGNSSSSESSIVYVEC QDDKELQ
KSAHNIVVCLDKNDSVSEHVYNVRNSKVAGAVFITNITDLEFYLQSEFPAVFLNLQE GDKVLEYIK
SNSAPKGKLEFRVTHIGAKPAPKVATYSSRGPSPSCPSILKPDLMAPGALILASWPQ QSPVTDVT
SGKLFSNFNIISGTSMSCPHASGVAALLKAAHPEWSPAAIRSAMMTTSNAMDNTQSP IRDIGSKN
AAATPLAMGAGHIDPNKALDPGLIYDATPQDYVNLLCALNFTSKQIKTITRSSSYTC SNPSLDLNY
PSFIGFFNGNSSESDPRRIQEFQRTVTNIGDGMSVYTAKLTTMGKFKVNLVPEKLVF KEKYEKLS
YKLRIEGPLVMDDIVVYGSLSWVETEGKYVVRSPIVATSIKVDPLTGHN
SEQ 121
MEFYQKLATCSHLSLLCFILLHSIQVQGSYFDQEYGKQVLSSAIQDKDWLVSIRRIIHEY PELRFQ
EYNTSALIRTELDKLGIYYEYPFAKTGLVALIGSSSPPVVALRADMDALPLQELVEW EHKSKVTGK
MHGCGHDAHTAMLLGAAKLLNERKDKLNGTVRLVFQPAEEGGAGAYHMINEGALGDA EAIFGM
HVDFKRPTGSIGTSPGPILAAVSFFEAKIEGKGGHAAEPHATVDPILAASFAVVALQ QLISREVDP
LHSQVLSVTYVRGGSASNVIPPYVEFGGTLRSLTTEGLLQLQKRVKEVIEGQAAVHR CKAYIDMK
EEDFPAYPACINDERLHQHVGRVGKLLLGSENIKETEKVMAGEDFAFYQELIPGVMF QIGIRNEK
LGSTHAPHSPHFFLDEDVLPIGAALHTAIAEMYLNDYQHPIAV
SEQ 122
RHYIYGKLTSNMKTFGIPLAAHSRVLTGSYIRSLYLQILTPFLVHTTAQADNLNCDRSAT LNCDRS
ATEVCTDSEVSTDMEPGNSIVNGVPESIAEEDTAEPLDMDFEFYLSDDKATFKGSEI VMNEPLQS
TDISGRLNVLVSWSPKMLEQYNTGLFSSLPEVFKSGFFAKRPQESVSLYKCLEAFLK EEPLGPE
DMWYCPACKQHRQATKKLDLWRLPEILVIHLKRFSYNRFLKNKLETYVDFPTHDLDL SSYLAYK
DGKSSYRYMLYAISNHYGSMGGGHYTAFVHQGADRWYDFDDSHVYPISQDKLKTSAA YVLFYR
RVEEI
SEQ 123
MSRNSLKIHLSIGKIQPGSENKNGSPVYTDSGTCEHLSELRSRVGSNPFFNFRGCVKVRP LGRA
SIRREPPNELVRCGACGQAPPRLYACVTCAAVFCRVHAPSHPVGNASDPSLHSIAVD IDRAELF
CCGCRDQVYDRDFDAAVVLAQTEATVIGSIQDPPPQPENTRKRRRVEYKPWTPDVKE QVLIVG
NSSPLPSQLGNDSTTPEVQWGLRGLNNLGNTCFMNSVLQALLHTPPLRNYFLSDKHN RYFCQR
KNNSVITRSSSDNGNKNSTMLCLACDLDAMFSAVFSGDWTPISPAKFLYSWWKHASN LASYEQ
QDAHEFFISVLDGIHERMQNDKGKALSPGSGDCCIAHRVFSGILRSDVMCTACGFTS TTYDPCID
ISLDLELSQGSSAKMTSKKSHNTHKKEAESGKFSQNGRISTLMGCLDHFTRPEKLGS DQKFFCQ
HCQVRQESLKQMSIRKLPLVSCFHIKRFEHSVIKKMSRKVDHYLQFPFSLDMSPYLS SSILRSRF
GNRIFSFDGDEQDASCESSSEFELFAVITHTGKLDAGHYVTYLRLSNQWYKCDDAWI TQVSESI
VRAAQGYM M FYVQKM L YYKAS EN Q VS
SEQ 124
MATHSSTLTISISLLLLLFFFFFSTLSSASDMSILTYDENQHFRTDDEVMSLYESWLLEH GKSYNA LDEKDKRFQIFKDNLRYIDEQNSVPNKSYKLGLTKFADLTNEEYRSMYLGTKTSDRRRLL KNKSD RYLPKVGDSLPDSVDWREKGVLVGVKDQGSCGSCWAFSAIASVEAVNSIVTGDVISLSEQ ELVD
CDTSYNDGCNGGLMDYAFDFIIKNGGIDTEEDYPYTGRDGRCDQSRKNAKVVTIDGY EDVPAN
NEKALQKAVANQPVSIAIEAGGHDFQHYVSGIFTGKCGTAVDHGVVAVGYGSENGMD YWIIRNS
WGASWGEKGYLRVQRNVASSKGLCGLAIEPSYPVKTGVNPPKPGPSPPSPIKPPTQC DDYAQC
PEGTTCCCVFEYYNSCFSWGCCPLEGATCCEDHYSCCPHDYPVCNIRAGTCSISKDN PLGVKA
M KH I H AEP I EAF I N GG RKSSS
SEQ 125
MKKLFLVLFSLALVLRLGESFDFHEKELETEEKLWELYERWRSHHTVSRSLDEKDKRFNV FKAN
VHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSKIKHHRSFLGASRANGTFMYA NVEDVP
PSVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTS QNQGC
NGGLMDMAFEFIKKKGGINTEENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDED SLLKAVA
NQPVSVAIQASGSDFQFYSEGVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWG PEWGEK
GYIRMQREIDAEEGLCGIAMQPSYPIKTSSSNPTGSPATAPKDEL
SEQ 126
MARPQFTVILAIISLLIHYGVVSGFRLSDVTNGSSVFLPSPADGSRHTTMLLPLFPPKDT SRRAEIS
RRHLQKSPASARMSLHDDLLLNGYYTTHIWIGTPPQKFALIVDTGSTVTYVPCSECK KCGNHQD
PKFQPEMSSTYQSVKCNKACPCDHKRQQCIYERRYAEMSASYGLLGEDIISFGNLSE LAPQRAV
FGCEIAETGDLYSQRADGIMGLGRGDLSIVDQLVEKHVISDSFSLCYGGMDFGGGAM VLGGVK
PPADMAFTKSDFGHSPYYNIDLKEIHVAGKPLNLNPRVFGGKHGTILDSGTTYAYLP EAAFAAFK
NAVVKELHSLKQIEGPDPSFKDICFSGAGSNISELSKNFPRVDMVFSDGKKLTLSPE NYLFQHFK
VRGAYCLGIFPNGKNPASLLGGIVVRNTLVTYDRENKRIGFWKTNCSELWDRLNLSP PSPPSPS
VSSLDNTNSSAHLSPSSAPSGPPGYNTPVEIKVGLITFYLSLSVNCSELKPRIPELA HFIAQELDV
NVSQVGF
SEQ 127
MGAKSFLVAFFLSLLLFPLAFCTSNDGLVRIGLKKIKFDQNNRLAARVESKEGEALRASF RKYNN
LRGNLGASEDTDIVALKNYMDAQYFGEIGIGSPPQKFTVIFDTGSSNLWVPSSKCYF SVPCLFHS
KYKSSQSSTYKKNGKSAAIRYGTGAISGFFSQDSVKVGDLVVKNQEFIEATREPSVT FLVAKFDG
ILGLGFQEISVGNAVPVWYNMVKQGLVKEPVFSFWLNRNTEEDEGGEIVFGGVDPNH YKGKHT
YVPVTRKGYWQFDMGDVLIDGQATGYCDNGCSAIADSGTSLLAGPTTVVTMINHAIG ASGVVSQ
QCKAVVEQYGQTIMDMLLAEAHPKKICSQVGLCTFDGTRGVSMGIESVVDENAGKSS GLHDAM
CSACEMAVVWMQNQLRQNQTQERILNYVNELCERLPSPMGQSAVDCGKLSGMPSVSF TIGGR
TFDLSPEEYILKVGEGPAAQCISGFIALDVPPPRGPLWILGDVFMGRYHTVFDSGKL RVGFAEAA
SEQ 128
MVVAFVGIAKSIGQQCLRRSKPYSYSYFSSYVRSSNSKYGLQNWQFQSHRTLILQSASES VKLE
RLSDSDSGILEVKLDRPEARNAIGKDMLRGLQQAFEAVSNERSANVLMICSSVPKVF CAGADLK
ERKTMILSEVQDFVSTLRSTFSFLEGLHIPTIAAIEGIALGGGLEMAMSCDIRICGE DAVLGLPETG
LAVIPGAGGTQRLPRLVGKSIAKDIIFTGRKISGKDAVSIGLVNYCVPAGEARLKTL ELARDINQKG
PVALRMAKCAIDKGVELNMESALALEWDCYEQLLDTKDRLEGLAAFAERRKPRYKGE
SEQ 129
MCSSNSLYINPKPCKHLADYKVKNGMSGYSLIQECFKTTPYGRTTLEISKSELPRCSICS GHEGR
FYMCLICSSVLCCLSPESNHALLHSQCKAGHEISVDMERAELYCSVCCDQVYDPDFD KVVMCK
HIMGFPRTEIGVVESELRLSKRRRLSFGMDLDSKNMKTLFLRRDQKSKSCFPLVLRG LNNLGNT
CFMNSVLQVLLHAPPLRNYFLSDRHNRDICRKMSSDRLCLPCDIDLIFSAVFSGDRT PYSPARFL
YSWWQHSENLATYEQQDAHEFFISVMDRIHDKEGKASLATKDNGDCQCIAHRTFYGL LRSDVT
CTSCGFTSTTHDPCMDISLDLNSCNSSPKDFANKSSKPNESLVGCLDLFTRPEKLGS DQKLYCE
NCQEKQDALKQMSIKKLPLVLSFHIKRFEHSPTRKMSRKIDRHLQFPFSLDMKPYLS SSIVRKRY
GNRIFSFDGDESDISTEFEIFAVVTHSGMLESGHYVTYLRLRNQWYKCDDAWITEVD EEVVRAS
QCYLMYYVQKMLYHKSCEDVSCQPMSLRADTFVPIAGCC
SEQ 130
MKELHSLREIEGPDPNYKDICFSGAGSDISELSKSFPPIDMVFSNGKKLSLTPENYLFRH SKVRG AYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRENERIGFWKTNCSELWDRLNLSPSPPPPP LPSGL DNTNSSANLTPALAPSLPLEHAPGKIKIGLVSFDMSLSVDYSALKPRVPELAHFIAQELE VNVSQV
HLMNFSTEGNDSLIRWAIFPAGSANYMPNATATEIINRLAENRFHLPDTFGSYKLVK WDIEPPPK
RIRWQQNYLVVVFALLVVLIIGLSASLGWLIWRRRQEIPYNPVGSAETHEKELQPLN
SEQ 131
MVTVSVKWQKEVYPAVEIDTSQPPYVFKAQLYDLTGVPPERQKIMVKGGLLKDDADWSKV GVK
EGQRLMMMGTADEIVKAPEKGPVFAEDLPEEEQVVNVGHSAGLFNLGNTCYMNSTVQ CLHSV
PELKSALTEYNQLGRSNDLDHSSHLLTVATRDLFNDLDKNVKPVAPMQFWTVLRKKY PQFGQQ
SNGAFMQQDAEECWTQLLYTLSQSLKSPNSSGSPDIVKALFGIEFDNRIHCAESGEE STETETV
YSLKCHISQEVNHLHEGLKRGLKSELEKASPSLGRSAVYVKDSRINGLPRYLTIQFV RFFWKRES
NQKAKILRKVDYPLSLDVYDFCSEDLRKKLEGPRQVLRDAEGKKAGLKTSEKTSSST DGDVKMT
EAEESSSGSGEASKTTQEGVLPEKEHHLTGIYDLVAVLTHKGRSADSGHYVAWVKQE NGKWV
QFDDDNPIPQREEDIPKLSGGGDWHMAYICMYKARVVPM
SEQ 132
MEKKKEVIRLERESVIPVLKPRLIMALADLIEHSSDRAEFLKLCKRVEYTIHAWYLLQFE DLMQLYS LFDPVNGAKKLEQQKLSPEEIDILEQNFLTYLFQIMHKSNFKIASDEEIDVAHSGQYLLN LPITVDE SKLDKKLLEKYFAEHPHEDLPEFADKYVIFRRGIGIDRTTDYFFMEKVDMIIGRTWAWIL RKTRID RLFSRRSSSRRKKDPKKDDEINSEAEDHDLYVERIRIENMELSARSNQFSLHQVK
SEQ 133
MELTCSSPLSVNSTISFNPQLRRYGSVYPHKRCQTVFSLFPYCPSSSSHITITTATTAAC STSSST
SSLFGISLSHRPCSSIPRKIKRSLYIVSGVFERFTERSIKAVMFSQKEAKALGKDMV YTQHLLLGLI
AEDRSPGGFLGSRITIDKAREAVRSIWHDDVEDDKEKLASQDSGSATSATDVAFSSS TKRVFEA
AVEYSRTMGHNFIAPEHMAFGLFTVDDGNATRVLKRLGVNVNRLAAEAVSRLQGELA KDGREPI
SFKRSREKSFPGKITIDRSAEKAKAEKNALEQFCVDLTARVSEGLIDPVIGREIEVQ RIIEILCRRTK
NNPILLGQAGVGKTAIAEGLAINIAEGNIPAFLMKKRVMSLDIGLLISGAKERGELE GRVTTLIKEVK
KSGNIILFIDEVHILVGAGTVGRGNKGSGLDIANLLKPALGRGELQCIASTTMDEFR LHIEKDKAFA
RRFQPVLINEPSQADAVQILLGLREKYESHHKCIYSLEAINAAVQLSARYIPDRYLP DKAIDLIDEA
GSKSRMQAHKRRKEQQISVLSQSPSDYWQEIRAVQAMHEVILASKLTENDDASRLND GSELHL
QPASPSTSDEDEPPVVGPEEIAAVASLWTGIPLKQLTVDERMLLVGLDEQLKKRVVG QDEAVAAI
CRAVKRSRTGLKDPNRPISAMLFCGPTGVGKSELAKALAASYFGSESAMLRLDMSEY MERHTV
SKLIGSPPGYVGYGEGGTLTEAIRRKPFTVVLLDEIEKAHPDIFNILLQLFEDGHLT DSQGRRVSF
KNALIVMTSNVGSTAIVKGRQNTIGFLLADDESAASYAGMKAIVMEELKTYFRPELM NRLDEVVV
FRPLEKPQMLQILDLMLQEVRARLVSLEISLEVSEAVMELICQQGFDRNYGARPLRR AVTQMVE
DLLSESFLSGDLKPGDVAIINLDESGNPVVANKSTQSIHLSDANGNPVVTNR
SEQ 134
MKNIERLANVALLGLSLAPLVVNVDPNVNVIVTACLTVFVGCYRSVKPTPPSETMSNEHA MRFPL
VGSAMLLSLFLLFKFLSKDLVNAVLTCYFFVLGIAALSATLLPAIRRFLPKKWNDDL IIWHFPYFRS
LEIEFTRSQIVAAIPGTIFCVWYAKQKHWLANNVLGLAFCIQGIEMLSLGSFKTGAI LLAGLFVYDIF
WVFFTPVMVSVAKSFDAPIKLLFPTADAKRPFSMLGLGDIVIPGIFVALALRFDVSR GKGPQYFKS
AFLGYTFGLALTIFVMNWFQAAQPALLYIVPAVIGFLAVHCIWNGDVKPLLEFDEGK TKGAEEAD
AKESKKVE
SEQ 135
MAFSSSYFSFIFLILLFIISFVVGEIKPIYLPGTYQSSLEKQHVKSKIPFKVHYFPQILD HFTFLPKSS
KVFKQKYLINDNYWKQGGPIFVYTGNEGNIDWFAANTGFMLDIAPKFHALLVFIEHR FYGDSMPF
GKKSYKSPKTLGYLNSQQALADYAVLIRSLKQNLSSESSPVVVFGGSYGGMLASWFR LKYPHIAI
GAVASSAPILQFDKITPWSSFYDAVSQDFKEVSLNCYRVIKGSWTELDALSKHEEGL TEVSKLFR
TCKGLHSVYSARDWLWEAFVYTAMVNYPTKANFMMPLPAYPVQEMCKIIDGLPKGAS KISRAFA
AASLYYNYTKREKCFNLEGGDDAHGLRGWDWQACTEMVMPMTCSNESMFPPSSYSYK EFKE
DCKKKYGVEPRPHWITTEFGGYRIEQVLKRFGSNMIFSNGMQDPWSRGGVLKNISAS IVALVTQ
KGAHHVDFRSETKNDPGWLIMQRKQEVAIIQKWLEEYYRDLKQN
SEQ 136 MSRFSLLLALVVAGGLFASALAGPATFADENPIRQVVSDGLHELENAILQVVGKTRHALS FARFA
HRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGLSYKLGVNEFTDLTWDEFRRDRL GAAQNC
SATTKGNLKVTNVVLPETKDWREAGIVSPVKNQGKCGSCWTFSTTGALEAAYSQAFG KGISLSE
QQLVDCAGAFNNFGCNGGLPSQAFEYIKSNGGLDTEEAYPYTGKNGLCKFSSENVGV KVIDSV
NITLGAEDELKYAVALVRPVSIAFEVIKGFKQYKSGVYTSTECGNTPMDVNHAVLAV GYGVENGV
P YW LI KN SWG ADWG D N G YFKM EM G KN M CG I ATCAS YP VVA
SEQ 137
MEKEHKYSLFLTKLKLFFLVTLSTFHGLSHGFQMDQARTLMSWRRSKMHAQTTTYATNED ETE
NLVFSDEKHVGNMEDDLIKDGLPAQPSNVMFKQYAGYVNVDVKNGRSLFYYFAEASS GNASSK
PLVLWLNGGPGCSSLGFGAMLELGPFGVNPDGKTLYSRRFAWNKVANVMFLESPAGV GFSYS
NTTSDYSKSGDKRTAEDAYRFLVNWFKRFPHYKGRDFYIMGESYAGFYVPELADIIV KRNMLPT
TNFYIQFKGIMIGNGIMNDETDEKGTLDYLWSHALISDETHRGLLQHCKTETETCQH FQNIAEAEL
GNVDPYNIYGPQCSINSKSRSSSPKLKNGYDPCEQQYVQNYLNLPHVQKALHANLTN LPYLWN
PCSNLDWKDTPATMFPIYKRLIASGLRILLYSGDVDAVVSVTSTRYSLSAMNLKVIK PWRPWLDD
TQEVAGYMVVYDGLAFATVRGAGHQVPQFQPRRAFALLNMFFANHS
SEQ 138
MANSYTSINFFLAPIIFLAILGLQLQSSDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGS VEYYSA
WTQRDRFIKGRRLAEADTANSTPLSFSGGNETFRLSSLGFLHYANVTVGTPGLSFLV ALDTGSD
LFWLPCDCSNCVRALETRSGRRINLNIYSPNTSSTGQIVPCNSTLCGQRRRCLSSQN ACAYGVA
YLSNNTSSSGVLVEDILHLETDNAQQKSVEAPIALGCGIRQTGAFLSGAAPNGLFGL GLENISVPS
MLASKGLAANSFSMCFGPDGIGRIVFGDKGSPAQGETPLNLDQLHPTYNISLTGITV GNKITDVD
FTAIFDSGTSFTYLNDPAYKVITENFDSQAKQPRIQPDGEIPFEYCYGLSANQTTFE VPDVNLTMK
GGNQLFLFDPIIMLSLQDRSGAYCLAVVKSGDVNIIGQNFMTGYRVVFDREKMVLGW KPSDCYD
SRGSNDKSTTLPVNKRNSTEAPSPSSVVPEATKGNGSGNEPATSFPSVQSSKPAANQ APAHFI
CQLMMALFSLFSYYLIIISS
SEQ 139
MAIHTSTLSISILVMLMFSVVSSSAAEDMSIISYNEKHHTNGESTVWRTDDEVMSLYESW LVEHK
KVYNALGEKDKRFQIFKDNLRYIDEHNSVPDKSYKLGLTQFADLTNEEYKSIYLGTK PDGRSRLL
NTQSDRYAPKVGDSLPDSVDWRKKGVLVDVKNQGQCGSCWAFSAVASIEAVNKIVTG NLISLS
EQELVDCDTSDNQGCQGGLMDDAFKFVIQNGGIDTEEDYPYKAKDGKCDQARKNARV VTIDGY
EDVPDNDEKALKKAVAGQPVSVAIEAGGKDFQHYKSGIFTGKCGAAVDHGVVAVGYG SENGM
DYWIVRNSWGASWGEKGYLRMQRNIGNPKGLCGIATIASYPVKTGQNPPKPAPSPPP VKPPTQ
CDDYNECPAGTTCCCVYEYYKYCFAWGCCPMEGATCCKDHNSCCPHDYPVCNVKAGT CSISK
NNPLGVKAMQHILAKPIGTFGNEGKKSPSS
SEQ 140
MEIKILLASLVIWYITCINVYADDMVRIELKRQSLDLSSISDARIYAKDLRGRNRNLAAP NDQIVYLK
NYHDVQYFAEIGIGSPPQRFIVVFDTGSSNLWVPSSRCFFSIACYLRSRYKSRLSNT YTKIGKSSK
IPFGTGSVHGFFSQDNVKVGGAVLKQQVFTEVTREGYLTLLRARFDGVLGLGFDQST TSRNVTP
VWYNMLLQHMVTKSIFSFWLNRDPTSKIAGEIIFGGMDWTHFRGQHTYVPVAQNGYW EIEIGDL
FIGSNSTGLCKDGCPAIVDTGTSFIAGPTTILTQINHAIGAEGIISLECKKVVSSYG DSIWERLIAGL
QPENVCNRIGLCTNNGSLCSSCEMIVFWIQVEIRKERSKEKAFQYANQLCEKLPNPG GKSFINC
DVFALPHITFTIGDKSFPLSPDQYVIRVDDSQGVHCISGFTTLNAHPRRPLWVLGDA FLRAYHTV
FDFGSSQIGFAESA
SEQ 141
MASIFALSLFFIIISFCITSITIPVQSDGHETFIIHVSKSDKPRVFATHHHWYSSIIRSV SQHPSKILYT
YSRAAVGFSARLTAAQADQLRRIPGVISVLPDEVRHLHTTHTPTFLGLADSFGLWPN SDYADDVI
IGVLDTGIWPERPSFSDEGLSPVPSSWKGKCATGPDFPETSCNKKIIGAQMFYKGYE ASHGPMD
ESKESKSPRDTEGHGTHTASTAAGSVVANASFYQYAKGEARGMAIKARIAAYKICWK NGCFNS
DILAAMDQAVNDGVHVISLSVGANGYAPHYLLDSIAIGAFGASEHGVLVSCSAGNSG PGAYTAV
NIAPWILTVGASTIDREFPADVILGDNRIFGGVSLYSGDPLTDAKLPVVYSGDCGSK YCYPGKLD
HKKVAGKIVLCDRGGNARVEKGSAVKQAGGVGMILLNLADSGEELVADSHLLPATMV GQKAGD
KIRHYVKSDPSPTATIVFRGTVIGKSPAAPRVAAFSSRGPNHLTPEILKPDVIAPGV NILAGWTGS VGPTDLDIDTRRVEFNIISGTSMSCPHASGLAALLKRAHPKWTPAAVKSALMTTAYNLDN SGKVF TDLATGQESTPFVHGSGHVDPNRALDPGLVYDIETSDYVNFLCSIGYDGDDVAVFVRDSS RVNC SEQNLATPGDLNYPSFSVVFTGESNGVVKYKRVMKNVGKNTDAVYEVKVNAPSSVEVSVS PAK LVFSEEKKSLSYEISFKSKSSGDLEMVKGIESAFGSIEWSDGIHNVRSPIAVRWRHYSAA SI
SEQ 142
MPSSLFLTLLLASISLSFSSTLNSNDDEFFLSSTPKFPLTMAEKLIRQLNLFPKHDINKA AATGDSE
QRLFERKLNLSYVGNSGSTVQDLGHHAGYYRLPHTKDARMFYFFFESRSRKNDPVVI WLTGGP
GCSSELAVFYENGPFKIADNMSLVWNDFGWDKVSNLIYVDQPTGTGFSYSSNDDDIR HDERGV
SNDLYDFLQAFFKAHPQYAKNDFYITGESYAGHYIPAFASRVHQGNKNKEGIYVNLK GFAIGNGL
TDPEIQYKAYTDYALDMKLIKKSDYNAIEKSYPKCQLAIKLCGKDGGTACMAAYLVC TSIFNKIMDI
AGDKNYYDVRKRCEGDLCYDFSKMETFLNDQQVKKALGVGDIEFVSCSSEVYQAMQL DWMRN
LEEGIPSLLEDGIKLLVYAGEYDLICNWLGNSRWVHAMKWSGQKAFGKATQVSFAVD GVEKGV
QKNYGPLTFLKVHDAGHMVPMDQPKAALEMLHRWMQDKLSKQGHLAPM
SEQ 143
MLVISDCYINSCKAFNFVINLPVMGHSHSHSSHSHSHFHSSKSSDDQNMDMGESITTQTD VSFM
LAKHVFSKEVKGDSNLVFSPLSIQIVLGLIAAGSKGPTKDQLLCFLKSKSIDELNSL YSHFVSVVFV
DGSPNGGPRLSVVNGVWIDQTLPFKPSYKKVVDKVYKAASNSVDFQCKAAEVANQVN QWAKM
KTNNLIKEILPHGTVNNMTRLIFANALYFKGVWNDKFNASETKDHKFHLLSGGSIKA PFMTSKNK
QYAVAFDGFKVLGLHYKQGKDMRRFCMYLILPDARDELPALLDKISSEPGFIDHHIP FEKAKMRK
FLIPKFKTTFGFEASKVLKGLGLTLPFSSGGLTEMVDSPLAGRLFVSQIFHKSFIEV NEEGTEAAA
VTASVIMTKSLIIEKEMEFVADHPFLFLIRDESTGAVFFIGSVLNPLAG
SEQ 144
MLRIGPSLRTARKLLNRNLHFQSPIIAGDVAPVHHRRQELHRFVRRCNYSSTVGNTSASA SFFST
LNNSNSSTTSTTPHVERAEENDSLQSNASEVEPVAAVEQRLSSGMVDAYLAIELALD SVVKIFTV
SSSPNYFLPWQNKSQRETTGSGFVIRGKRILTNAHVVADHTFVLVRKHGSPTKYRAT VQAVGHE
CDLAILVVESEEFWEGMNSLELGDVPFLQEAVAVVGYPQGGDNISVTKGVVSRVEPT QYVHGA
SQLLAIQIDAAINPGNSGGPAIMGDKVAGVAFQNLSGAENIGYIIPVPVIKHFIAGV EERGEYIGFC
SLGLSCQPTENAQIREYFQMQSKLTGVLVSRINPLSDASRVLKKDDIILSFDGVPIA NDGTVPFRN
RERITFDHLVSMKKPNETAELKVLRNGKVHDFKITLHPLQPLVPVHQFDKLPSYFIF AGLVFIPLTQ
PFLHEYGEDWYNASPRRLCERALRELPKKPGEQFIILSQVLMDDINAGYERLAELQV KKVNGVE
VLNLKHLRQLVEDGNQKNVRFDLDDEKVIVLNYESARIATSRILKRHRIPHAMSSDL TDDENAVEL
QSACSS
SEQ 145
ICREPPNELVRCGACGHAPPRLYACVTCAEVFCRVHAPSHPAGNAADPSLHCIAVDIDRA ELFC
CGCRDQVYNSDFDAAVALAQTEATVIGSIQDPPPHPESTRKRRRVEYKPWTPDVKEQ VLIVGNS
SPLPSQLGNDSTTPEVQWGLRGLNNLGNTCFMNSVLQALLHTPPLRNYFLSDKHNRY FCQRKN
SSVITRSSSDNGNKNSTMLCLACDLDAMFSAVFSGDRTPISPAKFLYSWWKHASNLA SYEQQD
AHEFFISVLDGIHERMQNDKGKALSPGSGDCCIAHRVFSGILRSDVMCTACGFTSTT YDPCIDISL
DLELSQGSSSKMTSKKSHNTHKKEAESGKFSQNGRISSLMGCLDHFTRPEKLGSDQK FFCQHC
QVRQESLKQMSIRKLPLVSCFHIKRFEHSVIKKMSRKVDHYLQFPFSLDMSPYLSSS ILRSRFGN
RIFSFDGDEQDASCESSSEFELFAVITHTGKLDAGHYVTYLRLSNQWYKCDDAWITQ VSENIVRA
AQG YM M FYVQKM LYYKASEKQVS
SEQ 146
MEGSPVLGEHAELIGVLSRPLRQRATAAEIQMVIPWEAITSACGSLLKEELQTRRKIHFD NGNLIS
VKNESPSNNIRNGPSNDTREHLLIDPVPPSLIEKAMTSICLITVDDGAWASGVLLNK QGLLLTNAH
LLEPWRFGKTSVNGSGYNTKSDVVLIPSDQSEHPGVEKFDIQRRNKHLIQKELKTPH FLVDNEQ
GSFRVNLAKTGSRIIRVRLDFMDPWVWTNAKVVHVSRGPLDVALLQLELVPDQLCPI TADFMCP
SPGSKAYILGHGLFGPRCDFLPSACVGAIAKVVEAKRSLLNQSSLGEHFPAMLETTA AVHPGGS
GGAVVNSEGHMIALVTSNARHGGGTVIPHLNFSIPCAALEPIFKFVEDMQNLSLEYL DKPDEQLS
SVWALTPPLSSKQSPSMLHLPMLPRGDSDGDTKGSKFAKFIADSEAMLKSATQLGKV ERLSNKL
VHSKL SEQ 147
MLKALTSSCLQNRFHAVTTAFTPQVRRGTDSNTPLLRVLGSLRSSNRRGPYLSRRFFCSD STD
GSESNSEAAASEAKPAEKGGDADSKASAAIVPTVFKPEDCLTVLALPLPHRPLFPGF YMHIYVKD
PKVLAALLESRKRQAPYAGAFLMKDEQGTDPNVVSASDTEKNIYELKGKDMLNRLHE VGTLAQI
TSIKDDQVILIGHRRIRMTEVVSEEPLTVKVDHLKEQPYNKDDDVIKATSFEVLSTL RDVLKTSSL
WKDHVQTYIQ
SEQ 148
MERKHLWAALLLLAIACFVFPASSDSLLRISLKKRQLDISSLNVANVARLEDRYGKHVMK DIEKKK
KKKKSDTNSDIVSLKNYLDAQYYGDISIGSPPQNFTVIFDTGSSNLWVPSSRCYFSI ACWIHSKYK
ARKSSTYTKKGESCSIHYGSGSISGFLSQDNVQVGDLVVTDQVFIEATRESSVTFIV AKFDGILGL
GFKEIAVGNTTPVWYNMVKQDLVKEPVFSFWLNRDINAKEGGELVFGGVDPKHFKDK HTYVPL
TQKGYWQFKMGDFSIGNQSTGFCEGGCAAIVDSGTSLLAGPTAVVTQVNHAIGAEGV LSMECK
ETISQYGEMIWDLLVSGVTPDQICLQVGLCYLNGAQHLSSNIRSVVEKENEGSSIGE APLCTACE
MAVIWMQNQLKQKTTKESVLEYVNQLCEKLPSPMGQSVIDCNSISSMPNVTFNIGDK DFVLTPD
QYILKTGEGIATICLSGFVALDVPPPRGPLWILGNVFMGVYHTVFDYGNLQLGFAEA A
SEQ 149
SRSYYNILLLQYLFLFVMALILGWKILFILLFVIIGMCTSQVTSRNIQALSMLEKHELWM SSHGRTY
KNEAEKEKRLNIFKENVKFIESFNNNGTKKPYKLGINAFADLTAEEFLSYYTTGLKL SNSYSQIQS
SFKYENLSDVPSVMDWRKSGAVTRIKHQGQCGCCWAFSAVAALEGANKLSTNNLISL SEQQLL
DCTTENNGCNGGLMTTAYDFIIQNGGIATESNYPYEEYQDSCKSQEMNSAVKINRYE TLPSTES
ALLKAVAKQPVSIGIAVNEDFHLYQNGVYNGNCEGQELNHAVTVIGYGTENDGTKYW LIKNSWG
TSWGENGYMKIARDTGIEGGLCGITTLASYPVL
SEQ 150
MGLPEVVDVARNYAVMVRIQGPDPKGLKMRKHAFHLYNSGKTTLSASGMLLPSSFVNASV SKQ
IQGESKLHSFGGHFLVLTVASVIEPFVVQQDRGDISKDKPELIPGAQIDILWEGGNT LQNDIKVTN
KEGLNWLPAELLRVVDIPVSSAAVQSLVEGSSSSIEHGWEVGWSLAAYGNSRQSFTN TKRTQV
EKISFPSQTPMMEAQSSLPSVIGTSTTRIALLRVSSNPYEDLPALKVATWSRRGDLL LGMGSPFG
ILSPSHFFNSISVGSIANSYPPSPQNKALLIADIRCLPGMEGSPVLGEHAELIGVLS RPLRQRATAA
EIQMVIPWEAITSACGSLLKEELQTRRKIHFGNGNLISVKKESFSNNIQDGHANDTQ EHLQIDPVP
PSLIEKAMTSICLIAVDDGAWASGVLLNKQGLLLTNAHLLEPWRFGKTSVNGSGYNT KSDVVLIP
SDQSEHPGVEKFDIQRRNKHLIQKELKTPHFLVDNEQCSFRVNLANTGSRTIRVRLD FMDPWV
WTNAKVVHVSRGPLDVALLQLELVPDQLCPIIVDFMCPSPGSKAYILGHGLFGPRCD FLPSACV
GAIAKVVEAKRPLLNQSSLGGHFPAMLETTAAVHPGGSGGAVVNSEGHMIALVTSNA RHGGGT
VIPHLNFSIPCAALEPIFKFAEDMQNLSLEYLDKPDEQLSSVWALTPPLSSKQSPSM LHLPMLPR
GDSDGDTKGSKFAKFIADSEAMLKSATQLGKVERLSNKLVHSKL
SEQ 151
MDNPSEDSSDSPQQQPESPVNDDQRVYLVPYRWWKEAQESSPSDGKSVTLYAAAPAPSYG G
PMKIINNIFSPDVAFNLRREEESLSQSQENGEVGVSGRDYALVPGDIWLQALKWHSN SKAAAKN
GKSFSATDEDIADVYPLQLRLSVLRETSSLGVRISKKDNTVECFKRACRIFSVDTEP LRIWDLSGQ
TALFFSDENNKILKDSQKQSEQDMLLELQVYGLSDSVKNKVKKDEMSMQYPNGSSFL MNGTGS
GITSNLTRSSSSSFSGGPCEAGTLGLTGLQNLGNTCFMNSALQCLAHTPKLVDYFLG DYKREIN
HDNPLGMNGEIASAFGDLLKKLWAPGATPVAPRTFKLKLAHFAPQFSGFNQHDSQEL LAFLLDG
LHEDLNRVKNKPYVEAKDGDDRPDEEIADEYWNNHLARNDSIIVDVCQGQYRSTLVC PVCKKVS
IMFDPFMYLSLPLPSTSMRSMTVTVIKNGSDIQISAFTITVSKDGRLEDLIRALSTA CSLDADETLL
VAEIYNNRIIRYLEEPADSLSLIRDGDRLVAYRLHKGTEEAPLVVFTHQQIDEHYIY GKLTSNMKTF
GIPLAAHSRVLTGSDIRSLYLQILTPFLVHNTAQADNLNCDRSATEACTDSEVITDM EPGNSIVNG
VPESIAEEDTAEPLDMEFQFYLSDDKATFKGSEIVMNEPLQSTDISGRLNVLVSWSP KILEQYNT
GLFSSLPEVFKSGFFAKRPQESVSLYKCLEAFLKEEPLGPEDMWYCPACKQHRQATK KLDLWR
LPEILVIHLKRFSYNRFLKNKLETYVDFPTHDLDLSSYLAYKDGKSSYRYMLYAISN HYGSMGGG
HYTAFVHQGADRWYDFDDSHVYSISQDKLKTSAAYVLFYRRVEEI
SEQ 152 MASSSRVFVLLLLIIFNFLYISAQKTIKHKPFSMSFPLISTSLSHNSSSKALFLSSFMAS NNRRQTQ
NTKTMSRIPSLNYKSTFKYSMALIVTLPIGTPPQNQQMVLDTGSQLSWIQCHKKIPK RPPPTTSF
DPSLSSTFSVLPCTHPLCKPRIPDFTLPTTCDQNRLCHYSYFYADGTLAEGNLVREK ITFSRSQS
TPPLILGCATESEDAEGILGMNLGRFSFASQAKVQKFSYCVPIRQGSHAVKPSGTFY LGQNPNS
HTFQYINLLTFPQSQRMPNLDPLAFTVGMVGIKIGGKKLNISGRVFRPNAGGSGQTI IDSGTEYTF
LVEEAYNKVREEIVRLVGPRLKKGYVYGGALDMCFDNRPIEIGRLIGDMTLQFENGV DILINKERM
LDEVEGGIHCVGIGRSESLGIASNIIGNFHQQNLWVEFDMRNRRVGFGKGECSRQV
SEQ 153
YTIIIFSLNMKIFSIFSLLLLLLLPILASCHEKQVYIVYFGGHKGEKALHEIEENHHSYL MSVKESEEE
ARYSLIYSYKHSINGFAALLTPHEASKLSELEEVVSVYKSEPRKYRLQTTRSWEFSG VEESVQPN
SLNKDNLLLKARYGKDVIIGVLDSGLWPESKSFSDEGLGPIPKSWKGICQSGDAFNS SNCNKKII
GARYYIKGYEQYYGPLNRTLDYLSPRDKDGHGTHTSSTAGGRKVPNVSAIGGFASGT ASGGAP
LARLAMYKVCWAIPKEGKEDGNTCFDEDMLAAMDDAIADGVDVISISIGTKEPQPFD QDSIAIGAL
YAVKKNIVVSCSAGNSGPAPSTLSNTAPWIITVGASSVDRAFLSPVILGNGKKFTGQ TVTPYKLE
KEMYPLVYAGQVINSNVTKDVAGQCLPGSLSPKKAKGKIVICLRGNGTRVGKGGEVK RAGGIGY
ILGNNKANGAELVADPHFLPATAVDYKSAMQILNYINSTKSPVAYIVPAKTVLHSKP APYMASFTS
RGPSAVAPDILKPDITAPGLNILAAWSGGSSPTKLDIDDRVVEYNIISGTSMSCPHV GGAAALLKAI
HPTWSSAAIRSALITSAGLRNNVGEQITDASGKPADPFQFGGGHFRPSKAADPGLVY DASYQDY
LLFLCASGIKDLDKSFKCPKKSHLPNNLNYPSLAIPNLNGTVTVSRRLTNVGAPKSV YFASAKPPL
GFSVEISPPVLSFKHVGSKRTFTITVKVRSDMIDSIPKDQYVFGWYSWNDGIHNVRS PIAVKLA
SEQ 154
MATRRSSSSALTALAASRSRLLSRFRPAVSRLSQNTLLGTGRCPPPNSGFFVAETTAALW PNYN
VLSKSFVHSYSTTAASSGQINNMDYTEMALEGIVGAVEAARTSKQQVVETEHLMKAL LEQKDGL
ARRIFTKAGLDNSSVLQETDQFISQQPKVVGDTSGPILGSHLSSLLENAKKHKKEMG DSFVSVEH
MLLSFLSDTRFGQKLFRNLQLTEKALKDAVNAVRGSQRVTDPNPEGKYEALEKYGND LTELARR
GKLDPVIGRDDEIRRCIQILSRRTKNNPVIIGEPGVGKTAIAEGLAQRIVRGDVPEP LMNRKLMSL
DMGALLAGAKYRGDFEERLKAVLKEVSSSNGQIILFIDEIHTVVGAGATSGAMDAGN LLKPMLGR
GELRCIGATTLNEYRKYIEKDPALERRFQQVYCGQPSVEDAISILRGLRERYELHHG VKISDSALV
SAAVLADRYITERFLPDKAIDLVDEAAAKLKMEITSKPTELDEIDRAVLKLEMEKLS LKNDTDKASK
ERLNKLESDLKSLKAKQKELNEQWEREKDLMTRIRSIKEEIDRVNLEMEAAEREYDL NRAAELKY
GTLISLQRQLGEAEKNLADYRKSGSSLLREEVTDLDITEIVSKWTGIPLSNLQQSER DKLVFLENE
LHKRVVGQDMAVKSVADAIRRSRAGLSDPNRPIASFMFMGPTGVGKTELGKALAAYL FNTENAL
VRIDMSEYMEKHAVSRLVGAPPGYVGYEEGGQLTEVVRRRPYSVVLFDEIEKAHHDV FNILLQL
LDDGRITDSQGRTVSFTNTVVIMTSNIGSHYILETLQNTRDSQEAVYDAMKKQVIEL ARRTFRPEF
MNRIDEYIVFQPLDLKQVSRIVELQMRRVKDRLKQKKIDLHYTQEAISLLANMGFDP NYGARPVK
RVIQQMVENEVAMGVLRGDFSEEDMIIVDADASPQGKDLLPEKRLLIRRIENGSNMD AMVAND
SEQ 155
VNVKCFFVSFFFSFSCMSLFFLQGWNFETFCLKTQSFAVTNKNHRPHLHSHHSSFLCFHT SYLL
FFLILYIYIAKTTSRFAKTQQPPQKMSRFTMLVVLVLLLLCLCHLSVATIGSSSNKK STYIVHVAKS
QMPESFENHKHWYDSSLKSVSDSAEMLYVYNNVVHGFSARLTVQEAESLERQSGILS VLPEMK
YELHTTRTPSFLGLDRSADFFPESNAMSDVIVGVLDTGVWPESKSFDDTGLGPVPDS WKGECE
SGTNFSSSNCNRKLIGARYFSKGYETTLGPVDVSKESKSARDDDGHGTHTATTAAGS IVQGASL
FGYASGTARGMATRARVAVYKVCWIGGCFSSDILAAMDKAIDDNVNVLSLSLGGGNS DYYRDS
VAIGAFAAMEKGILVSCSAGNAGPGPYSLSNVAPWITTVGAGTLDRDFPAYVSLGNG KNFSGVS
LYKGDLSLSKMLPFVYAGNASNTTNGNLCMTGTLIPEKVKGKIVLCDRGINPRVQKG SVVKEAG
GVGMVLANTAANGDELVADAHLLPATTVGQTTGEAIKKYLTSDPNPTATILFEGTKV GIKPSPVV
AAFSSRGPNSITQEILKPDIIAPGVNILAGWTGGVGPTGLAEDTRRVGFNIISGTSM SCPHVSGLA
ALLKGAHPDWSPAAIRSALMTTAYTVYKNGGALQDVSTGKPSTPFDHGAGHVDPVAA LNPGLV
YDLRADDYLNFLCALNYTSIQINSIARRNYNCETSKKYSVTDLNYPSFAVVFLEQMT AGSGSSSS
SVKYTRTLTNVGPAGTYKVSTVFSSSNSVKVSVEPETLVFTRVNEQKSYTVTFTAPS TPSTTNVF
GRIEWSDGKHVVGSPVAISWI
SEQ 156 MLKALTSSCLQNRFHAVTTAFTPQVRRGTDSNTPLLRVLGSLRSSNRRVPYLSRRFFCSD STDG
SESNSEAAASEAKPAEEGGDADSKASAAMVPTVFKPEDCLTVLALPLPHRPLFPGFY MHIYVKD
PKVLAALLESRKRQAPYAGAFLMKDEQGTDPNVVSASDTEKNIYELKGKDMLNRLHE VGTLAQI
TSIKDDQVILIGHRRIRMAEVVSEEPLTVKVDHLKEQPYNKDDDVIKATSFEVLSTL RDVLKTSSL
WKDHVQTYIQHIGDFNYARLADFGAAISGANKLQCQQVLEELDVHKRLQLTLELVKK EMEISKIQ
ESIARAIEEKISGEQRRYLLNEQLKAIKKELGLETDDKTALSAKFRERLEPNKEKIP VHVMQVIEEE
LTKLQLLEASSSEFNVTRNYLDWLTALPWGNYSDENFDVLRAEQILDEDHYGLTDVK ERILEFIA
VGKLRGTSQGKIICLSGPPGVGKTSIGRSIARALNRKFYRFSVGGLSDVAEIKGHRR TYIGAMPG
KMVQCLKSVGTANPLVLIDEIDKLGRGHAGDPASAMLELLDPEQNANFLDHYLDVPI DLSKVLFV
CTANVVEMIPNPLLDRMEVISIAGYITDEKMHIARDYLEKATRETCGIKPEQVEVTN SALLALIENY
CREAGVRNLQKQIEKIYRKIALKLVREDGEIEPQNAEVGEVEAESIHLSDEIKSKEE IQAGAESAN
GSNDDKASENNAEAEAQGAPVNQTQKSANEDACLQDTQETEKATESEASKTVNKVVV DSPNL
ADYVGKPVFHAERIYDQTPVGVVMGLAWTSMGGSTLYIETSLVEQGEGKGALNVTGQ LGDVMK
ESAQIAHTVARTILQEKEPDNQFFANSKLHLHVPAGATPKDGPSAGCTMITSLLSLA MKKPVKKD
LAMTGEVTLTGKILPIGGVKEKAIAARRSDVKTIIFPSANRRDFDELAPNVKEGLDV HFVDDYKQIF
DLAF
SEQ 157
MQFFRRNPSLHRISSRFLNQVVKTSAYSTKKVYNAGQPTAATHPQLMKEGEITPGITSEE YMQR
RKKLLEFLPENSLAIVAAAPIKMMTDVVPYNFRQDADYLYITGCQQPGGVAVLGHDC GLCMFMP
EQSPQDALWQGETAGVDAALQIFKADLAYPINRLPQILSRMIESSSTVFHNVKTRTS SYLELEAY
KKAVSNYKVKDFSVYTHEARFVKSPAELKLMRDSASIACQALVQTMLYSKLFPDEGM LSAKFEY
ECRVRGAQRMAFNPVVGGGPNGSVVHYFRNDQKIEDGNLVLMDVGCELHGYVSDLTR VWPPF
GKFSSVQEELYNLILETNKECVELCRPGTTIREIHHYSVETLRRGFKEIGILKNDRR GRYEMLNPT
NIGHYLGMDVHDCSTIGNDRPLKPGVVITIEPGVYIPSCFDCPERFQGIGFRIEDEV LITESGYEVL
TASIPKEIKHLESLLNNFGSGRGTEIRAALS
SEQ 158
LLTSHKNHIILLPFLLYKIFISLQKQTLMASSTRVFVLLLLIIFNFLYISAQKTIKHKPF SMSFPLTSTSL
SHNSSSKALFLSSLLASNQRKQAPNTKTVSRIPSLNYKSTFKYSMALIVTLPIGTPP QNQQMVLD
TGSQLSWIQCHKKIPKRPPPTTSFDPSLSSTFSVLPCTHPLCKPRIPDFTLPTTCDQ NRLCHYSY
FYADGTLAEGNLVREKITFSRSQSTPPLILGCATESEDAEGILGMNLGRFSFASQAK VQKFSYCV
PIRQGSHAVKPSGTFYLGQNPNSHTFQYINLLTFPQSQRMPNLDPLAFTVGMVGIKI GGKKLNIS
GRVFRPNAGGSGQTIIDSGTEYTFLVEEAYNKVREEIVRLVGPRLKKGYVYGGALDM CFDNRPM
EIGRLIGDMTLQFENGVEILINKERMLDEVEGGIHCVGIGRSESLGIASNIIGNFHQ QNLWVEFDM
RNRRVGFGKGECSRQM
SEQ 159
MAALNFFIIFTSLVLPIASDPLLSTYVVHVDTKAKPSHYLTQDEWYNSVVESVLANKMDS DSTSPR
LFYSYDVVLQGFAARLTDQESEKLNKFPEVIHIFKDQSRIKLDTTRSPNFLGLNTGY GLWPQSNF
GDDVIIGLVDTGIWPESESFKDNGIGPIPTRWKGKCVDGIEFNATSSCNRKLIGARN FVKGVEND
YHHQSARDQNGHGTHTASTAAGTEVNGANVFGFAKGKARGIASKARIAMYKACGSSS CAESDI
LAAIESAIKDGVDILSLSLGYDDAPFYENPVAIATFAAVKRNIFVASSAGNLGPYPF SVHNTAPWV
TTVGAGSLDRDFPVEINLSNNKTFVGSSLYPGRISGKSYSLVYIENCSIMTIDRSKV ERKIVVCNT
SKIEALRNGILIQKAGGFGLIQLNLPTEGEGIRAMAYTLPSATLGYKEGIELLSYIK SNANPRAGFV
RRKDTVIGKKVRAPIVASFSSRGPNVVVPEVLKPDLIAPGLNILAAWPGDISPTRLK MDPRRVKFN
INSGTSMACPHIAGVAALVRAVHPDWSPAAIKSALMTTSTAFDNAQLPIIKHEDMEL ATPISIGAG
HVNPESAIDPGLIYDTDTSDYINLLCSLNYTEKQMKLFTNESNPCSGFTGSPLDLNY PSLSVMFR
PDSYVHVVKKTLTHVAVSKPEVYKVKIVNLNSEKVSLSIEPRKLIFNESLQKQSYVV KFESHYAFN
SSRKIAEQMAFGSILWESEKHNVRSPFAVMWVQQNFNNSRLYK
SEQ 160
MEISKIQESIARAIEEKISGEQRRYLLNEQLKAIKKELGLETDDKTALSAKFRERLEPNK EKIPVHV
MQVIEEELTKLQLLEASSSEFNVTRNYLDWLTALPWGSYSDENFDVLRAEQILDEDH YGLTDVK
ERILEFIAVGKLRGTSQGKIICLSGPPGVGKTSIGRSIARALNRKFYRFSVGGLSDV AEIKGHRRTY
IGAMPGKMVQCLKSVGTANPLVLIDEIDKLGRGHAGDPASAMLELLDPEQNANFLDH YLDVPIDL
SKVLFVCTANVVEMIPNPLLDRMEVISIAGYITDEKVHIARDYLEKATRETCGIKPE QVEVTDSALL ALIENYCREAGVRNLQKQIEKIYRKIALKLVREDGEIEPQNAEVDEVKAESIHLSDEIKS KEEIQAG
AESANGSNDDEASENNAEAEAQGAENQTQKSANEDTCLQDTQETEKATESEASKTVN KVVVD
SPNLADYVGKPVFHAERIYDQTPVGVVMGLAWTSMGGSTLYIETSLVEQGEGKGALN VTGQLG
DVMKESAQIAHTVARTILLEKEPDNQFFANSKLHLHVPAGATPKDGPSAGCTMITSL LSLAMKKP
VKKDLAMTGEVTLTGKILPIGGVKEKAIAARRSDVKTIIFPSANRRDFDELAPNVKE GLDVHFVDD
YKQIFDLAF