Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
GENE EDITING USING A MESSENGER RIBONUCLEIC ACID CONSTRUCT
Document Type and Number:
WIPO Patent Application WO/2021/067394
Kind Code:
A1
Abstract:
Embodiments of the present disclosure are directed to a method that includes contacting a population of plant cells with a messenger ribonucleic acid (mRNA) construct including a sequence encoding a rare‑cutting endonuclease and a detectable label, wherein the rare‑cutting endonuclease is configured to induce a mutation at a target genomic locus. The method further includes screening the population of plant cells for the detectable label to identify target plant cells that are genetically transformed with the mRNA construct.

Inventors:
SHAN QIWEI (US)
Application Number:
PCT/US2020/053469
Publication Date:
April 08, 2021
Filing Date:
September 30, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CALYXT INC (US)
International Classes:
C12N15/01; C12N9/22; C12N15/62
Foreign References:
US20150064789A12015-03-05
US20140075593A12014-03-13
Other References:
FERNÁNDEZ‐ÁBALOS JOSÉ MANUEL, FOX HELEN, PITT CHRIS, WELLS BRIAN, DOONAN JOHN H: "Plant-adapted green fluorescent protein is a versatile vital reporter for gene expression, protein localization and mitosis in the filamentous fungus, Aspergillus nidulans", MOLECULAR MICROBIOLOGY, vol. 27, no. 1, January 1998 (1998-01-01), pages 121 - 130, XP055813590
MA ET AL.: "Visualization of repetitive DNA sequences in human chromosomes with transcription activator-like effectors", PNAS, vol. 110, no. 52, 24 December 2013 (2013-12-24), pages 21048 - 21053, XP055392927, DOI: 10.1073/pnas.1319097110
See also references of EP 4022055A4
Attorney, Agent or Firm:
LORFING, Abigail A. et al. (US)
Download PDF:
Claims:
CLAIMS

1. A method, comprising: contacting a population of plant cells with a messenger ribonucleic acid (mRNA) construct including a sequence encoding a rare-cutting endonuclease and a detectable label, wherein the rare-cutting endonuclease is configured to induce a mutation at a target genomic locus; and screening the population of plant cells for the detectable label to identify target plant cells that are genetically transformed with the mRNA construct.

2. The method of claim 1, wherein contacting the population of plant cells includes delivering the mRNA construct into the population of plant cells derived using at least one of polyethylene glycol (PEG)-mediated transformation, electroporation, particle bombardment, and microinjection mediated protoplast transformation.

3. The method of claim 1, further including preparing the mRNA construct using in- vitro transcription, wherein the mRNA construct includes a transcription activator like effector nuclease (TALEN) mRNA including the sequence encoding the rare-cutting endonuclease and the detectable label.

4. The method of claim 1, wherein the rare-cutting endonuclease is conjugated to the detectable label or is a fusion protein including the rare-cutting endonuclease and the detectable label.

5. The method of one of claims 1, wherein screening the population of plant cells for the detectable label includes isolating the target plant cells that have the detectable label from a remainder of the population of plant cells.

6. The method of claim 5, wherein the detectable label includes a first detectable label and a second detectable label and wherein the rare-cutting endonuclease includes a first half-transcription activator like effector nuclease (TALEN) that is labeled with the first detectable label and a second half-TALEN that is labeled with the second detectable label, and wherein isolating the target plant cells from the remainder includes isolating the target plant cells that have the first detectable label and the second detectable label.

7. The method of claim 5, wherein isolating the target plant cells includes using fluorescence activated cell sorting (FACS) with a nozzle having a diameter of at least 100 um and up to 200 um.

8. The method of claim 1, wherein the plant cells are plant protoplasts and the method further includes: culturing the target plant cells that are transformed with the mRNA construct; and regenerating plants from the cultured target plant cells, wherein the regenerated plants express the mRNA construct.

9. A non-naturally occurring plant, generated by a genomic editing technique, wherein the genomic editing technique comprises: contacting a population of plant cells with a messenger ribonucleic acid (mRNA) construct that includes a sequence encoding a rare-cutting endonuclease and a detectable label, wherein the rare-cutting endonuclease is configured to induce a mutation at a target genomic locus; screening the population of plant cells for the detectable label to identify target plant cells that are transformed with the mRNA construct; and regenerating a non-naturally occurring plant from the target plant cells.

10. The non-naturally occurring plant of claim 9, wherein the mRNA construct comprises an mRNA coding sequence including: a rare-cutting endonuclease sequence encoding the rare-cutting endonuclease; and a detectable label sequence encoding the detectable label.

11. A messenger ribonucleic acid (mRNA) construct, comprising: an mRNA coding sequence including: a rare-cutting endonuclease sequence; and a detectable label sequence; and a promoter sequence upstream from the mRNA coding sequence.

12 The mRNA construct of claim 11 , further including a first untranslated region (UTR) upstream from the mRNA coding sequence and downstream from the promoter sequence.

13. The mRNA construct of clam 12, further include a second UTR that is downstream from the mRNA coding sequence.

14. The mRNA construct of claim 11, wherein the rare-cutting endonuclease sequence includes a sequence encoding a transcription activator like effector nuclease (TALEN).

15. The mRNA construct of claim 14, wherein the detectable label sequence encodes a first detectable label and a second detectable label, and the rare-cutting endonuclease encodes a first half-TALEN that is labeled with the first detectable label and a second half-TALEN that is labeled with the second detectable label.

16. The mRNA construct of claim 15, wherein the first detectable label and the second detectable label are different.

17. The mRNA construct of claim 15, wherein: the first half-TALEN includes a first binding domain and a first endonuclease domain that forms a first fusion protein with the first detectable label; and the second half-TALEN includes a second binding domain and a second endonuclease domain that forms a second fusion protein a second detectable label.

18. The mRNA construct of claim 15, wherein the first detectable label and the second detectable label each include a fluorescent protein.

19. The mRNA construct of claim 11, wherein the mRNA construct encodes the rare- cutting endonuclease sequence and the detectable label sequence separated by a flexible linker sequence.

20. The mRNA construct of claim 11 , wherein the detectable label sequence includes a detectably labeled nucleotide.

Description:
GENE EDITING USING A MESSENGER RIBONUCLEIC ACID CONSTRUCT

INCORPORATION-BY-REFERENCE OF MATERIAL SUBMITTED ELECTRONICALLY

[0001] Incorporated by reference in its entirety is a computer-readable nucleotide/amino acid sequence listing, an ASCII text file which is 113 kb in size, submitted concurrently herewith, and identified as follows: “C1633108111_SequenceListing_ST25” and created on September 29, 2020.

BACKGROUND

[0002] Genome editing technologies using engineered nucleases, such as Transcription activator-like effector nucleases (TALEN), Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and related CRISPER associated protein 9 (Cas9) or Cpfl systems, have accelerated basic biology research, biotechnology, breeding, and gene therapy. Plant genome editing typically starts with transforming explant tissue with a deoxyribonucleic acid (DNA) genome editing vector either by Agrobacterium spp. or biolistic methods. Transformation is followed by tissue culture, including antibiotic or herbicide selection and regeneration of edited plantlets. The resulting primary generation plantlets are transgenic as exogenous nucleic acids are incorporated in the plant genome. For sexually reproducing plants, the transgene element can be segregated out in following generations by self-pollination or crossing with a wild-type plant. Such segregation efforts require significant time and resources to ultimately obtain plants without transgenes.

[0003] Scientists have tried several different methods to conduct genome editing without transgenic DNA integration. Non-transgenic approaches to gene editing are desirable for multiple reasons. Many plant species, especially root, tuber, and fruit bearing species including potato, strawberry, apple, grapes, and bananas are propagated asexually and can present a challenge for gene editing because exogenous nucleic acids cannot be removed by segregation. Previous approaches for non-transgenic gene editing are burdensome, require significant screening efforts to identify plants with the intended edits, and produce inconsistent results.

[0004] Accordingly, there remains a need for efficient techniques that allow for enrichment of gene edited events and that avoid exogenous DNA integration into the target cell genome.

SUMMARY

[0005] The present disclosure is directed to overcoming the above-mentioned challenges and needs related to gene editing. This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description.

[0006] In some embodiments, a method of gene editing comprises contacting a population of plant cells with a messenger ribonucleic acid (mRNA) construct including a sequence encoding a rare-cutting endonuclease and a detectable label. The rare-cutting endonuclease is configured to induce a mutation at a target genomic locus. The method further includes screening the population of plant cells for the detectable label to identify target plant cells that are genetically transformed with the mRNA construct.

[0007] In some embodiments, contacting the population of plant cells includes delivering the mRNA construct into the population of plant cells derived using at least one of polyethylene glycol (PEG) mediated transformation, electroporation, particle bombardment, and microinjection mediated protoplast transformation, as well as various combinations thereof.

[0008] In some embodiments, screening the population of plant cells for the detectable label includes isolating the target plant cells that have the detectable label from a remainder of the population of plant cells. In some embodiments, isolating the target cells includes using fluorescence activated cell sorting (FACS) with a nozzle having a diameter of at least 100 micrometers (um) and up to 200 um.

[0009] In some embodiments, the method further includes preparing the mRNA construct using in-vitro transcription, where the mRNA construct includes a TALEN mRNA including the sequence encoding the rare-cutting endonuclease and the detectable label. [0010] In some embodiments, the rare-cutting endonuclease is a fusion protein and the sequence includes an endonuclease sequence encoding the rare-cutting endonuclease and a detectable label sequence encoding the detectable label. In some embodiments, the rare-cutting endonuclease includes a first half-TALEN that is labeled with a first detectable label and a second half-TALEN that is labeled with a second detectable label. [0011] In some embodiments, the first detectable label and the second detectable label are different. In some embodiments, the first half-TALEN includes a first binding domain and a first endonuclease domain, and the first half-TALEN forms a first fusion protein with the first detectable label. In some embodiments, the second half-TALEN includes a second binding domain and a second endonuclease domain, and the second half-TALEN forms a second fusion protein with a second detectable label. The first detectable label and second detectable label can be label domains of the first and second fusion proteins, respectively. In some embodiments, the endonuclease domains and detectable label domains are separated by a flexible linker. In such embodiments, isolating the target plant cells from the population includes isolating the target plant cells that have or exhibit the first detectable label and the second detectable label.

[0012] In some embodiments, the detectable label sequence includes a fluorescent protein sequence. In some embodiments, the fluorescent protein is yellow fluorescent protein (YFP), red fluorescent protein (RFP), blue fluorescent protein (BFP), and the like. [0013] In some embodiments, the rare-cutting endonuclease is conjugated to a detectable label. In some embodiments, the first half-TALEN is conjugated to a first detectable label and the second half-TALEN is conjugated to a second detectable label. In further embodiments, the first detectable label and the second detectable label are different. The detectable label can be a fluorophore, such as, Alexa Fluor 488, Alexa Fluor 647, Texas Red, FITC, or the like.

[0014] In some embodiments, the plant cells are plant protoplasts. In such embodiments, the method can further include culturing the target plant cells that are transformed with the mRNA construct and regenerating plants from the cultured target plant cells, where the regenerated plants express the mRNA construct.

[0015] Some embodiments are directed to a non-naturally occurring plant, generated by a genomic editing technique. In such embodiments, the genomic editing technique comprises contacting a population of plant cells with an mRNA construct that includes a sequence encoding a rare-cutting endonuclease and a detectable label. The rare-cutting endonuclease can be configured to induce a mutation at a target genomic locus. The genomic editing technique further includes screening the population of plant cells for the detectable label to identify target plant cells that are transformed with the mRNA construct, and regenerating a non- naturally occurring plant from the target plant cells.

The mRNA construct can include an mRNA coding sequence including a rare-cutting endonuclease sequence encoding the rare-cutting endonuclease, and a detectable label sequence encoding the detectable label.

[0016] Some embodiments are directed to an mRNA construct comprising an mRNA coding sequence and a promoter sequence. The mRNA coding sequence includes a rare- cutting endonuclease sequence and a detectable label sequence. The promoter sequence is upstream from the mRNA coding sequence. The promoter sequence can be operatively linked to the rare-cutting endonuclease sequence.

[0017] In some embodiments, the mRNA construct further includes a first untranslated region (UTR) upstream from the mRNA coding sequence and downstream from the promoter sequence. In some embodiments, the mRNA construct further includes a second UTR downstream from the mRNA coding sequence.

[0018] In some embodiments, the rare-cutting endonuclease sequence includes a sequence encoding a TALEN. For example, the rare-cutting endonuclease sequence can encode a binding domain and an endonuclease domain of the TALEN.

[0019] In some embodiments, the detectable label includes a first detectable label and a second detectable label, and the rare-cutting endonuclease includes a first half-TALEN that is labeled with the first detectable label and a second half-TALEN that is labeled with the second detectable label. In some embodiments, the first detectable label and the second detectable label are different.

[0020] In some embodiments, the first half-TALEN includes a first binding domain and a first endonuclease domain that forms a first fusion protein with the first detectable label. In such embodiments, the second half-TALEN includes a second binding domain and a second endonuclease domain that forms a second fusion protein with a second detectable label. The first detectable label can be a first label domain of the first fusion protein and the second detectable label can be a second label domain of the second fusion protein. In some embodiments, the first detectable label and the second detectable label each include a fluorescent protein.

[0021] In some embodiments, the first half-TALEN is conjugated to the first detectable label, and the second half-TALEN is conjugated to the second detectable label. [0022] In some embodiments, the rare-cutting endonuclease sequence and the detectable label sequence are separated by a flexible linker sequence.

[0023] In some embodiments, the detectable label sequence includes a detectably labeled nucleotide. In further embodiments, the detectably labeled nucleotide includes a fluorophore.

[0024] In some embodiments, the plant cells are plant protoplasts.

[0025] In some embodiments, the plant cells are, or are derived from, protoplasts, callus, immature embryos, somatic embryos, embryo axis, meristematic tissue, leaf tissue, stem tissue, or root tissue.

[0026] In some embodiments, the plant cells are dicotyledonous plant cells. In some embodiments, the dicotyledonous plant cells are soybean, canola, alfalfa, potato, and the like. In other embodiments, the plant cells are monocotyledonous plant cells. In some embodiments, the monocotyledonous plant cells are corn, wheat, oats, and the like.

BRIEF DESCRIPTION OF THE DRAWINGS [0027] Various example embodiments can be more completely understood in consideration of the following detailed description in connection with the accompanying drawings, in which:

[0028] FIG. 1 is a flow diagram illustrating an example method for gene editing a population of plant cells, consistent with the present disclosure.

[0029] FIGs. 2A-2B are diagrams illustrating example mRNA constructs, consistent with the present disclosure.

[0030] FIGs. 3A-3F are diagrams illustrating example mRNA coding sequences of mRNA constructs, such as the mRNA constructs illustrated by FIGs. 2A-2B, consistent with the present disclosure.

[0031] FIG. 4 is a flow diagram illustrating an example method for gene editing a population of plant cells, consistent with the present disclosure. [0032] FIG. 5 is a flow diagram illustrating another example method for gene editing a population of plant cells, consistent with the present disclosure.

[0033] FIGs. 6A-8C illustrate example flow cytometry data demonstrating sorting of protoplasts transformed using the nucleic acid constructs of Table 2, consistent with the present disclosure.

[0034] FIGs. 9A-9D illustrate microscopy images of plant cells transformed using the nucleic acid constructs of Table 4, consistent with the present disclosure.

[0035] FIG. 10 illustrates detected deletions of plants transformed using the nucleic acid constructs of Table 4, consistent with the present disclosure.

DETAILED DESCRIPTION

[0036] Aspects of the present disclosure are directed to a variety of methods, constructs, and plants involving and/or developed using non-DNA constructs that encode rare-cutting endonucleases and a detectable label. These methods include direct delivery of RNA and/or protein to the plant cells. Example embodiments include contacting a population of plant cells with an mRNA construct to transform the plant cells. The mRNA construct encodes the rare-cutting endonuclease and the detectable label, and the rare-cutting endonuclease can induce a mutation at a target genomic locus. The contacted population of plant cells can be screened for cells with the mutation at the target genomic locus. While the present invention is not necessarily limited to such applications, various aspects of the invention may be appreciated through a discussion of various embodiments using this context.

[0037] Accordingly, in the following description various specific details are set forth to describe specific embodiments presented herein. It should be apparent to one skilled in the art, however, that one or more other examples and/or variations of these embodiments can be practiced without all the specific details given below. In other instances, well known features have not been described in detail so as not to obscure the description of the embodiments herein. For ease of illustration, the same reference numerals can be used in different diagrams to refer to the same elements or additional instances of the same element.

[0038] Plant transformation and tissue culture present significant limitations to genome editing efforts and are costly in terms of time, labor and materials to develop and implement specialized protocols. Non-DNA gene editing, sometimes herein referred to as “DNA-free editing”, typically requires time-consuming and expensive dedicated protocols to generate and deliver reagents but can save time by not requiring incorporation of transgenic DNA. Methods consistent with embodiments of the present disclosure can include delivering an in vitro- purified mRNA construct into plant tissues or plant cells derived from plant tissues. The mRNA construct includes the non-DNA gene editing reagents, such as the encoded rare-cutting endonuclease, and a detectable label used to identify plant cells and/or plant tissue transformed by and/or including the mRNA construct. The plant cells transiently exposed to the non-DNA gene editing reagents can be screened to identify plant cells and/or plant tissue transformed by and/or that include the mRNA construct through physical means, such as FACS. The plant cells that contain the intended gene edit(s) can be separated from the remainder of the plant cell population. Example methods in accordance with the present disclosure can reduce the laborious process of screening for desired mutations or edits. In some embodiments, example methods directed to gene edits on sexually reproduced plants or other types of plants can avoid any requirement for imposed segregation and avoid transformants that include DNA integrations into the genome.

[0039] Turning now the figures, FIG. 1 is a flow diagram illustrating an example method 100 for gene editing a population of plant cells, consistent with the present disclosure. The plant cells can be derived from a variety of different types of plants and/or plant tissue. As non-limiting examples, the plant cells can include and/or can be derived from protoplasts, callus, immature embryos, somatic embryos, embryo axis, meristematic tissue, leaf tissue, stem tissue, root tissue, etc. The plants can include dicotyledonous plants and plant cells, such as soybean, canola, alfalfa, potato, and the like, as well as monocotyledonous plants and plant cells, such as com, wheat, oats, and the like.

[0040] At 102, the method 100 includes contacting a population of plant cells with an mRNA construct. As used herein, an mRNA construct includes and/or refers a nucleic acid sequence including one or more binary vectors carrying genome editing reagents, a detectable label, and a promoter. The genome editing reagents can include or encode an endonuclease, such as a TALEN mRNA. For example, the mRNA construct includes a sequence encoding a rare-cutting endonuclease and a detectable label. The rare-cutting endonuclease can include a TALEN and related Fokl protein, or CRISPR and related Cas9 or Cpfl, among other endonucleases. The detectable label can include a fluorescent protein, a fluorophore, or nucleotide bound to a fluorophore, among other types of labels. In some embodiments, the rare-cutting endonuclease is a TALEN that includes an endonuclease domain and a binding domain (sometimes referred to as a “TALE domain”). The binding domain can be configured to bind a target location and the endonuclease domain is configured to induce a mutation at a target genomic locus associated with the target location.

[0041] As used herein, a domain includes and/or refers to a conserved part of a protein sequence and tertiary structure of the protein that can form a three-dimensional structure. The domains can be encoded by the mRNA constructs, as further described below.

[0042] The mRNA construct can include a variety of nucleic acid segments, selected and arranged to facilitate transport of genome editing reagents in the plant cells. For instance, the mRNA construct can include a TALEN mRNA that includes the sequence encoding the rare-cutting endonuclease and the detectable label. In some embodiments, the mRNA construct includes an mRNA coding sequence, a UTR, and the promoter sequence. The UTR can be upstream from the mRNA coding sequence, such as a 5’

UTR. In some embodiments, the mRNA construct can include the mRNA coding sequence, the promoter sequence, and a UTR downstream from the mRNA coding sequence, such as a 3’ UTR. In various embodiments, the mRNA construct can include the mRNA coding sequence, a first UTR upstream from the mRNA coding sequence (e.g., a 5’ UTR), a second UTR downstream from the mRNA encoding sequence (e.g., a ‘3 UTR), and a promoter sequence that is upstream the first UTR. Example mRNA constructs are illustrated in FIGs. 2A-2B and discussed further herein.

[0043] Example mRNA constructs in accordance with the present disclosure can have a variety of forms, as further illustrated herein. In some embodiments, the detectable label can include a nucleotide of the mRNA construct that is labeled with a fluorophore. In some embodiments, a plurality of nucleotides of the mRNA construct are labeled with a fluorophore.

[0044] Contacting the population of plants cells with the mRNA construct can include delivering the mRNA construct into the population of plant cells. The mRNA construct can be delivered into the plant cells via different approaches including, but not limited to, PEG-mediated transformation, electroporation, particle bombardment, or microinjection mediated protoplast transformation, as well as combinations thereof. Specific examples of the delivery approaches are further described below.

[0045] In various embodiments, prior to contacting a population of plant cells with the mRNA construct at 102, the method 100 can include preparing the mRNA construct using in-vitro transcription. For example, the gene editing reagents can be prepared as a DNA vector that encodes the rare-cutting endonuclease and a promotor to stimulate transcription. In some embodiments, the DNA vector further encodes the detectable label. The gene editing reagents can be mixed with RNA nucleotides and polymerase in a tube and purified, resulting in transcription of the DNA vector to an mRNA construct. In some embodiments, rather than the DNA vector encoding the detectable label, one or more nucleotides of the mRNA construct can be labeled, such as with a fluorophore.

[0046] At 104, the method 100 includes screening the population of plant cells for the detectable label to identify target plant cells that are genetically transformed with the mRNA construct. Target plant cells, as used herein, include and/or refer to plant cells that express the mRNA construct and/or that otherwise exhibit or express the detectable label. The target plant cells can include the intended mutation at the target genomic locus. In some embodiments, the population of plant cells can be screened and target plant cells can be selected for expression of the mRNA construct via the detectable label. Screening the population of plant cells for the detectable label can include isolating target plant cells that have the detectable label from a remainder of the population of plant cells. Various embodiments include FACS based selection of transformed protoplasts. As further described below, isolating target cells can include using FACS with a nozzle having a diameter of at least 100 um and up to 200 um.

[0047] FACS applied to plant protoplasts can be difficult because maintaining live protoplasts after sorting is challenging, plant regeneration from protoplasts is difficult to perform, and debris generated during enzymatic treatment of plant tissue can clog the instrument and hinder the FACS process. For example, with no cell wall for protection, protoplasts are extremely fragile during transportation and sorting. Somewhat surprisingly, various embodiments of the present disclosure include implementing FACS protocols that successfully segregate transformed plant protoplasts and allow for plant regeneration. Method embodiments in accordance with the present disclosure can include a FACS based screening or selection of protoplasts using a 100-200 um diameter nozzle to reduce pressure on the protoplasts as compared to smaller nozzles, such as 85 um and 70 um nozzles. In some specific embodiments, the nozzle can have a diameter of between 100-150 um, between 100-130 um, or between 120-130 um. In more specific embodiments, the nozzle diameter is 120 um, 130 um, 150 um, or 200 um. The larger nozzle size can reduce sorting speed as compared to the smaller nozzles. For example, the larger nozzle size can reduce the sorting speed by about 2-5 million events per hour as compared to the smaller nozzles. However, larger nozzle size can provide increased stability and viability.

[0048] In some embodiments, the detectable label includes a first detectable label and a second detectable label. The rare-cutting endonuclease can include a first half-TALEN (e.g., left-half TALEN (LHT)) that is labeled with the first detectable label and a second half-TALEN that is labeled with the second detectable label (e.g., right-half TALEN (RHT)). In such embodiments, the method 100 can further include isolating the target plant cells that have the first detectable label and the second detectable label. In some embodiments, the first detectable label and second detectable label can be different labels. In other embodiments, the first detectable label and second detectable label can be the same. Although embodiments are not so limited, and the mRNA construct can encode and/or the rare-cutting endonuclease can be labeled with a single detectable label and/or more than two detectable labels. In some embodiments, the mRNA construct itself can be labeled with a fluorophore.

[0049] Accordingly, a number of embodiments are directed to the combination of non-DNA-mediated plant cell editing of protoplast plant cells, along with the selection of target cells receiving both half TALENs using FACS and fluorescent proteins or fluorophore labelling of the two TALENs. Such a combination can allow for a highly efficient method to overcome the obstacle of a non-DNA editing method, where use of traditional selectable markers cannot be employed. Plants regenerated from FACS selected protoplasts can enriched for the intended gene edits, thus reducing the screening efforts typically required with transient gene expression.

[0050] As described above, the individual half TALEN constructs can contain the detectable labels. For example, the individual half TALEN constructs can be fusion proteins that contain fluorescent protein domains, with or without intervening flexible linker domains. Example detectable labels, such as fluorescent proteins, can be incorporated into such a fusion protein. Non- limiting examples of fluorescent proteins include YFP, RFP, and BFP, among others. Although examples are not so limited, and other fluorescent proteins can be used, such as cyan-linker yellow (CLY).

[0051] In various embodiments, the first individual half TALEN construct has a fluorescent protein domain, such as YFP, attached at the N-terminus of the left half TALEN (LHT) separated with a peptide linker, such as GGGGSGGGGS. In such embodiments, the corresponding other individual half TALEN construct has a fluorescent protein, such as RFP attached at the N-terminus of the right half TALEN (RHT) separated with a flexible (peptide) linker, such as GGGGSGGGGS. To improve the mRNA stability and overall expression, UTR sequences, e.g., from the Arabidopsis gene AtlG09740, can be added, flanking the TALEN coding sequences. These expression cassettes can be used for in-vitro transcription to obtain high-quality purified mRNA encoding the TALEN subunits, or for protein expression and purification in a bacterial or insect cell expression system using standard methods.

[0052] In some embodiments, instead of creating fusion proteins with detectable label domains, the purified nuclease proteins can be labeled by a conjugation-based method with a commercial labeling kit such as Alexa Fluor 488 Protein Labeling Kit (Thermo Fisher Scientific, Cat # A10235).

[0053] In some embodiments, the mRNA encoding the nuclease can itself be chemically labeled by incorporating labeled nucleotides into the mRNA during the in vitro transcription process. This incorporation-based labeling method can achieve uniformity and consistency in labeling the mRNA. For example, fluorophore-labeled ChromaTide™ (Thermo Fisher Scientific) uridine-5 '-triphosphates (UTPs) can be enzymatically incorporated into RNA or probes. Cells transformed with the labeled mRNA can then be detected.

[0054] The present disclosure addresses contamination problems through use of antibiotics and fungicides in liquid media, frequent media changes after sorting, and cell sorter sterilization using bleach and ethanol. For example, embodiments in accordance with the present disclosure can avoid the use of antibiotics and/or fungicides as transformed cells are selected based on a detectable label, and not based on resistant gene expression to an antibiotic and/or fungicide. Table 3 as further illustrated herein is an example of FACS canola protoplasts with nucleic acid vectors that include a fluorescent protein, such as a fluorescent protein expression DNA vector.

[0055] Various embodiments of the present disclosure are directed to a non-naturally occurring plant generated by the method 100 described by FIG. 1 and/or the methods 450, 570 described further herein by FlGs. 4-5. For example, the method 100 can further include culturing the identified target plant cells that are transformed with the mRNA construct, and regenerating plants from the cultured target plant cells, where the regenerated plants express the mRNA construct. The plants can be generated using example mRNA constructs, such as those illustrated by FIGs. 2A-2B.

[0056] In some embodiments and consistent with method 100, a non-naturally occurring plant can be generated by a genomic editing technique that includes using an mRNA construct. The mRNA construct can include a rare-cutting endonuclease sequence which encodes the rare cutting endonuclease and a detectable label sequence which encodes or includes the detectable label. The genomic editing technique can include contacting a population of plant cells with the mRNA construct, screening the population of plant cells for the detectable label to identify target plant cells that are transformed with the mRNA construct, and regenerating a non-naturally occurring plant from the identified target plant cells. Other example embodiments of the disclosure are directed to naturally occurring seed, reproductive tissue, or vegetative tissue generated by the method 100 of FIG. 1.

[0057] FIGs. 2A-2B are diagrams illustrating example mRNA constructs 210, 211, consistent with the present disclosure. As shown by FIG. 2A, the mRNA construct 210 includes an mRNA coding sequence 212 and a promoter sequence 214 upstream from the mRNA coding sequence 214. As non-limiting examples, the promoter can include a nopaline synthase promoter (NosPro) or a T7 promoter, among others. Other example promoters can include Sp6 promoter, a T3 promoter, Ubi promoter, a CaMV35S promoter, an ADHI promoter, and ADH1 promoter, a GDS promoter, a TEF1 promoter, a Gall promoter, a CaMKlla promoter, a T71ac promoter, an araBAD promoter, a trp promoter, a lac promoter, a Ptac promoter, among others.

[0058] The mRNA coding sequence 212 can include a detectable label sequence 216 and a rare-cutting endonuclease sequence 218. As further illustrated by FIGs. 3A-3F, the rare-cutting endonuclease sequence 218 can include a sequence encoding a TALEN. In some embodiments, the rare-cutting endonuclease sequence 218 can encode a binding domain and endonuclease domain. The binding domain can be configured to bind to a target sequence. The rare-cutting endonuclease domain can be configured to induce a mutation at a target genomic locus associated with the target location. However, embodiments are not so limited. The detectable label sequence 216 encodes or includes the detectable label, such as a fluorescence protein sequence, a fluorophore, and/or a nucleotide (e.g., an RNA nucleotide) that is labeled with a fluorophore, as further described herein.

[0059] In the embodiments illustrated by FIG. 2 A, the detectable label sequence 216 is upstream from the rare-cutting endonuclease sequence 218. However, embodiments are not so limited, and the rare-cutting endonuclease sequence 218 can be upstream from the detectable label sequence 216. As may be appreciated, upstream can include a location proximal to and/or closer to the 5’ end of the mRNA construct 210 and/or mRNA coding sequence 212 as compared to the referenced sequence. Conversely, downstream can include a location proximal to and/or closer to the 3’ end of the mRNA construct 210 and/or mRNA coding sequence 212 as compared to the referenced sequence. As used herein, a sequence with adjectives listed in front, such as the detectable label sequence 216 and the rare-cutting endonuclease sequence 218, includes or refers to a nucleotide sequence that encodes or is the adjectives (e.g., encodes or is the detectable label).

[0060] In some embodiments and as shown by the mRNA construct 211 of FIG. 2B, the promoter sequence 214 can be upstream from the mRNA coding sequence 212, and at least one UTR 215, 217 can be downstream from the promoter sequence 214 and upstream from the mRNA coding sequence 212. For example, the mRNA coding sequence 211 of FIG. 2B includes a first UTR 215 upstream from the mRNA coding sequence 212, and the promoter sequence 214 is upstream the first UTR 215. In some embodiments, the mRNA construct 211 includes a second UTR 217 that is downstream from the mRNA coding sequence 212. However, embodiments are not so limited, and additional and/or different mRNA constructs are contemplated. For example, the mRNA construct can include no UTR and/or a single UTR as described above.

[0061] As further shown and described by FIGs. 3A-3F, the mRNA coding sequence of example mRNA constructs can have a variety of forms. In a number of embodiments, the detectable label sequence 216 and the rare-cutting endonuclease sequence 218 can form a fusion protein when translated. In some embodiments, the detectable label sequence 216 includes a nucleotide of the mRNA construct that is detectably labeled, such as with a fluorophore.

[0062] FIGs. 3A-3F are diagrams illustrating example mRNA coding sequences of mRNA constructs, such as the mRNA constructs illustrated by FIGs. 2A-2B, consistent with the present disclosure. Each of the mRNA coding sequences illustrated by FIGs. 3A- 3F include the detectable label sequence 216 and the rare-cutting endonuclease sequence 218 as illustrated by FIGs. 2A-2B.

[0063] In some embodiments and as shown by FIG. 3A, the mRNA coding sequence 320 can include the detectable label sequence 322 and the rare-cutting endonuclease sequence 324 which are separated by a flexible linker sequence 326. The flexible linker sequence 326 can include a plurality of nucleotides. For example, the flexible linker sequence 326 can encode a flexible peptide linker. As shown by FIG. 3A, the detectable label sequence 322 can be upstream from the rare-cutting endonuclease sequence 324, however embodiments are not so limited and the detectable label sequence 322 can be downstream from the rare-cutting endonuclease sequence 324.

[0064] FIG. 3B illustrates an example mRNA coding sequence 330 that includes a first half-TALEN sequence 334 and a second half-TALEN sequence 338, which can encode a LHT and a RHT. In some examples, the detectable label sequence can include a first detectable label sequence 332 that labels the first half-TALEN (e.g., the first half- TALEN sequence 334) and a second detectable label sequence 336 that labels the second half-TALEN (e.g., the second half-TALEN sequence 338). As previously described, the first detectable label encoded by the first detectable label sequence 332 and the second detectable label encoded by the second detectable label sequence 336 can be different, such as sequences encoding different florescent proteins and/or fluorophores.

[0065] Each of the first half-TALEN sequence 334 and second half-TALEN sequence 338 can encode a binding domain 325, 335 and an endonuclease domain 327, 337. In some embodiments, the half-TALEN sequences 334, 338 and the detectable label sequences 332, 336 can form and/or encode a first fusion protein and a second fusion protein. For example, the first half-TALEN sequence 334 can encode a first binding domain 325 and a first endonuclease domain 327 that form a first fusion protein with the first detectable label encoded by the first detectable label sequence 332 when translated. The second half-TALEN sequence 338 can encode a second binding domain 335 and a second endonuclease domain 337 that form a second fusion protein with the second detectable label encoded by the second detectable label sequence 336 when translated. [0066] The mRNA coding sequence 330 of FIG. 3B illustrates the detectable label sequences 332, 336 upstream from the TALEN sequences 334, 338, respectively. However, embodiments are not so limited. For example, FIG. 3C illustrates an example mRNA coding sequence 331, which is similar to the mRNA coding sequence 330 but with the first half-TALEN sequence 334 upstream of the first detectable label sequence 332 and the second half-TALEN sequence 338 upstream of the second detectable label sequence 336.

[0067] As previously described, the rare-cutting endonuclease sequence and detectable label sequence can be separated by a flexible linker sequence which encodes or includes a flexible linker. FIG. 3D illustrates an example of an mRNA construct 340 which is similar to the mRNA coding sequence 330 of FIG. 3D with the addition of flexible linker sequences 343, 345 between the detectable label sequences 332, 336 and the half-TALEN sequences 334, 338. For example, the mRNA construct 340 includes a first detectable label sequence 332 and a first half-TALEN sequence 334 that are separated by a first flexible linker sequence 343. The mRNA construct 340 can further include a second detectable label sequence 336 and a second half-TALEN sequence 338 that are separated by a second flexible linker sequence 345. Although not illustrated, the first half-TALEN sequence 334 and the second detectable label sequence 336 can be separated by a third flexible linker sequence.

[0068] The mRNA coding sequence 340 of FIG. 3C illustrates the detectable label sequences 332, 336 upstream from the TALEN sequences 334, 338, respectively. However, embodiments are not so limited. For example, FIG. 3E illustrates an example mRNA coding sequence 341, which is similar to the mRNA coding sequence 340 but with the first half-TALEN sequence 334 upstream of the first detectable label sequence 332 and the second half-TALEN sequence 338 upstream of the second detectable label sequence 336. Similarly, although not illustrated, the first detectable label sequence 332 and the second half-TALEN sequence 338 can be separated by a third flexible linker sequence. [0069] FIG. 3F illustrates an example mRNA coding sequence 347 in which the detectable label sequence includes a detectably labeled nucleotide 349. As shown, the mRNA coding sequence 347 includes the detectably labeled nucleotide 349 which is upstream from the first half-TALEN sequence 334 and the second half-TALEN sequence 338. For example, detectably labeled nucleotide 349 can include a nucleotide of the mRNA construct that is bound to a fluorophore or other detectable label. Although embodiments are not so limited and the detectably labeled nucleotide 349 can be downstream of the second half-TALEN sequence 338. In some embodiments, at least one flexible linker sequence 343, 345 can separate the detectably labeled nucleotide 349 from the first half-TALEN sequence 334 and/or separate the first half-TALEN sequence 334 from the second half-TALEN sequence 338. As may be appreciated, the detectably labeled nucleotide 349 can include a plurality of detectably labeled nucleotides, which can increase the signal strength of the detectable label as compared to a single detectably labeled nucleotide.

[0070] Different example approaches for enriching and/or screening the plant cells for the intended gene edit(s) are now described. Enriching and/or screening the plant cells can increase the representation of plant cells likely to contain the intended genomic edit. [0071] FIG. 4 is a flow diagram illustrating an example method 450 for gene editing a population of plant cells, consistent with the present disclosure. At 452, the method 450 can include developing components of the construct (e.g., mRNA or protein). For an mRNA construct, the components can include the TALEN vector, such as a sequence including a TALEN, a Fusion Protein (FP)-TALEN, a detectable label, a TALE-activator and/or Trex2. Similar components can be prepared for a protein construct. The components can be prepared separately by different techniques. At 454, the method can include identifying whether the construct is an mRNA construct or protein construct. As may be appreciated, step 454 may not occur but is shown to illustrate that different method steps can occur for the developing an mRNA construct or a protein construct. In response to a determination that the construct is an mRNA construct, the method 450 at 456 includes performing in-vitro mRNA transcription and purification, as previous described. At 458, the method 450 optionally includes labeling the mRNA construct with chemical dyes, such as to increase a signal strength of the detectable label and/or to label the nucleotide(s) to include or form the detectable label(s). In some embodiments, in response to determining the construct is a protein construct, at 455, the method 450 includes performing E. coli expression of the protein and column purification. At 457, the method can optionally include labeling the protein construct with chemical dyes, similar to the mRNA construct as described above.

[0072] At 460, the method 450 includes performing PEG-mediated protoplast transformation using the mRNA construct or protein construct. After a period of time, such as around twenty-four hours, at 462, the protoplasts can be sorted with FACs for fluorescent positive cells. At 464, the method 450 can further include collecting the positive cells by culturing on liquid and solid mediums and regenerating into plants. At 466, the plants can be screened by genotyping for the mutation of the target gene.

[0073] In some specific embodiments, the PEG-mediated transformation can start with the isolation of protoplasts from healthy plant tissues that are regenerable, for example, canola young leaf blade, wheat immature embryos, or soybean somatic embryos, embryo axis etc. Next, the tissues can be digested in buffer with enzymes such as cellulose, macerozyme (and/or) pectolyase. After a few hours of digestion, round and intact protoplasts can be isolated in a first buffer, such as mannitol magnesium (MMG), for transformation. The mRNA/protein reagents (e.g., the mRNA construct) can be added into a tube with protoplasts and polyethylene glycol, such as 40% PEG4000. The tube is mixed and incubated, such as for 20-30 minutes. The protoplasts can be washed with a second buffer (e.g., W5 buffer) and transferred into a third buffer (e.g., M8P buffer). The TALENs can be fused with a detectable label, such as a fluorescent protein. After incubation (such as for 16-36 hours), the fluorescent signal can be detected under microscope and/or FACS. If the mRNA construct or protein are labeled with chemical dyes, the mRNA construct or protein can be sorted after transformation. Fluorescent positive cells are collected and transferred into regeneration medium. The protoplasts can be cultured in several rounds of liquid medium, then moved to callus inducing medium (CIM), shoot inducing medium (SIM) and rooting medium (RM).

[0074] Although FIG. 4 illustrates use of PEG-mediated transformation, embodiments are not so limited. In some embodiments, fluorescently labeled TALEN constructs (mRNA and/or protein constructs) are delivered into plant protoplast cells or other tissues using other methods such as electroporation, bombardment, or microinjection mediated protoplast transformation. For larger plant tissues with cell walls such as embryos, bombardment (or biolistics) with gold particles coated with mRNA can be used as delivery methods. Following delivery of the fluorescently labeled endonucleases, e.g., mRNA constructs encoding the endonucleases, FACS can be used to select fluorescent colored positive protoplast cells. In embodiments where two differentially-labeled half TALEN constructs are used, FACS can be used to select dual fluorescent colored positive protoplast cells. And, the selected protoplasts can be regenerated into whole plants, as described above.

[0075] For particle bombardment transformation, the mRNA constructs or proteins can be coated onto particles, such as gold particles. To coat the mRNA or protein(s) on the gold particles, different volumes of mRNA or protein solution are mixed with a fixed amount of gold suspension by pipetting.

[0076] Ammonium acetate and 2-propanol can be used to precipitate the mRNA TALEN onto gold particles. For example, the following protocol can be used:

2 microliters (pi) of TALEN mRNA 1 pi Left half TALEN at 1 micrograms (pg)/pl, and 1 mΐ Right half TALEN at 1 mg/ml) and 1 mΐ of TALE- activator (1 mg/ml),

1 mΐ Ammonium acetate (5 moles (M)),

20 mΐ 2-propanol, and

5 mΐ gold nanoparticles (40 milligrams (mg)/milliliter (ml) for single delivery. [0077] For protein bombardment, the following example protocol can be used:

2 mΐ of TALEN protein (1 mΐ Left half TALEN at 2 pg/mΐ, and 1 mΐ Right half TALEN at 2 pg/mΐ),

1 mΐ of TALE- activator (2 pg/mΐ), and 5 mΐ gold nanoparticles (40 mg/ml) for one delivery.

A PDS-1000/He gene gun (Bio-Rad) can be used according to general settings. Various embodiments include at least substantially the same features and attributes, include Bio- Rad settings, as discussed within Kikkert, et al. Plant Cell, Tissue and Organ Culture, volume 33, pages 221-226 (1993), which is hereby incorporated by reference in its entirety for its general teachings related to Bio-Rad the specific teachings related to example general settings for Bio-Rad.

[0078] Although embodiments are not so limited, and various particle bombardment transformation protocols can be used. [0079] In some embodiments, the detectably labeled endonuclease or the detectably labeled mRNA construct encoding the nuclease can be co-delivered with an in vitro purified exonuclease or mRNA encoding the exonuclease. An example exonuclease is Trex2. Co-delivery of an exonuclease (or an encoding mRNA) and the mRNA construct can increase the efficiency of non-homologous end joining (NHEJ)-mediated deletions at the endonuclease target cutting site, thus further increasing the likelihood and/or the efficiency of the deletion. Some embodiments include the triple co-delivery of the endonuclease reagent (e.g., TALEN), an exonuclease (e.g., Trex2), and a TALE-activator (as further described herein) to further increase efficiency (e.g., frequency) in inducing deletions.

[0080] FIG. 5 is a flow diagram illustrating another example method for gene editing a population of plant cells, consistent with the present disclosure. The method 570 can include steps 452, 454, 456, 458, 455, and 457 as previously described by method 450, and which are not repeated herein. At 580, the method 570 includes delivering the mRNA or protein construct by performing particle bombardment transformation. At 582, the plant tissues can be cultured on solid mediums and regenerated into plants. And, at 584, the plants can be screened by genotyping for the mutation of the target gene.

[0081] In some embodiments, in addition to contacting the population of target cells with an mRNA or protein construct including a sequence encoding the rare-cutting endonuclease, the method 570 (or method 550) further includes contacting the population of target cells with an agent that confers a selective advantage on transiently transformed cells. By conferring a selective advantage, co-administration of the additional agent promotes enhanced growth and proliferation of cells that are transformed with the non-DNA gene editing reagents (see, e.g., Table 3, which indicates this effect). In some embodiments, the agent that confers a selective advantage includes a TALE activator.

The TALE activator can include a TALE DNA binding domain (e.g., a TALEN reagent) and an activator agent. Example activator agents include TALE-VP128, 6TAD and a 6TAD-VP128 fusion. Example activator agents include nucleotide and amino acid sequences set forth in SEQ ID NOs: 22-27. The TALE DNA binding domain (e.g., a TALEN reagent) and the TALE-activator together target genes that promote morphogenic traits. These morphogenic traits can include hormone regulators that regulate cell division. Example target regulator proteins include BBM, WUS, LEC2, GRF5, STM, E2Fa and AGL15 (SEQ ID NOs: 1-7 for example encoding nucleotide sequence and SEQ ID NOs: 8-14 for example protein sequences). The TALE DNA binding component can be configured to specifically bind the promoter sequences of the target regulator gene.

For example, the TALE DNA-binding domain can be configured to selectively bind to a promoter of BBM, WUS, LEC2, GRF5, STM, E2Fa and AGL15, such as a promoter sequence with at least 90% sequence identity to one of the sequences set forth in SEQ ID NOs: 15-21. The combination of the activator agent and the promoter sequence- specific TALE DNA-binding domain facilitate the ability of the associated TALE activator to promote enhanced expression of the target regulator gene in cells that are also transformed with the non-DNA gene editing reagent. The TALE DNA binding domain and associated activator agent (e.g., the TALE activator) can be delivered in the form of an mRNA construct or a protein, so that the method and the product produced thereby remain non-transgenic and/or DNA-free.

[0082] For example, SEQ ID NOs: 1-7 can include coding sequences (CDSs) for BBM, WUS, LEC2, GRF5, STM, E2Fa and AGL15 and SEQ ID NOs: 8-14 can include the protein sequences for BBM, WUS, LEC2, GRF5, STM, E2Fa, and AGL15, which can be derived from SEQ ID NOs: 1-7 and can include protein CDSs. SEQ ID NOs: 15-21 can include nucleic acid sequences of promoters for BBM, WUS, LEC2, GRF5, STM, E2Fa and AGL15. SEQ ID NOs: 22-24 can include CDSs for the activator genes VP128, 6TAD and a 6TAD-VP128 fusion and SEQ ID NOs: 25-27 can include the protein sequences for VP128, 6TAD and a 6TAD-VP128 fusion, which can be derived from SEQ ID NOs: 22-24 and can include the protein CDSs.

[0083] As with FIG. 4, in various embodiments, the method 570 includes co-delivery of a TALEN and an in vitro purified exonuclease, such as a Trex2 mRNA or protein. Co-delivery of an exonuclease increases the efficiency of NHEJ mediated deletions at the endonuclease target cutting site, thus further increasing the likelihood/efficiency of the deletion. Further example embodiments include the triple co-delivery of the endonuclease reagent (e.g., TALEN), an exonuclease such as Trex2, and a TALE- activator, to further increase efficiency (frequency) in inducing deletions.

[0084] For convenience, certain terms employed in the specification, examples, and appended claims are provided here. The definitions are provided to aid in describing particular embodiments and are not intended to limit the claimed invention, as the scope of the invention is limited only by the claims.

[0085] The use of the term "or" in the claims and specification is used to mean "and/or" unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive, although the disclosure supports a definition that refers to only alternatives and "and/or."

[0086] The words "a" and "an," when used in conjunction with the word "comprising" or “including” in the claims or specification, denotes one or more, unless specifically noted.

[0087] Unless the context clearly requires otherwise, throughout the description and the claims, the words “include”, “including”, "comprise," "comprising," and the like, are to be construed in an open and inclusive sense as opposed to a closed, exclusive or exhaustive sense. For example, the term "comprising" can be read to indicate "including, but not limited to." The term "consists essentially of" or grammatical variants thereof indicate that the recited subject matter can include additional elements not recited in the claim, but which do not materially affect the basic and novel characteristics of the claimed subject matter.

[0088] Words using the singular or plural number also include the plural and singular number, respectively. The word "about" indicates a number within range of minor variation above or below the stated reference number. For example, "about" can refer to a number within a range of 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, or 1% above or below the indicated reference number.

[0089] As used herein, the term "polypeptide" or "protein" refers to a polymer in which the monomers are amino acid residues that are joined together through amide bonds. When the amino acids are alpha-amino acids, either the L-optical isomer or the D- optical isomer can be used, the L-isomers being typical. The term polypeptide or protein as used herein encompasses any amino acid sequence and includes modified sequences such as glycoproteins. The term polypeptide, unless noted otherwise, is specifically intended to cover naturally occurring proteins, as well as those that are recombinantly or synthetically produced.

[0090] One of skill will recognize that individual substitutions, deletions or additions to a peptide, polypeptide, or protein sequence which alters, adds or deletes a single amino acid or a percentage of amino acids in the sequence is a "conservatively modified variant" where the alteration results in the substitution of an amino acid with a chemically similar amino acid. Conservative amino acid substitution tables providing functionally similar amino acids are well known to one of ordinary skill in the art. The following six groups are examples of amino acids that are considered to be conservative substitutions for one another: i. Alanine (A), Serine (S), Threonine (T), ii. Aspartic acid (D), Glutamic acid (E), iii. Asparagine (N), Glutamine (Q), iv. Arginine (R), Lysine (K), v. Isoleucine (I), Leucine (L), Methionine (M), Valine (V), and vi. Phenylalanine (L), Tyrosine (Y), Tryptophan (W).

[0091] The term "nucleic acid" refers to a DNA or RNA nucleic acid and sequences of nucleic acids in either single- or double-stranded form, and unless otherwise limited, encompasses known analogs of natural nucleotides that hybridize to nucleic acids in manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence includes the complementary sequence thereof.

[0092] Reference to sequence identity addresses the degree of similarity of two polymeric sequences, such as protein sequences or nucleic acid sequences. Determination of sequence identity can be readily accomplished by persons of ordinary skill in the art using accepted algorithms and/or techniques. Sequence identity is typically determined by comparing two optimally aligned sequences over a comparison window, where the portion of the peptide or polynucleotide sequence in the comparison window can include additions or deletions (e.g., gaps) as compared to the reference sequence (which does not include additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical amino-acid residue or nucleic acid base occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the window of comparison and multiplying the result by 100 to yield the percentage of sequence identity. Various software driven algorithms are readily available, such as BLAST N or BLAST P to perform such comparisons. [0093] Disclosed are materials, compositions, and components that can be used for, can be used in conjunction with, can be used in preparation for, or are products of the disclosed methods and compositions. It is understood that, when combinations, subsets, interactions, groups, etc., of these materials are disclosed, each of various individual and collective combinations is specifically contemplated, even though specific reference to each and every single combination and permutation of these compounds may not be explicitly disclosed. This concept applies to all aspects of this disclosure including, but not limited to, steps in the described methods. Thus, specific elements of any foregoing embodiments can be combined or substituted for elements in other embodiments. For example, if there are a variety of additional steps that can be performed, it is understood that each of these additional steps can be performed with any specific method steps or combination of method steps of the disclosed methods, and that each such combination or subset of combinations is specifically contemplated and should be considered disclosed. Additionally, it is understood that the embodiments described herein can be implemented using any suitable material such as those described elsewhere herein or as known in the art.

[0094] Various embodiments are implemented in accordance with the underlying provisional application, U.S. Provisional Application No. 62/908,499, filed on September 30, 2019 and entitled “DNA-Free Gene Editing”, to which benefit is claimed and is fully incorporated herein by reference. For instance, embodiments herein and/or in the provisional application may be combined in varying degrees (including wholly). Embodiments discussed in the Provisional Application are not intended, in any way, to be limiting to the overall technical disclosure, or to any part of the claimed invention unless specifically noted.

[0095] While illustrative embodiments have been illustrated and described, it will be appreciated that various changes can be made therein without departing from the scope of the invention. EXPERIMENTAL EMBODIMENTS

[0096] Various experimental embodiments were directed to designing different nucleic acid plasmid vectors, sometimes herein referred to as vectors for ease of reference and which can include the previously described nucleic acid constructs or a portion thereof, such as a DNA or mRNA construct. The vectors include a rare-cutting endonuclease and a detectable label. Specific experiments were designed to show the addition of a detectable label to the plasmid vectors, sorting of transformed protoplasts using FACS, identification and sorting of cells via a detectable label and using FACs, and genetic editing by the plasmid vectors that include the rare-cutting endonuclease and the detectable label. A number of experiments conducted are described herein.

[0097] An experiment was conducted to illustrate different nucleic acid vector designs. The different vectors are shown below in Table 1. The nucleic acid constructs in Table 1 include DNA constructs. However, as may be appreciated, the various DNA vectors can be transcribed to form an mRNA construct using the above-described in-vitro transcription techniques. The constructs include TALEN nucleic acid constructs.

[0098] Table 1. Constructs [0099] The constructs in Table 1 that were generated in the experimental embodiments are described in detail below. The vectors pCLS3 and pCLS4 are vectors that were generated and that include a TALEN that targets the gene BnFAD2 and which are tethered to fluorescent proteins. Vector pCLS3 includes a promoter NosPro, a fluorescent protein YEP, a linker sequence 2xGGGGS, and a LHT tethered to the YEP and that targets the gene BnFAD2. Vector pCLS4 includes a promoter NosPro, a fluorescent protein RFP, a linker sequence 2xGGGGS, and a RHT tethered to the YEP and that targets the gene BnFAD2. The vectors pCLS3 and pCLS4 are complete TALEN constructs. In experimental embodiments, vectors pCLS3 and pCLS4 were used to demonstrate TALEN activity for a TALEN-Fluorescent fusion protein. Vectors pCLS14 and pCLS15 are vectors that were generated and that can be used for in-vitro transcription to generate an mRNA construct encoding a TALEN-fluorescent fusion protein. Vector pCLS14 includes a promoter T7, a 5’ UTR, a fluorescent protein YEP, a linker sequence 2xGGGGS, a LHT tethered to the YFP and that targets the gene BnFAD2, a 3’ UTR, and a poly-A tail. Vector pCLS15 includes a promoter T7, a 5’ UTR, a fluorescent protein RFP, a linker sequence 2xGGGGS, a RHT tethered to the RFP and that targets the gene BnFAD2, a 3’ UTR, and a poly-A tail. Vectors pCLS16 and pCLS17 were generated and used as controls in various experimental embodiments. Vector pCLS16 includes a promoter NosPro and a LHT that targets the gene BnFAD2. Vector pCLS17 includes a promoter NosPro and a RHT that targets the gene BnFAD2.

[0100] A full map sequence of vector pCLS3 is set forth in SEQ ID NO: 28 and an expression cassette from vector pCLS3 is set forth in SEQ ID NO: 29. A full map sequence of vector pCLS4 is set forth in SEQ ID NO: 30 and an expression cassette from vector pCLS4 is set forth in SEQ ID NO: 31. A full map sequence of vector pCLS14 is set forth in SEQ ID NO: 32 and an expression cassette from vector pCLS14 is set forth in SEQ ID NO: 33. A full map sequence of vector pCLS15 is set forth in SEQ ID NO: 34 and an expression cassette from vector pCLS15 is set forth in SEQ ID NO: 35. For example, the promoters, NosPro and T7, are based on Agrobacterium tumefaciens sequence (e.g., an Agrobacterium tumefaciens Ti plasmid), YFP is based on Aequorea victoria sequence, RFP is based on Discosoma sp sequence, and the UTRs and/or polyA tail are based on Arabidopsis thaliana sequence. The TALENs (e.g., T03(BnFAD2)-L and T03(BnFAD2)-R) are based on Brassica napus sequence, Xanthomonas sequence, and Flavobacterium okeanokoites sequence. The TALENS include a TALE effector based on Xanthomonas sequence that is further based on and targets Brassica napus sequence (e.g., targets a gene) and a Fokl based on Xanthomonas sequence.

[0101] The remaining example constructs of Table 1 are described below. Vector pCLSl includes a promoter NosPro, a fluorescent protein YFP, a linker sequence 2xGGGGS, and a TALEN backbone for a LHT. Vector pCLS2 includes a promoter NosPro, a fluorescent protein RFP, a linker sequence 2xGGGGS, and a TALEN backbone for a RHT. The vectors pCLS 1 and pCLS2 include entry level vectors having Bsal cutting sites for TALE GG cloning. Bsal is a type II restriction endonuclease and a non-limiting example of a Bsal cutting site includes GGTCTCN'NNNN. Vectors pCLS5- pCLS13 include entry level vectors and/or portions of vectors which can be used for in- vitro transcription to generate an mRNA construct. For example, vector pCLS5 includes a promoter T7, a 5’ UTR, a TALEN backbone for a LHT, a ‘3 UTR, and a poly-A tail. Vector pCLS6 includes a promoter T7, a 5’ UTR, a TALEN backbone for a RHT, a ‘3 UTR, and a poly-A tail. Vector pCLS7 includes a promoter T7, a 5’ UTR, a fluorescent protein YFP, a linker sequence 2xGGGGS, a TALEN backbone for a LHT, a ‘3 UTR, and a poly-A tail. Vector pCLS8 includes a promoter T7, a 5’ UTR, a fluorescent protein RFP, a linker sequence 2xGGGGS, a TALEN backbone for a RHT, a ‘3 UTR, and a poly-A tail. Vector pCLS9 includes a promoter T7, a fluorescent protein YFP, and a poly- A tail. Vector pCLSIO includes a promoter T7, a 5’ UTR, a fluorescent protein YFP, a ‘3 UTR, and a poly-A tail. Vector pCLSl 1 includes a promoter T7, a 5’ UTR, Trex2, a ‘3 UTR, and a poly-A tail. Vector pCLS12 includes a promoter T7, a 5’ UTR, a LHT that targets the gene BnFAD2, a 3’ UTR, and a poly-A tail. Vector pCLSl 3 includes a promoter T7, a 5’ UTR, a RHT that targets the gene BnFAD2, a 3’ UTR, and a poly-A tail. Embodiments are not limited to targeting of a specific gene, such as BnFAD2.

[0102] Various embodiments are directed to constructs that include activator agents, such as illustrated by vectors pCLSl 8- pCLS20. For example, vector pCLSl 8 includes a promoter T7, a ‘5 UTR, a TALEN, an activator agent VP128, a 3’ UTR, and a poly-A tail. Vector pCLS19 includes a promoter T7, a ‘5 UTR, a TALEN, an activator agent 6TAD, a 3’ UTR, and a poly-A tail. Vector pCLS20 includes a promoter T7, a ‘5 UTR, a TALEN, a first activator agent VP128, a second activator agent 6TAD, a 3’ UTR, and a poly-A tail. [0103] Another example experiment was conducted to illustrate transformation of protoplasts with detectable labels. More specifically, canola protoplasts were transformed using the nucleic acid constructs illustrated in Table 2. As shown in Table 2, the constructs were DNA constructs that encoded fluorescent proteins used to label the canola protoplast. Table 3 illustrates example results of sorting the transformed canola protoplasts by the florescent proteins using FACS.

[0104] Table 2. DNA vectors [0105] Table 3. FACS canola protoplasts with fluorescent protein expression DNA vector

[0106] FIGs. 6A-8C illustrate example flow cytometry data demonstrating sorting of protoplasts transformed using the nucleic acid constructs of Table 2, consistent with the present disclosure. For example, FIGs. 6A-8C show raw data from flow cytometry experiments demonstrating the ability to sort plant protoplasts using fluorescence. FIGs. 6A-6C show raw flow cytometry data from experimental results of sorting Sample 2 in Table 2 which included canola protoplasts transformed to express YFP using vector pCLS21. FIGs. 7A-7C show raw flow cytometry data from experimental results of sorting Sample 4 in Table 2 which included canola protoplasts transformed to express RFP using vector pCLS23. FIGs. 8A-8C show raw flow cytometry data from experimental results of sorting Sample 5 in Table 2 which included canola protoplasts transformed to express YFP and RFP using vectors pCLS21 and pCLS23.

[0107] A further example experiment was conducted to show protoplast transformed with a nucleic acid construct that has a rare-cutting endonuclease and a detectable label. For example, canola protoplasts were transformed using plasmid vectors illustrated by Table 4.

[0108] Table 4. Canola Protoplasts Transformation

[0109] Table 4 illustrates example nucleic acid constructs used to transform canola protoplasts. The constructs generated included previously described vectors pCLS3, pCLS4, pCLS16, pCLS17, p pCLS21, and pCLS23. Each of the plasmid vectors 1 and 2 (e.g., referred to as “Plasmid 1” and “Plasmid 2”) of Samples A-F included DNA and a quantity of 20 ug. Samples A-E of Table 4 included a 200,000 protoplasts. Samples A-D were prepared using the same Illumina sequence for analysis. Samples E-F were used as controls. The vectors were used to transform canola protoplasts to compare the gene editing efficiency of fluorescently labeled TALEN nucleic acid constructs as compared to constructs without fluorescent labels. As described above, vectors pCLS3 and pCLS4 included the fluorescent proteins YFP and RFP, and vectors pCLS16 and pCLS17 did not. Vectors pCLS21 and pCLS23 were used as controls and included fluorescent labels. [0110] FIGs. 9A-9D illustrate microscopy images of plant cells transformed using the nucleic acid constructs of Table 4, consistent with the present disclosure. FIG. 9A illustrates a microscopy image of canola plant cells from Sample A of Table 4. Sample A included canola protoplasts transformed using vectors pCLS3 and pCLS4. FIGs. 9B-9C illustrate microscopy images of canola plant cells from Sample C of Table 4. Sample C included canola protoplasts transformed using vectors pCLS3 and pCLS4. FIG. 9D illustrates a microscopy image of canola plant cells from Sample F of Table 4. Sample F included a control group of canola protoplasts transformed using vector pCLS21. The images of FIGs. 9A-9C demonstrate expression of YFP-TALEN fusion protein located in the nucleus of protoplasts.

[0111] FIG. 10 illustrates detected deletions of plants transformed using the nucleic acid constructs of Table 4, consistent with the present disclosure. The gene editing efficiencies of Samples A, B, C, and D from Table 4 were compared. Samples A and C included canola protoplasts transformed with constructs encoding TALENs fused to florescent protein (e.g., fusion proteins). Samples B and D included canola protoplasts transformed with constructs encoding TALENs without a detectable label. The TALENs in all Samples A-D targeted the gene BnFAD2. The graph of FIG. 10 illustrates results of a NHEJ mutation assay used to detect deletions in a population of protoplast cells that were transformed with the TALEN or Fluor-TALEN vector plasmids. As shown, Samples A and C resulted in detected deletions representative of activity of TALENS without detectable labels, such as Samples B and D.

[0112] The above described experimental embodiments demonstrate detectable labels being expressed by protoplasts, successfully sorting protoplasts expressing the detectable labels via FACS, and TALEN activity resulting from protoplasts expressing the detectable labels. Embodiments in accordance with the present disclosure are not limited to that demonstrated by the experimental embodiments and can include a variety of different types of constructs including different types of endonucleases, detectable labels, target genes, and mutations. SEQUENCE LISTING FREE TEXT

[0113] SEQ ID NOs: 1-21 are each based on Glycine max sequence. SEQ ID NOs: 22 and 25 are each based on herpes simplex vims sequence. SEQ ID NOs: 23 and 26 are each on based on Xanthomonas campestris sequence. SEQ ID NOs: 24 and 27 are each based on herpes simplex virus sequence and Xanthomonas campestris sequence. SEQ ID NOs: 28 and 29 are each a synthetic construct based on Agrobacterium tumefaciens sequence, Aequorea victoria sequence, Brassica napus sequence, Xanthomonas sequence, and Flavobacterium okeanokoites sequence. SEQ ID NOs: 30 and 31 are each a synthetic construct based on Agrobacterium tumefaciens sequence, Discosoma sp sequence, Brassica napus sequence, Xanthomonas sequence, and Flavobacterium okeanokoites sequence. SEQ ID NOs: 32 and 33 are each a synthetic construct based on Agrobacterium tumefaciens sequence, Aequorea victoria sequence, Arabidopsis thaliana sequence, Xanthomonas sequence, and Flavobacterium okeanokoites sequence. SEQ ID NOs: 34 and 35 are each a synthetic construct based on Agrobacterium tumefaciens sequence, Discosoma sp sequence, Arabidopsis thaliana sequence, Xanthomonas sequence, and Flavobacterium okeanokoites sequence.

SEQUENCE LISTING [0114] SEP ID NO: 1 (GmBBMl CDS)

ATGGGGTCTATGAATTTGTTAGGTTTTTCTCTCTCTCCTCACGAAGAACACCCT

TCTAGTCAAGATCACTCTCAAACGACACCTTCTCGTTTTAGCTTCAACCCTGA

TGGATCAATCTCAAGCACTGATGTAGCAGGAGGCTGCTTTGATCTCACTTCTG

ACTCAACTCCTCATTTACTTAACCTTCCTTCTTATGGCATATACGAAGCATTTC

ACAGAAACAATAGTATTAACACCACTCAAGATTGGAAGGAGAACTACAACA

GCCAAAATTTGCTATTGGGAACTTCGTGCAATAAACAAAACATGAACCAAAA

CCAACAGCAACAGCCAAAGCTTGAAAACTTCCTCGGTGGACACTCATTTGGC

GAACATGAGCAAACCTACGGTGGTAACTCAGCCTCTACAGATTACATGTTTCC

TGCTCAGCCAGTATCGGCTGGTGGTGGTGGTAGTGGTGGTGGCAGTAACAAT

AACAACAACAGTAACTCCATAGGGTTATCCATGATAAAGACATGGTTGAGGA

ACCAACCACCGAACTCAGAAAACATCAACAACAACAATGAAAGTGGTGGCA

ATATTAGAAGCAGTGTGCAGCAAACTCTATCACTTTCCATGAGTACTGGTTCA

CAATCAAGCACATCACTGCCCCTTCTCACTGCTAGTGTGGATAATGGAGAGA GTTCTTCTGATAACAAACAACCAAACACCTCGGCTGCACTTGATTCCACCCAA

ACCGGAGCCATTGAAACTGCACCCAGAAAGTCCATTGACACTTTTGGACAGA

GAACTTCTATCTACCGTGGTGTAACAAGGCATAGGTGGACGGGGAGGTACGA

GGCTCACCTGTGGGATAATAGTTGTAGAAGAGAGGGACAGACTCGCAAAGG

AAGGCAAGGTGGTTATGATAAAGAAGAAAAGGCAGCTAGAGCCTACGATTT

GGCAGCACTAAAATACTGGGGAACAACCACAACAACAAATTTTCCAATTAGC

CACTATGAGAAAGAGTTGGAAGAAATGAAGCACATGACTAGGCAAGAGTAC

GTTGCGTCATTGAGAAGGAAGAGTAGTGGGTTTTCTCGCGGTGCATCCATTTA

TCGAGGAGTGACGAGACACCACCAACATGGAAGGTGGCAAGCGAGGATTGG

AAGAGTTGCTGGCAACAAGGATCTTTACTTGGGAACTTTTAGCACCCAAGAA

GAGGCAGCGGAAGCATATGATGTAGCAGCAATCAAATTCCGAGGACTAAGTG

CTGTTACAAACTTTGACATGAGCAGATATGACGTGAAAAGCATACTTGAGAG

CACCACTTTGCCAATAGGTGGTGCTGCAAAGCGTTTGAAGGATATGGAGCAG

GTTGAACTGAGTGTGGATAATGGTCATAGAGCAGATCAAGTAGATCATAGTA

TCATCATGAGTTCTCACCTAACTCAAGGAATCAATAACAACTATGCAGGAGG

GGGAACAGCAACTCATCATAACTGGCACAATGCTCATGCATTCCACCAACCT

CAACCTTGCACCACCATGCACTACCCTTATGGACAAAGAATTAATTGGTGCA

AGCAAGAACAACAAGACAACTCTGATGCCCCTCACTCTTTGTCTTATTCAGAT

ATTCATCAACTTCAGCTAGGGAACAATGGAACACATAACTTCTTTCACACAA

ATTCAGGGTTGCACCCTATGTTGAGCATGGATTCTGCTTCCATTGACAATAGC

TCTTCTTCTAACTCGGTTGTTTATGATGGTTATGGAGGTGGTGGGGGCTACAA

TGTGATGCCTATGGGAACTACTACTGCTGTTGTTGCAAGTGATGGTGATCAAA

ATCCAAGAAGCAATCATGGTTTTGGTGATAATGAGATAAAAGCACTTGGTTA

TGAAAGTGTGTATGGCTCTGCAACTGATTCTTATCATGCACATGCAAGGAACT

TGTATTATCTTACTCAACAGCAATCATCTTCTGTTGATACAGTGAAGGCTAGT

GCATATGATCAAGGGTCTGCATGCAATACTTGGGTTCCAACTGCTATTCCAAC

TCATGCACCCAGATCAACTACTAGTATGGCTCTCTGCCATGGGGCTACTACAC

CCTTCTCTTTATTGCATGAATAG

[0115] SEP ID NO: 2 (GmWUS CDS)

ATGATGGAACCTCAACAACAACAACAACAAGCACAAGGGAGCCAACAACAA

CAACAAAACGAGGATGGTGGCAGTGGAAAAGGGGGGTTTCTGAGCAGGCAA

AGTAGTACACGGTGGACTCCAACAAACGACCAGATAAGAATATTGAAGGAA CTTTACTACAACAATGGAATTAGATCCCCGAGTGCAGAGCAGATTCAGAGGA

TCTCTGCTAGGCTGAGGCAGTACGGTAAGATTGAAGGCAAGAATGTCTTTTAT

TGGTTCCAGAACCACAAAGCTCGAGAAAGGCAGAAGAAAAGGTTCACTTCTG

ATCATAATCATAATAATGTCCCCATGCAAAGACCCCCAACTAATCCTTCTGCT

GCTTGGAAACCTGATCTAGCTGATCCCATTCACACCACCAAGTATTGTAACAT

CTCTTCTACTGCAGGGATCTCTTCGGCATCATCTTCTGTTGAGATGGTTACTGT

GGGACAGATGGGGAATTATGGGTATGGTTCTGTGCCCATGGAGAAAAGTTTT

AGGGACTGCTCGATATCAGCTGGGGGTAGCAGTGGCCATGTTGGATTAATAA

ACCACAACTTGGGGTGGGTTGGTGTGGACCCATATAATTCCTCAACCTATGCC

AACTTCTTTGACAAAATAAGGCCAAGTGATCAAGAAACCCTTGAAGAAGAAG

CAGAGAACATTGGTGCTACTAAGATTGAAACCCTCCCTTTATTCCCTATGCAC

GGTGAGGACATCCATGGCTATTGCAACCTCAAGTCTAATTCGTATAACTATGA

TGGAAACGGCTGGTATCATACTGAAGAAGGGTTCAAGAATGCTTCTCGTGCT

TCCTTGGAGCTCAGTCTCAACTCCTACACTCGCAGGTCTCCAGATTATGCTTA

A

[0116] SEP ID NO: 3 (GmLEC2 CDS)

ATGGAAAACTTTTTTGTGCCATTTTTAAAAAAAAACCCCAACCCATCAATCAC

CACTACTGGTGGCAATGGCTCATCTTCATCAAACCAAACAAGCCTTGTACAAC

C A AGC AC AT ATCCTC A AA ATTTCCCTT AC A AT ACT AGTGTA A AACTT A ACTTT

CCAGAACAACCTTATTTCATTCCTTTGTATCCCTTTCCAACAGGACAAGTTAG

CTTTTCTAATCAACCCTATGGAATGCCAAATTCGGAACTTCAAGGTTCGAGGG

CATGCATGACCAAAGCTACAAGGGAGAGATGGAGACAAGTAAGACAAAGGA

GTAAAAATTCTACTCTTGTCGCTCCTAATTCAGTTCTAGAAAGGACAACAAGA

GAACAATTTGTTCCTAATGGAGGGTCAAATGTGAGGATCACAGTCAAACAAC

ACAATGCAACCAAGTTTTTTAACACCCCAAACGGGAAGAAGCTAGAAGAAAT

TTTGACAAAGAAGTTGAATAATAGTGATGTTGGCGTCCTAGGCCGCATTGTGC

TCCCAAAGAGAGAGGCTGAGGATAAGCTTCCGACACTGTGGAAGAAGGAAG

GAATCAATATTGTACTAAAGGATGTATATTCTGAGATTGAATGGAGCATCAA

ATACAAGTACTGGACTAATAACAAAAGCAGAATGTATATTCTTGATAATACA

GGGGATTTTGTTAACCATTATAAACTTCAAACAGGAGATTTCATAACCCTTTA

CAAGGACGAGTTGAAAAATCTGTATGTGTCGGCTCGAAAGGATCAAGAAAAT

CTAGAAGAATCTAAGTCCTCGTCAAACACAGGAATGTCACATGAACCAGATG CATATTTAGCTTACTTGACGAAGGAACTTAGCCATAAGGGGAAAGCAGAAGC

TGCCAACAACCTTTTGAACAATGTTGAGGAAGAGGCACCAAATCAAGCAAAT

CAATTACATCAATTCATGCCGATGAACAATATTGTTGGGGAGGGGGCATCAA

ACCAAGCAATTCAAGAAGCCGCACCAGCCGCACCCGTCAATGTTAATCAAGA

AAACAAAGTTGTTGACGACGATGATGATGATATCTATGGTGGCCTTGACAAT

ATTTTCGAAATTGGAAATACTTATCAAATTTGGTAG

[0117] SEP ID NO: 4 (GmGRF5 CDS)

ATGATGAGTGCAAGTGCAAGAAATAGGTCTCCTTTCACGCAAACTCAGTGGC

AAGAGCTTGAGCATCAAGCTCTTGTTTTTAAGTACATGGTTACAGGAACACCC

ATCCCACCAGATCTCATCTACTCTATTAAAAGAAGTCTAGACACTTCAATTTC

TTCAAGGCTCTTCCCACATCATCCAATTGGGTGGGGATGTTTTGAAATGGGAT

TTGGCAGAAAAGTAGACCCAGAGCCAGGGAGGTGCAGAAGAACAGATGGCA

AGAAATGGAGATGCTCAAAGGAGGCATATCCAGACTCCAAGTACTGTGAAAG

ACACATGCACAGAGGCAGAAACCGTTCAAGAAAGCCTGTGGAAGTTTCTTCA

GCAATAAGCACCGCCACAAACACCTCCCAAACAATCCCATCTTCTTATACCCG

AAACCTTTCCTTGACCAACCCCAACATGACACCACCCTCTTCCTTCCCTTTCTC

TCCTTTGCCCTCTTCTATGCCTATTGAGTCCCAACCCTTTTCCCAATCCTACCA

AAACTCTTCTCTCAATCCCTTCTTCTACTCCCAATCAACCTCCTCTAGACCCCC

AGATGCTGATTTTCCACCCCAAGATGCCACCACCCACCAGCTATTCATGGACT

CTGGGTCTTATTCGCATGATGAAAAGAATTATAGGCATGTTCATGGAATAAG

AGAAGATGTGGATGAGAGAGCTTTCTTCCCAGAAGCATCAGGATCAGCTAGG

AGCTACACTGAATCATACCAGCAACTATCAATGAGCTCCTACAAGTCCTATTC

AAACTCCAACTTTCAGAACATCAATGATGCCACCACCAACCCAAGACAGCAA

GAGCAGCAACAACAACAACACTGCTTTGTTTTGGGGACAGACTTCAAATCAA

CAAGACCAACTAAAGAGAAAGAAGCTGAGACAGCTACGGGTCAGAGACCCC

TTCACCGTTTCTTTGGGGAGTGGCCACCAAAGAACACAACAGATTCATGGCT

AGATCTTGCTTCCAACTCCAGAATCCAAACCGATGAATGA

[0118] SEP ID NO: 5 (GmSTM CDS)

ATGGAGGGTAGTAGTTGCTCTAATGACACTTCTTATTTGTTGGCTTTTGGAGA

AAACAGTGGTGGGCTATGCCCAATGACGATGATGCCTTTGGTAACTTCCCATC

ATGCAACAAATCCTAGTAATCCTAGTAATAATACTAATAATAATGAAAACAC

AAACTGTCTCTTCATTCCCAACTGCAGTAACAGTTCTGGAACTCCTTCTATCA TGCTCCACAACAACAACAACACTGATGATGATAACAACAAAACCAGCACTAA

CACTGGGTTAGGGTACTATTTCATGGAGAGTGACCACCATCACCGCAACAAC

AACAACAATGGAAGCTCCTCCTCCTCTTCCTCTTCTGCTGTCAAGGCCAAGAT

CATGGCTCATCCTCACTATCACCGTCTCTTGGCAGCTTACGTCAATTGTCAGA

AGGTTGGAGCCCCACCGGAAGTGGTGGCAAGGTTAGAAGAAGCATGTGCTTC

TGCAGCGACAATGGCTGGTGATGCAGCAGCAGCAGCTGGATCAAGCTGCATA

GGTGAAGATCCAGCTTTGGATCAGTTCATGGAGGCTTACTGTGAGATGCTCAC

CAAGTATGAGCAAGAACTCTCCAAACCCTTAAAGGAAGCCATGCTCTTCCTTC

AAAGGATTGAGTGCCAGTTCAAAAATCTTACAATTTCTTCCACCGACTTTGCT

TGCAACGAGGGTGCTGAGAGGAATGGATCATCTGAAGAGGATGTTGATCTAC

ACAACATGATAGATCCCCAGGCAGAGGACAGGGAATTAAAGGGTCAGCTTTT

GCGCAAGTACAGCGGATACCTGGGCAGTCTGAAGCAAGAATTCATGAAGAA

GAGGAAGAAAGGAAAGCTACCTAAAGAAGCAAGGCAACAATTACTTGAATG

GTGGAGCAGACATTACAAATGGCCTTACCCATCCGAGTCACAGAAGCTGGCC

CTTGCAGAGTCGACAGGTCTGGATCAGAAGCAAATCAACAACTGGTTTATTA

ATCAAAGGAAACGGCACTGGAAGCCTTCAGAGGACATGCAGTTTGTGGTGAT

GGATCCAAGCCATCCACACTATTACATGGATAATGTTCTGGGCAATCCATTTC

CCATGGATCTCTCCCATCCAATGCTCTAG

[0119] SEP ID NO: 6 (GmE2FA CDS)

ATGTCCAGCGCCGCCGGAGTTCCCGACCGCCTCGCTTCGCAGCCGCGGGGGG

CTGCCGGCGCCCCTGCCCTCCCGCCGCTCAAGCGCCACCTTGCCTTCGTCACG

AAACCGCCCTTCGCCCCGCCCGATGAGTACCACAGCTTCTCCAGTGCCGACTC

CCGCCGCGCCGCGGATGAAGCCGTCGTCGTTAGATCTCCGTACATGAAGCGG

AAGAGTGGAATGACTGACAGTGAAGGGGAGTCACAAGCACAAAAGTGGAGT

AACAGCCCAGGATACACTAATGTTAGTAATGTAACGAATAATAGTCCCTTCA

AAACTCCTGTGTCTGCAAAAGGGGGAAGGGCACAGAAGGCAAAGGCTTCCA

AAGAAGGCAGATCATGTCCTCCGACACCCATGTCAAATGCTGGTTCCCCTTCT

CCTCTTACTCCTGCTAGCAGCTGTCGCTATGACAGTTCCTTAGGTCTCTTGACA

AAAAAGTTCATCAATTTGGTCAAACATGCGGAGGATGGTATTCTTGACCTAA

ATAAAGCAGCAGAAACTTTGGAGGTGCAAAAGAGGAGGATATATGACATAA

CTAATGTTTTGGAAGGCATTGGTCTCATTGAAAAGAAGCTCAAGAACAGAAT

ACATTGGAAGGGAATTGAATCTTCTACGTCTGGTGAGGTGGATGGTGATATCT CTGTGCTTAAGGCAGAAGTTGAGAAACTTTCTTTGGAGGAGCAGGGATTAGA

TGATCAAATAAGGGAAATGCAAGAAAGGCTGAGGAATTTGAGTGAAAATGA

AAACAACCAGAAGTGCCTTTTCGTGACTGAAGAAGATATTAAGGGCCTGCCT

TGCTTCCAGAATGAAACTTTAATAGCAATTAAAGCTCCGCATGGAACCACCCT

GGAAGTCCCTGATCCTGAGGAAGCTGTAGACTATCCGCAGAGAAGATATAGA

ATCATTCTTAGAAGCACAATGGGCCCCATTGATGTCTACCTTATCAGTCAATT

TGAAGAGAAATTTGAAGAGGTTAATGGTGCTGAGCTCCCCATGATCCCACTT

GCTTCCAGTTCTGGTTCCAATGAGCAACTAATGACGGAAATGGTTCCTGCTGA

ATGCAGCGGAAAAGAACTTGAACCTCAAACTCAGCTCTCTTCTCATGCATTCT

CTGATCTAAATGCTTCACAGGAGTTTGCTGGTGGCATGATGAAGATTGTCCCT

TCAGATGTTGATAATGATGCAGATTATTGGCTTCTATCAGATGCTGACGTTAG

TATAACAGATATGTGGAGAACAGATTCTACTGTTGATTGGAATGGTATAGAC

ATGCTTCATCCTGATTTTGGAATCATTTCGAGGCCTCAAAGTCCATCATCTGG

GCTTGCTGAAGTGCCATCAACAGGAGCAAACTCTATTCAGAAGTGA

[0120] SEP ID NO: 7 (GmAGL15 CDS)

ATGGGTCGAGGGAAAATCGAGATCAAAAGAATCGACAATGCTAGCAGCAGA

CAAGTCACGTTCTCGAAGCGGAGAACAGGGTTGTTCAAGAAGGCTCAGGAAC

TTTCCATTCTCTGTGACGCCGAGGTTGCTGTCATAGTTTTCTCCAACACTGGCA

AGCTCTTCGAGTTTTCCAGTTCCGGTATGAAGCGAACACTTTCAAGATACAAC

AAATGCCTTGGTTCTACAGATGCTGCTGTAGCAGAAATTATGACACAGAAGG

AAGATTCTAAGATGGTGGAGATTCTAAGAGAGGAAATTGAAAAGCTAGAAA

CAAAGCAATTACAGTTGGTGGGTAAGGATCTGACAGGATTGGGTTTAAAGGA

ATTGCAAAATTTAGAGCAGCAACTTAATGAGGGGTTATTGTCTGTCAAGGCG

AGAAAGGAGGAATTACTCATGGAGCAACTAGAGCAATCTAGAGTTCAGGAA

CAGCGGGTTATGTTGGAGAATGAAACTTTGCGAAGACAGATTGAGGAGCTTC

GGTGTCTGTTTCCACAATCAGAAAGCATGGTCCCATTCCAATACCAACATACT

GAAAGAAAGAATACTTTTGTAAATACTGGCGCCAGATGTCTCAACTTGGCTA

ATAACTGTGGAAATGAGAAAGGGAGTTCAGATACAGCATTTCATTTGGGGTT

GCCTGCTGGTGTTCAAGAGGAAGGCCCCCAAGAAAGAAACCTTTTCAAATGA

[0121] SEP ID NO: 8 (GmBBMl Protein)

MGSMNLLGFSLSPHEEHPSSODHSpTTPSRFSFNPDGSISSTDVAGGCFDLTSDST

PHLLNLPSYGIYEAFHRNNSINTTpDWKENYNSONLLLGTSCNKpNMNONOOO QPKLENFLGGHSFGEHEQTYGGNSASTDYMFPAQPVSAGGGGSGGGSNNNNNS

NSIGLSMIKTWLRNQPPNSENINNNNESGGNIRSSVQQTLSLSMSTGSQSSTSLPLL

TASVDNGESSSDNKQPNTSAALDSTQTGAIETAPRKSIDTFGQRTSIYRGVTRHR

WTGRYEAHLWDNSCRREGQTRKGRQGGYDKEEKAARAYDLAALKYWGTTTT

TNFPISHYEKELEEMKHMTRQEYVASLRRKSSGFSRGASIYRGVTRHHQHGRWQ

ARIGRVAGNKDLYLGTFSTQEEAAEAYDVAAIKFRGLSAVTNFDMSRYDVKSIL

ESTTLPIGGAAKRLKDMEQVELSVDNGHRADQVDHSIIMSSHLTQGINNNYAGG

GTATHHNWHNAHAFHQPQPCTTMHYPYGQRINWCKQEQQDNSDAPHSLSYSDI

HQLQLGNNGTHNFFHTNSGLHPMLSMDSASIDNSSSSNSVVYDGYGGGGGYNV

MPMGTTTAVVASDGDQNPRSNHGFGDNEIKALGYESVYGSATDSYHAHARNLY

YLTQQQSSSVDTVKASAYDQGSACNTWVPTAIPTHAPRSTTSMALCHGATTPFSL

LHE

[0122] SEP ID NO: 9 (GmWUS Protein)

MMEPQQQQQQAQGSQQQQQNEDGGSGKGGFLSRQSSTRWTPTNDQIRILKELY

YNNGIRSPSAEQIQRISARLRQYGKIEGKNVFYWFQNHKARERQKKRFrSDHNH

NNVPMQRPPTNPS AAWKPDLADPIHTTKY CNIS STAGISS ASS S VEMVTVGQMGN

YGYGSVPMEKSFRDCSISAGGSSGHVGLINHNLGWVGVDPYNSSTYANFFDKIRP

SDQETLEEEAENIGATKIETLPLFPMHGEDIHGYCNLKSNSYNYDGNGWYHTEEG

FKNASRASLELSLNSYTRRSPDYA

[0123] SEP ID NO: 10 (GmLEC2 Protein)

MENFFVPFLKKNPNPSITTTGGNGSSSSNQTSLVQPSTYPQNFPYNTSVKLNFPEQ PYFIPLYPFPTGQVSFSNQPYGMPNSELQGSRACMTKATRERWRQVRQRSKNSTL VAPNSVLERTTREQFVPNGGSNVRITVKQHNATKFFNTPNGKKLEEILTKKLNNS DVGVLGRIVLPKREAEDKLPTLWKKEGINIVLKDVYSEIEWSIKYKYWTNNKSR MYILDNTGDFVNHYKLQTGDFITLYKDELKNLYVS ARKDQENLEES KSS SNTGM SHEPDAYLAYLTKELSHKGKAEAANNLLNNVEEEAPNQANQLHQFMPMNNIVG EGASNQAIQEAAPAAPVNVNQENKVVDDDDDDIYGGLDNIFEIGNTY QIW [0124] SEP ID NO: 11 (GmGRF5 Protein)

MMSASARNRSPFTQTQWQELEHQALVFKYMVTGTPIPPDLIYSIKRSLDTSISSRL FPHHPIGWGCFEMGFGRKVDPEPGRCRRTDGKKWRCSKEAYPDSKYCERHMHR GRNRSRKPVEVSSAISTATNTSQTIPSSYTRNLSLTNPNMTPPSSFPFSPLPSSMPIE SQPFSQS YQNSSLNPFFY S QSTS SRPPDADFPPQD ATTHQLFMDS GS YSHDEKNYR HVHGIREDVDERAFFPEASGSARSYTESYQQLSMSSYKSYSNSNFQNINDATTNP

RQQEQQQQQHCFVLGTDFKSTRPTKEKEAETATGQRPLHRFFGEWPPKNTTDSW

LDLASNSRIQTDE

[0125] SEP ID NO: 12 (GmSTM Protein)

MEGSSCSNDTSYLLAFGENSGGLCPMTMMPLVTSHHATNPSNPSNNTNNNENTN CLFIPNCSNSSGTPSIMLHNNNNTDDDNNKTSTNTGLGYYFMESDHHHRNNNNN GSSSSSSSSAVKAKIMAHPHYHRLLAAYVNCQKVGAPPEVVARLEEACASAATM AGDAAAAAGSSCIGEDPALDQFMEAYCEMLTKYEQELSKPLKEAMLFLQRIECQ FKNLTIS STDFACNEG AERN GS SEED VDLHNMIDPQ AEDRELKGQLLRKY S G YLG SLKQEFMKKRKKGKLPKEARQQLLEWWSRHYKWPYPSESQKLALAESTGLDQK QINNWFINQRKRHWKPSEDMQFVVMDPSHPHYYMDNVLGNPFPMDLSHPML [0126] SEP ID NO: 13 (GmE2FA Protein)

MSSAAGVPDRLASQPRGAAGAPALPPLKRHLAFVTKPPFAPPDEYHSFSSADSRR

AADEAVVVRSPYMKRKSGMTDSEGESQAQKWSNSPGYTNVSNVTNNSPFKTPV

SAKGGRAQKAKASKEGRSCPPTPMSNAGSPSPLTPASSCRYDSSLGLLTKKFINL

VKHAEDGILDLNKAAETLEVQKRRIYDITNVLEGIGLIEKKLKNRIHWKGIESSTS

GEVDGDISVLKAEVEKLSLEEQGLDDQIREMQERLRNLSENENNQKCLFVTEEDI

KGT PGFQNETT TATKAPHGTTT EVPPPFEAVPYPPRR YRTTT RSTMGPTPVYT TSQF

EEKFEEVNGAELPMIPLASSSGSNEQLMTEMVPAECSGKELEPQTQLSSHAFSDL

NASQEFAGGMMKIVPSDVDNDADYWLLSDADVSITDMWRTDSTVDWNGIDML

HPDFGIISRPQSPSSGLAEVPSTGANSIQK

[0127] SEP ID NO: 14 (GmAGL15 Protein)

MGRGKIEIKRIDN AS S RQ VTFS KRRTGLFKKAQELS ILCD AE V A VI VFSNTGKLFE

FSSSGMKRTLSRYNKCLGSTDAAVAEIMTQKEDSKMVEILREEIEKLETKQLQLV

GKDLTGLGLKELQNLEQQLNEGLLSVKARKEELLMEQLEQSRVQEQRVMLENE

TLRRQIEELRCLFPQSESMVPFQYQHTERKNTFVNTGARCLNLANNCGNEKGSSD

T AFHLGLPAG V QEEGPQERNLFK

[0128] SEP ID NO: 15 (GmBBMl Promoter)

AATATTATTAATATACTCTTAATATATTGGTTAATGAAATAAAATTAATTATT

GATTTCTTAATTACTTATTCTTGAAGTATACAGATTCATAAAATCTCTTCTTAC

AATGGACACAAAAACTAAGCATCTTTTCGTTTACAATGTGTCATTAGCATCTT

CTTAATCTTCTTAATTAATGAATCTCTATTAGCGATTACAATGTGTCATTAACA TCTTATTCGATAGTACTATTAATTGAGATTCCTCTCATTCAACCACTTTTATAA

AAAAATAAAGTTTTAACAAAAAAGAAAATCATAGTTCATAATATCTAACTTT

ATACTTTATGAAAAAAAAGTAATGTATCACATATCACATCAGAATTTATTTTC

CATGAAACATGAAGGCAGTGATGCATCAATCAGCACATTAGTGATTTTGTGT

CACAAGTCACAACTGTTCAGAAAAAGCTCTTAGAGTGAATCGTAACACCGTA

TCACAAGGGCGCATTATATTTTTCAATACCGCGAGCAACTAGTAGTACTAGTG

TGTTTGGACTACCACATTAATTACGAAATGGTCCCCGTGTGTGGATCTTTTCA

TTAGCCCTTGAAGTAATTTTTTTTTTCTGATTCAAAGATTTCAAGTGCCCTAGA

ATGTATAAGACGCGTCCCATTTCTATTGTGTGCGCGTGTGTGGTGTGTACGTG

C ATATC AGCC AG A AG A A AG AG A A A AT A ACTC A A A AT AT AGT A ACTT A A AGT A

TACTATAAATGTTCTCTCATCTCTATGCTATAAATGTTTTTTTTTCAATTTTTTG

AGCTCTTCAAGAATTTGACCCTTCTCCTCCTCCTCCTTCTTCTTTTCTTTCAAAC

CTCCTCATATAAACTAGTACTATATGCTTCTTCTTCTTCTTCTCCTTCATGCAC

A A ACTGCT ATTTTC ACCCTTTAT AT ATCT ATCT ACTCCTG A AG ATT AG ATT ACC

TTGAGGGCTTTGTGCTCTCTGTGTAATATTCTTCAATATC

[0129] SEP ID NO: 16 (GmWUS Promoter)

TGAAATGCCTATAGAATATGCGGACCAATGCACAACACAAAAAATAAATAGC

CCTGATGGAAAGGGAAATTCGATCTAAATCTACATCTCATCTTTTAATAAGTG

TATGTACGGAAAGAGGAGAGATATAAAAAAAATAAAATAATAGATATAATA

AATTACTTATTTGAT G A A A A AT AAAAGTTAAAATATAAAAAGA G A ATT G A AG

TAAAAGTGAGATGGAAAAAAAAAATGGATGTATCACCAATTGACCATAATAA

CTCTATATGCTTCATGCATTGGTTGGGACCCATGAAATGCACAATAAGTTCAC

AAATACATTTTTACCCTCCAATTCATCAGGTAAGTACAGAATATATATCTTGG

TAGCTTGCTGATTCG ACTT AAT AATTATAG AGT AAGAATTTAAAAAAAAAAT

GTATGTGTGTGTATAGGGGCCATGTCTGATATCTCCATCAAAAGAAGAACCT

ATTGAACTCCCAAATCACAACCCGCATCATTCCATTGCCATTCATTCATTCAT

TCAGAAAATCTACTCTTTTTTTTTTCTTTCCTTCCATCCAATATATCATTTCATG

CCTCATTTTTCTACCTTTTCCCACTGTCTCTGTGTGCAAATACTTTATTTCACA

CATACCTGGTCATGCCTTTTCGTCCAAGTAATTCCTGATAGTACCCTCACTTTC

TAAGCTCTCTTTTGTCCCTTCCCTTTTTATGAACACCACTCTGTCACCCTCAGT

CCTTCTCTCTCAGATATTTATTTATGATTTTCTCTCTTTATCACTCCATGTACTA

TATGTGCCTGTGCCTCATCTATCATCTATCATCTATCATCTATCATCACCTATT ATAAGTTTATAACCCCCCTCACCCTTTCCTCCCCTTCATAATTCATGCAGTAGT AATCTCTCTTCTCACCTATATACCCTCTAATATTCTAATTCTCTCTCTTGATCC AACAAACAAACACTACCATTTTGTTTGTTCTGAGTAGTGATCC [0130] SEP ID NO: 17 lGmLEC2 Promoter]

ACACTTATTTTTTTCTTCAATCACATTCACGTATATTATTATATATTCTATAAT

ATTTGTATTTATTCAATTCAATTATTTATTATTTTTTTATATTTATTAACATATA

TAAATGATAATTAAAAACATATTCAATTCAATAATAATATTATATATTATTAT

ACACTAATTAATAAGTCACATTTATGTGTATATACCAATTGACTGTAATATTA

TCTTTT AG ATTTT A AT A AGTC AC AC ACGC ATGC AT AA AG ACG ATTTT A ATC AG

ACATATTCATGTATATTATCATATACTAATTAATAAATACCTATGTGATATTTT

CATTGATTGCTTATGAAACTCTCAACCCCACACATGAAGCCAAAACCATGGC

CAAACCAAAACCCCAGCCATTTTCACACCTCTATCTTCCCATAGTCACTTCCT

ATATTATTATCCTCTCTTCGTAACTGCAATTCATGTTCCTCTAGGCATCTTACA

AACACATGGGGCACACACCTTTCTTTGGCTTTATGCAACACATGAAGACAAT

GTCCATCTTGCATACCATTTATAAGTCAGCAAGTCTCAACTTTATGATACCAT

AACGCTCACTTTCACTGCAATGACATTTCATCTTCTCTTGTTTTTTCTGCTTCA

TCCATCTCAACACTCTCAATTTTTTTTTATATTTTGAACTTGCAATTTATGTGTT

TTTGTTCAGTGCATTTGATTACAACTCAGATGAGTATTCCAATGTCACAACGT

TCCCTCCACTTGTTACCCACTTCAACATCTTCCTTCCTCTCTCTTGTTTCCTTTT

CCTTCCTTTTCTTTATTCTCGTTCACAATCCTTGCATTTATTTTTGTCATACTTT

TTTTTTTATATTTTTGTTTGCTTAATTGGCACTACCACTGCACCTAAACAACTT

CTTATAAGAGCCTCATACACACACACACTCTCTCAATTCACTCAACACTCAAA

ACAAAAACCTTGAAGCCTGTTAATTTCTCACCAAA

[0131] SEP ID NO: 18 lGmGRF5 Promoter)

ATTATCATTGAGTTAAAACTCTAACTCAAGCATGAAAAAATACATTAAAGTTT

TGTGTTTTTCAATTACCATAAAGTTTGATGAATATTGGTTTTGACGTTTTGTGG

TTATGGAAATGATTAAGGAGAAAACATGTAAAGGGTTATGATGGCCTATTGA

CAAGACGGTGGCCAATAGAGAGTTAAAGGCCAAATTGACTGTAACCCAAATT

CCACTGATGAAAGTGAGATGCTTGGGTTTGGGGGGTGAAATGAAAAAAGGA

GAAAGGAGAAAGCATCAATCCGTGGCCAAAAAAAGCAGGATTCAGCTCTAG

CCTTGGCCTCCAAATCTATCAATGAGATAACGCCACGCATGCTTCAAGCCAA

AAAAGATTAAAAATGACACGTACGAGACTTTCTCTTATTCAAAAAGTTACTA CAATTGCAAAGAGAGATTGATAATTTGATATACTAATGGCCACTATTGCTCAG

CAGCTTACACTTCACATAACCGGATGGCATGGCACTGTTTTCCATGAAGTGAT

GTGGAGACAGCAAAACCAAAGGTGCATGGACTAACATGCATTTGAATTTAAT

TTTTCTTCTTTTCCTTTGTACATTTGTTTATGGATTTCTGTAAAGATGTTAGAG

ACAAGGGCAGCAACAAAGGCAGCTGCAGAGAAAAAACAGAAGCAACAGAG

GTGCAGTCATTATAAAGAGCAGACTCACTCACTCACCCATCATCCAGCACATT

AGAGAAATAGAGAGGAGGTGGCAGCAAAGCCAGAAAGCATCATCAGACTCT

CAGACCCATTAGTATTATCCGTGCACAGGAGAAGAATCTCTACCCTTGAAAA

AT AT AT ATA A A AAT AAA AT A AT A ATG ACCCTCC A A AGTCC A A ATT ACT ATC A

CCCCATCTAGAGAATTTATTTCACTCTTTCAAATCTTATATCTTCTTGTTCTTC

ACTTCCCCACTATTTTAGAGAGAGACACACACACTCTTCCTTCCTTTTGTTGTC

TCAAA

[0132] SEP ID NO: 19 (GmSTM Promoter)

[0133] TGCACATGCAATTTAATTGTGATATCATTATTATCACTCATATGAAG

CTATTGCT AGCTC A A AT AGT AGT ATT A ATTTATT ATT AG A ACTTTC A AG A ACT

AAGCGTACGTTCAAGTATCAATCAATCAACACAATTTGCTCGATAATGATAA

CATACTCGTATACACCTAGCTCACATAAGTTACGGTATTAAACATTTATAATC

TGACACAATTTAATATCATTATCGAGCTGTTATCATATTTAAGTTAAGGATTT

CTTT AATT AGTATTTTT A AGAT ATT A ATT A AAA A A A AT A A A A A A AT ATTT ATT

GTGTAAATCAAGATAAAAAATTATATCTCTCAATAAAAATATTTTTACTTTAA

ATTTCTTAACT AATATTCTT AAAACACTTATT AAT ATTT ATTTTTAGGTTAAAA

GTAAAAGTATTTATAAGAAACAGTAATAGAAAAATTAAATATATAATAGTTA

ATAATTAATAATTTGTTATTAAAATGACATCATACCTTACTGGCTCTTAGAAA

ATC A ATTCTTAT AGTTGT AGT ACTTTTT AT A AC AGA A A AC ATTAT ATTTC A AA

TTGAAGTGTACTCAAGAAAAAAAATGAAATGAAGAGTATAACCGGGAGAGG

GGGACAATGGGAAGCGACAATGTGTACGTAACCTGATGGAGGTGCTTTCACT

ACGGTATTTTACGGGAAGTGATGCTACGCTAGGCCTTTATTAATTATTATATT

AGGGACGAGGGATATCATATGGGATATAGAGATGAACTATGGTGCTGGAAAT

AGATCGAGAAAAAAGGGGTTGCTGAGAGGAAGAGACATTCGGACTGTCCCA

CAAACTTTACCAGCTTTATTTACTCACCTGCAGACGCGCTTTTTCCATGGTTAA

TTATACTGTATCGTATTAAATTAGATCATACTAGTATACTATATACTACCATA

GGAAGAGAGAGAAGTAAGCATCATCATATAGTAAATATTCATGTTTAGACTT TAGTATTAATAGTAACTAACGCTAATGTTAAAACACTAAATACATCTATTTTG

GAGCTAACAAGAAGAACAAATTAGGTTTGATAAATTAAATCCCTAATGTTCT

GTTAAATGTTGGTACTTGTTTGTGGGACTAGAGAATTTTTTAATCACTGTGGT

GAGAAGATCGAGGACAAATAGGGTGAGAATATTAAATGAGTGGAGGGATTG

CCATCAAAGTGTAGAGAGAGAGAGAAGGAAGGGTTGATTTTGATTCCGTGCC

CCATAAACATAAACATAAACATAAACCATCTCATCTTTCTCCATTGATGGCCA

GTAGTGGGTAACTTGTTTTTCTTCCTCGATTTGATCGTTCCTTCTCTCTCTCTAT

TGTGTTTTGTTTTATGCCAGGAATGGCAGCGTATCAGTGGCAGTGCAGGAAA

AGAGAGGGAGAGTTTTCATTGGGAAGGTAAAAGCTTTTGTTTGTAGCAGTGA

AACCTCGCCCCCTTCTCTTCATCGCTACTAGTAGTAACTCATCGTTTTCTCGGT

GTGCCCCGCGTGCGCTCTGCTGTGTCTTCTCACTCACACCAGAGGTGTAACCG

TGTAACCACTAGAATCATTTATTCATTAATGCTGGCAACAGTGGCATGGAAA

GAAAGATTAATTTTTCCAAAGGAAAGAAAAACCCTCTGCAGGCTTTGCCAGA

TAAGCCAAGTGGGAAAACCAAACCCTCTATTAGTACTTACTTCATGTAACTGA

CTATAGCCACCACTATCACTATTTAGGATTTTCTGTAAAAAGCCTGATACTCT

TTTACCATAAAACCCGGGAGAGCCCTGGAAGACAAACATCTTCATTCAGACT

TCATAAAATAAAATAGAGAAGTGTTTTTTTGTTTTTTTGGTTTGTTGTAATTAA

GGCTAGCTAGTGAGTGTGTTCTACAACTGTAGTGAGCTACAGAAGGTGGTGG

TAGTAGTAGGCAAAAAGGATAAGACAGTGAGTGTGTATGTTGTTGACAAGCA

AAAGCC

[0134] SEP ID NO: 20 (GmE2FA Promoter)

AATTAGTCTTATTGAATACTTATAATTTAATAAGTTAACTTCCCAATTTTAGAT

T ATC A A ATTTCT AGTTTC ACGG A AC AT A ACCT ATTTTC A A A A AT A ATTT A AC A

T AAC ACTT A ATTTGGT AT ACT AAC AC AC ATGT AC ATTC ATT A A A A A ATAG ACT

AAGTAATTGATAATATATTACAAAATTAAAACATATAAACTAATTATAAATT

ATTAAATATGATTTTATACCTGTGCTAGACATGTGGTATCACGCTAGTAATTA

AT A AT AT ATTA A A A ATT A A A AT A AT AT A AC AAGTT ACT ACT AT A A ATT AT A A

AATATAAATGTAATATCAATATAAGCCACAAGAGTTAAACTTGTCCATATGT

ATAACTTTTAAGTAGTTAGAAAACTTGTTAAAGATATAAAATTTATTGACGAT

ATAAATTTTGTTTACACCAGTATCAATGCATATCAATTAAATCCTTTTTCTATT

AATTTTAACATATACATCACATTAATCACACTAATGAAGGTAAGCAAAGAAT

TTAACAAGTTTTTTTTTTTTTAAAATCTAATATAAACTAAAAAGTAAGGCAGC GAAAAAGGAAATAAGATAATTTCATGATAATAATCTAAAAATACAATAACCC

CGTACCAAAAAAACATGTGTAATTACAGGAACACTTAAAATTTCTTCTTTTAT

TATTATTATTTTTTTTTTCGCGCATGCAGTTCCCTCCACATCTATCCGAAACCA

AATTCCCTCCTTCCCTCGTTTTCTGCTCTCGCCTCCTCTACGTTCCATAACGCC

CTCTCTCTCTCTCTCTCTCTCTCTCTCTTTTTTTTTTTTTTTTCCAAACCCTTTTC

CCCTCCCTCTCACTTTCTCTCTCTAAACCCCACTCTTTCTCTCTCTAAAACCCT

ACACTGTACTCTCCTTCCTTCGGATCCTTCTCCCGTTTCCCTCCAATTTCCCCC

CAATTCCGCTGGCCCCACCTCCGCCCCTTTTCCCGCTTCCTC

[0135] SEP ID NO: 21 (GmAGL15 Promoter)

TCTAAATGCCCAGAGAACACAACACGGAGCCATGCAAAGTTGCCGTTTCCAG

CAAACCTCTCTGGTTATTTGAGGTAAAACGCTTTGCAGTCTCGCAAATCGCAA

CAACCCCTTCGTCTTCTCAGTAAAAGGGGTCTTACTTACTTAGTGTCTTCGTTC

GTATCTTCAACCCTGAATTCGCTTCTCCTCCCAAAGCACCACCACCACCTCTA

ATTAATTCCTCGTTCAGTTGGGCATGTTTGCGCATTTCTGAGAGAGCGAGAAA

ATAAA

[0136] SEP ID NO: 22 (VP128 CDS)

GGAGGGTCCGGAGGTGACGCTTTGGATGATTTCGATCTCGATATGCTCGGCTC

CGACGCCCTTGATGACTTCGACCTGGATATGCTTGGAAGCGACGCTCTCGATG

ACTTCGATCTTGACATGCTTGGTAGTGATGCCCTGGACGACTTTGACTTGGAT

ATGCTCGCTCGGGGGTCCGACGCTTTGGATGACTTCGATCTGGACATGCTGGG

CTCAGACGCACTTGACGACTTCGACCTCGACATGCTGGGATCAGACGCCCTC

GATGATTTTGATCTTGACATGCTTGGAAGTGACGCGTTGGACGATTTTGATCT

CGATATGCTT

[0137] SEP ID NO: 23 (6TAD CDS)

[0138] GGAGGGTCCGGAGGTCTGTTGGACCCTGGTACGCCTATGGATGCGG

ATTTGGTCGCGTCTAGTACCGTTGTGTGGGAGCAAGACGCGGACCCGTTTGCA

GGAACAGCAGATGATTTTCCCGCGTTTAATGAAGAAGAGTTGGCCTGGTTGA

TGGAACTTCTTCCTCAGGGAGGTTCGGGAGGACTTCTTGACCCCGGCACTCCG

ATGGATGCCGACCTCGTCGCATCCTCTACTGTCGTTTGGGAACAGGATGCAGA

CCCGTTCGCAGGCACCGCAGATGATTTCCCTGCCTTTAACGAAGAGGAACTC

GCTTGGCTGATGGAATTGCTTCCGCAAGCGAGAGGGGGTTCAGGCGGGTTGC

TCGATCCGGGTACACCGATGGACGCCGACTTGGTTGCATCGTCAACAGTCGTC TGGGAACAGGACGCGGACCCCTTTGCGGGCACAGCGGACGACTTCCCGGCTT

TTAATGAGGAGGAACTCGCATGGCTTATGGAGCTTTTGCCACAGGGTGGTTC

AGGTGGTCTACTTGATCCTGGGACTCCTATGGACGCCGACTTGGTAGCTAGCT

CAACAGTTGTTTGGGAGCAAGACGCTGACCCTTTCGCCGGCACTGCAGACGA

TTTTCCCGCTTTCAATGAAGAAGAGCTCGCCTGGCTCATGGAGCTTCTGCCCC

AGGCTAGAGGAGGCTCAGGTGGATTGCTGGATCCAGGCACCCCAATGGACGC

AGATCTCGTCGCTAGTAGCACTGTAGTGTGGGAACAGGATGCAGATCCCTTT

GCTGGCACTGCCGACGACTTCCCCGCATTCAACGAGGAGGAACTGGCTTGGC

TTATGGAACTCCTCCCTCAGGGGGGGTCCGGCGGCTTGCTGGATCCCGGCACT

CCCATGGACGCAGACCTGGTTGCTTCTAGTACCGTCGTCTGGGAGCAAGACG

CCGATCCATTCGCAGGTACCGCCGATGATTTTCCTGCCTTTAATGAAGAAGAG

TTGGCATGGTTGATGGAGCTCCTTCCTCAA

[0139] SEP ID NO: 24 (6TAD-VP128 CDS)

GGAGGGTCCGGAGGTCTGTTGGACCCTGGTACGCCTATGGATGCGGATTTGG

TCGCGTCTAGTACCGTTGTGTGGGAGCAAGACGCGGACCCGTTTGCAGGAAC

AGCAGATGATTTTCCCGCGTTTAATGAAGAAGAGTTGGCCTGGTTGATGGAA

CTTCTTCCTCAGGGAGGTTCGGGAGGACTTCTTGACCCCGGCACTCCGATGGA

TGCCGACCTCGTCGCATCCTCTACTGTCGTTTGGGAACAGGATGCAGACCCGT

TCGCAGGCACCGCAGATGATTTCCCTGCCTTTAACGAAGAGGAACTCGCTTG

GCTGATGGAATTGCTTCCGCAAGCGAGAGGGGGTTCAGGCGGGTTGCTCGAT

CCGGGTACACCGATGGACGCCGACTTGGTTGCATCGTCAACAGTCGTCTGGG

AACAGGACGCGGACCCCTTTGCGGGCACAGCGGACGACTTCCCGGCTTTTAA

TGAGGAGGAACTCGCATGGCTTATGGAGCTTTTGCCACAGGGTGGTTCAGGT

GGTCTACTTGATCCTGGGACTCCTATGGACGCCGACTTGGTAGCTAGCTCAAC

AGTTGTTTGGGAGCAAGACGCTGACCCTTTCGCCGGCACTGCAGACGATTTTC

CCGCTTTCAATGAAGAAGAGCTCGCCTGGCTCATGGAGCTTCTGCCCCAGGCT

AGAGGAGGCTCAGGTGGATTGCTGGATCCAGGCACCCCAATGGACGCAGATC

TCGTCGCTAGTAGCACTGTAGTGTGGGAACAGGATGCAGATCCCTTTGCTGGC

ACTGCCGACGACTTCCCCGCATTCAACGAGGAGGAACTGGCTTGGCTTATGG

AACTCCTCCCTCAGGGGGGGTCCGGCGGCTTGCTGGATCCCGGCACTCCCATG

GACGCAGACCTGGTTGCTTCTAGTACCGTCGTCTGGGAGCAAGACGCCGATC

CATTCGCAGGTACCGCCGATGATTTTCCTGCCTTTAATGAAGAAGAGTTGGCA TGGTTGATGGAGCTCCTTCCTCAAGCACGCGGGGGGTCTGGTGGTGGTGGAT

CTGGCGGTGACGCTTTGGATGATTTCGATCTCGATATGCTCGGCTCCGACGCC

CTTGATGACTTCGACCTGGATATGCTTGGAAGCGACGCTCTCGATGACTTCGA

TCTTGACATGCTTGGTAGTGATGCCCTGGACGACTTTGACTTGGATATGCTCG

CTCGGGGGTCCGACGCTTTGGATGACTTCGATCTGGACATGCTGGGCTCAGAC

GCACTTGACGACTTCGACCTCGACATGCTGGGATCAGACGCCCTCGATGATTT

TGATCTTGACATGCTTGGAAGTGACGCGTTGGACGATTTTGATCTCGATATGC

TT

[0140] SEP ID NO: 25 (VP128 Protein)

GSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDML

ARGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLDMLGSDALDDFDLD

ML

[0141] SEP ID NO: 26 (6TAD Protein)

GGS GGLLDPGTPMD ADLV AS ST V VWEOD ADPFAGT ADDFPAFNEEELAWLMEL

LPpGGSGGLLDPGTPMDADLVASSTVVWEpDADPFAGTADDFPAFNEEELAWL

MELLPpARGGSGGLLDPGTPMDADLVASSTVVWEpDADPFAGTADDFPAFNEE

ELAWLMELLPpGGSGGLLDPGTPMDADLVASSTVVWEpDADPFAGTADDFPAF

NEEELAWLMELLPpARGGSGGLLDPGTPMDADLVASSTVVWEpDADPFAGTAD

DFPAFNEEELAWLMELLPOGGSGGLLDPGTPMD ADLV ASSTV VWEOD ADPFAG

T ADDFPAFNEEEL AWLMELLPQ

[0142] SEP ID NO: 27 (6TAD-VP128 Protein)

GGS GGLLDPGTPMD ADLV AS ST VVWEQD ADPFAGT ADDFPAFNEEELAWLMEL LPQGGSGGLLDPGTPMD AD LVASSTVVWEQD ADPFAGT ADDFPAFNEEELAWL MELLPQARGGSGGLLDPGTPMDADLVASSTVVWEQDADPFAGTADDFPAFNEE ELAWLMELLPQGGSGGLLDPGTPMD AD LVASSTVVWEQD AD PFAGTADDFPAF NEEELAWLMELLPQARGGSGGLLDPGTPMDADLVASSTVVWEQD ADPFAGT AD DFPAFNEEELAWLMELLPQGGSGGLLDPGTPMD ADLV ASSTV VWEQD ADPFAG TADDFPAFNEEELAWLMELLPQARGGSGGGGSGGDALDDFDLDMLGSDALDDF DLDMLGSDALDDFDLDMLGSDALDDFDLDMLARGSDALDDFDLDMLGSDALD DFDLDMLGSDALDDFDLDMLGSDALDDFDLDML [0143] SEP ID NO: 28 (full map of pCLS3)

CGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAG

TGAGCGCAACGCAATTAATACGCGTACCGCTAGCCAGGAAGAGTTTGTAGAA

ACGCAAAAAGGCCATCCGTCAGGATGGCCTTCTGCTTAGTTTGATGCCTGGCA

GTTTATGGCGGGCGTCCTGCCCGCCACCCTCCGGGCCGTTGCTTCACAACGTT

CAAATCCGCTCCCGGCGGATTTGTCCTACTCAGGAGAGCGTTCACCGACAAA

CAACAGATAAAACGAAAGGCCCAGTCTTCCGACTGAGCCTTTCGTTTTATTTG

ATGCCTGGCAGTTCCCTACTCTCGCGTTCGAATACATCTAGATCCAAGTACAT

GGCAAATAATGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAACAAATT

GATGAGCAATGCTTTTTTATAATGCCAACTTTGTACAAAAAAGCAGGCTTAGG

TACCTCGCGAATGCATCTAGATCCAATGATCATGAGCGGAGAATTAAGGGAG

TCACGTTATGACCCCCGCCGATGACGCGGGACAAGCCGTTTTACGTTTGGAAC

TGACAGAACCGCAACGTTGAAGGAGCCACTCAGCCGCGGGTTTCTGGAGTTT

AATGAGCTAAGCACATACGTCAGAAACCATTATTGCGCGTTCAAAAGTCGCC

TAAGGTCACTATCAGCTAGCAAATATTTCTTGTCAAAAATGCTCCACTGACGT

TCCATAAATTCCCCTCGGTATCCAATTAGAGTCTCATATTCACTCTCAATCCA

AATAATCTGCACCGGATCTCGCCCTTACCTGCTAGTCATGGGCGATCCTAAAA

AGAAACGTAAGGTCATCGATTACCCATACGATGTTCCAGATTACGCTATGGCT

CCTAAGAAGAAGAGAAAGGTTATAACAATGGTGAGCAAGGGCGAGGAGCTG

TTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGGCGACGTAAACGGCC

ACAAGTTCAGCGTGTCCGGCGAGGGCGAGGGCGATGCCACCTACGGCAAGCT

GACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCCTGGCCCACCC

TCGTGACCACCTTCGGCTACGGCCTGCAGTGCTTCGCCCGCTACCCCGACCAC

ATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAGG

AGCGCACCATCTTCTTCAAGGACGACGGCAACTACAAGACCCGCGCCGAGGT

GAAGTTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCATCGAC

TTCAAGGAGGACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTACAACA

GCCACAACGTCTATATCATGGCCGACAAGCAGAAGAACGGCATCAAGGTGAA

CTTCAAGATCCGCCACAACATCGAGGACGGCAGCGTGCAGCTCGCCGACCAC

TACCAGCAGAACACCCCCATCGGCGACGGCCCCGTGCTGCTGCCCGACAACC

ACTACCTGAGCTACCAGTCCGCCCTGAGCAAAGACCCCAACGAGAAGCGCGA

TCACATGGTCCTGCTGGAGTTCGTGACCGCCGCCGGGATCACTCTCGGCATGG ACGAGCTGTACAAGCCGCGGTTCCCGGGAGATCTTGGAGGGGGCGGTAGCGG

CGGTGGCGGGAGCATCGATATCGCCGATCTACGCACGCTCGGCTACAGCCAG

CAGCAACAGGAGAAGATCAAACCGAAGGTTCGTTCGACAGTGGCGCAGCAC

CACGAGGCACTGGTCGGCCACGGGTTTACACACGCGCACATCGTTGCGTTAA

GCCAACACCCGGCAGCGTTAGGGACCGTCGCTGTCAAGTATCAGGACATGAT

CGCAGCGTTGCCAGAGGCGACACACGAAGCGATCGTTGGCGTCGGCAAACAG

TGGTCCGGCGCACGCGCTCTGGAGGCCTTGCTCACGGTGGCGGGAGAGTTGA

GAGGTCCACCGTTACAGTTGGACACAGGCCAACTTCTCAAGATTGCAAAACG

TGGCGGCGTGACCGCAGTGGAGGCAGTGCATGCATGGCGCAATGCACTGACG

GGTGCCCCGCTCAACTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATA

ATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTG

CCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGAT

GGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCC

AGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGG

TGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAG

GCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCAATATTGGTG

GCAAGCAGGCGCTGGAGACGGTGCAGGCGCTGTTGCCGGTGCTGTGCCAGGC

CCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGC

AAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCC

ACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCAATATTGGTGGCAA

GCAGGCGCTGGAGACGGTGCAGGCGCTGTTGCCGGTGCTGTGCCAGGCCCAC

GGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGC

AGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGG

CTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCAATATTGGTGGCAAGCAG

GCGCTGGAGACGGTGCAGGCGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCT

TGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGC

GCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTG

ACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCG

CTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGA

CCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCT

GGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACC

CCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTG GAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCC

CGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGG

AGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCC

GGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGA

GACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCC

CAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAG

ACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCTCA

GCAGGTGGTGGCCATCGCCAGCAATGGCGGCGGCAGGCCGGCGCTGGAGAG

CATTGTTGCCCAGTTATCTCGCCCTGATCCGGCGTTGGCCGCGTTGACCAACG

ACCACCTCGTCGCCTTGGCCTGCCTCGGCGGGCGTCCTGCGCTGGATGCAGTG

AAAAAGGGATTGGGGGATCCTATCAGCCGTTCCCAGCTGGTGAAGTCCGAGC

TGGAGGAGAAGAAATCCGAGTTGAGGCACAAGCTGAAGTACGTGCCCCACG

AGTACATCGAGCTGATCGAGATCGCCCGGAACAGCACCCAGGACCGTATCCT

GGAGATGAAGGTGATGGAGTTCTTCATGAAGGTGTACGGCTACAGGGGCAAG

CACCTGGGCGGCTCCAGGAAGCCCGACGGCGCCATCTACACCGTGGGCTCCC

CCATCGACTACGGCGTGATCGTGGACACCAAGGCCTACTCCGGCGGCTACAA

CCTGCCCATCGGCCAGGCCGACGAAATGCAGAGGTACGTGGAGGAGAACCA

GACCAGGAACAAGCACATCAACCCCAACGAGTGGTGGAAGGTGTACCCCTCC

AGCGTGACCGAGTTCAAGTTCCTGTTCGTGTCCGGCCACTTCAAGGGCAACTA

CAAGGCCCAGCTGACCAGGCTGAACCACATCACCAACTGCAACGGCGCCGTG

CTGTCCGTGGAGGAGCTCCTGATCGGCGGCGAGATGATCAAGGCCGGCACCC

TGACCCTGGAGGAGGTGAGGAGGAAGTTCAACAACGGCGAGATCAACTTCGC

GGCCGACTGATAACTCGAGAAGGGCGCGATCGTTCAAACATTTGGCAATAAA

GTTTCTTAAGATTGAATCCTGTTGCCGGTCTTGCGATGATTATCATATAATTTC

TGTTGAATTACGTTAAGCATGTAATAATTAACATGTAATGCATGACGTTATTT

ATG AG ATGGGTTTTT ATG ATT AG AGTCCCGC A ATT AT AC ATTT A AT ACGCG AT

AGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCA

TCTATGTTACTAGATCGGGAATTCGTAATCATGGTCATAGCATTGGATCGGAT

CCCGGGCCCGTCGACTGCAGAGGCCTGCATGCAACAACTTTGTATACAAAAG

TTGAACGAGAAACGTAAAATGATATAAATATCAATATATTAAATTAGATTTT

GCATAAAAAACAGACTACATAATACTGTAAAACACAACATATCCAGTCACTA

TGTCGATTGTCTTCATCGGATCCCATCCCCTATAGTGAGTCGTATTACATGGT CATAGCTGTTTCCTGGCAGCTCTGGCCCGTGTCTCAAAATCTCTGATGTTACA

TTGCACAAGATAAAAATATATCATCATGCCTCCTCTAGACCAGCCAGGACAG

AAATGCCTCGACTTCGCTGCTGCCCAAGGTTGCCGGGTGACGCACACCGTGG

AAACGGATGAAGGCACGAACCCAGTGGACATAAGCCTGTTCGGTTCGTAAGC

TGTAATGCAAGTAGCGTATGCGCTCACGCAACTGGTCCAGAACCTTGACCGA

ACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGTTATGACTGTT

TTTTTGGGGTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCC

GTGGGTCGATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGGGCA

GTCGCCCTAAAACAAAGTTAAACATCATGAGGGAAGCGGTGATCGCCGAAGT

ATCGACTCAACTATCAGAGGTAGTTGGCGTCATCGAGCGCCATCTCGAACCG

ACGTTGCTGGCCGTACATTTGTACGGCTCCGCAGTGGATGGCGGCCTGAAGC

CACACAGTGATATTGATTTGCTGGTTACGGTGACCGTAAGGCTTGATGAAAC

AACGCGGCGAGCTTTGATCAACGACCTTTTGGAAACTTCGGCTTCCCCTGGAG

AGAGCGAGATTCTCCGCGCTGTAGAAGTCACCATTGTTGTGCACGACGACAT

CATTCCGTGGCGTTATCCAGCTAAGCGCGAACTGCAATTTGGAGAATGGCAG

CGCAATGACATTCTTGCAGGTATCTTCGAGCCAGCCACGATCGACATTGATCT

GGCTATCTTGCTGACAAAAGCAAGAGAACATAGCGTTGCCTTGGTAGGTCCA

GCGGCGGAGGAACTCTTTGATCCGGTTCCTGAACAGGATCTATTTGAGGCGCT

AAATGAAACCTTAACGCTATGGAACTCGCCGCCCGACTGGGCTGGCGATGAG

CGAAATGTAGTGCTTACGTTGTCCCGCATTTGGTACAGCGCAGTAACCGGCA

AAATCGCGCCGAAGGATGTCGCTGCCGACTGGGCAATGGAGCGCCTGCCGGC

CCAGTATCAGCCCGTCATACTTGAAGCTAGACAGGCTTATCTTGGACAAGAA

GAAGATCGCTTGGCCTCGCGCGCAGATCAGTTGGAAGAATTTGTCCACTACG

TGAAAGGCGAGATCACCAAGGTAGTCGGCAAATAACCCTCGAGCCACCCATG

ACCAAAATCCCTTAACGTGAGTTACGCGTCGTTCCACTGAGCGTCAGACCCCG

TAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGC

TGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATC

AAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGAT

ACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACT

CTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCT

GCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTAC

CGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCA GCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTG

AGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAG

CGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGC

CTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATT

TTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCG

GCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTG

CGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGAT

ACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAA

GCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGC

[0144] SEQ ID NO: 29 (expression cassette from pCLS3)

GATCATGAGCGGAGAATTAAGGGAGTCACGTTATGACCCCCGCCGATGACGC

GGGACAAGCCGTTTTACGTTTGGAACTGACAGAACCGCAACGTTGAAGGAGC

CACTCAGCCGCGGGTTTCTGGAGTTTAATGAGCTAAGCACATACGTCAGAAA

CCATTATTGCGCGTTCAAAAGTCGCCTAAGGTCACTATCAGCTAGCAAATATT

TCTTGTCAAAAATGCTCCACTGACGTTCCATAAATTCCCCTCGGTATCCAATT

AGAGTCTCATATTCACTCTCAATCCAAATAATCTGCACCGGATCTCGCCCTTA

CCTGCTAGTCATGGGCGATCCTAAAAAGAAACGTAAGGTCATCGATTACCCA

TACGATGTTCCAGATTACGCTATGGCTCCTAAGAAGAAGAGAAAGGTTATAA

CAATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGT

CGAGCTGGACGGCGACGTAAACGGCCACAAGTTCAGCGTGTCCGGCGAGGGC

GAGGGCGATGCCACCTACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCG

GCAAGCTGCCCGTGCCCTGGCCCACCCTCGTGACCACCTTCGGCTACGGCCTG

CAGTGCTTCGCCCGCTACCCCGACCACATGAAGCAGCACGACTTCTTCAAGTC

CGCCATGCCCGAAGGCTACGTCCAGGAGCGCACCATCTTCTTCAAGGACGAC

GGCAACTACAAGACCCGCGCCGAGGTGAAGTTCGAGGGCGACACCCTGGTGA

ACCGCATCGAGCTGAAGGGCATCGACTTCAAGGAGGACGGCAACATCCTGGG

GCACAAGCTGGAGTACAACTACAACAGCCACAACGTCTATATCATGGCCGAC

AAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAACATCGAG

GACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGCG

ACGGCCCCGTGCTGCTGCCCGACAACCACTACCTGAGCTACCAGTCCGCCCTG

AGCAAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCGTGA

CCGCCGCCGGGATCACTCTCGGCATGGACGAGCTGTACAAGCCGCGGTTCCC GGGAGATCTTGGAGGGGGCGGTAGCGGCGGTGGCGGGAGCATCGATATCGC

CGATCTACGCACGCTCGGCTACAGCCAGCAGCAACAGGAGAAGATCAAACCG

AAGGTTCGTTCGACAGTGGCGCAGCACCACGAGGCACTGGTCGGCCACGGGT

TTACACACGCGCACATCGTTGCGTTAAGCCAACACCCGGCAGCGTTAGGGAC

CGTCGCTGTCAAGTATCAGGACATGATCGCAGCGTTGCCAGAGGCGACACAC

GAAGCGATCGTTGGCGTCGGCAAACAGTGGTCCGGCGCACGCGCTCTGGAGG

CCTTGCTCACGGTGGCGGGAGAGTTGAGAGGTCCACCGTTACAGTTGGACAC

AGGCCAACTTCTCAAGATTGCAAAACGTGGCGGCGTGACCGCAGTGGAGGCA

GTGCATGCATGGCGCAATGCACTGACGGGTGCCCCGCTCAACTTGACCCCCC

AGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGA

CGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGA

GCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGAC

GGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAG

CAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACG

GTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGC

AGGTGGTGGCCATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGT

GCAGGCGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAG

GTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCC

AGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGT

GGTGGCCATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAG

GCGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGG

TGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCG

GCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTG

GCCATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGCGC

TGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGC

CATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTG

TTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCA

TCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTT

GCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATC

GCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGC

CGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGC

CAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCG GTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCA

GCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGT

GCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGC

CACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGC

TGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAA

TGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTG

TGCCAGGCCCACGGCTTGACCCCTCAGCAGGTGGTGGCCATCGCCAGCAATG

GCGGCGGCAGGCCGGCGCTGGAGAGCATTGTTGCCCAGTTATCTCGCCCTGA

TCCGGCGTTGGCCGCGTTGACCAACGACCACCTCGTCGCCTTGGCCTGCCTCG

GCGGGCGTCCTGCGCTGGATGCAGTGAAAAAGGGATTGGGGGATCCTATCAG

CCGTTCCCAGCTGGTGAAGTCCGAGCTGGAGGAGAAGAAATCCGAGTTGAGG

CACAAGCTGAAGTACGTGCCCCACGAGTACATCGAGCTGATCGAGATCGCCC

GGAACAGCACCCAGGACCGTATCCTGGAGATGAAGGTGATGGAGTTCTTCAT

GAAGGTGTACGGCTACAGGGGCAAGCACCTGGGCGGCTCCAGGAAGCCCGA

CGGCGCCATCTACACCGTGGGCTCCCCCATCGACTACGGCGTGATCGTGGAC

ACCAAGGCCTACTCCGGCGGCTACAACCTGCCCATCGGCCAGGCCGACGAAA

TGCAGAGGTACGTGGAGGAGAACCAGACCAGGAACAAGCACATCAACCCCA

ACGAGTGGTGGAAGGTGTACCCCTCCAGCGTGACCGAGTTCAAGTTCCTGTTC

GTGTCCGGCCACTTCAAGGGCAACTACAAGGCCCAGCTGACCAGGCTGAACC

ACATCACCAACTGCAACGGCGCCGTGCTGTCCGTGGAGGAGCTCCTGATCGG

CGGCGAGATGATCAAGGCCGGCACCCTGACCCTGGAGGAGGTGAGGAGGAA

GTTCAACAACGGCGAGATCAACTTCGCGGCCGACTGATAACTCGAGAAGGGC

GCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCC

GGTCTTGCGATGATTATCATATAATTTCTGTTGAATTACGTTAAGCATGTAAT

AATTAACATGTAATGCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAG

TCCCGC A ATT AT AC ATTT A AT ACGCG AT AG A A AAC A A A AT ATAGCGCGC A A A

CTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTAGATCGGGAATTCG

TAATCATGGTCATAGC

[0145] SEP ID NO: 30 (full map of PCLS4)

CGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAG

TGAGCGCAACGCAATTAATACGCGTACCGCTAGCCAGGAAGAGTTTGTAGAA

ACGCAAAAAGGCCATCCGTCAGGATGGCCTTCTGCTTAGTTTGATGCCTGGCA GTTTATGGCGGGCGTCCTGCCCGCCACCCTCCGGGCCGTTGCTTCACAACGTT

CAAATCCGCTCCCGGCGGATTTGTCCTACTCAGGAGAGCGTTCACCGACAAA

CAACAGATAAAACGAAAGGCCCAGTCTTCCGACTGAGCCTTTCGTTTTATTTG

ATGCCTGGCAGTTCCCTACTCTCGCGTTCGAATACATCTAGATCCAAGTACAT

GGCAAATAATGATTTTATTTTGACTGATAGTGACCTGTTCGTTGCAACAAATT

G ATG AGC A AT GCTTTTTT ATA ATGCC A ACTTTGT AT AC A AA AGTT GT AGGT AC

CTCGCGAATGCATCTAGATCCAATGATCATGAGCGGAGAATTAAGGGAGTCA

CGTTATGACCCCCGCCGATGACGCGGGACAAGCCGTTTTACGTTTGGAACTG

ACAGAACCGCAACGTTGAAGGAGCCACTCAGCCGCGGGTTTCTGGAGTTTAA

TGAGCTAAGCACATACGTCAGAAACCATTATTGCGCGTTCAAAAGTCGCCTA

AGGTCACTATCAGCTAGCAAATATTTCTTGTCAAAAATGCTCCACTGACGTTC

CATAAATTCCCCTCGGTATCCAATTAGAGTCTCATATTCACTCTCAATCCAAA

TAATCTGCACCGGATCTCGCCCTTACCTGCTAGTCATGGGCGATCCTAAAAAG

AAACGTAAGGTCATCGATAAGGAGACTGCCGCTGCCAAGTTCGAGAGACAGC

ACATGGACAGCATGGTGTCTAAGGGCGAAGAGCTGATTAAGGAGAACATGC

ACATGAAGCTGTACATGGAGGGCACCGTGAACAACCACCACTTCAAGTGCAC

ATCCGAGGGCGAAGGCAAGCCCTACGAGGGCACCCAGACCATGAGAATCAA

GGTGGTCGAGGGCGGCCCTCTCCCCTTCGCCTTCGACATCCTGGCTACCAGCT

TCATGTACGGCAGCAGAACCTTCATCAACCACACCCAGGGCATCCCCGACTT

CTTTAAGCAGTCCTTCCCTGAGGGCTTCACATGGGAGAGAGTCACCACATAC

GAAGACGGGGGCGTGCTGACCGCTACCCAGGACACCAGCCTCCAGGACGGCT

GCCTCATCTACAACGTCAAGATCAGAGGGGTGAACTTCCCATCCAACGGCCC

TGTGATGCAGAAGAAAACACTCGGCTGGGAGGCCAACACCGAGATGCTGTAC

CCCGCTGACGGCGGCCTGGAAGGCAGAAGCGACATGGCCCTGAAGCTCGTGG

GCGGGGGCCACCTGATCTGCAACTTCAAGACCACATACAGATCCAAGAAACC

CGCTAAGAACCTCAAGATGCCCGGCGTCTACTATGTGGACCACAGACTGGAA

AGAATCAAGGAGGCCGACAAAGAGACGTACGTCGAGCAGCACGAGGTGGCT

GTGGCCAGATACTGCGACCTCCCTAGCAAACTGGGGCACAAACTTAATGGAG

GGGGCGGTAGCGGCGGTGGCGGGAGCATCGATATCGCCGATCTACGCACGCT

CGGCTACAGCCAGCAGCAACAGGAGAAGATCAAACCGAAGGTTCGTTCGAC

AGTGGCGCAGCACCACGAGGCACTGGTCGGCCACGGGTTTACACACGCGCAC

ATCGTTGCGTTAAGCCAACACCCGGCAGCGTTAGGGACCGTCGCTGTCAAGT ATCAGGACATGATCGCAGCGTTGCCAGAGGCGACACACGAAGCGATCGTTGG

CGTCGGCAAACAGTGGTCCGGCGCACGCGCTCTGGAGGCCTTGCTCACGGTG

GCGGGAGAGTTGAGAGGTCCACCGTTACAGTTGGACACAGGCCAACTTCTCA

AGATTGCAAAACGTGGCGGCGTGACCGCAGTGGAGGCAGTGCATGCATGGCG

CAATGCACTGACGGGTGCCCCGCTCAACTTGACCCCCCAGCAGGTGGTGGCC

ATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGT

TGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCAT

CGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTG

CCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCG

CCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCC

GGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCC

AGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGCGCTGTTGCCGG

TGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAG

CAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTG

CTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCA

ATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCT

GTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAAT

AATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGT

GCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGA

TGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGC

CAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCG

GTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCA

GGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGT

GGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGG

CCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGG

CAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCC

CACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCA

AGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCA

CGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAG

CAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACG

GCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCA

GGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGC TTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGG

CGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTT

GACCCCTCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGCGGCAGGCCGGCG

CTGGAGAGCATTGTTGCCCAGTTATCTCGCCCTGATCCGGCGTTGGCCGCGTT

GACCAACGACCACCTCGTCGCCTTGGCCTGCCTCGGCGGGCGTCCTGCGCTGG

ATGCAGTGAAAAAGGGATTGGGGGATCCTATCAGCCGTTCCCAGCTGGTGAA

GTCCGAGCTGGAGGAGAAGAAATCCGAGTTGAGGCACAAGCTGAAGTACGT

GCCCCACGAGTACATCGAGCTGATCGAGATCGCCCGGAACAGCACCCAGGAC

CGTATCCTGGAGATGAAGGTGATGGAGTTCTTCATGAAGGTGTACGGCTACA

GGGGCAAGCACCTGGGCGGCTCCAGGAAGCCCGACGGCGCCATCTACACCGT

GGGCTCCCCCATCGACTACGGCGTGATCGTGGACACCAAGGCCTACTCCGGC

GGCTACAACCTGCCCATCGGCCAGGCCGACGAAATGCAGAGGTACGTGGAGG

AGAACCAGACCAGGAACAAGCACATCAACCCCAACGAGTGGTGGAAGGTGT

ACCCCTCCAGCGTGACCGAGTTCAAGTTCCTGTTCGTGTCCGGCCACTTCAAG

GGCAACTACAAGGCCCAGCTGACCAGGCTGAACCACATCACCAACTGCAACG

GCGCCGTGCTGTCCGTGGAGGAGCTCCTGATCGGCGGCGAGATGATCAAGGC

CGGCACCCTGACCCTGGAGGAGGTGAGGAGGAAGTTCAACAACGGCGAGAT

CAACTTCGCGGCCGACTGATAACTCGAGAAGGGCGCGATCGTTCAAACATTT

GGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTGCGATGATTATC

ATATAATTTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAATGCAT

GACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTT

AATACGCGATAGAAAACAAAATATAGCGCGCAAACTAGGATAAATTATCGCG

CGCGGTGTCATCTATGTTACTAGATCGGGAATTCGTAATCATGGTCATAGCAT

TGGATCGGATCCCGGGCCCGTCGACTGCAGAGGCCTGCATGCAAACCAGCTT

TCTTGTACAAAGTTGGCATTATAAGAAAGCATTGCTTATCAATTTGTTGCAAC

GAACAGGTCACTATCAGTCAAAATAAAATCATTATTTGTCGATTGTCTTCATC

GGATCCCATCCCCTATAGTGAGTCGTATTACATGGTCATAGCTGTTTCCTGGC

AGCTCTGGCCCGTGTCTCAAAATCTCTGATGTTACATTGCACAAGATAAAAAT

ATATCATCATGCCTCCTCTAGACCAGCCAGGACAGAAATGCCTCGACTTCGCT

GCTGCCCAAGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACG

AACCCAGTGGACATAAGCCTGTTCGGTTCGTAAGCTGTAATGCAAGTAGCGT

ATGCGCTCACGCAACTGGTCCAGAACCTTGACCGAACGCAGCGGTGGTAACG GCGCAGTGGCGGTTTTCATGGCTTGTTATGACTGTTTTTTTGGGGTACAGTCT

ATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCCGTGGGTCGATGTTTGA

TGTTATGGAGCAGCAACGATGTTACGCAGCAGGGCAGTCGCCCTAAAACAAA

GTTAAACATCATGAGGGAAGCGGTGATCGCCGAAGTATCGACTCAACTATCA

GAGGTAGTTGGCGTCATCGAGCGCCATCTCGAACCGACGTTGCTGGCCGTAC

ATTTGTACGGCTCCGCAGTGGATGGCGGCCTGAAGCCACACAGTGATATTGA

TTTGCTGGTTACGGTGACCGTAAGGCTTGATGAAACAACGCGGCGAGCTTTG

ATCAACGACCTTTTGGAAACTTCGGCTTCCCCTGGAGAGAGCGAGATTCTCCG

CGCTGTAGAAGTCACCATTGTTGTGCACGACGACATCATTCCGTGGCGTTATC

CAGCTAAGCGCGAACTGCAATTTGGAGAATGGCAGCGCAATGACATTCTTGC

AGGTATCTTCGAGCCAGCCACGATCGACATTGATCTGGCTATCTTGCTGACAA

AAGCAAGAGAACATAGCGTTGCCTTGGTAGGTCCAGCGGCGGAGGAACTCTT

TGATCCGGTTCCTGAACAGGATCTATTTGAGGCGCTAAATGAAACCTTAACGC

TATGGAACTCGCCGCCCGACTGGGCTGGCGATGAGCGAAATGTAGTGCTTAC

GTTGTCCCGCATTTGGTACAGCGCAGTAACCGGCAAAATCGCGCCGAAGGAT

GTCGCTGCCGACTGGGCAATGGAGCGCCTGCCGGCCCAGTATCAGCCCGTCA

TACTTGAAGCTAGACAGGCTTATCTTGGACAAGAAGAAGATCGCTTGGCCTC

GCGCGCAGATCAGTTGGAAGAATTTGTCCACTACGTGAAAGGCGAGATCACC

AAGGTAGTCGGCAAATAACCCTCGAGCCACCCATGACCAAAATCCCTTAACG

TGAGTTACGCGTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGG

ATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAA

AACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTT

TTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCT

AGTGT AGCCGT AGTT AGGCC ACC ACTTC A AG A ACTCTGT AGC ACCGCCT AC AT

ACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCG

TGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGT

CGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTA

CACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCC

GAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGA

GAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTG

TCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGG

GGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGG CCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGT

GGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGA

ACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATA

CGCAAACCGCCTCTCCCCGCGCGTTGGC

[0146] SEQ ID NO: 31 (expression cassette from pCLS4)

GATCATGAGCGGAGAATTAAGGGAGTCACGTTATGACCCCCGCCGATGACGC

GGGACAAGCCGTTTTACGTTTGGAACTGACAGAACCGCAACGTTGAAGGAGC

CACTCAGCCGCGGGTTTCTGGAGTTTAATGAGCTAAGCACATACGTCAGAAA

CCATTATTGCGCGTTCAAAAGTCGCCTAAGGTCACTATCAGCTAGCAAATATT

TCTTGTCAAAAATGCTCCACTGACGTTCCATAAATTCCCCTCGGTATCCAATT

AGAGTCTCATATTCACTCTCAATCCAAATAATCTGCACCGGATCTCGCCCTTA

CCTGCTAGTCATGGGCGATCCTAAAAAGAAACGTAAGGTCATCGATAAGGAG

ACTGCCGCTGCCAAGTTCGAGAGACAGCACATGGACAGCATGGTGTCTAAGG

GCGAAGAGCTGATTAAGGAGAACATGCACATGAAGCTGTACATGGAGGGCA

CCGTGAACAACCACCACTTCAAGTGCACATCCGAGGGCGAAGGCAAGCCCTA

CGAGGGCACCCAGACCATGAGAATCAAGGTGGTCGAGGGCGGCCCTCTCCCC

TTCGCCTTCGACATCCTGGCTACCAGCTTCATGTACGGCAGCAGAACCTTCAT

CAACCACACCCAGGGCATCCCCGACTTCTTTAAGCAGTCCTTCCCTGAGGGCT

TCACATGGGAGAGAGTCACCACATACGAAGACGGGGGCGTGCTGACCGCTAC

CCAGGACACCAGCCTCCAGGACGGCTGCCTCATCTACAACGTCAAGATCAGA

GGGGTGAACTTCCCATCCAACGGCCCTGTGATGCAGAAGAAAACACTCGGCT

GGGAGGCCAACACCGAGATGCTGTACCCCGCTGACGGCGGCCTGGAAGGCA

GAAGCGACATGGCCCTGAAGCTCGTGGGCGGGGGCCACCTGATCTGCAACTT

CAAGACCACATACAGATCCAAGAAACCCGCTAAGAACCTCAAGATGCCCGGC

GTCTACTATGTGGACCACAGACTGGAAAGAATCAAGGAGGCCGACAAAGAG

ACGTACGTCGAGCAGCACGAGGTGGCTGTGGCCAGATACTGCGACCTCCCTA

GCAAACTGGGGCACAAACTTAATGGAGGGGGCGGTAGCGGCGGTGGCGGGA

GCATCGATATCGCCGATCTACGCACGCTCGGCTACAGCCAGCAGCAACAGGA

GAAGATCAAACCGAAGGTTCGTTCGACAGTGGCGCAGCACCACGAGGCACTG

GTCGGCCACGGGTTTACACACGCGCACATCGTTGCGTTAAGCCAACACCCGG

CAGCGTTAGGGACCGTCGCTGTCAAGTATCAGGACATGATCGCAGCGTTGCC

AGAGGCGACACACGAAGCGATCGTTGGCGTCGGCAAACAGTGGTCCGGCGC ACGCGCTCTGGAGGCCTTGCTCACGGTGGCGGGAGAGTTGAGAGGTCCACCG

TTACAGTTGGACACAGGCCAACTTCTCAAGATTGCAAAACGTGGCGGCGTGA

CCGCAGTGGAGGCAGTGCATGCATGGCGCAATGCACTGACGGGTGCCCCGCT

CAACTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAG

CAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACG

GCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCA

GGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGC

TTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGG

CGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTT

GACCCCGGAGCAGGTGGTGGCCATCGCCAGCAATATTGGTGGCAAGCAGGCG

CTGGAGACGGTGCAGGCGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGA

CCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCT

GGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACC

CCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGG

AGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCC

CCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAG

ACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGG

AGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGA

CGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCA

GCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGAC

GGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAG

CAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACG

GTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGC

AGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGG

TCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCA

GGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTC

CAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGG

TGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCA

GCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTG

GTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGC

GGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGT

GGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCG GCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCTCAGCAGGTGGTG

GCCATCGCCAGCAATGGCGGCGGCAGGCCGGCGCTGGAGAGCATTGTTGCCC

AGTTATCTCGCCCTGATCCGGCGTTGGCCGCGTTGACCAACGACCACCTCGTC

GCCTTGGCCTGCCTCGGCGGGCGTCCTGCGCTGGATGCAGTGAAAAAGGGAT

TGGGGGATCCTATCAGCCGTTCCCAGCTGGTGAAGTCCGAGCTGGAGGAGAA

GAAATCCGAGTTGAGGCACAAGCTGAAGTACGTGCCCCACGAGTACATCGAG

CTGATCGAGATCGCCCGGAACAGCACCCAGGACCGTATCCTGGAGATGAAGG

TGATGGAGTTCTTCATGAAGGTGTACGGCTACAGGGGCAAGCACCTGGGCGG

CTCCAGGAAGCCCGACGGCGCCATCTACACCGTGGGCTCCCCCATCGACTAC

GGCGTGATCGTGGACACCAAGGCCTACTCCGGCGGCTACAACCTGCCCATCG

GCCAGGCCGACGAAATGCAGAGGTACGTGGAGGAGAACCAGACCAGGAACA

AGCACATCAACCCCAACGAGTGGTGGAAGGTGTACCCCTCCAGCGTGACCGA

GTTCAAGTTCCTGTTCGTGTCCGGCCACTTCAAGGGCAACTACAAGGCCCAGC

TGACCAGGCTGAACCACATCACCAACTGCAACGGCGCCGTGCTGTCCGTGGA

GGAGCTCCTGATCGGCGGCGAGATGATCAAGGCCGGCACCCTGACCCTGGAG

GAGGTGAGGAGGAAGTTCAACAACGGCGAGATCAACTTCGCGGCCGACTGAT

AACTCGAGAAGGGCGCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGA

TTGAATCCTGTTGCCGGTCTTGCGATGATTATCATATAATTTCTGTTGAATTAC

GTTAAGCATGTAATAATTAACATGTAATGCATGACGTTATTTATGAGATGGGT

TTTTATGATTAGAGTCCCGCAATTATACATTTAATACGCGATAGAAAACAAAA

TATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTA

GATCGGGAATTCGTAATCATGGTCATAGC

[0147] SEP ID NO: 32 (full map of pCLS14)

TGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGT

CAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGC

GTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTT

GCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGA

GCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT

CAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAG

TGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACG

ATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACA

CAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTG AGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATC

CGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGG

GAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAG

CGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCA

GCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGT

TCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGT

GAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAG

CGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGG

CCGATTCATTAATGCAGGTTAACCTGGCTTATCGAAATTAATACGACTCACTA

TAGGGAGCCCGGCAGATCTGATCTCTTGAACTTTCCAAGAGTTGAAGAAAAT

CACAGAAAGCCTTAGCACAGAGAAGAGAGATTGAAGAAGTCGACGGCCATC

GCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGC

CGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGC

CAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCG

GTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCA

GCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGT

GCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGC

AATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGCGCTGTTGCCGGTGC

TGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAA

TAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTG

TGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCAATA

TTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGCGCTGTTGCCGGTGCTGTG

CCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGAT

GGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCC

AGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCAATATTGG

TGGCAAGCAGGCGCTGGAGACGGTGCAGGCGCTGTTGCCGGTGCTGTGCCAG

GCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCG

GCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGC

CCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGC

AAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCC

ACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAA

GCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCAC GGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGC

AGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGG

CTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAG

GCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCT

TGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGC

GCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTG

ACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGC

TGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGAC

CCCTCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGCGGCAGGCCGGCGCTG

GAGAGCATTGTTGCCCAGTTATCTCGCCCTGATCCGGCGTTGGCCGCGTTGAC

CAACGACCACCTCGTCGCCTTGGCCTGCCTCGGCGGGCGTCCTGCGCTGGATG

CAGTGAAAAAGGGATTGGGGGATCCTATCAGCCGTTCCCAGCTGGTGAAGTC

CGAGCTGGAGGAGAAGAAATCCGAGTTGAGGCACAAGCTGAAGTACGTGCC

CCACGAGTACATCGAGCTGATCGAGATCGCCCGGAACAGCACCCAGGACCGT

ATCCTGGAGATGAAGGTGATGGAGTTCTTCATGAAGGTGTACGGCTACAGGG

GCAAGCACCTGGGCGGCTCCAGGAAGCCCGACGGCGCCATCTACACCGTGGG

CTCCCCCATCGACTACGGCGTGATCGTGGACACCAAGGCCTACTCCGGCGGC

TACAACCTGCCCATCGGCCAGGCCGACGAAATGCAGAGGTACGTGGAGGAG

AACCAGACCAGGAACAAGCACATCAACCCCAACGAGTGGTGGAAGGTGTAC

CCCTCCAGCGTGACCGAGTTCAAGTTCCTGTTCGTGTCCGGCCACTTCAAGGG

CAACTACAAGGCCCAGCTGACCAGGCTGAACCACATCACCAACTGCAACGGC

GCCGTGCTGTCCGTGGAGGAGCTCCTGATCGGCGGCGAGATGATCAAGGCCG

GCACCCTGACCCTGGAGGAGGTGAGGAGGAAGTTCAACAACGGCGAGATCA

ACTTCGCGGCCGACTGATAACCATGGAGAGGATATATATGTACATATGCAAA

GGGATATCAAGACCATCTGTAATCTTTTGAAGTTTTGTGAAGCTATAGAAGCC

AAGCAAGAATTCTACCAGATTACTTCCCAAATAAGTGGTGTGAATGTAAATT

AATAAGAGCTACAGAAACATTGATTGGCTCAGTGTATGTGTTGTATTCATATT

CGTTGTTTTATTTTATACGGTTGAGAATTGAATAATGTTGTTGCATCAAATCA

CTATGAAGGACATTTACAGTCAGCTGCTCGATCGAGGCGGCCAACAACAAAA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AAAAAAAAAAAGAAAAGCCAATTGGGATCNNAGTTCTATAGTGTCACCTAAA

TCGTATGTGTATGATACATAAGGTTATGTATTAATTGTAGCCGCGTTCTAACG ACAATATGTCCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG

TTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTT

GTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGC

ATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCC

TCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAG

ACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATT

TTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAA

TGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTC

GCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAA

ACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTT

ACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGA

AGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTAT

TATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCT

CAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATG

GCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACAC

TGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCT

TTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGA

GCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCA

ATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTC

CCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTT

CTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGG

TGAGCGTGGATCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCC

TCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAAC

GAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACT

GTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTA

ATTT A A A AGG ATCT AGGTGA AG ATCCTTTT

[0148] SEQ ID NO: 33 (expression cassette from pCLS14)

TAATACGACTCACTATAGGGAGCCCGGCAGATCTGATCTCTTGAACTTTCCAA

GAGTTGAAGAAAATCACAGAAAGCCTTAGCACAGAGAAGAGAGATTGAAGA

AGTCGACGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGT

CCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAG

GTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCC AGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGT

GGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAG

CGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGG

TGGCCATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGC

GCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTG

GCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGC

TGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGC

CATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGCGCTG

TTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCA

TCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTT

GCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATC

GCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGCGCTGTTGC

CGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGC

CAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCG

GTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCA

GCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGT

GCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGC

AATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGC

TGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCA

CGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTG

TGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACG

ATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTG

CCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCCACGAT

GGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCC

AGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGGCGG

TGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAG

GCCCACGGCTTGACCCCTCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGCG

GCAGGCCGGCGCTGGAGAGCATTGTTGCCCAGTTATCTCGCCCTGATCCGGC

GTTGGCCGCGTTGACCAACGACCACCTCGTCGCCTTGGCCTGCCTCGGCGGGC

GTCCTGCGCTGGATGCAGTGAAAAAGGGATTGGGGGATCCTATCAGCCGTTC

CCAGCTGGTGAAGTCCGAGCTGGAGGAGAAGAAATCCGAGTTGAGGCACAA

GCTGAAGTACGTGCCCCACGAGTACATCGAGCTGATCGAGATCGCCCGGAAC AGCACCCAGGACCGTATCCTGGAGATGAAGGTGATGGAGTTCTTCATGAAGG

TGTACGGCTACAGGGGCAAGCACCTGGGCGGCTCCAGGAAGCCCGACGGCGC

CATCTACACCGTGGGCTCCCCCATCGACTACGGCGTGATCGTGGACACCAAG

GCCTACTCCGGCGGCTACAACCTGCCCATCGGCCAGGCCGACGAAATGCAGA

GGTACGTGGAGGAGAACCAGACCAGGAACAAGCACATCAACCCCAACGAGT

GGTGGAAGGTGTACCCCTCCAGCGTGACCGAGTTCAAGTTCCTGTTCGTGTCC

GGCCACTTCAAGGGCAACTACAAGGCCCAGCTGACCAGGCTGAACCACATCA

CCAACTGCAACGGCGCCGTGCTGTCCGTGGAGGAGCTCCTGATCGGCGGCGA

GATGATCAAGGCCGGCACCCTGACCCTGGAGGAGGTGAGGAGGAAGTTCAA

CAACGGCGAGATCAACTTCGCGGCCGACTGATAACCATGGAGAGGATATATA

TGTACATATGCAAAGGGATATCAAGACCATCTGTAATCTTTTGAAGTTTTGTG

AAGCTATAGAAGCCAAGCAAGAATTCTACCAGATTACTTCCCAAATAAGTGG

TGTGAATGTAAATTAATAAGAGCTACAGAAACATTGATTGGCTCAGTGTATG

TGTTGTATTCATATTCGTTGTTTTATTTTATACGGTTGAGAATTGAATAATGTT

GTTGCATCAAATCACTATGAAGGACATTTACAGTCAGCTGCTCGATCGAGGC

GGCCAACAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AAAAAAAAAAAAAAAAAAAAAAAAGAAAAGCCAATTGGGATCNNAGTTCTA

TAGTGTCACCTAAATCGTATGTGTATGATACATAAGGTTATGTATTAATTGTA

GCCGCGTTCTAACGACAATATGTCCATATGGTGCACTCTCAGTACAATCTGCT

CTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGC

GCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTA

[0149] SEP ID NO: 34 (full map of pCLS15)

TGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGT

CAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGC

GTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTT

GCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGA

GCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT

CAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAG

TGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACG

ATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACA

CAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTG

AGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATC CGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGG

GAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAG

CGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCA

GCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGT

TCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGT

GAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAG

CGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGG

CCGATTCATTAATGCAGGTTAACCTGGCTTATCGAAATTAATACGACTCACTA

TAGGGAGCCCGGCAGATCTGATCTCTTGAACTTTCCAAGAGTTGAAGAAAAT

CACAGAAAGCCTTAGCACAGAGAAGAGAGATTGAAGAAGTCGACATGGGCG

ATCCTAAAAAGAAACGTAAGGTCATCGATAAGGAGACTGCCGCTGCCAAGTT

CGAGAGACAGCACATGGACAGCATGGTGTCTAAGGGCGAAGAGCTGATTAA

GGAGAACATGCACATGAAGCTGTACATGGAGGGCACCGTGAACAACCACCA

CTTCAAGTGCACATCCGAGGGCGAAGGCAAGCCCTACGAGGGCACCCAGACC

ATGAGAATCAAGGTGGTCGAGGGCGGCCCTCTCCCCTTCGCCTTCGACATCCT

GGCTACCAGCTTCATGTACGGCAGCAGAACCTTCATCAACCACACCCAGGGC

ATCCCCGACTTCTTTAAGCAGTCCTTCCCTGAGGGCTTCACATGGGAGAGAGT

CACCACATACGAAGACGGGGGCGTGCTGACCGCTACCCAGGACACCAGCCTC

CAGGACGGCTGCCTCATCTACAACGTCAAGATCAGAGGGGTGAACTTCCCAT

CCAACGGCCCTGTGATGCAGAAGAAAACACTCGGCTGGGAGGCCAACACCG

AGATGCTGTACCCCGCTGACGGCGGCCTGGAAGGCAGAAGCGACATGGCCCT

GAAGCTCGTGGGCGGGGGCCACCTGATCTGCAACTTCAAGACCACATACAGA

TCCAAGAAACCCGCTAAGAACCTCAAGATGCCCGGCGTCTACTATGTGGACC

ACAGACTGGAAAGAATCAAGGAGGCCGACAAAGAGACGTACGTCGAGCAGC

ACGAGGTGGCTGTGGCCAGATACTGCGACCTCCCTAGCAAACTGGGGCACAA

ACTTAATGGAGGGGGCGGTAGCGGCGGTGGCGGGAGCATCGATATCGCCGAT

CTACGCACGCTCGGCTACAGCCAGCAGCAACAGGAGAAGATCAAACCGAAG

GTTCGTTCGACAGTGGCGCAGCACCACGAGGCACTGGTCGGCCACGGGTTTA

CACACGCGCACATCGTTGCGTTAAGCCAACACCCGGCAGCGTTAGGGACCGT

CGCTGTCAAGTATCAGGACATGATCGCAGCGTTGCCAGAGGCGACACACGAA

GCGATCGTTGGCGTCGGCAAACAGTGGTCCGGCGCACGCGCTCTGGAGGCCT

TGCTCACGGTGGCGGGAGAGTTGAGAGGTCCACCGTTACAGTTGGACACAGG CCAACTTCTCAAGATTGCAAAACGTGGCGGCGTGACCGCAGTGGAGGCAGTG

CATGCATGGCGCAATGCACTGACGGGTGCCCCGCTCAACTTGACCCCCCAGC

AGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGT

CCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAG

GTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCC

AGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGT

GGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAG

CGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGG

TGGCCATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGGAGACGGTGCAGGC

GCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTG

GCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGG

CTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGG

CCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCT

GTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCC

ATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGT

TGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCAT

CGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTG

CCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCG

CCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCC

GGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCC

AGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGG

TGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAG

CAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTG

CTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGGTGGTGGCCATCGCCAGCC

ACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCT

GTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAAT

GGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGT

GCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATGG

CGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGC

CAGGCCCACGGCTTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATG

GTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCA

GGCCCACGGCTTGACCCCTCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGC GGCAGGCCGGCGCTGGAGAGCATTGTTGCCCAGTTATCTCGCCCTGATCCGG

CGTTGGCCGCGTTGACCAACGACCACCTCGTCGCCTTGGCCTGCCTCGGCGGG

CGTCCTGCGCTGGATGCAGTGAAAAAGGGATTGGGGGATCCTATCAGCCGTT

CCCAGCTGGTGAAGTCCGAGCTGGAGGAGAAGAAATCCGAGTTGAGGCACA

AGCTGAAGTACGTGCCCCACGAGTACATCGAGCTGATCGAGATCGCCCGGAA

CAGCACCCAGGACCGTATCCTGGAGATGAAGGTGATGGAGTTCTTCATGAAG

GTGTACGGCTACAGGGGCAAGCACCTGGGCGGCTCCAGGAAGCCCGACGGC

GCCATCTACACCGTGGGCTCCCCCATCGACTACGGCGTGATCGTGGACACCA

AGGCCTACTCCGGCGGCTACAACCTGCCCATCGGCCAGGCCGACGAAATGCA

GAGGTACGTGGAGGAGAACCAGACCAGGAACAAGCACATCAACCCCAACGA

GTGGTGGAAGGTGTACCCCTCCAGCGTGACCGAGTTCAAGTTCCTGTTCGTGT

CCGGCCACTTCAAGGGCAACTACAAGGCCCAGCTGACCAGGCTGAACCACAT

CACCAACTGCAACGGCGCCGTGCTGTCCGTGGAGGAGCTCCTGATCGGCGGC

GAGATGATCAAGGCCGGCACCCTGACCCTGGAGGAGGTGAGGAGGAAGTTC

AACAACGGCGAGATCAACTTCGCGGCCGACTGATAAAGAGGATATATATGTA

CATATGCAAAGGGATATCAAGACCATCTGTAATCTTTTGAAGTTTTGTGAAGC

TATAGAAGCCAAGCAAGAATTCTACCAGATTACTTCCCAAATAAGTGGTGTG

AATGTAAATTAATAAGAGCTACGAAACATTGATTGGCTCAGTGTATGTGTTGT

ATTCATATTCGTTGTTTTATTTTATACGGTTGAGAATTGAATAATGTTGTTGCA

TCAAATCACTATGAAGGACATTTACAGTCAGCTGCTCGATCGAGGCGGCCAA

CAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGA

AGAGCCAATTGGGATCNNAGTTCTATAGTGTCACCTAAATCGTATGTGTATGA

TACATAAGGTTATGTATTAATTGTAGCCGCGTTCTAACGACAATATGTCCATA

TGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCG

ACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCAT

CCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTT

TCACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTAT

TTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACT

TTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTC

AAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATT

GAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTT

TTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGT AAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGAT

CTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAAT

GATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACG

CCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGT

TGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGA

GAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTAC

TTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACAT

GGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCC

ATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGT

TGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTA

ATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCC

TTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGATCT

CGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAG

TTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGAT

CGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTT

ACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCT

AGGTGAAGATCCTTTT

[0150] SEP ID NO: 35 (expression cassette from nCLS 15 )

TAATACGACTCACTATAGGGAGCCCGGCAGATCTGATCTCTTGAACTTTCCAA

GAGTTGAAGAAAATCACAGAAAGCCTTAGCACAGAGAAGAGAGATTGAAGA

AGTCGACATGGGCGATCCTAAAAAGAAACGTAAGGTCATCGATAAGGAGACT

GCCGCTGCCAAGTTCGAGAGACAGCACATGGACAGCATGGTGTCTAAGGGCG

AAGAGCTGATTAAGGAGAACATGCACATGAAGCTGTACATGGAGGGCACCGT

GAACAACCACCACTTCAAGTGCACATCCGAGGGCGAAGGCAAGCCCTACGAG

GGCACCCAGACCATGAGAATCAAGGTGGTCGAGGGCGGCCCTCTCCCCTTCG

CCTTCGACATCCTGGCTACCAGCTTCATGTACGGCAGCAGAACCTTCATCAAC

CACACCCAGGGCATCCCCGACTTCTTTAAGCAGTCCTTCCCTGAGGGCTTCAC

ATGGGAGAGAGTCACCACATACGAAGACGGGGGCGTGCTGACCGCTACCCA

GGACACCAGCCTCCAGGACGGCTGCCTCATCTACAACGTCAAGATCAGAGGG

GTGAACTTCCCATCCAACGGCCCTGTGATGCAGAAGAAAACACTCGGCTGGG

AGGCCAACACCGAGATGCTGTACCCCGCTGACGGCGGCCTGGAAGGCAGAA

GCGACATGGCCCTGAAGCTCGTGGGCGGGGGCCACCTGATCTGCAACTTCAA GACCACATACAGATCCAAGAAACCCGCTAAGAACCTCAAGATGCCCGGCGTC

TACTATGTGGACCACAGACTGGAAAGAATCAAGGAGGCCGACAAAGAGACG

TACGTCGAGCAGCACGAGGTGGCTGTGGCCAGATACTGCGACCTCCCTAGCA

AACTGGGGCACAAACTTAATGGAGGGGGCGGTAGCGGCGGTGGCGGGAGCA

TCGATATCGCCGATCTACGCACGCTCGGCTACAGCCAGCAGCAACAGGAGAA

GATCAAACCGAAGGTTCGTTCGACAGTGGCGCAGCACCACGAGGCACTGGTC

GGCCACGGGTTTACACACGCGCACATCGTTGCGTTAAGCCAACACCCGGCAG

CGTTAGGGACCGTCGCTGTCAAGTATCAGGACATGATCGCAGCGTTGCCAGA

GGCGACACACGAAGCGATCGTTGGCGTCGGCAAACAGTGGTCCGGCGCACGC

GCTCTGGAGGCCTTGCTCACGGTGGCGGGAGAGTTGAGAGGTCCACCGTTAC

AGTTGGACACAGGCCAACTTCTCAAGATTGCAAAACGTGGCGGCGTGACCGC

AGTGGAGGCAGTGCATGCATGGCGCAATGCACTGACGGGTGCCCCGCTCAAC

TTGACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGG

CGCTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTT

GACCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCG

CTGGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGA

CCCCCCAGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCT

GGAGACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACC

CCGGAGCAGGTGGTGGCCATCGCCAGCAATATTGGTGGCAAGCAGGCGCTGG

AGACGGTGCAGGCGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCC

CCAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGA

GACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCC

CAGCAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAG

ACGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCC

AGCAGGTGGTGGCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGA

CGGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGA

GCAGGTGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGAC

GGTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAG

CAGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACG

GTCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGC

AGGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGG

TCCAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCA GGTGGTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTC

CAGCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCGGAGCAGG

TGGTGGCCATCGCCAGCCACGATGGCGGCAAGCAGGCGCTGGAGACGGTCCA

GCGGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTG

GTGGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGC

GGCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGT

GGCCATCGCCAGCAATGGCGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCG

GCTGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCCCAGCAGGTGGTG

GCCATCGCCAGCAATAATGGTGGCAAGCAGGCGCTGGAGACGGTCCAGCGGC

TGTTGCCGGTGCTGTGCCAGGCCCACGGCTTGACCCCTCAGCAGGTGGTGGCC

ATCGCCAGCAATGGCGGCGGCAGGCCGGCGCTGGAGAGCATTGTTGCCCAGT

TATCTCGCCCTGATCCGGCGTTGGCCGCGTTGACCAACGACCACCTCGTCGCC

TTGGCCTGCCTCGGCGGGCGTCCTGCGCTGGATGCAGTGAAAAAGGGATTGG

GGGATCCTATCAGCCGTTCCCAGCTGGTGAAGTCCGAGCTGGAGGAGAAGAA

ATCCGAGTTGAGGCACAAGCTGAAGTACGTGCCCCACGAGTACATCGAGCTG

ATCGAGATCGCCCGGAACAGCACCCAGGACCGTATCCTGGAGATGAAGGTGA

TGGAGTTCTTCATGAAGGTGTACGGCTACAGGGGCAAGCACCTGGGCGGCTC

CAGGAAGCCCGACGGCGCCATCTACACCGTGGGCTCCCCCATCGACTACGGC

GTGATCGTGGACACCAAGGCCTACTCCGGCGGCTACAACCTGCCCATCGGCC

AGGCCGACGAAATGCAGAGGTACGTGGAGGAGAACCAGACCAGGAACAAGC

ACATCAACCCCAACGAGTGGTGGAAGGTGTACCCCTCCAGCGTGACCGAGTT

CAAGTTCCTGTTCGTGTCCGGCCACTTCAAGGGCAACTACAAGGCCCAGCTG

ACCAGGCTGAACCACATCACCAACTGCAACGGCGCCGTGCTGTCCGTGGAGG

AGCTCCTGATCGGCGGCGAGATGATCAAGGCCGGCACCCTGACCCTGGAGGA

GGTGAGGAGGAAGTTCAACAACGGCGAGATCAACTTCGCGGCCGACTGATAA

AGAGGATATATATGTACATATGCAAAGGGATATCAAGACCATCTGTAATCTT

TTGAAGTTTTGTGAAGCTATAGAAGCCAAGCAAGAATTCTACCAGATTACTTC

CCAAATAAGTGGTGTGAATGTAAATTAATAAGAGCTACGAAACATTGATTGG

CTCAGTGTATGTGTTGTATTCATATTCGTTGTTTTATTTTATACGGTTGAGAAT

TGAATAATGTTGTTGCATCAAATCACTATGAAGGACATTTACAGTCAGCTGCT

CGATCGAGGCGGCCAACAACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AAAAAAAAAAAAAAGAAGA