HEALEY JOSEPH (GB)
HAPESHI ALEXIA (GB)
WO2014138324A1 | 2014-09-12 | |||
WO1992006204A1 | 1992-04-16 |
US5223409A | 1993-06-29 |
JIANG FENG ET AL: "Cryo-EM Structure and Assembly of an Extracellular Contractile Injection System", CELL, vol. 177, no. 2, 4 April 2019 (2019-04-04), pages 370, XP085654989, ISSN: 0092-8674, DOI: 10.1016/J.CELL.2019.02.020
KIM DOHLICH ET AL: "A Substrate-Fusion Protein Is Trapped inside the Type III Secretion System Channel in Shigella flexneri", PLOS PATHOGENS, vol. 10, no. 1, 16 January 2014 (2014-01-16), pages e1003881, XP055151154, DOI: 10.1371/journal.ppat.1003881
ABE A ET AL: "Type-III effectors: Sophisticated bacterial virulence factors", COMPTES RENDUS - BIOLOGIES, ELSEVIER, PARIS, FR, vol. 328, no. 5, 1 May 2005 (2005-05-01), pages 413 - 428, XP027629480, ISSN: 1631-0691, [retrieved on 20050501]
G. BRIONES ET AL: "Cre Reporter System To Monitor the Translocation of Type III Secreted Proteins into Host Cells", INFECTION AND IMMUNITY, vol. 74, no. 2, 20 January 2006 (2006-01-20), pages 1084 - 1090, XP055151152, ISSN: 0019-9567, DOI: 10.1128/IAI.74.2.1084-1090.2006
SORY M-P ET AL: "IDENTIFICATION OF THE YOPE AND YOPH DOMAINS REQUIRED FOR SECRETION AND INTERNALIZATION INTO THE CYTOSOL OF MACROPHAGES USING THE CYAA GENE FUSION APPROACH", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, vol. 92, 1 December 1995 (1995-12-01), pages 11998 - 12002, XP002921387, ISSN: 0027-8424, DOI: 10.1073/PNAS.92.26.11998
"GenBank", Database accession no. FM 162591.1
WILKINSON, FEMS MICROBIOLOGY LETTERS, vol. 309, no. 2, August 2010 (2010-08-01), pages 136 - 143
"The Molecular Biology of Photorhabdus Bacteria", 2017, SPRINGER INTERNATIONAL PUBLISHING AG, pages: 159 - 177
WILKINSON ET AL., BMC GENOMICS, vol. 10, 2009
JULIE D. THOMPSON ET AL.: "CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position- Specific Gap Penalties and Weight Matrix Choice", NUCLEIC ACIDS RESEARCH, vol. 22, no. 22, 1994, pages 4673 - 4680, XP002956304
OSAMU GOTOH: "Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments", J. MOL. BIOL., vol. 264, no. 4, 1996, pages 823 - 838
ERIC DEPIEREUXERNEST FEYTMANS: "Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences", CABIOS, vol. 8, no. 5, 1992, pages 501 - 509
C. E. LAWRENCE ET AL.: "Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment", SCIENCE, vol. 262, no. 5131, 1993, pages 208 - 214, XP001152872, DOI: 10.1126/science.8211139
IVO VAN WALLE ET AL.: "Align-M - A New Algorithm for Multiple Alignment of Highly Divergent Sequences", BIOINFORMATICS, vol. 20, no. 9, 2004, pages 1428 - 1435
ALTSCHUL ET AL., BULL. MATH. BIO., vol. 48, 1986, pages 603 - 16
HENIKOFFHENIKOFF, PROC. NATL. ACAD. SCI. USA, vol. 89, 1992, pages 10915 - 19
ROBERTSON ET AL., J. AM. CHEM. SOC., vol. 113, 1991, pages 2722
ELLMAN ET AL., METHODS ENZYMOL., vol. 202, 1991, pages 301
CHUNG ET AL., SCIENCE, vol. 259, 1993, pages 806 - 9
CHUNG ET AL., PROC. NATL. ACAD. SCI. USA, vol. 90, 1993, pages 10145 - 9
TURCATTI ET AL., J. BIOL. CHEM., vol. 271, 1996, pages 19991 - 8
KOIDE ET AL., BIOCHEM., vol. 33, 1994, pages 7470 - 6
WYNNRICHARDS, PROTEIN SCI., vol. 2, 1993, pages 395 - 403
CUNNINGHAMWELLS, SCIENCE, vol. 244, 1989, pages 1081 - 5
VOS ET AL., SCIENCE, vol. 255, 1992, pages 306 - 12
SMITH ET AL., J. MOL. BIOL., vol. 224, 1992, pages 899 - 904
WLODAVER ET AL., FEBS LETT., vol. 309, 1992, pages 59 - 64
REIDHAAR-OLSONSAUER, SCIENCE, vol. 241, 1988, pages 53 - 7
BOWIESAUER, PROC. NATL. ACAD. SCI. USA, vol. 86, 1989, pages 2152 - 6
LOWMAN ET AL., BIOCHEM., vol. 30, 1991, pages 10832 - 7
DERBYSHIRE ET AL., GENE, vol. 46, 1986, pages 145
NER ET AL., DNA, vol. 7, 1988, pages 127
SINGLETON ET AL.: "DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY", 1994, JOHN WILEY AND SONS
HALEMARHAM: "THE HARPER COLLINS DICTIONARY OF BIOLOGY", 1991, HARPER PERENNIAL
YANG ET AL., J BACTERIOL., vol. 188, no. 6, March 2006 (2006-03-01), pages 2254 - 2261
CZEKANSKA, METHODS IN MOLECULAR BIOLOGY, vol. 740, 2011, pages 27 - 32
HUCH ET AL., REGEN MED., vol. 8, no. 4, July 2013 (2013-07-01), pages 385 - 7
CLAIMS 1. Use of a Photorhabdus Virulence Cassettes (PVC) effector leader sequence, for packaging a payload into a PVC Needle Complex; wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof; and wherein the leader sequence and the payload form an effector fusion that is distinct from a wild-type PVC effector protein. 2. The use according to claim 1 , wherein the leader sequence comprises amino acid residues 1-50 of a PVC effector. 3. The use according to claim 1 or claim 2, wherein the leader sequence comprises an amino acid sequence having at least 60% sequence identity to one or more sequence selected sequence from SEQ ID NO.: 47 - SEQ ID NO.: 92. 4. The use according to any one of the preceding claims, wherein the PVC effector comprises an amino acid sequence of one or more sequence selected from SEQ ID NO.: 1 - SEQ ID NO.: 46. 5. The use according to any one of the preceding claims, wherein the PVC effector comprises a sequence selected from SEQ ID NO: 4, SEQ ID NO: 22, SEQ ID NO: 25, SEQ ID NO: 30, SEQ ID NO: 32 and SEQ ID NO: 46. 6. The use according to any one of the preceding claims, wherein the leader sequence is covalently fused to the payload, preferably at an N-terminus of the payload. 7. A method for manufacturing a PVC Needle Complex comprising a payload, the method comprising: a. contacting a PVC Needle Complex with an effector fusion comprising a PVC effector leader sequence fused to a payload; b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and c. wherein the effector fusion is distinct from a wild-type PVC effector protein. 8. The method according to claim 7, wherein said contacting occurs within a cell, in a cell lysate, or in a purified cell lysate. 9. An in vitro and/or ex vivo method for delivering a payload into a cell, the method comprising: a. contacting a cell with a PVC Needle Complex comprising an effector fusion; b. wherein the effector fusion comprises a PVC effector leader sequence fused to a payload; c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and d. wherein the effector fusion is distinct from a wild-type PVC effector protein. 10. A method for suppressing a pest, the method comprising: a. contacting a pest, or a target area comprising a pest, with a PVC Needle Complex comprising an effector fusion; b. wherein the effector fusion comprises a PVC effector leader sequence fused to a payload; c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and d. wherein the effector fusion is distinct from a wild-type PVC effector protein. 11. A PVC Needle Complex, for use in a method of treatment; a. wherein the PVC Needle Complex comprises an effector fusion which comprises a PVC effector leader sequence fused to a payload; b. wherein said payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof; and c. wherein the effector fusion is distinct from a wild-type PVC effector protein. 12. A PVC Needle Complex comprising an effector fusion; a. wherein said effector fusion comprises a PVC effector leader sequence fused to a payload; b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and c. wherein the effector fusion is distinct from a wild-type PVC effector protein. 13. An effector fusion, comprising a PVC effector leader sequence fused to a payload; a. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and b. wherein the effector fusion is distinct from a wild-type PVC effector protein. 14. An isolated PVC effector leader sequence. 15. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion, or isolated PVC effector leader sequence according to any one of claims 7-14, wherein the leader sequence comprises amino acid residues 1-50 of a PVC effector. 16. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according to any one of claims 7-15, wherein the leader sequence comprises an amino acid sequence having at least 60% sequence identity to one or more sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92. 17. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according any one of claims 7-16, wherein the PVC effector comprises an amino acid sequence of one or more sequence selected from SEQ ID NO.: 1 - SEQ ID NO.: 46. 18. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according any one of claims 7-17, wherein the PVC effector comprises a sequence selected from SEQ ID NO: 4, SEQ ID NO: 22, SEQ ID NO: 25, SEQ ID NO: 30, SEQ ID NO: 32 and SEQ ID NO: 46. 19. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according to any one of claims 7-18, wherein the leader sequence is covalently fused to a payload. 20. An isolated nucleic acid comprising a nucleotide sequence which encodes the isolated PVC effector leader sequence of any one of claims 14-19. 21. An expression vector comprising an isolated nucleic acid molecule of claim 20. 22. A host cell comprising an isolated nucleic acid molecule of claim 20, or an expression vector of claim 21. 23. The host cell of claim 22, wherein said host cell is one or more a selected from a mammalian cell, an insect cell, a yeast cell, a bacterial cell, and/or a plant cell; preferably wherein said bacterial cell is an E. coli cell. 24. The host cell of claim 22, wherein said host cell is a Photorhabdus cell. 25. The host cell of claim 24, wherein said Photorhabdus cell comprises a Photorhabdus PVC operon operably linked to an inducible promoter. |
The present invention relates to a leader sequence, and use of a leader sequence for packaging molecules into protein complexes.
Biological molecules (e.g. peptides, proteins and nucleic acids) have great potential as broadly applicable therapeutics. Indeed, there has been a trend in recent years for the pharmaceutical industry to move away from‘small molecule’ drugs, toward more complex macromolecular therapeutics (aka. “biologies”). Such biologies include protein-based therapeutics (notably antibodies, hormones, growth factors and cytokines) and nucleic acid- based treatments (such as short-interfering RNAs, DNA/RNA vaccines and gene therapies).
While the biologies market has developed significantly in recent years, the low availability of effective delivery systems (and practicable methods for manufacturing such delivery systems) has limited the diversity of molecular targets of such bio-therapeutics, especially when the target is cytosolic. Indeed, the majority of approved peptide therapeutics on the market act by targeting extracellular components, such as membrane receptors or secreted molecules (e.g. present in the interstitial space). For example, humira (the most successful therapeutic monoclonal antibody) targets the extracellularly secreted cytokine TNFa. Insulin acts by binding its cognate receptor present on the cell membrane (the same being true of other hormone peptide therapeutics).
Similar problems exist in the agricultural industry, where protein-based pesticides are typically toxins which must target an extracellular component of a cell of a pest. By way of example, Bacillus thuringiensis toxins are commonly used natural pesticides which must bind membrane receptors to exert their toxic effects.
Methods for cytosolic delivery of biological molecules have been developed for laboratory research, which generally involve delivering the molecules within lipid vehicles which fuse with the plasma membrane of a cell, before emptying their payload into the cytosol. However, such methods find limited use in medicine and veterinary, e.g. due to the non specific nature in which they deliver molecules to cells.
Bacterial secretion systems have been explored as potential delivery systems, given their natural ability to secrete (or more particularly‘inject’) molecules into target cells. The most studied of such secretion systems is the Type III secretion system (T3SS), a “protein appendage” found in several Gram-negative bacteria. However, a significant drawback of these systems is that they remain associated with the bacterial membrane at all times, requiring use of actual bacterial cells (comprising the secretion system) as the delivery system. As such, it is difficult to fully control what molecules are transferred from the bacteria to the target cell (even when the biologic of interest is overexpressed), as these secretion systems function by providing a connection (e.g. channel) between the bacteria’s cytosol and the target cell’s cytosol, through which other components (potentially harmful to the host) may flow. Therefore, there exists not only a need for improved delivery systems, but also means for producing such systems which find compatibility with molecules (payloads) having a range of sizes and molecular properties.
The present invention solves one or more of the above-mentioned problems.
The present invention is predicated on the surprising finding that toxigenic Photorhabdus Virulence Cassettes (PVC) effector proteins of Photorhabdus bacteria comprise a previously unknown“leader sequence” (or“leader peptide”), which functions to package (or“load”) PVC effectors into a so called PVC Needle Complex(e.g. “nanosyringe”), which subsequently delivers the PVC effector to a target cell where it exerts its toxigenic effect(s) (the PVC effectors representing a payload of such nanosyringes). Moreover, the inventors have found that such leader sequences can be practically utilized to direct a payload linked thereto to be packaged into a PVC Needle Complex (and related/ homologous complexes), a well characterized molecular delivery system of Photorhabdus. Thus, the newly discovered leader sequence surprisingly functions to load the PVC Needle Complex with a molecular payload (or“warhead”).
Further to this finding, the inventors have developed an advantageous, practical utility for such leader sequence for packaging/ loading ‘heterologous’ payloads (including non- Photorhabdus molecules) into PVC Needle Complexes, independent of the size, molecular properties or provenance of the heterologous payload.
In a first aspect the invention provides use of a Photorhabdus Virulence Cassettes (PVC) effector leader sequence, for packaging a payload into a PVC Needle Complex;
wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof (preferably a polypeptide); and
wherein the leader sequence and the payload form an effector fusion that is distinct from a wild-type PVC effector protein.
In one aspect, an aspect of the invention provides use of a PVC effector leader sequence, for packaging a payload into a PVC Needle Complex;
wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof (preferably a polypeptide); and
wherein the leader sequence and the payload form a fusion that is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
In other words, the invention provides in one aspect a method for packaging a payload into a PVC Needle Complex with a PVC effector leader sequence, comprising contacting an (effector) fusion with a PVC Needle Complex, wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof (preferably a polypeptide); and wherein the leader sequence and the payload form the (effector) fusion, that is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
The terms“fusion” and“effector fusion”, in the context of a (effector) fusion formed by the leader sequence and the payload (and that is distinct from a wild-type PVC effector protein) are used interchangeably herein. This use (of the leader sequence) was demonstrated, as outlined in the examples, by expressing an effector fusion (tagged with a detection label) and a PVC Needle Complex in a cell (e.g. host bacterial cell) wherein the effector fusion is packaged into the PVC Needle Complex (via the leader sequence), isolating the PVC Needle Complex, then detecting the presence or absence of the payload within the PVC Needle Complex (e.g. a disrupted version thereof) via Western blot detection of the detection label. The presence of the payload is detected when fused to a leader sequence only, but not when the payload lacks a leader sequence.
The term“PVC effector leader sequence” means the leader region (polypeptide region) from a PVC effector polypeptide which is capable of packaging a payload (e.g. effector) into a PVC Needle Complex, and is preferably amino acids 1-50 of a PVC effector, or amino acids 2-50 when omitting the initial methionine. The inventors have demonstrated that the leader sequence is encompassed within (or may consist essentially of) amino acids 1-50 of a multitude of identified PVC effector polypeptide sequences. However, leader sequences having alternative lengths and positioning within a PVC effector are intended to be encompassed (e.g. with the proviso that said leader sequence is capable of packaging a payload into a PVC Needle Complex).
The remaining (non-leader sequence) portion of a PVC effector is referred to an“effector portion” (e.g. payload) herein. The effector portion preferably comprises or consists essentially of amino acids 51-C terminus of a PVC effector protein.
Thus, in one embodiment, a PVC effector leader sequence is encompassed within amino acids 1-50 or 2-50 (preferably 1-50) of a PVC effector polypeptide.
In embodiment, a PVC effector leader sequence comprises (or consists essentially of) amino acids 1-50 or 2-50 (preferably 1-50) of a PVC effector polypeptide.
The term“wild-type PVC effector protein” is used synonymously with the term“endogenous PVC effector protein”, or simply“PVC effector protein”, and refers to an (e.g. intact) PVC effector sequence having an endogenous leader sequence (i.e. endogenous to the given PVC effector, preferably amino acids 1-50 of the PVC effector) associated with the effector portion (e.g. the payload, preferably amino acids 51-C terminus of a PVC effector protein). Examples of wild-type PVC effectors may comprise (or consist essentially of) an amino acid sequence of one or more sequence selected from SEQ ID NO.: 1 - SEQ ID NO.: 46. The fusion/ effector fusion of the invention described herein is thus distinct from a PVC effector protein (e.g. wild-type PVC effector protein), as the leader sequence is not fused to an effector portion with which it may be fused in the case of a wild-type PVC effector protein. By way of example, the fusion/ effector fusion may comprise the leader sequence of the“Pnf” PVC effector protein (e.g. the leader of SEQ ID NO.: 78) fused to the effector portion of the hvnA (gene Plu1649) PVC effector protein (e.g. amino acids 51-295 of SEQ ID NO.: 46), but is not intended to refer to the leader sequence of the“Pnf’ PVC effector protein (e.g. the leader of SEQ ID NO.: 78) fused to the effector portion of the Pnf PVC effector protein (e.g. amino acids 51-340 of SEQ ID NO.: 32). On the other hand, the fusion/ effector fusion may comprise the leader sequence of, e.g., the “Pnf” PVC effector protein (e.g. the leader of SEQ ID NO.: 78) fused to a non-effector portion, for example a non- Photorhabdus protein such as Ore recombinase. Thus, the leader sequence finds utility in packaging a range of e.g. heterologous (non-wild-type) agents into a PVC Needle Complex, opening the possibility to use the PVC Needle Complex as a modular, diverse delivery system for delivering not only natural effectors, but also‘unnatural’ payloads to a cell for the first time. As such, it is possible to manufacture a PVC Needle Complex having a payload of choice.
Another aspect of the invention provides a method for manufacturing a PVC Needle Complex comprising a payload (e.g. in other words, a method for manufacturing a packaged PVC Needle Complex), the method comprising:
a. contacting (e.g. within a host cell) a PVC Needle Complex with an effector fusion comprising a PVC effector leader sequence fused to a payload;
b. wherein the payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
c. wherein the effector fusion is distinct from a wild-type PVC effector protein.
An aspect of the invention provides a method for manufacturing a PVC Needle Complex comprising a payload (e.g. in other words, a method for manufacturing a packaged PVC Needle Complex), the method comprising:
a. contacting (e.g. within a host cell) a PVC Needle Complex with a fusion, the fusion comprising a PVC effector leader sequence fused to a payload, wherein the leader sequence and the payload form a fusion that is distinct from a PVC effector protein (e.g. wild-type PVC effector protein); and
b. wherein the payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide).
In one embodiment, said contacting may occur within a cell (e.g. bacterial host cell), in a cell lysate, or in a purified cell lysate (preferably within a cell). In one embodiment, said contacting may occur within a cell free expression system. Similar, a use described herein may comprise a contacting step (between the fusion/ effector fusion and PVC Needle Complex) occurring within a cell (e.g. bacterial host cell), in a cell lysate, cell free expression system, or in a purified cell lysate (preferably within a cell, more preferably a bacterial host cell).
A cassette (operon) encoding the PVC Needle Complex may be operably linked to a first promoter, and a gene encoding the fusion/ effector fusion (payload) may be operably linked to a second (preferably different) promoter. In one embodiment, said first and/or second promoter is an inducible promoter (e.g. an arabinose inducible promoter such a pBAD, and/or an IPTG inducible promoter). Thus, the invention embraces an expression system wherein an operon encoding the PVC is present within a first vector/ plasmid (optionally operably linked to a first promoter), and the sequence encoding the effector fusion (leader sequence fused to payload) is present within a second (preferably different) plasmid (optionally linked to a second promoter). In one embodiment, the PVC Needle Complex and/or (preferably and) effector fusion may be expressed in one or more host selected from a bacterial cell, a yeast cell, an insect cell and/or a mammalian cell. In a preferable embodiment, the PVC Needle Complex and effector fusion may be expressed together in a host cell selected from a bacterial cell, a yeast cell, an insect cell and a mammalian cell (preferably a bacterial cell). Suitable mammalian cells include a HEK293 cell and/or a CHO cell.
The PVC Needle Complex and/or (preferably and) the effector fusion (payload) may be expressed in a heterologous bacterial expression system (preferably E. coli). In one embodiment, the PVC Needle Complex and/or (preferably and) the PVC effector may be expressed in a Photorhabdus cell, optionally wherein the PVC operon of the Photorhabdus cell is endogenous to the cell (and optionally wherein the PVC operon is operably linked to an inducible promoter which may be incorporated into the genome to be operably linked to the PVC operon via genetic engineering). For example, an inducible promoter may be introduced into the genome of a Photorhabdus cell 5’ to a PVC (operon), preferably by recombineering as described in the examples (e.g. Example 3).
The payload may be, for example, a therapeutic payload, such that a PVC Needle Complex finds utility in medical treatment.
In a further aspect, the invention provides a (packaged) PVC Needle Complex, for use in a method of treatment;
a. wherein the PVC Needle Complex comprises (e.g. is packaged with) an effector fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;
b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
c. wherein the effector fusion is distinct from a wild-type PVC effector protein.
A further aspect of the invention provides a (packaged) PVC Needle Complex, for use in a method of treatment;
a. wherein the PVC Needle Complex holds (e.g. is packaged with) a fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;
b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
c. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
In one aspect, the invention provides a method of treating a subject, the method comprising administering a (packaged) PVC Needle Complex to a subject (e.g. a patient);
a. wherein the PVC Needle Complex comprises (e.g. is packaged with) an effector fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;
b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
c. wherein the effector fusion is distinct from a wild-type PVC effector protein. In other words, an aspect of the invention provides a method of treating a subject, the method comprising administering a (packaged) PVC Needle Complex to a subject (e.g. a patient);
a. wherein the PVC Needle Complex holds (e.g. is packaged with) a fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;
b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
c. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
In a preferable embodiment, the payload is a polypeptide.
The subject may be a mammalian subject, preferably a human subject.
The terms “PVC Needle Complex holds an effector fusion” and “PVC Needle Complex comprising an effector fusion” means a PVC Needle Complex having a packaged effector fusion, or in other words, a PVC Needle Complex that is packaged with an effector fusion.
The term“packaged effector fusion”,“fusion” and“effector fusion” (e.g. wherein the fusion/ effector fusion is distinct from a wild-type PVC effector protein) embraces a combination of a PVC effector leader sequence and a payload which remains in contact (e.g. fused) subsequent to packaging into PVC Needle Complex (e.g. the leader sequence has not been cleaved off the payload), as well as combination of a PVC effector leader sequence and a payload which are no longer in direct contact (e.g. no longer fused, such as following cleavage of the leader sequence from the payload).
The term“treat” or“treating” as used herein encompasses prophylactic treatment (e.g. to prevent onset of a disease) as well as corrective treatment (treatment of a subject already suffering from a disease). Preferably“treat” or“treating” as used herein means corrective treatment. The term “treat” or“treating” encompasses treating both the disease and a symptom thereof. In some embodiments “treat” or“treating” refers to a symptom of a disease.
Therefore, a PVC Needle Complex may be administered to a subject in a therapeutically effective amount or a prophylactically effective amount.
A“therapeutically effective amount” is any amount of the (packaged/ laden) PVC Needle Complex, which when administered alone or in combination (e.g. with another therapeutic, administered parallel or in series and acting additively or synergistically) to a subject for treating a disease (or a symptom thereof) is sufficient to effect such treatment of the disease, or symptom thereof.
A“prophylactically effective amount” is any amount of the (packaged/ laden) PVC Needle Complex that, when administered alone or in combination (e.g. with another therapeutic, administered parallel or in series and acting additively or synergistically) to a subject inhibits or delays the onset or reoccurrence of a disease (or a symptom thereof). In some embodiments, the prophylactically effective amount prevents the onset or reoccurrence of a disease entirely. “Inhibiting” the onset means either lessening the likelihood of disease onset (or symptom thereof), or preventing the onset entirely.
In a related aspect, there is provided a (packaged) PVC Needle Complex comprising (e.g. that holds/ that is packaged with) an effector fusion;
a. wherein said effector fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said effector fusion is formed by a PVC effector leader sequence and a payload);
b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and
c. wherein the effector fusion is distinct from a wild-type PVC effector protein.
In other words, one aspect of the invention provides a (packaged) PVC Needle Complex that holds (e.g. is packaged with) a fusion;
a. wherein said fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said fusion is formed by a PVC effector leader sequence and a payload);
b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
c. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
In a preferable embodiment, the (packaged) PVC Needle Complex is an isolated (e.g. non natural) PVC Needle Complex.
As explained below, the PVC Needle Complex typically functions in nature to deliver toxigenic PVC effectors to insect targets. By expanding greatly the number and variety of payloads which may be packaged into a PVC Needle Complex, the invention concomitantly expands the number and variety of invertebrates (e.g. pests), such as amoeba, nematodes, helminths and insects, which may be targeted and killed.
In a further aspect of the invention, there is provided a method for suppressing a pest, the method comprising:
a. contacting a pest, or a target area comprising a pest, with a (packaged) PVC Needle Complex comprising (e.g. holding/ packaged with) an effector fusion;
b. wherein the effector fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said effector fusion is formed by a PVC effector leader sequence and a payload);
c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
d. wherein the effector fusion is distinct from a wild-type PVC effector protein.
An aspect of the invention provides a method for suppressing a pest, the method comprising: a. contacting a pest, or a target area comprising a pest, with a (packaged) PVC Needle Complex holding (e.g. packaged with) a fusion; b. wherein the fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said fusion is formed by a PVC effector leader sequence and a payload);
c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
d. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
The terms “PVC Needle Complex holds an effector fusion” and “PVC Needle Complex comprising an effector fusion” means a PVC Needle Complex having a packaged effector fusion.
The term“target area” refers to an area where a pest is present and/or where a pest may be (e.g. is expected to be, or suspected of being) present.
Thus, in one embodiment, a target area may be contacted before, and/or when a pest is present. The target area may be in the vicinity of (e.g. close proximity to) a pest. Alternatively, the target area may be an area that a user wishes to protect from a pest. For example, a target area may comprise a plant and/or plant product.
The term“suppressing a pest” embraces“pest control”, “inhibiting the growth of a pest”, “inhibiting the proliferation of pest”, and/or“mortality of a pest”.
Examples of such pest include one or more insect(s), mite(s), sowbug(s), pillbug(s), centipede(s), mollusk(s), millipede(s), protist(s), fungus (fungi), helminth(s) and/or bloodborne parasite(s). The pest may be at any stage of development e.g. may be a larvae and/or adult pest (e.g. imago).
The invention may be used to target a variety of agricultural, commercial, home and garden pests.
In one embodiment the pest is an insect, a mite, a sowbug, a pillbug, a centipede, a mollusk and/or a millipede. Suitably the pest may be an insect and/or a mite (preferably insect).
Examples of suitable insects include, an insect of the order Lepidoptera, Coleoptera, Diptera, Blattodea, Hymenoptera, Isoptera, Orthoptera, Thysanura, and/or Dermaptera. In one embodiment an insect of the order Lepidoptera may be one or more of a moth and/or a butterfly. Suitable moths include Manduca Sexta and/or Galleria mellonella.
In one embodiment an insect of the order Coleoptera may be one or more of a European chafer grub, a northern masked chafer grub, a southern masked chafer grub, a Japanese beetle grub, a June beetle grub, a black vine weevil, a strawberry root weevil, a clay-colored weevil, a Colorado potato beetle, and/or a wireworm. In another embodiment an insect of the order Diptera may be one or more of a leatherjacket (e.g. larvae of a crane fly), an onion maggot, a cabbage maggot, a carrot rust fly maggot, a fungus gnat, and/or a mosquito. In another embodiment an insect of the order Blattodea may be a cockroach, suitably one or more cockroach selected from an American cockroach, and/or a German cockroach. In one embodiment an insect of the order Hymenoptera may be an ant. Suitably, the ant may be one or more of a carpenter ant, an odorous house ant, a pavement ant, an Argentine ant, a Pharaoh ant, a tawny crazy ant, a harvester ant, a red imported fire ant, a Southern fire ant, a European fire ant, and/or a little fire ant. In another embodiment an insect of the order Hymenoptera may be a yellowjacket.
In one embodiment an insect of the order Isoptera may be a termite. Suitably the termite may be one or more of a damp wood termite, a dry wood termite, and/or a subterranean termite. In another embodiment an insect of the order Orthoptera may be one or more of a cricket, a grasshopper, and/or a locust. In one embodiment an insect of the order Thysanura may be a silverfish. In another embodiment an insect of the order Dermaptera may be an earwig.
Examples of suitable molluscs include a slug and/or a snail.
In one embodiment, the pest is a protist. In one embodiment, said protist is one or more selected from Chaos carolinense, Amoeba proteus, Naegleria fowleri, Dictyostelium discoideum, Entamoeba histolytica, Trichomonas vaginalis, Blastocystis hominis, Leishmania Spp., and Giardia lamblia. In one embodiment, said protist is one or more selected from Fonticula alba, Dictyostelium discoideum, Chlamydomonas reinhardtii, Crytomonas paramedium, Paulinella chromatophora, Nannochloropsis gaditana, and/or Tetrahymena Spp.
In one embodiment, the pest is a fungus. In one embodiment, said fungus is one or more fungus selected from Encephalitozoan cuniculi, Nasema apis, Namema ceranae, Vittaforma carneae, Enterocytosoan bieneusi, Spraguea lophii, Vavra culiculis, Edharzardia aedes, Nematocida parisii, Razeiia Spp., Parasitella parasitica, Lichteimia ramose, Sporisorium scitamineum, Trametes versicolor, and/or Punctularia strigosozonata.
In one embodiment, said fungus is a Candida spp. Said Candida spp. may be one or more selected from C. albicans, C. ascalaphidarum, C. amphixiae, C. Antarctica, C. argentea, C. atlantica, C. atmosphaerica, C. auris, C. blattae, C. bromeliacearum, C. carpophila, C. carvajaiis, C. cerambycidarum, C. chauliodes, C. corydalis, C. dosseyi, C. dubliniensis, C. ergatensis, C. fructus, C. glabrata, C. fermentati, C. guilliermondii, C. haemulonii, C. humilis, C. insectamens, C. insectorum, C. intermedia, C. jeffresii, C. kefyr, C. keroseneae, C. krusei, C. lusitaniae, C. lyxosophila, C. maltose, C. marina, C. membranifaciens, C. mogii, C. oleophila, C. oregonensis, C. parapsilosis, C. quercitrusa, C. rugose, C. sake, C. shehatea, C. temnochilae, C. tenuis, C. theae, C. tolerans, C. tropicalis, C. tsuchiyae, C. sinolaborantium, C. sojae, C. subhashii, C. viswanathii, C. utilis, C. ubatubensis, and/or C. zemplinina. Suitably, said Candida spp. may be C. albicans.
In another embodiment, the pest is a helminth. Said helminth may be one or more selected from the phyla Annelida, Platyhelminthes, Nematoda and/or Acanthocephala. In one embodiment, said helminth is a parasitic flatworm. Said parasitic flatworm may be one or more selected from a Cestoda, a Trematoda and/or a Monogenea. In one embodiment, said helminth is a parasitic nematode. Said parasitic nematode may be one or more selected an ascarid ( Ascaris ), a filaria, a hookworm, a pinworm ( Enterobius ), and/or a whipworm {Trichuris trichiura).
In one embodiment, the pest is a bloodborne parasite. Said bloodborne parasite may be one or more selected from Trypanosoma Spp (e.g. Trypanosoma brucei and/or T cruzi), Babesia Spp (e.g. Babesia microti), Leishmania Spp, Plasmodium Spp (e.g. P. falciparum), and/or Toxoplasma Spp. (e.g. Toxoplasma gondii).
The PVC Needle Complex for pest control is suitably environmentally safe (e.g. an environmentally safe pesticidal composition).
Other advantageous utilities include delivering a payload to a cell, for example, during laboratory research. Such cell may be part of an in vitro cell line, or may be a cell of an animal (e.g. a research animal model). Additionally or alternatively, the cell may be comprised within an ex vivo system, such as an organoid.
Another aspect of the invention provides an in vitro (and/or ex vivo) method for delivering a payload into a cell, the method comprising:
a. contacting a cell with a (packaged) PVC Needle Complex comprising (e.g. holding/ packaged with) an effector fusion;
b. wherein the effector fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said effector fusion is formed by a PVC effector leader sequence and a payload);
c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
d. wherein the effector fusion is distinct from a wild-type PVC effector protein.
An aspect of the invention provides an in vitro (and/or ex vivo) method for delivering a payload into a cell, the method comprising:
a. contacting a cell with a (packaged) PVC Needle Complex holding (e.g. packaged with) a fusion;
b. wherein the fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said fusion is formed by a PVC effector leader sequence and a payload);
c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
d. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
In one aspect, the invention provides an effector fusion comprising (or consisting essentially of) a PVC effector leader sequence fused to a payload (or in other words, an effector fusion formed by a PVC effector leader sequence and a payload);
a. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and
b. wherein the effector fusion is distinct from a wild-type PVC effector protein. An aspect of the invention provides a fusion comprising (or consisting essentially of) a PVC effector leader sequence fused to a payload (or in other words, a fusion formed by a PVC effector leader sequence and a payload);
a. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and
b. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).
In one embodiment, the fusion/ effector fusion is an isolated fusion/ effector fusion (e.g. an isolated, non-naturally occurring fusion/ effector fusion).
The present invention embraces a nucleic acid comprising a nucleotide sequence which encodes the fusion/ effector fusion, and/or an expression vector comprising said nucleic acid. Also embraced is a host cell comprising said nucleic acid and/or expression vector.
As discussed above, the present inventors have discovered and practically utilised the leader sequence(s) described herein for the first time.
Thus, another aspect of the invention provides an isolated PVC effector leader sequence (e.g. wherein the isolated PVC effector leader sequence is capable of packaging a payload into a PVC Needle Complex).
In a related aspect there is provided an isolated nucleic acid comprising a nucleotide sequence which encodes a PVC effector leader sequence.
The isolated PVC effector leader sequence may be recombinant, synthetic, and/or purified. The isolated nucleic encoding a PVC effector leader sequence may be recombinant, synthetic, and/or purified.
Further details on the background of the invention, and terminology used herein, is provided below.
Photorhabdus is a bacterium of the genus Enterobacteriacae, represented by three formally recognized (to date) species - namely P. luminescens, P. asymbiotica, and P. temperata. Important strains include P. asymbiotica subsp. australis, and P. luminescens subsp laumondii. Currently available genome sequences are available on GenBank ( Photorhabdus asymbiotica ATCC43949 complete genome - GenBank Accession Number: FM 162591.1 ; Photorhabdus laumondii subsp. laumondii strain TT01 chromosome, complete genome - GenBank Accession number: CP024901.1).
Reference to“Photorhabdus luminescens subsp. laumondii’ may be used interchangeably with“Photorhabdus luminescens subsp. laumondii TT01”,“Photorhabdus laumondii subsp. laumondii strain TT01” and“P. luminescens TT01” herein.
The genome sequence for a further strain of P. asymbiotica, namely P. asymbiotica Kingscliff, is described in Wilkinson et. al. (FEMS Microbiology Letters, Volume 309, Issue 2, August 2010, Pages 136-143), incorporated herein by reference. Further genome sequences are described in Thanwisai et. at. (PLoS ONE 7(9): e43835), incorporated herein by reference.
Each of these species comprise at least one operon known as a Photorhabdus Virulence Cassette (PVC) operon, encoding a PVC Needle Complex, which may be referred to as a “nanosyringe” herein. Given that Photorhabdus is typically found in nature as an insecticidal bacterium following regurgitation from a (symbiont) entomopathogenic Heterorhabditis sp. nematode (e.g. in order to avoid competition for food and resources from insects), it is understood that the PVC Needle Complex functions in nature to suppress insects. Indeed, it has been shown that an isolated PVC Needle Complex (holding/ packaged with a natural effector toxin, such as Pnf) can be used to kill insect larvae - see Example 2. The Photorhabdus Virulence Cassettes represent one of at least four well-characterised toxin delivery systems of Photorhabdus. Other major classes of Photorhabdus protein insecticidal toxins include the “Toxin Complexes” (Tcs), the “binary PirAB toxins”, and the “makes caterpillars floppy” (Mcf) toxins.
The term“Photorhabdus Virulence Cassette” (PVC) (used synonymously with the term“PVC operon” herein) means a discrete operon of a Photorhabdus genome comprising genes encoding for polypeptide subunits which, when expressed, assemble to provide the macromolecular PVC Needle Complex. The molecular architecture of these cassettes have been well characterized and described, for example in The Molecular Biology of Photorhabdus Bacteria (Springer International Publishing AG 2017, ISBN: 978-3-319-52714- 7, Chapter 10, pages 159-177), incorporated herein by reference. A PVC (operon) typically comprises around sixteen genes ( pvd-pvc16 ) encoding structural proteins which assemble to provide a“PVC Needle Complex”, which are typically followed by one or more genes at the 3’ end which encode PVC effector genes, having toxic activity (and typically being homologues of typical T3SS-like effectors). A Photorhabdus genome typically comprises a plurality of such cassettes (e.g. at least four), which are often associated with different effector payloads, or even a plurality of effector payloads.
Three classes of PVC structural operons (Classes I, II and III) have been observed in the genomes of Photorhabdus, and members of other genera. PVCs within each class are similar in terms of the number and type of genes encoding structural proteins they contain (see Figure 1(B)). In more detail, Class I PVCs (which may be referred to as a“prototypical PVC” herein) comprise 16 conserved genes ( pvd-16 ). Class II lack pvc13 host cell binding fibres and pvc3, which (without wishing to be bound by theory) the inventors believe may be a minor specialised sheath subunit that attaches pvc13 fibre proteins onto the PVC Needle Complex (nanosyringe). As such, it is believed this class may be“non-specific”, injecting payloads into multiple (potential any) cell types. Class III is similar to Class I, but has an additional PvcO gene at the start of the operon (of unknown function) and two additional genes encoded between pvc13 and pvc14 that resemble“invasion” type protein genes. This class is typically seen in the human clinical isolate strains of Photorhabdus - the inventors have shown that optimal transcription of PVC Class III may occur when the strain (harboring the PVC operon encoding a PVC Class III operon) is grown at 37°C and exposed to human serum, suggesting this class may be a mammalian adapted version of a PVC Needle Complex. An example cassette (PVC) is shown in Figure 1 (D), which shows a map of the model“Class I” PVC operon of Photorhabdus asymbiotica ATCC43949 (obtainable from the ATCC, accession number: ATCC 43949), said operon being associated with the downstream effector gene“PAU_03332” (encoding a Pnf protein effector, e.g. SEQ ID NO.: 32). This model operon is referred to as Pa AT cc 43949 p\/C pnf. This operon comprises sixteen structural genes ( pvd-16 ), and two genes (3’ end) encoding effectors (in this case the pvc17 / Rhs- like, encoding an Rhs-like effector, and pvc21, encoding a Pnf effector). Said genes pvd-16 correspond to genes PAU_03353 to PAU_03338 of the sequence of GenBank accession no. FM 162591.1 , and are represented by the sequence of SEQ ID NO.: 93.
An example PVC operon (e.g. encoding the structural genes, but not a/the PVC effector) is provided in SEQ ID NO: 93 (which is encodes the operon shown schematically in Figure1(D)), with other examples being SEQ ID NO: 94 and in SEQ ID NO: 95. These sequences begin at the ATG start codon of the first structural gene ( pvd ) of the PVC cassette / operon, and end at the TAA stop codon of the final structural gene (pvd 6).
A PVC Needle Complex from any one of Classes l-lll may be used for a variety of applications. However, PVC Needle Complexes of a certain class may be particularly suitable for delivery to a defined cell type. For example, a PVC Needle Complex for delivery of a payload to a mammalian cell may suitably be a member of Class III. A PVC Needle Complex for delivery of a payload to an insect cell (e.g. to an insect) may suitably be a member of Class I (such as P. asymbiotica PVC pnf, encoded by SEQ ID NO.: 93, e.g. as expressed in E. coli from a cosmid clone).
Thus, as will be understood by the skilled person, the term“PVC Needle Complex” (used synonymously with the terms“PVC Needle Complex delivery system” and“nanosyringe” herein) means a macromolecular protein complex comprising polypeptide subunits encoded by a PVC (operon) of a Photorhabdus bacterium. A PVC Needle Complex is assembled in a nanosyringe structure, having a physical structure (superficially) similar to the antibacterial R- type pyocins (see Figure 3). Functional and molecular studies have shown that a PVC Needle Complex becomes packaged (loaded) with a PVC effector protein(s) (i.e. the PVC effector proteins are packaged therein, or thereon), the packaged PVC Needle Complex is released from the bacterium, and then injects the PVC effector into a target cell such that the PVC effector protein may exert toxicity.
The term “PVC Needle Complex” preferably encompasses PVC Needle Complex-like structures/complexes, encoded by operon(s) comprising genes which are homologous to genes of a Photorhabdus PVC operon. PVC-like elements are not restricted to Photorhabdus, and a well characterized homologous operon (to a PVC operon) is present on the pADAP plasmid of the insect pathogenic bacteria Serratia entomophila. Furthermore, an analogous, and (at least partially) homologous, PVC-like ‘injectosome’ Needle Complex system is employed by the bacterium Pseudoaiteromonas luteoviolacea (e.g. used to control the metamorphosis of the marine worm Hydroides elegans). Structures exist in other Enterobacteriaceae (such as Yersinia Spp.) which are encoded by operons having homology to a PVC operon, and may be used with a leader sequence described herein. Each of these (PVC-like) structures are embraced by the term“PVC Needle Complex” as used herein. Thus, a PVC Needle Complex is a“nanosyringe” complex, with the polypeptide encoded by the effector gene being packaged (loaded) within, or at the end (tip) of, the PVC Needle Complex, thus representing a“payload” or“warhead” of the PVC Needle Complex. The present inventors have demonstrated that the PVC Needle Complex itself (with the payload still loaded) is freely released (e.g. secreted) from Photorhabdus cells, before interacting with the membrane of a target cell and injecting the payload into the cell’s cytosol. Indeed, the inventors have successfully expressed and loaded PVC Needle Complexes in heterologous expression systems, before isolating/ purifying the PVC Needle Complexes and using them to suppress (e.g. kill) insect larvae (see Example 2). Thus, the PVC Needle Complexes act as long-range protein delivery systems.
In one embodiment, the PVC Needle Complex is encoded by a sequence having at least 75% sequence identity (preferably at least 85% sequence identify; more preferably at least 95% sequence identity) to a sequence selected from SEQ ID NO.: 93, SEQ ID NO.: 94, and SEQ ID NO.: 95 (for example, SEQ ID NO.: 93).
In one embodiment, the PVC Needle Complex is encoded by a sequence selected from SEQ ID NO.: 93, SEQ ID NO.: 94, and SEQ ID NO.: 95 (for example, SEQ ID NO.: 93).
Leader/ signal sequences are typically peptides, often of 10-30 amino acids long present at the N-terminus of the majority of (newly) expressed proteins that are destined towards the secretory pathway (e.g. for directing said proteins to a protein-conducting channel on the cell membrane). Many proteins require a signal sequence for Golgi or endoplasmic reticulum entry.
The term“leader sequence” (used interchangeably with the terms“leader peptide”,“signal sequence”, “targeting signal”, “localization signal”, “localization sequence”, and “transit peptide” herein), used in the context of a“PVC effector leader sequence” herein, means a polypeptide sequence which functions to direct the PVC effector into the interior, or the end (tip), of a PVC Needle Complex - as such, the leader sequence functions to package a PVC effector into a PVC Needle Complex. The PVC Needle Complex can subsequently deliver (e.g. inject) the PVC effector into a target cell. The PVC Needle Complex may be an assembled PVC Needle Complex. The term“PVC Needle Complex” may refer to a fragment of a PVC Needle Complex (e.g. wherein the leader sequence contacts said fragment, and optionally the PVC Needle Complex assembles around the leader sequence-payload ‘effector fusion’).
A PVC leader sequence is typically present in the N-terminus (characterized by or encompassed within the first 50 amino acids) of a PVC effector or homologue thereof. However, the invention embraces leader sequences of PVC effectors and PVC effector homologues, which may be found in regions other than the N-terminal region of such PVC effectors/ homologues (e.g. in the C-terminal region).
In one embodiment, the leader sequence comprises (or consists essentially of) amino acid residues 1-50 of a PVC effector (e.g. PVC effector protein). Reference to“amino acid residues 1-50” embraces“amino acid residues 2-50”, wherein the N-terminal methionine is omitted e.g. has been cleaved. The leader sequence may be a fragment of the N-terminal 50 amino acids of a PVC effector (e.g. a fragment comprising or consisting essentially of £ 45, £ 35, £ 25, or £ 15 amino acids), with the proviso that the fragment is capable of packaging a payload into a PVC Needle Complex.
In one embodiment, a leader sequence (e.g. isolated leader sequence) of the invention comprises (or consists essentially of) an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 100% sequence identity to one or more sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92) - e.g. with the proviso that the leader sequence is capable of packaging a payload into a PVC Needle Complex. In a preferable embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence having at least 60% sequence identity to one or more sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92) - e.g. with the proviso that the leader sequence is capable of packaging a payload into a PVC Needle Complex. In a more preferable embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence of one or more selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92). In one embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92).
In one embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence selected from SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, and SEQ ID NO.: 92.
In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 50. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 68. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 71. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 76. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 78. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 92.
Without wishing to be bound by theory, it is believed that the leader sequences share a “chemical composition consensus”, based on amino acid properties. More particularly, the leader sequences comprise similar charge patterns, the pattern comprising 2x negatively charged regions, each followed by a positively charged region (e.g. [-ve] [+ve] [-ve] [+ve]) - see Figure 9. This is consistent with leader sequences of toxins of the type 2 secretion system, which comprise a charge / property pattern of [+ve] [Hydrophobic] [+ve] [C]. A further theory posits that the leader sequences share a typical“helix-turn-helix” structure. Another theory is that the leader sequences form a structure recognised by an ATPase enzyme (e.g. encoded by the gene PAU_03339 ( pvc15) in the model operon of Figure 1 (D)) present in the interior, or at the end (e.g. tip), of a PVC Needle Complex. The term“PVC effector” (used synonymously with the term“PVC operon-encoded effector”, and“PVC effector protein”) means an effector polypeptide encoded by a Photorhabdus PVC operon, more particularly (and typically) found shortly downstream (3’) of the structural genes of said operon (preferably shortly or immediately downstream of pvc16 , and typically within 5kb). The term“PVC effector” preferably embraces homologues thereof. Thus, the leader sequence may also be from a polypeptide encoded by a gene which is a homologue of gene encoding a PVC effector - see Table 1 for examples of such homologues. Indeed, identification of PVC effectors is aided by detecting homology of a gene downstream of pvd 6 with a known toxin polypeptide (e.g. a gene which encodes said toxin polypeptide). As will be understood by the skilled person, the term“homologue” preferably means a gene that descended from the same ancestral gene, and shares similar function - such gene (or polypeptide encoded thereby) is homologous to a gene encoding the PVC effector. A homologue may be from the genome of a Photorhabdus species or from a species other than a Photorhabdus species. Examples of suitable homologues are outlined in Table 1.
The present inventors have elucidated and characterised, in detail, genes that encode PVC effectors of these PVC Needle Complexes in the three most common (best characterised) strains of Photorhabdus, as well as the P. asymbiotica Thai strain PB68.1. This was conducted based on analysing proximity of genetic linkage to the 3’ end of the PVC structural genes of the operons, and predicted function of the protein sequence of the effector (e.g. a homologue of a known effector/ toxin protein). In more detail, the PVC effectors (e.g. genes encoding the PVC effectors) were typically identified as open reading frames (ORF) having homology to genes encoding known toxin polypeptides (e.g. homologues as outlined in Table 1), and being typically present within a distance of 1 kilobase to 5 kilobase (kb) (e.g. within 1 kb) downstream of the final structural gene of a PVC operon (e.g. pvc16) (typically with few or no intervening genes). Typically, there are no“non-toxin-like” ORFs between the end of the operon (encoding the PVC Needle Complex) and the PVC effector gene(s). Although there may be (e.g. one, or two) other small predicted genes present in these regions, these other genes are not assigned as PVC effectors (due to lack of homology to a known effector/ toxin gene, as described above).
In order to assign a putative PVC effector gene (e.g. ORF within a distance of 5kb, for example within 1 kb downstream of the final structural gene of a PVC operon) as a PVC effector gene, the inventors used a combination of BlastP and HHPRED (https://toolkit.tuebingen.mpg. de/#/tools/hhpred). Putative PVC effector genes were assigned as PVC effector genes based on direct homology to known toxin encoding genes, similarity to a toxin protein family, proximity to the PVC operon (e.g. within 1-5kb downstream of the final structural gene of a PVC operon, pvc16) and/or based on domain similarities of predicted secondary structures to that of known toxins.
Thus, a PVC effector (gene) may be identified (within a Photorhabdus genome) by (i) identifying pvc16 (e.g. via sequence homology to a known pvc16), (ii) identifying an ORF 3’ to pvc16, preferably £5kb downstream of pvc16), and (iii) confirming said ORF encodes a PVC effector through identification of sequence homology to a known gene encoding a toxin polypeptide (for example, a toxin protein described in the column of Table 1 labelled “Homologue(s)”). By way of example, the PVC effector gene PAU_03337 (referred to herein as“sepC” due to homology to virulent sep genes) is positioned 325 base pairs (bp) downstream of pvc16 (PAU_03338) of the PVC operon referred to herein as PVCpnf (e.g. of SEQ ID NO. 93), which is found in P. asymbiotica ATCC43949. That is, the start codon of PAU_03337 begins 325 bp downstream of the end of the stop codon of PAU_03338.
This can be illustrated by reference to the P. asymbiotica ATCC43949 complete genome, accessible via GenBank accession no. FM 162591.1 (see also e.g. Wilkinson et al, BMC Genomics volume 10, article number: 302 (2009), incorporated herein by reference), in which effector gene PAU_03337 is annotated as being positioned in the genome as follows: complement (3913237..3914247) - that is, at nucleotide positions 3913237..3914247; and PAU_03338 is annotated as being positioned in the genome as follows: complement (3914573..3915454). No other ORF (encoding an effector or otherwise) is found between these two genes.
A further PVC effector gene associated with the PVC operon referred to herein as PVCpnf (e.g. of SEQ ID NO. 93), namely PAU_03332 (referred to herein as “pnf’), is positioned 3535bp downstream of pvc16 (PAU_03338).
The PVC effector gene PAU_02095 (referred to herein as“Rhs-iike toxin effector” due to homology to virulent Rhs toxin genes) is positioned 3961 bp downstream of pvc16 (PAU_02099) of a PVC operon referred to herein as PVC lopT (e.g. of SEQ ID NO. 94), which is found in P. asymbiotica ATCC43949. That is, the start codon of PAU_02095 begins 3961 bp downstream of the end of the stop codon of PAU_02099.
In a further example, the PVC effector of gene PAU_02009 (referred to as“cif” herein due to predicted function as a cell cycle inhibiting factor/ ATP/GTP binding protein) is positioned 157bp downstream of pvc16 (PAU_02008) of the associated PVC operon, referred to herein as PVC c/7, found in P. asymbiotica ATCC43949.
In yet further examples: with regard to a PVC operon of P. luminescens TT01 referred to as a PVC unit4 operon herein, PVC effector gene “pvc17” (e.g.“plu165T’) is positioned 104bp downstream of pvc16 (gene“piu1655’)\ and with regard to a PVC operon of Photorhabdus temperata subsp. temperata Meg1 referred to as a PVCc/7 operon herein, PVC effector gene “CIF toxin effector” (e.g. MEG1 DRAFT_03529) is positioned 4216bp downstream of the relevant pvc16 gene.
These examples, illustrate that a gene encoding a PVC effector is typically positioned within a distance of £ 5 kb downstream of the final gene of a PVC operon (e.g. of pvc16), more typically within a distance of £ 1 kb downstream of the final gene of a PVC operon.
In summary, there exists 46 PVC effectors that have been identified in these four strains (based on currently available sequence data) (see Table 1). The first 50 amino acids of each of these PVC effectors represent (or encompass) their endogenous leader sequence, and the inventors have demonstrated the leader sequences may be cloned and fused to a variety of payloads to be packaged into a PVC Needle Complex - see Examples 3 and 4. Thus, a PVC effector (as translated) comprises at least two principle domains: the leader sequence (amino acids 1 to 50) and the actual effector polypeptide (amino acids 51 to C-terminal amino acid) - the latter of which may be referred to as the“effector” (e.g.“effector portion”) or “payload” herein. Although the Photorhabdus genome sequence(s) continues to be revised, this consolidated list of PVC effector genes represents a comprehensive description of such effectors, and is based on currently available sequence data of the most common (best characterised) Photorhabdus strains, and provides the skilled person with an understanding of the term “PVC effector” as well as the sequences of these PVC effectors (as well as how to search/mine for further PVC effectors, e.g. in alternative (genome) sequences). As described above, the inventors have found that the PVC effector proteins comprise a leader sequence which is necessary (and sufficient) for directing the PVC effector protein (e.g. payload) to be packaged/ loaded into a PVC Needle Complex. Table 1
The accession numbers provided in Table 1 are provided for exemplary purposes, providing example amino acid sequences of (or having high similarity to) PVC effectors described herein. The sequences of said accession numbers may be accessed through GenBank (https://www.ncbi.nlm.nih.gov/genbank/).
The locus tag (beginning with“PAU” or“Plu”) corresponds to the locus tag assigned to the effector in genome sequences available through GenBank above. Locus tags beginning with “PAT” (referring to strain P. asymbiotica Thai strain PB68.1) and“PAK” (referring to strain P. asymbiotica Kingscliff) have been assigned by the present inventors upon identification of the PVC effector genes within the genomes of said strains (in a consistent manner with the locus tags of publicly available sequences). This locus tags may be used herein to refer to the corresponding PVC effector polypeptide.
In one embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector protein in parentheses) selected from PAK_1985 (SEQ ID NO: 1), PAK_1987 (SEQ ID NO: 2), PAK_1988 (SEQ ID NO: 3), PAK_2075 (SEQ ID NO: 4), PAK_2077 (SEQ ID NO: 5), PAK_2892 (SEQ ID NO: 6), PAK_2893 (SEQ ID NO: 7), PAK_2894 (SEQ ID NO: 8), PAK_3525 (SEQ ID NO: 9), PAT_00148 (SEQ ID NO: 10), PAT_00149 (SEQ ID NO: 11), PAT_00150 (SEQ ID NO: 12), PAT_00152 (SEQ ID NO: 13),
PAT_02308 (SEQ ID NO: 14), PAT_02309 (SEQ ID NO: 15), PAT_02310 (SEQ ID NO: 16),
PAT_02956 (SEQ ID NO: 17), PAT_02957 (SEQ ID NO: 18), PAT_03171 (SEQ ID NO: 19),
PAT_03172 (SEQ ID NO: 20), PAT_03177 (SEQ ID NO: 21), PAU_02009 (SEQ ID NO: 22),
PAU_02010 (SEQ ID NO: 23), PAU_02095 (SEQ ID NO: 24), PAU_02096 (SEQ ID NO: 25),
PAU_02097 (SEQ ID NO: 26), PAU_02098 (SEQ ID NO: 27), PAU_02230 (SEQ ID NO: 28),
PAU_02805 (SEQ ID NO: 29), PAU_02806 (SEQ ID NO: 30), PAU_02807 (SEQ ID NO: 31),
PAU_03332 (SEQ ID NO: 32), PAU_03337 (SEQ ID NO: 33), Plu1651 (SEQ ID NO: 34), Plu1671 (SEQ ID NO: 35), Plu1672 (SEQ ID NO: 36), Plu1690 (SEQ ID NO: 37), Plu1691 (SEQ ID NO: 38), Plu1712 (SEQ ID NO: 39), Plu1713 (SEQ ID NO: 40), Plu1714 (SEQ ID NO: 41), Plu2400 (SEQ ID NO: 42), Plu2401 (SEQ ID NO: 43), Plu2514 (SEQ ID NO: 44), Plu2515 (SEQ ID NO: 45), Plu1649 (SEQ ID NO: 46), or a combination thereof.
In one embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector protein in parentheses) selected from PAU_02009 (SEQ ID NO: 22), PAU_02010 (SEQ ID NO: 23), PAU_02095 (SEQ ID NO: 24), PAU_02096 (SEQ ID NO:
25), PAU_02097 (SEQ ID NO: 26), PAU_02098 (SEQ ID NO: 27), PAU_02230 (SEQ ID NO:
28), PAU_02805 (SEQ ID NO: 29), PAU_02806 (SEQ ID NO: 30), PAU_02807 (SEQ ID NO:
31), PAU_03332 (SEQ ID NO: 32), PAU_03337 (SEQ ID NO: 33), Plu1651 (SEQ ID NO: 34),
Plu1671 (SEQ ID NO: 35), Plu1672 (SEQ ID NO: 36), Plu1690 (SEQ ID NO: 37), Plu1691 (SEQ ID NO: 38), Plu1712 (SEQ ID NO: 39), Plu1713 (SEQ ID NO: 40), Plu1714 (SEQ ID NO: 41), Plu2400 (SEQ ID NO: 42), Plu2401 (SEQ ID NO: 43), Plu2514 (SEQ ID NO: 44), Plu2515 (SEQ ID NO: 45), Plu1649 (SEQ ID NO: 46), or a combination thereof. These gene names correspond to the‘locus tags’ of PVC effector genes in the Photorhabdus genome sequences accessible via GenBank, as described above. The PAT and PAK locus tags were generated by the present inventors, such that terminology is consistent with the PAU and Plu locus tags of publicly available genome sequences.
Thus, the PVC effector may be encoded by one or more gene listed above.
In one embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector in parentheses) selected from PAK_02075 (SEQ ID NO: 4), PAU_02009 (SEQ ID NO: 22), PAU_02096 (SEQ ID NO: 25), PAU_02806 (SEQ ID NO: 30), PAU_03332 (SEQ ID NO: 32), Plu1651 (SEQ ID NO: 34), Plu1649 (SEQ ID NO: 46), or a combination thereof.
In a preferable embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector in parentheses) selected from PAU_02806 (SEQ ID NO: 30), PAU_03332 (SEQ ID NO: 32), Plu1651 (SEQ ID NO: 34), Plu1649 (SEQ ID NO: 46), or a combination thereof. The PVC effector may have a sequence having at least 80% sequence identity (preferably at least 90% sequence identity; more preferably 100% sequence identity) to an amino acid sequence selected from SEQ ID NO: 1 - SEQ ID NO: 46. For example, the PVC effector may have a sequence having at least 80% sequence identity (preferably at least 90% sequence identity; more preferably 100% sequence identity) to an amino acid sequence selected from SEQ ID NO: 22 - SEQ ID NO: 46.
The present inventors have identified the leader sequences of the gogB1 (PAU_02806) and Pnf (PAU_03332) PVC effectors as being particularly efficient at packaging a (fused) payload into a PVC Needle Complex. In one embodiment, the PVC effector is encoded by PAU_02806 (e.g. has an amino acid sequence of SEQ ID NO: 30). In one embodiment, the PVC effector is encoded by PAU_03332 (e.g. has an amino acid sequence of SEQ ID NO: 32).
In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of one or more selected from SEQ ID NO: 1 - SEQ ID NO: 46 (for example SEQ ID NO: 22 - SEQ ID NO: 46), or a combination thereof. For example, the PVC effector may comprise (or consist essentially of) a sequence selected from SEQ ID NO: 4, SEQ ID NO: 22, SEQ ID NO: 25, SEQ ID NO: 30, SEQ ID NO: 32 and SEQ ID NO: 46.
In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 4. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO. 22. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO. 25. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO: 30. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO: 32. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO. 46.
The term“packaging” (used synonymously with the terms“trans-packaging” and“loading”) means the directing of a payload, by a leader sequence of the invention (to which the payload is linked/ fused), into the interior, or end (tip), of an assembled PVC Needle Complex, such that the PVC Needle Complex is subsequently configured for delivering (e.g. injecting) the payload into a target cell. Thus, the payload may be packaged within a PVC Needle Complex, or may be packaged at the end (or tip) of the PVC Needle Complex (e.g. at least a portion of the payload may be external to the PVC Needle Complex).
The term“payload” (used synonymously with the term“warhead” herein) means a molecule which is packaged into the interior, or end (tip), of an assembled PVC Needle Complex, and subsequently delivered (e.g. injected) into a (target) cell. In wild-type Photorhabdus, the payload is a PVC effector (more particularly, the effector portion of said PVC effector), encoded (as described above) by a gene that is downstream to (3’ to) the structural genes of a PVC operon. For example, see model PVC operon of Figure 1(D), having effector genes PAU_03337 (listed as PVCpnf 17), encoding an adenylate cyclase effector (e.g. SEQ ID NO.: 33); and PAU_03332 (listed as PVCpnf 21), encoding a Pnf effector (e.g. SEQ ID NO.: 32). A leader sequence and a payload of the present invention form an“effector fusion” (or simply “fusion”) that is“distinct from a (e.g. wild-type) PVC effector” (e.g. a polypeptide encoded by one of the genes outlined in Table 1). For example, the effector fusion may be a chimaera, formed of a leader sequence from a first PVC effector fused to (an/the effector portion of) a second (different) PVC effector (preferably amino acids 51 to the C-terminal amino acid of said second PVC effector), wherein said first PVC effector and said second PVC effector are different. The effector fusion may be a chimaera, comprising (or consisting essentially of) a leader sequence described herein fused to a non-PVC effector polypeptide. The effector fusion may be a chimaera, comprising (or consisting essentially of) a leader sequence described herein fused to a non -Photorhabdus polypeptide. The effector fusion may be a leader sequence-nucleic acid fusion (preferably conjugate), comprising a leader sequence described herein fused to a nucleic acid.
An effector fusion is not limited to a fusion complex comprising a leader sequence fused to a toxic payload (e.g. the leader could be fused to a therapeutic payload). Thus, the term “effector” as used in the context of “effector fusion” means the payload which is packaged into the PVC Needle Complex (which could provide a variety of effects, including toxigenic and/or therapeutic effects). Thus, the term“effector fusion” may be used interchangeably with the term“fusion” herein.
The term “effector fusion” may be used synonymously with the term “leader sequence- payload fusion”, and/or“leader sequence-payload complex”.
Alternatively or additionally, the payload may be distinct from a PVC effector protein (e.g. distinct from amino acids 51 to the C-terminal amino acid of a PVC effector). For example, the payload may be a polypeptide or nucleic acid that is not found in a wild-type Photorhabdus bacterium.
Analysis of the size (e.g. polypeptide length) and structure of the various natural PVC effector payloads encoded by Photorhabdus, shows that there exists a wide variety of different PVC effector lengths and structures, demonstrating that the applicability of the PVC Needle Complex delivery system of the present invention is not limited by the size or properties of the payload of interest. To summarise, there is no requirement for particular secondary structure, biophysical property, or length of cargoes, confirming that that the PVC Needle Complex can be utilised as a versatile multifunctional delivery vehicle.
The payload may be one or more selected from a polypeptide (e.g. a polypeptide payload), a nucleic acid (e.g. a nucleic acid payload), or a combination thereof. In a preferable embodiment, the payload is a polypeptide.
Examples of polypeptide payloads include an antibody (e.g. an anti-MDM antibody), a nanobody, a peptide vaccine (e.g. a tyrosinase-related protein 2 (TRP2) peptide vaccine), a nuclear factor-kB inhibitor, a T3SS payload (e.g. a T3SS payload which inhibits the NF-kB and/or MAPK pathways), an anti-apoptotic peptide (e.g. BH4), nicotinamide adenine dinucleotide quinone internal oxidoreductase (Ndi1), a PHOX complex subunit, a myotubularin, a nucleic acid (preferably DNA)-modifying enzyme, or a combination thereof. Examples of suitable nucleic acid-modifying enzymes include a recombinase (e.g. Cre recombinase), a transposase, a Cas enzyme (e.g. Cas9), and/or a Mad7 (preferably Mad7, more preferably Cre recombinase). The payload may be, for example, tBid (SEQ ID NO.: 109) and/or BaxBH3 peptide (aa59-73) (SEQ ID NO.: 111).
Any polypeptide having enzymatic activity may be a payload.
A nucleic acid payload may be conjugated/ crosslinked to a leader sequence of the invention. For example, copper-free click chemistry (e.g. strain-promoted alkyne azide cycloaddition (SPAAC)) may be used to crosslink a nucleic acid to a leader sequence. Examples of nucleic acid payloads include a primer, an mRNA, a nucleic acid analogue, an aptamer, a small interfering RNA (siRNA), a microRNA therapeutic inhibitor (antimiR), a microRNA therapeutic mimic (promiR), a long non-coding RNA modulator, a single guide RNA (sgRNA), or a combination thereof.
The leader sequence may be fused directly or indirectly (e.g. by means of a spacer) to the payload. The leader sequence may be fused covalently or non-covalently to the payload. In a preferable embodiment, the leader sequence is covalently fused to the payload. For example, the fusion/ effector fusion may be a (recombinant) fusion protein comprising (or consisting essentially of) a PVC effector leader sequence fused to a (polypeptide) payload.
Another aspect of the invention provides an isolated nucleic acid comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides an isolated nucleic acid comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.
Another aspect of the invention provides an expression vector comprising: a nucleic acid (preferably an isolated nucleic acid) comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides an expression vector comprising: a nucleic acid (preferably an isolated nucleic acid) comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.
Another aspect of the invention provides a host cell comprising an isolated nucleic acid, the isolated nucleic acid comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides a host cell comprising an isolated nucleic acid, the isolated nucleic acid comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.
The term“nucleic acid” may be used synonymously with the term“polynucleotide”.
Another aspect of the invention provides a host cell comprising an expression vector, the expression vector comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides a host cell comprising an expression vector, the expression vector comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.
Said host cell may be a mammalian cell, an insect cell, a yeast cell, a bacterial cell (e.g. E. coli), or a plant cell. In a preferable embodiment, the host cell is a bacterial cell (preferably E. coli).
In one embodiment, the host cell is a Photorhabdus cell, optionally wherein the Photorhabdus cell comprises a PVC operon operably linked to an inducible promoter (e.g. see Example 3). The PVC operon may be endogenous to the Photorhabdus cell (e.g. the PVC operon may be PVCu4). Suitably, the Photorhabdus cell may be obtainable from the ATCC under accession no. ATCC 29999.
The sequences (e.g. leader sequence and/or nucleic acid sequence) of the present invention include sequences that have been removed from their naturally occurring environment, recombinant or cloned (e.g. DNA) isolates, and chemically synthesized analogues or analogues biologically synthesized by heterologous systems.
The leader sequence(s) and/or polynucleotide(s) of the present invention may be prepared by any means known in the art. For example, large amounts of the leader sequence(s) and/or polynucleotide(s) may be produced by replication and/or expression in a suitable host cell. The natural or synthetic DNA fragments coding for a desired fragment will typically be incorporated into recombinant nucleic acid constructs, typically DNA constructs, capable of introduction into and replication in a prokaryotic or eukaryotic cell. Usually the DNA constructs will be suitable for autonomous replication in a unicellular host, such as yeast or bacteria, but may also be intended for introduction to and integration within the genome of a cultured bacterial, insect, mammalian, plant or other eukaryotic cell lines.
The leader sequence(s) and/or polynucleotide(s) of the present invention may also be produced by chemical synthesis, e.g. a polynucleotide by the phosphoramidite method or the tri-ester method, and may be performed on commercial automated oligonucleotide synthesizers. A double-stranded (e.g. DNA) fragment may be obtained from the single stranded product of chemical synthesis either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.
When applied to a leader sequence or nucleic acid sequence, the term“isolated” in the context of the present invention denotes that the leader sequence and/or polynucleotide sequence has been removed from its natural genetic milieu and is thus free of other extraneous or unwanted coding sequences (but may include naturally occurring 5' and 3' untranslated regions such as promoters and terminators), and is in a form suitable for use within genetically engineered protein production systems. Such isolated molecules are those that are separated from their natural environment. SEQUENCE HOMOLOGY
Any of a variety of sequence alignment methods can be used to determine percent identity, including, without limitation, global methods, local methods and hybrid methods, such as, e.g., segment approach methods. Protocols to determine percent identity are routine procedures within the scope of one skilled in the art. Global methods align sequences from the beginning to the end of the molecule and determine the best alignment by adding up scores of individual residue pairs and by imposing gap penalties. Non-limiting methods include, e.g., CLUSTAL W, see, e.g., Julie D. Thompson et al., CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position- Specific Gap Penalties and Weight Matrix Choice, 22(22) Nucleic Acids Research 4673-4680 (1994); and iterative refinement, see, e.g., Osamu Gotoh, Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments, 264(4) J. Mol. Biol. 823-838 (1996). Local methods align sequences by identifying one or more conserved motifs shared by all of the input sequences. Non-limiting methods include, e.g., Match-box, see, e.g., Eric Depiereux and Ernest Feytmans, Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences, 8(5) CABIOS 501 -509 (1992); Gibbs sampling, see, e.g., C. E. Lawrence et al., Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment, 262(5131 ) Science 208-214 (1993); Align- M, see, e.g., Ivo Van Walle et al., Align-M - A New Algorithm for Multiple Alignment of Highly Divergent Sequences, 20(9) Bioinformatics: 1428-1435 (2004).
Thus, percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-19, 1992. Briefly, two amino acid sequences are aligned to optimize the alignment scores using a gap opening penalty of 10, a gap extension penalty of 1 , and the "blosum 62" scoring matrix of Henikoff and Henikoff (ibid.) as shown below (amino acids are indicated by the standard one-letter codes).
The "percent sequence identity" between two or more nucleic acid or amino acid sequences is a function of the number of identical positions shared by the sequences. Thus, % identity may be calculated as the number of identical nucleotides / amino acids divided by the total number of nucleotides / amino acids, multiplied by 100. Calculations of % sequence identity may also take into account the number of gaps, and the length of each gap that needs to be introduced to optimize alignment of two or more sequences. Sequence comparisons and the determination of percent identity between two or more sequences can be carried out using specific mathematical algorithms, such as BLAST, which will be familiar to a skilled person. ALIGNMENT SCORES FOR DETERMINING SEQUENCE IDENTITY
The percent identity is then calculated as:
Total number of identical matches
x 100
[length of the longer sequence plus the
number of gaps introduced into the longer
sequence in order to align the two sequences]
Substantially homologous polypeptides are characterized as having one or more amino acid substitutions, deletions or additions. These changes are preferably of a minor nature, that is conservative amino acid substitutions (see below) and other substitutions that do not significantly affect the folding or activity of the polypeptide; small deletions, typically of one to about 30 amino acids; and small amino- or carboxyl-terminal extensions, such as an amino- terminal methionine residue, a small linker peptide of up to about 20-25 residues, or an affinity tag.
CONSERVATIVE AMINO ACID SUBSTITUTIONS
Basic: arginine, lysine, histidine
Acidic: glutamic acid, aspartic acid
Polar: glutamine, asparagine
Hydrophobic: leucine, isoleucine, valine
Aromatic: phenylalanine, tryptophan, tyrosine
Small: glycine, alanine, serine, threonine, methionine In addition to the 20 standard amino acids, non-standard amino acids (such as 4- hydroxyproline, 6-N-methyl lysine, 2-aminoisobutyric acid, isovaline and a -methyl serine) may be substituted for amino acid residues of the polypeptides of the present invention. A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, and unnatural amino acids may be substituted for polypeptide amino acid residues. The polypeptides of the present invention can also comprise non-naturally occurring amino acid residues.
Non-naturally occurring amino acids include, without limitation, trans-3-methylproline, 2,4- methano-proline, cis-4-hydroxyproline, trans-4-hydroxy-proline, N-methylglycine, allo- threonine, methyl-threonine, hydroxy-ethylcysteine, hydroxyethylhomo-cysteine, nitro- glutamine, homoglutamine, pipecolic acid, tert-leucine, norvaline, 2-azaphenylalanine, 3- azaphenyl-alanine, 4-azaphenyl-alanine, and 4-fluorophenylalanine. Several methods are known in the art for incorporating non-naturally occurring amino acid residues into proteins. For example, an in vitro system can be employed wherein nonsense mutations are suppressed using chemically aminoacylated suppressor tRNAs. Methods for synthesizing amino acids and aminoacylating tRNA are known in the art. Transcription and translation of plasmids containing nonsense mutations is carried out in a cell free system comprising an E. coli S30 extract and commercially available enzymes and other reagents. Proteins are purified by chromatography. See, for example, Robertson et al. , J. Am. Chem. Soc. 113:2722, 1991 ; Ellman et al., Methods Enzymol. 202:301 , 1991 ; Chung et al., Science 259:806-9, 1993; and Chung et al., Proc. Natl. Acad. Sci. USA 90:10145-9, 1993). In a second method, translation is carried out in Xenopus oocytes by microinjection of mutated mRNA and chemically aminoacylated suppressor tRNAs (Turcatti et al., J. Biol. Chem. 271 :19991-8, 1996). Within a third method, E. coli cells are cultured in the absence of a natural amino acid that is to be replaced (e.g., phenylalanine) and in the presence of the desired non-naturally occurring amino acid(s) (e.g., 2-azaphenylalanine, 3-azaphenylalanine, 4-azaphenylalanine, or 4-fluorophenylalanine). The non-naturally occurring amino acid is incorporated into the polypeptide in place of its natural counterpart. See, Koide et al., Biochem. 33:7470-6, 1994. Naturally occurring amino acid residues can be converted to non-naturally occurring species by in vitro chemical modification. Chemical modification can be combined with site-directed mutagenesis to further expand the range of substitutions (Wynn and Richards, Protein Sci. 2:395-403, 1993).
A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, non-naturally occurring amino acids, and unnatural amino acids may be substituted for amino acid residues of polypeptides of the present invention.
Essential amino acids in the polypeptides of the present invention can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 244: 1081-5, 1989). Sites of biological interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., Science 255:306-12, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett. 309:59-64, 1992. The identities of essential amino acids can also be inferred from analysis of homologies with related components (e.g. the translocation or protease components) of the polypeptides of the present invention.
Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241 :53-7, 1988) or Bowie and Sauer (Proc. Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al. , Biochem. 30:10832-7, 1991 ; Ladner et al., U.S. Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).
Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241 :53-7, 1988) or Bowie and Sauer (Proc. Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al., Biochem. 30:10832-7, 1991 ; Ladner et al., U.S. Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Singleton, et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 20 ED., John Wiley and Sons, New York (1994), and Hale & Marham, THE HARPER COLLINS DICTIONARY OF BIOLOGY, Harper Perennial, NY (1991) provide the skilled person with a general dictionary of many of the terms used in this disclosure.
This disclosure is not limited by the exemplary methods and materials disclosed herein, and any methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of this disclosure. Numeric ranges are inclusive of the numbers defining the range. Unless otherwise indicated, any nucleic acid sequences are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
The headings provided herein are not limitations of the various aspects or embodiments of this disclosure.
Amino acids are referred to herein using the name of the amino acid, the three letter abbreviation or the single letter abbreviation. The term“protein", as used herein, includes proteins, polypeptides, and peptides. As used herein, the term“amino acid sequence” is synonymous with the term“polypeptide” and/or the term“protein”. In some instances, the term“amino acid sequence” is synonymous with the term“peptide”. In some instances, the term“amino acid sequence” is synonymous with the term“enzyme”. The terms "protein" and "polypeptide" are used interchangeably herein. In the present disclosure and claims, the conventional one-letter and three-letter codes for amino acid residues may be used. The 3- letter code for amino acids as defined in conformity with the lUPACIUB Joint Commission on Biochemical Nomenclature (JCBN). It is also understood that a polypeptide may be coded for by more than one nucleotide sequence due to the degeneracy of the genetic code.
Other definitions of terms may appear throughout the specification. Before the exemplary embodiments are described in more detail, it is to be understood that this disclosure is not limited to particular embodiments described, and as such may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present disclosure will be defined only by the appended claims.
Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limits of that range is also specifically disclosed. Each smaller range between any stated value or intervening value in a stated range and any other stated or intervening value in that stated range is encompassed within this disclosure. The upper and lower limits of these smaller ranges may independently be included or excluded in the range, and each range where either, neither or both limits are included in the smaller ranges is also encompassed within this disclosure, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in this disclosure.
It must be noted that as used herein and in the appended claims, the singular forms“a”,“an”, and“the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to“an effector” includes a plurality of such effectors and reference to“the effector” includes reference to one or more effectors and equivalents thereof known to those skilled in the art, and so forth.
The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that such publications constitute prior art to the claims appended hereto.
BRIEF DESCRIPTION OF THE DRAWINGS
Embodiments of the invention will now be described, by way of example only, with reference to the following Figures and Examples.
Figure 1 shows (A) a schematic representation of one PVC operon layout (gene clusters present in varying regions of the originating genome) encoding a PVC Needle Complex. (B) A schematic representation of Class I, II and III PVC operon layouts. Homologous subunit types amongst the classes are show as having similar shading (in grey scale). (C) An illustration of an assembled PVC Needle Complex. The numbering shown is used to correlate a gene cluster in (A) with the position of the encoded proteins in the structure in (C) (e.g. the cap Ί6’ cluster in A is shown as Ί6’ in the left-most cap region of (B)). (D) A map of the model Class I PaATCC 43949 PVCpnf operon (e.g. encoded by SEQ ID NO.: 93), showing two effector genes in the payload region (Rhs-like adenylate cyclase, and PAU_03332). Figure 2 shows an overview of a cloning procedure for preparation of PVC Needle Complex expressing plasmids, based on overlapping PCR. PCR fragments (having overlapping regions) are provided from template gDNA of P. asymP/of/ca ATCC43949 (available from the ATCC under accession no. ATCC 43949) with relevant primers targeting the PVC operon. Figure 3 shows a transmission electron micrograph of an (in vitro) sample of PVC Needle Complexes (e.g. prepared from cells having the expression vector described above). The PVC Needle Complexes assemble in a distinct‘nanosyringe’ structure, consistent with its role as a contractile structure. A 3D rendered model of a PVC Needle Complex as derived from high resolution single particle cryo-EM tomography structure is shown in (B).
Figure 4 shows (A) a transmission electron micrograph of a PVC Needle Complex comprising a Pnf payload following immuno-gold staining with an anti-Pnf (immunogold) antibody, confirming the Pnf-payload toxin is associated with the PVC Needle Complex (referred to as PVC pnf). PVCpnf Needle Complexes were prepared from supernatants of an E. coli cosmid clone, which encodes the PVCpnf operon. Anti-peptide antibodies against the Pnf (TGQKPGNNEWKTGR, SEQ ID NO: 96) epitope were used to localise the payload toxin protein. The Pnf toxin could only be detected at the ends of broken or contracted needle complex, providing evidence that the toxins are contained within the complex (arrows). (B) Western blot analysis confirms that the Pnf protein (toxin) can only be detected using the anti-peptide antibody if the PVC Needle Complex is either chemically or physically disrupted. These preparations were taken from Pa AT cc 43949 supernatants. The inability to detect Pnf in clarified supernatants confirms all the protein is associated with the PVC Needle Complex enrichment preparations. Lanes 1 +5; sonicated samples, 2+6; 1 M NaCI treatments, 3+7; 1 % SDS treatments 4+8; 1 M Urea treatments. Note the PVC Needle Complex appears stable in 1 M NaCI.
Figure 5 shows cryo-SEM image of ex vivo hemocytes (insect macrophage/neutrophil equivalents) from 5th instar Manduca sexta that had been injected with a native (A) or heat inactivated (B) enriched preparation of PaATcc43949 pvc pnf Needle Complexes (nanosyringes) heterologously produced by an E. coli cosmid clone. Note the abundant linear structures corresponding to PVC Needle Complexes (nanosyringe) (small arrows) and membrane ruffling effect (large arrows), consistent with the mode of action of the Pnf payload toxin, which are absent from the control treatment. Scale bar = 50pm. 25kV; magnification 40 K (A) and 50K (B).
Figure 6 shows experimental results demonstrating the (toxic) cellular phenotype following contact with a PVC Needle Complex is due to intracellular toxin delivery. (A) A Pnf loaded PVC Needle Complex was injected into insects ( Galleria mellonella insect larvae), showing potent activity within 15 minutes for the given dose (explained in the examples) - note mortality/morbidity is typically associated with the“melanisation” immune response in these dead/dying insects. (B) A control, denatured (via boiling) Pnf loaded PVC Needle Complex injected into animals showed no activity. (C) Purified Pnf (payload), absent the PVC Needle Complex (i.e. Pnf not packaged into the complex), showed no activity against either animals (left) or a HeLa cell line (right). (D) Pnf (payload) delivered into the cytosol of HeLa cells - via ‘BioPorter’ liposomal preparations containing the protein, or by intracellular expression following transfection with an appropriate plasmid (E) - showed potent activity/toxicity, as evidenced by multi-nucleation in the cells. (F) - The effect of PVCpnf+Pnf on the respiration rate of THP1 derived human macrophages as measured by Resazurin plate reader assay. Note the heat denatured and empty PVCpnf nanosyringes showed no strong adverse effect. These same samples were tested by injection into Galleria larvae. The PVCpnZ+Pnf samples showed over around 50% mortality within minutes (darkened larvae in the bottom two panels) while the heat denatured and empty PVCpnf injected insects all remained healthy (no darkened larvae in the top two panels).
Figure 7 shows (in silico ) predicted secondary structures of a range of the endogenous payload (toxin) associated with various PVC operons, demonstrating the large variety of structure types. (B) The amino acid length of various payloads (toxins) plotted against predicted isoelectric point.
Figure 8 shows confirmation that leader sequences (e.g. having 50 amino acids) of the invention are necessary and sufficient for (trans-)packaging payload proteins/peptides into PVC Needle Complexes (nanosyringes) expressed in Photorhabdus. (A) 1-6: Schematic maps of chimeric effector protein expression constructs (trans-expressed in the arabinose- inducible pBAD30 vector), including those expressing Pnf and non-native cre-recombinase and Myc-tags. C-terminal Myc-tag epitopes are shown as black arrows. (B) Western blots using anti-Myc mouse antibody. Samples are from purified PVC(u4) Needle Complexes (nanosyringes) overexpressed from chromosomally engineered P. luminescens TT01 which harbour the trans-packaging expression constructs 1-6 shown in (A). A blank pBAD30 plasmid was used as a negative control and showed no signal. Arrows show correct band sizes for expected products.
Figure 9 shows an alignment of the leader sequences, demonstrating the presence of a chemical composition consensus amongst the leader sequences, based on amino acid properties. More particularly, the leader sequences comprise similar charge patterns, of 2x negatively charged regions, each followed by a positively charged region [-ve] [+ve] [-ve] [+ve].
Figure 10 shows (A) western blot analysis of PVC Needle Complexes and payloads from particulate preparations (Cesium Chloride gradient and Monolith FPLC preparations, as described in Materials and Methods). In [1] (pBADPVCpnf, in which PVC16 of the nanosyringe is FLAG-tagged providing PVC16::FLAG detectable with AntiFLAG Ab), a signal from the tagged cap protein of“PVCPnf” (PVC Needle Complex with a Pnf payload) can be seen, confirming the presence of PVC Needle Complexes in the purified fraction. In [2] (pBADPVCpnf + Cre::Myc, detectable with AntiMyc Ab, the Cre having an N-term fusion of the Pnf leader e.g. SEQ ID NO.: 78), a signal from the Myc-tagged payload protein packaged in abundance, in the same sample as (1), confirming presence of Cre payload in purified PVC Needle Complexes (nanosyringes). In [3] (PVCU4 + Cre::Myc, detectable with AntiMyc Ab, the Cre having an N-term fusion of the Pnf leader e.g. SEQ ID NO.: 78), a different PVC Needle Complex chassis (“PVCU4”) purification is probed for Myc-tagged Cre revealing a packaged (packaged Myc-tagged Cre) corresponding band. This is highlighted in the blot for clarity. (B) Transmission electron micrograph of a PVC Needle Complex, shows both wild- type (having a Pnf payload) PVC Needle Complexes and PVC Needle Complexes having an atypical (non-native) recombinase (Cre) payload, in any chassis tested, does not affect morphology of the PVC Needle Complexes, ensuring they are not assembled aberrantly. Figure 10 (C) provides additional/ complementary data to that of (A). In more detail, (C) provides further proof via Western blot analysis of (trans-)packaging of the Cre recombinase into purified PVC pnf expressed in E. coli. The Western blot demonstrates that for a given amount of Anti-FLAG antibody Western signal (a specific probe for the nanosyringe due the incorporation of PVC16::FLAG), a much higher amount of the Cre payload is detected (using the Anti-Myc tag antibody). The numbers denote 2-fold dilutions. Note, upon dilution, the anti- FLAG signal from the nanosyringe is lost, while the payload remains intense in most lanes. CsCI denotes purification by Caesium Chloride density gradient centrifugation. “Mon” denotes the samples were additionally anion exchanged via“Monolithic” columns. “Post- Elution”,“Interphase”,“Sub-lnterph.”, denote the liquid fractions where the signal is detected from the purification process. D - Western blot analysis of Cre trans-packaged into PVCpnf in E. coli. Payloads are probed for their incorporated ‘Myc’ tags (C-terminal fusions) after purification of the nanosyringe-payload complex. Western blot analysis of particle preps confirms that all four leaders could efficiently trans-package the exogenous Cre enzyme. E - A phylogenetic tree, demonstrating the exemplified leader sequences are well distributed throughout and are therefore at or close to maximally sequentially diverse (see Example 4.2). Figure 11 shows western blot analysis of PVC Needle Complexes expressed without (1) and with (2) concomitant expression of (Myc-tagged) Pnf from a separate plasmid, probed simultaneously with an anti-FLAG and anti-Myc antibody. In the lanes marked 1 , the PVC Needle Complex (nanosyringe) was expressed and purified without the presence of a ‘payload plasmid’ (an expression plasmid encoding a payload protein linked to a leader sequence) within E. coli. This leads to a band corresponding only to the FLAG tag present on the syringe (PVC Needle Complex) itself. For lanes 2, the same approach was undertaken, but using cultures which also included a (separate) plasmid bearing a tagged payload (Myc- Pnf). Bands can be seen which correspond to the FLAG and Myc tags, confirming presence of the Pnf payload (the four lanes within 1 and 2 are simply different purification fractions from Caesium Chloride gradients).
Figure 12 shows western blot analysis of trans-packing experiments in P. luminescens TT01 PVCu4 over-expression strain. Results demonstrate the trans-packaging of a myc-tagged Pvc17 (Plu1651whole::Myc).
Figure 13 shows further western blot analysis of trans-packaging experiments in P. luminescens TT01 PVCunit4 over-expression strain (as explained in the Examples). Results demonstrate trans-packing of Myc-tagged Pvc17 (Plu 1651 :: Myc) and a Myc tag alone using the leader of Pnf (PAU_03332 leader), and that the leader is necessary. (A) Lane 1 shows packaging of the leader of fused to a Myc-tag (PAU_03332::Myc); Lane 3 shows a lack of packaging when the leader sequence is absent (Myc only is not packaged); lane 4 shows lack of packaging of HvnA (a natural effector) when the leader sequence is absent; lane 6 shows packaging of Myc-tagged PAU_03332:: Plu 1649, i.e. a chimaera of the leader from PAU_03332 (i.e. amino acids 1-50 of PAU_03332) and the effector (i.e. amino acids 51-C- terminus) from Plu1649. The high intensity of bands in lanes 1 and 6 demonstrate that the Pnf (PAU_03332) leader is particularly effective at packaging a payload). (B) Lane 1 shows packaging of Plu1651 with a C-terminal Myc tag using an anti-Myc antibody Western blot. Figure 14 shows further Western blot analysis demonstrating the very high level of trans packaging of Myc-tagged Pnf (PAU_03332::Myc) using the PAU_02806 (GogB) leader (second lane, not including the ladder lane). The first lane demonstrates use of the Plu1649 leader for packaging the PAU_03332 effector (Myc-tagged Plu1649::PAU_03332). The band appears weak due to the relative intensity of the band in the second lane. The experiment involved filter sterilisation of 50 mL culture, 8 M final concentration of urea added to break down PVCs. Samples collected from 10 mL supernatant.
Figure 15 shows further western blot analysis demonstrating trans-packaging of Plu 1651 (pvc17) with a C-terminal Myc tag as described in Figure 13 into PVCunit4 expressed from Photorhabdus . Raw represents particulate preps from supernatants, Be, Be2 and IP represent different“cuts” from a Caesium chloride gradient purification.
Figure 16 (A) provides a diagrammatic explanation of the mechanism of action of Cre in the mouse organoid experiment (of Example 6), and how the positive control (TAM) facilitates Cre activation. White arrows show the location of cells expressing the tdTom fluorescent reporter gene. B - Demonstration of delivery of active trans-packaged Cre-recombinase into murine bile duct organoids by PVC pnf expressed and purified from E. coli. White circles show the location of groups of cells expressing the fluorescent reporter gene. The upper images show a direct grey scale conversion of an images obtained via light microscopy. The lower image shows a corresponding image with false-colour enhancement of positive cells, which is provided simply to aid identification of the difference between effected cells and surrounding unaffected ones within the former grey scale conversion.
Figure 17 shows a dot-blot analysis of nanosyringe expression both with a payload (the Cas9-like protein MAD7) and without. Some leaky expression of the IPTG inducible MAD7 is seen before induction (T1) as is common with this expression system. There is no Myc signal from the PVC only sample at any time point as expected, and the MAD7 signal grows throughout the expression over a ~24 hour period. Strong Myc signal is maintained post purification via ultracentrifugation as described elsewhere, indicating that the protein is incorporated into the nanosyringe chassis system. FLAG signal is robust in the MAD7 sample, and occurs as expected post-induction and persists post-purification, as this promoter system has reduced leaky expression. It is concluded that the nanosyringes and MAD7 are compatible with one another in terms of expression, and that MAD7, the largest protein tested to date, can be packaged in to the nanosyringe system.
Figure 18 shows western dot-blot analysis confirming trans-packaging of the pro-apoptotic tBid protein domain and BaxBH3 (both having the leader sequence of SEQ ID NO.: 78 fused to the N-term) peptide into purified PVCpnf expressed from E. coli {7 & 8). The nanosyringe with its cognate toxin“Pnf” is shown, as purified by 2 different methods (5 & 6) as a positive control. The blots at the bottom of the panel represent the same examples as in 7 & 8 in the panels above. These blots were made from another purification of the same constructs, demonstrating reproducibility of purification. This experiment demonstrated that“tBid protein domain and BaxBH3 peptide” packed samples (nanosyringes) can be successfully prepared, e.g. for used in the apoptosis delivery assays in Example 9.
Figure 19 (A) shows TUNEL-stain microscopic analysis from cells exposed to the packaged nanosyringes for 20 minutes only. First (left) bar = DNase I treated cells (+ control); Second bar = no DNAse I or nanosyringe treatment (- control); Third bar = cells were exposed to nanosyringes packaged with tBid (via leader sequence of SEQ ID NO.: 78 fused to the N- term); fourth (right) bar = cells were exposed to nanosyringes packaged with Bax_BH3 domain (via leader sequence of SEQ ID NO.: 78 fused to the N-term). B - Representative micrographs as described in Example 9, showing TUNEL staining of PBMC’s, following treatment with nanosyringes and controls. PBMCs were treated with tBID, Bax loaded nanosyringes, and the positive (DNase I treated cells) and negative (no DNase I treatment) controls for 20 minutes at room temperature before performing TUNEL staining to determine an apoptotic response. In the original (non-grayscale) micrographs: Cells negative for apoptotic response show blue or light brown staining. Blue staining (Methyl green) or light brown staining indicates healthy cells with absence of apoptotic signal. Dark brown staining indicates cells undergoing apoptosis. EXAMPLES
Materials and Methods
Cloning
Plasmids encoding PVC Needle Complexes were prepared using standard molecular techniques known in the art. Briefly, genomic DNA from P. asymb/of/ca ATCC43949 (obtainable from the ATCC under accession no. ATCC 43949) was used in PCR (with appropriate primers) to amplify multiple (e.g. four) overlapping regions of the PVC operon. Overlap/extension PCR was employed to prepare a whole operon, and fused (again using overlapping PCR) into an appropriate expression vector as detailed in Figure 1 (using the primers of SEQ ID NO: 101 - SEQ ID NO: 106).
Briefly: four overlapping PVC fragments (generated with primers of SEQ ID NO: 101 (F1) and SEQ ID NO: 105 (R1); SEQ ID NO: 102 (F2) and SEQ ID NO: 106 (R2); SEQ ID NO: 103 (F3) and SEQ ID NO: 107 (R3); and SEQ ID NO: 104 (F4) and SEQ ID NO: 108 (R4), respectively) were made covering the PVC operon (e.g. of SEQ ID NO: 93). The target cloning vector was cut at the required insertion site. These 5 DNA fragments were then assembled by overlapping PCR (using primers of SEQ ID NO: 101 and of SEQ ID NO: 108), and the resulting fragment was ligated into the cloning vector. Products were transformed into laboratory E. coli and recovered with vector marker selection (e.g. due to ampicillin resistance).
The operons are typically operably linked to an inducible promoter (e.g. arabinose inducible, and/or IPTG inducible) as is known in the art. This is generally achieved by cloning into pBAD family plasmids (inducible via arabinose) (Invitrogen, catalog number: V43001) and pVTRa (inducible via IPTG) (Biomedal, S.L.) vectors (although any combination of compatible expression vector systems should suffice).
A PVC Needle Complex can be expressed independently of the payload (toxin), and vice versa. Separate expression vectors (e.g. having differing inducible promoters) may harbour the PVC Needle Complex and the payload, respectively.
Expression (e.g. laboratory scale expression) / Purification of PVC Needle Complexes in E. coli
A typical process to purify a PVC Needle Complex from a 1 L culture of an E. coli expression strain (transformed with an appropriate expression vector/ cosmid) is as follows:
1- An overnight culture of the bacteria (transformed with PVC Needle Complex expression vector) is prepared by picking a colony from a plate and inoculating 100 mL of LB media. The culture is grown at 37°C with shaking.
a. Typically, the media may be routinely supplemented with 0.2% d-Glucose to aid repression of the genetic constructs for optimal cell health.
b. The media is also supplemented with the relevant antibiotics for maintenance of the expression (PVC Needle Complex) vector. If a payload vector is also being used, the relevant antibiotic for that vector is also supplied.
2- The next day, a 1 L flask is inoculated via dilution in a 1 :100 ratio from the overnight
culture. The media for the 1 L flask is identical to the overnight media but typically does not contain glucose. 3- Cultures are grown to approximately mid-to-late exponential (an OD600nm of -0.8) at which point the plasmids are induced.
a. For the PVC Needle Complex (nanosyringe) plasmid, typically 0.2% arabinose is added to induce expression. For the payload plasmid (plasmid encoding for the payload, such as Pnf), IPTG concentrations may typically be optimised on a per- protein basis, and a typical starting figure of 0.1 mM is preferable.
4- The cultures are returned to the incubator post-induction and cultured at 18°C until the following day.
5- Cultures are harvested by centrifugation in appropriate centrifuges/bottles/rotors at
5000xg for 30 mins.
6- Cell pellets are then lysed to release PVC Needle Complexes (nanosyringes).
a. The following lysis methods may be used:
(i) Lysozyme incubation overnight (ii) Sonication with a needle sonicator (with or without first treating with lysozyme (iii) Cell
disruptor/homogenisers.
7- Optionally, DNAse, and protease inhibitors can be added to the lysate.
8- Cell debris is removed by centrifugation at 50,000xg, 4°C, for 20 minutes in a high speed centrifuge.
9- Concentrate the lysate through a 100,000 kDa MWCO centrifugation column to reduce volumes and remove small proteins. Once the volume is down to a manageable volume, centrifuge several times replacing the retentate solution with an appropriate sample buffer such as TM (20 mM Tris-HCI, 8 mM MgCI 2 , pH 7.4) to dialyse.
A subsequent process for purification via Caesium Chloride density gradient is as follows:
1. Prepare CsCI density solutions as follows:
(a) 1.7 g/mL CsCI in H 2 0; (B) 1.5 g/mL CsCI in H 2 0; (C) 1.45 g/mL CsCI in H 2 0
2. Gradients (from bottom-to-top of the tube) are then set up in ultracentrifuge tubes like so:
(1) (bottom of tube) - 2mL density, 1.7 CsCI; (2) - 3ml density, 1.5 CsCI; (3) - 3mL density, 1.45 CsCI; (4) (top of tube) - sample in TM buffer. Suitably, apply each density carefully to side of tube so as not to blend the boundary with the previous density layer.
3. Balanced tubes are then subjected to ultracentrifugation at 35,000 RPM in an SW40Ti swinging bucket rotor, equivalent to 155,000 xg, for 2 hours, 4°C.
4. The correct gradient fraction will be the region just above a‘blue-ish-white’ halo that appears. Fractions are extracted via puncturing the tube with a syringe and needle.
5. PVC Needle Complexes of good purity can be obtained in this manner, and stored in buffer at 4°C. Suitably, dialyse back in to TM buffer to remote the CsCI.
Following, or in place of CsCI gradient purification, PVCs can be extracted via Monolith anion exchange chromatography, as follows (note all steps can be performed manually with a peristaltic pump or syringe apparatus, or via F/HPLC):
1. Unless already done, dialyse the sample extract into the binding mobile phase (typically TM buffer) with a low concentration of salt (20 mM NaCI).
2. Equilibrate the column according to the manufacturer’s guidelines, briefly:
a. At least 5 Column Volumes (CV) of dH 2 0;
b. At least 5 CV of binding buffer (TM, with low salt);
c. At least 5 CV of elution buffer (TM with high salt, >= 1M NaCI);
d. At least 10 CV of binding buffer once more. 3. Apply the sample to the column at a low flow rate (1-2ml_/min)
4. Wash the column with up to 200 mM NaCI-containing TM buffer.
5. Elute with 1 M NaCI-containing TM buffer (alternatively, use a gradient elution if using an FPLC machine).
6. PVC Needle Complexes are present in the elution fractions. If a fraction collector is used, subsequent SDS-PAGE or similar may be needed to identify the correct fraction.
The column (of e.g. step 2) was of the CIMmultus(™) Quaternary Amine anion exchange columns (BIA Separations d.o.o.). For example, the CIMmultus™ QA-1 , which is a monolithic column with 1.3 pm channel size and a column volume of 1 ml_.
Alternatively, a DEAE (a weak anion exchanger) column may be used.
Alternatively, for use with a Photorhabdus expression system, PVC Needle Complexes can be purified from supernatants as well as/instead of cell pellets, with the following
additions/modifications:
1. Following cell harvest from the standard protocol above, supernatants are transferred to a pyrex bottle, and can optionally be concentrated via 100,000 MWCO columns if necessary.
a. DNAse (0.25U/ml_) and protease inhibitors can optionally be added.
2. NaCI is added to a final concentration of 0.5M, and 80 g/L of PEG6000 is also added.
The solution is mixed at 4°C overnight.
3. The solution is centrifuged to pellet the PEG6000 at 8000xg, 4°C for 30 mins.
4. The pellet is resuspended in a small volume (~5 ml_) of TM buffer (or similar) and
incubated for 2 hours at room temperature, shaking.
5. Pellet by centrifugation at 13,000 xg for 10 mins, and collect the supernatant to a new tube. Proceed with purification method of choice.
Other methods for purifying PVC Needle Complexes have been described elsewhere, for example in Yang et al (J Bacteriol. 2006 Mar; 188(6): 2254-2261), incorporated herein by reference.
Construction of an arabinose inducible over-expression strains for P. luminescens TT01
PVCunit4 (chassis encoded by genes plu1667 - plu1652)
Photorhabdus strains overexpressing a PVC Needle Complex were prepared using chromosomal recombineering to place a PVC (operon) of choice (operon encoding PVCunit4 Needle Complex was used here, as an example) under the control of an arabinose inducible transcription promoter. The recombineered strains are then genetically transformed with effector expression plasmids (e.g. based on the arabinose inducible expression vector pBAD30) to facilitate PVC Needle Complex over-expression, PVC effector expression, PVC effector trans-packaging, and secretion of the whole complex simply through the addition of the arabinose sugar.
Recombinant Photorhabdus PVC over-expression strain construction
The promoter region of PVCunit4 was amplified using primers PVCpromF (5’- T AT CAT AT GT CT ACAACTCCAGAACAAATTGCT G-3’ , SEQ ID NO: 97) and PVCpromR (5’- ATCTCTAGAACAGATATTCCAGCCAGC-3’, SEQ ID NO: 98) using genomic DNA from P. luminescens strain DJC (aka strain TT01) as a template. A suitable P. luminescens strain is obtainable from the ATCC under accession no. ATCC 29999. The PCR product was digested with Ndel and Xbal and introduced by ligation into the suicide vector pCEP (ThermoFisher, catalog number: V04450), using E. coli DH5a l-pir (Biomedal S.L.) as the carrier strain. The resulting plasmid was transferred to the E. coli donor strain S17.1 l-pir (Biomedal S.L.) for conjugation into Photorhabdus. Briefly, overnight cultures of the donor strain and a rifampicin resistant (RifR) isolate of P. luminescens DJC were diluted in LB supplemented with 10 mM MgSCL and grown to mid-exponential (OD600 -0.5). Then, 3 ml of each culture were harvested, washed twice and re-suspended in 100 pi of LB supplemented with 10 mM MgSCL. 80 mI of P. luminescens DJC RifR were mixed with 20 mI of the donor bacteria (resulting in a recipient to donor ratio of 4:1) and placed in the centre of an LB agar plate supplemented with 0.1 % pyruvate and 10 mM MgSCL. The plate was incubated overnight at 30°C and the resulting growth was harvested in 1.5 ml LB. Aliquots were plated on plates containing rifampicin (50 pg/ml) and chloramphenicol (25 pg/ml) to select for trans-conjugants and the plates were incubated at 30°C for 3 days. Possible transconjugants were re-streaked and confirmed by PCR using primers ParaINF (5’- GGCGTCACACTTTGCT AT G-3’ , SEQ ID NO: 99) and tPVCpR (5’- TCGGTGGCAGTAAATTGTCC-3’, SEQ ID NO: 100).
PVC Needle Complex over-expression and purification from Photorhabdus
Overnight cultures of P. luminescens DJC PVCunit4::pCEP were diluted in 2x 250 ml LB supplemented with chloramphenicol (25 pg/ml) and incubated at 28 °C, 180 rpm. After 2-3 h, arabinose (0.2 %) was added and the cultures were returned to the incubator for another 26 h. The cells were pelleted by centrifugation (7000 g for 30 min) and the supernatant was collected. DNAse I was added to the supernatant at a concentration of 0.25 U/ml to degrade any extracellular DNA. Following an incubation of 30 min at room temperature, polyethylene glycol 8000 (8 %) and NaCI (0.5 M) were added to precipitate the proteins. The supernatants were incubated overnight at 4°C, stirring. The precipitated proteins were then collected by centrifugation at 8000 g for 30 min at 4°C. The pellets were re-suspended in 8 ml TM buffer (20 mM TrisHCI, 20 mM MgCI2, pH7.4) and incubated at room temperature for 2h with gentle shaking. Any remaining debris was removed by centrifugation at 13000g for 10 min and the supernatant containing PVC Needle Complexes was applied to a CsCI density gradient and centrifuged at 35000 rpm for 2h in a Beckman coulter Optima L-90K or XPN-80K ultracentrifuge. The CsCI density gradient was made by layering TM buffer containing CsCI at p = 1.7 (2 ml), 1.5 (3 ml), and 1.45 (3 ml) from the bottom of the tube, respectively. The fraction containing PVC Needle Complexes was collected and UltraceMOOK devices (Amicon) were used to remove the CsCI and exchange the buffer for TMS (20 mM TrisHCI, 8 mM MgS04, pH7.4). The PVC Needle Complexes were further purified using a CIMmultus™ quarternary amine 2 pm pore anion exchange column (BIAseparations). The column was washed with TMS buffer containing 200 mM NaCI and the PVC Needle Complexes were eluted in TMS containing 1 M NaCI. The NaCI was removed by buffer exchange using an UltraceMOOK device and the sample was applied to a CIMmultus™ DEAE 2 pm pore column (BIA separations) for a final purification. The column was washed in TMS containing 200 mM NaCI and the sample was eluted in TMS containing 500 mM NaCI. It is possible to perform this with and without lysis (e.g. because the PVC Needle Complexes appear to be secreted from live cell, and can be collected in supernatant) of the cells (to release the PVC Needle Complexes).
Transmission electron microscopy
For transmission electron microscopy (TEM) pioloform-covered 300-mesh copper grids that were coated with a fine layer of carbon were used as substrates for the protein fractions. A preferred aqueous negative stain is 3% methylamine tungstate. The coated grids were exposed to UV light for 16 h immediately prior to use to ensure adequate wetting of the substrate. A 10 pi drop was applied to the TEM grid, and the protein was allowed to settle for 5 min. Liquid was absorbed with filter paper from the edge of the grid and replaced immediately with 10 mI of filtered negative stain. The drop was partially removed with filter paper, and the grids were allowed to air dry thoroughly before they were viewed with a JEOL 1200EX transmission electron microscope (JEOL, Tokyo, Japan) operating at 80 kV.
BioPORTER assay and actin stress fibre analysis.
For BioPORTER assays (Genlantis), 80 mI of purified wild-type and mutant Pnf proteins (500 pg ml-1), or PBS as a negative control, were added to one BioPORTER tube (Genlantis) and re-suspended in 920 mI of DMEM. The samples were added to HeLa cells grown in 6- well plates and incubated for 4h. BioPORTER/protein or PBS mixes were replaced by fresh complete medium and the cells were incubated for 20-48 h. To visualize cell morphology and actin cytoskeleton, cells were fixed for 15 min in 4% PBS-formaldehyde, permeabilized with 0.1% Triton X-100 and stained with Tetramethylrhodamine B isothiocyanate (TRITC)- phalloidin (Sigma) and DAPI dihydrochloride (Sigma). Images were acquired with a LSM510 confocal microscope (Leica).
EXAMPLE 1
Cloning and expression of PVC Needle Complexes
The inventors have successfully excised (cloned) the required expression genes from the host bacterium, Photorhabdus (e.g. which are comprised within SEQ ID NO: 93, SEQ ID NO.:94 and/or SEQ ID NO:.95), and have devised a reliable, scalable expression system in laboratory E. coli as explained above. It has been demonstrated that trans-expression on separate plasmids enables incorporation of payloads (e.g. Pnf) into the syringes, creating a multi-plasmid (modular) platform.
Following purification from E. coli, electron microscopy analysis demonstrated that the purified PVC Needle Complexes retained the correct‘nanosyringe’ structure (see Figure 3). Furthermore, PVC Needle Complexes remained correctly associated with the payload (e.g. Pnf) following purification (see Figure 4), demonstrating that the inventors have successfully prepared the PVC Needle Complexes (nanosyringes) having the correct structure for payload delivery to cells.
Furthermore, electron microscopy analysis demonstrated that the purified complexes appropriately localise to the cell surface of cells, and PVC Needle Complexes with a Pnf payload (PVC rhή induces a phenotype (ruffling) consistent with the postulated mechanism of the effector (PVC) - see Figure 5. EXAMPLE 2
2.1 Demonstrating PVC Needle Complexes exert effect via intracellular delivery of effector
The polypeptide Pnf was identified as a PVC effector as follows. This was identified within the Photorhabdus asymbiotica ATCC43949 complete genome - GenBank Accession Number: FM 162591.1.
The final gene of the PVC operon (P. asymbiotica ATCC43949 PVCpnf operon, which has a sequence of SEQ ID NO: 93) was identified, namely pvc16 (e.g. PAU_03338). The position of the pvc16 genes of a PVC locus is illustrated in Figures 1(A), (B) and (D). ORFs shortly 3’ of pvc16 (e.g. within about 5kb downstream of pvc16) were identified - one such ORF (PAU_03332) being 3535bp downstream of pvc16. The predicted function of the polypeptide (having a sequence of SEQ ID NO.: 32) encoded by this putative effector ORF was obtained by a combination of BlastP and HHPRED (https://toolkit.tuebingen.mpg. de/#/tools/hhpred). This ORF could then be assigned as a PVC effector based on direct homology to a known bacterial toxin (e.g. of the CNF1 family from E. coli).
A Pnf loaded PVC Needle Complex was then prepared according to Example 1.
The inventors have demonstrated that these packaged (e.g. laden) PVC Needle Complexes exert cellular effects consistent with the provenance of the cargoes they carry. By way of example, cells and whole insect animals exposed to PVC Needle Complexes loaded with the cytoskeleton toxin Pnf undergo cell death in a manner consistent with cytoskeleton toxicity.
Injection experiments (injection into the insect larvae) were performed by injection of 10mI of supernatant, provided following centrifugation (pelleting) of an overnight culture (typically 1 L) of a culture of E. coli harbouring a cosmid clone encoding the PVC Needle Complex with Pnf (PVCPnf) - e.g. a PVC encoded by SEQ ID NO.: 93, packaged with a PVC effector of SEQ ID NO.: 32.
Demonstrating that the PVC Needle Complexes are responsible for the phenotype due to intracellular delivery (e.g. injection) of the Pnf payload, the toxic effect could only be reconstituted when the same protein (Pnf) is provided with another route to access the cell cytosol (transfection and expression of an expression plasmid, or conductance via liposomal preparations containing the protein) - see Figure 6. Conversely, denatured (via boiling) PVC Needle Complex preparations, toxin proteins overlaid on tissue culture cells or toxin proteins injected into whole animals showed no activity.
2.2. Evidence of delivery of the toxic effector enzyme Pnf into cultured human macrophages
To complement the data outlined above, the inventors conducted additional experimentation providing further evidence of delivery of the toxic effector enzyme Pnf into cultured human macrophages.
Concept: The inventors tested PVCpnf expressed and purified from E. coli, (trans-)packaged with the native Pnf toxin on cultured human THP1 derived macrophages. Unlike the lethal effect of the Pnf toxin in insect models, previous liposome mediated Pnf protein transfection experiments indicated a subtler phenotype in human Hela cells. In those experiments the cells showed actin stress fibre formation at 24h and multinucleation at 48 h. The inventors therefore tested the effect of the purified PVC pnf (the nanosyringe) holding/ packaged with the Pnf PVC effector on macrophage respiration rate using a Resazurin colourimetric assay.
Methods:
Background behind Resazurin assays. The blue compound resazurin was explored for use in assays to determine the activity of PVCs on macrophages (M0). Resazurin is metabolically reduced in cell mitochondria, producing a pink and highly fluorescent compound, resorufin. The effect of PVCs on macrophage metabolism can be determined by introducing resazurin into the culture media. The number of macrophages affected by PVCs can be inferred by comparing the fluorescence measured to that of the cell density optimisation curve (see Czekanska, Methods in Molecular Biology, 2011 , 740, 27-32, incorporated herein by reference).
Optimisation of use of Resazurin for THP1 derived macrophages. The metabolism of macrophages over 18 h was assessed at different seed densities to determine the optimum cell density for use of this assay with PVCs. A 30 ml_ culture of THP-1 cells was pelleted at 1000 rpm for 4 min, before resuspension in 2 ml_ of RPMI media (also containing 10 % FBS (v/v) and 2 mM L-glutamine). Cells were counted using a cell haemocytometer, then diluted in media to a density of 2x10 6 cells mL· 1 . THP-1 cells were then activated with phorbol 12- myristate- 13-acetate (PMA) immediately before plating. 200 pl_ of the cells were plated in quadruplicate in a 96-well plate, and a 2-fold serial dilution was performed until reaching a final cell density of 1.5625x10 3 cell mL· 1 . 125 mI_ of the starting cell dilution was also plated in quadruplicate on the same plate, for a 5-fold serial dilution, until reaching a cell density of 0.32x10 3 cells mL· 1 . Four blank wells were also prepared, containing RPMI and PMA. The plate was incubated at 37 °C with 5 % CO2 for 48 h. Media was aspirated from the wells and replaced with fresh RPMI, and the macrophages were incubated for a further 24 h. A resazurin tablet (VWR) was dissolved in RPMI (12.5 mg/ml_), and 10 mI_ added to each well in quick succession (well concentration of 1.25 mg/ml_). The fluorescence produced was measured on a plate reader every 30 min for 18 h (excitation: 530-570 nm, emission: 580- 620 nm, maintained at 37 °C and 5 % CO2). The optimum cell density over time was then determined for use with PVCs.
Use of assay for PVC testing. THP-1 cells, diluted to 1.25x10 5 mL· 1 , were activated and seeded in a 96-well plate, where wells contained 100 mI_ of cells at a final well density of 1.25x10 4 cells mL· 1 . Blank wells were also prepared in quadruplicate, containing cells without PVC samples, as well as wells containing media and PMA only. The plate was incubated for 48 h at 37 °C with 5 % CO2. The media was then replaced with fresh RPMI, before addition of 10 pL of each PVC sample. The plate was incubated for a further 24 h, before the addition of 10 mI_ resazurin (12.5 mg/ml_) to each well, and the fluorescence was measured every 30 min for 18 h (excitation: 530-570 nm, emission: 580-620 nm, maintained at 37 °C and 5 % C0 2 ).
Results: Figure 6F shows that challenge with PVCpnf+Pnf did indeed lower the respiration rate of the macrophage, while heat denatured or empty PVC pnf nanosyringes had no strong adverse effect. Nevertheless, control cells with no sample addition still showed the best respiration rates. The effects on macrophage were correlated with insect injection toxicity assays. In this case the two PVCpnf+Pnf preparations showed lethality to over half the insect cohort, while the heat denatured and empty PVCpnf injected insects all remained healthy.
EXAMPLE 3
Demonstrating that a leader sequence is responsible for payload packaging into PVC Needle Complexes
Surprisingly, the inventors have found that the provision of a‘leader’ peptide sequence, preferably on the N-terminus of a payload (toxin) protein, can direct the payload to the PVC complex and allow for (e.g. trigger) the packaging of the payload into the PVC Needle Complex. The inventors have demonstrated that amino acid residues 1-50 of a PVC effector protein is/ comprises a leader sequence.
To demonstrate this, an expression construct (overexpression in chromosomally engineered P. luminescens TT01) was prepared, in which the leader sequence (the N-terminal amino acid residues 1-50) was ablated such that the payload expressed by Plu1649 (referred to as “hvnA” in the figure, and having a sequence of SEQ ID NO.: 46) (Myc-tagged for detection purposes) was absent a leader sequence (see Figure 8A - construct 1). Following expression (of both the payload and PVC Needle Complex) and isolation of the PVC Needle Complex (and running the components thereof, which includes any packaged payload, on a gel), no (Myc-tagged) Plu1649 (“hvnA”) was detectable within the PVC Needle Complex via western blot analysis, demonstrating that the payload (absent the leader sequence) was not packaged into the complex (see Figure 8B, lane 1), and thus not associated with the isolated complex. Successful packaging was seen, however, for hvnA which did retain the leader sequence, see lane 2 (note that the band appears weak, due to the relative intensity of the band of lane 3).
Surprisingly, hvnA having a leader sequence from a different (non-hvnA) PVC effector (i.e. corresponding to the N-terminal amino acid residues 1-50 from the PAU_03332 effector) (see Figure 8A, construct 3) was correctly packaged into the complex and remained associated with the PVC Needle Complex upon isolation/ purification, as demonstrated by Western blot detection of the Myc-tagged hvnA (see Figure 8B, lane 3). Thus, the inventors have demonstrated the surprising ability of the ‘PAU_03332’ leader sequence (which is associated with a different payload, Pnf) for packaging of a hvnA payload (i.e. a different payload to that of PAU_03332). This demonstrates the ability to swap the leader sequences of the PVC effector, allowing use of an optimal leader sequence (having optimal packaging activity) for packaging.
EXAMPLE 4
4.1 Demonstrating that a leader sequence directs packaging (into PVC Needle Complexes) of atypical/ exogenous payloads
In an unexpected technical effect of the invention, the inventors have found that fusing a leader sequence described herein to exogenous (non -Photorhabdus) polypeptides (preferably at the N-terminus) allows for packaging of said exogenous polypeptides into a PVC Needle Complex, with the exogenous polypeptides remaining associated with the PVC Needle Complex upon isolation/ purification. By way of example, see Figure 8B (lane 4) demonstrating that a non -Photorhabdus‘Myc’ polypeptide (<10kDa) is packaged into the PVC Needle Complex when fused to a leader sequence, and lane 6, demonstrating a much larger non- Photorhabdus ‘Cre-recombinase’ polypeptide (>32kDa) can likewise be appropriately packaged into PVC Needle Complex when fused to a leader polypeptide of the invention.
The inventors performed in-depth analysis of the size (e.g. polypeptide length) and structure of the various natural PVC effector payloads encoded by Photorhabdus (see Figure 7), which show a wide variety of different lengths and structure, demonstrating that the applicability of the PVC Needle Complex (nanosyringe) delivery system of the present invention is not limited by the size or properties of the payload protein of interest. To summarise, there is no requirement for particular secondary structure, biophysical property, or length of cargoes, confirming that that the PVC Needle Complex (nanosyringe) chassis can be utilised as a versatile multifunctional delivery vehicle.
Furthermore, this packaging of exogenous polypeptides is independent of the chosen PVC Needle Complex chassis e.g. has been accomplished using both a“PVCpnf” chassis (SEQ ID NO.: 93) and a“PVC U4” (e.g. PVCunit4) chassis (endogenous to the Photorhabdus overexpression strain) ( see Figure 10A). Importantly, the inventors have demonstrated that packaging exogenous payloads in either chassis does not affect morphology of the PVC Needle Complexes, ensuring they are not assembled aberrantly (see Figure 10B).
In data shown herein, payload proteins are supplied in‘trans’ on separate genetic constructs. The leader sequences are surprisingly sufficient to target these separately synthesised proteins for packaging into the PVC Needle Complex vehicle (see Figure 11). This applies in E. coli when the chassis (PVC) genes themselves are also present on a plasmid, as well as with chassis genes being integrated into the chromosome, as is the case in Photorhabdus, the host organism.
Further exemplification of trans-packaging of high levels of the Cre site specific recombinase into the PVCpnf nanosyringe expressed in E. coli is provided in Figure 10(C). In more detail, the inventors constructed a laboratory E. coli expression strain harbouring (i) the arabinose inducible expression plasmid for the P. asymbiotica ATCC43949 PVCpnf operon e.g. of SEQ ID NO.: 93 (with a C-terminal FLAG tag on Pvc16, e.g. immediately 3’ to SEQ ID NO.: 93) and (ii) a second IPTG inducible expression plasmid containing the Cre recombinase with a N-terminal fusion of the natural Pnf effector 50 amino acid leader sequence (e.g. leader of SEQ ID NO.: 78) and a C-terminal Myc-TAG epitope. The PVC operon and effector (Cre + leader sequence) were co-induced for 24 hours and the chimeric nanosyringes purified. Western blot analysis was used to confirm the presence of the FLAG-tagged Pvc16 cap protein (and therefore the nanosyringe chassis) and the trans-packaged Myc-tagged Cre recombinase post purification.
4.2 Trans-packaging using additional leaders demonstrating functionality of a larger, diverse sequence space
Complementing the data outlined in Example 3, Figure 10D demonstrates (trans-) packaging of Cre into PVC pnf (in E. coli) using the following four additional leader sequences (thus demonstrating the functionality of a larger sequence space): Lane 1 : the leader of PAU_02096 (leader sequence = SEQ ID NO.: 71), experiment referred to as“NanoSyringe + lopt50::cre::Myc in Figure 10D;
Lane 2: the leader of PAK_02075 (leader sequence = SEQ ID NO.: 50), experiment referred to as“NanoSyringe + cnf50::cre::Myc in Figure 10D;
Lane 3: the leader of PAU_02009 (leader sequence = SEQ ID NO.: 68), experiment referred to as“NanoSyringe + cif50::cre::Myc in Figure 10D; and
Lane 4: the leader of PAU_02806 (leader sequence = SEQ ID NO.: 76), experiment referred to as“NanoSyringe + gog50::cre::Myc in Figure 10D.
These results also demonstrate the utility of leader sequences showing greater sequence diversity for (trans-)packaging a payload. Indeed, to provide further validation, the inventors performed a CLUSTALW sequence comparison of a panel of leader sequences to determine diversity. PVC effectors are identified as proteins encoding recognisable toxin-like domains that are encoded immediately downstream of the pvc16 structural gene. Each PVC operon can encode just a single effector, or several different effector genes in tandem array. A phylogenetic tree is shown in Figure 10E, with the identities of leader sequences exemplified herein for packaging payload proteins into the nanosyringe complexes being elaborated by either the P. asymbiotica ATCC43949 PVC pnf operon (solid arrows) or the P. luminescens TT01 PVC unit4 operons (dashed arrows) or both.
As can be seen from the tree of Figure 10E, the exemplified leader sequences are well distributed throughout and are therefore at or close to maximally sequentially diverse.
EXAMPLE 5
Tail Fibre / binding domain modification
PVC Needle Complexes are known to comprise tail fibres (see the 3D rendered PVC structure, left most asterix of the rightmost image) which are believed to allow for cell-type specific targeting of the PVC complexes. The inventors have successfully demonstrated that modification of a tail fibre region to incorporate non-natural amino acids (e.g. a substitution of an amino acid in the wild-type sequence for an alternative amino acid of the 20 standard amino acids) does not affect expression of tail fibres.
EXAMPLE 6
Demonstrating delivery of an active (exogenous) enzyme/ payload into ex vivo murine organoids with a leader seguence-packaged PVC Needle Complex
Concept: Obtaining data for the delivery of an exogenous functional enzyme to a mammalian tissue. The inventors have demonstrated the delivery of a trans-packaged bacteriophage derived recombinase protein known as“Cre” into ex vivo mouse bile duct organoids. The organoids are derived from a mouse line in which the expression of a chromosomally encoded red fluorescent protein (RFP) reporter is normally prevented by a stop signal flanked by loxP recognition sites for the Cre-recombinase. If the recombinase is present, the stop signal is recombined out and the cells then go on to express the reporter protein. The general principle behind this experimental demonstration is summarised in Figure 16A. Method: The Bile Duct organoid preparation: murine primary bile ducts were isolated and expanded as organoids in matrigel using“BD expansion media” for 12 passages following Huch et al (Regen Med. 2013 Jul;8(4):385-7. PMID: 23826690; DOI :10.2217/rme.13.39) protocol. Cells were then plated in 2D and cultured in BD expansion media. Mouse Genotype: LSL-Tom reporter in Rosa26 locus + Axin2CreRT (inducible upon 40HT treatment). Cells were cultured in uncoated polystyrene plates at a seeding density: of 10,000 cells/well. Nanosyringes were prepared as 30% volume syringe preparation in PBS + 70% culture media. Total volume of 100 pi per well. The positive control represented 500nM 40HT (in ethanol) at 1 :1000 (v/v) as positive control for the recombination. The negative control represents 1 :1000 (v/v) ethanol dilution only. Cells were seeded and grown for 48h, nanosyringes added and then cultured for another 24h before fixing (4% PFA fixation 15min RT) and staining for microscopic examination. Staining: Primary antibody Anti-RFP (1 :1000) from Rockland. Secondary Anti-Rabbit 568 (used at 1 :500 v/v). Samples were visualized on a laser-confocal microscope.
Result: Figure 16B includes representative micrographs from these experiments demonstrating signal for the RFP protein could be detected in a number of cells when treated with the Cre loaded PVCpnf nanosyringe. As these are ex vivo organoids, rather than simple cell monolayers, some stochasticity in the number of cells that are dosed is expected, and this is even observed in the positive control, which is a small molecule inducer (rather than a large protein complex). It is anticipated that, as these are organoids, there will be some level of cellular differentiation present which may alter the binding characteristics of the nanosyringes. A further interesting observation from this preliminary run, is that while information on total amounts of nanosyringes applied to the system is not yet available, the inventors demonstrate that the TAM small molecule inducer does not appear to have appreciably greater tissue penetration than the nanosyringes, suggesting their ability to distribute is not majorly hampered by their size.
Additional interpretation: To summarise, the inventors have demonstrated the ability to deliver (e.g. dose) exogenous enzymes to a cellular target. Moreover, this“nanosyringe + Cre” experiment is a promising proof of concept for a biotechnology tool/aide, by demonstrating the ability to provide a DNA change leading to a transformed cell. This experiment therefore demonstrates the use of exogenous payloads (a protein of viruses rather than bacteria), and nucleic acid modifying enzymes in particular. It is evident that the Cre enzyme is delivered in a functional manner and is capable of traversing the cellular interior to the nucleus to affect its DNA modifying changes.
EXAMPLE 7
Trans-packaging of MAD7 site specific recombinase (exogenous payload) into the PVCpnf nanosyringe expressed in E. coli
Concept: As with the Cre data (of Example 6), and other examples of packaged payloads provided herein, the inventors have demonstrated packaging of the Cas-like enzyme MAD7 into a nanosyringe via a leader sequence. This is the largest exogenous example (MAD7 = 147.9 kDa) of a payload described herein. Methods: Briefly, the chassis genes and the MAD7 gene (the latter being tagged with a C- terminal Myc tag for detection, and a leader sequence for nanosyringe incorporation described herein), were expressed (upon induction) simultaneously in E. coli. Upon harvesting and purification of the nanosyringe complex, payload packaging was probed via dot blot analysis (e,g. for detection of the Myc tag). The purification method described herein (using ultracentrifugation) can be employed to select for (e.g. exceedingly) high molecular weight protein complexes/ biological matter, enabling recovery of the nanosyringes and any cargo (payload) they carry. ‘Loose’/ unpackaged payload remains in solution and is not subject to sufficient centrifugal force and as such is lost during purification, unless contained within the much larger nanosyringe‘shell’ (that is, when successfully packaged). Successful packaging of MAD7 is demonstrated by Figure 17.
EXAMPLE 8
Trans-packaging of apoptosis inducing payloads into PVC pnf, expressed in E. coli
Using the E. coli PVCpnf leader/.payload:: Myc trans-packaging system described in Figure 10C (PVC pnf leader = SEQ ID NO.: 78), the inventors demonstrated the ability to trans package at least two pro-apoptotic human derived protein sequences or peptides (e.g. the sequences of SEQ ID NO.: 109 and SEQ ID NO.: 111). The Pnf effector protein leader sequence (e.g. SEQ ID NO.: 78) was fused to the N-terminus, and a Myc epitope tag was fused to the C-terminus. Western dot blot analysis (similar to that of Example 7) confirmed the presence of these human derived proteins in purified nanosyringes (Figure 18).
EXAMPLE 9
Demonstration of the induction of apoptosis in cultured ex vivo human cells by nanosyringe delivery of (trans-)packaged pro-apoptotic human polypeptides
A preliminary test has confirmed the ability to use the PVC pnf nanosyringe, produced in E. coli, to deliver trans-packaged human protein sequences (e.g. packaged according to Example 8) and induce apoptosis in ex vivo circulating PBMC cells from human donors. The assay is a TUNEL-stain microscopic analysis from cells exposed to the packaged nanosyringes for 20 minutes only. Results are shown in Figure 19A, demonstrating (via successful induction of apoptosis) delivery of tBid p15 fragment and BaxBH3 domain.
• tBid p15 fragment (SEQ ID NO: 109) is part of the normal human apoptosis regulation pathway. Cellular effects: a pro-apoptotic member of the Bcl-2 family. The C-terminal part of Bid (tBid) translocates to the mitochondria, where it induces the release of cytochrome c. Bid is normally cleaved by caspase 8 from its latent cytosolic full-length pro-Bid form.
• BaxBH3 (aa59-73) (SEQ ID NO: 111) is a minimal BH3 domain synthetic peptide, comprising critical 15 residues of the defined Bax BH3 domain. Cellular effects: these 15 residues contain sufficient information to bind to, and functionally antagonize, Bcl- xL and to induce specifically Bax/Bak. Appears to abrogate Bak/Bcl-2 interactions - freeing up pro-apoptosis factors.
A more detailed test of the delivery of pro-apoptotic human peptides into ex vivo Peripheral Blood Mononuclear Cells (PBMCs) is now described. The aim of this study was to investigate whether the pro-apoptotic peptide loaded PVC nanosyringes could induce apoptosis in ex vivo human Peripheral Blood Mononuclear Cells. The nanosyringes were first assessed for any immediate cell toxicity using Trypan blue dye exclusion assays and then for apoptosis response by using the TUNEL assay.
Trypan Blue Exclusion Test for cell viability: Trypan blue is a Diazo dye commonly used to selectively colour dead tissue or cells, hence, dead cells are shown as a distinctive blue colour under a microscope while live cells or tissues with intact cell membranes remain uncoloured. Since live cells are excluded from staining, this staining method is also described as a Dye Exclusion Method. Trypan blue is commonly used for assessment of tissue or cell viability. A suitable number of cells (2 X 10 5 ) were exposed to the nanosyringes and empty nanosyringe for 20 minutes. A suitable volume of cells (30mI_) were added to an equal volume of 0.4% Trypan blue and the number of viable (unstained) and dead (stained) cells counted using a hemocytometer. Each compound was tested at 3 concentrations. Blood cells from two independent human donors was tested for each compound at each concentration and each sample was tested in duplicate.
Treatment and preparation of cells for microscopy: The viability of Peripheral Blood Mononuclear Cells (PBMCs) from two independent healthy human donors was determined after 20-minute treatment with the two chimeric nanosyringes (e.g. loaded with the exogenous pro-apoptotic peptides) at 3 test concentrations in 2 independent tests. PBMCs were harvested by centrifugation and resuspended in media at 1 X 10 6 cells/ml. Cells were fixed in 2.5% formalin and incubated for 20 mins at room temperature. Poly-L-lysine coated slides were prepared by spraying with 70% ethanol and leaving to air dry. Cells were centrifuged for 30 seconds. Supernatant was removed and cells were resuspended in 200mI dH 2 0. 5mI of cell suspension was added to each slide/fixation. Two fixations were performed per slide to allow staining to be performed in duplicate. Cell suspension was left to air dry.
Results of PBMC cell viability assay: The Trypan blue viability assays confirmed that the PVC preparations were not immediately toxic in themselves to PBMCs taken from healthy human donors (Table 2). Nanosyringe treatment showed > 60% viability indicating low toxicity at maximum dose concentration (Table 2). The inventors then moved on to test the ability of the chimeric nanosyringes to induce apoptosis.
Table 2. Viability of Peripheral Blood Mononuclear Cells (PBMCs) from two independent human blood donors after exposure to each compound for 20 minutes at 3 test concentrations (v/v dilutions). PBMC controls are untreated.
Testing for chimeric nanosyringe induced apoptosis using the TUNEL assay: The
TUNEL assay was then used to identify apoptotic nuclei in single cell suspensions fixed on slides. In the assay Terminal deoxynucleotidyl Transferase (TdT) binds to the exposed 3’-OH ends of DNA fragments which are generated in response to apoptotic signal factors. This in turn catalyses the addition of biotin-labelled deoxynucleotides which can be detected using a streptavidin-horseradish peroxide (HRP) conjugate. Diamineobenzidine (DAB) reacts with the HRP-labelled sample to generate an insoluble brown substrate at the site of DNA fragmentation. Methyl green counterstaining enables the visualisation of normal and apoptotic cells.
The induction of apoptosis following exposure of human PMBCs to the nanosyringes was determined. A TUNEL assay kit (Abeam) was used for detection of apoptotic cells. The assay was performed following the manufacturer’s instructions. Briefly, slides were covered with 100pL proteinase K solution or 5 minutes, slides were rinsed with 1x TRIS buffer saline (TBS). The treatment of nanosyringes or the DNase I positive kit control was performed for 20 minutes at room temperature. Slides were rinsed with TBS. Slides were then incubated with TdT equilibrium buffer for 30 minutes before the addition of TdT labelling reaction mix. Slides were incubated at 37° for 19 minutes. Slides were then washed with TBS before application of the stop buffer and incubation at room temperature for 5 minutes. Slides were washed again with TBS before addition of the blocking buffer for 10 minutes at room temperature. Detection was performed by application of the conjugate to the samples for 30 minutes. Slides were rinsed with TBS before application of the DAB solution for 15 minutes. Slides were rinsed with dH 2 0 followed by counterstaining with methyl green. Slides were dehydrated in 100% ethanol followed by xylene and mounted with a glass cover slip. All staining was performed in duplicate. An apoptosis endpoint, indicative of positive staining in the apoptosis detection assay is represented by dark brown (DAB) signal. Lighter shades of brown and/or shades of blue/green to green/brown indicate a non-reactive negative cell for apoptosis.
Analysis was performed by selecting 5 random sections of cells on the slide, positive stained cells (dark drown) and negative stain cells (blue or light brown) were counted and the percentage of cells showing apoptotic bodies was determined.
To generate a positive control, slides were treated with 1 pg/mI DNase I (the kit positive control) for 20 minutes at room temperature following the proteinase K treatment step detailed below. The DNase I treatment fragments DNA in normal cells to generate free 3ΌH groups identical to those generated during apoptosis. A negative control was generated by substituting DNase I with dH 2 0 in the reaction mix during the treatment stage.
Results of PMBC apoptosis assays: TUNEL staining using PBMCs was performed following treatment with the intact tBID and Bax loaded nanosyringes, with appropriate positive and negative kit controls. Treatment was performed for 20 mins to determine if the nanosyringes elicited an apoptotic signal. A positive control (DNase I treatment) and negative control (no DNase I treatment) was included. Results showed both nanosyringes, containing either tBID or Bax, showed strong apoptotic signals (89% and 78% positive, respectively) on the PBMCs. The positive control showed a strong apoptotic signal (79%), whereas the negative control showed no apoptotic signal (100% negative). Also observed was a significant loss of the numbers of attached cells in the nanosyringe treated samples, presumably indicative of a rapid and comprehensive apoptosis response, and a failure to be retained after washing. Note this effect is even more pronounced than the kit positive control suggesting a more rapid response. Representative micrographs are shown in Figure 19B.
Conclusion: It is concluded that the tBID and Bax loaded nanosyringes are able to rapidly induce extensive apoptosis in human Peripheral Blood Mononuclear Cells. Furthermore, Trypan Blue dye exclusion assays have confirmed that these chimeric nanosyringes do not cause rapid lethal lysis or extensive membrane damage to the cells.
EXAMPLE 10
Exemplification of practical utility of leader sequences and PVC Needle Complexes -
Intracellular delivery of atypical {non-Photorhabdus ) payload
(1) An anti-MDM (p53 inhibitor) antibody is linked to a leader sequence described herein, and expressed together with a PVC Needle Complex for packaging therein. Isolated PVC Needle Complex (comprising the antibody payload) is contacted with a tumour for intracellular delivery of the antibody (said tumour cells being characterised by having high MDM- suppression of p53 activity for MDM inhibition). The tumour is suppressed by the activity of the anti-MDM antibody.
(2) A PVC Needle Complex is used to (intracellularly) deliver anti-tumour peptide vaccine to activate the MHC-I dependent cytotoxic T-cell lymphocyte (CTL) response. A tyrosinase- related protein 2 (TRP2) peptide vaccine is delivered for enhancing cross-presentation to CTLs occurs and antitumor effects against TRP2-expressing tumours. The tumour is suppressed by the activity of the peptide vaccine.
(3) A PVC Needle Complex is used to (intracellularly) deliver a nuclear factor-kB inhibitors (which are used for the control of inflammatory disorders, such as rheumatoid arthritis) to a cell. The cell subsequently demonstrates a reduced expression of pro-inflammatory cytokines.
(4) A PVC Needle Complex is used to (intracellularly) deliver a T3SS payload (which inhibits NF-kB and MAPK pathways). This is completed with an isolated (purified) PVC Needle Complex, without any need for the PVC Needle Complex to remain associated with the bacterial cell from which it derives.
(5) A PVC Needle Complex is used to (intracellularly) deliver, to a cell, anti-apoptotic peptides including BH4, the Bcl-xL-protein, and/or a peptide inhibitor of c-Jun N-terminal kinase (which can protect the heart and brain against ischemic injuries (a restriction in blood supply to tissues, causing a shortage of oxygen and glucose needed for cellular metabolism)). For example, Jun-kinase inhibition via a 20 amino-acid binding motif of the JUN kinase is sufficient. A release of e.g. cytochrome c in the cell is inhibited. (6) A PVC Needle Complex is used to (intracellularly) deliver nicotinamide adenine dinucleotide quinone internal oxidoreductase (Ndi1), the single-subunit yeast analog of complex I (which provides significant cardioprotective effects) to complex l-deficient mutant cells. The Ndi1 protein is correctly targeted to the matrix side of the inner mitochondrial membranes, and restores the NADH oxidase activity to the complex l-deficient cells.
(7) A PVC Needle Complex is used to deliver one of two of the essential subunits of the PHOX complex (which are used in enzyme replacement therapy to restore production of ROS in chronic granulomatous disease) to a chronic granulomatous disease cell. A restoration in production of ROS is observed.
(8) A PVC Needle Complex is used to (intracellularly) deliver (e.g. intramuscularly) a myotubularin (which is used for improving local and distant muscle performance in X-linked myotubular myopathy patients). Myotubularin- dephosphorylation of phosphatidylinositol 3- phosphate and phosphatidylinositol (3,5)-bi-phosphate is observed.
(9) A PVC Needle Complex is used to (intracellularly) deliver a recombinase“Cre” (which is capable of excising defined genetic cassettes) into a mouse cell line, in which the genome has loxP recombination sites flanking a stop signal upstream of an mCherry gene. The Cre payload excises the recombination sites, and removes the stop signal, allowing for expression of the mCherry gene in the cell.
(10) A PVC Needle Complex is used to (intracellularly) deliver a ~15kDa nanobody (antibody fragment) with affinity for an intracellular component. A nanobody-intracellular complex is detected.
(11) A PVC Needle Complex is used to intracellularly deliver (e.g. into insect cells) an atypical (non -Photorhabdus) polypeptide toxin for insect crop pests and animal parasites. Suppression of the pests is observed.
(12) A PVC Needle Complex is used to (intracellularly) deliver a nuclease (e.g. Cas9 and/or Mad7) into a target cell comprising a guide RNA. The nuclease performs site-directed gene inactivation
All publications mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described methods and system of the present invention will be apparent to those skilled in the art without departing from the scope and spirit of the present invention. Although the present invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in biochemistry and biotechnology or related fields are intended to be within the scope of the following claims. SEQUENCES
Where an initial Met amino acid residue or a corresponding initial codon is indicated in any of the following SEQ ID NOs, said residue/codon may be optional.
SEQ ID NO: 1 (PAK 19851
MMREYSNEDDFIKEKTNLVKSENVEADNYLETEYLTYLAKLIGMTERENHHLNSIKLIDD IIELHNDRKGNKLL
WNDNWQDKIIDRDLQSIFKKIDEMVSEFGGLEAYKDIVGENPYDPTEPVCGYSAQNI FKLMTEGEYAVDPVK
MAKTGKINGNQFAEKLEHLNSSNNYVALINDHRLGHMFLVDIPSTNRERVGYIYQSD LGDGALPALKIADWL
KSRGKESINVNKLKKFLNDEFTMLPDNEQKGLIAEIFDLNKDIDSVKSGKIKKDKAV DIYLREYDINDFISNIEKL
KTKLA
SEQ ID NO: 2 (PAK 1987)
MFQNRIRNEKTTQSGKGKTLDRMTDSLYLEIPNVEAVTLAYQKLTSKYRKFDNKTKLILD SSDEFSQLKSEK
QRKGFSKSGLKNNGVSDRKFIYTKNALKNFAAHAGYEHNGHYEDEFVNFKDNNKNLA KGKLFPGISLIERR
KLSIVKNKEGKWEHKETDEAEAYKVTDIEKFISGVRSMYLQGNTFLHAKTEALIRKH IANNENILPTMAGIAGL
HAEVQALNNLFISGDKGTKKREKWKYIRNMLESSIFTQRLTTGQAGKDFAACHNCSG ILSSPVNVITGKVES
AGDNFLSTLSRYKTSQESPI
SEQ ID NO: 3 (PAK 1988)
MEREYSEKQKNPSKLSRKTAISERIAALERSGLSNSNQPVPQFARPYTSNRPVVNINPGR SSIAVATANSTS
PVNIPTPAPASPDKLLPSTSCDTTSSILIVGKYNLELTSQGKIVVFRGDNRTPEQIV AAGGFYPWSKQDVGKI
KKELIDEFIEIGPSAHMMGHVRSPNKNYVSTGMNMDSGGFGEQSNYLYKMEIPGLKP QDMNERTLGEKIRQ
DKRGINYPHFLMSHLTLAESEFVAMIPARSEELTFITPIPLSYITSYRKRGTNTWLP MPLKK
SEQ ID NO: 4 (PAK 2075)
MSNYEYDIVTQHDTYQIKDNEYTVVNGKYWQYEQEGNKNNNKVSISLMKENQNDPVWITS DIKEISLYIIENL
FSYHKFSAELQHTLKNAVKAVFNEYSEIKYSELLHNINNIFNLFFIKIYNTSDIDTA INILTAKIEIYDKLEKINQDK
TDSNNTNVDIWEELGINAEEPLLKIYRQAFSTGDIDDEVYSDALLTFMSDGNLELGD KEKSDYNQRIKDKTDL
FESYKKGIEKVASLITTNNINPGIPITYPETEKSINIGDDLLLAQLAKEEIALKKQN RTEYSQQDIFELQTLQAAK
YHLLILSSLGALLYQIAPNVEKMTKGHGDYRDIIFSQEQAESLFKKHNIQYDTNHVL SQESKHIEMEGCIILTAA
IIYRMRKENATVEQALNYSTLETIKLFENDKKKLNPFNTNNVKPAGYFSFIDFKKRD KFDSQYNFNEQFNVYK
NKYSHYESISFSKLILSSPAAQLTAEEIVNPPEEAFLYSVEQGMGNVAMIKMYQGNW LVISTIQGGVKAKKYS
RQQVDSNPTLRAMSKPNALFLIERKMETGMGILMPNMMVNTGKRLFPTGYERAKTLS GFAETSRYKNSYN
AFWNDYYGITSGMNVGISFTGSPKFNFYKEENLLSVTATIIQQGLNDIAIKSKQALD ITSGWHIAATILIPFYNVI
YKSTTDSEYELTGEDIGSIVFDTANVLLVVATLGMSLTESMAAKVTQTTLRLRQAGL TGRALITAVVRTLPEH
GIITLRQSSGIILGGLIDLIEPLPIRSTLTLTYRGVINAVGAMRNSIKLEKSFADIF GKSTRGLGKLKNEWKVSNL
PLEEIVPHSNGGEIYKGIYSIRPTNPETAVKQNFYIKEAGANYQVKWDDANHTWRVV NPTYPEQFSYWPAV
KLDKNGHWVTHADVSNKFLILEQSKRIDQELEAAHSNINNDNILDAFIHINTAFKDC ERYDIDKLSDITDTLTHF
FEKSLKPGDKKAIFSTEIMSIQQAWIREVILPLQNNSSISIEKINAIKTELPYLLRK TFPIESQLPNQLVANKIALAI
EEIPNTRIPKYTSGNISKTVQYTSLLENNHVDIPPVGITITGNDTFINQVTRVLSEI DEIPSGNIVIQELEKQGLNI
QPPTMNDIVREKNGQFYANNSAGSHIAFDPENHLIGTEEKLIDEPWRTREPAIALYH EMLHIYYNRYPTWFT
SIDNKVIDQKVSGGFSLLEESRIVGTKYYVNDKNTLFDFNDSDYLLENNSALLTENR FRAEYAIFKNKSEYVIR
PYSGKGDSQIPLTKTKININESHRNVMGVGSGKPEKMPNESATDYRNRVREWRKANK QPEADIGTGDMRK
TKAEARVKLLKENYPQFEPQKIELGGAFQLWTVPNEPANKLMLSSHGYFFSDSAATQ VPAGKTIQFLGPHG
KTLLEAPENPLYSPFDVTLGNSGFTVQPYATIESGNKAGLGSVKIGDKTFTVNDIQN IATDDVENYLLATGVE
ANASNHGKVRNYGIKYYEKMPDEEVKAAIWKNRADETSTHKYDALLVSPEAGNRKKL SDIFALMKTDERMS
KYDEITFVACREELNRINMKSIHDTGLGGGYEPKLEPTVILSRRRREATFTADGAII YSIIAVNLHHNFITEEIVG
IAPFLFINN
SEQ ID NO: 5 (PAK 2077)
MEHEYNEKEKQRNSAIKLNDAIRNNEENMDMTSPLELNFQNTNRKSRGLRERFSATLQRN LPGHSMLDRE
LTTDGQKNQESRFSPGMIMDRLMHFGVRTRLGKVRNSASKYGGQVTFKFAQTKGTFL DQIMKHKDTSGGV
CESISAHWISAHAKGDSIFNQLYVGGKKGKFHIDTLFSIKQLQMDGYLDDEQSTMTE YWLGTQGMQPNIQR
NDDTDEHSSKVVGETGNRGTKDLLHAILDTGDKGSGYKKISFLGKMAGHTVAAYVDD QKGVTFFDPNFGE
FSFPDKTSFSHWFTDDFWPKSWYSLEIGLGQEFEVFNYAPEAP
SEQ ID NO: 6 (PAK 2892)
MPNKKYSENTHQGKKPLMKSEANNEHDIQNSSLGIGLDLNSMMGNSSTSLSHIQDYSFWK ENISEYYKWM
VVVKAHLKQLDWTLKSMDSPESAGTNIAKNTGTTALQTLLNTGGSIAGAAIGGAIGS AIAPGVGTIAGMGIGA
LAGTGLNYLNDTVIEKLNEKLEIAYPYPKTRNMIFDINNYDKNPIIKAIKKKTNKDN LKVTAGSSLTSQLVGKVT
SPIKFPAYKLADLAIALAGLSSDKARHILDFTDSIREVLNESHSDAVAFMRKNYGDN AMGLAGLSSRIK
SEQ ID NO: 7 (PAK 2893)
MEREYSEKEKHKKRPIQLRNSIEQHEEETANNSLGLGLDLNQATNPPKVPKDNYNEENGD LFYGLANQRG
RYIKSVNPNFDPDKINSSPMIIDVYNNNVSNTILNKYPLDKLVKLSGNPQKYANNIK VENSLQQDVASSKRGW
YPLWNDYFKTGNENKKFNIADIYKETRNQYGSDYYHTWHTPTGAAPKLLWKRGSKLG IEMAASNEKTKIHF VLDGLNIQEVVNKQKGSTPLEQGRGESITASELRYAYRNRERLAGKIHFYENDQETVAPW EKSPELWQNYI
PKNKNQNESSTPQRNNGTLYRLGGPFRKLRASLRKRS
SEQ ID NO: 8 (PAK 2894)
MMEHEYSKEEEKKRQQSKPNNATHDESNLPLELEKHFNARTPATAHSKWFTYENDTEVEL TTERIKEIFSN
KQPKIIIAGDGHNKPPFQYAKNIPDVNSSFDAGTLQLYIEATDEQINENNPEYIPKE FMAKPGLFTNKNRRAEI
VGWEDSELSNAMKEMFELSDKSTREKLTPEETSSFYKLHETAIRHFFRPEFNQLRDE FFEILAKAGSNRELD
KIALEMIGFTSGTWRDEYINPTLAEKIAKHAAEKENHTFVVSIGDAHLSENPMQEYL NKRRNGGEFKHQIIFT
RDKRPILPDNMKTGNKNS
SEQ ID NO: 9 (PAK 3525)
MLKYANPQAVPTQRTKNTAKKPSSSSSFDGQLELSNGEWSKHSEMGLKRGGLINSIRRRI ARNGNIGRFNE
LIDSEAKKWPSEPVDKNIHMIWIGTRNISEKNIKLSIDTAKKNPDYNTSIIYDSGIS GHEGARNFMLEKFEGSN
VNXSLAFPKGIGVMREYAPEAGKATAFPNTPIAVTKNNPIINKTLDLAVGNYQRGEK NVLKLAGPDVFTQALY
QEIPGLNSKVLNAQLDQFELAKRQALGLPLEKPKSFADEKLTSVEKEKINRPYQSMR GLSGHVMNGADHS
WAVDTEVLGH
SEQ ID NO: 10 (PAT 00148)
MMREYSNEDDCTKEKTNLVKSENVEADNYLEMEHLTYLAKLISMTERENHHLNSIKLIDD IIELHNDRKGNKL
LWNDNWQDKIIDRDLQSIFKKIDEMVSEFGGLEAYKDIVGESPYDPTEPVCGYSAQN IFKLMTEGEYAVDPV
KMAKTGKINGNQFAEKLEHLNSSNNYVALINDHRLGHMFLVDIPSTNRERVGYIYQS DLGDGALPALKIADW
LKSRGKESINVNKLKKFLNDEFTMLPENEQKGLIAEIFDLNKDIDSVKSGKIKKDKA VDIYLREYDINDFISNVE
KLKTKLA
SEQ ID NO: 11 (PAT 00149)
MIFKMLNLAVFYLLGNIFHYLICQKFICYFCSVLKSVTMFLTKVAVQIALYLNILPTMAG IAGLHAEVQALNNI.FI
SGDRGTEKRENWKYIRNMLESTIFTQRLTAGQAGKDFAACHNCSGILSSPVNVITGK VESAGGNFFINIISI
SEQ ID NO: 12 (PAT 00150)
MEREYSEKPKNLSQLSRKTAISERRAMFERNASSNNEQPVPQFARSYTSNRSVVNINPGR SSIAVVTANST
SPVNISTPAAASPDKLLPSTSCDTTSSTLTVGKYKLELTSQGKVVVFRGDNRTPEQI VAAGGFGEQSNYLYK
MEIPGLKPQDMNERTLGEKIRQDSRGN
SEQ ID NO: 13 (PAT 00152)
MKYDPRLRTWVEDDFDYEKNFKKQTDYINYKDLEKQLKENVDYYALLDENEAIIFLKELG CDIKSFLNDTAFP
VTDVLSNFAGNIKDALGVFKVAKNFKPINIGIFTYIINELKGKGIKAIEYLGKNGER YIKLTDRPGIRKYLNATRY
LINNKKIMEVGIGSVAMEGSIVKGARFGVIYSAAYRSVELMFKSEYDLTNFFVNLSM DMAKIIVATIIAKSTVAA
ATSFVVTAALSTTAIAIGVFIIGALVVWGLMWLDDEFKISETIIRRLKEHKVKTPIS TYHSDQIFNAWGRYYRG
SEQ ID NO: 14 (PAT 02308)
MPNKKHSENTHQGRKPLIKSEANNEHDIENSSLGIGLDLNSTIGNNSASLSQIQDYSFWK ENISEYYKWMVV
VKAHLKQLDWTLKSMDSSESAGTNIAKNIGTTALQTLLNTGGSIAGGAIGGAIGSAI APGVGTIAGMGIGALA
GTGLNYLNDTVIEKLNEKLEIAYPYPKTRNMIFDINNYDKNPIIKAIKKKTNKDNLK VTAGSSLTSQLVGKVTSP
IKFPAYKLSDLAISHNRALAGLSSDKARHILDFTDSIREVLNESHSDAVAFMRKNYG DNAMGLSGLSSRIKGE
KLTLATLARTRNKIENRINSINKQTLKLSSKNSNE
SEQ ID NO: 15 (PAT 02309)
MEREYSEKEKHKKRPIQLRNSIEQHEEETANNSLGLGLDLNQATNPPKVPKDNYNEENGD LFYGLATQRGR
YIKSVNPNFDPDKINSSPMIIDVYNNNVSNTILNKYPLDKLVKLSGNPQKYANNIKV ENNLQQDVASSKRGWY
PLWNDYFKIGNENKKFNIADIYKETRNQYGSDYYHTWHTPTGAAPKLLWKRGSKLGI EMAASNEKTKIHFVL
DGLNIQEVVNKQKGSTPLEQGRGESITASELRYAYRNRERLAGKIHFYENDQETVAP WEKSPELWQNYIPK
NKNQNESSTPQRNNGALYRLGGPFRKLRASLRKRS
SEQ ID NO: 16 (PAT 02310)
MMEHEYSKEEEKKRQQSKPNNATHDESNLPLELEKHSNARTSATAYSKWFTYENDMEVEL TTERVREIFS
NKQPKIIIAGDGHNKPPFQYTKNIPDVNSSFDAGTLQLYIEATDEQINENNPEYIPK EFMAKPGLFTNKNRRA
EIVGWEDSELSNAMKEMFELSDKSTREKLTPEETSSFYKLHETAIRHFFRPEFNQLR DEFFEILAKAGSNRE
LDKIALEMIGFTSGTWRDEYINPTLAEKIAKHAAEKENHTFVVSIGDAHLSENPMQE YLNKRRNGGEFKHQII
FTRDKRPILPDNMKTGKKNS
SEQ ID NO: 17 (PAT 02956)
MSNYEYDIVTQHDTYQIKDNEYTVVNGKYWQYEQEGNKNNNKISISLMKDNQNDPVWITS DIKEISLYIIENL
FSYHKFSAELQHTLKNAVKAVFNEYSEIKYSELLHNINNIFNLFFIKTYNTSDINTA INILTAKIEIYDKLEKINQD
KTDLNNTKVDIWEELGINAEEPLLKIYRQAFSTGDIDDEVYSDALLTFMSDGNLKLG DKEKSDYNQRIKDKTD
LFESYKKGIEKVASLITTNNINPGIPITYPETEKSINIGDDLLLAQLAKEEIALKKQ NRTEYSQQDIFELQTLQAA
KYHLLILSSLGALLYQIAPNVEKMTKGHGDYRDIIFSQEQAESLFKKHNIQYDTNHV LSQESKHIEMEGCIILTA
AIIYRMRKENATVEQALNYSTLETIKLFENDKKKLNPFNTNNVKPAGYFSFIDFKKR DKFDSQYNFNEQFNVY
KNKYSHYESISFSKLILSSPAAQLTAEEIVNPPEETFLYSVEQGMGNVAMIKMYQGN WLVVSTIQGGVKARK
YSQQQVDSQPTLRAMSRPNALFLIERKIMIGIGIFMENQIVNTGKRLFPTGYERAKT LSGFAETSRYKNSYNA
FWNDYYGITSGMNVGISFTGSPKFNFYKEENLLSVTATIIQQGLNDIAIKSKQALDI TSGWHIAATILIPFYNVIY
KSTTDSEYELTGEDIGSIVFDTANVLLVVATLGMSLTESMAAKVTQTTLRLRQAGLT GRALITAVVRTLPEHGI
ITLRQSSGIILGGLIDLIEPLPIRSTLTLTYRGVISAVGAMRNSIKLEKSFADIFGK STRGLGKLKHEWKVSNLPL EEIVPHSNGGEIYKGIYSIRHTNPETAVKQNFYIKEAGANYQVKWDDANHTWRVVNPTYP EQFSYWPAVKL
DKNGHWVTHADISNKFLILEKSKRIDQELEAAHSNINNDNILDAFIHINTAFKDCER YDIDKLSDITDTLTHFFE
KSLKPGDKKAIFSTEIMSIQQAWIREVILPLQNNSSISIEKINAIKTELPYLLRKTF PIESQLPNQLVANKIALAIEE
IPNTRIPKYTSGNISKTVQYTSLLENNHVDIPPVGITITGNDTFINQVTRVLSEIDE IPSGNIVIQELEKQGLNIQP
PTMNDIVREKNGQFYANNSAGSHIAFDPENHLIGTEEKLIDEPWRTREPAIALYHEM LHIYYNRYPTWFTSID
NKVIDQKVSGGFSLLEESRIVGTKYYVNDKDTLFDFNDSDYLLENNSALLTENRFRA EYAIFKNKSEYVIRPY
SGKGDSQIPLTKTKININESHRNVMGVGSGKPEKMPNESATDYRNRVREWRKANKQP EADIGTGDMRKTK
AEARVKLLKENYPQFEPQKIELGGAFQLWTVPNEPANKLMLSSHGYFFSDSAATQVP AGKTIQFLGPHGKT
LLEAPENPLNSPFDVTLGNSGFTVQPYATIESGNKAGLGSVKIGDKTFTVNDIQNIA TDDVENYLLATGVEAN
ASNHGKVRNYGIKYYEKMPDEEVKAAIWKNRADETSTHKYDALLVSPEAGNRKKLSD IFALMKTDERMSKY
DEITFVACREELNRINMKSIHDTGLGGGYEPKLEPTVILSRRRREATFTADGAIIYS IIAVNLHHNFITEEIVGIA
PFLFIDN
SEQ ID NO: 18 (PAT 02957)
MEHEYNEKEKQRNSAIKLNDAIRNNEENMDMTSPLELNSQNTNRKSRGLRERFSATLQRN LPGHSMLDRE
LTTDGQKNQESRFSPGMIMDRLMHFGVRTRLGKVRNSASKYGGQVTFKFAQTKGTFL DQIMKHKDTSGGV
CESISAHWISAHAKGDSIFNQLYVGGQKGKFHIDTLFSIKQLQMDGYLDDEQSTMTE YWLGTQGMQPNIQR
NDDTDEHSSKVVGETGTKGTKDLLHAILDTGDKGSGYKKISFLGKMAGHTVAAYVDD QKGVTFFDPNFGEF
SFPDKTSFSHWFTDDFWPKSWYSLEIGLGQEFEVFNYAPKEP
SEQ ID NO: 19 (PAT 03171 )
MFKYDTSEKMAKFGKGKTSDGMLLDTLYLEIPDEKAVMSAYKSQILDELRNFSEKTHSFF SGKKPLYSKKYL
ANLAAHAGYVHVTDYNSIGNYKDGFVNFKDNSRNLAEGKLFPGIRLIKRPKLSIVRD KETERWKKQESDEAD
AYEITDIESFISGVRDMYSRANVDLHPVIESLIRNHIVNNDHVLPTMAGIAGLHAEV QALNNLLILADGRAGKIV
GGRKIEEYMQDMLKSFIFTQRLTTKQAGNDFAACHNCSGILSVPANVITGKVASAGS NFSLILSRYKNSQES
PI
SEQ ID NO: 20 (PAT 03172)
MLKHANPQTVSTQRTKSTAKKPSSSSSFDRQFELSNSENQPGEGNKDWTIKGWRQRFADR SLNKGHISPL
MNKGLLVGSEEALINVPVVAHRYDSSHQLTDAGPLKADSHSNNLDPFYGVVTGFRGD QVTSSESGSGSIG
GHWGKNTLDSNITGINVVNGASGTVGIRIALKDIQHGAPVIVTSGALSGCTMVYAVK NGYFFAYHTGQKPGD
KEWKTGRQGVVATYRSHQALSPDSEPMAVGEQNNDLVNIFASYDQGIITYMGKPGVI IDNTAENVGVFNYD
EVKLEKPDIRAGYSYALLAKDDKGKVNVKVLSEDVIVPLGNKGKTIKAINSLKKRLL
SEQ ID NO: 21 (PAT 03177)
MPRYANYQINPKQNTKNSHGKSSSSNFSSGYFSSSNNSLDDSLIRQQVKREFIWEGHMKE IEEASRLGNFA
VSFRAAGGPTLRALGKGAAAKGHDILEKTIKPGSINKAYPKDEASNVIKKVQEAGIE GYVGHWDKKTGRLLGI
YMSSGHGLSDEQVNGKIYPIDLNNLEASLSALKTKENWAALPFTGDYDMHDMISFTG QPHSVPSNSSEERK
IIDRINRLVARSDPNRPFGDIEHNVIRHGAQVSYPAFAMDKEKEEIKKHGGIVKAVA EPGEFPVAIVSKGKWTI
ANNIDELNQFYNSIGAKMKVSWKPGAENPGFVSNPQRPGMARFSRKR
SEQ ID NO: 22 (PAU 02009)
MMREYSKEDDCVKEKTNLAESENVEADNYLEMDCLNYLAKLNGMPERKDHSLNSTKLIDD IIKLHNDRKGN
KLLWNDNWQDKIIDRDLESIFKKIDEMVSEFGGIEIYKDIVGENPYDPTEPVCGYSA QNIFKLMTEGEHAVDP
VKMAQTGKINGNEFAEKLEQLNSSNNYVALINDHRLGHMFLVDIPSTNREKVGYIYQ SDLGDGALPALKIAD
WLKSRGKESINVNKLKKFLSNEFTMLSESEQKELIAEIFDINKDIANVKLGKIKKDK AVDVYLREYDLNDFISNI
EKLKTKLV
SEQ ID NO: 23 (PAU 02010)
MPIIGHKEDLIRTERSSVDLTRSSNNRQTDNLELNIPQHKRDNKDIEHAVIYGFSQHRGP EMQKAFADNKNP
VTIDEYNAGLGIMGELSLSDYFRISQDLKENRLPELNEKNIQNHSLKYFDAMGVNMK SADPNVKEEAKEQQ
RAYTRSWGFYMMENKEKLDIQSKINNLIPKKKSFFSKSPGEDEYKKLDEFILKNSNG SNLTIPKQRKILMKFA
SAKNAVDVTKNLSGEEQTWLKDIIATAFFRQTSKLGMSWFIEQLASPDFRFVIVGFN GEELTTDQIRSNKPW
KHGNRRKEGASEYAEPITFSEIRHAHRKGYDSKINFIKK
SEQ ID NO: 24 (PAU 02095)
MISTFDPAICAGTPTVTVLDNRNLTVREIVFHRAKAGGDTDTLITRHQYDLRGNLTQSLD PRLYDLMQKDNT
VQPNFYWQHDLLGRVLHTVSIDAGGTVTLSDIEDRPALNVNAMGVVKTWQYEANSLP GRLLSVSEQSANE
AVPRVIEHFIWAGNSQAEKDLNLAGQYMRHYDTAGLDQLNSLSLTGAHLSQSLQLLK DDQMPDWAGDNES
VWQNKLKNEVHTTQSTTDATGAPLTQTDAKENMQRLAYNVTGQLKSSWLTLNGQLEQ IIVKSLAYSESGQK
IREEHGNGVVTKYSYEPDTQRLINITTQRSKGHVFSEKLLQDLLYEYDPVGNIVSIL NRAEATHFWRNQKVSP
RNTYTYDSLYQLIQSTGREMADIGQQNNKMPTPLVPLSSDDKVYTTYTRTYSYDRGN NLTKIQHRAPASHNI
YTTEITVSNRSNRAVLSHNGLTPREVDAQFDASGHQISLPTGQNLSWNQRGELQQAT TINRDNSATDREW
YRYNAGSARILKVSEQQTGNSTQQQQVTYLPGLELRTTKSGTNTTEDLQVITMVETE RTQVRILHWSAGKP
NDIANNQVRYSYDNLIESNVMELDTKGKIISQEEYYPYGGTAIWTARNQIEASYKTV RYSGKERDKTGLYYY
RHRYYQPWLGRWLSADPAGTVDGLNLYRMVKNNPIRYQDESGTNANDKAQAIFKEGK KIAINQLKIASNFL
KDSKNSENALEIYRIFFGGHQDIEQLPQWKKRIDSVIYGLDKLKTTKHVHYQQDKSG SSSTVADLNVDEYKK
WSEGNKSIYVNVYADALKRVYEDPLLGREHVAHIAIHELSHGVLRTQDHKYIGVLSS PGSHDLTDLLSILMPP
ANEQDRTEKQRRATGARKALENADSFTLSARYLYYTAQDPNFLSSLRKAHRDFNNKK TDRLIIRPPERR SEQ ID NO: 25 (PAU 02096)
MEREYNKKEKQKKSAIKLDDAVGNNEENMDMTSPLELNSQYTNRKRPGLRERFSATLQRN LPGHSMLDRE
LTTDGQKNQESRFSPGMIMDRIMHLGVRTRLGKVRNSASKYGGQVTFKFAQTKGTFL DQIMKHKDTSGGV
CESISAHWISAHAKGESIFDQLYVGGQKGKFHIDTLFSIKQLQMDGYLDDEQSTMTE YWLGTQGIQPNRQK
NDNMNEHSSKIVGETGTRGTKDLLRAILDTGDKGSGYKKISFLGKMAGHTVAAYVDD QKGVTFFDPNFGEF
NFPDKVSFSHWFTDDFWPKSWYSLEIGLGQEFEVFNYEPKEP
SEQ ID NO: 26 (PAU 02097)
MVYEYAKTNDRKRKLSTQSDNYEEKSFSPVLDLSRNNQNTPNMEDEYETPQNFINRTGRE KLFRAIRMVAS
NKRDPITKDQVSVPPDGNLFTELKDKHLDRAAEYKKLKTWPTHASIIATSPSANTPI AQHVSGDDALSPYIST
GDKPGAVQNTVRNWNGIGPASERRLRPEKTWSPIIEIDVNKLPDTTKIFDLNKPNNT FFSTTNSDIAQNAFAD
KEVLISPEIPGLAITRVINDPEEIKQIANLNPSQSLIEKKNTIPEEKIIFEEKKSVP IHDSDADIPSSSFVFPKRKKP
RNIRSRTDS
SEQ ID NO: 27 (PAU 02098)
MVFEHDKTVERKRKPSIQLGNDKEKSSEQALELPQSKQNNPLLHDLITSNNLRKEAAVFA KQIGPSYQGILD
GLEHLHNLSGNEQLTAGFELHRRITRYLEEHPDSKRNAALRRTQTQLGDLMFTGTLQ EVRHPLLEMAETRP
AMASQIYQIARDEAKGNTPGLTDLMVRWVKEDPYLAAKSGYQGKIPNDLPFEPKFHV ELGDQFGEFKTWLD
TAQNQGLLTHTRLDEQNKQVHLGYSYNELLDMTGGVESVKMAVYFLKEAAKQAEPGS AKSQEAILLNRFA
NPAYLTQLEQGRLAQMEAIYHSSHNTDVAAWDQQFSPDALTQFNHQLDNSVDLNSQL SFLLKDRQGLLIGE
SHGSDLNGLRFVEEQMDALKAHGVTVIGLEHLRSDLAQPLIDKFLTSENEPMPAELA AMLKTKHLSVNLFEQ
ARSKQMKIIALDNNSTTRPAEGEHSLMYRAGAANNVAVERLQQLPAEEKFVAIYGNA HLQSHEGIDHFLPGI
THRLGLPALKVDENNRFTAQADNINQRKCYDDVVEVSRIQLTS
SEQ ID NO: 28 (PAU 02230)
MKGIEGVIMLSHDILPEKLLVSEKKHENVGSYFSDDIGEQSEQTEVSHFNLSLDDAFDIY ADISIENQQELKNK
DNNTNIWSSLGRGDDDHNLKKIINDAFKEKLPQLMEYRRKGYNVIGLDKEGIKKLEG MLKAVPPEIQQPTMK
NLYSAAQELLNTLKQHPLLPENQDMIQQSNLVIRNLSDALEAINAVSKVNQVEWWEE VHKTNKAQSDRLIAA
TLEELFFKVKDKRLPGSNDDYCQQEREETERKIKDLLLYDGYQLTAEHFKFGRLRKS LLAESRVTRLKLAEY
LEKKSVGILTAARDAKMYAMKILLAQTRNNGFNAKDLINAGQVNDRLLSFQQYARHI RAVDGEIDGIILSNPLV
VACIKETNDEPAHIKIARAILPVSEELGTVSKVLRETKEKVQPSKPKEELNHPHQDW WNRGDELWKYIKKTS
WNIKETSVHVTQMVGYEASKTASRAKHKLKESSYSESINGAVKGTALLLLDEIQQAE NRIRQIPQFAWDVQE
AVEQHSSVIQRTAYPDELPELSELLNEQLKHEEARWQAVKKQSRDKLQELIAPITRL AQEKWAQDLYFQLG
EELRKERQDRWKDIQQFDEIMAEAVGQFAEMARELDSEAVRLAEHGHSGGKELQEKV AKWLRDLSKLKGK
VKAGVAKITGTSLDNFSRSGMLARGMSEWAEDLKQSYLQETLQEGSAVAAELFERTL MEVVEENRTHFAK
ESDPEAERFLKRLALALKHAAENTTVYPPTPEEILAGSRSLPEDIRHWAEKKVVSGA ISAAFRGGFKLVTGTF
SLPVRVVIRGAKTGGTLYRGVRAINRSVRLGQGPATQVKSKFINQELSKTAFRLTLS LSPLVAWGMAASITA
GRLYNEKDYPEKIIKNIVIDLPEELLWIGGYAGINAAIRAHAEKAIQQAIQHALDEQ ADKLALRINKEIAGKSADV
NVEIIPQETSVSPAETAQSTPEPLSDFASTSQLTMPELIDIQDNNSAQQPKVRRKRD VSVESEISIDNLNIINA
NTREDKVNSEIKSELRSELKRFENSDANSPMSDVERAIFIDLFLYKNKYEVSESQQD YKNTWLKFRRELESQ
ENKEIKEYLRFRSIIEAYEIYDKKRLDDDTIPEAGTIIKEVIDFFQKLKKENPITFM KLAEAMVKFQYYYEEEDEN
EDRYFKMAEIYYFLNKTENEKKSKTFHLDIIDKYPNENNRLLDEFFLNKNNNNPDLD EIIYKLQSMQEKYRES
YEMLSKVENIHQVLSDDSKNEENIFLDNRIIAAQVFDGSINISLQDKKKWLNRYDQI RNEEGSDGWKLMHIES
ILINLRRINTAINLTAMKSESALLLIDKLLNFQKKARENILHISETPHEDFTSYSQF KTRKELGNDDSKYYAQFD
NYKDNHDAEKEAKEILSQVVARASLSFSELFDKVESIKLFSFVYKNRDGGAPLAAPG RTVVIKFPGKDTGGL
VISNLFLRNHVKRISTKEMEDLKPLTEGMYTRATQHRSLGSYYHIGSQSEHTNALEI LSGMNKEELKTHLKK
QGIWFGEPALFSNEYPKQENTGHLENTTLKNAIIGVSTIQNNAAANYLRSTMYESTG WEKLGDRFIPFYEIGR
RKHYDREYEINSEQLTLDIITSIAIAYPAARGIVATIRSSAIPSILKSGLRGSALFK SLSLELGKMGFNASKVFGG
AVYELIEPYPINSHLNRHNVFNKVKDTAWEFHTDVGLKGGGLKDFIDRFTKEPKEIT ISGYKFKRIKYNQENF
DTMQRMALDYAYNPDSKGKIAQAQQAYKTGKEDYNAPQYDNFNGLSLDKKIERYISP DTDATTKGVLAGK
MNESIKDINAFQTAKDAQSWKKSANKANKVVLTPQNLYLKGKPSECLPESVLMGWAL QSSQDAKLSKMLM
GIYSSNDITSNPLYKSLKELHANGNASKFNASATSISNINVSNLATSETKLFPTEIS SVRVDAPKHTMLISKIKN
RENKIKYVFYDPNYGMAYFDKHSDMAAFFQKKMQQYDFPDDSVSFHPLDYSNVSDIK ISGRNLNEIIDGEIP
LLYKQEGVQLEGITPRDGIYRVPPKNTLGVQETKHYIIVNNDIYQVEWDQTNNTWRV FDPSNTNRSRPTVPV
KQDTNGEWFKHSETGLKGGGPIDDIRKYIARKSAIKIFNQSINYSATKWPPEPIDKN IHMIWIGTKNISEKNIKL
SIDTAKKNPDYNTSIIYDSGISGHEGAKKFMLEKFQDSNVNIIDFRKKSYFSQLKQE PSFAYYEQVIAENKYAQ
ASDILRLLVLKYEGGIYKDIDDIQVKGFGSLTFPKGIGVMREYAPEAGKATAFPNTP IAVTKNNPIINKTLDLAV
SNYQRGEKNVLKLAGPDVFTQALYQEIPGLDSKVLNAQLYQLELAKRQALGVPLEKP KNFADEQLTSAEKE
KINRPYQSIRGLSGYVENGADHSWAVDTNIPSTSTQTSTIVTPLAPKTEMLPPVPSS STKSSTSAPVLQEKIS
YNLATDIDATDYLNQLKQKTNINNKISSPAGQCESLMKPVSDFMRENGFTDIRYRGM FIWNNATEQIPMNHF
VVVGKKVGKDYVFDVSAHQFENKGMPDLNGPLILAAEDWAKKYRGATTRKLIYYSDF KNASTATNTYNALP
RELVLESMEGKTFITSPNWYQTFKRTHNIHPEVTVSDPATFSLNYSVNPTAENLSPP PPPPIPSHGQVPKTV
TPPPPPMRSPLSLSQPLERLPANKTKPIGFNPGENKASFSKLEEAGKHYYKDDKSRQ AAPVNTMSDFDNRY
LSHTTEAPAPSNVAHLAPGNIYNTKVTAKGAEKPAYDIYISKDGESLITSSSYKVDD ITTDSKFGKPLPYSEIM FNSLKKSGVDPKNLKRSVQASIENKVTQDVISAIGTRIQRGQVIRVSPTENPDAFYTLLG TDNCKATLHMLNQ
HAEEFGHKVVTSIEFKGTGYLVMNIGTSTQTSTIVTPPPMPGTSQLVQ
SEQ ID NO: 29 (PAU 02805)
MPNKKYSENTHQGKKPLIKSEANNEHAIDNSPLGIGLDLNSILGNNSASLSQIHDYSFWK ENISEYYKWMVV
VKAHLKQLDWTLKSMDSPESAGANIAKNIGTTTLQTLLNTGGSIAGGAIGGAIGSAI APGVGTIAGMGIGALA
GTGLNYLNDTAIEKLNEKLEIAYPYPKTRNMIFDINNYDKNPLIKAIKKKTKKDNLK VMAGSSLTSQLLGRITPI
KIPAYKLADLAVSHHRALAGLSSDKARHILDFTNSIREVLNESHSDAVAFMRKNYGD NAMGLSGLSSKIKGD
KLTLDTLARTRNKIENRINSINKQTLKLSSKNSNE
SEQ ID NO: 30 (PAU 02806)
MEREYSEKEKHKKHPIQLRDAIEQHAEETANNSLGLGLDLHQAINTPKVPKDNYNEENGD LFYGLAAQRGR
YIKSVNPNFDPDKTNSSPMVIDVYNNHVSNTILNKYPLDKLGKLYGNPQKYAKDIKV TNSLQQDVAASKRGW
YPLWNDYFKAGNENKKFNIADIYKETRNQYGSDYYHTWHEPTGAAPKLLWKRGSKLG IAMAASNEKTKIHF
VLDGLNIQEVVNKQKGSTPLEQGRGESITASELRYAYRNRERLAGKIHFYENDQETI APWEKSPELWQNYIP
KNKSQNESSTPQRNNGALYRLGGPFRKLRASLRKRS
SEQ ID NO: 31 (PAU 02807)
MVHEYSINDRQKRHSFSSANPIDPEVTNRENSRHRFPKDNYNKGHGDLFYGLAPERGKYI KEANPKFDPNN
PENAAMIIDVYNDEISRVILNNNANKISTNRLLNFIYNFRKNRLENLMKNPEKYAKD IKVKDNLRENISPKKIEK
YPLWNDYFEAGIRNKKFNIAEIFKETASQYNSDYYHAWHIGGNSAPRLLWKRGSKLG IEIAASNQRTKIHFIL
DGLKIEDVVNKTKGPAPLKAGPGESITASELRYAYRNRARLAGRIHFYENGKETIAP WDKDPELWQKYTPKN
RSGMEL
SEQ ID NO: 32 (PAU 03332)
MLKYANPQTVATQRTKNTAKKPPSSTSFDGHLELSNGENQPYEGHKIRKIKGLRQHLADR SLNKGHISPLM
NKGLLVGSKDVSIDIPVIAHRYDSSHQLTDAEPLKADSHSNHLDPFYGVIAGFRGDQ VTSSESGSGSIGVHW
GKNTLDSNIMGVNVVNGASGTVGIRIALKDIQHGSPVIVTSGALSGCTMVYSVKNGY FFAYHTGQKPGNNE
WKTGRQGVVATYLSHQALSPDSEPMTVGEQNNDLVNIFANYDQSVITYMGKPGVLID KMAENVGVFNYDEI
KPEKPAIRAGYSYALLAKDDKGKVNVKVLSEDVIVSSGKQGNTVKAINSLKKRLL
SEQ ID NO: 33 (PAU 03337)
MPRYANYQINPKQNIKNSHGKSSSSDFSSGYLSFSNNSLDDPFIRQQVKREFIWEGHMKE IEEASRLGNFA
VSFRAAGGPTLRALGKGAAAKGHDILEKTIKPGSINKAYPKDEASDVIKKVQEAGIE GYVGHWDKKTGRLLGI
YMSSGHGLSDEQVNGKIYPIDLNNLEASLSALKAKENWAALPFTGDYDMHDMISFTG QPHSVPSNSSEERK
IIDRINRLVARSDSNRPFGDIEHNVIRHGAQVSYPAFAMDKEKEEIKKHGGIVKAVA EPGEFPVAIVSKGKWri
ANNIDELNQFYNSIGAKMKVSWKPGAENPGFVSNPQRPGMARFSRKR
SEQ ID NO: 34 (Plu1651 )
MPNKKYSENTHQGKNPLMKSGANNEHDLQDSPLGIGLDLNSMLVNSSTSLSQIQDYSFWK ENISEYYKWM
VVVESHLKQLDWTLKSMDSPESAGTNVAKNMGVTALQSLLNTGSSIAGGAIGGAIGS AIAPGVGTIAGAGIG
ALAGTGLNYLNDTAMSKLSKKLEIAHPYPKTRNMILDINNYDKNPIIKAIKKNVNKD NLKVTAGSSLTSKLVGT
VTSPIKFPAYKFAELAVSHHRALEGLSDDKARHILDFTNSIREVLKESHSDAVAFMR KNYGDNAMGLSGFSS
KIKREKLTLNTLAKTKNEIENRINSINKQTLKVSSRSRNE
SEQ ID NO: 35 (Plu1671 )
MLSTEKHNKDTKHPRNREKKFSIQPENSTQDDEDIKNNSLGVGLDLDQMIRNTSSTLTNA PQKPEDGYYYHI
SRGNNLQSFLQNGFKPQGSPGPTLSEEDFSRRKIGIIKLIYSIIATTINKNRKAKKI SKDNFLMPQEFWHEFKN
FYQNIPTQTNIDDQLLKKSITESIDKLDQNKFMEKHSDRKQTIINNEREAILQQDER INEIISSRAKMIQQREAE
NTEGYIYLAPHKNTLLEYMKHLQEEKNLFLILAVKEDIFTEKGLEQDPQEPHGAVRY KGALSTEELNFVNQE
GQICAIPASIGEMDYGDFILNQQQVIDFCKK
SEQ ID NO: 36 (Plu1672)
MPINDLKKKFEISPQAAQAIGAPARSNSSKQAEHQTEHLELDTSKNRRDRKDLNAQATPN QQHTKKLETEV
NNGGNKSKAQAHTPDLVMKKESSVTPNTRKSPNEKIKAEDIFHRYKDRFSPSDRELP FEIMNEITNNGIAFS
SEKAPESHLDKVKDKKFTLRHYTSGNGQEKPTFNEIGSNFNLVNEGIKTLKRTQGSN TNEDDWNRLGNTAF
TFFLLAIDGEVSDRKFLSNTTHFAEIDIENPAELKELGLDETEFFASPDLLHEKNLS QAPAVKGKLSDLKSLLL
KQSGIKPVQLQSLGAKGILERIDSKFNGSLEIKIPGNVKVKEWKKVEK
SEQ ID NO: 37 (Plu1690)
MPNSKYSEKVNHSANGAEKCSIHSNQYNINNCTLGLGLDLNKKLRTGNERNIEGAQPFIP FPSKQKQYSTSP
IAMADILNESALTSQPIITDLINPQKIKMSDGVKNILNNKEGGGDLVFKALQIKPSD ETLPFNALKIVDTYQEEM
PNKDMSISAYWAPQGGYVDIPAQPDISRHPQYVFTPNFSGCSFVVDKMNEDTLRVRH VQGGQEDVEYNN
QNIDHGMGMITAMEFRDYGYHEADDKVIENTYGFAFLKFNQEKKQWQLHYQKIAAAP NIINIKTKSSWLPFS
KPSIEADTFTFKNMKVPGYSRKNINNN
SEQ ID NO: 38 (Plu1691 )
MPKLTELLSRFENPIQNQPNHISKKNPISNSKVLNNSEEKTAPLELKHDDSKIKSQVSIP NLVKKNEKPAASNT
PNNSHEKVKAEDIFNRFKSKFDPYDRELPFDIMNKITNNEIKFSSEKSKDDYLAKVK DKKFTLRHYTAGTGQE
KPTFDEISSNFNLVNKGIKTLNRTQGSNTNEDDWNRLGNTAFTFYLLAIDGEVSNRK FLSNTTHFAEINIEDS
EELKELGLDQAEFFASPDLLHEKNLSQAPAVKGKLSDLKSLLLKRSGISSVQLGRLD AKAILKSIDNEFGNSL
EIKIPGNVKVNKWNKI SEQ ID NO: 39 (Plu1712)
MPRYSNSQRTPTQSTKNTRRTSPSSNSSTEHLSLSNAPTNDSSVRQEVKEKFIWEGHWEG HMEAIEKASIL
GNFAVSFRAAGKPTLEALGKGAAAKGHDILEKTIKPGSIEKAYPENEASDVIKKVRE AGIEGYVGHWNKETG
RLEGIYMSSGHGLPNGQVNGKIYPIDLNNLEASLAPLKEKKNWAALPFTGDYDMHDM ISFTTQPHSVPSNS
SEEKKIIDRINEYIAKSDSNRPFEDIEHNVIRHGPQVSYPAFAMDKEKKEIKERGGI VKAVAEPGEFPVAIVSK
GKWTIANNINELEQFYNSIGAKMKASWKPGAGNPGFVSNPQKPGMARFSRKK
SEQ ID NO: 40 (Plu1713>
MFSTYSSKNDNQTINKINTEEKHENTETDNHLEINLEHTGKSKPDIEPKDVTTGTINAGT LLYKTTAIPEFLDN
AKSLGLAEYEKRHKDIQDYLNLGKAEDAEKLKNKSQWAGQYFALEKSYDEYANEAPD SYNNLLKNAGKDLL
ENTEEVKVFLYTFKVTKDIKVLKPHNNSNSYYVGDTEGWEKAKEIMNDVQSQSEKND NPFPELKNLEDKNF
LLEELGEKGYAWMGPLHAKEGAEKGTEFSYELAISPNLLRQHLTLESEELLGTYKNR YGYWDKK
SEQ ID NO: 41 (Plu1714)
MKKTDEKYGQYEYKDEDITSYPIAWTNPDNGKIYIGINSPEYSHLNNKGESELNLAKIIS TIIHESLHASSHQH
KGLQSQTDTGADNLNYDEYVTDYFAREVYKQILPDKDYVANCFTKGLGGENKIWGGN IVEFMIQ
SEQ ID NO: 42 (Plu2400>
MVYEYDKTIERRRNPSIQLNNNEKSSEQALELSQNNPLLHDLITSNNLRKEAAVFAKRIG PSYQEILDELEHL
HHLSGNEQLAAGFELHRRITHYLEEHPDSKRNTALRRTQTQFGDLMFTGTLQKIRHS LLEMAETRPEMASHI
YQIAREEVKGNTPGLTDLMVRWVKEDPYLAAKTGYQGKIPNDLPFEPKFHVELGAQF DDFKKWLDTAQSK
ELLTHTRLDEQNKQVHLGYSYNELLDMTGVESVQMAVYFLKEAAKQAEPGSTKSQED ILLHRFANPTYLAQ
LEHSRLAQIEAIYHSSHDTDVTAWDQQFASDALTQFNHQLNNTVDLNSQLSLLLKDR QGLLIGESHGSDLNG
LRFVEEQMEVLKAHGVTVIGLEHLRSDLAQPLIDKFLASGNEPMPAELAALLKTKHL SANLFEQARSKQMKII
ALDNNSTTRPTVEGTQHGLMYRAGAANNVAVERLRQLPAGEKFVAIYGNAHLQSHEG IDHFLPGITHRLGL
PALKVDENNRFTAQVDNINQRKRYDDVVELPRIQLTS
SEQ ID NO: 43 (Plu2401>
MEHEYSEKEKPQKCPIQLRDSIEHDKEDINTTTPLELNSQYTNRKRAGLRERFSTTLQRN LPGHSMLDRELT
TDGMKNQESRFSPAMIMDRMMHFGVRTRLGKVRNSASKHGGQVTFKFAQTKGTFLDQ IMKHKDTSGGVC
ESISAHWISAHAKGESIFDQLYVGGQKGKFHIDSLVSIKQLQMDSYLDDEQSTMTEY WLGTQGIQPIMQKND
VDEHSSKVVGQTGNKGTTDLLRAILDTGDKGSGYKKISFLGKMAGHTVAAYVDDQKG VIFFDPNFGEFSFP
SITSFSRWFTDDFWPKSWYNLEIGLGQQFEVFNYELKKS
SEQ ID NO: 44 (Plu2514>
MYDSKKKNSEPTTKKKFERSNYSQWDDSINHYEDMNRARIKNRNDILTTVDYFGEKKKTM HTFEYQSDIKH
DTNFNNKNKSLFESFAASFVLQNPSFFSGVIDKLSKKLFNIISKIDERNNFQKKLYD FIEKDTSPEGQFGRFTL
GKNEILNVLQVKSDTPQLFVKKMLLIKSLGAFIIDFSSKDIGNYDFIFDGKGREVND IIEKNRPTNLFKVRGRTN
IKSSQHRSDIGILDTPTFDSLTEEQKSFLTIPELTKRRPLFRTFTHELDAEDKRVVE SVFVNRTFDCDSPLIGS
VSGSTSCVLVAADILFPDMTMVERKKLAIATFAFLVGGGYHSATEVFDVAYPGLDLN KEIEELIENNPIQENA
GVATLRQLIGNSGF
SEQ ID NO: 45 (Plu2515>
MPISNLAKESEVRAVKDIPCKNIETDNHLEIGLSSGLSRSKDTSKFKKNSINTIKLIDDI IALHNDPKGNKLLWN
DNWQDKIINRDLANIFEKIDESVSELGGLEMYQEMVGVNPYDPTEPVCGLSAQNIFK LMTEGEHAVDPVEM
AQTGKIDGNEFAESVDQLSSAKNYVALVNDRRLGHMFLIDIPSNDQETVGYIYQSDL GQGALPPLKIADWLN
SRGKDAVSLNKLKKLLSREFNLLSDDEKRALISETLDIHKDVSNVELDRIKRDRGVD IYLTEYDVNNFYENIET
LKSKLSNYDKKLSKPK
SEQ ID NO: 46 (Plu1649>
MLANVLPNLASFLKYEKETPLFFIEDGFNFQNLNPGRVPLIKTPEQRKAGDTQSPAFLCS GVILRGTIHSNDY
KFWQPSPSSIKSGGVSFSYLRKDAKFKRLAYGYKNGFIIFPEHIAPEDRVDFSVLCA FPIDGYTNERANQGC
GENITKAKDKGKSCQEQNVTNSDDWIKNYRKVNSQDFFQCGFNVTKDVNNPAIAFYQ MLESIKKLPRTPNT
PPKQNEIRISTWEESDPNKLPIEALFYSENSGLADAQKDQRDYKNATGKFLPIVKML LPRTLNEDALFKFNIK
DQVINP
Leader Sequences (e.q. with SEQ ID NO: 47 - 92 corresponding to amino acids 1-50 of
SEQ ID NO: 1 - SEQ ID NO: 46. respectively)
SEQ ID NO: 93 ( Photorhabdus asvmbiotica strain ATCC43949 PVCPnf operon, pyd - DVC16; e.g. corresponding to genes PAU 03353 to PAU 03338 of the sequence of GenBank accession no. FM 162591.1)
ATGTCTACAAGTACATCTCAAATTGCGGTTGAATATCCTATTCCTGTCTATCGCTTTATT GTTTCTGTCGGAGA T G AG AAAATT C C ATTT AAT AGTGTTTCAGGATTAGAT ATT AGTTATGACACCATT G AAT ACCGAGATGGTGTTG GTAATTGGTTCAAAATGCCGGGTCAGAGTCAGAGCACTAATATCACCTTGCGTAAAGGCG TTTTCCCGGGGA AAACAGAACTGTTTGATTGGATTAACTCTATTCAGCTTAATCAGGTAGAGAAAAAGGATA TTACCATCAGTTTA ACTAATGATGCAGGTACCGAATTATTAATGACCTGGAATGTTTCTAATGCTTTTCCCACT TCATTGACTTCACC TTCATTTGATGCCACCAGTAATGATATTGCAGTACAGGAAATTACGCTGATGGCAGATCG GGTGATTATGCAG GCT GTTT GAAGCATT GAT ATTT AATCATCTC AT AT AAGGGAACTTTT ATG ACAACCGTT ACCAGTT ATCCTGGC GTTTATATTGAAGAATTAAATAGCCTGGCCTTGTCAGTTTCAAATAGCGCCACAGCGGTT CCTGTTTTTGCTGT GGACGAACAAAACCAATATATTAGTGAAGATAATGCAATCCGTATTAATTCGTGGATGGA TTATCTTAATCTGA TTGGCAATTTT AAT AAT GAAGACAAATT AGAT GTTTCT GTGCGTGCTT ATTTTGCCAATGGAGGTGGAT ATT GT TATCTCGTCAAAACAACGAGTTTAGAAAAAATTATTCCAACCTTGGATGATGTAACCTTA TTGGTTGCTGCGG GCGAAGATATTAAAACGACAGTAGATGTTTTATGTCAGCCAGGAAAAGGGTTATTCGCAG TCTTTGATGGCCC
TGAAACAGAGTTGACTATCAACGGTGCGGAAGAGGCAAAACAAGCCTATACCGCCAC ACCATTCGCTGCGGT
TTATTATCCTTGGTT G AAAG CGGATTGGGCTAACATAGATATTCCACCCAGTGCAGTGATGGCGGGAGTTTAT
GCATCGGTGGATTTATCCCGTGGTGTATGGAAAGCGCCTGCCAATGTTGCGTTGAAA GGGGGCCTGGAACC
TAAATTTTTAGTCACGGATGAATTGCAGGGTGAATATAACACTGGCCGCGCTATCAA TATGATTCGTAATTTCA
GT AACACAGGTACT ACGGTTTGGGGTGCAAGAACCCTGGAAGAT AAAGACAATTGGCGTT AT GTTCC AGTGC
GACGCTTGTTTAATTCTGTGGAGCGGGATATCAAGCGTGCCATGAGCTTTGCTATGT TCGAGCCTAATAATCA
GCCTACTTGGGAGCGGGTACGGGCGGCGATTAGCAACTACCTTTATAGCCTGTGGCA ACAGGGGGGATTAG
CTGGCAGCAAAGAAGAAGACGCTT ATTTT GTGCAAATTGGTAAAGGT AT AACGAT G AC AC AG GAG C AG ATT G
ATGCAGGGCAAATGATTGTTAAAGTCGGTTTGGCTGCTGTACGGCCTGCGGAATTTA TCATTCTCCAGTTTAC
GCAAGATGTAGAACAGCGTTAATCATATGATTATGAGGAGTTATCATGTCTGCTATT CTGAAAGCGCCTGGCG
TTTATATTGAAGAAGACGCTTCCCTAGCGTTGTCTGTCAGTAACAGCGCGACTGCCG TGCCTGTTTTTATCGG
AAAATTTACTCCGACAGTGGTTGATTCAATCCAAGTCTGTACCCGTATCAGCAACTG GCTTGAATTCACTTCC
TCTTTTTCCCTAGCTCCAACAGTTGAGATTGTTGTCCAATCTAACACTGAATCTGAA TCTGAATCTGAAACTTA
CCACTATATTGAGACAATCAATTTATCTCCAGCTGTGGAAGCATTGCGACTCTATTT TCAAAATGGCGGAGGA
GCTTGCTATATCTACCCATTAAATGATGCTGAAGATGAATTGGTTCTGGCGGCCATA CCAGAAGTCATTGAAC
AGAAAGGTGATATTACTCTGTTGGTTTGCCCGGAACTCGATCTGGATTACAAAACTA AGATCTATGGCGCAGT
GAGCTCACTGTTGAATGATAACAAAGTGGGCTATTTCCTGATTGCGGATAGCAATGA TGGAGAATCTGTGTCA
GGAGTATGGAATAGTGCTAAGGCCGCCGCCTATTATCCCCAGTTGGAAACTAACCTA AAATTTTCCACGTTGC
CTGGGGATAAGGACATTCGTATCAGCGGTTATCAGGATGATGATGAAACACATAAAC CGAAAAACTTGGATG
AGCTCAGGACAATCAACGAGGCGTTGGCACAGGATATTGATGCAAGATTGCTCGAGG AGAAACAACGTGCT
GTCATCATTCCGCCAAGTGCTGCCATTGCGGGCATTTATTGCCAAACGGATAATCGT CGCGGTGTTTGGAAA
GCGCCAGCCAACGTTGCGCTCACAGGGATCGGGAGTTTGCTTGATAAGGTAGACGAT GAACGGCAGGGAGA
GATGAATGACAAGGGAATCAATGTCATCCGTTCATTTACCGACCGTGGTTTTATGGT CTGGGGAGCCCGTAC
TTGTGTGGACGCTGCCAACATCAGCTGGCGTTATATTCCTGTTCGTCGCCTGTTCAA TTCCGTTGAACGAGAT
ATCCGCCAGGCGCTGCGCGCTGTGTTGTTTGAAACTAATAGTCAGCCTACCTGGGTA CGTGCTAAGGCTGC
CGTTGATCAATATCTTTATACCCTTTGGCAGAAAAATGCATTGATGGGTGCTCGCCC GGAAGAAGCTT ATTTT
GTGCAAATTGGTCAGGATATCACCATGTCCGAGGCTGATATTAAACAGGGTAAGATG ATCATGACTGTTGGTT
TGGCAGCAGTGCGGCCAGCTGAGTTCATCATTCTGCAATTTACGCAGGATGTTGTTC AGTAATCTCCATGACT
AAACGCCAGGCACTGTATTGACAGTGCCTACTCTAACCATCTTGGAGGAGGTGATGA TGATGGAGAGACTCC
AACCGGGTGTGACTTTAACAGAAAGTATAATCACGATGGGTCAGCAAGAGATACCCA GTGCTGTGCCGGTGT
TTATTGGTTACACCGTTCGTTATCCGGAACAATCGGAAGCATCAGTCCGTATCGACA GTTTGGCCGAGTATAC
CAGCCTGTTTGGTGACGACCATGTGATGATGTTTGCTGTCAGGCACTATTTTGATAA TGGCGGGCAACAGGC
ATTT GTTTT ACCCCT GAAGGACAAT ATGCCATCAGTGG AGAT GACCACAGCT GAAGCGGAAAATCT GAT AGC
CGCATTGCGCTCTGCTACGGTTAGCGAAGCCATTGGTGGGCATAGTCAGATTACACT GATTTTGGTACCGGA
TATGGCTCGGCTTAATGACAGTGATATTGATGACTCCTCAACCCAGGTAAGCCTGTG GTCCCAAGGCTGGGA
GGCGCTGCTGCAATTGAGTCAGGTTAGGCCCAACCTCTTTGTGCTGTTAGATGCGCC GGATAATGTTGAACA
GGCGCAGAAGTGTATGACAACGCTATCGTCAGATTATCGTCAATGGGGGGCAGCATA TTGGCCTCGTCTGG
AAACTACCTATCAGAAAGAAATATCTGGCAAGGACAATGAATCTCAGGGAATTTTCC AGGGGACTGTTCTGTC
ACCCACAGCCGCGGTCGCAGCGGTAATTCAACGCACGGATAACGACGCGGGTGTTTG GAAAGCACCGGCC
AATATTGCCTTATCCCAGGTTATTCGACCTGTTAAATCTTATCTTCAGGGAAGTGTA CTGTTTAACAGCAGCG
GCACTTCGCTCAATGTGATCCGCAGTTTCCCAGGTAAGGGCATACGGGTATGGGGAT GCCGCACTCTGGAA
AACACGGATAATACGCAGTGGCGCTATCTGCAAACACGTCGGCTGGTTTCCTATGTA ACAGCGCATTTGACC
CAATTGGCTCGCAT GT AT GTCTTT GAGCCAAAT AAT GAACTT ACCTGGAT GAAGTT AAAAGG ACAAAGTT ACA
ACTGGTTACGGCAATTATGGTTGCAGGGTGGCTTGTATGGTTCACAGGAGGATGAGG CATTTAACATTCTGT
T AGGCGT AAACGAGACGAT GACT GAGGAT GAT GTTCGTGCAGGAAAAAT GATCAT GAAAGTT GAGTTGGCT G
TGTTGTTTCCTGCCGAATTTATTGAGATCAGTTTGGTGTTTAATACCCAAACAGAGG CGCTGTCTTAAGAAGG
AAAAAGT ACGATGAACGATTATTACACACCCGTGGTATCCCATCGTTTTATGGCGAGTTTTATTTTT AACCGCA
TTCCCGATCCGCTGGATATTCGTTTTCAGCGTATCTCTGGCCTTAGTCGGGAACTAC AGGTGACTCAGTACA
GT GAGGG AGGAGAAAATGCCCGT AAT AACT ATTT AGCT GAGAAAATCCAACACGGTACGTT GACTTTGGAAC
GGGGCGTGATGACAGTCTCGCCATTGACCTGGATGTTTGATCGGGTATTGAGTGGTG AAAAAATCGCTTATG
CCGATGTGGTGGTGATGCTACTGAATGAAAATTCACTGCCATTGTCCAGTTGGACGT TGAGCAATGCGCTGC
CGGT ACGCTGGCAAACCAGCGACTTT GACGCTAAC AGCAATGCCAT ATTGGT GAAT ACCCTT GAATTGCGTT
ACCAGGATATGCGCTGGCTTGGAGTCAAAATATGACAGTAGAAATCAGAGAGTTACT TATCCAGGCAAAGGT
AGTGCCATCAACACGACCGACT GAAT C AG AAC G G C AAAAC CATTCTTTGATACAG G AAAG TCTGGATGAGGC
GACTTGGGTGGAAACGAT AAAACGCGAAGT GTTGGCCGCATT ACGCG AT GAGGAAGGGTGGCGTCCAT GAG
TCT GATT GAACGTGGTTT AGCT AAGCT GACAATT AATGCTT AT AAGGAT AGGGAAGGGAAGAT ACGGGCAGG
AACGTTGCAGGCCATGTATAACCCTGACTCCTTGCAACTGGATTACCAAACGGATTA TCAGCAATCCCAAGC
GATT AAT AG C G AAAAG C AAAG TAGCATTTATGTACAGGCCAAGCCCGCAGGGTTATCACTT G AATT AATTTTT
GATGCCACGATGCCGGGTAACAAAACCCCCATTGAAGAGCAGCTCATGCAGCTCAAG CAACTGTGCAGTGT
GGATGCAACCAGTAACGAGACGCGATTCCTGCAAGTTAAATGGGGCAAAATGCGTTG GGAAAGTCGGGGTT
ACTTTGCTGGCAGGGCCAAGAGTTTGTCT GT GAATT ACACTTT GTTT GATCGT G ATGCG ACTCCCTT GAGGGT
ACGGGTAATATTGGCATTAGTGGCTGATGAAAGTCTGGTGTTGCAGGAGACTGAACA AAATCTGCAATCTCC
GGCAAAAATCGCATTACGCATACAGGATGGGGTATCTCTGGCTCTGATGGCAGCCAG TACGGCATCAACATT
GTCAGGCGGTGTGGATTATCTGACGCTGGCCTGGCAAAACGGTCTGGATAATCTCAA TGGGTTCGTTCCGG
GTGAAATATTGCAGGCCACCAGGGGAGACGAATCATGAGCCACCAACTGAAAATTAT TGCAGATGGTAAGGC ACTGTCACTTTTGGCCGCGGTAGATGTGGACACCTGTTATCGGGTTAACAGTATACCTTC TGCGACATTGAAA
CTGAGCGTACCGGATAGGCCACTCTCTTCTTTCAGTCAGACGGATGTTCAGACAGAA CTGGCCCACTGTCAG
GTAGGGAAAACCCTGCGTCTGGAATTGATTGATGGTAGCAAAAAATGGGTGCTGTTT AATGGTCTTATTACCC
GTAAGGCTCTGAGAATTAAGAATAAGCAATTATTGCTCACTCTGGTTGTCAAGCATC GGTTGCAACTGATGGT
GGAT ACCCAGC ATTCACAGCT GTTT AAAG ACAAAAGCGAAAAAGCGATCTT AAGCACGCT ATT GAATCAGACC
GGAATCAATGCTCGCTTCGGAAAGATAGCGGCGTTAGATCAAAAGCATGAACAGATG GTGCAATTTCGTTGT
TCAGACTGGCATTTTCTGTTGTGCCGACTGTCGGCAACCGGTGCATGGTTGTTACCT GCCATAGAAGACGTT
CAGTTT GTTCAACCT GATGCTCT GAAATCAAACTCAGCCT AT ACCTT GAAGAGC AGGGGGG AT GAGAACAAA
GACATCGTTGTCAAGGATGCTTACTGGCAGTTTGACAATCAAATCAACCCCGCTTTG CTGGAAGTCAGTGGC
TGGGATATCAGTAAGCAGCAGGTACAATCAGGCGGTCGCTACGGAAAAATCGCGTTG GGTAAGGCGGCACT
CTCTCCTGATGGATTGGCATCCCTTAATAAAACGGGTTGGGACATTTGTTATAGCAG TCCGTTAACAACCCAG
GAAAGCGGTTATCTGGCACAGGGATTATTGCTTAACCAGCGCATTTCTGGGGTGACA GGAGAATTTTTGCTC
AAAGGAGATGGGCGTT ACCAGTTGGG AG ACAACATTCAGCT GACTGGATTTGGTTC ACAGTT AGATGGT ACG
GCAAGCATTACTGAGGTTCGCCACCGTCTTAATCGGCGAATTGATTGGGAAACCACG GTGAGCATTGGTTTA
CAACATGAATATTTGCCGATATTACCTGATGCTCCCGAACTACATATTGCGACAGTA GCGAAATATCAGCAGG
ACAGTGCGGTGTTAAACCGTATCCCCATTATTCTGCCGGTACTGAATCGTCCCAATG AATTTTTGTGGGCCAG
ATTGGGGAAACCTT ATGCT AGCCAT GAAAGCGGTTTCT GTTTTT ACCCAGAGCCAGGT GACGAAGTT ATT ATT
GGTTTTTTTGAAAATGATCCGCGTTATCCAGTTATTTTAGGTGCTATGCATAATCCG AAAAATAAGGCCCCTTT
TGAACCAACCCAAGATAATAGGGAAAAAGTATTGATCGTTAAAAAAGGTGAAGCGCA ACAACAATTAGTCATT
G ATG G C AAAG AG AAAAT G ATC C G AATT AAT G C G G G T G AAAAT C AAAT AAT G CTT C AG C AAG AT AAAG AC ATTT
CTCT GTCAACGAAAAAAGAATT AACACT GAAAGCGCAG ACAAT GAATGCCACGATGG AT AAATCATTGGCAAT
GTCCGGGAAAAACAGTGTTGAAATCAAAGGCGCAAAAATTAATCTTACCCAATGAAA GGTGACGATGAATGG
AAAATCAAATACTGACACAACTCTATGGTCGTGGTTGGGCTTTTCCTCCGGTCTTTT CCCTTGAAAAGGGGGT
AGAGATGGCT GAAGGGGCGGAAGAT GT GAGACAAAGTTTGCAGATTCT GTTT AGT ACT GAGCCGGGGGAAC
GTCTTATGCGTGAAAATTATGGCTGCGGATTAAATGATTTTATGTTTGAAAATATCC GCAATGAACTTATTGCT
GAAATTGAATCCCATATCCATGACAACGTATTACGATATGAACCCCGGGCTGATATG ACTGATATTCAGGTTC
GTCAATCCCCTGGCATGGGGAATACTTTGCAAGTGCAGGTCATGTATCGCCTGAGAG GGAGTGATATCAATC
AACAAATCCAGGGAGT ACTTGCACT GAGT GAAGGCCGGGT G ACGGAGGT AGT AT GAGT GAAGCGATT GTGG
TGGATGGTGACGTGTTACAGTTTGATCCCAACTTTGGCAATCGGCAGGTGACGGTTC CCAGCCCAGGAAAAA
TTAGCGGCACAGGACATGCGCAGGTAAGTGGAAAAAAAGTGTGTATTCTGGGGGATG AGAAACAGGTCAGG
GTTTCTGCAACCTATATTACAACAACACATACTACGCCGGGAACAGGAACCATTACT ATCAGTGCTCTGGATG
CTGGCCAGCAGGCCCTTCAGTGTACCAGTGGGGCGGCTTTAATTATCAAGGGGCAGC AATTTACGGCGATG
TTTACGCCT G AATT GCCAGCCAT G AAT AAT ACAGTGACTCCGCCACAACCGGATGTTACGACACCTTCATCAG
GAAAAGG AC GTTTT AT C ACT C AACAAAATTTTGCT ACCGT AAATT AG AGT ATT G ACT GAATT AAAT AGAATT AA
CGAAGGT GT AAAT AATT ATTT ATTTGCT GACGAATCGCT GT GACAAATAAAC ACAGGT GAT GTT ATGGAATT AA
AT GAGTT AACT AACAAATT GTCAAATTTGGTGCCAAT G ACCGATTTT AAATT AGAT AATCGAGCCAGTTTGCAA
TTGCTTAAATATATTGAAGCGTATACGAAGATAATACCCTTTAATTCTGGCGATAAA TATTGGAATGACTTTTTC
TTTATGTCAG G AAAT AC G C C AG AG AAACTT G C AAAATT AT AT C AG AAAG AAAT AG AAC C C AAT GGGGAGTTAT
TACCTCAGCAGGCTTTTTTGTTGGCGGTTTTGCGTTTATTGGAAACACCAATATCCT TATTAAATGTATTACCT
GCTGCTCATCGTGAGCTCTATTATCGGGAGCTTTTAGGCTTGTCTTCCCATGCGGCA CAGCCTGATCAGGTT
GCTTT ATCT ATGGAACT GAATTCGACAGT GATGGAACAGCTGCTCCCT GAAGGAACCCT GTTT GAGGCTGGT
CAGGATGAACAAGGCAATGCATTGCAATATGCCCTGGATGCCAGTTTGCTGGCTAAT CGTGGATATATCAGT
GACTTGCGCTGGTTACGGAATGACGGGGAAAAGCAATGGGTTACTTCTGCTCCATGG GATTTACAGGCACAG
GT GTCACTGCCGTCT GATGGGAT ACG ATT ATTTGGT AAGACAAAT AGTGATCAGCAGGT ATTTGGTGGGGT G
TT GAT AACGTCATCACTTCTGGCG ATGGAAGCGGGG AT AAGGAAGATCATT GTT ACTTTT GAGCAGGAG AT G
AACACCCAAGAACTGGTGGCACAGGTCAGCAGTGGAAATCAATGGCTAACATTGACG TCTGAGGTAAATAAG
AAAGAGGTCACACTGACACTGTCAGACAAAGAACCGGCAATCAGTGCGCCAGAGGAT CTGGATAATCTCTTT
TTCACGCAACCGGTACTCAGGCTACAGGGAAAGGATAGTCAGGCACTGCCGGAGGTG ACGGGTATCAGCGT
TTCGGAAAAGGATG AT ACTAAGGAT ACCTCTTTT GAGAT GT ATCACTT AACACCATTTGGTT AT AGCAGT GAT A
T AGAGCCATTGGAGGAAAATCCAGCGTT AT ATTT AGGCTTT ACT GAT GT AAAGCCAGGGCAAACACTGGCGC
TGTATTGGAAATTAAAATCCCCGCAGCAACCAACCGTTTCCTGGTATTACCTGGATC AACATAATCAATGGGC
TGAATTGGATTCATGGGTCAGTGATGGAACCCAGAATCTGTATCAGGATGGTACTTG GCACGTTGAGTTGCC
TGTGGATGCATCCAATCAGGCAGAGCAGATGCCAGTTGGACGCTATTGGTTGCGGGC AGTGGTGGAGGTAC
CCGCTCATGAGGGGGCGTTGGGGAAGGCTCCTTGGCTATATGGTCTAATCTATAACG CCATGACGGCAACC
TTG GTT AAT GTAGATAGCATCAGTGACAGCCATTTCTTAACCCCTTTGCCTGCCAGCAGCATACAGCGG CCC
GTTGAACCCATCATTGTGTTGGCATCGGTCAACCAGCCTTGGGCATCATGGGGTGGA CGTATACCTGAATCC
TACAGTGCCTTTTTTGAACGGATAGCTCAAAACCTGTCTCATCGAAACCGGTCCTTA ACCTGGGGAAATATGG
TGACATTACTCAAAGAGCGTTATGTCAGCATCTTTGATGTTAAGTATCCAGGTAATG ATGAACTCACCAGAGT
GCCAGCATTGG AGCAGCAGCAACT AACAGT G ATTCC AGCAAACCGGT ACAACG AT AGCGAT GATTCTCTGCG
TCCGGTACTGAATCCTGCTCGTCTGCAAGAGATGGCTGATTGGTTGCAGCAGAAAGA CTCTCCCTGGGCCTC
TATTGAGGTCAGGAATCCAGAATACTTGGATGTGAAAATCCATTACGAGGTGATTTT TAAACCTGATGTGAAC
GAAGATTTTGGCT ATCGCCAGCTACAGCAGCAACT GT GT G AGGT GT AT ATGCCTTGGAGC AT AGAT G AGCAG
CGGCCCGTTGTATTGAATAACAGCATTAATTATTTCCAGTTGTTAGCCACTATTCAA CAGCAACCGCTGGTTG
AGCGAGTCACTCGTCTGACACTACATCGGGCTGATTCTTCTGATGAGAGTGATGGTA CAGCATCTGTGGAAG
CCAAAGATAATGAAGTGCTTATTTTAGTCTGGGAAGAGGACGATAATCTGCAATACC GAGGAAATGACTATGA
GT AATCAGGATGC ACT GTTTC AT AGCGTT AAAGACGAT ATTCACTTT GAT ACCTTGCTGGAACAAGCTCATCA GGTGATTGAAAAACAGGCTGAAAAACTGTGGAGTGATACGGCAGAGCATGATCCGGGTAT CACATTTTTGCA
GGGAATCAGTTACGGTGTGTCAGATTTGGCTTACCGACATACATTACCCCTGAAAGA TTTACTGACTCCGGC
GCCGGATGAGCAGCAGCAAGAGGGAATTTTTCCTGCCGAATTTGGCCCGCATAATAC ACTGACTTGTGGGC
CGGTGACAGCGGATGATTATCGCAAGGCATTGTTAGATCTACACAGCAGCGACAGCC TGGATGGTACTCAG
CAGGATGAGGGGGATTTTCTGTTCCGGAGTGTGCAACTGGTGCGTGAACCGGAAAAA CAGCGTTATACCTAT
TGGTAT GATGCAACCAAG AGGGAAT AT AGCTTT GTCAACAGT GAAGGGGCT AAAGAGTTT ACCTTGCGGGGG
AATT ACTGGTT GT ATCTGGAACCAACCCGTTGGACTCAGGGTAAT ATTGCCGCTGCT ACCAGACAACT GACA
GAATTTTTGACTAAAAATCGCAATATTGGTGAATCTGTCAGCAACATTATCTGGCTA CAACCGGTTGATCTGC
CACTGTTGCTGGATGTTGAACTGGATGATGATGTAGGTGCACAGGATGTCCCCGGTA TTTTTGCGGCGGTGT
AT AGCACCGC AGAGCAGT ATCT GATGCCTGG AGCACAGCGTT ACCGT ACGGAAGT ACTGCAAAATGCTGGG
ATGAGCAATGATCAAATCTTCGAAGGTCCATTATTGGAACATGGCTGGATACCAGAG CTGCCGGCAGCCCGT
GATTATACTCAAAGGCTCACTCTCAATCTTAGCCGGTTGGTAAATAGTCTGCTTGAG ATTGAGGGCATTAAAC
ATGTGAATCGTCTTCGTCTGGATGATAGCTTCGATAAAACTGCTATTGAACCCGTTA AGGGGGATACCTGGTC
GTGGTCGATCAAAGAGGGCTATTATCCACGTCTTTGGGGAGAAGACCCACTTAACCA ATTGGCGCAACAAAA
TGGCCCGCTT AGGGT GAT AGCCAAAGG AGGGATT AGCGTCAGT GT GAGT AAAGAGCAAATCCAGGCCAGTT
TACCCAGTCAATCACTGATTCAAAATGAGCCGGTAATATTGGCTTACGGCCAGCACC GTGACGTTGGCAGCT
ATT ATCCCGTCAGT GAT ACTTTGCCGCCTTGCT ATGG ACT ACAAC ATTCTTT GTCT GAAAGT GAACACTT ATT G
CCACTTCATCAATTTATGTTGCCATTTGAACAATTATTGGCCTGTGGTTGTCAACAG ATAGCCATGCTCCCGC
GGTTACTGGCTTTTCAGCGCGAAGGTTATGAGGTTTGGGGTGATCAGTGGCCCTTTA AGTCAGGCTCAGTGA
ATGATGACGCCCATCAAGATTATGCCCCTGCATTAAAGGATTTGTTAGGACAGATTG CGCTGGATAGTGATCA
T GAATTGGAT ATT ATT AATT ACTTGCTGGGTT ACTTTGGCAC ACAGCGGGCACCGCGT ACCTTT ACGACACAA
CTCGATGATTTTCGTGCGGTCCAACAGGGTTATCTGGCCCAGCAACCGACATTGACT TACCACCGCTCCAAT
ATTCGTATCGATCAGGTATCGTCGCTACAAAAACGTATTGCTGCTCGCATGGGGCTG GGCGGTGAGTTGTTT
AAACCTCAACCGGATCTGAGCCAACTGCCTTTTTATTTGATTGAACATCGAGCGTTG CTGCCAGTCAAACCCA
ATAGTCAGTTTGATAAGGAACAGAAACCAGCCTCGGTGACAGAGGAGGGGGGCAGCC AAACAGGTCAACAT
TATGTGGTCATTGAACAGAAGGGCATTGATGGCAAGCTGACACAGGGGCAAGTGATC AATTTAATTCTGTAT
GAAGGAGAGCAGGGAGAAACCCAATTTACGATACGCGGTCAGATGGTATTCAAAACC GAGGGGGATAAGTT
TTGGTTGGATGTGAATAATAGTGCGCAACTGGAATATAATCTGGCGCGGGTAATGAC AGCAGCCAAGGCGAG
TAAACTCTTTTGGCAAAACAGCCCGGTATGGATGGAGGATATGGGCTATCGTCTGGC CTATGCTAGTGACCA
ATCCTCATTGCCTGTGAATCAACGGCGCTTGACCCGCACAGTGCAAACTCCATTCCC GCCGATGGTTGTTGT
AGGT AGCGAAATCACCCTGTT AAAGCAGGT GGGGAT AGTCAATTT AAAAAAAGCGG AGTCAGAAAAACTTT AT
GCAAAAGTTGTTAGCTTTGATCGCATTGAAGGGACCTTGATTATTGAGCGTTTGGGT AATTCCACTCTGGCTT
TTCCTACCTCGGAAGAGGCGTGGCGGTATAGTTGGTATTTTTCGGGGGAGAAATATG AAAGGACTGACCGCT
TTTC ATTT GT GATT AGCGT AGT AGT GAACAGT GACTT AATT AAATTGCCCGGT GTT GATCCCTATAAATTGGAA
G AAT G G GT G AAAG AAAC GATTCTTACC G AATTT CCAGCTCATATTTCTATGATTATCCATTGGATGGATCGGG
AAGCCTTTTTAAATTTCGCCAATACCTATCAGCGTTGGCAAAATAATGGTACGCCAC TGGGGGATGCGGCTTA
TTCCATTCTAGAAAGTTTGACACTTGGTAAATTGCCATCTGCCTTAAAAGGTGTTGG CACAATGCGTATTGCC
ACATCTAGTCAAAGAGAAGAAGTGGTGGGTAGTAATGGTGATCAATGGAATACAGAT GGAATAACCCAGAAT
GAATT ATTCT AT GTTCCT AAAGAGAGCT AGGAAAAATAAAT ATCTGCC ACT AAT GAT GTT GAATT AAAT AT GTTT
TCTGGAGTTAATCATGAACGAAACTCGTTATAATGCAACTGTACAAGAACAACAAAC ATTATCTAATCCAAAAG
CTGTTGGACCTGACATCGATAAATTAAAGGATAAATTTAAAGAGGGCAGTATTCCCC TGCAAACCGATTTCAA
TGAGTTAATTGATATTGCCGATATTGGACGTAAAGCCTGTGGTCAAGCGCCACAACA AAATGGCCCAGGAGA
AGGATTGAAATTGGCTGATGACGGTACGCTTAATTTAAAAATAGGCACTTTTTCCAA TAAAGACTTTTCTCCAT
TAATATTAAAAGATGATGTTTTATCTGTAGATCTTGGTAGTGGTCTGACTAATGAAA CCAATGGAATCTGTGTC
GGTCAGGGCGATGGT ATT ACAGTT AACACT AGCAATGTAGCT GT AAAACAAGGT AACGGAATT AGCGTT ACT A
GT AGTGGTGGTGTTGCCGTT AAAGTT AGTGCT AAT AAGGG ACTT AGCGTT GAT AGT AGTGGTGTTGCAGTT AA
AGTT AAT ACT GAT AAGGGAATT AGCGTTG ATGGT AATGGT GTTGCAGTT AAAGTT AAT ACT AGT AAAGGAATT A
GCGTT GAT AAT ACAGGTGTTGCAGTT AT AGCT AATGCT AGT AAGGGAATT AGCGTTG ATGGT AGTGGT GTTGC
AGTT AT AGCT AAT ACT AGT AAAGGAATT AGCGTT G ATGGTAGTGGT GTTGCAGTT AT AGCT AAT ACT AGT AAA
GGAATT AGCGTTG AT AAT ACAGGT GTTGCAGTT AT AGCT AATGCT AGT AAGGGAATT AGCGTT GATGGT AGT G
GTGTTGCAGTTATAGCTAATACTAGTAAAGGAATTAGCGTTGATGGTAGTGGTGTTG CAGTTATAGCTAATAC
T AGT AAAGGAATT AGCGTT G ATAGT AGTGGT GTTGCAGTT AAAGTT AAAGCT AATGGCGGAATT AAAGT AGAT
GCTAATGGTGTTGCAATTGATCCTAATAATGTACTCCCCAAGGGAGTGATTGTAATG TTCTCTGGCAGTACTG
CACCAACTGGTTGGGCGTTATGTGATGGCAATAATGGTACACCAAATTTAATCGATC GATTTATTTTAGGTGG
GAAAGGG ACT GAT ATT AAT G GAGT GAGT ACT AAT AC AGCTT C AGGTACT AAAAAT AGT AAGTT ATT CG ATTT C A
GTTCT GAT GAAGCT ACATT AACT ATT GATGGT AAAACACTGGGG AGAGC ATTATCGTTAC AGCAAAT ACCT AA
TCATGCACACTTT AGTGGAAT AATT ATGGAT ACAG AGAAAGTT AATTATT ATGGAAGT AAAAAAATCACAACAA
ATGTGTGGGGTGTAACAACAGGAGATAATACTTCAGTACGATATATTTATAAGTCAT CAGGTGTACTTGACTC
TAACAATAATGTCTCCAACAGTACCTTAGGCGGAAACAGTCTGCAGACGCACGATCA TGATATTAAGATAACG
GGCACAGGAAAACATTCTCACAAAAACAAAGTAACAGTCCCTTATTATATTCTGGCT TTCATCATAAAGCTTTA
AT AT AT AT G AAAAATT G AAAAT AT AAATT ATC C ATT AAT AAT AAAG AG GATATTAGCATGACTTCGGAGC C AAAT
CTGTTAAACCGGATTACAATTACTATTGAAGCTAATAATCAACAAGTAGCTAGAAAA GTATTGCATGGCTCCTT
GCTT AATCAAGCT AAT AT AAAT AAATT ATTT AATTCAT ACTTT AAT GAAT AT GAAATT AAT AGGGGTGTTT ATTT A
GAAACATTAATCCTGAATCTTGGTACGATAAATTTCCATGATTTTAATTCATTGTTT CCTACTCTCCTAAAAGCT
G C ATT GAAT AAAG AATT C AGT C AAT ATC AG AT AAAC AAC CAT AG G G AAG AAAT G CT ATTT AAT GAG AC AAT AT C
AAAT CAAGCT ACT GAT AAGT CTT AC AT ATTTGGCG AT AACAAATT AATT GAT G C AG AGAATTT C ATT C ACTTTTT AT AT C AAAAG CATTCCACATT AAAT CTAGTAGAAG C AAT G G G AAAT AAT G GT ATT G AAAAATT AAC AAAT C AG T
TAACACAAATAGAAAATAAATTTGCGTTATTATTGGCAAAAAGTTGTTTGTCTGAGG AAGGCTTAAAACGACTC
TTG G CT AT C AAAC AAC C C G ATTT ATT AAT C G CT AT C AAT CGCAGATTATCT G AAAG AAT AAAT AG AC C AC AAT A
TCAGGAGAAGCTTGTTTCCTGCGGACAACTGATATTTAGTGCTCTGGGATATATACA ACAGTACAATATACAG
GAAATTCCT AAACCGGAT GAAAAAGTTATTGC ACGCAT AACAACT GAACTT AAT AAT AATGGTTTGCTT AAT AC
AATACCTATTATTACACTATTTCGTCAGAGTGGGATTAACGATTCATCACTAAATGA TTGGCTAAAGAAAATCT
GGCAGGTGAGATCAATTTCACAGTTATGCAGAAAGTATCTTTCTGCTAAGGAATACC AATATCTGTCAGAACA
TTTTGTTTCAAAGAGCGTCGATAAAAATAGATATGATGAAGAGCCCGTAAATCAGAG CATATTATCAAGGTTG
AATAATAATTCCATTAAAGAAGGAAATAATCACAGTCAACTCTGTACTCTCAGTAGA CTATATTCTGAACCCGT
TGTATTACCTGAACAAACCATTCTACGTCAGGTTAGTAATACAGTAGATCAGAGCAT ATTATCAAGGTTGAATA
ATGCCTCCATTAAAGAAGGAAATAACCAAAGTCAACTTCGCACTCTCAGTAGACTAT ATTCTGAGCCCGTTGC
ATT ACCT GAACAAACCATTCCACGTCAGGTT AGT AAT ACAGGT AT ATT AATTCT ATGGCCAATGCT ACCT ACAC
TATTTAACCAGCTTGGTCTACTTGAGAAAAAGAAATTTATCCATCGTCAGGCCCAGT TTAATGCCGTTGATTTT
CTT GATT ACCTG ATTTGGGGAACCGAAGAT GT GAAAGTGGAACGAAAGGTTTT GAAT AAT GTTCT AT GTGGGT
TAATGGCTGATGAAATTACTGAACCAATGCCTATTGAACCAGAAAAACAATGGATAA TAATTCAATGGCTGGA
CGCTATTATCTCCCAACTTTCTGGCTGGAAAAAGTTAAGTCGTAATGACGTCCGTCA ATTATTTCTACAACGAC
C AGG AGAATT ACT GAT CAAT GAAC AGGAAATT AAAAT CAC AAT AC AGCAACAACC ATTT G ATG CTCTGTT AACT
GATTGGCCGTGGCCAATGAATATGGCTTGTTTTAGCTGGTTGAGTCAACCATTAACC ATTACGTGGTTATAAC
C ATTG AC CAC AAT GACTTAGTCTGAGT AAAAAAT AT GAAT AT ATC G C C TG TTTTTT ATG ATTC ATT GAAT C AG G
ATAACGACCGTGATCTATCGTTTTTATTTAGCGAACTGGAACGAATAGATCTCGCTC TTCAACACCATTTTTAT
TGTGTAGAAAGTCAGCGAAGTGAGCTCCTGGATGAGTTTCTGCTCACTGAGGCGGAA GTGGTGACCAGGCT
GGATAAGCCACTTGGTAAACCTCATTGGATAAATGATGATTATCTGGCGATATCGCA AAAGGGCAATGTAAGC
CT AATGGCAGCGTCCAGATT AATGGATCT GATCGAACGCTTT GAACT GACT GATTTT GAGCGCGAT GTTTT AC
TATTAGGCTTATTGCCCCATTTTGATAGCCGCTATTATCGACTGTTTTCGCTGATTC AAGGGGGACAACAGGG
TCGATTACCTTCTTTTGCGCTGGCATTGGAACTGTTTTGCCACTCGGCGCTGGAGAA ACAGGTACAGCAAGC
GAGTTTTCTGCACCGGGCACCTTTGATGGGTTGCCAGCTATTATCCATCGATACTAG TCAAAAAACGCTGGC
CTGGCTCCAGACTCCCTTTATTACTGACAGCGGGGTATATCACTTTTTACTGGGGCA TCACTACATTATGCCG
GCTTT AGAACATT GTGCT GAGTGGTT AACACCGACAGGGATTGGCT GTT ATCCT GAAGG ATT AAAACAAGTAC
TGGGTAACGTATTGTTATCTGACAACGATAATATTAGACCGATTGTCTTATTACGGG GAATGGCCGGCAGTGC
CAGAGCTT AT ACCATT ACT AAT AT GATGGCTTCAGAAGGGAAGCAAACACTGCTGGT AGAT AT ATCCAAACTT
GCTGATAGCGATGAAAAAAACATTATTCTTCAGATAAAGCATATTTTGCGGGAAACC CGCATGCATGGAGCAT
GTTT ATT ATT ACG GAATTTTT G CTTGTT AGT G GAAC AGAAT AAACAACT ATTGG ACTCCCT GT C AG AGTT ATT G
AATCAACCTGAATTAAGAATTGTTTGCCTGATTGAGCCTTATTCCCCATTGGTATGG CTGAAAAAGATACCGG
TATTACTGATTGAGATGCCACTTTTAACGCCTGCGGAAAAAGCCAGATTGTTAATTG CCAGCTTACCGGATAA
TT GTTCCGAGGAT ATT GAT ACGAT AACTTT AAGCC AGCGTT ACACTTTTAACCCAGAAACCCTGCCATT G ATTT
TGCAAGAGGCCCAGCTTTATCAACAGCAGCGAGATCCGCTGGATATCTTGCAGCAAT GCGATATACGCCAGG
CATTAAATTTGCGTGCTCAACAAAATTTCGGTCAATTGGCACAGCGGATTATTCCTA AGCGCTCATTAAAGGA
TTTATTGGTATCCGATGAGATTGCTCAGCAGTTACGGGAAATACTCATAGCAATTAA GTATCGGGAACAGGTT
CTGGCGGGAGGGTTTAAAGATAAAATTGCCTATGGCACTGGTATCAGCGCCCTGTTT TATGGTGATTCAGGC
ACTGGAAAAACCATGGCAGCAGAAGTGATTGCTGACCACATTGGCGTTGACTTAATA AAAGTGGATTTATCTA
CAGT AGT GAAT AAAT ACATCGGT GAAACAGAAAAAAACTT ATCCCGT ATTTTCGATTTGGCGGAACAGGATGC
AGGGGT ATT ATTCTTT GAT GAAGCT G ACGCACT GTTTGGT AAACGCAGT GAAACT AAAGATTCCCAGGACAG A
CATGCCAATATTGAAGTTTCTTACTTATTACAGCGCCTGGAGAATTACCCGGGTCTG GTCATTTTATCCACCA
ATAATCGTGGTCATTTAGACAGTGCTTTTAATCGTCGTTTTACTTTCATTACCCGTT TTACTTACCCGGATGAA
AAAAT C C GT AAAAAAAT GTG G C AG G AAATTT G G C CT AG AAAT AT AAAAAT ATC G G AAG AT ATC G ATTTTAAC G A
ATTAGCTCAACGAACAAGCGTGACTGGCGCGAATATCCGCAATATTGCTTTATTGTC TTCATTCTTTGCTTCA
GAGCAGGGGAATGATGAAGTCAGTAATGAAAATATTGAAATTGCATTGAAGCGTGAA TTAGCTAAAGTCGGA
CG ATT AAC ATTTT AAAAGTT AT C ACAAT GAAAGT ATT GAAAT ATT AAAT AAATTT ATT ACCAAAAAGTT AT C ACG
AT AT AATTT AAG AGAGGTTTTTT AT GTT AAACACGCAAACT ATT ATTG AT GTCAAT AAGGCAATGGATGCC AT G
CTGCGCGCATATCTGAATCAAGATATTGCCATTCGTTTTGATCTACCTGAATTGGAT ACTATGCAATCTGATGC
GATGGTAAGTATCTTTCTTTATGACATTCATGAAGATTTACAGCTTCGCTCGGCAGA ATCAAGAGGGTTTGAT
GTTTATGCCGGGAGGTTATTGCCTGGTTGGGTAAATATTAAATGTAACTATCTGATT ACCTATTGGGAAGCTT
CT AAGCCAGCG ACT GATGCCAGCAGTCCGGAT AGCCAACCT GAT AACCAGGCAAT ACAAGT GAT GTCACAAG
TATTAAATGCCTTGATTAATAATCGTCAATTGGCAGGTATTCCTGGTGCTTATACTC AGGTTGTACCGCCTAAA
GAGAGTTTAAATAGCCTGGGGAATTTCTGGCAATCACTGGGTAATCGCCCACGGCTT TCTCTCAATTATTCAG
T GACAGT ACCT GTT AGCCT AAACGATGGTCAGGAT AGCGCG ACTCCGGTT ACCGCGGTTTCTT CT ACAGTGG
AACAAACGGCATCGCTCAGTCAAGAAGTGGTTAGTCATGCTTTACGCGAATTACTCA TTACGGAATTAGGAG
GAGGAG AGG AT AACCGGTTGGT ACT G AGTAAAGTT GAATT ATCCGCAGT GAAAG AG ACGAT GACTCAAGACA
GTCCGGCTCAGATGATTATATTGTTGTCTGTTTCAGGCATTACACGACAGGAATATT TGAAGGAAATTGATAAT
ATCTTT GATCGTTGGGT AAAT AATGCTGAAGTT ATT ACCACT ATT GAT GATT GTGGGATT AGAATT GAAAGT AT
AACGAAAGAT AATCTT GT AGGAATTT AA SEQ ID NO: 94 ( Photorhabdus asvmbiotica strain ATCC43949 PVCIopT operon, pyd -
DVC16\ e.q. corresponding to genes PAU 02112 to PAU 02099 of the sequence of
GenBank accession no. FM 162591.1)
ATGGCCACAACCACAGTTGACTATCCAATACCGGCTTATCGATTTGTTGTCTCCGTTGGT GATGAACAAATCC
CTTTTAACAGCGTTTCGGGGCTGGATATTACTTATGATGTCATCGAGTATAAAGATG GCACCGGTAATTATTAT
AAAATGCCGGGTCAACGTCAGTTAATCAATATTACACTGCGTAAAGGGGTATTCCCT GGCGACACTAAACTTT
TT GATTGGCTT AATTCCATTCAGCTT AATCAGGTT GAGAAAAAAG AT GTTTCAATT AGCTT GACCAACGAAGTT
GGAACTGAAATTTTAATGACCTGGAGCGTAGCCAATGCATTCCCAACCTCATTAACA TCTCCTTCTTTTGATG
CCACCAGCAATGATATCGCTGTTCAAGAAATAAAACTGACTGCCGATCGAGTCACTA TTCAGGCAGCTTAAAG
CATCACGATGATTGATATATCAGACGGGACAAAATGATCCTCAAAATTTGGCACAAC GGCTACCCGTCCAACT
AAATTTACCCTCTTACAGTTCACGCAAAATATCGCACAATACAATTGGAGGCAATAT GCCAACAACAACTTATC
CCGGCGTTTATATTGAAGAAGACGCCTCACTGTCACTTTCCGTTCGCTCAAGTGCAA CGGCGGTGCCCGTTT
TTACCGTTGAAGATGACAGTCAACTTCATACTCCTACCAGAGTGAATAGTTGGTTAG AATATCTGACAAAAAAA
G C AG AT AAAAAATT C AATT CT AC C G AC AAACTT GATATCGCATTGCGCGCTTATTTTATTAACGGCGGCGGAT
ATGGTTATCTCGTCAAAGCGGGTGAATTAACAAATCAAATTCCAAAACTTAACGATG TCACATTACTGGTCGC
GGCTGGAGAAAATATCAAAGATGCTGTGAGTACACTTTGTCAACCGGGCAAAGGCTT ATTTGCCATTCTGGAT
GGCCCAACCGAAGAGTTAAAGTCTGATGGCAAATCCAGAGATCCGTATGATCAAAGC CCTTTTGCCGCCGTT
TATTACCCCTGGCTAGTTGCTGATTGGGCAGACAATATTCCGCCAAGCGCGGCCATT GCCGGTATCTATTGT
TCAGTTGACCGTACCCGCGGTGTCTGGAAAGCCCCAGCAAATGTCATATTACAAGGC GGGGTGAAACCGAA
GTTT AAAGTCACCGAT GACTT ACAAGGT ATTT ACAACACCGGT AAAGCCATCAAT AT GATCCGT GAATTTCCG
AATACCGGTGTCACCATCTGGGGCGCCCGCACACTTAAGGACGAAGATAACTGGCGT TACATCCCAGTTCG
CCGCCT GTTT AACAGTGCAG AGCGAGACATT AAAAATGCCAT GAGTTTCGCGGTCTTT GAACCT AACAGCCA
ACCCACCTGGAAAGCTGTACACCGAGCTATTGATAATTATCTCTATGCCCTTTGGCA ACAAGGAGGGCTAGC
AGGAAACAAAGCT GAACAAGCTT ACTTT GTGCAAATTGGT AAAGGGAT AACCAT GACCGAT G ATG AT ATCAAG
CAAGGGAAAATGATTGTTAAAGTGGGTATGGCCGCAGTGCGCCCGGCTGAATTTATC ATCCTTCAATTTTCAC
AAAATGTAGCACAGTAACCGTACTGAGGCGCGGTTTAACACCGCGTCCATTCAGTCT ATTGAATGGAGGAGA
C AAT AAT GATAACGGAGAT AAAAC AG CCGGGCGTCACCATCACG G AAAATT C G AT ATC C C C G AAATC AG AT A
ATGAATTTATCGGCGTCCCCGTTTTTATTGGCCATACCGAAAAAAATTCAAGCCATA AAACGGCTGTTAAACTA
AAT AGCCTGATGGACTTTACCCAAGCTTTCGGTGCATCAGGATTAACCTATTATTCAGTACGC CACTTTTTTGA
AAATGGTGGACAGCAAGCTTATATCTTGTCACTGGGGATTAATCAACAGCTAAAAGA TTTTCAATCATTGATTA
CCGCCCTGCAATGGAACTGGGTAAAACAAGCCATTGCCGCAGAAAACGAAATCACAT TGATTGTTGTGCCTG
ATATTACCCGTTTTAATGATCTCAGCGCTCAAAAAAGCCTTTGGCTACAACTCTGGC AATCAATACTTGAACTG
T GT AAAAGTCGGCGTGGCATCATGGG ATT ACTGG ACGCGCCT GAT GATCCAACATT AGCAACT GAGT GTTT A
AAACAATTCTCTTCCACTGATCGCCAATGGGGCGCCGTATACTGGCCAAGGCTAAAA AGTACCTACCAAGAA
AACGGTACATACATTGTACTTTCACCTACTGCTGCGGTCGCCGCCGTTATGCAACGC AATGACAGTCAGAAA
GGCATATGGACTGCTCCCGCCAATGTGGCTTTAGCCAACGTCATCGGTCCGGTACGT TCTTACATTGAAGCT
GGAACCTTGCTGAATCAAGAAGGCACTTCGTTGAATCTGGTGCGTAGCTTCCCCGGC AAAGGCATTAAAATC
TGGGGCTGCCGCACTCTGGATAACATACCTCATTCTCCCTGGCGTTATATCCAAATT CGCCGTTTGGTTTCCT
ATATCGAAGCTCATATAACCCAACTTGGCCGCGCCTTTGTCTTTGAACCCAACAACG CCATCACCTGGATGAA
ATTTAAAGGTCAGGCCCACAACTGGCTACGTCAATTATGGCTAAAAGGTGGATTACG GGGCACTCAGGAAGA
TCAAGCATTTGAGGTGTTACTGGGTGTTAATGAATCCATGAGTGAAACGGATATCTT GGCCGGAAAAATGATC
ATGAAAATCAGGCTGGCGCTGTTAATTCCGGCAGAATTTATTGAGCTGAGTCTGACG TTTGATATCCGTAACA
ATACCGTACCTAGCTAATCTAAACAGGGGAAAAACATGTACAACTTATACACCCCGT CAGTATCTCACCGTTT
TATCGCCAGTTTTCTGTTTAACAACATTCCCAGCCCACTTGATATCGCCTTTCAGCG TATATCTGGCCTGAGC
CGAGAACTGCAAACCACCCAACAT AGCCAAGGTGGAGAAAACGCCAGAAACGTCTGGTT ATCCGAGAAG AT
CCAACATGGCAGCCTGGTGCTGGAGCGCGGTGTTATGACCATCACTCCCCTCACCTT GGTTTTTGATCGCGT
GCTGCGCGGTGAAAAAGCCGTGTATGCCGATGTTGTCATCATGCTACTGAATGAAAA TGCGTTACCCGTGGC
GAGCTGGACAGTCAGTAACGCGCTACCGGTTCGTTGGTCCACCAGCGACTTTGATGC TAATAGCAACACCGT
ACTGGTGAGTTCTCTGGAATTACGTTATCAGGATATGCGCTGGTTAGGAGTAAAAGC ATGACGGTAGAAATTA
AAGAACTGATTATTCAGGCTAAAGTCACCGATTCTACGAGTGATCAACTCGCCCCAA GAACATTAGCCCAAGA
AAAGCTGGAT AACGCCCGTTT GATT GACAT AGT GAAACGGGAAGT GTT AGAGGCATT ACGTGAAGGAGGCCA
TCAT GAGTTT AATT GAACGTGGTTT ATCCAG ACTCACCCT AACCGCTTTT AAAGACCGAGAAGGT AAAGTTTC
CGTGGGTCGCTTACAAGCCATGTATAACCCCGATACGATCCAGCTTGACTACCAAAC CCGCTACCAACAGGA
TGAAAGTGTTAATCGTGCCAGCCAAAGCAGCCGTTATGTATTATCCCAACCCGCCGG ATTATCCTTAGTTCTG
CTGTTTGATGCCTCGATGCCCGATAATAACATGCCGATAGAAACCCAGCTTGCGACC CTGAAATCCCTGTGT
GCGATTGATGCCAGCACCAAAGTACCCCACTTCCTTAAAATCAAATGGGGCAAAATG CGCTGGGAAAACAAA
GGTTATTTCGCCTGCCGAGCCAGTAGCCTGGCCGTCAACTATACCCTGTTTGACCGG GATGCCACACCATTG
CGGGCCAGCGCCACTCTATCTCTGGTAGCGGACGAAAGCTTTATTATTCAAGCTACC GAACGGCAGTTAAAA
TCACCGCCGGCCACTGCGGTTAGCGTAACTGATATGCTCTCCCTGCCTTTGATTGCT TTAGATGCTGGAGCG
TCTCTGGCTGGTGGCATTGATTATCTCTCGCTGGCCTGGCAAAACGGTCTGGATAAT CTTGATGACTTTACCC
CCGGACAAACACTGCAAGCGCGGGGGGATGCATGAAGATACCCATGATAACCCTCAA AATAGGTGGCAAAA
CGCTCAATCAATTGACTGTCATCAGTCTGACAATAAACCATCAAATCAATGGCATTC CCTCGACCAACATCAC
CTTGGGGATCGCTGGCGATGCGAGCCATATTTTCGACACCAAAGCCCAAGCTGAACT GGCAAGTTGTCGCC
CCAAT AAT GAACTCACCCT ACAG ATCCAAAAAACCGTGGT GTTT AAAGGG AGCATCGTTCGACAAGCACTT GA ACTGAAAGGTCAAGACAGCATCATTACCCTGACAGCAAAACATCCACTACAAAAGTTAAC TCATAGCCTCCAT
TCACAATTATTCAGTCAACAGAGTGATGAAGCGATTATCAGGAAATTATTCAATCAG GCGGGTATCCAAACAA
CGATAAAGCAGGCTCCTCAACTTAAAACCGTTCATGAACAAATGGTGCAATTTCGTT GCAATGACTGGGCATT
CCTAAAAAGCCGATTGATTGCCACTAATACCTGGCTGTTGCCCGGCAATGAATCGGT TACTTTGATAACACCT
AAGGCCCTGAATCAATCGACAGTGCATACTCTTCATCGACAGGCCAGTGCTGAAGAT ATTGTGTTATTTGCAG
CGGATCTCCAATGGAATAACCAATATAGCCCTAAAACGGTGAGTGTACGTGCCTGGG ATATTGCTCAACAAA
AGCTTTCCCCAGCAATT AAT ACCCAAAAC AGTCAGCTTGGCAGTC AT AAATTGGCCGTGGACAGT ATCGCCG
CACTGGCTGATAAAGAGTGGCAATGGGCTTACAGCTATCCATTAGATAATGAACAAG CCAAACACCTTGCTCA
AGGCATTATGAATAACCTGCGAAGCCATAATATATCTGGCAGTTTTGAAATCGAAGG TAATCACCGTTATCAA
CCGGGGGATGTCTTGGCGTTAAATGGTTTTGGTCAGGGGATGGACGGTCAAGGGATT ATCACCGGAGTCAG
TCAGATAATTAATCAGCGGCAAGGCTGGCACACCCTATTAACCTTAGGCATGTTACC CGATGTAGAACCGCC
GGTGCCTCAGGTGAAAGAGTTGCATATCGGTATCGTGGAAAAATACCAGCAAGACCG CCAATCACTAAGCCG
TATCCCAGTCAGAATACCCGCATTAAACTTGACCAAAGGTGTCCTTTTTGCCCGGCT AGGTAAACCTTATGCC
AGTCATGAAAGCGGATTTTGCTTTTATCCCGAACCGGGAGATGAAGTGATTATCGGA TTCTTTGAATGTGATC
CTCGTTTTCCAGT GAT ATT AGGTTCCATGCAT AATCCGAAAAAT AAACC ACCGTT AGAACCCAGT GAAAAAAAT
CCGGTGAAAACTTTAGTTATCAAGCAAGGGGATAAACAACAAGCATTAATATTCGAT AATAAAGAAAACACGG
TGGCACTTAATAGCGGCGAAAATAAAGTCTCTCTGCAACAGGATAAAAACATTACGC TCAATTCAACTAAAAA
TCTCATCACTCAGGCCCAAGAAATTAATATACAAGCGGAAAAATCTCTGTCAGCCAC AGGAAAATCTGGCGTC
GAT ATT AAG G G C G C G AAAATT AACTTAACCCAGT AAT G AG GT ATT G AAAT G AC AAG C C AAAT ATT AG C C AAT A
TTTACGGTTGCGGCTGGAAATTTCCGCCACAGTTTTCTATTGAAACTGGCGTAGAAA TGGCCGAAGGTGCCG
AAAACGTTCGCCAAAGTATGAAAATCCTTTTTTTAACTGAACCCGGTGAACGAATTA TGCGTGAAGATTATGG
TTGTGGTCTGAATGATTACATGTTTGAAAATATCAGTGATGAATTATTATCGGAGAT TCAAACCCGCATTGAAG
AACGAGTATTGCGCTATGAACCCCGTGCTGAAATCACAGATATCCAAGTAACTCAGA AAACAGACTCACCGA
ATACTTTACATATTCAAGTGACCTATGCCCTGAGAGGCAGCCAAATCAGTCAACAGC TTGAAGGGGTTCTTGA
GATCAACGAAGGTCAGGCAAAGGTGAGTCTATGAGCAAACAACTCATTATTGATGGC GACAGCCTGCTATTC
GAGCCATTATTCGGCAACCGGCAGGTCACTATTTTGATGCCAGCGACCATCAGAGGC AGCGGACACGCGCA
AATCCAAGGCAGAAAGATAGCGATTGTCGGCGATGAAAAAAAGGTACAACTTCAAGC GCAATACATTACCCC
AAGCCACCCGGTACCTGGCATAGGCACAGTTACCATTGCTCAATTAGATACCAGCCA GCAAGTCAACTTTTG
CCACAGCCCTGCCACAGTGATAGTTGTCGGGCAGCAATTTACCGCTCGATTTACCCC ATCACAGCCGGCAAT
T AAT CCGTCAACCGGGCCAGATGTCACAACACCCAGTATGGG C AAAG GCCGTTTTATTGCCAGTCAACATAC
TATCAACGCCGGATAAATAACTCTGCAAAATCATTATTCAATAACGTTCCTATTCTG CAATAGCTATCAGCAAT
ATATTCAAATAACAGGTGGTATAATATGGGACTCACCGAATTAAAAAATAAACTCTC TGCTATCGTACTCGATA
CGGATTTT AAACTT GAT GAAAGAAGT ACACT GGAT ATTTT AAACTGGCT ACAAGAAT ATGCT AAAAAAATCCCT
TTCAATCAAGAGAAAAAACAGTTCTGGGATAGTTTCTATTTTATTCAGGAAAATAGT CCTGAGAAATTAGCCGA
TCTTTACCAAAACGTTAATAAAACGAATGGCCATTTACCGGCCCATCAAGCTTTTGT TTTAGCCTTTTTAAAAC
TTTT AGAAACCACCAAAGT ATT ATTT AAT ACTTTTCCGGCACGACATCGT GATCTTT ATT ACCGGGAATT ATT A
GGTCTAAAACCCAGAAATGCCCAAGCAGATAGTGTTGCTTTAGGCATTACCTTAAAT ACAGATAACACAGAAC
ATCTTATTCCTAAAGGAACCTTGTTCGATGCCGGGCAGGACAGGGCCGGAAATCCGC TACAATACGCATCAA
ATGCAGATTTACTGGCGAATCAAGGAAAATTGAGCGATCTGCGTTGGTGTCGAAAAG ATAATGATAGCTGGC
AATCTGCAAT ACT ACT GAACCACTCAGAT AAT ATT GAATT ACCT GAAAACAGT ATTCGACTTTTT AGTCCAACG
CCGGATGATATTCCCGTTTTATCCGGTTATTTGATAACTTCGTCTTTATTTGCTATG CCAACGGGGGAACGCA
GT ATT ACATTG ACTTT AGCAGAT AATTGGCATGGT GAT ATT AAGCACATCACCGCT AAAATC AGTTCGGGAG A
T C ACTGGCTTT C ACT AT C AGT AAAAAAAGAACAAG ACAAT AGT ATT C ACT AT CTT AAACTTT ATTT AT CAACCAA
TGATGACCCCATCGGTCCTCCTGATGCTTTGGATAATATAGCGTTTGATGTACCGGT ATTAAAGCTGGGCACT
GTTCAGGGACCT AT ACT ACCCAAGATT ACGGGT ATT GAAATT AGCATT AACGGCAACAGT AAT GT ACATT ATT
CCTCTGATAACGGTATTGAAAAAATAGATGCAGCTAGTTTTCCCTTTGGACAATCAC CGTCACCAGGTTCCGG
TTTTAATCTGATTGCCCCTGAATGGTATGGTACAGAAAGCGCCAAAATTACTCTTAC TCCTCAATGGACTGGA
TTACCCAAAGAGGGGTTTAAAGAGTGGTATCAAGGATATAGTTCTACCCCCGAAAAT AATGCATTTAAAGTAC
AGGCTT ATTT AAT C AC ACCT CAAAAG AG AGAAAAATTT AAT GAAGCT C AGT C ATT ATTT AAT G AAAGT AAAG AC
AAGAAACCACAAGGAAAAAGCCT AACTTTT ACCTT ACCTGCAATGG ATT ATTCCTTTGCAAACAGCCCATC AT
CT AAT AACTGGCCCGCATCAATACGCATAGAACTAACC GAAC AG G ATTTT ATG C ATG C C C AAT ATT G G C AAAA
TCCTACGGGTAAAAAACAGCCCTATACCCCCAAAATGAACACATTACAAATTCAGTT CAGTGCCAAAGTTAAA
CCCGAACAATTTTCCGTTTATTCTCTCACGCCTTTTGGTTGGGGAAAAACAGGAGAA AATAGAACATCATTAA
CCCAT GAT ACATTCT ATTT AGGTTTT ACCGAT GT ATT ACCAGGACAAACTTT ATCCCT GT ACTGGCAGTT AGAA
GGTATTAAAAAGCTCCCTTTATCCTGGTCTTATCTGAATCAAGAAAATACCTGGAGT CCATTGGATAATCAGGT
G C ATG AC C AAAC C C AC A AC CT ATTTGATCGAG G AAT CTGGCGTACCTCATTGCCACATGATGCTT C AAAC C AA
GCCTCTCAAATGCCAAAAGGACAAT ATTGGGT GAAGGCACAC ATTTT ACAAACGAATCAAGCAACCCT GACT
GATCTGTATTGGTATCGAAAAGATAATGATGTCTGGAAATCCGCAACACCTCTTAGC CTTTCAAATAACATGAA
ATTACCCGCAAACGGTATTCAGATTTTTAGCCCAACATCTCATGATGTTCCAGTTCG ATACGGCTACCTAATTA
CTTCATCTTTATTCTCATTCCTCAAGAAAGGACGCAATATCACATTAATTTTAGCAG GAGATAGCTGGGAGGG
TAATCCTGAAAACATCACCGCTAAAATCAGTTCAGGAAATCACTGGTTAACACTATC CGTCGAATATCTGAGT
AATACTAATAGTCTTAAGTTGCAATTATCAGATAATAATAATGATCCCATCAGCCCC CCTAATGCTCTGGATAA
TATGACGTTTGACACGCCATTGTTAAAACTAGAAGCCACTCAGGATTTCACTTTGCC CTGGATTTATAAGGTAT
GCGTTAATAGCAACAATATACTCTCTACCTCTGACAGCTCAGATGCAGCGATTACTC GTTTCCCCTTTGGCCA
ATCACCATCGTTGGGTTCCAGCTTTAGTCCGAAAATCGTTTTCCCGGAATGGTTTGA ATCTGAATACGCATCA
GACACCACGATCACGATTACCCCTCAATGGGTTAACCTGCCCACAGAAAACTTTTCA TCGTGGTATGACGGA TATATTAATAAACCTGCCGATAATAGCGTATTTAAAATAGAGGGTTATTTACTTACTCAT TATCAGGGAAAAATC
AAACT CACAGAAGCTGAGACAGGAAGC G AAAC C C AAG C ATT ATT C AAT G G AAAC AAT GCACCACAAG G AAAA
AGCCTGACTTTCACTTTACCTAATAGGTATAACTTCTATCCGCGCAACCATCAGTCA ATGAAGATAGAAATAAA
ACTCGTTAAACAAGACTTTATGCACACTCAACATAAGAGCAATCCCACAGGCAAAAA ACCACCCTATACCCCG
CAAATCAGTGCCTTACAGGTGGAATTCAATGCTACAGCTTTCCATCGAAAATTCTCC GTTTATCCTCTCACGC
CTTTTGGCTGGGGCAAAACAGGAGAAAAT AGCACACCATT AATTCAT GAT ACATTTT ATTT AGGCTT GACCG A
TATATCACCAGAGCAAACTTTTTCTCTGTATTGGCAGCTAAAGGGCCTTAAAGAGCT ACCTTTGTCTTGGTTTT
ATCTAAGTGAAGAAAATAGCTGGAAATCATTAAATAGATCAACTTACAACCAAACCC ACAACCTGTTTGAATCA
G C AG AAC AAAG TATC CT ATT ACCACGGGATGCTT C AAAC CAAGCCTCT C AAAT GCCATTAGGACGGT ATT G G
CT GAAAGCACAGAT AGAACAGGAGAAAAAACAG AT AAAGAT AGCGCTTCCT GATT ATT ATCCAAGAATCAGG
GGGCTGTTGTATAACGCTACCATCGCCACTTTAATCAACGCTGAAGCTGTTGAGCAA TCTCACCTTATCAACG
GATTGGCTGCTAACAACATTAAACAACCGGTTAACTCATCCGTTGCCATCAACGAAG TTATTCAACCCTGGAC
ATCCTGGAACGGTCGCCCAAAAGAAACCGAGTCAGCATTCCTGGCACGAGTTCCTGC CCGGCTCTCTCATC
GTAACCGAGTGCTAAGCTGGGGTAACATTGCCACTTTATTAAAAGAGAATTTTAGTA GCTTATTCGATGTCAA
ATACCCTTCTGTCAGTGAATTAACCAAAATTCCAGCGCCAGAAAAGCGACAATTAAC CATCATCCCCGACAAC
CGCTAT AAAGAT AATGATGATTCACTACGCCCAGTATTGAACCAAGCCAGACTGACCGAGATGGTCGAATGG
TT AGATCGATT AAGT AGCCCTTGGACAACT ATT GAAATT AAAAATCCC ACAT AT GTT AACGTTCTG ATCCACT A
TGAACTGATATTTACCTCGGATGTTAACCCCGATTATGGCCTCCATCAGCTACAACA AGAACTCAGTCGAAAA
TATATGCCGTGGGGAGAAAATGCAGCTATTGGCGTAACACCCGGTAATCGTATTGAC TACTTCCAGTTATTAG
CCTCAATTCAACAATCACCGCTGGTTGAACGGGTCACCAACTTAACGTTAAAAAAAG GCAGCCAGCCTACCG
TAAGTGAAAGTATAGAAGCCGCCGATGATGAAGTACTGATTTTAGTCTGGTCATAAA AACTTCCCCAACCTAA
GGAATTAACAAATGAATAATCGAGATATGCTATTTCCTATCATTAAAGACGATATTA CCTTTGATTCTTTATTCG
C C C AG G C AAAAG C C GTT ATT G AAC AAC AAT CGGGGCAGCTCTG G AAT AAT AC AG G T G AAAAT GATCCCGGCA
TTACTTTATTAGAAGCCTGTTGTTATGGCGCATCCGATCTGGCCTATCGCCACACAT TGCCACTGCGAGATTT
GCTTACTCCTCAAGAAAATGAACGAATAGATGATGGCATTTTTCCCAAAGAATTTGG TCCACAACAAATACTG
ACCTGCGGCCCAATTACCGCGGAAGATT ACCGTCGAGCTTT GTT AGATTTGCGT AGT GAT AAC ACCGTT GAA
GGTTATTTTTTCTTTAATGATGCACAGCTCATTCGTGAACCGGAAAATCAACGCTAT TCATATTGGTATAACAA
AGAAAAACGCGAATAC AGTTTT ACTCAAG ACCAAT ACAGCGAACAATT ACAGTT AACACT GAG AGGAAACT AT
TGGCTCTATTTACTTCCCAGTCGGAAAACCCAGCTCGATAACACCCTGGCTGAAGAA AGACTCAACATTTTTC
T GAAAGAT AACCGAAACTT AGGAGAATCGGTCAGT AAAATT ATTTGGCT AGAACCCATT AAACT GTCATT GAA
AATTGATATTCAGCTTGATGATGACGCCAAAGATATTGCTGATATATTTGCTAAAGT TTATATGATTGCAGAAC
AAATGGTGCTT GAAAAACCATT ACGTT AT ACCACTCAAGCGAT GAAAGAACTGGGTT ACAGTCAGGAACAAAT
ATTTGAAGGCCCTTATTTACACCACGGTTGGATACCGAAATTACCTCAAACCAAAGA TTATACTCACCCTACC
GTATTAAATCTCAGTCCTTTAATTAATCAGTTACTGGCTATCAAAGGGGTGAAACAT ATTACCCAATTTACATT
GGATAAGCCTGATAAAAAAATTTCTAAGTTACCAAATGATAATTGGTCTTGGGAAAT CGCTCCGGGATATTAC
CCAAAACTATGGGGAGATACTCCATTAGAATTAATTACCTCACCAACAAGCCCACTC ACCATCACGGCAAAAG
GGGGAATT AAAATTGCT ATT ACT AAACAACAG ATAGAAAAAAACAT AAT GACAGAACCACT AATT AAT ACACAG
CCAGAATTATTGAACTGGGGTAAACATCGCAAAGTCCTGGATTACTATCCGATAAGC AATAAATTACCCGCTT
GCTATGGATTACAAACTAATACCCAACAACAGCTACAGTTGCATCAATTTATGCTGC CTTTTGAACAAATGCTA
GCGAAT AACTGCGCT GAACTTGCTTT ATTGCCAAGACT ATT AGCTTTT AAACAACGAGGAAAT ACGGT ACAT G
GCATTCAATGGCCTTTTAAAGAAAATACGGTTGGTCAACATGTTCATAAGGACATAG TATCTAATTTAAACAAT
AATGCT ACGAAAATCGAT AAT AATGCCGAT GACT ACGACAAGGAACTCGTT ATTCT AG ATT ATTT GTT AAGAT A
TTTTGGGGCTCAATGTGCAATCCCACGACTATCACCAGACCCACCACAATCATCATT AACAGAACCTCAGACT
AAAAAAGATTTTCTATCTACTCAGCGCGAATATCTGGCTCAACAGCCAAAACTGACT TATCAGCGTAACAATAT
TCGGATTGATAAAGTATCAGCACTGCAAAAACGTATCGCTGCCCGATTAGGTCTGGG AGGAGAATGTTTCAA
AGCAGAGCCTGACTTAGCTCACCTTCCTTTCTACCTCATTGAACATCGTAGGCTCTT ACCAGTAAAACCTGAT
AT AAAATT CTAT ATT GAG CAACAACCT AATT CT CTGGAAATT GAAAAT GAT AAATT AAAAAT C AC AC AGAAAG AT
TCAGCGGGTCGGTTACTGCAAGGTCAAGTTATTAACCTGGAATTTCGTGAGGGCTAT GATGAATTTACATTGC
T AAACTT AAT GAT AACT GAAGT G ACAAG AG AT AC ATTC ACC ATT AG C ATT AAT AAT AGC CGT GAT CT C AG AG AC
AATCTGGACAAAGTGCAACACGCGTTTGAACAAACGAATAATCTGAGCTGGCACAAT AGCTTAATATGGATGG
AAG AT ATG GATT AT C AATT GGTTTATGC C AAT GGAGAACAACTG G AAAAAG C G GAAAAT G AAC GAT G G ATT AC
CATTAACAATCAAAGTGCTTTCCCTGCTATGATCGGAGAGAATGATGAAATCACACT AAAAATTCAATCCGATT
AT GAACTT AAAACCAAAGTCGTGCGGCTT GATT AT AACAACAAAAAAATTCT GATT AT AAAAGATGCGACATCA
ATAAATAATTTTCCGCCAAAAAGAGAAGCATCATATTATTCTTGCTCTTCTCTAAAA GACAATGGGTACGGATA
TTCGG AT GAAT AT AAAT AT GAACTT ACTT AT ATT GAT AC AG ATT CT ACAAAAGAAAAT G AGT G CTGG ATT ACT AT
CAGCGATCCAAATAATTTGTTTTCTCCTGATATCATCGCAGAGAATGACGAAATTAT ATTGAAAGCTAACCCTA
ATT AT GAGTTTAAAACGCACGT AGTAAAATTT GATCGT ATT AAT AGACAAAT ATT ACTT AGGAAAAAT ACAG AC
CTGGAAAATAATTTTCCATCAGAAAACAACACATCGCACTATCGCTGGCATTTCTCT GGTGAAAAATATGCCC
AAACTGACCATTTTTCATTTGTTGTCAGTGCAGTACTGAATCGAGAATTAATTGAGA GGGGCACAGTCGATCT
CTAT AAATT AGAGTCTTGGGTAAAAACTGAGATTTTATCTGAATTACCCGCGCATATCTCACTCGTTAT TCATT
GGCTATCATCGGAAGAATTCGAAAAATTTGCCAGTACTTATAAAGTTTGGCAAAATA ATGGCGCTCCTTTAGG
TGATCACGCATATAAAATTCTAGAAACATTAACACTTGGGAAAAAACCTTCTACTTC AGCAAGAAGGTCCAGC
AGCTATATAGAAGCACAGTAAT AATT CTTACAGAACATT AAC C CAT ATTT ATCTTAT AAT AT C AAAC AT CAT AAA
AACAAT CTT C AGCT C ATT AT AAT G AC AT ATTT CAT ACT C AGGTTT CTT CAT ATCTGTT AATT ACAAAG AGAAT AT
TAATATGATCTCAGCACCAAATCTGTTAAATCGGATTATCATTACTATTGAAGCGAA TAACGCACAGGCAGCTA
AAAAAGTATTGCATGGCTCCCTGCTTAATCAATCCAGTATAAACAAACTCTTTGATT CATACTTTAACCAATAT GTTGTTAATCAGACTATCTACCTGAAGACACTCACCCTGAATCTTGGCGAAATACGATTA AATAGTTTTAATTC
AC AGTTT GTT ATTCGGCTT AAT ACT ATT CT G AGT CAAG CATT G AG CCAAT AT C AG GT AAAT AAT CAAACT GAT A
TT GAGAAATTT ATTT ATT ACTT AT ATCGAAAAG ATTCT AT ATT AAACCCAAT AG AGGAAATCAAT AATCGT GAAA
TTACTGACAT C AAT ATTAAG C AATT AATT AAC C AATT ACCCCAGAT AC AAAAC AATT GGACACTATTATTGGCA
AAAAGCT GTTT ATCCACACAT AGCCTGAAAAAACTCCTGGCT ATCAAAAAAACAGCTTT ATT AACCGCCATT AA
TCGTAAATTATCTGAAAAGATCAATATATCACCCTATCAGCAGGAATCGGTTTCCAC CTGGCAATTGATACTGA
ATGCGCTGAAATATATACAGCGACATAATACACAGGAAATACCTGAACCCGATGCGA AAGTCATATCACTCAT
TACAACGGAACT C AAT G AC AAT G C C ATT AAT AC AG C AC C AATT ATTGCATTATTTCGCCAAGTTATAACCAACC
ATTCCCCACT GAAT AAGTGGCTGGAACAACT GTGGCAAACAAAGCGAATTTCACAGTT AT GT AAAAAAC AGCT
GTCAATTGAAGAATACCAACATCTATCGGAGCGCTTTATTGCCAAACACGGGAATAA AAATAAATCTGATAAA
AAATCATCCATGACTTCCGAACCGCTGTTATTACCTGAACACCCTCCACCACGTCAG GTCAATAATGCTGGAA
TATTAGTTCTGTGGCCGATGTTACCTACTCTATTTAACCAATTCGGCCTGTTTGAAA AACAAAAATTTATTCATC
GTCAAGCTCAATTTAGGGCTGTTAATCTACTTGATTATCTCATTTGGGGAAACGAAG AAACACAGACAGAACG
AAAAATATTGAATTGCGTTCTGTGTGGGTTAATTGCCGATGAGGACACGGAATCAAT CCCTATTGAGCCAGAA
AAAC AAC AG GT AAT AG AAC AAT GGTTAGATGCAGTTATCAGT C A ACTT CCTGCCTG G AAAAAATT AAG C C G C A
ATGATAGCCGC C AATT GTTTTTACAACGCCCGGGG G AATT G CT G AC AAAT G AG C AG G AAAT C AAAATT AC G G
TACAACCTCAACCATTCGATGCACTGTTAAATAACTGGCCCTGGCCGTTAAATATCG CCAAACTTCCCTGGCT
GGATCGCCCTTTATTAATCAACTGGTAAAACATTGACAAGGTTTATATGAAAGAACA TCAATATAGAATAGTCG
ATCTACGCTGGATTTATTCCCATTTGGAGCGCATCGATCTGCTGTTACAACGTCACT ATTACCAAAAGAGAGA
CAAAT ACGATTCATTGCCAGAAAGTTTTTTGCTTGAAGAAGAT GAATT AGAACAACGTCT AGCAAAACCGTT G
GGTATTCCTCATTGGCTAACAGCAAATACCGGCGCTGGTGATACAGAAACAGAAAAT CATTCTGCTTCCGGC
ACATT ATCACTGCT AGTCACGCGTTTT AAACTCACT GAATTT GAACGT GAT GT GTT ATTGCT AGGTTTATT ACC
GCATTTTGACAACCGCTATCATGCGTTATTTGCTACTCTGCACGGTAACAGTAAAAA ACAGTGGCCCAGTTTT
GATTTAGCGATTGAATTATTTAGCCAACATCAAAGTAACTGGCAATTATTTCAACAC CACTTTTTACCGCAAGC
TC C ATT AAT C AAT C AC C ATTT ATT AC G ACT C AAT AACCAAGAGGAACCCATTTGGCT AC AAACT C AATTTTT AA
CTCACAATGCAGTCTGGTCTTTTTTATCCGGTCAGCGCGTCATTTTACCTCCCTTAA TATCCTGCGCTTACTG
GCATATTCCAACCTCACAGACTTGGTATCCACCAATCCTTGGTCATGCATTTGAAAA AATATTGCTGAATGAAA
CGGACGAAATACGCCCGCTGGTGGTTCTTAAAGGAAAACAGGACAGCGCCAGAGAAC TGGCAGTCAGTAAT
ATT ATGGGAATTCACGGCATT AAC ACTTT AACGTTCGATTT ATTTCACCTGCCAGAT GAAGAGTGCACCACCT
CAATACTCAATCTGCTAATAGATGCAATACGAGAAACCCGGCTACATAATGCCTGTT TATTAATCCGTAACTTT
TCTTTGCTGGCAGAGGAAAAGAGAATATCGCATAGAGAATTATCAGCTCTACTGAAT CAACCCAAATTACGTG
TGGTTTGTCTGGCAGAGTCAGAAGAATCATTAGCATGGGTTAAACACCTGCCGATAG TGCAAATTAATATGCC
ACCGGCGACGCTGGCAGATAAAAAAACGATGCTGGAAGCCAGTTTGCCAGATAATGT CACTAAAGGAATTAA
TATAACTCAATTATGTCAACGTTTTTCATTTACAGCAGAAACATTACCGTTAATTAT CAAGGAAGCTCATCAATA
CCAAATCCTCCGACAACCGGAAGATCAATTGAAAGAATCTGATCTACGTAAGGCATT AAATTGCCGCGCCCA
ACAAAATTTCGGT AAATT AGCCCAGCGT AT GACACCAAAACGAAGTTTT AAT GATTTGGTT ATTTCCGCT GACT
TAACTCAACAGTTGAAAGAAATCATCGCAGCAATTAATTACCGTGACCAAATTCTGG GCGCAGGTTTTCGGGA
AAAAATCAGCTATGGTACTGGTATTAGCGCCCTATTTTACGGTGAATCCGGGACGGG GAAAACCATGGCCGC
AGAAGTGATTGCCAGCTATCTTGGTGTTGATCTGATTAAGGTAGATCTTTCTACCGT GGTGAATAAATACATC
GGTGAAACCGAAAAAAATATCTCCCGTATTTTCGATCTGGCCGAAGCGGATTCCGGG GTGCTGTTTTTCGAT
GAAGCCG ATGCCTT ATTCGGT AAACGCAGT GAAACCAAAG ATGCCCAAGAT AGACATGCCAAT ATTGAAGTTT
CTT ATTT ATT ACAGCG ACT AGAAAATT ATCCGGGATT AGT GATTTT AGCGACT AACAATCGCAACCATTTGGAT
AGTGCGTTTAATCGCCGCTTTACCTTTATTACCCGCTTTACTTATCCCGATGAAGCA TTACGCAAAGCAATGT
GGCAGGCAATTTGGCCTGAACAACTTAAGTTATCAGATCAACTTGATTTTGAGCATT TGGCTAAACAGGCAAA
TCTGACCGGTGCTAATATCAGAAATATTGCCTTATTATCATCAATATTAGCTACAGA TAATAATAGTGATCAAAT
T G AAAAT AAAC AT ATAGCGCGAGCATTGATACTT GAATT AAAT AAAAC GGGCCGATTGATTTTTT AAT C ATTT A
TACCCAATAAATTTCGAGTTGCAGCGCGGCGGCAAGTGAACGAATCCCCAGGAGCAT AGATAACTATGTGAC
TGGGGT GAGT GAAAGCAGCCAACAAAGCAGCAACTT GAAGGAT GAAGGGT AT AT AGAATTGGAGT GAAT AT G
ACAAAT AT AATT AACCCT AAT AATGCGATTCTT GAAGTT AAT AACGCATT AAAT GAT ATTTT ATCTCAGT ATTT A
ACTAATATTGATATCCGCTTTGATCTACCAGAAATAAATTCAATCCCATCAACCCCT ACAGTGAGTATATTTCTT
TATGATATACATGAAGACCTACAATTACGTTCTGCTGAACCAAGAAGTTATCATCCT ACCACCAGCTCATTATT
GCCGGGATGGGTAAATATTAATTATAACTATTTAATTACTTACTGGCATTCAAGTAA TCCATCAAGCGACAGTT
CTACCCCTGATAGTCAACCCAATAATCAAGCGGCACAAGTCATGACTGCTATTTTAA ATGCATTGGTTAACAA
CCGACAATTACCTAAAATTCCTGGCGCATATACCAGAGTCATTCCACCTCAAGAAAA TCTAAATAGCTTAGGT
AACTTTTGGCAAGCGCTTGGCAATCGCCCTCGCCTTTCTTTATTATATTCAATTACC GCACCGGTAAAACTGC
AAAAT ATTAAAGATGTCATAAAGCCCATTAGCCAAATTTCCACTTCTGTGGATCAAAAATCAAAT CTGGAT AAT
TCGCAAATCAACCAAGCCTTATTTAGCAAATTGGGTGCCGATTTAGGTGGCACACAA GATGTTCGTCTTGCTC
TTGCGAAAGT GAATCTG ACAACCAAACCTGCT AAAGAAAAT AAT GAAAATCAAAAT AAT AAAAAT GT AATT ATT
GAAGTTTCTGGCATT ACCCATTTGGATT ATTT ACCCAGAAT AAAAGGT ATTCTTTCAACATGGGT AAAT AGTCA
T AGTGCT GTT GTT AGGAT AAAT GAT ATTGGT ATT ATT GTTTCAGAAT AT AAAT AT GAT AAATT AACAGGCGTTT A
A SEQ ID NO: 95 ( Photorhabdus asvmbiotica strain ATCC43949 PVCPaTox operon, pyd
- DVC16 )
ATGAATACAGCTCAAGAAATTATTAACCGTTTATCGGGGAGAGCCGTTACGCTTGGTTGG GATGTTGTTATTG
CTT AT GACCGAAAAAAAATT AACACTCTGTT AGAGCAACAAT AT GTT GAAAAGGT AAAAAACGGGGAGAACTT
CCCGCTTATCAACTGGGAGAACCAGAGAAAAACACTTCAATTTAAAGATCTTCAATT AGGTGTTCCACTTATTT
CTTTTGAGAATTCAACACTGGAAAATTCAAGGGCGCTTGCCACGATAGAATTTATTT CAGGAGCTATTATTGAA
TTT AGT GACTCCGGGCAAATAATCAACT AT AAGAAGATT GAACCT AGTCATGGTT ATGGCATGGTGCT GACT A
TCGATCTCATGGCTGGTACAGGTTCAGTAGAAGAACAAGGTCGGGTGATAATAAATC TTAACGAAGGCGCCA
TACTCGATTTGCATGTTATCCAACAACCGCCAGCAGAAGTGGTAGAATTTTTCCGCA CTTGGTTGATGGCTAA
TAAAATGACTTATGAATTAGGTAAGCTGGATCTGAGTAGTCAAGCTGGTCTAGTGCC TCGTTCTTTTCGTATTC
GTACTCAGCGGGCGCCTGAAAAAATTCGTAAAGCGACGAGCGATGAAGGAAATGGCG CTGTTTTGTTGTTTG
TTGCCACTAACTATAACCCTACAAGTGGAACTTTACCTGCCAAGGATTATCCGTGGC TAATCCCTGAGGAATA
TTCAGGCGCATTGCTTATCGGTAATAAATGCTTATTTAAAGACATTCTGAAACCGAA TCTGGATCAGTTGTTTG
ATAAAGGGGAATGGACATTAAAAGTTCAGCAAACGGATTCTGATCAACTGCTGCATT ATCTGGAGGCAAACTC
TGCAT AT AT AACAGAT AAGCCTT AT ATGGCAG ACTTT GAAGGAACTCAGGATGGAGTCTGG ACAGGACGTT AT
AAATTTGAGACTGGCCGGGGACATTATGGGGTGTATGAAAATGTACGCTTTCCTATC AATGGAATGTTGATGA
AACCGGCT AAAACTGGATT ACAGTT ATCAAT AGATTCACC ACAAAGCCATCAATTT AAT GTT GATTTCGGAAT G
AAGTGGTTCCATTGTGCTAATATAATGTGTGGTTATTCCTGGTTTAACGAGACTTAC CCATTTTATCTTGATGG
AAAATCATTTTATCAAGTTCATATTGACCCTGATAAAGAGGTGATTTATTTTACTGG GCCAGATGAAGATATTA
ATATTGTAGGAAATTACAGCCCGCCTGCGTGGTGGCAATCTAAATGGCAAAAACATA TCAGTGATGATTTTAC
GGATATTTCCTCG G AAAAATTT AAGCGACTCAGT C AAAT AAAATT G C C AG AAAT ATGCATGTTTGCCGTGAAC
CATTTATTATTTCCTGGTCATAATACTTTGCTGTTGAAAGACGTTTATTTACCGGGT GATATGGTGATTTTCGG
TGATATTAACCCATCACTTACCGCTTTTCGGGTTACGCCATTAAAAGCAACAGTGGT GGCAAAGGGAACCCAA
CAATTTAAAGCCATAGAAACTAATTGATGATTATACCCTTCATCCTTCAAGTTGCTG CTTTGTTGGCTACGTTC
ACTCACCCCAGTCACATAGTTAGCTATGCTCCCGGGGATTCGCTCCCTGGCCGTCGC GATGCATCTTGAAAT
CCAT AGGGTAT AT ATTT AATTGGAT AAGTCTTTTTT ATTTT AACATT AT AACCT GATTCTTTTTGGAT AAAATT AA
AGGATTATTAACATGTCTATTACACAAGAACAAATCGCTGCTGAATATCCTATTCCT AGTTACCGTTTTATGGT
TTCT AT AGGAGAT GTGCAAGTCCCTTTT AAT AGT GTTTCGGGATT AGAT AGGAAAT AT GAGGTT ATT GAAT AT A
AAGATGGCATTGGT AATT ATT AT AAAAT G C C AG G AC AAAT ACAGAGGGTTGATATTACACTTCG G AAAG G C AT
ATTCTCTGGGAAAAATGATTTATTTAATTGGATTAATTCCATTGAACTCAATCGGGT AGAAAAAAAGGATATTA
CAATTAGTTTAACTAATGATACTGGCAGTAAAGTCTTAATGAGTTGGGTTGTTTCGA ACGCCTTTCCGAGCTC
ACTGACGGCCCCTTCATTTGATGCTTCAAGTAATGAAATTGCAGTACAAGAAATTTC ATTAGTTGCTGATCGG
GTAACAATTCAGGTTCCCTGATAACTAAAAACTTTAAGGAAAAATAATGTCTGTACA AACAACTTATCCCGGAA
TTTATATTGAAGAAGATGCATCATTGTCTCTATCTATCAATAATAGTCCAACAGCAA TCCCTGTTTTTATCGGTA
AATTTTACAACTTGGATGGTTCCTTACCTAAAGTGGGAACATGTTCTAGAATTACCA GTTGGTTAGATTTCACT
AAAAAATTTTCGGTAGCTCCTCCTCAAACCATTTCATTGATCGCGTCGCCAATTGCT GACACACAAGAAAGTG
TACCCAAAGCAGTTCAATATACTTATAAGGCCGAGTTTGAAACCTCAGAAAATCTGG CAAATGGTGCCTATGC
GGT ACAACATT ATTTCCAGAATGGCGGTGGT ATTTGCT AT ATCAT ACCTTT AGTT AGCGT GAAAAAAGAGG AT
GCTGCGATT GAGTTAACAAAATT ACCT GAATT AATT GAAAG ACAACAAGAGATT ACGTT AATCGTCTGCCCGG
AGGACGATAAGACGCTCACTGTTGATAGCAGTAAAAAATCGGATGTTTATAACAGCA TCAATACATTATTGAG
TAATAAGGTAGGTTATTTTCTCATTGCAGATTCAGATGATGGCAAAGCAGTTCCTGA TACGTTGCCGGAAAAA
ACTGCGGTCTATTATCCTGGTTTACTAACTTCTTTTACACAACGCTATGCCCGACCT GCCGATTCTGCTATCAA
AGTGACCGGTATTACAAATATATCAACTCTGGCTGATATTCACACCAACTTGGCCGA TGACTACTCAACAGCA
AGTCAGGTTATTAATGATGTTTTGGAAAAAAAT AAT AAGCTCGCATCGTCTCCCATT ATTTT ACCTCCCAGCGC
CGCTGTTGCTGGTGCTTATGCCGCTGTTGATGTGAGTCGTGGTGTTTGGAAAGCACC TGCGAATGTGATGTT
AAGTAATGCCACGCCAATCATTAGTATTTCCGATGCGGAACAAGGTGTGATGAACCC ATTAGGTATTAATGCT
ATTCGT AGTTTT ACTGGT AGAGGT ACTTT GATTTGGGGAGCTCGT ACTCTGGAT AAAACGG AT AACTGGCGCT
AT GTTCCT GT ACGTCGTTT ATTCAAT AGCGCAG AGCGAGAT ATT AAGTT AGCAATGCGTTTTGCAGTTTTT GA
GCCTAACTCCCAACCAATTTGGGAAAAGGTCAAGGCTGCTATCAATAGCTATTTGCA GTCACTTTGGCAGCAA
GGTGCACTGCAAGGCAAT AAACCCGAT GAAGCCTGGTTT GT ACAAATTGGT AAAGGCGT GACCAT GACAGAT
G ATG AT ATT AAG AAT G G GAG AAT GATT AT C AAAAT CGGCATGGCGGCAGTACGTCCGG C AG AATTC ATT ATTT
TACAGTTTACGCAGAATATCGCCCAGTAACTTAGGTCTATACCCTATAGATTTCAAG ATGCATCGCGGCGGCA
AGGGAGCGAATCCCCGGGAGCATATACCCAATAGATTTCAAGTTGCAGTGCGGCGGC AAGTGAACGCATCC
CCAGGAGCATAGATAACTATGTGACTGGGGTAAGTGAACGCAGCCAACAAAGCAGCA GCTTGAAAGATGAA
GGGTAT AGAT AACG AT GT GACCGGGGT GAGT GAGTGCAGCCAACAAAGAGGCAACTTGAAAGAT AACGGGT
ATATTTAATATGGGCGATTTATTGCCCATTTTTGTGAAAGGAAATGAGTTATGTCGC CAACGCTACCCGGTGT
AACGATGACTCAGGCGCAGATAACAGCGTTCGGTGTCAGTACATTAAATATGCCCGT ATTCATAGGGTATTGT
ACGAGATTGCCTGCCTTTTCAGCGCCT GT AAAAGT AAACAGTTT AGCT GAAACAGAACAAAT AAT AGGGAAAG
AAGGGCGTTTGTATGCTCTATTGCGCCACTTTTTCGATAACGATGGGATACAAGCTT TTATTCTGTCGTTAGG
CGCACCTGCTGGGGAAAATGCT AAT AGTTGGCTT G AGGCATT ACAACAGCCCGATTT GTATGCGGCT GTTGC
AGCAGAGCCGCTAATTACACTTTTAGCCGTCGTTGAGGCAAGTGAACTGAACCAAAA AGAAGGTAATGAGGC
T GTGGAAGCTTGGCG ACAGT ACTGGAAAGCAGTATT AGCGTT AT GTCAGGCACGC AGT GACTT GTTTGCCAT
ATTGGAGGCACCAGATGATACCGCATTAATCAAGCGTAGTTTGCAGGATTTTCATCA TAAGGCACGTCAGTTT
GGCGCTCTCTACTGGCCAAGGCTAGAAACATCTTATCAATCCTCTCAGTTAAAAATT TTGTCTCCTATTGGTG
CAGTAGCAGCGGTTATTCAAAGTAATGATGTCCGGCGAGGGGTAGGACATGCACCTG CCAATATAGCGTTAA
AACAGACGATTCGCCCGATAAAGTCCCGCCTGGAATTAGAAGAGTTGTATGAAGAAT CGGATGGTTCACTGA ATCTGATTTGTAGTTTTCCAGCTCGTGGTACTCGTATTTGGGGATGTCGTACGTTGGCGG GTATTGATTCACC
TTGGCGTT AT ATTCAAACCCGATT ATT GACTTCACACGTGGAAAGGCAACTCAGCC AGTT AGGGTGCAT GTT G
AT GTTT GAACCT AAT AACGCAGTCACTTGGAT GAAGTTT AAAGGCCATGCTGGGAATCT ATT AAGGCAGCTTT
GGTTACAAGGGGTGCTGTATGGGCAGCGTGAAGATGAAGCCTTTTCCGTTGAAATAG ATGAAAACGAAACGA
TGACTCGCCAGGATATTGATGAAGG C AG AAT GATTGCTCGTATTCATTTGGCATTGTTAGCACCGGCAGAGTT
TATCGCTGTGACTTTGAATTTTGATACTCGCTCAGGCATTGCGACGAGTACATAATA AATCGGAATATCTCCAT
GACACTACCAGCAGAGCTTTATACCCCAGCGGTTTCACATCGTTTTATTGTT AATTTT CTTTTT AAAG G TTT AC
TTCCTTCTCCCGTAGATATTCGATTTCAACGTGTTTCTGGTTTAGGGCGTGAGTTAC AGGTTGAACAGCGCCA
TCAGGGGG G AG AAAAC G C AC G G AAT CATTGGTTGGCTGAACGTATACAGCAT AAT AGCTTGATATT AG AAAG
AGGGGTTATGGTCGTTACCCCTTTAACACTGATGTTTGATCAGGTGATGCGGGGGGA AACTCTCAATTGGGC
AGATGTGGTAATTATTCTTCTCGATCAGGCTCAACGTCCGATAACAAGTTGGACCTT GAGTCATGCGCTACCG
GTTCGCTGGCAAACAGGAGATTT AG ATGCCAACAGT AACCAAGTGCT GATT AACACCTT AGAGCTGCGTT AT
GAAGATATGCGCATTATAGGGGTAAAATTATGACTATCGAAATCCGTGAACTCATTG TTCAAGCCCGTGTTGT
CGGGACT GAT ACCAAAACAACACGAACCGTTCCTTT ATCT ATT GTGCAAATGGAAACACTT AT AGAACAACGT
CTGGTTGAAAAAGTGAAGCGGGAGATATTAGACGTACTCCGGGAAGAACAAGGTGGT GGGTTATGAGCTTG
CTT GAACGAGGTCTGGCT AAACTCACGATT ACGGGTTGGAAGGAGCGT GAGCGTAAACATCAGATTGGT AAA
CTAGAAGCAATGTATAACCCGGAAACACTTCAACTGGATTATCAAACTGATTATCTC CCTGATGTTAGCAATAA
TCAGGTAACAGTGAGTAACCGCTACGTTTTGTCAAAGCCCGCAGGGTTAACACTATC CTTGTTATTTGATGCC
AATATGGCTGGTCTTACGACAACCGTCGAGTCCCAAATCACTACCCTCAAATCGCTT TGTTTAGTTAATGCAA
GTACTGATGAACC C AATTTTTT G G AAATT AATT G G G G G G C AAT G C GTTG G G AAAAT AAAAATT ATTTTG TTG GT
CGGGCTAGTGGATT GTCTCT GACTT ATTTGCGCTTT GATCGT AACGCAACACC ATTGCGT GT GAGTGCGCAG
CTCACATTAGTCGCAGATGAAAGCTTTGTGCTCCAGGATAACCAAGCCAAGTTAGAT GCGCCGCCGGTATCA
GTAGTTAATGTCCCGGATCTGACTTCATTACCTGCACTGGCGAATATCGCTAGCGTA ACCACTATGTTGGGA
GTGGATTATTTAATGTTAGCCCGCACCAATGATATGGATAATTTGGATGATATGCAG CCAGGTCAGACATTGC
GAACACCGGAGGCATCATGAGTTTTTTAGATAACAGTAACTTCAAGCCATCAGATAT CAAACTGTTCGTTAAC
ATTCAGGGAGTGGAGAAGGAACTCAACGAACTGATAGTAAGCGAATTGAAAATCTCC CGACGTATCAATGCC
ATTCCGCAGGC AGTT GT AAAGCT AAGAGCGAAAG AGAGT GAAAGTGGTGT AT ATC AGTCT GAT GT ACAGCGG
ATGTTGAAGAGTTGCCGTCCGGGAGTAAAGGCAGAGCTTCGTATTTTGAATACCCGG CTATTCAGTGGCGAT
ATT GTGCAGCAAAAAACAGAGTT AGT GT ATGCGAAAACACACACT ATCAAATTGGTGCT ACGCCAT GACTT AC
AGCGCATCACCGGTAATTTTCGTACCAGAGTGTTTGCGAATACCCGTGATCGTAAAG TGATAGCCGATCTATT
GAATACCGCAACATTAAAGCCGGCATTTTCGGGGACATCACATTGGGATATAGATCA TGAGCAACTGGTTCA
GT ATCGTTGCAGT GATTGGCAATTTTT GTTGCAACGGCTCT ATGCT ACGAAT AGCTGGTTGTT AGCT GAAGAA
GAT AAAGAT AACACTCAGGGGAAAGT GACCATT ATTGCTCCAAATTCTTTGCCCCT GAAT GAGCGTTGGACAC
TGCAACATCAGGCTGATCATCAGGCTATCCGGCTTTACAGCACGGAGCTGATGCTGG ATAACCGGTTTGATA
CAGCGG AGGCT GTT GTT AGTGCTTGGGAT ATT GAT GATCAGGCATT ACTCGTGGCGTGGAAAGAAACCCTT A
GTCAAGTTGGGAAAGATGCGTT AGCGTCAGAT AATTTT AGCCAGACAAATAAAGATTCGAGTGAACTGTT ATT
AAGTTGTCCGCTCTCTACAAAAGAAGTTCAATTTTTAACGCGTAGCCAATTAGTCAT GCGGCGCTTGACGGCC
GTTCGTGGTTCACTGAAGGTTGAAGGCAGTACTAAGTACCGTTTAGGGCATGAACTG ATGTTGTCAGGTTTT
GGTGAAAATATGGATGGCTCACAAATACTGACGGGAGTGGATCATCGAATAACGGCA GAAGAAAGTTGGAAA
ACAACCTTACATGTGGGATTAGAACTGCCGTTAAAGGCAGAGTATGTCACTCAGGTT AACGGTGTTCATATCG
GCAAGGTTGCTGATTATCAATCAGATAGCAAAAAATGGGATCGTATTCCTGTTTTGA TCCCTGCATTTGGAAC
GAAT ATTCCCTT GTTTGCCCGATTGGGAAAACCCT ACGCCAGCCACCAAAGTGGATTTT GTTTCT ATCCT GAA
ACGGGTGATGAAGTCATTCTCAGTTTTTTGGAAGGGGACCCTCGTTATCCTGTCATT ATTGATTCCCTGCATA
ATCCTAAACAACAGACTCCATTGCAAATCAGCAAAGAGAATAATCTCAAAATGTTGA TGATTAAGCAGAGCGA
TAAAGATGAGCAACAATTGTTATTTGATAGCCAGCAACAAACAGTCGCGTTAATCGG TAAGAAAAATATCGAG
GTTAAAGGTGAGTATATCAACCTGACTAAATCAAAGGGGACTCGATAATGGCAAATA CGCTTATTGGCCAGGT
ATATGGTCAAGGATGGGCTTTTCCCATT AAATTT ATTC CTG AT AAT AAAG AAAC C G C AG AT C AAAC AG C C G GT
ATTGTTATGGCTCAAGGGATTGAAGATGTCAGTCAATCGCTGGAAATATTATTTCTT ACCGAGCCTGGCGAAC
GAATTATGCGTGAAGATTTTGGTTGTGGTTTACAAGATTTTGTTTTTGAAAATATTA GTGATACGCTAATTTCTG
CCATCAAAAATCGTATTCAGCAAGCAATATTACGTTATGAACCTCGCGCATATTTAT TGAACGTTGATATTCAA
ACCAAAGAAAACCAACCTGGACATCTGCTCATTCAGATTAATTGGAAATTACGTGGT AGTGATATATCTCAGC
GTTT AG ACGGAGTGCTT AGACTCCATTCAGGTCAAGCATTGGAACTGTT AT GACCAATT AT ATT ATT ATCGAC
GGGGATCTCATTCAAATAAATCCCAAATTTGAGGGTGATCGAACTCTTACGATTAAT GGTATTCCTAAAATAAG
CGGGAATGGAGATGCGCAAATTGAAGGAAAAAATATTTGTGTGTCAGGTGATCACTT AACTGTCTCAATTCCA
GCCATTT AT AT AACCTCCAGAC ATCCT GTTGCAGGT AGTGGAAAAGT GAAAATT ACAAATTT ATCT GACGACC
AACTAGCAGAATTTTGTGTTAGTGGGGATGTTGTGATTATTGAAGGCAGTCAGTTTG AAGCTCAGTTTACACC
GGATAAGCCGGCCACTAATCCAAGTAACCAAGATGCAGATAATCCTGCGCCTTCGAA TGGGAGTGGGAGATT
TATACACTCACAGAACTTCGTTAAGGCAGAAAAATAAAAAATTTTGCCGAAGCGGTT AATAAGTATGAATAAG
CGGGGCGGATAAAAACATGGATCTTGCTGAATTAAATAATACGTTGATGAATGACTT ACCAACGACCAATTTT
AAGTT AG AAAC AAAG G AC CC ATT AACGCAATT AAAGTGGTT ACAACGTT AT ACAGAAAAT ATTCGTTTTT ATGC
GAATGATGATTATTTCTGGCATCAATTCTGGTTCTTAAAAAATCACACACCAGAAGC GCTCTTTGCTCGTTTGC
AAGGTGAAACGTTGGCTGATGGAGAATTGCCTCCTCATCAAGCGCTATTGCTGGCCT TTTTACAACAGCTTAA
GACGCCAGGAATCATGCTTGATACTTTTTCAGCCCGTCATCGGCAATTGTACTATCA GGAATTGCTAGGGATA
ACGCAGAAAG ATGCACAACCT GATCAT GTGGCGCTTGGCGTGGTATT AAGT ACTGGTATTGCAGAAT ATTT AT
TACCGACAGGCACATTAGTGGATGGTGGACAAGACAGCAGCGGAAATTCACTGCAAT ATGCGTTGGATACCG
ATTTATTGGTTAATCCAGGGCAATTAACAGATGTTCGCTACAGCTATTTGGATCATA AGACCTATAAAATCTTC ATCTTGCAAGATGATAAAGCGAATATCAGTTGGCCCTCTTCAGGCGCTCGTTTATTTGTA GCACCTGAGGGCA
ACGGACAGGAAAAGGCACCT GAACAAAAGTTGGCACTTT ACCTGGGATTT GAT GAT AT ACAGCC AGGGCAAA
CTCTTTCTTTATTTTGGCAATTCATTGCATCAACTCCCCTGACATTAAAATGGTTTT ATCTGAACGAGATAAATA
ACTGGGTGAAGCTAGATAGTGTCAGAGATAACACGGATGGCTTTTTTATCAGTGGAT TATGGCAAGCGATATT
ACCT GAT GATGCGGT GAAAAT GT ATTTTCCAG AGACAACTTCT GT AAAACGCT ACTGGATT AAAGCT G AGGT G
GAATCGCTTACTGAATCTGGCGATTTGTGGCAACCGCTATTAGAAGGCATCTTGTAT AACGCTCAAACAGCAA
CGCTGGTT GATGCAGACAACACAGAT GAAAAGCACTTTC AT GATGGGCT GATGCCTTTT AGCGTGCAGCATT
TGGTCAACACCGTTTCAGAGGTAAAAAAAATTGAGCAGCCCTGGTCTTCTTGGGGGG GAACGCCACAGGAA
GACACTACTGATTTCTTCCATCGAGCGGCAACACGTCTTCAGCATCGCCAGCGTGCG TTAACTTGGGATAAC
C AAATT GCCATGTTGAAGGCT G AATTT CCGCGGATTTATGATGTCATCTCAC C AAAT ATCACGTGGATGAACC
AACTTCAGACATCAAATACGCAAACGCTGATCGTTATTCCTGATGTGAACTACAGCG ACAACAAGGATCGCTT
ACGGCCACAATTCAGCCCTGCCAGCTTGCGACAAATGAGTGACTGGTTACAGATTCA CACTAGCGCATGGGC
GAATCCACAAGTGGAAAATCCAATTTATATTGATGTCTCTGTGACCTATGAGGTGCA ATTTAGTGCGGGTGTG
AATCCTGATTATGCCCTCCGGCAATTACAACAATGGTTGAGTTCAATTTATATGCCA TGGTATCACGCAGATA
AAAAAGGT GTTGCCGCTGGCGATCAAATCG ATTTTT ACCAACT GTTTGCAGAT ATTCAGCG AGT ACCTT ACGT
GGAGCATGTCAAAACATTGACATTGACCACAAAAGACACCTCATTAACCAATGGCGG GGTTATTAAGGCACA
GCAAAAT GAAGTGCTGGT GTTGGT ATG G CAACAAG G AG AAC AAATT AGGCAGGGAGAATCGAAATG AGGCA
GCAT AAT GAGTT ATTTCCT GT AGT AAAAGACGCGAT AAGCTTT GAAAACCTGCAAGCTCAGGGT GAGAAGGTT
ATTAGTGATCAGTCCGGTAACATATGGAGCGATAAAGATAAACATGATCCTGGTATA ACATTACTAGACTCTTT
AAGTTACGGTGTTTCGGATTTAGCGTATCGGCACTCATTACCTTTAACCGATTTATT AACCATTGCTGGAAAAG
ATACGCTTTTTCCAGCC G AATT CGGGCCACAGCAGACGCTAACTTGTGGCCCTATAACACTGGATGATTACC
GGCGTGCGTTACTTGATTTACATGGTAATGATGCATTTAAAATATCAGCTAGTGACC CCAGAGACTTTTTGTTT
CAGGAT AT ACAGTT AATTT GT GAGCCAAAAAGTAAGCGTT AT AAAT ACT ATTTCAATCCCGAAACGCTT GAAT A
TACATTCACGCCACCTTCAGGGGATAAATTTAAAACTTTAACACTACGAGGGAATTA TTGGCTTTATTGGATAC
CAACCCGTTGGGCAGGTAAATCAGCTAATTTGCCGTTAGTTAAGCGGGTGATGGAAG ATTTTCTCCGTGAAA
ATCGAAATTTGGGGGAAAATGTTGTTCAAGTGACACGGGTGATATCAACGCCTATTT ATCCTGAGCTGGTCAT
TGAGCTGGCGGATGATATTACAGATGCGGCATCAGTATTAGCATCAATCTATATGCT ATTAGAACAGTGGGC
GATGCCGATGCCTGCTCGCTTTACTACCGAAGCATTACAGGCCAAGGGATTAACAAA CGAAGAGATCTTTGA
TGGGCCGTGGTTGCGTCATGGTTGGATACCTCAGTTACCGACCTCTCAAAACTACCA TACAGGCATGGTTCT
GAAGATGAATCATCTGATTAACCAATTGCTGGCGGTTGAAGGTATAAAGCGCGTAGT TAGCCTGACGTTGCC
AGAAACAGAAT ATTTGCATCAGAT AAAAG AT GAT AATTGGTCCTGGCAATT AGAT GTTGGTT ATT ATCCATT AT
TATGGGGAGCTAATCCACTAGAGGTAATTACAGAGAAAAATAACAATTATGTCAAAT TGTTCGCAAAAGGTGG
GGTACGATTACAACCTGATCAGAAAAGTGTTGAGCGGTTATTATCACAGGAATCACT CATTAATAATGCTGCA
TCCACGTTACCGGCTGGTAAGGTGCGTGATCTCAAAGCCTATACACCTATAAGCCGC AGGTTGCCTGCCTGT
T ATGGTTTGCAGAAT ACTTTGCAAAAGTT AAAACCT GAACAACG ACACTT AT ATCAGTTCCT ATT ACCATTGGA
GCAAATGCTTGCTGATGGATGTGCGCGGCTTGCATTTTTGCCACATTTGTTAGCATT TAGGGACCGAAGCGG
AAATATCAGTGATACACTCTGGCCTTTCAAGAATACAGAGGACACAATTGCCCAACA GGTTCATCAGGAATAT
GCCGGTACATTAAAAGCCTTTCAACAGCAGGAAATTAGCCTGTTTGATGATAAAAAT AGACCGCATCATGGCA
ATATCAATCGGGAATTAGATATTCTTGATTATCTGCTAGGGTATTTTGGTACACAAC GTGCAAAGCGTCCATTA
ACGCAGGATATTCATGATTTTCTGCAAACCCAGCGAGGTTATTTGGCACAGCAGCCG GAGTTGGGTTATCAG
CGTGATAATATCCGTATTGATCGAGTTTCAGCTTTACAAAAACGTATAGCAGCCCGA ATTGGGCTAGATGGTA
CTATTTTCAAAGAATCGGTTGATTTAAGTAAGTTACCTTTTTATTTGATTGAACATC GTCAGCTTTTACCAAATT
TACCCCATCTTGACTTTCAACATGATCAAACTCCCCAATCTTTTGTGATTTCCGACA ACATTGTTAAAGTGAAA
CAAGCGGGAAT AGCAGAT AAAATCGTTCGTGG ACAGCTT ATTG ATTTT AT AGAT ATT GAAAGCAAATTT ACCG
TTCGTGCCCAAATGATTGTCGCTGTAGAGGGAAATGAATTTTCTCTGGATACAAAAA ATAGTATTCAACTTGAA
AAGAATCTGCAGTTATTACAATCAGCGTCTGAGAAAAACAATTTACGATGGAGAAAT AGCACGGCGTGGTTAG
AGGATATGACGTATCGTATCAATTATACTGACGATCAGGTTATAGACGATAAAACAA AACAATGTCGTTTACAA
AGTAATACTAAATCGCCTTTTCCAGCCTTAATTGCACCAAAAAATAAGATTACGATT ATTAAGCAATCTTCTCC
ACTCTCCAGT ATTGCT GAATTT ACT GAT G AAC C AG AATTC AAATT AGTTGCAACGGT GACAGAGATT GATCGG
ATT GAAGGGAT ATT GACTATCGAACGGGAT GACAACCAACTCCCTTTCCCG ACT AAAGAAGAG AGT AATCAAT
ATATATGGTACATATCTGATGAAAACTATATTTCAAGTGATCGTTTCTCTTTTGTGG TGAGCGTCGTGCTGAAT
CGCGGTTTGGTT GAAAGGGAAGAT ATT GATCAAT AT AAGCT AGAGGAATGGAT AGAGCGT GAAACACTTGCA
GAGTTTCCTGCACATATTTCGTTAATTACTCATTGGCTGGCATCTGAAAATTTCGAT GATTTTGCGAAGACATA
TCAACGTTGGCAAAACAATGGGGCGCAGTT AGGGGAT GAATCCT ACACC ATTTTGGAAAAACT GACATT AGG
GCATTTACCAACAGGACTTACTGGCATTAGTAATATGTTTATTGCTACAGAAGCTCA GCGTCTAGAAGTTGTT
GGCGAGAGTGGTAATGAGTGGAATACCCAGGCAATTATTAACAACGAACTATTCTAT GTTCCCTCACAGAATA
GTT AAT ACCGAGT GTT GT GATCAACTTTT ATT AT AAGCCGGAGGAT AAATGGACAACAAAAAT AACAAACCTAC
TGATCAAGAGATTCTAAAAACATCACGGGCTGTCGGAGAAATTCCTTCAGCGGATAA TTTAAAAAATCGTTTTA
AAGCTCGTTCGATTCCATTAGAGACGGATTTTACTAATCTCATTGACCTTGCTGAAG TTGGACGATTGGCTAT
CGGCCAGTCACCATCGCAGCAAAGTAAAACGCCTGGCACCGGAATGGAATTAACTTC GGATGGTAAATTACA
AGTCAAGGCTGGGGCAGGTGTTGATATCGATAATAATAATCGTATTACTATTAAGTC TGGTCATGGAATTAAG
GTTGATGGAAACGGCATTTCCGTTAAACCAGGTTCGGGTATTAAGGTTGATAGTAAT GGTGTAAATGTCAATA
TTGATGATTTTTGGGAG G AAAT AC G C AAT AAAATT ATG CCT AAAG GAACCATGCTGCCTATTTATGGCACACC
TAACCCCTCTGCGCTGCCAACAGGATGGGAATGGTGTGATGGTAAAGATGGCAGACC TAATTTAAAAAAAGG
GAAATATAACTTACTATCAGGTCAGTCTTCAGGTACTGATACTTTTTGGGCAGATAA TAAGAATGGAGATACA
GAG AT CAAC GTGTT ATTT GTTT ACT AT AT GATT AAGGTT GTGT AAT AT CTT AAGT AAT ATGC ATT ACTCT AAAAT GAAT GATTT AT ATTT AAGTAAC AT AAT AATTAAGTT GT GTTGTAGGGCT GTTTTT ATG AGAAAT AT AAAAACGGA
GGTAATAATTGGCTTCAAAATATCAGTGATGAAATAGAGTTATTTCGCTTTATAAAA ATTTTGTTTTATTTCTTTT
AAT AATT ATTT AT AGAAGGT AAT GAT AT GTGCACACAAAAAAACGTGTT AG AT AGACTGAAAGAT AGAAAT ATT
ACATTGGGTTGGG ATGTT GTT GTTGCAT AT AACCAAGAAAGT GTT AAT AAGTT ATT GAAGCAACAAT AT GTT GA
AAAAGTTT ACTCAAAT GAACATTTT GTTTTT AAAG ATTGGCAT GAT GAT AAT AAAACGAAATTT ATT GAGGG ATT
AACAGTAGGCGCTCCACTAGTTTCATTTGAGGAGGCGTCTTTATCCGATGCTAATGT AAAAGTGACACTTAAC
TTTCTTTCTGGTAGATGGAGAGTTATACAAGCAAATACCGGCACACCAATTGAATGG AAAGAAATTGTTCCTG
GCAGTGGCTATAAAGCAGAATTAGTTGTTCCGCTTAAATCAATAACTGGTAGTGTAA GTAAAAAAGATATCATA
TT AAAATTCAAAGATGCT GTCGT AAAAAAAAT AAATTTATTT GACAATCAAGAGCCT GATTTT ATT AATT ATTTC
AAGCAATCG ATCAGT GAGGGAAATT AT ACTTT AGGGCAACTGGTGACAGACAGCACACCGGGATT AATTCCT
GCTGAATTTCATATTCGTACTCAACCCCATCCAAAAACACGTGAGCGTGGTTCTCAA TATGTAGGAAATGGTG
CGGT ACT GTT GTTT ATT AAAACGCAAT ATGGCGGAAGTGGAACATTGCCT GT AAAT GATTTT GATTGGTT AATT
CCTGATGATCATACTAGCGCATTAGTCATTTCGAGTAAGACCATGATGGGGCAAATA TTGCCAAAACAATACA
AAGATAAATTGCCTGGTGATCCTCAGTTTAGCCCACCAAAAAGAGTCAATGATAAAC AAGACTCTGCTTATTAT
ATTACGATTACCGATGGTGGATTTGATGGTAATAGCCCTATAGAGAAGTCATGGTTA CGTTCTGATTATAGCA
ATGGGATTTGGACTGGTGAACGTGGT AATGCT ATT ATTGGT GAAAAAGGAAAGCGGAT ACCACCACGTTTTC
CAT ACCAAAATTTT GTT ATT AAACCTCATGGT GAATCGTT ATTTCAAGGATGGGAGAAT AAGAT AAATT ACACT
CAAAAGTGTGCAAGATATTTCCGACATCATAGTAATAGTATAACTTTCGAAGATACT GCATTAATGGATCTCAG
T ATTGGTGGACAAGGT AGT ATCAATTGCCAGATT GATGGT GAACATTTCT ATTT AAAATCAGAT GATTTTTCCC
CCAATGTCAGCTATGAACCAACTTCATTCTGGGATAAATTTATCGGTGGGGTGGATG CAAATGTGAAAGATGA
ATTC AG AG AT G AATT AGCACAACAGGCAGAAG C AAAG TT AAAAC AG GT ATTT AAT ATT G AATT G C CT G AAAT C
AGTCTGTTTTCTATTAAACATCTGCTCTTTCCTGGCATGGATGTTATGCAACTTAAA CAGGGTTATTTCCCAGG
AGATTTGATTATCTTTGGGGATATTTCACCT AAATT G AC C AC AATT CAGGTGGCTCCTTTGGAAGCCATGGTT
GCCCTT AAAGAAAATCAAAAATTCACT GTCGT ACCT GAAAAT AAAAAT GTT AGTTGGAAGTTGGATCAT AAT AG
TGAGGCTAT C AAT G ATC C G G G AAAT ATTG ATG AT AAAG GTATTTATACGGCACCGGG C AG AAT CAGATCTGG
TTCTGAAGTCATTAAAGTCACTGCAACTGACGGCGATGGAAATCAGGCATCGGCGGC GCTGACGTTGGTTCC
TTCTTCTGTTGCATTAACACCTTCTTTTGCTTTTATCTCTGAAGCAGATAAGAAACC TATATTATTATTGGCGAA
TGTCCTAGACGGAAAAGCAGTAACATGGAATGTGGAAAGCTGTACAGGCAGCCAATG TGGTTCTGTTGATCA
GAAT GGGCTTTATACTCCACCAGCAGGGCGTTTTAACGATGGATTTACTTTTGCATCCATCACC GCAACTGCA
AAAGATGGTAGTCAAGCACGAACCATTATTTGTCTAATGGCATCAATGCCAGGACAT GGTTTTTACAAGGTTG
AACCTAATTTACGTTTGAATGTGAAAGTAGGGGAAGAAATTATCTTTAAAGCGCAGG CAGATAGCTATAATGG
TGATCCTGATACTTGGGAAATTTTCCCTCCTCGCGGAAAATTAAGTGAACCTGAGTT TGAACCCAATAATGAT
CCTGAAACTAATGATACAATTTTTGGTCATTATAAGGTGACCTATACCGCGCCGACT AATGTTACCTCACCTG
AATTGCTT GTT GTCCAT GT ATGGGAGAAAAAT AGGCATAAT G AGAAAAACAAAGGT AAGGCAGGAT ATGCACT
T ATT GAAATT ATCCC AGAT GAT AAAT AGAAAATTT ATTT AAAT AAAAATC ACAGCGGGTTT AT CTCGCT GT GATT
AAAGT CAT CTTTTTTT AT AG ATTGTTT ATCTCT AAT AAT AATTTT ATTTT AT AAT AT AAAGGAAATT AAAAT GAAT A
ATGAAT AT AAAAAT AACACCGTGAATTGGCGTATTTCACCTGAT ACGGTAGGAAGTATTGAT AAT AACGGTTTA
TATACAGCACCTAATCGGGTAAAGAATATCGAATTTGTCCAAGTAATGGCAAGCGAT GCTAATAATAATCAAT
CTTCTGCGATTATTACTGTTATTCCCTCTTCTGTTGCGTTAACGCCATCGTTTACTT TTATCTCTGAGGCAAAA
AAAAC ATC AGTCACTTTT AAAGCGAC AGAACTT GAAGGGAAAAAAGT GACATGGAGT AT AAAT AATT AT ACCA
GTAATCAGTATGGTTCCATCGATCAAAATGGTATCTACACACCACCGGAAAGTCGTT TTAACGATGGATATAC
TTTTGTATCTATTACAGCAAAAGCGGAAAATGGCGCTGAAGCGCAAGCGCTTATTTG CTTGATGGCCAAAATT
CCAGGGCATGCCTTTTTCGAT GTTCAGCCT AAT AT AT GTTT AAGT GT GAAGCCTGGAGAAGAAATCATTTTT A
GAGCTAACGCAGATCGTTATAATGGTGATCCTGATTCCTGGGAAATTTTCCCGTCTC TTGGTAAATTGGGTGA
GCCTGAGTATATAAAAAATAACGATCCAGAAATTCCTATTTATGGATATTATCAAGT GAAATATATTGCGCCAA
CCAATATAAATTCTTCCCAAATACTCGTTGTGCGTACTTGGGAATATGACAAACATG ATGAGCATAATCAAGGT
AAAG CAGGATATGC ATTC ATT GAAATT GTG C C AG AAAAT G AG CTTT AAT AT AT AT AC C C AAT AGATTTCGAGCC
GCAGCGCGGCGGCAAGTGAATGAATCCCCAGGAGCATAGATAACGATGTGACTGGGG TGAGTGAACGCAG
CCAACAAAG AGGT AACTT GAAAGAT AAT GAGTAT AAAT GACTTT AGT AAGAGAAATT ATGGCTTCATTCAGAAC
T ATTT ATT AGAGT AATT AACTTT AT AAAGACATTT AATGGAAAAT AT AAT AGAAAAATTT AAT ATT AAT ATT GAAG
TCTCATCTGAAATTATTGGAGAGAGTTTATTAAACTCCCCTTTATTGATGAGTAGAG AAATCAGCAATCAATTA
TCT GAAAT ATT ATT AG ATT AT AAAG AAT AT AAT ATT G C ATT G GAT AAGTT AGT GTT AAAT AT AGG AGAAAT ACCC
T AT GAAAT ATTT GAACAACAATTCT ATGGTCGTTTGGGAAAATT ATT AAAT GAAAAGTT AACAAT AAT AAT AAAT
GAT AAATT ATTGGT AAAAAACAT ATCAACCTCGTT ATTTCCT GAAT GTTTT AGT GAAAAAAGAAACCC ATT ATTA
AAT AGAGT CAT AAAAAATTT ACCTTCT AATTTGGTTTTT G AAGTT C ATTCAAT G GT AAAAAT AGAAT C AGT AAAT
AACAAAAAACAAGCT AAT AT ATT G AC AT CTTATCTGGCTT ATT CTTTTTTT AAT AAAAG CAAATT ACAACAAC AT
TT ATTTTCCACT AGT AAT AAT AAATT AATT GAGAGCTT AT ACGC ACTTTTTCT AACGGATC AGAATCGAAT ACCT
ACTGCTCATAAAATAGGAAAAGGTGCACTTATACTATCTGCCCTTATTTGGCTTTAT TCTAATTCCAATGATTAT
CTGCCCAAACCAGAAAGCACTCTGTTGTTACAAATAGAACAGGATATAAAACAAGGA TATTTGCCTTTAACGT
TGTTAATCACTTTCTTCCAGAACAGAAATGGCGGGCGTGTTTTTTGCGATTGGCAGT ATGCGTTATGGCAAAT
C GAT ATC AT C AAAAAT CACTTAGGCATT AAAAT AAC AT C G AAAG AAC CCCATTTACGG GAG AAAAT AAT GTT AC
AACCAGTTAATGCTTCTGATCGATCCTCTGTGCTGATATCAGACGAAAAATTGACAA TACCGTTAACAATTACA
GGTGCGGGATTAGTGCTTCTCTGGCCACTATTAACTCCACTATTTTCGTCTTTTGAT TTGTTAGATAAGAAAAG
TTTTTCAG ACAATTTGGCACAGGAAAT AGCATTT AATTT ATTGGAATGGTT AGTCTGGGG AGATG AG ATGCT G
TTACATCAGGAATCATCATTATCTTTATTACTCTGCGGAATAGATCACCAAACAATA CTGGAGCGCCAGGTTCT
TATTCCTGAGCACAAGGAAAAATTAAATAACTGGTTGCAAGGTATTTGTACTCAACT TTTCTCTTGGAAAAAGC TAGGGATCGATGATATGCGCCAACTTTTTTTGCAGCGTCAGGCTGCACTTTATTATGAAG ATGATGGCCGTTG
GTTATTAACGGTGCAGCGTGAAGCTTATGATGTATTACTGACTCAAATGCCTTGGCC GTGGCCATTGAATATT
GTGACATTACCTTGGCTAGCTGAGCCGATTAGTATCACTTGGGAAGGTATCTCTGAA CCAACGGATTTGTCAT
TTTGGTAATCCAATATCTCATTAGGAACTCTATGCATGTACGATTTATCTGATGATC TTGCCAGACAGAATATT
TCACCGGAATATGAATTGACGGTTTTGCTGTCTCAGACTGCTATATTGGATAAACGA ATTCGTTTACGAATTCA
GGAATTAATGCAACAGCAAACACTATTGGGAGAAAGTGGACAGACGTCTTTTGATGA TATTTCATTTTCATTC
GTTTCGAGTGAACAACAAAAATCATCTTATTTGGTGTCACCGCATCAAAATTGGACG AAAGAGGATTTTCCTC
CTGAGCCGATCCCATCTCGTAGCCGTCTAGGACAATTAGTTGAACGGTTTGACTTAA CTCAATTTGAAATTGA
TTTGATTTTATTGTGCCTGTTGCCTCATCTTGACAGACGTTATCTAACGTTATTTTC TCTTGTTCCGGTAAGTG
GAGGT AAT AACAGCAAAAAGCAGAT GTT AACGTTGGG ATTGGCTTTGGAGTTGCTTT GTCCGAGTGTAGT AG
AG C G C AAT GCGCAACGTGCCAGTTTATTACCACAGGCACCGCTTTGGGATTATCGTTTATTTCAGTTG CGCG
GTGATATGTCTGTTTCCTACGATGAAATACCGTTAGCAATCGATAATTCTCTTATGC ATTGGTTATTGGGGCAT
GATGCTCTCCCGATTTCTCTTCTCTCCCGGGCTCATTGGCTTCCTGTTCCTGAAGTG CCTGATATTTTGCCTG
ATTTCACCAACCAATTGATAGAACTCTGCCAAATGGAACAAGAGGGGATGCTGACAA TAATCGCCGGCGGAG
CCGGAAGTGGCAGCAAAACAAGTGTTGCACGCGCAGCATCACAAGTAGGGCGCTCTG TATTGTTGTTATCGT
T AGCATCAGT GACACT GAGT GAAC ATGAAACT ATT ACACT GAT AACACTGGC ATTACGTGAAGCACAACT AAG
AAATGCCT GTCTT AT GTTT GAAGCTTTGGAT GAGTTTT GT GAAGCACGCCCCGCTTTGCAGCTCTGGCTAGGA
AATCGACTGGCTCGTTGTTCGATTCCGCTGTTTTGTCAATTACCTAAGCAAGCATCA TTATTGCCATTGGATG
CAATTTCACAAGTT GT ATT GTCT ATGCCAATGCCTTCTTT AATGGT GAAGGCTGCAGCATT AGCTTCAAT GAT G
ACGAATTATTTTCCAGACAATTCATTGGATGTTGAAAGTTTAGTGACATGTTTCCAT CCTTCTCCATTGATATTG
AAAAAGGCCCTT AGT GAAGCAGAAATTT ATCGCCGACT ACGGGGGGAAACGGCT AGTTT GAGATT AGAT GAT
GTGCAAATGTCCCTGCGTTTTCGGTTACAGCAGAATTTTGGACGTTTAGCACAGAGA ATTACACCACAACGAA
CCTTTGATGATTTGATCATCAGTGAATCTCAACAGCAACAATTACAAGAAATCCTGG CGGCTATTCGGCAACG
AGATAGGATGCTAGAGCAAGGATTTGCTCGTAAAGTGAGCTATGGGACGGGTATCAG CACGCTATTTTTTGG
TGAATCTGGCACAGGAAAAACGATGGTAGCAGAAGTGTTAGCTGGTGTTTTAGGTGT GGATTTGATCAAGGT
AGATTTGTCCACTGTGGTTAACAAATATATTGGTGAAACTGAAAAAAATCTGGCTCG TGTTTTTGATTATGCCC
AAGAAGACGCCGGGGTATTGTTCTTTGATGAGGCAGATGCATTGTTTGGCAAACGAA GTGAAACTAAAGATG
CAAAAGATCGTCATGCTAATATTGAAGTTTCCTACCTATTGCAACGCCTTGAAAGTT ATCCAGGGCTGGTGAT
ATTAGCCACCAATTACCGTAATCATTTAGACTCAGCATTTAGTCGTCGCCTGACTTT TTCGGTACGATTCTCTT
TTCCAGATGTTTCCTTACGG GAAC GGATGTGGCGGATTATCTGGCCATCGG G AATT C AATT AGCCGACGACA
TCAGTTTTTCAGCGTTGGCAAAACGGGCTGAATTAACGGGGGCGAATATCCGTAATA TTGCGCTACTCGCTA
GTTGGCTGGCAGT AGAT GAAGGAAAT GAAAAAATT ACT ATGGCTCAT ATT GAATGCGCATT ACGACGT GAACT
GAGT AAAGTTGGGCGCATT GATTT ACCTT AATTTTTCTTT GT AATCGGGAGACAACT ATGGTT AAAAAT ATCAA
ATCAGAT GAAACCTT ACT GAT ATT AAAT AGT AAAAT AGAAGATGC ATTAAAAGCGT ATTT ACCGGGCGAAG AT
GTCGTTATTCGGTTCGATATGTTTGGTAAAAATGAAAATCCAGATTCTCCTACCGTG TGCGTTTTTCTTTATGA
TATTCAGGAAGATCTGCAATTACGCGTGGGAGAAGGGCGGCAATACCTGCCTGCGAC AGGAAATTTTGTCCC
GGGATGTGTCAATGTTCGTTGTAATTATCTTATTTCCTACTGGGAGCCGGAACAGAG CGGAGGGCAGGGATC
GCCAACCATACGTTCTAATAGTCAATCAATGAAGATAATGAACTGTGTATTGAATGC ATTAATTAATCATCGTT
CATTTCCTGGTTTACCCAGAACTTATACGAGAGTTCTTCCTCCTAATGAACAATTAA ATAGCTTAGGAAACTTT
TGGCAATCATTAGATAATAAGCCTCGACTATGTTTAAGTTATATGGTGACTATTCCT ATTCAACTTACCCCGCC
GACAGAGAAGGTATCTCCTGTCATTACCTCAAAAACTGATATTACTCGAAAACCATC GCTTAACTTTTATCTTG
AGGCAGATGAAATTATCCGTCAGGCATTAGTTGATGCCTTAATATCTCAAACAACAG AATCTATGGATACGAT
AACT AGCTGGCTGGCAAAAGTT GTT ATT ATTT GTCGACC ACCAGAAAT AAT GAAT AAACAAAT GATT GAACAAA
CT GT GAAATT AATT ATTGCTGGAATT ACAGAAGAGGGATT AGCTGGAAAT AT AAAGACAATCACTCAAAAGTG
GGTGGAAGAGAAGACGATT ATTGGT GAAATCG ACGAT GTTTCTCT AGTT ATTTCCCAAGTT GACACGACAGC
GTT GTCTGCT GT AACAAT ACCGACATCTGTTT AA
SEQ ID NO: 96 (Pnf epitope)
TGQKPGNNEWKTGR
SEQ ID NO: 97 (PVCpromF)
TATC ATATGTCT AC AACT C C AG AAC AAATT G CTG
SEQ ID NO: 98 (PVCpromR)
ATCTCTAGAACAGATATTCCAGCCAGC
SEQ ID NO: 99 (ParaINF)
GGCGTCACACTTTGCTATG
SEQ ID NO: 100 (ParaINF)
TCGGTGGCAGTAAATTGTCC
SEQ ID NO: 101 (F1 primer)
AT GT CT ACAAGTACAT CTCAAATT GCG SEQ ID NO: 102 (F2 primer)
GACTCCCTTGAGGGTACGG
SEQ ID NO: 103 (F3 primer)
TT CT GAT GAGAGTG AT GGT AC
SEQ ID NO: 104 (F4 primer)
T G AAT AAAG AATT C AGT C AAT AT C
SEQ ID NO: 105 (R1 primer)
TAGTGGCTGAT G AAAGT CTG
SEQ ID NO: 106 (R2 primer)
GG AAGCCAAAGAT AAT GAAGT G
SEQ ID NO: 107 (R3 primer)
CATTTCTTCCCTATGGTTG
SEQ ID NO: 108 (R4 primer)
TT AAATTCCT AC AAG ATT AT CTTT
SEQ ID NO: 109 ftBid amino acid sequence)
RSSHSRLGRIEADSESQEDIIRNIARHLAQVGDSMDRSIPPGLVNGLALQLRNTSRSEED RNRDLATAL
EQLLQAYPRDMEKEKTMLVLALLLAKKVASHTPSLLRDVFHTTVNFINQNLRTYVRS LARNGMD
SEQ ID NO: 110 (£. coli Sequence Optimised tBid bases)
CGGTCAAGTCACTCGCGTCTGGGGAGAATCGAGGCTGATAGTGAGAGCCAAGAGGATATC ATAA
GAAACATAGCACGCCATTTGGCACAGGTAGGCGATTCTATGGATCGCTCCATCCCGC CTGGACTT
GTCAATGGTCTTGCGCTTCAACTTCGTAACACTTCCCGGTCCGAGGAAGACAGAAAT CGGGACCT
TGCGACTGCTCTGGAACAACTGCTTCAAGCATATCCTCGTGACATGGAGAAAGAAAA GACTATGT
TAGTATTAGCTCTTCTTTTAGCTAAAAAGGTAGCTTCGCACACTCCAAGTTTATTGC GGGACGTTT
TTCACACCACTGTTAATTTCATCAATCAGAACCTGCGTACTTATGTGAGATCTTTGG CGAGAAATG
GTATGGAT
SEQ ID NO: 111 (BaxBH3 peptide (aa59-73))
LSESLKRIGDELDSN
SEQ ID NO: 112 (E. coli Sequence Optimised BaxBH3 bases)
CT GTCGGAGAGTTT GAAGCGT AT AGGT GACGAGCTGGACAGCAAT