Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
LEADER SEQUENCE
Document Type and Number:
WIPO Patent Application WO/2020/245611
Kind Code:
A1
Abstract:
The present invention provides use of a Photorhabdus Virulence Cassettes (PVC) effector leader sequence, for packaging a payload into a PVC Needle Complex, and related methods for manufacturing a packaged PVC Needle Complex. The payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof, and the leader sequence and the payload form an effector fusion that is distinct from a wild-type PVC effector protein.

Inventors:
WATERFIELD NICHOLAS R (GB)
HEALEY JOSEPH (GB)
HAPESHI ALEXIA (GB)
Application Number:
PCT/GB2020/051380
Publication Date:
December 10, 2020
Filing Date:
June 05, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV WARWICK (GB)
International Classes:
C12N15/74; A61P1/04; A61P3/04; A61P11/06; A61P29/00; A61P35/00; A61P37/06; C07K14/24; C12N15/70
Domestic Patent References:
WO2014138324A12014-09-12
WO1992006204A11992-04-16
Foreign References:
US5223409A1993-06-29
Other References:
ISABELLA VLISIDOU ET AL: "PHOTORHABDUS VIRULENCE CASSETTES: EXTRACELLULAR MULTI-PROTEIN NEEDLE COMPLEXES FOR DELIVERY OF SMALL PROTEIN EFFECTORS INTO HOST CELLS.", BIORXIV, 15 February 2019 (2019-02-15), XP055716354, Retrieved from the Internet [retrieved on 20200720], DOI: 10.1101/549964
JIANG FENG ET AL: "Cryo-EM Structure and Assembly of an Extracellular Contractile Injection System", CELL, vol. 177, no. 2, 4 April 2019 (2019-04-04), pages 370, XP085654989, ISSN: 0092-8674, DOI: 10.1016/J.CELL.2019.02.020
KIM DOHLICH ET AL: "A Substrate-Fusion Protein Is Trapped inside the Type III Secretion System Channel in Shigella flexneri", PLOS PATHOGENS, vol. 10, no. 1, 16 January 2014 (2014-01-16), pages e1003881, XP055151154, DOI: 10.1371/journal.ppat.1003881
ABE A ET AL: "Type-III effectors: Sophisticated bacterial virulence factors", COMPTES RENDUS - BIOLOGIES, ELSEVIER, PARIS, FR, vol. 328, no. 5, 1 May 2005 (2005-05-01), pages 413 - 428, XP027629480, ISSN: 1631-0691, [retrieved on 20050501]
G. BRIONES ET AL: "Cre Reporter System To Monitor the Translocation of Type III Secreted Proteins into Host Cells", INFECTION AND IMMUNITY, vol. 74, no. 2, 20 January 2006 (2006-01-20), pages 1084 - 1090, XP055151152, ISSN: 0019-9567, DOI: 10.1128/IAI.74.2.1084-1090.2006
SORY M-P ET AL: "IDENTIFICATION OF THE YOPE AND YOPH DOMAINS REQUIRED FOR SECRETION AND INTERNALIZATION INTO THE CYTOSOL OF MACROPHAGES USING THE CYAA GENE FUSION APPROACH", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, vol. 92, 1 December 1995 (1995-12-01), pages 11998 - 12002, XP002921387, ISSN: 0027-8424, DOI: 10.1073/PNAS.92.26.11998
"GenBank", Database accession no. FM 162591.1
WILKINSON, FEMS MICROBIOLOGY LETTERS, vol. 309, no. 2, August 2010 (2010-08-01), pages 136 - 143
"The Molecular Biology of Photorhabdus Bacteria", 2017, SPRINGER INTERNATIONAL PUBLISHING AG, pages: 159 - 177
WILKINSON ET AL., BMC GENOMICS, vol. 10, 2009
JULIE D. THOMPSON ET AL.: "CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position- Specific Gap Penalties and Weight Matrix Choice", NUCLEIC ACIDS RESEARCH, vol. 22, no. 22, 1994, pages 4673 - 4680, XP002956304
OSAMU GOTOH: "Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments", J. MOL. BIOL., vol. 264, no. 4, 1996, pages 823 - 838
ERIC DEPIEREUXERNEST FEYTMANS: "Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences", CABIOS, vol. 8, no. 5, 1992, pages 501 - 509
C. E. LAWRENCE ET AL.: "Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment", SCIENCE, vol. 262, no. 5131, 1993, pages 208 - 214, XP001152872, DOI: 10.1126/science.8211139
IVO VAN WALLE ET AL.: "Align-M - A New Algorithm for Multiple Alignment of Highly Divergent Sequences", BIOINFORMATICS, vol. 20, no. 9, 2004, pages 1428 - 1435
ALTSCHUL ET AL., BULL. MATH. BIO., vol. 48, 1986, pages 603 - 16
HENIKOFFHENIKOFF, PROC. NATL. ACAD. SCI. USA, vol. 89, 1992, pages 10915 - 19
ROBERTSON ET AL., J. AM. CHEM. SOC., vol. 113, 1991, pages 2722
ELLMAN ET AL., METHODS ENZYMOL., vol. 202, 1991, pages 301
CHUNG ET AL., SCIENCE, vol. 259, 1993, pages 806 - 9
CHUNG ET AL., PROC. NATL. ACAD. SCI. USA, vol. 90, 1993, pages 10145 - 9
TURCATTI ET AL., J. BIOL. CHEM., vol. 271, 1996, pages 19991 - 8
KOIDE ET AL., BIOCHEM., vol. 33, 1994, pages 7470 - 6
WYNNRICHARDS, PROTEIN SCI., vol. 2, 1993, pages 395 - 403
CUNNINGHAMWELLS, SCIENCE, vol. 244, 1989, pages 1081 - 5
VOS ET AL., SCIENCE, vol. 255, 1992, pages 306 - 12
SMITH ET AL., J. MOL. BIOL., vol. 224, 1992, pages 899 - 904
WLODAVER ET AL., FEBS LETT., vol. 309, 1992, pages 59 - 64
REIDHAAR-OLSONSAUER, SCIENCE, vol. 241, 1988, pages 53 - 7
BOWIESAUER, PROC. NATL. ACAD. SCI. USA, vol. 86, 1989, pages 2152 - 6
LOWMAN ET AL., BIOCHEM., vol. 30, 1991, pages 10832 - 7
DERBYSHIRE ET AL., GENE, vol. 46, 1986, pages 145
NER ET AL., DNA, vol. 7, 1988, pages 127
SINGLETON ET AL.: "DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY", 1994, JOHN WILEY AND SONS
HALEMARHAM: "THE HARPER COLLINS DICTIONARY OF BIOLOGY", 1991, HARPER PERENNIAL
YANG ET AL., J BACTERIOL., vol. 188, no. 6, March 2006 (2006-03-01), pages 2254 - 2261
CZEKANSKA, METHODS IN MOLECULAR BIOLOGY, vol. 740, 2011, pages 27 - 32
HUCH ET AL., REGEN MED., vol. 8, no. 4, July 2013 (2013-07-01), pages 385 - 7
Attorney, Agent or Firm:
MACLEAN, Martin (GB)
Download PDF:
Claims:
CLAIMS

1. Use of a Photorhabdus Virulence Cassettes (PVC) effector leader sequence, for packaging a payload into a PVC Needle Complex;

wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof; and

wherein the leader sequence and the payload form an effector fusion that is distinct from a wild-type PVC effector protein.

2. The use according to claim 1 , wherein the leader sequence comprises amino acid residues 1-50 of a PVC effector.

3. The use according to claim 1 or claim 2, wherein the leader sequence comprises an amino acid sequence having at least 60% sequence identity to one or more sequence selected sequence from SEQ ID NO.: 47 - SEQ ID NO.: 92.

4. The use according to any one of the preceding claims, wherein the PVC effector comprises an amino acid sequence of one or more sequence selected from SEQ ID NO.: 1 - SEQ ID NO.: 46.

5. The use according to any one of the preceding claims, wherein the PVC effector comprises a sequence selected from SEQ ID NO: 4, SEQ ID NO: 22, SEQ ID NO: 25, SEQ ID NO: 30, SEQ ID NO: 32 and SEQ ID NO: 46.

6. The use according to any one of the preceding claims, wherein the leader sequence is covalently fused to the payload, preferably at an N-terminus of the payload.

7. A method for manufacturing a PVC Needle Complex comprising a payload, the method comprising:

a. contacting a PVC Needle Complex with an effector fusion comprising a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein.

8. The method according to claim 7, wherein said contacting occurs within a cell, in a cell lysate, or in a purified cell lysate.

9. An in vitro and/or ex vivo method for delivering a payload into a cell, the method comprising:

a. contacting a cell with a PVC Needle Complex comprising an effector fusion;

b. wherein the effector fusion comprises a PVC effector leader sequence fused to a payload;

c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

d. wherein the effector fusion is distinct from a wild-type PVC effector protein. 10. A method for suppressing a pest, the method comprising:

a. contacting a pest, or a target area comprising a pest, with a PVC Needle Complex comprising an effector fusion;

b. wherein the effector fusion comprises a PVC effector leader sequence fused to a payload;

c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

d. wherein the effector fusion is distinct from a wild-type PVC effector protein.

11. A PVC Needle Complex, for use in a method of treatment;

a. wherein the PVC Needle Complex comprises an effector fusion which comprises a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof; and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein.

12. A PVC Needle Complex comprising an effector fusion;

a. wherein said effector fusion comprises a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein.

13. An effector fusion, comprising a PVC effector leader sequence fused to a payload; a. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

b. wherein the effector fusion is distinct from a wild-type PVC effector protein.

14. An isolated PVC effector leader sequence.

15. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion, or isolated PVC effector leader sequence according to any one of claims 7-14, wherein the leader sequence comprises amino acid residues 1-50 of a PVC effector.

16. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according to any one of claims 7-15, wherein the leader sequence comprises an amino acid sequence having at least 60% sequence identity to one or more sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92.

17. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according any one of claims 7-16, wherein the PVC effector comprises an amino acid sequence of one or more sequence selected from SEQ ID NO.: 1 - SEQ ID NO.: 46.

18. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according any one of claims 7-17, wherein the PVC effector comprises a sequence selected from SEQ ID NO: 4, SEQ ID NO: 22, SEQ ID NO: 25, SEQ ID NO: 30, SEQ ID NO: 32 and SEQ ID NO: 46.

19. The method, PVC Needle Complex for use, PVC Needle Complex, effector fusion or isolated PVC effector leader sequence according to any one of claims 7-18, wherein the leader sequence is covalently fused to a payload.

20. An isolated nucleic acid comprising a nucleotide sequence which encodes the isolated PVC effector leader sequence of any one of claims 14-19.

21. An expression vector comprising an isolated nucleic acid molecule of claim 20.

22. A host cell comprising an isolated nucleic acid molecule of claim 20, or an expression vector of claim 21.

23. The host cell of claim 22, wherein said host cell is one or more a selected from a mammalian cell, an insect cell, a yeast cell, a bacterial cell, and/or a plant cell;

preferably wherein said bacterial cell is an E. coli cell. 24. The host cell of claim 22, wherein said host cell is a Photorhabdus cell.

25. The host cell of claim 24, wherein said Photorhabdus cell comprises a Photorhabdus

PVC operon operably linked to an inducible promoter.

Description:
Leader Sequence

The present invention relates to a leader sequence, and use of a leader sequence for packaging molecules into protein complexes.

Biological molecules (e.g. peptides, proteins and nucleic acids) have great potential as broadly applicable therapeutics. Indeed, there has been a trend in recent years for the pharmaceutical industry to move away from‘small molecule’ drugs, toward more complex macromolecular therapeutics (aka. “biologies”). Such biologies include protein-based therapeutics (notably antibodies, hormones, growth factors and cytokines) and nucleic acid- based treatments (such as short-interfering RNAs, DNA/RNA vaccines and gene therapies).

While the biologies market has developed significantly in recent years, the low availability of effective delivery systems (and practicable methods for manufacturing such delivery systems) has limited the diversity of molecular targets of such bio-therapeutics, especially when the target is cytosolic. Indeed, the majority of approved peptide therapeutics on the market act by targeting extracellular components, such as membrane receptors or secreted molecules (e.g. present in the interstitial space). For example, humira (the most successful therapeutic monoclonal antibody) targets the extracellularly secreted cytokine TNFa. Insulin acts by binding its cognate receptor present on the cell membrane (the same being true of other hormone peptide therapeutics).

Similar problems exist in the agricultural industry, where protein-based pesticides are typically toxins which must target an extracellular component of a cell of a pest. By way of example, Bacillus thuringiensis toxins are commonly used natural pesticides which must bind membrane receptors to exert their toxic effects.

Methods for cytosolic delivery of biological molecules have been developed for laboratory research, which generally involve delivering the molecules within lipid vehicles which fuse with the plasma membrane of a cell, before emptying their payload into the cytosol. However, such methods find limited use in medicine and veterinary, e.g. due to the non specific nature in which they deliver molecules to cells.

Bacterial secretion systems have been explored as potential delivery systems, given their natural ability to secrete (or more particularly‘inject’) molecules into target cells. The most studied of such secretion systems is the Type III secretion system (T3SS), a “protein appendage” found in several Gram-negative bacteria. However, a significant drawback of these systems is that they remain associated with the bacterial membrane at all times, requiring use of actual bacterial cells (comprising the secretion system) as the delivery system. As such, it is difficult to fully control what molecules are transferred from the bacteria to the target cell (even when the biologic of interest is overexpressed), as these secretion systems function by providing a connection (e.g. channel) between the bacteria’s cytosol and the target cell’s cytosol, through which other components (potentially harmful to the host) may flow. Therefore, there exists not only a need for improved delivery systems, but also means for producing such systems which find compatibility with molecules (payloads) having a range of sizes and molecular properties.

The present invention solves one or more of the above-mentioned problems.

The present invention is predicated on the surprising finding that toxigenic Photorhabdus Virulence Cassettes (PVC) effector proteins of Photorhabdus bacteria comprise a previously unknown“leader sequence” (or“leader peptide”), which functions to package (or“load”) PVC effectors into a so called PVC Needle Complex(e.g. “nanosyringe”), which subsequently delivers the PVC effector to a target cell where it exerts its toxigenic effect(s) (the PVC effectors representing a payload of such nanosyringes). Moreover, the inventors have found that such leader sequences can be practically utilized to direct a payload linked thereto to be packaged into a PVC Needle Complex (and related/ homologous complexes), a well characterized molecular delivery system of Photorhabdus. Thus, the newly discovered leader sequence surprisingly functions to load the PVC Needle Complex with a molecular payload (or“warhead”).

Further to this finding, the inventors have developed an advantageous, practical utility for such leader sequence for packaging/ loading ‘heterologous’ payloads (including non- Photorhabdus molecules) into PVC Needle Complexes, independent of the size, molecular properties or provenance of the heterologous payload.

In a first aspect the invention provides use of a Photorhabdus Virulence Cassettes (PVC) effector leader sequence, for packaging a payload into a PVC Needle Complex;

wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof (preferably a polypeptide); and

wherein the leader sequence and the payload form an effector fusion that is distinct from a wild-type PVC effector protein.

In one aspect, an aspect of the invention provides use of a PVC effector leader sequence, for packaging a payload into a PVC Needle Complex;

wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof (preferably a polypeptide); and

wherein the leader sequence and the payload form a fusion that is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

In other words, the invention provides in one aspect a method for packaging a payload into a PVC Needle Complex with a PVC effector leader sequence, comprising contacting an (effector) fusion with a PVC Needle Complex, wherein the payload is one or more selected from a polypeptide, a nucleic acid, or a combination thereof (preferably a polypeptide); and wherein the leader sequence and the payload form the (effector) fusion, that is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

The terms“fusion” and“effector fusion”, in the context of a (effector) fusion formed by the leader sequence and the payload (and that is distinct from a wild-type PVC effector protein) are used interchangeably herein. This use (of the leader sequence) was demonstrated, as outlined in the examples, by expressing an effector fusion (tagged with a detection label) and a PVC Needle Complex in a cell (e.g. host bacterial cell) wherein the effector fusion is packaged into the PVC Needle Complex (via the leader sequence), isolating the PVC Needle Complex, then detecting the presence or absence of the payload within the PVC Needle Complex (e.g. a disrupted version thereof) via Western blot detection of the detection label. The presence of the payload is detected when fused to a leader sequence only, but not when the payload lacks a leader sequence.

The term“PVC effector leader sequence” means the leader region (polypeptide region) from a PVC effector polypeptide which is capable of packaging a payload (e.g. effector) into a PVC Needle Complex, and is preferably amino acids 1-50 of a PVC effector, or amino acids 2-50 when omitting the initial methionine. The inventors have demonstrated that the leader sequence is encompassed within (or may consist essentially of) amino acids 1-50 of a multitude of identified PVC effector polypeptide sequences. However, leader sequences having alternative lengths and positioning within a PVC effector are intended to be encompassed (e.g. with the proviso that said leader sequence is capable of packaging a payload into a PVC Needle Complex).

The remaining (non-leader sequence) portion of a PVC effector is referred to an“effector portion” (e.g. payload) herein. The effector portion preferably comprises or consists essentially of amino acids 51-C terminus of a PVC effector protein.

Thus, in one embodiment, a PVC effector leader sequence is encompassed within amino acids 1-50 or 2-50 (preferably 1-50) of a PVC effector polypeptide.

In embodiment, a PVC effector leader sequence comprises (or consists essentially of) amino acids 1-50 or 2-50 (preferably 1-50) of a PVC effector polypeptide.

The term“wild-type PVC effector protein” is used synonymously with the term“endogenous PVC effector protein”, or simply“PVC effector protein”, and refers to an (e.g. intact) PVC effector sequence having an endogenous leader sequence (i.e. endogenous to the given PVC effector, preferably amino acids 1-50 of the PVC effector) associated with the effector portion (e.g. the payload, preferably amino acids 51-C terminus of a PVC effector protein). Examples of wild-type PVC effectors may comprise (or consist essentially of) an amino acid sequence of one or more sequence selected from SEQ ID NO.: 1 - SEQ ID NO.: 46. The fusion/ effector fusion of the invention described herein is thus distinct from a PVC effector protein (e.g. wild-type PVC effector protein), as the leader sequence is not fused to an effector portion with which it may be fused in the case of a wild-type PVC effector protein. By way of example, the fusion/ effector fusion may comprise the leader sequence of the“Pnf” PVC effector protein (e.g. the leader of SEQ ID NO.: 78) fused to the effector portion of the hvnA (gene Plu1649) PVC effector protein (e.g. amino acids 51-295 of SEQ ID NO.: 46), but is not intended to refer to the leader sequence of the“Pnf’ PVC effector protein (e.g. the leader of SEQ ID NO.: 78) fused to the effector portion of the Pnf PVC effector protein (e.g. amino acids 51-340 of SEQ ID NO.: 32). On the other hand, the fusion/ effector fusion may comprise the leader sequence of, e.g., the “Pnf” PVC effector protein (e.g. the leader of SEQ ID NO.: 78) fused to a non-effector portion, for example a non- Photorhabdus protein such as Ore recombinase. Thus, the leader sequence finds utility in packaging a range of e.g. heterologous (non-wild-type) agents into a PVC Needle Complex, opening the possibility to use the PVC Needle Complex as a modular, diverse delivery system for delivering not only natural effectors, but also‘unnatural’ payloads to a cell for the first time. As such, it is possible to manufacture a PVC Needle Complex having a payload of choice.

Another aspect of the invention provides a method for manufacturing a PVC Needle Complex comprising a payload (e.g. in other words, a method for manufacturing a packaged PVC Needle Complex), the method comprising:

a. contacting (e.g. within a host cell) a PVC Needle Complex with an effector fusion comprising a PVC effector leader sequence fused to a payload;

b. wherein the payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein.

An aspect of the invention provides a method for manufacturing a PVC Needle Complex comprising a payload (e.g. in other words, a method for manufacturing a packaged PVC Needle Complex), the method comprising:

a. contacting (e.g. within a host cell) a PVC Needle Complex with a fusion, the fusion comprising a PVC effector leader sequence fused to a payload, wherein the leader sequence and the payload form a fusion that is distinct from a PVC effector protein (e.g. wild-type PVC effector protein); and

b. wherein the payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide).

In one embodiment, said contacting may occur within a cell (e.g. bacterial host cell), in a cell lysate, or in a purified cell lysate (preferably within a cell). In one embodiment, said contacting may occur within a cell free expression system. Similar, a use described herein may comprise a contacting step (between the fusion/ effector fusion and PVC Needle Complex) occurring within a cell (e.g. bacterial host cell), in a cell lysate, cell free expression system, or in a purified cell lysate (preferably within a cell, more preferably a bacterial host cell).

A cassette (operon) encoding the PVC Needle Complex may be operably linked to a first promoter, and a gene encoding the fusion/ effector fusion (payload) may be operably linked to a second (preferably different) promoter. In one embodiment, said first and/or second promoter is an inducible promoter (e.g. an arabinose inducible promoter such a pBAD, and/or an IPTG inducible promoter). Thus, the invention embraces an expression system wherein an operon encoding the PVC is present within a first vector/ plasmid (optionally operably linked to a first promoter), and the sequence encoding the effector fusion (leader sequence fused to payload) is present within a second (preferably different) plasmid (optionally linked to a second promoter). In one embodiment, the PVC Needle Complex and/or (preferably and) effector fusion may be expressed in one or more host selected from a bacterial cell, a yeast cell, an insect cell and/or a mammalian cell. In a preferable embodiment, the PVC Needle Complex and effector fusion may be expressed together in a host cell selected from a bacterial cell, a yeast cell, an insect cell and a mammalian cell (preferably a bacterial cell). Suitable mammalian cells include a HEK293 cell and/or a CHO cell.

The PVC Needle Complex and/or (preferably and) the effector fusion (payload) may be expressed in a heterologous bacterial expression system (preferably E. coli). In one embodiment, the PVC Needle Complex and/or (preferably and) the PVC effector may be expressed in a Photorhabdus cell, optionally wherein the PVC operon of the Photorhabdus cell is endogenous to the cell (and optionally wherein the PVC operon is operably linked to an inducible promoter which may be incorporated into the genome to be operably linked to the PVC operon via genetic engineering). For example, an inducible promoter may be introduced into the genome of a Photorhabdus cell 5’ to a PVC (operon), preferably by recombineering as described in the examples (e.g. Example 3).

The payload may be, for example, a therapeutic payload, such that a PVC Needle Complex finds utility in medical treatment.

In a further aspect, the invention provides a (packaged) PVC Needle Complex, for use in a method of treatment;

a. wherein the PVC Needle Complex comprises (e.g. is packaged with) an effector fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein.

A further aspect of the invention provides a (packaged) PVC Needle Complex, for use in a method of treatment;

a. wherein the PVC Needle Complex holds (e.g. is packaged with) a fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

c. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

In one aspect, the invention provides a method of treating a subject, the method comprising administering a (packaged) PVC Needle Complex to a subject (e.g. a patient);

a. wherein the PVC Needle Complex comprises (e.g. is packaged with) an effector fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein. In other words, an aspect of the invention provides a method of treating a subject, the method comprising administering a (packaged) PVC Needle Complex to a subject (e.g. a patient);

a. wherein the PVC Needle Complex holds (e.g. is packaged with) a fusion which comprises (or consists essentially of) a PVC effector leader sequence fused to a payload;

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

c. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

In a preferable embodiment, the payload is a polypeptide.

The subject may be a mammalian subject, preferably a human subject.

The terms “PVC Needle Complex holds an effector fusion” and “PVC Needle Complex comprising an effector fusion” means a PVC Needle Complex having a packaged effector fusion, or in other words, a PVC Needle Complex that is packaged with an effector fusion.

The term“packaged effector fusion”,“fusion” and“effector fusion” (e.g. wherein the fusion/ effector fusion is distinct from a wild-type PVC effector protein) embraces a combination of a PVC effector leader sequence and a payload which remains in contact (e.g. fused) subsequent to packaging into PVC Needle Complex (e.g. the leader sequence has not been cleaved off the payload), as well as combination of a PVC effector leader sequence and a payload which are no longer in direct contact (e.g. no longer fused, such as following cleavage of the leader sequence from the payload).

The term“treat” or“treating” as used herein encompasses prophylactic treatment (e.g. to prevent onset of a disease) as well as corrective treatment (treatment of a subject already suffering from a disease). Preferably“treat” or“treating” as used herein means corrective treatment. The term “treat” or“treating” encompasses treating both the disease and a symptom thereof. In some embodiments “treat” or“treating” refers to a symptom of a disease.

Therefore, a PVC Needle Complex may be administered to a subject in a therapeutically effective amount or a prophylactically effective amount.

A“therapeutically effective amount” is any amount of the (packaged/ laden) PVC Needle Complex, which when administered alone or in combination (e.g. with another therapeutic, administered parallel or in series and acting additively or synergistically) to a subject for treating a disease (or a symptom thereof) is sufficient to effect such treatment of the disease, or symptom thereof.

A“prophylactically effective amount” is any amount of the (packaged/ laden) PVC Needle Complex that, when administered alone or in combination (e.g. with another therapeutic, administered parallel or in series and acting additively or synergistically) to a subject inhibits or delays the onset or reoccurrence of a disease (or a symptom thereof). In some embodiments, the prophylactically effective amount prevents the onset or reoccurrence of a disease entirely. “Inhibiting” the onset means either lessening the likelihood of disease onset (or symptom thereof), or preventing the onset entirely.

In a related aspect, there is provided a (packaged) PVC Needle Complex comprising (e.g. that holds/ that is packaged with) an effector fusion;

a. wherein said effector fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said effector fusion is formed by a PVC effector leader sequence and a payload);

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

c. wherein the effector fusion is distinct from a wild-type PVC effector protein.

In other words, one aspect of the invention provides a (packaged) PVC Needle Complex that holds (e.g. is packaged with) a fusion;

a. wherein said fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said fusion is formed by a PVC effector leader sequence and a payload);

b. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

c. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

In a preferable embodiment, the (packaged) PVC Needle Complex is an isolated (e.g. non natural) PVC Needle Complex.

As explained below, the PVC Needle Complex typically functions in nature to deliver toxigenic PVC effectors to insect targets. By expanding greatly the number and variety of payloads which may be packaged into a PVC Needle Complex, the invention concomitantly expands the number and variety of invertebrates (e.g. pests), such as amoeba, nematodes, helminths and insects, which may be targeted and killed.

In a further aspect of the invention, there is provided a method for suppressing a pest, the method comprising:

a. contacting a pest, or a target area comprising a pest, with a (packaged) PVC Needle Complex comprising (e.g. holding/ packaged with) an effector fusion;

b. wherein the effector fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said effector fusion is formed by a PVC effector leader sequence and a payload);

c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

d. wherein the effector fusion is distinct from a wild-type PVC effector protein.

An aspect of the invention provides a method for suppressing a pest, the method comprising: a. contacting a pest, or a target area comprising a pest, with a (packaged) PVC Needle Complex holding (e.g. packaged with) a fusion; b. wherein the fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said fusion is formed by a PVC effector leader sequence and a payload);

c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

d. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

The terms “PVC Needle Complex holds an effector fusion” and “PVC Needle Complex comprising an effector fusion” means a PVC Needle Complex having a packaged effector fusion.

The term“target area” refers to an area where a pest is present and/or where a pest may be (e.g. is expected to be, or suspected of being) present.

Thus, in one embodiment, a target area may be contacted before, and/or when a pest is present. The target area may be in the vicinity of (e.g. close proximity to) a pest. Alternatively, the target area may be an area that a user wishes to protect from a pest. For example, a target area may comprise a plant and/or plant product.

The term“suppressing a pest” embraces“pest control”, “inhibiting the growth of a pest”, “inhibiting the proliferation of pest”, and/or“mortality of a pest”.

Examples of such pest include one or more insect(s), mite(s), sowbug(s), pillbug(s), centipede(s), mollusk(s), millipede(s), protist(s), fungus (fungi), helminth(s) and/or bloodborne parasite(s). The pest may be at any stage of development e.g. may be a larvae and/or adult pest (e.g. imago).

The invention may be used to target a variety of agricultural, commercial, home and garden pests.

In one embodiment the pest is an insect, a mite, a sowbug, a pillbug, a centipede, a mollusk and/or a millipede. Suitably the pest may be an insect and/or a mite (preferably insect).

Examples of suitable insects include, an insect of the order Lepidoptera, Coleoptera, Diptera, Blattodea, Hymenoptera, Isoptera, Orthoptera, Thysanura, and/or Dermaptera. In one embodiment an insect of the order Lepidoptera may be one or more of a moth and/or a butterfly. Suitable moths include Manduca Sexta and/or Galleria mellonella.

In one embodiment an insect of the order Coleoptera may be one or more of a European chafer grub, a northern masked chafer grub, a southern masked chafer grub, a Japanese beetle grub, a June beetle grub, a black vine weevil, a strawberry root weevil, a clay-colored weevil, a Colorado potato beetle, and/or a wireworm. In another embodiment an insect of the order Diptera may be one or more of a leatherjacket (e.g. larvae of a crane fly), an onion maggot, a cabbage maggot, a carrot rust fly maggot, a fungus gnat, and/or a mosquito. In another embodiment an insect of the order Blattodea may be a cockroach, suitably one or more cockroach selected from an American cockroach, and/or a German cockroach. In one embodiment an insect of the order Hymenoptera may be an ant. Suitably, the ant may be one or more of a carpenter ant, an odorous house ant, a pavement ant, an Argentine ant, a Pharaoh ant, a tawny crazy ant, a harvester ant, a red imported fire ant, a Southern fire ant, a European fire ant, and/or a little fire ant. In another embodiment an insect of the order Hymenoptera may be a yellowjacket.

In one embodiment an insect of the order Isoptera may be a termite. Suitably the termite may be one or more of a damp wood termite, a dry wood termite, and/or a subterranean termite. In another embodiment an insect of the order Orthoptera may be one or more of a cricket, a grasshopper, and/or a locust. In one embodiment an insect of the order Thysanura may be a silverfish. In another embodiment an insect of the order Dermaptera may be an earwig.

Examples of suitable molluscs include a slug and/or a snail.

In one embodiment, the pest is a protist. In one embodiment, said protist is one or more selected from Chaos carolinense, Amoeba proteus, Naegleria fowleri, Dictyostelium discoideum, Entamoeba histolytica, Trichomonas vaginalis, Blastocystis hominis, Leishmania Spp., and Giardia lamblia. In one embodiment, said protist is one or more selected from Fonticula alba, Dictyostelium discoideum, Chlamydomonas reinhardtii, Crytomonas paramedium, Paulinella chromatophora, Nannochloropsis gaditana, and/or Tetrahymena Spp.

In one embodiment, the pest is a fungus. In one embodiment, said fungus is one or more fungus selected from Encephalitozoan cuniculi, Nasema apis, Namema ceranae, Vittaforma carneae, Enterocytosoan bieneusi, Spraguea lophii, Vavra culiculis, Edharzardia aedes, Nematocida parisii, Razeiia Spp., Parasitella parasitica, Lichteimia ramose, Sporisorium scitamineum, Trametes versicolor, and/or Punctularia strigosozonata.

In one embodiment, said fungus is a Candida spp. Said Candida spp. may be one or more selected from C. albicans, C. ascalaphidarum, C. amphixiae, C. Antarctica, C. argentea, C. atlantica, C. atmosphaerica, C. auris, C. blattae, C. bromeliacearum, C. carpophila, C. carvajaiis, C. cerambycidarum, C. chauliodes, C. corydalis, C. dosseyi, C. dubliniensis, C. ergatensis, C. fructus, C. glabrata, C. fermentati, C. guilliermondii, C. haemulonii, C. humilis, C. insectamens, C. insectorum, C. intermedia, C. jeffresii, C. kefyr, C. keroseneae, C. krusei, C. lusitaniae, C. lyxosophila, C. maltose, C. marina, C. membranifaciens, C. mogii, C. oleophila, C. oregonensis, C. parapsilosis, C. quercitrusa, C. rugose, C. sake, C. shehatea, C. temnochilae, C. tenuis, C. theae, C. tolerans, C. tropicalis, C. tsuchiyae, C. sinolaborantium, C. sojae, C. subhashii, C. viswanathii, C. utilis, C. ubatubensis, and/or C. zemplinina. Suitably, said Candida spp. may be C. albicans.

In another embodiment, the pest is a helminth. Said helminth may be one or more selected from the phyla Annelida, Platyhelminthes, Nematoda and/or Acanthocephala. In one embodiment, said helminth is a parasitic flatworm. Said parasitic flatworm may be one or more selected from a Cestoda, a Trematoda and/or a Monogenea. In one embodiment, said helminth is a parasitic nematode. Said parasitic nematode may be one or more selected an ascarid ( Ascaris ), a filaria, a hookworm, a pinworm ( Enterobius ), and/or a whipworm {Trichuris trichiura).

In one embodiment, the pest is a bloodborne parasite. Said bloodborne parasite may be one or more selected from Trypanosoma Spp (e.g. Trypanosoma brucei and/or T cruzi), Babesia Spp (e.g. Babesia microti), Leishmania Spp, Plasmodium Spp (e.g. P. falciparum), and/or Toxoplasma Spp. (e.g. Toxoplasma gondii).

The PVC Needle Complex for pest control is suitably environmentally safe (e.g. an environmentally safe pesticidal composition).

Other advantageous utilities include delivering a payload to a cell, for example, during laboratory research. Such cell may be part of an in vitro cell line, or may be a cell of an animal (e.g. a research animal model). Additionally or alternatively, the cell may be comprised within an ex vivo system, such as an organoid.

Another aspect of the invention provides an in vitro (and/or ex vivo) method for delivering a payload into a cell, the method comprising:

a. contacting a cell with a (packaged) PVC Needle Complex comprising (e.g. holding/ packaged with) an effector fusion;

b. wherein the effector fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said effector fusion is formed by a PVC effector leader sequence and a payload);

c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

d. wherein the effector fusion is distinct from a wild-type PVC effector protein.

An aspect of the invention provides an in vitro (and/or ex vivo) method for delivering a payload into a cell, the method comprising:

a. contacting a cell with a (packaged) PVC Needle Complex holding (e.g. packaged with) a fusion;

b. wherein the fusion comprises (or consists essentially of) a PVC effector leader sequence fused to a payload (or in other words, wherein said fusion is formed by a PVC effector leader sequence and a payload);

c. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

d. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

In one aspect, the invention provides an effector fusion comprising (or consisting essentially of) a PVC effector leader sequence fused to a payload (or in other words, an effector fusion formed by a PVC effector leader sequence and a payload);

a. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof; and

b. wherein the effector fusion is distinct from a wild-type PVC effector protein. An aspect of the invention provides a fusion comprising (or consisting essentially of) a PVC effector leader sequence fused to a payload (or in other words, a fusion formed by a PVC effector leader sequence and a payload);

a. wherein said payload is one or more selected from a polypeptide, a nucleic acid or a combination thereof (preferably a polypeptide); and

b. wherein the fusion is distinct from a PVC effector protein (e.g. wild-type PVC effector protein).

In one embodiment, the fusion/ effector fusion is an isolated fusion/ effector fusion (e.g. an isolated, non-naturally occurring fusion/ effector fusion).

The present invention embraces a nucleic acid comprising a nucleotide sequence which encodes the fusion/ effector fusion, and/or an expression vector comprising said nucleic acid. Also embraced is a host cell comprising said nucleic acid and/or expression vector.

As discussed above, the present inventors have discovered and practically utilised the leader sequence(s) described herein for the first time.

Thus, another aspect of the invention provides an isolated PVC effector leader sequence (e.g. wherein the isolated PVC effector leader sequence is capable of packaging a payload into a PVC Needle Complex).

In a related aspect there is provided an isolated nucleic acid comprising a nucleotide sequence which encodes a PVC effector leader sequence.

The isolated PVC effector leader sequence may be recombinant, synthetic, and/or purified. The isolated nucleic encoding a PVC effector leader sequence may be recombinant, synthetic, and/or purified.

Further details on the background of the invention, and terminology used herein, is provided below.

Photorhabdus is a bacterium of the genus Enterobacteriacae, represented by three formally recognized (to date) species - namely P. luminescens, P. asymbiotica, and P. temperata. Important strains include P. asymbiotica subsp. australis, and P. luminescens subsp laumondii. Currently available genome sequences are available on GenBank ( Photorhabdus asymbiotica ATCC43949 complete genome - GenBank Accession Number: FM 162591.1 ; Photorhabdus laumondii subsp. laumondii strain TT01 chromosome, complete genome - GenBank Accession number: CP024901.1).

Reference to“Photorhabdus luminescens subsp. laumondii’ may be used interchangeably with“Photorhabdus luminescens subsp. laumondii TT01”,“Photorhabdus laumondii subsp. laumondii strain TT01” and“P. luminescens TT01” herein.

The genome sequence for a further strain of P. asymbiotica, namely P. asymbiotica Kingscliff, is described in Wilkinson et. al. (FEMS Microbiology Letters, Volume 309, Issue 2, August 2010, Pages 136-143), incorporated herein by reference. Further genome sequences are described in Thanwisai et. at. (PLoS ONE 7(9): e43835), incorporated herein by reference.

Each of these species comprise at least one operon known as a Photorhabdus Virulence Cassette (PVC) operon, encoding a PVC Needle Complex, which may be referred to as a “nanosyringe” herein. Given that Photorhabdus is typically found in nature as an insecticidal bacterium following regurgitation from a (symbiont) entomopathogenic Heterorhabditis sp. nematode (e.g. in order to avoid competition for food and resources from insects), it is understood that the PVC Needle Complex functions in nature to suppress insects. Indeed, it has been shown that an isolated PVC Needle Complex (holding/ packaged with a natural effector toxin, such as Pnf) can be used to kill insect larvae - see Example 2. The Photorhabdus Virulence Cassettes represent one of at least four well-characterised toxin delivery systems of Photorhabdus. Other major classes of Photorhabdus protein insecticidal toxins include the “Toxin Complexes” (Tcs), the “binary PirAB toxins”, and the “makes caterpillars floppy” (Mcf) toxins.

The term“Photorhabdus Virulence Cassette” (PVC) (used synonymously with the term“PVC operon” herein) means a discrete operon of a Photorhabdus genome comprising genes encoding for polypeptide subunits which, when expressed, assemble to provide the macromolecular PVC Needle Complex. The molecular architecture of these cassettes have been well characterized and described, for example in The Molecular Biology of Photorhabdus Bacteria (Springer International Publishing AG 2017, ISBN: 978-3-319-52714- 7, Chapter 10, pages 159-177), incorporated herein by reference. A PVC (operon) typically comprises around sixteen genes ( pvd-pvc16 ) encoding structural proteins which assemble to provide a“PVC Needle Complex”, which are typically followed by one or more genes at the 3’ end which encode PVC effector genes, having toxic activity (and typically being homologues of typical T3SS-like effectors). A Photorhabdus genome typically comprises a plurality of such cassettes (e.g. at least four), which are often associated with different effector payloads, or even a plurality of effector payloads.

Three classes of PVC structural operons (Classes I, II and III) have been observed in the genomes of Photorhabdus, and members of other genera. PVCs within each class are similar in terms of the number and type of genes encoding structural proteins they contain (see Figure 1(B)). In more detail, Class I PVCs (which may be referred to as a“prototypical PVC” herein) comprise 16 conserved genes ( pvd-16 ). Class II lack pvc13 host cell binding fibres and pvc3, which (without wishing to be bound by theory) the inventors believe may be a minor specialised sheath subunit that attaches pvc13 fibre proteins onto the PVC Needle Complex (nanosyringe). As such, it is believed this class may be“non-specific”, injecting payloads into multiple (potential any) cell types. Class III is similar to Class I, but has an additional PvcO gene at the start of the operon (of unknown function) and two additional genes encoded between pvc13 and pvc14 that resemble“invasion” type protein genes. This class is typically seen in the human clinical isolate strains of Photorhabdus - the inventors have shown that optimal transcription of PVC Class III may occur when the strain (harboring the PVC operon encoding a PVC Class III operon) is grown at 37°C and exposed to human serum, suggesting this class may be a mammalian adapted version of a PVC Needle Complex. An example cassette (PVC) is shown in Figure 1 (D), which shows a map of the model“Class I” PVC operon of Photorhabdus asymbiotica ATCC43949 (obtainable from the ATCC, accession number: ATCC 43949), said operon being associated with the downstream effector gene“PAU_03332” (encoding a Pnf protein effector, e.g. SEQ ID NO.: 32). This model operon is referred to as Pa AT cc 43949 p\/C pnf. This operon comprises sixteen structural genes ( pvd-16 ), and two genes (3’ end) encoding effectors (in this case the pvc17 / Rhs- like, encoding an Rhs-like effector, and pvc21, encoding a Pnf effector). Said genes pvd-16 correspond to genes PAU_03353 to PAU_03338 of the sequence of GenBank accession no. FM 162591.1 , and are represented by the sequence of SEQ ID NO.: 93.

An example PVC operon (e.g. encoding the structural genes, but not a/the PVC effector) is provided in SEQ ID NO: 93 (which is encodes the operon shown schematically in Figure1(D)), with other examples being SEQ ID NO: 94 and in SEQ ID NO: 95. These sequences begin at the ATG start codon of the first structural gene ( pvd ) of the PVC cassette / operon, and end at the TAA stop codon of the final structural gene (pvd 6).

A PVC Needle Complex from any one of Classes l-lll may be used for a variety of applications. However, PVC Needle Complexes of a certain class may be particularly suitable for delivery to a defined cell type. For example, a PVC Needle Complex for delivery of a payload to a mammalian cell may suitably be a member of Class III. A PVC Needle Complex for delivery of a payload to an insect cell (e.g. to an insect) may suitably be a member of Class I (such as P. asymbiotica PVC pnf, encoded by SEQ ID NO.: 93, e.g. as expressed in E. coli from a cosmid clone).

Thus, as will be understood by the skilled person, the term“PVC Needle Complex” (used synonymously with the terms“PVC Needle Complex delivery system” and“nanosyringe” herein) means a macromolecular protein complex comprising polypeptide subunits encoded by a PVC (operon) of a Photorhabdus bacterium. A PVC Needle Complex is assembled in a nanosyringe structure, having a physical structure (superficially) similar to the antibacterial R- type pyocins (see Figure 3). Functional and molecular studies have shown that a PVC Needle Complex becomes packaged (loaded) with a PVC effector protein(s) (i.e. the PVC effector proteins are packaged therein, or thereon), the packaged PVC Needle Complex is released from the bacterium, and then injects the PVC effector into a target cell such that the PVC effector protein may exert toxicity.

The term “PVC Needle Complex” preferably encompasses PVC Needle Complex-like structures/complexes, encoded by operon(s) comprising genes which are homologous to genes of a Photorhabdus PVC operon. PVC-like elements are not restricted to Photorhabdus, and a well characterized homologous operon (to a PVC operon) is present on the pADAP plasmid of the insect pathogenic bacteria Serratia entomophila. Furthermore, an analogous, and (at least partially) homologous, PVC-like ‘injectosome’ Needle Complex system is employed by the bacterium Pseudoaiteromonas luteoviolacea (e.g. used to control the metamorphosis of the marine worm Hydroides elegans). Structures exist in other Enterobacteriaceae (such as Yersinia Spp.) which are encoded by operons having homology to a PVC operon, and may be used with a leader sequence described herein. Each of these (PVC-like) structures are embraced by the term“PVC Needle Complex” as used herein. Thus, a PVC Needle Complex is a“nanosyringe” complex, with the polypeptide encoded by the effector gene being packaged (loaded) within, or at the end (tip) of, the PVC Needle Complex, thus representing a“payload” or“warhead” of the PVC Needle Complex. The present inventors have demonstrated that the PVC Needle Complex itself (with the payload still loaded) is freely released (e.g. secreted) from Photorhabdus cells, before interacting with the membrane of a target cell and injecting the payload into the cell’s cytosol. Indeed, the inventors have successfully expressed and loaded PVC Needle Complexes in heterologous expression systems, before isolating/ purifying the PVC Needle Complexes and using them to suppress (e.g. kill) insect larvae (see Example 2). Thus, the PVC Needle Complexes act as long-range protein delivery systems.

In one embodiment, the PVC Needle Complex is encoded by a sequence having at least 75% sequence identity (preferably at least 85% sequence identify; more preferably at least 95% sequence identity) to a sequence selected from SEQ ID NO.: 93, SEQ ID NO.: 94, and SEQ ID NO.: 95 (for example, SEQ ID NO.: 93).

In one embodiment, the PVC Needle Complex is encoded by a sequence selected from SEQ ID NO.: 93, SEQ ID NO.: 94, and SEQ ID NO.: 95 (for example, SEQ ID NO.: 93).

Leader/ signal sequences are typically peptides, often of 10-30 amino acids long present at the N-terminus of the majority of (newly) expressed proteins that are destined towards the secretory pathway (e.g. for directing said proteins to a protein-conducting channel on the cell membrane). Many proteins require a signal sequence for Golgi or endoplasmic reticulum entry.

The term“leader sequence” (used interchangeably with the terms“leader peptide”,“signal sequence”, “targeting signal”, “localization signal”, “localization sequence”, and “transit peptide” herein), used in the context of a“PVC effector leader sequence” herein, means a polypeptide sequence which functions to direct the PVC effector into the interior, or the end (tip), of a PVC Needle Complex - as such, the leader sequence functions to package a PVC effector into a PVC Needle Complex. The PVC Needle Complex can subsequently deliver (e.g. inject) the PVC effector into a target cell. The PVC Needle Complex may be an assembled PVC Needle Complex. The term“PVC Needle Complex” may refer to a fragment of a PVC Needle Complex (e.g. wherein the leader sequence contacts said fragment, and optionally the PVC Needle Complex assembles around the leader sequence-payload ‘effector fusion’).

A PVC leader sequence is typically present in the N-terminus (characterized by or encompassed within the first 50 amino acids) of a PVC effector or homologue thereof. However, the invention embraces leader sequences of PVC effectors and PVC effector homologues, which may be found in regions other than the N-terminal region of such PVC effectors/ homologues (e.g. in the C-terminal region).

In one embodiment, the leader sequence comprises (or consists essentially of) amino acid residues 1-50 of a PVC effector (e.g. PVC effector protein). Reference to“amino acid residues 1-50” embraces“amino acid residues 2-50”, wherein the N-terminal methionine is omitted e.g. has been cleaved. The leader sequence may be a fragment of the N-terminal 50 amino acids of a PVC effector (e.g. a fragment comprising or consisting essentially of £ 45, £ 35, £ 25, or £ 15 amino acids), with the proviso that the fragment is capable of packaging a payload into a PVC Needle Complex.

In one embodiment, a leader sequence (e.g. isolated leader sequence) of the invention comprises (or consists essentially of) an amino acid sequence having at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% or 100% sequence identity to one or more sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92) - e.g. with the proviso that the leader sequence is capable of packaging a payload into a PVC Needle Complex. In a preferable embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence having at least 60% sequence identity to one or more sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92) - e.g. with the proviso that the leader sequence is capable of packaging a payload into a PVC Needle Complex. In a more preferable embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence of one or more selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92). In one embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence selected from SEQ ID NO.: 47 - SEQ ID NO.: 92 (preferably SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, or SEQ ID NO.: 92).

In one embodiment, a leader sequence comprises (or consists essentially of) an amino acid sequence selected from SEQ ID NO.: 50, SEQ ID NO.: 68, SEQ ID NO.: 71 , SEQ ID NO.: 76, SEQ ID NO.: 78, and SEQ ID NO.: 92.

In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 50. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 68. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 71. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 76. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 78. In one embodiment, the leader sequence comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 92.

Without wishing to be bound by theory, it is believed that the leader sequences share a “chemical composition consensus”, based on amino acid properties. More particularly, the leader sequences comprise similar charge patterns, the pattern comprising 2x negatively charged regions, each followed by a positively charged region (e.g. [-ve] [+ve] [-ve] [+ve]) - see Figure 9. This is consistent with leader sequences of toxins of the type 2 secretion system, which comprise a charge / property pattern of [+ve] [Hydrophobic] [+ve] [C]. A further theory posits that the leader sequences share a typical“helix-turn-helix” structure. Another theory is that the leader sequences form a structure recognised by an ATPase enzyme (e.g. encoded by the gene PAU_03339 ( pvc15) in the model operon of Figure 1 (D)) present in the interior, or at the end (e.g. tip), of a PVC Needle Complex. The term“PVC effector” (used synonymously with the term“PVC operon-encoded effector”, and“PVC effector protein”) means an effector polypeptide encoded by a Photorhabdus PVC operon, more particularly (and typically) found shortly downstream (3’) of the structural genes of said operon (preferably shortly or immediately downstream of pvc16 , and typically within 5kb). The term“PVC effector” preferably embraces homologues thereof. Thus, the leader sequence may also be from a polypeptide encoded by a gene which is a homologue of gene encoding a PVC effector - see Table 1 for examples of such homologues. Indeed, identification of PVC effectors is aided by detecting homology of a gene downstream of pvd 6 with a known toxin polypeptide (e.g. a gene which encodes said toxin polypeptide). As will be understood by the skilled person, the term“homologue” preferably means a gene that descended from the same ancestral gene, and shares similar function - such gene (or polypeptide encoded thereby) is homologous to a gene encoding the PVC effector. A homologue may be from the genome of a Photorhabdus species or from a species other than a Photorhabdus species. Examples of suitable homologues are outlined in Table 1.

The present inventors have elucidated and characterised, in detail, genes that encode PVC effectors of these PVC Needle Complexes in the three most common (best characterised) strains of Photorhabdus, as well as the P. asymbiotica Thai strain PB68.1. This was conducted based on analysing proximity of genetic linkage to the 3’ end of the PVC structural genes of the operons, and predicted function of the protein sequence of the effector (e.g. a homologue of a known effector/ toxin protein). In more detail, the PVC effectors (e.g. genes encoding the PVC effectors) were typically identified as open reading frames (ORF) having homology to genes encoding known toxin polypeptides (e.g. homologues as outlined in Table 1), and being typically present within a distance of 1 kilobase to 5 kilobase (kb) (e.g. within 1 kb) downstream of the final structural gene of a PVC operon (e.g. pvc16) (typically with few or no intervening genes). Typically, there are no“non-toxin-like” ORFs between the end of the operon (encoding the PVC Needle Complex) and the PVC effector gene(s). Although there may be (e.g. one, or two) other small predicted genes present in these regions, these other genes are not assigned as PVC effectors (due to lack of homology to a known effector/ toxin gene, as described above).

In order to assign a putative PVC effector gene (e.g. ORF within a distance of 5kb, for example within 1 kb downstream of the final structural gene of a PVC operon) as a PVC effector gene, the inventors used a combination of BlastP and HHPRED (https://toolkit.tuebingen.mpg. de/#/tools/hhpred). Putative PVC effector genes were assigned as PVC effector genes based on direct homology to known toxin encoding genes, similarity to a toxin protein family, proximity to the PVC operon (e.g. within 1-5kb downstream of the final structural gene of a PVC operon, pvc16) and/or based on domain similarities of predicted secondary structures to that of known toxins.

Thus, a PVC effector (gene) may be identified (within a Photorhabdus genome) by (i) identifying pvc16 (e.g. via sequence homology to a known pvc16), (ii) identifying an ORF 3’ to pvc16, preferably £5kb downstream of pvc16), and (iii) confirming said ORF encodes a PVC effector through identification of sequence homology to a known gene encoding a toxin polypeptide (for example, a toxin protein described in the column of Table 1 labelled “Homologue(s)”). By way of example, the PVC effector gene PAU_03337 (referred to herein as“sepC” due to homology to virulent sep genes) is positioned 325 base pairs (bp) downstream of pvc16 (PAU_03338) of the PVC operon referred to herein as PVCpnf (e.g. of SEQ ID NO. 93), which is found in P. asymbiotica ATCC43949. That is, the start codon of PAU_03337 begins 325 bp downstream of the end of the stop codon of PAU_03338.

This can be illustrated by reference to the P. asymbiotica ATCC43949 complete genome, accessible via GenBank accession no. FM 162591.1 (see also e.g. Wilkinson et al, BMC Genomics volume 10, article number: 302 (2009), incorporated herein by reference), in which effector gene PAU_03337 is annotated as being positioned in the genome as follows: complement (3913237..3914247) - that is, at nucleotide positions 3913237..3914247; and PAU_03338 is annotated as being positioned in the genome as follows: complement (3914573..3915454). No other ORF (encoding an effector or otherwise) is found between these two genes.

A further PVC effector gene associated with the PVC operon referred to herein as PVCpnf (e.g. of SEQ ID NO. 93), namely PAU_03332 (referred to herein as “pnf’), is positioned 3535bp downstream of pvc16 (PAU_03338).

The PVC effector gene PAU_02095 (referred to herein as“Rhs-iike toxin effector” due to homology to virulent Rhs toxin genes) is positioned 3961 bp downstream of pvc16 (PAU_02099) of a PVC operon referred to herein as PVC lopT (e.g. of SEQ ID NO. 94), which is found in P. asymbiotica ATCC43949. That is, the start codon of PAU_02095 begins 3961 bp downstream of the end of the stop codon of PAU_02099.

In a further example, the PVC effector of gene PAU_02009 (referred to as“cif” herein due to predicted function as a cell cycle inhibiting factor/ ATP/GTP binding protein) is positioned 157bp downstream of pvc16 (PAU_02008) of the associated PVC operon, referred to herein as PVC c/7, found in P. asymbiotica ATCC43949.

In yet further examples: with regard to a PVC operon of P. luminescens TT01 referred to as a PVC unit4 operon herein, PVC effector gene “pvc17” (e.g.“plu165T’) is positioned 104bp downstream of pvc16 (gene“piu1655’)\ and with regard to a PVC operon of Photorhabdus temperata subsp. temperata Meg1 referred to as a PVCc/7 operon herein, PVC effector gene “CIF toxin effector” (e.g. MEG1 DRAFT_03529) is positioned 4216bp downstream of the relevant pvc16 gene.

These examples, illustrate that a gene encoding a PVC effector is typically positioned within a distance of £ 5 kb downstream of the final gene of a PVC operon (e.g. of pvc16), more typically within a distance of £ 1 kb downstream of the final gene of a PVC operon.

In summary, there exists 46 PVC effectors that have been identified in these four strains (based on currently available sequence data) (see Table 1). The first 50 amino acids of each of these PVC effectors represent (or encompass) their endogenous leader sequence, and the inventors have demonstrated the leader sequences may be cloned and fused to a variety of payloads to be packaged into a PVC Needle Complex - see Examples 3 and 4. Thus, a PVC effector (as translated) comprises at least two principle domains: the leader sequence (amino acids 1 to 50) and the actual effector polypeptide (amino acids 51 to C-terminal amino acid) - the latter of which may be referred to as the“effector” (e.g.“effector portion”) or “payload” herein. Although the Photorhabdus genome sequence(s) continues to be revised, this consolidated list of PVC effector genes represents a comprehensive description of such effectors, and is based on currently available sequence data of the most common (best characterised) Photorhabdus strains, and provides the skilled person with an understanding of the term “PVC effector” as well as the sequences of these PVC effectors (as well as how to search/mine for further PVC effectors, e.g. in alternative (genome) sequences). As described above, the inventors have found that the PVC effector proteins comprise a leader sequence which is necessary (and sufficient) for directing the PVC effector protein (e.g. payload) to be packaged/ loaded into a PVC Needle Complex. Table 1

The accession numbers provided in Table 1 are provided for exemplary purposes, providing example amino acid sequences of (or having high similarity to) PVC effectors described herein. The sequences of said accession numbers may be accessed through GenBank (https://www.ncbi.nlm.nih.gov/genbank/).

The locus tag (beginning with“PAU” or“Plu”) corresponds to the locus tag assigned to the effector in genome sequences available through GenBank above. Locus tags beginning with “PAT” (referring to strain P. asymbiotica Thai strain PB68.1) and“PAK” (referring to strain P. asymbiotica Kingscliff) have been assigned by the present inventors upon identification of the PVC effector genes within the genomes of said strains (in a consistent manner with the locus tags of publicly available sequences). This locus tags may be used herein to refer to the corresponding PVC effector polypeptide.

In one embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector protein in parentheses) selected from PAK_1985 (SEQ ID NO: 1), PAK_1987 (SEQ ID NO: 2), PAK_1988 (SEQ ID NO: 3), PAK_2075 (SEQ ID NO: 4), PAK_2077 (SEQ ID NO: 5), PAK_2892 (SEQ ID NO: 6), PAK_2893 (SEQ ID NO: 7), PAK_2894 (SEQ ID NO: 8), PAK_3525 (SEQ ID NO: 9), PAT_00148 (SEQ ID NO: 10), PAT_00149 (SEQ ID NO: 11), PAT_00150 (SEQ ID NO: 12), PAT_00152 (SEQ ID NO: 13),

PAT_02308 (SEQ ID NO: 14), PAT_02309 (SEQ ID NO: 15), PAT_02310 (SEQ ID NO: 16),

PAT_02956 (SEQ ID NO: 17), PAT_02957 (SEQ ID NO: 18), PAT_03171 (SEQ ID NO: 19),

PAT_03172 (SEQ ID NO: 20), PAT_03177 (SEQ ID NO: 21), PAU_02009 (SEQ ID NO: 22),

PAU_02010 (SEQ ID NO: 23), PAU_02095 (SEQ ID NO: 24), PAU_02096 (SEQ ID NO: 25),

PAU_02097 (SEQ ID NO: 26), PAU_02098 (SEQ ID NO: 27), PAU_02230 (SEQ ID NO: 28),

PAU_02805 (SEQ ID NO: 29), PAU_02806 (SEQ ID NO: 30), PAU_02807 (SEQ ID NO: 31),

PAU_03332 (SEQ ID NO: 32), PAU_03337 (SEQ ID NO: 33), Plu1651 (SEQ ID NO: 34), Plu1671 (SEQ ID NO: 35), Plu1672 (SEQ ID NO: 36), Plu1690 (SEQ ID NO: 37), Plu1691 (SEQ ID NO: 38), Plu1712 (SEQ ID NO: 39), Plu1713 (SEQ ID NO: 40), Plu1714 (SEQ ID NO: 41), Plu2400 (SEQ ID NO: 42), Plu2401 (SEQ ID NO: 43), Plu2514 (SEQ ID NO: 44), Plu2515 (SEQ ID NO: 45), Plu1649 (SEQ ID NO: 46), or a combination thereof.

In one embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector protein in parentheses) selected from PAU_02009 (SEQ ID NO: 22), PAU_02010 (SEQ ID NO: 23), PAU_02095 (SEQ ID NO: 24), PAU_02096 (SEQ ID NO:

25), PAU_02097 (SEQ ID NO: 26), PAU_02098 (SEQ ID NO: 27), PAU_02230 (SEQ ID NO:

28), PAU_02805 (SEQ ID NO: 29), PAU_02806 (SEQ ID NO: 30), PAU_02807 (SEQ ID NO:

31), PAU_03332 (SEQ ID NO: 32), PAU_03337 (SEQ ID NO: 33), Plu1651 (SEQ ID NO: 34),

Plu1671 (SEQ ID NO: 35), Plu1672 (SEQ ID NO: 36), Plu1690 (SEQ ID NO: 37), Plu1691 (SEQ ID NO: 38), Plu1712 (SEQ ID NO: 39), Plu1713 (SEQ ID NO: 40), Plu1714 (SEQ ID NO: 41), Plu2400 (SEQ ID NO: 42), Plu2401 (SEQ ID NO: 43), Plu2514 (SEQ ID NO: 44), Plu2515 (SEQ ID NO: 45), Plu1649 (SEQ ID NO: 46), or a combination thereof. These gene names correspond to the‘locus tags’ of PVC effector genes in the Photorhabdus genome sequences accessible via GenBank, as described above. The PAT and PAK locus tags were generated by the present inventors, such that terminology is consistent with the PAU and Plu locus tags of publicly available genome sequences.

Thus, the PVC effector may be encoded by one or more gene listed above.

In one embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector in parentheses) selected from PAK_02075 (SEQ ID NO: 4), PAU_02009 (SEQ ID NO: 22), PAU_02096 (SEQ ID NO: 25), PAU_02806 (SEQ ID NO: 30), PAU_03332 (SEQ ID NO: 32), Plu1651 (SEQ ID NO: 34), Plu1649 (SEQ ID NO: 46), or a combination thereof.

In a preferable embodiment, the PVC effector is encoded by one or more gene (with the SEQ ID NO. of the encoded PVC effector in parentheses) selected from PAU_02806 (SEQ ID NO: 30), PAU_03332 (SEQ ID NO: 32), Plu1651 (SEQ ID NO: 34), Plu1649 (SEQ ID NO: 46), or a combination thereof. The PVC effector may have a sequence having at least 80% sequence identity (preferably at least 90% sequence identity; more preferably 100% sequence identity) to an amino acid sequence selected from SEQ ID NO: 1 - SEQ ID NO: 46. For example, the PVC effector may have a sequence having at least 80% sequence identity (preferably at least 90% sequence identity; more preferably 100% sequence identity) to an amino acid sequence selected from SEQ ID NO: 22 - SEQ ID NO: 46.

The present inventors have identified the leader sequences of the gogB1 (PAU_02806) and Pnf (PAU_03332) PVC effectors as being particularly efficient at packaging a (fused) payload into a PVC Needle Complex. In one embodiment, the PVC effector is encoded by PAU_02806 (e.g. has an amino acid sequence of SEQ ID NO: 30). In one embodiment, the PVC effector is encoded by PAU_03332 (e.g. has an amino acid sequence of SEQ ID NO: 32).

In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of one or more selected from SEQ ID NO: 1 - SEQ ID NO: 46 (for example SEQ ID NO: 22 - SEQ ID NO: 46), or a combination thereof. For example, the PVC effector may comprise (or consist essentially of) a sequence selected from SEQ ID NO: 4, SEQ ID NO: 22, SEQ ID NO: 25, SEQ ID NO: 30, SEQ ID NO: 32 and SEQ ID NO: 46.

In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO.: 4. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO. 22. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO. 25. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO: 30. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO: 32. In one embodiment, the PVC effector comprises (or consists essentially of) an amino acid sequence of SEQ ID NO. 46.

The term“packaging” (used synonymously with the terms“trans-packaging” and“loading”) means the directing of a payload, by a leader sequence of the invention (to which the payload is linked/ fused), into the interior, or end (tip), of an assembled PVC Needle Complex, such that the PVC Needle Complex is subsequently configured for delivering (e.g. injecting) the payload into a target cell. Thus, the payload may be packaged within a PVC Needle Complex, or may be packaged at the end (or tip) of the PVC Needle Complex (e.g. at least a portion of the payload may be external to the PVC Needle Complex).

The term“payload” (used synonymously with the term“warhead” herein) means a molecule which is packaged into the interior, or end (tip), of an assembled PVC Needle Complex, and subsequently delivered (e.g. injected) into a (target) cell. In wild-type Photorhabdus, the payload is a PVC effector (more particularly, the effector portion of said PVC effector), encoded (as described above) by a gene that is downstream to (3’ to) the structural genes of a PVC operon. For example, see model PVC operon of Figure 1(D), having effector genes PAU_03337 (listed as PVCpnf 17), encoding an adenylate cyclase effector (e.g. SEQ ID NO.: 33); and PAU_03332 (listed as PVCpnf 21), encoding a Pnf effector (e.g. SEQ ID NO.: 32). A leader sequence and a payload of the present invention form an“effector fusion” (or simply “fusion”) that is“distinct from a (e.g. wild-type) PVC effector” (e.g. a polypeptide encoded by one of the genes outlined in Table 1). For example, the effector fusion may be a chimaera, formed of a leader sequence from a first PVC effector fused to (an/the effector portion of) a second (different) PVC effector (preferably amino acids 51 to the C-terminal amino acid of said second PVC effector), wherein said first PVC effector and said second PVC effector are different. The effector fusion may be a chimaera, comprising (or consisting essentially of) a leader sequence described herein fused to a non-PVC effector polypeptide. The effector fusion may be a chimaera, comprising (or consisting essentially of) a leader sequence described herein fused to a non -Photorhabdus polypeptide. The effector fusion may be a leader sequence-nucleic acid fusion (preferably conjugate), comprising a leader sequence described herein fused to a nucleic acid.

An effector fusion is not limited to a fusion complex comprising a leader sequence fused to a toxic payload (e.g. the leader could be fused to a therapeutic payload). Thus, the term “effector” as used in the context of “effector fusion” means the payload which is packaged into the PVC Needle Complex (which could provide a variety of effects, including toxigenic and/or therapeutic effects). Thus, the term“effector fusion” may be used interchangeably with the term“fusion” herein.

The term “effector fusion” may be used synonymously with the term “leader sequence- payload fusion”, and/or“leader sequence-payload complex”.

Alternatively or additionally, the payload may be distinct from a PVC effector protein (e.g. distinct from amino acids 51 to the C-terminal amino acid of a PVC effector). For example, the payload may be a polypeptide or nucleic acid that is not found in a wild-type Photorhabdus bacterium.

Analysis of the size (e.g. polypeptide length) and structure of the various natural PVC effector payloads encoded by Photorhabdus, shows that there exists a wide variety of different PVC effector lengths and structures, demonstrating that the applicability of the PVC Needle Complex delivery system of the present invention is not limited by the size or properties of the payload of interest. To summarise, there is no requirement for particular secondary structure, biophysical property, or length of cargoes, confirming that that the PVC Needle Complex can be utilised as a versatile multifunctional delivery vehicle.

The payload may be one or more selected from a polypeptide (e.g. a polypeptide payload), a nucleic acid (e.g. a nucleic acid payload), or a combination thereof. In a preferable embodiment, the payload is a polypeptide.

Examples of polypeptide payloads include an antibody (e.g. an anti-MDM antibody), a nanobody, a peptide vaccine (e.g. a tyrosinase-related protein 2 (TRP2) peptide vaccine), a nuclear factor-kB inhibitor, a T3SS payload (e.g. a T3SS payload which inhibits the NF-kB and/or MAPK pathways), an anti-apoptotic peptide (e.g. BH4), nicotinamide adenine dinucleotide quinone internal oxidoreductase (Ndi1), a PHOX complex subunit, a myotubularin, a nucleic acid (preferably DNA)-modifying enzyme, or a combination thereof. Examples of suitable nucleic acid-modifying enzymes include a recombinase (e.g. Cre recombinase), a transposase, a Cas enzyme (e.g. Cas9), and/or a Mad7 (preferably Mad7, more preferably Cre recombinase). The payload may be, for example, tBid (SEQ ID NO.: 109) and/or BaxBH3 peptide (aa59-73) (SEQ ID NO.: 111).

Any polypeptide having enzymatic activity may be a payload.

A nucleic acid payload may be conjugated/ crosslinked to a leader sequence of the invention. For example, copper-free click chemistry (e.g. strain-promoted alkyne azide cycloaddition (SPAAC)) may be used to crosslink a nucleic acid to a leader sequence. Examples of nucleic acid payloads include a primer, an mRNA, a nucleic acid analogue, an aptamer, a small interfering RNA (siRNA), a microRNA therapeutic inhibitor (antimiR), a microRNA therapeutic mimic (promiR), a long non-coding RNA modulator, a single guide RNA (sgRNA), or a combination thereof.

The leader sequence may be fused directly or indirectly (e.g. by means of a spacer) to the payload. The leader sequence may be fused covalently or non-covalently to the payload. In a preferable embodiment, the leader sequence is covalently fused to the payload. For example, the fusion/ effector fusion may be a (recombinant) fusion protein comprising (or consisting essentially of) a PVC effector leader sequence fused to a (polypeptide) payload.

Another aspect of the invention provides an isolated nucleic acid comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides an isolated nucleic acid comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.

Another aspect of the invention provides an expression vector comprising: a nucleic acid (preferably an isolated nucleic acid) comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides an expression vector comprising: a nucleic acid (preferably an isolated nucleic acid) comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.

Another aspect of the invention provides a host cell comprising an isolated nucleic acid, the isolated nucleic acid comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides a host cell comprising an isolated nucleic acid, the isolated nucleic acid comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.

The term“nucleic acid” may be used synonymously with the term“polynucleotide”.

Another aspect of the invention provides a host cell comprising an expression vector, the expression vector comprising a nucleotide sequence which encodes a PVC effector leader sequence of the invention. Another aspect of the invention provides a host cell comprising an expression vector, the expression vector comprising a nucleotide sequence which encodes an effector fusion (e.g. fusion) of the invention, and optionally a nucleotide sequence which encodes a PVC Needle Complex.

Said host cell may be a mammalian cell, an insect cell, a yeast cell, a bacterial cell (e.g. E. coli), or a plant cell. In a preferable embodiment, the host cell is a bacterial cell (preferably E. coli).

In one embodiment, the host cell is a Photorhabdus cell, optionally wherein the Photorhabdus cell comprises a PVC operon operably linked to an inducible promoter (e.g. see Example 3). The PVC operon may be endogenous to the Photorhabdus cell (e.g. the PVC operon may be PVCu4). Suitably, the Photorhabdus cell may be obtainable from the ATCC under accession no. ATCC 29999.

The sequences (e.g. leader sequence and/or nucleic acid sequence) of the present invention include sequences that have been removed from their naturally occurring environment, recombinant or cloned (e.g. DNA) isolates, and chemically synthesized analogues or analogues biologically synthesized by heterologous systems.

The leader sequence(s) and/or polynucleotide(s) of the present invention may be prepared by any means known in the art. For example, large amounts of the leader sequence(s) and/or polynucleotide(s) may be produced by replication and/or expression in a suitable host cell. The natural or synthetic DNA fragments coding for a desired fragment will typically be incorporated into recombinant nucleic acid constructs, typically DNA constructs, capable of introduction into and replication in a prokaryotic or eukaryotic cell. Usually the DNA constructs will be suitable for autonomous replication in a unicellular host, such as yeast or bacteria, but may also be intended for introduction to and integration within the genome of a cultured bacterial, insect, mammalian, plant or other eukaryotic cell lines.

The leader sequence(s) and/or polynucleotide(s) of the present invention may also be produced by chemical synthesis, e.g. a polynucleotide by the phosphoramidite method or the tri-ester method, and may be performed on commercial automated oligonucleotide synthesizers. A double-stranded (e.g. DNA) fragment may be obtained from the single stranded product of chemical synthesis either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.

When applied to a leader sequence or nucleic acid sequence, the term“isolated” in the context of the present invention denotes that the leader sequence and/or polynucleotide sequence has been removed from its natural genetic milieu and is thus free of other extraneous or unwanted coding sequences (but may include naturally occurring 5' and 3' untranslated regions such as promoters and terminators), and is in a form suitable for use within genetically engineered protein production systems. Such isolated molecules are those that are separated from their natural environment. SEQUENCE HOMOLOGY

Any of a variety of sequence alignment methods can be used to determine percent identity, including, without limitation, global methods, local methods and hybrid methods, such as, e.g., segment approach methods. Protocols to determine percent identity are routine procedures within the scope of one skilled in the art. Global methods align sequences from the beginning to the end of the molecule and determine the best alignment by adding up scores of individual residue pairs and by imposing gap penalties. Non-limiting methods include, e.g., CLUSTAL W, see, e.g., Julie D. Thompson et al., CLUSTAL W: Improving the Sensitivity of Progressive Multiple Sequence Alignment Through Sequence Weighting, Position- Specific Gap Penalties and Weight Matrix Choice, 22(22) Nucleic Acids Research 4673-4680 (1994); and iterative refinement, see, e.g., Osamu Gotoh, Significant Improvement in Accuracy of Multiple Protein. Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural Alignments, 264(4) J. Mol. Biol. 823-838 (1996). Local methods align sequences by identifying one or more conserved motifs shared by all of the input sequences. Non-limiting methods include, e.g., Match-box, see, e.g., Eric Depiereux and Ernest Feytmans, Match-Box: A Fundamentally New Algorithm for the Simultaneous Alignment of Several Protein Sequences, 8(5) CABIOS 501 -509 (1992); Gibbs sampling, see, e.g., C. E. Lawrence et al., Detecting Subtle Sequence Signals: A Gibbs Sampling Strategy for Multiple Alignment, 262(5131 ) Science 208-214 (1993); Align- M, see, e.g., Ivo Van Walle et al., Align-M - A New Algorithm for Multiple Alignment of Highly Divergent Sequences, 20(9) Bioinformatics: 1428-1435 (2004).

Thus, percent sequence identity is determined by conventional methods. See, for example, Altschul et al., Bull. Math. Bio. 48: 603-16, 1986 and Henikoff and Henikoff, Proc. Natl. Acad. Sci. USA 89:10915-19, 1992. Briefly, two amino acid sequences are aligned to optimize the alignment scores using a gap opening penalty of 10, a gap extension penalty of 1 , and the "blosum 62" scoring matrix of Henikoff and Henikoff (ibid.) as shown below (amino acids are indicated by the standard one-letter codes).

The "percent sequence identity" between two or more nucleic acid or amino acid sequences is a function of the number of identical positions shared by the sequences. Thus, % identity may be calculated as the number of identical nucleotides / amino acids divided by the total number of nucleotides / amino acids, multiplied by 100. Calculations of % sequence identity may also take into account the number of gaps, and the length of each gap that needs to be introduced to optimize alignment of two or more sequences. Sequence comparisons and the determination of percent identity between two or more sequences can be carried out using specific mathematical algorithms, such as BLAST, which will be familiar to a skilled person. ALIGNMENT SCORES FOR DETERMINING SEQUENCE IDENTITY

The percent identity is then calculated as:

Total number of identical matches

x 100

[length of the longer sequence plus the

number of gaps introduced into the longer

sequence in order to align the two sequences]

Substantially homologous polypeptides are characterized as having one or more amino acid substitutions, deletions or additions. These changes are preferably of a minor nature, that is conservative amino acid substitutions (see below) and other substitutions that do not significantly affect the folding or activity of the polypeptide; small deletions, typically of one to about 30 amino acids; and small amino- or carboxyl-terminal extensions, such as an amino- terminal methionine residue, a small linker peptide of up to about 20-25 residues, or an affinity tag.

CONSERVATIVE AMINO ACID SUBSTITUTIONS

Basic: arginine, lysine, histidine

Acidic: glutamic acid, aspartic acid

Polar: glutamine, asparagine

Hydrophobic: leucine, isoleucine, valine

Aromatic: phenylalanine, tryptophan, tyrosine

Small: glycine, alanine, serine, threonine, methionine In addition to the 20 standard amino acids, non-standard amino acids (such as 4- hydroxyproline, 6-N-methyl lysine, 2-aminoisobutyric acid, isovaline and a -methyl serine) may be substituted for amino acid residues of the polypeptides of the present invention. A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, and unnatural amino acids may be substituted for polypeptide amino acid residues. The polypeptides of the present invention can also comprise non-naturally occurring amino acid residues.

Non-naturally occurring amino acids include, without limitation, trans-3-methylproline, 2,4- methano-proline, cis-4-hydroxyproline, trans-4-hydroxy-proline, N-methylglycine, allo- threonine, methyl-threonine, hydroxy-ethylcysteine, hydroxyethylhomo-cysteine, nitro- glutamine, homoglutamine, pipecolic acid, tert-leucine, norvaline, 2-azaphenylalanine, 3- azaphenyl-alanine, 4-azaphenyl-alanine, and 4-fluorophenylalanine. Several methods are known in the art for incorporating non-naturally occurring amino acid residues into proteins. For example, an in vitro system can be employed wherein nonsense mutations are suppressed using chemically aminoacylated suppressor tRNAs. Methods for synthesizing amino acids and aminoacylating tRNA are known in the art. Transcription and translation of plasmids containing nonsense mutations is carried out in a cell free system comprising an E. coli S30 extract and commercially available enzymes and other reagents. Proteins are purified by chromatography. See, for example, Robertson et al. , J. Am. Chem. Soc. 113:2722, 1991 ; Ellman et al., Methods Enzymol. 202:301 , 1991 ; Chung et al., Science 259:806-9, 1993; and Chung et al., Proc. Natl. Acad. Sci. USA 90:10145-9, 1993). In a second method, translation is carried out in Xenopus oocytes by microinjection of mutated mRNA and chemically aminoacylated suppressor tRNAs (Turcatti et al., J. Biol. Chem. 271 :19991-8, 1996). Within a third method, E. coli cells are cultured in the absence of a natural amino acid that is to be replaced (e.g., phenylalanine) and in the presence of the desired non-naturally occurring amino acid(s) (e.g., 2-azaphenylalanine, 3-azaphenylalanine, 4-azaphenylalanine, or 4-fluorophenylalanine). The non-naturally occurring amino acid is incorporated into the polypeptide in place of its natural counterpart. See, Koide et al., Biochem. 33:7470-6, 1994. Naturally occurring amino acid residues can be converted to non-naturally occurring species by in vitro chemical modification. Chemical modification can be combined with site-directed mutagenesis to further expand the range of substitutions (Wynn and Richards, Protein Sci. 2:395-403, 1993).

A limited number of non-conservative amino acids, amino acids that are not encoded by the genetic code, non-naturally occurring amino acids, and unnatural amino acids may be substituted for amino acid residues of polypeptides of the present invention.

Essential amino acids in the polypeptides of the present invention can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, Science 244: 1081-5, 1989). Sites of biological interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., Science 255:306-12, 1992; Smith et al., J. Mol. Biol. 224:899-904, 1992; Wlodaver et al., FEBS Lett. 309:59-64, 1992. The identities of essential amino acids can also be inferred from analysis of homologies with related components (e.g. the translocation or protease components) of the polypeptides of the present invention.

Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241 :53-7, 1988) or Bowie and Sauer (Proc. Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al. , Biochem. 30:10832-7, 1991 ; Ladner et al., U.S. Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).

Multiple amino acid substitutions can be made and tested using known methods of mutagenesis and screening, such as those disclosed by Reidhaar-Olson and Sauer (Science 241 :53-7, 1988) or Bowie and Sauer (Proc. Natl. Acad. Sci. USA 86:2152-6, 1989). Briefly, these authors disclose methods for simultaneously randomizing two or more positions in a polypeptide, selecting for functional polypeptide, and then sequencing the mutagenized polypeptides to determine the spectrum of allowable substitutions at each position. Other methods that can be used include phage display (e.g., Lowman et al., Biochem. 30:10832-7, 1991 ; Ladner et al., U.S. Patent No. 5,223,409; Huse, WIPO Publication WO 92/06204) and region-directed mutagenesis (Derbyshire et al., Gene 46:145, 1986; Ner et al., DNA 7:127, 1988).

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Singleton, et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 20 ED., John Wiley and Sons, New York (1994), and Hale & Marham, THE HARPER COLLINS DICTIONARY OF BIOLOGY, Harper Perennial, NY (1991) provide the skilled person with a general dictionary of many of the terms used in this disclosure.

This disclosure is not limited by the exemplary methods and materials disclosed herein, and any methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of this disclosure. Numeric ranges are inclusive of the numbers defining the range. Unless otherwise indicated, any nucleic acid sequences are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.

The headings provided herein are not limitations of the various aspects or embodiments of this disclosure.

Amino acids are referred to herein using the name of the amino acid, the three letter abbreviation or the single letter abbreviation. The term“protein", as used herein, includes proteins, polypeptides, and peptides. As used herein, the term“amino acid sequence” is synonymous with the term“polypeptide” and/or the term“protein”. In some instances, the term“amino acid sequence” is synonymous with the term“peptide”. In some instances, the term“amino acid sequence” is synonymous with the term“enzyme”. The terms "protein" and "polypeptide" are used interchangeably herein. In the present disclosure and claims, the conventional one-letter and three-letter codes for amino acid residues may be used. The 3- letter code for amino acids as defined in conformity with the lUPACIUB Joint Commission on Biochemical Nomenclature (JCBN). It is also understood that a polypeptide may be coded for by more than one nucleotide sequence due to the degeneracy of the genetic code.

Other definitions of terms may appear throughout the specification. Before the exemplary embodiments are described in more detail, it is to be understood that this disclosure is not limited to particular embodiments described, and as such may vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present disclosure will be defined only by the appended claims.

Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limits of that range is also specifically disclosed. Each smaller range between any stated value or intervening value in a stated range and any other stated or intervening value in that stated range is encompassed within this disclosure. The upper and lower limits of these smaller ranges may independently be included or excluded in the range, and each range where either, neither or both limits are included in the smaller ranges is also encompassed within this disclosure, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in this disclosure.

It must be noted that as used herein and in the appended claims, the singular forms“a”,“an”, and“the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to“an effector” includes a plurality of such effectors and reference to“the effector” includes reference to one or more effectors and equivalents thereof known to those skilled in the art, and so forth.

The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that such publications constitute prior art to the claims appended hereto.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention will now be described, by way of example only, with reference to the following Figures and Examples.

Figure 1 shows (A) a schematic representation of one PVC operon layout (gene clusters present in varying regions of the originating genome) encoding a PVC Needle Complex. (B) A schematic representation of Class I, II and III PVC operon layouts. Homologous subunit types amongst the classes are show as having similar shading (in grey scale). (C) An illustration of an assembled PVC Needle Complex. The numbering shown is used to correlate a gene cluster in (A) with the position of the encoded proteins in the structure in (C) (e.g. the cap Ί6’ cluster in A is shown as Ί6’ in the left-most cap region of (B)). (D) A map of the model Class I PaATCC 43949 PVCpnf operon (e.g. encoded by SEQ ID NO.: 93), showing two effector genes in the payload region (Rhs-like adenylate cyclase, and PAU_03332). Figure 2 shows an overview of a cloning procedure for preparation of PVC Needle Complex expressing plasmids, based on overlapping PCR. PCR fragments (having overlapping regions) are provided from template gDNA of P. asymP/of/ca ATCC43949 (available from the ATCC under accession no. ATCC 43949) with relevant primers targeting the PVC operon. Figure 3 shows a transmission electron micrograph of an (in vitro) sample of PVC Needle Complexes (e.g. prepared from cells having the expression vector described above). The PVC Needle Complexes assemble in a distinct‘nanosyringe’ structure, consistent with its role as a contractile structure. A 3D rendered model of a PVC Needle Complex as derived from high resolution single particle cryo-EM tomography structure is shown in (B).

Figure 4 shows (A) a transmission electron micrograph of a PVC Needle Complex comprising a Pnf payload following immuno-gold staining with an anti-Pnf (immunogold) antibody, confirming the Pnf-payload toxin is associated with the PVC Needle Complex (referred to as PVC pnf). PVCpnf Needle Complexes were prepared from supernatants of an E. coli cosmid clone, which encodes the PVCpnf operon. Anti-peptide antibodies against the Pnf (TGQKPGNNEWKTGR, SEQ ID NO: 96) epitope were used to localise the payload toxin protein. The Pnf toxin could only be detected at the ends of broken or contracted needle complex, providing evidence that the toxins are contained within the complex (arrows). (B) Western blot analysis confirms that the Pnf protein (toxin) can only be detected using the anti-peptide antibody if the PVC Needle Complex is either chemically or physically disrupted. These preparations were taken from Pa AT cc 43949 supernatants. The inability to detect Pnf in clarified supernatants confirms all the protein is associated with the PVC Needle Complex enrichment preparations. Lanes 1 +5; sonicated samples, 2+6; 1 M NaCI treatments, 3+7; 1 % SDS treatments 4+8; 1 M Urea treatments. Note the PVC Needle Complex appears stable in 1 M NaCI.

Figure 5 shows cryo-SEM image of ex vivo hemocytes (insect macrophage/neutrophil equivalents) from 5th instar Manduca sexta that had been injected with a native (A) or heat inactivated (B) enriched preparation of PaATcc43949 pvc pnf Needle Complexes (nanosyringes) heterologously produced by an E. coli cosmid clone. Note the abundant linear structures corresponding to PVC Needle Complexes (nanosyringe) (small arrows) and membrane ruffling effect (large arrows), consistent with the mode of action of the Pnf payload toxin, which are absent from the control treatment. Scale bar = 50pm. 25kV; magnification 40 K (A) and 50K (B).

Figure 6 shows experimental results demonstrating the (toxic) cellular phenotype following contact with a PVC Needle Complex is due to intracellular toxin delivery. (A) A Pnf loaded PVC Needle Complex was injected into insects ( Galleria mellonella insect larvae), showing potent activity within 15 minutes for the given dose (explained in the examples) - note mortality/morbidity is typically associated with the“melanisation” immune response in these dead/dying insects. (B) A control, denatured (via boiling) Pnf loaded PVC Needle Complex injected into animals showed no activity. (C) Purified Pnf (payload), absent the PVC Needle Complex (i.e. Pnf not packaged into the complex), showed no activity against either animals (left) or a HeLa cell line (right). (D) Pnf (payload) delivered into the cytosol of HeLa cells - via ‘BioPorter’ liposomal preparations containing the protein, or by intracellular expression following transfection with an appropriate plasmid (E) - showed potent activity/toxicity, as evidenced by multi-nucleation in the cells. (F) - The effect of PVCpnf+Pnf on the respiration rate of THP1 derived human macrophages as measured by Resazurin plate reader assay. Note the heat denatured and empty PVCpnf nanosyringes showed no strong adverse effect. These same samples were tested by injection into Galleria larvae. The PVCpnZ+Pnf samples showed over around 50% mortality within minutes (darkened larvae in the bottom two panels) while the heat denatured and empty PVCpnf injected insects all remained healthy (no darkened larvae in the top two panels).

Figure 7 shows (in silico ) predicted secondary structures of a range of the endogenous payload (toxin) associated with various PVC operons, demonstrating the large variety of structure types. (B) The amino acid length of various payloads (toxins) plotted against predicted isoelectric point.

Figure 8 shows confirmation that leader sequences (e.g. having 50 amino acids) of the invention are necessary and sufficient for (trans-)packaging payload proteins/peptides into PVC Needle Complexes (nanosyringes) expressed in Photorhabdus. (A) 1-6: Schematic maps of chimeric effector protein expression constructs (trans-expressed in the arabinose- inducible pBAD30 vector), including those expressing Pnf and non-native cre-recombinase and Myc-tags. C-terminal Myc-tag epitopes are shown as black arrows. (B) Western blots using anti-Myc mouse antibody. Samples are from purified PVC(u4) Needle Complexes (nanosyringes) overexpressed from chromosomally engineered P. luminescens TT01 which harbour the trans-packaging expression constructs 1-6 shown in (A). A blank pBAD30 plasmid was used as a negative control and showed no signal. Arrows show correct band sizes for expected products.

Figure 9 shows an alignment of the leader sequences, demonstrating the presence of a chemical composition consensus amongst the leader sequences, based on amino acid properties. More particularly, the leader sequences comprise similar charge patterns, of 2x negatively charged regions, each followed by a positively charged region [-ve] [+ve] [-ve] [+ve].

Figure 10 shows (A) western blot analysis of PVC Needle Complexes and payloads from particulate preparations (Cesium Chloride gradient and Monolith FPLC preparations, as described in Materials and Methods). In [1] (pBADPVCpnf, in which PVC16 of the nanosyringe is FLAG-tagged providing PVC16::FLAG detectable with AntiFLAG Ab), a signal from the tagged cap protein of“PVCPnf” (PVC Needle Complex with a Pnf payload) can be seen, confirming the presence of PVC Needle Complexes in the purified fraction. In [2] (pBADPVCpnf + Cre::Myc, detectable with AntiMyc Ab, the Cre having an N-term fusion of the Pnf leader e.g. SEQ ID NO.: 78), a signal from the Myc-tagged payload protein packaged in abundance, in the same sample as (1), confirming presence of Cre payload in purified PVC Needle Complexes (nanosyringes). In [3] (PVCU4 + Cre::Myc, detectable with AntiMyc Ab, the Cre having an N-term fusion of the Pnf leader e.g. SEQ ID NO.: 78), a different PVC Needle Complex chassis (“PVCU4”) purification is probed for Myc-tagged Cre revealing a packaged (packaged Myc-tagged Cre) corresponding band. This is highlighted in the blot for clarity. (B) Transmission electron micrograph of a PVC Needle Complex, shows both wild- type (having a Pnf payload) PVC Needle Complexes and PVC Needle Complexes having an atypical (non-native) recombinase (Cre) payload, in any chassis tested, does not affect morphology of the PVC Needle Complexes, ensuring they are not assembled aberrantly. Figure 10 (C) provides additional/ complementary data to that of (A). In more detail, (C) provides further proof via Western blot analysis of (trans-)packaging of the Cre recombinase into purified PVC pnf expressed in E. coli. The Western blot demonstrates that for a given amount of Anti-FLAG antibody Western signal (a specific probe for the nanosyringe due the incorporation of PVC16::FLAG), a much higher amount of the Cre payload is detected (using the Anti-Myc tag antibody). The numbers denote 2-fold dilutions. Note, upon dilution, the anti- FLAG signal from the nanosyringe is lost, while the payload remains intense in most lanes. CsCI denotes purification by Caesium Chloride density gradient centrifugation. “Mon” denotes the samples were additionally anion exchanged via“Monolithic” columns. “Post- Elution”,“Interphase”,“Sub-lnterph.”, denote the liquid fractions where the signal is detected from the purification process. D - Western blot analysis of Cre trans-packaged into PVCpnf in E. coli. Payloads are probed for their incorporated ‘Myc’ tags (C-terminal fusions) after purification of the nanosyringe-payload complex. Western blot analysis of particle preps confirms that all four leaders could efficiently trans-package the exogenous Cre enzyme. E - A phylogenetic tree, demonstrating the exemplified leader sequences are well distributed throughout and are therefore at or close to maximally sequentially diverse (see Example 4.2). Figure 11 shows western blot analysis of PVC Needle Complexes expressed without (1) and with (2) concomitant expression of (Myc-tagged) Pnf from a separate plasmid, probed simultaneously with an anti-FLAG and anti-Myc antibody. In the lanes marked 1 , the PVC Needle Complex (nanosyringe) was expressed and purified without the presence of a ‘payload plasmid’ (an expression plasmid encoding a payload protein linked to a leader sequence) within E. coli. This leads to a band corresponding only to the FLAG tag present on the syringe (PVC Needle Complex) itself. For lanes 2, the same approach was undertaken, but using cultures which also included a (separate) plasmid bearing a tagged payload (Myc- Pnf). Bands can be seen which correspond to the FLAG and Myc tags, confirming presence of the Pnf payload (the four lanes within 1 and 2 are simply different purification fractions from Caesium Chloride gradients).

Figure 12 shows western blot analysis of trans-packing experiments in P. luminescens TT01 PVCu4 over-expression strain. Results demonstrate the trans-packaging of a myc-tagged Pvc17 (Plu1651whole::Myc).

Figure 13 shows further western blot analysis of trans-packaging experiments in P. luminescens TT01 PVCunit4 over-expression strain (as explained in the Examples). Results demonstrate trans-packing of Myc-tagged Pvc17 (Plu 1651 :: Myc) and a Myc tag alone using the leader of Pnf (PAU_03332 leader), and that the leader is necessary. (A) Lane 1 shows packaging of the leader of fused to a Myc-tag (PAU_03332::Myc); Lane 3 shows a lack of packaging when the leader sequence is absent (Myc only is not packaged); lane 4 shows lack of packaging of HvnA (a natural effector) when the leader sequence is absent; lane 6 shows packaging of Myc-tagged PAU_03332:: Plu 1649, i.e. a chimaera of the leader from PAU_03332 (i.e. amino acids 1-50 of PAU_03332) and the effector (i.e. amino acids 51-C- terminus) from Plu1649. The high intensity of bands in lanes 1 and 6 demonstrate that the Pnf (PAU_03332) leader is particularly effective at packaging a payload). (B) Lane 1 shows packaging of Plu1651 with a C-terminal Myc tag using an anti-Myc antibody Western blot. Figure 14 shows further Western blot analysis demonstrating the very high level of trans packaging of Myc-tagged Pnf (PAU_03332::Myc) using the PAU_02806 (GogB) leader (second lane, not including the ladder lane). The first lane demonstrates use of the Plu1649 leader for packaging the PAU_03332 effector (Myc-tagged Plu1649::PAU_03332). The band appears weak due to the relative intensity of the band in the second lane. The experiment involved filter sterilisation of 50 mL culture, 8 M final concentration of urea added to break down PVCs. Samples collected from 10 mL supernatant.

Figure 15 shows further western blot analysis demonstrating trans-packaging of Plu 1651 (pvc17) with a C-terminal Myc tag as described in Figure 13 into PVCunit4 expressed from Photorhabdus . Raw represents particulate preps from supernatants, Be, Be2 and IP represent different“cuts” from a Caesium chloride gradient purification.

Figure 16 (A) provides a diagrammatic explanation of the mechanism of action of Cre in the mouse organoid experiment (of Example 6), and how the positive control (TAM) facilitates Cre activation. White arrows show the location of cells expressing the tdTom fluorescent reporter gene. B - Demonstration of delivery of active trans-packaged Cre-recombinase into murine bile duct organoids by PVC pnf expressed and purified from E. coli. White circles show the location of groups of cells expressing the fluorescent reporter gene. The upper images show a direct grey scale conversion of an images obtained via light microscopy. The lower image shows a corresponding image with false-colour enhancement of positive cells, which is provided simply to aid identification of the difference between effected cells and surrounding unaffected ones within the former grey scale conversion.

Figure 17 shows a dot-blot analysis of nanosyringe expression both with a payload (the Cas9-like protein MAD7) and without. Some leaky expression of the IPTG inducible MAD7 is seen before induction (T1) as is common with this expression system. There is no Myc signal from the PVC only sample at any time point as expected, and the MAD7 signal grows throughout the expression over a ~24 hour period. Strong Myc signal is maintained post purification via ultracentrifugation as described elsewhere, indicating that the protein is incorporated into the nanosyringe chassis system. FLAG signal is robust in the MAD7 sample, and occurs as expected post-induction and persists post-purification, as this promoter system has reduced leaky expression. It is concluded that the nanosyringes and MAD7 are compatible with one another in terms of expression, and that MAD7, the largest protein tested to date, can be packaged in to the nanosyringe system.

Figure 18 shows western dot-blot analysis confirming trans-packaging of the pro-apoptotic tBid protein domain and BaxBH3 (both having the leader sequence of SEQ ID NO.: 78 fused to the N-term) peptide into purified PVCpnf expressed from E. coli {7 & 8). The nanosyringe with its cognate toxin“Pnf” is shown, as purified by 2 different methods (5 & 6) as a positive control. The blots at the bottom of the panel represent the same examples as in 7 & 8 in the panels above. These blots were made from another purification of the same constructs, demonstrating reproducibility of purification. This experiment demonstrated that“tBid protein domain and BaxBH3 peptide” packed samples (nanosyringes) can be successfully prepared, e.g. for used in the apoptosis delivery assays in Example 9.

Figure 19 (A) shows TUNEL-stain microscopic analysis from cells exposed to the packaged nanosyringes for 20 minutes only. First (left) bar = DNase I treated cells (+ control); Second bar = no DNAse I or nanosyringe treatment (- control); Third bar = cells were exposed to nanosyringes packaged with tBid (via leader sequence of SEQ ID NO.: 78 fused to the N- term); fourth (right) bar = cells were exposed to nanosyringes packaged with Bax_BH3 domain (via leader sequence of SEQ ID NO.: 78 fused to the N-term). B - Representative micrographs as described in Example 9, showing TUNEL staining of PBMC’s, following treatment with nanosyringes and controls. PBMCs were treated with tBID, Bax loaded nanosyringes, and the positive (DNase I treated cells) and negative (no DNase I treatment) controls for 20 minutes at room temperature before performing TUNEL staining to determine an apoptotic response. In the original (non-grayscale) micrographs: Cells negative for apoptotic response show blue or light brown staining. Blue staining (Methyl green) or light brown staining indicates healthy cells with absence of apoptotic signal. Dark brown staining indicates cells undergoing apoptosis. EXAMPLES

Materials and Methods

Cloning

Plasmids encoding PVC Needle Complexes were prepared using standard molecular techniques known in the art. Briefly, genomic DNA from P. asymb/of/ca ATCC43949 (obtainable from the ATCC under accession no. ATCC 43949) was used in PCR (with appropriate primers) to amplify multiple (e.g. four) overlapping regions of the PVC operon. Overlap/extension PCR was employed to prepare a whole operon, and fused (again using overlapping PCR) into an appropriate expression vector as detailed in Figure 1 (using the primers of SEQ ID NO: 101 - SEQ ID NO: 106).

Briefly: four overlapping PVC fragments (generated with primers of SEQ ID NO: 101 (F1) and SEQ ID NO: 105 (R1); SEQ ID NO: 102 (F2) and SEQ ID NO: 106 (R2); SEQ ID NO: 103 (F3) and SEQ ID NO: 107 (R3); and SEQ ID NO: 104 (F4) and SEQ ID NO: 108 (R4), respectively) were made covering the PVC operon (e.g. of SEQ ID NO: 93). The target cloning vector was cut at the required insertion site. These 5 DNA fragments were then assembled by overlapping PCR (using primers of SEQ ID NO: 101 and of SEQ ID NO: 108), and the resulting fragment was ligated into the cloning vector. Products were transformed into laboratory E. coli and recovered with vector marker selection (e.g. due to ampicillin resistance).

The operons are typically operably linked to an inducible promoter (e.g. arabinose inducible, and/or IPTG inducible) as is known in the art. This is generally achieved by cloning into pBAD family plasmids (inducible via arabinose) (Invitrogen, catalog number: V43001) and pVTRa (inducible via IPTG) (Biomedal, S.L.) vectors (although any combination of compatible expression vector systems should suffice).

A PVC Needle Complex can be expressed independently of the payload (toxin), and vice versa. Separate expression vectors (e.g. having differing inducible promoters) may harbour the PVC Needle Complex and the payload, respectively.

Expression (e.g. laboratory scale expression) / Purification of PVC Needle Complexes in E. coli

A typical process to purify a PVC Needle Complex from a 1 L culture of an E. coli expression strain (transformed with an appropriate expression vector/ cosmid) is as follows:

1- An overnight culture of the bacteria (transformed with PVC Needle Complex expression vector) is prepared by picking a colony from a plate and inoculating 100 mL of LB media. The culture is grown at 37°C with shaking.

a. Typically, the media may be routinely supplemented with 0.2% d-Glucose to aid repression of the genetic constructs for optimal cell health.

b. The media is also supplemented with the relevant antibiotics for maintenance of the expression (PVC Needle Complex) vector. If a payload vector is also being used, the relevant antibiotic for that vector is also supplied.

2- The next day, a 1 L flask is inoculated via dilution in a 1 :100 ratio from the overnight

culture. The media for the 1 L flask is identical to the overnight media but typically does not contain glucose. 3- Cultures are grown to approximately mid-to-late exponential (an OD600nm of -0.8) at which point the plasmids are induced.

a. For the PVC Needle Complex (nanosyringe) plasmid, typically 0.2% arabinose is added to induce expression. For the payload plasmid (plasmid encoding for the payload, such as Pnf), IPTG concentrations may typically be optimised on a per- protein basis, and a typical starting figure of 0.1 mM is preferable.

4- The cultures are returned to the incubator post-induction and cultured at 18°C until the following day.

5- Cultures are harvested by centrifugation in appropriate centrifuges/bottles/rotors at

5000xg for 30 mins.

6- Cell pellets are then lysed to release PVC Needle Complexes (nanosyringes).

a. The following lysis methods may be used:

(i) Lysozyme incubation overnight (ii) Sonication with a needle sonicator (with or without first treating with lysozyme (iii) Cell

disruptor/homogenisers.

7- Optionally, DNAse, and protease inhibitors can be added to the lysate.

8- Cell debris is removed by centrifugation at 50,000xg, 4°C, for 20 minutes in a high speed centrifuge.

9- Concentrate the lysate through a 100,000 kDa MWCO centrifugation column to reduce volumes and remove small proteins. Once the volume is down to a manageable volume, centrifuge several times replacing the retentate solution with an appropriate sample buffer such as TM (20 mM Tris-HCI, 8 mM MgCI 2 , pH 7.4) to dialyse.

A subsequent process for purification via Caesium Chloride density gradient is as follows:

1. Prepare CsCI density solutions as follows:

(a) 1.7 g/mL CsCI in H 2 0; (B) 1.5 g/mL CsCI in H 2 0; (C) 1.45 g/mL CsCI in H 2 0

2. Gradients (from bottom-to-top of the tube) are then set up in ultracentrifuge tubes like so:

(1) (bottom of tube) - 2mL density, 1.7 CsCI; (2) - 3ml density, 1.5 CsCI; (3) - 3mL density, 1.45 CsCI; (4) (top of tube) - sample in TM buffer. Suitably, apply each density carefully to side of tube so as not to blend the boundary with the previous density layer.

3. Balanced tubes are then subjected to ultracentrifugation at 35,000 RPM in an SW40Ti swinging bucket rotor, equivalent to 155,000 xg, for 2 hours, 4°C.

4. The correct gradient fraction will be the region just above a‘blue-ish-white’ halo that appears. Fractions are extracted via puncturing the tube with a syringe and needle.

5. PVC Needle Complexes of good purity can be obtained in this manner, and stored in buffer at 4°C. Suitably, dialyse back in to TM buffer to remote the CsCI.

Following, or in place of CsCI gradient purification, PVCs can be extracted via Monolith anion exchange chromatography, as follows (note all steps can be performed manually with a peristaltic pump or syringe apparatus, or via F/HPLC):

1. Unless already done, dialyse the sample extract into the binding mobile phase (typically TM buffer) with a low concentration of salt (20 mM NaCI).

2. Equilibrate the column according to the manufacturer’s guidelines, briefly:

a. At least 5 Column Volumes (CV) of dH 2 0;

b. At least 5 CV of binding buffer (TM, with low salt);

c. At least 5 CV of elution buffer (TM with high salt, >= 1M NaCI);

d. At least 10 CV of binding buffer once more. 3. Apply the sample to the column at a low flow rate (1-2ml_/min)

4. Wash the column with up to 200 mM NaCI-containing TM buffer.

5. Elute with 1 M NaCI-containing TM buffer (alternatively, use a gradient elution if using an FPLC machine).

6. PVC Needle Complexes are present in the elution fractions. If a fraction collector is used, subsequent SDS-PAGE or similar may be needed to identify the correct fraction.

The column (of e.g. step 2) was of the CIMmultus(™) Quaternary Amine anion exchange columns (BIA Separations d.o.o.). For example, the CIMmultus™ QA-1 , which is a monolithic column with 1.3 pm channel size and a column volume of 1 ml_.

Alternatively, a DEAE (a weak anion exchanger) column may be used.

Alternatively, for use with a Photorhabdus expression system, PVC Needle Complexes can be purified from supernatants as well as/instead of cell pellets, with the following

additions/modifications:

1. Following cell harvest from the standard protocol above, supernatants are transferred to a pyrex bottle, and can optionally be concentrated via 100,000 MWCO columns if necessary.

a. DNAse (0.25U/ml_) and protease inhibitors can optionally be added.

2. NaCI is added to a final concentration of 0.5M, and 80 g/L of PEG6000 is also added.

The solution is mixed at 4°C overnight.

3. The solution is centrifuged to pellet the PEG6000 at 8000xg, 4°C for 30 mins.

4. The pellet is resuspended in a small volume (~5 ml_) of TM buffer (or similar) and

incubated for 2 hours at room temperature, shaking.

5. Pellet by centrifugation at 13,000 xg for 10 mins, and collect the supernatant to a new tube. Proceed with purification method of choice.

Other methods for purifying PVC Needle Complexes have been described elsewhere, for example in Yang et al (J Bacteriol. 2006 Mar; 188(6): 2254-2261), incorporated herein by reference.

Construction of an arabinose inducible over-expression strains for P. luminescens TT01

PVCunit4 (chassis encoded by genes plu1667 - plu1652)

Photorhabdus strains overexpressing a PVC Needle Complex were prepared using chromosomal recombineering to place a PVC (operon) of choice (operon encoding PVCunit4 Needle Complex was used here, as an example) under the control of an arabinose inducible transcription promoter. The recombineered strains are then genetically transformed with effector expression plasmids (e.g. based on the arabinose inducible expression vector pBAD30) to facilitate PVC Needle Complex over-expression, PVC effector expression, PVC effector trans-packaging, and secretion of the whole complex simply through the addition of the arabinose sugar.

Recombinant Photorhabdus PVC over-expression strain construction

The promoter region of PVCunit4 was amplified using primers PVCpromF (5’- T AT CAT AT GT CT ACAACTCCAGAACAAATTGCT G-3’ , SEQ ID NO: 97) and PVCpromR (5’- ATCTCTAGAACAGATATTCCAGCCAGC-3’, SEQ ID NO: 98) using genomic DNA from P. luminescens strain DJC (aka strain TT01) as a template. A suitable P. luminescens strain is obtainable from the ATCC under accession no. ATCC 29999. The PCR product was digested with Ndel and Xbal and introduced by ligation into the suicide vector pCEP (ThermoFisher, catalog number: V04450), using E. coli DH5a l-pir (Biomedal S.L.) as the carrier strain. The resulting plasmid was transferred to the E. coli donor strain S17.1 l-pir (Biomedal S.L.) for conjugation into Photorhabdus. Briefly, overnight cultures of the donor strain and a rifampicin resistant (RifR) isolate of P. luminescens DJC were diluted in LB supplemented with 10 mM MgSCL and grown to mid-exponential (OD600 -0.5). Then, 3 ml of each culture were harvested, washed twice and re-suspended in 100 pi of LB supplemented with 10 mM MgSCL. 80 mI of P. luminescens DJC RifR were mixed with 20 mI of the donor bacteria (resulting in a recipient to donor ratio of 4:1) and placed in the centre of an LB agar plate supplemented with 0.1 % pyruvate and 10 mM MgSCL. The plate was incubated overnight at 30°C and the resulting growth was harvested in 1.5 ml LB. Aliquots were plated on plates containing rifampicin (50 pg/ml) and chloramphenicol (25 pg/ml) to select for trans-conjugants and the plates were incubated at 30°C for 3 days. Possible transconjugants were re-streaked and confirmed by PCR using primers ParaINF (5’- GGCGTCACACTTTGCT AT G-3’ , SEQ ID NO: 99) and tPVCpR (5’- TCGGTGGCAGTAAATTGTCC-3’, SEQ ID NO: 100).

PVC Needle Complex over-expression and purification from Photorhabdus

Overnight cultures of P. luminescens DJC PVCunit4::pCEP were diluted in 2x 250 ml LB supplemented with chloramphenicol (25 pg/ml) and incubated at 28 °C, 180 rpm. After 2-3 h, arabinose (0.2 %) was added and the cultures were returned to the incubator for another 26 h. The cells were pelleted by centrifugation (7000 g for 30 min) and the supernatant was collected. DNAse I was added to the supernatant at a concentration of 0.25 U/ml to degrade any extracellular DNA. Following an incubation of 30 min at room temperature, polyethylene glycol 8000 (8 %) and NaCI (0.5 M) were added to precipitate the proteins. The supernatants were incubated overnight at 4°C, stirring. The precipitated proteins were then collected by centrifugation at 8000 g for 30 min at 4°C. The pellets were re-suspended in 8 ml TM buffer (20 mM TrisHCI, 20 mM MgCI2, pH7.4) and incubated at room temperature for 2h with gentle shaking. Any remaining debris was removed by centrifugation at 13000g for 10 min and the supernatant containing PVC Needle Complexes was applied to a CsCI density gradient and centrifuged at 35000 rpm for 2h in a Beckman coulter Optima L-90K or XPN-80K ultracentrifuge. The CsCI density gradient was made by layering TM buffer containing CsCI at p = 1.7 (2 ml), 1.5 (3 ml), and 1.45 (3 ml) from the bottom of the tube, respectively. The fraction containing PVC Needle Complexes was collected and UltraceMOOK devices (Amicon) were used to remove the CsCI and exchange the buffer for TMS (20 mM TrisHCI, 8 mM MgS04, pH7.4). The PVC Needle Complexes were further purified using a CIMmultus™ quarternary amine 2 pm pore anion exchange column (BIAseparations). The column was washed with TMS buffer containing 200 mM NaCI and the PVC Needle Complexes were eluted in TMS containing 1 M NaCI. The NaCI was removed by buffer exchange using an UltraceMOOK device and the sample was applied to a CIMmultus™ DEAE 2 pm pore column (BIA separations) for a final purification. The column was washed in TMS containing 200 mM NaCI and the sample was eluted in TMS containing 500 mM NaCI. It is possible to perform this with and without lysis (e.g. because the PVC Needle Complexes appear to be secreted from live cell, and can be collected in supernatant) of the cells (to release the PVC Needle Complexes).

Transmission electron microscopy

For transmission electron microscopy (TEM) pioloform-covered 300-mesh copper grids that were coated with a fine layer of carbon were used as substrates for the protein fractions. A preferred aqueous negative stain is 3% methylamine tungstate. The coated grids were exposed to UV light for 16 h immediately prior to use to ensure adequate wetting of the substrate. A 10 pi drop was applied to the TEM grid, and the protein was allowed to settle for 5 min. Liquid was absorbed with filter paper from the edge of the grid and replaced immediately with 10 mI of filtered negative stain. The drop was partially removed with filter paper, and the grids were allowed to air dry thoroughly before they were viewed with a JEOL 1200EX transmission electron microscope (JEOL, Tokyo, Japan) operating at 80 kV.

BioPORTER assay and actin stress fibre analysis.

For BioPORTER assays (Genlantis), 80 mI of purified wild-type and mutant Pnf proteins (500 pg ml-1), or PBS as a negative control, were added to one BioPORTER tube (Genlantis) and re-suspended in 920 mI of DMEM. The samples were added to HeLa cells grown in 6- well plates and incubated for 4h. BioPORTER/protein or PBS mixes were replaced by fresh complete medium and the cells were incubated for 20-48 h. To visualize cell morphology and actin cytoskeleton, cells were fixed for 15 min in 4% PBS-formaldehyde, permeabilized with 0.1% Triton X-100 and stained with Tetramethylrhodamine B isothiocyanate (TRITC)- phalloidin (Sigma) and DAPI dihydrochloride (Sigma). Images were acquired with a LSM510 confocal microscope (Leica).

EXAMPLE 1

Cloning and expression of PVC Needle Complexes

The inventors have successfully excised (cloned) the required expression genes from the host bacterium, Photorhabdus (e.g. which are comprised within SEQ ID NO: 93, SEQ ID NO.:94 and/or SEQ ID NO:.95), and have devised a reliable, scalable expression system in laboratory E. coli as explained above. It has been demonstrated that trans-expression on separate plasmids enables incorporation of payloads (e.g. Pnf) into the syringes, creating a multi-plasmid (modular) platform.

Following purification from E. coli, electron microscopy analysis demonstrated that the purified PVC Needle Complexes retained the correct‘nanosyringe’ structure (see Figure 3). Furthermore, PVC Needle Complexes remained correctly associated with the payload (e.g. Pnf) following purification (see Figure 4), demonstrating that the inventors have successfully prepared the PVC Needle Complexes (nanosyringes) having the correct structure for payload delivery to cells.

Furthermore, electron microscopy analysis demonstrated that the purified complexes appropriately localise to the cell surface of cells, and PVC Needle Complexes with a Pnf payload (PVC rhή induces a phenotype (ruffling) consistent with the postulated mechanism of the effector (PVC) - see Figure 5. EXAMPLE 2

2.1 Demonstrating PVC Needle Complexes exert effect via intracellular delivery of effector

The polypeptide Pnf was identified as a PVC effector as follows. This was identified within the Photorhabdus asymbiotica ATCC43949 complete genome - GenBank Accession Number: FM 162591.1.

The final gene of the PVC operon (P. asymbiotica ATCC43949 PVCpnf operon, which has a sequence of SEQ ID NO: 93) was identified, namely pvc16 (e.g. PAU_03338). The position of the pvc16 genes of a PVC locus is illustrated in Figures 1(A), (B) and (D). ORFs shortly 3’ of pvc16 (e.g. within about 5kb downstream of pvc16) were identified - one such ORF (PAU_03332) being 3535bp downstream of pvc16. The predicted function of the polypeptide (having a sequence of SEQ ID NO.: 32) encoded by this putative effector ORF was obtained by a combination of BlastP and HHPRED (https://toolkit.tuebingen.mpg. de/#/tools/hhpred). This ORF could then be assigned as a PVC effector based on direct homology to a known bacterial toxin (e.g. of the CNF1 family from E. coli).

A Pnf loaded PVC Needle Complex was then prepared according to Example 1.

The inventors have demonstrated that these packaged (e.g. laden) PVC Needle Complexes exert cellular effects consistent with the provenance of the cargoes they carry. By way of example, cells and whole insect animals exposed to PVC Needle Complexes loaded with the cytoskeleton toxin Pnf undergo cell death in a manner consistent with cytoskeleton toxicity.

Injection experiments (injection into the insect larvae) were performed by injection of 10mI of supernatant, provided following centrifugation (pelleting) of an overnight culture (typically 1 L) of a culture of E. coli harbouring a cosmid clone encoding the PVC Needle Complex with Pnf (PVCPnf) - e.g. a PVC encoded by SEQ ID NO.: 93, packaged with a PVC effector of SEQ ID NO.: 32.

Demonstrating that the PVC Needle Complexes are responsible for the phenotype due to intracellular delivery (e.g. injection) of the Pnf payload, the toxic effect could only be reconstituted when the same protein (Pnf) is provided with another route to access the cell cytosol (transfection and expression of an expression plasmid, or conductance via liposomal preparations containing the protein) - see Figure 6. Conversely, denatured (via boiling) PVC Needle Complex preparations, toxin proteins overlaid on tissue culture cells or toxin proteins injected into whole animals showed no activity.

2.2. Evidence of delivery of the toxic effector enzyme Pnf into cultured human macrophages

To complement the data outlined above, the inventors conducted additional experimentation providing further evidence of delivery of the toxic effector enzyme Pnf into cultured human macrophages.

Concept: The inventors tested PVCpnf expressed and purified from E. coli, (trans-)packaged with the native Pnf toxin on cultured human THP1 derived macrophages. Unlike the lethal effect of the Pnf toxin in insect models, previous liposome mediated Pnf protein transfection experiments indicated a subtler phenotype in human Hela cells. In those experiments the cells showed actin stress fibre formation at 24h and multinucleation at 48 h. The inventors therefore tested the effect of the purified PVC pnf (the nanosyringe) holding/ packaged with the Pnf PVC effector on macrophage respiration rate using a Resazurin colourimetric assay.

Methods:

Background behind Resazurin assays. The blue compound resazurin was explored for use in assays to determine the activity of PVCs on macrophages (M0). Resazurin is metabolically reduced in cell mitochondria, producing a pink and highly fluorescent compound, resorufin. The effect of PVCs on macrophage metabolism can be determined by introducing resazurin into the culture media. The number of macrophages affected by PVCs can be inferred by comparing the fluorescence measured to that of the cell density optimisation curve (see Czekanska, Methods in Molecular Biology, 2011 , 740, 27-32, incorporated herein by reference).

Optimisation of use of Resazurin for THP1 derived macrophages. The metabolism of macrophages over 18 h was assessed at different seed densities to determine the optimum cell density for use of this assay with PVCs. A 30 ml_ culture of THP-1 cells was pelleted at 1000 rpm for 4 min, before resuspension in 2 ml_ of RPMI media (also containing 10 % FBS (v/v) and 2 mM L-glutamine). Cells were counted using a cell haemocytometer, then diluted in media to a density of 2x10 6 cells mL· 1 . THP-1 cells were then activated with phorbol 12- myristate- 13-acetate (PMA) immediately before plating. 200 pl_ of the cells were plated in quadruplicate in a 96-well plate, and a 2-fold serial dilution was performed until reaching a final cell density of 1.5625x10 3 cell mL· 1 . 125 mI_ of the starting cell dilution was also plated in quadruplicate on the same plate, for a 5-fold serial dilution, until reaching a cell density of 0.32x10 3 cells mL· 1 . Four blank wells were also prepared, containing RPMI and PMA. The plate was incubated at 37 °C with 5 % CO2 for 48 h. Media was aspirated from the wells and replaced with fresh RPMI, and the macrophages were incubated for a further 24 h. A resazurin tablet (VWR) was dissolved in RPMI (12.5 mg/ml_), and 10 mI_ added to each well in quick succession (well concentration of 1.25 mg/ml_). The fluorescence produced was measured on a plate reader every 30 min for 18 h (excitation: 530-570 nm, emission: 580- 620 nm, maintained at 37 °C and 5 % CO2). The optimum cell density over time was then determined for use with PVCs.

Use of assay for PVC testing. THP-1 cells, diluted to 1.25x10 5 mL· 1 , were activated and seeded in a 96-well plate, where wells contained 100 mI_ of cells at a final well density of 1.25x10 4 cells mL· 1 . Blank wells were also prepared in quadruplicate, containing cells without PVC samples, as well as wells containing media and PMA only. The plate was incubated for 48 h at 37 °C with 5 % CO2. The media was then replaced with fresh RPMI, before addition of 10 pL of each PVC sample. The plate was incubated for a further 24 h, before the addition of 10 mI_ resazurin (12.5 mg/ml_) to each well, and the fluorescence was measured every 30 min for 18 h (excitation: 530-570 nm, emission: 580-620 nm, maintained at 37 °C and 5 % C0 2 ).

Results: Figure 6F shows that challenge with PVCpnf+Pnf did indeed lower the respiration rate of the macrophage, while heat denatured or empty PVC pnf nanosyringes had no strong adverse effect. Nevertheless, control cells with no sample addition still showed the best respiration rates. The effects on macrophage were correlated with insect injection toxicity assays. In this case the two PVCpnf+Pnf preparations showed lethality to over half the insect cohort, while the heat denatured and empty PVCpnf injected insects all remained healthy.

EXAMPLE 3

Demonstrating that a leader sequence is responsible for payload packaging into PVC Needle Complexes

Surprisingly, the inventors have found that the provision of a‘leader’ peptide sequence, preferably on the N-terminus of a payload (toxin) protein, can direct the payload to the PVC complex and allow for (e.g. trigger) the packaging of the payload into the PVC Needle Complex. The inventors have demonstrated that amino acid residues 1-50 of a PVC effector protein is/ comprises a leader sequence.

To demonstrate this, an expression construct (overexpression in chromosomally engineered P. luminescens TT01) was prepared, in which the leader sequence (the N-terminal amino acid residues 1-50) was ablated such that the payload expressed by Plu1649 (referred to as “hvnA” in the figure, and having a sequence of SEQ ID NO.: 46) (Myc-tagged for detection purposes) was absent a leader sequence (see Figure 8A - construct 1). Following expression (of both the payload and PVC Needle Complex) and isolation of the PVC Needle Complex (and running the components thereof, which includes any packaged payload, on a gel), no (Myc-tagged) Plu1649 (“hvnA”) was detectable within the PVC Needle Complex via western blot analysis, demonstrating that the payload (absent the leader sequence) was not packaged into the complex (see Figure 8B, lane 1), and thus not associated with the isolated complex. Successful packaging was seen, however, for hvnA which did retain the leader sequence, see lane 2 (note that the band appears weak, due to the relative intensity of the band of lane 3).

Surprisingly, hvnA having a leader sequence from a different (non-hvnA) PVC effector (i.e. corresponding to the N-terminal amino acid residues 1-50 from the PAU_03332 effector) (see Figure 8A, construct 3) was correctly packaged into the complex and remained associated with the PVC Needle Complex upon isolation/ purification, as demonstrated by Western blot detection of the Myc-tagged hvnA (see Figure 8B, lane 3). Thus, the inventors have demonstrated the surprising ability of the ‘PAU_03332’ leader sequence (which is associated with a different payload, Pnf) for packaging of a hvnA payload (i.e. a different payload to that of PAU_03332). This demonstrates the ability to swap the leader sequences of the PVC effector, allowing use of an optimal leader sequence (having optimal packaging activity) for packaging.

EXAMPLE 4

4.1 Demonstrating that a leader sequence directs packaging (into PVC Needle Complexes) of atypical/ exogenous payloads

In an unexpected technical effect of the invention, the inventors have found that fusing a leader sequence described herein to exogenous (non -Photorhabdus) polypeptides (preferably at the N-terminus) allows for packaging of said exogenous polypeptides into a PVC Needle Complex, with the exogenous polypeptides remaining associated with the PVC Needle Complex upon isolation/ purification. By way of example, see Figure 8B (lane 4) demonstrating that a non -Photorhabdus‘Myc’ polypeptide (<10kDa) is packaged into the PVC Needle Complex when fused to a leader sequence, and lane 6, demonstrating a much larger non- Photorhabdus ‘Cre-recombinase’ polypeptide (>32kDa) can likewise be appropriately packaged into PVC Needle Complex when fused to a leader polypeptide of the invention.

The inventors performed in-depth analysis of the size (e.g. polypeptide length) and structure of the various natural PVC effector payloads encoded by Photorhabdus (see Figure 7), which show a wide variety of different lengths and structure, demonstrating that the applicability of the PVC Needle Complex (nanosyringe) delivery system of the present invention is not limited by the size or properties of the payload protein of interest. To summarise, there is no requirement for particular secondary structure, biophysical property, or length of cargoes, confirming that that the PVC Needle Complex (nanosyringe) chassis can be utilised as a versatile multifunctional delivery vehicle.

Furthermore, this packaging of exogenous polypeptides is independent of the chosen PVC Needle Complex chassis e.g. has been accomplished using both a“PVCpnf” chassis (SEQ ID NO.: 93) and a“PVC U4” (e.g. PVCunit4) chassis (endogenous to the Photorhabdus overexpression strain) ( see Figure 10A). Importantly, the inventors have demonstrated that packaging exogenous payloads in either chassis does not affect morphology of the PVC Needle Complexes, ensuring they are not assembled aberrantly (see Figure 10B).

In data shown herein, payload proteins are supplied in‘trans’ on separate genetic constructs. The leader sequences are surprisingly sufficient to target these separately synthesised proteins for packaging into the PVC Needle Complex vehicle (see Figure 11). This applies in E. coli when the chassis (PVC) genes themselves are also present on a plasmid, as well as with chassis genes being integrated into the chromosome, as is the case in Photorhabdus, the host organism.

Further exemplification of trans-packaging of high levels of the Cre site specific recombinase into the PVCpnf nanosyringe expressed in E. coli is provided in Figure 10(C). In more detail, the inventors constructed a laboratory E. coli expression strain harbouring (i) the arabinose inducible expression plasmid for the P. asymbiotica ATCC43949 PVCpnf operon e.g. of SEQ ID NO.: 93 (with a C-terminal FLAG tag on Pvc16, e.g. immediately 3’ to SEQ ID NO.: 93) and (ii) a second IPTG inducible expression plasmid containing the Cre recombinase with a N-terminal fusion of the natural Pnf effector 50 amino acid leader sequence (e.g. leader of SEQ ID NO.: 78) and a C-terminal Myc-TAG epitope. The PVC operon and effector (Cre + leader sequence) were co-induced for 24 hours and the chimeric nanosyringes purified. Western blot analysis was used to confirm the presence of the FLAG-tagged Pvc16 cap protein (and therefore the nanosyringe chassis) and the trans-packaged Myc-tagged Cre recombinase post purification.

4.2 Trans-packaging using additional leaders demonstrating functionality of a larger, diverse sequence space

Complementing the data outlined in Example 3, Figure 10D demonstrates (trans-) packaging of Cre into PVC pnf (in E. coli) using the following four additional leader sequences (thus demonstrating the functionality of a larger sequence space): Lane 1 : the leader of PAU_02096 (leader sequence = SEQ ID NO.: 71), experiment referred to as“NanoSyringe + lopt50::cre::Myc in Figure 10D;

Lane 2: the leader of PAK_02075 (leader sequence = SEQ ID NO.: 50), experiment referred to as“NanoSyringe + cnf50::cre::Myc in Figure 10D;

Lane 3: the leader of PAU_02009 (leader sequence = SEQ ID NO.: 68), experiment referred to as“NanoSyringe + cif50::cre::Myc in Figure 10D; and

Lane 4: the leader of PAU_02806 (leader sequence = SEQ ID NO.: 76), experiment referred to as“NanoSyringe + gog50::cre::Myc in Figure 10D.

These results also demonstrate the utility of leader sequences showing greater sequence diversity for (trans-)packaging a payload. Indeed, to provide further validation, the inventors performed a CLUSTALW sequence comparison of a panel of leader sequences to determine diversity. PVC effectors are identified as proteins encoding recognisable toxin-like domains that are encoded immediately downstream of the pvc16 structural gene. Each PVC operon can encode just a single effector, or several different effector genes in tandem array. A phylogenetic tree is shown in Figure 10E, with the identities of leader sequences exemplified herein for packaging payload proteins into the nanosyringe complexes being elaborated by either the P. asymbiotica ATCC43949 PVC pnf operon (solid arrows) or the P. luminescens TT01 PVC unit4 operons (dashed arrows) or both.

As can be seen from the tree of Figure 10E, the exemplified leader sequences are well distributed throughout and are therefore at or close to maximally sequentially diverse.

EXAMPLE 5

Tail Fibre / binding domain modification

PVC Needle Complexes are known to comprise tail fibres (see the 3D rendered PVC structure, left most asterix of the rightmost image) which are believed to allow for cell-type specific targeting of the PVC complexes. The inventors have successfully demonstrated that modification of a tail fibre region to incorporate non-natural amino acids (e.g. a substitution of an amino acid in the wild-type sequence for an alternative amino acid of the 20 standard amino acids) does not affect expression of tail fibres.

EXAMPLE 6

Demonstrating delivery of an active (exogenous) enzyme/ payload into ex vivo murine organoids with a leader seguence-packaged PVC Needle Complex

Concept: Obtaining data for the delivery of an exogenous functional enzyme to a mammalian tissue. The inventors have demonstrated the delivery of a trans-packaged bacteriophage derived recombinase protein known as“Cre” into ex vivo mouse bile duct organoids. The organoids are derived from a mouse line in which the expression of a chromosomally encoded red fluorescent protein (RFP) reporter is normally prevented by a stop signal flanked by loxP recognition sites for the Cre-recombinase. If the recombinase is present, the stop signal is recombined out and the cells then go on to express the reporter protein. The general principle behind this experimental demonstration is summarised in Figure 16A. Method: The Bile Duct organoid preparation: murine primary bile ducts were isolated and expanded as organoids in matrigel using“BD expansion media” for 12 passages following Huch et al (Regen Med. 2013 Jul;8(4):385-7. PMID: 23826690; DOI :10.2217/rme.13.39) protocol. Cells were then plated in 2D and cultured in BD expansion media. Mouse Genotype: LSL-Tom reporter in Rosa26 locus + Axin2CreRT (inducible upon 40HT treatment). Cells were cultured in uncoated polystyrene plates at a seeding density: of 10,000 cells/well. Nanosyringes were prepared as 30% volume syringe preparation in PBS + 70% culture media. Total volume of 100 pi per well. The positive control represented 500nM 40HT (in ethanol) at 1 :1000 (v/v) as positive control for the recombination. The negative control represents 1 :1000 (v/v) ethanol dilution only. Cells were seeded and grown for 48h, nanosyringes added and then cultured for another 24h before fixing (4% PFA fixation 15min RT) and staining for microscopic examination. Staining: Primary antibody Anti-RFP (1 :1000) from Rockland. Secondary Anti-Rabbit 568 (used at 1 :500 v/v). Samples were visualized on a laser-confocal microscope.

Result: Figure 16B includes representative micrographs from these experiments demonstrating signal for the RFP protein could be detected in a number of cells when treated with the Cre loaded PVCpnf nanosyringe. As these are ex vivo organoids, rather than simple cell monolayers, some stochasticity in the number of cells that are dosed is expected, and this is even observed in the positive control, which is a small molecule inducer (rather than a large protein complex). It is anticipated that, as these are organoids, there will be some level of cellular differentiation present which may alter the binding characteristics of the nanosyringes. A further interesting observation from this preliminary run, is that while information on total amounts of nanosyringes applied to the system is not yet available, the inventors demonstrate that the TAM small molecule inducer does not appear to have appreciably greater tissue penetration than the nanosyringes, suggesting their ability to distribute is not majorly hampered by their size.

Additional interpretation: To summarise, the inventors have demonstrated the ability to deliver (e.g. dose) exogenous enzymes to a cellular target. Moreover, this“nanosyringe + Cre” experiment is a promising proof of concept for a biotechnology tool/aide, by demonstrating the ability to provide a DNA change leading to a transformed cell. This experiment therefore demonstrates the use of exogenous payloads (a protein of viruses rather than bacteria), and nucleic acid modifying enzymes in particular. It is evident that the Cre enzyme is delivered in a functional manner and is capable of traversing the cellular interior to the nucleus to affect its DNA modifying changes.

EXAMPLE 7

Trans-packaging of MAD7 site specific recombinase (exogenous payload) into the PVCpnf nanosyringe expressed in E. coli

Concept: As with the Cre data (of Example 6), and other examples of packaged payloads provided herein, the inventors have demonstrated packaging of the Cas-like enzyme MAD7 into a nanosyringe via a leader sequence. This is the largest exogenous example (MAD7 = 147.9 kDa) of a payload described herein. Methods: Briefly, the chassis genes and the MAD7 gene (the latter being tagged with a C- terminal Myc tag for detection, and a leader sequence for nanosyringe incorporation described herein), were expressed (upon induction) simultaneously in E. coli. Upon harvesting and purification of the nanosyringe complex, payload packaging was probed via dot blot analysis (e,g. for detection of the Myc tag). The purification method described herein (using ultracentrifugation) can be employed to select for (e.g. exceedingly) high molecular weight protein complexes/ biological matter, enabling recovery of the nanosyringes and any cargo (payload) they carry. ‘Loose’/ unpackaged payload remains in solution and is not subject to sufficient centrifugal force and as such is lost during purification, unless contained within the much larger nanosyringe‘shell’ (that is, when successfully packaged). Successful packaging of MAD7 is demonstrated by Figure 17.

EXAMPLE 8

Trans-packaging of apoptosis inducing payloads into PVC pnf, expressed in E. coli

Using the E. coli PVCpnf leader/.payload:: Myc trans-packaging system described in Figure 10C (PVC pnf leader = SEQ ID NO.: 78), the inventors demonstrated the ability to trans package at least two pro-apoptotic human derived protein sequences or peptides (e.g. the sequences of SEQ ID NO.: 109 and SEQ ID NO.: 111). The Pnf effector protein leader sequence (e.g. SEQ ID NO.: 78) was fused to the N-terminus, and a Myc epitope tag was fused to the C-terminus. Western dot blot analysis (similar to that of Example 7) confirmed the presence of these human derived proteins in purified nanosyringes (Figure 18).

EXAMPLE 9

Demonstration of the induction of apoptosis in cultured ex vivo human cells by nanosyringe delivery of (trans-)packaged pro-apoptotic human polypeptides

A preliminary test has confirmed the ability to use the PVC pnf nanosyringe, produced in E. coli, to deliver trans-packaged human protein sequences (e.g. packaged according to Example 8) and induce apoptosis in ex vivo circulating PBMC cells from human donors. The assay is a TUNEL-stain microscopic analysis from cells exposed to the packaged nanosyringes for 20 minutes only. Results are shown in Figure 19A, demonstrating (via successful induction of apoptosis) delivery of tBid p15 fragment and BaxBH3 domain.

• tBid p15 fragment (SEQ ID NO: 109) is part of the normal human apoptosis regulation pathway. Cellular effects: a pro-apoptotic member of the Bcl-2 family. The C-terminal part of Bid (tBid) translocates to the mitochondria, where it induces the release of cytochrome c. Bid is normally cleaved by caspase 8 from its latent cytosolic full-length pro-Bid form.

• BaxBH3 (aa59-73) (SEQ ID NO: 111) is a minimal BH3 domain synthetic peptide, comprising critical 15 residues of the defined Bax BH3 domain. Cellular effects: these 15 residues contain sufficient information to bind to, and functionally antagonize, Bcl- xL and to induce specifically Bax/Bak. Appears to abrogate Bak/Bcl-2 interactions - freeing up pro-apoptosis factors.

A more detailed test of the delivery of pro-apoptotic human peptides into ex vivo Peripheral Blood Mononuclear Cells (PBMCs) is now described. The aim of this study was to investigate whether the pro-apoptotic peptide loaded PVC nanosyringes could induce apoptosis in ex vivo human Peripheral Blood Mononuclear Cells. The nanosyringes were first assessed for any immediate cell toxicity using Trypan blue dye exclusion assays and then for apoptosis response by using the TUNEL assay.

Trypan Blue Exclusion Test for cell viability: Trypan blue is a Diazo dye commonly used to selectively colour dead tissue or cells, hence, dead cells are shown as a distinctive blue colour under a microscope while live cells or tissues with intact cell membranes remain uncoloured. Since live cells are excluded from staining, this staining method is also described as a Dye Exclusion Method. Trypan blue is commonly used for assessment of tissue or cell viability. A suitable number of cells (2 X 10 5 ) were exposed to the nanosyringes and empty nanosyringe for 20 minutes. A suitable volume of cells (30mI_) were added to an equal volume of 0.4% Trypan blue and the number of viable (unstained) and dead (stained) cells counted using a hemocytometer. Each compound was tested at 3 concentrations. Blood cells from two independent human donors was tested for each compound at each concentration and each sample was tested in duplicate.

Treatment and preparation of cells for microscopy: The viability of Peripheral Blood Mononuclear Cells (PBMCs) from two independent healthy human donors was determined after 20-minute treatment with the two chimeric nanosyringes (e.g. loaded with the exogenous pro-apoptotic peptides) at 3 test concentrations in 2 independent tests. PBMCs were harvested by centrifugation and resuspended in media at 1 X 10 6 cells/ml. Cells were fixed in 2.5% formalin and incubated for 20 mins at room temperature. Poly-L-lysine coated slides were prepared by spraying with 70% ethanol and leaving to air dry. Cells were centrifuged for 30 seconds. Supernatant was removed and cells were resuspended in 200mI dH 2 0. 5mI of cell suspension was added to each slide/fixation. Two fixations were performed per slide to allow staining to be performed in duplicate. Cell suspension was left to air dry.

Results of PBMC cell viability assay: The Trypan blue viability assays confirmed that the PVC preparations were not immediately toxic in themselves to PBMCs taken from healthy human donors (Table 2). Nanosyringe treatment showed > 60% viability indicating low toxicity at maximum dose concentration (Table 2). The inventors then moved on to test the ability of the chimeric nanosyringes to induce apoptosis.

Table 2. Viability of Peripheral Blood Mononuclear Cells (PBMCs) from two independent human blood donors after exposure to each compound for 20 minutes at 3 test concentrations (v/v dilutions). PBMC controls are untreated.

Testing for chimeric nanosyringe induced apoptosis using the TUNEL assay: The

TUNEL assay was then used to identify apoptotic nuclei in single cell suspensions fixed on slides. In the assay Terminal deoxynucleotidyl Transferase (TdT) binds to the exposed 3’-OH ends of DNA fragments which are generated in response to apoptotic signal factors. This in turn catalyses the addition of biotin-labelled deoxynucleotides which can be detected using a streptavidin-horseradish peroxide (HRP) conjugate. Diamineobenzidine (DAB) reacts with the HRP-labelled sample to generate an insoluble brown substrate at the site of DNA fragmentation. Methyl green counterstaining enables the visualisation of normal and apoptotic cells.

The induction of apoptosis following exposure of human PMBCs to the nanosyringes was determined. A TUNEL assay kit (Abeam) was used for detection of apoptotic cells. The assay was performed following the manufacturer’s instructions. Briefly, slides were covered with 100pL proteinase K solution or 5 minutes, slides were rinsed with 1x TRIS buffer saline (TBS). The treatment of nanosyringes or the DNase I positive kit control was performed for 20 minutes at room temperature. Slides were rinsed with TBS. Slides were then incubated with TdT equilibrium buffer for 30 minutes before the addition of TdT labelling reaction mix. Slides were incubated at 37° for 19 minutes. Slides were then washed with TBS before application of the stop buffer and incubation at room temperature for 5 minutes. Slides were washed again with TBS before addition of the blocking buffer for 10 minutes at room temperature. Detection was performed by application of the conjugate to the samples for 30 minutes. Slides were rinsed with TBS before application of the DAB solution for 15 minutes. Slides were rinsed with dH 2 0 followed by counterstaining with methyl green. Slides were dehydrated in 100% ethanol followed by xylene and mounted with a glass cover slip. All staining was performed in duplicate. An apoptosis endpoint, indicative of positive staining in the apoptosis detection assay is represented by dark brown (DAB) signal. Lighter shades of brown and/or shades of blue/green to green/brown indicate a non-reactive negative cell for apoptosis.

Analysis was performed by selecting 5 random sections of cells on the slide, positive stained cells (dark drown) and negative stain cells (blue or light brown) were counted and the percentage of cells showing apoptotic bodies was determined.

To generate a positive control, slides were treated with 1 pg/mI DNase I (the kit positive control) for 20 minutes at room temperature following the proteinase K treatment step detailed below. The DNase I treatment fragments DNA in normal cells to generate free 3ΌH groups identical to those generated during apoptosis. A negative control was generated by substituting DNase I with dH 2 0 in the reaction mix during the treatment stage.

Results of PMBC apoptosis assays: TUNEL staining using PBMCs was performed following treatment with the intact tBID and Bax loaded nanosyringes, with appropriate positive and negative kit controls. Treatment was performed for 20 mins to determine if the nanosyringes elicited an apoptotic signal. A positive control (DNase I treatment) and negative control (no DNase I treatment) was included. Results showed both nanosyringes, containing either tBID or Bax, showed strong apoptotic signals (89% and 78% positive, respectively) on the PBMCs. The positive control showed a strong apoptotic signal (79%), whereas the negative control showed no apoptotic signal (100% negative). Also observed was a significant loss of the numbers of attached cells in the nanosyringe treated samples, presumably indicative of a rapid and comprehensive apoptosis response, and a failure to be retained after washing. Note this effect is even more pronounced than the kit positive control suggesting a more rapid response. Representative micrographs are shown in Figure 19B.

Conclusion: It is concluded that the tBID and Bax loaded nanosyringes are able to rapidly induce extensive apoptosis in human Peripheral Blood Mononuclear Cells. Furthermore, Trypan Blue dye exclusion assays have confirmed that these chimeric nanosyringes do not cause rapid lethal lysis or extensive membrane damage to the cells.

EXAMPLE 10

Exemplification of practical utility of leader sequences and PVC Needle Complexes -

Intracellular delivery of atypical {non-Photorhabdus ) payload

(1) An anti-MDM (p53 inhibitor) antibody is linked to a leader sequence described herein, and expressed together with a PVC Needle Complex for packaging therein. Isolated PVC Needle Complex (comprising the antibody payload) is contacted with a tumour for intracellular delivery of the antibody (said tumour cells being characterised by having high MDM- suppression of p53 activity for MDM inhibition). The tumour is suppressed by the activity of the anti-MDM antibody.

(2) A PVC Needle Complex is used to (intracellularly) deliver anti-tumour peptide vaccine to activate the MHC-I dependent cytotoxic T-cell lymphocyte (CTL) response. A tyrosinase- related protein 2 (TRP2) peptide vaccine is delivered for enhancing cross-presentation to CTLs occurs and antitumor effects against TRP2-expressing tumours. The tumour is suppressed by the activity of the peptide vaccine.

(3) A PVC Needle Complex is used to (intracellularly) deliver a nuclear factor-kB inhibitors (which are used for the control of inflammatory disorders, such as rheumatoid arthritis) to a cell. The cell subsequently demonstrates a reduced expression of pro-inflammatory cytokines.

(4) A PVC Needle Complex is used to (intracellularly) deliver a T3SS payload (which inhibits NF-kB and MAPK pathways). This is completed with an isolated (purified) PVC Needle Complex, without any need for the PVC Needle Complex to remain associated with the bacterial cell from which it derives.

(5) A PVC Needle Complex is used to (intracellularly) deliver, to a cell, anti-apoptotic peptides including BH4, the Bcl-xL-protein, and/or a peptide inhibitor of c-Jun N-terminal kinase (which can protect the heart and brain against ischemic injuries (a restriction in blood supply to tissues, causing a shortage of oxygen and glucose needed for cellular metabolism)). For example, Jun-kinase inhibition via a 20 amino-acid binding motif of the JUN kinase is sufficient. A release of e.g. cytochrome c in the cell is inhibited. (6) A PVC Needle Complex is used to (intracellularly) deliver nicotinamide adenine dinucleotide quinone internal oxidoreductase (Ndi1), the single-subunit yeast analog of complex I (which provides significant cardioprotective effects) to complex l-deficient mutant cells. The Ndi1 protein is correctly targeted to the matrix side of the inner mitochondrial membranes, and restores the NADH oxidase activity to the complex l-deficient cells.

(7) A PVC Needle Complex is used to deliver one of two of the essential subunits of the PHOX complex (which are used in enzyme replacement therapy to restore production of ROS in chronic granulomatous disease) to a chronic granulomatous disease cell. A restoration in production of ROS is observed.

(8) A PVC Needle Complex is used to (intracellularly) deliver (e.g. intramuscularly) a myotubularin (which is used for improving local and distant muscle performance in X-linked myotubular myopathy patients). Myotubularin- dephosphorylation of phosphatidylinositol 3- phosphate and phosphatidylinositol (3,5)-bi-phosphate is observed.

(9) A PVC Needle Complex is used to (intracellularly) deliver a recombinase“Cre” (which is capable of excising defined genetic cassettes) into a mouse cell line, in which the genome has loxP recombination sites flanking a stop signal upstream of an mCherry gene. The Cre payload excises the recombination sites, and removes the stop signal, allowing for expression of the mCherry gene in the cell.

(10) A PVC Needle Complex is used to (intracellularly) deliver a ~15kDa nanobody (antibody fragment) with affinity for an intracellular component. A nanobody-intracellular complex is detected.

(11) A PVC Needle Complex is used to intracellularly deliver (e.g. into insect cells) an atypical (non -Photorhabdus) polypeptide toxin for insect crop pests and animal parasites. Suppression of the pests is observed.

(12) A PVC Needle Complex is used to (intracellularly) deliver a nuclease (e.g. Cas9 and/or Mad7) into a target cell comprising a guide RNA. The nuclease performs site-directed gene inactivation

All publications mentioned in the above specification are herein incorporated by reference. Various modifications and variations of the described methods and system of the present invention will be apparent to those skilled in the art without departing from the scope and spirit of the present invention. Although the present invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention which are obvious to those skilled in biochemistry and biotechnology or related fields are intended to be within the scope of the following claims. SEQUENCES

Where an initial Met amino acid residue or a corresponding initial codon is indicated in any of the following SEQ ID NOs, said residue/codon may be optional.

SEQ ID NO: 1 (PAK 19851

MMREYSNEDDFIKEKTNLVKSENVEADNYLETEYLTYLAKLIGMTERENHHLNSIKLIDD IIELHNDRKGNKLL

WNDNWQDKIIDRDLQSIFKKIDEMVSEFGGLEAYKDIVGENPYDPTEPVCGYSAQNI FKLMTEGEYAVDPVK

MAKTGKINGNQFAEKLEHLNSSNNYVALINDHRLGHMFLVDIPSTNRERVGYIYQSD LGDGALPALKIADWL

KSRGKESINVNKLKKFLNDEFTMLPDNEQKGLIAEIFDLNKDIDSVKSGKIKKDKAV DIYLREYDINDFISNIEKL

KTKLA

SEQ ID NO: 2 (PAK 1987)

MFQNRIRNEKTTQSGKGKTLDRMTDSLYLEIPNVEAVTLAYQKLTSKYRKFDNKTKLILD SSDEFSQLKSEK

QRKGFSKSGLKNNGVSDRKFIYTKNALKNFAAHAGYEHNGHYEDEFVNFKDNNKNLA KGKLFPGISLIERR

KLSIVKNKEGKWEHKETDEAEAYKVTDIEKFISGVRSMYLQGNTFLHAKTEALIRKH IANNENILPTMAGIAGL

HAEVQALNNLFISGDKGTKKREKWKYIRNMLESSIFTQRLTTGQAGKDFAACHNCSG ILSSPVNVITGKVES

AGDNFLSTLSRYKTSQESPI

SEQ ID NO: 3 (PAK 1988)

MEREYSEKQKNPSKLSRKTAISERIAALERSGLSNSNQPVPQFARPYTSNRPVVNINPGR SSIAVATANSTS

PVNIPTPAPASPDKLLPSTSCDTTSSILIVGKYNLELTSQGKIVVFRGDNRTPEQIV AAGGFYPWSKQDVGKI

KKELIDEFIEIGPSAHMMGHVRSPNKNYVSTGMNMDSGGFGEQSNYLYKMEIPGLKP QDMNERTLGEKIRQ

DKRGINYPHFLMSHLTLAESEFVAMIPARSEELTFITPIPLSYITSYRKRGTNTWLP MPLKK

SEQ ID NO: 4 (PAK 2075)

MSNYEYDIVTQHDTYQIKDNEYTVVNGKYWQYEQEGNKNNNKVSISLMKENQNDPVWITS DIKEISLYIIENL

FSYHKFSAELQHTLKNAVKAVFNEYSEIKYSELLHNINNIFNLFFIKIYNTSDIDTA INILTAKIEIYDKLEKINQDK

TDSNNTNVDIWEELGINAEEPLLKIYRQAFSTGDIDDEVYSDALLTFMSDGNLELGD KEKSDYNQRIKDKTDL

FESYKKGIEKVASLITTNNINPGIPITYPETEKSINIGDDLLLAQLAKEEIALKKQN RTEYSQQDIFELQTLQAAK

YHLLILSSLGALLYQIAPNVEKMTKGHGDYRDIIFSQEQAESLFKKHNIQYDTNHVL SQESKHIEMEGCIILTAA

IIYRMRKENATVEQALNYSTLETIKLFENDKKKLNPFNTNNVKPAGYFSFIDFKKRD KFDSQYNFNEQFNVYK

NKYSHYESISFSKLILSSPAAQLTAEEIVNPPEEAFLYSVEQGMGNVAMIKMYQGNW LVISTIQGGVKAKKYS

RQQVDSNPTLRAMSKPNALFLIERKMETGMGILMPNMMVNTGKRLFPTGYERAKTLS GFAETSRYKNSYN

AFWNDYYGITSGMNVGISFTGSPKFNFYKEENLLSVTATIIQQGLNDIAIKSKQALD ITSGWHIAATILIPFYNVI

YKSTTDSEYELTGEDIGSIVFDTANVLLVVATLGMSLTESMAAKVTQTTLRLRQAGL TGRALITAVVRTLPEH

GIITLRQSSGIILGGLIDLIEPLPIRSTLTLTYRGVINAVGAMRNSIKLEKSFADIF GKSTRGLGKLKNEWKVSNL

PLEEIVPHSNGGEIYKGIYSIRPTNPETAVKQNFYIKEAGANYQVKWDDANHTWRVV NPTYPEQFSYWPAV

KLDKNGHWVTHADVSNKFLILEQSKRIDQELEAAHSNINNDNILDAFIHINTAFKDC ERYDIDKLSDITDTLTHF

FEKSLKPGDKKAIFSTEIMSIQQAWIREVILPLQNNSSISIEKINAIKTELPYLLRK TFPIESQLPNQLVANKIALAI

EEIPNTRIPKYTSGNISKTVQYTSLLENNHVDIPPVGITITGNDTFINQVTRVLSEI DEIPSGNIVIQELEKQGLNI

QPPTMNDIVREKNGQFYANNSAGSHIAFDPENHLIGTEEKLIDEPWRTREPAIALYH EMLHIYYNRYPTWFT

SIDNKVIDQKVSGGFSLLEESRIVGTKYYVNDKNTLFDFNDSDYLLENNSALLTENR FRAEYAIFKNKSEYVIR

PYSGKGDSQIPLTKTKININESHRNVMGVGSGKPEKMPNESATDYRNRVREWRKANK QPEADIGTGDMRK

TKAEARVKLLKENYPQFEPQKIELGGAFQLWTVPNEPANKLMLSSHGYFFSDSAATQ VPAGKTIQFLGPHG

KTLLEAPENPLYSPFDVTLGNSGFTVQPYATIESGNKAGLGSVKIGDKTFTVNDIQN IATDDVENYLLATGVE

ANASNHGKVRNYGIKYYEKMPDEEVKAAIWKNRADETSTHKYDALLVSPEAGNRKKL SDIFALMKTDERMS

KYDEITFVACREELNRINMKSIHDTGLGGGYEPKLEPTVILSRRRREATFTADGAII YSIIAVNLHHNFITEEIVG

IAPFLFINN

SEQ ID NO: 5 (PAK 2077)

MEHEYNEKEKQRNSAIKLNDAIRNNEENMDMTSPLELNFQNTNRKSRGLRERFSATLQRN LPGHSMLDRE

LTTDGQKNQESRFSPGMIMDRLMHFGVRTRLGKVRNSASKYGGQVTFKFAQTKGTFL DQIMKHKDTSGGV

CESISAHWISAHAKGDSIFNQLYVGGKKGKFHIDTLFSIKQLQMDGYLDDEQSTMTE YWLGTQGMQPNIQR

NDDTDEHSSKVVGETGNRGTKDLLHAILDTGDKGSGYKKISFLGKMAGHTVAAYVDD QKGVTFFDPNFGE

FSFPDKTSFSHWFTDDFWPKSWYSLEIGLGQEFEVFNYAPEAP

SEQ ID NO: 6 (PAK 2892)

MPNKKYSENTHQGKKPLMKSEANNEHDIQNSSLGIGLDLNSMMGNSSTSLSHIQDYSFWK ENISEYYKWM

VVVKAHLKQLDWTLKSMDSPESAGTNIAKNTGTTALQTLLNTGGSIAGAAIGGAIGS AIAPGVGTIAGMGIGA

LAGTGLNYLNDTVIEKLNEKLEIAYPYPKTRNMIFDINNYDKNPIIKAIKKKTNKDN LKVTAGSSLTSQLVGKVT

SPIKFPAYKLADLAIALAGLSSDKARHILDFTDSIREVLNESHSDAVAFMRKNYGDN AMGLAGLSSRIK

SEQ ID NO: 7 (PAK 2893)

MEREYSEKEKHKKRPIQLRNSIEQHEEETANNSLGLGLDLNQATNPPKVPKDNYNEENGD LFYGLANQRG

RYIKSVNPNFDPDKINSSPMIIDVYNNNVSNTILNKYPLDKLVKLSGNPQKYANNIK VENSLQQDVASSKRGW

YPLWNDYFKTGNENKKFNIADIYKETRNQYGSDYYHTWHTPTGAAPKLLWKRGSKLG IEMAASNEKTKIHF VLDGLNIQEVVNKQKGSTPLEQGRGESITASELRYAYRNRERLAGKIHFYENDQETVAPW EKSPELWQNYI

PKNKNQNESSTPQRNNGTLYRLGGPFRKLRASLRKRS

SEQ ID NO: 8 (PAK 2894)

MMEHEYSKEEEKKRQQSKPNNATHDESNLPLELEKHFNARTPATAHSKWFTYENDTEVEL TTERIKEIFSN

KQPKIIIAGDGHNKPPFQYAKNIPDVNSSFDAGTLQLYIEATDEQINENNPEYIPKE FMAKPGLFTNKNRRAEI

VGWEDSELSNAMKEMFELSDKSTREKLTPEETSSFYKLHETAIRHFFRPEFNQLRDE FFEILAKAGSNRELD

KIALEMIGFTSGTWRDEYINPTLAEKIAKHAAEKENHTFVVSIGDAHLSENPMQEYL NKRRNGGEFKHQIIFT

RDKRPILPDNMKTGNKNS

SEQ ID NO: 9 (PAK 3525)

MLKYANPQAVPTQRTKNTAKKPSSSSSFDGQLELSNGEWSKHSEMGLKRGGLINSIRRRI ARNGNIGRFNE

LIDSEAKKWPSEPVDKNIHMIWIGTRNISEKNIKLSIDTAKKNPDYNTSIIYDSGIS GHEGARNFMLEKFEGSN

VNXSLAFPKGIGVMREYAPEAGKATAFPNTPIAVTKNNPIINKTLDLAVGNYQRGEK NVLKLAGPDVFTQALY

QEIPGLNSKVLNAQLDQFELAKRQALGLPLEKPKSFADEKLTSVEKEKINRPYQSMR GLSGHVMNGADHS

WAVDTEVLGH

SEQ ID NO: 10 (PAT 00148)

MMREYSNEDDCTKEKTNLVKSENVEADNYLEMEHLTYLAKLISMTERENHHLNSIKLIDD IIELHNDRKGNKL

LWNDNWQDKIIDRDLQSIFKKIDEMVSEFGGLEAYKDIVGESPYDPTEPVCGYSAQN IFKLMTEGEYAVDPV

KMAKTGKINGNQFAEKLEHLNSSNNYVALINDHRLGHMFLVDIPSTNRERVGYIYQS DLGDGALPALKIADW

LKSRGKESINVNKLKKFLNDEFTMLPENEQKGLIAEIFDLNKDIDSVKSGKIKKDKA VDIYLREYDINDFISNVE

KLKTKLA

SEQ ID NO: 11 (PAT 00149)

MIFKMLNLAVFYLLGNIFHYLICQKFICYFCSVLKSVTMFLTKVAVQIALYLNILPTMAG IAGLHAEVQALNNI.FI

SGDRGTEKRENWKYIRNMLESTIFTQRLTAGQAGKDFAACHNCSGILSSPVNVITGK VESAGGNFFINIISI

SEQ ID NO: 12 (PAT 00150)

MEREYSEKPKNLSQLSRKTAISERRAMFERNASSNNEQPVPQFARSYTSNRSVVNINPGR SSIAVVTANST

SPVNISTPAAASPDKLLPSTSCDTTSSTLTVGKYKLELTSQGKVVVFRGDNRTPEQI VAAGGFGEQSNYLYK

MEIPGLKPQDMNERTLGEKIRQDSRGN

SEQ ID NO: 13 (PAT 00152)

MKYDPRLRTWVEDDFDYEKNFKKQTDYINYKDLEKQLKENVDYYALLDENEAIIFLKELG CDIKSFLNDTAFP

VTDVLSNFAGNIKDALGVFKVAKNFKPINIGIFTYIINELKGKGIKAIEYLGKNGER YIKLTDRPGIRKYLNATRY

LINNKKIMEVGIGSVAMEGSIVKGARFGVIYSAAYRSVELMFKSEYDLTNFFVNLSM DMAKIIVATIIAKSTVAA

ATSFVVTAALSTTAIAIGVFIIGALVVWGLMWLDDEFKISETIIRRLKEHKVKTPIS TYHSDQIFNAWGRYYRG

SEQ ID NO: 14 (PAT 02308)

MPNKKHSENTHQGRKPLIKSEANNEHDIENSSLGIGLDLNSTIGNNSASLSQIQDYSFWK ENISEYYKWMVV

VKAHLKQLDWTLKSMDSSESAGTNIAKNIGTTALQTLLNTGGSIAGGAIGGAIGSAI APGVGTIAGMGIGALA

GTGLNYLNDTVIEKLNEKLEIAYPYPKTRNMIFDINNYDKNPIIKAIKKKTNKDNLK VTAGSSLTSQLVGKVTSP

IKFPAYKLSDLAISHNRALAGLSSDKARHILDFTDSIREVLNESHSDAVAFMRKNYG DNAMGLSGLSSRIKGE

KLTLATLARTRNKIENRINSINKQTLKLSSKNSNE

SEQ ID NO: 15 (PAT 02309)

MEREYSEKEKHKKRPIQLRNSIEQHEEETANNSLGLGLDLNQATNPPKVPKDNYNEENGD LFYGLATQRGR

YIKSVNPNFDPDKINSSPMIIDVYNNNVSNTILNKYPLDKLVKLSGNPQKYANNIKV ENNLQQDVASSKRGWY

PLWNDYFKIGNENKKFNIADIYKETRNQYGSDYYHTWHTPTGAAPKLLWKRGSKLGI EMAASNEKTKIHFVL

DGLNIQEVVNKQKGSTPLEQGRGESITASELRYAYRNRERLAGKIHFYENDQETVAP WEKSPELWQNYIPK

NKNQNESSTPQRNNGALYRLGGPFRKLRASLRKRS

SEQ ID NO: 16 (PAT 02310)

MMEHEYSKEEEKKRQQSKPNNATHDESNLPLELEKHSNARTSATAYSKWFTYENDMEVEL TTERVREIFS

NKQPKIIIAGDGHNKPPFQYTKNIPDVNSSFDAGTLQLYIEATDEQINENNPEYIPK EFMAKPGLFTNKNRRA

EIVGWEDSELSNAMKEMFELSDKSTREKLTPEETSSFYKLHETAIRHFFRPEFNQLR DEFFEILAKAGSNRE

LDKIALEMIGFTSGTWRDEYINPTLAEKIAKHAAEKENHTFVVSIGDAHLSENPMQE YLNKRRNGGEFKHQII

FTRDKRPILPDNMKTGKKNS

SEQ ID NO: 17 (PAT 02956)

MSNYEYDIVTQHDTYQIKDNEYTVVNGKYWQYEQEGNKNNNKISISLMKDNQNDPVWITS DIKEISLYIIENL

FSYHKFSAELQHTLKNAVKAVFNEYSEIKYSELLHNINNIFNLFFIKTYNTSDINTA INILTAKIEIYDKLEKINQD

KTDLNNTKVDIWEELGINAEEPLLKIYRQAFSTGDIDDEVYSDALLTFMSDGNLKLG DKEKSDYNQRIKDKTD

LFESYKKGIEKVASLITTNNINPGIPITYPETEKSINIGDDLLLAQLAKEEIALKKQ NRTEYSQQDIFELQTLQAA

KYHLLILSSLGALLYQIAPNVEKMTKGHGDYRDIIFSQEQAESLFKKHNIQYDTNHV LSQESKHIEMEGCIILTA

AIIYRMRKENATVEQALNYSTLETIKLFENDKKKLNPFNTNNVKPAGYFSFIDFKKR DKFDSQYNFNEQFNVY

KNKYSHYESISFSKLILSSPAAQLTAEEIVNPPEETFLYSVEQGMGNVAMIKMYQGN WLVVSTIQGGVKARK

YSQQQVDSQPTLRAMSRPNALFLIERKIMIGIGIFMENQIVNTGKRLFPTGYERAKT LSGFAETSRYKNSYNA

FWNDYYGITSGMNVGISFTGSPKFNFYKEENLLSVTATIIQQGLNDIAIKSKQALDI TSGWHIAATILIPFYNVIY

KSTTDSEYELTGEDIGSIVFDTANVLLVVATLGMSLTESMAAKVTQTTLRLRQAGLT GRALITAVVRTLPEHGI

ITLRQSSGIILGGLIDLIEPLPIRSTLTLTYRGVISAVGAMRNSIKLEKSFADIFGK STRGLGKLKHEWKVSNLPL EEIVPHSNGGEIYKGIYSIRHTNPETAVKQNFYIKEAGANYQVKWDDANHTWRVVNPTYP EQFSYWPAVKL

DKNGHWVTHADISNKFLILEKSKRIDQELEAAHSNINNDNILDAFIHINTAFKDCER YDIDKLSDITDTLTHFFE

KSLKPGDKKAIFSTEIMSIQQAWIREVILPLQNNSSISIEKINAIKTELPYLLRKTF PIESQLPNQLVANKIALAIEE

IPNTRIPKYTSGNISKTVQYTSLLENNHVDIPPVGITITGNDTFINQVTRVLSEIDE IPSGNIVIQELEKQGLNIQP

PTMNDIVREKNGQFYANNSAGSHIAFDPENHLIGTEEKLIDEPWRTREPAIALYHEM LHIYYNRYPTWFTSID

NKVIDQKVSGGFSLLEESRIVGTKYYVNDKDTLFDFNDSDYLLENNSALLTENRFRA EYAIFKNKSEYVIRPY

SGKGDSQIPLTKTKININESHRNVMGVGSGKPEKMPNESATDYRNRVREWRKANKQP EADIGTGDMRKTK

AEARVKLLKENYPQFEPQKIELGGAFQLWTVPNEPANKLMLSSHGYFFSDSAATQVP AGKTIQFLGPHGKT

LLEAPENPLNSPFDVTLGNSGFTVQPYATIESGNKAGLGSVKIGDKTFTVNDIQNIA TDDVENYLLATGVEAN

ASNHGKVRNYGIKYYEKMPDEEVKAAIWKNRADETSTHKYDALLVSPEAGNRKKLSD IFALMKTDERMSKY

DEITFVACREELNRINMKSIHDTGLGGGYEPKLEPTVILSRRRREATFTADGAIIYS IIAVNLHHNFITEEIVGIA

PFLFIDN

SEQ ID NO: 18 (PAT 02957)

MEHEYNEKEKQRNSAIKLNDAIRNNEENMDMTSPLELNSQNTNRKSRGLRERFSATLQRN LPGHSMLDRE

LTTDGQKNQESRFSPGMIMDRLMHFGVRTRLGKVRNSASKYGGQVTFKFAQTKGTFL DQIMKHKDTSGGV

CESISAHWISAHAKGDSIFNQLYVGGQKGKFHIDTLFSIKQLQMDGYLDDEQSTMTE YWLGTQGMQPNIQR

NDDTDEHSSKVVGETGTKGTKDLLHAILDTGDKGSGYKKISFLGKMAGHTVAAYVDD QKGVTFFDPNFGEF

SFPDKTSFSHWFTDDFWPKSWYSLEIGLGQEFEVFNYAPKEP

SEQ ID NO: 19 (PAT 03171 )

MFKYDTSEKMAKFGKGKTSDGMLLDTLYLEIPDEKAVMSAYKSQILDELRNFSEKTHSFF SGKKPLYSKKYL

ANLAAHAGYVHVTDYNSIGNYKDGFVNFKDNSRNLAEGKLFPGIRLIKRPKLSIVRD KETERWKKQESDEAD

AYEITDIESFISGVRDMYSRANVDLHPVIESLIRNHIVNNDHVLPTMAGIAGLHAEV QALNNLLILADGRAGKIV

GGRKIEEYMQDMLKSFIFTQRLTTKQAGNDFAACHNCSGILSVPANVITGKVASAGS NFSLILSRYKNSQES

PI

SEQ ID NO: 20 (PAT 03172)

MLKHANPQTVSTQRTKSTAKKPSSSSSFDRQFELSNSENQPGEGNKDWTIKGWRQRFADR SLNKGHISPL

MNKGLLVGSEEALINVPVVAHRYDSSHQLTDAGPLKADSHSNNLDPFYGVVTGFRGD QVTSSESGSGSIG

GHWGKNTLDSNITGINVVNGASGTVGIRIALKDIQHGAPVIVTSGALSGCTMVYAVK NGYFFAYHTGQKPGD

KEWKTGRQGVVATYRSHQALSPDSEPMAVGEQNNDLVNIFASYDQGIITYMGKPGVI IDNTAENVGVFNYD

EVKLEKPDIRAGYSYALLAKDDKGKVNVKVLSEDVIVPLGNKGKTIKAINSLKKRLL

SEQ ID NO: 21 (PAT 03177)

MPRYANYQINPKQNTKNSHGKSSSSNFSSGYFSSSNNSLDDSLIRQQVKREFIWEGHMKE IEEASRLGNFA

VSFRAAGGPTLRALGKGAAAKGHDILEKTIKPGSINKAYPKDEASNVIKKVQEAGIE GYVGHWDKKTGRLLGI

YMSSGHGLSDEQVNGKIYPIDLNNLEASLSALKTKENWAALPFTGDYDMHDMISFTG QPHSVPSNSSEERK

IIDRINRLVARSDPNRPFGDIEHNVIRHGAQVSYPAFAMDKEKEEIKKHGGIVKAVA EPGEFPVAIVSKGKWTI

ANNIDELNQFYNSIGAKMKVSWKPGAENPGFVSNPQRPGMARFSRKR

SEQ ID NO: 22 (PAU 02009)

MMREYSKEDDCVKEKTNLAESENVEADNYLEMDCLNYLAKLNGMPERKDHSLNSTKLIDD IIKLHNDRKGN

KLLWNDNWQDKIIDRDLESIFKKIDEMVSEFGGIEIYKDIVGENPYDPTEPVCGYSA QNIFKLMTEGEHAVDP

VKMAQTGKINGNEFAEKLEQLNSSNNYVALINDHRLGHMFLVDIPSTNREKVGYIYQ SDLGDGALPALKIAD

WLKSRGKESINVNKLKKFLSNEFTMLSESEQKELIAEIFDINKDIANVKLGKIKKDK AVDVYLREYDLNDFISNI

EKLKTKLV

SEQ ID NO: 23 (PAU 02010)

MPIIGHKEDLIRTERSSVDLTRSSNNRQTDNLELNIPQHKRDNKDIEHAVIYGFSQHRGP EMQKAFADNKNP

VTIDEYNAGLGIMGELSLSDYFRISQDLKENRLPELNEKNIQNHSLKYFDAMGVNMK SADPNVKEEAKEQQ

RAYTRSWGFYMMENKEKLDIQSKINNLIPKKKSFFSKSPGEDEYKKLDEFILKNSNG SNLTIPKQRKILMKFA

SAKNAVDVTKNLSGEEQTWLKDIIATAFFRQTSKLGMSWFIEQLASPDFRFVIVGFN GEELTTDQIRSNKPW

KHGNRRKEGASEYAEPITFSEIRHAHRKGYDSKINFIKK

SEQ ID NO: 24 (PAU 02095)

MISTFDPAICAGTPTVTVLDNRNLTVREIVFHRAKAGGDTDTLITRHQYDLRGNLTQSLD PRLYDLMQKDNT

VQPNFYWQHDLLGRVLHTVSIDAGGTVTLSDIEDRPALNVNAMGVVKTWQYEANSLP GRLLSVSEQSANE

AVPRVIEHFIWAGNSQAEKDLNLAGQYMRHYDTAGLDQLNSLSLTGAHLSQSLQLLK DDQMPDWAGDNES

VWQNKLKNEVHTTQSTTDATGAPLTQTDAKENMQRLAYNVTGQLKSSWLTLNGQLEQ IIVKSLAYSESGQK

IREEHGNGVVTKYSYEPDTQRLINITTQRSKGHVFSEKLLQDLLYEYDPVGNIVSIL NRAEATHFWRNQKVSP

RNTYTYDSLYQLIQSTGREMADIGQQNNKMPTPLVPLSSDDKVYTTYTRTYSYDRGN NLTKIQHRAPASHNI

YTTEITVSNRSNRAVLSHNGLTPREVDAQFDASGHQISLPTGQNLSWNQRGELQQAT TINRDNSATDREW

YRYNAGSARILKVSEQQTGNSTQQQQVTYLPGLELRTTKSGTNTTEDLQVITMVETE RTQVRILHWSAGKP

NDIANNQVRYSYDNLIESNVMELDTKGKIISQEEYYPYGGTAIWTARNQIEASYKTV RYSGKERDKTGLYYY

RHRYYQPWLGRWLSADPAGTVDGLNLYRMVKNNPIRYQDESGTNANDKAQAIFKEGK KIAINQLKIASNFL

KDSKNSENALEIYRIFFGGHQDIEQLPQWKKRIDSVIYGLDKLKTTKHVHYQQDKSG SSSTVADLNVDEYKK

WSEGNKSIYVNVYADALKRVYEDPLLGREHVAHIAIHELSHGVLRTQDHKYIGVLSS PGSHDLTDLLSILMPP

ANEQDRTEKQRRATGARKALENADSFTLSARYLYYTAQDPNFLSSLRKAHRDFNNKK TDRLIIRPPERR SEQ ID NO: 25 (PAU 02096)

MEREYNKKEKQKKSAIKLDDAVGNNEENMDMTSPLELNSQYTNRKRPGLRERFSATLQRN LPGHSMLDRE

LTTDGQKNQESRFSPGMIMDRIMHLGVRTRLGKVRNSASKYGGQVTFKFAQTKGTFL DQIMKHKDTSGGV

CESISAHWISAHAKGESIFDQLYVGGQKGKFHIDTLFSIKQLQMDGYLDDEQSTMTE YWLGTQGIQPNRQK

NDNMNEHSSKIVGETGTRGTKDLLRAILDTGDKGSGYKKISFLGKMAGHTVAAYVDD QKGVTFFDPNFGEF

NFPDKVSFSHWFTDDFWPKSWYSLEIGLGQEFEVFNYEPKEP

SEQ ID NO: 26 (PAU 02097)

MVYEYAKTNDRKRKLSTQSDNYEEKSFSPVLDLSRNNQNTPNMEDEYETPQNFINRTGRE KLFRAIRMVAS

NKRDPITKDQVSVPPDGNLFTELKDKHLDRAAEYKKLKTWPTHASIIATSPSANTPI AQHVSGDDALSPYIST

GDKPGAVQNTVRNWNGIGPASERRLRPEKTWSPIIEIDVNKLPDTTKIFDLNKPNNT FFSTTNSDIAQNAFAD

KEVLISPEIPGLAITRVINDPEEIKQIANLNPSQSLIEKKNTIPEEKIIFEEKKSVP IHDSDADIPSSSFVFPKRKKP

RNIRSRTDS

SEQ ID NO: 27 (PAU 02098)

MVFEHDKTVERKRKPSIQLGNDKEKSSEQALELPQSKQNNPLLHDLITSNNLRKEAAVFA KQIGPSYQGILD

GLEHLHNLSGNEQLTAGFELHRRITRYLEEHPDSKRNAALRRTQTQLGDLMFTGTLQ EVRHPLLEMAETRP

AMASQIYQIARDEAKGNTPGLTDLMVRWVKEDPYLAAKSGYQGKIPNDLPFEPKFHV ELGDQFGEFKTWLD

TAQNQGLLTHTRLDEQNKQVHLGYSYNELLDMTGGVESVKMAVYFLKEAAKQAEPGS AKSQEAILLNRFA

NPAYLTQLEQGRLAQMEAIYHSSHNTDVAAWDQQFSPDALTQFNHQLDNSVDLNSQL SFLLKDRQGLLIGE

SHGSDLNGLRFVEEQMDALKAHGVTVIGLEHLRSDLAQPLIDKFLTSENEPMPAELA AMLKTKHLSVNLFEQ

ARSKQMKIIALDNNSTTRPAEGEHSLMYRAGAANNVAVERLQQLPAEEKFVAIYGNA HLQSHEGIDHFLPGI

THRLGLPALKVDENNRFTAQADNINQRKCYDDVVEVSRIQLTS

SEQ ID NO: 28 (PAU 02230)

MKGIEGVIMLSHDILPEKLLVSEKKHENVGSYFSDDIGEQSEQTEVSHFNLSLDDAFDIY ADISIENQQELKNK

DNNTNIWSSLGRGDDDHNLKKIINDAFKEKLPQLMEYRRKGYNVIGLDKEGIKKLEG MLKAVPPEIQQPTMK

NLYSAAQELLNTLKQHPLLPENQDMIQQSNLVIRNLSDALEAINAVSKVNQVEWWEE VHKTNKAQSDRLIAA

TLEELFFKVKDKRLPGSNDDYCQQEREETERKIKDLLLYDGYQLTAEHFKFGRLRKS LLAESRVTRLKLAEY

LEKKSVGILTAARDAKMYAMKILLAQTRNNGFNAKDLINAGQVNDRLLSFQQYARHI RAVDGEIDGIILSNPLV

VACIKETNDEPAHIKIARAILPVSEELGTVSKVLRETKEKVQPSKPKEELNHPHQDW WNRGDELWKYIKKTS

WNIKETSVHVTQMVGYEASKTASRAKHKLKESSYSESINGAVKGTALLLLDEIQQAE NRIRQIPQFAWDVQE

AVEQHSSVIQRTAYPDELPELSELLNEQLKHEEARWQAVKKQSRDKLQELIAPITRL AQEKWAQDLYFQLG

EELRKERQDRWKDIQQFDEIMAEAVGQFAEMARELDSEAVRLAEHGHSGGKELQEKV AKWLRDLSKLKGK

VKAGVAKITGTSLDNFSRSGMLARGMSEWAEDLKQSYLQETLQEGSAVAAELFERTL MEVVEENRTHFAK

ESDPEAERFLKRLALALKHAAENTTVYPPTPEEILAGSRSLPEDIRHWAEKKVVSGA ISAAFRGGFKLVTGTF

SLPVRVVIRGAKTGGTLYRGVRAINRSVRLGQGPATQVKSKFINQELSKTAFRLTLS LSPLVAWGMAASITA

GRLYNEKDYPEKIIKNIVIDLPEELLWIGGYAGINAAIRAHAEKAIQQAIQHALDEQ ADKLALRINKEIAGKSADV

NVEIIPQETSVSPAETAQSTPEPLSDFASTSQLTMPELIDIQDNNSAQQPKVRRKRD VSVESEISIDNLNIINA

NTREDKVNSEIKSELRSELKRFENSDANSPMSDVERAIFIDLFLYKNKYEVSESQQD YKNTWLKFRRELESQ

ENKEIKEYLRFRSIIEAYEIYDKKRLDDDTIPEAGTIIKEVIDFFQKLKKENPITFM KLAEAMVKFQYYYEEEDEN

EDRYFKMAEIYYFLNKTENEKKSKTFHLDIIDKYPNENNRLLDEFFLNKNNNNPDLD EIIYKLQSMQEKYRES

YEMLSKVENIHQVLSDDSKNEENIFLDNRIIAAQVFDGSINISLQDKKKWLNRYDQI RNEEGSDGWKLMHIES

ILINLRRINTAINLTAMKSESALLLIDKLLNFQKKARENILHISETPHEDFTSYSQF KTRKELGNDDSKYYAQFD

NYKDNHDAEKEAKEILSQVVARASLSFSELFDKVESIKLFSFVYKNRDGGAPLAAPG RTVVIKFPGKDTGGL

VISNLFLRNHVKRISTKEMEDLKPLTEGMYTRATQHRSLGSYYHIGSQSEHTNALEI LSGMNKEELKTHLKK

QGIWFGEPALFSNEYPKQENTGHLENTTLKNAIIGVSTIQNNAAANYLRSTMYESTG WEKLGDRFIPFYEIGR

RKHYDREYEINSEQLTLDIITSIAIAYPAARGIVATIRSSAIPSILKSGLRGSALFK SLSLELGKMGFNASKVFGG

AVYELIEPYPINSHLNRHNVFNKVKDTAWEFHTDVGLKGGGLKDFIDRFTKEPKEIT ISGYKFKRIKYNQENF

DTMQRMALDYAYNPDSKGKIAQAQQAYKTGKEDYNAPQYDNFNGLSLDKKIERYISP DTDATTKGVLAGK

MNESIKDINAFQTAKDAQSWKKSANKANKVVLTPQNLYLKGKPSECLPESVLMGWAL QSSQDAKLSKMLM

GIYSSNDITSNPLYKSLKELHANGNASKFNASATSISNINVSNLATSETKLFPTEIS SVRVDAPKHTMLISKIKN

RENKIKYVFYDPNYGMAYFDKHSDMAAFFQKKMQQYDFPDDSVSFHPLDYSNVSDIK ISGRNLNEIIDGEIP

LLYKQEGVQLEGITPRDGIYRVPPKNTLGVQETKHYIIVNNDIYQVEWDQTNNTWRV FDPSNTNRSRPTVPV

KQDTNGEWFKHSETGLKGGGPIDDIRKYIARKSAIKIFNQSINYSATKWPPEPIDKN IHMIWIGTKNISEKNIKL

SIDTAKKNPDYNTSIIYDSGISGHEGAKKFMLEKFQDSNVNIIDFRKKSYFSQLKQE PSFAYYEQVIAENKYAQ

ASDILRLLVLKYEGGIYKDIDDIQVKGFGSLTFPKGIGVMREYAPEAGKATAFPNTP IAVTKNNPIINKTLDLAV

SNYQRGEKNVLKLAGPDVFTQALYQEIPGLDSKVLNAQLYQLELAKRQALGVPLEKP KNFADEQLTSAEKE

KINRPYQSIRGLSGYVENGADHSWAVDTNIPSTSTQTSTIVTPLAPKTEMLPPVPSS STKSSTSAPVLQEKIS

YNLATDIDATDYLNQLKQKTNINNKISSPAGQCESLMKPVSDFMRENGFTDIRYRGM FIWNNATEQIPMNHF

VVVGKKVGKDYVFDVSAHQFENKGMPDLNGPLILAAEDWAKKYRGATTRKLIYYSDF KNASTATNTYNALP

RELVLESMEGKTFITSPNWYQTFKRTHNIHPEVTVSDPATFSLNYSVNPTAENLSPP PPPPIPSHGQVPKTV

TPPPPPMRSPLSLSQPLERLPANKTKPIGFNPGENKASFSKLEEAGKHYYKDDKSRQ AAPVNTMSDFDNRY

LSHTTEAPAPSNVAHLAPGNIYNTKVTAKGAEKPAYDIYISKDGESLITSSSYKVDD ITTDSKFGKPLPYSEIM FNSLKKSGVDPKNLKRSVQASIENKVTQDVISAIGTRIQRGQVIRVSPTENPDAFYTLLG TDNCKATLHMLNQ

HAEEFGHKVVTSIEFKGTGYLVMNIGTSTQTSTIVTPPPMPGTSQLVQ

SEQ ID NO: 29 (PAU 02805)

MPNKKYSENTHQGKKPLIKSEANNEHAIDNSPLGIGLDLNSILGNNSASLSQIHDYSFWK ENISEYYKWMVV

VKAHLKQLDWTLKSMDSPESAGANIAKNIGTTTLQTLLNTGGSIAGGAIGGAIGSAI APGVGTIAGMGIGALA

GTGLNYLNDTAIEKLNEKLEIAYPYPKTRNMIFDINNYDKNPLIKAIKKKTKKDNLK VMAGSSLTSQLLGRITPI

KIPAYKLADLAVSHHRALAGLSSDKARHILDFTNSIREVLNESHSDAVAFMRKNYGD NAMGLSGLSSKIKGD

KLTLDTLARTRNKIENRINSINKQTLKLSSKNSNE

SEQ ID NO: 30 (PAU 02806)

MEREYSEKEKHKKHPIQLRDAIEQHAEETANNSLGLGLDLHQAINTPKVPKDNYNEENGD LFYGLAAQRGR

YIKSVNPNFDPDKTNSSPMVIDVYNNHVSNTILNKYPLDKLGKLYGNPQKYAKDIKV TNSLQQDVAASKRGW

YPLWNDYFKAGNENKKFNIADIYKETRNQYGSDYYHTWHEPTGAAPKLLWKRGSKLG IAMAASNEKTKIHF

VLDGLNIQEVVNKQKGSTPLEQGRGESITASELRYAYRNRERLAGKIHFYENDQETI APWEKSPELWQNYIP

KNKSQNESSTPQRNNGALYRLGGPFRKLRASLRKRS

SEQ ID NO: 31 (PAU 02807)

MVHEYSINDRQKRHSFSSANPIDPEVTNRENSRHRFPKDNYNKGHGDLFYGLAPERGKYI KEANPKFDPNN

PENAAMIIDVYNDEISRVILNNNANKISTNRLLNFIYNFRKNRLENLMKNPEKYAKD IKVKDNLRENISPKKIEK

YPLWNDYFEAGIRNKKFNIAEIFKETASQYNSDYYHAWHIGGNSAPRLLWKRGSKLG IEIAASNQRTKIHFIL

DGLKIEDVVNKTKGPAPLKAGPGESITASELRYAYRNRARLAGRIHFYENGKETIAP WDKDPELWQKYTPKN

RSGMEL

SEQ ID NO: 32 (PAU 03332)

MLKYANPQTVATQRTKNTAKKPPSSTSFDGHLELSNGENQPYEGHKIRKIKGLRQHLADR SLNKGHISPLM

NKGLLVGSKDVSIDIPVIAHRYDSSHQLTDAEPLKADSHSNHLDPFYGVIAGFRGDQ VTSSESGSGSIGVHW

GKNTLDSNIMGVNVVNGASGTVGIRIALKDIQHGSPVIVTSGALSGCTMVYSVKNGY FFAYHTGQKPGNNE

WKTGRQGVVATYLSHQALSPDSEPMTVGEQNNDLVNIFANYDQSVITYMGKPGVLID KMAENVGVFNYDEI

KPEKPAIRAGYSYALLAKDDKGKVNVKVLSEDVIVSSGKQGNTVKAINSLKKRLL

SEQ ID NO: 33 (PAU 03337)

MPRYANYQINPKQNIKNSHGKSSSSDFSSGYLSFSNNSLDDPFIRQQVKREFIWEGHMKE IEEASRLGNFA

VSFRAAGGPTLRALGKGAAAKGHDILEKTIKPGSINKAYPKDEASDVIKKVQEAGIE GYVGHWDKKTGRLLGI

YMSSGHGLSDEQVNGKIYPIDLNNLEASLSALKAKENWAALPFTGDYDMHDMISFTG QPHSVPSNSSEERK

IIDRINRLVARSDSNRPFGDIEHNVIRHGAQVSYPAFAMDKEKEEIKKHGGIVKAVA EPGEFPVAIVSKGKWri

ANNIDELNQFYNSIGAKMKVSWKPGAENPGFVSNPQRPGMARFSRKR

SEQ ID NO: 34 (Plu1651 )

MPNKKYSENTHQGKNPLMKSGANNEHDLQDSPLGIGLDLNSMLVNSSTSLSQIQDYSFWK ENISEYYKWM

VVVESHLKQLDWTLKSMDSPESAGTNVAKNMGVTALQSLLNTGSSIAGGAIGGAIGS AIAPGVGTIAGAGIG

ALAGTGLNYLNDTAMSKLSKKLEIAHPYPKTRNMILDINNYDKNPIIKAIKKNVNKD NLKVTAGSSLTSKLVGT

VTSPIKFPAYKFAELAVSHHRALEGLSDDKARHILDFTNSIREVLKESHSDAVAFMR KNYGDNAMGLSGFSS

KIKREKLTLNTLAKTKNEIENRINSINKQTLKVSSRSRNE

SEQ ID NO: 35 (Plu1671 )

MLSTEKHNKDTKHPRNREKKFSIQPENSTQDDEDIKNNSLGVGLDLDQMIRNTSSTLTNA PQKPEDGYYYHI

SRGNNLQSFLQNGFKPQGSPGPTLSEEDFSRRKIGIIKLIYSIIATTINKNRKAKKI SKDNFLMPQEFWHEFKN

FYQNIPTQTNIDDQLLKKSITESIDKLDQNKFMEKHSDRKQTIINNEREAILQQDER INEIISSRAKMIQQREAE

NTEGYIYLAPHKNTLLEYMKHLQEEKNLFLILAVKEDIFTEKGLEQDPQEPHGAVRY KGALSTEELNFVNQE

GQICAIPASIGEMDYGDFILNQQQVIDFCKK

SEQ ID NO: 36 (Plu1672)

MPINDLKKKFEISPQAAQAIGAPARSNSSKQAEHQTEHLELDTSKNRRDRKDLNAQATPN QQHTKKLETEV

NNGGNKSKAQAHTPDLVMKKESSVTPNTRKSPNEKIKAEDIFHRYKDRFSPSDRELP FEIMNEITNNGIAFS

SEKAPESHLDKVKDKKFTLRHYTSGNGQEKPTFNEIGSNFNLVNEGIKTLKRTQGSN TNEDDWNRLGNTAF

TFFLLAIDGEVSDRKFLSNTTHFAEIDIENPAELKELGLDETEFFASPDLLHEKNLS QAPAVKGKLSDLKSLLL

KQSGIKPVQLQSLGAKGILERIDSKFNGSLEIKIPGNVKVKEWKKVEK

SEQ ID NO: 37 (Plu1690)

MPNSKYSEKVNHSANGAEKCSIHSNQYNINNCTLGLGLDLNKKLRTGNERNIEGAQPFIP FPSKQKQYSTSP

IAMADILNESALTSQPIITDLINPQKIKMSDGVKNILNNKEGGGDLVFKALQIKPSD ETLPFNALKIVDTYQEEM

PNKDMSISAYWAPQGGYVDIPAQPDISRHPQYVFTPNFSGCSFVVDKMNEDTLRVRH VQGGQEDVEYNN

QNIDHGMGMITAMEFRDYGYHEADDKVIENTYGFAFLKFNQEKKQWQLHYQKIAAAP NIINIKTKSSWLPFS

KPSIEADTFTFKNMKVPGYSRKNINNN

SEQ ID NO: 38 (Plu1691 )

MPKLTELLSRFENPIQNQPNHISKKNPISNSKVLNNSEEKTAPLELKHDDSKIKSQVSIP NLVKKNEKPAASNT

PNNSHEKVKAEDIFNRFKSKFDPYDRELPFDIMNKITNNEIKFSSEKSKDDYLAKVK DKKFTLRHYTAGTGQE

KPTFDEISSNFNLVNKGIKTLNRTQGSNTNEDDWNRLGNTAFTFYLLAIDGEVSNRK FLSNTTHFAEINIEDS

EELKELGLDQAEFFASPDLLHEKNLSQAPAVKGKLSDLKSLLLKRSGISSVQLGRLD AKAILKSIDNEFGNSL

EIKIPGNVKVNKWNKI SEQ ID NO: 39 (Plu1712)

MPRYSNSQRTPTQSTKNTRRTSPSSNSSTEHLSLSNAPTNDSSVRQEVKEKFIWEGHWEG HMEAIEKASIL

GNFAVSFRAAGKPTLEALGKGAAAKGHDILEKTIKPGSIEKAYPENEASDVIKKVRE AGIEGYVGHWNKETG

RLEGIYMSSGHGLPNGQVNGKIYPIDLNNLEASLAPLKEKKNWAALPFTGDYDMHDM ISFTTQPHSVPSNS

SEEKKIIDRINEYIAKSDSNRPFEDIEHNVIRHGPQVSYPAFAMDKEKKEIKERGGI VKAVAEPGEFPVAIVSK

GKWTIANNINELEQFYNSIGAKMKASWKPGAGNPGFVSNPQKPGMARFSRKK

SEQ ID NO: 40 (Plu1713>

MFSTYSSKNDNQTINKINTEEKHENTETDNHLEINLEHTGKSKPDIEPKDVTTGTINAGT LLYKTTAIPEFLDN

AKSLGLAEYEKRHKDIQDYLNLGKAEDAEKLKNKSQWAGQYFALEKSYDEYANEAPD SYNNLLKNAGKDLL

ENTEEVKVFLYTFKVTKDIKVLKPHNNSNSYYVGDTEGWEKAKEIMNDVQSQSEKND NPFPELKNLEDKNF

LLEELGEKGYAWMGPLHAKEGAEKGTEFSYELAISPNLLRQHLTLESEELLGTYKNR YGYWDKK

SEQ ID NO: 41 (Plu1714)

MKKTDEKYGQYEYKDEDITSYPIAWTNPDNGKIYIGINSPEYSHLNNKGESELNLAKIIS TIIHESLHASSHQH

KGLQSQTDTGADNLNYDEYVTDYFAREVYKQILPDKDYVANCFTKGLGGENKIWGGN IVEFMIQ

SEQ ID NO: 42 (Plu2400>

MVYEYDKTIERRRNPSIQLNNNEKSSEQALELSQNNPLLHDLITSNNLRKEAAVFAKRIG PSYQEILDELEHL

HHLSGNEQLAAGFELHRRITHYLEEHPDSKRNTALRRTQTQFGDLMFTGTLQKIRHS LLEMAETRPEMASHI

YQIAREEVKGNTPGLTDLMVRWVKEDPYLAAKTGYQGKIPNDLPFEPKFHVELGAQF DDFKKWLDTAQSK

ELLTHTRLDEQNKQVHLGYSYNELLDMTGVESVQMAVYFLKEAAKQAEPGSTKSQED ILLHRFANPTYLAQ

LEHSRLAQIEAIYHSSHDTDVTAWDQQFASDALTQFNHQLNNTVDLNSQLSLLLKDR QGLLIGESHGSDLNG

LRFVEEQMEVLKAHGVTVIGLEHLRSDLAQPLIDKFLASGNEPMPAELAALLKTKHL SANLFEQARSKQMKII

ALDNNSTTRPTVEGTQHGLMYRAGAANNVAVERLRQLPAGEKFVAIYGNAHLQSHEG IDHFLPGITHRLGL

PALKVDENNRFTAQVDNINQRKRYDDVVELPRIQLTS

SEQ ID NO: 43 (Plu2401>

MEHEYSEKEKPQKCPIQLRDSIEHDKEDINTTTPLELNSQYTNRKRAGLRERFSTTLQRN LPGHSMLDRELT

TDGMKNQESRFSPAMIMDRMMHFGVRTRLGKVRNSASKHGGQVTFKFAQTKGTFLDQ IMKHKDTSGGVC

ESISAHWISAHAKGESIFDQLYVGGQKGKFHIDSLVSIKQLQMDSYLDDEQSTMTEY WLGTQGIQPIMQKND

VDEHSSKVVGQTGNKGTTDLLRAILDTGDKGSGYKKISFLGKMAGHTVAAYVDDQKG VIFFDPNFGEFSFP

SITSFSRWFTDDFWPKSWYNLEIGLGQQFEVFNYELKKS

SEQ ID NO: 44 (Plu2514>

MYDSKKKNSEPTTKKKFERSNYSQWDDSINHYEDMNRARIKNRNDILTTVDYFGEKKKTM HTFEYQSDIKH

DTNFNNKNKSLFESFAASFVLQNPSFFSGVIDKLSKKLFNIISKIDERNNFQKKLYD FIEKDTSPEGQFGRFTL

GKNEILNVLQVKSDTPQLFVKKMLLIKSLGAFIIDFSSKDIGNYDFIFDGKGREVND IIEKNRPTNLFKVRGRTN

IKSSQHRSDIGILDTPTFDSLTEEQKSFLTIPELTKRRPLFRTFTHELDAEDKRVVE SVFVNRTFDCDSPLIGS

VSGSTSCVLVAADILFPDMTMVERKKLAIATFAFLVGGGYHSATEVFDVAYPGLDLN KEIEELIENNPIQENA

GVATLRQLIGNSGF

SEQ ID NO: 45 (Plu2515>

MPISNLAKESEVRAVKDIPCKNIETDNHLEIGLSSGLSRSKDTSKFKKNSINTIKLIDDI IALHNDPKGNKLLWN

DNWQDKIINRDLANIFEKIDESVSELGGLEMYQEMVGVNPYDPTEPVCGLSAQNIFK LMTEGEHAVDPVEM

AQTGKIDGNEFAESVDQLSSAKNYVALVNDRRLGHMFLIDIPSNDQETVGYIYQSDL GQGALPPLKIADWLN

SRGKDAVSLNKLKKLLSREFNLLSDDEKRALISETLDIHKDVSNVELDRIKRDRGVD IYLTEYDVNNFYENIET

LKSKLSNYDKKLSKPK

SEQ ID NO: 46 (Plu1649>

MLANVLPNLASFLKYEKETPLFFIEDGFNFQNLNPGRVPLIKTPEQRKAGDTQSPAFLCS GVILRGTIHSNDY

KFWQPSPSSIKSGGVSFSYLRKDAKFKRLAYGYKNGFIIFPEHIAPEDRVDFSVLCA FPIDGYTNERANQGC

GENITKAKDKGKSCQEQNVTNSDDWIKNYRKVNSQDFFQCGFNVTKDVNNPAIAFYQ MLESIKKLPRTPNT

PPKQNEIRISTWEESDPNKLPIEALFYSENSGLADAQKDQRDYKNATGKFLPIVKML LPRTLNEDALFKFNIK

DQVINP

Leader Sequences (e.q. with SEQ ID NO: 47 - 92 corresponding to amino acids 1-50 of

SEQ ID NO: 1 - SEQ ID NO: 46. respectively)

SEQ ID NO: 93 ( Photorhabdus asvmbiotica strain ATCC43949 PVCPnf operon, pyd - DVC16; e.g. corresponding to genes PAU 03353 to PAU 03338 of the sequence of GenBank accession no. FM 162591.1)

ATGTCTACAAGTACATCTCAAATTGCGGTTGAATATCCTATTCCTGTCTATCGCTTTATT GTTTCTGTCGGAGA T G AG AAAATT C C ATTT AAT AGTGTTTCAGGATTAGAT ATT AGTTATGACACCATT G AAT ACCGAGATGGTGTTG GTAATTGGTTCAAAATGCCGGGTCAGAGTCAGAGCACTAATATCACCTTGCGTAAAGGCG TTTTCCCGGGGA AAACAGAACTGTTTGATTGGATTAACTCTATTCAGCTTAATCAGGTAGAGAAAAAGGATA TTACCATCAGTTTA ACTAATGATGCAGGTACCGAATTATTAATGACCTGGAATGTTTCTAATGCTTTTCCCACT TCATTGACTTCACC TTCATTTGATGCCACCAGTAATGATATTGCAGTACAGGAAATTACGCTGATGGCAGATCG GGTGATTATGCAG GCT GTTT GAAGCATT GAT ATTT AATCATCTC AT AT AAGGGAACTTTT ATG ACAACCGTT ACCAGTT ATCCTGGC GTTTATATTGAAGAATTAAATAGCCTGGCCTTGTCAGTTTCAAATAGCGCCACAGCGGTT CCTGTTTTTGCTGT GGACGAACAAAACCAATATATTAGTGAAGATAATGCAATCCGTATTAATTCGTGGATGGA TTATCTTAATCTGA TTGGCAATTTT AAT AAT GAAGACAAATT AGAT GTTTCT GTGCGTGCTT ATTTTGCCAATGGAGGTGGAT ATT GT TATCTCGTCAAAACAACGAGTTTAGAAAAAATTATTCCAACCTTGGATGATGTAACCTTA TTGGTTGCTGCGG GCGAAGATATTAAAACGACAGTAGATGTTTTATGTCAGCCAGGAAAAGGGTTATTCGCAG TCTTTGATGGCCC

TGAAACAGAGTTGACTATCAACGGTGCGGAAGAGGCAAAACAAGCCTATACCGCCAC ACCATTCGCTGCGGT

TTATTATCCTTGGTT G AAAG CGGATTGGGCTAACATAGATATTCCACCCAGTGCAGTGATGGCGGGAGTTTAT

GCATCGGTGGATTTATCCCGTGGTGTATGGAAAGCGCCTGCCAATGTTGCGTTGAAA GGGGGCCTGGAACC

TAAATTTTTAGTCACGGATGAATTGCAGGGTGAATATAACACTGGCCGCGCTATCAA TATGATTCGTAATTTCA

GT AACACAGGTACT ACGGTTTGGGGTGCAAGAACCCTGGAAGAT AAAGACAATTGGCGTT AT GTTCC AGTGC

GACGCTTGTTTAATTCTGTGGAGCGGGATATCAAGCGTGCCATGAGCTTTGCTATGT TCGAGCCTAATAATCA

GCCTACTTGGGAGCGGGTACGGGCGGCGATTAGCAACTACCTTTATAGCCTGTGGCA ACAGGGGGGATTAG

CTGGCAGCAAAGAAGAAGACGCTT ATTTT GTGCAAATTGGTAAAGGT AT AACGAT G AC AC AG GAG C AG ATT G

ATGCAGGGCAAATGATTGTTAAAGTCGGTTTGGCTGCTGTACGGCCTGCGGAATTTA TCATTCTCCAGTTTAC

GCAAGATGTAGAACAGCGTTAATCATATGATTATGAGGAGTTATCATGTCTGCTATT CTGAAAGCGCCTGGCG

TTTATATTGAAGAAGACGCTTCCCTAGCGTTGTCTGTCAGTAACAGCGCGACTGCCG TGCCTGTTTTTATCGG

AAAATTTACTCCGACAGTGGTTGATTCAATCCAAGTCTGTACCCGTATCAGCAACTG GCTTGAATTCACTTCC

TCTTTTTCCCTAGCTCCAACAGTTGAGATTGTTGTCCAATCTAACACTGAATCTGAA TCTGAATCTGAAACTTA

CCACTATATTGAGACAATCAATTTATCTCCAGCTGTGGAAGCATTGCGACTCTATTT TCAAAATGGCGGAGGA

GCTTGCTATATCTACCCATTAAATGATGCTGAAGATGAATTGGTTCTGGCGGCCATA CCAGAAGTCATTGAAC

AGAAAGGTGATATTACTCTGTTGGTTTGCCCGGAACTCGATCTGGATTACAAAACTA AGATCTATGGCGCAGT

GAGCTCACTGTTGAATGATAACAAAGTGGGCTATTTCCTGATTGCGGATAGCAATGA TGGAGAATCTGTGTCA

GGAGTATGGAATAGTGCTAAGGCCGCCGCCTATTATCCCCAGTTGGAAACTAACCTA AAATTTTCCACGTTGC

CTGGGGATAAGGACATTCGTATCAGCGGTTATCAGGATGATGATGAAACACATAAAC CGAAAAACTTGGATG

AGCTCAGGACAATCAACGAGGCGTTGGCACAGGATATTGATGCAAGATTGCTCGAGG AGAAACAACGTGCT

GTCATCATTCCGCCAAGTGCTGCCATTGCGGGCATTTATTGCCAAACGGATAATCGT CGCGGTGTTTGGAAA

GCGCCAGCCAACGTTGCGCTCACAGGGATCGGGAGTTTGCTTGATAAGGTAGACGAT GAACGGCAGGGAGA

GATGAATGACAAGGGAATCAATGTCATCCGTTCATTTACCGACCGTGGTTTTATGGT CTGGGGAGCCCGTAC

TTGTGTGGACGCTGCCAACATCAGCTGGCGTTATATTCCTGTTCGTCGCCTGTTCAA TTCCGTTGAACGAGAT

ATCCGCCAGGCGCTGCGCGCTGTGTTGTTTGAAACTAATAGTCAGCCTACCTGGGTA CGTGCTAAGGCTGC

CGTTGATCAATATCTTTATACCCTTTGGCAGAAAAATGCATTGATGGGTGCTCGCCC GGAAGAAGCTT ATTTT

GTGCAAATTGGTCAGGATATCACCATGTCCGAGGCTGATATTAAACAGGGTAAGATG ATCATGACTGTTGGTT

TGGCAGCAGTGCGGCCAGCTGAGTTCATCATTCTGCAATTTACGCAGGATGTTGTTC AGTAATCTCCATGACT

AAACGCCAGGCACTGTATTGACAGTGCCTACTCTAACCATCTTGGAGGAGGTGATGA TGATGGAGAGACTCC

AACCGGGTGTGACTTTAACAGAAAGTATAATCACGATGGGTCAGCAAGAGATACCCA GTGCTGTGCCGGTGT

TTATTGGTTACACCGTTCGTTATCCGGAACAATCGGAAGCATCAGTCCGTATCGACA GTTTGGCCGAGTATAC

CAGCCTGTTTGGTGACGACCATGTGATGATGTTTGCTGTCAGGCACTATTTTGATAA TGGCGGGCAACAGGC

ATTT GTTTT ACCCCT GAAGGACAAT ATGCCATCAGTGG AGAT GACCACAGCT GAAGCGGAAAATCT GAT AGC

CGCATTGCGCTCTGCTACGGTTAGCGAAGCCATTGGTGGGCATAGTCAGATTACACT GATTTTGGTACCGGA

TATGGCTCGGCTTAATGACAGTGATATTGATGACTCCTCAACCCAGGTAAGCCTGTG GTCCCAAGGCTGGGA

GGCGCTGCTGCAATTGAGTCAGGTTAGGCCCAACCTCTTTGTGCTGTTAGATGCGCC GGATAATGTTGAACA

GGCGCAGAAGTGTATGACAACGCTATCGTCAGATTATCGTCAATGGGGGGCAGCATA TTGGCCTCGTCTGG

AAACTACCTATCAGAAAGAAATATCTGGCAAGGACAATGAATCTCAGGGAATTTTCC AGGGGACTGTTCTGTC

ACCCACAGCCGCGGTCGCAGCGGTAATTCAACGCACGGATAACGACGCGGGTGTTTG GAAAGCACCGGCC

AATATTGCCTTATCCCAGGTTATTCGACCTGTTAAATCTTATCTTCAGGGAAGTGTA CTGTTTAACAGCAGCG

GCACTTCGCTCAATGTGATCCGCAGTTTCCCAGGTAAGGGCATACGGGTATGGGGAT GCCGCACTCTGGAA

AACACGGATAATACGCAGTGGCGCTATCTGCAAACACGTCGGCTGGTTTCCTATGTA ACAGCGCATTTGACC

CAATTGGCTCGCAT GT AT GTCTTT GAGCCAAAT AAT GAACTT ACCTGGAT GAAGTT AAAAGG ACAAAGTT ACA

ACTGGTTACGGCAATTATGGTTGCAGGGTGGCTTGTATGGTTCACAGGAGGATGAGG CATTTAACATTCTGT

T AGGCGT AAACGAGACGAT GACT GAGGAT GAT GTTCGTGCAGGAAAAAT GATCAT GAAAGTT GAGTTGGCT G

TGTTGTTTCCTGCCGAATTTATTGAGATCAGTTTGGTGTTTAATACCCAAACAGAGG CGCTGTCTTAAGAAGG

AAAAAGT ACGATGAACGATTATTACACACCCGTGGTATCCCATCGTTTTATGGCGAGTTTTATTTTT AACCGCA

TTCCCGATCCGCTGGATATTCGTTTTCAGCGTATCTCTGGCCTTAGTCGGGAACTAC AGGTGACTCAGTACA

GT GAGGG AGGAGAAAATGCCCGT AAT AACT ATTT AGCT GAGAAAATCCAACACGGTACGTT GACTTTGGAAC

GGGGCGTGATGACAGTCTCGCCATTGACCTGGATGTTTGATCGGGTATTGAGTGGTG AAAAAATCGCTTATG

CCGATGTGGTGGTGATGCTACTGAATGAAAATTCACTGCCATTGTCCAGTTGGACGT TGAGCAATGCGCTGC

CGGT ACGCTGGCAAACCAGCGACTTT GACGCTAAC AGCAATGCCAT ATTGGT GAAT ACCCTT GAATTGCGTT

ACCAGGATATGCGCTGGCTTGGAGTCAAAATATGACAGTAGAAATCAGAGAGTTACT TATCCAGGCAAAGGT

AGTGCCATCAACACGACCGACT GAAT C AG AAC G G C AAAAC CATTCTTTGATACAG G AAAG TCTGGATGAGGC

GACTTGGGTGGAAACGAT AAAACGCGAAGT GTTGGCCGCATT ACGCG AT GAGGAAGGGTGGCGTCCAT GAG

TCT GATT GAACGTGGTTT AGCT AAGCT GACAATT AATGCTT AT AAGGAT AGGGAAGGGAAGAT ACGGGCAGG

AACGTTGCAGGCCATGTATAACCCTGACTCCTTGCAACTGGATTACCAAACGGATTA TCAGCAATCCCAAGC

GATT AAT AG C G AAAAG C AAAG TAGCATTTATGTACAGGCCAAGCCCGCAGGGTTATCACTT G AATT AATTTTT

GATGCCACGATGCCGGGTAACAAAACCCCCATTGAAGAGCAGCTCATGCAGCTCAAG CAACTGTGCAGTGT

GGATGCAACCAGTAACGAGACGCGATTCCTGCAAGTTAAATGGGGCAAAATGCGTTG GGAAAGTCGGGGTT

ACTTTGCTGGCAGGGCCAAGAGTTTGTCT GT GAATT ACACTTT GTTT GATCGT G ATGCG ACTCCCTT GAGGGT

ACGGGTAATATTGGCATTAGTGGCTGATGAAAGTCTGGTGTTGCAGGAGACTGAACA AAATCTGCAATCTCC

GGCAAAAATCGCATTACGCATACAGGATGGGGTATCTCTGGCTCTGATGGCAGCCAG TACGGCATCAACATT

GTCAGGCGGTGTGGATTATCTGACGCTGGCCTGGCAAAACGGTCTGGATAATCTCAA TGGGTTCGTTCCGG

GTGAAATATTGCAGGCCACCAGGGGAGACGAATCATGAGCCACCAACTGAAAATTAT TGCAGATGGTAAGGC ACTGTCACTTTTGGCCGCGGTAGATGTGGACACCTGTTATCGGGTTAACAGTATACCTTC TGCGACATTGAAA

CTGAGCGTACCGGATAGGCCACTCTCTTCTTTCAGTCAGACGGATGTTCAGACAGAA CTGGCCCACTGTCAG

GTAGGGAAAACCCTGCGTCTGGAATTGATTGATGGTAGCAAAAAATGGGTGCTGTTT AATGGTCTTATTACCC

GTAAGGCTCTGAGAATTAAGAATAAGCAATTATTGCTCACTCTGGTTGTCAAGCATC GGTTGCAACTGATGGT

GGAT ACCCAGC ATTCACAGCT GTTT AAAG ACAAAAGCGAAAAAGCGATCTT AAGCACGCT ATT GAATCAGACC

GGAATCAATGCTCGCTTCGGAAAGATAGCGGCGTTAGATCAAAAGCATGAACAGATG GTGCAATTTCGTTGT

TCAGACTGGCATTTTCTGTTGTGCCGACTGTCGGCAACCGGTGCATGGTTGTTACCT GCCATAGAAGACGTT

CAGTTT GTTCAACCT GATGCTCT GAAATCAAACTCAGCCT AT ACCTT GAAGAGC AGGGGGG AT GAGAACAAA

GACATCGTTGTCAAGGATGCTTACTGGCAGTTTGACAATCAAATCAACCCCGCTTTG CTGGAAGTCAGTGGC

TGGGATATCAGTAAGCAGCAGGTACAATCAGGCGGTCGCTACGGAAAAATCGCGTTG GGTAAGGCGGCACT

CTCTCCTGATGGATTGGCATCCCTTAATAAAACGGGTTGGGACATTTGTTATAGCAG TCCGTTAACAACCCAG

GAAAGCGGTTATCTGGCACAGGGATTATTGCTTAACCAGCGCATTTCTGGGGTGACA GGAGAATTTTTGCTC

AAAGGAGATGGGCGTT ACCAGTTGGG AG ACAACATTCAGCT GACTGGATTTGGTTC ACAGTT AGATGGT ACG

GCAAGCATTACTGAGGTTCGCCACCGTCTTAATCGGCGAATTGATTGGGAAACCACG GTGAGCATTGGTTTA

CAACATGAATATTTGCCGATATTACCTGATGCTCCCGAACTACATATTGCGACAGTA GCGAAATATCAGCAGG

ACAGTGCGGTGTTAAACCGTATCCCCATTATTCTGCCGGTACTGAATCGTCCCAATG AATTTTTGTGGGCCAG

ATTGGGGAAACCTT ATGCT AGCCAT GAAAGCGGTTTCT GTTTTT ACCCAGAGCCAGGT GACGAAGTT ATT ATT

GGTTTTTTTGAAAATGATCCGCGTTATCCAGTTATTTTAGGTGCTATGCATAATCCG AAAAATAAGGCCCCTTT

TGAACCAACCCAAGATAATAGGGAAAAAGTATTGATCGTTAAAAAAGGTGAAGCGCA ACAACAATTAGTCATT

G ATG G C AAAG AG AAAAT G ATC C G AATT AAT G C G G G T G AAAAT C AAAT AAT G CTT C AG C AAG AT AAAG AC ATTT

CTCT GTCAACGAAAAAAGAATT AACACT GAAAGCGCAG ACAAT GAATGCCACGATGG AT AAATCATTGGCAAT

GTCCGGGAAAAACAGTGTTGAAATCAAAGGCGCAAAAATTAATCTTACCCAATGAAA GGTGACGATGAATGG

AAAATCAAATACTGACACAACTCTATGGTCGTGGTTGGGCTTTTCCTCCGGTCTTTT CCCTTGAAAAGGGGGT

AGAGATGGCT GAAGGGGCGGAAGAT GT GAGACAAAGTTTGCAGATTCT GTTT AGT ACT GAGCCGGGGGAAC

GTCTTATGCGTGAAAATTATGGCTGCGGATTAAATGATTTTATGTTTGAAAATATCC GCAATGAACTTATTGCT

GAAATTGAATCCCATATCCATGACAACGTATTACGATATGAACCCCGGGCTGATATG ACTGATATTCAGGTTC

GTCAATCCCCTGGCATGGGGAATACTTTGCAAGTGCAGGTCATGTATCGCCTGAGAG GGAGTGATATCAATC

AACAAATCCAGGGAGT ACTTGCACT GAGT GAAGGCCGGGT G ACGGAGGT AGT AT GAGT GAAGCGATT GTGG

TGGATGGTGACGTGTTACAGTTTGATCCCAACTTTGGCAATCGGCAGGTGACGGTTC CCAGCCCAGGAAAAA

TTAGCGGCACAGGACATGCGCAGGTAAGTGGAAAAAAAGTGTGTATTCTGGGGGATG AGAAACAGGTCAGG

GTTTCTGCAACCTATATTACAACAACACATACTACGCCGGGAACAGGAACCATTACT ATCAGTGCTCTGGATG

CTGGCCAGCAGGCCCTTCAGTGTACCAGTGGGGCGGCTTTAATTATCAAGGGGCAGC AATTTACGGCGATG

TTTACGCCT G AATT GCCAGCCAT G AAT AAT ACAGTGACTCCGCCACAACCGGATGTTACGACACCTTCATCAG

GAAAAGG AC GTTTT AT C ACT C AACAAAATTTTGCT ACCGT AAATT AG AGT ATT G ACT GAATT AAAT AGAATT AA

CGAAGGT GT AAAT AATT ATTT ATTTGCT GACGAATCGCT GT GACAAATAAAC ACAGGT GAT GTT ATGGAATT AA

AT GAGTT AACT AACAAATT GTCAAATTTGGTGCCAAT G ACCGATTTT AAATT AGAT AATCGAGCCAGTTTGCAA

TTGCTTAAATATATTGAAGCGTATACGAAGATAATACCCTTTAATTCTGGCGATAAA TATTGGAATGACTTTTTC

TTTATGTCAG G AAAT AC G C C AG AG AAACTT G C AAAATT AT AT C AG AAAG AAAT AG AAC C C AAT GGGGAGTTAT

TACCTCAGCAGGCTTTTTTGTTGGCGGTTTTGCGTTTATTGGAAACACCAATATCCT TATTAAATGTATTACCT

GCTGCTCATCGTGAGCTCTATTATCGGGAGCTTTTAGGCTTGTCTTCCCATGCGGCA CAGCCTGATCAGGTT

GCTTT ATCT ATGGAACT GAATTCGACAGT GATGGAACAGCTGCTCCCT GAAGGAACCCT GTTT GAGGCTGGT

CAGGATGAACAAGGCAATGCATTGCAATATGCCCTGGATGCCAGTTTGCTGGCTAAT CGTGGATATATCAGT

GACTTGCGCTGGTTACGGAATGACGGGGAAAAGCAATGGGTTACTTCTGCTCCATGG GATTTACAGGCACAG

GT GTCACTGCCGTCT GATGGGAT ACG ATT ATTTGGT AAGACAAAT AGTGATCAGCAGGT ATTTGGTGGGGT G

TT GAT AACGTCATCACTTCTGGCG ATGGAAGCGGGG AT AAGGAAGATCATT GTT ACTTTT GAGCAGGAG AT G

AACACCCAAGAACTGGTGGCACAGGTCAGCAGTGGAAATCAATGGCTAACATTGACG TCTGAGGTAAATAAG

AAAGAGGTCACACTGACACTGTCAGACAAAGAACCGGCAATCAGTGCGCCAGAGGAT CTGGATAATCTCTTT

TTCACGCAACCGGTACTCAGGCTACAGGGAAAGGATAGTCAGGCACTGCCGGAGGTG ACGGGTATCAGCGT

TTCGGAAAAGGATG AT ACTAAGGAT ACCTCTTTT GAGAT GT ATCACTT AACACCATTTGGTT AT AGCAGT GAT A

T AGAGCCATTGGAGGAAAATCCAGCGTT AT ATTT AGGCTTT ACT GAT GT AAAGCCAGGGCAAACACTGGCGC

TGTATTGGAAATTAAAATCCCCGCAGCAACCAACCGTTTCCTGGTATTACCTGGATC AACATAATCAATGGGC

TGAATTGGATTCATGGGTCAGTGATGGAACCCAGAATCTGTATCAGGATGGTACTTG GCACGTTGAGTTGCC

TGTGGATGCATCCAATCAGGCAGAGCAGATGCCAGTTGGACGCTATTGGTTGCGGGC AGTGGTGGAGGTAC

CCGCTCATGAGGGGGCGTTGGGGAAGGCTCCTTGGCTATATGGTCTAATCTATAACG CCATGACGGCAACC

TTG GTT AAT GTAGATAGCATCAGTGACAGCCATTTCTTAACCCCTTTGCCTGCCAGCAGCATACAGCGG CCC

GTTGAACCCATCATTGTGTTGGCATCGGTCAACCAGCCTTGGGCATCATGGGGTGGA CGTATACCTGAATCC

TACAGTGCCTTTTTTGAACGGATAGCTCAAAACCTGTCTCATCGAAACCGGTCCTTA ACCTGGGGAAATATGG

TGACATTACTCAAAGAGCGTTATGTCAGCATCTTTGATGTTAAGTATCCAGGTAATG ATGAACTCACCAGAGT

GCCAGCATTGG AGCAGCAGCAACT AACAGT G ATTCC AGCAAACCGGT ACAACG AT AGCGAT GATTCTCTGCG

TCCGGTACTGAATCCTGCTCGTCTGCAAGAGATGGCTGATTGGTTGCAGCAGAAAGA CTCTCCCTGGGCCTC

TATTGAGGTCAGGAATCCAGAATACTTGGATGTGAAAATCCATTACGAGGTGATTTT TAAACCTGATGTGAAC

GAAGATTTTGGCT ATCGCCAGCTACAGCAGCAACT GT GT G AGGT GT AT ATGCCTTGGAGC AT AGAT G AGCAG

CGGCCCGTTGTATTGAATAACAGCATTAATTATTTCCAGTTGTTAGCCACTATTCAA CAGCAACCGCTGGTTG

AGCGAGTCACTCGTCTGACACTACATCGGGCTGATTCTTCTGATGAGAGTGATGGTA CAGCATCTGTGGAAG

CCAAAGATAATGAAGTGCTTATTTTAGTCTGGGAAGAGGACGATAATCTGCAATACC GAGGAAATGACTATGA

GT AATCAGGATGC ACT GTTTC AT AGCGTT AAAGACGAT ATTCACTTT GAT ACCTTGCTGGAACAAGCTCATCA GGTGATTGAAAAACAGGCTGAAAAACTGTGGAGTGATACGGCAGAGCATGATCCGGGTAT CACATTTTTGCA

GGGAATCAGTTACGGTGTGTCAGATTTGGCTTACCGACATACATTACCCCTGAAAGA TTTACTGACTCCGGC

GCCGGATGAGCAGCAGCAAGAGGGAATTTTTCCTGCCGAATTTGGCCCGCATAATAC ACTGACTTGTGGGC

CGGTGACAGCGGATGATTATCGCAAGGCATTGTTAGATCTACACAGCAGCGACAGCC TGGATGGTACTCAG

CAGGATGAGGGGGATTTTCTGTTCCGGAGTGTGCAACTGGTGCGTGAACCGGAAAAA CAGCGTTATACCTAT

TGGTAT GATGCAACCAAG AGGGAAT AT AGCTTT GTCAACAGT GAAGGGGCT AAAGAGTTT ACCTTGCGGGGG

AATT ACTGGTT GT ATCTGGAACCAACCCGTTGGACTCAGGGTAAT ATTGCCGCTGCT ACCAGACAACT GACA

GAATTTTTGACTAAAAATCGCAATATTGGTGAATCTGTCAGCAACATTATCTGGCTA CAACCGGTTGATCTGC

CACTGTTGCTGGATGTTGAACTGGATGATGATGTAGGTGCACAGGATGTCCCCGGTA TTTTTGCGGCGGTGT

AT AGCACCGC AGAGCAGT ATCT GATGCCTGG AGCACAGCGTT ACCGT ACGGAAGT ACTGCAAAATGCTGGG

ATGAGCAATGATCAAATCTTCGAAGGTCCATTATTGGAACATGGCTGGATACCAGAG CTGCCGGCAGCCCGT

GATTATACTCAAAGGCTCACTCTCAATCTTAGCCGGTTGGTAAATAGTCTGCTTGAG ATTGAGGGCATTAAAC

ATGTGAATCGTCTTCGTCTGGATGATAGCTTCGATAAAACTGCTATTGAACCCGTTA AGGGGGATACCTGGTC

GTGGTCGATCAAAGAGGGCTATTATCCACGTCTTTGGGGAGAAGACCCACTTAACCA ATTGGCGCAACAAAA

TGGCCCGCTT AGGGT GAT AGCCAAAGG AGGGATT AGCGTCAGT GT GAGT AAAGAGCAAATCCAGGCCAGTT

TACCCAGTCAATCACTGATTCAAAATGAGCCGGTAATATTGGCTTACGGCCAGCACC GTGACGTTGGCAGCT

ATT ATCCCGTCAGT GAT ACTTTGCCGCCTTGCT ATGG ACT ACAAC ATTCTTT GTCT GAAAGT GAACACTT ATT G

CCACTTCATCAATTTATGTTGCCATTTGAACAATTATTGGCCTGTGGTTGTCAACAG ATAGCCATGCTCCCGC

GGTTACTGGCTTTTCAGCGCGAAGGTTATGAGGTTTGGGGTGATCAGTGGCCCTTTA AGTCAGGCTCAGTGA

ATGATGACGCCCATCAAGATTATGCCCCTGCATTAAAGGATTTGTTAGGACAGATTG CGCTGGATAGTGATCA

T GAATTGGAT ATT ATT AATT ACTTGCTGGGTT ACTTTGGCAC ACAGCGGGCACCGCGT ACCTTT ACGACACAA

CTCGATGATTTTCGTGCGGTCCAACAGGGTTATCTGGCCCAGCAACCGACATTGACT TACCACCGCTCCAAT

ATTCGTATCGATCAGGTATCGTCGCTACAAAAACGTATTGCTGCTCGCATGGGGCTG GGCGGTGAGTTGTTT

AAACCTCAACCGGATCTGAGCCAACTGCCTTTTTATTTGATTGAACATCGAGCGTTG CTGCCAGTCAAACCCA

ATAGTCAGTTTGATAAGGAACAGAAACCAGCCTCGGTGACAGAGGAGGGGGGCAGCC AAACAGGTCAACAT

TATGTGGTCATTGAACAGAAGGGCATTGATGGCAAGCTGACACAGGGGCAAGTGATC AATTTAATTCTGTAT

GAAGGAGAGCAGGGAGAAACCCAATTTACGATACGCGGTCAGATGGTATTCAAAACC GAGGGGGATAAGTT

TTGGTTGGATGTGAATAATAGTGCGCAACTGGAATATAATCTGGCGCGGGTAATGAC AGCAGCCAAGGCGAG

TAAACTCTTTTGGCAAAACAGCCCGGTATGGATGGAGGATATGGGCTATCGTCTGGC CTATGCTAGTGACCA

ATCCTCATTGCCTGTGAATCAACGGCGCTTGACCCGCACAGTGCAAACTCCATTCCC GCCGATGGTTGTTGT

AGGT AGCGAAATCACCCTGTT AAAGCAGGT GGGGAT AGTCAATTT AAAAAAAGCGG AGTCAGAAAAACTTT AT

GCAAAAGTTGTTAGCTTTGATCGCATTGAAGGGACCTTGATTATTGAGCGTTTGGGT AATTCCACTCTGGCTT

TTCCTACCTCGGAAGAGGCGTGGCGGTATAGTTGGTATTTTTCGGGGGAGAAATATG AAAGGACTGACCGCT

TTTC ATTT GT GATT AGCGT AGT AGT GAACAGT GACTT AATT AAATTGCCCGGT GTT GATCCCTATAAATTGGAA

G AAT G G GT G AAAG AAAC GATTCTTACC G AATTT CCAGCTCATATTTCTATGATTATCCATTGGATGGATCGGG

AAGCCTTTTTAAATTTCGCCAATACCTATCAGCGTTGGCAAAATAATGGTACGCCAC TGGGGGATGCGGCTTA

TTCCATTCTAGAAAGTTTGACACTTGGTAAATTGCCATCTGCCTTAAAAGGTGTTGG CACAATGCGTATTGCC

ACATCTAGTCAAAGAGAAGAAGTGGTGGGTAGTAATGGTGATCAATGGAATACAGAT GGAATAACCCAGAAT

GAATT ATTCT AT GTTCCT AAAGAGAGCT AGGAAAAATAAAT ATCTGCC ACT AAT GAT GTT GAATT AAAT AT GTTT

TCTGGAGTTAATCATGAACGAAACTCGTTATAATGCAACTGTACAAGAACAACAAAC ATTATCTAATCCAAAAG

CTGTTGGACCTGACATCGATAAATTAAAGGATAAATTTAAAGAGGGCAGTATTCCCC TGCAAACCGATTTCAA

TGAGTTAATTGATATTGCCGATATTGGACGTAAAGCCTGTGGTCAAGCGCCACAACA AAATGGCCCAGGAGA

AGGATTGAAATTGGCTGATGACGGTACGCTTAATTTAAAAATAGGCACTTTTTCCAA TAAAGACTTTTCTCCAT

TAATATTAAAAGATGATGTTTTATCTGTAGATCTTGGTAGTGGTCTGACTAATGAAA CCAATGGAATCTGTGTC

GGTCAGGGCGATGGT ATT ACAGTT AACACT AGCAATGTAGCT GT AAAACAAGGT AACGGAATT AGCGTT ACT A

GT AGTGGTGGTGTTGCCGTT AAAGTT AGTGCT AAT AAGGG ACTT AGCGTT GAT AGT AGTGGTGTTGCAGTT AA

AGTT AAT ACT GAT AAGGGAATT AGCGTTG ATGGT AATGGT GTTGCAGTT AAAGTT AAT ACT AGT AAAGGAATT A

GCGTT GAT AAT ACAGGTGTTGCAGTT AT AGCT AATGCT AGT AAGGGAATT AGCGTTG ATGGT AGTGGT GTTGC

AGTT AT AGCT AAT ACT AGT AAAGGAATT AGCGTT G ATGGTAGTGGT GTTGCAGTT AT AGCT AAT ACT AGT AAA

GGAATT AGCGTTG AT AAT ACAGGT GTTGCAGTT AT AGCT AATGCT AGT AAGGGAATT AGCGTT GATGGT AGT G

GTGTTGCAGTTATAGCTAATACTAGTAAAGGAATTAGCGTTGATGGTAGTGGTGTTG CAGTTATAGCTAATAC

T AGT AAAGGAATT AGCGTT G ATAGT AGTGGT GTTGCAGTT AAAGTT AAAGCT AATGGCGGAATT AAAGT AGAT

GCTAATGGTGTTGCAATTGATCCTAATAATGTACTCCCCAAGGGAGTGATTGTAATG TTCTCTGGCAGTACTG

CACCAACTGGTTGGGCGTTATGTGATGGCAATAATGGTACACCAAATTTAATCGATC GATTTATTTTAGGTGG

GAAAGGG ACT GAT ATT AAT G GAGT GAGT ACT AAT AC AGCTT C AGGTACT AAAAAT AGT AAGTT ATT CG ATTT C A

GTTCT GAT GAAGCT ACATT AACT ATT GATGGT AAAACACTGGGG AGAGC ATTATCGTTAC AGCAAAT ACCT AA

TCATGCACACTTT AGTGGAAT AATT ATGGAT ACAG AGAAAGTT AATTATT ATGGAAGT AAAAAAATCACAACAA

ATGTGTGGGGTGTAACAACAGGAGATAATACTTCAGTACGATATATTTATAAGTCAT CAGGTGTACTTGACTC

TAACAATAATGTCTCCAACAGTACCTTAGGCGGAAACAGTCTGCAGACGCACGATCA TGATATTAAGATAACG

GGCACAGGAAAACATTCTCACAAAAACAAAGTAACAGTCCCTTATTATATTCTGGCT TTCATCATAAAGCTTTA

AT AT AT AT G AAAAATT G AAAAT AT AAATT ATC C ATT AAT AAT AAAG AG GATATTAGCATGACTTCGGAGC C AAAT

CTGTTAAACCGGATTACAATTACTATTGAAGCTAATAATCAACAAGTAGCTAGAAAA GTATTGCATGGCTCCTT

GCTT AATCAAGCT AAT AT AAAT AAATT ATTT AATTCAT ACTTT AAT GAAT AT GAAATT AAT AGGGGTGTTT ATTT A

GAAACATTAATCCTGAATCTTGGTACGATAAATTTCCATGATTTTAATTCATTGTTT CCTACTCTCCTAAAAGCT

G C ATT GAAT AAAG AATT C AGT C AAT ATC AG AT AAAC AAC CAT AG G G AAG AAAT G CT ATTT AAT GAG AC AAT AT C

AAAT CAAGCT ACT GAT AAGT CTT AC AT ATTTGGCG AT AACAAATT AATT GAT G C AG AGAATTT C ATT C ACTTTTT AT AT C AAAAG CATTCCACATT AAAT CTAGTAGAAG C AAT G G G AAAT AAT G GT ATT G AAAAATT AAC AAAT C AG T

TAACACAAATAGAAAATAAATTTGCGTTATTATTGGCAAAAAGTTGTTTGTCTGAGG AAGGCTTAAAACGACTC

TTG G CT AT C AAAC AAC C C G ATTT ATT AAT C G CT AT C AAT CGCAGATTATCT G AAAG AAT AAAT AG AC C AC AAT A

TCAGGAGAAGCTTGTTTCCTGCGGACAACTGATATTTAGTGCTCTGGGATATATACA ACAGTACAATATACAG

GAAATTCCT AAACCGGAT GAAAAAGTTATTGC ACGCAT AACAACT GAACTT AAT AAT AATGGTTTGCTT AAT AC

AATACCTATTATTACACTATTTCGTCAGAGTGGGATTAACGATTCATCACTAAATGA TTGGCTAAAGAAAATCT

GGCAGGTGAGATCAATTTCACAGTTATGCAGAAAGTATCTTTCTGCTAAGGAATACC AATATCTGTCAGAACA

TTTTGTTTCAAAGAGCGTCGATAAAAATAGATATGATGAAGAGCCCGTAAATCAGAG CATATTATCAAGGTTG

AATAATAATTCCATTAAAGAAGGAAATAATCACAGTCAACTCTGTACTCTCAGTAGA CTATATTCTGAACCCGT

TGTATTACCTGAACAAACCATTCTACGTCAGGTTAGTAATACAGTAGATCAGAGCAT ATTATCAAGGTTGAATA

ATGCCTCCATTAAAGAAGGAAATAACCAAAGTCAACTTCGCACTCTCAGTAGACTAT ATTCTGAGCCCGTTGC

ATT ACCT GAACAAACCATTCCACGTCAGGTT AGT AAT ACAGGT AT ATT AATTCT ATGGCCAATGCT ACCT ACAC

TATTTAACCAGCTTGGTCTACTTGAGAAAAAGAAATTTATCCATCGTCAGGCCCAGT TTAATGCCGTTGATTTT

CTT GATT ACCTG ATTTGGGGAACCGAAGAT GT GAAAGTGGAACGAAAGGTTTT GAAT AAT GTTCT AT GTGGGT

TAATGGCTGATGAAATTACTGAACCAATGCCTATTGAACCAGAAAAACAATGGATAA TAATTCAATGGCTGGA

CGCTATTATCTCCCAACTTTCTGGCTGGAAAAAGTTAAGTCGTAATGACGTCCGTCA ATTATTTCTACAACGAC

C AGG AGAATT ACT GAT CAAT GAAC AGGAAATT AAAAT CAC AAT AC AGCAACAACC ATTT G ATG CTCTGTT AACT

GATTGGCCGTGGCCAATGAATATGGCTTGTTTTAGCTGGTTGAGTCAACCATTAACC ATTACGTGGTTATAAC

C ATTG AC CAC AAT GACTTAGTCTGAGT AAAAAAT AT GAAT AT ATC G C C TG TTTTTT ATG ATTC ATT GAAT C AG G

ATAACGACCGTGATCTATCGTTTTTATTTAGCGAACTGGAACGAATAGATCTCGCTC TTCAACACCATTTTTAT

TGTGTAGAAAGTCAGCGAAGTGAGCTCCTGGATGAGTTTCTGCTCACTGAGGCGGAA GTGGTGACCAGGCT

GGATAAGCCACTTGGTAAACCTCATTGGATAAATGATGATTATCTGGCGATATCGCA AAAGGGCAATGTAAGC

CT AATGGCAGCGTCCAGATT AATGGATCT GATCGAACGCTTT GAACT GACT GATTTT GAGCGCGAT GTTTT AC

TATTAGGCTTATTGCCCCATTTTGATAGCCGCTATTATCGACTGTTTTCGCTGATTC AAGGGGGACAACAGGG

TCGATTACCTTCTTTTGCGCTGGCATTGGAACTGTTTTGCCACTCGGCGCTGGAGAA ACAGGTACAGCAAGC

GAGTTTTCTGCACCGGGCACCTTTGATGGGTTGCCAGCTATTATCCATCGATACTAG TCAAAAAACGCTGGC

CTGGCTCCAGACTCCCTTTATTACTGACAGCGGGGTATATCACTTTTTACTGGGGCA TCACTACATTATGCCG

GCTTT AGAACATT GTGCT GAGTGGTT AACACCGACAGGGATTGGCT GTT ATCCT GAAGG ATT AAAACAAGTAC

TGGGTAACGTATTGTTATCTGACAACGATAATATTAGACCGATTGTCTTATTACGGG GAATGGCCGGCAGTGC

CAGAGCTT AT ACCATT ACT AAT AT GATGGCTTCAGAAGGGAAGCAAACACTGCTGGT AGAT AT ATCCAAACTT

GCTGATAGCGATGAAAAAAACATTATTCTTCAGATAAAGCATATTTTGCGGGAAACC CGCATGCATGGAGCAT

GTTT ATT ATT ACG GAATTTTT G CTTGTT AGT G GAAC AGAAT AAACAACT ATTGG ACTCCCT GT C AG AGTT ATT G

AATCAACCTGAATTAAGAATTGTTTGCCTGATTGAGCCTTATTCCCCATTGGTATGG CTGAAAAAGATACCGG

TATTACTGATTGAGATGCCACTTTTAACGCCTGCGGAAAAAGCCAGATTGTTAATTG CCAGCTTACCGGATAA

TT GTTCCGAGGAT ATT GAT ACGAT AACTTT AAGCC AGCGTT ACACTTTTAACCCAGAAACCCTGCCATT G ATTT

TGCAAGAGGCCCAGCTTTATCAACAGCAGCGAGATCCGCTGGATATCTTGCAGCAAT GCGATATACGCCAGG

CATTAAATTTGCGTGCTCAACAAAATTTCGGTCAATTGGCACAGCGGATTATTCCTA AGCGCTCATTAAAGGA

TTTATTGGTATCCGATGAGATTGCTCAGCAGTTACGGGAAATACTCATAGCAATTAA GTATCGGGAACAGGTT

CTGGCGGGAGGGTTTAAAGATAAAATTGCCTATGGCACTGGTATCAGCGCCCTGTTT TATGGTGATTCAGGC

ACTGGAAAAACCATGGCAGCAGAAGTGATTGCTGACCACATTGGCGTTGACTTAATA AAAGTGGATTTATCTA

CAGT AGT GAAT AAAT ACATCGGT GAAACAGAAAAAAACTT ATCCCGT ATTTTCGATTTGGCGGAACAGGATGC

AGGGGT ATT ATTCTTT GAT GAAGCT G ACGCACT GTTTGGT AAACGCAGT GAAACT AAAGATTCCCAGGACAG A

CATGCCAATATTGAAGTTTCTTACTTATTACAGCGCCTGGAGAATTACCCGGGTCTG GTCATTTTATCCACCA

ATAATCGTGGTCATTTAGACAGTGCTTTTAATCGTCGTTTTACTTTCATTACCCGTT TTACTTACCCGGATGAA

AAAAT C C GT AAAAAAAT GTG G C AG G AAATTT G G C CT AG AAAT AT AAAAAT ATC G G AAG AT ATC G ATTTTAAC G A

ATTAGCTCAACGAACAAGCGTGACTGGCGCGAATATCCGCAATATTGCTTTATTGTC TTCATTCTTTGCTTCA

GAGCAGGGGAATGATGAAGTCAGTAATGAAAATATTGAAATTGCATTGAAGCGTGAA TTAGCTAAAGTCGGA

CG ATT AAC ATTTT AAAAGTT AT C ACAAT GAAAGT ATT GAAAT ATT AAAT AAATTT ATT ACCAAAAAGTT AT C ACG

AT AT AATTT AAG AGAGGTTTTTT AT GTT AAACACGCAAACT ATT ATTG AT GTCAAT AAGGCAATGGATGCC AT G

CTGCGCGCATATCTGAATCAAGATATTGCCATTCGTTTTGATCTACCTGAATTGGAT ACTATGCAATCTGATGC

GATGGTAAGTATCTTTCTTTATGACATTCATGAAGATTTACAGCTTCGCTCGGCAGA ATCAAGAGGGTTTGAT

GTTTATGCCGGGAGGTTATTGCCTGGTTGGGTAAATATTAAATGTAACTATCTGATT ACCTATTGGGAAGCTT

CT AAGCCAGCG ACT GATGCCAGCAGTCCGGAT AGCCAACCT GAT AACCAGGCAAT ACAAGT GAT GTCACAAG

TATTAAATGCCTTGATTAATAATCGTCAATTGGCAGGTATTCCTGGTGCTTATACTC AGGTTGTACCGCCTAAA

GAGAGTTTAAATAGCCTGGGGAATTTCTGGCAATCACTGGGTAATCGCCCACGGCTT TCTCTCAATTATTCAG

T GACAGT ACCT GTT AGCCT AAACGATGGTCAGGAT AGCGCG ACTCCGGTT ACCGCGGTTTCTT CT ACAGTGG

AACAAACGGCATCGCTCAGTCAAGAAGTGGTTAGTCATGCTTTACGCGAATTACTCA TTACGGAATTAGGAG

GAGGAG AGG AT AACCGGTTGGT ACT G AGTAAAGTT GAATT ATCCGCAGT GAAAG AG ACGAT GACTCAAGACA

GTCCGGCTCAGATGATTATATTGTTGTCTGTTTCAGGCATTACACGACAGGAATATT TGAAGGAAATTGATAAT

ATCTTT GATCGTTGGGT AAAT AATGCTGAAGTT ATT ACCACT ATT GAT GATT GTGGGATT AGAATT GAAAGT AT

AACGAAAGAT AATCTT GT AGGAATTT AA SEQ ID NO: 94 ( Photorhabdus asvmbiotica strain ATCC43949 PVCIopT operon, pyd -

DVC16\ e.q. corresponding to genes PAU 02112 to PAU 02099 of the sequence of

GenBank accession no. FM 162591.1)

ATGGCCACAACCACAGTTGACTATCCAATACCGGCTTATCGATTTGTTGTCTCCGTTGGT GATGAACAAATCC

CTTTTAACAGCGTTTCGGGGCTGGATATTACTTATGATGTCATCGAGTATAAAGATG GCACCGGTAATTATTAT

AAAATGCCGGGTCAACGTCAGTTAATCAATATTACACTGCGTAAAGGGGTATTCCCT GGCGACACTAAACTTT

TT GATTGGCTT AATTCCATTCAGCTT AATCAGGTT GAGAAAAAAG AT GTTTCAATT AGCTT GACCAACGAAGTT

GGAACTGAAATTTTAATGACCTGGAGCGTAGCCAATGCATTCCCAACCTCATTAACA TCTCCTTCTTTTGATG

CCACCAGCAATGATATCGCTGTTCAAGAAATAAAACTGACTGCCGATCGAGTCACTA TTCAGGCAGCTTAAAG

CATCACGATGATTGATATATCAGACGGGACAAAATGATCCTCAAAATTTGGCACAAC GGCTACCCGTCCAACT

AAATTTACCCTCTTACAGTTCACGCAAAATATCGCACAATACAATTGGAGGCAATAT GCCAACAACAACTTATC

CCGGCGTTTATATTGAAGAAGACGCCTCACTGTCACTTTCCGTTCGCTCAAGTGCAA CGGCGGTGCCCGTTT

TTACCGTTGAAGATGACAGTCAACTTCATACTCCTACCAGAGTGAATAGTTGGTTAG AATATCTGACAAAAAAA

G C AG AT AAAAAATT C AATT CT AC C G AC AAACTT GATATCGCATTGCGCGCTTATTTTATTAACGGCGGCGGAT

ATGGTTATCTCGTCAAAGCGGGTGAATTAACAAATCAAATTCCAAAACTTAACGATG TCACATTACTGGTCGC

GGCTGGAGAAAATATCAAAGATGCTGTGAGTACACTTTGTCAACCGGGCAAAGGCTT ATTTGCCATTCTGGAT

GGCCCAACCGAAGAGTTAAAGTCTGATGGCAAATCCAGAGATCCGTATGATCAAAGC CCTTTTGCCGCCGTT

TATTACCCCTGGCTAGTTGCTGATTGGGCAGACAATATTCCGCCAAGCGCGGCCATT GCCGGTATCTATTGT

TCAGTTGACCGTACCCGCGGTGTCTGGAAAGCCCCAGCAAATGTCATATTACAAGGC GGGGTGAAACCGAA

GTTT AAAGTCACCGAT GACTT ACAAGGT ATTT ACAACACCGGT AAAGCCATCAAT AT GATCCGT GAATTTCCG

AATACCGGTGTCACCATCTGGGGCGCCCGCACACTTAAGGACGAAGATAACTGGCGT TACATCCCAGTTCG

CCGCCT GTTT AACAGTGCAG AGCGAGACATT AAAAATGCCAT GAGTTTCGCGGTCTTT GAACCT AACAGCCA

ACCCACCTGGAAAGCTGTACACCGAGCTATTGATAATTATCTCTATGCCCTTTGGCA ACAAGGAGGGCTAGC

AGGAAACAAAGCT GAACAAGCTT ACTTT GTGCAAATTGGT AAAGGGAT AACCAT GACCGAT G ATG AT ATCAAG

CAAGGGAAAATGATTGTTAAAGTGGGTATGGCCGCAGTGCGCCCGGCTGAATTTATC ATCCTTCAATTTTCAC

AAAATGTAGCACAGTAACCGTACTGAGGCGCGGTTTAACACCGCGTCCATTCAGTCT ATTGAATGGAGGAGA

C AAT AAT GATAACGGAGAT AAAAC AG CCGGGCGTCACCATCACG G AAAATT C G AT ATC C C C G AAATC AG AT A

ATGAATTTATCGGCGTCCCCGTTTTTATTGGCCATACCGAAAAAAATTCAAGCCATA AAACGGCTGTTAAACTA

AAT AGCCTGATGGACTTTACCCAAGCTTTCGGTGCATCAGGATTAACCTATTATTCAGTACGC CACTTTTTTGA

AAATGGTGGACAGCAAGCTTATATCTTGTCACTGGGGATTAATCAACAGCTAAAAGA TTTTCAATCATTGATTA

CCGCCCTGCAATGGAACTGGGTAAAACAAGCCATTGCCGCAGAAAACGAAATCACAT TGATTGTTGTGCCTG

ATATTACCCGTTTTAATGATCTCAGCGCTCAAAAAAGCCTTTGGCTACAACTCTGGC AATCAATACTTGAACTG

T GT AAAAGTCGGCGTGGCATCATGGG ATT ACTGG ACGCGCCT GAT GATCCAACATT AGCAACT GAGT GTTT A

AAACAATTCTCTTCCACTGATCGCCAATGGGGCGCCGTATACTGGCCAAGGCTAAAA AGTACCTACCAAGAA

AACGGTACATACATTGTACTTTCACCTACTGCTGCGGTCGCCGCCGTTATGCAACGC AATGACAGTCAGAAA

GGCATATGGACTGCTCCCGCCAATGTGGCTTTAGCCAACGTCATCGGTCCGGTACGT TCTTACATTGAAGCT

GGAACCTTGCTGAATCAAGAAGGCACTTCGTTGAATCTGGTGCGTAGCTTCCCCGGC AAAGGCATTAAAATC

TGGGGCTGCCGCACTCTGGATAACATACCTCATTCTCCCTGGCGTTATATCCAAATT CGCCGTTTGGTTTCCT

ATATCGAAGCTCATATAACCCAACTTGGCCGCGCCTTTGTCTTTGAACCCAACAACG CCATCACCTGGATGAA

ATTTAAAGGTCAGGCCCACAACTGGCTACGTCAATTATGGCTAAAAGGTGGATTACG GGGCACTCAGGAAGA

TCAAGCATTTGAGGTGTTACTGGGTGTTAATGAATCCATGAGTGAAACGGATATCTT GGCCGGAAAAATGATC

ATGAAAATCAGGCTGGCGCTGTTAATTCCGGCAGAATTTATTGAGCTGAGTCTGACG TTTGATATCCGTAACA

ATACCGTACCTAGCTAATCTAAACAGGGGAAAAACATGTACAACTTATACACCCCGT CAGTATCTCACCGTTT

TATCGCCAGTTTTCTGTTTAACAACATTCCCAGCCCACTTGATATCGCCTTTCAGCG TATATCTGGCCTGAGC

CGAGAACTGCAAACCACCCAACAT AGCCAAGGTGGAGAAAACGCCAGAAACGTCTGGTT ATCCGAGAAG AT

CCAACATGGCAGCCTGGTGCTGGAGCGCGGTGTTATGACCATCACTCCCCTCACCTT GGTTTTTGATCGCGT

GCTGCGCGGTGAAAAAGCCGTGTATGCCGATGTTGTCATCATGCTACTGAATGAAAA TGCGTTACCCGTGGC

GAGCTGGACAGTCAGTAACGCGCTACCGGTTCGTTGGTCCACCAGCGACTTTGATGC TAATAGCAACACCGT

ACTGGTGAGTTCTCTGGAATTACGTTATCAGGATATGCGCTGGTTAGGAGTAAAAGC ATGACGGTAGAAATTA

AAGAACTGATTATTCAGGCTAAAGTCACCGATTCTACGAGTGATCAACTCGCCCCAA GAACATTAGCCCAAGA

AAAGCTGGAT AACGCCCGTTT GATT GACAT AGT GAAACGGGAAGT GTT AGAGGCATT ACGTGAAGGAGGCCA

TCAT GAGTTT AATT GAACGTGGTTT ATCCAG ACTCACCCT AACCGCTTTT AAAGACCGAGAAGGT AAAGTTTC

CGTGGGTCGCTTACAAGCCATGTATAACCCCGATACGATCCAGCTTGACTACCAAAC CCGCTACCAACAGGA

TGAAAGTGTTAATCGTGCCAGCCAAAGCAGCCGTTATGTATTATCCCAACCCGCCGG ATTATCCTTAGTTCTG

CTGTTTGATGCCTCGATGCCCGATAATAACATGCCGATAGAAACCCAGCTTGCGACC CTGAAATCCCTGTGT

GCGATTGATGCCAGCACCAAAGTACCCCACTTCCTTAAAATCAAATGGGGCAAAATG CGCTGGGAAAACAAA

GGTTATTTCGCCTGCCGAGCCAGTAGCCTGGCCGTCAACTATACCCTGTTTGACCGG GATGCCACACCATTG

CGGGCCAGCGCCACTCTATCTCTGGTAGCGGACGAAAGCTTTATTATTCAAGCTACC GAACGGCAGTTAAAA

TCACCGCCGGCCACTGCGGTTAGCGTAACTGATATGCTCTCCCTGCCTTTGATTGCT TTAGATGCTGGAGCG

TCTCTGGCTGGTGGCATTGATTATCTCTCGCTGGCCTGGCAAAACGGTCTGGATAAT CTTGATGACTTTACCC

CCGGACAAACACTGCAAGCGCGGGGGGATGCATGAAGATACCCATGATAACCCTCAA AATAGGTGGCAAAA

CGCTCAATCAATTGACTGTCATCAGTCTGACAATAAACCATCAAATCAATGGCATTC CCTCGACCAACATCAC

CTTGGGGATCGCTGGCGATGCGAGCCATATTTTCGACACCAAAGCCCAAGCTGAACT GGCAAGTTGTCGCC

CCAAT AAT GAACTCACCCT ACAG ATCCAAAAAACCGTGGT GTTT AAAGGG AGCATCGTTCGACAAGCACTT GA ACTGAAAGGTCAAGACAGCATCATTACCCTGACAGCAAAACATCCACTACAAAAGTTAAC TCATAGCCTCCAT

TCACAATTATTCAGTCAACAGAGTGATGAAGCGATTATCAGGAAATTATTCAATCAG GCGGGTATCCAAACAA

CGATAAAGCAGGCTCCTCAACTTAAAACCGTTCATGAACAAATGGTGCAATTTCGTT GCAATGACTGGGCATT

CCTAAAAAGCCGATTGATTGCCACTAATACCTGGCTGTTGCCCGGCAATGAATCGGT TACTTTGATAACACCT

AAGGCCCTGAATCAATCGACAGTGCATACTCTTCATCGACAGGCCAGTGCTGAAGAT ATTGTGTTATTTGCAG

CGGATCTCCAATGGAATAACCAATATAGCCCTAAAACGGTGAGTGTACGTGCCTGGG ATATTGCTCAACAAA

AGCTTTCCCCAGCAATT AAT ACCCAAAAC AGTCAGCTTGGCAGTC AT AAATTGGCCGTGGACAGT ATCGCCG

CACTGGCTGATAAAGAGTGGCAATGGGCTTACAGCTATCCATTAGATAATGAACAAG CCAAACACCTTGCTCA

AGGCATTATGAATAACCTGCGAAGCCATAATATATCTGGCAGTTTTGAAATCGAAGG TAATCACCGTTATCAA

CCGGGGGATGTCTTGGCGTTAAATGGTTTTGGTCAGGGGATGGACGGTCAAGGGATT ATCACCGGAGTCAG

TCAGATAATTAATCAGCGGCAAGGCTGGCACACCCTATTAACCTTAGGCATGTTACC CGATGTAGAACCGCC

GGTGCCTCAGGTGAAAGAGTTGCATATCGGTATCGTGGAAAAATACCAGCAAGACCG CCAATCACTAAGCCG

TATCCCAGTCAGAATACCCGCATTAAACTTGACCAAAGGTGTCCTTTTTGCCCGGCT AGGTAAACCTTATGCC

AGTCATGAAAGCGGATTTTGCTTTTATCCCGAACCGGGAGATGAAGTGATTATCGGA TTCTTTGAATGTGATC

CTCGTTTTCCAGT GAT ATT AGGTTCCATGCAT AATCCGAAAAAT AAACC ACCGTT AGAACCCAGT GAAAAAAAT

CCGGTGAAAACTTTAGTTATCAAGCAAGGGGATAAACAACAAGCATTAATATTCGAT AATAAAGAAAACACGG

TGGCACTTAATAGCGGCGAAAATAAAGTCTCTCTGCAACAGGATAAAAACATTACGC TCAATTCAACTAAAAA

TCTCATCACTCAGGCCCAAGAAATTAATATACAAGCGGAAAAATCTCTGTCAGCCAC AGGAAAATCTGGCGTC

GAT ATT AAG G G C G C G AAAATT AACTTAACCCAGT AAT G AG GT ATT G AAAT G AC AAG C C AAAT ATT AG C C AAT A

TTTACGGTTGCGGCTGGAAATTTCCGCCACAGTTTTCTATTGAAACTGGCGTAGAAA TGGCCGAAGGTGCCG

AAAACGTTCGCCAAAGTATGAAAATCCTTTTTTTAACTGAACCCGGTGAACGAATTA TGCGTGAAGATTATGG

TTGTGGTCTGAATGATTACATGTTTGAAAATATCAGTGATGAATTATTATCGGAGAT TCAAACCCGCATTGAAG

AACGAGTATTGCGCTATGAACCCCGTGCTGAAATCACAGATATCCAAGTAACTCAGA AAACAGACTCACCGA

ATACTTTACATATTCAAGTGACCTATGCCCTGAGAGGCAGCCAAATCAGTCAACAGC TTGAAGGGGTTCTTGA

GATCAACGAAGGTCAGGCAAAGGTGAGTCTATGAGCAAACAACTCATTATTGATGGC GACAGCCTGCTATTC

GAGCCATTATTCGGCAACCGGCAGGTCACTATTTTGATGCCAGCGACCATCAGAGGC AGCGGACACGCGCA

AATCCAAGGCAGAAAGATAGCGATTGTCGGCGATGAAAAAAAGGTACAACTTCAAGC GCAATACATTACCCC

AAGCCACCCGGTACCTGGCATAGGCACAGTTACCATTGCTCAATTAGATACCAGCCA GCAAGTCAACTTTTG

CCACAGCCCTGCCACAGTGATAGTTGTCGGGCAGCAATTTACCGCTCGATTTACCCC ATCACAGCCGGCAAT

T AAT CCGTCAACCGGGCCAGATGTCACAACACCCAGTATGGG C AAAG GCCGTTTTATTGCCAGTCAACATAC

TATCAACGCCGGATAAATAACTCTGCAAAATCATTATTCAATAACGTTCCTATTCTG CAATAGCTATCAGCAAT

ATATTCAAATAACAGGTGGTATAATATGGGACTCACCGAATTAAAAAATAAACTCTC TGCTATCGTACTCGATA

CGGATTTT AAACTT GAT GAAAGAAGT ACACT GGAT ATTTT AAACTGGCT ACAAGAAT ATGCT AAAAAAATCCCT

TTCAATCAAGAGAAAAAACAGTTCTGGGATAGTTTCTATTTTATTCAGGAAAATAGT CCTGAGAAATTAGCCGA

TCTTTACCAAAACGTTAATAAAACGAATGGCCATTTACCGGCCCATCAAGCTTTTGT TTTAGCCTTTTTAAAAC

TTTT AGAAACCACCAAAGT ATT ATTT AAT ACTTTTCCGGCACGACATCGT GATCTTT ATT ACCGGGAATT ATT A

GGTCTAAAACCCAGAAATGCCCAAGCAGATAGTGTTGCTTTAGGCATTACCTTAAAT ACAGATAACACAGAAC

ATCTTATTCCTAAAGGAACCTTGTTCGATGCCGGGCAGGACAGGGCCGGAAATCCGC TACAATACGCATCAA

ATGCAGATTTACTGGCGAATCAAGGAAAATTGAGCGATCTGCGTTGGTGTCGAAAAG ATAATGATAGCTGGC

AATCTGCAAT ACT ACT GAACCACTCAGAT AAT ATT GAATT ACCT GAAAACAGT ATTCGACTTTTT AGTCCAACG

CCGGATGATATTCCCGTTTTATCCGGTTATTTGATAACTTCGTCTTTATTTGCTATG CCAACGGGGGAACGCA

GT ATT ACATTG ACTTT AGCAGAT AATTGGCATGGT GAT ATT AAGCACATCACCGCT AAAATC AGTTCGGGAG A

T C ACTGGCTTT C ACT AT C AGT AAAAAAAGAACAAG ACAAT AGT ATT C ACT AT CTT AAACTTT ATTT AT CAACCAA

TGATGACCCCATCGGTCCTCCTGATGCTTTGGATAATATAGCGTTTGATGTACCGGT ATTAAAGCTGGGCACT

GTTCAGGGACCT AT ACT ACCCAAGATT ACGGGT ATT GAAATT AGCATT AACGGCAACAGT AAT GT ACATT ATT

CCTCTGATAACGGTATTGAAAAAATAGATGCAGCTAGTTTTCCCTTTGGACAATCAC CGTCACCAGGTTCCGG

TTTTAATCTGATTGCCCCTGAATGGTATGGTACAGAAAGCGCCAAAATTACTCTTAC TCCTCAATGGACTGGA

TTACCCAAAGAGGGGTTTAAAGAGTGGTATCAAGGATATAGTTCTACCCCCGAAAAT AATGCATTTAAAGTAC

AGGCTT ATTT AAT C AC ACCT CAAAAG AG AGAAAAATTT AAT GAAGCT C AGT C ATT ATTT AAT G AAAGT AAAG AC

AAGAAACCACAAGGAAAAAGCCT AACTTTT ACCTT ACCTGCAATGG ATT ATTCCTTTGCAAACAGCCCATC AT

CT AAT AACTGGCCCGCATCAATACGCATAGAACTAACC GAAC AG G ATTTT ATG C ATG C C C AAT ATT G G C AAAA

TCCTACGGGTAAAAAACAGCCCTATACCCCCAAAATGAACACATTACAAATTCAGTT CAGTGCCAAAGTTAAA

CCCGAACAATTTTCCGTTTATTCTCTCACGCCTTTTGGTTGGGGAAAAACAGGAGAA AATAGAACATCATTAA

CCCAT GAT ACATTCT ATTT AGGTTTT ACCGAT GT ATT ACCAGGACAAACTTT ATCCCT GT ACTGGCAGTT AGAA

GGTATTAAAAAGCTCCCTTTATCCTGGTCTTATCTGAATCAAGAAAATACCTGGAGT CCATTGGATAATCAGGT

G C ATG AC C AAAC C C AC A AC CT ATTTGATCGAG G AAT CTGGCGTACCTCATTGCCACATGATGCTT C AAAC C AA

GCCTCTCAAATGCCAAAAGGACAAT ATTGGGT GAAGGCACAC ATTTT ACAAACGAATCAAGCAACCCT GACT

GATCTGTATTGGTATCGAAAAGATAATGATGTCTGGAAATCCGCAACACCTCTTAGC CTTTCAAATAACATGAA

ATTACCCGCAAACGGTATTCAGATTTTTAGCCCAACATCTCATGATGTTCCAGTTCG ATACGGCTACCTAATTA

CTTCATCTTTATTCTCATTCCTCAAGAAAGGACGCAATATCACATTAATTTTAGCAG GAGATAGCTGGGAGGG

TAATCCTGAAAACATCACCGCTAAAATCAGTTCAGGAAATCACTGGTTAACACTATC CGTCGAATATCTGAGT

AATACTAATAGTCTTAAGTTGCAATTATCAGATAATAATAATGATCCCATCAGCCCC CCTAATGCTCTGGATAA

TATGACGTTTGACACGCCATTGTTAAAACTAGAAGCCACTCAGGATTTCACTTTGCC CTGGATTTATAAGGTAT

GCGTTAATAGCAACAATATACTCTCTACCTCTGACAGCTCAGATGCAGCGATTACTC GTTTCCCCTTTGGCCA

ATCACCATCGTTGGGTTCCAGCTTTAGTCCGAAAATCGTTTTCCCGGAATGGTTTGA ATCTGAATACGCATCA

GACACCACGATCACGATTACCCCTCAATGGGTTAACCTGCCCACAGAAAACTTTTCA TCGTGGTATGACGGA TATATTAATAAACCTGCCGATAATAGCGTATTTAAAATAGAGGGTTATTTACTTACTCAT TATCAGGGAAAAATC

AAACT CACAGAAGCTGAGACAGGAAGC G AAAC C C AAG C ATT ATT C AAT G G AAAC AAT GCACCACAAG G AAAA

AGCCTGACTTTCACTTTACCTAATAGGTATAACTTCTATCCGCGCAACCATCAGTCA ATGAAGATAGAAATAAA

ACTCGTTAAACAAGACTTTATGCACACTCAACATAAGAGCAATCCCACAGGCAAAAA ACCACCCTATACCCCG

CAAATCAGTGCCTTACAGGTGGAATTCAATGCTACAGCTTTCCATCGAAAATTCTCC GTTTATCCTCTCACGC

CTTTTGGCTGGGGCAAAACAGGAGAAAAT AGCACACCATT AATTCAT GAT ACATTTT ATTT AGGCTT GACCG A

TATATCACCAGAGCAAACTTTTTCTCTGTATTGGCAGCTAAAGGGCCTTAAAGAGCT ACCTTTGTCTTGGTTTT

ATCTAAGTGAAGAAAATAGCTGGAAATCATTAAATAGATCAACTTACAACCAAACCC ACAACCTGTTTGAATCA

G C AG AAC AAAG TATC CT ATT ACCACGGGATGCTT C AAAC CAAGCCTCT C AAAT GCCATTAGGACGGT ATT G G

CT GAAAGCACAGAT AGAACAGGAGAAAAAACAG AT AAAGAT AGCGCTTCCT GATT ATT ATCCAAGAATCAGG

GGGCTGTTGTATAACGCTACCATCGCCACTTTAATCAACGCTGAAGCTGTTGAGCAA TCTCACCTTATCAACG

GATTGGCTGCTAACAACATTAAACAACCGGTTAACTCATCCGTTGCCATCAACGAAG TTATTCAACCCTGGAC

ATCCTGGAACGGTCGCCCAAAAGAAACCGAGTCAGCATTCCTGGCACGAGTTCCTGC CCGGCTCTCTCATC

GTAACCGAGTGCTAAGCTGGGGTAACATTGCCACTTTATTAAAAGAGAATTTTAGTA GCTTATTCGATGTCAA

ATACCCTTCTGTCAGTGAATTAACCAAAATTCCAGCGCCAGAAAAGCGACAATTAAC CATCATCCCCGACAAC

CGCTAT AAAGAT AATGATGATTCACTACGCCCAGTATTGAACCAAGCCAGACTGACCGAGATGGTCGAATGG

TT AGATCGATT AAGT AGCCCTTGGACAACT ATT GAAATT AAAAATCCC ACAT AT GTT AACGTTCTG ATCCACT A

TGAACTGATATTTACCTCGGATGTTAACCCCGATTATGGCCTCCATCAGCTACAACA AGAACTCAGTCGAAAA

TATATGCCGTGGGGAGAAAATGCAGCTATTGGCGTAACACCCGGTAATCGTATTGAC TACTTCCAGTTATTAG

CCTCAATTCAACAATCACCGCTGGTTGAACGGGTCACCAACTTAACGTTAAAAAAAG GCAGCCAGCCTACCG

TAAGTGAAAGTATAGAAGCCGCCGATGATGAAGTACTGATTTTAGTCTGGTCATAAA AACTTCCCCAACCTAA

GGAATTAACAAATGAATAATCGAGATATGCTATTTCCTATCATTAAAGACGATATTA CCTTTGATTCTTTATTCG

C C C AG G C AAAAG C C GTT ATT G AAC AAC AAT CGGGGCAGCTCTG G AAT AAT AC AG G T G AAAAT GATCCCGGCA

TTACTTTATTAGAAGCCTGTTGTTATGGCGCATCCGATCTGGCCTATCGCCACACAT TGCCACTGCGAGATTT

GCTTACTCCTCAAGAAAATGAACGAATAGATGATGGCATTTTTCCCAAAGAATTTGG TCCACAACAAATACTG

ACCTGCGGCCCAATTACCGCGGAAGATT ACCGTCGAGCTTT GTT AGATTTGCGT AGT GAT AAC ACCGTT GAA

GGTTATTTTTTCTTTAATGATGCACAGCTCATTCGTGAACCGGAAAATCAACGCTAT TCATATTGGTATAACAA

AGAAAAACGCGAATAC AGTTTT ACTCAAG ACCAAT ACAGCGAACAATT ACAGTT AACACT GAG AGGAAACT AT

TGGCTCTATTTACTTCCCAGTCGGAAAACCCAGCTCGATAACACCCTGGCTGAAGAA AGACTCAACATTTTTC

T GAAAGAT AACCGAAACTT AGGAGAATCGGTCAGT AAAATT ATTTGGCT AGAACCCATT AAACT GTCATT GAA

AATTGATATTCAGCTTGATGATGACGCCAAAGATATTGCTGATATATTTGCTAAAGT TTATATGATTGCAGAAC

AAATGGTGCTT GAAAAACCATT ACGTT AT ACCACTCAAGCGAT GAAAGAACTGGGTT ACAGTCAGGAACAAAT

ATTTGAAGGCCCTTATTTACACCACGGTTGGATACCGAAATTACCTCAAACCAAAGA TTATACTCACCCTACC

GTATTAAATCTCAGTCCTTTAATTAATCAGTTACTGGCTATCAAAGGGGTGAAACAT ATTACCCAATTTACATT

GGATAAGCCTGATAAAAAAATTTCTAAGTTACCAAATGATAATTGGTCTTGGGAAAT CGCTCCGGGATATTAC

CCAAAACTATGGGGAGATACTCCATTAGAATTAATTACCTCACCAACAAGCCCACTC ACCATCACGGCAAAAG

GGGGAATT AAAATTGCT ATT ACT AAACAACAG ATAGAAAAAAACAT AAT GACAGAACCACT AATT AAT ACACAG

CCAGAATTATTGAACTGGGGTAAACATCGCAAAGTCCTGGATTACTATCCGATAAGC AATAAATTACCCGCTT

GCTATGGATTACAAACTAATACCCAACAACAGCTACAGTTGCATCAATTTATGCTGC CTTTTGAACAAATGCTA

GCGAAT AACTGCGCT GAACTTGCTTT ATTGCCAAGACT ATT AGCTTTT AAACAACGAGGAAAT ACGGT ACAT G

GCATTCAATGGCCTTTTAAAGAAAATACGGTTGGTCAACATGTTCATAAGGACATAG TATCTAATTTAAACAAT

AATGCT ACGAAAATCGAT AAT AATGCCGAT GACT ACGACAAGGAACTCGTT ATTCT AG ATT ATTT GTT AAGAT A

TTTTGGGGCTCAATGTGCAATCCCACGACTATCACCAGACCCACCACAATCATCATT AACAGAACCTCAGACT

AAAAAAGATTTTCTATCTACTCAGCGCGAATATCTGGCTCAACAGCCAAAACTGACT TATCAGCGTAACAATAT

TCGGATTGATAAAGTATCAGCACTGCAAAAACGTATCGCTGCCCGATTAGGTCTGGG AGGAGAATGTTTCAA

AGCAGAGCCTGACTTAGCTCACCTTCCTTTCTACCTCATTGAACATCGTAGGCTCTT ACCAGTAAAACCTGAT

AT AAAATT CTAT ATT GAG CAACAACCT AATT CT CTGGAAATT GAAAAT GAT AAATT AAAAAT C AC AC AGAAAG AT

TCAGCGGGTCGGTTACTGCAAGGTCAAGTTATTAACCTGGAATTTCGTGAGGGCTAT GATGAATTTACATTGC

T AAACTT AAT GAT AACT GAAGT G ACAAG AG AT AC ATTC ACC ATT AG C ATT AAT AAT AGC CGT GAT CT C AG AG AC

AATCTGGACAAAGTGCAACACGCGTTTGAACAAACGAATAATCTGAGCTGGCACAAT AGCTTAATATGGATGG

AAG AT ATG GATT AT C AATT GGTTTATGC C AAT GGAGAACAACTG G AAAAAG C G GAAAAT G AAC GAT G G ATT AC

CATTAACAATCAAAGTGCTTTCCCTGCTATGATCGGAGAGAATGATGAAATCACACT AAAAATTCAATCCGATT

AT GAACTT AAAACCAAAGTCGTGCGGCTT GATT AT AACAACAAAAAAATTCT GATT AT AAAAGATGCGACATCA

ATAAATAATTTTCCGCCAAAAAGAGAAGCATCATATTATTCTTGCTCTTCTCTAAAA GACAATGGGTACGGATA

TTCGG AT GAAT AT AAAT AT GAACTT ACTT AT ATT GAT AC AG ATT CT ACAAAAGAAAAT G AGT G CTGG ATT ACT AT

CAGCGATCCAAATAATTTGTTTTCTCCTGATATCATCGCAGAGAATGACGAAATTAT ATTGAAAGCTAACCCTA

ATT AT GAGTTTAAAACGCACGT AGTAAAATTT GATCGT ATT AAT AGACAAAT ATT ACTT AGGAAAAAT ACAG AC

CTGGAAAATAATTTTCCATCAGAAAACAACACATCGCACTATCGCTGGCATTTCTCT GGTGAAAAATATGCCC

AAACTGACCATTTTTCATTTGTTGTCAGTGCAGTACTGAATCGAGAATTAATTGAGA GGGGCACAGTCGATCT

CTAT AAATT AGAGTCTTGGGTAAAAACTGAGATTTTATCTGAATTACCCGCGCATATCTCACTCGTTAT TCATT

GGCTATCATCGGAAGAATTCGAAAAATTTGCCAGTACTTATAAAGTTTGGCAAAATA ATGGCGCTCCTTTAGG

TGATCACGCATATAAAATTCTAGAAACATTAACACTTGGGAAAAAACCTTCTACTTC AGCAAGAAGGTCCAGC

AGCTATATAGAAGCACAGTAAT AATT CTTACAGAACATT AAC C CAT ATTT ATCTTAT AAT AT C AAAC AT CAT AAA

AACAAT CTT C AGCT C ATT AT AAT G AC AT ATTT CAT ACT C AGGTTT CTT CAT ATCTGTT AATT ACAAAG AGAAT AT

TAATATGATCTCAGCACCAAATCTGTTAAATCGGATTATCATTACTATTGAAGCGAA TAACGCACAGGCAGCTA

AAAAAGTATTGCATGGCTCCCTGCTTAATCAATCCAGTATAAACAAACTCTTTGATT CATACTTTAACCAATAT GTTGTTAATCAGACTATCTACCTGAAGACACTCACCCTGAATCTTGGCGAAATACGATTA AATAGTTTTAATTC

AC AGTTT GTT ATTCGGCTT AAT ACT ATT CT G AGT CAAG CATT G AG CCAAT AT C AG GT AAAT AAT CAAACT GAT A

TT GAGAAATTT ATTT ATT ACTT AT ATCGAAAAG ATTCT AT ATT AAACCCAAT AG AGGAAATCAAT AATCGT GAAA

TTACTGACAT C AAT ATTAAG C AATT AATT AAC C AATT ACCCCAGAT AC AAAAC AATT GGACACTATTATTGGCA

AAAAGCT GTTT ATCCACACAT AGCCTGAAAAAACTCCTGGCT ATCAAAAAAACAGCTTT ATT AACCGCCATT AA

TCGTAAATTATCTGAAAAGATCAATATATCACCCTATCAGCAGGAATCGGTTTCCAC CTGGCAATTGATACTGA

ATGCGCTGAAATATATACAGCGACATAATACACAGGAAATACCTGAACCCGATGCGA AAGTCATATCACTCAT

TACAACGGAACT C AAT G AC AAT G C C ATT AAT AC AG C AC C AATT ATTGCATTATTTCGCCAAGTTATAACCAACC

ATTCCCCACT GAAT AAGTGGCTGGAACAACT GTGGCAAACAAAGCGAATTTCACAGTT AT GT AAAAAAC AGCT

GTCAATTGAAGAATACCAACATCTATCGGAGCGCTTTATTGCCAAACACGGGAATAA AAATAAATCTGATAAA

AAATCATCCATGACTTCCGAACCGCTGTTATTACCTGAACACCCTCCACCACGTCAG GTCAATAATGCTGGAA

TATTAGTTCTGTGGCCGATGTTACCTACTCTATTTAACCAATTCGGCCTGTTTGAAA AACAAAAATTTATTCATC

GTCAAGCTCAATTTAGGGCTGTTAATCTACTTGATTATCTCATTTGGGGAAACGAAG AAACACAGACAGAACG

AAAAATATTGAATTGCGTTCTGTGTGGGTTAATTGCCGATGAGGACACGGAATCAAT CCCTATTGAGCCAGAA

AAAC AAC AG GT AAT AG AAC AAT GGTTAGATGCAGTTATCAGT C A ACTT CCTGCCTG G AAAAAATT AAG C C G C A

ATGATAGCCGC C AATT GTTTTTACAACGCCCGGGG G AATT G CT G AC AAAT G AG C AG G AAAT C AAAATT AC G G

TACAACCTCAACCATTCGATGCACTGTTAAATAACTGGCCCTGGCCGTTAAATATCG CCAAACTTCCCTGGCT

GGATCGCCCTTTATTAATCAACTGGTAAAACATTGACAAGGTTTATATGAAAGAACA TCAATATAGAATAGTCG

ATCTACGCTGGATTTATTCCCATTTGGAGCGCATCGATCTGCTGTTACAACGTCACT ATTACCAAAAGAGAGA

CAAAT ACGATTCATTGCCAGAAAGTTTTTTGCTTGAAGAAGAT GAATT AGAACAACGTCT AGCAAAACCGTT G

GGTATTCCTCATTGGCTAACAGCAAATACCGGCGCTGGTGATACAGAAACAGAAAAT CATTCTGCTTCCGGC

ACATT ATCACTGCT AGTCACGCGTTTT AAACTCACT GAATTT GAACGT GAT GT GTT ATTGCT AGGTTTATT ACC

GCATTTTGACAACCGCTATCATGCGTTATTTGCTACTCTGCACGGTAACAGTAAAAA ACAGTGGCCCAGTTTT

GATTTAGCGATTGAATTATTTAGCCAACATCAAAGTAACTGGCAATTATTTCAACAC CACTTTTTACCGCAAGC

TC C ATT AAT C AAT C AC C ATTT ATT AC G ACT C AAT AACCAAGAGGAACCCATTTGGCT AC AAACT C AATTTTT AA

CTCACAATGCAGTCTGGTCTTTTTTATCCGGTCAGCGCGTCATTTTACCTCCCTTAA TATCCTGCGCTTACTG

GCATATTCCAACCTCACAGACTTGGTATCCACCAATCCTTGGTCATGCATTTGAAAA AATATTGCTGAATGAAA

CGGACGAAATACGCCCGCTGGTGGTTCTTAAAGGAAAACAGGACAGCGCCAGAGAAC TGGCAGTCAGTAAT

ATT ATGGGAATTCACGGCATT AAC ACTTT AACGTTCGATTT ATTTCACCTGCCAGAT GAAGAGTGCACCACCT

CAATACTCAATCTGCTAATAGATGCAATACGAGAAACCCGGCTACATAATGCCTGTT TATTAATCCGTAACTTT

TCTTTGCTGGCAGAGGAAAAGAGAATATCGCATAGAGAATTATCAGCTCTACTGAAT CAACCCAAATTACGTG

TGGTTTGTCTGGCAGAGTCAGAAGAATCATTAGCATGGGTTAAACACCTGCCGATAG TGCAAATTAATATGCC

ACCGGCGACGCTGGCAGATAAAAAAACGATGCTGGAAGCCAGTTTGCCAGATAATGT CACTAAAGGAATTAA

TATAACTCAATTATGTCAACGTTTTTCATTTACAGCAGAAACATTACCGTTAATTAT CAAGGAAGCTCATCAATA

CCAAATCCTCCGACAACCGGAAGATCAATTGAAAGAATCTGATCTACGTAAGGCATT AAATTGCCGCGCCCA

ACAAAATTTCGGT AAATT AGCCCAGCGT AT GACACCAAAACGAAGTTTT AAT GATTTGGTT ATTTCCGCT GACT

TAACTCAACAGTTGAAAGAAATCATCGCAGCAATTAATTACCGTGACCAAATTCTGG GCGCAGGTTTTCGGGA

AAAAATCAGCTATGGTACTGGTATTAGCGCCCTATTTTACGGTGAATCCGGGACGGG GAAAACCATGGCCGC

AGAAGTGATTGCCAGCTATCTTGGTGTTGATCTGATTAAGGTAGATCTTTCTACCGT GGTGAATAAATACATC

GGTGAAACCGAAAAAAATATCTCCCGTATTTTCGATCTGGCCGAAGCGGATTCCGGG GTGCTGTTTTTCGAT

GAAGCCG ATGCCTT ATTCGGT AAACGCAGT GAAACCAAAG ATGCCCAAGAT AGACATGCCAAT ATTGAAGTTT

CTT ATTT ATT ACAGCG ACT AGAAAATT ATCCGGGATT AGT GATTTT AGCGACT AACAATCGCAACCATTTGGAT

AGTGCGTTTAATCGCCGCTTTACCTTTATTACCCGCTTTACTTATCCCGATGAAGCA TTACGCAAAGCAATGT

GGCAGGCAATTTGGCCTGAACAACTTAAGTTATCAGATCAACTTGATTTTGAGCATT TGGCTAAACAGGCAAA

TCTGACCGGTGCTAATATCAGAAATATTGCCTTATTATCATCAATATTAGCTACAGA TAATAATAGTGATCAAAT

T G AAAAT AAAC AT ATAGCGCGAGCATTGATACTT GAATT AAAT AAAAC GGGCCGATTGATTTTTT AAT C ATTT A

TACCCAATAAATTTCGAGTTGCAGCGCGGCGGCAAGTGAACGAATCCCCAGGAGCAT AGATAACTATGTGAC

TGGGGT GAGT GAAAGCAGCCAACAAAGCAGCAACTT GAAGGAT GAAGGGT AT AT AGAATTGGAGT GAAT AT G

ACAAAT AT AATT AACCCT AAT AATGCGATTCTT GAAGTT AAT AACGCATT AAAT GAT ATTTT ATCTCAGT ATTT A

ACTAATATTGATATCCGCTTTGATCTACCAGAAATAAATTCAATCCCATCAACCCCT ACAGTGAGTATATTTCTT

TATGATATACATGAAGACCTACAATTACGTTCTGCTGAACCAAGAAGTTATCATCCT ACCACCAGCTCATTATT

GCCGGGATGGGTAAATATTAATTATAACTATTTAATTACTTACTGGCATTCAAGTAA TCCATCAAGCGACAGTT

CTACCCCTGATAGTCAACCCAATAATCAAGCGGCACAAGTCATGACTGCTATTTTAA ATGCATTGGTTAACAA

CCGACAATTACCTAAAATTCCTGGCGCATATACCAGAGTCATTCCACCTCAAGAAAA TCTAAATAGCTTAGGT

AACTTTTGGCAAGCGCTTGGCAATCGCCCTCGCCTTTCTTTATTATATTCAATTACC GCACCGGTAAAACTGC

AAAAT ATTAAAGATGTCATAAAGCCCATTAGCCAAATTTCCACTTCTGTGGATCAAAAATCAAAT CTGGAT AAT

TCGCAAATCAACCAAGCCTTATTTAGCAAATTGGGTGCCGATTTAGGTGGCACACAA GATGTTCGTCTTGCTC

TTGCGAAAGT GAATCTG ACAACCAAACCTGCT AAAGAAAAT AAT GAAAATCAAAAT AAT AAAAAT GT AATT ATT

GAAGTTTCTGGCATT ACCCATTTGGATT ATTT ACCCAGAAT AAAAGGT ATTCTTTCAACATGGGT AAAT AGTCA

T AGTGCT GTT GTT AGGAT AAAT GAT ATTGGT ATT ATT GTTTCAGAAT AT AAAT AT GAT AAATT AACAGGCGTTT A

A SEQ ID NO: 95 ( Photorhabdus asvmbiotica strain ATCC43949 PVCPaTox operon, pyd

- DVC16 )

ATGAATACAGCTCAAGAAATTATTAACCGTTTATCGGGGAGAGCCGTTACGCTTGGTTGG GATGTTGTTATTG

CTT AT GACCGAAAAAAAATT AACACTCTGTT AGAGCAACAAT AT GTT GAAAAGGT AAAAAACGGGGAGAACTT

CCCGCTTATCAACTGGGAGAACCAGAGAAAAACACTTCAATTTAAAGATCTTCAATT AGGTGTTCCACTTATTT

CTTTTGAGAATTCAACACTGGAAAATTCAAGGGCGCTTGCCACGATAGAATTTATTT CAGGAGCTATTATTGAA

TTT AGT GACTCCGGGCAAATAATCAACT AT AAGAAGATT GAACCT AGTCATGGTT ATGGCATGGTGCT GACT A

TCGATCTCATGGCTGGTACAGGTTCAGTAGAAGAACAAGGTCGGGTGATAATAAATC TTAACGAAGGCGCCA

TACTCGATTTGCATGTTATCCAACAACCGCCAGCAGAAGTGGTAGAATTTTTCCGCA CTTGGTTGATGGCTAA

TAAAATGACTTATGAATTAGGTAAGCTGGATCTGAGTAGTCAAGCTGGTCTAGTGCC TCGTTCTTTTCGTATTC

GTACTCAGCGGGCGCCTGAAAAAATTCGTAAAGCGACGAGCGATGAAGGAAATGGCG CTGTTTTGTTGTTTG

TTGCCACTAACTATAACCCTACAAGTGGAACTTTACCTGCCAAGGATTATCCGTGGC TAATCCCTGAGGAATA

TTCAGGCGCATTGCTTATCGGTAATAAATGCTTATTTAAAGACATTCTGAAACCGAA TCTGGATCAGTTGTTTG

ATAAAGGGGAATGGACATTAAAAGTTCAGCAAACGGATTCTGATCAACTGCTGCATT ATCTGGAGGCAAACTC

TGCAT AT AT AACAGAT AAGCCTT AT ATGGCAG ACTTT GAAGGAACTCAGGATGGAGTCTGG ACAGGACGTT AT

AAATTTGAGACTGGCCGGGGACATTATGGGGTGTATGAAAATGTACGCTTTCCTATC AATGGAATGTTGATGA

AACCGGCT AAAACTGGATT ACAGTT ATCAAT AGATTCACC ACAAAGCCATCAATTT AAT GTT GATTTCGGAAT G

AAGTGGTTCCATTGTGCTAATATAATGTGTGGTTATTCCTGGTTTAACGAGACTTAC CCATTTTATCTTGATGG

AAAATCATTTTATCAAGTTCATATTGACCCTGATAAAGAGGTGATTTATTTTACTGG GCCAGATGAAGATATTA

ATATTGTAGGAAATTACAGCCCGCCTGCGTGGTGGCAATCTAAATGGCAAAAACATA TCAGTGATGATTTTAC

GGATATTTCCTCG G AAAAATTT AAGCGACTCAGT C AAAT AAAATT G C C AG AAAT ATGCATGTTTGCCGTGAAC

CATTTATTATTTCCTGGTCATAATACTTTGCTGTTGAAAGACGTTTATTTACCGGGT GATATGGTGATTTTCGG

TGATATTAACCCATCACTTACCGCTTTTCGGGTTACGCCATTAAAAGCAACAGTGGT GGCAAAGGGAACCCAA

CAATTTAAAGCCATAGAAACTAATTGATGATTATACCCTTCATCCTTCAAGTTGCTG CTTTGTTGGCTACGTTC

ACTCACCCCAGTCACATAGTTAGCTATGCTCCCGGGGATTCGCTCCCTGGCCGTCGC GATGCATCTTGAAAT

CCAT AGGGTAT AT ATTT AATTGGAT AAGTCTTTTTT ATTTT AACATT AT AACCT GATTCTTTTTGGAT AAAATT AA

AGGATTATTAACATGTCTATTACACAAGAACAAATCGCTGCTGAATATCCTATTCCT AGTTACCGTTTTATGGT

TTCT AT AGGAGAT GTGCAAGTCCCTTTT AAT AGT GTTTCGGGATT AGAT AGGAAAT AT GAGGTT ATT GAAT AT A

AAGATGGCATTGGT AATT ATT AT AAAAT G C C AG G AC AAAT ACAGAGGGTTGATATTACACTTCG G AAAG G C AT

ATTCTCTGGGAAAAATGATTTATTTAATTGGATTAATTCCATTGAACTCAATCGGGT AGAAAAAAAGGATATTA

CAATTAGTTTAACTAATGATACTGGCAGTAAAGTCTTAATGAGTTGGGTTGTTTCGA ACGCCTTTCCGAGCTC

ACTGACGGCCCCTTCATTTGATGCTTCAAGTAATGAAATTGCAGTACAAGAAATTTC ATTAGTTGCTGATCGG

GTAACAATTCAGGTTCCCTGATAACTAAAAACTTTAAGGAAAAATAATGTCTGTACA AACAACTTATCCCGGAA

TTTATATTGAAGAAGATGCATCATTGTCTCTATCTATCAATAATAGTCCAACAGCAA TCCCTGTTTTTATCGGTA

AATTTTACAACTTGGATGGTTCCTTACCTAAAGTGGGAACATGTTCTAGAATTACCA GTTGGTTAGATTTCACT

AAAAAATTTTCGGTAGCTCCTCCTCAAACCATTTCATTGATCGCGTCGCCAATTGCT GACACACAAGAAAGTG

TACCCAAAGCAGTTCAATATACTTATAAGGCCGAGTTTGAAACCTCAGAAAATCTGG CAAATGGTGCCTATGC

GGT ACAACATT ATTTCCAGAATGGCGGTGGT ATTTGCT AT ATCAT ACCTTT AGTT AGCGT GAAAAAAGAGG AT

GCTGCGATT GAGTTAACAAAATT ACCT GAATT AATT GAAAG ACAACAAGAGATT ACGTT AATCGTCTGCCCGG

AGGACGATAAGACGCTCACTGTTGATAGCAGTAAAAAATCGGATGTTTATAACAGCA TCAATACATTATTGAG

TAATAAGGTAGGTTATTTTCTCATTGCAGATTCAGATGATGGCAAAGCAGTTCCTGA TACGTTGCCGGAAAAA

ACTGCGGTCTATTATCCTGGTTTACTAACTTCTTTTACACAACGCTATGCCCGACCT GCCGATTCTGCTATCAA

AGTGACCGGTATTACAAATATATCAACTCTGGCTGATATTCACACCAACTTGGCCGA TGACTACTCAACAGCA

AGTCAGGTTATTAATGATGTTTTGGAAAAAAAT AAT AAGCTCGCATCGTCTCCCATT ATTTT ACCTCCCAGCGC

CGCTGTTGCTGGTGCTTATGCCGCTGTTGATGTGAGTCGTGGTGTTTGGAAAGCACC TGCGAATGTGATGTT

AAGTAATGCCACGCCAATCATTAGTATTTCCGATGCGGAACAAGGTGTGATGAACCC ATTAGGTATTAATGCT

ATTCGT AGTTTT ACTGGT AGAGGT ACTTT GATTTGGGGAGCTCGT ACTCTGGAT AAAACGG AT AACTGGCGCT

AT GTTCCT GT ACGTCGTTT ATTCAAT AGCGCAG AGCGAGAT ATT AAGTT AGCAATGCGTTTTGCAGTTTTT GA

GCCTAACTCCCAACCAATTTGGGAAAAGGTCAAGGCTGCTATCAATAGCTATTTGCA GTCACTTTGGCAGCAA

GGTGCACTGCAAGGCAAT AAACCCGAT GAAGCCTGGTTT GT ACAAATTGGT AAAGGCGT GACCAT GACAGAT

G ATG AT ATT AAG AAT G G GAG AAT GATT AT C AAAAT CGGCATGGCGGCAGTACGTCCGG C AG AATTC ATT ATTT

TACAGTTTACGCAGAATATCGCCCAGTAACTTAGGTCTATACCCTATAGATTTCAAG ATGCATCGCGGCGGCA

AGGGAGCGAATCCCCGGGAGCATATACCCAATAGATTTCAAGTTGCAGTGCGGCGGC AAGTGAACGCATCC

CCAGGAGCATAGATAACTATGTGACTGGGGTAAGTGAACGCAGCCAACAAAGCAGCA GCTTGAAAGATGAA

GGGTAT AGAT AACG AT GT GACCGGGGT GAGT GAGTGCAGCCAACAAAGAGGCAACTTGAAAGAT AACGGGT

ATATTTAATATGGGCGATTTATTGCCCATTTTTGTGAAAGGAAATGAGTTATGTCGC CAACGCTACCCGGTGT

AACGATGACTCAGGCGCAGATAACAGCGTTCGGTGTCAGTACATTAAATATGCCCGT ATTCATAGGGTATTGT

ACGAGATTGCCTGCCTTTTCAGCGCCT GT AAAAGT AAACAGTTT AGCT GAAACAGAACAAAT AAT AGGGAAAG

AAGGGCGTTTGTATGCTCTATTGCGCCACTTTTTCGATAACGATGGGATACAAGCTT TTATTCTGTCGTTAGG

CGCACCTGCTGGGGAAAATGCT AAT AGTTGGCTT G AGGCATT ACAACAGCCCGATTT GTATGCGGCT GTTGC

AGCAGAGCCGCTAATTACACTTTTAGCCGTCGTTGAGGCAAGTGAACTGAACCAAAA AGAAGGTAATGAGGC

T GTGGAAGCTTGGCG ACAGT ACTGGAAAGCAGTATT AGCGTT AT GTCAGGCACGC AGT GACTT GTTTGCCAT

ATTGGAGGCACCAGATGATACCGCATTAATCAAGCGTAGTTTGCAGGATTTTCATCA TAAGGCACGTCAGTTT

GGCGCTCTCTACTGGCCAAGGCTAGAAACATCTTATCAATCCTCTCAGTTAAAAATT TTGTCTCCTATTGGTG

CAGTAGCAGCGGTTATTCAAAGTAATGATGTCCGGCGAGGGGTAGGACATGCACCTG CCAATATAGCGTTAA

AACAGACGATTCGCCCGATAAAGTCCCGCCTGGAATTAGAAGAGTTGTATGAAGAAT CGGATGGTTCACTGA ATCTGATTTGTAGTTTTCCAGCTCGTGGTACTCGTATTTGGGGATGTCGTACGTTGGCGG GTATTGATTCACC

TTGGCGTT AT ATTCAAACCCGATT ATT GACTTCACACGTGGAAAGGCAACTCAGCC AGTT AGGGTGCAT GTT G

AT GTTT GAACCT AAT AACGCAGTCACTTGGAT GAAGTTT AAAGGCCATGCTGGGAATCT ATT AAGGCAGCTTT

GGTTACAAGGGGTGCTGTATGGGCAGCGTGAAGATGAAGCCTTTTCCGTTGAAATAG ATGAAAACGAAACGA

TGACTCGCCAGGATATTGATGAAGG C AG AAT GATTGCTCGTATTCATTTGGCATTGTTAGCACCGGCAGAGTT

TATCGCTGTGACTTTGAATTTTGATACTCGCTCAGGCATTGCGACGAGTACATAATA AATCGGAATATCTCCAT

GACACTACCAGCAGAGCTTTATACCCCAGCGGTTTCACATCGTTTTATTGTT AATTTT CTTTTT AAAG G TTT AC

TTCCTTCTCCCGTAGATATTCGATTTCAACGTGTTTCTGGTTTAGGGCGTGAGTTAC AGGTTGAACAGCGCCA

TCAGGGGG G AG AAAAC G C AC G G AAT CATTGGTTGGCTGAACGTATACAGCAT AAT AGCTTGATATT AG AAAG

AGGGGTTATGGTCGTTACCCCTTTAACACTGATGTTTGATCAGGTGATGCGGGGGGA AACTCTCAATTGGGC

AGATGTGGTAATTATTCTTCTCGATCAGGCTCAACGTCCGATAACAAGTTGGACCTT GAGTCATGCGCTACCG

GTTCGCTGGCAAACAGGAGATTT AG ATGCCAACAGT AACCAAGTGCT GATT AACACCTT AGAGCTGCGTT AT

GAAGATATGCGCATTATAGGGGTAAAATTATGACTATCGAAATCCGTGAACTCATTG TTCAAGCCCGTGTTGT

CGGGACT GAT ACCAAAACAACACGAACCGTTCCTTT ATCT ATT GTGCAAATGGAAACACTT AT AGAACAACGT

CTGGTTGAAAAAGTGAAGCGGGAGATATTAGACGTACTCCGGGAAGAACAAGGTGGT GGGTTATGAGCTTG

CTT GAACGAGGTCTGGCT AAACTCACGATT ACGGGTTGGAAGGAGCGT GAGCGTAAACATCAGATTGGT AAA

CTAGAAGCAATGTATAACCCGGAAACACTTCAACTGGATTATCAAACTGATTATCTC CCTGATGTTAGCAATAA

TCAGGTAACAGTGAGTAACCGCTACGTTTTGTCAAAGCCCGCAGGGTTAACACTATC CTTGTTATTTGATGCC

AATATGGCTGGTCTTACGACAACCGTCGAGTCCCAAATCACTACCCTCAAATCGCTT TGTTTAGTTAATGCAA

GTACTGATGAACC C AATTTTTT G G AAATT AATT G G G G G G C AAT G C GTTG G G AAAAT AAAAATT ATTTTG TTG GT

CGGGCTAGTGGATT GTCTCT GACTT ATTTGCGCTTT GATCGT AACGCAACACC ATTGCGT GT GAGTGCGCAG

CTCACATTAGTCGCAGATGAAAGCTTTGTGCTCCAGGATAACCAAGCCAAGTTAGAT GCGCCGCCGGTATCA

GTAGTTAATGTCCCGGATCTGACTTCATTACCTGCACTGGCGAATATCGCTAGCGTA ACCACTATGTTGGGA

GTGGATTATTTAATGTTAGCCCGCACCAATGATATGGATAATTTGGATGATATGCAG CCAGGTCAGACATTGC

GAACACCGGAGGCATCATGAGTTTTTTAGATAACAGTAACTTCAAGCCATCAGATAT CAAACTGTTCGTTAAC

ATTCAGGGAGTGGAGAAGGAACTCAACGAACTGATAGTAAGCGAATTGAAAATCTCC CGACGTATCAATGCC

ATTCCGCAGGC AGTT GT AAAGCT AAGAGCGAAAG AGAGT GAAAGTGGTGT AT ATC AGTCT GAT GT ACAGCGG

ATGTTGAAGAGTTGCCGTCCGGGAGTAAAGGCAGAGCTTCGTATTTTGAATACCCGG CTATTCAGTGGCGAT

ATT GTGCAGCAAAAAACAGAGTT AGT GT ATGCGAAAACACACACT ATCAAATTGGTGCT ACGCCAT GACTT AC

AGCGCATCACCGGTAATTTTCGTACCAGAGTGTTTGCGAATACCCGTGATCGTAAAG TGATAGCCGATCTATT

GAATACCGCAACATTAAAGCCGGCATTTTCGGGGACATCACATTGGGATATAGATCA TGAGCAACTGGTTCA

GT ATCGTTGCAGT GATTGGCAATTTTT GTTGCAACGGCTCT ATGCT ACGAAT AGCTGGTTGTT AGCT GAAGAA

GAT AAAGAT AACACTCAGGGGAAAGT GACCATT ATTGCTCCAAATTCTTTGCCCCT GAAT GAGCGTTGGACAC

TGCAACATCAGGCTGATCATCAGGCTATCCGGCTTTACAGCACGGAGCTGATGCTGG ATAACCGGTTTGATA

CAGCGG AGGCT GTT GTT AGTGCTTGGGAT ATT GAT GATCAGGCATT ACTCGTGGCGTGGAAAGAAACCCTT A

GTCAAGTTGGGAAAGATGCGTT AGCGTCAGAT AATTTT AGCCAGACAAATAAAGATTCGAGTGAACTGTT ATT

AAGTTGTCCGCTCTCTACAAAAGAAGTTCAATTTTTAACGCGTAGCCAATTAGTCAT GCGGCGCTTGACGGCC

GTTCGTGGTTCACTGAAGGTTGAAGGCAGTACTAAGTACCGTTTAGGGCATGAACTG ATGTTGTCAGGTTTT

GGTGAAAATATGGATGGCTCACAAATACTGACGGGAGTGGATCATCGAATAACGGCA GAAGAAAGTTGGAAA

ACAACCTTACATGTGGGATTAGAACTGCCGTTAAAGGCAGAGTATGTCACTCAGGTT AACGGTGTTCATATCG

GCAAGGTTGCTGATTATCAATCAGATAGCAAAAAATGGGATCGTATTCCTGTTTTGA TCCCTGCATTTGGAAC

GAAT ATTCCCTT GTTTGCCCGATTGGGAAAACCCT ACGCCAGCCACCAAAGTGGATTTT GTTTCT ATCCT GAA

ACGGGTGATGAAGTCATTCTCAGTTTTTTGGAAGGGGACCCTCGTTATCCTGTCATT ATTGATTCCCTGCATA

ATCCTAAACAACAGACTCCATTGCAAATCAGCAAAGAGAATAATCTCAAAATGTTGA TGATTAAGCAGAGCGA

TAAAGATGAGCAACAATTGTTATTTGATAGCCAGCAACAAACAGTCGCGTTAATCGG TAAGAAAAATATCGAG

GTTAAAGGTGAGTATATCAACCTGACTAAATCAAAGGGGACTCGATAATGGCAAATA CGCTTATTGGCCAGGT

ATATGGTCAAGGATGGGCTTTTCCCATT AAATTT ATTC CTG AT AAT AAAG AAAC C G C AG AT C AAAC AG C C G GT

ATTGTTATGGCTCAAGGGATTGAAGATGTCAGTCAATCGCTGGAAATATTATTTCTT ACCGAGCCTGGCGAAC

GAATTATGCGTGAAGATTTTGGTTGTGGTTTACAAGATTTTGTTTTTGAAAATATTA GTGATACGCTAATTTCTG

CCATCAAAAATCGTATTCAGCAAGCAATATTACGTTATGAACCTCGCGCATATTTAT TGAACGTTGATATTCAA

ACCAAAGAAAACCAACCTGGACATCTGCTCATTCAGATTAATTGGAAATTACGTGGT AGTGATATATCTCAGC

GTTT AG ACGGAGTGCTT AGACTCCATTCAGGTCAAGCATTGGAACTGTT AT GACCAATT AT ATT ATT ATCGAC

GGGGATCTCATTCAAATAAATCCCAAATTTGAGGGTGATCGAACTCTTACGATTAAT GGTATTCCTAAAATAAG

CGGGAATGGAGATGCGCAAATTGAAGGAAAAAATATTTGTGTGTCAGGTGATCACTT AACTGTCTCAATTCCA

GCCATTT AT AT AACCTCCAGAC ATCCT GTTGCAGGT AGTGGAAAAGT GAAAATT ACAAATTT ATCT GACGACC

AACTAGCAGAATTTTGTGTTAGTGGGGATGTTGTGATTATTGAAGGCAGTCAGTTTG AAGCTCAGTTTACACC

GGATAAGCCGGCCACTAATCCAAGTAACCAAGATGCAGATAATCCTGCGCCTTCGAA TGGGAGTGGGAGATT

TATACACTCACAGAACTTCGTTAAGGCAGAAAAATAAAAAATTTTGCCGAAGCGGTT AATAAGTATGAATAAG

CGGGGCGGATAAAAACATGGATCTTGCTGAATTAAATAATACGTTGATGAATGACTT ACCAACGACCAATTTT

AAGTT AG AAAC AAAG G AC CC ATT AACGCAATT AAAGTGGTT ACAACGTT AT ACAGAAAAT ATTCGTTTTT ATGC

GAATGATGATTATTTCTGGCATCAATTCTGGTTCTTAAAAAATCACACACCAGAAGC GCTCTTTGCTCGTTTGC

AAGGTGAAACGTTGGCTGATGGAGAATTGCCTCCTCATCAAGCGCTATTGCTGGCCT TTTTACAACAGCTTAA

GACGCCAGGAATCATGCTTGATACTTTTTCAGCCCGTCATCGGCAATTGTACTATCA GGAATTGCTAGGGATA

ACGCAGAAAG ATGCACAACCT GATCAT GTGGCGCTTGGCGTGGTATT AAGT ACTGGTATTGCAGAAT ATTT AT

TACCGACAGGCACATTAGTGGATGGTGGACAAGACAGCAGCGGAAATTCACTGCAAT ATGCGTTGGATACCG

ATTTATTGGTTAATCCAGGGCAATTAACAGATGTTCGCTACAGCTATTTGGATCATA AGACCTATAAAATCTTC ATCTTGCAAGATGATAAAGCGAATATCAGTTGGCCCTCTTCAGGCGCTCGTTTATTTGTA GCACCTGAGGGCA

ACGGACAGGAAAAGGCACCT GAACAAAAGTTGGCACTTT ACCTGGGATTT GAT GAT AT ACAGCC AGGGCAAA

CTCTTTCTTTATTTTGGCAATTCATTGCATCAACTCCCCTGACATTAAAATGGTTTT ATCTGAACGAGATAAATA

ACTGGGTGAAGCTAGATAGTGTCAGAGATAACACGGATGGCTTTTTTATCAGTGGAT TATGGCAAGCGATATT

ACCT GAT GATGCGGT GAAAAT GT ATTTTCCAG AGACAACTTCT GT AAAACGCT ACTGGATT AAAGCT G AGGT G

GAATCGCTTACTGAATCTGGCGATTTGTGGCAACCGCTATTAGAAGGCATCTTGTAT AACGCTCAAACAGCAA

CGCTGGTT GATGCAGACAACACAGAT GAAAAGCACTTTC AT GATGGGCT GATGCCTTTT AGCGTGCAGCATT

TGGTCAACACCGTTTCAGAGGTAAAAAAAATTGAGCAGCCCTGGTCTTCTTGGGGGG GAACGCCACAGGAA

GACACTACTGATTTCTTCCATCGAGCGGCAACACGTCTTCAGCATCGCCAGCGTGCG TTAACTTGGGATAAC

C AAATT GCCATGTTGAAGGCT G AATTT CCGCGGATTTATGATGTCATCTCAC C AAAT ATCACGTGGATGAACC

AACTTCAGACATCAAATACGCAAACGCTGATCGTTATTCCTGATGTGAACTACAGCG ACAACAAGGATCGCTT

ACGGCCACAATTCAGCCCTGCCAGCTTGCGACAAATGAGTGACTGGTTACAGATTCA CACTAGCGCATGGGC

GAATCCACAAGTGGAAAATCCAATTTATATTGATGTCTCTGTGACCTATGAGGTGCA ATTTAGTGCGGGTGTG

AATCCTGATTATGCCCTCCGGCAATTACAACAATGGTTGAGTTCAATTTATATGCCA TGGTATCACGCAGATA

AAAAAGGT GTTGCCGCTGGCGATCAAATCG ATTTTT ACCAACT GTTTGCAGAT ATTCAGCG AGT ACCTT ACGT

GGAGCATGTCAAAACATTGACATTGACCACAAAAGACACCTCATTAACCAATGGCGG GGTTATTAAGGCACA

GCAAAAT GAAGTGCTGGT GTTGGT ATG G CAACAAG G AG AAC AAATT AGGCAGGGAGAATCGAAATG AGGCA

GCAT AAT GAGTT ATTTCCT GT AGT AAAAGACGCGAT AAGCTTT GAAAACCTGCAAGCTCAGGGT GAGAAGGTT

ATTAGTGATCAGTCCGGTAACATATGGAGCGATAAAGATAAACATGATCCTGGTATA ACATTACTAGACTCTTT

AAGTTACGGTGTTTCGGATTTAGCGTATCGGCACTCATTACCTTTAACCGATTTATT AACCATTGCTGGAAAAG

ATACGCTTTTTCCAGCC G AATT CGGGCCACAGCAGACGCTAACTTGTGGCCCTATAACACTGGATGATTACC

GGCGTGCGTTACTTGATTTACATGGTAATGATGCATTTAAAATATCAGCTAGTGACC CCAGAGACTTTTTGTTT

CAGGAT AT ACAGTT AATTT GT GAGCCAAAAAGTAAGCGTT AT AAAT ACT ATTTCAATCCCGAAACGCTT GAAT A

TACATTCACGCCACCTTCAGGGGATAAATTTAAAACTTTAACACTACGAGGGAATTA TTGGCTTTATTGGATAC

CAACCCGTTGGGCAGGTAAATCAGCTAATTTGCCGTTAGTTAAGCGGGTGATGGAAG ATTTTCTCCGTGAAA

ATCGAAATTTGGGGGAAAATGTTGTTCAAGTGACACGGGTGATATCAACGCCTATTT ATCCTGAGCTGGTCAT

TGAGCTGGCGGATGATATTACAGATGCGGCATCAGTATTAGCATCAATCTATATGCT ATTAGAACAGTGGGC

GATGCCGATGCCTGCTCGCTTTACTACCGAAGCATTACAGGCCAAGGGATTAACAAA CGAAGAGATCTTTGA

TGGGCCGTGGTTGCGTCATGGTTGGATACCTCAGTTACCGACCTCTCAAAACTACCA TACAGGCATGGTTCT

GAAGATGAATCATCTGATTAACCAATTGCTGGCGGTTGAAGGTATAAAGCGCGTAGT TAGCCTGACGTTGCC

AGAAACAGAAT ATTTGCATCAGAT AAAAG AT GAT AATTGGTCCTGGCAATT AGAT GTTGGTT ATT ATCCATT AT

TATGGGGAGCTAATCCACTAGAGGTAATTACAGAGAAAAATAACAATTATGTCAAAT TGTTCGCAAAAGGTGG

GGTACGATTACAACCTGATCAGAAAAGTGTTGAGCGGTTATTATCACAGGAATCACT CATTAATAATGCTGCA

TCCACGTTACCGGCTGGTAAGGTGCGTGATCTCAAAGCCTATACACCTATAAGCCGC AGGTTGCCTGCCTGT

T ATGGTTTGCAGAAT ACTTTGCAAAAGTT AAAACCT GAACAACG ACACTT AT ATCAGTTCCT ATT ACCATTGGA

GCAAATGCTTGCTGATGGATGTGCGCGGCTTGCATTTTTGCCACATTTGTTAGCATT TAGGGACCGAAGCGG

AAATATCAGTGATACACTCTGGCCTTTCAAGAATACAGAGGACACAATTGCCCAACA GGTTCATCAGGAATAT

GCCGGTACATTAAAAGCCTTTCAACAGCAGGAAATTAGCCTGTTTGATGATAAAAAT AGACCGCATCATGGCA

ATATCAATCGGGAATTAGATATTCTTGATTATCTGCTAGGGTATTTTGGTACACAAC GTGCAAAGCGTCCATTA

ACGCAGGATATTCATGATTTTCTGCAAACCCAGCGAGGTTATTTGGCACAGCAGCCG GAGTTGGGTTATCAG

CGTGATAATATCCGTATTGATCGAGTTTCAGCTTTACAAAAACGTATAGCAGCCCGA ATTGGGCTAGATGGTA

CTATTTTCAAAGAATCGGTTGATTTAAGTAAGTTACCTTTTTATTTGATTGAACATC GTCAGCTTTTACCAAATT

TACCCCATCTTGACTTTCAACATGATCAAACTCCCCAATCTTTTGTGATTTCCGACA ACATTGTTAAAGTGAAA

CAAGCGGGAAT AGCAGAT AAAATCGTTCGTGG ACAGCTT ATTG ATTTT AT AGAT ATT GAAAGCAAATTT ACCG

TTCGTGCCCAAATGATTGTCGCTGTAGAGGGAAATGAATTTTCTCTGGATACAAAAA ATAGTATTCAACTTGAA

AAGAATCTGCAGTTATTACAATCAGCGTCTGAGAAAAACAATTTACGATGGAGAAAT AGCACGGCGTGGTTAG

AGGATATGACGTATCGTATCAATTATACTGACGATCAGGTTATAGACGATAAAACAA AACAATGTCGTTTACAA

AGTAATACTAAATCGCCTTTTCCAGCCTTAATTGCACCAAAAAATAAGATTACGATT ATTAAGCAATCTTCTCC

ACTCTCCAGT ATTGCT GAATTT ACT GAT G AAC C AG AATTC AAATT AGTTGCAACGGT GACAGAGATT GATCGG

ATT GAAGGGAT ATT GACTATCGAACGGGAT GACAACCAACTCCCTTTCCCG ACT AAAGAAGAG AGT AATCAAT

ATATATGGTACATATCTGATGAAAACTATATTTCAAGTGATCGTTTCTCTTTTGTGG TGAGCGTCGTGCTGAAT

CGCGGTTTGGTT GAAAGGGAAGAT ATT GATCAAT AT AAGCT AGAGGAATGGAT AGAGCGT GAAACACTTGCA

GAGTTTCCTGCACATATTTCGTTAATTACTCATTGGCTGGCATCTGAAAATTTCGAT GATTTTGCGAAGACATA

TCAACGTTGGCAAAACAATGGGGCGCAGTT AGGGGAT GAATCCT ACACC ATTTTGGAAAAACT GACATT AGG

GCATTTACCAACAGGACTTACTGGCATTAGTAATATGTTTATTGCTACAGAAGCTCA GCGTCTAGAAGTTGTT

GGCGAGAGTGGTAATGAGTGGAATACCCAGGCAATTATTAACAACGAACTATTCTAT GTTCCCTCACAGAATA

GTT AAT ACCGAGT GTT GT GATCAACTTTT ATT AT AAGCCGGAGGAT AAATGGACAACAAAAAT AACAAACCTAC

TGATCAAGAGATTCTAAAAACATCACGGGCTGTCGGAGAAATTCCTTCAGCGGATAA TTTAAAAAATCGTTTTA

AAGCTCGTTCGATTCCATTAGAGACGGATTTTACTAATCTCATTGACCTTGCTGAAG TTGGACGATTGGCTAT

CGGCCAGTCACCATCGCAGCAAAGTAAAACGCCTGGCACCGGAATGGAATTAACTTC GGATGGTAAATTACA

AGTCAAGGCTGGGGCAGGTGTTGATATCGATAATAATAATCGTATTACTATTAAGTC TGGTCATGGAATTAAG

GTTGATGGAAACGGCATTTCCGTTAAACCAGGTTCGGGTATTAAGGTTGATAGTAAT GGTGTAAATGTCAATA

TTGATGATTTTTGGGAG G AAAT AC G C AAT AAAATT ATG CCT AAAG GAACCATGCTGCCTATTTATGGCACACC

TAACCCCTCTGCGCTGCCAACAGGATGGGAATGGTGTGATGGTAAAGATGGCAGACC TAATTTAAAAAAAGG

GAAATATAACTTACTATCAGGTCAGTCTTCAGGTACTGATACTTTTTGGGCAGATAA TAAGAATGGAGATACA

GAG AT CAAC GTGTT ATTT GTTT ACT AT AT GATT AAGGTT GTGT AAT AT CTT AAGT AAT ATGC ATT ACTCT AAAAT GAAT GATTT AT ATTT AAGTAAC AT AAT AATTAAGTT GT GTTGTAGGGCT GTTTTT ATG AGAAAT AT AAAAACGGA

GGTAATAATTGGCTTCAAAATATCAGTGATGAAATAGAGTTATTTCGCTTTATAAAA ATTTTGTTTTATTTCTTTT

AAT AATT ATTT AT AGAAGGT AAT GAT AT GTGCACACAAAAAAACGTGTT AG AT AGACTGAAAGAT AGAAAT ATT

ACATTGGGTTGGG ATGTT GTT GTTGCAT AT AACCAAGAAAGT GTT AAT AAGTT ATT GAAGCAACAAT AT GTT GA

AAAAGTTT ACTCAAAT GAACATTTT GTTTTT AAAG ATTGGCAT GAT GAT AAT AAAACGAAATTT ATT GAGGG ATT

AACAGTAGGCGCTCCACTAGTTTCATTTGAGGAGGCGTCTTTATCCGATGCTAATGT AAAAGTGACACTTAAC

TTTCTTTCTGGTAGATGGAGAGTTATACAAGCAAATACCGGCACACCAATTGAATGG AAAGAAATTGTTCCTG

GCAGTGGCTATAAAGCAGAATTAGTTGTTCCGCTTAAATCAATAACTGGTAGTGTAA GTAAAAAAGATATCATA

TT AAAATTCAAAGATGCT GTCGT AAAAAAAAT AAATTTATTT GACAATCAAGAGCCT GATTTT ATT AATT ATTTC

AAGCAATCG ATCAGT GAGGGAAATT AT ACTTT AGGGCAACTGGTGACAGACAGCACACCGGGATT AATTCCT

GCTGAATTTCATATTCGTACTCAACCCCATCCAAAAACACGTGAGCGTGGTTCTCAA TATGTAGGAAATGGTG

CGGT ACT GTT GTTT ATT AAAACGCAAT ATGGCGGAAGTGGAACATTGCCT GT AAAT GATTTT GATTGGTT AATT

CCTGATGATCATACTAGCGCATTAGTCATTTCGAGTAAGACCATGATGGGGCAAATA TTGCCAAAACAATACA

AAGATAAATTGCCTGGTGATCCTCAGTTTAGCCCACCAAAAAGAGTCAATGATAAAC AAGACTCTGCTTATTAT

ATTACGATTACCGATGGTGGATTTGATGGTAATAGCCCTATAGAGAAGTCATGGTTA CGTTCTGATTATAGCA

ATGGGATTTGGACTGGTGAACGTGGT AATGCT ATT ATTGGT GAAAAAGGAAAGCGGAT ACCACCACGTTTTC

CAT ACCAAAATTTT GTT ATT AAACCTCATGGT GAATCGTT ATTTCAAGGATGGGAGAAT AAGAT AAATT ACACT

CAAAAGTGTGCAAGATATTTCCGACATCATAGTAATAGTATAACTTTCGAAGATACT GCATTAATGGATCTCAG

T ATTGGTGGACAAGGT AGT ATCAATTGCCAGATT GATGGT GAACATTTCT ATTT AAAATCAGAT GATTTTTCCC

CCAATGTCAGCTATGAACCAACTTCATTCTGGGATAAATTTATCGGTGGGGTGGATG CAAATGTGAAAGATGA

ATTC AG AG AT G AATT AGCACAACAGGCAGAAG C AAAG TT AAAAC AG GT ATTT AAT ATT G AATT G C CT G AAAT C

AGTCTGTTTTCTATTAAACATCTGCTCTTTCCTGGCATGGATGTTATGCAACTTAAA CAGGGTTATTTCCCAGG

AGATTTGATTATCTTTGGGGATATTTCACCT AAATT G AC C AC AATT CAGGTGGCTCCTTTGGAAGCCATGGTT

GCCCTT AAAGAAAATCAAAAATTCACT GTCGT ACCT GAAAAT AAAAAT GTT AGTTGGAAGTTGGATCAT AAT AG

TGAGGCTAT C AAT G ATC C G G G AAAT ATTG ATG AT AAAG GTATTTATACGGCACCGGG C AG AAT CAGATCTGG

TTCTGAAGTCATTAAAGTCACTGCAACTGACGGCGATGGAAATCAGGCATCGGCGGC GCTGACGTTGGTTCC

TTCTTCTGTTGCATTAACACCTTCTTTTGCTTTTATCTCTGAAGCAGATAAGAAACC TATATTATTATTGGCGAA

TGTCCTAGACGGAAAAGCAGTAACATGGAATGTGGAAAGCTGTACAGGCAGCCAATG TGGTTCTGTTGATCA

GAAT GGGCTTTATACTCCACCAGCAGGGCGTTTTAACGATGGATTTACTTTTGCATCCATCACC GCAACTGCA

AAAGATGGTAGTCAAGCACGAACCATTATTTGTCTAATGGCATCAATGCCAGGACAT GGTTTTTACAAGGTTG

AACCTAATTTACGTTTGAATGTGAAAGTAGGGGAAGAAATTATCTTTAAAGCGCAGG CAGATAGCTATAATGG

TGATCCTGATACTTGGGAAATTTTCCCTCCTCGCGGAAAATTAAGTGAACCTGAGTT TGAACCCAATAATGAT

CCTGAAACTAATGATACAATTTTTGGTCATTATAAGGTGACCTATACCGCGCCGACT AATGTTACCTCACCTG

AATTGCTT GTT GTCCAT GT ATGGGAGAAAAAT AGGCATAAT G AGAAAAACAAAGGT AAGGCAGGAT ATGCACT

T ATT GAAATT ATCCC AGAT GAT AAAT AGAAAATTT ATTT AAAT AAAAATC ACAGCGGGTTT AT CTCGCT GT GATT

AAAGT CAT CTTTTTTT AT AG ATTGTTT ATCTCT AAT AAT AATTTT ATTTT AT AAT AT AAAGGAAATT AAAAT GAAT A

ATGAAT AT AAAAAT AACACCGTGAATTGGCGTATTTCACCTGAT ACGGTAGGAAGTATTGAT AAT AACGGTTTA

TATACAGCACCTAATCGGGTAAAGAATATCGAATTTGTCCAAGTAATGGCAAGCGAT GCTAATAATAATCAAT

CTTCTGCGATTATTACTGTTATTCCCTCTTCTGTTGCGTTAACGCCATCGTTTACTT TTATCTCTGAGGCAAAA

AAAAC ATC AGTCACTTTT AAAGCGAC AGAACTT GAAGGGAAAAAAGT GACATGGAGT AT AAAT AATT AT ACCA

GTAATCAGTATGGTTCCATCGATCAAAATGGTATCTACACACCACCGGAAAGTCGTT TTAACGATGGATATAC

TTTTGTATCTATTACAGCAAAAGCGGAAAATGGCGCTGAAGCGCAAGCGCTTATTTG CTTGATGGCCAAAATT

CCAGGGCATGCCTTTTTCGAT GTTCAGCCT AAT AT AT GTTT AAGT GT GAAGCCTGGAGAAGAAATCATTTTT A

GAGCTAACGCAGATCGTTATAATGGTGATCCTGATTCCTGGGAAATTTTCCCGTCTC TTGGTAAATTGGGTGA

GCCTGAGTATATAAAAAATAACGATCCAGAAATTCCTATTTATGGATATTATCAAGT GAAATATATTGCGCCAA

CCAATATAAATTCTTCCCAAATACTCGTTGTGCGTACTTGGGAATATGACAAACATG ATGAGCATAATCAAGGT

AAAG CAGGATATGC ATTC ATT GAAATT GTG C C AG AAAAT G AG CTTT AAT AT AT AT AC C C AAT AGATTTCGAGCC

GCAGCGCGGCGGCAAGTGAATGAATCCCCAGGAGCATAGATAACGATGTGACTGGGG TGAGTGAACGCAG

CCAACAAAG AGGT AACTT GAAAGAT AAT GAGTAT AAAT GACTTT AGT AAGAGAAATT ATGGCTTCATTCAGAAC

T ATTT ATT AGAGT AATT AACTTT AT AAAGACATTT AATGGAAAAT AT AAT AGAAAAATTT AAT ATT AAT ATT GAAG

TCTCATCTGAAATTATTGGAGAGAGTTTATTAAACTCCCCTTTATTGATGAGTAGAG AAATCAGCAATCAATTA

TCT GAAAT ATT ATT AG ATT AT AAAG AAT AT AAT ATT G C ATT G GAT AAGTT AGT GTT AAAT AT AGG AGAAAT ACCC

T AT GAAAT ATTT GAACAACAATTCT ATGGTCGTTTGGGAAAATT ATT AAAT GAAAAGTT AACAAT AAT AAT AAAT

GAT AAATT ATTGGT AAAAAACAT ATCAACCTCGTT ATTTCCT GAAT GTTTT AGT GAAAAAAGAAACCC ATT ATTA

AAT AGAGT CAT AAAAAATTT ACCTTCT AATTTGGTTTTT G AAGTT C ATTCAAT G GT AAAAAT AGAAT C AGT AAAT

AACAAAAAACAAGCT AAT AT ATT G AC AT CTTATCTGGCTT ATT CTTTTTTT AAT AAAAG CAAATT ACAACAAC AT

TT ATTTTCCACT AGT AAT AAT AAATT AATT GAGAGCTT AT ACGC ACTTTTTCT AACGGATC AGAATCGAAT ACCT

ACTGCTCATAAAATAGGAAAAGGTGCACTTATACTATCTGCCCTTATTTGGCTTTAT TCTAATTCCAATGATTAT

CTGCCCAAACCAGAAAGCACTCTGTTGTTACAAATAGAACAGGATATAAAACAAGGA TATTTGCCTTTAACGT

TGTTAATCACTTTCTTCCAGAACAGAAATGGCGGGCGTGTTTTTTGCGATTGGCAGT ATGCGTTATGGCAAAT

C GAT ATC AT C AAAAAT CACTTAGGCATT AAAAT AAC AT C G AAAG AAC CCCATTTACGG GAG AAAAT AAT GTT AC

AACCAGTTAATGCTTCTGATCGATCCTCTGTGCTGATATCAGACGAAAAATTGACAA TACCGTTAACAATTACA

GGTGCGGGATTAGTGCTTCTCTGGCCACTATTAACTCCACTATTTTCGTCTTTTGAT TTGTTAGATAAGAAAAG

TTTTTCAG ACAATTTGGCACAGGAAAT AGCATTT AATTT ATTGGAATGGTT AGTCTGGGG AGATG AG ATGCT G

TTACATCAGGAATCATCATTATCTTTATTACTCTGCGGAATAGATCACCAAACAATA CTGGAGCGCCAGGTTCT

TATTCCTGAGCACAAGGAAAAATTAAATAACTGGTTGCAAGGTATTTGTACTCAACT TTTCTCTTGGAAAAAGC TAGGGATCGATGATATGCGCCAACTTTTTTTGCAGCGTCAGGCTGCACTTTATTATGAAG ATGATGGCCGTTG

GTTATTAACGGTGCAGCGTGAAGCTTATGATGTATTACTGACTCAAATGCCTTGGCC GTGGCCATTGAATATT

GTGACATTACCTTGGCTAGCTGAGCCGATTAGTATCACTTGGGAAGGTATCTCTGAA CCAACGGATTTGTCAT

TTTGGTAATCCAATATCTCATTAGGAACTCTATGCATGTACGATTTATCTGATGATC TTGCCAGACAGAATATT

TCACCGGAATATGAATTGACGGTTTTGCTGTCTCAGACTGCTATATTGGATAAACGA ATTCGTTTACGAATTCA

GGAATTAATGCAACAGCAAACACTATTGGGAGAAAGTGGACAGACGTCTTTTGATGA TATTTCATTTTCATTC

GTTTCGAGTGAACAACAAAAATCATCTTATTTGGTGTCACCGCATCAAAATTGGACG AAAGAGGATTTTCCTC

CTGAGCCGATCCCATCTCGTAGCCGTCTAGGACAATTAGTTGAACGGTTTGACTTAA CTCAATTTGAAATTGA

TTTGATTTTATTGTGCCTGTTGCCTCATCTTGACAGACGTTATCTAACGTTATTTTC TCTTGTTCCGGTAAGTG

GAGGT AAT AACAGCAAAAAGCAGAT GTT AACGTTGGG ATTGGCTTTGGAGTTGCTTT GTCCGAGTGTAGT AG

AG C G C AAT GCGCAACGTGCCAGTTTATTACCACAGGCACCGCTTTGGGATTATCGTTTATTTCAGTTG CGCG

GTGATATGTCTGTTTCCTACGATGAAATACCGTTAGCAATCGATAATTCTCTTATGC ATTGGTTATTGGGGCAT

GATGCTCTCCCGATTTCTCTTCTCTCCCGGGCTCATTGGCTTCCTGTTCCTGAAGTG CCTGATATTTTGCCTG

ATTTCACCAACCAATTGATAGAACTCTGCCAAATGGAACAAGAGGGGATGCTGACAA TAATCGCCGGCGGAG

CCGGAAGTGGCAGCAAAACAAGTGTTGCACGCGCAGCATCACAAGTAGGGCGCTCTG TATTGTTGTTATCGT

T AGCATCAGT GACACT GAGT GAAC ATGAAACT ATT ACACT GAT AACACTGGC ATTACGTGAAGCACAACT AAG

AAATGCCT GTCTT AT GTTT GAAGCTTTGGAT GAGTTTT GT GAAGCACGCCCCGCTTTGCAGCTCTGGCTAGGA

AATCGACTGGCTCGTTGTTCGATTCCGCTGTTTTGTCAATTACCTAAGCAAGCATCA TTATTGCCATTGGATG

CAATTTCACAAGTT GT ATT GTCT ATGCCAATGCCTTCTTT AATGGT GAAGGCTGCAGCATT AGCTTCAAT GAT G

ACGAATTATTTTCCAGACAATTCATTGGATGTTGAAAGTTTAGTGACATGTTTCCAT CCTTCTCCATTGATATTG

AAAAAGGCCCTT AGT GAAGCAGAAATTT ATCGCCGACT ACGGGGGGAAACGGCT AGTTT GAGATT AGAT GAT

GTGCAAATGTCCCTGCGTTTTCGGTTACAGCAGAATTTTGGACGTTTAGCACAGAGA ATTACACCACAACGAA

CCTTTGATGATTTGATCATCAGTGAATCTCAACAGCAACAATTACAAGAAATCCTGG CGGCTATTCGGCAACG

AGATAGGATGCTAGAGCAAGGATTTGCTCGTAAAGTGAGCTATGGGACGGGTATCAG CACGCTATTTTTTGG

TGAATCTGGCACAGGAAAAACGATGGTAGCAGAAGTGTTAGCTGGTGTTTTAGGTGT GGATTTGATCAAGGT

AGATTTGTCCACTGTGGTTAACAAATATATTGGTGAAACTGAAAAAAATCTGGCTCG TGTTTTTGATTATGCCC

AAGAAGACGCCGGGGTATTGTTCTTTGATGAGGCAGATGCATTGTTTGGCAAACGAA GTGAAACTAAAGATG

CAAAAGATCGTCATGCTAATATTGAAGTTTCCTACCTATTGCAACGCCTTGAAAGTT ATCCAGGGCTGGTGAT

ATTAGCCACCAATTACCGTAATCATTTAGACTCAGCATTTAGTCGTCGCCTGACTTT TTCGGTACGATTCTCTT

TTCCAGATGTTTCCTTACGG GAAC GGATGTGGCGGATTATCTGGCCATCGG G AATT C AATT AGCCGACGACA

TCAGTTTTTCAGCGTTGGCAAAACGGGCTGAATTAACGGGGGCGAATATCCGTAATA TTGCGCTACTCGCTA

GTTGGCTGGCAGT AGAT GAAGGAAAT GAAAAAATT ACT ATGGCTCAT ATT GAATGCGCATT ACGACGT GAACT

GAGT AAAGTTGGGCGCATT GATTT ACCTT AATTTTTCTTT GT AATCGGGAGACAACT ATGGTT AAAAAT ATCAA

ATCAGAT GAAACCTT ACT GAT ATT AAAT AGT AAAAT AGAAGATGC ATTAAAAGCGT ATTT ACCGGGCGAAG AT

GTCGTTATTCGGTTCGATATGTTTGGTAAAAATGAAAATCCAGATTCTCCTACCGTG TGCGTTTTTCTTTATGA

TATTCAGGAAGATCTGCAATTACGCGTGGGAGAAGGGCGGCAATACCTGCCTGCGAC AGGAAATTTTGTCCC

GGGATGTGTCAATGTTCGTTGTAATTATCTTATTTCCTACTGGGAGCCGGAACAGAG CGGAGGGCAGGGATC

GCCAACCATACGTTCTAATAGTCAATCAATGAAGATAATGAACTGTGTATTGAATGC ATTAATTAATCATCGTT

CATTTCCTGGTTTACCCAGAACTTATACGAGAGTTCTTCCTCCTAATGAACAATTAA ATAGCTTAGGAAACTTT

TGGCAATCATTAGATAATAAGCCTCGACTATGTTTAAGTTATATGGTGACTATTCCT ATTCAACTTACCCCGCC

GACAGAGAAGGTATCTCCTGTCATTACCTCAAAAACTGATATTACTCGAAAACCATC GCTTAACTTTTATCTTG

AGGCAGATGAAATTATCCGTCAGGCATTAGTTGATGCCTTAATATCTCAAACAACAG AATCTATGGATACGAT

AACT AGCTGGCTGGCAAAAGTT GTT ATT ATTT GTCGACC ACCAGAAAT AAT GAAT AAACAAAT GATT GAACAAA

CT GT GAAATT AATT ATTGCTGGAATT ACAGAAGAGGGATT AGCTGGAAAT AT AAAGACAATCACTCAAAAGTG

GGTGGAAGAGAAGACGATT ATTGGT GAAATCG ACGAT GTTTCTCT AGTT ATTTCCCAAGTT GACACGACAGC

GTT GTCTGCT GT AACAAT ACCGACATCTGTTT AA

SEQ ID NO: 96 (Pnf epitope)

TGQKPGNNEWKTGR

SEQ ID NO: 97 (PVCpromF)

TATC ATATGTCT AC AACT C C AG AAC AAATT G CTG

SEQ ID NO: 98 (PVCpromR)

ATCTCTAGAACAGATATTCCAGCCAGC

SEQ ID NO: 99 (ParaINF)

GGCGTCACACTTTGCTATG

SEQ ID NO: 100 (ParaINF)

TCGGTGGCAGTAAATTGTCC

SEQ ID NO: 101 (F1 primer)

AT GT CT ACAAGTACAT CTCAAATT GCG SEQ ID NO: 102 (F2 primer)

GACTCCCTTGAGGGTACGG

SEQ ID NO: 103 (F3 primer)

TT CT GAT GAGAGTG AT GGT AC

SEQ ID NO: 104 (F4 primer)

T G AAT AAAG AATT C AGT C AAT AT C

SEQ ID NO: 105 (R1 primer)

TAGTGGCTGAT G AAAGT CTG

SEQ ID NO: 106 (R2 primer)

GG AAGCCAAAGAT AAT GAAGT G

SEQ ID NO: 107 (R3 primer)

CATTTCTTCCCTATGGTTG

SEQ ID NO: 108 (R4 primer)

TT AAATTCCT AC AAG ATT AT CTTT

SEQ ID NO: 109 ftBid amino acid sequence)

RSSHSRLGRIEADSESQEDIIRNIARHLAQVGDSMDRSIPPGLVNGLALQLRNTSRSEED RNRDLATAL

EQLLQAYPRDMEKEKTMLVLALLLAKKVASHTPSLLRDVFHTTVNFINQNLRTYVRS LARNGMD

SEQ ID NO: 110 (£. coli Sequence Optimised tBid bases)

CGGTCAAGTCACTCGCGTCTGGGGAGAATCGAGGCTGATAGTGAGAGCCAAGAGGATATC ATAA

GAAACATAGCACGCCATTTGGCACAGGTAGGCGATTCTATGGATCGCTCCATCCCGC CTGGACTT

GTCAATGGTCTTGCGCTTCAACTTCGTAACACTTCCCGGTCCGAGGAAGACAGAAAT CGGGACCT

TGCGACTGCTCTGGAACAACTGCTTCAAGCATATCCTCGTGACATGGAGAAAGAAAA GACTATGT

TAGTATTAGCTCTTCTTTTAGCTAAAAAGGTAGCTTCGCACACTCCAAGTTTATTGC GGGACGTTT

TTCACACCACTGTTAATTTCATCAATCAGAACCTGCGTACTTATGTGAGATCTTTGG CGAGAAATG

GTATGGAT

SEQ ID NO: 111 (BaxBH3 peptide (aa59-73))

LSESLKRIGDELDSN

SEQ ID NO: 112 (E. coli Sequence Optimised BaxBH3 bases)

CT GTCGGAGAGTTT GAAGCGT AT AGGT GACGAGCTGGACAGCAAT