Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
BACTERIOCINS FOR CONTROL OF SALMONELLA ENTERICA
Document Type and Number:
WIPO Patent Application WO/2021/053242
Kind Code:
A2
Abstract:
The present invention relates to bacteriocins for control of Salmonella enterica (salmocins). The bacteriocins are derived from Salmonella. The salmocins can be expressed in plants and can be used in a method of preventing or reducing infection or contamination of an object with Salmonella.

Inventors:
HAHN SIMONE (DE)
SCHNEIDER TOBIAS (DE)
STEPHAN ANETT (DE)
SCHULZ STEVE (DE)
GIRITCH ANATOLI (DE)
GLEBA YURI (DE)
Application Number:
PCT/EP2020/076334
Publication Date:
March 25, 2021
Filing Date:
September 21, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NOMAD BIOSCIENCE GMBH (DE)
International Classes:
C07K14/255; A61K38/16; C12N9/16; C12N15/82
Domestic Patent References:
WO2018172065A12018-09-27
WO2005049839A22005-06-02
WO2012019660A12012-02-16
WO2008028661A22008-03-13
WO2006003018A22006-01-12
WO2005071090A12005-08-04
WO2006012906A12006-02-09
WO2002101006A22002-12-19
WO2007137788A12007-12-06
WO2002068664A12002-09-06
WO2014187571A12014-11-27
WO2013149726A12013-10-10
WO1999022003A11999-05-06
WO2002097080A22002-12-05
WO2002088369A12002-11-07
Foreign References:
EP3097783A12016-11-30
EP2061890A22009-05-27
US201916577484A2019-09-20
US20200010517A12020-01-09
Other References:
MEDINA ET AL., PLOS ONE, vol. 6, no. 8, 2011, pages e23055
NATIONAL ENTERIC DISEASE SURVEILLANCE: SALMONELLA ANNUAL REPORT, 2013
NEDIALKOVA ET AL., PLOS PATHOG., vol. 10, no. 1, January 2014 (2014-01-01), pages e1003844
WERNER ET AL., PROC. NATL. ACAD. SCI. USA, vol. 108, 2011, pages 14061 - 14066
GROSSBRAUN, MOL. GEN. GENET., vol. 251, 1996, pages 388 - 396
BARRETEAU ET AL., MICROBIAL DRUG RESISTANCE, vol. 18, 2012, pages 222 - 229
STEPHEN F. ALTSCHULTHOMAS L. MADDENALEJANDRO A. SCHAFFERJINGHUI ZHANGZHENG ZHANGWEBB MILLERDAVID J. LIPMAN: "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", NUCLEIC ACIDS RES., vol. 25, 1997, pages 3389 - 3402, XP002905950, DOI: 10.1093/nar/25.17.3389
SCHULZ ET AL., PROC. NATL. ACAD. SCI. USA., vol. 112, 2015, pages E5454 - E5460
Attorney, Agent or Firm:
BLODIG, Wolfgang (DE)
Download PDF:
Claims:
CLAIMS

1. A protein comprising at least any one of the following amino acid sequence segments: (a-vii) the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25), (a-viii) the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26), (a-ix) the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27), (a-x) the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28), (a-xi) the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (a-xii) the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(b-vii) a segment having at least 70% sequence identity to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(b-viii) a segment having at least 70% sequence identity to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(b-ix) a segment having at least 70% sequence identity to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(b-x) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28);

(b-xi) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (b-xii) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(c-vii) a segment having at least 80% sequence similarity to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(c-viii) a segment having at least 80% sequence similarity to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(c-ix) a segment having at least 80% sequence similarity to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(c-x) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28),

(c-xi) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (c-xii) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or (d-vii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(d-viii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(d-ix) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(d-x) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28),

(d-xi) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or

(d-xii) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34).

2. The protein according to claim 1 , further comprising a cytotoxic or catalytic domain having any one or more of the following activities: a membrane pore-forming activity, DNase activity, RNase activity, or a cell wall degrading activity.

3. The protein according to claim 1 , comprising a cytotoxic or catalytic domain comprising any one of the following amino acid sequence segments:

(a-vii)’ the segment from amino acid residue 347 to 519 of ScolEIc (SEQ ID NO: 25),

(a-viii)’ the segment from amino acid residue 347 to 519 of ScolEld (SEQ ID NO: 26),

(a-ix)’ the segment from amino acid residue 345 to 517 of ScolEle (SEQ ID NO: 27),

(a-x)’ the segment from amino acid residue 139 to 269 of ScolMa (SEQ ID NO: 28),

(a-xi)’ the segment from amino acid residue 139 to 269 of ScoIMb (SEQ ID NO: 33), or

(a-xii)’ the segment from amino acid residue 139 to 269 of ScolMc (SEQ ID NO: 34); or

(b-vii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(b-viii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(b-ix)’ a segment having at least 70% sequence identity to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27), (b-x)’ a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(b-xi)’ a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or (b-xii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34); or

(c-vii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(c-viii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(c-ix)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27),

(c-x)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(c-xi)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or (c-xii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34); or

(d-vii)’ a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 of ScolEIc (SEQ ID NO: 25),

(d-viii)’ a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 of ScolEld (SEQ ID NO: 26),

(d-ix)’ a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 345 to 517 of ScolEle (SEQ ID NO: 27),

(d-x)’ a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 of ScolMa (SEQ ID NO: 28,

(d-xi)’ a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 of ScoIMb (SEQ ID NO: 33, or

(d-xii)’ a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 of ScolMc (SEQ ID NO: 34. 4. A protein comprising or consisting of any one of the following amino acid sequences: (A-vii) SEQ ID NO: 25,

(A-viii) SEQ ID NO: 26,

(A-ix) SEQ ID NO: 27,

(A-x) SEQ ID NO: 28,

(A-xi) SEQ ID NO: 33, or

(A-xii) SEQ ID NO: 34; or

(B-vii) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 25,

(B-viii) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 26,

(B-ix) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 27,

(B-x) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 28,

(B-xi) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 33, or

(B-xii) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 34; or

(C-vii) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 25,

(C-viii) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 26,

(C-ix) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 27,

(C-x) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 28,

(C-xi) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 33, or

(C-xii) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 34; or

(D-vii) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 25, (D-viii) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 26,

(D-ix) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 27, or (D-x) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 28,

(D-xi) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 33, or

(D-xii) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 34; or

(E-vii) an amino acid sequence comprising or consisting of at least 425 contiguous amino acid residues of SEQ ID NO: 25,

(E-viii) an amino acid sequence comprising or consisting of at least 425 contiguous amino acid residues of SEQ ID NO: 26,

(E-ix) an amino acid sequence comprising or consisting of at least 425 contiguous amino acid residues of SEQ ID NO: 27,

(E-x) an amino acid sequence comprising or consisting of at least 215 contiguous amino acid residues of SEQ ID NO: 28,

(E-xi) an amino acid sequence comprising or consisting of at least 215 contiguous amino acid residues of SEQ ID NO: 33, or

(E-xii) an amino acid sequence comprising or consisting of at least 215 contiguous amino acid residues of SEQ ID NO: 34.

5. The protein according to claim 1 or 4, wherein said protein is that of item (a-vii), (b-vii), (c-vii), or (d-vii), or

- (A-vii), (B-vii), (C-vii), (D-vii) or (E-vii); or wherein said protein is that of item (a-viii), (b-viii), (c-viii), or (d-viii), or

- (A-viii), (B-viii), (C-viii), (D-viii) or (E-viii); or wherein said protein is that of item

- (a-x), (b-x), (c-x), or (d-x), or

- (A-x), (B-x), (C-x), (D-x) or (E-x); or wherein said protein is that of item (a-xi), (b-xi), (c-xi), or (d-xi), or - (A-xi), (B-xi), (C-xi), (D-xi) or (E-xi); or wherein said protein is that of item (a-xii), (b-xii), (c-xii), or (d-xii), or

- (A-xii), (B-xii), (C-xii), (D-xii) or (E-xii).

6. The protein according to any one of claims 1 to 3, wherein the protein is as defined in any one of (b-x), (c-x), (d-x), (b-xi), (c-xi), (d-xi), (b-xii), (c-xii), (d-xii), (b-xi)’, (c-xi)’, (d- xi)’, (b-xii)’, (c-xii)’, or (d-xii)’, and the amino acid residue of said protein corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg.

7. The protein according to claim 4, wherein the protein is as defined in any one of (B-x), (C-x), (D-x), (E-x), (B-xi), (C-xi), (D-xi), (E-xi), (B-xii), (C-xii), (D-xii), or (E-xii), and the amino acid residue of said protein corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg.

8. The protein according to claim 6 or 7, wherein the amino acid residues corresponding to residues 76 and 84 of SEQ ID NO: 33 are both Gin.

9. The protein according to any one of claims 1 to 8, that is capable of exerting a cytotoxic effect on Salmonella.

10. The protein according to any one of claims 1 to 9, wherein the toxicity of a protein of claim 1 , notably of classes (b) to (d) of claim 1 , against Salmonella enterica is such that it and the protein of SEQ ID NO: 1 produce spots free of viable bacteria of Salmonella enterica ssp. enterica serovar Newport strain ATCC® 6962™* of the same diameter 12 hours after spotting 5 microliters of a solution of said protein of classes (b) to (d) and the protein of SEQ ID NO: 1 onto a soft agar overlay plate seeded with 0.14 mL bacterial solution of 1x107 cfu/mL per cm2 of the sensitive Salmonella enterica strain and subsequent incubation of the agar plate at 37°C, wherein the concentration of the protein of classes (b) to (d) is at most 5 times that of the comparative solution of the protein of SEQ ID NO: 1.

11. A composition comprising one or more proteins as defined in any one of claims claim 1 to 10, preferably comprising one or more proteins according to claim 4, 7 or 8.

12. The composition according to claim 11, comprising a protein according to item (a-iv), (b-iv), (c-iv) or (d-iv), and/or according to item (a- v), (b-v), (c-v) or (d-v) of claim 1 ; and a protein according to item (a-i), (b-i), (c-i) or (d-i); said composition further comprising a protein according to item (a-vii), (b-vii), (c-vii) or (d-vii), and/or according to item (a-viii), (b-viii), (c-viii) or (d-viii), and/or according to item (a-ix), (b-ix), (c-ix) or (d-ix) of claim 1, and/or according to item (a-x), (b-x), (c-x) or (d-x), and/or according to item (a-xi), (b-xi), (c-xi) or (d-xi), and/or according to item (a- xii), (b-xii), (c-xii) or (d-xii).

13. The composition according to claim 11 , comprising: a protein according to item (a-iv), (b-iv), (c-iv) or (d-iv), and/or according to item (a- v), (b-v), (c-v) or (d-v); and a protein according to item (a-x), (b-x), (c-xi) or (d-x).

14. The composition according to claim 11, comprising: a protein according to item (a-iv), (b-iv), (c-iv) or (d-iv), and/or according to item (a- v), (b-v), (c-v) or (d-v); and a protein according to item (a-xi), (b-xi), (c-xi) or (d-xi).

15. The composition according to claim 11 , comprising: a protein according to item (a-iv), (b-iv), (c-iv) or (d-iv), and/or according to item (a- v), (b-v), (c-v) or (d-v); and a protein according to item (a-xii), (b-xii), (c-xii) or (d-xii).

16. The composition according to claim 11, comprising a a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or according to item (A-v), (B-v), (C-v), (D-v) or (E-v) of claim 4; and a protein according to item (A-x), (B-x), (C-x), (D-x) or (E-x), optionally as further defined claims 7 or 8.

17. The composition according to claim 11 , comprising a a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or according to item (A-v), (B-v), (C-v), (D-v) or (E-v) of claim 4; and a protein according to item (A-xi), (B-xi), (C-xi), (D-xi) or (E-xi), optionally as further defined claims 7 or 8.

18. The composition according to claim 11 , comprising a a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or according to item (A-v), (B-v), (C-v), (D-v) or (E-v) of claim 4; and a protein according to item (A-xii), (B-xii), (C-xii), (D-xii) or (E-xii), optionally as further defined claim 7 or 8.

19. A composition comprising one or more proteins as defined in claim 4, 7 or 8.

20. The composition according to claim 20, comprising a protein according to item (A-iii), (B- iii), (C-iii), (D-iii) or (E-iii), and/or according to item (A-vii), (B-vii), (C-vii), (D-vii) or (E-vii), and/or according to item (A-viii), (B-viii), (C-viii), (D-viii) or (E-viii), and/or according to item (A-ix), (B-ix), (C-ix), (D-ix) or (E-ix), and/or according to item (A-x), (B-x), (C-x), (D- x) or (E-x), or any combination thereof.

21. The composition according to claim 20, comprising a a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or according to item (A-v), (B-v), (C-v), (D-v) or (E-v) of claim 4; and a protein according to item (A-x), (B-x), (C-x), (D-x) or (E-x), optionally as further defined claim 7 or 8.

22. The composition according to claim 19, comprising a a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or according to item (A-v), (B-v), (C-v), (D-v) or (E-v) of claim 4; and a protein according to item (A-xi), (B-xi), (C-xi), (D-xi) or (E-xi), optionally as further defined claim 7 or 8.

23. The composition according to claim 19, comprising a a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or according to item (A-v), (B-v), (C-v), (D-v) or (E-v) of claim 4; and a protein according to item (A-xii), (B-xii), (C-xii), (D-xii) or (E-xii), optionally as further defined claim 7 or 8.

24. The composition according to claim 19, comprising a a protein according to item (A-x), (B-x), (C-x), (D-x) or (E-x), and/or according to item (A-xi), (B-xi), (C-xi), (D-xi) or (E-xi), and/or (A-xii), (B-xii), (C-xii), (D-xii) or (E- xii), optionally as further defined claim 7 or 8; and a protein according to item (A-viii), (B-viii), (C-viii), (D-viii) or (E-iii).

25. The composition according to any one of claims 11 to 24, wherein said composition is a plant material or extract thereof, wherein the plant material is a material from a plant having expressed said protein; or wherein said composition is an aqueous solution containing said protein.

26. A method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with a protein as defined in claim 1 or 4, optionally as further defined in dependent claims thereto, or a composition as defined in claim 11 or 25.

27. The method according to claim 26, wherein said object is sprayed with said aqueous solution or is immersed into said aqueous solution, or said object is immersed for at least 10 seconds, preferably for at least 1 minute, preferably for at least 5 minutes into an aqueous solution of said protein.

28. The method according to claim 26 or 27, wherein said object is food or animal feed; or said food is whole animal carcass, meat, eggs, raw fruit or vegetable.

29. A method of treating infection with Salmonella of a subject in need thereof, comprising administering to said subject a protein as defined in claim 1 or 4, optionally as further defined in dependent claims thereto, or a composition as defined in claim 11 or 25.

30. The method according to claim 26 or 29, wherein said Salmonella is Salmonella enterica.

Description:
BACTERIOCINS FOR CONTROL OF SALMONELLA ENTERICA

FIELD OF THE INVENTION

The present invention provides proteins capable of exerting a cytotoxic effect on Salmonella, referred to as salmocins. The invention also provides compositions, including pharmaceutical compositions, comprising one or more of said proteins. Also provided is a method of preventing or reducing infection or contamination of an object with Salmonella, a method of treating infection with Salmonella of a subject or patient in need thereof, and a process of producing a composition comprising the protein.

BACKGROUND OF THE INVENTION

Salmonella is a rod-shaped Gram-positive bacterium of Enterobacteriaceae family. Salmonella enterica is the type species and is further divided into six subspecies with S. enterica ssp. enterica as subspecies that includes over 2500 serovars. Salmonella infections are common and can result in protean clinical manifestations, ranging from an asymptomatic state to very severe diseases. Salmonella enterica causes an estimated 1 million illnesses in the United States each year, resulting in an estimated 19,000 hospitalizations and 380 deaths. Over the last 5 years, 46 Salmonella outbreaks have been recorded in USA, most of the food poisonings being due to contaminated poultry or vegetables, but also red meats and fish (CDC website).

Prevention Salmonella infections or reducing contamination of food with Salmonella requires control measures at all stages of the food chain, from agricultural production on the farm to processing, manufacturing and preparation of foods in both commercial establishments and household kitchens. Good hygienic practices reduce contamination of food with Salmonella, but do not guarantee the absence of Salmonella from products. Preventive measures for Salmonella in the home are similar to those used against other foodborne bacteria. Basic food hygiene practices, such as "cook thoroughly", are recommended as a preventive measure against salmonellosis, cf. WHO at www.who.int/mediacentre/factsheets/fs139/en/.

Antimicrobial therapy may be used to treat humans or animals suffering from Salmonella infections. However, antimicrobial resistance is a global public health concern and Salmonella is one of the microorganisms in which some resistant serotypes have emerged, affecting the food chain.

Most of the above mentioned methods of preventing or treating Salmonella infections or reducing contamination with Salmonella are methods that are essentially independent from a particular pathogenic bacterium or from a particular serotype of Salmonella. This has the advantage that little prior knowledge of the specific Salmonella strain or Salmonella enterica serotype in question is necessary before counter-measures are taken. However, the above mentioned methods of preventing Salmonella infection or reducing contamination with Salmonella such as heating are not always applicable or change the treated good or food in undesirable ways. Other methods may have turned out non-effective with a particular patient. There is therefore a need for further methods of preventing or treating Salmonella infections or contamination, or methods for reducing or preventing contamination of objects with Salmonella, notably with Salmonella enterica ssp. enterica.

Some proteins active against Salmonella were previously described (WO 2018/172065) that were referred to as “salmocins”. While the salmocins described in WO 2018/172065 show high activity against Salmonella species and strains, there are limitations in that their toxic activities are similar, whereby the spectrum of activity is limited even if different ones of the known salmocins are combined. Further, mid- or long-term use of the same or similar salmocin types may select Salmonella strains that are resistant to the salmocins used. Therefore, additional salmocins that act by other mechanisms are highly desired. Notably, compositions comprising two or more salmocins of different types are desired for broadening the range of activity against many different Salmonella species.

For proteins for technical applications, ease of production, purification, and storage are important aspects that can be decisive for the question whether the technical applications are feasible.

It is an object of the invention to provide methods for preventing or treating Salmonella infections such as food-borne Salmonella infections. It is another object to provide methods for preventing or reducing contamination of objects, notably food, with Salmonella. It is a further object to provide methods for preventing or treating Salmonella infections and/or methods for reducing contamination of objects with Salmonella, that are effective against a wide range of Salmonella serogroups. It is another object to provide further salmocins that act by different or additional mechanisms against Salmonella. It is a further object of the invention to provide agents active against Salmonella that can be produced and/or purified and/or stored conveniently with high stability. Further, compounds, agents and compositions for such methods are desired.

SUMMARY OF THE INVENTION

Accordingly, the invention provides the subject-matter defined in the claims. The invention also provides: (1) A protein, preferably capable of exerting a cytotoxic effect on Salmonella, said protein comprising at least any one of the following amino acid sequence segments (a-i) to (a-x) or derivatives thereof as defined in (b-i) to (b-x), (c-i) to (c-x) or (d-i) to (d-x):

(a-i) the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1), (a-ii) the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2), (a-iii) the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3), (a-iv) the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO: 4), (a-v) the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO: 5), (a-vi) a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6,

(a-vii) the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25), (a-viii) the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26), (a-ix) the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27), (a-x) the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28), (a-xi) the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (a-xii) the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(b-i) a segment having at least 75% sequence identity to the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1),

(b-ii) a segment having at least 70% sequence identity to the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2),

(b-iii) a segment having at least 77% sequence identity to the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3),

(b-iv) a segment having at least 70% sequence identity to the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO: 4),

(b-v) a segment having at least 70% sequence identity to the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO: 5),

(b-vi) a segment having at least 70% sequence identity to a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6,

(b-vii) a segment having at least 70% sequence identity to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(b-viii) a segment having at least 70% sequence identity to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(b-ix) a segment having at least 70% sequence identity to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(b-x) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28), (b-xi) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (b-xii) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(c-i) a segment having at least 85% sequence similarity to the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1),

(c-ii) a segment having at least 80% sequence similarity to the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2),

(c-iii) a segment having at least 85% sequence similarity to the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3),

(c-iv) a segment having at least 80% sequence similarity to the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO: 4),

(c-v) a segment having at least 80% sequence similarity to the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO: 5),

(c-vi) a segment having at least 80% sequence similarity to a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6,

(c-vii) a segment having at least 80% sequence similarity to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(c-viii) a segment having at least 80% sequence similarity to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(c-ix) a segment having at least 80% sequence similarity to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(c-x) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28),

(c-xi) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (c-xii) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(d-i) a segment having from 1 to 25 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1),

(d-ii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2), (d-iii) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3),

(d-iv) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO: 4),

(d-v) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO: 5),

(d-vi) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6,

(d-vii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(d-viii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(d-ix) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(d-x) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28),

(d-xi) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or

(d-xii) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34).

(2) The protein according to (1), comprising a cytotoxic or catalytic domain having any one or more of the following activities: a membrane pore-forming activity, DNase activity, RNase activity, or a cell wall degrading activity such as muramidase activity.

(3) The protein according to (1) or (2), comprising a cytotoxic or catalytic domain comprising or consisting of any one of the following amino acid sequence segments (a-i)’ to (a- x)’, or derivatives thereof or amino acid sequence segments as defined in any one of (b-i)’ to (b- x)’, (c-i)’ to (c-x)’ or (d-i)’ to (d-x)’:

(a-i)’ the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(a-ii)’ the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2),

(a-iii)’ the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3),

(a-iv)’ the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4), (a-v)’ the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5), (a-vi)’ the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(a-vii)’ the segment from amino acid residue 347 to 519 of ScolEIc (SEQ ID NO: 25), (a-viii)’ the segment from amino acid residue 347 to 519 of ScolEld (SEQ ID NO: 26), (a-ix)’ the segment from amino acid residue 345 to 517 of ScolEle (SEQ ID NO: 27), (a-x)’ the segment from amino acid residue 139 to 269 of ScolMa (SEQ ID NO: 28), (a-xi)’ the segment from amino acid residue 139 to 269 of ScoIMb (SEQ ID NO: 33), or (a-xii)’ the segment from amino acid residue 139 to 269 of ScolMc (SEQ ID NO: 34); or

(b-i)’ a segment having at least 70% sequence identity to the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(b-ii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2),

(b-iii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3),

(b-iv)’ a segment having at least 70% sequence identity to the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4),

(b-v)’ a segment having at least 70% sequence identity to the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5),

(b-vi)’ a segment having at least 70% sequence identity to the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(b-vii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(b-viii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(b-ix)’ a segment having at least 70% sequence identity to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27),

(b-x)’ a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(b-xi)’ a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or (b-xii)’ a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34); or

(c-i)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(c-ii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2),

(c-iii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3),

(c-iv)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4),

(c-v)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5),

(c-vi)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(c-vii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(c-viii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(c-ix)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27),

(c-x)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(c-xi)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or (c-xii)’ a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34); or

(d-i)’ a segment having from 1 to 20 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(d-ii)’ a segment having from 1 to 20 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2),

(d-iii)’ a segment having from 1 to 20 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3), (d-iv)’ a segment having from 1 to 20 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4),

(d-v)’ a segment having from 1 to 20 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5),

(d-vi)’ a segment having from 1 to 20 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(d-vii)’ a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(d-viii)’ a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(d-ix)’ a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27), or

(d-x)’ a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(d-xi)’ a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or

(d-xii)’ a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34).

(4) The protein according to (1), (2) or (3), comprising a translocation domain comprising (or consisting of) any one of the following amino acid sequence segments (a-i)” to (a-v)”, (a- vii)” to (a-x)”, (a-xi)”, and (a-xii)”, or derivatives thereof (or amino acid sequence segments) as defined in any one of (b-i)” to (b-v)”, (b-vii)” to (b-x)”, (b-xi)” and (b-xii)”, (c-i)” to (c-v)”, (c-vii)” to (c-x)”, (c-xi)” and (c-xii)”, or (d-i)” to (d-v)”, (d-vii)” to (d-x)”, (d-xi)”, and (d-xii)”:

(a-i)” the segment from amino acid residue 43 to 313 of ScolE2 (SEQ ID NO: 1),

(a-ii)” the segment from amino acid residue 35 to 315 of ScolE3 (SEQ ID NO: 2),

(a-iii)” the segment from amino acid residue 43 to 316 of ScolE7 (SEQ ID NO: 3),

(a-iv)” the segment from amino acid residue 1 to 170 of ScolEla (SEQ ID NO: 4),

(a-v)” the segment from amino acid residue 1 to 195 of ScolElb (SEQ ID NO: 5), (a-vii)” the segment from amino acid residue 6 to 194 of ScolEIc (SEQ ID NO: 25), (a-viii)”the segment from amino acid residue 6 to 194 of ScolEld (SEQ ID NO: 26), (a-ix)”the segment from amino acid residue 5 to 192 of ScolEle (SEQ ID NO: 27),

(a-x)” the segment from amino acid residue 1 to 37 of ScolMa (SEQ ID NO: 28),

(a-xi)” the segment from amino acid residue 1 to 37 of ScoIMb (SEQ ID NO: 33), or (a-xii)” the segment from amino acid residue 1 to 37 of ScolMc (SEQ ID NO: 34); or

(b-i)” a segment having at least 75%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 43 to 313 of ScolE2 (SEQ ID NO: 1),

(b-ii)” a segment having at least 75%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 35 to 315 of ScolE3 (SEQ ID NO: 2),

(b-iii)” a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 43 to 316 of ScolE7 (SEQ ID NO: 3),

(b-iv)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 1 to 170 of ScolEla (SEQ ID NO: 4),

(b-v)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 1 to 195 of ScolElb (SEQ ID NO: 5),

(b-vii)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 6 to 194 of ScolEIc (SEQ ID NO: 25),

(b-viii)”a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 6 to 194 of ScolEld (SEQ ID NO: 26),

(b-ix)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 5 to 192 of ScolEle (SEQ ID NO: 27), (b-x)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 1 to 37 of ScolMa (SEQ ID NO: 28),

(b-xi)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 1 to 37 of ScoIMb (SEQ ID NO:

33), or

(b-xii)” a segment having at least 70%, preferably at least 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 1 to 37 of ScolMc (SEQ ID NO:

34); or

(c-i)” a segment having at least 85%, preferably at least 90% and more preferably at least 95% sequence similarity to the segment from amino acid residue 43 to 313 of ScolE2 (SEQ ID NO: 1),

(c-ii)” a segment having at least 85%, preferably at least 90% and more preferably at least 95% sequence similarity to the segment from amino acid residue 35 to 315 of ScolE3 (SEQ ID NO: 2),

(c-iii)” a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 43 to 316 of ScolE7 (SEQ ID NO: 3),

(c-iv)” a segment having at least 80%, preferably at least 90, more preferably at least 95% sequence similarity to the segment from amino acid residue 1 to 170 of ScolEla (SEQ ID NO: 4),

(c-v)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 1 to 195 of ScolElb (SEQ ID NO: 5),

(c-vii)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 6 to 194 of ScolEIc (SEQ ID NO: 25),

(c-viii)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 6 to 194 of ScolEld (SEQ ID NO: 26),

(c-ix)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 5 to 192 of ScolEle (SEQ ID NO: 27), (c-x)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 1 to 37 of ScolMa (SEQ ID NO: 28),

(c-xi)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 1 to 37 of ScoIMb (SEQ ID NO: 33), or

(c-xii)” a segment having at least 80%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 1 to 37 of ScolMc (SEQ ID NO: 34); or

(d-i)” a segment having from 1 to 50, preferably from 1 to 40, more preferably from 1 to 30, even more preferably from 1 to 20 and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 43 to 313 of ScolE2 (SEQ ID NO: 1),

(d-ii)” a segment having from 1 to 50, preferably from 1 to 40, more preferably from 1 to 30, even more preferably from 1 to 20 and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 35 to 315 of ScolE3 (SEQ ID NO: 2),

(d-iii)” a segment having from 1 to 30, preferably from 1 to 20 and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 43 to 316 of ScolE7 (SEQ ID NO: 3),

(d-iv)” a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 1 to 170 of ScolEla (SEQ ID NO: 4),

(d-v)” a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 1 to 195 of ScolElb (SEQ ID NO: 5),

(d-vii)” a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 6 to 194 of ScolEIc (SEQ ID NO: 25),

(d-viii)”a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 6 to 194 of ScolEld (SEQ ID NO: 26), (d-ix)” a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 5 to 192 of ScolEle (SEQ ID NO: 27), or

(d-x)” a segment having from 1 to 7, preferably from 1 to 5, more preferably from 1 to 3, and most preferably from 1 to 3 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 1 to 37 of ScolMa (SEQ ID NO: 28),

(d-xi)” a segment having from 1 to 7, preferably from 1 to 5, more preferably from 1 to 3, and most preferably from 1 to 3 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 1 to 37 of ScoIMb (SEQ ID NO: 33), or

(d-xii)” a segment having from 1 to 7, preferably from 1 to 5, more preferably from 1 to 3, and most preferably from 1 to 3 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 1 to 37 of ScolMc (SEQ ID NO: 34).

(5) The protein according to any one of (1) to (4), wherein the protein is as defined in any one of items: (b-xi), (c-xi), (d-xi), (b-xii), (c-xii), (d-xii), (b-xi)’, (c-xi)’, (d-xi)’, (b-xii)’, (c-xii)’, (d-xii)’, (b-xi)”, (c-xi)”, (d-xi)”, (b-xii)”, (c-xii)’, or (d-xii)”; the amino acid residue of said protein corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg.

(6) The protein according to (5), wherein the amino acid residues corresponding to residues 76 and 84 of SEQ ID NO: 33 are Gin.

(7) The protein according to any one of (1) to (6), for use in a method of treating infection or contamination with Salmonella such as Salmonella enterica, preferably Salmonella enterica ssp. enterica.

(8) The protein according to any one of (1) to (7), wherein the toxicity of a protein of claim 1 , notably of classes (b) to (d) of claim 1, against Salmonella enterica is such that it and the protein of SEQ ID NO: 1 produce spots free of viable bacteria of Salmonella enterica ssp. enterica serovar Newport strain ATCC® 6962™* of the same diameter 12 hours after spotting 5 microliters of a solution of said protein of classes (b) to (d) and the protein of SEQ ID NO: 1 onto a softagar overlay plate seeded with 0.14 mL bacterial solution of 1x107 cfu/mL per cm2 of the sensitive Salmonella enterica strain and subsequent incubation of the agar plate at 37°C, wherein the concentration of the protein of classes (b) to (d) is at most 5 times that of the comparative solution of the protein of SEQ ID NO: 1.

(9) A protein comprising or consisting of any one of the following amino acid sequences:

(A-x) SEQ ID NO: 28,

(A-xi) SEQ ID NO: 33, or

(A-xii) SEQ ID NO: 34; or

(B-x) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 28,

(B-xi) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 33, or

(B-xii) an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 34; or

(C-x) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 28,

(C-xi) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 33, or

(C-xii) an amino acid sequence having at least 80% sequence similarity to the amino acid sequence of SEQ ID NO: 34; or

(D-x) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 28,

(D-xi) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 33, or

(D-xii) an amino acid sequence having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 34; or

(E-x) an amino acid sequence comprising or consisting of at least 215 contiguous amino acid residues of SEQ ID NO: 28,

(E-xi) an amino acid sequence comprising or consisting of at least 215 contiguous amino acid residues of SEQ ID NO: 33, or

(E-xii) an amino acid sequence comprising or consisting of at least 215 contiguous amino acid residues of SEQ ID NO: 34.

(10) The protein according to (9), wherein the protein is as defined in (B-x), (C-x), (D-x), or (E-x), preferably as defined in (B-xi), (C-xi), (D-xi), (E-xi), (B-xii), (C-xii), (D-xii), or (E-xii), and the amino acid residue (of said protein) corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg.

(11) The protein according to (9) or (10), wherein the amino acid residues corresponding to residues 76 and 84 of SEQ ID NO: 33 are Gin.

(12) A composition comprising one or more proteins as defined in any one of (1) to (11).

(13) The composition according to (12), wherein said one or more proteins is or comprises ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, or ScolMc, or a derivative of ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, or ScolMc, said derivatives as defined in items (b), (c), and (d) of item (1) or, preferably, as defined in items (B), (C), and (D) of item (9).

(14) The composition according to (12) or (13), comprising two or more proteins selected from at least two different classes (i) to (x), (xi) and (xii) as defined in (1), (2), or (3), preferably two or more proteins selected from at least two different classes (i) to (xii) as defined in (1), (2), or (3), even more preferably two or more proteins selected from at least two different classes (iv), (v) and (x) to (xii) as defined in any one of (1) to (4).

(15) The composition according to (14), comprising at least a protein of a sub-class (x) (of any of classes (a) to (d)) and a protein of a sub-class (v) (of any of classes (a) to (d)); or comprising at least a protein of a sub-class (xi) (of any of classes (a) to (d)) and a protein of a sub-class (v) (of any of classes (a) to (d)); or comprising at least a protein of a sub-class (xii) (of any of classes (a) to (d)) and a protein of a sub-class (v) (of any of classes (a) to (d)).

(16) The composition according to any one of (12) to (15) for use in a method of treating infection with Salmonella ; preferably Salmonella enterica, more preferably Salmonella enterica ssp. enterica.

(17) The composition according to any one of (12) to (16), wherein said composition is a plant material or extract thereof, wherein the plant material is a material from a plant having expressed said protein, preferably an edible plant having expressed said protein.

(18) The composition according to (17), wherein said plant material is material from a plant selected from the group consisting of spinach, chard, beetroot, carrot, sugar beet, leafy beet, amaranth, Nicotiana, and/or said plant material is one or more leaves, roots, tubers, or seeds, or a crushed, milled or comminuted product of said leaves, roots, tubers, or seeds.

(19) The composition according to any one of (12) to (18), wherein said composition is an aqueous solution containing said protein.

(20) The composition according to (19), wherein the concentration of said protein, or if the compositions contains two or more different proteins, said proteins, in said aqueous solution is from 0.0001 to 1 mg/ml, preferably from 0.001 to 0.1 mg/ml, more preferably from 0.005 to 0.05 mg/ml; or from 0.1 to 15 mg/kg food, preferably from 0.5 to 10 mg/kg, more preferably from 0.1 to 5 mg/kg food.

(21) The composition according to any one of (12) to (20), comprising a protein according to item (A-iv), (B-iv), (C-iv), (D-iv) or (E-iv), and/or a protein according to item (A-x), (B-x), (C-x), (D-x) or (E-x); or comprising a protein according to item (A-v), (B-v), (D-v) or (E-v), and/or a protein according to item (A-x), (B-x), (D-x) or (E-x); wherein preferred embodiments defined herein may be combined with the embodiments defined in this item (21).

(22) A method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with a protein as defined in any one of (1) to (11) or a composition as defined in any one of (12) to (21).

(23) The method according to (22), wherein said object is sprayed with said aqueous solution or is immersed into said aqueous solution.

(24) The method according to (22) to (23), wherein said object is immersed for at least 10 seconds, preferably for at least 1 minute, preferably for at least 5 minutes into an aqueous solution of said protein.

(25) The method according to any one of (22) to (24), wherein said object is food or animal feed.

(26) The method according to (25), wherein said food is whole animal carcass, meat, eggs, raw fruit or vegetable, preferably said food is meet, raw fruit or vegetable, more preferably said food is meat.

(27) A method of treating infection with Salmonella of a subject in need thereof, comprising administering to said subject a protein as defined in any one of (1) to (11) or a composition as defined in any one of (12) to (21).

(28) The method according to any one of (22) to (27), wherein said Salmonella is Salmonella entehca, preferably Salmonella enterica ssp. enterica.

(29) A process of producing a composition comprising a protein as defined in any one of (1) to (11), said process comprising the following steps:

(i) expressing said protein in a plant, preferably an edible plant or Nicotiana,

(ii) harvesting plant material containing expressed protein from said plant,

(iii) extracting said protein from said plant material using an aqueous buffer to obtain a composition containing said protein,

(iv) optionally removing undesired contaminants from said composition.

(30) The protein according to (1) or (9), wherein said protein is that of item

(a-vii), (b-vii), (c-vii), or (d-vii), each optionally in combination with item (3), or

- (A-vii), (B-vii), (C-vii), (D-vii) or (E-vii); or wherein said protein is that of item

(a-viii), (b-viii), (c-viii), or (d-viii), each optionally in combination with item (3), or

- (A-viii), (B-viii), (C-viii), (D-viii) or (E-viii); or wherein said protein is that of item (a-x), (b-x), (c-x), or (d-x), each optionally in combination with item (3), or

- (A-x), (B-x), (C-x), (D-x) or (E-x); or wherein said protein is that of item

(a-xi), (b-xi), (c-xi), or (d-xi), each optionally in combination with item (3), or

- (A-xi), (B-xi), (C-xi), (D-xi) or (E-xi); or wherein said protein is that of item

(a-xii), (b-xii), (c-xii), or (d-xii), each optionally in combination with item (3), or

- (A-xii), (B-xii), (C-xii), (D-xii) or (E-xii).

(31) The composition according to (12), comprising a a protein according to item (A-x), (B-x), (C-x), (D-x) or (E-x), and/or according to item (A-xi), (B-xi), (C-xi), (D-xi) or (E-xi), and/or (A-xii), (B-xii), (C-xii), (D-xii) or (E- xii), optionally as further defined claim 7 or 8; and a protein according to item (A-viii), (B-viii), (C-viii), (D-viii) or (E-iii).

Salmonella bacteriocins, herein together with derivatives thereof referred to as “salmocins” (herein abbreviated as “Scol” or “Sal”), are natural non-antibiotic antimicrobial proteins produced by certain Salmonella strains that kill or inhibit the growth of other Salmonella strains. Unlike relatively well studied Escherichia coli protein analogues termed colicins, salmocins have been given little attention. There is a number of Salmonella sequences with similarity to colicin sequences in the publicly available genome databases, most of them showing high identity to colicins M, la, lb, 5 and 10. The inventors have identified salmocins similar to but different from colicins that can be used to prevent or reduce infection or contamination with Salmonella, notably with Salmonella enterica ssp. enterica.

The inventors have found that salmocins can be expressed efficiently in plants. Expression processes as those used in this study have already been brought to the level of GMP compliance, and are currently being used in different clinical trials as manufacturing processes. Most salmocins are expressed at high yields (up to 1.7 g active protein per kilogram of fresh green biomass), meaning low commercially viable manufacturing costs. Production can be made using, inter alia, tobacco and edible plants such as leaf beets or spinach. Among different salmocins, salmocins Ma (ScolMa), Mb (ScoIMb), and Me (ScolMc) are preferred, and compositions of these M-type salmocins with E1a (ScolEla) and (more preferably) E1b (ScolElb) or their derivatives are more preferred, since they were found to possess very broad antimicrobial activity against major pathogenic Salmonella strains. Each of the two salmocins ScolEla and ScolElb shows also a very high activity against all 36 major pathogenic strains tested. Also, ScolMa was shown to have broad antimicrobial activity against major pathogenic Salmonella strains (cf. Fig. 32). Interestingly, ScolMa has high antimicrobial activity against several pathogenic Salmonella strains against which ScolEI salmocins are less active, making ScolMa, as well as ScoIMb and ScolMc, ideal salmocins to be used in combination with ScolEI salmocins in compositions of the invention and, thus, to be used in the methods of the invention. ScoIMb and ScolMc have antimicrobial activity spectrum against Salmonella strains very similar as ScolMa (cf. Fig. 37). Therefore, ScoIMb and ScolMc have the same or similar utility as ScolMa for the uses and methods described herein. In terms of ease of purification, among the ScolM salmocins, ScoIMb and ScolMc (and their derivatives defined herein) are preferred, and ScolMc (and its derivatives as defined herein) are most preferred. ScolMc (and its derivatives) combines a broad antimicrobial activity spectrum against Salmonella strains with ideal handling properties, as well as high antimicrobial activity.

Treatments with low amounts of colicins (e.g. less than 10 mg colicin per kg of treated food product) reduce the bacterial load of different pathogenic strains by 3 to >6 logs in the assay employed. In spike experiments using poultry meats spiked with two to four pathogen serovars, colicins (colicins M, la and 5) efficiently reduced the titer of pathogenic bacteria. Therefore, it is expected that salmocins possessing higher antimicrobial activity towards S. enterica ssp. enterica serovars mentioned above than the colicins mentioned will reduce the titer of a Salmonella contamination on poultry even more effectively.

The experimental data of the present invention demonstrate that the non-antibiotic antibacterial salmocins can be expressed at very high levels in plants such as Nicotiana benthamiana, a standard manufacturing host for multiple biopharmaceuticals currently undergoing clinical trials, and the plant-expressed proteins are apparently fully active. The expression levels in most cases reach 37% of total soluble protein or 1.74 g/kg of fresh leaf biomass without process optimization, meaning that salmocins are not toxic to plants and that optimized industrial procedures of transfection or induction in transgenic hosts could be developed that are inexpensive. In contrast, attempts to express bacteriocin proteins in bacterial hosts at a high level were usually met with general toxicity of this bacteriocin class even in species other than homologous bacterial species (e.g. Medina et al. , PLoS One, 2011;6(8):e23055; Diaz et al., 1994). Thus, plants are excellent hosts for manufacturing salmocins.

The data of this invention demonstrate that salmocins can efficiently control most or all major pathogenic serotypes of Salmonella enterica ssp. enterica under actual exposure modelling. There is a limited variety among salmocins produced by Salmonella. Studied salmocins have quite diverse structure within the general three-domain (translocation, receptor and cytotoxic domains) architecture, similar to more studied E. coli colicins. Surprisingly, practically all tested salmocins and colicins, alone or together with antitoxin (immunity protein), are expressed very well in plants. This may be explained by a low toxicity of salmocins and colicins to plant cells and by the fact that these bacteriocin proteins, being classical representatives of ‘inherently disordered proteins’ (a feature essential for ability to unfold/refold during the bacterial cell wall and membrane translocation), probably do not impose unusual requirements on translation and post-translational machinery of the plant cell.

Unlike the list of major E. coli strains defined by FDA based on a historical analysis of food poisoning due to E. coli , a list of major foodborne Salmonella strains has not been defined by regulatory agencies, primarily due to higher diversity of the pathovars responsible for the outbreaks. Being confronted with this lack of guidance in the prior art, the inventors decided to pool three existing major studies that ranked the pathovars based on their prevalence and poisoning severity. In our study, 36 serovars have been selected to be analyzed, 29 of them caused at least 100 incidences reported to Centers for Disease Control in 2003-2012 (National Enteric Disease Surveillance: Salmonella Annual Report, 2013 (CDC, June 2016): laboratory- confirmed human Salmonella infections (US) reported to CDC 2003-2012) with 17 most notorious pathovars on CDCs top 20 list; the number being five times higher than the number of E. coli pathovars defined by the FDA (seven).

The data presented herein show that, based on their ability to control major pathogenic Salmonella strains, the five different Salmonella salmocins ScolEla, ScolElb, ScolE2, ScolE3, and ScolE7 can be divided into three groups. Salmonella salmocins E1a and E1b turned out to be universally active, each being able to kill all tested pathovars and showing the highest average activity. Average activity of the two salmocins on all tested strains was over 10 7 AU/pg. For example, the individual activity of salmocin E1a was: >10 3 AU/pg for 35 out of 36 strains, >10 4 AU/pg for 24 out of 36 strains and >10 6 AU/pg for 13 out of 36 strains. The remaining salmocins fell into two groups with Salmocins E2 and E7 being inhibitory to over 80% of strains but having a 100-fold lower average activity (less than 10 5 AU/pg), whereas salmocin E3 inhibited approx. 60% of strains at lower average activity (about 10 2 AU/pg). The inventors have further found that the salmocins ScolEIc, ScolEld, ScolEle, and ScolMa demonstrated significant antimicrobial activities. ScolElb and ScolEld were found to be surprisingly better in handling properties, notably for purification. ScolEld was surprisingly found to have an exceptional storage stability as a solution (notably an aqueous solution when cooled below room temperature, such as at 0° to 10°C, preferably between 3° to 7°C, or at about 4°C).

Among the ScolM salmocins, ScoIMb and ScolMc, in particular ScolMc, were found to be surprisingly better in handling properties, notably for purification, and high in antimicrobial activity against Salmonella species. These results are unexpected, because colicins (salmocin analogues produced by E. coli cells) exhibit a much narrower spectrum of antimicrobial activity against seven E. coli pathovars, and mixtures of two to five colicins had to be preferably used to efficiently inhibit all seven STEC serotypes defined by FDA. Colicins also demonstrated much lower average activity against ‘Big Seven’ STEC strains (average <10 3 AU/pg), although much higher activity has been observed on strain H104:H4 (>10 5 AU/pg) that caused a major outbreak in 2011 in Europe, and a common laboratory strain.

The inventors’ analysis of cross specific activity of salmocins and colicins on E. coli and Salmonella, respectively, demonstrates low activity against bacteria of different genus/species. In particular, activity of salmocins against ‘Big Seven’ STEC strains was low (less than 10 2 AU/pg) although some salmocins (such as E2, E7 and E1b, but surprisingly not E1a) were fairly active against H104:H4 (10 3 AU/pg) and laboratory strain DH10B (10 5 AU/pg). Similarly, activity of colicins on Salmonella pathovars was found to be low, with colicins la and lb being active on over 80% of strains, but with average activity of only colicin la being higher than 3x10 3 AU/pg (or three to four orders of magnitude less than salmocin E1a/b). The inventors conclude from these studies that, to combat both pathogenic species, one has to use mixtures of colicins and salmocins. These results are also in seeming partial disagreement with the recent studies of ecological efficacy of colicin-like proteins in competitions between bacteria of different genera (Nedialkova et al., PLoS Pathog. 2014 Jan; 10(1):e1003844).

The invention provides new agents and compositions for controlling Salmonella. The salmocins of the invention have the advantage that marketing authorization can be obtained in an uncomplicated manner. For example, the FDA recently granted plant-produced colicins GRAS (Generally Regarded As Safe) status (GRN573, FDA website). Because of the unmet need for natural non-antibiotic antibacterials for Salmonella control, the inventors conceived exploring Salmonella bacteriocins (’’salmocins”). Thereby, the present invention was accomplished.

DESCRIPTION OF THE FIGURES

Figure 1 shows schematically viral vectors for the expression of salmocins and corresponding immunity proteins used in the Examples. Constructs for the expression of salmocins are based on Tobacco mosaic virus (TMV), whereas constructs for the expression of immunity proteins are based on Potato Virus X (PVX).

Salmocin expression vectors include pNMD28161, pNMD28151 and pNMD28172 for the expression of salmocins ScolE2, ScolE3 and ScolE7, respectively (Fig. 1A), pNMD28191, pNMD28204, and pNMD28182 for the expression of salmocins ScolEla, ScolElb and Spst, respectively (Fig. 1B). RB and LB indicate the right and left borders of T-DNA of binary vectors. Pact2: promoter of Arabidopsis actin2 gene; o: 5’ end from TVCV (turnip vein clearing virus); RdRp: RNA- dependent RNA polymerase open reading frame (ORF) from cr-TMV (crucifer-infecting tobamovirus); MP: movement protein ORF from cr-TMV; ScolE2: salmocin ScolE2 coding sequence; ScolE3: salmocin ScolE3 coding sequence; ScolE7: salmocin ScolE7 coding sequence; ScolEla: salmocin ScolEla coding sequence; ScolElb: salmocin ScolElb coding sequence; Spst: salmocin Spst coding sequence; N: 3’-non-translated region from cr-TMV; T: Agrobacterium nopaline synthase terminator; white segments interrupting grey segments in the RdRp and MP ORFs indicate introns inserted into these ORFs for increasing the likelihood of RNA replicon formation in the cytoplasm of plant cells, which is described in detail in W02005049839. An intron was also inserted into ScolE2, ScolE3 and ScolE7 ORFs for preventing the cytotoxic effect of these proteins on E. coli cells used for plasmid cloning.

PVX-based vectors for the expression of immunity proteins include pNMD28222 and pNMD28232 for the expression of salmocin ScolE2 and ScolE7 immunity proteins, respectively (Fig. 1A). P35S: cauliflower mosaic virus 35S promoter; PVX-pol: RNA-dependent RNA polymerase from PVX; CP: coat protein ORF; 25K, 12K and 8K together indicate the 25 kDa, 12 kDa and 8 kDa triple gene block modules from PVX; N: 3’-untranslated region from PVX. SlmmE2 and SlmmE7 stand for coding sequences of salmocin ScolE2 and ScolE7 immunity proteins, respectively.

Figure 2 shows comparative SDS-PAGE analysis of expression for salmocins after the infiltration of Nicotiana benthamiana plants with agrobacteria carrying viral vectors. Plant leaf material was extracted with 5 volumes of buffer containing 50 mM HEPES (pH 7.0), 10 mM potassium acetate, 5 mM magnesium acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20 and 300 mM NaCI. Protein extracts were resolved in 12% polyacrylamide gels. For gel loading, aliquots containing the extract volumes corresponding to 0.4 mg fresh weight of plant tissue were used. Before loading on the gel, aliquots of protein extracts were mixed with 2 x Laemmli buffer in the proportion 1:1 and incubated at 95°C for 10 min. Numerals above gel lanes stand for protein extracts from plant tissues expressing the following recombinant proteins: 1 - salmocin ScolE2; 2 - salmocin ScolE3; 3 - salmocin ScolE7; 4 - salmocin ScolEla; 5 - salmocin ScolElb; 6 - salmocin Spst. Numeral 7 corresponds to the extract from uninfected leaf tissue used as a negative control. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671). Arrows indicate specific protein bands corresponding to expressed recombinant colicins.

Figure 3 shows the semi-quantitative evaluation of specific antimicrobial activity of salmocin-containing plant extracts against 36 S. enterica ssp. enterica strains listed in Tables 5A and 5B. The antimicrobial activity was tested using a radial diffusion assay via spot-on-lawn- method. The percentage of salmocin-sensitive Salmonella strains (average of 3 independent experiments) is given for salmocins: 1 - salmocin ScolE2; 2 - salmocin ScolE3; 3 - salmocin ScolE7; 4 - salmocin ScolEla; 5 - salmocin ScolElb; 6 - salmocin Spst.

Figure 4 shows the semi-quantitative evaluation of the average antimicrobial activity of salmocin-containing plant extracts against 36 S. enterica ssp. enterica strains listed in Tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn- method and calculated in arbitrary units (AU) per mg fresh weight (FW) of plant biomass expressing recombinant salmocins (average of 3 independent experiments). Thereby it reflects the yield of specific active agent per unit of biomass; /. e. the specific production capacity of the host. Arbitrary units are calculated as a dilution factor for the highest dilution of protein extract causing a detectable clearing effect in the radial diffusion assay. Tested recombinant salmocins are given as: 1 - salmocin ScolE2; 2 - salmocin ScolE3; 3 - salmocin ScolE7; 4 - salmocin ScolEla; 5 - salmocin ScolElb; 6 - salmocin Spst.

Figure 5 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin-containing plant extracts against 36 S. enterica ssp. enterica strains listed in Tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on- lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments) which reflects the specific activity of salmocins against particular strains; /. e. the specific antimicrobial potency of salmocins is being evaluated. Tested recombinant salmocins are given as: 1 - salmocin ScolE2; 2 - salmocin ScolE3; 3 - salmocin ScolE7; 4 - salmocin ScolEla; 5 - salmocin ScolElb; 6 - salmocin Spst.

Figure 6 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolE2-containing plant extracts against 36 S. enterica ssp. enterica strains listed in Tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per mg FW plant biomass (average of 3 independent experiments).

Figure 7 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolE2-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments).

Figure 8 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolE3-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per mg FW plant biomass (average of 3 independent experiments). Figure 9 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolE3-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments).

Figure 10 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolE7-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per mg FW plant biomass (average of 3 independent experiments).

Figure 11 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolE7-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments).

Figure 12 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolEla-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per mg FW plant biomass (average of 3 independent experiments).

Figure 13 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolEla-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments).

Figure 14 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolElb-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per mg FW plant biomass (average of 3 independent experiments).

Figure 15 shows the semi-quantitative evaluation of the average specific antimicrobial activity of salmocin ScolElb-containing plant extracts against 36 S. enterica ssp. enterica strains listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments). Figure 16 shows schematically viral vectors based on Tobacco mosaic virus (TMV) for the expression of colicins used in the Examples. Colicin expression vectors include pNMD25856, pNMD15311, pNMD25848, pNMD19141 , pNMD25861 and pNMD10221 for the expression of colicins colS4, col5, col 10, col la, collb and colM, respectively.

RB and LB indicate the right and left borders of T-DNA of binary vectors. Pact2: promoter of Arabidopsis actin2 gene; o: 5’ end from TVCV (turnip vein clearing virus); RdRp: RNA- dependent RNA polymerase open reading frame (ORF) from cr-TMV (crucifer-infecting tobamovirus); MP: movement protein ORF from cr-TMV; colS4: colicin S4 coding sequence; col5: colicin 5 coding sequence; col10: colicin 10 coding sequence; colla: colicin la coding sequence; collb: colicin lb coding sequence; colM: colicin M coding sequence; N: 3’-non-translated region from cr-TMV; T: Agrobacterium nopaline synthase terminator; white segments interrupting grey segments in the RdRp and MP ORFs indicate introns inserted into these ORFs for increasing the likelihood of RNA replicon formation in the cytoplasm of plant cells, which is described in detail in W02005049839.

Figure 17 shows the semi-quantitative evaluation of specific antimicrobial activity of colicin-containing plant extracts against 35 S. enterica ssp. enterica strains (No. 1-35) listed in tables 5A and 5B. The antimicrobial activity was tested using radial diffusion assay via spot-on- lawn-method. The percentage of colicin-sensitive Salmonella strains (average of 3 independent experiments) is given for colicins: 1 - colicin S4; 2 - colicin 5; 3 - colicin 10; 4 - colicin la; 5 - colicin lb; 6 - colicin M.

Figure 18 shows the reduction of S. enterica ssp. enterica cell population in contaminated chicken breast meat pieces by treatment with a three-component colicin blend comprising colicin M, colicin la and colicin 5. Meat was contaminated with a two-strain mixture of S. enterica ssp. enterica strains ATCC ® 14028™* and ATCC ® 13076™* of serotypes Typhimurium and Enteritidis, respectively. Asteriks indicate statistically significant differences in bacterial numbers.

Figure 19 shows the reduction of S. enterica ssp. enterica cell population in contaminated chicken breast meat pieces by treatment with a three-component colicin blend comprising colicin M, colicin la and colicin 5. Meat was contaminated with a four-strain mixture of S. enterica ssp. enterica strains ATCC ® 14028™*, ATCC ® 13076™*, ATCC ® 9270™* and ATCC ® 6962™* of serotypes Typhimurium, Enteritidis, Anatum and Newport, respectively. Asteriks indicate statistically significant differences in bacterial numbers.

Figure 20 A-C shows a multiple sequence alignment of salmocin amino acid sequences generated using Clustal Omega tool (Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. (2011 October 11) Molecular systems biology 7 :539). ScolE2, ScolE3, ScolE7, ScolEla, ScolElb and Spst refer to SEQ ID NOs: 1-6, respectively. Colour labels indicate properties of amino acid residues (red residues as AVFPMILW (small, hydrophobic and aromatic-Y); blue residues as DE (acidic); magenta residues as RK (basic) and green residues as STYHCNGQ (hydroxyl sulfhydryl amine and G)). Consensus symbol * (asterisk) indicates positions in alignment which have a single, fully conserved residue, a : (colon) indicates conservation between groups of strongly similar properties and a . (period) indicates conservation between groups of weakly similar properties.

Figure 21 is a schematic representation of T-DNA regions of ScolElb-encoding plasmid construct (pNMD35541) used for stable plant transformation. The T-DNA region is composed of 4 expression cassettes for: 1) constitutive expression of kanamycine resistance transgenic plant selection marker, 2) constitutive expression of alcR transcriptional activator, 3) ethanol-inducible expression of salmocin ScolElb and 4) ethanol-inducible expression of TMV MP. Arrows indicate orientation of expression cassettes. For tight control of viral replicon activation in non-induced state, the viral vector is deconstructed in the 2 components, replicon and MP (expression cassettes 3 and 4) (Werner et al. Proc. Natl. Acad. Sci. USA 108,14061-14066 (2011). LB and RB, binary left and right borders, respectively; Tnos and Pnos, terminator and promoter of the Agrobacterium nopaline synthase gene; NPTII, neomycin phosphotransferase II; Pstls, promoter of potato ST-LS1 gene; alcR, Aspergillus nidulans alcR ORF; Tact2, Arabidopsis thaliana actin 2 terminator; T35S, CaMV 35S terminator; 3 ' TMV, 3 ' untranslated region of TMV; RdRp, RNA- dependent RNA polymerase; pAIcA, ScolElb, coding sequence of salmocin E1b (ScolElb); Aspergillus nidulans alcohol dehydrogenase (alcA) promoter; MP, movement protein; Toes terminator of Agrobacterium octopine synthase gene; [ ] deletion of MP in expression cassette 3.

Figure 22 shows the results of inducible expression of salmocin ScolElb in stable transgenic Nicotiana benthamiana plants. Loading with crude extracts corresponds to 3 mg FW extracted with 2x Laemmli buffer from (lanes 1, 3, 5, 7) non-induced plant material or (lanes 2, 4, 6, 8) plant material 4 dp induction with ethanol (lanes 1, 2 ) N. benthamiana WT plant, (lanes 3, 4), (lanes 5, 6), (lanes 7, 8) different transgenic plant candidates for single copy T-DNA insertion of TO generation (#4, 12, 37 for ScolElb). Arrows mark recombinant proteins.

Figure 23 shows the results of transient expression of salmocins in Spinacia oleracea cv. Fruhes Riesenblatt upon syringe infiltration with agrobacteria carrying TMV or TMV and PVX vectors. Loading of TSP extracts corresponds to 3 mg FW plant material extracted with 5 vol. 150 mM NaCI. Plant material was harvested (a) 5 dpi (days post infiltration) for ScolEl b, 6 dpi for ScolE3, ScolE7 and ScolEla or 7 dpi for ScolE2 or (b) 4 dpi for ScolElb, 5 dpi for ScolE3, ScolE7 and ScolEla and 6 dpi for ScolE2 or ( d) 8 dpi for ScolE2, ScolE3, ScolE7, ScolEla and ScolElb. (a, b, d) Analyzed extracts were prepared from plant material expressing ScolE2 (lane 1), ScolE3 (lane 2), ScolE7 (lane 3), ScolEla (lane 4) and ScolElb (lane 5) or from (WT) non- transfected leaf tissue. ScolE2 and ScolE7 were co-expressed with their respective immunity proteins. Arrows mark recombinant proteins.

Figure 24 shows activity spectrum of bacteriocins from Salmonella and E. coli against Salmonella enterica ssp. enterica and E. coli Big 7 STEC serotypes. Semi-quantitative evaluation of the specific antimicrobial activity by radial diffusion assay via spot-on-lawn-method of (a, c, e, g) salmocin- and ( b , d, f, h ) colicin-containing plant extracts against 36 (a, e) or 35 ( b , f) S. enterica ssp. enterica strains listed in Table 9 or 7 E. coli Big 7 STEC strains (c, d, g, h) listed in Table 10. Average and STDV of N=3 and N=2 independent experiments is given in (a, b, e, f) and (c, d, g, h), respectively, for the percentage of bacteriocin-sensitive strains (e, f, g, h) and for the specific bacteriocin activity calculated in arbitrary units (AU) per pg of recombinant protein (a, b, c, d) on all tested strains. 1 - ScolE2 (a, c, e, g) or colS4 (b, d, f, h ); 2 - ScolE3 (a, c, e, g) or col5 (b, d, f, h ); 3 - ScolE7 (a, c, e, g) or col10 (b, d, f, h ); 4 - ScolEla (a, c, e, g) or colla (b, d, f, h ); 5 - ScolElb (a, c, e, g) or collb (b, d, f, h ); 6 - colM (b, d, f, h).

Figure 25 shows the reduction of a S. enterica ssp. enterica contamination on fresh chicken breast fillet by salmocins. (a) Bacterial populations recovered from meat upon storage for various periods of time at 10°C upon salmocin treatment (black bar at 0 h, initial contamination level; white bars, carrier treatment; at 1 h, 24 h, 48 h and 72 h: light grey bars, bacteriocin treatment ScolEla in concentration of 3 mg/kg meat; grey bars, bacteriocin treatment ScolE1a+ScolE1b+ScolE2+ScolE7 in concentration of 3+1+1+1 mg/kg meat; dark grey bars, bacteriocin treatment ScolE1a+ScolE1b+ScolE2+ScolE7 in concentration of 0.3+0.1+0.1+0.1 mg/kg meat) of contaminated meat by spray-application. Error bars indicate standard deviation of biological replicates, N=4. (b) Chicken breast trims used in (a).

Figure 26 shows a multiple sequence alignment of salmocin amino acid sequences generated using Clustal Omega tool. ScolEla, ScolEIc, ScolEld, ScolEle refer to SEQ ID NOs: 4, 25, 26 and 27, respectively. Consensus symbol * (asterisk) indicates positions in alignment which have a single, fully conserved residue, a : (colon) indicates conservation between groups of strongly similar properties and a . (period) indicates conservation between groups of weakly similar properties.

Figure 27 shows a multiple sequence alignment of salmocin ScolMa and colicin ColM amino acid sequences generated using Clustal Omega tool. ScolMa and ColM refer to SEQ ID NOs: 28 and 14, respectively.

Figure 28 shows schematically viral vectors pNMD47710, pNMD47720, pNMD48260 and pNMD47730 for the expression of salmocins ScolEIc, ScolEld, ScolEle and ScolMa, respectively. These constructs are based on Tobacco mosaic virus (TMV). ScolEIc: salmocin ScolEIc coding sequence; ScolEld: salmocin ScolEld coding sequence; ScolEle: salmocin ScolEle coding sequence; ScolMa: salmocin ScolMa coding sequence.

Figure 29 shows comparative SDS-PAGE analysis of expression for salmocins after the infiltration of Nicotiana benthamiana plants with agrobacteria carrying viral vectors. Plant leaf material was extracted with 5 volumes of buffer containing 50 mM HEPES (pH 7.0), 10 mM potassium acetate, 5 mM magnesium acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20 and 300 mM NaCI. Protein extracts were resolved in 12% polyacrylamide gels. For gel loading, aliquots containing the extract volumes corresponding to 0.4 mg fresh weight of plant tissue were used. Before loading on the gel, aliquots of protein extracts were mixed with 2 x Laemmli buffer in the proportion 1:1 and incubated at 95°C for 10 min. Numerals above gel lanes stand for protein extracts from plant tissues expressing the following recombinant proteins: 1 - salmocin ScolEIc; 2 - salmocin ScolEld; 3 - salmocin ScolEle; 4 - salmocin ScolMa; 5 - salmocin ScolElb; 6 - salmocin ScolEla, 7 - colicin M. Numeral 8 corresponds to the extract from uninfected leaf tissue used as a negative control. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671). Arrows indicate specific protein bands corresponding to expressed recombinant colicins.

Figure 30 shows a schematic representation of T-DNA regions of ScolEld- and ScolMa- encoding plasmid constructs (pNMD49621 and pNMD49632, respectively) used for stable plant transformation. The T-DNA region is composed of 4 expression cassettes for: 1) constitutive expression of kanamycine resistance transgenic plant selection marker, 2) constitutive expression of alcR transcriptional activator, 3) ethanol-inducible expression of salmocin ScolElb and 4) ethanol-inducible expression of TMV MP. Arrows indicate orientation of expression cassettes. ScolEld stands for coding sequence of salmocin E1d (ScolEld); ScolMa stands for coding sequence of salmocin Ma (ScolMa). For details, see the description to Figure 21.

Figure 31 shows the results of inducible expression of salmocin ScolEld in stable transgenic Nicotiana benthamiana plants of TO generation. Loading with crude extracts corresponds to 3 mg FW extracted with 2x Laemmli buffer from N. benthamiana wild type plants (WT) and individual plant TO plants transformed with pNMD49621 construct (lines 29, 88, 101, 151, 152, 153, 154, 155, 156, 157, 158, and 159) 4 days post induction with ethanol. Arrows mark recombinant proteins.

Figure 32 shows the semi-quantitative evaluation of specific antimicrobial activity of ScolElb-, ScolEld- and ScolMa-containing plant extracts against 36 S. enterica ssp. enterica strains listed in Tables 5A and 5B. The antimicrobial activity was tested using a radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin (average of 3 independent experiments). Figure 33 shows the evaluation of antimicrobial activity for salmocins ScolEla, ScolElb and ScolEld upon their storage as lyophilized purified proteins and in a solution. Salmocin activity against susceptible Salmonella Typhimurium strain ATCC14028 was evaluated by radial diffusion assay. Antimicrobial activity was expressed as specific activity units (AU) per mg of recombinant protein. Salmocin concentration was determined using the Bradford assay. (A) Activity of salmocin ScolEla upon the storage as lyophilized dry powder and in the solution at 4°C and at the room temperature (RT, 20-20°C). (B) Activity of salmocin ScolElb (batch 3) upon the storage as lyophilized dry powder and in the solution at 4°C and at the room temperature (RT, 20-20°C). (C) Activity of salmocin ScolEIc (batch 2) upon the storage as lyophilized dry powder and in the solution at 4°C and at the room temperature (RT, 20-20°C).

Figure 34 shows a multiple sequence alignment of salmocin amino acid sequences generated using Clustal Omega (1.2.4) online tool (https://www.ebi.ac.uk/Tools/msa/clustalo/). ScolMa, ScoIMb and ScolMc refer to SEQ ID NOs: 28, 33 and 34, respectively. Consensus symbol * (asterisk) indicates positions in alignment which have a single, fully conserved residue, a : (colon) indicates conservation between groups of strongly similar properties and a . (period) indicates conservation between groups of weakly similar properties.

Figure 35 shows schematically viral vectors pNMD51280 and pNMD51290 for the expression of salmocins ScoIMb and ScolMc, respectively. These constructs are based on Tobacco mosaic virus (TMV). ScoIMb: salmocin ScoIMb coding sequence; ScolMc: salmocin ScolMc coding sequence.

Figure 36 shows SDS-PAGE analysis of expression for salmocins ScoIMb and ScolMc after the infiltration of Nicotiana benthamiana leaves with agrobacteria carrying viral vectors. Plant leaf material was harvested at 4, 5, 6 and 7 days post infiltration (dpi). Plant leaf material was extracted with 5 volumes of buffer containing 50 mM HEPES (pH 7.0), 10 mM potassium acetate, 5 mM magnesium acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20 and 300 mM NaCI. Protein extracts were resolved in 12% polyacrylamide gels. Before loading on the gel, aliquots of protein extracts were mixed with 2 x Laemmli buffer in the proportion 1:1 and incubated at 95°C for 10 min. Numerals above gel lanes stand days post infiltration when leaf samples were harvested. ScoIMb and ScolMc indicate protein extracts from plant tissues expressing ScoIMb and ScolMc, respectively. NC - the extract from uninfected leaf tissue used as a negative control. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671). Arrow indicates specific protein bands corresponding to expressed recombinant salmocins.

Figure 37 shows the semi-quantitative evaluation of specific antimicrobial activity of plant extracts containing ScolMa, ScoIMb and ScolMc salmocins 10 selected S. enterica ssp. enterica strains listed in Tables 5A and 5B. The antimicrobial activity was tested using a radial diffusion assay via spot-on-lawn-method and calculated in arbitrary units (AU) per pg of recombinant salmocin.

Figure 38 shows SDS-PAGE analysis of salmocin ScolMa protein purification fractions when HIC HiTrapTM Phenyl FF (LS) resin (A) and HiTrap™ Butyl FF resin (B) were used. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); CL - column load; FT - flow-through; W- column wash; F4, F8-F16, F18 - elution fractions. Arrows indicate specific protein bands corresponding to expressed recombinant salmocin ScolMa and RUBISCO large subunit (RbcL).

Figure 39 shows SDS-PAGE analysis of salmocin ScolMa protein purification fractions for HiTrap™ Capto™ MMC resin. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); CL - column load; FT - flow-through; W - column wash; F11-F16, F18, F19 - elution fractions.

Figure 40 shows SDS-PAGE analysis of ScolMa, ScoIMb and ScolMc recombinant proteins extracted from Nicotiana benthamiana leaf biomass using the extraction buffers differing in pH values. The extraction was performed at 4°C (A) and at room temperature (approx. 22°C) (B). Next, buffers were used for protein extraction: 1) 50 mM HEPES (pH 7.0), 10 mM K-acetate, 5 mM Mg-acetate, 10% glycerol, 0.05% Tween-20, 300 mM NaCI; 2) 15 mM Na-acetate (pH 5.0), 20 mM NaH 2 P0 4 , 30 mM NaCI, 0.05% Tween-80 and 3) 15 mM Na-acetate (pH 4.0), 20 mM NaH 2 P0 4 , 30 mM NaCI, 0.05% Tween 80.

L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); 1 - the extract obtained with the extraction buffer of pH 7.0; 2 - the extract obtained with the extraction buffer of pH 5.0; 3 - the extract obtained with the extraction buffer of pH 4.0.

Figure 41 shows SDS-PAGE analysis of salmocin ScoIMb protein purification fractions for HiTrap™ Capto™ MMC resin. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); CL - column load; FT - flow-through; W - column wash; F10-F20 - elution fractions. Arrow indicates specific ScoIMb protein band.

Figure 42 shows SDS-PAGE analysis of salmocin ScoIMb protein purification fractions for HiTrap™ Phenyl FF (LS) resin. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); CL - column load; FT - flow-through; W - column wash; F9-F18 - elution fractions. Arrow indicates specific ScoIMb protein band.

Figure 43 shows SDS-PAGE analysis of salmocin ScolMc protein purification fractions for HiTrap™ Capto™ MMC resin. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); CL - column load; FT - flow-through; W - column wash; F10-F20 - elution fractions. Arrow indicates specific ScolMc protein band. Figure 44 shows SDS-PAGE analysis of salmocin ScolMc protein purification fractions for HiTrap™ Phenyl FF (LS) resin resin. L - PageRuler™ Prestainded Protein Ladder (Thermo Fisher Scientific Inc. (Waltham, USA), #SM0671); CL - column load; FT - flow-through; W - column wash; F9-F18 - elution fractions. Arrow indicates specific ScolMc protein band.

DETAILED DESCRIPTION OF THE INVENTION

The proteins of the invention are proteins that have a cytotoxic effect on Salmonella and are referred to herein as “salmocins”. The salmocins generally have at least a binding domain (also referred to as “receptor binding domain”) that allows binding of the salmocin to a surface receptor structure of cells of the target Salmonella. Salmocins further have a cytotoxic domain that may be a catalytic or a pore-forming domain. The catalytic domain may have an RNase or DNase catalytic activity, an inhibitory activity against cell wall peptidoglycan (murein) biosynthesis, or may degrade cell wall structures of Salmonella. Further, the salmocins may have a translocation domain that may interact with membrane proteins of cells of the target Salmonella so that the salmocin is translocated to a compartment where the salmocin exerts its cytotoxic function.

The inventors assume that the M-type salmocins (ScolM or SalM) of the invention are peptidoglycanases that specifically cleave the bond between the lipid moiety and the pyrophosphoryl group of the peptidoglycan lipid I and lipid II intermediates, located at the periplasmic side of the inner membrane (by analogy with Gross and Braun, Mol. Gen. Genet. 251 (1996) 388-396; Barreteau et al. , Microbial Drug Resistance 18 (2012), 222-229). The released C55-polyisoprenol no longer translocates MurNAc-pentapeptide-GIcNAc across the cytoplasmic membrane. The ScolM salmocins kill sensitive Salmonella strains after it has been taken up across the outer membrane into the periplasm. The mode of action of ScolM is believed to involve the steps of adsorption to the FhuA outer membrane receptor, energy- dependent translocation through the outer cell membrane into the periplasm by the TonB import machinery (TonB, ExbB and ExbD), and catalytic action of its substrate. Each of these steps is performed by a specific protein domain. Accordingly, also the ScolM salmocins share a three-domain structural organization and a narrow antibacterial spectrum in that the antibacterial activity against bacteria other than Salmonella is limited.

For the specificity to Salmonella, the binding domain is of importance and (inter alia) distinguishes the salmocins from otherwise similar colicins. Thus, the protein of the invention may be defined by having at least a binding domain that comprises or consists of or is contained in any one of the amino acid sequence segments as defined in item (1) above or in claim 1. Items (a-i) to (a-v) of item (1) or claim 1 define binding domains of the salmocins ScolE2, ScolE3, ScolE7, ScolEla, and ScolElb, respectively. Items (a-vii) to (a-x) and (a-xi) and (a-xii) of item (1) above define binding domains of the salmocins ScolEIc, ScolEld, ScolE1e,ScolMa, ScoIMb, and ScolMc, respectively. The binding domain of Spst is contained in the amino acid sequence segment defined in item (a-vi) of item (1) above. The amino acid sequences of salmocins ScolE2, ScolE3, ScolE7, ScolEla, ScolElb, and Spst are given as SEQ ID NO: 1 to 6, respectively. The amino acid sequence of the salmocins ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, and ScolMc are given as SEQ ID NO: 25 to 28, 33, and 34, respectively. Items (b) to (d) of item (1) above define derivatives of ScolE2, ScolE3, ScolE7, ScolEla, ScolElb, Spst, ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, and ScolMc, having or containing derivative binding domains (or amino acid sequence segments). Analogously, items (B) to (E) and items (a) to (5) (defined below) define derivatives of ScolE2, ScolE3, ScolE7, ScolEla, ScolElb, Spst, ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, and ScolMc.

Derivatives of ScolE2, ScolE3, ScolE7, ScolEla, ScolElb, Spst, ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, and ScolMc are preferably capable of exerting a cytotoxic effect on Salmonella.

In the present invention, salmocins ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, and ScolMc (and their derivatives as defined) are preferred; ScolMa, ScoIMb, and ScolMc (and their derivatives as defined herein) are more preferred. ScoIMb and in particular ScolMc (and their derivatives as defined herein) are most preferred due to their exceptional ease of purification after expression and their antimicrobial activity. ScolMc (and its derivatives as defined herein) is also preferred due to its exceptionally high antimicrobial activity (see e.g. Example 23).

ScolEla and ScolElb as well as ScolEID and derivatives thereof as defined herein are preferred for use in combination with a ScolM salmocin (such as ScolMa, ScoIMb, and ScolMc) and their derivatives as defined herein.

Herein, an amino acid sequence segment (or, briefly, segment) refers to a plurality of contiguous amino acid residues of a protein or polypeptide having a larger number of amino acid residues than the segment. Domains are also referred to herein as “amino acid sequence segments” or briefly “segments”. The terms “protein” and “polypeptide” are used interchangeabley herein.

The protein of the invention comprises at least a binding domain. The following items (i) to (v), (vii) to (x), (xi), and (xii) of each of items (b) to (d) define preferred binding domains. Most preferred binding domains are those of items (a-i) to (a-v), (a-vii) to (a-x), (a-xi), and (a-xii).

Items (vi) of each of the following items (b) to (d), i.e. sub-items (b-vi), (c-vi) and (d-vi), define preferred amino acid sequence segments that contain a binding domain and are derivatives of salmocin Spst. The protein of the invention preferably comprises any one of the following amino acid sequence segments: (b-i) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1),

(b-ii) a segment having at least 75%, preferably 80%, more preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2),

(b-iii) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3),

(b-iv) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO: 4),

(b-v) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO: 5),

(b-vi) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6; in one embodiment, a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to a segment comprising at least 250 contiguous amino acid residues of Spst of SEQ ID NO: 6; in a further embodiment; in a further embodiment, the protein comprises or consists of an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the (entire) amino acid sequence of SEQ ID NO: 6,

(b-vii) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(b-viii) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(b-ix) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(b-x) a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28); (b-xi) a segment having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33); or (b-xii) a segment having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(c-i) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1),

(c-ii) a segment having at least 85%, preferably at least 90%, more preferably at least 95% sequence similarity to the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2),

(c-iii) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3),

(c-iv) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO: 4),

(c-v) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO: 5),

(c-vi) a segment having at least 90%, preferably at least 95% sequence similarity to a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6; in one embodiment, a segment having at least 90%, preferably at least 95% sequence similarity to a segment comprising at least 250 contiguous amino acid residues of Spst of SEQ ID NO: 6; in a further embodiment, the protein comprises or consists of an amino acid sequence having at least 80%, preferably at least 90% and most preferably at least 95% sequence similarity to the (entire) amino acid sequence of SEQ ID NO: 6;

(c-vii) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(c-viii) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(c-ix) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(c-x) a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28),

(c-xi) a segment having at least 90%, preferably at least 95%, more preferably at least 97% sequence similarity to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or (c-xii) a segment having at least 90%, preferably at least 95%, more preferably at least 97% sequence similarity to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34); or

(d-i) a segment having from 1 to 20, preferably from 1 to 15, more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 316 to 449 of ScolE2 (SEQ ID NO: 1),

(d-ii) a segment having from 1 to 30, preferably from 1 to 29, more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 315 to 483 of ScolE3 (SEQ ID NO: 2),

(d-iii) a segment having from 1 to 20, preferably from 1 to 15, more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 318 to 451 of ScolE7 (SEQ ID NO: 3),

(d-iv) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 174 to 297 of ScolEla (SEQ ID NO:

4),

(d-v) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 198 to 322 of ScolElb (SEQ ID NO:

5),

(d-vi) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to a segment comprising at least 200 contiguous amino acid residues of Spst of SEQ ID NO: 6; in one embodiment, a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to a segment comprising at least 250 contiguous amino acid residues of Spst of SEQ ID NO: 6; in a further embodiment, the protein comprises or consists of an amino acid sequence having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the (entire) amino acid sequence of SEQ ID NO: 6.

(d-vii) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of ScolEIc (SEQ ID NO: 25),

(d-viii) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of ScolEld (SEQ ID NO: 26),

(d-ix) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, and even more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 193 to 317 of ScolEle (SEQ ID NO: 27),

(d-x) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScolMa (SEQ ID NO: 28),

(d-xi) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScoIMb (SEQ ID NO: 33), or

(d-xii) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of ScolMc (SEQ ID NO: 34). Among the above proteins (or polypeptides), those of subclasses identified by numerals (vii) to (xii) are preferred (in all classes (a) to (d)) and those of subclass (x), (xi) and (xii) are more preferred.

In another embodiment, the invention provides a protein that is preferably capable of exerting a cytotoxic effect on Salmonella, wherein the amino acid sequence of said protein is defined by, or by comprising the segments of, any one of items (a-i) to (a-vi), (a-vii) to (a-x), (a- xi), (a-xii), (b-i) to (b-vi), (b-vii) to (b-x), (b-xi), (b-xii), (c-i) to (c-vi), (c-vii) to (c-x), (c-xi), (c-xii), (d-i) to (d-vi), or (d-vii) to (d-x), (d-xi), (d-xii) above. In a preferred embodiment, the amino acid sequence of said protein (or polypeptide) is defined by, or by comprising the segments of, any one of items (a-vii) to (a-x), (a-xi), (a-xii), (b-vii) to (b-x), (b-xi), (b-xii), (c-vii) to (c-x), (c-xi), (c- xii), or (d-vii) to (d-x), (d-xi), (d-xii) defined above. In a more preferred embodiment, the amino acid sequence of said protein is defined by, or by comprising the segments of, any one of items (a-x), (a-xi), (a-xii), (b-x), (b-xi), (b-xii), (c-x), (c-xi), (c-xii), or (d-x), (d-xi), (d-xii) defined above.

Preferably, alternatively or additionally, the protein is as defined in any one of (b-x), (c- x), (d-x), (b-xi), (c-xi), (d-xi), (b-xii), (c-xii), (d-xii), and the amino acid residue corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg. Alternatively or additionally, the amino acid residues corresponding to residues 76 and 84 of SEQ ID NO: 33 may be Gin. Herein, the wording “the amino acid residue corresponding to residue xx ... of SEQ ID NO: yy” means that the amino acid sequence of said protein has, at the position corresponding to residue xx of SEQ ID NO: yy, the indicated amino acid residue. Here, xx stands for the number (from the N-terminus) of an amino acid residue in the amino acid sequence of the protein, and yy stands for the indicated SEQ ID NO:.

Corresponding amino acid residues may be determined by aligning the protein with the amino acid sequence of SEQ ID NO:33 to give the best alignment (as done and shown e.g. Fig. 34). The wording “amino acid residue(s) corresponding to the residue ...” refers to an alignment as shown in Fig. 34 and means amino acid residues in SEQ ID NO: yy (here: SEQ ID NO: 33) having the same position (i.e. written on top of each other) in said alignment as the indicated amino acid residues in SEQ ID NO: yy.

Where a protein is defined herein by a number or number range of amino acid substitutions, additions, insertions or deletions, amino acid substitutions, additions, insertions or deletions may be combined, but the given number or number range refers to the sum of all amino acid substitutions, additions, insertions and deletions. Among amino acid substitutions, additions, insertions and deletions, amino acid substitutions, additions, and deletions are preferred. The term “insertion” relates to insertions within the amino acid sequence of a reference sequence, i.e. excluding additions at the C- or N-terminal end. The term additions means additions at the C- or N-terminal end of the amino acid sequence of a reference sequence. A deletion may be a deletion of a terminal or an internal amino acid residue of a reference sequence. Herein, where the protein or any domain thereof is defined by a number or number range of amino acid substitutions, additions, insertions or deletions relative to an indicated amino acid sequence of segment, in a further embodiment, the protein or domain may have from 1 to several amino acid substitutions, additions, insertions or deletions relative to the indicated amino acid sequence of segment.

The cytotoxic or catalytic domain of the protein of the invention may be as defined in item (3) above. In preferred embodiments, the protein of the invention comprises a cytotoxic or catalytic domain that comprises or consists of any one of the following amino acid sequence segments:

(b-i)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(b-ii)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2), (b-iii)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3),

(b-iv)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4),

(b-v)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5),

(b-vi)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(b-vii)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(b-viii)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(b-ix)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27),

(b-x)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90% and most preferably at least 95% sequence identity to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(b-xi)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or (b-xii)’ a segment having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34); or

(c-i)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(c-ii)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2),

(c-iii)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3), (c-iv)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4),

(c-v)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5),

(c-vi)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(c-vii)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(c-viii)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(c-ix)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27),

(c-x)’ a segment having at least 90%, preferably at least 95% sequence similarity to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(c-xi)’ a segment having at least 90%, preferably at least 95%, more preferably at least 97% sequence similarity to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or

(c-xii)’ a segment having at least 90%, preferably at least 95%, more preferably at least 97% sequence similarity to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34); or

(d-i)’ a segment having from 1 to 20, preferably from 1 to 15, more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 453 to 582 of ScolE2 (SEQ ID NO: 1),

(d-ii)’ a segment having from 1 to 20, preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 501 to 584 of ScolE3 (SEQ ID NO: 2),

(d-iii)’ a segment having from 1 to 20, preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 455 to 584 of ScolE7 (SEQ ID NO: 3),

(d-iv)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 306 to 478 of ScolEla (SEQ ID NO: 4),

(d-v)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 350 to 522 of ScolElb (SEQ ID NO: 5), (d-vi)’ a segment having from 1 to 20, preferably from 1 to 15, more preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 112 to 288 of Spst (SEQ ID NO: 6),

(d-vii)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 ScolEIc (SEQ ID NO: 25),

(d-viii)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 ScolEld (SEQ ID NO: 26),

(d-ix)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 345 to 517 ScolEle (SEQ ID NO: 27),

(d-x)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 ScolMa (SEQ ID NO: 28),

(d-xi)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, even more preferably from 1 to 10, and most preferably at least from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 ScoIMb (SEQ ID NO: 33), or (d-xii)’ a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 15, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 ScolMc (SEQ ID NO: 34).

In more preferred embodiments, the protein of the invention comprises a cytotoxic or catalytic domain that comprises, or consists of, any one of the sequence segments (a-vii)’ to (a- x)’, (a-xi)’ or (a-xii)’.

The protein may be as defined in any one of (b-x)’, (c-x)’, (d-x)’, (b-xi)’, (c-xi)’, (d-xi)’, (b- xii)’, (c-xii)’, or (d-xii)’, and the amino acid residue of said protein corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg. Alternatively or additionally, the amino acid residues corresponding to residues 76 and 84 of SEQ ID NO: 33 may be Gin.

Herein, in any item (x-y)’ (wherein x stands for any one of a, b, c, or d, and y stands for any roman numeral i to xii), the prime ‘ indicates catalytic domains or segments. Items (x-y) lacking the prime indicates binding domains or segments. Items (x-y)” carrying the double prime ” indicates translocation domains or segments. Among items (a) to (d), those of items (a), (b) and (d) are preferred and items (a) and (d) are more preferred. Similarly, among items (a)’ to (d)’, those of items (a)’, (b)’ and (d)’ are preferred and items (a)’ and (d)’ are more preferred. Similarly, among items (a)” to (d)”, those of items (a)”, (b)” and (d)” are preferred and items (a)” and (d)” are more preferred:

Where the protein of the invention comprises a binding domain as defined herein and a catalytic domain as defined herein, any binding domain (or segment) as defined above may be combined with any catalytic domain (or segment). In a preferred embodiment, a binding domain of any sub-item from (i) to (x) is combined, in a protein of the invention, with a catalytic domain of sub-item (i)’ to (x)’, respectively (e.g. a binding domain of item (iii) is combined with a catalytic domain of item (iii)’), whereby the catalytic domain may be on the C-terminal side of the protein. In one embodiment, a binding domain of any item (a) to (d) is combined with a catalytic domain of item (a)’ to (d)’, respectively, whereby the catalytic domain may be on the C- terminal side of the protein.

In certain embodiments, the protein of the invention may be capable of exerting a cytotoxic effect on Salmonella, and the protein comprises at least any one of the following combinations of amino acid sequence segments, preferably in the given order from N-terminus to the C-terminus of the protein:

(a-i) the segment from amino acid residue 316 to 449 SEQ ID NO: 1 and the segment from amino acid residue 453 to 582 of SEQ ID NO: 1 ,

(a-ii) the segment from amino acid residue 315 to 483 of ScolE3 of SEQ ID NO: 2 and the segment from amino acid residue 501 to 584 of SEQ ID NO: 2,

(a-iii) the segment from amino acid residue 318 to 451 of SEQ ID NO: 3 and the segment from amino acid residue 455 to 584 of SEQ ID NO: 3,

(a-iv) the segment from amino acid residue 174 to 297 of SEQ ID NO: 4 and the segment from amino acid residue 306 to 478 of SEQ ID NO: 4,

(a-v) the segment from amino acid residue 198 to 322 of SEQ ID NO: 5 and the segment from amino acid residue 350 to 522 of SEQ ID NO: 5,

(a-vi) a segment comprising at least 200 contiguous amino acid residues of SEQ ID NO: 6 including the segment from amino acid residue 112 to 288 of SEQ ID NO: 6,

(a-vii) the segment from amino acid residue 195 to 319 of SEQ ID NO: 25 and the segment from amino acid residue 347 to 519 of SEQ ID NO: 25,

(a-viii) the segment from amino acid residue 195 to 319 of SEQ ID NO: 26 and the segment from amino acid residue 347 to 519 of SEQ ID NO: 26,

(a-ix) the segment from amino acid residue 193 to 317 of SEQ ID NO: 27 and the segment from amino acid residue 345 to 517 of SEQ ID NO: 27,

(a-x) the segment from amino acid residue 38 to 138 of SEQ ID NO: 28 and the segment from amino acid residue 139 to 269 of SEQ ID NO: 28, (a-xi) the segment from amino acid residue 38 to 138 of SEQ ID NO: 33 and the segment from amino acid residue 139 to 269 of SEQ ID NO: 33, or (a-xii) the segment from amino acid residue 38 to 138 of SEQ ID NO: 34 and the segment from amino acid residue 139 to 269 of SEQ ID NO: 34; or

(b-i) a segment having at least 75% sequence identity to the segment from amino acid residue 316 to 449 of SEQ ID NO: 1 and a segment having at least 70% sequence identity to the segment from amino acid residue 453 to 582 of SEQ ID NO: 1,

(b-N) a segment having at least 70% sequence identity to the segment from amino acid residue 315 to 483 of SEQ ID NO: 2 and a segment having at least 70% sequence identity to the segment from amino acid residue 501 to 584 of SEQ ID NO: 2,

(b-iii) a segment having at least 77% sequence identity to the segment from amino acid residue 318 to 451 of SEQ ID NO: 3 and a segment having at least 70% sequence identity to the segment from amino acid residue 455 to 584 of SEQ ID NO: 3,

(b-iv) a segment having at least 70% sequence identity to the segment from amino acid residue 174 to 297 of SEQ ID NO: 4 and a segment having at least 70% sequence identity to the segment from amino acid residue 306 to 478 of SEQ ID NO: 4,

(b-n) a segment having at least 70% sequence identity to the segment from amino acid residue 198 to 322 of SEQ ID NO: 5 and a segment having at least 70% sequence identity to the segment from amino acid residue 350 to 522 of SEQ ID NO: 5,

(b-vi) a segment having at least 70% sequence identity to a segment comprising at least 200 contiguous amino acid residues of SEQ ID NO: 6 including a segment having at least 70% sequence identity to the segment from amino acid residue 112 to 288 of SEQ ID NO: 6,

(b-nϋ) a segment having at least 70% sequence identity to the segment from amino acid residue 195 to 319 of SEQ ID NO: 25 and a segment having at least 70% sequence identity to the segment from amino acid residue 347 to 519 of SEQ ID NO: 25,

(b-viii) a segment having at least 70% sequence identity to the segment from amino acid residue 195 to 319 of SEQ ID NO: 26 and a segment having at least 70% sequence identity to the segment from amino acid residue 347 to 519 of SEQ ID NO: 26,

(b-ix) a segment having at least 70% sequence identity to the segment from amino acid residue 193 to 317 of SEQ ID NO: 27 and a segment having at least 70% sequence identity to the segment from amino acid residue 345 to 517 of SEQ ID NO: 27,

(b-c) a segment having at least 70% sequence identity to the segment from amino acid residue 38 to 138 of SEQ ID NO: 28 and a segment having at least 70% sequence identity to the segment from amino acid residue 139 to 269 of SEQ ID NO: 28,

(b-xi) a segment having at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95% sequence identity to the segment from amino acid residue 38 to 138 of SEQ ID NO: 33 and a segment having at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95% sequence identity to the segment from amino acid residue 139 to 269 of SEQ ID NO: 33,

(b-xii) a segment having at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95% sequence identity to the segment from amino acid residue 38 to 138 of SEQ ID NO: 34 and a segment having at least 70%, preferably at least 80%, more preferably at least 90%, and most preferably at least 95% sequence identity to the segment from amino acid residue 139 to 269 of SEQ ID NO: 34; or

(c-i) a segment having at least 85% sequence similarity to the segment from amino acid residue 316 to 449 of SEQ ID NO: 1 and a segment having at least 80% sequence similarity to the segment from amino acid residue 453 to 582 of SEQ ID NO: 1,

(c-N) a segment having at least 80% sequence similarity to the segment from amino acid residue 315 to 483 of SEQ ID NO: 2 and a segment having at least 80% sequence similarity to the segment from amino acid residue 501 to 584 of SEQ ID NO: 2,

(c-iii) a segment having at least 85% sequence similarity to the segment from amino acid residue 318 to 451 of SEQ ID NO: 3 and a segment having at least 80% sequence similarity to the segment from amino acid residue 455 to 584 of SEQ ID NO: 3,

(c-iv) a segment having at least 80% sequence similarity to the segment from amino acid residue 174 to 297 of SEQ ID NO: 4 and a segment having at least 80% sequence similarity to the segment from amino acid residue 306 to 478 of SEQ ID NO: 4,

(c-n) a segment having at least 80% sequence similarity to the segment from amino acid residue 198 to 322 of SEQ ID NO: 5 and a segment having at least 80% sequence similarity to the segment from amino acid residue 350 to 522 of SEQ ID NO: 5,

(c-vi) a segment having at least 80% sequence similarity to a segment comprising at least 200 contiguous amino acid residues of SEQ ID NO: 6 including a segment having at least 80% sequence similarity to the segment from amino acid residue 112 to 288 of SEQ ID NO: 6,

(c-vii) a segment having at least 80% sequence similarity to the segment from amino acid residue 195 to 319 of SEQ ID NO: 25 and a segment having at least 80% sequence similarity to the segment from amino acid residue 347 to 519 of SEQ ID NO: 25,

(c-viii) a segment having at least 80% sequence similarity to the segment from amino acid residue 195 to 319 of SEQ ID NO: 26 and a segment having at least 80% sequence similarity to the segment from amino acid residue 347 to 519 of SEQ ID NO: 26,

(c-ix) a segment having at least 80% sequence similarity to the segment from amino acid residue 193 to 317 of SEQ ID NO: 27 and a segment having at least 80% sequence similarity to the segment from amino acid residue 345 to 517 of SEQ ID NO: 27, or (c-c) a segment having at least 80% sequence similarity to the segment from amino acid residue 38 to 138 of SEQ ID NO: 28 and a segment having at least 80% sequence similarity to the segment from amino acid residue 139 to 269 of SEQ ID NO: 28,

(c-xi) a segment having at least 80%, preferably at least 90%, and most preferably at least 95% sequence similarity to the segment from amino acid residue 38 to 138 of SEQ ID NO: 33 and a segment having at least 80%, preferably at least 90%, and most preferably at least 95% sequence similarity to the segment from amino acid residue 139 to 269 of SEQ ID NO: 33,

(c-xii) a segment having at least 80%, preferably at least 90%, and most preferably at least 95% sequence similarity to the segment from amino acid residue 38 to 138 of SEQ ID NO: 34 and a segment having at least 80%, preferably at least 90%, and most preferably at least 95% sequence similarity to the segment from amino acid residue 139 to 269 of SEQ ID NO: 34; or

(d-i) a segment having from 1 to 25 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 316 to 449 of SEQ ID NO: 1 and a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 453 to 582 of SEQ ID NO: 1,

(d-N) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 315 to 483 of SEQ ID NO: 2 and a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 501 to 584 of SEQ ID NO: 2,

(d-iii) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 318 to 451 of SEQ ID NO: 3 and a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 455 to 584 of SEQ ID NO: 3,

(d-iv) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 174 to 297 of SEQ ID NO: 4 and a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 306 to 478 of SEQ ID NO: 4,

(d-n) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 198 to 322 of SEQ ID NO: 5 and a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 350 to 522 of SEQ ID NO: 5,

(d-vi) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to a segment comprising at least 200 contiguous amino acid residues of SEQ ID NO: 6 including a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 112 to 288 of SEQ ID NO: 6,

(d-vii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of SEQ ID NO: 25 and a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 of SEQ ID NO: 25,

(d-viii) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 195 to 319 of SEQ ID NO: 26 and a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 347 to 519 of SEQ ID NO: 26,

(d-ix) a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 193 to 317 of SEQ ID NO: 27 and a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 345 to 517 of SEQ ID NO: 27, or (d-c) a segment having from 1 to 30 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of SEQ ID NO: 28 and a segment having from 1 to 40 amino acid substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 of SEQ ID NO: 28,

(d-xi) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of SEQ ID NO: 33 and a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 of SEQ ID NO: 33,

(d-xii) a segment having from 1 to 30, preferably from 1 to 20, more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 38 to 138 of SEQ ID NO: 34 and a segment having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid residue substitutions, additions, insertions or deletions to the segment from amino acid residue 139 to 269 of SEQ ID NO: 34.

The protein comprises preferably at least any one of the combinations of amino acid sequence segments defined in subclasses (vii) to (xii), even more preferably (x), (xi), or (xii).

All these embodiments may be combined with the preferred values for minimum sequence identities or similarities or preferred numbers of amino acid substitutions, additions, insertions or deletions of the respective segments defined herein. Item (4) above (of the Summary of the Invention) defines translocation domains and derivatives thereof of ScolE2, ScolE3, Scol E7, ScolEla, ScolElb, ScolEIc, ScolEld, ScolEle, ScolMa, ScoIMb, and ScolMc. The definitions of the translocation domains and derivatives thereof may be combined with the definitions of the cytotoxic and binding domains or derivatives thereof. The definitions of the translocation domains and derivatives thereof may be combined with the definitions of the protein as inter alia defined below.

A protein of the invention may have a binding domain (or binding segment) according to any one of items (a-i) to (a-x), (a-xi) and (a-xii) or according to any of the derivatives of items (b-i) to (b-x), (b-xi) and (b-xii), (c-i) to (c-x), (c-xi) and (c-xii), or (d-i) to (d-x), (d-xi) and (d-xii). Preferably, a protein of the invention may have a binding domain (or binding segment) according to any one of items (a-vii) to (a-x), (a-xi) and (a-xii) or according to any of the derivatives of items (b-vii) to (b-x), (b-xi) and (b-xii), (c-vii) to (c-x), (c-xi) and (c-xii), or (d-vii) to (d-x), (d-xi) and (d-xii). Any such binding domain may be combined with a catalytic/cytotoxic domain according to any one items (a-i)’ to (a-x)’, (a-xi)’, (a-xii)’, (b-i)’ to (b-x)’, (b-xi)’, (b-xii)’, (c- i)’ to (c-x)’, (c-xi)’, (c-xii)’, or (d-i)’ to (d-x), (d-xi)’, or (d-xii)’. Preferably, any such binding domain may be combined with a catalytic/cytotoxic domain according to any one items (a-vii)’ to (a-x)’, (a-xi)’, (a-xii)’, (b-vii)’ to (b-x)’, (b-xi)’, (b-xii)’, (c-vii)’ to (c-x)’, (c-xi)’, (c-xii)’, or (d-i)’ to (d-x), (d- xi)’, (d-xii)’, whereby this preferred embodiment may preferably be combined with the preferred binding domains given above in this paragraph.

The domain structure of the salmocins allows establishing artificial salmocins wherein domains from different salmocins of the invention, or derivatives thereof as defined herein, are combined to form novel salmocins (chimeric salmocins). In such chimeric salmocins, the domain sequence of natural salmocins, from the N-terminus to the C-terminus, of a translocation domain (if present), a binding domain, and a catalytic or activity domain may or may not be maintained; preferably, it is maintained. Thus, the protein of the invention may comprise, from the N-terminus to the C-terminus, a binding domain of any one of items (a-i) to (a-x), (a-xi), and (a-xii) or according to any of the derivatives of items (b-i) to (b-x), (b-xi), (b-xii), (c-i) to (c-x), (c-xi) , (c-xii), or (d-i) to (d-x), (d-xi), (d-xii), and a catalytic domain (segment) of any one of items (a-i)’ to (a-x)’, (a-xi)’, (a-xii)’, (b-i)’ to (b-x)’, (b-xi)’, (b-xii)’, (c-i)’ to (c-x)’, (c-xi)’, (c-xii)’ or (d-i)’ to (d-x)’, (d-xi)’, (d-xii)’. In a preferred embodiment, the protein of the invention may comprise, from the N-terminus to the C-terminus, a translocation domain of any one of items (a-i)” to (a-ix)”, (b-i)” to (b-ix)”, (c-i)” to (c-ix)” or (d-i)” to (d-ix)”, a binding domain of any one of items (a-i) to (a-x), or according to any of the derivatives of items (b-i) to (b-x), (c-i) to (c- x) or (d-i) to (d-x), and a catalytic domain (segment) of any one of items (a-i)’ to (a-x)’, (b-i)’ to (b-x)’, (c-i)’ to (c-x)’ or (d-i)’ to (d-x)’. Within the three cytotoxic activities of the salmocins nuclease, pore-forming and muramidase (Table 1), domains may be exchanged between salmocins of the same type of cytotoxic activity. For example, a new salmocin with RNase-type cytotoxicity may be formed from the translocation and binding domains of ScolE2 or ScolE7 (or derivatives of these domains) and the cytotoxic domain of ScolE3. Preferably, however, a binding domain of any one of sub-items (i) to (x) is combined with a catalytic domain of any one of sub-items (i)’ to (x)’, respectively, for increased similarity to natural salmocins, preferably each of any of items (a) to (d). More preferably, however, a binding domain of any one of sub-items (i) to (v) or (vii) to (x) may be combined with a catalytic domain of any one of sub-items (i)’ to (v)’ or (vii)’ to (x)’, respectively, and a translocation domain of any one of sub-items (i)” to (v)” or (vii)” to (x)”, respectively, preferably of any of items (a) to (d), for increased similarity to natural salmocins.

In another embodiment, a binding domain of sub-items (i) to (v) is combined with a catalytic domain of sub-items (i)’ to (vi)’ (preferably (i)’ to (v)’), respectively, for increased similarity to natural salmocins, preferably each of any of items (a) to (d). In a further embodiment, a binding domain of any one of sub-items (i) to (v) is combined with a catalytic domain of any one of sub-items (i)’ to (v)’, respectively, and a translocation domain of any one of sub-items (i)” to (v)”, respectively, preferably of any of items (a) to (d), for increased similarity to natural salmocins.

The invention also provides a protein that is preferably capable of exerting a cytotoxic effect on Salmonella, said protein comprising or consisting of the following amino acid sequences:

(A-i) SEQ ID NO: 1,

(A-ii) SEQ ID NO: 2,

(A-iii) SEQ ID NO: 3,

(A-iv) SEQ ID NO: 4,

(A-v) SEQ ID NO: 5,

(A-vi) SEQ ID NO: 6,

(A-vii) SEQ ID NO: 25,

(A-viii) SEQ ID NO: 26,

(A-ix) SEQ ID NO: 27,

(A-x) SEQ ID NO: 28,

(A-x) SEQ ID NO: 33, or (A-x) SEQ ID NO: 34; or (B-i) an amino acid sequence having at least 75%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 1,

(B-ii) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 93%, and even more preferably at least 96% sequence identity to the amino acid sequence of SEQ ID NO: 2,

(B-iii) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 3,

(B-iv) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 4,

(B-v) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 5,

(B-vi) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 6,

(B-vii) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 25,

(B-viii) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 26,

(B-ix) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, and even more preferably at least 95% sequence identity to the amino acid sequence of SEQ ID NO: 27,

(B-x) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the amino acid sequence of SEQ ID NO: 28,

(B-xi) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the amino acid sequence of SEQ ID NO: 33, or

(B-xii) an amino acid sequence having at least 70%, preferably at least 80%, more preferably at least 85%, even more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence identity to the amino acid sequence of SEQ ID NO: 34; or

(C-i) an amino acid sequence having at least 85%, preferably at least 90%, and more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 1,

(C-ii) an amino acid sequence having at least 85%, preferably at least 90%, and more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 2,

(C-iii) an amino acid sequence having at least 85%, preferably at least 90%, and more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 3,

(C-iv) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 4,

(C-v) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 5,

(C-vi) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 6,

(C-vii) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 25,

(C-viii) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 26,

(C-ix) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, and even more preferably at least 95% sequence similarity to the amino acid sequence of SEQ ID NO: 27,

(C-x) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence similarity to the amino acid sequence of SEQ ID NO: 28,

(C-xi) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence similarity to the amino acid sequence of SEQ ID NO: 33, or

(C-xii) an amino acid sequence having at least 80%, preferably at least 85%, more preferably at least 90%, even more preferably at least 95%, and most preferably at least 97% sequence similarity to the amino acid sequence of SEQ ID NO: 34; or (D-i) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 1,

(D-ii) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 2,

(D-iii) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 3,

(D-iv) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 4,

(D-v) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 5,

(D-vi) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 6,

(D-vii) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 25,

(D-viii) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 26,

(D-ix) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, and most preferably from 1 to 10 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 27,

(D-x) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 28, (D-xi) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 33, or

(D-xii) an amino acid sequence having from 1 to 40, preferably from 1 to 30, more preferably from 1 to 20, even more preferably from 1 to 10, and most preferably from 1 to 5 amino acid residue substitutions, additions, insertions or deletions to the amino acid sequence of SEQ ID NO: 34; or

(E-i) an amino acid sequence comprising or consisting of at least 470, preferably at least 525, more preferably at least 555, contiguous amino acid residues of SEQ ID NO: 1,

(E-ii) an amino acid sequence comprising or consisting of at least 470, preferably at least 525, more preferably at least 555, contiguous amino acid residues of SEQ ID NO: 2,

(E-iii) an amino acid sequence comprising or consisting of at least 470, preferably at least 525, more preferably at least 555, contiguous amino acid residues of SEQ ID NO: 3,

(E-iv) an amino acid sequence comprising or consisting of at least 390, preferably at least 435, more preferably at least 460, contiguous amino acid residues of SEQ ID NO: 4,

(E-v) an amino acid sequence comprising or consisting of at least 425, preferably at least 475, more preferably at least 500, contiguous amino acid residues of SEQ ID NO: 5,

(E-vi) an amino acid sequence comprising or consisting of at least 250, preferably at least 270, more preferably at least 282, contiguous amino acid residues of SEQ ID NO: 6,

(E-vii) an amino acid sequence comprising or consisting of at least 425, preferably at least 475, more preferably at least 500, contiguous amino acid residues of SEQ ID NO: 25,

(E-viii) an amino acid sequence comprising or consisting of at least 425, preferably at least 475, more preferably at least 500, contiguous amino acid residues of SEQ ID NO: 26,

(E-ix) an amino acid sequence comprising or consisting of at least 425, preferably at least 475, more preferably at least 500, contiguous amino acid residues of SEQ ID NO: 27,

(E-x) an amino acid sequence comprising or consisting of at least 215, preferably at least 240, more preferably at least 260, contiguous amino acid residues of SEQ ID NO: 28,

(E-xi) an amino acid sequence comprising or consisting of at least 215, preferably at least 240, more preferably at least 260, contiguous amino acid residues of SEQ ID NO: 33, or (E-xii) an amino acid sequence comprising or consisting of at least 215, preferably at least 240, more preferably at least 260, contiguous amino acid residues of SEQ ID NO: 34.

As is generally understood and for avoiding any doubt, the wording “protein comprising any one of the following amino acid sequences” means that the amino acid sequence of said protein may comprise additional amino acid residues or sequence stretches than those defined (e.g. a purification tag or other tag). The wording “protein consisting of any one of the following amino acid sequences” means that the amino acid sequence of said protein does not have additional amino acid residues than those defined. As above, among the above proteins (or polypeptides), those of subclasses identified by numerals (vii) to (xii) are preferred (in all classes (a) to (d)) and those of subclass (x), (xi) and (xii) are more preferred. This also applies to the preferred embodiments described below. In another embodiment, the invention provides a protein that is preferably capable of exerting a cytotoxic effect on Salmonella, wherein the amino acid sequence of said protein is as defined in any one of items (A-i) to (A-x), (A-xi), (A-xii), (B-i) to (B-x), (B-xi), (B-xii), (C-i) to (C- x), (C-xi), (C-xii), (D-i) to (D-x), (D-xi), (D-xii), or (E-i) to (E-x), (E-xi), or (E-xii). In a preferred embodiment, the invention provides a protein that is preferably capable of exerting a cytotoxic effect on Salmonella, wherein the amino acid sequence of said protein is as defined in any one of items (A-vii) to (A-x), (A-xi), (A-xii), (B-vii) to (B-x), (B-xi), (B-xii), (C-vii) to (C-x), (C-xi), (C-xii), (D-vii) to (D-x), (D-xi), (D-xii), or (E-vii) to (E-x), (E-xi), or (E-xii).

The above definitions of the proteins with respect to the entire sequence of SEQ ID NOs 1 to 6, 25 to 28, 33 or 34 may be combined with the above definitions of the protein based on one or more particular domains such as a binding and/or catalytic or cytotoxic domains and/or translocation domain where available.

Herein, the determination of sequence identities and similarities is done using Align Sequences Protein BLAST (BLASTP 2.6.1+) (Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402.).

The derivatives of domains and/or protein of the invention as defined above in items (b) to (d), (b)’ to (d)’, (b)” to (d)”, or items (B) to (D) or (E) may, notwithstanding the sequence varieties allowed by the embodiments defined above, preserve amino acid residues as defined in the following. In preferred embodiments, the amino acid residue(s) corresponding to residue 125 of SEQ ID NO: 4 is Asn or Ser; residue 145 of SEQ ID NO: 4 is Lys or Arg; residue 151 of SEQ ID NO: 4 is Ala or Gly; residue 154 of SEQ ID NO: 4 is Ala, Ser or Gly; residue 155 of SEQ ID NO: 4 is Phe, Leu or lie; residue 158 of SEQ ID NO: 4 is Ala or Gly; residue 163 of SEQ ID NO: 4 is Glu, Asp, Ser, Leu or lie, preferably Glu, Asp, or Ser; residue 165 of SEQ ID NO: 4 is Ala, Thr, Val or Ser, preferably Ala, Thr, or Val; residue 167 of SEQ ID NO: 4 is Arg; residue 172 of SEQ ID NO: 4 is Thr, Ala, or Ser; residue 175 of SEQ ID NO: 4 is Gin; residue 176 of SEQ I D NO: 4 is Val or Leu; residue 178 of SEQ ID NO: 4 is Gin or Leu, preferably Gin; residue 181 of SEQ ID NO: 4 is Glu or Asp, preferably Glu; residue 184 of SEQ ID NO: 4 is Arg or Gin, preferably Arg; residue 192 of SEQ ID NO: 4 is Ala or Thr; residue 195 of SEQ ID NO: 4 is Ala or Val; residue 196 of SEQ ID NO: 4 is Glu or Gin, preferably Glu; residue 198 of SEQ ID NO: 4 is Ala or Thr; residue 209 of SEQ ID NO: 4 is Leu or lie, preferably Leu; residue 273 of SEQ ID NO: 4 is Leu or lie; residue 280 of SEQ ID NO: 4 Arg; residue 283 of SEQ ID NO: 4 Lys; residue 286 of SEQ ID NO: 4 Gin or Lys; residue 290 of SEQ ID NO: 4 Ala, or Thr; residue 299 of SEQ ID NO: 4 Asp, Asn or Glu; residue 301 of SEQ ID NO: 4 Leu; residue 302 of SEQ ID NO: 4 Asn or Asp; residue 346 of SEQ ID NO: 4 Asn, Asp, or Glu; residue 363 of SEQ ID NO: 4 Lys, Asn or Arg; residue 364 of SEQ ID NO: 4 Lys or Gin.

The wording “amino acid residue(s) corresponding to the amino acid residue refers to the alignments shown in Fig. 20A-C and Fig. 26 and means amino acid residues in SEQ ID NO: 4 (ScolEla) or amino acid residues in SEQ ID NO: 1 to 3, 5, 6, 25, 26 or 27 having the same position (i.e. written on top of each other) in said alignment as the indicated amino acid residues in SEQ ID NO: 4 (ScolEla).

In derivatives of ScolEla and ScolElb, and/or in derivative domains of ScolEla and ScolElb, corresponding amino acid residues that are the same in the alignment of ScolEla and ScolElb of Fig. 20 may be the same amino acid residue as in ScolEla and ScolElb; and/or corresponding to amino acid residues that differ among ScolEla and ScolElb, some or all such differing amino acid residues may be an amino acid residue as in ScolEla or in ScolElb (but not another amino acid residue).

In derivatives of ScolE2 and ScolE7, and/or in derivative domains of ScolE2 and ScolE7, corresponding amino acid residues that are the same in the alignment of ScolE2 and ScolE7 of Fig. 20 may be the same amino acid residue as in ScolE2 or ScolE7; and/or corresponding to amino acid residues that differ among ScolE2 and ScolE7, some or all such differing amino acid residues may be an amino acid residue as in ScolE2 or ScolE7 (but not another amino acid residue). A salmocin according to the invention may comprise an additional N- or C-terminal amino acid sequence stretch such as purification tags, e.g. as a His-tag of 6 or more contiguous histidine residues; the derivative has, preferably, no N-terminal amino acid residue addition.

The protein (salmocin) of the invention is preferably capable of exerting a cytotoxic effect on Salmonella, notably of Salmonella enterica and more preferably Salmonella enterica ssp. enterica. Whether this condition is fulfilled can be tested experimentally using a radial diffusion assays via spot-on-lawn-method. The cytotoxicity of a protein to be tested against Salmonella enterica is such that it and the protein of SEQ ID NO: 1 produce spots free of viable bacteria of Salmonella enterica ssp. enterica serovar Newport strain ATCC ® 6962™* of the same diameter 12 hours after spotting 5 microliters of a solution of said protein to be tested and the protein of SEQ ID NO: 1 onto a softagar overlay plate seeded with 0.14 mL bacterial solution of 1x10 7 cfu/mL per cm 2 of the sensitive Salmonella enterica strain and subsequent incubation of the agar plate at 37°C, wherein the concentration of the protein to be tested is at most 5 times that of the comparative solution of the protein SEQ ID NO: 1. In a preferred embodiment, the point of reference is not the protein of SEQ ID NO: 1 , but the protein of SEQ ID NO: 4 or 5 under otherwise identical conditions.

The composition of the invention comprises a protein (salmocin) as described above and optionally further components as the case requires such as a carrier. The composition preferably comprises ScolEla and/or ScolElb or a derivative thereof as described above and optionally further components as the case requires such as a carrier. The composition may comprise one or more different proteins (salmocins) as defined herein, such as two, three or four different proteins (salmocins) as defined herein. “Different” means that the proteins differ in at least one amino acid residue. The composition may comprise two, three or more salmocins from the same subclass represented by any one of items (i) to (x), (xi) or (xii) above or, preferably, from different classes represented by any one of items (i) to (x) above. The composition may further comprise one or more E. coli colicin or a derivative thereof e.g. as described in EP 3097783 A1, e.g. for concomitantly controlling pathogenic E. coli such as EHEC.

As the protein of the invention is preferably produced by expression in plants or cells thereof, the composition may be a plant material or extract thereof, wherein the plant material is a material from a plant having expressed the protein, preferably Nicotiana or an edible plant having expressed said protein. An extract of plant material is an aqueous solution containing water-soluble proteins including a salmocin of the invention that is present or expressed in said plant material, or a dried product of such aqueous solution. The extract preferably has water- insoluble components of the plant material removed e.g. by filtration or centrifugation. The plant material may be a material from a plant selected from the group consisting of spinach, chard, beetroot, carrot, sugar beet, leafy beet, amaranth, Nicotiana, and/or said plant material is one or more leaves, roots, tubers, or seeds, or a crushed, milled or comminuted product of said leaves, roots, tubers, or seeds.

The composition or said extract from a plant material may be a solid or liquid composition, such as a solution or a dispersion, containing said salmocin(s). The liquid composition may be aqueous, such as an aqueous solution. The concentration of said protein in said aqueous dispersion or solution may be from 0.0001 to 1 mg/ml, preferably from 0.001 to 0.1 mg/ml, more preferably from 0.005 to 0.05 mg/ml. If more than one salmocin capable of exerting a cytotoxic effect on Salmonella is employed, these concentrations relate to the total concentration of all such salmocins.

The aqueous solution may, apart from the one or more salmocin, contain a buffer. The buffer may be an inorganic or organic acid or salts thereof. An example of an inorganic acid is phosphoric acid or salts thereof. Examples of the organic acid are HEPES, acetic acid, succinic acid, tartaric acid, malic acid, benzoic acid, cinnamic acid, glycolic acid, lactic acid, citric acid, and ascorbic acid. Preferred organic acids are malic acid, lactic acid, citric acid, and ascorbic acid. The pH of the solution may generally be from 4 to 8, preferably from 5 to 8, more preferably from 6.0 to 7.5. If the object to which the composition is applied is meat, the pH of the solution may generally be from 4 to 8, preferably from 4.5 to 7, more preferably from 5.0 to 6.5, and even more preferably from 5.0 to 6.0. Further, the solution may contain isotonic agents such as glycerol or a salt. A preferred salt to be used is sodium chloride. The aqueous solution containing the one or more salmocin may be a buffered aqueous solution that may contain further solutes e.g. salts such as from 50 to 400 mM NaCI, preferably from 100 to 200 mM NaCI. The aqueous solution may further contain a sulfhydryl compound such as dithiothreitol (DTT), dithioerythritol, thioethanol or glutathione, preferably DTT. The concentration of the total of sulfhydryl compounds in the aqueous solution may be from 1 to 50 mM, preferably from 2 to 20 mM and more preferably from 4 to 10 mM.

If the composition of the invention is a solid composition, it may be a powder such as a lyophilized solid composition obtained by lyophilization of the extract or solution mentioned above. The powder may contain additional solid components such as those mentioned above for the aqueous solution. Before use, it may be reconstituted with a suitable liquid, such as water or buffer. The solid composition may contain buffer, salts or other components as mentioned above, such that the concentrations given above may be achieved upon reconstitution or dissolution of the solid composition.

Examples of carriers of the composition are solvents such as water or an aqueous buffer (as described above), salts, sugars such as monosaccharides and disaccharides, sugar alcohols, and other carriers such as those known from pharmaceutical compositions. Examples of the latter are starch, cellulose and other proteins such as albumin. Examples of sugars are glucose, fructose, lactose, sucrose, and maltose.

The composition of the invention may contain at least 10, preferably at least 20, more preferably at least 30, even more preferably at least 50, even more preferably at least 75 % by weight of one or more salmocins of the invention based on the total weight of protein in the composition. The content of salmocin(s) in the composition may be determined by subjecting the composition to SDS-PAGE and analyzing the obtained gel, after staining, by determining the intensity of bands on the gel. Thereby, intensity of bands due to salmocins can be determined in relation to the sum of intensities of bands due to all proteins in the composition. The total protein content in the composition may be determined using the well-known Bradford protein assay.

In one embodiment, the composition of the invention is a pharmaceutical composition. The pharmaceutical composition may, apart from one or more salmocin(s) of the invention, optionally contain an E. coli colicin, and/or one or more suitable pharmaceutically acceptable excipients.

The invention provides a method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with one or more proteins (salmocins) as described above or a composition as described above. The object may be a surface of any non-organic object or an organic object such as food. Contamination of an object with Salmonella means adhesion of viable Salmonella cells to the object. Reducing contamination with Salmonella means reducing the number of viable Salmonella cells adhering to the object. Determining contamination of objects with Salmonella is part of the general knowledge. For example, dilution plating of solutions or dispersions of homogenized food as done in the Examples or dilution plating of a rinsing solution of other objects may be used, followed by counting bacterial colonies. Preferably, the object is food or animal feed. The food may be meat such as whole poultry carcasses, raw meat, cooked meat, and minced meat, eggs such raw eggs, whole eggs, peeled cooked eggs, scrambled eggs, fried eggs, raw fruit, or raw or cooked vegetable.

For treating or contacting the object with the protein or composition, a solution of the protein or a liquid composition as described above is generally contacted with the object. For example, said object is sprayed with an aqueous solution or is immersed into the aqueous solution as a composition of the invention. The object may be immersed for at least 10 seconds, preferably for at least 1 minute, preferably for at least 5 minutes into the aqueous solution. Contacting the object with a liquid composition helps to distribute the composition over the surface of the object. Where sufficiently even distribution can be achieved, it is possible to contact the object with a solid composition according to the invention, e.g. upon mincing meat. The invention also provides a method of treating infection with Salmonella of a subject in need thereof, comprising administering to said subject one or more proteins (salmocins) as described above or a composition as described above. The subject may be a human being or a mammal such as a farm animal. Examples of farm animals are poultry and cattle. Generally, a liquid or solid pharmaceutical composition containing the salmocin(s) and optionally further components as described above is prepared for administration to the animal or human. Liquid compositions may be aqueous solutions as described above. Solid compositions may be powder containing the at least one salmocin(s) e.g. in freeze-dried form, or tablets obtained from such powder or capsules filled with such powder. Administration may be oral. In this case, the pharmaceutical preparation is one that allows passage through the stomach without being attacked by the acid medium in the stomach. The salmocin(s) should then be released from the pharmaceutical preparation in the intestine. Such pharmaceutical preparations are known in the art. Examples are tablets and capsules resistant to the acid medium in the stomach. It is further possible to administer orally a biological material such as E. coli or plant material containing expressed salmocin(s) to a patient. The salmocin(s) may be administered to a human adult in amounts of 1 mg to 1000 mg per day, preferably of from 10 mg to 250 mg per day to a human patient. Such amounts may also be administered to an animal. In a probiotic approach, a patient may be treated by administering to the patient a genetically-modified microorganism expressing the at least one salmocin(s). The genetically-modified microorganism may be a genetically-modified non-pathogenic E. coli or a lactic acid-producing microorganism as commonly employed in fermentation of milk products. Examples of lactic acid-producing microorganism are bacteria from the genera Lactobacillus such as Lactobacillus lactis and Bifidobacterium such as Bifidobacterium bifidum or Bifidobacterium breve. Another route of administration is by injection into the blood stream of a patient for preventing infection with Salmonella. For this purpose, the salmocin(s) may be dissolved in a physiological saline and the solution be sterilized.

In the methods described above, the Salmonella is Salmonella enterica, preferably Salmonella enterica ssp. enterica.

Salmocins ScolEla and ScolElb have a particularly wide activity against many different serovars of Salmonella, notably of Salmonella enterica, preferably of Salmonella enterica ssp. enterica, as demonstrated in the Examples below. Therefore, ScolEla and ScolElb, or derivatives thereof, are preferably used for treating infection or for preventing or reducing contamination with any Salmonella enterica, preferably any Salmonella enterica ssp. enterica. Salmocins E2, E3, E7 and Spst also have a wide activity against target Salmonella. However, ScolE2 and derivatives thereof may be preferably used against strains 1, 3, 4, 15, 20, 22 to 30 as defined in Tables 5A and 5B. ScolE3 and derivatives thereof may be preferably used against strains 1, 3, 4, 17, and 20 to 25 as defined in Tables 5A and 5B. ScolE7 and derivatives thereof may be preferably used against strains 1, 3, 4, 5, 15, 20, 22 to 30 and 32 as defined in Tables 5A and 5B.

A salmocin according to the invention may be produced by known methods of protein expression in a standard expression system. For producing the salmocin, a nucleotide sequence encoding it may be expressed in a suitable host organism. Methods usable for producing and purifying a protein of interest have been described in the prior art and any such methods may be used. An E. coli expression system as generally known in the art may, for example, be used. If a eukaryotic expression system is used, one or more introns may be inserted in the coding sequence of the salmocin to prevent toxicity on the bacterial organism used for cloning.

Particularly efficient expression methods are plant expression systems that are also known in the prior art. Plant expression systems usable for expressing a salmocin according to the invention are described in the Examples. A possible way of achieving expression of a nucleotide sequence of interest in plants is the use of self-replicating (viral) replicons containing the nucleotide sequence encoding the salmocin. The coding sequence of the salmocin may be codon optimized for expression in plants or in the particular plant used as expression host.

Plant viral expression systems have been described in many publications, such as in WO2012019660, W02008028661 , W02006003018, W02005071090, W02005049839, W02006012906, W002101006, WO2007137788 or WO02068664 and many more publications are cited in these documents. Various methods for introducing a nucleic acid molecule, such as a DNA molecule, into a plant or plant part for transient expression are known. Agrobacteria may be used for transfecting plants with the nucleic acid molecule (vector) or nucleic acid construct e.g. by agroinfiltration or spraying with agrobacterial suspensions. For references, see WO 2012019660, WO 2014187571 , or WO 2013149726.

In embodiments wherein strong expression of a salmocin as a protein of interest is desired, a nucleic acid construct containing a nucleotide sequence encoding the salmocin may encode a viral vector that can replicate in plant cells to form replicons of the viral vector. In order to be replicating, the viral vector and the replicons may contain an origin of replication that can be recognized by a nucleic acid polymerase present in plant cells, such as by the viral polymerase expressed from the replicon. In case of RNA viral vectors (referred to as “RNA replicons”), the replicons may be formed by transcription under the control of a promoter active in plant cells, from the DNA construct after the latter has been introduced into plant cell nuclei.

In case of DNA replicons, the replicons may be formed by recombination between two recombination sites flanking the sequence encoding the viral replicon in the DNA construct, e.g. as described in WOOO/17365 and WO 99/22003. If the replicon is encoded by the DNA construct, RNA replicons are preferred. Use of DNA and RNA viral vectors (DNA or RNA replicons) has been extensively described in the literature over the years. Some examples are the following patent publications: W02008028661, W02007137788, WO 2006003018, W02005071090, W02005049839, W002097080, WO02088369, WO02068664. Examples of DNA viral vectors are those based on geminiviruses. For the present invention, viral vectors or replicons based on plant RNA viruses, notably those based on plus-sense single-stranded RNA viruses may be preferably used. Accordingly, the viral replicon may be a plus-sense single- stranded RNA replicon. Examples of such viral vectors are those based on tobacco mosaic virus (TMV) and potexvirus X (PVX). “Based on” means that the viral vector uses the replication system such as the replicase and/or other proteins involved in replication of these viruses. Potexvirus-based viral vectors and expression systems are described in EP2061890 or W02008/028661.

The salmocin may be expressed in a multi-cellular plant or a part thereof, notably a higher plant or parts thereof. Both monocot and dicot (crop) plants can be used. Common plants usable for expressing the protein of interest include Nicotiana benthamiana, Nicotiana tabacum, spinach, Brassica campest s, B. juncea, beets ( Beta vulgaris), cress, arugula, mustard, strawberry, Chenopodium capitatum, lettuce, sunflower, cucumber, Chinese cabbage, cabbage, carrot, green onion, onion, radish, lettuce, field peas, cauliflower, broccoli, burdock, turnip, tomato, eggplant, squash, watermelon, prince melon, and melon. Preferred plants are spinach, chard, beetroot, carrot, sugar beet, Nicotiana tabacum, and Nicotiana benthamiana. Expression in edible plants may be used for preventing contamination of the plants or food made therefrom with Salmonella. In one embodiment, plants are used that do not normally enter the human or animal food chain such as Nicotiana species such as N. tabacum and N. benthamiana.

Generally, the salmocin as a protein of interest is expressed in the cytosol of cells of the plants or plant parts. In this case, no signal peptide directing the protein of interest into a particular compartment is added to the protein. Alternatively, the protein of interest can be expressed in or targeted into chloroplasts of the plants; in the latter case, an N-terminal pre sequence, generally referred to as plastid transit peptide or chloroplast targeting peptide, is added to the N-terminal or C-terminal end, preferably the N-terminal end, of the salmocin as the protein of interest.

The salmocin may be co-expressed together with an immunity protein as described in the experimental section, notably if the salmocin has nuclease activity, for preventing toxicity on plant tissue. Suitable immunity proteins that may be co-expressed are those given in Table 2 below. In the process of producing a composition comprising at least one salmocin, a salmocin is, in the first step, expressed in a plant or cells of a plant, such as an edible plant. In the next step, plant material containing expressed salmocin from a plant having expressed the salmocin is harvested. Plant material may e.g. be leaves, roots, tubers, or seeds, or a crushed, milled or comminuted product of leaves, roots, tubers, or seeds. In step (iii), the salmocin is extracted from the plant material using an aqueous buffer. This may include that the plant material is homogenized and insoluble material may be removed by centrifugation or filtration. Soluble components including the salmocin will be extracted into the aqueous buffer to produce a salmocin solution in the aqueous buffer. The aqueous buffer may contain an inorganic or organic acid or salts thereof and may have a pH as defined above for the aqueous solution as a composition of the invention. Further, the aqueous buffer may contain salt and/or a sulfhydryl compound as also described above for the aqueous solution as a composition of the invention.

If a relatively pure salmocin composition is desired, the salmocin solution in the aqueous buffer may be further purified by removing undesired components in step (iv) according to known methods of protein purification.

Accordingly, the invention provides a process of producing a composition comprising a protein according to the invention, said process comprising the following steps:

(i) expressing said protein in a plant as described above, preferably an edible plant or

Nicotiana,

(ii) harvesting plant material containing expressed protein from said plant,

(iii) extracting said protein from said plant material using an aqueous buffer to obtain a composition containing said protein, optionally removing undesired contaminants from said composition.

If a salmocin is expressed in plants, the plants or tissue thereof having expressed protein is harvested, the tissue may be homogenized and insoluble material may be removed by centrifugation or filtration. If relatively pure salmocin is desired, the salmocin may be further purified by generally known method of protein purification such as by chromatographic methods which can remove other host-cell proteins and plant metabolites such as alkaloids and polyphenols. Purified salmocin solutions may be concentrated and/or freeze-dried.

If salmocins are expressed in edible plants, crude protein extracts from the edible plants or semi-purified concentrates may be used for preventing or reducing contamination of an object such as food with Salmonella.

Preferred Embodiments

A protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-x), (B-x), (C-x), (D-x), or (E-x). A protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xi), (B-xi), (C-xi), (D-xi), or (E-xi).

A protein the amino acid sequence of which comprises or consists of the amino acid sequences (A-xii), (B-xii), (C-xii), (D-xii), or (E-xii).

A protein according to any one of the previous three preferred embodiments (sentences), wherein the protein is as defined in any one of (B-x), (C-x), (D-x), (E-x), (B-xi), (C- xi), (D-xi), (E-xi), (B-xii), (C-xii), (D-xii), or (E-xii), preferably as defined in any one of (B-xi), (C- xi), (D-xi), (E-xi), (B-xii), (C-xii), (D-xii), or (E-xii), and wherein the amino acid residue of said protein corresponding to residue 155 of SEQ ID NO: 33 is Pro and/or the amino acid residue corresponding to residue 246 of SEQ ID NO: 33 is Arg or Lys, preferably Arg, and wherein the amino acid residues corresponding to residues 76 and 84 of SEQ ID NO:

33 are both Gin.

A protein (ScolEld of SEQ ID NO:26 or its derivative) the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-viii), (B-viii), (C-viii), (D-viii), or (E-viii), optionally as further defined above regarding amino acid residues corresponding to SEQ ID NO: 4.

A composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-x), (B-x), (C-x), (D-x), or (E-x), optionally as further defined above with regard to specific amino acid residues, and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-v), (B-v), (C-v), (D-v), or (E-v);

A composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xi), (B-xi), (C-xi), (D-xi), or (E-xi), optionally as further defined above with regard to specific amino acid residues, and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-v), (B-v), (C-v), (D-v), or (E-v);

A composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xii), (B-xii), (C-xii), (D-xii), or (E-xii), optionally as further defined above with regard to specific amino acid residues, and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-v), (B-v), (C-v), (D-v), or (E-v);

A composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-x), (B-x), (C-x), (D-x), or (E-x), optionally as further defined above with regard to specific amino acid residues, and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-iv), (B-iv), (C-iv), (D-iv), or (E-iv);

A composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xi), (B-xi), (C-xi), (D-xi), or (E-xi), optionally as further defined above with regard to specific amino acid residues, and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-iv), (B-iv), (C-iv), (D-iv), or (E-iv);

A composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xii), (B-xii), (C-xii), (D-xii), or (E-xii), optionally as further defined above with regard to specific amino acid residues, and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-iv), (B-iv), (C-iv), (D-iv), or (E-iv).

The protein or composition of any one of the preceding sentences for use in a method of treating infection of a subject with Salmonella.

A method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with a protein as defined in any one of the preceding sentences or with a composition as defined in any one of the preceding sentences.

A method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with a protein comprising or consisting of any one of the amino acid sequences (A-x), (B-x), (C-x), (D-x), or (E-x) (optionally as further defined above with regard to specific amino acid residues), or contacting said object with a composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-x), (B-x), (C-x), (D-x), or (E-x) (optionally as further defined above with regard to specific amino acid residues) and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-v), (B-v), (C-v), (D-v), or (E- v).

A method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xi), (B-xi), (C-xi), (D-xi), or (E- xi) (optionally as further defined above with regard to specific amino acid residues), or contacting said object with a composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences according to (A-xi), (B-xi), (C-xi), (D-xi), or (E-xi) (optionally as further defined above with regard to specific amino acid residues) and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-v), (B-v), (C-v), (D-v), or (E-v). A method of preventing or reducing infection or contamination of an object with Salmonella, comprising contacting said object with a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xii), (B-xii), (C-xii), (D-xii), or (E-xii) (optionally as further defined above with regard to specific amino acid residues), or contacting said object with a composition comprising a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-xii), (B-xii), (C-xii), (D- xii), or (E-xii) (optionally as further defined above with regard to specific amino acid residues) and a protein the amino acid sequence of which comprises or consists of any one of the amino acid sequences (A-v), (B-v), (C-v), (D-v), or (E-v).

Methods of treating infection with Salmonella of a subject in need thereof, comprising administering to said subject a protein as defined in any one of the preceding sentences or a composition as defined in any one of the preceding sentences.

These preferred embodiments can be combined with other embodiments or preferred embodiments described herein.

EXAMPLES

Example 1 : Plasmid constructs (salmocins)

Six salmocins representing four activity groups were selected (Table 1). Table 1. List of Salmonella bacteriocins (salmocins) used in examples.

The list comprises salmocins ScolE2, ScolE3, ScolE7, ScolEla, ScolElb and Spst. Respective amino acid sequences were retrieved from GenBank; corresponding nucleotide sequences with codon usage optimized for Nicotiana benthamiana were synthesized by Thermo Fisher Scientific Inc. In case of salmocins ScolE2, ScolE3 and ScolE7, the coding sequence was interrupted by insertion of the cat 1 intron (the first intron from Ricinus communis cat1 gene for catalase CAT1 (GenBank: D21161.1 , nucleotide positions between 679 and 867)) to prevent the cytotoxicity in Escherichia coli cells used for cloning. Salmocin coding sequences were inserted into TMV-based assembled viral vector pNMD035 (described in detail in WO2012/019660) resulting in plasmid constructs depicted in Fig.1 A-B. In preliminary expression studies, it was found that bacteriocins with nuclease (RNase and DNase) activities are usually highly toxic for plant tissues where they are expressed. Their expression resulted in tissue necrosis and poor accumulation of recombinant protein. However, co-expression with appropriate immunity proteins reduced the toxic effect and increased the accumulation of these bacteriocins dramatically. Salmocin immunity proteins used in our studies are listed in the Table 2.

Table 2. List of immunity proteins used in examples

Immunity proteins SlmmE2 and SlmmE7 for salmocins ScolE2 and ScolE7, respectively. Amino acid sequences of immunity proteins were retrieved from GenBank; corresponding nucleotide sequences with codon usage optimized for Nicotiana benthamiana were synthesized by Thermo Fisher Scientific Inc. and subcloned into PVX-based assembled viral vector pNMD670 as described in WO2012/019660. Resulting plasmid constructs are shown in Fig. 1 A.

Example 2: Salmocin expression screen

6 week-old Nicotiana benthamiana plants were infiltrated using needleless syringe with diluted Agrobacterium tumefaciens cultures carrying TMV-based assembled vectors for cytosolic salmocin expression. In case of salmocins ScolE2 and ScolE7, Agrobacterium cultures carrying TMV-based vector for salmocin expression were mixed in equal proportions with other cultures carrying PVX-based vectors for the expression of the corresponding immunity proteins. Individual overnight cultures were adjusted to OD6oo=1.5 and further diluted 1:100 with infiltration buffer containing 10 mM MES, pH 5.5 and 10 mM MgSCL. Plasmid constructs used in this experiment are summarized in Table 3. For determination of optimal harvesting timepoint, plant material was harvested at several timepoints post infiltration and used for protein extraction with 5 volumes of buffer containing 50 mM HEPES (pH 7.0), 10 mM potassium acetate, 5 mM magnesium acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20 and 300 mM NaCI. Total soluble protein (TSP) concentration was determined using the Bradford assay, and TSP extracts were analyzed using SDS-PAGE with Coomasssie staining. In our experiment, all tested salmocins were expressed on reasonably high levels varying between 1.2 and 1.8 mg recombinant colicin/g FW or between 18 and 47% of TSP (Table 4) as determined by comparison with bovine serum albumin (BSA) protein. Table 3. Summary of salmocin expression screen.

Table 4. Yield of recombinant salmocins expressed in Nicotiana benthamiana plants. FW stands for fresh weight, TSP for total soluble protein, dpi for days post infiltration, AV for average, SD for standard deviation, N for number of independent experiments.

Example 3: Salmocin activity screen

We analyzed the antimicrobial activity of plant-made recombinant salmocins against 36 strains of 33 different serotypes of S. enterica ssp. enterica. Details of strains used in the experiments are given in Tables 5A and 5B.

Table 5A. Salmonella enterica ssp. enterica strains used for antimicrobial activity screen.

Table 5B. Salmonella enterica ssp. enterica strains used for antimicrobial activity screen.

Antimicrobial activity of recombinant salmocin-containing plant extracts was tested in radial diffusion assays via spot-on-lawn-method. For this purpose, we prepared agar plates overlaid with soft agar containing cells of tested Salmonella strains. 10x10 cm quadratic petri dishes were poured with 15-20 ml LB agar medium (1.5% w/v agar). LB soft agar medium (0.8% (w/v) agar) was melted, 20 ml aliquots were transferred into 50 ml plastic tubes and their temperature was adapted to 50-55°C. Salmonella overnight cultures adjusted to Oϋboo =1.0 with LB medium were added to the soft agar medium with a ratio of 1:100 resulting in the final OD6OO=0.01 or approximately 1x10 7 cells/ml and 20 ml LB softagar containing Salmonella test strain are poured on the pre-poured LB plate resulting in 0.14 mL bacterial solution of 1x10 7 cfu/mL per cm 2 .

Plant leaf material was extracted as described in Example 2. We prepared 1:1 dilution series of plant extracts starting with undiluted samples by using same extraction buffer. 5 pi aliquots of TSP dilution series were applied to agar plates; plates were incubated at 37°C overnight. Antimicrobial activity was evaluated based on clearing zones.

Among the 6 tested salmocins, one demonstrated narrow antimicrobial activity (Spst - 12% of strains inhibited), one salmocin had medium activity spectrum (ScolE3 - 60% of strains inhibited), and 4 others had broad activity spectrum: ScolE2 and ScolE7 - inhibited about 90% of strains and ScolEla and ScolElb - inhibited 100% of strains (Fig. 3).

Salmocins ScolEI and ScolElb demonstrated not only broad but also remarkably high activity against tested Salmonella strains (Figs. 4, 5). For semi-quantitative comparison, we represented relative antimicrobial activity of recombinant colicins in arbitrary units (AU), calculated as a dilution factor for the highest dilution of protein extract causing a detectable clearing effect in the radial diffusion assay. Salmocin antimicrobial activity against Salmonella strains calculated in AU per mg FW of the plant tissue is shown in Figs. 6, 8, 10, 12 and 14 for ScolE2, ScolE3, ScolE7, ScolEla, and ScolElb, respectively. Thereby, the yield of specific active agent per unit of biomass; /. e. the specific production capacity of the host is reflected.

Figures 7, 9, 11, 13 and 15 demonstrate the same activity calculated in AU per pg of recombinant salmocin proteins ScolE2, ScolE3, ScolE7, ScolEla and ScolElb, respectively, reflecting the specific antimicrobial potency of salmocins.

Example 4: Plasmid constructs (colicins)

Six colicins representing two activity groups were selected (Table 6). The list comprises colicins colS4, col5, col10, colla, collb and colM. Respective amino acid sequences were retrieved from GenBank; corresponding nucleotide sequences with codon usage optimized for Nicotiana benthamiana were synthesized by Thermo Fisher Scientific Inc. Colicin coding sequences were inserted into TMV-based assembled viral vector pNMD035 (described in detail in WO2012/019660) resulting in plasmid constructs depicted in Fig.16. The coding sequence of colicin M was interrupted by insertion of the cat 1 intron (the first intron from Ricinus communis cat1 gene for catalase CAT1 (GenBank: D21161.1 , nucleotide positions between 679 and 867)).

Table 6. List of E. coli bacteriocins (colicins) used in Examples.

Example 5: colicin expression screen

6 week-old Nicotiana benthamiana plants were infiltrated using needleless syringe with diluted Agrobacterium tumefaciens cultures carrying TMV-based assembled vectors for cytosolic colicin expression. Agrobacterium overnight cultures were adjusted to OD 6 oo=1.5 and further diluted 1 :100 with infiltration buffer containing 10 mM MES, pH 5.5 and 10 mM MgS0 4 . Plasmid constructs used in this experiment are summarized in Table 7. For determination of optimal harvesting timepoint, plant material was harvested at several timepoints post infiltration and used for protein extraction with 5 volumes of buffer containing 50 mM HEPES (pH 7.0), 10 mM potassium acetate, 5 mM magnesium acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20 and 300 mM NaCI. Total soluble protein (TSP) concentration was determined using the Bradford assay, and TSP extracts were analyzed using SDS-PAGE with Coomasssie staining. In our experiment, all tested colicins were expressed on reasonably high levels varying between 1.5 and 4.7 mg recombinant colicin/g FW or 16 and 41% of TSP (Table 8) as determined by comparison with bovine serum albumin (BSA) protein.

Table 7. Summary of colicin expression screen.

Table 8. Yield of recombinant colicins expressed in Nicotiana benthamiana plants. FW stands for fresh weight, TSP for total soluble protein, dpi for days post infiltration, AV for average, SD for standard deviation, N for number of independent experiments.

Example 6: Colicin activity screen

We analyzed the antimicrobial activity of plant-made recombinant colicins against 35 strains of 32 different serotypes of S. enterica ssp. enterica. Details of strains used in our experiments are given in tables 5A and 5B (strain numbers 1-35).

Antimicrobial activity of recombinant colicin-containing plant extracts was tested in radial diffusion assays via spot-on-lawn-method as described in Example 3.

Among the 6 tested colicins, one demonstrated narrow antimicrobial activity (colS4 - 25% of strains inhibited), three colicins had medium activity spectrum (col5, col10 and colM - 48%, 46% and 42% of strains inhibited, respectively), and two colicins had broad activity spectrum: colla and collb - inhibited 96% and 89% of strains, respectively (Fig. 17). Example 7: Determination of efficacy of bactericidal effect of bacteriocins (colicin blends) on pathogenic strains of S. enterics ssp. enterics applied to meat matrices

Plant-produced colicins were tested for antibacterial activity on samples of chicken breast fillet contaminated with pathogenic Salmonella.

Evaluation of efficacy encompasses the analysis of pathogenic S. enterica ssp. enterica populations on contaminated meat samples subsequently treated with blends of plant-made recombinant colicins or a control carrier solution consisting of plant extract from the same production host but without colicins, and storage of treated meat samples for various time periods at 4°C.

No special sourcing of meat samples is used to ensure that bacteriocin activity is evaluated in representative consumer products. Raw chicken breast fillets are purchased at retail outlets (for these studies, ALDI supermarket, Halle, Germany), one day before the experiment. The meat is stored at 4 °C and the meat is not washed or pre-treated before experimental exposures.

The meat iest matrices are experimentally contaminated with a 1:1 or 1:1 :1 :1 mixture of 2 or 4 Salmonella enterica ssp. enterica strains representing the serotypes Typhimurium and Enteritidis (ATCC ® 9270™*, ATCC ® 13076™*) or Typhimurium, Enteritidis, Newport and Anatum (ATCC ® 9270™*, ATCC ® 13076™*, ATCC ® 6962™* and ATCC ® 9270™*), respectively (Figs. 18 and 19, respectively). Prior to meat contamination, the strains are individually grown to OD 600 = 0.3 and mixed 1 :1 or 1 : 1 : 1 : 1. The strain mix is further diluted to the desired cell number (OD 600 =0.005-0.001 , 2x10 6 -1.8x10 5 cfu/ml) with LB broth for use as meat contamination suspension to achieve an initial inoculum of ~2x10 4 cfu/g meat. Chicken breast trims (3 pieces of ~25 g weight) are dipped into 12 ml of bacterial suspension and inverted and dipped again to inoculate both sides. Contaminated meat and bacteria are allowed to dry and colonize matrix samples, respectively, for 30 min at RT, during which time chicken breast trims are inverted every 15 min.

Contaminated meat is either treated with carrier or colicin blend solution (TSP extracts prepared 50 mM HEPES pH7.0, 10 mM K acetate, 5 mM Mg acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20, 300 mM NaCI from N. benthamiana either non-treated plant material or plant material upon syringe-inoculation with Agrobacterium for colicin expression) by low-pressure spraying (2-4 bar) using atomizer flasks. Proposed application rates are 3 mg/kg for colicin M and 1 mg/kg for any other colicin used in the blend (colicin la and colicin 5). The meat is further incubated for 30 min at RT while inverted every 15 min.

Thirty minutes after colicin application, aliquots of ~25 g chicken breast trims are placed into sterile sample bags (BagFilter ® 400 P) in replicates, the exact weight of each sample is recorded, and sample bags are closed using a closing clip (BagClip ® 400). In total, meat samples are incubated at room temperature for 1 h upon colicin treatment before the sealed meat samples are then stored at 4°C.

Meat samples are sampled at 1 h, 48 h and 72 h of storage at 4°C for determination of on- matrix microbial contamination levels. For recovery of pathogenic Salmonella from meat samples, to each ~25 g aliquot of meat sample -100 ml buffered peptone water is added. The samples are homogenized in a laboratory blender (BagMixer ® 400CC ® ; settings: gap 0, time 30 s, speed 4). Microbial suspensions from filtered part of the storage bag are collected and a 1:10 dilution series is prepared. 100 pi aliquots of undiluted or diluted microbial suspensions are plated on XLD agar. The plates are incubated for 18-24 h at 37°C and the CFU (colony forming units) are enumerated. The CFU number per g sample is calculated as follows:

Total CFU=Actual CFU x Concentration Factor x Dilution Factor x Actual ml Peptone Water g Meat 0.1 ml Plating Volume Actual g Sample

The efficacy of the colicin treatment in reducing the number of viable pathogenic Salmonella in the experimentally contaminated meat samples is evaluated by comparing the data obtained with the carrier-treated control samples and colicin-treated samples by one-way ANOVA (Tukey ' s multiple comparisons test) and unpaired parametric t-test using GraphPad Prism v. 6.01.

The results of bacterial counts are shown in Figs. 18 and 19 for a two-strain or four-strain Salmonella contamination, respectively. Most significant reduction of bacterial population (1.8 logs) occurred already after 1 hour storage upon colicin treatment.

In summary, statistically significant reduction of Salmonella populations on contaminated meat could be achieved by treatment of meat with a colicin blend.

Salmocin blends or salmocin/colicin blends will be tested for decontamination of food products from Salmonella and are planned to be used in food industry for reducing Salmonella contamination. Salmocins ScolEla and ScolElb show the broadest antimicrobial activity against tested Salmonella strains. Thus, they can be used as a main ingredient of salmocin cocktails for the control of Salmonella.

Example 8: Production of salmocins in stable transgenic hosts

N. benthamiana was transformed by Agrobacterium-medi atedi leaf disk transformation using vectors for EtOH-inducible transgene expression and induction of detached leaves of TO generation transgenic plants for salmocin expression. This was done as described in Schulz et al. Proc. Natl. Acad. Sci. USA. 112, E5454-E5460 (2015). Stable transgenic Nicotiana benthamiana plants containing the genomic insertion of TMV- based viral vector double-inducible with ethanol for ScolElb expression (Fig. 21) exhibited normal growth and development, and selected transgenic lines accumulated salmocins upon induction with ethanol to the expected levels (Fig. 22).

Example 9: Production of salmocins in spinach

Spinacia oleracea cv. Fruhes Riesenblatt plants were grown in the greenhouse (day and night temperatures of 19-23°C and 17-20°C, respectively, with 12 h light and 35-70% humidity). Six-week-old plants were used for syringe infiltration as described in Example 2. Expression of recombinant proteins was confirmed using SDS-PAGE with Coomassie staining (Fig. 23).

Example 10: Extended salmocin activity screen

We further analyzed the antimicrobial activity of plant-made recombinant salmocins against other strains of Salmonella as described in Example 6. To determine the salmocin antimicrobial activity spectrum, 109 strains representing 105 S. enterica ssp. enterica serotypes were selected and screened (Table 9). The screen included one strain each of all serotypes (except serotypes Typhi and I4,5:12:r>) that are documented at the U.S. Centers for Disease Control and Prevention (CDC) (www.cdc.gov/nationalsurveillance/pdfs/salmonella-annual- report-2013-508c.pdf) as having caused at least 100 incidences of human Salmonella infection from 2003-2012, two strains of serotypes Typhimurium, Enteritidis and Javiana and 6 serotypes causing less than 100 incidences or not reported to CDC.

Table 9. List of Salmonella enterica ssp. enterica strains analysed for antimicrobial susceptibility. Serotype antigenic formula is given in (Subspecies [space] O antigens [colon] Phase 1 H antigens [colon] Phase 2 H antigens) as provided by the supplier. Numbers in source of supply correspond to 1 - Microbiologies, Inc. (St. Cloud, USA), 2 - LGC Standards (Teddington, UK), 3 - Robert Koch Institute, national reference centre for salmonellosis and other enteric pathogens (Wernigerode, Germany), 4 - Leibnitz Institute DSMZ - German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany), 5 - National Collection of Type Cultures (Salisbury, UK). Strains marked with “ were used for antmicrobial susceptibility testing in triplicate experiments. The number of incidences refers to laboratory-confirmed human Salmonella infections (US) reported to CDC 2003-2012 published in National Enteric Disease Surveillance: Salmonella Annual Report, 2013 (CDC, June 2016; www.cdc.gov/nationalsurveillance/pdfs/salmonella-annual-repo rt-2013-508c.pdf.

Table 10. List of E. coli STEC strains used in this study.

In order to estimate the breadth of the activity spectrum, all strains were tested at least once and 36 or 35 strains were subsequently re-screened in triplicate experiments with salmocins and colicins, respectively (Fig. 24). The broadest antimicrobial activity spectrum was again identified for salmocins ScolEla and ScolElb, which showed positive antibacterial activity against 100% and 99% of all strains evaluated, respectively. Significant breadth of activity was also observed for salmocins ScolE2 (94%), ScolE3 (70%) and ScolE7 (95%) as reflected by their activity on the subset of 36 strains represented in Fig. 24e.

The five salmocins analysed were divided into four groups based on their ability to control major pathogenic Salmonella strains. Salmocins ScolEla and ScolElb were universally active, each being able to kill all tested pathovars and showing the highest average activity of higher than 10 5 AU/pg recombinant protein on all tested strains (Fig. 24a) and in most cases higher than 10 3 AU/pg protein against individual strains. The remaining salmocins fell into two groups, with salmocins ScolE2 and ScolE7 in one group having a 100-fold lower average activity (<10 5 AU/pg protein, Fig. 24a), and ScolE3 in another group showing substantially lower average activity (10 2 AU/pg, Fig. 24a).

In contrast to the high potencies of salmocins in inhibiting enteropathogenic S. enterica strains, the specific activities of colicins la, lb, M, 5, 10 and S4 (Table 6) were 2-4 orders of magnitude lower (2-3 logs AU/pg, Fig. 24b), although most of the 109 strains were inhibited by colicins la (92%) and lb (90%) and about one third of strains by colicins S4 (45%), 5 (25%), 10 (29%) and M (34%), as also reflected in the susceptibility pattern of the subset of 35 strains (Fig. 24f). In general, salmocins demonstrated higher and broader activity against Salmonella than E. coli colicins. Conversely, salmocins showed low (below 10 2 AU/pg) inter-specific and narrrow activity against E. coli STEC (Table 10) strains (Fig. 24c, g).

Example 11 : Individual salmocin ScolEla and salmocin blends control Salmonella on contaminated chicken meat matrices

The bactericidal efficacy of plant-produced individual salmocin ScolEla as well as salmocin blends for control of Salmonella- contaminated meat surfaces was analyzed in a simulation study.

Chicken breast fillet was purchased from a local supermarket. Nalidixic acid resistant mutants of strains of S. enterica ssp. enterica serovars Enteritidis (strain ATCC ® 13076™*), Typhimurium (strain ATCC ® 14028™*), Newport (strain ATCC ® 6962™*), Javiana (strain ATCC ® 10721 ™*), Heidelberg (strain ATCC ® 8326™*), Infantis (strain ATCC ® BAA-1675™*) and Muenchen (strain ATCC ® 8388™*) were individually grown in LB medium supplemented with 25 pg/ml nalidixic acid to stationary phase, diluted with fresh LB and grown to exponential phase. For contamination of poultry, bacterial cultures were diluted with LB medium to OD 6 oo=0.001 (~2 x 10 5 cfu/ml) and mixed 1 :1:1:1 :1:1:1. A pool of chicken breast fillets cut into pieces of about 20 g weight was inoculated with 1 ml of a mixture of 7 S. enterica strains at ~2 x 10 5 CFU/ml density per 100 g of meat at room temperature resulting in an initial contamination level of meat matrices of about 3 log CFU/g of a 7-serotype mixture of pathogenic S. enterica ; attachment of bacteria to meat surfaces was allowed for 30 min at room temperature. Subsequently, chicken breast trims were treated by spraying (10 ml/kg) with either plant extract control (TSP extract of WT N. benthamiana plant material with no salmocins, prepared with 50 mM HEPES pH 7.0, 10 mM K acetate, 5 mM Mg acetate, 10% (v/v) glycerol, 0.05% (v/v) Tween-20, 300 mM NaCI), orsalmocin solutions (either individual or mixtures of TSP extracts of N. benthamiana plant material expressing salmocins ScolEla, ScolElb, ScolE2 and ScolE7 prepared with the same buffer as the plant extract control) at concentrations of 3 mg/kg ScolEla, or 3 mg/kg ScolEla, 1 mg/kg ScolElb, 1 mg/kg ScolE2, 1 mg/kg ScolE7 or 0.3 mg/kg ScolEla, 0.1 mg/kg ScolElb, 0.1 mg/kg ScolE2, and 0.1 mg/kg ScolE7. Treated meat trims were further incubated at room temperature for 30 min. Aliquots of meat trims corresponding to ~40 g were packed into BagFilter ® 400P sterile bags (Interscience) and stored for 1 h, 1 d and 3 d at 10°C, which represents realistic industrial meat processing conditions that are permissive but suboptimal for bacterial growth.

In total, meat samples were incubated at room temperature for 1.5 h during salmocin treatment before they were sealed and stored at 10°C. For analysis of bacterial populations, poultry aliquots were homogenized with 4 vol. peptone water using Bag Mixer ® 400CC ® homogenizer (settings: gap 0, time 30 s, speed 4; Interscience) and colony forming units (CFU) of S. enterica were enumerated on XLD medium (Sifin Diagnostics) supplemented with 25 pg/ml nalidixic acid upon plating of serial dilutions of microbial suspensions. Samples were analysed in quadruplicate.

The efficacy of the salmocin treatment in reducing the number of viable pathogenic Salmonella in the experimentally contaminated meat samples was evaluated by comparing the data obtained with the carrier-treated control samples and salmocin-treated samples by two- tailed unpaired parametric t-test with 6 degrees of freedom using GraphPad Prism v. 6.01.

Efficacy of salmocin treatment was assessed for the extent of reduction in the pathogenic bacterial population level on salmocin-treated (individual ScolEla at an application rate of 3 mg/kg meat and salmocin blend consisting of ScolE1a+ScolE1b+ScolE2+ScolE7 applied at 3+1 +1+1 mg/kg meat, respectively), both in relation to plant extract control-treated, meat samples and statistically significant net reductions in viable counts of 2-3 logs CFU/g meat at all timepoints analysed were found (Fig. 25). The highest level of reduction of bacterial populations was observed for the 4-salmocin blend (concentration of 3+1+1+1 mg/kg meat) with up to 3.39 mean log reduction vs. carrier treatment upon 48 h of storage, which corresponds to a 99.6 mean percent reduction of bacteria. A single salmocin, ScolEla (applied at 3 mg/kg meat), was able to control Salmonella contamination on meat with similar efficacy to the blend of four salmocins applied at double the concentration (6 mg/kg meat total salmocin). Even a treatment with salmocins at very low dose (total salmocin 0.6 mg/kg meat; 0.3+0.1+0.1+0.1 mg/kg meat for a blend of ScolE1a+ScolE1b+ScolE2+ScolE7) produced statistically significant reductions of bacterial populations of about 1 log CFU for up to 48 h of storage. Upon initial reduction of bacterial contamination, re-growth of viable bacteria was observed after 72 h, indicating that salmocins act quickly but have no prolonged technical effect on food.

Example 12: Recombinant salmocins are correctly expressed by plants

The primary structure including post-translational modifications of the plant-expressed recombinant salmocins contained in plant TSP extracts was analysed by Matrix-assisted laser desorption/ionization (MALDI) time-of-flight (TOF) mass spectrometry (MS).

For proteolytic digestion, TSP extracts prepared from plant material expressing salmocins with 5 vol. 20 mM Na citrate, 20 mM NahhPCU, 30 mM NaCI, pH 5.5 were subjected to SDS- PAGE and Coomassie-stained SDS gel bands containing 5 pg of protein were excised and destained by consecutive washing with 100 mM NH4HCO3 and 100 mM NH4HCO3 in acetonitrile (ACN)/H 2 0 (50; 50, v/v). Disulfide bonds were reduced with 10 mM DTT for 45 min at 50 °C followed by alkylation with 10 mg/ml of iodoacetamide for 60 min. Destained and alkylated gel bands were then subjected to proteolytic digestion with different sequencing grade endoproteinases (Promega, Madison, USA). Protease: protein ratio in the digestion solutions was adjusted to 1 :20 (w/w) and digestions were carried out for 12 h at 25 °C (chymotrypsin) or 37 °C (Asp-N, Glu-C, Lys-C, trypsin). Proteolytic peptides were extracted by consecutive washing with H2O, ACN/H 2 0/trifluoroacetic acid (50; 45; 5, v/v/v) and ACN, respectively. Extraction solutions were combined, concentrated in a vacuum centrifuge and resolubilized in H 2 0/acetic acid (90; 10, v/v).

Proteolytic salmocin peptides obtained as described above or purified intact plant- produced salmocin ScolEla, ScolElb and ScolE7 proteins were purified for mass spectrometry by solid-phase extraction using C4 or C18 bonded silica material (ZipTip ® , Millipore, Darmstadt, Germany) and elution solutions were co-crystallized on a MALDI ground steel target with 2,5- dihydroxyacetophenone as well as 2,5-dihydroxybenzoic acid matrix (Bruker Daltonics, Bremen, Germany).

Mass spectra were acquired on a MALDI-TOF/TOF mass spectrometer (Autoflex SpeedTM, Bruker Daltonics, Bremen, Germany) with positive polarity in linear mode for molecular mass determination and in reflector mode for protein sequencing by In-source decay (ISD) analysis. The matrix crystals were irradiated with a Nd:YAG laser (Smart beam-IITM, Bruker Daltonics, Bremen, Germany) at an emission wavelength of 355 nm and set to a pulse rate of 1 kHz.

MS and MS/MS spectra were recorded with flexControl (version 3.4, Bruker Daltonics, Bremen, Germany) by accumulation of at least 5000 or 10000 laser shots (per sample spot), respectively. Laser energy was set slightly above the threshold for MS experiments and set to maximum for MS/MS analyses. Spectra processing was carried out with flexAnalysis (version 3.4, Bruker Daltonics, Bremen, Germany) by applying baseline subtraction with TopHat algorithm, smoothing with Savitzky-Golay algorithm and peak detection with SNAP algorithm.

The mass spectrometer was calibrated using a set of standard peptides and proteins with known masses (Peptide Calibration Standard II, Protein Calibration Standard I and II, Bruker Daltonics, Bremen, Germany).

Determination of the intact molecular mass was based on the mass-to-charge-ratios ( m/z ) of single and multiple charged molecular ions.

Sequencing of protein termini was carried out by ISD analysis. The annotation of ISD fragment spectra was carried using BioTools (version 3.2, Bruker Daltonics, Bremen, Germany) by in silico generation of m/z values for fragment ions and their comparison with the m/z values of the fragment signals observed within the acquired ISD spectra. This approach enabled the identification of the terminal amino acid sequences as well as of present modifications.

For protein sequencing analysis, only fragment (MS/MS) spectra were used for the identification of proteolytic peptides and the annotation was carried out with PEAKS Studio (version 7.5, Bioinformatics Solutions Inc., Waterloo, Canada). Identification of proteins and verification of their amino acid sequences was performed by searching the MS/MS data against the NCBI nr database and the UniProt/SwissProt database to which the sequences of the salmocins were appended, respectively. Database search was performed with a parent mass error tolerance of 50 ppm and a fragment mass error tolerance of 0.5 Da. The maximum number for both missed cleavages as well as post-translational modifications for one proteolytic fragment was set to 3. Non-specific cleavage was allowed for both protein termini.

Search results of each MS/MS dataset from proteolytic peptides of salmocins against the UniProt/SwissProt database confirmed the identity of each of the analysed salmocins (Table 11). The integrity of purified salmocins ScolEla, ScolElb and ScolE7 was further analysed by MS- based sequencing of protein termini using ISD and molecular mass determination methods, which confirmed that all salmocin proteins were intact upon plant expression. Post-translational modifications observed were restricted to cleavage of N-terminal methionine in case of ScolE2, ScolE7, ScolEla and ScolElb and N-terminal acetylation for ScolE7 and ScolEla (Table 11). Table 11. Identity and integrity studies on plant-produced salmocins. MALDI-TOF/TOF mass spectrometry analysis of salmocin-containing TSP extracts of N. benthamiana or purified salmocins by peptide mass fingerprinting or by sequencing of protein termini by in-source decay and molecular mass determination, respectively. Proteases used for generation of peptide fragments are indicated. The identity of salmocins was confirmed by searching MS/MS datasets obtained against NCBI non-redundant database. Obtained molecular masses indicate that the proteins were intact. ND, not detected; PTM post-translational modification; aa, amino acid sequence. Example 13: Identification of salmocins ScolEIc, ScolEld ScolEle and ScolMa

As it was shown in Example 10, two pore-forming salmocins ScolEla and ScolElb demonstrated the highest and broadest antimicrobial activity against all tested Salmonella strains. To identify other salmocins to control Salmonella, we performed a homology search in NCBI database for Salmonella proteins similar to ScolEla and ScolElb but different to colicins in the N-terminal part. This search revealed three new sequences, which we called ScolEIc (SEQ ID NO: 25), ScolEld (SEQ ID NO: 26) and ScolEle (SEQ ID NO: 27) (Table 12). The CLUSTAL Omega alignment of these sequences is shown in Fig. 26.

We also searched for Salmonella proteins similar to colicin M in order to have another functional domain expressing antimicrobial activity, which is not related to nuclease (as this often creates the need for co-expression of immunity proteins and most of these proteins were not easy to purify). This search resulted in ScolMa sequence (SEQ ID NO: 28) (Table 12).

Table12. List of Salmonella bacteriocins (salmocins) used in examples.

Example 14: Plasmid constructs for salmocins ScolEIc, ScolEld ScolEle and ScolMa

Amino acid sequences of salmocins ScolEIc, ScolEld, ScolEle and ScolMa were retrieved from GenBank; corresponding nucleotide sequences with codon usage optimized for Nicotiana benthamiana were synthesized by Thermo Fisher Scientific Inc. SEQ ID NO: 29 encoded ScolEIc, SEQ ID NO: 30 encoded ScolEld, SEQ ID NO: 31 encoded ScolEle, and SEQ ID NO: 32 encoded ScolMa.

Salmocin coding sequences were inserted into TMV-based assembled viral vector pNMD035 (described in detail in WO2012/019660) resulting in plasmid constructs depicted in Fig. 28.

Example 15: Expression screen for salmocins ScolEIc, ScolEld ScolEle and ScolMa

Salmocin expression screen was performed as described in Example 2. The accumulation of salmocins ScolEIc, ScolEld and ScolMa in Nicotiana benthamiana leaves was high. In contrast, the expression of ScolEle was poor (Fig. 29). Example 16: Salmocin activity screen for ScolEIc, ScolEld ScolEle and ScolMa

We compared the antimicrobial activity of plant-made recombinant salmocins ScolEIc, ScolEld, ScolEle and ScolMa against ScolEla and ScolElb. For this comparison, we used 10 Salmonella enterica ssp. enterica strains (Table 13). The evaluation of antimicrobial activity in plant extracts containing salmocins was performed using a radial diffusion spot-on-lawn assay as described in Example 6. The extract from untransfected plant tissue (Wt) was used as a negative control. All tested new salmocins demonstrated significant antimicrobial activities, although they were not superior of ScolEla and ScolEb (Table 13).

Based on the expression and antimicrobial activity levels, we selected ScolEld and ScolMa for generation of ethanol-inducible stable transgenic Nicotiana benthamiana hosts.

Table 13. Antimicrobial activity of salmocins ScolEIc, ScolEld ScolEle and ScolMa against selected Salmonella enterica ssp. enterica strains.

Example 17: Production of salmocins ScolEld and ScolMa in stable transgenic hosts.

N. benthamiana was transformed by Agrobacterium-medi atedi leaf disk transformation using vectors for EtOH-inducible transgene expression (pNMD49621 for ScolEld and pNMD49632 for ScolMa, Fig. 30). The ethanol induction of detached leaves of TO generation transgenic plants for salmocin expression was performed. These experiments were done as described in Example 8. For 169 TO transgenic lines transformed with pNMD49621 construct which were used for analysis, 126 lines were positive for EtOH-inducible SalEld expression (Fig. 31).

Example 18: Antimicrobial activity of salmocins ScolElb, ScolEld and ScolMa: comparison.

Antimicrobial activities of plant-made recombinant salmocins ScolElb, ScolEld and ScolMa were analyzed as described in Example 3. In these experiments, we tested the plant extracts containing recombinant salmocins against 36 strains of 33 different serotypes of S. enterica ssp. enterica using radial diffusion assay via spot-on-lawn-method. Bacterial strains are listed in Tables 5A and 5B.

For semi-quantitative comparison, we represented relative antimicrobial activity of recombinant salmocins in arbitrary units (AU), calculated as a dilution factor for the highest dilution of protein extract causing a detectable clearing effect in the radial diffusion assay. Concentration of salmocin proteins in TSP extracts was evaluated by visual comparison with BSA standard on Coomassie-stained polyacrylamide gels after protein electrophoresis. Specific antimicrobial activity was calculated in arbitrary units (AU) per pg of recombinant salmocin as an average of 3 independent experiments.

We have found that ScolEld has an antibacterial activity spectrum very similar to ScolElb, although in most cases the overall activity of ScolEld was lower ,and in just few cases it was comparable with ScolElb (Figs. 32A and 32B).

Antimicrobial activity of ScolMa was lower than ScolEI b for 20 strains, comparable for 10 strains and higher for 6 strains (Figs. 32A and 32B). Based on different (complementary) antimicrobial activity pattern and distinct antimicrobial activity mechanism, ScolMa could be considered as a good candidate for salmocin cocktails in a mixture with ScolEI salmocins.

Example 19: Stability of salmocins ScolEI a, ScolEI b and ScolEI d upon the storage

For stability evaluation, purified salmocin protein samples were stored as dry lyophilized powder and as a solution at 4°C and room temperature (20-25°C). Protein stability was assessed on the basis of antimicrobial activity for 93 days (ScolEla), 387 days (ScolElb) and 200 days (ScolEld). Antimicrobial activity against susceptible bacterium was evaluated by radial diffusion assay. All salmocins were tested with the same bacterial strain: Salmonella Typhimurium ATCC® 14028.

Lyophilized protein samples were resuspended in distilled water (0.2-0.4 mg/ml). Soluble protein concentration was measured for each sample using Bradford assay. For radial diffusion assay, serial 1 :2 dilutions of proteins solubilized in PBS buffer were made. 5 pL of protein dilutions (1-1.8 pg of protein before dilution), were spotted on soft agar plates with susceptible bacterial strain. Residual activity of salmocins was evaluated after overnight incubation of plates. Antimicrobial activity was evaluated as specific activity units (AU) - highest dilution giving a difference to non-affected bacterial growth area determined by visual inspection of plates for bacterial growth inhibition by holding the plate in front of a light source. Highest dilution with growth inhibition was recorded as an activity in AU/mg of salmocins.

All three salmocins remained stable throughout the whole study period when stored as a dry powder either at room temperature or at 4°C (Fig. 33A-C). In the case of solution, all three salmocins have lost their activities at room temperature within one week (Fig. 33A-C). When salmocin solution was stored at 4°C, ScolEld was the most stable one: it retained substantial antimicrobial activity after 183 days of storage (Fig. 33C).

Example 20: Identification of salmocins ScoIMb and ScolMc

To indentify additional salmocins with phosphatase M activity, we performed a homology search in NCBI database for Salmonella proteins similar to ScolMa (SEQ ID NO: 28). As a result of this search, we identified two proteins: ScoIMb (SUF52254.1 ; SEQ ID NO: 33) and ScolMc (OIN38022.1; SEQ ID NO: 34). The CLUSTAL Omega (1.2.4) multiple sequence alignment of ScolMa, ScoIMb and ScolMc sequences is shown in Fig. 34. All three sequences have a high (97-98%) degree of identity with only few amino acid mismatches.

Example 21: Plasmid constructs and expression screen for salmocins ScoIMb and ScolMc

ScoIMb and ScolMc encoding nucleotide sequences with codon usage optimized for Nicotiana benthamiana were synthesized by Thermo Fisher Scientific Inc. SEQ ID NO: 35 encoded ScoIMb, and SEQ ID NO: 36 encoded ScolMc.

Salmocin coding sequences were inserted into TMV-based assembled viral vector pNMD035 (described in detail in WO2012/019660) resulting in plasmid constructs depicted in Fig. 35.

ScoIMb and ScolMc expression screen was performed as described in Example 2. At 7 dpi, plant tissue necrosis developed, suggesting earlier harvesting time points to be preferrable. ScoIMb and ScolMc recombinant proteins accumulated in Nicotiana benthamiana leaves at comparable levels (Fig. 36). Based on visual evaluation by comparison with BSA standard on Coomassie-stained polyacrylamide gels after protein electrophoresis, ScoIMb expression yield was found to be about 2.6 mg/g FW plant material or 30% TSP. For ScolMc, the expression yield was 2.3 mg/g FW or 37.5% TSP. For both these proteins, it was higher than for ScolMa (1.6 mg/g FW or 18.8% TSP). Example 22: Antimicrobial activity of salmocins ScolMa, ScoIMb and ScolMc: comparison

For comparison, antimicrobial activities of plant-made recombinant salmocins ScolMa, ScoIMb and ScolMc were analyzed as described in Example 3. We tested recombinant-salmocin- containing plant extracts against 10 strains of 8 different serotypes of S. enterica ssp. enterica using radial diffusion assay via spot-on-lawn-method. Bacterial strains are listed in Tables 5A and 5B.

For semi-quantitative comparison, we represented relative antimicrobial activity of recombinant salmocins in arbitrary units (AU), calculated as a dilution factor for the highest dilution of protein extract causing a detectable clearing effect in the radial diffusion assay. Concentration of salmocin proteins in TSP extracts was evaluated by visual comparison with BSA standard on Coomassie-stained polyacrylamide gels after protein electrophoresis. Specific antimicrobial activity was calculated in arbitrary units (AU) per pg of recombinant salmocin as an average of 3 independent experiments.

All three tested salmocins had very similar antimicrobial pattern: they were active against Typhimurium, Javiana, Muenchen, Heidelberg, Dublin and Abony strains, and exerted no or very low activity against Enteritidis and Newport strains (Fig. 37).

Example 23: Determination of Minimum Inhibitory Concentrations (MICs) of salmocins ScolElb, ScolE7, ScolEld, ScoIMb and ScolMc

The Minimum Inhibitory Concentration (MIC) is defined as the lowest concentration of bacteriocin, which prevents visible growth of corresponding bacterial strain. We determined MICs of ScolElb, ScolE7, ScolEld, ScoIMb and ScolMc salmocins by agar dilution method using MHB medium (Muller-Hinton Bouillon; Carl Roth GmbH& Co. KG, Karlsruhe, Germany). The tested Salmonella strains (Enteritidis ATCC13076, Typhimurium ATCC14028, Typhimurium ATCC13311 and Dublin ATCC14028) were streaked for single colony onto a MHB agar plate (MHB medium supplemented 0.75% (w/v) agar, bacteriology grade (AppliChem GmbH, Darmstadt, Germany)) and the plate was incubated for at least 16h at 37°C. The main culture was prepared by picking 8 equally sized colonies for inoculation of 4 ml of MHB medium, which was further incubated for 2-3h at 37°C and 180rpm until an OD600 of about 0.2 was reached. The bacterial culture was pre-diluted to OD600=0.02 followed by further dilution to potential 2x10 6 cfu/ml.

For evaluation of the of the MIC, the tested lyophilized purified salmocin proteins were resuspended in PBS buffer (phosphate buffered saline (137 mM NaCI, 2.7 mM KCI, 10 mM Na 2 HP0 4 , 2 mM KH2PO4)) supplemented with 0.1 mg/ml BSA (AppliChem GmbH, Darmstadt, Germany). The protein concentration was adjusted to the appropriate amount (100 fold higher than the concentration which will be tested) and 11 further 1 : 1 protein dilutions in PBS + 0.1 mg/ml BSA were prepared. The different salmocin containing MHB agar plates were prepared using 6- well sterile cell culture plates (TPP Techno Plastics Products AG, Trasadingen, Switzerland) after melting of the MHB agar, dividing into 5ml aliquots and cooling of the agar to 50°C. After addition of 50 pi salmocin/5ml MHB agar the solution was filled into one well of the 6-well plate. After cooling of the plates to room temperature, 4 spots of 5mI diluted Salmonella culture was dropped in each well. The drops were dried and the plates were incubated for ~16h at 37°C. After incubation, the plates were checked for bacterial growth at different concentrations. The MIC is defined as the lowest concentration of the antimicrobial substance that inhibits visible growth of the test microbe.

Table 14. Minimum inhibitory concentrations (MICs) of ScolElb, ScolE7, ScolEld, ScoIMb and ScolMc against selected Salmonella strains. n.d.- not determined

MICs determined for individual salmocins show certain batch-to- batch variation, which may be related to the batch quality, i.e. the presence of proteinaceous impurities or proteolytic degradation products. In our previous experiments, ScolElb and ScolEld showed the broadest antimicrobial activity (Example 3); they have also very low MICs for the tested strains. SalElb has a broader high activity as one can also detect a low MIC for the second tested Typhimurium strain. ScolMc has a higher activity on the tested ATCC14028 strains in comparison to ScoIMb. The lowest MIC (<0.98 ng/ml) was surprisingly detected for ScolMc, see e.g. data with the Typhimurium strain ATCC13311.

Example 24: Purification of ScolMa protein

Salmocin ScolMa containing Nicotiana benthamiana leaf biomass was produced using Agrobacterium-medi atedi delivery of TMV-based viral vector pNMD47730 (Fig. 28). Liquid culture of Agrobacterium tumefaciens strain ICF320 carrying pNMD477390 plasmid was inoculated in LBS medium from glycerol stock and incubated with shaking at 28°C overnight. Agrobacterium overnight culture was adjusted to OD6oo=1.3 and further diluted 1:100 with LBS medium. Leaves of 4-6-week old N. benthamiana plants were infiltrated using the needleless syringe. At 5 dpi, plant material was harvested, frozen with liquid nitrogen, ground using the mortar and the pestle and further stored at -80°C.

For ScolMa protein isolation, we tested various purification strategies. Extraction of recombinant protein of interest from N. benthamiana leaf biomass at pH 4.0 - 5.5 is preferable as in this pH range the RUBISCO, the most abundant plant host protein, is precipitated and can be further easily removed together with other solids. Unfortunately, at pH 4.0 - 5.5 the ScolMa extraction yield was very poor. The ScolMa yield was much higher at pH 6.0, although the resulting extracts contained high amounts of RUBISCO as well. Such extracts were further subjected to column chromatography purification using various approaches. All protein purification experiments were performed using the AKTA™ Pure chromatography system (Cytiva Europe GmbH, Freiburg im Breisgau, Germany).

For example, we tested Hydrophobic Interaction Chromatography (HIC) using HiTrap™ Phenyl FF (LS) and HiTrap™ Butyl FF resins (Cytiva Europe GmbH, Freiburg im Breisgau, Germany). In both cases, the buffer consisting of 20 mM citrate (pH6.0), 20 mM NaH 2 P0 4 and 2 M NaCI was used for extraction. This buffer was also used for column equilibration and for column washing after the sample loading. The elution was performed using the same buffer with 0-100% NaCI gradient. For both resins, no efficient separation between ScolMa and host proteins was achieved (Fig. 38A and 38B).

Other numerous purification attempts using HiTrap™ Capto™ MMC, HiTrap™ Capto™ Adhere, HiTrap™ Capto™ Q, HiTrap™ Capto™ Adhere, HiTrap™ Capto™ MMC, HiTrap™ Capto™ MMC, HiTrap™ Capto™ S, HiTrap™ Capto™ MMC and HiTrap™ Capto™ DEAE resins (all Cytiva Europe GmbH, Freiburg im Breisgau, Germany) did not result in satisfying ScolMa yield and purity. The outcome of these experiments is summarized in the Table 15. As an example, Fig. 39 shows unsuccessful result of ScolMa purification using HiT rap™ Capto™ DEAE resin. 20 mM Tris-HCI (pH 8.0), 0.05% Tween-80 was used for the extraction. The column was washed with 20 mM Tris-HCI (pH 8.0). The elution was performed with 0-100% linear gradient of elution buffer containing 20 mM Tris-HCI (pH 8.0), 1 M NaCI. Although ScolMa extraction was efficient, the protein did not bind properly to the resin; no separation of RUBISCO was achieved as well.

Table 15. Summary of approaches used for ScolMa purification.

CV: column volume

Example 25: Purification of ScoIMb and ScolMc proteins

We tested the extraction of ScolMa, ScoIMb and ScolMc proteins from Nicotiana benthamiana leaf biomass using the buffers with pH 4.0, pH 5.0 and pH 7.0. Leaf biomass containing recombinant proteins of interest was produced as described in Example 26. pNMD47730 (Fig. 28), pNMD51280 and pNMD51290 (Fig. 35) constructs were used for the expression of ScolMa, ScoIMb and ScolMc proteins, respectively. For the extraction, we tested following buffers: 1) 50 mM HEPES (pH 7.0), 10 mM K-acetate, 5 mM Mg-acetate, 10% glycerol, 0.05% Tween-20, 300 mM NaCI; 2) 15 mM Na-acetate (pH 5.0), 20 mM NaH 2 P0 4 , 30 mM NaCI, 0.05% Tween-80, and 3) 15 mM Na-acetate (pH 4.0), 20 mM NaH 2 P0 4 , 30 mM NaCI, 0.05% Tween-80.

The extraction was performed at 4°C or at the room temperature (approx. 22°C). For the extraction at 4°C, frozen ground plant material was thoroughly mixed with the extraction buffer and incubated for 30 min on ice. After 10 min centrifugation at 13000 rpm at4°C, the supernatant was transferred into the new tube (repeated twice). Resulting supernatant was analyzed using SDS-PAGE (15% gel). For the extraction at room temperature, frozen ground plant material was thoroughly mixed with the extraction buffer and incubated for 15 min at room temperature. After 5 min centrifugation at 7100 g at 22°C, the supernatant was transferred into the new tube and centrifuged again for 20 min at 7100 g at 22°C. Resulting supernatant was analyzed using SDS- PAGE (15% gel). Figures 40A and 40B show SDS-PAGE analysis of the extraction efficiency at 4°C and at the room temperature, respectively. The recovery of ScolMa was efficient for the buffer of pH 7.0 at both temperatures. The recovery of this protein was significantly less efficient when the buffers with pH 5.0 and pH 4.0 were used. With these buffers, the ScolMa yield was particularly low at room temperature indicating protein instability at such conditions (Fig. 40A). In contrast, ScoIMb and ScolMc had much higher yield with all three buffers tested and at both temperatures (Fig. 40A and 40B).

ScoIMb was efficiently purified using HiTrap™ Capto™ MMC resin. N. benthamiana leaf material was extracted with 5 volumes of the extraction buffer (15 mM Na-acetate (pH 4.0), 20 mM NaH2PC>4, 30 mM NaCI, 0.05% Tween-80) at room temperature. The same buffer was used for the column equilibration and column washing. The protein elution was performed using a linear 0-100% gradient of the elution buffer consisting of 10 mM Na-acetate (pH 7.5), 20 mM NaH2PC>4, 50 mM NaCI. Protein samples were analyzed using SDS-PAGE (Fig. 41). Elution fraction F18 was highly enriched with ScoIMb protein.

ScoIMb was also efficiently purified using HiTrap™ Phenyl FF (LS) resin. Plant biomass was extracted with 5 volumes of extraction buffer (15 mM Na-acetate (pH 4.0), 20 mM NaH 2 P0 4 , 30 mM NaCI, 0.05% Tween-80) at room temperature. Filtered plant extract was mixed with 50 mM Na-acetate (pH 4.0), 30 mM NaCI, 3 M (NH 4 ) 2 S0 4 to get final 1 M (NH 4 ) 2 S0 4 concentration. The column was equilibrated and washed with the buffer containing 15 mM Na-acetate (pH 4.0), 20 mM NaH 2 P0 4 , 30 mM NaCI, 0.05% Tween 80, 1 M (NH 4 ) 2 S0 4 . A linear 0-100% gradient of the elution buffer containing 10 mM Na-acetate (pH 7.5), 20 mM NaH 2 P0 4 , 50 mM NaCI was used for protein elution. Protein samples were analyzed using SDS-PAGE (Fig. 42). Elution fractions F11-F14 were most efficiently enriched with purified ScoIMb protein.

ScolMc was purified using HiTrap™ Capto™ MMC resin according to the same protocol as in case of ScoIMb. SDS-PAGE analysis of protein purification fractions is shown in Fig. 43. Elution fraction F18 was highly enriched with ScoIMb protein.

Again, ScolMc was purified with HiTrap™ Phenyl FF (LS) resin using the same protocol as for ScolMc. Fig. 43 shows SDS-PAGE analysis of protein purification fractions. Elution fractions F10-F15 were most highly enriched with ScolMc.

Amino acid and nucleotide sequences SEQ ID NO: 1 Amino acid sequence of salmocin ScolE2

MSGGDGIGHN SGAHSTGGVN GSSSGRGGSS SGGGNNPNSG PGWGTTHTPD GHDIHNYNPG EFGGGGHKPG GNGGNHSGGT GDGQPPGAAM AFGFPALVPA GAGGLAVTVS GDALAAAIAD VLAVLKGPFK FGAWGIALYG ILPTEIAKDD PRMMSKIVTS LPADAVTESP VSSLPLDQAT VSVTKRVTDV VKDERQHIAV VAGVPASIPV VDAKPTTHPG VFSVSVPGLP DLQVSTVKNA PAMTALPRGV TDEKDRTVHP AGFTFGGSSH EAVIRFPKES GQAPVYVSVT DVLTPEQVKQ RQDEENRRQQ EWDATHPVEV AERNYRLASD ELNRANVDVA GKQERQIQAA QAVAARKGEL DAANKTFADA KEEIKKFERF AHDPMAGGHR MWQMAGLKAQ RAQNEVNQKQ AEFNAAEKEK ADADAALNVA LESRKQKEQK AKDASDKLDK ENKRNHPGKA TGKGQPVGDK WLEDAGKEAG APVPDRIADK LRDKEFKNFD DFRKKFWEEV SKDPELSKQF IPGNKKRMSQ GLAPRARNKD TVGGRRSFEL HHDKPISQDG GVYDMDNIRV TTPKLHIDIH RGK

SEQ ID NO: 2 Amino acid sequence of salmocin ScolE3

MSGGDGRGHN TGAHSTSGNI NGGPTGLGVS GGASDGSGWS SENNPWGGGS GSGIHWGGGS GRGNGGGNGN SGGGSGTGGN LSAVAAPVAF GFPALSTPGA GGLAVSISAS ELSAAIAGII AKLKKVNLKF TPFGVVLSSL IPSEIAKDDP NMMSKIVTSL PADDITESPV SSLPLDKATV NVNVRVVDDV KDERQNISVV SGVPMSVPVV DAKPTERPGV FTASIPGAPV LNISVNNSTP AVQTLSPGVT NNTDKDVRPA GFTQGGNTRD AVIRFPKDSG HNAVYVSVSD VLSPDQVKQR QDEENRRQQE WDATHPVEVA EREYENARAE LEAENKNVHS LQVALDGLKN TAEGLALSDA GRHPLTSSES RFVAVPGYSG GGVHFDATAT VDSRDRLNSL LSLGGAAYVN NVLELGEVSA PTEDGLKVGN AIKNAMIEVY DKLRQRLITR QNEINHAQVS LNTAIESRNK KEEKKRSAEN KLNEERNKPR KGTKDYGHDY HPAPETEEIK GLGDIKKGIP KTPKQNGGGK RKRWIGDKGR KIYEWDSQHG ELEGYRASDG QHLGSFDPKT GKQLKGPDPK RNIKKYL

SEQ ID NO: 3 Amino acid sequence of salmocin ScolE7

MSGGDGIGHN SGAHSTGGVN GSSSGSGGSS SGSGNNPNSG PGWGTTHTPN GDIHNYNPGE FGGGGNKPGG HGGNSGNHDG SSGNGQPSAA PMAFGFPALA PAGAGSLAVT VSGEALSAAI ADIFAALKGP FKFGAWGIAL YGIMPTEIAK DDPNMMSKIM TSLPADTVTD TPVSSLPLDQ ATVSVTKRVA DVVKDERQHI AVVAGVPMSV PVVDAKPTTR PGIFSATVPG LPALEVSTGK SIPASTALPR GITEDKDRTE HPAGFTFGGS SHDAVIRFPK ESGQAPVYVS VTDVLTPEQV KQRQDEESRR QQEWDATHPV EVAERNYRLA SDELNRVNAD VAGKQERQAQ AGQAVAARKG ELDAANKTFA DAKEEIKKFE HFARDPMAGG HRMWQMAGLK AQRAQNEVNQ KQAEFDAAEK EKADADAALN AALESRKQKE QKAKDTKERL DKENKRNQPG KATGKGQPVS DKWLEDAGKE SGSPIPDSIA DKLRDKEFRN FDDFRKKFWE EVSKDPELSK QFIKGNRDRM QVGKAPKSRK KDAAGKRTSF ELHHDKPVSQ DGGVYDMDNL RITTPKRHID IHRGQ

SEQ ID NO: 4 Amino acid sequence of salmocin ScolEla

MADNTIAYYE DGVPHSADGK VVIVIDGKMP VDTGAGGTGG GGGGKVGGTS ESSAAIHATA KWSTAQLKKT LAEKAARERE TAAAMAAAKA KRDALTQHLK DIVNDVLRHN ASRTPSATDL AHANNMAMQA EAQRLGRAKA EEKARKEAEA AELAFQEAER QREEAVRQLA ETERQLKQAE EEKRLAALSD EARAVENARK NLDTAKSELA NVDSDIERQR SQLSSLDADV KKAEENLRLT MRIKGRIGRK MQAKSQAIVD DKKRIYSDAE NVLNTMTVNR NLKAQQVTDA ENELKVAIDN LNSSQMKNAV DATVSFYQTL TEKYGEKYSL IAQELAEKSK GKKIGNVDEA LAAFEKYKDV LDKKFSKADR DAIVNALKSF NYDDWAKHLD QFAKYLKITG HVSFGYDVVS DVLKASETGD WKPLFITLEQ KVLDTGMSYL VVLMFSLIAG TTLGIFGVAI ITAILCSFVD KYILNALNDA LGI

SEQ ID NO: 5 Amino acid sequence of salmocin ScolEI b

MSDNTIAYYE DGVPYSADGQ VVIVIDGKMP VDTGAGGTGG GGGGKVGGTS ESSAAIHATA KWSKAQLQKS LEEKAARERE TAAAMAAAKA KRDALTQHLK DIVNDVLRYN ASRTPSATDL AHANNMAMQA EAQRLGRAKA EEKARKEAEA AEKSLQEAER QREEAARQRA EAERQLKQAE AEEKRLAALS EEARAVEITQ KNLAAAQSEL SKMDGEIKSL NVRLSTSIHA RDAEMNSLSG KRNELAQESA KYKELDELVK KLEPRANDPL QNRPFFDATS RRARAGDTLA EKQKEVTASE TRINELNTEI NQVRGAISQA NNNRNLKVQQ VTETENALKV AIDNLNSSQM KNAVDATVSF YQTLTAKYGE KYSLIAQELA EQSKGKKISN VDEALAAFEK YKDVLDKKFS KADRDAIVNA LKSVDYADWA KHLDQFSRYL KISGRVSTGY DIYSDIRKGM DTNDWRPLFL TLEKLAVDAG VGYIVALGFS VIASTALGIW GVAIITGVIC SFVDKKDLEK LNEALGI SEQ ID NO: 6 Amino acid sequence of salmocin Spst

MFIKSGGNLT IRTFGGLGVG GDFDSDTWRR RSTDSWVPYS EYIAIECIVA PNQLYQLLTD VAQVETVAAQ LAQVGYQYLQ GRLRLVREDG SCTDFSGKAM LDNLLNKSKD ILDLDFLHVS EGYRSEAYWP GQSSGITIGY GVDIGHQSEE GLHKWGVPQS IIDKIKDYFG ITGEAANTLL KGLKDKTLGL SDREIKQFSD IVKKQATADI INKYNAATKG ITFDKIPYNT RTAIIDLFYQ YSAPKGAPKS WGFIINNDWN GFYNELMNFG DKHTTRRERE AALVLSDIVN NQYIYK

SEQ ID NO: 7 Amino acid sequence of salmocin ScolE2 immunity protein SlmmE2

MELKKSISDY TEAEFKKIIE AIINCEGDEK TQDDNLEFFI RVTEYPSGSD LIYYPEGDND GSTEAIIKEI KEWRAANGKP GFKQADSSYF VSFDYRDGDW

SEQ ID NO: 8 Amino acid sequence of salmocin ScolE7 immunity protein SlmmE7

MELKNSISDY TEAEFIEFMK EIDKENVAET DDKLDLLLNH FEQVTEHPDG TDLIYYAASD AESTPEAITK KIKEWRAANG KPGFKQG

SEQ ID NO: 9 Amino acid sequence of colicin S4

MAKELSVYGP TAGESMGGTG ANLNQQGGNN NSNSGVHWGG GSGSGNGGRE HGSQTGWGWS KTNNPDVPPY VDDNGQVRIT ITNGLVKTPV YGVPGAGGNS DVQGGYIPEN PNDEVARKWD KNNLPREIDV SIDGFKYRVT LNDNGRAIGI LRTGVRPYVG SEKAKAGIME KINHKTPEEI YEALGFNKDE SQRQEKAKQQ AEDAWDRLPP NVRKFDVDVE QFHYLVVLDD YGNVLSVTRT GVRPYVGSEK AKAGIMDKVD HKTPEEIYEA LGFNNEEPQR QNQAKKAAYD VFYSFSMNRD RIQSDVLNKA AEVISDIGNK VGDYLGDAYK SLAREIADDV KNFQGKTIRS YDDAMASLNK VLSNPGFKFN RADSDALANV WRSIDAQDMA NKLGNISKAF KFADVVMKVE KVREKSIEGY ETGNWGPLML EVESWVLSGI ASAVALGVFS ATLGAYALSL GAPAIAVGIV GILLAAVVGA LLDDKFADAL NKEIIKPAH

SEQ ID NO: 10 Amino acid sequence of colicin 5

MDKVTDNSPD VESTESTEGS FPTVGVDTGD TITATLATGT ENVGGGGGAF GGASESSAAI HATAKWSTAQ LKKHQAEQAA RAAAAEAALA KAKSQRDALT QRLKDIVNDA LRANAARSPS VTDLAHANNM AMQAEAERLR LAKAEQKARE EAEAAEKALR EAERQRDEIA RQQAETAHLL AMAEAAEAEK NRQDSLDEEH RAVEVAEKKL AEAKAELAKA ESDVQSKQAI VSRVAGELEN AQKSVDVKVT GFPGWRDVQK KLERQLQDKK NEYSSVTNAL NSAVSIRDAK KTDVQNAEIK LKEAKDALEK SQVKDSVDTM VGFYQYITEQ YGEKYSRIAQ DLAEKAKGSK FSSVDEALAA FEKYKNVLDK KISKVDRDAI FNALESVNYD ELSKNLTKIS KSLKITSRVS FLYDVGSDFK NAIETGNWRP LFVTLEKSAV DVGVAKIVAL MFSFIVGVPL GFWGIAIVTG IVSSYIGDDE LNKLNELLGI

SEQ ID NO: 11 Amino acid sequence of colicin 10

MDKVTDNSPD VESTESTEGS FPTVGVDTGD TITATLATGT ENVGGGGGAF GGASESSAAI HATAKWSTAQ LKKHQAEQAA RAAAAEAALA KAKSQRDALT QRLKDIVNDA LRANAARSPS VTDLAHANNM AMQAEAERLR LAKAEQKARE EAEAAEKALR EAERQRDEIA RQQAETAHLL AMAEAAEAEK NRQDSLDEEH RAVEVAEKKL AEAKAELAKA ESDVQSKQAI VSRVAGELEN AQKSVDVKVT GFPGWRDVQK KLERQLQDKK NEYSSVTNAL NSAVSIRDAK KTEVQNAEIK LKEAKDALEK SQVKDSVDTM VGFYQYITEQ YGEKYSRIAQ DLAEKAKGSK FNSVDEALAA FEKYKNVLDK KFSKVDRDDI FNALESITYD EWAKHLEKIS RALKVTGYLS FGYDVWDGTL KGLKTGDWKP LFVTLEKSAV DFGVAKIVAL MFSFIVGAPL GFWGIAIITG IVSSYIGDDE LNKLNELLGI

SEQ ID NO: 12 Amino acid sequence of colicin la MSDPVRITNP GAESLGYDSD GHEIMAVDIY VNPPRVDVFH GTPPAWSSFG NKTIWGGNEW VDDSPTRSDI EKRDKEITAY KNTLSAQQKE NENKRTEAGK RLSAAIAARE KDENTLKTLR AGNADAADIT RQEFRLLQAE LREYGFRTEI AGYDALRLHT ESRMLFADAD SLRISPREAR SLIEQAEKRQ KDAQNADKKA ADMLAEYERR KGILDTRLSE LEKNGGAALA VLDAQQARLL GQQTRNDRAI SEARNKLSSV TESLNTARNA LTRAEQQLTQ QKNTPDGKTI VSPEKFPGRS STNHSIVVSG DPRFAGTIKI TTSAVIDNRA NLNYLLTHSG LDYKRNILND RNPVVTEDVE GDKKIYNAEV AEWDKLRQRL LDARNKITSA ESAVNSARNN LSARTNEQKH ANDALNALLK EKENIRNQLA GINQKIAEEK RKQDELKATK DAINFTTEFL KSVSEKYGAK AEQLAREMAG QAKGKKIRNV EEALKTYEKY RADINKKINA KDRAAIAAAL ESVKLSDISS NLNRFSRGLG YAGKFTSLAD WITEFGKAVR TENWRPLFVK TETIIAGNAA TALVALVFSI LTGSALGIIG YGLLMAVTGA LIDESLVEKA NKFWGI

SEQ ID NO: 13 Amino acid sequence of colicin lb

MSDPVRITNP GAESLGYDSD GHEIMAVDIY VNPPRVDVFH GTPPAWSSFG NKTIWGGNEW VDDSPTRSDI EKRDKEITAY KNTLSAQQKE NENKRTEAGK RLSAAIAARE KDENTLKTLR AGNADAADIT RQEFRLLQAE LREYGFRTEI AGYDALRLHT ESRMLFADAD SLRISPREAR SLIEQAEKRQ KDAQNADKKA ADMLAEYERR KGILDTRLSE LEKNGGAALA VLDAQQARLL GQQTRNDRAI SEARNKLSSV TESLKTARNA LTRAEQQLTQ QKNTPDGKTI VSPEKFPGRS STNHSIVVSG DPRFAGTIKI TTSAVIDNRA NLNYLLTHSG LDYKRNILND RNPVVTEDVE GDKKIYNAEV AEWDKLRQRL LDARNKITSA ESAINSARNN VSARTNEQKH ANDALNALLK EKENIRSQLA DINQKIAEEK RKRDEINMVK DAIKLTSDFY RTIYDEFGKQ ASELAKELAS VSQGKQIKSV DDALNAFDKF RNNLNKKYNI QDRMAISKAL EAINQVHMAE NFKLFSKAFG FTGKVIERYD VAVELQKAVK TDNWRPFFVK LESLAAGRAA SAVTAWAFSV MLGTPVGILG FAIIMAAVSA LVNDKFIEQV NKLIGI

SEQ ID NO: 14 Amino acid sequence of colicin M

METLTVHAPS PSTNLPSYGN GAFSLSAPHV PGAGPLLVQV VYSFFQSPNM CLQALTQLED

YIKKHGASNP LTLQIISTNI GYFCNADRNL VLHPGISVYD AYHFAKPAPS QYDYRSMNMK

QMSGNVTTPI VALAHYLWGN GAERSVNIAN IGLKISPMKI NQIKDIIKSG VVGTFPVSTK

FTHATGDYNV ITGAYLGNIT LKTEGTLTIS ANGSWTYNGV VRSYDDKYDF NASTHRGIIG

ESLTRLGAMF SGKEYQILLP GEIHIKESGK R

SEQ ID NO: 15 Nucleotide sequence used for salmocin ScolE2 expression in examples atgtctggtggtgatggtatcggtcacaatagcggtgctcattctactggtggtgtgaac ggttcttcat ctggtaggggtggtagttcttcaggtggtggtaacaaccctaactctggtcctggttggg gtactactca tactcctgatggtcacgatatccacaactacaaccctggtgagtttggtggtggtggaca taagcctggt ggaaacggtggtaatcactctggtggtactggtgatggacaacctcctggtgctgctatg gcttttggtt tccctgctcttgttcctgctggtgctggtggtcttgctgttactgtttctggtgatgctc tggctgctgc aattgctgatgtgcttgctgttctgaagggacctttcaagtttggtgcttggggtatcgc tctgtacggt attcttcctaccgagatcgctaaggatgatccaaggatgatgagcaagatcgtgacctct ttgcctgctg atgctgtgactgagtctcctgtgtcatctctgcctcttgatcaggctactgtgagcgtta ccaagagggt taccgatgtggttaaggatgagaggcagcacattgctgttgttgctggtgtgcctgcttc tatccctgtt gttgatgctaagcctactacccaccctggtgtgttctctgtttctgttcctggtctgcct gatctgcagg tttcaactgtgaagaacgctcctgctatgactgctttgcctaggggtgttactgatgaga aggataggac tgttcaccctgctggtttcaccttcggtggttcttctcatgaggctgtgatcaggttccc taaagagtct ggtcaggctcctgtttacgtgtcagtgaccgatgttcttacccctgagcaggttaagcag agacaggatg aagagaatagaaggcagcaagagtgggatgctactcaccctgttgaagtggctgagagga attacaggct ggcttctgatgagctgaacagggctaatgtggatgtggctggtaagcaagagaggcagat tcaagctgct caagctgttgctgctagaaagggtgaactggatgctgctaacaagaccttcgctgatgct aaagaagaga tcaagaagttcgagaggttcgctcacgatcctatggctggtggacacagaatgtggcaaa tggctggtct taaggctcagagggctcagaatgaggttaaccagaaacaagctgagttcaacgctgctga gaaagaaaag gctgatgcagatgctgctctgaacgtggcacttgagtctaggaagcagaaagaacaaaag gcaaaggatg ctagcgataagctggataaggaaaacaagaggaaccaccctggaaaggctactggtaagg gtcaacctgt tggtgataagtggcttgaggatgctggtaaagaagctggagcacctgttccagataggat cgctgataag ctgagagataaggaattcaagaacttcgatgattttaggaagaagttctgggaagaggta aatttctagt ttttctccttcattttcttggttaggacccttttctctttttatttttttgagctttgat ctttctttaa actgatctattttttaattgattggttatggtgtaaatattacatagctttaactgataa tctgattact ttatttcgtgtgtctatgatgatgatgataactgcaggttagcaaggatcctgagctgag caagcagttc atccctggtaacaagaaaaggatgagccagggtcttgctcctagggctagaaacaaggat actgtgggtg gtagaagatccttcgagctgcatcacgataagccaatctctcaggatggtggtgtttacg atatggataa catcagggtgaccaccccaaagctgcacatcgatattcataggggaaagtaa

SEQ ID NO: 16 nucleotide sequence used for salmocin ScolE3 expression in examples atgtctggtggtgatggtaggggtcataataccggtgctcatagcaccagcggtaacatt aacggtggtc ctactggtcttggtgtgtcaggtggtgcttctgatggttctggttggtcctctgagaaca atccttgggg tggtggtagcggttctggtattcactggggaggtggaagtggtagaggtaatggtggtgg aaacggtaac agtggtggtggttctggaactggtggtaacctttctgctgttgctgctcctgttgctttc ggtttccctg ctctttctactcctggtgctggtggtttggctgtgtctatttctgcttctgagctgagcg ctgctatcgc tggtattatcgctaagctgaagaaggtgaacctgaagttcacccctttcggtgtggtgct gtcctctttg attcctagcgagatcgctaaggatgatcctaacatgatgagcaagatcgtgaccagcctg cctgctgatg atattaccgagtctcctgtgtcctctctgcctcttgataaggctactgtgaatgtgaacg tgagggtggt ggatgatgtgaaggatgagaggcagaacatcagcgttgtgtctggtgttcctatgtctgt gcctgttgtg gatgctaagcctactgaaaggcctggtgtgttcaccgcttctattccaggtgctcctgtg ctgaacatct ccgtgaacaattctacccctgctgtgcagactctttctcctggtgtgactaacaacaccg ataaggatgt taggcctgctggtttcactcagggtggtaataccagggatgctgtgatcaggttccctaa ggattctggt cacaacgcagtgtacgtgtccgtgtctgatgtgttgtctccagatcaggttaagcagagg caggatgaag agaatagaaggcagcaagagtgggatgctactcaccctgttgaagttgctgagagagagt acgagaacgc tagagctgaacttgaggctgaaaacaagaacgtgcacagccttcaggtggcacttgatgg tcttaagaat accgctgagggtctggctctttctgatgctggtagacatcctctgaccagcagcgagtct agatttgttg ctgtgcctggttactccggtggtggtgttcattttgatgctaccgctaccgtggatagca gggataggct taactctcttctgtctcttggtggtgctgcttacgtgaacaacgtgttggagcttggtga ggtgtcagct cctactgaggatggtttgaaggtgggaaacgctatcaagaacgctatgatcgaggtgtac gataagctga ggcagaggcttattaccaggcagaacgagatcaaccacgctcaggtgtcacttaacaccg ctatcgagtc taggaacaagaaagaggaaaagaagaggtccgcagagaacaagctgaacgaagagagaaa caagcctaga aagggtactaaggattacggacacgattaccatcctgctccagagactgaagaaatcaag ggtctgggtg atatcaagaagggtatccctaagacccctaagcagaacggtggtggtaagagaaagagat ggatcggaga taagggtagaaagatctacgagtgggatagccagcatggtgagcttgaaggtaaatttct agtttttctc cttcattttcttggttaggacccttttctctttttatttttttgagctttgatctttctt taaactgate tattttttaattgattggttatggtgtaaatattacatagctttaactgataatctgatt actttatttc gtgtgtctatgatgatgatgataactgcaggttatagggcttcagatggtcagcacctgg gaagctttga tcctaagactggtaagcagctgaagggtcctgatccaaagaggaacatcaagaagtacct ttaa

SEQ ID NO: 17 nucleotide sequence used for salmocin ScolE7 expression in examples atgtctggtggtgatggtatcggtcacaatagcggtgctcattctactggtggtgtgaac ggttcctctt ctggttctggtggaagctcatctggaagcggtaacaaccctaattctggtcctggttggg gtactactca tacccctaacggtgatatccacaactacaaccctggtgagtttggtggtggtggaaacaa gcctggtgga catggtggtaactctggtaaccacgatggtagctctggaaacggtcaaccttctgctgct cctatggctt ttggtttccctgctcttgctcctgctggtgctggttctcttgctgttactgtttctggtg aggctctgtc tgctgctatcgctgatattttcgctgctctgaagggacctttcaagttcggtgcttgggg tattgctctg tacggtattatgcctaccgagatcgctaaggatgatcctaacatgatgagcaagatcatg accagcctgc ctgetgatactgtgactgatactcctgtgtcctctctgcctcttgatcaggctactgtgt ctgtgactaa gagggttgcagatgtggtgaaggatgagaggcagcatattgctgttgttgctggtgtgcc tatgtctgtg cctgttgttgatgctaagcctaccactaggcctggtatcttctctgctactgttcctgga cttcctgctt tggaggtgtcaaccggtaagtctattcctgcttctaccgctctgcctaggggtattactg aggataagga taggactgagcaccctgctggtttcactttcggtggttcttctcacgatgctgtgatcag gttccctaaa gagtctggtcaggctccagtttacgtgtcagtgactgatgtgcttacccctgagcaggtt aagcagagac aggatgaagagtctagaaggcagcaagagtgggatgctactcatcctgttgaagtggctg agaggaacta caggcttgcttctgatgagctgaacagggtgaacgctgatgtggctggtaagcaagaaag acaagctcaa gctggacaggctgttgctgctagaaagggtgaacttgatgctgctaacaagaccttcgct gatgctaaag aagagatcaagaagttcgagcacttcgctagggatccaatggctggtggtcatagaatgt ggcagatggc tggtcttaaggctcagagggctcagaatgaggttaaccagaaacaagctgagttcgatgc tgcagagaaa gaaaaggctgatgctgatgcagctctgaacgctgctcttgaatctaggaagcagaaagag cagaaggcta aggataccaaagagaggctggataaggaaaacaagaggaatcagcctggtaaggctaccg gtaagggtca gccagtttctgataagtggcttgaggatgctggtaaagagagcggttctcctatccctga tagcattgct gataagcttagagataaggaattcagaaacttcgatgattttaggaagaagttctgggag gaagttagca aggatcctgagctgagcaagcagttcatcaagggtaacagagataggatgcaggtaaatt tctagttttt ctccttcattttcttggttaggacccttttctctttttatttttttgagctttgatcttt ctttaaactg atctattttttaattgattggttatggtgtaaatattacatagctttaactgataatctg attactttat ttcgtgtgtctatgatgatgatgataactgcaggttggaaaggctcctaagtccagaaag aaggatgctg ctggtaagaggacctctttcgagcttcatcacgataagcctgtgagccaggatggtggtg tttacgatat ggataacctgaggatcaecacccctaagaggcacatcgatattcataggggacagtaa

SEQ ID NO: 18 nucleotide sequence used for salmocin ScolEI a expression in examples atggctgataacaccattgcttactacgaggatggtgtgcctcacagcgctgatggtaag gtggtgattg tgatcgatggtaagatgcctgtggataccggtgctggtggtactggtggtggtggaggtg gtaaggttgg aggaacttctgaaagctctgctgctattcacgctaccgctaagtggtctaccgctcagct taagaaaacc ctggctgagaaggctgctagagagagagaaactgctgctgcaatggctgctgctaaggct aagagagatg ctcttacccagcacctgaaggatatcgtgaacgatgtgcttaggcacaacgcttctagga ccccttctgc tactgatettgetcacgctaacaacatggctatgcaggctgaagctcagagacttggtag agctaaggct gaggaaaaggctagaaaagaggctgaggctgctgagcttgctttccaagaagctgaaaga cagagggaag aggctgttagacagcttgctgaaactgagaggcagcttaagcaagctgaggaagagaaga ggcttgctgc tctttctgatgaggctagggctgttgagaacgctaggaagaatctggataccgcaaagtc cgagctggct aatgtggattctgatatcgagaggcagaggtcccagctgtcatctcttgatgctgatgtg aagaaggctg aagagaacctgaggctgaccatgaggattaagggtaggatcggtaggaagatgcaggcta agtcacaggc tatcgtggatgataagaaaaggatctactccgatgctgagaacgtgctgaataccatgac cgtgaatagg aacctgaaggctcagcaggttaccgatgcagagaatgagcttaaggtggcaatcgataac ctgaacagca gccagatgaagaacgctgtggatgctaccgtgtctttctaccagactctgaccgagaagt acggtgagaa gtacagccttatcgctcaagagctggcagagaagtccaagggtaagaaaatcggaaatgt ggatgaggct ctggctgcattcgagaagtataaggatgtgctggataagaagttcagcaaggctgatagg gatgctattg tgaacgctctgaagtccttcaactacgatgattgggctaagcacctggatcagttcgcta agtacctgaa gatcaccggtcacgtgagcttcggttacgatgttgtgtctgatgtgctgaaggctagcga gactggtgat tggaagcctctgttcattacccttgagcagaaggtgttggatactggtatgagctacctg gtggtgctga tgttctctcttattgctggaaccaccctgggaatcttcggtgtggctattattaccgcta tcctgtgcag cttcgtggataagtacatcctgaacgcactgaacgatgctctgggaatctaa

SEQ ID NO: 19 nucleotide sequence used for salmocin ScolEI b expression in examples atgagcgataacaccattgcttactacgaggatggtgtgccttacagcgctgatggtcaa gtggtgattg tgatcgatggtaagatgcctgtggataccggtgctggtggtactggtggtggtggaggtg gtaaggttgg aggaacttctgaaagctctgctgctattcacgctaccgctaagtggtctaaggctcagct tcagaagtcc ctggaagagaaggctgctagagagagagaaactgctgctgcaatggctgctgctaaggct aagagagatg ctcttacccagcacctgaaggatatcgtgaacgatgtgctgaggtacaacgcttctagga ctccttctgc taccgatcttgctcacgctaacaacatggctatgcaggctgaagctcagagacttggtag agctaaggct gaggaaaaggctagaaaagaggctgaggctgctgagaagtctcttcaagaagctgagaga cagagggaag aagctgctaggcaaagagctgaagcagagaggcaacttaagcaggcagaggctgaagaga agaggttggc tgctctttctgaagaggctagggcagttgagatcacccagaagaatcttgctgctgctca gagcgagctg tccaagatggatggtgagatcaagagccttaacgtgaggctgtctacctctatccatgct agggatgctg agatgaacagcctgtctggtaagaggaacgagctggctcaagagagcgctaagtacaaag aactggatga gctggtgaagaagcttgagcctagggctaatgatcctctgcagaacaggcctttcttcga tgctacatct agaagggcaagggctggtgatactttggctgagaagcagaaagaggtgaccgcttctgag actaggatca acgagcttaacaccgagatcaaccaggtgaggggtgctatttcacaggcaaacaacaata ggaacctgaa ggtgcagcaggttaccgagactgagaacgctcttaaggtggcaatcgataacctgaacag cagccagatg aagaacgctgtggatgctaccgtgtctttctaccagaccctgactgctaagtacggtgag aagtacagcc tgatcgctcaagaacttgctgagcagtccaagggtaagaaaatcagcaatgtggatgagg ctctggctgc attcgagaagtataaggatgtgctggataagaagttcagcaaggctgatagggatgcaat tgtgaacgct ctgaagtccgtggattacgctgattgggctaagcacctggatcagttcagcagatacctg aagatcagcg gtagggtgtcaaccggttacgatatctacagcgatatcagaaagggtatggataccaacg attggaggcc tctgttcctgacccttgagaagcttgctgttgatgctggtgtgggttacatcgtggctct tggtttctct gtgatcgcttctaccgctcttggtatttggggtgtggctattatcaccggtgtgatctgc agcttcgttg ataagaaggatttggagaagctgaacgaggcactgggaatctaa

SEQ ID NO: 20 nucleotide sequence used for salmocin Spst expression in examples atgttcatcaagagcggtggtaacctgaccatcaggacttttggtggtcttggtgtgggt ggtg atttcgatagcgatacttggagaagaaggtccaccgattcttgggtgccatacagcgagt acat tgctatcgagtgcatcgtggctcctaaccagctttaccagcttcttactgatgtggctca ggtg gaaactgtggctgctcaacttgctcaggttggataccagtatcttcagggtaggcttagg ctgg tgagagaggatggttcttgcaccgatttcagcggtaaggctatgctggataacctgctga acaa gagcaaggatattctggatctggatttcctgcacgtgagcgagggttataggtctgaagc ttat tggcctggtcagtcctctggtatcaccattggttacggtgtggatatcggtcaccagtct gaag agggacttcataagtggggtgtgcctcagagcatcatcgataagatcaaggattacttcg gtat taccggtgaggctgctaacacccttcttaagggtctgaaggataagaccctgggactgag cgat agagagatcaagcagttctccgatatcgtgaagaagcaggctaccgctgatatcatcaac aagt acaacgctgctaccaagggtatcacctttgataagatcccttacaacaccaggaccgcta tcat cgatctgttctaccagtacagcgctcctaagggtgctcctaagtcttggggtttcattat caac aacgattggaacggtttctacaacgagctgatgaacttcggtgataagcacaccaccaga agag agagggaagctgctctggttctgtctgatattgtgaacaaccagtacatctacaagtaa

SEQ ID NO: 21 nucleotide sequence used for salmocin ScolE2 immunity protein SlmmE2 expression in examples atggaactgaagaagtccatcagcgattacaccgaggctgagttcaagaagatcatcgag gcta tcatcaactgcgagggtgatgagaaaacccaggatgataaccttgagttcttcatcaggg tgac cgagtacccttctggtagcgatcttatctactaccctgagggtgataacgatggtagcac cgag gcaattatcaaagaaatcaaggaatggagggctgctaacggtaagcctggttttaagcaa gctt aa

SEQ ID NO: 22 nucleotide sequence used for salmocin ScolE7 immunity protein SlmmE7 expression in examples atggaactgaagaacagcatcagcgattacaccgaggctgagttcatcgagttcatgaaa gaaa tcgataaggaaaacgtggcagagactgatgataagctggatctgctgctgaaccacttcg agca ggttacagaacaccctgatggaaccgatctgatctactacgctgcttccgatgctgagtc tacc cctgaggctatcaccaagaaaatcaaagaatggagggctgctaacggtaagcctggtttt aagc aaggttaa

SEQ ID NO: 23 nucleotide sequence of binary TMV-based vector used for salmocin ScolEI a expression in examples catggctgataacaccattgettactacgaggatggtgtgcctcacagcgctgatggtaa ggtggtgatt gtgatcgatggtaagatgcctgtggataccggtgctggtggtactggtggtggtggaggt ggtaaggttg gaggaacttctgaaagctctgctgctattcacgctaccgctaagtggtctaccgctcagc ttaagaaaac cctggctgagaaggctgctagagagagagaaactgctgctgcaatggctgctgctaaggc taagagagat gctcttacccagcacctgaaggatatcgtgaacgatgtgcttaggcacaacgcttctagg accccttctg ctactgatcttgctcacgctaacaacatggctatgcaggctgaagctcagagacttggta gagctaaggc tgaggaaaaggctagaaaagaggctgaggctgctgagcttgctttccaagaagctgaaag acagagggaa gaggctgttagacagcttgctgaaactgagaggcagcttaagcaagctgaggaagagaag aggcttgctg ctctttctgatgaggctagggctgttgagaacgctaggaagaatctggataccgcaaagt ccgagctggc taatgtggattctgatatcgagaggcagaggtcccagctgtcatctcttgatgctgatgt gaagaaggct gaagagaacctgaggctgaccatgaggattaagggtaggatcggtaggaagatgcaggct aagtcacagg ctatcgtggatgataagaaaaggatctactccgatgctgagaacgtgctgaataccatga ccgtgaatag gaacctgaaggctcagcaggttaccgatgcagagaatgagcttaaggtggcaatcgataa cctgaacagc agccagatgaagaacgctgtggatgctaccgtgtctttctaccagactctgaccgagaag tacggtgaga agtacagccttatcgctcaagagctggcagagaagtccaagggtaagaaaatcggaaatg tggatgaggc tctggctgcattcgagaagtataaggatgtgctggataagaagttcagcaaggctgatag ggatgctatt gtgaacgctctgaagtccttcaactacgatgattgggctaagcacctggatcagttcgct aagtacctga agatcaccggtcacgtgagcttcggttacgatgttgtgtctgatgtgctgaaggctagcg agactggtga ttggaagcctctgttcattacccttgagcagaaggtgttggatactggtatgagctacct ggtggtgctg atgttctctcttattgctggaaccaccctgggaatcttcggtgtggctattattaccgct atcctgtgca gcttcgtggataagtacatcctgaacgcactgaacgatgctctgggaatctaagcttact agagcgtggt gcgcacgatagcgcatagtgtttttctctccacttgaatcgaagagatagacttacggtg taaatccgta ggggtggcgtaaaccaaattacgcaatgttttgggttccatttaaatcgaaaccccttat ttcctggatc acctgttaacgcacgtttgacgtgtattacagtgggaataagtaaaagtgagaggttcga atcctcccta accccgggtaggggcccagcggccgctctagctagagtcaagcagatcgttcaaacattt ggcaataaag tttcttaagattgaatcctgttgccggtcttgcgatgattatcatataatttctgttgaa ttacgttaag catgtaataattaacatgtaatgcatgacgttatttatgagatgggtttttatgattaga gtcccgcaat tatacatttaatacgcgatagaaaacaaaatatagcgcgcaaactaggataaattatcgc gcgcggtgtc atctatgttactagatcgacctgcatccaccccagtacattaaaaacgtccgcaatgtgt tattaagttg tctaagcgtcaatttgtttacaccacaatatatcctgccaccagccagccaacagctccc cgaccggcag ctcggcacaaaatcaccactcgatacaggcagcccatcagtcagatcaggatctcctttg cgacgctcac cgggctggttgccctcgccgctgggctggcggccgtctatggccctgcaaacgcgccaga aacgccgtcg aagccgtgtgcgagacaccgcggccgccggcgttgtggatacctcgcggaaaacttggcc ctcactgaca gatgaggggcggacgttgacacttgaggggccgactcacccggcgcggcgttgacagatg aggggcaggc tcgatttcggccggcgacgtggagctggccagcctcgcaaatcggcgaaaacgcctgatt ttacgcgagt ttcccacagatgatgtggacaagcctggggataagtgccctgcggtattgacacttgagg ggcgcgacta ctgacagatgaggggcgcgatccttgacacttgaggggcagagtgctgacagatgagggg cgcacctatt gacatttgaggggctgtccacaggcagaaaatccagcatttgcaagggtttccgcccgtt tttcggccac cgctaacctgtcttttaacctgcttttaaaccaatatttataaaccttgtttttaaccag ggctgcgccc tgtgcgcgtgaccgcgcacgccgaaggggggtgcccccccttctcgaaccctcccggccc gctaacgcgg gcctcccatccccccaggggctgcgcccctcggccgcgaacggcctcaccccaaaaatgg cagcgctggc caattcgtgcgcggaacccctatttgtttatttttctaaatacattcaaatatgtatccg ctcatgagac aataaccctgataaatgcttcaataatattgaaaaaggaagagtatggctaaaatgagaa tatcaccgga attgaaaaaactgatcgaaaaataccgctgcgtaaaagatacggaaggaatgtctcctgc taaggtatat aagctggtgggagaaaatgaaaacctatatttaaaaatgacggacagccggtataaaggg accacctatg atgtggaacgggaaaaggacatgatgctatggctggaaggaaagctgcctgttccaaagg tcctgcactt tgaacggcatgatggctggagcaatctgctcatgagtgaggccgatggcgtcctttgctc ggaagagtat gaagatgaacaaagccctgaaaagattatcgagctgtatgcggagtgcatcaggctcttt cactccatcg acatatcggattgtccctatacgaatagcttagacagccgcttagccgaattggattact tactgaataa cgatctggccgatgtggattgcgaaaactgggaagaagacactccatttaaagatccgcg cgagctgtat gattttttaaagacggaaaagcccgaagaggaacttgtcttttcccacggcgacctggga gacagcaaca tctttgtgaaagatggcaaagtaagtggctttattgatcttgggagaagcggcagggcgg acaagtggta tgacattgccttctgcgtccggtcgatcagggaggatatcggggaagaacagtatgtcga gctatttttt gacttactggggatcaagcctgattgggagaaaataaaatattatattttactggatgaa ttgttttage tgtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaattta aaaggateta ggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagttttcgttcca ctgagcgtca gaccccgtagaaaagatcaaaggatcttcttgagatcctttttttctgcgcgtaatctgc tgcttgcaaa caaaaaaaccaccgctaccagcggtggtttgtttgccggatcaagagctaccaactcttt ttccgaaggt aactggcttcagcagagcgcagataccaaatactgtccttctagtgtagccgtagttagg ccaccacttc aagaactctgtagcaccgcctacatacctcgctctgctaatcctgttaccagtggctgct gccagtggcg ataagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggt cgggctgaac ggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaactgagatacct acagcgtgag ctatgagaaagcgccacgcttcccgaagggagaaaggcggacaggtatccggtaagcggc agggtcggaa caggagagcgcacgagggagcttccagggggaaacgcctggtatctttatagtcctgtcg ggtttcgcca cctctgacttgagcgtcgatttttgtgatgctcgtcaggggggcggagcctatggaaaaa cgccagcaac gcggcctttttacggttcctggcagatcctagatgtggcgcaacgatgccggcgacaagc aggagcgcac cgacttcttccgcatcaagtgttttggctctcaggccgaggcccacggcaagtatttggg caaggggtcg ctggtattcgtgcagggcaagattcggaataccaagtacgagaaggacggccagacggtc tacgggaccg acttcattgccgataaggtggattatctggacaccaaggcaccaggcgggtcaaatcagg aataagggca cattgccccggcgtgagtcggggcaatcccgcaaggagggtgaatgaatcggacgtttga ccggaaggca tacaggcaagaactgatcgacgcggggttttccgccgaggatgccgaaaccatcgcaagc cgcaccgtca tgcgtgcgccccgcgaaaccttccagtccgtcggctcgatggtccagcaagctacggcca agatcgagcg cgacagcgtgcaactggctccccctgccctgcccgcgccatcggccgccgtggagcgttc gcgtcgtctc gaacaggaggcggcaggtttggcgaagtcgatgaccatcgacacgcgaggaactatgacg accaagaagc gaaaaaccgccggcgaggacctggcaaaacaggtcagcgaggccaagcaggccgcgttgc tgaaacacac gaagcagcagatcaaggaaatgcagctttccttgttcgatattgcgccgtggccggacac gatgcgagcg atgccaaacgacacggcccgctctgccctgttcaccacgcgcaacaagaaaatcccgcgc gaggcgctgc aaaacaaggtcattttccacgtcaacaaggacgtgaagatcacctacaccggcgtcgagc tgcgggccga cgatgacgaactggtgtggcagcaggtgttggagtacgcgaagcgcacccctatcggcga gccgatcacc ttcacgttctacgagctttgccaggacctgggctggtcgatcaatggccggtattacacg aaggccgagg aatgcctgtcgcgcctacaggcgacggcgatgggcttcacgtccgaccgcgttgggcacc tggaatcggt gtcgctgctgcaccgcttccgcgtcctggaccgtggcaagaaaacgtcccgttgccaggt cctgatcgac gaggaaatcgtcgtgctgtttgctggcgaccactacacgaaattcatatgggagaagtac cgcaagctgt cgccgacggcccgacggatgttcgactatttcagctcgcaccgggagccgtacccgctca agctggaaac cttccgcctcatgtgcggatcggattccacccgcgtgaagaagtggcgcgagcaggtcgg cgaagcctgc gaagagttgcgaggcagcggcctggtggaacacgcctgggtcaatgatgacctggtgcat tgcaaacgct agggccttgtggggtcagttccggctgggggttcagcagccagcgcctgatctggggaac cctgtggttg gcacatacaaatggacgaacggataaaccttttcacgcccttttaaatatccgattattc taataaacgc tcttttctcttaggtttacccgccaatatatcctgtcaaacactgatagtttaaactgaa ggcgggaaac gacaatctgatctaagctagcttggaattggtaccacgcgtttcgacaaaatttagaacg aacttaatta tgatctcaaatacattgatacatatctcatctagatctaggttatcattatgtaagaaag ttttgacgaa tatggcacgacaaaatggctagactcgatgtaattggtatctcaactcaacattatactt ataccaaaca ttagttagacaaaatttaaacaactattttttatgtatgcaagagtcagcatatgtataa ttgattcaga atcgttttgacgagttcggatgtagtagtagccattatttaatgtacatactaatcgtga atagtgaata tgatgaaacattgtatcttattgtataaatatccataaacacatcatgaaagacactttc tttcacggtc tgaattaattatgatacaattctaatagaaaacgaattaaattacgttgaattgtatgaa atctaattga acaagccaaccacgacgacgactaacgttgcctggattgactcggtttaagttaaccact aaaaaaacgg agctgtcatgtaacacgcggatcgagcaggtcacagtcatgaagccatcaaagcaaaaga actaatccaa gggctgagatgattaattagtttaaaaattagttaacacgagggaaaaggctgtctgaca gccaggtcac gttatctttacctgtggtcgaaatgattcgtgtctgtcgattttaattatttttttgaaa ggccgaaaat aaagttgtaagagataaacccgcctatataaattcatatattttcctctccgctttgaag ttttagtttt attgcaacaacaacaacaaattacaataacaacaaacaaaatacaaacaacaacaacatg gcacaatttc aacaaacaattgacatgcaaactctccaagccgctgcgggacgcaacagcttggtgaatg atttggcatc tcgtcgcgtttacgataatgcagtcgaggagctgaatgctcgttccagacgtcccaaggt aataggaact ttctggatctactttatttgctggatctcgatcttgttttctcaatttccttgagatctg gaattcgttt aatttggatctgtgaacctccactaaatcttttggttttactagaatcgatctaagttga ccgatcagtt agctcgattatagctaccagaatttggcttgaccttgatggagagatccatgttcatgtt acctgggaaa tgatttgtatatgtgaattgaaatctgaactgttgaagttagattgaatctgaacactgt caatgttaga ttgaatctgaacactgtttaaggttagatgaagtttgtgtatagattcttcgaaacttta ggatttgtag tgtcgtacgttgaacagaaagctatttctgattcaatcagggtttatttgactgtattga actctttttg tgtgtttgcaggtccacttctccaaggcagtgtctacggaacagaccctgattgcaacaa acgcatatcc ggagttcgagatttcctttactcatacgcaatccgctgtgcactccttggccggaggcct tcggtcactt gagttggagtatctcatgatgcaagttccgttcggttctctgacgtacgacatcggcggt aacttttccg cgcaccttttcaaagggcgcgattacgttcactgctgcatgcctaatctggatgtacgtg acattgctcg ccatgaaggacacaaggaagctatttacagttatgtgaatcgtttgaaaaggcagcagcg tcctgtgcct gaataccagagggcagctttcaacaactacgctgagaacccgcacttcgtccattgcgac aaacctttcc aacagtgtgaattgacgacagcgtatggcactgacacctacgctgtagctctccatagca tttatgatat ccctgttgaggagttcggttctgcgctactcaggaagaatgtgaaaacttgtttcgcggc ctttcatttc catgagaatatgcttctagattgtgatacagtcacactcgatgagattggagctacgttc cagaaatcag gtaacattccttagttacctttcttttctttttccatcataagtttatagattgtacatg ctttgagatt tttctttgcaaacaatctcaggtgataacctgagcttcttcttccataatgagagcactc tcaattacac ccacagcttcagcaacatcatcaagtacgtgtgcaagacgttcttccctgctagtcaacg cttcgtgtac cacaaggagttcctggtcactagagtcaacacttggtactgcaagttcacgagagtggat acgttcactc tgttccgtggtgtgtaccacaacaatgtggattgcgaagagttttacaaggctatggacg atgcgtggca ctacaaaaagacgttagcaatgcttaatgccgagaggaccatcttcaaggataacgctgc gttaaacttc tggttcccgaaggtgctcttgaaattggaagtcttcttttgttgtctaaacctatcaatt tctttgcgga aatttatttgaagctgtagagttaaaattgagtcttttaaacttttgtaggtgagagaca tggttatcgt ccctctctttgacgcttctatcacaactggtaggatgtctaggagagaggttatggtgaa caaggacttc gtctacacggtcctaaatcacatcaagacctatcaagctaaggcactgacgtacgcaaac gtgctgagct tcgtggagtctattaggtctagagtgataattaacggtgtcactgccaggtaagttgtta cttatgattg ttttcctctctgctacatgtattttgttgttcatttctgtaagatataagaattgagttt tcctctgatg atattattaggtctgaatgggacacagacaaggcaattctaggtccattagcaatgacat tcttcctgat cacgaagctgggtcatgtgcaagatgaaataatcctgaaaaagttccagaagttcgacag aaccaccaat gagctgatttggacaagtctctgcgatgccctgatgggggttattccctcggtcaaggag acgcttgtgc gcggtggttttgtgaaagtagcagaacaagccttagagatcaaggttagtatcatatgaa gaaataccta gtttcagttgatgaatgctattttctgacctcagttgttctcttttgagaattatttctt ttctaatttg cctgatttttctattaattcattaggttcccgagctatactgtaccttcgccgaccgatt ggtactacag tacaagaaggcggaggagttccaatcgtgtgatctttccaaacctctagaagagtcagag aagtactaca acgcattatccgagctatcagtgcttgagaatctcgactcttttgacttagaggcgttta agactttatg tcagcagaagaatgtggacccggatatggcagcaaaggtaaatcctggtccacactttta cgataaaaac acaagattttaaactatgaactgatcaataatcattcctaaaagaccacacttttgtttt gtttctaaag taatttttactgttataacaggtggtcgtagcaatcatgaagtcagaattgacgttgcct ttcaagaaac ctacagaagaggaaatctcggagtcgctaaaaccaggagaggggtcgtgtgcagagcata aggaagtgtt gagcttacaaaatgatgctccgttcccgtgtgtgaaaaatctagttgaaggttccgtgcc ggcgtatgga atgtgtcctaagggtggtggtttcgacaaattggatgtggacattgctgatttccatctc aagagtgtag atgcagttaaaaagggaactatgatgtctgcggtgtacacagggtctatcaaagttcaac aaatgaagaa ctacatagattacttaagtgcgtcgctggcagctacagtctcaaacctctgcaaggtaag aggtcaaaag gtttccgcaatgatccctctttttttgtttctctagtttcaagaatttgggtatatgact aacttctgag tgttccttgatgcatatttgtgatgagacaaatgtttgttctatgttttaggtgcttaga gatgttcacg gcgttgacccagagtcacaggagaaatctggagtgtgggatgttaggagaggacgttggt tacttaaacc taatgcgaaaagtcacgcgtggggtgtggcagaagacgccaaccacaagttggttattgt gttactcaac tgggatgacggaaagccggtttgtgatgagacatggttcagggtggcggtgtcaagcgat tccttgatat attcggatatgggaaaacttaagacgctcacgtcttgcagtccaaatggtgagccaccgg agcctaacgc caaagtaattttggtcgatggtgttcccggttgtggaaaaacgaaggagattatcgaaaa ggtaagttct gcatttggttatgctccttgcattttaggtgttcgtcgctcttccatttccatgaatagc taagattttt tttctctgcattcattcttcttgcctcagttctaactgtttgtggtatttttgttttaat tattgctaca ggtaaacttctctgaagacttgattttagtccctgggaaggaagcttctaagatgatcat ccggagggcc aaccaagctggtgtgataagagcggataaggacaatgttagaacggtggattccttcttg atgcatcctt ctagaagggtgtttaagaggttgtttatcgatgaaggactaatgctgcatacaggttgtg taaatttcct actgctgctatctcaatgtgacgtcgcatatgtgtatggggacacaaagcaaattccgtt catttgcaga gtcgcgaactttccgtatccagcgcattttgcaaaactcgtcgctgatgagaaggaagtc agaagagtta cgctcaggtaaagcaactgtgttttaatcaatttcttgtcaggatatatggattataact taatttttga gaaatctgtagtatttggcgtgaaatgagtttgctttttggtttctcccgtgttataggt gcccggctga tgttacgtatttccttaacaagaagtatgacggggcggtgatgtgtaccagcgcggtaga gagatccgtg aaggcagaagtggtgagaggaaagggtgcattgaacccaataaccttaccgttggagggt aaaattttga ccttcacacaagctgacaagttcgagttactggagaagggttacaaggtaaagtttccaa ctttccttta ccatatcaaactaaagttcgaaactttttatttgatcaacttcaaggccacccgatcttt ctattcctga ttaatttgtgatgaatccatattgacttttgatggttacgcaggatgtgaacactgtgca cgaggtgcaa ggggagacgtacgagaagactgctattgtgcgcttgacatcaactccgttagagatcata tcgagtgcgt cacctcatgttttggtggcgctgacaagacacacaacgtgttgtaaatattacaccgttg tgttggaccc gatggtgaatgtgatttcagaaatggagaagttgtccaatttccttcttgacatgtatag agttgaagca ggtctgtctttcctatttcatatgtttaatcctaggaatttgatcaattgattgtatgta tgtcgatccc aagactttcttgttcacttatatcttaactctctctttgctgtttcttgcaggtgtccaa tagcaattac aaatcgatgcagtattcaggggacagaacttgtttgttcagacgcccaagtcaggagatt ggcgagatat gcaattttactatgacgctcttcttcccggaaacagtactattctcaatgaatttgatgc tgttacgatg aatttgagggatatttccttaaacgtcaaagattgcagaatcgacttctccaaatccgtg caacttecta aagaacaacctattttcctcaagcctaaaataagaactgcggcagaaatgccgagaactg caggtaaaat attggatgccagacgatattctttcttttgatttgtaactttttcctgtcaaggtcgata aattttattt tttttggtaaaaggtcgataatttttttttggagccattatgtaattttcctaattaact gaaccaaaat tatacaaaccaggtttgctggaaaatttggttgcaatgatcaaaagaaacatgaatgcgc cggatttgac agggacaattgacattgaggatactgcatctctggtggttgaaaagttttgggattcgta tgttgacaag gaatttagtggaacgaacgaaatgaccatgacaagggagagcttctccaggtaaggactt ctcatgaata ttagtggcagattagtgttgttaaagtctttggttagataatcgatgcctcctaattgtc catgttttac tggttttctacaattaaaggtggctttcgaaacaagagtcatctacagttggtcagttag cggactttaa ctttgtggatttgccggcagtagatgagtacaagcatatgatcaagagtcaaccaaagca aaagttagac ttgagtattcaagacgaatatcctgcattgcagacgatagtctaccattcgaaaaagatc aatgcgattt tcggtccaatgttttcagaacttacgaggatgttactcgaaaggattgactcttcgaagt ttctgttcta caccagaaagacacctgcacaaatagaggacttcttttctgacctagactcaacccaggc gatggaaatt ctggaactcgacatttcgaagtacgataagtcacaaaacgagttccattgtgctgtagag tacaagatct gggaaaagttaggaattgatgagtggctagctgaggtctggaaacaaggtgagttcctaa gttccatttt tttgtaatccttcaatgttattttaacttttcagatcaacatcaaaattaggttcaattt tcatcaacca aataatatttttcatgtatatataggtcacagaaaaacgaccttgaaagattatacggcc ggaatcaaaa catgtctttggtatcaaaggaaaagtggtgatgtgacaacctttattggtaataccatca tcattgccgc atgtttgagctcaatgatccccatggacaaagtgataaaggcagctttttgtggagacga tagcctgatt tacattcctaaaggtttagacttgcctgatattcaggcgggcgcgaacctcatgtggaac ttcgaggcca aactcttcaggaagaagtatggttacttctgtggtcgttatgttattcaccatgatagag gagccattgt gtattacgatccgcttaaactaatatctaagttaggttgtaaacatattagagatgttgt tcacttagaa gagttacgcgagtctttgtgtgatgtagctagtaacttaaataattgtgcgtatttttca cagttagatg aggccgttgccgaggttcataagaccgcggtaggcggttcgtttgctttttgtagtataa ttaagtattt gtcagataagagattgtttagagatttgttctttgtttgataatgtcgatagtctcgtac gaacctaagg tgagtgatttcctcaatctttcgaagaaggaagagatcttgccgaaggctctaacgaggt taaaaaccgt gtctattagtactaaagatattatatctgtcaaggagtcggagactttgtgtgatataga tttgttaatc aatgtgccattagataagtatagatatgtgggtatcctaggagccgtttttaccggagag tggctagtgc cagacttcgttaaaggtggagtgacgataagtgtgatagataagcgtctggtgaactcaa aggagtgcgt gattggtacgtacagagccgcagccaagagtaagaggttccagttcaaattggttccaaa ttactttgtg tccaccgtggacgcaaagaggaagccgtggcaggtaaggatttttatgatatagtatgct tatgtatttt gtactgaaagcatatcctgcttcattgggatattactgaaagcatttaactacatgtaaa ctcacttgat gatcaataaacttgattttgcaggttcatgttcgtatacaagacttgaagattgaggcgg gttggcagcc gttagctctggaagtagtttcagttgctatggtcaccaataacgttgtcatgaagggttt gagggaaaag gtcgtcgcaataaatgatccggacgtcgaaggtttcgaaggtaagccatcttcctgctta tttttataat gaacatagaaataggaagttgtgcagagaaactaattaacctgactcaaaatctaccctc ataattgttg tttgatattggtcttgtattttgcaggtgtggttgacgaattcgtcgattcggttgcagc atttaaagcg gttgacaactttaaaagaaggaaaaagaaggttgaagaaaagggtgtagtaagtaagtat aagtacagac cggagaagtacgccggtcctgattcgtttaatttgaaagaagaaaacgtcttacaacatt acaaacccga atcagtaccagtatttcgataagaaacaagaaac

SEQ ID NO: 24 nucleotide sequence of binary PVX-based vector used for salmocin ScolE2 immunity protein lmmScolE2 expression in examples gatcggtcgtatcactggaacaacaaccgctgaggctgttgtcactctaccaccaccata actacgtcta cataaccgacgcctaccccagtttcatagtattttctggtttgattgtatgaataatata aataaaaaaa aaaaaaaaaaaaaaaaactagtgagctcttctgtcagcgggcccactgcatccaccccag tacattaaaa acgtccgcaatgtgttattaagttgtctaagcgtcaatttgtttacaccacaatatatcc tgccaccagc cagccaacagctccccgaccggcagctcggcacaaaatcaccactcgatacaggcagccc atcagtcaga tcaggatctcctttgcgacgctcaccgggctggttgccctcgccgctgggctggcggccg tctatggccc tgcaaacgcgccagaaacgccgtcgaagccgtgtgcgagacaccgcggccgccggcgttg tggatacctc gcggaaaacttggccctcactgacagatgaggggcggacgttgacacttgaggggccgac tcacccggcg cggcgttgacagatgaggggcaggctcgatttcggccggcgacgtggagctggccagcct cgcaaatcgg cgaaaacgcctgattttacgcgagtttcccacagatgatgtggacaagcctggggataag tgccctgcgg tattgacacttgaggggcgcgactactgacagatgaggggcgcgatccttgacacttgag gggcagagtg ctgacagatgaggggcgcacctattgacatttgaggggctgtccacaggcagaaaatcca gcatttgcaa gggtttccgcccgtttttcggccaccgctaacctgtcttttaacctgcttttaaaccaat atttataaac cttgtttttaaccagggctgcgccctgtgcgcgtgaccgcgcacgccgaaggggggtgcc cccccttctc gaaccctcccggcccgctaacgcgggcctcccatccccccaggggctgcgcccctcggcc gcgaacggcc tcaccccaaaaatggcagcgctggccaattcgtgcgcggaacccctatttgtttattttt ctaaatacat tcaaatatgtatccgctcatgagacaataaccctgataaatgcttcaataatattgaaaa aggaagagta tggctaaaatgagaatatcaccggaattgaaaaaactgatcgaaaaataccgctgcgtaa aagatacgga aggaatgtctcctgctaaggtatataagctggtgggagaaaatgaaaacctatatttaaa aatgacggac agccggtataaagggaccacctatgatgtggaacgggaaaaggacatgatgctatggctg gaaggaaagc tgcctgttccaaaggtcctgcactttgaacggcatgatggctggagcaatctgctcatga gtgaggccga tggcgtcctttgctcggaagagtatgaagatgaacaaagccctgaaaagattatcgagct gtatgcggag tgcatcaggctctttcactccatcgacatatcggattgtccctatacgaatagcttagac agccgcttag ccgaattggattacttactgaataacgatctggccgatgtggattgcgaaaactgggaag aagacactcc atttaaagatccgcgcgagctgtatgattttttaaagacggaaaagcccgaagaggaact tgtcttttcc cacggcgacctgggagacagcaacatctttgtgaaagatggcaaagtaagtggctttatt gatcttggga gaagcggcagggcggacaagtggtatgacattgccttctgcgtccggtcgatcagggagg atatcgggga agaacagtatgtcgagctattttttgacttactggggatcaagcctgattgggagaaaat aaaatattat attttactggatgaattgttttagctgtcagaccaagtttactcatatatactttagatt gatttaaaac ttcatttttaatttaaaaggatctaggtgaagatcctttttgataatctcatgaccaaaa tcccttaacg tgagttttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgaga tccttttttt ctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttgtttg ccggatcaag agctaccaactctttttccgaaggtaactggcttcagcagagcgcagataccaaatactg tccttctagt gtagccgtagttaggccaccacttcaagaactctgtagcaccgcctacatacctcgctct gctaatcctg ttaccagtggctgctgccagtggcgataagtcgtgtcttaccgggttggactcaagacga tagttaccgg ataaggcgcagcggtcgggctgaacggggggttcgtgcacacagcccagcttggagcgaa cgacctacac cgaactgagatacctacagcgtgagctatgagaaagcgccacgcttcccgaagggagaaa ggcggacagg tatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccagggggaaac gcctggtatc tttatagtcctgtcgggtttcgccacctctgacttgagcgtcgatttttgtgatgctcgt caggggggcg gagcctatggaaaaacgccagcaacgcggcctttttacggttcctggcagatcctagatg tggcgcaacg atgccggcgacaagcaggagcgcaccgacttcttccgcatcaagtgttttggctctcagg ccgaggccca cggcaagtatttgggcaaggggtcgctggtattcgtgcagggcaagattcggaataccaa gtacgagaag gacggccagacggtctacgggaccgacttcattgccgataaggtggattatctggacacc aaggcaccag gcgggtcaaatcaggaataagggcacattgccccggcgtgagtcggggcaatcccgcaag gagggtgaat gaatcggacgtttgaccggaaggcatacaggcaagaactgatcgacgcggggttttccgc cgaggatgcc gaaaccatcgcaagccgcaccgtcatgcgtgcgccccgcgaaaccttccagtccgtcggc tcgatggtcc agcaagctacggccaagatcgagcgcgacagcgtgcaactggctccccctgccctgcccg cgccatcggc cgccgtggagcgttcgcgtcgtctcgaacaggaggcggcaggtttggcgaagtcgatgac catcgacacg cgaggaactatgacgaccaagaagcgaaaaaccgccggcgaggacctggcaaaacaggtc agcgaggcca agcaggccgcgttgctgaaacacacgaagcagcagatcaaggaaatgcagctttccttgt tcgatattgc gccgtggccggacacgatgcgagcgatgccaaacgacacggcccgctctgccctgttcac cacgcgcaac aagaaaatcccgcgcgaggcgctgcaaaacaaggtcattttccacgtcaacaaggacgtg aagatcacct acaccggcgtcgagctgcgggccgacgatgacgaactggtgtggcagcaggtgttggagt acgcgaagcg cacccctatcggcgagccgatcaccttcacgttctacgagctttgccaggacctgggctg gtcgatcaat ggccggtattacacgaaggccgaggaatgcctgtcgcgcctacaggcgacggcgatgggc ttcacgtccg accgcgttgggcacctggaatcggtgtcgctgctgcaccgcttccgcgtcctggaccgtg gcaagaaaac gtcccgttgccaggtcctgatcgacgaggaaatcgtcgtgctgtttgctggcgaccacta cacgaaattc atatgggagaagtaccgcaagctgtcgccgacggcccgacggatgttcgactatttcagc tcgcaccggg agccgtacccgctcaagctggaaaccttccgcctcatgtgcggatcggattccacccgcg tgaagaagtg gcgcgagcaggtcggcgaagcctgcgaagagttgcgaggcagcggcctggtggaacacgc ctgggtcaat gatgacctggtgcattgcaaacgctagggccttgtggggtcagttccggctgggggttca gcagccagcg cctgatctggggaaccctgtggttggcacatacaaatggacgaacggataaaccttttca cgccctttta aatatccgattattctaataaacgctcttttctcttaggtttacccgccaatatatcctg tcaaacactg atagtttaaactgaaggcgggaaacgacaatctgatctaagctaggcatgcctgcaggtc aacatggtgg agcacgacacgcttgtctactccaaaaatatcaaagatacagtctcagaagaccaaaggg caattgagac ttttcaacaaagggtaatatccggaaacctcctcggattccattgcccagctatctgtca ctttattgtg aagatagtggaaaaggaaggtggctcctacaaatgccatcattgcgataaaggaaaggcc atcgttgaag atgcctctgccgacagtggtcccaaagatggacccccacccacgaggagcatcgtggaaa aagaagacgt tccaaccacgtcttcaaagcaagtggattgatgtgatatctccactgacgtaagggatga cgcacaatcc cactatccttcgcaagacccttcctctatataaggaagttcatttcatttggagaggaga aaactaaacc atacaccaccaacacaaccaaacccaccacgcccaattgttacacacccgcttgaaaaag aaagtttaac aaatggccaaggtgcgcgaggtttaccaatcttttacagactccaccacaaaaactctca tccaagatga ggcttatagaaacattcgccccatcatggaaaaacacaaactagctaacccttacgctca aacggttgaa gcggctaatgatctagaggggttcggcatagccaccaatccctatagcattgaattgcat acacatgcag ccgctaagaccatagagaataaacttctagaggtgcttggttccatcctaccacaagaac ctgttacatt tatgtttcttaaacccagaaagctaaactacatgagaagaaacccgcggatcaaggacat tttccaaaat gttgccattgaaccaagagacgtagccaggtaccccaaggaaacaataattgacaaactc acagagatca caacggaaacagcatacattagtgacactctgcacttcttggatccgagctacatagtgg agacattcca aaactgcccaaaattgcaaacattgtatgcgaccttagttctccccgttgaggcagcctt taaaatggaa agcactcacccgaacatatacagcctcaaatacttcggagatggtttccagtatatacca ggcaaccatg gtggcggggcataccatcatgaattcgctcatctacaatggctcaaagtgggaaagatca agtggaggga ccccaaggatagctttctcggacatctcaattacacgactgagcaggttgagatgcacac agtgacagta cagttgcaggaatcgttcgcggcaaaccacttgtactgcatcaggagaggagacttgctc acaccggagg tgcgcactttcggccaacctgacaggtacgtgattccaccacagatcttcctcccaaaag ttcacaactg caagaagccgattctcaagaaaactatgatgcagctcttcttgtatgttaggacagtcaa ggtcgcaaaa aattgtgacatttttgccaaagtcagacaattaattaaatcatctgacttggacaaatac tctgctgtgg aactggtttacttagtaagctacatggagttccttgccgatttacaagctaccacctgct tctcagacac actttctggtggcttgctaacaaagacccttgcaccggtgagggcttggatacaagagaa aaagatgcag ctgtttggtcttgaggactacgcgaagttagtcaaagcagttgatttccacccggtggat ttttctttca aagtggaaacttgggacttcagattccaccccttgcaagcgtggaaagccttccgaccaa gggaagtgtc ggatgtagaggaaatggaaagtttgttctcagatggggacctgcttgattgcttcacaag aatgccagct tatgcggtaaacgcagaggaagatttagctgcaatcaggaaaacgcccgagatggatgtc ggtcaagaag ttaaagagcctgcaggagacagaaatcaatactcaaaccctgcagaaactttcctcaaca agctccacag gaaacacagtagggaggtgaaacaccaggccgcaaagaaagctaaacgcctagctgaaat ccaggagtca atgagagctgaaggtgatgccgaaccaaatgaaataagcgggacgatgggggcaataccc agcaacgccg aacttcctggcacgaatgatgccagacaagaactcacactcccaaccactaaacctgtcc ctgcaaggtg ggaagatgcttcattcacagattctagtgtggaagaggagcaggttaaactccttggaaa agaaaccgtt gaaacagcgacgcaacaagtcatcgaaggacttccttggaaacactggattcctcaatta aatgctgttg gattcaaggcgctggaaattcagagggataggagtggaacaatgatcatgcccatcacag aaatggtgtc cgggctggaaaaagaggacttccctgaaggaactccaaaagagttggcacgagaattgtt cgctatgaac agaagccctgccaccatccctttggacctgcttagagccagagactacggcagtgatgta aagaacaaga gaattggtgccatcacaaagacacaggcaacgagttggggcgaatacttgacaggaaaga tagaaagctt aactgagaggaaagttgcgacttgtgtcattcatggagctggaggttctggaaaaagtca tgccatccag aaggcattgagagaaattggcaagggctcggacatcactgtagtcctgccgaccaatgaa ctgcggctag attggagtaagaaagtgcctaacactgagccctatatgttcaagacctctgaaaaggcgt taattggggg aacaggcagcatagtcatctttgacgattactcaaaacttcctcccggttacatagaagc cttagtctgt ttctactctaaaatcaagctaatcattctaacaggagatagcagacaaagcgtctaccat gaaactgctg aggacgcctccatcaggcatttgggaccagcaacagagtacttctcaaaatactgccgat actatctcaa tgccacacaccgcaacaagaaagatcttgcgaacatgcttggtgtctacagtgagagaac gggagtcacc gaaatcagcatgagcgccgagttcttagaaggaatcccaactttggtaccctcggatgag aagagaaagc tgtacatgggcaccgggaggaatgacacgttcacatacgctggatgccaggggctaacta agccgaaggt acaaatagtgttggaccacaacacccaagtgtgtagcgcgaatgtgatgtacacggcact ttctagagcc accgataggattcacttcgtgaacacaagtgcaaattcctctgccttctgggaaaagttg gacagcaccc cttacctcaagactttcctatcagtggtgagagaacaagcactcagggagtacgagccgg cagaggcaga gccaattcaagagcctgagccccagacacacatgtgtgtcgagaatgaggagtccgtgct agaagagtac aaagaggaactcttggaaaagtttgacagagagatccactctgaatcccatggtcattca aactgtgtcc aaactgaagacacaaccattcagttgttttcgcatcaacaagcaaaagatgagactctcc tctgggcgac tatagatgcgcggctcaagaccagcaatcaagaaacaaacttccgagaattcctgagcaa gaaggacatt ggggacgttctgtttttaaactaccaaaaagctatgggtttacccaaagagcgtattcct ttttcccaag aggtctgggaagcttgtgcccacgaagtacaaagcaagtacctcagcaagtcaaagtgca acttgatcaa tgggactgtgagacagagcccagacttcgatgaaaataagattatggtattcctcaagtc gcagtgggtc acaaaggtggaaaaactaggtctacccaagattaagccaggtcaaaccatagcagccttt taccagcaga ctgtgatgctttttggaactatggctaggtacatgcgatggttcagacaggctttccagc caaaagaagt cttcataaactgtgagacgacgccagatgacatgtctgcatgggccttgaacaactggaa tttcagcaga cctagcttggctaatgactacacagctttcgaccagtctcaggatggagccatgttgcaa tttgaggtgc tcaaagccaaacaccactgcataccagaggaaatcattcaggcatacatagatattaaga ctaatgcaca gattttcctaggcacgttatcaattatgcgcctgactggtgaaggtcccacttttgatgc aaacactgag tgcaacatagcttacacccatacaaagtttgacatcccagccggaactgctcaagtttat gcaggagacg actccgcactggactgtgttccagaagtgaagcatagtttccacaggcttgaggacaaat tactcctaaa gtcaaagcctgtaatcacgcagcaaaagaagggcagttggcctgagttttgtggttggct gatcacacca aaaggggtgatgaaagacccaattaagctccatgttagcttaaaattggctgaagctaag ggtgaactca agaaatgtcaagattcctatgaaattgatctgagttatgcctatgaccacaaggactctc tgcatgactt gttcgatgagaaacagtgtcaggcacacacactcacttgcagaacactaatcaagtcagg gagaggcact gtctcactttcccgcctcagaaactttctttaaccgttaagttaccttagagatttgaat aagatgtcag caccagctagtacaacacagcccatagggtcaactacctcaactaccacaaaaactgcag gcgcaactcc tgccacagcttcaggcctgttcactatcccggatggggatttctttagtacagcccgtgc catagtagcc agcaatgctgtcgcaacaaatgaggacctcagcaagattgaggctatttggaaggacatg aaggtgccca cagacactatggcacaggctgcttgggacttagtcagacactgtgctgatgtaggatcat ccgctcaaac agaaatgatagatacaggtccctattccaacggcatcagcagagctagactggcagcagc aattaaagag gtgtgcacacttaggcaattttgcatgaagtatgccccagtggtatggaactggatgtta actaacaaca gtccacctgctaactggcaagcacaaggtttcaagcctgagcacaaattcgctgcattcg acttcttcaa tggagtcaccaacccagctgccatcatgcccaaagaggggctcatccggccaccgtctga agctgaaatg aatgctgcccaaactgctgcctttgtgaagattacaaaggccagggcacaatccaacgac tttgccagcc tagatgcagctgtcactcgaggaaggatcaccggaacgaccacagcagaggcagtcgtta ctctgcctcc tccataacagaaactttctttaaccgttaagttaccttagagatttgaataagatggata ttctcatcag tagtttgaaaagtttaggttattctaggacttccaaatctttagattcaggacctttggt agtacatgca gtagccggagccggtaagtccacagccctaaggaagttgatcctcagacacccaacattc accgtgcata cactcggtgtccctgacaaggtgagtatcagaactagaggcatacagaagccaggaccta ttcctgaggg caacttcgcaatcctcgatgagtatactttggacaacaccacaaggaactcataccaggc actttttgct gacccttatcaggcaccggagtttagcctagagccccacttctacttggaaacatcattt cgagttccga ggaaagtggcagatttgatagctggctgtggcttcgatttcgagacgaactcaccggaag aagggcactt agagatcactggcatattcaaagggcccctactcggaaaggtgatagccattgatgagga gtctgagaca acactgtccaggcatggtgttgagtttgttaagccctgccaagtgacgggacttgagttc aaagtagtca ctattgtgtctgccgcaccaatagaggaaattggccagtccacagctttctacaacgcta tcaccaggtc aaagggattgacatatgtccgcgcagggccataggctgaccgctccggtcaattctgaaa aagtgtacat agtattaggtctatcatttgctttagtttcaattacctttctgctttctagaaatagctt accccacgtc ggtgacaacattcacagcttgccacacggaggagcttacagagacggcaccaaagcaatc ttgtacaact ccccaaatctagggtcacgagtgagtctacacaacggaaagaacgcagcatttgctgccg ttttgctact gactttgctgatctatggaagtaaatacatatctcaacgcaatcatacttgtgcttgtgg taacaatcat agcagtcattagcacttccttagtgaggactgaaccttgtgtcatcaagattactgggga atcaatcaca gtgttggcttgcaaactagatgcagaaaccataagggccattgccgatctcaagccactc tccgttgaac ggttaagtttccattgatactcgaaagaggtcagcaccagctagcaacaaacaagaacat ggaactgaag aagtccatcagcgattacaccgaggctgagttcaagaagatcatcgaggctatcatcaac tgcgagggtg atgagaaaacccaggatgataaccttgagttcttcatcagggtgaccgagtacccttctg gtagcgatct tatctactaccctgagggtgataacgatggtagcaccgaggcaattatcaaagaaatcaa ggaatggagg gctgctaacggtaagcctggttttaagcaagcttaa

SEQ ID NO: 25 Amino acid sequence of salmocin ScolEIc

MSDNTIAYYEDGVPYSADGKVLIIIDGKMPVDTGGTGGGGGGKAGVTSESSAAIHAT AKWSKAQLQKSLE EKAARERETAAAMAAAKAKRDALTQHLKDIVNDVLYHNAHPPAVIDLAHANNMAMQAEAQ RLGRAKAEEK ARKEAEAAEKSLQEAERQCEEAARQRAEAERQLKQAEAEEKRLAALSEEARAVEIAQKNL AAAQSELSKM DGEIMSLNVRLSTSIHARDAEMNSLSGKRNELAQASAKYKELDELVKKLEPRANDPLQNR PFFDAASRRA RAGDTLAEKQKEVTASETRINELNTEINQVQGAISQANNNRNLNVQQVTETENALKVAID NLNSSQMKNA VDATVSFYQTLTEKYGEKYSLIAQELAEKSKGKKIGNVDEALAAFEKYKDVLDRKFSKAD RDAIVNALKS FNYDDWAKHLDQFAKYLKITGHVSFGYDVVSDVLKARETGDWKPLFITLEQKALDTGMSY LVVLMFSLIA GTTLGIFGVAIITAILCSFVDKYILNALNDALGI SEQ ID NO: 26 Amino acid sequence of salmocin ScolEld

MSDNTIAYYEDGVPYSADGKVLIIIDGKMPVDTGGTGGGGGGKAGVTSESSAAIHAT AKWSKAQLQKSLE EKAARERETAAAMAAAKAKRDALTQHLKDIVNDVLYHNAHPPAVIDLAHANNMAMQAEAQ RLGRAKAEEK ARKEAEAAEKSLQEAERQREEAARQRAEAERQLKQAEAEEKRLAALSEEARAVEIAQKNL AAAQSELSKM DGEIMSLNVRLSTSIHARDAEMNSLSGKRNELAQASAKYKELDELVKKLEPRANDPLQNR PFFDAASRRA RAGDTLAEKQKEVTASETRINELNTEINQVQGAISQANNNRNLNVQQVTETENALKVAID NLNSSQMKNA VDATVSFYQTLTEKYGEKYSLIAQELAEKSKGKKIGNVDEALAAFEKYKDVLDRKFSKAD RDAIVNALKS FNYDDWAKHLDQFAKYLKITGHVSFGYDVVSDVLKARETGDWKPLFITLEQKALDTGMSY LVVLMFSLIA GTTLGIFGVAIITAILCSFVDKYILNALNDALGI

SEQ ID NO: 27 Amino acid sequence of salmocin ScolEle

MSGSIAYYEDGVPYSADGKVVIVITGKLPEGTGGSLTADLGSAGVSESSAAIHATAK WSTAQLQKTKAEQ AVKVKEAAVAQAKAKEKRDALTQYLKDIVNQALSHNSRPPAVTDLAHANNMAMQAEAERL RLAKAEAKAR EEAEAAEKAFQLAEQQRLASEREQAETERQLKLAEAEEKRLAALSEEARAVEIAQKNLAA TQSELTNMDG EIQNLNIRLNNNIHERDAETSSLSARRNELFQVSEQYKEIDAQVKKLEPRANDPLQSRPF FAAMTRRANV YTVVQEKQGLVTASETRINQFNADISRLQEEIVKANEKRNMIITHIHEAEEQLKIAKINL INSQIKDATD SVIGFYQTLTEKYGQKYSLLAQELAEKSKGKKIGNVNEALAAFEKYQDVLNKKFSKADRD AIFNALESVK YDDWAKHLDQFAKYLKITGRVSFGYDLVSDVLKVRDTGDWKPLFLTLEKKALDTGLSYLV VLMFSLIAGT TLGIWGVAIVTGILCSFIDKSMLNDLNEALGI

SEQ ID NO: 28 Amino acid sequence of salmocin ScolMa

MTDTITVVAPVPPSGSALAGNYSASTMSAGNRISSGPTFLQFAYPYYQSPQLAVNCA KWILDFVESHDMK NANNQKIFSENVGHFCFADKNLVNYPAMKVLDAFGGDRKFIYSQDQISRLSGDVTTPITA WAHFLWGDGA ARTVNLTDVGLRIQANQISPVMDLVKGGAVGTFPVNAKFTRDTMLDGIIPASYLGNITLQ TTGTLTINSL GAWSYDGVVKAYNDTYDANPSTHRGLLGEYSTSVLGHFSGTPYEIQMPGMIPVKGNGMR

SEQ ID NO: 29 Nucleotide sequence used for salmocin ScolEIc expression in examples atgagcgacaacaccattgcctactacgaggatggtgtgccttacagcgctgatggtaag gtgctgatca tcatcgatggcaagatgcctgttgataccggtggtactggtggtggtggcggtggtaagg ctggtgttac ttctgaatctagcgctgctattcacgctaccgccaagtggtctaaggctcagcttcagaa gtctctggaa gagaaggctgctagagagagggaaactgctgctgctatggctgctgcaaaggctaagaga gatgctctta cccagcacctgaaggacatcgtgaacgatgtgctttaccataacgctcaccctccagctg tgattgatct tgctcacgctaacaacatggctatgcaggctgaagctcagaggcttggtagagctaaggc tgaggaaaag gctcgtaaagaagctgaggctgctgagaagtcacttcaagaggctgaaagacagtgcgag gaagctgcta gacaaagagctgaagcagagaggcaacttaagcaggctgaggcagaagagaagaggcttg ctgctctttc tgaagaggctagggctgttgagatcgctcagaagaatcttgctgctgcacagagcgagct gtccaagatg gatggtgagatcatgtctctgaacgtgcggctgtctacttctatccatgctagggatgcc gagatgaaca gcctttctggtaagaggaatgagctggctcaggctagcgccaagtacaaagaacttgatg agctggtgaa gaagctcgagcctagggctaatgatccacttcagaacaggccattcttcgacgctgctag tagaagggct agagctggcgatactcttgccgagaagcagaaagaggttaccgcttctgagactcggatc aacgagctta acaccgagattaaccaggtgcagggtgctatctcacaggccaacaacaataggaacctca acgtgcagca ggtcaccgagactgagaacgctcttaaggtggcaatcgacaacctgaacagcagccagat gaagaacgct gtggatgctaccgtgagcttctaccagactttgaccgagaagtacggggagaagtacagc cttattgctc aagagctggccgagaagtccaagggtaagaaaattggtaacgtggacgaggctctcgctg ccttcgaaaa gtacaaggatgtgctggaccggaagttcagcaaggctgatagggatgctattgtgaacgc cctgaagtcc ttcaactacgacgattgggctaagcacctggaccagttcgctaagtaccttaagatcacc ggccacgtgt ccttcggttacgatgttgtttctgacgtgctgaaggctcgtgagactggtgattggaagc ctctgttcat taccctcgagcagaaggctttggataccggcatgtcttaccttgtggtgctgatgttctc tctgatcgct ggtactacccttggcattttcggtgtggctatcatcaccgctatcctgtgctcattcgtg gacaagtaca tcctgaacgctctgaacgatgctctgggaatctaa

SEQ ID NO: 30 Nucleotide sequence used for salmocin ScolEld expression in examples atgagcgacaacaccattgcctactacgaggatggtgtgccttacagcgctgatggtaag gtgctgatca tcatcgatggcaagatgcctgttgataccggtggtactggtggtggtggcggtggtaagg ctggtgttac ttctgaatctagcgctgctattcacgctaccgccaagtggtctaaggctcagcttcagaa gtctctggaa gagaaggctgctagagagagggaaactgctgctgctatggctgctgcaaaggctaagaga gatgctctta cccagcacctgaaggacatcgtgaacgatgtgctttaccataacgctcaccctccagctg tgattgatct tgctcacgctaacaacatggctatgcaggctgaagctcagaggcttggtagagctaaggc tgaggaaaag gctcgtaaagaagctgaggctgctgagaagtcacttcaagaggctgaaaggcagagagag gaagctgcaa gacaaagagcagaggctgagagacaacttaagcaggctgaggcagaagagaagaggcttg ctgctctttc tgaagaggctagggctgttgagatcgctcagaagaatcttgctgctgcacagagcgagct gtccaagatg gatggtgagatcatgtctctgaacgtgcggctgtctacttctatccatgctagggatgcc gagatgaaca gcctttctggtaagaggaatgagctggctcaggctagcgccaagtacaaagaacttgatg agctggtgaa gaagctcgagcctagggctaatgatccacttcagaacaggccattcttcgacgctgctag tagaagggct agagctggcgatactcttgccgagaagcagaaagaggttaccgcttctgagactcggatc aacgagctta acaccgagattaaccaggtgcagggtgctatctcacaggccaacaacaataggaacctca acgtgcagca ggtcaccgagactgagaacgctcttaaggtggcaatcgacaacctgaacagcagccagat gaagaacgct gtggatgctaccgtgagcttctaccagactttgaccgagaagtacggggagaagtacagc cttattgctc aagagctggccgagaagtccaagggtaagaaaattggtaacgtggacgaggctctcgctg ccttcgaaaa gtacaaggatgtgctggaccggaagttcagcaaggctgatagggatgctattgtgaacgc cctgaagtcc ttcaactacgacgattgggctaagcacctggaccagttcgctaagtaccttaagatcacc ggccacgtgt ccttcggttacgatgttgtttctgacgtgctgaaggctcgtgagactggtgattggaagc ctctgttcat taccctcgagcagaaggctttggataccggcatgtcttaccttgtggtgctgatgttctc tctgatcgct ggtactacccttggcattttcggtgtggctatcatcaccgctatcctgtgctcattcgtg gacaagtaca tcctgaacgctctgaacgatgctctgggaatctaa

SEQ ID NO: 31 Nucleotide sequence used for salmocin ScolEle expression in examples atgagcggctctatcgcttactacgaggatggtgttccttacagcgctgatggtaaggtg gtgatcgtga ttaccggcaagcttcctgaaggtactggtggttctcttaccgctgatcttggatctgctg gtgtgtctga aagctctgctgctattcatgctaccgccaagtggtctactgctcagcttcaaaagactaa ggctgagcag gccgtgaaggtgaaagaagctgctgttgctcaggccaaggccaaagaaaagagggatgct cttacccagt acctgaaggatatcgtgaaccaggctctgagccataactctagacctccagctgtgactg atctggctca cgctaacaatatggctatgcaggctgaggctgagaggcttagacttgctaaagctgaggc taaggctcgt gaagaagctgaagctgcagagaaggcttttcagcttgctgaacagcagaggcttgcttct gaaagagaac aggctgagacagagcggcagcttaagttggctgaagcagaggaaaagaggctggctgctc tttctgaaga ggctagggctgttgagatcgctcagaagaatcttgctgctacccagtctgagctgaccaa catggatggt gagatccagaacctgaacatccggctgaacaacaacatccatgagagggacgctgagaca agctctctgt ctgctagacggaacgagcttttccaggttagcgagcagtacaaagagatcgacgcccagg ttaagaagct tgagcctagggctaatgaccctcttcagtctaggcctttcttcgctgctatgaccagacg ggctaatgtg tacactgtggtgcaagagaagcagggtcttgtgactgcttctgagactcggatcaaccag ttcaacgctg acatctctaggctgcaagaagagatcgtcaaggccaacgagaagcggaacatgatcatta cccacatcca cgaggctgaggaacagctgaagatcgctaagatcaacctgatcaactcccagatcaagga cgctaccgat agcgtgatcggtttctaccagactctgaccgagaagtacggccagaagtactctttgctt gctcaagagc tggccgagaagtccaagggtaagaaaatcggtaacgtgaacgaggcccttgctgcctttg agaagtacca ggatgtgctgaacaagaagttctccaaggctgacagggacgctatcttcaacgctcttga gtccgtgaag tacgacgattgggctaagcaccttgaccagttcgccaagtaccttaagatcaccggtagg gtgagcttcg gttacgatcttgtgtccgatgtcctcaaggtgagagatactggtgattggaagccgctgt tcttgaccct tgagaagaaggctcttgataccggcctgtcttaccttgtggtgctgatgttctctcttat cgccggtact acccttggcatttggggtgttgctattgtgaccggtatcctgtgctccttcatcgacaag agcatgctga acgacctcaacgaggctcttggtatctaa

SEQ ID NO: 32 Nucleotide sequence used for salmocin ScolMa expression in examples

Atgaccgacactattactgtggttgctcctgtgcctccttctggttctgctcttgct ggtaactacagcg cctctactatgtctgctggcaacaggatttctagcggtcctacctttctgcagttcgctt acccttacta ccagtctcctcagcttgctgtgaattgcgctaagtggatcctggacttcgttgagagcca cgacatgaag aacgccaacaaccagaaaatcttcagcgagaacgtgggccacttctgcttcgctgataag aaccttgtga actacccggccatgaaggtgttggatgcttttggtggtgaccggaagttcatctacagcc aggatcagat ctctcggctgtctggtgatgtgactactcctattactgcttgggctcacttcctgtgggg tgatggtgct gctaggactgtgaatcttaccgatgttggtctgcggatccaggccaatcagatttctcct gtgatggacc tggtgaaaggtggtgctgttggtactttccctgtgaacgctaagttcaccagggatacca tgctggacgg tatcatccctgctagctaccttggtaacattacccttcagaccaccggcaccttgaccat caattctctt ggtgcttggagctacgatggcgtggtgaaggcttacaacgatacctacgatgctaacccg tctactcaca ggggtttgcttggtgagtacagcacttctgtgctcggtcatttctctggaaccccttacg agattcagat gcctggtatgatcccggtgaaaggcaatggtatgaggtaa

SEQ ID NO: 33 Amino acid sequence of salmocin ScoIMb

MTDTITVVAP VPPSGSALAG NYSASTMSAG NRISSGPTFL QFAYPYYQSP QLAVNCAKWI LDFVESHDMK NANNQQIFSE NVGQFCFADK NLVDYPTMKV LDAFGGDRKF IYSQDQISRL SGDVTTPITA WAHFLWGDGA ARTVNLTDVG LRIQPNQISP VMDLVKGGAV GTFPVNAKFT RDTMLDGIIP ASYLGNITLQ TTGTLTINSL GAWSYDGVVK AYNDTYDANP STHRGLLGEY STSVLRHFSG TPYEIQMPGM IPVKGNGMR

SEQ ID NO: 34 Amino acid sequence of salmocin ScolMc

MTDTITVVAP VPPSGSSLAG NYTASTMSSG NRISSGPTFL QFAYPYYQSP QLAVNCAKWI LDFVESHDMK NANNQQIFSE NVGQFCFADK NLVNYPAMKV LDAFGGDRKF IYSQDQISRL SGDVTTPITA WAHFLWGDGA ARTVNLTDVG LRIQPNQISP VMDLVKGGAV GTFPVNAKFT RDTMLDGIIP ASYLGNITLQ TTGTLTINSL GAWSYDGVVK AYNDTYDANP STHRGLLGEY STSVLRHFSG TPYEIQMPGM IPVKGNGMR

SEQ ID NO: 35 Nucleotide sequence used for salmocin ScoIMb expression in examples atgaccgacactattaccgtggttgctcctgttcctccaagcggttctgctcttgctggt aactactctg cctctaccatgtctgctggcaacaggatttctagcggtcctacctttctgcagttcgctt acccttacta ccagtctcctcagcttgctgtgaattgcgctaagtggatcctggacttcgttgagagcca cgacatgaag aacgctaacaaccagcagatcttcagcgagaacgttggccagttttgcttcgctgataag aacctggtgg actacccgaccatgaaggtgttggatgcttttggtggcgaccggaagttcatctactctc aggatcagat cagcaggctgtctggtgatgtgactactcctattactgcttgggctcacttcctgtgggg tgatggtgct gctaggactgtgaatcttaccgatgttggtctgcggatccagccgaatcagatttctcct gtgatggacc tggtgaaaggtggtgctgttggtactttccctgtgaacgctaagttcaccagggatacca tgctggacgg tatcatccctgctagctaccttggtaacattacccttcagaccaccggcaccttgaccat caattctctt ggtgcttggagctacgatggcgtggtgaaggcttacaacgatacctacgatgctaacccg tctactcaca ggggtttgcttggtgagtactctacctctgtgctgaggcacttttctggaaccccttacg agattcagat gcctggtatgatcccggtgaaaggcaacggtatgaggtaa

SEQ ID NO: 36 Nucleotide sequence used for salmocin ScolMc expression in examples atgaccgacactattaccgtggttgctcctgttcctccaagcggttcttctcttgctggt aactacaccg ccagcaccatgtcatctggtaacaggatttctagcggccctaccttccttcagttcgctt atccttacta ccagtctcctcagctggctgtgaattgcgctaagtggattctggacttcgtcgagagcca cgacatgaag aatgctaacaaccagcagatcttcagcgagaacgttggccagttttgcttcgctgataag aacctggtga actacccggctatgaaggtgttggatgctttcggtggtgaccggaagttcatctactctc aggatcagat cagcaggctgtctggtgatgtgactactcctattactgcttgggctcacttcctgtgggg tgatggtgct gctaggactgtgaatcttaccgatgttggtctgcggatccagccgaatcagatttctcct gtgatggatc tggtgaaaggtggtgctgttggtactttccctgtgaacgctaagttcacccgggatacca tgctggacgg tattatccctgctagctacctgggtaacattacccttcagaccaccggtactctgaccat caattctctt ggcgcttggagctacgatggtgtggtgaaggcttacaacgatacctacgacgctaacccg tctactcaca gaggtttgcttggtgagtacagcacctctgtgcttaggcacttttctggaaccccttacg agatccagat gcctggtatgattccggtgaaaggcaacggcatgaggtaa This patent application claims the priority of US patent application 16/577,484, filed on September 20, 2019, published as US-2020-0010517-A1, the content of which is incorporated herein by reference including entire description, claims, figures, and sequence listing.