Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
BACTERIOCIN POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, AND METHODS OF USE THEREOF
Document Type and Number:
WIPO Patent Application WO/2023/235682
Kind Code:
A1
Abstract:
Provided herein are fusion polypeptides that include an amino acid sequence of a bacteriocin flanked at both the N- and C-termini by a split intein that circularizes the bacteriocin. Also provided are nucleic acids and genetic vectors encoding the fusion polypeptide, and microbial cells genetically engineered with the nucleic acids or genetic vectors. Further provided are methods of making a circular bacteriocin, methods of screening using a library of nucleic acids or genetic vectors encoding the fusion polypeptide, and methods of controlling the growth of an organism using circular bacteriocins made by the methods provided herein.

Inventors:
BORRERO DEL PINO JUAN (ES)
GABANT PHILIPPE (BE)
Application Number:
PCT/US2023/067567
Publication Date:
December 07, 2023
Filing Date:
May 26, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SYNGULON SA (BE)
COMPLUTENSE UNIV OF MADRID (ES)
TSUNOZAKI MAKOTO (US)
International Classes:
C07K14/315; C07K14/195
Domestic Patent References:
WO2019046577A12019-03-07
Foreign References:
US9333227B22016-05-10
Other References:
"Non-Natural Amino Acids", vol. 462, 1 January 2009, ELSEVIER, ISBN: 978-0-12-374310-7, ISSN: 0076-6879, article ZHANG XINGANG ET AL: "Chapter 6 Using Expressed Protein Ligation to Probe the Substrate Specificity of Lantibiotic Synthetases", pages: 117 - 134, XP093082389, DOI: 10.1016/S0076-6879(09)62006-1
SCHREIBER CHRISTINE ET AL: "A high-throughput expression screening platform to optimize the production of antimicrobial peptides", MICROBIAL CELL FACTORIES, vol. 16, no. 1, 13 February 2017 (2017-02-13), XP055777618, Retrieved from the Internet DOI: 10.1186/s12934-017-0637-5
SCOTT C P ET AL: "Production of cyclic peptides and proteins in vivo", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, NATIONAL ACADEMY OF SCIENCES, vol. 96, no. 24, 23 November 1999 (1999-11-23), pages 13638 - 13643, XP002137479, ISSN: 0027-8424, DOI: 10.1073/PNAS.96.24.13638
GABANT PHILIPPE ET AL: "PARAGEN 1.0: A Standardized Synthetic Gene Library for Fast Cell-Free Bacteriocin Synthesis", FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, vol. 7, 6 September 2019 (2019-09-06), XP055855697, DOI: 10.3389/fbioe.2019.00213
TAVASSOLI A ET AL: "Split-intein mediated circular ligation used in the synthesis of cyclic peptide libraries in E. coli", NATURE PROTOCOLS, NATURE PUBLISHING GROUP, GB, vol. 2, no. 5, 1 January 2007 (2007-01-01), pages 1126 - 1133, XP001538220, ISSN: 1750-2799, DOI: 10.1038/NPROT.2007.152
JAIME E. TOWNEND ET AL: "Traceless Production of Cyclic Peptide Libraries in E. coli", ACS CHEMICAL BIOLOGY, vol. 11, no. 6, 6 April 2016 (2016-04-06), pages 1624 - 1630, XP055521611, ISSN: 1554-8929, DOI: 10.1021/acschembio.6b00095
INGHAM A.B. ET AL: "A versatile system for the expression of nonmodified bacteriocins in Escherichia coli", JOURNAL OF APPLIED MICROBIOLOGY, vol. 98, no. 3, 1 March 2005 (2005-03-01), GB, pages 676 - 683, XP093082240, ISSN: 1364-5072, DOI: 10.1111/j.1365-2672.2004.02502.x
PEÑA NURIA ET AL: "In vitro and in vivo production and split-intein mediated ligation (SIML) of circular bacteriocins", FRONTIERS IN MICROBIOLOGY, vol. 13, 14 November 2022 (2022-11-14), XP093081573, DOI: 10.3389/fmicb.2022.1052686
R. H. PEREZ, T. ZENDO, AND K. SONOMOTO: "Circular and Leaderless Bacteriocins: Biosynthesis, Mode of Action, Applications, and Prospects", FRONT. MICROBIOL., vol. 9, 2018, pages 2085
MOUNT D.: "Bioinformatics: Sequence and Genome Analysis", 2004, COLD SPRING HARBOR LABORATORY PRESS
HENIKOFFHENIKOFF, PNAS, vol. 89, 1992, pages 915 - 919
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 10
ALTSCHUL ET AL., NUCLEIC ACIDS RES., vol. 25, no. 17, 1997, pages 3389 - 3402
DI VENTURA ET AL., BIOLOGICAL CHEMISTRY, vol. 400, no. 4, 2019, pages 467 - 475
GIBSON ET AL.: "Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome", SCIENCE, vol. 329, 2010, pages 52 - 56, XP055082599, DOI: 10.1126/science.1190719
"Building-in biosafety for synthetic biology", MICROBIOLOGY, vol. 159, 2013, pages 1221 - 1235
M. YOUNES ET AL.: "Safety of nisin (E 234) as a food additive in the light of new toxicological data and the proposed extension of use", EFSA J., vol. 15, no. 12, December 2017 (2017-12-01)
S. SOLTANI ET AL.: "Bacteriocins as a new generation of antimicrobials: toxicity aspects and regulations", FEMS MICROBIOL. REV., vol. 45, no. 1, January 2021 (2021-01-01)
M. L. CHIKINDASR. WEEKSD. DRIDERV. A. CHISTYAKOVL. M. DICKS: "Functions and emerging applications of bacteriocins", CURR. OPIN. BIOTECHNOL., vol. 49, February 2018 (2018-02-01), pages 23, XP055718331, DOI: 10.1016/j.copbio.2017.07.011
M. ZIMINA ET AL.: "Overview of Global Trends in Classification, Methods of Preparation and Application of Bacteriocins", ANTIBIOT. 2020, vol. 9, no. 9, August 2020 (2020-08-01), pages 553
P. D. COTTERR. P. ROSSC. HILL: "Bacteriocins-a viable alternative to antibiotics?", NAT. REV. MICROBIOL., vol. 11, no. 2, February 2013 (2013-02-01), pages 95 - 105
J. BORRERO ET AL.: "Characterization of garvicin ML, a novel circular bacteriocin produced by Lactococcus garvieae DCC43, isolated from mallard ducks (Anas platyrhynchos", APPL. ENVIRON. MICROBIOL, vol. 77, no. 1, January 2011 (2011-01-01), pages 369 - 373, XP055341919, DOI: 10.1128/AEM.01173-10
P. ALVAREZ-SIEIROM. MONTALBAN-LOPEZD. MUO. P. KUIPERS: "Bacteriocins of lactic acid bacteria: extending the family", APPL. MICROBIOL. BIOTECHNOL., vol. 100, no. 7, April 2016 (2016-04-01), pages 2939 - 2951, XP035870780, DOI: 10.1007/s00253-016-7343-9
J. BORRERO ET AL.: "Plantaricyclin A, a Novel Circular Bacteriocin Produced by Lactobacillus plantarum NI326: Purification, Characterization, and Heterologous Production", APPL. ENVIRON. MICROBIOL., vol. 84, no. 1, January 2018 (2018-01-01)
D. MAJORL. FLANZBAUML. LUSSIERC. DAVIESK. M. P. CALDOJ. Z. ACEDO: "Transporter Protein-Guided Genome Mining for Head-to-Tail Cyclized Bacteriocins", MOLECULES, vol. 26, no. 23, December 2021 (2021-12-01)
B. XIN ET AL.: "In Silico Analysis Highlights the Diversity and Novelty of Circular Bacteriocins in Sequenced Microbial Genomes", MSYSTEMS, vol. 5, no. 3, June 2020 (2020-06-01)
B. VEZINAB. H. A. REHMA. T. SMITH: "Bioinformatic prospecting and phylogenetic analysis reveals 94 undescribed circular bacteriocins and key motifs", BMC MICROBIOL., vol. 20, no. 1, April 2020 (2020-04-01), pages 77
P. GABANTJ. BORRERO: "PARAGEN 1.0: A Standardized Synthetic Gene Library for Fast Cell-Free Bacteriocin Synthesis", FRONT. BIOENG. BIOTECHNOL., vol. 7, 2019, pages 213
C. P. SCOTTE. ABEL-SANTOSM. WALLD. C. WAHNONS. J. BENKOVIC: "Production of cyclic peptides and proteins in vivo", PROC. NATL. ACAD. SCI. U. S. A., vol. 96, no. 24, November 1999 (1999-11-01), pages 13638 - 13643
A. TAVASSOLIS. J. BENKOVIC: "Split-intein mediated circular ligation used in the synthesis of cyclic peptide libraries in E. coli", NAT. PROTOC., vol. 2, no. 5, 2007, pages 1126 - 1133, XP001538220, DOI: 10.1038/nprot.2007.152
J. E. TOWNENDA. TAVASSOLI: "Traceless Production of Cyclic Peptide Libraries in E. coli.", ACS CHEM. BIOL., vol. 11, no. 6, June 2016 (2016-06-01), pages 1624 - 1630, XP055521611, DOI: 10.1021/acschembio.6b00095
M. CHERIYANC. S. PEDAMALLUK. TORIF. PERLER: "Faster protein splicing with the Nostoc punctiforme DnaE intein using non-native extern residues", J. BIOL. CHEM., vol. 288, no. 9, March 2013 (2013-03-01), pages 6202 - 6211, XP055139724, DOI: 10.1074/jbc.M112.433094
A. CHOPINM. C. CHOPINA. MOILLO-BATTP. LANGELLA: "Two plasmid-determined restriction and modification systems in Streptococcus lactis", PLASMID, vol. 11, no. 3, May 1984 (1984-05-01), pages 260 - 263
Attorney, Agent or Firm:
CHRISTENSEN, Michael, R. (US)
Download PDF:
Claims:
WHAT IS CLAIMED IS:

1. A fusion polypeptide comprising an amino acid sequence of a bactcriocin flanked at both the N- and C-termini by a split intein that circularizes the bacteriocin.

2. The fusion polypeptide of claim 1, wherein the bacteriocin is a natively circular bacteriocin.

3. The fusion polypeptide of claim 1 or 2, wherein the amino acid sequence of the bacteriocin is circularly permuted compared to a native amino acid sequence of the bacteriocin.

4. The fusion polypeptide of anyone of claims 1-3, wherein the first residue of the amino acid sequence of the bacteriocin is a serine or a cysteine that is present in the native amino acid sequence of the bacteriocin.

5. The fusion polypeptide of anyone of claims 1-3, wherein the first residue of the amino acid sequence of the bacteriocin is a non-native serine or a non-native cysteine.

6. The fusion polypeptide of claim 5, wherein the non-native serine or the non- native cysteine substitutes a native amino acid residue in the amino acid sequence of the bacteriocin.

7. The fusion polypeptide of claim 5, wherein the length of the amino acid sequence of the bacteriocin is increased by one residue due to the non-native serine or the non- native cysteine compared to the length of the native amino acid sequence of the bacteriocin.

8. The fusion polypeptide of any one of claims 5-7, wherein the native amino acid sequence of the bacteriocin does not comprise a serine or cysteine.

9. The fusion polypeptide of any one of the preceding claims, wherein the split intein is based on an intein from one of the following: Npu DnaE, See VMA, Ssp DnaE.

10. The fusion polypeptide of any one of the preceding claims, wherein the split intein is a conditional split intein.

11. The fusion polypeptide of claim 10, wherein the conditional split intein is pH- or temperature-sensitive.

12. The fusion polypeptide of any one of the preceding claims, wherein the split intein comprises a second amino acid sequence of a C-terminal intein fragment (Ic) at least 80% identical to the Ic shown in Table B, and a third amino acid sequence of a N-terminal intein fragment (IN) at least 80% identical to the split intein IN shown in Table B.

13. The fusion polypeptide of any one of the preceding claims, wherein the bactcriocin is selected from any one of the bactcriocins listed in Table A.

14. The fusion polypeptide of any one of the preceding claims, wherein the amino acid sequence of the bacteriocin is at least 80% identical to any one of the sequences listed in Table A.

15. The fusion polypeptide of any one of the preceding claims, wherein the amino acid sequence of the bacteriocin is selected from any one of the sequences listed in Table A.

16. The fusion polypeptide of any one of claims 1-14, wherein the bacteriocin is an engineered bacteriocin.

17. The fusion polypeptide of any one of the preceding claims, wherein one or more amino acids of the polypeptide in the amino acid sequence is a non-natural amino acid.

18. The fusion polypeptide of any one of the preceding claims, further comprising a degradation tag.

19. The fusion polypeptide of claim 18, wherein the degradation tag is at the C- terminus of the fusion polypeptide.

20. The fusion polypeptide of any one of the preceding claims, wherein the split intein comprises a C-terminal intein fragment (“Ic”) fused N-terminal to the amino acid sequence of the bacteriocin and a N-terminal intein fragment (“IN”) fused C-terminal to the amino acid sequence of the bacteriocin, wherein the polypeptide further comprises a degradation tag C-terminal to the IN.

21. The fusion polypeptide of any one of claims 18-20, wherein the degradation tag comprises a sequence at least 80% identical to AANDENYALAA (SEQ ID NO: 873).

22. The fusion polypeptide of any one of the preceding claims, further comprising a signal peptide and/or a leader sequence.

23. A nucleic acid comprising a nucleotide sequence encoding the fusion polypeptide of any one of the preceding claims.

24. The nucleic acid of claim 23, wherein the nucleotide sequence is operably linked to a promoter sequence.

25. The nucleic acid of claim 23 or 24, wherein the nucleic acid comprises DNA.

26. The nucleic acid of claim 23, wherein the nucleic acid comprises RNA.

27. A genetic vector comprising the nucleic acid of any one of claims 23-26.

28. A genetically engineered microbial cell comprising the nucleic acid of any one of claims 23-26, or the genetic vector of claim 27.

29. The microbial cell of claim 28, wherein the microbial cell is resistant to the bacteriocin.

30. The microbial cell of claim 28 or 29, wherein the microbial cell comprises a second nucleic acid encoding an immunity modulator that confers resistant to the bacteriocin.

31. The microbial cell of claim 30, wherein expression of the immunity modulator from the second nucleic acid is regulatable.

32. The microbial cell of any one of claims 28-31, wherein the microbial cell is a bacteria, fungi, or algae.

33. A composition comprising the fusion polypeptide of any one of claims 1-22.

34. A composition comprising a circular bacteriocin and a split intein.

35. A method of making a circular bacteriocin, comprising contacting the nucleic acid of any one of claims 22-26, or the genetic vector of claim 27 with an in vitro expression system under conditions sufficient to produce a circular bacteriocin.

36. A method of making a circular bacteriocin, comprising culturing the microbial cell of any one of claims 28-31 under conditions sufficient to produce a circular bacteriocin.

37. The method of claim 35 or 36, further comprising purifying the circular bacteriocin.

38. The method of any one of claims 35-37, further comprising purifying the fusion polypeptide.

39. The method of any one of claims 35-38, wherein the split intein is a conditional split intein that circularizes the bacteriocin under a permissive condition but not under a non- permissive condition, and wherein the method further comprises exposing the fusion polypeptide to the permissive condition, following exposure to the non-permissive condition, to induce circularization of the bacteriocin.

40. The method of any one of claims 35-39, further comprising modifying the pH or temperature to induce circularization of the bacteriocin, wherein the split intein is pH- or temperature- sensitive, respectively.

41. The method of any one of claims 35-40, further comprising allowing the split intein to be degraded after the circular bacteriocin is produced.

42. A library comprising a plurality of genetic vectors, each genetic vector comprising the nucleic acid of any one of claims 23-26, wherein at least two of the plurality of genetic vectors comprise nucleotide sequences encoding different bacteriocins.

43. The library of claim 41, wherein the nucleotide sequences encode bacteriocins from different microbial species.

44. The library of claim 41, wherein the nucleotide sequences comprise different sequence variants of a parent bacteriocin.

45. The library of claim 44, wherein the parent bacteriocin is a natively circular bacteriocin, and the sequence variants comprise a first variant that abrogates natural circularization of the parent bacteriocin.

46. A method of screening, comprising: providing the library of any one of claims 42-45 ; expressing a plurality of polypeptides encoded by one of more genetic vectors of the library; generating a plurality of circular bacteriocins from the plurality of expressed polypeptides; and assaying the plurality of circular bacteriocins for a desired activity.

47. The method of claim 46, wherein the desired activity comprises antimicrobial activity.

48. A method of controlling the growth of a microorganism, comprising contacting a composition comprising and/or conducive to supporting the growth of a microorganism with the microbial cell of any one of claims 28-32 under conditions sufficient to produce a circular bacteriocin, to thereby control the growth of the microorganism.

49. A method of controlling the growth of a microorganism, comprising contacting a composition comprising and/or conducive to supporting the growth of a microorganism with a circular bacteriocin made by the method of any one of claims 35-38, to thereby control the growth of the microorganism.

50. A method of controlling the growth of a microorganism, comprising contacting a composition comprising and/or conducive to supporting the growth a microorganism with the fusion polypeptide of any one of claims 1-22, to thereby control the growth of the microorganism.

51. The method of any one of claims 48-50, wherein the microorganism is a bacteria.

52. The method of any one of claims 48-51, wherein the composition is a culture medium, feedstock, or a microbiome.

53. The method of any one of claims 48-52, wherein the split intein is a conditional split intein that circularizes the bacteriocin under a permissive condition but not under a non- permissive condition, and wherein the method further comprises providing the permissive condition to the composition to thereby induce circularization of the bacteriocin.

54. The method of any one of claims 48-53, wherein the method comprises modifying the pH or temperature of the composition to induce circularization of the bacteriocin, wherein the split intein is pH- or temperature- sensitive, respectively.

55. A method of designing a nucleic acid encoding a polypeptide precursor of a bacteriocin, comprising: identifying a native amino acid sequence of a candidate bacteriocin, wherein the native amino acid sequence does not comprise a serine or cysteine at the N- terminus; providing a second amino acid sequence having a serine or cysteine at the N- terminus thereof by at least one of: circularly permuting the native amino acid sequence; or introducing a serine or cysteine to the native amino acid sequence; providing a nucleotide sequence encoding a polypeptide comprising the second amino acid sequence flanked at both the N- and C -termini by a split intein configured to circularize the bacteriocin; and expressing the polypeptide encoded by the nucleotide sequence.

56. The method of claim 55, wherein the candidate bacteriocin is predicted to be a circular bacteriocin based on a genomic sequence of a microorganism that encodes the candidate bacteriocin in its genome.

57. The method of claim 55 or 56, comprising: identifying a plurality of native amino acid sequences of a plurality of different candidate bacteriocins; for each of the plurality of native amino acid sequences: providing the second amino acid sequence; and providing the nucleotide sequence encoding a polypeptide comprising the second amino acid sequence flanked at both the N- and C-termini by a split intein configured to circularize the bacteriocin, thereby generating a library of nucleic acids representing each of the plurality of native amino acid sequences.

58. The method of any one of claims 55-57, wherein the polypeptide further comprises a degradation tag.

59. The method of any one of claims 55-58, wherein the polypeptide further comprises a signal peptide and/or leader sequence.

60. The method of any one of claims 55-59, wherein the polypeptide is expressed in vitro.

61. The method of any one of claims 55-59, wherein the polypeptide is expressed from a genetically engineered microbial cell configured to express the polypeptide encoded by the nucleotide sequence.

62. The fusion protein, nucleic acid, vector, library, microbial cell, or method of any one of the preceding claims, wherein the polypeptide comprises an affinity tag, optionally wherein the affinity tag is chitin binding protein (CBP).

Description:
BACTERIOCIN POLYPEPTIDES, NUCLEIC ACIDS ENCODING SAME, AND

METHODS OF USE THEREOF

REFERENCE TO RELATED APPLICATIONS

[0001] The present application claims benefit of U.S. Provisional Application No. 63/365584, filed May 31, 2022. The entirety of this related application is incorporated herein by reference.

REFERENCE TO SEQUENCE LISTING

[0002] The present application is being filed along with a Sequence Listing in electronic format. The Sequence Listing is provided as a file entitled SeqListSyng.012wo.xml, created on May 25, 2023, which is 1,024,021 bytes in size. The information in the electronic format of the Sequence Listing is incorporated herein by reference in its entirety.

BACKGROUND

Field

[0003] The present disclosure generally relates to antimicrobial peptides, such as bacteriocins.

[0004] Bacteriocins are ribosomally synthesized antimicrobial peptides produced by bacteria. Applications of bacteriocins have been traditionally focused on food preservation, mainly due to the widespread presence of these peptides within the lactic acid bacteria group, and the approval of nisin as food preservative by the regulatory agencies. The use of bacteriocins as antimicrobial agents in human and animal health and non-food industrial applications, among others, are also contemplated.

SUMMARY

[0005] Circular bacteriocins are a class of antimicrobial peptides produced by Gram-positive bacteria that after production undergo a head to tail ligation. Compared to their linear counterparts, circular bacteriocins are, in general, quite stable to temperature and pH changes and more resistant to proteolytic enzymes, being considered as a promising group of antimicrobial peptides for industrial applications. A limited number of circular bacteriocins have been produced and fully characterized, although many operons potentially coding for new circular bactcriocins arc found in genomes in the databases. The activity of several proteins mediate the production and circularization of these bacteriocins and genes encoding these proteins are expressed by the native bacteriocin producing bacteria or can be expressed in a heterologous host. Provided herein are methods of carrying out bacteriocin circularization by using the split-intein circular ligation of peptides and proteins (SICCLOPPS) system. In some embodiments, methods of the present disclosure provide fast and efficient options for in vitro (by a cell-free protein system) and in vivo (by E. coli) production and correct circularization of characterized and/or novel circular bacteriocins. In some embodiments, the present disclosure provides intein-based synthetic biology tools for the production and characterization of new circular bacteriocins, the biosynthesis of variants and/or the production of these peptides in other hosts.

[0006] Provided herein is a fusion polypeptide comprising an amino acid sequence of a bacteriocin flanked at both the N- and C-termini by a split intein that circularizes the bacteriocin. Optionally, the bacteriocin is a natively circular bacteriocin. In some embodiments, the amino acid sequence of the bacteriocin is circularly permuted compared to a native amino acid sequence of the bacteriocin. In some embodiments, the first residue of the amino acid sequence of the bacteriocin is a serine or a cysteine that is present in the native amino acid sequence of the bacteriocin.

[0007] In some embodiments, the first residue of the amino acid sequence of the bacteriocin is a non-native serine or a non-native cysteine. Optionally, the non-native serine or the non-native cysteine substitutes a native amino acid residue in the amino acid sequence of the bacteriocin. Optionally, the length of the amino acid sequence of the bacteriocin is increased by one residue due to the non-native serine or the non-native cysteine compared to the length of the native amino acid sequence of the bacteriocin. In some embodiments, the native amino acid sequence of the bacteriocin does not comprise a serine or cysteine.

[0008] In some embodiments, the split intein is based on an intein from one of the following: Npu DnaE, See VMA, Ssp DnaE. In some embodiments, the split intein is a conditional split intein. In some embodiments, the conditional split intein is pH- or temperature- sensitive. In some embodiments, the split intein comprises a second amino acid sequence of a C-terminal intein fragment (Ic) at least 80% identical to the Ic shown in Table B, and a third amino acid sequence of a N-terminal intein fragment (TN) at least 80% identical to the split intein IN shown in Table B.

[0009] In some embodiments, the bacteriocin is selected from any one of the bacteriocins listed in Table A. In some embodiments, the amino acid sequence of the bacteriocin is at least 80% identical to any one of the sequences listed in Table A. In some embodiments, the amino acid sequence of the bacteriocin is selected from any one of the sequences listed in Table A.

[0010] In some embodiments, the bacteriocin is an engineered bacteriocin. In some embodiments, one or more amino acids of the polypeptide in the amino acid sequence is a nonnatural amino acid.

[0011] In some embodiments, the fusion polypeptide further comprises a degradation tag. Optionally, the degradation tag is at the C-terminus of the fusion polypeptide. In some embodiments, the split intein comprises a C-terminal intein fragment (“Ic”) fused N- terminal to the amino acid sequence of the bacteriocin and a N-terminal intein fragment (“IN”) fused C-terminal to the amino acid sequence of the bacteriocin, wherein the polypeptide further comprises a degradation tag C-terminal to the IN. In some embodiments, the degradation tag comprises a sequence at least 80% identical to AANDENYALAA (SEQ ID NO: 873).

[0012] In some embodiments, the fusion polypeptide further comprises a signal peptide and/or a leader sequence.

[0013] Also provided herein is a nucleic acid comprising a nucleotide sequence encoding the fusion polypeptide of any one of the preceding claims. Optionally, the nucleotide sequence is operably linked to a promoter sequence. In some embodiments, the nucleic acid comprises DNA. Optionally, the nucleic acid comprises RNA. Also provided is a genetic vector comprising the nucleic acid of the present disclosure.

[0014] Also provided is a genetically engineered microbial cell comprising the nucleic acid of the present disclosure, or the genetic vector of the present disclosure. Optionally, the microbial cell is resistant to the bacteriocin. In some embodiments, the microbial cell comprises a second nucleic acid encoding an immunity modulator that confers resistant to the bacteriocin. Optionally, expression of the immunity modulator from the second nucleic acid is regulatable. In some embodiments, the microbial cell is a bacteria, fungi, or algae. [0015] Also provided herein is a composition comprising the fusion polypeptide of the present disclosure. Provided herein is a composition comprising a circular bactcriocin and a split intein.

[0016] Further provided herein is a method of making a circular bacteriocin, comprising contacting the nucleic acid of the present disclosure, or the genetic vector of the present disclosure with an in vitro expression system under conditions sufficient to produce a circular bacteriocin. Also provided is a method of making a circular bacteriocin, comprising culturing the microbial cell of the present disclosure under conditions sufficient to produce a circular bacteriocin.

[0017] In some embodiments, the method further comprises purifying the circular bacteriocin. In some embodiments, the method further comprises purifying the fusion polypeptide. In some embodiments, the split intein is a conditional split intein that circularizes the bacteriocin under a permissive condition but not under a non-permissive condition, and wherein the method further comprises exposing the fusion polypeptide to the permissive condition, following exposure to the non-permissive condition, to induce circularization of the bacteriocin. hr some embodiments, the method further comprises modifying the pH or temperature to induce circularization of the bacteriocin, wherein the split intein is pH- or temperature- sensitive, respectively. In some embodiments, the method further comprises allowing the split intein to be degraded after the circular bacteriocin is produced.

[0018] Also provided is a library comprising a plurality of genetic vectors, each genetic vector comprising the nucleic acid of the present disclosure, wherein at least two of the plurality of genetic vectors comprise nucleotide sequences encoding different bacteriocins. Optionally, the nucleotide sequences encode bacteriocins from different microbial species. Optionally, the nucleotide sequences comprise different sequence variants of a parent bacteriocin. Optionally, the parent bacteriocin is a natively circular bacteriocin, and the sequence variants comprise a first variant that abrogates natural circularization of the parent bacteriocin.

[0019] Also provided herein is a method of screening, comprising: providing the library of the present disclosure; expressing a plurality of polypeptides encoded by one of more genetic vectors of the library; generating a plurality of circular bacteriocins from the plurality of expressed polypeptides; and assaying the plurality of circular bacteriocins for a desired activity. Optionally, the desired activity comprises antimicrobial activity.

[0020] Further provided is a method of controlling the growth of a microorganism, comprising contacting a composition comprising and/or conducive to supporting the growth of a microorganism with the microbial cell of the present disclosure under conditions sufficient to produce a circular bacteriocin, to thereby control the growth of the microorganism. Also provided is a method of controlling the growth of a microorganism, comprising contacting a composition comprising and/or conducive to supporting the growth of a microorganism with a circular bacteriocin made by the method of the present disclosure, to thereby control the growth of the microorganism. Provided herein is a method of controlling the growth of a microorganism, comprising contacting a composition comprising and/or conducive to supporting the growth a microorganism with the fusion polypeptide of the present disclosure, to thereby control the growth of the microorganism.

[0021] In some embodiments, the microorganism is a bacteria. In some embodiments, the composition is a culture medium, feedstock, or a microbiome. In some embodiments, the split intein is a conditional split intein that circularizes the bacteriocin under a permissive condition but not under a non-permissive condition, and wherein the method further comprises providing the permissive condition to the composition to thereby induce circularization of the bacteriocin. In some embodiments, the method comprises modifying the pH or temperature of the composition to induce circularization of the bacteriocin, wherein the split intein is pH- or temperature-sensitive, respectively.

[0022] Further provided is a method of designing a nucleic acid encoding a polypeptide precursor of a bacteriocin, comprising: identifying a native amino acid sequence of a candidate bacteriocin, wherein the native amino acid sequence does not comprise a serine or cysteine at the N-terminus; providing a second amino acid sequence having a serine or cysteine at the N-terminus thereof by at least one of: circularly permuting the native amino acid sequence; or introducing a serine or cysteine to the native amino acid sequence; providing a nucleotide sequence encoding a polypeptide comprising the second amino acid sequence flanked at both the N- and C-termini by a split intein configured to circularize the bacteriocin; and expressing the polypeptide encoded by the nucleotide sequence. Optionally, the candidate bacteriocin is predicted to be a circular bacteriocin based on a genomic sequence of a microorganism that encodes the candidate bacteriocin in its genome. Optionally, the method includes: identifying a plurality of native amino acid sequences of a plurality of different candidate bacteriocins; for each of the plurality of native amino acid sequences: providing the second amino acid sequence; and providing the nucleotide sequence encoding a polypeptide comprising the second amino acid sequence flanked at both the N- and C-termini by a split intein configured to circularize the bacteriocin, thereby generating a library of nucleic acids representing each of the plurality of native amino acid sequences.

[0023] In some embodiments, the polypeptide further comprises a degradation tag. In some embodiments, the polypeptide further comprises a signal peptide and/or leader sequence. In some embodiments, the polypeptide is expressed in vitro. In some embodiments, the polypeptide is expressed from a genetically engineered microbial cell configured to express the polypeptide encoded by the nucleotide sequence.

BRIEF DESCRIPTION OF THE DRAWINGS

[0024] FIG. 1 is a schematic diagram showing a polypeptide of a bacteriocin flanked by a split intein that is spliced to generate a circular bacteriocin, according to some non-limiting embodiments of the present disclosure.

[0025] FIG. 2A is a schematic diagram showing a circular bacteriocin, according to some non-limiting embodiments of the present disclosure.

[0026] FIG. 2B is a schematic diagram showing structure of nucleic acids encoding a bacteriocin with or without a functional split intein, according to some non-limiting embodiments of the present disclosure.

[0027] FIG. 2C depicts an amino acid sequence of a bacteriocin flanked by a split intein , according to some non-limiting embodiments of the present disclosure.

[0028] FIG. 2D is a schematic diagram showing a polypeptide of a bacteriocin flanked by a split intein that is spliced to generate a circular bacteriocin, according to some non-limiting embodiments of the present disclosure.

[0029] FIG. 3 A is an image showing antimicrobial activity of a circular bacteriocin generated by bacteria genetically engineered with a nucleic acid encoding a bacteriocin flanked by a split intein, according to some non-limiting embodiments of the present disclosure. [0030] FTG. 3B is an image showing antimicrobial activity of a circular bacteriocin generated by bacteria genetically engineered with a nucleic acid encoding a bacteriocin flanked by a split intein, according to some non-limiting embodiments of the present disclosure.

[0031] FIG. 4A is a schematic diagram showing in vitro and in vivo production, followed by evaluation of antimicrobial activity and mass spectrometry analysis of circular bacteriocin, according to some non-limiting embodiments of the present disclosure.

[0032] FIG. 4B is a collection of mass spectra from mass spectrometry analysis of purified circular bacteriocin produced by a genetically engineered bacteria, according to some non-limiting embodiments of the present disclosure.

[0033] FIG. 5 is a block diagram showing a method of screening, according to some non-limiting embodiments of the present disclosure.

DETAILED DESCRIPTION

[0034] Bacteriocins can be divided in two main groups: class I bacteriocins that undergo post- translational modifications and class II or unmodified bacteriocins. Bacteriocins such as lantibiotics, thiopeptides, lassopeptides or sactibiotics belong to class I, and pediocin like bacteriocins, two peptide bacteriocins and linear non-pediocin like, single-peptide bacteriocins belong to class II. Some bacteriocins undergo enzymatic modification during biosynthesis, where an amide bond is formed between the N and C-terminal amino acid, thus acquiring a head-to-tail or circular structure. Without being bound by theory, the circular structure of these bacteriocins is thought to contribute to their higher stability against thermal stress, pH variation, and degradation by many proteolytic enzymes, compared to their linear counterparts. Thus, circular bacteriocins may have a variety of industrial applications.

[0035] Biosynthesis of circular bacteriocins involves the action of different proteins encoded by genes that are usually clustered together. Gene organization in head-to- tail cyclized bacteriocins clusters is well conserved and can include a minimum of 5 to 7 genes encoding the bacteriocin precursor peptide, immunity proteins, membrane DUF95 protein (presumably involved in circularization), and one or more other proteins [9] [10].

[0036] A typical biosynthetic gene cluster for head-to-tail cyclized bacteriocins consists of genes encoding the bacteriocin precursor peptide, transporter protein(s), a SpoIIM (stage II sporulation protein M) membrane protein (previously known as DUF95), an immunity protein, and one or more unknown hydrophobic proteins. The inactive precursor peptide has an N-tcrminal leader sequence and C-tcrminal core peptide. During maturation, the leader peptide is cleaved, and a peptide bond is formed between the new N-terminal amino acid and the C-terminal residue, producing the active head-to-tail cyclized bacteriocin.

[0037] Advances in sequencing and bioinformatics have accelerated exponentially the discovery of novel circular bacteriocins. Numerous potential novel circular bacteriocins have been identified across a wide group of gram positive strains by identifying hypothetical circular bacteriocin clusters in microbial genomes.

[0038] Novel bacteriocins can be experimentally confirmed by production and purification of the antimicrobial peptide in the supernatant of either the native strain or an heterologous host carrying all the genes needed for biosynthesis of the mature bacteriocin. This process can be laborious, expensive and time consuming and in most cases requires the native bacteriocin producing bacteria. Alternatively, a cell-free protein synthesis approach can be used for the production of bacteriocins. In vitro production can allow testing of the properties of the bacteriocin including industrially relevant ones that may be more difficult by other approaches, such as by fermentation (see Gabant and Borrero 2019). In vitro production is also compatible with high throughput approaches to screen collection of genes of bacteriocins or collection of variants thereof. Suitable options of in vitro production include PARAGEN 1.0, as described by Gabant and Borrero (2019), which demonstrated the synthetic production of a collection 164 different class II bacteriocins (called PARAGEN 1.0) using a cell-free protein synthesis approach.

[0039] Split inteins (internal proteins) can be used to circularize peptides. Provided herein, in some embodiments, is a fast and reliable method for producing circular bacteriocins by combining the split intein circular ligation of peptides and proteins (SICLOPPS) method with cell-free protein synthesis. In some embodiments, fusion of the C and N-terminal intein fragments from Nostoc punctiforme (Npu) DnaE split intein to the mature peptide of bacteriocin garvicin ML allows for the production and circularization of this peptide, without any other protein involved in circularization of the peptide in the native context needed. In some embodiments, active garvicin ML is produced both in vitro (by cell-free synthesis) and in vivo (by E. coli). Purification and posterior analysis of garvicin ML has proved correct circularization of the peptide thus obtaining a peptide with the same molecular weight of the native one. Tn some embodiments, other circular bacteriocins both characterized or not yet characterized arc produced. In some embodiments, new candidates can be tested, or libraries of circular bacteriocins can be generated.

[0040] Provided herein are fusion polypeptides and nucleic acids encoding same, for generating circular bacteriocins. In general terms, fusion polypeptides of the present disclosure include an amino acid sequence of a bacteriocin that is flanked on both ends of the amino acid sequence by a split intein that can circularize the bacteriocin. In some embodiments, the fusion polypeptides and nucleic acids of the present disclosure facilitate production of circular bacteriocins. The circular bacteriocins made from the fusion polypeptides of the present disclosure, or from the nucleic acid and genetic vectors encoding same, can have antimicrobial activity. In some embodiments, a circular bacteriocin made from the fusion polypeptide of the present disclosure, or from the nucleic acid and genetic vectors encoding same as disclosed herein, has substantial antimicrobial activity. In some embodiments, a circular bacteriocin made from the fusion polypeptide of the present disclosure, or from the nucleic acid and genetic vectors encoding same as disclosed herein, has at least about the same level of antimicrobial activity as that of the corresponding, natively produced circular bacteriocin. As the circularization of the bacteriocin by the split intein can be achieved without any additional components that may have been involved in a native context (e.g., other proteins encoded by genes of the bacteriocin cluster in the native microbial organism), the circular bactericion can be produced or expressed in a variety of heterologous contexts, e.g., in a heterologous organism that does not have the additional proteins, or in vitro in the absence of the additional components). In some embodiments, the fusion polypeptides and nucleic acids of the present disclosure provide for high-throughput expression of known or putative circular bacteriocins for screening. In some embodiments, the fusion polypeptides and nucleic acids of the present disclosure provide for expression of circular bacteriocins variants having mutations that would have affected circularization of the bacteriocin via the native mechanism, and thereby expand the mutational space for screening variant bacteriocins having a desired activity. In some embodiments, the fusion polypeptides and nucleic acids of the present disclosure provide for expression of circular bacteriocins variants that include nonnatural amino acids, and thereby expand the mutational space for screening variant bacteriocins of interest. In some embodiments, use of a split intein to circularize bacteriocins allow s for an additional level of control for regulating bacteriocin activity, by regulating the cyclizing activity of the split intcin. In some embodiments, circularizing a bacteriocin (including circularizing a naturally linear bacteriocin) improves the stability of the bacteriocin, e.g., by making the bacteriocin more resistant to degradation by heat, pH, or protease.

TERMS

[0041] Unless stated otherwise, terms used herein have their customary and ordinary meaning as understood by one of skill in the art in view of this disclosure.

[0042] As used herein, “bacteriocin,” and variations of this root term, has its customary and ordinary meaning as understood by one of skill in the art in view of this disclosure. It refers to a polypeptide that is secreted by a host cell and can neutralize at least one microbial organism other than the individual host cell in which the polypeptide is made, including cells clonally related to the host cell and other microbial cells. “Bacteriocin” refers to naturally circular bacteriocins and naturally linear bacteriocins, unless indicated otherwise. A “circular bacteriocin” denotes a bacteriocin that is circularized when expressed from the natural host from which the bacteriocin is derived, or that is predicted to be circularized based on sequence of the bacterial genome, or that has been designed or engineered to be active when circularized. A “linear bacteriocin” denotes a bacteriocin that is linear (and does not get circularized) when expressed from the natural host from which the bacteriocin is derived, or that is predicted to be linear based on the genomic context, or that has been designed or engineered to be active when in linear form. “Bacteriocin” also encompasses a cell-free or chemically synthesized version of such a polypeptide, for example an engineered bacteriocin in accordance with some embodiments herein. A host cell can exert cytotoxic or growthinhibiting effects on one or a plurality of other microbial organisms by secreting bacteriocins.

[0043] “Circularized” and “cyclized” are used interchangeably and have their customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and are used to denote a polypeptide that has undergone head-to-tail circularization or cyclization of the peptide backbone, to form an amide bond between the N-terminal amino group and C-terminal carboxyl group of the polypeptide. “Linear” as used herein has its customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and denotes a polypeptide having a free (non-bonded) amino group at the N- tcrminus and/or a free (non-bonded) carboxyl group at the C-tcrminus.

[0044] As used herein “circularly permuted” denotes modification of a linear sequence of elements by shifting the position of the elements while preserving the position of each element relative to each other, where elements that are shifted past the first or last position in the linear sequence wrap around to the opposite end of the sequence. For example, circular permutation of the sequence “ABCDE” can result in any one of “BCDEA”, “CDEAB”, “DEABC”, and “EABCD”.

[0045] As used herein, the term "operably linked" has its customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and refers to a linkage of nucleic acid elements in a functional relationship. A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid. For instance, a transcription regulatory sequence is operably linked to a coding sequence if it affects the transcription of the coding sequence. Operably linked means that the DNA sequences being linked are typically contiguous and, where necessary to join two protein encoding regions, contiguous and in reading frame.

[0046] The terms “protein” or “polypeptide” have their customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and are used interchangeably and refer to molecules consisting of a chain of amino acids, without reference to a specific mode of action, size, 3- dimensional structure or origin.

[0047] The term "gene" has its customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and means a DNA fragment comprising a region (transcribed region), which is transcribed into an RNA molecule (e.g. an mRNA) in a cell, operably linked to suitable regulatory regions (e.g. a promoter). A gene will usually comprise several operably linked fragments, such as a promoter, a 5' leader sequence, a coding region and a 3 '-nontranslated sequence (3'-end) e.g. comprising a polyadenylation- and/or transcription termination site.

[0048] In amino acid sequences as described herein, amino acids or “residues” are denoted by three-letter or one-letter symbols. These three-letter symbols as well as the corresponding one-letter symbols are well known to the person skilled in the art and have the following meaning: A (Ala) is alanine, C (Cys) is cysteine, D (Asp) is aspartic acid, E (Glu) is glutamic acid, F (Phe) is phenylalanine, G (Gly) is glycine, H (His) is histidine, I (lie) is isolcucinc, K (Lys) is lysine, L (Leu) is leucine, M (Met) is methionine, N (Asn) is asparagine, P (Pro) is proline, Q (Gin) is glutamine, R (Arg) is arginine, S (Ser) is serine, T (Thr) is threonine, V (Vai) is valine, W (Trp) is tryptophan, Y (Tyr) is tyrosine. A residue may be any proteinogenic amino acid, but also any non-proteinogenic amino acid such as D-amino acids and modified amino acids formed by post-translational modifications, and also any non-natural amino acid. As used herein “natural” and “non-natural” each has its ordinary and customary meaning as understood by one of ordinary skill in the art, in view of the present disclosure. A “natural” amino acid denotes an amino acid naturally occurring in nature. A “non-natural” amino acid denotes a non-genetically encoded amino acid, irrespective of whether it appears in nature or not. Non-natural amino acids that can be present in a peptidomimetic as described herein include: b-amino acids; p-acyl-L-phenylalanine; N-acetyl lysine; O-4-allyl-L-tyrosine; 2-aminoadipic acid; 3-aminoadipic acid; beta-alanine; 4-tert-butyl hydrogen 2-azidosuccinate; beta-aminopropionic acid; 2-aminobutyric acid; 4-aminobutyric acid; 2,4-diamino butyric acid; 6-aminocaproic acid; 2-aminoheptanoic acid; 2-aminoisobutyric acid; 3-aminoisobutyric acid; 2- aminopimelic acid; p-aminophenylalanine; 2,3-diaminobutyric acid; 2,3-diamino propionic acid; 2,2'-diaminopinnelic acid; p-amino-L-phenylalanine; p-azido-L- phenylalanine; D-allyl glycine; p-benzoyl-L-phenylalanine; 3 -benzo thienyl alanine p- bromophenylalanine; t-butylalanine; t-butylglycine; 4-chlorophenylalanine; cyclohexylalanine; cysteic acid; D-citrulline; thio-L-citrulline; desmosine; epsilon-amino hexanoic acid; N-ethylglycine; N-ethylasparagine; 2-fluorophenylalanine; 3- fluorophenylalanine; 4-fluorophenylalanine; homoarginine; homocysteine; homoserine; hydroxy lysine; alio-hydroxy lysine; 3-(3-methyl-4-nitrobenzyl)-L-histidine methyl ester; isodesmosine; allo-isoleucine; isopropyl-L-phenylalanine; 3- methyl-phenylalanine; N- methylglycine; N-methylisoleucine; 6-N-methyllysine; O-methyl-L-tyrosine; N-methylvaline; methionin sulfoxide; 2-napthylalanine; L-3-(2-naphthyl)alanine; isoserine; 3 -phenylserine; norvaline; norleucine; 5,5,5-trifluoro-DL-leucine; ornithine; 3-chloro-tyrosine; N5- carbamoylornithine; penicillamine; phenylglycine; piperidinic acid; pyridylalanine; 1 ,2,3,4- tetrahydro-isoquinoline-3 -carboxylic acid; beta-2-thienylalanine; y-carboxy-DL-glutamic acid; 4-fluoro-DL-glutamic acid; D-thyroxine; allo-threonine; 5-hydroxy-tryptophan; 5- methoxy-tryptophan; 5-fluoro-tryptophan; 3 -fluoro- valine. In some embodiments, a natural amino acid of a fusion polypeptide of the present disclosure is substituted by a corresponding non-natural amino acid. As used herein, a "corresponding non-natural amino acid" refers to a non-natural amino acid that is a derivative of the reference natural amino acid. For instance, a natural amino acid can be substituted by the corresponding beta-amino acid, which have their amino group bonded to the beta-carbon rather than the alpha carbon.

[0049] The terms “homology”, “sequence identity” and the like have their customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and are used interchangeably herein. Sequence identity is described herein as a relationship between two or more amino acid (polypeptide or protein) sequences or two or more nucleic acid (nucleic acid) sequences, as determined by comparing the sequences. In an embodiment, sequence identity is calculated based on the full length of two given sequences, including those identified by SEQ ID NO’s, or on a part thereof. Part thereof means at least 50%, 60%, 70%, 80%, 90%, or 100% of both SEQ ID NO’s. "Identity" also refers to the degree of sequence relatedness between amino acid or nucleic acid sequences, as determined by the match between strings of such sequences. "Identity" can be readily calculated by known methods, including but not limited to those described in Bioinformatics and the Cell: Modem Computational Approaches in Genomics, Proteomics and transcriptomics, Xia X., Springer International Publishing, New York, 2018; and Bioinformatics: Sequence and Genome Analysis, Mount D., Cold Spring Harbor Laboratory Press, New York, 2004.

[0050] “Sequence identity” can be determined by alignment of two peptide or two nucleotide sequences using global or local alignment algorithms, depending on the length of the two sequences. In embodiments, sequences of similar lengths are aligned using a global alignment algorithms (e.g. Needleman-Wunsch) which aligns the sequences optimally over the entire length, while sequences of substantially different lengths are aligned using a local alignment algorithm (e.g. Smith- Waterman). Sequences may then be referred to as "substantially identical” when they (when optimally aligned by for example the program EMBOSS needle or EMBOSS water using default parameters) share at least a certain minimal percentage of sequence identity (as described below).

[0051] A global alignment is suitably used to determine sequence identity when the two sequences have similar lengths. In embodiments, when sequences have a substantially different overall length, local alignments, such as those using the Smith- Waterman algorithm, can be used. EMBOSS needle uses the Needleman-Wunsch global alignment algorithm to align two sequences over their entire length (full length), maximizing the number of matches and minimizing the number of gaps. EMBOSS water uses the Smith- Waterman local alignment algorithm. Generally, the EMBOSS needle and EMBOSS water default parameters are used, with a gap open penalty = 10 (nucleotide sequences) I 10 (proteins) and gap extension penalty = 0.5 (nucleotide sequences) / 0.5 (proteins). For nucleotide sequences the default scoring matrix used is DNAfull and for proteins the default scoring matrix is Blosum62 (Henikoff & Henikoff, 1992, PNAS 89, 915-919).

[0052] Percentage identity may be determined by searching against public databases, using algorithms such as FASTA, BLAST, etc. Thus, the nucleic acid and protein sequences of some embodiments of the present disclosure can further be used as a “query sequence” to perform a search against public databases to, for example, identify other family members or related sequences. Such searches can be performed using the BLASTn and BLASTx programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences homologous to nucleic acid molecules of the present disclosure. BLAST protein searches can be performed with the BLASTx program, score = 50, wordlength = 3 to obtain amino acid sequences homologous to protein molecules of the present disclosure. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17): 3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., BLASTx and BLASTn) can be used. See the homepage of the National Center for Biotechnology Information accessible on the world wide web at ncbi.nlm.nih.gov/.

[0053] As used herein, “conservative” amino acid substitution has its customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and refers to the interchange ability of residues having similar side chains. For example, a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and histidine; and a group of amino acids having sulphur-containing side chains is cysteine and methionine. Suitable conservative amino acids substitution groups include: valinc-lcucinc-isolcucinc, phcnylalaninc-tyrosinc, lysine-arginine, alanine-valine, and asparagine-glutamine. Substitutional variants of the amino acid sequence disclosed herein are those in which at least one residue in the disclosed sequences has been removed and a different residue inserted in its place. In some embodiments, the amino acid change is conservative. Suitable conservative substitutions for each of the naturally occurring amino acids include: Ala to ser; Arg to lys; Asn to gin or his; Asp to glu; Cys to ser or ala; Gin to asn; Glu to asp; Gly to pro; His to asn or gin; He to leu or val; Leu to ile or val; Lys to arg; gin or glu; Met to leu or ile; Phe to met, leu or tyr; Ser to thr; Thr to ser; Trp to tyr; Tyr to trp or phe; and, Val to ile or leu.

[0054] As used herein, "microbial organism", "microorganism" /‘microbial cell” or “microbial host” and variations of these root terms (such as pluralizations and the like) have their customary and ordinary meanings as understood by one of skill in the art in view of this disclosure, including any naturally-occurring species or synthetic or fully synthetic prokaryotic or eukaryotic unicellular organism. Thus, this expression can refer to cells of any of the three domains Bacteria, Archaea and Eukarya.

[0055] “Comprise” and its conjugations is used herein in its non-limiting sense to mean that items following the word are included, but items not specifically mentioned are not excluded. In addition, “consist of’ may be replaced by “consist essentially of’ meaning that a feature as described herein may comprise additional feature(s) than the ones specifically identified, said additional feature(s) not altering the unique characteristic of the described features.

[0056] Reference to an element by the indefinite article "a" or "an" does not exclude the possibility that more than one of the element is present, unless the context clearly requires that there be one and only one of the elements. The indefinite article "a" or "an" thus usually means "at least one".

[0057] As used herein, with "at least" a particular value means that particular value or more. For example, "at least 2" is understood to be the same as "2 or more" i.e., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, ..., etc.

[0058] The word “about” or “approximately” when used in association with a numerical value (e.g., about 10) means that the value may be the given value (e.g., 10) more or less 10 % of the value. As used herein, the term "and/or" indicates that one or more of the stated cases may occur, alone or in combination with at least one of the stated cases, up to with all of the stated cases.

[0059] Each embodiment as identified herein may be combined together unless otherwise indicated.

[0060] All patent applications, patents, and printed publications cited herein are incorporated herein by reference for at least the subject matter referenced and in their entireties, except for any definitions, subject matter disclaimers or disavowals, and except to the extent that the incorporated material is inconsistent with the express disclosure herein, in which case the language in this disclosure controls.

[0061] One skilled in the art will recognize many features similar or equivalent to those described herein, which could be used in the practice of the present subject matter. Indeed, the present disclosure is in no way limited to the features described.

POLYPEPTIDES

[0062] As discussed above, fusion polypeptides for generating circular bacteriocins are provided. With reference to Fig. 1, a schematic diagram of a fusion polypeptide of the present disclosure is provided. The fusion polypeptide can include an amino acid sequence 110 of a bacteriocin, which in some embodiments can be a mature sequence of the bacteriocin. A mature sequence typically includes a full sequence of the bacteriocin without the native signal peptide, leader sequence or other additional N- or C-terminal regulatory sequences (e.g., involved in processing and/or secretion). In some embodiments, the amino acid sequence is circularly permuted compared to the native mature sequence of the bacteriocin. The amino acid sequence 110 can be flanked by a split intein 121, 122 that is arranged such that the split intein circularizes the bacteriocin through cyclization of the peptide backbone. The amino acid sequence can be flanked at the N terminus by the C terminal intein fragment (“Ic”) 121 fused to the first amino acid residue 112 of the amino acid sequence 110 of the bacteriocin, and at the C-terminus by the N-terminal intein fragment (“IN”) 122 fused to the last amino acid residue 114 of the amino acid sequence 110 of the bacteriocin. Thus, when the amino acid sequence 110 of the bacteriocin is flanked at both the N- and C-termini by the split intein, the split intein mediates formation of a peptide bond between the first amino acid residue 112 and the last amino acid residue 114 of the amino acid sequence 110 of the hacteriocin, to generate the circularized hacteriocin 115. The intcin 125 after circularization can be cleaved from the circularized hacteriocin. In some embodiments, the N-terminal amino acid residue 112 of the hacteriocin is a serine (or cysteine) that is directly fused to the Ic. In some embodiments, the amino acid sequence 110 of the hacteriocin is modified (e.g., by circular permutation) from the native sequence (e.g., native mature sequence) such that the first amino acid residue in the sequence is a serine or cysteine, as further provided herein. In some embodiments, the intein 125 after circularization is removed via a C-terminal degradation tag.

[0063] In some embodiments, the fusion polypeptide includes an amino acid sequence of any suitable hacteriocin. In some embodiments, the amino acid sequence is that of a circular hacteriocin (e.g., a hacteriocin known to be circular as produced in a native context, a hacteriocin predicted to be circular based on the genomic context, a hacteriocin designed or engineered to be functional in circular form, etc.). In some embodiments, the hacteriocin has antimicrobial activity only when circularized. In some embodiments, the hacteriocin has substantial antimicrobial activity only when circularized. In some embodiments, the hacteriocin has antimicrobial activity when in linear form. In some embodiments, the hacteriocin has antimicrobial activity when circularized and when in linear form. In some embodiments, the hacteriocin has greater antimicrobial activity when circularized compared to when in linear form. In some embodiments, the hacteriocin has antimicrobial activity that is greater by at least about 10%, at least about 20%, at least about 30%, at least about 40%, at least about 50%, at least about 60%, at least about 70%, at least about 80%, at least about 90%, at least about 100%, at least about 120%, at least about 150%, or at least about 200% or more, or by a percentage in a range defined by any two of the preceding values, when circularized compared to when in linear form, for example, 10%- 200%, 10%-100%, 50%-200%, 50%-100%, 70%-200%, or 50%-150%.

[0064] The fusion polypeptide can include any suitable amino acid sequence of a hacteriocin or a variant thereof (such as a circularly permuted variant thereof as described herein). In some embodiments, the amino acid sequence is or is derived from a naturally occurring hacteriocin. In some embodiments, the amino acid sequence is a mature sequence of a hacteriocin, or a variant thereof (such as a circularly permuted variant thereof as described herein). In some embodiments, the amino acid sequence is an amino acid sequence of a bacteriocin without a native signal peptide sequence. Tn some embodiments, the amino acid sequence is an amino acid sequence of a bactcriocin without any signal peptide sequence. In some embodiments, the amino acid sequence does not include any sequences that would have been required in a native context for processing of the bacteriocin (e.g., intracellular processing, circularization). In some embodiments, the amino acid sequence of a bacteriocin is modified from the native sequence (e.g., native mature sequence) to promote circularization by the split intein. In some embodiments, the amino acid sequence of the bacteriocin includes as the first amino acid residue an amino acid that is preferred by the split intein for circularization. In some embodiments, the amino acid that is preferred by the split intein for circularization depends on the type of split intein in the fusion polypeptide. In some embodiments, the amino acid that is preferred by the split intein for circularization is a cysteine or serine. In some embodiments, the amino acid sequence of the bacteriocin includes as the first amino acid residue a cysteine or serine. In some embodiments, the native amino acid sequence of the bacteriocin is circularly permuted, as disclosed herein, such that a cysteine or serine that is present in the native amino acid sequence is the first amino acid residue of the amino acid sequence of the bacteriocin of the fusion polypeptide.

[0065] In some embodiments, the amino acid sequence of the bacteriocin in the fusion polypeptide is circularly permuted compared to the native amino acid sequence (e.g., native mature sequence) of the bacteriocin. As used herein “circularly permuted” denotes modification of a linear sequence of elements by shifting the position of the elements while preserving the position of each element relative to each other, where elements that are shifted past the first or last position in the linear sequence wrap around to the opposite end of the sequence. For example, circular permutation of the sequence “ABCDE” can result in any one of “BCDEA”, “CDEAB”, “DEABC”, and “EABCD”. In some embodiments, an amino acid residue that is not the N-terminal residue in the native amino acid sequence (e.g., native mature sequence) of the bacteriocin is the first amino acid residue of the circularly permuted amino acid sequence of the bacteriocin in the fusion polypeptide. In some embodiments, the first amino acid residue of the circularly permuted amino acid sequence of the bacteriocin in the fusion polypeptide is an amino acid that is preferred by the split intein to be the first amino acid for circularization. In some embodiments, the preferred amino acid is a cysteine or serine. In some embodiments, where the native amino acid sequence of the bacteriocin includes a cysteine or serine, the amino acid sequence of a bacteriocin is circularly permuted compared to the native amino acid sequence such that the native cysteine or serine is the first amino acid residue of the amino acid sequence of the bacteriocin. In some embodiments, where the native amino acid sequence of the bacteriocin includes a cysteine or serine, the amino acid sequence of a bacteriocin is circularly permuted compared to the native amino acid sequence such that the native cysteine or serine is the first amino acid residue of the amino acid sequence of the bacteriocin, and is directly fused to the Ic. As used herein in the context of a fusion polypeptide, an “amino acid sequence of a bacteriocin” is intended to include circularly permuted sequences of the bacteriocin relative to its native sequence.

[0066] In some embodiments, the first amino acid residue of the amino acid sequence of the bacteriocin of the fusion polypeptide is a non-native amino acid residue. As used herein, “non-native” has its ordinary and customary meaning as understood by one of ordinary skill in the art in view of the present disclosure, and denotes an amino acid that is not present in a native amino acid sequence, or a circularly permuted sequence thereof. In some embodiments, the first amino acid residue of the amino acid sequence of the bacteriocin of the fusion polypeptide is a non-native amino acid residue that is a preferred amino acid for circularization by the split intein. In some embodiments, the native amino acid sequence of the bacteriocin is modified by adding the non-native amino acid residue to the native sequence, or by substituting a native amino acid residue with a non-native amino acid residue. In some embodiments, the native amino acid sequence of the bacteriocin is modified by adding the amino acid preferred by the split intein to a N-terminal of the native sequence, or by substituting the first amino acid residue of the native sequence with the amino acid preferred by the split intein to provide the amino acid sequence in the fusion polypeptide. In some embodiments, the native amino acid sequence of the bacteriocin does not include the amino acid preferred by the split intein. In some embodiments, the length of the amino acid sequence of the bacteriocin is increased by one residue due to addition of the non-native amino acid, compared to the length of the native amino acid sequence of the bacteriocin.

[0067] In some embodiments, the cysteine or serine that is the first amino acid residue of the amino acid sequence of the bacteriocin is a cysteine or serine that is not present in the native amino acid sequence of the bacteriocin (or in a circularly permuted sequence thereof). In some embodiments, the native amino acid sequence of the bacteriocin is modified by adding a cysteine or serine residue to the native sequence, or by substituting a native amino acid residue that is not a cysteine or serine with a cysteine or serine. In some embodiments, the native amino acid sequence of the bacteriocin is modified by adding a N-terminal cysteine or serine to the native sequence, or by substituting the first amino acid residue of the native sequence with a cysteine or serine to provide the amino acid sequence in the fusion polypeptide. In some embodiments, the native amino acid sequence of the bacteriocin can be modified by inserting a cysteine or serine into the native sequence, or substituting an amino acid of the native sequence (other than the first N-terminal residue) with a cysteine or serine to provide the amino acid sequence in the fusion polypeptide, and circularly permuting the modified sequence, as disclosed herein, such that the non-native cysteine or serine is the first amino acid residue of the amino acid sequence of the bacteriocin in the fusion polypeptide. In some embodiments, the native amino acid sequence of the bacteriocin does not include a serine or cysteine. In some embodiments, the length of the amino acid sequence of the bacteriocin is increased by one residue due to the non-native serine or the non-native cysteine compared to the length of the native amino acid sequence of the bacteriocin.

[0068] The amino acid sequence of the bacteriocin can have any suitable length. In some embodiments, the amino acid sequence of the bacteriocin in the fusion polypeptide is 20-30, 30-40, 40-50, 50-60, 60-70, 70-80, 80-90, 90-100, 100-110, 110-120, 120-130, 130- 140, 140-150, 150-175, 175-200, 200-300, 300-400, 400-600, 600-800, 800-1000 amino acids long, or longer, or a length in a range defined by any two of the preceding values, for example 20-1000, 20-800, 20-600, 100-800, 20-150, 80-150, 40-130, 100-150, or 20-80 amino acids long.

[0069] Suitable amino acid sequences of a circular bacteriocin include, without limitation, any one of the sequences set forth in Table A. In some embodiments, the fusion polypeptide includes an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99%, or about 100%, or by a percentage in a range defined by any two of the preceding values (e.g., 70-100%, 70-90%, 75-95%, 80-90%, 90-98%), identical to any one of the sequences in Table A. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the sequences set forth in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid substitutions thereto. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the sequences set forth in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions thereto. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the sequences set forth in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid additions or deletions thereto. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the sequences set forth in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid substitutions, additions, and/or deletions thereto. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the sequences set forth in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid conservative substitutions, additions, and/or deletions thereto. In some embodiments, the amino acid substitutions or additions include substitution with or addition of a non-natural amino acid. In some embodiments, the amino acid substitutions or additions include substitutions with or additions of natural amino acids only.

[0070] In Table A, each bacteriocin is represented by two amino acid sequences (except for Bacteriocin F9 from Staphylococcus felis, which is represented by three sequences), where for each bacteriocin entry, the native mature sequence is shown on the top and a modified form of the native mature sequence, each modified form having a serine as the first amino acid residue of the bacteriocin by circular permutation of the native sequence and/or insertion or substitution of a serine to the native sequence, is shown on the bottom. For example, for Amylocyclicin (Ale) from Bacillus amyloliquefaciens, the native mature sequence has the sequence:

LASTLGISTAAAKKAIDIIDAASTIASIISLIGIVTGAGAISYAIVATAKTMIKKY GKKYAAAW (SEQ ID NO: 751), and a circularly permuted form of the native mature sequence has the sequence: STAAAKKAIDIIDAASTIASIISLIGIVTGAGAISYAIVATAKTMIKKYGKKYAA AWLASTLGI (SEQ ID NO: 752).

In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the modified sequences (the bottom row of each bacteriocin entry) in Table A. In some embodiments, the fusion polypeptide includes an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99%, or about 100%, or by a percentage in a range defined by any two of the preceding values (e.g., 70-100%, 80-100%, 90-95%, 85-95%, or 95-99%), identical to any one of the modified sequences (the bottom row of each bactcriocin entry) in Tabic A. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the modified sequences (the bottom row of each bacteriocin entry) in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 amino acid substitutions, additions, and/or deletions thereto. In some embodiments, the fusion polypeptide includes an amino acid sequence of any one of the modified sequences (the bottom row of each bacteriocin entry) in Table A, having up to 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1 conservative amino acid substitutions, additions, and/or deletions thereto. In some embodiments, the amino acid substitutions or additions include substitution with or addition of a non-natural amino acid. In some embodiments, the amino acid substitutions or additions include substitutions with or additions of natural amino acids only.

Table A

[0071] Other non-limiting examples of bacteriocins (e.g., linear bacteriocins) suitable for use in the present fusion polypeptides are set forth in the even numbered sequences of SEQ ID NOS: 4-450 and the odd numbered sequences of SEQ ID NOS: 699-737. Detailed descriptions of suitable bacteriocins can be found, for example, in U.S. Patent No. 9,333,227 and International Publication No. WO2019/046577, each of which is hereby incorporated by reference in its entirety. Some examples of suitable bacteriocins and categories of bacteriocins are taught in Tables 1.1 and 1.2 of U.S. Patent No. 9,333,227 and of International Publication No. WO2019/046577. The amino acid sequence of the bacteriocin in the fusion polypeptide can include a sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99%, or about 100%, or by a percentage in a range defined by any two of the preceding values (e.g., 70-100%, 70-90%, 75-95%, 80- 90%, 90-98%), identical to any of the amino acid sequences of bacteriocins disclosed herein, including those above. Any of the amino acid sequences of bacteriocins disclosed herein, including those above, can be modified by any suitable option as disclosed herein, such that a serine or cysteine (c.g., a native serine or cysteine) is the first amino acid of the bactcriocin sequence. In some embodiments, any of the amino acid sequences of bacteriocins disclosed herein, including those above, can be circularly permuted to place a serine or cysteine as the first amino acid of the bacteriocin sequence in the fusion polypeptide. In some embodiments, a serine or cysteine can be added to or can substitute a native amino acid in any of the amino acid sequences of bacteriocins disclosed herein, including those above, and optionally can be further be circularly permuted, to place a non-native serine or cysteine as the first amino acid of the bacteriocin sequence in the fusion polypeptide.

[0072] In some embodiments the bacteriocin in the present fusion polypeptide is an engineered bacteriocin, e.g., a polypeptide engineered to have antimicrobial activity when circularized. In some embodiments, the fusion polypeptide includes a non-natural amino acid in the amino acid sequence of the bacteriocin and/or split intein. In some embodiments, the fusion polypeptide includes 1, 2, 3, 4, 5, or more non-natural amino acids in the amino acid sequence of the bacteriocin and/or split intein. In some embodiments, 1, 2, 3, 4, 5, or more amino acids in the amino acid sequence of the bacteriocin and/or split intein of the fusion polypeptide is substituted with a corresponding non-natural amino acid.

[0073] The split intein of the present fusion polypeptide can be any suitable intein that can mediate circularization of a bacteriocin. In some embodiments, the split intein includes a C- and N-terminal intein fragments (Ic and IN, respectively) that flank the bacteriocin. In some embodiments, Ic is fused to the N-terminus of the amino acid sequence of the bacteriocin, and IN is fused to C-terminus of the amino acid sequence of the bacteriocin. In some embodiments, the split intein is a constitutively active split intein (e.g., a split intein that can circularize the bacteriocin under conditions in which the fusion polypeptide is expressed from a nucleic acid encoding same). In some embodiments, the split intein is a conditional split intein, e.g., a split intein that circularizes the bacteriocin under permissive conditions, and not under non-permissive conditions. In some embodiments, the split intein circularizes the bacteriocin under permissive conditions, and does not substantially circularize the bacteriocin under non-permissive conditions. In some embodiments, the split intein circularizes the bacteriocin preferentially or specifically under a permissive condition. In some embodiments, the split intein circularizes the bacteriocin under a permissive condition at a faster rate than under a non-permissive condition. In some embodiments, the split intein circularizes the bactcriocin under a permissive condition to a greater extent than under a non- permissive condition. In some embodiments, the conditional split intein is sensitive to pH, temperature, light stimulation, and/or a small molecule ligand. As used herein, “sensitive” has its customary and ordinary meaning as understood by one of skill in the art in view of this disclosure, and with reference to an environmental condition of a conditional split intein, denotes that the split intein’ s circularization activity (e.g., rate and/or extent thereof) is affected by the environmental condition to which the split intein is exposed.

[0074] In some embodiments, the split intein is configured to circularize the bacteriocin preferentially or specifically under a permissive pH (or pH range). In some embodiments, the split intein is configured to circularize the bacteriocin preferentially or specifically under a permissive temperature (or temperature range). In some embodiments, the split intein is pH-sensitive. In some embodiments, the split intein circularizes the bacteriocin at pH below a threshold pH or above a threshold pH, or within a pH range. In some embodiments, the threshold pH is less than 3.0, or about 3.0, 4.0, 4.5, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5,

7.6, 7.7, 7.8, 7.9, 8.0, 8.1, 8.2, 8.3, 8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.5, or about 10.0, or a pH value in a range defined by any two of the preceding values, for example pH 3-5, pH 4-7, pH 4-6, pH 5-6, pH 8-10, pH 8-9. In some embodiments, the pH range is bound by any two of the following pH values: 3.0, 4.0, 4.5, 5.0, 5.1, 5.2, 5.3, 5.4, 5.5, 5.6, 5.7, 5.8, 5.9, 6.0, 6.1, 6.2, 6.3, 6.4, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.1, 7.2, 7.3, 7.4, 7.5, 7.6, 7.7, 7.8, 7.9, 8.0, 8.1, 8.2, 8.3,

8.4, 8.5, 8.6, 8.7, 8.8, 8.9, 9.0, 9.5, 10.0. In some embodiments, the split intein is temperaturesensitive. In some embodiments, the split intein circularizes the bacteriocin at a temperature below or above a threshold temperature, or within a temperature range. In some embodiments, the threshold temperature is less than 15°C, or about 15°C, 16.0°C, 17.0°C, 18.0°C, 19.0°C,

20°C, 21°C, 22°C, 23°C, 24°C, 25°C, 26°C, 27°C, 28°C, 29°C, 30°C, 31°C, 32°C, 33°C, 34°C,

35°C, 36°C, 37°C, 38°C, 39°C, 40°C, 41°C, 42°C, 45°C, 50°C, 55°C, or about 60°C, or a temperature in a range defined by any two of the preceding values, for example 15-20°C, 15- 18°C, 30-40°C, 40-50°C, 25-35°C. In some embodiments, the temperature range is bound by any two of the following temperatures: 15°C, 16.0°C, 17.0°C, 18.0°C, 19.0°C, 20°C, 21°C, 22°C, 23°C, 24°C, 25°C, 26°C, 27°C, 28°C, 29°C, 30°C, 31°C, 32°C, 33°C, 34°C, 35°C, 36°C, 37°C, 38°C, 39°C, 40°C, 41 °C, 42°C, 45°C, 50°C, 55°C, 60°C. Tn some embodiments, the split intcin is configured to circularize the bactcriocin preferentially or specifically in the presence of a small molecule ligand. In some embodiments, the split intein is configured to circularize the bacteriocin preferentially or specifically by light stimulation. Non-limiting examples of suitable conditional inteins are disclosed in Di Ventura et al., (Biological Chemistry, vol. 400, no. 4, 2019, pp. 467-475), which is incorporated herein by reference in its entirety.

[0075] In some embodiments, the split intein is based on an intein from one of the following: Npu DnaE, See VMA, Ssp DnaE. In some embodiments, the split intein is a naturally split intein (e.g., is found as a split intein in the genome of the host microorganism). In some embodiments, the split intein is not a split intein in its native context, and is engineered to be a split intein. In some embodiments, the split intein is a constitutively active split intein (e.g., a split intein that can circularize the bacteriocin under conditions in which the fusion polypeptide is expressed from a nucleic acid encoding same) derived from any one of Npu DnaE, See VMA, Ssp DnaE. In some embodiments, the split intein is a conditional split intein derived from any one of Npu DnaE, See VMA, Ssp DnaE. In some embodiments, the split intein is derived from a Nostoc punctiforme (Npu) DnaE split intein. In some embodiments, the split intein includes C- and N-terminal intein fragments from the Npu DnaE split intein. In some embodiments, the Ic and IN include the respective amino acid sequences set forth in Table B. In some embodiments, the Ic and IN include an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99%, or about 100%, or by a percentage in a range defined by any two of the preceding values (e.g., 70-100%, 70-90%, 75-95%, 80-90%, 90-98%), identical to the respective sequences set forth in Table B.

Table B [0076] Tn some embodiments, the fusion polypeptide includes one or more additional functional sequences in addition to the bactcriocin flanked by the split intcin. In some embodiments, the fusion polypeptide includes a degradation tag, e.g., configured to degrade the intein after the bacteriocin is circularized and the intein cleaved from the circularized bacteriocin. In some embodiments, the fusion polypeptide includes a C-terminal degradation tag that is fused C-terminal to the N-terminal intein fragment, IN. In some embodiments, the split intein includes a C-terminal intein fragment (“Ic”) fused N-terminal to the amino acid sequence of the bacteriocin and a N-terminal intein fragment (“IN”) fused C- terminal to the amino acid sequence of the bacteriocin, where the polypeptide further includes a degradation tag C-terminal to the IN. The degradation tag can be any suitable peptide that can induce degradation of the split intein after the bacteriocin is circularized and the intein is cleaved from the circularized bacteriocin. In some embodiments, the degradation tag is an SsrA sequence. In some embodiments, the degradation tag includes the sequence: AANDENYALAA (SEQ ID NO: 873). In some embodiments, the degradation tag includes a sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, at least 98%, at least 99%, or about 100%, or by a percentage in a range defined by any two of the preceding values (e.g., 70-100%, 70-90%, 75-95%, 80-90%, 90-98%), identical to AANDENYALAA (SEQ ID NO: 873). In some embodiments, the degradation tag includes a sequence that differs from AANDENYALAA (SEQ ID NO: 873) by at most 4, 3, 2, or 1 amino acids (e.g., substitutions, additions, and/or deletions). In some embodiments, the degradation tag includes a sequence that differs from AANDENYALAA (SEQ ID NO:873) by at most 4, 3, 2, or 1 conservative amino acid substitutions, additions, and/or deletions. In some embodiments, the amino acid substitutions or additions include substitution with or addition of a non-natural amino acid. In some embodiments, the amino acid substitutions or additions include substitutions with or additions of natural amino acids only.

[0077] In some embodiments, the fusion polypeptide includes one or more affinity tags. In some embodiments, the affinity tag is associated with the split intein, which can facilitate purification of the fusion protein, but may be dissociated from the bacteriocin upon circularization. In some embodiments, the affinity tag is associated with the amino acid sequence of the bacteriocin, and may be incorporated into the circular bacteriocin. The affinity tag can be used to purify the circular bacteriocin after circularization. Optionally, one or more cleavage sites can be positioned between the affinity tag and the rest of the fusion polypeptide to facilitate removal of the affinity tag after affinity purification. Affinity tags can be used in purification, for example by contact with a molecule that binds the affinity tag immobilized on a solid phase, such as a bead. Example affinity tags suitable for fusion polypeptides of the present disclosure can comprise, consist essentially of, or consist of His-tags, glutathione-S- transferase (GST) tags, FLAG tags, strep tags, maltose binding protein (MBP), chitin binding protein (CBP), myc tags, HA tags, NE tags, and V5 tags, variants of any of these, or any combination of two or more of these. In some embodiments, the affinity tag is a chitin binding protein (CBP). In some embodiments, the fusion polypeptide or the circularized bacteriocin having a CBP affinity tag is purified using a chitin resin.

[0078] In some embodiments, the fusion polypeptide includes a signal peptide and/or leader sequence. In some embodiments, the signal peptide or leader sequence is configured to facilitate secretion of the fusion polypeptide from a microbial cell genetically engineered to express the fusion polypeptide, as disclosed herein. Any suitable signal peptide and/or leader sequence that may facilitate secretion of the fusion polypeptide or circular bacteriocin from the genetically engineered microbial cell may be used. In some embodiments, the fusion polypeptide further comprises a post-translational or co-translation modification, for example, glycosylation, acetylation, methylation, PEGylation, SUMOylation, ubiquitination, or two or more of any of these.

[0079] Also provided are compositions comprising the fusion polypeptide of the present disclosure. In some embodiments, the composition includes a physiologically compatible carrier, such as water or a buffer solution. In some embodiments, the fusion polypeptide is lyophilized in the composition. Also provided is a composition comprising a circular bacteriocin and a split intein. The circular bacteriocin can be any circular bacteriocin produced from the fusion polypeptide, including those described herein. In some embodiments, the split intein includes a C-terminal intein fragment (Ic) and an N-terminal intein fragment (IN). In some embodiments, the Ic and IN are associated with each other. The split intein can be any suitable split intein as provided herein. In some embodiments, the split intein further comprises a degradation tag. NUCLETC A IDS, VECTORS, GENETICALLY ENGINEERED MICROBIAL CE LS

[0080] Also provided herein is a nucleic acid that includes a nucleotide sequence encoding a fusion polypeptide as described herein. In some embodiments, the nucleic acid (e.g., DNA or RNA) includes regulatory elements that drive expression of the fusion polypeptide under suitable conditions. In some embodiments, the nucleic acid includes DNA. In some embodiments, the nucleic acid (e.g., DNA) includes regulatory elements (e.g., promoter) that drive transcription from the nucleic acid under suitable conditions (e.g., in vivo expression or in vitro transcription). In some embodiments, the nucleotide sequence is operably linked to a promoter sequence, e.g., in a DNA vector, as disclosed herein. Any suitable promoter sequence can be used to drive transcription from the nucleic acid. In some embodiments, the promoter sequence is one suitable for driving transcription from the nucleic acid in vitro (e.g., in an in vitro transcription solution). In some embodiments, the promoter sequence is one suitable for expressing the fusion polypeptide from the nucleic acid in vivo (e.g., in a microbial cell). In some embodiments, the promoter is a constitutive promoter. In some embodiments, the promoter is a conditionally active promoter, e.g., depending on the presence or absence of an environmental condition, chemical compound, gene product, stage of the cell cycle, or the like.

[0081] Non-limiting, example nucleic acids encoding some of these bacteriocins (e.g., linear bacteriocins) are set forth in the odd numbered sequences of SEQ ID NOs: 5-451 and the even numbered sequences of 700-738. Detailed descriptions of suitable bacteriocins and some polynucleotide sequences that encode bacteriocins, including methods and compositions for using bacteriocins to control the growth of microbial cells can be found, for example, in U.S. Patent No. 9,333,227 and International Publication No. WO2019/046577, each of which is hereby incorporated by reference in its entirety.

[0082] In some embodiments, the nucleic acid includes regulatory elements that drive translation of the fusion polypeptide from the nucleic acid under suitable conditions (e.g., in vivo expression or in vitro translation). In some embodiments, the nucleic acid is RNA. Generally, translation initiation for a particular transcript is regulated by particular sequences at or 5' of the 5' end of the coding sequence of a transcript. For example, a coding sequence can begin with a start codon configured to pair with an initiator tRNA. While naturally- occurring translation systems typically use Met (AUG) as a start codon, it will be readily appreciated that an initiator tRNA can be engineered to bind to any desired triplet or triplets, and accordingly, triplets other than AUG can also function as start codons in certain embodiments. Additionally, sequences near the start codon can facilitate ribosomal assembly, for example a Kozak sequence ((gcc)gccRccAUGG, SEQ ID NO: 542, in which R represents "A" or "G") or Internal Ribosome Entry Site (IRES) in typical eukaryotic translational systems, or a Shine-Delgamo sequence (GGAGGU, SEQ ID NO: 543) in typical prokaryotic translation systems. As such in some embodiments, a transcript comprising a "coding" nucleotide sequence of the present disclosure includes an appropriate start codon and translational initiation sequence. In some embodiments, for example if two or more "coding" nucleotide sequences are positioned in cis on a transcript, each nucleotide sequence includes an appropriate start codon and translational initiation sequence(s). In some embodiments, for example, if two or more "coding" nucleotide sequences are positioned in cis on a transcript, the two sequences are under control of a single translation initiation sequence, and either provide a single polypeptide that can function with both encoded polypeptides in cis, or provide a means for separating two polypeptides encoded in cis, for example a 2 A sequence or the like. In some embodiments, a translational initiator tRNA is regulatable, so as to regulate initiation of translation of a bacteriocin from the nucleic acid.

[0083] Also provided is a genetic vector that includes a nucleic acid of the present disclosure. Any suitable genetic vector can be used to include a nucleic acid having a nucleotide sequence encoding a fusion polypeptide as described herein. In some embodiments, the genetic vector is an expression vector. Suitable genetic vectors include, without limitation, plasmids, viruses (including bacteriophage), and transposable elements.

[0084] In some embodiments, a genetic vector can include one or more additional nucleotide sequences encoding a gene product of interest. In some embodiments, a genetic vector can include an additional nucleotide sequence encoding a gene product that confers resistance to the circularized bacteriocin in the microbial cell expressing the circularized bacteriocin from the genetic vector. In some embodiments, the gene product that confers resistance to the circularized bacteriocin is an immunity modulator. Any suitable immunity modulator can be encoded by the additional nucleotide sequence in the genetic vector. Suitable immunity modulators are provided, e.g., without limitation in U.S. Patent No. 9,333,227. In some embodiments, the genetic vector is configured to express the gene product of interest under suitable conditions. Tn some embodiments, a promoter in the genetic vector drives transcription from both the nucleotide sequence encoding the bactcriocin, and from the additional nucleotide sequence encoding the gene product that confers resistance to the circularized bacteriocin in the microbial cell expressing the circularized bacteriocin from the genetic vector. In some embodiments, expression of the nucleotide sequence encoding the bacteriocin and the additional nucleotide sequence encoding the gene product that confers resistance to the circularized bacteriocin are under the control of different promoters. In some embodiments, either one or both of the promoters controlling expression of the nucleotide sequence encoding the bacteriocin and the additional nucleotide sequence encoding the gene product that confers resistance to the circularized bacteriocin is a conditional promoter. In some embodiments, expression from a conditional promoter operably linked to the nucleotide sequence encoding the bacteriocin is regulated by different conditions compared to expression from a conditional promoter operably linked to the additional nucleotide sequence encoding the gene product that confers resistance to the circularized bacteriocin.

[0085] Also provided is a genetically engineered microbial cell that includes a nucleic acid of the present disclosure, or a genetic vector as provided herein. The microbial cell can be genetically engineered by any suitable option. In some embodiments, the microbial cell is transformed with the genetic vector of the present disclosure. In some embodiments, nucleic acid is stably integrated into a chromosome, or can be a self-replicating unit that is independent of the chromosome (e.g., as a plasmid, extrachromosomal array, episome, minichromosome, or the like). In some embodiments, plasmid conjugation can be used to introduce a desired plasmid from a "donor" microbial cell to a recipient microbial cell.

[0086] Any suitable microbial cell can be genetically engineered to include the nucleic acid or genetic vector of the present disclosure. In some embodiments, the microbial cell is one that does not naturally produce the bacteriocin encoded by the nucleic acid or genetic vector. In some embodiments, the microbial cell is one that does not encode the bacteriocin encoded by the nucleic acid or genetic vector in its genome endogenously. In some embodiments, the microbial cell is resistant to the bacteriocin. In some embodiments, the microbial cell expresses a gene product (e.g., an immunity modulator) that confers resistance to the bacteriocin. In some embodiments, the microbial cell is genetically engineered to expresses the gene product (e.g., an immunity modulator) that confers resistance to the bacteriocin. Tn some embodiments, expression of the immunity modulator from the second nucleic acid is rcgulatablc. In some embodiments, expression of the immunity modulator from the second nucleic acid is controlled by a conditional promoter.

[0087] Exemplary microbial cells that can be used in accordance with embodiments herein include, but are not limited to, bacteria, yeast, filamentous fungi, and algae, for example photosynthetic microalgae. Furthermore, fully synthetic microorganism genomes can be synthesized and transplanted into single microbial cells, to produce synthetic microorganisms capable of continuous self-replication (see Gibson et al. (2010), "Creation of a Bacterial Cell Controlled by a Chemically Synthesized Genome," Science 329: 52-56, which is incorporated herein by reference). As such, in some embodiments, the microbial cell is fully synthetic. A desired combination of genetic elements, including elements that regulate gene expression, and elements encoding gene products (for example immunity modulators, poison, antidote, and industrially useful molecules also called product of interest) can be assembled on a desired chassis into a partially or fully synthetic microbial cell. Description of genetically engineered microbial organisms for industrial applications can also be found in Wright, et al. (2013) "Building-in biosafety for synthetic biology" Microbiology 159: 1221-1235, incorporated herein by reference.

[0088] A variety of bacterial species and strains can be used in accordance with embodiments herein, and genetically modified variants, or synthetic bacteria based on a "chassis" of a known species can be provided. Exemplary bacteria (including those with industrially applicable characteristics), which can be used in accordance with embodiments herein include, but are not limited to, Bacillus species (for example Bacillus coagulans, Bacillus subtilis, and Bacillus licheniformis), Paenibacillus species, Streptomyces species, Micrococcus species, Corynebacterium species, Acetobacter species, Cyanobacteria species, Salmonella species, Rhodococcus species, Pseudomonas species, Lactobacillus species, Enterococcus species, Alcaligenes species, Klebsiella species, Paenibacillus species, Arlhrobacler species, Corynebacterium species, Brevibaclerium species, Thermus aquaticus, Pseudomonas stutzeri, Clostridium thermocellus, and Escherichia coli.

[0089] A variety of yeast species and strains can be used in accordance with embodiments herein, and genetically modified variants, or synthetic yeast based on a "chassis" of a known species can be provided. Exemplary yeast with industrially applicable characteri sties, which can be used in accordance with embodiments herein include, but are not limited to Saccharomyces species (for example, Saccharomyces cerevisiae, Saccharomyces bayanus, Saccharomyces boulardii). Candida species (for example, Candida utilis, Candida krusei), Schizosaccharomyces species (for example Schizosaccharomyces pombe, Schizosaccharomyces japonicus ), Pichia or Hansemda species (for example, Pichia pastoris or Hansemda polymorpha) species, and Bretanomyces species (for example, Bretanomyces claussenii).

[0090] A variety of algae species and strains can be used in accordance with embodiments herein, and genetically modified variants, or synthetic algae based on a "chassis" of a known species can be created. In some embodiments, the algae comprises, consists essentially of, or consists of photosynthetic microalgae. Exemplary algae species that can be useful for biofuels, and can be used in accordance with some embodiments herein, include Botryococcus braunii, Chlorella species, Dunaliella tertiolecta, Gracilaria species, Pleurochrysis carterae, and Sargassum species. Additionally, many algae can be useful for food products, fertilizer products, waste neutralization, environmental remediation, and carbohydrate manufacturing (for example, biofuels).

[0091] A variety of filamentous fungal species and strains can be used in accordance with embodiments herein, and genetically modified variants, or synthetic filamentous fungi based on a "chassis" of a known species can be provided. Exemplary filamentous fungi (including those with industrially applicable characteristics), which can be used in accordance with embodiments herein include, but are not limited to an Acremonium, Agaricus, Alternaria, Aspergillus, Aureobasidium, Botryospaeria, Ceriporiopsis, Chaetomidium, Chrysosporium, Claviceps, Cochliobolus, Coprinopsis, Coptotermes, Corynascus, Cryphonectria, Cryptococcus, Diplodia, Exidia, Filibasidium, Fusarium, Gibberella, Holomastigotoides, Humicola, Irpex, Lentinula, Leptospaeria, Magnaporthe, Melanocarpus, Meripilus, Mucor, Myceliophthora, Neocaffimastix, Neurospora, Paecilomyces, Peniciffium, Phanerochaele, Piromyces, Poitrasia, Pseudopleclania, Pseudotrichonympha, Rhizomucor, Schizophyllum , Scytalidium, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trichoderma, Trichophaea, Verticillium, Volvariella, or Xylaria. In some embodiments, filamentous fungus species include, without limitation, Acremonium cellulolyticus, Aspergillus aculeatus, Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenaturn, Humicola grisea, Humicola insolens, Humicola lanuginosa, Irpex lacteus, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium funiculosum, Penicillium purpurogenum, Phanerochaete chrysosporium, Thielavia achromatica, Thielavia albomyces, Thielavia albopilosa, Thielavia australeinsis , Thielavia fimeti, Thielavia microspora, Thielavia ovispora, Thielavia peruviana, Thielavia setosa, Thielavia spededonium, Thielavia subthermophila, Thielavia terrestris, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride.

[0092] Also provided is a library that includes the nucleic acids or genetic vectors of the present disclosure. The library in some embodiments finds use in screening circular bacteriocins whose antimicrobial activity has not been characterized, or for screening circular bacteriocins from different strains for one having a desired antimicrobial activity. In some embodiments, the library finds use in screening different variants of a circular bacteriocin for desired or altered activity. In some embodiments, at least two of the genetic vectors in the library include nucleotide sequences encoding different bacteriocins. The bacteriocins encoded by the genetic vectors of the library can differ in any suitable manner. In some embodiments, the library is a mutational library that includes sequence variants of a bacteriocin that has one or more mutations compared to a parent sequence. The mutations in the sequence variants can include random mutations, in some embodiments. In some embodiments, the mutations in the sequence variants can include targeted mutations. The library can include, in some embodiments, sequence variants that would abolish or abrogate circularization of the bacteriocin in a native context. In some embodiments, the parent bacteriocin is a natively circular bacteriocin, and the sequence variants include a first variant that abrogates natural circularization of the parent bacteriocin.

[0093] In some embodiments, the library includes bacteriocins from different strains or species of microbial organisms (e.g., bacteria). In some embodiments, the library includes different previously uncharacterized bacteriocins, e.g., bacteriocins predicted based on sequence alone or bacteriocins for which antimicrobial activity has not been observed. In some embodiments, the library includes different bacteriocins known to have antimicrobial activity.

[0094] The library can include any suitable number of variants. In some embodiments, the library includes at least about 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 140, 160, 180, 200, 300, 400, 500, 600, 700, 800, 900, 10 3 , 10 4 , 10 5 , 10 6 , 10 7 , 10 8 , 10 9 or more variants, or a number of variants in a range defined by any two of the preceding values.

METHODS

[0095] Also provided are methods of making a circular bacteriocin (which for convenience may be referred to as production methods). In some embodiments, the method includes contacting a nucleic acid having a nucleotide sequence that encodes a fusion polypeptide as described herein with an in vitro expression system under conditions sufficient to produce a circular bacteriocin. In some embodiments, the in vitro expression system is a cell-free transcription/translation solution. In some embodiments, use of the in vitro expression system allows expression of a circular bacteriocin from a nucleic acid encoding the same, where the nucleic acid cannot be expressed in vivo, e.g., from a microbial cell genetically engineered with the nucleic acid. Without being bound by theory, in some cases, toxicity of the nucleic acid or the gene product encoded therein can prevent expression of the gene product from the nucleic acid.

[0096] Any suitable cell-free expression system can be used to transcribe and/or translate nucleic acids in vitro. In some embodiments, the in vitro expression system comprises, consists of, or consists essentially of cell extracts. In some embodiments, the in vitro expression system comprises an RNA polymerase, ribosomes, tRNAs (and the corresponding amino acids), an energy source, and enzymatic cofactors. In some embodiments, the in vitro expression system can further comprise enzymes for co- or post-translational modification, and/or cellular components that mediate protein folding such as heat shock proteins. By way of example, in embodiments in which the nucleic acid comprises RNA transcripts encoding the fusion polypeptide, it is contemplated that an in vitro expression system comprising, consisting essentially of, or consisting of a translation solution is sufficient (since it will be understood that the RNA is already a transcript). By way of example, in embodiments in which the nucleic acid comprises DNA encoding the fusion polypeptide, it is contemplated that an in vitro expression system comprises a transcription solution (for transcribing the DNAs into RNAs) and a translation solution (for translating the RNAs into polypeptides). In some embodiments, the transcription and translation solutions are together in a single solution (e.g., components of the transcription solution and translation solution are distributed evenly within the same volume). In some embodiments, the transcription and translation solutions are in separate solutions, for example in vesicles suspended in a single solution, and/or in separate solutions that are applied sequentially, and/or in separate compartments. In some embodiments, components of the in vitro transcription/translation solution are lyophilized, and configured to be reconstituted into the in vitro transcription/translation solution upon the addition of water. In some embodiments, the in vitro transcription/translation solution is reconstituted by adding water to lyophilized components.

[0097] Translation solutions can be useful for translating the nucleic acids as provided herein. Suitable translation solutions can comprise, consist essentially of, or consist of reagents for in vitro translation (which, for convenience, may be referred to herein as "translation reagents"), and as such can be configured for in vitro translation of a transcript such as an RNA. Some embodiments include a transcription solution comprising reagents for transcription (which, for convenience, may be referred to herein as "transcription reagents"), and thus is configured for in vitro transcription and translation, for example to transcribe and translate the nucleic acid encoding fusion polypeptides as provided herein. It is contemplated that in vitro transcription and translation in a single solution (such as a transcription solution further comprising a translation solution as described herein) can facilitate efficient in vitro production of fusion polypeptides of some embodiments. Thus, in accordance with some embodiments described herein, the in vitro expression system comprises an in vitro transcription reagent and/or an in vitro translation reagent. [0098] Tn accordance with some embodiments described herein, the translation solution comprises, consists essentially of, or consists of one or more translation reagents or in vitro translation reagents. Examples of translation reagents include, but are not limited to, a ribosome, a buffer, an amino acid, a tRNA (which may be conjugated to an amino acid), a lysate or extract such as an E. coli lysate or E. coli extract, and a cofactor or metallic ion such as Mg 2+ , or a combination of two or more of any of the listed items. In accordance with some embodiments described herein the translation solution further comprises a transcription reagents, and thus is configured for in vitro transcription and translation. As described herein, a transcription solution further comprising translation reagents contemplates a single solution that is suitable for in vitro transcription and translation. As such, a transcription solution further comprising translation reagents encompasses a single transcription/translation solution. It will be appreciated that some components of a transcription and/or translation solution, for example ribosomes, may not be liquids, and could potentially be isolated from the transcription and/or translation solution, for example by filtration and/or centrifugation.

[0099] In some embodiments, the translation solution comprises a post- translational modification enzyme. Examples of post-translational modification enzymes include, but are not limited to a cleavage enzyme, a kinase, a phosphatase, a giycosyltransferase, or a mixture of any two of the listed items.

[0100] Transcription solutions of some embodiments described herein (and which can be comprised by translation solutions as described herein) can comprise, consist essentially or, or consist of one or more transcription reagents. Examples of transcription reagents include an RNA polymerase, a buffer, a nucleic acid mix (for example, NTPs including ATP, GTP, CTP, and UTP), a cofactor or metallic ion such as Mg 2+ , a transcription inducer (such as a transcription factor, IPTG, or lactose), a polyadenylation enzyme, a capping enzyme, a lysate or extract such as a bacterial lysate or extract such as an E. coli lysate or E. coli extract, an SP6 polymerase, a T3 polymerase, a T7 RNA polymerase, or a mixture of two or more of any of the listed items. The transcription solution can be useful for transcribing a template, such as a candidate nucleic acid as described herein. Translation solutions of some embodiments include one or more transcription reagents in combination with one or more translation reagents.

[0101] The in vitro expression system can be provided in any suitable volume. In accordance with some embodiments described herein, the in vitro expression system is provided in a volume of 1 pl - 1000 pl, 1 pl - 50 pl, 1 pl - 500 pl, 1 pl - 900 pl, 50 pl - 100 pl, 50 pl - 500 pl, 50 pl -1000 pl, 100 pl - 200 pl, 100 pl - 500 pl, 100 pl - 1000 pl, 200 pl - 500 pl, 200 pl - 1000 pl, 500 pl - 900 pl, 500 pl - 1000 pl, 1ml - 2 ml, 3 ml - 5 ml, 5 ml- 10 ml, 10 ml - 20 ml, 20 ml - 50 ml, 50 ml - 100 ml, or more.

[0102] In accordance with some embodiments described herein, the in vitro transcription/translation solution is lyophilized. In some embodiments, the in vitro transcription/translation solution is configured be reconstituted in a solution such as water.

[0103] The contacting can be carried out for any suitable amount of time. In some embodiment, the contacting is done for at least about 10 minutes, at least about 20 minutes, at least about 30 minutes, at least about 45 minutes, at least about 60 minutes, at least about 1.5 hours, at least about 2 hours, at least about 3 hours, at least about 4 hours, at least about 6 hours, at least about 8 hours, at least about 10 hours, at least about 12 hours, at least about 16 hours, at least about 20 hours, at least about 24 hours, at least about 2 days, at least about 3 days, or more, or by a duration within a range defined by any two of the preceding time periods, for example 10-60 minutes, 1 hour-24 hours, 1-12 hours, 24-48 hours, 1-3 days.

[0104] In any production methods, in some embodiments, the method includes culturing a microbial cell genetically engineered with a nucleic acid or genetic vector encoding a fusion polypeptide as described herein under conditions sufficient to produce a circular bacteriocin. In some embodiments, the method includes culturing a second microbial cell in conjunction with the microbial cell genetically engineered with the nucleic acid or genetic vector encoding the fusion polypeptide. In some embodiments, the second microbial cell is an industrially useful microbial cell that is resistant to the circular bacteriocin.

[0105] In any production methods, in some embodiments, the method includes purifying the circular bacteriocin. In some embodiments, the circular bacteriocin is purified from the in vitro expression system. In some embodiments, the circular bacteriocin is purified after culturing the microbial cell genetically engineered with a nucleic acid or genetic vector encoding a fusion polypeptide as described herein. Any suitable option can be used to purify the circular bacteriocin. In some embodiments, the method includes purifying the fusion polypeptide, e.g., using an affinity tag associated therewith. In some embodiments, where the fusion polypeptide or the circular bacteriocin includes an affinity tag, the circular bacteriocin can be purified by contacting the fusion polypeptide or the circular bacteriocin with a support (e.g., a column, a bead, etc.) having a binding agent attached thereto, where the binding agent binds the affinity tag, and eluting the bound fusion polypeptide or circular bactcriocin. Any suitable affinity tag, such as those disclosed herein, can be used to purify the circular bacteriocin and/or the fusion polypeptide. In some embodiments, the affinity tag is CBP. In some embodiments, the affinity tag is CBP and purifying the circular bacteriocin and/or the fusion polypeptide includes using a chitin resin.

[0106] The contacting or culturing can be done under any suitable condition for producing the circular bacteriocin by the in vitro expression system or the genetically engineered microbial cell. In some embodiments, contacting the nucleic acid with the in vitro expression system involves incubating the nucleic acid in a transcription and/or translation solution at a suitable temperature. In some embodiments, the contacting is done at room temperature. In some embodiments, the contacting is done at less than 15°C, or about 15°C, about 18°C, about 20°C, about 22°C, about 25°C, about 27°C, about 30°C, about 34°C, about 36°C, about 38°C, or about 40°C, or higher, or at a temperature in a range defined by any two of the preceding values. In some embodiments, the culturing is done at a temperature suitable for growth of the genetically engineered microbial cell. In some embodiments, the culturing is done at less than 15°C, or about 15°C, about 18°C, about 20°C, about 22°C, about 25°C, about 27°C, about 30°C, about 34°C, about 36°C, about 38°C, or about 40°C, or higher, or at a temperature in a range defined by any two of the preceding values.

[0107] In any production methods, in some embodiments, contacting the nucleic acid with the in vitro expression system involves incubating the nucleic acid in a transcription and/or translation solution at a suitable pH. In some embodiments, the contacting is done at a pH of less than 3.0, or about 3.0, about 4.0, about 4.5, about 5.0, about 5.5, about 6.0, about 6.5, about 6.7, about 7.0, about 7.2, about 7.5, about 8.0, about 8.5, about 9.0, or about 10.0, or higher, or at a pH in a range defined by any two of the preceding values. In some embodiments, the culturing is done at a pH suitable for growth of the genetically engineered microbial cell. In some embodiments, the culturing is done at a pH of less than 3.0, or about 3.0, about 4.0, about 4.5, about 5.0, about 5.5, about 6.0, about 6.5, about 6.7, about 7.0, about 7.2, about 7.5, about 8.0, about 8.5, about 9.0, or about 10.0, or higher, or at a pH in a range defined by any two of the preceding values. [0108] Tn any production methods, in some embodiments, where the split intein is a conditional intein, the method includes exposing the fusion polypeptide to the permissive condition, following exposure to the non-permissive condition, to induce circularization of the bacteriocin. In some embodiments, the method further includes modifying the temperature during (or after) the contacting or culturing. In some embodiments, where the split intein is temperature- sensitive, the method includes modifying the temperature, from a non-permissive temperature to a permissive temperature, or vice versa. In some embodiments, the method further includes shifting the pH during (or after) the contacting or culturing. In some embodiments, where the split intein is pH-sensitive, the method includes shifting the pH, from a pH which is non-permissive for circularization to a pH permissive for circularization, or vice versa.

[0109] Also provided are methods of screening (which for convenience can be referred to as screening methods). With reference the Fig. 5, the method 500 can include providing a library of nucleic acids or genetic vectors of the present disclosure at block 510. The method can further include expressing a plurality of polypeptides encoded by one of more genetic vectors of the library, at block 520. The method can also include generating a plurality of circular bacteriocins from the plurality of expressed polypeptides, at block 530. Further, the method can include, at block 540, assaying the plurality of circular bacteriocins for a desired activity. The desired activity can be any suitable activity of the circular bacteriocins. In some embodiments, the desired activity is a change in activity relative to a reference, e.g., relative to the activity of a parent bacteriocin when screening a mutational library, or relative to a standard level of activity. In some embodiments, the desired activity is identification of an activity where none or substantially none was known previously, e.g., identifying a bacteriocin that is effective against a microbial species by screening a library of uncharacterized and/or predicted bacteriocins.

[0110] In any screening methods, in some embodiments, the desired activity includes antimicrobial activity. In some embodiments, the desired activity includes an increased antimicrobial activity, e.g., compared to the parent bacteriocin, against one or more microorganisms. In some embodiments, the desired activity includes antimicrobial activity against a specific species or strain of microorganism. In some embodiments, the desired activity includes resistance to degradation, such as, but not limited to, protease, heat, or pH degradation.

[0111] Also provided is a method of controlling the growth of a microorganism (which may be referred to herein as growth-controlling methods). In some embodiments, the method includes contacting a composition (e.g., culture medium, feedstock, a microbiome, etc.) that includes a microorganism (e.g., an undesirable microorganism) with the genetically engineered microbial cell of the present disclosure under conditions sufficient for the genetically engineered microbial cell to produce the circular bacteriocin, to inhibit or slow the growth of the microorganism. In some embodiments, the method includes contacting a composition (e.g., culture medium, feedstock, a microbiome, etc.) that is conducive to supporting the growth of a microorganism (e.g., an undesirable microorganism) with the genetically engineered microbial cell of the present disclosure under conditions sufficient for the genetically engineered microbial cell to produce the circular bacteriocin, to prevent the growth or delay the appearance of the microorganism in the composition.

[0112] In some embodiments, the method includes contacting a composition (e.g., culture medium, feedstock, a microbiome, etc.) that includes a microorganism (e.g., an undesirable microorganism) with a circular bacteriocin made by a production method, as disclosed herein, to inhibit or slow the growth of the microorganism. In some embodiments, the method includes contacting a composition (e.g., culture medium, feedstock, a microbiome, etc.) that is conducive to supporting the growth of a microorganism (e.g., an undesirable microorganism) with a circular bacteriocin made by a production method, as disclosed herein, to prevent the growth or delay the appearance of the microorganism in the composition. In some embodiments, the method includes contacting a composition (e.g., culture medium, feedstock, a microbiome, etc.) that includes a microorganism (e.g., an undesirable microorganism) with a fusion polypeptide, as disclosed herein, to inhibit or slow the growth of the microorganism. In some embodiments, the method includes contacting a composition (e.g., culture medium, feedstock, a microbiome, etc.) that is conducive to supporting the growth of a microorganism (e.g., an undesirable microorganism) with a fusion polypeptide, as disclosed herein, to prevent the growth or delay the appearance of the microorganism in the composition. [0113] Tn any growth-controlling methods, in some embodiments, the microorganism targeted by the circular bactcriocin is any suitable microorganism for which restricting or preventing growth is desirable. In some embodiments, the microorganism is a bacteria. In some embodiments, the microorganism is a pathogenic microorganism.

[0114] In any growth-controlling methods, in some embodiments, the composition can be associated with any environment in which controlling the growth of microorganisms is desired. In some embodiments, the composition includes, without limitation, a culture medium, feedstock, or a microbiome. The microbiome can include any suitable collection of microorganisms associated with an environment. In some embodiments, the microbiome includes that of an animal, a human organ, a plant, a plant root, and/or soil. In some embodiments, the microbiome includes that of a subject, such as a skin, gut, gastrointestinal tract, mammary gland, placenta, tissue, biofluid, seminal fluid, uterus, vagina, ovarian follicle, lung, saliva, oral cavity, mucosa, conjunctiva, or biliary tract. In some embodiments, the composition is associated with a commercially relevant environment, such as, without limitation, an industrial feedstock, or in a fermenter, or in a food, pharmaceutical, or cosmetic manufacturing environment.

[0115] In any growth-controlling methods, in some embodiments, where the split intein is a conditional intein, the method includes exposing the fusion polypeptide to the permissive condition, following exposure to the non-permissive condition, to induce circularization of the bacteriocin. In any growth-controlling methods, in some embodiments, the method includes modifying the pH or temperature of the composition to induce circularization of the bacteriocin, where the split intein is pH- or temperature-sensitive, respectively, as disclosed herein. In some embodiments, the method includes modifying the temperature of the composition from a non-permissive temperature or pH to a permissive temperature or pH, respectively, to induce circularization of the bacteriocin.

[0116] Also provided is a method of designing a nucleic acid encoding a polypeptide precursor of a bacteriocin (which for convenience may be referred to as designing methods). The method can include identifying a native amino acid sequence of a candidate bacteriocin, wherein the native amino acid sequence does not comprise a serine or cysteine at the N-terminus; providing a second amino acid sequence having a serine or cysteine at the N- terminus thereof by at least one of: circularly permuting the native amino acid sequence; or introducing a non-native serine or cysteine to the native amino acid sequence; providing a nucleotide sequence encoding a polypeptide comprising the second amino acid sequence flanked at both the N- and C-termini by a split intein configured to circularize the bacteriocin; and expressing the polypeptide encoded by the nucleotide sequence. In some embodiments, the candidate bacteriocin is a bacteriocin that is predicted to be a circular bacteriocin, e.g., based on the sequence of the bacteriocin or the genomic context. The polypeptide comprising the second amino acid sequence flanked at both the N- and C-termini by a split intein configured to circularize the bacteriocin can be any suitable polypeptide, e.g., a fusion polypeptide as disclosed herein. In some embodiments, the candidate bacteriocin is one that is predicted to be a circular bacteriocin based on a genomic sequence of a microorganism that encodes the candidate bacteriocin in its genome. In some embodiments, introducing a nonnative serine or cysteine to the native amino acid sequence includes substituting a native amino acid residue with a serine or cysteine, or adding or inserting a serine or cysteine to the native amino acid sequence.

[0117] In some embodiments, the nucleic acids encoding a polypeptide precursor of a bacteriocin finds use in generating a library of candidate bacteriocins for screening. In any designing methods, in some embodiments, the method includes: identifying a plurality of native amino acid sequences of a plurality of different candidate bacteriocins; for each of the plurality of native amino acid sequences: providing the second amino acid sequence; and providing the nucleotide sequence encoding a polypeptide comprising the second amino acid sequence flanked at both the N- and C-termini by a split intein configured to circularize the bacteriocin. In some embodiments, nucleic acids or genetic vectors that include the nucleotide sequences encoding the polypeptide can be provided in any suitable library, including a library as disclosed herein.

[0118] In some embodiments, the polypeptide further includes a degradation tag as disclosed herein. In some embodiments, the polypeptide further comprises a signal peptide and/or leader sequence as disclosed herein.

[0119] The split intein can be any suitable split intein as described herein. Expressing the polypeptide encoded by the nucleotide sequence can be done using any suitable option. In some embodiments, the polypeptide encoded by the nucleotide sequence is expressed in an in vitro expression system, as provide herein. In some embodiments, the polypeptide encoded by the nucleotide sequence is expressed by a microbial cell genetically engineered with a nucleic acid having the nucleotide sequence.

KITS

[0120] Also provided is a kit for generating a circular bacteriocin. In some embodiments, the kit includes a fusion polypeptide of the present disclosure. In some embodiments, the kit includes: a lyophilized composition of a fusion polypeptide of the present disclosure; and a liquid (e.g., water or buffer) for reconstituting the lyophilized composition. In some embodiments, the kit includes a panel of fusion polypeptides as disclosed herein having different bacteriocin sequences. In some embodiments, the kit includes a nucleic acid or genetic vector that encodes the fusion polypeptide as disclosed herein. In some embodiments, the kit includes a library of nucleic acids or genetic vectors encoding a plurality of fusion polypeptides as disclosed herein having different bacteriocin sequences. In some embodiments, the kit includes: a nucleic acid or genetic vector that encodes the fusion polypeptide as disclosed herein; and an in vitro transcription solution (or one or more components thereof), or an in vitro transcription solution (or one or more components thereof) and an in vitro translation solution (or one or more components thereof). In some embodiments, the kit includes a microbial cell genetically engineered with the nucleic acid or genetic vector that encodes the fusion polypeptide, as disclosed herein. In some embodiments, the kit includes an indicator strain of microorganism that is known to be susceptible to the circular bacteriocin generated by the kit. In some embodiments, the kit further comprises instructions for generating the circular bacteriocin from the fusion polypeptide, nucleic acid, genetic vector, or genetically engineered microbial cell.

ADDITIONAL EMBODIMENTS

[0121] Examples 1-3 below demonstrate circularization of bacteriocins from a fusion polypeptide that includes the bacteriocin flanked by a split intein. Circular bacteriocins are promising groups of antimicrobial peptides for industrial applications due to their higher stability compared to their linear counterparts. These peptides are, in general, more resistant to proteolytic enzymes and able to retain their full activity at different pH or temperatures. Until now, circular bacteriocins remained as a quite selective group with just 20 candidates discovered and fully characterized. Thanks to the recent advances in the omics technologies, especially in genomic sequencing, a high number of previously undcscribcd putatively circular bacteriocin clusters have been recently identified in different genomes of Gram positive bacteria from general databases, showing that these peptides are probably more prevalent in nature than previously appreciated. However, in some cases, in the absence of the producer strains it is difficult to predict the real functionality of these genes/clusters found.

[0122] Synthetic biology tools can be used for the fast and easy production of bacteriocins. Using cell-free protein synthesis (CFPS) techniques a collection of different mature bacteriocins, both fully characterized as hypothetical non-characterized candidates can be produced [13] . With just one single gene coding for the mature peptide (without the signal peptide or leader sequence), CFPS allows the production of different active bacteriocins outside the bacterial host in less than 4 hours. Due to the fact that protein production occurs outside the cell, there is no need for further leader sequence/signal peptide cleavage nor transport from dedicated proteins. Although this technique is suitable for the production of non-modified class II bacteriocins, production of bacteriocins requiring post-translational modifications (class I) may not be as efficient, without the activity of other dedicated proteins involved in the maturation of the peptides in the native bacterial host. Circular bacteriocins can be included in this last group, where several proteins are known to be involved in maturation (cleavage/circularization) and secretion outside the cell via different dedicated transporter systems in the native bacterial host.

[0123] Examples 1-3 below show that circularization of bacteriocins using split- inteins. Inteins can be used with tags for column purification or protein degradation. Split- intein mediated circular ligation of peptides and proteins (SICLOPPS) that incorporates different improvements, such as the use of an intein from Nostoc punctiforme (Npu), which is faster and also significantly more tolerant of amino acid diversity in the extein sequence and also a Ssra sequence in C-terminus to reduce the toxic effects of Npu by directing the Ssra- tagged protein to the ClpXP machinery for degradation [16] were used to circularize bacteriocins.

[0124] SICLOPPS was tested with Garvicin ML, a known circular bacteriocin from Lactococus garvieae DCC43, a strain isolated from Mallard Ducks [6]. It has been demonstrated that splicing with Npu intein is more efficient when a Cys or a Ser are put in position +1. Ser32 was selected over the other two serines present in GarML, but the other serines could have been chosen. As for other bactcriocins produced by CFPS, the recombinant gene was put under the control of a T7 promoter and terminator sequence [13]. After in vitro production antimicrobial activity was only observed when using the intact SICLOPPS construct, but not in the construct not carrying inteins (linear GarML) or in the construct with two point mutations in the inteins (preventing splicing). A higher activity was observed when the reaction was left overnight at room temperature. Splicing by Npu has been estimated to occur within the first 30 to 60 seconds, therefore the higher activity observed after 24h is not likely to be explained by higher percentage of splicing over time. Instead this higher activity could be due to a higher production of recombinant protein by the in vitro reaction. It has been observed that in some cases, leaving the in vitro reaction longer than 2 hours, can benefit protein production. These results demonstrated that SICLOPPS was working and GarML had been produced and circularized spontaneously without any further accessory protein. In order to confirm the presence of spliced GarML after CFPS a MS analysis of the reaction was run, and a peak with a mass of 6004 Da, corresponding to the exact mass of original GarML, was observed, confirming correct splicing of the inteins from GarML.

[0125] Correct circularization of GarML was further confirmed by bringing the SICLOPPS construct to E. coli and producing recombinant GarML in vivo for higher scale protein production. After protein production and purification a single FPLC fraction with antimicrobial activity was obtained. MS analysis of this fractions revealed again a mass of 6004 Da, thus confirming the results previously obtained with the CFPS method and showing that splicing was also taking place in E. coli. Moreover, trypsin digestion of the purified peptide and LC-MS/MS analysis of the fragments obtained revealed the presence of 6 peptides covering the 100% of mature GarML. The presence in one of the peptide (TIVNAVSAGMDIATALSLFSGAFTAAGGIMALIK) of SI and F60 linked together confirmed correct circularization of GarML in the selected location. These results show that GarML produced with SICLOPPS system not only circularizes and has the exact same sequence as GarML produced by its native producer, but that it most likely adopts the same three-dimensional conformation. This was also observed during the construction of the PARAGEN collection, where different pediocin-like bactcriocins, having 2 or 3 disulfide bonds in their native and active final conformation, were active after in vitro production and showed a similar spectrum of activity as the native ones. This is the first time that a circular bactcriocin is produced in E. coli. Some options to enhance production and facilitate purification might include the use of a different host for protein production, the addition of an immunity gene to the construct in order to prevent toxic effects of the bacteriocin, fusion of a signal peptide to the protein to promote secretion outside the cell, use of switchable inteins for conditional protein splicing or use of fusion tags to facilitate column purification.

[0126] Last, use of SICLOPPS for circularization of other circular bacteriocins was determined. The list included some fully characterized candidates such as enterocin AS-48 or gassericin A, two well studied bacteriocins potential as food biopreservatives or prebiotic compounds in animal health is well documented. Other bacteriocins selected in this study correspond to putative circular bacteriocins found in genome mining studies, but not yet confirmed in the lab. After CFPS most of SICLOPPS was observed to produce bacteriocins showing activity against at least one of the indicators used in the study, thus showing that, most likely, splicing and circularization of these bacteriocins was happening.

[0127] Examples 1-3 demonstrate that circularization of bacteriocins using split- inteins allow the fast production and circularization of bacteriocins, ready to be tested for antimicrobial activity. This is the first time that production and circularization of bacteriocins is carried out using split-inteins, the first time they are produced using CFPS and also the first time a functional circular bacteriocin is produced by E. coli.

[0128] Examples 1-3 below demonstrate an efficient synthetic biology method to carry out circularization of many bacteriocins, even in the absence of the original producer strain. This method also simplifies production of circular bacteriocins as just one single gene is necessary for production and circularization. This work provides for use of inteins with bacteriocins, including:

- [0129] For production, circularization and functional test of other hypothetical circular bacteriocins.

- [0130] For production and circularization of non-circular (linear) bacteriocins to enhance stability.

- [0131] For the generation of circular bacteriocin libraries to test and obtain variants with enhanced features. [0132] For heterologous production of circular bacteriocins in other host for scaling up bacteriocin production.

REFERENCES

[1] M. Younes et al., “Safety of nisin (E 234) as a food additive in the light of new toxicological data and the proposed extension of use,” EFSA J., vol. 15, no. 12, Dec.

2017.

[2] S. Soltani et al., “Bacteriocins as a new generation of antimicrobials: toxicity aspects and regulations,” FEMS Microbiol. Rev., vol. 45, no. 1, Jan. 2021.

[3] M. L. Chikindas, R. Weeks, D. Drider, V. A. Chistyakov, and L. M. Dicks, “Functions and emerging applications of bacteriocins,” Curr. Opin. Biotechnol., vol. 49, p. 23, Feb.

2018.

[4] M. Zimina et al. , “Overview of Global Trends in Classification, Methods of Preparation and Application of Bacteriocins,” Antibiot. 2020, Vol. 9, Page 553, vol. 9, no. 9, p. 553, Aug. 2020.

[5] P. D. Cotter, R. P. Ross, and C. Hill, “Bacteriocins-a viable alternative to antibiotics?,” Nat. Rev. Microbiol., vol. 11, no. 2, pp. 95-105, Feb. 2013.

[6] J. Borrero et al. , “Characterization of garvicin ML, a novel circular bacteriocin produced by Lactococcus garvieae DCC43, isolated from mallard ducks (Anas platyrhynchos).,” Appl. Environ. Microbiol., vol. 77, no. 1, pp. 369-373, Jan. 2011.

[7] P. Alvarez-Sieiro, M. Montalban-Lopez, D. Mu, and O. P. Kuipers, “Bacteriocins of lactic acid bacteria: extending the family,” Appl. Microbiol. Biotechnol., vol. 100, no. 7, pp. 2939-2951, Apr. 2016.

[8] J. Borrero et al., “Plantaricyclin A, a Novel Circular Bacteriocin Produced by Lactobacillus plantarum NI326: Purification, Characterization, and Heterologous Production.,” Appl. Environ. Microbiol., vol. 84, no. 1, Jan. 2018.

[9] R. H. Perez, T. Zendo, and K. Sonomoto, “Circular and Leaderless Bacteriocins: Biosynthesis, Mode of Action, Applications, and Prospects.,” Front. Microbiol., vol. 9, p. 2085, 2018.

[10] D. Major, L. Flanzbaum, L. Lussier, C. Davies, K. M. P. Caldo, and J. Z. Acedo, “Transporter Protein-Guided Genome Mining for Head-to-Tail Cyclized Bacteriocins,” Molecules, vol. 26, no. 23, Dec. 2021.

[11] B. Xin et al., “ In Silico Analysis Highlights the Diversity and Novelty of Circular Bacteriocins in Sequenced Microbial Genomes ,” mSystems, vol. 5, no. 3, Jun. 2020.

[12] B. Vezina, B. H. A. Rehm, and A. T. Smith, “Bioinformatic prospecting and phylogenetic analysis reveals 94 undescribed circular bacteriocins and key motifs.,” BMC Microbiol., vol. 20, no. 1, p. 77, Apr. 2020.

[13] P. Gabant and J. Borrero, “PARAGEN 1.0: A Standardized Synthetic Gene Library for Fast Cell-Free Bacteriocin Synthesis.,” Front. Bioeng. Biotechnol., vol. 7, p. 213, 2019.

[14] C. P. Scott, E. Abel-Santos, M. Wall, D. C. Wahnon, and S. J. Benkovic, “Production of cyclic peptides and proteins in vivo,” Proc. Nall. Acad. Sci. U. S. A., vol. 96, no. 24, pp. 13638-13643, Nov. 1999.

[15] A. Tavassoli and S. J. Benkovic, “Split-intein mediated circular ligation used in the synthesis of cyclic peptide libraries in E. coli.,” Nat. Protoc., vol. 2, no. 5, pp. 1126— 1133, 2007. [16] J. E. Townend and A. Tavassoli, “Traceless Production of Cyclic Peptide Libraries in E. coli., ” ACS Chem. Biol., vol. 11, no. 6, pp. 1624-1630, Jun. 2016.

[17] M. Cheriyan, C. S. Pedamallu, K. Tori, and F. Perler, “Faster protein splicing with the Nostoc punctiforme DnaE intein using non-native extern residues,” J. Biol. Chem., vol. 288, no. 9, pp. 6202-6211, Mar. 2013.

EXAMPLES

[0133] The following materials and methods were used in Examples 1-4.

Bacterial strains, plasmids and culture conditions

[0134] For GarML production the microbial strains and plasmids used in this study are listed in Table 1.

Table 1 a CECT, Coleccion Espanola de Cultivos Tipo. b A. Chopin, M. C. Chopin, A. Moillo-Batt, and P. Langella, “Two plasmid-determined restriction and modification systems in Streptococcus lactis.,” Plasmid, vol. 11, no. 3, pp. 260- 263, May 1984.

[0135] A schematic view of the design of the plasmids is shown in Figs. 2A-2D. All amino acid sequences of the fusion polypeptides are shown in Table 2.1. All amino acid sequences, both native and as modified for use with SICLOPPS (split-intein circular ligation of peptides and proteins), of the characterized and uncharacterized circular bacteriocins with the SICLOPPS method and the controls are shown in Table 2.2. For plasmid construction, all amino acidic sequences were reverse-translated and codon optimized for Escherichia coli (world wide web at bioinformatics.org/sms2/rev_trans.html). The nucleotide sequences were included in a vector backbone containing the T7 promoter region, a start codon (ATG) a stop codon (TAA) and a T7 terminator region. Plasmid synthesis was carried out by Genewiz (New Jersey, USA).

Table 2.1

List of bacteriocins used in this study. Bacteriocins have been grouped according to the classification made by Vezina et al., 2020. In bold are the name of those bacteriocins fully characterized. The first line of the mature amino acidic sequence corresponds to the described or hypothetical linear sequence originated after leader sequence cleavage and before head-to- tail circularization. The second line corresponds to the amino acidic sequence used in this study for circularization using the SICCLOPPS system. The serines used in position 1 are bolded and underlined in the original sequence. Bacteriocin F9 has no serine in its original amino acidic sequence. A serine was added in position one (bolded).

Table 2.2 Components of the STCLOPPS system used in this study. Tn bold are the point residue substitution used in order to generate non-functional inteins.

Cell-free production of bacteriocins

[0136] Recombinant vectors were stabilized and amplified in E. coli DH5a standard strain and used as templates for cell-free protein synthesis using PURExpress® in vitro Protein Synthesis Kit (New England Biolabs). All bacteriocins synthesized were tested on petri dishes for antimicrobial activity against an indicator strain. Bacteriocins showing a halo of inhibition against at least one of the indicators were considered as positive. Production and purification ofGarvicin ML from E. coli BL21-GarvML

[0137] In order to purify the antimicrobial compound produced by E. coli BL21- pUC-Npu-GarvML, the culture was grown in 10 ml LB broth supplemented with Ampicillin at 100 pg/ml (LB-Amp) and grown in a shaking 37°C incubator overnight. 500 ml of LB-Amp were inoculated with the overnight culture to an ODeoo of 0.1 and grown in a shaking 37°C incubator. When the culture had reached an ODeoo of 0.4, IPTG was added to a final concentration of 0.5 mM. Culture was grown for another 3 hours and cells were pelleted by centrifugation (8,000 r.p.m.; 4°C) for 15 minutes. Cells were resuspended in 20 ml ice-cold column buffer (20 mM Phosphate buffer pH 6 and 1 M NaCl) and lysed by sonication (6 cycles of 10 seconds at 45% with 1 minute incubation in ice in between the cycles). The insoluble debris was pelleted by centrifugation (8,000 r.p.m.; 4°C) for 15 minutes and the soluble fraction (SF) obtained was filtered through a 0.45 nm filter.

[0138] SF was further subjected to hydrophobic-interaction (Octyl Sepharose CL- 4B; Merck) chromatography. First Ammonium Sulfate was added to the SF (10% w/v). A column with 2 ml of Octyl Sepharose CL-4B was washed with H2O and equilibrated with 15 ml equilibration buffer (EB ; 20 mM Phosphate buffer pH6 with Ammonium Sulfate [1% w/v]). Then the SF was added to the column, which was washed with 10 ml EB. Bacteriocin was eluted with 10 ml 70% EtOH diluted in 20 mM Phosphate buffer pH 6.

[0139] Finally the fraction obtained was diluted 5 times in H2O + TFA (0.1% v/v) and subjected to reverse phase chromatography (Source 5 RPC ST4.6-150; GE Healthcare) in a fast-protein liquid chromatography system (AKTA, RP-FPLC). Samples were eluted with the mobile phase consisting of 0.1% (v/v) trifluoroacetic acid (TFA) in a mixture of water (eluent A) and isopropanol (eluent B). (Both reagents were HPLC grade). A gradient program was followed: samples were initially eluted with 100% A for 5 min, then with a linear gradient 0 - 70% B over 50 min, followed by a linear gradient to 100% B over 5 min and maintained at 100% B for 7 min. The flow rate was maintained at 1 ml/min, absorbance was monitored at 254 nm and the column was kept at room temperature. Column eluents and FPLC fractions were assayed for antimicrobial activity by spot on agar assays (SPA), using L. garvieae 5806 as the target organism. FPLC fractions displaying antimicrobial activity were subjected to a second run of FPLC under the same conditions.

Characterization of Garvivin ML by directed proteomics combined with Mass Spectrometry

[0140] Active fractions from the second run of FPLC were concentrated with a Speed-vac and subjected to matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) on a 4800 Proteomics Analyzer with TOF/TOF (AB SCIEX) in positive reflectron mode (Unidad de Proteomica-Universidad Complutense de Madrid, Madrid, Spain).

[0141] After drying out and resuspending the sample in 25 pl TEAB 25 mM and buffer S-TRAP to equal parts, it was digested with trypsin in a S-Trap microcolumn (PROTIFITM) as recommended by the manufacturer. Shortly, protein was reduced with 10 DTT for 60 min at 56°C, and then alkylated with 25 mM iodocetamide for 60 min in darkness. Then 20% SDS, TEAB 1 M and Phosphoric Acid were added to final concentration of 10%, 100 mM and 1.2%, respectively. Next S-Trap binding buffer was added in a 6:1 ratio, applied to the column and digested, following the protocol, with 1.5 pg recombinant Trypsin sequencing grade (Roche Molecular Biochemicals) in TEAB 50 mM for 90 min at 47°C in static conditions.

[0142] Finally, the peptides obtained were eluted and dried out on a SpeedVac (Savant), reconstituted in 15 pl 2% ACN, Formic Acid 0.1% and quantified on a Qubit (ThermoFisher Scientific). Then samples were analyzed through LC-MS/MS.

Example 1

[0143] This non-limiting example shows designing a genetic vector encoding a bacteriocin flanked by a split intein, and cell-free production of an active bacteriocin therefrom. Design of the expression vector for production of garvicin ML [0144] Based on the work describing the split intein circular ligation of peptides and proteins (SICLOPPS) system [16] a gene containing the C and N-tcrminal intein fragments from Npu DnaE split intein (Ic and IN, respectively) fused to the mature peptide of bacteriocin garvicin ML (GarML) was synthesized. With reference to Fig. 2C, the split intein sequence is shown with solid underline, and the bacteriocin sequence is shown with dotted underline. Native Garvicin ML circularization occurs after a head-to-tail ligation between residues Leul and Ala60 after leader sequence cleavage [6] (Fig. 2A), but the intein chemistry typically requires the first amino acid of the target peptide to be either a cysteine or a serine. GarML has no cysteine in its mature sequence, but it has 3 serines (Serl9, Ser29 and Ser32). Accordingly, the order of the residues in Garvicin ML was switched, choosing Ser32 as the first residue in the linear conformation (Seri in the new conformation) and Phe31 as the last residue (Phe60 in the new conformation). In addition, a protein degradation tag in C-terminus (SsrA) of the construct was included (rectangular box in Fig. 2C) to overcome potential toxicity of Npu intein after splicing (Figs. 2B-2D) [16]. Based on the amino acid sequence designed, the synthetic gene with the codon usage of E. coll was produced, and was put under the control of promoter T7 in a pUC expression standard expression vector, called pUC-Npu-GarML. In parallel the same construct without the inteins (pUC-GarML) and with an inactive variant of the Npu DnaE inteins having a N36D substitution in Ic and a CIA substitution in IN (pUC- Npu’ - GarML) were produced [17] (Fig. 2B).

Cell-free production of GarML

[0145] Plasmids pUC-Npu-GarML, pUC-GarML and pUC-Npu-GarML were used as templates for cell-free protein production of Npu-GarML, GarML and Npu-GarML, respectively. We then tested the activity of the products of the reactions against L. garvieae strain. Neither GarML nor Npu-ClA-GarML was active against the indicator, demonstrating that neither the linear GarML nor a linear version with the Ic and IN at both sides of GarML was an active form. In contrast, Npu-GarML showed activity against the indicator (Fig. 3A), and this activity was higher when the product was left overnight at room temperature (Fig. 3B). These results demonstrated that in vitro production of GarML fused to Npu inteins allows for circularization of GarML (Fig. 2D) as the linear version of GarML with or without Npu inteins did not show any activity. [0146] Tn some embodiments, designing a nucleic acid encoding a bacteriocin circularized by a split intcin involves circularly permuting the amino acid sequence of a native, mature form of a circular bacteriocin such that a serine or cysteine in the native sequence is positioned as the first amino acid, fusing an N-terminal fragment of a split intein to the N- terminus of the circularly permuted bacteriocin sequence, and fusing a C-terminal fragment of the split intein to the C-terminus of the circularly permuted bacteriocin sequence. In some embodiments, a nucleic acid encoding a bacteriocin flanked at both the N- and C-termini by a split intein that circularizes the bacteriocin is expressed in vitro in a cell-free expression system to produce a gene product that exhibits antimicrobial activity of the encoded bacteriocin, where the encoded bacteriocin is a natively circular bacteriocin.

Example 2

[0147] This non-limiting example shows expression of circular bacteriocin by a genetically engineered bacteria with a vector encoding a bacteriocin flanked by a split intein, and analysis of the expressed bactericin to confirm head-to-tail circularization by the split intein.

Purification and mass spectrometry (MS) and multiple reaction monitoring (MRM) analysis of garvicin ML from the soluble fraction of recombinant E. coli

[0148] In order to demonstrate and prove correct splicing and circularization of GarML Npu-GarML production was scaled up using E. coli BL21 as producer (Fig. 4A). Soluble fraction of E. coli BL21 pUC-Npu-GarML showed antimicrobial activity against Lc. garvieae. The active peptide was further purified from the soluble fraction using hydrophobic interaction (HI) and reversed -phase fast-performance liquid chromatography (FPLC). Antimicrobial activity correlated with a peak eluting at 39% isopropanol in the FPLC chromatogram in two consecutive rounds of purification. Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) analysis revealed that the corresponding fractions had a mass of 6,004,2 Da (Fig. 4B).This correlates with the mass from native circular garvicin ML [6]. This fraction was further subjected to trypsin digestion and fragments originated where analysed by LC-MRM-MS analysis. Knowing the mass and aminoacidic sequence of garvicin ML, it is possible to predict the precursor z and fragments m/z (MRM transition). Each targeted peptide has a set of accompanying transitions that are then selectively detected in a second stage of MS. All peptides were confirmed by MS/MS covering 100% of the complete GarML sequence. One of the peptides detected and confirmed by MS/MS (TIVNAVSAGMDIATALSFSGAFTAAGGIMALIKK) contained residues SI and F60 from Npu-GarML linked together, thus confirming splicing and head-to-tail circularization of GarML.

Example 3

[0149] This non-limiting example shows cell-free production of known and predicted circular bacteriocins using split inteins, and confirmation of antimicrobial activity thereof.

Plasmid design and cell-free production of characterized and uncharacterized circular bacteriocins

[0150] Following the same scheme as for GarML, different plasmids containing genes coding for recombinant proteins containing Npu Ic and IN fused to the mature amino acid sequence of different known and hypothetical circular bacteriocins were synthesized. The circularization point for all of them was changed with respect to the native peptides so they all would start with a Ser in position 1 (Table 2.1). The Ssra tag was also added to the C-terminus. As for GarML, an Npu inactive version was built for all peptides. Plasmids synthesized were used as templates for cell free protein production. Antimicrobial activity of the peptides obtained after cell free production and overnight incubation at room temperature were tested against indicator L. lactis IL1403. None of the proteins containing the inactive inteins showed antimicrobial activity. Most of the fractions containing native Npu inteins fused to the mature sequence of the characterized, and hypothetical circular bacteriocins showed activity against L. lactis IL1403, thus showing that SICLOPPS method is also applicable for circularization of other circular bacteriocins (Table 3).

Table 3: Antimicrobial activity of different bacteriocins circularized with inteins

[0151] In some embodiments, bacteriocins known or predicted to be circular bacteriocins are expressed in a cell-free system by designing a nucleic acid encoding the bacteriocin flanked by a split intein, where the native amino acid sequence of the bacteriocin is circularly permuted, and/or is mutated to introduce a non-native serine, such that a serine is at position 1 of the bacteriocin encoded by the nucleic acid.

Example 4 [0152] This non-limiting example shows screening a library for hacteriocins having a desired activity.

[0153] A nucleic acid encoding an amino acid sequence of a circular bacteriocin, Enterocin NKR-5-3B, is prepared. The amino acid sequence is modified relative to the native sequence of the circular bacteriocin such that a serine or cysteine is at position 1 of the amino acid sequence, e.g., by circularly permuting the native sequence to place a native serine or cysteine at position 1. The nucleic acid is amplified and mutations are introduced, e.g., by random mutagenesis or selective point mutation, to generate a collection of variants of the nucleic acid encoding the circular bacteriocin. The variants are cloned into an expression vector so that each variant of the nucleic acid encoding the circular bacteriocin is flanked by a split intein configured to circularize the bacteriocin, to generate a library of expression vectors having variant nucleic acids encoding the circular bacteriocin.

[0154] A cell-free expression system is used to express circular bacteriocins from the library of expression vectors, and the produced circular bacteriocins are tested for antimicrobial activity against one or more bacterial strains of interest, to identify those that exhibit a desired activity. For example, in one embodiment variant nucleic acids encoding Enterocin NKR-5 -3B that retain antimicrobial activity against L. lactis, but do not retain antimicrobial activity against L. inocua are isolated and sequenced to identify the mutation(s) responsible for conferring the desired antimicrobial activity to the circular bacteriocin.

Example 5

[0155] This non-limiting example shows controlling the growth of a microbial organism using a circular bacteriocin.

[0156] A polypeptide containing an amino acid sequence of a circular bacteriocin, for example Enterocin AS -48, flanked by a split intein is produced. The split intein is a conditionally active, pH-sensitive split intein, and is configured to circularize the bacteriocin when the pH is below 6.0. The polypeptide is introduced into a culture medium growing a microbial organism of interest, at pH 7.0. The bacteriocin is not circularized at pH 7.0, and does not exhibit antimicrobial activity. When a contaminating microbial species, L. lactis, is detected in the culture medium, the pH of the medium is reduced to below 6.0, which activates the split intein and causes the bacteriocin to circularize. Subsequently, the growth of the contaminating L. lactis in the culture medium is inhibited.

Example 6

[0157] This non-limiting example shows controlling the growth of a microbial organism using a circular bacteriocin.

[0158] A microbial cell is genetically engineered with an expression vector encoding a circular bacteriocin, for example Leucocyclicin Q, flanked by a split intein. The genetically engineered microbial cell is introduced into a culture medium growing a microbial organism of interest. The microbial cell produces the bacteriocin in circularized form, and secretes it into the culture medium. Growth of a contaminating microbial species, L. lactis, is inhibited by the circular bacteriocin.

[0159] In at least some of the embodiments described herein, one or more elements used in an embodiment can interchangeably be used in another embodiment unless such a replacement is not technically feasible. It will be appreciated by those skilled in the art that various other omissions, additions and modifications may be made to the methods and structures described herein without departing from the scope of the claimed subject matter. All such modifications and changes are intended to fall within the scope of the subject matter, as defined by the appended claims.

[0160] With respect to the use of substantially any plural and/or singular terms herein, those having skill in the art can translate from the plural to the singular and/or from the singular to the plural as is appropriate to the context and/or application. The various singular/plural permutations may be expressly set forth herein for sake of clarity.

[0161] It will be understood by those within the art that, in general, terms used herein, and especially in the appended claims (e.g., bodies of the appended claims) are generally intended as “open” terms (e.g., the term “including” should be interpreted as “including but not limited to,” the term “having” should be interpreted as “having at least,” the term “includes” should be interpreted as “includes but is not limited to,” etc.). It will be further understood by those within the art that if a specific number of an introduced claim recitation is intended, such an intent will be explicitly recited in the claim, and in the absence of such recitation no such intent is present. For example, as an aid to understanding, the following appended claims may contain usage of the introductory phrases "at least one" and "one or more" to introduce claim recitations. However, the use of such phrases should not be construed to imply that the introduction of a claim recitation by the indefinite articles "a" or "an" limits any particular claim containing such introduced claim recitation to embodiments containing only one such recitation, even when the same claim includes the introductory phrases "one or more" or "at least one" and indefinite articles such as "a" or "an" (e.g., “a” and/or “an” should be interpreted to mean “at least one” or “one or more”); the same holds true for the use of definite articles used to introduce claim recitations. In addition, even if a specific number of an introduced claim recitation is explicitly recited, those skilled in the art will recognize that such recitation should be interpreted to mean at least the recited number (e.g., the bare recitation of "two recitations," without other modifiers, means at least two recitations, or two or more recitations). Furthermore, in those instances where a convention analogous to “at least one of A, B, and C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “ a system having at least one of A, B, and C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). In those instances where a convention analogous to “at least one of A, B, or C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “ a system having at least one of A, B, or C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). It will be further understood by those within the art that virtually any disjunctive word and/or phrase presenting two or more alternative terms, whether in the description, claims, or drawings, should be understood to contemplate the possibilities of including one of the terms, either of the terms, or both terms. For example, the phrase “A or B” will be understood to include the possibilities of “A” or “B” or “A and B.”

[0162] In addition, where features or aspects of the disclosure are described in terms of Markush groups, those skilled in the art will recognize that the disclosure is also thereby described in terms of any individual member or subgroup of members of the Markush group. [0163] As will be understood by one of skill in tbe art, for any and all purposes, such as in terms of providing a written description, all ranges disclosed herein also encompass any and all possible sub-ranges and combinations of sub-ranges thereof. Any listed range can be easily recognized as sufficiently describing and enabling the same range being broken down into at least equal halves, thirds, quarters, fifths, tenths, etc. As a non-limiting example, each range discussed herein can be readily broken down into a lower third, middle third and upper third, etc. As will also be understood by one skilled in the art all language such as “up to,” “at least,” “greater than,” “less than,” and the like include the number recited and refer to ranges which can be subsequently broken down into sub-ranges as discussed herein. Finally, as will be understood by one skilled in the art, a range includes each individual member. Thus, for example, a group having 1-3 articles refers to groups having 1, 2, or 3 articles. Similarly, a group having 1-5 articles refers to groups having 1, 2, 3, 4, or 5 articles, and so forth.

[0164] While various aspects and embodiments have been disclosed herein, other aspects and embodiments will be apparent to those of skill in the art. The various aspects and embodiments disclosed herein are for purposes of illustration and are not intended to be limiting, with the true scope and spirit being indicated by the following claims.