Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SPLIT RECOMBINASES HAVING INDUCIBLE RECOMBINASE ACTIVITY
Document Type and Number:
WIPO Patent Application WO/2024/036155
Kind Code:
A2
Abstract:
Described herein are split recombinases having inducible recombinase activity and polynucleic acid molecules encoding the same. Also described herein are kits and host cells comprising the split recombinases, as well as methods of their use.

Inventors:
GAM JEREMY (US)
KAUL MAYA (US)
NIELSEN ALEC (US)
Application Number:
PCT/US2023/071850
Publication Date:
February 15, 2024
Filing Date:
August 08, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
ASIMOV INC (US)
International Classes:
C12N9/22; C12N1/20
Attorney, Agent or Firm:
POWELL, Curtis, R. et al. (US)
Download PDF:
Claims:
- I l l -

CLAIMS

What is claimed is:

1. A polynucleic acid molecule encoding a polypeptide dimer having recombinase activity, wherein the nucleic acid sequence encoding the polypeptide dimer comprises, from 5’ to 3’: (i) a sequence encoding for a first polypeptide comprising a first portion of a recombinase and a first dimerization domain; (ii) a sequence encoding for a viral 2A peptide and/or an internal ribosomal entry site (IRES); and (iii) a sequence encoding for a second polypeptide comprising a second portion of a recombinase and a second dimerization domain; wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity.

2. The polynucleic acid molecule of claim 1, wherein the polypeptide dimer is derived from a Flp recombinase, a Bxbl recombinase, a PhiC31 recombinase, a TP901 recombinase, a Cre recombinase, a Vcre recombinase, a R4 recombinase, a Dre recombinase, an Inti recombinase, an Int2 recombinase, an Int3 recombinase, an Int4 recombinase, an Int5 recombinase, an Int6 recombinase, an Int7 recombinase, an Int8 recombinase, an Int9 recombinase, an IntlO recombinase, an Intl l recombinase, an Int 12 recombinase, an Intl3 recombinase, an Int 14 recombinase, an Int 15 recombinase, an Int 16 recombinase, an Int 17 recombinase, an Inti 8 recombinase, an Inti 9 recombinase, an Int20 recombinase, an Int21 recombinase, an Int22 recombinase, an Int23 recombinase, an Int24 recombinase, an Int25 recombinase, an Int26 recombinase, an Int27 recombinase, an Int28 recombinase, an Int29 recombinase, an Int30 recombinase, an Int31 recombinase, an Int32 recombinase, an Int33 recombinase, or an Int34 recombinase.

3. The polynucleic acid molecule of claim 2, wherein the polypeptide dimer is derived from Flp recombinase.

4. The polynucleic acid molecule of claim 3, wherein the first portion of the recombinase corresponds to an N-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 41 and the second portion of the recombinase corresponds to a C-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 42.

5. The polynucleic acid molecule of claim 3, wherein the first portion of the recombinase corresponds to a C-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 42 and the second portion of the recombinase corresponds to an N-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 41.

6. The polynucleic acid molecule of claim 3, wherein the first portion of the recombinase corresponds to an N-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 43 and the second portion of the recombinase corresponds to a C-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 44.

7. The polynucleic acid molecule of claim 3, wherein the first portion of the recombinase corresponds to a C-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 44 and the second portion of the recombinase corresponds to an N-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 43.

8. The polynucleic acid molecule of claims 2, wherein the polypeptide dimer is derived from Bxbl recombinase.

9. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 65 and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 66.

10. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 66 and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 65.

11. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 47 and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 48.

12. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 48 and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 47.

13. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 49 and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 50.

14. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 50 and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 49.

15. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 53 and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 54. 16. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 54 and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 53.

17. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 59 and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 60.

18. The polynucleic acid molecule of claim 8, wherein the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 60 and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 59.

19. The polynucleic acid molecule of claims 2, wherein the polypeptide dimer is derived from PhiC31 recombinase.

20. The polynucleic acid molecule of claim 19, wherein the first portion of the recombinase corresponds to an N-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 67 and the second portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 68.

21. The polynucleic acid molecule of claim 19, wherein the first portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 68 and the second portion of the recombinase corresponds to an N-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 67. 22. The polynucleic acid molecule of claim 19, wherein the first portion of the recombinase corresponds to an N-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 69 and the second portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 70.

23. The polynucleic acid molecule of claim 19, wherein the first portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 70 and the second portion of the recombinase corresponds to an N-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 69.

24. The polynucleic acid molecule of claims 2, wherein the polypeptide dimer is derived from TP901 recombinase.

25. The polynucleic acid molecule of claim 24, wherein the first portion of the recombinase corresponds to an N-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 71 and the second portion of the recombinase corresponds to a C-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 72.

26. The polynucleic acid molecule of claim 24, wherein the first portion of the recombinase corresponds to a C-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 72 and the second portion of the recombinase corresponds to an N-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 71.

27. The polynucleic acid molecule of claims 2, wherein the polypeptide dimer is derived from Cre recombinase.

28. The polynucleic acid molecule of claim 27, wherein the first portion of the recombinase corresponds to an N-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 73 and the second portion of the recombinase corresponds to a C-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 74.

29. The polynucleic acid molecule of claim 27, wherein the first portion of the recombinase corresponds to a C-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 74 and the second portion of the recombinase corresponds to an N-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 73.

30. The polynucleic acid molecule of claims 2, wherein the polypeptide dimer is derived from Vcre recombinase.

31. The polynucleic acid molecule of claim 30, wherein the first portion of the recombinase corresponds to an N-terminal portion of Vcre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 77 and the second portion of the recombinase corresponds to a C-terminal portion of Vcre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 78.

32. The polynucleic acid molecule of claim 30, wherein the first portion of the recombinase corresponds to a C-terminal portion of Vcre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 78 and the second portion of the recombinase corresponds to an N-terminal portion of Vcre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 77.

33. The polynucleic acid molecule of any one of claims 1-32, wherein, in the first polypeptide, the first dimerization domain is N-terminal to the first portion of the recombinase.

34. The polynucleic acid molecule of any one of claims 1-32, wherein, in the first polypeptide, the first dimerization domain is C-terminal to the first portion of the recombinase. 35. The polynucleic acid molecule of any one of claims 1-34, wherein, in the second polypeptide, the second dimerization domain is N-terminal to the second portion of the recombinase.

36. The polynucleic acid molecule of any one of claims 1-34, wherein, in the second polypeptide, the second dimerization domain is C-terminal to the second portion of the recombinase.

37. The polynucleic acid molecule of any one of claims 1-36, wherein the dimerization of the first polypeptide and the second polypeptide is dependent on the presence of a small molecule inducer.

38. The polynucleic acid molecule of claim 37, wherein the small molecule inducer is selected from the group consisting of gibberellic acid, abscisic acid, and rapalog.

39. The polynucleic acid molecule of claim 37 or claim 38, wherein: the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 80 and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 79; the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 79 and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 80; the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 81 and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 82; the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 82 and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 81; the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 83 and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 84; or the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 84 and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 83. 40. The polynucleic acid molecule of any one of claims 1-39, wherein the nucleic acid sequence encoding the polypeptide dimer comprises a sequence encoding for a viral 2 A peptide.

41. The polynucleic acid molecule of claim 40, wherein the viral 2A peptide comprises an amino acid sequence having at least 85% identity to any one of SEQ ID NOs: 88-89 and 236- 237.

42. The polynucleic acid molecule of any one of claims 1-39, wherein the nucleic acid sequence encoding the polypeptide dimer comprises a sequence encoding for an IRES.

43. The polynucleic acid molecule of claim 42, wherein the IRES comprises a nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 85-87.

44. The polynucleic acid molecule of any one of claims 1-39, wherein the nucleic acid sequence encoding the polypeptide dimer comprises a sequence having at least 85% identity to any one of SEQ ID NOs: 90-110.

45. The polynucleic acid molecule of any one of claims 1-44, wherein the polynucleic acid molecule encodes a polycistronic mRNA operably linked to a promoter, wherein the polycistronic mRNA comprises the nucleic acid sequence encoding the polypeptide dimer.

46. The polynucleic acid molecule of claim 45, wherein the nucleic acid sequence encoding for the polycistronic mRNA is operably linked to a constitutive promoter.

47. The polynucleic acid molecule of claim 45, wherein the nucleic acid sequence encoding for the polycistronic mRNA is operably linked to an inducible promoter.

48. The polynucleic acid molecule of claim 45, wherein the polynucleic acid molecule comprises an expression cassette comprising the nucleotide encoding for the polycistronic mRNA operably linked to a promoter, wherein the expression cassette comprises a nucleic acid sequence having at least 85% identity to any one of SEQ ID NO: 132-143. 49. An engineered cell comprising the polynucleic acid molecule of any one of claims 1- 48.

50. An engineered cell comprising:

(a) a first polynucleic acid molecule according to any one of claims 45-48; and

(b) a second polynucleic acid molecule comprising a nucleic acid sequence encoding, from 5’ to 3’: (i) a first recombinase site; (ii) a gene coding segment; and (iii) a second recombinase site; wherein the first and second recombinase sites correspond to the polypeptide dimer having recombinase activity encoded by the polycistronic mRNA of the first polynucleic acid of (a).

51. The engineered cell of claim 50, wherein the first recombinase site of the second polynucleic acid molecule comprises a nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 144-235 and the second recombinase site of the second polynucleic acid molecule comprises the nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 144-235.

52. The engineered cell of claim 50 or claim 51, wherein the gene coding segment comprises a nucleic acid sequence encoding for at least a portion of Rep52, at least a portion of Rep40, at least a portion of Rep78, at least a portion of Rep68, at least a portion of E2A, at least a portion of E40rf6, at least a portion of VARNA, at least a portion of VP1, at least a portion of VP2, at least a portion of VP3, at least a portion of AAP, or a combination thereof.

53. The engineered cell of claim 52, wherein the engineered cell comprises a stable integration of one or more polynucleic acid molecules collectively comprising nucleic acid sequences encoding for: Rep52 or Rep40; Rep78 or Rep68; E2A; E40rf6; VARNA; VP1; VP2; VP3; and AAP.

54. The engineered cell of claim 52 or claim 53, wherein the engineered cell comprises one or more polynucleic acid molecules collectively comprising nucleic acid sequences encoding for: UL5, UL8, UL29, UL30, UL42, UL52, UL12, ICP10, ICP4, and ICP22.

Description:
SPLIT RECOMBINASES HAVING INDUCIBLE RECOMBINASE ACTIVITY

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 63/370,881, filed August 9, 2022, the entire contents of which are hereby incorporated by reference.

REFERENCE TO AN ELECTRONIC SEQUENCE LISTING

The contents of the electronic sequence listing (A12107001 IWOOO-SEQ-CRP.xml; Size: 376,273 bytes; and Date of Creation: August 8, 2023) is hereby incorporated by reference in its entirety.

FIELD

Described herein are split recombinases having inducible recombinase activity and polynucleic acid molecules encoding the same. Also described herein are kits and host cells comprising the split recombinases, as well as methods of their use.

BACKGROUND OF INVENTION

Recombinases are enzymes that catalyze site-specific recombination events within DNA. Recombinases are widely used in multicellular organisms to manipulate the structure of genomes and, thereby, to control gene expression. The use of recombinases to manipulate expression in engineered cells has been limited by their toxicity.

SUMMARY OF INVENTION

Described herein are split recombinases, which in some embodiments have inducible recombinase activity. The inducible split recombinases described herein are expected to have reduced toxicity relative to the recombinases from which they are derived. These split recombinases can be used to regulate gene expression in various applications. For example, an AAV production system may comprise a split recombinase, wherein the split recombinase mediates inducible control of a gene product(s) required for AAV production, including cytostatic or cytotoxic AAV gene products.

In some aspects, the disclosure relates to polynucleic acid molecules encoding a polypeptide dimer having recombinase activity. In some embodiments, the nucleic acid sequence encoding the polypeptide dimer comprises, from 5’ to 3’: (i) a sequence encoding for a first polypeptide comprising a first portion of a recombinase and a first dimerization domain; (ii) a sequence encoding for a viral 2A peptide and/or an internal ribosomal entry site (IRES); and (iii) a sequence encoding for a second polypeptide comprising a second portion of a recombinase and a second dimerization domain; wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity.

In some embodiments, the polypeptide dimer is derived from a Flp recombinase, a Bxbl recombinase, a PhiC31 recombinase, a TP901 recombinase, a Cre recombinase, a VCre recombinase, a R4 recombinase, a Dre recombinase, an Inti recombinase, an Int2 recombinase, an Int3 recombinase, an Int4 recombinase, an Int5 recombinase, an Int6 recombinase, an Int7 recombinase, an Int8 recombinase, an Int9 recombinase, an IntlO recombinase, an Intl l recombinase, an Int 12 recombinase, an Intl3 recombinase, an Intl4 recombinase, an Intl5 recombinase, an Intl6 recombinase, an Intl7 recombinase, an Intl8 recombinase, an Intl9 recombinase, an Int20 recombinase, an Int21 recombinase, an Int22 recombinase, an Int23 recombinase, an Int24 recombinase, an Int25 recombinase, an Int26 recombinase, an Int27 recombinase, an Int28 recombinase, an Int29 recombinase, an Int30 recombinase, an Int31 recombinase, an Int32 recombinase, an Int33 recombinase, or an Int34 recombinase.

In some embodiments, the polypeptide dimer is derived from Flp recombinase.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 41, and the second portion of the recombinase corresponds to a C -terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 42. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 42, and the second portion of the recombinase corresponds to an N-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 41. In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 43, and the second portion of the recombinase corresponds to a C -terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 44. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 44, and the second portion of the recombinase corresponds to an N-terminal portion of Flp recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 43.

In some embodiments, the polypeptide dimer is derived from Bxbl recombinase.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 65, and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 66. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 66, and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 65.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 47, and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 48. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 48, and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 47.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 49, and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 50. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 50, and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 49.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 53, and the second portion of the recombinase corresponds to a C -terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 54. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 54, and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 53.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 59, and the second portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 60. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 60, and the second portion of the recombinase corresponds to an N-terminal portion of Bxbl recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 59.

In some embodiments, the polypeptide dimer is derived from PhiC31 recombinase.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 67, and the second portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 68. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 68, and the second portion of the recombinase corresponds to an N-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 67.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 69, and the second portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 70. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 70, and the second portion of the recombinase corresponds to an N-terminal portion of PhiC31 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 69.

In some embodiments, the polypeptide dimer is derived from TP901 recombinase.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 71, and the second portion of the recombinase corresponds to a C-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 72. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 72, and the second portion of the recombinase corresponds to an N-terminal portion of TP901 recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 71.

In some embodiments, the polypeptide dimer is derived from Cre recombinase.

In some embodiments, the first portion of the recombinase corresponds to an N- terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 73, and the second portion of the recombinase corresponds to a C-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 74. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 74, and the second portion of the recombinase corresponds to an N-terminal portion of Cre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 73.

In some embodiments, the polypeptide dimer is derived from VCre recombinase. In some embodiments, the first portion of the recombinase corresponds to an N-terminal portion of VCre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 77, and the second portion of the recombinase corresponds to a C-terminal portion of VCre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 78. In some embodiments, the first portion of the recombinase corresponds to a C-terminal portion of VCre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 78, and the second portion of the recombinase corresponds to an N-terminal portion of VCre recombinase and consists of an amino acid sequence having at least 85% identity to SEQ ID NO: 77.

In some embodiments, in the first polypeptide, the first dimerization domain is N- terminal to the first portion of the recombinase. In some embodiments, in the first polypeptide, the first dimerization domain is C-terminal to the first portion of the recombinase. In some embodiments, in the second polypeptide, the second dimerization domain is N-terminal to the second portion of the recombinase. In some embodiments, in the second polypeptide, the second dimerization domain is C-terminal to the second portion of the recombinase.

In some embodiments, the dimerization of the first polypeptide and the second polypeptide is dependent on the presence of a small molecule inducer. In some embodiments, the small molecule inducer is selected from the group consisting of gibberellic acid, abscisic acid, and rapalog.

In some embodiments, the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 80, and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 79. In some embodiments, the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 79, and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 80.

In some embodiments, the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 81, and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 82. In some embodiments, the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 82, and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 81.

In some embodiments, the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 83, and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 84. In some embodiments, the first dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 84, and the second dimerization domain comprises an amino acid sequence having at least 85% identity to SEQ ID NO: 83.

In some embodiments, the nucleic acid sequence encoding the polypeptide dimer comprises a sequence encoding for a viral 2A peptide. In some embodiments, the viral 2A peptide comprises an amino acid sequence having at least 85% identity to any one of SEQ ID NOs: 88-89 and 236-237.

In some embodiments, the nucleic acid sequence encoding the polypeptide dimer comprises a sequence encoding for an IRES. In some embodiments, the IRES comprises a nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 85-87.

In some embodiments, the nucleic acid sequence encoding the polypeptide dimer comprises a sequence having at least 85% identity to any one of SEQ ID NOs: 90-110.

In some embodiments, the polynucleic acid molecule encodes a polycistronic mRNA operably linked to a promoter, wherein the polycistronic mRNA comprises the nucleic acid sequence encoding the polypeptide dimer. In some embodiments, the nucleic acid sequence encoding for the polycistronic mRNA is operably linked to a constitutive promoter. In some embodiments, the nucleic acid sequence encoding for the polycistronic mRNA is operably linked to an inducible promoter.

In some embodiments, the polynucleic acid molecule comprises an expression cassette comprising the nucleotide encoding for the polycistronic mRNA operably linked to a promoter, wherein the expression cassette comprises a nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 132-143.

In some aspects, the disclosure relates to engineered cells comprising a polynucleic acid molecule encoding a polypeptide having recombinase activity, as provided herein.

In some embodiments, an engineered cell comprises: (a) a first polynucleic acid molecule encoding a polypeptide having recombinase activity, as provided herein; and (b) a second polynucleic acid molecule comprising a nucleic acid sequence encoding, from 5’ to 3’: (i) a first recombinase site; (ii) a gene coding segment; and (iii) a second recombinase site; wherein the first and second recombinase sites correspond to the polypeptide dimer having recombinase activity encoded by the polycistronic mRNA of the first polynucleic acid of (a).

In some embodiments, the first recombinase site of the second polynucleic acid molecule comprises a nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 144-235 and the second recombinase site of the second polynucleic acid molecule comprises the nucleic acid sequence having at least 85% identity to any one of SEQ ID NOs: 144-235.

In some embodiments, the gene coding segment comprises a nucleic acid sequence encoding for at least a portion of Rep52, at least a portion of Rep40, at least a portion of Rep78, at least a portion of Rep68, at least a portion of E2A, at least a portion of E40rf6, at least a portion of VARNA, at least a portion of VP1, at least a portion of VP2, at least a portion of VP3, at least a portion of AAP, or a combination thereof.

In some embodiments, the engineered cell comprises a stable integration of one or more polynucleic acid molecules collectively comprising nucleic acid sequences encoding for: Rep52 or Rep40; Rep78 or Rep68; E2A; E40rf6; VARNA; VP1; VP2; VP3; and AAP.

In some embodiments, the engineered cell comprises one or more polynucleic acid molecules collectively comprising nucleic acid sequences encoding for: UL5, UL8, UL29, UL30, UL42, UL52, UL12, ICP10, ICP4, and ICP22.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 provides a diagram of an exemplary small molecule-inducible split recombinases. A genetic schematic of a split recombinase is shown (top) containing the N- terminal portion of a recombinase (Rec-N), a dimerization domain (DI), a corresponding dimerization domain (D2), and the C-terminal portion of the same recombinase (Rec-C). The dimerization domains are separated by a 2A peptide or IRES sequence to allow both coding regions to be expressed from the same transcript. The transcript can be driven by a constitutive promoter (e.g., hEFla) or an inducible promoter (e.g., TRE3G). A schematic showing small-molecule-mediated dimerization is also shown (bottom). After both polypeptides of a split recombinase are expressed separately, or otherwise separated by autocatalysis, they remain separated such that minimal recombinase activity is present, until a dimerizing small molecule is added, causing association of the two dimerization domains and reconstituting recombinase function.

FIGs. 2A-2D show the genetic schematics (left) and experimental results (right) for four embodiments of split Cre recombinases (CreN: N-terminal portion of Cre recombinase; CreC: C-terminal portion of Cre recombinase). Experiments were performed in the presence (+SM) and absence (-SM) of small molecule inducers. FIG. 2A provides a genetic schematic (left) and experimental results (right) for a first embodiment (vl) of a split Cre recombinase. FIG. 2G provides a genetic schematic (left) and experimental results (right) for a second (v2) embodiment of a split Cre recombinase. FIG. 2C provides a genetic schematic (left) and experimental results (right) for a third embodiment (v3) of a split Cre recombinase. FIG. 2D provides a genetic schematic (left) and experimental results (right) for a fourth embodiment (v4) of a split Cre recombinase. FIG. 3 shows experimental results for several split recombinases (e.g., “Flp 27/28- GA” and “Flp 396/397-ABA”) compared to their respective non-split forms (e.g., “Flp”). The amino acid position of the split in each split recombinase is shown (e.g., Flp 27/28 indicates a split between amino acids 27 and 28 of Flp), as well as the small molecule inducer that was used to induce dimerization (e.g., GA: gibberellic acid; ABA: abscisic acid; Rap: rapalog). -SM; minus small molecule inducer; +SM; plus small molecule inducer.

FIG. 4 shows results for eleven embodiments of split Bxbl recombinases. GID1 and GAI dimerization domains were used, and dimerization was induced in the presence of GA (gibberellic acid). The amino acid position of the split in each split recombinase is shown (e.g., 468/469 indicates a split between amino acids 468 and 469 of Bxbl). -SM; minus small molecule inducer; +SM; plus small molecule inducer.

FIGs. 5A-5L show FACS results for the eleven embodiments of split Bxbl recombinases of FIG. 4. FIG. 5A: unsplit Bxbl; FIG. 5B: Bxbl 37/38; FIG. 5C: Bxbl 169/170; FIG. 5D: Bxbl 208/209; FIG. 5E: Bxbl 222/223; FIG. 5F: Bxbl 259/260; FIG. 5G: Bxbl 262/263; FIG. 5H: Bxbl 363/364; FIG. 51: Bxbl 370/371; FIG. 5J: Bxbl 399/400; FIG. 5K: Bxbl 440/441; FIG. 5L: Bxbl 468/469. -SM; minus small molecule inducer; +SM; plus small molecule inducer. X-axis shows iRFP720 transfection marker fluorescence measured in the APC-A700 channel. Y-axis shows EGFP reporter expression measured in the FITC channel.

FIGs. 6A-5H shows FACS results for eight embodiments of split recombinases compared to their respective non-split forms. FIG. 6A: Flp 27/28 (GA small molecule inducer); FIG. 6B: Flp 396/397 (ABA small molecule inducer); FIG. 6C: Cre 229/230 (GA small molecule inducer); FIG. 6D: VCre 269/270 (GA small molecule inducer); FIG. 6E: Phi 233/234 (GA small molecule inducer); FIG. 6F: Phi 571/572 (Rap small molecule inducer); FIG. 6G: TP901 326/327 (GA small molecule inducer); FIG. 6H: Bxbl 468/469. -SM; minus small molecule inducer; +SM; plus small molecule inducer. All charts are in terms of transfection marker (iRFP measured in APC-700 channel) and fluorescent readout (EGFP measured in FITC channel). VCre plasmid without split had a nonsense mutation and has no data.

DETAILED DESCRIPTION OF INVENTION

Split recombinases operate by expressing a recombinase in two parts, each part incapable of independent catalytic activity (see e.g., Weinberg et al., Nat Commun. 2019 Oct 24;10(l):4845. Doi: 10.1038/s41467-019-12800-7). Split recombinases may be expressed linked to a domain capable of dimerizing in the presence of a small molecule, reconstituting the recombinase, and restoring catalytic activity.

In addition to providing previously undescribed split recombinases, the instant disclosure provides polynucleic acid molecules that encode a split recombinase in a single expression cassette. This allows one to express the split recombinase from a single transcription unit, rather than from separate promoters or plasmids.

The split recombinases (and polynucleic acid molecules encoding the same) that are described herein have various applications. For example, disclosed herein are adenovirus (AAV) production systems comprising a split recombinase, wherein the split recombinase mediates inducible control of expression of an AAV gene product(s) required for AAV production, such as a cytostatic or cytotoxic AAV gene product(s) (e.g., Rep, E2A and E4). The cytotoxic and cytostatic nature of these gene products has hampered the development of stable AAV producer cell lines.

Also described herein are engineered cells and kits comprising the split recombinases.

I. Split Recombinases

In some aspects, the disclosure relates to split recombinases. A “split recombinase” as described herein is a polypeptide dimer comprising a first polypeptide and a second polypeptide, wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization, but have recombinase activity when dimerized. In particular, the split recombinases described herein comprise: (i) a first polypeptide comprising a first portion of a recombinase and a first dimerization domain; and (ii) a second polypeptide comprising a second portion of the recombinase and a second dimerization domain; wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity.

As used herein, the term “recombinase activity” refers to the ability to catalyze sitespecific recombination events within DNA. Methods of determining whether a split recombinase has recombinase activity (e.g., in the presence and absence of dimerization) are known to those having skill in the art. Exemplary methods of determining whether a split recombinase has recombinase activity are provided herein in the Examples section.

Recombinase activity of a split recombinase can be controlled (e.g., induced) in various ways. For example, in some embodiments, dimerization of the first polypeptide and the second polypeptide of a split recombinase may depend on the presence of a small molecule inducer. Alternatively, or in addition, in some embodiments, the nucleic acid sequence encoding at least one polypeptide of a split recombinase dimer (and optionally the nucleic acid sequence(s) of both polypeptides of the split recombinase dimer) is operably linked to an inducible promoter.

In some embodiments, the first polypeptide of a split recombinase comprises, from N- terminus to C-terminus: the first portion of the recombinase; and the first dimerization domain. In other embodiments, the first polypeptide of a split recombinase comprises, from N-terminus to C-terminus: the first dimerization domain; and the first portion of the recombinase.

In some embodiments, the second polypeptide of a split recombinase comprises, from N-terminus to C-terminus: the second portion of the recombinase; and the second dimerization domain. In other embodiments, the second polypeptide of a split recombinase comprises, from N-terminus to C-terminus: the second dimerization domain; and the second portion of the recombinase.

In some embodiments, the first portion of the recombinase corresponds to the N- terminal portion of the recombinase from which the split recombinase is derived, and the second portion of the recombinase corresponds to the C-terminal portion of the recombinase from which the split recombinase is derived. In other embodiments, the second portion of the recombinase corresponds to the N-terminal portion of the recombinase from which the split recombinase is derived, and the first portion of the recombinase corresponds to the C- terminal portion of the recombinase from which the split recombinase is derived. a. First and Second Portions of a Split Recombinase

A split recombinase may be derived from any previously described recombinase (see e.g., Weinberg et al., Nat Commun. 2019 Oct 24;10(l):4845. Doi: 10.1038/s41467-019- 12800-7). Exemplary recombinase amino acid sequences which have been used herein to derive split recombinases are provided in Table 2.

In some embodiments, a split recombinase described herein comprises a first portion and a second portion of a recombinase selected from the group consisting of a Flp recombinase (e.g., SEQ ID NO: 1), a Bxbl recombinase (e.g., SEQ ID NO: 2), a PhiC31 recombinase (e.g., SEQ ID NO: 3), a TP901 recombinase (e.g., SEQ ID NO: 4), a Cre recombinase (e.g., SEQ ID NO: 5), a Vcre recombinase (e.g., SEQ ID NO: 6), an Inti recombinase (e.g., SEQ ID NO: 7), an Int2 recombinase (e.g., SEQ ID NO: 8), an Int3 recombinase (e.g., SEQ ID NO: 9), an Int4 recombinase (e.g., SEQ ID NO: 10), an Int5 recombinase (e.g., SEQ ID NO: 11), an Int6 recombinase (e.g., SEQ ID NO: 12), an Int7 recombinase (e.g., SEQ ID NO: 13), an Int8 recombinase (e.g., SEQ ID NO: 14), an Int9 recombinase (e.g., SEQ ID NO: 15), an IntlO recombinase (e.g., SEQ ID NO: 16), an Inti 1 recombinase (e.g., SEQ ID NO: 17), an Intl2 recombinase (e.g., SEQ ID NO: 18), an Intl3 recombinase (e.g., SEQ ID NO: 19), an Intl4 recombinase (e.g., SEQ ID NO: 20), an Intl5 recombinase (e.g., SEQ ID NO: 21), an Intl6 recombinase (e.g., SEQ ID NO: 22), an Intl7 recombinase (e.g., SEQ ID NO: 23), an Intl8 recombinase (e.g., SEQ ID NO: 24), an Intl9 recombinase (e.g., SEQ ID NO: 25), an Int20 recombinase (e.g., SEQ ID NO: 26), an Int21 recombinase (e.g., SEQ ID NO: 27), an Int22 recombinase (e.g., SEQ ID NO: 28), an Int23 recombinase (e.g., SEQ ID NO: 29), an Int24 recombinase (e.g., SEQ ID NO: 30), an Int25 recombinase (e.g., SEQ ID NO: 31), an Int26 recombinase (e.g., SEQ ID NO: 32), an Int27 recombinase (e.g., SEQ ID NO: 33), an Int28 recombinase (e.g., SEQ ID NO: 34), an Int29 recombinase (e.g., SEQ ID NO: 35), an Int30 recombinase (e.g., SEQ ID NO: 36), an Int31 recombinase (e.g., SEQ ID NO: 37), an Int32 recombinase (e.g., SEQ ID NO: 38), an Int33 recombinase (e.g., SEQ ID NO: 39), an Int34 recombinase (e.g., SEQ ID NO: 40), an R4 recombinase (e.g., SEQ ID NO: 238), or a Dre recombinase (e.g., SEQ ID NO: 239), wherein the first portion and the second portion individually lack recombinase activity, but collectively have recombinase activity.

In some embodiments, a split Flp recombinase described herein comprises a first portion and a second portion of a Flp recombinase (e.g., SEQ ID NO: 1), wherein the first portion and the second portion individually lack recombinase activity, but collectively (i.e., when bound together covalently or non-covalently) have recombinase activity. In some embodiments, the first portion of Flp recombinase corresponds to the N-terminal portion of Flp recombinase, and the second portion of Flp recombinase corresponds to the C-terminal portion of Flp recombinase. In other embodiments, the second portion of Flp recombinase corresponds to the N-terminal portion of Flp recombinase, and the first portion of Flp recombinase corresponds to the C-terminal portion of Flp recombinase.

In some embodiments, the first portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 41, and the second portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 42. In some embodiments, the first portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 41, and the second portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 42. In some embodiments, the first portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 42, and the second portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 41. In some embodiments, the first portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 42, and the second portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 41.

In some embodiments, the first portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 43, and the second portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 44. In some embodiments, the first portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 43, and the second portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 44. In some embodiments, the first portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 44, and the second portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 43. In some embodiments, the first portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 44, and the second portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 43.

In some embodiments, a split Bxbl recombinase described herein comprises a first portion and a second portion of a Bxbl recombinase (e.g., SEQ ID NO: 2), wherein the first portion and the second portion individually lack recombinase activity, but collectively (i.e., when bound together covalently or non-covalently) have recombinase activity. In some embodiments, the first portion of Bxbl recombinase corresponds to the N-terminal portion of Bxbl recombinase, and the second portion of Bxbl recombinase corresponds to the C- terminal portion of Bxbl recombinase. In other embodiments, the second portion of Bxbl recombinase corresponds to the N-terminal portion of Bxbl recombinase, and the first portion of Bxbl recombinase corresponds to the C-terminal portion of Bxbl recombinase.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 45, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 46. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 45, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 46. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 46, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 45. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 46, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 45.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 47, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 48. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 47, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 48. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 48, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 47. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 48, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 47.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 49, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 50. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 49, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 50. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 50, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 49. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 50, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 49.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 51, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 52. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 51, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 52. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 52, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 51. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 52, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 51.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 53, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 54. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 53, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 54. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 54, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 53. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 54, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 53. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 55, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 56. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 55, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 56. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 56, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 55. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 56, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 55.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 57, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 58. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 57, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 58. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 58, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 57. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 58, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 57.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 59, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 60. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 59, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 60. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 60, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 59. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 60, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 59.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 61, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 62. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 61, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 62. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 62, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 61. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 62, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 61.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 63, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 64. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 63, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 64. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 64, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 63. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 64, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 63.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 65, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 66. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 65, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 66. In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 66, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 65. In some embodiments, the first portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 66, and the second portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 65.

In some embodiments, a split PhiC31 recombinase described herein comprises a first portion and a second portion of a PhiC31 recombinase (e.g., SEQ ID NO: 3), wherein the first portion and the second portion individually lack recombinase activity, but collectively (i.e., when bound together covalently or non-covalently) have recombinase activity. In some embodiments, the first portion of PhiC31 recombinase corresponds to the N-terminal portion of PhiC31 recombinase, and the second portion of PhiC31 recombinase corresponds to the C- terminal portion of PhiC31 recombinase. In other embodiments, the second portion of PhiC31 recombinase corresponds to the N-terminal portion of PhiC31 recombinase, and the first portion of PhiC31 recombinase corresponds to the C-terminal portion of PhiC31 recombinase.

In some embodiments, the first portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 67, and the second portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 68. In some embodiments, the first portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 67, and the second portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 68. In some embodiments, the first portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 68, and the second portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 67. In some embodiments, the first portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 68, and the second portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 67.

In some embodiments, the first portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 69, and the second portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 70. In some embodiments, the first portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 69, and the second portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 70. In some embodiments, the first portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 70, and the second portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 69. In some embodiments, the first portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 70, and the second portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 69.

In some embodiments, a split TP901 recombinase described herein comprises a first portion and a second portion of a TP901 recombinase (e.g., SEQ ID NO: 4), wherein the first portion and the second portion individually lack recombinase activity, but collectively (i.e., when bound together covalently or non-covalently) have recombinase activity. In some embodiments, the first portion of TP901 recombinase corresponds to the N-terminal portion of TP901 recombinase, and the second portion of TP901 recombinase corresponds to the C- terminal portion of TP901 recombinase. In other embodiments, the second portion of PhiC31 recombinase corresponds to the N-terminal portion of TP901 recombinase, and the first portion of TP901 recombinase corresponds to the C-terminal portion of TP901 recombinase.

In some embodiments, the first portion of TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 71, and the second portion of TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 72. In some embodiments, the first portion of TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 71, and the second portion of TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 72. In some embodiments, the first portion of TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 72, and the second portion of TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 71. In some embodiments, the first portion of TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 72, and the second portion of TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 71.

In some embodiments, a split Cre recombinase described herein comprises a first portion and a second portion of a Cre recombinase (e.g., SEQ ID NO: 5), wherein the first portion and the second portion individually lack recombinase activity, but collectively (i.e., when bound together covalently or non-covalently) have recombinase activity. In some embodiments, the first portion of Cre recombinase corresponds to the N-terminal portion of Cre recombinase, and the second portion of Cre recombinase corresponds to the C-terminal portion of Cre recombinase. In other embodiments, the second portion of Cre recombinase corresponds to the N-terminal portion of Cre recombinase, and the first portion of Cre recombinase corresponds to the C-terminal portion of Cre recombinase.

In some embodiments, the first portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 73, and the second portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 74. In some embodiments, the first portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 73, and the second portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 74. In some embodiments, the first portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 74, and the second portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 73. In some embodiments, the first portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 74, and the second portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 73.

In some embodiments, the first portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 75, and the second portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 76. In some embodiments, the first portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 75, and the second portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 76. In some embodiments, the first portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 76, and the second portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 75. In some embodiments, the first portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 76, and the second portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 75.

In some embodiments, a split Vcre recombinase described herein comprises a first portion and a second portion of a Vcre recombinase (e.g., SEQ ID NO: 6), wherein the first portion and the second portion individually lack recombinase activity, but collectively (i.e., when bound together covalently or non-covalently) have recombinase activity. In some embodiments, the first portion of Vcre recombinase corresponds to the N-terminal portion of Vcre recombinase, and the second portion of Vcre recombinase corresponds to the C-terminal portion of Vcre recombinase. In other embodiments, the second portion of Vcre recombinase corresponds to the N-terminal portion of Vcre recombinase, and the first portion of Vcre recombinase corresponds to the C-terminal portion of Vcre recombinase.

In some embodiments, the first portion of Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 77, and the second portion of Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 78. In some embodiments, the first portion of Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 77, and the second portion of Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 78. In some embodiments, the first portion of Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 78, and the second portion of Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 77. In some embodiments, the first portion of Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 78, and the second portion of Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 77. b. Dimerization Domains

As described above, the split recombinases described herein comprise: (i) a first polypeptide comprising a first portion of a recombinase and a first dimerization domain; and (ii) a second polypeptide comprising a second portion of the recombinase and a second dimerization domain; wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity.

Exemplary dimerization domain pairs (i.e., a pair consisting of a first dimerization domain and second dimerization domain) that can be fused to pairs of polypeptides to render the polypeptides capable of dimerization are known to those having ordinary skill in the art.

In some embodiments, a first dimerization domain and a second dimerization domain are capable of dimerizing in the absence of a small molecule inducer.

In some embodiments, dimerization of a first dimerization domain and a second dimerization domain is induced by the presence of a small molecule inducer. For example, in some embodiments, dimerization of a first dimerization domain and a second dimerization domain is induced by the presence of gibberellic acid (GA). In some embodiments, dimerization of a first dimerization domain and a second dimerization domain is induced by the presence of abscisic acid (ABA). In some embodiments, dimerization of a first dimerization domain and a second dimerization domain is induced by the presence of rapalog (Rap).

In some embodiments, a dimerization domain pair comprises a GID1 dimerization domain and a GAI dimerization domain. In some embodiments, the GID1 dimerization domain comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 79 and the GAI dimerization domain comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 80. In some embodiments, the GID1 dimerization domain comprises the amino acid sequence of SEQ ID NO: 79, and the GAI dimerization domain comprises the amino acid sequence of SEQ ID NO: 80. In some embodiments, a dimerization domain pair comprises an ABI dimerization domain and a PYL dimerization domain. In some embodiments, the ABI dimerization domain comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 81 and the PYL dimerization domain comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 82. In some embodiments, the ABI dimerization domain comprises the amino acid sequence of SEQ ID NO: 81, and the PYL dimerization domain comprises the amino acid sequence of SEQ ID NO: 82.

In some embodiments, a dimerization domain pair comprises an FRB dimerization domain and a FKBP dimerization domain. In some embodiments, the FRB dimerization domain comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 83 and the FKBP dimerization domain comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 84. In some embodiments, the FRB dimerization domain comprises the amino acid sequence of SEQ ID NO: 83, and the FKBP dimerization domain comprises the amino acid sequence of SEQ ID NO: 84. c. Exemplary Split Recombinases

In some embodiments, a split recombinase comprises a combination of features provided in Table 1.

Exemplary Split Flp Recombinases

In some embodiments, a split Flp recombinase comprises: (i) a first polypeptide comprising a first portion of a Flp recombinase and a GID1 dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of the Flp recombinase and a GAI dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of Flp recombinase; and the GID1 dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the GID1 dimerization domain; and the first portion of Flp recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C -terminus: the second portion of Flp recombinase; and the GAI dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the GAI dimerization domain; and the second portion of Flp recombinase. In some embodiments, the first portion of Flp recombinase corresponds to the N-terminal portion of Flp recombinase, and the second portion of Flp recombinase corresponds to the C-terminal portion of Flp recombinase. In other embodiments, the first portion of Flp recombinase corresponds to the C-terminal portion of Flp recombinase, and the second portion of Flp recombinase corresponds to the N-terminal portion of Flp recombinase. In some embodiments, the N-terminal portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 41, and the C-terminal portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 42. In some embodiments, the N-terminal portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 41, and the C-terminal portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 42. In some embodiments, the split Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 90. In some embodiments, the split Flp recombinase comprises the amino acid sequence of SEQ ID NO: 90.

In some embodiments, a split Flp recombinase comprises: (i) a first polypeptide comprising a first portion of a Flp recombinase and an ABI dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of a Flp recombinase and a PYL dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of Flp recombinase; and the ABI dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the ABI dimerization domain; and the first portion of Flp recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the second portion of Flp recombinase; and the PYL dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the PYL dimerization domain; and the second portion of Flp recombinase. In some embodiments, the first portion of Flp recombinase corresponds to the N-terminal portion of Flp recombinase, and the second portion of Flp recombinase corresponds to the C-terminal portion of Flp recombinase. In other embodiments, the first portion of Flp recombinase corresponds to the C-terminal portion of Flp recombinase, and the second portion of Flp recombinase corresponds to the N-terminal portion of Flp recombinase. In some embodiments, the N-terminal portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 43, and the C-terminal portion of Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 44. In some embodiments, the N-terminal portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 43, and the C-terminal portion of Flp recombinase comprises the amino acid sequence of SEQ ID NO: 44. In some embodiments, the split Flp recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 91. In some embodiments, the split Flp recombinase comprises the amino acid sequence of SEQ ID NO: 91.

Exemplary Split Bxbl Recombinases

In some embodiments, a split Bxbl recombinase comprises: (i) a first polypeptide comprising a first portion of a Bxbl recombinase and a GID1 dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of the Bxbl recombinase and a GAI dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of Bxbl recombinase; and the GID1 dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the GID1 dimerization domain; and the first portion of Bxbl recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the second portion of Bxbl recombinase; and the GAI dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the GAI dimerization domain; and the second portion of Bxbl recombinase. In some embodiments, the first portion of Bxbl recombinase corresponds to the N-terminal portion of Bxbl recombinase, and the second portion of Bxbl recombinase corresponds to the C- terminal portion of Bxbl recombinase. In other embodiments, the first portion of Bxbl recombinase corresponds to the C-terminal portion of Bxbl recombinase, and the second portion of Bxbl recombinase corresponds to the N-terminal portion of Bxbl recombinase.

In some embodiments, the N-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 45, and the C-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 46. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 45, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 46.

In some embodiments, the first portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 47, and the second portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 48. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 47, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 48.

In some embodiments, the N-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 49, and the C-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 50. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 49, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 50.

In some embodiments, the N-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 51, and the C-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 52. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 51, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 52. In some embodiments, the N-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 53, and the C-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 54. In some embodiments, the N-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 53, and the C-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 54.

In some embodiments, the N-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 55, and the C-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 56. In some embodiments, the N-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 55, and the C-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 56.

In some embodiments, the N-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 57, and the C-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 58. In some embodiments, the N-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 57, and the C-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 58.

In some embodiments, the N-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 59, and the C-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 60. In some embodiments, the N-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 59, and the C-terminal portion ofBxbl recombinase comprises the amino acid sequence of SEQ ID NO: 60.

In some embodiments, the N-terminal portion ofBxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 61, and the C-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 62. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 61, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 62.

In some embodiments, the N-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 63, and the C-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 64. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 63, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 64.

In some embodiments, the N-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 65, and the C-terminal portion of Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 66. In some embodiments, the N-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 65, and the C-terminal portion of Bxbl recombinase comprises the amino acid sequence of SEQ ID NO: 66.

In some embodiments, the split Bxbl recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to any one of SEQ ID NOs: 92-102. In some embodiments, the split Bxbl recombinase comprises the amino acid sequence of any one of SEQ ID NOs: 92-102.

Exemplary Split PhiC31 Recombinases

In some embodiments, a split PhiC31 recombinase comprises: (i) a first polypeptide comprising a first portion of a PhiC31 recombinase and a GID1 dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of the PhiC31 recombinase and a GAI dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of PhiC31 recombinase; and the GID1 dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the GID1 dimerization domain; and the first portion of PhiC31 recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the second portion of PhiC31 recombinase; and the GAI dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the GAI dimerization domain; and the second portion of PhiC31 recombinase. In some embodiments, the first portion of PhiC31 recombinase corresponds to the N-terminal portion of PhiC31 recombinase, and the second portion of PhiC31 recombinase corresponds to the C- terminal portion of PhiC31 recombinase. In other embodiments, the first portion of PhiC31 recombinase corresponds to the C-terminal portion of PhiC31 recombinase, and the second portion of PhiC31 recombinase corresponds to the N-terminal portion of PhiC31 recombinase. In some embodiments, the N-terminal portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 67, and the C-terminal portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 68. In some embodiments, the N-terminal portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 67, and the C-terminal portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 68. In some embodiments, the split PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 104. In some embodiments, the split PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 104.

In some embodiments, a split PhiC31 recombinase comprises: (i) a first polypeptide comprising a first portion of a PhiC31 recombinase and a FRB dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of a PhiC31 recombinase and a FKBP dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of PhiC31 recombinase; and the FRB dimerization domain. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the FRB dimerization domain; and the first portion of PhiC31 recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the second portion of PhiC31 recombinase; and the FKBP dimerization domain. In some embodiments, the second polypeptide comprises, from N- terminus to C-terminus: the FKBP dimerization domain; and the second portion of PhiC31 recombinase. In some embodiments, the first portion of PhiC31 recombinase corresponds to the N-terminal portion ofPhiC31 recombinase, and the second portion ofPhiC31 recombinase corresponds to the C-terminal portion of PhiC31 recombinase. In other embodiments, the first portion of PhiC31 recombinase corresponds to the C-terminal portion of PhiC31 recombinase, and the second portion of PhiC31 recombinase corresponds to the N- terminal portion of PhiC31 recombinase. In some embodiments, the N-terminal portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 69, and the C- terminal portion of PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 70. In some embodiments, the N-terminal portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 69, and the C-terminal portion of PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 70. In some embodiments, the split PhiC31 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 105. In some embodiments, the split PhiC31 recombinase comprises the amino acid sequence of SEQ ID NO: 105.

Exemplary Split TP901 Recombinases

In some embodiments, a split TP901 recombinase comprises: (i) a first polypeptide comprising a first portion of a TP901 recombinase and a GID1 dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of the TP901 recombinase and a GAI dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of TP901 recombinase; and the GID1 dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the GID1 dimerization domain; and the first portion of TP901 recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C -terminus: the second portion of TP901 recombinase; and the GAI dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the GAI dimerization domain; and the second portion of TP901 recombinase. In some embodiments, the first portion of TP901 recombinase corresponds to the N-terminal portion of TP901 recombinase, and the second portion of TP901 recombinase corresponds to the C- terminal portion of TP901 recombinase. In other embodiments, the first portion of TP901 recombinase corresponds to the C-terminal portion of TP901 recombinase, and the second portion of TP901 recombinase corresponds to the N-terminal portion of TP901 recombinase. In some embodiments, the N-terminal portion of TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 71, and the C-terminal portion of TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 72. In some embodiments, the N- terminal portion of TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 71, and the C-terminal portion of TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 72. In some embodiments, the split TP901 recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 106. In some embodiments, the split TP901 recombinase comprises the amino acid sequence of SEQ ID NO: 106.

Exemplary Split Cre Recombinases

In some embodiments, a split Cre recombinase comprises: (i) a first polypeptide comprising a first portion of a Cre recombinase and a ABI dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of the Cre recombinase and a PYL dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of Cre recombinase; and the ABI dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the ABI dimerization domain; and the first portion of Cre recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the second portion of Cre recombinase; and the PYL dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the PYL dimerization domain; and the second portion of Cre recombinase. In some embodiments, the first portion of Cre recombinase corresponds to the N-terminal portion of Cre recombinase, and the second portion of Cre recombinase corresponds to the C-terminal portion of Cre recombinase. In other embodiments, the first portion of Cre recombinase corresponds to the C-terminal portion of Cre recombinase, and the second portion of Cre recombinase corresponds to the N-terminal portion of Cre recombinase.

In some embodiments, the N-terminal portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 73, and the C-terminal portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 74. In some embodiments, the N-terminal portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 73, and the C-terminal portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 74.

In some embodiments, the N-terminal portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 75, and the C-terminal portion of Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 76. In some embodiments, the N-terminal portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 75, and the C-terminal portion of Cre recombinase comprises the amino acid sequence of SEQ ID NO: 76.

In some embodiments, the split Cre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to any one of SEQ ID NOs: 107-109. In some embodiments, the split Cre recombinase comprises the amino acid sequence of any one of SEQ ID NOs: 107-109.

Exemplary Split Vcre Recombinases

In some embodiments, a split Vcre recombinase comprises: (i) a first polypeptide comprising a first portion of a Vcre recombinase and a GID1 dimerization domain (as provided in Part lb, above); and (ii) a second polypeptide comprising a second portion of the Vcre recombinase and a GAI dimerization domain (as provided in Part lb, above); wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity. In some embodiments, the first polypeptide comprises, from N-terminus to C-terminus: the first portion of Vcre recombinase; and the GID1 dimerization domain. In some embodiments, the first polypeptide comprises, from N- terminus to C-terminus: the GID1 dimerization domain; and the first portion of VCre recombinase. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the second portion of Vcre recombinase; and the GAI dimerization domain. In some embodiments, the second polypeptide comprises, from N-terminus to C-terminus: the GAI dimerization domain; and the second portion of Vcre recombinase. In some embodiments, the first portion of Vcre recombinase corresponds to the N-terminal portion of Vcre recombinase, and the second portion of Vcre recombinase corresponds to the C-terminal portion of Vcre recombinase. In other embodiments, the first portion of Vcre recombinase corresponds to the C-terminal portion of Vcre recombinase, and the second portion of Vcre recombinase corresponds to the N-terminal portion of Vcre recombinase. In some embodiments, the N-terminal portion of Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 77, and the C-terminal portion of Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 78. In some embodiments, the N-terminal portion of Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 77, and the C-terminal portion of Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 78. In some embodiments, the split Vcre recombinase comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to SEQ ID NO: 110. In some embodiments, the split Vcre recombinase comprises the amino acid sequence of SEQ ID NO: 110. d. Percent Identity

As used herein, the term “percent identity” (or “% identity”) refers to a relationship between the sequences of two polypeptides or polynucleotides, as determined by sequence comparison (alignment). In some embodiments, identity is determined across the entire length of a sequence. In some embodiments, identity is determined over a region of a sequence. Identity of related polypeptides or nucleic acid sequences can be readily calculated by those having ordinary skill in the art. For example, the percent identity of two sequences (e.g., nucleic acid or amino acid sequences) may be determined using BLAST®, NBLAST®, XBLAST®, Gapped BLAST®, and Clustal Omega programs, using default parameters of the respective programs. In some embodiments, the identity of two polypeptides is determined by aligning the two amino acid sequences, calculating the number of identical amino acids, and dividing by the length of one of the amino acid sequences. In some embodiments, the identity of two nucleic acids is determined by aligning the two nucleotide sequences and calculating the number of identical nucleotides and dividing by the length of one of the nucleic acids.

II. Polynucleic Acid Molecules Encoding a Split Recombinase

In some aspects, the disclosure relates to polynucleic acid molecules (or combinations of polynucleic acid molecules) encoding a split recombinase described herein. In some embodiments, a split recombinase is encoded by a single expression cassette. In other embodiments, a split recombinase is encoded by two expression cassettes.

As used herein, the term “expression cassette” refers to a nucleic acid sequence encoding a gene product (i.e., a mRNA or polypeptide) operably linked to a promoter.

As used herein, the term “promoter” refers to a nucleic acid sequence that is bound by proteins to initiate transcription of RNA from DNA. A promoter may be a constitutive promoter (i.e., an unregulated promoter that allows for continual transcription). Examples of constitutive promoters are known in the art and include, but are not limited to, cytomegalovirus (CMV) promoters, elongation factor 1 a (EFla) promoters, simian vacuolating virus 40 (SV40) promoters, ubiquitin-C (UBC) promoters, U6 promoters, p5 promoters, pl9 promoters, p40 promoters, E2A promoters, E4 promoters and phosphoglycerate kinase (PGK) promoters. See e.g., Ferreira et al. Proc. Natl. Acad. Sci. U.S.A. 2013 Jul; 110(28): 11284-89; Pub. No.: US 2014/377861 Al; Qin et al. PloS one 5.5 (2010): el0611. - the entireties of which are incorporated herein by reference. Alternatively, a promoter may be an inducible promoter (i.e., activates transcription under specific circumstances). An inducible promoter may be a chemically inducible promoter, a temperature inducible promoter, or a light inducible promoter. Additional types of inducible promoters are known to those having ordinary skill in the art. Examples of inducible promoters are known in the art and include, but are not limited to, tetracycline/doxycy cline inducible promoters, cumate inducible promoters, ABA inducible promoters, CRY2-CIB1 inducible promoters, DAPG inducible promoters, pTRE3G promoters, pTREtight promoters, the Gal4 UAS operator sequences and mifepristone inducible promoters, and a promoters containing at least one of VanR, TtgR, PhlF, or CymR operator sequences. See e.g., Stanton et al., ACS Synth. Biol. 2014 Dec 19; 3(12): 880-91; Liang et al., Sci. Signal. 2011 Mar 15; 4(164): rs2; Patent No.: US 7,745,592 B2; Patent No.: US 7,935,788 B2 - the entireties of which are incorporated herein by reference.

In some embodiments, expression of the first polypeptide and/or the second polypeptide of a split-recombinase is under the control of a constitutive promoter. In some embodiments, expression of the first polypeptide and/or the second polypeptide of a splitrecombinase is under the control of an inducible promoter. a. Single Expression Cassette Embodiments

In some embodiments, a polynucleic acid described herein comprises an expression cassette encoding for a polycistronic mRNA, wherein the polycistronic mRNA comprises: (i) a sequence encoding for a first polypeptide of a split recombinase (e.g., as described above); (ii) a sequence encoding for an intercistronic region; and (iii) a sequence encoding for a second polypeptide of a split recombinase (e.g., as described above); wherein the sequence encoding for the intercistronic region is flanked on one end by the sequence encoding for the first polypeptide and on the other end by the sequence encoding for the second polypeptide.

In some embodiments, the intercistronic region comprises a nucleic acid sequence encoding an internal ribosomal entry site (IRES). Various IRES sequences have been described previously and are known to those having ordinary skill in the art. In some embodiments, an IRES comprises a nucleic acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to any one of SEQ ID NOs: 85-87. In some embodiments, an IRES comprises the nucleic acid sequence of any one of SEQ ID NOs: 85-87.

In some embodiments, the intercistronic region comprises a nucleic acid sequence encoding a 2A peptide. Various 2A peptides have been described previously and are known to those having ordinary skill in the art. In some embodiments, a 2A peptide comprises an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to any one of SEQ ID NOs: 88-89 and 236-237. In some embodiments, a 2A peptide comprises an amino acid sequence of any one of SEQ ID NOs: 88-89 and 236-237.

In some embodiments, a polynucleic acid molecule encoding a split recombinase (or polypeptide dimer having recombinase activity) comprises, from 5’ to 3’: (i) a sequence encoding for a first polypeptide comprising a first portion of a recombinase and a first dimerization domain; (ii) a sequence encoding for a viral 2A peptide and/or an internal ribosomal entry site (IRES); and (iii) a sequence encoding for a second polypeptide comprising a second portion of a recombinase and a second dimerization domain; wherein the first polypeptide and the second polypeptide lack recombinase activity in the absence of dimerization; and wherein a recombinase dimer comprising the first polypeptide and the second polypeptide has recombinase activity.

In some embodiments, the first dimerization domain of the first polypeptide is N- terminal to the first portion of the recombinase. In some embodiments, the first dimerization domain is C-terminal to the first portion of the recombinase. In some embodiments, the second dimerization domain of the second polypeptide is N-terminal to the second portion of the recombinase. In some embodiments, the second dimerization domain is C-terminal to the second portion of the recombinase.

In some embodiments, the first portion of the recombinase corresponds to the N- terminal portion of the recombinase, and the second portion of the recombinase corresponds to the C-terminal portion of the recombinase. In other embodiments, the first portion of the recombinase corresponds to the C-terminal portion of the recombinase, and the second portion of the recombinase corresponds to the N-terminal portion of the recombinase.

In some embodiments, a polynucleic acid molecule comprises a nucleic acid sequence encoding a split recombinase, wherein the nucleic acid sequence has at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to any one of SEQ ID NOs: 111-131. In some embodiments, a polynucleic acid molecule comprises a nucleic acid sequence encoding a split recombinase, wherein the nucleic acid sequence comprises the nucleic acid sequence of any one of SEQ ID NOs: 111-131.

In some embodiments, a polynucleic acid molecule comprises an expression cassette encoding a polycistronic mRNA, wherein the polycistronic mRNA comprises a nucleic acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% identity to any one of SEQ ID NOs: 132-143. In some embodiments, a polynucleic acid molecule comprises an expression cassette encoding a polycistronic mRNA, wherein the polycistronic mRNA comprises a nucleic acid sequence of any one of SEQ ID NOs: 132-143. b. Two Expression Cassette Embodiments

In other embodiments, a first expression cassette encodes the first polypeptide of a split recombinase, and a second expression cassette encodes the second polypeptide of a split recombinase. In some embodiments, the first expression cassette comprises a constitutive promoter (as described herein). In some embodiments, the first expression cassette comprises an inducible promoter (as described herein).

In some embodiments, the second expression cassette comprises a constitutive promoter (as described herein). In some embodiments, the second expression cassette comprises an inducible promoter (as described herein).

In some embodiments, a single polynucleic acid comprises the first expression cassette and the second expression cassette. In other embodiments, a first polynucleic acid molecule comprises the first expression cassette, and a second polynucleic acid molecule comprises the second expression cassette.

III. Adeno-Associated Virus Production Systems

In some aspects, the disclosure relates to adeno-associated virus (AAV) production systems which allow for inducible control of a gene product(s) required for AAV production, including an AAV gene product(s) that is cytotoxic or cytostatic to a cell. In the AAV production systems described herein, this inducible control is mediated by a splitrecombinase (e.g., as provided herein). The possibility for near-zero background expression in the absence of dimerization and near-native expression in the presence of dimerization make split recombinases a promising technology for viral platforms which have complex and poorly characterized regulation. In contrast, systems that directly regulate viral genes with synthetic promoters (e.g., Tet-On or cumate) require significant tuning and may result in leaky expression in the off state.

The AAV production systems described herein comprise one or more polynucleic acid molecules comprising: (a) an AAV production component comprising a polynucleic acid molecule encoding an AAV gene product (or a portion thereof) flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site; and (b) a split recombinase (as described herein) corresponding to the first recombinase attachment site and the second recombination attachment site of (a). The one or more polynucleic acid molecules of an AAV production system may further comprise: (c) a transcriptional activator; (d) a transfer polynucleic acid molecule; I a selection marker; or (f) a combination thereof. a. AAV Production Component

The AAV production systems described herein have an AAV production component comprising an AAV production component comprising a polynucleic acid molecule encoding an AAV gene product (or a portion thereof) flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site. AAV gene products required for generation of an AAV in a recombinant host cell (or an “engineered cell” as described herein) are known to those having ordinary skill in the art. Exemplary AAV gene products include Rep52, Rep40, Rep78, Rep68, El, E2A, E40rf6, VARNA, CAP (VP1, VP2, VP3), AAP, and MAAP or functional variants thereof. The Rep gene products (comprising Rep52, Rep40, Rep78 and Rep68) are involved in AAV genome replication and packaging. The El genes upregulate transcription of several adenovirus and AAV genes. The E2A gene product is involved in aiding DNA synthesis processivity during AAV replication. The E40rf6 gene product supports AAV replication. The VARNA gene product plays a role in regulating translation. The CAP gene products (comprising VP1, VP2, VP3) encode viral capsid proteins. The AAP gene product plays a role in capsid assembly. MAAP is a protein residing in an alternate reading from of VP1 and appears to play a role in the viral capsid as described in Ogden et al. Science 366.6469 (2019): 1139-1143, which is incorporated by reference in its entirety.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding Rep52 (or a portion thereof), wherein the nucleic acid sequence encoding for Rep52 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding Rep40 (or a portion thereof), wherein the nucleic acid sequence encoding for Rep40 (or a portion thereof), is flanked on each end by a recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding Rep78 (or a portion thereof), wherein the nucleic acid sequence encoding for Rep78 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding Rep68 (or a portion thereof), wherein the nucleic acid sequence encoding for Rep68 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site. In some embodiments, an AAV production component comprises a nucleic acid sequence encoding E2A (or a portion thereof), wherein the nucleic acid sequence encoding for E2A (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding E4ORF6 (or a portion thereof), wherein the nucleic acid sequence encoding for E4ORF6 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding VARNA (or a portion thereof), wherein the nucleic acid sequence encoding for VARNA (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding VP1 (or a portion thereof), wherein the nucleic acid sequence encoding for VP1 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding VP2 (or a portion thereof), wherein the nucleic acid sequence encoding for VP2 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding VP3 (or a portion thereof), wherein the nucleic acid sequence encoding for VP3 (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding AAP (or a portion thereof), wherein the nucleic acid sequence encoding for AAP (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component comprises a nucleic acid sequence encoding MAAP (or a portion thereof), wherein the nucleic acid sequence encoding for MAAP (or a portion thereof), is flanked on one end by a first recombinase attachment site and on the other end by a second recombinase attachment site.

In some embodiments, an AAV production component is (i.e., the gene products of the AAV component are) encoded on a single nucleic acid molecule. In other embodiments, multiple nucleic acid molecules collectively comprise the AAV production component (i.e., at least two of the gene products of the AAV production component are encoded on different nucleic acid molecules). For example, an AAV production component may comprise at least

2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 10, or at least 11 nucleic acid molecules. In some embodiments, an AAV production component comprises 2,

3, 4, 5, 6, 7, 8, 9, 10, or 11 nucleic acid molecules.

In some embodiments, an AAV production system comprises one or more nucleic acid sequences that collectively encode the gene products: Rep52 or Rep40; Rep78 or Rep68; E2A; E40rf6; VARNA; VP1; VP2; VP3; AAP; and MAAP. In some embodiments, an AAV production system comprises one or more nucleic acid sequences that collectively encode the gene products: Rep52, Rep40, Rep78, Rep68, E2A, E40rf6, VARNA, VP1, VP2, VP3, and AAP. In some embodiments, the one or more nucleic acid molecules that collectively encode the gene products required for generation of an AAV are each operably linked to a promoter as described herein.

Recombinase attachment sites have been described previously and are known to those having ordinary skill in the art. Exemplary recombinase attachment sites and their corresponding recombinases are provided in Table 10.

In some embodiments, an AAV production system comprise: (i) a nucleic acid sequence encoding for a split Flp recombinase (as described herein); and (ii) a nucleic acid sequence encoding for an AAV gene product (or a portion thereof), wherein the nucleic acid sequence encoding for the AAV gene product (or a portion thereof) is flanked on one end by a first Flp recombinase attachment site and on the other end by a second Flp recombinase attachment site, wherein the first Flp recombinase attachment site and the second Flp recombinase attachment are capable of being bound and recombined by the split Flp recombinase of (i).

In some embodiments, the first Flp recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 144-156. In some embodiments, the first Flp recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 144-156.

In some embodiments, the second Flp recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 144-156. In some embodiments, the second Flp recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 144-156.

In some embodiments, an AAV production system comprise: (i) a nucleic acid sequence encoding for a split Bxbl recombinase (as described herein); and (ii) a nucleic acid sequence encoding for an AAV gene product (or a portion thereof), wherein the nucleic acid sequence encoding for the AAV gene product (or a portion thereof) is flanked on one end by a first Bxb 1 recombinase attachment site and on the other end by a second Bxb 1 recombinase attachment site, wherein the first Bxb 1 recombinase attachment site and the second Bxb 1 recombinase attachment are capable of being bound and recombined by the split Bxbl recombinase of (i).

In some embodiments, the first Bxbl recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 157-172. In some embodiments, the first Bxbl recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 157-172.

In some embodiments, the second Bxbl recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 173-188. In some embodiments, the second Bxbl recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 173-188.

In some embodiments, an AAV production system comprise: (i) a nucleic acid sequence encoding for a split PhiC31 recombinase (as described herein); and (ii) a nucleic acid sequence encoding for an AAV gene product (or a portion thereof), wherein the nucleic acid sequence encoding for the AAV gene product (or a portion thereof) is flanked on one end by a first PhiC31 recombinase attachment site and on the other end by a second PhiC31 recombinase attachment site, wherein the first PhiC31 recombinase attachment site and the second PhiC31 recombinase attachment are capable of being bound and recombined by the split PhiC31 recombinase of (i).

In some embodiments, the first PhiC31 recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 189-204. In some embodiments, the first PhiC31 recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 189-204. In some embodiments, the second PhiC31 recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 205-220. In some embodiments, the second PhiC31 recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 205-220.

In some embodiments, an AAV production system comprise: (i) a nucleic acid sequence encoding for a split Cre recombinase (as described herein); and (ii) a nucleic acid sequence encoding for an AAV gene product (or a portion thereof), wherein the nucleic acid sequence encoding for the AAV gene product (or a portion thereof) is flanked on one end by a first Cre recombinase attachment site and on the other end by a second Cre recombinase attachment site, wherein the first Cre recombinase attachment site and the second Cre recombinase attachment are capable of being bound and recombined by the split Cre recombinase of (i).

In some embodiments, the first Cre recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 221-229. In some embodiments, the first Cre recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 221-229.

In some embodiments, the second Cre recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 221-229. In some embodiments, the second Cre recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 221-229.

In some embodiments, an AAV production system comprise: (i) a nucleic acid sequence encoding for a split Vcre recombinase (as described herein); and (ii) a nucleic acid sequence encoding for an AAV gene product (or a portion thereof), wherein the nucleic acid sequence encoding for the AAV gene product (or a portion thereof) is flanked on one end by a first Vcre recombinase attachment site and on the other end by a second Vcre recombinase attachment site, wherein the first Vcre recombinase attachment site and the second Vcre recombinase attachment are capable of being bound and recombined by the split Vcre recombinase of (i).

In some embodiments, the first Vcre recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 230-235. In some embodiments, the first Vcre recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 230-235.

In some embodiments, the second Vcre recombinase attachment site comprises a nucleic acid sequence having at least 80% (e.g., at least 80%, at least 85%, at least 90%, at least 95%, or at least 99%) identity to the nucleic acid sequence of any one of SEQ ID NOs: 230-235. In some embodiments, the second Vcre recombinase attachment site comprises the nucleic acid sequence of any one of SEQ ID NOs: 230-235. b. Transcriptional Activator

In some embodiments, an AAV production system further comprises a transcriptional activator (or a polynucleic acid molecule encoding the same). As used herein, the term “transcriptional activator” refers to a transcription factor that binds to and regulates expression of an inducible promoter of an AAV production system (e.g., an inducible promoter operably linked to a nucleic acid sequence encoding for an AAV gene product, an inducible promoter operably linked to a nucleic acid encoding for a polypeptide of a split recombinase, etc.). Exemplary transcriptional activators, and their corresponding promoter recognition sites, are known to those having skill in the art and include, but are not limited to, TetOn-3G, TetOn-V16, TetOff-Advanced, VanR-VP16, TtgR-VP16, PhlF-VP16, and the cumate cTA and rcTA. In some embodiments, the transcriptional activator is operably linked to a promoter (as described herein). In some embodiments, the transcriptional activator binds to its corresponding promoter recognition site when exposed to a small molecule inducer. In some embodiments, the small molecule inducer is selected from the group consisting of doxycycline, vanillate, phloretin, rapamycin, abscisic acid, gibberellic acid, acetoxymethyl ester, and cumate.

In some embodiments, an AAV production system comprises two or more transcriptional activators (or polynucleic acid molecules encoding the same). c. Transfer Polynucleic Acid Molecule

In some embodiments, an AAV production system further comprises a transfer polynucleic acid molecule. In some embodiments, a transfer polynucleic acid molecule comprises, from 5’ to 3’: (i) a nucleic acid sequence of a 5’ inverted tandem repeat; (ii) a central nucleic acid; and (iii) a nucleic acid sequence of a 3’ inverted tandem repeat. In some embodiments, the nucleic acid sequence is a plasmid or a vector. In some embodiments, a central nucleic acid of the transfer polynucleic acid molecule comprises a multiple cloning site. Exemplary multiple cloning sites are known to those having ordinary skill in the art. A multiple cloning site can be used for cloning a payload molecule (or gene of interest) - or an expression cassette encoding a payload molecule - into the transfer nucleic acid molecule prior to the generation of viral vectors in a host cell.

In some embodiments, a central nucleic acid of the transfer polynucleic acid molecule comprises a gene product of interest. d. Selection Marker

An AAV production system may further comprise a nucleic acid sequence encoding for a selection marker. As used herein, the term “selection marker” refers to a protein that - when introduced into or expressed in a cell - confers a trait that is suitable for selection. As used herein, the term “selection cassette” refers to a nucleic acid sequence encoding a selection marker operably linked to a promoter (as described herein) and a terminator.

A selection marker may be a fluorescent protein. Examples of fluorescent proteins are known in the art (e.g., TagBFP, EBFP2, EGFP, EYFP, mK02, or Sirius). See e.g., Patent No.: US 5,874,304; Patent No.: EP 0969284 Al; Pub. No.: US 2010/167394 A - the entireties of which are incorporated here by reference.

Alternatively, or in addition, a selection marker may be an antibiotic resistance protein. Examples of antibiotic resistance proteins are known in the art (e.g., facilitating puromycin, hygromycin, neomycin, zeocin, blasticidin, or phleomycin selection). See e.g., Pub. No.: WO 1997/15668 A2; Pub. No.: WO 1997/43900 Al - the entireties of which are incorporated here by reference.

Alternatively, or in addition, a selection marker may be an auxotrophic selection marker (e.g., glutamine synthetase).

IV. Engineered Cells

In some aspects, the disclosure relates to engineered cells comprising a split recombinase described herein or a polynucleic acid molecule encoding a split recombinase described herein (optionally wherein the polynucleic acid molecule is stably integrated).

In some aspects, the disclosure relates to engineered cells for AAV production comprising an AAV production described herein. In some embodiments, the engineered cell may comprise any part (and any combination of parts) of the AAV production systems described herein. An engineered cell may comprise at least a portion of the AAV production component (e.g., one or more nucleic acid sequences encoding Rep52, Rep40, Rep78, Rep68, E2A, E40rf6, VARNA, VP1, VP2, VP3, and/or AAP). For example, and as described above, an AAV production component may comprise multiple nucleic acid molecules. In such embodiments, an engineered cell comprises one or more of said multiple polynucleic acid molecules - each of which may be located extra-chromosomally or stably integrated into the genome of the engineered cell. In some embodiments, an engineered cell comprises the entire AAV production component. In some embodiments, an engineered cell further comprises one or more polynucleic acid molecules collectively comprising nucleic acid sequences encoding for: UL5, UL8, UL29, UL30, UL42, UL52, UL12, ICP10, ICP4, and ICP22 (optionally wherein one or more of the polynucleic acid molecules is stably integrated).

In some aspects, the disclosure relates to engineered cells comprising: (a) a first polynucleic acid molecule (optionally stably integrated) encoding a split recombinase (as described herein); and (b) a second polynucleic acid molecule (optionally stably integrated) comprising a nucleic acid sequence encoding, from 5’ to 3’: (i) a first recombinase attachment site; (ii) a gene coding segment; and (iii) a second recombinase attachment site; wherein the first recombinase attachment site and the second recombinase attachment site correspond to the split recombinase (or polypeptide dimer having recombinase activity) of (a).

As used herein, the term “stably integrated” refers to an exogenous nucleic acid sequence, nucleic acid molecule, construct, gene, or nucleic acid sequence that has been inserted into the genome of and organism (e.g. the engineered cell as described herein) and is passed on to future generations after cell division. It is to be understood that any nucleic acid sequence, nucleic acid molecule, construct, gene or nucleic acid sequence described herein may be stably integrated. In some embodiments, any nucleic acid sequence, nucleic acid molecule, construct gene or nucleic acid sequence may be integrated into the genome using random integration, targeted integration, or transposon-mediated integration. It is to be understood that any of the stably integrated nucleic acid molecules described herein may comprise IR/DR sequences that are capable of binding the Sleeping Beauty transposase. Stable integration using the Sleeping Beauty transposase is described in Mates, Lajos, et al. Nature genetics 41.6 (2009): 753-761 which is incorporated by reference in its entirety. In some embodiments, a IR/DR sequence comprises a Sleeping Beauty 100X (SB100X) IR/DR.

An engineered cell described herein may further comprise a landing pad. As used herein, the term “landing pad” refers to a heterologous nucleic acid molecule sequence that facilitates the targeted insertion of a “payload” sequence into a specific locus (or multiple loci) of the cell’s genome. Accordingly, the landing pad is integrated into the genome of the cell. A fixed integration site is desirable to reduce the variability between experiments that may be caused by positional epigenetic effects or proximal regulatory elements. The ability to control payload copy number is also desirable to modulate expression levels of the payload without changing any genetic components.

In some embodiments, the landing pad is located at a safe harbor site in the genome of the engineered cell. As used herein, the term “safe harbor site” refers to a location in the genome where genes or genetic elements can be introduced without disrupting the expression or regulation of adjacent genes and/or adjacent genomic elements do not disrupt expression or regulation of the introduced genes or genetic elements. Examples of safe harbor sites are known to those having skill in the art and include, but are not limited to, AAVS1, ROSA26, COSMIC, Hl 1, CCR5, and LiPS-A3S. See e.g., Gaidukov et al., Nucleic Acids Res. 2018 May 4; 46(8): 4072-4086; Patent No.: US 8,980,579 B2; Patent No.: US 10,017,786 B2; Patent No.: US 9,932,607 B2; Pub. No.: US 2013/280222 A; Pub. No.: WO 2017/180669 Al - the entireties of which are incorporated herein. In some embodiments, the safe harbor site is a known site. In other embodiments, the safe harbor site is a previously undisclosed site. See “Methods of Identifying High-Expressing Genomic Loci and Uses Thereof’ herein. In some embodiments, an engineered cell described herein comprises a landing pad that is integrated at a safe harbor locus selected from the group consisting of AAVS1, ROSA26, COSMIC, Hl l, CCR5, and LiPS-A3S.

In some embodiments, the engineered cell is derived from a HEK293 cell. In some embodiments, the engineered HEK293 cell comprises a landing pad that is integrated at a safe harbor locus selected from the group consisting of AAVS1, ROSA26, COSMIC, Hl l, CCR5, and LiPS-A3S.

Each of the landing pads described herein comprises at least one recombination site. For example, a landing pad may comprise recombination sites corresponding to a Flp recombinase, a Bxbl integrase, a PhiC31 recombinase, a TP901 recombinase, a Cre recombinase, a Vcre recombinase, an Intl-Int34 recombinase, an R4 recombinase, or a Dre recombinase.

The landing pads described herein may comprise one or more expression cassettes. V. Kits

In some aspects, the disclosure relates to kits comprising a split recombinase described herein, a polynucleic acid encoding a split recombinase described herein, an AAV production system described herein, and/or an engineered cell described herein.

In some embodiments, a kit comprises one or more nucleic acid molecules collectively comprising an AAV production system.

In some embodiments, the kit further comprises a small molecule that induces dimerization of an inducible split recombinase described herein. In some embodiments, the small molecule inducer is gibberellic acid (GA), abscisic acid (ABA), or rapalog (Rap).

In some embodiments, a kit comprises a nucleic acid molecule comprising a nucleic acid sequence of a transcriptional activator operably linked to a nucleic acid sequence of a promoter, wherein the transcriptional activator, when expressed in the presence of the small molecule inducer, binds to a chemically inducible promoter of the AAV production system, optionally wherein an engineered cell comprises the nucleic acid molecule comprising the nucleic acid sequence of the transcriptional activator. In some embodiments, the transcriptional activator is selected from the group consisting of TetOn-3G, TetOn-V16, TetOff- Advanced, VanR-VP16, TtgR-VP16, PhlF-VP16, and the cumate cTA and rcTA.

In some embodiments, the kit may further comprise instructions for use of the cells.

VI. Methods of Using Engineered Cells for AAV production

In some aspects, the present disclosure provides methods for producing AAV using an AAV production system described herein, wherein the AAV production system comprises: (a) an AAV production component collectively encode gene products required for generation of an AAV in a recombinant host cell; and (b) a split recombinase described herein (or a polynucleic acid molecule encoding the same, as described herein). In some embodiments, the method of AAV production comprises transfecting or stably integrating into an engineered cell any combination of the one or more nucleic acid molecules collectively comprising the AAV production component and the polynucleic acid molecule encoding the split recombinase. In some embodiments, the method of AAV production further comprises transfecting a nucleic acid molecule comprising a payload for AAV delivery (e.g. a therapeutic DNA sequence) as described above. In some embodiments, the method comprises growing the engineered cell to a confluency that is optimal for AAV production. An optimal confluency may be dependent, for example, on the type of cell the engineered cell is derived from. The skilled person will know or be able to determine the optimal confluency for AAV production. In some embodiments, the method comprises harvesting the AAV produced from the culture of engineered cells using methods that are well known to those of skill in the art.

EXAMPLES

Example 1: Testing Genetic Designs for Split Recombinases

Four genetic designs (VI -V4) were tested for expressing a Cre 270/271 split recombinase (split between amino acids 270 and 271) (FIGs. 2A-2D). The N-terminal portion of the Cre 270/271 split recombinase was fused to an ABI dimerization domain (CreN-ABI) and the C-terminal portion of the Cre 270/271 split recombinase was fused to a PYL1 dimerization domain (PYLl-Cre). In some embodiments, the nucleic acid sequence encoding the split recombinase comprised the structure (from 5’ to 3’): CreN-ABI - P2A - PYLl-CreC (FIGs. 2A and 2C). In other embodiments, the nucleic acid sequence encoding the split recombinase comprised the structure (from 5’ to 3’): PYLl-CreC - P2A - CreN-ABI (FIGs. 2B and 2D). Expression of the Cre 270/271 split recombinase was driven by either a constitutive hEFla promoter (FIGs. 2A-2B) or an inducible TRE promoter with addition of TetOn (FIGs. 2C-2D). Recombinase constructs were transfected into a reporter HEK293FT cell containing an integrated expression construct that expresses TagBFP prior to recombination and EGFP following recombination. An iRFP720 expression construct was also cotransfected to control for transfection efficiency. TagBFP was measured in the PB450- A channel, EGFP was measured in the FITC-A channel, and iRFP720 was measured in the APC-A700-A channel.

VI exhibited a low level of background recombination (0.1%) in the absence of a small-molecule inducer. V2-V4 exhibited near zero background recombination in the absence of a small-molecule inducer. In addition, constructs with 5’ CreN-ABI fusion (i.e., VI and V3) exhibited increased recombination when induced compared to constructs with a 3’ CreN-ABI fusion (i.e., V2 and V4), though the 3’ Cre-N-ABI fusions exhibited lower background recombination. Based on these results, additional split recombinase designs primarily utilize genetic designs in which the sequence encoding the N-terminal half is placed 5’ to the sequence encoding the C-terminal half. Example 2: Testing of Split Recombinase/Dimerization Domain Combinations

Several split recombinase/dimerization domain combinations were tested (FIG. 3). Each combination showed induction of recombinase activity with addition of a respective small molecule inducer. Some combinations showed very high induction. Importantly, several combinations had near background levels of recombinase activity in the absence of small molecule. Recombinase/dimerization domains that performed particularly well (high induction and low background) include: FlpN396-ABI / PYLl-FlpC397 (abscisic acid inducible); Vcre_N269-GIDl / GAI-Vcre_C279 (gibberellic acid inducible); CreN229-GIDl / GAI-CreC230 (gibberellic acid inducible); PhiC31N233-GIDl / GAI-PhiC31C234 (gibberellic acid inducible).

Example 3: Testing of Bxbl Split Recombinase Split Locations

Additional Bxbl split recombinases were designed by selecting amino acid split locations believed likely to result in successful split recombinases, as well as locations likely to disrupt recombinase function (as a negative control) (FIG. 4 and FIGs. 5A-5L). Each Bxbl split recombinase showed induction of recombinase activity with addition of a respective small molecule inducer. Some Bxbl split recombinases showed very high induction (e.g., 169/170, 208/209, 259/260, 370/371, 468/469). Only one Bxbl split recombinase (468/469) showed high basal activity.

Example 4: Testing of Additional Split Recombinases

The activities (basal and induced) of additional split recombinases were tested (FIGs. 6A-6H).

Table 1: List of Split Recombinases Studied in Examples 1-4

Table 2: Exemplary Recombinase Amino Acid Sequences Table 3: Exemplary Split Positions for Split Recombinases

Table 4: Exemplary Dimerization Domain Pair Amino Acid Sequences

Table 5: Exemplary IRES Polycistronic Expression Element Nucleic Acid sequences

Table 6: Exemplary 2A Polycistronic Expression Element Amino Acid Sequences

Table 7: Exemplary Split Recombinases Amino Acid Sequences

Table 8: Exemplary Split Recombinases Nucleic Acid Sequences

Table 9: Exemplary Expression Cassette Nucleic Acid Sequences

Table 10: Exemplary Recombinase Attachment Sites




 
Previous Patent: SEALING CONNECTOR

Next Patent: SUCTION VALVE FOR AN ENDOSCOPE