MATUSZEK ZANETA (US)
ARBAB MANDANA (US)
HARVARD COLLEGE (US)
WO2022067130A2 | 2022-03-31 | |||
WO2022204543A1 | 2022-09-29 | |||
WO2021030344A1 | 2021-02-18 | |||
WO2018098587A1 | 2018-06-07 | |||
WO2022203905A1 | 2022-09-29 | |||
WO2021226558A1 | 2021-11-11 | |||
WO2023081426A1 | 2023-05-11 | |||
WO2001038547A2 | 2001-05-31 | |||
WO2020191239A1 | 2020-09-24 | |||
WO2021226558A1 | 2021-11-11 | |||
WO2021052097A1 | 2021-03-25 | |||
WO2022067130A2 | 2022-03-31 | |||
WO2022150790A2 | 2022-07-14 | |||
WO2023076898A1 | 2023-05-04 | |||
WO2023015309A2 | 2023-02-09 | |||
WO2013045632A1 | 2013-04-04 | |||
WO2014055782A1 | 2014-04-10 | |||
WO2016069774A1 | 2016-05-06 |
US20170224843A1 | 2017-08-10 | |||
US20200318116A1 | 2020-10-08 | |||
AU2021232005A1 | 2022-09-29 | |||
US8394604B2 | 2013-03-12 | |||
EP0011690W | 2000-11-23 | |||
US20200023721W | 2020-03-19 | |||
US20210031439W | 2021-05-07 | |||
US20220012054W | 2022-01-11 | |||
US20220078655W | 2022-10-25 | |||
US20220074628W | 2022-08-05 | |||
US5244797A | 1993-09-14 | |||
EP2877490A2 | 2015-06-03 | |||
USPP61836080P | ||||
US203662634263P | ||||
US201362634910P | ||||
US10202658B2 | 2019-02-12 | |||
US10189831B2 | 2019-01-29 | |||
US10150955B2 | 2018-12-11 | |||
US9932567B1 | 2018-04-03 | |||
US9783791B2 | 2017-10-10 | |||
US9580698B1 | 2017-02-28 | |||
US9534201B2 | 2017-01-03 | |||
US9458484B2 | 2016-10-04 | |||
US20110059502A1 | 2011-03-10 | |||
US4880635A | 1989-11-14 | |||
US4906477A | 1990-03-06 | |||
US4911928A | 1990-03-27 | |||
US4917951A | 1990-04-17 | |||
US4920016A | 1990-04-24 | |||
US4921757A | 1990-05-01 |
SINGLETON ET AL.: "Dictionary of Microbiology and Molecular Biology", 1994
KOTEWICZ, M. L.SAMPSON, C. M.D'ALESSIO, J. M.GERARD, G. F.: "Isolation of cloned Moloney murine leukemia virus reverse transcriptase lacking ribonuclease H activity", NUCLEIC ACIDS RES, vol. 16, 1988, pages 265 - 277
HALEMARHAM: "The Harper Collins Dictionary of Biology", 1991, SPRINGER VERLAG
JINEK M.CHYLINSKI K.FONFARA I.HAUER M.DOUDNA J.A.CHARPENTIER E.: "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity", SCIENCE, vol. 337, 2012, pages 816 - 821, XP055229606, DOI: 10.1126/science.1225829
FERRETTIMCSHAN W.M.AJDIC D.J.SAVIC D.J.SAVIC G.LYON K.PRIMEAUX C.SEZATE S.SUVOROV A.N.KENTON S.: "Complete genome sequence of an Ml strain of Streptococcus pyogenes", PROC. NATL. ACAD. SCI. U.S.A., vol. 98, 2001, pages 4658 - 4663
DELTCHEVA E.CHYLINSKI K.SHARMA C.M.GONZALES K.CHAO Y.PIRZADA Z.A.ECKERT M.R.VOGEL J.CHARPENTIER E.: "CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III", NATURE, vol. 471, 2011, pages 602 - 607, XP055308803, DOI: 10.1038/nature09886
JINEK ET AL., SCIENCE, vol. 337, 2012, pages 816 - 821
QI ET AL.: "Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression", CELL, vol. 152, no. 5, 2013, pages 1173 - 83, XP055346792, DOI: 10.1016/j.cell.2013.02.022
CHEN, B. ET AL.: "Dynamic Imaging of Genomic Loci in Living Human Cells by an Optimized CRISPR/Cas System", CELL, vol. 155, no. 7, 2013, pages 1479 - 1471, XP028806611, DOI: 10.1016/j.cell.2013.12.001
CHYLINSKI, RHUN: "The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems Charpentier", RNA BIOLOGY, vol. 10, no. 5, 2013, pages 726 - 737, XP055116068, DOI: 10.4161/rna.24321
BARANAUSKAS, A. ET AL.: "Generation and characterization of new highly thermostable and processive M-MuLV reverse transcriptase variants", PROTEIN ENG DES SEL, vol. 25, 2012, pages 657 - 668, XP055071799, DOI: 10.1093/protein/gzs034
MAKAROVA ET AL.: "C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector", SCIENCE, vol. 353, no. 6299, 2016, XP055407082, DOI: 10.1126/science.aaf5573
STEVENS ET AL., J. AM. CHEM. SOC., vol. 138, no. 7, 24 February 2016 (2016-02-24), pages 2162 - 5
SHAH ET AL., CHEM. SCI., vol. 5, no. 1, 2014, pages 446 - 461
ANZALONE, A. V. ET AL.: "Search-and-replace genome editing without double-strand breaks or donor DNA", NATURE, vol. 576, 2019, pages 149 - 157, XP055980447, DOI: 10.1038/s41586-019-1711-4
SHAH ET AL.: "Protospacer recognition motifs: mixed identities and functional diversity", RNA BIOLOGY, vol. 10, no. 5, pages 891 - 899
WU ET AL., BIOCHIM. BIOPHYS. ACTA, vol. 35732, 1998, pages 1
PERBAL: "Controlled Drug Bioavailability, Drug Product Design and Performance", 1984, WILEY & SONS
SAUNDERSSAUNDERS: "Microbial Genetics Applied to Biotechnology", 1987, CROOM HELM
BERGER ET AL., BIOCHEMISTRY, vol. 22, 1983, pages 2365 - 2372
GERARD, G. R, DNA, vol. 5, 1986, pages 271 - 279
KOTEWICZ, M. L. ET AL., GENE, vol. 35, 1985, pages 249 - 258
NISHIMASU ET AL., CELL, vol. 156, 2014, pages 935 - 949
STEVENS ET AL.: "A promiscuous split intein with expanded protein engineering applications", PNAS, vol. 114, 2017, pages 8538 - 8543, XP055661453, DOI: 10.1073/pnas.1701083114
IWAI ET AL.: "Highly efficient protein trans-splicing by a naturally split DnaE intein from Nostc punctiforme", FEBS LETT, vol. 580, pages 1853 - 1858
SHINGLEDECKER ET AL., GENE, vol. 207, 1998, pages 187
SOUTHWORTH ET AL., EMBO J., vol. 17, 1998, pages 918
MILLS ET AL., PROC. NATL. ACAD. SCI. USA, vol. 95, 1998, pages 3543 - 3548
AUTIERIAGRAWAL, J. BIOL. CHEM., vol. 273, 1998, pages 14731 - 15890
YAMAZAKI ET AL., J. AM. CHEM. SOC., vol. 120, 1998, pages 5591
EVANS ET AL., J. BIOL. CHEM., vol. 275, 2000, pages 9091
OTOMO ET AL., BIOCHEMISTRY, vol. 38, 1999, pages 16040 - 16044
OTOMO ET AL., J. BIOLMOL. NMR, vol. 14, 1999, pages 105 - 114
SCOTT ET AL., PROC. NATL. ACAD. SCI. USA, vol. 96, 1999, pages 13638 - 13643
ZUKERSTIEGLER, NUCLEIC ACIDS RES., vol. 9, 1981, pages 133 - 148
A. R. GRUBER ET AL., CELL, vol. 106, no. 1, 2008, pages 23 - 24
PA CARRGM CHURCH, NATURE BIOTECHNOLOGY, vol. 27, no. 12, 2009, pages 1151 - 62
JOST, M. ET AL.: "Titrating expression using libraries of systematically attenuated CRISPR guide RNAs", NAT. BIOTECHNOL., vol. 38, 2020, pages 355 - 364, XP037055445, DOI: 10.1038/s41587-019-0387-5
DAVIS, J. R. ET AL., NAT. BIOTECHNOL., 2023
HERZIG, E.VORONIN, N.KUCHERENKO, N.HIZI, A.: "A Novel Leu92 Mutant of HIV-1 Reverse Transcriptase with a Selective Deficiency in Strand Transfer Causes a Loss of Viral Replication", J. VIROL., vol. 89, 2015, pages 8119 - 8129
MOHR, G. ET AL.: "A Reverse Transcriptase-Cas 1 Fusion Protein Contains a Cas6 Domain Required for Both CRISPR RNA Biogenesis and RNA Spacer Acquisition", MOL. CELL, vol. 72, 2018, pages 700 - 714
ZHAO, C.LIU, F.PYLE, A. M.: "An ultraprocessive, accurate reverse transcriptase encoded by a metazoan group II intron", RNA, vol. 24, 2018, pages 183 - 195
ZIMMERLY, S.WU, L.: "An Unexplored Diversity of Reverse Transcriptases in Bacteria", MICROBIOL SPECTR, vol. 3, 2015
OSTERTAG, E. M.KAZAZIAN JR, H. H.: "Biology of Mammalian L1 Retrotransposons", ANNUAL REVIEW OF GENETICS, vol. 35, 2001, pages 501 - 538, XP002474549
PERACH, M.HIZI, A.: "Catalytic Features of the Recombinant Reverse Transcriptase of Bovine Leukemia Virus Expressed in Bacteria", VIROLOGY, vol. 259, 1999, pages 176 - 189, XP004450354, DOI: 10.1006/viro.1999.9761
LIM, D. ET AL.: "Crystal structure of the moloney murine leukemia virus RNase H domain", J. VIROL., vol. 80, 2006, pages 8379 - 8389
ZHAO, C.PYLE, A. M.: "Crystal structures of a group II intron maturase reveal a missing link in spliceosome evolution", NATURE STRUCTURAL & MOLECULAR BIOLOGY, vol. 23, 2016, pages 558 - 565, XP055556551, DOI: 10.1038/nsmb.3224
GRIFFITHS, D. J.: "Endogenous retroviruses in the human genome sequence", GENOME BIOL, vol. 2, 2001, pages 1017, XP002996132
ZIMMERLY, S.GUO, H.PERLMAN, P. S.LAMBOWLTZ, A. M.: "Group II intron mobility occurs by target DNA-primed reverse transcription", CELL, vol. 82, 1995, pages 545 - 554
FENG, Q.MORAN, J. V.KAZAZIAN, H. H.BOEKE, J. D.: "Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition", CELL, vol. 87, 1996, pages 905 - 916
BERKHOUT, B.JEBBINK, M.ZSIROS, J.: "Identification of an Active Reverse Transcriptase Enzyme Encoded by a Human Endogenous HERV-K Retrovirus", JOURNAL OF VIROLOGY, vol. 73, 1999, pages 2365 - 2375, XP002361440
AREZI, B.HOGREFE, H.: "Novel mutations in Moloney Murine Leukemia Virus reverse transcriptase increase thermostability through tighter binding to template-primer", NUCLEIC ACIDS RES, vol. 37, 2009, pages 473 - 481, XP002556110, DOI: 10.1093/nar/gkn952
BLAIN, S. W.GOFF, S. P.: "Nuclease activities of Moloney murine leukemia virus reverse transcriptase. Mutants with altered substrate specificities", J. BIOL. CHEM., vol. 268, 1993, pages 23585 - 23592, XP055491482
XIONG, Y.EICKBUSH, T. H.: "Origin and evolution of retroelements based upon their reverse transcriptase sequences", EMBO J, vol. 9, 1990, pages 3353 - 3362
HERSCHHORN, A.HIZI, A.: "Retroviral reverse transcriptases", CELL. MOL. LIFE SCI., vol. 67, 2010, pages 2717 - 2747, XP019837855
TAUBE, R.LOYA, S.AVIDAN, O.PERACH, M.HIZI, A.: "Reverse transcriptase of mouse mammary tumour virus: expression in bacteria, purification and biochemical characterization", BIOCHEM. J., vol. 329, 1998, pages 579 - 587, XP055980374, DOI: 10.1042/bj3290579
LIU, M. ET AL.: "Reverse Transcriptase-Mediated Tropism Switching in Bordetella Bacteriophage", SCIENCE, vol. 295, 2002, pages 2091 - 2094, XP002384941, DOI: 10.1126/science.1067467
LUAN, D. D.KORMAN, M. H.JAKUBCZAK, J. L.EICKBUSH, T. H.: "Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition", CELL, vol. 72, 1993, pages 595 - 605, XP024245568, DOI: 10.1016/0092-8674(93)90078-5
NOTTINGHAM, R. M. ET AL.: "RNA-seq of human reference RNA samples using a thermostable group II intron reverse transcriptase", RNA, vol. 22, 2016, pages 597 - 613
TELESNITSKY, A.GOFF, S. P.: "RNase H domain mutations affect the interaction between Moloney murine leukemia virus reverse transcriptase and its primer-template", PROC. NATL. ACAD. SCI. U.S.A., vol. 90, 1993, pages 1276 - 1280
HALVAS, E. K.SVAROVSKAIA, E. S.PATHAK, V. K.: "Role of Murine Leukemia Virus Reverse Transcriptase Deoxyribonucleoside Triphosphate-Binding Site in Retroviral Replication and In Vivo Fidelity", JOURNAL OF VIROLOGY, vol. 74, 2000, pages 10349 - 10358
NOWAK, E. ET AL.: "Structural analysis of monomeric retroviral reverse transcriptase in complex with an RNA/DNA hybrid", NUCLEIC ACIDS RES, vol. 41, 2013, pages 3874 - 3887
STAMOS, J. L.LENTZSCH, A. M.LAMBOWITZ, A. M.: "Structure of a Thermostable Group II Intron Reverse Transcriptase with Template-Primer and Its Functional and Evolutionary Implications", MOLECULAR CELL, vol. 68, 2017, pages 926 - 939
DAS, D.GEORGIADIS, M. M.: "The Crystal Structure of the Monomeric Reverse Transcriptase from Moloney Murine Leukemia Virus", STRUCTURE, vol. 12, 2004, pages 819 - 829, XP025941534, DOI: 10.1016/j.str.2004.02.032
AVIDAN, O.MEER, M. E.OZ, I.HIZI, A.: "The processivity and fidelity of DNA synthesis exhibited by the reverse transcriptase of bovine leukemia virus", EUROPEAN JOURNAL OF BIOCHEMISTRY, vol. 269, 2002, pages 859 - 867
GERARD, G. F. ET AL.: "The role of template-primer in protection of reverse transcriptase from thermal inactivation", NUCLEIC ACIDS RES, vol. 30, 2002, pages 3118 - 3129, XP002556108, DOI: 10.1093/nar/gkf417
MONOT, C. ET AL.: "The Specificity and Flexibility of L1 Reverse Transcription Priming at Imperfect T-Tracts", PLOS GENETICS, vol. 9, 2013, pages el003499
MOHR, S. ET AL.: "Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing", RNA, vol. 19, 2013, pages 958 - 970, XP055149277, DOI: 10.1261/rna.039743.113
COKOL ET AL.: "Finding nuclear localization signals", EMBO REP., vol. 1, no. 5, 2000, pages 411 - 415, XP072230221, DOI: 10.1093/embo-reports/kvd092
FREITAS ET AL.: "Mechanisms and Signals for the Nuclear Import of Proteins", CURRENT GENOMICS, vol. 10, no. 8, 2009, pages 550 - 7, XP055502464
MAGIN ET AL., VIROLOGY, vol. 274, 2000, pages 11 - 16
TINLAND ET AL., PROC. NATL. ACAD. SCI. U.S.A., vol. 89, 1992, pages 7442 - 46
MOEDE ET AL., FEBS LETT., vol. 461, 1999, pages 229 - 34
PATEL ET AL.: "Flap endonucleases pass 5'-flaps through a flexible arch using a disorder-thread-order mechanism to confer specificity for free 5'-ends", NUCLEIC ACIDS RESEARCH, vol. 40, no. 10, 2012, pages 4507 - 4519
TSUTAKAWA ET AL.: "Human flap endonuclease structures, DNA double-base flipping, and a unified understanding of the FEN1 superfamily", CELL, vol. 145, no. 2, 2011, pages 198 - 211, XP028194588, DOI: 10.1016/j.cell.2011.03.004
KEIJZERS ET AL., BIOSCI REP, vol. 35, no. 3, 2015, pages e00206
JOHANSSON ET AL.: "RNA recognition by the MS2 phage coat protein", SEM VIROL., vol. 8, no. 3, 1997, pages 176 - 185
DELEBECQUE ET AL.: "Organization of intracellular reactions with rationally designed RNA assemblies", SCIENCE, vol. 333, 2011, pages 470 - 474
MALI ET AL.: "Cas9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering", NAT. BIOTECHNOL., vol. 31, 2013, pages 833 - 838, XP055693153, DOI: 10.1038/nbt.2675
ZALATAN ET AL.: "Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds", CELL, vol. 160, 2015, pages 339 - 350, XP055278878, DOI: 10.1016/j.cell.2014.11.052
ZUFFEREY ET AL., J. VIROL., vol. 73, no. 4, 1999, pages 2886 - 92
FLAJOLET ET AL., J. VIROL., vol. 72, no. 7, 1998, pages 6175 - 80
LANGER, SCIENCE, vol. 249, 1990, pages 1527 - 1533
SEFTON, CRC CRIT. REF. BIOMED. ENG., vol. 14, 1989, pages 201
BUCHWALD ET AL., SURGERY, vol. 88, 1980, pages 507
SAUDEK ET AL., N. ENGL. J. MED., vol. 321, 1989, pages 574
"Medical Applications of Controlled Release", 1974, CRC PRESS
RANGERPEPPAS, MACROMOL. SCI. REV. MACROMOL. CHEM., vol. 23, 1983, pages 61
LEVY ET AL., SCIENCE, vol. 228, 1985, pages 190
DURING ET AL., ANN. NEUROL., vol. 25, 1989, pages 351
HOWARD ET AL., J. NEUROSURG., vol. 71, 1989, pages 105
ZHANG Y. P. ET AL., GENE THER., vol. 6, 1999, pages 1438 - 47
TAKAHASHIYAMANAKA, CELL, vol. 126, no. 4, 2006, pages 663 - 76
ROCCA ET AL., METHODS AND CLINICAL DEVELOPMENT, vol. 17, 2020, pages 1026 - 1036
VIRMOUNI ET AL., DIS MODEL MECH, vol. 8, no. 3, 2015, pages 225 - 235
DAVIS ET AL., NBME, 2023
CLAIMS What is claimed is: 1. A pegRNA comprising a spacer sequence comprising the nucleic acid sequence: or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 381-387. 2. The pegRNA of claim 1, wherein the pegRNA comprises the spacer sequence GACCCTGGAAAAGCTGAT (SEQ ID NO: 873), or a spacer sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 873. 3. The pegRNA of claim 1 or 2, wherein the pegRNA comprises a primer binding site (PBS) of about 8 to about 16 nucleotides in length. 4. The pegRNA of any one of claims 1-3, wherein the pegRNA comprises a PBS of about 8, about 9, about 10, about 11, about 12, about 13, about 14, about 15, or about 16 nucleotides in length. 5. The pegRNA of any one of claims 1-4, wherein the pegRNA comprises a PBS of about 10 nucleotides in length. 6. The pegRNA of any one of claims 1-5, wherein the pegRNA comprises a reverse transcription template comprising a sequence of about 15 to about 35 nucleotides, wherein the sequence encodes a sequence that comprises one or more nucleotide edits compared to a sequence directly upstream of a CAG repeat in exon 1 of the HTT gene. 7. The pegRNA of claim 6, wherein the one or more nucleotide edits comprises a PAM mutation that alters a PAM sequence. 8. The pegRNA of claim 7, wherein the PAM sequence is NGG, wherein N is any one of nucleotides A, G, C, or T. 9. The pegRNA of any one of claims 1-8, wherein the pegRNA comprises a reverse transcription template comprising a sequence of about 26 nucleotides, wherein the sequence encodes a sequence that comprises one or more nucleotide edits compared to a sequence directly upstream of a CAG repeat in exon 1 of the HTT gene. 10. The pegRNA of any one of claims 1-9, wherein the pegRNA comprises a reverse transcription template comprising an edit template encoding or comprising one or more repeats of the trinucleotide sequence CAG. 11. The pegRNA of claim 10, wherein the edit template encodes or comprises 4-35 repeats of the trinucleotide sequence CAG, or wherein the edit template encodes or comprises 4-10 repeats of the trinucleotide sequence CAG. 12. The pegRNA of claim 11, wherein the pegRNA comprises a reverse transcription template comprising an edit template encoding or comprising the nucleotide sequence CAGCAGCAGCAG (SEQ ID NO: 874), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 874. 13. The pegRNA of claim 11, wherein the pegRNA comprises a reverse transcription template comprising an edit template encoding or comprising the nucleotide sequence CAGCAGCAGCAGCAGCAGCAGCAGCAG (SEQ ID NO: 875), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 875. 14. The pegRNA of any one of claims 10-13, wherein the edit template further encodes or comprises one or more nucleotides of the trinucleotide sequence CAA. 15. The pegRNA of any one of claims 10-14, wherein the pegRNA comprises a reverse transcription template comprising an edit template encoding or comprising the nucleotide sequence CAGCAGCAGCAGCAACAACAA (SEQ ID NO: 876), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 876. 16. The pegRNA of any one of claims 1-15, wherein the pegRNA comprises a reverse transcription template comprising a homology arm of about 16 to about 50 nucleotides in length, wherein the homology arm is complementary to a sequence directly downstream of a CAG repeat in exon 1 of a wildtype HTT gene in the coding strand. 17. The pegRNA of claim 16, wherein the homology arm is about 31 nucleotides in length. 18. The pegRNA of claim 16, wherein the homology arm is about 40 nucleotides in length. 19. The pegRNA of any one of claims 1-18, wherein the pegRNA is an engineered pegRNA (epegRNA). 20. The pegRNA of any one of claims 1-19, wherein the pegRNA comprises at its 3' end a structural motif that improves stability of the pegRNA. 21. The pegRNA of claim 19 or 20, wherein the pegRNA comprises an evopreq1 motif. 22. The pegRNA of claim 21, wherein the evopreq1 motif comprises the sequence of SEQ ID NO: 442, a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of SEQ ID NO: 442, or a sequence having 1, 2, 3, 4, or 5 mutations relative to SEQ ID NO: 442. 23. The pegRNA of any one of claims 1-22, wherein the pegRNA comprises a UA flip in the pegRNA scaffold sequence. 24. The pegRNA of any one of claims 1-23, wherein the pegRNA comprises the sequence of any one of SEQ ID NOs: 454-815, or a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of any one of SEQ ID NOs: 454-815. 25. A composition comprising (i) a prime editor comprising a nucleic acid programmable DNA binding protein (napDNAbp) and a DNA polymerase domain, and (ii) a pegRNA of any one of claims 1-24. 26. The composition of claim 25, wherein the prime editor comprises PE2, PE2max, PE3, PE3max, PE3b, or PE3bmax architecture. 27. The composition of claim 25 or 26, wherein the prime editor comprises a Cas9 protein. 28. The composition of claim 27, wherein the Cas9 protein comprises an inactivating mutation in an HNH nuclease domain. 29. The composition of claim 27 or 28, wherein the Cas9 protein comprises the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 6. 30. The composition of any one of claims 27-29, wherein the Cas9 protein comprises a K775R substitution and/or a K918A substitution relative to the amino acid sequence of SEQ ID NO: 6. 31. The composition of any one of claims 27-29, wherein the Cas9 protein comprises a D23G substitution and/or an H754R substitution relative to the amino acid sequence of SEQ ID NO: 6. 32. The composition of any one of claims 25-31, wherein the DNA polymerase is a reverse transcriptase. 33. The composition of claim 32, wherein the reverse transcriptase is an MMLV reverse transcriptase. 34. The composition of claim 33, wherein the RNaseH domain of the MMLV reverse transcriptase is truncated relative to the full-length wild type MMLV reverse transcriptase. 35. The composition of claim 34, wherein the RNaseH domain of the MMLV reverse transcriptase is truncated between amino acids D497 and I498 relative to the amino acid sequence of SEQ ID NO: 33 or 34, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 33 or 34. 36. The composition of any one of claims 33-35, wherein the MMLV reverse transcriptase comprises the amino acid substitution V223Y relative to the amino acid sequence of SEQ ID NO: 33 or 34. 37. The composition of any one of claims 33-36, wherein the MMLV reverse transcriptase comprises the amino acid sequence of SEQ ID NO: 33 or 34, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 33 or 34. 38. The composition of any one of claims 25-36, wherein the reverse transcriptase is a Tf1 reverse transcriptase. 39. The composition of claim 38, wherein the Tf1 reverse transcriptase comprises the amino acid sequence of SEQ ID NO: 55, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 55. 40. The composition of any one of claims 25-40 further comprising a nicking gRNA. 41. The composition of claim 40, wherein the spacer of the nicking gRNA comprises the nucleotide sequence: or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any of these SEQ ID NOs. 42. The composition of claim 40 or 41, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 384. 43. The composition of claim 40 or 41, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 406. 44. A method of treating Huntington’s disease by prime editing comprising contacting a target nucleotide sequence with a composition of any one of claims 25-43. 45. The method of claim 44, wherein the contacting results in the contraction or replacement of a CAG repeat sequence in the target nucleotide sequence. 46. The method of claim 45, wherein the CAG repeat sequence is contracted from greater than 35 repeats to 35 or fewer repeats. 47. The method of claim 45, wherein the CAG repeat sequence is greater than 35 repeats in length and is replaced with a CAG repeat sequence of 35 or fewer repeats in length. 48. The method of any one of claims 45-47, wherein the CAG repeat sequence is contracted to four repeats or replaced with a sequence comprising four CAG repeats. 49. The method of any one of claims 44-48 further comprising nicking the non-PAM- containing strand of the target nucleotide sequence using a nicking gRNA. 50. The method of claim 49, wherein the spacer of the nicking gRNA comprises the nucleotide sequence: or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any of these SEQ ID NOs. 51. The method of claim 49 or 50, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 384. 52. The method of claim 49 or 50, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 406. 53. The method of any one of claims 44-52, wherein the contacting is performed in a cell. 54. The method of claim 53, wherein the cell is a eukaryotic cell. 55. The method of claim 53 or 54, wherein the cell is a human cell. 56. The method of any one of claims 53-55, wherein the cell is in vitro. 57. The method of any one of claims 53-55, wherein the cell is ex vivo. 58. The method of any one of claims 53-55, wherein the cell is in a subject. 59. The method of claim 58, wherein the subject is a human. 60. The method of any one of claims 53-59, wherein one or more polynucleotides encoding the composition are delivered to the cell. 61. The method of claim 60, wherein the one or more polynucleotides encoding the composition are delivered to the cell in one or more adeno-associated virus (AAV) particles. 62. The method of claim 61, wherein the one or more polynucleotides encoding the composition are delivered to the cell in two AAV particles. 63. The method of claim 61 or 62, wherein the AAV particles comprise AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. 64. The method of any one of claims 61-63, wherein the AAV particles comprise AAV9. 65. The method of any one of claims 61-64, wherein a first and second AAV particle are delivered to the cell, wherein the first AAV particle comprises a polynucleotide comprising the structure 5′-[inverted terminal repeat (ITR) sequence]-[promoter]-[napDNAbp N-terminal fragment]-[N-intein]-[terminator sequence]-[ITR sequence]-3′, and wherein the second AAV particle comprises a polynucleotide comprising the structure 5′-[ITR sequence]-[promoter]- [C-intein]-[napDNAbp C-terminal fragment]-[reverse transcriptase]-[terminator sequence]- [optional nicking gRNA]-[pegRNA]-[ITR]-3′. 66. A polynucleotide encoding the pegRNA of any one of claims 1-24. 67. One or more polynucleotides encoding the prime editor and the pegRNA of the composition of any one of claims 25-43. 68. A vector comprising the polynucleotide of claim 66. 69. One or more vectors comprising the one or more polynucleotides of claim 67. 70. One or more AAV particles comprising the polynucleotide of claim 66, the one or more polynucleotides of claim 67, the vector of claim 68, or the one or more vectors of claim 69. 71. The one or more AAV particles of claim 70, wherein the AAV particles comprise AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. 72. The one or more AAV particles of claim 70 or 71, wherein the AAV particles comprise AAV9. 73. The one or more AAV particles of any one of claims 70-72 comprising a first AAV particle and a second AAV particle, wherein the first AAV particle comprises a polynucleotide comprising the structure 5′-[inverted terminal repeat (ITR) sequence]- [promoter]-[napDNAbp N-terminal fragment]-[N-intein]-[terminator sequence]-[ITR sequence]-3′, and wherein the second AAV particle comprises a polynucleotide comprising the structure 5′-[ITR sequence]-[promoter]-[C-intein]-[napDNAbp C-terminal fragment]- [reverse transcriptase]-[terminator sequence]-[optional nicking gRNA]-[pegRNA]-[ITR]-3′. 74. A pharmaceutical composition comprising the pegRNA of any one of claims 1-24, the composition of any one of claims 25-43, the one or more polynucleotides of claim 66 or 67, the one or more vectors of claim 68 or 69, or the one or more AAV particles of any one of claims 70-73. 75. A cell comprising the pegRNA of any one of claims 1-24, the composition of any one of claims 25-43, the one or more polynucleotides of claim 66 or 67, the one or more vectors of claim 68 or 69, or the one or more AAV particles of any one of claims 70-73. 76. A kit comprising the pegRNA of any one of claims 1-24, the composition of any one of claims 25-43, the one or more polynucleotides of claim 66 or 67, the one or more vectors of claim 68 or 69, or the one or more AAV particles of any one of claims 70-73. 77. Use of the pegRNA of any one of claims 1-24, the composition of any one of claims 25-43, the one or more polynucleotides of claim 66 or 67, the one or more vectors of claim 68 or 69, the one or more AAV particles of any one of claims 70-73, or the pharmaceutical composition of claim 74 for the treatment of Huntington’s disease. 78. Use of the pegRNA of any one of claims 1-24, the composition of any one of claims 25-43, the one or more polynucleotides of claim 66 or 67, the one or more vectors of claim 68 or 69, the one or more AAV particles of any one of claims 70-73, or the pharmaceutical composition of claim 74 in the manufacture of a medicament for the treatment of Huntington’s disease. 79. A pegRNA comprising a spacer sequence comprising the nucleic acid sequence: or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 388-392. 80. The pegRNA of claim 79, wherein the pegRNA comprises the spacer sequence GCAAGACTAACCTGGCCAACA (SEQ ID NO: 388), or a spacer sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 388. 81. The pegRNA of claim 79 or 80, wherein the pegRNA comprises a primer binding site (PBS) of about 8 to about 14 nucleotides in length. 82. The pegRNA of any one of claims 79-81, wherein the pegRNA comprises a PBS of about 8, about 9, about 10, about 11, about 12, about 13, or about 14 nucleotides in length. 83. The pegRNA of any one of claims 79-82, wherein the pegRNA comprises a PBS of about 10 nucleotides in length. 84. The pegRNA of any one of claims 79-83, wherein the pegRNA comprises a reverse transcription template comprising a homology arm of about 8 to about 50 nucleotides in length, wherein the homology arm is complementary to a sequence directly downstream of a GAA repeat in a wildtype FXN gene. 85. The pegRNA of any one of claims 79-84, wherein the homology arm is about 40 nucleotides in length. 86. The pegRNA of any one of claims 79-85, wherein the pegRNA is an engineered pegRNA (epegRNA). 87. The pegRNA of any one of claims 79-86, wherein the pegRNA comprises at its 3' end a structural motif that improves stability of the pegRNA. 88. The pegRNA of claim 87, wherein the pegRNA comprises an evopreq1 motif. 89. The pegRNA of claim 88, wherein the evopreq1 motif comprises the sequence of SEQ ID NO: 442, a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of SEQ ID NO: 442, or a sequence having 1, 2, 3, 4, or 5 mutations relative to SEQ ID NO: 442. 90. The pegRNA of any one of claims 79-89, wherein the pegRNA comprises a UA flip in the pegRNA scaffold sequence. 91. The pegRNA of any one of claims 79-90, wherein the pegRNA comprises a gRNA core sequence selected from the group consisting of SEQ ID NOs: 393-413. 92. A composition comprising (i) a prime editor comprising a nucleic acid programmable DNA binding protein (napDNAbp) and a DNA polymerase domain, and (ii) a pegRNA of any one of claims 79-91. 93. The composition of claim 92, wherein the prime editor comprises PE2, PE2max, PE3, PE3max, PE3b, or PE3bmax architecture. 94. The composition of claim 92 or 93, wherein the prime editor comprises a Cas9 protein. 95. The composition of claim 94, wherein the Cas9 protein comprises an inactivating mutation in an HNH domain. 96. The composition of claim 94 or 95, wherein the Cas9 protein comprises the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 6. 97. The composition of any one of claims 94-96, wherein the Cas9 protein comprises a K775R substitution and/or a K918A substitution relative to the amino acid sequence of SEQ ID NO: 6. 98. The composition of any one of claims 94-96, wherein the Cas9 protein comprises a D23G substitution and/or an H754R substitution relative to the amino acid sequence of SEQ ID NO: 6. 99. The composition of any one of claims 92-98, wherein the DNA polymerase is a reverse transcriptase. 100. The composition of claim 99, wherein the reverse transcriptase is an MMLV reverse transcriptase. 101. The composition of claim 100, wherein the RNaseH domain of the MMLV reverse transcriptase is truncated relative to the full-length wild type MMLV reverse transcriptase. 102. The composition of claim 101, wherein the RNaseH domain of the MMLV reverse transcriptase is truncated between amino acids D497 and I498 relative to the amino acid sequence of SEQ ID NO: 33 or 34, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 33 or 34. 103. The composition of claim 101 or 102, wherein the MMLV reverse transcriptase comprises the amino acid substitution V223Y relative to the amino acid sequence of SEQ ID NO: 33 or 34. 104. The composition of any one of claims 100-103, wherein the MMLV reverse transcriptase comprises the amino acid sequence of SEQ ID NO: 33 or 34, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 33 or 34. 105. The composition of claim 99, wherein the reverse transcriptase is a Tf1 reverse transcriptase. 106. The composition of claim 105, wherein the Tf1 reverse transcriptase comprises the amino acid sequence of SEQ ID NO: 55, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the amino acid sequence of SEQ ID NO: 55. 107. The composition of any one of claims 92-106 further comprising a nicking gRNA. 108. The composition of claim 107, wherein the spacer of the nicking gRNA comprises the nucleotide sequence: or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 410-413. 109. The composition of claim 107 or 108, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 411. 110. The composition of claim 107 or 108, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 413. 111. A method of treating Friedreich’s ataxia by prime editing comprising contacting a target nucleotide sequence with a composition of any one of claims 92-110. 112. The method of claim 111, wherein the contacting results in the deletion of a GAA repeat sequence in the target nucleotide sequence. 113. The method of claim 112, wherein the GAA repeat sequence deleted comprises greater than 65 GAA repeats. 114. The method of any one of claims 111-113 further comprising nicking the non-PAM- containing strand of the target nucleotide sequence using a nicking gRNA. 115. The method of claim 114, wherein the spacer of the nicking gRNA comprises the nucleotide sequence: or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 410-413. 116. The method of claim 114 or 115, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 411. 117. The method of claim 114 or 115, wherein the spacer of the nicking gRNA comprises the nucleotide sequence GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 413. 118. The method of any one of claims 111-117, wherein the contacting is performed in a cell. 119. The method of claim 118, wherein the cell is a eukaryotic cell. 120. The method of claim 118 or 119, wherein the cell is a human cell. 121. The method of any one of claims 118-120, wherein the cell is in vitro. 122. The method of any one of claims 118-120, wherein the cell is ex vivo. 123. The method of any one of claims 118-122, wherein the cell is in a subject. 124. The method of claim 123, wherein the subject is a human. 125. The method of any one of claims 118-124, wherein one or more polynucleotides encoding the composition are delivered to the cell. 126. The method of claim 125, wherein the one or more polynucleotides encoding the composition are delivered to the cell in one or more adeno-associated virus (AAV) particles. 127. The method of claim 126, wherein the one or more polynucleotides encoding the composition are delivered to the cell in two AAV particles. 128. The method of claim 126 or 127, wherein the AAV particles comprise AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. 129. The method of any one of claims 126-128, wherein the AAV particles comprise AAV9. 130. The method of any one of claims 126-129, wherein a first and second AAV particle are delivered to the cell, wherein the first AAV particle comprises a polynucleotide comprising the structure 5′-[inverted terminal repeat (ITR) sequence]-[promoter]- [napDNAbp N-terminal fragment]-[N-intein]-[terminator sequence]-[ITR sequence]-3′, and wherein the second AAV particle comprises a polynucleotide comprising the structure 5′- [ITR sequence]-[promoter]-[C-intein]-[napDNAbp C-terminal fragment]-[reverse transcriptase]-[terminator sequence]-[optional nicking gRNA]-[pegRNA]-[ITR]-3′. 131. A polynucleotide encoding the pegRNA of any one of claims 79-91. 132. One or more polynucleotides encoding the prime editor and the pegRNA of the composition of any one of claims 92-110. 133. A vector comprising the polynucleotide of claim 131. 134. One or more vectors comprising the one or more polynucleotides of claim 132. 135. One or more AAV particles comprising the polynucleotide of claim 131, the one or more polynucleotides of claim 132, the vector of claim 133, or the one or more vectors of claim 134. 136. The one or more AAV particles of claim 135, wherein the AAV particles comprise AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. 137. The one or more AAV particles of claim 135 or 136, wherein the AAV particles comprise AAV9. 138. The one or more AAV particles of any one of claims 135-137 comprising a first AAV particle and a second AAV particle, wherein the first AAV particle comprises a polynucleotide comprising the structure 5′-[inverted terminal repeat (ITR) sequence]- [promoter]-[napDNAbp N-terminal fragment]-[N-intein]-[terminator sequence]-[ITR sequence]-3′, and wherein the second AAV particle comprises a polynucleotide comprising the structure 5′-[ITR sequence]-[promoter]-[C-intein]-[napDNAbp C-terminal fragment]- [reverse transcriptase]-[terminator sequence]-[optional nicking gRNA]-[pegRNA]-[ITR]-3′. 139. A pharmaceutical composition comprising the pegRNA of any one of claims 79-91, the composition of any one of claims 92-110, the one or more polynucleotides of claim 131 or 132, the one or more vectors of claim 133 or 134, or the one or more AAV particles of any one of claims 135-138. 140. A cell comprising the pegRNA of any one of claims 79-91, the composition of any one of claims 92-110, the one or more polynucleotides of claim 131 or 132, the one or more vectors of claim 133 or 134, or the one or more AAV particles of any one of claims 135-138. 141. A kit comprising the pegRNA of any one of claims 79-91, the composition of any one of claims 92-110, the one or more polynucleotides of claim 131 or 132, the one or more vectors of claim 133 or 134, or the one or more AAV particles of any one of claims 135-138. 142. Use of the pegRNA of any one of claims 79-91, the composition of any one of claims 92-110, the one or more polynucleotides of claim 131 or 132, the one or more vectors of claim 133 or 134, the one or more AAV particles of any one of claims 135-138, or the pharmaceutical composition of claim 139 in the treatment of Friedreich’s ataxia. 143. Use of the pegRNA of any one of claims 79-91, the composition of any one of claims 92-110, the one or more polynucleotides of claim 131 or 132, the one or more vectors of claim 133 or 134, the one or more AAV particles of any one of claims 135-138, or the pharmaceutical composition of claim 139 in the manufacture of a medicament for the treatment of Friedreich’s ataxia. 144. A system for prime editing the HTT gene comprising: a) a pegRNA comprising i) a spacer sequence comprising the sequence GACCCTGGAAAAGCTGAT (SEQ ID NO: 873), and ii) a reverse transcription template comprising an edit template comprising the sequence CAGCAGCAGCAG (SEQ ID NO: 874), CAGCAGCAGCAGCAGCAGCAGCAGCAG (SEQ ID NO: 875), or CAGCAGCAGCAGCAACAACAA (SEQ ID NO: 876); b) a PE3bmax prime editor, optionally wherein the PE3bmax prime editor comprises the amino acid substitutions D23G and H754R within a Cas9 protein and V223Y within an MMLV reverse transcriptase; and c) a nicking guide RNA comprising a spacer sequence comprising the sequence GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384) or GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406). 145. A system for prime editing the FXN gene comprising: a) a pegRNA comprising i) a spacer sequence comprising the sequence GCAAGACTAACCTGGCCAACA (SEQ ID NO: 388); b) a PE3bmax prime editor, optionally wherein the PE3bmax prime editor comprises the amino acid substitution V223Y within an MMLV reverse transcriptase; and c) a nicking guide RNA comprising a spacer sequence comprising the sequence GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411) or GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413). 146. The method of any one of claims 60-65 or 125-130, wherein the composition is delivered to the brain of a subject. 147. The method of any one of claims 60-65, 125-130, or 146, wherein the composition is delivered by intracerebroventricular injection (ICV). |
KEY: BIPARTITE SV40 NUCLEAR LOCALIZATION SEQUENCE (NLS) TOP: (SEQ ID NO: 95), CAS9(R221K N39K H840A) (SEQ ID NO: 2) SGGSx2-BIPARTITE SV40NLS-SGGSx2 LINKER (SEQ ID NO: 79) M-MLV reverse transcriptase(D200N T306K W313F T330P L603W) (SEQ ID NO: 34) Other linker sequence (SEQ ID NOs: 82) BIPARTITE SV40NLS (SEQ ID NO: 97) Other linker sequence c-Myc NLS (SEQ ID NO: 98) PE3max and PE3bmax [0146] As used herein, “PE3max” refers to a prime editing composition comprising a PEmax protein, a desired pegRNA, and a second strand nicking guide RNA. In some embodiments, PE3max can be considered as PE3 except wherein the PE2 component is substituted with PEmax. “PE3bmax” refers to a prime editing composition comprising a PEmax protein, a desired pegRNA, and a second strand nicking guide RNA, wherein the second-strand nicking guide RNA is designed for temporal control such that the second strand nick is not introduced until after the installation of the desired edit. This is achieved by designing the second strand nicking guide RNA with a spacer sequence that comprise complementarity to, and hybridizes with, only the edited strand after installation of the desired nucleotide edit(s), but not the endogenous target DNA sequence. PE4max [0147] As used herein, “PE4max” refers to PE4 but the PE2 component is substituted with PEmax. PE5max and PE5bmax [0148] As used herein, “PE5max” refers to PE5, but the PE2 component of PE3 is substituted with PEmax. “PE5bmax” refers to PE5b, but the PE2 component of PE3 is substituted with PEmax. PE6 [0149] The term “PE6” refers to a suite of next-generation prime editors described herein (PE6a, PE6b, PE6c, PE6d, PE6e, PE6f, and PE6g) comprising improved reverse transcriptase and/or Cas9 variants. The improved reverse transcriptase and Cas9 domains of the PE6 variants can also be combined with each other to offer cumulative benefits. For example, a PE6 prime editor comprising an improved reverse transcriptase variant of PE6a and an improved Cas9 variant of PE6e is referred to herein as the prime editor “PE6a-e” (or “PE6e- a”). Any possible combination of PE6 prime editors is contemplated by the present disclosure including, for example, PE6a-e, PE6a-f, PE6a-g, PE6b-e, PE6b-f, PE6b-g, PE6c-e, PE6c-f, PE6c-g, PE6d-e, PE6d-f, and PE6d-g. [0150] PE6a comprises a reverse transcriptase variant comprising the amino acid substitutions E60K, K87E, E165D, D243N, R267I, E279K, K318E, and K343N relative to an Ec48 reverse transcriptase (SEQ ID NO: 59). PE6b comprises a reverse transcriptase variant comprising the amino acid substitutions P70T, G72V, S87G, M102I, K106R, K118R, I128V, L158Q, F269L, A363V, K413E, and S492N relative to a Tf1 reverse transcriptase (SEQ ID NO: 55). PE6c comprises a reverse transcriptase variant comprising the amino acid substitutions P70T, G72V, S87G, M102I, K106R, K118R, I128V, L158Q, S188K, I260L, F269L, R288Q, S297Q, A363V, K413E, and S492N relative to a Tf1 reverse transcriptase (SEQ ID NO: 55). PE6d comprises a reverse transcriptase variant comprising the amino acid substitutions T128N, D200C, and V223Y (and the substitutions T306K, W313F, and T330P used in the MMLV reverse transcriptase of PE2 and PEmax) relative to a MMLV reverse transcriptase (SEQ ID NO: 33) with a truncation of the C-terminal RnaseH domain (e.g., between D497 and I498 of SEQ ID NO: 33). PE6e comprises a Cas9 variant comprising the amino acid substitutions K775R and K918A relative to wild type Streptococcus pyogenes Cas9 or Streptococcus pyogenes Cas9 nickase (SEQ ID NO: 7). PE6f comprises a Cas9 variant comprising the amino acid substitutions H99R, E471K, I632V, D645N, H721Y, and K918A relative to wild type Streptococcus pyogenes Cas9 or Streptococcus pyogenes Cas9 nickase (SEQ ID NO: 7). PE6g comprises a Cas9 variant comprising the amino acid substitutions H99R, E471K, I632V, D645N, R654C, and H721Y relative to wild type Streptococcus pyogenes Cas9 or Streptococcus pyogenes Cas9 nickase (SEQ ID NO: 7). Any of the PE6 prime editors provided herein may also comprise the architecture of the PEmax protein as provided below. In some embodiments, any of the PE6 prime editors provided herein may further comprise additional amino acid mutations, e.g., any of those included in PEmax as provided below. PE7 [0151] The term “PE7” refers to the PE6 prime editors plus a second strand nicking guide RNA. For example, “PE7a” refers to the PE6a prime editor as provided herein, plus a second strand nicking guide RNA. Polymerase [0152] As used herein, the term “polymerase” refers to an enzyme that synthesizes a nucleotide strand and that may be used in connection with the prime editor delivery systems described herein. The polymerase can be a “template-dependent” polymerase (i.e., a polymerase that synthesizes a nucleotide strand based on the order of nucleotide bases of a template strand). The polymerase can also be a “template-independent” polymerase (i.e., a polymerase that synthesizes a nucleotide strand without the requirement of a template strand). A polymerase may also be further categorized as a “DNA polymerase” or an “RNA polymerase.” In various embodiments, the prime editor system comprises a DNA polymerase. In various embodiments, the DNA polymerase can be a “DNA-dependent DNA polymerase” (i.e., whereby the template molecule is a strand of DNA). In such cases, the DNA template molecule can be a PEgRNA, wherein the extension arm comprises a strand of DNA. In such cases, the PEgRNA may be referred to as a chimeric or hybrid PEgRNA which comprises an RNA portion (i.e., the guide RNA components, including the spacer and the gRNA core) and a DNA portion (i.e., the extension arm). In various other embodiments, the DNA polymerase can be an “RNA-dependent DNA polymerase” (i.e., whereby the template molecule is a strand of RNA). In such cases, the PEgRNA is RNA, i.e., including an RNA extension. The term “polymerase” may also refer to an enzyme that catalyzes the polymerization of nucleotides (i.e., the polymerase activity). Generally, the enzyme will initiate synthesis at the 3′-end of a primer annealed to a polynucleotide template sequence (e.g., such as a primer sequence annealed to the primer binding site of a PEgRNA) and will proceed toward the 5′ end of the template strand. A “DNA polymerase” catalyzes the polymerization of deoxynucleotides. As used herein in reference to a DNA polymerase, the term DNA polymerase includes a “functional fragment thereof.” A “functional fragment thereof” refers to any portion of a wild-type or mutant DNA polymerase that encompasses less than the entire amino acid sequence of the polymerase and which retains the ability, under at least one set of conditions, to catalyze the polymerization of a polynucleotide. Such a functional fragment may exist as a separate entity, or it may be a constituent of a larger polypeptide, such as a fusion protein. Prime editing [0153] As used herein, the term “prime editing” refers to an approach for gene editing using napDNAbps, a polymerase (e.g., a reverse transcriptase), and specialized guide RNAs that include a primer binding site and a DNA synthesis template for encoding desired new genetic information (or deleting genetic information) that is then incorporated into a target DNA sequence. Prime editing is described in Anzalone, A. V. et al., Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019), which is incorporated herein by reference. [0154] Prime editing represents a platform for genome editing that is a versatile and precise method to directly write new genetic information into a specified DNA site using a nucleic acid programmable DNA binding protein (“napDNAbp”) working in association with a polymerase (i.e., in the form of a fusion protein or otherwise provided in trans with the napDNAbp), wherein the prime editing system is programmed with a prime editing (PE) guide RNA (“PEgRNA”) that both specifies the target site and templates the synthesis of the desired edit in the form of a replacement DNA strand by way of an extension (either DNA or RNA) engineered onto a guide RNA (e.g., at the 5ʹ or 3ʹ end, or at an internal portion of a guide RNA). The replacement strand containing the desired edit (e.g., a single nucleobase substitution) shares the same sequence as the endogenous strand (or is homologous to it) immediately downstream of the nick site of the target site to be edited (with the exception that it includes the desired edit). Through DNA repair and/or replication machinery, the endogenous strand downstream of the nick site is replaced by the newly synthesized replacement strand containing the desired edit. In some cases, prime editing may be thought of as a “search-and-replace” genome editing technology since the prime editors, as described herein, not only search and locate the desired target site to be edited, but at the same time, encode a replacement strand containing a desired edit that is installed in place of the corresponding target site endogenous DNA strand. The prime editors of the present disclosure relate, in part, to the discovery that the mechanism of target-primed reverse transcription (TPRT) or “prime editing” can be leveraged or adapted for conducting precision CRISPR/Cas-based genome editing with high efficiency and genetic flexibility. TPRT is naturally used by mobile DNA elements, such as mammalian non-LTR retrotransposons and bacterial Group II introns. Cas protein-reverse transcriptase fusions or related systems are used to target a specific DNA sequence with a guide RNA, generate a single strand nick at the target site, and use the nicked DNA as a primer for reverse transcription of an engineered reverse transcriptase template that is integrated with the guide RNA. However, while the concept begins with prime editors that use reverse transcriptase as the DNA polymerase component, the prime editors described herein are not limited to reverse transcriptases but may include the use of virtually any DNA polymerase. Indeed, while the application throughout may refer to prime editors with “reverse transcriptases,” it is set forth here that reverse transcriptases are only one type of DNA polymerase that may work with prime editing. Thus, wherever the specification mentions a “reverse transcriptase,” the person having ordinary skill in the art should appreciate that any suitable DNA polymerase may be used in place of the reverse transcriptase. Thus, in one aspect, the prime editors may comprise Cas9 (or an equivalent napDNAbp), which is programmed to target a DNA sequence by associating it with a specialized guide RNA (i.e., PEgRNA) containing a spacer sequence that anneals to a complementary sequence (the complementary sequence to an endogenous protospacer sequence) in the target DNA. The PEgRNA also contains new genetic information in the form of an extension that encodes a replacement strand of DNA containing a desired nucleotide change which is used to replace a corresponding endogenous DNA strand at the target site. To transfer information from the PEgRNA to the target DNA, the mechanism of prime editing involves nicking the target site in one strand of the DNA to expose a 3′-hydroxyl group. The exposed 3′-hydroxyl group can then be used to prime the DNA polymerization of the edit-encoding extension on PEgRNA directly into the target site. In various embodiments, the extension—which provides the template for polymerization of the replacement strand containing the edit—can be formed from RNA or DNA. In the case of an RNA extension, the polymerase of the prime editor can be an RNA-dependent DNA polymerase (such as a reverse transcriptase). In the case of a DNA extension, the polymerase of the prime editor may be a DNA-dependent DNA polymerase. The newly synthesized strand (i.e., the replacement DNA strand containing the desired nucleotide edit) that is formed by the prime editor would be homologous to the genomic target sequence (i.e., have the same sequence as), except for the inclusion of one or more desired nucleotide changes (e.g., a single nucleotide substitution, a deletion, or an insertion, or a combination thereof). The newly synthesized (or replacement) strand of DNA may also be referred to as a single strand DNA flap, which would compete for hybridization with the complementary homologous endogenous DNA strand, thereby displacing the corresponding endogenous strand. Resolution of the hybridized intermediate (also referred to as a heteroduplex, comprising the single strand DNA flap synthesized by the reverse transcriptase hybridized to the endogenous DNA strand with the exception of mismatches at positions where desired nucleotide edits are installed in the edit strand) can include removal of the resulting displaced flap of endogenous DNA (e.g., with a 5ʹ end DNA flap endonuclease, FEN1), ligation of the synthesized single strand DNA flap to the target DNA, and assimilation of the desired nucleotide changes as a result of cellular DNA repair and/or replication processes. Because templated DNA synthesis offers single nucleotide precision for the modification of any nucleotide, including insertions and deletions, the scope of this approach is very broad and could foreseeably be used for myriad applications in basic science and therapeutics. In certain embodiments, the system can be combined with the use of an error-prone reverse transcriptase enzyme (e.g., provided as a fusion protein with the Cas9 domain, or provided in trans to the Cas9 domain). The error- prone reverse transcriptase enzyme can introduce alterations during synthesis of the single strand DNA flap. Thus, in certain embodiments, error-prone reverse transcriptase can be utilized to introduce nucleotide changes to the target DNA. Depending on the error-prone reverse transcriptase that is used with the system, the changes can be random or non-random. [0155] In various embodiments, prime editing operates by contacting a target DNA molecule (for which a change in the nucleotide sequence is desired to be introduced) with a nucleic acid programmable DNA binding protein (napDNAbp) complexed with a prime editing guide RNA (PEgRNA). In various embodiments, the prime editing guide RNA (PEgRNA) comprises an extension at the 3′ or 5′ end of the guide RNA, or at an intramolecular location in the guide RNA, and encodes the desired nucleotide change (e.g., single nucleotide substitution, insertion, or deletion). First, the napDNAbp/extended gRNA complex contacts the DNA molecule, and the extended gRNA guides the napDNAbp to bind to a target locus. Next, a nick in one of the strands of DNA of the target locus is introduced (e.g., by a nuclease or chemical agent), thereby creating an available 3′ end in one of the strands of the target locus. In certain embodiments, the nick is created in the strand of DNA that corresponds to the R-loop strand, i.e., the strand that is not hybridized to the guide RNA sequence, i.e., the “non-target strand.” The nick, however, could be introduced in either of the strands. That is, the nick could be introduced into the R-loop “target strand” (i.e., the strand hybridized to the protospacer of the extended gRNA) or the “non-target strand” (i.e., the strand forming the single-stranded portion of the R-loop and which is complementary to the target strand). In the next step, the 3′ end of the DNA strand (formed by the nick) interacts with the extended portion of the guide RNA in order to prime reverse transcription (i.e., “target-primed RT”). In certain embodiments, the 3′ end DNA strand hybridizes to a specific RT priming sequence on the extended portion of the guide RNA, i.e., the “reverse transcriptase priming sequence” or “primer binding site” on the PEgRNA. In the next step, a reverse transcriptase (or other suitable DNA polymerase) is introduced that synthesizes a single strand of DNA from the 3′ end of the primed site towards the 5′ end of the prime editing guide RNA. The DNA polymerase (e.g., reverse transcriptase) can be fused to the napDNAbp or alternatively can be provided in trans to the napDNAbp. This forms a single-strand DNA flap comprising the desired nucleotide change (e.g., the single base change, insertion, or deletion, or a combination thereof) and that is otherwise homologous to the endogenous DNA at or adjacent to the nick site. In the next step, the napDNAbp and guide RNA are released. The final two steps relate to the resolution of the single strand DNA flap such that the desired nucleotide change becomes incorporated into the target locus. This process can be driven towards the desired product formation by removing the corresponding 5′ endogenous DNA flap that forms once the 3′ single strand DNA flap invades and hybridizes to the endogenous DNA sequence. Without being bound by theory, the cell’s endogenous DNA repair and replication processes resolve the mismatched DNA to incorporate the nucleotide change(s) to form the desired altered product. The process can also be driven towards product formation with “second strand nicking.” This process may introduce at least one or more of the following genetic changes: transversions, transitions, deletions, and insertions. [0156] The term “prime editor (PE) system” or “prime editor (PE)” or “PE system” or “PE editing system” refers the compositions involved in the method of genome editing using target-primed reverse transcription (TPRT) described herein, including, but not limited to, the napDNAbps, reverse transcriptases, fusion proteins (e.g., comprising napDNAbps and reverse transcriptases), prime editing guide RNAs, and complexes comprising fusion proteins and prime editing guide RNAs, as well as accessory elements, such as second strand nicking components (e.g., second strand nicking sgRNAs) and 5′ endogenous DNA flap removal endonucleases (e.g., FEN1) for helping to drive the prime editing process towards the edited product formation. [0157] Although in the embodiments described thus far the PEgRNA constitutes a single molecule comprising a guide RNA (which itself comprises a spacer sequence and a gRNA core or scaffold) and a 5ʹ or 3ʹ extension arm comprising the primer binding site and a DNA synthesis template, the PEgRNA may also take the form of two individual molecules. For example, in some embodiments, a PEgRNA may comprise a guide RNA and a trans prime editor RNA template (tPERT), which essentially houses the extension arm (including, in particular, the primer binding site and the DNA synthesis domain) and an RNA-protein recruitment domain (e.g., MS2 aptamer or hairpin) in the same molecule which becomes co- localized or recruited to a modified prime editor complex that comprises a tPERT recruiting protein (e.g., MS2cp protein, which binds to the MS2 aptamer). [0158] Prime editing, twin prime editing, and prime editors are further described, e.g., in International Patent Application No. PCT/US2020/023721, filed March 19, 2020, which published as WO 2020/191239; International Patent Application No. PCT/US2021/031439, filed May 7, 2021, which published as WO 2021/226558; International Patent Application No. PCT/2021/052097, filed September 24, 2021, which published as WO 2022/067130; International Patent Application No. PCT/US2022/012054, filed January 11, 2022, which published as WO 2022/150790; International Patent Application No. PCT/US2022/078655, filed October 25, 2022, which published as WO 2023/076898; and International Patent Application No. PCT/US2022/074628, filed August 5, 2022, which published as WO 2023/015309; the contents of each of which is incorporated by reference herein. Prime editor [0159] The term “prime editor” refers to the polypeptide or polypeptide components involved in prime editing as described herein. In some embodiments, a prime editor comprises a fusion constructs comprising a napDNAbp (e.g., Cas9 nickase) and a reverse transcriptase. In some embodiments, a prime editor is capable of carrying out prime editing on a target nucleotide sequence in the presence of a PEgRNA (or “extended guide RNA”). In some embodiments, a prime editor comprises a napDNAbp (e.g., Cas9 nickase) and a reverse transcriptase provided in trans, i.e., the napDNAbp and the reverse transcriptase are not fused. The in trans napDNAbp and the reverse transcriptase may be tethered via a non-peptide linkage, e.g., a MS2 RNA-protein binding RNA sequence and a MS2 coat protein fused to either the napDNAbp or the reverse transcriptase, or may be unlinked to each other and simply recruited by the pegRNA. In some embodiments, a prime editor composition, system, or complex provided herein comprises a fusion protein or a fusion protein complexed with a PEgRNA, and/or further complexed with a second-strand nicking sgRNA. In some embodiments, the prime editor system may also refer to the complex comprising a fusion protein (reverse transcriptase fused to a napDNAbp), a PEgRNA, and a regular guide RNA capable of directing the second-site nicking step of the non-edited strand as described herein. Primer binding site [0160] The term “primer binding site” or “PBS” refers to the portion of a PEgRNA as a component of the extension arm (e.g., at the 3ʹ end of the extension arm), and is a single- stranded portion of the PEgRNA as a component of the extension arm that comprises a region of complementarity to a sequence on the non-target strand of a double stranded target DNA. In some embodiments, the primer binding site is complementary to a region upstream of a nick site in a non-target strand. In some embodiments, the primer binding site is complementary to a region immediately upstream of a nick site in the non-target strand. In some embodiments, the primer binding site is capable of binding to the primer sequence that is formed after nicking of the edit strand (the non-target strand) of the target DNA sequence by the prime editor. When the prime editor (e.g., by a Cas9 nickase component of a prime editor) nicks the edit strand of the target DNA sequence, a free 3′ end is formed in the edit strand, which serves as a primer sequence that anneals to the primer binding site on the PEgRNA to prime reverse transcription. In some embodiments, the PBS is complementary to or substantially complementary to and can anneal to a free 3′ end on the non-target strand of the double stranded target DNA at the nick site. In some embodiments, the PBS anneals to the free 3′ end on the non-target strand can initiate target-primed DNA synthesis. In some embodiments, a PBS comprises complementarity to nucleotides 1 to (n-3) of a spacer sequence. Protein, peptide, and polypeptide [0161] The terms “protein,” “peptide,” and “polypeptide” are used interchangeably herein and refer to a polymer of amino acid residues linked together by peptide (amide) bonds. The terms refer to a protein, peptide, or polypeptide of any size, structure, or function. Typically, a protein, peptide, or polypeptide will be at least three amino acids long. A protein, peptide, or polypeptide may refer to an individual protein or a collection of proteins. One or more of the amino acids in a protein, peptide, or polypeptide may be modified, for example, by the addition of a chemical entity such as a carbohydrate group, a hydroxyl group, a phosphate group, a farnesyl group, an isofarnesyl group, a fatty acid group, a linker for conjugation, functionalization, or other modification, etc. A protein, peptide, or polypeptide may also be a single molecule or may be a multi-molecular complex. A protein, peptide, or polypeptide may be just a fragment of a naturally occurring protein or peptide. A protein, peptide, or polypeptide may be naturally occurring, recombinant, or synthetic, or any combination thereof. Any of the proteins provided herein may be produced by any method known in the art. For example, the proteins provided herein may be produced via recombinant protein expression and purification, which is especially suited for fusion proteins comprising a peptide linker. Methods for recombinant protein expression and purification are well known, and include those described by Green and Sambrook, Molecular Cloning: A Laboratory Manual (4th ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (2012)), the contents of which are incorporated herein by reference. Protospacer [0162] As used herein, the term “protospacer” refers to the sequence (e.g., of ~20 bp) in DNA adjacent to the PAM (protospacer adjacent motif) sequence. The protospacer shares the same sequence as the spacer sequence of the guide RNA (except that a protospacer contains Thymine and the spacer sequence contains Uracil). The guide RNA anneals to the complement of the protospacer sequence on the target DNA (specifically, one strand thereof, i.e., the “target strand” versus the “non-target strand” of the target DNA sequence). In some embodiments, in order for a Cas nickase component of a prime editor to function, it also requires a specific protospacer adjacent motif (PAM) that varies depending on the Cas protein component itself, e.g., the type of Cas protein and the bacterial species from which it is derived. The most commonly used Cas9 nuclease, derived from S. pyogenes, recognizes a PAM sequence of NGG that is directly downstream of the protospacer sequence in the genomic DNA, on the non-target strand. Protospacer adjacent motif (PAM) [0163] As used herein, the term “protospacer adjacent motif” or “PAM” refers to a DNA sequence (e.g., an approximately 2-6 nucleotide sequence) that is an important targeting component of a Cas nuclease, e.g., a Cas9. For example, in some embodiments for a Cas9 nuclease, the PAM sequence is on either strand and is downstream in the 5ʹ to 3ʹ direction of the Cas9 cut site. The canonical PAM sequence (i.e., the PAM sequence that is associated with the Cas9 nuclease of Streptococcus pyogenes or SpCas9) is 5ʹ-NGG-3ʹ, wherein “N” is any nucleobase followed by two guanine (“G”) nucleobases. In some embodiments, SpCas9’s can also recognize additional non-canonical PAMs (e.g., NAG and NGA). [0164] Different PAM sequences can be associated with different Cas9 nucleases or equivalent proteins from different organisms. In addition, any given Cas9 nuclease, e.g., SpCas9, may be modified to alter the PAM specificity of the nuclease such that the nuclease recognizes an alternative PAM sequence. [0165] For example, with reference to the canonical SpCas9 amino acid sequence SEQ ID NO: 6, the PAM sequence can be modified by introducing one or more mutations, including (a) D1135V, R1335Q, and T1337R “the VQR variant,” which alters the PAM specificity to NGAN or NGNG, (b) D1135E, R1335Q, and T1337R “the EQR variant,” which alters the PAM specificity to NGAG, and (c) D1135V, G1218R, R1335E, and T1337R “the VRER variant,” which alters the PAM specificity to NGCG. In addition, the D1135E variant of canonical SpCas9 still recognizes NGG, but it is more selective compared to the wild type SpCas9 protein. [0166] It will also be appreciated that Cas9 enzymes from different bacterial species (i.e., Cas9 orthologs) can have varying PAM specificities. For example, Cas9 from Staphylococcus aureus (SaCas9) recognizes NGRRT or NGRRN. In addition, Cas9 from Neisseria meningitis (NmCas) recognizes NNNNGATT. In another example, Cas9 from Streptococcus thermophilis (StCas9) recognizes NNAGAAW. In still another example, Cas9 from Treponema denticola (TdCas) recognizes NAAAAC. These are examples and are not meant to be limiting. It will be further appreciated that non-SpCas9s bind a variety of PAM sequences, which makes them useful when no suitable SpCas9 PAM sequence is present at the desired target cut site. Furthermore, non-SpCas9s may have other characteristics that make them more useful than SpCas9. For example, Cas9 from Staphylococcus aureus (SaCas9) is about 1 kilobase smaller than SpCas9, so it can be packaged into adeno- associated virus (AAV). Further reference is made to Shah et al., “Protospacer recognition motifs: mixed identities and functional diversity,” RNA Biology, 10(5): 891-899 (which is incorporated herein by reference). Reverse transcriptase [0167] The term “reverse transcriptase” describes a class of polymerases characterized as RNA-dependent DNA polymerases. All known reverse transcriptases require a primer to synthesize a DNA transcript from an RNA template. Historically, reverse transcriptase has been used primarily to transcribe mRNA into cDNA, which can then be cloned into a vector for further manipulation. Avian myoblastosis virus (AMV) reverse transcriptase was the first widely used RNA-dependent DNA polymerase (Verma, Biochim. Biophys. Acta 473:1 (1977)). The enzyme has 5ʹ-3ʹ RNA-directed DNA polymerase activity, 5ʹ-3ʹ DNA-directed DNA polymerase activity, and RNase H activity. RNase H is a processive 5ʹ and 3ʹ ribonuclease specific for the RNA strand for RNA-DNA hybrids (Perbal, A Practical Guide to Molecular Cloning, New York: Wiley & Sons (1984)). Errors in transcription cannot be corrected by reverse transcriptase because known viral reverse transcriptases lack the 3ʹ-5ʹ exonuclease activity necessary for proofreading (Saunders and Saunders, Microbial Genetics Applied to Biotechnology, London: Croom Helm (1987)). A detailed study of the activity of AMV reverse transcriptase and its associated RNaseH activity has been presented by Berger et al., Biochemistry 22:2365-2372 (1983). Another reverse transcriptase that is used extensively in molecular biology is reverse transcriptase originating from Moloney murine leukemia virus (M-MLV or “MMLV”). See, e.g., Gerard, G. R., DNA 5:271-279 (1986) and Kotewicz, M. L., et al., Gene 35:249-258 (1985). M-MLV reverse transcriptase substantially lacking in RNase H activity has also been described. See, e.g., U.S. Pat. No.5,244,797. The invention contemplates the use of any such reverse transcriptases, or variants or mutants thereof. [0168] In addition, the invention contemplates the use of reverse transcriptases that are error- prone, i.e., that may be referred to as error-prone reverse transcriptases or reverse transcriptases that do not support high fidelity incorporation of nucleotides during polymerization. During synthesis of the single-strand DNA flap based on the RT template integrated with the guide RNA, the error-prone reverse transcriptase can introduce one or more nucleotides that are mismatched with the RT template sequence, thereby introducing changes to the nucleotide sequence through erroneous polymerization of the single-strand DNA flap. These errors introduced during synthesis of the single strand DNA flap then become integrated into the double strand molecule through hybridization to the corresponding endogenous target strand, removal of the endogenous displaced strand, ligation, and then through one more round of endogenous DNA repair and/or sequencing processes. In some embodiments, the prime editors used in the complexes and methods provided herein comprise MMLV RT. Reverse transcription [0169] As used herein, the term “reverse transcription” indicates the capability of an enzyme to synthesize a DNA strand (that is, complementary DNA or cDNA) using RNA as a template. In some embodiments, the reverse transcription can be “error-prone reverse transcription,” which refers to the properties of certain reverse transcriptase enzymes that are error-prone in their DNA polymerization activity. Second-strand nicking [0170] Prime editing typically involves the resolution of heteroduplex DNA (i.e., containing one edited and one non-edited strand) formed as a result of installation of one or more desired nucleotide changes in the edit strand but not (yet) in the non-edit strand of the target DNA sequence. Resolution of the heteroduplex DNA (the edited strand paired with the endogenous non-edited strand) and installation of nucleotide changes corresponding to the desired nucleotide edits in the non-edit strand permanently integrates the desired edits in the target DNA sequence. The approach of “second-strand nicking” can be used herein to help drive the resolution of heteroduplex DNA in favor of permanent integration of the edited strand into the DNA molecule. As used herein, the concept of “second-strand nicking” refers to the introduction of a second nick on the unedited strand. In some embodiments, a second nick is introduced at a location on the non-edit strand corresponding to a position downstream of the first nick (i.e., the initial nick site that provides the free 3′ end for use in priming of the reverse transcriptase on the extended portion of the guide RNA) on the edit strand. Thus, the first nick (introduced by the prime editor in combination with the PEgRNA) and the second nick (introduced by the prime editor and a second-strand nicking guide RNA) are on opposite strands. Said another way, the first nick is on the non-target strand (i.e., the strand that forms the single strand portion of the R-loop), and the second nick is on the target strand. Said still another way, the first nick (introduced by the prime editor in combination with the PEgRNA) is on the edit strand, and the second nick (introduced by the prime editor and second strand nicking guide RNA) is on the non-edit strand. The second nick can be introduced in the non- edit strand at a position that is opposite at least 1, 2, 3, 4, or 5 nucleotides downstream or upstream of the first nick of the edit strand, or that is opposite at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, or 150 or more nucleotides downstream or upstream of the first nick of the edit strand. The second nick can also be introduced in the non-edit strand at a position that is opposite at least 1, 2, 3, 4, or 5 nucleotides downstream or upstream of the edit site of the edit strand, or that is opposite at least 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, or 150 or more nucleotides downstream or upstream of the edit site of the edit strand. The second nick, in certain embodiments, can be introduced in the non-edit strand at a position that is opposite about 1-150 nucleotides downstream or upstream of the first nick of the edit strand, or that is opposite about 1-140, or about 1-130, or about 1-120, or about 1-110, or about 1- 100, or about 1-90, or about 1-80, or about 1-70, or about 1-60, or about 1-50, or about 1-40, or about 1-30, or about 1-20, or about 1-10 nucleotides downstream or upstream of the first nick of the edit strand. Without being bound by theory, the second nick induces the cell’s endogenous DNA repair and replication processes towards replacement of the non-edit strand, thereby permanently installing the edited sequence on both strands of the target DNA and resolving the heteroduplex that is formed as a result of PE. [0171] In certain embodiments, the second strand nicking guide RNA (also referred to herein as the nicking guide RNA, ngRNA, secondary nicking RNA, or second strand nicking sgRNA) may include a spacer sequence that preferentially and/or selectively only anneals to the edit strand after the desired nucleotide edit(s) are installed but not to the original strand of DNA the becomes replaced by the edited strand (i.e., the 5′ single-strand DNA flap that is displaced and ultimately removed during heteroduplex resolution). This can operate by designing the second strand nicking guide RNA to comprise a spacer sequence that anneals only to the edited region of the edited strand (and thus, wherein the spacer of the second strand nicking guide RNA comprises a nucleotide sequence that is the complement of the edited sequence or region thereof and includes the complement of the edit) and thus, can discriminate between the edited strand and the original strand of the displaced 5′ single-strand DNA flap that is immediately downstream of the cut site of the edited strand. This can be referred to as “temporal second-strand nicking” because the second strand nicking occurs only after prime editing has generated the new 3′ DNA flap containing the desired edit. This avoids the introduction of a double strand cut during prime editing which would otherwise result from the simultaneous or approximately simultaneous cutting of opposite strands by the PE complex comprising the PEgRNA and the PE complex comprising the second-strand cutting guide RNA. Spacer sequence [0172] As used herein, the term “spacer sequence” in connection with a guide RNA or a PEgRNA refers to the portion of the guide RNA or PEgRNA of about 20 nucleotides that contains a nucleotide sequence that shares the same sequence as the protospacer sequence in the target DNA sequence. The spacer sequence anneals to the complement of the protospacer sequence to form a ssRNA/ssDNA hybrid structure at the target site and a corresponding R loop ssDNA structure of the endogenous DNA strand. Split Cas9 [0173] A “split Cas9” refers to a fusion protein with a split site located within the Cas9 protein that is provided as an N-terminal portion (also referred to as an N-terminal half) and a C-terminal portion (also referred to as a C-terminal half) encoded by two separate nucleotide sequences. The polypeptides corresponding to the N-terminal portion and the C-terminal portion of the Cas9 protein may be combined (joined) to form a complete Cas9 protein. A Cas9 protein is known to consist of a bi-lobed structure linked by a disordered linker (e.g., as described in Nishimasu et al., Cell, Volume 156, Issue 5, pp.935–949, 2014, incorporated herein by reference). In some embodiments, the “split” occurs between the two lobes, generating two portions of a Cas9 protein, each containing one lobe. Split Intein [0174] Although inteins are most frequently found as a contiguous domain, some exist in a naturally split form. In this case, the two fragments are expressed as separate polypeptides and must associate before splicing takes place, so-called protein trans-splicing. [0175] An exemplary split intein is the Ssp DnaE intein, which comprises two subunits, namely, DnaE-N and DnaE-C. The two different subunits are encoded by separate genes, namely dnaE-n and dnaE-c, which encode the DnaE-N and DnaE-C subunits, respectively. DnaE is a naturally occurring split intein in Synechocytis sp. PCC6803 and is capable of directing trans-splicing of two separate proteins, each comprising a fusion with either DnaE- N or DnaE-C. [0176] Additional naturally occurring or engineered split-intein sequences are known in the art or can be made from whole-intein sequences described herein or those available in the art. Examples of split-intein sequences can be found in Stevens et al., “A promiscuous split intein with expanded protein engineering applications,” PNAS, 2017, Vol.114: 8538-8543; Iwai et al., “Highly efficient protein trans-splicing by a naturally split DnaE intein from Nostc punctiforme, FEBS Lett, 580: 1853-1858, each of which are incorporated herein by reference. Additional split intein sequences can be found, for example, in WO 2013/045632, WO 2014/055782, WO 2016/069774, and EP2877490, the contents each of which are incorporated herein by reference. [0177] In addition, protein splicing in trans has been described in vivo and in vitro (Shingledecker, et al., Gene 207:187 (1998), Southworth, et al., EMBO J.17:918 (1998); Mills, et al., Proc. Natl. Acad. Sci. USA, 95:3543-3548 (1998); Lew, et al., J. Biol. Chem., 273:15887-15890 (1998); Wu, et al., Biochim. Biophys. Acta 35732:1 (1998b), Yamazaki, et al., J. Am. Chem. Soc.120:5591 (1998), Evans, et al., J. Biol. Chem.275:9091 (2000); Otomo, et al., Biochemistry 38:16040-16044 (1999); Otomo, et al., J. Biolmol. NMR 14:105- 114 (1999); Scott, et al., Proc. Natl. Acad. Sci. USA 96:13638-13643 (1999)) and provides the opportunity to express a protein as to two inactive fragments that subsequently undergo ligation to form a functional product. Split Prime Editor [0178] A “split prime editor” refers to a prime editor that is provided as an N-terminal portion (also referred to as a N-terminal half) and a C-terminal portion (also referred to as a C-terminal half) encoded by two separate nucleic acids. The polypeptides corresponding to the N-terminal portion and the C-terminal portion of the prime editor may be combined to form a complete prime editor. In some embodiments, for a prime editor that comprises a dCas9 or nCas9, the “split” is located in the dCas9 or nCas9 domain, at positions as described herein in the split prime editor. Accordingly, in some embodiments, the N-terminal portion of the prime editor contains the N-terminal portion of the split prime editor, and the C-terminal portion of the prime editor contains the C-terminal portion of the split prime editor. Similarly, intein-N or intein-C may be fused to the N-terminal portion or the C-terminal portion of the prime editor, respectively, for the joining of the N- and C-terminal portions of the prime editor to form a complete prime editor. In some embodiments, the split site is within a Cas9 protein of a prime editor. In certain embodiments, the split site is between amino acid residues 844 and 845 of a Cas9 protein within a prime editor (e.g., a Cas9 protein of SEQ ID NO: 6). In certain embodiments, the split site is between amino acid residues 1024 and 1025 of a Cas9 protein within a prime editor (e.g., a Cas9 protein of SEQ ID NO: 6). Silent mutation [0179] As used herein, the term “silent mutation” refers to a mutation in a nucleic acid molecule that does not have an effect on the phenotype of the nucleic acid molecule, or the protein it produces if it encodes a protein. Silent mutations can be introduced into coding regions of a nucleic acid (i.e., segments of a gene that encode for a protein), or they can be introduced in non-coding regions of a nucleic acid. A silent mutation in a nucleic acid sequence, e.g., in a target DNA sequence or in a DNA synthesis template sequence to be installed in the target sequence, may be a nucleotide alteration that does not result in expression or function of the amino acid sequence encoded by the nucleic acid sequence, or other functional features of the target nucleic acid sequence. When silent mutations are present in a coding region, they may be synonymous mutations. Synonymous mutations refer to substitutions of one base for another in a gene such that the corresponding amino acid residue of the protein produced by the gene is not modified. This is due to the redundancy of the genetic code, allowing for multiple different codons to encode for the same amino acid in a particular organism. When a silent mutation is in a noncoding region or a junction of a coding region and a non-coding region (e.g., an intron/exon junction), it may be in a region that does not impact any biological properties of the nucleic acid molecule (e.g., splicing, gene regulation, RNA lifetime, etc.). In particular embodiments, a silent mutation may also be a “benign” mutation, for example, where a nucleotide substitution results in one or more alterations in the amino acid sequence encoded, but does not result in detrimental impact on the expression or function of the polypeptide. Silent mutations may be useful, for example, for increasing the length of contiguous changes in a desired nucleotide edit or the number of nucleotide edits made to a target nucleotide sequence using prime editing to evade correction of the edit by the MMR pathway. In certain embodiments, the number of silent mutations installed may be one, or two, or three, or four, or five, or six, or seven, or eight, or nine, or ten, or more. In certain other embodiments involving at least two silent mutations, the silent mutations may be installed within one, or two, or three, or four, or five, or six, or seven, or eight, or nine, or ten, or 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 nucleotides from the intended edit site. In some embodiments, silent mutations are installed in order to alter or optimize the secondary structure that a particular pegRNA will form in cell. In some embodiments, changing some bases of a pegRNA with silent mutations results in changes to the secondary structure of the pegRNA that can improve editing efficiency. Subject [0180] The term “subject,” as used herein, refers to an individual organism, for example, an individual mammal. In some embodiments, the subject is a human. In some embodiments, the subject is a non-human mammal. In some embodiments, the subject is a non-human primate. In some embodiments, the subject is a rodent. In some embodiments, the subject is a sheep, a goat, a cattle, a cat, or a dog. In some embodiments, the subject is a vertebrate, an amphibian, a reptile, a fish, an insect, a fly, or a nematode. In some embodiments, the subject is a research animal. In some embodiments, the subject is genetically engineered, e.g., a genetically engineered non-human subject. The subject may be of either sex and may be at any stage of development. In some embodiments, a subject has or is suspected of having a triplet repeat disorder. In some embodiments, a subject has or is suspected of having Huntington’s disease. In some embodiments, a subject has or is suspected of having Friedreich’s ataxia. Target site [0181] The term “target site” refers to a sequence within a nucleic acid molecule that is edited by a prime editor (PE) disclosed herein. The target site further refers to the sequence within a nucleic acid molecule to which a complex of the prime editor (PE) and gRNA binds. In some embodiments, a target site comprises a trinucleotide repeat sequence. In some embodiments, a target site comprises part of the HTT gene. In some embodiments, a target site comprises part of the FXN gene. Temporal second-strand nicking [0182] As used herein, the term “temporal second-strand nicking” refers to a variant of second strand nicking whereby the installation of the second-strand nick in the unedited strand occurs only after the desired edit is installed in the edited strand by the PE complexed with the PEgRNA. Without being bound by theory, the second-strand nick in the unedited strand induces the cell’s endogenous DNA repair and replication processes towards replacement of the unedited strand, thereby permanently installing the edited sequence on both strands and resolving the heteroduplex that is formed as a result of PE. In some embodiments, a prime editor system comprising a second strand nicking guide RNA designed with the temporal second strand nicking strategy, which can avoid concurrent nicks on both strands that could lead to double-stranded DNA breaks. The second-strand nicking guide RNA is designed for temporal control such that the second strand nick is not introduced until after the installation of the desired edit. This is achieved by designing a gRNA with a spacer sequence that matches only the edited strand, but not the original allele. Using this strategy, mismatches between the spacer of the second-strand nicking guide RNA and the unedited allele should disfavor second-strand nicking until after the editing event on the PAM strand takes place. In certain embodiments, the second strand nicking guide RNA may include a spacer sequence that preferentially and/or selectively only anneals to the edited strand (i.e., after PE synthesizes the edit), but not to the original strand of DNA that becomes replaced by the edited strand (i.e., the 5′ single-strand DNA flap that is displaced and ultimately removed during heteroduplex resolution). This can operate by designing the second strand nicking guide RNA to comprise a spacer sequence that anneals only to the edited region of the edited strand (and thus, wherein the spacer of the second strand nicking guide RNA comprises a nucleotide sequence that is the complement of the edited sequence or region thereof and includes the complement of the edit) and thus, can discriminate between the edited strand and the original strand of the displaced 5′ single-strand DNA flap that is immediately downstream of the cut site of the edited strand. This avoids the introduction of a double strand cut during prime editing that would otherwise result from the simultaneous or approximately simultaneous cutting of opposite strands by the PE complex comprising the PEgRNA and the PE complex comprising the second-strand cutting guide RNA. [0183] In some embodiments, a prime editor system (e.g., a PE3b system or a PE5b system) comprises components that improve temporal second-strand nicking by including PE-based installation of one or more silent mutations around an edit site (e.g., introducing one or more silent mutations located upstream and/or downstream of a non-silent, desired nucleotide edit or adjacent to the non-silent nucleotide edit). In some embodiments, a prime editor system comprises a pegRNA, the DNA synthesis template of which comprises one or more non- silent nucleotide edits and further comprises one or more silent mutations compared to the endogenous sequence of the target strand (and accordingly encodes a single stranded DNA comprising the one or more non-silent nucleotide edits and the silent mutations compared to the endogenous sequence of the edit strand). In some embodiments, the one or more silent mutations are adjacent to or immediately adjacent to a non-silent nucleotide edit in the DNA synthesis template. For example, in some embodiments, the one or more silent mutations are within 5 nucleotides upstream of the non-silent nucleotide edit. In some embodiments, the one or more silent mutations are within 5 nucleotides downstream of the non-silent nucleotide edit. In some embodiments, the one or more silent mutations are immediately adjacent to the non-silent nucleotide edit, such that the DNA synthesis template contains at least 3 contiguous nucleotides that are not complementary to the corresponding endogenous sequence downstream of the nick site on the edit strand of the target DNA sequence. Without wishing to be bound by a particular theory, such silent mutations may improve prime editing efficiency by evading the cellular mismatch repair pathway by avoiding reversion of the PE- installed edit on the edit strand back to the pre-edited sequence. Such silent mutations may also improve prime editing efficiency by altering or optimizing the secondary structure of the pegRNA. In some embodiments, a prime editor system comprising a pegRNA with the one or more silent mutations in addition to the non-silent mutation in the DNA synthesis template can result in improved editing efficiency of the target DNA, as compared to a control prime editor system comprising a pegRNA that only contains the non-silent mutation and not the one or more silent mutations in the DNA synthesis template. In some embodiments, combining PE3b designs with the silent mutations can further improve prime editing efficiency and/or reduce indel frequency resulted from editing. This can operate by designing a second strand nicking guide RNA that comprises a spacer sequence that anneals only to the edited strand, which includes not only a desired edit, but also the one or more installed silent mutations that are installed at proximal continuous or non-continuous positions near the desired edit. The single-strand nicking guide RNA comprises a spacer sequence that is complementary to the PE-edited strand can discriminate between the edited strand and the original strand, which corresponds to the displaced 5′ single-strand DNA flap that is immediately downstream of the first nick site of the edited strand. This improved strategy of temporal second-strand nicking avoids the introduction of a double strand cut during prime editing that would otherwise result from the simultaneous or approximately simultaneous cutting of opposite strands by the PE complex comprising the PEgRNA and the PE complex comprising the second-strand cutting guide RNA. [0184] The silent mutations may be installed in coding regions of the target nucleic acid molecule or in non-coding regions of the target nucleic acid molecule. When the silent mutations are installed in a coding region, they introduce into the nucleic acid molecule one or more alternate codons encoding the same amino acid as the unedited nucleic acid molecule. Alternatively, when the silent mutations are installed in a non-coding region, the silent mutations may be present in a region of the nucleic acid molecule that does not influence splicing, gene regulation, RNA lifetime, or other biological properties of the target site on the nucleic acid molecule. Treatment [0185] The terms “treatment,” “treat,” and “treating” refer to a clinical intervention aimed to reverse, alleviate, delay the onset of, or inhibit the progress of a disease or disorder, or one or more symptoms thereof, as described herein. As used herein, the terms “treatment,” “treat,” and “treating” refer to a clinical intervention aimed to reverse, alleviate, delay the onset of, or inhibit the progress of a disease or disorder, or one or more symptoms thereof, as described herein. In some embodiments, treatment may be administered after one or more symptoms have developed and/or after a disease has been diagnosed. In other embodiments, treatment may be administered in the absence of symptoms, e.g., to prevent or delay onset of a symptom or inhibit onset or progression of a disease. For example, treatment may be administered to a susceptible individual prior to the onset of symptoms (e.g., in light of a history of symptoms and/or in light of genetic or other susceptibility factors). Treatment may also be continued after symptoms have resolved, for example, to prevent or delay their recurrence. In some embodiments, a treatment is administered to treat a triplet repeat disorder. In certain embodiments, a treatment is administered to treat Huntington’s disease. In certain embodiments, a treatment is administered to treat Friedreich’s ataxia. Triplet Repeat Disorder [0186] The terms “triplet repeat disorder,” “trinucleotide repeat disorder,” “triplet repeat expansion disorder,” or “trinucleotide repeat expansion disorder refer to a number of human disease, including Huntington’s Disease, Fragile X syndrome, and Friedreich’s ataxia, associated with expansion of particular trinucleotide repeat sequences. The most common trinucleotide repeat contains CAG triplets, though GAA triplets (Friedreich’s ataxia) and CGG triplets (Fragile X syndrome) also occur. Inheriting a predisposition to expansion, or acquiring an already expanded parental allele, increases the likelihood of acquiring the disease. [0187] Triplet repeat disorders include, for example, those associated with the following genes and numbers of pathogenic polyglutamine trinucleotide repeats: [0188] Triplet repeat disorders include, for example, those associated with the following genes and numbers of pathogenic non-polyglutamine trinucleotide repeats:
[0189] Trinucleotide repeat expansion disorders are complex, progressive disorders that involve developmental neurobiology and often affect cognition as well as sensory-motor functions. The disorders show genetic anticipation (i.e., increased severity with each generation). The DNA expansions or contractions usually happen meiotically (i.e., during the time of gametogenesis, or early in embryonic development), and often have sex-bias meaning that some genes expand only when inherited through the female, and others only through the male. In humans, trinucleotide repeat expansion disorders can cause gene silencing at either the transcriptional or translational level, which essentially knocks out gene function. Alternatively, trinucleotide repeat expansion disorders can cause altered proteins generated with large repetitive amino acid sequences that either abrogate or change protein function, often in a dominant-negative manner (e.g., poly-glutamine diseases). [0190] Without wishing to be bound by theory, triplet expansion is caused by slippage during DNA replication or during DNA repair synthesis. Because the tandem repeats have identical sequence to one another, base pairing between two DNA strands can take place at multiple points along the sequence. This may lead to the formation of “loop out” structures during DNA replication or DNA repair synthesis. This may also lead to repeated copying of the repeated sequence, expanding the number of repeats. Additional mechanisms involving hybrid RNA:DNA intermediates have been proposed. [0191] Depending on the particular trinucleotide expansion disorder, the defect-inducing triplet expansions may occur in “trinucleotide repeat expansion proteins.” Trinucleotide repeat expansion proteins are a diverse set of proteins associated with susceptibility for developing a trinucleotide repeat expansion disorder, the presence of a trinucleotide repeat expansion disorder, the severity of a trinucleotide repeat expansion disorder, or any combination thereof. Trinucleotide repeat expansion disorders are divided into two categories determined by the type of repeat. The most common repeat is the triplet CAG, which, when present in the coding region of a gene, codes for the amino acid glutamine (Q). Therefore, these disorders are referred to as the polyglutamine (polyQ) disorders and comprise the following diseases: Huntington’s Disease (HD); Spinobulbar Muscular Atrophy (SBMA); Spinocerebellar Ataxias (SCA types 1, 2, 3, 6, 7, and 17); and Dentatorubro-Pallidoluysian Atrophy (DRPLA). The remaining trinucleotide repeat expansion disorders either do not involve the CAG triplet or the CAG triplet is not in the coding region of the gene. These disorders, therefore, are referred to as the non-polyglutamine disorders. The non- polyglutamine disorders comprise Fragile X Syndrome (FRAXA); Fragile XE Mental Retardation (FRAXE); Friedreich’s Ataxia (FRDA); Myotonic Dystrophy (DM); and Spinocerebellar Ataxias (SCA types 8 and 12). [0192] The production rate or circulating concentration of a protein associated with a trinucleotide repeat expansion disorder may be elevated or depressed in a population having a trinucleotide repeat expansion disorder relative to a population lacking the trinucleotide repeat expansion disorder. Non-limiting examples of proteins associated with trinucleotide repeat expansion disorders include AR (androgen receptor), FMR1 (fragile X mental retardation 1), HTT (huntingtin), DMPK (dystrophia myotonica-protein kinase), FXN (frataxin), ATXN2 (ataxin 2), ATN1 (atrophin 1), FEN1 (flap structure-specific endonuclease 1), TNRC6A (trinucleotide repeat containing 6A), PABPN1 (poly(A) binding protein, nuclear 1), JPH3 (junctophilin 3), MED15 (mediator complex subunit 15), ATXN1 (ataxin 1), ATXN3 (ataxin 3), TBP (TATA box binding protein), CACNA1A (calcium channel, voltage-dependent, P/Q type, alpha 1A subunit), ATXN80S (ATXN8 opposite strand (non- protein coding)), PPP2R2B (protein phosphatase 2, regulatory subunit B, beta), ATXN7 (ataxin 7), TNRC6B (trinucleotide repeat containing 6B), TNRC6C (trinucleotide repeat containing 6C), CELF3 (CUGBP, Elav-like family member 3), MAB21L1 (mab-21-like 1 (C. elegans)), MSH2 (mutS homolog 2, colon cancer, nonpolyposis type 1 (E. coli)), TMEM185A (transmembrane protein 185A), SIX5 (SIX homeobox 5), CNPY3 (canopy 3 homolog (zebrafish)), FRAXE (fragile site, folic acid type, rare, fra(X)(q28) E), GNB2 (guanine nucleotide binding protein (G protein), beta polypeptide 2), RPL14 (ribosomal protein L14), ATXN8 (ataxin 8), INSR (insulin receptor), TTR (transthyretin), EP400 (E1A binding protein p400), GIGYF2 (GRB10 interacting GYF protein 2), OGG1 (8-oxoguanine DNA glycosylase), STC1 (stanniocalcin 1), CNDP1 (carnosine dipeptidase 1 (metallopeptidase M20 family)), C10orf2 (chromosome 10 open reading frame 2), MAML3 mastermind-like 3 (Drosophila), DKC1 (dyskeratosis congenita 1, dyskerin), PAXIP1 (PAX interacting (with transcription-activation domain) protein 1), CASK (calcium/calmodulin- dependent serine protein kinase (MAGUK family)), MAPT (microtubule-associated protein tau), SP1 (Sp1 transcription factor), POLG (polymerase (DNA directed), gamma), AFF2 (AF4/FMR2 family, member 2), THBS1 (thrombospondin 1), TP53 (tumor protein p53), ESR1 (estrogen receptor 1), CGGBP1 (CGG triplet repeat binding protein 1), ABT1 (activator of basal transcription 1), KLK3 (kallikrein-related peptidase 3), PRNP (prion protein), JUN (jun oncogene), KCNN3 (potassium intermediate/small conductance calcium- activated channel, subfamily N, member 3), BAX (BCL2-associated X protein), FRAXA (fragile site, folic acid type, rare, fra(X)(q27.3) A (macroorchidism, mental retardation)), KBTBD10 (kelch repeat and BTB (POZ) domain containing 10), MBNL1 (muscleblind-like (Drosophila)), RAD51 (RAD51 homolog (RecA homolog, E. coli) (S. cerevisiae)), NCOA3 (nuclear receptor coactivator 3), ERDA1 (expanded repeat domain, CAG/CTG 1), TSC1 (tuberous sclerosis 1), COMP (cartilage oligomeric matrix protein), GCLC (glutamate- cysteine ligase, catalytic subunit), RRAD (Ras-related associated with diabetes), MSH3 (mutS homolog 3 (E. coli)), DRD2 (dopamine receptor D2), CD44 (CD44 molecule (Indian blood group)), CTCF (CCCTC-binding factor (zinc finger protein)), CCND1 (cyclin D1), CLSPN (claspin homolog (Xenopus laevis)), MEF2A (myocyte enhancer factor 2A), PTPRU (protein tyrosine phosphatase, receptor type, U), GAPDH (glyceraldehyde-3-phosphate dehydrogenase), TRIM22 (tripartite motif-containing 22), WT1 (Wilms tumor 1), AHR (aryl hydrocarbon receptor), GPX1 (glutathione peroxidase 1), TPMT (thiopurine S- methyltransferase), NDP (Norrie disease (pseudoglioma)), ARX (aristaless related homeobox), MUS81 (MUS81 endonuclease homolog (S. cerevisiae)), TYR (tyrosinase (oculocutaneous albinism IA)), EGR1 (early growth response 1), UNG (uracil-DNA glycosylase), NUMBL (numb homolog (Drosophila)-like), FABP2 (fatty acid binding protein 2, intestinal), EN2 (engrailed homeobox 2), CRYGC (crystallin, gamma C), SRP14 (signal recognition particle 14 kDa (homologous Alu RNA binding protein)), CRYGB (crystallin, gamma B), PDCD1 (programmed cell death 1), HOXA1 (homeobox A1), ATXN2L (ataxin 2-like), PMS2 (PMS2 postmeiotic segregation increased 2 (S. cerevisiae)), GLA (galactosidase, alpha), CBL (Cas-Br-M (murine) ecotropic retroviral transforming sequence), FTH1 (ferritin, heavy polypeptide 1), IL12RB2 (interleukin 12 receptor, beta 2), OTX2 (orthodenticle homeobox 2), HOXA5 (homeobox A5), POLG2 (polymerase (DNA directed), gamma 2, accessory subunit), DLX2 (distal-less homeobox 2), SIRPA (signal-regulatory protein alpha), OTX1 (orthodenticle homeobox 1), AHRR (aryl-hydrocarbon receptor repressor), MANF (mesencephalic astrocyte-derived neurotrophic factor), TMEM158 (transmembrane protein 158 (gene/pseudogene)), and ENSG00000078687. In certain embodiments, the protein associated with a trinucleotide repeat disorder is the HTT protein. In certain embodiments, the protein associated with a trinucleotide repeat disorder is the FXN protein. Variant [0193] As used herein, the term “variant” should be taken to mean the exhibition of qualities that have a pattern that deviates from what occurs in nature, e.g., a variant Cas9 is a Cas9 comprising one or more changes in amino acid residues as compared to a wild type Cas9 amino acid sequence. The term “variant” encompasses homologous proteins having at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identity with a reference sequence and having the same or substantially the same functional activity or activities as the reference sequence. The term also encompasses mutants, truncations, or domains of a reference sequence that display the same or substantially the same functional activity or activities as the reference sequence. Vector [0194] The term “vector,” as used herein, refers to a nucleic acid that can be modified to encode a gene of interest and that is able to enter a host cell, mutate, and replicate within the host cell, and then transfer a replicated form of the vector into another host cell. Exemplary suitable vectors include viral vectors, such as retroviral vectors or bacteriophages and filamentous phage, and conjugative plasmids. Additional suitable vectors will be apparent to those of skill in the art based on the instant disclosure. Wild type [0195] As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene, or characteristic as it occurs in nature as distinguished from mutant or variant forms. DETAILED DESCRIPTION [0196] The present disclosure provides compositions, systems, and methods useful in the treatment of trinucleotide repeat disorders, including Huntington’s disease and Friedreich’s ataxia. The present disclosure also provides pegRNAs designed to target the HTT or FXN genes. Complexes comprising a prime editor and any of the pegRNAs disclosed herein are also provided by the present disclosure. The present disclosure further provides polynucleotides, vectors, AAVs, cells, compositions, and kits. Methods of treating Huntington’s disease and Friedreich’s ataxia, as well as uses of the compositions, pegRNAs, and systems described herein, are also provided herein. PEgRNAs [0197] The prime editing complexes and methods described herein contemplate the use of any suitable PEgRNAs. In various aspects, the present disclosure provides pegRNAs targeting the HTT and FXN genes as described herein. Such pegRNAs may be useful, for example, in prime editing methods targeting pathogenic trinucleotide repeats in HTT and FXN for the treatment of Huntington’s disease and Friedreich’s ataxia. In some embodiments, a trinucleotide repeat is replaced with a trinucleotide repeat of a different length using the pegRNAs provided herein. In some embodiments, a trinucleotide repeat is contracted using the pegRNAs provided herein. In some embodiments, a trinucleotide repeat is deleted using the pegRNAs provided herein. In certain embodiments, CAG repeats in the HTT gene (e.g., at the 5′- end of the HTT gene) are replaced, contracted, or deleted using the pegRNAs provided herein. In certain embodiments, GAA repeats in the FXN gene (e.g., in intron 1 of the FXN gene) are replaced, contracted, or deleted using the pegRNAs provided herein. [0198] In one aspect, the present disclosure provides pegRNAs targeting the HTT gene comprising a spacer sequence comprising the nucleic acid sequence: GACCCTGGAAAAGCTGATGA (SEQ ID NO: 381); GCTGCTGCTGGAAGGACTTG (SEQ ID NO: 382); GCTGCTGCTGCTGCTGCTGGA (SEQ ID NO: 383); GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384); GGCGGCGGCGGCGGCGGTGG (SEQ ID NO: 385); TGAGGAAGCTGAGGAGGCGG (SEQ ID NO: 386); or GGCGGCTGAGGAAGCTGAGG (SEQ ID NO: 387), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 381-387. In certain embodiments, the pegRNA comprises the spacer sequence GACCCTGGAAAAGCTGATGA (SEQ ID NO: 381), or a spacer sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 381. [0199] In some embodiments, the pegRNA comprises a primer binding site (PBS) of about 8 to about 16 nucleotides in length. In some embodiments, the pegRNA comprises a PBS of about 8, about 9, about 10, about 11, about 12, about 13, about 14, about 15, or about 16 nucleotides in length. In certain embodiments, the pegRNA comprises a PBS of about 10 nucleotides in length. [0200] In some embodiments, the pegRNA comprises a reverse transcription template comprising a sequence of about 15 to about 35 nucleotides, wherein the sequence encodes a sequence that comprises one or more nucleotide edits compared to a sequence directly upstream of a CAG repeat in exon 1 of the HTT gene. In some embodiments, the one or more nucleotide edits comprises a PAM mutation that alters a PAM sequence. In certain embodiments, the PAM sequence is NGG, wherein N is any one of nucleotides A, G, C, or T. In certain embodiments, the pegRNA comprises a reverse transcription template comprising a sequence of about 26 nucleotides, wherein the sequence encodes a sequence that comprises one or more nucleotide edits compared to a sequence directly upstream of a CAG repeat in exon 1 of the HTT gene. In some embodiments, the pegRNA comprises a reverse transcription template comprising an edit template encoding or comprising one or repeats of the trinucleotide sequence CAG (for example, 4-35 or 4-10 repeats). In certain embodiments, the pegRNA comprises a reverse transcription template comprising an edit template encoding or comprising the nucleotide sequence CAGCAGCAGCAG (SEQ ID NO: 874), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 874. In certain embodiments, the pegRNA comprises a reverse transcription template comprising an edit template comprising the nucleotide sequence CAGCAGCAGCAGCAGCAGCAGCAGCAG (SEQ ID NO: 875), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 875. In some embodiments, the edit template further comprises one or more nucleotides of the trinucleotide sequence CAA. In certain embodiments, the pegRNA comprises a reverse transcription template comprising an edit template comprising the nucleotide sequence CAGCAGCAGCAGCAACAACAA (SEQ ID NO: 876), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 876. In some embodiments, the pegRNA comprises a reverse transcription template comprising a homology arm of about 16 to about 50 nucleotides in length, wherein the homology arm is complementary to a sequence directly downstream of a CAG repeat in exon 1 of a wildtype HTT gene in the coding strand. In certain embodiments, the pegRNA comprises a reverse transcription template comprising a homology arm of about 31 nucleotides in length. In certain embodiments, the pegRNA comprises a reverse transcription template comprising a homology arm of about 40 nucleotides in length. [0201] In some embodiments, the pegRNA for editing HTT is an engineered pegRNA (epegRNA). In some embodiments, the pegRNA comprises at its 3' end a structural motif that improves stability of the pegRNA. In some embodiments, the epegRNA comprises an evopreq1 motif. In certain embodiments, the evopreq1 motif comprises the sequence of SEQ ID NO: 442, or a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 442, a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of SEQ ID NO: 442, or a sequence having 1, 2, 3, 4, or 5 mutations relative to SEQ ID NO: 442. In some embodiments, the pegRNA comprises a UA flip in the pegRNA scaffold sequence. In certain embodiments, the pegRNA comprises the sequence of any one of SEQ ID NOs: 454-815, or a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of any one of SEQ ID NOs: 454-815. [0202] In another aspect, the present disclosure provides pegRNAs targeting the FXN gene comprising a spacer sequence comprising the nucleic acid sequence: GCAAGACTAACCTGGCCAACA (SEQ ID NO: 388); GTCCGGAGTTCAAGACTAACC (SEQ ID NO: 389); GAAGGTGGATCACCTGAGGTC (SEQ ID NO: 390); GTCTGGAGTAGCTGGGATTAC (SEQ ID NO: 391); or GCAGGCGCGCGACACCACGCC (SEQ ID NO: 392), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 388-392. In certain embodiments, the pegRNA comprises the spacer sequence GCAAGACTAACCTGGCCAACA (SEQ ID NO: 388), or a nucleic acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 388. [0203] In some embodiments, the pegRNA comprises a primer binding site (PBS) of about 8 to about 14 nucleotides in length. In some embodiments, the pegRNA comprises a PBS of about 8, about 9, about 10, about 11, about 12, about 13, or about 14 nucleotides in length. In certain embodiments, the pegRNA comprises a PBS of about 10 nucleotides in length. [0204] In some embodiments, the pegRNA comprises a reverse transcription template comprising a homology arm of about 8 to about 50 nucleotides in length, wherein the homology arm is complementary to a sequence directly downstream of a GAA repeat in a wildtype FXN gene. In certain embodiments, the pegRNA comprises a reverse transcription template comprising a homology arm of about 40 nucleotides in length. [0205] In some embodiments, the pegRNA for editing FXN is an engineered pegRNA (epegRNA). In some embodiments, the pegRNA comprises at its 3' end a structural motif that improves stability of the pegRNA. In some embodiments, the epegRNA comprises an evopreq1 motif. In certain embodiments, the evopreq1 motif comprises the sequence of SEQ ID NO: 442, a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of SEQ ID NO: 442, or a sequence having 1, 2, 3, 4, or 5 mutations relative to SEQ ID NO: 442. In some embodiments, the pegRNA comprises a UA flip in the pegRNA scaffold sequence. [0206] In certain embodiments, the pegRNA for editing FXN comprises the sequence of any one of SEQ ID NOs: 816-867, or a sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to the sequence of any one of SEQ ID NOs: 816-867. [0207] In some aspects, the present disclosure also provides gRNAs for nicking the non- PAM-containing strand of the target nucleotide sequence. In some embodiments, the spacer of the nicking gRNA targets the HTT gene and comprises the nucleotide sequence: GGCGGCTGAGGAAGCTGAGG (SEQ ID NO: 387); GGCGGCGGCGGCGGCGGTGG (SEQ ID NO: 385); GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384); GAAGGACTTGAGGGACTCGA (SEQ ID NO: 393); GTGAGGAAGCTGAGGAGGCGG (SEQ ID NO: 394); GCTGTTGCTGCTGCTGCTGC (SEQ ID NO: 395); GCTGCTGCTGCTGCTGCTGGA (SEQ ID NO: 383); GCTGCTGCTGGAAGGACTTG (SEQ ID NO: 382); GGCCTTCATCAGCTTTTCC (SEQ ID NO: 396); GGCTTTCATCAGCTTTTCC (SEQ ID NO: 397); GGGACTCGAAGGCCTTCAT (SEQ ID NO: 398); GGAAGGACTTGAGGGACTCG (SEQ ID NO: 399); GCTGCTGCTGGAAGGACTT (SEQ ID NO: 400); GCTGCTGCTGCTGCTGCTGG (SEQ ID NO: 401); GCTGGAAGGACTTGAGGGACT (SEQ ID NO: 402); GCTGCTGCTGCTGGAAGGACT (SEQ ID NO: 403); CTGCTGCTGCTGCTGCTGGAA (SEQ ID NO: 404); GCTGCTGCTGCTGCTGCTGCT (SEQ ID NO: 405); GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406); GCTGCTGGAAGGACTTCAG (SEQ ID NO: 407); GCTGCTGAAAGGACTTCAG (SEQ ID NO: 408); or GCTGTTGGAAGGACTTCAG (SEQ ID NO: 409), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any of these. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 384. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 406. [0208] In some embodiments, the spacer of the nicking gRNA targets the FXN gene and comprises the nucleotide sequence: GTCCCAAAGTGCTGAGATTAT (SEQ ID NO: 410); GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411); GATTCTCCTGCCGCAGCCTC (SEQ ID NO: 412); or GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any of these. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 411. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 413. [0209] As described further below, various gRNA core sequences may be utilized in the pegRNAs provided herein. In some embodiments, a gRNA core that binds to SpCas9 is used in the pegRNAs provided herein. In some embodiments, a gRNA core that binds to SaCas9 is used in the pegRNAs provided herein. In some embodiments, a gRNA core that binds to any Cas9 protein provided herein, or any Cas9 protein known in the art, is used in the pegRNAs provided herein. PEgRNA architecture [0210] In some embodiments, an extended guide RNA, or pegRNA, used in the prime editing complexes and methods disclosed herein includes a spacer sequence (e.g., a ~20 nt spacer sequence) and a gRNA core region, which binds with the napDNAbp. In some embodiments, the pegRNA includes an extended RNA segment, i.e., an extension arm, at the 5′ end, i.e., a 5′ extension. In some embodiments, the 5′ extension includes a reverse transcription template sequence, a primer binding site, and an optional 5-20 nucleotide linker sequence. The RT primer binding site hybridizes to the free 3ʹ end that is formed after a nick is formed in the non-target strand of the R-loop, thereby priming reverse transcriptase for DNA polymerization in the 5′-3′ direction. [0211] In another embodiment, an extended guide RNA (i.e., a pegRNA) used in the prime editing complexes and methods provided herein includes a spacer sequence (e.g., a ~20 nt spacer sequence) and a gRNA core, which binds with the napDNAbp. In some embodiments, the pegRNA includes an extended RNA segment, i.e., an extension arm, at the 3′ end, i.e., a 3′ extension. In some embodiments, the 3′ extension includes a reverse transcription template sequence, and a reverse transcription primer binding site. The RT primer binding site hybridizes to the free 3′ end that is formed after a nick is formed in the non-target strand of the R-loop, thereby priming reverse transcriptase for DNA polymerization in the 5′-3′ direction. [0212] In another embodiment, an extended guide RNA (i.e., a pegRNA) used in the prime editing complexes and methods provided herein includes a spacer sequence (e.g., a ~20 nt spacer sequence) and a gRNA core, which binds with the napDNAbp. In some embodiments, the pegRNA includes an extended RNA segment, i.e., an extension arm, at an intermolecular position within the gRNA core, i.e., an intramolecular extension. In some embodiments, the intramolecular extension includes a reverse transcription template sequence, and a reverse transcription primer binding site. The RT primer binding site hybridizes to the free 3′ end that is formed after a nick is formed in the non-target strand of the R-loop, thereby priming reverse transcriptase for DNA polymerization in the 5′-3′ direction. [0213] In one embodiment, the position of the intermolecular RNA extension is not in the spacer sequence of the guide RNA. In another embodiment, the position of the intermolecular RNA extension is in the gRNA core. In still another embodiment, the position of the intermolecular RNA extension is anywhere within the guide RNA molecule except within the spacer sequence, or at a position which disrupts the spacer sequence. In one embodiment, the intermolecular RNA extension is inserted downstream from the 3′ end of the spacer sequence. In another embodiment, the intermolecular RNA extension is inserted at least 1 nucleotide, at least 2 nucleotides, at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, at least 24 nucleotides, or at least 25 nucleotides downstream of the 3′ end of the spacer sequence. [0214] In other embodiments, the intermolecular RNA extension is inserted into the gRNA core, which refers to the portion of a traditional guide RNA corresponding or comprising the tracrRNA, which binds and/or interacts with the napDNAbp, e.g., a Cas9 protein or equivalent thereof (i.e., a different napDNAbp). Preferably the insertion of the intermolecular RNA extension does not disrupt or minimally disrupts the interaction between the tracrRNA portion and the napDNAbp. [0215] The length of the RNA extension (which includes at least the RT template and primer binding site) can be any useful length. In various embodiments, the RNA extension is at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, at least 24 nucleotides, at least 25 nucleotides, at least 30 nucleotides, at least 40 nucleotides, at least 50 nucleotides, at least 60 nucleotides, at least 70 nucleotides, at least 80 nucleotides, at least 90 nucleotides, at least 100 nucleotides, at least 200 nucleotides, at least 300 nucleotides, at least 400 nucleotides, or at least 500 nucleotides in length. [0216] The RT template sequence can also be any suitable length. For example, the RT template sequence can be at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 30 nucleotides, at least 40 nucleotides, at least 50 nucleotides, at least 60 nucleotides, at least 70 nucleotides, at least 80 nucleotides, at least 90 nucleotides, at least 100 nucleotides, at least 200 nucleotides, at least 300 nucleotides, at least 400 nucleotides, or at least 500 nucleotides in length. [0217] In still other embodiments, the reverse transcription primer binding site sequence is at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 30 nucleotides, at least 40 nucleotides, at least 50 nucleotides, at least 60 nucleotides, at least 70 nucleotides, at least 80 nucleotides, at least 90 nucleotides, at least 100 nucleotides, at least 200 nucleotides, at least 300 nucleotides, at least 400 nucleotides, or at least 500 nucleotides in length. [0218] In other embodiments, the optional linker or spacer sequence is at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 30 nucleotides, at least 40 nucleotides, at least 50 nucleotides, at least 60 nucleotides, at least 70 nucleotides, at least 80 nucleotides, at least 90 nucleotides, at least 100 nucleotides, at least 200 nucleotides, at least 300 nucleotides, at least 400 nucleotides, or at least 500 nucleotides in length. [0219] The RT template sequence, in certain embodiments, encodes a single-stranded DNA molecule which is homologous to the non-target strand (and thus, complementary to the corresponding site of the target strand) but includes one or more nucleotide changes. The one or more nucleotide changes may include one or more single-base nucleotide changes, one or more deletions, and/or one or more insertions. [0220] The synthesized single-stranded DNA product of the RT template sequence is homologous to the non-target strand except that it contains one or more nucleotide changes. The single-stranded DNA product of the RT template sequence hybridizes in equilibrium with the complementary target strand sequence, thereby displacing the homologous endogenous target strand sequence. The displaced endogenous strand may be referred to in some embodiments as a 5′ endogenous DNA flap species. This 5′ endogenous DNA flap species can be removed by a 5′ flap endonuclease (e.g., FEN1) and the single-stranded DNA product, now hybridized to the endogenous target strand, may be ligated, thereby creating a mismatch between the endogenous sequence and the newly synthesized strand. The mismatch may be resolved by the cell’s innate DNA repair and/or replication processes. [0221] In various embodiments, the nucleotide sequence of the RT template sequence corresponds to the nucleotide sequence of the non-target strand that becomes displaced as the 5′ flap species and that overlaps with the site to be edited. [0222] In various embodiments of the extended guide RNAs, the reverse transcription template sequence may encode a single-strand DNA flap that is complementary to an endogenous DNA sequence adjacent to a nick site, wherein the single-strand DNA flap comprises a desired nucleotide change. The single-stranded DNA flap may displace an endogenous single-strand DNA at the nick site. The displaced endogenous single-strand DNA at the nick site can have a 5′ end and form an endogenous flap, which can be excised by the cell. In various embodiments, excision of the 5′ end endogenous flap can help drive product formation since removing the 5′ end endogenous flap encourages hybridization of the single- strand 3′ DNA flap to the corresponding complementary DNA strand, and the incorporation or assimilation of the desired nucleotide change carried by the single-strand 3′ DNA flap into the target DNA. [0223] The terms “cleavage site,” “nick site,” and “cut site,” as used interchangeably herein in the context of prime editing, refer to a specific position in between two nucleotides or two base pairs in the double-stranded target DNA sequence. In some embodiments, the position of a nick site is determined relative to the position of a specific PAM sequence. In some embodiments, the nick site is the particular position where a nick will occur when the double stranded target DNA is contacted with a napDNAbp, e.g., a nickase such as a Cas nickase, that recognizes a specific PAM sequence. For each PEgRNA described herein, a nick site (e.g., the “first nick site” when referred to in the context of PE3, PE5 and similar approaches), is characteristic of the particular napDNAbp to which the gRNA core of the PEgRNA associates with, and is characteristic of the particular PAM required for recognition and function of the napDNAbp. For example, for a PEgRNA that comprises a gRNA core that associates with a SpCas9, the nick site in the phosphodiester bond between bases three (“-3” position relative to the position 1 of the PAM sequence) and four (“-4” position relative to position 1 of the PAM sequence). [0224] In some embodiments, a nick site is in a target strand of the double-stranded target DNA sequence. In some embodiments, a nick site is in a non-target strand of the double- stranded target DNA sequence. In some embodiments, the nick site is in a protospacer sequence. In some embodiments, the nick site is adjacent to a protospacer sequence. In some embodiments, a nick site is downstream of a region, e.g., on a non-target strand, that is complementary to a primer binding site of a PEgRNA. In some embodiments, a nick site is downstream of a region, e.g., on a non-target strand, that binds to a primer binding site of a PEgRNA. In some embodiments, a nick site is immediately downstream of a region, e.g., on a non-target strand, that is complementary to a primer binding site of a PEgRNA. In some embodiments, the nick site is upstream of a specific PAM sequence on the non-target strand of the double stranded target DNA, wherein the PAM sequence is specific for recognition by a napDNAbp that associates with the gRNA core of a PEgRNA. In some embodiments, the nick site is downstream of a specific PAM sequence on the non-target strand of the double stranded target DNA, wherein the PAM sequence is specific for recognition by a napDNAbp that associates with the gRNA core of a PEgRNA. In some embodiments, the nick site is 3 nucleotides upstream of the PAM sequence, and the PAM sequence is recognized by a Streptococcus pyogenes Cas9 nickase, a P. lavamentivorans Cas9 nickase, a C. diphtheriae Cas9 nickase, a N. cinerea Cas9, a S. aureus Cas9, or a N. lari Cas9 nickase. In some embodiments, the nick site is 3 nucleotides upstream of the PAM sequence, and the PAM sequence is recognized by a Cas9 nickase, wherein the Cas9 nickase comprises a nuclease active HNH domain and a nuclease inactive RuvC domain. In some embodiments, the nick site is 2 base pairs upstream of the PAM sequence, and the PAM sequence is recognized by a S. thermophilus Cas9 nickase. [0225] In various embodiments of the extended guide RNAs, the cellular repair of the single- strand DNA flap results in installation of the desired nucleotide change, thereby forming a desired product. [0226] In still other embodiments, the desired nucleotide change is installed in an editing window that is between about -5 to +5 of the nick site, or between about -10 to +10 of the nick site, or between about -20 to +20 of the nick site, or between about -30 to +30 of the nick site, or between about -40 to +40 of the nick site, or between about -50 to +50 of the nick site, or between about -60 to +60 of the nick site, or between about -70 to +70 of the nick site, or between about -80 to +80 of the nick site, or between about -90 to +90 of the nick site, or between about -100 to +100 of the nick site, or between about -200 to +200 of the nick site. [0227] In other embodiments, the desired nucleotide change is installed in an editing window that is between about +1 to +2 from the nick site, or about +1 to +3, +1 to +4, +1 to +5, +1 to +6, +1 to +7, +1 to +8, +1 to +9, +1 to +10, +1 to +11, +1 to +12, +1 to +13, +1 to +14, +1 to +15, +1 to +16, +1 to +17, +1 to +18, +1 to +19, +1 to +20, +1 to +21, +1 to +22, +1 to +23, +1 to +24, +1 to +25, +1 to +26, +1 to +27, +1 to +28, +1 to +29, +1 to +30, +1 to +31, +1 to +32, +1 to +33, +1 to +34, +1 to +35, +1 to +36, +1 to +37, +1 to +38, +1 to +39, +1 to +40, +1 to +41, +1 to +42, +1 to +43, +1 to +44, +1 to +45, +1 to +46, +1 to +47, +1 to +48, +1 to +49, +1 to +50, +1 to +51, +1 to +52, +1 to +53, +1 to +54, +1 to +55, +1 to +56, +1 to +57, +1 to +58, +1 to +59, +1 to +60, +1 to +61, +1 to +62, +1 to +63, +1 to +64, +1 to +65, +1 to +66, +1 to +67, +1 to +68, +1 to +69, +1 to +70, +1 to +71, +1 to +72, +1 to +73, +1 to +74, +1 to +75, +1 to +76, +1 to +77, +1 to +78, +1 to +79, +1 to +80, +1 to +81, +1 to +82, +1 to +83, +1 to +84, +1 to +85, +1 to +86, +1 to +87, +1 to +88, +1 to +89, +1 to +90, +1 to +90, +1 to +91, +1 to +92, +1 to +93, +1 to +94, +1 to +95, +1 to +96, +1 to +97, +1 to +98, +1 to +99, +1 to +100, +1 to +101, +1 to +102, +1 to +103, +1 to +104, +1 to +105, +1 to +106, +1 to +107, +1 to +108, +1 to +109, +1 to +110, +1 to +111, +1 to +112, +1 to +113, +1 to +114, +1 to +115, +1 to +116, +1 to +117, +1 to +118, +1 to +119, +1 to +120, +1 to +121, +1 to +122, +1 to +123, +1 to +124, or +1 to +125 from the nick site. [0228] In still other embodiments, the desired nucleotide change is installed in an editing window that is between about +1 to +2 from the nick site, or about +1 to +5, +1 to +10, +1 to +15, +1 to +20, +1 to +25, +1 to +30, +1 to +35, +1 to +40, +1 to +45, +1 to +50, +1 to +55, +1 to +100, +1 to +105, +1 to +110, +1 to +115, +1 to +120, +1 to +125, +1 to +130, +1 to +135, +1 to +140, +1 to +145, +1 to +150, +1 to +155, +1 to +160, +1 to +165, +1 to +170, +1 to +175, +1 to +180, +1 to +185, +1 to +190, +1 to +195, or +1 to +200, from the nick site. [0229] In various aspects, the extended guide RNAs are modified versions of an extended guide RNA. pegRNAs (i.e., extended guide RNAs) and ngRNAs may be expressed from an encoding nucleic acid, or synthesized chemically. Methods are well known in the art for obtaining or otherwise synthesizing guide RNAs, and for determining the appropriate sequence of the pegRNA, including the protospacer sequence, which interacts and hybridizes with the target strand of a genomic target site of interest. [0230] In various embodiments, the particular design aspects of a pegRNA sequence and ngRNA sequence will depend upon the nucleotide sequence of a genomic target site of interest (i.e., the desired site to be edited) and the type of napDNAbp (e.g., Cas9 protein) present in the prime editing systems utilized in the methods and compositions described herein, among other factors, such as PAM sequence locations, percent G/C content in the target sequence, the degree of microhomology regions, secondary structures, etc. [0231] In general, a spacer sequence (i.e., a guide sequence) of a pegRNA or ngRNA can be any polynucleotide sequence having sufficient complementarity with a target polynucleotide sequence to hybridize with the target sequence and direct sequence-specific binding of a napDNAbp (e.g., a Cas9, Cas9 homolog, or Cas9 variant) to the target sequence. In some embodiments, the degree of complementarity between a spacer and its corresponding target sequence, when optimally aligned using a suitable alignment algorithm, is about or more than about 50%, 60%, 75%, 80%, 85%, 90%, 95%, 97.5%, 99%, or more. Optimal alignment may be determined with the use of any suitable algorithm for aligning sequences, non-limiting examples of which include the Smith-Waterman algorithm, the Needleman-Wunsch algorithm, algorithms based on the Burrows-Wheeler Transform (e.g., the Burrows Wheeler Aligner), ClustalW, Clustal X, BLAT, Novoalign (Novocraft Technologies, ELAND (Illumina, San Diego, Calif.), SOAP (available at soap.genomics.org.cn), and Maq (available at maq.sourceforge.net). In some embodiments, a spacer is about or more than about 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 75, or more nucleotides in length. [0232] In some embodiments, a spacer is less than about 75, 50, 45, 40, 35, 30, 25, 20, 15, 12, or fewer nucleotides in length. The ability of a spacer to direct sequence-specific binding of a prime editor to a target sequence may be assessed by any suitable assay. For example, the components of a prime editor, including the spacer to be tested, may be provided to a host cell having the corresponding target sequence, such as by transfection with vectors encoding the components of a prime editor disclosed herein, followed by an assessment of preferential cleavage within the target sequence, such as by Surveyor assay as described herein. Similarly, cleavage of a target polynucleotide sequence may be evaluated in a test tube by providing the target sequence, components of a prime editor, including the spacer to be tested and a control spacer different from the test spacer, and comparing binding or rate of cleavage at the target sequence between the test and control spacer reactions. Other assays are possible, and will occur to those skilled in the art. [0233] A spacer may be selected to target any target sequence. In some embodiments, the target sequence is a sequence within a genome of a cell. Exemplary target sequences include those that are unique in the target genome. For example, for the S. pyogenes Cas9, a unique target sequence in a genome may include a Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXGG where NNNNNNNNNNNNXGG (N is A, G, T, or C; and X can be anything). A unique target sequence in a genome may include an S. pyogenes Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXGG where NNNNNNNNNNNXGG (N is A, G, T, or C; and X can be anything). For the S. thermophilus CRISPR1Cas9, a unique target sequence in a genome may include a Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXXAGAAW where NNNNNNNNNNNNXXAGAAW (N is A, G, T, or C; X can be anything; and W is A or T). A unique target sequence in a genome may include an S. thermophilus CRISPR 1 Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXXAGAAW where NNNNNNNNNNNXXAGAAW (N is A, G, T, or C; X can be anything; and W is A or T). For the S. pyogenes Cas9, a unique target sequence in a genome may include a Cas9 target site of the form MMMMMMMMNNNNNNNNNNNNXGGXG where NNNNNNNNNNNNXGGXG (N is A, G, T, or C; and X can be anything). A unique target sequence in a genome may include an S. pyogenes Cas9 target site of the form MMMMMMMMMNNNNNNNNNNNXGGXG where NNNNNNNNNNNXGGXG (N is A, G, T, or C; and X can be anything). In each of these sequences, “M” may be A, G, T, or C, and need not be considered in identifying a sequence as unique. In some embodiments, a target sequence is in the HTT gene. In certain embodiments, CAG repeats in the HTT gene (e.g., at the 5′- end of the HTT gene) are targeted. In some embodiments, a target sequence is in the FXN gene. In certain embodiments, GAA repeats in the FXN gene (e.g., in intron 1 of the FXN gene) are targeted. [0234] In some embodiments, a spacer is selected to reduce the degree of secondary structure within the spacer. Secondary structure may be determined by any suitable polynucleotide folding algorithm. Some programs are based on calculating the minimal Gibbs free energy. An example of one such algorithm is mFold, as described by Zuker and Stiegler (Nucleic Acids Res.9 (1981), 133-148). Another example folding algorithm is the online webserver RNAfold, developed at Institute for Theoretical Chemistry at the University of Vienna, using the centroid structure prediction algorithm (see, e.g., A. R. Gruber et al., 2008, Cell 106(1): 23-24; and PA Carr and GM Church, 2009, Nature Biotechnology 27(12): 1151-62). Further algorithms may be found in U.S. Application Ser. No.61/836,080, incorporated herein by reference. In some embodiments, silent mutations are introduced in a spacer in order to alter its secondary structure and increase the efficiency of prime editing. [0235] In some embodiments, the scaffold or gRNA core portion of a pegRNA comprises sequences corresponding to the tracr sequence and tracr mate sequence of a traditional guide RNA. In general, a tracr mate sequence includes any sequence that has sufficient complementarity with a tracr sequence to promote one or more of: (1) excision of a spacer flanked by tracr mate sequences in a cell containing the corresponding tracr sequence; and (2) formation of a complex at a target sequence, wherein the complex comprises the tracr mate sequence hybridized to the tracr sequence. In general, degree of complementarity is with reference to the optimal alignment of the tracr mate sequence and tracr sequence, along the length of the shorter of the two sequences. Optimal alignment may be determined by any suitable alignment algorithm, and may further account for secondary structures, such as self- complementarity within either the tracr sequence or tracr mate sequence. In some embodiments, the degree of complementarity between the tracr sequence and tracr mate sequence along the length of the shorter of the two when optimally aligned is about or more than about 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99%, or higher. In some embodiments, the tracr sequence is about or more than about 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, or more nucleotides in length. In some embodiments, the tracr sequence and tracr mate sequence are contained within a single transcript, such that hybridization between the two produces a transcript having a secondary structure, such as a hairpin. Preferred loop forming sequences for use in hairpin structures are four nucleotides in length, and most preferably have the sequence GAAA. However, longer or shorter loop sequences may be used, as may alternative sequences. The sequences preferably include a nucleotide triplet (for example, AAA), and an additional nucleotide (for example C or G). Examples of loop forming sequences include CAAA and AAAG. In an embodiment of the invention, the transcript or transcribed polynucleotide sequence has at least two or more hairpins. In preferred embodiments, the transcript has two, three, four or five hairpins. In a further embodiment of the invention, the transcript has at most five hairpins. In some embodiments, the single transcript further includes a transcription termination sequence; preferably this is a polyT sequence, for example six T nucleotides. Further non-limiting examples of single polynucleotides comprising a spacer, a tracr mate sequence, and a tracr sequence are as follows (listed 5′ to 3′), where “N” represents a base of a spacer, the first block of lower case letters represent the tracr mate sequence, and the second block of lower case letters represent the tracr sequence, and the final poly-T sequence represents the transcription terminator: (1)NNNNNNNNGTTTTTGTACTCTCAAGATTTAGAAATAAATCTTGCAGAAGCTACA AAGATAAGGCTTCATGCCGAAATCAACACCCTGTCATTTTATGGCAGGGTGTTTTC GTTATTTAATTTTTT (SEQ ID NO: 137); (2)NNNNNNNNNNNNNNNNNNGTTTTTGTACTCTCAGAAATGCAGAAGCTACAAA GATAAGGCTTCATGCCGAAATCAACACCCTGTCATTTTATGGCAGGGTGTTTTCGT TATTTAATTTTTT (SEQ ID NO: 138); (3)NNNNNNNNNNNNNNNNNNNNGTTTTTGTACTCTCAGAAATGCAGAAGCTACA AAGATAAGGCTTCATGCCGAAATCAACACCCTGTCATTTTATGGCAGGGTGTTTTT T (SEQ ID NO: 139); (4)NNNNNNNNNNNNNNNNNNNNGTTTTAGAGCTAGAAATAGCAAGTTAAAATAA GGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTTTTT (SEQ ID NO: 140); (5)NNNNNNNNNNNNNNNNNNNNGTTTTAGAGCTAGAAATAGCAAGTTAAAATAA GGCTAGTCCGTTATCAACTTGAAAAAGTGTTTTTTT (SEQ ID NO: 141); and (6) NNNNNNNNNNNNNNNNNNNNGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG CTAGTCCGTTATCATTTTTTTT (SEQ ID NO: 142). [0236] In some embodiments, sequences (1) to (3) are used in combination with Cas9 from S. thermophilus CRISPR1. In some embodiments, sequences (4) to (6) are used in combination with Cas9 from S. pyogenes. In some embodiments, the tracr sequence is a separate transcript from a transcript comprising the tracr mate sequence. [0237] It will be apparent to those of skill in the art that in order to target any of the fusion proteins comprising a Cas9 domain and a single-stranded DNA binding protein, as disclosed herein, to a target site, e.g., a site comprising a point mutation to be edited, it is typically necessary to co-express the fusion protein together with a guide RNA, e.g., an sgRNA. As explained in more detail elsewhere herein, a guide RNA typically comprises a tracrRNA framework allowing for Cas9 binding, and a spacer, which confers sequence specificity to the Cas9:nucleic acid editing enzyme/domain fusion protein. [0238] In some embodiments, a pegRNA comprises a structure 5ʹ-[spacer]- GUUUUAGAGCUAGAAAUAGCAAGUUAAAAUAAAGGCUAGUCCGUUAUCAACU UGAAAAAGUGGCACCGAGUCGGUGCUUUUU(SEQ ID NO: 143)-extension arm-3ʹ, wherein the spacer comprises a sequence that is complementary to the target sequence. The spacer, also referred to herein as the spacer sequence, is typically 20 nucleotides long. The sequences of suitable guide RNAs for targeting Cas9:nucleic acid editing enzyme/domain fusion proteins to specific genomic target sites will be apparent to those of skill in the art based on the instant disclosure. Such suitable guide RNA sequences typically comprise spacers that are complementary to a nucleic acid sequence within 50 nucleotides upstream or downstream of the target nucleotide to be edited. Some exemplary guide RNA sequences suitable for targeting any of the provided fusion proteins to specific target sequences are provided herein. Additional spacers are well known in the art and can be used with the prime editors utilized in the methods and compositions described herein. [0239] In some embodiments, a PEgRNA comprises three main component elements ordered in the 5ʹ to 3ʹ direction, namely: a spacer, a gRNA core, and an extension arm at the 3ʹ end. In some embodiments, the extension arm may further be divided into the following structural elements in the 5ʹ to 3ʹ direction, namely: an edit template , a homology arm, and a primer binding site. In some embodiments, the extension arm may further be divided into the following structural elements in the 5ʹ to 3ʹ direction, namely: a homology arm, an edit template, and a primer binding site. In some embodiments, the extension arm may further be divided into the following structural elements in the 5ʹ to 3ʹ direction, namely: a DNA synthesis template (e.g., a RT template), and a primer binding site. In addition, the PEgRNA may comprise an optional 3ʹ end modifier region and an optional 5ʹ end modifier region . Still further, the PEgRNA may comprise a transcriptional termination signal at the 3ʹ end of the PEgRNA. These structural elements are further defined herein. The depiction of the structure of the PEgRNA is not meant to be limiting and embraces variations in the arrangement of the elements. For example, the optional sequence modifiers and could be positioned within or between any of the other regions shown, and not limited to being located at the 3ʹ and 5ʹ ends. PEgRNA modifications [0240] The PEgRNAs may also include additional design modifications that may alter the properties and/or characteristics of PEgRNAs, thereby improving the efficacy of prime editing. In various embodiments, these modifications may belong to one or more of a number of different categories, including but not limited to: (1) designs to enable efficient expression of functional PEgRNAs from non-polymerase III (pol III) promoters, which would enable the expression of longer PEgRNAs without burdensome sequence requirements; (2) modifications to the core, Cas9-binding PEgRNA scaffold, which could improve efficacy; (3) modifications to the PEgRNA to improve RT processivity, allowing the insertion of longer sequences at targeted genomic loci; and (4) addition of RNA motifs to the 5ʹ or 3ʹ termini of the PEgRNA that improve PEgRNA stability, enhance RT processivity, prevent misfolding of the PEgRNA, or recruit additional factors important for genome editing. [0241] In one embodiment, PEgRNA could be designed with polIII promoters to improve the expression of longer-length PEgRNA with larger extension arms. sgRNAs are typically expressed from the U6 snRNA promoter. This promoter recruits pol III to express the associated RNA and is useful for expression of short RNAs that are retained within the nucleus. However, pol III is not highly processive and is unable to express RNAs longer than a few hundred nucleotides in length at the levels required for efficient genome editing. Additionally, pol III can stall or terminate at stretches of U’s, potentially limiting the sequence diversity that could be inserted using a PEgRNA. Other promoters that recruit polymerase II (such as pCMV) or polymerase I (such as the U1 snRNA promoter) have been examined for their ability to express longer sgRNAs. However, these promoters are typically partially transcribed, which would result in extra sequence 5ʹ of the spacer in the expressed PEgRNA, which has been shown to result in markedly reduced Cas9:sgRNA activity in a site-dependent manner. Additionally, while pol III-transcribed PEgRNAs can simply terminate in a run of 6-7 U’s, PEgRNAs transcribed from pol II or pol I would require a different termination signal. Such signals often also result in polyadenylation, which would result in undesired transport of the PEgRNA from the nucleus. Similarly, RNAs expressed from pol II promoters such as pCMV are typically 5ʹ-capped, also resulting in their nuclear export. [0242] In some aspects, the complexes and methods of present disclosure utilize next- generation modified pegRNAs (also referred to herein as “engineered pegRNAs” or “epegRNAs”) with improved properties, including but not limited to, increased stability and cellular lifespan, and improved binding affinity for a napDNAbp. These modified pegRNAs result in improved genome editing as demonstrated by increased editing efficiency at a wide variety of genomic sites. By appending certain nucleic acid structural motifs to the terminus of the extension arm of a pegRNA, including but not limited to, a prequeosin 1 -1 riboswitch aptamer (“evopreQ 1 -1”) or a variant thereof, a pseudoknot from the MMLV viral genome (“evopreQ1-1”) or a variant thereof, a modified tRNA used by MMLV RT as a primer for reverse transcription, or a variant thereof, and a G quadruplex, or a variant thereof, a consistent increase in editing activity may be achieved. [0243] In one embodiment, the modified pegRNAs include a nucleic acid moiety at the 3′ end of the pegRNA. Optionally, the 3′ end of the pegRNA is fused to the nucleic acid moiety through a nucleotide linker. In various embodiments, it will be appreciated that a wide variety of nucleotide sequences will work reasonably well for each genomic target site. Linker length can also be variable. In some cases, linkers ranging in length from 3-18 nucleotides will work. In other cases, the linker may be at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, at least 24 nucleotides, at least 25 nucleotides, at least 26 nucleotides, at least 27 nucleotides, at least 28 nucleotides, at least 29 nucleotides, or at least 30 nucleotides. [0244] In general, the nucleic acid moieties that may be used to modify a pegRNA, for example, by attaching it to the 3′ end of a pegRNA, may include any nucleic acid moiety, including, for instance, a nucleic acid molecule comprising or which forms a double-helix moiety, toeloop moiety, hairpin moiety, stem-loop moiety, pseudoknot moiety, aptamer moiety, G quadraplex moiety, tRNA moiety, or a ribozyme moiety. The nucleic acid moiety may be characterized as forming a secondary nucleic acid structure, a tertiary nucleic acid structure, or a quadruple nucleic acid structure. In other words, the nucleic acid moiety may form any two-dimensional or three-dimensional structure known to be formed by such structures. The nucleic acid moiety may be DNA or RNA. [0245] Without restriction, the following are specific examples of nucleotide motifs that may be appended to the terminus of the extension arm of a pegRNA. Thus, in the case of a 3′ extension arm, the nucleotide motif would be coupled, attached, or otherwise linked to the 3′ of the pegRNA, optionally via a linker. In the case of a 5′ extension arm, the nucleotide motif would be coupled, attached, or otherwise linked to the 5′ end of the pegRNA, optionally via a linker.
[0246] As indicated above, these motifs may be coupled, attached, or otherwise joined to a canonical pegRNA via a linker. Exemplary linkers include, but are not limited to: [0247] In some embodiments, a linker will be designed and/or selected based on the genomic site being targeted by prime editing and the modified pegRNA. [0248] In various embodiments, it will be appreciated that a wide variety of nucleotide sequences will work reasonably well for each genomic target site. Linker length is also likely to be variable. In some cases, linkers ranging in length from 3-18 nucleotides will work. In other cases, the linker may be at least 3 nucleotides, at least 4 nucleotides, at least 5 nucleotides, at least 6 nucleotides, at least 7 nucleotides, at least 8 nucleotides, at least 9 nucleotides, at least 10 nucleotides, at least 11 nucleotides, at least 12 nucleotides, at least 13 nucleotides, at least 14 nucleotides, at least 15 nucleotides, at least 16 nucleotides, at least 17 nucleotides, at least 18 nucleotides, at least 19 nucleotides, at least 20 nucleotides, at least 21 nucleotides, at least 22 nucleotides, at least 23 nucleotides, at least 24 nucleotides, at least 25 nucleotides, at least 26 nucleotides, at least 27 nucleotides, at least 28 nucleotides, at least 29 nucleotides, or at least 30 nucleotides. [0249] In one embodiment, the linker is 8 nucleotides in length. [0250] The present disclosure also contemplates variants of the above nucleotide motifs and linkers that have at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.9% sequence identity with any of the above motif and linker sequences. [0251] The pegRNAs may also include additional design improvements that may modify the properties and/or characteristics of pegRNAs thereby improving the efficacy of prime editing. In various embodiments, these improvements may belong to one or more of a number of different categories, including, but not limited to: (1) designs to enable efficient expression of functional pegRNAs from non-polymerase III (pol III) promoters, which would enable the expression of longer pegRNAs without burdensome sequence requirements; (2) improvements to the core, Cas9-binding pegRNA scaffold, which could improve efficacy; (3) modifications to the pegRNA to improve RT processivity, allowing the insertion of longer sequences at targeted genomic loci; and (4) addition of RNA motifs to the 5ʹ or 3ʹ termini of the pegRNA that improve pegRNA stability, enhance RT processivity, prevent misfolding of the pegRNA, or recruit additional factors important for genome editing. [0252] In one embodiment, pegRNA could be designed with polIII promoters to improve the expression of longer-length pegRNA with larger extension arms. sgRNAs are typically expressed from the U6 snRNA promoter. This promoter recruits pol III to express the associated RNA and is useful for expression of short RNAs that are retained within the nucleus. However, pol III is not highly processive and is unable to express RNAs longer than a few hundred nucleotides in length at the levels required for efficient genome editing. Additionally, pol III can stall or terminate at stretches of U’s, potentially limiting the sequence diversity that could be inserted using a pegRNA. Other promoters that recruit polymerase II (such as pCMV) or polymerase I (such as the U1 snRNA promoter) have been examined for their ability to express longer sgRNAs. However, these promoters are typically partially transcribed, which would result in extra sequence 5ʹ of the spacer in the expressed pegRNA, which has been shown to result in markedly reduced Cas9:sgRNA activity in a site- dependent manner. Additionally, while pol III-transcribed pegRNAs can simply terminate in a run of 6-7 U’s, pegRNAs transcribed from pol II or pol I would require a different termination signal. Often such signals also result in polyadenylation, which would result in undesired transport of the pegRNA from the nucleus. Similarly, RNAs expressed from pol II promoters such as pCMV are typically 5ʹ-capped, also resulting in their nuclear export. [0253] Exemplary U6 promoters include, but are not limited to: [0254] U6 promoter: [0255] U6v9 promoter: [0256] U6v7 promoter: [0257] U6v4 promoter: [0258] One of ordinary skill in the art will appreciate that these promoter sequences can be trimmed at the 5′ and still function at the same or nearly the same level. For example, any of the U6 promoters could be trimmed at the 5′ end by removing up to 1, 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides from the 5′ end, i.e., approximately 30% of the promoter length. In other embodiments, up to 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, or 30% of the length of the promoter may be trimmed from the 5′ end. [0259] One of ordinary skill in the art will also appreciate that other promoters could be used to improve the expression of longer length pegRNAs with larger extension arms. For example, in different cell types, other promoters may be preferred and result in greater expression of the longer length pegRNAs. [0260] Previously, Rinn and coworkers screened a variety of expression platforms for the production of long-noncoding RNA- (lncRNA) tagged sgRNAs. These platforms include RNAs expressed from pCMV and that terminate in the ENE element from the MALAT1 ncRNA from humans, the PAN ENE element from KSHV, or the 3ʹ box from U1 snRNA. Notably, the MALAT1 ncRNA and PAN ENEs form triple helices protecting the polyA-tail. These constructs could also enhance RNA stability. It is contemplated that these expression systems will also allow the expression of longer PEgRNAs. [0261] In addition, a series of methods have been designed for the cleavage of the portion of the pol II promoter that would be transcribed as part of the PEgRNA, adding either a self- cleaving ribozyme such as the hammerhead, pistol, hatchet, hairpin, VS, twister, or twister sister ribozymes, or other self-cleaving elements to process the transcribed guide, or a hairpin that is recognized by Csy4 and also leads to processing of the guide. Also, it is hypothesized that incorporation of multiple ENE motifs could lead to improved PEgRNA expression and stability, as previously demonstrated for the KSHV PAN RNA and element. It is also anticipated that circularizing the PEgRNA in the form of a circular intronic RNA (ciRNA) could also lead to enhanced RNA expression and stability, as well as nuclear localization. [0262] In various embodiments, the PEgRNA may include various above elements, as exemplified by the following sequences. [0263] Non-limiting example 1 - PEgRNA expression platform consisting of pCMV, Csy4 hairpin, the PEgRNA, and MALAT1 ENE ( Q ) [0264] Non-limiting example 2 - PEgRNA expression platform consisting of pCMV, Csy4 hairpin, the PEgRNA, and PAN ENE [0265] Non-limiting example 3 - PEgRNA expression platform consisting of pCMV, Csy4 hairpin, the PEgRNA, and 3xPAN ENE [0266] Non-limiting example 4 - PEgRNA expression platform consisting of pCMV, Csy4 hairpin, the PEgRNA, and 3ʹ box [0267] Non-limiting example 5 - PEgRNA expression platform consisting of pU1, Csy4 hairpin, the PEgRNA, and 3ʹ box [0268] In various other embodiments, the PEgRNA may be improved by introducing modifications to the scaffold or core sequences. The core, Cas9-binding PEgRNA scaffold can likely be improved to enhance PE activity. Several such approaches have already been demonstrated. For instance, the first pairing element of the scaffold (P1) contains a GTTTT- AAAAC (SEQ ID NO: 146) pairing element. Such runs of Ts have been shown to result in pol III pausing and premature termination of the RNA transcript. Rational mutation of one of the T-A pairs to a G-C pair in this portion of P1 has been shown to enhance sgRNA activity, suggesting this approach would also be feasible for PEgRNAs. Additionally, increasing the length of P1 has also been shown to enhance sgRNA folding and lead to improved activity, suggesting it as another avenue for the modification of PEgRNA activity. Example modifications to the core can include: [0269] PEgRNA containing a 6 nt extension to P1 C GCC C GCG GC C G C G (S Q NO: 5 ) [0270] PEgRNA containing a T-A to G-C mutation within P1 [0271] In various other embodiments, the PEgRNA may be modified at the edit template region. As the size of the insertion templated by the PEgRNA increases, it is more likely to be degraded by endonucleases, undergo spontaneous hydrolysis, or fold into secondary structures unable to be reverse-transcribed by the RT, or that disrupt folding of the PEgRNA scaffold and subsequent Cas9-RT binding. Accordingly, it is likely that modification to the template of the PEgRNA might be necessary to affect large insertions, such as the insertion of whole genes. Some strategies to do so include the incorporation of modified nucleotides within a synthetic or semi-synthetic PEgRNA that render the RNA more resistant to degradation or hydrolysis or less likely to adopt inhibitory secondary structures. Such modifications could include 8-aza-7-deazaguanosine, which would reduce RNA secondary structure in G-rich sequences; locked-nucleic acids (LNA) that reduce degradation and enhance certain kinds of RNA secondary structure; 2′-O-methyl, 2′-fluoro, or 2′-O- methoxyethoxy modifications that enhance RNA stability. Such modifications could also be included elsewhere in the PEgRNA to enhance stability and activity. Alternatively, or additionally, the template of the PEgRNA could be designed such that it is also more likely to adopt simple secondary structures that are able to allow processing by the RT. Such simple structures would act as a thermodynamic sink, making it less likely that more complicated structures that would prevent reverse transcription would occur. Finally, one could also split the template into two separate PEgRNAs. In such a design, a prime editors, e.g., a nCas9-RT fusion protein, would be used to initiate transcription, and also to recruit a separate template RNA to the targeted site via an RNA-binding protein fused to Cas9 or an RNA recognition element on the PEgRNA itself such as the MS2 aptamer. The RT could either directly bind to this separate template RNA, or initiate reverse transcription on the original PEgRNA before swapping to the second template. Such an approach could allow long insertions by both preventing misfolding of the PEgRNA upon addition of the long template, and also by not requiring dissociation of Cas9 from the genome for long insertions to occur, which could possibly inhibit PE-based long insertions. [0272] In still other embodiments, the PEgRNA may be modified by introducing additional RNA motifs at the 5ʹ and 3ʹ termini of the PEgRNAs, or even at positions therein between (e.g., in the gRNA core region, or the spacer). Several such motifs - such as the PAN ENE from KSHV and the ENE from MALAT1 were discussed above as possible means to terminate expression of longer PEgRNAs from non-pol III promoters. These elements form RNA triple helices that engulf the polyA tail, resulting in their being retained within the nucleus. However, by forming complex structures at the 3ʹ terminus of the PEgRNA that occlude the terminal nucleotide, these structures would also likely help prevent exonuclease- mediated degradation of PEgRNAs. [0273] Other structural elements inserted at the 3ʹ terminus could also enhance RNA stability, albeit without allowing for termination from non-pol III promoters. Such motifs could include hairpins or RNA quadruplexes that would occlude the 3ʹ terminus, or self- cleaving ribozymes such as HDV that would result in the formation of a 2ʹ-3ʹ-cyclic phosphate at the 3ʹ terminus, and also potentially render the PEgRNA less likely to be degraded by exonucleases. Inducing the PEgRNA to cyclize via incomplete splicing (to form a ciRNA) could also increase PEgRNA stability and result in the PEgRNA being retained within the nucleus. [0274] Additional RNA motifs could also improve RT processivity or enhance PEgRNA activity by enhancing RT binding to the DNA-RNA duplex. Addition of the native sequence bound by the RT in its cognate retroviral genome could enhance RT activity. This could include the native primer binding site (PBS), polypurine tract (PPT), or kissing loops involved in retroviral genome dimerization and initiation of transcription. [0275] Addition of dimerization motifs, such as kissing loops or a GNRA tetraloop/tetraloop receptor pair, at the 5ʹ and 3ʹ termini of the PEgRNA could also result in effective circularization of the PEgRNA, improving stability. Additionally, it is envisioned that addition of these motifs could allow the physical separation of the PEgRNA spacer and primer, preventing occlusion of the spacer, which would hinder PE activity. Short 5ʹ extensions or 3ʹ extensions to the PEgRNA that form a small toehold hairpin in the spacer region or along the primer binding site could also compete favorably against the annealing of intracomplementary regions along the length of the PEgRNA, e.g., the interaction between the spacer and the primer binding site that can occur. Finally, kissing loops could also be used to recruit other template RNAs to the genomic site and enable swapping of RT activity from one RNA to the other. A number of secondary RNA structures may be engineered into any region of the PEgRNA, including in the terminal portions of the extension arm (i.e., e1 and e2), as shown. [0276] Example modifications include, but are not limited to: [0277] PEgRNA-HDV fusion [0278] PEgRNA-MMLV kissing loop [0279] PEgRNA-VS ribozyme kissing loop [0280] PEgRNA-GNRA tetraloop/tetraloop receptor [0281] PEgRNA template switching secondary RNA-HDV fusion [0282] PEgRNA scaffolds can be further improved via directed evolution, in an analogous fashion to how SpCas9 and prime editors (PE) have been improved. Directed evolution can enhance PEgRNA recognition by Cas9 or evolved Cas9 variants. Additionally, different PEgRNA scaffold sequences are likely optimal at different genomic loci, either enhancing PE activity at the site in question, reducing off-target activities, or both. Finally, evolution of PEgRNA scaffolds to which other RNA motifs have been added would almost certainly improve the activity of the fused PEgRNA relative to the unevolved, fusion RNA. For instance, evolution of allosteric ribozymes composed of c-di-GMP-I aptamers and hammerhead ribozymes led to dramatically improved activity, suggesting that evolution would improve the activity of hammerhead-PEgRNA fusions as well. In addition, while Cas9 currently does not generally tolerate 5ʹ extension of the sgRNA as well as tolerating 3′ extensions, directed evolution will likely generate mutations that mitigate this intolerance, allowing additional RNA motifs to be utilized. [0283] In various embodiments, other scaffolds that have been shown to improve activity relative to canonical sgRNA scaffolds may be used in pegRNAs and epegRNAs as described herein. Such improvements may include, for example, those disclosed in Chen, B. et al., Dynamic Imaging of Genomic Loci in Living Human Cells by an Optimized CRISPR/Cas System. Cell.2013, 155(7), 1479-1471 and Jost, M. et al., Titrating expression using libraries of systematically attenuated CRISPR guide RNAs. Nat. Biotechnol.2020, 38, 355-364, which are herein incorporated by reference in their entirety. These improvements may enhance epegRNA activity through improved binding to the prime editor and/or improved expression. Stabilization of the sgRNA scaffold could also reduce PBS/spacer interactions that inhibit pegRNA and epegRNA activity. [0284] Exemplary epegRNAs incorporating improved sgRNA scaffolds include, but are not limited to: [0285] HEK31-15del standard scaffold evopreQ1 [0286] HEK31-15del cr748 evopreQ1 [0287] HEK31-15del cr289 evopreQ1 [0288] HEK31-15del cr622 evopreQ1 [0289] HEK31-15del cr772 evopreQ1 [0290] HEK31-15del cr532 evopreQ1 [0291] HEK31-15del cr961 evopreQ1 [0292] HEK31-15del flip and extension scaffold evopreQ1 [0293] RNF21-15del cr748 evopreQ1 [0294] RNF21-15del cr289 evopreQ1 [0295] RNF21-15del cr622 evopreQ1 [0296] RNF21-15del cr772 evopreQ1 [0297] RNF21-15del cr532 evopreQ1 [0298] RNF21-15del cr961 evopreQ1 [0299] RNF21-15del flip and extension scaffold evopreQ1 [0300] RUNX11-15del standard scaffold evopreQ1 [0301] RUNX11-15del cr748 evopreQ1 [0302] RUNX11-15del cr289 evopreQ1 [0303] RUNX11-15del cr622 evopreQ1 [0304] RUNX11-15del cr772 evopreQ1 [0305] RUNX11-15del cr532 evopreQ1 [0306] RUNX11-15del cr961 evopreQ1 [0307] RUNX11-15del flip and extension scaffold evopreQ1 [0308] RUNX1 +5G-T standard scaffold evopreQ1 [0309] RUNX1 +5G-T cr748 evopreQ1 [0310] RUNX1 +5G-T cr289 evopreQ1 [0311] RUNX1 +5G-T cr622 evopreQ1 [0312] RUNX1 +5G-T cr772 evopreQ1 [0313] RUNX1 +5G-T cr532 evopreQ1 [0314] RUNX1 +5G-T cr961 evopreQ1 [0315] RUNX1 +5G-T flip and extension scaffold evopreQ1 [0316] DNMT11-15del standard scaffold evopreQ1 [0317] DNMT11-15del cr748 evopreQ1 [0318] DNMT11-15del cr289 evopreQ1 [0319] DNMT11-15del cr622 evopreQ1 [0320] DNMT11-15del cr772 evopreQ1 [0321] DNMT11-15del cr532 evopreQ1 [0322] DNMT11-15del cr961 evopreQ1 [0323] DNMT11-15del flip and extension scaffold evopreQ1 [0324] DNMT1 +5 G--T standard scaffold evopreQ1 [0325] DNMT1 +5 G--T cr748 evopreQ1 [0326] DNMT1 +5 G--T cr289 evopreQ1 [0327] DNMT1 +5 G--T cr622 evopreQ1 [0328] DNMT1 +5 G--T cr772 evopreQ1 [0329] DNMT1 +5 G--T cr532 evopreQ1 [0330] DNMT1 +5 G--T cr961 evopreQ1 [0331] DNMT1 +5 G--T flip and extension scaffold evopreQ1 [0332] FANCF 1-15del standard scaffold evopreQ1 [0333] FANCF 1-15del cr748 evopreQ1 [0334] FANCF 1-15del cr289 evopreQ1 [0335] FANCF 1-15del cr622 evopreQ1 [0336] FANCF 1-15del cr772 evopreQ1 [0337] FANCF 1-15del cr532 evopreQ1 [0338] FANCF 1-15del cr961 evopreQ1 [0339] FANCF 1-15del flip and extension scaffold evopreQ1 [0340] FANCF +5 G--T cr748 evopreQ1 [0341] FANCF +5 G--T cr289 evopreQ1 [0342] FANCF +5 G--T cr622 evopreQ1 [0343] FANCF +5 G--T cr772 evopreQ1 [0344] FANCF +5 G--T cr532 evopreQ1 [0345] FANCF +5 G--T cr961 evopreQ1 [0346] FANCF +5 G--T flip and extension scaffold evopreQ1 [0347] EMX11-15del standard scaffold evopreQ1 [0348] EMX11-15del cr748 evopreQ1 [0349] EMX11-15del cr289 evopreQ1 [0350] EMX11-15del cr622 evopreQ1 [0351] EMX11-15del cr772 evopreQ1 [0352] EMX11-15del cr532 evopreQ1 [0353] EMX11-15del cr961 evopreQ1 [0354] EMX11-15del flip and extension scaffold evopreQ1 [0355] EMX1 +5 G--T standard scaffold evopreQ1 [0356] EMX1 +5 G--T cr748 evopreQ1 [0357] EMX1 +5 G--T cr289 evopreQ1 [0358] EMX1 +5 G--T cr622 evopreQ1 [0359] EMX1 +5 G--T cr772 evopreQ1 [0360] EMX1 +5 G--T cr532 evopreQ1 [0361] EMX1 +5 G--T cr961 evopreQ1 [0362] EMX1 +5 G--T flip and extension scaffold evopreQ1 [0363] RNF2 +1FLAG standard scaffold evopreQ1 [0364] RNF2 +1FLAG cr748 evopreQ1 [0365] RNF2 +1FLAG cr289 evopreQ1 [0366] RNF2 +1FLAG cr622 evopreQ1 ( Q ) [0367] RNF2 +1FLAG cr772 evopreQ1 [0368] RNF2 +1FLAG cr532 evopreQ1 [0369] RNF2 +1FLAG cr961 evopreQ1 [0370] RNF2 +1FLAG flip and extension scaffold evopreQ1 [0371] VEGFA +5 G--T cr748 evopreQ1 [0372] VEGFA +5 G--T cr289 evopreQ1 [0373] VEGFA +5 G--T cr622 evopreQ1 [0374] VEGFA +5 G--T cr772 evopreQ1 [0375] VEGFA +5 G--T cr532 evopreQ1 [0376] VEGFA +5 G--T cr961 evopreQ1 [0377] VEGFA +5 G--T flip and extension scaffold evopreQ1 [0378] VEGFA +1FLAG standard scaffold evopreQ1 ) [0379] VEGFA +1FLAG cr748 evopreQ1 ( Q ) [0380] VEGFA +1FLAG cr289 evopreQ1 [0381] VEGFA +1FLAG cr622 evopreQ1 (S Q NO: 35) [0382] VEGFA +1FLAG cr772 evopreQ1 [0383] VEGFA +1FLAG cr532 evopreQ1 (S Q NO: 35) [0384] VEGFA +1FLAG cr961 evopreQ1 ( Q ) [0385] VEGFA +1FLAG flip and extension scaffold evopreQ1 ( Q ) [0386] VEGFA 1-15 del standard scaffold evopreQ1 [0387] VEGFA 1-15 del cr748 evopreQ1 ( Q ) [0388] VEGFA 1-15 del cr289 evopreQ1 [0389] VEGFA 1-15 del cr622 evopreQ1 ( Q ) [0390] VEGFA 1-15 del cr772 evopreQ1 [0391] VEGFA 1-15 del cr532 evopreQ1 [0392] VEGFA 1-15 del cr961 evopreQ1 [0393] VEGFA 1-15 del flip and extension scaffold evopreQ1 [0394] RUNX1 +1FLAG standard scaffold evopreQ1 [0395] RUNX1 +1FLAG cr748 evopreQ1 NO: 366) [0396] RUNX1 +1FLAG cr289 evopreQ1 ) [0397] RUNX1 +1FLAG cr622 evopreQ1 ( Q ) [0398] RUNX1 +1FLAG cr772 evopreQ1 [0399] RUNX1 +1FLAG cr532 evopreQ1 [0400] RUNX1 +1FLAG cr961 evopreQ1 ) [0401] RUNX1 +1FLAG flip and extension scaffold evopreQ1 [0402] DNMT1 +1FLAG standard scaffold evopreQ1 ) [0403] DNMT1 +1FLAG cr748 evopreQ1 [0404] DNMT1 +1FLAG cr289 evopreQ1 [0405] DNMT1 +1FLAG cr622 evopreQ1 [0406] DNMT1 +1FLAG cr772 evopreQ1 [0407] DNMT1 +1FLAG cr532 evopreQ1 [0408] DNMT1 +1FLAG cr961 evopreQ1 [0409] DNMT1 +1FLAG flip and extension scaffold evopreQ1 [0410] The present disclosure contemplates any such ways to further improve the efficacy of the prime editing systems utilized in the methods and complexes disclosed herein. [0411] In various embodiments, it may be advantageous to limit the appearance of a consecutive sequence of Ts from the extension arm, as consecutive series of T’s may limit the capacity of the PEgRNA to be transcribed. For example, strings of at least three consecutive T’s, at least four consecutive T’s, at least five consecutive T’s, at least six consecutive T’s, at least seven consecutive T’s, at least eight consecutive T’s, at least nine consecutive T’s, at least ten consecutive T’s, at least eleven consecutive T’s, at least twelve consecutive T’s, at least thirteen consecutive T’s, at least fourteen consecutive T’s, or at least fifteen consecutive T’s should be avoided when designing the PEgRNA, or should be at least removed from the final designed sequence. In one embodiment, one can avoid the inclusion of unwanted strings of consecutive T’s in PEgRNA extension arms by avoiding target sites that are rich in consecutive A:T nucleobase pairs. Methods of Treatment and Uses [0412] In some aspects, the present disclosure provides methods of treating trinucleotide repeat disorders, including Huntington’s disease and Friedreich’s ataxia, using prime editing. [0413] In one aspect, the present disclosure provides methods of treating Huntington’s disease by prime editing comprising contacting a target nucleotide sequence with any of the pegRNA-prime editor complexes or compositions disclosed herein. In some embodiments, the contacting results in the contraction or replacement of a CAG repeat sequence in the target nucleotide sequence. In some embodiments, the CAG repeat sequence is contracted from greater than 35 repeats to 35 or fewer repeats (e.g., 35 repeats, 34 repeats, 33 repeats, 32 repeats, 31 repeats, 30 repeats, 29 repeats, 28 repeats, 27 repeats, 26 repeats, 25 repeats, 24 repeats, 23 repeats, 22 repeats, 21 repeats, 20 repeats, 19 repeats, 18 repeats, 17 repeats, 16 repeats, 15 repeats, 14 repeats, 13 repeats, 12 repeats, 11 repeats, 10 repeats, 9 repeats, 8 repeats, 7 repeats, 6 repeats, 5 repeats, 4 repeats, or 3 repeats). In certain embodiments, the CAG repeat sequence is greater than 35 repeats in length and is replaced with a CAG repeat sequence of 35 or fewer repeats in length (e.g., 35 or fewer repeats, 34 or fewer repeats, 33 or fewer repeats, 32 or fewer repeats, 31 or fewer repeats, 30 or fewer repeats, 29 or fewer repeats, 28 or fewer repeats, 27 or fewer repeats, 26 or fewer repeats, 25 or fewer repeats, 24 or fewer repeats, 23 or fewer repeats, 22 or fewer repeats, 21 or fewer repeats, 20 or fewer repeats, 19 or fewer repeats, 18 or fewer repeats, 17 or fewer repeats, 16 or fewer repeats, 15 or fewer repeats, 14 or fewer repeats, 13 or fewer repeats, 12 or fewer repeats, 11 or fewer repeats, 10 or fewer repeats, 9 or fewer repeats, 8 or fewer repeats, 7 or fewer repeats, 6 or fewer repeats, 5 or fewer repeats, 4 or fewer repeats, or 3 or fewer repeats). In certain embodiments, the CAG repeat sequence is contracted to four repeats or replaced with a sequence comprising four CAG repeats. [0414] In some embodiments, the methods further comprise nicking the non-PAM-containing strand of the target nucleotide sequence using a nicking gRNA. In some embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence: GGCGGCTGAGGAAGCTGAGG (SEQ ID NO: 387); GGCGGCGGCGGCGGCGGTGG (SEQ ID NO: 385); GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384); GAAGGACTTGAGGGACTCGA (SEQ ID NO: 393); GTGAGGAAGCTGAGGAGGCGG (SEQ ID NO: 394); GCTGTTGCTGCTGCTGCTGC (SEQ ID NO: 395); GCTGCTGCTGCTGCTGCTGGA (SEQ ID NO: 383); GCTGCTGCTGGAAGGACTTG (SEQ ID NO: 382); GGCCTTCATCAGCTTTTCC (SEQ ID NO: 396); GGCTTTCATCAGCTTTTCC (SEQ ID NO: 397); GGGACTCGAAGGCCTTCAT (SEQ ID NO: 398); GGAAGGACTTGAGGGACTCG (SEQ ID NO: 399); GCTGCTGCTGGAAGGACTT (SEQ ID NO: 400); GCTGCTGCTGCTGCTGCTGG (SEQ ID NO: 401); GCTGGAAGGACTTGAGGGACT (SEQ ID NO: 402); GCTGCTGCTGCTGGAAGGACT (SEQ ID NO: 403); CTGCTGCTGCTGCTGCTGGAA (SEQ ID NO: 404); GCTGCTGCTGCTGCTGCTGCT (SEQ ID NO: 405); GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406); GCTGCTGGAAGGACTTCAG (SEQ ID NO: 407); GCTGCTGAAAGGACTTCAG (SEQ ID NO: 408); or GCTGTTGGAAGGACTTCAG (SEQ ID NO: 409), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any of these. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGCTGCTGCTGCTGC (SEQ ID NO: 384), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 384. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GCTGCTGGAAGGACTTGAG (SEQ ID NO: 406), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 406. [0415] In some embodiments, the contacting is performed in a cell. In some embodiments, the cell is a eukaryotic cell. In certain embodiments, the cell is a human cell. In certain embodiments, the cell is in vitro. In certain embodiments, the cell is ex vivo. In some embodiments, the cell is in a subject. In certain embodiments, the subject is a human. [0416] In another aspect, the present disclosure provides methods of treating Friedreich’s ataxia by prime editing comprising contacting a target nucleotide sequence with any of the complexes provided herein. In some embodiments, the contacting results in the deletion of a GAA repeat sequence in the target nucleotide sequence. In some embodiments, the GAA repeat sequence deleted comprises greater than about 50 GAA repeats, greater than about 51 GAA repeats, greater than about 52 GAA repeats, greater than about 53 GAA repeats, greater than about 54 GAA repeats, greater than about 55 GAA repeats, greater than about 56 GAA repeats, greater than about 57 GAA repeats, greater than about 58 GAA repeats, greater than about 59 GAA repeats, greater than about 60 GAA repeats, greater than about 61 GAA repeats, greater than about 62 GAA repeats, greater than about 63 GAA repeats, greater than about 64 GAA repeats, greater than about 65 GAA repeats, greater than about 66 GAA repeats, greater than about 67 GAA repeats, greater than about 68 GAA repeats, greater than about 69 GAA repeats, greater than about 70 GAA repeats, greater than about 71 GAA repeats, greater than about 72 GAA repeats, greater than about 73 GAA repeats, greater than about 74 GAA repeats, greater than about 75 GAA repeats, greater than about 76 GAA repeats, greater than about 77 GAA repeats, greater than about 78 GAA repeats, greater than about 79 GAA repeats, or greater than about 80 GAA repeats. In certain embodiments, the GAA repeat sequence deleted comprises greater than 65 GAA repeats. [0417] In some embodiments, the methods further comprise nicking the non-PAM-containing strand of the target nucleotide sequence using a nicking gRNA. In some embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence: GTCCCAAAGTGCTGAGATTAT (SEQ ID NO: 410); GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411); GATTCTCCTGCCGCAGCCTC (SEQ ID NO: 412); or GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any of these. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GTGTATTTTTTAGTAGATACT (SEQ ID NO: 411), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 411. In certain embodiments, the spacer of the nicking gRNA comprises the nucleotide sequence GCGACACCACGCCCGGCTAAC (SEQ ID NO: 413), or a nucleotide sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 413. [0418] In some embodiments, the contacting is performed in a cell. In some embodiments, the cell is a eukaryotic cell. In certain embodiments, the cell is a human cell. In certain embodiments, the cell is a central nervous system cell. In certain embodiments, the cell is in vitro. In certain embodiments, the cell is ex vivo. In some embodiments, the cell is in a subject. In certain embodiments, the subject is a human. In certain embodiments, the central nervous system is targeted. [0419] In some embodiments, one or more polynucleotides encoding any of the complexes provided herein are delivered to a cell. In certain embodiments, two polynucleotides encoding any of the complexes provided herein are delivered to a cell. In certain embodiments, the two polynucleotides comprise two halves of a prime editor comprising a split intein capable of reassembling into a prime editor molecule. In some embodiments, the one or more polynucleotides encoding a complex provided herein are delivered to a cell in one or more adeno-associated virus (AAV) particles. Delivery of prime editor complexes has been described, for example, in U.S. Provisional Application, U.S.S.N., 63/426,336, filed November 17, 2022, U.S. Provisional Application, U.S.S.N., 63/491,013, filed March 17, 2023, and Davis, J. R., et al. Nat. Biotechnol.2023, each of which is incorporated herein by reference. In certain embodiments, the one or more polynucleotides encoding the complex are delivered to the cell in two AAV particles. In some embodiments, one or both of the AAV particles comprise AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, or AAV9. In certain embodiments, one or both of the AAV particles comprise AAV9. In some embodiments, the AAV particles target central nervous system cells. [0420] In some embodiments, a first and second AAV particle are delivered to a cell. In certain embodiments, the first AAV particle comprises a polynucleotide comprising the structure 5′-[inverted terminal repeat (ITR) sequence]-[promoter]-[napDNAbp N-terminal fragment]-[N-intein]-[terminator sequence]-[ITR sequence]-3′. In certain embodiments, the second AAV particle comprises a polynucleotide comprising the structure 5′-[ITR sequence]- [promoter]-[C-intein]-[napDNAbp C-terminal fragment]-[reverse transcriptase]-[terminator sequence]-[optional nicking gRNA]-[pegRNA]-[ITR]-3′. In certain embodiments, the split site within the napDNAbp is between amino acid residues 844 and 845 of a Cas9 (e.g., a Cas9 protein of SEQ ID NO: 6). In certain embodiments, the split site within the napDNAbp is between amino acid residues 1024 and 1025 of a Cas9 protein (e.g., a Cas9 protein of SEQ ID NO: 6). napDNAbp [0421] In various embodiments, the prime editors utilized in the complexes and methods described herein comprise a nucleic acid programmable DNA binding protein (napDNAbp). [0422] In various embodiments, prime editors may include a napDNAbp domain having a wild type Cas9 sequence, including, for example the canonical Streptococcus pyogenes Cas9 sequence of SEQ ID NO: 6, shown as follows. [0423] In other embodiments, the prime editors may include a napDNAbp domain having a modified Cas9 sequence, including, for example the nickase variant of Streptococcus pyogenes Cas9 of SEQ ID NO: 7 having an H840A substitution relative to the wild type SpCas9 (of SEQ ID NO: 6), shown as follows: [0424] The prime editors described herein may include any of the modified Cas9 sequences described above, or any variant thereof having at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity thereto. In some embodiments, the prime editors used in the methods described herein include any of the following other wild type SpCas9 sequences, which may be modified with one or more of the mutations described herein at corresponding amino acid positions:
[0425] The prime editors used in the methods described herein may include any of the above SpCas9 sequences, or any variant thereof having at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity thereto. In other embodiments, the Cas9 protein can be a wild type Cas9 ortholog from another bacterial species different from the canonical Cas9 from S. pyogenes. For example, modified versions of the following Cas9 orthologs can be used in connection with the prime editors described in this specification by making mutations at positions corresponding to H840A or any other amino acids of interest in wild type SpCas9. In addition, any variant Cas9 orthologs having at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity to any of the below orthologs may also be used with the prime editors.
[0426] In some embodiments, the napDNAbp used in the prime editors described herein comprise one or more amino acid mutations relative to a wild type napDNAbp, for example, a wild type Cas9 protein. In some embodiments, a Cas9 protein comprises an inactivating mutation in an HNH domain. In some embodiments, the prime editors described herein comprise a Cas9 protein comprising one or more amino acid substitutions relative to the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 6. In certain embodiments, the Cas9 protein comprises a K775R substitution and/or a K918A substitution relative to the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 6 (or corresponding mutations in a homologous Cas9 protein). In certain embodiments, the Cas9 protein comprises a K775R substitution and a K918A substitution relative to the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 6 (or corresponding mutations in a homologous Cas9 protein). In certain embodiments, the Cas9 protein comprises a D23G substitution and/or an H754R substitution relative to the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 6 (or corresponding mutations in a homologous Cas9 protein). In certain embodiments, the Cas9 protein comprises a D23G substitution and an H754R substitution relative to the amino acid sequence of SEQ ID NO: 6, or an amino acid sequence at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 6 (or corresponding mutations in a homologous Cas9 protein). [0427] The napDNAbp used in the prime editors described herein may include any suitable homologs and/or orthologs or naturally occurring enzymes, such as Cas9. Cas9 homologs and/or orthologs have been described in various species, including, but not limited to, S. pyogenes and S. thermophilus. The Cas moiety may be configured (e.g., mutagenized, recombinantly engineered, or otherwise obtained from nature) as a nickase, i.e., capable of cleaving only a single strand of the target double-stranded DNA. In some embodiments, a Cas9 nuclease has an inactive (e.g., an inactivated) DNA cleavage domain; that is, the Cas9 is a nickase. In some embodiments, the Cas9 protein comprises an amino acid sequence that is at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% identical to the amino acid sequence of a Cas9 protein as provided by any one of the Cas9 orthologs in the above tables. [0428] Additional suitable napDNAbp sequences that can be used in prime editors will be apparent to those of skill in the art based on this disclosure, and such Cas9 nucleases and sequences include Cas9 sequences from the organisms and loci disclosed in Chylinski, Rhun, and Charpentier, “The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems” (2013) RNA Biology 10:5, 726-737; the entire contents of which are incorporated herein by reference. Additional exemplary Cas variants and homologs include, but are not limited to, Cas9 (e.g., dCas9 and nCas9), Cpf1, CasX, CasY, C2c1, C2c2, C2c3, GeoCas9, CjCas9, Cas12a, Cas12b, Cas12g, Cas12h, Cas12i, Cas13b, Cas13c, Cas13d, Cas14, Csn2, xCas9, SpCas9-NG, Nme2Cas9, circularly permuted Cas9, Argonaute (Ago), Cas9-KKH, SmacCas9, Spy-macCas9, SpCas9-VRQR, SpCas9-NRRH, SpaCas9-NRTH, SpCas9-NRCH, LbCas12a, AsCas12a, CeCas12a, MbCas12a, Cas3, CasΦ, and circularly permuted Cas9 domains such as CP1012, CP1028, CP1041, CP1249, and CP1300, and variants and homologs thereof. Reverse transcriptase domain [0429] The prime editors used in the complexes and methods described herein comprise a reverse transcriptase domain. In some embodiments, the reverse transcriptase domain is a wild type MMLV reverse transcriptase. In some embodiments, the reverse transcriptase domain is a variant of wild type MMLV reverse transcriptase having the amino acid sequence of SEQ ID NO: 34 (e.g., at least 85%, at least 90%, at least 92%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 99.5% identical to the amino acid sequence of SEQ ID NO: 34). [0430] For example, PE2 and PEmax comprise a variant reverse transcriptase domain of SEQ ID NO: 34, which is based on the wild type MMLV reverse transcriptase domain of SEQ ID NO: 33 (and, in particular, a Genscript codon optimized MMLV reverse transcriptase having the nucleotide sequence of SEQ ID NO: 33), and which comprises amino acid substitutions D200N, T306K, W313F, T330P, and L603W relative to the wild type MMLV RT of SEQ ID NO: 33. The amino acid sequence of the variant RT domain of PE2 and PEmax is SEQ ID NO: 34. [0431] Prime editors may also comprise other variant RT domains as well. In various embodiments, the prime editors used in the methods and systems described herein (with the RT domain provided as either a fusion partner or in trans) can include a variant RT comprising one or more of the following mutations: P51L, S67K, E69K, L139P, T197A, D200N, H204R, F209N, E302K, E302R, T306K, F309N, W313F, T330P, L345G, L435G, N454K, D524G, E562Q, D583N, H594Q, L603W, E607K, or D653N in the wild type M- MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence. [0432] Some exemplary reverse transcriptases that can be fused to napDNAbp proteins or provided as individual proteins according to various embodiments of this disclosure are provided below. Exemplary reverse transcriptases include variants with at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity to the following wild-type enzymes or partial enzymes: [0433] In various embodiments, the prime editors utilized in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising one or more of the following mutations: P51X, S67X, E69X, L139X, T197X, D200X, H204X, F209X, E302X, T306X, F309X, W313X, T330X, L345X, L435X, N454X, D524X, E562X, D583X, H594X, L603X, E607X, or D653X in the wild type M- MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. [0434] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a P51X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is L. [0435] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an S67X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is K. [0436] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an E69X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is K. [0437] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an L139X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is P. [0438] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a T197X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is A. [0439] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a D200X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is N. [0440] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an H204X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is R. [0441] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an F209X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is N. [0442] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an E302X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is K. [0443] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an E302X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is R. [0444] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a T306X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is K. [0445] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an F309X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is N. [0446] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a W313X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is F. [0447] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a T330X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is P. [0448] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an L345X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is G. [0449] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an L435X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is G. [0450] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an N454X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is K. [0451] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a D524X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is G. [0452] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an E562X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is Q. [0453] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a D583X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is N. [0454] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an H594X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is Q. [0455] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an L603X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is W. [0456] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising an E607X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is K. [0457] In various embodiments, the prime editors used in the complexes and methods described herein (with RT provided as either a fusion partner or in trans) can include a variant RT comprising a D653X mutation in the wild type M-MLV RT of SEQ ID NO: 33, or at a corresponding amino acid position in another wild type RT polypeptide sequence, wherein “X” can be any amino acid. In certain embodiments, X is N. [0458] Some exemplary reverse transcriptases that can be fused to napDNAbp proteins or provided as individual proteins according to various embodiments of this disclosure are provided below. Exemplary reverse transcriptases include variants with at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity to the wild-type enzymes or partial enzymes described in SEQ ID NOs: 33- 78, 159-179, and 188-213. [0459] The prime editor (PE) system described here contemplates any publicly available reverse transcriptase (and any variant thereof) described or disclosed in any of the following U.S. patents (each of which is incorporated by reference), U.S. Patent Nos: 10,202,658; 10,189,831; 10,150,955; 9,932,567; 9,783,791; 9,580,698; 9,534,201; and 9,458,484, that can be made using known methods for installing mutations or known methods for evolving proteins. The following references describe reverse transcriptases known in the art. Each of their disclosures are incorporated herein by reference. [0460] Herzig, E., Voronin, N., Kucherenko, N. & Hizi, A. A Novel Leu92 Mutant of HIV-1 Reverse Transcriptase with a Selective Deficiency in Strand Transfer Causes a Loss of Viral Replication. J. Virol.89, 8119–8129 (2015). [0461] Mohr, G. et al. A Reverse Transcriptase-Cas1 Fusion Protein Contains a Cas6 Domain Required for Both CRISPR RNA Biogenesis and RNA Spacer Acquisition. Mol. Cell 72, 700-714.e8 (2018). [0462] Zhao, C., Liu, F. & Pyle, A. M. An ultraprocessive, accurate reverse transcriptase encoded by a metazoan group II intron. RNA 24, 183–195 (2018). [0463] Zimmerly, S. & Wu, L. An Unexplored Diversity of Reverse Transcriptases in Bacteria. Microbiol Spectr 3, MDNA3-0058–2014 (2015). [0464] Ostertag, E. M. & Kazazian Jr, H. H. Biology of Mammalian L1 Retrotransposons. Annual Review of Genetics 35, 501–538 (2001). [0465] Perach, M. & Hizi, A. Catalytic Features of the Recombinant Reverse Transcriptase of Bovine Leukemia Virus Expressed in Bacteria. Virology 259, 176–189 (1999). [0466] Lim, D. et al. Crystal structure of the moloney murine leukemia virus RNase H domain. J. Virol.80, 8379–8389 (2006). [0467] Zhao, C. & Pyle, A. M. Crystal structures of a group II intron maturase reveal a missing link in spliceosome evolution. Nature Structural & Molecular Biology 23, 558–565 (2016). [0468] Griffiths, D. J. Endogenous retroviruses in the human genome sequence. Genome Biol.2, REVIEWS1017 (2001). [0469] Baranauskas, A. et al., Generation and characterization of new highly thermostable and processive M-MuLV reverse transcriptase variants. Protein Eng Des Sel 25, 657–668 (2012). [0470] Zimmerly, S., Guo, H., Perlman, P. S. & Lambowltz, A. M. Group II intron mobility occurs by target DNA-primed reverse transcription. Cell 82, 545–554 (1995). [0471] Feng, Q., Moran, J. V., Kazazian, H. H. & Boeke, J. D. Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell 87, 905–916 (1996). [0472] Berkhout, B., Jebbink, M. & Zsíros, J. Identification of an Active Reverse Transcriptase Enzyme Encoded by a Human Endogenous HERV-K Retrovirus. Journal of Virology 73, 2365–2375 (1999). [0473] Kotewicz, M. L., Sampson, C. M., D’Alessio, J. M. & Gerard, G. F. Isolation of cloned Moloney murine leukemia virus reverse transcriptase lacking ribonuclease H activity. Nucleic Acids Res 16, 265–277 (1988). [0474] Arezi, B. & Hogrefe, H. Novel mutations in Moloney Murine Leukemia Virus reverse transcriptase increase thermostability through tighter binding to template-primer. Nucleic Acids Res 37, 473–481 (2009). [0475] Blain, S. W. & Goff, S. P. Nuclease activities of Moloney murine leukemia virus reverse transcriptase. Mutants with altered substrate specificities. J. Biol. Chem.268, 23585– 23592 (1993). [0476] Xiong, Y. & Eickbush, T. H. Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J 9, 3353–3362 (1990). [0477] Herschhorn, A. & Hizi, A. Retroviral reverse transcriptases. Cell. Mol. Life Sci.67, 2717–2747 (2010). [0478] Taube, R., Loya, S., Avidan, O., Perach, M. & Hizi, A. Reverse transcriptase of mouse mammary tumour virus: expression in bacteria, purification and biochemical characterization. Biochem. J.329 ( Pt 3), 579–587 (1998). [0479] Liu, M. et al. Reverse Transcriptase-Mediated Tropism Switching in Bordetella Bacteriophage. Science 295, 2091–2094 (2002). [0480] Luan, D. D., Korman, M. H., Jakubczak, J. L. & Eickbush, T. H. Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition. Cell 72, 595–605 (1993). [0481] Nottingham, R. M. et al. RNA-seq of human reference RNA samples using a thermostable group II intron reverse transcriptase. RNA 22, 597–613 (2016). [0482] Telesnitsky, A. & Goff, S. P. RNase H domain mutations affect the interaction between Moloney murine leukemia virus reverse transcriptase and its primer-template. Proc. Natl. Acad. Sci. U.S.A.90, 1276–1280 (1993). [0483] Halvas, E. K., Svarovskaia, E. S. & Pathak, V. K. Role of Murine Leukemia Virus Reverse Transcriptase Deoxyribonucleoside Triphosphate-Binding Site in Retroviral Replication and In Vivo Fidelity. Journal of Virology 74, 10349–10358 (2000). [0484] Nowak, E. et al., Structural analysis of monomeric retroviral reverse transcriptase in complex with an RNA/DNA hybrid. Nucleic Acids Res 41, 3874–3887 (2013). [0485] Stamos, J. L., Lentzsch, A. M. & Lambowitz, A. M. Structure of a Thermostable Group II Intron Reverse Transcriptase with Template-Primer and Its Functional and Evolutionary Implications. Molecular Cell 68, 926-939.e4 (2017). [0486] Das, D. & Georgiadis, M. M. The Crystal Structure of the Monomeric Reverse Transcriptase from Moloney Murine Leukemia Virus. Structure 12, 819–829 (2004). [0487] Avidan, O., Meer, M. E., Oz, I. & Hizi, A. The processivity and fidelity of DNA synthesis exhibited by the reverse transcriptase of bovine leukemia virus. European Journal of Biochemistry 269, 859–867 (2002). [0488] Gerard, G. F. et al. The role of template-primer in protection of reverse transcriptase from thermal inactivation. Nucleic Acids Res 30, 3118–3129 (2002). [0489] Monot, C. et al. The Specificity and Flexibility of L1 Reverse Transcription Priming at Imperfect T-Tracts. PLOS Genetics 9, e1003499 (2013). [0490] Mohr, S. et al. Thermostable group II intron reverse transcriptase fusion proteins and their use in cDNA synthesis and next-generation RNA sequencing. RNA 19, 958–970 (2013). [0491] In some embodiments, the prime editor proteins comprise an MMLV reverse transcriptase comprising one or more amino acid substitutions. The wild-type MMLV reverse transcriptase is provided by the following sequence: [0492] The reverse transcriptases used in the prime editors described herein may comprise one or more mutations relative to the wild-type amino acid sequence. In some embodiments, the reverse transcriptase is the MMLV pentamutant described above (i.e., comprising amino acid substitutions D200N, T306K, W313F, T330P, and L603W). [0493] In some embodiments, the present disclosure provides MMLV reverse transcriptase variants, and prime editors (e.g., fusion proteins and prime editors in which the napDNAbp and reverse transcriptase are provided in trans) comprising MMLV reverse transcriptase variants, wherein the variants comprise one or more mutations relative to SEQ ID NO: 33 selected from the group consisting of T13I, V19I, A32T, G38V, S60Y, P111L, K120R, H126Y, T128N, T128F, T128H, V129S, P132S, G138R, C157F, P175Q, P175S, D200S, D200Y, D200N, D200C, Y222F, V223A, V223M, V223T, V223W, V223Y, L234I, T246I, N249S, T287A, P292T, E302A, E302K, T306K, G316R, E346K, K373N, W388C, V402A, K445N, M457I, and A462S. In some embodiments, an MMLV reverse transcriptase variant comprises two or more of these mutations, three or more of these mutations, four or more of these mutations, or five or more of these mutations. [0494] In some embodiments, the MMLV reverse transcriptase variants used in the prime editors provided herein comprise a single mutation relative to SEQ ID NO: 33. In some embodiments, the single mutations is selected from the group consisting of T13I, G38V, K120R, H126Y, T128N, T128F, T128H, V129S, P132S, P175Q, P175S, D200C, D200Y, V223M, V223T, V223W, V223Y, L234I, P292T, G316R, K373N, M457I, and V402A. [0495] In certain embodiments, the MMLV reverse transcriptase variants used in the prime editors provided herein comprise any one of the following groups of mutations relative to the amino acid sequence of SEQ ID NO: 33: D200Y and E302A; D200Y, V223A, and M457I; V223M, T306K, and A462S; D200N and E302K; D200Y and E302K; T128N and V223A; V19I, A32T, and D200Y; D200S, V223A, E346K, and W388C; S60Y, V223A, and N249S; P111L, V223A, T287A, and G316R; S60Y, G138R, and V223A; S60Y, Y222F, V223A, and K445N; or S60Y, C157F, V223A, and T246I. In certain embodiments, the MMLV reverse transcriptase variant used in the prime editors provided herein comprises the amino acid sequence of any one of SEQ ID NOs: 33-42, 63-78, and 172-179, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 33-42, 63-78, and 172-179, wherein the amino acid sequence comprises at least one of residues 13I, 19I, 32T, 38V, 60Y, 111L, 120R, 126Y, 128N, 128F, 128H, 129S, 132S, 138R, 157F, 175Q, 175S, 200S, 200Y, 200N, 200C, 222F, 223A, 223M, 223T, 223W, 223Y, 234I, 246I, 249S, 287A, 292T, 302A, 302K, 306K, 316R, 346K, 373N, 388C, 402A, 445N, 457I, and 462S. [0496] In other examples, the proteins described herein may comprise an MMLV reverse transcriptase comprising one or more substitutions at amino acid positions V19, A32, S60, P111, T128, G138R, C157F, D200, Y222, V223, T246, N249, T287, G316, E346, W388, and/or K445. In some embodiments, the proteins described herein comprise an MMLV reverse transcriptase comprising one or more substitutions selected from the group consisting of V19I, A32T, S60Y, P111L, T128N, G138R, C157F, D200S, D200Y, Y222F, V223A, T246I, N249S, T287A, G316R, E346K, W388C, and K445N. In certain embodiments, the proteins described herein comprise an MMLV reverse transcriptase comprising any one of the following groups of amino acid substitutions: T128N and V223A; V19I, A32T, and D200Y; D200S, V223A, E346K, and W388C; S60Y, V223A, and N249S; P111L, V223A, T287A, and G316R; S60Y, G138R, and V223A; S60Y, Y222F, V223A, and K445N; or S60Y, C157F, V223A, and T246I. [0497] Exemplary evolved reverse transcriptase enzymes are as follows:
[0498] The use of reverse transcriptase enzymes comprising an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity to any of the evolved variants described herein in the prime editors disclosed herein is also contemplated by the present disclosure, provided the RT sequence comprises one of the amino acid substitutions disclosed herein. [0499] The disclosure also contemplates the use of any wild-type reverse transcriptase in the prime editors described herein. Exemplary wild-type reverse transcriptases which may be used include, but are not limited to, the following sequences, or any variant thereof having at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity thereto:
[0500] The use of reverse transcriptase enzymes comprising an amino acid sequence having at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% sequence identity to any of the enzymes above in the prime editor proteins disclosed herein is also contemplated by the present disclosure. [0501] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is an AVIRE reverse transcriptase of SEQ ID NO: 47, or an AVIRE reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 47, wherein the AVIRE reverse transcriptase variant comprises one or more mutations selected from the group consisting of D199N, T305K, W312F, G329P, and L604W. In some embodiments, the AVIRE reverse transcriptase variant comprises two or more, three or more, four or more, or all five of these mutations. In some embodiments, the AVIRE reverse transcriptase variant comprises the mutation D199N. In some embodiments, the AVIRE reverse transcriptase variant comprises the mutation T305K. In some embodiments, the AVIRE reverse transcriptase variant comprises the mutation W312F. In some embodiments, the AVIRE reverse transcriptase variant comprises the mutation G329P. In some embodiments, the AVIRE reverse transcriptase variant comprises the mutation L604W. [0502] In certain embodiments, the AVIRE reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 214-219, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 214-219, wherein the amino acid sequence comprises at least one of the residues 199N, 305K, 312F, 329P, and 604W: AVIRE-RT (D199N): AVIRE-RT (T305K): AVIRE-RT (W312F): AVIRE-RT (G329P): AVIRE-RT (L604W): [0503] In certain embodiments, the AVIRE reverse transcriptase variant comprises an amino acid sequence of SEQ ID NO: 219, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 219, wherein the amino acid sequence comprises the residues 199N, 305K, 312F, 329P, and 604W: AVIRE_penta: [0504] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is a KORV reverse transcriptase of SEQ ID NO: 50, or a KORV reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 50, wherein the KORV reverse transcriptase variant comprises one or more mutations selected from the group consisting of D197N, T303K, W310F, E327P, and L599W. In some embodiments, the KORV reverse transcriptase variant comprises two or more, three or more, four or more, or all five of these mutations. In some embodiments, the KORV reverse transcriptase variant comprises the mutation D197N. In some embodiments, the KORV reverse transcriptase variant comprises the mutation T303K. In some embodiments, the KORV reverse transcriptase variant comprises the mutation W310F. In some embodiments, the KORV reverse transcriptase variant comprises the mutation E327P. In some embodiments, the KORV reverse transcriptase variant comprises the mutation L599W. [0505] In certain embodiments, the KORV reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 220-225, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 220-225, wherein the amino acid sequence comprises at least one of the residues 197N, 303K, 310F, 327P, and 599W: KORV-RT D197N: KORV-RT T303K: KORV-RT W310F: KORV-RT E327P: KORV-RT L599W: [0506] In certain embodiments, the KORV reverse transcriptase variant comprises an amino acid sequence of SEQ ID NO: 225, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 225, wherein the amino acid sequence comprises the residues 197N, 303K, 310F, 327P, and 599W: KORV_penta: [0507] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is a WMSV reverse transcriptase of SEQ ID NO: 54, or a WMSV reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 54, wherein the WMSV reverse transcriptase variant comprises one or more mutations selected from the group consisting of D197N, T303K, W311F, E327P, and L599W. In some embodiments, the WMSV reverse transcriptase variant comprises two or more, three or more, four or more, or all five of these mutations. In some embodiments, the WMSV reverse transcriptase variant comprises the mutation D197N. In some embodiments, the WMSV reverse transcriptase variant comprises the mutation T303K. In some embodiments, the WMSV reverse transcriptase variant comprises the mutation W311F. In some embodiments, the WMSV reverse transcriptase variant comprises the mutation E327P. In some embodiments, the WMSV reverse transcriptase variant comprises the mutation L599W. [0508] In certain embodiments, the WMSV reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 226-231, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 226-231, wherein the amino acid sequence comprises at least one of the residues 197N, 303K, 311F, 327P, and 599W: WMSV-RT D197N: WMSV-RT T303K: WMSV-RT W311F: WMSV-RT E327P: WMSV-RT L599W: [0509] In certain embodiments, the WMSV reverse transcriptase variant comprises an amino acid sequence of SEQ ID NO: 231, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 231, wherein the amino acid sequence comprises the residues 197N, 303K, 311F, 327P, and 599W: WMSV_penta: ) [0510] In some embodiments, the reverse transcriptase comprises a PERV reverse transcriptase. For example, the prime editor proteins described herein may comprise a PERV reverse transcriptase comprising one or more mutations relative to the amino acid sequence of SEQ ID NO: 45. In some embodiments, the PERV reverse transcriptase comprises one or more mutations selected from the group consisting of D199N, T305K, W312F, E329P, and L602W relative to the amino acid sequence of SEQ ID NO: 45. In certain embodiments, the PERV reverse transcriptase comprises the mutations D199N, T305K, W312F, E329P, and L602W relative to the amino acid sequence of SEQ ID NO: 45. In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is a PERV reverse transcriptase of SEQ ID NO: 45, or a PERV reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 45, wherein the PERV reverse transcriptase variant comprises one or more mutations selected from the group consisting of D199N, T305K, W312F, E329P, and L602W. In some embodiments, the PERV reverse transcriptase variant comprises two or more, three or more, four or more, or all five of these mutations. In some embodiments, the PERV reverse transcriptase variant comprises the mutation D199N. In some embodiments, the PERV reverse transcriptase variant comprises the mutation T305K. In some embodiments, the PERV reverse transcriptase variant comprises the mutation W312F. In some embodiments, the PERV reverse transcriptase variant comprises the mutation E329P. In some embodiments, the PERV reverse transcriptase variant comprises the mutation L602W. [0511] In certain embodiments, the PERV reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 232-238, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 232-238, wherein the amino acid sequence comprises at least one of the residues 199N, 305K, 312F, 329P, and 602W: PERV variant 21: PERV-RT D199N: PERV-RT T305K: PERV-RT W313F: PERV-RT E329P: PERV-RT L602W: [0512] In certain embodiments, the PERV reverse transcriptase variant comprises an amino acid sequence of SEQ ID NO: 238, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 238, wherein the amino acid sequence comprises the residues 199N, 305K, 312F, 329P, and 602W: PERV variant 21.6 (pentamutant comprising D199N, T305K, W312F, E329P, and L602W substitutions): [0513] In some embodiments, the reverse transcriptase comprises a Tf1 reverse transcriptase. For example, the prime editor proteins described herein may comprise a Tf1 reverse transcriptase comprising one or more mutations relative to the amino acid sequence of SEQ ID NO: 55. In some embodiments, the Tf1 reverse transcriptase comprises one or more mutations selected from the group consisting of V14A, E22K, P70T, G72V, M102I, K106R, K118R, A139T, L158Q, F269L, S297Q, K356E, A363V, K413E, I423V, and S492N relative to the amino acid sequence of SEQ ID NO: 55. In certain embodiments, the Tf1 reverse transcriptase comprises any one of the following groups of amino acid substitutions relative to the amino acid sequence of SEQ ID NO: 55: K118R and S297Q; V14A, L158Q, F269L, and K356E; K106R, L158Q, F269L, A363V, and I423V; E22K, P70T, G72V, M102I, K106R, A139T, L158Q, F269L, A363V, K413E, and S492N; or P70T, G72V, M102I, K106R, L158Q, F269L, A363V, K413E, and S492N. [0514] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is a Tf1 reverse transcriptase of SEQ ID NO: 171, or a Tf1 reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 171, wherein the Tf1 reverse transcriptase variant comprises one or more mutations selected from the group consisting of V14A, E22K, I64L, I64W, P70T, G72V, M102I, K106R, K118R, L133N, A139T, L158Q, S188K, I260L, F269L, E274R, R288Q, Q293K, S297Q, N316Q, K321R, K356E, A363V, K413E, I423V, and S492N relative to SEQ ID NO: 171. In some embodiments, the Tf1 reverse transcriptase variant comprises a single mutation, wherein the single mutation is an I64L mutation, an I64W mutation, a K118R mutation, an L133N mutation, an S188K mutation, an I260L mutation, an E274R mutation, an R288Q mutation, a Q293K mutation, an S297Q mutation, an N316Q mutation, or a K321R mutation. [0515] In some embodiments, the Tf1 reverse transcriptase variant comprises any one of the following groups of mutations relative to the amino acid sequence of SEQ ID NO: 171: K118R and S297Q; V14A, L158Q, F269L, and K356E; E22K, P70T, G72V, M102I, K106R, A139T, L158Q, F269L, A363V, K413E, and S492N; P70T, G72V, M102I, K106R, L158Q, F269L, A363V, K413E, and S492N; K106R, L158Q, F269L, A363V, and I423V; K118R, S297Q, S188K, I64L, I260L, and R288Q; E22K, P70T, G72V, M102I, K106R, A139T, L158Q, F269L, A363V, K413E, S492N, K118R, S297Q, S188K, I64L, and I260L; K118R and S188K; K118R, S188K, and I260L; K118R, S188K, I260L, and S297Q; or K118R, S188K, I260L, R288K, and S297Q. [0516] In certain embodiments, the Tf1 reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 196-213 and 241-245, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 196-213 and 241-245, wherein the amino acid sequence comprises at least one of residues 14A, 22K, 64L, 64W, 70T, 72V, 102I, 106R, 118R, 133N, 139T, 158Q, 188K, 260L, 269L, 274R, 288Q, 293K, 297Q, 316Q, 321R, 356E, 363V, 413E, 423V, 492N: [0517] In some embodiments, the reverse transcriptase comprises an Ec48 reverse transcriptase. For example, the prime editor proteins described herein may comprise an Ec48 reverse transcriptase comprising one or more mutations relative to the amino acid sequence of SEQ ID NO: 59. In some embodiments, the Ec48 reverse transcriptase comprises one or more mutations selected from the group consisting of A36V, E54K, K87E, R205K, V214L, D243N, R267I, S277F, E279K, N317S, K318E, H324Q, K326E, E328K, and R372K relative to the amino acid sequence of SEQ ID NO: 59. In certain embodiments, the Ec48 reverse transcriptase comprises any one of the following groups of amino acid substitutions relative to the amino acid sequence of SEQ ID NO: 59: R267I, K318E, K326E, E328K, and R372K; K87E, R205K, V214L, D243N, R267I, N317S, K318E, H324Q, and K326E; E54K, K87E, D243N, R267I, E279K, and K318E; A36V, K87E, R205K, D243N, R267I, E279K, and K318E; E54K, K87E, D243N, R267I, E279K, and K318E; or E54K, K87E, D243N, R267I, S277F, E279K, and K318E. [0518] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is an Ec48 reverse transcriptase of SEQ ID NO: 59, or an Ec48 reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 59, wherein the Ec48 reverse transcriptase variant comprises one or more mutations selected from the group consisting of A36V, E54K, E60K, K87E, S151T, E165D, L182N, T189N, R205K, V214L, D243N, R267I, S277F, E279K, V303M, K307R, R315K, N317S, K318E, H324Q, K326E, E328K, K343N, R372K, R378K, and T385R relative to SEQ ID NO: 59. In some embodiments, the Ec48 reverse transcriptase variant comprises a single mutation, wherein the single mutation is an L182N mutation, a T189N mutation, a K307R mutation, an R315K mutation, an R378K mutation, or a T385R mutation. [0519] In some embodiments, the Ec48 reverse transcriptase variant comprises any one of the following groups of mutations relative to the amino acid sequence of SEQ ID NO: 59: R267I, K318E, K326E, E328K, and R372K; K87E, R205K, V214L, D243N, R267I, N317S, K318E, H324Q, and K326E; E54K, K87E, D243N, R267I, E279K, and K318E; A36V, K87E, R205K, D243N, R267I, E279K, and K318E; E54K, K87E, D243N, R267I, E279K, and K318E; E54K, K87E, D243N, R267I, S277F, E279K, and K318E; E60K, K87E, E165D, D243N, R267I, E279K, K318E, and K343N; E60K, K87E, S151T, E165D, D243N, R267I, E279K, V303M, K318E, and K343N; or R315K, L182N, and T189N. [0520] In certain embodiments, the Ec48 reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 188-195 or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 188-195, wherein the amino acid sequence comprises at least one of residues 36V, 54K, 60K, 87E, 151T, 165D, 182N, 189N, 205K, 214L, 243N, 267I, 277F, 279K, 303M, 307R, 315K, 317S, 318E, 324Q, 326E, 328K, 343N, 372K, 378K, and 385R: [0521] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is an Ne144 reverse transcriptase of SEQ ID NO: 64, or an Ne144 reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 64, wherein the Ne144 reverse transcriptase variant comprises one or more mutations selected from the group consisting of A157T, A165T, and G288V relative to SEQ ID NO: 64. In some embodiments, the Ne144 reverse transcriptase variant comprises the mutations A157T, A165T, and G288V. [0522] In certain embodiments, the Ne144 reverse transcriptase variant comprises the amino acid sequence of SEQ ID NO: 239, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 239, wherein the amino acid sequence comprises at least one of residues 157T, 165T, and 288V: [0523] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is a Vc95 reverse transcriptase of SEQ ID NO: 58, or a Vc95 reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 58, wherein the Vc95 reverse transcriptase variant comprises one or more mutations selected from the group consisting of L11M, S75A, V97M, N146D, and N245T relative to SEQ ID NO: 58. In some embodiments, the Vc95 reverse transcriptase variant comprises the mutations L11M, S75A, V97M, N146D, and N245T. [0524] In certain embodiments, the Vc95 reverse transcriptase variant comprises the amino acid sequence of SEQ ID NO: 240, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to SEQ ID NO: 240, wherein the amino acid sequence comprises at least one of residues 11M, 75A, 97M, 146D, and 245T: [0525] In some embodiments, the present disclosure provides reverse transcriptases, and prime editors (e.g. fusion proteins or prime editors in which each component is provided in trans) comprising reverse transcriptases, wherein the reverse transcriptase is a Gs reverse transcriptase of SEQ ID NO: 60, or a Gs reverse transcriptase variant having at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% sequence identity with SEQ ID NO: 60, wherein the Gs reverse transcriptase variant comprises one or more mutations selected from the group consisting of N12D, A16E, A16V, L17P, V20G, L37R, L37P, R38H, Y40C, I41N, I41S, W45R, I67T, I67R, G72E, G73V, G78V, Q93R, A123V, Y126F, E129G, K162N, P190L, D206V, R233K, A234V, R263G, P264S, R267M, K279E, R287I, R291K, P309T, R344S, R358S, R360S, E363G, V374A, and Q412H relative to SEQ ID NO: 60. In some embodiments, the Gs reverse transcriptase variant comprises two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, or ten or more of these mutations. [0526] In some embodiments, the Gs reverse transcriptase variant comprises any one of the following groups of mutations relative to the amino acid sequence of SEQ ID NO: 60: L17P and D206V; N12D, L37R, and G78V; A16E, L37P, and A123V; A16V, R38H, W45R, Y126F, and Q412H; A16V, R38H, W45R, and R291K; N12D, L37R, G72E, E129G, P264S, R344S, and R360S; N12D, Y40C, I67T, G73V, Q93R, R287I, and R358S; N12D, Y40C, I67T, G73V, Q93R, and R358S; N12D, I41N, P190L, A234V, and K279E; N12D, L37R, R267M, P309T, R358S, and E363G; A16V, V20G, I41S, R233K, and P264S; L17P, V20G, I41S, I67R, R263G, P264S, and V374A; or L17P, V20G, I41S, I67R, K162N, R263G, and P264S. [0527] In certain embodiments, the Gs reverse transcriptase variant comprises the amino acid sequence of any one of SEQ ID NOs: 159-171, or an amino acid sequence at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identical to any one of SEQ ID NOs: 159-171, wherein the amino acid sequence comprises at least one of residues 12D, 16E, 16V, 17P, 20G, 37R, 37P, 38H, 40C, 41N, 41S, 45R, 67T, 67R, 72E, 73V, 78V, 93R, 123V, 126F, 129G, 162N, 190L, 206V, 233K, 234V, 263G, 264S, 267M, 279E, 287I, 291K, 309T, 344S, 358S, 360S, 363G, 374A, and 412H: Nuclear localization sequences (NLS) [0528] In various embodiments, the prime editors used in the complexes and methods described herein may comprise one or more nuclear localization sequences (NLS), which help promote translocation of a protein into the cell nucleus. Such sequences are well-known in the art and can include the following examples: [0529] The NLS examples above are non-limiting. The prime editors used in the presently described complexes and methods may comprise any known NLS sequence, including any of those described in Cokol et al., “Finding nuclear localization signals,” EMBO Rep., 2000, 1(5): 411-415 and Freitas et al., “Mechanisms and Signals for the Nuclear Import of Proteins,” Current Genomics, 2009, 10(8): 550-7, each of which are incorporated herein by reference. [0530] In various embodiments, the fusion proteins used in the complexes and methods described herein further comprise one or more (and preferably at least two) nuclear localization sequences. In certain embodiments, the fusion proteins comprise at least two NLSs. In embodiments with at least two NLSs, the NLSs can be the same NLSs or they can be different NLSs. In some embodiments, one or more of the NLSs are bipartite NLSs (“bpNLS”). In certain embodiments, the disclosed fusion proteins comprise two bipartite NLSs. In some embodiments, the disclosed fusion proteins comprise more than two bipartite NLSs. [0531] The location of the NLS fusion can be at the N-terminus, the C-terminus, or within a sequence of a fusion protein (e.g., inserted between the encoded napDNAbp component (e.g., Cas9) and a polymerase domain (e.g., a reverse transcriptase). [0532] The NLSs may be any known NLS sequence in the art. The NLSs may also be any future-discovered NLSs for nuclear localization. The NLSs also may be any naturally- occurring NLS, or any non-naturally occurring NLS (e.g., an NLS with one or more desired mutations). [0533] The term “nuclear localization sequence” or “NLS” refers to an amino acid sequence that promotes import of a protein into the cell nucleus, for example, by nuclear transport. Nuclear localization sequences are known in the art and would be apparent to the skilled artisan. For example, NLS sequences are described in Plank et al., International PCT application PCT/EP2000/011690, filed November 23, 2000, published as WO/2001/038547 on May 31, 2001, the contents of which are incorporated herein by reference. In some embodiments, an NLS comprises the amino acid sequence PKKKRKV (SEQ ID NO: 94), MDSLLMNRRKFLYQFKNVRWAKGRRETYLC (SEQ ID NO: 99), KRTADGSEFESPKKKRKV (SEQ ID NO: 97), or KRTADGSEFEPKKKRKV (SEQ ID NO: 106). In other embodiments, an NLS comprises the amino acid sequences NLSKRPAAIKKAGQAKKKK (SEQ ID NO: 107), PAAKRVKLD (SEQ ID NO: 98), RQRRNELKRSF (SEQ ID NO: 108), or NQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGGY (SEQ ID NO: 109). [0534] In one aspect of the disclosure, a prime editor or other fusion protein may be modified with one or more nuclear localization sequences (NLS), preferably at least two NLSs. In certain embodiments, the fusion proteins are modified with two or more NLSs. The disclosure contemplates the use of any nuclear localization sequence known in the art at the time of the disclosure, or any nuclear localization sequence that is identified or otherwise made available in the state of the art after the time of the instant filing. A representative nuclear localization sequence is a peptide sequence that directs the protein to the nucleus of the cell in which the sequence is expressed. A nuclear localization signal is predominantly basic, can be positioned almost anywhere in a protein's amino acid sequence, generally comprises a short sequence of four amino acids (Autieri & Agrawal, (1998) J. Biol. Chem. 273: 14731-37, incorporated herein by reference) to eight amino acids, and is typically rich in lysine and arginine residues (Magin et al., (2000) Virology 274: 11-16, incorporated herein by reference). Nuclear localization sequences often comprise proline residues. A variety of nuclear localization sequences have been identified and have been used to effect transport of biological molecules from the cytoplasm to the nucleus of a cell. See, e.g., Tinland et al., (1992) Proc. Natl. Acad. Sci. U.S.A.89:7442-46; Moede et al., (1999) FEBS Lett.461:229- 34, which is incorporated herein by reference. Translocation is currently thought to involve nuclear pore proteins. [0535] Most NLSs can be classified in three general groups: (i) a monopartite NLS exemplified by the SV40 large T antigen NLS (PKKKRKV (SEQ ID NO: 94)); (ii) a bipartite motif consisting of two basic domains separated by a variable number of spacer amino acids and exemplified by the Xenopus nucleoplasmin NLS (KRXXXXXXXXXXKKKL (SEQ ID NO: 110)); and (iii) noncanonical sequences such as M9 of the hnRNP Al protein, the influenza virus nucleoprotein NLS, and the yeast Gal4 protein NLS (Dingwall and Laskey 1991). [0536] Nuclear localization sequences appear at various points in the amino acid sequences of proteins. NLS have been identified at the N-terminus, the C-terminus, and in the central region of proteins. Thus, the disclosure provides fusion proteins that may be modified with one or more NLSs at the C-terminus and/or the N-terminus, as well as at internal regions of the fusion protein. The residues of a longer sequence that do not function as component NLS residues should be selected so as not to interfere, for example, tonically or sterically, with the nuclear localization signal itself. Therefore, although there are no strict limits on the composition of an NLS-comprising sequence, in practice, such a sequence can be functionally limited in length and composition. [0537] The present disclosure contemplates any suitable means by which to modify a fusion protein to include one or more NLSs. In one aspect, the fusion proteins may be engineered to express a fusion protein that is translationally fused at its N-terminus or its C-terminus (or both) to one or more NLSs, i.e., to form a prime editor-NLS fusion construct. In other embodiments, a fusion protein-encoding nucleotide sequence may be genetically modified to incorporate a reading frame that encodes one or more NLSs in an internal region of the encoded prime editor. In addition, the NLSs may include various amino acid linkers or spacer regions encoded between the prime editor and the N-terminally, C-terminally, or internally- attached NLS amino acid sequence, e.g., and in the central region of proteins. Thus, the present disclosure also provides for nucleotide constructs, vectors, and host cells for expressing fusion proteins that comprise a prime editor and one or more NLSs, among other components. [0538] The prime editors described herein may also comprise nuclear localization sequences that are linked to a prime editor through one or more linkers, e.g., a polymeric, amino acid, nucleic acid, polysaccharide, chemical, or nucleic acid linker element. The linkers within the contemplated scope of the disclosure are not intended to have any limitations and can be any suitable type of molecule (e.g., polymer, amino acid, polysaccharide, nucleic acid, lipid, or any synthetic chemical linker domain) and can be joined to the prime editor by any suitable strategy that effectuates forming a bond (e.g., covalent linkage, hydrogen bonding) between the prime editor and the one or more NLSs. Linkers [0539] The prime editors used in the complexes and methods described herein may include one or more linkers. As defined above, the term “linker,” as used herein, refers to a chemical group or a molecule linking two molecules or moieties, e.g., a binding domain and a cleavage domain of a nuclease. In some embodiments, a linker joins a gRNA binding domain of an RNA-programmable nuclease and a polymerase (e.g., a reverse transcriptase). In some embodiments, a linker joins a Cas9 nickase and a reverse transcriptase. Typically, the linker is positioned between, or flanked by, two groups, molecules, or other moieties and connected to each one via a covalent bond, thus connecting the two. In some embodiments, the linker is an amino acid or a plurality of amino acids (e.g., a peptide or protein). In some embodiments, the linker is an organic molecule, group, polymer, or chemical moiety. In some embodiments, the linker is 5-100 amino acids in length, for example, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 30-35, 35-40, 40-45, 45-50, 50-60, 60- 70, 70-80, 80-90, 90-100, 100-150, or 150-200 amino acids in length. Longer or shorter linkers are also contemplated. [0540] The linker may be as simple as a covalent bond, or it may be a polymeric linker many atoms in length. In certain embodiments, the linker is a polypeptide, or amino acid-based. In other embodiments, the linker is not peptide-like. In certain embodiments, the linker is a covalent bond (e.g., a carbon-carbon bond, disulfide bond, carbon-heteroatom bond, etc.). In certain embodiments, the linker is a carbon-nitrogen bond of an amide linkage. In certain embodiments, the linker is a cyclic or acyclic, substituted or unsubstituted, branched or unbranched, aliphatic or heteroaliphatic linker. In certain embodiments, the linker is polymeric (e.g., polyethylene, polyethylene glycol, polyamide, polyester, etc.). In certain embodiments, the linker comprises a monomer, dimer, or polymer of aminoalkanoic acid. In certain embodiments, the linker comprises an aminoalkanoic acid (e.g., glycine, ethanoic acid, alanine, beta-alanine, 3-aminopropanoic acid, 4-aminobutanoic acid, 5-pentanoic acid, etc.). In certain embodiments, the linker comprises a monomer, dimer, or polymer of aminohexanoic acid (Ahx). In certain embodiments, the linker is based on a carbocyclic moiety (e.g., cyclopentane, cyclohexane). In other embodiments, the linker comprises a polyethylene glycol moiety (PEG). In other embodiments, the linker comprises amino acids. In certain embodiments, the linker comprises a peptide. In certain embodiments, the linker comprises an aryl or heteroaryl moiety. In certain embodiments, the linker is based on a phenyl ring. The linker may include functionalized moieties to facilitate attachment of a nucleophile (e.g., thiol, amino) from the peptide to the linker. Any electrophile may be used as part of the linker. Exemplary electrophiles include, but are not limited to, activated esters, activated amides, Michael acceptors, alkyl halides, aryl halides, acyl halides, and isothiocyanates. [0541] In some other embodiments, the linker comprises the amino acid sequence (GGGGS)n (SEQ ID NO: 84), (G)n (SEQ ID NO: 85), (EAAAK)n (SEQ ID NO: 86), (GGS)n (SEQ ID NO: 87), (SGGS) n (SEQ ID NO: 81), (XP) n (SEQ ID NO: 88), or any combination thereof, wherein n is independently an integer between 1 and 30, and wherein X is any amino acid. In some embodiments, the linker comprises the amino acid sequence (GGS)n (SEQ ID NO: 87), wherein n is 1, 3, or 7. In some embodiments, the linker comprises the amino acid sequence SGSETPGTSESATPES (SEQ ID NO: 89). In some embodiments, the linker comprises the amino acid sequence SGGSSGGSSGSETPGTSESATPESSGGSSGGS (SEQ ID NO: 90). In some embodiments, the linker comprises the amino acid sequence SGGSGGSGGS (SEQ ID NO: 91). In some embodiments, the linker comprises the amino acid sequence SGGS (SEQ ID NO: 82). In other embodiments, the linker comprises the amino acid sequence SGGSSGGSSGSETPGTSESATPESAGSYPYDVPDYAGSAAPAAKKKKLDGSGSGGSS GGS (SEQ ID NO: 83, 60AA). In some embodiments, the linker comprises the amino acid sequence GGS, GGSGGS (SEQ ID NO: 92), GGSGGSGGS (SEQ ID NO: 93), SGGSSGGSSGSETPGTSESATPESSGGSSGGSS (SEQ ID NO: 80), SGSETPGTSESATPES (SEQ ID NO: 89), or SGGSSGGSSGSETPGTSESATPESAGSYPYDVPDYAGSAAPAAKKKKLDGSGSGGSS GGS (SEQ ID NO: 83). [0542] In certain embodiments, linkers may be used to link any of the peptides or peptide domains or moieties of the invention (e.g., a napDNAbp linked or fused to a reverse transcriptase domain, and/or a napDNAbp linked to one or more NLS). Any of the domains of the fusion proteins used in the complexes and methods described herein may also be connected to one another through any of the presently described linkers. Additional prime editor domains A. Flap endonucleases (e.g., FEN1) [0543] In various embodiments, the prime editors described herein may comprise one or more flap endonucleases (e.g., FEN1), which refers to an enzyme that catalyzes the removal of 5ʹ single stranded DNA flaps (provided in trans or fused to the PE fusion proteins). These are naturally occurring enzymes that process the removal of 5ʹ flaps formed during cellular processes, including DNA replication. The prime editors described herein may utilize endogenously supplied flap endonucleases or those provided in trans to remove the 5ʹ flap of endogenous DNA formed at the target site during prime editing. Flap endonucleases are known in the art and are described, for example, in Patel et al., “Flap endonucleases pass 5ʹ- flaps through a flexible arch using a disorder-thread-order mechanism to confer specificity for free 5ʹ-ends,” Nucleic Acids Research, 2012, 40(10): 4507-4519 and Tsutakawa et al., “Human flap endonuclease structures, DNA double-base flipping, and a unified understanding of the FEN1 superfamily,” Cell, 2011, 145(2): 198-211 (each of which are incorporated herein by reference). An exemplary flap endonuclease is FEN1, which can be represented by the following amino acid sequence:
[0544] The flap endonucleases may also include any FEN1 variant, mutant, or other flap endonuclease ortholog, homolog, or variant. Non-limiting FEN1 variant examples are as follows:
[0545] In various embodiments, the prime editors utilized in the complexes and methods contemplated herein may include any flap endonuclease variant of the above-disclosed sequences having an amino acid sequence that is at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 96% identical, at least about 97% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% identical to any of the above sequences. Other endonucleases that may be utilized by the instant compositions and methods to facilitate removal of the 5′ end single strand DNA flap include, but are not limited to (1) trex 2, (2) exo1 endonuclease (e.g., Keijzers et al., Biosci Rep.2015, 35(3): e00206) Trex 2 [0546] Three prime (3´) repair exonuclease 2 (TREX2) – human Accession No. NM_080701 [0547] Three prime (3′) repair exonuclease 2 (TREX2) – mouse Accession No. NM_011907 [0548] Three prime (3′) repair exonuclease 2 (TREX2) – rat Accession No. NM_001107580 ( Q ) ExoI [0549] Human exonuclease 1 (EXO1) has been implicated in many different DNA metabolic processes, including DNA mismatch repair (MMR), micro-mediated end-joining, homologous recombination (HR), and replication. Human EXO1 belongs to a family of eukaryotic nucleases, Rad2/XPG, which also include FEN1 and GEN1. The Rad2/XPG family is conserved in the nuclease domain through species from phage to human. The EXO1 gene product exhibits both 5′ exonuclease and 5′ flap activity. Additionally, EXO1 contains an intrinsic 5′ RNase H activity. Human EXO1 has a high affinity for processing double stranded DNA (dsDNA), nicks, gaps, and pseudo Y structures and can resolve Holliday junctions using its inherit flap activity. Human EXO1 is implicated in MMR and contains conserved binding domains interacting directly with MLH1 and MSH2. EXO1 nucleolytic activity is positively stimulated by PCNA, MutSα (MSH2/MSH6 complex), 14-3-3, MRN, and 9-1-1 complex. [0550] Exonuclease 1 (EXO1) Accession No. NM_003686 (Homo sapiens exonuclease 1 (EXO1), transcript variant 3) – isoform A ) [0551] Exonuclease 1 (EXO1) Accession No. NM_006027 (Homo sapiens exonuclease 1 (EXO1), transcript variant 3) – isoform B B. Inteins and split-inteins [0553] It will be understood that in some embodiments (e.g., delivery of a prime editor in vivo), it may be advantageous to split a polypeptide (e.g., a reverse transcriptase or a napDNAbp) or a fusion protein (e.g., a prime editor) into an N-terminal half and a C-terminal half, deliver them separately, and then allow their colocalization to reform the complete protein (or fusion protein as the case may be) within the cell. Separate halves of a protein or a fusion protein may each comprise a split-intein tag to facilitate the reformation of the complete protein or fusion protein by the mechanism of protein trans splicing. [0554] Protein trans-splicing, catalyzed by split inteins, provides an entirely enzymatic method for protein ligation. A split-intein is essentially a contiguous intein (e.g., a mini- intein) split into two pieces named N-intein and C-intein, respectively. The N-intein and C- intein of a split intein can associate non-covalently to form an active intein and catalyze the splicing reaction in essentially the same way as a contiguous intein does. Split inteins have been found in nature and have also been engineered in laboratories. As used herein, the term “split intein” refers to any intein in which one or more peptide bond breaks exists between the N-terminal and C-terminal amino acid sequences such that the N-terminal and C-terminal sequences become separate molecules that can non-covalently reassociate, or reconstitute, into an intein that is functional for trans-splicing reactions. Any catalytically active intein, or fragment thereof, may be used to derive a split intein for use in the methods of the invention. For example, in one aspect, the split intein may be derived from a eukaryotic intein. In another aspect, the split intein may be derived from a bacterial intein. In another aspect, the split intein may be derived from an archaeal intein. Preferably, the split intein so-derived will possess only the amino acid sequences essential for catalyzing trans-splicing reactions. [0555] As used herein, the “N-terminal split intein (In)” refers to any intein sequence that comprises an N-terminal amino acid sequence that is functional for trans-splicing reactions. An In thus also comprises a sequence that is spliced out when trans-splicing occurs. An In can comprise a sequence that is a modification of the N-terminal portion of a naturally occurring intein sequence. For example, an In can comprise additional amino acid residues and/or mutated residues, as long as the inclusion of such additional and/or mutated residues does not render the In non-functional in trans-splicing. Preferably, the inclusion of the additional and/or mutated residues improves or enhances the trans-splicing activity of the In. [0556] As used herein, the “C-terminal split intein (Ic)” refers to any intein sequence that comprises a C-terminal amino acid sequence that is functional for trans-splicing reactions. In one aspect, the Ic comprises 4 to 7 contiguous amino acid residues, at least 4 amino acids of which are from the last β-strand of the intein from which it was derived. An Ic thus also comprises a sequence that is spliced out when trans-splicing occurs. An Ic can comprise a sequence that is a modification of the C-terminal portion of a naturally occurring intein sequence. For example, an Ic can comprise additional amino acid residues and/or mutated residues, as long as the inclusion of such additional and/or mutated residues does not render the In non-functional in trans-splicing. Preferably, the inclusion of the additional and/or mutated residues improves or enhances the trans-splicing activity of the Ic. [0557] In some embodiments of the invention, a peptide linked to an Ic or an In can comprise an additional chemical moiety including, among others, fluorescence groups, biotin, polyethylene glycol (PEG), amino acid analogs, unnatural amino acids, phosphate groups, glycosyl groups, radioisotope labels, and pharmaceutical molecules. In other embodiments, a peptide linked to an Ic can comprise one or more chemically reactive groups including, among others, ketones, aldehydes, Cys residues, and Lys residues. The N-intein and C-intein of a split intein can associate non-covalently to form an active intein and catalyze the splicing reaction when an "intein-splicing polypeptide (ISP)" is present. As used herein, "intein- splicing polypeptide (ISP)" refers to the portion of the amino acid sequence of a split intein that remains when the Ic, In, or both, are removed from the split intein. In certain embodiments, the In comprises the ISP. In another embodiment, the Ic comprises the ISP. In yet another embodiment, the ISP is a separate peptide that is not covalently linked to In nor to Ic. [0558] Split inteins may be created from contiguous inteins by engineering one or more split sites in the unstructured loop or intervening amino acid sequence between the -12 conserved beta-strands found in the structure of mini-inteins. Some flexibility in the position of the split site within regions between the beta-strands may exist, provided that creation of the split will not disrupt the structure of the intein, the structured beta-strands in particular, to a sufficient degree that protein splicing activity is lost. [0559] In protein trans-splicing, one precursor protein consists of an N-extein part followed by the N-intein, another precursor protein consists of the C-intein followed by a C-extein part, and a trans-splicing reaction (catalyzed by the N- and C-inteins together) excises the two intein sequences and links the two extein sequences with a peptide bond. Protein trans- splicing, being an enzymatic reaction, can work with very low (e.g., micromolar) concentrations of proteins and can be carried out under physiological conditions. [0560] Exemplary sequences are as follows:
[0561] Although inteins are most frequently found as a contiguous domain, some exist in a naturally split form. In this case, the two fragments are expressed as separate polypeptides and must associate before splicing takes place, so-called protein trans-splicing. [0562] An exemplary split intein is the Ssp DnaE intein, which comprises two subunits, namely, DnaE-N and DnaE-C. The two different subunits are encoded by separate genes, namely dnaE-n and dnaE-c, which encode the DnaE-N and DnaE-C subunits, respectively. DnaE is a naturally occurring split intein in Synechocytis sp. PCC6803 and is capable of directing trans-splicing of two separate proteins, each comprising a fusion with either DnaE- N or DnaE-C. [0563] Additional naturally occurring or engineered split-intein sequences are known in the art or can be made from whole-intein sequences described herein or those available in the art. Examples of split-intein sequences can be found in Stevens et al., “A promiscuous split intein with expanded protein engineering applications,” PNAS, 2017, Vol.114: 8538-8543; Iwai et al., “Highly efficient protein trans-splicing by a naturally split DnaE intein from Nostc punctiforme, FEBS Lett, 580: 1853-1858, each of which are incorporated herein by reference. Additional split intein sequences can be found, for example, in WO 2013/045632, WO 2014/055782, WO 2016/069774, and EP2877490, the contents of each of which are incorporated herein by reference. [0564] In addition, protein splicing in trans has been described in vivo and in vitro (Shingledecker, et al., Gene 207:187 (1998), Southworth, et al., EMBO J.17:918 (1998); Mills, et al., Proc. Natl. Acad. Sci. USA, 95:3543-3548 (1998); Lew, et al., J. Biol. Chem., 273:15887-15890 (1998); Wu, et al., Biochim. Biophys. Acta 35732:1 (1998b), Yamazaki, et al., J. Am. Chem. Soc.120:5591 (1998), Evans, et al., J. Biol. Chem.275:9091 (2000); Otomo, et al., Biochemistry 38:16040-16044 (1999); Otomo, et al., J. Biolmol. NMR 14:105- 114 (1999); Scott, et al., Proc. Natl. Acad. Sci. USA 96:13638-13643 (1999)) and provides the opportunity to express a protein as two inactive fragments that subsequently undergo ligation to form a functional product. RNA-protein interaction domain [0565] In various embodiments, two separate protein domains (e.g., a Cas9 domain and a polymerase domain) may be colocalized to one another to form a functional complex (akin to the function of a fusion protein comprising the two separate protein domains) by using an “RNA-protein recruitment system,” such as the “MS2 tagging technique.” Such systems generally tag one protein domain with an “RNA-protein interaction domain” (a.k.a. “RNA- protein recruitment domain”) and the other with an “RNA-binding protein” that specifically recognizes and binds to the RNA-protein interaction domain, e.g., a specific hairpin structure. These types of systems can be leveraged to colocalize the domains of a prime editor, as well as to recruit additional functionalities to a prime editor, such as a UGI domain. In one example, the MS2 tagging technique is based on the natural interaction of the MS2 bacteriophage coat protein (“MCP” or “MS2cp”) with a stem-loop or hairpin structure present in the genome of the phage, i.e., the “MS2 hairpin.” In the case of the MS2 hairpin, it is recognized and bound by the MS2 bacteriophage coat protein (MCP). Thus, in one exemplary scenario, a reverse transcriptase-MS2 fusion can recruit a Cas9-MCP fusion. [0566] A review of other modular RNA-protein interaction domains are described in the art, for example, in Johansson et al., “RNA recognition by the MS2 phage coat protein,” Sem Virol., 1997, Vol.8(3): 176-185; Delebecque et al., “Organization of intracellular reactions with rationally designed RNA assemblies,” Science, 2011, Vol.333: 470-474; Mali et al., “Cas9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering,” Nat. Biotechnol., 2013, Vol.31: 833-838; and Zalatan et al., “Engineering complex synthetic transcriptional programs with CRISPR RNA scaffolds,” Cell, 2015, Vol.160: 339-350, each of which are incorporated herein by reference in their entireties. Other systems include the PP7 hairpin, which specifically recruits the PCP protein, and the “com” hairpin, which specifically recruits the Com protein. See Zalatan et al. [0567] The nucleotide sequence of the MS2 hairpin (or equivalently referred to as the “MS2 aptamer”) is: GCCAACATGAGGATCACCCATGTCTGCAGGGCC (SEQ ID NO: 144). [0568] The amino acid sequence of the MCP or MS2cp is: GSASNFTQFVLVDNGGTGDVTVAPSNFANGVAEWISSNSRSQAYKVTCSVRQSSAQ NRKYTIKVEVPKVATQTVGGEELPVAGWRSYLNMELTIPIFATNSDCELIVKAMQGL LKDGNPIPSAIAANSGIY (SEQ ID NO: 145). C. Other PE elements [0569] In certain embodiments, the prime editors utilized in the methods and complexes described herein may comprise an inhibitor of base repair. The term “inhibitor of base repair” or “IBR” refers to a protein that is capable in inhibiting the activity of a nucleic acid repair enzyme, for example, a base excision repair enzyme. In some embodiments, the IBR is an inhibitor of OGG base excision repair. In some embodiments, the IBR is an inhibitor of base excision repair (“iBER”). Exemplary inhibitors of base excision repair include inhibitors of APE1, Endo III, Endo IV, Endo V, Endo VIII, Fpg, hOGG1, hNEIL1, T7 EndoI, T4PDG, UDG, hSMUG1, and hAAG. In some embodiments, the IBR is an inhibitor of Endo V or hAAG. In some embodiments, the IBR is an iBER that may be a catalytically inactive glycosylase or catalytically inactive dioxygenase or a small molecule or peptide inhibitor of an oxidase, or variants thereof. In some embodiments, the IBR is an iBER that may be a TDG inhibitor, an MBD4 inhibitor, or an inhibitor of an AlkBH enzyme. In some embodiments, the IBR is an iBER that comprises a catalytically inactive TDG or catalytically inactive MBD4. An exemplary catalytically inactive TDG is an N140A mutant of SEQ ID NO: 136 (human TDG). [0570] Some exemplary glycosylases are provided below. The catalytically inactivated variants of any of these glycosylase domains are iBERs that may be fused to the napDNAbp or polymerase domain of the prime editors utilized in the methods and compositions provided in this disclosure. [0571] OGG (human) MPARALLPRRMGHRTLASTPALWASIPCPRSELRLDLVLPSGQSFRWREQSPAHWSG VLADQVWTLTQTEEQLHCTVYRGDKSQASRPTPDELEAVRKYFQLDVTLAQLYHH WGSVDSHFQEVAQKFQGVRLLRQDPIECLFSFICSSNNNIARITGMVERLCQAFGPRL IQLDDVTYHGFPSLQALAGPEVEAHLRKLGLGYRARYVSASARAILEEQGGLAWLQ QLRESSYEEAHKALCILPGVGTKVADCICLMALDKPQAVPVDVHMWHIAQRDYSW HPTTSQAKGPSPQTNKELGNFFRSLWGPYAGWAQAVLFSADLRQSRHAQEPPAKRR KGSKGPEG (SEQ ID NO: 133) [0572] MPG (human) MVTPALQMKKPKQFCRRMGQKKQRPARAGQPHSSSDAAQAPAEQPHSSSDAAQAP CPRERCLGPPTTPGPYRSIYFSSPKGHLTRLGLEFFDQPAVPLARAFLGQVLVRRLPN GTELRGRIVETEAYLGPEDEAAHSRGGRQTPRNRGMFMKPGTLYVYIIYGMYFCMNI SSQGDGACVLLRALEPLEGLETMRQLRSTLRKGTASRVLKDRELCSGPSKLCQALAI NKSFDQRDLAQDEAVWLERGPLEPSEPAVVAAARVGVGHAGEWARKPLRFYVRGS PWVSVVDRVAEQDTQA (SEQ ID NO: 134) [0573] MBD4 (human) MGTTGLESLSLGDRGAAPTVTSSERLVPDPPNDLRKEDVAMELERVGEDEEQMMIK RSSECNPLLQEPIASAQFGATAGTECRKSVPCGWERVVKQRLFGKTAGRFDVYFISP QGLKFRSKSSLANYLHKNGETSLKPEDFDFTVLSKRGIKSRYKDCSMAALTSHLQNQ SNNSNWNLRTRSKCKKDVFMPPSSSSELQESRGLSNFTSTHLLLKEDEGVDDVNFRK VRKPKGKVTILKGIPIKKTKKGCRKSCSGFVQSDSKRESVCNKADAESEPVAQKSQL DRTVCISDAGACGETLSVTSEENSLVKKKERSLSSGSNFCSEQKTSGIINKFCSAKDSE HNEKYEDTFLESEEIGTKVEVVERKEHLHTDILKRGSEMDNNCSPTRKDFTGEKIFQE DTIPRTQIERRKTSLYFSSKYNKEALSPPRRKAFKKWTPPRSPFNLVQETLFHDPWKL LIATIFLNRTSGKMAIPVLWKFLEKYPSAEVARTADWRDVSELLKPLGLYDLRAKTI VKFSDEYLTKQWKYPIELHGIGKYGNDSYRIFCVNEWKQVHPEDHKLNKYHDWLW ENHEKLSLS (SEQ ID NO: 135) [0574] TDG (human) MEAENAGSYSLQQAQAFYTFPFQQLMAEAPNMAVVNEQQMPEEVPAPAPAQEPVQ EAPKGRKRKPRTTEPKQPVEPKKPVESKKSGKSAKSKEKQEKITDTFKVKRKVDRFN GVSEAELLTKTLPDILTFNLDIVIIGINPGLMAAYKGHHYPGPGNHFWKCLFMSGLSE VQLNHMDDHTLPGKYGIGFTNMVERTTPGSKDLSSKEFREGGRILVQKLQKYQPRIA VFNGKCIYEIFSKEVFGVKVKNLEFGLQPHKIPDTETLCYVMPSSSARCAQFPRAQDK VHYYIKLKDLRDQLKGIERNMDVQEVQYTFDLQLAQEDAKKMAVKEEKYDPGYEA AYGGAYGENPCSSEPCGFSSNGLIESVELRGESAFSGIPNGQWMTQSFTDQIPSFSNH CGTQEQEEESHA (SEQ ID NO: 136) [0575] In some embodiments, the fusion proteins described herein may comprise one or more heterologous protein domains (e.g., about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more domains in addition to the prime editor components). A fusion protein may comprise any additional protein sequence, and optionally a linker sequence between any two domains. Other exemplary features that may be present are localization sequences, such as cytoplasmic localization sequences, export sequences, such as nuclear export sequences, or other localization sequences, as well as sequence tags that are useful for solubilization, purification, or detection of the fusion proteins. [0576] Examples of protein domains that may be fused to a prime editor or component thereof (e.g., the napDNAbp domain, the polymerase domain, or the NLS domain) include, without limitation, epitope tags and reporter gene sequences. Non-limiting examples of epitope tags include histidine (His) tags, V5 tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and thioredoxin (Trx) tags. Examples of reporter genes include, but are not limited to, glutathione-5-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT), beta-galactosidase, beta-glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP). A prime editor may be fused to a gene sequence encoding a protein or a fragment of a protein that binds DNA molecules or binds other cellular molecules, including, but not limited to, maltose binding protein (MBP), S-tag, Lex A DNA binding domain (DBD) fusions, GAL4 DNA binding domain fusions, and herpes simplex virus (HSV) BP16 protein fusions. Additional domains that may form part of a prime editor are described in US Patent Publication No.2011/0059502, published March 10, 2011, and incorporated herein by reference. [0577] In an aspect of the disclosure, a reporter gene that includes, but is not limited to, glutathione-5-transferase (GST), horseradish peroxidase (HRP), chloramphenicol acetyltransferase (CAT) beta-galactosidase, beta-glucuronidase, luciferase, green fluorescent protein (GFP), HcRed, DsRed, cyan fluorescent protein (CFP), yellow fluorescent protein (YFP), and autofluorescent proteins including blue fluorescent protein (BFP), may be introduced into a cell to encode a gene product that serves as a marker by which to measure the alteration or modification of expression of the gene product. In certain embodiments of the disclosure, the gene product is luciferase. In a further embodiment of the disclosure, the expression of the gene product is decreased. [0578] Suitable protein tags provided herein include, but are not limited to, biotin carboxylase carrier protein (BCCP) tags, myc-tags, calmodulin-tags, FLAG-tags, hemagglutinin (HA)-tags, polyhistidine tags, also referred to as histidine tags or His-tags, maltose binding protein (MBP)-tags, nus-tags, glutathione-S-transferase (GST)-tags, green fluorescent protein (GFP)-tags, thioredoxin-tags, S-tags, Softags (e.g., Softag 1, Softag 3), strep-tags, biotin ligase tags, FlAsH tags, V5 tags, and SBP-tags. Additional suitable sequences will be apparent to those of skill in the art. In some embodiments, the fusion protein comprises one or more His tags. [0579] In some embodiments of the present disclosure, the activity of the prime editing system may be temporally regulated by adjusting the residence time, the amount, and/or the activity of the expressed components of the PE system. For example, as described herein, the PE may be fused with a protein domain that is capable of modifying the intracellular half-life of the PE. In certain embodiments involving two or more vectors (e.g., a vector system in which the components described herein are encoded on two or more separate vectors), the activity of the PE system may be temporally regulated by controlling the timing in which the vectors are delivered. For example, in some embodiments, a vector encoding the nuclease system may deliver the PE prior to the vector encoding the template. In other embodiments, the vector encoding the PEgRNA may deliver the guide prior to the vector encoding the PE system. In some embodiments, the vectors encoding the PE system and PEgRNA are delivered simultaneously. In certain embodiments, the simultaneously delivered vectors temporally deliver, e.g., the PE, PEgRNA, and/or second strand guide RNA components. In further embodiments, the RNA (such as, e.g., the nuclease transcript) transcribed from the coding sequence on the vectors may further comprise at least one element that is capable of modifying the intracellular half-life of the RNA and/or modulating translational control. In some embodiments, the half-life of the RNA may be increased. In some embodiments, the half-life of the RNA may be decreased. In some embodiments, the element may be capable of increasing the stability of the RNA. In some embodiments, the element may be capable of decreasing the stability of the RNA. In some embodiments, the element may be within the 3′ UTR of the RNA. In some embodiments, the element may include a polyadenylation signal (PA). In some embodiments, the element may include a cap, e.g., an upstream mRNA or PEgRNA end. In some embodiments, the RNA may comprise no PA such that it is subject to quicker degradation in the cell after transcription. In some embodiments, the element may include at least one AU-rich element (ARE). The AREs may be bound by ARE binding proteins (ARE-BPs) in a manner that is dependent upon tissue type, cell type, timing, cellular localization, and environment. In some embodiments, the destabilizing element may promote RNA decay, affect RNA stability, or activate translation. In some embodiments, the ARE may comprise 50 to 150 nucleotides in length. In some embodiments, the ARE may comprise at least one copy of the sequence AUUUA. In some embodiments, at least one ARE may be added to the 3′ UTR of the RNA. In some embodiments, the element may be a Woodchuck Hepatitis Virus (WHP). [0580] Posttranscriptional Regulatory Element (WPRE), which creates a tertiary structure to enhance expression from the transcript. In further embodiments, the element is a modified and/or truncated WPRE sequence that is capable of enhancing expression from the transcript, as described, for example in Zufferey et al., J. Virol., 73(4): 2886-92 (1999) and Flajolet et al., J. Virol., 72(7): 6175-80 (1998). In some embodiments, the WPRE or equivalent may be added to the 3′ UTR of the RNA. In some embodiments, the element may be selected from other RNA sequence motifs that are enriched in either fast- or slow-decaying transcripts. [0581] In some embodiments, the vector encoding the PE or the PEgRNA may be self- destroyed via cleavage of a target sequence present on the vector by the PE system. The cleavage may prevent continued transcription of a PE or a PEgRNA from the vector. Although transcription may occur on the linearized vector for some amount of time, the expressed transcripts or proteins subject to intracellular degradation will have less time to produce off-target effects without continued supply from expression of the encoding vectors. Pharmaceutical compositions [0582] Other aspects of the present disclosure relate to pharmaceutical compositions comprising any of the guide RNAs (including, e.g., PEgRNAs, ePEgRNAs, and second strand nicking gRNAs), fusion proteins, and polynucleotides described herein. The term “pharmaceutical composition,” as used herein, refers to a composition formulated for pharmaceutical use. In some embodiments, the pharmaceutical composition further comprises a pharmaceutically acceptable carrier. In some embodiments, the pharmaceutical composition comprises additional agents (e.g., for specific delivery, increasing half-life, or other therapeutic compounds). [0583] As used herein, the term “pharmaceutically-acceptable carrier” means a pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, manufacturing aid (e.g., lubricant, talc magnesium, calcium or zinc stearate, or steric acid), or solvent encapsulating material, involved in carrying or transporting the compound from one site (e.g., the delivery site) of the body, to another site (e.g., an organ, tissue, or other part of the body). A pharmaceutically acceptable carrier is “acceptable” in the sense of being compatible with the other ingredients of the formulation and not injurious to the tissue of the subject (e.g., physiologically compatible, sterile, physiologic pH, etc.). Some examples of materials that can serve as pharmaceutically-acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, methylcellulose, ethyl cellulose, microcrystalline cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) lubricating agents, such as magnesium stearate, sodium lauryl sulfate and talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil, and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol (PEG); (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) pH buffered solutions; (21) polyesters, polycarbonates and/or polyanhydrides; (22) bulking agents, such as polypeptides and amino acids; (23) serum component, such as serum albumin, HDL, and LDL; (22) C2-C12 alcohols, such as ethanol; and (23) other non-toxic compatible substances employed in pharmaceutical formulations. Wetting agents, coloring agents, release agents, coating agents, sweetening agents, flavoring agents, perfuming agents, preservatives, and antioxidants can also be present in the formulation. The terms such as “excipient,” “carrier,” “pharmaceutically acceptable carrier,” or the like are used interchangeably herein. [0584] In some embodiments, the pharmaceutical composition is formulated for delivery to a subject, e.g., for gene editing. Suitable routes of administering the pharmaceutical composition described herein include, without limitation: topical, subcutaneous, transdermal, intradermal, intralesional, intraarticular, intraperitoneal, intravesical, transmucosal, gingival, intradental, intracochlear, transtympanic, intraorgan, epidural, intrathecal, intramuscular, intravenous, intravascular, intraosseus, periocular, intratumoral, intracerebral, and intracerebroventricular administration. [0585] In some embodiments, the pharmaceutical composition described herein is administered locally to a diseased site (e.g., tumor site). In some embodiments, the pharmaceutical composition described herein is administered to a subject by injection, by means of a catheter, by means of a suppository, or by means of an implant, the implant being of a porous, non-porous, or gelatinous material, including a membrane, such as a sialastic membrane, or a fiber. [0586] In other embodiments, the pharmaceutical composition described herein is delivered in a controlled release system. In one embodiment, a pump may be used (see, e.g., Langer, 1990, Science 249:1527-1533; Sefton, 1989, CRC Crit. Ref. Biomed. Eng.14:201; Buchwald et al., 1980, Surgery 88:507; Saudek et al., 1989, N. Engl. J. Med.321:574). In another embodiment, polymeric materials can be used. (See, e.g., Medical Applications of Controlled Release (Langer and Wise eds., CRC Press, Boca Raton, Fla., 1974); Controlled Drug Bioavailability, Drug Product Design and Performance (Smolen and Ball eds., Wiley, New York, 1984); Ranger and Peppas, 1983, Macromol. Sci. Rev. Macromol. Chem.23:61. See also Levy et al., 1985, Science 228:190; During et al., 1989, Ann. Neurol.25:351; Howard et al., 1989, J. Neurosurg.71:105). Other controlled release systems are discussed, for example, in Langer, supra. [0587] In some embodiments, the pharmaceutical composition is formulated in accordance with routine procedures as a composition adapted for intravenous or subcutaneous administration to a subject, e.g., a human. In some embodiments, pharmaceutical compositions for administration by injection are solutions in sterile isotonic aqueous buffer. Where necessary, the pharmaceutical composition can also include a solubilizing agent and a local anesthetic such as lignocaine to ease pain at the site of the injection. Generally, the ingredients are supplied either separately or mixed together in unit dosage form, for example, as a dry lyophilized powder or water free concentrate in a hermetically sealed container such as an ampoule or sachette indicating the quantity of active agent. Where the pharmaceutical composition is to be administered by infusion, it can be dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where the pharmaceutical composition is administered by injection, an ampoule of sterile water for injection or saline can be provided so that the ingredients can be mixed prior to administration. [0588] A pharmaceutical composition for systemic administration may be a liquid, e.g., sterile saline, lactated Ringer’s or Hank’s solution. In addition, the pharmaceutical composition can be in solid forms and re-dissolved or suspended immediately prior to use. Lyophilized forms are also contemplated. [0589] The pharmaceutical composition can be contained within a lipid particle or vesicle, such as a liposome or microcrystal, which is also suitable for parenteral administration. The particles can be of any suitable structure, such as unilamellar or plurilamellar, so long as compositions are contained therein. Compounds can be entrapped in “stabilized plasmid-lipid particles” (SPLP) containing the fusogenic lipid dioleoylphosphatidylethanolamine (DOPE), low levels (5-10 mol%) of cationic lipid and stabilized by a polyethyleneglycol (PEG) coating (Zhang Y. P. et al., Gene Ther.1999, 6:1438-47). Positively charged lipids such as N-[1-(2,3-dioleoyloxi)propyl]-N,N,N-trimethyl-amoniummethyls ulfate, or “DOTAP,” are particularly preferred for such particles and vesicles. The preparation of such lipid particles is well known. See, e.g., U.S. Patent Nos.4,880,635; 4,906,477; 4,911,928; 4,917,951; 4,920,016; and 4,921,757; each of which is incorporated herein by reference. [0590] The pharmaceutical compositions described herein may be administered or packaged as a unit dose, for example. The term “unit dose” when used in reference to a pharmaceutical composition of the present disclosure refers to physically discrete units suitable as unitary dosage for the subject, each unit containing a predetermined quantity of active material calculated to produce the desired therapeutic effect in association with the required diluent; i.e., carrier, or vehicle. [0591] Further, the pharmaceutical composition can be provided as a pharmaceutical kit comprising (a) a container containing a compound of the invention in lyophilized form and (b) a second container containing a pharmaceutically acceptable diluent (e.g., sterile water) for injection. The pharmaceutically acceptable diluent can be used for reconstitution or dilution of the lyophilized compound of the invention. Optionally associated with such container(s) can be a notice in the form prescribed by a governmental agency regulating the manufacture, use, or sale of pharmaceuticals or biological products, which notice reflects approval by the agency of manufacture, use, or sale for human administration. [0592] In another aspect, an article of manufacture containing materials useful for the treatment of the diseases described above is included. In some embodiments, the article of manufacture comprises a container and a label. Suitable containers include, for example, bottles, vials, syringes, and test tubes. The containers may be formed from a variety of materials such as glass or plastic. In some embodiments, the container holds a composition that is effective for treating a disease and may have a sterile access port. For example, the container may be an intravenous solution bag or a vial having a stopper pierce-able by a hypodermic injection needle. The active agent in the composition is a compound of the invention. In some embodiments, the label on or associated with the container indicates that the composition is used for treating the disease of choice. The article of manufacture may further comprise a second container comprising a pharmaceutically acceptable buffer, such as phosphate-buffered saline, Ringer’s solution, or dextrose solution. It may further include other materials desirable from a commercial and user standpoint, including other buffers, diluents, filters, needles, syringes, and package inserts with instructions for use. Kits and Cells [0593] The guide RNAs (including pegRNAs and epegRNAs), fusion proteins, and compositions of the present disclosure may be assembled into kits. In some embodiments, the kit comprises polynucleotides for expression of the prime editors and/or pegRNAs, epegRNAs, and second strand nicking gRNAs described herein. In other embodiments, the kit further comprises appropriate guide nucleotide sequences or nucleic acid vectors for the expression of such guide nucleotide sequences, to target the Cas9 protein of the prime editors to the desired target sequence (e.g., a gene associate with a triplet repeat disorder, such as the HTT or FXN genes). [0594] The kits described herein may include one or more containers housing components for performing the methods described herein, and optionally instructions for use. Any of the kits described herein may further comprise components needed for performing the prime editing methods described herein. Each component of the kits, where applicable, may be provided in liquid form (e.g., in solution) or in solid form, (e.g., a dry powder). In certain cases, some of the components may be reconstitutable or otherwise processible (e.g., to an active form), for example, by the addition of a suitable solvent or other species (for example, water), which may or may not be provided with the kit. [0595] In some embodiments, the kits may optionally include instructions and/or promotion for use of the components provided. As used herein, “instructions” can define a component of instruction and/or promotion, and typically involve written instructions on or associated with packaging of the disclosure. Instructions also can include any oral or electronic instructions provided in any manner such that a user will clearly recognize that the instructions are to be associated with the kit, for example, audiovisual (e.g., videotape, DVD, etc.), Internet, and/or web-based communications, etc. The written instructions may be in a form prescribed by a governmental agency regulating the manufacture, use, or sale of pharmaceuticals or biological products, which can also reflect approval by the agency of manufacture, use, or sale for animal administration. As used herein, “promoted” includes all methods of doing business including methods of education, hospital and other clinical instruction, scientific inquiry, drug discovery or development, academic research, pharmaceutical industry activity including pharmaceutical sales, and any advertising or other promotional activity including written, oral, and electronic communication of any form, associated with the disclosure. Additionally, the kits may include other components depending on the specific application, as described herein. [0596] The kits may contain any one or more of the components described herein in one or more containers. The components may be prepared sterilely, packaged in a syringe, and shipped refrigerated. Alternatively, they may be housed in a vial or other container for storage. A second container may have other components prepared sterilely. Alternatively, the kits may include the active agents premixed and shipped in a vial, tube, or other container. [0597] The kits may have a variety of forms, such as a blister pouch, a shrink-wrapped pouch, a vacuum sealable pouch, a sealable thermoformed tray, or a similar pouch or tray form, with the accessories loosely packed within the pouch, one or more tubes, containers, a box, or a bag. The kits may be sterilized after the accessories are added, thereby allowing the individual accessories in the container to be otherwise unwrapped. The kits can be sterilized using any appropriate sterilization techniques, such as radiation sterilization, heat sterilization, or other sterilization methods known in the art. The kits may also include other components, depending on the specific application, for example, containers, cell media, salts, buffers, reagents, syringes, needles, a fabric, such as gauze, for applying or removing a disinfecting agent, disposable gloves, a support for the agents prior to administration, etc. Some aspects of this disclosure provide kits comprising a nucleic acid construct comprising a nucleotide sequence encoding the prime editor systems described herein, or various components thereof (e.g., including, but not limited to, the napDNAbps, reverse transcriptase domains, and pegRNAs/epegRNAs/second strand nicking gRNAs). In some embodiments, the nucleotide sequence(s) comprises a heterologous promoter (or more than a single promoter) that drives expression of the prime editor system components. [0598] Other aspects of this disclosure provide kits comprising one or more nucleic acid constructs encoding the various components of the prime editing system described herein. In some embodiments, the nucleotide sequence comprises a heterologous promoter that drives expression of the prime editing system components. [0599] Cells that may contain any of the guide RNAs, fusion proteins, and compositions described herein include prokaryotic cells and eukaryotic cells. The methods described herein may be used to deliver a prime editor and/or guide RNA into a eukaryotic cell (e.g., a mammalian cell, such as a human cell). In some embodiments, the cell is in vitro (e.g., a cultured cell). In some embodiments, the cell is in vivo (e.g., in a subject, such as a human subject). In some embodiments, the cell is ex vivo (e.g., isolated from a subject and may be administered back to the same or a different subject). [0600] Mammalian cells of the present disclosure include human cells, primate cells (e.g., vero cells), rat cells (e.g., GH3 cells, OC23 cells) or mouse cells (e.g., MC3T3 cells). There are a variety of human cell lines, including, without limitation, human embryonic kidney (HEK) cells, HeLa cells, cancer cells from the National Cancer Institute's 60 cancer cell lines (NCI60), DU145 (prostate cancer) cells, Lncap (prostate cancer) cells, MCF-7 (breast cancer) cells, MDA-MB-438 (breast cancer) cells, PC3 (prostate cancer) cells, T47D (breast cancer) cells, THP-1 (acute myeloid leukemia) cells, U87 (glioblastoma) cells, SHSY5Y human neuroblastoma cells (cloned from a myeloma) and Saos-2 (bone cancer) cells. In some embodiments, prime editors and/or guide RNAs are delivered into human embryonic kidney (HEK) cells (e.g., HEK293 or HEK293T cells). In some embodiments, prime editors and/or guide RNAs are delivered into stem cells (e.g., human stem cells), such as, for example, pluripotent stem cells (e.g., human pluripotent stem cells including human induced pluripotent stem cells (hiPSCs)). A stem cell refers to a cell with the ability to divide for indefinite periods in culture and to give rise to specialized cells. A pluripotent stem cell refers to a type of stem cell that is capable of differentiating into all tissues of an organism, but not alone capable of sustaining full organismal development. A human induced pluripotent stem cell refers to a somatic (e.g., mature or adult) cell that has been reprogrammed to an embryonic stem cell-like state by being forced to express genes and factors important for maintaining the defining properties of embryonic stem cells (see, e.g., Takahashi and Yamanaka, Cell 126 (4): 663–76, 2006, incorporated by reference herein). Human induced pluripotent stem cells express stem cell markers and are capable of generating cells characteristic of all three germ layers (ectoderm, endoderm, mesoderm). [0601] Additional non-limiting examples of cell lines that may be used in accordance with the present disclosure include 293-T, 293-T, 3T3, 4T1, 721, 9L, A-549, A172, A20, A253, A2780, A2780ADR, A2780cis, A431, ALC, B16, B35, BCP-1, BEAS-2B, bEnd.3, BHK-21, BR 293, BxPC3, C2C12, C3H-10T1/2, C6, C6/36, Cal-27, CGR8, CHO, CML T1, CMT, COR-L23, COR-L23/5010, COR-L23/CPR, COR-L23/R23, COS-7, COV-434, CT26, D17, DH82, DU145, DuCaP, E14Tg2a, EL4, EM2, EM3, EMT6/AR1, EMT6/AR10.0, FM3, H1299, H69, HB54, HB55, HCA2, Hepa1c1c7, High Five cells, HL-60, HMEC, HT-29, HUVEC, J558L cells, Jurkat, JY cells, K562 cells, KCL22, KG1, Ku812, KYO1, LNCap, Ma-Mel 1, 2, 3....48, MC-38, MCF-10A, MCF-7, MDA-MB-231, MDA-MB-435, MDA- MB-468, MDCK II, MG63, MONO-MAC 6, MOR/0.2R, MRC5, MTD-1A, MyEnd, NALM- 1, NCI-H69/CPR, NCI-H69/LX10, NCI-H69/LX20, NCI-H69/LX4, NIH-3T3, NW-145, OPCN/OPCT Peer, PNT-1A/PNT 2, PTK2, Raji, RBL cells, RenCa, RIN-5F, RMA/RMAS, S2, Saos-2 cells, Sf21, Sf9, SiHa, SKBR3, SKOV-3, T-47D, T2, T84, THP1, U373, U87, U937, VCaP, WM39, WT-49, X63, YAC-1, and YAR cells. [0602] Some aspects of this disclosure provide cells comprising any of the constructs disclosed herein. In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors described herein. In some embodiments, a cell is transfected as it naturally occurs in a subject. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell is derived from cells taken from a subject, such as a cell line. A wide variety of cell lines for tissue culture are known in the art. Examples of cell lines include, but are not limited to, C8161, CCRF-CEM, MOLT, mIMCD- 3, NHDF, HeLa-S3, Huh1, Huh4, Huh7, HUVEC, HASMC, HEKn, HEKa, MiaPaCell, Panc1, PC-3, TF1, CTLL-2, C1R, Rat6, CV1, RPTE, A10, T24, J82, A375, ARH-77, Calu1, SW480, SW620, SKOV3, SK-UT, CaCo2, P388D1, SEM-K2, WEHI-231, HB56, TIB55, Jurkat, J45.01, LRMB, Bcl-1, BC-3, IC21, DLD2, Raw264.7, NRK, NRK-52E, MRC5, MEF, Hep G2, HeLa B, HeLa T4, COS, COS-1, COS-6, COS-M6A, BS-C-1 monkey kidney epithelial, BALB/3T3 mouse embryo fibroblast, 3T3 Swiss, 3T3-L1, 132-d5 human fetal fibroblasts; 10.1 mouse fibroblasts, 293-T, 3T3, 721, 9L, A2780, A2780ADR, A2780cis, A 172, A20, A253, A431, A-549, ALC, B16, B35, BCP-1 cells, BEAS-2B, bEnd.3, BHK-21, BR 293, BxPC3, C3H-10T1/2, C6/36, Cal-27, CHO, CHO-7, CHO-IR, CHO-K1, CHO-K2, CHO-T, CHO Dhfr −/−, COR-L23, COR-L23/CPR, COR-L23/5010, COR-L23/R23, COS-7, COV-434, CML T1, CMT, CT26, D17, DH82, DU145, DuCaP, EL4, EM2, EM3, EMT6/AR1, EMT6/AR10.0, FM3, H1299, H69, HB54, HB55, HCA2, HEK-293, HeLa, Hepa1c1c7, HL-60, HMEC, HT-29, Jurkat, JY cells, K562 cells, Ku812, KCL22, KG1, KYO1, LNCap, Ma-Mel 1-48, MC-38, MCF-7, MCF-10A, MDA-MB-231, MDA-MB-468, MDA-MB-435, MDCK II, MDCK 11, MOR/0.2R, MONO-MAC 6, MTD-1A, MyEnd, NCI- H69/CPR, NCI-H69/LX10, NCI-H69/LX20, NCI-H69/LX4, NIH-3T3, NALM-1, NW-145, OPCN/OPCT cell lines, Peer, PNT-1A/PNT 2, RenCa, RIN-5F, RMA/RMAS, Saos-2 cells, Sf-9, SkBr3, T2, T-47D, T84, THP1 cell line, U373, U87, U937, VCaP, Vero cells, WM39, WT-49, X63, YAC-1, YAR, and transgenic varieties thereof. [0603] Cell lines are available from a variety of sources known to those with skill in the art (see, e.g., the American Type Culture Collection (ATCC) (Manassus, VA)). In some embodiments, a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences. In some embodiments, a cell transiently transfected with the components of a CRISPR system as described herein (such as by transient transfection of one or more vectors, or transfection with RNA), and modified through the activity of a CRISPR complex, is used to establish a new cell line comprising cells containing the modification but lacking any other exogenous sequence. In some embodiments, cells transiently or non-transiently transfected with one or more vectors described herein, or cell lines derived from such cells, are used in assessing one or more test compounds. EXAMPLES Example 1. Using prime editing to reduce the size of CAG repeat tracts to a normal polyQ length [0604] In Huntington’s disease (HD), the length of CAG repeats in HTT are typically ≥40, while repeat length in the general population is described as a normal distribution ranging from 9 to 35 CAGs, with a median of 17 CAGs 14,15 (FIG.1). PolyQ lengths in HTT below this normal range have not been observed, and extreme truncation of polyQ is considered detrimental to HTT protein function 10 . Excision of the locus surrounding the CAG repeats in HTT exon 1 using paired nucleases has been demonstrated to alleviate polyQ proteinopathy in vitro and improve motor deficits in mice 16–19 . However, these strategies result in the additional loss of coding sequence and also cannot differentiate between pathogenic and non- pathogenic HTT alleles, thus resulting in loss of wild type HTT protein expression. 10,11,16–18,20 The outcome of HTT loss in cells is not fully known 21,22 . Recent clinical trial outcomes, however, suggest that loss of HTT expression may in fact induce toxicity in HD 23 . [0605] Prime editing 12 was explored as a means to precisely modify HTT exon 1 to replace CAG repeats with a small, normal number of triplets without disruption of the surrounding sequence. A few hundred prime editing strategies (PE2) were extensively screened using various prime editing sgRNA (pegRNA) designs. Multiple spacer sequences paired with different PBS and RT template (RTT: homology + synthesis template sequence) lengths were tested, which allowed selection of the best spacer sequence (“spacer”) enabling up to 2-4% average replacement of CAG repeats at endogenous loci in HEK293T cells (FIGs.2 and 3A- 3B). To increase editing efficiency, PE3 strategies, which pair PE2 editing with an additional sgRNA to induce a nearby nick near or within the editing target site to bias DNA repair towards incorporation of the prime edit 12,24,25 , were explored. Out of five different nicking conditions, one was selected – nick N5 – that resulted in the highest editing and the best editing:indel ratio (FIGs.3A-3B). [0606] Next, another extensive screen of pegRNA designs with varying PBS and RTT lengths was performed in a PE3 system with an extra N5 nick. From all tested conditions, pegRNAs with PBS greater than 8 nt and homology longer than 25 nt resulted in the highest prime editing efficiency (FIGs.4A-4B). Next, additional prime editing improvements at this locus were further evaluated. pegRNA with 3′ structural motifs (evopreq1 and mpknot) 26 were tested with rationally designed or computed by pegLIT 26 linker sequences, and an ~2- fold increase in editing efficiency was observed after applying evopreq1 motif with a rationally designed linker sequence (FIGs.5A-5B). All designs moving forward include the evopreq1 motif and are referred to as “epegRNA”. [0607] The length of the epegRNA homology sequence was then revisited, and it was determined that epegRNAs with homology of 31 and 40 nt are most efficient (FIG.6). It was also validated that “PEmax” optimized editor architecture resulted in better prime editing at this locus (FIG.7). Significant improvement of editing while using these editors in a combination with overexpression of dominant negative MLH1 protein (PE4 and PE5, without and with an additional nicking sgRNA respectively) was not observed (FIG.8) 27 . All conditions discussed moving forward were performed with the PEmax editor. [0608] To explore additional pegRNA design options, multiple dual epegRNA strategies were screened, including PrimeDel 28 and TwinPE 29 , and no improvement was observed in comparison to the single epegRNA strategy (FIG.10). Next, the optimized epegRNA architecture was used in a PE3 system (N5 nick), and different variants of the sequence were screened, replacing CAG repeats (from 2 CAGs to 9 CAGs, either as pure CAG codons or in a combination with interrupting CAA codons – “CARs”, where R stands for “either A or G”). It was observed that replacement of the repeats with 4CAG units results in the highest editing efficiency (FIGs.10A-10B). To further maximize editing efficiency and maximize the editing:indel ratio in the PE3 system, a few more nicking guides were screened, either recognizing the canonical NGG or non-canonical NGA PAMs. It was determined that N5 nicking guide offers the highest editing efficiency, but N23b nicking guide can be a safer alternative (resulting in lower indels, but also slightly decreased correct editing) if using a PE3b editing system is desired (FIG.11) 12 . Next, the efficiency of epegRNAs with a regular scaffold was compared to ones with a modified scaffold containing a UA flip. This improvement was implemented for all future applications (FIGs.12A-12B) 30 . [0609] A screen of various different prime editors that employ evolved or rationally designed reverse transcriptase (RT) components was performed next. It was determined that a PE variant with a truncation of the RNAseH domain (e.g., between D497 and I498 of SEQ ID NO: 33) and V223Y mutation in its RT (“V223Y”), as well as pRT-5.800 editor with Tf1 RT (“pRT-5.800max”), in a combination with the optimized epegRNA and nicking guide (N5) resulted in editing significantly higher than PEmax, most likely due to their higher activity and processivity. Additionally, these two editors are much smaller in size, which is advantageous for in vivo applications. This trend was true for short and long CAG replacement variants (FIGs.13A-13B and 14). [0610] Finally, an extensive epegRNA screen was performed in which silent mutations were introduced in the RTT sequence (i.e., in the synthesis template portion of the RTT) to relax the secondary structure of the RNA and, consequently, allow for a more efficient reverse transcription of the RTT template sequence. A few mutations that resulted in considerably higher editing efficiency were found for both, 4CAG/4CAR (pure CAG repeats or a mixture of CAG and CAA codons, followed by endogenous CAACAG sequence, resulting in six glutamines total) and 9CAG/9CAR (pure CAG repeats or a mixture of CAG and CAA codons, followed by endogenous CAACAG sequence, resulting in eleven glutamines total) replacement strategies (FIGs.15A-15B, 16A-16C, and 17A-17B). It was found that epegRNA reducing CAG repeat size to 4CAG repeats encoding additional GC2+CT2+GA1 mutations improves editing ~2 fold over the previous epegRNA design (homology 31nt, PBS 10nt, epegRNA, UA scaffold flip) in the PE3 system (with N5 nicking guide) (FIGs.18A- 18B). Additionally, epegRNA reducing CAG repeat size to 9CARs encoding additional GC2+CT2 mutations improves editing > 2-fold over the previous epegRNA design (homology 31nt or 40nt, PBS 10nt, epegRNA, UA scaffold flip) in the PE3 system (N5 nicking guide) (FIGs.19A-19B). [0611] In summary, the final prime editing strategy to reduce the size of CAG repeats in the HTT locus employs the following improvements: PE3 or PE3b system 12 , PEmax 27 , epegRNA 26 , UA scaffold flip 30 , pegRNA RTT sequence modification, and PE RT variants. All the data presented has been analyzed with software – CAGspresso – developed for this study. [0612] Next, prime editors comprising Cas9 variants were tested in an effort to find a more efficient prime editor variant that would increase editing for both the PE3 and PE3b strategies for 4CAG and 9CAG replacement (FIGs.21-28). To make the differences in PE performance more visible, PE plasmid input was decreased to a concentration that becomes limiting for prime editing, and the different PE variants were compared under this more stringent condition. Two Cas9 variants were identified: “eeCas9-1” (SpCas9 with K918A + K775R) and “eeCas9-8” (SpCas9 with D23G + H754R) in a combination with truncated MMLV (RnaseH deletion) with a V223Y mutation (V223YRHdelta) showed higher editing efficiency than a prime editor comprising the V223YRHdelta variant by itself. FIGs.29A-29B provide an updated summary of the PE strategy to reduce CAG repeats in view of the additional Cas9 variants tested. [0613] To validate the prime editing strategy in more relevant cell lines, transgenic mouse embryonic stem cell lines (mESCs) containing the human HTT exon 1 were generated. Using Tol2 transposase, two cell lines expressing HTT exon 1 with either 21 or 72 pure CAG repeats 31 , ending in a terminal ‘CAACAG’ sequence 32,33 , were generated. These transgenic cell lines allowed rapid screening of genome editing at both non-pathogenic (21 CAG) and pathogenic (72 CAG) repeat lengths, that are larger than CAG repeats at endogenous HTT alleles typically found in wild type human cell lines (typically 16-18 CAG) (FIGs.30 and 31A-31B). Notably, prime editing at shorter CAG repeats in HTT-mESC was lower (22% at 21 CAGs), suggesting prime editing may be more efficient at pathogenic repeat lengths than normal alleles, possibly due to unfavored secondary structure formation of long repeat alleles (FIGs.32A-32B). Editing efficiency was somewhat decreased when pegRNAs encode the incorporation of a greater number of CAGs (FIG.32C). [0614] Next, the prime editing strategy to reduce CAG repeats was tested in a HD mouse model with expanded CAG repeats. An Htt.Q111 mouse model was used that carries a human pathogenic HTT allele with ~111 glutamines and exhibits somatic repeat instability in the CNS (including the striatum and cortex) and the liver 71–73 . [0615] A dual-ssAAV delivery system was generated for the HTT-targeting prime editing strategy to replace the pathogenic number of CAG repeats in Htt.Q111 with either 6 or 11 glutamines 54,74,75 . In the first round of preliminary optimizations, intracerebroventricular (ICV) injections were performed in Htt.Q111 neonates using ssAAV9 virus and the htt-v1 (dual ssAAV9-PE-v1). Four weeks post injection, no appreciable prime editing was observed above background (FIG.33). The AAV architecture was reevaluated. and a new prime editing strategy (htt-v2) that utilizes additional improvements to the prime editor and pegRNA was used (FIGs.34A-34B). P0 ICV was injected into HTT.Q111 neonates with a regular dose of dual ssAAV9-PE-v2 (4.0x10 10 vg total) 54 , and editing was evaluated four weeks post injection. Approximately 1% of alleles in the cortex and over 3% of alleles in the liver acquired the desired prime edit (FIGs.35A-35C). However, it was determined that the majority of the edited alleles still carry the endogenously encoded pathogenic CAG repeats (FIGs.35A-35C). The treatment with ssAAV9-PE-v2 was extended, and transduction efficiency in treated animals was assessed. Eight weeks after the injection, ~60% transduction efficiency in the cortex and ~20% GFP+ flow-sorted nuclei in the striatum was observed (FIGs.36A-36C). The editing observed in the examined tissues increased ~2-fold compared to the 4-week time point (FIGs.36A-36C and 37). As the transduction efficiency did not seem to be limiting, the alternative PE3b prime editing strategy was next evaluated (htt-v3, described above), which yielded good editing efficiency with much lower indels (FIGs.38A- 38C). By using this new approach (dual ssAAV9-PE-v3), the goal was to reduce undesired editing outcomes and possibly increase editing efficiency in vivo (FIGs.38A-38C). The mice were P0 ICV injected with a high dose (1.6x10 11 vg total) of the dual ssAAV9-PE-v3 to compensate for the lower editing efficiency of the PE3b system compared to PE3 observed in vitro, and the editing outcomes were assessed 4 weeks post injection. Compared to htt-v2, strategy htt-v3 showed a slight increase in % alleles with the prime edit, including up to 2- fold in the cortex and liver (FIGs.39A-39D). Most edited alleles in the cortex and striatum still contained the endogenous CAG repeats (FIGs.39A-39D). The fraction of correctly edited alleles improved, however, for the liver, in which >50% of all prime edits did not contain the pathogenic CAG tract (FIGs.39A-39D). This unexpected editing outcome may be caused by 1) DNA damage repair mechanisms resolving the transient DSB at the edited site before editing is complete; 2) unsuccessful flap resolution after the desired sequence (edit) gets incorporated into the DNA; or 3) inefficient prime editing in the transduced cells caused by a non-optimal strategy design and/or the complex character of the edit. To address these potential limitations, additional evaluation of the prime editing strategies in vitro was performed by testing new PE variants to further improve editing. It was found that some strategies (htt-v4a and htt-v4b), have the potential to perform better in the endogenous HTT context. These strategies were shown to yield slightly increased editing efficiency in vitro (FIGs.40A-40B). Example 2. Using prime editing to remove long GAA expansions at pathogenic FXN alleles [0616] The length of FXN GAA-repeats in the general population ranges from ~5-60, while FRDA patients may present with 66 to well over 1200 repeats, typically ranging from 600 to 900 repeats 57 . The age of FRDA onset in patients, loss of FXN protein, and severity of symptoms are inversely correlated with the GAA repeat length of the shortest FXN allele (FIG.41). [0617] The first intron in FXN plays a key role in transcriptional regulation, and GAA repeat expansion induces epigenetic changes to the chromatin state that results in silencing of gene expression 81-83 . Excision of the locus containing the expanded GAA repeats has been demonstrated to increase the total mature FXN transcripts in cells and thereby enable rescue of FXN protein levels and some disease phenotypes in FRDA patient cell lines 64-66,84,85 ; however, due to technical considerations regarding target sequence specificity, strategies result in large (1-20kb) deletion of the surrounding sequence, including substantial removal of native FXN regulatory sequences, which may impact long-term and tissue-specific rescue of FRDA-afflicted tissues 86,87 . [0618] Prime editing 73 was explored as a means to precisely modify FXN intron 1 to remove expanded GAA repeats while minimally disrupting surrounding regulatory sequences. A few hundred prime editing strategies (PE2) using various prime editing sgRNA (pegRNA) designs were screened. Multiple spacer sequences paired with different PBS and RT template (RTT: homology + synthesis template sequence) lengths were tested, which allowed selection of the best spacer sequence (here: “spacer”) enabling up to 18% average deletion of GAA repeats at endogenous loci in HEK293T cells (FIG.42). To increase editing efficiency, PE3 strategies, which pair PE2 editing with an additional sgRNA to induce a nick near or within the editing target site to bias DNA repair towards incorporation of the prime edit were explored 73,74,88 . Out of three different nicking conditions, one was selected – “nick C,” which resulted in the highest editing efficiency (FIGs.43A-43B). [0619] Next, another extensive screen of pegRNA designs with varying PBS and RTT lengths was performed in PE2 and PE3 systems with extra nick C. From all tested conditions, pegRNAs with PBS of 10nt and homology of 40nt resulted in the highest prime editing (FIG. 44A-44B). Additional prime editing improvements at this locus were further evaluated. PegRNA with 3′ structural motifs (evopreq1 and mpknot) 89 were tested with rationally designed linker sequences or linker sequences computed by pegLIT 89 , and a slight increase in editing efficiency was observed after adding an evopreq1 motif with a pegLIT-predicted linker sequence (FIGs.45A-45B). All pegRNA designs from here forward include the evopreq1 motif and are referred to as “epegRNA.” [0620] It was then validated that use of PEmax-optimized editor architecture results in slightly better prime editing at this locus (FIG.46), but no significant improvement of editing was observed while using it in combination with overexpression of dominant negative MLH1 protein (PE4 and PE5, without and with an additional nicking sgRNA, respectively) (FIG. 47) 90 . From here forward, all conditions discussed were performed with PEmax editors. [0621] To explore additional pegRNA design options, multiple dual epegRNA strategies were screened, including PrimeDel 91 and TwinPE 92 , and no improvement was observed in comparison to the single epegRNA strategy (FIG.48). To further maximize editing efficiency and maximize the editing:indel ratio in the PE3 system, additional nicking guides were screened, and it was determined that nicking guide 3b outperforms nick C and offers a safer alternative to the PE3 system, resulting in lower indels (FIGs.49A-49B) 73,93 . Next, the efficiency of epegRNAs with a regular scaffold was compared to ones with a modified scaffold containing UA flip, and this improvement was implemented in all pegRNAs tested from here forward (FIGs.50A-50B) 94 . [0622] Finally, a screen of various different prime editors that employ evolved or rationally designed reverse transcriptase (RT) components was performed. It was determined that a PE variant with a truncation of the RNAseH domain and a V223Y mutation in its RT (“V223Y RHdelta”), as well as pRT-5.800 editor with Tf1 RT (“pRT-5.800max”) in a combination with the optimized epegRNA and nicking guide (3b) resulted in editing significantly higher than regular PEmax, most likely due to their higher activity and processivity. Additionally, these two editors are much smaller in size, which is advantageous for in vivo applications (FIGs.51A-51B). [0623] In summary, an optimized prime editing strategy to remove the long GAA repeat expansions in the FXN locus was developed, employing the following improvements: PE3b system 73,93 , PEmax 90 , epegRNA 89 , UA scaffold flip 94 , and PE RT variants. All the data presented has been analyzed with software – FRATAXino - developed for this study (FIGs. 52A-52B). [0624] To validate the prime editing strategy in more relevant cell lines, transgenic mouse embryonic stem cell lines (mESCs) containing a ~300bp fragment of the human FXN intron 1 were generated. Using Tol2 transposase, a cell line expressing FXN intron 1 with 30 GAA repeats was generated. This transgenic cell line facilitated rapid screening of genome editing at non-pathogenic yet larger GAA repeats than at endogenous FXN alleles typically found in wild type human cell lines (typically 8-18 GAAs) (FIGs.53 and 54). Up to 60% and 55% average prime editing was achieved using all the improvements to the pegRNA (epegRNA, UA scaffold flip) and PE (PEmax, V223Y RHdelta) described above in the PE2 and PE3b system, respectively. This strategy also resulted in 70% precise deletion in transgenic cells (FIGs.54 and 55). [0625] To test additional reverse transcriptase variants and Cas9 candidates that could potentially improve editing efficiency, a number of different editors were screened in the context of limited prime editor quantities to pronounce the differences between the tested editors. The best candidate was found to be a prime editor comprising an MMLV reverse transcriptase with a V223Y mutation and a truncation of the RNaseH domain (e.g., between D497 and I498 of SEQ ID NO: 33) (FIG.56). A regular dose of prime editor was also transfected for comparison. Standard Cas9 nickase used in PE2 showed equally good editing efficiency as the Cas9-1 and Cas9-8 variants described herein (FIG.56). Cas9 comprising an N863A mutation abolished editing activity. Cas9-1 and Cas9-8 were included in further optimizations in cell models with pathogenic GAA repeats. PEmax utilizing V223Y RNaseH- truncated MMLV reverse transcriptase or pRT-5.800max (i.e., prime editor comprising Tf1 RT) both outperformed other RT variants in a PE3b system with 3b nicking guide RNA (FIG. 56). Both editors are much smaller than standard PEmax, which provides advantages for delivery and other applications. [0626] Besides FXN-mESC cells carrying 30 GAA repeats, two additional cell lines with an integrated fragment of FXN intron 1 with 60 or 200 GAA repeats were generated (FIG.57). The leading prime editing strategies were tested in all three FXN-mESCs lines with successful deletion of the targeted sequence (FIGs.58A-58D). It was confirmed that standard Cas9 performs very well, but Cas9 variants eeCas9-1 and eeCas9-8 resulted in increased editing efficiency, especially in the context of longer GAA repeats (FIG.58D). [0627] pegRNA with homology of 40 nucleotides resulted in particularly good editing efficiency in FXN mESC cells, and use of an extra nicking guide did not improve editing efficiency further. Additionally, editing of longer alleles (30, 60, and 200 GAAs) was more efficient than editing in HEK293T cells (9GAAs) (FIG.59). PEmax with V223Y RNaseH- truncated MMLV reverse transcriptase performed similar or better than standard PE2max but has the advantage of being smaller sized (FIG.58B). [0628] Next, the leading prime editing strategies were tested in FRDA patient-derived fibroblasts (line 3816 and line 4078) and in control fibroblasts with a healthy number of GAA repeats (FIG.60A). Three different commercially produced synthetic pegRNA with the same sequence but different RNA modifications that influence RNA stability were used, and the results were compared. It was found that across different pegRNA molecules and fibroblast lines, standard Cas9 nickase from PE2 with V223Y RNaseH-truncated MMLV reverse transcriptase performed best (FIG.60B). [0629] Next, the prime edited fibroblast cells were used to assess phenotypic rescue, as measured by FXN expression (RNA expression measured by ddPCR) (FIGs.61A-61C). This was found to be highly reduced in FRDA patient-derived cells (FIG.61C). The prime editing result (FIG.61B) was compared to rescue in cells edited with a Cas9 nuclease strategy (FIG. 61A), previously published by Rocca et al., Methods and Clinical Development, 2020, 17, 1026-1036. It was found that patient cells treated by prime editing showed a substantial increase in FXN expression, further confirming that prime editing has therapeutic potential for rescuing the main phenotypic trait of FRDA (reduced FXN expression). [0630] For in vivo studies of prime editing GAA repeats, a YG8s mouse model was used (Virmouni et al., Dis Model Mech, 2015, 8(3), 225-235). Before testing the V223Y RNaseH- truncated MMLV reverse transcriptase prime editor, PEmax, together with the leading pegRNA design (and no nicking guide RNA) was packaged (FIG.62A) and tested in vivo in two mouse models of FRDA, with either 300 GAAs or 800 GAAs (FIG.62B). The mice were treated by injecting dual-AAV on postnatal day 0 via ICV (intracerebroventricular injection) or on postnatal day 1 via FVI (facial vein injection). It was determined that AAV9 FXN-v1 strategy shows promising results when delivered systemically. On average, 10% editing in the liver was achieved (FIG.62B). As the target tissues in FRDA are the CNS and heart, the delivery strategy was further optimized. [0631] The optimized prime editing strategy included the pegRNA design described herein, PE3b, and PEmax architecture with V223Y RNaseH-truncated MMLV reverse transcriptase. AAV architecture described in Davis et al., NBME, 2023 was also included to optimize delivery (FIG.63A). Mice were treated with dual-AAV9 by P0 ICV injection (FIG.63B), and editing was measured across the CNS and systemic tissues after 4 and 8 weeks (FIG. 63C). This improved prime editing AAV delivery strategy (AAV9 FXN-v2; see FIG.63A) yielded prime editing efficiencies in the liver of approximately 15% and in the heart of approximately 10% and showed non-zero editing in the cortex (FIG.63C). [0632] The transduction efficiency of the AAV9 FXN-v2 strategy was then evaluated in the cortex, and editing efficiency was evaluated across different tissues in transduced and bulk nuclei (FIGs.64A-64C). Additionally, the mice were treated with an increased dose of the AAV9 FXN-v2 to determine the effect of dose on prime editing. The FXN-v2 strategy significantly increased transduction efficiency (FIG.64C). And the use of a higher viral dose (approximately four times higher) increased the transduction efficiency. No increase in transduction efficiency was observed between four week- and eight week-long treatments. Importantly, it was determined that the higher dose FXN-v2 strategy yielded approximately 20% editing efficiency in bulk cortex nuclei and approximately 40% editing efficiency in transduced cortex nuclei (FIG.64B). Additionally, editing efficiency in the liver and heart reached approximately 20% and 10%, respectively, after FXN-v2 treatment (FIG.64B). [0633] It was then determined whether FRDA mice successfully treated with AAV9 FXN-v2 showed any rescue of FXN expression (FIGs.65A-65C). Accordingly, FXN expression levels in these mice was assessed by measuring RNA expression by ddPCR in the liver of mice treated with a low or high dose of FXN-v2 for 4 and 8 weeks. Compared to untreated animals, both GAA.300 and GAA.800 mice were found to present an obvious increase in FXN expression that continues to increase over time (FIGs.65B-65C). [0634] Table 1. Sequences used in Example 1
[0635] Table 2. Sequences used in Example 2
REFERENCES [0636] 1. Roos, R. A. C. Huntington’s disease: a clinical review. Orphanet journal of rare diseases 5, (2010). [0637] 2. Bates, G. P., Dorsey, R., Gusella, J. F., Hayden, M. R., et al. Huntington disease. Nature Reviews Disease Primers vol.1 (2015). [0638] 3. Frank, S. Treatment of Huntington’s Disease. Neurotherapeutics 11, 153 (2014). [0639] 4. Langbehn, D. R., Hayden, M. R., Paulsen, J. S., Johnson, H., et al. CAG- repeat length and the age of onset in Huntington disease (HD): a review and validation study of statistical approaches. American journal of medical genetics. Part B, Neuropsychiatric genetics^: the official publication of the International Society of Psychiatric Genetics 153B, 397–408 (2010). [0640] 5. Gusella, J. F. CAG Repeat Not Polyglutamine Length Determines Timing of Huntington’s Disease Onset. (2019) doi:10.1016/j.cell.2019.06.036. [0641] 6. Gatto, E. M., Rojas, N. G., Persi, G., Etcheverry, J. L., et al. Huntington disease: Advances in the understanding of its mechanisms. Clinical parkinsonism & related disorders 3, (2020). [0642] 7. Swami, M., Hendricks, A. E., Gillis, T., Massood, T., et al. Somatic expansion of the Huntington’s disease CAG repeat in the brain is associated with an earlier age of disease onset. Human Molecular Genetics 18, 3039–3047 (2009). [0643] 8. Kacher, R., Lejeune, F. X., Noël, S., Cazeneuve, C., et al. Propensity for somatic expansion increases over the course of life in Huntington disease. eLife 10, (2021). [0644] 9. Pinto, R. M., Arning, L., Giordano, J. v., Razghandi, P., et al. Patterns of CAG repeat instability in the central nervous system and periphery in Huntington’s disease and in spinocerebellar ataxia type 1. Human molecular genetics 29, 2551–2567 (2020). [0645] 10. Oura, S., Noda, T., Morimura, N., Hitoshi, S., et al. Precise CAG repeat contraction in a Huntington’s Disease mouse model is enabled by gene editing with SpCas9- NG. Communications Biology 4, (2021). [0646] 11. Cinesi, C., Aeschbach, L., Yang, B. & Dion, V. Contracting CAG/CTG repeats using the CRISPR-Cas9 nickase. Nature Communications 7, 1–10 (2016). [0647] 12. Anzalone, A. v, Randolph, P. B., Davis, J. R., Sousa, A. A., et al. Search-and- replace genome editing without double-strand breaks or donor DNA. Nature 576, 149 (2019). [0648] 13. Komor, A. C., Zhao, K. T., Packer, M. S., Gaudelli, N. M., et al. Improved base excision repair inhibition and bacteriophage Mu Gam protein yields C:G-to-T:A base editors with higher efficiency and product purity. Science Advances 3, eaao4774–eaao4774 (2017). [0649] 14. Wright, G. E. B., Collins, J. A., Kay, C., McDonald, C., et al. Length of uninterrupted CAG repeats, independent of polyglutamine size, results in increased somatic instability and hastened age of onset in Huntington disease. Am. J. Hum. Genet.104(6), 1116- 1126 (2019). [0650] 15. Costa, M. D. C., Magalhães, P., Guimarães, L., Maciel, P., et al. The CAG repeat at the Huntington disease gene in the Portuguese population: insights into its dynamics and to the origin of the mutation. Journal of Human Genetics 200551:351, 189–195 (2006). [0651] 16. Monteys, A. M., Ebanks, S. A., Keiser, M. S. & Davidson, B. L. CRISPR/Cas9 Editing of the Mutant Huntingtin Allele In Vitro and In Vivo. Molecular Therapy 25, 12–23 (2017). [0652] 17. Dabrowska, M., Juzwa, W., Krzyzosiak, W. J. & Olejniczak, M. Precise Excision of the CAG Tract from the Huntingtin Gene by Cas9 Nickases. Frontiers in Neuroscience 12, (2018). [0653] 18. Yang, S., Li, S. & Li, X.-J. CRISPR/Cas9-mediated gene editing ameliorates neurotoxicity in mouse model of Huntington’s disease. J Clin Invest 127, 2719 (2017). [0654] 19. Ekman, F. K., Ojala, D. S., Adil, M. M., Lopez, P. A., et al. CRISPR-Cas9- Mediated Genome Editing Increases Lifespan and Improves Motor Deficits in a Huntington’s Disease Mouse Model. Molecular Therapy - Nucleic Acids 17, 829–839 (2019). [0655] 20. Shin, J. W., Kim, K.-H., Chao, M. J., Atwal, R. S., et al. Permanent inactivation of Huntington’s disease mutation by personalized allele-specific CRISPR/Cas9. doi:10.1093/hmg/ddw286. [0656] 21. Saudou, F. & Humbert, S. The Biology of Huntingtin. Neuron vol.89910– 926 (2016). [0657] 22. A Study to Evaluate the Safety, Tolerability, Pharmacokinetics, and Pharmacodynamics of RO7234292 (ISIS 443139) in Huntington’s Disease Patients Who Participated in Prior Investigational Studies of RO7234292 (ISIS 443139). ClinicalTrials.gov. clinicaltrials.gov/ct2/show/NCT03342053. [0658] 23. Kingwell, K. Double setback for ASO trials in Huntington disease. Nature Reviews. Drug Discovery 20, 412–413 (2021). [0659] 24. Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A., et al. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420–424 (2016). [0660] 25. Gaudelli, N. M., Komor, A. C., Rees, H. A., Packer, M. S., et al. Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage. Nature 551, 464–471 (2017). [0661] 26. Nelson, J. W., Randolph, P. B., Shen, S. P., Everette, K. A., et al. Engineered pegRNAs improve prime editing efficiency. Nature biotechnology 40, 402–410 (2022). [0662] 27. Chen, P. J., Hussmann, J. A., Yan, J., Knipping, F., et al. Enhanced prime editing systems by manipulating cellular determinants of editing outcomes. Cell 184, 5635- 5652.e29 (2021). [0663] 28. Choi, J., Chen, W., Suiter, C. C., Lee, C., et al. Precise genomic deletions using paired prime editing. doi:10.1101/2020.12.30.424891. [0664] 29. Anzalone, A. V., Gao, X. D., Podracky, C. J., Nelson, A. T., et al. Programmable deletion, replacement, integration and inversion of large DNA sequences with twin prime editing. Nature biotechnology 40, (2022). [0665] 30. Chen, F., Ding, X., Feng, Y., Seebeck, T., et al. Targeted activation of diverse CRISPR-Cas systems for mammalian genome editing via proximal CRISPR targeting. Nature Communications 8, 14958 (2017). [0666] 31. Narain, Y., Wyttenbach, A., Rankin, J., Furlong, R. A., et al. Original articles A molecular investigation of true dominance in Huntington’s disease. doi:10.1136/jmg.36.10.739. [0667] 32. Kawakami, K. Tol2: A versatile gene transfer vector in vertebrates. Genome Biology vol.8 S7 (2007). [0668] 33. Arbab, M., Srinivasan, S., Hashimoto, T., Geijsen, N., et al. Cloning-free CRISPR. Stem Cell Reports 5, 908–917 (2015). [0669] 34. Massey, T. H. & Jones, L. The central role of DNA damage and repair in CAG repeat diseases. DMM Disease Models and Mechanisms 11, dmm031930–dmm031930 (2018). [0670] 35. Liu, Y. & Wilson, S. H. DNA base excision repair: A mechanism of trinucleotide repeat expansion. Trends in Biochemical Sciences (2012) doi:10.1016/j.tibs.2011.12.002. [0671] 36. Wright, G. E. B., Black, H. F., Collins, J. A., Gall-Duncan, T., et al. Interrupting sequence variants and age of onset in Huntington’s disease: clinical implications and emerging therapies. The Lancet Neurology 19, 930–939 (2020). [0672] 37. Findlay Black, H., Wright, G. E. B., Collins, J. A., Caron, N., et al. Frequency of the loss of CAA interruption in the HTT CAG tract and implications for Huntington disease in the reduced penetrance range. Genetics in Medicine 22, 2108–2113 (2020). [0673] 38. Rolfsmeier, M. L. & Lahue, R. S. Stabilizing Effects of Interruptions on Trinucleotide Repeat Expansions in Saccharomyces cerevisiae. Molecular and Cellular Biology (2000) doi:10.1128/mcb.20.1.173-180.2000. [0674] 39. McGurk Leeanne, Rifai Olivia M, Sherbakova Oksana, Perlegos Alexandra E., Byrns chiuna N., Carranza, Zhou Henry W., Kim Hyung-Jun, Zhu Yongqing, B. N. M. Toxicity of pathogenic ataxin-2 in Drosophila shows dependence on a pure CAG repeat sequence. Human Molecular Genetics 1–5 (2021). [0675] 40. Koblan, L. W., Doman, J. L., Wilson, C., Levy, J. M., et al. Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction. Nat Biotechnol 36, 843–846 (2018). [0676] 41. Gehrke, J. M., Cervantes, O., Clement, M. K., Wu, Y., et al. An APOBeC3A- Cas9 base editor with minimized bystander and off-target activities. Nature Biotechnology 36, (2018). [0677] 42. Huang, T. P., Zhao, K. T., Miller, S. M., Gaudelli, N. M., et al. Circularly permuted and PAM-modified Cas9 variants broaden the targeting scope of base editors. Nature Biotechnology 1 (2019) doi:10.1038/s41587-019-0134-y. [0678] 43. Komor, A. C., Zhao, K. T., Packer, M. S., Gaudelli, N. M., et al. Improved base excision repair inhibition and bacteriophage Mu Gam protein yields C:G-to-T:A base editors with higher efficiency and product purity. Science Advances 3, eaao4774 (2017). [0679] 44. Thuronyi, B. W., Koblan, L. W., Levy, J. M., Yeh, W.-H., et al. Continuous evolution of base editors with expanded target compatibility and improved activity. Nature Biotechnology 1 (2019) doi:10.1038/s41587-019-0193-0. [0680] 45. Arbab, M., Shen, M. W., Mok, B., Wilson, C., et al. Determinants of Base Editing Outcomes from Target Library Analysis and Machine Learning. Cell 182, 463- 480.e30 (2020). [0681] 46. Miller, S., Wang, T., Randolph, P., Arbab, M., et al. Continuous evolution of SpCas9 variants compatible with non-G PAMs. Nature Biotechnology 38, 471–481 (2020). [0682] 47. Nishimasu, H., Shi, X., Ishiguro, S., Gao, L., et al. Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science 361, 1259–1262 (2018). [0683] 48. Iyer, R. R., Pluciennik, A., Napierala, M. & Wells, R. D. DNA triplet repeat expansion and mismatch repair. Annual Review of Biochemistry vol.84199–226 (2015). [0684] 49. Budworth, H. & McMurray, C. T. A Brief History of Triplet Repeat Diseases. Methods Mol. Biol. (2013). doi:10.1007/978-1-62703-411-1_1. [0685] 50. Bidichandani, S. I. & Delatycki, M. B. Friedreich Ataxia. GeneReviews® (2017). [0686] 51. Cnop, M., Mulder, H. & Igoillo-Esteve, M. Diabetes in Friedreich ataxia. Journal of neurochemistry 126 Suppl, 94–102 (2013). [0687] 52. Pandolfo, M. Friedreich Ataxia: New Pathways. doi:10.1177/0883073812448534. [0688] 53. Reetz, K., Dogan, I., Costa, A. S., Dafotakis, M., et al. Biological and clinical characteristics of the European Friedreich’s Ataxia Consortium for Translational Studies (EFACTS) cohort: a cross-sectional analysis of baseline data. Articles Lancet Neurol 14, 174–182 (2015). [0689] 54. Cook, A. & Giunti, P. Friedreich’s ataxia: clinical features, pathogenesis and management. British Medical Bulletin 124, 19–30 (2017). [0690] 55. Silva, A. M., Brown, J. M., Buckle, V. J., Wade-Martins, R., et al. Expanded GAA repeats impair FXN gene expression and reposition the FXN locus to the nuclear lamina in single cells. Human Molecular Genetics 24, 3457 (2015). [0691] 56. Lazaropoulos, M., Dong, Y., Clark, E., Greeley, N. R., et al. Frataxin levels in peripheral tissue in Friedreich ataxia. Annals of Clinical and Translational Neurology 2, 831– 842 (2015). [0692] 57. Delatycki, M. B. & Bidichandani, S. I. Friedreich ataxia- pathogenesis and implications for therapies. Neurobiology of disease 132, (2019). [0693] 58. Long, A., Napierala, J. S., Polak, U., Hauser, L., et al. Somatic instability of the expanded GAA repeats in Friedreich’s ataxia. PloS one 12, (2017). [0694] 59. Neil, A. J., Hisey, J. A., Quasem, I., McGinty, R. J., et al. Replication- independent instability of Friedreich’s ataxia GAA repeats during chronological aging. Proceedings of the National Academy of Sciences of the United States of America 118, (2021). [0695] 60. De Biase, I., Rasmussen, A., Endres, D., Al-Mahdawi, S., et al. Progressive GAA expansions in dorsal root ganglia of Friedreich’s ataxia patients. Annals of neurology 61, 55–60 (2007). [0696] 61. Pinto, R. M., Arning, L., Giordano, J. V., Razghandi, P., et al. Patterns of CAG repeat instability in the central nervous system and periphery in Huntington’s disease and in spinocerebellar ataxia type 1. Human molecular genetics 29, 2551–2567 (2020). [0697] 62. Kacher, R., Lejeune, F. X., Noël, S., Cazeneuve, C., et al. Propensity for somatic expansion increases over the course of life in huntington disease. eLife 10, (2021). [0698] 63. Swami, M., Hendricks, A. E., Gillis, T., Massood, T., et al. Somatic expansion of the Huntington’s disease CAG repeat in the brain is associated with an earlier age of disease onset. Human molecular genetics 18, 3039–3047 (2009). [0699] 64. Rocca, C. J., Rainaldi, J. N., Sharma, J., Shi, Y., et al. CRISPR-Cas9 Gene Editing of Hematopoietic Stem Cells from Patients with Friedreich’s Ataxia. Molecular Therapy - Methods & Clinical Development 17, 1026–1036 (2020). [0700] 65. Mazzara, P. G., Muggeo, S., Luoni, M., Massimino, L., et al. Frataxin gene editing rescues Friedreich’s ataxia pathology in dorsal root ganglia organoid-derived sensory neurons. Nature Communications 11, 1–18 (2020). [0701] 66. Li, Y., Polak, U., Bhalla, A. D., Rozwadowska, N., et al. Excision of Expanded GAA Repeats Alleviates the Molecular Phenotype of Friedreich’s Ataxia. Molecular Therapy 23, 1055–1065 (2015). [0702] 67. McVey, M. & Lee, S. E. MMEJ repair of double-strand breaks (director’s cut): deleted sequences and alternative endings. Trends in genetics^: TIG 24, 529–538 (2008). [0703] 68. Jeggo, P. A.5 DNA Breakage and Repair. Advances in Genetics 38, 185–218 (1998). [0704] 69. Fishman-Lobell, J., Rudin, N. & Haber, J. E. Two alternative pathways of double-strand break repair that are kinetically separable and independently modulated. Molecular and cellular biology 12, 1292–1303 (1992). [0705] 70. Rudin, N. & Haber, J. E. Efficient repair of HO-induced chromosomal breaks in Saccharomyces cerevisiae by recombination between flanking homologous sequences. Molecular and cellular biology 8, 3918–3928 (1988). [0706] 71. Enache, O. M., Rendo, V., Abdusamad, M., Lam, D., et al. Cas9 activates the p53 pathway and selects for p53-inactivating mutations. Nature genetics 52, 662–668 (2020). [0707] 72. Neil, A. J., Hisey, J. A., Quasem, I., McGinty, R. J., et al. Replication- independent instability of Friedreich’s ataxia GAA repeats during chronological aging. Proceedings of the National Academy of Sciences of the United States of America 118, (2021). [0708] 73. Anzalone, A. V., Randolph, P. B., Davis, J. R., Sousa, A. A., et al. Search- and-replace genome editing without double-strand breaks or donor DNA. Nature (2019) doi:10.1038/s41586-019-1711-4. [0709] 74. Gaudelli, N. M., Komor, A. C., Rees, H. A., Packer, M. S., et al. Programmable base editing of T to G C in genomic DNA without DNA cleavage. Nature 551, 464–471 (2017). [0710] 75. Richter, M. F., Zhao, K. T., Eton, E., Lapinaite, A., et al. Phage-assisted evolution of an adenine base editor with improved Cas domain compatibility and activity. Nature Biotechnology (2020) doi:10.1038/s41587-020-0453-z. [0711] 76. Arbab, M., Shen, M. W., Mok, B., Wilson, C., et al. Determinants of Base Editing Outcomes from Target Library Analysis and Machine Learning. Cell 182, 463- 480.e30 (2020). [0712] 77. Newby, G. A., Yen, J. S., Woodard, K. J., Mayuranathan, T., et al. Base editing of haematopoietic stem cells rescues sickle cell disease in mice. Nature 2021 595:7866595, 295–302 (2021). [0713] 78. Koblan, L. W., Erdos, M. R., Wilson, C., Cabral, W. A., et al. In vivo base editing rescues Hutchinson–Gilford progeria syndrome in mice. Nature 1–7 (2021) doi:10.1038/s41586-020-03086-7. [0714] 79. Nethisinghe, S., Kesavan, M., Ging, H., Labrum, R., et al. Interruptions of the FXN GAA Repeat Tract Delay the Age at Onset of Friedreich’s Ataxia in a Location Dependent Manner. International Journal of Molecular Sciences 2021, Vol.22, Page 7507 22, 7507 (2021). [0715] 80. Al-Mahdawi, S., Ging, H., Bayot, A., Cavalcanti, F., et al. Large interruptions of GAA repeat expansion mutations in Friedreich ataxia are very rare. Frontiers in Cellular Neuroscience 12, (2018). [0716] 81. Khristich, A. N., Armenia, J. F., Matera, R. M., Kolchinski, A. A., et al. Large-scale contractions of Friedreich’s ataxia GAA repeats in yeast occur during DNA replication due to their triplex-forming ability. Proceedings of the National Academy of Sciences of the United States of America 117, 1628–1637 (2020). [0717] 82. Soragni, E., Herman, D., Dent, S. Y. R., Gottesfeld, J. M., et al. Long intronic GAATTC repeats induce epigenetic changes and reporter gene silencing in a molecular model of Friedreich ataxia. Nucleic Acids Research 36, 6056–6065 (2008). [0718] 83. Greene, E., Mahishi, L., Entezam, A., Kumari, D., et al. Repeat-induced epigenetic changes in intron 1 of the frataxin gene and its consequences in Friedreich ataxia. Nucleic Acids Research 35, 3383–3390 (2007). [0719] 84. Li, J., Rozwadowska, N., Clark, A., Fil, D., et al. Excision of the expanded GAA repeats corrects cardiomyopathy phenotypes of iPSC-derived Friedreich’s ataxia cardiomyocytes. Stem cell research 40, 101529 (2019). [0720] 85. Li, Y., Li, J., Wang, J., Zhang, S., et al. Premature transcription termination at the expanded GAA repeats and aberrant alternative polyadenylation contributes to the Frataxin transcriptional deficit in Friedreich’s ataxia. Human molecular genetics (2022) doi:10.1093/HMG/DDAC134. [0721] 86. Jiralerspong, S., Liu, Y., Montermini, L., Stifani, S., et al. Frataxin shows developmentally regulated tissue-specific expression in the mouse embryo. Neurobiology of disease 4, 103–113 (1997). [0722] 87. Puspasari, N., Rowley, S. M., Gordon, L., Lockhart, P. J., et al. Long range regulation of human FXN gene expression. PloS one 6, (2011). [0723] 88. Komor, A. C., Kim, Y. B., Packer, M. S., Zuris, J. A., et al. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420–424 (2016). [0724] 89. Nelson, J. W., Randolph, P. B., Shen, S. P., Everette, K. A., et al. Engineered pegRNAs improve prime editing efficiency. Nature biotechnology 40, 402–410 (2022). [0725] 90. Chen, P. J., Hussmann, J. A., Yan, J., Knipping, F., et al. Enhanced prime editing systems by manipulating cellular determinants of editing outcomes. Cell 184, 5635- 5652.e29 (2021). [0726] 91. Choi, J., Chen, W., Suiter, C. C., Lee, C., et al. Precise genomic deletions using paired prime editing. Nature Biotechnology 202140:240, 218–226 (2021). [0727] 92. Anzalone, A. V., Gao, X. D., Podracky, C. J., Nelson, A. T., et al. Programmable deletion, replacement, integration and inversion of large DNA sequences with twin prime editing. Nature biotechnology 40, (2022). [0728] 93. Tao, R., Wang, Y., Jiao, Y., Hu, Y., et al. Bi-PE: bi-directional priming improves CRISPR/Cas9 prime editing in mammalian cells. Nucleic Acids Research 50, 6423–6434 (2022). [0729] 94. Chen, F., Ding, X., Feng, Y., Seebeck, T., et al. Targeted activation of diverse CRISPR-Cas systems for mammalian genome editing via proximal CRISPR targeting. Nature communications 8, 14958 (2017). [0730] 95. Shishkin, A. A., Voineagu, I., Matera, R., Cherng, N., et al. Large-scale expansions of Friedreich’s ataxia GAA repeats in yeast. Molecular cell 35, 82–92 (2009). [0731] 96. Ohshima, K., Sakamoto, N., Labuda, M., Poirier, J., et al. A nonpathogenic GAAGGA repeat in the Friedreich gene: Implications for pathogenesis. Neurology 53, 1854 (1999). [0732] 97. Sakamoto, N., Larson, J. E., Iyer, R. R., Montermini, L., et al. GGATCC- interrupted Triplets in Long GAATTC Repeats Inhibit the Formation of Triplex and Sticky DNA Structures, Alleviate Transcription Inhibition, and Reduce Genetic Instabilities* Downloaded from. Journal of Biological Chemistry 276, 27178–27187 (2001). [0733] 98. Chandran, V., Gao, K., Swarup, V., Versano, R., et al. Inducible and reversible phenotypes in a novel mouse model of Friedreich’s ataxia. eLife 6, (2017). [0734] 99. Larimar Therapeutics Provides Update on CTI-1601 Clinical Program - Larimar Therapeutics, Inc. investors.larimartx.com/news-releases/news-release- details/larimar-therapeutics-provides-update-cti-1601-clinic al-program. [0735] 100. Belbellaa, B., Reutenauer, L., Messaddeq, N., Monassier, L., et al. High Levels of Frataxin Overexpression Lead to Mitochondrial and Cardiac Toxicity in Mouse Models. Molecular Therapy - Methods and Clinical Development 19, 120–138 (2020). [0736] 101. Huichalaf, C., Perfitt, T. L., Kuperman, A., Gooch, R., et al. In vivo overexpression of frataxin causes toxicity mediated by iron-sulfur cluster deficiency. Molecular Therapy - Methods and Clinical Development 24, 367–378 (2022). [0737] 102. Miller, S. M., Wang, T., Randolph, P. B., Arbab, M., et al. Continuous evolution of SpCas9 variants compatible with non-G PAMs. doi:10.1038/s41587-020-0412- 8. [0738] 103. Walton, R. T., Christie, K. A., Whittaker, M. N. & Kleinstiver, B. P. Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science 368, 290–296 (2020). [0739] 104. Nishimasu, H., Shi, X., Ishiguro, S., Gao, L., et al. Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science 361, 1259–1262 (2018). [0740] 105. Chatterjee, P., Lee, J., Nip, L., Koseki, S. R. T., et al. A Cas9 with PAM recognition for adenine dinucleotides. Nature Communications 11, 1–6 (2020). [0741] 106. Massey, T. H. & Jones, L. The central role of DNA damage and repair in CAG repeat diseases. Disease models & mechanisms 11, dmm031930 (2018). [0742] 107. Liu, Y. & Wilson, S. H. DNA base excision repair: a mechanism of trinucleotide repeat expansion. Trends in Biochemical Sciences 37, 162–172 (2012). [0743] 108. Oura, S., Noda, T., Morimura, N., Hitoshi, S., et al. Precise CAG repeat contraction in a Huntington’s Disease mouse model is enabled by gene editing with SpCas9- NG. Communications Biology 4, (2021). [0744] 109. Kawakami, K. Tol2: A versatile gene transfer vector in vertebrates. Genome Biology vol.8 S7 (2007). [0745] 110. Arbab, M., Srinivasan, S., Hashimoto, T., Geijsen, N., et al. Cloning-free CRISPR. Stem Cell Reports 5, 908–917 (2015). EQUIVALENTS AND SCOPE [0746] In the claims articles such as “a,” “an,” and “the” may mean one or more than one unless indicated to the contrary or otherwise evident from the context. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process. [0747] Furthermore, the invention encompasses all variations, combinations, and permutations in which one or more limitations, elements, clauses, and descriptive terms from one or more of the listed claims is introduced into another claim. For example, any claim that is dependent on another claim can be modified to include one or more limitations found in any other claim that is dependent on the same base claim. Where elements are presented as lists, e.g., in Markush group format, each subgroup of the elements is also disclosed, and any element(s) can be removed from the group. It should it be understood that, in general, where the invention, or aspects of the invention, is/are referred to as comprising particular elements and/or features, certain embodiments of the invention or aspects of the invention consist, or consist essentially of, such elements and/or features. For purposes of simplicity, those embodiments have not been specifically set forth in haec verba herein. It is also noted that the terms “comprising” and “containing” are intended to be open and permits the inclusion of additional elements or steps. Where ranges are given, endpoints are included. Furthermore, unless otherwise indicated or otherwise evident from the context and understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value or sub–range within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise. [0748] This application refers to various issued patents, published patent applications, journal articles, and other publications, all of which are incorporated herein by reference. If there is a conflict between any of the incorporated references and the instant specification, the specification shall control. In addition, any particular embodiment of the present invention that falls within the prior art may be explicitly excluded from any one or more of the claims. Because such embodiments are deemed to be known to one of ordinary skill in the art, they may be excluded even if the exclusion is not set forth explicitly herein. Any particular embodiment of the invention can be excluded from any claim, for any reason, whether or not related to the existence of prior art. [0749] Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation many equivalents to the specific embodiments described herein. The scope of the present embodiments described herein is not intended to be limited to the above Description, but rather is as set forth in the appended claims. Those of ordinary skill in the art will appreciate that various changes and modifications to this description may be made without departing from the spirit or scope of the present invention, as defined in the following claims.
Next Patent: HUMAN METAPNEUMOVIRUS ANTIBODIES AND USES THEREOF