Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ENGINEERED BACTERIAL TYROSYL-TRNA SYNTHETASE MUTANTS FOR INCORPORATING UNNATURAL AMINO ACIDS INTO PROTEINS
Document Type and Number:
WIPO Patent Application WO/2023/077058
Kind Code:
A1
Abstract:
Compositions of bacterial tyrosyl-tRNA synthetase mutants that enable site-specific incorporation of numerous unnatural amino acids into proteins expressed in eukaryotic cells or in engineered ATM E. coli and methods of use are described. Compositions of novel unnatural amino acids that can be incorporated using these mutant bacterial tyrosyl-tRNA synthetase are also described.

Inventors:
CHATTERJEE ABHISHEK (US)
GRASSO KATHERINE T (US)
ROY SOUMYA JYOTI SINGHA (US)
Application Number:
PCT/US2022/078860
Publication Date:
May 04, 2023
Filing Date:
October 28, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TRUSTEES BOSTON COLLEGE (US)
International Classes:
C12N15/52; C12N9/00
Domestic Patent References:
WO2010141851A12010-12-09
WO2020219708A12020-10-29
WO2020219708A12020-10-29
Foreign References:
JP5322195B22013-10-23
US10717975B22020-07-21
US0010717A1854-03-28
US0000975A1838-10-10
Other References:
ITALIA JAMES S ET AL: "Resurrecting the Bacterial Tyrosyl-tRNA Synthetase/tRNA Pair for Expanding the Genetic Code of BothE. coliand Eukaryotes", CELL CHEMICAL BIOLOGY, ELSEVIER, AMSTERDAM, NL, vol. 25, no. 10, 2 August 2018 (2018-08-02), pages 1304, XP085531593, ISSN: 2451-9456, DOI: 10.1016/J.CHEMBIOL.2018.07.002
"Gen Bank", Database accession no. ACB02843
"NCBI", Database accession no. NP_416154.1
CELL CHEM. BIOL., vol. 25, 2018
Attorney, Agent or Firm:
HOGLE, Doreen M. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1 . A composition comprising a variant E.coli tyrosyl-tRNA synthetase (EcTyr- RS) wherein the variant EcTyr-RS preferentially aminoacacylates an E. coli tyrosyl-tRNA (EctRNAtyr) with a tyrosine analog over the naturally-occurring tyrosine amino acid, wherein the variant EcTyr-RS comprises the amino acid sequence of SEQ ID NO: 1 or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the EcTyr-RS is mutated relative to SEQ ID NO: 1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265.

2. The composition of claim 1, wherein the variant E. coli tyrosyl-tRNA synthetase comprises SEQ ID NO: 1, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the tyrosine (Y) at position 37 is replaced with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved.

3. The composition of either of claims 1-2, wherein the variant E.coli tyrosyl- tRNA synthetase comprises an amino acid sequence selected from the group consisting of SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO: 4 or SEQ ID NO:5.

4. The composition of any of claims 1-3, wherein the tyrosine analogs are selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate.

5. A polynucleotide encoding the variant E.coli tyrosyl- tRNA synthetase of any of claims 1-3. 6. The polynucleotide of claim 5, wherein the nucleotide sequence comprises SEQ ID NO:6. 7. A cell comprising a variant E.coli tyrosyl-tRNA synthetase (EcTyr-RS), wherein the variant EcTyr-RS preferentially aminoacylates an E.coli tyrosyl- tRNA with a tyrosyl analog, and an orthogonal E.coli-tyrosyl tRNA (Ec- tRNATyr) as a pair, wherein the variant EcTyr-RS comprises the amino acid sequence of SEQ ID NO: 1 or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the EcTyr-RS is mutated relative to SEQ ID NO: 1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265. 8. The cell of claim 7, wherein the variant EcTyr-RS comprises the amino acid sequence SEQ ID NO: 1, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the tyrosine (Y) at position 37 is replace with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cy steine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved. 9. The cell of either of claims 7-8, wherein the variant E.coli tyrosyl-tRNA synthetase comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO: 4 or SEQ ID NO:5.

10. The cell of any of claims 7, 8 or 9, wherein the cell comprises a polynucleotide encoding the amino acid sequence of the variant E.coli tyrosyl-tRNA synthetase according to any of claims 7, 8 or 9. 11. The cell of claim 10, wherein the polynucleotide comprises the sequence of SEQ ID NO:6, or a nucleotide sequence with at least about 90% sequence identity of SEQ ID NO:6. 12. The cell of any of claims 7-11, wherein the cell is an E.coli cell or a eukaryotic cell. 13. The cell of claim 12, wherein the eukaryotic cell is a mammalian cell. 14. The cell of claim 12, wherein the E.coli is the ATMY strain of E.coli cell. 15. The cell of any of claims 7-14, wherein the tyrosine analog is selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. 16. A method of producing a protein or peptide of interest in a cell with one, or more, tyrosyl analogs at specified amino acid residue positions in the protein or peptide, the method comprising, a. culturing the cell of any of claims 7-14 in a culture medium under conditions suitable for growth, wherein the cell comprises a nucleic acid encoding a protein or peptide of interest with one, or more, selector codons incorporated at the one, or more specified positions in the protein or peptide, wherein the cell further comprises a nucleic acid encoding an Ec- t.RNATyr that. recognizes the selector codon, and b. contacting the cell culture medium with one, or more, tyrosyl analogs under conditions suitable for incorporation of the one, or more, tyrosyl analogs into the protein in response to the selector codon, thereby producing the protein or peptide of interest with one, or more tyrosyl analogs at specified positions in the protein or peptide.

17. The method of claim 16, wherein the tyrosyl analog is selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. 18. The method of claim 16, wherein the cell is an E. coli cell or a eukaryotic cell. 19. The method of claim 18, wherein the eukaryotic cell is a mammalian cell. 20. The method of claim 18, wherein the E. coli cell is the ATMY4 strain of E.coli cell. 21. The method of claim 16, wherein the cell further comprises a second tRNA/RS pair that is orthogonal to the cell, wherein the second pair does not cross-react with the EcTyr-RS/tRNA pair and that recognizes an amber selector codon in the protein, wherein the protein produced contains one, or more tyrosyl analogs and one, or more, distinct unnatural amino acid other than a tyrosyl analog. 22. A method of site-specifically incorporating one, or more, tyrosyl analogs into a protein or peptide of interest in a cell, the method comprising, a. culturing the cell in a culture medium under conditions suitable for growth, wherein the cell comprises a nucleic acid encoding a protein or peptide of interest with one, or more, amber selector codons incorporated at one, or more at specific sites in the protein or peptide, and wherein the cell further comprises a nucleic acid encoding a variant E.coli tyrosyl-tRNA synthetase (EcTyr-RS), wherein the EcTry-RS preferentially aminoacylat.es an E.coli tyrosyl fRNA (Ec-tRNATyr) that recognizes the amber selector codon, wherein the variant EcTyr-RS comprises the amino acid sequence of SEQ ID NO:1, or an amino acid sequence with at least 90% sequence identity with the full-length SEQ ID NO: 1, wherein the EcTyr-RS is mutated relative to SEQ ID NO:1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265; and b. contacting the cell culture medium with one, or more, tyrosyl analogs under conditions suitable for incorporation of the one, or more, tyrosyl analogs into the protein or peptide at the sites of the selector codon(s), thereby producing the protein or peptide of interest with one, or more site- specifically incorporated tyrosyl analogs. 23. The method of claim 22, wherein the variant E.coli tyrosyl-tRNA synthetase (EcTyr-RS) comprises SEQ ID NO: 1, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO:1, wherein the tyrosi ition 37 is replace with glycine (G) or cysteine (C), the leucin ion 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved. 24. The method of either of claim 22 or 23, wherein the variant E.coli tyrosyl- tRNA synthetase comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO: 4 or SEQ ID NO:5. 25. The method of any of claims 22-24, wherein the tyrosyl analog is selected from the group consisting of: pBPA, p AEY, pAzAc-F, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. 26. The method of claim 22, wherein the cell is an E.coli cell or a eukaryotic cell. 27. The method of claim 26, wherein the eukaryotic cell is a mammalian cell. 28. The method of claim 26, wherein the E. coli cell is the ATMY4 strain of E. coll cell.

29. The method of claim 22, wherein the cell further comprises a second tRNA/RS pair that is orthogonal to the cell, wherein the second pair does not cross-react with the EcTyr-RS/tRNA pair and that recognizes an amber selector codon in the protein, wherein the protein or peptide of interest produced contains one, or more tyrosyl analogs, and one, or more, distinct unnatural amino acid residues other than a tyrosyl residue. 30. A kit for producing a protein or peptide of interest in a cell, wherein the protein or peptide comprises one, or more tyrosyl analogs, the kit comprising: a. a container containing a polynucleotide sequence encoding an Ec-tRNATyr that recognizes an amber selector codon; and b. a container containing a polynucleotide encoding a variant E. coli tyrosyl tRNA synthetase that preferentially aminoacylates the Ec-tRNATyr with a tryrosyl analog, wherein the EcTry-RS comprises the amino acid sequence of SEQ ID NO: 1 , or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the EcTyr-RS is mutated relative to SEQ ID NO: 1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265. 31. The kit of claim 30, wherein the variant E. coli tyrosyl-tRNA synthetase (EcTyr-RS) comprises SEQ ID NO: 1, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the tyrosine (Y) at position 37 is replace with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R.) and the asparagine (N) at position 126 is conserved.

32. The kit of either of claims 30 or 31, wherein the variant E.coli tyrosyl-tRNA synthetase comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO: 4 or SEQ ID NO:5. 33. The kit of any of claims 30-32, wherein the kit further comprises one, or more, tyrosyl analogs, wherein the tyrosyl analog is selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. 34. The kit of any of claims 30-33, wherein the kit further comprises instructions for producing the protein or peptide of interest. 35. A tyrosine analog selected from the group consisting of: pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate.

Description:
ENGINEERED BACTERIAL. TYROSYL-TRNA SYNTHETASE MUTANTS FOR INCORPORATING UNNATURAL AMINO ACIDS INTO PROTEINS

GOVERNMENT SUPPORT [0001] This invention was made with government support under grant number

R35GM136437, awarded by the National Institutes of Health. The government has certain rights in this invention.

RELATED APPLICATIONS [0002] This application claims the benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 63/263,268, filed on October 29, 2021, which is incorporated herein by reference in its entirety.

REFERENCE TO AN ELECTRONIC SEQUENCE LISTING [0003] The contents of the electronic sequence listing (03420012W01 ST26.xml; Size: 10,485 bytes; and Date of Creation: October 24, 2022) is herein incorporated by reference in its entirety.

FIELD OF THE INVENTION [0004] The present disclosure is directed to the field of biotechnology describing compositions of an engineered tyrosyl-tRNA synthetase used for the site-specific incorporation of a variety of unnatural amino acids into recombinant proteins expressed in live cells.

BACKGROUND OF THE INVENTION [0005] The site-specific incorporation of unnatural amino acids (UAAs) through genetic code expansion (GCE) enables the study and manipulation of protein structure/function by introducing novel chemical modalities (i.e., photoaffinity probes, bioconjugation handles, post-translational modifications). To execute GCE, a non-cross reactive (orthogonal) aminoacyl-tRNA synthetase (aaRS), an orthogonal tRNA, and a blank codon are all needed. Traditionally, an orthogonal aaRS is imported into a host organism from an evolutionarily distant domain of life. Then, the substrate specificity of the orthogonal aaRS is altered using directed evolution. First, rationally selected key amino acid residues in the active site of the orthogonal aaRS are randomized to create a library of mutants. Second, from this pool of mutants, members active for the selective incorporation of the desired UAA are enriched through a double sieve selection scheme followed by their characterization. [0006] Evolution of each new mutant UAA-tRNA synthetase (UAA-RS) is not trivial. This is due to the technical complexities of building an orthogonal aaRS library that is large enough to potentially contain a mutant solution. Some evolved UAA-RS variants are poly specific (able to incorporate more than one UAA that are structurally similar). Polyspecific UAA-RS mutants are advantageous because they circumvent the limitations of having to perform time-consuming and laborious directed evolution experiments targeted to each individual UAAs. Consequently, the identification of new poly specific UAA-RSs significantly broadens the toolbox of incorporable UAAs, enabling innovative applications and manipulations of protein structure.

SUMMARY OF THE INVENTION [0007] A bacteria-derived tyrosyl-tRNA synthetase (TyrRS)/tRNA pair has been previously engineered to site-specifically incorporate a small set of UAAs into proteins expressed in eukaryotic cells such as yeast and mammalian cells. (See for example W02020/219708, the teachings of which are incorporated herein in their entirety.) Recently, altered translational machinery (ATM) E.coli strains have been developed, where the endogenous TyrRS/tRNA pair is substituted with an archaeal counterpart. (See for example, U.S Pat. 10,717,975, the teachings of which are incorporated herein in their entirety.) Since this ATM-Tyr (ATMY) E.coli strain lacks the endogenous bacterial TyrRS/tRNA pair, it can be reintroduced therein for incorporating an unnatural amino acid residue. Consequently, developing engineered mutants of the bacterial TyrRS that charges new UAAs would facilitate their incorporation into proteins expressed both in eukaryotic cells as well as any ATMY E.coli. ][0008] The present invention [0 e0n0c8o]m] passes novel engineered mutants/variants of the bacterial aminoacyl-tyrosyl-tRNA synthetase (TyrRS) that selectively charge a series of unnatural amino acids (UAAs or Uaas, also referred to herein a non-canonical amino acids or ncAAs). With the exception of the analog pBPA, the UAAs described herein are novel. Compositions are described herein comprising a variant E.coli tyrosyl-tRNA synthetase (EcTyr-RS) wherein the EcTyr-RS preferentially aminoacylates an E.coli tiyrosyl tRNA (Ec-tRNA Tyr ) with a tyrosine analog over the naturally-occurring tyrosine amino acid. These Uaas can be site-specifically incorporated into proteins expressed in bacterial or eukaryotic ceils using these engineered TyrRS mutants and their cognate tRNA. More specifically, these UAAs are described herein. [0009] In particular, the current invention encompasses a composition comprising a mutant/variant of E.coli tyrosyl-tRNA synthetase (EcTyrRS) wherein the wild-type EcTyr-RS comprises the amino acid sequences as published in the NCBI database for the K-12 E.coli sitrain (ncbi.nlm.nih.gov/protein/BAE77907), or E.coli sitrain (ncbi.nlm.nih.gov/protein/.BAA 15398.2 K-12 E.coii strain, substrain DH10B (ncbi.nlm.nih.gov/protein and Gen Bank accession number ACB02843) or E.coii strain MG1655 NCBI Reference Sequence NP_416154.1. The variant EcTyr-RS preferentially aminoacylates an E.coli tiyrosyltRNA (EctRNA tyr ) with a tyrosine analog over the naturally-occurring tyrosine amino acid, wherein the variant EcTyr-RS comprises the amino acid sequence of SEQ ID NO: 1, or a homologous EcTyr-RS comprising an amino acid sequence with at least about 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the full-length SEQ ID NO:1 and comprising mutated amino acid positions. The variant EcTyrRS is mutated at critical amino acid residues at active-sites as shown in FIG. 5A. Substitutions of suitable replacement amino acid residues at these sites are also show in FIG. 5B (SEQ ID NOS:2-5, polyTyrRS-9; polyYyrRS-6, polyTyrRS-11 and polyTyrRS-16 respectively).

[0010] Specifically, the variant E.coli tiyrosyl-tRNA synthetase comprises SEQ ID NO:1, or an amino acid sequence with at least about 90% sequence identity with the full- length SEQ ID NO:1, with one, or more mutations at the following positions: Y37; L71; N126; D182 ; F183; L186 or D2651. More specifically, encompassed by the present invention is a variant Ec-tyrosyl-RS wherein the tyrosine (Y) at position 37 is replaced with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved. [0011] Examples of variant amino acid sequences of the EcTyr-RS molecules of the present invention encompassing such amino acid mutations/substitutions/conservations (conserved amino acid residues that are the same as in the wild type EcTyr-RS) are shown in FIG. 5B comprising SEQ ID NO:2 through SEQ ID NO:5. Specifically, as shown in FIG. 5B, poly-TyrRS-9 comprises SEQ ID NO: 1 with the following mutations/substitutions/conservations: Y37G; L71V; N126N; D182C; F 183Y; L186C and D265R designated SEQ ID NO:2. [0012 ] Poly-TyrRS-6 of FIG. 5B comprises SEQ ID NO: 1 with the following mutations/conserved amino acid residues: Y37C; L71C; N126N; D182S; D183M; L186A and D265R and designated SEQ ID NO:3. [0013 ] Poly-TyrRS-11 of FIG. 5B comprises SEQ ID NO: 1 with the following mutations/substitutions/conservations:Y37G; L71V; N126N; D182S, F 183M; L186C; and D265R designated as SEQ ID NO: 4. [0014 ] Poly-TyrRS-16 of FIG. 5B comprises SEQ ID NO: 1 with the following mutations/substitutions/conservations: Y37C; L71I, N126N; D182S; F183M, L186G and D265R designated as SEQ ID NO:5. [0015 ] Importantly, the Ec-tyrosyl-RS variants of the present invention are polyspecific-that is, the variant Tyr-RS enables site-specific incorporation of numerous unnatural amino acids (e.g., as shown in FIG. 1) in proteins of interest expressed in eukaryotic cells or in the genetically-engineered ATM E.coli as described herein. The mutant poly specific EcTyr-tRNA-synthetases as described herein can incorporate several UAAs (non-sulfonated tyrosine analogs) into proteins expressed in bacteria and eukaryotes with high fidelity and efficiency (hence the term “polyspecific”). Examples of such tyrosine analogs are selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate (see FIG.1). More specifically, the novel UAAs are selected from the group consisting of: pAEY, pAz.AcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. [0016 ] The Tyr-tRNA synthetases encompassed by the present invention further include homologous archaeal or bacteria-derived Tyr-RNA synthetases with active-site residues substituted with mutations as described herein. The homologous Tyr-RS comprises an amino acid sequence with at least about 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% sequence identity with the full-length SEQ ID NO:1. The homologous Tyr-RS can be mutated at its active-site residues corresponding to the mutations as described herein for the E.coli Tyr-RS, for example at positions: Y37; L71; N126; D182; F183; L186 or D2651. [0017 ] Such homologous TyrRS genes can be identified by techniques known to those of skill in the art, for example by performing sequence identity/homology searches of TyrRS genetic sequence databases to identify TyrRS gene sequences with, for example, about 80% sequence identity; about 85% sequence identity; about 90% sequence identity; about 95% sequence identity or greater than about 95% sequence identity, which are substantially homologous, or highly homologous to the E.coli TyrRS described herein. Such homologous TyrRS genes suitable for use as described herein may contain sequence variation from the E. coli Tyr-RS wherein such sequence variations do not affect the functionality (aminoacyl activity) of the RNA synthetase. Such nucleotide variations can be defined as conservative sequence variations or substitutions. [0018 ] It is noted that the E.coli tyrosyl-tRNA synthetases and O-sulfotyrosine analogs as described in WO 2020/219708 are not encompassed by the present invention and are specifically excluded. [0019 ] Also encompassed by the present invention are polynucleotide sequences that encode the variant polyspecific tyrosyl-tRNA synthetases as described herein, and polynucleotide sequences that hybridize to those polynucleotide sequences under highly stringent conditions over substantially the entire length of the nucleotide sequence, as well as the polypeptides encoded by the polynucleotides. For example, the nucleotide sequence SEQ ID NO:6 encodes the variant “polyTyrRS-9” as described herein. [0020 ] The present invention further encompasses tRNA compositions wherein the tRNA anti-codon loop is modified (e.g., mutated) to specifically bind to (e.g., recognize) an amber (UAG/TAG) codon as described herein. tRNA compositions comprising the ochre codon (TAA/UAA) or the opal codon (UGA/TGA) are also encompassed by the present invention. In particular, the present invention encompasses compositions wherein the tRNA is the E.coli tyrosyl tRNA, or another homologous bacteria-derived tRNA, wherein the polynucleotide sequence comprises, for example, SEQ ID NO:7, or a tyrosyl- tRNA sequence with about 80%; about 85%; about 90%, about 95% or greater than about 95% sequence identity with SEQ ID NO:7, and with an anti-codon loop comprising a sequence that specifically binds to a selector sequence of an mRNA comprising an amber codon. Importantly, the tRNA EcTyr UAG described herein is an amber suppressor suitable for use in both genetically -engineered bacteria and eukaryotes. For E.coli tyrosyl tRNA sequences suitable for use in this invention, also see, for example, sequences described in Cell Chem. Biol. 25, 13-4-1312 (2018).

[ 0021 ] It i s important to note that, the modified tRNA of E. coli, or a homologous bacteria-derived tRNA, can be combined with an RNA synthetase of another homologous bacteria-derived RNA synthetase to produce novel combinations for unnatural amino acid, e.g., tyrosine analog, incorporation into proteins. Additionally, a combination of two distinct tRNA-RS/tRNA pairs can be combined. For example, the EcTyr-RS/tRNA pair described herein can also be combined with other suitable tRNA-RS/tRNA pairs which respond to other codons such as an opal suppressor, to site-specifically incorporate two distinct unnatural amino acids into polypeptide/proteins expressed in eukaryotic cells. [0022 ] The present invention also encompasses the following novel UAAs as shown in FIG. 1 : pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. The variant EcTyr-RS enzymes of the present invention preferentially aminoacylate a corresponding E. coli tyrosyl-tRNA with these novel tyrosine analogs over the naturally-occurring tyrosine amino acid reside. [0023 ] Also encompassed by the present invention are bacterial or eukaryotic cells comprising the aminoacyl-Tyr-RS variants described herein, and in particular, comprising nucleic acid sequences encoding these Tyr-RS variants. In one embodiment herein is a cell (as used herein the singular term ‘"cell” also encompasses the plural “cells”) comprising a variant E. coli tyrosyl-tRN A synthetase (EcTyr-RS), wherein the variant EcTyr-RS preferentially aminoacylates an E. coli tyrosyl-tRNA with a tyrosyl analog, and an orthogonal E. coli-tyrosyl tRNA (Ec-tRNA Tyr ) as a pair, wherein the variant EcTyr-RS comprises the amino acid sequence of SEQ ID NO: 1 or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO: 1, wherein the EcTyr- RS is mutated relative to SEQ ID NO:1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265 and N126 is conserved. [0024 ] More specifically, the cell comprises a variant EcTyr-RS comprising the amino acid sequence SEQ ID NO:1, or an amino acid sequence with at least about 90% sequence identity' with the full-length SEQ ID NO:1, wherein the tyrosine (Y) at position 37 is replace with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved. For example, the cell as described herein comprises a variant E. coli tyrosyl-tRNA synthetase comprising an amino acid sequence selected from the group consisting of: SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO: 4 or SEQ ID NO:5. [0025 ] The cells of the present invention can be bacterial or eukaryotic cells. In one embodiment the eukaryotic cell is a mammalian cell. In another embodiment the bacterial cell is E. coli. E. coli cells encompassed herein include any ATMY (altered translational machinery-tyrosine) E.coli and in particular, the ATMY4 strain of E. coli.

[ 0026] Also encompassed is a cell, wherein the cell comprises a polynucleotide encoding the amino acid sequence of the variant E.coli tyrosyl-tRNA synthetase as descried herein. For example, the polynucleotide comprises the sequence of SEQ ID NO:6, or a nucleotide sequence with at least about 90% sequence identity of SEQ ID NO:6. [0027 ] Cells encompassed by the present invention also include the additional components/nucleic acid or amino acid sequences required for the cells to produce/ express the proteins comprising the UAAs described herein. Such components are known to those of skill in the art and can be as described in U.S Pat. 10,717,975, the teachings of which are incorporated herein in their entirety.

[ 0028 ] Further encompassed by the present invention are methods of producing proteins incorporating unnatural amino acids in a site-specific manner using a variant Tyr- RS as described herein. The steps of a method of producing a protein or peptide of interest in a cell with one, or more, tyrosyl analogs at specified amino acid residue positions in the protein or peptide, comprises the following: a) culturing the cell comprising a nucleic acid encoding a variant E.coli tyrosyl-RS as described herein in a culture medium under conditions suitable for growth, wherein the cell comprises a nucleic acid encoding a protein or peptide of interest with one, or more, amber selector codons incorporated at the one, or more specified positions in the protein or peptide, wherein the cell further comprises a nucleic acid encoding an Ec-tRNA Tyr that recognizes the selector codon, and b) contacting the cell culture medium with one, or more, tyrosyl analogs under conditions suitable for incorporation of the one, or more, tyrosyl analogs into the protein in response to the selector codon, thereby producing the protein or peptide of interest with one, or more tyrosyl analogs at specified positions in the protein or peptide.

[ 0029 ] Specifically encompassed herein is a method of site-specifically incorporating one, or more, tyrosyl analogs into a protein or peptide of interest in a cell, the method comprising the steps of: a) culturing the cell in a culture medium under conditions suitable for growth, wherein the cell comprises a nucleic acid encoding a protein or peptide of interest with one, or more, amber selector codons incorporated at one, or more at specific sites in the protein or peptide, and wherein the cell further comprises a nucleic acid encoding a variant E.coli tyrosyl-tRNA synthetase (EcTyr-RS), wherein the EcTry-RS preferentially aminoacylates an E.coli tyrosyl tRNA (Ec-tRNA Tyr ) that recognizes the amber selector codon, wherein the variant EcTyr-RS comprises the amino acid sequence of SEQ ID NO: 1, or an amino acid sequence with at least 90% sequence identity with the full- length SEQ ID NO: 1, wherein the EcTyr-RS is mutated relative to SEQ ID NO: 1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265; and b) contacting the cell culture medium with one, or more, tyrosyl analogs under conditions suitable for incorporation of the one, or more, tyrosyl analogs into the protein or peptide at the sites of the selector codon(s), thereby producing the protein or peptide of interest with one, or more site-specifically incorporated tyrosyl analogs. [0030 ] In particular, the variant E. coli tyrosyl-tRNA synthetase (EcTyr-RS) comprises SEQ ID NO.T, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO:1, wherein the tyrosine (Y) at position 37 is replace with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved. [0031 ] For example, the variant E.coli tyrosyl-tRNA synthetase can comprise an amino acid sequence selected from the group consisting of: SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO: 4 or SEQ ID NO:5 and the tyrosyl analog is selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate.

[ 0032 ] As described herein, the cell can be an E. coli cell or a eukaryotic cell. In one embodiment the eukaryotic cell is a mammalian cell. In another embodiment the E. coll cell is the ATMY strain of E.coli cell (for example as described in U.S Pat. 10,717,975, the teachings of which are incorporated herein in their entirety). In particular, the strain of E. coli cell is ATMY4, as described herein. [0033 ] Also encompassed by the present invention are methods of incorporating two, or more unnatural amino acids at specified positions in a polypeptide/protein expressed in a cell. In these methods the cell further comprises a second tRNA/RS pair that is orthogonal to the cell, wherein the second pair recognizes a selector codon in the protein but does not cross-react with the first RS/tRNA pair (e.g., EcTyr-RS/tRNA tyr ). The method is performed as above (or in a similar manner) wherein the protein expressed/produced contains one, or more tyrosine analogs and one, or more, distinct unnatural amino acid other than a tyrosine analog incorporated by the first RS/tRNA pair. [0034 ] Further encompassed by the present invention are kits comprising variant Ec- tyrosyl-RS molecules as described herein. Specifically encompassed herein is a kit for producing a protein or peptide of interest in a cell, wherein the protein or peptide comprises one, or more tyrosy l analogs. The components of the kit comprise a container containing a polynucleotide sequence encoding an Ec-tRNA Tyr that recognizes an amber selector codon; and a container containing a polynucleotide encoding a variant E. coli tyrosyl tRNA synthetase that preferentially aminoacylates the Ec-tRNA Tyr with a tryrosyl analog, wherein the EcTry-RS comprises the amino acid sequence of SEQ ID NO:1, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO.T, wherein the EcTyr-RS is mutated relative to SEQ ID NO:1 at amino acid residues tyrosine (Y) 37, leucine (L) 71, aspartic acid (D) 182, phenylalanine (F) 183, leucine (L) 186 and aspartic acid (D) 265. Also included in the kit are buffers, diluents and other components necessary for producing a protein or peptide of interest with site-specific incorporation of one, or more UAAs as described herein. [ 0035 ] In one embodiment, the kit. comprises a nucleic acid encoding a variants, coli tyrosyl -tRNA synthetase (EcTyr-RS) comprising SEQ ID NO:1, or an amino acid sequence with at least about 90% sequence identity with the full-length SEQ ID NO:1, wherein the tyrosine (Y) at position 37 is replace with glycine (G) or cysteine (C), the leucine (L) at position 71 is replaced with valine (V), cysteine (C) or isoleucine (I), the aspartic acid (D) at position 182 is replaced with cysteine (C) or serine (S), the phenylalanine (F) at position 183 is replaced with tyrosine (Y) or methionine ((M), the leucine (L) at position 186 is replaced with cysteine (C), arginine (R) or glycine (G), the aspartic acid (D) at position 265 is replaced with arginine (R) and the asparagine (N) at position 126 is conserved. An example of such a nucleic acid is SEQ ID NO: 6 encoding the poly specific EcTyr-RS-9 (SEQ ID NO: 2). In another embodiment, the kit comprises a variant E.coli tyrosyl-tRNA synthetase comprising an amino acid sequence selected from the group consisting of: SEQ ID NO:2; SEQ ID NO:3, SEQ ID NO: 4 or SEQ ID NO:5. [0036 ] The kit further comprises one, or more, tyrosyl analogs, wherein the tyrosyl analog is selected from the group consisting of: pBPA, pAEY, pAzAcF, pAzPrAmF, alkene amide, pAcrF, pAlkAcF, pCAcF, pFAcF and propargyl carbamate. The kit can further comprise instructions for producing the protein or peptide of interest. [0037 ] Some of the Uaas incorporated into the proteins using the poly specific tyrosyl- RS variants described herein can further comprise bioconjugation handles that allow site- specific labeling of recombinant proteins. These can be used to generate therapeutically relevant protein conjugates (e.g., antibody-drug conjugates) or research reagents (e.g., antibody-fluorophore conjugates). Others are electrophilic amino acids that can create crosslink with natural nucleophilic amino acids, which is useful for many applications, including the ability to make genetically encoded cyclic peptides in cells. Such genetically encoded cyclic peptides can be evolved to develop novel therapeutics. [ 0038 ] Embodiments of the di sclosure demonstrate features and advantages that will become apparent to one of ordinary skill in the art upon reading the attached Detailed Description.

BRIEF DESCRIPTION OF THE DRAWINGS [0039 ] In the accompanying drawings, reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale; emphasis has instead been placed upon illustrating the principles of the invention. Of the drawings: [0044 ] FIG. 1 . The structures of the UAAs that can be incorporated with the poly specific TyrRS. [0041 ] FIG. 2. The poly specific TyrRS/tRNA pair suppression activity in engineeredATMY E. coli ceils for the multiple UAAs that it incorporates, as can be employed in some embodiments. The activity can be measured in embodiments by co-expressing the polyspecific TyrRS/tRNA pair along with a sf'GFP reporter harboring a TAG codon at a permissive site. The suppression activity can be evaluated as the full-length sfGFP expression measured using its characteristic fluorescence. [0042 ] FIG. 3. Yields of sf'GFP and EGFP reporters as can be employed in some embodiments. The sfGFP expressions can be carried out as described in the FIG. 2. description. The EGFP expressions can be carried out by co-expressing the polyspecific tyrosyl-tRNA synthetase/tRNA pair in HEK293T cells along with a EGFP reporter harboring a TAG codon. The expressed GFP proteins can then be purified by affinity column chromatography and their yields can be calculated by Bradford assay. [0043 ] FIG. 4A-N. MS-analysis of the GFP reporter proteins confirming incorporation of all of the UAAs. (A) sfGFP-pBPA (B) EGFP-pBPA (C) sfGFP- LGA (D) EGFP-LCA (E) sfGFP-alpha (F) EGFP-alpha (G) sfGFP-beta (H) EGFP-beta (I) sfGFP-Michael (J) EGFP-Michael (K) sfGFP-alkyne (L) EGFP-alkyne (M) sf'GFP-chlor (N) sfGFP-fluoro. [0044 ] FIG. 5A-B. FIG. 5 A shows the active site of the polyspecific TyrRS in the crystal structure (PDB: 1X8X) and the key residues that may be randomized in the active site of E. coli TyrRS to potentially generate polyspecific variants, as may be employed in some embodiments. FIG. 5B shows a table of the key mutations that resulted in poly specific activities and the sequences of variant poly-tyrosyl-RS proteins (SEQ ID NOS: 2-5). [0045 ] FIG. 6. Shows the nucleotide sequence of the poly specific EcTyrRS, poly- TyrRS-9 with mutation sites highlighted in red. [0046] FIG. 7 Shows the amino acid sequence of the wild-type EcTyr-RS (SEQ ID NO.T). [00471 FIG. 8 Shows the nucleotide sequence of an E. coli Tyr-tRNA (SEQ ID NO:7) suitable for use in the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS [0048 ] The invention now will be described more fully hereinafter with reference to the accompanying drawings, in which illustrative embodiments of the invention are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

[ 0049 ] As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items. Also, all conjunctions used are to be understood in the most inclusive sense possible. Thus, the word "or" should be understood as having the definition of a logical "or" rather than that of a logical "exclusive or" unless the context clearly necessitates otherwise. Further, the singular forms and the articles "a", "an" and "the" are intended to include the plural forms as well, unless expressly stated otherwise. It will be further understood that the terms: includes, comprises, including and/or comprising, when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof Further, it will be understood that when an element, including component or subsystem, is referred to and/or shown as being connected or coupled to another element, it can be directly connected or coupled to the other element or intervening elements may be present.

[ 0050 ] It will be understood that although terms such as “first” and “second” are used herein to describe various elements, these elements should not be limited by these terms. These terms are only used to distinguish one element from another element. Thus, an element discussed below could be termed a second element, and similarly, a second element may be termed a first element without departing from the teachings of the present invention. [0051 ] Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein. [0052 ] The following specific embodiments and examples are to be construed as merely illustrative and not limitative of the remainder of the disclosure in any way whatsoever. Embodiments of the disclosure demonstrate features and advantages that will become apparent to one of ordinary skill in the art upon reading the attached Detailed Description.

[ 0053] The present invention encompasses genetically-engineered mutants/variants of a bacteria-derived tyrosyl-tRNA synthetase (TyrRS) for the efficient incorporation of a variety of UAAs (also referred to herein as ncAAs, non-canonical amino acids) into a protein, polypeptide or peptide of interest, that can be expressed in a bacterial cell or a eukaryotic cell. As described herein, these mutant/variant poly specific synthetases can incorporate several novel UAAs into proteins or peptides expressed in eukaryotes in a sitespecific manner with high fidelity and efficiency. Thus, the present invention, as described herein, enables the expression of eukaryotic proteins or peptides in eukaryotic cells with precisely installed (e.g., incorporated) UAAs at specified positions within the expressed protein or peptide. The ability to incorporate an UAA into virtually any site of any protein or peptide in bacterial and eukaryotic cells offers intriguing opportunities for novel synthetic biology applications.

[ 0054 ] To produce the variant Ec-tyrosyl-tRNA synthetases described herein, a library was made to genetically-engineer polyspecific tyrosyl-tRNA synthetase variants wherein key active site amino acid residues are randomized (Tyr37, Leu71, Asnl26, Asp 182, Phe183, Leu186) and can also introduce the point mutation D265 (where in the specific numbering corresponds to the wild-type E. coli tyrosyl-tRNA synthetase (SEQ ID NO:1). (See for example, W02020/219708 and U.S. Pat.No.10,717, 975, the teachings of which are incorporated herein in their entirety.) The library can be created through site-saturation mutagenesis and can be subjected to a selection scheme in engineered ATM E. coli, yeast, or another suitable host cell, to identify those capable of charging target UAAs with acceptable levels of fidelity and efficiency. Following the selection, individual library members can be characterized for their ability to incorporate the target UAA in response to a nonsense or frameshift codon at various reporter proteins, including, but not limited to chloramphenicol acetyl transferase, GFP, luciferase, etc. Evaluation of library members as described above can identify suitable polyspecific bacteria-derived TyrRS variants that can selectively incorporate multiple different UAAs. [0055 ] To evaluate the poly specificity of the engineered tyrosyl-tRNA synthetase, expression of a suitable reporter protein, such as GFP, harboring a nonsen se/frameshift codon can be employed in any ATMY E. coli, HEK293T, or other suitable host cells. The suppression activity of the poly specific TyrRS can be evaluated based on the corresponding reporter protein selectively in the presence of an UAA. The expressed reporter proteins can then be isolated and characterized by mass spectrometry to confirm the incorporation of desired UAAs. [0056] This engineered TyrRS mutant can be used in any eukaryotic cell along with the appropriate cognate tRNA (suppressing a nonsense or frameshift codon, or a codon composed of one or more non-natural nucleobases). Such expression hosts include, but are not limited to, yeast, insect cells, and mammalian cells. Additionally, this TyrRS/tRNA pair can be used for UAA incorporation in engineered ATM E.coli strains, where this bacterial pair has been functionally replaced with a eukaryotic or archaeal counterpart. [0057 ] The substrate UAAs of the polyspecific TyrRS variants include those with a bioconjugation handle such as alkynes, azides, ketones, cyclopropenes, etc. Site-specific incorporation of these UAAs into proteins will enable their subsequent site-specific labeling through bioorthogonal conjugation reactions. This strategy can be used to generate homogeneous, site-specific conjugates of recombinant proteins, including, but not limited to, antibody-drug conjugates, protein conjugates with various biophysical/biochemical probes such as fluorophores, protein-protein conjugates, protein-nucleic acid conjugates, protein-small molecule conjugates, protein-peptide conjugates, etc. [0058 ] The substrate UAAs of the poly specific TyrRS variants include those with electrophilic groups such as a, p-unstaurated carbonyls, halogenated amino acids, etc., which can react with natural amino acid residues in a proximity-dependent manner. Incorporation of such amino acids into proteins can enable covalent capture of a nearby interaction partner. This strategy can also enable the development of proteins (including but not limited to antibodies, protein and peptide ligands for various receptors) that, covalently associate with a target for therapeutic or diagnostic applications. [0059 ] It should be noted that the embodiments described herein should not be limited to these specific bacterial-derived synthetase scaffolds. This disclosure and its embodiments provide a general engineered synthetase active site (Figures 5A-B) that can be used for the incorporation of numerous UAAs in both bacterial and mammalian cells (Figures 1-4); the cell type, synthetase scaffold, and incorporable UAAs can vary as the technology in this field and in this work advance. Using the techniques described herein, other bacterial tyrosyl-tRNA synthetases can be engineered for UAA incorporation in either bacterial or eukaryotic cells and can include mutations at the following active acid residues that corresponds to the A. coli tyrosyl-tRNA synthetase: Y37; L71; D182; F183; L186; and D265 (Figures 5-6).

[ 0060 ] While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.