Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PRODUCTION OF SELENOPROTEINS (SELPROT)
Document Type and Number:
WIPO Patent Application WO/2015/005756
Kind Code:
A1
Abstract:
A general method is proposed that permits incorporation of a photocaged selenocysteine residue in any desired position(s) of a protein utilizing UAG codon suppression by a genetically modified yeast strain. The protein carrying the selenocysteine residue(s) caged with a photoreactive nitrobenzyl group is expressed and purified from the yeast cells. The caging group is removed by illumination with UV light leaving the selenocysteine in place (inside cells or in purified protein preparation). The proposed method has a potential to simplify the synthesis of selenoproteins and therefore expand their biotechnological and biopharmaceutical utilization.˙

Inventors:
KLIMAŠAUSKAS SAULIUS (LT)
RAKAUSKAITĖ RASA (LT)
MASEVIČIUS VIKTORAS (LT)
Application Number:
PCT/LT2014/000009
Publication Date:
January 15, 2015
Filing Date:
July 08, 2014
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV VILNIUS (LT)
International Classes:
C07K14/00; A61K41/00; C12N9/99
Other References:
WANG LEI ET AL: "Expanding the genetic code", ANGEWANDTE CHEMIE INTERNATIONAL EDITION, WILEY - V C H VERLAG GMBH & CO. KGAA, DE, vol. 44, no. 1, 17 December 2004 (2004-12-17), pages 34 - 66, XP002548374, ISSN: 1433-7851, [retrieved on 20041215], DOI: 10.1002/ANIE.200460627
YOUNG TRAVIS S ET AL: "Expanding the Genetic Repertoire of the Methylotrophic Yeast Pichia pastoris", BIOCHEMISTRY, AMERICAN CHEMICAL SOCIETY, US, vol. 48, no. 12, 31 March 2009 (2009-03-31), pages 2643 - 2653, XP002631536, ISSN: 0006-2960, [retrieved on 20090305], DOI: 10.1021/BI802178K
WU NING ET AL: "A genetically encoded photocaged amino acid", JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, AMERICAN CHEMICAL SOCIETY, vol. 126, no. 44, 10 November 2004 (2004-11-10), pages 14306 - 14307, XP002459777, ISSN: 0002-7863, [retrieved on 20041019], DOI: 10.1021/JA040175Z
QING SHAO ET AL: "Photoactive molecules for applications in molecular imaging and cell biology", CHEMICAL SOCIETY REVIEWS, vol. 39, no. 8, 1 January 2010 (2010-01-01), pages 2835, XP055157312, ISSN: 0306-0012, DOI: 10.1039/b915574k
D. GFELLER ET AL: "SwissSidechain: a molecular and structural database of non-natural sidechains", NUCLEIC ACIDS RESEARCH, vol. 41, no. D1, 26 October 2012 (2012-10-26), pages D327 - D332, XP055157331, ISSN: 0305-1048, DOI: 10.1093/nar/gks991
RASA RAKAUSKAITE ET AL: "Production of selenoproteins in yeast via genetically encoded incorporation of a photocaged selenocysteine", NEW BIOTECHNOLOGY, vol. 31, 1 July 2014 (2014-07-01), pages S202, XP055157074, ISSN: 1871-6784, DOI: 10.1016/j.nbt.2014.05.969
Attorney, Agent or Firm:
PETNIŪNAITĖ, Jurga (J. Jasinskio g. 16A, Vilnius, LT)
Download PDF:
Claims:
CLAIMS

1. A method for producing in a translation system a protein comprising an unnatural amino acid at one or more selected positions, the method comprising:

(c) providing a translation system comprising:

(j) an unnatural amino acid that is photocaged selenocysteine;

(ii) an orthogonal aminoacyl-tRNA synthetase (O-RS);

(iii) an orthogonal tRNA (O-tRNA), wherein said O-RS preferentially aminoacylates said O-tRNA with said photocaged selenocysteine; and,

(iv) a nucleic acid encoding said protein, wherein said nucleic acid comprises at least one selector codon that is recognized by said O-tRNA; and,

(d) incorporating said unnatural amino acid at said selected position in said protein during translation of said protein in response to said selector codon, thereby producing said protein comprising said unnatural amino acid at the selected position.

2. A method of claim 1, wherein a photocaged selenocysteine comprises formula I:

wherein

R is H, alkyl or alkoxy, carboxymethoxy group,

R1 is H, alkyl, aryl, alkoxy, hydroxy, or a halide group,

2 3

R and R each independently represent H, alkyl, aryl, alkoxy, hydroxy group, C02R or halide; wherein when R and R2 or R and R3 is alkoxy, R2 or R3 may form a five, six or seven membered cyclic structure in combination with R, is H, methyl, alkyl, CH2OR5 or C02R5

is H or alkyl.

3. A method of claim 2, wherein the photocaged selenocysteine is 4,5-dimethoxy-2- nitrobenzylselenocysteine.

4. The method of claim 1, wherein said translation system comprises a polynucleotide encoding said O-tRNA fused to the SNR52 promoter.

5. The method of claim 1 , wherein the selector codon recognized by said O-tRNA is UAG nonsense codon.

6. The method of claim 1, wherein said nucleic acid comprises at least one TAG nonsense codon.

7. The method of claim 1, wherein said translation system comprises a nucleic acid that encodes a mutated orthogonal LeuRS.

8. The method of any of preceding claims, wherein said translation system comprises a host cell comprising said unnatural amino acid, said O-RS and said O-tRNA.

9. The method of claim 8, wherein said host cell is a bacterial, yeast or mammalian host cell.

10. The method of claim 9, wherein said host cell is a yeast host cell.

1 1. The method of claim 10, wherein said yeast host cell is a Saccharomyces cerevisiae host cell.

12. The method of claim 8 - 1 1 , wherein said host cell further comprises a knockout or deletion of UPF1 gene.

13. The method of claim 1, wherein said method further comprises step (c) removing the photocaging group by exposing translation system or purified protein solution to UV light.

14. The method of claims 1-13 wherein said protein is further used for light-controlled protein dimerization.

15. The method of claims 1-14 wherein said protein is further used for selective conjugation, targeted labeling or other chemical modification.

16. A photocaged selenocysteine of the formula I:

wherein

R is H, alkyl or alkoxy, carboxymethoxy group,

R1 is H, alkyl, aryl, alkoxy, hydroxy, or a halide group,

R2 and R3 each independently represent H, alkyl, aryl, alkoxy, hydroxy group, C02R5 or halide; wherein when R and R2 or R and R3 is alkoxy, R2 or R3 may form a five, six or seven membered cyclic structure in combination with R,

R4 is H, methyl, alkyl, CH2OR5 or C02R5

R5 is H or alkyl.

17. A process for producing a selenocysteine of claim 16, which comprises the steps of: reducing of Ν,Ν'-blocked selenocystine and reacting with a compound selected from a corresponding substituted (at positions 1,3,4,5 and/or alpha) 2-nitrobenzyl halide, sulfonate, phosphate or carboxylate, followed by de-protection of amino groups.

18. A process of claim 17, wherein selenocystine is blocked with a group selected from butoxycarbonyl, carboxy benzyl, allyloxycarbonyl, metoxy or ethoxycarbonyl.

19. A photocaged selenocysteine of claim 16, wherein the compound is used in a method of claim 1. 20. A protein obtained by the method of claims 1 -15.

21. A protein of claim 20, wherein said protein is EGFP.

22. A protein of claim 20 in which catalytic or functionally important Cys or Ser is replaced with Sec.

23. Use of the method of claim 1 for production of natural recombinant selenoproteins.

24. Use of the method of claim 1 for production of artificially modified proteins in which one or several Sec residues are inserted in desired positions.

25. Use of the method according to claim 24, wherein one or several Sec residues are inserted at selected positions in or around the active centre. 26. The use of the method according to claim 25, wherein Sec replaces catalytic Cys or Ser in said selected positions.

27. The use of the method according to claim 26, wherein said replacement is performed in an enzyme selected from the group of thiol oxidoreductases, protein-tyrosine phosphatases, Cys peptidases and one carbon pyrimidine-5 transferases.

Description:
PRODUCTION OF SELENOPROTEINS (SelProt)

Field of the Invention The present invention relates to targeted incorporation of rare and unnatural amino acids into proteins, in particular to the use of in vivo technology utilizing UAG codon suppression by genetically encoded unnatural amino acids for recombinant protein synthesis.

Background of the Invention

Engineering and in-cell production of recombinant proteins is widely exploited for structural and functional studies and for numerous practical applications in medicine and industry. However, designing and producing a protein with a desired enzymatic activity or biological function is often a challenging task (Brustad, E.M. and Arnold, F.H. (201 1) Curr. Opin. Chem. Biol. 15(2), 201 -210). Incorporation of selenocysteine (Sec), the 21 st amino acid, into engineered proteins is one of the ways to bring in new properties. Sec is a cysteine (Cys) analog carrying a selenium atom instead of sulfur, which provides unique physicochemical characteristics to Sec. When compared to the thiol group of Cys, the selenol group of Sec has a higher nucleophilicity, lower pKa, and a lower redox potential. Due to its pronounced chemical reactivity, Sec is capable of changing structural and enzymatic qualities of a protein. For example, engineering selenocysteine residues into the catalytic centre offers an attractive means to expand enzyme's naturally evolved catalytic capacity (Huang, X. et al. (201 1) Chem. Soc. Rev. 40, 1 171- 1 184). Alternatively, the unique chemical reactivity of Sec towards electrophiles and oxidation reagents makes it useful for selective conjugation, labeling or creation of dimerization sites in a protein (Heras, I.L. et al. (201 1) Anal. Bioanal. Chem. 400(6), 1717-27; Arner, E. (2010) Exp. Cell. Res. 316(8), 1296-303; Johansson, L. et al (2005) Biochim. Biophys. Acta. 1726( 1), 1- 13). In addition, incorporation of selenocysteine carrying valuable isotopes of Se is useful for facilitation of phase determination by multi-wavelength anomalous diffraction in X-ray crystallography of proteins, positron emission tomography PET ( 73 Se), specific radiolabeling ( 75 Se) and high-resolution NMR studies ( 77 Se) of proteins (Johansson, L. et al (2005) Biochim. Biophys. Acta. 1726(1), 1 -13). Yet another area is efficient production of natural recombinant selenoproteins in bacterial (Arner, E.S.J. (2002) Met. Enzymol. 347, 226-235) or eukaryotic cells (Liu, H. et al. (2012) Protein Expr Purif. 84(1 ), 59-63.

Despite its high technological potential, incorporation of Sec into recombinant proteins is far from trivial. A number of methods have been proposed for making artificial selenoproteins, however, none of them so far enables a precise insertion of Sec with a good yield (Johansson, L. et al (2005) Biochim. Biophys. Acta. 1726( 1), 1 -13). For example, protein ligation approaches are limited by the solid base synthesis which generally allows making peptides no longer than 50 amino acids. If Sec has to be incorporated beyond the 50 th position, ligation of 3 peptide fragments is needed. Since peptide ligation requires an N-terminal Cys residue, this approach unavoidably creates an additional Cys mutation in the protein sequence (Muir, T. W. (2003) Annu. Rev. Biochem. 72, 249-289). Chemical conversion of serine (Ser) residues to Sec targets all Ser residues and therefore is not suitable for making single Sec substitutions in most proteins (Wu, Z.-P., Hilvert, D. (1989) J. Am. Chem. Soc. I l l , 4513-4514). A similar lack of specificity arises when a protein containing several Cys residues is expressed in a Cys auxotrofic strain with Sec supplied in the growth medium (Muller, S. et al. (1994) Biochem. 33, 3404-3412). Expression of recombinant proteins in bacteria by employing native UGA codon suppression is also technically limited. Specifically, bacterial selenoprotein genes carry a hairpin-like selenocystein insertion sequence (SECIS) inside the open reading frame and adjacent to the UGA codon. Not only the SECIS structure but also 4 conserved nucleotides within it are absolutely required to direct the incorporation of Sec into the peptide chain during translation (Yoshizawa, S., Bock, A. (2009) Biochim. et Biophys. Acta. 1790, 1404-1414). Therefore, it is practically impossible to engineer a SECIS structure into a recombinant mRNA sequence without making additional changes in the protein sequence. Mammalian expression systems, in which SECIS is positioned in the 3'UTR, are suitable for making targeted Sec incorporation without mandatory alterations of the protein sequence, but cannot provide good protein yields at a comparable cost (Berry, M.J. et al. ( 1993) EMBO J., 12 (8), 3315-3322). The same is true about in vitro protein expression systems which require expensive components and labor-intensive chemical charging of tRNA with Sec (Kang, T.J., Suga, H. (2008) Biochem. Cell Biol. 86, 92-99).

An alternative approach for targeted incorporation of certain unnatural amino acids is based on in vivo incorporation of genetically encoded unnatural amino acids at a UAG nonsense codon. This technology allowed incorporation of over 70 different unnatural amino acids into protein structures with high specificity (Liu, C.C. and Schultz, P.G. (2010) Annu. Rev. Biochem. 79, 413-444), however, it has not been shown to be suitable for producing Sec insertions/substitutions. Although successful incorporation of phenylselenocysteine into histone H3 protein was achieved (Guo, J. et al. (2008) Angew. Chem. Int. Ed. Engl. 47(34), 6399-6401), but a suitable method for deprotection of the incorporated residue to Sec was not proposed (Clark, K. . et al. (2009) Methods Enzymol. 462, 97- 1 15). This approach was also used for the incorporation of photocaged amino acids such as DMNB-Ser, which can be readily converted to Ser in situ by photolysis (Lemke, E.A. et al. (2007) Nat. Chem. Biol. 3(12), 769-772). The yeast cells are co-transformed with the pescTrpLeuRSBH5T252A plasmid, which encodes for an orthogonal UAG nonsense codon suppressor tRNAcu A (O- tRNA) and an orthogonal aminoacyl tRNA synthetase (O-RS) that is capable of charging the O-tRNA with the unnatural amino acid DMNB-Ser (and apparently DMNB-Cys, Young, T.S. et al. (2009) Biochemistry. 48(12), 2643-2653). However, the amounts of such recombinant proteins obtained from the described yeast cultures were quite low (Lemke, E.A. et al. (2007) Nat. Chem. Biol. 3(12), 769-772; Chin, J.W. et al. (2003) Science. 301 (5635), 964-967). The yield of recombinant proteins was improved when expression of O-tRNA was optimized by fusing its gene to the SNR52 promoter (pSNR-LeuRS plasmid) and expressing the protein in a yeast UPF\ deletion strain (LWUPF 1A), which stabilizes nonsense containing mRNAs (Wang, Q. and Wang, L. (2008) J. Am. Chem. Soc. 130( 19), 6066-6067). However, the described system was not used for expression of DMNB-Ser or DMNB-Cys containing proteins.

So far none of the existing technologies proved to serve as a general method for high yield production of recombinant proteins that contain a Sec residue in a predefined position.

Summary of the Invention

First, the current invention solves the above problem by using a UAG codon engineered at any defined position of mRNA, which is decoded in yeast cells by the O-tRNA charged with the unnatural amino acid, the photocaged selenocysteine. The protective group such as 4,5- dimethoxy-2-nitrobenzyl (DMNB) is then removed from the protein by illumination with UV light leaving Sec in place. This strategy, in principle, can be applied to make any desired Sec incorporation in any protein. Moreover, in live cells, synthesis of a photocaged selenoprotein will permit optical control over biological processes carried out by the protein. As a result, this method offers a new tool for protein engineering and specifically selenology.

Second, the current invention provides a new photocaged selenocysteine compound of formula I and a synthesis scheme for obtaining the same.

Brief Description of the Figures

Figure 1 shows synthetic route to production of DMNB-Cys.

Figure 2 shows synthetic route to production of DMNB-Sec.

Figure 3 illustrates DMNB-Sec-dependent production of functional EGFP protein in yeast S. cerevisiae cells.

Figure 4 illustrates the results of the removal of the photocaging DMNB group from cysteine and selenocysteine residues after exposing the EGFP protein to UV light (340-390 nm).

Figure 5 illustrates the results of the formation of the EGFP protein dimers after protein exposure to uv and reduction of the dimers by DTT treatment.

Figure 6 demonstrates that photodecaging, dimerisation and reductive monomerisation all occur under mild conditions preserving the functionally (fluorescence) of the EGFP protein. Detailed Description of the Invention

Unless defined herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains.

The term "orthogonal" refers to a molecule that functions with endogenous components of a cell with reduced efficiency as compared to a corresponding molecule that is endogenous to the cell or translation system, or that fails to function with endogenous components of the cell. In the context of tRNAs and aminoacyl-tRNA synthetases, orthogonal refers to an inability or reduced efficiency of an orthogonal tRNA to function with an endogenous tRNA synthetase compared to an endogenous tRNA to function with the endogenous tRNA synthetase, or of an orthogonal aminoacyl-tRNA synthetase to function with an endogenous tRNA compared to an endogenous tRNA synthetase to function with the endogenous tRNA. The orthogonal molecule lacks a functionally normal endogenous complementary molecule in the cell. The term "selector codon" refers to codons recognized by the O-tRNA in the translation process and not recognized by an endogenous tRNA. Selector codons can include, e.g., nonsense codons, such as UAG, UAA, UGA.

The term "translation system" refers to the components that incorporate an amino acid into a growing polypeptide chain; these components can include, for example, ribosomes, tRNAs, synthetases, mRNA and the like. The O-tRNA and the O-RS can be added to or be a part of an in vitro or in vivo translation system. The compositions made by the methods of the invention optionally are in a cell. The unnatural amino acid that is added to the system can be a synthetic amino acid, such as a photocaged selenocysteine, which can be exogenously added to the growth medium.

The current invention relates to a method for producing in a translation system a protein comprising an unnatural amino acid at one or more selected positions, the method comprising:

(a) providing a translation system comprising:

(i) an unnatural amino acid that is photocaged selenocysteine;

(ii) an orthogonal aminoacyl-tRNA synthetase (O-RS);

(iii) an orthogonal tRNA (O-tRNA), wherein said O-RS preferentially aminoacylates said O-tRNA with said photocaged selenocysteine; and,

(iv) a nucleic acid encoding said protein, wherein said nucleic acid comprises at least one selector codon that is recognized by said O-tRNA; and,

(b) incorporating said unnatural amino acid at said selected position in said protein during translation of said protein in response to said selector codon, thereby producing said protein comprising said unnatural amino acid at the selected position.

The method may further comprise step (c) removing the photocaging group by exposing translation system or purified protein solution to UV light.

The host cell that is used is preferably a bacterial, yeast or mammalian host cell; more prefereably it is a yeast cell. The yeast cell is preferably Saccharomyces cerevisiae.

The invention further relates to a photocaged selenocysteine of the formula I:

Wherein

R is H, alkyl or alkoxy, carboxymethoxy group,

R 1 is H, alkyl, aryl, alkoxy, hydroxy, or a halide group.

R 2 and R 3 each independently represent H, alkyl, aryl, alkoxy, hydroxy group, C0 2 R 5 or halide; wherein when R and R or R and R is alkoxy, R or R may form a five, six or seven membered cyclic structure in combination with R,

R 4 is H, methyl, alkyl, CH 2 OR 5 or C0 2 R 5

R 5 is H or alkyl.

The photocaged selenocysteine is preferably 4,5-dimethoxy-2-nitrobenzylselenocysteine (DMNB-Sec).

The invention further relates to a process for producing a photocaged selenocysteine of formula I, which comprises the steps of: reducing of Ν,Ν'-blocked selenocystine and reacting with a compound selected from a corresponding substituted (at positions 1 ,3,4,5 and/or alpha) 2-nitrobenzyl halide, sulfonate, phosphate or carboxylate, followed by de-protection of amino groups. The selenocystine may be blocked with a group selected from butoxycarbonyl, carboxybenzyl, allyloxycarbonyl, metoxy or ethoxycarbonyl.

The photocaged selenocysteine of formula I may be further used in the method of invention. The present invention for site-specific Sec incorporation is based on UAG codon suppression technology used to incorporate some other unnatural amino acids into proteins. This method employs yeast cells transformed with plasmids, carrying an engineered UAG nonsense codon at the desired location inside a gene of interest.

The system comprises the following major components:

1. the UPF\ -deficient LWUPF 1 Δ yeast strain;

2. plasmid pRR6;

3. synthetic photocaged amino acid DMNB-Sec 4. plasmid carrying an engineered UAG nonsense codon at the desired location inside a gene of interest (exemplified by pGFP-TAG);

The pRR6 plasmid was constructed by replacing the LeuRS gene in the pSNR-LeuRS plasmid (Wang, Q. and Wang, L. (2008) J. Am. Chem. Soc. 130(19), 6066-6067) with the mutated DMNB-Ser specific LeuRSBH5T252A gene from the pescTrpLeuRSBH5T252A plasmid (Lemke, E.A. et al. (2007) Nat. Chem. Biol. 3(12), 769-772) by genetic engineering. The nucleotide sequence representing LeuRSBH5T252A gene is provided as SEQ ID NO: 1. atgcaagagcaataccgcccggaagagatagaatccaaagtacagcttcattgggatgag aagcgcacatttgaagtaaccgaagacg agagcaaagagaagtattactgcctgtctgggcagccctatccttctggtcgactacaca tgggccacgtacgtaactacaccatcggtg acgtgatcgcccgctaccagcgtatgctgggcaaaaacgtcctgcagccgatcggctggg acgcgtttggtctgcctgcggaaggcgc ggcggtgaaaaacaacaccgctccggcaccgtggacgtacgacaacatcgcgtatatgaa aaaccagctcaaaatgctgggctttggtt atgactggagccgcgagctggcaacctgtacgccggaatactaccgttgggaacagaaat tcttcaccgagctgtataaaaaaggcctg gtatataagaagacttctgcggtcaactggtgtccgaacgaccagaccgtactggcgaac gaacaagttatcgacggctgctgctggcg ctgcgataccaaagttgaacgtaaagagatcccgcagtggtttatcaaaatcactgctta cgctgacgagctgctcaacgatctggataaa ctggatcactggccagacaccgttaaaaccatgcagegtaactggatcggtcgttccgaa ggcgtggagatcaccttcaacgttaacga ctatgacaacacgctgaccgtttacactacccgcccggacgcgtttatgggttgtaccta cctggcggtagctgcgggtcatccgctggc gcagaaagcggcggaaaataatcctgaactggcggcctttattgacgaatgccgtaacac caaagttgccgaagctgaaatggcgacg atggagaaaaaaggcgtcgatactggctttaaagcggttcacccattaacgggcgaagaa attcccgtttgggcagcaaacttcgtattg atggagtacggcacgggcgcagttatggcggtaccggggcacgaccagcgcgactacgag tttgcctctaaatacggcctgaacatca aaccggttatcctggcagctgacggctctgagccagatctttctcagcaagccctgactg aaaaaggcgtgctgttcaactctggcgagtt caacggtcttgaccatgaagcggccttcaacgccatcgccgataaactgactgcgatggg cgttggcgagcgtaaagtgaactaccgc ctgcgcgactggggtgtttcccgtcagcgttactggggcgcgccgattccgatggtgact ctagaagacggtaccgtaatgccgacccc ggacgaccagctgccggtgatcctgccggaagatgtggtaatggacggcattaccagccc gattaaagcagatccggagtgggcgaa aactaccgttaacggtatgccagcactgcgtgaaaccgacactttcgacacctttatgga gtcctcctggctgtatgcgcgctacacttgcc cgcagtacaaagaaggtatgctggattccgaagcggctaactactggctgccggtggata tcggtattggtggtattgaacacgccattat gtttctgctctacttccgcttcttccacaaactgatgcgtgatgcaggcatggtgaactc tgacgaaccagcgaaacagttgctgtgtcagg gtatggtgctggcagatgccttctactatgttggcgaaaacggcgaacgtaactgggttt ccccggttgatgctatcgttgaacgtgacga gaaaggccgtatcgtgaaagcgaaagatgcggcaggccatgaactggtttataccggcat gagcaaaatgtccaagtcgaagaacaac ggtatcgacccgcaggtgatggttgaacgttacggcgcggacaccgttcgtctgtttatg atgtttgcttctccggctgatatgactctcgaa tggcaggaatccggtgtggaaggggctaaccgcttcctgaaacgtgtctggaaactggtt tacgagcacacagcaaaaggtgatgttgc ggcactgaacgttgatgcgctgactgaaaatcagaaagcgctgcgtcgcgatgtgcataa aacgatcgctaaagtgaccgatgatatcg gccgtcgtcagaccttcaacaccgcaattgcggcgattatggagctgatgaacaaactgg cgaaagcaccaaccgatggcgagcagg atcgcgctctgatgcaggaagcactgctggccgttgtccgtatgcttaacccgttcaccc cgcacatctgcttcacgctgtggcaggaact gaaaggcgaaggcgatatcgacaacgcgccgtggccggttgctgacgaaaaagcgatggt ggaagactccacgctggtcgtggtgc aggttaacggtaaagtccgtgccaaaatcaccgttccggtggacgcaacggaagaacagg ttcgcgaacgtgctggccaggaacatct ggtagcaaaatatcttgatggcgttactgtacgtaaagtgatttacgtaccaggtaaact cctcaatctggtcgttggctaa

The photocaged selenocysteine, DMNB-Sec, was chemically synthesized from selenocystine as a trifluoroacetate salt using a newly developed procedure. The same procedure may be used to synthesize other photocaged selenocysteines of the formula I:

Wherein

R is H, alkyl or alkoxy, carboxymethoxy group,

R 1 is H, alkyl, aryl, alkoxy, hydroxy, or a halide group.

R 2 and R 3 each independently represent H, alkyl, aryl, alkoxy, hydroxy group, C0 2 R 5 or halide; wherein when R and R 2 or R 3 is alkoxy, R 2 or R 3 may form a five, six or seven membered cyclic structure in combination with R,

R 4 is H, methyl, alkyl, CH 2 OR 5 or C0 2 R 5

R 5 is H or alkyl.

As an example, we used plasmid pGFP-TAG (Wang, Q. and Wang, L. (2008) J. Am. Chem. Soc. 130(19), 6066-6067), which encodes for the EGFP protein with a UAG mutation at nonessential position Tyr39.

For protein production, the LWUPF I A cells were simultaneously transformed with the pRR6 and pGFP-TAG plasmids and were grown in medium containing 4 mM DMNB-Sec. The full length active protein is formed only when the unnatural amino acid is inserted to its sequence after decoding the UAG codon. The 6-His tagged EGFP protein carrying the unnatural amino acid DMNB-Sec replacing Tyr39 was purified from yeast cells using Immobilized metal affinity chromatography.

The identity of the isolated EGFP protein carrying DMNB-Sec was confirmed by mass spectrometry. The yield of the seleno-EGFP protein was 1 mg/L.

The disclosed approach uses a photocaging DMNB group to protect Sec, such that the reactive selenohydryl (or selenide) group will be protected from chemical reactions with the biopolymers, metabolites and other molecules (including reactive oxygen species) present in cellular environment. After protein synthesis is complete, the photocaging group can be removed by illuminating the producing cells or isolated protein solution in vitro with UV light.

Previously, it was shown that the O-RS optimized for DNMB-Ser can also accept DNMB-Cys (Young, T.S. et al. (2009) Biochemistry. 48(12), 2643-2653). We surprisingly found that this aaRS can also accept DMNB-Sec. Although the DMNB-protected Sec differs by only one atom from DMNB-Ser and DMNB-Cys, it was not directly obvious that such a replacement in the unnatural amino acid would be well tolerated by the mutated orthogonal LeuRS. One of the reasons is a noticeably larger van der Waals radius of selenium as compared to sulfur and oxygen, and the larger amino acid DMNB-Sec is less likely to fit in the pocket of an enzyme. Secondly, a much higher chemical reactivity of selenium towards electrophiles and oxidants might lead to fast decay of DMNB-Sec in the growth medium or inside the yeast cells. Third, the toxicity of 4 mM DMNB-Sec for yeast cells was not previously examined, whereas, some Se compounds are known to be toxic for yeast at much lower concentrations (SeMet is toxic at 0.25 mM, Kitajima, T. et al. (2012) J. Biol. Chem. 287(13), 10032-8). Fourth, although the thiol chemistry in general has certain similarities with selenium chemistry, we found that a synthetic route established for production of DMNB-Cys from cystine was unsuitable and required major changes to be efficient for the synthesis of DMNB-Sec from selenocystine. Finally, photochemical removal of the caging DMNB group from Sec or other Se compounds was not previously described and it was not clear if the reaction would proceed in a similar fashion as observed for Ser (Lemke, E.A. et al. (2007) Nat. Chem. Biol. 3(12), 769-772) or Cys (Rathert, P. et al., (2007) Chembiochem. 8(2), 202-207.).

The described approach can be used for production of any recombinant selenoprotein in yeast. The list of known selenoproteins is provided in the following resource: http://www.selenodb.org/ The described approach can be used for production of any other artificially modified proteins in which one or several Sec residues are inserted in desired positions. Inserted Sec residues can be used for targeted labeling or further chemical modification of the protein.

Sec residues are preferentially labelled with the same reagents used for Cys modification, but the reaction can be selectively performed at Sec at a slightly acid pH (pH<7).

Inserted Sec residues can be used for targeted and light-controlled dimerization of functional proteins as exemplified for EGFP. The described approach can be used for production of any artificially modified enzymes in which one or several Sec residues are inserted at selected positions in or around the active centre. This position can be often selected by replacing catalytic Cys or Ser residues in an enzyme. Preferential classes of such enzymes are thiol oxidoreductases (EC 1.8.3.2), protein- tyrosine phosphatases (EC 3.1.3.48), Cys peptidases (EC 3.4.22) (The MEROPS online database: //merops.sanger.ac.uk/cgi-bin/family_index?type=P#C), one carbon pyrimidine-5 transferases such as deoxycytidylate 5-hydroxymethyltransferases (EC 2.1.2.8), thymidylate synthetases (EC 2.1.1.45), and a large variety of DNA cytosine-5 methyltransferases (EC 2.1.1.37), RNA cytosine-5 methyltransferases (listed in http://modomics.genesilico.p1/reactions/C:m5C/) and RNA uracil-5 methyltransferases (listed in http://modomics.genesilico.p1/reactions/U:m5U/) (Bujnicki J. et al. (2004) Nucleic Acids Res. 32(8), 2453-63). DNA cytosine-5 methyltransferases (exemplified by M.Hhal, M.SssI, M.Hpall, M.AluI, M2.Eco31I) and their engineered variants have also been shown to catalyze sequence-specific non-canonical reactions such as the transfer of extended groups from synthetic co factor analogues (Lukinavicius G. et al. (2012) Nucleic Acids Res., 40(22), 1 1594— 1 1602; US2013130922A 1), addition and removal of formaldehyde at the 5-position of a target cytosine residue (Liutkeviciute Z. et al., (2009) Nat. Chem. Biol. 5, 400-402), and addition of aliphatic thiols or selenols to 5-hydroxymethylcytosine residues in DNA (Liutkeviciute Z. et al., (201 1) Angew. Chem. Int. Ed., 50, 2090-2093). These novel reactions can be used for analysis of epigenetic modifications in genomic DNA which typically occur at CpG dinucleotide sequences. Replacement of the catalytic Cys with Sec may enhance the catalytic power of these enzymes in these practically important non-canonical reactions. EXAMPLES

Example 1. Synthesis of photocaged amino acids

Synthesis of DMNB-Cys (Figure 1.)

Cystine dimethyl ester 1

Suspension of cystine (5 g, 20.8 mmol) in MeOH (100 ml) was cooled to -70°C and thionyl chloride (3.75 ml, 19.24 mmol) was added to reaction vessel. Reaction mixture was left until room temperature is reached, then refluxed for 6 hours. Methanol evaporated under reduced pressure and residue dissolved in methanol, after addition of diethyl ether product precipitates as white crystals. Procedure gave 6.69g (94%) of target product as hydrochloride salt.

N,A^-bis(tert-butoxycarbonyl)cystine dimethyl ester 2

Reaction was performed according procedure described in literature (Busnel et al., (2005) Bioorg. Med. Chem. 13(7), 2373-80).

N-(tert-butoxycarbonyl)cysteine methyl ester 3

Reaction was performed according procedure described in literature (Busnel et al., (2005)).

N-(/ert-butoxycarbonyl)-5'-(4,5-dimethoxy-2-nitrobenzyl)c ysteine methyl ester 4

Suspension of N-(tert-butoxycarbonyl)cysteine methyl ester (1.94 g, 8.25 mmol), 4,5- dimethoxy-2-nitrobenzylbromide (2.40 g, 8.73 mmol) and sodium carbonate (1.51 g, 14.25 mmol) in THF (50 ml) refluxed for 60 hours, then quenched with water (50 ml). THF was evaporated under reduced pressure and product extracted with DCM. DCM evaporated, residue purified by column chromatography (C 6 H :CHC1 3 1 : 1, R f = 0.23). Described procedure yielded 2.16 g (61%) of target product.

Ή NMR (400 MHz, CDC1 3 ): δ = 1.47 (9H, s, 3 CH 3 ), 2.91 (1H, dd J = 14, 5.6Hz, SCH^He), 2.98 (1H, dd J= 14, 5.6Hz, SCH A Hs), 3.78 (3Η, s, OCH 3 ), 3.97 (3H, s, ArOCH 3 ), 4.03 (3H, s, ArOCH 3 ), 4.09 (1H, d J = 13.6Hz, BnCH,H B S), 4.17 (1H, d J = 13.6Hz, BnCH A H s S), 4.56 (1Η, brm, CH), 5.35 (1H, d J = 7.6Hz, NH), 6.94 (1H, s, ArH), 7.69 (1H, s, ArH). 13 C NMR (100 Mz, CDC1 3 ): 6 = 28.3, 34.4, 34.5, 52.7, 53.4, 56.4, 56.5, 80.3, 108.8, 1 13.4, 128.7, 140.5, 148.0, 153.0, 155.2, 174.4. Anal, calcd. for C 18 H 26 N 2 0 8 S: C, 50.22; H, 6.09; N, 6.51 ; S, 7.45. Found: C, 50.39; H, 6.25; N, 6.53; S, 7.22. HRMS: m/z [M+Na] + calcd. for Ci 8 H 26 N 2 0 8 S: 453.1302; Found: 453.1310.

5'-(4,5-dimethoxy-2-nitrobenzyl)cysteine hydrochloride 5 To suspension of N-(tert-butoxycarbonyl)-5'-(4,5-dimethoxy-2-nitrobenzyl)cyst eine methyl ester (0.95 g, 2.21 mmol) in mixture of acetone (25 ml) and water (25 ml) cone, hydrochloric acid (0,92 ml, 1 1.36 mmol) was added. Reaction mixture was stirred for 3 days at 60°C (sand bath) temperature, solvents evaporated under reduced pressure, residue washed with acetone. Described procedure yielded 0.74 g (95%) of target product.

Ή NMR (400 MHz, DMSO-D 6 ): δ = 2.95 (1H, dd J = 14.6, 6.4Hz, SCH,H B ), 3.01 (1H, dd J = 14.6, 4.8, SCH A H0, 3.86 (3Η, s, OCH 3 ), 3.93 (3H, s, OCH 3 ), 4.12 (1H, d J = 13.6 Hz, BnCH f HeS), 4.15-4.25 (2H, m, BnCH A HsS + CH), 7.35 (1 H, s, ArH), 7.70 (1H, s, ArH), 8.62 (3H, brs, NH 3 ), 14.04 (1 H, brs, OH). , 3 C NMR (100 Mz, CDC1 3 ): δ = 31.3, 33.7, 52.3, 56.5, 57.0, 109.4, 1 14.9, 129.3, 140.1, 148.0, 153.2, 170.0. HRMS: m/z [M+H] + calcd. for C, 2 H 16 N 2 0 6 S: 317.0802; Found: 317.0805.

Synthesis of DMNB-Sec (Figure 2.)

N.N'-bis^rt-butoxycarbony selenocystine 1

Solution of selenocystine (0.625 g, 1.88 mmol) in 3.8 ml of alkaline water (1 M NaOH) was combined with solution of di-tert-butyldicarbonate (1.633 g, 7.49 mmol) in 1.8 ml of dioxane and .reaction mixture was stirred for 24 hours at room temperature, concentrated under reduced pressure and washed with ethyl acetate. Water phase was acidified with hydrochloric acid,and product extracted with ethyl acetate. Organic layer washed with brine, dried over sodium sulphate. Ethyl acetate was removed under reduced pressure. Residue was recrystallized form benzene to yield 0.755 g (75.5%) of target product.

Ή NMR (300 MHz, CDC1 3 ): δ = 1.45 (9H, s, 3*CH 3 ), 3.21 (1 H, dd J = 12.6, 4.8Hz, SeCHjHu), 3.46 (1H, dd J= 12.6, 4.8Hz, SeCH A Hi , 4.61 (1Η, brm, CH), 5.37 (1 H, brs, NH). N-(ter/-butoxycarbonyl)-5'e-(4,5-dimethoxy-2-nitrobenzyl)sel enocysteine 2

Mixture of N,jV-bis(tert-butoxycarbonyl)selenocystine (0.755 g, 1.41 mmol) and 4,5- dimethoxy-2-nitrobenzylbromide (1.170 g, 4.24 mmol) in ethanol (30 ml) was cooled to 0°C, then sodium borohydride (0.161 g, 4.26 mmol) poured to reaction vessel. Reaction mixture was stirred for 75 minutes at room temperature then quenched with water (70 ml). Ethyl acetate was added and reaction mixture was acidified with hydrochloric acid. Organic layer was separated and dried over sodium sulphate. Solvents evaporated under reduced pressure, residue purified by column chromatography (CHC1 3 , R f = 0.26). Described procedure yielded 1.133 g (86.5%) of target product. Ή NMR (400 MHz, CDC1 3 ): δ = 1.46 (9H, s, 3xCH 3 ), 3.03 (1H, dd J = 13.6, 4.8Hz, SeC¾H B ), 3.09 (1 H, dd J = 13.6, 4.8Hz, SeCH A H s ), 3.95 (3Η, s, OCH 3 ), 4.01 (3H, s, OCH 3 ), 4.1 1 (1 H, d J = 1 1.6Hz, BnCH4H B Se), 4.21 ( 1 H, d J = 1 1.6Hz, BnCH A H s Se), 4.62 ( 1 Η, brm, CH), 5.46 (1 H, brm, NH), 6.83 (1H, s, ArH), 7.69 (1H, s, ArH). I 3 C NMR (100 Mz, CDC1 3 ): δ = 25.9 (2 overlapping signals, it's clear from HSQC data), 28.3, 53.6, 56.4, 56.6, 80.7, 108.9, 1 13.2, 130.6, 139.7, 147.8, 153.2, 155.6, 174.9. Anal, calcd. for C i 7 H 24 N 2 0 8 Se: C, 44.07; H, 5.22. Found: C, 44.13; H, 5.53. HRMS: m/z [M+Na] + calcd. for C 17 H 24 N 2 0 8 Se: 487.0591 ; Found: 487.0589. &-(4,5-dimethoxy-2-nitrobenzyl)selenocysteine trifluoroacetate 3

N-(tert-butoxycarbonyl)-&-(4,5-dimethoxy-2-nitrobenzyl)s elenocysteine (1.13 g, 2.44 mmol) was dissolved in TFA ( 10 ml) and stirred for 15 min at room temperature. After TFA was evaporated solid residue was washed with DCM, filtered and dried. Crude product was dissolved in methanol, insoluble impurities filtered off. Evaporation of methanol under reduced pressure yielded 0.9 g (77%) of target product.

'H NMR (300 MHz, DMSO-D 6 ): 5 = 2.95-3.1 1 (2H, m, SeCH 2 ), 3.87 (3H, s, OCH 3 ), 3.92 (3H, s, OCH 3 ), 4.14 ( 1H, d J = 12 Hz, BnCHjHeSe), 4.17-4.25 (2H, m, BnCH A H B Se + CH), 7.15 (1H, s, ArH), 7.70 (1H, s, ArH), 8.44 (3H, brs, NH 3 ). 13 C NMR (75 Mz, CDC1 3 ): δ = 23.4, 26.0, 53.1 , 56.7, 57.0, 109.5, 1 14.5, 131.2, 139.8, 148.2, 153.6, 170.4. HRMS: m/z [M+H] + calcd. for Ci 2 H 16 N 2 0 6 Se: 365.0247; Found: 365.0255.

Example 2 Targeted incorporation of a photocaged Sec into a protein produced in yeast cells.

Example 2 demonstrates that the presence of all O-tRNA, O-RS and DMNB-Cys or DMNB- Sec is required for production of a functional protein using the disclosed yeast expression system. Yeast cells were simultaneously transformed with two plasmids (pGFP-TAG and pRR6) as well as single plasmid combinations and selected transformants were grown either without (column 1 ) or with the unnatural amino acid (columns 2 and 3) in selective liquid medium for 24 h at 30°C. Cells were harvested, washed and resuspended in water. The efficiency of the DMNB-Sec-dependent production of a functional EGFP protein was monitored by measuring EGFP fluorescence of the producing yeast cells (Figure 3). Fluorescence readings were taken from 0.1 ml samples in triplicates using Synergy H4 (BioTek Instruments, Inc.) plate reader. Fluorescence values were corrected by absorbance OD595 readings which were taken in parallel for each sample. Plasmid pGFP-TAG encodes an EGFP protein that contains a TAG nonsense codon substitution at functionally permissive Tyr39 position. Plasmid pRR6 encodes a TAG nonsense codon suppressor O-tRNA, and a mutated orthogonal LeuRS. Cells grown without a DMNB-protected amino acid produce mostly truncated nonfluorescent EGFP (column 1). Functional active EGFP protein (fluorescent) is produced after decoding of the TAG codon by a nonsense suppressor O-tRNA charged with unnatural amino acids DMNB-Cys (column 2) or DMNB-Sec (column 3).

Column 1. EGFP fluorescence above the background level in cells grown without unnatural amino acids shows low level production of full length EGFP protein likely due to residual charging of O-tRNA with Leu and its incorporation at the TAG codon.

Column 2. Cells grown with 4 mM DMNB-Cys added to the medium produce a 6 fold higher EGFP fluorescence as compared with column 1 , consistent with charging of the O-tRNA with DMNB-Cys and incorporation of DMNB-Cys at the TAG codon of the reporter protein during translation.

Column 3. Cells grown with 4 mM DMNB-Sec added to the medium produce similar fluorescence levels of the EGFP protein, compared to column 2, consistent with charging of O- tRNA with DMNB-Sec and incorporation of DMNB-Sec at the TAG codon of the reporter protein during translation.

Column 4. Cells transformed with pGFP-TAG alone produce low levels of functional EGFP and correspond to the TAG nonsense codon suppression errors made by cellular translation system.

Column 5. Fluorescence background in cells lacking EGFP reporter plasmid.

Column 6. Yeast autofluorescence background in untransformed cells.

Column 7. Absence of fluorescence of 4 mM DMNB-Cys solution.

Column 8. Absence of fluorescence of 4 mM DMNB-Sec solution.

These observations are consistent with targeted incorporation of a photocaged Sec residues in the EGFP protein produced in yeast cells. Light induced removal of the protecting DMNB group to yield a selenoprotein (Figure 4)

The produced selenoprotein containing a photocaged Sec residue is decaged during a photochemical reaction in which the protective DMNB group is removed from selenium yielding a protein with Sec in place (see Fig. 4). Figure 4 shows the time course of the photolytical decaging of the incorporated DMNB-Sec residue. The 6-His tagged EGFP protein carrying the unnatural amino acid DMNB-Sec at position Tyr39 was purified from yeast cells using standard IMAC procedure. Protein solution (0.2 mg/ml in PBS) was exposed to UV light generated by an Olympus AX70 microscope equipped with excitation filter (340-390 nm band pass). At indicated time points, 20 μΐ samples were taken for ESl-MS analysis and the EGFP products were quantified based on mass spectrograms. The caging DMNB group was removed in high yield after 40-60 min illumination demonstrating utility of the described approach for selenoprotein production.

Example 3 Light-controlled reversible protein dimerization

Chemically reversible dimerization of an engineered sec-protein (Figure 5) Figure 5 demonstrates the formation of (SecEGFP) 2 dimers upon exposure of the DMNB- SecEGFP protein to UV light for 20 min. It also shows subsequent reduction of the dimer with the reducing agent DTT to yield the SecEGFP protein. At 0 min time point, protein solution consisted of photocaged DMNB-SecEGFP (calculated mass: 28936 Da) irrespective of the addition of the 50 mM DTT. After 20 min exposure of the protein to UV light, a new monomeric SecEGFP (calculated mass: 28741 Da) as well as dimeric (SecEGFP) 2 (calculated mass: 57480 Da) protein products were formed. The protein dimers were completely reduced to yield the monomeric SecEGFP after treatment with 50 mM DTT. Consistent with the MS measurements, the dimerization most likely occurs via the formation of a Se-Se bond between the inserted Sec residues under mildly oxidizing conditions (presence of air oxygen), which is readily reversed in the presence of DTT.

Figure 6. Dimerization of a functionally active selenoprotein.

Aliquots of same samples used in ESI-MS were prepared in standard SDS-PAGE sample buffer (Thermo Fisher) and analyzed by electrophoresis on a 12% polyacrylamide gel. In-gel EGFP fluorescence was visualized using FLA-5100 (Fujifilm, Japan) scanner. Fluorescent (SecEGFP)2 dimers were formed after a 20 min exposure of the DMNB-SecEGFP protein to UV light in high yield, which were then converted to fluorescent monomers by treatment with 50 mM DTT. This experiment illustrates that photolytical decaging, oxidant-promoted dimerization and reductive monomerisation all can occur under mild conditions thereby preserving the functionality (fluorescence) of a protein.

This example shows that further utilisation of the photocaged Sec in peptides or proteins can be useful in cases where light-controlled protein dimerization is desired.