Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
IN VITRO PROTEIN EXPRESSION PROCESS COMPRISING CΚ FUSION MOLECULES
Document Type and Number:
WIPO Patent Application WO/2007/148092
Kind Code:
A2
Abstract:
The invention relates to an in vitro protein expression process comprising preparation of a nucleic acid molecule which comprises a fusion of a gene encoding a target protein and a gene encoding an immunoglobulin Κ light chain constant domain (CΚ) followed by cell-free protein expression. The invention also relates to proteins expressed by said process and to a composition and kit for performing said in vitro protein expression process.

Inventors:
HE MINGYUE (GB)
TAUSSIG MICHAEL JOHN (GB)
Application Number:
PCT/GB2007/002309
Publication Date:
December 27, 2007
Filing Date:
June 21, 2007
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BABRAHAM INST (GB)
HE MINGYUE (GB)
TAUSSIG MICHAEL JOHN (GB)
International Classes:
C12N15/62; C07K1/22; C12P21/02
Domestic Patent References:
WO2005087810A22005-09-22
Foreign References:
EP0505908A11992-09-30
Other References:
HE ET AL: "Antibody-ribosome-mRNA (ARM) complexes as efficient selection particles for in vitro display and evolution of antibody combining sites" NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, SURREY, GB, vol. 25, no. 24, 15 December 1997 (1997-12-15), pages 5132-5134, XP002079231 ISSN: 0305-1048 cited in the application
HE M ET AL: "Single step generation of protein arrays from DNA by cell-free expression and in situ immobilisation (PISA method)." NUCLEIC ACIDS RESEARCH 1 AUG 2001, vol. 29, no. 15, 1 August 2001 (2001-08-01), pages E73-E73, XP002460061 ISSN: 1362-4962 cited in the application
BURSTEIN Y ET AL: "INDEPENDENT EXPRESSION OF THE GENE CODING FOR THE CONSTANT DOMAIN OF IMMUNO GLOBULIN LIGHT CHAIN EVIDENCE FROM SEQUENCE ANALYSES OF THE PRECURSOR OF THE CONSTANT REGION POLY PEPTIDE" PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 74, no. 8, 1977, pages 3157-3161, XP002460062 ISSN: 0027-8424
SON JEONG-MI ET AL: "Enhancing the efficiency of cell-free protein synthesis through the polymerase-chain-reaction-based addition of a translation enhancer sequence and the in situ removal of the extra amino acid residues" ANALYTICAL BIOCHEMISTRY, vol. 351, no. 2, April 2006 (2006-04), pages 187-192, XP002460063 ISSN: 0003-2697
YANG JUNHAO ET AL: "Rapid expression of vaccine proteins for B-cell lymphoma in a cell-free system" BIOTECHNOLOGY AND BIOENGINEERING, vol. 89, no. 5, 5 March 2005 (2005-03-05), pages 503-511, XP002460064 ISSN: 0006-3592
CHEN HAIQIN ET AL: "Efficient production of a soluble fusion protein containing human beta-defensin-2 in E. coli cell-free system" JOURNAL OF BIOTECHNOLOGY, vol. 115, no. 3, 9 February 2005 (2005-02-09), pages 307-315, XP002460065 ISSN: 0168-1656
PALMER ELIZABETH ET AL: "Enhanced cell-free protein expression by fusion with immunoglobulin C kappa domain" PROTEIN SCIENCE, vol. 15, no. 12, December 2006 (2006-12), pages 2842-2846, XP008086103 ISSN: 0961-8368
Attorney, Agent or Firm:
GIBSON, Mark (St John's Innovation CentreCowley Road, Cambridge CB4 0WS, GB)
Download PDF:
Claims:
CLAIMS

1. An in vitro process of expressing a target protein which comprises the steps of: (a) preparing a nucleic acid construct comprising both a gene encoding said target protein and a gene encoding an immunoglobulin K light chain constant domain (CK) ; and (b) subjecting the construct prepared in step (a) to protein expression in a cell-free protein expression system.

2. A process as defined in claim 1 , wherein the cell-free protein expression system comprises components for transcription and translation.

3. A process as defined in claim 1 , wherein the cell-free system is a cell-free lysate selected from a prokaryotic or eukaryotic system, such as

E. coli, rabbit reticulocyte, wheatgerm lysates, mammalian cells or an artificially constructed system which enables in vitro protein synthesis.

4. A process as defined in claim 3, wherein the cell-free system is an E. coli cell-free system.

5. A process as defined in any preceding claims, wherein the immunoglobulin K light chain constant domain (CK) is a human immunoglobulin K light chain constant domain (CK) .

6. A process as defined in any preceding claims, wherein the target protein is an immunoglobulin protein.

7. A process as defined in any preceding claims, wherein the target protein is a non-immunoglobulin protein.

8. A process as defined in any preceding claims, wherein the target protein is a binding protein.

9. A process as defined in any preceding claims, wherein the gene encoding the CK domain is fused to the gene encoding the target protein.

10. A process as defined in any preceding claims, wherein the gene encoding the CK domain is present at the C-terminus of the gene encoding the target protein.

11. A process as defined in any preceding claims, wherein the nucleic acid construct additionally comprises one or more of the following: a promoter, a transcriptional and translational regulatory sequence, an untranslated leader sequence, a sequence encoding a cleavage site, a recombination site, a transcriptional terminator or a ribosome entry site.

12. A process as defined in any preceding claims, wherein the nucleic acid construct additionally comprises a gene encoding an immobilisation tag configured to attach to a protein immobilisation agent.

13. A process as defined in claim 12, wherein the immobilisation tag is a polyhistidine sequence and said protein immobilisation agent is a chelating agent.

14. A process as defined in claim 13, wherein the immobilisation tag is a peptide, domain or protein and said protein immobilisation agent is an antibody specific to said tag.

15. A process as defined in any preceding claims, which additionally comprises the step of:

(c) isolating the expressed target protein from the protein expression system.

16. A process as defined in claim 15, wherein the isolation step (c) is performed by affinity chromatography.

17. A process as defined in any preceding claims, which additionally comprises the step of:

(d) cleaving the CK domain from the expressed target protein.

18. A process as defined in any preceding claims, wherein the protein expression system contains one or more biomolecules or molecules required to produce modifications such as co- or post-translational modifications , non-natural or chemically modified amino acids.

19. A composition for in vitro expression of a target protein, said composition comprising:

(a) a nucleic acid construct comprising both a gene encoding said target protein and a gene encoding an immunoglobulin K light chain constant domain (CK) ; and

(b) a cell-free protein expression system.

20. A kit for in vitro protein expression of a target protein which comprises: (a) a nucleic acid construct comprising a gene encoding an immunoglobulin K light chain constant domain (CK) ; and (b) a cell-free protein expression system.

21. A kit as defined in claim 20, wherein the nucleic acid construct additionally comprises a gene encoding said target protein.

22. A kit as defined in claim 20 or claim 21 , which additionally comprises instructions to use said kit in accordance with the process as defined in any of claims 1 to 18.

Description:

IN VITRO PROTEIN EXPRESSION PROCESS COMPRISING CK FUSION MOLECULES

TECHNICAL FIELD OF THE INVENTION The invention relates to an in vitro protein expression process comprising preparation of a nucleic acid molecule which comprises a fusion of a gene encoding a target protein and a gene encoding an immunoglobulin K light chain constant domain (CK) followed by cell-free protein expression. The invention also relates to proteins expressed by said process and to a composition and kit for performing said in vitro protein expression process.

BACKGROUND OF THE INVENTION

Protein production in heterologous systems is a major challenge in many areas of biological research and biopharmaceutical development. Cell-free protein synthesis is becoming a widely used alternative to cell-based methods for rapid and parallel production of proteins, providing a rapid route to the translation of genetic information into folded proteins. As in vitro methods, cell free expression systems allow proteins to be expressed and modified during translation under defined conditions that living cells may be incapable of reproducing. As well as their application to protein production for structural and functional studies (Spirin, A. (2004) TIB 22, 538-545) , a number of significant protein selection and display technologies , including ribosome display, mRNA display and in situ protein arrays, also make use of cell-free protein expression systems (He, M. & Taussig, MJ. (1997) Nucleic Acids Res. 25, 5132-5134; Hanes, J. & Plϋckthun, A. (1997) Proc. Natl. Acad. Sci. USA. 94, 4937-4942; He, M. & Taussig, MJ. (2001) Nucleic Acid. Res. 29, e73) .

A number of cell-free protein expression systems are available, such as rabbit reticulocyte, E. coli S30, and wheat-germ lysates, mammalian cells

(Mikami, S. et al. , (2006) Protein Expr. Purif. 46, 348-357) and the artificially assembled PURE system (Shimizu, Y. et al. , (2001) Nat. Biotechnol, 119, 781-755) . Efforts have been made to improve protein yield by identifying key factors affecting in vitro transcription and translation and developing modified protocols . They include the preparation of cell-free extracts using genetically engineered bacterial strains, optimisation of extraction of the cell lysate, supplies of various energy resources or amino acid concentration, defining the composition of the system using isolated components, and the use of dialysis, continuous- flow, continuous exchange, hollow fiber systems, and bilayer and the film-surface (Spirin, A. (2004) TIB 22, 538-545; Calhoun, K. & Swartz, J.R. (2005) Biotechnol. Prog. 21, 1146-1153; Sawasaki, T. et al. , (2002) FEBS Lett. 514, 102-105) . Despite these developments, some proteins are still only weakly expressed in cell-free systems. There is thus a great need to enhance cell-free protein expression.

SUMMARY OF THE INVENTION

According to a first aspect of the invention there is provided an in vitro process of expressing a target protein which comprises the steps of: (a) preparing a nucleic acid construct comprising both a gene encoding said target protein and a gene encoding an immunoglobulin K light chain constant domain (CK) ; and (b) subjecting the construct prepared in step (a) to protein expression in a cell-free protein expression system.

According to a second aspect of the invention there is provided a target protein-Cκ domain fusion protein obtainable by a process as hereinbefore defined.

According to a further aspect of the invention there is provided a protein obtainable by a process as hereinbefore defined.

According to a further aspect of the invention there is provided a composition for in vitro expression of a target protein, said composition comprising: (a) a nucleic acid construct comprising both a gene encoding said target protein and a gene encoding an immunoglobulin K light chain constant domain (CK) ; and

(b) a cell-free protein expression system.

According to a further aspect of the invention there is provided a kit for in vitro protein expression of a target protein which comprises:

(a) a nucleic acid construct comprising a gene encoding an immunoglobulin K light chain constant domain (CK) ; and

(b) a cell-free protein expression system.

According to a further aspect of the invention there is provided a nucleic acid molecule comprising both a gene encoding a non-immunoglobulin protein and a gene encoding an immunoglobulin K light chain constant domain (CK) .

BRIEF DESCRIPTION OF THE FIGURES

Figure 1 shows a representation of a typical nucleic acid construct in accordance with one embodiment of the invention;

Figure 2 shows the effect of CK fusion upon human single chain (sc) antibody expression;

Figure 3 shows the effect of CK fusion upon GTP binding protein expression;

Figure 4 shows the effect of CK fusion upon FK506 binding protein expression;

Figure 5 shows a representation of the nucleic acid constructs with and without CK which were used in the transcription factor analysis; and

Figure 6 shows the effect of CK fusion on expression of human transcription factors.

DETAILED DESCRIPTION OF THE INVENTION

The process of the invention provides the advantage of significantly enhancing protein expression when incorporated into an in vitro protein expression system (e.g. a cell-free system comprising components for transcription and translation) which consequently allows the production of proteins in a quantity which were previously only scarcely produced. CK has been described as a spacer for ribosome display of antibody fragments (He, M. and Taussig, MJ. (1997) supra) . Subsequently, Chen, S. S. et al. (2004) FASEB J. 18, ρC173 , abs no. 73.10 described the use of a CK fusion protein for ribosome display of GFP, however, the use of CK constructs presented herein provides a new approach to enhance in vitro protein production. In vivo fusion of a CK domain to immunoglobulin and closely related proteins (e.g. single chain antibody fragments and T-cell receptor protein) is known (Maynard, J. A. et al. (2002) Nature Biotechnol. 20, 597-601 ; Maynard, J. A. et al. (2005) J. Immunol. Methods. 306, 51-67) . The CK domain has also been used to make fusions with other non-immunoglobulin proteins for in vivo expression (Caswell, R. et al. (1993) Biotechnol. Techniques 7(4), 307-312; WO 2005/087810 (Zymogenetics Inc.) ; US 6, 146,631 (Better et al.) ; WO 01/46232

(Zymogenetics Inc.) and EP 0 505908 (F. Hoffmann La Roche AG)) , however, the results shown in at least Caswell et al. demonstrated that the resultant fusion protein could not be detected. By contrast, data presented herein surprisingly demonstrates that protein expression is significantly enhanced in a cell-free protein expression system.

In one embodiment, the protein expression system comprises an in vitro protein expression system (e.g. a cell-free system) comprising components for transcription and translation. In a further embodiment, the cell-free system is a cell-free lysate selected from a prokaryotic or eukaryotic system, such as E. coli, rabbit reticulocyte, wheatgerm lysates, mammalian cells ((Mikami, S. et al. , (2006) supra) or an artificially constructed system (e.g. the PURE system (Shimizu, Y. et al. , (2001) supra) which enables protein synthesis in vitro . In a further embodiment, the cell-free system is a bacterial cell-free system such as E. coli (e.g. a coupled E. coli S30 cell-free system) . In the embodiment wherein the nucleic acid construct comprises an mRNA construct, the cell-free system used for protein expression in step (b) is suitably an uncoupled cell-free system for translation.

It will be appreciated that the cell-free protein expression system will be capable of full expression of the target protein, for example, the process will result in a solubilised, expressed protein.

References to "nucleic acid" refer to any nucleic acid moiety capable of in vitro protein synthesis when exposed to an in vitro protein expression system (e.g. a cell-free system comprising components for transcription and translation) . In one embodiment, the nucleic acid moiety comprises genomic DNA, cloned DNA fragments, plasmid DNA, cDNA libraries, PCR products, synthetic oligonucleotides or mRNA. The nucleic acid constructs for in vitro transcription/translation may be obtained by PCR

(polymerase chain reaction) or RT (reverse transcription) -PCR amplification, using primers designed on any known DNA sequences, such as those from databases and genome projects.

In one embodiment, the nucleic acid molecule comprises a PCR product.

References to "gene" should be interpreted in an analogous manner to "nucleic acid" and may herein be used interchangeably.

References to "target protein" refer to any protein required to be expressed and/or purified and/or characterised. Data is presented herein to demonstrate the applicability of the invention for enhancing the level of expression of a variety of proteins and therefore the definition of target protein is intended to be defined broadly. In one embodiment, the target protein is an immunoglobulin protein. In an alternative embodiment, the target protein is a non-immunoglobulin protein. In a further embodiment, the non-immunoglobulin protein is a binding protein (e.g. a GTP binding protein, such as Rab22b or an FK506 binding protein, such as FKBP2) . In an alternative embodiment, the non-immunoglobulin protein is a transcription factor (e.g. human transcription factor) such as ERG, E2F-1 , SMAD3 or TCF7L2. In a further embodiment, the human transcription factor is ERG, E2F-1 or SMAD3.

In one embodiment, the immunoglobulin K light chain constant domain (CK) is a human immunoglobulin K light chain constant domain (CK) .

In one embodiment, the gene encoding the CK domain is fused, suitably with a peptide linker, to the gene encoding the target protein. It will be appreciated that the CK domain may be present at either the N-terminus or C-terminus of the gene encoding the target protein. In one embodiment,

the gene encoding the CK domain is present at the C-terminus of the gene encoding the target protein.

Alternative tags, e.g. the chloramphenicol acetyl transferase (CAT) sequence, have been used to engineer proteins to increase their expression level in cell-free systems, however, these have generally been N-terminal fusions (Son, J . M. et al. (2006) Anal. Biochem. 351 , 187-192; Shaki- Loewenstein, S. et al. (2005) J. Immunol. Meth. 303, 19-39) . The resultant construct therefore increases translation initiation, and consequently also production of the overall protein. Contrary to these results, we have surprisingly shown that fusion of CK to the C-terminus of a target protein significantly enhances in vitro expression of the target protein.

The nucleic acid molecule may additionally comprise one or more of the following: a promoter, a transcriptional and translational regulatory sequence, an untranslated leader sequence, a sequence encoding a cleavage site, a recombination site, a transcriptional terminator or a ribosome entry site. The nucleic acid molecule may further comprise a plurality of cistrons (or open reading frames) or a sequence encoding a reporter protein whose abundance may be quantitated and can provide an accurate measure of expressed protein.

In one embodiment of the invention the nucleic acid molecule comprises one or more promoter (e.g. a T7 promoter) , enhancer (e.g. a genelO enhancer) and a ribosome binding site or translation initiation sequence

(e.g. a Shine Dalgarno (SD) sequence for prokaryotic systems or kozak sequence for eukaryotic systems) . Such sequences are either commercially available and may be purchased, for example, from Roche or may be prepared in accordance with standard methodology.

In one embodiment the nucleic acid molecule comprises one or more transcriptional and translational terminators present at the 3' end of the molecule.

In order to enhance the efficiency of isolation of the resultant expressed target protein, the nucleic acid molecule may additionally comprise a gene encoding an immobilisation tag configured to attach (e.g. covalently or non-covalently) to a protein immobilisation agent.

In one embodiment, the immobilisation tag is a polyhistidine sequence, such as one or more hexahistidine and said protein immobilisation agent is a chelating agent such as Ni-NTA. In a further embodiment, said immobilisation tag is a peptide, domain or protein and said protein immobilisation agent is an antibody specific to said tag.

It will be appreciated that preparation of the nucleic acid construct in step (a) may be performed in accordance with standard molecular biology techniques known to those skilled in the art, for example, those described in He, M. & Taussig, MJ. (2001) Nucleic Acid Res. 29, e73, the nucleic acid construct protocols of which are herein incorporated by reference.

In one embodiment, the process of the invention additionally comprises the step of:

(c) isolating the expressed target protein from the protein expression system.

The presence of the CK domain within the resultant expressed fusion protein of target protein-Cκ domain significantly simplifies isolation of the fusion protein from the protein expression system. For example, the isolation step (c) may be performed by known techniques such as affinity chromatography and the like, which would involve an agent capable of

recognising and binding to the CK domain. Thus, the presence of the CK domain not only provides the significant advantage of enhancing protein expression but also synergistically provides a useful immobilisation tag in which the resultant protein may be isolated without the need for incorporation of additional immobilisation tags. In one embodiment, the isolation step (c) involves affinity chromatography or the like with agents having affinity for the CK domain (e.g. antibodies or Protein L) .

It will be appreciated that the isolation step (c) may be replaced with a detection step wherein detection techniques, such as immunodetection, Western blotting and the like, may be employed to detect the presence of the expressed target protein.

In one embodiment, the process of the invention additionally comprises the step of:

(d) cleaving the CK domain from the expressed target protein.

It will be appreciated that cleavage of the CK domain may be required in order to perform structural and functional studies of the target protein. It will also be appreciated that the cleavage step (d) may be performed either before or after isolation step (c) . In the embodiment wherein the protein expression system is an E. coli cell-free system, cleavage may typically be performed by in situ specific cleavage at an engineered protease site in the E. coli cell-free translation mixture. Such cleavage may be performed by standard procedures known to those skilled in the art, for example, those described in Son, J. M. et al. (2006) Anal. Biochem. 351 , 187-192, the tag cleavage protocols of which are herein incorporated by reference.

An advantage of using an in vitro protein expression system (e.g. a cell- free system) is that they provide an environment in which the conditions of protein expression can be adjusted and controlled through addition of

exogenous bioniolecules or molecules. This makes it possible to generate modified proteins, such as those with co- or post-translational modifications, non-natural or chemically modified amino acids (such as fluorescent groups) .

Thus, in one embodiment of the invention, the protein expression system contains one or more additional agents. In an alternative embodiment, the nucleic acid molecule may comprise a gene encoding one or more additional agents (e.g. gene encoded products such as polypeptides or RNA molecules) .

In one embodiment, the additional agents interact with the expressed fusion or target protein or encode additional agents capable of interacting with the fusion or target protein (e.g. nucleic acids capable of being transcribed and/or translated into a protein binding partner by the protein expression system) .

In a further embodiment, the additional agents are biomolecules or molecules required to produce modifications such as co- or post- translational modifications, non-natural or chemically modified amino acids (such as fluorescent groups or biotin) . In a yet further embodiment, the additional agents are reporter proteins such as an enzyme (e.g. β- galactosidase, chloramphenicol acetyl transferase, β-glucuronidase or the like) or a fluorescent protein (e.g. green fluorescent protein (GFP) , red fluorescent protein, lucif erase or the like) . The additional agents are suitably added into the cell-free lysate, such that the resultant expressed proteins are modified during translation or after immobilisation and may allow the rapid detection of such proteins. In one embodiment, the additional agent comprises one or more protein folding promoting agents. These agents have the advantage of ensuring that the expressed protein is correctly folded.

It is envisaged that the kit of the invention will enable the user to incorporate the gene encoding the target protein into the nucleic acid construct of component (a) and then simply express the target protein in accordance with the process as hereinbefore defined.

In one embodiment, the kit additionally comprises instructions to use said kit in accordance with the process as hereinbefore defined.

It will be appreciated that the kit or composition for in vitro protein expression of a target protein may additionally comprise any other component or feature hereinbefore described with reference to the process of the invention.

The invention will now be described, by way of example only, with reference to the accompanying examples:

EXAMPLES

1. Materials Used Primers

(1) RTST7/B (SEQ ID NO: 1)

(2) PET7/F (SEQ ID NO: 2)

(3) Linker-tag/B (SEQ ID NO: 3) (4) Tterm/F (SEQ ID NO: 4)

(5) HuC4/B (SEQ ID NO: 5)

(6) RzpdCκ/F (SEQ ID NO: 6)

(7) PErzpd/B (SEQ ID NO: 7)

(8) Rzpd-L/F (SEQ ID NO: 8)

Molecular biology reagents and cell-free system

Nucleotides , agarose, PCR Gel Extraction Kit and HRP-linked mouse anti-His antibody were obtained from Sigma, UK; Taq DNA polymerase was obtained from Qiagen, UK; HRP-linked anti-human K antibody was obtained from the Binding Site, UK; NuPAGE Bis-Tris gels were obtained from Invitrogen, CA, USA; PVDF Immobilon-P membranes were obtained from Millipore; Western Blot detection SuperSignal Kit was obtained from Pierce, UK; and coupled E. coli S30 cell-free expression system was obtained from Roche, UK. The rab22b and FKBP2 clones were obtained from Dr. Bernhard Korn.

2. Construction of PCR fragments

The general PCR constructs used for cell-free protein synthesis are shown in Figure 1. The 5' end contains a T7 promoter, a gene 10 enhancer and SD sequence (Roche kit) for efficient transcription and translation. The ORF of the gene of interest was placed after the initiation codon ATG, followed by fusion in frame to the following in order: a flexible peptide linker, a double-(His) 6 tag and two consecutive stop codons (TAATAA) (He, M. & Taussig, MJ. (2001) Nucleic Acid. Res. 29, e73) . When human CK was included, it was placed downstream of the gene ORF and before the peptide linker. A transcription termination region was included at the 3' end of the constructs.

3. PCR generation of individual domains

The standard PCR mixture consisted of 5μl 1O x PCR buffer, lOμl 5xQ, 4μl dNTPs mix containing 2.5 mM of each, 1.5μl of forward and backward primers (16μM each) , IU Taq DNA polymerase, 1-lOng template DNA and water to a final volume of 50 μl.

(a) RTST7 domain , comprising T7 promoter, genelO enhancer and SD sequence, was created using primers RTST7/B and PET7/F from a plasmid template used as a control in the cell free system (Roche, UK) .

(b) Double (His) 6 tag domain, comprising a flexible peptide linker, two hexahistidine sequences, separated by an 11-amino acid spacer sequence, and two consecutive stop codons (TAATAA) , was generated using primers Linker-tag/B and Tterm/F on the plasmid template pTA-His (He, M. & Taussig, MJ. (2001) Nucleic Acid. Res. 29, e73) .

(c) C κ-(His) 6 tag domain was produced using primers HuC4/B and Tterm/F on a plasmid template, which encodes the CK domain with the double- (His) 6 tag fused at the C-terminus.

(d) ORFs of genes to be expressed were amplified using their corresponding plasmids (RZPD, Germany) as templates and individually designed primers . For generation of constructs without CK, primer PErzpd/B and Rzpd-L/F were used, while PErzpd/B and RzρdCκ/F were used for constructs with CK.

4. Assembly PCR

The ORF of the gene of interest and the appropriate domain fragments were assembled by mixing in equimolar ratios (total DNA 50-100 ng) after elution from agarose gel (1%) and adding into a PCR solution containing 2.5 μl 10 x PCR buffer, 1 μl dNTPs mix containing 2.5 mM of each, 1 U Tag DNA polymerase and water to a final volume of 25 μl, and thermal cycling for eight cycles (94°C for 30 s, 54°C for 1 rnin and 72°C for 1 min) . For constructs without CK the fragments assembled were the RTST7 domain, gene ORF and double (His) 6 tag domain, while for the constructs with CK they were the RTST7 domain, gene ORF and CK- (HIs) 6 tag domain.

5. Amplification of PCR constructs

Assembled constructs were amplified by transferring 2μl to a second PCR mixture in a final volume of 50μl (as above) for a further 30 cycles using primers RTST7/B and T-term/F. Thermal cycling for 30 cycles (94 0 C for

30 s, 54 0 C for 1 min and 72 0 C for 1 min, finally, 72° C for 8min) . The final PCR construct was analysed by agarose (1%) gel electrophoresis to determine quality and concentration by comparison with a known DNA marker. The PCR products may be used for cell-free expression with or without further purification.

6. Cell-free protein expression

Proteins were expressed from PCR constructs using the coupled E, coli S30 system, incubated at 3O 0 C for 4 hours. A standard reaction comprised 12μl E. coli S30 lysate, 12μl amino acids, 1O-Ul reaction mix, 5μl reconstitution buffer, lμl methionine and 100-500ng PCR DNA, made to 50,αl with water.

7. Detection of proteins by Western blotting Protein expressed in the E. coli S30 lysate were mixed with an equal volume of 2x SDS buffer (10OmM Tris, pH 8.0, 5% SDS, 0.2% bromophenol blue, 20% glycerol) , heated to 9O 0 C for 5 minutes, loaded onto a 10% NuPAGE Bis-Tris gel and run at 200V. The separated proteins were transferred to a PVDF membrane by electroblotting for 2 hours at 8OmA. The membrane was blocked in 1% bovine serum albumin (BSA) in phosphate-buffered saline (PBS) for 1 hour, then incubated with either HRP-linked mouse anti-His antibody (diluted 1 :4000 in PBS/BSA) or HRP-linked mouse anti-κ antibody (1 :500 in PBS/BSA) for 1 hour. The membrane was developed using the SuperSignal kit (PIERCE, UK) as per the manufacturer's instructions.

Example 1 : Effect of CK fusion on human single chain (sc) antibody expression

This experiment compared expression of human single chain (sc) antibody fragment constructs with and without the human CK domain. The two domain (V H , VJ scFv construct is a standard format for cell based

recombinant antibody expression (Holliger, P. & Hudson, PJ. (2005) Nat. Biotechnol. 23 , 1126-1136) . Anti-carcinoembryonic antigen (CEA) and anti-progesterone scFv fragments, created by eukaryotic ribosome display technology (He, M. & Taussig, MJ. (1997) Nucleic Acids Res. 25 , 5132-5134) were assembled as fusions to a double (His) 6 tag (d(His) 6 ) (He, M. & Taussig, MJ. (2001) Nucleic Acid. Res. 29, e73) or additionally to the CK domain (Cκ-d(His) 6 ) by overlap PCR using the constructs as set out in Figure 1 and using the methods described hereinbefore in sections 2-5. After expression in the E. coli S30 system as described in section 6 above and Western blotting detection by anti-His antibody as described in section 7 showed that neither scFv-d(His) s fragment was expressed detectably (lane 1 in Figures 2A and 2B) , whereas both scFv-Cκ-d(His) 6 fusions successfully led to high expression yields (lane 2 in Figures 2A-2B; 2A: anti-progesterone; 2B: anti-CEA) .

Comparison with standards estimated that at least lOOμg/ml protein was produced after inclusion of the CK domain. Sequencing of the PCR constructs confirmed that the reading frames were in all cases correct and that the only sequence differences were the presence of CK. It was also shown that the scFv-Cκ fragments were retained in the soluble fraction after high speed centrifugation (15,000rpm for 20min) and bound their specific antigens.

Example 2: Effect of CK fusion on GTP and FK506 binding protein expression

To test whether C-terminal fusion to CK could also improve expression of other proteins which were known to be synthesised at very low levels, and particularly non-immunoglobulin related proteins, Rab22b (a GTP binding protein) and FKBP2 (FK506 binding protein) were chosen for this analysis. This experiment was performed in an analogous manner to

Example 1 and the results are shown in Figures 3-4. Lane 1 of Figures 3 A and 4A show that both proteins were only poorly expressed in the E. coli S30 lysate as d(His) 6 constructs and were only weakly detected using anti- His antibody. By contrast, the levels of expression of rab22b-Cκ-d(His) 6 and FKBP2-Cκ-d(His) 6 were both strong (lane 2 of Figures 3A and 4A) . Rab22b-Cκ-d(His) 6 and FKBP2-Cκ-d(His) 6 were also detected using an anti-Cκ monoclonal antibody, confirming high level expression of CK fusion (lane 2 of Figures 3B and 4B) .

When it was possible to detect the non-Cκ tagged protein on a Western blot, the increased expression through inclusion of the CK domain was estimated as more than 10-50 fold.

Example 3: Effect of CK fusion on expression of human transcription factors

Materials:

Plasmids encoding the following binding domains of human transcription factors were obtained from National Public Health Institute, Finland:

ERG: Transcriptional regulator ERG

E2F-1 : Transcription factor E2F1

SMAD 3: Mothers against decapentaplegic homolog 3

TCF7L2: Transcription factor 7 like 2

Constructs:

(a) Constructs without CK fusion (TF)

DNA Constructs were generated by PCR. The 5' end of the PCR constructs contained a T7 promoter, an enhancer and SD sequence followed by ATG. For detection of the expression, both a myc-tag and a

double His-tag were added to N-terminus (myc tag) and C-terminus (His- tag) of the target gene. A transcription termination region was also included at the 3 'end of the PCR construct (Figure 5A)

(b) Constructs with CK fusion (TF- CK)

Constructs with CK fusion were produced by inserting a CK domain between the c-terminus of the target gene and the double His-tag (Figure 5B) .

Results

Constructs with or without the CK fusion were subjected to protein synthesis in an E. coli cell-free translation kit (Roche) . After incubation for 3-4 hrs at 3O 0 C, individual translation mixtures were analysed by SDS- PAGE and Western blot probed by either anti-myc, anti-(His) 6 or anti-Cκ antibodies. The results are shown in Figure 6 which demonstrates the Western blot result, probed by anti-(His) 6 antibody. Figure 6 shows that no bands were detected with all four constructs without CK fusion (Figure 6A) and by contrast proteins, which correspond to their respective molecular size, were strongly detected by antibodies in three of the constructs (ERG, E2F-1 and SMAD 3) in the presence of the CK domain at the C-terminus (Figure 6B) . The reason for non-detection of the TCF7L2 construct may be due to degradation at mRNA or protein level. It is believed that CK expression of the TCF7L2 construct may be achieved by optimisation of expression conditions (such as expression duration and temperature) or addition of RNase or protease inhibitors .

The detection by all the three antibodies (anti-myc, anti-His and anti-Cκ) confirms the expression of these designed epitopes located at both N- and C-terminus of the target gene.