Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
AFFINITY PURIFICATION OF PROTEIN
Document Type and Number:
WIPO Patent Application WO/2006/097708
Kind Code:
A3
Abstract:
We describe fusion proteins comprising a bacterial immunity polypeptide and their use in affinity purification of protein complexes.

Inventors:
KLEANTHOUS COLIN (GB)
GEORGIOU THEONIE (GB)
Application Number:
PCT/GB2006/000901
Publication Date:
January 25, 2007
Filing Date:
March 13, 2006
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV YORK (GB)
KLEANTHOUS COLIN (GB)
GEORGIOU THEONIE (GB)
International Classes:
C07K1/22; C07K14/24; C07K19/00; C12N1/21; C12N5/10; C12N15/31; C12N15/63; C12P21/02
Other References:
QUADRI LUIS E N ET AL: "Characterization of the protein conferring immunity to the antimicrobial peptide carnobacteriocin B2 and expression of carnobacteriocins B2 and BM1", JOURNAL OF BACTERIOLOGY, vol. 177, no. 5, 1995, pages 1144 - 1151, XP002394564, ISSN: 0021-9193
VENEMA K ET AL: "MODE OF ACTION OF LCIA, THE LACTOCOCCIN A IMMUNITY PROTEIN", MOLECULAR MICROBIOLOGY, BLACKWELL SCIENTIFIC, OXFORD, GB, vol. 14, no. 3, 1 March 1994 (1994-03-01), pages 521 - 532, XP000561271, ISSN: 0950-382X
GELI V ET AL: "USE OF A FOREIGN EPITOPE AS A TAG FOR THE LOCALIZATION OF MINOR PROTEINS WITHIN A CELL THE CASE OF THE IMMUNITY PROTEIN TO COLICIN A", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, vol. 85, no. 3, 1988, pages 689 - 693, XP002394565, ISSN: 0027-8424
WALLIS R ET AL: "In vivo and in vitro characterization of overproduced colicin E9 immunity protein.", EUROPEAN JOURNAL OF BIOCHEMISTRY / FEBS. 15 JUL 1992, vol. 207, no. 2, 15 July 1992 (1992-07-15), pages 687 - 695, XP002394566, ISSN: 0014-2956
LAU C ET AL: "BEHAVIOR OF COLICIN E-1 COLICIN E-2 AND COLICIN E-3 ATTACHED TO SEPHADEX BEADS", BIOCHEMISTRY, vol. 15, no. 3, 1976, pages 666 - 671, XP002405089, ISSN: 0006-2960
PAYNE M A ET AL: "Biphasic binding kinetics between FepA and its ligands.", THE JOURNAL OF BIOLOGICAL CHEMISTRY. 29 AUG 1997, vol. 272, no. 35, 29 August 1997 (1997-08-29), pages 21950 - 21955, XP002405090, ISSN: 0021-9258
Attorney, Agent or Firm:
HARRISON GODDARD FOOTE (York YO1 8NQ, GB)
Download PDF:
Claims:

Claims

1. A chimeric fusion protein comprising an immunity polypeptide linked to at least one heterologous polypeptide.

2. A protein according to Claim 1 wherein said immunity polypeptide is linked by a linker molecule to said heterologous polypeptide.

3. A protein according to Claim 2 wherein said linker comprises a cleavable peptidic linker.

4. A nucleic acid molecule which encodes a chimeric polypeptide which nucleic acid molecule comprises: i) a first part consisting of a nucleic acid sequence as represented in Figure 1 or 2 and which encodes at least one polypeptide, or active binding part thereof, which has the activity associated with an immunity protein; or a variant nucleic acid molecule which hybridises to the nucleic acid molecule as represented in Figure IA or IB and which encodes a polypeptide which has the activity associated with an immunity polypeptide; and iii) a second part consisting of a nucleic acid sequence which encodes a heterologous polypeptide wherein said first and second parts are linked.

5. A nucleic acid molecule according to Claim 4 wherein said first and second nucleic acid molecules are linked by a linker molecule.

6. A nucleic acid according to Claim 5 wherein said linker encodes a cleavable peptidic linker.

7. A nucleic acid molecule according to any of Claims 4-6 wherein said first part comprises a nucleic acid molecule consisting of a nucleic acid sequence which encodes at least one colicin DNase immunity polypeptide.

8. A nucleic acid molecule according to Claim 7 wherein said immunity polypeptide is selected from the group consisting of: Im2, Im7, Im8 and Im9.

9. A nucleic acid molecule according to any of Claims 4-6 wherein said first part comprises a nucleic acid molecule consisting of a nucleic acid sequence which encodes at least one colicin RNase immunity polypeptide.

10. A nucleic acid molecule according to Claim 9 wherein said immunity polypeptide is selected from the group consisting of: Im3, Im4, Im5 and Im6.

11. A nucleic acid molecule according to any of Claims 4-10 which encodes a chimeric polypeptide comprising at least two immunity polypeptides wherein said polypeptides are in frame translational fusions.

12. A nucleic acid molecule according to Claim 11 wherein said nucleic acid molecule encodes two immunity polypeptides which bind a similar colicin polypeptide.

13. A nucleic acid molecule according to Claim 11 wherein said nucleic acid molecule encodes two immunity polypeptides which bind dissimilar colicin polypeptides.

14. A nucleic acid molecule according to any of Claims 6-13 wherein said cleavable linker comprises at least one protease sensitive site.

15. A nucleic acid according to Claim 14 wherein said site is a cleavage site for a tobacco etch virus protease.

16. A chimeric polypeptide encoded by a nucleic acid molecule according to any of Claims 4-15.

17. A composition comprising a nucleic acid molecule or polypeptide according to any of Claims 1-16.

18. A vector comprising a nucleic acid molecule according to any of Claims 4-15.

19. A cell transfected with a nucleic acid or vector according to any of Claims 4- 15 or Claim 18.

20. A cell according to Claim 19 wherein said cell is a eukaryotic cell.

21. A cell according to Claim 20 wherein said eukaryotic cell is a mammalian cell.

22. A cell according to Claim 20 wherein said cell is a plant cell.

23. A cell according to Claim 20 wherein said cell is a yeast cell.

24. A cell according to Claim 19 wherein said cell is a prokaryotic cell.

25. The use of the chimeric polypeptide according to any of Claims 1-3 or 16 for the isolation of a complex of biological molecules.

26. Use according to Claim 25 wherein said complex of biological molecules comprises a complex of at least one protein.

27. Use according to Claim 26 wherein said complex is a complex of protein molecules.

28. A method, to isolate a complex of biological molecules from a plurality of biological molecules comprising providing a preparation comprising a plurality of biological molecules and at least one chimeric polypeptide according to any of Claims 1-3 or 16 and incubating the preparation under conditions which allow the association of biological molecules in said preparation with said chimeric polypeptide and isolating the complex of biological molecules associated with said chimeric polypeptide.

29. A method to isolate a complex of biological molecules from a plurality of biological molecules comprising the steps of: i) providing a preparation comprising a plurality of biological molecules and at least one chimeric polypeptide according any of Claims 1-3 or 16 and incubating the preparation under conditions which allow the association of biological molecules in said preparation with said chimeric polypeptide; ii) contacting the mixture with a first affinity matrix comprising a binding partner for said chimeric polypeptide to allow the binding of at least part of said chimeric polypeptide to said matrix and washing said matrix to remove biological molecules non-specifically bound to said chimeric polypeptide and said matrix; iii) eluting from said matrix the bound chimeric polypeptide and associated biological molecules; iv) contacting said eluted chimeric polypeptide with a second affinity matrix comprising a second binding partner for said chimeric polypeptide to allow binding of a different part of said chimeric polypeptide to said second affinity matrix; and v) eluting said chimeric polypeptide from said second matrix and optionally releasing said biological molecules associated with said chimeric polypeptide.

30. A method according to Claim 29 wherein said preparation comprises a cell adapted to express said chimeric polypeptide.

31. A method according to Claim 29 or 30 wherein said complex comprises at least one protein molecule.

32. A method according to Claim 31 wherein said complex comprises a complex of protein molecules.

33. A method according to any of Claims 29-32 wherein said first affinity matrix comprises a colicin polypeptide which binds its cognate immunity polypeptide.

34. A method according to any of Claims 29-32 wherein said second affinity matrix comprises a colicin polypeptide different from the colicin polypeptide of the first affinity matrix.

35. A method according to any of Claims 29-34 wherein said elution is obtained by incubation with a protease which cleaves said linker to release said chimeric polypeptide bound to said first matrix.

36. A method according to any of Claims 29-35 wherein said chimeric polypeptide includes a second protease sensitive site, cleavage of which releases said chimeric polypeptide from said second affinity matrix.

37. A method according to any of Claims 33-36 wherein said colicin polypeptide is selected from the group consisting of: E2, E7, E8 and E9.

38. A method according to any of Claims 33-36 wherein said colicin polypeptide is selected from the group consisting of: E3, E4, E5 and E6.

39. An affinity matrix comprising a substrate and associated crosslinked or conjugated thereto at least one polypeptide encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid molecule consisting of a nucleic acid sequence as represented in Figure 5A or 5B; ii) a nucleic acid molecule consisting of a nucleic acid sequence which hybridises to the nucleic acid molecule in (i) and which encodes a polypeptide with nuclease activity; iii) a nucleic acid molecule which is degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.

40. A matrix according to Claim 39 wherein said nuclease is a colicin DNase polypeptide.

41. A matrix according to Claim 40 wherein said colicin DNase polypeptide is selected from the group consisting of: E2, E7, E8 and E9.

42. A matrix according to Claim 39 wherein said nuclease is an RNase polypeptide.

43. A matrix according to Claim 42 wherein said RNase polypeptide is selected from the group consisting of: E3, E4, E5 and E6.

44. A method for the coupling of at least one colicin polypeptide to a substrate comprising the steps of: ii) providing a preparation comprising a colicin polypeptide and a matrix material; and iii) providing conditions which enable the association, cross- linking or conjugation of said colicin to said substrate.

45. A method according to Claim 44 wherein said substrate is an affinity matrix material.

46. A kit comprising: a nucleic acid or vector according to any of Claims 4-15 or 18, or a polypeptide according to any of Claims 1-3 or 17; an agent which cleaves a cleavable linker in said chimeric polypeptide; and affinity matrix materials required to isolate said chimeric polypeptide.

Description:

AFFINITY PURIFICATION OF PROTEIN

The invention relates to a chimeric fusion protein for use in the affinity purification of polypeptides.

Affinity purification of biological molecules, for example proteins, is known in the art and allows the purification of molecules by exploiting the binding affinity of the target molecule for a molecular binding partner. For example, Staphylococcus Protein A will bind immunoglobulin G and this has been used to purify and/or concentrate antibodies from serum. Further examples of "affinity tags" include, maltose binding protein, glutathione S transferase, calmodulin binding protein and the engineering of polyhistidine tracks into proteins that are then purified by affinity purification on nickel containing matrices. In many cases commercially available vectors and/or kits can be used to fuse a protein of interest to a suitable affinity tag that is subsequently transfected into a host cell for expression and subsequent extraction and purification on an affinity matrix.

Genomic sequencing has resulted in a massive increase in the identification of genes encoding proteins. In some examples the function of a gene is not apparent simply by reference to its linear sequence and there is a desire to identify the function of these unassigned proteins and the protein partners with which they interact in a cell. A reverse genetic approach to the identification of gene function is a laborious and time consuming. A physical method that identifies the protein targets of proteins identified through genomic sequencing is attractive and may be a first step toward assigning a function to an unassigned open reading frame. In addition, it might also be desirable to identify protein binding partners of known proteins in an effort to discover new potential activities for a known protein.

A classical technique called "two-hybrid" has been used to identify the binding partner for a known protein that is described in, for example, Bartel and Fields

(1997) The Yeast Two-Hybrid System, Oxford University Press New York.

Although this technique has been successful in identifying single interacting

partners for a given protein it is prone to error resulting in both false positives and false negatives and is limited in so far as the system is unable to identify higher order structures which are sometimes found within a cell.

A system to identify and isolate protein complexes is described in WO00/09716. The technique is referred to as tandem affinity purification (TAP) and makes use of a two part affinity purification method which utilises a fusion protein containing two different affinity tags, e.g. A and B, separated by a cleavable linker. Typically, a nucleic acid encoding a target protein is sub-cloned adjacent to one of the affinity tags. This creates a fusion protein that consists of:

NH target protein: affinity tag B-cleavable linker-affinity tag A COOH.

The fusion construct is transfected into a cell, for example a yeast cell, and is expressed. Proteins which bind the target become associated with the fusion protein and the cells are broken under non-denaturing conditions and the cell extract is applied to an affinity matrix to which tag A binds. The bound complex is then dis-associated from the affinity matrix after washing by cleavage of the linker, typically a protease sensitive linker. A second round of affinity purification is then conducted with a second affinity matrix to which affinity tag B binds. The second selection step is washed and eluted from the second matrix to provide a purified complex of proteins that is bound to the target protein. The two-step selection reduces non-specific binding and allows the isolation of a complex of proteins as opposed to a single binding partner. In WO00/09716 the first and second affinity tags are Protein A and calmodulin binding protein. A further example of TAP is described in WO03/095619. In this example the first and second affinity tags is a protein with a biotinylation recognition motif and a hexapeptide His tag polypeptide respectively.

There is a desire to identify further affinity tags which have increased affinity for an affinity matrix to further reduce background binding by increasing the binding affinity of an affinity tag for its binding partner on the affinity matrix.

The colicins are a family of protein antibiotics that are made by the Enterobacteriacae during times of stress. These proteins kill susceptible bacterial cells either by acting as ionophores and depolarising the inner membrane or by lytic (e.g. nuclease) activity in the periplasm or cytoplasm. Typically, a colicin comprises a central receptor domain, an amino-terminal translocation domain and a carboxyl-terminal cytotoxic domain. A class of colicins, which are referred to as the enzymatic E class colicins, gains entry into a bacterial cell via contact with the vitamin B 12 receptor and the ToI complex located in the periplasm which triggers translocation of the colicin into the cell. Nine group E colicins are known; El is an ionophore forming colicin, the remainder are either endonucleases or ribonucleases. For example the E3, E4, E5 and E6 colicins are ribonucleases and the E2, E7, E8 and E9 are non specific DNases.

The expression of an E type colicin in a host bacterial cell is problematic since the host cell has to be able to tightly control the activity of the nuclease otherwise the host cell nucleic acid becomes sensitive to nuclease attack. This is overcome by the co-expression of so called "immunity proteins" which are inhibitors of colicins that bind the colicin nuclease domain to neutralise its activity. The complex of immunity protein and colicin is secreted into the extracellular environment and it is this complex that binds a target cell. Once translocation is initiated the immunity protein disassociates, leaving the target cell unprotected against the action of the colicin. An example of such an immunity protein is Im9 that binds with extremely high affinity to the endonuclease (DNase) domain of colicin E9 where, the K d of the complex is 10 "16 M at pH 7 and 25 0 C (Wallis et al (1995) Biochemistry 34, 13743). Another example is the immunity protein Im3 that binds to the ribonuclease (RNase) domain of colicin E3, where the K d of the complex is 10 '12 M at pH 7 and

25°C (Walker et al (2003) Biochemistry 42, 4161). These very high affinities make nuclease-immunity protein complexes as ideal affinity-based purification modules.

A further attraction in the use of colicin DNase-Im protein complexes in affinity- based purifications is the similarly high levels of specificity exhibited by the DNase-domains for their cognate Ln protein partners wherein non-cognate hn proteins are highly discriminated against (Li et al (2004) J. MoI. Biol. 337, 743). For example, the mi7 protein, specific for the DNase domain of colicin E7, binds to the non-cognate E9 DNase domain with a K^ of ~ 10 "4 M, 12-orders of magnitude weaker than the cognate immunity protein mi9 under the same conditions. Therefore, and as described in the following, double-immunity protein fusions can be constructed in which the binding to colicin nuclease columns is specific to the cognate partnership.

We herein disclose an affinity purification methodology that exploits the very high affinity and specificity of immunity proteins for colicin nucleases.

According to an aspect of the invention there is provided a chimeric fusion protein comprising an immunity . polypeptide linked to at least one heterologous polypeptide.

Preferably said immunity polypeptide is linked by a linker molecule to said heterologous polypeptide.

hi a preferred embodiment of the invention said linker comprises a cleavable peptidic linker.

According to an aspect of the invention there is provided a nucleic acid molecule which encodes a chimeric polypeptide which nucleic acid molecule comprises: i) a first part consisting of a nucleic acid sequence as represented in

Figure 1 or 2 and which encodes at least one polypeptide, or active binding part

thereof, which has the activity associated with an immunity protein; or a variant nucleic acid molecule which hybridises to the nucleic acid molecule as represented in Figures 1 and 2 which encodes a polypeptide which has the activity associated with an immunity polypeptide ; and ii) a second part consisting of a nucleic acid sequence which encodes a heterologous polypeptide wherein said first and second parts are linked.

hi a preferred embodiment of the invention said first and second nucleic acid molecules are linked by a linker molecule. Preferably said linker encodes a cleavable peptidic linker.

In a further preferred embodiment of the invention said hybridisation conditions are stringent conditions.

Hybridization of a nucleic acid molecule occurs when two complementary nucleic acid molecules undergo an amount of hydrogen bonding to each other. The stringency of hybridization can vary according to the environmental conditions surrounding the nucleic acids, the nature of the hybridization method, and the composition and length of the nucleic acid molecules used. Calculations regarding hybridization conditions required for attaining particular degrees of stringency are discussed in Sambrook et al., Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 2001); and Tijssen, Laboratory Techniques in Biochemistry and Molecular Biology — Hybridization with Nucleic Acid Probes Part I, Chapter 2 (Elsevier, New York, 1993). The T m is the temperature at which 50% of a given strand of a nucleic acid molecule is hybridized to its complementary strand. The following is an exemplary set of hybridization conditions and is not limiting:

Very HiRh Stringency (allows sequences that share at least 90% identity to hybridize)

Hybridization: 5x SSC at 65°C for 16 hours

Wash twice: 2x SSC at room temperature (RT) for 15 minutes each

Wash twice: 0.5x SSC at 65 0 C for 20 minutes each

High Stringency (allows sequences that share at least 80% identity to hybridize) Hybridization: 5x-6x SSC at 65°C-70°C for 16-20 hours

Wash twice: 2x SSC at RT for 5-20 minutes each

Wash twice: Ix SSC at 55°C-70°C for 30 minutes each

Low Stringency (allows sequences that share at least 50% identity to hybridize) Hybridization: 6x SSC at RT to 55°C for 16-20 hours

Wash at least twice: 2x-3x SSC at RT to 55°C for 20-30 minutes each.

hi a preferred embodiment of the invention said first part comprises a nucleic acid molecule consisting of a nucleic acid sequence which encodes at least one colicin DNase immunity polypeptide.

hi a preferred embodiment of the invention said immunity polypeptide is encoded by the nucleic acid sequence as shown in Table IA or the amino acid sequence as shown in Table IB.

Preferably said immunity polypeptide is selected from the group consisting of: Itn2, Im7, hn8 and Ln9.

hi an alternative preferred embodiment of the invention said first part comprises a nucleic acid molecule consisting of a nucleic acid sequence which encodes at least one colicin RNase immunity polypeptide.

hi a preferred embodiment of the invention said immunity polypeptide is encoded by the nucleic acid sequence as shown in Table 2 A or the amino acid sequence as shown in Table 2B.

Preferably said immunity polypeptide is selected from the group consisting of: Im3, Im4, Im5 and Im6

In a further preferred embodiment of the invention said nucleic acid molecule encodes a chimeric polypeptide comprising at least two immunity polypeptides wherein said polypeptides are in-frame translational fusions.

In a preferred embodiment of the invention said nucleic acid molecule encodes two immunity polypeptides that bind a similar colicin polypeptide.

In an alternative embodiment of the invention said nucleic acid molecule encodes two immunity polypeptides that bind dissimilar colicin polypeptides.

In a preferred embodiment of the invention said cleavable linker comprises at least one protease sensitive site. Preferably said site is a cleavage site for a tobacco etch virus protease.

According to a further aspect of the invention there is provided a chimeric polypeptide encoded by a nucleic acid molecule according to the invention.

hi a preferred embodiment of the invention said chimeric polypeptide comprises at least one part that comprises a variant amino acid sequence.

A variant polypeptide may differ in amino acid sequence by one or more substitutions, additions, deletions, truncations that may be present in any combination. Among preferred variants are those that vary from a reference polypeptide by conservative amino acid substitutions. Such substitutions are those that substitute a given amino acid by another amino acid of like characteristics. The following non-limiting list of amino acids are considered conservative replacements (similar): a) alanine, serine, and threonine; b) glutamic acid and aspartic acid; c) asparagine and glutamine d) arginine and lysine; e) isoleucine, leucine, methionine

and valine and f) phenylalanine, tyrosine and tryptophan. Most highly preferred are variants that retain or enhance the same biological function and activity as the reference polypeptide from which it varies.

In a preferred embodiment of the invention said chimeric polypeptide sequence has 40% or greater sequence identity with the polypeptides hereindisclosed and which retain the biological activity associated with said polypeptides, for example colicin nuclease activity or immunity protein activity.

The invention features parts of said chimeric polypeptide sequences having at least 75% identity with the polypeptide sequences as herein disclosed, or fragments and functionally equivalent polypeptides thereof. In one embodiment, the polypeptides have at least 85% identity, more preferably at least 90% identity, even more preferably at least 95% identity, still more preferably at least 97% identity, and most preferably at least 99% identity with the amino acid sequences disclosed herein and which retain the requisite biological activity.

According to a further aspect of the invention there is provided a composition comprising a nucleic acid molecule or polypeptide according to the invention.

According to a further aspect of the invention there is provided a vector comprising a nucleic acid molecule according to the invention.

In a preferred embodiment of the invention said nucleic acid molecule is operably linked to a promoter that controls the expression of said chimeric polypeptide.

Preferably said nucleic acid molecule is adapted for eukaryotic expression. Typically said adaptation includes, by example and not by way of limitation, the provision of transcription control sequences (promoter sequences) which mediate cell/tissue specific expression. These promoter sequences may be cell/tissue specific, inducible or constitutive.

"Promoter" is an art recognised term and, for the sake of clarity, includes the following features which are provided by example only. Enhancer elements are cis acting nucleic acid sequences often found 5' to the transcription initiation site of a gene (enhancers can also be found 3' to a gene sequence or even located in intronic sequences). Enhancers function to increase the rate of transcription of the gene to which the enhancer is linked. Enhancer activity is responsive to trans acting transcription factors that have been shown to bind specifically to enhancer elements. The binding/activity of transcription factors (please see Eukaryotic Transcription Factors, by David S Latchman, Academic Press Ltd, San Diego) is responsive to a number of physiological/environmental cues that include, by example and not by way of limitation, intermediary metabolites (e.g. glucose), environmental effectors (e.g. heat).

Promoter elements also include so called TATA box and RNA polymerase initiation selection sequences that function to select a site of transcription initiation. These sequences also bind polypeptides that function, inter alia, to facilitate transcription initiation selection by RNA polymerase.

Adaptations also include the provision of selectable markers and autonomous replication sequences that facilitate the maintenance of said vector in either the eukaryotic cell or prokaryotic host. Vectors that are maintained autonomously are referred to as episomal vectors.

Adaptations which facilitate the expression of vector encoded genes include the provision of transcription termination/polyadenylation sequences. This also includes the provision of internal ribosome entry sites (IRES) that function to maximise expression of vector encoded genes arranged in bi-cistronic or multi- cistronic expression cassettes.

These adaptations are well known in the art. There is a significant amount of published literature with respect to expression vector construction and recombinant DNA techniques in general. Please see, Sambrook et al (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbour Laboratory, Cold Spring Harbour, NY and references therein; Marston, F (1987) DNA Cloning Techniques: A Practical

Approach VoI m IRL Press, Oxford UK; DNA Cloning: F M Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994).

Vectors can be viral based and may be derived from viruses including adenovirus, retrovirus, adeno-associated virus, herpesvirus, lentivirus; vaccinia virus and baculo virus.

According to a further aspect of the invention there is provided a cell transfected with a nucleic acid or vector according to the invention.

In a preferred embodiment of the invention said cell is a eukaryotic cell.

Preferably said eukaryotic cell is a mammalian cell.

hi an alternative preferred embodiment of the invention said cell is a plant cell.

In a still further preferred embodiment of the invention said cell is a yeast cell.

In an alternative preferred embodiment of the invention said cell is a prokaryotic cell.

According to a further aspect of the invention there is provided the use of the chimeric polypeptide according to the invention for the isolation of a complex of biological molecules.

hi a preferred embodiment of the invention said complex of biological molecules comprises a complex comprising at least one protein. Preferably said complex is a complex of protein molecules.

According to a further aspect of the invention there is provided a method to isolate a complex of biological molecules from a plurality of biological molecules comprising providing a preparation comprising a plurality of biological molecules

and at least one chimeric polypeptide according to the invention and incubating the preparation under conditions which allow the association of biological molecules in said preparation with said chimeric polypeptide and isolating the complex of biological molecules associated with said chimeric polypeptide.

According to a further aspect of the invention there is provided a method to isolate a complex of biological molecules from a plurality of biological molecules comprising the steps of: i) providing a preparation comprising a plurality of biological molecules and at least one chimeric polypeptide according to the invention and incubating the preparation under conditions which allow the association of biological molecules in said preparation with said chimeric polypeptide; ii) contacting the mixture with a first affinity matrix comprising a binding partner for said chimeric polypeptide to allow the binding of at least part of said chimeric polypeptide to said matrix and washing said matrix to remove biological molecules non-specifϊcally bound to said chimeric polypeptide and said matrix; iii) eluting from said matrix the bound chimeric polypeptide and associated biological molecules; iv) contacting said eluted chimeric polypeptide with a second affinity matrix comprising a second binding partner for said chimeric polypeptide to allow binding of a different part of said chimeric polypeptide to said second affinity matrix; and v) eluting said chimeric polypeptide from said second matrix and optionally releasing said biological molecules associated with said chimeric polypeptide.

It will be apparent to the skilled artisan that elution of the chimeric polypeptide and associated biological molecules may be achieved by methods well known in the art and include, by example, elution by alteration in pH, ionic conditions or by

incubation with an agent, for example a chemical agent or a protease, which cleaves the bound chimeric polypeptide from the matrix.

In a preferred method of the invention said preparation comprises a cell adapted to express the chimeric polypeptide according to the invention, said cell providing the plurality of biological molecules.

In a preferred method of the invention said complex comprises at least one protein molecule. Preferably said complex comprises a complex of protein molecules.

It will be apparent that the method according to the invention may be adapted to isolate complexes of biological molecules from mixtures. These complexes maybe protein complexes or, for example, a mixture of protein and nucleic acid e.g. chromatin. The method may also be used to isolate organelles or even whole cells or cell membranes from complex mixtures. The complexes may also be formed from in vitro transcription and/or translation assays formed from cell extracts.

hi a preferred method of the invention said first affinity matrix comprises a colicin polypeptide that binds its cognate immunity polypeptide.

hi a further preferred method of the invention said second affinity matrix comprises a colicin polypeptide different from the colicin polypeptide of the first affinity matrix.

hi a preferred method of the invention said elution is obtained by incubation with a protease which cleaves said linker to release said chimeric polypeptide bound to said first matrix.

hi a further preferred method of the invention said chimeric polypeptide includes a second protease sensitive site, cleavage of which releases said chimeric polypeptide from said second affinity matrix.

In a further preferred method of the invention said colicin is selected from the group consisting of: E2, E7, E8 and E9.

In an alternative preferred method of the invention said colicin is selected from the group consisting of: E3, E4, E5 and E6.

According to a further aspect of the invention there is provided an affinity matrix comprising a substrate and associated crosslmked or conjugated thereto at least one polypeptide encoded by a nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: i) a nucleic acid molecule consisting of a nucleic acid sequence as represented in Figure 5A or 5B; ii) a nucleic acid molecule consisting of a nucleic acid sequence which hybridises to the nucleic acid molecule in (i) and which encodes a polypeptide with nuclease activity; iii) a nucleic acid molecule which is degenerate as a result of the genetic code to the sequences defined in (i) and (ii) above.

In a preferred embodiment of the invention said nuclease is a colicin DNase.

In a preferred embodiment of the invention said colicin DNase is selected from the group consisting of: E2, E7, E8 and E9.

hi an alternative preferred embodiment of the invention said nuclease is an RNase.

In a preferred embodiment of the invention said RNase is selected from the group consisting of: E3, E4, E5 and E6.

According to a further aspect of the invention there is provided a method for the coupling of at least one colicin polypeptide to a substrate comprising the steps of:

i) providing a preparation comprising a colicin polypeptide and a matrix material; and ii) providing conditions which enable the association, cross-linking or conjugation of said colicin to said substrate.

In a preferred method of the invention said substrate is an affinity matrix material.

According to a further aspect of the invention there is provided a kit comprising: a nucleic acid or vector according to the invention or a polypeptide according to the invention; an agent which cleaves a cleavable linker in said chimeric polypeptide according to the invention; and affinity matrix materials required to isolate said chimeric polypeptide.

According to an aspect of the invention there is provided a chimeric fusion protein comprising a colicin polypeptide linked to at least one heterologous polypeptide.

According to a further aspect of the invention there is provided a nucleic acid molecule which encodes a chimeric polypeptide which nucleic acid molecule comprises: i) a first part consisting of a nucleic acid sequence as represented in

Figure 5A or 5B and which encodes at least one polypeptide, or part thereof, which has nuclease activity; or a variant nucleic acid molecule which hybridises to the nucleic acid molecule as represented in Figure 5A or 5B and which encodes a polypeptide which has nuclease activity; and ii) a second part consisting of a nucleic acid sequence which encodes a heterologous polypeptide wherein said first and second parts are linked.

In a preferred embodiment of the invention said chimeric polypeptide is a variant polypeptide which has reduced nuclease activity. Preferably said chimeric polypeptide is a variant which lacks nuclease activity.

Aspects, embodiments, uses and methods applicable to chimeric polypeptides comprising immunity polypeptides are equally applicable to colicin comprising chimeric polypeptides.

Throughout the description and claims of this specification, the words "comprise" and "contain" and variations of the words, for example "comprising" and "comprises", means "including but not limited to", and is not intended to (and does not) exclude other moieties, additives, components, integers or steps.

Throughout the description and claims of this specification, the singular encompasses the plural unless the context otherwise requires, hi particular, where the indefinite article is used, the specification is to be understood as contemplating plurality as well as singularity, unless the context requires otherwise.

Features, integers, characteristics, compounds, chemical moieties or groups described in conjunction with a particular aspect, embodiment or example of the invention are to be understood to be applicable to any other aspect, embodiment or example described herein unless incompatible therewith.

An embodiment of the invention will now be described by example only and with reference to the following figures:

Figure 1 is the nucleic acid sequence which encodes DNase immunity protein Im9;

Figure 2 is the nucleic acid sequence which encodes RNase immunity protein Im3;

Figure 3 is a schematic for double immunity protein tag showing the gene construction for dImP79, where hn7 and lm.9 have been fused and are separated by a cleavage linker, trs: TEV protease recognition sequence based on highly aphid transmissable TEV protease;

Figure 4 shows a 16% SDS-PAGE of TEV protease cleavage of dlmp79 in the presence of colicin DNases (E9 and E7) and alone (last lane). & Imp79 and Imp79 alone were subject to TEV protease cleavage;

Figure 5A is the nucleic acid sequence of a colicin DNase domain E9; Figure 5B is the nucleic acid sequence of a colicin RNase domain E3;

Figure 6 shows a 16% SDS-PAGE of the purification steps using Im9 as a fusion purification tag. 1: uninduced cells of pNC3/BL21 DE3; 2: induced, overexpressed Im9 fused to a 34 residue peptide sequence (Bn9Gol); 3&4: pellet and supernatant respectively, after cell disruption. Supernatant (lane 4) was loaded onto an E9 DNase-linked sepharose 4B column; 5: flow through off column during loading; 6&7: high salt washes of column; 8: eluted and dialysed protein Im9Gol. *, denotes a breakdown product of the fusion protein;

Table IA (amino acid) and B (DNA) illustrates an alignment from the immunity proteins of the DNase family, based on the Im9 protein showing greater than 40% sequence identity. Representative proteins are shown for each sub-family such that OrfUl represents the family of OrfUl proteins that share at least 94% sequence identity. Programmes used were BLAST (at http://www.ncbi.nlm.nih.gov/BLAST) and ClustalW (at http://npsa-pbil.ibcp.fr/cgi-bin/align clustalw.pl);

Table 2A (amino acid) and B (DNA) illustrates proteins from the RNase immunity family as based on the Im3 protein showing greater than 40% sequence identity. Also included within this family is the protein from Pseudomonas fluorescens (alignment not shown), (http://npsa-pbil.ibcp.fr/cgi-bin/align clustalw.pl);

Table 3A (amino acid) and B (DNA) illustrates alignment of the DNase family, based on the E9 DNase protein showing greater than 40% sequence identity. Representative proteins are shown for each sub-family such that Uro_Ecoli represents the family of uropathogenic specific proteins that share at least 78%

sequence identity. Programmes used were BLAST (at http://www.ncbi.nlm.nih.gov/BLAST) and ClustalW (at http://npsa-pbil.ibcp.fr/cgi- bin/align clustalw.pl); and

Table 4A (amino acid) and B (DNA) illustrates an alignment of RNases based on homology to E3 RNase where proteins are identified by greater than 40% homology. Representative sequences are shown for each protein family such that the protein from Pseudomonas fluorescens is typical of the family. Programmes used were BLAST (at http://www.ncbi.nlm.nih.gov/BLAST) and ClustalW (at http://npsa-pbil.ibcp.fr/cgi-bin/align clustalw.pl).

Materials and Methods

Construction of double Immunity genes and variants The plasmids pRJ347 & pRJ345 (coding imml & imm9 genes respectively) were used as the templates. The 5' gene of the double tag was engineered to include an Ndel and a BamHl restriction site in frame to the gene. The 3' gene was designed to include Xbal and EcoRI restriciton sites after the stop codon. Primers were designed to amplify the genes, including overhang to code for the central TEV protease recognition sequence based on HAT TEV protease. The 5' gene in the construct had the stop codon removed, allowing for direct readthrough. The two first round per products were used in a further per round where both products were used to anneal to each other. The final per product was purified and cloned into the Zero Blunt TOPO PCR Cloning Kit (Invitrogen) and used to transform E. coli TOPlO competent cells (Invitrogen). PCR colony screens identified gene fragments of the correct length. These colonies were amplified and the plasmids purified (Qiagen QIAprep Spin Miniprep Kit). DNA sequencing verified the sequence. To further weaken any possible interactions of the 2 nd tag to the E9 DNase resin, the Im7(D52A, Im7 numbering) variant of the tags was constructed using whole plasmid site directed mutagenesis, according to manufacturers instructions (Stratagene, PfU Turbo DNA polymerase). Genes constructed in this way include

dImP97 (imm9-trs-immT), dImP97a, dlmp79 and dImP7a9. A schematic of the gene construction is shown below.

Purification and Assay of the dlmP Tag dlrnP tags were assayed for in vitro viability by excising the genes from the cloning vector with Ndel, EcoRI, ligating into similarly restricted pET22b (Novagen) and transforming competent E. coli BL21 DE3. Vector with insert was identified by colony per screen and these plasmids named pDVIP79, pDVIP7a9, pIMP97 and ρEMP97a. Trials were conducted for protein expression prior to protein purification. dlmP protein was produced in E. coli growing in LB-amp (lOOμg/ml) at 37 °C and induced with ImM IPTG during exponential growth. Cells were harvested, sonicated and purified essentially as for Immunity proteins (Wallis et al., 1992). Purified proteins were dialysed into 20 mM TEA, pH 7.5, snap frozen and stored at -20°C.

TEV protease cleavage of dlmP in the presence and absence of cognate DNase domains Protease cleavage was carried out for E9-mip79, E7-lmp79 & Imp79 to assess cleavage of the dlmP in complex with the cognate DNases. Approximately 40 μg of Imp79 (in complex with E7 DNase, E9 DNase domain, or alone) was incubated with 20 U of TEV protease for 1 hr 40 mins at 30 0 C in buffer (50 mM Tris-HCl, pH7.5, 10O mM NaCl).

Cross linking E-DNase domains to Sepharose Purified E9 and E7 DNase domains (Garinot-Schneider et al., 1996) were cross linked to CNBr-activated Sepharose 4B beads (Amersham Biosciences) according to manufacturers guidelines. mi9 was also cross-linked to the resin in a similar manner.

Complexes in the presence of detergents

As a test of complex formation in the presence of detergents, both the E9 DNase- Im9 and E3 RNase-Im3 complexes were preformed in detergent (β-octylglucoside upto 1% w/v; Triton XlOO upto l%v/v) and analysed for complex formation by size exclusion chromatography. Detergents do not destroy complexation of nuclease domains and their immunity proteins. These experiments show that the complexes survive the addition of detergents and will also form complexes when detergents are added prior to complex formation. The detergent NP40 was used (upto 0.5%v/v) in buffer for E9 DNase cross-linked Sepharose and adding the dlmP protein tag. The tag still binds to the resin in the presence of detergent.

Immunity Proteins as purification fusion tags for polypeptides

The imm9 gene was engineered to extend its sequence to include a TEV protease recognition site C-terminal. The gene construct was designed to include restriction sites for the cloning of target proteins 3 'to the protease site. The test system was the

Im9Gol fusion protein, where GoI is a 35 amino acid polypeptide co-activator in bacteriophage exclusion systems. Restriction sites are (5 '-3'):

NdeI(imm9)(trs)XhoϊHindID.(gor)BamHl. The gene construct was ligated into pETllc and pET15b vectors (Novagen). Purification of overexpressed product is carried out using either anion exchange chromatography or E9 DNase-cross linked resin.