Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SUPEROXIDE DISMUTASE-4
Document Type and Number:
WIPO Patent Application WO/1995/027781
Kind Code:
A1
Abstract:
Polynucleotides which encode the polypeptide SOD-4, as well as such polypeptides, and antibodies against the polypeptide and the use of the polypeptide as a pharmaceutical for treating cerebral ischaemia, ulcers, inflammation, arrhythmia, oedema and paraquat intoxication as well as rheumatoid arthritis, osteoarthritis and radiation injury.

Inventors:
YU GUO-LIANG (US)
ROSEN CRAIG A (US)
FRASER CLAIRE M (US)
GOCAYNE JEANNINE D (US)
Application Number:
PCT/US1994/006099
Publication Date:
October 19, 1995
Filing Date:
May 31, 1994
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
HUMAN GENOME SCIENCES INC (US)
YU GU LIANG (US)
ROSEN CRAIG A (US)
FRASER CLAIRE M (US)
GOCAYNE JEANNINE D (US)
International Classes:
A61K38/44; A61K48/00; A61P1/04; C12N15/09; A61P9/06; A61P9/10; A61P17/02; A61P19/02; A61P29/00; A61P43/00; C07H21/04; C07K16/40; C12N9/02; C12N9/04; C12N15/53; C12P21/08; A61K38/00; C12R1/91; (IPC1-7): C12N9/02; C12N15/53; C12N15/67; A61K37/50
Other References:
NATURE, Volume 362, issued 04 March 1993, D.R. ROSEN et al., "Mutations in Cu/Zn Superoxide Dismutase Gene are Associated with Familial Amyotrophic Lateral Sclerosis", pages 59-62.
ARCH. BIOCHEM. BIOPHYS., Volume 206, No. 2, issued February 1981, J.A. BAUM et al., "Isolation and Characterization of the Cytosolic and Mitochondrial Superoxide Dismutase of Maize", pages 249-264.
CHEMICAL ABSTRACTS, Volume 117, issued 1992, H. HATANAKA et al., "Preparation of Superoxide Dismutase (SOD) Containing Heparin-Binding Site for Protease Resistance", page 375, Abstract No. 166676j; & JP,A,04 108 379.
MOL. GEN. GENET., Volume 219, issued 1989, R.E. CANNON et al., "Two cDNAs Encode Two Nearly Identical Cu/Zn Superoxide Dismutase Proteins in Maize", pages 1-8.
SCIENCE, Volume 239, issued 1988, C.C. LEE et al., "Generation of cDNA Probes Directed by Amino Acid Sequence: Cloning of Urate Oxidase", pages 1288-1291.
Download PDF:
Claims:
WHAT IS CLAIMED IS:
1. An isolated polynucleotide encoding for SOD4, said polynucleotide selected from the groupε conεiεting of (a) a polynucleotide encoding for the SOD4 polypeptide having the deduced amino acid εequence of Figure 1 or a fragment, analog or derivative of εaid polypeptide; (b) a polynucleotide encoding for the SOD4 polypeptide having the amino acid εequence encoded by the cDNA contained in ATCC Depoεit No. 75716 or a fragment analog or derivative of εaid polypeptide.
2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA.
3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA.
4. The polynucleotide of Claim 1 wherein the polynucleotide is genomic DNA.
5. The polynucleotide of Claim 2 wherein said polynucleotide encodes for SOD4 having the deduced amino acid sequence of Figure 1.
6. The polynucleotide of Claim 2 wherein said polynucleotide encodes for the SOD4 polypeptide encoded by the cDNA of ATCC Depoεit No. 75716.
7. The polynucleotide of Claim 1 having the coding sequence for SOD4 as shown in Figure 1.
8. The polynucleotide of Claim 2 having the coding sequence for SOD4 deposited as ATCC Deposit No. 75716.
9. A vector containing the DNA of Claim 2.
10. A host cell genetically engineered with the vector of Claim 9.
11. A procesε for producing a polypeptide compriεing: expressing from the host cell of Claim 10 the polypeptide encoded by said DNA.
12. A process for producing cells capable of expressing a polypeptide comprising genetically engineering cells with the vector of Claim 9.
13. An isolated DNA hybridizable to the DNA of Claim 2 and encoding a polypeptide having SOD4 activity.
14. A polypeptide selected from the group consiεting of (i) a SOD4 polypeptide having the deduced amino acid sequence of Figure 1 and fragments, analogε and derivativeε thereof and (ii) a SOD4 polypeptide encoded by the cDNA of ATCC Depoεit No. 75716 and frag entε, analogs and derivatives of εaid polypeptide.
15. The polypeptide of Claim 14 wherein the polypeptide is SOD4 having the deduced amino acid sequence of Figure 1.
16. An antibody against the polypeptide of claim 14.
17. A method for the treatment of a patient having need of SOD4 comprising: administering to the patient a therapeutically effective amount of the polypeptide of claim 14.
18. A pharmaceutical composition comprising the polypeptide of Claim 14 and a pharmaceutically acceptable carrier.
19. The method of Claim 18 wherein said polypeptide is administered by providing to the patient DNA encoding said polypeptide and expreεεing εaid polypeptide in vivo .
20. A proceεε for detecting a disease related to a mutation of the DNA of Claim 1, comprising: detecting in a sample derived from a hoεt the preεence of a mutation in the DNA of Claim 1.
Description:
SUPEROXIDE DISMUTASE-4

This invention relates to newly identified polynucleotides, polypeptides encoded by such polynucleotides, the use of such polynucleotides and polypeptides, as well as the production of such polynucleotides and polypeptides. More particularly, the polypeptide of the present invention is Superoxide Dismutase- 4 (SOD-4) .

There is a very strong thermodynamic driving force for the reactions between oxygen and biochemical compounds in the body such as proteins, carbohydrates, lipids and nucleic acids. If such reactions go to completion, water, carbon dioxide and a number of waste products are formed as end products with the release of large amounts of energy. Oxidation of biological compounds is the source of energy of living organisms. Such reactions occur spontaneously but are very slow due to reaction barriers. These barriers are overcome by enzymes in intermediary metabolism, and the final reaction with oxygen takes place in the mitochondria, where the oxygen is reduced by four electrons to water without the liberation of any intermediate products. The reaction is accomplished by cytochrome oxidase complex in the electron transport chain and the energy is bound by the formation of ATP.

However, the direct four step reduction of oxygen to water is unique, and when oxygen reacts spontaneously or is catalyzed by enzymes it is forced to react one step at a time. A series of reactive and toxic intermediates are formed, namely the superoxide radical (0 2 "), hydrogen peroxide (H 2 0 2 ), and the hydroxyl radical (OH " ).

Two of these, 0 2 " and OH", have single unpaired electrons and are therefore called free radicals. A few percent of the oxygen consumption in the body has been estimated to lead to the formation of the toxic reduction intermediates. The toxic affects of oxygen are mainly ascribable to the actions of these intermediates.

Oxygen in itself reacts slowly with most biochemical compounds. The toxic reactions are in general initiated by processes giving rise to oxygen radicals, which in themselves cause direct damage to biochemical compounds or start chain reactions involving oxygen.

Some compounds react spontaneously with oxygen, i.e., they autoxidize. Virtually all autoxidations result in the formation of toxic oxygen reduction intermediates. Autoxidation of adrenalin, pyrogallol and several other compounds lead to the formation of the superoxide radical. When ionizing radiation passes through an aqueous solution containing oxygen, the superoxide radical is the radical found in the highest concentration. The toxic oxygen reduction products so formed are of fundamental importance for the killing ability of the cells, but may also lead to damage in the surrounding tissue.

Hydrogen peroxide is always formed when superoxide is formed by way of the dismutation reaction. Most oxidases in the body directly reduce oxygen to hydrogen peroxide.

Organisms living in the presence of oxygen have been forced to develop a number of protective mechanisms against the toxic oxygen reduction metabolites. The protective

factors include superoxide dismutases (SOD) which dismutate the superoxide radical and are found in relatively constant amounts in mammalian cells and tissue. The best known of these enzymes is CuZnSOD which is a dimer with a molecular weight of 33,000 containing two copper and two zinc atoms. CuZnSOD is found in the cytosol and in the intermembrane space of the mitochondria. MnSOD is a tetramer with a molecular weight of 85,000 containing four Mn atoms, and is mainly located in the mitochondrial matrix. Until recently the extra cellular fluids were assumed to lack SOD activity. However U.S. Patent No. 5,248,603 recently disclosed the presence of a superoxide dis utase in extracellular fluids (e.g., blood plasma, lymph, synovial fluid and cerebrospinal fluid) which was termed EC-SOD.

Crystallographic structures of recombinant human CuZnSOD have been determined, refined and analyzed at 2.5 A resolution for wild-type and a designed thermal stable double-mutant enzyme (Cys-6 Ala, Cys-111 Ser). There is a helix dipole interaction with a Zn site, and 14 residues form two or more structurally conserved side-chain to main- chain hydrogen bonds that appear critical to active-site architecture, loop confirmation and the increased stability resulting from the Cys-111 Ser mutation. Parge, H.E. et al, Proc. Natl. Acad. Sci. U.S.A., 89:6109-13 (1992).

Mutations in the CuZnSOD gene occur in patients with the fatal neurodegenerative disorder familial amyotrophic lateral sclerosis. Screening of the CuZnSOD coding region revealed that the mutation Ala 4 to Val in exon 1 was the most frequent one, mutations were identified in exons 2, 4 and 5 but not in the active site region formed by exon 3. Thus, defective CuZnSOD is linked to motor neuron death and carries implications for understanding and possible treatment of familial amyotrophic lateral sclerosis. The polypeptide of the present invention, SOD-4, is structurally and functionally related to CuZnSOD.

Japanese Patent No. 4327541 discloses a therapeutic drug for immuno-reactions with organs after transplantation containing the active substance of human CuZnSOD obtained by gene recombination.

Japanese Patent No. 4312533 discloses a composition for treating cerebral ischaemia which comprises recombinant CuZn human SOD and inhibits delayed nerve necrosis accompanying ischaemia.

Japanese Patent No. 4248984 discloses a superoxide dismutase derivative which has a longer half-life in blood than SOD and therefore helps treat various diseases.

European Patent No. 499621 discloses a method for purifying recombinant CuZnSOD and a method for increasing the yield of the B isoform analog of this polypeptide.

Japanese Patent No. 2156884 discloses a 153 amino acid polypeptide having human superoxide dismutase properties and a DNA sequence encoding such polypeptide, a DNA sequence expressed by the nucleic acid sequence and production of the polypeptide by culture of host cells.

Japanese Patent No. 63313581 discloses a pharmacologically active modified superoxide dismutase which is obtained by reacting SOD with a compound containing an amino or carboxyl group.

Japanese Patent No. 63077822 discloses an agent for improving the function of organs which uses a human SOD-like polypeptide as the active substance.

In accordance with one aspect of the present invention, there is provided a novel mature polypeptide which is SOD-4, as well as fragments, analogs and derivatives thereof. The polypeptide of the present invention is of human origin.

In accordance with another aspect of the present invention, there are provided polynucleotides (DNA or RNA) which encode such polypeptides.

In accordance with yet a further aspect of the present invention, there is provided a process for producing such polypeptides by recombinant techniques.

In accordance with yet a further aspect of the present invention, there is provided a process for utilizing such polypeptides, or polynucleotides encoding such polypeptides for therapeutic purposes, for example, for treating inflammatory pathologies, ulcers, arrhythmia, ischaemia, oedema, paraquat intoxication, rheumatoid arthritis and oεteoarthritis, reducing reperfusion injuries and decreasing blood pressure.

In accordance with yet a further aspect of the present invention, there is provided an antibody against such polypeptides. These and other aspects of the present invention should be apparent to those skilled in the art from the teachings herein.

The following drawings are illustrative of embodiments of the invention and are not meant to limit the scope of the invention as encompassed by the claims.

Fig. 1 shows the cDNA sequence and deduced amino acid sequence for the SOD-4 gene. The amino acid sequence encodes for one of the mature forms of the polypeptide, since there are at least two in-frame ATG start codons. The mature polypeptide could start at either one of the ATG codons. The standard one letter abbreviation for amino acids is used.

Fig. 2 displays the amino acid homology between SOD-4 with eleven other cytosolic CuZnSODs from various species. The copper-zinc-bind sites (in boldface type) are formed by six His residues and one Asp residue. The Arg (R) residue is believed necessary to guide the superoxide to the activity site. Identical residues are represented by dashes and deletions are represented by dots.

Fig. 3 shows the results of bacterial expression and purification of human SOD-4 after separation on an SDS polyaerylamide gel.

Fig. 4 shows the results of expression of recombinant SOD-4 in COS cells after separation on an SDS polyacrylamide gel.

In accordance with an aspect of the present invention, there is provided an isolated nucleic acid (polynucleotide) which encodes for the mature polypeptide having the deduced amino acid sequence of Figure 1 or for the mature polypeptide e.ncoded by the cDNA of the clone deposited as ATCC Deposit No. 75716 on March 22, 1994.

The polynucleotide of the present invention was isolated from an early stage human brain cDNA library. It contains an open reading frame encoding a polypeptide of 255 amino acids. The polypeptide has the highest degree of homology to CuZnSOD isolated from Schistosoma mansoni having 51% identity and 72% similarity over a 151 amino acid overlap.

The polynucleotide of the present invention may be in the form of RNA or in the form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA may be double- stranded or single-stranded, and if single stranded may be the coding strand or non-coding (anti-sense) strand. The coding sequence which encodes the mature polypeptide may be identical to the coding sequence shown in Figure 1 or that of the deposited clone or may be a different coding sequence which coding sequence, as a result of the redundancy or degeneracy of the genetic code, encodes the same, mature polypeptide as the DNA of Figure 1 or the deposited cDNA.

The polynucleotide which encodes for the mature polypeptide of Figure 1 or for the mature polypeptide encoded by the deposited cDNA may include: only the coding sequence for the mature polypeptide; the coding sequence for the mature polypeptide and additional coding sequence such as a leader or secretory sequence or a proprotein sequence; the coding sequence for the mature polypeptide (and optionally additional coding sequence) and non-coding sequence, such aε

introns or non-coding sequence 5 ' and/or 3 ' of the coding sequence for the mature polypeptide.

Thus, the term "polynucleotide encoding a polypeptide" encompasses a polynucleotide which includes only coding sequence for the polypeptide as well as a polynucleotide which includes additional coding and/or non-coding sequence.

The present invention further relates to variants of the hereinabove described polynucleotides which encode for fragments, analogs and derivatives of the polypeptide having the deduced amino acid sequence of Figure 1 or the polypeptide encoded by the cDNA of the deposited clone. The variant of the polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non-naturally occurring variant of the polynucleotide.

Thus, the present invention includes polynucleotides encoding the same mature polypeptide as shown in Figure 1 or the same mature polypeptide encoded by the cDNA of the deposited clone as well as variants of such polynucleotides which variants encode for a fragment, derivative or analog of the polypeptide of Figure 1 or the polypeptide encoded by the cDNA of the deposited clone. Such nucleotide variants include deletion variants, substitution variants and addition or insertion variants.

Aε hereinabove indicated, the polynucleotide may have a coding sequence which is a naturally occurring allelic variant of the coding sequence shown in Figure 1 or of the coding sequence of the deposited clone. As known in the art, an allelic variant iε an alternate form of a polynucleotide sequence which may have a substitution, deletion or addition of one or more nucleotides, which does not substantially alter the function of the encoded polypeptide.

The polynucleotides of the present invention may also have the coding sequence fused in frame to a marker sequence which allows for purification of the polypeptide of the present invention. The marker sequence may be a hexa-

hiεtidine tag supplied by a pDIO vector to provide for purification of the mature polypeptide fused to the marker in the case of a bacterial host, or, for example, the marker sequence may be a hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used. The HA tag corresponds to an epitope derived from the influenza hemagglutinin protein (Wilson, I., et al.. Cell, 37:767 (1984)).

The present invention further relates to polynucleotides which hybridize to the hereinabove-described sequences if there is at least 50% and preferably 70% identity between the sequences. The present invention particularly relates to polynucleotides which hybridize under- stringent conditions to the hereinabove-described polynucleotides . As herein used, the term "stringent conditions" means hybridization will occur only if there is at least 95% and preferably at least 97% identity between the εequenceε. The polynucleotides which hybridize to the hereinabove deεcribed polynucleotides in a preferred embodiment encode polypeptides which retain substantially the same biological function or activity as the mature polypeptide encoded by the cDNA of Figure 1 or the deposited cDNA.

The deposit(ε) referred to herein will be maintained under the terms of the Budapest Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of Patent Procedure. These deposits are provided merely as convenience to those of skill in the art and are not an admission that a deposit is required under 35 U.S.C. §112. The sequence of the polynucleotides contained in the deposited materials, as well as the amino acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and are controlling in the event of any conflict with any description of sequences herein. A license may be required to make, use or sell the deposited materials, and no such license is hereby granted.

The present invention further relates to a SOD-4 polypeptide which has the deduced amino acid sequence of Figure 1 or which has the amino acid sequence encoded by the deposited cDNA, as well as fragments, analogs and derivatives of such polypeptide.

The terms "fragment," "derivative" and "analog" when referring to the polypeptide of Figure 1 or that encoded by the deposited cDNA, means a polypeptide which retains essentially the same biological function or activity as such polypeptide. Thuε, an analog includeε a proprotein which can be activated by cleavage of the proprotein portion to produce an active mature polypeptide.

The polypeptide of the present invention may be a recombinant polypeptide, a natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide.

The fragment, derivative or analog of the polypeptide of Figure 1 or that encoded by the deposited cDNA may be (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code, or (ii) one in which one or more of the amino acid residues includeε a substituent group, or (iii) one in which the mature polypeptide is fused with another compound, such as a compound to increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to the mature polypeptide, such as a leader or secretory sequence or a sequence which is employed for purification of the mature polypeptide or a proprotein sequence. Such fragments, derivatives and analogs are deemed to be within the scope of those skilled in the art from the teachings herein.

The polypeptides and polynucleotideε of the present invention are preferably provided in an isolated form, and preferably are purified to homogeneity.

The term "isolated" means that the material is removed from its original environment (e.g., the natural environment if it is naturally occurring) . For example, a naturally- occurring polynucleotide or polypeptide present in a living animal is not isolated, but the same polynucleotide or polypeptide, separated from some or all of the coexisting materials in the natural system, is isolated. Such polynucleotides could be part of a vector and/or such polynucleotides or polypeptides could be part of a composition, and still be isolated in that such vector or composition is not part of its natural environment.

The present invention also relates to vectors which include polynucleotides of the present invention, host cells which are genetically engineered with vectors of the invention and the production of polypeptides of the invention by recombinant techniques.

Host cells are genetically engineered (transduced or transformed or transfected) with the vectors of this invention which may be, for example, a cloning vector or an expreεεion vector. The vector may be, for example, in the form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants or amplifying the SOD-4 genes. The culture conditions, such as temperature, pH and the like, are those previously used with the host cell εelected for expreεεion, and will be apparent to the ordinarily εkilled artisan.

The polynucleotides of the present invention may be employed for producing polypeptides by recombinant techniqueε. Thus, for example, the polynucleotide may be included in any one of a variety of expresεion vectors for expressing a polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA;

baculovirus; yeast plasmids; vectors derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any other vector may be uεed as long as it is replicable and viable in the host.

The appropriate DNA sequence may be inserted into the vector by a variety of procedures. In general, the DNA s.equence is inserted into an appropriate restriction endonuclease site(s) by procedures known in the art. Such procedures and others are deemed to be within the scope of those skilled in the art.

The DNA sequence in the expresεion vector is operatively linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As representative examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli. lac or trp. the phage lambda P L promoter and other promoters known to control expression of genes in prokaryotic or eukaryotic cells or their viruses. The expression vector also contains a riboso e binding site for translation initiation and a transcription terminator. The vector may also include appropriate sequences for amplifying expression.

In addition, the expresεion vectors preferably contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells such as dihydrofolate reductaεe or neomycin resistance for eukaryotic cell culture, or such aε tetracycline or ampicillin resistance in E. coli.

The vector containing the appropriate DNA sequence as hereinabove described, as well as an appropriate promoter or control sequence, may be employed to transform an appropriate host to permit the host to expresε the protein.

As representative examples of appropriate hosts, there may be mentioned: bacterial cells, such as E. coli. Strepto γces. Salmonella typhimurium; fungal cells, such as

yeast; insect cells such as Drosophila and Sf9; animal cells such as CHO, COS or Bowes melanoma; plant cells, etc. The selection of an appropriate host is deemed to be within the scope of thoεe εkilled in the art from the teachingε herein.

More particularly, the preεent invention alεo includes recombinant constructs comprising one or more of the sequences as broadly described above. The constructs comprise a vector, such aε a plasmid or viral vector, into which a εequence of the invention haε been inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, the construct further compriseε regulatory εequenceε, including, for example, a promoter, operably linked to the εequence. Large numberε of εuitable vectors and promoters are known to those of skill in the art, and are commercially available. The following vectors are provided by way of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbεks, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene) ; ptrc99a, pKK223- 3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any other plasmid or vector may be used as long as they are replicable and viable in the host.

Promoter regions can be εelected from any deεired gene uεing CAT (chloramphenicol tranεferaεe) vectors or other vectors with selectable markers. Two appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoterε include lad, lacZ, T3, T7, gpt, lambda P R , P L and trp. Eukaryotic promoterε include CMV immediate early, HSV thy idine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the level of ordinary skill in the art.

In a further embodiment, the present invention relates to hoεt cellε containing the above-described constructs. The

host cell can be a higher eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such aε a yeaεt cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE- Dextran mediated transfection, or electroporation. (Davis, L., Dibner, M. , Battey, I., Basic Methods in Molecular Biology, (1986)).

The constructs in host cells can be used in a conventional manner to produce the gene product encoded by the recombinant sequence. Alternatively, the polypeptides of the invention can be synthetically produced by conventional peptide synthesizers.

Mature proteins can be expresεed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free translation syεtems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present invention. Appropriate cloning and expresεion vectors for use with prokaryotic and eukaryotic hostε are deεcribed by Sambrook, et al.. Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989), the diεcloεure of which is hereby incorporated by reference.

Transcription of the DNA encoding the polypeptides of the present invention by higher eukaryotes is increased by inserting an enhancer εequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp that act on a promoter to increase its transcription. Examples including the SV40 enhancer on the late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers.

Generally, recombinant expression vectors will include origins of replication and selectable markers permitting transformation of the host cell, e.g., the ampicillin

resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expresεed gene to direct transcription of a downstream structural sequence. Such promoters can be derived from operons encoding glycolytic enzymeε such as 3-phosphoglycerate kinase (PGK), α-factor, acid phosphataεe, or heat shock proteins, among others. The heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequenceε, and preferably, a leader εequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a fusion protein including an N-terminal identification peptide imparting deεired characteristics, e.g., stabilization or simplified purification of expressed recombinant product.

Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a desired protein together with suitable translation initiation and termination signals in operable reading phase with a functional promoter. The vector will comprise one or more phenotypic selectable markers and an origin of replication to ensure maintenance of the vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hoεts for transformation include E. coli. Bacillus subtilis. Salmonella typhimurium and various species within the genera Pseudomonaε, Streptomyces, and Staphylococcus, although others may also be employed as a matter of choice.

As a representative but nonlimiting example, uεeful expreεεion vectors for bacterial use can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEMl

(Promega Biotec, Madison, WI, USA). These pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed.

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the selected promoter is induced by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period.

Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further puri ication.

Microbial cellε employed in expreεεion of proteinε can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents, such methods are well know to those skilled in the art.

Various mammalian cell culture syεtemε can also be employed to express recombinant protein. Examples of mammalian expresεion syεterns include the COS-7 lines of monkey kidney fibroblastε, described by Gluzman, Cell, 23:175 (1981), and other cell lines capable of expresεing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors will comprise an origin of replication, a suitable promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor siteε, tranεcriptional termination εequences, and 5' flanking nontranscribed εequenceε. DNA εequenceε derived from the SV40 splice, and polyadenylation sites may be used to provide the required nontranscribed genetic elementε.

The polypeptides can be recovered and purified from recombinant cell cultures by methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography.

affinity chromatography hydroxylapatite chromatography and lectin chromatography. It is preferred to have low concentrations (approximately 0.15-5 mM) of calcium ion present during purification. (Price et al., J. Biol. Chem. , 244:917 (1969)). Protein refolding stepε can be uεed, as necessary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be e.mployed for final purification steps.

The polypeptides of the present invention may be a naturally purified product, or a product of chemical synthetic procedures, or produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial, yeast, higher plant, inεect and mammalian cells in culture). Depending upon the host employed in a recombinant production procedure, the polypeptides of the present invention may be glycosylated or may be non-glycosylated. Polypeptides of the invention may also include an initial methionine amino acid residue.

SOD-4 may also be employed as an anti-inflammatory agent. Other SOD proteins have been shown to exhibit an anti-inflammatory affect in a serieε of animal models of inflammation aε well aε in inflammatory diεeaεeε in animals (Huber et al, edε. Michelεon el al. Academic Press, 517-549, (1977). SOD-4 may also be used to treat rheumatoid arthritis and the adverse effects of ionizing radiation since, in humans, positive affects have been shown using SOD proteins to treat rheumatoid arthritis and arthroεes as well as adverse affects of treatment with ionizing radiation. The mechanism by which SOD-4 works iε by removing oxidation productε, which productε cause tiεεue degeneration.

SOD-4 may be uεed to treat Crohn'ε diεease, Bechet's disease, dermatitis, ulcers, ulcerative colitis, and against the adverse effects of radiation therapy. Other SOD proteins have been found to be effective against these conditions (Niwa, Y et al. Free Rad. Res. Commε. 1:137-153 (1985)).

If the supply of blood to a tissue is cut off, the tissue will slowly become necrotic. Oxygen radicals formed as a result of the reappearance of oxygen in previously ischaemic tissue appear to contribute to the damage. Thus the removal of these free radicals by SOD-4 helps to protect tissue against damage. SOD-4 may be employed to reduce the incidence of ischaemia and reperfusion induced arrhythmias by a. similar mechanism, since SOD proteinε have been reported to affect theεe conditions (Woodward, B. et al, J. Mol. Cell. Cardiol. 17:485-493 (1985). In the same manner, SOD-4 may be employed to treat cerebral ischaemia and kidney ischaemia, SOD proteins have been demonstrated to protect tissues in ischaemia or anoxiareperfusion models in the kidney (Baker, G.L., et al.. Am. Surg. , 202:628-41 (1985).

Also, SOD-4 may be employed in connection with kidney transplantations and other organ transplantations such as skin, lung, liver and pancreaε.

SOD-4 may be employed to treat burns. The local oedema after an experimental slight burn in rats could be somewhat decreaεed through injection of SOD proteinε (Bjork and Artursson, Burns, 9:249-256 (1983).

Parenterally administered CuZnSOD haε been reported to prevent bronchopulmonary dyεplasia in preterm neonates suffering from infantile respiratory distresε. The CuZnSOD has recently received orphan drug statuε for thiε treatment. Accordingly, SOD-4 may also be employed to treat these diseases also. (Rosenfeld W. et al, J. Pediatr. 105:781-785 (1984) .

In various types of autoimmune diseases, such aε systemic lupus erythematosus, and rheumatoid arthritis an increased frequency of chromosomal breaks in lymphocyteε has been demonstrated. Plasma from such patients contains a chromosome breaking factor, called clastogenic factor. Superoxide radicals in the plasma results in formation of

this factor. SOD-4 may protect against this clastogenic activity by removing the superoxide radicals.

Superoxide radicals tend to damage cells, DNA and proteins by oxidative stress which may disrupt the normal cell cycle and lead to uncontrolled division of cells which is the basiε of a cancer. Accordingly, SOD-4 can be employed to prevent or control cancer by the removal of superoxide radicals from a patient's system.

Oxygen radicals contribute to the damaging affects of a number of toxic subεtances εuch as paraquat and alloxan. SOD-4 may protect against theεe toxic substances through direct injection.

Alloxan has been reported to have diabetogenic activity. SOD-4 may protect against this diabetogenic activity of alloxan in vivo . Beta-cellε of the pancreaε are extremely εensitive to alloxan, and this sensitivity may lead to insulin-dependent diabetes mellituε. It may therefore be contemplated to protect the Beta cells with injections with SOD-4 at the first onset of diabetes mellituε.

The polypeptideε may also be employed in accordance with the present invention by expression of such polypeptides in vivo , which iε often referred to aε "gene therapy."

Thus, for example, cellε from a patient may be engineered with a polynucleotide (DNA or RNA) encoding a polypeptide ex vivo, with the engineered cellε then being provided to a patient to be treated with the polypeptide. Such methodε are well-known in the art. For example, cellε may be engineered by procedures known in the art by use of a retroviral particle containing RNA encoding a polypeptide of the present invention.

Similarly, cellε may be engineered in vivo for expression of a polypeptide in vivo by, for example, procedureε known in the art. Aε known in the art, a producer cell for producing a retroviral particle containing RNA encoding the polypeptide of the present invention may be

-IB-

administered to a patient for engineering cells in vivo and expression of the polypeptide in vivo . These and other methods for administering a polypeptide of the present invention by such method should be apparent to thoεe εkilled in the art from the teachingε of the preεent invention. For example, the expreεεion vehicle for engineering cells may be other than a retrovirus, for example, an adenoviruε which may be used to engineer cells in vivo after combination with a suitable delivery vehicle.

The polypeptides of the present invention may be employed in combination with a suitable pharmaceutical carrier. Such compoεitionε comprise a therapeutically effective amount of the polypeptide, and a pharmaceutically acceptable carrier or excipient. Such a carrier includes but is not limited to saline, buffered εaline, dextroεe, water, glycerol, ethanol, and combinationε thereof. The formulation should suit the mode of administration.

The invention also provides a pharmaceutical pack or kit comprising one or more containers filled with one or more of the ingredients of the pharmaceutical compositions of the invention. Asεociated with such container(ε) can be a notice in the form prescribed by a governmental agency regulating the manufacture, use or εale of phar aceuticalε or biological productε, which notice reflectε approval by the agency of manufacture, use or εale for human adminiεtration. In addition, the polypeptides of the preεent invention may be employed in conjunction with other therapeutic compoundε.

The pharmaceutical compositions may be administered in a convenient manner such as by the oral, topical, intravenous, intraperitoneal, intramuscular, subcutaneous, intranaεal or intradermal routeε. The amounts and dosage regimens of SOD-4 administered to a εubject will depend on a number of factorε such as the mode of administration, the nature of the condition being treated and the judgment of the preεcribing phyεician. Generally speaking, they are given.

for example, in therapeutically effective doses of at least about 10 μg/kg body weight and in most cases they will be administered in an amount not in excesε of about 8 g/Kg body weight per day and preferably the doεage is from about 10 μg/kg to about 1 mg/kg body weight daily, taking into account the routes of administration, symptoms, etc.

The sequences of the present invention are also valuable for chromoεome identification. The sequence is specifically targeted to and can hybridize with a particular location on an individual human chromosome. Moreover, there is a current need for identifying particular εiteε on the chromoεome. Few chromoεome marking reagents based on actual sequence data (repeat polymorphismε) are preεently available for marking chromosomal location. The mapping of DNAs to chromosomes according to the present invention iε an important firεt εtep in correlating thoεe εequences with genes asεociated with diεease.

Briefly, sequences can be mapped to chromosomeε by preparing PCR primerε (preferably 15-25 bp) from the cDNA. Computer analysis of the cDNA iε uεed to rapidly εelect primers that do not span more than one exon in the genomic DNA, thus complicating the amplification process. These primers are then used for PCR εcreening of εomatic cell hybrids containing individual human chromoεomeε. Only those hybrids containing the human gene corresponding to the primer will yield an amplified fragment.

PCR mapping of εomatic cell hybrids is a rapid procedure for assigning a particular DNA to a particular chromosome. Using the present invention with the same oligonucleotide primers, sublocalization can be achieved with panelε of fragments from specific chromoεomeε or pools of large genomic clones in an analogous manner. Other mapping εtrategieε that can similarly be used to map to its chromoεome include in situ hybridization, preεcreening with labeled flow-εorted

chromosomeε and preεelection by hybridization to construct chromosome specific-cDNA libraries.

Fluorescence in situ hybridization (FISH) of a cDNA clones to a metaphase chromoεomal εpread can be uεed to provide a precise chromosomal location in one step. This technique can be used with cDNA as εhort aε 500 or 600 bases; however, clones larger than 2,000 bp have a higher likelihood o.f binding to a unique chromosomal location with sufficient signal intensity for simple detection. FISH requires use of the clones from which the EST was derived, and the longer the better. For example, 2,000 bp is good, 4,000 is better, and more than 4,000 is probably not necessary to get good results a reasonable percentage of the time. For a review of this technique, see Verma et al.. Human Chromosomeε: a Manual of Basic Techniques, Pergamon Press, New York (1988).

Once a sequence has been mapped to a precise chromoεomal location, the phyεical position of the sequence on the chromosome can be correlated with genetic map data. Such data are found, for example, in V. McKusick, Mendelian Inheritance in Man (available on line through Johns Hopkins University Welch Medical Library) . The relationship between genes and diseases that have been mapped to the same chromosomal region are then identified through linkage analysis (coinheritance of physically adjacent genes).

Next, it is necesεary to determine the differenceε in the cDNA or genomic sequence between affected and unaffected individuals. If a mutation is observed in some or all of the affected individuals but not in any normal individuals, then the mutation is likely to be the causative agent of the disease.

With current resolution of physical mapping and genetic mapping techniques, a cDNA precisely localized to a chromosomal region associated with the disease could be one of between 50 and 500 potential causative genes. (This

assumes 1 megabase mapping resolution and one gene per 20 kb) .

Comparison of affected and unaffected individuals generally involves first looking for εtructural alterations in the chromoεomeε, such aε deletionε or tranεlocations that are visible from chromosome spreads or detectable using PCR based on that cDNA sequence. Ultimately, complete sequencing of genes from several individuals is required to confirm the presence of a mutation and to diεtinguiεh mutationε from polymorphiεmε.

The polypeptides, their fragments or other derivativeε, or analogs thereof, or cells expressing them can be used as an immunogen to produce antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal antibodies. The present invention also includes chimeric, single chain, and humanized antibodies, as well as Fab fragments, or the product of an Fab expresεion library. Variouε procedures known in the art may be used for the production of such antibodies and fragments.

Antibodies generated againεt the polypeptideε corresponding to a sequence of the present invention can be obtained by direct injection of the polypeptides into an animal or by administering the polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind the polypeptides itself. In this manner, even a εequence encoding only a fragment of the polypeptideε can be used to generate antibodies binding the whole native polypeptides. Such antibodies can then be used to isolate the polypeptide from tisεue expreεsing that polypeptide.

For preparation of monoclonal antibodies, any technique which provides antibodies produced by continuous cell line cultures can be used. Exampleε include the hybridoma technique (Kohler and Milεtein, 1975, Nature, 256:495-497), the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 1983, Immunology Today 4:72), and the EBV-

hybridoma technique to produce human monoclonal antibodies (Cole, et al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96).

Techniqueε deεcribed for the production of εingle chain antibodieε (U.S. Patent 4,946,778) can be adapted to produce εingle chain antibodies to immunogenic polypeptide products of this invention.

In accordance with a further aspect of the present invention, there is provided a process for determining suεceptibility to diεorderε directly related to a mutation in the SOD-4 gene product. Such disorders include but are not limited, amyotrophic lateral sclerosis "ALS", and Parkinson's disease. Thus, a mutation in an SOD-4 protein indicates a susceptibility to these disorders, and the nucleic acid sequenceε encoding an SOD-4 polypeptide may be employed in an assay for ascertaining such susceptibility. Thus, for example, the asεay may be employed to determine a mutation in a human SOD-4 protein as herein described, such as a deletion, truncation, insertion, frameship, etc., with εuch mutation being indicative of a susceptibility to the above- mentioned diseases.

A mutation may be ascertained, for example, by a DNA εequencing aεsay. Tissue samples, including but not limited to blood samples, are obtained from a patient. The sampleε are proceεεed by methodε known in the art to capture the RNA. The firεt εtrand cDNA iε εynthesized from the RNA sampleε by adding an oligonucleotide primer consiεting of polythymidine residues which hybridize to the polyadenosine εtretch present on the mRNA's. Reverse transcriptaεe and deoxynucleotideε are added to allow synthesis of the firεt strand cDNA. Primer sequences are synthesized based on the DNA sequence of the SOD-4 polypeptide of the invention. The primer sequence is generally comprised of 15 to 30 and preferable from 18 to 25 consecutive basis of the SOD-4 gene. The primers are used in pairs (one "sense" and one "anti-sense") to amplify the

cDNA from the patients by the PCR method such that three overlapping fragments of the patients' cDNAs are generated. The overlapping fragments are then subjected to dideoxynucleotide εequencing uεing a εet of primer sequences synthesized to correspond to the base pairs of the cDNAs at a point approximately every 200 baεe pairs throughout the gene. The primer sequenceε are uεed for sequencing to determine where a mutation in the patients' SOD-4 protein may be. The sequence information determined from the patient is then compared to non-mutated εequenceε to determine if any mutationε are preεent.

In another embodiment, the primer εequenceε are uεed in the PCR method to amplify a mutated region. The region could be εequenced and used as a diagnostic tool to predict a predispoεition to εuch mutated geneε.

Alternatively, the aεsay to detect mutations in the SOD- 4 gene may be performed by generating cDNA from the RNA and expresεing the protein encoded by the cDNA by in vitro transcription and translation. The expressed protein may then be analyzed by electrophoresis on an SDS, polyacrylamide or other gel. A "normal" SOD-4 gene product is also electrophoresed on the gel, and the gel iε then dried and subjected to auto-radiography and the suεpected mutated gene product and the "normal" gene product are analyzed and any differences in the banding pattern of such gene products are indicative of a mutation in the cDNA. A mutation in the gene product can also be detected by uεing SOD-4 antibody in a Western Blot analysis. Accordingly, the mutations in the genes of the present invention may be determined directly by sequencing or indirectly by examining an expressed protein.

The present invention will be further described with reference to the following examples; however, it is to be understood that the present invention is not limited to such examples. All parts or amounts, unleεε otherwiεe specified, are by weight.

In order to facilitate understanding of the following examples certain frequently occurring methods and/or terms will be described.

"Plasmids" are deεignated by a lower case p preceded and/or followed by capital letterε and/or numberε. The starting plasmidε herein are either commercially available, publicly available on an unreεtricted basis, or can be cpnstructed from available plaεmids in accord with published procedures. In addition, equivalent plaεmidε to thoεe deεcribed are known in the art and will be apparent to the ordinarily εkilled artiεan.

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction enzyme that acts only at certain sequenceε in the DNA. The various restriction enzymes used herein are commercially available and their reaction conditions, cofactors and other requirements were used as would be known to the ordinarily skilled artisan. For analytical purposeε, typically 1 μg of plaεmid or DNA fragment iε used with about 2 units of enzyme in about 20 μl of buffer solution. For the purpoεe of isolating DNA fragments for plasmid construction, typically 5 to 50 μg of DNA are digested with 20 to 250 units of enzyme in a larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes are specified by the manufacturer. Incubation times of about 1 hour at 37 * C are ordinarily used, but may vary in accordance with the supplier's instructionε. After digeεtion the reaction iε electrophoresed directly on a polyacrylamide gel to isolate the desired fragment.

Size separation of the cleaved fragments is performed uεing 8 percent polyacrylamide gel described by Goeddel, D. et al . , Nucleic Acids Res., 8:4057 (1980).

"Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two complementary polydeoxynucleotide strandε which may be chemically εyntheεized. Such synthetic

oligonucleotideε have no 5 ' phoεphate and thuε will not ligate to another oligonucleotide without adding a phoεphate with an ATP in the presence of a kinase. A εynthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated.

"Ligation" refers to the process of forming phosphodieεter bonds between two double stranded nucleic acid fragments (Maniatis, T. , et al. , Id., p. 146). Unless otherwise provided, ligation may be accomplished uεing known buffers and conditions with 10 units to T4 DNA ligase ("ligaεe") per 0.5 μg of approximately equimolar amounts of the DNA fragments to be ligated.

Unless otherwise stated, transformation was performed as described in the method of Graham, F. and Van der Eb, A., Virology, 52:456-457 (1973).

Example 1 Bacterial Expression and Purification of SOD-4

The DNA sequence encoding for SOD-4, ATCC # 75716 is initially amplified using PCR oligonucleotide primers correεponding to the 5 ' and 3 ' sequences of the processed SOD-4 protein (minus the εignal peptide εequence) and the vector εequenceε 3' to the SOD-4 gene. Additional nucleotideε corresponding to SOD-4 are added to the 5' and 3 ' εequenceε reεpectively. The 5' oligonucleotide primer haε the εequence 5'- CGGGATCCATGGGCAGCGGCCAGTTG-3 ' and contains a Bam HI restriction enzyme site followed by 18 nucleotides of SOD-4 coding εequence starting from one of the presumed terminal amino acids of the processed protein. The 3' sequence, 5'- CGTCTAGAGGTCCTGCTCAAAGGTGGG-3' contains complementary εequenceε to an Xba I reεtriction εite and the laεt 21 nucleotides of SOD-4 and to a pDIO vector sequence located 3 ' to the SOD-4 DNA insert. The restriction enzyme sites correspond to the restriction enzyme sites on the bacterial expresεion vector pDIO (Qiagen, Inc. 9259 Eton

Avenue, Chatsworth, CA, 91311). pDIO encodes antibiotic resistance (Amp r ) , a bacterial origin of replication (ori), an IPTG-regulatable promoter operator (P/0), a ribosome binding site (RBS), a 6-Hiε tag and reεtriction enzyme sites. pDIO was then digested with Bam HI and Xba I. The amplified εequenceε were ligated into pDIO and were inserted in frame with the sequence encoding for the histidine tag and the RBS. The ligation mixture was then used to transform E. coli strain M15/rep4 available from Qiagen under the trademark M15/rep 4 by the procedure described in Sambrook, J et al.. Molecular Cloning: A Laboratory Manual, Cold Spring Laboratory Press, 1989. M15/rep4 contains multiple copies of the plasmid pREP4, which expresses the la repressor and also confers kanamycin resistance (Kan r ) . Transformants are identified by their ability to grow on LB plates and ampicillin/kanamycin resiεtant colonies were selected. Plasmid DNA was isolated and confirmed by restriction analysiε. Clones containing the desired constructε were grown overnight (0/N) in liquid culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). The O/N culture iε uεed to inoculate a large culture at a ratio of 1:100 to 1:250. The cellε were grown to an optical denεity 600 (O.D. 600 ) of between 0.4 and 0.6. IPTG ( "Iεopropyl-B-D-thiogalacto pyranoεide") waε then added to a final concentration of 1 mM. IPTG induceε by inactivating the lad repressor, clearing the P/O leading to increased gene expresεion. Cells were grown an extra 3 to 4 hours. Cells were then harvested by centrifugation. The cell pellet waε solubilized in the chaotropic agent 6 Molar Guanidine HC1. After clarification, solubilized SOD-4 waε purified from this solution by chromatography on a Nickel-Chelate column under conditionε that allow for tight binding by proteinε containing the 6-Hiε tag. Hochuli, E. et al., J. Chromatography 411:177-184 (1984). Proteinε from different εtageε of purification were εeparated on a 12.5% SDS

polyacrylamide gel and stained with Coomassie blue dye. M represents a molecular sizing marker. Lanes 1 and 2 are total extracts from bacteria containing the vector pDIO in the absence (lane 1) and presence (lane 2) of IPTG. Lanes 3 and 4 are total extracts from bacteria containing the expresεion plasmid pD10-SOD-4 in the absence (lane 3) and presence (lane 4) of IPTG. Lanes 5 through 9 represent eJLution fractions from a Nickel-Chelate column. Lane 5 is flow-through; lanes 6 and 7 represent elution fractions washed with 6 M guanidine HC1, 50 mM NaP0 4 , pH 8 and pH 6; lanes 8 and 9 are elution fractions washed with 6 M guanidine HC1 50 mM NaP0 4 pH 5 and pH 2. See Figure 3.

Example 2 Expression of Recombinant SOD-4 in COS cells

The expresεion of plasmid, pSOD-4-HA is derived from a vector pcDNAl/Amp (Invitrogen) containing: 1) SV40 origin of replication, 2) ampicillin resistance gene, 3) E.coli replication origin, 4) CMV promoter followed by a polylinker region, a SV40 intron and polyadenylation site. A DNA fragment encoding the entire SOD-4 precursor and a HA tag fused in frame to its 3' end was cloned into the polylinker region of the vector, therefore, the recombinant protein expression iε directed under the CMV promoter. The HA tag correspond to an epitope derived from the influenza hemagglutinin protein aε previously described (I. Wilson, H. Niman, R. Heighten, A Cherenεon, M. Connolly, and R. Lerner, 1984, Cell 37, 767). The infusion of HA tag to our target protein allows easy detection of the recombinant protein with an antibody that recognizes the HA epitope.

The plasmid construction strategy is described aε followε:

The DNA εequence encoding for SOD-4, ATCC # 75716 waε constructed by PCR using two primers: the 5' primer sequence 5'-AATTAACCCTCACTAAAGGG-3' in pBluescript vector; the 3'

sequence 5 '-CGCTCTAGATCAAGCGTAGTCTGGGACGTCGTATGGGTAAAGGTGGGCA GGGGGCTG-3 ' contains complementary sequenceε to an Xba I restriction enzyme site, translation εtop codon, HA tag and the last 18 nucleotides of the SOD-4 coding sequence (not including the stop codon). Therefore, the PCR product contains a Bam HI site from the pBluescript vector, SOD-4 coding εequence followed by HA tag fused in frame, a translation termination εtop codon next to the HA tag, and an Xba I εite. The PCR amplified DNA fragment and the vector, pBluescript, were digeεted with Bam HI and Xba I restriction enzymeε and ligated. The ligation mixture waε transformed into E. coli strain SURE (available from Stratagene Cloning Systems, 11099 North Torrey Pines Road, La Jolla, CA 92037) the transformed culture waε plated on ampicillin media plates and resiεtant colonieε were εelected. Plaεmid DNA waε isolated from tranεformantε and examined by restriction analysis for the presence of the correct fragment. For expression of the recombinant SOD-4, COS cells were transfected with the expreεεion vector by DEAE-DEXTRAN method. (J. Sambrook, E. Fritsch, T. Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Laboratory Presε, (1989)). The expression of the SOD-4-HA protein was detected by radiolabelling and immunoprecipitation method. (E. Harlow, D. Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Presε, (1988)). Proteins were labelled for 8 hourε with 3s S-cysteine two days poεt tranεfection. Culture media were then collected and cellε were lysed with detergent (RIPA buffer (150 mM NaCl, 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50mM Triε, pH 7.5). (Wilεon, I. et al.. Id. 37:767 (1984)). 35 S-cysteine labeled proteins from COS cell lysateε and supernatants were immunoprecipitated with an HA polyclonal antibody and separated using 15% SDS-PAGE. M equals molecular weight markerε . Laneε 1 through 4 are cell lyεates . Lanes 5 through 8 are supernatants. Lanes 1 and 5 are mock controlε with no DNA. Laneε 2 and 6 are MIP-I7

-29-

SUBSTITUTE SHEET (RULE 26.

control for secreted proteins. Lanes 3 and 7 are control for cell lyεate and laneε 4 and 8 are SOD-4. See Figure 4.

Numerouε modificationε and variationε of the preεent invention are poεεible in light of the above teachingε and, therefore, within the εcope of the appended claimε, the invention may be practiced otherwiεe than aε particularly deεcribed.

SEQUENCE LISTING

(1) GENERAL INFORMATION:

(i) APPLICANT: YU, ET AL.

(ii) TITLE OF INVENTION: Superoxide Dismutase-4

(iii) NUMBER OF SEQUENCES: 2

(iv) CORRESPONDENCE ADDRESS:

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN,

CECCHI, STEWART & OLSTEIN

(B) STREET: 6 BECKER FARM ROAD

(C) CITY: ROSELAND

(D) STATE: NEW JERSEY

(E) COUNTRY: USA

(F) ZIP: 07068

(v) COMPUTER READABLE FORM:

(A) MEDIUM TYPE: 3.5 INCH DISKETTE

(B) COMPUTER: IBM PS/2

(C) OPERATING SYSTEM: MS-DOS

(D) SOFTWARE: WORD PERFECT 5.1

(Vi) CURRENT APPLICATION DATA:

(A) APPLICATION NUMBER: 08/225,757

(B) FILING DATE: 11 APR-94

(C) CLASSIFICATION:

(VU) ATTORNEY/AGENT INFORMATION:

(A) NAME: FERRARO, GREGORY D.

(B) REGISTRATION NUMBER: 36,134

(C) REFERENCE/DOCKET NUMBER: 325800-106

(viii) TELECOMMUNICATION INFORMATION:

(A) TELEPHONE: 201-994-1700

(B) TELEFAX: 201-994-1744

(2) INFORMATION FOR SEQ ID NO:l:

(i) SEQUENCE CHARACTERISTICS

(A) LENGTH: 1080 BASE PAIRS

(B) TYPE: NUCLEIC ACID

(C) STRANDEDNESS: SINGLE

(D) TOPOLOGY: LINEAR

(ii) MOLECULE TYPE: cDNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:

CTGGTTGGTG CTCCTGCGCC GGAGGAGTTC TGCGTCTCGG GGTGGTGACT GGGTCCAGAA 60

TGGCTTCGGA TTGGGGAACA GGGGACCCTC TGCACGTTGG AGTTCGCGGT GCAGATGACC 120

TGTCAGAGCT GTGTGGACGC GGTGCGCAAA TCCCTGCAAG GGGTGGCAGG TGTCCAGGAT USD

GTGGAGGTGC ACT GGAGGA CCAGATGGTC TTGGTACACA CCACTCTACC CAGCCAGGAG 240

GTGCAGGCTC TCCTGGAAGG CACGGGGCGG CAGGCGGTAC TCAAGGGCAT GGGCAGCGGC 300

CAGTTGCAGA ATCTGGGGGC AGCAGTGGCC ATCCTGGGGG GGGCTGGCAC CGTGCAGGGG 360

GTGGTGCGCT TCCTACAGCT GACCCCTGAG CGCTGCCTCA TCGAGGGAAC TATTGACGGC 420

CTGGAGCCTG GGCTGCATGG ACTCCACGTC CATCAGTACG GGGACCTTAC AAACAACTGC 490

AACAGCTGTG GGAATCACTT TAACCCTGAT GGAGCATCTC ATGGGGGCCC CCAGGACTCT 540

GACCGGCACC GCGGAGACCT GGGCAATGTC CGTGCTGATG CTGACGGCCG CGCCATCTTC 600

AGAATGGAGG ATGAGCAGCT GAAGGTGTGG GATGTGATTG CCCGCAGCCT GATTATTGAT 660

GAGGGAGAAG ATGACCTGGG CCGGGGAGGC CATCCCTTAT CCAAGATCAC AGGGAACTCC 720

GGGGAGAGGT TGGCCTGTGG CATCATTGCA CGCTCCGCTG GCCTTTTCCA GAACCCCAAG 780

CAGATCTGCT CTTGCGATGG CCTCACCATC TGGGAGGAGC GAGGCCGGCC CATCGCTGGC 840

AAGGGCCGAA AGGAGTCAGC GCAGCCCCCT GCCCACCTTT GAGCAGGACC TCACCTTGGC 900

TCTGTTGCTG TCCTCCAGGG CGAGCACTTT CCACTTCCAG AGGGGGCCAG AGGGACTTTG 960

CCTGCCCAGT CTTTGGAGAG CTCAGTACAG GGCAGGAGCT GCTGTGGTGT TCCCTTGGCA TSD

AATGAAAGTT TTATTTTCGT TTGGGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA Iffi

-32-

SUBSTITUTE SHEET (RULE 26.

(2) INFORMATION FOR SEQ ID NO:2: (i) SEQUENCE CHARACTERISTICS

(A) LENGTH: 255 AMINO ACIDS

(B) TYPE: AMINO ACID

(C) STRANDEDNESS:

(D) TOPOLOGY: LINEAR

_(ii) MOLECULE TYPE: PROTEIN

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:

Met Thr Cyε Gin Ser Cys Val Asp Ala Val Arg Lys Ser Leu Gin

5 10 15

Gly Val Ala Gly Val Gin Asp Val Glu Val His Leu Glu Asp Gin

20 25 30

Met Val Leu Val Hiε Thr Thr Leu Pro Ser Gin Glu Val Gin Ala

35 40 45

Leu Leu Glu Gly Thr Gly Arg Gin Ala Val Leu Lyε Gly Met Gly

50 55 60

Ser Gly Gin Leu Gin Asn Leu Gly Ala Ala Val Ala lie Leu Gly

65 70 75

Gly Ala Gly Thr Val Gin Gly Val Val Arg Phe Leu Gin Leu Thr

80 85 90

Pro Glu Arg Cys Leu lie Glu Gly Thr lie Asp Gly Leu Glu Pro

95 100 105

Gly Leu Hiε Gly Leu His Val His Gin Tyr Gly Asp Leu Thr Asn

110 115 120

Asn Cys Asn Ser Cyε Gly Aεn Hiε Phe Aεn Pro Aεp Gly Ala Ser

125 130 135

His Gly Gly Pro Gin Asp Ser Asp Arg His Arg Gly Aεp Leu Gly

140 145 150

Asn Val Arg Ala Aεp Ala Asp Gly Arg Ala lie Phe Arg Met Glu

155 160 165

Asp Glu Gin Leu Lys Val Trp Asp Val lie Ala Arg Ser Leu lie

170 175 180

lie Asp Glu Gly Glu Asp Asp Leu Gly Arg Gly Gly His Pro Leu

185 190 195

Ser Lyε lie Thr Gly Aεn Ser Gly Glu Arg Leu Ala Cys Gly lie

200 205 210 lie Ala Arg Ser Ala Gly Leu Phe Gin Asn Pro Lys Gin lie Cyε

215 220 225

Ser Cys Asp Gly Leu Thr lie Trp Glu Glu Arg Gly Arg Pro lie

230 235 240

Ala Gly Lyε Gly Arg Lyε Glu Ser Ala Gin Pro Pro Ala His Leu

245 250 255