Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
POLYNUCLEOTIDES, POLYPEPTIDES EXPRESSED BY THE POLYNUCLEOTIDES AND METHODS FOR THEIR USE
Document Type and Number:
WIPO Patent Application WO/2001/048192
Kind Code:
A1
Abstract:
Novel polynucleotides including partial and extended sequences, and open reading frames, are provided, together with probes and primers, DNA constructs comprising the polynucleotides, biological materials and organisms incorporating the polynucleotides, polypeptides expressed by the polynucleotides, and methods for using the polynucleotides and polypeptides.

Inventors:
WATSON JAMES D (NZ)
MURISON JAMES GREG (NZ)
Application Number:
PCT/NZ2000/000256
Publication Date:
July 05, 2001
Filing Date:
December 21, 2000
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
GENESIS RES & DEV CORP LTD (NZ)
WATSON JAMES D (NZ)
MURISON JAMES GREG (NZ)
International Classes:
C07K14/47; A61K38/00; (IPC1-7): C12N15/11
Other References:
DATABASE GENBANK [online] 28 July 1999 (1999-07-28), Database accession no. AL034558
DATABASE GENBANK [online] 21 March 1995 (1995-03-21), Database accession no. CAA29045
DATABASE GENBANK [online] 5 December 1998 (1998-12-05), Database accession no. AR018857
DATABASE GENBANK [online] 14 December 1999 (1999-12-14), Database accession no. CAB40181
Attorney, Agent or Firm:
Hawkins, Michael Howard (P.O. Box 852, Wellington, NZ)
Download PDF:
Claims:
We claim:
1. An isolated polynucleotide comprising a sequence selected from the group consisting of : (1) sequences recited in SEQ ID NOS : 135 ; (2) complements of the sequences recited in SEQ ID NOS: 135; (3) reverse complements of the sequences recited in SEQ ID NOS: 135; (4) reverse sequences of the sequences recited in SEQ ID NOS: 135 (5) sequences having at least a 99% probability of being the same as a sequence recited in (1) (4) above as determined using computer algorithm BLASTN; (6) sequences having at least 50% identity to a nucleotide sequence recited in (1) (4) above determined using computer algorithm BLASTN ; (7) sequences having at least 75% identity to a nucleotide sequence recited in (1) (4) above determined using computer algorithm BLASTN ; (8) sequences having at least 90% identity to a nucleotide sequence recited in (1) (4) above determined using computer algorithm BLASTN; (9) sequences having at least 95% identity to a nucleotide sequence recited in (1) (4) above determined using computer algorithm BLASTN; (10) nucleotide sequences that hybridize to a sequence recited in (1) (4) above under stringent hybridization conditions; (11) nucleotide sequences that are 200mers of a sequence recited in (1) (4) above; (12) nucleotide sequences that are 100mers of a sequence recited in (1) (4) above; (13) nucleotide sequences that are 40 mers of a sequence recited in (1) (4) above; (14) nucleotide sequences that are 20mers of a sequence recited in (1) (4) above; and (15) nucleotide sequences that are degeneratively equivalent to a sequence recited in (1) (4) above.
2. An oligonucleotide comprising at least 10 contiguous residues complementary to 10 contiguous residues of a nucleotide sequence recited in claim 1.
3. A genetic construct comprising an isolated polynucleotide of claim 1.
4. A host cell transformed with a genetic construct of claim 3.
5. An isolated polypeptide encoded by a polynucleotide of claim 1.
6. An isolated polypeptide comprising an amino acid sequence selected from the group consisting of : (a) sequences provided in SEQ ID NOS: 3665; (b) sequences having at least a 99% probability of being the same as a sequence of SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP; (c) sequences having at least 50% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP; (d) sequences having at least 75% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP; (e) sequences having at least 90% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP ; and (f) sequences having at least 95% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP.
7. An isolated polynucleotide encoding a polypeptide of claim 6.
8. An isolated polypeptide comprising at least a functional portion of a polypeptide comprising an amino acid sequence selected from the group consisting of : (a) sequences provided in SEQ ID NOS: 3665; (b) sequences having at least a 99% probability of being the same as a sequence of SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP; (c) sequences having at least 50% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP ; (d) sequences having at least 75% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP; (e) sequences having at least 90% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP and (f) sequences having at least 95% identity to a sequence provided in SEQ ID NOS: 3665, as determined using the computer algorithm BLASTP.
9. A composition comprising a polypeptide according to any one of claims 6 and 8 and at least one component selected from the group consisting of : physiologically acceptable carriers and immunostimulants.
10. A composition comprising a polynucleotide according to claim 1 and at least one component selected from the group consisting of pharmaceutically acceptable carriers and immunostimulants.
11. A method for treating a disorder in a mammal comprising administering a composition according to claim 9.
12. A method for treating a disorder in a mammal comprising. administering a composition according to claim 10.
13. A diagnostic kit comprising at least one oligonucleotide according to claim 2.
14. An organism comprising a host cell according to claim 4.
Description:
POLYNUCLEOTIDES, POLYPEPTIDES EXPRESSED BY THE POLYNUCLEOTIDES AND METHODS FOR THEIR USE Technical Field of the Invention This invention relates to polynucleotides believed to be novel, including partial, extended and full length sequences, as well as probes and primers, genetic constructs comprising the polynucleotides, biological materials incorporating the polynucleotides, polypeptides expressed by the polynucleotides, and methods for using the polynucleotides and polypeptides.

Background of the Invention Sequencing of the genomes, or portions of the genomes, of numerous biological materials, including humans, animals, microorganisms and various plant varieties, has been and is being carried out on a large scale. Polynucleotides identified using sequencing techniques may be partial or full-length genes, and may contain open reading frames, or portions of open reading frames, that encode polypeptides. Putative polypeptides may be determined based on polynucleotide sequences. The sequencing data relating to polynucleotides thus represents valuable and useful information.

Polynucleotides may be analyzed for various degrees of novelty by comparing identified sequences to sequences published in various public domain databases, such as EMBL. Newly identified polynucleotides and putative' polypeptides may also be compared to polynucleotides and polypeptides contained in public domain information to ascertain homology to known polynucleotides and polypeptides. In this way, the degree of similarity, identity or homology of polynucleotides and polypeptides of unknown function may be determined relative to polynucleotides and polypeptides having known functions.

Information relating to the sequences of isolated polynucleotides may be used in a variety of ways. Specified polynucleotides having a particular sequence may be isolated, or synthesized, for use in in vivo or in vitro experimentation as

probes or primers. Alternatively, collections of sequences of isolated polynucleotides may be stored using magnetic or optical storage medium, and analyzed or manipulated using computer hardware and software, as well as other types of tools.

Summary of the Invention The present invention relates to polynucleotide sequences identified in the attached Sequence Listing as SEQ ID NOS: 1-35, variants of those sequences, extended sequences comprising the sequences set out in SEQ ID NOS: 1-35 and their variants, probes and primers corresponding to the sequences set out in SEQ ID NOS: 1-35 and their variants, polynucleotides comprising at least a specified number of contiguous residues of any of the polynucleotides identified as SEQ ID NOS: 1-35 (x-mers), and extended sequences comprising portions of the sequences set out in SEQ ID NOS: 1-35, all of which are referred to herein, collectively, as"polynucleotides of the present invention." The polynucleotide sequences identified as SEQ ID NOS: 1-35 were derived from mammalian sources, namely, from mouse airways induced eosinophilia, rat dermal papilla and mouse stromal cells. Some of the polynucleotides of the present invention are"partial"sequences, in that they do not represent a full-length gene encoding a full-length polypeptide. Such partial sequences may be extended by further analyzing and sequencing the EST clones from which the sequences were obtained, or by analyzing and sequencing various DNA libraries (e. g. cDNA or genomic) using primers and/or probes and well knows hybridization and/or PCR techniques. The partial sequences identified as' SEQ ID NOS : 1-35 may thus be extended until an open reading frame encoding a polypeptide, a full-length polynucleotide and/or gene capable of expressing a polypeptide, or another useful portion of the genome is identified. Such extended sequences, including full-length polynucleotides and genes, are described as "corresponding to"a sequence identified as one of the sequences of SEQ ID NOS: 1-35 or a variant thereof, or a portion of one of the sequences of SEQ ID NOS: 1- 35 or a variant thereof, when the extended polynucleotide comprises an identified

sequence or its variant, or an identified contiguous portion (x-mer) of one of the sequences of SEQ ID NOS: 1-35 or a variant thereof.

The polynucleotides identified as SEQ ID NOS: 1-35 were isolated from mouse and rat cDNA clones and represent sequences that are expressed in the tissue from which the cDNA was prepared. The sequence information may be used to isolate or synthesize expressible DNA molecules, such as open reading frames or full-length genes, that can then be used as expressible or otherwise functional DNA in transgenic mammals and other organisms. Similarly, RNA sequences, reverse sequences, complementary sequences, anti-sense sequences and the like, corresponding to the polynucleotides of the present invention, may be routinely ascertained and obtained using the cDNA sequences identified as SEQ ID NOS: 1-35.

In a first aspect, the present invention provides isolated polynucleotide sequences comprising a polynucleotide selected from the group consisting of : (a) sequences recited in SEQ ID NO: 1-35; (b) complements of the sequences recited in SEQ ID NO: 1-35; (c) reverse complements of the sequences recited in SEQ ID NO: 1-35; (d) reverse sequences of the sequences recited in SEQ ID NO: 1-35; (e) sequences having either 40%, 60%, 75% or 90% identical nucleotides, as defined herein, to a sequence of (a)- (d); probes and primers corresponding to the sequences set out in SEQ ID NO: 1-35; polynucleotides comprising at least a specified number of contiguous residues of any of the polynucleotides identified as SEQ ID NO : 1-35; and extended sequences comprising portions of the sequences set out in SEQ ID NO: 1-35; all of which are referred to herein as "polynucleotides of the present invention". The present invention also provides isolated polypeptide sequences identified in the attached Sequence Listing as SEQ.

ID NO: 36-65; polypeptide variants of those sequences; and polypeptides comprising the isolated polypeptide sequences and variants of those sequences.

In another aspect, the present invention provides genetic constructs comprising a polynucleotide of the present invention, either alone, or in combination with one or more additional polynucleotides of the present invention,

or in combination with one or more known polynucleotides, together with cells and target organisms comprising such constructs.

The polynucleotides identified as SEQ ID NOS: 1-35 may contain open reading frames ("ORFs") or partial open reading frames encoding polypeptides.

Additionally, open reading frames encoding polypeptides may be identified in extended or full-length sequences corresponding to the sequences set out as SEQ ID NOS: 1-35. Open reading frames may be identified using techniques that are well known in the art. These techniques include, for example, analysis for the location of known start and stop codons, most likely reading frame identification based on codon frequencies, etc. Suitable tools and software for ORF analysis are available, for example, on the Internet at http ://www. ncbi. nlm. nih. gov/gorf/gorf. html. Open reading frames and portions of open reading frames may be identified in the polynucleotides of the present invention. Once a partial open reading frame is identified, the polynucleotide may be extended in the area of the partial open reading frame using techniques that are well known in the art until the polynucleotide for the full open reading frame is identified. Thus, polynucleotides and open reading frames encoding polypeptides may be identified using the polynucleotides of the present invention.

Once open reading frames are identified in the polynucleotides of the present invention, the open reading frames may be isolated and/or synthesized.

Expressible DNA constructs may then be constructed that comprise the open ,.. reading frames and suitable promoters, initiators, terminators, etc., which are well known in the art. Such DNA constructs may be introduced into a host cell to express the polypeptide encoded by the open reading frame. Suitable host cells may include various prokaryotic and eulcaryotic cells.

Polypeptides encoded by the polynucleotides of the present invention may- be expressed and used in various assays to determine their biological activity.

Such polypeptides may be used to raise antibodies, to isolate corresponding interacting proteins or other compounds, and to quantitatively determine levels of interacting proteins or other compounds.

In another aspect, the present invention provides isolated polypeptides' encoded, or partially encoded, by the above polynucleotides. As used herein, the term"polypeptide"encompasses amino acid chains of any length, including full- length proteins, wherein the amino acid residues are linked by covalent peptide bonds. The term"polypeptide encoded by a polynucleotide"as used herein, includes polypeptides encoded by a polynucleotide that comprises an isolated polynucleotide sequence or variant provided herein. Polypeptides of the present invention may be naturally purified products, or may be produced partially or wholly using recombinant techniques. Such polypeptides may be glycosylated with bacterial, fungal, mammalian or other eukaryotic carbohydrates or may be non-glycosylated. In specific embodiments, the inventive polypeptides comprise an amino acid sequence selected from the group consisting of SEQ ID NO: 36-65.

Polypeptides of the present invention may be produced recombinantly by inserting a polynucleotide sequence that encodes the polypeptide into a genetic' construct and expressing the polypeptide in an appropriate host. Any of a variety of genetic constructs known to those of ordinary skill in the art may be employed.

Expression may be achieved in any appropriate host cell that has been transformed or transfected with a genetic construct containing a polynucleotide that encodes a recombinant polypeptide. Suitable host cells include prokaryotes, yeast, and higher eukaryotic cells. Preferably, the host cells employed are Escherichia coli, insect, yeast, or a mammalian cell line such as COS or CHO.

The polynucleotide sequences expressed in this manner may encode naturally occurring polypeptides, portions of naturally occurring polypeptides, or other variants thereof.

In a related aspect, polypeptides are provided that comprise at least a functional portion of a polypeptide having an amino acid sequence encoded by a polynucleotide of the present invention. As used herein, the"functional portion"' of a polypeptide is that portion which contains the active site essential for affecting the function of the polypeptide, for example, the portion of the molecule that is capable of binding one or more reactants. The active site may be made up

of separate portions present on one or more polypeptide chains and will generally exhibit high binding affinity.

Functional portions of a polypeptide may be identified by first preparing fragments of the polypeptide by either chemical or enzymatic digestion of the polypeptide, or by mutation analysis of the polynucleotide that encodes the polypeptide and subsequent expression of the resulting mutant polypeptides. The polypeptide fragments or mutant polypeptides are then tested to determine which portions retain biological activity, using, for example, the representative assays provided below.

Portions and other variants of the inventive polypeptides may also be generated by synthetic or recombinant means. Synthetic polypeptides having fewer than about 100 amino acids, and generally fewer than about 50 amino acids, may be generated using techniques well known to those of ordinary skill in the art. For example, such polypeptides may be synthesized using any of the commercially available solid-phase techniques, such as the Merrifield solid-phase. synthesis method, where amino acids are sequentially added to a growing amino acid chain. See Merrifield, J Am. Chem. Soc. 85: 2149-2154,1963. Equipment for automated synthesis of polypeptides is commercially available from suppliers such as Perkin Elmer/Applied BioSystems, Inc. (Foster City, California), and may be operated according to the manufacturer's instructions. Variants of a native polypeptide may be prepared using standard mutagenesis techniques, such as oligonucleotide-directed, site-specific mutagenesis (dunkel, Proc. Natl. Acad. Sci.

USA 82: 488-492, 1985). Sections of polynucleotide sequence may also be removed using standard techniques to permit preparation of truncated polypeptides.

In general, the polypeptides disclosed herein are prepared in an isolated, substantially pure, form. Preferably, the polypeptides are at least about 80% pure, more preferably at least about 90% pure, and most preferably at least about 99% pure. In certain embodiments, described in detail below, the isolated polypeptides are incorporated into pharmaceutical compositions or vaccines.

The present invention also contemplates methods for modulating the polynucleotide and/or polypeptide content and composition of an organism, such methods involving stably incorporating into the genome of the organism a construct containing DNA of the present invention. In one embodiment, the target organism is a mammal, preferably a human, for example for human gene therapy.

In a related aspect, a method for producing an organism having an altered genotype or phenotype is provided, the method comprising transforming a cell with a DNA construct of the present invention to provide a transgenic cell, and cultivating the transgenic cell under conditions conducive to regeneration and mature organism growth.

The isolated polynucleotides of the present invention have utility in genome mapping, in physical mapping, and in positional cloning of genes.

Additionally, the polynucleotide sequences identified as SEQ ID NOS : 1-35 and their variants may be used to design oligonucleotide probes and primers.

Oligonucleotide probes and primers have sequences that are substantially complementary to the polynucleotide of interest over a certain portion of the polynucleotide. Oligonucleotide probes designed using the polynucleotides of the' present invention may be used to detect the presence and examine the expression patterns of genes in any organism having sufficiently similar DNA and RNA sequences in their cells using teclmiques that are well known in the art, such as slot blot DNA hybridization techniques. Oligonucleotide primers designed using the polynucleotides of the present invention may be used for PCR amplifications.

Oligonucleotide probes and primers designed using the polynucleotides of the present invention may also be used in connection with various microarray technologies, including the microarray technology of Affymetrix (Santa Clara, CA).

The polynucleotides of the present invention may also be used to tag or identify an organism or reproductive material therefrom. Such tagging may be accomplished, for example, by stably introducing a non-disruptive non-functional heterologous polynucleotide identifier into an organism, the polynucleotide' comprising one of the polynucleotides of the present invention.

Detailed Description Polynucleotides were isolated by high throughput sequencing of cDNA libraries prepared from mouse airway-induced eosinophilia, rat dermal papilla and mouse stromal cells as described below, in Example 1. Isolated polynucleotides of the present invention include the polynucleotides identified as SEQ ID NOS: 1-.

35; isolated polynucleotides comprising a polynucleotide sequence selected from the group consisting of SEQ ID NOS: 1-35; isolated polynucleotides comprising at least a specified number of contiguous residues (x-mers) of any of the polynucleotides identified as SEQ ID NOS: 1-35; polynucleotides complementary to any of the above polynucleotides ; anti-sense sequences corresponding to any of the above polynucleotides; and variants of any of the above polynucleotides, as that term is described in this specification. The present invention also provides isolated polypeptide sequences identified in the attached Sequence Listing as SEQ ID NO: 36-65; polypeptide variants of those sequences; and polypeptides comprising the isolated polypeptide sequences and variants of those sequences.

The correspondence of isolated polynucleotides encoding isolated polypeptides of the present invention, and the functionality of the polypeptides, are shown, below, in Table 1.

Table 1 SEQ ID SEQ ID Activity Category Functionality NO Poly-NO Poly- nucleotides peptides 1 36. Secretory molecule Hypothetical 131. 1 lcDa protein 2 37 Secretory ZCYT07 belongs to a family of IL- molecule/cytokine/17-related cytokines differing in cell signaling patterns of expression and proinflammatory responses that may be transduced through a cognate set of cell surface receptors. IL-17 is a T cell-derived cytolcine that may play an important role in the initiation or maintenance of the proinflammatory response. Whereas expression of IL- 17 is restricted to activated T cells, the IL-17 receptor is found to be widely expressed, a finding consistent with the pleiotropic activitiesof IL-17. 3 38 Secretory molecule Novel 4 39 Receptor/cytolcine/Tumor endotlzelial marker 1 cellsignaling precursor 5 40 Secretory molecule ERO1-L (ERO1-like protein) is involved in oxidative endoplasmic reticulum (ER) protein folding in mammalian cells. Oxidizing conditions must be maintained in the ER to allow the formation of disulfide bonds in secretory proteins. A family of conserved genes, termed ERO for ER oxidoreductins, plays a key role in this process. EROS-L is a type II integral membrane protein. 6 41 Secretory molecule Novel 7 42 Receptor/transcriptio EMR2 is an EGF-lilce module that is n factor part of the epidermal growth factor (EGF)-TM7 proteins, which also include EMR1, (EGF-like molecule containing mucin-like hormone receptor 1) F4/80, and CD97. These proteins constitute a recently defined class B GPCR subfamily and are predominantly expressed on leukocytes. These molecules possess N-terminal EGF-lilce domains coupled to a seven-span transmembrane (7TM) moiety via a mucin-like spacer domain. EMR2 contains a total of five tandem EGF- like domains and expresses similar protein isoforms consisting of various numbers of EGF-like domains as a result of alternative RNA splicing. EMR2 share many characteristics with CD97, including highly homologous EGF-like domains and identical gene organization, indicating that both genes are the products of a recent gene duplication event. Both EMR2 and CD97 are highly expressed in immune tissues; however, unlike CD97, which is ubiquitously expressed in most cell types, EMR2 expression is restricted to monocytes, macrophages 8 43 Secretory molecule/Bone/cartilage proteoglycan I (BGN) cell is also Icnown as biglycan or PG-S 1. structure/motility, BGN is found in the extracellular extracellular matrix matrices of several connective tissues, especially in articular cartilages. The two glycosaminoglycan chains attached to BGN can be either chondroitin sulfate or dermatan sulfate. BGN belongs to the small interstitial proteoglycans family. BGN is a small leucine-rich proteoglycan and is a nonfibrillar extracellular matrix component with functions that include the positive regulation of bone formation. It is synthesized as a precursor with an NH (2)-terminal propeptide that is cleaved to yield the mature form found in vertebrate tissues. Bone morphogenetic protein- 1 (BMP-1) cleaves proBGN at a single site, removing the propeptide and producing BGN. Soluble BGN purified from rat thymic myoid cells had hemopoietic activity capable of inducing preferential growth and differentiation of monocytic lineage cells from various hemopoietic sources, including brain microglial cells. The haemopoietic BGN plays an important role in generating brain- specific circumstances for development of microglial/monocytic cells 9 44 Secretory molecule Tubulointerstitial nephritis antigen (TIN-ag) is a basement membrane glycoprotein reactive with autoantibodies in some forms of immunologically mediated human tubulointerstitial nephritis. TIN1 and TIN2 are alternatively spliced products of the TIN-Ag gene. The open reading frames of TIN1 and TIN2 indicates the presence of a signal peptide and putative pre- propeptide and both forms contain putative calcium-binding sites. TIN1 additionally contains a characteristic laminin-lilce epidermal growth factor (EGF) motif and significant homology within the carboxy terminus with the cystine proteinase family of enzymes. The EGF motif bears important similarities in the positions of cystines with two motifs in the propeptide of von Willebrand factor. The EGF motif and part of the region that is homologous with the cystine proteinase family are removed from the TIN2 DNA The rest of the TIN1 and TIN2 sequences are identical. TIN-ag is expressed mainly in the kidney and in the intestinal epithelium. 10 Receptor-like Novel molecule 11 45 Secretory molecule/Toso is a cell surface, specific gene/protein regulator of Fas-induced apoptosis in expression, RNA T cells. Fas is a surface receptor that synthesis, can transmit signals for apoptosis. transcription factors Toso is expressed in lymphoid cells and expression is enhanced after cell- specific activation processes in T cells. Toso appeared limited to inhibition of apoptosis mediated by members of the TNF receptor family and was capable of inhibiting T cell self-killing induced by TCR activation processes that up-regulate Fas ligand. Toso inhibits caspase-8 processing, the most upstream caspase activity in Fas-mediated signaling, potentially through activationof cFLIP. Toso therefore serves as a novel regulator of Fas- mediated apoptosis and may act as a regulator of cell fate in T cells and other hematopoietic lineages. 12 46 Secretory molecule/Surface glycoprotein CD59 is a gene/protein phosphatidyl-inositol-glycan- expression, RNA anchored glycoprotein involved in T- synthesis, cell activation and restriction of transcription factors complement-mediated lysis. It is also lcnown as protectin, and is ubiquitously expressed on benign and malignant cells. CD59 inhibits complement (C)-mediated lysis of target cells by preventing the formation of the membrane attack complex, in the terminal step of C- ctivation. Recent experimental evidence demonstrates that CD59 is the main restriction factor of C- mediated lysis of malignant cells of different histotypes. Additionally, a soluble form of CD59, that retains its anchoring ability and functional properties, has been identified in body fluids and in culture supernatants of different malignant cells. CD59 may protect neoplastic cells from C-mediated lysis, contributing to their escape from innate C-control and to tumor progression. The expression of CD59 by neoplastic cells may contribute to impair the therapeutic efficacy of C-activating monoclonal antibodies (mAb) directed to tumor- associated antigens. CD59 can be utilized to improve the therapeutic efficacy of clinical approaches of humoral immunotherapy with C- activating mAb in human malignancies. 13 47 Secretory Cytochrome B561 (cyb561) is a molecules/cell or secretory vesicle-specific electron organism defense, transport protein unique to homeostasis, neuroendocrine secretory vesicles. It detoxification binds two heme groups non- covalent and is an integral membrane protein. It acts as an electron channel and mediates equilibrationof ascorbate- semidehydroascorbate inside the secretory vesicle with the ascorbate redox pair in the cytoplasm. The role for this function is to regenerate ascorbate inside the secretory vesicle for use by monooxygenases. The secretory vesicles contain catecholamines and amidated peptides. Cyb561 belongs to the eulcaryotic b561 family. 14 48 Secretory molecule Novel 15 49 Receptor-lilce High affinity immunoglobulin molecule/gene or epsilon receptor beta-subunit protein expression, (FCER1) is also knows as IgE Fc RNA synthesis, receptor, beta-subunit, FCERlb or transcription factor FCElb. FCER1 binds to the Fc region of immunoglobulins epsilon and is a high affinity receptor. FCER1 plays a role in initiating the allergic response where binding of allergen to receptor-bound IgE leads to cell activation and the release of mediators, such as histamine. FCERI is responsible for the manifestations of allergy and induces the secretion of important lympholcines. It functions as a tetramer consisting of an alpha chain, a beta chain, and two disulfide-linked gamma chains and is an integral membrane protein. Variants of the FCER1B gene have been identified, which are associated with an increased risk of developing atopy and bronchial asthma. Atopic dermatitis is a common slcin disease frequently associated with allergic disorders such as allergic rhinitis and asthma. 16 50 Receptor-like Hypothetical 10. 3 kDa protein molecule 17 51 Secretory Lysosomal thiol reductase IP30 molecule/antigen catalyzes disulfide bond reduction processing both in vitro and in vivo and is optimally active at acidic pH. IP30 is important in disulfide bond reduction of proteins delivered to MHC class 11-containing compartments and consequently in antigen processing. IP30 can be mediated by multiple lysosomal protease. Proteins internalized into the endocytic pathway are usually degraded. Efficient proteolysis requires denaturation, induced by acidic conditions within lysosomes, and reduction of inter-and intrachain disulfide bonds. The active site, determined by mutagenesis, consists of a pair of cystine residues separated by two amino acids, similar to other enzymes of the thioredoxin family. 18 Receptor-lilce RNA binding protein. molecule 19 52 Secretory Notch4-lilçe protein (ZNEU1) is part molecule/cellular of the NOTCH4 family that encodes receptors responsible for cell fate decisions during development. These Notch receptors and their ligands, Delta and Jagged, have been implicated in several diseases. When truncated, constitutively active mutant forms of the Notch receptor appear to be involved in T-cell leulcemia, mammary carcinomas and a tumorous germline phenotype. Notch4 genes are expressed specifically in endothelial cells. 20 53 Secretory molecule Novel 21 54 Secretory Serotransferrin (siderophilin) (Tf) or molecule/transporter beta-1-metal binding globulin is part of the transferrin family. Transferrins are iron binding transport proteins which can bind two atoms of ferric iron in association with the binding of an anion, usually bicarbonate. Tf is responsible for the transport of iron from sites of absorption and heme degradation to those of storage and utilization. Serum transferrin also a further role in stimulating cell proliferation. Tf gene expression is modulated by vitamin A, testosterone, and peptide hormones. 22 55 Secretory molecule/Insulin-lilce growth factor binding gene or protein protein 5 protease (IGFBP-5) expression, RNA modulates the effects of insulin synthesis, growth factors (IGFs) on cells. transcription factor IGFBP-5 is synthesized by smooth muxcle cells and binds to the extracellular matrix. It is also secreted into conditioned medium of cultured cells and is cleaved into fragments by a concomitantly produced protease. These fragments have reduced affinity for the IGFs. IGFBP-5 protease belongs to a family of serine-metallo proteases. 23 56 Secretory Major epididymis-specific protein E4 molecule/cellular is also known as epididymal protein development BE-20. It belongs to WAP-type'four- disulfide core'family and plays a role in the maturation of spermatozoa during its movement through the epididymis and the capacity of sperm to fertilize ova. Expression of E4 was located to the epithelial cells of the cauda epididymis and proximal segment of the ductus deferens by in situ hybridization. No expression was found in sections of the corpus and caput epididymis, testis, and liver. 24 Secretory TNFR-related death receptor-6 DR6 molecule/cell contains an extracellular region signaling containing varying numbers of cysteine-rich domains and an intracellular region that contains the death domain. Death receptors are activated in a ligand-dependent or independent manner and transduce apoptotic signals via their respective intracellular death domains. 25 57 Receptor-lilce Novel molecule 26 58 Secretory Channel inducing factor precursor molecule/regulation (CHIF) or corticosteroid-induced protein induces a potassium channel when expressed in Xenopus oocytes and activates endogenous oocyte transport proteins. It is a type I membrane protein selectively present in the distal parts of the nephron (medullary and papillary collecting ducts and end portions of cortical collecting tubule) and in the epithelial cells of the distal colon. No expression is found in renal proximal tubule, loop of Henle and distal tubule, proximal colon, small intestine, lung, choroid plexus, salivary glands, or brain. CHIF belongs to the ATP 1 Gl/PLM/Mat- 8 family and exhibits significant homologies with proteins that are putatively regulatory (phospholemman, gamma-subunit of Na (+)-K (+)-ATPase, Mat-8). 27 59 Secretory molecule Hepatocellular carcinoma-associated antigen 112. 28 60 Receptor-lilçe Lymphatic endothelium-specific molecule/homeostasi hyaluronan receptor LYVE-1 is a s major receptor for hyaluronan (HA) on the lymph vessel wall molecule that binds both soluble and immobilized HA. LYVE-1 plays a role in the control of the HA pathway. The extracellular matrix glycosaminoglycan hyaluronan (HA) is an abundant component of slcin and mesenchymal tissues where it facilitates cell migration during wound healing, inflammation, and embryonic morphogenesis. Both during normal tissue homeostasis and particularly after tissue injury, HA is mobilized from these sites through lymphatic vessels to the lymph nodes where it is degraded before entering the circulation for rapid uptalce by the liver. LYVE-1 is similar to the CD44 HA receptor, but in contrast to CD44, LYVE-1 colocalizes with HA on the luminal face of the lymph vessel wall and is completely absent from blood vessels. 29 61 Receptor-like G protein-coupled receptor GPR35 is molecule/cell an integral membrane protein that signaling belongs to family 1 of G-protein coupled receptors (GPRC). The GPCR family shares a structural motif of seven transmembrane segments with large numbers of conserved residues in those regions. 30 62 Receptor-like Tumor-associated glycoprotein E4 is molecule also Icnown as Taal or Tage4 and belongs to the immunoglobulin superfamily. This family contains cell adhesion molecules which have wide-ranging functions and mediate a variety of homotypic and heterotypic cellular interactions playing a general role in cell surface recognition. The Tage4 gene product is closely related to the hepatocellular carcinoma antigen TuAg. l. Tage4 is a glycoprotein expressed at the surface of colon carcinoma cell lines, but at a very low level in normal adult colon and lung tissue and not in normal tissues tested. 31 63 Secretory molecule Hypothetical 28.6 lcDa protein is also known as plunc, for palate, lung, and nasal epithelium clone. Expression of plunc is associated with the palate, nasal septum, and nasal conchae. It is also expressed strongly in the trachea and bronchi of the adult lung. No significant homologies with lcnown genes were observed at the nucleotide level and limited amino acid homology with two salivary gland-specific proteins was noted. The amino acid sequence revealed consensus sequences for N- glycosylation, protein kinase C and casein kinase phosphorylation, as well as a leucine zipper. In addition, an unique amino acid sequence repeat sequence is located near the amino-terminal portion of the protein. 32 64 Secretory molecule Claudin-18 (Cldnl8) is a component of tight junction (TJ) strands and belongs to the claudin family. Claudins are integral membrane protein component of tight junctions, a structure controlling cell-to-cell adhesion and, consequently, regulating paracellular and transcellular transport of solutes across epithelia and endothelia. The claudin family also includes occludin and 17 other distinct claudins. Claudin family members are tetra- span transmembrane proteins that are localized in cell-specific TJs. In multicellular organisms, various compositionally distinct fluid compartments are established by epithelial and endothelial cellular sheets. For these cells to function as barriers, TJs are considered to create a primary barrier for the diffusion of solutes through the paracellular pathway. Claudins are therefore responsible for TJ-specific obliteration of the intercellular space. 33 Secretory molecule Glutamine repeat protein 1 (GRP-1) contains simple tandem repeats of the trinucleotide sequence CAG that encode homopolymeric stretches of glutamine. Although polyglutamine has been identified in diverse proteins, it is present predominantly in transcription factors. Greater than two-thirds ofGRP-1 are only two amino acids, namely glutamine (50%) and histidine (18%). There are four polyglutamine motifs interspersed with histidine-rich regions. There is also a putative nuclear localization signal flanked by sites for possible serine phosphorylation. GRP-1 mRNA was expressed constitutively in some macrophage cell lines and B and T cell lines. Interferon-gamma or lipopolysaccharide augmented GRP- 1 mRNA expression in the mouse macrophage cell line ANA-1. Because polyglutamine motifs can cause protein oligomerization and can function as transcriptional activation domains, GRP-1 is a transcription factor associated with interferon-gamma-or lipopolysaccharide-induced activationof macrophages. 34 Secretory molecule Alpha-1 collagen 35 65 Receptor-like Gdnf family receptor alpha 4, molecule/Cell transmembrane isoform (Gfra4) is a signaling members of the Gdnf protein family that signal through receptors consisting of a GPI-linlced GFRalpha subunit and the transmembrane tyrosine kinase Ret. Gfra4 is expressed in many tissues, including nervous system, in which intron retention leads to a putative intracellular or secreted GFRalpha4 protein. Efficient splicing occurs only in thyroid, parathyroid, and pituitary and less in adrenal glands. A splice form that leads to a GPI- linked GFRalpha4 receptor is expressed in juvenile thyroid and parathyroid glands. In newborn and mature thyroid as well as in parathyroid and pituitary glands major transcripts encode for a putative transmembrane isoform of GFRalpha4. GFRalpha4 expression may restrict the inherited cancer syndrome multiple endocrine neoplasia type 2, associated with mutations in RET, to these cells.

The word"polynucleotide (s)," as used herein, means a polymeric collection of nucleotides and includes DNA and corresponding RNA molecules and both single and double stranded molecules, including HnRNA and mRNA molecules, sense and anti-sense strands of DNA and RNA molecules, and comprehends cDNA, genomic DNA, and wholly or partially synthesized polynucleotides. An HnRNA molecule contains introns and"corresponds to"a DNA molecule in a generally one-to-one manner. An mRNA molecule.

"corresponds to"an HnRNA and DNA molecule from which the introns have been excised. A polynucleotide of the present invention may be an entire gene, or any portion thereof. A gene is a DNA sequence which codes for a functional protein or RNA molecule. Operable anti-sense polynucleotides may comprise a fragment of the corresponding polynucleotide, and the definition of "polynucleotide"therefore includes all operable anti-sense fragments. Anti-sense polynucleotides and techniques involving anti-sense polynucleotides are well known in the art and are described, for example, in Robinson-Benion et al., Methods in Enzymol. 254 (23): 363-375,1995 and Kawasalci et al., Artific. Organs 20 (8): 836-848,1996.

Identification of genomic DNA and heterologous species DNA can be accomplished by standard DNA/DNA hybridization techniques, under appropriately stringent conditions, using all or part of a cDNA sequence as a. probe to screen an appropriate library. Alternatively, PCR techniques using oligonucleotide primers that are designed based on known genomic DNA, cDNA and/or protein sequences can be used to amplify and identify genomic and cDNA sequences. Synthetic DNA corresponding to the identified sequences and variants may be produced by conventional synthesis methods. All of the polynucleotides described herein are isolated and purified, as those terms are commonly used in the art.

As used herein, the term"oligonucleotide"refers to a relatively short segment of a polynucleotide sequence, generally comprising between 6 and 60 nucleotides, and comprehends both probes for use in hybridization assays and primers for use in the amplification of DNA by polymerase chain reaction.

As used herein, the term"x-mer,"with reference to a specific value of"x," refers to a polynucleotide comprising at least a specified number ("x") of contiguous residues of any of the polynucleotides identified as SEQ ID NOS: 1- 35. The value of x may be from about 20 to about 600, depending upon the specific sequence.

As used herein, the term"polypeptide"encompasses amino acid chains of any length, including full-length proteins, wherein amino acid residues are linked by covalent peptide bonds. Polypeptides of the present invention may be naturally purified products, or may be produced partially or wholly using recombinant techniques. Such polypeptides may be glycosylated with mammalian or other eukaryotic carbohydrates or may be non-glycosylated.

According to one embodiment,"variants"of the polynucleotides of the present invention, including the polynucleotides set forth as SEQ ID NOS: 1-35, as that term is used herein, comprehends polynucleotides producing an"E"value of 0.01 or less, as described below, or having at least a specified percentage identity to a polynucleotide of the present invention, as described below.

Polynucleotide variants of the present invention may be naturally occurring allelic variants, or non-naturally occurring variants.

Polynucleotide and polypeptide sequences may be aligned, and percentages of identical residues in a specified region may be determined against another polynucleotide or polypeptide, using computer algorithms that are publicly available. Two exemplary algorithms for aligning and identifying the similarity of polynucleotide sequences are the BLASTN and FASTA algorithms.

Polynucleotides may also be analyzed using the BLASTX algorithm, which compares the six-frame conceptual translation products of a nucleotide query sequence (both strands) against a protein sequence database. The percentage identity of polypeptide sequences may be examined using the BLASTP algorithm.

The BLASTN, BLASTP and BLASTX algorithms are available on the NCBI anonymous FTP server (ftp ://ncbi. nlm. nih. gov) under/blast/executables/and are available from the National Center for Biotechnology Information (NCBI), National Library of Medicine, Building 38A, Room 8N805, Bethesda, MD 20894,

USA. The BLASTN algorithm Version 2.0.11 [Jan-20-2000], set to the parameters described below, is preferred for use in the determination of polynucleotide variants according to the present invention. The BLASTP algorithm, set to the parameters described below, is preferred for use in the determination of polypeptide variants according to the present invention. The use of the BLAST family of algorithms, including BLASTN, BLASTP and BLASTX, is described at NCBI's website at URL http ://www. ncbi. nlm. nih. gov/BLAST/newblast. litml and in the publication of Altschul, et al., Nucleic Acids Res. 25 : 3389-3402,1997.

The FASTA and FASTX algorithms are available on the Internet at the ftp' site ftp ://ftp. virginia. edu/pub/, and from the University of Virginia by contacting David Hudson, Vice Provost for Research, University of Virginia, P. O. Box 9025, Charlottesville, VA 22906-9025, USA. The FASTA algorithm, set to the default parameters described in the documentation and distributed with the algorithm, may be used in the determination of polynucleotide variants. The readme files for FASTA and FASTX Version 1. Ox that are distributed with the algorithms describe the use of the algorithms and describe the default parameters. The use of the FASTA and FASTX algorithms is described in Pearson and Lipman, Proc. Natl.

Acad. Sci. USA 85 : 2444-2448,1988; and Pearson, Methods in Enzynzol.

183: 63-98,1990. The following running parameters are preferred for determination of alignments and similarities using BLASTN that contribute to the E values and percentage identity: Unix running command with default parameter values thus: blastall-p blastn-d embldb-e 10-G 0-E 0-r 1-v 30-b 30-i- queryseq-o results; the Parameters are:-p Program Name [String];-d Database [String];-e Expectation value (E) [Real];-G Cost to open a gap (zero invokes default behavior) [Integer] ;-E Cost to extend a gap (zero invokes default behavior) [Integer];-r Reward for a nucleotide match (BLASTN only) [Integer] ;- v Number of one-line descriptions (V) [Integer];-b Number of alignments to show (B) [Integer];-i Query File [File In];-o BLAST report Output File [File Out] Optional.

The"hits"to one or more database sequences by a queried sequence produced by BLASTN or FASTA or a similar algorithm align and identify similar portions of sequences. The hits are arranged in order of the degree of similarity and the length of sequence overlap. Hits to a database sequence generally represent an overlap over only a fraction of the sequence length of the queried sequence.

The BLASTN and FASTA algorithms produce"Expect"values for alignments. The Expect value (E) indicates the number of hits one can"expect"to see over a certain number of contiguous sequences by chance when searching a database of a certain size. The Expect value is used as a significance threshold for determining whether the hit to a database, such as the preferred EMBL database, indicates true similarity. For example, an E value of 0.1 assigned to a hit is interpreted as meaning that in a database of the size of the EMBL database, one might expect to see 0. 1 matches over the aligned portion of the sequence with a similar score simply by chance. The aligned and matched portions of the sequences, then, have a probability of 90% of being the same by this criterion.

For sequences having an E value of 0.01 or less over aligned and matched portions, the probability of finding a match by chance in the EMBL database is 1% or less using the BLASTN or FASTA algorithm.

According to one embodiment,"variant"polynucleotides, with reference to each of the polynucleotides of the present invention, preferably comprise sequences having the same number or fewer nucleic acids than each of the polynucleotides of the present invention and producing an E value of 0.01 or less when compared to the polynucleotide of the present invention. That is, a variant polynucleotide is any sequence that has at least a 99% probability of being the same as the polynucleotide of the present invention, measured as having an E value of 0.01 or less using the, BLASTN or FASTA algorithms set at the default parameters. According to a preferred embodiment, a variant polynucleotide is a sequence having the same number or fewer nucleic acids than a polynucleotide of the present invention that has at least a 99% probability of being the same as the

polynucleotide of the present invention, measured as having an E value of 0.01 or less using the BLASTN or FASTA algorithms set at the default parameters.

Alternatively, variant polynucleotides of the present invention may comprise a sequence exhibiting at least about 40%, more preferably at least about 60%, more preferably yet at least about 75%, and most preferably at least about 90% similarity to a polynucleotide of the present invention, determined as described below. The percentage similarity is determined by aligning sequences using one of the BLASTN or FASTA algorithms, set at default parameters, and identifying the number of identical nucleic acids over the best aligned portion ; dividing the number of identical nucleic acids by the total number of nucleic acids of the polynucleotide of the present invention ; and then multiplying by 100 to determine the percentage similarity. For example, a polynucleotide of the present invention having 220 nucleic acids has a hit to a polynucleotide sequence in the' EMBL database having 520 nucleic acids over a stretch of 23 nucleotides in the alignment produced by the BLASTN algorithm using the default parameters. The 23 nucleotide hit includes 21 identical nucleotides, one gap and one different nucleotide. The percentage similarity of the polynucleotide of the present invention to the hit in the EMBL library is thus 21/220 times 100, or 9.5%. The polynucleotide sequence in the EMBL database is thus not a variant of a polynucleotide of the present invention.

Alternatively, variant polynucleotides of the present invention hybridize to a polynucleotide of the present invention under stringent hybridization conditions.

As used herein,"stringent conditions"mean prewashing in a solution of 6X SSC, 0.2% SDS ; hybridizing at 65°C, 6X SSC, 0.2% SDS overnight; followed by two washes of 30 minutes each in 1X SSC, 0.1% SDS at 65°C and two washes of 30 minutes each in 0.2X SSC, 0.1% SDS at 65°C.

The present invention also encompasses allelic variants of the disclosed sequences, together with DNA sequences that differ from the disclosed sequences but which, due to the degeneracy of the genetic code, encode a polypeptide which is the same as that encoded by a DNA sequence disclosed herein. Thus, polynucleotides comprising sequences that differ from the polynucleotide

sequences recited in SEQ ID NOS : 1-35, or complements, reverse sequences, or reverse complements of those sequences as a result of conservative substitutions are contemplated by and encompassed within the present invention. Additionally, polynucleotides comprising sequences that differ from the polynucleotide sequences recited in SEQ ID NOS: 1-35, or complements, reverse complements, or reverse sequences as a result of deletions and/or insertions totaling less than 10% of the total sequence length are also contemplated by and encompassed within the present invention.

The polynucleotides of the present invention may be isolated from various DNA libraries, or may be synthesized using techniques that are well known in the art. The polynucleotides may be synthesized, for example, using automated oligonucleotide synthesizers (e. g. Beckman Oligo 1000M DNA Synthesizer) to obtain polynucleotide segments of up to 50 or more nucleic acids. A plurality of such polynucleotide segments may then be ligated using standard DNA- manipulation techniques that are well known in the art of molecular biology. One conventional and exemplary polynucleotide synthesis technique involves synthesis of a single stranded polynucleotide segment having, for example, 80 nucleic acids, and hybridizing that segment to a synthesized complementary 85 nucleic acid segment to produce a 5-nucleotide overhang. The next segment may then be synthesized in a similar fashion, with a 5-nucleotide overhang on the opposite strand. The"sticky"ends ensure proper ligation when the two portions are hybridized. In this way, a complete polynucleotide of the present invention may be synthesized entirely in vitro.

SEQ ID NOS : 2,3,5,7-9,11,12,14,15,17,19-21,23,26,28 and 30-32 are full-length sequences. The remaining polynucleotides are referred to as "partial"sequences, in that they may not represent the full coding portion of a gene encoding a naturally occurring polypeptide. The partial polynucleotide. sequences disclosed herein may be employed to obtain the corresponding full- length genes for various species and organisms by, for example, screening DNA expression libraries using hybridization probes based on the polynucleotides of the present invention, or using PCR amplification with primers based upon the

polynucleotides of the present invention. In this way one can, using methods well Icnown in the art, extend a polynucleotide of the present invention upstream and downstream of the corresponding mRNA, as well as identify the corresponding genomic DNA, including the promoter and enhancer regions, of the complete gene. The present invention thus comprehends isolated polynucleotides comprising a sequence identified in SEQ ID NOS: 1-35, or a variant of one of the specified sequences, that encode a functional polypeptide, including full-length genes. Such extended polynucleotides may have a length of from about 50 to about 4,000 nucleic acids or base pairs, and preferably have a length of less than about 4,000 nucleic acids or base pairs, more preferably yet a length of less than about 3,000 nucleic acids or base pairs, more preferably yet a length of less than about 2,000 nucleic acids or base pairs. Under some circumstances, extended polynucleotides of the present invention may have a length of less than about 1,800 nucleic acids or base pairs, preferably less than about 1,600 nucleic acids or base pairs, more preferably less than about 1,400 nucleic acids or base pairs, more. preferably yet less than about 1,200 nucleic acids or base pairs, and most preferably less than about 1,000 nucleic acids or base pairs.

Polynucleotides of the present invention comprehend polynucleotides comprising at least a specified number of contiguous residues (x-mers) of any of the polynucleotides identified as SEQ ID NOS: 1-35 or their variants. According to preferred embodiments, the value of x is preferably at least 20, more preferably at least 40, more preferably yet at least 60, and most preferably at least 80. Thus, polynucleotides of the present invention include polynucleotides comprising a 20- mer, a 40-mer, a 60-mer, an 80-mer, a 100-mer, a 120-mer, a 150-mer, a 180-mer, a 220-mer a 250-mer, or a 300-mer, 400-mer, 500-mer or 600-mer of a polynucleotide identified as SEQ ID NOS: 1-35 or a variant of one of the polynucleotides identified as SEQ ID NOS: 1-35.

Polynucleotide probes and primers complementary to and/or. corresponding to SEQ ID NOS: 1-35, and variants of those sequences, are also comprehended by the present invention. Such oligonucleotide probes and primers are substantially complementary to the polynucleotide of interest. An

oligonucleotide probe or primer is described as"corresponding to"a polynucleotide of the present invention, including one of the sequences set out as SEQ ID NOS: 1-35 or a variant, if the oligonucleotide probe or primer, or its complement, is contained within one of the sequences set out as SEQ ID NOS: 1- 35 or a variant of one of the specified sequences.

Two single stranded sequences are said to be substantially complementary when the nucleotides of one strand, optimally aligned and compared using, for example, the BLAST algorithm as described above, with the appropriate nucleotide insertions and/or deletions, pair with at least 80%, preferably at least 90% to 95%, and more preferably at least 98% to 100%, of the nucleotides of the other strand. Alternatively, substantial complementarity exists when a first DNA strand will selectively hybridize to a second DNA strand under stringent hybridization conditions. Stringent hybridization conditions for determining complementarity include salt conditions of less than about 1 M, more usually less than about 500 mM and preferably less than about 200 mM. Hybridization temperatures can be as low as 5°C, but are generally greater than about 22°C, more preferably greater than about 30°C and most preferably greater than about 37°C. Longer DNA fragments may require higher hybridization temperatures for specific hybridization. Since the stringency of hybridization may be affected by other factors such as probe composition, presence of organic solvents and extent of base mismatching, the combination of parameters is more important than the absolute measure of any one alone. The DNA from plants or samples or products containing plant material can be either genomic DNA or DNA derived by preparing cDNA from the RNA present in the sample.

In addition to DNA-DNA hybridization, DNA-RNA or RNA-RNA hybridization assays are also possible. In the case of DNA-RNA hybridization, the mRNA from expressed genes would then be detected instead of genomic DNA or cDNA derived from mRNA of the sample. In the case of RNA-RNA hybridization, RNA probes could be used. In addition, artificial analogs of DNA hybridizing specifically to target sequences could also be employed.

In specific embodiments, the oligonucleotide probes and/or primers comprise at least about 6 contiguous residues, more preferably at least about 10 contiguous residues, and most preferably at least about 20 contiguous residues complementary to a polynucleotide sequence of the present invention. Probes and primers of the present invention may be from about 8 to 100 base pairs in length or, preferably from about 10 to 50 base pairs in length or, more preferably from about 15 to 40 base pairs in length. The probes can be easily selected using procedures well lcnown in the art, taking into account DNA-DNA hybridization stringencies, annealing and melting temperatures, potential for formation of loops and other factors, which are well known in the art. Tools and software suitable for designing probes, and especially suitable for designing PCR primers, are available on the Internet, for example, URL http ://www. horizonpress. com/pcr/. Preferred techniques for designing PCR primers are also disclosed in Dieffenbach and Dyksler, PCR primer : a Iclboratory manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1995.

A plurality of oligonucleotide probes or primers corresponding to a polynucleotide of the present invention may be provided in a kit form. Such kits generally comprise multiple DNA or oligonucleotide probes, each probe being specific for a polynucleotide sequence. Kits of the present invention may comprise one or more probes or primers corresponding to a polynucleotide of the present invention, including a polynucleotide sequence identified in SEQ ID NOS: 1-35.

In one embodiment useful for high-throughput assays, the oligonucleotide probe kits of the present invention comprise multiple probes in an array format, wherein each probe is immobilized in a predefined, spatially addressable location on the surface of a solid substrate. Array formats which may be usefully employed in the present invention are disclosed, for example, in U. S. Patents No. 5,412,087, 5,545,531, and PCT Publication No. WO 95/00530, the disclosures of which are hereby incorporated by reference.

Oligonucleotide probes for use in the present invention may be constructed synthetically prior to immobilization on an array, using techniques well knows in

the art (see, for example, Oligonucleotide Synthesis : A Practical Approach, Gait, ed., IRL Press, Oxford, 1984). Automated equipment for the synthesis of oligonucleotides is available commercially from such companies as Perkin Elmer/Applied Biosystems Division (Foster City, CA) and may be operated according to the manufacturer's instructions. Alternatively, the probes may be constructed directly on the surface of the array using techniques taught, for example, in PCT Publication No. WO 95/00530.

The solid substrate and the surface thereof preferably form a rigid support and are generally formed from the same material. Examples of materials from which the solid substrate may be constructed include polymers, plastics, resins, membranes, polysaccharides, silica or silica-based materials, carbon, metals and inorganic glasses. Synthetically prepared probes may be immobilized on the surface of the solid substrate using techniques well known in the art, such as those disclosed in U. S. Patent No. 5,412,087.

In one such technique, compounds having protected functional groups, such as thiols protected with photochemically removable protecting groups, are attached to the surface of the substrate. Selected regions of the surface are then irradiated with a light source, preferably a laser, to provide reactive thiol groups.

This irradiation step is generally performed using a mask having apertures at predefined locations using photolithographic techniques well known in the art of semiconductors. The reactive thiol groups are then incubated with the oligonucleotide probe to be immobilized. The precise conditions for incubation, such as temperature, time and pH, depend on the specific probe and can be easily determined by one of skill in the art. The surface of the substrate is washed free of unbound probe and the irradiation step is repeated using a second mask having a different pattern of apertures. The surface is subsequently incubated with a second, different, probe. Each oligonucleotide probe is typically immobilized in a discrete area of less than about 1 mm2. Preferably each discrete area is less than about 10,000 mm2, more preferably less than about 100 mO12. In this manner, a multitude of oligonucleotide probes may be immobilized at predefined locations on the array.

The resulting array may be employed to screen for differences in organisms or samples or products containing genetic material as follows.

Genomic or cDNA libraries are prepared using techniques well lcnown in the art.

The resulting target DNA is then labeled with a suitable marker, such as a radiolabel, chromophore, fluorophore or chemiluminescent agent, using protocols well known for those skilled in the art. A solution of the labeled target DNA is contacted with the surface of the array and incubated for a suitable period of The surface of the array is then washed free of unbound target DNA and the probes to which the target DNA hybridized are determined by identifying those regions of the array to which the markers are attached. When the marker is a radiolabel, such as 32p, autoradiography is employed as the detection method. In one embodiment, the marker is a fluorophore, such as fluorescein, and the location of bound target DNA is determined by means of fluorescence spectroscopy.

Automated equipment for use in fluorescence scanning of oligonucleotide probe arrays is available from Affymetrix, Inc. (Santa Clara, CA) and may be operated according to the manufacturer's instructions. Such equipment may be employed to determine the intensity of fluorescence at each predefined location on the array, thereby providing a measure of the amount of target DNA bound at each location.

Such an assay would be able to indicate not only the absence and presence of the marker probe in the target, but also the quantitative amount as well.

In this manner, oligonucleotide probe kits of the present invention may be employed to examine the presence/absence (or relative amounts in case of mixtures) of polynucleotides in different samples or products containing different materials rapidly and in a cost-effective manner.

Another aspect of the present invention involves collections of a plurality of polynucleotides of the present invention. A collection of a plurality of the polynucleotides of the present invention, particularly the polynucleotides identified as SEQ ID NOS: 1-35, may be recorded and/or stored on a storage medium and subsequently accessed for purposes of analysis, comparison, etc.

One utility for such sets of sequences is the analysis of the set, either alone or together with other sequences sets, for single nucleotide polymorphisms (SNPs)

between sequences from different tissues and/or individuals for genetic studies, mapping and fingerprinting purposes. Suitable storage media include magnetic media such as magnetic diskettes, magnetic tapes, CD-ROM storage media, optical storage media, and the like. Suitable storage media and methods for recording and storing information, as well as accessing information such as polynucleotide sequences recorded on such media, are well blown in the art. The polynucleotide information stored on the storage medium is preferably computer- readable and may be used for analysis and comparison of the polynucleotide' information.

Another aspect of the present invention thus involves storage medium on which are recorded a collection of the polynucleotides of the present invention, particularly a collection of the polynucleotides identified as SEQ ID NOS: 1-35.

According to one embodiment, the storage medium includes a collection of at least 20, preferably at least 50, more preferably at least 100, and most preferably at least 200 of the polynucleotides of the present invention, preferably the polynucleotides identified as SEQ ID NOS: 1-35, or variants of those polynucleotides.

Another aspect of the present invention involves a combination of polynucleotides, the combination containing at least 5, preferably at least 10, more preferably at least 20, and most preferably at least 50 different polynucleotides of the present invention, including polynucleotides selected from SEQ ID NOS: 1-' 35, or variants of these polynucleotides.

In another aspect, the present invention provides DNA constructs comprising, in the 5'-3'direction, a gene promoter sequence ; an open reading frame coding for at least a functional portion of a polypeptide encoded by a polynucleotide of the present invention ; and a gene termination sequence. The open reading frame may be orientated in either a sense or antisense direction.

DNA constructs comprising a non-coding region of a gene coding for an enzyme encoded by the above DNA sequences or a nucleotide sequence complementary to a non-coding region, together with a gene promoter sequence and a gene termination sequence, are also provided. Preferably, the gene promoter and

termination sequences are functional in a host cell. More preferably, the gene promoter and termination sequences are common to those of the polynucleotide being introduced. Other promoter and termination sequences generally used in the art, such as the Cauliflower Mosaic Virus (CMV) promoter, with or without enhancers, such as the Kozak sequence or Omega enhancer, and Agrobacterium tumefaciens nopalin synthase terminator may be usefully employed in the present invention. Tissue-specific promoters may be employed in order to target expression to one or more desired tissues. The DNA construct may further include a marker for the identification of transformed cells.

Techniques for operatively linking the components of the DNA constructs are well knows in the art and include the use of synthetic linkers containing one or more restriction endonuclease sites as described, for example, by Sambrook et al., Molecular Cloning : a laboratory manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. The DNA constructs of the present invention may be linked to a vector having at least one replication system, for example, Escherichia coli, whereby after each manipulation, the resulting construct can be cloned and sequenced and the correctness of the manipulation determined.

Transgenic cells comprising the DNA constructs of the present invention are provided, together with organisms comprising such transgenic cells.

Techniques for stably incorporating DNA constructs into the genome of target organisms, such as mammals, are well known in the art and include electroporation, protoplast fusion, injection into reproductive organs, injection. into immature embryos, high velocity projectile introduction and the like. The choice of technique will depend upon the target organism to be transformed. In one embodiment, naked DNA is injected or delivered orally. Once the cells are transformed, cells having the DNA construct incorporated in their genome are selected. Transgenic cells may then be cultured in an appropriate medium, using techniques well known in the art.

In yet a further aspect, the present invention provides methods for modifying the level (concentration) or activity of a polypeptide in a host organism, comprising stably incorporating into the genome of the organism a

DNA construct of the present invention. The DNA constructs of the present invention may be used to transform a variety of organisms, including mammals, for example to make experimental gene knock out or transgenic animals.

Further, the polynucleotides of the present invention have particular application for use as non-disruptive tags for marking organisms, including commercially valuable animals, fish, bacteria and yeasts. DNA constructs comprising polynucleotides of the present invention may be stably introduced into an organism as heterologous, non-functional, non-disruptive tags. It is then possible to identify the origin or source of the organism at a later date by determining the presence or absence of the tag (s) in a sample of material.

Detection of the tag (s) may be accomplished using a variety of conventional techniques, and will generally involve the use of nucleic acid probes.

Sensitivity in assaying the presence of probe can be usefully increased by using branched oligonucleotides, as described by Horn et al., Nucleic Acids Res.

25 (23): 4842-4849,1997, enabling to detect as few as 50 DNA molecules in the sample.

In particular, the polynucleotides of the present invention encode polypeptides that have important roles in processes such as induction of growth, differentiation of tissue-specific cells, cell migration, cell proliferation, and cell- cell interaction. These polypeptides are important in the maintenance of tissue integrity, and thus are important in processes such as wound healing. Some of these polypeptides act as modulators of immune responses, such as immunologically active polypeptides for the benefit of offspring. In addition, many polypeptides are immunologically active, making them important therapeutic targets in a whole range of disease states. Antibodies to the polypeptides of the present invention and small molecule inhibitors related to the polypeptides of the present invention may also be used for modulating immune responses and for treatment of diseases according to the present invention.

SEQ ID NOS: 1 ; 2; 4; 5; 6; 8 ; 9; 11; 12; 14; 17; 19-24; 26; 27; 31-34 encode secreted polypeptides. SEQ ID NOS: 10; 15; 16; 18; 25; 28; 30; and 35 encode polypeptides acting as receptors. SEQ ID NOS: 2; 4; 24; 29 and 35

encode polypeptides with cell signaling activity, which may be either intracellular or extracellular. Kinase genes, for example, encode polypeptides that phosphorylate specific substrates during cell-to-cell signaling. While some kinases are involved in normal metabolism and nucleotide production, others are significant for altering the activity of many cellular processes through the phosphorylation of specific proteins. Polypeptides encoded by these genes are important in the transmission of intracellular signals resulting from the binding of extracellular ligands such as hormones, growth factors or cytolcines to membrane- bound receptors. The utility of polynucleotides encoding kinases resides in the manipulation of their signaling activities and downstream effects for the diagnosis and treatment of mammalian diseases that may be a consequence of inappropriate expression of these kinase genes.

SEQ ID NOS: 2 and 4 encode polypeptides with cytokine activity.

Cytokine or growth factor polynucleotides encode polypeptides involved in intercellular signaling and represent another important class of molecules.

Polynucleotides encoding such genes have utility in the diagnosis and treatment of disease.

SEQ ID NOS : 7; 11; 12; 15 and 22 encode polypeptides with transcription factor activity. These polynucleotides encode polypeptides required for the control of synthesis of proteins in tissue specific manner and have utility for the modification of protein synthesis for the control of disease.

SEQ ID NOS : 8 encode polypeptides acting in the extracellular matrix.

SEQ ID NOS: 11; 12 ; 15 and 22 encode polypeptides with RNA synthesis activities.

SEQ ID NO: 12 encodes a polypeptide having CD antigen activity. Such. polynucleotides have utility as modulators of the composition, expression level and class of CD antigen expressed, which influence immune responses to self- antigens, neo-antigens and infectious agents.

Further exemplary specific utilities, for exemplary polynucleotides of the present invention, are specified in the Table below. SEQ ID NO : UTILITY 2 Promoting immune response as part of a vaccine or anti-cancer treatment. Inhibitors of this molecule can be useful as anti- inflammatory treatment, e. g. for autoimmune diseases or allergies. 11 ; 19 Utility as a target for cancer treatment and as an immunoregulatory and anti-inflammatory molecule 12 Diagnostic for specific types of cancer and for development of an anti-cancer treatment. 15 As a target for antagonists in the treatment of diseases such as asthma and allergy. 22 Useful to inhibit or enhance the activity of the soluble molecule that binds this protein. 28 Useful to promote or block cell trafficking and therefore in the treatment as anti- inflammatory and/or vaccine adjuvant where it can promoter inflammation. 35 Useful for tissue and neural regeneration.

The following examples are offered by way of illustration and not by way of limitation.

Example 1 ISOLATION OF CDNA SEQUENCES FROM MAMMALIAN EXPRESSION LIBRARIES The cDNA sequences of the present invention were obtained by high- throughput sequencing of cDNA expression libraries constructed mouse airways- induced eosinophilia, rat dermal papilla and mouse stromal cells. The cDNA libraries were prepared as follows. cDNALibraryftom Dermal Papilla (DEPA) Dermal papilla cells from rat hair vibrissae (whiskers) were grown in culture and the total RNA extracted from these cells using established protocols.

Total RNA, isolated using TRIzol Reagent (BRL Life Technologies, Gaithersburg, MD), was used to obtain mRNA using a Poly (A) Quik mRNA isolation kit (Stratagene, La Jolla, CA), according to the manufacturer's

specifications. A cDNA expression library was then prepared from the mRNA by reverse transcriptase synthesis using a Lambda ZAP cDNA library synthesis kit' (Stratagene). cDNAlibrary fom nzouse aiwway-inca'uced eosinophilia (MALA) Airway eosinophilia were induced in BALB/cByJ mice by administering 2 pg ovalbumin in 2 mg alum adjuvant intraperitoneally on day 0 and day 14, and subsequently 100 pg ovalbumin in 50 ul phosphate buffered saline (PBS) intranasally route on day 28. The accumulated eosinophils in the lungs were detected by washing the airways of the anesthetized mice with saline, collecting the washings (broncheolar lavage or BAL), and counting the numbers of eosinophils. The mice were sacrificed and total RNA was isolated from whole lung tissue using TRIzol Reagent (BRL Life Technologies). mRNA was isolated by using a Poly (A) Quik mRNA isolation lcit (Stratagene, La Jolla, CA), according to the manufacturer's specifications. A cDNA expression library was then prepared from the mRNA by reverse transcriptase synthesis using a Lambda ZAP cDNA library synthesis kit (Stratagene). cDNA Expression Library fi^om Peripheral Lymph Node Stromal Cells (MLSA) The peripheral axillary and brachial lymph nodes of BALB/cByJ mice with the flaky skin (fsn) mutation (Jackson Laboratories, Bar Harbour, MN) were dissected out. Single cell suspensions were obtained from the lymph nodes and cultured in tissue culture flasks at 107 cells/ml in 20% fetal calf serum and Dulbecco's Minimum Essential Medium. After 2 days the non-adherent cells were removed. The adherent cells were cultured for a further 2 days and then treated with 0.25 g/100ml Trypsin (ICN, Aurora, OH) and re-cultured. After a further 4 days, non-adherent cells were discarded and adherent cells removed by trypsinization. Remaining adherent cells were physically removed by scraping' with a rubber policeman. All adherent stromal cells were pooled.

DNA Expression Library f om Flaky skin Iymph node stromal cells in pBK (MLSA) Stromal cells from Flal skin mice lymph nodes were grown in culture and the total RNA extracted from these cells using established protocols. Total RNA, isolated using TRIzol Reagent (BRL Life Technologies, Gaithersburg, MD), was used to obtain mRNA using a Poly (A) Quik mRNA isolation kit (Stratagene, La Jolla, CA), according to the manufacturer's specifications. A cDNA expression library was then prepared from the mRNA by reverse transcriptase synthesis using a Lambda ZAP cDNA library synthesis kit (Stratagene). cDNA sequences were obtained by high-throughput sequencing of the cDNA libraries described above using a Prism 377 sequencer (Perlcin Elmer/Applied Biosystems Division, Foster City CA), and are provided in SEQ ID NO: 1-35, with corresponding polypeptide sequences in SEQ ID NOS: 36-65.

EXAMPLE 2 Analysis of cDNA sequences using BLAST algorithms BLASTN Polynucleotide analysis The isolated cDNA sequences were compared to sequences in the EMBL DNA database using the computer algorithm BLASTN. Comparisons of DNA sequences provided in SEQ ID NOS: 1-35, to sequences in the EMBL DNA database (using BLASTN) were made as of November, 2000, using Version 2.0.11 [Jan-20-2000], and the following Unix running command: blastall-p blastn-d embldb-e 10-GO-EO-r 1-v 30-b 30-i queryseq-o.

The sequences of SEQ ID NOS: 1,3,4,6-11,13,18,21,22,24,25,28- 30,33 and 34 were determined to have less than 50% identity, determined as described above, to sequences in the EMBL database using the computer algorithm BLASTN, as described above. The sequences of SEQ ID NOS: 2,12, 14-16,20 and 35 were determined to have less than 75% identity, determined as described above, to sequences in the EMBL database using the computer

algorithm BLASTN, as described above The sequences of SEQ ID NOS: 17,19, 23 and 27 were determined to have less than 90% identity, determined as described above, to sequences in the EMBL database using the computer algorithm BLASTN, as described above. Finally, the sequences of SEQ ID NOS: 5,26 and 32 were determined to have less than 98% identity, determined as described above, to sequences in the EMBL database using the computer algorithm BLASTN, as described above.

BLASTP Polypeptide analysis The sequences of SEQ ID NOS: 37,41,42,44,46-50,55,56 and 59 were determined to have less than 50% identity, determined as described above, to sequences in the SwissProt database using the computer algorithm BLASTP, as described above. The sequences of SEQ ID NOS: 36,38,43,45 and 60 were determined to have less than 75% identity, determined as described above, to sequences in the SwissProt database using the computer algorithm BLASTP, as described above. The sequences of SEQ ID NOS: 39,54 and 58 were determined to have less than 90% identity, determined as described above, to sequences in the SwissProt database using the computer algorithm BLASTP, as described above.' Finally, the sequences of SEQ ID NOS: 53,57,62 and 65 were determined to have less than 98% identity, determined as described above, to sequences in the SwissProt database using the computer algorithm BLASTP, as described above.

BLASTX Polynucleotide Analysis The sequences of SEQ ID NOS: 2-4, 6-16,18,22-24,26-30 and 33-35 were determined to have less than 50% identity, determined as described above, to sequences in the SwissProt database using the computer algorithm BLASTX, as described above. The sequences of SEQ ID NOS: 1, 19, 20,25 and 32 were determined to have less than 75% identity, determined as described above, to sequences in the SwissProt database using the computer algorithm BLASTX, as described above. Finally, the sequences of SEQ ID NOS : 5,17,21 and 31 were determined to have less than 90% identity, determined as described above, to'

sequences in the SwissProt database using the computer algorithm BLASTX, as described above.

Example 2 ISOLATION AND CHARACTERIZATION OF THE HUMAN HOMOLOG OF MUKS 1 This example demonstrates that an isolated cDNA may be used to isolate its homologue from a different species, the corresponding polypeptide may be expressed and the function of the polypeptide can be determined, starting the whole process from an isolated cDNA obtained as described above.

Analysis ofRNA transcripts by Northern Blotting Northern analysis to determine the size and distribution of mRNA for the clone muKSl (SEQ ID NO: 66; isolated from a mouse keratinocyte stem cell cDNA library using high-throughput sequencing as described above) was performed by probing murine tissue mRNA blots with a probe consisting of nucleotides 268-499 of muKSl, radioactively labeled with [oc32P]-dCTP.

Prehybridization, hybridization, washing and probe labeling were performed as described in Sambrook et al., Ibid. mRNA for muKSl was 1.6 kb in size and was observed to be most abundant in brain, lung, muscle and heart. Expression could also be detected in lower intestine, slcin and kidney. No detectable signal was found in testis, spleen, liver, thymus and stomach.

Human homologue of muKSl MuKSl (SEQ ID NO: 66) was used to search the EMBL database (Release 50 plus updates to June, 1998) to identify human EST homologues. The top three homologies were to the following ESTs : accession numbers AA643952, HS1301003 and AA865643. These showed 92.63% identity over 285 nucleotides, 93.64% over 283 nucleotides and 94.035% over 285 nucleotides, respectively. Frame shifts were identified in AA643952 and HS1301003 when

translated. Combination of all three ESTs identified the human homologue huKSl (SEQ ID NO: 67) and translated polypeptide SEQ ID NO : 67. Alignment of muKS 1 and huKS 1 polypeptides indicated 95% identity over 96 amino acids.

Bacíerial expression a7gd purification of muKSl and huKSl Polynucleotides 269-502 of muKSl (SEQ ID NO: 69), encoding amino acids 23-99 of polypeptide muKSl (SEQ ID NO: 70), and polynucleotides 55-288. of huKSl (SEQ ID NO: 71), encoding amino acids 19-95 of polypeptide huKSl (SEQ ID NO: 72), were cloned into the bacterial expression vector pET-16b (Novagen, Madison, WI), which contains a bacterial leader sequence and N- terminal 6xHistidine tag. These constructs were transformed into competent E. coli BL21 (DE3) (Novagen) as described in Sambrook et al., Ibid.

Starter cultures of recombinant E. coli BL21 (DE3) (Novagen) transformed with bacterial expression vector pET16b containing SEQ ID NO: 69 (muKSla) and SEQ ID NO: 71 (huKSla) were grown in NZY broth containing 100 pg/ml ampicillin (Gibco-BRL Life Technologies) at 37°C. Cultures were spun down and used to inoculate 800 ml of NZY broth and 100 pLg/ml ampicillin. Cultures were grown until the ODsgs of the cells was between 0.4 and 0.8. Bacterial expression was induced for 3 hours with 1 mM IPTG. Bacterial expression produced an induced band of approximately 15 kDa for muKS 1 a and huKSIa.

MuKSla and huKSla were expressed in insoluble inclusion bodies. In order to purify the polypeptides, bacterial cell pellets were re-suspended in lysis buffer (20 mM Tris-HCl pH 8.0,10 mM p-Mercaptoethanol, 1 mM PMSF). To the lysed cells, 1% NP-40 was added and the mix incubated on ice for 10 minutes.

Lysates were further disrupted by sonication on ice at 95 W for 4 x 15 seconds and then centrifuged for 10 minutes at 18,000 rpm to pellet the inclusion bodies.

The pellet containing the inclusion bodies was re-suspended in lysis buffer containing 0.5% w/v CHAPS and sonicated for 5-10 seconds. This mix was stored on ice for 1 hour, centrifuged at 14,000 rpm for 15 minutes at 4°C and the supernatant discarded. The pellet was once more re-suspended in lysis buffer

containing 0.5% w/v CHAPS, sonicated, centrifuged and the supernatant removed as before. The pellet was re-suspended in solubilizing buffer (6 M guanidine HC1, 0.5 M NaCl, 20 mM Tris-HCl pH 8. 0), sonicated at 95 W for 4 x 15 sec and centrifuged for 10 minutes at 18, 000 rpm and 4°C to remove debris. The' supernatant was stored at 4°C. MuKSIa and ImKS 1 a were purified by virtue of the N-terminal 6x histidine tag contained within the bacterial leader sequence, using a Nickel-Chelating sepharose column (Amersham Pharmacia, Uppsala, Sweden) and following the manufacturer's protocol. Proteins were purified twice over the column to reduce endotoxin contamination. In order to re-fold the proteins once purified, the protein solution was dialysed in a 4 M-2 M urea gradient in 20 mM Tris-HCl pH 7.5 containing 10% glycerol overnight at 4°C.

The protein was then further dialysed 2x against 2 litres of 20 mM Tris-HCl pH 7.5 containing 10% glycerol.

Injection of bacterially expressed muKSl a into nude mice Two nude mice were anaesthetised intraperitoneally with 75 al of 1/10' dilution of Hypnorm (Janssen Pharmaceuticals, Buckinghamshire, England) in phosphate buffered saline. 20 pg of bacterially expressed muKS I a (SEQ ID NO: 70) was injected subcutaneously in the left hind foot, ear and left hand side of the back. The same volume of phosphate buffered saline was injected in the same sites but on the right hand side of the same animal. Mice were left for 18 hours and then examined for inflammation. Both mice showed a red swelling in the ear and foot sites injected with the bacterially expressed protein. No obvious inflammation could be identified in either back site. Mice were culled and biopsies taken from the ear, back and foot sites and fixed in 3.7% formol saline.

Biopsies were embedded, sectioned and stained with Haemotoxylin and eosin.

Sites injected with muKSla had a marked increase in polymorphonuclear granulocytes, whereas sites injected with phosphate buffered saline had a low background infiltrate of polymorphonuclear granulocytes.

Chemokines are a large superfamily of highly basic secreted proteins with a broad number of functions (Baggiolini et al., 4nnu. Rev. Immunol. 15: 675-705, 1997 ; Ward et al., Immunity 9 : 1-11,1998; Horuk, Nature 393: 524-525,1998).

The polypeptide sequences of muKSl and huis1 have similarity to CXC chemoldnes, suggesting that this protein will act like other CXC chemolcines. The in vivo data from nude mice supports this hypothesis. This chemokine-lilce protein may therefore be expected to stimulate leukocyte, epithelial, stromal and neuronal cell migration, promote angiogenesis and vascular development, promote neuronal patterning, hematopoietic stem cell mobilization, lceratinocyte and epithelial stem cell patterning and development, activation and proliferation of leukocytes, and promotion of migration in wound healing events. It has recently been shown that receptors to chemokines act as co-receptors for HIV-1 infection of CD4+ cells (Cairns et al., Nature Medicine 4: 563-568, 1998) and that high circulating levels of chemokines can render a degree of immunity to those exposed to the HIV virus (Zagury et al., Proc. Natl. Acad. Sci. USA 95: 3857-3861,1998). This novel gene and its encoded protein may thus be usefully employed as regulators of epithelial, lymphoid, myeloid, stromal and neuronal cells migration and cancers; as agents for the treatment of cancers, neuro-degenerative diseases, inflammatory autoimmune diseases such as psoriasis, asthma and Crohns disease ; for use in wound healing; and as agents for the prevention of HIV-1 binding and infection of leulcocytes.

SEQ ID NOS: 1-72 are set out in the attached Sequence Listing. The codes for nucleotide sequences used in the attached Sequence Listing, including the symbol"n,"conform to WIPO Standard ST. 25 (1998), Appendix 2, Table 1.

All references cited herein, including patent references and non-patent publications, are hereby incorporated by reference in their entireties.

While in the foregoing specification this invention has been described in relation to certain preferred embodiments, and many details have been set forth for purposes of illustration, it will be apparent to those skilled in the art that the invention is susceptible to additional embodiments and that certain of the details

described herein may be varied considerably without departing from the basic principles of the invention.