Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD OF IDENTIFYING A PSYCHOTROPIC AGENT USING DIFFERENTIAL GENE EXPRESSION
Document Type and Number:
WIPO Patent Application WO/2000/037685
Kind Code:
A2
Abstract:
Disclosed are methods of identifying psychotropic agents that do not induce motor side effects using differential gene expression. Also disclosed are novel nucleic acid sequences whose expression is differentially regulated by psychotropic agents.

Inventors:
GOULD-ROTHBERG BONNIE (US)
Application Number:
PCT/US1999/030727
Publication Date:
June 29, 2000
Filing Date:
December 21, 1999
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CURAGEN CORP (US)
GOULD ROTHBERG BONNIE (US)
International Classes:
A61K31/711; A61K38/00; A61K45/00; A61K48/00; G01N33/50; A61P25/18; C07K14/47; C12N1/15; C12N1/19; C12N1/21; C12N5/10; C12N15/09; C12N15/12; C12Q1/02; C12Q1/68; G01N33/15; G01N33/53; G01N33/566; G01N33/68; (IPC1-7): C12Q1/68; G01N33/50; A61K31/00; C12N15/12
Domestic Patent References:
WO1995013369A11995-05-18
WO1998045436A21998-10-15
Other References:
SHIMKETS R.A. ET AL.,: "Gene expression analysis by transcript profiling coupled to a gene database query" NATURE BIOTECHNOLOGY, vol. 17, - August 1999 (1999-08) pages 798-803, XP002130008 cited in the application
DATABASE EMBL [Online] embl AC#AL031781, 29 September 1998 (1998-09-29) MASHREGHI-MOHAMMADI M.: "human DNA sequence from clone 51J12 on chromosome 6q26-27" XP002139924
Attorney, Agent or Firm:
Elrifi, Ivor R. (Levin Cohn, Ferris, Glovsky & Pope, P.C. One Financial Center Boston MA, US)
Download PDF:
Claims:
What is claimed is :
1. A method of identifying a psychotropic agent that does not induce a significant motor side effect, the method comprising : (a) providing a test cell population comprising a cell capable of expressing one or more genes, wherein each gene is selected from the group consisting of HALO 131 and 32 ; (b) contacting said test cell population with said psychotropic agent ; and (c) comparing the expression of said gene in said test cell population to the expression of said gene in a reference cell population ; wherein an alteration in expression of said gene in the test cell population compared to the expression of said gene in the reference cell population indicates the psychotropic agent does not induce a significant motor side effect.
2. The method of claim 1, wherein said reference cell population is treated with a psychotropic agent which induces a motor side effect.
3. The method of claim 1, wherein said method comprises comparing expression of five or more of said genes.
4. The method of claim 1, wherein said method comprises comparing expression of ten or more of said genes.
5. The method of claim 1, wherein said method comprises comparing expression of 25 or more of said genes.
6. The method of claim 1. wherein said method comprises comparing expression of 30 or more of said genes.
7. The method of claim 1, wherein said method further comprises comparing the expression of a gene selected from the group consisting ofNGFIA, JunB, Synaptophysin. and phosphatidylinositol3kinase.
8. The method of claim 7, wherein said method comprises comparing the expression of three or more genes selected from the group consisting of NGF IA, JunB, Synaptophysin, and phosphatidylinositol3kinase.
9. The method of claim 1, wherein an increase in expression of said gene indicates the psychotropic agent does not induce a significant motor side effect.
10. The method of claim 9, wherein said gene is selected from the group consisting of a HALO 18, 10, 11, 14, 1618, 21, 2427, 2931 and 32 nucleic acid.
11. The method of claim 1, wherein a decrease in expression of said gene in the first subset of cells compared to the second subset of cells indicates psychotropic agent does not induce a significant motor side effect.
12. The method of claim 11, wherein said gene is selected from the group consisting of HALO 9, 12, 13, 19, 20, 22, 23, and 28 nucleic acid.
13. The method of claim 1, wherein said cell population is provided in vitro.
14. The method of claim 1, wherein said cell population is provided ex vivo from a mammalian subject.
15. The method of claim 1, wherein said cell population is derived from a human or rodent subject.
16. The method of claim 1, wherein said cell is provided in vivo in a mammalian subject.
17. The method of claim 1, wherein said cell is a neuronal cell.
18. The method of claim 1, wherein said cell is from brain tissue.
19. The method of claim 18, wherein said cell is from striatum brain tissue.
20. The method of claim 1, wherein said cell is a human cell.
21. The method of claim 1, wherein said control agent is a butyrophenone compound.
22. The method of claim 1, wherein said control agent is selected from the group consisting of droperidol and haloperidol.
23. The method of claim 1, wherein said control agent is haloperidol.
24. The method of claim 1, wherein said control agent is a phenothiazine.
25. The method of claim 1, wherein said control agent is chorpromaine.
26. The method of claim 1, wherein said motor side effect is an extrapyramidal motor pathology.
27. The method of claim 26, wherein said motor side effect is a dystonia.
28. A psychotropic agent identified according to the method of claim 1.
29. A pharmaceutical composition comprising the psychotropic agent of claim 28.
30. A method of selecting a psychotropic agent appropriate for a particular subject, the method comprising : (a) providing from said subject a cell population comprising a cell capable of expressing one or more genes, wherein each gene is selected from the group consisting of HALO 131 and 32 ; (c) contacting said cell population with said psychotropic agent ; and (c) comparing the expression of said gene to the expression of said gene in a reference cell population ; wherein an alteration in expression of said gene in the test cell population compared to the expression of said gene in the reference cell population indicates the psychotropic agent is appropriate for said subject.
31. The method of claim 30, wherein said subject is a human.
32. The method of claim 30, wherein said appropriate psychotropic agent does not induce a significant motor defect in said subject.
33. A method of diagnosing or determining susceptibility to a movement disorder in a subject, the method comprising : (a) providing from said subject a cell population comprising a cell capable of expressing one or more genes, wherein said gene is selected from the group consisting of HALO 131 and 32 ; and (b) comparing the expression of said gene to the expression of said gene in a reference cell population comprising cells from a subject not suffering from a movement disorder ; wherein an alteration in expression of said gene in the test cell population compared to the expression of said gene in the reference cell population indicates subject has or is susceptible to a movement disorder.
34. A method of diagnosing or determining susceptibility to a movement disorder in a subject, the method comprising : (a) providing from said subject a test cell population comprising a cell capable of expressing one or more genes, wherein each gene is selected from the group consisting of a quaking gene and a gene encoding VI protein ; and (b) comparing the expression of said gene in said cell population to the expression of said gene in a reference cell population comprising cells from a subject not suffering from a movement disorder ; wherein an alteration in expression of said gene in the test cell population compared to the expression of said gene in the reference cell population indicates subject has or is susceptible to a movement disorder.
35. The method of claim 34, wherein said gene is a human quaking gene.
36. The method of claim 34, wherein said human quaking gene is a human Qk5 quaking gene.
37. The method of claim 34, wherein said gene is human Qk7 quaking gene.
38. The method of claim 34, wherein said gene is a V I gene.
39. The method of claim 34, wherein said alteration in expression is an increase in expression of said quaking gene.
40. The method of claim 39, wherein said alteration in expression is a decrease in expression of said quaking gene.
41. A method of preventing or delaying the onset of a motor pathology in a subject, the method comprising administering to said subject an agent which increases the expression or activity of a gene selected from the group consisting of HALO 18, 10, 11, 14, 1618, 21, 2427, 2931 and 32.
42. The method of claim 41, wherein said motor pathology is associated with administration of a psychoactive agent to said subject.
43. The method of claim 41, wherein said subject is a human.
44. The method of claim 41, wherein said motor pathology is a dystonia.
45. The method of claim 41, wherein said agent increases the expression or activity of a human quaking gene.
46. The method of claim 45, wherein said agent is a human Qk5 nucleic acid or human Qk7 nucleic acid.
47. The method of claim 45, wherein said agent is a human Qk5 or human Qk7 polypeptide, or an agonist thereof.
48. A method of preventing or delaying the onset of a motor pathology in a subject, the method comprising administering to said subject an agent which increases the expression or activity of a gene selected from the group consisting of HALO 9, 12, 13, 19, 20, 22, 23, 28, and HALO 34..
49. The method of claim 48, wherein said motor pathology is associated with administration of a psychoactive agent to said subject.
50. The method of claim 48, wherein said subject is a human.
51. The method of claim 48, wherein said motor pathology is a dystonia.
52. A method of preventing or delaying the onset of a motor pathology in a subject, the method comprising administering to said subject an agent which alters the expression of a quaking gene or a gene encoding a VI polypeptide..
53. The method of claim 52, wherein said gene is a human quaking gene.
54. The method of claim 52, wherein said human quaking gene is a human Qk5 quaking gene.
55. The method of claim 52, wherein said gene is a human Qk7 quaking gene.
56. The method of claim 52, wherein said gene is a V I gene.
57. The method of claim 52. wherein said alteration in expression is an increase in expression of said quaking gene.
58. The method of claim 52, wherein said alteration in expression is a decrease in expression of said quaking gene.
59. A method of identifying a base occupying a polymorphic site in a nucleic acid, the method comprising : (a) obtaining nucleic acid from a subject ; and (b) determining at least a portion of a region of nucleotide sequence corresponding to a contiguous region of any one HALOX nucleotide sequence listed in Table 1 ; (c) comparing the determined nucleotide sequence to a reference sequence of said nucleic acid ; and (d) identifying a difference in the determined nucleic acid sequence relative to the reference sequence, wherein a difference in the determined nucleic acid sequence indicates a polymorphic site in said nucleic acid.
60. The method of claim 59, wherein said subject suffers from, or is at risk for, a psychiatric disorder or a movement disorder.
61. The method of claim 60, wherein the presence of the polymorphic site is correlated with the presence of the psychiatric disorder or movement disorder.
62. The method of claim 59* wherein said nucleic acid is genomic DNA.
63. The method of claim 59, wherein said nucleic acid is cDNA.
64. A nucleic acid sequence 20100 nucleotides in length comprising the polymorphic site identified in the method of claim 59.
65. The method of claim 59, wherein the nucleic acid is obtained from a plurality of subjects, and a base occupying one of the polymorphic sites is determined in each of the subjects.
66. An isolated nucleic acid comprising a nucleic acid which includes a nucleic acid sequence selected from the group consisting of a HALO : 119 gene, or its complement.
67. A vector comprising the nucleic acid of claim 66.
68. A cell comprising the vector of claim 67.
69. A pharmaceutical composition comprising the nucleic acid of claim 66.
70. A polypeptide encoded by the nucleic acid of claim 66.
71. An isolated nucleic acid comprising a nucleic acid which includes a nucleic acid selected from the group consisting of a human Qk5 nucleic acid (SEQ ID NO : 13) and a human Qk7 nucleic acid (SEQ ID NO : 15), or its complement.
72. The nucleic acid of claim 71, wherein said nucleic acid includes a nucleic acid comprising said human Qk5 nucleic acid (SEQ ID NO : 13), or its complement.
73. The nucleic acid of claim 71, wherein said nucleic acid includes a nucleic acid comprising said human Qk7 nucleic acid (SEQ ID NO : 15), or its complement.
74. A polypeptide encoded by the nucleic acid of claim 71.
75. A polypeptide encoded by the nucleic acid of claim 72. 76.
76. A polypeptide encoded by the nucleic acid of claim 73, wherein said polypeptide contains one or more posttranslational modifications of a human quaking protein.
77. A pharmaceutical composition comprising the polypeptide of claim 74.
78. A pharmaceutical composition comprising the polypeptide of claim 75.
79. An antibody which specifically binds to the polypeptide of claim 74.
Description:
METHOD OF IDENTIFYING A PSYCHOTROPIC AGENT USING DIFFERENTIAL GENE EXPRESSION RELATED APPLICATIONS This invention claims priority to USSN 60/113, 127, filed December 21, 1998 and USSN unknown filed December 20, 1999. The contents of the application are incorporated by reference in their entirety.

FIELD OF THE INVENTION The invention relates generally to nucleic acids and polypeptides and in particular to the identification of psychotropic agents using differential gene expression.

BACKGROUND OF THE INVENTION Neuroleptics are agents that are used to treat psychotic disorders such as schizophrenia.

They can cause side effects that cause disruptions of the motor system.

. In humans, they further reduce initiative and interest in environmental stimuli, and suppress manifestations of emotion. An important neuroleptic agent is haloperidol, a member of the butyrophenone (phenylbutylpiperidine) class of heterocyclic antipsychotic agents used in the treatment of schizophrenia. Other members of the butyrophenone class include droperidol, a short-acting highly sedative compound used for anaesthesia induction and pimozide, a potent neuroleptic with prolonged action used to prevent involuntary vocalizations of Tourette's Syndrome. The butyrophenone antipsychotics have been demonstrated to have selective D2 dopaminergic receptor antagonism. (Goodman & Gilman's The Pharmacological Basis of Therapeutics, Ninth Edition, Hardman, JG et al. (eds), McGraw-Hill, New York, 1996, p. 406) Additionally, haloperidol has also been shown to have binding activity with sigma receptors (Seth et al, J Neurochem 70).

In the psychotic patient, following several days of neuroleptic administration,"positive symptoms"such as agitation, hallucinations, delusions, disorganized thought tend to disappear and there are some effects on"negative symptoms"as withdrawn or autistic patients can sometimes become more communicative. Overall, however, haloperidol and its chemical

relatives are most noted for their treatment of"positive symptoms"and have little effect on most catatonic patients. (Goodman & Gilman, Ninth Edition, p. 407) Dosing of haloperidol typically requires a 10 mg-16 mg loading dose followed by maintenance therapy of 12 mg-30 mg per day in divided doses. Dosing is individualized to allow patients to take the minimally necessary dose that alleviates symptoms (Harrison's Principles of Internal Medicine, 13th ed., Fauci, AS et al. (eds.), McGraw-Hill, New York, 1994, p. 2418). Because psychotic disorders are chronic diseases, and controlled studies have demonstrated relapses within 6 months in 60% of all patients, sustained therapy is recommended.

A prevalent side effect of both butyrophenone and phenothiazine (e. g. chlorpromazine) neuroleptics is the induction of extrapyramidal motor pathology. Extrapyramidal symptoms include parkinsonism, akathisia, dystonia and tardive dyskinesia. Such symptoms are apparent with both acute and chronic administration of neuroleptic drugs (Gill, HS et al., J. Clin.

Psychopharm. 17 (5) : 377-389 (1997)). Dystonias typically appear within the first few days of therapy. These can manifest as either Parkinsonian-like tremors or as uncontrollable, spastic muscle contractions that produced abnormal postures. Dystonic movements are typically slow, writhing movements that are transiently sustained. Ones that affect the eye muscles can be particularly disturbing as the patient loses ability to focus visually. In most patients coadministration of haloperidol with benzotropine or trihexyphenidyl (two anti-muscarinic agents) can reduce or alleviate the dystonic and Parkinsonian manifestations. Sustained, chronic use can induce tardive dyskinesia, a broad spectrum of hyperkinesias associated with exposure to neuroleptic drugs within 6 months of the onset of symptoms (although the patient has probably been on the drug for several years) which persists for 1 month after discontinuation of the neuroleptic agent. The most common movement manifestations of tardive dyskinesia involve repeated tongue protrusions and lip smacking. About 30% of all patients exposed to neuroleptic therapy develop some form of persistent movement disorder.

Development of extrapyramidal symptoms, and especially tardive dyskinesia as a consequence of long-term neuroleptic administration has been recognized for almost 4 decades.

Tardive dyskinesia remains the most feared and disconcerting extrapyramidal side-effect of chronic treatment (Walters, VL et al., Schizophrenia Res. 28 : 231-246 (1997)). At the present time, therefore, prevention is best accomplished by intervening prior to the development of extrapyramidal symptoms (Walters et al. (1997)). Alternatively, although a variety of treatment therapies have been attempted in the treatment of tardive dyskinesia, none has become manifest as being successful in most patients (Egan, MF et al., Schizophrenia Bull. 23 (4) : 583-609 (1997)).

From the above description of the manifestations of tardive dyskinesia and related motor dyskinesias, it is apparent that there is a compelling need for identifying alternative neuroleptic agents whose beneficial effects in the treatment of schizophrenia remain essentially undiminished from those in use currently, but which do not induce the symptoms of tardive dyskinesia. There further is a need for developing methods useful in screening pharmaceutical agents that are potential or candidate neuroleptics for their avoidance of the development of tardive dyskinesia. There is additionally a need for identifying molecular and cell biological bases for carrying out such methods. The present invention recognizes these deficiencies, and addresses their resolution.

SUMMARY OF THE INVENTION The invention is based in part on the discovery that certain genes are differentially expressed in the brain striatum regions of animals treated with therapeutic levels of the common neuroleptic, haloperidol. These differentially expressed genes include novel and genes that, while previously described, have not heretofore been identified as haloperidol responsive.

Identification of the differentially expressed genes or gene fragments permits their use in identifying patterns of gene expression that produce the effects of tardive dystonia and similar dyskinesias when previously uncharacterized candidate neuroleptics are administered to a test system or to a test animal. Thus, the discovery allows for the identification of psychoactive agents, e. g. neuroleptic agents, which do not produce a pattern of differential gene expression characteristic of tardive dystonia and similar dyskinesias.

In various aspects, the invention includes methods of a method of identifying psychotropic agents, methods of diagnosing movement disorders, and methods of treating movement disorders. For example, in one aspect, the invention provides a method of identifying a psychotropic agent that does not induce a significant motor side effect by providing a test cell population comprising a cell capable of expressing one or more genes responsive to haloperidol, contacting the test cell population with the psychotropic agent ; and comparing the expression of the gene in the test cell population to the expression of the gene in a reference cell population.

An alteration in expression of the gene in the test cell population compared to the expression of the gene in the reference cell population indicates the psychotropic agent does not induce a significant motor side effect.

The invention in a further aspect includes a method of selecting a psychotropic agent appropriate for a particular subject. The method includes providing from the subject a cell population comprising a cell capable of expressing one or more genes one or more genes responsive to haloperidol, contacting the cell population with the psychotropic agent, and comparing the expression of the gene to the expression of the gene in a reference cell population.

An alteration in expression of the gene in the test cell population compared to the expression of the gene in the reference cell population indicates the psychotropic agent is appropriate for the subject.

In a further aspect, the invention provides a method of diagnosing or determining susceptibility to a movement disorder in a subject. The method includes providing from the subject a cell population comprising a cell capable of expressing one or more haloperidol- responsive genes, and comparing the expression of the gene to the expression of the gene in a reference cell population that includes cells from a subject not suffering from a movement disorder. An alteration in expression of the gene in the test cell population compared to the expression of the gene in the reference cell population indicates subject has or is susceptible to a movement disorder.

Also provided are novel nucleic acids, as well as their encoded polypeptides, whose expression is responsive to the effects of haloperidol. Included are nucleic acids encoding two full-length human quaking homologs were identified. They are named human Qk5 and Qk7, for quaking splice variant 5, and human quaking splice variant 7, respectively.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In the case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.

Other features and advantages of the invention will be apparent from the following detailed description and claims.

BRIEF DESCRIPTION OF THE DRAWINGS Fig. 1 is a representation of the nucleic acid sequence (SEQ ID NO : 13) and the encoded amino acid sequence (SEQ ID NO : 14) of the human Qk5 isoform..

Fig. 2 is a representation of the nucleic acid sequence (SEQ ID NO : 15) and the encoded amino acid sequence (SEQ ID NO : 16) of the human Qk7 isoform.

Fig. 3 is a representation of regions of sequence homologies between murine quaking sequences and the human Qk5 sequences.

Fig. 4 is a representation of regions of sequence homologies between murine quaking sequences and the human Qk7sequences.

DETAILED DESCRIPTION OF THE INVENTION The present invention is based in part on the discovery of changes in expression patterns of multiple nucleic acid sequences in the striatum of the brain in animals treated with therapeutic levels of the neuroleptic haloperidol. Changes in expression are observed in both heretofore undescribed nucleic acid sequences and previously identified nucleic acids. The discovery provides the basis for methods of screening for pharmacological agents which exhibit anti- psychotropic properties but which do not induce the changes in gene expression associated with haloperidol.

Rats treated with haloperidol for 28 days manifest motor disturbances that parallel human pathology, suggesting the usefulness of this treatment as a model for neuroleptic-induced motor disease. Accordingly, the differentially expressed genes were identified by treating Wistar rats with haloperidol (0. 041 mg/kg/d for 3 days via continuous infusion Alza pump), or rats treated with vehicle only for 3 days..

Rats were then sacrificed, their brains were removed, and total RNA was recovered from the microdissected striatum. cDNA was prepared and the resulting samples were processed through 140 subsequences of differential expression analysis as described in U. S. Patent No. 5, 871, 697 and in Shimkets et al., Nature Biotechnology 17 : 198-803 (1999).

101 gene fragments were initially found to be differentially expressed in rat striatum in response to haloperidol. The differential expression of 50 of the gene fragments was confirmed using a unlabeled oligonucleotide competition assay as described in Shimkets et al., Nature Biotechnology 17 : 198-803. 32 single copy nucleic acid sequences genes and 5 repetitive copy nucleic acid sequences differentially expressed in haloperidol and vehicle treated striatum were

selected for further analysis elements. The 32 single nucleic acid sequences identified herein, as well as 4 sequences previously reported to demonstrate haloperidol-responsive gene expression, are referred to herein as HALOX, to denote that they are haloperidol-responsive sequences.

A summary of the sequences analyzed is presented in Table 1. For five of the nucleic acids, designated (HALO1-5), no homology was found to nucleic acid sequences in public databases. Thus, these represent novel gene fragments in rat.

13 sequences (HALO : 6-18) represent novel rat genes for which the sequence identity to sequences found in public databases is either high (i. e., >-85%, observed for 7 fragments), or moderate (i. e., between about 65% and about 80%, observed for 6 genes) suggesting a putative homology.

14 sequences (HALO 19-32) were previously described but have not previously been recognized as being differentially expressed as part of a haloperidol response in the striatum. Of these 14 genes, three genes (quaking, 2'-3'cyclic nucleotide phosphodiesterase 11 and V-1 protein) have significant relationship to the regulation of myelin formation. Thus, they may be relevant to the onset of dystonic reactions associated with haloperidol. Five genes (neurogranin, Ca+2 ATPase, ankyrin isoform, rab 5c-like protein and synaptophysin) have significant relationship to synaptic vesicle release ; and 6 genes (phosphatidyl inositol 3 kinase, inositol 1, 4, 5 triphosphate 3 kinase, NGFI-A, Jun B, Meis 2 and NGFI-B) are directly involved in signal transduction.

Without wishing to be bound by theory, the present inventor believes that motor dystonias, such as that manifested in Parkinsonism, and presumably that found in tardive dyskinesia as well, is due to a dysfunction in certain areas of the brain. In particular, it is believed that the substantia nigra releases dopamine, which is detected at the synapse by neurons whose axons reach the brain striatum. It is believed that neuroleptics achieve their effects by simulating dopamine reception, which helps alleviate Parkinson-like symptoms. Further without wishing to be bound by theory, the present inventor understands that mice that lack the gene termed quaking exhibit tremor and epileptic-type symptoms. The inventor further hypothesizes that normal mammals such as humans and rats are endowed with the ortholog of the quaking gene in the fetal and neonatal stages of life, and that they may experience down-regulation of this, and related genes, as a result of the administration of neuroleptics such as haloperidol. As a further hypothesis, the present inventor believes that the transient reduction or elimination of expression of genes such as quaking is responsible for development of tardive dyskinesia and related pathologies.

For some of the novel sequences (i. e., HALO 1-18), a cloned sequence is provided along with one or more additional sequence fragments (e. g., ESTs or contigs) which contain sequences substantially identical to, the cloned sequence. Also provided is a consensus sequences which includes a composite sequence assembled from the cloned and additional fragments. For a given HALO sequence, its expression can be measured using any of the associated nucleic acid sequences may be used in the methods described herein. For previously described sequences (HAL019-36) database accession numbers are provided. This information allows for one of ordinary skill in the art to deduce information necessary for detecting and measuring expression of the HALO nucleic acid sequences.

The haloperidol-responsive nucleic acids discussed herein include the following:<BR> Table 1 Description of Sequence Contirmed Bands Sequence Database Haloperidol HALOX Reference Effect on Assignment Transcript Level Haloperidol Responsive Novel Nucleic Acid Sequences Novel gene fragment, 86 bp r0w0_85.8 -- -2.9 HALO1 Novel gene fragment, 121 bp r0j0_120.6 -- -2 HALO2 Rat novel gene fragment, 495 bp y0p0_314.6 -- -2.5 HALO3 Novel gene fragment, 649 bp mls032.4 -- -3 HALO4 Novel gene fragment, 539 bp gln0_114.5 -- -1.5 HALO5 Novel gene fragment, 262 bp mls0_354.3 U44940 -2 HALO6 95% similarity to mouse quaking gene type 1 [U44940] Novel gene fragment, 420 bp S0y0_330.3 -1.7 HALO7 Nucleotides 266-31 are 86% similar to M57936 human U1 soRNP 70 kDa protein [M57936 Novel gene fragment, 179 bp l0t0_177.6 U03279 -2 HALO8 91% similar to mouse phosphatidylinositol 3 kinase 1110kD subunit [U03279] Novel gene fragment, 94 bp i0s0_93.4 U57343 +1.6 HALO9 98% similar to mouse meis2 [U57343] subfamily Novel gene fragment, 279 bp m0s0_228.7 U11293 -2.0 HALO10 89% similar to human rab5c-like protein [U11293] Novel gene fragment, 516 bp i0n0_242 Ab002381 -2.0 HALO11 88% similar to human KIAA0383 gene[AB002381] Rat novel gene fragment, 859 bp s0t0_t218.3 Y57368 13 HALO12 83% homologous to mouse EGF repeat transmembrane protein [U57368] Novel gene fragment, 472 bp i0s0_93.4 1.26247 +6.3 HALO13 80% similar to human suilisol [L26247] Novel gene fragment, 408 bp w0n0_402.9 X02822 -5 HALO14 72% similar to rat repetitive ribosomal DNA 11 3' to 45S pre-rRNA [X02822] Novel gene fragment, 138 bp m0r0_128.4 TR:G2827434 #1.0 HALO15 97% similar to mouse gene fragment, 1849 bp with 44% amino acid similarity to human sorting nexin-2 Novel gene fragment, 176 bp r0t0_371.2 P24229 -8 HALO16 70% similar to E. coli putative ATP- dependent RNA helicase RHLB [P24229] Novel gene fragment, 600 bp g1n0_114.5 U349400 -10 HALO17 360 bp region having 65% similarity to 5' region of human NOFI [U39400] Novel gene fragment, 561 bp s0t0_365.5 AL008729 -2.5 HALO18 Encoded polypeptide 80% similar to human predicted protein DJ257A7.1 [AL008729] Previously Described Nucleic Acid Sequences Newly Shown To Be Haloperidol Responsive Ribosomal prtein L18a s0v0_147.6 X14181 +1.8 HALO19 Inositol 1,4,5-triphosphate 3-kinase b1i0_312.0 M29787 _2 HALO20 2'-3' cyclic nucleotide 3' b1i0_218.6 L16532 -3 HALO21 phosphodiesterase [CNPH] NGFI-B U17254 +2.4 HALO22 l0w0_97.2, w0i0_180.8 Neurogranin g1n0_117.5 L09119 +1.1 HALO23 V-1 protein d0v0_180.9 D26179 -2.0 HALO24 190 kDa ankyrin isoform m1l0_366.9 F069525 -2.0 HALO25 Cathepsin S w0ho_124.3 L03201 -1.7 HALO26 D-Amino acid oxidase g1i0_267.4 B003400 -2.0 HALO27 Stomach nonmuscle Ca+2 ATPase J04023 +2 HALO28 s0v0_133.8, m1y0_132 L1 retrotransposon ORF2 i0s0_66.7 U83119 -3.3 HALO29 h0r0_83.9 d0p0_218.8 h0a0_373.9 r0a0_132.0 h0r0_409 i0n0_250.9 i0n0_65.1 Lone interspersed repetitive DNA d0p0_279.9 13100_5 -5 HALO30 sequence LINE3 m0r0_118.4 d0p0_132 Long interspersed repetitive DNA i0a0_82.8 53581_2 -3.5 HALO31 containing 7 ORFs d0g0_218.8 L1 retrotransposon m1vi2-rn38 h0r0_120.4 U87604 -5.5 HALO32 Known Nucleic Acid Sequences Previously Demonstrated To Be Haloperidol Responsive NGF1-A b1i0_218.6 M18416 -3 HALO33 JunB g0c0_264.7 X54686 +4 HALO34 Synaptophysin [p38] f0k0_242.0 X068388 #1.0 HALO35 Phosphatidyl-inositol-3-kinase U03279 HALO36

Below follows additional discussion of nucleic acid sequences whose expression is differentially regulated in the presence of haloperidol.

HALO1, a novel 86 bp gene fragment The nucleic acid has the following sequence : 1 gaattcagcc agggatcgcc cgtgctcaat gacctcactg ccatcctgga cttggcttgc 61 ctagatctcc tgctccagtt gctagc (SEQ ID NO : 1) Its expression is decreased 2. 9-fold in haloperidol-treated rats.

HAL02, a novel 121 bp gene fragment The nucleic acid has the following sequence : 1 gaattcattg gaaagccaaa cgggtcattt gcagttaccc cctccaaccc acccccacag 61 tcttaaagct gtgctcactg ggatagaaca caaatggcta agcacaggga atgtgcgtac 121 g (SEQ ID NO : 2) Its expression is decreased 2-fold in haloperidol-treated rats.

HAL03, a novel rat gene fragment The nucleic acid was identified in a cloned fragment having the following sequence : 1 actagtaaaa gcttctaact cttcttgttg ttcatttttt ttcctttttc ttctttgttt 61 ggattgcagc attctgctct tctgatgatg cgctgtgacc ctgaaagtag cgcaaaggct : 121 gcgcaggtta atgcgcattg cgtgcgaatg agcccctgtg aacggttgac tagatgagta 181 atctgattga ctggctctct cagtcctatt ctgtagcctt tttggataaa attgggtttt 241 aacgtacctt gagtccaact aatctcatta agtaaatatt ctctatgggc ctgtctagta 301 gattaatgga tcy (SEQ ID NO : 3) It is also provided assembled into a contig that includes EST AA875524, to provide the consensus sequence : 1 ACTAGTAAAAGCTTCTAACTCTTCTTGTTGTTCATTTTTTTTCCTTTTTCTTCTTTGTTT GGATTGCAGCATTCTGCTCT 81 TCTGATGATGCGCTGTGACCCTGAAAGTAGCGCAAAGGCTGCGCAGCGTTAATGCGCATT GCGTGCGAATGAGCCCCTGT 161 GAACGGTTGACTAGATGAGTAATCTGATTGACTGGCTCTCTCAGTCCTATTCTGTAGCCT TTTTGGATAAAATTGGGTTT 241 TAACGTACCTTGAGTCCAACTAATCTCATTAAGTAAATATTCTCTATGGGCCTGTCTAGT AGATTAATGGATCNTGGTTG 321 GCCGTTTGCTGCGTCTAGGGGTGTTCTATGTAGCGCAGCAGTTCGCAGCGATTGCGCAGT GCGATGCTGTTAGGTGGCGC 401 CAGCGATGTTTGCGCTCGCATTACAGGGACATCAACCTAGGTGCAATCCTGTCATGTGAG GTTTTATTTTCTTCCTCCTC 481 AGAAGAGAAGTGTTATGAATCTGAAACTTAAAGCCTAAAGGATAATGACCGACTTGGCAG AAAGATTTTTTTA (SEQ ID NO : 4) HAL04, a novel 649 bp new gene fragment

The nucleic acid was initially identified in a cloned fragment having the following sequence : 1 aagcttgtca gtgcacacat agatggtcgg catgtttagc aaactttgtg aaatttaaat 61 aagtttgtag ttacatgtga aactctaaat gcatggtaac cgttgatgtc ataacagttt : 121 agttatttcg ttctgttctg tcatgtgcca caaaataagt ntctttttca cctttttttt 181 gtttttttgg ttttttgttt ttttggtttt tcctgttttt tttgcccttt gtanattant 241 tgaggttaaa actggttcat cctgaaaaaa acgacgaaaa aaancgaaaa agtccattca 301 tattttttaa caattg (SEQ ID NO : 5) The cloned sequence was assembled onto a contig that includes EST AA891494 : caattgtataagtncccaagtcattcactacaccctcangccttgcntttgtaatttgac ttctgaaatgtcggcgatcaaagcatgcacctgtaccaatgacaaaagaaaaagcatttt atattactactcaataaaatgtgcatgaacttaaagaatgctcatcctttcactgagtct gctgaagggaatgccatgcgcaccaccacggtgtcctctgggtgctggcccttccccacc ctgcacacttaggataggctgcttcccagggacctcacgatataaggagcggtacc (SEQ ID NO : 6) The resulting assembled sequence includes : GGTACCGCTCCTTATATCGTGAGGTCCCTGGGAAGCAGCCTATCCTAAGTGTGCAGGGTG GGGAAGGGCCAGCACCCAGAGGACACCGTGGTGGTGCGCATGGCATTCCCTTCAGCAGAC TCAGTGAAAGGATGAGCATTTTTTTTTCTTTAAGTTCATGCACATTTTATTGAGTAGTAA TATAAAATGCTTTTTCTTTTGTCATTGGTACAGGTGCATGCTTTGATCGCCGACATTTCA GAAGTCAAATTACAAAGGCAAGGCTTGAGGGTGTAGTGAATGACTTGGGCACTTATACAA TTGTTAAAAAATATGAATGGACTTTTCTGCTGTTTGTCGTCGTTGTTTTCAGAATGAACC <BR> <BR> <BR> <BR> AGTTGTAACCTCAACTAATATACAAAGGGCAAAAAAAACAAAAAAAAACAAAAAAAAACA <BR> <BR> <BR> <BR> AAAAAACAAAAAACCAAAAAAACAAAAAAAAAGGTGAAAAAAAGTACGTATTTTGTGGCA <BR> <BR> <BR> <BR> CATGACAGAACAGAACGAAATAACTAAACTGTTATGACATCAACGGTTACCATGCATTTA GAGTTTCACATGTAACTACAAACTTAATTAAATTTCACAAAGTTTGCTAAACATGCCGAC CATCTATGTGTGCACTGACAAGCTTATGTTAAAAACTTTTAAGAATACT (SEQ ID NO : 7) or 1 GGTACCGCTCCTTATATCGTGAGGTCCCTGGGAAGCAGCCTATCCTAAGTGTGCAGGGTG GGGAAGGGCCAGCACCCAGA 81 GGACACCGTGGTGGTGCGCATGGCATTCCCTTCAGCAGACTCAGTGAAAGGATGAGCATT TTTTTTTCTTTAAGTTCATG 161 CACATTTTATTGAGTAGTAATATAAAATGCTTTTTCTTTTGTCATTGGTACAGGTGCATG CTTTGATCGCCGACATTTCA 241 GAAGTCAAATTACAAAGGCAAGGCTTGAGGGTGTAGTGAATGACTTGGGCACTTATACAA TTGTTAAAAAATATGAATGG 321 ACTTTTCTGCTGTTTGTCGTCGTTGTTTTCAGAATGAACCAGTTGTAACCTCAACTAATA TACAAAGGGCAAAAAAAACA 401 AAAAAAAACAAAAAAAAACAAAAAAACAAAAAACCAAAAAAACAAAAAAAAGGTGAAAAA AAGTACGTATTTTGTGGCAC 481 ATGACAGAACAGAACGAAATAACTAAACTGTTATGACATCAACGGTTACCATGCATTTAG AGTTTCACATGTAACTACAA

561 ACTTATTTAAATTTCACAAAGTTTGCTAAACATGCCGACCATCTATGTGTGCACTGACAA GCTTATGTTAAAAACTTTTA 641 AGAATACT (SEQ ID NO : 8) HALO5, a novel 539 bp gene fragment The fragment includes the following sequence : 1 tctagattgt ctgggctgga gtattctgta tggcctggta gacgggaatg ttctgcacgt 61 aaatcatgta tcttcagatg ggacatctct taagtattaa tgttgtgtgt aca (SEQ ID NO : 9) HAL06, a novel 363 bp gene fragment similar to mouse quaking gene The 363 bp sequence is provided as the following sequence- 1 caattgggtt tgcctctatt ttggctcctc cttcttttta tccctcatgg agcctttgcn 61 ncggaccatt attttacatc ngtttncgac taaagttgtt tagngtaagt accanaggtc 121 naggattana cccaaaaaat taaaatcagg gtattctttt acaggcacat aaagtttctc 181 ttgtaactga acaatgggtc ccaccgcgtn acgcaattct gcactccttt tctctgtact 241 gccatttaat gtgtcattgt acatgtcttt ccgtactctg ctaatttctt cgtccagcag 301 ccgctcgagg tggttgaaga tcccgcagaa gttgggcagg ctgctcataa gctt (SEQ ID NO : 10) The fragment is assembled in a contig that includes the following sequence : ctcgagcggctgctggacgaagaaattagcagagtacggnaagacatgtncaatgacaca ttaaatggcagtacagagaaaaggagtgcagaattgcctgactcggtgggacccattgtt cagtnacaagagaaactttatntgcctgtaaaagaataccctggattttaattttgttgg gagaatccttggacctagagggacttacagctaaacaacttgaagcagaaacaggatgta aaataatggtccgaggcaaaggctccatgagggataaaaagaaggaggagcaaaatagag ggcaaacccaattg (SEQ ID NO : 11) to provide the consensus sequence : 1 CAATTGGGTTTGCCCTCTATTTTGGCTCCTCCTTCTTTTTATCCCTCATGGAGCCTTTGC CTCGGACCATTATTTTACAT 81 CCTGTTTNCTGACTACAAGTTGTTTAGCTGTAAGTACCACTAGGTCCAAGGATTATACCC AACAAAATTAAAATCCAGGG 161 TATTCTTTTACAGGCACATAAAGTTTCTCTTGTAACTGAACAATGGGTCCCACCGAGTCA CGCAATTCTGCACTCCTTTT 241 CTCTGTACTGCCATTTAATGTGTCATTGTACATGTCTTTCCGTACTCTGCTAATTTCTTC GTCCAGCAGCCGCTCGAGGT 321 GGTTGAAGATCCCGCAGAAGTTGGGCAGGCTGCTCATAAGCTT (SEQ ID NO : 12)

This sequence is 95% similar to the mouse quaking type 1 gene. Its expression is decreased 2-fold in haloperidol-treated rats. The quaking gene is a member of the STAR (signal transduction and activator of RNA) class of proteins. The protein has a domain with homology to hnRNP K (KH domain) which suggests RNA binding activity. The quaking KH domain is most similar to KH domains from C. elegans gld-1, a tumor suppressor gene and to Sam68 from humans, a downstream target of src. A new unique domain, called QUA2, located immediately downstream from KH domain is also common to Sam68 and gld-1.

The quaking gene has been identified in mice as being implicated in the maintenance of normal extent of myelination of nerve cell axons and is therefore essential for both embryogenesis and development of the nervous system (Zorn, AM and Krieg, PA, Genes and Dev. 11 : 2176-2190 (1997) ; Hardy, RJ et al., J. Neurosci. 16 (24) : 7941-7949 (1996)). The STAR (signal transduction and activation of RNA) family of proteins has been implicated in a variety of functions in developmental processes (Vernet, C and Artzt, K, Trends in Gen. 13 (12) : 479-484 (1997)). The cloned mice gene (qui) is transcribed into three messages of 5, 6 and 7 kb (Hardy et al. (1996) ; Ebersole, TA et al., Nat. Gen. 12 : 260-265 (1996)). Transcription is detected in brain, lung, heart and testes. The translated protein is localized to myelinating tracts in the cerebellum among other locations. The quaking protein has a novel function in that it links signal transduction with some aspect of RNA metabolism. The protein may serve as a role for alternative splicing regulation-has been shown that quaking mice have atypical isoforms of necessary myelin proteins.

Quaking was initially characterized as a phenotypic mouse mutant where the mice exhibited tremors and poor coordination. Mice exhibiting the quaking phenotype were found on histologic analysis to have abnormally spliced myelin proteins. Down-regulation of rat quaking in the striatum following haloperidol administration may suggest direct impact on striatal myelin integrity and supports a hypothesis that this regimen leads to the onset of dystonias. The sequence of the human ortholog of the mouse quaking gene had not been determined prior to the time of the present invention.

Accordingly, two full-length human quaking homologs were identified. They are named human Qk5 and Qk7, for quaking splice variant 5, and human quaking splice variant 7, respectively. The nucleotide (SEQ ID NO : 13) and predicted amino acid sequence (SEQ ID NO : 14) of human Qk5 is shown in Fig. 1. The nucleotide (SEQ ID NO : 15) and predicted amino acid sequence (SEQ ID NO : 16) of human Qk5 is shown in Fig. 2. Homology between

human Qk5 and Qk7 and mouse quaking nucleotide and amino acid sequences are shown in Figs.

3A-C and 4A-D. respectively.

These genes are useful as markers for the onset of tardive dyskinesia/dystonias in human subjects taking neuroleptics.

HAL07, Novel 420 bp gene fragment The sequence was initially identified in the following sequence fragment : 1 actagtggga gggcacatgg aatcgagatg gagaacctga ccctagtatt gagtgctggg 61 cctgtaccta gtgaaggtga ttgaggcagt ggtgagcagt aggtgttttt gaggccttga 121 ggccactgtt taggttgggc aggatagata gacccaggtc tcccagccca ggtgcaaatc 181 atccctcaga ttctgaggct cccttttttc cttcatccat gtgtttctag atgntgcggg 241 aaatgtagtc tttccctctc agggttccct gtagctttag ttgccctaat ggtggtgggt 301 gtggggtctg tatgagtact caggtaagct t (SEQ ID NO : 17) In this sequence bp 266-331 (SEQ ID NO : 18) is 86% similar to the human Ul snRNP 70 kDa protein. Its expression is diminished 1. 7-fold in haloperidol-treated rats.

The 70 kDa protein is a member of the pre-mRNA to mRNA spliceosome complex. The protein is the major antigen recognized by many autoimmune antibodies The fragment was assembled to form a contig whose sequence is : gccggcaactcctgggggcctggcgaggaggcgggcttcccgggggtggggtaggggttg ggacacgggactgcttacctggagaccccaagcttacctgagtactcatacagaccccac acccannaccattagggcaactaaagctacagggaaccctgagagggaaagactacattt cccacatcatctaga (SEQ ID NO : 19) The resulting consensus sequence is : 1 ACTAGTGGGAGGGCACATGGAATCGAGATGGAGAACCTGACCCTAGTATTGAGTGCTGGG CCTGTACCTAGTGAAGGTGA 81 TTGAGGCAGTGGTGAGCAGTAGGTGTTTTTGAGGCCTTGAGGCCACTGTTTAGGTTGGGC AGGATAGATAGACCCAGGTC 161 TCCCAGCCCAGGTGCAAATCATCCCTCAGATTCTGAGGCTCCCTTTTTTCCTTCATCCAT GTGTTTCTAGATGATGCGGG 241 AAATGTAGTCTTTCCCTCTCAGGGTTCCCTGTAGCTTTAGTTGCCCTAATGGTGGTGGGT GTGGGGTCTGTATGAGTACT 321 CAGGTAAGCTTGGGGTCTCCAGGTAAGCAGTCCCGTGTCCCAACCCCTACCCCACCCCCG GGAAGCCCGCCTCCTCGCCA 401 GGCCCCCAGGAGTTGCCGGC (SEQ ID NO : 20) HAL08, a novel 179 bp fragment

The cloned sequence is : 1 acgcgtgccg tttgttttga cgcaggattt cttaatagtg attagtaaag gagcacaaga 61 gtacacaaag accagagagt ttgagaggtt tcagngaatg tgttacaagg cgtacctagc 121 aattcggcag catgccaatt ctcttcatca accttttctc catgatgctt ggctccgga (SEQ ID NO : 21) Its expression is diminished 2-fold in haloperidol-treated rats. In a 179 base portion of this band, 91 % of the bases are similar to mouse phosphatidylinositol 3 kinase 110 kDa subunit, which is the catalytic subunit of the PI-3-kinase gene. This kinase phosphorylates the 3'OH group on inositol lipids. The protein has been implicated as participants in signaling pathways regulating cell growth by virtue of their activation in response to various mitogenic stimuli.

PI3Ks are composed of a 110-kDa catalytic subunit and an 85-kDa adaptor subunit.

HAL09, a novel 94 bp gene fragment The cloned sequence is : 1 agatctgctg tggaattggt attgtatgtc catgggatcc tcttttctca gcacgtgttc 61 ctcactagaa gaaaatgctg ttacctttaa gctt (SEQ ID NO : 22) The expression of this sequence is increased 1. 6-fold in haloperidol-treated rats, and is 98% identical to murine homeobox protein Meis2 mRNA The latter protein is also referred to as MRG1, and is a member of pbx-related homeobox genes in mammalian systems. Meis proteins bind DNA as part of a heterodimer. The second half of the heterodimers come from other HOX proteins. Depending which other HOX protein is binding to the meis2 adapter, the heterodimer can determine the set of actively transcribed genes.

HALO10, a novel 279 bp gene fragment The cloned sequence is : 1 tcatgatgga cccttcccct gcccccagtg gtggcccgag ttgttaagtg cgattggtta 61 gagtagattc cagtcaggtc attctgctgg aggagtgggg gcagtggcag gtaaggggct 121 cagttgctgc agcactggct ccggttggct gggttgctct cctgcagatc cacacctctg 181 tttcggcctg gagcaccagc tgcattctgg ggctcaatct tgggaagctt (SEQ ID NO : 23) Its expression is diminished 2-fold in haloperidol-treated rats and is 89% simlar to human rab 5c-like protein.

The human rab 5c-like protein was initially identified as a gene sequenced from the BRCA 1 candidate region on chromosome 17. Rab proteins are small GTPases involved in the regulation of membrane traffic. Rab5a, rab5b, rab5c all regulate transport in the early endocytic pathway and stimulate the homotypic fusion between early endosomes in vitro and increase the rate of endocytosis when overexpressed in vivo. Rab5c-like protein, RABL, represents a putative small GTP-binding protein from a human fetal lung cDNA library. RABL encodes 216 amino acids that are 86% identical to members of the RAB5 subfamily, and it shows 94% homology in nucleotide sequence with RAB5C of dog. The gene is expressed ubiquitously in all human tissues examined.

The cloned sequence was assembled into a contig that includes the fragments aagcttcccaagaatgagccccagaatgcagctggtgctccaggncgaaacagaggtgtg gatctgcaggagagcaacccagccagccggagccagtgctgcagcaactgagccccttac ctgccactgcccccactcctccagcagaatgacctgactggaatctactctaaccaatca cacttaacaactcggaccaccnctgggggcaggggaagggtccatcatgaattctccgca taactttgatcctagg (SEQ ID NO : 24), fragments aagcttcccaagaatgagccccagaatgcagctggtgctccaggccgaaacagaggtgtg gatctgcaggagagcaacccagccagccggagccagtgctgcagcaactgagccccttac ctgccactgccccnnctcctccagcagaatggcctgactggaatctactctaaccaatcg cacttaacaactcgggccaccattgggggcaggggaagggtccatcatgaattc (SEQ ID NO : 25) and fragment ggatccacacctctgtttcnncctggagcaccagctgcattctggggctcattcttggga agcttcttagctatcgccatgaaaattt (SEQ ID NO : 26) to give the consensus sequence : 1 CCTAGGATCAAAGTTATGCGGAGAATTCATGATGGACCCTTCCCCTGCCCCCAGTGGTGG CCCGAGTTGTTAAGTGCGAT 81 TGGTTAGAGTAGATTCCAGTCAGGTCATTCTGCTGGAGGAGTGGGGGCAGTGGCAGGTAA GGGGCTCAGTTGCTGCAGCA 161 CTGGCTCCGGCTGGCTGGGTTGCTCTCCTGCAGATCCACACCTCTGTTTCGGCCTGGAGC ACCAGCTGCATTCTGGGGCT 241 CATTCTTGGGAAGCTTCTTAGCTATCGCCATGAAAATTT (SEQ ID NO : 27)

HALO11, a novel 516 bp gene fragment The cloned sequence is : 1 agatctctct aactttacat tttcattcca tctgtagatt tttctatctt tataaaatat 61 tggagttatt ttttaaggaa aaatagaaaa gtagcttgtg aatagctcaa accaagctta 121 cacatcgccg catgtaaaaa gcaggaaagt tatttgtgtc tgtttatgtt gcttcctttt 181 gtagcctttg taccctggac gggtgacagt aagggccgag caggagaggc gcgaccttgt 241 aca (SEQ ID NO : 28) Its expression is increased 4-fold in haloperidol-treated rats.

This fragment was assembled into a contig that includes EST AA942662 and EST AA964602 to provide the consensus sequence : 1 GAAGTAACTGACTAAAAAGAGAACGAGATACACACAAGAGTGCTGCTGGCTCCTGTTTTG TACAAGGTCGCGCCTCTCCT 81 GCTCGGCCCTTACTGTCACCCGTCCAGGGTACAAAGGCTACAAAAGGAAGCAACATAAAC AGACACAAATAACTTTCCTG 161 CTTTTTACATGCGGCGATGTGTAAGCTTGGTTTGAGCTATTCACAAGCTACTTTTCTATT TTTCCTTAAAAAATAACTCC 241 AATATTTTATAAAGATAGAAAAATCTACAGATGGAATGAAAATGTAAAGTTAGAGAGATC TCCATAAAATAGGGACTTCA 321 CACCACACTCACTGTTCCTTGAATCCTGCTGCGTGTTCCGACATGTATGAAATGCTTCAG AACCTGACAGGCAAACACTG 401 AGATATGCTCATTCAATAAACACAAGTGTGCGCTTATAAAACAGAAAGCTGCCTCTCCCC AAAGGAGCCTGTCGCCAAAA 481 TGGAAAAGGGTCTTCTCAACTTTACACCAAACATTT (SEQ ID NO : 29) The contig is 88% similar to human mRNA for the KIAA0383 gene. whose function is unknown.

HAL012, a 859 bp novel rat gene fragment The cloned sequence is : 1 aagcttttat cacgtaacca gctgaacaac acaccaaaag cagcctaggg atgagcaccg 61 cgctttggta gcgattaggt tttattcacc tggtattaaa actattcact atttcaaaaa 121 tccggaactt ttaagaattc atttgcaagg cagcatcaaa aactgaaaag gaagggaaaa 181 aaaaacaaca gctaataatc ggcttctccg cacgct (SEQ ID NO : 30) Its expression is increased 3-fold in haloperidol-treated rats. The cloned sequence was assembled into a contig including EST AA926216, EST AA685607, EST H35630 and EST AA925503 to provide the consensus sequence : 1 CGTTTTATAAATTTAATCATTTGCTAATGGAAATTTTACCACCTCCCATTTGTGTTACAA ATCTTAGCTCCTGGAGCGGC

81 ACTACAATTCAGGAGTTGTTTTTTCTCACCTCCTCTGTCATTTGTCACAGGAGGTCCCTG CTTGGCAATGACATTTGTGA 161 GTTAGGATAATGACGTTCCTTCTCTCCTTTTTTTTTCCTTTCATACTTCAGATTTAGGAG AAAAAGATTCTGTTTCCACG 241 TGAGAGGAACTGTAAGCTTTTATCACGTAACCAGCTGAACAACACACCAAAAGCAGCCTA GGGATGAGCACCGCGCTTTG 321 GTAGCGATTAGGTTTTATTCACCTGGTATTAAAACTATTCACTATTTCAAAAATCCGGAA CTTTTAAGAATTCATTTCAA 401 AGGCAGCATCAAAAACTGAAAAGGAAGGAAAXAQAQGCTAATAATCGGCTTCTCCGQCGC GTGGAGCTCGCG 481 AAACTGGAGCCCCGGAGAAGTGGCTCTGCTCAGCCGCCCGCCCACGCCGCGGCGGTCCTT GCTTTCCCCGCATGCGCCCG 561 CAGGCAGCGTGCAGTCCTAAGCCCGGCTGTGGAGAAGCTCACTCTCTCTCTTGTTCTGAA TGGTGTTTGTGTCGGTCTGC 641 CTCTGTGTATGGTATTATGTCTTATAATCCTGCATCACTTCCATCCTATCCAGTCATATC TAATGTAGAAAAATTAGTTT 721 CCAGTGAAAGTAATATGTAGTGCTTTTATGGTATTTGTGTGCAATATCCCCTCTTCTATT GAGGATATTTGATGTAAAGG 801 w=AAACTGAGTTCCAQATAAAATACAAAGTGGCAAAAGTTC (SEQ ID NO : 31) This fragment exhibits 83% similarity to mouse EGF repeat transmembrane protein, whose function is unknown but which is regulated by the IGF-1 receptor.

HAL013, a novel 472 bp gene fragment The cloned sequence is : 1 aagcttggta tttgttccct tgtcgtaagt ttaactgata ccaggctggc cttacccttc 61 atgtttcaac atcccttggc taggagagat ct (SEQ ID NO : 32) Its expression is increased 6. 3-fold in haloperidol-treated rats. This novel gene fragment of 472bp is 80% similar to human suilisol (L26247), which is a homolog of the yeast suil translation factor.

The fragment was assembled into a contig that includes EST H35427, EST AA848657, EST AA900144 and EST AA875574 to provide the consensus sequence : 1 CACAGTCCCCAGCCCTAGAAGAGTGTCACCATTTGAACAGCCCAGGTGACTGAGAGTATG GGTAACTGCCCCAGCTATAT 81 CATTAGAGTTGAGTCTCTCTGGCTGTAAAAAGAACCCTTGGTGTCTGACCAGGTAGGCAG AATCCAGAAAGGGCTACCTT 161 TCCAGAGAAGTCATGGACATTAGCTCACCACCAGGGCAGTCTTTTTTAGGCAGATCTCTC CTAGCCAAGGGATGTTGAAA 241 CATGAAGGGTAAGGCCAGCCTGGTATCAGTTAAACTTACGACAAGGGAACAAATACCAAG CTGGTGCTGTTGGTCTTATG <BR> <BR> <BR> <BR> 321 GCTAGCTATAAAGGCTTCAACACAATACAAGCCACTGCCCAGTGCCATGTGAAGGAACAA ACTGGTCTTTTGGTTTTCTT 401 TTCCCTTCCAGTTTTAATGTTATGTAATGTATTTAAATCCTTATTTAAATAAAGCTTGTT TTCAGAAATAAT (SEQ ID NO : 33) HALO14, a novel 408 bp gene fragment The cloned sequence is : 1 gctagctgag agggggtggg gtggggcggg gctggagaat atgcaggttc ctgaaggtca 61 gtcggggaag tactgctgct gccctagcac gcttcagtgc ctctttagag tttagagttt

121 tctaaagttt tctgcctgaa atcagcgagt gatgatttca ctgtgaaatg atgtctgatc 181 atcgctctcg ctgtcctgtc agggctccgg ctcctggcaa atgtctgact gaaggaaacc 241 ttagttagac tcncacccag ctgtttggaa atggtaatgg agttgatagc acaccctggg 301 ggaaaaaggc aaactccctt tttgcnnant ctcaattccc agcctcgcct gcanctcggg 361 gatttnaag (SEQ ID NO : 34) Its expression is diminished 5-fold in haloperidol-treated rats.

The cloned fragment was assembled into a contig that includes : gctagctgagagggggtggggtggggcggggctggagaatatgcaggtccctgaaggtca gtcggggaagtactgctgctgccctagcacgcttcagtgcctctttagagtttagagttt tctaaagttttctgcctgaaatcagcgagtgatgatttcactgtgaaatgatgtctgatc atcgctctcgctgtcctgtcagggctccggctcctggcaaatgtctgactgaaggaaacc ttagttagactcacacccagctgtttggaaatggtaatggagttgatagcacaccctggg ggaaagaggcagactccctttttgctcactctcaattcccagcctcgccctgccagttcg gggatttctaagtaagggtgaatctggaccanatatgtacttcggaga (SEQ ID NO : 35), gctagctgaganggggtggggtggggcggggctggagaatatgcaggttcctgaaggtca <BR> <BR> <BR> gtcggggaagtactgctgctgccctagcacgcttcagtgcctctttagagtttagagttt <BR> <BR> <BR> <BR> tctaaagttttctgcctgaaatcagcgagtgatgatttcactgtgaaatgatgtctgatc a (SEQ ID NO : 36), and tgatcatcgctctcgctgtcctgtcagggctccggctcctggcaaatgngtgactgaagg aaaccttagttagactcacacccagctgtttggaaatggtaatggagttgatagcacacc ctgggggaaagaggcagactccctttttgctcactctcaattcccagcctcgccctgcca gctcggggatttctaagtaagggtgaatctggaccatatatgtaca (SEQ ID NO : 37), to provide the consensus sequence : 1 GCTAGCTGAGAGGGGGTGGGGTGGGGCGGGGCTGGAGAATATGCAGGTCCCTGAAGGTCA GTCGGGGAAGTACTGCTGCT 81 GCCCTAGCACGCTTCAGTGCCTCTTTAGAGTTTAGAGTTTTCTAAAGTTTTCTGCCTGAA ATCAGCGAGTGATGATTTCA 161 CTGTGAAATGATGTCTGATCATCGCTCTCGCTGTCCTGTCAGGGCTCCGGCTCCTGGCAA ATGTCTGACTGAAGGAAACC 241 TTAGTTAGACTCACACCCAGCTGTTTGGAAATGGTAATGGAGTTGATAGCACACCCTGGG GGAAAGAGGCAGACTCCCTT 321 TTTGCTCACTCTCAATTCCCAGCCTCGCCCTGCCAGCTCGGGGATTTCTAAGTAAGGGTG AATCTGGACCATATATGTAC 401 ATTCGGAGA (SEQ ID NO : 38)

In this sequence, 151 bases have 72% similarity to rat repetitive ribosomal DNA II 3'to 45 S pre- rRNA [X02822].

HALO15, a 138 bp novel gene fragment The sequence is 97% similar to amouse gene fragment 1849 bp in length which has 44% simolarity to human sorting nexin-2 [TR : 62827434]. Sorting nexins are a class of molecules that target ligand-bound peptide receptors and appropriately target them to the lysosomes for degradation. They are highly hydrophilic but are found partially associated with the plasma membrane. They are widely expressed but each sorting nexin has its own tissue specificity set.

Sorting nexin 2 has shown affinity for tyrosine kinase receptors including EGFR, PDGF-R and insulin-R. It also has activity against the long form only of the leptin receptor.

HAL016, a 176 bp novel gene fragment The cloned sequence is : 1 gaattcacaa caccgggtgg gtaggaaagc agctaacata gcctaggttg gtgcagaagc 61 tcacaagaag tggccaggat gtagaggtgg ctgaccaggt aggtagtaag ggcctctact 121 tgccctcctt aacacacaca cctcactcac ggctttgtac aggagcagcc aatggt (SEQ ID NO : 39) Its expression is diminished 8-fold in haloperidol-treated rats. A predicted gene product shows 70% similarity over 31 amino acid residues to E coli putative ATP-dependent RNA helicase RHLB, which was identified in the 85-minute region of the E. coli genome. The E. coli gene encodes a protein sequence with the"D-E-A-D"box motif. Proteins in this gene family occur in eukaryotes as well as prokaryotes, and, as far as tested, have been found to participate in ATP-dependent RNA helicase or RNA-dependent ATPase activities.

HALO17, a 600 bp novel gene fragment The cloned sequence is : 1 tgtacagaca atctcttgtg cattctgtgg aagcatcacc tgtcaataaa aagctaatgg 61 ccagtgagct agaggcagga ttagattgtg ggaaattgga cagggaactc taga (SEQ ID NO : 40) Its expression is diminished 10-fold in haloperidol-treated rats. The cloned sequence was assembled into a contig that includes EST H31749, actagttcacaactcatttaacccattaaaactattctatgtcngccacatggctggtta

gttacctttcagtttcatacatctngcttcccatctagagttccctgtccaatttcccac aatctaatcctgcctctagctcactggccattagctttttattgacaggtgatgcttcca cagaatgcacaagagattgtctgtaca (SEQ ID NO : 41) the sequence fragment : tctagagttcccnntccnntttcccacaatctaatcctgcctctnnctcnttgtccgnna ncttttnatngncaggtgatgcttccacagaatgcacaagagatngtctgnacagnnntc angtcngccnngtaagccngatgnttgntgtggcctcctgtnntggacagctttcn (SEQ ID NO : 42) and the fragment accggtatgtataggtatccacttnaaanctgtccaacacaggangccacancaaccatc aggctaacaaggcagacatgactgctgtan (SEQ ID NO : 43) to provide the consensus sequence : 1 TCACCCCNGTTAATGAGNTGACAGGTACCCCTCGAATCAAGGNCCTACTTTGATGAGCAA CTTAAANCCTGNCTTCTTGA 81 GAAAGGCCTTCTGAGNCCTGATGGTCAGCCCATGTGGCAGTGCTCTCCACAGACTGGCAT CCAGAGAGGAAGTGGACTTG 161 GAATCTCTGGAATGGGACACAAAGAACAGAATTTATTCTTAGGATGAAAGGGCTTTGAGA TAAGGCCTTGCTTTCGTCAA 241 GGGGGAGTAGACCGGTATGTATAGGTATCCACTTGAAAGCTGTCCAACACAGGAGGCCAC AGCAACCATCAGGCTAACAA 321 GGCAGACATGACTGCTGTACAGACAATCTCTTGTGCATTCTGTGGAAGCATCACCTGTCA ATAAAAAGCTAATGGCCAGT 401 GAGCTAGAGGCAGGATTAGATTGTGGGAAATTGGACAGGGAACTCTAGATGGGAAGCNAG ATGTATGAAACTGAAAGGTA 481 ACTAACCAGCCATGTGGCNGACATAGAATAGTTTTAATGGGTTAAATGAGTTGTGAACTA GT (SEQ ID NO : 44) In a 36 base portion of this sequence there is a 65% similarity to the 5'region of the human NOF 1 gene. The term"NOF"represents"Neighbor of FAU."The human was NOF 1 gene was isolated during a chromosomal walk along l l q 13 in search of a gene responsible for the translocation breakpoint in a particular clone of B-cell NHL. cDNA clones representing NOF hybridized to a 2. 2-kb mRNA present in all tissues tested. The largest open reading frame appears to contain 166 amino acids and is proline rich. The sequence shows no homology with any known gene in the public databases. The NOF gene consists of 4 exons and 3 introns spanning approximately 5 kh, and the boundaries between exons and introns follow the GT/AG rule. The NOF locus is conserved during evolution, with the predicted protein having over 80% identity to three translated mouse and rat ESTs of unknown function. The NOF 1 gene is not the gene responsible for the translocation in the 11 q 13 chromosomal region.

HALO18, a 561 bp novel gene fragment The cloned sequence is : 1 aagcttcaga cattatggat ggaccagatc ctggcgcccc cgtgaaattg ccttgtctgc 61 cagtgaaact gtcgcctccg ctacccccaa aganagtcct gatctgcatg cctgtagggg 121 gcccagagct ctccctggca ccctacgcag cccagaagag cagccagcag gtgttggccc 181 agcaccacca caccgtcctg ccatcccaga tgnagcacca gctgagttat tcgcagccac 241 ggccagcatc tcccgtcctc caccggcacc ttacccatgc acccctcggg ctgcaggatg 301 atcgatnagc tgaacaagac ncttgctatg accatgcagn ggctggaaag ctccgagnaa (SEQ ID NO : 45) Its expression is diminished 2. 5-fold in haloperidol-treated rats. This fragment was assembled into a contig that includes fragment : acgcgttnctcggagctttccagcctctgcatggtcatagcaagtgtcttgttcagctca tcgatcatcctgcagcccgaggggtgcatgggtaaggtgncggtggaggacgggagatgc tggccgtggctgccatactgcagctggtgctgcatctgggatggcaggacggtgtggtgg tgctgggccacagcctgctggctgctcttctgggctgcgtaggatgccagggagagctct ggggccc (SEQ ID NO : 46) the fragment nncccagagctctccctggcatcctacgcngcccagaagagcanccagcaggttgtggcc cagcaccaccacaccgtcctnccatcccanatgcagcaccagctnagtatggcagccacg gccagcatctcccgtcctccaccggcaccttacccatgcacccctcgggctgcagggatg atcgatgagctgaacaagacacttgctatgaccatgcagaggctggaaagctccgagcaa cgnttcccctgctccacttcttaccacagctctggttttgcacn (SEQ ID NO : 47), the fragment ncccgttnctcgntgctttccagcctctgcatggtcatagcaagngtctttttcggctca ncgatcatcctgcagcccgaggggtgcatgggtaaggtgncggtggaggacgggagatgc tggccgtggcgccatactncagctggtgctgatctgggatgggcaggacggtgtggtgnt gctgggccacagcctgctggctgctcttctgggctgcttaggatgccaggganagctctg ggcn (SEQ ID NO : 48), and the fragment

agatctacgntaaagatggagagctctccatatcaaatgaagatnactccctcacaaacg gccagtccctgagctccagccagctctctttgcctgctctgtcggaaatggagcctgtcc caatgcccagggacccctgctcatatgaggtgctccaagcttcagacattatggatggac cagatcctggcgcc (SEQ ID NO : 49) to generate the consensus sequence : 1 NGTGCAAAACCAGAGCTGTGGTAAGAAGTGGAGCAGGGGAACGCGTTGCTCGGAGCTTTC CAGCCTCTGCATGGTCATAG 81 CAAGTGTCTTGTTCAGCTCATCGATCATCCTGCAGCCCGAGGGGTGCATGGGTAAGGTGC CGGTGGAGGACGGGAGATGC 161 TGGCCGTGGCTGCCATACTGCAGCTGGTGCTGCATCTGGGATGGCAGGACGGTGTGGTGG TGCTGGGCCACAGCCTGCTG 241 GCTGCTCTTCTGGGCTGCGTAGGATGCCAGGGAGAGCTCTGGGGCCCCCTACAGGCATGC AGATCAGGACTNTCTTTGGG 321 GGTAGCGGAGGCGACAGTTTCACTGGCAGACAAGGCAATTTCACGGGGGCGCCAGGATCT GGTCCATCCATAATGTCTGA 401 AGCTTGGAGCACCTCATATGAGCAGGGGTCCCTGGGCATTGGGACAGGCTCCATTTCCGA CAGAGCAGGCAAAGAGAGCT 481 GGCTGGAGCTCAGGGACTGGCCGTTTGTGAGGGAGTNATCTTCATTTGATATGGAGAGCT CTCCATCTTTANCGTAGATC 561 T (SEQ ID NO : 50) In a 103 amino acid fragment of a putative gene product there is 80% amino acid identity to human predicted protein DJ257A7. 1.

HAL019 HALO 19 corresponds to a nucleotides encoding a component of the large ribosomal subunit L18a [X14181]. Its transcription is increased 1. 8 fold in haloperidol-treated rats. This sequence includes two zipper-like domains and has been shown to interact with Jun in the zipper region.

HAL020 HAL020 corresponds to a nucleotides encoding inositol 1, 4, 5, triphosphatase [M29787].

Its expression is increased 2-fold in haloperidol-treated rats. The kinase phosphorylates 1, 4, 5 inositol triposphate on the 3'position to add a fourth phosphate. The kinase functions in signal transduction.

HAL021 HAL021 corresponds to 2', 3' cyclic nucleotide 3'phosphodiesterase (CNPII) [L16532].

Its expression decreases 3. 0 folf in haldoperidol-treated reats.

It exiss in multiple isoforms. A larger isoform of phosphodiesterase localizes to CNS.

The protein is associated with myelination in the CNS. Its conserved motifs include two leucine

repeat heptads, and two consensus motifs for phosphorylation in the N-terminal domain of CNP2.

CNP2 is produced by alternate splicing from the original CNP gene. In central and peripheral nervous system tissues, the enzyme is localized almost exclusively in the two cell types that elaborate myelin, the oligodendrocyte and the Schwann cell, respectively. Nonneural sources of CNPase have also been described, but they all have much lower activities than those found in brain. The freshly isolated brain enzymes appear as closely spaced doublets at approximately 46 and 48 kDa on SDS-PAGE. The primary sequence appears highly conserved between these two proteins, designated CNP 1 and CNP2.

HAL022 HAL023 corresponds to NGFI-B [U17254], which is also known as Nur77. Its expressionis increased 2. 4 fold in haloperidol-treated rats.

NGFI-B was identified by differential hybridization as a gene that is rapidly, but transiently, induced in PC 12 cells by NGF. The nucleotide sequence of the NGFI-B gene reveals that it encodes a 61 kd protein with strong homologies to members of the glucocorticoid nuclear receptor gene family. Transcription of NGFI-B itself is induced in an immediate early response and has been documented as a response to various stimuli including fos/jun and TSH.

HAL023 HAL023 corresponds to neurogranin [L09119]. Its expresssion is increased 1. 1 fold in haloperidol-treated rats.

Neurogranin is also known as the C kinase substrate calmodulin binding protein and the rodent cortex protein (RC3), which is 78 amino acids in length. The RC3 protein amino terminus contains a cysteine-rich domain similar to those found in snake venom neurotoxins. The carboxyl terminus contains a collagen-like motif that may function in the assembly of RC3 subunits into a multimeric protein. RC3 and GAP-43 regulate calmodulin availability in dendritic spines and axons, respectively, and calmodulin regulates their ability to amplify the mobilization of Ca2+ in response to metabotropic glutamate receptor stimulation. These molecules release CaM rapidly in response to large influxes of Ca2+ and slowly in response to small increases. This nonlinear response is analogous to the behavior of a capacitor, hence the name calpacitin. The protein may be involved in the process of neuronal long-term potentiation and dendritic spine remodelling.

HAL024 HAL024 corresponds to V-1 protein. Its expression is decreased 2. 0-fold in haldoperidol-treated rats. It contains 2. 5 contiguous repeats ofthe cdc10/SWI6 motif, which was originally found in products of cell cycle control. Highest levels of expression of this gene are in the hippocampus and cerebellum, followed by cortical expression. The protein has been implicated in differentiating classes of neurons, including cerebellar granule cells. Abnormal temporal profile of V-1 expression during prenatal cerebellar development has been noted in the staggerer mouse mutant, which fails to establish connections between granule and purkinje cells in the cerebellum.

HAL025 HAL025 corresponds to the 190 kDA ankyrin isoform [F069525]. Its expresssion is decreased 2. 0-fold in haloperidol-treated rats.

Ankyrins are a family of adapters that mediate linkages between integral membrane proteins and cytoskeletal components. Such interactions are thought to be important to the polarized distribution of membrane proteins in transporting epithelia. This ankyrin isoform has homology to, but is not identical with, the previously identified larger neuronal isoform. The protein has (a) expression at the lateral plasma membrane, (b) functional assembly with the cytoskeleton, and (c) interaction with at least one membrane protein, the Na, K-ATPase. This latter interaction may support its involvement with the regulation of cell polarity.

HAL026 HAL026 corresponds to cathepsin S [L03201]. Its expression is decreased 1. 7-fold in haloperidol treated rats.

Cathepsin S is a cysteine protease with elastase activity. It was initially described in alveolar macrophages and has a broad range of natural pH activity. The gene contains only 2 Spl sites but contains 18 API sites that may be involved in the regulation of the gene.

HAL027 HAL027 corresponds to D-amino acid oxidase [B003400]. Its expression is decreased 2. 0-fold in haldoperidol-teated rats.

D-amino acid oxidase is one of the principal and characteristic flavoenzymes of peroxisomes, and is found in liver, kidney and brain. The oxidase on a wide range of D-amino acids but is completely inactive on the natural, useful L-amino acids. It requires FAD as a prosthetic group. Its active site is distinct from D-aspartate oxidase. It is thoughthought that the function of the amino acid is for protection against D amino acids of bacteria, fungi.

Alternatively, it is possible that the enzyme is may be an evolutionary relic. Prototypical reaction describes glycine being converted to glyoxylate (HC=OCOOH) with the release of NH3 and the formation of peroxide from °2 and HO.

HAL028 HAL028 corresponds to stomach nonmuscle Ca+2 ATPase [J04023]. Its expression is increased 2-fold in heloperidol-treated rats.

Stomach nonmuscle Ca+2 ATPase is also known as sarcoplasmic reticulum Ca+2 ATPase. The enzyme is a Ca+2 transporting ATPase of the aspartylphosphate class. This ATPase was characterized in rat stomach, brain and kidney tissue and has homology to the slow-twitch isoform of the Ca+2 ATPase. It is distnguishable in that it has a novel, different C-terminus. It localizedsto ER/SR region and regulates intracellular calcium stores. It is possibly a rat homolog for human HK1 channel.

HAL029 HAL029 corresponds to long interspersed reptitive DNA sequence LINE3 [13100_5].

Its expression decreases 5-fold in haloperidol-treated rats.

HAL030 HAL030 corresponds to long interspersed reptititve DNA containing 7 open reading frames (ORF) [535812]. Its expresssion decreases 3. 5-fold in haloperidol-treated rats.

HAL031 HAL031 corresponds to Ll to retrotransposon ml vi2-rn38 [U87605]. Its expresssion decreases 5. 5-fold in haldoperidol-treated rats.

HAL032

HAL032 corresponds to LI retrotransposon ORF2 [U82119]. Its expresssion decreases 3. 3fold in haloperidol-treated rats.

HAL033 HALO33 corresponds to NGF1-A [M 18146], whose expression has previously been reported to be differentially regulated by haldoperidol. In the present studies, its expression increased 1. 6-fold in haloperidol-treated rats.

NGF1-A is also known as EGR-1, krox-24, or zif268. It is an early growth response gene that displays fos-like kinetics following mitogenic stimulation. It includes three DNA-binding zinc fingers and functions as a transcription factor.

HAL034 HAL034 corresponds to JunB [X54686], whose expression has been previously reported to be differentially regulated by haldoperidol. In the present studies, its expresssion increased 4- fold in haldoperidol-treated rats.

JunB is a transcription factor that is a member of the serum response element family (SRE) as is NGFI-A and NGFI-B.

HAL035 HAL035 corresponds to synaptophysin [X06388], whose expression has been previously reported to be differentially regulated by haldoperidol. In the present studies, its expression varied about 1-fold in haldoperidol-treated rats.

Synaptophysin is an integral membrane protein of small synaptic vesicles in brain and endocrine cells. It is also detected in presynaptic vesicles. Complexes of six synaptophysin molecules in the synaptic vesicle membrane may be part of the fusion pore between the synaptic vesicle and the plasma membrane.

HAL036 HAL036 correponds to phophatidyl-inositol-3-kinase. Inisotaol monophsphates are reporded to decrease 4-6 weeks following administeration of heloperidol in deconoate dosing.

The HALOX nucleic acids and encoded polypeptides can be identified using the information provide above. In some embodiments, the HALOX nucleic acids and polypeptide

correspond to nucleic acids or polypeptides which include the various sequences (referenced by SEQ ID NOs) disclosed for each HALOX polypeptide.

Screening for Psychotropic Drugs Lacking Signif cant Side Effects In one aspect, the invention provides a method of identifying a psychotropic agent that does not induce a significant motor side effect. A used herein, a"significant motor side effect"is an unintended motor effect which materially impacts a subject's ability to enjoy or perform a life function. Examples of types of motor effects include, e. g, dystonias. Dystonic movements can include slow writhing movements that are transiently sustained. They can affect several distinct areas of the body, such as the limbs, lips, tongue and eyes. Motor side effects can also include tardive dyskinesias. Symptoms of tardive dyskinesia include, e. g., persistent movement disorders, repeated tongue protrusions and lip smacking.

The psychotropic agent can be identified by providing a cell population that includes cells capable of expressing one or more genes homologous to those listed in Table 1 as HALO 1- 32. The sequences need not be identical to sequences including HALO1-32, as long as the sequence is sufficiently similar that specific hybridization can be detected. Preferably, the cell includes sequences that are identical, or nearly identical to those identifying the HALOX nucleic acids shown in Table 1.

The cell population exposed to, i. e., contacted with, the test psychotropic agent can be any number of cells, i. e., one or more cells, and can be provided in vitro, in vivo, or ex vivo.

The cell population is preferably obtained from or derived from a human or rodent subject, or is provided in vivo in the mammalian subject. The cell population can be, e. g., derived from brain tissue or non-brain neuronal tissue. Preferably, the cell population is from striatum brain tissue.

If desired, the cell population can be divided into two or more subpopulations. In some embodiments, various sub populations can be exposed to a control agent, and/or a test psychotropic agent, multiple test psychotropic agents, or, e. g., varying dosages of one or multiple test agents administered together, or in various combinations.

In general expression of the genes or nucleic acids can be measured using any method known in the art, e. g., using northern based hybridization analysis or methods which specifically, and, preferably, quantitatively amplify specific nucleic acid sequences.

Expression of sequences in test and control populations of cells can be compared using any art-recognized method for comparing expression of nucleic acid sequences. For example, expression can be compared using GENECALLING'methods as described described US Patent Patent 5, 871, 697 and in Shimkets et al., Nat. Biotechnol. 17 : 798-803.

Expression of the gene or genes in the test cell population is then compared to the expression of the gene in a reference cell population, which is a cell population not exposed to the test psychotropic agent, or, in some embodiments, a cell population exposed to a significantly lower dose (e. g., 10, 100, 1000 or more lower dose) of the test psychotropic agent. Comparison can be performed on test and control samples measured concurrently or at temporally distinct times. An example of the latter is the use of compiled expression information, e. g., a sequence database. which assembles information about expression levels of known sequences following administration of various agents. For example, alteration of expression levels following administration of test psychotropic agent can be compared to the expression changes observed in the gene following administration of a control agent, such as haloperidol.

An alteration in expression of the gene in the test cell population compared to the expression of the gene in the reference cell population indicates the psychotropic agent does not induce a significant motor side effect. Preferably, the reference cell population, which can be one or more cells, has been exposed to a psychotropic agent which induces a motor side effect.

For example, the control agent can be a butyrophenone compound, such as droperidol or haloperidol, or can be control agent is a phenothiazine, such as chlorpromaine. In some embodiments, the control agent can be a test vehicle.

For some applications it will be desirable to divide a starting cell population into two or more subpopulations of cells. The subpopulations can be created by dividing the first population of cells to create as identical a subpopulation as possible. This will be suitable, in, for example, in vitro or ex vivo screening methods. Alternatively, subpopulations can be created by exposing two matched populations of cells to the test psychotropic agent and control agent. For in vivo studies, for example, the test psychotropic agent and control agent can be administered to two groups of test animals, after which cells are recovered from animals and gene expression measured. Preferably, the test animals are as similar as possible with respect to genetic background, sex, age, weight, nutritional status and other parameters.

While the expression of any number of sequences shown in Table 1 can be compared, it is preferred that the expression of multiple, e. g, 2, 3, 5, 7, 9, 11, 13, 15, 17, 20, 23, 25, 30 or

even all 32 sequences be compared. In addition, expression of one or more of sequences SEQ ID NOs ; 1-32 can be compared with sequences from of HALO33-36, corresponding to NGF 1-A, JunB, synaptophysin, phosphatidyl-inositl-3-kinase, which have been previously shown to be response to haloperidol.

For some genes whose expression is measured, an increase in expression of the gene in the first subset of cells compared to the second subset of cells indicates the psychotropic agent does not induce a motor side effect. These genes include those sequences whose expression decreases in haloperidol cells vs. control cells, as shown in Table 1. Examples of such genes include, e. g., HALO 1-8, 10, 11, 14, 16-18, 21, 24-27, 29-32, and HAL033.

For other genes, a decrease in expression of the gene in the first subset of cells compared to the second subset of cells indicates psychotropic agent does not induce a motor side effect These genes include those sequences whose expression increases in haloperidol cells vs. control cells, as shown in Table 1. Examples of these genes include, e. g., HALO 10, 13, 14. 20, 21, 23, 24, 29, and 35.

The invention also includes a psychotropic agent identified according to this screening method, and a pharmaceutical composition comprising the psychotropic agent so identified.

Also included in the invention is a method of selecting a psychotropic agent appropriate for a particular subject, e. g., a particular human subject. By appropriate is meant that psychotropic agent does not induce a significant motor defect in the subject.

The method is based in part on the observation that different individuals metabolize pharmaceutical agents due to, in part, differences in their genetic backgrounds. Accordingly, the method identifies agents which, for the given individual, do not induce gene expression patterns characteristic of a haloperidol response.

The method includes providing from the subject a cell population comprising a cell capable of expressing one or more genes, wherein the gene is selected from the group consisting of HALO 1-32. A cell population from the subject is then contacted with the psychotropic agent, and expression of the gene is measured and compared to a reference cell population. An alteration in expression of the gene in the test cell population compared to the expression of the gene in the reference cell population indicates the psychotropic agent is appropriate for the subject.

Any cell can be used, as long as it is capable of expressing one or more genes homologous to those listed in Table 1 as HALO 1-32. The sequences need not be identical to sequences including HALO1-32, as long as the sequence is sufficiently similar that specific hybridization can be detected. Preferably, the cell includes sequences that are identical, or nearly identical to those identifying the HALOX nucleic acids shown in Table 1.

The cell population exposed to, i. e., contacted with, the test psychotropic agent can be any number of cells, i. e., one or more cells, and can be provided in vitro, in vivo, or ex vivo. The cell population is preferably derived from brain tissue or non-brain neuronal tissue. A preferred source for the cell population is striatum brain tissue.

If desired, the cell population can be divided into two or more subpopulations. In some embodiments, various sub populations can be exposed to a control agent, and/or a test psychotropic agent, multiple test psychotropic agents, or, e. g., varying dosages of one or multiple test agents administered together, or in various combinations.

In general expression of the genes or nucleic acids can be measured using any method known in the art, e. g., using northern based hybridization analysis or methods which specifically, and, preferably, quantitatively amplify specific nucleic acid sequences. In some embodiments expression can be measured at the protein level, i. e., by measuring expression levels of the HALOX proteins.

Expression of sequences in test and control populations of cells can be compared using any art-recognized method for comparing expression of nucleic acid sequences. For example, expression can be compared using GENECALLING'K methods as described in US Patent No.

5, 871, 697 and in Shimkets et al., Nat. Biotechnol. 17 : 798-803.

Expression of the gene or genes in the test cell population is then compared to the expression of the gene in a reference cell population, which is a cell population not exposed to the test psychotropic agent, or, in some embodiments, a cell population exposed to a significantly lower dose (e. g., 10, 100, 1000 or more lower dose) of the test psychotropic agent. Comparison can be performed on test and control samples measured concurrently or at temporally distinct times. An example of the latter is the use of compiled expression information, e. g., a sequence database, which assembles information about expression levels of known sequences following administration of various agents. For example, alteration of expression levels following administration of test psychotropic agent can be compared to the expression changes observed in the gene following administration of a control agent, such as haloperidol.

An alteration in expression of the gene in the test cell population compared to the expression of the gene in the reference cell population indicates the psychotropic agent does not induce a significant motor side effect and is an appropriate agent for the subject Preferably, the reference cell population, which can be one or more cells, has been exposed to a psychotropic agent which induces a motor side effect. For example, the control agent can be a butyrophenone compound, such as droperidol or haloperidol, or can be control agent is a phenothiazine, such as chlorpromaine. In some embodiments, the control agent can be a test vehicle.

For some applications it will be desirable to divide a starting cell population into two or more subpopulations of cells. The subpopulations can be created by dividing the first population of cells to create as identical a subpopulation as possible. This will be suitable, in, for example, in vitro or ex vivo screening methods. Alternatively, subpopulations can be created by exposing two matched populations of cells to the test psychotropic agent and control agent. For in vivo studies, for example, the test psychotropic agent and control agent can be administered to two groups of test animals, after which cells are recovered from animals and gene expression measured. Preferably, the test animals are as similar as possible with respect to genetic background, sex, age, weight, nutritional status and other parameters.

While the expression of any number of sequences shown in Table 1 can be compared, it is preferred that the expression of multiple, e. g., 2, 3, 5, 7, 9, 11, 13, 15, 17, 20, 23, 25, 30 or even all 32 sequences be compared. In addition, expression of one or more of sequences SEQ ID NOs ; I-32 can be compared with sequences from of HAL033-36, corresponding to NGFI-A, JunB, synaptophysin, phosphatidyl-inositl-3-kinase. which have been previously shown to be response to haloperidol.

For some genes whose expression is measured, an increase in expression of the gene in the first subset of cells compared to the second subset of cells indicates the psychotropic agent does not induce a motor side effect. These genes include those sequences whose expression decreases in haloperidol cells vs. control cells, as shown in Table 1. Examples of such genes include, e. g., HALO 1-8, 10, 11, 14, 16-18, 21, 24-27, 29-32, and HAL033.

For other genes, a decrease in expression of the gene in the first subset of cells compared to the second subset of cells indicates psychotropic agent does not induce a motor side effect These genes include those sequences whose expression increases in haloperidol cells vs. control cells, as shown in Table 1. Examples of these genes include, e. g., HALO 10, 13, 14, 20, 21, 23, 24, 29, and 35.

Methods of Diagnosing Motor Pathologies Included in the invention is a method of diagnosing, or determining susceptibility to, a movement disorder in a subject, e. g., a human subject.

The method includes providing from the subject a cell population which includes one or more cells capable of expressing one or more HALO genes, e. g. HALOI-36wherein each gene is selected from the group consisting of HALO 1-32. Expression of the gene is compared to an expression pattern of cells that are indicative of the presence of a movement disorder ("diseased reference group"), or indicative of cells known not to suffer from a movement disorder ("healthy reference group"), or both reference groups. A similar expression level of the gene in the test cell population compared to the expression of the gene in the diseased reference cell population indicates subject has or is susceptible to a movement disorder. An inverse expression level of the gene in the test cell population compared to the expression of the gene in the healthy reference group similarly indicates the subject has or is susceptible to a movement disorder.

In a specific aspect, the invention includes a method of diagnosing or determining susceptibility to a movement disorder in a subject by providing a cell population from the subject that includes a cell capable of expressing one or more genes, wherein each gene is selected from the group consisting of a quaking gene and a gene encoding VI protein and comparing the expression of the gene to the expression of the gene in a reference cell population. The gene can be e. g., a human or mouse quaking gene, e. g., a human Qk5 or Qk7 gene.

In some embodiments, the observed alteration in expression is an increase in expression of the quaking gene or VI gene in the test subject relative to a healthy reference group control sample. In other embodiments, the observed alteration in expression is an decrease in expression of the quaking gene or VI gene in the test subject relative to a healthy reference group control sample.

Methods of treating motor pathologies Also included in the invention is a method of preventing or delaying the onset of a motor pathology in a subject, e. g., a human, by administering to the subject an agent which increases the expression or activity of a gene selected from the group consisting of HALO 1-8, 10, 11, 14,

16-18, 21, 24-27, 29-32, and HAL033. In some embodiments, the motor pathology is associated with administration of a psychoactive agent to the subject.

In another aspect, the invention includes a method of preventing or delaying the onset of a motor pathology in a subject by administering to the subject an agent which decreases the expression or activity of a gene selected from the group consisting of HALO 9, 12, 13, 19, 20, 22, 23, 28, and HALO 34.

In some embodiments, the agent increases the expression or activity of a human quaking gene, and can be, e. g., a human Qk5 nucleic acid or protein, or a human Qk7 nucleic acid or protein.

In some embodiments, the motor pathology is associated with administration of a psychoactive agent to the subject. The motor pathology can be any of the motor impairments described herein, e. g., a dystonia.

In some embodiments, the agent increases the expression or activity of a human quaking gene. Thus, the agent can be e. g, a quaking gene, including a human Qk5 nucleic acid or human Qk7 nucleic acid, a human Qk5 or human Qk7 polypeptide, or an agonist of a human Qk5 or Qk7 polypeptide.

The herein described HALO nucleic acids, polypeptides, antibodies, agonists, and antagonists when used therapeutically are referred to herein as"Therapeutics". Methods of administration of Therapeutics include, but are not limited to, intradermal, intramuscular, intraperitoneal, intravenous, subcutaneous, intranasal, epidural, and oral routes. The Therapeutics of the present invention may be administered by any convenient route, for example by infusion or bolus injection, by absorption through epithelial or mucocutaneous linings (e. g., oral mucosa. rectal and intestinal mucosa, etc.) and may be administered together with other biologically-active agents. Administration can be systemic or local. In addition, it may be advantageous to administer the Therapeutic into the central nervous system by any suitable route, including intraventricular and intrathecal injection. Intraventricular injection may be facilitated by an intraventricular catheter attached to a reservoir (e. g., an Ommaya reservoir). Pulmonary administration may also be employed by use of an inhaler or nebulizer, and formulation with an aerosolizing agent. It may also be desirable to administer the Therapeutic locally to the area in need of treatment ; this may be achieved by, for example, and not by way of limitation, local infusion during surgery, topical application, by injection, by means of a catheter, by means of a

suppository, or by means of an implant. In a specific embodiment, administration may be by direct injection at the site (or former site) of a malignant tumor or neoplastic or pre-neoplastic tissue.

Various delivery systems are known and can be used to administer a Therapeutic of the present invention including, e. g. : (i) encapsulation in liposomes, microparticles, microcapsules ; (ii) recombinant cells capable of expressing the Therapeutic ; (iii) receptor-mediated endocytosis (See, e. g., Wu and Wu, 1987. JBiol Chem 262 : 4429-4432) ; (iv) construction of a Therapeutic nucleic acid as part of a retroviral or other vector, and the like. In one embodiment of the present invention, the Therapeutic may be delivered in a vesicle, in particular a liposome. In a liposome, the protein of the present invention is combined, in addition to other pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, and the like. Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, for example, in U. S. Pat. No.

4, 837, 028 ; and U. S. Pat. No. 4, 737, 323, all of which are incorporated herein by reference. In yet another embodiment, the Therapeutic can be delivered in a controlled release system including, e. g. : a delivery pump (See, e. g., Saudek, et al.., 1989. New Engl J led 321 : 574 and a semi-permeable polymeric material (See, e. g, Howard, et al.., 1989. JNeurosurg 71 : 105).

Additionally, the controlled release system can be placed in proximity of the therapeutic target (e. g., the brain), thus requiring only a fraction of the systemic dose. See, e. g., Goodson, In : Medical Applications of Controlled Release 1984. (CRC Press, Bocca Raton, FL).

In a specific embodiment of the present invention, where the Therapeutic is a nucleic acid encoding a protein, the Therapeutic nucleic acid may be administered in vivo to promote expression of its encoded protein, by constructing it as part of an appropriate nucleic acid expression vector and administering it so that it becomes intracellular (e. g., by use of a retroviral vector, by direct injection, by use of microparticle bombardment, by coating with lipids or cell-surface receptors or transfecting agents, or by administering it in linkage to a homeobox-like peptide which is known to enter the nucleus (See, e. g., Joliot, et al.., 1991. Proc Natl Acad Sci USA 88 : 1864-1868), and the like. Alternatively, a nucleic acid Therapeutic can be introduced intracellularly and incorporated within host cell DNA for expression, by homologous recombination.

As used herein, the term"therapeutically effective amount"means the total amount of each active component of the pharmaceutical composition or method that is sufficient to show a meaningful patient benefit, i. e., treatment, healing, prevention or amelioration of the relevant medical condition, or an increase in rate of treatment, healing, prevention or amelioration of such conditions. When applied to an individual active ingredient, administered alone, the term refers to that ingredient alone. When applied to a combination, the term refers to combined amounts of the active ingredients that result in the therapeutic effect, whether administered in combination, serially or simultaneously.

The amount of the Therapeutic of the invention which will be effective in the treatment of a particular disorder or condition will depend on the nature of the disorder or condition, and may be determined by standard clinical techniques by those of average skill within the art. In addition, in vitro assays may optionally be employed to help identify optimal dosage ranges. The precise dose to be employed in the formulation will also depend on the route of administration, and the overall seriousness of the disease or disorder, and should be decided according to the judgment of the practitioner and each patient's circumstances. Ultimately, the attending physician will decide the amount of protein of the present invention with which to treat each individual patient. Initially, the attending physician will administer low doses of protein of the present invention and observe the patient's response. Larger doses of protein of the present invention may be administered until the optimal therapeutic effect is obtained for the patient, and at that point the dosage is not increased further. However, suitable dosage ranges for intravenous administration of the Therapeutics of the present invention are generally about 20-500 micrograms (g) of active compound per kilogram (Kg) body weight. Suitable dosage ranges for intranasal administration are generally about 0. 01 pg/kg body weight to 1 mg/kg body weight.

Effective doses may be extrapolated from dose-response curves derived from in vitro or animal model test systems. Suppositories generally contain active ingredient in the range of 0. 5% to 10% by weight ; oral formulations preferably contain 10% to 95% active ingredient.

The duration of intravenous therapy using the pharmaceutical composition of the present invention will vary, depending on the severity of the disease being treated and the condition and potential idiosyncratic response of each individual patient. It is contemplated that the duration of each application of the protein of the present invention will be in the range of 12 to 24 hours of continuous intravenous administration. Ultimately the attending physician will decide on the

appropriate duration of intravenous therapy using the pharmaceutical composition of the present invention.

Polynucleotides of the present invention can also be used for gene therapy. Gene therapy refers to therapy that is performed by the administration of a specific nucleic acid to a subject.

Delivery of the Therapeutic nucleic acid into a mammalian subject may be either direct (i. e., the patient is directly exposed to the nucleic acid or nucleic acid-containing vector) or indirect (i. e., cells are first transformed with the nucleic acid in vitro, then transplanted into the patient). These two approaches are known, respectively, as in vivo or ex vivo gene therapy. Polynucleotides of the invention may also be administered by other known methods for introduction of nucleic acid into a cell or organism (including, without limitation, in the form of viral vectors or naked DNA).

Any of the methodologies relating to gene therapy available within the art may be used in the practice of the present invention. See e. g., Goldspiel, et al.., 1993. Clin Pharm 12 : 488-505.

Cells may also be cultured ex vivo in the presence of therapeutic agents or proteins of the present invention in order to proliferate or to produce a desired effect on or activity in such cells.

Treated cells can then be introduced in vivo for therapeutic purposes.

HALO Nucleic Acids Also provided in the invention are novel nucleic acid comprising a nucleic acid sequence selected from the group consisting of HALOs : I-19, or its complement, as well as vectors and cells including these nucleic acids.

Thus, one aspect of the invention pertains to isolated HALO nucleic acid molecules that encode HALO proteins or biologically active portions thereof. Also included are nucleic acid fragments sufficient for use as hybridization probes to identify HALO-encoding nucleic acids (e. g., HALO mRNA) and fragments for use as polymerase chain reaction (PCR) primers for the amplification or mutation of HALO nucleic acid molecules. As used herein, the term"nucleic acid molecule"is intended to include DNA molecules (e. g., cDNA or genomic DNA), RNA molecules (e. g., mRNA), analogs of the DNA or RNA generated using nucleotide analogs, and derivatives, fragments and homologs thereof. The nucleic acid molecule can be single-stranded or double-stranded, but preferably is double-stranded DNA.

"Probes"refer to nucleic acid sequences of variable length, preferably between at least about 10 nucleotides (nt) or as many as about, e. g., 6, 000 nt, depending on use. Probes are used in the detection of identical, similar, or complementary nucleic acid sequences. Longer length probes are usually obtained from a natural or recombinant source, are highly specific and much slower to hybridize than oligomers. Probes may be single-or double-stranded and designed to have specificity in PCR, membrane-based hybridization technologies, or ELISA-like technologies.

An"isolated"nucleic acid molecule is one that is separated from other nucleic acid molecules which are present in the natural source of the nucleic acid. Examples of isolated nucleic acid molecules include, but are not limited to, recombinant DNA molecules contained in a vector, recombinant DNA molecules maintained in a heterologous host cell, partially or substantially purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an"isolated"nucleic acid is free of sequences which naturally flank the nucleic acid (i. e., sequences located at the 5'and 3'ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. For example, in various embodiments, the isolated HALO nucleic acid molecule can contain less than about 50 kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0. 5 kb or 0. 1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, an"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular material or culture medium when produced by recombinant techniques, or of chemical precursors or other chemicals when chemically synthesized.

A nucleic acid molecule of the present invention, e. g., a nucleic acid molecule having the nucleotide sequence of any of HALOS : I-19, or a complement of any of these nucleotide sequences, can be isolated using standard molecular biology techniques and the sequence information provided herein. Using all or a portion of these nucleic acid sequences as a hybridization probe, HALO nucleic acid sequences can be isolated using standard hybridization and cloning techniques (e. g., as described in Sambrook et al., eds., MOLECULAR CLONING : A LABORATORY MANUAL 2"d Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989 ; and Ausubel, et al., eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, NY, 1993.)

A nucleic acid of the invention can be amplified using cDNA, mRNA or alternatively, genomic DNA, as a template and appropriate oligonucleotide primers according to standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. Furthermore, oligonucleotides corresponding to HALO nucleotide sequences can be prepared by standard synthetic techniques, e. g., using an automated DNA synthesizer.

As used herein, the term"oligonucleotide"refers to a series of linked nucleotide residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a PCR reaction.

A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise portions of a nucleic acid sequence having at least about 10 nt and as many as 50 nt, preferably about 15 nt to 30 nt. They may be chemically synthesized and may be used as probes.

In another embodiment, an isolated nucleic acid molecule of the invention comprises a nucleic acid molecule that is a complement of the nucleotide sequence shown in HALO1-20. In another embodiment, an isolated nucleic acid molecule of the invention comprises a nucleic acid molecule that is a complement of the nucleotide sequence shown in any of these sequences, or a portion of any of these nucleotide sequences. A nucleic acid molecule that is complementary to the nucleotide sequence shown in HALO 1-20 is one that is sufficiently complementary to the nucleotide sequence shown, such that it can hydrogen bond with little or no mismatches to the nucleotide sequences shown, thereby forming a stable duplex.

As used herein, the term"complementary"refers to Watson-Crick or Hoogsteen base pairing between nucleotides units of a nucleic acid molecule, and the term"binding"means the physical or chemical interaction between two polypeptides or compounds or associated polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, Von der Waals, hydrophobic interactions, etc. A physical interaction can be either direct or indirect.

Indirect interactions may be through or due to the effects of another polypeptide or compound.

Direct binding refers to interactions that do not take place through, or due to, the effect of another polypeptide or compound, but instead are without other substantial chemical intermediates.

Moreover, the nucleic acid molecule of the invention can comprise only a portion of the nucleic acid sequence of HALOs : I-19 or 20, e. g., a fragment that can be used as a probe or primer or a fragment encoding a biologically active portion of HALO. Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific hybridization in the case of nucleic acids or for specific recognition of an epitope in the case of amino acids, respectively, and are at most some portion less than a full length sequence. Fragments may be derived from any contiguous portion of a nucleic acid or amino acid sequence of choice. Derivatives are nucleic acid sequences or amino acid sequences formed from the native compounds either directly or by modification or partial substitution. Analogs are nucleic acid sequences or amino acid sequences that have a structure similar to, but not identical to, the native compound but differs from it in respect to certain components or side chains. Analogs may be synthetic or from a different evolutionary origin and may have a similar or opposite metabolic activity compared to wild type.

Derivatives and analogs may be full length or other than full length, if the derivative or analog contains a modified nucleic acid or amino acid, as described below. Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not limited to, molecules comprising regions that are substantially homologous to the nucleic acids or proteins of the invention, in various embodiments, by at least about 45%, 50%, 70%, 80%, 95%, 98%, or even 99% identity (with a preferred identity of 80-99%) over a nucleic acid or amino acid sequence of identical size or when compared to an aligned sequence in which the alignment is done by a computer homology program known in the art, or whose encoding nucleic acid is capable of hybridizing to the complement of a sequence encoding the aforementioned proteins under stringent, moderately stringent, or low stringent conditions. See e. g. Ausubel, et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, NY, 1993, and below. An exemplary program is the Gap program (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer Group, University Research Park, Madison, WI) using the default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2 : 482-489, which in incorporated herein by reference in its entirety).

A"homologous nucleic acid sequence"or"homologous amino acid sequence,"or variations thereof, refer to sequences characterized by a homology at the nucleotide level or amino acid level as discussed above. Homologous nucleotide sequences encode those sequences

coding for isoforms of a HALO polypeptide. Isoforms can be expressed in different tissues of the same organism as a result of, for example, alternative splicing of RNA. Alternatively, isoforms can be encoded by different genes. In the present invention, homologous nucleotide sequences include nucleotide sequences encoding for a HALO polypeptide of species other than humans, including, but not limited to, mammals, and thus can include, e. g., mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide sequences also include, but are not limited to, naturally occurring allelic variations and mutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence does not, however, include the nucleotide sequence encoding a human HALO protein. Homologous nucleic acid sequences include those nucleic acid sequences that encode conservative amino acid substitutions (see below) in a HALO polypeptide, as well as a polypeptide having a HALO activity. A homologous amino acid sequence does not encode the amino acid sequence of a human HALO polypeptide.

The nucleotide sequence determined from the cloning of human HALO genes allows for the generation of probes and primers designed for use in identifying and/or cloning HALO homologues in other cell types, e. g., from other tissues, as well as HALO homologues from other mammals. The probe/primer typically comprises a substantially purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence of a nucleic acid comprising a HALO sequence, or an anti-sense strand nucleotide sequence of a nucleic acid comprising a HALO sequence. or of a naturally occurring mutant of these sequences.

Probes based on human HALO nucleotide sequences can be used to detect transcripts or genomic sequences encoding the same or homologous proteins. In various embodiments, the probe further comprises a label group attached thereto, e. g., the label group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for identifying cells or tissue which misexpress a HALO protein, such as by measuring a level of a HALO-encoding nucleic acid in a sample of cells from a subject e. g., detecting HALO mRNA levels or determining whether a genomic HALO gene has been mutated or deleted.

"A polypeptide having a biologically active portion of HALO"refers to polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a polypeptide of the

present invention, including mature forms, as measured in a particular biological assay, with or without dose dependency. A nucleic acid fragment encoding a"biologically active portion of HALO"can be prepared by isolating a portion of HALO1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, or 47, that encodes a polypeptide having a HALO biological activity, expressing the encoded portion of HALO protein (e. g., by recombinant expression in vitro) and assessing the activity of the encoded portion of HALO. For example, a nucleic acid fragment encoding a biologically active portion of a HALO polypeptide can optionally include an ATP-binding domain. In another embodiment, a nucleic acid fragment encoding a biologically active portion of HALO includes one or more regions.

HALO variants The invention further encompasses nucleic acid molecules that differ from the disclosed or referenced HALO nucleotide sequences due to degeneracy of the genetic code. These nucleic acids thus encode the same HALO protein as that encoded by nucleotide sequence comprising a HALO nucleic acid as shown in, e. g., HALO1-19 or 20.

In addition to the human HALO nucleotide sequence shown in HALO1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, or 47, it will be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to changes in the amino acid sequences of a HALO polypeptide may exist within a population (e. g., the human population).

Such genetic polymorphism in the HALO gene may exist among individuals within a population due to natural allelic variation. As used herein, the terms"gene"and"recombinant gene"refer to nucleic acid molecules comprising an open reading frame encoding a HALO protein, preferably a mammalian HALO protein. Such natural allelic variations can typically result in 1-5% variance in the nucleotide sequence of the HALO gene. Any and all such nucleotide variations and resulting amino acid polymorphisms in HALO that are the result of natural allelic variation and that do not alter the functional activity of HALO are intended to be within the scope of the invention.

Moreover, nucleic acid molecules encoding HALO proteins from other species, and thus that have a nucleotide sequence that differs from the human sequence of HALO 1-20, are intended to be within the scope of the invention. Nucleic acid molecules corresponding to natural allelic variants and homologues of the HALO DNAs of the invention can be isolated

based on their homology to the human HALO nucleic acids disclosed herein using the human cDNAs, or a portion thereof, as a hybridization probe according to standard hybridization techniques under stringent hybridization conditions. For example, a soluble human HALODNA can be isolated based on its homology to human membrane-bound HALO. Likewise, a membrane-bound human HALODNA can be isolated based on its homology to soluble human HALO.

Accordingly, in another embodiment, an isolated nucleic acid molecule of the invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the nucleic acid molecule comprising the nucleotide sequence of HALOS : 1-19, or 20. In another embodiment, the nucleic acid is at least 10, 25, 50, 100, 250 or 500 nucleotides in length. In another embodiment, an isolated nucleic acid molecule of the invention hybridizes to the coding region.

As used herein, the term"hybridizes under stringent conditions"is intended to describe conditions for hybridization and washing under which nucleotide sequences at least 60% homologous to each other typically remain hybridized to each other.

Homologs (i. e., nucleic acids encoding HALO proteins derived from species other than human) or other related sequences (e. g., paralogs) can be obtained by low, moderate or high stringency hybridization with all or a portion of the particular human sequence as a probe using methods well known in the art for nucleic acid hybridization and cloning.

As used herein, the phrase"stringent hybridization conditions"refers to conditions under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no other sequences. Stringent conditions are sequence-dependent and will be different in different circumstances. Longer sequences hybridize specifically at higher temperatures than shorter sequences. Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at which 50% of the probes complementary to the target sequence hybridize to the target sequence at equilibrium.

Since the target sequences are generally present at excess, at Tm, 50% of the probes are occupied at equilibrium. Typically, stringent conditions will be those in which the salt concentration is less than about 1. 0 M sodium ion, typically about 0. 01 to 1. 0 M sodium ion (or other salts) at pH 7. 0 to 8. 3 and the temperature is at least about 30°C for short probes, primers or oligonucleotides (e. g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers and oligonucleotides.

Stringent conditions may also be achieved with the addition of destabilizing agents, such as formamide.

Stringent conditions are known to those skilled in the art and can be found in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, N. Y. (1989), 6. 3. 1-6. 3. 6. Preferably, the conditions are such that sequences at least about 65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain hybridized to each other. A non-limiting example of stringent hybridization conditions is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCI (pH 7. 5), 1 mM EDTA, 0. 02% PVP, 0. 02% Ficoll, 0. 02% BSA, and 500 mg/ml denatured salmon sperm DNA at 65°C. This hybridization is followed by one or more washes in 0. 2X SSC, 0. 01% BSA at 50°C. An isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to the sequence of HALOS : 1-19 or 20 corresponds to a naturally occurring nucleic acid molecule. As used herein, a"naturally-occurring"nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs in nature (e. g., encodes a natural protein).

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic acid molecule comprising the nucleotide sequence of HALOS : 1-19 or 20, or fragments, analogs or derivatives thereof, under conditions of moderate stringency is provided. A non-limiting example of moderate stringency hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0. 5% SDS and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes in 1X SSC, 0. 1% SDS at 37°C. Other conditions of moderate stringency that may be used are well known in the art. See, e. g., Ausubel et al. (eds.), 1993, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, NY, and Kriegler, 1990, GENE TRANSFER AND EXPRESSION, A LABORATORY MANUAL, Stockton Press, NY.

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule comprising the nucleotide sequence of HALOS : I-19 or 20, or fragments, analogs or derivatives thereof, under conditions of low stringency, is provided. A non-limiting example of low stringency hybridization conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7. 5), 5 mM EDTA, 0. 02% PVP, 0. 02% Ficoll, 0. 2% BSA, 100 mg/ml denatured salmon sperm DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 25 mM Tris-HCl (pH 7. 4), 5 mM EDTA, and 0. 1% SDS at 50°C. Other conditions of low stringency that may be used are well known in the art (e. g., as employed for cross-species

hybridizations). See, e. g., Ausubel et al. (eds.), 1993, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, NY, and Kriegler, 1990, GENE TRANSFER AND EXPRESSION, A LABORATORY MANUAL, Stockton Press, NY ; Shilo et al., 1981, Proc Natl Acad Sci USA 78 : 6789-6792.

Conservative mutations In addition to naturally-occurring allelic variants of the HALO sequence that may exist in the population, the skilled artisan will further appreciate that changes can be introduced into a HALO nucleic acid or directly into a HALO polypeptide sequence without altering the functional ability of the HALO protein. In some embodiments, the nucleotide sequence of HALOS : 1-19 or 30 will be altered, thereby leading to changes in the amino acid sequence of the encoded HALO protein. For example, nucleotide substitutions that result in amino acid substitutions at various "non-essential"amino acid residues can be made in the sequence of HALOS : I-19 or 20. A "non-essential"amino acid residue is a residue that can be altered from the wild-type sequence of HALO without altering the biological activity, whereas an"essential"amino acid residue is required for biological activity. For example, amino acid residues that are conserved among the HALO proteins of the present invention, are predicted to be particularly unamenable to alteration.

In addition, amino acid residues that are conserved among family members of the HALO proteins of the present invention, are also predicted to be particularly unamenable to alteration.

As such, these conserved domains are not likely to be amenable to mutation. Other amino acid residues, however, (e. g., those that are not conserved or only semi-conserved among members of the HALO proteins) may not be essential for activity and thus are likely to be amenable to alteration.

Another aspect of the invention pertains to nucleic acid molecules encoding HALO proteins that contain changes in amino acid residues that are not essential for activity. Such HALO proteins differ in amino acid sequence from the amino acid sequences of polypeptides encoded by nucleic acids containing HALOS : 1-19 or 20, yet retain biological activity. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the protein comprises an amino acid sequence at least about 45% homologous, more preferably 60%, and still more preferably at least about 70%, 80%, 90%, 95%, 98%, and

most preferably at least about 99% homologous to the amino acid sequence of the amino acid sequences of polypeptides encoded by nucleic acids comprising HALOS : 1-19 or 20.

An isolated nucleic acid molecule encoding a HALO protein homologous to can be created by introducing one or more nucleotide substitutions, additions or deletions into the nucleotide sequence of a nucleic acid comprising HALOS : 1-19 or 20, such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein.

Mutations can be introduced into a nucleic acid comprising HALOS : 1-19 or 20 by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis.

Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A"conservative amino acid substitution"is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e. g., lysine, arginine, histidine), acidic side chains (e. g., aspartic acid, glutamic acid), uncharged polar side chains (e. g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e. g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e. g., threonine, valine, isoleucine) and aromatic side chains (e. g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in HALO is replaced with another amino acid residue from the same side chain family. Alternatively, in another embodiment, mutations can be introduced randomly along all or part of a HALO coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for HALO biological activity to identify mutants that retain activity. Following mutagenesis of the nucleic acid, the encoded protein can be expressed by any recombinant technology known in the art and the activity of the protein can be determined.

In one embodiment, a mutant HALO protein can be assayed for (1) the ability to form protein : protein interactions with other HALO proteins, other cell-surface proteins, or biologically active portions thereof, (2) complex formation between a mutant HALO protein and a HALO ligand ; (3) the ability of a mutant HALO protein to bind to an intracellular target protein or biologically active portion thereof ; (e. g., avidin proteins) ; (4) the ability to bind ATP ; or (5) the ability to specifically bind a HALO protein antibody.

In specific embodiments, the invention includes an isolated polynucleotide comprising a sequence chosen from the group consisting of a first sequence, the first sequence being 80% or more identical to a second sequence that encodes a polypeptide whose expression is modulated in a mammal to which haloperidol is administered ; a fragment of the first sequence ; a complementary polynucleotide sequence comprising a sequence complementary to the first sequence ; and a fragment of the complementary polynucleotide sequence.

Preferably, the second sequence is chosen from the group consisting of a polynucleotide encoding splice variant 5 of a human ortholog of murine quaking type I, a polynucleotide encoding splice variant 7 of a human ortholog of murine quaking type I, a sequence having at least 88% identity to the human KIAA0383 gene, a rat polynucleotide having at least 83% identity to a mouse polynucleotide encoding an EGF repeat transmembrane protein, a sequence having at least 80% identity to a polynucleotide encoding human suilisol, a sequence having at least 72% identity to rat repetitive ribosomal DNA II 3'to 45 S pre-rRNA, a sequence encoding a polypeptide having at least 70% amino acid identity to E. coli putative ATP-dependent RNA helicase RHLB, a sequence having at least 65% identity to a 5'region of human NOF1, a sequence having at least 91% identity to a polynucleotide encoding mouse phosphatidylinositol- 3-kinase 1 l OkD subunit, a sequence having at least 98% identity to a polynucleotide encoding mouse meis2 subfamily protein, and a sequence having at least 89% identity to a polynucleotide encoding a human rab5c-like protein, a polynucleotide encoding a polypeptide having at least 80% amino acid identity to human predicted protein DJ257A7. 1, a polynucleotide similar to rat repetitive ribosomal DNA II 3'to 45S pre-rRNA, fragment rOwO_85. 8, fragment rOjO_120. 6, fragment 10y0158. 5, fragment yOpO_314. 6, fragment mls0321. 4, and fragment gln0-114. 5.

In other embodiment, the second sequence is chosen from the group consisting of a polynucleotide encoding splice variant 5 of a human ortholog of murine quaking type I, a polynucleotide encoding splice variant 7 of a human ortholog of murine quaking type I, a sequence having at least 88% identity to the human KIAA0383 gene, a rat polynucleotide having at least 83% identity to a mouse polynucleotide encoding an EGF repeat transmembrane protein, a sequence having at least 80% identity to a polynucleotide encoding human suilisol, a sequence having at least 72% identity to rat repetitive ribosomal DNA II 3'to 45S pre-rRNA, a sequence encoding a polypeptide having at least 70% amino acid identity to E. coli putative ATP- dependent RNA helicase RHLB, and a sequence having at least 65% identity to a 5'region of human NOF 1.

In yet other embodiments, the second sequence is chosen from the group consisting of a polynucleotide encoding splice variant 5 of a human ortholog of murine quaking type I and a polynucleotide encoding splice variant 7 of a human ortholog of murine quaking type I.

Preferably, the fragment hybridizes to the sequence complementary to the first sequence.

In other embodiment, the fragment of the complementary polynucleotide sequence described in claim 1 wherein the fragment of the complementary polynucleotide sequence hybridizes to the first sequence.

In other specific embodiments, the nucleic acid is RNA or DNA. The fragment or the fragment of the complementary polynucleotide sequence described in claim 1, wherein the fragment is between about 10 and about 100 nucleotides in length, e. g., between about 10 and about 90 nucleotides in length, or about 10 and about 75 nucleotides in length, about 10 and about 50 bases in length, about 10 and about 40 bases in length, or about 15 and about 30 bases in length.

In specific embodiments, the invention includes an isolated polynucleotide comprising a sequence that encodes a polypeptide chosen from the group consisting of splice variant 5 of a human ortholog of murine quaking type I and splice variant 7 of a human ortholog of murine quaking type I.

Antisense Another aspect of the invention pertains to isolated antisense nucleic acid molecules that are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide sequence of a HALO sequence or fragments, analogs or derivatives thereof. An"antisense" nucleic acid comprises a nucleotide sequence that is complementary to a"sense"nucleic acid encoding a protein, e. g., complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire HALO coding strand, or to only a portion thereof.

Nucleic acid molecules encoding fragments, homologs, derivatives and analogs of a HALO protein, or antisense nucleic acids complementary to a nucleic acid comprising a HALO nucleic acid sequence are additionally provided.

In one embodiment, an antisense nucleic acid molecule is antisense to a"coding region" of the coding strand of a nucleotide sequence encoding HALO. The term"coding region"refers to the region of the nucleotide sequence comprising codons which are translated into amino acid residues. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region"of the coding strand of a nucleotide sequence encoding HALO. The term "noncoding region"refers to 5'and 3'sequences which flank the coding region that are not translated into amino acids (i. e., also referred to as 5'and 3'untranslated regions).

Given the coding strand sequences encoding HALO disclosed herein, antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to the entire coding region of HALO mRNA, but more preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding region of HALO mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the translation start site of HALO mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed using chemical synthesis or enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid (e. g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e. g., phosphorothioate derivatives and acridine substituted nucleotides can be used.

Examples of modified nucleotides that can be used to generate the antisense nucleic acid include : 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2, 2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil,

uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3) w, and 2, 6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i. e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).

The antisense nucleic acid molecules of the invention are typically administered to a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a HALO protein to thereby inhibit expression of the protein, e. g., by inhibiting transcription and/or translation. The hybridization can be by conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in the major groove of the double helix. An example of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to target selected cells and then administered systemically. For example, for systemic administration, antisense molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e. g., by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using the vectors described herein.

To achieve sufficient intracellular concentrations of antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.

In yet another embodiment, the antisense nucleic acid molecule of the invention is an a-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res 15 : 6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res 15 : 6131-6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS Lett 215 : 327-330).

Ribozymes and PNA moieties In still another embodiment, an antisense nucleic acid of the invention is a ribozyme.

Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a complementary region.

Thus, ribozymes (e. g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) Nature 334 : 585-591)) can be used to catalytically cleave HALO mRNA transcripts to thereby inhibit translation of HALO mRNA. A ribozyme having specificity for a HALO-encoding nucleic acid can be designed based upon the nucleotide sequence of a HALO DNA disclosed herein. For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved in a HALO-encoding mRNA. See, e. g., Cech et al. U. S. Pat. No. 4, 987, 071 ; and Cech et al. U. S. Pat. No. 5, 116, 742. Alternatively, HALO mRNA can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e. g., Bartel et al., (1993) Science 261 : 1411-1418.

Alternatively, HALO gene expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region of a HALO nucleic acid (e. g., the HALO promoter and/or enhancers) to form triple helical structures that prevent transcription of the HALO gene in target cells. See generally, Helene. (1991) Anticancer Drug Des. 6 : 569-84 ; Helene. et al. (1992) Ann. N. Y. Acad. Sci. 660 : 27-36 ; and Maher (1992) Bioassays 14 : 807-15.

In various embodiments, the nucleic acids of HALO can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e. g., the stability, hybridization, or solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) Bioorg Med Chem 4 : 5-23).

As used herein, the terms"peptide nucleic acids"or"PNAs"refer to nucleic acid mimics, e. g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above ; Perry-O'Keefe et al. (1996) PNAS 93 : 14670-675.

PNAs of HALO can be used in therapeutic and diagnostic applications. For example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, e. g., inducing transcription or translation arrest or inhibiting replication. PNAs of HALO can also be used, e. g., in the analysis of single base pair mutations in a gene by, e. g., PNA directed PCR clamping ; as artificial restriction enzymes when used in combination with other enzymes, e. g., S1 nucleases (Hyrup B. (1996) above) ; or as probes or primers for DNA sequence and hybridization (Hyrup et al. (1996), above ; Perry-O'Keefe (1996), above).

In another embodiment, PNAs of HALO can be modified, e. g., to enhance their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in the art. For example, PNA-DNA chimeras of HALO can be generated that may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, e. g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24 : 3357-63. For example, a DNA chain can be synthesized on a solid support using standard. phosphoramidite coupling chemistry, and modified nucleoside analogs, e. g., 5'- (4-methoxytrityl) amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA and the 5'end of DNA (Mag et al. (1989) Nucl Acid Res 17 : 5973-88). PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5'PNA segment and a 3' DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can be synthesized with a 5'DNA segment and a 3'PNA segment. See, Petersen et al. (1975) Bioorg Med Chem Lett 5 : 1119-11124.

In other embodiments, the oligonucleotide may include other appended groups such as peptides (e. g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (see, e. g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U. S. A. 86 : 6553-6556 ; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84 : 648-652 ; PCT Publication No. W088/09810) or the blood-brain barrier (see, e. g., PCT Publication No. W089/10134). In addition, oligonucleotides can be modified with hybridization triggered cleavage agents (See, e. g., Krol et

al., 1988, BioTechniques 6 : 958-976) or intercalating agents. (See, e. g., Zon, 1988, Pharm. Res.

5 : 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e. g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, etc.

HALO polypeptides One aspect of the invention pertains to isolated HALO proteins, and biologically active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided are polypeptide fragments suitable for use as immunogens to raise anti-HALO antibodies. In one embodiment, native HALO proteins can be isolated from cells or tissue sources by an appropriate purification scheme using standard protein purification techniques. In another embodiment, HALO proteins are produced by recombinant DNA techniques. Alternative to recombinant expression, a HALO protein or polypeptide can be synthesized chemically using standard peptide synthesis techniques.

An"isolated"or"purified"protein or biologically active portion thereof is substantially free of cellular material or other contaminating proteins from the cell or tissue source from which the HALO protein is derived, or substantially free from chemical precursors or other chemicals when chemically synthesized. The language"substantially free of cellular material"includes preparations of HALO protein in which the protein is separated from cellular components of the cells from which it is isolated or recombinantly produced. In one embodiment, the language "substantially free of cellular material"includes preparations of HALO protein having less than about 30% (by dry weight) of non-HALO protein (also referred to herein as a"contaminating protein"), more preferably less than about 20% of non-HALO protein, still more preferably less than about 10% of non-HALO protein, and most preferably less than about 5% non-HALO protein. When the HALO protein or biologically active portion thereof is recombinantly produced, it is also preferably substantially free of culture medium, i. e., culture medium represents less than about 20%, more preferably less than about 10%, and most preferably less than about 5% of the volume of the protein preparation.

The language"substantially free of chemical precursors or other chemicals"includes preparations of HALO protein in which the protein is separated from chemical precursors or other chemicals that are involved in the synthesis of the protein. In one embodiment, the

language"substantially free of chemical precursors or other chemicals"includes preparations of HALO protein having less than about 30% (by dry weight) of chemical precursors or non-HALO chemicals, more preferably less than about 20% chemical precursors or non-HALO chemicals, still more preferably less than about 10% chemical precursors or non-HALO chemicals, and most preferably less than about 5% chemical precursors or non-HALO chemicals.

Biologically active portions of a HALO protein include peptides comprising amino acid sequences sufficiently homologous to or derived from the amino acid sequence of the HALO protein, e. g., the amino acid sequence encoded by a nucleic acid comprising HALO 1-20 that include fewer amino acids than the full length HALO proteins, and exhibit at least one activity of a HALO protein. Typically, biologically active portions comprise a domain or motif with at least one activity of the HALO protein. A biologically active portion of a HALO protein can be a polypeptide which is. for example, 10. 25, 50, 100 or more amino acids in length.

A biologically active portion of a HALO protein of the present invention may contain at least one of the above-identified domains conserved between the HALO proteins. An alternative biologically active portion of a HALO protein may contain at least two of the above-identified domains. Another biologically active portion of a HALO protein may contain at least three of the above-identified domains. Yet another biologically active portion of a HALO protein of the present invention may contain at least four of the above-identified domains.

Moreover, other biologically active portions, in which other regions of the protein are deleted, can be prepared by recombinant techniques and evaluated for one or more of the functional activities of a native HALO protein.

In some embodiments, the HALO protein is substantially homologous to one of these HALO proteins and retains its the functional activity, yet differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in detail below.

In specific embodiments, the invention includes an isolated polypeptide comprising an amino acid sequence that is 80% or more identical to the sequence of a polypeptide whose expression is modulated in a mammal to which haloperidol is administered.

For example, in some embodiments, the polypeptide is expressed by splice variant 5 of a human ortholog of murine quaking type I or the polypeptide expressed by splice variant 7 of a human ortholog of murine quaking type I.

Determining homology between two or more sequences To determine the percent homology of two amino acid sequences or of two nucleic acids, the sequences are aligned for optimal comparison purposes (e. g., gaps can be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a second amino or nucleic acid sequence). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are homologous at that position (i. e., as used herein amino acid or nucleic acid"homology"is equivalent to amino acid or nucleic acid"identity").

The nucleic acid sequence homology may be determined as the degree of identity between two sequences. The homology may be determined using computer programs known in the art, such as GAP software provided in the GCG program package. See Needleman and Wunsch 1970 JMol Biol 48 : 443-453. Using GCG GAP software with the following settings for nucleic acid sequence comparison : GAP creation penalty of 5. 0 and GAP extension penalty of 0. 3, the coding region of the analogous nucleic acid sequences referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with the CDS (encoding) part of a DNA sequence comprising HALOS : 1-19 or 20.

The term"sequence identity"refers to the degree to which two polynucleotide or polypeptide sequences are identical on a residue-by-residue basis over a particular region of comparison. The term"percentage of sequence identity"is calculated by comparing two optimally aligned sequences over that region of comparison. determining the number of positions at which the identical nucleic acid base (e. g., A, T, C, G, U, or I, in the case of nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the region of comparison (i. e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. The term "substantial identity as used herein denotes a characteristic of a polynucleotide sequence. wherein the polynucleotide comprises a sequence that has at least 80 percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent sequence identity, more usually at least 99 percent sequence identity as compared to a reference sequence over a comparison region.

Chimeric and fusion proteins The invention also provides HALO chimeric or fusion proteins. As used herein, a HALO "chimeric protein"or"fusion protein"comprises a HALO polypeptide operatively linked to a non-HALO polypeptide. A"HALO polypeptide"refers to a polypeptide having an amino acid sequence corresponding to HALO, whereas a"non-HALO polypeptide"refers to a polypeptide having an amino acid sequence corresponding to a protein that is not substantially homologous to the HALO protein, e. g., a protein that is different from the HALO protein and that is derived from the same or a different organism. Within a HALO fusion protein the HALO polypeptide can correspond to all or a portion of a HALO protein. In one embodiment, a HALO fusion protein comprises at least one biologically active portion of a HALO protein. In another embodiment, a HALO fusion protein comprises at least two biologically active portions of a HALO protein. In yet another embodiment, a HALO fusion protein comprises at least three biologically active portions of a HALO protein. Within the fusion protein, the term"operatively linked"is intended to indicate that the HALO polypeptide and the non-HALO polypeptide are fused in-frame to each other. The non-HALO polypeptide can be fused to the N-terminus or C-terminus of the HALO polypeptide.

For example, in one embodiment a HALO fusion protein comprises a HALO domain operably linked to the extracellular domain of a second protein. Such fusion proteins can be further utilized in screening assays for compounds which modulate HALO activity (such assays are described in detail below).

In yet another embodiment, the fusion protein is a GST-HALO fusion protein in which the HALO sequences are fused to the C-terminus of the GST (i. e., glutathione S-transferase) sequences. Such fusion proteins can facilitate the purification of recombinant HALO.

In another embodiment, the fusion protein is a HALO protein containing a heterologous signal sequence at its N-terminus. For example, a native HALO signal sequence can be removed and replaced with a signal sequence from another protein. In certain host cells (e. g., mammalian host cells), expression and/or secretion of HALO can be increased through use of a heterologous signal sequence.

In yet another embodiment, the fusion protein is a HALO-immunoglobulin fusion protein in which the HALO sequences comprising one or more domains are fused to sequences derived

from a member of the immunoglobulin protein family. The HALO-immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an interaction between a HALO ligand and a HALO protein on the surface of a cell, to thereby suppress HALO-mediated signal transduction in vivo. The HALO-immunoglobulin fusion proteins can be used to affect the bioavailability of a HALO cognate ligand. Inhibition of the HALO ligand/HALO interaction may be useful therapeutically for both the treatment of proliferative and differentiative disorders, as well as modulating (e. g. promoting or inhibiting) cell survival. Moreover, the HALO-immunoglobulin fusion proteins of the invention can be used as immunogens to produce anti-HALO antibodies in a subject, to purify HALO ligands, and in screening assays to identify molecules that inhibit the interaction of HALO with a HALO ligand.

A HALO chimeric or fusion protein of the invention can be produced by standard recombinant DNA techniques. For example, DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional techniques, e. g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that give rise to complementary overhangs between two consecutive gene fragments that can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e. g., a GST polypeptide). A HALO-encoding nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the HALO protein.

HALO agonists and antagonists The present invention also pertains to variants of the HALO proteins that function as either HALO agonists (mimetics) or as HALO antagonists. Variants of the HALO protein can be

generated by mutagenesis, e. g., discrete point mutation or truncation of the HALO protein. An agonist of the HALO protein can retain substantially the same, or a subset of. the biological activities of the naturally occurring form of the HALO protein. An antagonist of the HALO protein can inhibit one or more of the activities of the naturally occurring form of the HALO protein by, for example, competitively binding to a downstream or upstream member of a cellular signaling cascade which includes the HALO protein. Thus, specific biological effects can be elicited by treatment with a variant of limited function. In one embodiment, treatment of a subject with a variant having a subset of the biological activities of the naturally occurring form of the protein has fewer side effects in a subject relative to treatment with the naturally occurring form of the HALO proteins.

Variants of the HALO protein that function as either HALO agonists (mimetics) or as HALO antagonists can be identified by screening combinatorial libraries of mutants, e. g., truncation mutants, of the HALO protein for HALO protein agonist or antagonist activity. In one embodiment, a variegated library of HALO variants is generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene library. A variegated library of HALO variants can be produced by, for example, enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that a degenerate set of potential HALO sequences is expressible as individual polypeptides, or alternatively, as a set of larger fusion proteins (e. g., for phage display) containing the set of HALO sequences therein. There are a variety of methods which can be used to produce libraries of potential HALO variants from a degenerate oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an appropriate expression vector. Use of a degenerate set of genes allows for the provision, in one mixture, of all of the sequences encoding the desired set of potential HALO sequences. Methods for synthesizing degenerate oligonucleotides are known in the art (see, e. g., Narang (1983) Tetrahedron 39 : 3 ; Itakura et al. (1984) Annu Rev Biochem 53 : 323 ; Itakura et al. (1984) Science 198 : 1056 ; Ike et al. (1983) Nucl Acid Res 11 : 477.

Polypeptide libraries In addition, libraries of fragments of the HALO protein coding sequence can be used to generate a variegated population of HALO fragments for screening and subsequent selection of variants of a HALO protein. In one embodiment, a library of coding sequence fragments can be

generated by treating a double stranded PCR fragment of a HALO coding sequence with a nuclease under conditions wherein nicking occurs only about once per molecule, denaturing the double stranded DNA, renaturing the DNA to form double stranded DNA that can include sense/antisense pairs from different nicked products, removing single stranded portions from reformed duplexes by treatment with S l nuclease, and ligating the resulting fragment library into an expression vector. By this method, an expression library can be derived which encodes N-terminal and internal fragments of various sizes of the HALO protein.

Several techniques are known in the art for screening gene products of combinatorial libraries made by point mutations or truncation, and for screening cDNA libraries for gene products having a selected property. Such techniques are adaptable for rapid screening of the gene libraries generated by the combinatorial mutagenesis of HALO proteins. The most widely used techniques, which are amenable to high throughput analysis, for screening large gene libraries typically include cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the combinatorial genes under conditions in which detection of a desired activity facilitates isolation of the vector encoding the gene whose product was detected. Recursive ensemble mutagenesis (REM), a new technique that enhances the frequency of functional mutants in the libraries, can be used in combination with the screening assays to identify HALO variants (Arkin and Yourvan (1992) PNAS 89 : 7811-7815 ; Delgrave et al. (1993) Protein Engineering 6 : 327-331).

Anti-HALO Antibodies An isolated HALO protein. or a portion or fragment thereof, can be used as an immunogen to generate antibodies that bind HALO using standard techniques for polyclonal and monoclonal antibody preparation. The full-length HALO protein can be used or, alternatively, the invention provides antigenic peptide fragments of HALO for use as immunogens. The antigenic peptide of HALO comprises at least 8 amino acid residues of the amino acid sequence encoded by a nucleic acid comprising the nucleic acid sequence shown in HALOS : 1-19 or 20 and encompasses an epitope of HALO such that an antibody raised against the peptide forms a specific immune complex with H, 4LO. Preferably, the antigenic peptide comprises at least 10 amino acid residues, more preferably at least 15 amino acid residues, even more preferably at least 20 amino acid residues, and most preferably at least 30 amino acid residues. Preferred

epitopes encompassed by the antigenic peptide are regions of HALO that are located on the surface of the protein, e. g., hydrophilic regions.

HALO polypeptides or derivatives, fragments, analogs or homologs thereof, may be utilized as immunogens in the generation of antibodies that immunospecifically-bind these protein components. The term"antibody"as used herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i. e., molecules that contain an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab and F (ab) 2 fragments, and an Fab expression library. Various procedures known within the art may be used for the production of polyclonal or monoclonal antibodies to a HALO protein sequence, or derivatives, fragments, analogs or homologs thereof. Some of these proteins are discussed below.

For the production of polyclonal antibodies, various suitable host animals (e. g., rabbit, goat, mouse or other mammal) may be immunized by injection with the native protein, or a synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic preparation can contain, for example, recombinantly expressed HALO protein or a chemically synthesized HALO polypeptide. The preparation can further include an adjuvant. Various adjuvants used to increase the immunological response include, but are not limited to, Freund's (complete and incomplete), mineral gels (e. g., aluminum hydroxide), surface active substances (e. g., lysolecithin, pluronic polyols, polyanions, peptides. oil emulsions, dinitrophenol, etc.), human adjuvants such as Bacille Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents. If desired, the antibody molecules directed against HALO can be isolated from the mammal (e. g., from the blood) and further purified by well known techniques, such as protein A chromatography to obtain the IgG fraction.

The term"monoclonal antibody"or"monoclonal antibody composition", as used herein, refers to a population of antibody molecules that contain only one species of an antigen binding site capable of immunoreacting with a particular epitope of HALO. A monoclonal antibody composition thus typically displays a single binding affinity for a particular HALO protein with which it immunoreacts. For preparation of monoclonal antibodies directed towards a particular HALO protein, or derivatives, fragments, analogs or homologs thereof, any technique that provides for the production of antibody molecules by continuous cell line culture may be

utilized. Such techniques include, but are not limited to, the hybridoma technique (see Kohler & Milstein, 1975 Nature 256 : 495-497) ; the trioma technique ; the human B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4 : 72) and the EBV hybridoma technique to produce human monoclonal antibodies (see Cole, et al., 1985 In : MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the practice of the present invention and may be produced by using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80 : 2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In : MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96).

According to the invention, techniques can be adapted for the production of single-chain antibodies specific to a HALO protein (see e. g., U. S. Patent No. 4, 946, 778). In addition, methods can be adapted for the construction of Fab expression libraries (see e. g., Huse, et al., 1989 Science 246 : 1275-1281) to allow rapid and effective identification of monoclonal Fab fragments with the desired specificity for a HALO protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies can be"humanized"by techniques well known in the art. See e. g., U. S. Patent No. 5, 225, 539. Antibody fragments that contain the idiotypes to a HALO protein may be produced by techniques known in the art including, but not limited to : (i) an F (ab') 2 fragment produced by pepsin digestion of an antibody molecule ; (ii) an Fab fragment generated by reducing the disulfide bridges of an F (ab) 2 fragment ; (iii) an Fab fragment generated by the treatment of the antibody molecule with papain and a reducing agent and (iv) F fragments.

Additionally, recombinant anti-HALO antibodies, such as chimeric and humanized monoclonal antibodies, comprising both human and non-human portions, which can be made using standard recombinant DNA techniques, are within the scope of the invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art, for example using methods described in PCT International Application No. PCT/US86/02269 ; European Patent Application No. 184, 187 ; European Patent Application No. 171, 496 ; European Patent Application No. 173, 494 ; PCT International Publication No. WO 86/01533 ; U. S. Pat. No. 4, 816, 567 ; European Patent Application No.

125, 023 ; Better et al. (1988) Science 240 : 1041-1043 ; Liu et al. (1987) PNAS 84 : 3439-3443 ; Liu et al. (1987) Jlmmunol. 139 : 3521-3526 ; Sun et al. (1987) PNAS 84 : 214-218 ; Nishimura et al.

(1987) Cancer Res 47 : 999-1005 ; Wood et al. (1985) Nature 314 : 446-449 ; Shaw et al. (1988) J Natl Cancer Inst. 80 : 1553-1559) ; Morrison (1985) Science 229 : 1202-1207 ; Oi et al. (1986) BioTechniques 4 : 214 ; U. S. Pat. No. 5, 225, 539 ; Jones et al. (1986) Nature 321 : 552-525 ; Verhoeyan et al. (1988) Science 239 : 1534 ; and Beidler et al. (1988) J Immunol 141 : 4053-4060.

In one embodiment. methods for the screening of antibodies that possess the desired specificity include, but are not limited to, enzyme-linked immunosorbent assay (ELISA) and other immunologically-mediated techniques known within the art. In a specific embodiment, selection of antibodies that are specific to a particular domain of a HALO protein is facilitated by generation of hybridomas that bind to the fragment of a HALO protein possessing such a domain. Antibodies that are specific for one or more domains within a HALO protein, e. g., domains spanning the above-identified conserved regions of HALO family proteins, or derivatives, fragments, analogs or homologs thereof, are also provided herein.

Anti-HALO antibodies may be used in methods known within the art relating to the localization and/or quantitation of a HALO protein (e. g., for use in measuring levels of the HALO protein within appropriate physiological samples, for use in diagnostic methods, for use in imaging the protein, and the like). In a given embodiment, antibodies for HALO proteins, or derivatives, fragments, analogs or homologs thereof, that contain the antibody derived binding domain, are utilized as pharmacologically-active compounds [hereinafter"Therapeutics"].

An anti-HALO antibody (e. g., monoclonal antibody) can be used to isolate HALO by standard techniques, such as affinity chromatography or immunoprecipitation. An anti-HALO antibody can facilitate the purification of natural HALO from cells and of recombinantly produced HALO expressed in host cells. Moreover, an anti-HALO antibody can be used to detect HALO protein (e. g., in a cellular lysate or cell supernatant) in order to evaluate the abundance and pattern of expression of the HALO protein. Anti-HALO antibodies can be used diagnostically to monitor protein levels in tissue as part of a clinical testing procedure, e. g., to, for example, determine the efficacy of a given treatment regimen. Detection can be facilitated by coupling (i. e., physically linking) the antibody to a detectable substance. Examples of detectable substances include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, ß-galactosidase, or acetylcholinesterase ; examples of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin ;

examples of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein. dansyl chloride or phycoerythrin ; an example of a luminescent material includes luminol ; examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable radioactive material include ''5I'3'I 35S or 3H.

HALO Recombinant Expression Vectors and Host Cells Another aspect of the invention pertains to vectors, preferably expression vectors, containing a nucleic acid encoding HALO protein, or derivatives, fragments, analogs or homologs thereof. As used herein, the term"vector"refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a"plasmid", which refers to a linear or circular double stranded DNA loop into which additional DNA segments can be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e. g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e. g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as"expression vectors". In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid"and"vector"can be used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors, such as viral vectors (e. g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions.

The recombinant expression vectors of the invention comprise a nucleic acid of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory sequences, selected on the basis of the host cells to be used for expression, that is operatively linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector,"operably linked"is intended to mean that the nucleotide sequence of interest is linked to the regulatory sequence (s) in a manner that allows for expression of the nucleotide sequence (e. g., in an in vitro transcription/translation

system or in a host cell when the vector is introduced into the host cell). The term"regulatory sequence"is intended to includes promoters, enhancers and other expression control elements (e. g., polyadenylation signals). Such regulatory sequences are described, for example, in Goeddel ; GENE EXPRESSION TECHNOLOGY : METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e. g., tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, etc. The expression vectors of the invention can be introduced into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as described herein (e. g., HALO proteins, mutant forms of HALO, fusion proteins, etc.).

The recombinant expression vectors of the invention can be designed for expression of HALO in prokaryotic or eukaryotic cells. For example, HALO can be expressed in bacterial cells such as E. coli, insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY : METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase.

Expression of proteins in prokaryotes is most often carried out in E. coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve three purposes : (1) to increase expression of recombinant protein ; (2) to increase the solubility of the recombinant protein ; and (3) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the recombinant protein to enable separation of the recombinant protein from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa. thrombin and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc ; Smith and Johnson (1988) Gene 67 : 31-40), pMAL (New England Biolabs. Beverly, Mass.) and

pRIT5 (Pharmacia, Piscataway, N. J.) that fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein.

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amrann et al., (1988) Gene 69 : 301-315) and pET 1 ld (Studier et al., GENE EXPRESSION TECHNOLOGY : METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 60-89).

One strategy to maximize recombinant protein expression in E. coli is to express the protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant protein. See, Gottesman, GENE EXPRESSION TECHNOLOGY : METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 119-128. Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the individual codons for each amino acid are those preferentially utilized in E. coli (Wada et al., (1992) Nucleic Acids Res. 20 : 2111-2118). Such alteration of nucleic acid sequences of the invention can be carried out by standard DNA synthesis techniques.

In another embodiment, the HALO expression vector is a yeast expression vector.

Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari, et al., (1987) EMBO J 6 : 229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30 : 933-943), pJRY88 (Schultz et al., (1987) Gene 54 : 113-123), pYES2 (Invitrogen Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.).

Alternatively, HALO can be expressed in insect cells using baculovirus expression vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e. g., SF9 cells) include the pAc series (Smith et al. (1983) Mol Cell Biol 3 : 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170 : 31-39).

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian cells using a mammalian expression vector. Examples of mammalian expression vectors include pCDM8 (Seed (1987) Nature 329 : 840) and pMT2PC (Kaufman et al. (1987) EMBO J 6 : 187-195). When used in mammalian cells, the expression vector's control functions are often provided by viral regulatory elements. For example, commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. For other suitable expression systems for both prokaryotic and eukaryotic cells. See, e. g., Chapters 16 and 17 of Sambrook et

al.. MOLECULAR CLONING : A LABORATORY MANUAL. 2nd ed.. Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press. Cold Spring Harbor, N. Y., 1989.

In another embodiment, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e. g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific ; Pinkert et al. (1987) Genes Dev 1 : 268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv Immunol 43 : 235-275), in particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J 8 : 729-733) and immunoglobulins (Banerji et ~1. (1983) Cell 33 : 729-740 ; Queen and Baltimore (1983) Cell 33 : 741-748), neuron-specific promoters (e. g., the neurofilament promoter ; Byrne and Ruddle (1989) PNAS 86 : 5473-5477), pancreas-specific promoters (Edlund et crl. (1985), Science 230 : 912-916), and mammary gland-specific promoters (e. g., milk whey promoter ; U. S. Pat. No. 4, 873, 316 and European Application Publication No. 264, 166). Developmentally-regulated promoters are also encompassed, e. g., the murine hox promoters (Kessel and Gruss (1990) Science 249 : 374-379) and the a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev 3 : 537-546).

The invention further provides a recombinant expression vector comprising a DNA molecule of the invention cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in a manner that allows for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to HALO mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the antisense orientation can be chosen that direct the continuous expression of the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory sequences can be chosen that direct constitutive. tissue specific or cell type specific expression of antisense RNA. The antisense expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the control of a high efficiency regulatory region. the activity of which can be determined by the cell type into which the vector is introduced. For a discussion of the regulation of gene expression using antisense genes see Weintraub et al.,"Antisense RNA as a molecular tool for genetic analysis." Reviews--Trends in Genetics, Vol. 1 (1) 1986.

Another aspect of the invention pertains to host cells into which a recombinant expression vector of the invention has been introduced. The terms"host cell"and"recombinant host cell" are used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell. but are still included within the scope of the term as used herein.

A host cell can be any prokaryotic or eukaryotic cell. For example, HALO protein can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those skilled in the art.

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional transformation or transfection techniques. As used herein, the terms"transformation"and "transfection"are intended to refer to a variety of art-recognized techniques for introducing foreign nucleic acid (e. g., DNA) into a host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation.

Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al.

(MOLECULAR CLONING : A LABORATORY MANUAL. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1989), and other laboratory manuals.

For stable transfection of mammalian cells, it is known that, depending upon the expression vector and transfection technique used, only a small fraction of cells may integrate the foreign DNA into their genome. In order to identify and select these integrants, a gene that encodes a selectable marker (e. g., resistance to antibiotics) is generally introduced into the host cells along with the gene of interest. Various selectable markers include those that confer resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell on the same vector as that encoding HALO or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic acid can be identified by drug selection (e. g., cells that have incorporated the selectable marker gene will survive, while the other cells die).

A host cell of the invention. such as a prokaryotic or eukaryotic host cell in culture, can be used to produce (i. e., express) an HALO protein. Accordingly, the invention further provides methods for producing HALO protein using the host cells of the invention. In one embodiment, the method comprises culturing the host cell of invention (into which a recombinant expression vector encoding HALO has been introduced) in a suitable medium such that HALO protein is produced. In another embodiment, the method further comprises isolating HALO from the medium or the host cell.

Kits and Nucleic Acid Collections for Identifying Psychotropic Agents or Movement Disorders In another aspect, the invention provides a kit useful for examining a pathophysiology associated with a PPARa-mediated pathway. The kit can include nucleic acids that detect two or more HALO sequences. In preferred embodiments, the kit includes reagents which detect 3, 4, 5, 6, 8, 10, 12, 15, 20, 25, 30, 35, or all of the HALOX nucleic acid sequences.

The invention also includes an isolated plurality of sequences which can identify one or more HALOX responsive nucleic acid sequences.

The kit or plurality may include, e. g., sequence homologous to HALOX nucleic acid sequences, or sequences which can specifically identify one or more HALOX nucleic acid sequences.

Single Nucleotide Polymorphisms associated with HALOX Genes The invention also provides nucleic acid sequences nucleic acids containing polymorphisms associated with HALOX-responsive genes. The term"polymorphism"in this context refers to the occurrence of two or more genetically determined alternative sequences or alleles in a population. A polymorphic marker or site is the locus at which divergence occurs.

Preferred markers have at least two alleles, each occurring at frequency of greater than 1%. and more preferably greater than 10% or 20% of a selected population. A polymorphic locus may be as small as one base pair. Polymorphic markers include restriction fragment length polymorphisms. variable number of tandem repeats (VNTR's), hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats, simple sequence repeats, and insertion elements such as Alu. The first identified allelic form is arbitrarily designated as a the reference form and other allelic forms are designated as alternative or variant alleles. The allelic form occurring most frequently in a selected population is sometimes referred to as the wild type form. Diploid organisms may be homozygous or

heterozygous for allelic forms. A diallelic polymorphism has two forms. A triallelic polymorphism has three forms.

A single nucleotide polymorphism occurs at a polymorphic site occupied by a single nucleotide, which is the site of variation between allelic sequences. The site is usually preceded by and followed by highly conserved sequences of the allele (e. g., sequences that vary in less than 1/100 or 1/1000 members of the populations). A single nucleotide polymorphism usually arises due to substitution of one nucleotide for another at the polymorphic site. A transition is the replacement of one purine by another purine or one pyrimidine by another pyrimidine. A transversion is the replacement of a purine by a pyrimidine or vice versa. Single nucleotide polymorphisms can also arise from a deletion of a nucleotide or an insertion of a nucleotide relative to a reference allele.

Hybridizations are usually performed under stringent conditions, for example, at a salt concentration of no more than 1 M and a temperature of at least 25. degree. C. For example, conditions of 5X SSPE (750 mM NaCl, 50 mM NaPhosphate, 5 mM EDTA, pH 7. 4) and a temperature of 25. degree.-30. degree. C. are suitable for allele-specific probe hybridizations.

An isolated nucleic acid means an object species invention) that is the predominant species present (i. e., on a molar basis it is more abundant than any other individual species in the composition). Preferably, an isolated nucleic acid comprises at least about 50, 80 or 90 percent (on a molar basis) of all macromolecular species present. Most preferably, the object species is purified to essential homogeneity (contaminant species cannot be detected in the composition by conventional detection methods).

Polymorphisms are detected in a target nucleic acid from an individual being analyzed.

For assay of genomic DNA, virtually any biological sample (other than pure red blood cells) is suitable. For example, convenient tissue samples include whole blood. semen, saliva, tears, urine, fecal material, sweat, buccal, skin and hair. For assay of cDNA or mRNA, the tissue sample must be obtained from an organ in which the target nucleic acid is expressed. Many of the methods described below require amplification of DNA from target samples. This can be accomplished by e. g., PCR. See generally, PCR Technology : Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, N. Y., N. Y.. 1992) ; PCR Protocols : A Guide to Methods and Applications (eds. Innis, et al..., Academic Press, San Diego, Calif., 1990) ; Mattila et al..., Nucleic Acids Res. 19, 4967 (1991) ; Eckert et al.., PCR Methods and

Applications 1, 17 (1991) ; PCR (eds. McPherson et al..., IRL Press. Oxford) ; and U. S. Pat. No.

4. 683. 202 (each of which is incorporated by reference for all purposes).

Other suitable amplification methods include the ligase chain reaction (LCR), (See Wu and Wallace, Genomics 4, 560 (1989), Landegren et al..., Science 241 1077 (1988)), transcription amplification (Kwoh et al.... Proc. Natl. Acad. Sci. USA 86, 1173 (1989)), and self- sustained sequence replication (Guatelli et al..., Proc. Nat. Acad. Sci. USA, 87, 1874 (1990)) and nucleic acid based sequence amplification (NASBA). The latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1, respectively.

There are two distinct types of analysis depending whether a polymorphism in question has already been characterized. The first type of analysis is sometimes referred to as de novo characterization. This analysis compares target sequences in different individuals to identify points of variation, i. e., polymorphic sites. By analyzing a groups of individuals representing the greatest ethnic diversity among humans and greatest breed and species variety in plants and animals, patterns characteristic of the most common alleles/haplotypes of the locus can be identified, and the frequencies of such populations in the population determined. Additional allelic frequencies can be determined for subpopulations characterized by criteria such as geography, race, or gender. The second type of analysis is determining which form (s) of a characterized polymorphism are present in individuals under test.

OTHER EMBODIMENTS The present invention is not to be limited in scope by the specific embodiments described herein. Indeed, various modifications of the invention in addition to those described herein will become apparent to those skilled in the art from the foregoing description and accompanying figures. Such modifications are intended to fall within the scope of the appended claims.