Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHODS FOR DIAGNOSIS AND THERAPY OF AUTOIMMUNE DISEASE, SUCH AS INSULIN DEPENDENT DIABETES MELLITUS, INVOLVING RETROVIRAL SUPERANTIGENS
Document Type and Number:
WIPO Patent Application WO/1999/005527
Kind Code:
A2
Abstract:
The invention relates to a process for the diagnosis of a human autoimmune disease, including presymptomatic diagnosis, said human autoimmune disease being associated with human endogenous retrovirus (HERV) having Superantigen (SAg) activity, comprising specifically detecting in a biological sample of human origin at least one of the following: (I) the mRNA of an expressed human endogenous retrovirus having Superantigen (SAg) activity, or fragments of such expressed retroviral mRNA, said retrovirus being associated with a given autoimmune disease, or (II) protein or peptide expressed by said retrovirus, or (III) antibodies specific to the protein expressed by said endogenous retrovirus, or (IV) SAg activity specifically associated with said endogenous retrovirus, detection of any of the species (I) to (IV) indicating presence of autoimmune disease or imminent onset of autoimmune disease.

Inventors:
CONRAD BERNARD (CH)
MACH BERNARD (CH)
Application Number:
PCT/EP1998/004926
Publication Date:
February 04, 1999
Filing Date:
July 22, 1998
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
MEDIGEN S A (CH)
CONRAD BERNARD (CH)
MACH BERNARD (CH)
International Classes:
A61K31/7088; A01K67/027; A61K31/7115; A61K39/21; A61P3/10; A61P37/00; C07K7/00; C07K14/08; C07K14/15; C12N5/10; C12N7/00; C12N15/09; C12P21/02; C12Q1/02; C12Q1/48; C12Q1/68; C12Q1/70; G01N33/50; G01N33/564; G01N33/569; G01N33/68; A61K38/00; (IPC1-7): G01N33/564
Domestic Patent References:
WO1995011975A21995-05-04
WO1996035793A11996-11-14
WO1996023076A11996-08-01
WO1993023560A11993-11-25
WO1997022694A21997-06-26
Foreign References:
US5508166A1996-04-16
Other References:
CONRAD B ET AL: "SUPERANTIGENS AS ETIOPATHOGENETIC FACTORS IN THE DEVELOPMENT OF INSULIN-DEPENDENT DIABETES MELLITUS" DIABETES/METABOLISM REVIEWS, vol. 10, no. 4, December 1994 (1994-12), pages 309-338, 322A, XP000608792
WALCHNER, M. ET AL.,: "Endogene retrovirale sequenzen als Faktor in der pathogenese des systemischen lupus erythemotides" HAUTARZT, vol. 47, July 1996 (1996-07), pages 502-509, XP002058198
ONO M ET AL: "NUCLEOTIDE SEQUENCE OF HUMAN ENDOGENOUS RETROVIRUS GENOME RELATED TO THE MOUSE MAMMARY TUMOR VIRUS GENOME" JOURNAL OF VIROLOGY, vol. 60, no. 2, November 1986 (1986-11), pages 589-598, XP000673638
LOEWER W, ET AL.: "Identification of a Rev-related protein by analysis of spliced transcripts of the human endogenous retrovirus HTDV/HERV-K" JOURNAL OF VIROLOGY, vol. 69, no. 1, January 1995 (1995-01), pages 141-148, XP002058199
CONRAD, B. ET AL.: "A human endogenous retroviral superantigen as candidate autoimmune gene in type I diabetes" CELL, vol. 90, July 1997 (1997-07), pages 303-313, XP002058200
C. BENOIST AND D. MATHIS: "Retrovirus as trigger, precipitator or marker?" NATURE, vol. 388, no. 6645, 28 August 1997 (1997-08-28), pages 833-834, XP002058201
LOEWER R ET AL: "A GENERAL METHOD FOR THE IDENTIFICATION OF TRANSCRIBED RETROVIRUS SEQUENCES (R-U5 PCR) REVEALS THE EXPRESSION OF THE HUMAN ENDOGENOUS RETROVIRUS LOCI HERV-H AND HERV-K IN TERATOCARCINOMA CELLS" VIROLOGY, vol. 192, February 1993 (1993-02), pages 501-511, XP000615447
Attorney, Agent or Firm:
Almond-martin, Carol (rue Chauveau-Lagarde Paris, FR)
Download PDF:
Claims:
CLAIMS
1. Process for the diagnosis of a human autoimmune disease, including presymptomatic diagnosis, said human autoimmune disease being associated with human endogenous retrovirus (HERV) having Superantigen (SAg) activity, comprising specifically detecting in a biological sample of human origin at least one of the following : Ithe mRNA of an expressed human endogenous retrovirus having Superantigen (SAg) activity, or fragments of such expressed retroviral mRNA, said retrovirus being associated with a given autoimmune disease, or IIprotein or peptide expressed by said retrovirus, or IIIantibodies specific to the proteins expressed by said endogenous retrovirus, or IVSAg activity specifically associated with said endogenous retrovirus, detection of any of the species (I) to (IV) indicating presence of autoimmune disease or imminent onset of autoimmune disease.
2. Process according to claim 1 wherein the expressed retroviral mRNA is specifically detected by nucleic acid amplification using primers, one of which is specific for the poly (A) signals present in the 3' Rpoly (A) sequences at the 3'extremity of the retrovirus.
3. Process according to claim 1 wherein the protein or peptide expressed by the endogenous retrovirus is detected using antibodies specific for the said retroviral protein or peptide.
4. Process according to claim 1 wherein the antibodies specific to retroviral protein are detected by use of the retroviral protein, or fragments thereof with which the antibodies specifically react.
5. Process according to claim 1 wherein SAg activity specifically associated with said HERV is detected, the biological sample being a biological fluid containing MHC Class II+ cells or cells induced to express MHC Class II molecules, this sample being contacted with cells bearing one or more variable (V)ß Tcell receptor chains, and detecting preferential proliferation of the V subset, or one of the vß subsets characteristic of said autoimmune disease.
6. Process according to claim 1 wherein the autoimmune disease is type I diabetes and the associated retrovirus having SAg activity is IDDMK,, 2 22 comprising the 5'long terminal repeat shown in Figure 7A, the 3'short terminal repeat shown in Figure 7B, or the env encoding sequences shown in Figure 7C, Figure 7D or Figure 7E, or variants thereof presenting approximately at least 90% sequence identity.
7. Process according to claim 6 wherein the expressed retroviral RNA is specifically detected by nucleic acid amplification using primers, one of which is specific for the poly (A) signals present in the 3' Rpoly (A) sequences at the 3'extremity of IDDMK1, 222.
8. Process according to claim 7 wherein the poly (A) specific primer is <BR> <BR> 5'TTTTTGAGTCCCCTTAGTATTTATT3'or<BR> 5'T (2o GAGTCCCCTTAGTATTTATT 3'.
9. Process according to claim 6 wherein protein expressed by IDDMK,, 222 is detected, said protein being either the protein encoded by the Nterminal moiety of the env coding region of IDDMKi22 as illustrated in Figure 7D or 7G, or the protein encoded by the pol coding region, as illustrated in Figure 7H, or a protein having at least 90% homology with the illustrated protein, or a fragment of said proteins having at least 6 aminoacids.
10. Process according to claim 6 wherein antibodies specific for env or pol proteins expressed by IDDMK1, are detected using the env or pol proteins illustrated in Figure 7D, 7G or 7H, or a protein having at least 90% homology with the illustrated protein, or a fragment of said proteins having at least 6 amino acids.
11. Human endogenous retrovirus having superantigen activity, and being associated with human autoimmune disease, said retrovirus being obtainable from RNA prepared from a biological sample originating from a human autoimmune source, by carrying out the following steps : i) isolation of the 5'RU5 ends of expressed putative retroviral genomes using nucleic acid amplification, the 3'primer being complementary to known « primer binding sites » (pbs) and the 5'primer being an oligonucleotide anchor; ii) isolation of the 3'Rpoly (A) ends corresponding to the 5'RU5 ends, by use of primers specific for the R regions isolated in step i); iii) amplification of the conserved RTRNase H region within the pol gene by using degenerate primers corresponding to the conserved region; iv) amplification of the 5'moiety of the putative retroviral genome by using primers specific for the different U5 regions isolated in step i) in conjunction with a primer specific for the 3'end of the central pol region isolated in step iii) ; v) amplification of the 3'moiety of the putative retroviral genome using primers specific for the central pol region isolated in step iii) in conjunction with primers specific for the poly (A) signals present in the 3'Rpoly (A) sequences isolated in step ii) ; vi) confirmation of the presence of an intact retroviral genome by amplification using primers specific for its predicted U5 and U3 regions.
12. Proviral DNA of a retrovirus according to claim 11.
13. Proviral DNA according to claim 12 obtainable from a biological sample of human origin by : i) obtaining retroviral RNA according to the method of claim 11, and further, ii) generating a series of DNA probes from the retroviral RNA obtained in i); iii) hybridising under stringent conditions, the probes on a genomic human DNA library ; iv) isolation of the genomic sequences hybridising with the probes.
14. Nucleic acid molecule comprising fragments of the retroviral RNA or DNA according to any one of claims 11 to 13, said fragment having a length of at least 15 nucleotides and preferably at least 30 nucleotides.
15. Nucleic acid molecule according to claim 14, encoding SAg activity of the retrovirus.
16. Nucleic acid molecule according to claim 15 derived from an endogenous human retrovirus open reading frame and optionally containing at least one internal stop codon.
17. Nucleic acid molecule according to claim 15 or 16 comprising the retroviral env gene.
18. Nucleic acid molecule comprising a sequence complementary to the nucleic acid molecules of any one of claims 11 to 17.
19. Nucleic acid molecule according to claim 18 comprising a ribozyme or antisense molecule to a human retrovirus having SAg activity to a proviral DNA of said retrovirus or a fragment thereof.
20. Nucleic acid molecule capable of hybridizing in stringent conditions, with the nucleic acid molecules of any one of claims 11 to 19.
21. Vector comprising nucleic acid molecules of any one of claims 11 to 20.
22. Nucleic acid molecule comprising at least one of the sequences illustrated in Figures 7A, 7B, 7C, 7D, 7E, or a nucleic acid sequence encoding the POL protein shown in Figure 7H, or a sequence exhibiting at least 90% homology with any of these sequences, or a fragment of any of these sequences having at least 20 nucleotides, and preferably at least 40 nucleotides.
23. Nucleic acid molecule at least partially complementary to any of the sequences according to claim 22.
24. Nucleic acid molecule according to claim 22 comprising a ribozyme or antisense.
25. Nucleic acid molecule which is HERV IDDMK1 222 comprising each of the sequences illustrated in Figures 7A, 7B, 7C, or sequences having at least 90% identity with these sequences, having a size of approximately 8.5 kb, having SAg activity encoded within the env region illustrated in Figure 7D or 7E, said SAg activity being specific for Vp7TCR chains.
26. Protein or peptide having at least 6 amino acids, characterised in that : it exhibits SAg activity and optionally is capable of giving rise, directly or indirectly, to autoreactive Tcells targeting tissue characteristic of a given autoimmune disease; it is encoded by a human endogenous retrovirus; it is obtainable from biological samples of patients having autoimmune disease.
27. Protein or peptide according to claim 26, encoded by the env gene of the HERV, or a portion thereof.
28. Protein or peptide according to claim 27 corresponding to a protein or peptide resulting from a premature translational stop, and/or from a frame shift in the translation of a retroviral open reading frame.
29. Protein or peptide according to any one of claims 26 to 28 obtainable by introducing viral DNA of claim 13 or fragments thereof, or corresponding synthetic DNA into a eukaryotic cell under conditions allowing the DNA to be expressed, and recovering said protein.
30. Protein according to any one of claims 26 to 29 comprising the amino acid sequence shown in Figure 7D, Figure 7F, Figure 7G, Figure 7H, or an amino acid sequence having at least 80% and preferably at least 90 % homology with the illustrated sequences, or a fragment of said sequence having at least 6 amino acids.
31. Antibodies capable of specifically recognising a protein or peptide according to any one of claims 26 to 30.
32. Antibodies according to claim 31 which are monoclonal.
33. Antibodies according to claim 31 or 32 which specifically recognise a HERV protein having SAg activity and which have the capacity to block SAg activity.
34. Cellline transfected with and expressing a human retrovirus or a portion thereof or a nucleic acid molecule according to any one of claims 11 to 25.
35. Nonhuman cells transformed with and expressing a human retrovirus or a nucleic acid molecule according to any one of claims 11 to 25.
36. Cellline or cells according to claim 34 or 35, said celllines or cells being MHC Class II+ and expressing a protein having SAg activity.
37. Process for identifying substances capable of binding to retroviral protein or peptide according to any one of claims 26 to 30, comprising contacting the substance under test, optionally labelled with detectable marker, with the said retroviral protein or peptide having SAg activity, and detecting binding.
38. Process for identifying substances capable of blocking SAg activity of an endogenous retrovirus associated with autoimmune disease, comprising introducing the substance under test into an assay system comprising i) MHC Class II+ cells functionally expressing retroviral protein or peptide according to any one of claims 26 to 30 and ii) cells bearing VusT cell receptor chains of the family or families specifically stimulated by the HERV SAg expressed by the MHC Class II+ cells, and determining the capacity of the substance under test to diminish or block Vp specific stimulation by the retroviral SAg.
39. Process according to claim 38 wherein the cells bearing VusT cell receptor chains are Tcell hybridoma and Vpspecific stimulation is determined for example by measurement of IL2 release, or measurement of Tcell proliferation.
40. Process according to claim 38 or 39, comprising an additional preliminary screening step for selecting substances capable of binding to retroviral protein having SAg activity, said screening step being according to claim 38.
41. Process for identifying substances capable of blocking transcription or translation of human endogenous retroviral (HERV) SAgencoding nucleic acid sequences, said SAg being associated with a human autoimmune disease, comprising : i) contacting the substance under test with cells expressing endogenous retroviral protein or peptide having SAg activity, according to any one of claims 26 to 30 and ii) detecting loss of SAg protein expression using SAg protein markers such as specific, labelled antiSAg antibodies.
42. Process according to claim 41 the cells expressing HERV protein having SAg activity are MHC Class II+ cells, and the process further comprises detection of loss of SAg activity by the process of claim 38.
43. Kit for screening substances capable of blocking SAg activity of a retrovirus associated with an autoimmune disease, or of blocking transcription or translation of the retroviral SAg protein, comprising : MHC Class II+ cells transformed with and functionally expressing said retroviral SAg; cells bearing Vß Tcell receptor chains of the family or families specifically stimulated by the HERV SAg ; means to detect specific Vp stimulation by HERV SAg ; optionally, labelled antibodies specifically binding to the retroviral SAg.
44. Protein or peptide derived from a retroviral SAg according to claim 26 wherein the protein is modified so as to be devoid of SAg activity and is capable of generating a immune response against SAg, involving either antibodies and/or Tcell responses.
45. Protein according to claim 44 wherein the modification consists of denaturation, or of a truncation, or of a deletion, insertion or replacement mutation of the SAg protein.
46. Protein according to claim 44 or 45 for use as a prophylactic or therapeutic vaccine against autoimmune disease associated with retroviral SAg.
47. Vaccine comprising an immunogenically effective amount of a protein according to claim 44 or 45 in association with a pharmaceutically acceptable carrier and optionally adjuvant.
48. Nucleic acid molecule encoding human retroviral SAg according to claim 15 or a modified form of said molecule for use as a prophylactic or therapeutic DNA vaccine against autoimmune disease associated with the retroviral SAg.
49. Substances identifiable by the process according to any one of claims 37 to 42 for use in therapy and/or prevention of autoimmune disease associated with the HERV SAg.
50. Use of substances capable of inhibiting retroviral function for the preparation of a medicament for use in therapy and/or prevention of autoimmune disease associated with retroviral SAg.
51. Use according to claim 50 wherein the substance capable of inhibiting retroviral function is Azido Deoxythymidine (A. Z. T.).
52. Use of substances capable of inhibiting retroviral SAg function for the preparation of a medicament for use in therapy of autoimmune disease associated with retroviral SAg.
53. Process for detecting human autoimmune disease associated with expression of human endogenous retrovirus Superantigen (SAg), said process comprising at least one of the following steps : i) detecting the presence of any expressed retrovirus in a biological sample of human origin; ii) detecting the presence of SAg activity in a biological sample of human origin containing MHC Class II cells.
54. Process according to claim 53 wherein the expressed retrovirus is detected by detection of reverse transcriptase activity.
55. Process according to claim 54 wherein the expressed retrovirus is detected by carrying out nucleic acid amplification reaction on RNA prepared from the biological sample, using as 3'primer a sequence complementary to known retroviral « primer binding sites » (pbs), and as 5'primer a nonspecific anchor sequence.
56. Process according to claim 53 wherein the presence of SAg activity is detected by contacting the biological sample containing MHC Class II+ cells with cells bearing one or more variable (V)ß Tcell receptor (TCR) chains and detecting preferential proliferation of a Vß subset.
57. Process according to claim 56 wherein the cells bearing Tcell receptors are Tcell hybridoma bearing defined human Vp domains.
58. Process for detecting SAg activity of an expressed human retrovirus associated with human autoimmune disease or of a portion of said retrovirus comprising : i) transfecting expressed retroviral DNA or portions thereof into MHC Class II+ antigen presenting cells under conditions in which the DNA is expressed, ii) contacting the transfectants with cells bearing one or more defined (V)ß Tcell receptor chains, and iii) determining whether the transfectant is capable of inducing preferential proliferation of a Vß subset, the capacity to induce preferential proliferation being indicative of SAg activity within the transfected DNA or portion thereof.
59. Process for isolating and characterising a human retrovirus, particularly a human endogenous retrovirus (HERV), said retrovirus having SAg activity and being involved in human autoimmune disease, comprising the following steps : i) isolation of the 5'RU5 ends of expressed putative retroviral genomes using nucleic acid amplification, the 3'primer being complementary to known « primer binding sites » (pbs); ii) isolation of the 3'Rpoly (A) ends corresponding to the 5'RU5 ends, by use of primers specific for the R regions isolated in step i) : iii) amplification of the conserved RTRNase H region within the pol gene by using degenerate primers corresponding to the conserved region; iv) amplification of the 5'moiety of the putative retroviral genome by using primers specific for the different U5 regions isolated in step i) in conjunction with a primer specific for the 3'end of the central pol region isolated in step iii) ; v) amplification of the 3'moiety of the putative retroviral genome using primers specific for the central pol region isolated in step iii) in conjunction with primers specific for the poly (A) signals present in the 3'Rpoly (A) sequences isolated in step ii) ; vi) confirmation of the presence of an intact retroviral genome by amplification using primers specific for its predicted U5 and U3 regions.
60. Process according to claim 59 further comprising a step vii) of detecting SAg activity associated with the retrovirus, or portions thereof, said detection being carried out according to claim 58.
61. Transgenic animal including in its genome nonhuman cells according to claim 35.
Description:
Methods for Diagnosis and Therapy of Autoimmune Disease, such as Insulin Dependent Diabetes Mellitus, involving Retroviral Superantigens.

The present invention relates to methods for the diagnosis of human autoimmune disease, for example Insulin Dependent Diabetes Mellitus (IDDM), and to methods for identifying substances which can be used in the therapy and prevention of such diseases. The invention further relates to novel human retroviruses involved in autoimmune disease and having superantigen activity, as well as to their expression products.

For some autoimmune diseases such as IDDM, Multiple Sclerosis, arthritis and others, it is known that a combination of genetic, environmental and possibly exogenous infectious factors may be important in precipitating disease. However, the precise roles of each of these factors remains incompletely elucidated.

For example, for IDDM, the Major Histocompatibility Complex (MHC) Class II genotype is one of the strongest genetic factors determining disease susceptibility (Vyse, T. J. and Todd J. A., 1996) although the respective zoles of the different MHC Class II+ cell types in promoting disease has not yet been clarified.

Furthermore, IDDM shows temporal, epidemic-like variations and the clinical disease exhibits preferential seasonal onset (Karvonen et al., 1993).

Recently, Conrad et al. (1994) provided evidence for superantigen involvement in IDDM aetiology and postulated that viruses may be the modifying agent responsible for the presence of superantigen on diabetic islets.

Genetic background also has an important influence in multiple sclerosis. In addition, Perron et al (Perron et al, 1997) have recently identified a retrovirus which can be isolated from cells of multiple sclerosis patients. Whether the retrovirus contributes as a causative agent of multiple sclerosis or as a link in the pathogenic process, or whether it is merely an epiphenomenon, has not been identified. No superantigen activity of the retrovirus has been identified.

It is an aim of the present invention to identify agents implicated in the pathogenesis of human autoimmune diseases, such as IDDM, and on the basis of these agents to provide reliable diagnostic procedures and therapeutic or prophylactic substances and compositions.

These cbjectives are met by the provision, according to the invention, of diagnostic procedures involving the detection of expressed retroviruses having superantigen (SAg) function, these retroviruses being directly involved in the pathogenesis of human autoimmune disease by activation of autoreactive T- cells. Compounds and compositions capable of blocking SAg function or production are also provided as therapeutic and prophylactic agents in the treatment of autoimmune disease.

The present invention is based on the discovery, by the present inventors that superantigens (SAgs) encoded by retroviruses, particularly endogenous retroviruses, play a major role in the pathogenesis of autoimmune disease, very likely by activating autoreactive T-cells.

Superantigens (SAgs) (Choi et al, 1989; White et al, 1989) are microbial proteins able to mediate interactions between MHC Class II-and polyclonal T- cells resulting in reciprocal activation (Acha-Orbea et al, 1991; Choi et al, 1991; Fleischer and Schrezenmeier, 1988). Their function is restricted by only two absolute requirements : the presence of MHC Class II on the surface of the presenting cells and the expression of one or more defined Variable (V)- (3 T cell receptor (TCR) chain (s) on T cells.

The potential role of SAgs in human diseases is ill-defined. Bacterial SAgs have been proposed to be associated with the pathogenesis of autoimmune disease (White et al, 1989). However, although pathogen disease associations have been described, none of these have as yet implicated a pathogen-encoded SAg (Howell et al, 1991; Paliard et al, 1991). A SAg-like activity resembling the one encoded by MMTV has been reported to be associated with herpesvirus infections (Dobrescu et al, 1995; Sutkowski et al, 1996). However, in none of these two systems has it been demonstrated that the SAg activity is actually encoded by the infectious agent.

SAg activity has been reported in patients having Type I diabetes (Conrad et al 1994). However, the origin of the Sag activity is not identified.

In the framework of the present invention, the inventors have identified the source of SAg activity in IDDM patients as being a novel endogenous retrovirus, (HERV) designated IDDKK1. 2-22. This retrovirus is related to, but distinct from mouse mammary tumor virus (MMTV). It is ubiquitous in the human genome but is only expressed in diabetic individuals, possibly in response to a particular environmental stimulus. The HERV encodes superantigen (SAg) activity within the env gene. Expression of the SAg gives rise to preferential expansion of Vp-7 T-cell receptor positive T-cells, some of which are very likely to be autoreactive. Thus the expression of self-SAg leads to systemic activation of a sub-set of T-lymphocytes, among which autoreactive T-cells, will in turn give rise to organ-specific autoimmune disease.

The involvement of retroviral SAg, particularly endogenous retroviral SAg in autoimmune disease is unexpected. Indeed, endogenous retroviruses (HERV) form an integral part of the human genome. If expressed from birth, any autoreactive T-cells activated by expression of a retroviral SAg should be deleted as part of the normal development of the immune system (thymic deletion). However, in the case of autoimmune diseases such as diabetes, the expression of the retrovirus, and hence of the encoded SAg, occurs only later in life, leading to the proliferation of autoreactive T-cells.

To identify the microbial agent responsible for SAg activity in diabetes, the present inventors have developed a novel primer-extension technique. This method can be used to isolate and identify, in a sample of polyadenylated RNA, any expressed, previously unidentified retroviral RNA, particularly retroviruses having SAg activity and being involved in human autoimmune disease. This strategy relies on the following three characteristic features of functional retroviruses. First, retroviral genomes contain a primer binding site (PBS) near their 5'end. Cellular tRNAs anneal to the PBS and serve as primers for Reverse Transcriptase (reviewed by Whitcomb and Hughes, 1992). Second, the R (repeat) sequence is repeated at the 5'and 3'ends of the viral RNA (Temin, 1981).

Third, the RT-RNAse H region of the pol gene is the most conserved sequence among different retroelements (McClure et al., 1988 ; Xiong and Eickbusch, 1990). The method comprises the following steps : i) isolation of the 5'R-U5 ends of expressed putative retroviral genomes using nucleic acid amplification, the 3'primer being complementary to known « primer binding sites » (pbs). ii) isolation of the 3'R-poly (A) ends corresponding to the 5'R-U5 ends, by use of primers specific for the R regions isolated in step i). iii) amplification of the conserved RT-RNase H region within the pol gene by using degenerate primers corresponding to the conserved region. iv) amplification of the 5'moiety of the putative retroviral genome by using primers specific for the different U5 regions isolated in step i) in conjunction with a primer specific for the 3'end of the central pol region isolated in step iii). v) amplification of the 3'moiety of the putative retroviral genome using primers specific for the central pol region isolated in step iii) in conjunction with primers specific for the poly (A) signals present in the 3'R-poly (A) sequences isolated in step ii). vi) confirmation of the presence of an intact retroviral genome by amplification using primers specific for its predicted U5 and U3 regions.

Once an expressed retrovirus has been identified, its SAg activity can be tested by contacting a biological sample containing MHC Class II+ cells expressing the putative Sag activity, with cells bearing one or more variable (V)-ß T-cell receptor (TCR) chains and detecting preferential proliferation of a Vp subset.

The techniques developed by the inventors to elucidate Sag involvement in IDDM, can be used to identify the possible involvement of expressed retrovirus and encoded SAg activity in other autoimmune diseases. The characterisation of the retrovirus and its SAg can then be made, and the particular Vus-T cell receptor chain activation associated with the SAg can be identified. A given autoimmune disease can thus be defined by reference to a characterised retroviral Sag specifically associated with the disease, and to the V-specificity or specificities. In certain autoimmune diseases, such as multiple sclerosis, it is known that T-cells with different V specificities can be involved in the recognition of the same immunodominant autoantigen, M. B. P. (Wucherpfennig K. W. et al, Science 1990,25,1016-1019). Once this « profile » has been determined, specific diagnostic, therapeutic and prophylactic tools can be elaborated for each autoimmune disease involving retroviral SAg-stimulation of autoreactive T-cells.

The present invention involves, in a first embodiment, methods of diagnosis of autoimmune disease based on the specific expression, in autoimmune patients, of retroviruses having Sag activity.

The methods of diagnosis of the present invention are advantageous in so far as they are highly specific, distinguishing between expressed and non-expressed viral nucleic acid, and can thus be reliably used even when the pathological agent is a ubiquitous endogenous retrovirus. They can be carried out on easily accessible biological samples, such as blood or plasma, without extensive pre-treatment. The diagnostic methods of the invention detect disease-specific expression of the retrovirus and can thus be applied before appearance of clinical symptoms, for example on genetically predisposed individuals. This allows suitable therapy to be initiated before autoimmune destruction of a particular target tissue occurs.

In the context of the present invention, the following terms encompass the following meanings : * a « human autoimmune disease » is defined as a polygenic disease characterised by the selective destruction of defined tissues mediated by the immune system. Epidemiological and genetic evidence also suggests the involvement of environmental factors.

* a « human endogenous retrovirus » (HERV) is a retrovirus which is present in the form of proviral DNA integrated into the genome of all normal cells and is transmitted by Mendelian inhertance patterns.

Such proviruses are products of rare infection and integration events of the retrovirus under consideration into germ cells of the ancestors of the host. Most endogenous retroviruses are transcription- ally silent or defective, but may be activated under certain conditions. Expression of the HERV may range from transcription of selected viral genes to production of complete viral particles, which may be infectious or non-infectious. Indeed, variants of HERV viruses may arise which are capable of an exogenous viral replication cycle, although direct experimental evidence for an exogenous life cycle is still missing. Thus, in some cases, endogenous retroviruses may also be present as exogenous retroviruses. These variants are included in the term « HERV » for the purposes of the invention. In the context of the invention, « human endogenous retrovirus » includes proviral DNA corresponding to a full retrovirus as represented schematically in Fig.

2A, comprising two LTR's, gag, pol and env, and further includes remnants or « scars » of such a full retrovirus which have arisen as a results of deletions in the retroviral DNA. Such remnants include fragments of the structure depicted in Fig.

2A, and have a minimal size of one LTR. Typically, the HERVs have at least one LTR, preferably two, and all or part of gag, pol or env.

* a Superantigen is a substance, normally a protein, of microbial origin that binds to major histocompatibility complex (MHC) Class II molecules and stimulates T-cell, via interaction with the Vp domain of the T-cell receptor (TCR). SAgs have the particular characteristic of being able to interact with a large proportion of the T-cell repertoire, i. e. all the members of a given Vp subset or « family », or even with more than one V subset, rather than with single, molecular clones from distinct Vp families as is the case with a conventional (MHC-restricted) antigen. The superantigen is said to have a mitogenic effect that is MHC Class II dependent but MHC-unrestricted. SAgs require cells that express MHC Class II for stimulation of T-cells to occur.

« SAg activity » signifies a capacity to stimulate T- cells in an MHC-dependent but MHC-unrestricted manner. In the context of the invention, SAg activity can be detected in a functional assay by measuring either IL-2 release by activated T-cells, or proliferation of activated T-cells.

* a retrovirus having SAg activity is said to be « associated with » a given autoimmune disease when expressed retroviral RNA can be found specifically in biological samples of autoimmune patients (ie the expressed retroviral RNA is not found in individuals free of autoimmune disease). Preferably « associated with » further signifies in this context that retroviral SAg activation of a Vp subset gives rise directly or indirectly to proliferation of autoreactive T-cells targeting tissue characteristic of the autoimmune disease. Blockage of SAg activity thus normally prevents generation of autoreactive T- cells. Disease « association » with Sag can also be defined immunologically or genetically immunological association means that a particular disease-associated HLA haplotype is permissive for Sag, whereas resistant haplotypes are permissive for Sag inhibition. Genetic association implies a polymorphism in either the expression pattern of Sag or in the amino acid sequence of Sag, with Sag alleles exhibiting different degree of susceptibility to the disease. cells which « functionally express » Sag are cells which express Sag in a manner suitable for giving rise to MHC-dependent, MHC-unrestricted T-cell stimulation in vitro or in vivo. This requires that the cell be MHC II or that it has been made MHC II+ by induction by agents such as IFN-y.

More particularly, in a first embodiment, the present invention relates to a process for the diagnosis of a human autoimmune disease, including pre- symptomatic diagnosis, said human autoimmune disease being associated with human retrovirus having Superantigen (SAg) activity, comprising specifically detecting in a biological sample of human origin at least one of the following : I : the mRNA of an expressed human retrovirus known to have Superantigen (SAg) activity, or fragments of such expressed retroviral mRNA, said retrovirus being associated with a given autoimmune disease, or II : protein expressed by said retrovirus, or III : antibodies specific to the proteins expressed by said retrovirus, or IV : SAg activity specifically associated with the autoimmune disease.

Thus, the diagnosis of a given autoimmune disease can be made, according to the invention, by one or more of four methods (I to IV), each involving the detection of a specific aspect of the expression of a SAg- encoding retrovirus known to be associated with the autoimmune disease, particularly an endogenous retrovirus. Detection of any of the species (I) to (IV) as listed above is indicative of the presence of the autoimmune disease specifically associated with the endogenous retrovirus under consideration or of imminent onset of the disease.

Each of the four possible methods I to IV of diagnosis of human autoimmune disease will be described in detail below.

According to method I, the autoimmune disease is diagnosed by specifically detecting in a biological sample the mRNA of an expressed human retrovirus known to have SAg activity.

Specific detection of retroviral expressed mRNA is preferably carried out using nucleic acid amplification with viral specific primers which discriminate between proviral DNA and expressed RNA template. This is of particular importance when the retrovirus associated with the autoimmune disease is an endogenous retrovirus. Indeed in such cases, the proviral DNA is present in all human cells, whether or not the autoimmune disease is present. False positives would be obtained if a detection method were used which does not distinguish between proviral DNA and transcribe mRNA.

The biological sample to be used for specific mRNA detection according to the invention may be any body fluid or tissue but is preferably plasma or blood.

Normally, total RNA is extracted from the sample using conventional techniques. DNAse treatment may be carried out to reduce contaminating cellular DNA.

By performing the amplification on total RNA samples, the effects of contaminating DNA are reduced but not eliminated, even after treatment by DNAse. The method of the present invention allows selective amplification of expressed viral RNA transcripts using at least one m-RNA specific primer, for example a poly- A specific primer, even in the presence of contaminating viral DNA in the sample. The poly-A specific primer is specific for the poly-A signaals present in the R-poly (A) sequences and the 3'extremity of the retrovirus (see for example Figure 2A step 5 and Figure 2C).

It has surprisingly been found that a poly-A- specific primer having from four to 25 T's for example 5 or 20 T's is optimal for the purposes of the present invention.

The mRNA specific amplification requires a reverse transcriptase (RT) step, for which the poly A- specific primer is also be used.

The second primer in the PCR step is generally complementary to the U3 region. When the amplification product has a size of about 300 to 500 nucleotides, the conditions applied for the amplification (PCR) step are normally the following : i) reverse transcriptase : 50°C 30 minutes ii) amplification : 940C 2 minutes (for a total 94°C 30 secondes of 10 cycles) 68°C 30 secondes -1.3°C each cycle 68°C 45 secondes iii) amplification 94'C 30 secondes (for a total 55°C 30 secondes 25 cycles) 68°C 45 secondes The amplified material is subjected to gel electrophoresis and hybridised with suitable probes, for example generated from the U3 region.

By perfcrming the mRNA specific detection of the invention, the presence of a given expressed retrovirus can be reliably determined in a biological sample. For endogenous retroviruses expression generally indicates onset of the disease process. This can be detected well before the apparition of any clinical symptoms. The diagnosis of the invention can thus be used to detect onset of the disease process, enabling treatment to be administered before irreversible autoimmune attack occurs.

The invention also encompasses pro-viral specific detection of retroviral DNA, and simultaneous detection of both expressed retroviral m-RNA and proviral DNA.

Details of these methods are given in Figure 2D and 2E, and associated legends. Specific proviral DNA detection can be used on healthy biological samples to confirm the endogenous nature of the retrovirus. the assay detecting both retroviral mRNA and proviral DNA can be used as an internal standard.

According to a preferred embodiment of the invention, the autoimmune disease detected is IDDM. The present inventors have identified, a human endogenous retrovirus associated with IDDM. This novel retrovirus (called IDDMK1 2-22) has SAg activity encoded in the NH2 terminal portion of the env gene, causing preferential proliferation of V7-TCR chain bearing T-cells.

IDDMK1 2-22 comprises the 5'LTR, 3'LTR and env- encoding sequences shown in Figures 7A, 7B and 7C respectively, and further comprises gag-encoding sequences. The SAg portion of the env protein occurs within the sequences shown in Figure 7D or 7G, particularly 7G.

Diagnosis of IDDM by specific detection of expressed retroviral RNA is carried out using a polyA specific probe of the type : 5'TTTTTGAGTCCCCTTAGTATTTATT 3' or similar sequence specifically hybridising to the polyA region of IDDMK1. 2-22 type retroviruses, having at least 90% sequence identity with the IDDMK1 2-22 and having SAg activity.

According to a second embodiment (II) of the invention, the human autoimmune disease associated with a retroviral SAg is diagnosed by specifically detecting protein expressed by the retrovirus, particularly gag, pol or env. In the case of endogenous retroviruses, the expressed proteins may be slightly different from the expected products as a result of read-through phenomena and possibly reading-frame shifts. Preferably, the expressed protein is detected in the biological sample, such as blood or plasma, using antibodies, particularly monoclonal antibodies, specific for the said protein. A Western-like procedure is particularly preferred, but other antibody-based recognition assays may be used.

In the case of IDDM, a preferred diagnostic method comprises the detection of a protein encoded by the env gene, as shown in Figure 7C, 7D or 7G, or the pol protein shown in Figure 7H, or the IDDMK1 2-22 GAG protein. Alternatively, proteins having at least approximately 90 % homology with these proteins, or proteins arising from read-through of internal stop codons, possibly with frame-shift, particularly a-1 frame shift, occurring immediately after the internal stop codon. Fragments of any of these proteins having at least 6, and preferably at least 10 amino acids, for example 6-20, or 10-15 amino acids, may also be detected. Preferred proteins for this type of diagnostic assay are those having SAg activity. It is also possible to detect retroviral particles when produced.

According to a third embodiment (III) of the invention, the autoimmune disease is diagnosed by detecting in a biological sample, antibodies specific for the protein expressed by the associated retrovirus.

Detection of antibodies specific for these proteins is normally carried out by use of the corresponding retroviral protein or fragments thereof having at least 6 amino-acids, preferably at least 10, for example 6-25 amino acids. The proteins are typically Gag, Pol or Env or fragments thereof and may or may not have superantigen activity. The retroviral proteins used in the detection of the specific antibodies may be recombinant proteins obtained by introducing viral DNA encoding the appropriate part of the retrovirus into eukaryotic cell and the conditions allowing the DNA to be expressed and recovering the said protein.

In the context of the present invention, the terms"antibodies specific for retroviral proteins" signifies that the antibodies show no significant cross reaction with any other proteins likely to occur in the biological sample. Generally, such antibodies specifically bind to an epitope which occurs exclusively on the retroviral protein in question. The antibodies may recognize the retroviral protein having SAg activity as presented by the M. H. C class II molecule.

Detection of specific antibodies may be carried out using conventional techniques such as sandwich assays, etc. Western blotting or other antibody-based recognition system may be used.

According to the fourth embodiment of the invention, the autoimmune disease is diagnosed by detecting, in a biological sample, SAg activity specifically associated with the autoimmune disease.

This is done by carrying out a functional assay in which a biological fluid sample containing MHC class II+ cells, for example Antigen Presenting Cells (APC) such as dendritic cells is contacted with cells bearing one or more variable P-T-receptor chains and detecting preferential proliferation of the V subset characteristic of said autoimmune disease. Typically, this method of diagnosis is combined with one or more of the methods (I), (II), (III) as described earlier to maximise specificity.

The biological sample according to this variant of the invention is typically blood and necessarily contains MHC class II+ cells such as B-lymphocytes, monocytes, macrophages or dendritic cells which have the capacity to bind the superantigen and enable it to elicit its superantigen activity. MHC class II content of the biological sample may be boosted by addition of agents such as IFN-gamma.

The biological fluid sample is contacted with cells bearing the V-T receptors belonging to a variety of different families or subsets in order to detect which of the Vß subsets is stimulated by the putative SAg, for example V-ß2,3,7,8,9 13 and 17. Within any one V-ß family it is advantageous to use V-ß chains having junctional diversity in order to confirm superantigen activity rather than nominal antigen activity.

The cells bearing the V-P receptor chains may be either an unselected population of T-cells or T-cell hybridoma. If unselected T-cells are used, the diagnostic process is normally carried out in the following manner : the biological sample containing MHC Class II+ cells is contacted with the T-cells for approximately 3 days. A growth factor such as Interleukin 2 (IL-2) which selectively amplifies activated T-cells is then added. Enrichment of a particular V-P family or families is measured using monoclonal antibodies against the TCR-p-chain. Only amplified cells are thus detected. The monoclonal antibodies are generally conjugated with a detectable marker such as a fluorochrome. The assay can be made T- cell specific by use of a second antibody, anti CD3, specifically recognizing the CD3-receptor.

T-cell hybridoma bearing defined T-cell receptor may also be used in the functional or cell-based assay for SAg activity. An example of commercially available cells of this type are given in B. Fleischer et al.

Infect. Immun. 64,987-994,1996. Such cell-lines are available from Immunotech, Marseille, France. According to this variant, activation of a particular family of V-P hybridoma leads to release of IL-2. IL2 release is therefore measured as read-out using conventional techniques. A specific example of this procedure for diabetes is illustrated in Figure 9. The basic methodology is adapted for other autoimmune diseases by employing T-cell receptor cells of the appropriate type for that disease.

For diabetes, detection of SAg activity will normally lead to preferential proliferation of the V-ß7 subset. For other autoimmune diseases, other V- (3 subsets may be proliferated.

According to another aspect of the present invention, there is provided human endogenous retroviruses having superantigen activity and being associated with human auto immune disease. Such retroviruses which may be of the HERV-K family, or otherwise, are obtainable from RNA prepared from a biological sample of human origin, by carrying out the following steps : i) isolation of the 5'R-U5 ends of expressed putative retroviral genomes using nucleic acid amplification, the 3'primer being complementary to known « primer binding sites » (pbs) ; ii) isolation of the 3'R-poly (A) ends corresponding to the 5'R-U5 ends, by use of primers specific for the R regions isolated in step i) ; iii) amplification of the conserved RT-RNase H region within the pol gene by using degenerate primers corresponding to the conserved region; iv) amplification of the 5'moiety of the putative retroviral genome by using primers specific for the different U5 regions isolated in step i) in conjunction with a primer specific for the 3'end of the central pol region isolated in step iii) ; v) amplification of the 3'moiety of the putative retroviral genome using primers specific for the central pol region isolated in step iii) in conjunction with primers specific for the poly (A) signals present in the 3'R-poly (A) sequences isolated in step ii) ; vi) confirmation of the presence of an intact retroviral genome by amplification using primers specific for its predicted U5 and U3 regions.

A preferred human endogenous retrovirus of the invention is IDDMK 1,2 22 comprising each of the sequences illustrated in figures 7A, 7B, 7C or sequences having at least 90 % identity with these sequences, and further comprising GAG-encoding sequences, and sequences encoding POL as shown in figure 7H. This retrovirus has a size of approximately of 8.5 kb, has SAg activity encoded within the Env region as shown in figure 7C and 7E and gives rise to V- (37 specific proliferation.

The invention also relates to proviral DNA of a retrovirus having superantigen activity and being associated with an autoimmune disease. Such proviral DNA is naturally found integrated into the human genome. The proviral DNA may be obtained from a biological sample of human origin by : i) obtaining retroviral RNA according to the method of claim 13, and further, ii) generating a series of DNA probes from the retroviral RNA obtained in i); iii) hybridising under stringent conditions, the probes on a genomic human DNA library; iv) isolation of the genomic sequences hybridising with the probes.

The invention also relates to nucleic acid molecules (RNA, DNA or cDNA) comprising fragments of the retroviral RNA or DNA described above, having at least 20 nucleotides and preferably at least 40. The fragments may be specific for a given retrovirus, specific signifying a homology of less than 20 % with other human or non-human retroviruses.

Preferred nucleic acid molecules of the invention encode SAg activity particularly SAg activity, responsible for the proliferation of autoreactive T- cells. If the region of the viral genome encoding the SAg activity is unknown, the particular region may be identified by : i) transfecting expressed retroviral DNA or portions thereof into MHC Class II+ antigen presenting cells under conditions in which the viral DNA is expressed, ii) contacting the MHC class II+ transfectants with cells bearing one or more defined (V)-ß T-cell receptor chains, and iii) determining whether the transfectant is capable of inducing preferential proliferation of a Vß subset, the capacity to induce preferential proliferation being indicative of SAg activity within the transfected DNA or portion thereof. Proliferation may be measured by determination of 3H-thymidine incorporation (see Examples methods and materials).

The nucleic acid molecule encoding SAg activity may be derived from an endogenous human retrovirus. It typically corresponds to an open reading frame of the retrovirus and may contain at least one internal stop codon or may be a synthetic mutant in which 1 or 2 nucleotides have been added or deleted to remove the stop codon and modify the reading frame.

Preferably, the nucleic acid of the invention comprises or consists of all or part of the env gene (encoding the envelope glycoprotein) of an endogenous human retrovirus associated with autoimmune disease.

The env-encoded protein is particularly likely to have SAg activity, as exemplified by the IDDM HERV.

Synthetic or recombinant nucleic acids corresponding to the env genes or fragments thereof are also within the scope of the invention.

The nucleic acid molecules of the invention may comprise ribozymes or antisense molecules to the retrovirus involved in autoimmune disease.

The invention also relates to nucleic acid molecules capable of hybridizing in stringent conditions with retroviral DNA or RNA. Typical stringent conditions are those where the combination of temperature and salt concentration chosen to be approximately 12-20°C below the Tm (melting temperature) of the hybrid under study.

Such nucleic acid molecules may be labelled with conventional labelling means to act as probes or, alternatively, may be used as primers in nucleic acid amplification reactions.

Preferred nucleic acid molecules of the invention are illustrated in figures 7A, 7B, 7C, 7D, 7E, 7G and also encompass nucleic acid sequences encoding the POL protein shown in figure 7H, and the GAG protein.

Sequences exhibiting at least 90 % homology with any of the afore-mentioned sequences are also comprised within the invention or fragments of any of these sequences having at least 20 and preferably at least 30 nucleotides.

The Env encoding sequence shown in figure 7C is particularly preferred, as well as the nucleic acid encoding the Env/F-S SAg protein shown in figures 7G and 7E. A preferred nucleic acid molecule is a molecule encoding the Env/F-S Sag protein wherein the first internal stop codon (shown underlined in figure 7C), is mutated by insertion of an extra T (at position 517 in Figure 7G underlined) to eliminate premature translational stop, the resulting sequence being then in the correct reading frame to encode the COOH terminal extension (shown underlined in Figure 7G).

This protein arises naturally from read-through together with a-1 frame shift, but this process is inefficient. The synthetic T'-inserted cDNA provides an efficient way of producing the SAg molecule shown in Figure 7G. The single reading frame in this « synthetic » molecule thus corresponds to two different reading frames separated by a stop codon in the natural molecule. Nucleic acid molecules encoding an HERV env and including minus 1, plus 1 frameshifts and termination suppression (0 frame) are thus particularly preferred embodiments of the invention.

The invention further relates to proteins expressed by human endogenous retroviruses having SAg activity and being associated with human autoimmune disease. Peptides or fragments of these proteins having at least 6 and preferably at least 10 aminoacids, for example 6-50 or 10-30 amino acids, are also included within the scope of the invention. Such proteins may be Gag, Pol or Env proteins or may be encoded by any Open Reading Frame situated elsewhere in the viral genome.

These proteins may or may not present SAg activity.

Particularly preferred proteins of the invention have SAg activity. Examples of SAg proteins of the invention are proteins encoded by the env gene of HERV, for example that shown in Figure 7G.

The proteins having SAg activity may naturally result from a premature translational stop and possibly also from a translational frameshift. Endogenous retroviral ORFs typically contain a number of internal stop codons, which often render the HERV defective. It has been discovered by the present inventors that, in some cases, retroviral expression products having SAg activity result from read-through transcription of the ORF, possibly also accompanied by a reading frame shift. Consequently, the proteins exhibiting SAg activity are not, in these cases, the expected expression products of the retrovirus.

It may therefore be deduced that open reading frames of retroviruses associated with human autoimmune disease which contain at least one internal translational stop codon are among potential candidates for SAg activity. The proteins produced by premature translational stop may have an additional carboxy- terminal extension resulting from translational frame shift, for example-1 or-2 or +1 or +2 translational frame shift. Such a protein is illustrated in figure 7G. Further preferred proteins of the invention are the proteins encoded by synthetic cDNA, corresponding to the in-frame fusion of two normally different reading frames, together with mutation of the internal stop codon. These artificial open-reading frames are made by inserting or deleting one or two nucleotides in the coding sequence at the site where frame-shift occurs naturally, thus « correcting » the reading frame and enabling efficient production of a protein which is naturally only produced very inefficiently.

Other proteins of the invention are those comprising the aminoacid sequences shown in figure 7D, 7F, 7H or an aminoacid sequence having at least 80 % and preferably at least 90 % homology with the illustrated sequences or fragments of these sequences having at least 6 and preferably at least 10 aminoacids. The proteins of the invention may be made by synthetic or recombinant techniques.

The invention also relates to antibodies capable of specifically recognizing a protein according to the invention. These antibodies are preferably monoclonal.

Preferred antibodies are those which specifically recognize a retroviral protein having SAg activity and which have the capacity to block SAg activity. The capacity of the antibody to block SAg activity may be tested by introducing the antibody under test into an assay system comprising : i) MHC Class II cells expressing retroviral protein having SAg activity and ii) cells bearing Vus-T cell receptor chains of the family or families specifically stimulated by the HERV SAg expressed by the MHC Class II+ cells, and determining the capacity of the substance under test to diminish or block Vp-specific stimulation by the HERV Sag.

The steps described below involve the use of Sag- expressing transfectant cells such as those described in the examples, to inhibit the effect of Sag in vitro and in vivo. The example applies to the Sag expressed by the IDDM-associated HERV, as well as to other Sags, encoded by HERV associated with other autoimmune diseases, such as multiple sclerosis, and previously identified as Sag by a functional T cell activation assay as described earlier.

Mabs directed against the Sag protein (or portion of it) are generated by standard procedures used to generate antibodies against cell surface antigens. Mice are immunised with mouse cells expressing both Sag and MHC class II (such as a Sag-transfected mouse B cell line described in the examples below). After fusion with hybridoma cell lines, supernatants are screened for the presence of anti-Sag antibodies on microtiter plates for reactivity to Sag transfectants cells, with non-transfected cells as negative controls. Only Mabs with reactivity specific for Sag expressing cells are selected.

All such Mabs, either as culture supernatants or as ascites fluid, are then tested for their ability to block the Sag activity, as assayed by the T cell assay in the presence of Sag-expressing human MHC class II positive transfectants, as described in Example 4 below. A preferred version of this assay makes use of Vp-specific hybridomas as T cell targets for read out.

Controls are blocking of the same assay by anti-HLA-DR Mabs, which is known to inhibit the Sag effect on T cell activation. Mabs capable of efficiently blocking the Vp-specific Sag effect, when tested at several dilutions, are selected as anti-Sag blocking Mabs.

As well as monoclonal antibodies capable of inhibiting IDDM Sag, this generation and selection of anti-Sag blocking Mabs can be achieved in the case of any HERV-encoded Sag associated with other autoimmune diseases, once such a HERV-encoded Sag has been demonstrated.

Sufficient numbers of anti-Sag Mabs are screened in the functional assay to identify anti-Sag Mabs with optimal Sag blocking activity, in terms of T cell activation (see for example Figure 9). Selected Sag blocking Mabs are then converted into their « humanised » counterpart by standard CDR grafting methodology (a procedure performed for a fee under contract by numerous companies). A humanised anti-Sag blocking Mab, directed against the IDDM associated Sag or against any Sag encoded by another HERV associated with autoimmunity, can then be tested clinically in patients. In the case of IDDM, early diagnosed patients are selected and protection against progessive requirement for insulin therapy is followed as an index of efficacy. In the case of other autoimmune diseases, efficacy of the anti-Sag Mab is followed with reference to the relevant clinical parameters.

The invention also relates to cells transfected with and expressing human endogenous retrovirus having SAg activity and being associated with a human autoimmune disease. The cells may be preferably human cells other than the naturally occuring cells from auto-immune patients and may also include other type of eukaryotic cells such as monkey, mouse or other higher eukaryotes. The cells may be established cell-lines and are preferably MHC class II+, or MHC II-inducible, such as ß-lymphocytes and monocytes. Non-human higher eukaryotic cell-lines (e. g. mouse) stably transfected with the HERV Sags of the invention (as exemplified in Example 6 below) have been found to specifically stimulate in vitro human vp-T cells of the specificity normally associated with the HERV Sag in vivo. The stimulation is coreceptor independent (CD4 and CD8).

This specific T-cell stimulation can also be observed in vivo upon injection of the transfectants into non- human animals. A transgenic animal model for the human autoimmune disease is therefore technically feasible.

The transgenic animal is made according to conventional techniques and includes in its genome, nucleic acid encoding the HERV Sags of the invention.

A further important aspect of the invention relates to the identification of substances capable of blocking or inhibiting SAg activity. These substances are used in prophylactic and therapeutic treatment of autoimmune diseases involving retroviral SAg activity.

The invention thus concerns methods for treating or preventing autoimmune disease, for example IDDM, by administering effective amounts of substances capable of blocking Sag activity associated with expression of a human endogenous retrovirus. The substances may be antibodies, proteins, peptides, derivatives of the HERV, derivatives of the Sag or small chemical molecules. The invention also relates to pharmaceutical compositions comprising these substances in association with physiological acceptable carriers, and to methods for the preparation of medicaments for use in therapy or prevention of autoimmune disease using these substances.

Further, this aspect of the invention includes a process for identifying substances capable of blocking or inhibiting SAg activity of an endogenous retrovirus associated with autoimmune disease, comprising introducing the substance under test into an assay system comprising : i) MHC Class II+ cells functionally expressing retroviral protein having SAg activity and; ii) cells bearing Vus-T cell receptor chains of the family or families specifically stimulated by the HERV SAg expressed by the MHC Class II+ cells, and determining the capacity of the substance under test to diminish or block Vß-specific stimulation by the HERV SAg, The cells bearing the P-T cell receptors and the MHC Class II+ cells may be those described earlier. Read- out is IL-2 release.

The substances tested for inhibition or blockage of Sag activity in such screening procedures may be proteins, peptides, antibodies, small molecules, synthetic or naturally occurring, derivatives of the retroviruses themselves, etc... Small molecules may be tested in large amounts using combinatorial chemistry libraries.

The screening procedure may include an additional preliminary step for selecting substances capable of binding to retroviral protein having SAg activity. This additional screening step comprises contacting the substances under test, optionally labelled with detectable marker with the retroviral protein having SAg activity and detecting binding.

The Sags of the invention or a portion thereof may be used for the identification of low molecular weight inhibitor molecules as drug candidates.

The rational is that because HERV encoded Sags are the product of ancient infectious agents, they are not indispensable to humans and can thus be inhibited without adverse side effects.

Inhibitors of Sag, as potential drug candidates, are preferably identified by a two step process : In the first step, compatible with large scale, high throughput, screening of collections ( « libraries ») of small molecular weight molecules, the recombinant Sag protein (or portion of it) is used in a screening assay for molecules capable of simply binding to the Sag protein (= « ligands »). Such high throughput screening assays are routinely performed by companies such as Novalon Inc or Scriptgen Inc, and are based either on competition for binding of peptides to the target protein or on changes in protein conformation induced by binding of a ligand to the target protein. Such primary high throughput screening for high affinity ligands capable of binding to a target recombinant protein are available commercially, under contract, from such companies as Novalon or Scriptgen. This screening method requires that a HERV protein with Sag activity, and knowledge of such an activity, be available.

In the second step, any low molecular weight molecule identified as described above as capable of binding to the Sag protein, is tested in the functional Sag assay consisting of human MHC class II positive Sag transfectants and responding Vp-specific T cells (preferably hybridomas), as described herein. Positive control for Sag inhibition is an anti-HLA-DR Mab, known to inhibit the Sag effect. All candidate molecules are thus tested, at different concentrations, for a quantitative assessment their anti-Sag inhibitory efficacy.

This example can apply to the Sag encoded by the IDDM-associated HERV described herein, as well as to any other Sag discovered to be encoded by another HERV associated with another autoimmune disease.

This screening procedure relies upon the availability of a Sag and of a Sag functional assay according to the invention, but it otherwise relies on commercially available steps. Compounds exhibiting anti-Sag inhibitory effects are then tested for obvious toxicity and pharmacokinetics assays, in order to determine if they represent valuable drug candidates.

Once a. substance or a composition of substances has been identified which is capable of blocking or inhibiting SAg activity, its mode of action may be identified particularly its capacity to block transcription or translation of SAg encoding sequences.

This capacity can be tested by carrying out a process comprising the following steps : i) contacting the substance under test with cells expressing retroviral protein having SAg activity, as previously defined, and ii) detecting loss of SAg protein expression using SAg protein markers such as specific, labelled anti- SAg antibodies.

The antibodies used in such a detection process are of the type described earlier.

The invention also relates to a kit for screening substances capable of blocking SAg activity of an endogenous retrovirus associated with an autoimmune disease, or of blocking transcription or translation of the retroviral SAg protein. The kit comprises : -MHC Class II+ cells transformed with and expressing retroviral SAg according to the invention; -cells bearing Vp T-cell receptor chains of the family or families specifically stimulated by the HERV SAg ; -means to detect specific Vp stimulation by HERV SAg; -optionally, labelled antibodies specifically binding to the retroviral SAg.

According to a further important aspect of the invention, there is provided a protein or peptide derived from an autoimmune related retroviral SAg as previously defined wherein the protein is modified so as to be essentially devoid of SAg activity, thereby no longer being capable of significantly activating auto- reactive T-cells. Such modified proteins are however capable of generating an immune response against SAg, the immune response involving either antibodies and/or T-cells responses. The immunogenic properties of the modified proteins are thus conserved with respect with the authentic SAg.

Such modified immunogenic proteins may be obtained by a number of conventional treatments of the SAg protein, for example by denaturation, by truncation or by mutation involving deletion, insertion or replacement of aminoacids. Modified SAg proteins being essentially devoid of SAg activity but capable of generating an immune response against SAg include the truncations of the SAg protein, either at the amino or carboxyterminal, and may involve truncations of about 5-30 aminoacids at either terminal. A preferred example with respect to the IDDMK 1.2-22 SAg encoded by the Env gene illustrated in Figure 7, particularly in figure 7E and figure 7G, are amino and carboxy terminal truncations of the protein shown in figure 7G, for example truncations of 5,10,15,20,25 or 30 amino acids. An example of a C-terminal truncation of the IDDMK 1.2-22 SAg protein is the protein shown in figure 7D, involving a truncation of 28 amino acids. The modified protein may be obtained by recombinant or synthetic techniques, or by modifying naturally occuring SAg proteins, for example by physical or chemical treatment.

These proteins are used in the framework of the invention as vaccines, both prophylactic and therapeutic, against autoimmune disease associated with retroviral SAg. The vaccines of the invention comprise an immunogenically effective amount of the immunogenic protein in association with a pharmaceutically acceptable carried and optionally an adjuvant. The use of these vaccine compositions is particularly advantageous in association with the early diagnosis of the autoimmune disease using the method of the invention. The invention also includes the use of the immunogenic proteins in the preparation of a medicament for prophylactic or therapeutic vaccination against autoimmune diseases.

The rational behind this prospective immunisation technique is that because HERV encoded Sags are the product of ancient infectious agents, they are not indispensable to humans and can thus be inhibited without adverse side effects.

Identification of suitable anti-sag vaccine proteins or peptides can be made in the following way.

Modified forms of the original active Sag protein, including truncated or mutated forms, or even specific peptides derived from the Sag protein, are first tested in the functional Sag assays described above to confirm that they have lost all Sag activity (in terms of T cell activation). These modified forms of Sag are then used to immunise mice (or humans) by standard procedures and with appropriate adjuvants. Extent and efficacy of immunisation is measured, including circulating anti-Sag antibodies. In a preferred example, eliciting a B cell immune response, by selecting B cell epitopes from the Sag protein as immunogen, is deliberately aimed at.

Successfully immunised animals are then tested for the effect of Sag in vivo by a standard assay, namely the injection of MHC class II positive Sag transfectants (such as the transfectants described in the examples below), known to induce in vivo a Vp- specific T cell activation. Successful immunisation against a Sag protein is expected to result in a reduction or in a block of the in vivo Sag-induced T cell activation and proliferation in effectively immunised individuals. This procedure is referred to as anti-Sag vaccination. Immunisation against Sag can be performed in humans, for diabetes, preferably initially in the case of early diagnosed IDDM patients. Efficacy of this novel « vaccination » procedure is monitored by clinical outcome and by reduction of the expected requirements for insulin therapy. In the case of other Sags, encoded by HERV associated with autoimmune diseases other than diabetes, the clinical outcome is monitored accordingly.

The vaccines of the invention can be prepared as injectables, e. g. liquid solutions or suspensions.

Solid forms for solution in, or suspension in, a liquid prior to injection also can be prepared. Optionally, the preparation also can be emulsified. The active antigenic ingredient or ingredients can be mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Examples of suitable excipients are water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof. In addition, if desired, the vaccine can contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants such as aluminium hydroxide or muramyl dipeptide or variations thereof. In the case of peptides, coupling to larger molecules (e. g. KLH or tetanus toxoid) sometimes enhances immunogenicity. The vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration includes suppositories and, in some cases, oral formulations.

The vaccines of the invention also include nucleic acid vaccines comprising nucleic acid molecules encoding the human retroviral Sag or modified forms of the SAg known to be immunogenic but no longer active as SAgs. The nucleic acid vaccines, particularly DNA vaccines, are usually administered in association with a pharmaceutically acceptable carrier as an intra- muscular injection.

The invention also relates to use of substances inhibiting either the retroviral function or the SAg function of the associated retroviruses, or Sag synthesis, in therapy for autoimmune diseases. These substances may be identified by the screening procedures described herein.

The invention further relates to methods for treatment or prevention of autoimmune diseases comprising administering an effective amount of a substance capable of inhibiting retroviral function or a substance capable of inhibiting SAg activity or synthesis.

An examples of compounds inhibiting retroviral function is AZT. Examples of compounds or substances capable of inhibiting SAg activity are antibodies to Sag, or ribozymes or antisense molecules to the SAg- encoding nucleic acid, or small molecules identifiedby virtue of their ability to inhibit SAg.

The invention also relates to a an exploratory process for detecting human autoimmune disease associated with expression of unidentified human retrovirus Superantigen (SAg), said process comprising at least one of the following steps : i) detecting the presence of any expressed retrovirus in a biological sample of human origin; ii) detecting the presence of SAg activity in a biological sample of human origin containing MHC Class II cells.

This process can be used as a preliminary indication of the involvement of retroviral superantigens in autoimmune disease.

Different aspects of the invention are illustrated in the figures.

Figure 1. Leukocytes from IDDM-patients release Reverse Transcriptase (RT) activity.

(A) Supernatants derived from cultured islets isolated from two patients (Conrad et al., 1994) were assayed for RT-activity, using a half-logarithmic dilution series of purified murine leukemia virus (MLV) RT as a standard (Pyra et al., 1994). Results are expressed as mean +/-1 SD. Islets and spleen cells from non- diabetic organ donors were cultured either alone, in the presence or absence of mitogen (-/+), or together in mixed allogeneic cultures (time as days in culture prior to collection of the supernatant is indicated below the bars).

(B) Islets and spleen cells from three non diabetic organ donors, from the two patients with acute-onset IDDM, and two patients with chronic IDDM (Conrad et al., 1994) were cultured for 1 week and supernatants were analyzed for the presence of RT-activity. Results are expressed as mean +/-1 SD for at least three individual measurements.

Figure 2A. Isolation of a single full length retroviral genome, IDDMK1, 222, with a six step procedure.

1) cPBS primers (Lys1, 2, Lys3, Pro, Trp) were used to perform a 5'RACE 2) the eight 5'R-U5 sequences obtained in 1) were used to perform a 3'RACE with primers annealing in the R 3) the conserved RT-RNAse H region was amplified with degenerate primers 4) the 5' moiety (the predicted size for full length HERV-K- retroviruses is 3.6 kb was amplified by PCR using primers specific for the eight 5'R-U5 sequences in conjunction with a primer specific for the 3'of the central pol region obtained in step 3. The primer specific for the K1r222 5'consistently yielded a fragment of this size, 5) the 3' (the predicted size for HERV-K-retroviruses is 5 kb) was amplified by PCR using a primer specific for the 5'of the central pol region isolated in step 3 and primers specific for the poly (A) signals present in the 3'R-poly (A) sequences obtained in step 2. The PCR reaction using a primer specific for the 3'clone Ka, 222 (amplified in step 4) consistently yielded a fragment potentially representing an intact 3'HERV-K moiety of 5 kb, 6) the presence of an intact 8.6 kb retroviral genome containing the overlapping 5'and 3'moieties isolated in steps 4 and 5 was confirmed by PCR using primers specific for its predicted U5 and U3 regions.

Figure 2B. Consensus features of retroviral 5'end sequences (termed STRs). These consensus features are valid for retroviruses with a polyadenylation signal in the R (repeat) region. The R region is characterized by the AATAAA or ATTAAA polyadenylation signal (bold) followed by 13 to 20 nucleotides and the dinucleotide CA or GA (bold) at the 3'end of the R region. The beginning of U5 region is defined by a GT-or T-rich sequence (underlined). The 3'end of the U5 region is in all known retroviruses defined by the dinucleotide CA, followed by one, two or three nucleotides and the primer-binding site (PB)- (N) stands for nucleotide, the suffixes x, y, and z for an undefined number.

Figure 2C. Schematic representation of mRNA-specific PCR of IDDMK1 2-22 using a poly (A)-specific probe (Rc- T (4)). Details of this technique are given in the « Experimental Procedure » Section of the Examples.

This procedure results in a Reverse-Transcriptase- dependent amplification of retroviral genomes. The products generated can be diminished below background by RNAse treatment.

Figure 2D. Schematic representation of IDDMK1 2-22 Provirus-specific PCR. The procedure specifically amplifies proviral 5'and 3'LTRs (long terminal repeats).

The primers used in an RT-control are substituted with either U5-primers 1) 5'ATC CAA CAA CCA Tga Tgg Ag 3'or 2) 5'TCT Cgt Aag gTg CAA Atg Aag 3'at 0.3 uM final concentration in conjunction with the U3-primers using either 3) gTA Aag gAT CAA gTg Ctg TgC 3'or 4) 5'CTT TAC AAA gCA gTA Ttg Ctg C 3'at 0.3 uM final concentration. 0.75 pi of Taq-Pwo-polymerase mix (Boehriner Mannheim, Expand High Fidelity PCR System) are used with a thermocycler profile corresponding to the one described for mRNA-specific RT-PCR and omitting the RT step. Hybridization is performed with the probe and the methods corresponding those used for mRNA-specific RT- PCR.

Sequence identity is confirmed by sequencing according to standard procedures.

Figure 2E. IDDMK1. 2-22 RNA-and Provirus-specific PCR.

This procedure will result in amplification products independentely of the presence or absence of RT- reactions and reflects the total retroviral RNA-and DNA-templates present in a given sample.

The same conditions as in the proviral specific PCR are used with U3 primers 1) 5'AAC ACT gCg AAA ggC CgC Agg 3'or 2) 5'Agg TAT TgT CCA Agg TTT CTC C 3'in conjunction with R (repeat) primers 3) 5'CTT TAC AAA A gCA gTA TTg Ctg C 3'or 4) 5'gTA Aag gAT CAA gTg Ctg TgC 3'. Cycling conditions and primer concentrations are identical to those described for proviral specific PCR.

Figure 2F. IDDMK1, 222 is an endogenous retrovirus found in the plasma of IDDM patients at disease onset but not in the plasma of healthy controls.

PCR primers pairs were designed that are either specific for the U3-R-or for the U3-R-poly (A)-region of IDDMK1, 222 (see Experimental Procedures). The U3-R primer pair amplified both viral RNA and DNA, whereas the U3-R-poly (A) primer pair amplified selectively viral RNA. The amplified material was hybridized with probes generated with the molecularly cloned U3-R region of IDDMK1, 222. Signals in the first and third rows correspond to amplification of contaminating DNA present in the plasma of IDDM patients (left hand columns, 1-10) and controls (right hand columns, 1-10) and were as expected RT-independent. In contrast, signals in the second row resulted from the amplification of viral RNA present only in IDDM patients (left hand columns, 1-10) but not in the non diabetic controls (right hand columns, 1-10). This was supported by the absence of amplification products in reactions lacking RT (fourth row, right and left hand clumns, 1-10). In addition the signal could be diminished below background by RNAse treatment (data not shown). In the fifth row the genomic DNA from IDDM patients and controls was amplified with the U3-R- specific primers. The primer pair specific for the U3- R-poly (A), in turn, did not result in amplification of genomic DNA (data not shown).

Figure 3. Phylogenetic trees of coding and non-coding regions place IDDMK1, 222 in the HERV-K10 family of HERVs.

(A) IDDMK1 222 SU-ENV is most closely related to HERV- K10, and is also related to the B-type retroviruses MMTV and JSRV.

(B) The phylogenetic analysis of the RT region shows that IDDMK1 222 belongs to the HERV-K10 family and is more closely related to B-type retroviruses such as MMTV than to D-type retroviruses such as Simian Mason Pfizer (SMP) or Spumaviridae (SFV). Abbreviations used: SRV-2, Simian retrovirus; JSRV, Jaagsiekte Sheep retrovirus; SFV; Simian foamy virus).

(C) The non-coding LTR region was used to construct a phylogenetic tree of the HERV-K family. K1,21 and K1 24 (see above) were isolated only as subgenomic or truncated transcripts. K1,21 is related to KC4, while K1,24 and IDDMK1 222 are related to the K10/K18 subfamily. Within this family, K1, 24 is closely related to K10, whereas IDDMK1,222 appears to be more distant.

Figure 4. The pol-env-U3-R region of IDDMK1, 222 exerts an MHC class II dependent but not MHC restricted mitogenic effect upon transfection in monocytes.

(A). IDDMKi22 is expected to generate two singly spliced subgenomic RNAs, one encoding ENV, and one comprising the U3-R region. The episomal expression vector was engineered to carry a proximal SD downstream of the promoter (pPOL-ENV-U3). Thus, the two naturally expected subgenomic RNAs can also be generated.

(B) Monocytic cell lines do not express MHC class II surface proteins in the absence of induction by Interferon-g (INF-g), (reviewed by Mach et al., 1996).

The monocyte cell line THP1 was transiently transfected with pPOL-ENV-U3 or with the expression vector alone (pVECTOR). Mitomycin C treated transfectants, either induced with INF-g for 48 h or non-induced (+/-INF-g, indicated below the x-axis) were cultured with MHC- compatible T cells at different responder : stimulator ratios as indicated below the graphs (T : APC). 3H- Thymidine incorporation was measured during the last 18 h of a 72 h culture and is given on the y-axis as n x 103 cpm. Results are presented as mean +/-1 SD.

(C) The MHC class II transactivator CIITA mediates INF- g inducible MHC class II expression (reviewed by Mach et al., 1996). An integrative and stable THP1-CIITA transfectant (THP1-CIITA) was transfected with pVECTOR or pPOL-ENV-UR and was used in functional assays identical to those described in Figure 4B.

(D) Peripheral blood lymphocytes (PBL) from healthy, MHC-unrelated donors (donors I, II and III indicated below the x-axis) were cultured with retroviral (pPOL- ENV-U3) and control transfectants (pVECTOR) at T : non -T ratios as indicated below the graphs (T : APC).

Figure 5. IDDMK1, 222 mediates a Vb 7-specific SAG- effect.

106 T cells/ml were cultured for 3 days with Mitomycin- treated pPOL-ENV-U3 and pVECTOR transfectants at T : non-T ratios as indicated. Twenty U/ml of recombinant IL-2 were then added to the cultures and FACS analysis performed after 3 to 4 days of expansion (Conrad et al., 1994).

(A) THP1 cells were transfected with pPOL-ENV-U3, the stimulated and expanded T cells were stained with anti- CD3 monoclonal antibodies and an isotype control after 7 days of coculture.

(B) T cells stimulated by THP1 transfected with the vector (pVECTOR) alone were stained with anti-CD3 monoclonal antibodies and the anti Vb 7-specific antibody 3G5.

(C) THP1 cells were transfected with pPOL-ENV-U3, the stimulated T cells were stained with anti-CD3 monoclonal antibodies and the anti Vb 7-antibody 3G5.

Tablel. IDDMK1, 222 mediates a Vb 7-specific SAG-effect.

The B lymphoblastoid cell line Raji was stably transfected with either pPOL-ENV-U3 or pVECTOR, and used in functional assays (equivalent to Figure 5) 2 weeks after selection. The monocytic cell line THP1 was cultured for 48 hours after transfection with the same constructs. The percentages of double positive (CD3 and Vb-7, Vb-8,-12) T cells are indicated that were obtained after 1 week of coculture with the respective transfectants (pPOL-ENV-U3 or pVECTOR).

Figure 6. The N-terminal env moiety of IDDMK1, 222 mediates the SAG-effect.

(A). Based on the construct pPOL-ENV-U3 different deletional mutants were generated that comprised 1) pPOL: the pol gene; 2) pPOL-ENV/TR: the poJ-. and the N-terminal moiety of the env-gene; 3) pCI-ENV/TR: the N-terminal moiety of env-gene alone.

(B). PBL from MHC unrelated donors were cocultured with Mitomycin C treated THP1 cells as described in Figure 4. The individual transfectants are indicated with the names of the constructs above the bars. (1) pVECTOR, 2) pPOL, 3) pPOL-ENV-U3,4) pPOL-ENV/TR, 5) pCI-neo, 6) pCI-ENV/TR). One of at least three independent 3H- Thymidine incorporation experiments with allogeneic T cells stimulated by the individual transfectants is shown. The ratio between T cells and transfectants is indicated below the bars (T : APC).

Figure 7A. IDDMK1. 222-5'LTR.

This figure shows the sequence of the 5'LTR (U3 RU5) of the IDDMK1 222-provirus.

Figure 7B. IDDMK1, 222-3'LTR.

This figure shows the sequence of the 3'LTR (U3 RU5) of the IDDMK1. 222 provirus.

Figure 7C. IDDMK1 222-env.

This figure shows the full nucleotide sequence of the env coding region, starting with the ATG initiation codon at position 59 (as shown in Figure 7D).

The first internal stop codon TAG at position 518 is underlined corresponding to the codon where, following a-1 frame shift, translation stops to give rise to the protein illustrated in Figure 7D. The second internal stop codon TAG at position 601 (in frame with the earlier TAG) is also underlined.

Translational stop at this codon gives rise to the IDDMK1 222-ENV/FS (SAG) protein illustrated in Figure 7G. The nucleic acid coding for the IDDMK1 222- env/fs (SAG) protein is also shown in Figure 7E.

Figure 7D. The nucleotide and deduced amino acid sequence of IDDMK1 222-SAG.

The minimal stimulatory sequence corresponding to the insert of pCI-ENV/TR comprises a C-terminally truncated protein of 153 amino acids. There is only one ORF with a stop codon at position 518. The first potential start codon in a favorable context is at position 59. Two potential N-linked glycosilation sites are present at positions 106, and 182 respectively. The degree of homology with other retroviral ENV proteins is shown in Figure 3A. No significant homology was detected with the SAG of MMTV or with autoantigens known to be important in IDDM.

Figure 7E. IDDMK1. 222-env/fs-sag.

Wild-type Nucleotide sequence coding for the 181 amino acid IDDMK1.222-ENV/FS-SAG protein shown in Figure 7G. To give rise to the SAg protein shown in figure 7G, translation of this nucleotide sequence involves a read-through of the first stop codon at position 518 followed immediately by a-1 frame shift.

Figure 7F. IDDMK1 222-ENV.

Deduced amino acid sequence encoded by the full env coding region (as shown in Figure 7B), without frame shift.

The underlined « Z » is the stop site stop site 153 amino acid protein shown in Figure 7D.

Figure 7G. Recombinant IDDMK1. 222 ENV/FS (SAG).

With respect to wild-type IDDMK1. 222 env an insertion of a T at position 517 (underlined) results in a predicted protein corresponding to the one expected to be generated by IDDMK1. 222 ENV/FS. The additional predicted C terminal amino acids that characterize ENV-FS are underlined. This protein has marked SAg activity.

Figure 7H. IDDMK1 222 POL.

Deduced amino acid sequence of the POL protein of IDDMK1 222.

Figures 8A to 8G illustrate candidate 5'STRs isolated in the first step of the six-step procedure (illustrated in Figure 2A) to isolate putative retroviral genomes from IDDM patients.

Figure 9. Functional assay for the presence of Vß7- IDDM-SAG in PBL.

PBL (peripheral blood lymphocytes) are isolated from 10ml of Heparine-blood (Vacutainer) from IDDM patients or controls with Ficoll-Hypaque (Pharmacia).

5 x 106 PBL are incubated with or without 103 U/ml recombinant human INF-y (Gibco-BRL) for 48 hours.

100 ug/ml Mitomycin C (Calbiochem) are added to inactivate for 10 cells for 1 hour at 37°C, and extensive washing is performed.

Culture with T cell hybridomas bearing human Vus-2,-3, -7,-8,-9,-13 and-17 at stimulator : responder ratios of 1 : 1 and 1 : 3 in 96 round bottom wells.

TCR-crosslinking with anti-CD3 antibodies (OKT3) is used as a positive control for each individual T hybridoma.

IL-2 release into the supernatant is measured with the indicator cell line CTLL2 according to standard procedures.

Results are expressed as percentage of maximal stimulation obtained with TCR crosslinking in the same experiments.

A selectively induced TCR-crosslinking and IL-release of Vß7 is interpreted as being compatible with the presence of IDDM-SAG in PBL from the individual analysed.

EXAMPLES In two patients with type I diabetes, a dominant pancreatic enrichment of one Vb-family, Vb 7, has been observed (Conrad et al., 1994). The same dominant enrichment of Vb 7 could be mimicked by stimulating T cells of diverse haplotypes with surface membrane preparations derived from the pancreatic inflammatory lesions but not with membranes from MHC-matched healthy control islets. This was taken as evidence for the presence of a surface membrane-associated SAG (Conrad et al., 1994).

In the framework of the present invention, the hypothesis that this SAG is of endogenous retroviral origin has been tested. Below it is shown that the SAG identified in these two patients is encoded by a human endogenous retrovirus related to MMTV. Expression of this endogenous SAG in IDDM suggests a general model according to which self SAG-driven and systemic activation of autoreactive T cells leads to organ- specific autoimmune disease.

Example 1. Cultured leukocytes from inflammatory b-cell lesions of IDDM-patients release Reverse Transcriptase activity Expression of cellular retroelements may be associated with measurable Reverse Transcriptase-activity (RT) (Heidmann et al., 1991). An RT-assay detected up to a hundredfold increase in RT-activity in supernatants from short-term cultures of freshly isolated pancreatic islets derived from two patients (Figure 1A), (Conrad et al., 1994; Pyra et al., 1994). No RT-activity above background levels was detected in medium controls, indicating that the RT-activity could not be accounted for by a contamination of the synthetic media and sera with animal retroviruses. We can also exclude the possibility that the RT-activity represents cellular polymerases released into the supernatant by dying cells. Indeed, no RT-activity can be detected in cultures from non-diabetic controls under conditions in which cell death is strongly enhanced, namely mitogen treated peripheral blood lymphocytes (PBL), splenocytes and cocultures of islets with allogeneic T cells.

Moreover, the IDDM-derived islets were cultured for 5 days, whereas control cultures were sequentially analysed for up to 4 weeks. Finally the absence of RT- activity in the supernatants of the mitogen-treated control PBL also excluded the possibility that the RT- activity detected with the IDDM islets was simply due to non-specific cell activation. Both, the islets and the inflammatory infiltration represented potential sources for the enzymatic activity. As shown in Figure 1B, supernatants from cultured spleen cells from the patients contained more RT-activity than the inflammatory b-cell lesions. Moreover, the RT-activity disappeared together with the local inflammatory lesion in two patients with chronic and long-standing disease, but it persisted in cultured spleen cells from the same patient (Figure 1B). This was interpreted as being compatible with the leukocytes as the most likely source of this RT-activity.

Example 2. Isolation of a full length retroviral genome, IDDMK1, 222, from supernatants of IDDM islets A strategy to isolate putative retroviral genomes from polyadenylated RNA extracted from the supernatants of IDDM islets was developed (Figure 2A). This strategy relies on the following three characteristic features of functional retroviruses. First, retroviral genomes contain a primer binding site (PBS) near their 5'end.

Cellular tRNAs anneal to the PBS and serve as primers for Reverse Transcriptase (reviewed by Whitcomb and Hughes, 1992). Second, the R (repeat) sequence is repeated at the 5'and 3'ends of the viral RNA (Temin, 1981). Third, the RT-RNAse H region of the pol gene is the most conserved sequence among different retroelements (McClure et al., 1988 ; Xiong and Eickbusch, 1990). These three features were exploited in a six step procedure as follows.

1) To isolate the 5'ends (5'R-U5) of putative retroviral RNA genomes, a 5'RACE procedure was performed with primers complementary to known PBS sequences (cPBS primers) (Weissmahr et al., 1997). Most retroviruses known have a primer binding site (PBS) complementary to one of only four individual 3'ends of <BR> <BR> tRNAs : tRNAPr° tRNALY53 tRNALysl. 2 and tRNA P.

Accordingly, sequence-specific primers complementary to the four PBSs were used to derive cDNA (Weissmahr, 1995). The amplification products resulting from anchored PCR and of 100-700 bp in size were sequenced and analyzed for the presence of consensus sequences typically found in retroviral 5'R-U5s (Weissmahr, 1995).

Eight different candidate 5'R-U5 sequences (5'kil, 2- 1,-4,-10,-16,-17,-22,-26 and-27) were obtained with the cPBS-Lysinel, 2 primer. All eight sequences contained features typical of the 5'ends of retroviral genomes (Temin, 1981). These include the presence at the expected positions of i) a PBS region, ii) conserved and correctly spaced upstream regulatory sequences, such as a poly (A) addition signal and site, and the downstream GT-or T-rich elements (Wahle and Keller, 1996), iii) a putative 5'end specific U5 region and iv) a putative R region. Of the eight 5'R- U5 sequences isolated, three (5'K1, 2-l,-4, and-22) were identified on the basis of sequence homology as belonging to previously identified families of human endogenous retroviruses (HERVs) that are closely related to mouse mammary tumour viruses (MMTV), namely HERV-K (C4) (Tassabehji et al., 1994), HERV-K10 and HERV-K18 (Ono, 1986a; Ono et al., 1986b). The remaining five sequences exhibited only a distant relationship with HERV-K retroviruses.

2) A repeat (R) region conserved in the 5'R-U5 and the 3'U3-R-poly (A) is essential for retroviral first strand DNA synthesis to proceed to completion (Whitcomb and Hughes, 1992). Primers specific for the R region-sequence obtained for individual 5'R-U5s were used to prime the cDNA synthesized with oligo (dT), (Weissmahr, 1995). Products resulting from anchored PCR were sequenced and analyzed for the presence of a conserved R region followed by a poly (A)-tail. The eight 3'R-poly (A) ends (3'K,, 2-"-4,-10,-16,-17,- 22,-26 and-27) corresponding to the eight different 5'R-U5 regions identified in step 1 were isolated by means of a 3'RACE procedure using primers specific for the R regions. In each case, the isolated sequences contained the expected R region followed by a poly (A) tail.

3) The conserved RT-RNase H region within the pol gene was next amplified by PCR using degenerate primers (Medstrand and Blomberg, 1993). 15 individual subclones were sequenced and all exhibited approximately 95% similarity at the protein level to the RT-RNase H region of the HERV-K family.

4) The 5'moiety (from the U5 region at the 5'end to the pol gene) of the putative retroviral genome was amplified by PCR using primers specific for the eight different U5 regions present in the 5'R-U5 sequences (isolate in step 1) in conjunction with a primer specific for the 3'end of the central pol region (isolated in step 3). The expected size of the PCR product corresponding to the 5'moiety of full length HERV-K retroviruses is 3.6 kb (Ono et al., 1986b). Only the PCR reaction using the primer specific for the K1,222 5'end clone consistently yielded a fragment of this size. Sequence analysis of several independent clones confirmed that this 3.6 kb fragment contains the R-U5-PBS region followed by coding regions corresponding to the gag and pol genes, and thus indeed represents the 5'moiety of an intact retroviral genome.

5) The 3'moiety (from the pol gene to the 3'end) of the putative retroviral genome was amplified by PCR using a primer specific for the 5'end of the central pol region (isolated in step 3) and primers specific for the poly (A) signals present in the 3'R-poly (A) sequences (isolated in step 2). The expected size of the PCR product corresponding to the 3'moiety of full length HERV-K-retroviruses is 5 kb (Ono et al., 1986b).

The PCR reaction using a primer specific for the 3'end clone Kl, 222, which is the one that should correspond to the 3'end of the retrovirus from which the 3.6 kb 5' moiety was amplified in step 4, consistently yielded a fragment potentially representing an intact 3 moiety of 5 kb. Sequence analysis of several independent clones confirmed that this 5 kb fragment indeed contains coding regions corresponding to the pol and env genes followed by the expected U3-R-poly (A) region.

6) Finally, the presence of an intact 8.6 kb retroviral genome containing the overlapping 5'and 3' moieties isolated in steps 4 and 5 was confirmed by PCR using primers specific for its predicted U5 and U3 regions.

The full length retroviral genome that was isolated was called IDDMK,, 222, where IDDM refers to the tissue source, K1, 2 refers to Lysinel, 2 cPBS primer and 22 represents the serial number of the clone. IDDMK1, 222 was determined to be novel retrovirus on the basis of two criteria. First, it has a unique pattern of restriction enzyme cleavage sites that is distinct from that of other known viruses. Second, its nucleotide and amino acid sequences in non-coding and coding regions diverge from other known retroviruses by at least 5-10 %.

IDDMK1, 222 was the only full length virus identified in these experiments, suggesting that it is the only functional retrovirus specifically associated with the supernatants of the cultured IDDM islets. PCR reactions using primers specific for the other 5'R-U5- PBS and 3'U3-R-poly (A) clones isolated in steps 1 and 2 did not yield fragments of the size expected for intact retroviral genomes in steps 4 and 5. In particular, primers specific for the 5'and 3'ends corresponding to the ubiquitous HERV-K10 virus did not amplify fragments corresponding to complete genomes, although this virus is known to be released as full length genome associated with viral particles from several cell lines and tissues (Tonjes et al., 1996). Our inability to detect full length HERV-K10 genomes in the IDDM islet supernatant is unlikely to be due to a technical problem because it could be amplified very efficiently from both genomic DNA and a size selected cDNA library prepared from a B-lymphoblastoid cell line (data not shown). It is more likely that HERV-K10 is not released in significant amounts by the cultured IDDM islets.

Finally, i) we confirmed by RNA-specific PCR that sequences identical, or highly similar, to the 3'U3-R- poly (A) of IDDMK were present in RT-positive but not in RT-negative samples analysed; ii) in a preliminary epidemiological study we detected by PCR sequences identical, or highly similar, to the 3'U3-R-poly (A) of IDDMK1, 2 only in the plasma of 10 recent onset IDDM patients but not in the plasma of 10 age-matched non diabetic controls (Figure 2F); and iii) we confirmed by PCR the presence of sequences identical, or highly similar to the U3-R region of IDDMK1,2 in genomic DNA of IDDM patients (n = 10) and non diabetic controls (n = 10) (Figure 2F). In summary, these data indicate that IDDMK1,2 is an endogenous retrovirus that is released from leukocytes in IDDM patients but not in non diabetic controls. Example 3. IDDMK1,222 is a novel member of the MMTV- related family of HERV-K, and is related to HERV-K10 To evaluate the relationship between IDDMK1, 222 and other known retroviruses we derived phylogenetic trees for subregions exhibiting different degrees of conservation (Galtier et al., 1996; Saitou and Nei, 1987; Thompson et al., 1994). The three regions chosen for this analysis were the RT region of the pol gene (Figure 3B), the outer region (SU, surface) of the env gene (Figure 3A) and the U3 region of the LTR (Figure 3C). The RT and SU regions were selected to construct interspecies phylogenetic trees because they represent, respectively, the most highly conserved and the most variable of the protein coding regions (McClure et al., 1988). The U3 region of the LTR was chosen to construct an intraspecies tree of the family to which IDDMK1, 222 belongs because LTR sequences are conserved in size and sequence only within a given species, and the U3 region accounts for most of the intraspecies differences (Temin, 1981). As shown in Figure 3A, the ENV polyprotein of IDDMK,, 222 is most closely related to that of HERV-K10. Both proteins are related to those of MMTV and Jaagsiekte sheep retrovirus (JSRV). The same is essentially true for the RT-subregion of the POL polyprotein, where IDDMK1,222 and HERVK10 are most closely related to the B-type retrovirus MMTV (Figure 3B). Figure 3C illustrates, that Kl, 21 is related to HERV-K (C4), while K4 24 and IDDMK1, 222 are related to the K10/K18 subfamily. Within this family, K1,24 is closely related to K10, whereas IDDMK1, 222 appears to be more distant.

Example 4. IDDMK1, 222 encodes a V57-specific SAG The strategy used to identify a putative SAG- function encoded by IDDMK1, 222 was dictated by 1) predictions based on the biology of the MMTV-SAG, 2) general requirements for a protein-protein interaction between a SAG and MHC class II molecules and 3) intracellular trafficking mechanisms used by proteins encoded by retroviruses. The prototypical retroviral SAG of MMTV is a type II transmembrane protein that is encoded within the U3 of the 3'LTR (reviewed by Acha- Orbea and McDonald, 1995). It is targeted into the MHC class II peptide loading compartment and exported to the cell surface. On the basis of potential splice donor (SD) and acceptor sites (SA) present in its sequence, IDDMK1, 222 is expected to generate two subgenomic mRNAs, one encoding ENV and a second transcript comprising the U3-R region (Figure 4A).

Based on these criteria we produced an episomal expression construct (pPOL-ENV-U3) with a 5'SD positioned upstream of the truncated pol, env and U3- regions (Figure 4A). It is expected that both of the putative subgenomic mRNAs can be generated from this construct (Figure 4A).

Retroviral-and control-transfectants of monocyte-and B lymphocyte-cell lines were generated and tested for their ability to stimulate MHC compatible and allogeneic T cell lines in a Vp7- specific manner. Monocytes do not express measurable MHC class II surface proteins in the absence of induction by Interferon-y (INF-y); the MHC class II transactivator CIITA mediates INF-y-inducible MHC class II expression (reviewed by Mach et al., 1996). As shown in Figure 4A, transient monocyte (THP1, U937) transfectants induced with INF-g and expressing the truncated IDDMK1, 222 genome (pPOL-ENV-U3) stimulated in a dose-dependent fashion T cell lines from MHC- compatible donors essentially to the same extent. The mitogenic effect was dependent on the presence of MHC class II, since INF-g-mediated MHC class II expression specifically induced the stimulatory capacity of retroviral-as compared to control-transfectants (Figure 4B). The use of THP1 cells rendered constitutively MHC class II positive by transfection with CIITA resulted in a stimulation comparable to INF- g-induction, suggesting that the INF-g-induced and CIITA-dependent MHC class II expression was indeed responsible for this functional difference (Figure 4C).

The mitogenic effect is not MHC-restricted, since a response exceeding allostimulation was observed when PBL from several different MHC-disparate donors were tested for proliferative responses to monocytes transfected with pPOL-ENV-U3 (Figure 4D). In essence, these functional data suggest that the truncated IDDMK1, 222 (pPOL-ENV-U3) genome is responsible for a mitogenic effect that is MHC class II-dependent but not MHC-restricted.

Experiments were performed in bulk-cultures using TCR-V (3-specific stimulation and expansion as a readout.

Retroviral THP1 transfectants induce a more than 15 fold increase in the number of the Vus-7 family but not of the two control families tested (V8, Vbl2) after specific stimulation and subsequent amplification (Figure 5, Table 1). This was verified by using two different V (3-7-specific monoclonal antibodies, 3G5 and 20E. A comparable effect was also observed when PBL from MHC-disparate donors were tested. This was interpreted as evidence for the presence a Vus-7- specific SAG.

The monocytic cell lines were at least 3 times more efficient in terms of specific TCR Vb-7 amplification as compared to the most efficient B lymphoblastoid cell line (Table 1). This difference could not be explained by variations in the level of MHC class II expression or by the individual MHC haplotypes present. On the other hand, it may be due to differential expression of costimulatory molecules or secretion of cytokines. In conclusion, by all criteria known to date, IDDMK1, 222 encodes a mitogenic activity having all features of a Vb-7-specific SAG.

TABLE 1 : IDDMK1 222 mediates a V (37-specific SAG-effect TRANSE1 : CTANT Vß-FAMILY Vß-7 Vß-8 Vß-12 Raji-pPOL-ENV-U37% 5%2.5% Raji-pVECTOR 2% THP1-pPOL-ENV-U316% 5.3%2.8% THP1-pVECTOR1% 5. 8%3% Example 5. The SAG function is mediated by the N- terminal moiety of the env protein A series of deletional mutants were generated that contained either the truncated pol-env-U3 region (pPOL-ENV-U3), the truncated pol gene alone (pPOL), or the truncated pol gene followed by the env gene truncated downstream of the premature stop codon found in all clones (pPOL-ENV/TR), (Figure 6A). In addition, a C-terminally truncated env gene was generated as an individual expression unit (pCI-ENV/TR). As shown in Figure 6B, by excluding the env-coding region the SAG- function is selectively lost (pPOL). If, however, the truncated env gene is included (pPOL-ENV/TR), the stimulatory capacity is restored to levels comparable to pPOL-ENV-U3. In addition, expression of the truncated env gene alone (pCI-ENV/TR) is sufficient for function. These findings demonstrate that the SAG function is mediated by the N-terminal moiety of the env gene comprising 153 amino acids. The nucleotide and predicted amino acid sequences of the minimal stimulatory region are shown in Figure 7. As shown in Figure 3A, this predicted protein resembles the N- terminal ENV proteins of related HERVs (HERV-K10), and those of the B-type retroviruses (MMTV, JSRV). However, there is no significant sequence homology with either MMTV-SAG, other SAGs, or autoantigens known to be important in IDDM.

Here, evidence is provided showing that a human endogenous retrovirus, IDDMK1, 222, is released from leukocytes in patients with acute onset type I diabetes. In preliminary experiments IDDMK1, 222 RNA sequences were detectable in the plasma of IDDM patients at disease onset but not in the plasma of age- matched healthy controls. This novel human retrovirus is related to MMTV and encodes a SAG with functional characteristics similar to the one encoded by MMTV. In contrast to MMTV, however the IDDM-associated SAG is encoded within the retroviral env gene rather than within the 3'LTR. It has the same TCR Vp7-specificity with the SAG originally identified in the IDDM patients. This SAG is thus likely to be the cause of the Vb7-enriched repertoire of islet-infiltrating T lymphocytes.

IDDMKC, 222 as a member of the HERV-K class of endogenous retroviruses HERV-K genomes exist in two different forms, type I genomes which are largely splice deficient and type II genomes which generate three subgenomic mRNAs (Tonjes et al., 1996 ; Ono, 1986). A 292 bp insert at the pol- env boundary with clustered nucleotide changes downstream of the splice acceptor site are present in type II but not in type I genomes (Tonjes et al., 1996). The insert affects both, the env and pol gene: i) type II genomes have a stop codon between env and pol which is missing in type I genomes and ii) have a considerably longer N terminal env region. The 292 bp insert and the clustered nucleotide changes have been proposed to be responsible for the efficient splicing of type II genomes (Tonjes et al., 1996). IDDMK1, 222 is missing the 292 bp insert but has two in frame stop codons between env and pol and the clustered nucleotide changes downstream of the SA typical of those found in type II genomes. In terms of splice efficiency, IDDMK1t222 may be in an intermediate position between type I and II genomes. This and the altered N terminal sequences in IDDMK1t222 with respect to type II genomes may affect SAG expression in vivo.

However, as shown in Figure 4, the 3'terminal moiety (POL-ENV-U3) of the IDDMK1, 222 genome mediates the SAG function in vitro. Moreover, it is known from MMTV that the SAG function in vivo may be present at levels where the respective protein remains undetectable (Winslow et al., 1992; reviewed by Acha-Orbea and MacDonald, 1995).

The model: human self SAGs as activators of autoreactive T cells in type I diabetes A model is proposed according to which induction of self SAGs in systemic and professional APCs, outside the pancreas, leads to autoimmunity in genetically susceptible individuals. The model implies two steps, the first is systemic, the second organ-specific. The initial event is a systemic, polyclonal activation of a Vb-restricted T cell subset, triggered by the expression of an endogenous retroviral SAG in professional MHC class II+APCs. In a second step, autoreactive T cells within the subset of SAG-activated T lymphocytes initiate organ-specific tissue destruction. The evidence presented here, however, does not rule out that the release of the IDDMK1, 222 RNA sequences in vivo and the SAG function associated with IDDM in these patients are the consequence rather than the cause of the inflammation.

The expression of self SAGs can in principle be modulated by two variables: physiological endogenous stimuli or environmental stimuli. A possible physiological stimulus might be steroid hormones. HERV- K10 expression is steroid-inducible in vitro and this is possibly the result of hormone response elements (HRE) present in its LTR (Ono et al., 1987). IDDMK1, 222 and HERV-K10 share the same putative HRE in their respective LTRs (Ono et al., 1987), (Figure 3). Steroid inducibility of IDDMK1, 222 could therefore also occur in vivo, in analogy to the well documented example of the transcriptional control by steroid hormones of the MMTV promoter (reviewed by Acha-Orbea and Mac Donald, 1995). Infectious agents are of major importance when considering environmental factors. Examples include the cellular SAGs that are expressed by herpesvirus- infected monocytes and B-lymphocytes (Dobrescu et al., 1995; Sutkowski et al., 1996). In both cases, HERVs have not been excluded as a potential source of the SAG-activity. It is thus conceivable that SAGs are being selectively expressed in response to ubiquitous pathogens such as herpesviridae (reviewed by Roizman, 1996). In fact, HERVs are induced by a variety of environmental stresses, and some of them behave as hepatic acute-phase genes (reviewed by Wilkinson et al., 1994).

The experimental evidence presented suggests that the RT-activity, the IDDMK1222 RNA sequences and in consequence the SAG may derive from leukocytes rather than from the pancreatic b-cells. This may indicate that expression of the retroviral SAG is induced preferentially in systemically circulating professional MHC class II+ APCs. The highest rate of IDDM coincides with puberty (10-14 years) in both sexes (Bruno et al., 1993). Infections with ubiquitous viruses (reviewed by Roizman, 1996) may act synergistically with an increase in the circulating levels of steroids to enhance expression of the SAG in professional APCs.

Autoreactive T cells can be readily demonstrated in the mature repertoire of healthy individuals (Pette et al., 1990). However, in order to able to migrate to the target tissue these T cells have to be activated (reviewed by Steinman, 1995). These considerations lead us to the hypothesis that among the Vb7+-T cells activated by IDDMK1 222-SAG, some are autoreactive and migrate to the target tissue were b-cell specific death ensues. Once b-cells die, cellular antigens are liberated and the immune response perpetuated through determinant spreading (reviewed by McDevitt, 1996).

The concept of IDDMK1,222-sag as autoimmune gene Known genes conferring susceptibility to autoimmune diseases are host-derived, stably inherited Mendelian traits and contribute in a cumulative fashion to the familial clustering of the disease without causing disease per se (reviewed by Todd, 1996). IDDMK1, 222 should be viewed as mobile genetic element with the potential to move within the host genome due to multiple mechanisms, including retrotransposition, homologous recombination, gene conversion and capture, resulting in multiple copies of individual HERVs (reviewed by Preston and Dougherty, 1996; Wain-Hobson, 1996). This renders family studies dealing with searches for HERV-disease association difficult. It should be noted, however, that there is little or no plus/minus genetic polymorphism in different humans at the HERV-K loci and as yet no evidence for mobility.

Interestingly, an IDDMK1, 222-related HLA-DQ-LTR is associated with susceptibility to IDDM, possibly due to cosegregation with the HLA (Figure 3C), (Badenhoop et al., 1996). In addition, infectious transmission cannot be excluded, as is the case for two closely related virus groups containing endogenous and exogenous variants: MMTV and JSRV (Figure 4A and 4B), (reviewed by Acha-Orbea and McDonald, 1995; York et al., 1992).

In summary, this candidate autoimmune-gene has distinctly different features from classical, disease- associated susceptibility genes. It has the potential of being transmitted as either an inherited trait or as an infectious agent. Moreover, this gene has no apparent essential function for the host but it may have instead an inducible and intriguing potential to directly cause disease whenever expressed in genetically susceptible individuals.

Example 6. Developement of an animal model to document and study the Sag effect in vivo Several mouse cell lines, in particular a B lymphocytes line (A20) and a monocyte line (WEHI-3) were stably transfected with the IDDM Sag cDNA (corresponding to the minimal region encoding a. a. 1 to 153 of the env protein of IDDM1,2,22, as described above). The B cell lines express mouse MHC class II molecules constitutively. In the case of monocyte lines, the transfectants are induced to express mouse MHC class II molecules by treatment with mouse interferon gamma (100-1000 units of mouse interferon (Genzyme) per ml for 48 hrs).

These MHC class II positive Sag transfectants were capable of stimulating (in vitro) human T lymphocytes of the V (37 specificity, and not Vß8 or V (312 as negative controls. This demonstrates that the IDDM Sag can function when expressed on MHC class II positive mouse cells. These Sag-expressing, MHC class II positive, mouse transfectants are used to immunise mice against the Sag protein and to generate anti Sag monoclonal antibodies, using as control the homologous untransfected cell lines.

This Sag effect lead to the stimulation of Vß7- specific T lymphocytes of both the CD4 and the CD8 type. This observation indicates that the IDDM Sag functions in T cell activation in a manner that is independent of the co-receptors CD4 and CD8. This situation is different from what is observed in the case of the mouse MMTV Sag, where only CD4 T lymphocytes are stimulated.

The same MHC class II positive mouse stable Sag transfectants (A 20, B lymphocytes and WEHI-3, monocytes), expressing the minimal functional region of IDDM Sag defined above (and corresponding to a. a. 1 to 153 of the env protein of IDDM1,2,22) specifically stimulated mouse T lymphocytes of the Vß4 and the VP10 specificity. (These are the most highly related mouse Vb sequences, from a structural point of view, to human Vu7).

Again, both CD4 and CD8 mouse T lymphocytes were activated, indicating a Sag mediated activation that is independent of the CD4 and CD8 co-receptors.

More importantly, injection of the same stable Sag transfectants into mice (either in the bind foot path or in the tail vein) lead to in vivo activation of T lymphocytes, again with the same Vp specificity observed upon in vitro mouse T cell activation by the IDDM Sag. T cell activation and Vp specificity in response to the injection of Sag transfectants was monitored by analysis of T lymphocytes in draining lymph nodes and in the spleen.

The ability to induce Vp-specific T lymphocyte activation in vivo in mice following injection of MHC class II positive transfectants expressing IDDM Sag indicates that the biological effect of IDDM Sag can now be monitored in an in vivo animal model. This allows the testing in vivo, not only of a Sag biological effect, but also of potential inhibitors of the effect of Sag, such as anti-Sag antibodies, including monoclonal anti-Sag antibodies, and small molecular weight inhibitors of Sag (first identified as inhibitors of Sag in in vitro cell based assays).

Finally, this in vivo model of the biological effect of Sag allows to test the effect of prior immunisation of animals with the Sag protein (or derivatives thereof) on the biological effect of Sag in vivo. This model provides a test of the possibility of a protective vaccination against IDDM Sag in vivo.

Transgenic mice carrying the IDDM Sag gene have been obtained. The Sag gene is under the control of a tetracycline operator element (consisting of a heptameric repeat of the Tn motive linked to a minimal promoter). These transgenic mice have been crossed with two other transgenic mice carrying the tetracycline transactivator gene (TTA) under the control of the CMV promoter. One transgenic (CMV-TTA) induces the tet transactivator upon withdrawal of tetracycline, while the other (CMV-RTTA) induces the tet transactivator in the presence of tetracycline. These double transgenic mice permit the deliberate, selective and controlled expression of Sag in vivo, allowing the subsequent study of immunopathological consequences of Sag expression.

Exactly the same steps can be followed (= Sag- expressing mouse cells and Sag expression in vivo) to establish animal models of the effect of other Sags encoded by other HERVs in the context of other autoimmune diseases, such as multiple sclerosis or rheumatoid arthritis.

Experimental Procedures Patients The the islets and spleens from patients with acute onset-and chronic IDDM and non diabetic organ donors were provided by the Pittsburgh Transplant Institute (Conrad et al., 1994).

The plasma and genomic DNA from patients and controls for the epidemiological study were isolated by the Diabetes Register in Turin, Italy (Bruno et al., 1993). The samples were collected within 1 month after the clinical diagnosis from patients, aged from 0-29 years (Bruno et al., 1993).

RT assays RT assays were performed as described (Pyra et al., 1994).

Isolation of full length retroviral genomes A description of the criteria used to identify unknown retroviral 5'R-U5s and 3'R-poly (As) has been published (Weissmahr et al., 1997).

I. Primers sequences for the 3'moiety of the putative retroviral genomes ; abbreviations are according to Eur.

J. Biochem. (1985). 150,1-5.

A. RT region RT la 5'YAAATggMgWAYgYTAACAgACT3' RT lb 5'YAAATggMgWAYgYTAACTgACT3' RT 2a-nested 5'CgTCTAgAgCCYTCTCCggCYATgATCCCg3' RT 2b-nested 5'CgTCTAgAgCCYTCTCCggCYATgATCCCA3' B. 3'U3-R-Poly (As): all primers have an identical 5'- anchor: 5'TgCgCCAgCAATgTATCCATg3'+ sequence-specific part #1K1,2-1 5'gggTggCAgTgCATCATAggT3' #4K1,2-45'gggAgAgggTCAgCAgCAgACA3' #K1,2-10 5'gACAgCAAgCCAgTgATAAgCA3' #K1,2-165'ggAACAgggACTCTCTgCA3'<BR> #K1,2-175'gggAAgggTAAggAAgTgTg3' #K1,2-22 5'ggTgTTTCTCCTgAgggAg3' #K1,2-265'gAAgAATggCCAACAgAAgCT3' #K1,2-27 5'gggAAACAAggAgTgTgAgT3' common, secondary anchor primer: 3'U3-R-poly (As) common 5'CATgTATATgCggCCgCTgCgCCAgCAATgTATCC ATgg3' II. Primer sequences for the 5'moiety of the genome: A. RT-region RT 1 5'TATCTTTCgTTTCTgCAgCAC3' RT 2 5'TAACTggTTgAAgAgCTCgACC3' B. 5'-R-U5 R-U5-1 5'ATACTAAggggACTCAgAggC3' R-U5-2 5'CAgAggCTggTgggATCCTCCATATgC3' The PCR conditions were as follows: lx 94° C 2 min; 45° C 5 min ; 68° C 30 min; lOx 94° C 15 sec ; 45° C 30 sec + 1° C/cycle; 680 C 3 min 30 sec; 25x: 94° C 15 sec; 55° C 30 sec ; 68° C 3 min 30 sec + 20 sec/cycle.

Primers were used at 300 nM final concentration, dNTPs at 200mM, with 52 U/ml of Taq-Pwo polymerase-mix (Boehringer Mannheim). One vol% of first-round PCR was subjected to a nested PCR. Size selected and purified amplification products were blunted, EcoRI adapted and subcloned into EcoRI-digested 1ZAPII-arms. After two rounds of hybridisation 20 individual clones were rescued as plasmids. Eleven clones were selected for further analysis based on a conserved restriction pattern. An equivalent procedure was followed for the 5' moiety of the genome. Sequencing was performed on an automatic sequencer (ABI, Perkin Elmer) using subgenomic clones.

Epidemiological study. RNA-PCR. Three ml of blood was collected in EDTA tubes (Vacutainer) and further processed within 6 hours. Samples were subjected twice to centrifugation, for 4 x 103 G, 10 min at 4°C. Total RNA was extracted from 560 ml of plasma (QIAamp ; Qiagen). Four vol % of total RNA was used for a single tube RT-PCR using thermostable AMV, Taq and Pwo (Boehringer Mannheim). Reactions contained at a final concentration: di-Na salts of dNTPs at 0.2 mM ; DTT at 5 mM; 10 U recombinant RNAsin (Promega); 1.5 mM MgCl2 ; R- poly (A) primer 5'TTT TTg AgT CCC CTT AgT ATT TAT T 3' ; U3 primer 5'Agg TAT TgT CCA Agg TTT CTC C 3', both at 0.3 mM. RT was performed at 50°C for 30 min directly followed by 94° C 2 min ; 940 C 30 sec, 68° C 30 sec,- 1.30 C each cycle, 68° C 45 sec for a total of 10 cycles ; 940 C 30 sec, 55° C 30 sec, 68° C 45 sec for a total of 25 cycles. The amplified material (487 bp) was subjected to agarose gel electrophoresis followed by alkaline transfer and hybridisation with probes generated from the IDDMK1, 222 U3-R-region. Genomic PCR.

100 ng of genomic DNA was subjected to PCR. Reactions contained at a final concentration: dNTPs at 200 mM ; 1.5 mM MgCl2; 2.6 U of Taq-Pwo (Boehringer Mannheim); U3-primer 5'Agg TAT TgT CCA Agg TTT CTC C 3' ; R- primers either 5'CTT TAC AAA gCA gTA TTg CTg C 3, or 5'gTA AAg gAT CAA gTg CTg TgC 3'at 300 nM. The amplified products were 300 and 395 bp in size, respectively. The cycling profile was as follows: 94° C 2 min ; 940 C 15 sec, 68° C 30 sec,-1.3° C each cycle, 72° C 45 sec for a total of 10 cycles; 94° C 15 sec, 55° C 30 sec, 720 C 45 sec for a total of 25 cycles.

Sequence alignment and phylogenetic trees Sequences were aligned with CLUSTAL W (Thompson et al., 1994). Alignments were checked and manually corrected with the SEA VIEW multiple sequence alignment editor (Galtier et al., 1996). Phylogenetic trees were computed from multiple alignments using the"neighbour joining"method (Saitou and Nei, 1987).

Expression Constructs. pPOL-ENV-U3: a SacI-NotI fragment derived from 11 IDDMK1, 222 clones was ligated with 1) a BamHI-SacI adapter containing a consensus SD and 2) with a NotI-XbaI adapter and 3) was subcloned into BamHI-XbaI digested plDR2-arms, selected for by two rounds of screening and plasmids rescued. At least five independent clones were used for transfections. pPOL: pPOL-ENV-U3 was digested with KpnI-NotI, blunted and religated. pPOL-ENV/TR: a stimulatory clone was digested with XbaI and religated. pCI-ENV/TR: 1 ng of pPOL-ENV-U3 was amplified with the primers 5'gAC TAA gCT TAA gAA CCC ATC AgA gAT gC 3'and 5'AgA CTg gAT CCg TTA AgT CgC TAT CgA CAg C 3'. The amplified products were subcloned into pCI-neo (Promega).

Cells and cell lines. Monocytic cell lines: THP1, U937. B-lymphoblastoid cell lines: Raji, BOLETH, SCHU and WT 51. T cells of molecularly MHC-typed blood donors were generated by positive selection with anti- CD3 coated immunomagnetic beads (Milan-Analytika).

Transfections. Transient transfectants were used for functional assays 48 hours after transfection; stable transfectants were selected for 2 weeks in progressive concentration of Hygromycin B to a final concentration of 250 mg/ml for lymphoblastoid lines, and 50 mg/ml for monocytic cell lines.

Functional assays. Transfectants were treated with Mitomycin C (Calbiochem) at 100 mg/ml per 107 cells for 1 hour at 37° C and washed extensively.

Proliferation assays. 105 CD3-beads-selected, MHC compatible T cells or Ficoll-Paque-isolated allogeneic PBL were cultured with transfectants at stimulator: responder ratios of 1: 1 ; 1: 3 and 1: 10 for 48 and 72 hours in 96 round-bottom wells at 37° C. 3H-Thymidine was then added at lmCi/well and incorporation measured after 18 hours incubation at 37° C. FACS analysis and antibodies used were as described ; after 3 days of specific stimulation, at T: non-T ratios of 1: 1 for syngeneic, and 10: 3 for allogeneic stimulations, the T cells were further expanded in 20 U/ml recombinant IL-2 for 6 days before flow cytometric analysis (Conrad et al., 1994).

References Acha-Orbea, H., Shakow, A. N., Scarpellino, L., Kolb, E., Muller, V., Vessaz-Shaw, A., Fuchs, R., Blochlinger, K., Rollini, P., Billotte, J., Sarafidou, M., MacDonald, H. R., and Diggelmann, H. (1991). Clonal deletion of Vbl4-bearing T cells in mice transgenic for mouse mammary tumor virus. Nature 350,207-211.

Acha-Orbea, H., and MacDonald, H. R. (1995).

Superantigens of mouse mammary tumor virus. Annu. Rev.

Immunol. 13,459-486.

Badenhoop, K., Tonjes, R. R., Rau., H., Donner, H., Rieker, W., Braun, J., Herwig, J., Mytilineos, J., Kurth, R., and Usadel, K. H. (1996). Endogenous retroviral Long Terminal Repeats of the HLA-DQ region are associated with susceptibility to Insulin-dependent diabetes mellitus. Human. Immunol. 50,103-110.

Brocke, S., Gaur, A., Piercy, C., Gautam, A., Gijbels, K., Fathman, C. G., and Steinman, L. (1993). Induction of relapsing paralysis in experimental autoimmune encephalomyelitis by bacterial superantigen. Nature 365,642-644.

Bruno, G., Merletti, F., Vuolo, A., Pisu, E., Giorio, M., and Pagano, G. F. (1993). Sex differences in incidence of IDDM in age-group 15-29 yr. Diabetes Care 16,133-136.

Choi, Y., Kotzin, B., Herron, L., Callahan, J., Marrack, P., and Kappler, J. (1989). Interaction of staphylococcus aureus toxin"superantigens"with human T cells. Science 86,8941-8945.

Choi, Y., Kappler, J. W., and Marrack, P. (1991). A superantigen encoded in the open reading frame of the 3'long terminal repeat of mouse mammary tumor virus.

Nature 350,203-207.

Cole, B. C., and Griffiths, M. M. (1993). Triggering and exacerbation of autoimmune arthritis by the mycoplasma arthritidis superantigen MAM. Arthritis Rheum. 36,994- 1002.

Conrad, B., Weidmann, E., Trucco, G., Rudert, W. A., Behboo, R., Ricordi, C., Rodriquez-Rilo, H., Finegold, D., and Trucco, M. (1994). Evidence for superantigen involvement in insulin-dependent diabetes mellitus aetiology. Nature 371,351-355.

Dobrescu, D., Ursea, B., Pope, M., Asch, A. S., and Posnett, D. N. (1995). Enhanced HIV-1 replication in Vpl2 T cells due to human cytomegolovirus in monocytes: evidence for a putative herpesvirus superantigen. Cell 82,753-763.

Fleischer, B., and Schrezenmeier, H. (1988). T cell stimulation by Staphylococcal Enterotoxins. J. Exp.

Med. 167,1697-1707.

Galtier, N., Gouy, M., and Gautier, C. (1996). SeaView and Phylo-win, two graphic tools for sequence alignment and molecular phylogeny. Comput. Applic. Biosci., In Press.

Heidmann, 0., and Heidmann, T. (1991).

Retrotransposition of a mouse IAP sequence tagged with an indicator gene. Cell 64,159-170.

Held, W., Waanders, G. A., Shakov, A. N., Scarpellino, L., Acha-Orbea, H., and MacDonald, H. R. (1993).

Superantigen-induced immune stimulation amplifies mouse mammary tumor virus infection and allows virus transmission. Cell 74,529-540.

Howell, M. D., Diveley, J. P., Lundeen, K. A., Esty, A., Winters, S. T., Carlo, D. J., and Brostoff, S. W. (1991).

Limited T-cell b-chain heterogeneity among interleukin 2 receptor-positive synovial T cells suggests a role for superantigen in rheumatoid arthritis. Proc. Natl.

Acad. Sci. USA 88,10921-10925.

Karvonen, M., Tuomilehto, J., Libman, I., LaPorte, R.

(1993). A review of the recent epidemiological data on the worldwide incidence of type 1 (insulin-dependent) diabetes mellitus. Diabetologia 36,883-892.

Lo, D., Reilly, C. R., Scott, B., Liblau, R., McDevitt, H. O., and Burkly, L. C. (1993). Antigen-presenting cells in adoptively transferred and spontaneous autoimmune diabetes. Eur. J. Immunol. 23,1693-1698.

Mackay, C. R.. (1993). Homing of naive, memory and effector lymphocytes. Curr. Op. Immunol. 5,423-427.

Mach, B., Steimle, V., Martinez-Soria, E., and Reith, W. (1996). Regulation of MHC class II genes: lessons from disease. Annu. Rev. Immunol. 14,301-331.

McClure, M. A., Johnson, M. S., and Doolittle, R. F.

(1988). Sequence comparisons of retroviral proteins: relative rates of change and general phylogeny. Proc.

Natl. Acad. Sci. USA 85,2469-2473.

Medstrand, P., and Blomberg, J. (1993).

Characterization of novel reverse transcriptase encoding human endogenous retroviral sequences similar to type A and type B retroviruses: differential transcription in normal human tissues. J. Virol. 67, 6778-6787. Oldstone, M. B. A. (1990). Molecular mimicry and autoimmune disease. Cell 50,819-820.

Ono, M. (1986a). Molecular cloning and Long Terminal Repeat sequences of human endogenous retrovirus genes related to types A and B retrovirus genes. J. Virol.

58,937-944.

Ono, M., Yasunaga, T., Miyata, T., and Ushikubo, H.

(1986b). Nucleotide sequence of human endogenous retrovirus genome related to mouse mammary tumor virus genome. J. Virol. 60,589-598.

Ono, M., Kawakami, M., and Ushikubo, H. (1987).

Stimulation of expression of the human endogenous retrovirus genome by female steroid hormones in human breast cancer cell line T47D. J. Virol. 61,2059-2062.

Paliard, X., West, S. G., Lafferty, J. A., Clements, J. R., Kappler, J. W., Marrack, P., and Kotzin, B. L.

(1991). Evidence for the effects of a superantigen in rheumatoid arthritis. Science 253,325-329.

Perron, H. et al (1997). P. N. A. S. 94,7583-7588.

Pette, M., Fujiita, K., Wilkinson, D., Altmann, D. M., Trowsdale, J., Giegrich, G., Hinkkanen, A., Epplen, J. T., Kappos, L., and Weckerle, H. (1990). Myelin autoreactivity in multiple sclerosis: recognition of myelin basic protein in the context of HLA-DR2 products by T lymphocytes of multiple sclerosis patients and healthy donors. Proc. Natl. Acad. Sci. USA 87,7968- 7972.

Preston, B. D., and Dougherty, J. P. (1996). Mechanisms of retroviral mutation. Trends Microbiol. 4,16-21.

Pyra, H., Boni, J., and Schupbach, J. (1994).

Ultrasensitive retrovirus detection by a reverse transcriptase assay based on product enhancement. Proc.

Natl. Acad. Sci. USA 91,1 544-1548.

Roizman, B. (1996). Herpesviridae. In Fields Virology, B. N. Fields et al., eds. (Philadelphia: Lippincott-Raven Publishers), pp. 2221-2230.

Saitou, N., and Nei, M. (1987). The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4,406-425.

Stegall, M. D., Lafferty, K. J., Kam, I., and Gill, R. G.

(1996). Evidence of recurrent autoimmunity in human allogeneic islet transplantation. Transplantation 61, 1272-1274.

Steinman, L. (1995). Escape from"horror autotoxicus" : pathogenesis and treatment of autoimmune disease. Cell 80,7-10.

Sutkowski, N., Palkama, T., Ciurli, C., Sekaly, R. P., Thorley-Lawson, D. A., and Huber, B. T. (1996). An Epstein-Barr virus-associated superantigen. J. Exp.

Med. 184,971-980.

Tassabehji, M., Strachan, T., Anderson, M., Campbell, R. D., Collier, S., and Lako, M. (1994). Identification of a novel family of human endogenous retroviruses and characterization of one family member, HERV-K (C4), located in the complement C4 gene cluster. Nucl. Acids.

Res. 22,5211-5217.

Temin, H. M. (1981). Structure, variation and synthesis of retrovirus Long Terminal Repeat. Cell 27,1-3.

Thompson, J. D., Higgins, D. G., and Gibson, T. J.

(1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position specific gap penalties and weight matric choice. Nucl. Acids. Res. 22,4673-4680.

Tisch R., and McDevitt, H. (1996). Insulin-dependent diabetes mellitus. Cell. 85,291-297.

Tyden, G., Finn, P. R., Sundkvist, G., and Bolinder, J.

(1996). Recurrence of autoimmune diabetes mellitus in recipients of cadaveric pancreatic grafts. N. Engl. J.

Med. 335,860-863.

Tonjes, R. R., Lower, R., Boller, K., Denner, K., Hasenmaier, B., Kirsch, H., Konig, H., Korbmacher, C., Limbach, C., Lugert, R., Phelps, R. C., Scherer, J., Thelen, K., Lower, J., and Kurth, R. (1996). HERV-K: the biologically most active human endogenous retrovirus family. J. AIDS. Hum. Retrovirol. 13,261- 267.

Vyse, T. J., and Todd, J. A. (1996). Genetic analysis of autoimmune diseases. Cell 85,311-318.

Wahle, E., and Keller, W. (1996). The biochemistry of polyadenylation. TIBS 21,247-250.

Wain-Hobson, S. ( 1996). Running the gamut of retroviral variation. Trends Microbiol. 4,135-141.

Weissmahr, R. N., Boni, J., and Schupbach, J. (1997).

Reverse Transcriptase activity in chicken embryo fibroblast culture supernatants is associated with particles containing endogenous avian retrovirus EAV-O RNA. J. Virol. 71,3005-3012.

Whitcomb, J. M., and Hughes, S. H. (1992). Retroviral reverse transcription and integration: progress and problems. Annu. Rev. Cell. Biol. 8,275-306.

White, J., Herman, A., Pullen, A. M., Kubo, R., Kappler, J. W., and Marrack, P. (1989). The Vb-specific superantigen staphylococcal enterotoxin B: stimulation of mature T cells and clonal deletion in neonatal mice.

Cell 56,27-35.

Wilkinson, D. A., et al. (1994). Endogenous human retroviruses. In The Retroviridae, J. A. Levy, ed. (New York: Plenum Press), pp. 465-535.

Winslow, G. M., Scherer, M. T., Kappler, J. W., and Marrack, P. (1992). Detection and biochemical characterization of the mouse mammary tumor virus 7 superantigen (Mls-la). Cell 71,719-730.

Wucherpfennig, K. W., and Strominger, J. L. (1995a).

Selective binding of self peptides to disease- associated Major Histocompatibility Complex (MHC) molecules: a mechanism for MHC-linked susceptibility to human autoimmune diseases. J. Exp. Med. 181,1597-1601.

Wucherpfennig, K. W., and Strominger, J. L. (1995b).

Molecular mimicry in T cell-mediated autoimmunity: viral peptides activate human T cell clones specific for myelin basic protein. Cell 80,695-705.

Xiong, Y., and Eickbusch, T. (1990). Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J. 9,3353-3362.

York, D. F., Vigne, R., Verwoerd, D. W., and Guerat, G.

(1992). Nucleotide sequence of the Jaagsiekte retrovirus, an exogenous and endogenous type D and B retrovirus of sheep and goats. J. Virol. 66,4930-4939.