Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
CHARACTERIZATION AND DETECTION OF SEQUENCES ASSOCIATED WITH AUTOIMMUNE DISEASES
Document Type and Number:
WIPO Patent Application WO/1989/004875
Kind Code:
A2
Abstract:
DNA sequences and corresponding amino acid sequences from the HLA class II beta region of the human genome that are associated with insulin-dependent diabetes mellitus (IDDM) and Pemphigus vulgaris (PV) have been identified. Specifically, marker DNA sequences which detect either directly or indirectly the identity of the codon encoding for the amino acid at position 57 of the DQ-beta protein sequence are disclosed as well as sequences from the DR-beta region. These sequences may be used to generate DNA hybridization probes and antibodies for assays to detect a person's susceptibility to autoimmune diseases, such as IDDM and PV. Such antibodies and peptides encoded by said DNA sequences can be used therapeutically or prophylactically.

Inventors:
ERLICH HENRY A (US)
HORN GLENN T (US)
Application Number:
PCT/US1988/004067
Publication Date:
June 01, 1989
Filing Date:
November 14, 1988
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CETUS CORP (US)
International Classes:
A61K38/00; A61K39/395; G01N33/53; A61P37/00; C07K14/74; C07K16/28; C12N15/02; C12N15/09; C12Q1/68; C12Q1/686; C12Q1/6881; C12Q1/6883; G01N33/564; (IPC1-7): C12Q1/68; C12N15/00
Domestic Patent References:
WO1986007464A11986-12-18
WO1988002783A11988-04-21
Foreign References:
EP0237362A11987-09-16
EP0084796A21983-08-03
Other References:
Journal of Clinical Investigation, vol. 79, April 1987 (US) B. Michelsen et al.: "Molecular cloning of a polymorphic DNA endonuclease fragment associates insulin-dependent diabetes mellitus with HLA-dq" pages 1144-1152
Immunogetics, vol. 24, 1986 Springer-Verlag (DE) B. Owerbach et al.: "Characterization of three HLA-DR beta genes isolated from an HLA-DR 3/4 insulin-dependent diabetic patient", pages 41-46
Science, vol. 239, no. 4843, 26 February 1988 (Washington, DC, US) A.A. Sinha et al.: "A newly characterized HLA DQ beta allele associated with Pemphigus vulgaris", pages 1026-1029
Proceedings of the National Academy of Sciences of the USA, vol. 85, May 1988 (Washington, DC, US) S.J. Scharf et al.: "HLA class II allelic variation and susceptibility to Pemphigus vulgaris" pages 3504-3508
Human Immunology, vol. 21, 1988, Am. Soc. for Histocompatability and Immunogenetics (US) G.T. Horn et al.: "Sequence analysis of HLA class II genes from insulin-dependent diabetic individuals", pages 249-263
Human Immunology, vol. 22, 1988, Am. Soc. for Histocompatability and Immunogenetics (US) S.J. Scharf et al.: "Sequence analysis of the HLA-DR beta and HLA-DQ beta loci from three Pemphigus vulgaris patients" pages 61-69
See also references of EP 0439458A1
Download PDF:
Claims:
WHAT IS CLAIMED IS:
1. A marker DRbetaI DNA sequence from the HLA class II beta genes associated with insulindependent diabetes mellitus and with DR4associated susceptibility to Pemphigus vulgaris.
2. The marker DRbetaI DNA sequence of claim 1 which is GACATCCTGGAAGACGAGCGG, or the DNA strand complementary thereto.
3. The marker DRbetaI DNA sequence of claim 1 associated with DR4, Dw4associated susceptibility to insulindependent diabetes mellitus.
4. The marker DRbetaI DNA sequence of claim 3 which is GGAGCAGAAGCGGGCCGCG, or the DNA strand complementary thereto.
5. Marker DQbeta DNA sequences from the HLA class II beta genes associated with DRw6associated susceptibility to Pemphigus vulgaris.
6. The marker DQbeta DNA sequences of claim 5 which comprise one or more nucleotide sequences from the second exon of the DQB1.3 allele from about codon 20 to about codon 80.
7. A marker DNA sequence from the HLA DQbeta allele associated with susceptibility to insulindependent diabetes mellitus, wherein said sequence can be used to detect either directly or indirectly the identity of the codon at position 57 of the DQbeta protein sequence.
8. The marker DNA sequence of claim 7 wherein said codon at position 57 is selected from the group consisting of codons for alanine, valine, and aspartate.
9. The marker DNA sequence of claim 8 wherein said DNA sequence is selected from the group consisting of: (a) GGGCCGCCTGCCGCC, (b) GGGCTGCCTGCCGCC, (c) GGGCGGCCTGTTGCC, (d) GGGCCGCCTGACGCC, and (e) GGGCGGCCTGATGCC, or the DNA strands which are complementary thereto.
10. The marker DNA sequence of claim "7 wherein said DNA sequence is used to detect indirectly a second DNA sequence comprising the codon at position 57 wherein said codon57 is selected from the group consisting of codons for alanine, valine and aspartate.
11. The marker DNA sequence of claim 10 wherein said DNA sequence is selected from the group consisting of: (a) GTGGGGGTGTATCGGGCG, (b) GTGGGGGAGTTCCGGGCG, (c) GTGGAGGTGTACCGGGCG, and (d ) GTGGGGGTGTACCGGGCA, or the DNA strands which are complementary thereto.
12. A process for detecting the presence or absence of sequences associated with suscept bility to insulindependent diabetes mellitus in a DNA sample comprising: (a) treating the sample to expose the DNA therein to hybridization; (b) affixing the treated sample to a membrane; (c) treating the membrane under hybridization conditions with a labeled sequencespecific oligonucleotide probe capable of hybridizing with one or more of the DNA sequences selected from the group consisting of: CD GACATCCTGGAAGACGAGCGG, (2) GGGCCGCCTGCCGCC, (3) GGGCTGCCTGCCGCC, and (4) GGGCGGCCTGTTGCC, or with the DNA strands complementary thereto; and (d) detecting whether the probe has hybridized to any DNA in the sample.
13. An antibody that binds to a peptide containing an epitope comprising an amino acid residue corresponding to position 57 of a DQbeta protein, and said antibody having crossreactivity with a homologous peptide sequence encoded by a human persistent viral pathogen, wherein said amino acid residue is selected from the group consisting of alanine and valine.
14. The antibody of claim 13 wherein the peptide is selected from the group consisting of: (a) Gly Pro Pro Ala Ala, (b) Gly Leu Pro Ala Ala, and (c) Gly Arg Pro Val Ala.
15. The antibody of claim 13 wherein the viral pathogen is selected from the group of viruses consisting of EpsteinBarr virus, rubella virus, Coxsackie virus, cytomegalovirus, and reovirus.
16. A process for detecting the presence or absence of sequences associated with susceptibility to insulindependent diabetes mellitus in a protein sample comprising: (a) incubating the sample in the presence of one or more of the antibodies of claim 13 that are labeled with a detectable moiety; and (b) detecting the moiety.
17. The process of claim 16 wherein before, during, or after incubating with the labeled antibody the sample is incubated in the presence of a monoclonal antibody that is immobilized to a solid support and binds to one or more of the amino acid sequences selected from the group consisting of: (a) Gly Pro Pro Ala Ala, (b) Gly Leu Pro Ala Ala, and (c) Gly Arg Pro Val Ala.
18. A process for identifying haplotypes associated with susceptibility to insul ndependent diabetes mellitus in a serum samp! e comprising: (a) incubating the sample in the presence of one or more of the peptides selected from the group consisting of Gly Pro Pro Ala Ala, Gly Leu Pro Ala Ala, and Gly Arg Pro Val Ala; (b) detecting the presence of immune complexes formed between said peptide and an antibody present in said serum sample; and (c) determining from the results of step (b) whether a susceptible haplotype is present.
19. The prophylactic and/or therapeutic use of amino acid sequences and/or antibodies thereto wherein said amino acid sequences are encoded by DNA sequences from the HLA class II beta genes selected from the group consisting of marker DRbetaI DNA sequences associated with insulindependent diabetes mellitus and with DR4associated susceptibility to Pemphigus vulgaris; marker DQbeta DNA sequences associated with DRw6associated susceptibility to Pemphigus vulgaris; marker DR3 betaIll DNA sequences associated with insul ndependent diabetes mellitus; and marker DNA sequences from the DQbeta allele associated with suscept bility to insulindependent diabetes mellitus wherein said sequence contains the codon for position 57 of the DQ beta protein sequenced.
Description:
CHARACTERIZATION AND DETECTION OF SEQUENCES ASSOCIATED WITH AUTOIMMUNE DISEASES

This invention relates to HLA class II beta genes and proteins associated with autoimmune diseases and methods for their diagnostic detection. Specifically, the autoimmune diseases on which this invention focuses are insulin-dependent diabetes mellitus (IDDM) and Pemphigus vulgaris ( PV) .

A variety of autoimmune diseases have been associated with serologically defined variants of the human leukocyte antigen (HLA) class III antigens*. The HLA region, located on the short arm of chromosome 6, encodes many different glycoproteins that have been classified into two categories. The first category, class I products, encoded by the HLA-A, -B, and -C loci, are on the surface of all nucleated cells and function as targets in cytolytic T-cell recognition. The second category, class II products, encoded by the HLA-D region, are involved in cooperation and interaction between cells of the immune system. These class II products appear to be encoded by at least three distinct loci, DR, DQ and DP, each with its distinct alpha and beta chains. The class II loci of. the human major histocompatibility complex (MHC) encode highly polymorphic cell- surface glycoproteins ( acrophage and beta-cell transmembrane glycoproteins). For review article, see Giles et al . Adv. in

Immunol . 37..1-71 (1985). The polymorphism in class II antigens is localized to the NH2~terminal outer domain and is encoded by the second exon. The class II polymorphic residues have been postulated to interact with -the T-cell receptor or with foreign antigen or both

Sette et al., Nature, 328:395-399 (1987) with recognition of the antigen peptide fragments in association with a specific class II product leading to T cell activation and consequent stimulation of antibody production by beta lymphocytes Marx et al . , Science, 23^:613-614 (1987).

Insulin-dependent diabetes mellitus (IDDM), a chronic autoimmune disease also known as Type I diabetes, is a familial disorder of glucose metabolism susceptibility associated with specific

allelic variants of the human leukocyte antigens (HLA). The dysfunctional regulation of glucose metabolism occurring in IDDM patients results from the i munologically mediated destruction of the insulin-producing islet cells of the pancreas, the beta cells. The development of IDDM can be divided into six stages, beginning with genetic susceptibility and ending with complete destruction of beta- cells. G. Eisenbarth, N. Eng. J. Med., 314:1360-1368 (1986). Donaich et al., Annu. Rev. Med., 3 :13-20 (1983). . More than 90% of all IDDM patients carry the DR3 and/or DR4 antigen, and individuals with both DR3 and DR4 are at greater risk than individuals who have homozygous DR3/3 or DR4/4 genotypes. 1. Raffel and J. Rotter, Clinical Diabetes, 3_:50-54 (1985); Svejgaard et al., Immunol. Rev., 7O_:193-218 (1983); L. Ryder et al., Ann. Rev. Genet., 21:169-187 (1981). Pemphigus derived from the Greek pemphix meaning blister or pustule is the name applied to a distinctive group of chronic or acute skin diseases characterized by successive crops of itching bullae. Pemphigus vul aris (PV) is a rare relapsing disease manifested by suprabasal , intraepi dermal bullae of the skin and mucus membranes, which is invariably fatal if untreated; however, remission has been obtained by the use of corticosteroid hormones and immunosuppressive drugs. PV, an autoimmune disease, has been strongly associated with the HLA serotypes DR4 and DRw6 Brautbar et al . , Tissue Antigens, .16_:238-241 (1980) with less than 5% of PV patients possessing neither marker. Disease associations with two different haplotypes can be interpreted to mean (1) the two haplotypes share a common all el e or epϊtope, or, alternatively, that (2) different alleles on the two haplotypes are capable of conferring disease suscepti il ty.

Molecular analysis of the HLA class II genes has revealed that the HLA serotypes are genetically heterogeneous, and that, in particular, the DR4 haplotype consists of five different DR-beta-I allelic sequences corresponding to the five mixed lymphocyte culture (MLC) defined types -Dw4, DwlO, Dwl3, Dwl4 and Dwl5 Gregersen et al . , PNAS (USA) 83_:2642-2646 (1986) and three different DQ-beta allelic sequences corresponding to the DQ-beta-3.1, DQ-beta-3.2 and DQ-blank

types Erlich et al., The Molecular Analysis of Histocompatibility Antigens, pp. 93-109 (Schacter et al., eds., 1987).. Virtually all of the extensive polymorphism characteristic of the class II loci has been localized to the second exon. Sequence analysis of coding sequence polymorphisms in the

DR-beta loci revealed that the sequence or epitope in the DR4 DR-beta- I chain that distinguishes DwlO from the other DR4 subtypes is shared by the DR-beta-I chain of the DRw6 haplotype. Gorski et al., Nature, 322:67-70 (1986). Recently, restriction fragment length polymorphisms (RFLPs) which subdivide the DR4 and DRw6 haplotypes were obtained by using a HLA-DQ-beta cDNA probe; such RFLPs have been reported to be even more highly associated with PV than are the serologic markers, Szafer et al., Proc. Nat! . Acad. Sci. USA, 84_:6542-6545 (1987).

Of all the immunologically defined polymorphisms, the HLA- DR-beta region has been found to be most strongly associated with IDDM. Therefore, restriction fragments of the HLA class II-beta DNA have been analyzed for use as genetic markers of insulin-dependent diabetes mellitus. D. Owerbach et al., Diabetes, j33_:958-964 (1984); 0. Cohen-Haguenauer et al., PNAS (USA), 82_:3335-3339 (1985); D. Stetler et al., PNAS (USA), 82_:8100-8104 (1985).

Arnheim et al., PNAS (USA), 82_:6970-6974 (Oct. 1985), examined DNA polymorphisms within the HLA class II loci associated with susceptibility to IDDM by using genomic blot-hybridization analysis with DQ-beta and DR-beta cDNA probes. Described therein is a DQ-beta subdivision of the DR4 haplotype wherein one DR4 variant had a Rsal restriction fragment length polymorphism (RFLP) of 1.8 kb and another had a Rsal RFLP of 1.5 kb. The DQ-beta-related 1.5 kb Rsal fragment was reported to identify a number of non-DR4 IDDM individuals as well as 90% of all IDDM DR4 individuals. Other investigators using other restriction enzymes (e.g.,

Ba HI, Hindlll) have reported RFLP subdivisions of the DR4 haplotype using DQ-beta probes. Hoi beck et al., Immunogeneti s (1986) 24:251- 258; Henson et al., Immunogenetics, (1987) 5_:152-160) . Hoi beck et

al., id. , found that the RFLP subsets of DR4, designated DQw3.1 and DQw3.2 are distinguishable by the reactivity of their expressed products with a specific monoclonal antibody TA10. Kim et al., PNAS (USA), 82_:8139-8142 (1985); Tait et al., Tissue Antigens, (1986) ,28_:65-71. The DQw3.1 subtype correlates with the serologic specificity TA10 + , whereas DQw3.2 correlates with TA10 " .

European Patent Publication No. 237,362 discloses the cloning and sequencing of the Rsal 1.5 kb (DQw3.2) and the Rsal 1.8 kb

(DQw3.1) variants of DR4 haplotypes and illustrates the differences in the sequences thereof. (Such differences are shown herein in Tables

III and IV.)

WO 86/07464 discloses a specific DQ-beta2 allelic variant, DQw3.2, as a specific genomic marker associated with IDDM, and provides two methods of identifying individuals at increased risk of diabetes. The first method involves the use of a labeled probe to detect the DQw3.2 all el e, whereas the second method involves the serologic detection of the DQw3.2 allele.

Erlich et al., Perspectives in Immunogenetics and Histocompatibility, Vol. 7:93-106 (Schacter et al., eds., 1987), reported the protein translation sequences for the DQw3.1 and 3.2 variants.

Michelson et al., J. Clin. Invest., 79_:1144-1152 (April 1987), reported the nucleotide sequence for the DQw3.1 variant.

Acha-Orbea et al., PNAS (USA), 84 , (8) : 2435 -2435 (1987) reported on differences in the H-2 I-A region of control mice and diabetes-susceptible NOD (non-obese diabetic) mice. Normal mice have an aspartate residue at position 57 of said region whereas NOD mice have a neutral serine residue at that position. The human -HLA-DQ-beta region is analogous to the H-2 I-A region of the mouse. Yoon et al . , Diabetes Care, 8_ (suppl . l):39-44 (Sept. -Oct.

1985), presents a review of the evidence for viruses as a trigger for IDDM in animals and humans. See also, Bodansky et al . , Lancet, (1986), ri_:1351-1353; Kagnoff et al., J. Exp. Med., (1984), 160:1544-

1557; McChesney et al . , Ann. Rev. Immunol. (1987), 5_:279-304; Oldstone et al., jjι_ Notkins et al. (eds.), Concepts in Viral Pathogenesis (1986); Schwi mback et al . , J. Exp. Med. (1987), _1_____§. :173 ~ 181 i Sil er et al., Disease Markers (1985), 3_:155-168; and Srinivasappa et al. , J. Virol., 5^:397-401.

Roudier et al . , Abstract from the American Rheumatism

Association (Western Region) meeting in San Diego, CA, Nov. 5-7, 1987, reported that the. HLA Dw4 DR-beta-I chain and an Epstein-Barr virus (EBV) glycoprotein share a hexapeptide. Todd et al., Nature, 29_:599-604 (Oct. 15, 1987) discusses the contribution of the HLA DQ-beta gene to susceptibility and resistance to IDDM. The authors conclude that "the structure of the DQ molecle, in particular residue 57 of the beta-chain, specifies the autoimmune response against the insulin-producing islet cells." Many HLA DR-beta sequences have been published previously.

The sequence AspIleLeuGluAspGluArg was reported by Gregersen et al . , PNAS (USA), 83_:2642-2646 (1986) as part of a study of the diversity of DR-beta genes from -HLA DR4 haplotypes. No mention was made of an association thereof with diabetes. In addition, J. Gorski and B. Mach, Nature, 322:67-70 (1986) reported on HLA-DR polymorphism within a group including the haplotypes DR3, DR5 and DRwδ. The nucleotide sequences found in the polymorphic regions at the beta-I locus were not discussed regarding association with diabetes. The first publ cation on HLA sequences from diabetics is that by D. Owerbach et al., Immunogenetics, 24_:41-46 (1986). This paper is based on .the study on a HLA-DR-beta gene library from _one IDDM patient. The analysis of class II polymorphism and disease suscept bility requires the comparison of many sequences derived from patients and HLA-matched controls. Allelic variation in the class II antigens is restricted to the outer domain encoded by the second exon of the protein. Serologic methods for detecting HLA class II gene polymorphism are not capable of detecting much of the variation detectable by DNA methods.

Allelic variations may be detected independently of restriction site polymorphism by using sequence-specific synthetic oligonucleotide probes. Conner et al., PNAS (USA), j50_:278 (1983). This technique has been applied to study the polymorphism of HLA DR- beta using Southern blotting. Angel ini et al . , PNAS (USA), 83:4489- 4493 (1986).

A further refinement of the technique using sequence- specific ol gonucleotide probes involves amplifying the nucleic acid sample being analyzed using selected primers, four nucleotide tri phosphates, and an appropriate enzyme such- as DNA polymerase, followed by detecting the nucleotide variation in sequence using the probes in a dot blot format, as described in European Publication No. 237,362, supra. A temperature cycling process wherein a thermostable enzyme is added only once in the amplification process is described in European Patent Publication No. 258,017, which also discloses thermostable enzymes, purified or recombinant, which can be used in the amplification process.

There is a need in the art for subdivision of the serologic markers HLA DR3, DR4 and DRw6 to obtain more informative and more precisely defined markers for susceptibility to the autoimmune diseases IDDM and PV. Further, there is a need in the art to identify susceptibility conferring haplotypes which are neither DR3, DR4 nor DRw6.

Previously, the distinction between the IDDM associated DQ- beta variants, DQw3.1 and DQw3.2, of the DR4 haplotype has been made by RFLP or by the use of antibodies. This invention in one aspect relates to methods to identify such DQ-beta variants.

Accordingly, the present invention provides a marker DR- beta-I DNA sequence from the HLA class II beta genes associated with insulin-dependent diabetes mellitus (IDDM) and with IDDM and with DR4- associated susceptibil ty to Pemphigus vulgaris (PV).

Specifically, in one aspect, the present invention provides a marker DR-beta-I DNA sequence associated with IDDM and with DR4- associated susceptibility to Pemphigus vulgaris which is

GACATCCTGGAAGACGAGCGG, or the DNA strand complementary thereto. Further, the invention provides for the amino acid sequence Asp lie Leu Glu Asp Glu Arg which is encoded by said DNA sequence.

The present invention also provides for marker DR-beta-I DNA sequences associated with DR4, Dw4-associated susceptibility- to IDDM wherein one such sequence is GGAGCAGAAGCGGGCCGCG, or the DNA strand complementary thereto. Further, the invention concerns amino acid sequences encoded by said marker DR-beta-I DNA sequences associated with DR4, Dw4-associated susceptibility to IDDM and antibodies to said amino acid sequences.

The invention further provides for marker DQ-beta DNA sequences from the HLA class II beta genes associated with DRw6- associated susceptibility to Pemphigus vulgaris. Specifically, such DQ-beta DNA sequences comprise one or more nucleotide sequences from the second exon of the unique DQB1.3 allele from about codon 20 to about codon 80. Further, the invention concerns amino acid sequences encoded by such DQB1.3 DNA sequences and antibodies to said amino acid sequences.

In another aspect, the invention provides a marker DNA sequence from the HLA DQ-beta allele associated with susceptibility to insulin-dependent diabetes mellitus (IDDM), wherein said sequence can be used to detect either directly or indirectly the identity of the codon at position 57 of the DQ-beta protein sequence and marker DNA sequences wherein said codon at position 57 is selected from the group consisting of codons for alanine, valine and aspartate. Said marker DNA sequence is preferably selected from the group consisting of:

(a) GGGCCGCCTGCCGCC,

(b) GGGCTGCCTGCCGCC,

(c) GGGCGGCCTGTTGCC, (d) GGGCCGCCTGACGCC, and

(e) GGGCGGCCTGATGCC, or the DNA strands which are complementary thereto.

This invention further relates to oligonucleotide probes specific for the 1.5 kb (DQ-beta-3.2) variant and for the 1.8 kb (DQ- beta-3.1) variant DQ-beta subdivisions of the DR4 haplotype.

Specifically, such an oligonucleotide probe specific for the 1.5 kb (DQ-beta-3.2) variant is designated GH74, is a 19-mer and has the sequence: CCGCTGGGGCCGCCTGCCG.

An oligonucleotide probe, designated GH92 specific for the 1.8 kb (DQ-beta-3.1) RFLP is also a 19-mer and has the following sequence: CGTGGAGGTGTACCGGGCG. Under appropriate hybridization and washing conditions, these oligonucleotides, labeled, for example, with

3Z P by kinasing, or with non-radioisotopic molecule reporters, such as biotin or an enzyme as, for example, horseradish peroxidase, specifically identify the DQ-beta-3.2 and DQ-beta-3.1 variants. This invention further relates to the diagnostic use of such probes.

Further, the invention provides for marker DNA sequences wherein said DNA sequences are used to detect indirectly a second DNA sequence comprising the codon at position 57 wherein said codon57 is selected from the group consisting of codons for alanine, valine and aspartate. Said marker DNA sequences used to detect indirectly said second DNA seqence, are preferably selected from the group consisting of:

(a) GTGGGGGTGTATCGGGCG, (b) GTGGGGGAGTTCCGGGCG,

(c) GTGGAGGTGTACCGGGCG, and

(d) GTGGGGGT6TACCGGGCA, or the DNA strands which are complementary thereto.

Further, the invention provides for allele-spec fic oligonucleotide (ASO) probes that can be used to detect indirectly the identity of codon 57 in the DQ-beta locus wherein such probes are preferably selected from the group consisting of:

(1) a 19-mer designated GH61 from the DQ-beta-B region of the DR3 haplotype having the nucleotide sequence: CGGCAGGCAGCCCCAGCAG;

(2) a 19-mer designated GH66 from the DQalpha region of the DR3 haplotype having the nucleotide sequence: TGTTTGCCTGTTCTCAGAC; and

(3) a 21-mer designated GH70 from the DQ-beta-A region of the DR3 haplotype having the nucleotide sequence:

GATGCTTCTGCTCACAAGACG.

The invention also relates to a process for detecting the presence or absence of sequences associated with susceptibility to insulin-dependent diabetes mellitus and/or Pemphigus vulgaris in a DNA sample comprising:

(a) treating the sample to expose the DNA therein to hybridization; (b) affixing the treated sample to a membrane;

(c) treating the membrane under hybridization conditions with a labeled sequence-specific oligonucleotide probe capable of hybridizing with one or more of the DNA sequences selected from the group consisting of: (1) GACATCCTGGAAGACGAGCGG,

(2) GGGCCGCCTGCCGCC,

(3) GGGCTGCCTGCCGCC, and

(4) GGGCGGCCTGTTGCC, or with the DNA strands complementary thereto; and (d) detecting whether the probe has hybridized to any DNA in the sample.

The invention also concerns an antibody that binds to a peptide segment containing an epitope comprising an amino acid residue corresponding to position 57 of a DQ-beta protein, wherein said antibody may have cross-reactivity with a homologous peptide sequence encoded by a human persistent viral pathogen, and wherein said amino acid residue is selected from the group consisting of alanine and valine. Said antibody preferably binds to a peptide selected from the group consisting of: (a) Gly Pro Pro Ala Ala,

(b) Gly Leu Pro Ala Ala, and

(c) Gly Arg Pro Val Ala.

Further, said viral pathogen is preferably selected from the group of viruses consisting of Epstein-Barr virus, rubella virus, Coxsackie virus, cytomegalovirus, and reovirus.

Still further, the invention relates to a process for detecting the presence or absence of sequences associated with susceptibility to insulin-dependent diabetes mellitus in a protein sample comprising: (a) incubating the sample in the presence of one or more of the antibodies that bind to a peptide selected from the group consisting of:

Gly Pro Pro Ala Ala, Gly Leu Pro Ala Ala, or Gly Arg Pro Val Ala; wherein said antibodies are labeled with a detectable moiety; and

(b) detecting the moiety. The antibodies can be polyclonal or monoclonal. Said process for detecting IDDM associated sequences includes those processes wherein before, during, or after incubating with the labeled antibody, the sample is incubated in the presence of a monoclonal antibody that is immobilized to a solid support and binds to one or more of the amino acid sequences selected from the group consisting of:

(a) Gly Pro Pro Ala Ala, (b) Gly Leu Pro Ala Ala, and

(c) Gly Arg Pro Val Ala.

The invention further concerns a process for identifying haplotypes associated with susceptibil ty to insulin-dependent diabetes mellitus in a serum sample comprising: (a) incubating the sample in the presence of one or more of the peptides selected from the group consisting of Gly Pro Pro Ala Ala, Gly Leu Pro Ala Ala, and Gly Arg Pro Val Ala;

(b) 'detecting the presence of immune complexes formed between said peptide and an antibody present in said serum sample; and (c) determining from the results of step (b) whether a susceptible haplotype is present. The peptides used in said process can be labeled with a detectable moiety, and the detection can be by enzyme reaction, fluorescence or luminescence emission.

The invention further concerns the prophylactic and therapeutic use of the above-referenced antibodies and peptides.

In another aspect, the invention provides a kit for detecting the presence or absence of sequences associated with susceptibility to insulin-dependent diabetes mellitus or Pemphigus vulgaris in a DNA sample, which kit comprises, in packaged form, a multi container unit having one container for each labeled sequence- specific DNA probe capable of hybridizing with one or more of the DNA sequences identified above or with the DNA strands complementary thereto.

In a final aspect, the invention provides a kit for detecting the presence or absence of amino acid sequences associated with susceptibility to insulin-dependent diabetes mellitus or to Pemphigus vul aris in a protein sample, which kit comprises, in packaged form, a multi container unit having a container for each antibody labeled with a detectable moiety that binds to one or more of the amino acid sequences identified above.

As mentioned above, genetic susceptibility to IDDM has been correlated in both family and population studies with the presence of the serologic markers HLA DR3 and DR4. The highest risk for IDDM is associated with HLA DR3,4 heterozygotes, suggesting that the susceptible alleles associated with these two DR types may be different and that two doses may be required for high risk to IDDM. Previous restriction fragment length polymorphism analysis has subdivided DR3 and DR4 into two subsets each.

Similarly, as mentioned above, genetic susceptibility to PV has been correlated with the presence of the serologic markers HLA, DR4 and DRw6. Previous restriction fragment length polymorphism has subdivided the DR4 and DRw6 haplotypes. Molecular analyses of the HLA genes herein has resulted in further subdivision of the HLA DR3, DR4 and DRw6 serological types and in the generation of novel, more informative, and more precisely defined genetic markers for susceptibility to IDDM and PV. The molecular techniques hen in reveal not only that the number of class

II loci is unexpectedly large, but also that the allelic variation at these loci is greater than the polymorphic series defined by serological typing and can be more precisely localized.

Figure 1 illustrates the sequence of the DR-beta fragment PCR (polymerase chain reaction) amplified according to Example III. The sequences of the PCR primers are shown by long arrows, which also indicate the direction of extension by the polymerase. The broken lines show the BamHI and Pstl recognition sequences used to generate restriction sites for cloning. The start of the second exon sequence is shown by the short arrow and the region of the fragment corresponding to the DwlO sequence specific oligonucleotide GH78 is shown by the bracketed segment. Digestion with BamHI and Pstl produces a 248 bp fragment for cloning. The sequence shown here represents a prototype DRB1 allele from the DR4 DwlO haplotype. Figure 2 illustrates the amino acid sequences for the HLA

DR-beta second exon from three Pemphigus vulgaris (PV) (see. Example III) and from DR-beta prototypes (Gregersen et al . , PNAS (USA), 83:2642-2646 (1986)) using the standard one-letter amino acid code (see Table VIII, infra) . The entire amino acid sequence of the DR- beta-I allele from a DR4 Dw4 haplotype is shown whereas the other sequences are aligned with it using a dash to indicate homology and letters to indicate polymorphic amino acids. The DR type (and the Dw type where appropriate) and the DR-beta locus assignment are shown at the right end of each sequence. "I" refers to the DR-beta-I locus, "III" for the DR-beta-III locus which encodes the DRw52 specificity, and H IV" for the DR-beta-IV locus, which encodes -the DRw53 specificity. The fragments obtained by PCR cloning are smaller than the prototype sequences, which are derived from cDNA clones.

Figure 3 illustrates the amino acid sequences from the HLA DR-beta second exon of three PV patients according to the conventions of Figure 2 wherein the sequences are aligned with DR4 DQ-beta-3.1 and DQ-beta-3.2. prototype sequences from cDNA clones. Michelsen et al., J. Clin. Invest., ~ ~ l- ~~ -- ' 5Z (1987); -Gregersen et al . , supra. At the right of each sequence is the DR type and DQ-beta type. The DQ-

beta sequences obtained by PCR cloning are shorter than the prototype sequences.

Figure 4 illustrates the alignment of HLA-DQ-beta protein sequences. DNA sequences of DQ-beta alleles were translated to the standard one-letter amino acid code (Table VIII, infra) and aligned to show patterns of homology. A dash indicates homology with the equivalent amino acid in the prototypic DQ-beta (DR4) allele. A blank indicates that the sequence was not determined. Location of the PCR amplification primers are shown on the bottom. Note that the PCR amplification procedure only determines the sequence between the oligonucleotide primers Scharf et al . , Science, 233 :1076-1078 (1986). The source of each sequence is designated on the left of each line, and its DR serologic type is shown on the right. An asterisk after the DR type indicates that the allele was determined from a patient with IDDM. The Hul29 sequence was determined from a PV patient. On the far right is the designation of the allele, corresponding when possible with the DQw typing of the haplotype. The DCB4 sequence was from Larhammar et al., PNAS (USA), 80_:7313-7317 (1983); CMCC and MMCC are from Horn et al . , 1988, PNAS (USA) 85_:6012- 6016; DQB37 from Michelsen et al., J. Clin. Invest., 79:1144-1152 (1987); KT3 from Gregersen et al., PNAS (USA), 83_:2642-2646 (1986); WT49 from Boss et al., PNAS (USA), 81.:5199-5203 (1984); BURK from Karr et al., J. Iππunol., 137:2886-2890 (1986); LG2 from Bull et al., PNAS (USA), 82_:3405-3409 (1985); DQBS4 and DQBS5 from Tsukamoto et al., Iirniunogenetics, _25_:343-346 (1987); AZH, BGE, and PGF from Lee et al . , Immunogenetics, 26_:85-91 1987); and the related DX-beta sequence was from Okada et al . , PNAS (USA), j52_:3410-3414 (1985). The PCR sequence illustrated for PGF matched the published cDNA. The DQB alleles reported for the two IDDM patients, DC and JR, were the only DQB3.1 alleles observed in 34 DR4 patients.

Figure 5 illustrates the alignment of HLA-DR-beta protein sequences. DNA sequences of the DR-beta alleles were translated to the standard one-letter amino acid code (Table VIII, nfra) and aligned to show patterns of homology. The conventions used are the same as explained in the description for Figure 4, supra . See also Example I, infra.

Allelic sequence variations reported herein have been compared with conventional HLA classifications, and a new nomenclature based on the correspondence between current DQw serological specificities and sequence patterns is used. Specifically, the DNA- defined types DQB1, DQB2 and DQB3 designate sequences derived from the serologically defined DQwl, DQw2 and DQw3 haplotypes respectively, whereas DQB4 designates those sequences derived from DQ (blank) haplotypes, which are apparently homogeneous. Employed in such nomenclature is the convention of using Greek letters for the genetic loci and for the protein products (e.g., the DQ-beta locus encoding the DQ-beta chain) and of using Roman capital letters followed by a number to designate the specific allelic sequence variants (e.g., the DQB2 allele). Sequence variants which subdivide such types are designated by a subtype number (e.g., DQB1.2 or DQA1.3). However, the designation of DQalpha allelic variants do not always correspond to the DQw specificity; for example, the DQA4 type is associated with both the DQw2 and DQw3 haplotypes. That is because the DQw2 and DQw3 specificities appear to be determined by polymorphic epi topes on the beta-chain, independently of allelic variation on the alpha-chain.

"Posititively associated" with an autoimmune disease is a term used herein to mean that the frequency of a marker is increased in patients with the disease relative to controls (individuals without the disease). The converse meaning applies to the term "negatively associated" with an autoimmune disease, that is, the frequency of the marker is decreased in patients relative to controls.

The terms "ol gonucleotide" as used herein is defined as a molecule comprised of two or more deoxyribonucleo tides or r bonucleotides, preferably more than three. Its exact size will depend on many factors, which in turn depend on the ultimate function or use of the oligonucleotide. The oligonucleotide may be derived synthetically or by cloning.

The term "sequence-specific ol gonucleotides" (SSOs) refers to oligonucleotides which will hybridize to one of the specific DNA

sequences identified herein, which are regions of the loci where allelic variations may occur. Such oligonucleotides have sequences spanning one or more of the DNA regions being detected and are specific for one or more of the regions being detected. One sequence- specific oligonucleotide is employed for each sequence to be detected, as described further hereinbelow.

The term "monoclonal antibodies" as used herein refers to an immuno globulin composition produced by a clonal population (or clone) derived through mitosis from a single antibody- producing cell. Unless otherwise indicated, the term is not intended to be limited to antibodies of any particular mammalian species or isotype or to antibodies prepared in any given manner. The term is intended to include whole antibody molecules as well as antigen- binding fragments (e.g., Fab, F(ab') 2 , Fc, Fc 1 ). An "antibody-producing cell line" is a clonal population or clone derived through mitosis of a single antibody-producing cell- capable of stable growth in .vitro for many generations. The term "cell line" refers to individual cells, harvested cells, and cultures containing cells so long- as they are derived from cells of the cell line referred to. Preferably the cell lines remain viable for at least about six months and maintain the ability to produce the specified monoclonal antibody through at least about 50 passages.

As used herein, the term "incubation" means contacting antibodies and antigens under conditions (e.g., proper pH, temperature, time, medium, etc.) that allow for the formation of antigen/antibody complexes also referred to as immune complexes. Also as used herein, "separating" refers to any method, usually washing, of separating a composition from a test support or immobilized antibody, such that any unbound antigen or antibody in the composition are removed and any immune complexes on the support remain intact. The selection of the appropriate incubation and separation techniques is within the skill of the art.

HLA class II DR-beta genes have been isolated Υrom HLA- typed IDDM patients and HLA-matched controls and have been sequenced,

resulting in regions of specific nucleotide and amino acid sequence which occur in various combinations and which are associated with IDDM. These specific sequences can be used in DNA or -protein diagnostic procedures to determine genetic susceptibility to IDDM. Four variant DR-beta sequences A-D found to be associated with IDDM are shown below. In each case, DNA sequences seen in the diabetic genomes produce an alteration in one to three amino acid residues (underlined) of the DR-beta protein. The amino acids normally found in these positions are shown in parentheses. Sequences A-C are from the DR-beta-III region whereas sequence D is from the DR- beta-I region. Sequence D encompasses the "I-DE" (isoleucine, aspartic acid, glutamic acid at positions 68, 71 and 72) epitope discussed infra.

A. ...Gl uLeuArgLysSerGl u.. ...GAGCTGCGTAAGTCTGAG

...CTCGACGCATTCAGACTC

(Val ,Ser,Leu,Pro,Asp,Ala)

B. ...GluGluPheLeuArgPhe..

...GAGGAGTTCCTGCGCTTC.... ...CTCCTCAAGGACGCGAAG

(Tyr,Asn,Ser,Asp)

C. ...ProValAlaGluSerTrp... ...CCTGTCGCCGAGTCCTGG... ...CCGCAGCGGCTCAGGACC... (Asp.Ser) (Tyr)

D. ...AspIleLeuGluAspGluArg... ...GACATCCTGGAAGACGAGCGG... ...CTGTAGGACCTTCTGCTCGCC...

(Leu,Phe) (Gin,Arg,Glu) (Lys,Arg,Ala) Table I below shows the IDDM susceptibility and DR-beta variation within the DR3 and DR4 haplotypes. Table II shows the correlation between the haplotypes and sequences A-D identified above. Sequences A, B -and C are correlated with B8, DR3 vs. non B8, DR3 haplotypes.

European Patent Publication No. 237,362, discussed above, discloses a general method of analyzing allelic sequence variation by using allele-specific oligonucleotide (ASO) probes to hybridize to PCR amplified DNA in a dot blot format. That publication lists some specific DNA and protein translation sequences derived from PCR cloning of several HLA class II (e.g., DR-beta, DQ-alpha, and DQ-beta) genes from a variety of HLA- typed individuals, either IDDM patients or controls. Some of such DNA sequences, identified by PCR and detectable by PCR/dot blot/ASO analysis, can function as useful markers for disease susceptibility or differential diagnosis. One such informative set of DNA and translation sequences are the DQ-beta sequences shown below in Tables III and IV, whicih respectively list the. DNA and amino acid translation sequences for a number of allelic variants in the HLA-DQ-beta region. The designations DR4 DR4 1 and DR4" therein are equivalent respectively to the terms DQB3.2, DQB3.1 and DQB4 (blank).

TABLE III

HLA-DQ-beta (segment A):

20 25 30

GlyThrGluArgValArgGlyValThrArgHisIleTyr GGGACGGAGCGCGTGCGGGGTGTGACCAGACACATCTAT DRl TTA T DR4 ' TCT T DR2,4 TCT— A -- DR6 C T DR8 C T T DR4" A TCT G AG DR3 ,7

DXB : A C G T

HLA-DQ-beta (segments B and D):

+— segment D— + + segment B + 45 50 -55 57 60

Val GlyVal TyrArgAlaVal ThrProGl nGlyArgProVal AlaGl uTyrTrpAsn GTGGGGGTGTACCGGGCAGTGACGCCGCAGGGGCGGCCTGTTGCCGAGTACTGGAAC DRl C— G~ A DR2

- G — DR6 T G T C CC DR4 A --G T -C AC DR4 ' T G T T— AC T DR8 A— T G T--T T CC— DR3,7

DXB : — T A— T— A — G CGA--T AGCA-C-AG— C HLA-DQ-beta (segment C):

66 70 75

GluValLeuGluGlyAlaArgAlaSerValAspArgVal GAAGTCCTGGAGGGGGCCCGGGCGTCGGTGGACAGGGTG DRl A GA-T- C DR2 A--A- GA-T C DR4,6 and DR4'

--CA- —A— A- CC DR8

—CA A— AAA G DR3 ,7

--CT--T CA— AG CG A

TABLE IV

Alignment of HLA-DQ-beta Protein Sequence

Exon- 2:

20 40 60 80

DCBPG DFVYQFKGMCYFTNGTERVRLVTRYIYNREEYARFDSDVGVYRAVTPLGPPAAEYWNSQK EVLERTRAELDTVCRHNYQLELRTTLQRR DQB3.2(4)

JoanP - - — DQB3.2(4,3)

MMCC4 - - - DQB3.2(4)

NIN Y - — E D— DQB3.1(4')

ARC G - RLD DI--ED--SV— (DR8)

10 DO 96 G -E RLD -DI--ED— SV-- (DR8,7)

DQBS3 — -F- --L— G - RLD DI--ED--SV- - DQB4(*)(4")

DCBP1 S-S VV -EF L--L DI — --AV-R (DR3.6)

D096 S-S IV --EF L— L — DI — — AV-R (DR7.8)

CMCC3 S-S IV EF L— L- — DI — K— AV-R— (DR3)

15 DCBS1 — — S-S V EF L--L— -DI — K— AV-R (DR3)

DQBC — L G— H V Q-R-V GA-SV-R EVAY-GI (DRl)

DCBP2 L G— H V Q-R-V- - GA— SV-R EVAY-GI (DRl, 6)

TAB Y DV - — Q-R-D- DI (DR8)

DQBH — F —Q-R-D G EVAF-GI (DR2)

20 PC I- ' — - —Q-R-D G— (DR2)

CMCC6 - - — H - --Q-R-V— EVGY-GI (DR6)

DXB __L G-A G EFQ— E-RSI -D— Y-DF-QE— AV-K EA Q

++

A

25 blank

Specificall , sequences for two variants of the serologically defined DR4 halotype listed as DR4 (DQB3.2) and DR4' (DQB3.1) in Tables III and IV are particularly informative. This DQ- beta subdivision of the DR4 haplotype was correlated with a DQ-beta Rsal RFLP in which one DR4 variant had a Rsal fragment of 1.8 kb and another had a Rsal fragment of 1.5 kb (Arnheim et al., supra). Among DR4 individuals the 1.5 kb variant (DR4) was found to be positively associated with IDDM whereas the 1.8 kb fragment (DR4 1 ) was found to be negatively associated with IDDM as indicated by Table A below.

TABLE A

Rel at ve IDD Controls Risk

15/57 7.9 10/57 10.8

5/57 0.47

Relative Risk = (# patients with marker) (# controls without marker) (# patients without marker) (# controls with marker)

The DR4 (DQB3.2) and DR4' (DQB3.1) sequences differ by five nucleotide substitutions which result in three different amino acid changes. Two of the changes (Gly to Glu at codon 45 and Ala to Asp at codon 57) are non-conservative and have major charge differences. The presence of a valine or alanine at codon 57 relative to aspartic acid is considered to be positively associated with susceptibility to IDDM.

One of the most significant differences between the DQ-beta alleles seen in the DR4 haplotypes, designated DQB3.1 and DQB3.2, is found at position 57. (The numbering used herein corresponds to the amino acids in the processed protein.) The allele in -the DQB3.1 haplotype, which is not associated with IDDM susceptibility, has an aspartate residue at position 57, whereas the allele in the DQB3.2 haplotype, which is strongly associated with IDDM susceptibility, has an alanine residue at position 57. Further, the DR3 haplotype which

is positively associated with IDDM, has an alanine at position 57. The DR2 haplotype, negatively associated with IDDM, also has an aspartate residue at position 57.

Also, it was found that of two DQ-beta alleles in the DRw6 haplotypes, the allele positively associated with IDDM has a valine at position 57, whereas the allele negatively associated with IDDM has again an aspartate at position 57. Still further, it was found that the DRl haplotype which is moderately associated with IDDM has a valine at position 57 in the DQ-beta locus. Based on such observations, it was determined that the pattern of amino acid variation at position 57 parallels the pattern of susceptibility to IDDM. Alleles containing alanine (hydrophobic residue) at position 57 are most highly associated with IDDM, alleles containing valine (hydrophobic residue) at position 57 being moderately associated with IDDM, and alleles containing aspartate (charged residue) at 57 being negatively associated with IDDM.

A major exception to this pattern is the DQB2 allele of the DR7 haplotype which has an alanine at position 57 as does the DR3 haplotype but unlike DR3, is not associated with IDDM susceptibility. Such a pattern extends to other genes also. For example, an allele of the DR-beta-III gene within- the DR3 haplotype, correlated with increa-sed IDDM susceptibility, has a valine at position 57, whereas most of the other DR-beta alleles have aspartate at position 57. Thus, it was determined that the hydrophobic valine at position 57 of the DR-beta-III allele is associated with IDDM susceptibility as well as the hydrophobic alanine in the DQ-beta allele.

Table V below summarizes the sequences found around position 57 in a number of genes. An asterisk in Table V indicates those haplotypes associated with greatest IDDM susceptibility.

23 TABLE V (Cont'd)

I-E-beta: Position 57 *

... GRPDAEN ...

DP-beta:

GRPAAEY — DE— E—

— DED-

Table V indicates that three amino acid sequences encoded by DQ-beta alleles are associated with IDDM susceptibility. These sequences and the nucleotide sequences encoding them are listed in Table VI below. Also listed therein are the nucleotide and amino acid translation sequences about position 57 indicative of alleles negatively associated with IDDM susceptibility. Codon 57 is underlined therein.

TABLE VI

IDDM Allele Sequence Association

DQ-beta (DR4): Gly Pro Pro Ala Ala ++

(DQw3.2) GGG CCG CCT J3CC_ GCC

DQ-beta (DR3) Gly Leu Pro Ala Ala ++

GGG CTG CCT _GCC_ GCC DQ-beta (DRl, 6): Gly Arg Pro Val Ala

GGG CGG CCT J2TT GCC

DQ-beta (DR4): Gly Pro Pro Asp Ala (DQw3.1) GGG CCG CCT GAC GCC

DQ-beta (DR2): Gly Arg Pro Asp Ala GGG CGG CCT GAT GCC

Tlje sequences in Table VI are considered to be marker DNA sequences from HLA DQ-beta alleles that can be used to directly detect the identity of the codon at position 57 that encodes for alanine, valine or aspartate.

Although sequences around position 57 in the DQ-beta protein are those most positively associated with IDDM susceptibility, marker sequences directly hybridizing thereto may not be optimal for inclusion in oligonucleotide probes. Since the DQ-beta alleles differ in other areas and are closely associated with other loci, such as DQalpha alleles, the identity of the amino acid at position 57 of the DQ-beta alleles can be determined by indirect ASO analysis through use of one or more oligonucleotide probes that hybridize to other regions of the DQ-beta, DQ-alpha or other HLA D regions. Further, such identification can be achieved by using one or more probes wherein one probe hybridizes to the region around position 57 and wherein one or more other probes hybridize to another HLA region in linkage disequ librium therewith.

For example, the DR3 haplotype is strongly associated with IDDM susceptibility and contains an alanine at position 57 in DQ- beta. Although this region can be detected directly with an ASO probe, such as GH74, the G/C-rich nature of that segment of DQ-beta leads to less than optimal probes because of the base mismatching potential associated therewith. The GH70 ASO probe, however, is specific for a region about 90 bp upstream from codon 57 in an area that is not as G/C-rich and thus avoids the base mismatching problem of a direct probe for the position 57 segment. Thus, the binding of the GH70 ASO probe identifies indirectly the DQ-beta allele wherein codon 57 encodes alanine. Probes even farther away from the DQ-beta codon 57 position can also be used. For example, the GH66 ASO probe is specific for the DR3 allele of the DQalpha locus, located about 12 kbp away from DQ- beta. However, as the DR3 allele of DQalpha locus has been shown to be consistently linked with- the DR3 allele of the DQ-beta locus, the binding of the GH66 probe identifies not only the DQalpha DR3 allele, but also the DQ-beta DR3 allele. The use of the DQalpha locus also provides for discrimination between the IDDM susceptible DR3 haplotype and the less .susceptible DR7 haplotype.

Susceptibility to other autoimmune diseases may also be related to codon 57 polymorphism. The DRw6 susceptibility to Pemphigus vulgaris is associated with a rare DQ-beta allele (DQB1.3) which differs from the non-susceptible alleles DQB1.2 and DQB1.1 only by a charge variation at position 57 and is correlated with the Dw9 DRw6 subtype. Similarly, a DP-beta allele found thus far only in celiac disease patients differs from an allele found in a homozygous typing cells (HTC) by an Ala-Asp substitution at position 57. Celiac disease is a digestive disorder characterized by a malabsorption syndrome affecting both children and adults precipitated by the ingestion of gluten-containing foods; its etiology is unknown but a hereditary factor has been implicated.

Described in Example III is the sequencing of the polymorphic second exon of the DR-beta-I, DR-beta-II and DQ-beta loci from three PV patients to discern any possible disease association with specific polymorphic class II epitopes. In the DQ-beta loci, 3 of 4 DR4 haplotypes contained the DQB3 present on 60-80% of control DR4 haplotypes, and one of the fo" haplotypes contained the DQB3.1 allele, present on about 20-40% of - control DR4 haplotypes. Erlich, et al., The Molecular Analysis of Histocompatibility Antigens, pp. 93-109 (Schacter et al., eds., 1987); Arnheim et al., PNAS (USA), (1985) 82_: 6970- 6974; Kim et al., PNAS (USA), (1985) 82_ : 8139-8143. The two DR5 haplotypes also contained the DQB3.1 allelic sequence variant present on all .control DR5 haplotypes. Thus, the distribution of DQ-beta alleles was essentially the same in patients, and in DR-matched controls. In this small .sample, all three patients were DQB3.1/DQB3.2 heterozygotes.

In the DR-beta-I locus, however, a potentially interesting pattern could be discerned. All three PV patients contained a DR4 haplotype with a DR-beta-I allelic sequence variant associated with the MLC-defined subtype, DwlO. In the U.S. population the frequency of the DwlO subtype among DR4 haplotypes is estimated to be approximately 10%. Hansen et al., Brit. Med. Bull. (1987) 43_:203 216. This observation was confirmed using oligonucleotide probe- rather than sequence analysis with virtually 100% of DR4 PV patients

containing the DwlO epitope. Since this epitope is associated with the MLC-defined type, DwlO, it is likely to be recognized by the T- cell receptor. These results suggest that for DR4 associated susceptibil ty the amino acid residues isoleucine, aspartic acid and glutamic acid at positions 68, 71, and 72 of the DR4 DR-beta-I chain play a role in PV autoimmunity. Such residues define an epitope herein identified as the "I-DE" epitope. See Example III.

As indicated in Example IV, the same "I-DE" epitope around codon 70 is also present on a subset of DRw6 haplotypes, but this epitope was not shown to be positively associated with PV in such DRw6 haplotypes. As the "I-DE" shared epitope cannot account for the DRw6 suscepti ility, further research was performed to find sequences conferring PV susceptibility within the DRw6 haplotype by determining the sequences of the DQ-beta alleles of two DR5/DRw6 PV patients by methods exemplified in Example III. Both patient DRw6 haplotypes were found to contain a previously unknown DQ-beta allele, which was designated DQB1.3 (see HU129 sequence of Figure 4). The DQB1.3 allele differs from the DRl DQ-beta allele, DQB1.1 by only a valine to aspartic acid substitution at position 57. Analogously, it differs from the rare DR2 AZH DQ-beta allele, DQB1.2, by only a serine to aspartic acid substitution at position 57. At the nucleotide level, the DQB1.3 allele is identical in the region around position 57 to the DR2 Dwl2 (DQB1.5) allele, DQ-beta, and to the most common DRw6 DQ-beta allele DQB1.6. To determine -the frequency of the DQB1.3 allele among PV patient and control DRw6 haplotypes, pairs of sequence-specific oligonucleotide (SSO) probes were used to identify both the DRl-like DQ-beta framework sequence and the sequence around codon 57. Exemplary SSO probes are as follows: (1) GH69 a 21-mer which identifies a DRl-like DQ-beta framework and has the nucleotide sequence GATGTGTCTGGTCACACCCCG;

(-2) GH60 a 19-mer which identifies a DRw6-like framework and has the- nucleotide sequence TCTTGTAACCAGACACATC;

(3) CRX03 a 19-mer which identifies a sequence about codon 57 wherein codon 57 encodes for aspartic acid and has the nucleotide sequence TCGGCGTCAGGCCGCCCCT; and

(4) CRX02 a 19-mer which identifies a sequence around codon 57 wherein codon 57 encodes for valine and has the nucleotide sequence

TCGGCAACAGGCCGCCCCT.

Using the above-designated probes, a pattern of hybridization can be used to identify and distinguish specific alleles. The following chart exemplifies the use of such a method wherein a plus sign (+) indicates hybridization of the SSO probe to the target DNA sample.

Probes

GH69 GH80 CRX03 CRX02

Al 1 el es Dw91.3 - + +

1.1 + +

Dwlδ 1.6 + +

Dwl91.7 + +

This approach revealed that 11 of the 13 DRwδ patient haplotypes (85%) contained the DQB1.3 allele, whereas only one of the 13 control DRw6 haplotypes (8%) contained the DQB1.3 allele. The other two DRw6 patients had DQ-beta alleles that had the DRl-like framework (identified by GH69) but failed to hybridize with either the CRX03 or the CRX02 probes. ' The findings indicate that the DRw6 associated PV susceptibility could be conferred by the rare DQ-beta allele DQB1.3 that differs from the common DQ-beta allele DQB1.1 by only one residue. Such a single charge difference of the polymorphic residue at position 57 of the DQ-beta chain associated with the DRw6 associated PV susceptibility correlates with that found for the DR4 and DR3 associated susceptibility for IDDM. However, in the case of

PV, it is clear that it is the allele rather than the epitope around position 57 that confers susceptibility because the most common DRw6 DQ-beta allele that is, DQB1.6, which is not associated with PV, has the same sequence around position 57. Because the DQB1.6 allele differs at other regions of its sequence (see Figure 4), it is possible to differentiate the DQB1.6 common allele from the rare DQB1.3 allele by the use of pairs of SSO probes, for example, as indicated immediately above, the GH69, GH80, CRX03, and CRX02 probes or ones substantially similar thereto. The conclusion that the novel DQ-beta allele accounts for the DRw6 associated susceptibility to PV is consistent with DQ-beta RFLP analysis of Sfazer et al., supra (1987).

Although the pattern of DQ-beta allelic variation clearly implicates position 57 in autoimmune predisposition, this region does not appear to be the only class II epitope within susceptibility conferring haplotypes which contribute to autoimmune disease. No class II sequences have been found to be uniquely associated with

IDDM. That observation suggests that "normal" class II alleles confer susceptibil ty, or that the susceptibility genes reside elsewhere in the MHC. Given the estimates of penetrance and concordance (50% for monozygetic twins and 25% for HLA identical sibs) for IDDM, Henson et al., Mol. Biol. Med. (1986), 3_:129-136, it is not surprising that some unaffected individuals contain putative class II susceptibility genes.

It appears that some environmental "triggering" agent, such as viral infection, is required for the disease to develop in susceptible individuals. The homology between DQ-beta alleles and rubella, a virus implicated in IDDM pathogenesis suggests a viral triggering mechanism.

Viruses have evolved mechanisms to evade their hosts immune defenses (McChesney et al., supra; Srinivasappa et al., supra), and some of these mechanisms appear to involve mimicry of vital MHC epitopes. ϋomology does exist between the HLA-DQ proteins and human viral pathogens. See Example VI. Table VII summarizes the major homologies observed to the Epstein-Barr virus (EBV), the genome for which has been completely s.equenced. Baer et al , Nature, 310:207-211.

PPAAEY 73713 R3 1374 B0LF1

* ' (repeated 12 times as part of the 3072 bp 'IRΓ repeat) ** (directly repeated 6 times)

Only one homology was seen between the peptide and potential epitope centered at position 57 in the DQB1.4 (DR2) allele and the entire EBV genome. However, six matches were seen to the peptide from the DRB2 (DR3) allele, and 21 were seen to the GPPAA peptide from the DQB3.2 (DR4) allele. Many of the later homologies were found in repeated segments of the EBV genome, including one segment of five amino acids directly repeated six times. In addition, the El envelope protein of rubella (Nakhasi et al., J. Biol. Chem., (1986), 261:16616- 16621), a virus implicated in IDDM pathogenesis (Rubenstein et al . , Diabetes, (1982), 3_1_: 1088- 1091), contains the GPPAA peptide at position 261. Exposure of a fetus to the rubella virus leads to a congenital infection, and a high risk for diabetes. Based on the homology between viral pathogens (such as EBV and rubella) and the DQ- beta peptide Gly Pro Pro Ala Glu, such region could also serve as a target for molecular mimicry Oldstone et al., supra with an immune

response to an infecting virus possibly leading to an attack on self cells.

Further, viruses other than EBV and rubella have been implicated with IDDM such as Coxsackie and cytomegalovirus (CMV) (Yoon et al . , supra) .

The viruses are likely mimicking HLA class II genes, and in particular the HLA DQ-beta genes around codon 57, delaying or modifying the onset of an effective immune response. If the immune system actually responds to the mimicked HLA epitope, then normal regulation of the immune system could be perturbed, possible leading to autoimmune disease.

Autoimmune disease could result from a series of factors: 1) inheritance of HLA alleles which are being mimicked by viruses, 2) infection by a virus mimicking the host's HLA class II alleles, 3) an immune response by the host to the mimicked epi topes, 4) perturbation of immune regulation of autoimmune responses, 5) development of an autoimmune response, and 6) progressive tissue destruction leading to an autoimmune disease.

An example of this mechanism could involve a person inheriting the DQB3.2 allele correlating with increased risk to IDDM followed by infection by the rubella virus, which is also correlated with IDDM, and which contains an epitope in its El envelope protein which specifically mimics a portion of the DQB3.2 protein. After infection by the virus, the person may elicit an immune response involving antibodies or T cells directed against the El protein, and cross-reactive to the DQB3.2 protein. This antibody or T cell response interferes with the normal function of the HLA DQ protein, leading to an autoimmune response and IDDM. The target of the final autoimmunity may be determined by the location of the viral infection, in this case the beta cells of the islets of Langer ans. Increased degrees of risk could be ascertained as an individual is shown to have the DQB3.2- allele, followed by infection by the rubella virus, followed by the appearance of anti-rubella antibodies cross-reactive to the HLA DQ-beta protein.

Diagnostic tests to determine whether an individual with alleles associated with an autoimmune susceptibility has been infected by a virus correspondingly associated with susceptibility to an autoimmune disease (based on the virus having homology to segments of the HLA class II alleles) coul be used to determine an effective prophylactic or therapeutic treatment plan for such individual. For example, vaccines, immunotoxins, immuno-antigens, peptides corresponding to the epitope of the mimicked region, or anti-idiotype antibodies could then be used to prevent or reverse an immune response against epitopes presented by the pathogenic organisms.

An example of such a diagnostic test could be detecting an immune response (i.e., antibody production or reactive immune cells) as reported in Schwinnbeck et al . , J . Exp . Med . , 166:173-181 (1987), where antibodies against a portion of the class I HLA molecule B27 are detected by attaching a synthetic peptide to a solid support, treating it with a dilution of the test serum, washing the solid support, and then testing ' whether any antibodies are retained. Other examples of such a diagnostic test could be either by directly probing for the viral DNA genome, as is done with the HIV (AIDS) virus, in Kwok et al., J. Virol., _61_:1690-1694 (1987), or by indirectly assaying for antibodies arising from infection, as in agglutination tests for CMV antibodies.

The DR4, DwlO DR-beta-I allele is associated with susceptibility to both PV and to IDDM. That allele contains the nucleotide sequence which encodes the I-DE amino acid sequence in the third hypervariable region (HV3; segment D in Figure 5) around positions 68-72 of the DR-beta-I chain. Another DR4 DR-beta-I allele, Dw4, also polymorphic in the same hypervariable region (HV3; segment D in Figure 5), has the sequence GGAGCAGAAGCGGGCCGCG around positions 68-72, and is also associated with IDDM. Thus, most DR4+ IDDM patients are either DR4, Dw4 or DR4, DwlO, indicating that both the DQ-beta allelic variants (see above) and the DR-beta-I allelic variants contribute to autoimmunity. The Dw4 variant can be distinguished from the other DR4, DR-beta-I alleles by sequence specific oligonucleotide (SSO) analysis.

The DR4 haplotype, Dw4 subtype is also associated with rheumatoid arthritis (RA), as is DR4, Dwl4. Roudier et al., Abstract from American Rheumatism Association (Western Region) Meeting in San Diego, CA, Nov. 5-7, 1987, page 15. A hexapeptide from the HV3 region of the DR-beta-I chain at amino acids 69-74, that distinguishes Dw4 from the other DR4, DR-beta-I alleles is shared by the Epstein-Barr virus (EBV) open reading frame BALF4 and may serve as a target for molecular mimicry.

The above-mentioned DNA sequences may be detected by DNA hybridization probe technology. In one example, which is not exclusive, the sample suspected of containing the genetic marker is spotted directly on a series of membranes and each membrane is hybridized with a different labeled oligonucleotide probe that is specific for the particular sequence variation. One procedure for spotting the sample on a membrane is described by Kafotos et al . , Nucleic Acids Research, 7_:1541-1552 (1979).

Briefly, the DNA sample affixed to the membrane may be pretreated with a prehybridization solution containing sodium dodecyl sulfate, Ficoll, serum albumin and various salts prior to the probe being added. Then, a labeled oligonucleotide probe that is specific to each sequence to be detected is added to a hybridization solution similar to the prehybridization solution. The hybridization solution is applied to the membrane and the membrane is subjected to hybridization conditions that will depend on the probe type and length, type and concentration of ingredients, etc? Generally ? hybridization is carried out at about 25-75°C, preferably 35 to 65°C, fdr 0.25-50 hours, preferably less than three hours. The greater the stringency of conditions, the greater the required complementarity for hybridization between the probe and sample. If the background level is high, stringency may be increased accordingly. The stringency can also be incorporated in the wash.

After the hybridization the sample is washed of unhybridized probe using any suitable means such as by washing one or more times with varying concentrations of standard saline phosphate EDTA (SSPE)

(180 mM NaCl, 10 M Na 2 HP0 4 and 1 M EDTA, pH 7.4) solutions at 25-75°C for about 10 minutes to one hour, depending on the temperature. The label is then detected by using any appropriate detection techniques.

The sequence-specific oligonucleotide that may be employed herein is an oligonucleotide that may be prepared using any suitable method, such as, for example, the organic synthesis of a nucleic acid from nucleoside derivatives. This synthesis may be performed in solution or on a solid support. One type of organic synthesis is the phosphotriester method, which has been utilized to prepare gene fragments or short genes. In the phosphotriester method, oligonucleotides are prepared that can then be joined together to form longer nucleic acids. For a description of this method, see Narang, S. A., et al., Meth. Enzymol., 68, 90 (1979) and U.S. Patent No. 4,356,270. The patent describes the synthesis and cloning of the somatostatin gene.

A second type of organic, synthesis is the phosphodiester method, which has been utilized to prepare tRNA gene. See Brown, E. L., et al., Meth. Enzymol., 68, 109 (1979) for a description of this method. As in the phosphotriester method, the phosphodiester method involves synthesis of oligonucleotides that are subsequently joined together to form the desired nucleic acid.

Automated embodiments of these methods may also be employed. In one such automated embodiment diethyl phosphora idites are used as starting materials and may be synthesized as described by Beaucage et al., Tetrahedron Letters, 22_:1859-1862 (1981). One method for synthesizing oligonucleotides on a modified solid support is described in U.S. Patent No. 4,458,066. It is also possible to use a primer which has been isolated from a biological source (such as a restriction endonuclease digest). The sequence-specific oligonucleotide must encompass the region of the sequence which spans the nucleotide variation being detected an-d must be specific for the nucleotide variation being detected. For example, four oligonucleotides may be prepared, each of vhich contains the nucleotide sequence site characteristic of each of

the four DNA sequences herein. Each ol gonucleotide would be hybridized to duplicates of the same sample to determine whether the sample contains one or more of the regions of the locus where allelic variations may occur which are characteristic of IDDM or PV. The length of the sequence-specific oligonucleotide will depend on many factors, including the source of oligonucleotide and the nucleotide composition. For purposes herein, the oligonucleotide typically contains 15-25 nucleotides, although it may contain more or fewer nucleotides. While ol gonucleotides which are at least 19-mers in length may enhance specificity and/or sensitivity, probes which are less than 19-mers, e.g., 16-mers, show more sequence-specific discrimination, presumably because a single mismatch is more destabilizing. If amplification of the sample is carried out as described below prior to detection with the probe, amplification increases specificity so that a longer probe length is less critical, and hybridization and washing temperatures can be lowered for the same salt concentration. Therefore, in such as case it is preferred to use probes which are less than 19-mers.

Where the sample is first placed on the membra'ne and then detected with the oligonucleotide, the ol gonucleotide must be labeled with a suitable label moiety, which may be detected by spectroscopic, photochemical, biochemical, immunochemical or chemical means. Immunochemical means include antibodies which are capable of forming a complex with the oligonucleotide under suitable conditions, and biochemical means include polypeptides or lectins capable of forming a complex with the oligonucleotide under the appropriate conditions. Examples include fluorescent dyes, electron-dense reagents, enzymes capable of depositing insoluble reaction products or being detected chronogenically, such as alkaline phosphatase, a radioactive label

3? such as P, or biotin. If biotin is employed, a spacer arm may be utilized to attach it to the oligonucleotide. Preferably, the labels used are non-radioactive.

In a "reverse" dot blot format, a labeled sequence-specific oligonucleotide probe capable of hybridizing with one of the DNA

sequences is spotted on (affixed to) the membrane under prehybridization conditions as described above. The sample is then added to the pretreated membrane under hybridization conditions as described above. Then the labeled oligonucleotide or a fragment thereof is released from the membrane in such a way that a detection means can be used to determine if a sequence in the sample hybridized to the labeled oligonucleotide. The release may take place, for example, by adding a restriction enzyme to the membrane which recognizes a restriction site in the probe. This procedure, known as oligomer restriction, is described more fully in EP Patent Publication 164,054 published December 11, 1985.

Alternatively, a sequence specific oligonucleotide immobilized to the membrane could bind or "capture" a target DNA strand ( PCR-ampl ified). This "captured" strand could be detected by a second lableed probe. The second oligonucleotide probe could be either locus-specific or allele-specific.

In an alternative method for detecting the DNA sequences herein, the sample to be analyzed is first amplified using DNA polymerase, four nucleotide tri phosphates and two primers, by a process termed the polymerase chain reaction. Briefly, this amplification process involves the steps of:

(a) treating a DNA sample suspected of containing one or more of the four IDDM genetic marker sequences, together or sequentially, with four different nucleotide tri phosphates, an agent for polymerization of the nucleotide tri phosphates, and one deoxyribonucleotide primer for each strand of each DNA suspected of containing the IDDM or PV genetic markers under hybridizing conditions, such that for each DNA strand containing each different genetic marker to be detected, an extension product of each primer is synthesized which is complementary to each DNA strand, wherein said primer(s) are selected so as to be substantially complementary to each DNA strand , containing each different genetic marker, such that the extension product synthesized from one primer, when it is separated from its complement, can serve as a template for synthesis of the extension product of the other primer;

(b) treating the sample under denaturing conditions to separate the primer extension products from their templates if the sequence (s) to be detected are present; and

(c) treating the sample, together or sequentially, with said four nucleotide tri phosphates, an agent for polymerization of the nucleotide tri phosphates, and oligonucleotide primers such that a primer extension product is synthesized using each of the single strands produced in step (b) as a template, wherein steps (b) and (c) are repeated a sufficient number of times to result in detectable amplification of the nucleic acid containing the sequence(s) if present.

The sample is then affixed to a membrane and detected with a sequence-specific probe as described above. Preferably, steps (b) and (c) are repeated at least five times, and more preferably 15-30 times if the sample contains human genomic DNA. If the sample comprises cells, preferably they are heated before step (a) to expose the DNA therein to the reagents. This step avoids extraction of the DNA prior to reagent addition.

In a "reverse" dot blot format, at least one of the primers and/or at least one of the four nucleotide tri phosphates used in the ampl fication chain reaction is labeled with a detectable label, so that the resulting amplified sequence is labeled. These labeled moieties may be present initially in the reaction mixture or added during a later cycle. Then an unlabeled sequence-specific oligonucleotide capable of hybridizing with the amplified sequence(s), if the sequence (s) is/are present, is spotted on (affixed to) the membrane under prehybridization conditions as described above. The amplified sample is then added to the pretreated membrane under hybridization conditions as described above. Finally, detection means are used to determine if an amplified sequence in the DNA sample has hybridized to the oligonucleotide affixed to the membrane.

Hybridization will occur only if the membrane-bound sequence containing the variation is present in the amplification product.

As indicated above, variations of this method include use of an unlabeled PCR target, an unlabeled immobilized allele-specific probe and a labeled oligonucleotide probe in a sandwich assay.

The amplification method provides for improved specificity and sensitivity of the probe; an interpretable signal can be obtained with a 0.04 icrogram sample in six hours. Also, if the amount of sample spotted on a membrane is increased to 0.1-0.5 micrograms, non- isotopically labeled oligonucleotides may be utilized in the amplification process rather than the radioactive probes used in previous methods. Finally, as mentioned above, the amplification process is applicable to use of sequence-specific oligonucleotides less than 19-mers in size, thus allowing use of more discriminatory sequence-specific oligonucleotides.

In a variation of the amplification procedure, a thermostable enzyme, such as one purified from Thermus aquaticus, may be utilized as the DNA polymerase in a temperature -eye led chain reaction. The thermostable enzyme refers to an enzyme which is stable to heat and is heat resistant and catalyzes (facilitates) combination of the nucleotides in the proper manner to form the primer extension products that are complementary to each DNA strand.

In this latter variation of the technique, the primers and nucleotide triphosphates are added to the sample, the mixture is heated and then cooled, and then the enzyme is added, the mixture is then heated to about 90-100°C to denature the DNA and then cooled to about 35-40°C, and the cycles are repeated until the desired amount of amplification takes place. This process may also be automated. The amplification process using the thermostable enzyme is described ore fully in European Patent Publication 258,017, published March 2, 1988.

The invention herein also contemplates a kit format which comprises a packaged multi container unit having containers for each labeled sequence-specific DNA probe. The kit may optionally contain a means to detect the label (such as an avi din-enzyme conjugate and enzyme substrate and chromogen if the label is biotin). In addition, the kit may include a container that has a positive control for the

probe containing one or more DNA strands with the sequence to be detected and a negative control for the probe that does not contain the DNA strands having any of the sequences to be detected.

One method for detecting the amino acid sequences in a protein sample that are associated with IDDM or PV involves the use of an immunoassay employing one or more antibodies that bind to one or more of the four amino acid sequences. While the antibodies may be polyclonal or monoclonal, monoclonal antibodies are preferred in view of their specificity and affinity for the antigen. Polyclonal antibodies may be prepared by well-known methods which involve synthesizing a peptide containing one or more of the amino acid sequences associated with IDDM or PV, purifying the peptide, attaching a carrier protein to the peptide by standard techniques, and injecting a host such as a rabbit, rat, goat, mouse, etc. with the peptide. The sera are extracted from the host by known methods and screened to obtain polyclonal antibodies which are specific to the peptide immunogen. The peptide may be synthesized by the solid phase synthesis method described by Merrifield, R. B., Adv. Enzymol. Re! at. Areas Mol. Biol., 3_:221-296 (1969) and in "The Chemistry of Polypeptides" (P. G. Katsoyannis, ed.), pp. 336-361, Plenum, New York- (1973), the disclosures of which are incorporated herein by reference. The peptide is then purified and may be conjugated to keyhold limpet hemocyanin (KLH) or bovine serum albumin (BSA). This may be accomplished via a sulfhydryl group, if the peptide contains a cysteine residue, using a heterobi functional crosslinking reagent such as N-maleimido-6-am no caproyl ester of 1- hydroxy-2-nitrobenzene-4-sulfonic acid sodium salt.

The monoclonal antibody will normally be of rodent or human origin because of the availability of murine, rat, and human tumor cell lines that may be used to produce immortal hybrid cell lines that secrete monoclonal antibody. The antibody may be of any isotype, but preferably an IgG, IgM or IgA, most preferably an IgG2a.

The murine monoclonal antibodies may be produced by immunizing the host with the peptide mentioned above. The host may be

inoculated intraperitoneally with an immunogenic amount of the peptide and then boosted with similar amounts of the immunogenic peptide. Spleens or jmphoid tissue is collected from the immunized mice a few days after the final boost and a cell suspension is prepared therefrom for use in the fusion.

Hybridomas may be prepared from the splenocytes or lymphoid tissue and a tumor (myeloma) partner using the general somatic cell hybridization technique of Koehler, B. and Milstein, C, Nature, 56_:495-497 (1975) and of Koehler, B. et al., Eur. J. Immunol., 6_:511- 519 (1976). Preferred myeloma cells for this purpose are those which fuse efficiently, support stable, high-level expression of antibody by the selected antibody- producing cells, and are sensitive to a medium such as HAT medium. Among these, preferred myeloma cell lines are murine myeloma lines, such as those derived from MOPC-21 and MOPC-11 mouse tumors available from the Sal k Institute, Cell Distribution Center, San Diego, California, USA, or P3X63-Ag8.653 (653) and Sp2/0- Agl4 (SP2/0) myeloma lines available from the American Type Culture Collection, Rockville, MD, USA, under ATCC CRL Nos. 1580 and 1581, respectively. Basically, the technique involves fusing the appropriate tumor cells and splenocytes or lymphoid tissue using a fusogen such as polyethylene glycol. After the fusion the cells are separated from the fusion medium and grown on a selective growth medium, such as HAT medium, to eliminate unhybridized parent cells and to select only those hybridomas that are resistant to the medium and immortal. The hybridomas may be expanded, if desired, and supernatants may be assayed by conventional immunoassay procedures (e.g., radioimmunoassay, enzyme immunoassay, or fluorescence immunoassay) using the immunizing agent as antigen. Positive clones may be characterized further to determine whether they meet the criteria of the antibodies of the invention. For example, the antigen-binding ability of the antibodies may be evaluated in v i tro by immunoblots, ELISAs and antigen neutralizing tests.

A preferred procedure for making a hybrid cell line that secretes human antibodies against the amino acid genetic markers is somatic cell hybridization using a mouse x human parent hybrid cell line and a human cell line producing sufficiently high levels of such antibodies. The human cell line may be obtained from volunteers immunized with the peptide(s) described above. The human cell line may be transformed with Epstein-Barr virus (EBV) as described, for example, by Foung, et al., J. Immunol. Methods, 70_:83-90 (1984).

When EBV transformation is employed, the most successful approaches have been either to pre-select the population of B cells to be transformed or to post-select the antigen-specific transformed populations by panning or rosetting techniques, as described by

Kozbar, et al . , Scan. J. Iππunol., _10_:187-194 (1979) and Steinitz, et al., J. Cl n. Lab. Immun., 2_:l-7 (1979). Recently EBV transformation has been combined with cell fusion to generate human monoclonal antibodies (see, e.g., Foung et . al., J. Immun. Meth., 70:83-90

(1984)), due to instability of immunoglobulin secretion by hybridomas when compared to EBV l phobias to id cell lines, and higher frequencies of rescue of the antigen-specific populations. EBV most frequently infects and transforms IgM-bearing B cells, but B cells secreting other classes of Ig can also be made into long-term lines using the

EBV fusion technique, as described by Brown and Miller, J. Immunol.,

128:24-29 (1982).

The cell lines which produce the monoclonal antibodies may be grown in vitro in suitable culture medium such as Iscove's medium, Dulbecco's Modified Eagle's Medium, or RPMI-1640 medium from Gibco, Grand Island, NY, or vn_ vivo in syngeneic or immunodeficient laboratory animals. If desired, the antibody may be separated from the culture medium or body fluid, as the case may be, by conventional techniques such as ammonium sulfate precipitation, hydroxyapatite chromatography, ion exchange chromatography, affinity chromatography, electrophoresis, micro filtration, and ultracentrifugation.

The antibodies herein may be used to detect the presence or absence of the amino acid sequences associated with IDDM in white

blood cells expressing the HLA class I antigens. The cells may be incubated in the presence of the antibody, and the presence or absence and/or degree of reaction (antibody- peptide binding) can be determined by any of a variety of methods used to determine or quantitate antibody/ antigen interactions (e.g., fluorescence, enzyme-linked immunoassay (ELISA), and cell killing using antibody and complement by standard methods). The antibody employed is preferably a monoclonal antibody.

For use in solid phase immunoassays, the antibodies employed in the present invention can be immobilized on any appropriate solid test support by any appropriate technique. The solid test support can be any suitable insoluble carrier material for the binding of antibodies in immunoassays. Many such materials are known in the art, including, but not limited to, nitrocellulose sheets or filters; agarose, resin, plastic (e.g., PVC or polystyrene) latex, or metal beads; plastic vessels; and the like. Many methods of immobilizing antibodies are also known in the art. See, e.g., Silman' et al . , Ann. Rev. Biochem., 35_:873 (1966);- Mel rose, Rev. Pure & App. Chem., 21 ^ :83 (1971); Cuatrecafas, et al . , Meth. Eπz.ym., Vol. 22 (1971). Such methods include covalent coupling, direct adsorption, physical entrapment, and attachment to a protein-coated surface. In the latter method, the surface is first coated with a water- insoluble protein such as zein, collagen, fibrinogen, keratin, glutelin, etc. The antibody is attached by simply contacting the protein-coated surface with an aqueous solution, of the antibody and allowing it to dry.

Any combination of support and binding technique which leaves the antibody immunoreactive, yet sufficiently immobilizes the antibody so that it can be retained with any bound antigen during a washing, can be employed in the present invention. A preferred solid test support is a plastic bead.

In the sandwich immunoassay, a labeled antibody is employed to measure the amount of antigen bound by the immobilized monoclonal antibody. The label can be any type that allows for the detection of the antibody when bound to a support. Generally, the label directly

or indirectly results in a signal which is measurable and related to the amount of label present in the sample. For example, directly measurable labels can include radio! abels (e.g., 25 I, ~ S, 14 C, etc.). A preferred directly measurable label is an enzyme, conjugated to the antibody, which produces a color reaction in the presence of the appropriate substrate (e.g., horseradish peroxidase/o- phenylenedi amine). An example of an indirectly measurable label would be antibody that has been biotinylated. The presence of this label is measured by contacting it with a solution containing a labeled avidin complex, whereby the avidin becomes bound to the biotinylated antibody. The label associated with the avidin is then measured. A preferred example of an indirect label is the avidin/biotin system employing an enzyme conjugated to the avidin, the enzyme producing a color reaction as described above. It is to be understood, however, that the term "label" is used in its broadest sense and can include, for example, employing "labeled" antibodies where the label is a xenotypic or isotypic difference from the immobilized antibody, so that the presence of "labeled" antibodies is detectable by incubation with an anti -xenotypic or anti-isotypic antibody carrying a directly detectable label.

Whatever label is selected, it results in a signal which can be measured and is related to the amount of label in a sample. Common signals are radiation levels (when radioisotopes are used), optical density (e.g., when enzyme color reactions are used), and fluorescence (when fluorescent compounds are used). It is preferred to employ a nonradioactive signal, such as optical density (or color intensity) produced by an enzyme reaction. Numerous enzyme/substrate combinations are known in the immunoassay art which can produce a suitable signal. See, e.g., U.S. Patent Nos. 4,323,647 and 4,190,496. For diagnostic use, the antibodies will typically be distributed in multi container kit form. These kits will typically contain the antibody(ies) in labeled or unlabeled form in suitable containers, any detectable ligand reactive with unlabeled antibody if it is used, reagents for the incubations and washings if necessary, reagents for detecting the label moiety to be detected, such as

substrates or derivatizing agents depending on the nature of the label, product inserts and instructions, and a positive control associated with IDDM or PV, such as a cell containing the HLA class II antigens associated with IDDM or PV. The antibodies in the kit may be affinity purified if they are polyclonal.

The following examples illustrate various embodiments of the invention and are not intended to be limiting in any respect. In the examples all parts and percentages are by weight if solid and by volume if liquid, and all temperatures are in degrees Centigrade, unless otherwise indicated.

EXAMPLE I

This example illustrates how four DR-beta sequences associated with IDDM were identified.

I. Analysis of HLA-DR-beta Sequences Several HLA class II beta genes were isolated from clinical blood samples of diverse HLA-typed IDDM individuals (from University of Pittsburgh clinic and from cell lines from IDDM patients available from the Human Genetic Mutant Cell Repository, Camden, NJ) and non- diabetic controls (homozygous typing cells) using cloning methods. In one such method, which is a standard method, human genomic DNA was isolated from the patient samples using essentially the method of Maniatis et al . , Molecular Cloning: A Laboratory Manual (1982), 280- 281 or prepared from the buffy coat fraction, which is composed primarily of peripheral blood lymphocytes, as described by Saiki et al., Bio/Technology, 3_:1° 08 - 1012 (1985). This DNA was then cloned as full genomic libraries into bacteriophage vectors, as described in Maniatis, supra, pp. 269-294. Individual clones for the HLA-DR-beta genes were selected by hybridization to radioactive cDNA probes (Maniatis et al . , pp. 309-328) and characterized by restriction mapping. ,See U.S. Patent No. 4,582,788 issued April 15, 1986. Individual clones from IDDM patients were assigned to DR-typed haplotypes by comparing the clone restriction map with the RFLP

segregation pattern within the patients' family. Finally, small fragments of these clones representing the variable second exon were subcloned (Maniatis, pp. 390-402) into the M13mpl0 cloning vector, which is publicly available from Boehringer-Mannheim. In an alternative procedure for cloning the genes, amplification of the relevant portion (the second exon) of the gene was carried out as described below.

A total of 1 microgra of each isolated human genomic DNA was amplified in an initial 100 microliter reaction volume containing 10 microliters of a solution containing 100 mM Tris'HCl buffer (pH 7.5), 500 mM NaCl, and 100 mM MgCl 2 , 10 microliters of 10 micromillimeters of primer GH46, 10 microliters of 10 m cromill meters of primer GH50, 15 microliters of 40 mM dNTP (contains 10 mM each of dATP, dCTP, dGTP and TTP) , and 45 microliters of water. Primers GH46 and GH50 have the following sequences:

5 ' -CCGGATCCTTCGTGTCCCCACAGCACG-3 ' (GH46) 5 '-CTCCCCAACCCCGTAGTTGTGTCTGCA-3 ' (GH50)

These primers, having non-homologous sequences to act as linker/primers, were prepared as follows: A. Automated Synthesis Procedures: The di ethyl phosphora idites, synthesized according to Beaucage and Caruthers (Tetrahedron Letters (1981) 22_:1859-1862) were sequentially condensed to a nucleoside derivatized controlled pore glass support using a Biosearch SAM-1. The procedure included detritylation with tr chloroacetic acid in dichloromethane, condensation using benzotriazole as activating proton donor, and capping with acetic anhydride and dimethyl ami nopyri dine in tetrahydrofuran and pyridine. Cycle time was approximately 30 minutes. Yields at each step were essentially quantitative and were determined by collection and spectroscopic examination of the dimethoxytrityl alcohol released during detritylation.

B. Ol godeoxyribonucleotide Deprotection and Purification Procedures: The solid support was removed from the column and exposed to 1 ml concentrated ammonium hydroxide at room temperature for four

hours in a closed tube. The support was then removed by filtration and the solution containing the partially protected ol igodeoxynucleotide was brought to 55°C for five hours. Ammonia was removed and the residue was applied to a preparative polyacrylamide gel. Electrophoresis was carried out at 30 volts/cm for 90 minutes after which the band containing the product was identified by UV shadowing of a fluorescent plate. The band was excised and eluted with 1 ml distilled water overnight at 4°C. This solution was applied to an Altech RP18 column and eluted with a 7-13% gradient of acetonitrile in 1% ammonium acetate buffer at pH 6.0. The elution was monitored by UV absorbance at 260 nm and the appropriate fraction collected, quantitated by UV absorbance in a fixed volume and evaporated to dryness at room temperature in a vacuum centrifuge.

C. Characterization of Oligodeoxyribonucleotides: Test aliquots of the purified oligonucleotides were 32 P labeled with polynucleotide kinase and q- P-ATP. The labeled compounds were examined by autoradiography of 14-20% pol acrylamide gels after electrophoresis for 45 minutes at 50 volts/cm. This procedure verifies the molecular weight. Base composition was determined by digestion of the oligodeoxyribonucleotide to nucleosides by use of venom diesterase and bacterial alkaline phosphatase and subsequent separation and quantitation of the derived nucleosides using a reverse phase HPLC column and a 10% acetonitrile, 1% ammonium acetate mobile phase. The above reaction mixtures were held in a heat block set at

95°C for 10 minutes to denature the DNA. Then each DNA sample underwent 28 cycles of amplification, where each cycle was composed of four steps:

(1) spinning " the sample briefly (10-20 seconds) in microcentrifuge to pellet condensation and transfer the denatured material immediately to a heat block set at 30°C for two minutes to allow primers and genomic DNA to anneal,

(2) adding 2 microliters of a solution prepared by mixing 39 microliters of the Klenow fragment of E. coli DNA Polymerase I (New England Biolabs, 5 units/micro! iters), 39 microliters of a salt mixture of 100 mM Tris buffer (pH 7.5), 500 mM NaCl and 100 mM MgCl 2 , and 312 microliters of water,

(3) allowing the reaction to proceed for two minutes at 30°C, and

(4) transferring the samples to the 95°C heat block for two minutes to denature the newly synthesized DNA, except this reaction was not carried out at the last cycle.

Then the mixtures were stored at -20°C. The following cloning procedure was used for the amplified products.

The reaction mixture was sub-cloned into M13mpl0 by first digesting in 50 microliters of a buffer containing 50 mM NaCl, 10 mM Tris-HCl, pH 7.8, 10 mM MgCl 2 , 20 units Pstl, and 26 units Hindi II at 37°C for 90 minutes. The reaction was stopped by freezing. The volume was adjusted to 110 microliters with a buffer containing Tris-HCl and EDTA and loaded onto a 1 ml BioGel P-4 spin dialysis column. One fraction was collected and ethanol precipitated. The ethanol pellet was resuspended in 15 microliters water and adjusted to 20 microliters volume containing 50 mM Tris'HCl, pH 7.8, 10 mM MgCl 2 , 0.5 mM ATP, 10 mM dithiothreitol, 0.5 micrograms of M13mpl0 vector digested with Pstl and Hindlll and 400 units ligase. This mixture was incubated for three hours at 16°C. Ten microliters of ligation reaction mixture containing Molt

4 DNA was transformed into E. coli strain JM103 competent cells, which are publicly available from BRL in Bethesda, MD. The procedure followed for preparing the transformed strain is described in Messing, J. (1981) Third Cleveland Symposium on Macro ol ecu! es: Recombinant DNA, ed. A. Walton, Elsevier, Amsterdam, 143-153.

About 40 different alleles from these two cloning procedures were sequenced. In some of the sequences determined four areas of specific DNA and protein sequence were found to occur in various

combinations and to be associated with IDDM. The DNA sequences seen in each of these segments in the genomes of IDDM patients produced an alteration in one to three amino acid residues of the DR-beta protein. These four variable segments of the DR-beta second exon, found in sequences obtained from many diabetic sources, and labeled A- D, are identified above. The regions which can be used for devising probes used for detecting such sequences are identified in Figure 5, where the amino acid abbreviations are shown in Table VIII.

II. Preparation of Primers for Detection

Ol gonucleotides designated GH46 and GH50 complementary to opposite strands of the conserved 5' and 3' ends of the DR-beta second exon were used as primers. The primers are identified in the section above.

III. Expected Amplification Reaction

One microgram of DNA from each DNA sample to be tested (10 microliters of 100 micrograms/millimeter DNA) may be amplified in an initial 100 micro! iter reaction volume containing 10 microliters of a solution containing 100 mM Tris buffer (pH 7.5), 500 mM NaCl, and 100 mM MgCl 2 . 10 microliters of 10 micromillimeter of primer GH46, 10 microliters of 10 micromillimeters of primer GH50, 15 microliters of 40 mM dNTP (contains 10 M each of dATP, dCTP, dGTP and TTP) , 10 microliters DMS0, and 45 microliters of water. Each reaction mixture is held in a heat block set at'95°C for 10 minutes to denature the DNA. Then each DNA sample undergoes 30 cycles of amplification where each cycle is- composed of four steps:

(1) spinning the sample briefly (10-20 seconds) in microcentrifuge to pellet condensation and transfer the denatured material immediately to a heat block set at 37°C for two minutes to allow primers and genomic DNA to anneal,

(2) adding 2 microliters of a solution prepared by mixing 39 microliters of the Klenow fragment of E. coli DNA Polymerase I (New England Biolabs, 5 units/micro! iters, 39 micro! iters of a salt mixture of 100 M Tris buffer (pH 7.5), 500 mM NaCl and 100 mM MgCl 2 , and 312 microliters of water,

(3) allowing the reaction to proceed for two minutes at 37°C, and

(4) transferring the samples to the 95°C heat block for two minutes to--denature the newly synthesized DNA, except this reaction was not carried out at the last cycle.

The final reaction volume is 150 microliters, and the reaction mixture is stored at -20°C.

IV. Expected Synthesis and Phosphorylation of Oligodeoxyribonucleo¬ tide Probes Two of four labeled DNA probes, designated GH54 (V— S) and

GH78 (I — DE), from Regions C and D, respectively, are employed.

These two probes are synthesized according to the procedures described above for preparing primers for cloning. The probes are labeled by contacting 10 pmole thereof with 4 units of T4 polynucleotide kinase (New England Biolabs) and about 40 pmole gamma" 32 P-ATP (New England Nuclear, about 7000 Ci/mmole) in a 40 microliter reaction volume containing 70 mM Tris buffer (pH 7.6), 10 mM MgCl 2 , 1.5 mM spermine, 100 mM dithiothreitol and water for 60

minutes at 37°C. The total volume is then adjusted to 100 microliters with 25 mM EDTA and purified according to the procedure of Maniatis et al., Molecular Cloning (1982), 466-467, over a 1 ml Bio Gel P-4 (BioRad) spin dialysis column equilibrated with Tris-EDTA (TE) buffer (10 mM Tris buffer, 0.1 mM EDTA, pH 8.0).

V. Expected Dot Blot Hybridizations Five microliters of each of the 150 microliters amplified samples from Section III was diluted with 195 microliters 0.4 N NaOH, 25 M EDTA and spotted onto three replicate Genatran 45 (PLASC0) nylon filters by first wetting the filter with water, placing it in a Bio- Dot (BioRad) apparatus for preparing dot blots which holds the filter in place, applying the samples, and rinsing each well with 0.4 ml of 20 x SSPE (3.6 M NaCl, 200 mM NaH 2 P0 4 , 20 mM EDTA), as disclosed by Reed and Mann, Nucleic Acids Research, 1_3_, 7202-7221 (1985). The filters are then removed, rinsed in 20 x SSPE, and baked for 30 minutes at 80°C in a vacuum oven. After baking, each filter is then contacted with 6 ml of a hybridization solution consisting of 5 x SSPE, 5 x Denhardt's solution (1 x = 0.02% pol vinylpyr roll " done, 0.02% Ficoll, 0.02% bovine serum

albumin, 0.2 mM Tris'HCl, 0.2 mM EDTA, pH 8.0) and 0.5% SDS and incubated for 60 minutes at 55°C. Then 5 microliters each of the probes is added to the hybridization solution and the filters are incubated for 60 minutes at 55°C. Finally, each hybridized filter is washed under stringent conditions. The genotypes are expected to be readily apparent after 90 minutes of autoradiography. The probes are expected to have reasonable specificity for the portions of the allele being detected in genomic DNA samples.

EXAMPLE II

Peptides to the amino acid sequences disclosed may be prepared as described above and used as immunogens to generate antibodies thereto, useful in immunoassays for detecting the amino acid sequence(s) in protein samples.

EXAMPLE HI .

To explore the possibility that both the DR4 and DRwδ haplotypes which are associated with Pemphigus vulgaris (PV) contain a common epitope indicative of PV susceptibility in the DR-beta-I chain, the nucleotide sequences of the polymorphic second exon of the DR-beta and DQ-beta loci from three PV patients were determined. As the HLA- DR serotypes of the three PV patients were DR4/4, DR4/5 and DR4/5, only the issue of DR4 associated PV susceptibility was explored in this example. The sequence analysis was carried out on M13 clones containing specific polymerase chain reaction (PCR) amplified fragments. See Saiki et al., Science, 230:1350-1354 (1985); and Scharf et al . , Science, 233:1076-1078 (1986) for methodology associated with PCR amplification, cloning and sequence analysis.

Sample Preparation and Ampl fication Procedures Blood samples from three Pemphigus vulgaris patients were provided by Dr. Bruce Wintroub, UCSF (California). 0.5 ml of whole blood was lysed by the addition o-f

1.5 ml of 10 mM Tris, pH 7.5, 10 mM EDTA, 100 mM NaCl, 40 mM

dithiothreitol , and 200 micrograms /mi Hi meter Proteinase K and incubated for 16 hours at 55°C. The samples were phenol extracted, phenol -CHC13 and CHC13 extracted and ethanol precipitated overnight at -20°C. The precipitated DNA was pelleted by centrifugation, washed with 70% ethanol, dried and resuspended in 100 microliters 10 mM Tris, 0.1 mM EDTA containing 100 microgram/millimeter RNAse A and incubated at 37°C for 15 minutes. The DNA samples were re-extracted with phenol -CHC13 and CHC13 to inactivate the RNAse A and ethanol precipitated, washed, and dried as described above. One microgram of intact genomic DNA was amplied by polymerase chain reaction (PCR) (Scharf et al., _id_. ) with the following changes in the reaction conditions: A.) The HLA DR-beta region genes were amplified by using 1 micromillimeter of the PCR primers GH46.and GH50 (See Figure 1 for description of primers and the HLA DR-beta target fragment); 1 unit of cloned ~^_ coli DNA polymerase 1 large fragment (Klenow fragment) was added for 20 cycles of amplification; an additional five cycles of amplification was run on the samples using 4 units of Klenow fragment B.) The HLA DQ-beta region genes amplified by using 1 micromillimeter of PCR primers GH28 and GH29: GH28 (C " CGGATCCGCATGTGCTACTTCACCAACG)

GH29 (GAGCTGCAGGTAGTTGTGTCTGCACAC) .

One unit of Klenow fragment was added for 20 cycles of amplification; an additional 8 cycles of amplification were carried out using 2 units of Klenow fragment per cycle. The DQ-beta primers produce a 238 base- pair fragment. The DR-beta primers produced a 272 bp fragment (see Figure 1). 1/10 of the PCR reactions were run on -a 4% NuSieve, 0.5% SeaKem (FMC) agarose gel and transferred to Genatran 45 nylon membrane (Scharf et al., jd_. ). The filter was prehybridized in 10 ml 5x SSPE, 4x Denhardt's and 0.5% sodium dodecyl sulfate (SDS) for 15 minutes at 37°C. The filter was hybridized with the addition of 0.1 pmole of (gamma" 32 P) ATP labeled DR-beta specific oligomer GH 22 for 16 hours at 37°C. The filter was washed in 4x SSPE, 0.1% SDS for 2.5 minutes at 30°C and for 1.5 minutes at 37°C and exposed for 16 hours at -70°C with one intensifying screen (DuPont Cronex Lightning Plus). The filter was stripped of probe by incubation for five minutes in boiling

O.lx SSPE, 0.1% SDS, dried and prehybridized in 10 ml 6x SSPE, lOx Denhardt's and 0.22 SDS at 42° C. 0.2 pmole of ( gamma " 32 P) ATP labeled DR4 DwlO sequence specific olig-omer GH78 was added to the prehybridization solution and incubated for 16 hours at 42° C. After hybridization the filter was washed in 1 x SSPE, 0.1% SDS for 10 minutes at 37°C and autoradiograpbhed.

Cloning and Segencing of HLA DR-beta and DQ-beta PCR Products. One half of each PCR reaction was ethanol precipitated, resuspended and loaded onto a 4% NuSieve, 0.5% SeaKem gel and electrophoresed at 20V- cm for one hour. Slices between 265 and 280 base-pairs for DR-beta, and between 220 and 250 base-pairs for DQ-beta, were removed from the gel and the DNA was electroeluted from the gel in 200 micro! iter 0.5x TBE buffer. The electroeluted DNA samples were dialyzed, digested with 60 units of BamHI and Pstl (New England BioLabs) for three hours at 37°C, phenol, phenol -CHC1 3 and CHC1 3 extracted and ethanol precipitated. One-fourth of the digested PCR DNA was ligated to 200 micrograms Of BamHI/ Pstl digested, dephosphor lated Ml3mpl0 and transformed into E. coli JM103 and plated onto selective media Scharf et al., jd_. ; Messing, Methods in Enzy ology, 101, pp. 20-78 (Wu et al . eds , 1983). The positive clones were identified by in -situ plaque filter hybridization using a nick-translated DR-beta cDNA probe and plaque purified, and DNA from the purified phage clones was sequenced by the chain termination method. Sanger et al., PNAS (USA), 74_:5463- 5468 (1977).

Analysis. Southern blot analysis of the PCR amplification products and of a DR-3 homozygous typing cell DNA (included as a general DR- beta amplification and hybridization control) using a DR-beta cDNA probe indicated that all four samples contained roughly equal amounts of amplified DR-beta fragment. The filter was then stripped of the nick-translated DR-beta probe and reprobed with GH78, an oligonucleotode probe specific for the DR4 subtype DwlO (see Figure 1). The GH78 sequence specific oligonucleotide (SSO) hybridizes specifically to DwlO sequences and to

none of the other DR4 subtypes (for example, Dw4, Dwl4, Dwl3 and Dwl5). The DNA amplified from all three PV patients hybridized to GH78 but not to the DR-3 homozygous control .

The amplified DNA frapents for the HLA DR-beta and DQ-beta loci were cloned and sequenced to examine in more detail polymorphisms at the level of coding sequences. Figure 2 shows amino acid alignments derived from nucleotide sequence data for HLA DR-beta loci. The DR4 DwlO subtype prototype sequence is distinguished from the DR4 Dw4 prototype sequence by the substitution of isoleucine, aspartic acid, and glutamic acid (the "I-DE" epitope) for leucine, gluta ine, and lysine at amino acid positions 68, 71, and 72, respectively. The sequence alignments show that the DR-beta-I sequences from the DR4 alleles of these three patients also contain this "I-DE" epitope). The DR-beta-I sequences from the DR5 alleles from the two DR4/5 patients (patients II and III) are identical to the DR-beta-I sequence from the DR5 prototype. Likewise, the DR-beta-III sequences from the DR5 alleles from these two DR4/5 patients are identical to the DR5 DR-beta-III prototype sequence. Patient I, who is DR4/4, has " one DwlO and one Dwl3 halotype, and the DR-beta-IV sequence from this patient is identical to the DR4-beta-IV prototype sequence. The amino acid alignments derived from the nucleotide sequence data for the HLA DQ-beta loci are shown in Figure 3. Patient I subtypes as DQB3.2 for one DR4 allele and DQB3.1 for the other DR4 allele. Patients II and III (both DR4/5) are also heterozygous for their DQ-beta loci; the DR4 allele is DQB3.2 and the DQB3.1 locus is from the DR5 haplotype. Given the DR serotypes, this heterozygosity at the DQ-beta locus is not surprising since both DQB3.1 and DQB3.2 are associated with the DR4 haplotype and DQB3.1 is associated with the DR5 halotype. The DQB3.1 and DQB3.2 sequences from all three patients are identical with their respective prototype sequences.

EXAMPLE IV

The same sequence near codon 70 described above in Example III that distinguishes DR4-DwlO from the other DR4-Dw subtypes is also present on a subset of DRw6 haplotypes. The pattern of sequence

polymorphism suggested that this shared "epitope" could be responsible for the DR4 and DRw6 disease association with PV.

DNA samples from Israeli patients, DR-matched controls, and Austrian patients were analyzed with a panel of DR-beta and DQ-beta sequence-specific oligonucleotide probes using PCR amplified DNA in a dot blot format as described above. In the analysis with DR-beta-I oligonucleotide probes, essentially all of the patient DR4 haplotypes (24/24 Israeli patients; 10/14 non-Israeli patients) had the DwlO associated epitope versus a smaller proportion of the control DR4 haplotypes (15/25 Israeli controls; 1/19 non-Israeli controls). However, the proporton of DRw6 DR-beta-I alleles that contain that epitope was lower in DRw6 PV patients (4/14) relative to controls (8/13). Therefore, it was concluded that if the DR4 susceptibility to PV can be attributed to a specific DR-beta-I allele, then the DRw6 susceptibility must be accounted for by a different sequence.

EXAMPLE V

PCR Amplification of DQ-beta Genomic Sequences. To obtain the results shown in Figure 4, 1 micro gram of genomic DNA from each of the various cell lines shown in the left-hand side of Figure 4 was amplified by PCR (Saiki et al., Nature, 324:163-166 (1986)) for 28 cycles with the Klenow fragment of _E_ ; _ coli DNA polymerase. The amplification primers GH28 and GH29 (see Example IV) were used to amplify a 238-bp segment of the DQ-beta gene. The degree and specificity of amplification were monitored by agarose gel electrophoresis, blotting, and hybridization with 32 P-labeled DQ-beta cDNA probes (Scharf et al., supra). The sources of the IDDM patients are described in Arnheim et al . , supra.

PCR Cloning into M13mpl0. The amplified DNA was extracted once with an equal volume of TE-saturated phenol, followed by two extractions with a phenol /chloroform mixture. The DNA was then diluted to 2 ml with sterile water, and dialyzed and concentrated by centrifugation through a Centricon 10 column (Am con) at 5000 x g for 60 minutes at room temperature. In the case of DQ-beta amplification,

where the PCR primers produce a number of non-HLA amplification products, the target band was first cut out of an agarose gel and electroeluted at 100 volts for one hour in 0.5xTBE. The DNA was diluted to 100 microliters and digested for two hours at 37°C with 40 units each of BamHI and Pstl. After digestion, the DNA was again phenol/chloroform extracted, dialyzed, and concentrated. It was then li gated into the M13mpl0 vector by a modification of the PCR-cloning procedure (Scharf et al., supra). The M13 plaques were initially screened for inserts by hybridization with either DQ-alpha or DQ-beta cDNA probes, and occasionally with allele-specific oligonucleotide (ASO) probes (Saiki et al . , supra). These clones were also screened for the amplification products of the DXal pha or DX-beta genes by the use of ASO probes. The single-strand phage DNA was isolated by standard techniques, and sequenced by the dideoxynucleotide primer extension method Smith, Meth. Enzymol . , _65_:560-580 (1980).

• EXAMPLE VI

Viral Sequence Homology

The sequence of the 172282-bp genome of strain B95-8 of Epstein-Barr virus (Baer et al . , supra) was translated by computer into protein sequences for all six of the forward and reverse phases. These translations were then searched for exact matches of five or more amino acids with polypeptide sequences centered at position 57 of the DQ-beta alleles (see Table VII). The location and size of the open reading frame (ORF) for each match was then determined, and correlated with -the major ORF's and transcriptional segments of the EBV genome (Baer et al , supra). One method of judging the significance of these homologies is by estimating their chance of occurring at random. In the six translation phases of EBV, the amino acid residues near position 57 of the DQ-beta alleles occur at fractions: A = 0.0804, D = 0.0258, E = 0.0337, G = 0.1089, L = 0.0890, P = 0.1089, R = 0.0962, and Y = 0.0141. By multiplying these frequencies ' together, the chances of random occurrence for the polypeptide epi topes are: "RPDAE" = 1/1,370,000, "GLPAA" = 1/147,000, "PAAEY" = 1/2,990,000, "GPPAA" = 1/120,000 and "PPAAEY" =

1/27,400,000. Thus, since there are 344,560 residues in all six of the translations, we would expect several random occurrences for the "GPPAA" and "GLPAA" epi topes, but none for the others. The excess number of matches and their correlation with major ORF's and with repeated segments of the genome contribute to their significance. By statistical probabilities alone, the six-residue match "PPAAEY" is particularly unlikely to have occurred at random.

Other modifications of the above described embodiments of the invention that are obvious to those skilled in the area of molecular and clinical biology and diagnostics and related disciplines are intended to be within the scope of the following claims.