Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MARKERS RELATED TO AGE-RELATED MACULAR DEGENERATION AND USES THEREFOR
Document Type and Number:
WIPO Patent Application WO/2012/082912
Kind Code:
A2
Abstract:
Methods are provided of screening for age-related macular degeneration (AMD), including a risk of a subject developing AMD or a risk of a subject progressing to an advanced form of AMD. The methods can include analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) selected from the group consisting of rs4711751, rs6982567, rsl999930, rsl3278062, rsl912795, rs2270637, rsl2040406, rsl367068, rsl079982, rsl443179, rs7720497, and/or rs61800454.

Inventors:
SEDDON JOHANNA M (US)
DALY MARK (US)
Application Number:
PCT/US2011/064936
Publication Date:
June 21, 2012
Filing Date:
December 14, 2011
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TUFTS MEDICAL CT INC (US)
GEN HOSPITAL CORP (US)
SEDDON JOHANNA M (US)
DALY MARK (US)
International Classes:
C12Q1/68; C12N15/09; G01N33/50
Attorney, Agent or Firm:
CULVERWELL, James A. et al. (State Street Financial CenterOne Lincoln Stree, Boston MA, US)
Download PDF:
Claims:
CLAIMS

1. A method of screening for age-related macular degeneration (AMD) in a human subject, the method comprising:

determining a risk of a subject developing AMD by analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) in the alpha chain of type X collagen (COL10A1) gene region, the at least one SNP comprising rsl999930 or a proxy for rsl999930, the presence of a SNP being indicative of an increased risk of the subject developing AMD.

2. A method of screening for age-related macular degeneration (AMD) in a human subject, the method comprising:

determining a risk of AMD progression in a subject by analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) in the alpha chain of type X collagen (COL10A1) gene region, the at least one SNP comprising rsl999930 or a proxy for rsl999930, the presence of a SNP being indicative of an increased risk of the subject developing an advanced form of AMD.

3. A method of screening for age-related macular degeneration (AMD) in a human subject, the method comprising:

determining a risk of a subject developing AMD by analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) in the vascular endothelial growth factor A (VEGFA) gene region, the at least one SNP comprising rs4711751 or a proxy for rs4711751, the presence of a SNP being indicative of an increased risk of the subject developing AMD.

4. A method of screening for age-related macular degeneration (AMD) in a human subject, the method comprising: determining a risk of AMD progression in a subject by analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) in the vascular endothelial growth factor A (VEGFA) gene region, the at least one SNP comprising rs4711751 or a proxy for rs4711751, the presence of a SNP being indicative of an increased risk of the subject developing an advanced form of AMD.

5. A method of screening for age-related macular degeneration (AMD) in a human subject, the method comprising:

determining a risk of a subject developing AMD by analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) selected from the group consisting of rs4711751, rsl999930, rsl3278062, rsl912795, rs2270637, rs6982567, rsl2040406, rsl367068, rsl079982, rsl443179, rs7720497, rs61800454, or a proxy therefor, the presence of a SNP being indicative of an increased risk of the subject developing AMD or developing an advanced form of AMD.

6. A method of screening for age-related macular degeneration (AMD) in a human subject, the method comprising:

determining a risk of AMD progression in a subject by analyzing a sample obtained from the subject for the presence of at least one single nucleotide polymorphism (SNP) identified in Tables 3-9, or a proxy therefor, the presence of a SNP being indicative of an increased risk of the subject developing AMD or developing an advanced form of AMD.

7. The method of claim 1 wherein the analyzing comprises (i) combining a nucleic acid sample from the subject with one or more polynucleotide probes capable of hybridizing selectively to a COL10A1 gene allele, or a proxy therefor, and (ii) detecting the presence or absence of hybridization.

8. The method of claim 6 wherein the analyzing comprises (i) combining a nucleic acid sample from the subject with one or more polynucleotide probes capable of hybridizing selectively to a VEGFA gene allele, or a proxy therefor, and (ii) detecting the presence or absence of hybridization.

9. The method of claim 7 or 8 wherein the probes are oligonucleotides capable of priming polynucleotide synthesis in an amplification reaction.

10. The method of any one of claims 1-6, wherein the subject is asymptomatic at the time of screening.

11. The method of any one of claims 1-6, wherein the SNP is detected in a

haplotype comprising the SNP.

12. The method of any one of claims 1-6, comprising screening for a specific subtype of AMD.

13. The method of claim 12, wherein the subtype is selected from the group consisting of early AMD, geographic atrophy, exudative AMD (CNV or neovascular disease), and combinations thereof.

14. The method of any one of claims 1-6, wherein the presence of at least one SNP is determined using a microarray.

15. The method of any one of claims 1-6, wherein the presence of at least one SNP is determined by sequencing.

16. The method of any one of claims 1-8, wherein the proxy is in linkage

disequilibrium with the SNP.

17. A diagnostic system comprising:

an array of polynucleotides comprising one or more of SEQ ID NOS: l-15, the polynucleotides comprising at least six or more contiguous nucleotides, and the polynucleotides comprising an allelic polymorphism, an array reader, an image processor, a database having AMD allelic data records and patient information records, a processor, and an information output,

wherein the system compiles and processes patient data and outputs information relating to the statistical probability of the patient developing AMD.

18. A method of using the diagnostic system of claim 17, comprising contacting a subject sample or portion thereof to the diagnostic array under high stringency hybridization conditions; inputting patient information into the system; and obtaining from the system information relating to the statistical probability of the patient developing AMD.

19. A method for diagnosing risk of AMD or severe forms of AMD in a human subject, the method comprising combining genetic risk with behavioral risk, wherein the genetic risk is determined by detecting in a sample obtained from a subject the presence or absence of a single nucleotide polymorphism SNP listed in Tables 3, 4, 5, 6, 7, 8, 9, or 10, or proxy therefor, wherein the presence of the allele is indicative of an increased risk of the subject developing AMD or a severe form of AMD.

20. The method of claim 19, wherein a behavioral risk is assessed by

determining if the subject exhibits a behavior or trait selected from the group consisting of: obesity, smoking, vitamin and dietary supplement intake, use of alcohol or drugs, poor diet, a sedentary lifestyle, medical history of heart disease or other vascular disease, and medical history of kidney or liver disease.

Description:
Attorney Docket No. TMC-467PC

MARKERS RELATED TO AGE-RELATED MACULAR DEGENERATION AND USES THEREFOR

REFERENCE TO RELATED APPLICATIONS

[0001] This application claims priority to and the benefit of U.S. Provisional Application No. 61/422,905, filed on December 14, 2010; U.S. Provisional Application No. 61/444,482, filed on February 18, 2011 ; and U.S. Provisional Application No. 61/529,817, filed on August 31 , 201 1. The entire disclosure of each of the above-identified applications is incorporated by reference.

STATEMENT REGARDING FEDERALLY FUNDED RESEARCH

[0002] This invention was made with government support under grant number ROl EY1 1309 awarded by the National Institutes of Health and the National Eye Institute. The government has certain rights in the invention.

BACKGROUND

[0003] Age-related macular degeneration (AMD) is the most common geriatric eye disorder leading to blindness. Macular degeneration is responsible for visual handicap in what is estimated conservatively to be approximately 16 million individuals worldwide. Among the elderly, the overall prevalence is estimated between 5.7% and 30% depending on the definition of early AMD, and its differentiation from features of normal aging, a distinction that remains poorly understood.

[0004] Histopathologically, the hallmark of early neovascular AMD is accumulation of extracellular drusen and basal laminar deposit (abnormal material located between the plasma membrane and basal lamina of the retinal pigment epithelium) and basal linear deposit (material located between the basal lamina of the retinal pigment epithelium and the inner collageneous zone of Bruch's membrane). The end stage of AMD is characterized by a complete degeneration of the neurosensory retina and of the underlying retinal pigment epithelium in the macular area. Advanced stages of AMD can be subdivided into geographic atrophy and exudative AMD. Geographic atrophy is characterized by progressive atrophy of the retinal pigment epithelium. In exudative AMD the key phenomenon is the occurrence of choroidal neovascularisation (CNV). Eyes with CNV have varying degrees of reduced visual acuity, depending on location, size, type and age of the neovascular lesion. The development of choroidal neovascular membranes can be considered a late complication in the natural course of the disease possibly due to tissue disruption (Bruch's membrane) and decompensation of the underlying longstanding processes of AMD.

[0005] Many pathophysiological aspects as well as vascular and environmental risk factors are associated with a progression of the disease. Family, twin, segregation, and case-control studies all suggested an involvement of genetic factors in the etiology of AMD prior to the discovery of various genes associated with AMD.

[0006] Knowledge is growing about the extent of heritability, number of genes involved, and mechanisms underlying phenotypic heterogeneity. The search for genes and markers related to AMD faces challenges— onset is late in life, and there is usually only one generation available for studies. The parents of patients are often deceased, and their children are too young to manifest the disease. Generally, the heredity of late-onset diseases has been difficult to estimate because of the uncertainties of the diagnosis in previous generations and the inability to diagnose AMD among the children of an affected individual. Even in the absence of the ambiguities in the diagnosis of AMD in previous generations, the late onset of the condition itself, natural death rates, and small family sizes result in underestimation of genetic forms of AMD, and in overestimation of rates of sporadic disease.

Moreover, the phenotypic variability is considerable, and it is conceivable that the currently used diagnostic entity of AMD in fact represents a spectrum of underlying conditions with various genetic and environmental factors involved.

|0007] There remains a strong need for improved methods of diagnosing or prognosticating AMD or a susceptibility to AMD in subjects, as well as for evaluating and developing new methods of treatment. SUMMARY

[0008] The application relates, in part, to the identification of numerous genetic markers which are associated with the presence or progression of age-related macular degeneration (AMD) in an individual. More specifically, methods are provided for diagnosing a risk of an individual developing AMD or progressing to advanced forms of AMD (e.g., geographic atrophy and/or wet AMD) using these genetic markers.

[0009] For example, in one aspect the invention provides a method of screening for age-related macular degeneration (AMD) in a human subject. The method can include determining a risk of AMD progression in the subject by analyzing a sample obtained from the subject for the presence in the subject's genome of at least one single nucleotide polymorphism (SNP) identified in Tables 3-10, or a proxy therefor. In some embodiments, a proxy is a marker that is in linkage disequilibrium with a particular SNP or marker of interest. The presence of a SNP indicates that the subject has an increased risk of developing AMD or developing an advanced form of AMD. The markers can be used individually or in combination when screening a subject. Preferred SNPs include, but are not limited to, rs471 1751 (VEGF), rsl 999930 (COL10A1/FRK), rsl3278062 (TNFRSF10A), rsl 912795 (B3GALTL), rs2270637 (SLC 18A1), rs6982567 (GDF6), rs 12040406 and rs 1367068 (CD55), rs 1079982 (CARD 10), rsl443179 (INTU), rs7720497 (ADAMTS 16), and rs61800454 (TMCOl). In some embodiments, the presence of a particular SNP indicates the subject has an increased risk of developing AMD. In some

embodiments, the presence of a particular SNP indicates the subject has an increased risk of developing an advanced form of AMD, such as geographic atrophy and/or wet AMD, which also is referred to as neovascular disease, choroidal

neovascularisation (CNV), and exudative AMD.

[0010] Various techniques can be used for analyzing a sample to determine the presence of a SNP in the subject's genome. For example, in some embodiments, the method of screening can include the steps of (i) combining a nucleic acid sample from the subject with one or more polynucleotide probes capable of hybridizing selectively to a particular SNP (e.g., any SNP identified in Tables 3-10) or gene allele, or a proxy therefor, and (ii) detecting the presence or absence of hybridization. The probes can be oligonucleotides capable of priming

polynucleotide synthesis in an amplification reaction, such as PCR or real time PCR. In some embodiments, the presence of at least one SNP is determined using a microarray. In various embodiments, the presence of at least one SNP is determined by sequencing a portion of the patient's genome.

[0011] In some embodiments, the patient is asymptomatic at the time of screening for AMD, and in some embodiments, the patient displays one or more AMD like symptoms at the time of screening.

[0012] In some embodiments, the method includes detecting a haplotypes that includes a particular SNP (e.g., any SNP listed in Tables 3-10).

[0013] In some embodiments, the method includes screening for a specific subtype of AMD, such as, for example, early AMD, geographic atrophy, wet AMD, neovascular disease, choroidal neovascularisation (CNV), exudative AMD, and combinations thereof.

[0014] The invention also provides, in part, a diagnostic system. The diagnostic system can include an array of polynucleotides comprising one or more of SEQ ID NOS: l-15, or any reference sequences corresponding to the SNPs identified in Tables 2-10. The polynucleotides can include at least six or more contiguous nucleotides, and the polynucleotides can include an allelic polymorphism or SNP. The system also can include an array reader, an image processor, a database having AMD allelic data records and patient information records, a processor, and an information output. The system compiles and processes patient data and outputs information relating to the statistical probability of the patient developing AMD.

[0015] The system can be used for various methods, including contacting a subject sample or portion thereof to the diagnostic array under high stringency hybridization conditions; inputting patient information into the system; and obtaining from the system information relating to the statistical probability of the patient developing AMD.

[0016] Further provided are methods for diagnosing risk of AMD or severe forms of AMD in a human subject. The method includes combining genetic risk with behavioral risk, wherein the genetic risk is determined by detecting in a sample obtained from a subject the presence or absence of a single nucleotide polymorphism SNP listed in Tables 3, 4, 5, 6, 7, 8, 9, or 10, or proxy therefor, wherein the presence of the allele is indicative of an increased risk of the subject developing AMD or a severe form of AMD. In various embodiments, behavioral risk is assessed by determining if the subject exhibits a behavior or trait selected from: obesity, smoking, vitamin and dietary supplement intake, use of alcohol or drugs, poor diet, a sedentary lifestyle, medical history of heart disease or other vascular disease, and medical history of kidney or liver disease.

BRIEF DESCRIPTION OF DRAWINGS

[0017] The figures are not necessarily to scale, emphasis instead generally being placed upon illustrative principles. The figures are to be considered illustrative in aspects and are not intended to limit the invention, the scope of which is defined only by the claims.

[0018] FIGS. 1 and 2 are nucleic acid sequences of VEGFA and GDF6 SNPs, respectively, in accordance with an illustrative embodiment.

[0019] FIGS. 3A and 3B are graphs showing a preliminary x 2 association analys in accordance with an illustrative embodiment.

[0020] FIG. 4 is a graph showing 80% power to detect a biallelic CNV, in accordance with an illustrative embodiment.

[0021] FIGS. 5.1 and 5.2 are nucleic acid sequences of various SNPs, in accordance with an illustrative embodiment.

(0022] FIGS. 6a-d show the FRK/COL 1 OA 1 region and VEGFA region, and association with AMD, in accordance with an illustrative embodiment.

[0023] FIG. 7 shows distribution of genetic ancestry estimated by EIGENSOFT, accordance with an illustrative embodiment.

[0024] FIG. 8 shows quantile-quantile (Q;Q) plots, in accordance with an illustrative embodiment.

[0025] FIG. 9 shows a Manhattan-Plot, in accordance with an illustrative embodiment. DETAILED DESCRIPTION

[0026] The present invention relates, in part, to the discovery that particular alleles at polymorphic sites associated with genes, including alpha chain of type X collagen (COL10A1), vascular endothelial growth factor A (VEGFA) and

growth/differentiation factor 6 (GDF6) are useful as markers for AMD etiology, for determining susceptibility to AMD, and for predicting or monitoring disease progression or severity, e.g., to determine a treatment course and/or to titrate dosages of therapeutic agents. More specifically, and by non-limiting example, the single nucleotide polymorphisms (SNPs) rs471 1751 in the VEGFA gene and rs6982567 in the GDF6 gene can be used as markers for AMD etiology, for determining susceptibility to AMD, and for predicting disease progression or severity, and for distinguishing risk of geographic atrophy, the advanced dry type of AMD from the advanced wet form of AMD. In addition, Tables 3, 4, 5, 6, 7, 8, 9, and 10 list additional polymorphisms that are also useful as such markers.

Furthermore, genes and/or markers in linkage disequilibrium with these SNPs provide additional such markers.

[0027] As used herein, "gene" is a term used to describe a genetic element that gives rise to expression products (e.g., pre-mRNA, mRNA and polypeptides). A gene can include regulatory elements, exons and sequences that otherwise appear to have only structural features, e.g., introns and untranslated regions.

[0028] The genetic markers disclosed herein are particular "alleles" at

"polymorphic sites" associated with various genes, including VEGFA, GFD6, and any markers identified in tables 3-10. A nucleotide position at which more than one nucleotide can be present in a population (either a natural population or a synthetic population, e.g., a library of synthetic molecules), is referred to herein as a

"polymorphic site". Where a polymorphic site is a single nucleotide in length, the site is referred to as a single nucleotide polymorphism ("SNP"). If at a particular chromosomal location, for example, one member of a population has an adenine and another member of the population has a thymine at the same genomic position, then this position is a polymorphic site, and, more specifically, the polymorphic site is a SNP. Polymorphic sites can allow for differences in sequences based on substitutions, insertions or deletions. Each version of the sequence with respect to the polymorphic site is referred to herein as an "allele" of the polymorphic site. Thus, in the previous example, the SNP allows for both an adenine allele and a thymine allele.

[0029] A genetic marker is "associated" with a genetic element or phenotypic trait, for example, if the marker is co-present with the genetic element or phenotypic trait at a frequency that is higher than would be predicted by random assortment of alleles (based on the allele frequencies of the particular population). Association also indicates physical association, e.g., proximity in the genome or presence in a haplotype block, of a marker and a genetic element.

[0030] A reference sequence is typically referred to for a particular genetic element, e.g. , a gene. The reference sequence, often chosen as the most frequently occurring allele, is referred to as a "wild type" allele or the "major allele"). Alleles that are more common or less common in individuals with a disease/trait compared to individuals without the disease/trait, with a certain level of statistical significance, are referred to as the variant alleles. The corresponding genotype is referred to as a genetic variant.

[0031] Some variant alleles can include changes that affect a polypeptide or protein, e.g., the polypeptide encoded by a variant allele. These sequence differences, when compared to a reference nucleotide sequence, can include, for example, the insertion or deletion of a single nucleotide, or of more than one nucleotide, resulting in a frame shift; the change of at least one nucleotide, resulting in a change in the encoded amino acid; the change of at least one nucleotide, resulting in the generation of a premature stop codon; the deletion of several nucleotides, resulting in a deletion of one or more amino acids encoded by the nucleotides; the insertion of one or several nucleotides, such as by unequal recombination or gene conversion, resulting in an interruption of the coding sequence of a reading frame; duplication of all or a part of a sequence; transposition; or a rearrangement of a nucleotide sequence.

[0032] Alternatively, a polymorphism associated with AMD or a susceptibility to AMD can be a synonymous change in one or more nucleotides (i.e., a change that does not result in a change to a codon of a complement pathway gene). Such a polymorphism can, for example, alter splice sites, affect the stability or transport of mRNA, or otherwise affect the transcription or translation of the polypeptide. The polypeptide encoded by the reference nucleotide sequence is the "reference" polypeptide with a particular reference amino acid sequence, and- polypeptides encoded by variant alleles are referred to as "variant" polypeptides with variant amino acid sequences.

[0033] A haplotype is a combination or set of genetic markers, e.g., particular alleles at polymorphic sites, such as, e.g., SNPs and/or microsatellites. The haplotypes described herein are associated with AMD and/or a susceptibility to AMD. Detection of the presence or absence of the haplotypes herein, therefore, is indicative of AMD, is indicative of a susceptibility to AMD, is indicative of a factor related to progression from early to intermediate or late stages of AMD, is indicative of progression from intermediate to late stages of AMD, or is indicative of a lack of AMD. Detecting haplotypes, therefore, can be accomplished by methods known in the art for detecting sequences at polymorphic sites.

[0034] "Linkage" refers to a higher than expected statistical association of genotypes and/or phenotypes with each other. Linkage disequilibrium ("LD") refers to a non-random assortment of two genetic elements. If a particular genetic element (e.g., an allele at a polymorphic site), for example, occurs in a population at a frequency of 0.25 and another occurs at a frequency of 0.25, then the predicted occurrence of a person's having both elements is 0.125, assuming a random distribution of the elements. If, however, it is discovered that the two elements occur together at a frequency higher than 0.125, then the elements are said to be in LD since they tend to be inherited together at a higher frequency than what their independent allele frequencies would predict. Roughly speaking, LD is generally correlated with the frequency of recombination events between the two elements. Allele frequencies can be determined in a population, for example, by genotyping individuals in a population and determining the occurrence of each allele in the population. For populations of diploid individuals, e.g., human populations, individuals will typically have two alleles for each genetic element (e.g., a marker or gene).

[0035] The invention is also directed to markers identified in a "haplotype block" or "LD block". These blocks are defined either by their physical proximity to a genetic element, e.g. , a VEGFA, GDF6, or the other markers provided herein, or by their "genetic distance" from the element. Markers and haplotypes identified in these blocks, because of their association with AMD and VEGFA, GDF6, or the markers identified herein, are encompassed by the invention. One of skill in the art will appreciate regions of chromosomes that recombine infrequently and regions of chromosomes that are "hotspots", e.g. , exhibiting frequent recombination events, are descriptive of LD blocks. Regions of infrequent recombination events bounded by hotspots will form a block that will be maintained during cell division. Thus, identification of a marker associated with a phenotype, wherein the marker is contained within an LD block, identifies the block as associated with the phenotype. Any marker identified within the block can therefore be used to indicate the phenotype.

[0036] Additional markers that are in LD with the markers of the invention or haplotypes are referred to herein as "surrogate" markers (i.e., "proxy" markers). Such a surrogate is a marker for another marker or another surrogate marker.

Surrogate markers are themselves markers and are indicative of the presence of another marker, which is in turn indicative of either another marker or an associated phenotype.

[0037] Susceptibility for developing AMD includes an asymptomatic patient showing increased risk to develop AMD, and a patient having early or intermediate stages of AMD indicating a progression toward more advanced forms of AMD and expected visual loss. Susceptibility for not developing AMD includes an asymptomatic patient having at least one wild type allele, or a non-risk genotype, or a protective genotype, or a non-risk allele, or a protective allele, or a non-risk haplotype, or a protective haplotype indicates a lack of a predisposition for developing AMD.

[0038] Genetic markers (e.g., SNPs) can be detected in nucleic acids (e.g., DNA or mRNA) in any suitable sample source obtained or taken from an individual, including blood, saliva, feces, bone, epithelial cells, endothelial cells, blood cells, and other bodily fluids, cells, and/or tissues.

[0039] Table 10 lists representative markers which are associated with AMD. These markers, or markers in linkage disequilibrium with these markers (e.g., R squared = 0.2 or higher), can be used as markers for AMD etiology, for determining susceptibility to AMD, and for predicting disease progression or severity, and for distinguishing risk of geographic atrophy, the advanced dry type of AMD from the advanced wet form of AMD.

Diagnostic Gene Array

[0040] In one aspect, the invention comprises an array of gene fragments, particularly nucleic acids including one or more SNPs given as SEQ ID NOS: l-15 and/or sequences including the SNPs identified in Tables 3, 4, 5, 6, 7, 8, 9, or 10 and probes for detecting the allele at the SNPs of one or more of SEQ ID NOS: l-15 and/or sequences including the SNPs identified in Tables 3, 4, 5, 6, 7, 8, 9, and 10. Polynucleotide arrays provide a high throughput technique that can assay a large number of polynucleotide sequences in a single sample. This technology can be used, for example, as a diagnostic tool to assess the risk potential of developing AMD using the SNPs and probes of the invention. Polynucleotide arrays (for example, DNA or RNA arrays), include regions of usually different sequence polynucleotides arranged in a predetermined configuration on a substrate, at defined x and y coordinates. These regions (sometimes referenced as "features") are positioned at respective locations ("addresses") on the substrate. The arrays, when exposed to a sample, will exhibit an observed binding pattern. This binding pattern can be detected upon interrogating the array. For example, all polynucleotide targets (for example, DNA) in the sample can be labeled with a suitable label (such as a fluorescent compound), and the fluorescence pattern on the array accurately observed following exposure to the sample. Assuming that the different sequence polynucleotides were correctly deposited in accordance with the predetermined configuration, then the observed binding pattern will be indicative of the presence and/or concentration of one or more polynucleotide components of the sample.

[0041] Arrays can be fabricated by depositing previously obtained biopolymers onto a substrate, or by in situ synthesis methods. The substrate can be any supporting material to which polynucleotide probes can be attached, including but not limited to glass, nitrocellulose, silicon, and nylon. Polynucleotides can be bound to the substrate by either covalent bonds or by non-specific interactions, such as hydrophobic interactions. The in situ fabrication methods include those described in U.S. Pat. No. 5,449,754 for synthesizing peptide arrays, and in U.S. Pat. No.

6,180,351 and WO 98/41531 and the references cited therein for synthesizing polynucleotide arrays. Further details of fabricating biopolymer arrays are described in U.S. Pat. No. 6,242,266; U.S. Pat. No. 6,232,072; U.S. Pat. No. 6,180,351 ; U.S. Pat. No. 6,171,797; EP No. 0 799 897; PCT No. WO 97/29212; PCT No. WO 97/27317; EP No. 0 785 280; PCT No. WO 97/02357; U.S. Pat. Nos. 5,593,839; 5,578,832; EP No. 0 728 520; U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCT No. WO 95/22058; and U.S. Pat. No. 5,631,734. Other techniques for fabricating biopolymer arrays include known light directed synthesis techniques. Commercially available polynucleotide arrays, such as Affymetrix GeneChip™, can also be used. Use of the GeneChip™, to detect gene expression is described, for example, in Lockhart et al., Nat. BiotechnoL, 14: 1675, 1996; Chee et al., Science, 274:610, 1996; Hacia et al, Nat. Gen., 14:441, 1996; and Kozal et al., Nat. Med., 2:753, 1996. Other types of arrays are known in the art, and are sufficient for developing an AMD diagnostic array of the present invention. [0042] To create the arrays, single-stranded polynucleotide probes can be spotted onto a substrate in a two-dimensional matrix or array. Each single-stranded polynucleotide probe can comprise at least 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, or 30 or more contiguous nucleotides selected from the nucleotide sequences shown in SEQ ID NO: 1-15 and/or sequences including the SNPs identified in Tables 3, 4, 5, 6, 7, 8, 9, and 10, or the complement thereof. Preferred arrays comprise at least one single-stranded polynucleotide probe comprising at least 6, 7, 8, 9, 10, 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, or 30 or more contiguous nucleotides selected from the nucleotide sequences shown in SEQ ID NO: 1 -15, and/or sequences including the SNPs identified in Tables 3, 4, 5, 6, 7, 8, 9, and 10, or the complement thereof.

[0043] Tissue samples from a subject can be treated to form single-stranded polynucleotides, for example by heating or by chemical denaturation, as is known in the art. The single-stranded polynucleotides in the tissue sample can then be labeled and hybridized to the polynucleotide probes on the array. Detectable labels that can be used include but are not limited to radiolabels, biotinylated labels, fluorophors, and chemiluminescent labels. Double stranded polynucleotides, comprising the labeled sample polynucleotides bound to polynucleotide probes, can be detected once the unbound portion of the sample is washed away. Detection can be visual or with computer assistance. Preferably, after the array has been exposed to a sample, the array is read with a reading apparatus (such as an array "scanner") that detects the signals (such as a fluorescence pattern) from the array features. Such a reader preferably would have a very fine resolution (for example, in the range of five to twenty microns) for an array .having closely spaced features.

[0044] The signal image resulting from reading the array can then be digitally processed to evaluate which regions (pixels) of read data belong to a given feature as well as to calculate the total signal strength associated with each of the features. The foregoing steps, separately or collectively, are referred to as "feature extraction" (U.S. Pat No. 7,206,438). Using any of the feature extraction techniques so described, detection of hybridization of a patient derived polynucleotide sample with one of the AMD markers on the array given as SEQ ID NO: 1-15 and/or sequences including the SNPs identified in Tables 3, 4, 5, 6, 7, 8, 9, and 10 identifies that subject as having or not having a genetic risk factor for AMD, as described above.

System for Analyzing Patient Data

[0045] In another aspect, the invention provides a system for compiling and processing patient data, and presenting a risk profile for developing AMD or for the progression to late stages. A computer aided medical data exchange system is preferred. The system is designed to provide high-quality medical care to a patient by facilitating the management of data available to care providers. The care providers will typically include physicians, surgeons, nurses, clinicians, various specialists, and so forth. It should be noted, however, that while general reference is made to a clinician in the present context, the care providers may also include clerical staff, insurance companies, teachers and students, and so forth. The system provides an interface, which allows the clinicians to exchange data with a data processing system. The data processing system is linked to an integrated knowledge base and a database.

[0046) The database may be software-based, and includes data access tools for drawing information from the various resources as described below, or coordinating or translating the access of such information. In general, the database will unify raw data into a useable form. Any suitable form may be employed, and multiple forms may be employed, where desired, including hypertext markup language (HTML) extended markup language (XML), Digital Imaging and Communications in Medicine (DICOM), Health Level Seven™ (HL7), and so forth. In the present context, the integrated knowledge base is considered to include any and all types of available medical data that can be processed by the data processing system and made available to the clinicians for providing the desired medical care. In general, data within the resources and knowledge base are digitized and stored to make the data available for extraction and analysis by the database and the data processing system. Even where more conventional data gathering resources are employed, the data is placed in a form that permits it to be identified and manipulated in the various types of analyses performed by the data processing system. [0047] The integrated knowledge base is intended to include one or more repositories of medical-related data in a broad sense, as well as interfaces and translators between the repositories, and processing capabilities for carrying out desired operations on the data, including analysis, diagnosis, reporting, display and other functions. The data itself may relate to patient-specific characteristics as well as to non-patient specific information, as for classes of persons, machines, systems and so forth. Moreover, the repositories may include devoted systems for storing the data, or memory devices that are part of disparate systems, such as imaging systems. As noted above, the repositories and processing resources making up the integrated knowledge base may be expandable and may be physically resident at any number of locations, typically linked by dedicated or open network links. Furthermore, the data contained in the integrated knowledge base may include both clinical data (e.g., data relating specifically to a patient condition) and non-clinical data. Examples of preferred clinical data include patient medical histories, patient serum, plasma, and/or other biomarkers such as blood levels of certain other nutrients, fats, female and male hormones, etc., and cellular antioxidant levels, and the identification of past or current environmental, lifestyle and other factors that predispose a patient to develop AMD. These include but are not limited to various risk factors such as obesity, smoking, vitamin and dietary supplement intake, use of alcohol or drugs, poor diet, a sedentary lifestyle, medical history of heart disease or other vascular disease, and/or medical history of kidney or liver disease. Non-clinical data may include more general information about the patient, such as residential address, data relating to an insurance carrier, and names and addresses or phone numbers of significant or recent practitioners who have seen or cared for the patient, including primary care physicians, specialists, and so forth.

[0048] The flow of information can include a wide range of types and vehicles for information exchange. In general, the patient can interface with clinicians through conventional clinical visits, as well as remotely by telephone, electronic mail, forms, and so forth. The patient can also interact with elements of the resources via a range of patient data acquisition interfaces that can include conventional patient history forms, interfaces for imaging systems, systems for collecting and analyzing tissue samples, body fluids, and so forth. Interaction between the clinicians and the interface can take any suitable form, depending upon the nature of the interface. Thus, the clinicians can interact with the data processing system through conventional input devices such as keyboards, computer mice, touch screens, portable or remote input and reporting devices. The links between the interface, data processing system, the knowledge base, the database and the resources typically include computer data exchange interconnections, network connections, local area networks, wide area networks, dedicated networks, virtual private network, and so forth.

[0049] In general, the resources can be patient-specific or patient-related, that is, collected from direct access either physically or remotely (e.g., via computer link) from a patient. The resource data can also be population-specific so as to permit analysis of specific patient risks and conditions based upon comparisons to known population characteristics. It should be noted that the resources can generally be thought of as processes for generating data. While many of the systems and resources will themselves contain data, these resources are controllable and can be prescribed to the extent that they can be used to generate data as needed for appropriate treatment of the patient. Exemplary controllable and prescribable resources include, for example, a variety of data collection systems designed to detect physiological parameters of patients based upon sensed signals. Such electrical resources can include, for example, electroencephalography resources (EEG), electrocardiography resources (ECG), electromyography resources (EMG), electrical impedance tomography resources (EIT), nerve conduction test resources, electronystagmography resources (ENG), and combinations of such resources. Various imaging resources also can be controlled and prescribed as necessary. Exemplary eye tests include, for example, electrophysiologic tests,

elcetroretinograms, electrooculagrams, retinal angiography, retinal photography, ultrasonography, optical coherence tomography, and other imaging modalities such as autofluorescence. A number of modalities of such resources are currently available, such as, for example, X-ray imaging systems, magnetic resonance (MR) imaging systems, computed tomography (CT) imaging systems, positron emission tomography (PET) systems, fluorography systems, sonography systems, infrared imaging systems, nuclear imaging systems, thermoacoustic systems, and so forth. Imaging systems can draw information from other imaging systems, electrical resources can interface with imaging systems for direct exchange of information (such as for timing or coordination of image data generation, and so forth).

[0050] In addition to such electrical and highly automated systems, various resources of a clinical and laboratory nature can be accessible. Such resources may include blood, urine, saliva and other fluid analysis resources, including

gastrointestinal, reproductive, urological, nephrological (kidney function), and cerebrospinal fluid analysis system. Such resources can further include polymerase (PCR) chain reaction analysis systems, genetic marker analysis systems, radioimmunoassay systems, chromatography and similar chemical analysis systems, receptor assay systems and combinations of such systems. Histologic resources, somewhat similarly, can be included, such as tissue analysis systems, cytology and tissue typing systems and so forth. Other histologic resources can include immunocytochemistry and histopathological analysis systems. Similarly, electron and other microscopy systems, in situ hybridization systems, and so forth can constitute the exemplary histologic resources. Pharmacokinetic resources can include such systems as therapeutic drug monitoring systems, receptor

characterization and measurement systems, and so forth. Again, while such data exchange can be thought of passing through the data processing system, direct exchange between the various resources can also be implemented.

[0051] Use of the present system involves a clinician obtaining a patient sample, and evaluation of the presence of a genetic marker in that patient indicating a predisposition (or not) for AMD or its progression, such as one or more of SEQ ID NO: 1-15, and/or sequences including the SNPs identified in Tables 3, 4, 5, 6, 7, 8, 9, and 10 alone or in combination with other known risk factors. The clinician or their assistant also obtains appropriate clinical and non-clinical patient information, and inputs it into the system. The system then compiles and processes the data, and provides output information that includes a risk profile for the patient, of developing AMD and/or progressing to advanced forms of AMD.

[0052] The present invention thus provides for certain polynucleotide sequences that have been correlated to AMD. These polynucleotides are useful as diagnostics, and are preferably used to fabricate an array, useful for screening patient samples. The array, in a currently most preferred embodiment, is used as part of a laboratory information management system, to store and process additional patient information in addition to the patient's genomic profile. As described herein, the system provides an assessment of the patient's risk for developing AMD, risk for disease progression, and likelihood of disease prevention based on patient controllable factors.

Kits

[0053] The invention relates in part to kits and systems useful for performing the diagnostic methods described herein. The methods described herein can be performed by, for example, diagnostic laboratories, service providers, experimental laboratories, and individuals. The kits can be useful in these settings, among others.

[0054] Kits include reagents and materials for obtaining genetic material and assaying one or more markers in a sample from an individual, analyzing the results, diagnosing whether the individual is susceptible to or at risk for developing AMD, monitoring disease progression, and/or determining an appropriate treatment course. For example, in some embodiments, the kit can include a needle, syringe, vial, cotton swap or other apparatus for obtaining and/or containing a sample from an individual. In some embodiments, the kit can include at least one reagent which is used specifically to detect a marker disclosed herein. That is, suitable reagents and techniques readily can be selected by one of skill in the art for inclusion in a kit for detecting or quantifying a marker of interest.

[0055] For example, where the marker is a nucleic acid (e.g., DNA or RNA), the kit includes reagents appropriate for detecting nucleic acids using, for example, PCR, hybridization techniques, and microarrays.

[0056] Where appropriate, the kit includes: extraction buffers or reagents, amplification buffers or reagents, reaction buffers or reagents, hybridization buffers or reagents, immunodetection buffers or reagents, labeling buffers or reagents, and detection means. The kit can include all or part of the nucleic acids of SEQ ID NOS: 1-15 and/or a nucleic acid including a SNP identified in Tables 3, 4, 5, 6, 7, 8, 9, and 10, or a nucleic acid molecule complementary thereto.

[0057] Kits can also include a control, which can be a control sample, a reference sample, an internal standard, or previously generated empirical data. The control may correspond to a known allele, e.g., a wild type and/or a variant allele. In addition, a control may be provided for each marker or the control may be a reference (e.g., a wild type and/or variant sequence).

[0058] Kits can include one or more containers for each individual reagent. Kits can further include instructions for performing the methods described herein and/or interpreting the results, in accordance with any regulatory requirements. In addition, software can be included in the kit for analyzing the results. Preferably, the kits are packaged in a container suitable for commercial distribution, sale, and/or use.

[0059] The following examples are provided for illustration, not limitation.

Example 1

Discovery of genetic variants associated with AMD:

[0060] Age-related macular degeneration (AMD), the leading cause of late onset blindness, arises from retinal damage associated with accumulation of drusen and subsequent atrophy or neovascularization that leads to loss of central vision. The results of a genome-wide association study (GWAS) of 979 advanced AMD cases and 1709 controls using the Affymetrix 6.0 platform with replication in seven additional cohorts (totaling 4337 unrelated cases and unrelated 2077 controls) are presented. These data were combined with the data from the Michigan/Penn/Mayo (MPM) GWAS, which was obtained from a public database, to increase sample size. The Michigan/Penn/Mayo (MPM) GWAS implicated different genes. Analyses of the raw genetic data implicated associated variants in the reference single nucleotide polymorphisms listed in Tables 3, 4, and 5, including the VEGFA gene (discovery P = 2.66e-05) and the GDF6 gene (discovery P = 6.14e-06). In Tables 3 and 5, for example, the effective allele (EA) and odds ratio (OR) are given for each polymorphism. For example, if T is the effective allele and the OR is 1.2, then the T allele is associated with a 20% higher risk compared to the other allele. If C is the effective allele, and the OR is 0.80, then the C allele is associated with a 20% lower risk compared to the other allele.

[0061] In Tables 3-10, the column headers include: SNP, single nucleotide polymorphism; GENE, gene of interest within or near putative interval; Chr, chromosome; BP or POS, base-pair position; EA, effective allele; OR, odds ratio; Al, minor allele; A2, major allele; Meta P, P value for the association between the minor allele and AMD; Z, weighted average and direction of minor allele signal; and P, P value.

[0062] Age-related macular degeneration (AMD) is a common, late-onset disorder that is modified by covariates including smoking and BMI, and has a 3-6 fold higher recurrence ratio in siblings than in the general population. The burden of AMD is clinically significant, causes visual loss, and reduces quality of life. Among individuals age 75 or older, approximately one in four have some sign of this disease, while about one in 15 have the advanced form with visual loss.

[0063] Described herein is a study involving 979 cases of advanced AMD in the discovery phase with multiple stages of replication. Samples (e.g., blood samples) were genotyped on the Affymetrix 6.0, platform which contains probes for 906,000 SNPs and an additional 946,000 SNP-invariant probes to enhance copy number variation (CNV) analysis and captures 82% of the variation at an r 2 > 0.8 for Europeans in the 3.1 million SNPs of HapMap phase 2. These data were combined with data with raw genetic data from a public database and conducted imputation using the HapMap phase 3 and the raw genetic data from the publicly available 1000 genomes project. Analyses of the resultant dataset uncovered several new AMD susceptibility loci for AMD. Significant, replicated associations include variations in VEGFA and GFD6, thus revealing novel markers associated with AMD pathogenesis. Additional associated markers include the SNPs listed in Tables 3, 4, 5, 6, 7, 8, 9, and 10.

METHODS

[0064] Briefly, a genome-wide association (GWAS) was combined with the MPM results as described more detail below. SNPs were imputed based on the HapMap 3 SNP database, and also imputed results based on pilot data from another public database called "the 1000 Genomes project". Using this dataset as the discovery sample, VEGFA SNP rs471 1751 was found to be in linkage disequilibrium with AMD, with a p value of p = 2.66e-05. Another marker, GDF6 SNP rs6982567 also was found to be in linkage disequilibrium with AMD, with a p value of p = 6.14e-6. These data were sent to other groups for replication using TaqMan® or Sequenom® assays, and the association was confirmed. Additional markers which are candidates for genetic variants associated with AMD are listed in Tables 3, 4, and 5. Study Sample Descriptions

[0065] The methods employed in this study conformed to the tenets of the Declaration of Helsinki, received approval from Institutional Review Boards, and informed consent was signed by all participants. Some methods have been described in detail previously. (Neale, et al., "Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC)." Proc Natl Acad Sci U S A 107, 7395-400 (2010); Fagerness, et al., "Variation near complement factor I is associated with risk of advanced AMD." Eur J Hum Genet 17, 100-4 (2009); Mailer, et al., "Variation in complement factor 3 is associated with risk of age-related macular degeneration." Nat Genet 39, 1200-1 (2007); Mailer, et al., "Common variation in three genes, including a noncoding variant in CFH, strongly influences risk of age-related macular degeneration." Nat Genet 38, 1055-9 (2006)) Cases had geographic atrophy or neovascular disease based on fundus photography and ocular examination (Clinical Age-Related Maculopathy Grading System (CARMS) stages 4 and 5). (Seddon, et al., "Evaluation of the clinical age- related maculopathy staging system." Ophthalmology 1 13, 260-6 (2006))

[0066] Controls were unrelated to cases, 60 years of age or older, and were defined as individuals without macular degeneration, categorized as CARMS stage 1, based on fundus photography and ocular examination. Subjects were derived from ongoing AMD study protocols as described previously.

[0067] Tufts/MGH Subjects included in the current GWAS were derived from ongoing AMD study protocols as described previously. (Neale, et al., "Genome- wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC)." Proc Natl Acad Sci USA 107, 7395-400 (2010); Mailer, et al., "Common variation in three genes, including a noncoding variant in CFH, strongly influences risk of age-related macular degeneration." Nat Genet 38, 1055-9 (2006); Seddon, et al., "Progression of age-related macular degeneration: association with body mass index, waist circumference, and waist-hip ratio." Arch Ophthalmol 121 , 785-92 (2003); Seddon, et al., "A genomewide scan for age-related macular degeneration provides evidence for linkage to several chromosomal regions." Am J Hum Genet 73, 780-90 (2003); Seddon, et al., "The US twin study of age-related macular degeneration: relative roles of genetic and environmental influences." Arch Ophthalmol 123, 321 -7 (2005); "A randomized, placebo-controlled, clinical trial of high-dose supplementation with vitamins C and E, beta carotene, and zinc for age-related macular degeneration and vision loss: AREDS report no. 8." Arch Ophthalmol 1 19, 1417-36 (2001)) MMAP Subjects included in the current GWAS were obtained from dbGaP

(http://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=DUC&viewj 3df&stacc=phs00018 2.v2.pl) and described previously. (Chen, W., et al., "Genetic variants near TIMP3 and high-density lipoprotein-associated loci influence susceptibility to age-related macular degeneration." Proc Natl Acad Sci U S A 107, 7401-6 (2010)) Shared controls from GAIN Schizophrenia Study were obtained from dbGap

(http://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=DUC&vi ew_pdf&stacc=phs00002 1. v2.pl) and described (Manolio, T.A., et al., "New models of collaboration in genome-wide association studies: the Genetic Association Information Network." Nat Genet 39, 1045-51 (2007)) The datasets of Tufts/MGH replication, MIGEN controls, Johns Hopkins University (JHU), Columbia University (COL),

Washington University (Wash-U), and Hopital Intercommunal de Creteil (FR- CRET) were included in a previous study. (Neale, et al., "Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC)." Proc Natl Acad Sci U S A 107, 7395-400 (2010) The datasets Centre for Eye Research Australia (AUS), Genentech, Decode (Iceland) and Rotterdam (ROT) applied the same criteria for the diagnosis and IRB approved protocols of their samples.

Genotyping using Genome-wide panels

[0068] The GWAS genotyping of Tufts/MGH samples and MIGEN samples were performed at the Broad and National Center for Research Resources (NCRR) Center for Genotyping and Analysis using the Affymetrix SNP 6.0 GeneChip (909622 SNPs). (Korn, et al., "Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs." Nat Genet 40, 1253-60 (2008)) Shared controls from GAIN study were also genotyped by using the Affymetrix SNP 6.0 GeneChip. MMAP samples were genotyped by Illumina HumanCNV370vl Bead Array (ILMN 370, 370404 SNPs). (Chen, W., et al., "Genetic variants near ΤΓΜΡ3 and high-density lipoprotein-associated loci influence susceptibility to age-related macular degeneration." Proc Natl Acad Sci U S A 107, 7401-6 (2010))

Other Replication Genotyping

[0069] Samples from Hopital Intercommunal de Creteil (FR-CRET) and Tufts- replication were genotyped at the Broad Institute Center by the Sequenom iPLEX assay (http://www.sequenom.com/Genetic-Analysis/Applications/iPLEX - Genotyping/iPLEX-Overview.aspx). Samples from Wash-U and AUS were genotyped by the Sequenom iPLEX assay at each respective site. Samples from JHU and COL were genotyped by the TaqMan assay using the ABI PRISM

7900 Sequence Detection System (ABI, Foster City, CA, USA)

(https://products.appliedbiosystems.com/ab/en/US/adirect/ ab?cmd=catNavigate2&c atID=601283).

Quality control

[0070] Quality control procedures for the genotype data of Tufts/MGH and MMAP have been described in detail. (Neale, et al., "Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC)." Proc Natl Acad Sci U S A 107, 7395-400 (2010); Chen, W., et al., "Genetic variants near TIMP3 and high-density lipoprotein-associated loci influence susceptibility to age-related macular degeneration." Proc Natl Acad Sci U S A 107, 7401 -6 (2010)). Briefly, individuals with call rates <0.95 then SNPs with call rates <0.98, Hardy-Weinberg equilibrium P <10 "6 , and MAF <0.01 were excluded. Potential relatedness between individuals was identified through a Genome-wide identity-by-state (IBS) matrix using PLINK. (Purcell, et al., "PLINK: a tool set for whole-genome association and population-based linkage analyses." Am J Hum Genet 81, 559-75 (2007)) IBS was estimated for each pair of individuals and one individual from each duplicate or related pair (pihat>0.2) was removed.

Ancestry outliers were identified based on principle components analysis (PCA) using EIGENSOFT (FIG. 7). (Price, et al., "Principal components analysis corrects for stratification in genome-wide association studies." Nat Genet 38, 904-9 (2006).) Imputation and Statistical Analysis

[0071] Stringent quality control checks described in Table 13 were applied on each of the data sets contribute to TMMG samples. We next used BEAGLE version 3.0 (Browning, et al., "A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals." Am J Hum Genet 84, 210-23 (2009); Browning, et al., "Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering." Am J Hum Genet 81 , 1084-97 (2007)) to infer genotypes using the phased CEU and TSI samples (566 haplotypes) of the 1000 Genomes project as a reference. The imputations were performed separately for those cases and controls genotyped on platforms with AFFY 6.0 (more SNPs) and those genotyped with ILMN 370 (less SNPs). For inclusion of data we utilized only imputed genotypes with imputation quality scores >0.6 where the score is defined as the ratio-of-variances (empirical/asymptotic) of each genotype. This score is equivalent to the RSQR HAT value by MACH and the information content (INFO) measure by PLINK. Since the imputation accuracy are relative low for SNPs with low minor allele frequency (MAF), we only included imputed genotypes of common variants (MAF>0.01) in the analysis. PLINK was used as the primary association test for the imputed genotypes coded by the genotype probabilities for each SNP. The eigenvalue scores with nominal siginificant (p<0.05) association to case/control status (first seven PCAs, PCA1 1 and PCA16) and the original genotyping platform were adjusted as covariates in the association test. The P-value for the combined analysis was derived from the sum of weighted average Z score by the Stouffer's Z- score method as previously described. Neale, et al., "Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC)." Proc Natl Acad Sci U S A 107, 7395-400 (2010) The Z score was weighted by the effective sample size of each independent replication cohort if the ratio between cases and controls was equal to 1 based on actual samples listed in Table 12. Heterogeneity of the association between SNP and disease was evaluated by the Cochran's Q-test.

[0072] FIG. 7 shows distribution of genetic ancestry along PCI and PC2 estimated by EIGENSOFT, colored by case (red) / control (blue) status and displayed by the original genotyping platforms, AFFY 6.0 (circle) / ILMN 370 (cross) in all TMMG samples before (left plot) and after (right plot) excluding outliers (PC2>0.05).

[0073] FIG. 8 shows quantile-quantile (Q;Q) plots. We plotted our genome-wide association findings from the cleaned TMMG dataset in Quantile-Quantile (Q:Q) plots. The Q:Q plot on the left represents the strong associations of the CFH, ARMS2/HTRA1, C2/CFB, C3, CFI and LIPC regions that has been previously associated. The Q.Q plot on the right represents the association results of SNPs after excluding these previous associated regions.

[0074] FIG. 9 shows a Manhattan-Plot. The log(p-values) of association results from the cleaned TMMG dataset were plotted for SNPs on each chromosome. SNPs with P<5xl0-7 were colored in red and the representative genes for each associated region were labeled.

[0075] We genotyped 1242 cases and 492 controls of European ancestry, diagnosed based on fundus photography and ocular examination, 1 188 controls from the Myocardial Infarction Genetics Consortium (MIGen), 1378 controls from the GAIN Schizophrenia Study and 1355 cases / 1076 controls from the Michigan, Mayo, APvEDS, Pennsylvania (MMAP) Cohort Study. After thorough quality control analyses, the merged dataset of Tufts/MMAP/MIGen/G ΑΓΝ (TMMG) contained 6728 samples, of which 4300 were genotyped by Affymetrix SNP 6.0 GeneChip and 2428 were genotyped by Illumina HumanCNV370vl Bead Array. The TMMG dataset genotyped by AFFY 6.0 (644,413 SNPs passing quality control checks) was imputed using the phased CEU and TSI samples (566 haplotypes) of the 1000 Genomes project as a reference. (Chen, W., et al., "Genetic variants near TIMP3 and high-density lipoprotein-associated loci influence susceptibility to age- related macular degeneration." Proc Natl Acad Sci U S A 107, 7401-6 (2010)) Separate imputation was performed on the TMMG dataset genotyped on the ILMN 370 (329,315 SNPs passing quality control checks) using the same method. A consensus set of 6,036,699 high quality SNPs from each imputed dataset was analyzed using a generalized linear model controlling for genetic ancestry based on principal component analysis. We observed little statistical inflation in the association statistic after removing known associated loci (see FIGS. 8a, 8b, X gC = 1.047). There were highly statistically significant association signals at SNPs in six previously published loci, including ARMS2/HTRA 1 (rsl0490924, p=2.5xl0 "143 ) , CFH (rsl061170, p=1.6xl0 ~136 ) and (rsl410996, ρ=7.6χ1(Τ 133 ), CFB (rs641 153 ,p=7.8 xlO 23 ), C3 (rs2230199, p=2.6xl 0 19 ), C2 (rs9332739, ρ=7.6χ10· 12 ), CFI (rsl0033900, p=8.7xl0- 12 ), and LIPC (rsl532085, p=3.2xl0 "7 ) (FIG. 9).

Genentech of the Roche group sample replication, (f) WASH-U represents Washington University sample replication; (g) AUS represents the Centre for Eye Research Australia sample replication; (h) Rotterdam represents the Rotterdam study sample replication.

Table 12: Age-related macular degeneration grade, gender and age information for samples.

sample replication, Iceland represents deCODE genetics sample replication and Rotterdam represents the Rotterdam study sample replication. AMD Grading System: grade 1 represents individuals with no drusen or a few small drusen, 4 represents individuals with central or non-central geographic atrophy ("advanced dry type"), and 5 represents individuals with neovascular disease ("advanced wet type").

Table 13: The evolution of sample size as a function of the quality control process.

Individuals who did not cluster with the majority of the sample using a principle components population stratification analysis were removed before imputation. Resultant SNPs from imputation were filtered by quality score>0.6 and Minor allele frequency>0.01.

(a) TMMG; (b) ICELAND; (c)COL;(d) JHU; (e) Genentech; (f) Wash-U; (g) AUS; (h) ROT. (: # Effective allele (EA)-frequency and odds ratio based on this SNP for each locus.

Table 15: Association results of some published candidate SNPs not showing significant evidence of association in TMMG.

[0076] Additional methods for marker discovery and validation also were used. Briefly, the Tufts/MGH replication dataset was comprised of DNA samples from unrelated Caucasian individuals not included in the GWAS, including 868 advanced AMD cases and 410 examined controls who were identified from the same Tufts cohorts, and 379 unexamined MGH controls.

[0077] The GWAS genotyping and the Tufts/MGH follow-up replication genotyping were performed at the Broad and National Center for Research

Resources (NCRR) Center for Genotyping and Analysis using the Affymetrix SNP 6.0 GeneChip and the Sequenom MassARRAY system for iPLEX assays, respectively. Initially, a primary dataset of 1 ,057 cases and 558 was examined controls and studied 906,000 genotyped SNPs and 946,000 CNVs using the Affymetrix 6.0 GeneChip which passed quality control filters. Then 43,562 SNPs were removed for low call rate, 4,708 were removed for failing Hardy-Weinberg test at 10-3, and 8,332 SNPs were removed because of failing a differential missing test between cases and controls at 10-3. Finally, 126,050 SNPs were removed for having allele frequency less than 1 %, similar to other studies using this methodology. Thus, 726,970 SNPs were evaluated in this study in the discovery phase. 73 individuals were removed for lower than expected call rate, resulting in 1 ,006 cases and 536 controls. All quality control steps were performed using PLINK. A preliminary χ2 association analysis was conducted to determine the extent to which population stratification and other biases were affecting the samples and observed a lambda of ~1.05, indicating that the samples were generally well matched for population ancestry, with some minor inflation remaining (explanation and visual

representation see Figure 3A & 3B). MIGEN shared controls were added, which were genotyped on the same Affymetrix 6.0 GeneChip® product, and population stratification analyses were conducted using multi-dimensional scaling in PLINK. These analyses identified 27 cases, 12 AMD controls and 223 MIGEN controls for a total of 262 individuals which were outliers in the principal component analysis. The final genomic control lambda for the logistic regression included seven significant (for prediction of phenotype status) principal components as covariates and was 1.036 for 632,932 SNPs. This dataset was used for our official GWAS analysis. [0078] SNPs with P<10-3 were evaluated from the GWAS discovery sample (n= 720 SNPs excluding previously associated regions) in the MPM GWAS. The exchange of top hits enabled us to use the two scans as primary replication efforts which enhanced the power of each study. Genotyping was performed of all SNPs with combined P<10-4 using Sequenom iPLEX™ at the Broad NCRR Genotyping Center using our Tufts/MGH replication sample. Focusing on sites which continued to show association with P<10-4 after this local replication, a third stage of replication was performed with collaborators in Iceland (deCode Genetics database). For this study, P values were calculated for the combined imputed dataset and for all of our top hits comparing AMD to controls (N=130). Other groups were asked to check these SNPs in their GWAS data— including Iceland (deCode Genetics database) and Genentech— and they sent their data for these SNPs, which were then added to our analyses. To validate the discovery of these SNPs, other groups also were asked to genotype SNPs of interest in their samples using either TaqMan or Sequenom as part of the replication. These data were received as well and combined values were calculated based on the frequencies of the alleles in the various AMD groups-the total advanced AMD case group, as well as the different advanced phenotypes, called geographic atrophy and neovascular AMD. P values for association between these various alleles, genotypes and different AMD case groups were calculated. SNPs associated with geographic atrophy and neovascular disease were studied and these groups were compared to each other, to determine which are associated with one advanced subtype versus the other. 20 SNPs were identified in this comparison (Table 5), and these SNPs were also sent to the same groups noted above, for replication.

[0079] To augment the control set, a subset of controls (n = 1409) from the Myocardial Infarction Genetics (MIGEN) Project was used. Briefly, MIGEN controls are ascertained across Europe, for absence of an MI event. These controls are unscreened for AMD, and so the utility of including them was assessed by examining the previously reported associations in the literature. Specifically, an assessment as to whether the loci at CFH, ARMS2, CFI, C3, CF/B2 showed more significant association to AMD upon expansion of the control sample was performed. The inclusion of these shared controls yielded a dramatic increase in the lambda (2.2). Multi-dimensional scaling was applied based on all pair-wise identity- by-state comparisons for all individuals. The first multi-dimensional scaling component separated out completely the shared controls from the initial dataset (Figure 3A). American populations can be matched to European populations (as long the European populations are diverse), so this complete delineation between the shared controls and the original dataset was due to technical bias between the two datasets. Moving the call rate threshold from 95% to 99% dramatically reduced the lambda (1.22), but still, apparent population stratification effects persisted. Multidimensional scaling was again applied to the IBS matrix, examining the first 10 axes of variation. The first axis of variation no longer classified the cases and controls. The second axis of variation identified a handful of individuals who were apparently either demonstrating high levels of technical bias or were from a different ancestral background (Figure 3B). Finally, the axes of variation were examined to determine whether they significantly predicted case or control status across the genome at an average P-value less than 0.05. Doing so yielded 7 axes of variation and a lambda of 1.036, comparable to the initial study lambda, with an expanded sample size.

Table 1. Age-related macular degeneration grade, gender and age information for samples.

[0080] Tufts/MGH Affy represents the genome-wide association scan using the Affymetrix 6.0 platform from Tufts Medical Center, Tufts University School of Medicine, without the MIGEN controls included; Tufts/MGH Replication represents the follow up replication pool at MGH/Tufts; UM ILMN represents the genome- wide association scan using the Illumina 322 platform from the University of Michigan; JHU represents the Johns Hopkins University sample replication, and NY represents the Columbia University sample replication. AMD Grading System: grade 1 represents individuals with no drusen or a few small drusen, 4 represents individuals with central or non-central geographic atrophy ("advanced dry type"), and 5 represents individuals with neovascular disease ("advanced wet type").

Table 2. The evolution of sample size as a function of the quality control process.

[0081] Each step represents a cleaning stage. The initial sample represents all samples genotyped. The initial dataset cleaning encompasses HWE, call rate, differential missingness between cases and controls, and minor allele frequency threshold. Adding in shared controls, the call rate and MAF thresholds were reapplied. For the final stage, call rate of 99% was required as was the removal of individuals who did not cluster with the majority of the sample.

RESULTS

[0082] Case and Control Sample Development. The initial study consisted of 1 ,057 unrelated cases with geographic atrophy or neovascular AMD, and 558 unrelated controls without AMD who were phenotyped based on clinical examination and ocular photography, and identified from studies of genetic- epidemiology of macular degeneration at Tufts Medical Center. The AMD grade in the worst eye was used in the analyses. All individuals were Caucasian from European ancestry (further details about the original and replication study populations can be found in METHODS and Table 1).

[0083] To enhance the power of this study, unrelated control resources that were genotyped on the same platform in the same lab were included, and additional stringent quality control to ensure the technical and population compatibility of these datasets was conducted (METHODS and Table 2). The final genotyped sample consisted of 979 cases and 1,709 controls. Using a logistic regression analysis including population structure covariates, genomic control inflation factors were comparable between the initial, similarly ascertained sample and the expanded sample, suggesting that potential population differences have been controlled appropriately (979 cases to 536 controls lamba = 1.051 ; 979 cases to 1 ,709 controls lambda = 1.036). Because these additional controls were unscreened for AMD status and may include individuals who have or might later develop AMD, their impact on established associations was determined. The most compelling previously reported associated regions in AMD: CFH on chromosome 1, CFI on chromosome 4, BFIC2 on chromosome 6, ARMS2IHTRA 1 on chromosome 10, and C3 on chromosome 19 were examined. 159 SNPs that were in LD with the most positively associated variant reported in the literature were examined. Of these, 137 showed an improvement in the χ 2 , with the addition of these controls. The average ratio of the initial study's cleaned χ 2 to final study's cleaned χ 2 was 1.82- nearly identical to the expected improvement in χ 2 based on theoretical power calculations of 1.84. As predicted, the addition of a significant number of unselected controls increased the power of this study substantially.

[0084] Genome-Wide Association Discovery Phase. Using a case-control analysis as implemented in PLINK, no SNPs in regions not already reported as being associated with AMD achieved genome-wide significance of 5 x 10 '8 as defined by Pe'er et al. Several SNPs of interest in regions without previously reported association with P-values between 10 "4 to 10 "6 were identified in the discovery scan (Tables 3, 4, and 5), including rs4711751 (VEGFA) with p = 2.66e- 5, and rs6982567 (GDF6) with p = 6.14e-6, as discussed in more detail below.

[0085] Replication Phases. To evaluate the top results from novel regions identified by the scan, several stages of replication analysis were performed. For all SNPs with p < 10° in the genome-wide association scan, results were obtained from the Michigan, Penn, and Mayo scan, selecting only their advanced cases versus controls, and combined the study results as equally weighted-Z scores given the similar sample sizes. From this combined analysis, SNPs with p < 10 "4 or higher in our independent local replication sample of advanced cases and controls from Tufts University School of Medicine and Massachusetts General Hospital (Tufts/MGH) were genotyped, who were unrelated to the individuals in our original scan. Not all SNPs could be imputed perfectly in the Michigan scan, given the different sizes and types of genotyping platforms used (Affymetrix 6.0 with 906,000 SNPS and Illumina with 320,000 SNPS). Therefore, a subset of strongly associated SNPs from the scan alone were selected to be genotyped in the local replication sample. After these steps, a subset of promising SNPs were distributed to collaborators at Iceland (DeCode database) and Genentech, for replication in independent samples. A tally of these P-values for the discovery and local replication stages are presented in Tables 3, 4, and 5. Additional identified SNPs are presented in Tables 6, 7, 8, 9, and 10.

[0086) Results of Combined Scan and Replication Analysis. A SNP on chromosome 6, rs471 1751 (VEGFA), showed significant association with p = 2.66e- 5, and a SNP on chromosome 8, rs6982567 (GDF6), showed significant association with a p = 6.14e-6. In addition, a different VEGFA SNP, rs943080, is reportedly in LD with this SNP (paper forthcoming) and is about 1950 bp from rs6982567. The genome-wide association study results disclosed herein revealed numerous additional SNPs that are associated with AMD (Tables 3, 4, 5, 6, 7, 8, 9, and 10). The nucleic acid sequence corresponding to each reference SNP (rs) number listed in Tables 3, 4, 5, 6, 7, 8, 9, and 10 is incorporated by references herein.

[0087] Thus, rs471 1751, rs6982567, a SNP listed in Tables 3, 4, 5, 6, 7, 8, 9, and 10, and/or a marker in linkage disequilibrium with one of these SNPs can be used in accordance with the present invention as markers for AMD etiology, for determining susceptibility to AMD, and for predicting disease progression or severity, and for distinguishing risk of geographic atrophy, the advanced dry type of AMD from the advanced wet form of AMD. In addition, any marker in LD with one of these markers can be used as a surrogate marker for AMD etiology, for determining susceptibility to AMD, and for predicting disease progression or severity.

[0088] Excluding previously published genetic regions associated with AMD, we detected a region on 6q21 -q22.3 (FIG 6a) containing 30 SNPs with p<5xl0 "7 in the TMMG sample. FIGS. 6a-d show the FRK/COL10A 1 region and association with AMD. FIG. 6a shows observed association in the 500-kb region surrounding the FRK/COL10A1 locus in meta-analysis of TMMG datasets. The represented SNP (rsl999930) for this region of P=3.4xl0 '7 was shown by small purple diamond (see arrow). In the combined analysis including all 8 cohorts this SNP was associated with AMD at P= 6.8 xl0 -8 (large purple diamond; see arrow). FIG. 6b shows Forest plot for rsl 999930 association across 8 cohorts. FIG 6c shows observed association in the 500-kb region surrounding the VEGFA (rs471 1751) locus in meta-analysis of TMMG datasets. In the combined analysis including all 6 cohorts this SNP was associated with AMD at P= 2.0 xl0 -11 (large purple diamond; see arrow). FIG. 6d shows Forest plot for rs471 1751 association across 6 cohorts.

[0089] Since all of these SNPs are in a tight LD block (r 2 >0.8), we chose to investigate the association in this region through rsl999930. The minor T allele frequency of rsl 999930 was 26.0% in cases and 30.5% in controls (Table 1 1) for the TMMG sample, with an odds ratio (OR) of 0.81, and 95% confidence interval (CI) 0.77-0.84. To confirm this new locus for AMD, we tested rsl 999930 in a total of 4269 independent cases and 50,938 independent controls of European ancestry from Johns Hopkins University (JHU), Columbia University (COL), Genentech, Decode, Washington University (Wash-U), Centre for Eye Research Australia (AUS), Rotterdam (ROT), and Hopital Intercommunal de Creteil (FR-CRET) (Table 12). Frequency and risk associated with the minor allele T of rsl 999930 in each replication cohort were all in the same direction as in TMMG (Figure 6b).

[0090] Combining the test statics of all independent replication cohorts weighted by their sample size using Stouffer's Z-score method, this association was confirmed (OR = 0.91, P = 0.0057). The results were very consistent across datasets with no significant evidence for heterogeneity under the Cochran's Q-test for our samples {Q =0.09, 1 2 =44[0 - 75]). In the combined analysis of all the samples, the T allele of rsl999930 significantly (p=6.8xl0 8 ) reduced the risk of AMD (OR=0.87 [0.83 - 0.92]). This associated region represented by rsl999930 contains the genes COL J OA J (encoding the alpha chain of type X collagen) and FRK (encoding fyn- related kinase).

[0091] We also tested other unreported loci for AMD with p-value<5xlO ~5 in the TMMG meta-analysis (Table 14) and several previously reported loci (Table 1 1) with suggestive association results. The risk variants in TIMP3 (rs9621532, p=6xl0 " 14 ) and HDL pathway genes LIPC (rsl0468017, p=5.3xl0 "9 ), CETP (rs3764261, p=9.6xl0 "9 ) were genome-wide significant in our combined analysis and a previously suggestive association in ABCA1 (rsl883025, p=9.5xl0 ~7 ) was still noteworthy. Another locus near C4BPA/CD55 gene was suggested from the TMMG analysis (P=4.6xl0 7 ), however, the combined p-value was 9.9x10 "5 .

[0092] Among the other previously unreported loci, the T allele of one candidate SNP (rs471 1751) near VEGFA, was associated with increased risk of AMD

(OR=1.21 [1.16 - 1.27], p=1.5xl0 "5 ) in the TMMG meta-analysis and the results were very consistent in direct genotyping replication in 3277 cases and 42091 controls (OR=1.20 (1.15 - 1.24), p=2.9xl0 "7 ). This SNP reached genome-wide significance (OR=1.20 [1.17 - 1.24], ρ=2.0χ1θ") in the combined analysis including all replication cohorts (FIGS. 6c, 6d). This novel association with VEGFA was also found in a parallel meta-analysis on a SNP (rs943080, R 2 =l , D'=l) in LD with rs471 1751. Our newly identified SNPs is 3' downstream of VEGFA and more than 90kb away from the SNP in VEGFA promoter region (rs2010963), which was reported to be associated with AMD previously (Table 15). The rs2010963 allele is in very low LD with rs471 1751 (R 2 = 0.015,D' = 0.138); therefore the association we identified in VEGFA was in a novel region and not likely due to LD with SNPs in the VEGFA promoter region. Of note, the previously reported rs2010963 SNP showed little evidence of association in our TMMG meta-analysis (p=0.26).

[0093] VEGFA which is a member of the vascular endothelial growth factor family increases vascular permeability, angiogenesis, cell growth and migration of endothelial cells. VEGFA has been a major candidate for AMD risk and it has been hypothesized that activation of this gene may induce pathologic angiogenesis under the retinal epithelial (RPE). Interestingly, Rajpar et al. described creating a knock-in mouse for COL10A1 p.Asn617Lys (possible human SNP rs61745148) which reduced the VEGF expression in hypertrophic chondrocytes leading to a significant reduction in the recruitment of osteoclasts to the vascular invasion front. (Rajpar, M.H., et al., "Targeted induction of endoplasmic reticulum stress induces cartilage pathology." PLoS Genet 5, el 000691 (2009))

[0094] Furthermore, hypoxia-inducible factor-2ct (HIF-2a, encoded by EPAS1) was shown to enhance promoter activities of COLIOAI,

MMP13 and VEGFA through specific binding to the respective hypoxia-responsive elements. (Saito, T. et al., "Transcriptional regulation of endochondral ossification by HIF-2alpha during skeletal growth and osteoarthritis development." Nat Med 16, 678-86 (2010))Hypoxia is known to increase VEGF transcription, translation, and mRNA stability because VEGFA is extremely sensitive to oxygen levels. VEGFA signaling of the Akt pathway can be antagonized by transpondin-1 (TSP-1) which can modulate the remodeling of the microvascular network of the developing retina. FRK has been shown to have negative function on the stimulation of microvascular survival by mediating the downstream signaling of TSP1 and the TSP receptor {CD36). (Sun, J. et al., "Thrombospondin- 1 modulates VEGF-A-mediated Akt signaling and capillary survival in the developing retina." Am J Physiol Heart Circ Physiol 296, HI 344-51 (2009)) It is quite possible that this SNP or set of SNPs in the region directly affect the expression of VEGF through either or both COLIOAI and FRK signaling.

[0095] We also investigated the specific association with geographic atrophy (GA) and neovascular (NV) subtypes of AMD in our TMMG samples respectively.

Association signals on CFH, C2, CFB, C3, CFI and ARMS2/HTRA 1 were also highly significant for both GA and NV compared to controls. The minor allele (T) of rsl999930 had a similar effect size for GA (OR=0.78 [0.69 - 0.89], P=1.0xl0 -4 ) and NV (OR=0.82 [0.75 - 0.90], P=4.1xl0 -5 ). The risk allele (T) of rs471 1751 also had a similar magnitude of effect on GA (OR=1.23 [1.08 - 1.40], P=2.0xl0 "3 ) and NV (OR=1.20 [1.09 - 1.32], P=2.5xl0 -4 ).

[0096] We found two novel associated loci near FRK/COL10A1 and VEGFA, and confirmed associations for ten previously published AMD loci in our combined analysis. The genetic loci associated with AMD suggest that the disease process may be explained in part by pathological activation of the alternative complement pathway (CFH, C2, CFB, C3, CFI), the imbalance of HDL cholesterol metabolism (LIPC, CETP,JBCAl) and possibly angiogenesis (VEGFA) induced by dysfunction or degradation of extracellular matrix (COL1OA1, FRK, ARMS2, TIMP3).

[0097] The use of headings and sections in the application is not meant to limit the invention; each section can apply to any aspect, embodiment, or feature of the invention.

[0098] Throughout the application, wtiere compositions are described as having, including, or comprising specific components, or where processes are described as having, including or comprising specific process steps, it is contemplated that compositions of the present teachings also consist essentially of, or consist of, the recited components, and that the processes of the present teachings also consist essentially of, or consist of, the recited process steps.

[0099] In the application, where an element or component is said to be included in and/or selected from a list of recited elements or components, it should be understood that the element or component can be any one of the recited elements or components and can be selected from a group consisting of two or more of the recited elements or components. Further, it should be understood that elements and/or features of a composition, an apparatus, or a method described herein can be combined in a variety of ways without departing from the spirit and scope of the present teachings, whether explicit or implicit herein.

[0100] The use of the terms "include," "includes," "including," "have," "has," or "having" should be generally understood as open-ended and non-limiting unless specifically stated otherwise.

[0101] The use of the singular herein includes the plural (and vice versa) unless specifically stated otherwise. Moreover, the singular forms "a," "an," and "the" include plural forms unless the context clearly dictates otherwise. In addition, where the use of the term "about" is before a quantitative value, the present teachings also include the specific quantitative value itself, unless specifically stated otherwise. [0102] It should be understood that the order of steps or order for performing certain actions is immaterial so long as the present teachings remain operable. Moreover, two or more steps or actions may be conducted simultaneously.

[0103] Where a range or list of values is provided, each intervening value between the upper and lower limits of that range or list of values is individually contemplated and is encompassed within the invention as if each value were specifically enumerated herein. In addition, smaller ranges between and including the upper and lower limits of a given range are contemplated and encompassed within the invention. The listing of exemplary values or ranges is not a disclaimer of other values or ranges between and including the upper and lower limits of a given range.

[0104] The aspects, embodiments, features, and examples of the invention are to be considered illustrative in all respects and are not intended to limit the invention, the scope of which is defined only by the claims. Other embodiments,

modifications, and usages will be apparent to those skilled in the art without departing from the spirit and scope of the claimed invention.

[0105] What is claimed is:

Table 3

Table 3

Table 3

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 4

Table 5

Table ' 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 6

Table 7

Table 7

Table 7

Table 7

Table 7

Table 7

Table 8

Table 8

Table 8

Table 9

GENE SNP CHR:BP GENE

HTRA1 I rs10490924 10: 124204438 HTRA1

CFH !rs1061 170 1 :194925860 CFH

CFH irs1410996 1 :194963556 CFH

CFB |rs641 153 6:32022159 CFB

C3 irs2230199 19:6669387 C3

C2 Irs9332739 6:3201 1783 C2

TIMP3 jrs9621532 22:3141451 1 TIMP3

LIPC 'rs10468017 15:56465804 LIPC

CFI |rs10033900 4:1 10878516 CFI

COL10A1 ,DSE,FRK,TSPY Ί COLIOAIDSE.FRKJSPY irs12204816 6:116568331

L1.TSPYL4 I L1.TSPYL4

VEGFA |rs471 1751 6:43936560 VEGFA

jCETP jrs3764261 16:55550825 CETP

IFRK frs 1999930 6:116493827 FRK

ABCA1 |rs1883025 9: 106704122 ABCA1

FRK/COL10A1 •rs12196141 6:116596243 FRK/COL10A1

HCG27(0) Irs9366769 6:31277268 HCG27(0)

LIPC jrs493258 15:56475172 LIPC

COL8A1 frs13095226 3: 100878962 COL8A1

TSHZ3(+ 107.3kb) rs2052572 19:36639376 TSHZ3(+107.3kb)

NT5DC1(0) |rs509859 6: 1 16529937 NT5DC1 (0)

FILIP1 L(0)|C3orf26(0) rs7626245 3:101053451 FILIP1 L(0)|C3orf26(0)

CNTNAP4 irs8053796 16:74921678 CNTNAP4

MYOM2(-1494kb) rs722782 ;8:506479 MYOM2(-1494kb)

FAM1 ~ 35B(0) ~ rs10103808 8:139212254 FAM135B(0)

OTOL1(+488.4kb) ;rs4256145 3:163192835 OTOL1 (+488.4kb)

IRF4(-675.1 kb) 'rs9328048 6: 1324870 IRF4(-675.1 kb)

INTU |rs1443179 4:128495772 INTU

MEIS2(+20.97kb) ; chr15:35201758 15:35201758 MEIS2(+20.97kb)

CTSD rs5591 1 157 1 1 : 1762440 CTSD

CDH12(-477.3kb) rs2883171 5:21309600 CDH12(-477.3kb)

CNTNAP4 rs6564324 16:74929641 CNTNAP4

ZFAT(+315.9kb)|KHDRBS ZFAT(+315.9kb)|KHDRBS irs 13253938 8:136110326

3(-428.6kb) 3(-428.6kb)

CDH9(-704.3kb) ichr5:26212150 5:26212150 CDH9(-704.3kb) Table 9

GENE SNP CHR:BP GENE

TNFRSF10A(+0.332kb)|C TNFRSF10A(+0.332kb)|C rs 13278062 8:23138916

HMP7(-18.18kb) HMP7(-18.18kb)

PCDH15(+149.1 kb) rs61856267 10:56380194 PCDH15(+149.1kb)

GDF6 rs6982567 8:96819457 GDF6

MRPL19(+489.6kb) rs1851808 2:76232489 MRPL19(+489.6kb)

C1orf1 16,C4BPA,C4BPB, | C1orf116,C4BPA,C4BPB,

CD55.CR1 ,CR2,DAF,PFK rs 12040406 1 :205515927 CD55,CR1 ,CR2,DAF,PFK

FB2.YOD1 FB2.YOD1

TKT rs 12632671 3:53233464 TKT

MOXD1 (- MOXD1 (- rs728371 6: 132523980

134.9kb)|CTGF(+209.8kb) 134.9kb)|CTGF(+209.8kb)

GANC(O) rs12908430 15:40375218 GANC(O)

MFGE8(+54.89kb)|HAPLN MFGE8(+54.89kb)|HAPLN

'3(+72.78kb)|ACAN(+92.971 rs11854658 15:87312556 3(+72.78kb)|ACAN(+92.97 kb)|ABHD2(-1 19.9kb) kb)|ABHD2(-119.9kb)

TMCO1 (0) chM : 163986967 1 : 163986967 TMCO1 (0)

SYNGAP1 (0) rs9461856 6:33503177 SYNGAP1(0)

USP31(+23kb)|UBFD1 (- USP31 (+23kb)|UBFD1 (-

385.3kb)|SCNN1 B(- 385.3kb)|SCNN1 B(- rs4967980 16:23091095

130kb)|SCNN1 G(- 130kb)|SCNN1 G(-

10.45kb)|COG7(-216.2kb) 10.45kb)|COG7(-216.2kb)

RREB1 rs11755724 6:7063989 RREB1

WDR35(+ 148.1 kb)|TTC32( WDR35(+ 148.1 kb)|TTC32(

+236.3kb)|SDC1 (- +236.3kb)|SDC1 (-

62.54kb)|PUM2(- rs6531212 2:20201501 62.54kb)|PUM2(-

1 10.4kb)|MATN3(+125.6k 1 10.4kb)|MATN3(+125.6k b)|LAPTM4A(+86.58kb) b)|LAPTM4A(+86.58kb)

PSMD7(-598.4kb) rs2127740 16:72289810 PSMD7(-598.4kb)

TMC05 rs16965939 15:35992085 TMC05

MAPK10 rs28621471 4:86877248 MAPK10

TBX3(+404kb) rs11067403 12: 1 14010367 TBX3(+404kb)

B3GALTL(0) rs1 ~ 912795 13:30736688 B3GALTL(0)

GCNT4(+22.83kb)|HMGC GCNT4(+22.83kb)|HMGC rs 12520598 15:74385314

R(-283.5kb) i R(-283.5kb) Table 9

Table 9

Table 9

Table 9

Table 9

Table 9

Table 9