Current Understanding of DNA Methylation and Age-related Disease

: DNA methylation involves the covalent transfer of a methyl group to the C-5 position of the cytosine ring on a DNA strand. DNA methylation is both heritable and modifiable and can affect gene expression. In recent years, epigenome-wide association studies using high-throughput technologies have associated variation in DNA methylation levels with normal and pathological aging processes in human populations. DNA methylation patterns have been used to construct epigenetic clocks which can serve as potential biomarkers of age-related diseases. Age acceleration, as determined using these epigenetic clocks, has been strongly linked to common diseases including cancer, neurodegenerative diseases, metabolic diseases, and cardiovascular diseases. Identification of these robust associations between (EWASs) associations DNA methylation and age-related diseases.

methylation in aging processes and recent advances in epigenome-wide association studies (EWASs) reporting associations between DNA methylation and age-related diseases.

Epigenetics
The definition of epigenetics has been modified over decades. In the mid-twentieth century, epigenetics was defined as the development of a complex organism from a zygote and all processes within [1,2]. With advancing knowledge of the various mechanisms of cellular development and evolution, the definition of epigenetics has altered. Today, epigenetics is more narrowly defined as structural and chemical changes of DNA and associated regulatory proteins (eg. histones), excluding changes to the nucleotide sequences [1,2].
Epigenetic modifications may alter the accessibility to DNA and chromatin. These may involve the recruitment or exclusion of various factors that regulate gene transcription. There are multiple epigenetic mechanisms known to affect these changes. One mechanism, proposed in two independent papers by Riggs [3] and Holliday and Pugh [4], outlined a molecular model in which the cytosine rings of DNA are methylated and demethylated, providing an on/off switch for gene expression [2,5]. DNA methylation also provided an explanation for the heritability of gene expression through somatic cell divisions [5].
Another epigenetic mechanism contributing to the regulation of gene expression involves histone variants and modifications, which alter the chromatin structure and subsequent access of transcription machinery. Histones can undergo several modifications including acetylation, methylation, and phosphorylation. Such modifications are followed by nucleosome remodelling which loosens the chromatin to allow binding of transcription machinery (euchromatin) or tightening of the chromatin to prevent binding (heterochromatin) [1].
More recently, several RNAs have been associated with epigenetic modifications and gene silencing. For example, long non-coding RNAs (lncRNAs) are known for their gene silencing effects in imprinting and X-chromosome inactivation [6]. LncRNAs recruit polycomb complexes which modify histones, and RNA-binding proteins to prevent histone deacetylation or promoter association [6]. Additionally, subclasses of small non-coding RNAs (sncRNAs) have been associated with gene silencing [7]. Piwi-interacting RNAs (piRNA) are an example of sncRNAs that repress transposon expression through de novo methylation [6]. There are also studies suggesting that small-interfering RNA (siRNA) target TATA-box sequences, blocking the initiation of transcription without modifying histones or DNA [6]. Together, these mechanisms demonstrate the complex cooperation of different epigenetic mechanisms for the regulation of gene expression.

DNA Methylation
DNA methylation is a well-studied epigenetic mechanism in plants and animals. Methylation involves the covalent transfer of a methyl group to the C-5 position of the cytosine ring on a DNA strand [8,9]. In somatic mammalian cells, 98% of DNA methylation occurs at symmetric cytosine/guanine dinucleotides (CpGs), whereas only 75% of DNA methylation in embryonic stem cells (ESCs) occurs on CpGs [8]. A significant proportion of DNA methylation has been detected in non-CG contexts, such as CHG or CHH (where H can be A, T, or C) [10,11].
DNA methylation has been shown to play a role in normal development, genomic imprinting, Xchromosome inactivation, chromosome stability, suppression of repetitive element transcription and regulation of gene transcription. Dysfunction of DNA methylation processes is strongly linked to many diseases, especially cancer [5,8].
DNA methylation is regulated by DNA methyltransferases. In humans there are four -DNMT1, DNMT2, DNMT3a, and DNMT3b -which serve a variety of functions [8,12]. DNMT1 is responsible for the maintenance of DNA methylation patterns in daughter strands during DNA replication and prefers hemimethylated DNA [8]. DNMT2 methylates cytosine-38 in the anticodon loop of the tRNA for aspartic acid, but does not methylate DNA [13]. DNMT3a and DNMT3b prefer unmethylated CpG dinucleotides and are responsible for de novo methylation during development. DNMT3a, DNMT3b, and DNMT1 cooperatively function to initiate and maintain DNA methylation. In addition, DNMT3L is a DNA methyltransferase-like protein which, although lacks the conserved catalytic domain that is common to the DNA methyltransferases, serves an important role in DNA methylation. DNMT3L has been shown to establish maternal methylation imprints, stimulate DNMT3a activity [14], and assist the DNMTs in binding the methyl group donor [8]. Each of these DNMTs are essential for normal mammalian development [15].
Although DNA methylation is chemically and genetically stable, it is a reversible modification that can occur actively or passively. Passive demethylation occurs when there is a lack of maintenance machinery, such as DNMT1, which results in the dilution of methylation during replication [16,17]. In contrast, active demethylation has several proposed mechanisms. However, most have been discredited due to lack of supporting evidence [17][18][19][20][21]. Two currently accepted demethylation mechanisms involve TET-mediated oxidation of 5-methylcytosine (5mC) to 5hydroxymethylcytosine (5hmC), which is further oxidated to 5-formylcytosine (5fC) and 5carboxlcytosine (5caC) [22]. These oxidated states may be diluted through replication, a process similar to passive demethylation [17]. Alternatively, 5fC and 5caC may be excised by thymine DNA glycosylase (TDG) and subsequently replaced by an unmodified cytosine through base excision repair (BER) [23]. Active DNA demethylation has been implicated in several biological pathways. This includes pre-implantation and primordial germ cell development [24,25], maintenance and differentiation of mouse embryonic stem cells [26], neuronal functions [27,28], DNA repair and genomic instability [17].

Regulation of Gene Expression by DNA Methylation
Initially it was thought that the sole function of DNA methylation was to silence gene expression. As more recent studies have emerged, it has become evident that the effects of DNA methylation vary depending upon the position in the genome at which it occurs [5,29]. For example, methylation of CpGs within transcription start sites (TSS) has been strongly linked to the prevention of transcription initiation [30,31]. Additionally, inappropriate methylation of CpG islands in, or near, transcription start sites can lead to the silencing of tumour suppressing genes [32]. In contrast, methylation within the gene body has been positively correlated with gene expression [33] and may even stimulate elongation and splicing [5]. Centromeres and other repeat regions can also be methylated and has been shown to improve chromosomal and genome stability [34]. Furthermore, somatic and cancer cells become unviable with a total loss of methylation [35,36], yet this effect has not been observed in ESCs [37].
Repressed genes are usually characterised by methylated CpGs within the promoter region, including CpG islands (CGI) [5,38,39]. However, methylation of promoter CGIs is not a characteristic of all repressed genes. It is usually genes that are permanently repressed that have methylated promoter CGIs. Imprinted genes, genes on the inactive X-chromosome and genes that would otherwise be inappropriately expressed in somatic cells -such as those exclusively expressed in germ cells -are examples of this circumstance [5,30]. However, it is still unclear why only a minority of CGIs are methylated. A summary of the role of DNA methylation in gene expression is shown in Figure 1 below.
The role of DNA methylation on gene silencing remains a subject of debate [40]. Regardless, it has been shown that in some circumstances gene transcription cannot be initiated when promoter CGIs are methylated once nucleosomes have formed [5,39]. Increasing evidence indicates that silencing precedes methylation therefore suggesting that DNA methylation has a maintenance role. This was demonstrated in a number of experiments including genes in the inactivated Xchromosome, and genes implicated in cancer [41,42]. Further proof of this mechanism began to be unravelled by Ooi et al. (2007) who showed that de novo methylation in cells that express DNMT3L required DNMT3a, DNMT3L, and nucleosomes. Since active transcription start sites do not have nucleosomes, de novo methylation cannot occur, therefore the TSS, and the gene, have to be inactivated prior to methylation [43]. Additionally, nucleosomes in active genes are marked by the histone modifications H3K4me2 or H3K4me3 and the histone variant H2A.Z, which are resistant to methylation [43]. This model predicts that higher levels of gene expression decrease the likelihood of de novo methylation with evidence of monoallelic methylation on the least expressed allele supporting this prediction [43]. However, these observations have only been studied in cells that express DNMT3L and it is not known whether similar processes occur in cells not expressing DNMT3L. The mechanism by which DNA methylation within the gene promoter prevents binding of transcription factors occurs through chromatin remodelling [44]. Methylated CpGs allow methyl-CpG-binding domain proteins to recruit repressor complexes and modify histones, to form either heterochromatin or euchromatin [44].
CpG sites within the gene body are more scarce but can be extensively methylated [5]. A strong positive correlation exists between active transcription and DNA methylation in gene bodies within the active X-chromosome [45]. Recently it has also been suggested that DNMT3B-mediated gene body methylation represses intragenic promoters [46], so that rather than being a causative activating mechanism, methylation restricts the activation of cryptic promoters within transcribed genes. Furthermore, a role in splicing regulation has been suggested for methylation as exons are more heavily methylated than introns with a transition occurring at exon-intron boundaries as well as being increased in the nucleosomes [39].

Epigenome-wide Association Study Designs and Technologies
Epigenome-wide association studies (EWAS) are a commonly used approach for detecting epigenetic changes, mainly DNA methylation, that may be occurring on the genome as a consequence of, or contributing to, a phenotype [47]. EWAS can identify DNA methylation signatures in either case vs. control designs or explore associations with continuous phenotypic traits, allowing insight into potential relationships between the methylome and a phenotype of interest [47]. These studies have promising potential for understanding disease mechanisms and the identification of biomarkers.
Whole-genome bisulphite sequencing (WGBS) remains the current gold standard method for mapping DNA methylation status [48]. However, it is expensive to perform, and parallelization is a major limitation [48,49]. In light of this, some capture techniques have been developed, such as reduced representation bisulfite sequencing (RRBS), which allows a subset of the genome to be targeted thus reducing costs. However, array-based technologies have gained popularity for their user-friendly, cost effective, time efficient, and reliable results. Until recently the Human 450K Methylation BeadChip was the most widely used array due to its high-throughput capabilities and wide coverage of most genes as well as upstream and downstream regions. However, the 450K design does not include some regulatory regions that have been shown to have an effect on transcription levels and phenotypic variation [48]. Illumina have recently improved upon this by introducing the Infinium MethylationEPIC BeadChip (ie. ~850K), which covers >90% of the original CpGs on the 450k BeadChip with an additional 350,000 CpGs in enhancer regions. However, an important limitation of array-based approaches remains in that they are not able to assay DNA methylation at single-base resolution or discern allele-specificity.

Figure 1
Regulation of gene expression by DNA methylation. Effects of DNA methylation on gene transcription are location-dependent. For example, methylation in the promoter region is observed in repressed genes. Similarly, methylated CpGs on, or near, transcription sites prevent transcription initiation. However, gene body methylation is strongly correlated with gene expression and has been implicated in elongation. Additionally, it has been speculated that gene body methylation may have a role in splicing since exons are more heavily methylated than introns, and there is a transition at exon-intron boundaries.
The choice of EWAS design used is dependent on the aims of the investigation. The simplest approach is the cross-sectional case-control study. This is particularly useful for comparing the epigenome of a group that express a phenotype of interest to those that do not [47,50,51]. Gopalan, et al. [52] used a cross-sectional study design to map and compare DNA methylation patterns across African and European populations to determine whether the patterns are replicated across groups of different genetic and environmental backgrounds. Cross-sectional studies are advantageous due to the large amount of readily available cohorts with DNA samples. However, cross-sectional studies are limited by a number of factors. The epigenome is highly variable, due to genetic factors, but can also be influenced by environmental factors giving rise to inter-individual differences which influence the results. It is also difficult to ascertain whether epigenetic variability is the cause or consequence of the phenotype.
Monozygotic twins are genotypically identical but are not always phenotypically identical. Thus, twin studies are ideal for investigating the effect of the epigenome on phenotype and gene expression [53,54]. Interestingly, one twin study revealed that epigenetic differences naturally arise in individuals as they age, correlating with differential gene expressions in monozygotic twins, demonstrating the robust effect of the epigenome [54]. In addition, twin studies are an ideal EWAS design as it eliminates many confounding factors, particularly environmental and genetic, that are present in non-twin studies. This is especially true for twins raised in similar environments. However, studies investigating causative factors of disease require a longitudinal approach, limiting sample size and resources for appropriate twin studies [55].
Longitudinal and replication studies involve monitoring a study population across a determined time course to track changes in monitored information (phenotypic, genomic, epigenomic, etc), with the goal of inferring associations for phenotypes of interest and potential traits that influence them over time. Such study designs have been used to track changes in DNA methylation associated with healthy aging [38], memory retention [56], diabetic research [57], and cancer research [58], among many other age-related diseases. Ideally, longitudinal studies monitor disease-free, or phenotypic-negative, subjects over many years and observe any phenotypic changes or disease-onset along with DNA methylation changes to identify potential biomarkers [55]. These studies eliminate inter-individual differences that confound results of other EWAS designs and allow observation of changes, rather than identification of differences, in DNA methylation patterns. However, longitudinal studies are expensive, time consuming, and recruiting subjects is often difficult.
As the application of epigenome-wide association studies become broader, more recent studies have been designed using a combination of approaches. This is aimed at reducing, or eliminating, confounding factors to identify more robust associations. Another consideration for EWAS is sample source. Most studies use whole blood, blood components, or saliva, as these are the least invasively obtained and have a wide range of applications including the identification of disease biomarkers in some tissue-specific diseases [59]. Other studies have used other tissue-types, such as brain [60], and cancer tumours [59] where blood is an unsuitable sample. The invasive nature of the procedures required to obtain these samples make these types of studies difficult to conduct. Until less invasive procedures are identified to obtain reliable methylation data, EWAS will be mostly confined to the use of whole-blood, or blood components, and saliva.
As a result, currently, most studies of epigenomics are conducted using mixed cell populations. While this has been very successful, there are limitations in the field that cannot be overcome until epigenomic changes can be observed in single-cell resolution. Recent studies suggest that the use of heterogenous tissues without consideration of cell-subtypes increases false positives [61]. Therefore, deconvolution of signals from mixed cell populations have been recently introduced in epigenetic data analysis. These aim to determine the proportion of cell-subtypes within a sample to reduce false positive associations [62,63]. An alternative to deconvolution of the data sets is the use of single-cell sequencing, a recent advancement in the omics field. It allows the interrogation of all CpG sites in the genome, potentially providing more insightful data into the mechanisms of epigenetic changes and their effect on phenotypic variability in comparison to the current bulk cell approach [64].

Epigenetic Clocks -Predictors of Age
The epigenetic drift and epigenetic clock phenomena have provided the basis for epigenetic and aging studies [65,66,67]. Epigenetic drift can be defined as the general decrease in DNA methylation and the increase in variability of methylation patterns with age [67,68]. This has been observed in monozygotic twins, where twin pairs who have similar epigenomes at birth, become more discordant with age [54]. These changes may be in response to environmental influences and some have been significantly linked to phenotypic variability as well as risk of disease. Alternatively, the epigenetic clock describes the changes in DNA methylation status in specific CpG sites across the genomes that are associated with age [67][68][69].
Although there have been several epigenetic predictors of age developed, there are currently only two models that are commonly used. The Hannum [70] and Horvath [59] models were developed independently and use different methods of calculating an individual's epigenetic age. The Hannum model was developed based on DNA methylation measurements using Illumina 450K array data from whole blood samples of more than 650 individuals from a mixed population [70]. The model identified 71 sites that were highly predictive of chronological age that had a 96% correlation with a margin of error of 3.9 years [70]. In comparison, the predictive model developed by Horvath involved over 7,800 samples from 51 different cell and tissue types. This model identified 353 CpG sites with robust correlations with chronological age. Horvath developed a multi-tissue predictor with a 96% correlation with a margin of error of 3.6 years [59]. Interestingly, epigenetic age estimates determined independently using both models are correlated despite only sharing 6 CpG sites in common [71]. Since epigenetic clocks were derived, they have been used to determine whether epigenetic age is associated with chronological age, and to infer implications on aging.
In addition to developing a predictive model of aging, Horvath also arrived at several other conclusions related to DNA methylation age and aging. He found that certain tissue types produced high error margins when using the predictive model, including breast tissue, skeletal muscle tissue, cardiac tissue, dermal fibroblasts, and uterine endometrium, although reasons for these discrepancies are unclear [59]. The results of this study also demonstrated that the DNA methylation age of both embryonic and induced pluripotent stem cells is near zero. Following from this, Horvath determined that the aging rate of the methylome follows a logarithmic increase until adulthood, then becoming linear thereafter [59].
The study by Horvath also tested the multi-tissue predictor on cancer tissues which exhibited extreme age acceleration [59]. These models have been used in many studies to determine whether lifestyle factors affect age acceleration [51], and whether age acceleration is correlated to disease onset and mortality [50,71].
Other confounders include the changes that occur in blood cell composition with age. Since the majority of epigenetic and aging studies are conducted using whole blood samples, it is important to determine whether the change in blood cell composition affects DNA methylation age. Horvath addressed this issue and concluded that while there is a correlation between DNA methylation ages with the abundance of some blood cell types, the correlation is a confounding effect of chronological age [59]. Additionally, DNA methylation age is not a reflection of changes in blood cell type composition, but rather the intrinsic changes of the methylome [59]. More recent studies have now focussed on the use of two more measures of epigenetic age, namely the intrinsic and extrinsic epigenetic age acceleration (IEAA and EEAA). IEAA measures the epigenetic age in blood independent of blood cell counts affected by chronological age [59,71,72]. In contrast, EEAA uses whole blood samples to measure epigenetic aging of immune cells and changes to blood cell composition [72]. Evidently, the use of IEAA and EEAA are only applicable to blood samples. For all other tissue and cell types, the universal measure of age acceleration is used, which is simply the average epigenetic age compared to chronological age.

Age-associated Diseases and DNA Methylation
An aged epigenome is accompanied by changes in DNA methylation which can adversely affect transcriptional processes and downstream protein structure and function [38,54,69,70,73]. Studies examining methylome aging rates and the increase in epigenetic drift revealed that these mechanisms have effects on the risk of age-related disease onset [54,70]. It has been proposed that this increase in disease risk could be due to gradual deregulation of epigenetic mechanisms and regulatory mechanisms. Consequently, this deregulation contributes to the loss of phenotypic plasticity in response to environmental stimuli, promoting aging [74].
A recent EWAS by Zhang, et al. [75] illustrated the role of DNA methylation in mortality. The study investigated DNA methylation in deceased individuals and compared these to a healthy cohort and a validation cohort. From this study, 58 CpG probes were discovered to be strongly associated with mortality and were subsequently replicated in the other cohorts. These CpGs were located across 38 genes, 14 intergenic regions on 19 chromosomes. The genes identified have been linked to a multitude of diseases including diabetes, cardiovascular diseases, cancers, neuropsychiatric disorders, and HIV. Interestingly, none of the CpGs identified were previously associated with age, although the majority were associated with smoking exposure. A risk score was also determined for mortality (independent of cause), cardiovascular diseases, and cancer, based on DNA methylation and identified 10 CpGs to be strong predictors of mortality [75].
Although hypomethylation is usually observed in an aging epigenome, hypermethylation primarily occurs in promoters of genes associated with cancer, such as tumour suppressors [69,74]. This is also observed for CpG islands in genes associated with T-cell immune responses and Tcell differentiation, resulting in decreased expression [76] with hypermethylation linked to the repression of gene expression (model in Figure 2). This likely contributes to the high incidence of cancer and the decline in immunocompetence and response to infection in aging individuals. It is also notable that epigenetic drift in stem cells hinders the repair and regeneration of tissues [77,78], particularly in the gastrointestinal tract and the regeneration of leukocytes. In human somatic cells, the deterioration of methylomes have been observed in central nervous system tissues, specifically in genes related to nervous system development, neuron differentiation, and neurogenesis, eventually leading to neurodegenerative disorders [79]. Hypomethylation of Alu and LINE1 elements during aging has also been related to a significant decline in lung function in the elderly [80]. In addition, CD8+ T cells have been identified to have an increased methylome variance associated with age that has been linked to impaired immune responses [76].
Lifestyle factors such as diet, exercise, and cigarette smoking are known to heavily influence the epigenome [51,81]. Exercise affects chromatin dynamics and delays aging while smoking can alter DNA methylation patterns, change the chromatin structure and increase the risk of developing age-related diseases. Diet has also been shown to reverse global and specific DNA methylation in loci correlated with metabolic pathways [81]. Remodelling of genome-wide methylation patterns have been observed as a result of dietary restriction, which delays age-associated changes [82]. Moreover, DNA methylation profiles of calorie-restricted mice and monkeys have showed a lower epigenetic age compared to chronological age [83]. Interestingly, a consistent high nutrient intake appeared to induce age-related methylation in the liver. Some compound in fruits and vegetables has also been shown to have an effect on reducing neurodegenerative and cardiovascular disease as well as chronic inflammation, through mediation of epigenetic processes [74,81].
The CpG site in FTO, a gene associated to fat mass and obesity, has been found to have lower methylation levels in type 2 diabetes mellitus patients when compared to healthy individuals [84]. The insulin gene promoter was also identified to be more hypermethylated in type 2 diabetic patients and negatively correlated with insulin gene expression. In addition, hypermethylation was found to be positively correlated with HbA1c levels [85]. More recently, examination of age acceleration related changes to lifestyle factors including diet, exercise, and education. The study used IEAA and EEAA measures and concluded that a healthy diet, adequate exercise, and higher education were correlated with slower age acceleration, although these effects were not identified to be significant [51].

Epigenetic Therapies
The link between epigenetic processes, aging, and age-related diseases has introduced new therapeutic avenues of pursuit. Current epigenetic drugs are generally designed to interfere with histone-modifying enzymes, DNA methyltransferases, and the enzymes that interpret chromatin modifications [86]. DNA methyltransferase inhibitors and histone deacetyltransferase inhibitors are currently two classes of epigenetic drugs commercially available, with several other classes undergoing pre-clinical and clinical phase trials [87].
The mechanism of action of methylation inhibitors, the first class of epigenetic drugs developed, involves inhibition of DNMT1 functions. One example is 5-azacytidine, a cytosine analog that integrates into the DNA strand and forms an irreversible bond with DNMT1, preventing methylation of the daughter strand during replication [88]. A similar drug, zebularine, has been shown to reverse methylation-silencing of genes in mice [89]. More recently, DNA methylation inhibitors have recently been shown to mimic a viral defence pathway by upregulating interferon response in ovarian cancer cells [90].
Antisense oligonucleotides are another means of methylation inhibition. These oligonucleotides bind to the 3' untranslated region (3' UTR) of DNMT1 to inhibit the translation of the DNMT1 transcript and have been shown to induce the re-expression of a tumour suppressing gene [91]. In addition to targeting replicating DNA strands and the DNMT1 gene, these smallmolecule drugs target the active site of the DNMT1 enzyme, preventing its binding to the target gene [92].
Although the focus to date for many epigenetic drugs has been cancer, the uses of these drugs are currently being explored for other diseases, particularly the neurodegenerative disorders, Alzheimer's Disease and Parkinson's Disease [93,94]. Epigenetic drugs present several complex issues as their targets have diverse functions, which further complicate the drug development process [95]. However, new potential drug targets continue to be identified as differentially methylated regions between disease-affected groups and healthy controls are analysed as is occurring in Alzheimer's Disease and atherosclerosis [96]. Furthermore, studies such as these improve our understanding of these mechanisms in healthy human aging and human longevity.

Figure 2
Changes in DNA methylation patterns in a gene with age. DNA methylation patterns have been observed to change with physiological aging and in various agerelated diseases. Given an active gene at an early age, CpG sites within gene promoters are typically unmethylated, while gene body CpG sites are methylated. (A) With age, global hypomethylation is usually observed which has been associated with genome instability, hence further promoting aging processes. Lifestyle factors, particularly diet, has been shown to delay age-associated DNA methylation changes and activate DNMTs. (B) Hypermethylation in promoter regions are seen in genes associated with cancer, particularly tumour suppressors, and T-cell immune responses and differentiation. Epigenetic therapies, such as 5-azacytidine, can prevent tumorigenesis by integrating into the DNA sequence of tumour suppressor genes, binding to DNMT1, and preventing inappropriate gene silencing.

Conclusions and Future Directions
The mechanisms of DNA methylation are now well documented, including the enzymes responsible for its initiation, maintenance, and regulation. In light of this, focus has now turned to understanding the effects of DNA methylation on gene expression. Studies have shown that methylation of CpG sites in promoter regions are strongly linked to the suppression of transcription, and that methylation of CpG sites within gene bodies are positively correlated to transcription. It has also been observed that methylation of CpG islands is characteristic of some permanently repressed genes, but not all repressed genes have methylated CpG islands. Although these associations are robust and have been demonstrated in numerous studies, the intrinsic mechanisms of these effects are still unclear.
Studies of DNA methylation have included correlating levels at particular genomic loci with phenotypic traits. Recent advancements in array-based technologies have enhanced these studies on a genome wide basis via the use of EWAS. EWAS can utilise a single sample, case-control, longitudinal, twin design or can combine multiple approaches to achieve specific aims. The application of EWAS have been the key to realising the effects of methylation on gene expression, identification of relevant CpG sites to aging, changes to methylation with age, and associations between methylation and age-related diseases. However, current EWAS methods using large cell populations provide only an average of the changes occurring to the methylome. Therefore, progression of epigenomic studies toward single-cell methods to observe specific methylation changes will likely provide significant advances to epigenomic research on stem and cancer cells.
Current epigenetic predictors of age have stemmed from EWAS. Several models have been developed which use different sets of CpGs to determine an epigenetic age. This can then be compared to chronological age to determine any age acceleration. Currently, the Hannum and Horvath models are the most often used for these determinations due to their high correlation and low margins of error. While the Horvath model can be used for multiple cell and tissue types, the Hannum model is more robust when using blood samples. Many studies have concluded strong links between DNA methylation, age acceleration, and increased risk of disease and mortality. While robust, the biological mechanisms behind these associations remain unknown. Although the association between DNA methylation and age-related diseases have introduced new therapeutic avenues, a major challenge exists to overcome target specificity and is the current focus of investigation.