Abstract
DNA methylation (DNAm) patterns in peripheral blood have been shown to be associated with aging related health outcomes. We perform an epigenome-wide screening to identify CpGs related to frailty, defined by a frailty index (FI), in a large population-based cohort of older adults from Germany, the ESTHER study. Sixty-five CpGs are identified as frailty related methylation loci. Using LASSO regression, 20 CpGs are selected to derive a DNAm based algorithm for predicting frailty, the epigenetic frailty risk score (eFRS). The eFRS exhibits strong associations with frailty at baseline and after up to five-years of follow-up independently of established frailty risk factors. These associations are confirmed in another independent population-based cohort study, the KORA-Age study, conducted in older adults. In conclusion, we identify 65 CpGs as frailty-related loci, of which 20 CpGs are used to calculate the eFRS with predictive performance for frailty over long-term follow-up.
Similar content being viewed by others
Introduction
Frailty is defined as a state of elevated vulnerability to poor resolution of homeostasis1. It is frequently observed in older adult populations2,3 and increases the risk of developing negative health outcomes including falls, physical limitations, hospitalization, and mortality1,4,5. A growing body of research has reported that this syndrome results from a multidimensional interplay of genetic, biological, psychosocial, and environmental factors6,7,8.
Without a gold standard measurement for frailty, various models were proposed to measure and define frailty in the past years4,9,10,11, and they were based on questionnaires, performance measures, routine data, or a combination of any of these. Two principal instruments were widely accepted in clinical practice12, the performance and questionnaire-based approach suggested by Fried et al.4 and the “Frailty index (FI)-approach” suggested by Rockwood et al.10, which can use any kind of data as long as the included items are frailty-related and fulfill specific validated criteria. FI has been shown to be a valid predictor of morbidity and mortality in many different population-based cohorts13,14.
Epigenetic modifications have been recognized to play a major role in aging15,16 and aging-related conditions, such as frailty17,18. Recently, various DNA methylation (DNAm)-based aging algorithms have been developed19,20,21,22,23 and have also been shown to be associated with frailty24,25. However, none of these algorithms was specifically derived for predicting frailty.
Here, we followed a previously proposed three-phase procedure21 to derive and validate DNAm signature-based frailty risk score in a large population-based cohort study of older adults. Replication was performed in another independent population-based cohort. Frailty was defined by a frailty index (FI) based on the concept of deficit accumulation10,26.
Results
Characteristics of the study populations
Table 1 shows the baseline characteristics of the participants. In ESTHER, the distributions of age, sex, body mass index, smoking status, and alcohol consumption were similar among the three subsets (all P-values > 0.05). The mean age was approximately 62 years, and a slight majority of participants were women. The majority of participants were overweight or obese (approximately 7 out of 10). Approximately half of them had never smoked, and approximately one out of six participants was still smoking at the time of enrollment. In KORA-age, the mean age was about 76 years and half of the participants were females. The education level (27% with ≥12 years of school education), body mass index levels (80% overweight or obese), and the level of alcohol consumption were higher than those in ESTHER.
Table 2 presents the distribution of FI at baseline and various follow-ups. FI and the proportion of frail participants increased with follow-up in ESTHER and KORA-age. In KORA-age, the levels of FI were higher due to the inclusion of older participants.
Identification of frailty-related CpGs
In the discovery phase, conducted in subset I, 2220 CpGs passed the genome-wide significance threshold (FDR < 0.05). 65 CpGs located at 47 genes across 21 chromosomes were successfully replicated in the validation phase in subset II and were deemed as frailty-related methylation loci. When more comprehensively adjusted for further potential confounders using model 2, 53 CpGs were associated with FI, of which 43 CpGs were inversely associated with FI, with a decrease in FI (95% CI) per one standard deviation (SD) increase in methylation ranging from 0.92% units (0.06–1.77) to 3.07% units (1.05–5.08) (Supplementary Data 1).
Screening the literature for CpGs that were previously reported to be associated with frailty identified 15 CpGs from three studies25,27,28. Eight of them showed statistically significant associations with frailty in subset III (Supplementary Table 1).
Construction of eFRS
Using LASSO regression, the number of CpGs was further reduced from 65 to 20 because many CpGs were correlated with each other. The eFRS was constructed with these 20 selected CpGs using the equation: \({{{{\rm{e}}}}{FRS}}=0.204-0.209\times{{{\rm{cg}}}}00921350-0.100\times {{{\rm{cg}}}}01234420-0.016\times {{{\rm{cg}}}}02867102-0.293\times {{{\rm{cg}}}}03725309-0.146\times {{{\rm{cg}}}}04955914-0.084\times {{{\rm{cg}}}}07312601+0.158\times {{{\rm{cg}}}}07349348+0.137\times {{{\rm{cg}}}}08463758+0.248\times {{{\rm{cg}}}}10408430-0.101 \\ \times {{{\rm{cg}}}}11700584-0.049\times {{{\rm{cg}}}}12510708+0.064\times {{{\rm{cg}}}}13570972-0.057 \times {{{\rm{cg}}}}15058210-0.180\times {{{\rm{cg}}}}15380836-0.144\\ \times {{{\rm{cg}}}}17860366+0.315\times {{{\rm{cg}}}}17971578-0.075\times {{{\rm{cg}}}}18791730-0.176\times {{{\rm{cg}}}}19267254-0.025\times 21656937-0.077{{{\rm{cg}}}}\times {{{\rm{cg}}}}23458887\).
Functional annotation of sets of CpGs
We performed a literature search in PubMed to obtain information on the genes that contain the 65 frailty-related CpGs (Supplementary Data 2). The 20 CpGs included in eFRS is annotated to 17 Genes. These genes are involved in various frailty-related outcomes, including different types of cancer (i.e., HDAC4, CASP9, NFE2L3, RILP, STK40, HAO2, SNX20, MRTO4, EMILIN3, P4HA3), cardiovascular disease (i.e., HDAC4, CASP9, SARS), diabetes mellitus (i.e., RPL36AL, SARS), and Alzheimer’s Disease (i.e., RPL36AL).
Supplementary Fig. 1 presents pathway enrichment and PPI network analysis of target genes of frailty-related CpGs. The enrichment heatmap (Supplementary Fig. 1A) shows that pathways of these genes include cellular macromolecule biosynthetic process, non-small cell lung cancer, viral carcinogenesis, export from the cell, and viral process. Supplementary Fig. 1B shows the relationships between these enriched terms, where each node symbolized an enriched term and a similarity > 0.3 are connected by edges. With the application of MCODE algorithm29, three modules (regulation of kinase activity, positive regulation of kinase activity, and positive regulation of transferase activity) in the PPI network were seen (Supplementary Fig. 1C).
Supplementary Data 3 shows the results of the mQTL analysis on the CpGs included in eFRS. Altogether, we identified 3, 3, and 55 mQTLs where genetic variation was significantly associated (P < 1 × 10−7) with the loci cg02867102, cg07312601, and cg11700584, respectively.
Association of eFRS with FI at baseline and each follow-up
Figure 1 shows the correlation matrix of age, eFRS and FI at baseline and various follow-up times in the two validation subsets. In ESTHER, eFRS was more strongly related to chronological age than FI at baseline (Spearman correlation coefficients, rSp, 0.443 and 0.267, respectively), but the correlation of FI with age increased with increasing length of follow-up (up to rSp = 0.397 at 11-year follow-up). Correlation coefficients between eFRS at baseline and FI at baseline and the various follow-up times were all in the range of 0.2–0.3, and correlation coefficients between the FI at various points of time were all in the range of 0.6–0.9. Essentially very similar correlations between eFRS and FI at baseline and the various follow-ups were seen with Pearson’s correlation coefficients (rSp similarly ranged from 0.2 to 0.3). Approximately 6%, 9%, 9%, 9%, and 5% variation of frailty of at baseline, 2-, 5-, 8-, and 11-year follow-up can be explained by eFRS. Furthermore, we observed a significant correlation of eFRS with AccAgeGrim (rSp = 0.566, P < 0.001). Replicated analyses in KORA-age showed consistent patterns and similar correlations. eFRS showed a slightly stronger correlation with age (rSp = 0.427) than FI at baseline (rSp = 0.411). Correlation coefficients between eFRS and FI at baseline and the two follow-ups similarly ranged between 0.2 and 0.3. About 6%, 7%, and 7% variation of frailty at baseline, 4-, and 8-year follow-up can be explained by the eFRS. All correlations in ESTHER and KORA-age were highly statistically significant (P < 0.001).
Supplementary Table 2 provides covariate-adjusted associations of eFRS at baseline with FI at baseline and subsequent follow-up rounds in the two validation subsets. In ESTHER, a one SD increase in eFRS was associated with an increase in the FI by approximately 2 percent units (range from 1.55 to 2.16 percent units). The associations were fairly stable across follow-up rounds and various adjustment levels, even though they did no longer reach statistical significance for FI measured at the 11-year follow-up, given the smaller number of participants who were still included in this follow-up round. In KORA-age, very similar, albeit slightly weaker associations were observed between eFRS and FI at baseline and subsequent two follow-up times (a one SD increase in eFRS was associated with 1.27-1.97 percent unit increments of FI).
Supplementary Fig. 2 illustrates multivariable-adjusted ORs (95% CIs) for the association of FRS with being pre-frail or frail at baseline and each follow-up in the two validation subsets. In ESTHER, the associations of pre-frailty/frailty with highest (vs. lowest) quartiles of eFRS were statistically significant at 2-year follow-up and 5-year follow-up with ORs 3.53 (95% CI = 1.66–7.52) and 2.86 (95% CI = 1.18–6.92), respectively. Similarly, when assessing ORs as per SD of eFRS, eFRS were strongly associated with being pre-frail or frail at baseline (OR = 1.38, 95% CI = 1.05–1.82), 2-year follow-up (OR = 1.67, 95% CI = 1.26–2.23) and 5-year follow-up (OR = 1.38, 95% CI = 1.01–1.87). Patterns were less consistent and associations were not statistically significant with pre-frailty/frailty status at 8- and 11-year follow-up. In KORA-age, strong associations of eFRS with being pre-frail or frail at baseline were seen with multivariable-adjusted ORs for eFRS quartiles 3 and 4 compared to quartile 1 as 1.99 (95% CI = 1.23–3.21) and 2.77 (95% CI = 1.61–4.75), respectively. A one SD increase of eFRS was significantly associated with a 40% increased odds of being pre-frail or frail at baseline. When restricting the analysis to those who were non-frail at baseline, similar associations were seen between eFRS at baseline and risk of being pre-frail or frail at multiple follow-ups in both ESTHER and KORA-age (Supplementary Fig. 3).
Associations of eFRS with being frail at baseline and various follow-up times in the two validation subsets are shown in Fig. 2. In ESTHER, participants in eFRS quartile 4 were at strongly increased risk of being frail at baseline (OR = 7.98, 95% CI = 2.27–28.07), 2-year follow-up (OR = 2.93, 95% CI = 1.13–7.57), and 5-year follow-up (OR = 2.92, 95% CI = 1.12–7.67), compared to participants with eFRS in quartile 1. Multivariable adjusted ORs (95% CIs) of being frail at baseline, 2-year follow-up, 5-year follow-up were 1.94 (1.31–2.89), 1.64 (1.15–2.35), and 1.48 (1.07–2.04) per one SD increase of eFRS, respectively. In KORA-age, the associations with being frail were generally weaker but statistically significant associations were still observed between eFRS and frailty at 4-year follow-up. Similar associations of eFRS with being frail at follow-ups were seen in both ESTHER and KORA-age when the analysis was restricted to participants who were non-frail or pre-frail at baseline (Fig. 3).
We further conducted sensitivity analyses with models that adjusted for smoking status using the Maas 13-CpGs model rather than self-reported smoking status in ESTHER. The positive associations were highly consistent between both types of models (Supplementary Table 3).
Predictive performance of eFRS for frailty at baseline and follow-ups
Table 3 displays the individual and joint predictive performance of age, sex, and eFRS for being frail in the two validation subsets. In ESTHER, eFRS presented comparable predictive performances at baseline and follow-ups with the combination of age and sex. When adding FRS to models including age and sex, the predictive performance for prediction of FI at baseline and 5-year follow-up was significantly improved (from 0.629 to 0.711 at baseline and from 0.650 to 0.680 at 5-year follow-up). In the substantially older cohort of KORA-age, the predictive performance of age and sex were generally much higher and adding eFRS to models including age and sex only slightly increased predictive performance.
Association of AccAgeGrim with FI at baseline and each follow-up
AccAgeGrim showed similar associations with FI at baseline and subsequent follow-up times in ESTHER as eFRS (a one SD increase in AccAgeGrim was associated with 1.53–2.17% unit increments of FI, Supplementary Table 4). Similar associations of AccAgeGrim with being pre-frail or frail were also seen at baseline, 2-year follow-up, and 5-year follow-up (Supplementary Fig. 4). However, a significant relationship between AccAgeGrim with being frail was only seen at 5-year follow-up (Supplementary Fig. 5).
Discussion
In this large-scale EWAS conducted in a population-based cohort of older adults, we identified 65 frailty-related CpGs located at 47 genes across 21 chromosomes based on DNA from whole blood, 20 of which were selected to construct the eFRS. To our knowledge, this is the first EWAS-derived eFRS, and it was found to be significantly associated with frailty at multiple points of time during long-term follow-up. These findings were validated in samples that did not overlap with samples from which the eFRS was derived and were also confirmed in an independent cohort, which demonstrated the ability of the eFRS to predict both prevalence and longer-term incidence of frailty.
To our knowledge, few studies have specifically assessed the relationship between DNAm patterns and frailty17,18. Only one previous EWAS on frailty defined according to the Fried criteria among 70-year-old people has been conducted and identified one CpG (cg18314882 on chromosome 8 in the MAF1 gene)25. However, the single CpG is not among the CpGs identified in our study. The differences in identified CpGs within corresponding gene clusters between the two studies might be due to the different measurements for frailty. Bellizzi et al. 17 defined frailty status using cluster analysis and reported that global DNA methylation was lower in frail individuals than in the non-frail participants. This is in line with our study, which observed that 43 out of 53 CpGs were inversely associated with FI in an adjusted model. Another study by Collerton et al. 18 using the Fried frailty definition in a cohort with 85 years old participants found that the genome-wide methylation was not associated with frailty status. Recently, several DNAm-based algorithms, i.e. Hannum’s blood-specific clock19 and GrimAge23, have been derived to estimate ‘epigenetic age’ and were suggested to closely correlate with frailty24,25,30.
The identified frailty-related CpGs highlight several genes or genetic regions and might promote further investigation of the biological mechanism of frailty. Glycolytic glyceraldehyde-3-phosphate dehydrogenase (GAPDH) contains two of the frailty-related CpGs (cg00252813 and cg02519286) and has been found to be likely related to the pathogeneses of amyotrophic lateral sclerosis and Huntington’s disease31,32, various forms of cancers33, and neurodegenerative disorders33. Three CpGs (cg01406381, cg21766592, and cg25607249) are located at Solute Carrier Family 1 Member 5 (SLC1A5), a high-affinity l-glutamine transporter that was highly expressed in several cancer types34,35,36. The remaining CpGs and correspondent genes, such as SARS, SCRN1, PIK3CD, RUNX1, NCAPH, HDAC4, and VAC14, were also observed to be associated with multiple diseases. The roles of the identified CpGs in the development and/or progression of diseases may explain the association of eFRS with frailty.
We derived a DNAm-based algorithm for predicting frailty, measured by the FI, and this eFRS showed robust predictive performance for frailty in a comprehensive validation chain. Of the 20 CpGs included in the eFRS, 8 CpGs map to intergenic regions with unknown function, and the other 12 CpGs are annotated to genes involved in common chronic diseases, including coronary artery disease, stroke, type 2 diabetes mellitus, and multiple types of cancers37,38,39,40. Frailty has been shown to be strongly associated with a broad range of adverse health outcomes, such as worsening mobility4, falls38, fracture41, and mortality42. The shared linkage with morbidity may therefore explain the association between eFRS and frailty. Moreover, the associations of 12 CpGs and common chronic diseases also support the potential capacity of eFRS for predicting adverse health outcomes and even fatal outcomes. Given that frailty may potentially be preventable up to some possible point of no return, the potential predictive performance of eFRS might be helpful for designing, implementing, and evaluating interventions aimed to prevent or slow down the development of frailty.
Several accurate composite algorithms of chronological age or lifespan have been built based on DNAm, including the Mortality Risk Score (MRscore) by Zhang et al.21, PhenoAge by Levine et al.22, and GrimAge by Lu et al.23. These DNAm-based algorithms were demonstrated to be robust predictors of mortality, lifespan, and healthspan21,22,23. In our study, a high correlation between eFRS and GrimAge was observed. Such correlation is not unexpected given that GrimAge includes DNAm-based surrogate biomarkers for health-related plasma proteins and smoking pack-years23 and the majority of the CpGs used to construct eFRS are annotated to health-related conditions. When comparing CpGs included in eFRS and these aging algorithms, three overlapped with GrimAge (cg02867102, cg07312601, and cg11700584). Moreover, cg02867102 and cg11700584 were significant signals in previous EWASs on smoking and aging43,44,45, which points to the major role of smoking in adverse health outcomes in old age including frailty.
The current study has several strengths including the large-scale random samples from the general population, the long-term follow-up with repeated measurements of frailty at multiple points of time during follow-up, and replication in a completely independent cohort. However, it is also necessary to consider the limitations of the present study when interpreting the results. First, the deficits for the measurement of FI were self-reported which might have led to potential reporting bias. However, in the ESTHER cohort, self-reported diseases which account for a large share of the self-reported deficits were found to be in high agreement with medical records in careful validation steps carried out at each follow-up. Second, the recruitment of the participants was conducted during a voluntary health check-up. Therefore, the participants of the ESTHER cohort might not be a fully representative sample of the general population. Nevertheless, the prevalence of risk factors and chronic diseases has been found to be comparable to those observed in the corresponding age range in a representative health survey from Germany which took place at the same time as ESTHER baseline recruitment did46. Third, in the discovery phase, only the CpGs included in both the 450K and EPIC array were included in the EWAS. Potential frailty-related CpGs exclusively covered by the EPIC array might have been missed.
In conclusion, in this EWAS on frailty conducted in a large population-based cohort, we identified 65 frailty-related CpGs and derived an epigenetic algorithm of frailty based on 20 CpGs. The DNAm-based eFRS was demonstrated to be strongly associated with long-term frailty and validated both internally in an independent subset and in an external population-based cohort. Further studies should investigate the associations of eFRS with additional health outcomes and its potential use for earlier detection of frailty risk and designing, monitoring, and evaluating prevention measures.
Methods
Study population and study design
The epigenome-wide association study (EWAS), including derivation and internal validation, is based on the ESTHER study, an ongoing prospective, population-based cohort study of older adults conducted in the federal state of Saarland, Germany. Details of the study design and population have been reported previously21,47. Briefly, men and women aged 50–75 undergoing a general health check-up in Saarland, a small federal state in southwestern Germany, from 2000 to 2002 were eligible for participation. At the time of recruitment, these general health exams were routinely offered every 2 years to people aged 35 years and older by their general practitioners (GPs). Overall, 9940 adults aged 50–75 years were recruited by their GPs and were followed by participant and GP questionnaires after 2, 5, 8, and 11 years During baseline enrollment and each follow-up, standardized questionnaires for participants and their GPs were used to collect extensive basic data on sociodemographic characteristics, risk factors, lifestyle factors, and medical history. Moreover, whole blood samples were collected at baseline from which DNA was extracted. The ESTHER cohort was found to be representative of the older German population with respect to major sociodemographic, lifestyle, and medical characteristics46.
Three subsets were randomly selected in three different rounds of methylation analysis from the ESTHER cohort for epigenome-wide DNA methylation data measurements. Subsets I and II included 998 and 741 randomly selected subjects for whom DNAm measurements were performed in August 2018 and July 2019 for various projects47,48. Subset III has a nested case-control design for mortality-related methylation signatures and 548 participants were randomly selected as the subcohort irrespective of death status21. Subset I was used as a discovery panel in the epigenome-wide screening for CpG sites related to frailty at baseline. Subset II was utilized as the first internal validation panel to further select CpG sites to construct a methylation-based epigenetic frailty risk score (eFRS). The associations of eFRS with FI at baseline and during 11 years of follow-up were further validated in subset III.
Replication in an independent cohort was performed in the KORA-Age study, a population-based cohort study conducted in the region of Augsburg, Southern Germany, whose study population and design have been described in detail previously49. In 2008 and 2009, 4123 participants aged ≥65 years from four population representative surveys conducted between 1984 and 2001 were enrolled and completed the baseline assessments. Then, an age- and sex-stratified random sub-sample (n = 1079) additionally completed medical examinations and was followed up in 2012 and 2016. Methylation data were available for 1010 participants from this random sub-samples (subset IV in the current analysis). The baseline, 4- and 8-year follow-up data of these 1010 participants were used to validate the associations of eFRS with FI in an independent cohort.
The ESTHER study was approved by the ethics committees of the medical faculty of Heidelberg University and of the medical board of the state of Saarland. The KORA-Age study was approved by the Ethics Committee of the Bavarian Medical Association (EK No. 08064). All ESTHER and KORA-Age participants provided written informed consent.
DNA methylation assessment
DNAm profiles of subsets I and II from ESTHER, and subset IV from KORA-Age were assessed with the Infinium Methylation EPIC BeadChip kit (EPIC, Illumina, Inc., San Diego, CA, USA), and DNAm profiles of subset III from ESTHER were determined with the earlier introduced Infinium Human Methylation450K BeadChip Assay (450K, Illumina, Inc., San Diego, CA, USA). Details of the methylation analysis in the ESTHER study have been reported previously48. Genome-wide DNAm profiling was conducted by the Genomics and Proteomics Core Facility of the German Cancer Research Center according to the manufacturer’s protocol. In data pre-processing, signals of probes with detection P-value > 0.01, missing values >10%, and probes targeting the X and Y chromosomes were excluded. Only the CpGs that are covered by both the 450K array and EPIC array were included in the EWAS. In the KORA-Age study, data quality control and pre-processing were conducted following the CPACOR pipeline50. Probes with detection P-value > 0.01 and missing values >5% were removed. Quantile normalization was then performed following a coherent approach as described by Lehne et al.50. In addition, leukocyte composition was estimated in ESTHER and KORA-Age using the algorithms of Houseman et al.51 for adjustment.
Frailty assessment
Following a standard procedure26,52, frailty in ESTHER and KORA-Age was assessed using a frailty index (FI), which is defined as the proportion of presented deficits of a predefined list of all deficits. As previously described, 31 and 33 deficits were selected for the assessment of FI in ESTHER48 and KORA-Age53, respectively. The lists of deficits used to define the FI in ESTHER and KORA-Age are presented in Supplementary Tables 5 and 6, respectively. Distributions of FI and proportions of the status of deficits included in the FI calculation at baseline and each follow-up in ESTHER are shown in Supplementary Fig. 6 and Supplementary Data 4, respectively. With reference to previous studies13,54, participants were deemed frail if their FI was ≥0.250, pre-frail if their FI was >0.100 and <0.250, and non-frail if their FI was ≤0.100.
Three-phase procedure to construct eFRS
The DNAm-based eFRS was developed using a three-phase process. A flowchart of the three-phase procedure is presented in Fig. 4. In the discovery phase, an epigenome-wide screening for frailty-related CpGs was carried out in subset I with baseline FI as a dependent variable using linear mixed regression models. The linear mixed regression models included methylation β-values as explanatory variables and adjustment for leukocyte composition as a fixed effect and batch as a random effect. After correcting for multiple testing using the Benjamini–Hochberg method55, CpGs that reached genome-wide significance [false discovery rate (FDR) < 0.05] were validated in subset II. Similar to the discovery phase, linear mixed regression models were conducted with baseline FI as the dependent variable and additionally adjusted for age and sex. Again, CpGs with FDRs <0.05 were selected and were deemed as frailty-related loci. Then, we applied LASSO regression with a regularization parameter chosen by ten-fold cross-validation following the ‘one standard error’ rule to select candidates among identified CpGs and construct eFRS.
Functional annotation of sets of CpGs
Frailty-related CpGs were annotated to genes with the information provided in the Illumina manifest file (http://emea.support.illumina.com/array/array_kits/infinium-methylationepic-beadchip-kit/downloads.html#), which is based on the University of California Santa Cruz (UCSC) and RefGene. To analyze the underlying roles of these genes, we used the Metascape online tool (https://metascape.org)56 to perform Gene Ontology (GO) analysis, the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis, and protein-protein interaction (PPI) network. Kappa scores were used as the similarity metric when performing hierarchical clustering on the enriched terms and sub-trees with a similarity of >0.3 were considered a cluster. In order to link genetic variants to variations of CpGs included in eFRS, methylation quantitative trait loci (mQTL) analysis was applied for the preliminary association analysis of single nucleotide polymorphism (SNP) sites with CpG sites. SNP-DNAm site pairs with a maximum distance of 1 Mb were tested. The analysis of mQTL was performed using the online tool mQTLdb (http://www.mqtldb.org)57.
Statistical analysis
Linear mixed regression and LASSO regression were performed to identify frailty-related CpGs and the eFRS in subsets I and II as aforementioned. The correlations of chronological age, eFRS, and FI at baseline and various follow-ups were assessed using Spearman correlation coefficients in subset III and subset IV. We also evaluated the correlation of eFRS and DNAm-based algorithms of aging in subset III, the AccAgeGrim (GrimAge age acceleration)23. The associations of eFRS with FI at baseline and various follow-ups were assessed in the two validation subsets by two linear mixed regression models that included age, sex, and leukocyte proportions as fixed effects, and batch as random effects (model 1). In further analyses, smoking status (never smoker, former smoker, current smoker) and alcohol consumption (grams per day) were additionally included as fixed effects (model 2). By categorizing baseline and various follow-up frailty statuses into two groups (non-frail versus pre-frail and frail; and non-frail and pre-frail versus frail), the associations were also estimated using a logistic regression model adjusting for all variables in model 2. In addition to logistic regression models including eFRS as a continuous variable, we also categorized eFRS according to quartiles and run logistic regression models including eFRS as a categorical variable. FDR was also applied for multiple comparisons among results at baseline and various follow-ups and P < 0.05 was considered statistical significance after multiple testing.
To assess potential additional variance from passive smoking, we conducted sensitivity analyses with additional logistic regression models that controlled for smoking status using a DNAm-based proxy (the Maas 13-CpGs model)58 rather than self-reported smoking status. Next, we assessed the association of AccAgeGrim with FI at baseline and various follow-ups.
Furthermore, we conducted subgroup analyses for the associations of the eFRS with frailty in which only non-frail participants at baseline were included for the outcome being pre-frail or frail, and only non-frail or pre-frail participants at baseline were included for the outcome being frail using model 2 for adjustment as described above.
We also systematically screened PubMed for previously reported CpGs associated with frailty and assessed the associations of these CpGs with frailty by linear-mixed regression models using model 2 for adjustment as described above.
The LASSO regression analyses were conducted using R programming (R Foundation of Statistical Computing, Vienna, Austria, version 4.0.1) package ‘glmnet (version 4.1-4)’59. All the other statistical analyses in the ESTHER study were carried out in SAS 9.4 (SAS Institute, Cary, NC) and the analyses in the KORA-age study were conducted in R (version 4.0.1).
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
All relevant data supporting the key findings of this study are available within the article and its supplementary information files. Due to ethical and legal restrictions, individual-level data of the two cohorts (ESTHER and KORA-age) cannot be made publicly available. Data are available upon request to X.Li (li.xiangwei@foxmail.com) and are subject to local rules and regulations. This includes submitting a proposal to the management team, where upon approval, analysis needs to be done on a local server with protected access, complying with General Data Protection Regulation. Requests will be responded to in 60 working days. Annotations of genes of frailty-related CpGs were downloaded from http://emea.support.illumina.com/array/array_kits/infinium-methylationepic-beadchip-kit/downloads.html#. The underlying roles of these genes are available at https://metascape.org56. Results of mQTL were downloaded from the online tool mQTLdb (http://www.mqtldb.org)57. The original files generated from the three websites have been deposited at https://figshare.com/articles/dataset/mQTLdb_Metascape_Annotations/20468985.
Code availability
SAS codes for statistical analysis are available upon request.
References
Xue, Q. L. The frailty syndrome: definition and natural history. Clin. Geriatr. Med. 27, 1–15 (2011).
Santos-Eggimann, B., Cuenoud, P., Spagnoli, J. & Junod, J. Prevalence of frailty in middle-aged and older community-dwelling Europeans living in 10 countries. J. Gerontol. A Biol. Sci. Med. Sci. 64, 675–681 (2009).
Gale, C. R., Cooper, C. & Sayer, A. A. Prevalence of frailty and disability: findings from the English Longitudinal Study of Ageing. Age Ageing 44, 162–165 (2015).
Fried, L. P. et al. Frailty in older adults: evidence for a phenotype. J. Gerontol. A Biol. Sci. Med. Sci. 56, M146–M156 (2001).
Strandberg, T. E. & Pitkala, K. H. Frailty in elderly people. Lancet 369, 1328–1329 (2007).
Bergman, H. et al. Frailty: an emerging research and clinical paradigm—issues and controversies. J. Gerontol. A Biol. Sci. Med. Sci. 62, 731–737 (2007).
Fulop, T. et al. Aging, frailty and age-related diseases. Biogerontology 11, 547–563 (2010).
Landi, F. et al. Disability, more than multimorbidity, was predictive of mortality among older persons aged 80 years and older. J. Clin. Epidemiol. 63, 752–759 (2010).
Lachs, M. S. et al. A simple procedure for general screening for functional disability in elderly patients. Ann. Intern. Med. 112, 699–706 (1990).
Rockwood, K. & Mitnitski, A. Frailty in relation to the accumulation of deficits. J. Gerontol. A Biol. Sci. Med. Sci. 62, 722–727 (2007).
Gobbens, R. J., van Assen, M. A., Luijkx, K. G., Wijnen-Sponselee, M. T. & Schols, J. M. The Tilburg Frailty Indicator: psychometric properties. J. Am. Med. Dir. Assoc. 11, 344–355 (2010).
Clegg, A., Young, J., Iliffe, S., Rikkert, M. O. & Rockwood, K. Frailty in elderly people. Lancet 381, 752–762 (2013).
Fan, J. et al. Frailty index and all-cause and cause-specific mortality in Chinese adults: a prospective cohort study. Lancet Public Health 5, e650–e660 (2020).
Kundi, H., Coskun, N. & Yesiltepe, M. Association of entirely claims-based frailty indices with long-term outcomes in patients with acute myocardial infarction, heart failure, or pneumonia: a nationwide cohort study in Turkey. Lancet Reg. Health Eur. 10, 100183 (2021).
Calvanese, V., Lara, E., Kahn, A. & Fraga, M. F. The role of epigenetics in aging and age-related diseases. Ageing Res. Rev. 8, 268–276 (2009).
Talens, R. P. et al. Epigenetic variation during the adult lifespan: cross-sectional and longitudinal data on monozygotic twin pairs. Aging Cell 11, 694–703 (2012).
Bellizzi, D., D’Aquila, P., Giordano, M., Montesanto, A. & Passarino, G. Global DNA methylation levels are modulated by mitochondrial DNA variants. Epigenomics 4, 17–27 (2012).
Collerton, J. et al. Acquisition of aberrant DNA methylation is associated with frailty in the very old: findings from the Newcastle 85+ Study. Biogerontology 15, 317–328 (2014).
Hannum, G. et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell 49, 359–367 (2013).
Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol. 14, R115 (2013).
Zhang, Y. et al. DNA methylation signatures in peripheral blood strongly predict all-cause mortality. Nat. Commun. 8, 14617 (2017).
Levine, M. E. et al. An epigenetic biomarker of aging for lifespan and healthspan. Aging (Albany, NY) 10, 573–591 (2018).
Lu, A. T. et al. DNA methylation GrimAge strongly predicts lifespan and healthspan. Aging (Albany, NY) 11, 303–327 (2019).
Breitling, L. P. et al. Frailty is associated with the epigenetic clock but not with telomere length in a German cohort. Clin. Epigenet. 8, 21 (2016).
Gale, C. R., Marioni, R. E., Harris, S. E., Starr, J. M. & Deary, I. J. DNA methylation and the epigenetic clock in relation to physical frailty in older people: the Lothian Birth Cohort 1936. Clin. Epigenet. 10, 101 (2018).
Mitnitski, A. B., Mogilner, A. J. & Rockwood, K. Accumulation of deficits as a proxy measure of aging. Sci. World J. 1, 323–336 (2001).
Gao, X. et al. Tobacco smoking and smoking-related DNA methylation are associated with the development of frailty among older adults. Epigenetics 12, 149–156 (2017).
Kim, S. et al. DNA methylation associated with healthy aging of elderly twins. Geroscience 40, 469–484 (2018).
Bader, G. D. & Hogue, C. W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinforma. 4, 2 (2003).
McCrory, C. et al. GrimAge outperforms other epigenetic clocks in the prediction of age-related clinical phenotypes and all-cause mortality. J. Gerontol. A Biol. Sci. Med. Sci. https://doi.org/10.1093/gerona/glaa286 (2020).
Pierce, A. et al. GAPDH is conformationally and functionally altered in association with oxidative stress in mouse models of amyotrophic lateral sclerosis. J. Mol. Biol. 382, 1195–1210 (2008).
Lazarev, V. F. et al. GAPDH binders as potential drugs for the therapy of polyglutamine diseases: design of a new screening assay. FEBS Lett. 589, 581–587 (2015).
Butera, G. et al. Regulation of autophagy by nuclear GAPDH and its aggregates in cancer and neurodegenerative disorders. Int. J. Mol. Sci. 20, https://doi.org/10.3390/ijms20092062 (2019).
Larriba, S. et al. ATB(0)/SLC1A5 gene. Fine localisation and exclusion of association with the intestinal phenotype of cystic fibrosis. Eur. J. Hum. Genet. 9, 860–866 (2001).
Hassanein, M. et al. SLC1A5 mediates glutamine transport required for lung cancer cell growth and survival. Clin. Cancer Res. 19, 560–570 (2013).
Huang, F. et al. Upregulated SLC1A5 promotes cell growth and survival in colorectal cancer. Int. J. Clin. Exp. Pathol. 7, 6006–6014 (2014).
Shibata, H. et al. Genome-wide association study of schizophrenia using microsatellite markers in the Japanese population. Psychiatr. Genet. 23, 117–123 (2013).
Kojima, G. Frailty as a predictor of future falls among community-dwelling older people: a systematic review and meta-analysis. J. Am. Med. Dir. Assoc. 16, 1027–1033 (2015).
Steenaard, R. V. et al. Tobacco smoking is associated with methylation of genes related to coronary artery disease. Clin. Epigenet. 7, 54 (2015).
Wang, Z. et al. Low mutation and neoantigen burden and fewer effector tumor infiltrating lymphocytes correlate with breast cancer metastasization to lymph nodes. Sci. Rep. 9, 253 (2019).
Johnson, R. L. et al. Frailty index is associated with periprosthetic fracture and mortality after total knee arthroplasty. Orthopedics 42, 335–343 (2019).
Kojima, G., Iliffe, S. & Walters, K. Frailty index as a predictor of mortality: a systematic review and meta-analysis. Age Ageing 47, 193–200 (2018).
Gao, X., Jia, M., Zhang, Y., Breitling, L. P. & Brenner, H. DNA methylation changes of whole blood cells in response to active smoking exposure in adults: a systematic review of DNA methylation studies. Clin. Epigenet. 7, 113 (2015).
Joehanes, R. et al. Epigenetic signatures of cigarette smoking. Circ. Cardiovasc. Genet. 9, 436–447 (2016).
Demanelis, K. et al. Cadmium exposure and age-associated DNA methylation changes in non-smoking women from northern Thailand. Environ. Epigenet. 3, dvx006 (2017).
Low, M. et al. [Epidemiological investigations of the chances of preventing, recognizing early and optimally treating chronic diseases in an elderly population (ESTHER study)]. Dtsch. Med. Wochenschr. 129, 2643–2647 (2004).
Gao, X. et al. The associations of DNA methylation alterations in oxidative stress-related genes with cancer incidence and mortality outcomes: a population-based cohort study. Clin. Epigenet. 11, 14 (2019).
Li, X. et al. Comparative validation of three DNA methylation algorithms of ageing and a frailty index in relation to mortality: results from the ESTHER cohort study. EBioMedicine 74, 103686 (2021).
Peters, A. et al. Multimorbidity and successful aging: the population-based KORA-Age study. Z. Gerontol. Geriatr. 44(Suppl. 2), 41–54 (2011).
Lehne, B. et al. A coherent approach for analysis of the Illumina HumanMethylation450 BeadChip improves data quality and performance in epigenome-wide association studies. Genome Biol. 16, 37 (2015).
Houseman, E. A. et al. DNA methylation arrays as surrogate measures of cell mixture distribution. BMC Bioinforma. 13, 86 (2012).
Searle, S. D., Mitnitski, A., Gahbauer, E. A., Gill, T. M. & Rockwood, K. A standard procedure for creating a frailty index. BMC Geriatr. 8, 24 (2008).
Stephan, A. J. et al. The times we are born into and our lifestyle choices determine our health trajectories in older age—results from the KORA-Age study. Prev. Med. 133, 106025 (2020).
Blodgett, J., Theou, O., Kirkland, S., Andreou, P. & Rockwood, K. Frailty in NHANES: comparing the frailty index and phenotype. Arch. Gerontol. Geriatr. 60, 464–470 (2015).
Ferreira, J. A. The Benjamini–Hochberg method in the case of discrete test statistics. Int. J. Biostat. 3, Article 11, https://doi.org/10.2202/1557-4679.1065 (2007).
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Gaunt, T. R. et al. Systematic identification of genetic influences on methylation across the human life course. Genome Biol. 17, 61 (2016).
Maas, S. C. E. et al. Validated inference of smoking habits from blood with a finite DNA methylation marker set. Eur. J. Epidemiol. 34, 1055–1074 (2019).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Acknowledgements
The ESTHER study was funded by grants from the Baden-Württemberg state Ministry of Science, Research and Arts (Stuttgart, Germany), the Federal Ministry of Education and Research (Berlin, Germany), the Federal Ministry of Family Affairs, Senior Citizens, Women and Youth (Berlin, Germany), and the Saarland State Ministry of Health, Social Affairs, Women and the Family (Saarbrücken, Germany). The work of Xiangwei Li was supported by a grant from Fondazione Cariplo (Bando Ricerca Malattie invecchiamento, #2017-0653). The KORA study was initiated and financed by the Helmholtz Zentrum München—German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria. The KORA-Age project was financed by the German Federal Ministry of Education and Research (BMBF FKZ 01ET0713 and 01ET1003A) as part of the ‘Health in old age’ program. Furthermore, KORA research was supported within the Munich Center of Health Sciences (MC-Health), Ludwig-Maximilians-Universität, as part of LMUinnovativ. The authors thank the study participants and their general practitioners as well as laboratory and administrative staff of the ESTHER study team. The authors gratefully acknowledge the contributions of DKFZ Genomics and Proteomics Core Facility in the processing of DNA samples and performing laboratory work. The authors thank contributions of KORA: methylation analysis and initial quality control were performed at the Core Facility Genotyping (Genome Analysis Center (GAC), Helmholtz Zentrum München) under the supervision of Nadine Lindemann, Dr. Jennifer Kriebel, and Dr. Eva Reischl. Further methylation data quality control and preprocessing were performed by Rory Wilson, Department of Molecular Epidemiology, Institute of Epidemiology, Helmholtz Zentrum München.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
Conception and design: X.L., H.B. Development of methodology: X.L., H.B. Acquisition of data (acquired and managed patients, provided facilities, etc.): B.H., B.S., H.B., E.G., A.P., M.W., B.T. Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): X.L., B.S., T.D. Writing of the manuscript: X.L., H.B. Critical review and revision of the manuscript: all authors Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): X.L., T.D. Study supervision: H.B.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Jonathan Mak and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Li, X., Delerue, T., Schöttker, B. et al. Derivation and validation of an epigenetic frailty risk score in population-based cohorts of older adults. Nat Commun 13, 5269 (2022). https://doi.org/10.1038/s41467-022-32893-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-32893-x
- Springer Nature Limited