Abstract
The healthy lifestyle index (HLI), defined as the unweighted sum of individual lifestyle components, was used to investigate the combined role of lifestyle factors on health-related outcomes. We introduced weighted outcome-specific versions of the HLI, where individual lifestyle components were weighted according to their associations with disease outcomes. Within the European Prospective Investigation into Cancer and Nutrition (EPIC), we examined the association between the standard and the outcome-specific HLIs and the risk of T2D, CVD, cancer, and all-cause premature mortality. Estimates of the hazard ratios (HRs), the Harrell’s C-index and the population attributable fractions (PAFs) were compared. For T2D, the HR for 1-SD increase of the standard and T2D-specific HLI were 0.66 (95% CI: 0.64, 0.67) and 0.43 (0.42, 0.44), respectively, and the C-index were 0.63 (0.62, 0.64) and 0.72 (0.72, 0.73). Similar, yet less pronounced differences in HR and C-index were observed for standard and outcome-specific estimates for cancer, CVD and all-cause mortality. PAF estimates for mortality before age 80 were 57% (55%, 58%) and 33% (32%, 34%) for standard and mortality-specific HLI, respectively. The use of outcome-specific HLI could improve the assessment of the role of lifestyle factors on disease outcomes, thus enhancing the definition of public health recommendations.
Similar content being viewed by others
Introduction
Lifestyle behaviors encompass multiple exposure factors, such as smoking habits, alcohol consumption, physical activity, adiposity, dietary habits and sleep1,2. Unhealthy lifestyle behaviors are associated with an increased risk of several chronic diseases3,4,5,6,7,8,9 and all-cause mortality10. This was first established in studies of individual lifestyle components, whereby summary measures such as the Mediterranean Diet Score for diet11,12,13, the number of pack-years for smoking habits14,15 and body mass index (BMI) for adiposity16, were associated with health-related outcomes. In parallel, individual lifestyle factors have been combined into versions of the healthy lifestyle index (HLI) to study the combined effects of individual lifestyle components on health and provide a holistic assessment on the role of lifestyle5,6,9,10,17,18. The HLI was mostly defined as the sum of individual scores expressing exposure to one particular lifestyle component, and was associated with mortality2,6,19 and the risk of type 2 diabetes (T2D)17,20,21,22, cardiovascular diseases (CVD)3,5, cancers18,23,24,25,26,27,28 and multi-morbidity8,29.
Although alternative versions have been proposed, e.g. based on principal component analysis19, the standard version of the HLI gives equal weight to each lifestyle component score, which implicitly assumes that all components have the same relationship with a given disease outcome. This strategy could yield biased assessments of the lifestyle-outcome relationships, particularly for outcomes that are predominantly associated with one lifestyle component. In this study we introduced and examined outcome-specific HLIs that used outcome-specific weights reflecting the strength of the association between each component and the outcome.
Within the European Prospective Investigation into Cancer and nutrition (EPIC)30, we compared results of analyses based on the standard and the outcome-specific HLIs in relation to the risk of T2D, CVD, overall cancer and all-cause mortality. We focused our evaluation on three standard epidemiological quantities: the hazard-ratio (HR), Harrell’s C-index, and population attributable fractions (PAFs) to reflect the strength of association, the discriminatory power, and the public health burden, respectively. This empirical comparison was complemented with a theoretical study of unweighted and weighted composite scores under simple linear causal models.
Methods
Study population
EPIC is an ongoing multicentric prospective study originally designed to study the relationship between diet and cancer risk30. EPIC recruited over 500,000 men and women between 1992 and 2000 from 23 centers in 10 European countries. In our analyses we excluded participants from centres lacking information on occurrence or date of diagnosis of T2D or CVD (France, Norway, Greece and Malmö; n = 168,382), participants with no follow-up for mortality (n = 1746) or no information on lifestyle (n = 934), participants with missing information on the incidence of T2D, CVD, and/or cancer during follow-up (n = 63,842), participants with prevalent T2D, CVD, or cancer at recruitment (n = 23,864), and, for sake of simplicity, participants with missing information on any of the five variables used in the definition of HLI (n = 5786), defined as complete-case analysis.
Health-related outcomes
Data on vital status and incidence of T2D and CVD (coded using the 10th Edition of the International Classification of Diseases, ICD-10), and cancer (coded according to the International Classification of Diseases for Oncology, ICD-O-3) were collected by each participating centre, from inclusion in the study to a center- and outcome-specific last date of ascertainment30,31,32. Dates of death were collected using record linkage with cancer registries, boards of health and death indices, or through active follow-up. Incident T2D cases, defined as E11 (ICD-10), were ascertained by a combination of self-report, linkage to primary care registers, secondary care registers, medication use (drug registers), hospital admissions, and mortality data31. CVD endpoints, defined as a composite of ischemic heart diseases (I20-I25), atrial fibrillation (I48), and cerebrovascular disease (I60-I69), were ascertained by different methods depending on the follow-up procedures by centre, using active follow-up through questionnaires or linkage with morbidity and hospital registries, or both32. Incident first primary cancer cases (excluding non-melanoma skin cancers) were identified through a combination of center-specific methods, including health insurance records, cancer and pathology registries and active follow-up through study participants and their next-of-kin. Follow-up for each participant and event of interest began upon inclusion in the study and ended upon the occurrence of the event, loss to follow-up, or the last date of ascertainment, whichever came first.
Assessment of lifestyle exposures at baseline
BMI (kg/m2) was derived from measured height and weight in all centers, except Oxford where it was self-reported30. A validated index capturing all physical activity domains (Cambridge Index) was computed from physical activity during recreational activities and at work33. Diet, including alcohol intake, was assessed using validated country- or center-specific dietary questionnaires designed to capture habitual consumption over the year preceding the study recruitment30. To measure adherence to a healthy diet, we computed the modified relative Mediterranean Diet Score (mrMDS), a version of the original Mediterranean Diet Score incorporating vegetable oil instead of olive oil12. To avoid redundancy with the alcohol component in the HLI, our version of mrMDS omitted alcohol intake. The remaining eight mrMDS components were measured in grams per 1000 kcal to express dietary intake as energy density12. All dietary components were divided into country-specific tertiles and scores 0 to 2 were summed up, resulting in a final mrMDS ranging from 0 to 16 with increasing scores for healthier diets. Information on smoking status was obtained using lifestyle questionnaires30, as was information on variables used for adjustments in our models, including educational attainment, menopausal status in women and the use of hormones in post-menopausal women.
Healthy lifestyle indices
Following the previous definition of the HLI used in a study of multi-morbidity in EPIC8, we considered HLIs that combined information on participants’ exposure to smoking, alcohol intake, diet, physical activity and adiposity. To facilitate the comparison of performance between the standard and outcome-specific HLIs, we used a binary scoring with 0/1 values reflecting unhealthy/healthy behavior for each component8, as displayed in Table 1. The standard HLI, ranging from 0 (unhealthiest behavior) to 5 (healthiest behavior), was defined as
To more accurately reflect the potential heterogenous relationships of each component with specific disease outcomes, outcome-specific HLIs were constructed using the same categorical scoring system. Data-driven weights were derived from the parameters of the main effects (\({w}_{k}\)) and of the interaction terms (\({\gamma }_{l}\)) in outcome-specific adjusted Cox models, implementing a forward selection procedure in EPIC to select relevant interaction terms among the lifestyle components. The outcome-specific HLI was defined as
with weights \({w}_{k}^{*}\) and \({\gamma }_{l}^{*}\) corresponding to scaled versions of \({w}_{k}\) and \({\gamma }_{l}\) so that outcome-specific HLIs had unit variance and larger values correspond to healthier profiles.
We also considered a more comprehensive scoring system for each variable ranging from 0 (unhealthiest) to 4 (healthiest behavior), as displayed in Table 1, again following a previous definition of the HLI used in EPIC8.
Cox models
In all our analyses, Cox models used age as the main time scale and were stratified by study center, sex, and age at recruitment in 5-year categories. They were adjusted for education level (no schooling, primary, secondary, and university or more), height (continuous), and energy intake from non-alcoholic sources (kcal/day), and, for women, menopausal status (pre-menopausal, peri-menopausal, post-menopausal, surgical) and use of postmenopausal hormones (never, ever, unknown). For each outcome, one Cox model was constructed with all five score variables as the main exposures to derive the outcome-specific weights and the outcome-specific HLIs. Then, Cox models were constructed by considering, in turn, each version of the HLI as the main exposure. The HLI was consistently modelled in continuous using a linear term on the log-hazard-rate scale.
Evaluation criteria
HR estimates and discriminatory power
For each event, HR estimates and corresponding 95% confidence intervals (CIs) were computed for a 1-standard deviation (SD) increase of the different versions of the HLI. They allowed the comparison of the estimated association between overall adherence to a healthy lifestyle and the event being studied, depending on the version of the HLI being used. To further illustrate how risk stratification may be hindered when using an unweighted rather than an outcome-specific weighted HLI, we considered the 25 = 32 lifestyle profiles corresponding to each possible combination of the five binary scores Smoking(0–1), Alcohol(0–1), Diet(0–1), PA(0–1) and Adipo(0–1). Setting the unhealthiest profile {Smoking(0–1) = 0, Alcohol(0–1) = 0, Diet(0–1) = 0, PA(0–1) = 0, Adipo(0–1) = 0} as the reference, we compared the HR estimates for the other 31 profiles produced by Cox models utilizing the standard and outcome-specific HLIs, respectively.
More generally, the discriminatory power of models based on the different versions of the HLI was quantified using Harrell’s C-index. HRs and Harrell’s C-indices were primarily computed in the full EPIC study population. For models based on outcome-specific HLIs, this amounted to evaluating them on the data used for their construction, which could create bias if overfitting was present. Cross-validation was applied to assess this bias: the EPIC study population was randomly split into (i) a training sample (75% of the total sample) where the outcome-specific weights were estimated, and (ii) a test sample (the remaining 25% of the total sample) where HRs and Harrell’s C-indices were computed. This process was repeated 10 times to prevent possible dependency on a single split34. HRs and Harrell’s C-indices were averaged over these 10 repetitions and compared to the values obtained on the total EPIC study population to assess the bias magnitude.
Population attributable fractions
For each specific outcome, we computed PAFs at age a, defined as
Here, \(P\left(Y<a\right)\) is the event risk before age a in the EPIC study population and \(P\left({Y}^{\left(max\right)}<a\right)\) is the hypothetical event risk before age a in the counterfactual EPIC study population where, for all participants, all five lifestyle summary variables would have been set to their maximal possible values, while all other variables used for adjustment or stratification would have been set to their actual value observed in EPIC. Under technical conditions35, \(PAF\left(a\right)\) coincides with the proportion \(P\left({Y}^{\left(max\right)}>a| Y<a\right)\) of events before age a that would have been prevented had all EPIC participants adhered to the “healthiest” behavior regarding all five lifestyle components. Absolute risks \(P\left(Y<a\right)\) and counterfactual absolute risks \(P\left({Y}^{\left(max\right)}<a\right)\) were estimated by averaging the individual risk predictions in the EPIC study population, and in the counterfactual populations, respectively. Non-parametric bootstrap based on 100 bootstrapped samples was used to estimate the corresponding 95% CI.
All analyses were performed using the R software, version 4.1.2. Given the nature of the weights used in the definition of the outcome-specific HLIs, models utilizing individual lifestyle scores would achieve similar discriminatory power and produce similar PAF estimates when compared to models based on outcome-specific HLIs.
Ethics
The EPIC study was conducted according to the Declaration of Helsinki and approved by the ethics committee at the International Agency for Research on Cancer (IARC) on 12 January 1995 and on 10 May 2017 (re-evaluation). Written informed consent was obtained from all subjects involved in the study.
Results
Study population
The final study population comprised 256,769 EPIC participants (Fig. 1), including 99,098 men (38.6%) and 157,671 women (61.4%) (Table 2). Average follow-up time and total number of incident events were 16.3 (SD = 3.4) years and 25,191 for all-cause mortality, 10.9 (2.3) years and 11,763 for T2D, 11.5 (2.8) years and 11,766 for CVD, and 14.3 (4.0) years and 34,159 for cancer.
HLIs based on binary scores
Outcome-specific HLIs
Weights of the main terms used in the definition of the outcome-specific HLIs were all positive, except the one of Alcohol(0–1) for the T2D- and CVD- specific HLIs (Table 3). Smoking had the strongest impact on the all-cause mortality, CVD- and cancer- specific HLIs, while adiposity had the strongest impact on the T2D-specific HLI. Alcohol had the weakest impact on the all-cause mortality specific HLI, while it was diet for the T2D- and CVD- specific HLI, and physical activity for the cancer-specific HLI. Several interaction terms were selected in the outcome-specific HLIs. For example, Alcohol(0–1) * Diet(0–1) was selected with a negative weight in the simplified death- and CVD-specific HLIs (Table 3).
Supplementary Figure S1 presents the empirical distributions of the outcome-specific HLIs and, for comparison, of the standard HLI (after scaling it to a unit standard deviation). The distribution of the standard HLI was approximately symmetrical and centered around 3, while the distributions of all the outcome-specific HLIs were more skewed, with most values in the top range of the distributions.
HRs, risk stratification and discriminatory power
As displayed in Table 4, outcome-specific HLIs were more strongly associated with risk of the corresponding outcome than the standard HLI. For T2D for example, the HR was 0.71 (0.70, 0.72) and 0.60 (0.59, 0.61) per 1-SD increment of the standard and outcome-specific HLI, respectively. HR estimates for the 32 lifestyle profiles corresponding to each possible combination of the five binary scores highlighted that different lifestyle profiles leading to the same value of the standard HLI could be associated with sizably different hazard ratios (Fig. 2). Among all five lifestyle profiles leading to a standard HLI value of 1, the profile {Smoking(0–1) = 1, Alcohol(0–1) = 0, Diet(0–1) = 0, PA(0–1) = 0, Adipo(0–1) = 0} had a much lower HR for all-cause mortality, and to a lesser extent, CVD and cancer, compared to the other four profiles, while the profile {Smoking(0–1) = 0, Alcohol(0–1) = 0, Diet(0–1) = 0, PA(0–1) = 0, Adipo(0–1) = 1} had the lowest HR for T2D, thus mirroring the respective impacts of the lifestyle components on the different outcome-specific HLIs.
Models based on outcome-specific HLIs achieved a larger discriminatory power compared to those based on the standard HLI. For T2D, Harrell’s C-index was 0.67 (0.66, 0.68) and 0.61 (0.60, 0.62) for the models based on the outcome-specific and standard HLIs, respectively. Cross-validated estimates were similar to those computed on the total EPIC study population (Supplementary Table 2), suggesting low to null bias due to over-fitting in our main analysis.
Population attributable fractions
PAFs were consistently larger in analyses based on the standard HLI compared to those based on outcome-specific HLIs. The proportion of deaths by the age of 80 that would have been prevented had the whole population adhered to the “healthiest” lifestyle habits was estimated to be 33% (31%, 34%) and 23% (22%, 24%) when using the standard HLI and the mortality-specific HLI, respectively (Fig. 3). Similar patterns were observed for the other three outcomes (Figs. 3).
HLIs based on categorical scores
Overall, the “healthiest” categories of each individual component received the largest weights in the outcome-specific HLIs (Supplementary Table 2). Consistent with what we observed when using binary scores, smoking had the strongest impact on the all-cause mortality, CVD- and cancer- specific HLIs, while adiposity had the strongest impact on the T2D-specific HLI. The distribution of the standard HLI was approximately symmetrical around its mean value (Supplementary Fig. 2). The distributions of the all-cause mortality-, CVD- and cancer-specific HLIs were more skewed, with most values in the top range of the distributions, while the T2D-specific HLI had a multi-modal distribution. Patterns of differences in HR, C-index and PAF estimated from analyses based on standard and outcome-specific HLIs were similar, yet more pronounced, when utilizing categorical scores compared to the simplified setting of binary scores presented above. For T2D, the HR was 0.66 (0.64, 0.67) and 0.43 (0.42, 0.44) per 1-SD increment of the standard and outcome-specific HLIs, respectively, while the corresponding C-index were 0.63 (0.62, 0.64) and 0.72(0.72, 0.73). The proportion of deaths by the age of 80 that would have been prevented had the whole population adhered to the “healthiest” lifestyle habits was estimated to be 57% (55%, 58%) and 33% (32%, 34%) when considering the standard and death-specific HLI, respectively (Fig. 4).
Theoretical study of linear causal models
Our theoretical study under a linear causal model presented in the Supplementary Material showed that the regression parameter of the scaled weighted composite score was always larger than that of the scaled unweighted composite score, unless the weighted and unweighted composite scores coincided. Considering the analog of the PAF, we showed that analyses utilizing the unweighted composite score yielded either downward or upward biased estimates, while analyses based on the weighted composite score yielded unbiased estimates.
Discussion
In this study, we introduced a novel HLI to account for the magnitude of the relationships between individual lifestyle components and specific disease outcomes, using data-driven weights. The standard and the outcome-specific versions of the HLI were extensively compared by estimating the HR, the C-index and the PAF in a range of scenarios, involving the risk of cancer, T2D, CVD and premature mortality. Two strategies to operationalize the HLI were also investigated, involving in turn, binary indicators or categorical scores for the five components.
In our study, the discriminatory power was consistently larger for models based on the outcome-specific HLI than the standard HLI, sometimes to a large extent as in the case of T2D. The reason for this limitation of the standard HLI was clearly illustrated in Fig. 2, when considering binary indicators. As the standard HLI assumes that all lifestyle components are equally associated with the risk of disease, different lifestyle patterns with the same number of unhealthy components necessarily lead to the same predicted disease hazard rate in analyses based on the standard HLI. Conversely, our analyses utilizing the outcome-specific HLI reflected the disease hazard rate heterogeneity across these lifestyle patterns with the same number of unhealthy components. This limitation of the standard HLI in terms of discriminatory power highlights that it might be a suboptimal analytical choice for risk stratification and/or risk prediction36, especially in situations where a given lifestyle component is strongly linked to the outcome under consideration, such as BMI in the case of T2D.
Most previous studies on the HLI used the standard HLI to address etiological questions, specifically to estimate disease-specific HRs to quantify the impact of adhering to healthy lifestyle habits, and disease-specific PAFs to measure the public health burden attributable to unhealthy lifestyles8,18,19,22,24. In our study, we observed consistently weaker HR estimates for the standard HLI than the outcome-specific HLI, sometimes to a large extent, as for T2D. These results suggest that analyses utilizing outcome-specific HLIs are more likely to detect associations, particularly for diseases weakly associated with lifestyle habits. Conversely, PAF estimates were consistently larger when using the standard HLI. Estimating weaker HRs and larger PAFs with the standard HLI than the outcome-specific HLI may seem paradoxical, however our results from the theoretical study of linear causal models and the inspection of the empirical distributions of the standard and the outcome-specific HLIs displayed in Figure S1 might help clarify this apparent paradox. According to the binary version of standard HLI, 59% of the EPIC study population had a standard HLI lower or equal to 3 units, i.e., more than 2 standard-deviations below the maximum HLI of 5 units. As a result, the health benefits for this large proportion of participants, had they adhered to the healthiest possible lifestyle, led to large PAF estimates. On the other hand, according to the, say, death-specific HLI, 65% of the study population had an HLI value within one standard-deviation of the maximum HLI. As a result, the benefit in premature mortality had they adhered to the healthiest possible lifestyle was less remarkable, thus explaining the lower PAF estimates. In essence, analyses of the outcome-specific HLI mimics closely an analytical strategy where individual lifestyle components are evaluated jointly within the same model, and therefore yield similar PAF estimates. Thus, our results highlight that analyses based on standard HLI could lead to biased assessments of the public health burden attributable to unhealthy lifestyle. As mentioned in our theoretical study of linear causal models, it could be argued that utilizing standard HLIs might produce approximately valid estimates of PAFs of a latent variable, e.g., reflecting health-consciousness. Yet, the validity of this approach, particularly whether the standard HLI is a better proxy than weighted HLIs for this latent variable, would need further assessment.
The etiology of chronic diseases is complex, and some level of simplification via summary quantities is welcome in epidemiological research. To paraphrase Box’s aphorism, “all summarizations are wrong but some are useful”37. To be useful, a summarization should produce approximately valid results. The validity of results in analyses based on the standard HLI could be assessed by comparing them to results of an outcome-specific HLI or the individual lifestyle components. If results are similar, the standard HLI could be appropriate as it does not rely on data-driven weights and it could facilitate the comparison of findings across studies and across health-related outcomes. However, the premise that standard HLIs would facilitate comparison across studies might be tempered in view of the myriad of versions of standard HLIs proposed in the literature5,6,9,10,17,18,38.
Multiple lifestyle factors influence an individual’s health, but some are more critical than others, which should be reflected in public health recommendations. Towards this aim, the “healthiest” lifestyle profiles could be defined as the combinations of individual lifestyle behaviors associated with lowest risk of disease, longest life expectancy, or longest life expectancy free of a chronic disease9. The development and validation of an HLI using weights derived from meta-analyzed associations with disease risk, mortality or a composite outcome reflecting mortality and common chronic diseases would help the characterization and promotion of these healthiest profiles.
In line with previous versions of the standard HLI8, the HLI considered in this study was based on five individual components: smoking habits, alcohol intake, diet, physical activity, and adiposity. Refined statistical methods, e.g. using splines, could be used to combine the individual components into an outcome-specific HLI. Also, working with a refined categorization of these five components, including more descriptors, such as more refined information on smoking intensity or adiposity or a broader spectrum of dietary exposures, and/or including information on other lifestyle factors, such as quality of sleep39,40 or stress, might lead to more accurate assessments of the relationship between lifestyle and health-related outcomes. The evaluation conducted in this study relied on the EPIC cohort, where the study populations in the various countries were generally more health-conscious than their source populations. We could not account for other major chronic diseases that could affect observed associations with our outcomes of interest because of a lack of such information in EPIC. For example, chronic obstructive pulmonary disease (COPD) frequently co-occurs with CVD and share tobacco smoking as a main risk factor41. These potential limitations were acknowledged, yet they were unlikely to affect the main conclusions of the study, which were corroborated by the evidence of our theoretical results under simple linear causal models.
Conclusions
The assessment of the relationship between a holistic composite score reflecting adherence to healthy lifestyle behaviors and the risk of disease and mortality could be improved by utilizing outcome-specific versions of the HLI. Standard HLIs can lead to biased assessments of the public health burden attributable to unhealthy lifestyles. The development and validation of data-driven HLIs best predicting the occurrence of disease outcomes is instrumental for the assessment of lifestyle and disease risk associations and the generation of accurate public health recommendations.
Data availability
The EPIC data is not publicly available, but access requests can be submitted to the Steering Committee (https://epic.iarc.fr/access/index.php).
References
Muller, D. C. et al. Modifiable causes of premature death in middle-age in Western Europe: results from the EPIC cohort study. BMC Med. 14, 87 (2016).
Ibsen, D. B. et al. Modifiable Lifestyle recommendations and mortality in Denmark: A cohort study. Am J Prev Med. 60, 792–801 (2021).
Stampfer, M. J., Hu, F. B., Manson, J. E., Rimm, E. B. & Willett, W. C. Primary prevention of coronary heart disease in women through diet and lifestyle. N Engl J Med. 343, 16–22 (2000).
Board I of M (US) and NRC (US) NCP, Curry SJ, Byers T, Hewitt M. Lifestyle Behaviors Contributing to the Burden of Cancer [Internet]. National Academies Press (US); 2003 [cited 2019 Oct 17]. Available from: https://www.ncbi.nlm.nih.gov/books/NBK223925/
Chiuve, S. E. et al. Primary prevention of stroke by healthy lifestyle. Circulation. 118, 947–954 (2008).
Loef, M. & Walach, H. The combined effects of healthy lifestyle behaviors on all cause mortality: A systematic review and meta-analysis. Prev Med. 55, 163–170 (2012).
Soerjomataram, I. et al. Cancers related to lifestyle and environmental factors in France in 2015. Eur J Cancer Oxf Engl 2018(105), 103–113 (1990).
Freisling, H. et al. Lifestyle factors and risk of multimorbidity of cancer and cardiometabolic diseases: A multinational cohort study. BMC Med. 18, 5 (2020).
Li, Y. et al. Healthy lifestyle and life expectancy free of cancer, cardiovascular disease, and type 2 diabetes: Prospective cohort study. BMJ. 368, l6669 (2020).
Li, Y. et al. Impact of healthy lifestyle factors on life expectancies in the US population. Circulation. 138, 345–355 (2018).
Panagiotakos, D. B., Pitsavos, C. & Stefanadis, C. Dietary patterns: a Mediterranean diet score and its relation to clinical and biological markers of cardiovascular disease risk. Nutr. Metab. Cardiovasc. Dis. NMCD. 16, 559–568 (2006).
Buckland, G. et al. Adherence to the Mediterranean diet and risk of coronary heart disease in the Spanish EPIC Cohort Study. Am J Epidemiol. 170, 1518–1529 (2009).
Couto, E. et al. Mediterranean dietary pattern and cancer risk in the EPIC cohort. Br. J. Cancer. 104, 1493–1499 (2011).
Tindle, H. A. et al. Lifetime smoking history and risk of lung cancer: Results from the framingham heart study. J. Natl. Cancer Inst. 110, 1201–1207 (2018).
Pleasants, R. A., Rivera, M. P., Tilley, S. L. & Bhatt, S. P. Both duration and pack-years of tobacco smoking should be used for clinical practice and research. Ann. Am. Thorac. Soc. 17, 804–806 (2020).
Arnold, M. et al. Cancers in France in 2015 attributable to high body mass index. Cancer Epidemiol. 52, 15–19 (2018).
Hu, F. B. et al. Diet, lifestyle, and the risk of type 2 diabetes mellitus in women. N Engl. J. Med. 345, 790–797 (2001).
McKenzie, F. et al. Healthy lifestyle and risk of cancer in the European prospective investigation into cancer and nutrition cohort study. Medicine (Baltimore). 95, e2850 (2016).
Hu, P. et al. Effect of healthy lifestyle index and lifestyle patterns on the risk of mortality: A community-based cohort study. Front Med. https://doi.org/10.3389/fmed.2022.920760 (2022).
Mozaffarian, D. et al. Lifestyle risk factors and new-onset diabetes mellitus in older adults. Arch. Intern. Med. 169, 798–807 (2009).
Lv, J. et al. Adherence to a healthy lifestyle and the risk of type 2 diabetes in Chinese adults. Int. J. Epidemiol. 46, 1410–1420 (2017).
Rajaobelina, K. et al. Population attributable fractions of the main type 2 diabetes mellitus risk factors in women: Findings from the French E3N cohort. J. Diabetes. 11, 242–253 (2019).
Aleksandrova, K. et al. Combined impact of healthy lifestyle factors on colorectal cancer: A large European cohort study. BMC Med. 12, 168 (2014).
Naudin, S. et al. Healthy lifestyle and the risk of pancreatic cancer in the EPIC study. Eur. J. Epidemiol. 35, 975–986 (2020).
Chen, S. L. F. et al. Combined lifestyle behaviors and the incidence of common cancer types in the norwegian women and cancer study (NOWAC). Clin. Epidemiol. 13, 721–734 (2021).
Li, Q. et al. The associations of healthy lifestyle index with breast cancer incidence and mortality in a population-based study. Breast Cancer Tokyo Jpn. 29, 957–966 (2022).
Peila, R. et al. Healthy lifestyle index and risk of pancreatic cancer in the Women’s Health Initiative. Cancer Causes Control CCC. 33, 737–747 (2022).
Meer, R., van de Pol, J., van den Brandt, P. A. & Schouten, L. J. The association of healthy lifestyle index score and the risk of renal cell cancer in the Netherlands cohort study. BMC Cancer. 23, 156 (2023).
Liu, G. et al. Adherence to a healthy lifestyle in association with microvascular complications among adults with type 2 diabetes. JAMA Netw Open. 6, e2252239 (2023).
Riboli, E. et al. European Prospective Investigation into Cancer and Nutrition (EPIC): Study populations and data collection. Public Health Nutr. 5, 1113–1124 (2002).
InterAct Consortium et al. Design and cohort description of the InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia. 54, 2272–2282 (2011).
Danesh, J. et al. EPIC-Heart: the cardiovascular component of a prospective study of nutritional, lifestyle and biological factors in 520,000 middle-aged participants from 10 European countries. Eur J Epidemiol. 22, 129–141 (2007).
Ekelund, U. et al. Physical activity and gain in abdominal adiposity and body weight: prospective cohort study in 288,498 men and women. Am J Clin Nutr. 93, 826–835 (2011).
Kuhn, M. & Johnson, K. Applied Predictive Modeling (Springer, 2013). https://doi.org/10.1007/978-1-4614-6849-3.
Causality, P. J. Models, reasoning, and inference 2nd edn. (Cambridge University Press, 2009).
Liao, J., Muniz-Terrera, G., Scholes, S., Hao, Y. & Chen, Y. Lifestyle index for mortality prediction using multiple ageing cohorts in the USA UK and Europe. Sci Rep. 8, 6644 (2018).
Box, G. E. P. Robustness in the Strategy of Scientific Model Building. In Robustness Stat (eds Launer, R. L. & Wilkinson, G. N.) 201–236 (Academic Press, 1979).
Schulze, M. B. & Hu, F. B. Epidemiology of Diabetes. In Handb Epidemiol (eds Ahrens, W. & Pigeot, I.) 1–49 (Springer, 2019).
Song, Z. et al. Association of healthy lifestyle including a healthy sleep pattern with incident type 2 diabetes mellitus among individuals with hypertension. Cardiovasc. Diabetol. 20, 239 (2021).
Guasch-Ferré, M. et al. Healthy lifestyle score including sleep duration and cardiovascular disease risk. Am. J. Prev. Med. 63, 33–42 (2022).
Rabe, K. F., Hurst, J. R. & Suissa, S. Cardiovascular disease and COPD: Dangerous liaisons?. Eur. Respir. Rev. 27, 180057 (2018).
Acknowledgements
The authors thank the EPIC participants and PIs for their cooperation in providing invaluable data. We also thank Bertrand Hemon, Corinne Casagrande and Carine Biessy (International Agency for Research on Cancer) for managing the EPIC data. The authors additional thank the government of Asturias (Spain) and all EPIC centers in the 9 European countries for their ongoing support to the EPIC Study.
Disclaimer
Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer/World Health Organization.
Funding
This study was partially supported by the grant LIBERTY (AAP SHS-E-SP 2020, PI: P Ferrari) from the French Institut National du Cancer (INCa). The coordination of EPIC is financially supported by International Agency for Research on Cancer (IARC) and by the Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, which has additional infrastructure support provided by the NIHR Imperial Biomedical Research Centre (BRC). The national cohorts are supported by Danish Cancer Society (Denmark); Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l’Education Nationale, Institut National de la Santé et de la Recherche Médi‐ cale (INSERM) (France); German Cancer Aid, German Cancer Research Center (DKFZ), German Institute of Human Nutrition Potsdam‐Rehbruecke (DIfE), Federal Ministry of Education and Research (BMBF) (Germany); Associazione Italiana per la Ricerca sul Cancro‐AIRC‐Italy, Compagnia di SanPaolo and National Research Council (Italy); Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), Statistics Netherlands (The Netherlands); Health Research Fund (FIS) ‐ Instituto de Salud Carlos III (ISCIII), Regional Govern‐ ments of Andalucía, Asturias, Basque Country, Murcia and Navarra, and the Catalan Institute of Oncology ‐ ICO (Spain); Swedish Cancer Society, Swedish Research Council and County Councils of Skåne and Västerbotten (Sweden); Cancer Research UK (14,136 to EPIC‐Norfolk; C8221/A29017 to EPIC‐Oxford), Medical Research Council (1,000,143 to EPIC‐Norfolk; MR/M012190/1 to EPIC‐Oxford) (UK). IDIBELL acknowledges support from the Generalitat de Catalunya through the CERCA Program. The funders were not involved in designing the study; collecting, analysing, and interpreting results; or writing and submitting the manuscript for publication.
Author information
Authors and Affiliations
Contributions
The authors’ responsibilities were as follows: PF and VV conceived, designed, and supervised the research. VV analysed the data. PF, HF, KM and VV were responsible for drafting the manuscript. AON, CCD, AT, AKE, RK, VK, MBS, GM, GT, VS, RT, LM, JWGD, YTS, THN, KBB, TMS, JRQ, MRB, CB, AAA, LC, MG, BS, AW, AKH, MJG, EW and MJ provided the original data, information on the respective populations, and advice on the study design, analysis, and interpretation of the results. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Viallon, V., Freisling, H., Matta, K. et al. On the use of the healthy lifestyle index to investigate specific disease outcomes. Sci Rep 14, 16330 (2024). https://doi.org/10.1038/s41598-024-66772-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-66772-w
- Springer Nature Limited