On the use of the healthy lifestyle index to investigate specific disease outcomes

Viallon, Vivian; Freisling, Heinz; Matta, Komodo; Nannsen, Anne Østergaard; Dahm, Christina C.; Tjønneland, Anne; Eriksen, Anne Kirstine; Kaaks, Rudolf; Katzke, Verena A.; Schulze, Matthias B.; Masala, Giovanna; Tagliabue, Giovanna; Simeon, Vittorio; Tumino, Rosario; Milani, Lorenzo; Derksen, Jeroen W. G.; van der Schouw, Yvonne T.; Nøst, Therese Haugdahl; Borch, Kristin Benjaminsen; Sandanger, Torkjel M.; Quirós, J. Ramón; Rodriguez-Barranco, Miguel; Bonet, Catalina; Aizpurua-Atxega, Amaia; Cirera, Lluís; Guevara, Marcela; Sundström, Björn; Winkvist, Anna; Heath, Alicia K.; Gunter, Marc J.; Weiderpass, Elisabete; Johansson, Mattias; Ferrari, Pietro

doi:10.1038/s41598-024-66772-w

On the use of the healthy lifestyle index to investigate specific disease outcomes

Article
Open access
Published: 15 July 2024

Volume 14, article number 16330, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

On the use of the healthy lifestyle index to investigate specific disease outcomes

Download PDF

Vivian Viallon¹,
Heinz Freisling¹,
Komodo Matta¹,
Anne Østergaard Nannsen²,
Christina C. Dahm²,
Anne Tjønneland^3,4,
Anne Kirstine Eriksen³,
Rudolf Kaaks⁵,
Verena A. Katzke⁵,
Matthias B. Schulze^6,7,8,
Giovanna Masala⁹,
Giovanna Tagliabue¹⁰,
Vittorio Simeon¹¹,
Rosario Tumino¹²,
Lorenzo Milani¹³,
Jeroen W. G. Derksen¹⁴,
Yvonne T. van der Schouw¹⁴,
Therese Haugdahl Nøst^15,16,
Kristin Benjaminsen Borch¹⁶,
Torkjel M. Sandanger¹⁶,
J. Ramón Quirós¹⁷,
Miguel Rodriguez-Barranco^18,19,20,
Catalina Bonet^21,22,
Amaia Aizpurua-Atxega^23,24,
Lluís Cirera²⁵^nAff20,
Marcela Guevara^20,26,27,
Björn Sundström²⁸,
Anna Winkvist^29,30,
Alicia K. Heath³¹,
Marc J. Gunter³¹,
Elisabete Weiderpass¹,
Mattias Johansson¹ &
…
Pietro Ferrari¹

944 Accesses
28 Altmetric
3 Mentions
Explore all metrics

Abstract

The healthy lifestyle index (HLI), defined as the unweighted sum of individual lifestyle components, was used to investigate the combined role of lifestyle factors on health-related outcomes. We introduced weighted outcome-specific versions of the HLI, where individual lifestyle components were weighted according to their associations with disease outcomes. Within the European Prospective Investigation into Cancer and Nutrition (EPIC), we examined the association between the standard and the outcome-specific HLIs and the risk of T2D, CVD, cancer, and all-cause premature mortality. Estimates of the hazard ratios (HRs), the Harrell’s C-index and the population attributable fractions (PAFs) were compared. For T2D, the HR for 1-SD increase of the standard and T2D-specific HLI were 0.66 (95% CI: 0.64, 0.67) and 0.43 (0.42, 0.44), respectively, and the C-index were 0.63 (0.62, 0.64) and 0.72 (0.72, 0.73). Similar, yet less pronounced differences in HR and C-index were observed for standard and outcome-specific estimates for cancer, CVD and all-cause mortality. PAF estimates for mortality before age 80 were 57% (55%, 58%) and 33% (32%, 34%) for standard and mortality-specific HLI, respectively. The use of outcome-specific HLI could improve the assessment of the role of lifestyle factors on disease outcomes, thus enhancing the definition of public health recommendations.

Combined lifestyle factors, incident cancer, and cancer mortality: a systematic review and meta-analysis of prospective cohort studies

Article Open access 10 February 2020

Cardiovascular health and cancer mortality: evidence from US NHANES and UK Biobank cohort studies

Article Open access 05 September 2024

Healthy lifestyle change and all-cause and cancer mortality in the European Prospective Investigation into Cancer and Nutrition cohort

Article Open access 29 May 2024

Introduction

Lifestyle behaviors encompass multiple exposure factors, such as smoking habits, alcohol consumption, physical activity, adiposity, dietary habits and sleep^1,2. Unhealthy lifestyle behaviors are associated with an increased risk of several chronic diseases^{3,4,5,6,7,8,9} and all-cause mortality¹⁰. This was first established in studies of individual lifestyle components, whereby summary measures such as the Mediterranean Diet Score for diet^11,12,13, the number of pack-years for smoking habits^14,15 and body mass index (BMI) for adiposity¹⁶, were associated with health-related outcomes. In parallel, individual lifestyle factors have been combined into versions of the healthy lifestyle index (HLI) to study the combined effects of individual lifestyle components on health and provide a holistic assessment on the role of lifestyle^{5,6,9,10,17,18}. The HLI was mostly defined as the sum of individual scores expressing exposure to one particular lifestyle component, and was associated with mortality^2,6,19 and the risk of type 2 diabetes (T2D)^17,20,21,22, cardiovascular diseases (CVD)^3,5, cancers^{18,23,24,25,26,27,28} and multi-morbidity^8,29.

Although alternative versions have been proposed, e.g. based on principal component analysis¹⁹, the standard version of the HLI gives equal weight to each lifestyle component score, which implicitly assumes that all components have the same relationship with a given disease outcome. This strategy could yield biased assessments of the lifestyle-outcome relationships, particularly for outcomes that are predominantly associated with one lifestyle component. In this study we introduced and examined outcome-specific HLIs that used outcome-specific weights reflecting the strength of the association between each component and the outcome.

Within the European Prospective Investigation into Cancer and nutrition (EPIC)³⁰, we compared results of analyses based on the standard and the outcome-specific HLIs in relation to the risk of T2D, CVD, overall cancer and all-cause mortality. We focused our evaluation on three standard epidemiological quantities: the hazard-ratio (HR), Harrell’s C-index, and population attributable fractions (PAFs) to reflect the strength of association, the discriminatory power, and the public health burden, respectively. This empirical comparison was complemented with a theoretical study of unweighted and weighted composite scores under simple linear causal models.

Methods

Study population

EPIC is an ongoing multicentric prospective study originally designed to study the relationship between diet and cancer risk³⁰. EPIC recruited over 500,000 men and women between 1992 and 2000 from 23 centers in 10 European countries. In our analyses we excluded participants from centres lacking information on occurrence or date of diagnosis of T2D or CVD (France, Norway, Greece and Malmö; n = 168,382), participants with no follow-up for mortality (n = 1746) or no information on lifestyle (n = 934), participants with missing information on the incidence of T2D, CVD, and/or cancer during follow-up (n = 63,842), participants with prevalent T2D, CVD, or cancer at recruitment (n = 23,864), and, for sake of simplicity, participants with missing information on any of the five variables used in the definition of HLI (n = 5786), defined as complete-case analysis.

Health-related outcomes

Data on vital status and incidence of T2D and CVD (coded using the 10th Edition of the International Classification of Diseases, ICD-10), and cancer (coded according to the International Classification of Diseases for Oncology, ICD-O-3) were collected by each participating centre, from inclusion in the study to a center- and outcome-specific last date of ascertainment^30,31,32. Dates of death were collected using record linkage with cancer registries, boards of health and death indices, or through active follow-up. Incident T2D cases, defined as E11 (ICD-10), were ascertained by a combination of self-report, linkage to primary care registers, secondary care registers, medication use (drug registers), hospital admissions, and mortality data³¹. CVD endpoints, defined as a composite of ischemic heart diseases (I20-I25), atrial fibrillation (I48), and cerebrovascular disease (I60-I69), were ascertained by different methods depending on the follow-up procedures by centre, using active follow-up through questionnaires or linkage with morbidity and hospital registries, or both³². Incident first primary cancer cases (excluding non-melanoma skin cancers) were identified through a combination of center-specific methods, including health insurance records, cancer and pathology registries and active follow-up through study participants and their next-of-kin. Follow-up for each participant and event of interest began upon inclusion in the study and ended upon the occurrence of the event, loss to follow-up, or the last date of ascertainment, whichever came first.

Assessment of lifestyle exposures at baseline

BMI (kg/m2) was derived from measured height and weight in all centers, except Oxford where it was self-reported³⁰. A validated index capturing all physical activity domains (Cambridge Index) was computed from physical activity during recreational activities and at work³³. Diet, including alcohol intake, was assessed using validated country- or center-specific dietary questionnaires designed to capture habitual consumption over the year preceding the study recruitment³⁰. To measure adherence to a healthy diet, we computed the modified relative Mediterranean Diet Score (mrMDS), a version of the original Mediterranean Diet Score incorporating vegetable oil instead of olive oil¹². To avoid redundancy with the alcohol component in the HLI, our version of mrMDS omitted alcohol intake. The remaining eight mrMDS components were measured in grams per 1000 kcal to express dietary intake as energy density¹². All dietary components were divided into country-specific tertiles and scores 0 to 2 were summed up, resulting in a final mrMDS ranging from 0 to 16 with increasing scores for healthier diets. Information on smoking status was obtained using lifestyle questionnaires³⁰, as was information on variables used for adjustments in our models, including educational attainment, menopausal status in women and the use of hormones in post-menopausal women.

Healthy lifestyle indices

Following the previous definition of the HLI used in a study of multi-morbidity in EPIC⁸, we considered HLIs that combined information on participants’ exposure to smoking, alcohol intake, diet, physical activity and adiposity. To facilitate the comparison of performance between the standard and outcome-specific HLIs, we used a binary scoring with 0/1 values reflecting unhealthy/healthy behavior for each component⁸, as displayed in Table 1. The standard HLI, ranging from 0 (unhealthiest behavior) to 5 (healthiest behavior), was defined as

$$standard\;HLI = Smoking^{{(0,1)}} + Alcohol^{{(0,1)}} + Diet^{{(0,1)}} + PA^{{(0,1)}} + Adipo^{{(0,1)}} .$$

Table 1 Binary and categorical scores used for the computation of the standard and outcome-specific HLIs, following a previous definition of the HLI⁸.

Full size table

To more accurately reflect the potential heterogenous relationships of each component with specific disease outcomes, outcome-specific HLIs were constructed using the same categorical scoring system. Data-driven weights were derived from the parameters of the main effects (${w}_{k}$) and of the interaction terms (${\gamma }_{l}$) in outcome-specific adjusted Cox models, implementing a forward selection procedure in EPIC to select relevant interaction terms among the lifestyle components. The outcome-specific HLI was defined as

$$Outcome-specific HLI{ } = \mathop \sum \limits_{{{\text{k}} = 1}}^{5} w_{k}^{*} *\left( {Summary\; Variable} \right)_{k} + \sum \gamma_{l}^{*} *\left( {Interaction \;Term} \right)_{l}$$

with weights ${w}_{k}^{*}$ and ${\gamma }_{l}^{*}$ corresponding to scaled versions of ${w}_{k}$ and ${\gamma }_{l}$ so that outcome-specific HLIs had unit variance and larger values correspond to healthier profiles.

We also considered a more comprehensive scoring system for each variable ranging from 0 (unhealthiest) to 4 (healthiest behavior), as displayed in Table 1, again following a previous definition of the HLI used in EPIC⁸.

Cox models

In all our analyses, Cox models used age as the main time scale and were stratified by study center, sex, and age at recruitment in 5-year categories. They were adjusted for education level (no schooling, primary, secondary, and university or more), height (continuous), and energy intake from non-alcoholic sources (kcal/day), and, for women, menopausal status (pre-menopausal, peri-menopausal, post-menopausal, surgical) and use of postmenopausal hormones (never, ever, unknown). For each outcome, one Cox model was constructed with all five score variables as the main exposures to derive the outcome-specific weights and the outcome-specific HLIs. Then, Cox models were constructed by considering, in turn, each version of the HLI as the main exposure. The HLI was consistently modelled in continuous using a linear term on the log-hazard-rate scale.

Evaluation criteria

HR estimates and discriminatory power

For each event, HR estimates and corresponding 95% confidence intervals (CIs) were computed for a 1-standard deviation (SD) increase of the different versions of the HLI. They allowed the comparison of the estimated association between overall adherence to a healthy lifestyle and the event being studied, depending on the version of the HLI being used. To further illustrate how risk stratification may be hindered when using an unweighted rather than an outcome-specific weighted HLI, we considered the 2⁵ = 32 lifestyle profiles corresponding to each possible combination of the five binary scores Smoking^(0–1), Alcohol^(0–1), Diet^(0–1), PA^(0–1) and Adipo^(0–1). Setting the unhealthiest profile {Smoking^(0–1) = 0, Alcohol^(0–1) = 0, Diet^(0–1) = 0, PA^(0–1) = 0, Adipo^(0–1) = 0} as the reference, we compared the HR estimates for the other 31 profiles produced by Cox models utilizing the standard and outcome-specific HLIs, respectively.

More generally, the discriminatory power of models based on the different versions of the HLI was quantified using Harrell’s C-index. HRs and Harrell’s C-indices were primarily computed in the full EPIC study population. For models based on outcome-specific HLIs, this amounted to evaluating them on the data used for their construction, which could create bias if overfitting was present. Cross-validation was applied to assess this bias: the EPIC study population was randomly split into (i) a training sample (75% of the total sample) where the outcome-specific weights were estimated, and (ii) a test sample (the remaining 25% of the total sample) where HRs and Harrell’s C-indices were computed. This process was repeated 10 times to prevent possible dependency on a single split³⁴. HRs and Harrell’s C-indices were averaged over these 10 repetitions and compared to the values obtained on the total EPIC study population to assess the bias magnitude.

Population attributable fractions

For each specific outcome, we computed PAFs at age a, defined as

$$PAF\left(a\right)= \frac{P\left(Y<a\right)-P\left({Y}^{\left(max\right)}<a\right)}{P\left(Y<a\right)}$$

(1)

Here, $P\left(Y<a\right)$ is the event risk before age a in the EPIC study population and $P\left({Y}^{\left(max\right)}<a\right)$ is the hypothetical event risk before age a in the counterfactual EPIC study population where, for all participants, all five lifestyle summary variables would have been set to their maximal possible values, while all other variables used for adjustment or stratification would have been set to their actual value observed in EPIC. Under technical conditions³⁵, $PAF\left(a\right)$ coincides with the proportion $P\left({Y}^{\left(max\right)}>a| Y<a\right)$ of events before age a that would have been prevented had all EPIC participants adhered to the “healthiest” behavior regarding all five lifestyle components. Absolute risks $P\left(Y<a\right)$ and counterfactual absolute risks $P\left({Y}^{\left(max\right)}<a\right)$ were estimated by averaging the individual risk predictions in the EPIC study population, and in the counterfactual populations, respectively. Non-parametric bootstrap based on 100 bootstrapped samples was used to estimate the corresponding 95% CI.

All analyses were performed using the R software, version 4.1.2. Given the nature of the weights used in the definition of the outcome-specific HLIs, models utilizing individual lifestyle scores would achieve similar discriminatory power and produce similar PAF estimates when compared to models based on outcome-specific HLIs.

Ethics

The EPIC study was conducted according to the Declaration of Helsinki and approved by the ethics committee at the International Agency for Research on Cancer (IARC) on 12 January 1995 and on 10 May 2017 (re-evaluation). Written informed consent was obtained from all subjects involved in the study.

Results

Study population

The final study population comprised 256,769 EPIC participants (Fig. 1), including 99,098 men (38.6%) and 157,671 women (61.4%) (Table 2). Average follow-up time and total number of incident events were 16.3 (SD = 3.4) years and 25,191 for all-cause mortality, 10.9 (2.3) years and 11,763 for T2D, 11.5 (2.8) years and 11,766 for CVD, and 14.3 (4.0) years and 34,159 for cancer.

Table 2 Main characteristics of the EPIC study population.

Full size table

HLIs based on binary scores

Outcome-specific HLIs

Weights of the main terms used in the definition of the outcome-specific HLIs were all positive, except the one of Alcohol^(0–1) for the T2D- and CVD- specific HLIs (Table 3). Smoking had the strongest impact on the all-cause mortality, CVD- and cancer- specific HLIs, while adiposity had the strongest impact on the T2D-specific HLI. Alcohol had the weakest impact on the all-cause mortality specific HLI, while it was diet for the T2D- and CVD- specific HLI, and physical activity for the cancer-specific HLI. Several interaction terms were selected in the outcome-specific HLIs. For example, Alcohol^(0–1) * Diet^(0–1) was selected with a negative weight in the simplified death- and CVD-specific HLIs (Table 3).

Table 3 Weights used in the construction of the outcome-specific HLI based on binary scores for the 5 individual lifestyle components.

Full size table

Supplementary Figure S1 presents the empirical distributions of the outcome-specific HLIs and, for comparison, of the standard HLI (after scaling it to a unit standard deviation). The distribution of the standard HLI was approximately symmetrical and centered around 3, while the distributions of all the outcome-specific HLIs were more skewed, with most values in the top range of the distributions.

HRs, risk stratification and discriminatory power

As displayed in Table 4, outcome-specific HLIs were more strongly associated with risk of the corresponding outcome than the standard HLI. For T2D for example, the HR was 0.71 (0.70, 0.72) and 0.60 (0.59, 0.61) per 1-SD increment of the standard and outcome-specific HLI, respectively. HR estimates for the 32 lifestyle profiles corresponding to each possible combination of the five binary scores highlighted that different lifestyle profiles leading to the same value of the standard HLI could be associated with sizably different hazard ratios (Fig. 2). Among all five lifestyle profiles leading to a standard HLI value of 1, the profile {Smoking^(0–1) = 1, Alcohol^(0–1) = 0, Diet^(0–1) = 0, PA^(0–1) = 0, Adipo^(0–1) = 0} had a much lower HR for all-cause mortality, and to a lesser extent, CVD and cancer, compared to the other four profiles, while the profile {Smoking^(0–1) = 0, Alcohol^(0–1) = 0, Diet^(0–1) = 0, PA^(0–1) = 0, Adipo^(0–1) = 1} had the lowest HR for T2D, thus mirroring the respective impacts of the lifestyle components on the different outcome-specific HLIs.

Table 4 HR (for a 1-SD increase) and Harrell’s C-index of the standard and outcome-specific HLIs for all-cause mortality, T2D, CVD and cancer.

Full size table

Models based on outcome-specific HLIs achieved a larger discriminatory power compared to those based on the standard HLI. For T2D, Harrell’s C-index was 0.67 (0.66, 0.68) and 0.61 (0.60, 0.62) for the models based on the outcome-specific and standard HLIs, respectively. Cross-validated estimates were similar to those computed on the total EPIC study population (Supplementary Table 2), suggesting low to null bias due to over-fitting in our main analysis.

Population attributable fractions

PAFs were consistently larger in analyses based on the standard HLI compared to those based on outcome-specific HLIs. The proportion of deaths by the age of 80 that would have been prevented had the whole population adhered to the “healthiest” lifestyle habits was estimated to be 33% (31%, 34%) and 23% (22%, 24%) when using the standard HLI and the mortality-specific HLI, respectively (Fig. 3). Similar patterns were observed for the other three outcomes (Figs. 3).

HLIs based on categorical scores

Overall, the “healthiest” categories of each individual component received the largest weights in the outcome-specific HLIs (Supplementary Table 2). Consistent with what we observed when using binary scores, smoking had the strongest impact on the all-cause mortality, CVD- and cancer- specific HLIs, while adiposity had the strongest impact on the T2D-specific HLI. The distribution of the standard HLI was approximately symmetrical around its mean value (Supplementary Fig. 2). The distributions of the all-cause mortality-, CVD- and cancer-specific HLIs were more skewed, with most values in the top range of the distributions, while the T2D-specific HLI had a multi-modal distribution. Patterns of differences in HR, C-index and PAF estimated from analyses based on standard and outcome-specific HLIs were similar, yet more pronounced, when utilizing categorical scores compared to the simplified setting of binary scores presented above. For T2D, the HR was 0.66 (0.64, 0.67) and 0.43 (0.42, 0.44) per 1-SD increment of the standard and outcome-specific HLIs, respectively, while the corresponding C-index were 0.63 (0.62, 0.64) and 0.72(0.72, 0.73). The proportion of deaths by the age of 80 that would have been prevented had the whole population adhered to the “healthiest” lifestyle habits was estimated to be 57% (55%, 58%) and 33% (32%, 34%) when considering the standard and death-specific HLI, respectively (Fig. 4).

Theoretical study of linear causal models

Our theoretical study under a linear causal model presented in the Supplementary Material showed that the regression parameter of the scaled weighted composite score was always larger than that of the scaled unweighted composite score, unless the weighted and unweighted composite scores coincided. Considering the analog of the PAF, we showed that analyses utilizing the unweighted composite score yielded either downward or upward biased estimates, while analyses based on the weighted composite score yielded unbiased estimates.

Discussion

In this study, we introduced a novel HLI to account for the magnitude of the relationships between individual lifestyle components and specific disease outcomes, using data-driven weights. The standard and the outcome-specific versions of the HLI were extensively compared by estimating the HR, the C-index and the PAF in a range of scenarios, involving the risk of cancer, T2D, CVD and premature mortality. Two strategies to operationalize the HLI were also investigated, involving in turn, binary indicators or categorical scores for the five components.

In our study, the discriminatory power was consistently larger for models based on the outcome-specific HLI than the standard HLI, sometimes to a large extent as in the case of T2D. The reason for this limitation of the standard HLI was clearly illustrated in Fig. 2, when considering binary indicators. As the standard HLI assumes that all lifestyle components are equally associated with the risk of disease, different lifestyle patterns with the same number of unhealthy components necessarily lead to the same predicted disease hazard rate in analyses based on the standard HLI. Conversely, our analyses utilizing the outcome-specific HLI reflected the disease hazard rate heterogeneity across these lifestyle patterns with the same number of unhealthy components. This limitation of the standard HLI in terms of discriminatory power highlights that it might be a suboptimal analytical choice for risk stratification and/or risk prediction³⁶, especially in situations where a given lifestyle component is strongly linked to the outcome under consideration, such as BMI in the case of T2D.

Most previous studies on the HLI used the standard HLI to address etiological questions, specifically to estimate disease-specific HRs to quantify the impact of adhering to healthy lifestyle habits, and disease-specific PAFs to measure the public health burden attributable to unhealthy lifestyles^{8,18,19,22,24}. In our study, we observed consistently weaker HR estimates for the standard HLI than the outcome-specific HLI, sometimes to a large extent, as for T2D. These results suggest that analyses utilizing outcome-specific HLIs are more likely to detect associations, particularly for diseases weakly associated with lifestyle habits. Conversely, PAF estimates were consistently larger when using the standard HLI. Estimating weaker HRs and larger PAFs with the standard HLI than the outcome-specific HLI may seem paradoxical, however our results from the theoretical study of linear causal models and the inspection of the empirical distributions of the standard and the outcome-specific HLIs displayed in Figure S1 might help clarify this apparent paradox. According to the binary version of standard HLI, 59% of the EPIC study population had a standard HLI lower or equal to 3 units, i.e., more than 2 standard-deviations below the maximum HLI of 5 units. As a result, the health benefits for this large proportion of participants, had they adhered to the healthiest possible lifestyle, led to large PAF estimates. On the other hand, according to the, say, death-specific HLI, 65% of the study population had an HLI value within one standard-deviation of the maximum HLI. As a result, the benefit in premature mortality had they adhered to the healthiest possible lifestyle was less remarkable, thus explaining the lower PAF estimates. In essence, analyses of the outcome-specific HLI mimics closely an analytical strategy where individual lifestyle components are evaluated jointly within the same model, and therefore yield similar PAF estimates. Thus, our results highlight that analyses based on standard HLI could lead to biased assessments of the public health burden attributable to unhealthy lifestyle. As mentioned in our theoretical study of linear causal models, it could be argued that utilizing standard HLIs might produce approximately valid estimates of PAFs of a latent variable, e.g., reflecting health-consciousness. Yet, the validity of this approach, particularly whether the standard HLI is a better proxy than weighted HLIs for this latent variable, would need further assessment.

The etiology of chronic diseases is complex, and some level of simplification via summary quantities is welcome in epidemiological research. To paraphrase Box’s aphorism, “all summarizations are wrong but some are useful”³⁷. To be useful, a summarization should produce approximately valid results. The validity of results in analyses based on the standard HLI could be assessed by comparing them to results of an outcome-specific HLI or the individual lifestyle components. If results are similar, the standard HLI could be appropriate as it does not rely on data-driven weights and it could facilitate the comparison of findings across studies and across health-related outcomes. However, the premise that standard HLIs would facilitate comparison across studies might be tempered in view of the myriad of versions of standard HLIs proposed in the literature^{5,6,9,10,17,18,38}.

Multiple lifestyle factors influence an individual’s health, but some are more critical than others, which should be reflected in public health recommendations. Towards this aim, the “healthiest” lifestyle profiles could be defined as the combinations of individual lifestyle behaviors associated with lowest risk of disease, longest life expectancy, or longest life expectancy free of a chronic disease⁹. The development and validation of an HLI using weights derived from meta-analyzed associations with disease risk, mortality or a composite outcome reflecting mortality and common chronic diseases would help the characterization and promotion of these healthiest profiles.

In line with previous versions of the standard HLI⁸, the HLI considered in this study was based on five individual components: smoking habits, alcohol intake, diet, physical activity, and adiposity. Refined statistical methods, e.g. using splines, could be used to combine the individual components into an outcome-specific HLI. Also, working with a refined categorization of these five components, including more descriptors, such as more refined information on smoking intensity or adiposity or a broader spectrum of dietary exposures, and/or including information on other lifestyle factors, such as quality of sleep^39,40 or stress, might lead to more accurate assessments of the relationship between lifestyle and health-related outcomes. The evaluation conducted in this study relied on the EPIC cohort, where the study populations in the various countries were generally more health-conscious than their source populations. We could not account for other major chronic diseases that could affect observed associations with our outcomes of interest because of a lack of such information in EPIC. For example, chronic obstructive pulmonary disease (COPD) frequently co-occurs with CVD and share tobacco smoking as a main risk factor⁴¹. These potential limitations were acknowledged, yet they were unlikely to affect the main conclusions of the study, which were corroborated by the evidence of our theoretical results under simple linear causal models.

Conclusions

The assessment of the relationship between a holistic composite score reflecting adherence to healthy lifestyle behaviors and the risk of disease and mortality could be improved by utilizing outcome-specific versions of the HLI. Standard HLIs can lead to biased assessments of the public health burden attributable to unhealthy lifestyles. The development and validation of data-driven HLIs best predicting the occurrence of disease outcomes is instrumental for the assessment of lifestyle and disease risk associations and the generation of accurate public health recommendations.

Data availability

The EPIC data is not publicly available, but access requests can be submitted to the Steering Committee (https://epic.iarc.fr/access/index.php).

References

Muller, D. C. et al. Modifiable causes of premature death in middle-age in Western Europe: results from the EPIC cohort study. BMC Med. 14, 87 (2016).
Article PubMed PubMed Central Google Scholar
Ibsen, D. B. et al. Modifiable Lifestyle recommendations and mortality in Denmark: A cohort study. Am J Prev Med. 60, 792–801 (2021).
Article PubMed Google Scholar
Stampfer, M. J., Hu, F. B., Manson, J. E., Rimm, E. B. & Willett, W. C. Primary prevention of coronary heart disease in women through diet and lifestyle. N Engl J Med. 343, 16–22 (2000).
Article CAS PubMed Google Scholar
Board I of M (US) and NRC (US) NCP, Curry SJ, Byers T, Hewitt M. Lifestyle Behaviors Contributing to the Burden of Cancer [Internet]. National Academies Press (US); 2003 [cited 2019 Oct 17]. Available from: https://www.ncbi.nlm.nih.gov/books/NBK223925/
Chiuve, S. E. et al. Primary prevention of stroke by healthy lifestyle. Circulation. 118, 947–954 (2008).
Article PubMed PubMed Central Google Scholar
Loef, M. & Walach, H. The combined effects of healthy lifestyle behaviors on all cause mortality: A systematic review and meta-analysis. Prev Med. 55, 163–170 (2012).
Article PubMed Google Scholar
Soerjomataram, I. et al. Cancers related to lifestyle and environmental factors in France in 2015. Eur J Cancer Oxf Engl 2018(105), 103–113 (1990).
Google Scholar
Freisling, H. et al. Lifestyle factors and risk of multimorbidity of cancer and cardiometabolic diseases: A multinational cohort study. BMC Med. 18, 5 (2020).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. et al. Healthy lifestyle and life expectancy free of cancer, cardiovascular disease, and type 2 diabetes: Prospective cohort study. BMJ. 368, l6669 (2020).
Article PubMed PubMed Central Google Scholar
Li, Y. et al. Impact of healthy lifestyle factors on life expectancies in the US population. Circulation. 138, 345–355 (2018).
Article PubMed PubMed Central Google Scholar
Panagiotakos, D. B., Pitsavos, C. & Stefanadis, C. Dietary patterns: a Mediterranean diet score and its relation to clinical and biological markers of cardiovascular disease risk. Nutr. Metab. Cardiovasc. Dis. NMCD. 16, 559–568 (2006).
Article PubMed Google Scholar
Buckland, G. et al. Adherence to the Mediterranean diet and risk of coronary heart disease in the Spanish EPIC Cohort Study. Am J Epidemiol. 170, 1518–1529 (2009).
Article PubMed Google Scholar
Couto, E. et al. Mediterranean dietary pattern and cancer risk in the EPIC cohort. Br. J. Cancer. 104, 1493–1499 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tindle, H. A. et al. Lifetime smoking history and risk of lung cancer: Results from the framingham heart study. J. Natl. Cancer Inst. 110, 1201–1207 (2018).
PubMed PubMed Central Google Scholar
Pleasants, R. A., Rivera, M. P., Tilley, S. L. & Bhatt, S. P. Both duration and pack-years of tobacco smoking should be used for clinical practice and research. Ann. Am. Thorac. Soc. 17, 804–806 (2020).
Article PubMed PubMed Central Google Scholar
Arnold, M. et al. Cancers in France in 2015 attributable to high body mass index. Cancer Epidemiol. 52, 15–19 (2018).
Article PubMed Google Scholar
Hu, F. B. et al. Diet, lifestyle, and the risk of type 2 diabetes mellitus in women. N Engl. J. Med. 345, 790–797 (2001).
Article CAS PubMed Google Scholar
McKenzie, F. et al. Healthy lifestyle and risk of cancer in the European prospective investigation into cancer and nutrition cohort study. Medicine (Baltimore). 95, e2850 (2016).
Article PubMed PubMed Central Google Scholar
Hu, P. et al. Effect of healthy lifestyle index and lifestyle patterns on the risk of mortality: A community-based cohort study. Front Med. https://doi.org/10.3389/fmed.2022.920760 (2022).
Article PubMed PubMed Central Google Scholar
Mozaffarian, D. et al. Lifestyle risk factors and new-onset diabetes mellitus in older adults. Arch. Intern. Med. 169, 798–807 (2009).
Article PubMed PubMed Central Google Scholar
Lv, J. et al. Adherence to a healthy lifestyle and the risk of type 2 diabetes in Chinese adults. Int. J. Epidemiol. 46, 1410–1420 (2017).
Article PubMed PubMed Central Google Scholar
Rajaobelina, K. et al. Population attributable fractions of the main type 2 diabetes mellitus risk factors in women: Findings from the French E3N cohort. J. Diabetes. 11, 242–253 (2019).
Article CAS PubMed Google Scholar
Aleksandrova, K. et al. Combined impact of healthy lifestyle factors on colorectal cancer: A large European cohort study. BMC Med. 12, 168 (2014).
Article PubMed PubMed Central Google Scholar
Naudin, S. et al. Healthy lifestyle and the risk of pancreatic cancer in the EPIC study. Eur. J. Epidemiol. 35, 975–986 (2020).
Article PubMed Google Scholar
Chen, S. L. F. et al. Combined lifestyle behaviors and the incidence of common cancer types in the norwegian women and cancer study (NOWAC). Clin. Epidemiol. 13, 721–734 (2021).
Article PubMed PubMed Central Google Scholar
Li, Q. et al. The associations of healthy lifestyle index with breast cancer incidence and mortality in a population-based study. Breast Cancer Tokyo Jpn. 29, 957–966 (2022).
Article Google Scholar
Peila, R. et al. Healthy lifestyle index and risk of pancreatic cancer in the Women’s Health Initiative. Cancer Causes Control CCC. 33, 737–747 (2022).
Article PubMed PubMed Central Google Scholar
Meer, R., van de Pol, J., van den Brandt, P. A. & Schouten, L. J. The association of healthy lifestyle index score and the risk of renal cell cancer in the Netherlands cohort study. BMC Cancer. 23, 156 (2023).
Article PubMed PubMed Central Google Scholar
Liu, G. et al. Adherence to a healthy lifestyle in association with microvascular complications among adults with type 2 diabetes. JAMA Netw Open. 6, e2252239 (2023).
Article PubMed PubMed Central Google Scholar
Riboli, E. et al. European Prospective Investigation into Cancer and Nutrition (EPIC): Study populations and data collection. Public Health Nutr. 5, 1113–1124 (2002).
Article CAS PubMed Google Scholar
InterAct Consortium et al. Design and cohort description of the InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia. 54, 2272–2282 (2011).
Article Google Scholar
Danesh, J. et al. EPIC-Heart: the cardiovascular component of a prospective study of nutritional, lifestyle and biological factors in 520,000 middle-aged participants from 10 European countries. Eur J Epidemiol. 22, 129–141 (2007).
Article PubMed Google Scholar
Ekelund, U. et al. Physical activity and gain in abdominal adiposity and body weight: prospective cohort study in 288,498 men and women. Am J Clin Nutr. 93, 826–835 (2011).
Article CAS PubMed Google Scholar
Kuhn, M. & Johnson, K. Applied Predictive Modeling (Springer, 2013). https://doi.org/10.1007/978-1-4614-6849-3.
Book Google Scholar
Causality, P. J. Models, reasoning, and inference 2nd edn. (Cambridge University Press, 2009).
Google Scholar
Liao, J., Muniz-Terrera, G., Scholes, S., Hao, Y. & Chen, Y. Lifestyle index for mortality prediction using multiple ageing cohorts in the USA UK and Europe. Sci Rep. 8, 6644 (2018).
Article ADS PubMed PubMed Central Google Scholar
Box, G. E. P. Robustness in the Strategy of Scientific Model Building. In Robustness Stat (eds Launer, R. L. & Wilkinson, G. N.) 201–236 (Academic Press, 1979).
Chapter Google Scholar
Schulze, M. B. & Hu, F. B. Epidemiology of Diabetes. In Handb Epidemiol (eds Ahrens, W. & Pigeot, I.) 1–49 (Springer, 2019).
Google Scholar
Song, Z. et al. Association of healthy lifestyle including a healthy sleep pattern with incident type 2 diabetes mellitus among individuals with hypertension. Cardiovasc. Diabetol. 20, 239 (2021).
Article CAS PubMed PubMed Central Google Scholar
Guasch-Ferré, M. et al. Healthy lifestyle score including sleep duration and cardiovascular disease risk. Am. J. Prev. Med. 63, 33–42 (2022).
Article PubMed PubMed Central Google Scholar
Rabe, K. F., Hurst, J. R. & Suissa, S. Cardiovascular disease and COPD: Dangerous liaisons?. Eur. Respir. Rev. 27, 180057 (2018).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors thank the EPIC participants and PIs for their cooperation in providing invaluable data. We also thank Bertrand Hemon, Corinne Casagrande and Carine Biessy (International Agency for Research on Cancer) for managing the EPIC data. The authors additional thank the government of Asturias (Spain) and all EPIC centers in the 9 European countries for their ongoing support to the EPIC Study.

Disclaimer

Where authors are identified as personnel of the International Agency for Research on Cancer/World Health Organization, the authors alone are responsible for the views expressed in this article and they do not necessarily represent the decisions, policy, or views of the International Agency for Research on Cancer/World Health Organization.

Funding

This study was partially supported by the grant LIBERTY (AAP SHS-E-SP 2020, PI: P Ferrari) from the French Institut National du Cancer (INCa). The coordination of EPIC is financially supported by International Agency for Research on Cancer (IARC) and by the Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, which has additional infrastructure support provided by the NIHR Imperial Biomedical Research Centre (BRC). The national cohorts are supported by Danish Cancer Society (Denmark); Ligue Contre le Cancer, Institut Gustave Roussy, Mutuelle Générale de l’Education Nationale, Institut National de la Santé et de la Recherche Médi‐ cale (INSERM) (France); German Cancer Aid, German Cancer Research Center (DKFZ), German Institute of Human Nutrition Potsdam‐Rehbruecke (DIfE), Federal Ministry of Education and Research (BMBF) (Germany); Associazione Italiana per la Ricerca sul Cancro‐AIRC‐Italy, Compagnia di SanPaolo and National Research Council (Italy); Dutch Ministry of Public Health, Welfare and Sports (VWS), Netherlands Cancer Registry (NKR), LK Research Funds, Dutch Prevention Funds, Dutch ZON (Zorg Onderzoek Nederland), World Cancer Research Fund (WCRF), Statistics Netherlands (The Netherlands); Health Research Fund (FIS) ‐ Instituto de Salud Carlos III (ISCIII), Regional Govern‐ ments of Andalucía, Asturias, Basque Country, Murcia and Navarra, and the Catalan Institute of Oncology ‐ ICO (Spain); Swedish Cancer Society, Swedish Research Council and County Councils of Skåne and Västerbotten (Sweden); Cancer Research UK (14,136 to EPIC‐Norfolk; C8221/A29017 to EPIC‐Oxford), Medical Research Council (1,000,143 to EPIC‐Norfolk; MR/M012190/1 to EPIC‐Oxford) (UK). IDIBELL acknowledges support from the Generalitat de Catalunya through the CERCA Program. The funders were not involved in designing the study; collecting, analysing, and interpreting results; or writing and submitting the manuscript for publication.

Author information

Lluís Cirera
Present address: Centro de Investigación Biomédica en Red de Epidemiología y Salud Pública (CIBERESP), Madrid, Spain

Authors and Affiliations

International Agency for Research On Cancer (IARC-WHO), Lyon, France
Vivian Viallon, Heinz Freisling, Komodo Matta, Elisabete Weiderpass, Mattias Johansson & Pietro Ferrari
Department of Public Health, Aarhus University, Aarhus, Denmark
Anne Østergaard Nannsen & Christina C. Dahm
Danish Cancer Society Research Center, Copenhagen, Denmark
Anne Tjønneland & Anne Kirstine Eriksen
Department of Public Health, University of Copenhagen, Copenhagen, Denmark
Anne Tjønneland
Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), 69120, Heidelberg, Germany
Rudolf Kaaks & Verena A. Katzke
Department of Molecular Epidemiology, German Institute of Human Nutrition Potsdam-Rehbruecke, Nuthetal, Germany
Matthias B. Schulze
German Center for Diabetes Research (DZD), Neuherberg, Germany
Matthias B. Schulze
Institute of Nutritional Science, University of Potsdam, Nuthetal, Germany
Matthias B. Schulze
Institute for Cancer Research, Prevention and Clinical Network (ISPRO), Florence, Italy
Giovanna Masala
Cancer Registry Unit, Fondazione IRCCS Istituto Nazionale Dei Tumori, Milan, Italy
Giovanna Tagliabue
Unit of Medical Statistics, University “L. Vanvitelli”, Naples, Italy
Vittorio Simeon
Hyblean Association for Epidemiological Research, AIRE–ONLUS, Ragusa, Italy
Rosario Tumino
Unit of Cancer Epidemiology, Città Della Salute E Della Scienza University-Hospital, and Center for Cancer Prevention (CPO), Turin, Italy
Lorenzo Milani
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
Jeroen W. G. Derksen & Yvonne T. van der Schouw
K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
Therese Haugdahl Nøst
Department of Community Medicine, UiT The Arctic University of Norway, Tromsø, Norway
Therese Haugdahl Nøst, Kristin Benjaminsen Borch & Torkjel M. Sandanger
Public Health Directorate, Asturias, Spain
J. Ramón Quirós
Escuela Andaluza de Salud Pública (EASP), Granada, Spain
Miguel Rodriguez-Barranco
Instituto de Investigación Biosanitaria Ibs.GRANADA, Granada, Spain
Miguel Rodriguez-Barranco
Centro de Investigación Biomédica en Red de Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
Miguel Rodriguez-Barranco & Marcela Guevara
Unit of Nutrition and Cancer, Catalan Institute of Oncology - ICO, L’Hospitalet de Llobregat, Barcelona, Spain
Catalina Bonet
Nutrition and Cancer Group; Epidemiology, Public Health, Cancer Prevention and Palliative Care Program, Bellvitge Biomedical Research Institute - IDIBELL, L’Hospitalet de Llobregat, Barcelona, Spain
Catalina Bonet
Sub Directorate for Public Health and Addictions of Gipuzkoa, Ministry of Health of the Basque Government, San Sebastián, Spain
Amaia Aizpurua-Atxega
Epidemiology of Chronic and Communicable Diseases Group, Biodonostia Health Research Institute, San Sebastián, Spain
Amaia Aizpurua-Atxega
Department of Epidemiology, Murcia Regional Health Council, IMIB-Arrixaca, Murcia, Spain
Lluís Cirera
Instituto de Salud Pública y Laboral de Navarra, 31003, Pamplona, Spain
Marcela Guevara
Navarra Institute for Health Research (IdiSNA), 31008, Pamplona, Spain
Marcela Guevara
Department of Public Health and Clinical Medicine, Umeå University, Umeå, Sweden
Björn Sundström
Department of Public Health and Clinical Medicine, Sustainable Health, Umeå University, Umeå, Sweden
Anna Winkvist
Department of Internal Medicine and Clinical Nutrition, The Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Anna Winkvist
Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK
Alicia K. Heath & Marc J. Gunter

Authors

Vivian Viallon
View author publications
You can also search for this author in PubMed Google Scholar
Heinz Freisling
View author publications
You can also search for this author in PubMed Google Scholar
Komodo Matta
View author publications
You can also search for this author in PubMed Google Scholar
Anne Østergaard Nannsen
View author publications
You can also search for this author in PubMed Google Scholar
Christina C. Dahm
View author publications
You can also search for this author in PubMed Google Scholar
Anne Tjønneland
View author publications
You can also search for this author in PubMed Google Scholar
Anne Kirstine Eriksen
View author publications
You can also search for this author in PubMed Google Scholar
Rudolf Kaaks
View author publications
You can also search for this author in PubMed Google Scholar
Verena A. Katzke
View author publications
You can also search for this author in PubMed Google Scholar
Matthias B. Schulze
View author publications
You can also search for this author in PubMed Google Scholar
Giovanna Masala
View author publications
You can also search for this author in PubMed Google Scholar
Giovanna Tagliabue
View author publications
You can also search for this author in PubMed Google Scholar
Vittorio Simeon
View author publications
You can also search for this author in PubMed Google Scholar
Rosario Tumino
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Milani
View author publications
You can also search for this author in PubMed Google Scholar
Jeroen W. G. Derksen
View author publications
You can also search for this author in PubMed Google Scholar
Yvonne T. van der Schouw
View author publications
You can also search for this author in PubMed Google Scholar
Therese Haugdahl Nøst
View author publications
You can also search for this author in PubMed Google Scholar
Kristin Benjaminsen Borch
View author publications
You can also search for this author in PubMed Google Scholar
Torkjel M. Sandanger
View author publications
You can also search for this author in PubMed Google Scholar
J. Ramón Quirós
View author publications
You can also search for this author in PubMed Google Scholar
Miguel Rodriguez-Barranco
View author publications
You can also search for this author in PubMed Google Scholar
Catalina Bonet
View author publications
You can also search for this author in PubMed Google Scholar
Amaia Aizpurua-Atxega
View author publications
You can also search for this author in PubMed Google Scholar
Lluís Cirera
View author publications
You can also search for this author in PubMed Google Scholar
Marcela Guevara
View author publications
You can also search for this author in PubMed Google Scholar
Björn Sundström
View author publications
You can also search for this author in PubMed Google Scholar
Anna Winkvist
View author publications
You can also search for this author in PubMed Google Scholar
Alicia K. Heath
View author publications
You can also search for this author in PubMed Google Scholar
Marc J. Gunter
View author publications
You can also search for this author in PubMed Google Scholar
Elisabete Weiderpass
View author publications
You can also search for this author in PubMed Google Scholar
Mattias Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Ferrari
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors’ responsibilities were as follows: PF and VV conceived, designed, and supervised the research. VV analysed the data. PF, HF, KM and VV were responsible for drafting the manuscript. AON, CCD, AT, AKE, RK, VK, MBS, GM, GT, VS, RT, LM, JWGD, YTS, THN, KBB, TMS, JRQ, MRB, CB, AAA, LC, MG, BS, AW, AKH, MJG, EW and MJ provided the original data, information on the respective populations, and advice on the study design, analysis, and interpretation of the results. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Vivian Viallon.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Viallon, V., Freisling, H., Matta, K. et al. On the use of the healthy lifestyle index to investigate specific disease outcomes. Sci Rep 14, 16330 (2024). https://doi.org/10.1038/s41598-024-66772-w

Download citation

Received: 24 October 2023
Accepted: 03 July 2024
Published: 15 July 2024
DOI: https://doi.org/10.1038/s41598-024-66772-w
Springer Nature Limited

On the use of the healthy lifestyle index to investigate specific disease outcomes

Abstract

Similar content being viewed by others

Combined lifestyle factors, incident cancer, and cancer mortality: a systematic review and meta-analysis of prospective cohort studies

Cardiovascular health and cancer mortality: evidence from US NHANES and UK Biobank cohort studies

Healthy lifestyle change and all-cause and cancer mortality in the European Prospective Investigation into Cancer and Nutrition cohort

Introduction

Methods

Study population

Health-related outcomes

Assessment of lifestyle exposures at baseline

Healthy lifestyle indices

Cox models

Evaluation criteria

HR estimates and discriminatory power

Population attributable fractions

Ethics

Results

Study population

HLIs based on binary scores

Outcome-specific HLIs

HRs, risk stratification and discriminatory power

Population attributable fractions

HLIs based on categorical scores

Theoretical study of linear causal models

Discussion

Conclusions

Data availability

References

Acknowledgements

Disclaimer

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation