Pre-existing and machine learning-based models for cardiovascular risk prediction

Cho, Sang-Yeong; Kim, Sun-Hwa; Kang, Si-Hyuck; Lee, Kyong Joon; Choi, Dongjun; Kang, Seungjin; Park, Sang Jun; Kim, Tackeun; Yoon, Chang-Hwan; Youn, Tae-Jin; Chae, In-Ho

doi:10.1038/s41598-021-88257-w

Pre-existing and machine learning-based models for cardiovascular risk prediction

Article
Open access
Published: 26 April 2021

Volume 11, article number 8886, (2021)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Pre-existing and machine learning-based models for cardiovascular risk prediction

Download PDF

Sang-Yeong Cho¹,
Sun-Hwa Kim²,
Si-Hyuck Kang^2,3,
Kyong Joon Lee⁴,
Dongjun Choi⁴,
Seungjin Kang⁵,
Sang Jun Park⁶,
Tackeun Kim⁷,
Chang-Hwan Yoon^2,3,
Tae-Jin Youn^2,3 &
…
In-Ho Chae^2,3

8083 Accesses
33 Citations
Explore all metrics

Abstract

Predicting the risk of cardiovascular disease is the key to primary prevention. Machine learning has attracted attention in analyzing increasingly large, complex healthcare data. We assessed discrimination and calibration of pre-existing cardiovascular risk prediction models and developed machine learning-based prediction algorithms. This study included 222,998 Korean adults aged 40–79 years, naïve to lipid-lowering therapy, had no history of cardiovascular disease. Pre-existing models showed moderate to good discrimination in predicting future cardiovascular events (C-statistics 0.70–0.80). Pooled cohort equation (PCE) specifically showed C-statistics of 0.738. Among other machine learning models such as logistic regression, treebag, random forest, and adaboost, the neural network model showed the greatest C-statistic (0.751), which was significantly higher than that for PCE. It also showed improved agreement between the predicted risk and observed outcomes (Hosmer–Lemeshow χ² = 86.1, P < 0.001) than PCE for whites did (Hosmer–Lemeshow χ² = 171.1, P < 0.001). Similar improvements were observed for Framingham risk score, systematic coronary risk evaluation, and QRISK3. This study demonstrated that machine learning-based algorithms could improve performance in cardiovascular risk prediction over contemporary cardiovascular risk models in statin-naïve healthy Korean adults without cardiovascular disease. The model can be easily adopted for risk assessment and clinical decision making.

Machine learning methodologies versus cardiovascular risk scores, in predicting disease risk

Article Open access 29 December 2018

Machine learning and atherosclerotic cardiovascular disease risk prediction in a multi-ethnic population

Article Open access 23 September 2020

Using machine learning-based algorithms to construct cardiovascular risk prediction models for Taiwanese adults based on traditional and novel risk factors

Article Open access 22 July 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Introduction

Cardiovascular disease (CVD) is the leading cause of illness and death worldwide^{1, 2}. Several risk assessment tools have been proposed to accurately predict the risk of CVD, among which the Framingham risk score (FRS), pooled cohort equation (PCE), systematic coronary risk evaluation (SCORE), and QRISK3 are widely used^3,4,5,6. The individual assessment of cardiovascular risk is a fundamental step for CVD prevention. Contemporary guidelines for primary prevention highly recommend the use of risk calculators to assess the risk of individuals and guide the intensity of preventive interventions^7,8,9,10. However, there is still room for improvement in their accuracy: the area under the curve (AUC) has been shown to be between 0.65 and 0.85^11,12,13. In addition, the overestimation of CVD risk, as well as underestimation, have been reported for specific individuals and population subgroups.

Recent years have seen remarkable advances in the application of machine learning (ML) in healthcare and medical research, thanks to high-performance computers¹⁴. ML is a method of data analysis that automates model building based on patterns and inferences with no prior explicit instructions. The increasing volume and complexity of healthcare information call for the application of big data analytics. ML methods have been increasingly applied in imaging interpretation and shown promising results^{15, 16}. They can also be used to develop prediction models from existing data to yield highly accurate results¹⁷.

This study was designed to assess the calibration and discrimination of pre-existing CVD risk algorithms among Korean adults naïve to cholesterol-lowering therapy. In addition, we developed ML-based risk prediction models and compared their performance with that of contemporary algorithms.

Related research

Several studies were conducted to verify the pre-existing CVD risk model. The Copenhagen study compared PCE and SCORE¹². The discrimination function was considered good with C-statistics ranging from 0.71 to 0.85 for PCE and 0.69–0.84 for SCORE. The predicted/observed event ratio was 1.2 for PCE and 5.0 for SCORE, which raises an issue of overestimation. A recent study based on individual-level meta-analysis showed simple recalibration of the pre-existing risk models may help¹¹. C-index was shown to range from 0.7010 to 0.7605. To date, only limited number of studies have applied ML techniques for cardiovascular risk prediction in the general population. A study from the Multi-Ethnic Study of Atherosclerosis (MESA) cohort used the random survival forest technique to identify the importance of subclinical disease markers, such as tissue necrosis factor-α receptor, coronary artery calcium score, and carotid ultrasound findings for cardiovascular outcomes¹⁷. Another study from the MESA cohort utilized support vector machine algorithms, which showed markedly improved discrimination over the PCE model using same parameters of PCE model¹⁸. Weng et al. also showed improved risk prediction by using ML algorithms from a prospective cohort of 378,256 patients, in which 22 more variables were used in addition to the 8 parameters from PCE¹⁹.

However, there is still controversy regarding the role of ML for clinical prediction. A meta-analysis of 71 studies demonstrated no definite evidence of superior performance of ML over logistic regression²⁰. The authors claimed that model validation procedures are often not sound or not well reported, and that it hampers a fair model comparison. Hot debates followed^21,22,23. Despite general optimism about the impact of artificial intelligence, experts think there are still substantial barriers in the real world such as lack of expertise and inadequate regulation²⁴.

Cardiovascular risk prediction is one of the fields that improved risk prediction algorithm can benefit the largest population at risk. Conventional cardiovascular risk calculators are basically based on logistic regression. In this study, we tested multiple ML models and sought to evaluate how much they can improve performance. The advantage of the new model was validated using multiple metrics including discrimination, calibration, and decision curve analysis.

Results

Characteristics of the study population

The PCE cohort was the main analysis cohort, in which 222,998 individuals with no previous history of atherosclerotic CVD were included (Fig. 1). Their mean age was 58.0 years, 58.1% were men, 5.5% had diabetes mellitus, and 21.1% were receiving antihypertensive treatment (Table 1). During the 5-year follow-up, 7819 subjects experienced atherosclerotic CVD events (event rate: 3.51%) (Supplementary Table S1).

Table 1 Baseline profiles of the study population.

Full size table

The FRS, SCORE, and QRISK3 cohorts had 180,305, 166,824, and 196,970 individuals, respectively, who matched the target population of each scoring system. Although the risk profiles did not differ largely across the cohorts, there were several distinctions such as no atrial fibrillation in the PCE cohort and no diabetes or chronic kidney disease in the SCORE cohort. Study endpoints were also defined separately in each cohort according to each system. Accordingly, 5-year event rates varied from 0.30% in the SCORE cohort-where only cardiac death was counted to 3.51% in the PCE cohort where hard atherosclerotic CVD was counted.

Performance of pre-existing risk prediction models

Figure 2A,B shows the discrimination and calibration of the pre-existing models in each corresponding cohort. All models showed moderate to good discriminatory function with c-statistics between 0.70 and 0.80. In the PCE cohort, the equations for whites outperformed the ones for African Americans (C-statistics [95% confidence intervals (CIs)], 0.741 [0.735–0.747] and 0.732 [0.726–0.737]; p < 0.001). Calibration was plotted for the incidence rate per 1000 person-years against the 10-year predicted risk. PCE showed the best calibration: PCE for whites underestimated the risk in the lower 3 deciles, while overestimation occurred in deciles 7 through 10. FRS, SCORE, and QRISK3 were shown to overestimate the risk compared to the observed incidence rates.

Performance of machine learning algorithms to the pooled cohort equation cohort

ML-based algorithms were applied to the PCE cohort. The performance of the ML-based algorithms are detailed in Table 2, and graphically shown in Fig. 3. The Brier score was between 0.030 and 0.032 across PCE and ML-based models. The neural network and logistic regression showed significantly improved discrimination compared to PCE for whites. The neural network exhibited the highest C-statistics (0.751 [95% CIs 0.740‒0.761]), which was significantly greater than that of any other models except logistic regression (Supplementary Table S2). The difference in C-statistics between the neural network and logistic regression was marginal (p = 0.071). A sensitivity analysis was performed with the neural network using 8 variables (age, sex, systolic pressure, total cholesterol, high-density lipoprotein cholesterol, smoking status, history of diabetes, and antihypertensive medication use), which also showed significantly improved discrimination compared to PCE.

Table 2 Performance of machine-learning based risk prediction models in the test set of the pooled cohort equation cohort.

Full size table

Calibration was improved with logistic regression, AdaBoost, and the neural network. The Hosmer–Lemeshow χ² values were 171.1, 15.3, 19.9, and 86.1 for PCE for whites, logistic regression, AdaBoost, and the neural network, respectively. Decision-curve analysis showed that ML-based algorithms provided an incremental net benefit across a range of thresholds (Fig. 4). The net benefit values at a threshold of 5% were shown to be 0.0072, 0.0079, 0.0074, and 0.0078 for PCE for whites, logistic regression, AdaBoost, and the neural network, respectively. At this particular cutoff, the neural network-based model would lead to 6 more treatments per 10,000 patients at the same number of unnecessary treatments compared to PCE for whites.

Performance of machine learning algorithms in other cohorts

Logistic regression and the neural network were also applied to the remaining cohorts (FRS, SCORE, and QRISK3 cohorts) (Supplementary Table S3). Logistic regression and the neural network showed significantly higher C-statistics than FRS, and logistic regression showed significantly higher C-statistics than SCORE. No ML algorithms outperformed the pre-existing prediction model in the QRISK3 cohort.

Discussion

In this study, we found that pre-existing risk models showed acceptable performance in predicting cardiovascular risk in real-world Korean adults who were free from CVD and naïve to statin therapy. However, they were mostly shown to overestimate individual risk and to have moderate to good discrimination. On the other hand, models using ML techniques were shown to improve cardiovascular risk prediction. Algorithms using logistic regression, AdaBoost, and the neural network showed significantly higher discrimination and better calibration than pre-existing calculators.

Prevention is the most effective way to reduce the impact of CVD²⁵. Current guidelines recommend that the assessment of CVD risk should be the start of cardiovascular risk-reducing strategies. The Third Report of the Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults guidelines recommended the use of the FRS²⁶. European guidelines recommend risk assessment via the SCORE system^{9, 10}, the United States’ guidelines advocate for the PCE^{4, 7}, and QRISK has been endorsed by the National Institute for Health and Clinical Excellence in the United Kingdom. Risk prediction is considered the key component in deciding treatment strategies. The American College of Cardiology /American Heart Association (ACC/AHA) guidelines for high blood pressure recommend medical treatment for primary prevention if a patient with hypertension (defined as ≥ 130/80 mm Hg) has an estimated 10-year atherosclerotic CVD risk of ≥ 10%⁸. Similarly, statin therapy should be considered in adults with a 10-year atherosclerotic CVD risk of ≥ 7.5% according to the ACC/AHA guidelines on blood cholesterol⁷.

The performance of risk prediction models has been validated by a number of studies^{11,12,13, 27, 28}. Similarly, our study demonstrated the competency of risk prediction algorithms in the real world. All pre-existing models showed C-statistics of 0.70‒0.80 for their dedicated endpoints. PCE showed relatively good agreement between the predicted risk and observed event rates, while FRS, SCORE, and QRISK3 were shown to overestimate risk in this study population. Previous studies on the Korean population have also shown that the accuracy of preexisting models was fairly good^{29, 30}.

Our study showed that several ML techniques including the neural network led to improved cardiovascular risk discrimination and calibration as well as net benefit. The AUC of the neural network was + 0.13 compared to that of PCE for whites while calibration was significantly better. In addition, the improved performance also resulted in net clinical benefit: better classifying the patients who require blood pressure-lowering or lipid-lowering therapy. An artificial neural network solves a problem through the learning process by controlling the strengths of connections between complexly intertwined neurons. The learning process is similar to human learning, memory, and inference. Its advantages include identifying arbitrary nonlinear multiparametric discriminant functions. In this manner, neural networks enable the learning of highly complex functions and accurate predictions for complex decision-making problems³¹.

Although ML-based models were shown to have better prediction capabilities, there may be criticism regarding the performance of ML-based algorithms. Firstly, ML-based algorithms typically use large numbers of variables, some of which are not routinely recorded in clinical practice. Conventional risk prediction models have been developed to be broadly used cost-effectively, and therefore, use only a small number of essential variables. However, our sensitivity analysis showed that even after limiting the number of variables, ML-based algorithms still showed better performance than conventional models. Secondly, although there was an improvement, the absolute degree of improvement was small. The neural network model showed significantly increased C-statistics compared to PCE, but the absolute increase was no greater than + 1.3%. Although statistically significant, it is reasonable to assume that this was only a modest improvement. However, ML, especially the artificial neural network, is expected to provide better data interpretation and risk prediction as the volume of medical information exponentially increases.

This study has several limitations. Firstly, only 5-year follow-up data were available in the present study. Most risk prediction models aim to predict 10-year outcomes. However, the use of population-based data allowed for a large sample of statin-naïve healthy adults without CVD. Most contemporary prospective studies are not free from potential bias associated with statin use, which may cause an effect modification. Secondly, the study is not free from selection bias since the study population was chosen from the recipients of the general health screening program. However, the national insurance system covers 97% of Korean residents. The health screening program included 51.2% of the recipients in 2009 and 54.1% in 2010 according to the national statistics³². Thirdly, there is a potential risk of misclassification bias as many covariates and outcomes were defined using claims information³³. For example, the status of blood pressure-lowering treatment may have changed during the follow-up duration, which was not considered in the model.

Pre-existing risk prediction models, such as the FRS, SCORE, PCE, and QRISK3, showed good performance in statin-naïve healthy Korean adults without CVD. This study suggests that ML-based cardiovascular risk prediction algorithms offer improved discrimination and calibration over contemporary models. Future studies are required to test the feasibility and usefulness of our models in the real-world clinical practice.

Methods

The data reported in this article are available to other researchers via application to the National Health Insurance Sharing Service (https://nhiss.nhis.or.kr/) for purposes of reproducing the results or replicating the procedure.

Data source and study individuals

The study subjects were extracted from the National Health Insurance Service-Health Screening (NHIS-HEALS) cohort from Korea. The cohort design and profiles have been reported previously³⁴. In brief, the insurance system covers 97% of Korean residents. General health screening programs are provided to all insured adults aged 40 years or older every 2 years for the prevention and early detection of major diseases. The National Health Insurance Service-Health Screening cohort includes 514,866 individuals who participated in health screening programs from 2002 to 2015.

Individuals who participated in the health screening program between 2009 and 2010 were chosen for this study. This selection of time period was to ensure a complete 5-year follow-up because the screening program started including fasting serum lipid levels (total cholesterol, triglycerides, high-density lipoprotein cholesterol, and low-density lipoprotein cholesterol) in 2009. Follow-up data until December 2015 are provided for the cohort. In line with the target population of contemporary scoring systems, selection criteria included (1) age between 40 and 79 years, (2) no previous diagnosis of CVDs, such as myocardial infarction, ischemic stroke, and congestive heart failure, (3) Those with angina who received coronary revascularization therapy, such as percutaneous coronary intervention and coronary artery bypass surgery were excluded. (4) In addition, to avoid bias caused by statin therapy, individuals who had been receiving a statin before the screening or started statin therapy during the study period before the obtaining of the study outcomes were also excluded.

Next, 4 separate cohorts were built following the intended target population and outcome definitions of each scoring system: the FRS, PCE, SCORE, and QRISK3 cohorts (Fig. 3). The definitions of the cohort population and study outcomes are detailed in Supplementary Table S4. The PCE cohort was the main target of analysis and results from the FRS, SCORE, and QRISK3 cohorts were provided for sensitivity analyses. The Seoul National University Bundang Hospital’s institutional review board determined that our study was exempt from review (X-1708-417-911). The present study was performed in accordance with the Declaration of Helsinki and the need for informed consent was waived.

Risk factor variables and risk score calculations

Sixteen variables were selected as risk factors: 8 variables that were commonly used in the established risk prediction models, and 8 variables used in only QRISK3. The 8 common variables included age, sex, systolic blood pressure, total cholesterol, high-density lipoprotein cholesterol, smoking status, history of diabetes, and antihypertensive medication use. Demographic characteristics such as age and sex were extracted from the enrolment status database. Systolic blood pressure, total cholesterol level, and high-density lipoprotein cholesterol level were derived from the results of the health screening program. Smoking status and the amount of smoking were identified using self-report questionnaires. Histories of diabetes and hypertension medication use were identified using previous claims data from 2002 until the date of enrollment. The 8 variables from the QRISK3 algorithm were steroid use, body mass index (kg/m²), atrial fibrillation/flutter, migraine, systemic lupus erythematosus, rheumatic arthritis, atypical antipsychotic use, and chronic kidney disease (Supplementary Table S5). Erectile dysfunction and schizophrenia, which are also used in the QRISK3 algorithm, were not included in this study because as the accuracy of the former has not been validated and the latter was not available from the NHIS-HEALS cohort due to privacy issues. No imputations were applied for continuous variables (age, systolic blood pressure, total cholesterol, high-density lipoprotein cholesterol, and body mass index), and subjects with any missing values and outliers were removed from the cohort.

Four types risk prediction scores were calculated with equation-based methods using patients’ baseline data: the FRS, PCE, SCORE, and QRISK3 (Supplementary Table S4)^3,4,5,6. PCE was originally developed to obtain 10-year cardiovascular risk. The predicted risk at 5 years was calculated using parameters that were published previously by Muntner et al.²⁷ Because Asian ethnicity is not represented in the PCE, both the equations (one for whites and the other for African-Americans) were calculated. Similarly, two risk calculators of the SCORE (one for low-risk populations, and the other for high-risk populations) were studied.

Outcome

The study endpoints were defined separately in each cohort following the definitions of each algorithm (Supplementary Table S5). The PCE cohort was the main study cohort, where the endpoint was first hard atherosclerotic CVD (defined as cardiac death, non-fatal myocardial infarction, and fatal or nonfatal stroke). Mortality was determined from the National Death Index by linking identification codes to the corresponding individual. Cardiac death was defined as death due to cardiovascular etiology. Nonfatal myocardial infarction and ischemic stroke were determined with claims records. Myocardial infarction was defined by discharge diagnosis codes I21 and I23 of the International Classification of Diseases, 10th Revision (ICD‐10). Stroke was defined as a discharge diagnosis (ICD-10-code, I63) of patients who needed hospitalization and underwent brain imaging, such as computed tomography and magnetic resonance imaging. Individuals were followed up until death from any cause or until the end of the cohort study (December 2015). Endpoint definitions of the FRS, SCORE, and QRISK3 cohorts are summarized in Supplementary Table S6.

Machine-learning algorithms

ML-based prediction models were developed to assess the participants’ 5-year risk for atherosclerotic CVD. Each cohort was partitioned into training/validation and test datasets in a 7:3 ratio using permutation. During the learning phase, the training/validation dataset was again divided into training and validation sets in an 8:2 ratio. The low overall event rate of CVD in the dataset posed the potential risk of biased predictions and misleading accuracy. Random oversampling was performed to develop a more balanced datasets during the training stage. We also obtained Cox-proportional hazard ratio to evaluate the association between 16 variables and endpoint in the PCE cohort (Supplementary Table S7). The predicted probability was a number between 0 and 1. Receiver operating characteristic curves were constructed, and the optimal cutoff value was determined by calculating Youden’s index for each model. Logistic regression and three other types of ML algorithms, including TreeBag, random forest, and neural networks, were pre-planned. One ML algorithm (AdaBoost) was added during the analysis. Logistic regression, which is also considered as an ML algorithm, uses a linear equation with independent predictors to predict a value³⁵. TreeBag and random forest are algorithms that combine a multitude of decision trees via bagging^{36, 37}. While random forest improves variance by reducing the correlation between trees, TreeBag uses random selection of variables for the best split at each node. AdaBoost combines weak learners into a weighted sum that represents the final output³⁸. Neural networks are statistical learning algorithms mimicking the biological neuron system³⁹. All ML algorithms were built using the R program. Supplementary Methods S1 section further elaborates on the machine learning techniques. The detailed architecture used in the neural networks is also described in Supplementary Methods and Supplementary Figure S1. The number of hidden layers and neurons in the layers were chosen empirically using the training/validation set. The consistency of the models was confirmed using fivefold cross-validation. The main models were constructed using the 16 baseline variables. A sensitivity analysis was done with models using the 8 variables that are commonly used in pre-existing prediction models.

Statistical analysis

Analyses were performed separately in each cohort. Clinical characteristics are presented as numbers and percentages for categorical variables and means ± standard deviation for continuous variables. The performance of the contemporary and ML-based risk prediction models was assessed with respect to discrimination, calibration, and net benefit. Discrimination and calibration are the most commonly used parameters in risk prediction models. The overall performance was assessed using the Brier score, which was calculated as the squared differences between actual binary outcomes and predicted probabilities⁴⁰. A lower score represented higher accuracy.

C-statistics and the 95% CIs were provided, to estimate the discrimination of the models. Delong’s test was used to compare two correlated C-statistics⁴¹. Predicted and observed event rates were compared for each model. Predictive accuracy, sensitivity, specificity, positive predictive values, negative predictive values, and F1 score were calculated, as shown below.

$${\text{Predictive accuracy}} = \frac{TP + TN}{{TP + FP + FN + TN}}$$

$${\text{Sensitivity}} = \frac{TP}{{TP + FN}}$$

$${\text{Specificity}} = { }\frac{TN}{{TN + FP}}$$

$${\text{Positive predictive value}} = \frac{TP}{{TP + FP}}$$

$${\text{Negative predictive value}} = \frac{TN}{{TN + FN}}$$

$${\text{F}}1{\text{ score}} = 2 \times \frac{{\left( {Senstivity \times Positive predictive value} \right)}}{{\left( {Senstivity + Positive predictive value} \right)}},$$

where TP indicates true positive, TN indicates true negative, FP indicates false positive and TN indicates true negative.

The goodness-of-fit (calibration) of the models was tested with the modified Hosmer–Lemeshow χ² statistic⁴². Study subjects were divided into deciles based on their predicted risk. For pre-existing prediction models, the observed incidence rate per 1000 person-years was compared against the predicted 10-year risk in each cohort. Incidence rates per 1000 person-years were calculated by dividing the number of events that occurred during the follow-up period. Calibration of the ML-based algorithms and PCE was determined using the predicted and observed numbers of events at 5 years in the PCE cohort.

Decision-curve analysis was used to quantify the clinical usefulness of each prediction model in the PCE cohort⁴³. A threshold probability indicates the relative weight of the harms of a false positive at which a patient would opt for treatment expecting its benefit. The net benefit of a model was calculated as the difference between the proportion of true positives and the proportion of false positives weighted by the odds of the selected threshold. Then net benefit was plotted across different threshold probabilities. A model that provides higher net benefit at a particular threshold is preferred. The net benefit was presented at cutoffs of 3.75% and 5%, which correspond to 7.5% and 10% thresholds, respectively, in blood cholesterol and high blood pressure guidelines^{7, 8}.

Two-sided P values of less than 0.05 were considered statistically significant. All statistical analyses were performed with R programming version 3.3.3 (R Foundation for Statistical Computing, Vienna, Austria).

Abbreviations

ACC/AHA:: American College of Cardiology /American Heart Association
AUC:: Area under curve
CIs:: Confidence intervals
CVD:: Cardiovascular disease
FRS:: Framingham risk score
ICD‐10:: International Classification of Diseases, 10th Revision
MESA:: Multi-Ethnic Study of Atherosclerosis
ML:: Machine learning
NHIS-HEALS:: National Health Insurance Service-Health Screening
PCE:: Pooled cohort equation
SCORE:: Systematic coronary risk evaluation

References

Benjamin Emelia, J. et al. Heart disease and stroke statistics—2019 update: A report from the American Heart Association. Circulation 139, e56–e66. https://doi.org/10.1161/CIR.0000000000000659 (2019).
Roth, G. A. et al. Global, regional, and national burden of cardiovascular diseases for 10 causes, 1990 to 2015. J. Am. Coll. Cardiol. 70, 1–25. https://doi.org/10.1016/j.jacc.2017.04.052 (2017).
Article PubMed PubMed Central Google Scholar
D’Agostino, R. B. Sr. et al. General cardiovascular risk profile for use in primary care: The Framingham Heart Study. Circulation 117, 743–753. https://doi.org/10.1161/CIRCULATIONAHA.107.699579 (2008).
Article PubMed Google Scholar
Goff David, C. et al. 2013 ACC/AHA guideline on the assessment of cardiovascular risk. Circulation 129, S49–S73. https://doi.org/10.1161/01.cir.0000437741.48606.98 (2014).
Conroy, R. M. et al. Estimation of ten-year risk of fatal cardiovascular disease in Europe: the SCORE project. Eur. Heart J. 24, 987–1003. https://doi.org/10.1016/S0195-668X(03)00114-3 (2003).
Article CAS PubMed Google Scholar
Hippisley-Cox, J., Coupland, C. & Brindle, P. Development and validation of QRISK3 risk prediction algorithms to estimate future risk of cardiovascular disease: prospective cohort study. BMJ 357, j2099. https://doi.org/10.1136/bmj.j2099 (2017).
Article PubMed PubMed Central Google Scholar
Grundy Scott, M. et al. 2018 AHA/ACC/AACVPR/AAPA/ABC/ACPM/ADA/AGS/APhA/ASPC/NLA/PCNA guideline on the management of blood cholesterol. Circulation 0, CIR.0000000000000625. https://doi.org/10.1161/CIR.0000000000000625.
Whelton Paul, K. et al. ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA guideline for the prevention, detection, evaluation, and management of high blood pressure in adults: Executive summary: a report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Hypertension 71(1269–1324), 2018. https://doi.org/10.1161/HYP.0000000000000066 (2017).
Article CAS Google Scholar
Catapano, A. L. et al. 2016 ESC/EAS guidelines for the management of dyslipidaemias. Eur. Heart J. 37, 2999–3058. https://doi.org/10.1093/eurheartj/ehw272 (2016).
Article PubMed Google Scholar
Zanchetti, A. et al. 2018 ESC/ESH Guidelines for the management of arterial hypertension. Eur. Heart J. 39, 3021–3104. https://doi.org/10.1093/eurheartj/ehy339 (2018).
Article PubMed Google Scholar
Pennells, L. et al. Equalization of four cardiovascular risk algorithms after systematic recalibration: Individual-participant meta-analysis of 86 prospective studies. Eur. Heart J. 40, 621–631. https://doi.org/10.1093/eurheartj/ehy653 (2019).
Article PubMed Google Scholar
Mortensen, M. B., Nordestgaard, B. G., Afzal, S. & Falk, E. ACC/AHA guidelines superior to ESC/EAS guidelines for primary prevention with statins in non-diabetic Europeans: The Copenhagen General Population Study. Eur. Heart J. 38, 586–594. https://doi.org/10.1093/eurheartj/ehw426 (2017).
Article CAS PubMed Google Scholar
Kavousi, M. et al. Comparison of application of the ACC/AHA guidelines, adult treatment panel III guidelines, and European society of cardiology guidelines for cardiovascular disease prevention in a European cohort comparison of guidelines for CVD prevention comparison of guidelines for CVD prevention. JAMA 311, 1416–1423. https://doi.org/10.1001/jama.2014.2632 (2014).
Article CAS PubMed Google Scholar
Char, D. S., Shah, N. H. & Magnus, D. Implementing machine learning in health care: Addressing ethical challenges. N. Engl. J. Med. 378, 981–983. https://doi.org/10.1056/NEJMp1714229 (2018).
Article PubMed PubMed Central Google Scholar
Chilamkurthy, S. et al. Deep learning algorithms for detection of critical findings in head CT scans: A retrospective study. The Lancet 392, 2388–2396. https://doi.org/10.1016/S0140-6736(18)31645-3 (2018).
Article Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410. https://doi.org/10.1001/jama.2016.17216 (2016).
Article PubMed Google Scholar
Ambale-Venkatesh, B. et al. Cardiovascular event prediction by machine learning: The multi-ethnic study of atherosclerosis. Circ. Res. 121, 1092–1101. https://doi.org/10.1161/CIRCRESAHA.117.311312 (2017).
Article CAS PubMed PubMed Central Google Scholar
Kakadiaris Ioannis, A. et al. Machine learning outperforms ACC/AHA CVD risk calculator in MESA. J. Am. Heart Assoc. 7, e009476. https://doi.org/10.1161/JAHA.118.009476 (2018).
Article CAS PubMed PubMed Central Google Scholar
Weng, S. F., Reps, J., Kai, J., Garibaldi, J. M. & Qureshi, N. Can machine-learning improve cardiovascular risk prediction using routine clinical data?. PLoS ONE 12, e0174944. https://doi.org/10.1371/journal.pone.0174944 (2017).
Article CAS PubMed PubMed Central Google Scholar
Christodoulou, E. et al. A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. J. Clin. Epidemiol. 110, 12–22. https://doi.org/10.1016/j.jclinepi.2019.02.004 (2019).
Article PubMed Google Scholar
Bian, J., Buchan, I., Guo, Y. & Prosperi, M. Statistical thinking, machine learning. J. Clin. Epidemiol. 116, 136–137. https://doi.org/10.1016/j.jclinepi.2019.08.003 (2019).
Article PubMed Google Scholar
Van Calster, B., Verbakel, J. Y., Christodoulou, E., Steyerberg, E. W. & Collins, G. S. Statistics versus machine learning: Definitions are interesting (but understanding, methodology, and reporting are more important). J. Clin. Epidemiol. 116, 137–138. https://doi.org/10.1016/j.jclinepi.2019.08.002 (2019).
Article PubMed Google Scholar
Kompa, B., Snoek, J. & Beam, A. L. Second opinion needed: communicating uncertainty in medical machine learning. NPJ Digit. Med. 4, 4. https://doi.org/10.1038/s41746-020-00367-3 (2021).
Article PubMed PubMed Central Google Scholar
Morgenstern, J. D. et al. “AI’s gonna have an impact on everything in society, so it has to have an impact on public health”: A fundamental qualitative descriptive study of the implications of artificial intelligence for public health. BMC Public Health 21, 40. https://doi.org/10.1186/s12889-020-10030-x (2021).
Article PubMed PubMed Central Google Scholar
McConnachie, A. et al. Long-term impact on healthcare resource utilization of statin treatment, and its cost effectiveness in the primary prevention of cardiovascular disease: A record linkage study. Eur. Heart J. 35, 290–298. https://doi.org/10.1093/eurheartj/eht232 (2014).
Article CAS PubMed Google Scholar
Grundy Scott, M. et al. Implications of recent clinical trials for the national cholesterol education program adult treatment panel III guidelines. Circulation 110, 227–239. https://doi.org/10.1161/01.CIR.0000133317.49796.0E (2004).
Muntner, P. et al. Validation of the atherosclerotic cardiovascular disease pooled cohort risk equations cardiovascular disease risk equations cardiovascular disease risk equations. JAMA 311, 1406–1415. https://doi.org/10.1001/jama.2014.2630 (2014).
Article CAS PubMed PubMed Central Google Scholar
DeFilippis, A. P. et al. An analysis of calibration and discrimination among multiple cardiovascular risk scores in a modern multiethnic cohort calibration and discrimination among CVD risk scores. Ann. Intern. Med. 162, 266–275. https://doi.org/10.7326/M14-1281 (2015).
Article PubMed PubMed Central Google Scholar
Jee, S. H. et al. A coronary heart disease prediction model: the Korean Heart Study. BMJ Open 4, e005025. https://doi.org/10.1136/bmjopen-2014-005025 (2014).
Article PubMed PubMed Central Google Scholar
Jung, K. J. et al. The ACC/AHA 2013 pooled cohort equations compared to a Korean risk prediction model for atherosclerotic cardiovascular disease. Atherosclerosis 242, 367–375. https://doi.org/10.1016/j.atherosclerosis.2015.07.033 (2015).
Article CAS PubMed Google Scholar
Esteva, A. et al. A guide to deep learning in healthcare. Nat. Med. 25, 24–29. https://doi.org/10.1038/s41591-018-0316-z (2019).
Article CAS PubMed Google Scholar
Kosis.kr. (n.d.). Korean Statistical Information Service. [online] Available at: http://kosis.kr/index/index.do.
Funk, M. J. & Landi, S. N. Misclassification in administrative claims data: quantifying the impact on treatment effect estimates. Curr. Epidemiol. Rep. 1, 175–185. https://doi.org/10.1007/s40471-014-0027-z (2014).
Article PubMed PubMed Central Google Scholar
Seong, S. C. et al. Cohort profile: The National Health Insurance Service-National Health Screening Cohort (NHIS-HEALS) in Korea. BMJ Open 7, e016640–e016640. https://doi.org/10.1136/bmjopen-2017-016640 (2017).
Article PubMed PubMed Central Google Scholar
Cox, D. R. The regression analysis of binary sequences. J. Roy. Stat. Soc. 20, 215–242 (1958).
MathSciNet MATH Google Scholar
Breiman, L. Bagging predictors. Mach. Learn. 24, 123–140. https://doi.org/10.1023/a:1018054314350 (1996).
Article MATH Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32. https://doi.org/10.1023/A:1010933404324 (2001).
Article MATH Google Scholar
Freund, Y. & Schapire, R. E. in Proceedings of the Thirteenth International Conference on International Conference on Machine Learning 148–156 (Morgan Kaufmann Publishers Inc., Bari, Italy, 1996).
Hastie, T., Tibshirani, R. & Friedman, J. in The Elements of Statistical Learning: Data Mining, Inference, and Prediction (eds Trevor Hastie, Robert Tibshirani, & Jerome Friedman) 389–416 (Springer New York, 2009).
Gerds, T. A., Cai, T. & Schumacher, M. The performance of risk prediction models. Biom. J. 50, 457–479. https://doi.org/10.1002/bimj.200810443 (2008).
Article MathSciNet PubMed MATH Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845. https://doi.org/10.2307/2531595 (1988).
Article CAS PubMed MATH Google Scholar
Lemeshow, S. & Hosmer, D. W. Jr. A review of goodness of fit statistics for use in the development of logistic regression models. Am. J. Epidemiol. 115, 92–106. https://doi.org/10.1093/oxfordjournals.aje.a113284 (1982).
Article CAS PubMed Google Scholar
Vickers, A. J. & Elkin, E. B. Decision curve analysis: A novel method for evaluating prediction models. Med. Decis. Mak. 26, 565–574. https://doi.org/10.1177/0272989X06295361 (2006).
Article Google Scholar

Download references

Funding

This work was supported by the Basic Science Research Program through the National Research Foundation of Korea (Grant number 2019R1C1C1006611).

Author information

Authors and Affiliations

Department of Cardiology, Gyeongsang National University School of Medicine and Gyeongsang National University Changwon Hospital, Changwon, Korea
Sang-Yeong Cho
Cardiovascular Center, Internal Medicine, Seoul National University Bundang Hospital, 82, Gumi-Ro 173 Beon-Gil, Bundang-Gu, Seongnam-si, 13620, Gyeonggi-Do, Korea
Sun-Hwa Kim, Si-Hyuck Kang, Chang-Hwan Yoon, Tae-Jin Youn & In-Ho Chae
Department of Internal Medicine, Seoul National University, Seoul, Korea
Si-Hyuck Kang, Chang-Hwan Yoon, Tae-Jin Youn & In-Ho Chae
Department of Radiology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Korea
Kyong Joon Lee & Dongjun Choi
Office of eHealth Research and Businesses, Seoul National University Bundang Hospital, Seongnam-si, Korea
Seungjin Kang
Department of Ophthalmology, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Korea
Sang Jun Park
Department of Neurosurgery, Seoul National University Bundang Hospital, Seoul National University College of Medicine, Seongnam-si, Korea
Tackeun Kim

Authors

Sang-Yeong Cho
View author publications
You can also search for this author in PubMed Google Scholar
Sun-Hwa Kim
View author publications
You can also search for this author in PubMed Google Scholar
Si-Hyuck Kang
View author publications
You can also search for this author in PubMed Google Scholar
Kyong Joon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Dongjun Choi
View author publications
You can also search for this author in PubMed Google Scholar
Seungjin Kang
View author publications
You can also search for this author in PubMed Google Scholar
Sang Jun Park
View author publications
You can also search for this author in PubMed Google Scholar
Tackeun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Chang-Hwan Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Tae-Jin Youn
View author publications
You can also search for this author in PubMed Google Scholar
In-Ho Chae
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.H.K. had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analyses. S.Y.C., S.H.K. and S.H.K. contributed concept and design. S.Y.C., S.H.K., S.H.K., K.L., D.C. and S.K. contributed acquisition, analysis, and interpretation of data. S.Y.C. and S.H.K. drafted the manuscript. K.L., S.P., T.K., C.H.Y., T.J.Y. and I.H.C. contributed to critical revision of the manuscript and study supervision. All authors gave final approval and agreed to be accountable for all aspects of work ensuring integrity and accuracy.

Corresponding author

Correspondence to Si-Hyuck Kang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cho, SY., Kim, SH., Kang, SH. et al. Pre-existing and machine learning-based models for cardiovascular risk prediction. Sci Rep 11, 8886 (2021). https://doi.org/10.1038/s41598-021-88257-w

Download citation

Received: 05 April 2020
Accepted: 30 March 2021
Published: 26 April 2021
DOI: https://doi.org/10.1038/s41598-021-88257-w
Springer Nature Limited

This article is cited by

Artificial intelligence in the risk prediction models of cardiovascular disease and development of an independent validation screening tool: a systematic review
- Yue Cai
- Yu-Qing Cai
- Guang-Wei Zhang
BMC Medicine (2024)
Ensemble machine learning for predicting in-hospital mortality in Asian women with ST-elevation myocardial infarction (STEMI)
- Sazzli Kasim
- Putri Nur Fatin Amir Rudin
- Nurulain Ibrahim
Scientific Reports (2024)
Comparing the performance of machine learning and conventional models for predicting atherosclerotic cardiovascular disease in a general Chinese population
- Zihao Fan
- Zhi Du
- Yingxian Sun
BMC Medical Informatics and Decision Making (2023)
Machine learning reveals sex-specific associations between cardiovascular risk factors and incident atherosclerotic cardiovascular disease
- Soongu Kwak
- Hyun-Jung Lee
- Yong-Jin Kim
Scientific Reports (2023)
Machine Learning in Cardiovascular Risk Prediction and Precision Preventive Approaches
- Nitesh Gautam
- Joshua Mueller
- Subhi J. Al’Aref
Current Atherosclerosis Reports (2023)

Pre-existing and machine learning-based models for cardiovascular risk prediction

Abstract

Similar content being viewed by others

Explore related subjects

Introduction

Related research

Results

Characteristics of the study population

Performance of pre-existing risk prediction models

Performance of machine learning algorithms to the pooled cohort equation cohort

Performance of machine learning algorithms in other cohorts

Discussion

Methods

Data source and study individuals

Risk factor variables and risk score calculations

Outcome

Machine-learning algorithms

Statistical analysis

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation