Early sepsis mortality prediction model based on interpretable machine learning approach: development and validation study

Wang, Yiping; Gao, Zhihong; Zhang, Yang; Lu, Zhongqiu; Sun, Fangyuan

doi:10.1007/s11739-024-03732-2

Early sepsis mortality prediction model based on interpretable machine learning approach: development and validation study

EM - ORIGINAL
Open access
Published: 14 August 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Internal and Emergency Medicine Aims and scope Submit manuscript

Early sepsis mortality prediction model based on interpretable machine learning approach: development and validation study

Download PDF

Yiping Wang¹,
Zhihong Gao²,
Yang Zhang²,
Zhongqiu Lu¹^na1 &
…
Fangyuan Sun²^na1

526 Accesses
Explore all metrics

Abstract

Sepsis triggers a harmful immune response due to infection, causing high mortality. Predicting sepsis outcomes early is vital. Despite machine learning’s (ML) use in medical research, local validation within the Medical Information Mart for Intensive Care IV (MIMIC-IV) database is lacking. We aimed to devise a prognostic model, leveraging MIMIC-IV data, to predict sepsis mortality and validate it in a Chinese teaching hospital. MIMIC-IV provided patient data, split into training and internal validation sets. Four ML models logistic regression (LR), support vector machine (SVM), deep neural networks (DNN), and extreme gradient boosting (XGBoost) were employed. Shapley additive interpretation offered early and interpretable mortality predictions. Area under the ROC curve (AUROC) gaged predictive performance. Results were cross verified in a Chinese teaching hospital. The study included 27,134 sepsis patients from MIMIC-IV and 487 from China. After comparing, 52 clinical indicators were selected for ML model development. All models exhibited excellent discriminative ability. XGBoost surpassed others, with AUROC of 0.873 internally and 0.844 externally. XGBoost outperformed other ML models (LR: 0.829; SVM: 0.830; DNN: 0.837) and clinical scores (Simplified Acute Physiology Score II: 0.728; Sequential Organ Failure Assessment: 0.728; Oxford Acute Severity of Illness Score: 0.738; Glasgow Coma Scale: 0.691). XGBoost’s hospital mortality prediction achieved AUROC 0.873, sensitivity 0.818, accuracy 0.777, specificity 0.768, and F1 score 0.551. We crafted an interpretable model for sepsis death risk prediction. ML algorithms surpassed traditional scores for sepsis mortality forecast. Validation in a Chinese teaching hospital echoed these findings.

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

Article Open access 06 March 2024

A generalizable and interpretable model for mortality risk stratification of sepsis patients in intensive care unit

Article Open access 15 September 2023

Transferability and interpretability of the sepsis prediction models in the intensive care unit

Article Open access 29 December 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Sepsis, a disordered immune reaction triggered by the body’s reaction to infection, has the potential to result in significant mortality rates [1]. Every year, millions of individuals worldwide are impacted by sepsis and septic shock, which pose significant challenges to healthcare [2]. The occurrence and death rates worldwide are increasing [3]. Policymakers consider the high expenses of intensive care unit (ICU) care and the frequent readmissions of sepsis survivors as a significant concern [1]. Hence, it is crucial to forecast the outcomes of sepsis patients promptly and precisely [4].

At present, various clinical scores such as the Sequential Organ Failure Assessment (SOFA), quick Sequential (Sepsis-related) Organ Failure Assessment (qSOFA) [5], Acute Physiology and Chronic Health Assessment Scoring System II (APACHE II), Oxford Acute Severity of Illness Score (OASIS) and Simplified Acute Physiology Score II (SAPS II) aid physicians in assessing the seriousness of sepsis and forecasting the likelihood of unfavorable incidents. Generally, the score rises as the patient’s likelihood of dying increases. Typically, these traditional measures of seriousness rely on a complex statistical model involving multiple variables and were designed for patients who are critically unwell in general, rather than specifically for sepsis. Simultaneously, these measures lack the accuracy needed for individual assessment, exhibiting considerable inaccuracies when applied to patient data that deviates from the mean [6]. It is practically impossible for humans to organize and interpret the vast amount of data collected from a patient in an ICU within the necessary timeframe, particularly when it comes to predicting the patient’s prognosis. Due to the inadequate effectiveness of current severity scores measures, several novel models have been created to forecast the likelihood of mortality among septic patients admitted to the ICU.

Over the past few years, there has been a gradual integration of machine learning (ML) into medical research [7]. ML has the ability to acquire knowledge from extensive medical data, investigate relationships within a dataset [8], and construct a relevant medical model capable of efficiently and precisely forecasting novel data [9].

The utilization of the Medical Information Mart for Intensive Care IV (MIMIC-IV) dataset in this research offers a chance to create a novel and enhanced model for predicting mortality, surpassing the previous studies that relied on the Mart for Intensive Care III (MIMIC-III) dataset. Our objective was to construct a risk forecasting algorithm utilizing the MIMIC-IV dataset to detect sepsis patients with a heightened likelihood of mortality upon admission to the hospital [10] and validate.

Methods

In accordance with the Sepsis-3 definition, this research examined sepsis patients from the MIMIC-IV database, documenting vital signs, laboratory examinations, intervention therapies, and additional information [11]. In addition, we employed the Shapley additive explanation (SHAP) technique to elucidate the forecasting model [12]. This not only enables accurate prognosis prediction for patients but also offers a plausible justification for the prediction [11]. We have externally validated our model using a Chinese teaching hospital database, obtaining consistent outcomes.

Data source

The research was carried out utilizing the MIMIC-IV dataset (version 1.0). MIMIC-IV comprises of 26 tables in a relational database. The MIMIC-IV dataset offers a possible benefit due to its increased magnitude and more up-to-date information in comparison to MIMIC-III [13, 14]. The Beth Israel Deaconess Medical Center has admitted more than 70,000 ICU patients between the years 2008 and 2019, and this database holds their data [15]. Authorized users were granted approval by the Massachusetts Institute of Technology (located in Cambridge, MA, USA) and the Institutional Review Board of BIDMC (based in Boston, MA, USA) to utilize the MIMIC-IV database [16]. Upon the successful fulfillment of the web-based training course offered by the National Institutes of Health (NIH) and the completion of the Protecting Human Research Participants examination [17] (Certification Number no.37920768), Yiping Wang was granted permission to retrieve data from the database. After receiving approval from the Hospital Ethics Committee (Issuing Number KY2022-R138) [18], external validation took place at the First Affiliated Hospital of Wenzhou Medical University (Wenzhou, Zhejiang, China).

Patients and definitions

The screening process in this study employed the sepsis criteria established in 2016, known as Sepsis-3, to identify patients with sepsis. We initially chose patients who satisfied either of the following two conditions within a time frame ranging from 24 h prior to ICU admission to 24 h after: 1. By meticulously documenting infection-related diagnoses using the International Classification of Diseases, ninth revision, Clinical Modification (ICD-9-CM) codes, as provided by Angus et al. [19]; 2.Meeting the criteria for a suspected infection as outlined by Seymour CW et al. [20]. We then recognized subsequent organ malfunction as a sudden alteration in SOFA score of at least 2 points during the initial 24 h of being in the ICU. We only included the initial ICU admission for patients who had been admitted to the ICU two or more times during a single hospital stay [21]. Exclusion occurred for patients with a missing rate of predictor variable exceeding 25% in their records. The following criteria were used to exclude participants: (1) readmissions; (2) pregnant individuals; (3) ICU stays less than 24 h or longer than 100 days; (4) individuals under the age of 18 and (5) those with no access or incomplete data records. The external validation data were gathered from January 01, 2021, to June 30, 2022, following the identical criteria for inclusion and exclusion.

Data collection and variable extraction

With reference to clinical expertise, published literature, and data records in the MIMIC-IV database [11], we gathered the subsequent seven categories of information. (1) Demographic details of the patient, such as gender, age, body mass index (BMI) and more; (2) The following should be assessed Within 24 h of being admitted to the ICU: essential signs including heart rate, average arterial pressure, respiratory rate, and Pao2/fio2 Ratio (P/F); (3) laboratory test outcomes within 24 h after admission to the ICU, such as creatinine, hemoglobin, albumin, lactate, and more; (4) treatment status within 24 h after entering the ICU, such as whether mechanical ventilation was performed, renal replacement treatment, etc.; (5) basic illnesses like hypertension, diabetes, chronic obstructive pulmonary disease (COPD), coronary kidney disease (CKD) and others; (6) outcome: ICU in-hospital mortality and (7) A variety of clinical scores within 24 h after ICU admission, including SOFA, SAPS II, and GCS [22]. For variables with multiple measurements in a day, we will in addition record the trend of each variable over time. We set different window lengths for each variable, take the average value in each window, and subtract the value of each window before the last window from the value of the last window, and finally divide it by the total number of windows to obtain the mean value of the variable’s trend over the day. To address anomalies in the data, we completely eliminate the apparent outliers. Multivariable imputations were employed to handle missing data. The estimators of the tree are set to 50 with PMM (candidate = 5) strategy, when comes to each field we assume the following Gaussian distribution and Poisson distribution. Additional file 3 included variables summary.

Model establishment and evaluation

This research employed a combination of four ML models, namely logistic regression (LR), support vector machine (SVM), deep neural networks (DNN) and extreme gradient boosting (XGB). Considering the class imbalance, we adopted the way of Youden’s J statistic [23] to locate the optimal threshold of all the models. Additional file 1 included a concise overview of these fundamental ML models. We used the XGBoost model (Version 1.6.1). We utilized SHAP values to elucidate our initial predictive model and identify the risk factors associated with mortality in septic patients. The SHAP method offers a consistent and locally precise attribute value for each feature, making it a comprehensive approach to interpret the outcomes of any ML model. After computing the SHAP values for every feature in each sample, we identified the top 20 variables by considering the average SHAP values [12]. SHAP values are used to explain the output of ML models. They represent the contribution of each feature to the prediction made by the model. In other words, SHAP values help us understand how each input variable influences the model’s decision. Finally, a validation from an external source was conducted to confirm if comparable outcomes could be witnessed in a Chinese medical facility. Python version 3.9 was utilized for all computations and assessments, encompassing evaluation metrics like AUROC, specificity, sensitivity, and accuracy.

The training process involved the following steps: first, data from the MIMIC-IV database was utilized, and 70% of the cases were randomly selected as the training set, while the remaining 30% were assigned to the internal validation set [24]. Second, these trained models were used to predict the probabilities for the original samples in the internal validation set. Finally, the trained model was capable of predicting a new sample with new variables. Subsequently, the trained ML model was employed to forecast our hospital data, considering our local database as the external validation set.

Statistical analysis

Measurement data that follows a normal distribution are typically represented as X ± S and are compared between groups using a two-independent-sample t test [25]. Data that are not normally distributed is represented as M (P25, P75) and is compared using the Mann–Whitney U test. The χ² test was used to compare the groups by expressing the enumeration data in terms of rate and percentage [26]. The threshold for statistical significance was established at a P-value of less than 0.05. The SPSS software (version 26.0, Chicago, USA) was utilized for conducting statistical analyses [27].

Results

Baseline characteristics of study samples

Figure 1 depicts the procedure for screening the patient. In the MIMIC-IV database, a total of 76,943 ICU admissions were identified during the initial search. The definition of sepsis was met by a grand total of 34,677 patients. 27,134 eligible patients were included after the exclusion criteria were applied. Table 1 displays the initial clinical and laboratory information of the patients. According to Table 1, a total of 15,900 individuals (58.6%) were identified as male, while the average age of the participants included in the study was 66.6 ± 15.7 years. Of the patients in the MIMIC-IV database, 83.4% (22635) of patients survived. Out of the total 27,134 participants, 4,499 individuals in the study group experienced a mortality rate of 16.6%. According to Table 1, it is evident that individuals who did not survive sepsis tended to be older (65.9 ± 15.6 vs. 70.0 ± 15.1; P < 0.01), exhibited reduced systolic blood pressure (113 [105–123] vs. 108 [101 − 120]; P < 0.01), and had a higher SOFA score (6 [4–8] vs. 9 [6–12]; P < 0.01). The MIMIC-IV database indicates that mortality is linked to elevated levels of white blood cells, creatinine, Lactate, anion gap, as well as increased heart rate and respiratory rate. Additional file 2 presents a comparison of the baseline characteristics and clinical outcomes observed in the local external validation set.

Table 1 Baseline characteristics and comparison between survivors and non-survivors in MIMIC database

Full size table

Evaluation of machine learning algorithm

To predict mortality in sepsis patients, we employed ML techniques including XGBoost, LR, SVM, and DNN, using all variables as input. The models were trained using the training set obtained from MIMIC-IV. Table 2 displays that the XGBoost model, utilizing all accessible variables, achieved an impressive AUROC of 0.873 [95% CI 0.866–0.880] during the internal validation set and 0.844 (95% CI 0.810–0.878) during the external validation set [15]. Figure 2A displayed that XGBoost demonstrated the highest AUC in the testing dataset (LR, AUC = 0.820; SVM, AUC = 0.831; and DNN, AUC = 0.821). Table 3 displays that the XGBoost model achieved a sensitivity of 0.818, a specificity of 0.768, an accuracy of 0.777, and an F1 score of 0.551. In predicting patient mortality (Fig. 2B), the XGBoost model outperformed the clinical scores SAPS II, SOFA, OASIS and GCS (SAPS II, AUC = 0.728; SOFA, AUC = 0.728; OASIS, AUC = 0.738; GCS, AUC = 0.691). External validation was performed at the First Affiliated Hospital of Wenzhou Medical University, China. Figure 2C and D displayed an AUROC of 0.844 for the XGBoost model during external validation. It is evident that the XGBoost model performed exceptionally well in both the internal and external validation processes.

Table 2 Receive operating characteristics curve analysis

Full size table

Table 3 Performance of each model for prediction

Full size table

Explanation of risk factors

Figure 3A displays the evaluated SHAP values of the XGBoost model. In Fig. 3A, the XGBoost model displays the 20 most significant variables, arranged in order of importance. The X-axis of the SHAP value served as a consistent measure for assessing the impact of a feature on the response model. The significance of the prediction model is indicated by the ranking of its features on the Y-axis. In each feature importance row, all patient attributes related to the outcome are visualized using differently colored dots, with red dots representing high values and blue dots representing low values [11]. The SHAP values surpass zero for particular characteristics, indicating an elevated likelihood of fatality. A feature with a higher SHAP value indicates an increased risk of death for the patient. The likelihood of death rises with higher values of the following factors: Age, BUN, Anion gap, LDH, RR, PT, ALP, 24 h ventilation, HR, and average increase in Creatinine. To determine the most influential factors in the prediction model, we illustrated the SHAP summary chart of XGBoost. Figure 3A displays the representation of the individual feature’ contribution to the internal validation set on the right side. Bars of greater length indicate higher significance in relation to the SHAP value. The interpretability of the ML model is demonstrated by Fig. 3B and C, which depict two representative samples. Risk factors and protective factors are represented by the features in red and blue. Figure 3B illustrates a high-risk instance, where the red features contribute to an increase in the risk value of the instance, surpassing the average value calculated by the prediction model [28]. In Fig. 3B, the patient was forecasted to perish due to decreased clinical scores (GCS), advanced age (83 years old), reduced albumin levels (1.8 g/dl), prolonged utilization of mechanical ventilation, along with consistently high PT (17.9 s) and creatinine (average increase). The instance has a low level of risk, as the blue features decrease the risk value of the instance below the average value. In Fig. 3C, the patient was anticipated to be living due to elevated clinical scores (GCS) and being younger (51 years old).

Discussion

This study developed and validated a ML model to predict mortality in sepsis patients. The model’s performance was assessed on two study groups from the MIMIC-IV database and a Chinese teaching hospital, demonstrating robustness across different datasets. Compared to previous studies, our approach utilized the more comprehensive MIMIC-IV dataset, covering a wider time span and a larger number of hospital admissions.

We employed multiple machines learning techniques and found that the XGBoost model outperformed LR, SVM, and DNN, as well as conventional clinical scores. The model achieved an impressive AUROC of 0.873 and showed high specificity, accuracy, and sensitivity in both internal and external validation. Our ML model demonstrated exceptional reliability and prospective generalization ability in both the internal and external validation sets, surpassing the prediction performance of conventional methods significantly. Additionally, medical professionals can utilize the SHAP dependency graph to promptly make informed choices that facilitate suitable medical intervention and enhance the efficient allocation of healthcare resources [11]. If a patient is at a high risk of mortality, it is imperative to administer intensive medical intervention [28]. Expanding these methods to patients in other medical or surgical departments could enhance their utility in predicting complications and mortality, thus aiding in ICU admission decisions.

With the flexibility to incorporate diverse clinical criteria and data attributes [29], ML can construct predictive models for various medical scenarios, including the prognosis of cardiac arrest in septic patients [30, 31], forecasting multiple organ function in elderly individuals [32, 33], predicting the risk of premature mortality in critically ill patients [29], and anticipating readmission outcomes for those in intensive care [34]. Presently, numerous associated prototypes employ artificial intelligence (AI) to forecast sepsis [9, 35], thereby augmenting doctors’ capacity for medical decision-making concerning patients afflicted with sepsis [15]. Hu et al. aimed to develop and validate an interpretable machine-learning model using data from 8817 sepsis patients in the MIMIC-IV database [36]. Leveraging information from modern electronic medical records, ML techniques can be utilized to predict the early occurrence of sepsis [9], as demonstrated by Nemati et al.

Despite the numerous studies and models available in the literature regarding mortality prediction in sepsis patients, local validation of such algorithms based on the MIMIC database was lacking. Our approach addresses limitations of existing severity scores, offering an early and accurate tool for mortality prediction in sepsis patients. This model’s accessibility to readily available admission variables can aid clinicians in making timely interventions [4, 37]. To the best of our understanding, this study is one of the earliest attempts to predict premature death in sepsis patients using a comprehensive public database and a locally validated database [15, 38, 39]. We are able to modify these ML models to incorporate real time change and provide clinical decision support to the clinical team.

Our study also had several limitations. Initially, considering the aspect of model development, we have utilized certain dynamic time series data. To improve the accuracy of sepsis outcome predictions, we plan to construct a dynamic model in future and include additional features [40]. Second, Furthermore, due to the retrospective nature, single-center data could induce selection bias, warranting multi-center investigations for broader validation [41]. To enhance the model’s universality, more diverse data sources are needed, despite Chinese hospital validation. In future, once verified through a local database and with ongoing parameter adjustments, the model will be able to provide enhanced support to clinicians when making clinical decisions.

Conclusion

In conclusion, machine learning holds substantial promise in sepsis. Our developed model for early sepsis mortality prediction using machine learning demonstrated impressive performance, as validated externally in a Chinese teaching hospital, yielding consistent results.

Data availability

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation. The extracted data and programming code for model development in this study are available from the corresponding author on reasonable request. Data of the MIMIC- IV are available on website [42]. It is an open database.

Abbreviations

SOFA:: Sequential organ failure assessment
SAPS II:: Simplified acute physiology score II
GCS:: Glasgow coma scale
APACHE II:: Acute physiology and chronic health assessment scoring system II
OASIS:: Oxford acute severity of illness score
qSOFA:: Quick sequential (sepsis-related) organ failure assessment
BMI:: Body mass index
INR:: International normalized ratio
CHD:: Coronary heart disease
CKD:: Coronary kidney disease
COPD:: Chronic obstructive pulmonary disease
SE:: Sensitivity
SP:: Specificity
AC:: Accuracy
AUROC:: Area under the receiver operating characteristic curve
ALP:: Alkaline phosphatase
PT:: Prothrombin time
LDH:: Lactate dehydrogenase

References

Wang Y, Sun F, Hong G, Lu Z (2021) Thyroid hormone levels as a predictor marker predict the prognosis of patients with sepsis. Am J Emerg Med 45:42–47
Article PubMed Google Scholar
Evans L, Rhodes A, Alhazzani W, Antonelli M, Coopersmith CM, French C, Machado FR, McIntyre L, Ostermann M, Prescott HC et al (2021) Surviving sepsis campaign: international guidelines for management of sepsis and septic shock 2021. Crit Care Med 49(11):e1063–e1143
Article PubMed Google Scholar
Vincent JL, Jones G, David S, Olariu E, Cadwell KK (2019) Frequency and mortality of septic shock in Europe and North America: a systematic review and meta-analysis. Crit Care 23(1):196
Article PubMed PubMed Central Google Scholar
Rivers E, Nguyen B, Havstad S, Ressler J, Muzzin A, Knoblich B, Peterson E, Tomlanovich M (2001) Early goal-directed therapy collaborative G: Early goal-directed therapy in the treatment of severe sepsis and septic shock. N Engl J Med 345(19):1368–1377
Article CAS PubMed Google Scholar
Rhodes A, Evans LE, Alhazzani W, Levy MM, Antonelli M, Ferrer R, Kumar A, Sevransky JE, Sprung CL, Nunnally ME et al (2017) Surviving sepsis campaign: international guidelines for management of sepsis and septic shock: 2016. Intensive Care Med 43(3):304–377
Article PubMed Google Scholar
Zeng Z, Yao S, Zheng J, Gong X (2021) Development and validation of a novel blending machine learning model for hospital mortality prediction in ICU patients with Sepsis. BioData Min 14(1):40
Article PubMed PubMed Central Google Scholar
Islam MM, Nasrin T, Walther BA, Wu CC, Yang HC, Li YC (2019) Prediction of sepsis patients using machine learning approach: a meta-analysis. Comput Methods Programs Biomed 170:1–9
Article PubMed Google Scholar
Chen JH, Asch SM (2017) Machine learning and prediction in medicine-beyond the peak of inflated expectations. N Engl J Med 376(26):2507–2509
Article PubMed PubMed Central Google Scholar
Nemati S, Holder A, Razmi F, Stanley MD, Clifford GD, Buchman TG (2018) An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med 46(4):547–553
Article PubMed PubMed Central Google Scholar
Johnson AE, Stone DJ, Celi LA, Pollard TJ (2018) The MIMIC code repository: enabling reproducibility in critical care research. J Am Med Inform Assoc 25(1):32–39
Article PubMed Google Scholar
Liu C, Liu X, Mao Z, Hu P, Li X, Hu J, Hong Q, Geng X, Chi K, Zhou F et al (2021) Interpretable machine learning model for early prediction of mortality in ICU patients with rhabdomyolysis. Med Sci Sports Exerc 53(9):1826–1834
Article CAS PubMed Google Scholar
Park JY, Hsu TC, Hu JR, Chen CY, Hsu WT, Lee M, Ho J, Lee CC (2022) Predicting sepsis mortality in a population-based national database: machine learning approach. J Med Internet Res 24(4):e29982
Article PubMed PubMed Central Google Scholar
Su Y, Guo C, Zhou S, Li C, Ding N (2022) Early predicting 30-day mortality in sepsis in MIMIC-III by an artificial neural networks model. Eur J Med Res 27(1):294
Article CAS PubMed PubMed Central Google Scholar
Zhu Y, He Z, Jin Y, Zhu S, Xu W, Li B, Nie C, Liu G, Lyu J, Han S (2023) Serum anion gap level predicts all-cause mortality in septic patients: a retrospective study based on the MIMIC III database. J Intensive Care Med 38(4):349–357
Article PubMed Google Scholar
Liu W, Tao G, Zhang Y, Xiao W, Zhang J, Liu Y, Lu Z, Hua T, Yang M (2021) A simple weaning model based on interpretable machine learning algorithm for patients with sepsis: a research of MIMIC-IV and eICU databases. Front Med (Lausanne) 8:814566
Article PubMed Google Scholar
Li Q, Wang J, Liu G, Xu M, Qin Y, Han Q, Liu H, Wang X, Wang Z, Yang K et al (2018) Prompt admission to intensive care is associated with improved survival in patients with severe sepsis and/or septic shock. J Int Med Res 46(10):4071–4081
Article PubMed PubMed Central Google Scholar
Hu C, Li Y, Wang F, Peng Z (2022) Application of machine learning for clinical subphenotype identification in sepsis. Infect Dis Ther 11(5):1949–1964
Article PubMed PubMed Central Google Scholar
Qin X, Zhang W, Zhu X, Hu X, Zhou W (2021) Early fresh frozen plasma transfusion: is it associated with improved outcomes of patients with sepsis? Front Med (Lausanne) 8:754859
Article PubMed Google Scholar
Angus DC, Linde-Zwirble WT, Lidicker J, Clermont G, Carcillo J, Pinsky MR (2001) Epidemiology of severe sepsis in the United States: analysis of incidence, outcome, and associated costs of care. Crit Care Med 29(7):1303–1310
Article CAS PubMed Google Scholar
Seymour CW, Liu VX, Iwashyna TJ, Brunkhorst FM, Rea TD, Scherag A, Rubenfeld G, Kahn JM, Shankar-Hari M, Singer M et al (2016) Assessment of clinical criteria for sepsis: for the third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA 315(8):762–774
Article CAS PubMed PubMed Central Google Scholar
Chen H, Zhu Z, Zhao C, Guo Y, Chen D, Wei Y, Jin J (2020) Central venous pressure measurement is associated with improved outcomes in septic patients: an analysis of the MIMIC-III database. Crit Care 24(1):433
Article PubMed PubMed Central Google Scholar
Wang J, Chen Z, Yang H, Li H, Chen R, Yu J (2022) Relationship between the hemoglobin-to-red cell distribution width ratio and all-cause mortality in septic patients with atrial fibrillation: based on propensity score matching method. J Cardiovasc Dev Dis 9(11):400
CAS PubMed PubMed Central Google Scholar
Ilhan Topcu D, Can Cubukcu H (2022) Optimization of patient-based real-time quality control based on the Youden index. Clin Chim Acta 534:50–56
Article CAS PubMed Google Scholar
Shan N, Xu X, Bao X, Qiu S (2022) Fast fault diagnosis in industrial embedded systems based on compressed sensing and deep kernel extreme learning machines. Sensors (Basel) 22(11):3997
Article PubMed Google Scholar
Ma Y, Tian T, Wang T, Wang J, Guan H, Yuan J, Song L, Yang W, Qiao S (2022) Predictive value of plasma big endothelin-1 in adverse events of patients with coronary artery restenosis and diabetes mellitus: beyond traditional and angiographic risk factors. Front Cardiovasc Med 9:854107
Article CAS PubMed PubMed Central Google Scholar
Ho KF, Swindell R, Brammer CV (2008) Dose intensity comparison between weekly and 3-weekly Cisplatin delivered concurrently with radical radiotherapy for head and neck cancer: a retrospective comparison from new cross hospital, Wolverhampton. UK Acta Oncol 47(8):1513–1518
Article CAS PubMed Google Scholar
Charatcharoenwitthaya P, Kuljiratitikal K, Aksornchanya O, Chaiyasoot K, Bandidniyamanon W, Charatcharoenwitthaya N (2021) Moderate-intensity aerobic vs resistance exercise and dietary modification in patients with nonalcoholic fatty liver disease: a randomized clinical trial. Clin Transl Gastroenterol 12(3):e00316
Article PubMed PubMed Central Google Scholar
Li X, Xu X, Xie F, Xu X, Sun Y, Liu X, Jia X, Kang Y, Xie L, Wang F et al (2020) A time-phased machine learning model for real-time prediction of sepsis in critical care. Crit Care Med 48(10):e884–e888
Article PubMed Google Scholar
Wardi G, Carlile M, Holder A, Shashikumar S, Hayden SR, Nemati S (2021) Predicting progression to septic shock in the emergency department using an externally generalizable machine-learning algorithm. Ann Emerg Med 77(4):395–406
Article PubMed PubMed Central Google Scholar
Kwon JM, Lee Y, Lee Y, Lee S, Park J (2018) An algorithm based on deep learning for predicting in-hospital cardiac arrest. J Am Heart Assoc. https://doi.org/10.1161/JAHA.118.008678
Article PubMed PubMed Central Google Scholar
Seki T, Tamura T, Suzuki M (2019) Group S-KS: outcome prediction of out-of-hospital cardiac arrest with presumed cardiac aetiology using an advanced machine learning technique. Resuscitation 141:128–135
Article PubMed Google Scholar
Mollura M, Lehman LH, Mark RG, Barbieri R (2021) A novel artificial intelligence based intensive care unit monitoring system: using physiological waveforms to identify sepsis. Philos Trans A Math Phys Eng Sci 379(2212):20200252
PubMed PubMed Central Google Scholar
Green M, Bjork J, Forberg J, Ekelund U, Edenbrandt L, Ohlsson M (2006) Comparison between neural networks and multiple logistic regression to predict acute coronary syndrome in the emergency room. Artif Intell Med 38(3):305–318
Article PubMed Google Scholar
Jiang X, Wang Y, Pan Y, Zhang W (2022) Prediction models for sepsis-associated thrombocytopenia risk in intensive care units based on a machine learning algorithm. Front Med (Lausanne) 9:837382
Article PubMed Google Scholar
Tsien CL, Fraser HS, Long WJ, Kennedy RL (1998) Using classification tree and logistic regression methods to diagnose myocardial infarction. Stud Health Technol Inform 52(Pt 1):493–497
PubMed Google Scholar
Hu C, Li L, Huang W, Wu T, Xu Q, Liu J, Hu B (2022) Interpretable machine learning for early prediction of prognosis in sepsis: a discovery and validation study. Infect Dis Ther 11(3):1117–1132
Article PubMed PubMed Central Google Scholar
Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, Bellomo R, Bernard GR, Chiche JD, Coopersmith CM et al (2016) The third international consensus definitions for sepsis and septic shock (sepsis-3). JAMA 315(8):801–810
Article CAS PubMed PubMed Central Google Scholar
Yoon JH, Pinsky MR, Clermont G (2022) Artificial intelligence in critical care medicine. Crit Care 26(1):75
Article PubMed PubMed Central Google Scholar
Mao Q, Jay M, Hoffman JL, Calvert J, Barton C, Shimabukuro D, Shieh L, Chettipally U, Fletcher G, Kerem Y et al (2018) Multicentre validation of a sepsis prediction algorithm using only vital sign data in the emergency department, general ward and ICU. BMJ Open 8(1):e017833
Article PubMed PubMed Central Google Scholar
Li D, Gao J, Hong N, Wang H, Su L, Liu C, He J, Jiang H, Wang Q, Long Y et al (2021) A clinical prediction model to predict heparin treatment outcomes and provide dosage recommendations: development and validation study. J Med Internet Res 23(5):e27118
Article PubMed PubMed Central Google Scholar
Ryu YH, Kim SY, Kim TU, Lee SJ, Park SJ, Jung HY, Hyun JK (2022) Prediction of poststroke depression based on the outcomes of machine learning algorithms. J Clin Med 11(8):2264
Article PubMed PubMed Central Google Scholar
MIMIC- IV database. URL: https://mimic.physionet.org

Download references

Acknowledgements

We thank all patients who participated in our study and the MIMIC-IV program for access to the database.

Funding

This work was supported by the Key R&D Program Projects of Zhejiang Province. (2021C03072) and the Science and Technology Program of Wenzhou (Grant No. Y20210718).

Author information

Zhongqiu Lu and Fangyuan Sun have been contributed equally.

Authors and Affiliations

Department of Emergency, The First Affiliated Hospital of WenZhou Medical University, Wenzhou, 325000, China
Yiping Wang & Zhongqiu Lu
Department of Computer Technology and Information Management, The First Affiliated Hospital of WenZhou Medical University, Wenzhou, 325000, China
Zhihong Gao, Yang Zhang & Fangyuan Sun

Authors

Yiping Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhihong Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhongqiu Lu
View author publications
You can also search for this author in PubMed Google Scholar
Fangyuan Sun
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

ZQL: conceived and designed the study. YPW: writing the original draft; FYS and YZ: software and validation; ZHG: helped with data collection and assembly. Every writer participated in the analysis of the data, the creation of the draft, and the thorough review of the manuscript. They all accepted responsibility for every aspect of the project and gave their approval to the final version.

Corresponding authors

Correspondence to Zhongqiu Lu or Fangyuan Sun.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

All procedures performed in the present study were in accordance with the principles outlined in the 1964 Helsinki Declaration and its later amendments. The study was approved by the First Affiliated Hospital Ethics Committee (Issuing Number:KY2022-R138). The need for informed consent was waived by the First Affiliated Hospital of Wenzhou Medical University Ethics Committee in Clinical Research for this retrospective study.

Consent to participate

The Massachusetts Institute of Technology (Cambridge, MA, USA) and the Institutional Review Board of BIDMC (Boston, MA, USA) approved the use of the MIMIC-IV database for authorized users. This database is a public de-identified database thus informed consent and approval of the Institutional Review Board was waived.

Consent for publication

Not applicable.

Provenance and peer review

Not commissioned, externally peer reviewed.

Human and Animal Rights Statement

All procedures performed in the present study were in accordance with the principles outlined in the 1964 Helsinki Declaration and its later amendments. The study was approved by the First Affiliated Hospital Ethics Committee (Issuing Number:KY2022- R138). The need for informed consent was waived by the First Affiliated Hospital of Wenzhou Medical University Ethics Committee in Clinical Research for this retrospective study.

Informed consent

The Massachusetts Institute of Technology (Cambridge, MA, USA) and the Institutional Review Board of BIDMC (Boston, MA, USA) approved the use of the MIMIC-IV database for authorized users. This database is a public de-identified database thus informed consent and approval of the Institutional Review Board was waived.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Y., Gao, Z., Zhang, Y. et al. Early sepsis mortality prediction model based on interpretable machine learning approach: development and validation study. Intern Emerg Med (2024). https://doi.org/10.1007/s11739-024-03732-2

Download citation

Received: 05 September 2023
Accepted: 27 July 2024
Published: 14 August 2024
DOI: https://doi.org/10.1007/s11739-024-03732-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Early sepsis mortality prediction model based on interpretable machine learning approach: development and validation study

Abstract

Similar content being viewed by others

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

A generalizable and interpretable model for mortality risk stratification of sepsis patients in intensive care unit

Transferability and interpretability of the sepsis prediction models in the intensive care unit

Explore related subjects

Introduction

Methods

Data source

Patients and definitions

Data collection and variable extraction

Model establishment and evaluation

Statistical analysis

Results

Baseline characteristics of study samples

Evaluation of machine learning algorithm

Explanation of risk factors

Discussion

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Consent to participate

Consent for publication

Provenance and peer review

Human and Animal Rights Statement

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation