One-year mortality of colorectal cancer patients: development and validation of a prediction model using linked national electronic data

Cowling, Thomas E.; Bellot, Alexis; Boyle, Jemma; Walker, Kate; Kuryba, Angela; Galbraith, Sarah; Aggarwal, Ajay; Braun, Michael; Sharples, Linda D.; van der Meulen, Jan

doi:10.1038/s41416-020-01034-w

Clinical Study

One-year mortality of colorectal cancer patients: development and validation of a prediction model using linked national electronic data

Article
Open access
Published: 24 August 2020

Volume 123, pages 1474–1480, (2020)
Cite this article

Download PDF

You have full access to this open access article

British Journal of Cancer Submit manuscript

One-year mortality of colorectal cancer patients: development and validation of a prediction model using linked national electronic data

Download PDF

Thomas E. Cowling ORCID: orcid.org/0000-0003-1524-4393^1,2,
Alexis Bellot^3,4,
Jemma Boyle^1,2,
Kate Walker^1,2,
Angela Kuryba¹,
Sarah Galbraith⁵,
Ajay Aggarwal⁶,
Michael Braun⁷,
Linda D. Sharples⁸ &
…
Jan van der Meulen^1,2

1643 Accesses
5 Citations
3 Altmetric
Explore all metrics

Abstract

Background

The existing literature does not provide a prediction model for mortality of all colorectal cancer patients using contemporary national hospital data. We developed and validated such a model to predict colorectal cancer death within 90, 180 and 365 days after diagnosis.

Methods

Cohort study using linked national cancer and death records. The development population included 27,480 patients diagnosed in England in 2015. The test populations were diagnosed in England in 2016 (n = 26,411) and Wales in 2015–2016 (n = 3814). Predictors were age, gender, socioeconomic status, referral source, performance status, tumour site, TNM stage and treatment intent. Cox regression models were assessed using Brier scores, c-indices and calibration plots.

Results

In the development population, 7.4, 11.7 and 17.9% of patients died from colorectal cancer within 90, 180 and 365 days after diagnosis. T4 versus T1 tumour stage had the largest adjusted association with the outcome (HR 4.67; 95% CI: 3.59–6.09). C-indices were 0.873–0.890 (England) and 0.856–0.873 (Wales) in the test populations, indicating excellent separation of predicted risks by outcome status. Models were generally well calibrated.

Conclusions

The model was valid for predicting short-term colorectal cancer mortality. It can provide personalised information to support clinical practice and research.

Prediction of 30-day, 90-day, and 1-year mortality after colorectal cancer surgery using a data-driven approach

Article Open access 29 February 2024

Developing prediction models for short-term mortality after surgery for colorectal cancer using a Danish national quality assurance database

Article 18 July 2022

Prognostic nomogram to predict the overall survival of patients with early-onset colorectal cancer: a population-based analysis

Article Open access 29 July 2021

Background

In 2018, colorectal cancer was the third most incident cancer and caused the second largest number of cancer deaths in high-income countries.^1,2 It is a heterogeneous disease with varied presentations and large differences in prognosis. Considering the cancer stage alone, 1-year net survival for localised and metastatic cancer varies from 96 to 55%, respectively, in the United States.³

Clinical prediction models combine multiple prognostic factors to estimate individualised risks of outcomes for each patient.^4,5 These risk predictions have many uses. In colorectal cancer research, prediction models have been used to examine prognosis in clinical trials,⁶ to control for confounding in observational studies,⁷ and to assess the added prognostic value of biomarkers.⁸ In clinical practice, they may be used to inform treatment decisions and to communicate prognosis to patients, in line with the aims of personalised medicine and shared decision-making.⁹

In the absence of high-quality prediction models, clinicians’ predictions of cancer survival may be inaccurate, non-transparent, and difficult to explain to patients.^10,11,12 Existing models for colorectal cancer mortality have focused on selected populations recruited to clinical trials (such as stage III and IV groups),^6,13,14 risks after surgery or chemotherapy,^15,16 or long-term survival using primary care data.¹⁷ A recent systematic review¹⁸ did not identify any models to predict mortality for all colorectal cancer patients using contemporary national hospital data.

In this study, our objective was to develop and validate a prediction model for death from colorectal cancer within 3, 6 and 12 months after diagnosis. To do this, we analysed national electronic hospital records linked to official mortality data from England and Wales.

Methods

Study populations

The National Bowel Cancer Audit collects data on adults (aged 18 years or over) newly diagnosed with colorectal cancer (International Classification of Diseases 10th Revision (ICD-10) codes: C18-20¹⁹) in England and Wales.²⁰ These data are entered into electronic record systems by hospital staff and later combined into a pooled national dataset by the National Health Service (NHS). We analysed data for patients whose date of diagnosis was from January 2015 to December 2016.

We defined one population to develop the prediction model and two separate populations to test the performance of this model. The eligible population used for model development included patients who were diagnosed in England in 2015 (n = 28,505 patients). The first test population included patients who were diagnosed in England in 2016 (n = 28,216 patients). The second test population included patients who were diagnosed in Wales in 2015 or 2016 (n = 3861 patients).

Outcome

The outcome was death from colorectal cancer as identified from official death records provided by the Office for National Statistics.²¹ We defined death from colorectal cancer using relevant ICD-10 codes recorded as the ‘underlying cause of death’ (see Supplement S1). The underlying cause is ‘the disease or injury, which initiated the train of morbid events leading directly to death’.²²

Time to death was defined as the number of days between the date of diagnosis (as recorded in the National Bowel Cancer Audit dataset) and the date of death from colorectal cancer (as recorded in Office for National Statistics mortality data). The date of diagnosis was ‘the date when cancer was confirmed or diagnosis agreed’, which is typically the date of the pathology report that confirmed cancer. Patients who died from other causes were censored on the date of death. Patients alive as of 1 January 2018 were censored on that date, providing at least 365 days of follow-up for all patients.

Records from the National Bowel Cancer Audit and Office for National Statistics datasets were combined using deterministic linkage based on each patient’s unique NHS number, date of birth, gender and postcode. From the 60,582 eligible patients (in both development and test sets), the final sample size was 57,705 patients (27,480 in the development population; 26,411 in the England test population; and 3814 in the Wales test population). Supplement S2 provides the sample flow chart. Distributions of variables were similar for the linked and unlinked patients (Supplement S3).

Predictor variables

We used ten variables from the National Bowel Cancer Audit dataset as predictor variables: age, gender, socioeconomic status, source of referral, performance status, tumour site, TNM (tumour, node, metastasis) stage at diagnosis and treatment intent. All variables were recorded in electronic data systems around the time of the first meeting between clinicians to discuss patients’ treatment after diagnosis. We selected these predictors a priori to include variables recorded around the time of diagnosis that had relatively complete data (≥80% of values nonmissing).

Patient age was coded as a continuous variable defined as the number of complete years between the dates of birth and diagnosis. Gender was male/female. Socioeconomic status was defined as the national rank of a patient’s area of residence according to the Index of Multiple Deprivation;²³ the mean population size of these areas was 1500.²³ To aid interpretation, these ranks were linearly rescaled to have a median of zero and lower and upper quartiles of −1 and +1, respectively.²⁴

The source of referral for investigation of suspected cancer had five categories: emergency hospital admission, urgent care/emergency department visit, primary care, national screening programme and ‘other’ (e.g. a separate outpatient clinic). Performance status was defined by five categories of the Eastern Cooperative Oncology Group score (ranging from ‘fully active’ to ‘completely disabled’).²⁵ Tumour site was one of nine ICD-10 codes indexed under C18-20. T, N and M stages of the cancer were defined by the TNM Classification of Malignant Tumours 5th Edition.²⁶ The treatment intent had three categories: curative, non-curative and no active cancer treatment.

All ten predictor variables were defined using the National Bowel Cancer Audit dataset. The original (incomplete) data were used to calculate descriptive statistics for each variable. To account for missing values of predictors, we used multiple imputation with chained equations to generate 40 complete datasets (see Supplement S4 for details). All analysis of associations between the outcome and predictors was done using these 40 imputed datasets. We pooled model estimates and performance measures across the datasets to produce the final results.²⁷

Statistical analysis

We used Cox proportional hazards regression²⁸ to estimate associations between predictor variables and the hazard of colorectal cancer death. Deaths from other causes were treated as censoring events. All predictors entered the regression model simultaneously. We fitted linear associations with the outcome for age and socioeconomic status, as nonlinear transformations fitted by a multivariable fractional polynomial algorithm^29,30,31 were well approximated by linear relationships.

We assessed model performance at 90, 180 and 365 days after diagnosis. Overall model performance was measured using Brier scores.³² These scores were calculated from the mean squared differences between predicted probabilities of colorectal cancer death within a given time period and the observed death status. We scaled these scores from 0–100% (0% if non-informative and 100% if perfect).³³

To assess discrimination, we calculated the c-index.³⁴ This indicates the proportion of all pairs of patients whose survival times could be ordered such that the patient with the lower predicted risk of colorectal cancer death survived longer.²⁴ C-indices equal one for perfect models and 0.5 for random predictions. To assess model calibration, we plotted the predicted risks of colorectal cancer death against the actual observed risks, using the loess smoother to estimate the calibration curve.²⁴

We assessed the internal validity of the model using 10-fold cross-validation and calculated mean values of the performance measures across the ten folds. We tested the performance of the model in two other populations: patients diagnosed in England in 2016 and in Wales in either 2015 or 2016.

Sensitivity analyses

Three sensitivity analyses tested the specification of the model and its performance, as detailed in Supplement S5. These added interaction terms between key predictors, added a comorbidity score and the number of unplanned admissions in the past year as predictor variables, and assessed whether censoring of surviving patients at 365 days affected the associations estimated.

Data preparation was done using Stata (v15) and R (v3.5) was used for all statistical analysis.

Results

In the population used to develop the prediction model, the percentages of patients who died from colorectal cancer were 7.4% (within 90 days), 11.7% (180 days) and 17.9% (365 days). These percentages were similar in the England test population but slightly greater in the Wales test population (Table 1). The Wales population had greater percentages of patients who were referred for diagnostic investigations after an emergency admission (29.0% vs. 13.0% in the development population) and who had metastases (25.6% vs. 22.1%). Most patients in each population were treated with curative intent (73.3–74.1%) (Table 2).

Table 1 Descriptive statistics for the outcome variable and follow-up time.

Full size table

Table 2 Descriptive statistics for predictor variables.

Full size table

Missing values were most common for the performance status of the patient (16.8%) and the T and N-stages of the cancer (19.2% and 17.0%; Table 2). Data fields were complete across all variables for 61.5% of patients. These patients were more likely to be treated with curative intent (76.6% vs. 67.5%) and to survive until the end of follow-up (70.6% vs. 61.5%) than patients who had at least one predictor variable with a missing value (Supplement S6).

After multiple imputation of missing values, risks of colorectal cancer death were greatest for patients who had metastatic disease, had a treatment plan with non-curative intent or no active cancer treatment, or had an unfavourable performance status (Table 3). The risk of cancer death within 365 days was more than 50% for three patient groups: patients in the two worst performance status categories (50.3% and 58.3%) and patients with a non-curative treatment intent (51.9%).

Table 3 Univariable and multivariable associations between the outcome and predictor variables in the development population, estimated using Cox regression.

Full size table

In the multivariable model including all predictor variables, the greatest relative difference in the hazard of colorectal cancer death was between the T4 and T1 stages (hazard ratio (HR) = 4.67; 95% confidence interval (CI): 3.59–6.09). Compared to patients with a curative treatment intent, the hazard of colorectal cancer death was 3.85 times greater for patients whose treatment plan was non-curative (HR = 3.85, 95% CI: 3.60–4.11) or did not include active cancer treatment (HR = 3.85, 95% CI: 3.52–4.21). Outcomes were similar between the non-curative and no active cancer treatment groups (HR = 1.00, 95% CI: 0.92–1.09) (Table 3).

Predicted probabilities of colorectal cancer death varied widely within treatment intent categories. In the England test population, the 10th and 90th percentiles of predicted risks within 365 days were 1.7% and 12.9% for patients treated with curative intent, 23.8% and 88.8% for patients with a non-curative intent and 16.3% and 89.6% for patients with no active cancer treatment planned.

Model performance

The probabilities of colorectal cancer death predicted by the model were well calibrated with the observed proportions of patients that died, in both England and Wales test populations (Fig. 1).

**Fig. 1: Calibration plots for predicted probabilities of colorectal cancer death within 90, 180 and 365 days after diagnosis, in the England and Wales test populations.**

The model typically predicted very low risks of colorectal cancer death for patients who did not experience this outcome (Fig. 2). The predicted risks were generally much greater for patients who did die from colorectal cancer, particularly for the 365-day outcome period. As a result, the predicted probabilities of colorectal cancer death were well separated between patients who did and did not have this outcome (Fig. 2). This was reflected in large values of the c-index, ranging from 0.873 to 0.890 and 0.856 to 0.873 in the England and Wales test populations, respectively (Table 4).

**Fig. 2: Boxplots comparing predicted probabilities of colorectal cancer death by outcome status within 90, 180 and 365 days after diagnosis, in the England and Wales test populations.**

Table 4 Overall model performance and discrimination in the development and test populations.

Full size table

The overall performance of the model as measured by the scaled Brier score was best for the 365-day period, followed by the 180 then 90-day periods (Table 4). For the 365-day period in the England test population, the Brier score was improved by 40.0% compared to if the overall risk of colorectal cancer death had been used as the predicted probability for all patients, indicating a large improvement in prediction ability when using the model (versus no model).

Sensitivity analyses

In the sensitivity analyses, interaction terms between patient age, M-stage and treatment intent did not improve model performance (maximum absolute difference in c-index or Brier score vs. main analysis = 0.001). Results were also similar when each patient’s history of comorbidities and unplanned hospitalisations were added as predictors (maximum absolute difference = 0.002). When patients who were alive 365 days after diagnosis were censored at this timepoint, predictor effects were similar to those in the main analysis (range of relative differences in HRs: 0.97–1.08).

Discussion

The model developed was valid for predicting death from colorectal cancer within 3, 6, and 12 months after diagnosis in England and Wales. The model discriminated very well between patients who did and did not die from colorectal cancer, such that the former group typically had much higher predicted probabilities of death. These predictions were well calibrated with observed outcomes. The T-stage of the tumour had the largest adjusted association with the risk of death, followed by the treatment intent and performance status of the patient.

No single variable alone had a high positive predictive value for colorectal cancer death. For example, just over half of patients (51%) who did not have a curative treatment intent died within 365 days. Predicted risks of death varied widely across patients who did not have a curative intent. This wide variation also existed for patients who did have a curative treatment plan.

Strengths and limitations

We used large, national datasets to develop a new model and examine its temporal and geographic validity in whole populations from two countries. The data used for predictor variables were entered as part of routine care processes and therefore represent information available to clinicians in practice around the time of decision-making. We used cause of death information from official death records to distinguish colorectal cancer deaths from other deaths, and we were able to measure these outcomes for at least 1 year after diagnosis for all patients. Although the patients in the test sets were similar to those in the development set, the differences in the type of referrals and TNM stages between England and Wales provided a reasonable test of external validity.

The model would likely be improved if further information about the cancer was available, such as the sites of any metastases or possibly molecular data, as well as additional characteristics of patients (such as frailty) and their cancer care. This may help to predict greater probabilities of colorectal cancer death for patients who experienced this outcome. Some uncertainty in prognosis may reflect the biological development of cancer and the possibility of treatment-related complications.

Detailed assessment of patients’ overall morbidity, particularly for older patients, could be used to contextualise predictions of cancer mortality in terms of overall life expectancy. However, the overall risk of dying from causes other than colorectal cancer within 1 year after diagnosis was only 4%, so other causes of mortality in this period may be less relevant to treatment decisions for most patients.

Differences in data collection or population characteristics may limit the generalisability of the model to other countries. Estimates of 1-year survival for colorectal cancer can differ markedly between high-income countries, such as 78% in England and 84% in Sweden in 2010–2012.³⁵ The model may need to be recalibrated when used elsewhere if the survival differences are unexplained by differences in the distributions of predictors. However, despite survival in Wales being somewhat worse than in England in the current study, model calibration remained acceptable. Most predictors used have standard international definitions. We rescaled the measure of socioeconomic status so that it might approximate similarly rescaled measures in other settings.

In order to avoid the possibility that any racial biases in access to treatment are reinforced by the prediction model, we did not consider patient ethnicity as a predictor.³⁶ This is in line with most clinical prediction models.³⁷ Prognostic factors such as lymphovascular invasion, surgical margin status and definitive treatment were not included in the model as they are typically unknown around the time of diagnosis and were not relevant to all patients (some of whom do not receive surgery).

Missing data will have biased results if data were ‘missing not at random’, which multiple imputation cannot address. The extent of this bias cannot be ascertained from observed data, but each predictor had less than 20% of values missing, thus reducing the potential bias. National Bowel Cancer Audit records could not be linked to Office for National Statistics death records for 4.4% of eligible patients; distributions of predictor variables were similar between the linked and unlinked groups of patients but some bias due to linkage problems cannot be ruled out.

The 5th edition of the TNM system used in the analysis has been superseded by the 8th edition in the U.K., which will affect the N-stage of some (but relatively few) patients.

Relation to existing literature

A previous study¹⁷ used primary care records and cancer registry data to develop a prediction model for longer-term survival (1, 5 and 10-year) of colorectal cancer patients in England. This model did not include several variables that are routinely recorded in clinical team meetings shortly after diagnosis such as the referral source, performance status, separate TNM stages and treatment intent. The c-index of 0.873 attained by our model for predicting 365-day cancer mortality in England is much greater than that reported for one-year mortality (from all causes) in the previous study (0.795 for men and 0.807 for women¹⁷). This indicates a large increase in performance (closer to the perfect c-index of 1), especially as c-indices are relatively insensitive to improvements in model fit.³⁸

A systematic review¹⁸ reported several prediction models developed for mortality in subgroups of colorectal cancer patients, such as patients with stage III⁶ or metastatic^13,14 cancer, or for posttreatment mortality.^15,16 None of these models were developed to predict mortality for all colorectal cancer patients using contemporary national hospital data. A previous study⁷ by our group used linked National Bowel Cancer Audit and Office for National Statistics death records to develop a risk-adjustment model for 90-day postoperative mortality. This model used similar predictors to the model presented here and showed good discrimination (c-index = 0.799) and calibration; the c-index may have been lower in this surgical cohort partly due to the population being more homogeneous.

Implications for research and practice

The predictor information used in the model is recorded electronically as part of routine practice in England and Wales, typically during clinical team meetings where patient care is planned. Patients’ risks of death within 3, 6 and 12 months could be automatically calculated in these meetings without additional data entry. Supplement S7 gives the formula for calculating predicted probabilities of colorectal cancer death within 90, 180 and 365 days after diagnosis.

The external validity of the model should be tested further before being used outside of England and Wales, possibly in combination with well-established methods for updating prediction models when used in new settings.³⁹ Ideally, the effects of the model on decision-making and patient outcomes would also be evaluated in future research (though such impact studies are rare⁴⁰).

The model’s predictions could be used to provide accurate prognostic information to patients, so that they can make informed decisions together with clinicians. The risk predictions may also help to prioritise patients for specialist palliative care services,^41,42 given the wide range of predicted risks for patients without a curative treatment intent. The predictions also varied widely for those with a curative intent, which may help to inform the intensity of related treatment. Finally, the model could also be relevant to various clinical, epidemiological and biomarker studies.

References

Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A. & Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68, 394–424 (2018).
PubMed Google Scholar
Ferlay, J., Ervik, M., Lam, F., Colombet, M., Mery, L., Piñeros, M. et al. in Global Cancer Observatory: Cancer Today. (International Agency for Research on Cancer, Lyon, 2018).
National Cancer Institute. Cancer Query System: SEER Survival Statistics. https://seer.cancer.gov/canques/survival.html (2020).
Moons, K. G., Royston, P., Vergouwe, Y., Grobbee, D. E. & Altman, D. G. Prognosis and prognostic research: what, why, and how? BMJ https://doi.org/10.1136/bmj.b375 (2009).
Steyerberg, E. W., Moons, K. G., van der Windt, D. A., Hayden, J. A., Perel, P., Schroter, S. et al. Prognosis Research Strategy (PROGRESS) 3: prognostic model research. PLoS Med. https://doi.org/10.1371/journal.pmed.1001381 (2013)
Renfro, L. A., Grothey, A., Xue, Y., Saltz, L. B., Andre, T., Twelves, C. et al. ACCENT-based web calculators to predict recurrence and overall survival in stage III colon cancer. J. Natl Cancer Inst. https://doi.org/10.1093/jnci/dju333 (2014).
Walker, K., Finan, P. J. & van der Meulen, J. H. Model for risk adjustment of postoperative mortality in patients with colorectal cancer. Br. J. Surg. 102, 269–280 (2015).
Article CAS PubMed Google Scholar
Dienstmann, R., Mason, M. J., Sinicrope, F. A., Phipps, A. I., Tejpar, S., Nesbakken, A. et al. Prediction of overall survival in stage II and III colon cancer beyond TNM system: a retrospective, pooled biomarker study. Ann. Oncol. 28, 1023–1031 (2017).
Article CAS PubMed PubMed Central Google Scholar
Steyerberg, E. W. Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating. 2nd edn. (Springer, Cham, 2019).
Glare, P., Virik, K., Jones, M., Hudson, M., Eychmuller, S., Simes, J. et al. A systematic review of physicians’ survival predictions in terminally ill cancer patients. BMJ 327, 195–198 (2003).
Article PubMed PubMed Central Google Scholar
Chow, E., Harth, T., Hruby, G., Finkelstein, J., Wu, J. & Danjoux, C. How accurate are physicians’ clinical predictions of survival and the available prognostic tools in estimating survival times in terminally ill cancer patients? A systematic review. Clin. Oncol. (R. Coll. Radio.) 13, 209–218 (2001).
CAS Google Scholar
Cheon, S., Agarwal, A., Popovic, M., Milakovic, M., Lam, M., Fu, W. et al. The accuracy of clinicians’ predictions of survival in advanced cancer: a review. Ann. Palliat. Med. 5, 22–29 (2016).
PubMed Google Scholar
Renfro, L. A., Goldberg, R. M., Grothey, A., Sobrero, A., Adams, R., Seymour, M. T. et al. Clinical calculator for early mortality in metastatic colorectal cancer: an analysis of patients from 28 clinical trials in the aide et Recherche en Cancerologie Digestive Database. J. Clin. Oncol. 35, 1929–1937 (2017).
Article CAS PubMed PubMed Central Google Scholar
Sjoquist, K. M., Renfro, L. A., Simes, R. J., Tebbutt, N. C., Clarke, S., Seymour, M. T. et al. Personalizing survival predictions in advanced colorectal cancer: the ARCAD nomogram project. J. Natl Cancer Inst. 110, 638–648 (2018).
Article PubMed Google Scholar
Cheung, W. Y., Renfro, L. A., Kerr, D., de Gramont, A., Saltz, L. B., Grothey, A. et al. Determinants of early mortality among 37,568 patients with colon cancer who participated in 25 clinical trials from the adjuvant colon cancer endpoints database. J. Clin. Oncol. 34, 1182–1189 (2016).
Article CAS PubMed PubMed Central Google Scholar
Weiser, M. R., Gonen, M., Chou, J. F., Kattan, M. W. & Schrag, D. Predicting survival after curative colectomy for cancer: individualizing colon cancer staging. J. Clin. Oncol. 29, 4796–4802 (2011).
Article PubMed PubMed Central Google Scholar
Hippisley-Cox, J. & Coupland C. Development and validation of risk prediction equations to estimate survival in patients with colorectal cancer: cohort study. BMJ https://doi.org/10.1136/bmj.j2497 (2017).
Mahar, A. L., Compton, C., Halabi, S., Hess, K. R., Weiser, M. R. & Groome, P. A. Personalizing prognosis in colorectal cancer: A systematic review of the quality and nature of clinical prognostic tools for survival outcomes. J. Surg. Oncol. 116, 969–982 (2017).
Article PubMed PubMed Central Google Scholar
World Health Organization. International Statistical Classification of Diseases and Related Health Problems 10th Revision. https://icd.who.int/browse10/2016/en (2020).
Boyle J., Braun M., Hill J., Kuryba A., van der Meulen J., Walker K. et al. National Bowel Cancer Audit Annual Report 2018. https://www.nboca.org.uk/content/uploads/2018/12/NBOCA-annual-report2018.pdf (2018).
Office for National Statistics. Deaths. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths (2020).
Office for National Statistics. User guide to mortality statistics - Cause of death coding. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/methodologies/userguidetomortalitystatisticsjuly2017#cause-of-death-coding (2020).
Ministry of Housing, Communities & Local Government,. English indices of deprivation. https://www.gov.uk/government/collections/english-indices-of-deprivation (2020).
Harrell F. E. Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis. 2nd edn. (Springer, Cham, 2015).
Eastern Cooperative Oncology Group. ECOG Performance Status. https://ecog-acrin.org/resources/ecog-performance-status (2020).
International Union Against Cancer (UICC). TNM classification of malignant tumours. 5th edn. (John Wiley & Sons, New York, 1997)..
Marshall, A., Altman, D. G., Holder, R. L. & Royston, P. Combining estimates of interest in prognostic modelling studies after multiple imputation: current practice and guidelines. BMC Med. Res. Methodol. 9, 57 (2009).
Article PubMed PubMed Central Google Scholar
Cox, D. R. Regression models and life-tables. J. R. Stat. Soc. Ser. B Stat. Methodol. 34, 187–202 (1972).
Google Scholar
Ambler, G. & Royston, P. Fractional polynomial model selection procedures: Investigation of type I error rate. J. Stat. Comput. Simul. 69, 89–108 (2001).
Article Google Scholar
Royston, P. & Altman, D. G. Regression using fractional polynomials of continuous covariates—-parsimonious parametric modeling. J. R. Stat. Soc. Ser. C. Appl. Stat. 43, 429–467 (1994).
Google Scholar
Royston, P., Ambler, G. & Sauerbrei, W. The use of fractional polynomials to model continuous risk variables in epidemiology. Int. J. Epidemiol. 28, 964–974 (1999).
Article CAS PubMed Google Scholar
Brier, G. W. Verification of forecasts expressed in terms of probability. Monthly Weather Rev. 78, 1–3 (1950).
Article Google Scholar
Steyerberg, E. W., Vickers, A. J., Cook, N. R., Gerds, T., Gonen, M., Obuchowski, N. et al. Assessing the performance of prediction models: a framework for traditional and novel measures. Epidemiology 21, 128–138 (2010).
Article PubMed PubMed Central Google Scholar
Wolbers, M., Blanche, P., Koller, M. T., Witteman, J. C. & Gerds, T. A. Concordance for prognostic models with competing risks. Biostatistics 15, 526–539 (2014).
Article PubMed PubMed Central Google Scholar
Benitez Majano, S., Di Girolamo, C., Rachet, B., Maringe, C., Guren, M. G., Glimelius, B. et al. Surgical treatment and survival from colorectal cancer in Denmark, England, Norway, and Sweden: a population-based study. Lancet Oncol. 20, 74–87 (2019).
Article PubMed PubMed Central Google Scholar
Vyas D. A., Eisenstein L. G., Jones D. S. Hidden in plain sight—reconsidering the use of race correction in clinical algorithms. N. Engl. J. Med. https://doi.org/10.1056/NEJMms2004740 (2020)
Paulus, J. K., Wessler, B. S., Lundquist, C. M. & Kent, D. M. Effects of race are rarely included in clinical prediction models for cardiovascular disease. J. Gen. Intern. Med. 33, 1429–1430 (2018).
Article PubMed PubMed Central Google Scholar
Cook, N. R. Use and misuse of the receiver operating characteristic curve in risk prediction. Circulation 115, 928–935 (2007).
Article PubMed Google Scholar
Su, T. L., Jaki, T., Hickey, G. L., Buchan, I. & Sperrin, M. A review of statistical updating methods for clinical prediction models. Stat. Methods Med. Res. 27, 185–197 (2018).
Article PubMed Google Scholar
Moons, K. G., Altman, D. G., Reitsma, J. B., Ioannidis, J. P., Macaskill, P., Steyerberg, E. W. et al. Transparent reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann. Intern. Med. https://doi.org/10.7326/M14-0698 (2015).
National Institute for Health and Care Excellence. End of life care for adults: quality standard [QS13]. https://www.nice.org.uk/guidance/qs13 (2017).
Hui, D., Mori, M., Watanabe, S. M., Caraceni, A., Strasser, F., Saarto, T. et al. Referral criteria for outpatient specialty palliative cancer care: an international consensus. Lancet Oncol. https://doi.org/10.1016/S1470-2045(16)30577-0 (2016).

Download references

Author information

Authors and Affiliations

Clinical Effectiveness Unit, Royal College of Surgeons of England, London, UK
Thomas E. Cowling, Jemma Boyle, Kate Walker, Angela Kuryba & Jan van der Meulen
Department of Health Services Research and Policy, London School of Hygiene & Tropical Medicine, London, UK
Thomas E. Cowling, Jemma Boyle, Kate Walker & Jan van der Meulen
Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
Alexis Bellot
Alan Turing Institute, London, UK
Alexis Bellot
Department of Palliative Care, Addenbrooke’s Hospital, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
Sarah Galbraith
Department of Clinical Oncology, Guy’s Hospital, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
Ajay Aggarwal
Department of Medical Oncology, The Christie NHS Foundation Trust, Manchester, UK
Michael Braun
Department of Medical Statistics, London School of Hygiene & Tropical Medicine, London, UK
Linda D. Sharples

Authors

Thomas E. Cowling
View author publications
You can also search for this author in PubMed Google Scholar
Alexis Bellot
View author publications
You can also search for this author in PubMed Google Scholar
Jemma Boyle
View author publications
You can also search for this author in PubMed Google Scholar
Kate Walker
View author publications
You can also search for this author in PubMed Google Scholar
Angela Kuryba
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Galbraith
View author publications
You can also search for this author in PubMed Google Scholar
Ajay Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar
Michael Braun
View author publications
You can also search for this author in PubMed Google Scholar
Linda D. Sharples
View author publications
You can also search for this author in PubMed Google Scholar
Jan van der Meulen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.E.C., J.B. and J.v.d.M. conceived the study. T.E.C. conducted the data analysis. T.E.C., A.B., J.B., K.W., A.K., S.G., A.A., M.B., L.D.S. and J.v.d.M. contributed to the design of the study, the interpretation of results, revisions of manuscripts drafted by T.E.C., and approved the final version to be published.

Corresponding author

Correspondence to Thomas E. Cowling.

Ethics declarations

Ethics approval and consent to participate

As the National Bowel Cancer Audit involves analysis of data for service evaluation, it is exempt from UK National Research Ethics Committee approval. Section 251 approval was obtained from the Ethics and Confidentiality Committee for the collection of personal health data without the consent of patients. The study was performed in accordance with the Declaration of Helsinki.

Data availability

The data used in this study are available from NHS Digital and Public Health England’s Office for Data Release but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. We do not have permission to share the patient-level records used in our analysis.

Competing interests

The authors declare no competing interests.

Funding information

T.E.C. was supported by the Medical Research Council (MR/S020470/1). The National Bowel Cancer Audit is commissioned by the Healthcare Quality Improvement Partnership (HQIP) as part of the National Clinical Audit and Patient Outcomes Programme, and funded by NHS England and the Welsh Government (www.hqip.org.uk/national-programmes). Neither HQIP nor the funders had any involvement in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the article for publication.

Additional information

Note This work is published under the standard license to publish agreement. After 12 months the work will become freely available and the license terms will switch to a Creative Commons Attribution 4.0 International (CC BY 4.0).

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cowling, T.E., Bellot, A., Boyle, J. et al. One-year mortality of colorectal cancer patients: development and validation of a prediction model using linked national electronic data. Br J Cancer 123, 1474–1480 (2020). https://doi.org/10.1038/s41416-020-01034-w

Download citation

Received: 01 May 2020
Revised: 29 July 2020
Accepted: 06 August 2020
Published: 24 August 2020
Issue Date: 10 November 2020
DOI: https://doi.org/10.1038/s41416-020-01034-w
Springer Nature Limited

This article is cited by

Survival analysis in pT1-3 and paracolic lymph-node invasion colorectal cancer: the prognostic role of positive paracolic lymph-node ratio for adjuvant chemotherapy
- Xiaochuang Feng
- Weilin Liao
- Dechang Diao
Clinical and Translational Oncology (2024)

Associated content

Clinical Studies

Series 19 April 2021

One-year mortality of colorectal cancer patients: development and validation of a prediction model using linked national electronic data

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Prediction of 30-day, 90-day, and 1-year mortality after colorectal cancer surgery using a data-driven approach

Developing prediction models for short-term mortality after surgery for colorectal cancer using a Danish national quality assurance database

Prognostic nomogram to predict the overall survival of patients with early-onset colorectal cancer: a population-based analysis

Background

Methods

Study populations

Outcome

Predictor variables

Statistical analysis

Sensitivity analyses

Results

Model performance

Sensitivity analyses

Discussion

Strengths and limitations

Relation to existing literature

Implications for research and practice

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Data availability

Competing interests

Funding information

Additional information

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Survival analysis in pT1-3 and paracolic lymph-node invasion colorectal cancer: the prognostic role of positive paracolic lymph-node ratio for adjuvant chemotherapy

Search

Navigation