Abstract
The present work aimed to establish a new model to accurately estimate overall survival (OS) as well as cancer-specific survival (CSS) of osteosarcoma. Osteosarcoma cases were collected from the Surveillance, Epidemiology, and End Results (SEER) database between 2004 and 2017 and randomized as training or validation sets. Then, the OS- and CSS-related variables were discovered through multivariate Cox regression analysis to develop new nomograms to predict the 1-, 3- and 5-year OS and CSS. Besides, consistency index (C-index), decision curve analysis (DCA), along with calibration curve were adopted for assessing the predicting ability of our constructed nomograms after calibrating for 1-, 3- and 5-year OS and CSS. Altogether, 1727 osteosarcoma cases were enrolled in the present study and randomly divided as training (n = 1149, 70%) or validation (n = 576, 30%) set. As shown by univariate as well as multivariate Cox regression analyses, age, grade, T stage, M stage, surgery, chemotherapy, and histological type were identified to be the adverse factors to independently predict OS and CSS among the osteosarcoma cases. Besides, based on results of multivariate Cox regression analysis, we constructed the OS and CSS prediction nomograms. The C-index in training set was 0.806 (95% CI 0.769–0.836) for OS nomogram and 0.807 (95% CI 0.769–0.836) for CSS nomogram. In the meantime, C-index value in validation set was 0.818 (95% CI 0.789–0.847) for OS nomogram, while 0.804 (95% CI 0.773–0.835) for CSS nomogram. Besides, those calibration curves regarding the 3- and 5-year CSS of our constructed nomogram were highly consistent between the predicted values and the measurements in the training set as well as the external validation set. Our constructed nomogram outperformed the TNM stage in prediction. Our constructed nomogram is facile, creditable, and feasible; it efficiently predicts OS and CSS for osteosarcoma cases and can assist clinicians in assessing the prognosis for individuals and making decisions.
Similar content being viewed by others
Introduction
Osteosarcoma is among the frequently occurring primary bone cancers, which particularly affects adolescents aged below 24 years and has an estimated annual prevalence of 0.34/100,000. Osteosarcoma is marked by its aggressiveness1,2, which manifests in the early lung metastasis and fast local invasion, resulting in high relapse and mortality rates3. After the emergence of adjuvant chemotherapy as well as limb salvage surgery, patients’ survival rate increased by 50% to about 70% in the most recent two decades4,5. Nonetheless, most osteosarcomas lead to disease- and treatment-associated morbidities or mortalities. Identification of the high-risk patients early is very important so as to offer a suitable clinical option or adjuvant therapy. Osteosarcoma can have a unique challenge in a clinical setting; as a result, it is urgently needed to establish the prognostic approaches to precisely estimate survival rates from osteosarcoma.
Currently, the TNM classification system is commonly employed to predict osteosarcoma prognosis6. Typically, the TNM classification system, released by the American Joint Commission on Cancer (AJCC), has been extensively employed to classify cancer patient survivals according to tumor invasion (T), regional lymph node (N) as well as distant metastasis (M)7,8. However, it is limited to evaluate cancer prognosis using the TNM classification system alone, which may not comprehensively assess the clinicopathological variables, including sex, age, race, or additional factors and is commonly employed to predict prognosis of osteosarcoma in extremities. It might not suitable to be used in axial location. Nomogram represents the statistical approach to determine the clinical event probabilities through taking into account those pre-weight values of all factors9,10. Recently, nomogram is extensively utilized for predicting diverse cancer survival11,12,13,14,15. In recent years, the massive data of cancer patients based on open-accessed data and bioinformatics methods make it possible for us to explore the independent risk factors for cancer prognosis16,17,18. The publically accessible SEER database includes cancer patient data across 18 registered sites that cover about almost 28% USA population19,20. This work was conducted to construct a creditable nomogram for predicting overall survival (OS) as well as cancer-specific survival (CSS) of osteosarcoma cases21; thus, assisting clinicians in providing better-customized treatment options to reduce the rate of metastasis and improve the survival rate of patients.
Materials and methods
Patients selection
The osteosarcoma cases in this work were selected from the SEER database, and their corresponding anonymous clinical data were extracted accordingly. The SEER*Stat software designed by the National Cancer Institute (version 8.3.6, https://seer.cancer.gov/seerstat/) was utilized, which covered the SEER 18 Regs custom data containing more therapeutic fields and the Nov 2018 Sub database (covering 2004–2016 data).
In this work, osteosarcoma cases conforming to the following inclusion criteria were selected: (1) Those with the diagnosis of osteosarcoma as the primary malignant tumor from 1983 to 2014 based on the International Classification of Diseases for Oncology [ICD-O] 9180–9187, 9192–919422; (2) Those whose osteosarcomas were confirmed histologically; (3) Those with osteosarcoma in extremities (long/short bones in the four extremities) or at the axial location (skull, ribs, spine, and pelvis); (4) Those whose histological type was determined; (5) Those with estimated survival time or identified cause of mortality after they were diagnosed.
Exclusion criteria
Patients whose survival time was unavailable or unclear were excluded from this study.
The patient clinicopathological characteristics, such as age, gender, race, grade, histological type, tumor site, size, surgery, stage of surgery, chemotherapy, radiotherapy, and survival time, were harvested. As for age, the cases were divided as 0–24, 25–59 and > 59 years groups. The races were classified as black, white, or other (Alaskan Native/American Indian, Pacific/Asian Islander). The tumor sites were classified into an extremity (long/short bones in four extremities) or the axial location (skull, ribs, spine, and pelvis). Tumors were classified into three different sizes (≤ 89 mm, small; 89–139 mm, intermediate; and > 140 mm, large). Also, the low-grade tumors covered the moderately- and well-differentiated grades (namely, ICD-O-3 Grades I and II), whereas the high-grade tumors mainly included the poorly-differentiated and undifferentiated grades (namely, ICD-O-3 Grades III and IV). Meanwhile, surgery, chemotherapy, or radiotherapy were classified into yes or no.
Statistical analyses
The specific processes of prediction model building and nomogram construction were as follow: Firstly, all patients were randomized as training or validation set in the ratio of 7:3. The development of the nomogram was performed using the training cohort, whereas the validation cohort was responsible for the validation. Secondly, the hazard ratios (HRs) and the 95% confidence intervals (CIs) were determined using univariate as well as multivariate Cox regression models and were used for assessing the contribution of every variable to OS or CSS independently. Thirdly, we introduced the significant variables identified from univariate analysis in the multivariate analysis to develop the nomograms for predicting 1-, 3- and 5-year OS and CSS. Additionally, another nomogram was also constructed on the basis of the TNM stage. Thereafter, the MedCalc software, version 15.2.0(MedCalc Software, Mariakerke, Belgium) was utilized for generating receiver operating characteristic (ROC) curves of these two nomograms, and the respective areas under the curve (AUC) values were determined. Moreover, C-index, as well as the calibration curve (1000 bootstraps resamples), were employed to evaluate the nomogram performance. Generally, C-index is between 0.5 and 1.0, with 0.5 indicating random chance, whereas 1.0 suggesting perfect discrimination. Also, net benefits of our constructed nomograms were evaluated by using decision curve analysis (DCA).
The R package (version 3.5.3, https://www.r-project.org/) and SPSS, version 24.0 (SPSS, Chicago, USA) were employed for statistical analysis. A difference of *P ≤ 0.05 (two-sided) indicated statistical significance.
Results
Demographic and pathologic characteristics
Overall, 1725 cases were included into this work and randomized as training (n = 1149) or validation (n = 576) set. Figure 1 presents the patient inclusion process. Table 1 displays the demographic and pathological data of osteosarcoma cases. According to our results, patients aged 0–24 years (64.7%) showed the greatest morbidity of osteosarcoma, and most of the osteosarcoma cases were from white races (74.0%) and males (53.9%). As for the tumor site, osteosarcoma was most commonly located at the lower extremity (58.87%), followed by the primary axial location (28.81%). Besides, most patients were categorized into M0 stage (78.3%) and grade IV (51.4%). Besides, 86.6% of cases underwent surgical resection, while 79.8% underwent chemotherapy. Difference between the two data sets was not significant.
Identification of OS and CSS-related factors based on training set
For identifying those prognosis-related factors, univariate as well as multivariate Cox regression analysis based on training set was conducted. Univariate Cox regression suggested that age at diagnosis, T stage, N stage, M stage, Grade, histological type, chemotherapy, surgery, and radiotherapy showed significant correlations with OS, whereas age, Grade, T stage, N stage, M stage, histological type, surgery, and radiotherapy showed tight correlations with CSS. Then, these above-screened variables were incorporated into multivariate analysis, which suggested that age, Grade, T stage, M stage histological type, and surgery independently predicted OS and CSS (Tables 2, 3).
Nomograms establishment and performance evaluation
Different nomograms regarding OS and CSS were constructed based on multivariate Cox regression results (Fig. 2). Each variable was assigned a point according to HRs. The overall score of every variable was added, and its corresponding location in the total point scale was determined so as to obtain the probabilities of CSS at 1, 3 and 5 years. Disease grade contributed most to survival, as demonstrated in the CSS nomogram.
Thereafter, the time-dependent ROC curves regarding OS and CSS were analyzed. They displayed profoundly increased AUC values of our constructed nomograms (OS 0.801, 95% CI 0774–0.828; CSS 0.786, 95% CI 0.755–0.817) compared with those in the TNM stage (OS 0.683, 95% CI 0.650–0.717; CSS 0.718, 95% CI 0.682–0.754) of training set (Table 4 and Fig. 3A,B). For comparing the consistency between the estimated and real survival, we utilized C-index for verifying our constructed nomograms in the training set. Resultantly, C-indices of the constructed OS and CSS prediction nomograms (OS, C-index = 0.806; CSS, C-index = 0.807) increased in comparison to those in the TNM stage (OS, C-index = 0.686; CSS, C-index = 0.735). Table 4 shows a consistent trend detected in the validation set. This similarity in results in both study sets suggests the accuracy of the model based on our nomograms. Also, we calibrated the 3- and 5-year OS and CSS prediction nomograms in both sets (Fig. 4), which approached the optimal curve, displaying that values predicted by our nomograms were highly consistent with the real measurements in both sets.
Clinical applications
Besides, net benefits were also calculated by DCA for evaluating the clinical effectiveness of our constructed nomogram. Resultantly, the constructed nomograms showed increased clinical net benefits compared with those in the TNM stage in the wide range of OS (10–50%) (Fig. 5A,B).
Discussion
Several prognosis-related factors may have a certain influence on the survival of osteosarcoma; however, the prognosis-related factors are not integrated previously. One individual prognosis index can lead to limited accuracy in predicting the prognosis for a patient. Nomogram has been the frequently used statistical approach to achieve high robustness and precision in predicting the probability of a patient’s overall survival. Kim et al.23. established a nomogram for predicting the metastasis risk among the nonmetastatic osteosarcoma cases that was validated to outperform the tumor necrosis rate or traditional AJCC classification system alone in predicting a distant metastasis (DM) of the tumor. Xia et al.24 established a nomogram for better estimating the prognosis for osteosarcoma cases that received surgery. Findings in the above studies could not be validated, so they had limited applicability in other populations because of the possible bias.
Kim and colleagues established a nomogram for predicting metastasis in patients with extremity osteosarcoma at Enneking stage IIB based on medical records from 91 cases that underwent surgery. Nonetheless, their study had a small sample size, so larger populations are required to validate the generalizability of their nomogram. In this study, the integrated and facile prognosis nomograms were constructed based on data collected from 1725 SEER-derived osteosarcoma patients so as to determine the OS and CSS at 3 and 5 years. Furthermore, our nomogram showed C-indices of 0.808 and 0.806, which were higher than most of the other nomograms for osteosarcoma.
Variables incorporated into the constructed nomogram were classified into 2 types of factors: (1) Clinical factors (such as age, race, grade, tumor size, tumor site, histological subtype, TNM stage), (2) Treatment-related factors (including surgery, chemotherapy, and/or radiotherapy). In the present work, most of our cases were under 24 years of age and occupied 56.7% and 56.8% in training and validation sets, respectively.
As depicted in Table 1, characteristics shown by many patients are: white, males, receiving surgery, had a tumor located at the extremities, and received chemotherapy. These findings corroborate those in previous studies. For the accurate selection of prognostic factors, univariate as well as multivariate Cox analyses were conducted for identifying factors to independently predict OS and CSS. According to our findings, age, tumor size, tumor site, TNM stage and grade, showed negative correlations with OS and CSS among the osteosarcoma cases, showing conformity with earlier findings25,26,27,28. M stage represents a factor that independently affects osteosarcoma patient prognosis. It is well known that distant metastasis of malignant tumors indicates a poor prognosis, which has been unanimously recognized by scholars23. However, due to the small number of baseline and outcome variables of N1 in the N staging, this variable could not become an independent prognostic factor influencing postoperative outcomes of patients with quadruped osteosarcoma when the multivariate COX regression model was fitted.
The TNM classification system has been commonly used to estimate the survival of osteosarcoma, but it just predicts the restricted osteosarcoma risk. In the present work, the practical nomograms were successfully established using 13 variables, including age, sex, race, grade, tumor size, tumor site, histological type, TNM stage, surgery, radiotherapy, and chemotherapy. As revealed by our results, our constructed nomograms outperformed the traditional TNM classification system in predicting patient survival (C-index: 0.806 vs. 0.686, 0.807 vs. 0.735), suggesting poor evaluative prognosis of the single TNM classification system.
Remarkably, our nomograms suggested that appropriate treatment must be given to extending patient survival. Amputation was adopted as the major treatment for high-grade osteosarcoma prior to the 1970s since adjuvant chemotherapy had not emerged at that time; this had severely affected the patient life quality and survival probability. Thanks to the emergence of adjuvant chemotherapy as well as limb salvage surgery, patient survival has risen by 50% to about 70%. In the current study, for an 18-year-old black patient who had high-grade osteosarcoma at T3N1M0 stage in the extremity (tumor size, 10.0 cm), receiving surgery and chemoradiotherapy could improve OS and CSS of the patient from 20 to 72%.
Compared to previous articles on osteosarcoma based on the SEER database, we have established a 1-year, 3-year, and 5-year overall survival (OS) and cause-specific survival (CSS) nomogram using the SEER database. By scoring various risk factors (age, race, gender, metastasis, pathological stage, surgery, chemotherapy), we comprehensively assess the survival outcomes of patients at 1 year, 3 years, and 5 years. We have also evaluated the clinical utility of our approach through net benefits analysis. However, this study suffered from some limitations. Firstly, a certain bias might have crept in due to its retrospective nature. Secondly, our nomograms were constructed on the basis of the large sample size, and the model was validated internally, but external validation was lacking. It was difficult to design an external validation study since osteosarcoma is not commonly seen. Yet, our constructed nomograms were able to effectively and precisely predict the survival for individual osteosarcoma cases.
Conclusions
In this work, our constructed nomograms displayed great predicting ability. Findings in this study suggested that age, tumor site, tumor size, tumor grade, surgery, and TNM stage are factors that can independently estimate OS and CSS of osteosarcoma cases. Such factors are incorporated to establish nomograms to predict the survival of osteosarcoma cases. Our study presents a reliable and accurate method for predicting the survival of osteosarcoma patients. Additionally, the nomograms we developed can be effectively utilized to forecast the 1-, 3-, and 5-year overall survival (OS) and cancer-specific survival (CSS) rates for individual osteosarcoma cases. This valuable tool assists surgeons and clinicians in evaluating the likelihood of survival and determining the risk of mortality for each patient.
Data availability
The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.
References
Ritter, J. & Bielack, S. S. Osteosarcoma. Ann. Oncol. 21, vii320–vii325. https://doi.org/10.1093/annonc/mdq276 (2010).
Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics. CA Cancer J. Clin. 69, 7–34. https://doi.org/10.3322/caac.21708 (2019).
Jeon, D. G. & Song, W. S. How can survival be improved in localized osteosarcoma?. Expert. Rev. Anticancer Ther. 10, 1313–1325. https://doi.org/10.1586/era.10.79 (2010).
Wang, S. D. et al. BMPR2 and HIF1-α overexpression in resected osteosarcoma correlates with distant metastasis and patient survival. Chin. J. Cancer Res. 29, 447–454. https://doi.org/10.21147/j.issn.1000-9604.2017.05.09 (2017).
Li, Y. J. et al. Prognostic value of the C-reactive protein to albumin ratio: A novel inflammation-based prognostic indicator in osteosarcoma. Onco Targets Ther. 10, 5255–5261. https://doi.org/10.2147/OTT.S140560 (2017).
Edge, S. B. & Compton, C. C. The American Joint Committee on Cancer: The 7th edition of the AJCC cancer staging manual and the future of TNM. Ann. Surg. Oncol. 17, 1471–1474. https://doi.org/10.1245/s10434-010-0985-4 (2010).
Amin, M. B. et al. The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population-based to a more “personalized” approach to cancer staging. CA Cancer J. Clin. 67, 93–99. https://doi.org/10.3322/caac.21388 (2017).
Balachandran, V. P., Gonen, M., Smith, J. J. & DeMatteo, R. P. Nomograms in oncology: More than meets the eye. Lancet Oncol. 16, e173–e180. https://doi.org/10.1016/S1470-2045(14)71116-7 (2015).
Liu, X. C. et al. Systemic prognostic score and nomogram based on inflammatory, nutritional and tumor markers predict cancer-specific survival in stage II–III gastric cancer patients with adjuvant chemotherapy. Clin. Nutr. 38, 1853–1860. https://doi.org/10.1016/j.clnu.2018.07.015 (2019).
Zhao, F. et al. Sites of synchronous distant metastases and prognosis in prostate cancer patients with bone metastases at initial diagnosis: A population-based study of 16,643 patients. Clin. Transl. Med. 8, 30–40. https://doi.org/10.1186/s40169-019-0247-4 (2019).
Li, Y. et al. Nomograms for predicting long-term overall survival and cancer-specific survival in patients with major salivary gland cancer: A population-based study. Oncotarget 8, 24469–24482. https://doi.org/10.18632/oncotarget.14905 (2017).
Ju, J. et al. Nomograms predicting long-term overall survival and cancer-specific survival in head and neck squamous cell carcinoma patients. Oncotarget 7, 51059–51068. https://doi.org/10.18632/oncotarget.10595 (2016).
Liu, J. J. et al. Development and external validation of a prognostic nomogram for gastric cancer using the national cancer registry. Oncotarget 7, 35853–35864. https://doi.org/10.18632/oncotarget.8221 (2021).
Hyder, O. et al. A nomogram to predict long-term survival after resection for intrahepatic cholangiocarcinoma: An Eastern and Western experience. JAMA Surg. 149, 432–438. https://doi.org/10.1001/jamasurg.2013.5168 (2014).
Nieder, C., Mehta, M. P., Geinitz, H. & Grosu, A. L. Prognostic and predictive factors in patients with brain metastases from solid tumors: A review of published nomograms. Crit. Rev. Oncol. Hematol. 126, 13–18. https://doi.org/10.1016/j.critrevonc.2018.03.018 (2018).
Ren, X. H. et al. ENAM gene associated with T classification and inhibits proliferation in renal clear cell carcinoma. Aging 13, 7035–7051. https://doi.org/10.18632/aging.202558 (2021).
Zhang, D. et al. Prognostic role of DNA damage response genes mutations and their association with the sensitivity of olaparib in prostate cancer patients. Cancer Control J. Moffitt Cancer Cent. 29, 10732748221129452. https://doi.org/10.1177/10732748221129451 (2022).
Yu, L. et al. Multi-omics analysis reveals the interaction between the complement system and the coagulation cascade in the development of endometriosis. Sci. Rep. 11, 11926–11937. https://doi.org/10.1038/s41598-021-90112-x (2021).
Hankey, B. F., Ries, L. A. & Edwards, B. K. The surveillance, epidemiology, and end results program: A national resource. Cancer Epidemiol. Biomark. Prev. 8, 1117–1121 (1999).
Mirabello, L., Troisi, R. J. & Savage, S. A. Osteosarcoma incidence and survival rates from 1973 to 2004: Data from the Surveillance, Epidemiology, and End Results Program. Cancer 115, 1531–1543. https://doi.org/10.1002/cncr.24121 (2009).
Liu, Y. Q. et al. AC010973.2 promotes cell proliferation and is one of six stemness-related genes that predict overall survival of renal clear cell carcinoma. Sci. Rep. 12, 4272–4282. https://doi.org/10.1038/s41598-022-07070-1 (2022).
Kim, M. S. et al. Prognostic nomogram for predicting the 5-year probability of developing metastasis after neo-adjuvant chemotherapy and definitive surgery for AJCC stage II extremity osteosarcoma. Ann. Oncol. 20, 955–960. https://doi.org/10.1093/annonc/mdn723 (2009).
Xia, W. K. et al. Prognostic performance of pre-treatment NLR and PLR in patients suffering from osteosarcoma. World J. Surg. Oncol. 14, 127–135. https://doi.org/10.1186/s12957-016-0889-2 (2014).
Kim, S. H. et al. Postoperative nomogram to predict the probability of metastasis in Enneking stage IIB extremity osteosarcoma. BMC Cancer 14, 666–675. https://doi.org/10.1186/1471-2407-14-666 (2014).
Faisham, W. I. et al. Prognostic factors and survival rate of osteosarcoma: A single-institution study. Asia Pac. J. Clin. Oncol. 13, e104–e110. https://doi.org/10.1111/ajco.12346 (2017).
Song, W. S. et al. Prognosis of extremity osteosarcoma in patients aged 40–60 years: A cohort/case controlled study at a single institute. Eur. J. Surg. Oncol. 36, 483–488. https://doi.org/10.1016/j.ejso.2010.03.006 (2010).
Lin, L., Deng, S. Y., Zhang, F. T., Liang, Y. Z. & Huang, Z. H. The extremity localized classic osteosarcomas have better survival than the axial non-classics. World J. Surg. Oncol. 16, 39–46. https://doi.org/10.1186/s12957-018-1344-3 (2018).
Zheng, W. P. et al. Nomogram application to predict overall and cancer-specific survival in osteosarcoma. Cancer Manag. Res. 10, 5439–5450. https://doi.org/10.2147/CMAR.S177945 (2018).
Acknowledgements
The present study was supported by Medical Science and Technology Project of Henan Province (Grant no. SBGJ202103101, LHGJ20200759 and LHGJ202220862) and the Medical Science Research Project of Henan Province (Grant no. 222102310107) and the Science and Technology Project of Henan Province (Grant no. 232102311058 and 232102310175).
Author information
Authors and Affiliations
Contributions
Y.Y. wrote the main manuscript. S.W. and J.L. prepared all figures and tables. J.G. performed the statistical analysis. H.G. revised the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yu, Y., Wang, S., Liu, J. et al. Development and validation of a nomogram to predict long-term cancer-specific survival for patients with osteosarcoma. Sci Rep 13, 10230 (2023). https://doi.org/10.1038/s41598-023-37391-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-37391-8
- Springer Nature Limited