Abstract
Disulfidptosis a new cell death mode, which can cause the death of Hepatocellular Carcinoma (HCC) cells. However, the significance of disulfidptosis-related Long non-coding RNAs (DRLs) in the prognosis and immunotherapy of HCC remains unclear. Based on The Cancer Genome Atlas (TCGA) database, we used Least Absolute Shrinkage and Selection Operator (LASSO) and Cox regression model to construct DRL Prognostic Signature (DRLPS)-based risk scores and performed Gene Expression Omnibus outside validation. Survival analysis was performed and a nomogram was constructed. Moreover, we performed functional enrichment annotation, immune infiltration and drug sensitivity analyses. Five DRLs (AL590705.3, AC072054.1, AC069307.1, AC107959.3 and ZNF232-AS1) were identified to construct prognostic signature. DRLPS-based risk scores exhibited better predictive efficacy of survival than conventional clinical features. The nomogram showed high congruence between the predicted survival and observed survival. Gene set were mainly enriched in cell proliferation, differentiation and growth function related pathways. Immune cell infiltration in the low-risk group was significantly higher than that in the high-risk group. Additionally, the high-risk group exhibited higher sensitivity to Afatinib, Fulvestrant, Gefitinib, Osimertinib, Sapitinib, and Taselisib. In conclusion, our study highlighted the potential utility of the constructed DRLPS in the prognosis prediction of HCC patients, which demonstrated promising clinical application value.
Similar content being viewed by others
Introduction
According to Global Cancer Statistics1, in 2020, there were 905,677 new cases of liver cancer and 830,180 new deaths, accounting for 4.7% of all cancer cases and 8.3% of all cancer-related deaths respectively, and ranking as the third cause of cancer-related mortality. Live cancer mainly includes hepatocellular carcinoma (HCC), intrahepatic cholangiocarcinoma, and mixed liver cancer, among which HCC is the most commonly diagnosed primary cancer of the liver, comprising 75–85% of cases of all liver cancer cases. Presently, although hepatic resection, liver transplantation, radiotherapy, chemotherapy and targeted therapy have improved the clinical prognosis for some HCC patients, the prognosis of HCC remains poor, with a post-operative recurrence rate of up to 70%2. Therefore, exploring novel prognostic models and potential therapeutic targets is of great clinical significance.
In March 2023, Liu et al.3 first reported in the journal Nat Cell Bio a new form of programmed cell death, which may serve as a new targeted therapeutic approach. The study found that cell death still occurred after the knock-down of genes that regulated apoptosis and iron death, but was inhibited by the use of the thiol-based reducing reagent Tris (2-Carboxyethyl) Phosphine (TCEP). Further analysis revealed that SLC7A11, which was highly expressed in cancer cells, mediated the uptake of cystine and consequently the synthesis of glutathione to counteract the oxidative effect caused by high expression. However, under glucose starvation, the supply of nicotinamide adenine dinucleotide phosphate (NADPH) generated through the pentose phosphate pathway was insufficient, and would lead to the inhibition of the cysteine reduction to cysteine pathway and the increase of intermolecular disulphide bonds of actin, which induced disulphide stress and activated the Rac-WAVE regulatory complex (WRC)-actin-related protein (Arp2/3) signaling axis, triggering cytoskeletal disorders and cell death. In this process, excessive cystine intake combined with glucose starvation may induce cell death. Due to such stress characteristics of disulfide compounds, this form of cell death was termed disulfidptosis. It reported that GYS1, NDUFS1, OXSM, LRPPRC, NDUFA11, NUBPL, NCKAP1, RPN1, SLC3A2, and SLC7A11 were closely associated with disulfidptosis.
Long non-coding RNAs (lncRNAs) refer to a type of non-coding RNAs more than 200 nucleotides in length, which are distributed in the nucleus and cytoplasm and can regulate the biological behavior of cancer cells by binding to DNA, RNA and protein, such as apoptosis, proliferation and metastasis. Numerous studies have reported that lncRNAs are closely associated with the occurrence and development of hepatocellular carcinoma. Zhou et al.4 found that lncRNA PART1 promoted HCC proliferation, migration and invasion by regulating miRNA-149-5p/MAP2K1 Axis. Wei et al.5 found that lncRNA PAARH inhibited HCC apoptosis and promoted HCC migration, invasion and tumor growth by upregulating HOTTIP and activating HIF-1α/VEGF pathway. In addition6, lncRNAs are found to be closely associated with programmed cell death. A study found that the prognostic model based on ferroptosis-related lncRNAs (POLH-AS139384, MKLN1-AS, AC090772.3, etc.) could accurately predict the prognosis of HCC patients. The above studies have demonstrated the potential critical role of lncRNAs in the treatment and prediction of HCC, but the role of DRL in HCC remains to be elucidated.
At present, in studies on the modes of cell death of HCC, Ferroptosis-Related lncRNA7, Cuproptosis-Related lncRNA8, pyroptosis-related lncRNA9, necroptosis-related lncRNAs10 all provide valuable prognostic biomarkers and therapeutic targets for HCC. However, DRLs remain unclear, so comprehensive evaluation of DRLs and construction of prognostic markers are essential for accurate treatment of clinical diagnosis, such as monitoring of prognostic markers, early detection of disease changes, and adoption of appropriate therapeutic measures to improve the survival and life quality of patients. Therefore, in this study, we used R to perform a comprehensive analysis of TCGA-LIHC-RNA-seq data and clinical data, identifying prognostic-related DRLs in HCC and constructing a prognostic risk-scoring model. In addition, a nomogram was constructed based on DRLPS risk scores to assess OS in HCC patients. Finally, we evaluated immune infiltration and drug sensitivity in high- and low-risk groups. These results were expected to provide references for the prognosis, immunotherapy and its mechanism in HCC.
Results
Identification of DRLs in TCGA-LIHC
Based on TCGA-LIHC-RNA-seq data, 50 normal tissue samples, 374 HCC samples, and 16,876 lncRNA expression data (Supplementary Table S1) were extracted. Currently, only Liu et al.3 reported that GYS1, NDUFS1, OXSM, LRPPRC, NDUFA11, NUBPL, NCKAP1, RPN1, SLC3A2, and SLC7A11 were closely associated with disulfidptosis. Excluding normal tissue samples, 362 DRLs (Supplementary Table S2) were obtained by co-expression analysis (Fig. 1) and were intersected with survival data (survival time, survival status) of TCGA-LIHC to obtain 317 lncRNAs and 370 HCC samples (Supplementary Table S3).
Construction of DRLPS in the training set
The 370 samples were grouped into training and testing sets in a 1:1 ratio using the random function in R software, and were tested for intra-group comparability. The clinicopathological features of the two sets of samples were shown in Supplementary Table S7, with all p-values > 0.05, indicating statistical comparability. 24 prognosis-related lncRNAs were identified by univariate Cox regression (p < 0.05) (Fig. 2A). Among them, AC069307.1 (HR: 0.342, 95%-CI: 0.141–0.833) was protective lncRNA; the rest (AC074117.1, AL590705.3, WAC-AS1, CYTOR, AC116351.1, ZEB1-AS1, AC124798.1, AC026356.1, BCAR3-AS1, LINC00205, LINC01836, MIR4435-2HG, AC072054.1, PRKAR1B- AS2, NRAV, SNHG14, HMGN3-AS1, AC107959.3, ZNF232-AS1, AC016747.1, AC131009.3, RNASEH1-AS1, LINC01011) were all risk lncRNAs. Lasso regression with ten-fold cross-validation was conducted, identifying 10 candidate lncRNAs (AC074117.1, AL590705.3, ZEB1-AS1, BCAR3-AS1, MIR4435-2HG, AC072054.1, AC069307.1, AC107959.3, ZNF232-AS1, AC131009.3) (Fig. 2B, C). The candidate lncRNAs were further screened by multivariate Cox regression analysis for lncRNAs with the greatest prognostic value, and 5 DRLs were ultimately selected to construct the optimal prognostic model. DRLPS-based risk Score = AL590705.3Expi* (0.619) + AC072054.1Expi* (0.644) + AC069307.1Expi* (− 1.302) + AC107959.3Expi* (0.493) + ZNF232-AS1Expi* (0.498).
Validation of the prognostic value of DRLPS
Based on the median value (0.903) of risk score in the training set, the samples were divided into high-risk and low-risk groups. The results of PCA indicated that lncRNAs in the model was able to distinguish high-risk groups from low-risk groups (Supplementary information Fig. S1D). The risk score plot (Supplementary information Fig. S1G), the scatter plot of OS status (Supplementary information Fig. S1H), and heat map of the expression of 5 DRLs in the high- and low-risk groups (Supplementary information Fig. S1I), showed that the mortality rate gradually increased as the risk scores increased. K–M survival analysis (Supplementary information Fig. S1A) showed that OS in the low-risk group was higher than in the high-risk groups. Area under the curve (AUC) is an indicator for assessing model performance, with a range of 0.5–1. AUC < 0.5 indicates that the model has no application value, while 0.5 < AUC < 1 indicates that the model has certain clinical predictive value, and higher value indicates better predictive ability. Our results showed that AUC were 0.714, 0.805, and 0.822 for 1-, 3-, and 5-year survival, respectively (Supplementary information Fig. S1B), suggesting that the model had high prognostic value in the prognosis of HCC patients.
Compared with traditional clinical features (Supplementary information Fig. S1C), it was found that the risk model (AUC = 0.805) showed superior predictive efficacy than other clinical features, such as Age (AUC = 0.506), Gender (AUC = 0.435), Grade (AUC = 0.566), and Stage (AUC = 0.655). Univariate Cox regression revealed that the risk model was significantly associated with OS (P < 0.001; HR = 1.411, 95%-CI: 1.224–1.627) (Supplementary information Fig. S4E), and multivariate Cox regression analysis confirmed it as an independent prognostic factor in HCC patients (P < 0.001; HR = 1.343,95%-CI: 1.145–1.577) (Supplementary information Fig. S1F).
Validation of DRLPS based on the testing set and external dataset
The testing set was applied to validate the robustness of the risk model constructed in the training set. Based on the median value in the training set, the testing set was divided into high- and low-risk groups. PCA analysis (Supplementary information Fig. S2D) demonstrated reliable clustering ability of risk score. K–M survival curves (P = 0.002) (Supplementary information Fig. S2A), Time-ROC curves (1-years AUC = 0.689; 3-years AUC = 0.635; 5-years AUC = 0.643) (Supplementary information Fig. S2B), and the risk score (AUC = 0.635) (Supplementary information Fig. S2C) had better predictive efficacy than Age (AUC = 0.502), Gender (AUC = 0.452), and Grade (AUC = 0.498). The risk score plot, scatter plot of OS status and heat map of the expression of 5 DRLs (Supplementary information Fig. S2G–I), together with univariate (P = 0.003; HR = 1.277, 95%-CI: 1.087–1.499) (Supplementary information Fig. S2E) and multivariate Cox regression analyses (P = 0.014; HR = 1.264, 95%-CI: 1.048–1.523) (Supplementary information Fig. S2F) all suggested that the results in the training set were robust. Then, to further validation the performance of DRLPS, we used 115 patient samples from an external dataset. We found that all the validation results were consistent with the Train set results, specifically, based on the median value in the GSE76427 dataset, PCA could clearly differentiate between high and low risk groups (Supplementary Fig. S3D), indicating good clustering reliability. K–M survival curves (P = 0.005) indicated that the survival of the low-risk group was significantly better than that of the high-risk group (Supplementary Fig. S3A). Both multivariate (P < 0.001, HR = 1.677, 95% CI: 1.244–2.261) (Supplementary Fig. S3F) and univariate (P < 0.001, HR = 1.715, 95% CI: 1.258–2.338) (Supplementary Fig. S3E) analyses showed that the predictive ability of the risk model could be independent of other clinical features. Time-ROC curves (1-year AUC = 0.699; 3-year AUC = 0.692; 5-year AUC = 0.798) (Supplementary Fig . S3B) and the risk score (AUC = 0.692) (Supplementary Fig. S3C) were consistent with the results in the training set. Risk curves (Supplementary Fig. S3G), scatter plots (Supplementary Fig. S3H), and DRL expression heat maps (Supplementary Fig. S3I) suggested mortality increased with risk.
Establishment of a predictive nomogram for HCC patients
Considering the universality of the risk model, a nomogram was developed using DRLPS-based risk score of 370 LIHC samples in conjunction with clinical features (Gerder, Grade, Age, TMN Stage) (Fig. 3A). The calculated consistency index (C-index) was 0.745 (95%-CI: 0.689–0.8), indicating high accuracy of the nomogram in predicting OS of HCC patients (Fig. 3B).
Go and KEGG enrichment analysis
To further investigate the potential biological functions between high-risk and low-risk groups, we performed a functional enrichment analysis of GO and KEGG pathways for mRNAs that were co-expressed with the OS-related lncRNAs in the 370 LIHC samples. There are 211 DEGs(|log2FC|> 1 and P < 0.05) between low and high-risk groups (Supplementary Table S4). In the GO functional enrichment analysis (Fig. 4A), the top 6 enriched biological processes (BP) were mitotic nuclear division (GO:0140014), mitotic sister chromatid segregation (GO:0000070), sister chromatid segregation (GO:0000819), nuclear division (GO:0000280), nuclear chromosome segregation (GO:0098813), and chromosome segregation (GO:0007059). The top 6 enriched Molecular Function (MF) were microtubule binding (GO:0008017), tubulin binding (GO:0015631), microtubule motor activity (GO:0003777), protein serine kinase activity (GO:0106310), cytoskeletal motor activity (GO:0003774), and fatty acid binding(GO:0005504). The top 6 enriched Cellular Component (CC) were spindleGO:0005819condensed chromosome, centromeric region (GO:0000779), chromosomal region (GO:0098687), chromosome, centromeric region (GO:0000775), kinetochore (GO:0000776), and condensed chromosome (GO:0000793) (Supplementary Table S5).
In the KEGG pathway enrichment analysis (Fig. 4B), the top 6 enriched pathways were Cell cycle, Oocyte meiosis, Central carbon metabolism in cancer, Progesterone-mediated oocyte maturation, Motor proteins, and Cellular senescence (Supplementary Table S6).
Immune infiltration and drug sensitivity analysis
CIBERSORT algorithm was used to evaluate the abundance of 22 different immune cell types between high and low risk group in 370 LIHC samples (Fig. 5A). It was found that the infiltration abundance of M0 Macrophages in the high-risk group was significantly higher than that in the low-risk group (Fig. 5B). Through ssGESA analysis, we quantified the enrichment scores of immune-related functions in the two risk groups. The results showed that B-cells, cytolytic activity, mast cells, neutrophils, NK cells, T helper cells, Tumor-Infiltrating T Lymphocytes (TIL), Type II interferon responses (Type II IFN Response, IFN-γ) in the low-risk group were significantly higher than those in the high-risk group (Fig. 5C), which suggested an impaired anti-tumor immune function in the high-risk group. Finally, based on GDSC database, drug sensitivity analyses were performed on both high- and low-risk groups. The 50% inhibition concentration (IC50) is used to reflect anti-tumor drug sensitivity. Drug sensitivity increases as IC50 value decreases, with lower IC50 values indicating higher sensitivity and better therapeutic efficacy. It was found that the high-risk group exhibited higher sensitivity to Afatinib, Fulvestrant, Gefitinib, Osimertinib, Sapitinib, and Taselisib (Fig. 5D), while the low-risk group exhibited higher sensitivity to Cytarabine, Oxaliplatin, Axitinib, Irinotecan, NU7441, Sorafenib (Fig. 5E).
Discussion
HCC is the most representative primary cancer of liver with high morbidity and mortality11. Currently, surgical therapy and chemotherapy remain the major options for HCC patients. However, the insidious pathogenesis, complex etiology, and strong invasiveness lead to poor clinical prognosis for most patients. Recently reported disulfidptosis may serve as a targeted therapeutic approach for tumors. The high expression of SLC7A11 in cancer cells results in excessive uptake of cystine into the cell, the reduction process of which consumes large amounts of NADPH (glucose metabolite) and induces cell-specific death by inhibiting glucose transporter. lncRNAs can play a regulatory role in programmed cell death through gene transcription and post-transcriptional modification, and act as potential therapeutic targets and biomarkers for the prediction and diagnosis of cancer patients, e.g., ferroptosis-related lncRNAs signature12, and cuproptosis-related lncRNAs signature13 in the prognosis of HCC. However, it is not yet clear the value and significance of DRLs in the prediction of prognosis of HCC patients. Therefore, this study constructed a DRLPS based on machine learning and explored the biological pathways, immune infiltration and chemotherapy drugs in HCC patients with the aim of improving the OS.
To the best of our knowledge, this study is the first to report the value and significance of DRLPS in the prognosis, immune infiltration and drug sensitivity in HCC patients. Upon analysis, we identified a total of 362 DRLs, among which 24 DRLs were found to have potential prognostic value by univariate Cox analysis. Among them, AC069307.1 was an important protective factor, but has not been reported in any study. It was worth noting that our co-expression analysis was very similar to the weighted gene co-expression network analysis by Zhang et al.14. Overall, Zhang et al., used a global approach to construct co-expression network of oral squamous cell carcinoma (OSCC) and predicted the pathogenesis of OSCC and gene clusters of tumorigenesis, which was also a novel perspective and provided multiple core targets. However, the difference is that we focused only on a risk model consisting of specific lncRNAs. Despite the different types of cancers studied by us and Zhang et al., it has been shown that both techniques may be used for a wide range of other cancers, such as breast cancer, melanoma, and others. Using Lasso regression with ten-fold cross-validation to avoid overfitting and multivariate Cox regression analysis, 5 DRLs were finally selected to construct prognostic signature, including AL590705.3, AC072054.1, AC069307.1, AC107959.3, and ZNF232-AS1. Studies reported9,15 that AL590705.3 was closely associated with DNA damage repair and pyroptosis, and could be used as a biomarker for the prediction and diagnosis of gastric cancer and HCC. Notably, AC072054.1 and ZNF232-AS1 have not been reported in previous studies and may be new markers. Huang et al.16 reported that AC107959.3 was one of the important prognostic targets for predicting hepatitis virus-positive HCC patients. Based on the median value of risk score, HCC patients were divided into high-risk and low-risk groups. PCA was used to visualize distinction between the high- and low-risk groups. K–M analysis showed that OS in the high-risk group was significantly lower than that in the low-risk group. Multivariate Cox regression showed that the constructed risk score model was a prognostic factor independent of other clinical features (Age, Gender, Grade, and Stage). The area under the ROC curve indicated good predictive efficacy of the DRLPS-based risk model, which was further validated by the testing set and outside validation set. Finally, the constructed nomogram showed a high degree of congruence between predicted 1-, 3- and 5-year OS and observed OS, indicating that the model could be used for clinical diagnosis of HCC patients. Although earlier studies reported some biomarkers and prognostic analyses for HCC, which were similar to our study, however, their shortcomings were quite obvious: they did not use more methods to investigate the dimensionality reduction, accuracy, and generalization ability of the model17. In this study, we used multiple machine learning approaches to improve the accuracy of biomarkers as well as their generalization ability in clinical research. In addition, early studies focused on the construction of Ferroptosis, apoptosis, and Cuproptosis prognostic markers, while Disulfidptosis is a glucose-dependent mode of death, based on which the DRLPs we constructed may provide great clinical treatment windows and new treatment ideas. For example, in interference models, the DRLs regulate the disulfidptosis-related mRNA or inhibit glucose transporter proteins by competitively binding to miRNAs, thus exerting anti-tumor effects.
Subsequently, to further reveal the differential biological functions and pathways between high- and low-risk groups, we performed GO function and KEGG pathway analyses. The results demonstrated that the top 6 biological processes were all associated with mitosis. The main proliferation mode of cancer cells is mitosis, and mitotic blockade has become an important breakthrough point in cancer-related clinical research, where attempts have been made to inhibit the proliferation, migration and invasiveness cancer cells through RNA interference technology or small molecule inhibitors. This result further suggests that lncRNAs in the model may be potential biomarkers for therapy. KEGG showed that DEGs were mainly enriched in cell cycle, oocyte meiosis, central carbon metabolism in cancer, progesterone-mediated oocyte maturation, motor proteins and cellular senescence pathway. Notably, cell cycle was the most significantly enriched pathway and an important cancer pathway, regulating cell growth, proliferation and differentiation. The four phases (G1-M) are mainly regulated by the conserved cyclin-dependent kinase (CDK)-cyclin protein complex, and are subject to cell cycle checkpoints18. Disruption of the cell cycle may lead to cell cycle arrest, DNA damage, abnormal spindle assembly, etc., causing abnormal cell proliferation and promoting the occurrence and development of tumor. Currently, commonly used chemotherapy drugs are designed to directly damage DNA and thus slow down cancer progression, but DNA repair mechanisms and cell cycle checkpoints are the main factors of drug resistance.
At present, immunotherapy is one of the main treatments for HCC patients, but the development of immunotherapy vastly depends on the recognition of immune microenvironment of the tumor19. To this end, we further analyzed immune infiltration in high- and low-risk groups. It was found that the infiltration abundance of M0 Macrophages in the high-risk group was significantly higher than that in the low-risk group. Study found that M0 macrophages in tumors prevent T cells from attacking tumor cells and secrete growth factors that encourage tumor angiogenesis20. Study reported that high infiltration of M0 macrophages in HCC patients showed a poor overall OS21. This result is consistent with the study of Jiang et al.22, which analyzed the association of risk scores constructed by hypoxia associated genes and immune infiltration, and found that HCC patients in the low-risk group had significantly higher M0 macrophages than those in the high-risk group. Although we found differential levels of immune cell infiltration, it was not clear how immune cell function differed between the high- and low-risk groups. Our study further identified differences in B cells, cytolytic activity, mast cells, neutrophils, NK cells, T helper cells, TIL, Type II IFN Reponse. B cells are the main effector cells of humoral immunity and are able to promote T cell response by secreting immunoglobulins, which may directly lyse tumor cells to suppress tumor progression23. According to Garnelo et al.24, tumor-infiltrating B cells and T cells were observed in close contact with each other in tumor tissues, and the functional interactions between them contributed to better prognosis of HCC patients. In addition, the immune functions of cytolytic activity25,26, mast cells27,28, neutrophils29,30, NK cells31,32, T helper cells33,34, TIL35,36, and Type II IFN Reponse37,38 were significantly higher in the low-risk group than those in the high-risk group, suggesting that the low-risk group has better immune microenvironment and prognosis than the high-risk group. Intriguingly, the function of neutrophils in the low-risk group was significantly better than that in the high-risk group, which was similar to the immune infiltration results of Yuan et al.39. They constructed a prognostic model for copper metabolism-related lncRNAs in HCC and subsequently found in immune-related analyses that Neutrophils were significantly higher in the low-risk group than in the high-risk group, which they did not explain. Neutrophils normally suppress the immune system's response to cancer and their elevation is associated with a poor prognosis of cancer. Further analyses revealed that neutrophils are able to kill tumor cells by stimulating the production of Reactive Oxygen Species, which further damage DNA and proteins and cause cell death. Finally, we identified 12 FDA-approved drugs. Among them, Afatinib40, Fulvestrant41, Gefitinib42, Osimertinib43, Cytarabine44, Oxaliplatin45, Axitinib46, Irinotecan47, Sorafenib48, NU744149, and Sapitinib50 have been extensively reported in numerous studies for their value in HCC, while Taselisib (GDC-0032) has not yet been reported in terms of its association with HCC. Taselisib is a novel51, oral P13K selective inhibitor that is currently in clinical trial and has been reported in studies of breast cancer model52, head and neck squamous carcinomas model53 and pancreatic ductal adenocarcinoma54. In conclusion, abovementioned findings all suggested that our constructed prognostic DRLPS could accurately predict the prognosis of HCC patients, as well as provide new therapeutic strategies.
While our study reported for the first time the prognostic and clinical therapeutic value of DRLPS in HCC, there are several limitations. Firstly, since disulfidptosis is a newly discovered form of programmed cell death, the specific mechanism of relevant lncRNAs in disulfidptosis remains to be further explored, which will be our main research orientation afterwards. In addition, since our study was mainly based on integrative bioinformatics and public databases for analyses, the lack of experimental validation is the biggest regret of this study.
Conclusion
Our study constructed a DRLPS by disulfidptosis-related mRNA-lncRNA coexpression that had good predictive value for the prognosis of HCC patients and may help to improve the prognosis of HCC patients and provide novel approach to more personalized treatments of immunotherapy, but further experimental validation is still needed. In the future, the results of immune infiltration can be further validated by transcriptomics, immunohistochemistry and flow cytometry, and the more specific mechanism of marker lncRNA in HCC can be validated by qPCR, dual-luciferase report assay, RNA in situ Hybridization, RNA Affinity Purification, CHIP-Seq, knockdown or overexpression experiments.
Materials and methods
Data acquisition and processing
Figure 6 presents the research flow chart of our study design. TCGA-LIHC-RNA-seq data (which contains mRNA and lncRNA expression) and clinical data were downloaded from The Cancer Genome Atlas (TCGA) website (https://portal.gdc.cancer.gov/). Subsequently, based on the genome annotation file (GENCODE V36) provided by the R-loopBase online database (https://rloopbase.nju.edu.cn/), Strawberry Perl (Version 5.30.0.1) was used to obtain lncRNA and mRNA lists in the TCGA-LIHC-RNA-seq data and extracted expression for subsequent analysis. DRGs were obtained from previous literature3. DRGs expression in TCGA-LIHC-mRNA was extracted using Strawberry Perl. mRNA-lncRNA coexpression analysis of extracted DRGs and TCGA-LIHC-lncRNA expression was performed with “limma” R package to identify DRLs (|Pearson correlation coefficient (PCC)|> 0.4 and P < 0.001).
Construction of DRLPS in the training set
DRLs and clinical data (survival time and survival status) were integrated using the “limma” R package. The samples (n = 1) were randomly divided into training and testing sets in a 1:1 ratio using the R software Caret package. The training set was used to train the supervised model, which adjusted the parameters according to the data in the training set and optimized the loss function to minimize the gap between the predicted and true values. The “survival” R package was used to perform univariate Cox regression analysis (P < 0.05) on the training set to identify prognostic DRLs. The core assumption of Cox regression analysis is the Proportional Hazards Assumption, i.e. Hazard Ratio (HR) maintains a constant ratio, independent of time. LASSO regression was then applied to these prognostic candidates with the “glment” R package. LASSO is a commonly used regularization in many regression analysis methods for variable selection and shrinkage in Cox’s proportional hazards model. The core principle of Lasso regression is to drive some of the regression coefficients to be strictly equal to zero by minimizing the sum of squares of the residuals under the condition that the sum of the absolute values of the regression coefficients is less than a constant, so as to reduce the complexity of the model and improve the performance and prediction accuracy. In addition, Lasso uses the sum of absolute values of the regression coefficients as a penalty function. Based on 1-SE (Standard Error) criterion, the model is trained with different penalty parameters λ, and performance is evaluated using cross-validation to find out the optimal λ. This process aims to improve the generalization ability of the model while maintaining the model fitting ability55,56,57. Therefore, the candidate DRLs were consequently identified through the optimal penalty parameter λ via the 1−SE (standard error) criterion. Finally, multivariate Cox analysis was conducted to determine the optimal DRLPS for HCC patients based on the minimum Akaike information criterion (AIC) value. AIC is a statistic used in model selection to measure the balance between the goodness of fit and the complexity of the model, with a smaller value indicating a better model. DRLPS-based risk scores were calculated using the following formula: \(RiskScore = \mathop \sum \limits_{i = 1}^{N} \left( {Expi*Ci} \right)\), where N is the number of lncRNAs, Expi is the expression level of each DRL, and Ci is the LASSO regression coefficient.
Evaluation and validation of DRLPS
Based on the median value of the risk score in the training set, the patients in the training and testing sets were divided into low-risk and high-risk groups. Testing set was used to examine the efficacy of the prognostic model. Testing set is obtained by differentiating the patient samples of the entire TGCA-LIHC cohort by 5:5 when the machine learning is used. It is mainly used to measure the performance of the trained model, without changing its parameters and effects, and verify whether the model was over-fitting or under-fitting, and its generalization ability when actually used. Subsequently, to further evaluate the performance of DRLPS, we used the GSE76427 dataset downloaded from the GEO database as an outside validation set. GSE76427 contained 115 HCC patient samples obtained from the GEO database, and the clinical information in it included fustat, fustime, age, gender, and stage to support our validation of DRLPS. K–M survival curve analysis, risk curves, univariate and multivariate independent prognostic analysis, ROC curve, and Principal components analysis (PCA) were performed using the “limma”, “survival”, “survivor”, “survminer”, “survivalROC”, and “timeROC” R packages to assess the prognostic model. The testing set was applied to validate this model.
GO and KEGG enrichment analysis
ID annotation, GO and KEGG enrichment analyses were performed by “org.Hs.eg.db” and “clusterProfiler” R package with the screening criteria of |log2(fold change) |> 1 and FDR < 0.05.
Immune infiltration analysis
The CIBERSORT (http://CIBERSORT.stanford.edu/) algorithm was used to evaluate the infiltration abundance of 22 immune cells between the high-risk and low-risk groups58. Obtaining immune.gmt from http://www.gsea-msigdb.org/gsea/index.jsp, ssGSEA was used to assess immune function in the high-risk and low-risk groups.
Drug sensitivity analysis
Expression data and pan-cancer drug sensitivity data were obtained from the Genomics of Drug Sensitivity in Cancer database (www.cancerRxgene.org), and drug sensitivity analyses were performed on the high and low risk groups using the “oncoPredict” R package.
Statistical analysis
R software (Version 4.2.3; the R Foundation, St. Louis, MO, USA) was used to calculate PCC to analyze the correlation between DRGs and DRLs. The chi-squared test was used to examine differences in clinical features. Univariate and multivariate Cox regression analysis and LASSO regression were performed to construct the optimal DRLPS for HCC. Univariate and multivariate Cox regression analysis was used to calculate Hazard Ratio (HR) with 95% confidence interval. CIBERSORT and ssGSEA algorithms was used to assess immune infiltration and function. P value < 0.05 was reckoned as statistically significant.
Data availability
The original contributions presented in the study are included in the article/ Supplementary Material. Further inquiries can be directed to the corresponding authors.
References
Sung, H. et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA-Cancer J. Clin. 71, 209 (2021).
Vagefi, P. A., Dodge, J. L., Yao, F. Y. & Roberts, J. P. Potential role of the donor in hepatocellular carcinoma recurrence after liver transplantation. Liver Transplant 21, 187 (2015).
Liu, X. et al. Actin cytoskeleton vulnerability to disulfide stress mediates disulfidptosis. Nat. Cell Biol. 25, 404 (2023).
Zhou, C. et al. Long non-coding RNA PART1 promotes proliferation, migration and invasion of hepatocellular carcinoma cells via miR-149-5p/MAP2K1 axis. Cancer Manag. Res. 12, 3771 (2020).
Wei, H. et al. Long non-coding RNA PAARH promotes hepatocellular carcinoma progression and angiogenesis via upregulating HOTTIP and activating HIF-1alpha/VEGF signaling. Cell Death Dis. 13, 102 (2022).
Yang, X. et al. Ferroptosis-related long non-coding RNA signature predicts the prognosis of hepatocellular carcinoma. Aging Albany N.Y. 14, 4069 (2022).
Xu, Z. et al. Construction of a ferroptosis-related nine-lncRNA signature for predicting prognosis and immune response in hepatocellular carcinoma. Front. Immunol. 12, 719175 (2021).
Chen, Q. et al. Cuproptosis-related LncRNA signature for predicting prognosis of hepatocellular carcinoma: A comprehensive analysis. Dis. Mark. 2022, 3265212 (2022).
Cheng, Z., Han, J., Jiang, F., Chen, W. & Ma, X. Prognostic pyroptosis-related lncRNA signature predicts the efficacy of immunotherapy in hepatocellular carcinoma. Biochem. Biophys. Rep. 32, 101389 (2022).
Peng, Y. et al. Construction and validation of a necroptosis-related lncRNAs prognosis signature of hepatocellular carcinoma. Front. Genet. 13, 916024 (2022).
Wang, H., Chen, X. & Calvisi, D. F. Hepatocellular carcinoma (HCC): The most promising therapeutic targets in the preclinical arena based on tumor biology characteristics. Expert Opin. Ther. Target 25, 645 (2021).
Liang, J. et al. Development and validation of ferroptosis-related lncRNAs signature for hepatocellular carcinoma. PEERJ 9, e11627 (2021).
Liu, C. et al. Comprehensive analysis of cuproptosis-related lncRNAs in immune infiltration and prognosis in hepatocellular carcinoma. BMC Bioinform. 24, 4 (2023).
Zhang, X. et al. Application of weighted gene co-expression network analysis to identify key modules and hub genes in oral squamous cell carcinoma tumorigenesis. Oncotargets Ther. 11, 6001 (2018).
Zhao, Z. et al. The DNA damage repair-related lncRNAs signature predicts the prognosis and immunotherapy response in gastric cancer. Front. Immunol. 14, 1117255 (2023).
Huang, Z. L., Li, W., Chen, Q. F., Wu, P. H. & Shen, L. J. Eight key long non-coding RNAs predict hepatitis virus positive hepatocellular carcinoma as prognostic targets. World J. Gastro Oncol. 11, 983 (2019).
Liu, G. M., Zeng, H. D., Zhang, C. Y. & Xu, J. W. Identification of a six-gene signature predicting overall survival for hepatocellular carcinoma. Cancer Cell Int. 19, 138 (2019).
Tomas-Loba, A. et al. p38gamma is essential for cell cycle progression and liver tumorigenesis. Nature 568, 557 (2019).
Li, D. et al. The m6A/m5C/m1A regulated gene signature predicts the prognosis and correlates with the immune status of hepatocellular carcinoma. Front. Immunol. 13, 918140 (2022).
Vinnakota, K. et al. M2-like macrophages induce colon cancer cell invasion via matrix metalloproteinases. J. Cell Physiol. 232, 3468 (2017).
Zhang, Y., Zou, J. & Chen, R. An M0 macrophage-related prognostic model for hepatocellular carcinoma. BMC Cancer 22, 791 (2022).
Jiang, H. Y., Ning, G., Wang, Y. S. & Lv, W. B. Ahypoxia-related signature enhances the prediction of the prognosis in hepatocellular carcinoma patients and correlates with sorafenib treatment response. Am. J. Transl. Res. 12, 7762 (2020).
Ryuma, T. et al. B cell and B cell-related pathways for novel cancer treatments. Cancer Treat. Rev. 73, 10 (2018).
Garnelo, M. et al. Interaction between tumour-infiltrating B cells and T cells controls the progression of hepatocellular carcinoma. GUT 66, 342 (2017).
Takahashi, H. et al. Immune cytolytic activity for comprehensive understanding of immune landscape in hepatocellular carcinoma. Cancers 12, 1221 (2020).
Chieochansin, T., Thepmalee, C., Grainok, J., Junking, M. & Yenchitsomanus, P. T. Cytolytic activity of effector t-lymphocytes against hepatocellular carcinoma is improved by dendritic cells pulsed with pooled tumor antigens. Sci. Rep.-UK 9, 17668 (2019).
Zhang, H., Sun, L. & Hu, X. Mast cells resting-related prognostic signature in hepatocellular carcinoma. J. Oncol. 2021, 4614257 (2021).
Rohr-Udilova, N. et al. Morphometric analysis of mast cells in tumor predicts recurrence of hepatocellular carcinoma after liver transplantation. Hepatol. Commun. 5, 1939 (2021).
Linde, I. L. et al. Neutrophil-activating therapy for the treatment of cancer. Cancer Cell 41, 356 (2023).
Geh, D. et al. Neutrophils as potential therapeutic targets in hepatocellular carcinoma. Nat. Rev. Gastro Hepat. 19, 257 (2022).
Sun, H. et al. Human CD96 correlates to natural killer cell exhaustion and predicts the prognosis of human hepatocellular carcinoma. Hepatology 70, 168 (2019).
Yu, M. et al. Development of GPC3-specific chimeric antigen receptor-engineered natural killer cells for the treatment of hepatocellular carcinoma. Mol. Ther. 26, 366 (2018).
Sung, C. Y., Lee, N. P. & El-Nezami, H. Regulation of T helper 17 by bacteria: An approach for the treatment of hepatocellular carcinoma. Int. J. Hepatol. 2012, 439024 (2012).
Kuang, D. M. et al. Activated monocytes in peritumoral stroma of hepatocellular carcinoma promote expansion of memory T helper 17 cells. Hepatology 51, 154 (2010).
Hiraoka, N. Tumor-infiltrating lymphocytes and hepatocellular carcinoma: Molecular biology. Int. J. Clin. Oncol. 15, 544 (2010).
Zheng, X., Jin, W., Wang, S. & Ding, H. Progression on the roles and mechanisms of tumor-infiltrating T lymphocytes in patients with hepatocellular carcinoma. Front. Immunol. 12, 729705 (2021).
Li, H. et al. Interaction between polymorphisms of IFN-gamma and MICA correlated with hepatocellular carcinoma. Med. Sci. Monitor 22, 549 (2016).
Li, J. et al. IL-17 promotes hepatocellular carcinoma through inhibiting apoptosis induced by IFN-gamma. Biochem. Biophys. Res. Commun. 522, 525 (2020).
Yuan, W. et al. Identification of a cuproptosis and copper metabolism gene-related lncRNAs prognostic signature associated with clinical and immunological characteristics of hepatocellular carcinoma. Front. Oncol. 13, 1153353 (2023).
Chen, J. et al. Ethoxy-erianin phosphate and afatinib synergistically inhibit liver tumor growth and angiogenesis via regulating VEGF and EGFR signaling pathways. Toxicol. Appl. Pharm. 438, 115911 (2022).
Wang, C. J. et al. Inhibition of hepatocellular carcinoma by fulvestrant involves the estrogen receptor alpha and Wnt pathways in vitro and in patients. Mol. Med. Rep. 10, 3125 (2014).
Tong, Y. et al. Inhibitory effects of genistein in combination with gefitinib on the hepatocellular carcinoma Hep3B cell line. Exp. Ther. Med. 18, 3793 (2019).
Huang, Q., He, S. & Zhan, D. Osimertinib is a dual inhibitor of hepatocellular carcinoma and angiogenesis in an EGFR-independent manner, and synergizes with venetoclax. J. Cancer Res. Clin. 149, 12 (2023).
Liao, Z. Y., Zhang, S. H. & Zhen, Y. S. Synergistic effects of geldanamycin and antitumor drugs. Yao Xue Xue Bao 36, 569 (2001).
Ma, L. et al. LSD1-demethylated LINC01134 confers oxaliplatin resistance through SP1-induced p62 transcription in HCC. Hepatology 74, 3213 (2021).
Li, W. et al. Knockdown of LINC00467 contributed to Axitinib sensitivity in hepatocellular carcinoma through miR-509-3p/PDGFRA axis. Gene Ther. 28, 634 (2021).
Emami, J. et al. PLGA-PEG-RA-based polymeric micelles for tumor targeted delivery of irinotecan. Pharm. Dev. Technol. 23, 41 (2018).
Huang, A., Yang, X. R., Chung, W. Y., Dennison, A. R. & Zhou, J. Targeted therapy for hepatocellular carcinoma. Signal Transduct. Target 5, 146 (2020).
Yang, C. et al. NU7441 enhances the radiosensitivity of liver cancer cells. Cell Physiol. Biochem. 38, 1897 (2016).
Yang, Y. et al. The 3D genomic landscape of differential response to EGFR/HER2 inhibition in endocrine-resistant breast cancer cells. BBA-Gene Regul. Mech. 1863, 194631 (2020).
You, H. et al. Taselisib moderates neuropathic pain through PI3K/AKT signaling pathway in a rat model of chronic constriction injury. Brain Res. Bull. 199, 110671 (2023).
Hoeflich, K. P. et al. The PI3K inhibitor taselisib overcomes letrozole resistance in a breast cancer model expressing aromatase. Genes Cancer 7, 73 (2016).
Zumsteg, Z. S. et al. Taselisib (GDC-0032), a potent beta-sparing small molecule inhibitor of PI3K, radiosensitizes head and neck squamous carcinomas containing activating PIK3CA alterations. Clin. Cancer Res. 22, 2009 (2016).
Xie, P., Tan, S. Y., Li, H. F., Tang, H. D. & Zhou, J. H. Transcriptome data-based status of PI3K/AKT/mTOR pathway indicates heterogeneity and immune modulation in patients with pancreatic ductal adenocarcinoma. J. Gene Med. 2023, e3570 (2023).
Wang, L., Shen, J. & Thall, P. F. A modified adaptive lasso for identifying interactions in the cox model with the heredity constraint. Stat. Prob. Lett. 93, 126 (2014).
Yin, Q. Y., Li, J. L. & Zhang, C. X. Ensembling variable selectors by stability selection for the cox model. Comput. Intel. Neurosci. 2017, 2747431 (2017).
Tibshirani, R. The lasso method for variable selection in the Cox model. Stat. Med. 16, 385 (1997).
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12, 453 (2015).
Acknowledgements
We sincerely acknowledge The Cancer Genome Atlas (TCGA) for providing transcriptomic and clinicopathological data.
Funding
This work was supported by grants from the National Natural Science Foundation of China (32171130 to PS), and East China Normal University Interdisciplinary Advancement Project (04-222351 to PS).
Author information
Authors and Affiliations
Contributions
L.P., C.P. and X.Z. analyzed, interpreted the data and wrote the manuscript. P.S. and L.X. designed the study, wrote the research outline and critically reviewed and revised the paper. Y.S. and P.G. processed the data in the softwares. D.W., X.L. and B.W. contributed to data acquisition, collation and processing in the softwares. All authors contributed to the article and approved the submitted version.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pu, L., Sun, Y., Pu, C. et al. Machine learning-based disulfidptosis-related lncRNA signature predicts prognosis, immune infiltration and drug sensitivity in hepatocellular carcinoma. Sci Rep 14, 4354 (2024). https://doi.org/10.1038/s41598-024-54115-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-54115-8
- Springer Nature Limited