Abstract
Our research endeavors are directed towards unraveling the stem cell characteristics of lower-grade glioma patients, with the ultimate goal of formulating personalized treatment strategies. We computed enrichment stemness scores and performed consensus clustering to categorize phenotypes. Subsequently, we constructed a prognostic risk model using weighted gene correlation network analysis (WGCNA), random survival forest regression analysis as well as full subset regression analysis. To validate the expression differences of key genes, we employed experimental methods such as quantitative Polymerase Chain Reaction (qPCR) and assessed cell line proliferation, migration, and invasion. Three subtypes were assigned to patients diagnosed with LGG. Notably, Cluster 2 (C2), exhibiting the poorest survival outcomes, manifested characteristics indicative of the subtype characterized by immunosuppression. This was marked by elevated levels of M1 macrophages, activated mast cells, along with higher immune and stromal scores. Four hub genes—CDCA8, ORC1, DLGAP5, and SMC4—were identified and validated through cell experiments and qPCR. Subsequently, these validated genes were utilized to construct a stemness risk signature. Which revealed that Lower-Grade Glioma (LGG) patients with lower scores were more inclined to demonstrate favorable responses to immune therapy. Our study illuminates the stemness characteristics of gliomas, which lays the foundation for developing therapeutic approaches targeting CSCs and enhancing the efficacy of current immunotherapies. By identifying the stemness subtype and its correlation with prognosis and TME patterns in glioma patients, we aim to advance the development of personalized treatments, enhancing the ability to predict and improve overall patient prognosis.
Similar content being viewed by others
Introduction
Glioma stands out as one of the most prevalent intracranial tumors, presenting a formidable challenge in terms of treatment and cure. Lower-grade glioma, which is classified into two grades (grade II and grade III), accounts for 10–20% of all primary brain tumors1. While the outlook is considerably more favorable compared to high-grade gliomas, the median survival is only 5.6 to 13.3 years, depending on the type of pathology2. The goal of patients is often to extend overall survival with quality of life. The more favorable prognosis of lower-grade glioma compared to high-grade gliomas is uniquely challenging for both doctors and patients, requiring adjustments in the treatment approach to prevent tumor recurrence and high-level transformation3. Thus, individualized treatment is more needed in lower-grade gliomas.
CSCs are a subgroup of glioma cells that have the characteristics of unrestricted proliferation, multidirectional differentiation, and inducing tumor drug resistance4. Because CSCs have extensive proliferative capacity5,6, they can form diffuse metastatic tumors and affect the progression, metastasis, and treatment resistance of glioma7, making them a promising therapeutic target8.
In addition, the tumor microenvironment (TME) will continuously communicate with CSCs through intercellular crosstalk9. Several CSC biomarkers have been identified to contribute to the development, progression, and prognosis of glioma10,11,12. Research indicates that glioma-associated CSCs secrete higher levels of TGF-β compared to other types of tumors13. CSCs have the ability to release interleukin-6 (IL-6), triggering the activation of STAT3 signaling in tumor cells and promoting tumor growth14. CSCs also can exhibit elevated expression levels of the epidermal growth factor receptor, leading to increased proliferation and survival15. Therefore, exploring the interaction between CSCs with genetic heterogenicity and TME characteristics will help to determine the targeted treatment strategy for glioma, upgrade patient outcomes and extend survival periods.
To enhance our understanding of the complexities inherent in the stemness landscape of glioma, we performed a comprehensive analysis based on an extensive collection of shared sets comprising 26 genes related to stem cells. The single-sample gene set enrichment analysis (ssGSEA) was used to facilitate the evaluation of the stem cell phenotype through the expression of all genes extracted from a substantial number of glioma samples. Utilizing unsupervised clustering techniques, we identified three distinct stemness subtypes with varying degrees of stemness. Proceeding from this, we pinpointed genes exhibiting a high level of correlation with the aforementioned stemness subtypes, as well as prognosis, using WGCNA.
After conducting all subset regression analysis, and performing single-factor Cox analysis, we developed a stemness-related prognostic signature. Subsequently, we explored its associations with TME characteristics, prognosis, immunotherapy efficacy, and the effectiveness of chemotherapy in lower-grade glioma. Our study successfully established a stem cell risk scoring system, providing a valuable tool for evaluating the glioma stem cell landscape.
Materials and methods
LGG tissue specimen collection
Between December 2021 and June 2022, we obtained five LGG tissue specimens and six normal peritumoral tissue specimens from the Department of Neurosurgery at Zhongnan Hospital of Wuhan University. All enrolled patients had a verified LGG diagnosis through pathology, gave informed consent, and the study received ethical approval from the committee.
LGG cohorts consolidation and preprocessing
We gathered 644 LGG samples from three different cohorts In aggregate, including TCGA-LGG, CGGA, and GSE43378 (grade II and grade III), along with their corresponding clinical and survival annotations. To ensure sample comparability, we acquired data for TCGA-LGG using the Cancer Genome Atlas (TCGA) database (https://genomecancer.ucsc.edu/). And transformed it into transcripts per million (TPM) form, accounting for differences in gene length16. As for the remaining five glioma-associated data cohorts, CGGA17,18,19,20, GSE4337821, Gravendeel22, GSE10785023 and Rembrandt24, we retrieved them from the GlioVis database (http://gliovis.bioinfo.cnio.es/)25 with the samples belonging to high-grade gliomas proposed. And preprocessed them via the Robust Multichip Average algorithm26. The method sva (3.44.0) was applied to mitigate the batch effect caused by different sequencing platforms, equipment, and reagents with the combat function27.
Collection and grouping of stem cell characteristics of LGG stem cell subtypes
To comprehensively identify LGG stemness subtypes, we gathered an impressive compilation of 26 sets which includ stem cell-related genes from the highly esteemed Internet basic tool StemChecker, which is accessible at http://stemchecker.sysbiolab.eu/28. Subsequently, utilizing the GSVA (Gene Set Variation Analysis) R package (v1.44.5)29, all LGG sample’ enrichment scores of stem cell-related gene sets were calculated. This was achieved through the strategic implementation of the ssGSEA algorithm. As an integral next step in our study, the ssGSEA scores obtained were then subjected to unsupervised consensus clustering on LGG samples. This was accomplished via the ConsensusClusterPlus R package (v1.60.0)30, which was executed using the K-means clustering method based on Euclidean distance. In order to ensure optimal accuracy and reliability, the clustering analysis was repeated a staggering 1000 times. Ultimately, the consensusClusterPlus R package and factoextra R package31 were implemented in order to expertly and effectively determine the ideal number of clusters.
Tumor microenvironment (TME) infiltration exploration
To thoroughly explore the intricacies of TME infiltrations in LGG samples, The well-known CIBERSORT deconvolution algorithm32 is utilized for assess the relative proportion of common immune cells based on the standardized large sample gene expression profile. With the "CIBERSORT" R package being utilized in conjunction with 1000 permutations and the gene signature of LM22 leukocyte. In this way, the TME fractions in each LGG sample were expertly and effectively quantified. In addition, a rigorous screening process was executed on samples with P < 0.05 as the cut-off of CIBERSORT analysis. In order to ensure maximal accuracy and dependability. Moreover, the incredibly powerful and highly regarded ESTIMATE algorithm33 was leveraged to estimate the stromal and immune scores for research dataset. This was accomplished through the R package estimate, the algorithm that can estimate the immune and mechanism level of the sample through RNA-seq data, and then evaluate the purity of the tumor.
Predicting immunotherapy response and chemotherapeutic sensitivity
To predict chemosensitivity in LGG samples, we used oncoPredict (V0.2)34, an R package based on Genomics of Drug Sensitivity in Cancer (GDSC) (https://www.cancerRxgene.org/)35, to compute the number of half-maximum inhibitory concentration (IC50) of various chemotherapeutic agents using ridge regression36. To assess prediction accuracy, tenfold cross-validation was employed. Additionally, we estimated immunotherapeutic responses in LGG patients via Tumor Immune Dysfunction and Exclusion (TIDE), a network of scientific research tools (http://tide.dfci.harvard.edu/).
WGCNA for key module identification
To construct an expression matrix from TCGA-LGG data, we used the GoodSamplesGenes method and sample network approach. We applied a statistical significance criteria value of Z.Ku = 2.5 (Z.ku = (ku-mean(k))/(sqrt(var(k))) that produced exceptional results37. Using the WGCNA R package (V1.71), we generated co-expression networks. Via the branch cutting approach, all genes were split into gene modules with min-ClusterSize = 30 and deepSplit = 2 set as crucial parameters38. We selected modules with high correlation (greater than 0.6) by assessing the variances in module eigengenes. To identify key modules associated with cluster (selected disease characteristics is cluster C1 or cluster C2), we evaluated gene significance (GS) and calculated module membership (MM) based on all genes’ value of average GS. Finally, we chose the module considered the most pertinent and crucial, as elucidated in the aforementioned description.
Calculation of the mRNA stemness index (mRNAsi)
We use the OCLR algorithm based on machine learning of a class of logistic regression, which is trained with embryonic stem cells and their differentiated progenitor cells as data set samples, to evaluate the mRNAsi of each LGG sample, following the methodology proposed by Malta et al.39. This method is effective in predicting the stemness of cancer, and the OCLR approach has been previously used for this purpose.
Development and validation of a prognostic stemness model
We used single Cox regression analysis (P < 0.05) to identify hub genes which were statistically associated with overall survival (OS) data in the TCGA dataset(TCGA-LGG, CGGA and gse43378)40. RandomForestSRC R package (v3.1.1) was utilized for the construction of random survival forest prognosis model to narrow the range of key genes41. The criterion for inclusion of genes in the next analysis is that the relative importance is greater than 0.25. After listing all the combinations, we obtained four genes through full subset regression with R package leaps (V3.1) and constructed a signature named stemness-risks core. The risk scores were calculated following the formula: stemness Risk score = , 'n' represents the number of genes in the signature and the value of Expi is the tpm expression profile values of the corresponding genes. We verified the accuracy of survival evaluation via the roc curve of stem cell-related risk score in MMD1(Gravendeel, GSE107850 and Rembrandt)42. We also drew the KM curve of three data sets in TCGA (TCGA-LGG, CGGA, GSE43378) separately. We also verified the protein levels of each gene were retrieved from the Human Protein Atlas (HPA) databasee43.
Nomogram construction and verification
The nomogram was developed based on the prognostic stemness model, and cross-validation was performed to prevent overfitting44. The accuracy of the nomogram was assessed through the examination of a calibration curve., where the 45° line represented the highest degree of prediction potential. Validation of the feasibility and value of clinical applications of nomograms based on decision analysis curves, which was performed using the R package rmda (V1.6)45,46.
Hub gene functional exploration
According to the difference in central gene expression, we conducted an enrichment analysis to determine the GO and KEGG pathway with significant differences and enrichment in the C1 and C2 groups of LGG. The c2.cp.kegg.v7.4.symbols.gmt gene set was annotated, and pathways were considered significantly enriched if they met the following criteria: a gene size (n) of 20%, FDR of 25%, |ES|> 6, and P < 0.05. The standard significantly enriched by KEGG and GO pathways was used in the analysis47. To investigate the differences in pathways and marker genomes differences between the lower and high stem cell related risk score groups of LGG patients, all 644 patients were first distinguished into two groups based on their stemness-risk scores, with the median value used as the threshold. Next, We conducted gene cluster enrichment analysis (GSEA) through the clusterProfiler R package48.
Transcription-level expression validation by RT-qPCR
RT-qPCR is a molecular biology technique used to measure the expression level of RNA49. In this technique, RNA is initially reverse transcribed into complementary DNA (cDNA) through the action of the reverse transcriptase enzyme with the SuperRT RT reagent Kit (Vazyme, China), and then the cDNA is amplified using quantitative polymerase chain reaction (qPCR). Real-time monitoring of the amplification process is carried out using fluorescent dyes, enabling the quantification of the initial amount of RNA. The 2(-Ct) technique is commonly used to analyze the data generated from RT-qPCR, where Ct values are normalized to a housekeeping gene and expressed relative to a control sample. In this study, RNA was extracted from both LGG tumors and adjacent normal peritumoral tissues using RNAiso Plus (Takara, Japan). Correspondingly, the quality of RNA was evaluated before performing RT-qPCR using specific primers and commercial kits50. The primer sequence is shown in supplementary Table S1.
Cell Culture
The HS683 human glioma cell line was sourced from Wuhan Procell Life Science and Technology Co., Ltd. (Procell, China). Cultivation involved the use of Dulbecco’s Modified Eagle’s Medium (DMEM, Servicebio, China) supplemented with 10% fetal bovine serum (FBS, Gibco, USA) and 1% antibiotics mix (Servicebio, China). Cultures were maintained at 5% CO2 and 37 °C, with regular passaging every 1–2 days to ensure exponential growth.
Cell transfection
For transient transfection of siRNAs into glioma cells, the RNATransMate system (Sangon Biotech, China) was employed following the manufacturer’s instructions. The siRNAs were chemically synthesized by Sangon Biotech (China). The siRNA sequences for these genes are shown in supplementary Table S2.
Lentiviral infection of glioma cells
We used a 6-well plate under a biosafety cabinet to seed well-conditioned HS683 cells in the logarithmic growth phase, and continuously cultured them in an incubator until the cell density reached approximately 50%, at which point the infection was performed. Fresh culture medium containing polybrene was prepared in advance. Then, the IDH R132H mutant lentivirus purchased from Heyuan company was used for the infection. At the beginning of the infection, the old medium in each well of the 6-well plate was aspirated, and then the prepared polybrene-containing medium was slowly added along the sidewall of each well. The plate was incubated horizontally in the incubator for 24 h. The next day, the medium was replaced with polybrene-free medium, and the plate was incubated horizontally in the incubator for another 48 h. Finally, the optimal concentration of puromycin was used to select the target cells. A small amount of the selected target cell samples was used for protein extraction, and qPCR was performed to detect the stable knockdown effect. After verification, the cells were expanded using puromycin-containing medium or used for subsequent experiments.
Knockdown of target genes in mutant strains
Under a biosafety cabinet, seed the well-conditioned, selected IDH1 mutant HS683 cells into a 6-well plate. When the cell density reaches 50–60%, perform the transfection operation with the following system: Solution A consists of 250 μL Opti-MEM and 7.5 μL Lipomaster3000, while Solution B consists of 250 μL Opti-MEM and 4 μL siRNA. After preparation, gently mix with a pipette, add Solution B to Solution A dropwise, mix well, and let it stand at room temperature for 10 min before adding it to the 6-well plate. Replace the medium 24 h later.
Cell counting kit‐8 (CCK-8) assay
HS683 cells and HS683 IDH mutant cells were seeded in 96‐well plates at a density of 10^4 cells per well. The control and siRNA groups of the HS683 cell lines were cultured for 24, 48, and 72 h, respectively, with each group having three duplicate wells. Absorbance at 450 nm was measured after incubation for 1–2 h at 37 °C and 5% CO2.
Wound-healing assay
After a 6-h period post-transfection, cells underwent harvesting, centrifugation, and resuspension in serum-free culture. Adjusting the cell density to 5 × 105 cells per well, a scratch was introduced into the cell layers when they reached 90% confluence using a 200 μl sterile pipette tip. Following the removal of the cell culture medium, suspended cells, and debris, each well was refilled with serum-free medium and left to incubate for 24–48 h. Subsequent observations and photographic documentation focused on the cell migration area. The scratch test was then employed to evaluate differences in cell healing ability based on the migration area.
Transwell assay
Matrigel (Corning, USA) was thawed overnight at 4 °C. Subsequently, 100 μl of diluted Matrigel was added to the chamber. The upper chamber received 200 μl of serum-free medium, while the lower chambers were supplemented with 500 μl of 10% FBS DMEM. A total of 3 × 104 harvested cells were seeded in the upper chambers and incubated for an additional 48 h. Following incubation, the invading chamber was removed, and cells on the polycarbonate membrane were fixed with 4% paraformaldehyde and stained with 0.1% crystal violet. Three random fields were selected, and the count of invaded cells was conducted under a microscope. The experiments were replicated in triplicate.
Statistical analysis
All data are completed by GraphPad Prism 5 software and R software (version 4.2.0). Differential expression was evaluated using the Wilcoxon signed-rank test. We use Student's t-test to analyze continuous variables. The statistically significant cut-off was set at 0.05.
Institutional review board statement
This study strictly followed the principles of the Helsinki Declaration, and all normal and glioma samples used in the study were approved by the Ethics Committee of Central South Hospital of Wuhan University (No. 2019048).
Informed consent statement
Patients who provided samples have signed the informed consent form.
Results
Stem cell landscape and characteristics of LGG
Figure 1 reflects the process of this article, illustrating the steps we took to construct and validate the stem cell-related risk prediction model and the associated influencing factors. In this study, we developed patterns of stemness enrichment and constructed signatures indicating the association between stemness and risk (Fig. 1). The scores reflecting the enrichment related to stemness in LGG samples were quantified using the ssGSEA algorithm, and 15 gene sets were screened out using univariate Cox analysis (P < 0.05) to depict a prognostic stemness network. Among the above 15 gene sets, all but Hs iPsc Shats are risk factors for OS (Fig. 2A). And LGG patients were categorized into three distinct clusters (Fig. 2B–E). Cluster 2 LGG patients had worse OS, whereas Cluster 1 LGG patients had the best OS (log-rank P = 7.5e−10, Fig. 2F). And Cluster2 was enriched by stemness gene sets, which were found to decrease patients' OS, such as Hs SC Shats, Hs SC Palmer, and HS ESC Bhattacharya. The enrichment of all gene sets with prognostic significance in C1 is lower than that in C2, except for the protective gene set Hs iPsc Shats (Fig. 3A,B). This observation further corroborates the earlier conclusion.
Additional understanding of the TME landscape was gained ESTIMATE and CIBERSORT analyses, comparing TME components and stromal and immune scores (Fig. 3C,D). Within the three stemness clusters, Cluster 2 demonstrated an immunosuppressive subtype marked by elevated levels of macrophages M1 and activated mast cells, along with increased immune and stromal scores. On the other hand, Cluster 1 displayed higher antitumor TME components, such as B cells naive and plasma cells, and lower immune and stromal scores.
Molecular characteristics, sensitivity to chemotherapy and responsiveness to immunotherapy vary across stemness subtypes
Currently, the conventional treatment strategy for colorectal cancer patients is surgery plus systemic chemotherapy. To evaluate chemosensitivity, we used a prediction algorithm to estimate the IC50 value of several chemotherapeutic drugs and compared them among stem clusters. As shown in Fig. 4A, the IC50 estimates of Bortezomab, Daporinad, Topotecan, and Staurosporine in Cluster 2 were significantly lower, indicating that this subtype may be more sensitive to these drugs. Additionally, Cluster 3 was found to be more sensitive to temozolomide. Using the TIDE algorithm, we evaluated the immune response of the three subtypes. As the TME environment results suggest, Cluster 1 had higher levels of B cells and other killer cell infiltration with a significantly higher proportion of benefits from immunotherapy than the other clusters. However, the activation of immunosuppressive mast cells and the decrease of M1 macrophages and CD8 T cells may be the reasons for the highest non-response ratio in Cluster 2. As shown in Fig. 4C, the Cluster 1, which has the best prognosis, mostly consists of IDH-mutant types, whereas the Cluster 2, which has the worst prognosis, primarily consists of IDH-wildtype. This is consistent with previous research findings. Similarly, the 1p19q co-deletion, which often indicates a better prognosis, is also mostly found in the Cluster 1. In contrast, the Cluster 2, which has the worst prognosis, only rarely exhibits the 1p19q co-deletion.
Utilizing WGCNA to identify the module and hub genes associated with stemness Cluster
The difference in immune therapy response and survival outcomes between the C1 and C2 groups of LGG patients is the greatest. To identify characteristic genes of this subtype in TCGA, we used WGCNA analysis. The optimal soft threshold power β was determined as 5, ensuring scale-free network constructions with an unscaled R2 value of 0.9 (shown in Fig. 5A).
We set a minimum of 300 genes for each module and observed that Eight modules were formed by genes with comparable expression profiles with a clustering dendrogram (Fig. 5B). Among these modules, The green module exhibits a robust positive correlation with gene expression in the Cluster 2 subtype (ME = 0.76, P = 3e−121), while demonstrating the strongest negative correlation with the Cluster 1 subtype. (ME = − 0.67, P = 1e−83) (Fig. 5C). The green module was determined to be the central node and identified 1123 Key genes for subsequent exploration, on the basis of qualification standards of GS > 0.4 and MM > 0.8 (Fig. 5D).
And the GO analysis showed enrichment in DNA-templated DNA replication, chromosome segregation, and DNA replication. Additionally (Fig. 5E), the KEGG analysis revealed that the turquoise module was mainly involved in cell cycle regulation, DNA replication, and the Fanconi anemia pathway (Fig. 5F).
Constructing a prognostic stemness signature and validation
We performed a single Cox regression analysis and identified 73 genes that were substantially linked with OS (P < 0.05). Next, we filtered out genes that were not important for OS using random forest survival analysis. resulting in the selection of 8 genes with relative importance > 0.25 (Fig. 6A). The ROC curve showed that the number of the area under the curve (AUC) in three years was greater than 0.89, which indicated the high accuracy of the selected genes as prognostic markers (Fig. 6B).
The 8 genes were assembled into a set for analysis, and Fig. 6C shows that SMC4, CDCA8, DLGAP5, and ORC1 were selected. We developed this formula to calculate the risk value for each sample: Risk score = (0.065 * expression of SMC4) + (0.040 * expression of CDCA8) + (− 0.032 * expression of DLGAP5) + (− 0.047 * expression of ORC1). Using this score, the stemness-risk score was calculated for each LGG patient. The ROC curve analysis showed that 1-year AUC = 0.73, 2-year AUC = 0.76, and 3-year AUC = 0.78 (MMD1) (Fig. 6D).
The stemness-risk signature's ability to predict overall survival (OS) was validated in other cohorts, including CGGA, TCGA-LGG, and GSE43378. The Kaplan–Meier survival curves demonstrated that patients with high stemness-risk scores had shorter OS time compared to those with lower stemness-risk scores (CGGA: log-rank test, P = 1.5E-5; TCGA-LGG: P = 1.4E−22; GSE43378: log-rank test, P = 0.04) (Fig. 6E–G). Stemness cluster C2 had the highest stemness-risk score, while cluster C1 had the lowest (Fig. 6H). The relationship between cluster, each group's stemness risk score, and status was consistent with previous findings (Fig. 6I).
Correlation between risk characteristics and tumor microenvironment
The heatmap (Fig. 7A) and Spearman correlation diagram (Fig. 7B) showed that macrophage B cells naive, monocyte and mast cell activated had a higher degree in the lower-risk group, indicating that this group may be more responsive to immunotherapy. Conversely, the high stemness-risk group was influenced by T cell CD4 memory rest, M2 macrophages, and M1 macrophages, indicating that immune function was suppressed. The immune score was significantly higher in the high-risk group compared to the lower-risk group. Figure 7C and D show the mean distribution of TME in the two groups. The most important immune response in the lower stemness risk group was T-cell regulation, suggesting differentiation of immune function toward antitumor, while the high stemness risk group had higher levels of immunosuppressive cells, especially M2 macrophages.
Confirmation of the expression of hub genes
The results of qPCR indicated that OCR1 (P < 0.05; Fig. 8A), DLGAP5 (P < 0.05; Fig. 8B), SMC4 (P < 0.05; Fig. 8C), and CDCA8 (P < 0.05; Fig. 8D) , compared to normal tissue, it is highly expressed in glioma tissue. We also investigated the Spearman correlation of mRNA expression levels for the four hub genes. We found that all four genes are strongly positively correlated, which is consistent with our research findings(Fig. 8E–J) . Furthermore, we confirmed the translation expression level of the hub genes using the HPA database, and the prognostic biomarkers were found to be stained strongly or moderately.: SMC4 (Fig. 9A), OCR1 (Fig. 9B), CDCA8 (Fig. 9C), and DLGAP5 (Fig. 9D), indicating that these hub genes were translated more in glioma samples.
Investigating the impact of hub genes on HS683 human glioma cell proliferation and migration
To ascertain the involvement of hub genes in HS683 human glioma cell proliferation and migration, we conducted additional experimental investigations focusing on prognostic markers. Subsequently, we implemented loss-of-function experiments to silence four hub genes in HS683 cells, aiming to elucidate the role of hub genes in LGG progression. The efficacy of siRNA knockdown in HS683 cells was validated using qRT-PCR (Fig. 10A). A significant reduction in cell viability within 72 h was observed through the CCK-8 assay after the knockdown of CDCA8, DLGAP5, ORC1, and SMC4 (Fig. 10B). Furthermore, transwell invasion and scratch assay results demonstrated a substantial decrease in HS683 cell migration and invasion following the knockdown of CDCA8, DLGAP5, ORC1, and SMC4 (Fig. 11). After verifying the significant IDH1 mutation in the HS683 cell line through qPCR,, the aforementioned in vitro experiments were also validated in the IDH mutant HS683 cell line.(Figs. 12, 13). Whether in IDH-mutant HS683 cell lines or IDH-wildtype HS683 cell lines, knocking down the four hub genes significantly reduced the invasive growth capability of glioma cells, further proving that these four genes are oncogenes.
Clinical application of risk score
Our previous findings have indicated that stemness Cluster C2 is more responsive to certain chemical drugs but less responsive to immunotherapy (as shown in Fig. 4B). Additionally, our observations have shown that the high stemness-risk group is more sensitive to drugs that are effective on C2, such as Bortezomab, Daporinad, Topotecan, and Staurosporine (as depicted in Fig. 14A). Afterward, we analyzed the correlation between the immunotherapy response and stemness-risk model using the TIDE analysis. The results shown in Fig. 14B indicate that patients who positively responded to treatment had significantly lower stemness-risk scores compared to non-responders (Wilcoxon test, P < 0.001).
We endeavored to construct a nomogram that would provide doctors with a practical and effective tool for prognosis decision-making. Our univariate Cox analysis revealed that both the risk score (P = 5.8E-21) and age (P = 1.5E-19) were significantly associated with the overall survival (OS) of LGG patients (as illustrated in Fig. 14C). Subsequently, we constructed a nomogram using the stemness risk score and age, as illustrated in Fig. 14D.
Discussion
For decades, numerous publications have described the interactions of cancer cells and the immune system. This is also applicable to other components of the TME. However, only recently has research begun to reveal the specific relationship between CSCs and Immune cells reflecting the regulation of immune function in the TME, leading to the development of rational therapeutic strategies to exploit the CSC-immune axis. This study presents the first systematic bioinformatics analysis revealing the molecular subtyping of stemness features in a large cohort of LGG patients, discussing the prognosis, TME infiltration patterns, and treatment response of LGG patients with different subtypes, and describing the gene expression associated with CSC subtypes. These findings offer new strategies for guiding more effective patient-specific treatments.
With the unsupervised clustering identification predicated on 26 stemness gene sets' ssGSEA scores for three different stemness subtypes, the C1 subtype is characterized by a high enrichment level of CSCs gene sets which support good prognosis, and an anti-tumor TME pattern such as enriched immature B cells and plasma cell infiltration, which makes it more sensitive to immunotherapy. In contrast, cluster 2 shows a high enrichment of CSCs gene sets which indicates poor prognosis. Cluster C2 showed higher infiltration of M1 and M0 macrophages and stromal scores, indicating that C2 is an immunosuppressive phenotype that may be less responsive to immunotherapy. In order to explore the genomic characteristics of each cell subtype, we conducted a comprehensive WGCNA analysis to screen modules associated with stem cell properties. Among them, the green module has the highest positive correlation with cluster C3 and the highest negative correlation with cluster C2. It is considered the most critical module and will be included in the future study. GO analysis showed that the primary biological process of the green module was the cell cycle. Previous research results show that dysregulation of stem cell cycle regulation may lead to normal tissue carcinogenesis, and in the progression of glioma, disruption of the cell cycle can also lead to the continuous deteriorating of glioma51,52. Then, through subset regression and random forest survival analysis, we identified prognostic hub genes within the green module and constructed a four-gene stemness signature based on these hub genes to quantitatively evaluate the prognostic risk of samples. Then we concluded that the C2 stemness subtype exhibited a higher risk score than the other two groups which means a poor prognosis.
Our analysis revealed that LGG patients with lower-risk characteristics have antitumor immunity, high infiltration of monocytes, and both activated and quiescent mast cells. On the other hand, high-risk LGG patients have an abundance of immune-suppressive M1/M2 macrophages and CD4 memory T cells. Among them, M2 macrophage has the highest infiltration content in the high-risk group, indicating their role in enhancing tumor invasiveness and mediating immunosuppression. Impaired CD4+ T effector memory cell function was associated with the proliferation of myeloid-derived suppressor cells (MDSCs) in glioma patients53. MDSCs can suppress other immune cells in the tumor microenvironment by inducing the expression of microRNA-101 to promote the CSC phenotype54. glioma CSCs produce cytokines, including colony-stimulating factor, TGFβ, and macrophage-inhibitory cytokine, to promote M2 macrophage polarization and MDSC recruitment55,56,57, leading to immune suppression and native M2 phenotype polarization. In conclusion, CSCs promote M2 macrophage polarization and MDSC recruitment by affecting the normal expression of MHC-I molecule and the release of immune-suppressive cytokines, facilitating the formation of a tumor immune-suppressive microenvironment58.
In terms of chemotherapy, in addition to temozolomide, which is commonly used for glioma patients59, we have found that several drugs, including bortezomib, daporinad, topotecan, and staurosporine, show sensitivity to high-risk gliomas. If delivered directly into the tumor through infusion, they will be delivered directly to the brain tumor, providing multiple possibilities for the effective treatment of glioma60. In terms of immunotherapy guidance, we conducted TIDE analysis, and LGG patients with lower stemness risk scores often have higher immune therapy responses. This confirms the predictive validity of our stemness model.
The OCLR method, developed for the mRNAsi using datasets of pluripotent stem cells and their progenitor cells61,62, plays a crucial role in our analysis. This index evaluates the activity of CSCs and malignant cell dedifferentiation in approximately 644 LGG samples from TCGA63. Similar to previous research results, our research shows that LGG patients who had higher mRNAsi values have better prognoses64,65 due to their strong abilities of self-renewal, metastasis, and treatment resistance, supported by increasing evidence. However, there is considerable heterogeneity in the markers and models of CSCs among different cancers66. In our study, we utilized a multitude of CSC gene sets to identify three distinct stemness clusters. We subsequently developed a stemness-related prognostic model that exhibited a strong negative correlation with mRNAsi, with a correlation coefficient of -0.71. We also observed that the high-risk stemness group was enriched with markers related to the cell cycle, oxidative phosphorylation, local adhesion, pancreatic cancer, homologous recombination, and progesterone-mediated cancer cell maturation, as determined by GSEA analysis. Interestingly, defects in the control of the cell cycle can lead to tumorigenesis through metabolic remodeling and immune escape.67,68,69.
Of the four stemness model genes identified in this study, SMC4 regulates the cell cycle and impacts the proliferation, migration, and invasion of glioma70,71. CDCA8 is a cell cycle regulator and tumor promoter that produces a marked effect in various malignant tumors72. For example, it regulates the migration of glioma cells and inhibits cell apoptosis73. Knockdown of DLGAP5 can significantly inhibit cell proliferation and simultaneously induce G2/M phase cell cycle arrest and cell apoptosis74. Chen et al. reported that tumor samples with lower expression of ORC1 would reduce the expression of Bcl-2, block the cell cycle, and increase the apoptosis rate75.
This study has some limitations. Although it is based on multiple datasets, it lacks further experimental verification and detailed data on immunotherapy and chemotherapy. Follow-up studies should carry out more experiments to clarify the potential molecular mechanism of stem cells acting on LGG. Furthermore, we need to use more of our clinical data to verify it soon. The specific mechanism target of CSCs is not clear, and further mechanism-related experiments are necessary to verify the TME-related conclusions to better serve the treatment progress of LGG.
Conclusion
Our study illuminates the stemness characteristics of gliomas, which lays the foundation for developing therapeutic approaches targeting CSCs and enhancing the efficacy of current immunotherapies. By identifying the stemness subtype and its correlation with prognosis and TME patterns in glioma patients, we aim to advance the development of personalized treatments, enhancing the ability to predict and improve overall patient prognosis.
Data availability
All the website addresses of the databases used in this article are provided in the text.
References
Ye, S. et al. Identification of an immune-related prognostic signature for glioblastoma by comprehensive bioinformatics and experimental analyses. Cells 11(19), 3000 (2022).
Lombardi, G. et al. Clinical management of diffuse lower-grade gliomas. Cancers 12(10), 3008 (2020).
Garcia, C. R. et al. Comprehensive evaluation of treatment and outcomes of lower-grade diffuse gliomas. PLoS ONE 13(9), e203639 (2018).
Li, J. et al. Hypoxic glioma stem cell-derived exosomes containing Linc01060 promote progression of glioma by regulating the MZF1/c-Myc/HIF1alpha axis. Cancer Res. 81(1), 114–128 (2021).
Wang, X. et al. Sequential fate-switches in stem-like cells drive the tumorigenic trajectory from human neural stem cells to malignant glioma. Cell Res. 31(6), 684–702 (2021).
Pardal, R., Clarke, M. F. & Morrison, S. J. Applying the principles of stem-cell biology to cancer. Nat. Rev. Cancer 3(12), 895–902 (2003).
Boyd, N. H. et al. Glioma stem cells and their roles within the hypoxic tumor microenvironment. Theranostics 11(2), 665–683 (2021).
Liu, K. et al. Hypoxia-induced GLT8D1 promotes glioma stem cell maintenance by inhibiting CD133 degradation through N-linked glycosylation. Cell Death Differ. 29(9), 1834–1849 (2022).
Chen, C. et al. Intracavity generation of glioma stem cell-specific CAR macrophages primes locoregional immunity for postoperative glioblastoma therapy. Sci. Transl. Med. 14(656), n1128 (2022).
Dalerba, P. et al. Phenotypic characterization of human colorectal cancer stem cells. Proc. Natl. Acad. Sci. USA 104(24), 10158–10163 (2007).
Abbasian, M. et al. The most reliable surface marker for the identification of colorectal cancer stem-like cells: A systematic review and meta-analysis. J. Cell Physiol. 234(6), 8192–8202 (2019).
Lottaz, C. et al. Transcriptional profiles of CD133+ and CD133− glioblastoma-derived cancer stem cell lines suggest different cells of origin. Cancer Res. 70(5), 2030–2040 (2010).
Han, J. et al. TGF-beta signaling and its targeting for glioma treatment. Am. J. Cancer Res. 5(3), 945–955 (2015).
Alvero, A. B. et al. Stem-like ovarian cancer cells can serve as tumor vascular progenitors. Stem Cells 27(10), 2405–2413 (2009).
Furnari, F. B. et al. Malignant astrocytic glioma: Genetics, biology, and paths to treatment. Genes Dev. 21(21), 2683–2710 (2007).
Wagner, G. P., Kin, K. & Lynch, V. J. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. 131(4), 281–285 (2012).
Zhao, Z. et al. Chinese glioma genome atlas (CGGA): A comprehensive resource with functional genomic data from chinese glioma patients. Genomics Proteomics Bioinformatics 19(1), 1–12 (2021).
Zhang, K. et al. Clinical management and survival outcomes of patients with different molecular subtypes of diffuse gliomas in China (2011–2017): A multicenter retrospective study from CGGA. Cancer Biol. Med. 19(10), 1460–1476 (2022).
Wang, Y. et al. Localizing seizure-susceptible brain regions associated with lower-grade gliomas using voxel-based lesion-symptom mapping. Neuro Oncol. 17(2), 282–288 (2015).
Liu, X. et al. A radiomic signature as a non-invasive predictor of progression-free survival in patients with lowerer-grade gliomas. Neuroimage Clin. 20, 1070–1077 (2018).
Kawaguchi, A. et al. Gene expression signature-based prognostic risk score in patients with glioblastoma. Cancer Sci. 104(9), 1205–1210 (2013).
Gravendeel, L. A. et al. Intrinsic gene expression profiles of gliomas are a better predictor of survival than histology. Cancer Res. 69(23), 9065–9072 (2009).
Gao, Y. et al. Expression-based intrinsic glioma subtypes are prognostic in lower-grade gliomas of the EORTC22033-26033 clinical trial. Eur. J. Cancer 94, 168–178 (2018).
Madhavan, S. et al. Rembrandt: Helping personalized medicine become a reality through integrative translational research. Mol. Cancer Res. 7(2), 157–167 (2009).
Bowman, R. L. et al. GlioVis data portal for visualization and analysis of brain tumor expression datasets. Neuro Oncol. 19(1), 139–141 (2017).
Gautier, L. et al. affy–analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20(3), 307–315 (2004).
Leek, J. T. et al. The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics 28(6), 882–883 (2012).
Pinto, J. P. et al. StemChecker: A web-based tool to discover and explore stemness signatures in gene sets. Nucleic Acids Res. 43(W1), W72–W77 (2015).
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: Gene set variation analysis for microarray and RNA-seq data. BMC Bioinformatics 14, 7 (2013).
Wilkerson, M. D. & Hayes, D. N. ConsensusClusterPlus: A class discovery tool with confidence assessments and item tracking. Bioinformatics 26(12), 1572–1573 (2010).
Nowak, B. et al. Reproduction indicators related to litter size and reproduction cycle length among sows of breeds considered maternal and paternal components kept on medium-size farms. Animals 10(7), 1164 (2020).
Newman, A. M. et al. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 12(5), 453–457 (2015).
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612 (2013).
Maeser, D., Gruener, R. F. & Huang, R. S. oncoPredict: An R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Brief. Bioinform. https://doi.org/10.1093/bib/bbab260 (2021).
Yang, W. et al. Genomics of drug sensitivity in cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 41(1), D955–D961 (2013).
Geeleher, P., Cox, N. J. & Huang, R. S. Clinical drug response can be predicted using baseline gene expression levels and in vitro drug sensitivity in cell lines. Genome Biol. 15(3), R47 (2014).
Langfelder, P. & Horvath, S. WGCNA: An R package for weighted correlation network analysis. BMC Bioinform. 9, 559 (2008).
Ivliev, A. E., ’t Hoen, P. A. & Sergeeva, M. G. Coexpression network analysis identifies transcriptional modules related to proastrocytic differentiation and sprouty signaling in glioma. Cancer Res. 70(24), 10060–10070 (2010).
Malta, T. M. et al. Machine learning identifies stemness features associated with oncogenic dedifferentiation. Cell 173(2), 338–354 (2018).
Koletsi, D. & Pandis, N. Survival analysis, part 3: Cox regression. Am. J. Orthod. Dentofac. Orthop. 152(5), 722–723 (2017).
Rigatti, S. J. Random forest. J. Insur. Med. 47(1), 31–39 (2017).
Cao, R. & Lopez-de-Ullibarri, I. ROC curves for the statistical analysis of microarray data. Methods Mol. Biol. 1986, 245–253 (2019).
Colwill, K. & Graslund, S. A roadmap to generate renewable protein binders to the human proteome. Nat. Methods 8(7), 551–558 (2011).
Park, S. Y. Nomogram: An analogue tool to deliver digital knowledge. J. Thorac. Cardiovasc. Surg. 155(4), 1793 (2018).
Vickers, A. J. & Elkin, E. B. Decision curve analysis: A novel method for evaluating prediction models. Med. Decis. Making 26(6), 565–574 (2006).
Vickers, A. J. et al. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med. Inform. Decis. Mak. 8, 53 (2008).
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102(43), 15545–15550 (2005).
Yu, G. et al. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16(5), 284–287 (2012).
Rocha, D. et al. Gene expression analysis in bacteria by RT-qPCR. Methods Mol. Biol. 2065, 119–137 (2020).
Du, S. H. et al. Application of RT-qPCR in the study of forensic pathology. Fa Yi Xue Za Zhi 33(5), 526–531 (2017).
Ren, D. et al. FAM84B promotes the proliferation of glioma cells through the cell cycle pathways. World J. Surg. Oncol. 20(1), 368 (2022).
Mens, M. & Ghanbari, M. Cell cycle regulation of stem cells by microRNAs. Stem Cell Rev. Rep. 14(3), 309–322 (2018).
Dubinski, D. et al. CD4+ T effector memory cell dysfunction is associated with the accumulation of granulocytic myeloid-derived suppressor cells in glioblastoma patients. Neuro Oncol. 18(6), 807–818 (2016).
Cui, T. X. et al. Myeloid-derived suppressor cells enhance stemness of cancer cells by inducing microRNA101 and suppressing the corepressor CtBP2. Immunity 39(3), 611–621 (2013).
Najafi, M., Farhood, B. & Mortezaee, K. Cancer stem cells (CSCs) in cancer progression and therapy. J. Cell Physiol. 234(6), 8381–8395 (2019).
Wang, Y. et al. M2 macrophage microvesicle-inspired nanovehicles improve accessibility to cancer cells and cancer stem cells in tumors. J. Nanobiotechnology 19(1), 397 (2021).
Erin, N. et al. Tumor microenvironment and epithelial mesenchymal transition as targets to overcome tumor multidrug resistance. Drug Resist. Updat. 53, 100715 (2020).
Di Tomaso, T. et al. Immunobiological characterization of cancer stem cells isolated from glioblastoma patients. Clin. Cancer Res 16(3), 800–813 (2010).
Chua, J., Nafziger, E. & Leung, D. Evidence-based practice: Temozolomide beyond glioblastoma. Curr. Oncol. Rep. 21(4), 30 (2019).
Tang, J. H. et al. Bortezomib inhibits growth and sensitizes glioma to temozolomide (TMZ) via down-regulating the FOXM1-Survivin axis. Cancer Commun. 39(1), 81 (2019).
Daily, K. et al. Molecular, phenotypic, and sample-associated data to describe pluripotent stem cell lines and derivatives. Sci. Data 4, 170030 (2017).
Salomonis, N. et al. Integrated genomic analysis of diverse induced pluripotent stem cells from the progenitor cell biology consortium. Stem Cell Rep. 7(1), 110–125 (2016).
Barbee, R. W. & Trippodo, N. C. The contribution of atrial natriuretic factor to acute volume natriuresis in rats. Am. J. Physiol. 253(6 Pt 2), F1129–F1135 (1987).
Wang, W. et al. A novel cancer stemness-related signature for predicting prognosis in patients with colon adenocarcinoma. Stem Cells Int. 2021, 7036059 (2021).
Lu, Y. et al. Characteristic analysis of featured genes associated with stemness indices in colorectal cancer. Front. Mol. Biosci. 7, 563922 (2020).
Magee, J. A., Piskounova, E. & Morrison, S. J. Cancer stem cells: Impact, heterogeneity, and uncertainty. Cancer Cell 21(3), 283–296 (2012).
Phan, T. G. & Croucher, P. I. The dormant cancer cell life cycle. Nat. Rev. Cancer 20(7), 398–411 (2020).
Liu, J., Peng, Y. & Wei, W. Cell cycle on the crossroad of tumorigenesis and cancer therapy. Trends Cell Biol. 32(1), 30–44 (2022).
Icard, P. et al. Interconnection between metabolism and cell cycle in cancer. Trends Biochem. Sci. 44(6), 490–501 (2019).
Wei-Shan, H., Amit, V. C. & Clarke, D. J. Cell cycle regulation of condensin Smc4. Oncotarget 10(3), 263–276 (2019).
You, A. et al. MiR-433-3p restrains the proliferation, migration and invasion of glioma cells via targeting SMC4J]. Brain Res. 1767, 147563 (2021).
Gu, J. et al. CDCA8/SNAI2 complex activates CD44 to promote proliferation and invasion of pancreatic ductal adenocarcinoma. Cancers 14(21), 5434 (2022).
Wang, X. et al. Double-targeting CDCA8 and E2F1 inhibits the growth and migration of malignant glioma. Cell Death Dis. 12(2), 146 (2021).
Zhang, H. et al. Knockdown of DLGAP5 suppresses cell proliferation, induces G(2)/M phase arrest and apoptosis in ovarian cancer. Exp. Ther. Med. 22(5), 1245 (2021).
Chen, X. et al. Up-regulated lncRNA XIST contributes to progression of cervical cancer via regulating miR-140-5p and ORC1. Cancer Cell Int. 19, 45 (2019).
Acknowledgements
We express our gratitude to Jincao Chen at Zhongnan Hospital, Wuhan University, for the site support and technical guidance.
Funding
This research was funded by echnology Innovation Special Major Project of Hubei Province 2022BAC003.
Author information
Authors and Affiliations
Contributions
All authors reviewed the references disclosed from the literature search. S.Y. and B.W. the bioinformatics section and writing work of the article, B.Y. and Y.Y. assisted in completing the experimental part of the article. Y.L. and M.F. assisted in constructing the IDH mutant cell line and conducting in vitro experiments for validation. W.W. and X.L. provided writing guidance for the article. C.L. and W.Z. supervised and promoted the entire article. All authors commented on the draft manuscript. All authors approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ye, S., Yang, B., Yang, L. et al. Stemness subtypes in lower-grade glioma with prognostic biomarkers, tumor microenvironment, and treatment response. Sci Rep 14, 14758 (2024). https://doi.org/10.1038/s41598-024-65717-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-65717-7
- Springer Nature Limited