Abstract
Background
Colon cancer features strong heterogeneity and invasiveness, with high incidence and mortality rates. Recently, RNA modifications involving m6A, m5C, and m1A play a vital part in tumorigenesis and immune cell infiltration. However, integrated analysis among various RNA modifications in colon cancer has not been performed.
Methods
RNA-seq profiling, clinical data and mutation data were obtained from The Cancer Genome Atlas and Gene Expression Omnibus. We first explored the mutation status and expression levels of m6A/m5C/m1A regulators in colon cancer. Then, different m6A/m5C/m1A clusters and gene clusters were identified by consensus clustering analysis. We further constructed and validated a scoring system, which could be utilized to accurately assess the risk of individuals and guide personalized immunotherapy. Finally, m6A/m5C/m1A regulators were validated by immunohistochemical staining and RT-qPCR.
Results
In our study, three m6A/m5C/m1A clusters and gene clusters were identified. Most importantly, we constructed a m6A/m5C/m1A scoring system to assess the clinical risk of the individuals. Besides, the prognostic value of the score was validated with three independent cohorts. Moreover, the level of the immunophenoscore of the low m6A/m5C/m1A score group increased significantly with CTLA-4/PD-1 immunotherapy. Finally, we validated that the mRNA and protein expression of VIRMA and DNMT3B increased in colon cancer tissues.
Conclusions
We constructed and validated a stable and powerful m6A/m5C/m1A score signature to assess the survival outcomes and immune infiltration characteristics of colon cancer patients, which further guides optimization of personalized treatment, making it valuable for clinical translation and implementation.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Colon cancer is gastrointestinal tumor featuring obvious heterogeneity and aggressiveness (Tauriello et al. 2017). According to global cancer statistics, there are 1,148,515 new individuals and 576,858 deaths related to colon cancer, and colon cancer ranks fifth in incidence and mortality rate among all cancers (Sung et al. 2021). Although there has been accelerated progress in diagnosis and treatment technology, including colonoscopy screening and surgical procedures, the 5-year survival rates of patients with colon cancer remain low (Liu et al. 2022b). Many colon cancers are already in advanced stages when they are detected. The main reasons for this are the high cost of colon cancer screening and the lack of sensitive screening indicators (He et al. 2022). Therefore, constructing a clinical signature to describe the characteristics of colon cancer and predict the overall survival of patients is imperative to further develop individualized treatment approaches.
There are different types of epigenetic variations, including chromatin reshaping, noncoding RNA (long noncoding RNAs and microRNAs) effects, and RNA modification, which are involved in key signaling pathways and can act as clinically relevant biomarkers in cancers (Jung et al. 2020). Currently, more than 170 kinds of modification types have been found in intracellular RNA, including mRNAs, tRNAs, rRNAs, and ncRNAs (Shi et al. 2020). These modifications contribute to pre-mRNA splicing, export, and stability. There are different kinds of RNA modifications, including N6-methyladenosine (m6A), 5-methylcytosine (m5C), and N1-methyladenosine (m1A) (Xie et al. 2020). These modifications are mediated by regulators including writers, readers, and erasers. The most well characterized RNA modification is m6A. The methyltransferase-like METTL3/METTL14 complex (writers) plays a catalytic role in the deposition of m6A on mRNA (Wang et al. 2016a; Wang et al. 2016b). M6A can be reversed by two different RNA demethylases: FTO and ALKBH5 (Jia et al. 2011; Zheng et al. 2013). In addition, m6A is recognized by various reader proteins, including the YTH domain families (Li et al. 2014). Similar to DNA, RNA modifications could occur at position 5 of cytidine residues, which is known as m5C (Edelheit et al. 2013). The functions of m5C vary in various RNA types. In tRNA, m5C modulates the stability and organization of RNA, contributing to translation accuracy. However, in rRNA, loss of methylcytosine maintains translational readthrough of stop codons. Most importantly, in mRNA, the methyltransferases of m5C are the NSUN family proteins (Yang et al. 2017). The Aly/REF export factor (ALYREF) serves as a reader, maintaining the nuclear export of mature mRNAs (Bohnsack et al. 2019). RNA demethylases of m5C are members of the tet methylcytosine dioxygenase family. Recently, there has been an increased focus on m1A, generated by methylation of adenosine at position 1 (Dominissini et al. 2016). The TRM6/61 complex is a methyltransferase catalyzing m1A formation (Chujo and Suzuki 2012; Vilardo et al. 2012). Similar to m6A, YTH family members can regulate m1A by binding to mRNA sites (Dai et al. 2018). In addition, m1A is erased from tRNAs by ALKBH1 and ALKBH3 and mRNA only by ALKBH3 (Liu et al. 2016). There is a close correlation between m1A and cell proliferation in gastrointestinal cancers. Consequently, m6A, m5C, and m1A regulators in colon cancer are worth further exploration.
However, previous studies have only investigated single RNA modifications, whose established signatures have some limitations owing to various modifications interacting with each other and working together in colon cancer (Qi et al. 2022). Therefore, we assessed cross-talk among various RNA modifications to construct a novel scoring system that evaluates the survival of patients, further paving the way for optimizing personalized treatment. In our work, we first investigated mutation traits of 48 m6A/m5C/m1A regulators in colon cancer. Next, we identified m6A/m5C/m1A clusters using unsupervised cluster analysis. Survival analysis, immune infiltration analysis, and biological function exploration were performed among different m6A/m5C/m1A clusters. Then, we used differential analysis and univariate Cox analysis to obtain the prognostic differentially expressed genes (DEGs). Moreover, gene clusters were constructed, and the corresponding analysis was also performed in different gene clusters. Considering the differential expression and complexity of m6A/m5C/m1A regulators, we constructed a scoring system and validated the prognostic value of the score with three independent cohorts. Subsequently, we analyzed survival, immune infiltration, tumor mutation status, clinical characteristic and the immunotherapy response. Finally, we validated m6A/m5C/m1A regulators at the protein and mRNA expression levels, which further identified that VIRMA and DNMT3B played vital roles in colon cancer.
Materials and methods
Data collection
RNA-seq transcription profiles, clinical data and mutation data were obtained from TCGA (https://portal.gdc.cancer.gov/). 473 colon cancer samples and 41 normal tissue samples were obtained. Gene expression profiles of GSE39582 dataset were obtained from GEO (https://www.ncbi.nlm.nih.gov/geo/). GSE39582 dataset contains 566 cancer samples and 19 normal samples, follow-up survival analysis and model validation are not required for normal samples, so they should be excluded. To process gene expression data from the TCGA and GSE39582, which were measured by different platforms, the Fragments Per Kilobase of exon model per Million mapped fragments (FPKM) format data were downloaded using the “fpkm” function in R to convert them to the transcripts per kilobase million (TPM) for the next analysis (Wagner et al. 2012). For validating the score efficacy, the following three GEO cohorts were employed: GSE29621 (n = 65), GSE17536 (n = 177), and GSE17537 (n = 55). The above cohorts with complete patient information including clinicopathological parameters and follow-up records. Then, the copy number variation (CNV) of colon cancer patients was abstracted from UCSC Xena (http://xena.ucsc.edu/) (Goldman et al. 2020). We collected 23 m6A, 15 m5C, and 10 m1A regulators from the previous literature (Haruehanroengra et al. 2020; Song et al. 2022). Detailed information on the m6A/m5C/m1A regulators is shown in Supplemental Table 1.
M6A/m5C/m1A regulators in the cancer genome map
To understand the functional mechanism of m6A/m5C/m1A regulators more clearly, we first constructed a functional pattern map of m6A/m5C/m1A regulators in RNA metabolism. Next, the mutation trait, CNV data, gene location, expression levels, and interaction relationships of 48 m6A/m5C/m1A regulators in colon cancer were analyzed with the following R packages: maftools, RCircos, limma, reshape2, ggpubr, igraph, psych, and RColorBrewer.
Survival analysis of m6A/m5C/m1A regulators in colon cancer patients
The predictive values of m6A/m5C/m1A regulators in colon cancer were assessed using Kaplan–Meier (K-M) and univariate Cox regression analyses. The R packages “survival” and “survminer” were used in the process. The results from these analyses are shown with the K–M curve.
Consensus clustering analysis of m6A/m5C/m1A regulators and corresponding functional analysis
Cluster clustering analysis is a method of subsampling from a set of sample data (e.g., a microarray) and identifying clusters with a specified number of clusters (k). For each k, pairwise consistency values, i.e., the proportion of occurrences of two samples in the same subsample to the same cluster, are calculated and stored in a symmetric consensus matrix. We performed unsupervised clustering analysis with the R package “ConsensusClusterPlus” (Wilkerson and Hayes 2010). The optimal classification was identified based on the cumulative distribution function (CDF) curve and the correlations between clusters. The overall survival (OS) of m6A/m5C/m1A clusters was assessed by K–M curves. Moreover, to analyze the levels of immune cells in tumor samples among m6A/m5C/m1A clusters, we performed single-sample gene set enrichment analysis (ssGSEA) with the R package “GSEABase” (Hanzelmann et al. 2013). The heatmap was constructed with the R package “pheatmap”, and the differences were identified using the R package “limma”. To further analyze the function of m6A/m5C/m1A clusters in biological pathways, we utilized gene set variation analysis (GSVA) (Foroutan et al. 2018).
Analysis of DEGs among m6A/m5C/m1A clusters
We first obtained m6A/m5C/m1A regulator-related DEGs between different m6A/m5C/m1A clusters with the R package “limma, VennDiagram” (Ritchie et al. 2015), and the significance filtering criterion was an adjusted p value < 0.001. To explore potential molecular functions and associated biological pathways, we analyzed m6A/m5C/m1A regulator-related DEGs with GO and KEGG analysis by the R packages “clusterProfiler” and “enrichplot” (Yu et al. 2012). The above results were visualized with bubble charts. Next, we further utilized univariate Cox analysis to obtain the prognostic DEGs. Then, the consensus clustering method was used to identify proper gene clusters, and a K–M curve was constructed to assess the OS for patients in different gene clusters. Finally, we also analyzed the levels of m6A/m5C/m1A regulators among different gene clusters and generated a heatmap to explore the relationship between clinical characteristics and gene clusters.
Construction and validation of the m6A/m5C/m1A scoring system by principal component analysis (PCA)
Based on the above results, we obtained the prognostic DEGs associated with m6A/m5C/m1A clusters. Then, we used PCA to analyze the prognostic DEGs, thus yielding the m6A/m5C/m1A score system. The m6A/m5C/m1A score was calculated with the following formula: m6A/m5C/m1Ascore = (PC1i + PC2 i), where i represents the levels of m6A/m5C/m1A-related genes (Sotiriou et al. 2006; Zeng et al. 2019). Principal components 1 and 2 were obtained to construct the m6A/m5C/m1A score (Sotiriou et al. 2006). A Sankey diagram was constructed to explore the relationship among m6A/m5C/m1A clusters, gene clusters, m6A/m5C/m1A scores, and fustats. Moreover, the Kaplan‒Meier curve was constructed and validated to assess the OS for the m6A/m5C/m1A score, and correlation analysis between the m6A/m5C/m1A scores and immune cells was explored with the R package “corrplot”. We also investigated the levels of the m6A/m5C/m1A score in different m6A/m5C/m1A clusters and gene clusters with the Kruskal–Wallis test.
Differences in clinical traits and immunotherapy among the m6A/m5C/m1A score groups
We first analyzed the mutation characteristics of colon cancer patients in different m6A/m5C/m1A score groups using the waterfall chart with the R package “maftools”. Moreover, K–M analysis was performed to investigate the differences in survival outcomes and clinical characteristics (age, sex, stage) between different score groups. Considering the clinical value of tumor mutation burden (TMB) in providing a theoretical direction for immunotherapy (Mariathasan et al. 2018), stratified survival analysis was used to explore the correlation between TMB and the score. In terms of the response to anti-CTLA-4 and anti-PD-1, immunophenoscore (IPS) is a reliable predictor (Charoentong et al. 2017). There were four sample types, including CTLA-4 (pos/neg)/PD-1 (pos/neg), according to the IPS. Pos and neg indicate positive expression and negative expression, respectively. Therefore, we explored the IPS in the m6A/m5C/m1A score groups with the R package “ggpubr”. Finally, we also analyzed the m6A/m5C/m1A score in samples with different MSI types.
Validation of m6A/m5C/m1A regulators with the human protein atlas and real-time quantitative PCR (RT-qPCR)
The data, including immunohistochemical staining data for colon cancer and normal tissues, were obtained from the HPA (http://www.proteinatlas.org/) (Liu et al. 2022a). We first validated differentially expressed m6A/m5C/m1A regulators at the protein expression level. There were four degrees of staining: high, medium, low, and not detected. Finally, we further validated hub m6A/m5C/m1A regulators at the mRNA expression level with RT-qPCR. TRIzol reagent (TaKaRa) was used to extract RNA based on the corresponding protocol. Reverse transcription was then performed with a TaqMan cDNA synthesis kit (TaKaRa). The SYBR Green PCR Kit (TaKaRa) was used for RT‒qPCR. The detailed experimental procedure has been described previously (Zeng et al. 2022). The main sequences are shown in Supplemental Table 2. The 2−ΔΔCt algorithm was performed to calculate gene expression levels. Ten pairs of colon cancer and normal colon tissues were provided by Wujin Hospital Affiliated with Jiangsu University.
Statistical analysis
Statistical analysis was performed using R 3.6.3 software or GraphPad Prism 9.0. The Wilcoxon and Kruskal–Wallis test were used for different group comparisons. Log-rank tests were employed in the Kaplan–Meier analysis. Spearman’s test was used in the immune cell infiltration analysis. A p value < 0.05 was considered statistically significant. *p < 0.05, **p < 0.01, ***p < 0.001. All experiments were repeated three times independently.
Results
The expression and variation of m6A/m5C/m1A regulators in colon cancer
A flow chart of our work was shown in Fig. 1. In our study, we first obtained 48 m6A/m5C/m1A regulators from a previous study. We constructed a functional pattern map of m6A/m5C/m1A regulators in RNA metabolism, which helped us understand the functional mechanism of m6A/m5C/m1A regulators more clearly (Fig. 2A). From the pattern diagram, we concluded that m6A/m5C/m1A regulators interacted with each other and jointly participated in RNA metabolism, including the splicing, exportation, stabilization, degradation, and translation of various RNAs. Therefore, we studied the value of all RNA regulators, including m6A, m5C, and m1A regulators, rather than individual RNA regulators. After obtaining the corresponding mutation data, we found that 162 (36.24%) of the 447 samples showed mutations in m6A/m5C/m1A regulators (Fig. 2B). ZC3H13, YTHDC2, TET1, TET3, VIRMA, and RBM15 had high mutation rates (> 5%). The top three mutation types were as follows: missense mutation, multihit mutation and frame shift deletion. Moreover, CNV analysis implied that ZC3H13, TET1, TRMT6, ALKBH1, IGFBP2, METTL16, RBM15, LRPPRC and HNRNPC showed extensive copy number loss, while the rest of the m6A/m5C/m1A regulators showed widespread CNV amplification (Fig. 2C). The circos plot showed the location of CNVs for the m6A/m5C/m1A regulators on the chromosome (Fig. 2D). In addition, we further investigated the levels of m6A/m5C/m1A regulators in colon cancer and found that most m6A/m5C/m1A regulators had higher expression levels in colon cancer tissue versus normal tissue independent of CNV, and 12 m6A/m5C/m1A regulators that showed a statistically significant difference between normal and tumor tissue (Fig. 2E). Finally, the correlations among 48 m6A/m5C/m1A regulators and their prognostic value in colon cancer were found (Fig. 2F). The above results indicated that the complexity of m6A/m5C/m1A regulators and tumor heterogeneity played vital roles in the occurrence and progression of colon cancer.
The overall survival curve of m6A/m5C/m1A regulators in colon cancer patients
To explore the predictive value of m6A/m5C/m1A regulators in colon cancer patients, we performed Kaplan‒Meier analysis. We found that low levels of m6A/m5C/m1A regulators, including ALKBH5, IGFBP3, DNMT3A, FTO, HNRNPC, METTL3, NSUN4, NSUN5, NSUN6, TRMT6, WTAP, and ZC3H13, indicated a favorable prognosis in colon cancer patients (Fig. 3), while high levels of m6A/m5C/m1A regulators, including NOP2, NSUN2, NSUN7, RBM15B, TET2, TRMT61B, ALYREF, LRPPRC, YTHDC2, and YTHDF1, indicated a favorable prognosis in colon cancer patients (Figure S1).
Identification of m6A/m5C/m1A clusters
Three m6A/m5C/m1A clusters were classified with unsupervised clustering analysis. According to the relative area change of the CDF curve and the correlations between clusters, matrix k = 3 was the optimal number of clusters (Fig. 4A–C). Finally, three clusters were identified: m6A/m5C/m1A cluster A (n = 230), m6A/m5C/m1A cluster B (n = 504), and m6A/m5C/m1A cluster C (n = 233). According to the survival analysis, Fig. 4D shows that m6A/m5C/m1A cluster C had a better survival outcome, while m6A/m5C/m1A cluster A had a poor survival outcome (p = 0.029). Moreover, we further explored the differential levels of immune cells among the clusters. The results showed that the levels of most immune cells were elevated in the three m6A/m5C/m1A clusters and that the m6A/m5C/m1A cluster A had higher levels of immune cell infiltration than the other clusters (Fig. 4E). We also performed ssGSEA of immune functions and supplemented with the levels of other immune infiltrating cells in the three m6A/m5C/m1A clusters (Fig. S2). The above results better explained why m6A/ m5C/m1A cluster A leads to poorer survival outcome. In addition, a correlation heatmap, including the expression of m6A/m5C/m1A regulators, project, patient clinical characteristics (survival time, status, grade, stage, and age) and m6A/m5C/m1A clusters, was plotted. The results indicated that the expression of m6A/m5C/m1A regulators was higher in clusters B and C (Fig. 4F). Finally, to assess the biological functions of m6A/m5C/m1A regulators, we performed GSVA. The heatmap indicated that cluster A was mainly enriched in complement and coagulation cascades, cytokine receptor interaction, cell adhesion molecules cams, leukocyte transendothelial migration, arachidonic acid metabolism, and neuroactive ligand receptor interaction; cluster B was mainly enriched in homologous recombination, glycosylphosphatidylinositol GPI anchor biosynthesis, and lysine degradation; and cluster C was significantly enriched in basal transcription factors (Fig. 4G–I).
Identification of m6A/m5C/m1A-related gene clusters
To further explore the clinical and biological traits of m6A/m5C/m1A clusters, a total of 580 DEGs were identified from the overlap across different m6A/m5C/m1A clusters in the Venn diagram (Fig. 5A). Then, GO results indicated that the DEGs mainly enriched in ribonucleoprotein complex biogenesis of biological process, nuclear envelope of cellular component, and single-stranded RNA binding of molecular function (Fig. 5B). KEGG results indicated that the DEGs mainly enriched in nucleocytoplasmic transport of the signaling pathway (Fig. 5C). Next, we further analyzed the DEGs of m6A/m5C/m1A clusters, and univariate Cox analysis was performed to obtain the prognostic DEGs. Then, the consensus clustering method was also used to identify proper gene clusters (k = 3), and there were three types of gene clusters: m6A/m5C/m1A-related gene cluster A (n = 304), cluster B (n = 238), and cluster C (n = 426) (Fig. 5D). Based on survival analysis, Fig. 5E shows that gene cluster A had a better survival outcome, while gene cluster B had a poorer survival outcome (p < 0.001). Figure 5F reveals that the three gene clusters had different clinical characteristics, and patients in gene cluster C showed an advanced clinical stage. Differential expression analysis indicated that the levels of most m6A/m5C/m1A regulators were elevated in the three gene clusters and that gene cluster C had higher levels of m6A/m5C/m1A regulators than the other gene clusters (Fig. 5G).
Construction and validation of the m6A/m5C/m1A scoring system
Although m6A/m5C/m1A regulators play a vital role in regulating prognosis and immune infiltration, due to the differential expression and complexity of m6A/m5C/m1A regulators, the above analysis referred to these regulators on a broad scale rather than at the individual level. We constructed an m6A/m5C/m1A score system to evaluate the clinical value of m6A/m5C/m1A regulators in individuals. The alluvial diagram showed the relationship among clusters, gene clusters, score groups, and fustats, which indicated that m6A/m5C/m1A cluster B/C and gene cluster A/C mainly corresponded to low m6A/m5C/m1A scores (Fig. 6A). Figure 6B shows that the low m6A/m5C/m1A score group had a better survival outcome, while the high score group had a poorer survival outcome (p < 0.001). The above results were consistent with the previous survival outcomes observed for m6A/m5C/m1A clusters and gene clusters. Immune infiltration analysis indicated that the infiltration levels of activated CD4 T cells, eosinophils, plasmacytoid dendritic cells, and types 2T helper cells were positively connective with the m6A/m5C/m1A score, while the infiltration levels of the remaining immune cells were negatively connective with the m6A/m5C/m1A score (Fig. 6C). Higher m6A/m5C/m1A scores were observed in cluster B/C and gene cluster B (p < 0.001) (Fig. 6D, E). Next, the prognostic value of the score was validated in three independent cohorts. The results showed that the low m6A/m5C/m1A score group had a better survival outcome, while the high score group had a poorer survival outcome (GSE29621 set, p = 0.004; GSE17536 set, p = 0.013; GSE17537 set, p = 0.033). The validation results were consistent with the previous construction result (Fig. 6F–H).
Comprehensive analysis of tumor mutations and clinical relevance in different m6A/m5C/m1A score groups
To further assess the clinical value of the m6A/m5C/m1A score, we first performed tumor mutation analysis in different m6A/m5C/m1A score groups. The results indicated that the high score group had higher mutation rates (100% vs. 94.37%). The top five mutated genes in the two score groups were APC, TP53, TTN, KRAS, and PIK3CA, and the top three mutation types were missense mutations, nonsense mutations, and multihit mutations (Fig. 7A, B). Previous studies have illustrated that there is a close connection between TMB and immunotherapy responses. Therefore, we performed survival analysis of TMB, which revealed that patients had a better prognosis in the high TMB group (p = 0.033) (Fig. 7C). We further investigated the relationship between the m6A/m5C/m1A score and TMB, and the results indicated that the best survival outcome was observed in the low score and the low TMB group (p < 0.001) (Fig. 7D). In addition, we investigated the prognostic value of the score in groups stratified according to different clinical characteristics (age, sex, stage), and the low score group had better survival outcomes in these subgroup analyses (Fig. S3). As shown in Fig. 7E, the death rate of colon cancer patients was higher in the high score group, and rank test results indicated that the m6A/m5C/m1A score was higher in dead patients (Fig. 7F).
Immunotherapy analysis of colon cancer patients in the m6A/m5C/m1A score groups
Immunotherapy is playing an increasingly main role in the current treatment of colon cancer, especially immuno-checkpoint inhibitor (ICI) treatment, including anti-CTLA-4/anti-PD-L1. Therefore, we first identified the expression levels of PD-L1/CTLA-4 in different score groups, and the results implied that PD-L1/CTLA-4 had higher expression levels in the high score group (Fig. 8E, F). IPS is a novel immunotherapeutic factor, and MSI status is strongly correlated with the response to immunotherapy. We further assessed IPS and MSI status in different score groups to predict immunotherapy response. The IPS in the low score group increased significantly compared to that in the high score group in CTLA-4 (pos/neg)/PD-1 (pos/neg) (Fig. 8A–D). Moreover, we explored the clinical significance of the m6A/m5C/m1A score in groups with different MSI types (MSS, MSI-H, MSI-L). As seen in Fig. 8G, Among patients with high regulator scores, the subgroups with MSS and MSI-L status had higher response rates. Higher m6A/m5C/m1A score were shown for those with MSS compared with those with other MSI types (Fig. 8H).
Validation of m6A/m5C/m1A regulators at the protein and mRNA expression levels
To investigate the key biological function of m6A/m5C/m1A regulators in colon cancer, we first assessed the protein expression levels of m6A/m5C/m1A regulators in colon cancer tissues and normal tissue with immunohistochemical staining data from the HPA. The results indicated that VIRMA and DNMT3B were significantly higher in colon cancer tissue, which was consistent with the previous analysis (Fig. 9). However, the protein levels of YTHDF2 and TET3 were different from the results of the previous analysis. The rest of the regulators showed no difference in tumor and normal tissues. YTHDF3 protein expression did not correspond to the staining outcome. Next, we further explored the mRNA expression levels of VIRMA and DNMT3B in 10 pairs of colon cancer and normal tissues with RT-qPCR. The results indicated that VIRMA and DNMT3B had higher mRNA expression levels in colon cancer tissue (Fig. 10), which was consistent with the protein level results. Therefore, we identified that VIRMA and DNMT3B could play vital roles in colon cancer.
Discussion
Previous reports have found that patients with colon cancer have a high mortality rate, mainly due to limitations of current clinical scoring systems and a lack of individualized treatment (Liu et al. 2022b). At the molecular level, inter- and intratumor heterogeneity contribute to a poor prognosis in patients. Epigenetics play a crucial role in the heterogeneity of colon cancer. Recently, as an important component of epigenetics, RNA modifications, including m6A, m5C, and m1A, have been demonstrated to be involved in various biological processes (Jung et al. 2020). RNA modifications interact with each other and play a joint role in the progression of cancers. However, many studies have focused solely on single RNA modifications, and different RNA modifications have also not been well studied in the tumor microenvironment and immune infiltration. Consequently, exploring overlapping RNA modifications, including m6A, m5C, and m1A, may help elucidate the features of the tumor microenvironment and immune infiltration, aiding in the development of an accurate and convenient clinical scoring system to help diagnose and individualize the treatment of patients with colon cancer.
In our study, we first analyzed the mutation traits, CNV status, gene location, expression levels, and interaction relationships of 48 m6A/m5C/m1A regulators in colon cancer. The waterfall chart showed that ZC3H13 had the highest mutation rate compared with the other regulators. Previous studies have also demonstrated that ZC3H13 may be the main regulator of the Ras-ERK signaling pathway (Zhu et al. 2019). Moreover, CNV mutation analysis implied that the majority of m6A/m5C/m1A regulators showed widespread CNV amplification and that TRMT6 had the highest rate of copy number loss. Gene copy number abnormalities may reveal therapeutic targets or markers of drug resistance in colon cancer (Pos et al. 2021). TRMT6/TRMT61A has been identified to increase m1A methylation of tRNA to increase PPARδ translation, which ultimately drives hepatic cancer stem cell self-renewal and tumorigenesis (Wang et al. 2021). In addition, we further investigated the expression levels of m6A/m5C/m1A regulators in colon cancer and found that there were 12 m6A/m5C/m1A regulators that showed a statistically significant difference between normal and tumor tissues. The above results revealed that m6A/m5C/m1A are involved in the occurrence and progression of colon cancer.
Considering the different biological properties of these regulators, three different clusters were identified with unsupervised cluster analysis. M6A/m5C/m1A cluster A had a poorer survival outcome than the other clusters. However, m6A/m5C/m1A cluster A had higher levels of immune cell infiltration than the other clusters. The main reason may be metabolic reprogramming. Colon cancer cells secrete various factors, including cytokines and chemokines, to reshape their microenvironment, which contributes to the reprogramming of the surrounding immune cells. Eventually, the microenvironment promotes tumor growth and metastasis. For example, myeloid-derived suppressor cells (MDSCs) are mainly enriched in m6A/m5C/m1A cluster C. MDSCs, as immunosuppressive cells, have angiogenic potential and can promote the growth and development of tumors (Motz and Coukos 2011).
To further explore the clinical and biological traits of the m6A/m5C/m1A clusters, three types of gene clusters were identified. The advantage of secondary clustering was that the number of differential genes could be expanded, the differential genes associated with prognosis could be screened, and laid the foundation for building a scoring system. Then, GO results indicated that the DEGs mainly enriched in ribonucleoprotein complex biogenesis, nuclear envelope, and single stranded RNA binding, while KEGG results indicated that the DEGs mainly enriched in nucleocytoplasmic transport, which further validated the role of RNA methylation in the tumor microenvironment.
Although m6A/m5C/m1A regulators played a vital role in regulating prognosis and immune infiltration, due to the differential expression and complexity of m6A/m5C/m1A regulators, the above analysis referred to the regulators as a whole rather than individual regulators. We constructed a m6A/m5C/m1A scoring system to evaluate the clinical value and treatment effect of m6A/m5C/m1A regulators in individuals. First, the m6A/m5C/m1A score could act as a significant predictive factor for colon cancer. As we expected, m6A/m5C/m1A cluster B/C and gene cluster A/C mainly corresponded to low m6A/m5C/m1A scores, which indicated a better prognosis. Next, the prognostic value of the score was validated in three independent cohorts (GSE29621 set, GSE17536 set, GSE17537 set). The results showed that the low m6A/m5C/m1A score group had a better survival outcome, while the high score group had a poorer survival outcome. The validation results were consistent with the previous construction result. Moreover, immune infiltration analysis indicated that the infiltration of activated CD4 T cells, eosinophils, plasmacytoid dendritic cells, and type 2T helper cells was positively related to the m6A/m5C/m1A score, and the score may identify the tumor immunophenotype and guide the use of targeted immunotherapy. In addition, TMB has recently been shown to correlate with clinical outcomes in a variety of cancers, such as melanoma, lung cancer, and colorectal cancer. Several studies have shown that high TMB is effective in predicting objective remission rates and progression-free survival, but TMB has limited ability to predict overall survival (Li et al. 2020). Our work identified that there was a close correlation between TMB and the m6A/m5C/m1A score, and a consolidated forecasting signature including TMB and the score could provide better survival prediction, which further guides immunotherapy. Immunotherapy has quickly replaced other traditional treatment modalities for a number of solid tumors, including colorectal cancer (Ganesh et al. 2019). Following the first success of melanoma treatment, immune checkpoint inhibitors, including anti-PD-L1 and anti-CTLA-4, now play a more critical role in immunotherapy. We found that there was a significant difference in the expression of PD-L1 and CTLA-4 in the different score groups. The IPS indicated better treatment results in the low score group regardless of the status of PD-1 and CTLA-4, which provides a guideline for immunotherapy.
Finally, to identify the key biological function of m6A/m5C/m1A regulators in colon cancer, we performed immunohistochemical staining and RT-qPCR to screen two key biomarkers (VIRMA and DNMT3B). Previous reports have revealed that VIRMA-directed m6A modification promotes NSCLC progression through m6A-dependent degradation of DAPK3 and that VIRMA could become a new therapeutic target for NSCLC (Xu et al. 2021). Moreover, VIRMA downregulation attenuated the aggressive phenotype of prostate cancer cells by reducing the stability and abundance of oncogenic lncRNAs through an overall reduction in m6A levels (Barros-Silva et al. 2020). However, the role of VIRMA in colon cancer still needs to be further explored. Moreover, low miR-203 expression in colorectal cancer leads to ABCG2 promoter methylation and significantly reduced expression by attenuating the inhibition of DNMT3B (To et al. 2017), which was consistent with our results.
We constructed and validated an accurate and simple scoring signature to assess the survival outcomes and immune infiltration characteristics of colon cancer patients. There are some limitations to our study. First, all sample data in this study are retrospective and require further validation with data from a multicenter and prospective study. Second, some clinical and molecular feature datasets are very inadequate, which may mask potential associations between the m6A/m5C/m1A score and certain variables. Third, the role of many regulators in colon cancer is unknown and needs to be further validated with in vivo and in vitro experiments.
Conclusion
We constructed and validated a stable and powerful m6A/m5C/m1A score signature to assess the survival outcomes and immune infiltration characteristics of colon cancer patients, which further guides the optimization of personalized treatment, making it valuable for clinical translation and implementation.
Data availability
All transcriptome and expression profile data were obtained from The Cancer Genome Atlas and Gene Expression Omnibus. These data are publicly available.
References
Barros-Silva D et al (2020) VIRMA-dependent N6-methyladenosine modifications regulate the expression of long non-coding RNAs CCAT1 and CCAT2 in prostate cancer. Cancers (Basel). https://doi.org/10.3390/cancers12040771
Bohnsack KE, Hobartner C, Bohnsack MT (2019) Eukaryotic 5-methylcytosine (m(5)C) RNA methyltransferases: mechanisms, cellular functions, and links to disease. Genes (Basel). https://doi.org/10.3390/genes10020102
Charoentong P et al (2017) Pan-cancer immunogenomic analyses reveal genotype-immunophenotype relationships and predictors of response to checkpoint blockade. Cell Rep 18:248–262. https://doi.org/10.1016/j.celrep.2016.12.019
Chujo T, Suzuki T (2012) Trmt61B is a methyltransferase responsible for 1-methyladenosine at position 58 of human mitochondrial tRNAs. RNA 18:2269–2276. https://doi.org/10.1261/rna.035600.112
Dai X, Wang T, Gonzalez G, Wang Y (2018) Identification of YTH domain-containing proteins as the readers for N1-methyladenosine in RNA. Anal Chem 90:6380–6384. https://doi.org/10.1021/acs.analchem.8b01703
Dominissini D et al (2016) The dynamic N(1)-methyladenosine methylome in eukaryotic messenger RNA. Nature 530:441–446. https://doi.org/10.1038/nature16998
Edelheit S, Schwartz S, Mumbach MR, Wurtzel O, Sorek R (2013) Transcriptome-wide mapping of 5-methylcytidine RNA modifications in bacteria, archaea, and yeast reveals m5C within archaeal mRNAs. PLoS Genet 9:e1003602. https://doi.org/10.1371/journal.pgen.1003602
Foroutan M, Bhuva DD, Lyu R, Horan K, Cursons J, Davis MJ (2018) Single sample scoring of molecular phenotypes. BMC Bioinf 19:404. https://doi.org/10.1186/s12859-018-2435-4
Ganesh K, Stadler ZK, Cercek A, Mendelsohn RB, Shia J, Segal NH, Diaz LA Jr (2019) Immunotherapy in colorectal cancer: rationale, challenges and potential. Nat Rev Gastroenterol Hepatol 16:361–375. https://doi.org/10.1038/s41575-019-0126-x
Goldman MJ et al (2020) Visualizing and interpreting cancer genomics data via the Xena platform. Nat Biotechnol 38:675–678. https://doi.org/10.1038/s41587-020-0546-8
Hanzelmann S, Castelo R, Guinney J (2013) GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinf 14:7. https://doi.org/10.1186/1471-2105-14-7
Haruehanroengra P, Zheng YY, Zhou Y, Huang Y, Sheng J (2020) RNA modifications and cancer. RNA Biol 17:1560–1575. https://doi.org/10.1080/15476286.2020.1722449
He R, Man C, Huang J, He L, Wang X, Lang Y, Fan Y (2022) Identification of RNA methylation-related lncRNAs signature for predicting hot and cold tumors and prognosis in colon cancer. Front Genet 13:870945. https://doi.org/10.3389/fgene.2022.870945
Jia G et al (2011) N6-methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat Chem Biol 7:885–887. https://doi.org/10.1038/nchembio.687
Jung G, Hernandez-Illan E, Moreira L, Balaguer F, Goel A (2020) Epigenetics of colorectal cancer: biomarker and therapeutic potential. Nat Rev Gastroenterol Hepatol 17:111–130. https://doi.org/10.1038/s41575-019-0230-y
Li F, Zhao D, Wu J, Shi Y (2014) Structure of the YTH domain of human YTHDF2 in complex with an m(6)A mononucleotide reveals an aromatic cage for m(6)A recognition. Cell Res 24:1490–1492. https://doi.org/10.1038/cr.2014.153
Li R, Han D, Shi J, Han Y, Tan P, Zhang R, Li J (2020) Choosing tumor mutational burden wisely for immunotherapy: a hard road to explore. Biochim Biophys Acta Rev Cancer 1874:188420. https://doi.org/10.1016/j.bbcan.2020.188420
Liu F et al (2016) ALKBH1-mediated tRNA demethylation regulates translation. Cell 167:1897. https://doi.org/10.1016/j.cell.2016.11.045
Liu J et al (2022a) A comprehensive analysis and validation of cuproptosis-associated genes across cancers: overall survival, the tumor microenvironment, stemness scores, and drug sensitivity. Front Genet 13:939956. https://doi.org/10.3389/fgene.2022.939956
Liu Z et al (2022b) Machine learning-based integration develops an immune-derived lncRNA signature for improving outcomes in colorectal cancer. Nat Commun 13:816. https://doi.org/10.1038/s41467-022-28421-6
Mariathasan S et al (2018) TGFbeta attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells. Nature 554:544–548. https://doi.org/10.1038/nature25501
Motz GT, Coukos G (2011) The parallel lives of angiogenesis and immunosuppression: cancer and other tales. Nat Rev Immunol 11:702–711. https://doi.org/10.1038/nri3064
Pos O, Radvanszky J, Buglyo G, Pos Z, Rusnakova D, Nagy B, Szemes T (2021) DNA copy number variation: main characteristics, evolutionary significance, and pathological aspects. Biomed J 44:548–559. https://doi.org/10.1016/j.bj.2021.02.003
Qi L et al (2022) Cross-talk of multiple types of RNA modification regulators uncovers the tumor microenvironment and immune infiltrates in soft tissue sarcoma. Front Immunol 13:921223. https://doi.org/10.3389/fimmu.2022.921223
Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK (2015) limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43:e47. https://doi.org/10.1093/nar/gkv007
Shi H, Chai P, Jia R, Fan X (2020) Novel insight into the regulatory roles of diverse RNA modifications: re-defining the bridge between transcription and translation. Mol Cancer 19:78. https://doi.org/10.1186/s12943-020-01194-6
Song W, Ren J, Xiang R, Yuan W, Fu T (2022) cross-talk between m(6)A- and m(5)C-related lncRNAs to construct a novel signature and predict the immune landscape of colorectal cancer patients. Front Immunol 13:740960. https://doi.org/10.3389/fimmu.2022.740960
Sotiriou C et al (2006) Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J Natl Cancer Inst 98:262–272. https://doi.org/10.1093/jnci/djj052
Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, Bray F (2021) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71:209–249. https://doi.org/10.3322/caac.21660
Tauriello DV, Calon A, Lonardo E, Batlle E (2017) Determinants of metastatic competency in colorectal cancer. Mol Oncol 11:97–119. https://doi.org/10.1002/1878-0261.12018
To KK, Leung WW, Ng SS (2017) A novel miR-203-DNMT3b-ABCG2 regulatory pathway predisposing colorectal cancer development. Mol Carcinog 56:464–477. https://doi.org/10.1002/mc.22508
Vilardo E, Nachbagauer C, Buzet A, Taschner A, Holzmann J, Rossmanith W (2012) A subcomplex of human mitochondrial RNase P is a bifunctional methyltransferase–extensive moonlighting in mitochondrial tRNA biogenesis. Nucleic Acids Res 40:11583–11593. https://doi.org/10.1093/nar/gks910
Wagner GP, Kin K, Lynch VJ (2012) Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci 131:281–285. https://doi.org/10.1007/s12064-012-0162-3
Wang P, Doxtader KA, Nam Y (2016a) Structural basis for cooperative function of Mettl3 and Mettl14 methyltransferases. Mol Cell 63:306–317. https://doi.org/10.1016/j.molcel.2016.05.041
Wang X et al (2016b) Structural basis of N(6)-adenosine methylation by the METTL3-METTL14 complex. Nature 534:575–578. https://doi.org/10.1038/nature18298
Wang Y et al (2021) N(1)-methyladenosine methylation in tRNA drives liver tumourigenesis by regulating cholesterol metabolism. Nat Commun 12:6314. https://doi.org/10.1038/s41467-021-26718-6
Wilkerson MD, Hayes DN (2010) ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics 26:1572–1573. https://doi.org/10.1093/bioinformatics/btq170
Xie S et al (2020) Emerging roles of RNA methylation in gastrointestinal cancers. Cancer Cell Int 20:585. https://doi.org/10.1186/s12935-020-01679-w
Xu Y et al (2021) VIRMA contributes to non-small cell lung cancer progression via N(6)-methyladenosine-dependent DAPK3 post-transcriptional modification. Cancer Lett 522:142–154. https://doi.org/10.1016/j.canlet.2021.08.027
Yang X et al (2017) 5-methylcytosine promotes mRNA export—NSUN2 as the methyltransferase and ALYREF as an m(5)C reader. Cell Res 27:606–625. https://doi.org/10.1038/cr.2017.55
Yu G, Wang LG, Han Y, He QY (2012) clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16:284–287. https://doi.org/10.1089/omi.2011.0118
Zeng D et al (2019) Tumor microenvironment characterization in gastric cancer identifies prognostic and immunotherapeutically relevant gene signatures. Cancer Immunol Res 7:737–750. https://doi.org/10.1158/2326-6066.CIR-18-0436
Zeng C et al (2022) DPEP1 promotes drug resistance in colon cancer cells by forming a positive feedback loop with ASCL2. Cancer Med. https://doi.org/10.1002/cam4.4926
Zheng G et al (2013) ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol Cell 49:18–29. https://doi.org/10.1016/j.molcel.2012.10.015
Zhu D, Zhou J, Zhao J, Jiang G, Zhang X, Zhang Y, Dong M (2019) ZC3H13 suppresses colorectal cancer proliferation and invasion via inactivating Ras-ERK signaling. J Cell Physiol 234:8899–8907. https://doi.org/10.1002/jcp.27551
Acknowledgements
The authors are grateful for the help of all colleagues and the public database.
Funding
This study was supported by Changzhou High-Level Medical Talents Training Project (No: 2016CZBJ054) and Clinical Medical Science and Technology Development Fund by the Changzhou Sci & Tech Program (No: CJ20210015 and CJ20220006).
Author information
Authors and Affiliations
Contributions
JL and WL: design of the study, data analysis, and manuscript writing. XL and YL: data collection and formal analysis. JL and XL: experimental validation. JL and MD: project administration, supervision, and validation. The final manuscript received the approval of all the authors.
Corresponding author
Ethics declarations
Conflict of interest
The authors state that the research has no conflicts of interest.
Ethical approval
The ethics commission of Wujin Hospital affiliated with Jiangsu University gave its approval to this study. Informed consent was acquired from patients.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
432_2023_5033_MOESM1_ESM.tif
Supplementary file1 Fig. S1: The overall survival curves of colon cancer patients treated with m6A/m5C/m1A regulators. (A-G) High levels of 7 m6A/m5C/m1A regulators were associated with a favorable prognosis. Red and blue represent high and low expression, respectively (TIF 1768 KB)
432_2023_5033_MOESM2_ESM.tif
Supplementary file2 Fig. S2: The levels of tumor-infiltrating immune cells and immune functions in different m6A/m5C/m1A clusters (TIF 2353 KB)
432_2023_5033_MOESM3_ESM.tif
Supplementary file3 Fig. S3: (A-F) Prognostic value of the m6A/m5C/m1A score in groups stratified by different clinical characteristics (age, sex, stage) (TIF 1254 KB)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Liu, J., Dou, M., Liu, X. et al. A novel m6A/m5C/m1A score signature to evaluate prognosis and its immunotherapy value in colon cancer patients. J Cancer Res Clin Oncol 149, 11995–12012 (2023). https://doi.org/10.1007/s00432-023-05033-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00432-023-05033-1