Multi-staged gene expression profiling reveals potential genes and the critical pathways in kidney cancer

Khouja, Hamed Ishaq; Ashankyty, Ibraheem Mohammed; Bajrai, Leena Hussein; Kumar, P. K. Praveen; Kamal, Mohammad Amjad; Firoz, Ahmad; Mobashir, Mohammad

doi:10.1038/s41598-022-11143-6

Multi-staged gene expression profiling reveals potential genes and the critical pathways in kidney cancer

Article
Open access
Published: 04 May 2022

Volume 12, article number 7240, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Multi-staged gene expression profiling reveals potential genes and the critical pathways in kidney cancer

Download PDF

Hamed Ishaq Khouja¹,
Ibraheem Mohammed Ashankyty¹,
Leena Hussein Bajrai^2,3,
P. K. Praveen Kumar⁴,
Mohammad Amjad Kamal^5,6,7,
Ahmad Firoz⁸ &
…
Mohammad Mobashir⁹

3146 Accesses
15 Citations
5 Altmetric
Explore all metrics

Abstract

Cancer is among the highly complex disease and renal cell carcinoma is the sixth-leading cause of cancer death. In order to understand complex diseases such as cancer, diabetes and kidney diseases, high-throughput data are generated at large scale and it has helped in the research and diagnostic advancement. However, to unravel the meaningful information from such large datasets for comprehensive and minute understanding of cell phenotypes and disease pathophysiology remains a trivial challenge and also the molecular events leading to disease onset and progression are not well understood. With this goal, we have collected gene expression datasets from publicly available dataset which are for two different stages (I and II) for renal cell carcinoma and furthermore, the TCGA and cBioPortal database have been utilized for clinical relevance understanding. In this work, we have applied computational approach to unravel the differentially expressed genes, their networks for the enriched pathways. Based on our results, we conclude that among the most dominantly altered pathways for renal cell carcinoma, are PI3K-Akt, Foxo, endocytosis, MAPK, Tight junction, cytokine-cytokine receptor interaction pathways and the major source of alteration for these pathways are MAP3K13, CHAF1A, FDX1, ARHGAP26, ITGBL1, C10orf118, MTO1, LAMP2, STAMBP, DLC1, NSMAF, YY1, TPGS2, SCARB2, PRSS23, SYNJ1, CNPPD1, PPP2R5E. In terms of clinical significance, there are large number of differentially expressed genes which appears to be playing critical roles in survival.

Identifying the novel key genes in renal cell carcinoma by bioinformatics analysis and cell experiments

Article Open access 21 July 2020

Identification of genes and pathways involved in kidney renal clear cell carcinoma

Article Open access 16 December 2014

A pan-kidney cancer study identifies subtype specific perturbations on pathways with potential drivers in renal cell carcinoma

Article Open access 28 December 2020

Introduction

Renal cell carcinoma (RCC) is the most common type of kidney cancer in adults, responsible for approximately 90–95% of cases and it is one of the leading causes of cancer death. Its occurrence shows mainly male predominance over women with a ratio of 1.5:1. RCC, a kidney cancer originates in the lining of the proximal convoluted tubule which is the part of the very small tubes in the kidney and transport primary urine^1,2. High-throughput data is created at a large scale in order to understand complex diseases like cancer, and it has aided in research and diagnostic advancement^3,4,5,6. However, extracting useful knowledge from such vast datasets for a complete and detailed understanding of cell phenotypes and disease pathophysiology remains a difficult task, and the molecular events that contribute to disease initiation and progression are still poorly understood^7,8,9. The advancement of the post-genomics period has resulted in a huge amount of "big data" in biological sciences, which has led to a multitude of interdisciplinary applications in recent decades^5,10. There are a number of biological databases that house various types of datasets. TCGA, oncomine, nephroseq, and GEO (gene expression omnibus) are the most widely used databases in biological sciences¹¹. These databases mainly GEO store vast amount of datasets related with cancer, diabetes, and other biological problems^{8,12,13,14,15,16}.

The identification of pathogenetically distinct tumour types poses a significant challenge in the treatment of complex diseases (especially cancer)^17,18,19. The improvement in tumor classification always helps in the improvement during therapeutic approaches^20,21. In target specific therapy, effectiveness can be maximised while toxicity is reduced by using enhanced classification. To access biological datasets from these databases previously, a variety of tools/approaches were used. For molecular classification of cancer Golub TR et al.,²² have divided cancer classification into two challenges as class discovery and class prediction.

A number of oncogenes and tumour suppressor genes that are changed in RCC, resulting in pathway dysregulation, need to be identified and investigated further^23,24,25. Copy number, gene sequencing, expression pattern, and methylation in primary RCC are all possible avenues for achieving this goal. With continued breakthroughs in omics technology, the application of molecular markers for early diagnosis and prognosis deserves further attention^{1,2,26,27,28,29,30}.

We have selected RCC dataset with samples from two stages (stages I and II) for the purpose of understanding how gene expression patterns vary and how altered gene expression patterns lead to possible changes in the respective inferred functions as tumour stage I to II changes and from affymetrix platforms (U133A to U133B). Different cancer stages help in describing where a cancer could be located, how far it has spread, and whether it is affecting other parts of the body^31,32,33. Healthy tissue usually contains many different types of cells grouped together. If the cancer looks similar to healthy tissue and contains different cell groupings, it is called differentiated or low-grade tumor and when the cancerous tissue looks very different from the healthy tissue, it is termed as poorly differentiated or high-grade tumor. The cancer’s grade may help the clinician to predict how quickly the cancer will spread. In general, the lower the tumor’s grade, the better the prognosis. Different types of cancer have different methods to assign a cancer grade^{7,34,35,36,37}. In general, it is very hard to detect most of the cancers at early stage so the main focus was on exploring the gene expression pattern alterations and its functional consequences and further to avoid biasedness, we have incorporated TCGA dataset also which have the samples from all the grades.

Here, we have selected a dataset from gene expression omnibus (GEO) where the samples are from human with two tumor stages (I and II). We have organized the samples in the order such as stage I normal versus tumor and stage II normal versus tumor for the affymetrix platforms U133A and U133B and analyzed the tumor samples with respect to their respective controls (normal sample of the same stage) for the gene expression alterations and evolved functions with the increase in tumor percentage. Based on our work, we conclude that irrespective of the tumor stage PI3K-Akt, Foxo, endocytosis, MAPK, Tight junction, cytokine-cytokine receptor interaction pathways and the major source of alteration for these pathways are MAP3K13, CHAF1A, FDX1, ARHGAP26, ITGBL1, C10orf118, MTO1, LAMP2, STAMBP, DLC1, NSMAF, YY1, TPGS2, SCARB2, PRSS23, SYNJ1, CNPPD1, PPP2R5E. In addition, we have also studied the clinical significance and observe that there are large number of differentially expressed genes which appears to be playing critical roles in for survival such as ARHGAP6, TGM4, CD248, SLC13A3, EPO, PARD6A, CLCA2, UBE2S, ERAL1, FGFR1, MRVI1, DYNC1I2, CDCA7.

Results

In the first step, we have selected the data of our interest (raw expression dataset) GSE6344^30,38, organized the samples in the order such as stage I normal versus tumor and stage II normal versus tumor for the affymetrix platforms U133A and U133B and processed it until normalization and log2 values for all the mapped genes as mentioned in the workflow Fig. 1a. This dataset contains 40 samples (5 normal and 5 tumor for two stages I and II from U133A and U133B platforms). For differential gene expression analysis, we have compared the tumor samples with normal samples of the respective stages and the respective platforms that it gives us four DEGs lists.

Gene expression profiling and the associated functions for varying tumor percentages

In this study, the initial focus of our goal was to understand the gene expression pattern between the different stages for normal versus tumor samples. For this purpose, the total number of the DEGs, up, and down regulated genes have been calculated (Fig. 1b) and the number of down-regulated genes are higher than the up-regulated genes and further, we observe that the number of down regulated genes are comparatively high in all the four DEGs list (Fig. 1c). For U133A dataset, we observe very high number of DEGs for same stage and shares 1147 genes between stage I and II with respect to U133B which is 606 genes and stage I and II specific genes are also high in both the platforms U133A and U133B. Similar to the DEGs distribution, the enriched pathways are also distributed in the similar trend as shown in Fig. 1d (p-values < 0.05) and even after applying strict cut-off of p-value as shown in Fig. 1e (p-values < 0.001). Most of the shared genes between different stages and platforms have been shown with their fold changes and these genes are known to be associated with the critical pathways which are very important for multiple type of cancers (Fig. 1f). In addition, we have also mapped the known association between all these genes (from Fig. 1f) for which one list of all these DEGs have been combined to single DEGs list and finally, these genes have been mapped by using the network database in the form of network as s in Fig. 1g. Figure 1g presents the network of DEGs and their connectivity with each other where there are four smaller clusters and these clusters are connected by a core cluster of SYNJ1, MAPT, YY1, NSMAF, and FNBP4 genes and among the highly connected genes SYNJ1, LAMP2, SCARB2, FDX1, HDLBP, CHAF1A, MAPT, and FNBP4.

Top-ranked enriched pathways for the respective DEGs list

After analyzing the number of DEGs and the enriched pathways, we have analyzed the enriched pathways and the genes which are altered in different RCC tumor stages (Table 1). We observe that MAPK, cytokine, Akt, Wnt, hippo, Hif1, metabolic signaling pathways are among the top-ranked pathways which are frequently altered and their potential source of alterations are MAP3K13, CHAF1A, FDX1, ARHGAP26, ITGBL1, C10orf118, MTO1, LAMP2, STAMBP, DLC1, NSMAF, YY1, TPGS2, SCARB2, PRSS23, SYNJ1, CNPPD1, PPP2R5E. These genes and the pathways are known to play the potential roles directly or indirectly in case of cancer.

Table 1 Enriched pathways grouped either common or specific to the conditions.

Full size table

Network-level understanding of the DEGs

Based on the venn diagram of the enriched pathways, we have prepared the list of the pathways in five groups (commonly enriched) and matched the genes with these pathways lists from all the four DEGs list (normal versus tumor in stage I and II for the U133A and U133B datasets). In Fig. 2, the networks have been shown for stage I of U133A, Stage I and II of U133B datasets. The networks shown are for those DEGs which are matching to different pathways lists obtained during venn diagram drawing. The major pathways have been highlighted on the top of the figure and in the left side the tumor stage have been mentioned. Since most of the networks for stage II of U133A dataset were densely connected so for such networks we have presented top 30 genes in terms of connectivity within the network (Fig. 3). Here, we have also shown the connectivity of the genes for those networks where the connections are not clearly visible. For more details of the list of the genes and the pathways used for the network-level analysis were supplied in the Supplementary Table S1.

Clinical significance of the differentially expressed genes

Additionally, we have selected the top-ranked genes (based on the fold change 15 up and 15 down) and analyzed the patients survival (Kaplan–Meier plot) for the patient samples from TCGA database and the dataset was TCGA kidney renal clear cell carcinoma (source data from GDAC Firehose) which contains 538 samples^5,36,39. We observe that most of the top-raked genes (from selected 30 DEGs) mainly up-regulated genes show very high significance on the patients survival (Fig. 4). In this figure, we have also shown the mutations in these top-ranked DEGs for clear renal cell carcinoma in the TCGA databse. There are few genes(ERBB4, SLC13A3, TGM4, and FGFR1) which are mutated at very high rate as shown in Fig. 4a,b. Further, we have also selected different dataset (GSE68417⁴⁰) which contains the samples for adjacent normal, low grade, and high grade and compared the differentially expressed genes and the enriched pathways with each other (Fig. 5a). This shows that the DEGs of adjacent normal versus low grade tumor samples share majority of the DEGs of adjacent normal versus high grade tumor samples and both these list share few DEGs with low grade versus high grade DEGs list and as expected there was no shared enriched pathways at all because there appears only few genes which have gene expression with fold change ≥ + 1.5 (up regulated) or ≤ − 1.5 (down regulated) in case of low grade versus high grade. Kaplan–Meier plots show the clinical significance and that is a large number of differentially expressed genes appear to be potentially significant in terms of survival and some of the selected genes are ARHGAP6, TGM4, CD248, SLC13A3, EPO, PARD6A, CLCA2, UBE2S, ERAL1, FGFR1, MRVI1, DYNC1I2, CDCA7 (additional data shown in supplementary Figs. S1–S6). Moreover, Fig. 5b has been presented with the list of genes and the respective p-values for survival analysis and here only those genes have been shown which are clinically significant and the overall pathways associated with these genes and further specific assocations were shown in Fig. 5c. Additionally, the expressions (RNA and protein) have been shown in supplementary data S7. We have checked the expression of these clinically relevant genes by using protein atlas where most of these genes are expressed in case of RCC and act as biomarkers and only TGM4 and GGN were not expressed.

Discussion

Renal cell carcinoma is one of the most common cancers, and it is one of the leading causes of cancer death^14,15,41. In terms of therapy and diagnosis, therapeutic and clinical outcomes differ between the individuals with even close similarity in clinical and pathological characteristics (tumor type, grades, and stages) and despite tremendous efforts to identify molecular biomarkers (prognostic and predictive) and with improved precision compared to clinical and pathological predictors only few molecular tests have been introduced into oncological practice²⁹. So it is important to understand and unravel different levels (such as gene expression pattern, epigenetics, protein expression) of diversities in cancer^42,43. We gathered the previously published dataset for this purpose and conducted a detailed and precise study ranging from gene expression profiling to functional changes, including networks mapped from the human protein network database.

Our work leads to the conclusion that irrespective of the tumor stage PI3K-Akt, Foxo, endocytosis, MAPK, Tight junction, cytokine-cytokine receptor interaction pathways and the major source of alteration for these pathways are MAP3K13, CHAF1A, FDX1, ARHGAP26, ITGBL1, C10orf118, MTO1, LAMP2, STAMBP, DLC1, NSMAF, YY1, TPGS2, SCARB2, PRSS23, SYNJ1, CNPPD1, PPP2R5E. Networks of DEGs for the enriched pathways show that there are large number of genes from few specific pathways are altered such as Ras signaling pathways(Fig. 2c,h,m), immune sysytems, Wnt, hippo, (Fig. 2d,i,n) Akt pathways (Fig. 2a,f,k). Here, we observe that critical pathways altered in RCC are wnt, hippo, regulation of actin cytoskeleton, ECM, infection and inflammation, metabolic, and more cancer related pathways. From the mapped network, we observe that the highly connected genes infer the potential pathways or in other works the top ranked genes based on connectivity refer to those pathways which are directly or indirectly associated either with RCC or other types of cancer.

In terms of clinical significance, we looked at the rate of mutations for the top ranked genes (based on fold change) and patients' survival for changes in gene expression, with Kaplan–Meier plots indicating clinical significance. We conclude that a large number of differentially expressed genes tend to be potentially important in terms of survival, with ARHGAP6, TGM4, CD248, SLC13A3, EPO, PARD6A, CLCA2, UBE2S, ERAL1, FGFR1, MRVI1, DYNC1I2, CDCA7 among the genes chosen. Using the publicly available datasets, we have investigated the gene expression profiling for renal cell carcinoma. In the previous work, it has been focused on selected genes and pathways. Here, we have investigated the list of critical pathways and the genes which appear to be clinically highly significant in case of renal cell carcinoma. These clinically significant genes lead to potential alteration in PI3K-Akt, foxo, endocytosis, MAPK, tight junction, cytokine-cytokine receptor interaction pathways. Our work will help in diagnosing the renal cell carcinoma patients because here, we have presented the differentially expressed genes, their inferred pathways, and the clinical impact of the selective genes. Since, our finding is from overall perspective including clinical relevance so this study will help in future for diagnostic also.

This work also appears to be more unique in comparison to the previous study that we potentially explored grade I and II of RCC and further explored the clinical relevance. Healthy tissue usually contains many different types of cells grouped together and if the cancer looks similar to healthy tissue and contains different cell groupings, it is called differentiated or low-grade tumor and when the cancerous tissue looks very different from the healthy tissue, it is termed as poorly differentiated or high-grade tumor. The cancer’s grade may help the clinician to predict how quickly the cancer will spread. In general, the lower the tumor’s grade, the better the prognosis. Different types of cancer have different methods to assign a cancer grade^{7,34,35,36,37} and the different tumor stages could help in describing the severeness, tumor propagation speed, and its impact on the other organs^31,32,33.. In general, it is very hard to detect most of the cancers at early stage so the main focus was on exploring the gene expression pattern alterations and its functional consequences and further to avoid biasedness, we have incorporated TCGA dataset also which have the samples from all the grades. Further, we have also investigated the expression of these clinically relevant genes by using protein atlas (https://www.proteinatlas.org/)^{44,45,46,47,48}. We observe that most of these genes are expressed in case of RCC and act as biomarkers and only TGM4 and GGN were not expressed. This study will be an important step for the understanding of early stage tumor propagation and also will be helpful for clinical aspect.

Conclusions

Based on our findings, we conclude that PI3K-Akt, Foxo, endocytosis, MAPK, Tight junction, and cytokine-cytokine receptor interaction pathways are among the most commonly altered pathways in renal cell carcinoma, and that MAP3K13, CHAF1A, FDX1, ARHGAP26, ITGBL1, C10orf118, MTO1, LAMP2, STAMBP, DLC1, NSMAF, YY1, TPGS2, SCARB2, PRSS23, SYNJ1, CNPPD1, and PPP2R5E are the major sources of alteration for these pathways. Wnt, hippo, actin cytoskeleton control, ECM, infection and inflammation, metabolic, and other cancer-related pathways are among the most important pathways altered in RCC. ARHGAP6, TGM4, CD248, SLC13A3, EPO, PARD6A, CLCA2, UBE2S, ERAL1, FGFR1, MRVI1, DYNC1I2, CDCA7 are some of the genes that were chosen after survival study.

Methods

Here, GSE6344 dataset was used for the study which contains the samples of stage I and II of gene expression for tumor kidney cancer^30,38. In the first step, we selected the raw expression dataset GSE6344 and processed it until normalisation and log2 values of all mapped genes were achieved, as shown in Fig. 1a of the workflow. These 40 samples in this dataset were 5 normal and 5 tumor for two stages I and II from U133A and U133B platforms. We have compared the tumor samples with standard samples of the respective stages and platforms for differential gene expression analysis, yielding four DEGs lists.

In short the basic steps involved for the entire study are raw file processing, intensity calculation and normalization. For normalization^49,50,51, GCRMA^{52,53,54,55,56}, RMA, and EB are the most commonly used approaches. Here, we have used EB for raw intensity normalization. After normalization, we proceed for our goal which is to understand the gene expression patterns^14,57 and its inferred functions^57,58.

To prepare the list of DEGs and analysis, we have our own in-built codes. The samples were placed into two groups such as COVID-19 positive and negative and then normal and the tumor samples. The selection criteria were placed by the fold change and p-values which have been calculated and for the selection of genes as differentially expressed the threshold of fold changes and p-values applied were ± 2 and 0.05, respectively and then KEGG database^59,60,61 have been used for pathway analysis and for which there is our own code designed⁶². In summary, for differential gene expression prediction and statistical analysis, MATLAB2017 functions (e.g., mattest) were applied and further for pathway analysis, we used KEGG⁶¹ database^62,63,64,65.

For generating DEGs network, FunCoup2.0⁶⁶ has been used for all the networks throughout the work and cytoscape⁶⁷ has been used for network visualization. For most of our coding and calculations MATLAB has been used^62,63,64,65. Furthermore, FunCoup2.0⁶⁶ database and cytoscape and its applications⁶⁸ were used for network visualization to understand the network and the connectivity of the genes within the network of DEGs^69,70. The basic concept of FunCoup network database is that it predicts four different classes of functional coupling or associations such as protein complexes, protein–protein physical interactions, metabolic, and signaling pathways⁶⁶. MATLAB 2017b codes and the command line codes have been used for figure plotting and during analysis. For the network level-analysis such as the number of connectivity per gene and the genes belonging to different number of pathways, the codes have been written in MATLAB and finally it has been plotted also by the codes written in MATLAB^64,65. For venn diagram plotting, freely available webserver (http://bioinformatics.psb.ugent.be/webtools/Venn/) was used^72,73,74.

Data availability

We have utilized the publicly available datasets (main data source) which are freely available and have mentioned it in method section with proper references. The analyzed details have been supported by the supplementary data.

References

Cairns, P. Renal cell carcinoma. Cancer Biomark 9, 461–473 (2010).
Article PubMed CAS Google Scholar
Hsieh, J. J. et al. Renal cell carcinoma. Nat. Rev. Dis. Primers. 3, 1–19 (2017).
Article Google Scholar
Swanton, C. Cancer evolution: The final frontier of precision medicine?. Ann. Oncol. 25, 549–551 (2014).
Article PubMed PubMed Central Google Scholar
Hiley, C., de Bruin, E. C., McGranahan, N. & Swanton, C. Deciphering intratumor heterogeneity and temporal acquisition of driver events to refine precision medicine. Genome Biol. 15, 453 (2014).
Article PubMed PubMed Central Google Scholar
Werner, H. M. J., Mills, G. B. & Ram, P. T. Cancer systems biology: a peek into the future of patient care?. Nat. Rev. Clin. Oncol 11, 167–176 (2014).
Article PubMed PubMed Central Google Scholar
Wang, E. Understanding genomic alterations in cancer genomes using an integrative network approach. Cancer Lett. 340, 261–269 (2013).
Article CAS PubMed Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell 144, 646–674 (2011).
Article CAS PubMed Google Scholar
Wang, Y. et al. Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet 365, 671–679 (2005).
Article CAS PubMed Google Scholar
Gaulton, K. J. et al. Genetic fine mapping and genomic annotation. Nat. Genet. 47, 1415–1425 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hornberg, J. J., Bruggeman, F. J., Westerhoff, H. V. & Lankelma, J. Cancer: A systems biology disease. Biosystems 83, 81–90 (2006).
Article CAS PubMed Google Scholar
Yuan, Y. et al. AssessingClinicalUtilityCancerGenomicProteomicDatTumorTypes2014NatBiotech. Nat. Biotechnol. 1–11 (2014). https://doi.org/10.1038/nbt.2940
Li, B. & Li, J. Z. A general framework for analyzing tumor subclonality using SNP array and DNA sequencing data. Genome Biol. 15, 473 (2014). https://doi.org/10.1186/s13059-014-0473-4
Article PubMed PubMed Central CAS Google Scholar
Rybak, A. P., Bristow, R. G. & Kapoor, A. Prostate cancer stem cells: deciphering the origins and pathways involved in prostate tumorigenesis and aggression. Oncotarget 6, 1900–1919 (2015).
Article PubMed Google Scholar
Lapointe, J. et al. Gene expression profiling identifies clinically relevant subtypes of prostate cancer. Proc. Natl. Acad. Sci. U.S.A. 101, 811–816 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Roth, R. B. et al. Gene expression analyses reveal molecular relationships among 20 regions of the human CNS. Neurogenetics 7, 67–80 (2006).
Article CAS PubMed Google Scholar
Ko, J.-H. et al. Expression profiling of ion channel genes predicts clinical outcome in breast cancer. Mol. Cancer 12, 106 (2013).
Article PubMed PubMed Central CAS Google Scholar
Aparicio, S. & Mardis, E. Tumor heterogeneity: next-generation sequencing enhances the view from the pathologist's microscope. Genome Biol. 15, 463 (2014). https://doi.org/10.1186/s13059-014-0463-6
Article PubMed PubMed Central Google Scholar
Navin, N. E. Tumor evolution in response to chemotherapy: Phenotype versus genotype. Cell Rep. 6, 417–419 (2014).
Article CAS PubMed PubMed Central Google Scholar
Strandmann, von, E. P., Reinartz, S., Wager, U. & Müller, R.,. Tumor-host cell interactionsin ovarian cancer: Pathwaysto therapy failure. Trends Cancer 3, 137–148 (2017).
Article CAS Google Scholar
Yap, T. A., Swanton, C. & de Bono, J. S. Personalization of prostate cancer prevention andtherapy: Are clinically qualified biomarkers in thehorizon?. EPMA J. 3, 3 (2012).
Article PubMed PubMed Central Google Scholar
Jia, Z. et al. Diagnosis of prostate cancer using differentially expressed genes in stroma. Can. Res. 71, 2476–2487 (2011).
Article CAS Google Scholar
Golub, T. R. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999).
Article CAS PubMed Google Scholar
Vogelstein, B. & Kinzler, K. W. Cancer genes and the pathways they control. Nat. Med. 10, 789–799 (2004).
Article CAS PubMed Google Scholar
Murai, M. & Oya, M. Renal cell carcinoma: Etiology, incidence and epidemiology. Curr. Opin. Urol. 14, 229–233 (2004).
Article PubMed Google Scholar
Terris, M., Klaassen, Z. & Kabaria, R. Renal cell carcinoma: Links and risks. IJNRD 45. https://doi.org/10.2147/IJNRD.S75916 (2016).
Tiwari, P., Kumar, L., Singh, G., Seth, A. & Thulkar, S. Renal cell cancer: Clinicopathological profile and survival outcomes. Indian J. Med. Paediatr. Oncol. 39, 23 (2018).
Article Google Scholar
Navai, N. & Wood, C. G. Environmental and modifiable risk factors in renal cell carcinoma. Urol. oncol. 30, 220–224 (2012).
Article PubMed Google Scholar
Maruschke, M. et al. Expression profiling of metastatic renal cell carcinoma using gene set enrichment analysis. Int. J. Urol. 21, 46–51 (2013).
Article PubMed CAS Google Scholar
Gerlinger, M. et al. Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing. Nat. Genet. 46, 225–233 (2014).
Article CAS PubMed PubMed Central Google Scholar
Tun, H. W. et al. pathway signature and cellular differentiation in clear cell renal cell carcinoma. PLoS ONE 5, e10696 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Cheung, K. J. & Ewald, A. J. A collective route to metastasis: Seeding by tumor cell clusters. Science 352, 167–169 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Jackson, T., Koh, G. Y. & Zheng, X. A continuous model of angiogenesis: Initiation, extension, and maturation of new blood vessels modulated by vascular endothelial growth factor, angiopoietins, platelet-derived growth factor-B, and pericytes. DCDS-B 18, 1109–1154 (2013).
Article MathSciNet MATH Google Scholar
Reis, P. P. et al. A gene signature in histologically normal surgical margins is predictive of oral carcinoma recurrence. BMC Cancer 11, 437–511 (2011).
Article CAS PubMed PubMed Central Google Scholar
Fraser, M. et al. Genomic hallmarks of localized, non-indolent prostate cancer. Nature 1–22. https://doi.org/10.1038/nature20788 (2017).
Penault-Llorca, F. & Radosevic-Robin, N. Biomarkers of residual disease after neoadjuvant therapy for breast cancer. Nat. Rev. Clin. Oncol. 1–17. https://doi.org/10.1038/nrclinonc.2016.1 (2016).
Liu, J. et al. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 173, 400-416.e11 (2018).
Article CAS PubMed PubMed Central Google Scholar
Suzuki, H. et al. Mutational landscape and clonal architecture in grade II and III gliomas. Nat. Genet. 1–14 (2015). https://doi.org/10.1038/ng.3273
Gumz, M. L. et al. Secreted frizzled-related protein 1 loss contributes to tumor phenotype of clear cell renal cell carcinoma. Clin. Cancer Res. 13, 4740–4749 (2007).
Article CAS PubMed Google Scholar
Ellrott, K. et al. Scalable open science approach for mutation calling of tumor exomes using multiple genomic pipelines. Cell Syst. 6, 271-281.e7 (2018).
Article CAS PubMed PubMed Central Google Scholar
Thibodeau, B. J. et al. Characterization of clear cell renal cell carcinoma by gene expression pro. Urol. Oncol. 1–9 (2015). https://doi.org/10.1016/j.urolonc.2015.11.001
Ross, D. T. et al. Systematic variation in gene expression patterns in human cancer cell lines. Nat. Genet. 24, 227–235 (2000).
Article CAS PubMed Google Scholar
Swanton, C. Intratumor heterogeneity: Evolution through space and time. Can. Res. 72, 4875–4882 (2012).
Article CAS Google Scholar
Zhang, J. et al. Intratumor heterogeneity in localized lung adenocarcinomas delineated by multiregion sequencing. Science 346, 256–259 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Sjöstedt, E. et al. An atlas of the protein-coding genes in the human, pig, and mouse brain. Science 367 (2020).
Uhlén, M. et al. A human protein atlas for normal and cancer tissues based on antibody proteomics. Mol. Cell. Proteomics 4, 1920–1932 (2005).
Article PubMed CAS Google Scholar
Uhlén, M. et al. A genome-wide transcriptomic analysis of protein-coding genes in human blood cells. Science 366, (2019).
Uhlén, M. et al. A pathology atlas of the human cancer transcriptome. Science 357, (2017).
Uhlén, M. et al. Proteomics. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Quackenbush, J. Microarray data normalization and transformation. Nat. Genet. 32, 496–501 (2002).
Article CAS PubMed Google Scholar
Simon, R. Microarray-based expression profiling and informatics. Curr. Opin. Biotechnol. 19, 26–29 (2008).
Article CAS PubMed Google Scholar
Ideker, T., Thorsson, V., Siegel, A. F. & Hood, L. E. Testing for differentially-expressed genes by maximum-likelihood analysis of microarray data. J. Comput. Biol. 7, 805–817 (2000).
Article CAS PubMed Google Scholar
Reimers, M. Making informed choices about microarray data analysis. PLoS Comput. Biol. 6, e1000786 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Chen, K.-H. et al. Gene selection for cancer identification: A decision tree model empowered by particle swarm optimization algorithm. BMC Bioinf. 15, 1–10 (2014).
Article Google Scholar
Bild, A. H. et al. An integration of complementary strategies for gene-expression analysis to reveal novel therapeutic opportunities for breast cancer. Breast Cancer Res. 11, R55 (2009).
Article PubMed PubMed Central CAS Google Scholar
Salomonis, N. et al. GenMAPP 2: New features and resources for pathway analysis. BMC Bioinformatics 8, 217 (2007).
Article PubMed PubMed Central CAS Google Scholar
Girke, T. Microarray analysis. 1–42 https://faculty.ucr.edu/~tgirke/HTML_Presentations/Manuals/Microarray/arrayBasics.pdf (2011).
Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Mi, H., Poudel, S., Muruganujan, A., Casagrande, J. T. & Thomas, P. D. PANTHER version 10: Expanded protein families and functions, and analysis tools. Nucleic Acids Res. 44, D336–D342 (2016).
Article CAS PubMed Google Scholar
Kanehisa, M. et al. KEGG for linking genomes to life and the environment. Nucleic Acids Res. 36, D480–D484 (2007).
Article PubMed PubMed Central CAS Google Scholar
Kanehisa, M., Goto, S., Furumichi, M., Tanabe, M. & Hirakawa, M. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res. 38, D355–D360 (2009).
Article PubMed PubMed Central CAS Google Scholar
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res. 40, D109–D114 (2011).
Article PubMed PubMed Central CAS Google Scholar
Eldakhakhny, B. M., Sadoun, Al, H., Choudhry, H. & Mobashir, M. In-Silico Study of immune system associated genes in case of type-2 diabetes with insulin action and resistance, and/or obesity. Front. Endocrinol. 12, 1–10 (2021).
Warsi, M. K., Kamal, M. A., Baeshen, M. N., Izhari, M. A. & Mobashir, M. Comparative study of gene expression profiling unravels functions associated with pathogenesis of dengue infection. Curr. Pharmaceut. Des. 26(41), 5293–5299 https://doi.org/10.2174/1381612826666201106093148 (2020).
Article CAS Google Scholar
Kamal, M. A. et al. Gene expression profiling and clinical relevance unravel the role hypoxia and immune signaling genes and pathways in breast cancer: Role of hypoxia and immune signaling genes in breast cancer. jimsa 1, (2020).
Krishnamoorthy, P. K. P. et al. Informatics in Medicine Unlocked. Inf. Med. Unlocked 20, 100422 (2020).
Article Google Scholar
Alexeyenko, A. & Sonnhammer, E. L. L. Global networks of functional coupling in eukaryotes from comprehensive data integration. Genome Res. 19, 1107–1116 (2009).
Article CAS PubMed PubMed Central Google Scholar
Okawa, S., Angarica, V. E., Lemischka, I., Moore, K. & del Sol, A. A differential network analysis approach for lineage specifier prediction in stem cell subpopulations. npj Syst Biol Appl 1–8 (2015). https://doi.org/10.1038/npjsba.2015.12
Shannon, P. et al. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Mobashir, M., Schraven, B., & Beyer, T. Simulated evolution of signal transduction networks. PloS one 7(12), e50905. https://doi.org/10.1371/journal.pone.0050905 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Mobashir, M., Madhusudhan, T., Isermann, B., Beyer, T. & Schraven, B. Negative interactions and feedback regulations are required for transient cellular response. Sci. Rep. 4, 3718. https://doi.org/10.1038/srep03718 (2014).
Article CAS Google Scholar
Kanehisa, M. The KEGG resource for deciphering the genome. Nucleic Acids Res. 32, 277D – 280 (2004).
Article CAS Google Scholar
Helmi, N., Alammari, D. & Mobashir, M. Role of potential COVID-19 immune system associated genes and the potential pathways linkage with type-2 diabetes. Comb. Chem. High Throughput Screen. https://doi.org/10.2174/1386207324666210804124416 (2021).
Article PubMed Google Scholar
Bajrai, L. H. et al. Understanding the role of potential pathways and its components including hypoxia and immune system in case of oral cancer. Sci. Rep. 11(1), 19576. https://doi.org/10.1038/s41598-021-98031-7 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Bajrai, L. H. et al. Gene Expression Profiling of Early Acute Febrile Stage of Dengue Infection and Its Comparative Analysis With Streptococcus pneumoniae Infection. Front. Cell. Infect. Microbiol. 11, 707905. https://doi.org/10.3389/fcimb.2021.707905 (2021).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

HIK, IMA, LHB, PKPK, MAK, and MM designed the experiment, performed calculations, analyzed the results and written the manuscript. HIK, IMA, LHB, PKPK, MAK, and MM contributed in designing the experiment, analysis, and manuscript writing. HIK, MAK, and MM contributed in experiment designing, analysis, and manuscript writing. The work has been supported by the Deanship of Scientific Research (DSR) at King Abdulaziz University, Jeddah, Saudi Arabia funded this project, under grant no. (422-800).

Funding

The work has been supported by the Deanship of Scientific Research (DSR) at King Abdulaziz University, Jeddah, Saudi Arabia funded this project, under grant no. (422-800). The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Author information

Authors and Affiliations

Department of Medical Laboratory Technology, Faculty of Applied Medical Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
Hamed Ishaq Khouja & Ibraheem Mohammed Ashankyty
Special Infectious Agents Unit-BSL3, King Fahad Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
Leena Hussein Bajrai
Biochemistry Department, Sciences College, King Abdulaziz University, Jeddah, Saudi Arabia
Leena Hussein Bajrai
Department of Biotechnology, Sri Venkateswara College of Engineering, Sriperumbudur, 602105, India
P. K. Praveen Kumar
West China School of Nursing/Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610041, Sichuan, China
Mohammad Amjad Kamal
King Fahd Medical Research Center, King Abdulaziz University, P. O. Box 80216, Jeddah, 21589, Saudi Arabia
Mohammad Amjad Kamal
Enzymoics, Novel Global Community Educational Foundation, 7 Peterlee Place, Hebersham, NSW, 2770, Australia
Mohammad Amjad Kamal
Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Kingdom of Saudi Arabia
Ahmad Firoz
SciLifeLab, Department of Oncology and Pathology, Karolinska Institutet, Box 1031, 171 21, Stockholm, Sweden
Mohammad Mobashir

Authors

Hamed Ishaq Khouja
View author publications
You can also search for this author in PubMed Google Scholar
Ibraheem Mohammed Ashankyty
View author publications
You can also search for this author in PubMed Google Scholar
Leena Hussein Bajrai
View author publications
You can also search for this author in PubMed Google Scholar
P. K. Praveen Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Amjad Kamal
View author publications
You can also search for this author in PubMed Google Scholar
Ahmad Firoz
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Mobashir
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.I.K., I.M.A., L.H.B., P.K.P.K., M.A.K., A.F., and M.M. designed the experiment, performed calculations, analyzed the results and written the manuscript. H.I.K., I.M.A., L.H.B., P.K.P.K., M.A.K., A.F., and M.M. contributed in designing the experiment, analysis, and manuscript writing. H.I.K., M.A.K., and M.M. contributed in experiment designing, analysis, and manuscript writing.

Corresponding authors

Correspondence to Hamed Ishaq Khouja or Mohammad Mobashir.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Table S1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khouja, H.I., Ashankyty, I.M., Bajrai, L.H. et al. Multi-staged gene expression profiling reveals potential genes and the critical pathways in kidney cancer. Sci Rep 12, 7240 (2022). https://doi.org/10.1038/s41598-022-11143-6

Download citation

Received: 13 May 2021
Accepted: 11 October 2021
Published: 04 May 2022
DOI: https://doi.org/10.1038/s41598-022-11143-6
Springer Nature Limited

This article is cited by

The Role of HSP90 and TRAP1 Targets on Treatment in Hepatocellular Carcinoma
- P. K. Praveen Kumar
- Harini Sundar
- M. Michael Gromiha
Molecular Biotechnology (2024)
Multi-omics analysis uncovers clinical, immunological, and pharmacogenomic implications of cuproptosis in clear cell renal cell carcinoma
- Maoshu Zhu
- Yongsheng Li
- Weimin Zhong
European Journal of Medical Research (2023)
Predicting congenital renal tract malformation genes using machine learning
- Mitra Kabir
- Helen M. Stuart
- Kathryn E. Hentges
Scientific Reports (2023)

Multi-staged gene expression profiling reveals potential genes and the critical pathways in kidney cancer

Abstract

Similar content being viewed by others

Identifying the novel key genes in renal cell carcinoma by bioinformatics analysis and cell experiments

Identification of genes and pathways involved in kidney renal clear cell carcinoma

A pan-kidney cancer study identifies subtype specific perturbations on pathways with potential drivers in renal cell carcinoma

Introduction

Results

Gene expression profiling and the associated functions for varying tumor percentages