Abstract
Background
Colon cancer with liver metastases (CCLM) characterized by genetic heterogeneity is an evolutionary process leading to variations in response to selective pressure, but the underlying evolutionary models still remains unclear.
Methods
Total of 30 samples, including primary tumor and two to four matched liver metastases from 8 treatment-naïve patients with CCLM were collected, and subjected to whole-exome DNA sequencing. PyClone was used to calculate intra and inter-tumor heterogeneity, LICHeE was used to reconstruct the cancer phylogeny trees and investigate the subclonal composition.
Results
The genetic differences were observed between primary and metastatic lesions, as well as among multiple metastases in all patients. The natural history models of colorectal cancer in each case were identified, including parallel, linear, and branching evolution. Liver metastases could originate from primary lesions or other metastases. Pathway and process enrichment analysis also showed obvious heterogeneity and enhancement of several molecular functions.
Conclusions
Our data reveal the genetic and heterogeneity between primary and metastatic lesions, as well as among multiple metastases and provide genomic evidence for clonal heterogeneity for CCLM.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Despite incessant improvement of surgery, radiotherapy and chemotherapy, colon cancer (CC) still is one of the most common causes of fatalities in both men and women worldwide [1]. Distant metastasis is a lethal consequence of cancer progression in CC patients, liver is the main metastatic site. Recent investigations have highlighted CC with liver metastasis (CCLM) is a genetic evolutionary process characterized by genetic heterogeneity, leading to variations in response to selective pressure and treatment outcome [2,3,4,5].
It is widely appreciated that the majority of CRC develop as a consequence of the progressive accumulation of genetic alterations in evolving clones of tumor cells, which also plays an essential role in the metastatic process of CRC [6]. Thus, modeled mCRC evolution may provide the evolution-based categorization of genomic alterations and the inferred mutational landscape. The natural history of tumor progression follows four evolutionary pathways: parallel evolution displaying similar genotypic and phenotypic modifications under similar environments, sequentially acquired driver mutations over time called linear evolution, genetically distinct with many standing lineages supporting the branch evolution, mutations in genes fortuitously relapses undergoing neutral evolution [7, 8]. In addition, studies find that the cellular heterogeneity and the constant evolution of CRC complicates the design of effective treatment regimens [9]. The influence of genetic heterogeneity and evolution on the response to neoadjuvant treatment has been analyzed, the results revealed substantial changes in subclones, and selective modification and subclonal populations were enriched after treatment [10].
The evolutionary process of tumor generates subpopulation cells termed subclones, which share subsets of mutations, their divergence result in genetic and molecular heterogeneity [2, 3]. Due to selective pressure, mutationally distinct subclones were selected by advantage; each of those subclones possesses a characteristic clinical property (e.g., drug sensitivity, growth rate, or metastatic potential) [11]. For CCLM, a range of subclones are competing and positively selected according to their ability to metastasize and adapt to the liver environment. Thus, subclonal cluster analysis may shed light on the characteristics of molecular biology of CCLM [2, 12,13,14]. Hence, understanding the clonal composition and phylogenetic tree may help us understand the biology and evolution of CCLM, as well as guide the design of combinatorial therapies. Nevertheless, the heterogeneity and evolution of CCLM remains unknown.
In the present study, we retrospectively collected matched tissues of primary and multiple liver metastases from treatment-naïve patients with CCLM. Subsequently, we performed genomic profiling of 30 samples (i.e., eight primary tumors and 22 CCLM samples) from eight patients. The objective of this study was to describe the genetic heterogeneity, reconstruct the cancer evolution history and analyze the function of subclones.
Materials and methods
Patients
Thirty samples including matched 8 primary and 22 CCLM samples were collected from 8 CC patients admitted to Shandong Cancer Hospital affiliated to Shandong First Medical University and Shandong Academy of Medical Sciences from January 2013 to November 2019. The CCLM patients were diagnosed according to the tumor specimens’ histological examination and the 8th Edition of the AJCC Staging System. All patients didn’t receive any anti-tumor treatment before sampling. This study was approved by Ethics Committee of Shandong Cancer Hospital affiliated to Shandong First Medical University and Shandong Academy of Medical Sciences. Informed consent was obtained from all individuals.
DNA extraction
Genomic DNA was extracted from tumor samples according to the following procedures. A 4-μm section of a hematoxylin and eosin–stained slide of a FFPE sample underwent a pathologist review to ensure each sample at least had the area of 1cm2, nucleated cellularity of 80% and tumor content of 20%. Ten unstained FFPE sections (total 40 μm) were used, generating 0.5–2 μg of DNA using the QIAGEN QIAamp DNA Mini Kit (QIAGEN) according to the manufacturer's instructions.
Whole-exome sequencing
WES libraries were constructed using KAPA hyper prep kit and 50–500 ng of double stranded DNA was fragmented to ~ 250 bp by sonication. Subsequent library construction was done using KAPA Hyper Prep Kit (KAPA Biosystems) for end repair, d A addition and adapter ligation was performed, followed by PCR amplification and Qubit™ dsDNA HS Assay quantification (Thermo Fisher Scientific). Samples yielding < 40 ng of extracted DNA or < 500 ng of pre-capture library were excluded for further sequencing. The SureSelect Human All Exon V6 from Agilent was used to target all exons. Hybridization capture followed the manufacture’s protocol Hybridization capture of DNA libraries using xGen® Lockdown® Probes and Reagents (Integrated DNA Technologies, Ver.4). Post-capture libraries were mixed together, denatured and diluted to 1.01 ~ 1.1 nM and subsequently sequenced on Novaseq 6000 with paired-end reads 2 × 150 bp by following the manufacturer’s protocols. (Illumina, Inc.)
Data analysis
WES sequencing reads after exclusion of low-quality reads were mapped to the UCSC hg19 reference sequence with BWA (version 0.7.15, http://bio-bwa.sourceforge.net/). PCR duplicates were removed by Picard (version 2.0.1, http://broadinstitute.github.io/picard/), and recalibrated by the Base Recalibrator tool from GATK (version 4.0.6.0, https://software.broadinstitute.org/gatk/). Somatic variants were detected using Mutect (version 2) on exome data of tumor. Annotation of variants was performed by Annovar (version 2017/07/17) [15], on Refseq gene models (version 2017/06/01). Known germline variants were filtered from dbSNP (version 147), database of the 1000 Genomes, NHLBI Exome Sequencing Project (ESP6500), Exome Aggregation Consortium (EXAC), and Genome Aggregation Database (gnomAD). A stringent downstream filter comprised of the following criteria was used to obtain high quality somatic variants: a minimum of 30X coverage; Variant Allele Fraction (VAF) > = 1% and at least 5 variant supporting reads in the tumor sample, strand bias < = 0.95; after that, mutations in the non-coding regions (3’UTR, 5’UTR, Intron, gene intergenic etc.) were removed.
For each tumor, SCNAs were inferred by CNVkit (version 0.9.5, https://cnvkit.readthedocs.io/en/stable/pipeline.html) using Circular Binary Segmentation algorithm with default parameters. Segment-level ratios were calculated and log2 transformed. A log2 ratio cutoff of ± 0.8 was used to define SCNA amplification and deletion [16]. As there’s no matched normal could be obtained, we used pooled normal as control to detect CNV [17]. Gene rearrangements different chromosomes and long indels located in the same chromosome were analyzed by pipelines published in 2019 by OrigiMed lnc. [18]. Mutation landscape was visualized by R package maftools. TMB was defined as the number of somatic mutations (including base substitutions and indels) in the coding region. To reduce sampling noise, synonymous alterations were also counted [19]. To calculate the TMB, the total number of mutations counted was divided by the size of the coding sequence region of the Agilent SureSelect Human All Exon V6. HLA-I genotyping was assessed by WES and patients were considered fully heterozygous at HLA-I if they had six different HLA-I alleles [20]. To uncover the immune-associated gene profile of primary and metastases, all immunologically relevant genes were extracted from the ImmPort (https://www.ImmPort.org/).
Heterogeneity analysis was performed with PyClone (version 0.13.0, default parameter) [21]. For inputs to PyClone, reference, variant read depths and copy number at each locus was taken from the output of previous analysis. PyClone was run with 10,000 iterations and a burn-in of 1000, as suggested by the authors.
Reconstruction of clonal evolution
For construction of the phylogenetic tree, multi-sample tumor phylogenies and tumor subclone decomposition were analyzed using LICHeE (Lineage Inference for Cancer Heterogeneity and Evolution) [22]. LICHeE is a method that use SSNVs from multiple samples of individual cancer patient to reconstruct cancer cell lineages and the evolutionary network to illustrate evolutionary timing relationships between each cluster pair, where each node corresponds to an SSNV cluster (the root represents the germ line) and each edge between two nodes, denotes that parent node could be an evolutionary predecessor of child node (SSNVs in parent cluster could have ‘happened before’ SSNVs in child cluster). The LICHeE algorithm was implemented in Java. It is the open source and freely available online (LICHeE Github Repository. http://viq854.github.io/lichee/).
Pathway analysis and statistical analysis
The pathway/ontology enrichment analysis of mutations genes was performed using the Metascape (http://metascape.org). The statistical tests were conducted at a two-sided level of significance of 0.05 in R environment. Immunohistochemical analysis for Ki67 and cleaved-caspase-3 was performed using paraffin sections. Immunohistochemical staining was carried out for cleaved-caspase-3 (1:200; Servicebio, GB11009-1) and Ki67 (1:600; Servicebio, GB121141).
Results
Workflow
In current study, we sequenced 30 unique bulk tumor samples (matched 8 primary tumor samples and 22 liver metastatic samples) from 8 patients with CCLM, the primary and metastatic tumors were resected at the same period and no patient received chemoradiotherapy pre-operation. All tumors were microsatellite stable, and other characteristics information about the patients and samples are provided in Table 1. The collected samples were sequenced using whole-exome sequencing, and the workflow was illustrated in Fig. 1: firstly, samples from each of the eight patients were shown (Fig. 1a), and the specific locations of the primary and metastasis loci were shown in Table 1. Next, the HE stain of the primary and multiple metastases were underwent review, followed by next-generation whole-exome DNA sequencing, and the CC08 HE stains were depicted in Fig. 1b as an example. Finally, a series of data analyses were completed, including somatic mutations illustration; intra- and inter-tumor heterogeneity illustration by PyClone, evolution mode reconstruction as well as biological function assay.
Diversity between primary and metastases in treatment-naive CCLM
An individual mutation profile was generated for each patient to present the genetic homogeneity and heterogeneity between primary and metastatic lesions (supplement materials S1). As shown in Fig. 2a and supplement materials S1, “Private” and “Shared” stood for heterogeneity, whereas “Ubiquitous” stood for homogeneity, evidently, mutations differed significantly between primary lesions and metastases, as well as among multiple metastases in all eight patients.
Moreover, the mutation information of the top 50 genes with the highest frequency, as well as the clinically important genes including KRAS, NRAS, BRAF, and PIK3CA were summarized and illuminated in Fig. 2b., to further illuminate the characteristics of mutated genes in each patient. The mutation frequency of PIK3CA was 43% (cases CC03, CC07, and CC08), two types of mutations were observed in which H1047R was dominant; whereas KRAS mutation was detected in only two patients (i.e., missense mutation in CC01 and multi-hit mutation in CC08). Notably, the mutation of either PIK3CA or KRAS in the primary lesion did not variate but was missing in metastatic foci (Table S1). Unexpectedly, no mutations of NRAS and BRAF were detected in all of the enrolled samples, which might attribute to the small patient sample size.
The type of alterations exhibited diversity between primary lesions and multiple metastases as evidence from the landscape. For instance, OBSCN mutations in case CC03, were multi-hit in primary but nonsense in metastases. The difference of tumor mutational burden (TMB) between samples is obvious in CC01, CC03, CC04, CC05 and CC06, the metastases had a higher TMB, but no significant difference was observed between primary and metastasis (Fig. S1a). Additionally, heterozygous HLA-I genotypes were detected in all of patients (30/30, 100%). Next, all immunologically relevant genes were extracted from the ImmPort. The mutations were picked out based on the immune-associated genes reflecting different statuses in the immune context of microenvironment (Fig. S2).
Finally, the mutation gene loci were diagramed for the top 10 most frequently mutated genes and the clinically important genes of KRAS and PIK3CA, which were depicted by lollipops (Fig. 2c and Fig. S1b). Subsequently, we found different mutation loci between the primary and metastases in NEB, OBSCN and B4GALNT4. These results indicate the intricate heterogeneity between primary and metastases, even when they shared the same mutation genes, the mutation form or loci varied greatly.
Intra- and inter- tumor heterogeneity in treatment-naive CCLM
Intra-tumor and inter-tumor heterogeneity can act as a substrate for genetic diversity and tumor evolution. Pyclone, a software to predict subclone numbers and infer clonal structures, was employed to evaluate the heterogeneity of CCLM. Nine clusters were identified across all samples in case CC01, and different cluster included diverse mutations, for example, cluster 0 obtained 102 somatic mutations, and cluster 1 included another 81 mutation genes. The cellular prevalence displayed differently across the primary and matched multiple metastases, the variation trend was obvious in cluster 1 (mean 0.260–0.536), but seemed unchanged across samples in cluster 3 (mean 0.318–0.320) (Fig. 3a, b). Consistently, the similar phenomenon was also observed in the other seven patients (Fig. S3 a-b), indicating the intra- and inter-tumor heterogeneity.
Next, a circus plot was applied to demonstrate the relationship between the identified mutation genes in all clusters. As shown in Fig. 3c, although patients shared fewer genes with each other (purple links), they exhibited more functional overlaps (blue links). This finding implied that mutations might be involved in different biological processes. In addition, we subsequently conducted gene set enrichment analysis of biological functions for the inferred clusters. Accordingly, two cohorts were hierarchically clustered based on Kappa-statistical similarities among their gene memberships (Fig. 3d). We hypothesized that these two cohorts may be linked to different prognoses. Further study involving more cases is required to confirm our findings.
Three different phylogenetic evolution model of CCLM
We also sought to elucidate the evolutionary process of CCLM. Therefore, the Lineage Inference for Cancer Heterogeneity and Evolution (LICHeE) method was used to analyze the evolutionary model of eight patients. Three different phylogenetic models were identified (i.e., parallel, linear, and branching evolution) (Fig. 4). As shown in Fig. 4a, sibling clades deriving from a single ancestral genotype were found in two cases (i.e., CC04 and CC06), indicating parallel evolution in primary lesions and different metastases. For cases CC04 and CC06, there were no novel mutations detected in the liver metastases; all mutations evolved from the primary lesions. Moreover, a linear evolution model was noted in case CC03. Both liver metastases lost 36 mutations compared with the primary lesion, whereas one metastasis gained 45 novel mutations (Fig. 4b left). According to the LICHeE algorithm, we concluded that L1 first metastasized from the primary lesion, followed by L2 (Fig. 4b right).
Another model revealed mixed lineages evolution in the rest cases, displaying diverse subclones in primary and metastases (Fig. 4c). Compared to primary sites, all the metastatic sites developed novel mutations to some extent, these mutations were formed during tumor progression, some of which acquired metastatic potential. For example, in case CC01, mutations were grouped into 5 subclones, in which subclone with 33 mutations was found in both primary and metastatic sites. During tumor progression, L2 and L3 first moved to liver after primary site developed subclone with 94 mutations, followed by developing shared 31 mutations respectively, while primary site developed other 36 mutations, and L1 transferred to liver.
Based on the same procedure, we speculated the evolutionary process from the phylogenetic trees in case CC02 (Fig. 4c). L2 was derived from L1 from subclone7, and evolved into a unique subclone 6 at the liver metastases, which is also consistent with a metastasis-seeding metastasis model. For cases CC05 and CC07, we investigated two possible scenarios according to the phylogenetic trees. For example, in case CC05, the L2 was derived from subclone 8, which could be acquired from the primary lesion or the L3 metastasis. There were four possibilities in case CC08: L1 and L3 could emerge from the primary lesion or the L2 metastasis in subclone 1.
Overall, these data indicated that the ancestral genotype may not be present in all samples. The proportions of subclones also exhibited diversity, and the dominant genotype was also distinguishable between samples. Thus, the evolutionary models of CCLM were complex and the subclones in primary lesions and multiple metastases were diverse, leading to tumor heterogeneity in a single patient.
The different functional effects of subclones
Cancer is a disease of clonal evolution. The selected subclone might lead to cell proliferation, invasion, and metastasis. The subclone in a metastasis might possess the ability to induce different features compared with the genotype of the primary lesion. Hence, pathway and process enrichment analysis was carried out for six patient with different subclones in primary and metastatic lesions using Metascape (Fig. 5a) [23]. As illustrated in the heatmap, several enriched pathways were frequently observed in primary lesions and metastases from all eight patients. These included ATP-dependent activity, calcium ion binding, actin binding, and cell adhesion molecule binding; the latter was observed at almost all sites in these eight patients. In addition, immunohistochemically (IHC) analysis of cell proliferation and apoptosis was performed to further examine biological differences between primary and metastatic lesions in the same case (Fig. 5b, Table 2, Fig. S3, S4). According to IHC staining for cleaved-caspase-3, similar results were obtained for the primary lesions and metastases. However, in case CC03, the staining intensity for the primary lesion was weak and the positive area was 50%. For CC03.L1, the staining intensity was weak and the positive area was 80%. For CC03.L2, the staining intensity was moderate and the positive area was > 90%. Compared with CASP3, Ki-67 exhibit broader heterogeneity in cases CC01, CC05, CC08, CC02, CC03, and CC07.
Discussion
CCLM is a heterogeneous disease with similar phenotype, but different genotypes, prognoses, and responses to chemotherapy. Current tumor treatment strategies are based on the linear evolution model, so if evolution models were diverse, then the treatment strategy should subsequently alter. Nevertheless, few studies have investigated the evolutionary processes responsible for these phenomena. Based on our data, tumor heterogeneity from matched primary lesions and multiple liver metastases of treatment-naïve patients was assessed, all samples displayed intra- and inter-tumor heterogeneity considerably, and the cancer phylogeny tree was also inferred, which implied different evolution models and metastatic route from primary tumor to liver in accord with primary-seeding or metastasis-seeding metastasis models.
It is crucial to understand the driver mutation genes relationship between primary lesions and metastases. Nevertheless, the overlap appears to be more ubiquitous than previously thought, especially in the mostly frequently mutation genes. As demonstrated in Fig. 2b, there were a large number of overlaps in driver gene mutations between primary and metastatic lesions, indicating the top 50 genes in Fig. 2b were mainly homogeneity. This is because, gene mutation empowers the selective advantages of tumor cell, constituting to the regulation of the major cellular functions including growth, death and genome stability and driving tumorigenesis. These mutations with the highest frequency can be inherited to the metastatic lesions without loss throughout the evolution, leading to the overlap in the Fig. 2b. We called these overlapped mutation as “trunk mutation” or “base mutation”, contributing to the genetic homogeneity. Moreover, novel mutations might also generate during or after metastasis, which we called “branch mutation”, and empowered heterogeneity between primary and metastatic lesions. The branch mutations usually have a lower mutation frequency since they occur much later than the trunk mutation, but they determine the evolutionary mode and genetic feature of CCLM. However, other situations should be also taken into consideration. Multiple subclones may exist in the primary lesion, which attribute to multiple clonal origins or new subclone generation during tumor growth. Different metastases might originate from different subclones of primary lesion. For example, in case 1, the top 50 mutations in primary lesion were totally inherited to CC01L2 and CC01L3, whereas a portion of them were missing in CC01L1, implying alternative evolutionary branch in CC01L1. Previous study has also proved that the genetic divergence characterizes the mode of evolution across diverse solid tumor types [24]. It is feasible to distinguish tumors driven by strong positive subclonal selection from those evolving neutrally or under weak selection, revealing different modes of evolution both within and between solid tumor types. This quite differed from the current study. They developed a classifier by simulating spatial tumor growth under different evolutionary modes, whereas the current genetic model was derived from tumor heterogeneity in matched primary lesions and multiple liver metastases.
We suggest that the biological differences between primary and metastatic lesions in a single case may be the result of the accumulation of genetic mutations, which are carried by different subclones in the evolutionary process. In our study, the biological differences between primary and metastatic lesions were indeed observed in some cases such as in case 3, which possessed the linear evolution mode. Nevertheless, these biological differences seemed not obvious in most cases. We believe that "clonal evolution" was confined to the genetic variation in metastatic foci but not to the biological feature. As described above, trunk mutations regulate the major cellular functions including growth, death and genome stability in both primary and metastatic lesion, Nevertheless, the branch mutations empower the heterogeneity, as well determine the evolutionary mode and genetic feature of CCLM, which may help explain why the biological differences were not such obvious.
Currently, drug therapies for CCLM have limited efficacy, thus, targeting the most frequently enriched pathways in metastatic subclones introduced by evolution may be possible to improve the efficacy. Several studies have demonstrated the evolution models in the development of cancer, but few studies have been performed using treatment-naïve samples of CCLM, due to the difficulty in collecting samples from patients. In this case, our research attempted to examine several aspects of treatment-naïve multiple synchronous CCLM, including the functional effects of subclones. The cell adhesion molecule binding was frequently observed in subclones from all samples in our data (Fig. 5), moreover, studies have shown that monoclonal antibodies involved in cell-to-cell regulation of adhesion have been extensively studied as a promising treatment for gastrointestinal cancer [25, 26], as such, it may be a potential treatment for CCLM, but its causal role should be clarified in future investigations. Moreover, mismatch repair deficient and TMB are the crucial features for cancer because of the immune checkpoint inhibitors treatment [27, 28]. Therefore, we analyzed the immunologic microenvironments related genes, MMR, TMB and HLA-I homozygosity, which may also be relevant to immune checkpoint blockade [20, 29, 30]. In our study, the MMR and HLA-I status displayed fully coherence, the immunologic microenvironments related genes were not very clearly distinguishable, whereas TMB exhibited heterogeneity in primary lesions and metastases exhibited. Hence, it is preferable to develop a more standardized approach for the treatment of CCLM patients with immune checkpoint inhibitors.
Several limitations of the present study should be acknowledged. Firstly, a major limitation of this study is the lack of matched non-tumor tissue from each patient. Nevertheless, in the absence of control, Mutect2 can use a single sample as input; also, a pooled standard sample can be used as normal. Mutect, a method developed by Broad Institute (Cambridge, MA, USA) for the reliable and accurate identification of somatic point mutations using NGS data, has been applied successively identify the sites carrying somatic mutations with high confidence. Secondly, we performed only whole-exome sequencing without RNA sequencing to investigate the evolution of the tumor immune microenvironment.
Collectively, our research demonstrated the genetic and functional similarities and differences between primary and metastatic lesions, as well as among multiple metastases, based on which three natural history models of CCLM were identified including parallel, linear, and branching, revealing liver metastasis originated not only from the primary but also from another metastatic lesion. Our results provide the genomic evidence for the metastatic heterogeneity and evolution of CCLM, which should be considered for its therapeutic decision making in future.
References
Bray F, Ferlay J, Soerjomataram I, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424.
Teng S, Li YE, Yang M, et al. Tissue-specific transcription reprogramming promotes liver metastasis of colorectal cancer. Cell Res. 2020;30:34–49.
Dang HX, Krasnick BA, White BS, et al. The clonal evolution of metastatic colorectal cancer. Sci Adv. 2020; 6 eaay9691.
Greaves M, Maley CC. Clonal evolution in cancer. Nature. 2012;481:306–13.
Merlo LM, Pepper JW, Reid BJ, et al. Cancer as an evolutionary and ecological process. Nat Rev Cancer. 2006;6:924–35.
Grady WM, Carethers JM. Genomic and epigenetic instability in colorectal cancer pathogenesis. Gastroenterology. 2008;135:1079–99.
Fearon ER, Vogelstein B. A genetic model for colorectal tumorigenesis. Cell. 1990;61:759–67.
Gerlinger M, Rowan AJ, Horswell S, et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. N Engl J Med. 2012;366:883–92.
Bozic I, Reiter JG, Allen B, et al. Evolutionary dynamics of cancer in response to targeted combination therapy. Elife. 2013;2: e00747.
Yang J, Lin Y, Huang Y, et al. Genome landscapes of rectal cancer before and after preoperative chemoradiotherapy. Theranostics. 2019;9:6856–66.
Fan J, Lee HO, Lee S, et al. Linking transcriptional and genetic tumor heterogeneity through allele analysis of single-cell RNA-seq data. Genome Res. 2018;28:1217–27.
Seretis F, Seretis C, Youssef H, et al. Colorectal cancer: seed and soil hypothesis revisited. Anticancer Res. 2014;34:2087–94.
Langley RR, Fidler IJ. The seed and soil hypothesis revisited–the role of tumor-stroma interactions in metastasis to different organs. Int J Cancer. 2011;128:2527–35.
Paget S. The distribution of secondary growths in cancer of the breast. 1889. Cancer Metastasis Rev. 1989; 8:98–101.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38: e164.
Bambury RM, Bhatt AS, Riester M, et al. DNA copy number analysis of metastatic urothelial carcinoma with comparison to primary tumors. BMC Cancer. 2015;15:242.
Talevich E, Shain AH, Botton T, et al. CNVkit: Genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput Biol. 2016;12: e1004873.
Cao J, Chen L, Li H, et al. An accurate and comprehensive clinical sequencing assay for cancer targeted and immunotherapies. Oncologist. 2019;24:e1294–302.
Chalmers ZR, Connelly CF, Fabrizio D, et al. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome Med. 2017;9:34.
Chowell D, Morris LGT, Grigg CM, et al. Patient HLA class I genotype influences cancer response to checkpoint blockade immunotherapy. Science. 2018;359:582–7.
Roth A, Khattra J, Yap D, et al. PyClone: statistical inference of clonal population structure in cancer. Nat Methods. 2014;11:396–8.
Popic V, Salari R, Hajirasouliha I, et al. Fast and scalable inference of multi-sample cancer lineages. Genome Biol. 2015;16:91.
Cao CH, Liu R, Lin XR, et al. LRP1B mutation is associated with tumor HPV status and promotes poor disease outcomes with a higher mutation count in HPV-related cervical carcinoma and head & neck squamous cell carcinoma. Int J Biol Sci. 2021;17:1744–56.
Sun R, Hu Z, Sottoriva A, et al. Between-region genetic divergence reflects the mode and tempo of tumor evolution. Nat Genet. 2017;49:1015–24.
Sagiv E, Arber N. The novel oncogene CD24 and its arising role in the carcinogenesis of the GI tract: from research to therapy. Expert Rev Gastroenterol Hepatol. 2008;2:125–33.
Eyvazi S, Kazemi B, Dastmalchi S, et al. Involvement of CD24 in multiple cancer related pathways makes it an interesting new target for cancer therapy. Curr Cancer Drug Targets. 2018;18:328–36.
Diaz LA Jr, Shiu KK, Kim TW, et al. Pembrolizumab versus chemotherapy for microsatellite instability-high or mismatch repair-deficient metastatic colorectal cancer (KEYNOTE-177): final analysis of a randomised, open-label, phase 3 study. Lancet Oncol. 2022;23:659–70.
Shimada Y, Okuda S, Watanabe Y, et al. Histopathological characteristics and artificial intelligence for predicting tumor mutational burden-high colorectal cancer. J Gastroenterol. 2021;56:547–59.
Chowell D, Krishna C, Pierini F, et al. Evolutionary divergence of HLA class I genotype impacts efficacy of cancer immunotherapy. Nat Med. 2019;25:1715–20.
Abed A, Calapre L, Lo J, et al. Prognostic value of HLA-I homozygosity in patients with non-small cell lung cancer treated with single agent immunotherapy. J Immunother Cancer. 2020;8: e001620.
Chen T, Chen X, Zhang S, et al. The genome sequence archive family: toward explosive data growth and diverse data types. Genomics Proteomics Bioinformatics. 2021;19:578–83.
Members C-N, Partners. Database Resources of the National Genomics Data Center, China National Center for Bioinformation in 2022. Nucleic Acids Res. 2022; 50:D27-D38.
Funding
This work was funded by Joint Funding Project of Shandong Provincial Natural Science Foundation (ZR201911040232); National Natural Science Foundation of China (81972014); National Natural Science Foundation of China (82003233); Jinan Science and Technology Plan Project (202019163); Science and Technology Project of traditional Chinese Medicine in Shandong Province (2021M013).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare that they have no conflicts of interest.
Ethical approval and consent to participate
This study was reviewed and approved by the Ethics Committee of the Shandong Cancer Hospital Affiliated to Shandong First Medical University and Shandong Academy of Medical Sciences (ID: 2020005006), with full respect to the Helsinki Declaration (1964).
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
535_2023_1989_MOESM2_ESM.jpg
Supplementary file2 Comparison of somatic mutations between primary and metastases. (a) Comparison of TMB between primary and metastases. (b) The rest eight gene and KRAS, PIK3CA lollipop mutation diagrams of primary (pointing up) and metastases (pointing down). Different color patches represent different domains, and y-axes means the number of mutations in primary and metastases, note that multiple mutation in a gene may occur from some case.(JPG 1245 KB)
535_2023_1989_MOESM3_ESM.jpg
Supplementary file3 The immune-associated gene profile of primary and metastases. The mutations were picked out based on the immune-associated genes including antigen processing and presentation, antimicrobials, BCR signaling, pathway, chemokine receptors, chemokines, cytokine receptors, cytokines, natural killer cell cytotoxicity, TCR signaling pathway. (JPG 1131 KB)
535_2023_1989_MOESM4_ESM.jpg
Supplementary file4 Intra and inter-tumor heterogeneity and subclones functional analysis. (a-b) The rest seven cases intra- and inter-tumor heterogeneity plots. The violin plots are depicted at the left, and cellular prevalence of each cluster across the different samples are at right. (JPG 1863 KB)
535_2023_1989_MOESM6_ESM.jpg
Supplementary file6 Immunohistochemical analysis showed expression of active cleaved-caspase-3 of primary and paired metastases of CC05, CC08, CC02, CC03, CC07. (JPG 2367 KB)
535_2023_1989_MOESM7_ESM.jpg
Supplementary file7 Immunohistochemical analysis showed expression of active ki-67 of primary and paired metastases of CC05, CC08, CC02, CC03, CC07. (JPG 2111 KB)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Chen, G., Zhu, W., Liu, Y. et al. The clonal heterogeneity of colon cancer with liver metastases. J Gastroenterol 58, 642–655 (2023). https://doi.org/10.1007/s00535-023-01989-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00535-023-01989-6