XPC deficiency increases risk of hematologic malignancies through mutator phenotype and characteristic mutational signature

Yurchenko, Andrey A.; Padioleau, Ismael; Matkarimov, Bakhyt T.; Soulier, Jean; Sarasin, Alain; Nikolaev, Sergey

doi:10.1038/s41467-020-19633-9

XPC deficiency increases risk of hematologic malignancies through mutator phenotype and characteristic mutational signature

Article
Open access
Published: 17 November 2020

Volume 11, article number 5834, (2020)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

XPC deficiency increases risk of hematologic malignancies through mutator phenotype and characteristic mutational signature

Download PDF

3928 Accesses
16 Citations
7 Altmetric
Explore all metrics

Abstract

Recent studies demonstrated a dramatically increased risk of leukemia in patients with a rare genetic disorder, Xeroderma Pigmentosum group C (XP-C), characterized by constitutive deficiency of global genome nucleotide excision repair (GG-NER). The genetic mechanisms of non-skin cancers in XP-C patients remain unexplored. In this study, we analyze a unique collection of internal XP-C tumor genomes including 6 leukemias and 2 sarcomas. We observe a specific mutational pattern and an average of 25-fold increase of mutation rates in XP-C versus sporadic leukemia which we presume leads to its elevated incidence and early appearance. We describe a strong mutational asymmetry with respect to transcription and the direction of replication in XP-C tumors suggesting association of mutagenesis with bulky purine DNA lesions of probably endogenous origin. These findings suggest existence of a balance between formation and repair of bulky DNA lesions by GG-NER in human body cells which is disrupted in XP-C patients.

Genomic mutation landscape of skin cancers from DNA repair-deficient xeroderma pigmentosum patients

Article Open access 04 May 2023

Increased risk of internal tumors in DNA repair-deficient xeroderma pigmentosum patients: analysis of four international cohorts

Article Open access 04 March 2022

Early-onset gynecological tumors in DNA repair-deficient xeroderma pigmentosum group C patients: a case series

Article Open access 11 August 2023

Introduction

Xeroderma Pigmentosum (XP) is a group of rare recessive genetic disorders, which includes seven complementation groups (A−G) characterized by constitutive deficiency of Nucleotide Excision Repair (NER) pathway, and XP variant (loss of polymerase η¹). NER serves as a primary pathway for repairing various helix-distorting DNA adducts. The NER is subdivided into global genome (GG-NER) and transcription-coupled (TC-NER) sub-pathways that preferentially operate genome-wide and on the transcribed DNA strand of genes respectively. XP patients demonstrate striking tumor-prone phenotype with near 10,000-times increased risk of non-melanoma skin cancer and 2000-times risk of melanoma due to the inability of cells to efficiently repair the major UV photoproducts^2,3. XP complementation group C (XP-C) characterized by GG-NER deficiency (but with an unaffected TC-NER) is one of the most tumor susceptible subtypes of XP⁴. Moreover, it was hypothesized that XP patients may harbor 10−20 times increased risk to some types of internal tumors including leukemia, sarcomas⁵ and thyroid nodules^6,7.

Two recent studies reported a more than a thousand-fold increased risk of hematological malignancies in independent cohorts of XP-C patients^8,9, which demonstrated mainly myelodysplastic syndrome with secondary acute myeloid leukemia manifestation. The genetic mechanism of increased risk of internal tumors in XP patients is not well understood.

Experiments with animal XP-C models demonstrated high incidence of liver and lung cancer¹⁰ as well as 30-fold increase of spontaneous mutation rate in Hprt gene in T-lymphocytes of 1-year-old mice¹¹. Induction of oxidative stress has been shown to further increase the somatic mutagenesis in Xpc^−/−-deficient mice with steady accumulation with age¹². A similar tumor-prone phenotype was observed in Ddb2/Xpe-deficient mice with impaired GG-NER pathway: these animals developed broad spectrum of tumors with particularly high incidence of hematopoietic neoplasms¹³.

In this work we perform whole-genome sequencing (WGS) of a unique collection of internal tumors from XP-C patients to demonstrate that the constitutive GG-NER deficiency causes mutator phenotype rendering susceptibility to hematological malignancies. A particular genomic mutational signature explains the majority of mutations in the studied XP-C leukemias and sarcomas. Observed mutational profiles indicate that this mutational process is associated with lesions formed from purine bases. This work explores mutational patterns and their mechanisms in XP-C patients beyond cutaneous malignancies genome-wide.

Results

XP-C leukemia is characterized by mutator phenotype

We sequenced whole genomes of six myeloid leukemia, one uterine rhabdomyosarcoma and one breast sarcoma along with paired normal tissues from unrelated patients, representing XP-C, the most frequent group of XP in Northern Africa and Europe¹⁴ and created a catalog of 202,467 somatic mutations (Table 1 and Supplementary Table 1). Seven out of eight samples harbored a founder c.1643_1644 delTG mutation characteristic of the given XP-C population¹⁴ (Table 1). The patients developed internal tumors early in life, between 12 and 30 years of age (median age of tumor diagnosis—24 years). XP-C cancers contained somatic copy number aberrations (SCNAs) and mutations which are characteristic for corresponding types of sporadic malignancies: mutations in TP53 and deletions of chromosomes 5 and 7 in leukemia, biallelic loss of CDKN2A in breast cancer and highly unstable genome of rhabdomyosarcoma (Supplementary Table 1). We compared frequency of TP53 mutations and common chromosomal aberrations (5q and 7q deletions) between XP-C leukemia and adult de novo acute myeloid leukemia cohort¹⁵ (AML); and found that TP53 was mutated significantly more often in our dataset (5/6 cases and 15/200 cases for XP-C and sporadic AML respectively, P = 2.963e−05, odds ratio = 58.66, 95% CI = 6.04−2872.04; Fisher’s exact test, two-sided). Together with significantly higher proportions of 5q and 7q deletions in XP-C leukemia (P = 1.024e−06 for 5q and P = 0.002985 for 7q deletions, Fisher’s exact test, two-sided), this may indicate that the studied leukemia cases are close to TP53 mutated with complex karyotype subgroup according to Papaemmanuil et al.¹⁶.

Table 1 Description of the studied XP-C tumors.

Full size table

We identified 14.5−31.2 (mean 24.6)-fold increase in the number of somatic mutations in XP-C leukemia samples relative to the sporadic myeloid neoplasms (Mann–Whitney U test, two-sided, P = 5.8e−05) and the absence of such an effect for XP-C sarcomas (Fig. 1a). This effect was consistent for single base substitutions (SBS), small indels (ID) and double base substitutions (DBS, Fig. 1a).

**Fig. 1: Mutational load and profiles of XP-C and 190 tissue-matched sporadic cancers.**

XPC deficiency underlies characteristic mutational process

The genomic mutational profiles in XP-C tumors were similar between each other irrespectively of the tumor type (average pairwise Cosine similarity of 0.964 (from 0.886 to 0.998)) (Fig. 1b, c, Supplementary Fig. 1 and Supplementary Data 1) but were different from tissue-matched sporadic tumors (Fig. 1b, c). The distinct grouping of XP-C tumors based on mutational profiles was further confirmed in the context of 190 sporadic tissue-matched cancers by multidimension scaling analysis (Fig. 1b). The mutational patterns of indels were dominated by single nucleotide deletions of C:G and T:A bases in homopolymer stretches and dinucleotide deletions in repeats (Supplementary Fig. 1b). The dinucleotide substitutions were not overrepresented by specific classes and demonstrated a broad range of contexts (Supplementary Fig. 1c).

To better understand the mutational processes operating in XP-C cancers, we extracted mutational signatures from XP-C and sporadic tissue-matched tumors with the non-negative matrix factorization approach¹⁷ (NMF). Seven signatures were extracted from this dataset (Supplementary Fig. 2a, b) and one of them, Signature “C” explained on average 83.1% of mutations in the XP-C samples (57% in breast sarcoma, 88.9% in rhabdomyosarcoma and 84.1−88.7% in leukemia) while in sporadic tumors only small contribution (average 9.7%, range 0−34.3%) of signature “C” was observed (Fig. 2a, b and Supplementary Fig. 2c, d).

**Fig. 2: Mutational profiles of XP-C tumors in the context of known mutational signatures.**

These seven extracted signatures (A−G) together with original XP-C mutational profiles were compared with COSMIC mutational signatures¹⁸ and mutational profiles of organoids from human XPC and mouse Ercc1 knockouts¹⁹ using unsupervised clustering. This analysis revealed that the XP-C tumor mutational profiles and their NMF-derived mutational Signature “C” had the highest similarity to the COSMIC Signature 8 (cosine similarity of 0.87−0.92, and 0.86 respectively) and formed a cluster together with XPC and Ercc1 organoid knockouts (Fig. 2c and Supplementary Fig. 2e). At the same time the Signature “C” was different from Signature 8 by strong transcriptional asymmetry, increased mutations from C and decreased mutations from T (1.24- and 1.43-fold respectively) specifically in excess of VpCpT > D and NpCpT > T (where V designates A,C,T and D–A,G,T; Fig. 2a).

Mutational asymmetries in XP-C tumors

A mutational process associated with XPC deficiency is expected to demonstrate asymmetry between the transcribed and untranscribed strands of a gene²⁰ (transcriptional bias: TRB). This may be associated with excess of unrepaired bulky lesions on the untranscribed strand due to impaired GG-NER while on the transcribed strand such lesions would be effectively repaired by TC-NER²¹. Indeed, transcriptional strand bias in XP-C was strong and highly significant for all six classes of nucleotide substitutions grouped by the reference and mutated nucleotide, while in tissue-matched sporadic cancers it was weak or absent (Fig. 3a−c, e and Supplementary Fig. 3a−c). Moreover, the strongest transcriptional bias was detected in highly expressed genes of XP-C tumors, reaching 7.34-fold (Wilcoxon signed-rank test, two-sided, P = 2.91e−11) in XP-C leukemia (Fig. 3c and Supplementary Fig. 3d).

**Fig. 3: Strong transcriptional bias (TRB) is a specific feature of XP-C tumors.**

These effects could be explained by either excess of mutations from damaged pyrimidines or decrease of mutations from damaged purines on the transcribed (noncoding) strand. Both phenomena were previously described (see Haradhvala et al.²¹) and refer to transcription-coupled damage (TCD) or transcription-coupled repair (TCR). In case of TCD the increase of mutation rates in gene as compared to intergenic region should be observed (TCD in liver cancer analysis in Haradhvala et al.²¹) while in case of TCR we can expect the decrease of mutation rates in gene as compared to intergenic regions. In order to discriminate between these two possibilities, a comparison between mutation rates in intergenic and genic regions separately for purines and pyrimidines can be performed. To validate the suspected effect of TC-NER (decrease of mutations from purines on the transcribed strand), we performed two analysis. First, we compared relative signature contributions on the transcribed and untranscribed strands of genes and observed strong depletion of the predominant in XP-C leukemia Signature “C” as well as increase of typical for sporadic leukemia Signatures “A” and “E” on the transcribed strand of genes (Fig. 3d). Second, we compared mutation rates separately on transcribed and untranscribed strands of genes with proximal intergenic regions and observed a strong and significant effect compatible with the decrease of mutations from purines on the transcribed strand (average 1.64-fold, Wilcoxon signed-rank test, two-sided, P = 1.694e−13) while there was no significant difference of mutations from purines between intergenic regions and untranscribed strand (P = 0.4437; conventional mutation representation depicts decrease of mutations from pyrimidines on the untranscribed strand; Fig. 3f and Supplementary Fig. 3e). In line with that, we observed no difference between mutations from purines on untranscribed strand and intergenic regions at different replication times, while signature of repair of mutations from purines on transcribed strand was observed and was the strongest in early-replicating regions which are usually associated with active gene transcription^20,22 (Fig. 4a and Supplementary Fig. 4a). Similarly to SBS, transcriptional bias in DBS and ID indicated that the primary damage is on purine bases, specifically in CpC > ApD and single nucleotide deletion of C:G nucleotides (Fig. 3b and Supplementary Fig. 3b, c).

**Fig. 4: Genomic landscape of mutagenesis in XP-C internal tumors.**

Recent report suggested that bulky DNA lesions on the lagging strand during DNA replication are more frequently converted into mutations than on the leading strand probably due to more frequent error-prone bypass by translesion synthesis (TLS) polymerases^21,23. Indeed, we found a strong replicational bias (average 1.38-fold of all six mutational classes in XP-C leukemia, Wilcoxon signed-rank test, two-sided, P = 2.91e−11) compatible with preferential bypass of purine DNA lesions by error-prone TLS polymerases on the lagging strand (Fig. 4b and Supplementary Fig. 4c) in XPC-deficient tumors.

TLS polymerases that are recruited to bypass a bulky lesion can also insert incorrect bases opposite to undamaged nucleotides near the lesion^24,25. Indeed, in all eight XP-C tumors, we observed statistically significant excess of clustered events as compared to the random distribution (Fig. 4c and Supplementary Fig. 5). In diploid genome regions of XP-C leukemia 0.3% of SBS formed 140 short clusters with distance between mutations inferior to 16 bp and mean of 7 bp (Fig. 4c and Supplementary Fig. 5). Moreover, 6.56-fold more mutations, which occurred within a distance of 16 bp from each other, were colocalized on the same sequencing reads, indicating that clustered mutations affect the same allele and may be interconnected (Wilcoxon signed-rank test, two-sided, P = 0.031). These results are compatible with the hypothesis of the existence of bulky DNA lesions that enter the S-phase and get bypassed by error-prone translesion DNA synthesis polymerases²³ in XPC-deficient cells, while in XPC-proficient cells majority of these lesions may be repaired prior to replication in error-free manner.

Due to the absence of GG-NER we expected to observe strong difference in terms of mutation rates between transcribed and untranscribed strands, particularly in open chromatin and early-replicating regions known to be actively transcribed while we expected no difference between untranscribed strand of genes and intergenic regions in heterochromatic regions²⁰. In XP-C leukemia mutation load in regions of open chromatin was strongly depleted in early-replicating regions and regions with active histone marks (H3K27ac (2.83-fold), H3K36me3 (8.45-fold), H3K4me1 (2.72-fold)) for transcribed strands of genes (Fig. 4a, d). Similar but weaker trends were observed when only untranscribed strands of genes and intergenic regions were analyzed (Fig. 4a, d and Supplementary Fig. 4a). Mutation load was also enriched on the untranscribed strand of genes and intergenic regions with repressive histone marks (H3K27me3 (1.26- and 1.09-fold), H3K9me3 (1.28- and 1.25-fold)) and in late replicating regions associated with heterochromatin (Fig. 4a, d). The observed patterns further confirm effectiveness of TC-NER on transcribed strand of genes in euchromatic regions while prove GG-NER being dysfunctional on both intergenic regions and untranscribed strands of genes all over the genome in XP-C samples. To assess the relative mutation rates in different chromatin state regions, we compared XP-C leukemia samples and sporadic myeloid neoplasms. The analysis revealed more homogeneous mutation load across the different states in XP-C leukemia in comparison with sporadic leukemia as well as elevated mutation rates in heterochromatic regions relative to genic and regulatory elements (Supplementary Fig. 4b).

To further validate the mutational consequences of XPC deficiency, we compared the mutational landscape of cutaneous squamous cell carcinomas (cSCC) from XP-C patients and sporadic tumors²⁰. All cSCC tumors, independently of XP-C mutational status presented the typical UV-light induced signature (C > T mutations at YpC sites (where Y designates C or T), 85.6%, Supplementary Fig. 6a), which arises due to the bulky lesions on pyrimidines. However, in XP-C cSCCs there was remarkably more pronounced decrease of mutations from pyrimidines on the transcribed strand relative to untrascribed strand and intergenic regions, as well as much stronger transcriptional bias in highly expressed genes (Supplementary Fig. 6b, c). Moreover, XP-C cSCC demonstrated stronger difference than sporadic cSCC between mutation rate on the transcribed strand of genes on the one side, and untranscribed strand of genes and intergenic regions on the other (Supplementary Fig. 6b, c, d). These differences were particularly strong in transcriptionally active early-replicating regions (Supplementary Fig. 6d). In the case of XP-C internal tumors the observed patterns were similar with the only difference that the mutational profiles are compatible with mutations from purines (Figs. 3c, f and 4a).

The majority of mutations precede copy number alterations

In order to assess the timing of somatic mutations in XP-C tumors, we selected the regions of somatic copy number alterations (SCNAs) where one allele was duplicated. We quantified the number of mutations that occurred before and after SCNA²⁶ based on variant allele frequencies (n = 2307 mutations in four copy neutral LOH and four copy gains; Supplementary Table 2 and Supplementary Fig. 7). On average 75% of mutations occurred before SCNAs suggesting that they may have accumulated in progenitor cells before tumorigenesis or early in tumor development (Wilcoxon signed-rank test, two-sided, P = 0.03906; Fig. 5a). Therefore, the observed mutational burden and signature in XP-C tumor genomes may partially represent mutagenesis associated with lesion accumulation during the lifetime of normal body cells (Fig. 5b).

**Fig. 5: Accumulation of DNA lesions and mutations observed in XP-C tumors.**

Discussion

This described mutator phenotype may explain the increased risk of internal cancers in general and particularly for hematological malignancies in XP-C patients, which may be associated with relatively high rate of blood stem cell divisions²⁷. Our results are in line with recent reports in human and mice showing that attenuated NER at germinal level is associated with increased risk of lymphoma and sarcoma^28,29.

The derived XP-C cancers Signature “C” has the highest similarity to COSMIC Signature 8, which was originally extracted from sporadic tumors with the most elevated (but not usually exceeding 35%) fraction in sarcoma, medulloblastoma, lymphoma, chronic lymphocytic leukemia and breast cancer¹⁸. While in some works, it was attributed to homology-repair deficiency^30,31, recently in organoid models Signature 8 was associated with the nucleotide-excision repair deficiency¹⁹. Comparison of the mutational profiles and NMF-extracted Signature “C” from XP-C internal tumors with the mutational profiles of human XPC and mouse Ercc1 knockouts demonstrated high similarity between them highlighting the dysfunctional NER as the genetic basis of their common mutational process. Our work provides evidence that COSMIC Signature 8 is likely to result from mutagenesis associated with bulky lesions primarily repaired by NER and can be considered as a marker of attenuated NER function.

Taken together our results and previous reports demonstrate that NER deficiency in different tissue types and in in vitro models unmasks a unique mutational process of similar etiology. A broad spectrum of nucleotide substitutions and deletions in XP-C context suggests the existence of a compendium of different bulky lesions induced by one or more genotoxins in DNA of somatic cells. The studied patients were diagnosed as XP-C at early age (median: 3 years) and were well protected from environmental mutagens during their life; therefore, the observed mutagenesis could be caused by endogenous genotoxins which DNA lesions are almost fully repaired in XPC-proficient cells (Fig. 5b).

Future studies on the identification of the nature of this mutational process and its link with particular genotoxins (for example, free radicals, aldehydes, food mutagens) producing bulky lesions may result in the elaboration of preventive measures for XP patients. Except for the breast sarcoma sample from Comorian Archipelago with IVS12 mutation, our dataset mainly represents an XP-C population of the Northern African origin and single XPC mutation (delTG) urging the importance of expanding the investigation of internal tumorigenesis and underlying mutagenesis in different XP populations.

Methods

Studied samples

Patients from the study were diagnosed with Xeroderma Pigmentosum at early age (median: 3.5 years; range 1.5−9 years). Primary fibroblasts from sun-unexposed skin were used to determine the DNA repair deficiency by unscheduled DNA synthesis following UV-C irradiation³². The XP genetic defect was characterized by complementation assay using recombinant retroviruses expressing wild-type DNA repair genes³³. The absence of the XPC protein was shown by Western blots³⁴. The XPC mutation was determined by Sanger sequencing or whole-exome sequencing. Informed signed consents were obtained from patients and/or their parents in accordance with the Declaration of Helsinki and the French law. This study was approved by the French Agency of Biomedicine (Paris, France), by the Ethics Committee from the CPP of Universitary Bordeaux Hospital (Bordeaux, France) and by the Institutional Review Board of the University Institute of Hematology (IUH: Saint-Louis Hospital, Paris). For patients with leukemia (n = 6), tumoral bone marrow or peripheral blood mononucleated cells were separated on Fycoll-Hypaque. Cultured skin fibroblast cells were used as non-hematopoietic DNA controls in five out of six patients. In the additional patient, bone marrow CD34+, CD14+ and CD3+ cells were sorted with magnetic beads; CD34+ CD14+ cells represented the leukemic fraction while CD3+ T-lymphocytes, non-leukemic fraction was used as a control. DNA from solid tumors (SA002T2 and SA007T3) was extracted from FFPE blocks after examination and dissection by a pathologist. Tumor DNA was extracted from parts of FFPE containing more than 90% of tumor cells. Germline DNA was extracted from the non-tumoral part of FFPE (Supplementary Table 1).

Genome sequencing and data processing

The genomes were sequenced using BGISEQ-500 or Illumina Hiseq 2500 (SA008T6) sequencers according to the manufacturer protocols to the mean coverage after deduplication equal to 45× for tumor and 30× for normal DNA (Supplementary Table 1) using 100 bp paired-end reads. Reads were mapped using BWA-MEM (v0.7.12) software³⁵ to the GRCh37 human reference genome and then used the standard GATK best practice pipeline³⁶ to process the samples and call somatic genetic variants. PCR duplicates were removed and base quality score recalibrated using GATK³⁷ (v4.0.10.1), MarkDuplicates and BaseRecalibrator tools. Somatic SNVs and INDELs were called and filtered using GATK tools Mutect2, FilterMutectCalls and FilterByOrientationBias and annotated with oncotator³⁸ (v1.9.9.0). SCNAs calling was done with FACETS³⁹ (v 0.5.14). Quality controls of fastq and mapping were done with FASTQC⁴⁰ (v0.11.7), samtools⁴¹ (v1.9), GATK HSmetrics, mosdepth⁴² (v0.2.5) and multiqc⁴³ (v1.5). All processing steps were combined in a pipeline built with snakemake⁴⁴ (v5.4.0).

The cSCC from the work of Zheng et al.²⁰ were downloaded as SRA files from the database of Genotypes and Phenotypes (dbGaP). The dataset was processed and filtered in the same way as XP-C leukemia samples.

Filtration of somatic variants

For XP-C leukemia samples from bone marrow biopsies, we used additional filtration of the PASS variants which included requirement of at least one read on the both strands (F1R2.split (‘,’).1 > 0 && F2R1.split (‘,’).1 > 0 filters in GATK) and the variant allele frequency (VAF) minimal threshold equal to 0.05.

To avoid contamination of true variants by FFPE sequencing artifacts, we used more stringent criteria for breast sarcoma (SA007T3) and rhabdomyosarcoma (SA002T2) samples which included at least 2 and 1 reads from each strand and minimal VAF equal to 0.3 and 0.4 for breast cancer and rhabdomyosarcoma samples respectively. These thresholds were chosen empirically taking into account the high purity/ploidy of the samples (Supplementary Table 1) and VAF of FFPE artefacts which can vary between 0.01 and 0.15 ⁴⁵.

Additionally, all used VCF files were filtered based on the alignability map of human genome⁴⁶ from UCSC browser⁴⁷ (https://genome.ucsc.edu/cgi-bin/hgFileUi?db=hg19&g=wgEncodeMapability) with the length of K-mer equal to 75 bp (wgEncodeCrgMapabilityAlign75mer, mutations overlapped regions with score <1 were filtered out) and UCSC Browser blacklisted regions (Duke and DAC).

Mutational signatures analysis

To convert the VCF files into a catalog of mutational matrices, we used the MutationalPatterns software v.1.11.0 ⁴⁸. Profiling of the mutational matrices of indels and double nucleotides substitutions was performed with SigProfilerMatrixGenerator v.1.0 software⁴⁹.

For comparison with XP-C tumors we used 190 tissue-matched whole cancer genomes from the ICGC PCAWG collection⁵⁰ which included cancers from the following projects: Chronic Myeloid Disorders—UK (n = 57), Acute Myeloid Leukaemia—KR (n = 8), Breast Cancer TCGA US (n = 91), Sarcoma—TCGA US (n = 34). We used only high-quality variants and additionally filtered out mutations in low-mappability and blacklisted regions of the human genome.

To construct the multidimension scaling plot (MDS), we computed pairwise Cosine similarity distance between all pairs of the samples using MutationalPatterns package⁴⁸ and then processed the matrix of distances between the samples in prcomp() function in R.

To perform non-negative Matrix Factorization approach and extract de novo mutational signatures, we used the XP-C samples along with tissue-matched dataset of PCAWG samples (n = 190) in NMF framework realized in MutationalPatterns R package⁴⁸ with 500 initialization runs. After examination of the diagnostic plots (Supplementary Fig. 2a), we choose K = 7 (with RSS at inflation point, according to Hatchins et al.⁵¹) to extract mutational signatures (Supplementary Fig. 2b) and then assigned them to the known mutational signatures based on the Cosine similarity (Fig. 2c and Supplementary Fig. 2e). Choosing of lower (K = 4) or higher factorization rank (K = 9) did not influence significantly the extracted Signature “C” and its proportion in samples.

To quantify the contribution of the NMF-derived mutational signatures (A−G) in XP-C tumors and tissue-matched PCAWG cancers, we used the quadratic programming-based algorithm⁵² realized in SigsPack R package⁵³ (Fig. 2b). To better understand and quantify the contribution of the NMF-derived mutational signatures in XP-C dataset, we additionally used bootstrapping (n = 10,000) on substitution classes to receive the confidence intervals of each signature contribution (Supplementary Fig. 2d).

Transcriptional strand bias analysis

Transcriptional strand bias (TRB) was quantified for each sample and six mutational classes using MutationalPatterns package⁴⁸. The function computed inequality between mutations from pyrimidines (C > A,T,G; T > A,C,G) to mutations from purines (G > A,C,T; A > C,G,T) for genes located on the sense and antisense strands of DNA relative to the reference human genome. Inequality in the number of mutations from purines and pyrimidines was considered as evidence of transcriptional bias and statistical significance was assessed using Poisson test.

To compute tissue-specific TRB between genes expressed at low and high level, we used RPKM values of RNA-seq from Epigenetic Roadmap Project⁵⁴ (E028 for breast sarcoma, E050 for leukemia, E100 for rhabdomyosarcoma). For each gene mutations were separated as located on transcribed or untranscribed strands and genes were divided into bins by the level of expression (RPKMs: 0−0.1, 0.1−1, 1−10, 10−20,000 for leukemia; 0−0.1, 0.1−20,000 for breast sarcoma and rhabdomyosarcoma). The significance for each bin was assessed using Poisson test, two-sided (single samples of breast sarcoma and rhabdomyosarcoma) or Wilcoxon signed-rank test, two-sided (leukemia, n = 6) and then for visualization the number of mutations was normalized by the total length of genes in each bin.

Following the hypothesis that majority of mutations were caused by purine DNA lesions, we were able to compute strand-specific mutation densities around transcription start sites (TSSs). Transcribed and untranscribed strands of genes as well as 5′ adjacent to TSS intergenic regions were treated separately. TSSs of all annotated genes (GENECODE v30 ⁵⁵) were retrieved using BEDTools v2.29.0 ⁵⁶ and then regions located ±50 kb of TSSs were split into 1-kb intervals. The 1-kb intervals that overlapped with other intergenic or genic intervals (represented mainly by overlapped or closely located genes) were removed. This approach rendered 237 Mbp of 5′ proximal to TSS intergenic regions and 151 Mbp of genic regions.

Replication timing

We used repliseq data from 12 cell lines^57,58 to calculate consensus replication timing regions. For each 1-kb regions we calculated the standard deviation between all the cell lines and removed all regions with standard deviation higher than 15. For the rest of consistent regions across different cell lines, we calculated the mean values and used them during analysis. The genome was divided into five bins (10−25, 25−40, 40−55, 55−70, 70−85) according to the replication timing values and mutational density was calculated for each bin adjusting for the length of each region. We computed dependence of mutational density on replication timing independently for genic and intergenic regions separating mutations on transcribed strand and untranscribed strands.

Epigenetic marks and mutational density

To infer relationship between mutation density and intensity of various epigenetic marks (methylation, H3K27ac, H3K27me3, H3K36me3, H3K4me1, H3K9me3), we downloaded bigwig files of the Roadmap Epigenomics Project⁵⁴ and converted them to wig and then bed files (tissue E050). The mean intensity of each mark was calculated for 1-kb nonoverlapping windows across autosomes with BEDOPS v2.4.37 (bedmap) software⁵⁹. We used only genomic windows with high alignability (equal to 1) along at least 90% of a window. Mark intensities were normalized to 1−100 range. For each window we split mark intensities into 5 quantiles (cut2() function in R⁶⁰) and calculated the relative mutation density of each mark for intergenic regions, transcribed and untranscibed strands of genes.

The ChromHMM Expanded 18-state models of chromatin states (E050) were downloaded as bed file⁵⁴ and all the windows with the highest alignability spanning less than 90% of the window were filtered out. Then we calculated relative mutation density for each sample and chromatin state for XP-C leukemia and sporadic myeloid neoplasms.

Replicational strand bias

We used data from Okazaki-seq experiments data⁶¹ for GM06990 and HeLa cell lines to infer the regions of genome preferentially replicating as lagging or leading strand relative to the reference human genome. 1 kbp genomic regions for which values representing the direction of replication fork differed between cell lines >0.4 were removed. We calculated the ratio of the densities between mutations from pyrimidines (C, T) and purines (G, A) for each bin (−1 to 0.5, −0.5 to 0, 0 to 0.5, 0.5 to 1) of the preferential replication direction (negative values correspond to genomic regions where reference strand is replicated as lagging strand; and positive values—as leading) similar to the methodology of Seplyarskiy et al.⁶².

Clustered mutations

To evaluate the distribution of mutations across the genome for the presence of clustered mutations in our dataset, we performed Monte Carlo simulations for the intermutation distances distribution of random mutations for ranges between 2 and 10,000 bp for each studied sample. We developed a mathematical model of the Monte Carlo method for random mutations generation based on the following statements: (1) positions of mutations are random and uniformly distributed along the genome; (2) random positions are selected from the same set of genomic intervals as original somatic mutations; (3) the number and nucleotide context spectrum of randomly generated mutations exactly matches somatic mutations in the corresponding sample. As follows, our simulations are based on the discrete homogeneous Poisson point process. The Monte Carlo simulations were performed using Java programming language, discrete random positions were generated with standard Java Random class (Supplementary code). Data analysis was carried out with MathWorks MATLAB. We randomly assigned mutations giving their trinucleotide (3 bp) contexts and repeated the procedure 30,000 times for each sample (Supplementary Fig. 5).

To compute statistics for the distances between neighbors for randomly placed mutations within mappability sections for chromosomes and whole genome, we used the following algorithm:

1: input: G ► mappable sections of genome

2: input: S ► desired statistics of nucleotide contexts

3: input: N ► total number of simulations

4: input: D ► maximal allowed distance between mutations

5: output: M ← {Ø} ► empty set for randomly generated mutations

6: output: O ← {Ø} ► empty set for distance statistics

7: repeat N times

8: while size of M is less than size of S

9: select random position p inside G

10: determine nucleotide context x for p

11: if count of x in M is less than in S

12: append p to M

13: end if

14: end while

15: sort M

16: for every position p in M except last

17: compute distance d between p and next position in M

18: if d < = D

19: append d into O

20: end if

21: end for

22: output M

23: output O

24: end repeat

We next verified that random mutations at small distances produced by random generations followed the Poisson distribution. Then, the means for simulated distributions were compared with the observed intermutation distances for XP-C leukemia samples (n = 6) using Wilcoxon signed-rank, two-sided test in 5 bp overlapping (1 bp step) windows to define the length of clusters (for 2−10,000 bp intervals). Resulted P values were corrected with Bonferroni approach. Significant enrichment of clustered mutations at short distances remained when simulations were performed without taking into account the context of mutations or in 5-bp context of mutations; or when only euploid parts of the genomes were taken into account. Four exomes of XP-C samples were independently sequenced on Illumina Hiseq 2500 with ~100× sequence coverage. Out of six clusters that overlapped exonic regions all six were validated. Additionally, we assessed the number of mutations located on the same read or different reads for clusters up to 16 bp located in diploid genomic regions.

Relative number of mutations before and after SCNAs

To infer the relative number of mutations that occurred before and after SCNA, we followed the previously described methodology²⁶ and identified SCNA of two classes in our dataset: copy gain or cnLOH (Supplementary Table 2). In these SCNA regions taking into account tumor purity and ploidy of the regions, we determined the conservative variant allele frequency (VAF) thresholds to separate the mutations that occurred before and after SCNA given their VAF. The number of mutations was then normalized per haploid copy of a genomic segment.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Experimental data generated in this study have been deposited to the European Genome-Phenome Archive (EGA), the accession number is EGAS00001004511. The PCAWG data referenced in the study (consensus VCF files with SNVs and INDELs) are available in a public repository from the https://dcc.icgc.org/repositories website. Genomic dataset of cSCC used in this study is available in the dbGaP database under accession code phs000830.v1.p1. All the other data supporting the findings of this study are available within the article and its supplementary information files and from the corresponding author upon reasonable request. A reporting summary for this article is available as a Supplementary Information file.

Code availability

All software used is published and/or in the public domain. Custom Java code for the clustered mutation analysis is available as the Supplementary code.

References

Lehmann, A. R., McGibbon, D. & Stefanini, M. Xeroderma pigmentosum. Orphanet J. Rare Dis. 6, 1–6 (2011).
Article Google Scholar
Kraemer, K. H., Lee, M. M., Andrews, A. D. & Lambert, W. C. The role of sunlight and DNA repair in melanoma and nonmelanoma skin cancer: the Xeroderma Pigmentosum paradigm. Arch. Dermatol. https://doi.org/10.1001/archderm.1994.01690080084012 (1994).
Bradford, P. T. et al. Cancer and neurologic degeneration in xeroderma pigmentosum: long term follow-up characterises the role of DNA repair. J. Med. Genet. https://doi.org/10.1136/jmg.2010.083022 (2011).
Sethi, M. et al. Patients with xeroderma pigmentosum complementation groups C, e and v do not have abnormal sunburn reactions. Br. J. Dermatol. 169, 1279–1287 (2013).
Article CAS Google Scholar
Kraemer, K. H. Xeroderma pigmentosum. Cutaneous, ocular, and neurologic abnormalities in 830 published cases. Arch. Dermatol. https://doi.org/10.1136/jmg.2010.083022 (1987).
Hadj-Rabia, S. et al. Unexpected extradermatological findings in 31 patients with xeroderma pigmentosum type C. Br. J. Dermatol. 168, 1109–1113 (2013).
Article CAS Google Scholar
Jerbi, M. et al. Clinical, genealogical and molecular investigation of the xeroderma pigmentosum type C complementation group in Tunisia. Br. J. Dermatol. 174, 439–443 (2016).
Article CAS Google Scholar
Sarasin, A. et al. Familial predisposition to TP53/complex karyotype MDS and leukemia in DNA repair-deficient xeroderma pigmentosum. Blood https://doi.org/10.1182/blood-2019-01-895698 (2019).
Oetjen, K. A. et al. Predisposition to hematologic malignancies in patients with xeroderma pigmentosum. Haematologica haematol. 2019, 223370 (2019).
Google Scholar
Melis, J. P. M. et al. Mouse models for Xeroderma Pigmentosum group A and group C show divergent cancer phenotypes. Cancer Res. 68, 1347–1353 (2008).
Article CAS Google Scholar
Wijnhoven, S. W. P. et al. Age-dependent spontaneous mutagenesis in Xpc mice defective in nucleotide excision repair. Oncogene 19, 5034–5037 (2000).
Article CAS Google Scholar
Melis, J. P. M. et al. Slow accumulation of mutations in Xpc-/- mice upon induction of oxidative stress. DNA Repair https://doi.org/10.1016/j.dnarep.2013.08.019 (2013).
Yoon, T. et al. Tumor-prone phenotype of the DDB2-deficient mice. Oncogene 24, 469–478 (2005).
Article CAS Google Scholar
Soufir, N. et al. A prevalent mutation with founder effect in Xeroderma Pigmentosum group C from North Africa. J. Invest. Dermatol. 130, 1537–1542 (2010).
Article CAS Google Scholar
Ley, T. J. et al. Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia. N. Engl. J. Med. https://doi.org/10.1056/NEJMoa1301689 (2013).
Papaemmanuil, E. et al. Genomic classification and prognosis in acute myeloid leukemia. N. Engl. J. Med. 374, 2209–2221 (2016).
Article CAS Google Scholar
Alexandrov, L. B., Nik-Zainal, S., Wedge, D. C., Campbell, P. J. & Stratton, M. R. Deciphering signatures of mutational processes operative in human cancer. Cell Rep. 3, 246–259 (2013).
Article CAS Google Scholar
Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
Article CAS Google Scholar
Jager, M. et al. Deficiency of nucleotide excision repair is associated with mutational signature observed in cancer. Genome Res. https://doi.org/10.1101/gr.246223.118 (2019).
Zheng, C. L. et al. Transcription restores DNA repair to heterochromatin, determining regional mutation rates in cancer genomes. Cell Rep. 9, 1228–1234 (2014).
Article CAS Google Scholar
Haradhvala, N. J. et al. Mutational strand asymmetries in cancer genomes reveal mechanisms of DNA damage and repair. Cell 164, 538–549 (2016).
Article CAS Google Scholar
Cowie, D. A., Nazarethi, J. & Story, D. A. Chromatin organization is a major influence on regional mutation rates in human cancer cells. Anaesth. Intensive Care 42, 310–314 (2014).
Article CAS Google Scholar
Seplyarskiy, V. B. et al. Error-prone bypass of DNA lesions during lagging-strand replication is a common source of germline and cancer mutations. Nat. Genet. https://doi.org/10.1038/s41588-018-0285-7 (2019).
Stone, J. E., Lujan, S. A., Kunkel, T. A. & Kunkel, T. A. DNA polymerase zeta generates clustered mutations during bypass of endogenous DNA lesions in Saccharomyces cerevisiae. Environ. Mol. Mutagen. https://doi.org/10.1002/em.21728 (2012).
Matsuda, T. et al. Error rate and specificity of human and murine DNA polymerase η. J. Mol. Biol. https://doi.org/10.1006/jmbi.2001.4937 (2001).
Jolly, C. & Van Loo, P. Timing somatic events in the evolution of cancer. Genome Biol. 19, 1–9 (2018).
Article Google Scholar
Tomasetti, C. Stem cell divisions, somatic mutations, cancer etiology, and cancer prevention. Int. Encycl. Public HEal. 80, 381–388 (2016).
Google Scholar
Chan, S. H. et al. Germline mutations in cancer predisposition genes are frequent in sporadic sarcomas. Sci. Rep. 7, 1–8 (2017).
Article Google Scholar
Hyka-Nouspikel, N. & Nouspikel, T. Nucleotide excision repair and B lymphoma: somatic hypermutation is not the only culprit. Cell Cycle https://doi.org/10.4161/cc.10.14.16496 (2011).
Waszak, S. M. et al. Spectrum and prevalence of genetic predisposition in medulloblastoma: a retrospective genetic study and prospective validation in a clinical trial cohort. Lancet Oncol. 19, 785–798 (2018).
Article CAS Google Scholar
Ma, X. et al. Pan-cancer genome and transcriptome analyses of 1,699 paediatric leukaemias and solid tumours. Nature 555, 371–376 (2018).
Article ADS CAS Google Scholar
Sarasin, A. et al. Prenatal diagnosis in a subset of trichothiodystrophy patients defective in DNA repair. Br. J. Dermatol. https://doi.org/10.1111/j.1365-2133.1992.tb14845.x (1992).
Arnaudeau-Bégard, C. et al. Genetic correction of DNA repair-deficient/cancer-prone xeroderma pigmentosum group C keratinocytes. Hum. Gene Ther. https://doi.org/10.1089/104303403766682241 (2003).
Cartault, F. et al. A new XPC gene splicing mutation has lead to the highest worldwide prevalence of xeroderma pigmentosum in black Mahori patients. DNA Repair https://doi.org/10.1016/j.dnarep.2011.03.005 (2011).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows−Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS Google Scholar
Van der Auwera, G. A. et al. GATK best practices. Curr. Protoc. Bioinform. https://doi.org/10.1002/0471250953 (2002).
Depristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat. Genet. https://doi.org/10.1038/ng.806 (2011).
Ramos, A. H. et al. Oncotator: cancer variant annotation tool. Hum. Mutat. https://doi.org/10.1002/humu.22771 (2015).
Shen, R. & Seshan, V. E. FACETS: Allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing. Nucleic Acids Res. 44, e131 (2016).
Article Google Scholar
Andrews, S. FASTQC A. Quality Control tool for High Throughput Sequence Data. (Babraham Bioinformatics, 2015).
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article Google Scholar
Pedersen, B. S. & Quinlan, A. R. Mosdepth: quick coverage calculation for genomes and exomes. Bioinformatics https://doi.org/10.1093/bioinformatics/btx699 (2018).
Ewels, P., Magnusson, M., Lundin, S. & Käller, M. MultiQC: summarize analysis results for multiple tools and samples in a single report. Bioinformatics https://doi.org/10.1093/bioinformatics/btw354 (2016).
Köster, J. & Rahmann, S. Snakemake-a scalable bioinformatics workflow engine. Bioinformatics https://doi.org/10.1093/bioinformatics/bts480 (2012).
Robbe, P. et al. Clinical whole-genome sequencing from routine formalin-fixed, paraffin-embedded specimens: pilot study for the 100,000 Genomes Project. Genet. Med. https://doi.org/10.1038/gim.2017.241 (2018).
Derrien, T. et al. Fast computation and applications of genome mappability. PLoS ONE https://doi.org/10.1371/journal.pone.0030377 (2012).
Kent, W. J. et al. The Human Genome Browser at UCSC. Genome Res. https://doi.org/10.1101/gr.229102 (2002).
Blokzijl, F., Janssen, R., van Boxtel, R. & Cuppen, E. MutationalPatterns: Comprehensive genome-wide analysis of mutational processes. Genome Med. https://doi.org/10.1186/s13073-018-0539-0 (2018).
Bergstrom, E. N. et al. SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events. BMC Genomics 20, 1–12 (2019).
Article Google Scholar
Campbell, P. J. et al. Pan-cancer analysis of whole genomes. Nature https://doi.org/10.1038/s41586-020-1969-6 (2020).
Hutchins, L. N., Murphy, S. M., Singh, P. & Graber, J. H. Position-dependent motif characterization using non-negative matrix factorization. Bioinformatics https://doi.org/10.1093/bioinformatics/btn526 (2008).
Huang, X., Wojtowicz, D. & Przytycka, T. M. Detecting presence of mutational signatures in cancer with confidence. Bioinformatics 34, 330–337 (2018).
Article CAS Google Scholar
Schumann, F. et al. SigsPack, a package for cancer mutational signatures. BMC Bioinform. 20, 1–9 (2019).
Article CAS Google Scholar
Kundaje, A. et al. Roadmap Epigenomics Consortium: integrative analysis of 111 reference human epigenomes. Nature https://doi.org/10.1038/nature14248 (2015).
Frankish, A. et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. https://doi.org/10.1093/nar/gky955 (2019).
Quinlan, A. R. BEDTools: The Swiss-Army tool for genome feature analysis. Curr. Protoc. Bioinform. https://doi.org/10.1002/0471250953.bi1112s47 (2014).
Hansen, R. S. et al. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc. Natl Acad. Sci. USA https://doi.org/10.1073/pnas.0912402107 (2010).
Thurman, R. E., Day, N., Noble, W. S. & Stamatoyannopoulos, J. A. Identification of higher-order functional domains in the human ENCODE regions. Genome Res. https://doi.org/10.1101/gr.6081407 (2007).
Neph, S. et al. BEDOPS: High-performance genomic feature operations. Bioinformatics https://doi.org/10.1093/bioinformatics/bts277 (2012).
R Development Core Team, R. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing https://doi.org/10.1007/978-3-540-74686-7 (2011).
Petryk, N. et al. Replication landscape of the human genome. Nat. Commun. 7, 1–13 (2016).
Article Google Scholar
Seplyarskiy, V. B. et al. APOBEC-induced mutations in human cancers are strongly enriched on the lagging DNA strand during replication https://doi.org/10.1101/gr.197046.115 (2016).

Download references

Acknowledgements

S.N. was supported by grant Foundation ARC 2017, Foundation Gustave Roussy and Swiss Cancer League KFC-3985-08-2016. B.T.M. was supported by MES RK grant AP05134722 and NU ORAU grant 091019CRP2111. The authors would like to thank Dr. Patricia Kannouche and Dr. V.B. Seplyarskiy for fruitful discussions and participation, and Dr. F. Rajabi, Dr. Catherine Genestie and Dr. Samuel Quentin for DNA extraction and providing samples. The authors are also very thankful to Dr. C. Genestie (IGR, Villejuif, France), Dr. Z. Tata and Dr. S. Duquenne and Dr. F. Cartault for giving us or for manipulating biopsies of tumors and Xiaole Xu (BGI) for the management of sequencing.

Author information

Authors and Affiliations

INSERM U981, Gustave Roussy Cancer Campus, Université Paris Saclay, Villejuif, France
Andrey A. Yurchenko, Ismael Padioleau & Sergey Nikolaev
National Laboratory Astana, Nazarbayev University, 010000, Astana, Kazakhstan
Bakhyt T. Matkarimov
University of Paris, INSERM U944 and CNRS UMR7212, Institut de Recherche Saint-Louis, F-75010, Paris, France
Jean Soulier
CNRS UMR9019 Genome Integrity and Cancers, Institut Gustave Roussy, Université Paris-Saclay, Villejuif, France
Alain Sarasin

Authors

Andrey A. Yurchenko
View author publications
You can also search for this author in PubMed Google Scholar
Ismael Padioleau
View author publications
You can also search for this author in PubMed Google Scholar
Bakhyt T. Matkarimov
View author publications
You can also search for this author in PubMed Google Scholar
Jean Soulier
View author publications
You can also search for this author in PubMed Google Scholar
Alain Sarasin
View author publications
You can also search for this author in PubMed Google Scholar
Sergey Nikolaev
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.N., A.S. and A.A.Y. designed the study. A.S. and J.S. collected the samples. A.A.Y. performed the data analysis and prepared figures. B.T.M. participated in the data analysis. I.P. performed data preprocessing. A.A.Y. and S.N. drafted manuscript. A.S. and J.S. commented manuscript. All authors contributed to the final version of the paper.

Corresponding author

Correspondence to Sergey Nikolaev.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Charles Mullighan, Steven Rozen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Software 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yurchenko, A.A., Padioleau, I., Matkarimov, B.T. et al. XPC deficiency increases risk of hematologic malignancies through mutator phenotype and characteristic mutational signature. Nat Commun 11, 5834 (2020). https://doi.org/10.1038/s41467-020-19633-9

Download citation

Received: 19 March 2020
Accepted: 07 October 2020
Published: 17 November 2020
DOI: https://doi.org/10.1038/s41467-020-19633-9
Springer Nature Limited

This article is cited by

DNA mismatch and damage patterns revealed by single-molecule sequencing
- Mei Hong Liu
- Benjamin M. Costa
- Gilad D. Evrony
Nature (2024)
Genomic mutation landscape of skin cancers from DNA repair-deficient xeroderma pigmentosum patients
- Andrey A. Yurchenko
- Fatemeh Rajabi
- Sergey I. Nikolaev
Nature Communications (2023)
Early-onset gynecological tumors in DNA repair-deficient xeroderma pigmentosum group C patients: a case series
- Andrey. A. Yurchenko
- Brice Fresneau
- Sergey I. Nikolaev
Communications Medicine (2023)
Pathological consequences of DNA damage in the kidney
- Juan I. Garaycoechea
- Catherine Quinlan
- Martijn S. Luijsterburg
Nature Reviews Nephrology (2023)
Increased risk of internal tumors in DNA repair-deficient xeroderma pigmentosum patients: analysis of four international cohorts
- Sergey Nikolaev
- Andrey A. Yurchenko
- Alain Sarasin
Orphanet Journal of Rare Diseases (2022)

XPC deficiency increases risk of hematologic malignancies through mutator phenotype and characteristic mutational signature

Abstract

Similar content being viewed by others

Introduction

Results

XP-C leukemia is characterized by mutator phenotype

XPC deficiency underlies characteristic mutational process

Mutational asymmetries in XP-C tumors

The majority of mutations precede copy number alterations

Discussion

Methods

Studied samples

Genome sequencing and data processing

Filtration of somatic variants

Mutational signatures analysis

Transcriptional strand bias analysis

Replication timing

Epigenetic marks and mutational density

Replicational strand bias

Clustered mutations

Relative number of mutations before and after SCNAs

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation