Main

Somatic mutations that accumulate during the human lifespan are emerging as potential contributors to several disorders beyond their known role in cancer1. The most prominent example is that of clonal hematopoiesis (CH), a condition in which de novo somatic mutations in hematopoietic stem cells (HSCs) provide a selective advantage that leads to the clonal outgrowth of the mutant cell2,3. CH-associated mutations affect a limited set of genes—most frequently the epigenetic regulatory genes DNMT3A and TET2—that are also commonly mutated in hematological neoplasia. Accordingly, CH is associated with a higher risk of developing hematological cancer4,5,6,7,8. In addition, CH is also emerging as a shared risk factor for a growing list of nonhematological disorders4,6,7,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23. Atherosclerotic cardiovascular disease (CVD) is the nonhematological condition most strongly linked to CH2,3,4,9,13,19,24. However, whether accelerated atherosclerosis is a cause or a consequence of CH remains a matter of debate. While sequencing studies in humans and experimental studies in mice support a direct causal contribution of some CH-related mutations to atherosclerosis by exacerbating inflammatory responses4,9,13,19,25,26, mathematical models based on human and mouse data suggest that the association between CH and CVD may instead reflect reverse causality27. Specifically, it has been posited that CH could simply be a direct consequence of increased HSC proliferation in individuals with atherosclerosis, resulting in accelerated expansion of existing mutant hematopoietic clones27,28,29. This reverse causality hypothesis could also apply to other CH-related associations, thereby explaining why CH is associated with a very diverse range of conditions4,6,7,9,10,11,12,13,14,15,16,17,18,30,31,32. The uncertain nature of the relationship between CH and atherosclerosis has particularly important clinical implications, as the specific atherogenic effects of CH mutations on leukocytes are being discussed as potential targets for the development of personalized strategies to reduce or prevent CVD in individuals with CH3. Such approaches would not be effective if the association between CH and atherosclerosis is merely due to the effects of atherosclerosis on the dynamics of CH.

Resolving the controversy surrounding the directionality of the CH–atherosclerosis association requires longitudinal analyses that combine deep genotyping of CH-related genes with deep phenotyping of study participants. Such longitudinal analyses were not feasible in earlier studies, which mainly relied on cross-sectional whole-exome sequencing (WES) datasets at a single time point4,6,7,9,13,16. To address this issue, here we take advantage of the Progression of Early Subclinical Atherosclerosis (PESA) study33, a longitudinal cohort of healthy middle-aged individuals with serially collected blood DNA samples and deep cardiovascular and metabolic phenotyping, including extensive imaging-based assessment of subclinical atherosclerosis burden in multiple vascular territories and at multiple time points. Leveraging this resource, we used a high-sensitivity sequencing approach to investigate CH and its dynamics in a longitudinal manner, aiming to elucidate the interplay between CH and atherosclerosis.

Results

CH in healthy middle-aged individuals

Previous analyses of WES or whole-genome sequencing (WGS) datasets suggested that CH is relatively uncommon in middle-aged individuals, with frequencies ranging approximately from 2% to 3% in individuals aged between 40 and 55 years, compared with >10% in individuals older than 65 (refs. 4,6,7,8,34). However, these previous observations were limited by the low sensitivity of somatic mutation calling based on WES or WGS data, which hampers the detection of small mutant clones (for example those present with variant allele fraction (VAF) <5%, that is, 10% of mutant blood cells, assuming monoallelic mutations)34. Therefore, to elucidate the prevalence and characteristics of CH in this segment of the population, we performed high-sensitivity targeted sequencing of 54 CH-related genes (median depth 3,712×; Extended Data Fig. 1a) in blood samples collected at enrollment from 3,692 participants in PESA (40–55 years old at the time of enrollment, median 45; see Methods for details). We predefined a minimum VAF threshold of 0.2% to identify CH mutations, as our sequencing depth allowed us to detect CH variants at this VAF with a sensitivity greater than 90% (Methods and Extended Data Fig. 1b). Using this threshold and a previously described filtering and curation strategy35,36 (Methods and Supplementary Table 1), we identified a total of 1,172 CH-related somatic mutations in 46 genes, with 1,079 variants (92%) occurring in 33 genes linked to myeloid CH (typically referred to as CH of indeterminate potential or CHIP) and 93 variants in 13 genes linked to lymphoid CH8,37 (Extended Data Fig. 1a and Supplementary Table 2). Approximately 25% of individuals carried at least one CH-related mutation detectable with our sequencing approach. Mutations in DNMT3A were most frequent (657 mutations, driving CH in 14.8% of individuals) followed by mutations in TET2 (153 mutations, driving CH in 3.9% of individuals), consistent with previous studies (Fig. 1a and Extended Data Fig. 1c)4,6,8,34,38. The basic features of CH mutations in the study population aligned well with those reported in previous WES or WGS analyses4,6,34,38 (Extended Data Fig. 1d and Supplementary Table 2), with more than one-half of the single nucleotide variants corresponding to a C > T substitution, a mutational signature characteristic of aging and CH (Extended Data Fig. 1e).

Fig. 1: Prevalence and characteristics of CH in middle-aged individuals.
figure 1

We performed deep targeted sequencing to identify somatic mutations in a custom panel of 54 CH-related genes in 3,692 individuals from the PESA cohort. a, The number of CH driver mutations identified per gene. The values above the bars indicate the percentage of mutations affecting each specific gene. b, The CH prevalence across quartiles of age. c, The number of mutations per individual across quartiles of age. d, The association between advancing age (stratified as quartiles) and CH (analyzed separately as driven by mutations in DNMT3A, TET2 or other genes) based on multivariate logistic regression analyses adjusted for sex. The bars indicate 95% confidence intervals centered in the mean value (square). e, The distribution of mutant clone size in the study population, assessed as VAF. The dashed line shows the 2% VAF threshold most typically used to identify CH. The box shows the 25th (Q1), 50th (median) and 75th (Q3) percentiles of the data. The whiskers represent Q1 − 1.5 × IQR at the minimum and Q3 + 1.5 × IQR at the maximum. f, The prevalence of CH with VAF ≥2% across quartiles of age. g, The association between gene-specific CH and female sex, based on multivariate logistic regression analyses adjusted for age. The bars indicate 95% confidence intervals centered in the mean value (square). h, The CH prevalence across quartiles of age stratified by sex. In b, f and h, CH status in individuals carrying more than one mutation was defined on the basis of the mutation with the highest VAF.

The prevalence of CH mutations in this middle-aged population increased with advancing age (Fig. 1b). After adjustment for sex, each additional year of age was independently associated with a 9% higher relative risk of carrying detectable CH mutations (odds ratio (OR) 1.09, 95% confidence interval (CI) 1.07–1.11, P < 0.001), and the prevalence of CH was 33.5% in the top quartile of age, which included individuals between ∼50 and ∼55 years of age (Fig. 1b). Most mutation carriers (76%) had only one detectable mutation, although the frequency of carrying more than one mutation increased with age (Fig. 1c). Age was markedly associated with DNMT3A-mutant CH (OR 1.09, 95% CI 1.07–1.12, P < 0.001), TET2-mutant CH (OR 1.10, 95% CI 1.05–1.14, P < 0.001) and CH driven by mutations in other genes (OR 1.11, 95% CI 1.08–1.14, P < 0.001), although the increase in CH prevalence with aging differed between driver genes, with DNMT3A-mutant CH rising in prevalence earlier in life than CH driven by mutations in other genes (Fig. 1d).

As expected, the mutant hematopoietic clones identified in our study were generally of much smaller size than those identified in previous WES or WGS analyses4,6,34,38 (median VAF 0.92%, range 0.2–39.9%; Fig. 1e, Extended Data Fig. 1f and Supplementary Table 2). In total, 78.8% of the detected mutations had a VAF smaller than 2%, the minimum clone size threshold used in most previous CH studies4,5,6,7,8,10,11,12,13,14,15,16,17, and only ∼4% of them had a VAF greater than 10%, which identified clones with significant impact on atherosclerotic CVD in previous large studies4,7,9,13,16,39. CH mutations over the typical 2% VAF threshold were detected in 6.0% of the study population and in ∼9% of those between ∼50 and ∼55 years of age (Fig. 1f). Computational downsampling of sequencing depth suggests that this elevated prevalence of CH compared with previous WGS/WES studies4,6,7,8,34 reflects the better sensitivity provided by our targeted sequencing approach (Extended Data Fig. 1g,h).

Consistent with recent reports6,40, we found that DNMT3A-mutant CH was associated with female sex after adjusting for age, with women exhibiting a 64% higher risk of carrying detectable DNMT3A mutations than men (Fig. 1g). In total, 17.8% of women in the study population carried DNMT3A mutations, whereas 13.2% of men carried such mutations, and this higher frequency of DNMT3A mutations in women was evident across all quartiles of age (Fig. 1h). No significant sex-related differences were observed in the prevalence of CH driven by mutations in genes other than DNMT3A (Fig. 1g,h).

Clinical characteristics of CH mutation carriers

Having determined with high resolution the landscape of CH-related mutations in our study population, we next aimed to examine the pathophysiological effects of CH in these middle-aged individuals. Table 1 presents the demographic and clinical characteristics of the study population, stratified by CH status. The characteristics of individuals with clones with VAF ≥2% or gene-specific forms of CH are listed in Extended Data Table 1 and Supplementary Table 3, respectively. Carrying CH-related mutations was associated with higher absolute blood cell counts in all hematopoietic lineages in multivariate regression analysis adjusted for age and sex (Extended Data Table 2). However, this effect was mild overall, with 3.9% and 7.2% relative increases in median counts of circulating leukocytes in those carrying CH-related mutations with VAF <2% and ≥2%, respectively (Extended Data Table 3). No other associations with major hematological parameters were observed. Overall, these findings corroborate the known relationship between CH and blood cell counts6,7,13,34. CH mutation carriers also tended to exhibit higher blood pressure and higher levels of glycated hemoglobin HbA1C (Table 1), consistent with some previous reports6,13,41, but these associations were not statistically significant after adjustment for age and sex (Supplementary Table 4).

Table 1 Baseline characteristics of the study population stratified by the presence of CH mutations

No cross-sectional association between CH and atherosclerosis

We next investigated the cross-sectional relationship between CH and subclinical atherosclerosis burden assessed at enrollment through noninvasive vascular imaging across multiple vascular beds, including three-dimensional vascular ultrasound imaging (3DVUS) of carotid and femoral arteries, and computed tomography (CT)-based determination of coronary artery calcium scores (CACS). Although PESA participants who carried CH mutations exhibited greater subclinical atherosclerosis burden (Extended Data Fig. 2a–d), no significant association was observed after adjusting for age and sex, or age, sex and conventional cardiovascular risk factors (Extended Data Table 4). Similar results were obtained in gene-specific analyses (Supplementary Table 5).

CH is associated with de novo femoral atherosclerosis

We next took advantage of the longitudinal nature of the PESA study and evaluated whether individuals who carried CH mutations at enrollment exhibited a higher risk of developing de novo atherosclerosis over subsequent years (Fig. 2a). To do this, we leveraged the availability of 3DVUS imaging data for the carotid and femoral arteries, collected at enrollment and the 3-year and 6-year follow-up imaging examinations in PESA (Fig. 2b). This imaging modality allows for the direct detection of atherosclerotic plaques, including the small plaques that are characteristic of early atherosclerosis and can be expected to develop in this timespan42,43. CH mutation carriers exhibited markedly higher rates of de novo femoral atherosclerosis development at the 3-year follow-up, which correlated with mutant clone size (Fig. 2c). Similar results were observed when restricting this analysis to mutations in CHIP genes or specific frequently mutated genes (Fig. 2c), or when evaluating de novo atherosclerosis development using imaging data from the 6-year follow-up (Fig. 2d).

Fig. 2: Longitudinal assessment of the effects of CH on de novo femoral atherosclerosis development.
figure 2

We investigated the association between CH at baseline and de novo development of femoral atherosclerosis ∼3 years and ∼6 years after enrollment among PESA participants who initially lacked detectable atherosclerosis in this region. 3DVUS imaging was used to determine the presence of femoral atherosclerosis. a, Summary of study design. b, Representative images from femoral atherosclerosis burden, assessed by 3DVUS, in an individual exhibiting de novo femoral atherosclerosis development (that is, absence of detectable atherosclerosis at the baseline evaluation (left) and plaque development at follow-up (right); scale bar, 5 mm). c,d, Incidence of de novo femoral atherosclerosis development at the 3-year follow-up (c, n = 2,347) or the 6-year follow-up (d, n = 2,214) in individuals free of CH (no CH) and in individuals exhibiting CH with VAF 0.2–2% or ≥2%. Incidence of de novo femoral atherosclerosis is also shown for myeloid CH (CHIP) or CH driven by specific CHIP genes. Statistical significance in the analyses of CH and CHIP was evaluated using univariate logistic regression models (P for trends are shown). In gene-specific analyses, statistical significance was examined through proportion tests relative to the non-CHIP carriers group (*P < 0.05, **P ≤ 0.01, ***P ≤ 0.001). e,f, The association between CH or CHIP and de novo femoral atherosclerosis development at the 3-year follow-up (e, n = 2,347) and 6-year follow-up (f, n = 2,214) based on multivariate logistic regression analyses. Statistical models were adjusted for age, sex, smoking, lipid-lowering treatment and the AUC of SBP, fasting glucose, LDL-C, BMI, CACS and global atherosclerotic plaque volume assessed by 3DVUS imaging; the bars indicate 95% confidence intervals centered in the mean value (square).

To account for the possible effects of the cumulative exposure to risk factors or the systemic burden of atherosclerosis27,28,29, we next evaluated the effects of CH in statistical models adjusted for the area under the curve (AUC) of conventional CV risk factors, CACS (assessed by CT) and global atherosclerotic plaque volume (assessed by 3DVUS), in addition to age and sex (see Methods for details). Using this approach, CH was independently associated with a 1.5-fold higher risk of developing de novo femoral atherosclerosis at the 3-year follow-up among participants who lacked detectable atherosclerosis in this vascular region at baseline (Fig. 2e). The magnitude of this effect was dependent on mutant clone size, with smaller clones (VAF 0.2–2%) conferring a 1.4-fold higher risk of developing de novo femoral atherosclerosis, which did not reach statistical significance (P = 0.072), and larger clones leading to a 2.1-fold higher risk, an effect comparable to that of traditional modifiable risk factors such as dyslipidemia or smoking (Extended Data Fig. 3a). These associations were independent of blood cell counts (Extended Data Fig. 3b). The effect of CH was primarily driven by mutations in genes related to myeloid CH or CHIP, which were associated with 1.6-fold greater risk of de novo femoral atherosclerosis (Fig. 2e and Extended Data Fig. 3c,d). In gene-specific analyses, CH driven by mutations in DNMT3A, TET2 and other genes were all associated with higher rates of de novo femoral atherosclerosis (Fig. 2e). The effect of CH mutations remained consistent after correction for multiple hypothesis testing (Supplementary Table 6). Similar, albeit milder, results were also obtained when evaluating femoral atherosclerosis development at the 6-year follow-up in PESA (Fig. 2f). No association was observed between CH and atherosclerosis development in the carotid artery (Supplementary Fig. 1), a region where atherosclerosis seems to develop at later ages based on previous analyses in PESA and other cohorts with multiterritorial vascular imaging data42,44,45.

No effect of subclinical atherosclerosis on CH dynamics

Finally, we determined the longitudinal dynamics of CH in healthy middle-aged participants in PESA to examine its potential modulation by the presence or extent of atherosclerosis. To do this, we sequenced blood samples collected at baseline and ∼6 years after enrollment from 718 CH mutation carriers (Fig. 3a). We then used the serially determined VAF measurements as a surrogate for clone size to calculate the annual relative expansion rates (AER) of CH mutations in blood (see details in Methods). Among 602 CH-related mutations that were detected at both time points with comparable sequencing depth and quality (Supplementary Table 7), the median AER was 6.5% per year. CH-related mutations in DNMT3A (381 variants) and TET2 (80 variants) expanded at median AER of 7.0% per year and 7.6% per year, respectively (Fig. 3b). Overall, mutations in epigenetic regulatory genes showed numerically slower expansion rates than genes encoding splicing regulators or involved in the DNA damage response (Supplementary Fig. 2), consistent with previous findings46,47,48. As a control, we also quantified the expansion of 271 nondeleterious somatic variants in CH-related genes identified at both time points in 223 individuals not carrying CH driver mutations. As expected, these neutral mutations generally did not show clear evidence of expansion within the timeframe of the study (median AER: 1.4% per year) (Fig. 3b).

Fig. 3: Longitudinal assessment of the dynamics of CH in middle-aged individuals.
figure 3

We monitored the expansion of 602 CH-related mutations through serial sequencing of blood samples from 494 individuals at baseline and ∼6 years after enrollment. Then, we investigated the association between subclinical atherosclerosis burden or related traits at baseline and the AER of the mutant hematopoietic clones. a, Summary of study design. b, The AER of mutations in any CH gene (n = 602) or specifically in DNMT3A (n = 381), TET2 (n = 80) or other genes (n = 141), compared with nondeleterious variants in CH-related genes (n = 271). Statistical significance was determined by two-sided Mann–Whitney–Wilcoxon tests (**P ≤ 0.01, ****P ≤ 0.0001). c, The AER of CH-related mutations stratified by age quartiles (n = 602). d, AER of CH-related mutations stratified by sex (n = 602). e, The correlation between AER of CH-related mutations and baseline variant allele frequency (VAF). The P value for two-sided Pearson correlation is indicated. fi, The AER of CH-related mutations in individuals with no detectable atherosclerosis and across tertiles of plaque burden, based on CACS (f, n = 600), global plaque volume (g, n = 578), carotid plaque volume (h, n = 597) or femoral plaque volume (i, n = 581) measured by 3DVUS. j,k, The AER of CH-related mutations stratified by conventional modifiable risk factors (j, n = 602 for all variables, except for obesity, n = 601) and quartiles of hsCRP (k, n = 600). Statistical significance in c, d and fk was determined by mixed-effects models adjusted for baseline VAF (d) or sex and baseline VAF (c and fk). The boxes in bd and fk represent the 25th (Q1), 50th (median) and 75th (Q3) percentiles of the data. The whiskers represent Q1 − 1.5 × IQR at the minimum and Q3 + 1.5 × IQR at the maximum.

The AER of CH-related mutations was comparable across quartiles of age, consistent with previous findings (Fig. 3c)48. In contrast, a trend to faster expansion was observed in men (Fig. 3d) and among variants with lower baseline VAF (Fig. 3e). Importantly, the expansion of CH variants was not affected by the presence or the extent of subclinical atherosclerosis at baseline, assessed through various noninvasive imaging modalities (Fig. 3f–i and Table 2). Similarly, the cumulative burden of atherosclerosis over the timeframe of this longitudinal analysis, estimated as the AUC of several vascular imaging parameters, had no significant effect on the dynamics of CH (Table 2). In sensitivity analyses, we obtained comparable results when defining mutant cell expansion with two alternative mathematical approaches used in other studies of CH dynamics, namely the logarithm of the ratio of VAFs at both time points49 and the exponential growth rate50 (Table 2). Similarly, neither high-sensitivity C-reactive protein (hsCRP), a biomarker of systemic inflammation, nor several conventional cardiovascular risk factors, such as dyslipidemia, diabetes, smoking or obesity, were associated with significant changes in AER or other indicators of clonal expansion (Fig. 3j,k and Extended Data Table 5). Hypertension was associated with higher AER (P = 0.048; Fig. 3j), but this relationship was not statistically significant when evaluating other indicators of CH dynamics or the effect of baseline or cumulative systolic blood pressure (SBP) (Extended Data Table 5).

Table 2 No effect of subclinical atherosclerosis on the expansion of mutant hematopoietic clones

Discussion

CH has emerged in recent years as a potential shared driver of blood cancer and several nonhematological conditions, most notably atherosclerotic CVD2,3. However, the accumulating reports of positive associations between CH and a diverse range of conditions and outcomes4,6,7,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,30,32 also lend support to the possibility that CH is simply a marker of a shared pathophysiological feature of these conditions, rather than a causal factor. The association between CH and atherosclerosis, in particular, has become a matter of active debate27,29, hindering the translation of research findings into new strategies for the prevention of CVD in CH mutation carriers. In this context, our longitudinal assessment of the interplay between CH, atherosclerosis and related traits provides important new insights into the relationship between CH and atherosclerosis.

Most importantly, our findings shed light on the directionality of the association between CH and atherosclerosis. Previous research in both humans and mice has suggested that certain CH mutations directly contribute to the development of atherosclerotic CVD4,9,13,25,26. In contrast, recent mathematical models based on human and mouse data have proposed that the association between CH and CVD may instead reflect reverse causality, whereby CH would simply be a direct consequence of increased HSC proliferation in individuals with atherosclerosis, resulting in accelerated kinetics of mutant cell expansion27,29. This reverse causality hypothesis is also applicable to the association between CH and other conditions, such as diabetes4,23,41, known to increase HSC proliferation51. The direction of these associations remains uncertain, partly due to the fact that most of them were identified through post hoc analyses of cross-sectional sequencing datasets4,7,9,10,11,12,13,14,15,16,17,18,30,31,32, which provide only a static ‘snapshot’ of CH at a given time and, therefore, very limited insight into the dynamics of this condition and its association with disorders that develop subclinically over the course of years. In this context, our analysis of longitudinal sequencing and imaging data provides novel insights into the relationship between CH and atherosclerosis, demonstrating that: (1) carrying CH mutations at a given time confers a greater risk of developing femoral atherosclerosis over subsequent years, and (2) the presence or extent of subclinical atherosclerosis has no significant effect on the expansion of mutant hematopoietic clones. These findings argue strongly against the possibility that the association between CH and atherosclerosis is due to reverse causality, thus providing an essential prior step for the development of clinical trials that evaluate the efficacy of personalized preventive care strategies tailored to the effects of specific CH mutations. Our findings, however, do not rule out that certain cardiometabolic conditions may accelerate CH dynamics to some extent. Indeed, we found that hypertensive individuals exhibit a non-statistically significant trend to faster expansion of mutant cells, an observation that demands further exploration given the emerging evidence suggesting an impact of hypertension on the bone marrow niche and hematopoiesis52,53,54. In addition, we cannot rule out the possibility that clinically overt CVD (for example, acute ischemic heart disease) modifies the dynamics of CH. Although this possibility would not affect the primary conclusions of our study, it deserves further investigation as it may help understand the regulation and effects of CH in high-risk CVD patients. Overall, further studies are warranted to dissect the factors that determine the dynamics of CH and explore the underlying regulatory mechanisms.

Our study also provides a large high-sensitivity assessment of the prevalence of CH and the magnitude of its connection to atherosclerosis. Previous human genetic evidence linking CH to atherosclerotic CVD was mainly obtained by repurposing WES datasets4,7,9,13,16,39, which were not intended initially to investigate somatic variants and have limited sensitivity to detect CH due to their modest sequencing depth. A previous report suggested that the typical depth of WES analyses only provides robust ability to identify mutant clones with VAF >5%, (that is, carriers of >10% mutant blood cells, if carrying monoallelic mutations)34, which is consistent with our estimations based on downsampling of sequencing depth in our study population. Furthermore, slight differences in variant interpretation and filtering parameters can be a source of major variability when using WES data to detect CH, as illustrated by recent studies by independent groups, who reached substantially different conclusions about the prevalence of CH and the strength of its association with atherosclerotic CVD despite analyzing WES data from the same cohort7,8,16. Here, through high-sensitivity sequencing of almost 3,700 seemingly healthy individuals, we demonstrate that CH mutations are prevalent in middle-aged individuals, being detectable in ∼25% of healthy individuals between 40 and 55 years of age, and in ∼34% of those aged 50–55. Even when applying the commonly used 2% VAF threshold for CH detection, we found that 6% of the study population (age range 40–55 years old) and 9% of those aged 50–55 exhibit CH, indicating an approximately three times higher prevalence of CH in middle-aged individuals than previously reported4,6,7,8,34. Thus, our findings suggest that CH contributes to atherosclerosis development in a much greater proportion of the population than previously believed. Importantly, the vast majority of mutant clones in our study population were small, with 79% of the mutations exhibiting a VAF <2% and 96% of them exhibiting a VAF <10% at baseline. Yet, carrying clones in this size range was still pathophysiologically relevant, as it was associated with a higher risk of de novo femoral atherosclerosis development, particularly among individuals with clones expanded to VAF ≥2%. These findings contribute to clarify the threshold of VAF that defines pathophysiologically relevant CH, as large WES-based studies previously concluded that clones with VAF <10% had minimal impact on subclinical atherosclerosis, atherosclerotic CVD or CVD-related mortality6,7,9,16,39, in contrast to smaller studies using more sensitive targeted sequencing approaches19.

Our results also provide important insight into gene-specific effects of CH. While the effect of somatic TET2 mutations on atherosclerotic CVD is supported by several sequencing studies9,13,16, the effect of DNMT3A mutations has remained controversial, with previous WES-based studies reporting a minimal, almost null effect of these mutations on atherosclerotic CVD7,13,16. In this context, while additional studies are warranted, our gene-specific analysis reinforces the association between somatic DNMT3A mutations and atherosclerotic disease, consistent with experiments in mouse models55 and recent targeted sequencing studies56, and in contrast to WES-based analyses7,13,16. As DNMT3A mutations are the most frequent mutations linked to CH, these findings underscore the potential clinical significance of this condition.

Several limitations must be considered when interpreting our findings. First, the study population consisted entirely of Caucasian individuals residing in Madrid, Spain, which may limit the generalizability of our results to other racial and ethnic groups. Second, our findings related to the effects of CH on de novo femoral atherosclerosis cannot be directly extrapolated to other vascular beds; in this context, future studies with other imaging modalities and in other age ranges are warranted. Third, CH in this middle-aged population was markedly dominated by DNMT3A mutations, which limits our statistical power to assess the effects of CH driven by mutations in other genes; thus, gene-specific analyses in this work should be considered exploratory. Finally, our study was focused on examining the effects of atherosclerosis on the expansion of existing mutant hematopoietic clones; therefore, we cannot draw any conclusions regarding the potential effects of atherosclerosis or related traits on de novo somatic mutagenesis. Testing this possibility, which to our knowledge has not been postulated so far, would require more sensitive sequencing techniques, such as duplex sequencing.

In summary, our study elucidates the directionality of the association between CH and atherosclerosis in humans. These results provide critical new insight into the effects and regulation of CH, which should help translate our growing knowledge of this condition into personalized strategies for managing the risk of atherosclerotic CVD and, potentially, that of other age-related disorders.

Methods

Study population

The study population included 3,692 participants in the PESA study (NCT01410318) with available blood DNA samples, multimodal vascular imaging data at multiple time points and written informed consent for DNA sequencing analyses. PESA is an ongoing observational prospective cohort study that is examining several imaging, biological and behavioral parameters potentially related to the presence and progression of early subclinical atherosclerosis in 4,184 apparently healthy middle-aged individuals. PESA participants were between 40 and 55 years old at enrollment (between June 2010 and February 2014), and have undergone three study visits so far (baseline and ∼3- and ∼6-year follow-up). All study visits included a clinical interview, physical examination, collection of blood samples and assessment of subclinical atherosclerosis by noninvasive imaging techniques, among other measurements33. Exclusion criteria included known CVD, cancer or immunological disorders, morbid obesity, chronic kidney disease, the presence of any disease expected to decrease life expectancy, and any condition that could affect adherence to the study procedures. The study protocol was approved by the Ethics Committee of Instituto de Salud Carlos III (CEI PI_52/2019), and all participants provided written informed consent.

Assessment of burden and progression of subclinical atherosclerosis and related traits

Atherosclerosis burden in the study population was assessed by noninvasive vascular imaging tests, in accordance with standard protocols in PESA, including 3DVUS-based assessment of the femoral and carotid arteries and noncontrast CT to determine CACS33,42,43,44. Individual-level 3DVUS data were collected using VPQ software v13 (Philips Healthcare) and CM2020 software (Philips Research Analysis). Atherosclerosis by 3DVUS was analyzed separately in carotid and femoral arteries as well as in an integrated manner (that is, global atherosclerotic plaque volume). Presence of subclinical atherosclerosis at baseline was defined as CACS >0 or plaque volume by 3DVUS >0 mm3 (categorical variables). The extent of subclinical atherosclerosis at baseline was estimated on the basis of CACS and plaque volumes, either as continuous variables or categorized as 0 and tertiles. Representative 3DVUS images across tertiles of atherosclerotic plaque volume can be found in Extended Data Fig. 4. In longitudinal analyses, de novo atherosclerosis development was defined as transitioning from absence of detectable atherosclerosis by 3DVUS at baseline to detectable atherosclerosis with plaque volume >0 mm3 at follow-up. Limited sensitivity and statistical power precluded an accurate examination of the effects of CH on the progression of prevalent atherosclerotic plaques. Modifiable risk factors and related traits were defined as continuous or categorical variables and evaluated on the basis of analyses of fasting blood samples and questionnaires. Diabetes was defined as exhibiting plasma fasting glucose ≥126 mg dl−1 or being treated with insulin or hypoglycemic medication. Hypertension was defined as exhibiting SBP ≥140 mmHg or diastolic blood pressure (DBP) ≥90 mmHg, or being treated with antihypertensive medication. Dyslipidemia was defined as exhibiting total cholesterol ≥240 mg dl−1, low-density lipoprotein cholesterol (LDL-C) ≥160 mg dl−1 or high-density lipoprotein cholesterol (HDL-C) <40 mg dl−1, or being treated with lipid-lowering drugs. Obesity was defined as exhibiting a body mass index (BMI) ≥30 kg m−2. Low plasma HDL-C was defined as exhibiting plasma HDL-C <40 mg dl−1 in males or <50 mg dl−1 in females. Hematological analyses included counts of leukocytes and main leukocyte subsets (neutrophils, lymphocytes, monocytes, eosinophils and basophils), red blood cell count, hemoglobin, hematocrit, red blood cell distribution width and platelet counts.

High-sensitivity targeted sequencing to identify CH-related somatic mutations

Sample preparation and high-sensitivity DNA sequencing, as well as somatic variant calling and annotation, were performed as previously described35,36. In brief, a custom gene panel was designed to identify somatic mutations in 54 well-established CH-related genes (Extended Data Fig. 1 and Supplementary Table 1), and unique dual indexes and unique molecular identifiers were used to detect polymerase chain reaction and sequencing errors and, therefore, to improve identification of small mutant clones. After confirming the quality of blood DNA, DNA libraries were generated (KAPA Hyper plus kit, Roche) and genes of interest were captured using xGen Hybridization Capture reagents (Integrated DNA Technologies). Then, DNA libraries were paired-end sequenced on a HiSeq 4000 or a NovaSeq 6000 sequencing system (Illumina). Raw sequencing reads were mapped to human genome build GRCh38, and somatic variants were identified by using Genome Analysis Toolkit (GATK) Mutect2. We removed common sequencing artifacts according to several Mutect2 filters (multiallelic somatic calls and variants with deficient mapping or base quality) as well as those flagged by the position filter. Germline mutations were excluded, and we only considered variants with ‘high’ or ‘moderate’ impact according to Variant Effect Predictor, reference read depth ≥300, alternative depth ≥4 and evidence of the variant on both forward and reverse strands (F1R2 and F2R1 ≥2). We also filtered out variants preceded or followed by homopolymer regions, indels with ≥3 consecutive repeated nucleotides within the insertion or deletion, and those SNVs located in a ≥5-nucleotide-long homopolymer, except if found at multiple sequencing time points in the same individual. Candidate CH driver mutations were then identified on the basis of prespecified inclusion criteria (Supplementary Table 1), the presence of ≥3 times in the Catalogue of Somatic Mutations in Cancer (COSMIC, https://cancer.sanger.ac.uk/cosmic) in hematopoietic samples from at least three different studies, in silico pathogenicity predictors and previous publications using a similar deep sequencing strategy. A specific approach was required to identify mutations in U2AF157 because of an erroneous duplication in the region of this gene in the GRCh38reference genome58.

We set a minimum VAF threshold of 0.2% to identify CH mutations, as our sequencing depth allowed us to detect CH variants at this VAF with a sensitivity greater than 90% (Extended Data Fig. 1b). In brief, we used a binomial distribution to estimate the sensitivity for the detection of somatic variants with a certain VAF at different sequencing depths, requiring a minimum alternative allele count of four reads. CH-related variants over the 0.2% VAF threshold were significantly associated with age (Supplementary Table 8), an essential feature of true CH variants8.

Assessment of expansion rates of mutant hematopoietic clones

Blood samples collected ∼6 years after enrollment from 718 CH mutation carriers were sequenced as described above. Among the 931 CH-related variants with VAF ≥0.2% identified in these individuals at baseline, 261 variants were not detected at follow-up. These variants were excluded from further analysis, as we were unable to ascertain if the apparent disappearance of these mutant clones is due to biological or technical reasons. We also excluded 68 additional variants for which sequencing depths differed substantially between both time points (ratio of baseline depth to follow-up depth exceeding the 90% percentile of the distribution). After applying this filtering strategy, 602 CH-related mutations in 494 individuals were detected at both time points supported by reads in both the forward and reverse strand (Supplementary Table 7). The AER of each of these mutations was calculated as the relative change in VAF over the follow-up period in years \((\Delta {{\mathrm{VAF}}} / {{\mathrm{VA}}}{{{{\mathrm{F}}}}}_{{\rm{baseline}}})/{{\mathrm{time}}}\times 100\). As a sensitivity analysis, we also calculated two additional indicators of the expansion rate of mutant hematopoietic clones based on previous literature: the logarithmic ratio of VAFs, that is log(VAF/VAFbaseline)/time × 100 (ref. 49), and the exponential growth rate, that is, VAF = VAFbaseline × (clonal growth + 1)time; clonal growth = ((VAF/VAFbaseline)(1/(time)) − 1) × 100 (ref. 50). Time refers to the follow-up period measured in years. These three metrics showed a perfect positive correlation with each other, with Spearman’s rank correlation coefficients equal to 1.00 (Supplementary Fig. 3).

As a control, blood samples from 223 individuals not carrying CH-related mutations at baseline were also sequenced ∼6 years after enrollment to estimate the expansion of nondeleterious somatic variants in CH-related genes, as a surrogate of the expansion related to stochastic neutral drift. We considered nondeleterious somatic variants those with impact ‘low’ (unlikely to change protein behavior, for example, synonymous variants) or ‘modifier‘ (typically noncoding variants). Likely germline variants and common sequencing artifacts were filtered out, and only variants supported by reads on both the forward and reverse strands at both time points were included in the analysis, consistent with our approach to investigate the expansion of CH driver variants.

Statistical analysis

All statistical analyses were performed using RStudio (v2022.07.2 + 576) with R version 4.2.2, considering a statistical significance level of 0.05. Adjustments for multiple hypothesis testing were not performed unless otherwise indicated. Variables were expressed as frequency (%), mean and standard deviation, or median and interquartile range (IQR), as appropriate. CH was investigated as a composite of mutations in any CH-related gene, unless otherwise stated. Specific analyses evaluated the effects of mutations in DNMT3A, TET2 and other genes separately. Differences in baseline characteristics between CH mutation carriers and individuals with no detectable CH mutation were evaluated through independent two-tailed t-tests for continuous variables exhibiting a normal distribution, Wilcoxon rank tests for continuous variables with a nonnormal distribution and χ2 tests for categorical variables. Associations between CH and traits of interest were evaluated using univariate or multivariate linear or logistic regression models, adjusted for age and sex or age, sex, LDL-C, SBP, BMI, fasting glucose levels, lipid-lowering treatment and smoking, as specified in figure legends. Some specific analyses were adjusted for age, sex and cardiovascular risk factors defined as categorical variables, as indicated in figure legends. Analyses of the association between CH and de novo development of atherosclerosis were adjusted by the AUCs of LDL-C, SBP, BMI and fasting glucose to consider the cumulative exposure to this cardiovascular risk factors during follow-up. These analyses were also adjusted for the AUC of CACS and global plaque volume to account for the possibility that the possible cumulative effects of systemic atherosclerosis on CH27 confound its relationship with de novo atherosclerosis development in specific regions. In multivariate analyses, only individuals with information for all the covariates included in the model were considered. In analyses concerning specific types of CH (for example, driven by mutations in a given gene or over a certain VAF), individuals not carrying detectable CH mutations were used as the reference category. In individuals carrying more than one CH mutation, all mutations were considered separately in gene-specific analyses, unless otherwise stated. In longitudinal analyses of CH dynamics, differences in the AER of gene-specific mutations compared with the expansion of nondeleterious variants were evaluated using two-tailed Mann–Whitney–Wilcoxon tests. The association between variables of interest and mutant clone expansion was evaluated through mixed-effects models (R package lme4 v1.1.34). Based on an estimation of statistical power (using the G*power software v3.1.9.7 and the R package pwrss v0.3.1), we had >80% power to detect as statistically significant (α = 0.05) any effect of the presence of subclinical atherosclerosis leading to an absolute increase in AER greater than 3.2%, an effect size substantially smaller than that expected to result from the increased HSC proliferation reported in human individuals with atherosclerosis27. To prevent potential biases, the analysis of imaging data, the sequencing of DNA samples, the bioinformatic processing of sequencing data, the curation of somatic variants to identify CH driver mutations and the statistical analysis of associations were conducted independently by different investigators who were blinded to other data.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.