Abstract
Multiple myeloma (MM) is an incurable malignancy of plasma cells. Epidemiological studies indicate a substantial heritable component, but the underlying mechanisms remain unclear. Here, in a genome-wide association study totaling 10,906 cases and 366,221 controls, we identify 35 MM risk loci, 12 of which are novel. Through functional fine-mapping and Mendelian randomization, we uncover two causal mechanisms for inherited MM risk: longer telomeres; and elevated levels of B-cell maturation antigen (BCMA) and interleukin-5 receptor alpha (IL5RA) in plasma. The largest increase in BCMA and IL5RA levels is mediated by the risk variant rs34562254-A at TNFRSF13B. While individuals with loss-of-function variants in TNFRSF13B develop B-cell immunodeficiency, rs34562254-A exerts a gain-of-function effect, increasing MM risk through amplified B-cell responses. Our results represent an analysis of genetic MM predisposition, highlighting causal mechanisms contributing to MM development.
Similar content being viewed by others
Introduction
Multiple myeloma (MM) is one of the most common blood malignancies. It is defined by uncontrolled, clonal growth of plasma cells (Supplementary Fig. 1). Clinically, MM leads to bone marrow failure, bone lesions, and hypercalcemia and remains essentially incurable. It is preceded by monoclonal gammopathy of unknown significance (MGUS), a common condition (~3% of >50 year-olds) that progresses to MM at an annual rate of 1%.
First-degree relatives of MM and MGUS cases have a two- to four-fold higher risk for MM, as well as an increased risk for other B-cell malignancies and some solid tumors1,2,3,4. Genome-wide association studies (GWAS) have identified DNA sequence variants at 25 loci influencing MM risk. However, much of the heritable risk remains unexplained5,6,7,8,9,10, and the biological mechanisms involved are largely uncharacterized11.
In the present study, we conducted a genome-wide association study totaling 10,906 MM cases and 366,221 controls. We identify 35 MM risk loci, 12 of which are novel. By integrating expression quantitative locus (eQTL), chromatin accessibility (ATAC-sequencing), and ultra-high-resolution chromatin configuration analysis (micro-C), we identify causal variants and high-confidence target genes. Using Mendelian Randomization analysis, we uncover two causal mechanisms for inherited MM risk: longer telomeres; and elevated levels of B-cell maturation antigen (BCMA) and interleukin-5 receptor alpha (IL5RA) in plasma. Moreover, we find that the largest increase in BCMA and IL5RA levels is mediated by the risk variant rs34562254-A at TNFRSF13B and that there is an antagonistic relationship between risk of B-cell immunodeficiency and risk of MM for this locus. Our results represent a comprehensive analysis of genetic MM predisposition, highlighting central biological mechanisms contributing to MM development.
Results
Genetic architecture of MM risk
To characterize the germline genetic architecture of MM, we performed a meta-analysis of ten GWAS datasets5,6,7,8,9,10 totaling 10,906 cases and 366,221 controls (Fig. 1a, Supplementary Data 1 and Supplementary Fig. 2). We identified 30 significant (Pmeta < 5 × 10−8) and two suggestive associations (Pmeta < 5 × 10−7), including nine novel significant associations (Supplementary Data 2). Approximate conditional analysis revealed three additional linkage disequilibrium (LD)-independent associations, yielding 12 novel associations (Fig. 1b and Table 1). We replicated all known associations except a previously reported borderline signal at 22q13.1/TOM1 (rs138745; Pmeta = 0.001)7. The two suggestive associations correspond to previously reported signals at 7q31.33/POT1 and 6p22.3/JARID2 (Pmeta = 7.1 × 10−8 and 1.2 × 10−7, respectively)5,6. MM plasma cell-specific transcriptome-wide association study (TWAS) and methylome-wide association study (MWAS) did not identify additional loci but provided support for 11 of the GWAS loci (Supplementary Data 3-4). Using linkage disequilibrium adjusted kinships (LDAK), we estimated the heritability ascribable to all common variation at 15.6% ( ± 4.7). Using LD score regression, we detected enrichment of risk variants in regions of accessible chromatin in plasma cells and B-cells (Fig. 1c), indicating that altered gene regulation in these cell types mediates MM risk. We also noted enrichment in activating histone marks of MM cell lines (Supplementary Fig. 3).
MM can be classified into hyperdiploid and non-hyperdiploid subtypes, the latter being primarily composed of cases with immunoglobulin heavy-chain (IGH) translocations, t(11;14), t(4;14) and t(14;16), which lead to over-expression of oncogenes, CCND1, MMSET and MAF respectively, through juxtaposition with the IGH locus. Previous work has found relationships between the risk loci at 11q13.3/CCND1 and 5q15/ELL2 with t(11;14) and hyperdiploid MM, respectively12,13. For newly discovered loci, we found no evidence for additional subtype-specific associations (Supplementary Data 5).
To examine the genetic overlap with other diseases, we analyzed 6234 MGUS cases and 720,279 controls. We observed a strong, positive correlation with MGUS effect sizes for MM lead variants (Pearson r2 = 0.58, P = 4.2 × 10−7; Fig. 1d and Supplementary Data 6), consistent with risk variants exerting their effects early in clonal evolution. Using the GWAS catalog14, we also identified pleiotropy (r2 > 0.8 between lead variants) with other B-cell neoplasias (8 signals), solid tumors (6 signals), autoimmune diseases (6 signals), and immunological traits (16 signals; Fig. 1e and Supplementary Data 7).
To assess the collective impact of all risk alleles, we calculated polygenic risk scores based on effect sizes and allele frequencies in our study population and the five super-populations in the 1000 Genomes compendium (European, American, African, East Asian, and South Asian). All identified risk variants were polymorphic in all super-populations, except the low-frequency variants at 6p22.3/JARID2, 7q36.1/SMARCD3, and 13q13.1/BRCA2 which were not polymorphic in East Asians (Supplementary Data 8). Consistent with the higher incidence of MM among individuals of African or African-American ancestry, we observed the highest polygenic risk scores in the AMR super-populations (Supplementary Fig. 4; median 2.24 relative to our study population), due to a higher prevalence of several risk alleles (e.g., 3p22.1/ULK4, 16p11.2/RNF40, 10q24.33/STN1, 19p13.3/NFIC, and 2p23.3/DTNB-AS1; Supplementary Data 8).
Identification of target genes
To identify target genes, we considered genes overlapping a region defined by the variants in high LD (r2 > 0.8) with the lead variant at each locus. Additionally, we considered genes with chromatin looping interactions with these regions, as determined by Micro-C analysis in MM cell lines. Among 371 genes in total, we prioritized target genes based on (i) potentially pathogenic coding variants, (ii) variants in long non-coding RNAs, (iii) expression quantitative trait loci (eQTLs) in the B-cell lineage, and (iv) TWAS signals (Supplementary Data 3-9, 10). To identify putative causal variants underlying the effects on gene expression, we performed massively parallel reporter assays (MPRA) in three MM cell lines. We also incorporated published MPRA data11, luciferase assays, and epigenomic annotations. Using conservative criteria (Online Methods), we identified 17 high-LD variants with transcriptional activity. Notably, 16 of these mapped to transcription start sites or enhancers (Table 2 and Supplementary Data 9).
In total, we identified 35 target genes (Fig. 2). Several of these were further supported by DepMap essentiality in MM or lymphoid cells (Supplementary Data 11), a Mendelian cancer predisposition syndrome (BRCA2, CDKN2A, POT1, and RFWD3)15,16, a congenital B-cell immunodeficiency (TNFRSF13B and WAC)17,18, or recurrent somatic genetic lesions in MM (IRF4, MYC, PRDM1, JARID2, MXI1, TNFRSF13B, and POT1)19,20,21,22,23. We also noted enrichment of target gene expression in the B-cell lineage (Supplementary Fig. 5). A more detailed description of all target genes is provided in Supplementary Notes.
Biological pathways involved in MM predisposition
Pathway analysis showed that the set of target genes is enriched for genes involved in B-cell development, chromatin organization, and telomere maintenance (Supplementary Data 12). For example, SAMSN1 encodes a regulator of B-cell activation, and SAMSN1 deletions have been reported in MM-prone mice24. TNFRSF13B regulates B-cell homeostasis25,26,27,28,29,30,31. ELL2 drives immunoglobulin (Ig) synthesis in plasma cells32,33. PRDM1 and ATG5 are essential for plasma cell survival34. Several other target genes interact with the MYC-IRF4 pathway, which plays a key role in B-cell and plasma cell development (Supplementary Fig. 6)23,35. These findings, and the enrichment of MM risk variants in accessible chromatin of plasma cells and B-cells (Fig. 1c), suggest that dysregulation of the germinal center and post-germinal center reaction is critical to MM predisposition.
Target genes involved in chromatin organization, cell cycle regulation, and DNA repair include CDKN2A, RFWD315, NFIC36, JARID237, SMARCD311, HMGN438, and CEP12039. Notably, the 13q13.1 association represents a pathogenic truncating variant in BRCA2 (Lys3326Ter)40.
Longer telomeres mediate genetic MM risk
Three target genes have well-known roles in telomere maintenance: TERC encodes the telomerase RNA component, POT1 and STN1 subunits of the shelterin complex. Given that leukocyte telomere length (LTL) is a marker for chromosomal instability41,42,43,44,45, we assessed the pleiotropy between MM and LTL using data on 472,174 individuals from UK Biobank46. Using colocalization analysis, we found evidence of shared causal variants for increased MM risk and increased LTL at the TERC, POT1, and STN1 loci (posterior probability, PP, of shared variant > 0.8; Fig. 3a and Supplementary Data 13). Additionally, using LDAK47, we found a positive genetic correlation between MM and LTL (Rg = 0.23, P = 1.87 × 10−5).
To examine the causal effect of LTL on MM risk, we performed a two-sample Mendelian randomization analysis using four methods48,49,50. Increased LTL was consistently associated with increased MM risk (inverse variance weight random effects model P-value, PIVW-RE = 2.07 × 10−4; Fig. 3b and Supplementary Data 14, 15), with the Steiger test confirming that this was the likely causal direction (Supplementary Data 16). These data support that a subset of risk variants increase MM risk by increasing telomere length, plausibly affecting replicative lifespan and/or chromosomal stability and thereby the risk of neoplastic transformation51.
Elevated plasma BCMA and IL5RA levels mediate genetic MM risk
To identify additional mechanisms underlying MM predisposition, we searched for shared effects of risk variants on B-cell and plasma cell development. These processes mainly take place in lymph glands and bone marrow. Since population-scale data is lacking for these tissues, we reasoned that shared mechanisms could be detectable indirectly through effects on circulating levels of proteins derived from these processes in peripheral blood.
Accordingly, we examined the effects of MM risk variants on the levels of 2931 plasma proteins using Olink data for 46,665 UK BioBank individuals. Across nine risk loci, we identified trans-protein quantitative trait loci (trans-pQTLs) for 21 proteins (Supplementary Data 17). Mendelian randomization analysis incorporating a Steiger test for directionality supported a causal relationship between increased levels of B-cell maturation antigen (BCMA; PIVW = 5.6 × 10−6) and interleukin-5 receptor subunit alpha (IL5RA; PIVW = 9.0 × 10−13) and increased MM risk (Fig. 3c-d). Both associations were replicated in SomaScan data for 36,177 Icelanders (Supplementary Data 16,18)52. Nine risk loci (ATG5/PRDM1, CCHCR1, ELL2, MXI1, NFIC, RNF40, SMARCD3, TNFRSF13B and WAC) showed significant association with BCMA and/or IL5RA. Colocalization analysis confirmed a shared variant with MM risk (PP > 0.8) at seven of these (ELL2, MXI1, NFIC, RNF40, SMARCD3, TNFRSF13B and WAC; Fig. 3a and Supplementary Data 19).
The BCMA receptor is expressed on plasma cells and mature B-cells. It binds B-cell activating factor (BAFF) and is a target for MM immunotherapy53. Its soluble form is produced by cleavage of the BCMA extracellular domain by γ-secretase54. Several studies have linked soluble BCMA levels to plasma cell pool size. For example, the plasma BCMA level decreases in MM patients after treatment, and patients with MGUS show lower levels than patients with fully developed MM55,56,57,58,59. IL5RA is also expressed in plasma cells and B-cells (Supplementary Fig. 7). IL5 stimulation promotes plasma cell differentiation and has been implicated in immunogenic MM cell death60,61. These results indicate that a second subset of MM risk variants exert their effects through increased BCMA and IL5RA levels, plausibly reflecting an expanded plasma cell and mature B-cell pool. These risk variants are distinct from those influencing telomere length (Fig. 3a).
The TNFRSF13B risk variant predisposes for MM through a gain-of-function effect
To gain insight into the molecular basis of the elevated BCMA and IL5RA levels, we focused on the TNFRSF13B locus. The TNFRSF13B variant rs34562254-A is one of the most statistically significant MM risk variants. It is associated with the largest increase in BCMA and IL5RA levels (P = 1.4 × 10−97, β = 0.23 for BCMA; P = 4.9 × 10−63, β = 0.19 for IL5RA for rs34562254-A; Supplementary Data 17). In addition, we and others have demonstrated an association between rs34562254-A and higher IgG levels62,63,64.
TNFRSF13B encodes the TACI receptor, a central regulator of B-cell responses and Ig class-switching. Individuals who carry rare loss-of-function variants in TNFRSF13B are predisposed to common variable immunodeficiency (CVID), a condition defined by low IgG and IgA levels due to stalled development of mature B-cells and plasma cells17. The most common CVID variants in TNFRSF13B are the Cys104Arg and Ala181Glu missense variants, which abolish TACI signalling65. We found associations between Cys104Arg and Ala181Glu and lower BCMA and IL5RA levels in the UK Biobank Olink data (Fig. 4a and Supplementary Data 22). The opposite effects on BCMA, IL5RA, and IgG levels shown by rs34562254-A indicate that this MM risk variant has a gain-of-function effect.
Searching for putative causal variants, we noted that rs34562254 is a missense variant (Pro251Leu) that is predicted to be benign66,67. However, we noted an association between rs34562254-A and increased TNFRSF13B expression in B-cells (Supplementary Data 10) and, congruent with this, two variants in high LD (rs4273077 and rs4792800; r2 = 0.90 and 0.92 with rs34562254) showed transcriptional effects in both MPRA datasets (Fig. 4b-c, Table 2 and Supplementary Data 9). Further, both rs4273077 and rs4792800 map to enhancers in TNFRSF13B intron 3, and rs4792800 displays a chromatin looping interaction with the transcription start site (Fig. 4b and Supplementary Data 9). Finally, dual-sgRNA CRISPR/Cas9 deletion of the rs4273077- and rs4792800-harboring regions in Raji cells led to the downregulation of both of the two main TNFRSF13B transcript isoforms (Fig. 4d), further supporting a regulatory role of these regions. These data indicate that the TNFRSF13B MM risk allele exerts a gain-of-function effect leading to increased MM risk.
Discussion
We report a comprehensive analysis of the germline genetic architecture of MM. By bringing together all major GWASs to date, we identify 12 new risk loci. Through functional fine-mapping, we identify high-confidence target genes and central biological pathways (Fig. 5 and Supplementary Notes). Our data support that MM risk variants act early in clonal evolution by predisposing for MGUS rather than for progression from MGUS to MM.
Furthermore, we identify two central mechanisms mediating inherited MM risk: increased LTL and increased BCMA and IL5RA levels. These findings are consistent with a model where dysregulation of telomere maintenance and B-cell and plasma cell development constitute central mechanisms in MM predisposition, each influenced by a distinct subset of risk loci (Fig. 3a). Our initial analysis of the TNFRSF13B risk locus suggests that the increase in BCMA and IL5RA levels reflects a gain-of-function effect leading to increased MM risk through amplified B-cell responses (Fig. 4a).
In conclusion, our study provides insights into genetic MM predisposition, highlighting central biological mechanisms that lead to MM.
Methods
Ethics
Collection of patient samples and clinico-pathological information was undertaken with informed consent and ethical approvals in accordance with the Declaration of Helsinki: for the Myeloma-IX68,69 trial by the Medical Research Council Leukaemia Data Monitoring and Ethics committee (MREC 02/8/95, ISRCTN68454111), the Myeloma-XI70 trial by the Oxfordshire Research Ethics Committee (MREC 17/09/09, ISRCTN49407852), HOVON65/GMMG-HD4 (ISRCTN 644552890; METC 13/01/2015), HOVON87/NMSG18 (EudraCTnr 2007-004007-34, METC 20/11/2008), HOVON95/EMN02 (EudraCTnr 2009-017903-28, METC 04/11/10), University of Heidelberg Ethical Commission (229/2003, S-337/2009, AFmu-119/2010), University of Arkansas for Medical Sciences Institutional Review Board (IRB 202077), Lund University Ethical Review Board (2022-01414-02), the Norwegian REK 2014/97, the Danish Ethical Review Board (no. H-16032570), and the National Bioethics Committee of Iceland (VSN 17-143).
Data reporting
No statistical methods were used to predetermine sample sizes. Experiments were not randomized, and the investigators were not blinded.
Genome-wide association study
We performed a meta-analysis of ten GWAS data sets from nine previously published studies (German, US, UKOnco, UK, Netherlands, Sweden, Norway, Denmark, and Iceland)5,6,7,8,9,10 and the UK Biobank (UKBB), totalling 10,906 cases and 366,221 controls, all population–based cohorts with European Ancestry. Published studies: The nine GWAS comprised Swedish (2338 cases, 11,971 controls), UK (2282 cases, 5197 controls), German (1508 cases, 2107 controls), Danish (940 cases, 91,744 controls), UKOnco (878 cases, 7083 controls), US (780 cases, 1857 controls), Netherlands (555 cases, 2669 controls), Icelandic (598 cases, 313,882 controls), and Norwegian (500 cases, 4696 controls) and. UK Biobank study71: 527 cases of MM and 1417 age and sex-matched controls were identified, and genotypes downloaded. The association between variant genotype and MM was performed under an additive model in SNPTESTv2.5. The diagnosis of MM (ICD-10 C90.0) was established in accordance with World Health Organization guidelines. All samples from patients for genotyping were obtained before treatment or at presentation.
We examined the relationship between genotype and MM in each GWAS, assuming a log-additive model72. Meta-analysis under a fixed-effects inverse-variance weighted model was performed using META v1.773. Variants in the meta-analysis only included those with an imputation quality score (info) > 0.8 and MAF > 0.005 (8.1 million variants after filtering). The I2 statistic was calculated to quantify between-study heterogeneity, and variants with an I2 > 75% were excluded. There was no evidence of genomic inflation (λ = 1, Supplementary Fig. 1). To define known risk loci, we compiled a list of previously reported genome-wide significant association signals for MM (i.e., P < 5 × 10−8). Genome-wide complex trait analysis was used to perform approximate conditional and joint association analysis (COJO)74 to identify independent risk loci. To estimate LD, we used a reference sample of unrelated individuals from a combined dataset of UK10K75 and European individuals from the 1000Genomes Project76, excluding variants with low imputation quality (INFO < 0.8) and deviation from HWE (P < 1 × 10−6). Associations at Pconditional < 5×10−8 within a 1 Mb region of primary associations were considered novel secondary associations.
Transcriptome-wide association study
We retrieved previously published expression data generated on CD138-purified plasma cells from 183 UK (MRC Myeloma IX trial, GSE21349), 658 German (E-MTAB-2299), and 608 US cohorts (GSE2658, GSE31161)77. RNA was profiled using Affymetrix Human Genome U133 2.0 Plus Arrays. Gene expression models were generated using the PredictDB pipeline78 for a total of 1449 participants. Elastic net model building was done independently for each dataset. Models were computed using genotype and expression data, and covariate factors were estimated using PEER79. For the UK dataset, 30 PEER factors were used; for the US and German data sets, 60 PEER factors were used, as recommended by the GTEx protocol. Transcriptome-wide association tests were performed for each dataset individually using S-PrediXcan80 with summary statistics from the GWAS meta-analysis. To combine S-PrediXcan results from the different data sets, we used S-MultiXcan81.
Methylome-wide association studies
Illumina 450 K methylation array data was obtained from 379 of the UK cohort (MRC Myeloma XI trial). The EZ DNA Methylation kit (Zymo Research) was briefly used for bisulfite conversion of genomic DNA. Tumour DNA methylation was profiled using Illumina Infinium HumanMethylation450 arrays. Raw data were exported from Genome Studio (Illumina). Quality checking and normalization of raw methylation data on 378 cases was performed using the ChIP Analysis Methylation Pipeline (ChAMP). The BMIQ method was used to perform normalization. Elastic net model building was performed using genotype and expression data and covariate factors estimated using PEER, where 60 PEER factors were according to the GTEx protocol. Methylome-wide association tests were then performed for the dataset using S-PrediXcan with summary statistics from the GWAS meta-analysis. We annotated CG islands with the nearest gene and considered a Bonferroni-corrected P-value of 2 × 10−6 (i.e., 0.05/25,000 genes) as significant.
Estimation of heritability and partitioned heritability
To estimate the narrow-sense heritability of MM risk, we used LDAK v5.2, applying BLD-LDAK and LDAK-Thin models47. Variants were harmonized to HapMap3 with 1000 Genomes EUR, MAF > 0.01. Transformation of observed scale heritability estimates of MM to the liability scale was carried out, assuming a lifetime risk of 1% for MM. To estimate cell type-specific partitioned heritability based on chromatin accessibility, we used LD-scores based on ATAC-seq data for sorted blood cells available for LDSC12, extended with LD-scores for myeloid, plasmacytoid dendritic cells, and plasma cells computed from published ATAC-seq data8 (NCBI Gene Expression Omnibus accession no. GSE119453; European Genome-phenome Archive accession no. EGAS00001005394 and EGAD00001007814).
ChIP-mentation and ATAC-seq data
We carried out ChIPmentation and ATAC-seq to annotate regulatory elements in KMS11 cells82,83,84. ChIPmentation was carried out for histone marks H3K27Ac, H3K27me3, H3K4me1, H3K4me3, H3K36me3 and H3K9me3 in KMS11, L363, JJN3 and MOLP-8 cell lines. ChIPmentation reads were trimmed and aligned to hg19/GRCh37 using Bowtie2. Duplicate reads were marked and removed using Picard. ChromHMM was used to infer chromatin states, training the model on four cell lines. Genome-wide signal tracks were binarized, including input controls. A 12-state model was assigned to the states85,86,87. We also annotated variants using GM12878 and Bone Marrow mesenchymal stem cell ChromHMM tracks, using Roadmap Epigenomics data88,89.
Cell culture
KMS11, KMS12-BM, L363, MOLP8, MM.1 S, U266B1, and Raji cells were obtained from ATCC, cultured under recommended conditions, and tested for mycoplasma.
Variant set enrichment analysis in ChIP-seq data
To examine enrichment in binding across risk loci, we adapted the method of Cowper-Sal lari et al.90. Briefly, for each risk locus, a region of strong LD (defined as r2 > 0.8 and D′ > 0.8) was determined, and these variants were considered the associated variant set (AVS). ChIP-seq peak data for six histone marks from KMS11, L363, MOLP8, and JJN3 cell lines were generated in-house. For each mark, the overlap of the variants in the AVS and the binding sites was assessed to generate a mapping tally. A null distribution was produced by randomly selecting variants with the same characteristics as the risk-associated variants, and the null mapping tally was calculated. This process was repeated 10,000 times, and P-values were calculated as the proportion of permutations where the null mapping tally was greater or equal to the AVS mapping tally. An enrichment score was calculated by normalizing the tallies to the median of the null distribution. Thus, the enrichment score is the number of standard deviations of the AVS mapping tally from the median of the null distribution tallies.
Association data for MGUS
Summary statistics for the MM variants were obtained from a meta-analysis of 6234 MGUS cases and 720,297 controls from Iceland (4092 cases and 298,673 controls), UK Biobank (1150 cases; 427,714 controls) and the German Cancer Research Center (992 cases; 2910 controls). Cases were defined based on ICD-10 code D47.2. The Icelandic samples were genotyped using Illumina microarrays UK Biobank samples using Affymetrix UK BiLEVE / UK Biobank Axiom chips. Genotypes were long-range phased and imputed using population-specific reference sets91 (whole-genome sequencing data for 63,118 Icelandic and 150,119 UK Biobank individuals). The association between variants and MGUS was tested using logistic regression assuming an additive model92. Association results for individual cohorts were meta-analyzed using inverse variance weighted meta-analysis.
Pleiotropy analysis
We used the GWAS catalog (accessed November 2023) to identify pleiotropy with other diseases, as well as with hematological and immunological traits. We considered highly correlated associations (r2 > 0.8 between the GWAS catalog and MM lead variants).
Micro-C analysis
To detect chromatin looping interactions between risk variants and target genes, we carried out Micro-C analysis as per refs. 93,94 with the following modifications: Cells were fixed at a density of 106 cells per ml in 3 mM disuccinimydyl glutarate (DSG) for 20 mins at room temperature (RT). After adding formaldehyde to a final concentration of 1%, cells were further incubated at room temperature for 10 mins. Reactions were quenched by adding glycine to a final concentration of 660 mM with incubation for 5 mins at RT. Fixed cells were digested with MNase (Worthington) optimized for each cell line and batch, with incubation for 10 mins at 37 °C, 1000 rpm. The reaction was quenched by EGTA at a final concentration of 12.5 mM, with incubation for 10 mins at 65 °C, 1000 rpm. End repair and biotin labeling was performed by incubating 106 MNase-digested cells using 30U of T4 PNK (NEB) at 37 °C for 15 mins, 1000 rpm. 35 U Large Klenow Fragment (NEB) was added and incubated at 37 °C for 15 mins. After biotin 14-dATP (Jena NU-809-BIOX), biotin 11-dCTP (Jena NU-835-BIO14), dTTP and dGTP were added (final concentration 66uM each), samples were incubated at 25 °C for 45 mins, 1000 rpm. Reactions were stopped using 40 mM EDTA and heating to 65 °C for 20 min. Ligation was performed with 10,000 U T4 DNA ligase, 23 °C for 3 hours, 1000 rpm. Biotin ends were excised using 200U Exonuclease III at 37 °C for 10 mins, 1000 rpm. Sequencing was conducted using a NovaSeq (Illumina). The Juicer (Aidenlab) pipeline was used to generate Hi-C maps from raw fastq files, and Mustache and FitHiC2 (Ay-lab) were used to call significant interactions.
Expression quantitative locus (eQTL) data sets
To identify cis-eQTLs in plasma cells, we analyzed gene expression profiles of CD138+ cells isolated from bone marrow aspirates from MM patients harvested using immunomagnetic beads. First, we used Affymetrix microarray data for 1445 subjects, including 183 UK Myeloma IX trial patients (a study aimed at comparing two bisphosphonates in the treatment of MM; Medical Research Council Leukaemia Data Monitoring and Ethics committee, no. MREC 02/8/95, ISRCTN68454111)95, 658 German GMMG patients, and 604 patients treated at the University of Arkansas for Medical Sciences Myeloma Center, USA6. Second, we used 185 RNA-seq samples from Lund University (Lund, Sweden)96. Third, we used 758 RNA-seq samples with DNA copy-number covariates from the CoMMpass study97. Fourth, we used 309 RNA-seq samples from the Dana Farber Cancer Institute (Boston, USA)98. For the first two data sets, paired SNP microarray genotypes were available. For the third and fourth data sets, only RNA-seq data were available, limiting eQTL analysis to risk alleles with these coding proxies. Additionally, we used mRNA-sequencing data for 28 sorted immune cell populations from 416 individuals from the ImmunExUT compendium99. For B cells, we used eQTL data for 758 Icelanders generated by isolating B-cells from peripheral blood through negative selection using magnetic beads (StemCell Technologies 19674). We used eQTL data from deCODE Genetics (RNA-seq for 17,848 Icelanders) for whole blood.
Selection of target genes and putative causal variants
To identify target genes underlying the MM risk associations, we considered genes overlapping a region defined by the variants in high LD (r2 > 0.8) with each lead variant. Additionally, we considered genes displaying chromatin looping interactions with these regions, as determined by the Micro-C data. Among 371 admissible genes in total, we prioritized genes as probable target genes if they: (i) contained a potentially pathogenic coding variants correlated (r2 > 0.8) with the MM lead variant; (ii) contained a variant in the expressed sequence of a long non-coding RNAs correlated (r2 > 0.8) with the MM lead variant; (iii) had expression quantitative trait loci (eQTLs) in plasma cells or another B-cell population (r2 > 0.8 between the eQTL and MM lead variant); or (iv) a Bonferroni-significant TWAS signal within 1 Mb of the MM lead variant. We considered potentially pathogenic variants as frameshift, stop-gain, stop-loss, and splice variants; computationally predicted-pathogenic missense variants, and missense variants in functionally well-characterized protein domains. At loci where no effect on expression could be identified in the B-cell lineage, we accepted eQTLs in other hematologic cell populations. At loci, where no gene fulfilled any of our criteria, we prioritized the closest gene.
We searched for putative causal gene-regulatory variants to obtain mechanistic support for the identified effects on gene expression. In the newly generated MPRA data (KMS11, RPMI-8226, and L363 cells), we nominated variants with false discovery rate (FDR) < 5% in at least two cell lines and absolute log2 fold-change >0.2 in at least one of these. In the published MPRA data (L363 and MOLP8 cells), we nominated variants with FDR < 5% in both cell lines and absolute log2 fold-change >0.2 in at least one of them. In addition, we nominated variants with significant effects in luciferase assays. We only considered effects in the same direction as the corresponding eQTL/TWAS signal.
To obtain further support a functional impact of the gene itself, we examined effects of CRISPR/Cas9 and shRNA knockdown on MM and lymphoid cell line growth using data from the Dependency Map (DepMap; https://www.depmap.org; version 23Q2), associations with human Mendelian cancer predisposition syndrome, congenital B-cell immunodeficiencies, and occurrence of recurrent somatic mutations in MM.
Massive parallel reporter assays
In addition to making use of our published MPRA data for the L363 and MOLP8 MM cell lines for 21 risk loci11, we generated an expanded MPRA dataset for 23 of the risk loci using the KMS11, RPMI8226, and L363 MM cell lines100. Single-base pair variants in LD (r2 ≥ 0.4) of the lead variant were included at each locus. Candidate regulatory sequences (CRS) were designed in the forward and reverse direction for reference (ref) and alternate (alt) alleles. Variants were centered in a 200-bp region. 230-bp oligos were synthesised (Agilent) with the CRS between 15 bp adapters- ACTGGCCGCTTGACG**CRS**CACTGCGGCTCCTGC. Two rounds of PCR were used to add a minimal promoter (primers 5BC-AG-f01v2 and 5BC-AG-r01v2; Supplementary Data 23) and a 15 bp random barcode. Amplified fragments were cloned by Gibson assembly into the SbfI/AgeI site of the pLS-SceI vector (Addgene no. 137725) before transformation into electrocompetent E.coli for plasmid amplification. pLS-SceI was a gift from Nadav Ahituv (Addgene no. 137725). Sanger sequencing was used to confirm successful construction. The purified plasmid was sequenced (Mi-seq) with custom primers (pLSmP-ass-seq-R1v2 and PLSmP-ass-seq-R2v2; Supplementary Data 23). The association function in the MPRAflow100 pipeline was used to map unique barcodes for each CRS. A lentivirus library was generated by transfecting HEK293T cells with the plasmid library. After two days, the supernatant was collected and concentrated, and this lentiviral library was used to transduce KMS11, RPMI-8226, and L363 cells in triplicate. After three days, DNA and RNA were harvested, plasmid RNA reverse transcribed, and plasmid DNA and cDNA amplified by PCR, further adding adapters for final NovaSeq sequencing (Illumina). MPRAflow100 was used to count barcodes and log2 DNA/RNA ratios for each CRS. Activity of ref vs alt allele was calculated using MPRAnalyze101, with CRS direction, barcode, and replicate as covariates. Primers were used as published except those referenced in Supplementary Data 23, which were used to accommodate novel adaptor sequences.
The pre-existing MPRA data for L363 and MOLP8 cells are described in ref. 11. In short; we screened 1039 variants in high LD (r2 > 0.8) with MM lead variants. For each one, we designed twelve 120 bp oligonucleotide sequences corresponding to reference and alternative alleles in six genomic contexts (both strands × three sliding windows with the variant at −20, 0, and +20 bp from the center). Sequences were coupled to a reporter gene with random 20 bp sequence barcodes 3′ of its open reading frame. Following transfection, the transcriptional activity of each construct was measured by determining the barcode representation in reporter mRNA relative to DNA, calculated using MPRAscore102. Plasmid sequencing identified 1.73 × 106 unique barcodes tagging 12,378 (99.2%) of the 12,468 designed oligonucleotides. F
Luciferase reporter assays
For loci not evaluated by MPRA, we performed luciferase assays. A region surrounding the variant (120 bp or 250 bp) was cloned into luciferase reporter constructs (pGL4.23[luc2/minP] or pGL3 basic; Promega). Constructs and renilla control vectors were transfected by nucleofection with Amaxa (Lonza), using kit V program X-01 for KMS-11 cells; or the Neon electroporation system (Life Technologies) using 2 pulses at 1250 V, 10 ms for Raji; 1 pulse 20 ms 1550 V for U266B1 and 1400 V, 3 pulses, 10 ms for L363. Cells were harvested after 20–24 h incubation at 37 °C, and 5% CO2 and luciferase activity were quantified (DualGlo, Promega E1960). Two technical replicates of each of the three biological replicates were normalized to the renilla control. Biological replicates were mean-centred, and a change in transcriptional activity was calculated as the difference in normalised reads between the reference and alternate alleles. Significance was calculated with a two-sided, paired t-test.
Additional gene expression data sets
To test for enrichment of target gene expression in hematopoietic cell types, we used bulk RNA-seq data for sorted blood cell populations103, and pseudo-bulked single-cell mRNA-seq data for 35,882 mononuclear blood and bone marrow cells104.
Protein quantitative locus (pQTL) analysis
Plasma samples collected from 46,665 UK Biobank participants of European descent were analyzed using Olink (UK Biobank application no. 65851)105. The Olink platform consists of 2941 immunoassays targeting 2925 proteins. The measurements were quantile-normalized and adjusted for age, sex, and sample age. Association testing was performed using a linear mixed model106. LD score regression was used to account for inflation in test statistics due to cryptic relatedness and stratification107. P-values were computed using a likelihood-ratio test, and the significance threshold was set to 1.8 × 10−9. 24,824 sentinel trans-pQTLs were discovered after recursive conditional analysis to dissect secondary pQTLs and LD-based clumping105. We used SomaScan v4 data for 35,892 Icelanders47 for replication, representing 4907 aptamer-based assays targeting 4719 proteins. The same pipeline was used to derive the lead trans-pQTLs as described for the Olink data. To assess whether the MM variants affect the levels of plasma proteins measured using the Olink platform, we searched for pQTL lead variants that co-localize (r2 > 0.8) with MM risk variants and found pQTLs for 21 proteins (Supplementary Data 17). Six of the proteins associated with more than one MM variant, suggesting a potential causal relationship with MM. To test this, we performed Mendelian Randomization analysis between each of these proteins and MM. Variant effects on proteins coded by IL5RA and BCMA showed significant association with MM risk (PIVW = 5.6 × 10−5 and PIVW = 9.0 × 10−13, respectively) and were therefore investigated further.
Mendelian randomisation analysis
Two-sample Mendelian Randomisation (2S-MR) was used to examine the causal relationship between leukocyte telomere length (LTL) and pQTLs (exposures) with MM risk (outcome) using the TwoSampleMR package108,109. Association data for LTL were obtained from ref. 46. For each variant, effect estimates, and standard errors were retrieved. Variants were considered potential instruments if they were associated at P < 5 × 10−8, minor allele frequency > 0.01. To avoid co-linearity, correlated variants were excluded (r2 ≥ 0.01). For each variant, causal effect estimates were generated as odds ratios per one standard deviation unit increase in LTL (ORSD), with 95% confidence intervals (CIs), using the Wald ratio (Supplementary Data 20). Causal effects were also estimated using a random-effects inverse weighted variance (IVW-RE) model, which assumes each variant identifies a different causal effect. To assess robustness, we compared causal estimates and associated P-values using weighted median (WME) and weighted mode-based (WMBE) methods (Supplementary Data 15). Directional pleiotropy was assessed using MR-Egger regression, and the Steiger test was used to infer the direction of causal effect for exposures (Supplementary Data 16). For this, we estimated the PVE using Cancer Research UK lifetime risk estimates for MM. A leave-one-out strategy under the IVW-RE model was employed to assess the potential impact of outlying and pleiotropic variants (Supplementary Data 21).
Bayesian test for colocalisation
To test if pleiotropic associations reflect shared variants, we performed colocalization using analysis using Coloc110 across 1 Mb genomic regions of either side of lead variants of interest. Coloc enumerates four possible configurations of causal variants for two traits, calculating support for each model based on a Bayes factor. Adopting default prior probabilities, a posterior probability ≥ 0.80 was considered as supporting a specific model.
CRISPR/Cas9 deletion of variant-harboring regions
To delete the rs4273077 and 4792800-harboring regions in TNFRSF13B, we used dual-sgRNA CRISPR/Cas9 genome editing. sgRNA pairs were selected using CRISPOR (crispor.org; Supplementary Data 24) and cloned into the pSpCas9(BB)-2A-GFP PX458 vector (Addgene no. 48138). Cloned sgRNA pairs were co-transfected (ThermoFisher Neon) into Raji cells. After 24 hours, GFP-positive cells were isolated by fluorescence-activated cell sorting. RNA was extracted (RNeasy plus micro kit; Qiagen) and reverse-transcribed. Using TaqMan™ Fast Advanced Master Mix (Applied BioSystems) and PrimeTime qPCR assays (IDT), we quantified the mRNA levels of the two main TNFRSF13B transcript isoforms with ATCB and GAPDH as controls (Supplementary Data 25). To verify deletion efficiency, the targeted regions were PCR-amplified from genomic DNA and analyzed on 2% agarose gels (Supplementary Data 24 and Supplementary Fig. 8).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Genotyping data have been deposited in Gene Expression Omnibus (GEO) with accession codes GSE21349, GSE19784, GSE24080, GSE2658, and GSE15695[https://www.ncbi.nlm.nih.gov/geo/]; in the European Genome-phenome Archive (EGA) with accession code EGAD50000000422 [https://ega-archive.org/studies/EGAS50000000292] in the European Bioinformatics Institute (EMBL-EBI) ArrayExpress repository with accession code E-MTAB-362 and E-TABM-1138[https://www.ebi.ac.uk/biostudies/arrayexpress/]; and the database of Genotypes and Phenotypes (dbGaP) with accession code phs000207.v1.p1[https://www.ncbi.nlm.nih.gov/gap/]. Summary-level GWAS data are available through EGA under accession numbers EGA50000000280, EGAS50000000292, EGAZ50000000827, and EGAZ50000000828[https://ega-archive.org/]. Expression data have been deposited in GEO with accession codes GSE21349, GSE2658, and GSE31161 [https://www.ncbi.nlm.nih.gov/geo/] and in EMBL-EBI ArrayExpress with accession code E-MTAB-2299[https://www.ebi.ac.uk/biostudies/arrayexpress/]. The accession number for the KMS11 ChIP-seq data is EGA: S00001002414[https://ega-archive.org/]. The GM12878 chromatin data is publicly available from UCSC. The sequencing data for the MPRA experiment have been deposited in the Sequence Read Archive, accession no. PRJNA679966. The ATAC-seq data for CD138+ MM plasma cells have been deposited in the EGA, accession no. EGAS00001005394 and EGAD00001007814[https://ega-archive.org/]. Publicly available eQTL data from the eQTLGen Consortium[http://www.eqtlgen.org] and gene expression data from the NCBI Gene Expression Omnibus (GEO) repository, accession numbers GSE111199, GSE24759, GSE15695, GSE4581, GSE19784, GSE26760, and GSE5900[https://www.ncbi.nlm.nih.gov/geo/]. Genotype data for the UK Biobank data and the proteomics data can be accessed at https://ukbiobank.dnanexus.com/landing. The UK Biobank Resource was used under application number 65851. The Icelandic genomic data and proteomics data have been described previously52. While these individual-level data cannot be shared by Icelandic law, we are open to collaborations, as we have been in the past. The remaining data are contained within the paper and Supplementary Files.
References
Pertesi, M. et al. Genetic predisposition for multiple myeloma. Leukemia 34, 697–708 (2020).
Halvarsson, B. M. et al. Direct evidence for a polygenic etiology in familial multiple myeloma. Blood Adv. 1, 619–623 (2017).
Altieri, A., Chen, B., Bermejo, J. L., Castro, F. & Hemminki, K. Familial risks and temporal incidence trends of multiple myeloma. Eur. J. Cancer 42, 1661–1670 (2006).
Frank, C. et al. Search for familial clustering of multiple myeloma with any cancer. Leukemia 30, 627–632 (2016).
Mitchell, J. S. et al. Genome-wide association study identifies multiple susceptibility loci for multiple myeloma. Nat. Commun. 7, 12050 (2016).
Went, M. et al. Identification of multiple risk loci and regulatory mechanisms influencing susceptibility to multiple myeloma. Nat. Commun. 9, 3707 (2018).
Swaminathan, B. et al. Variants in ELL2 influencing immunoglobulin levels associate with multiple myeloma. Nat. Commun. 6, 7213 (2015).
Broderick, P. et al. Common variation at 3p22.1 and 7p15.3 influences multiple myeloma risk. Nat. Genet 44, 58–61 (2011).
Chubb, D. et al. Common variation at 3q26.2, 6p21.33, 17p11.2 and 22q13.1 influences multiple myeloma risk. Nat. Genet. 45, 1221–1225 (2013).
Duran-Lozano, L. et al. Germline variants at SOHLH2 influence multiple myeloma risk. Blood Cancer J. 11, 76 (2021).
Ajore, R. et al. Functional dissection of inherited non-coding variation influencing multiple myeloma risk. Nat. Commun. 13, 151 (2022).
Weinhold, N. et al. The CCND1 c.870G>A polymorphism is a risk factor for t(11;14)(q13;q32) multiple myeloma. Nat. Genet. 45, 522–525 (2013).
Li, N. et al. Genetic predisposition to multiple myeloma at 5q15 is mediated by an ELL2 enhancer polymorphism. Cell Rep. 20, 2556–2564 (2017).
Welter, D. et al. The NHGRI GWAS catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42, D1001–D1006 (2014).
Knies, K. et al. Biallelic mutations in the ubiquitin ligase RFWD3 cause Fanconi anemia. J. Clin. Invest 127, 3013–3027 (2017).
Speedy, H. E. et al. Germ line mutations in shelterin complex genes are associated with familial chronic lymphocytic leukemia. Blood 128, 2319–2326 (2016).
Castigli, E. et al. TACI is mutant in common variable immunodeficiency and IgA deficiency. Nat. Genet 37, 829–834 (2005).
Vanegas, S., Ramirez-Montano, D., Candelo, E., Shinawi, M. & Pachajoa, H. DeSanto-shinawi syndrome: first case in south America. Mol. Syndromol. 9, 154–158 (2018).
Walker, B. A. et al. Identification of novel mutational drivers reveals oncogene dependencies in multiple myeloma. Blood 132, 587–597 (2018).
Rustad, E. H. et al. Revealing the impact of structural variants in multiple myeloma. Blood Cancer Discov. 1, 258–273 (2020).
Maura, F. et al. Genomic classification and individualized prognosis in multiple myeloma. J. Clin. Oncol. 42, 1229–1240 (2024).
Lin, M. et al. Identification of novel fusion transcripts in multiple myeloma. J. Clin. Pathol. 71, 708–712 (2018).
Manier, S. et al. Genomic complexity of multiple myeloma and its clinical implications. Nat. Rev. Clin. Oncol. 14, 100–113 (2017).
Amend, S. R. et al. Whole genome sequence of multiple myeloma-prone C57BL/KaLwRij mouse strain suggests the origin of disease involves multiple Cell types. PLoS One 10, e0127828 (2015).
Moreaux, J. et al. APRIL and TACI interact with syndecan-1 on the surface of multiple myeloma cells to form an essential survival loop. Eur. J. Haematol. 83, 119–129 (2009).
Hengeveld, P. J. & Kersten, M. J. B-cell activating factor in the pathophysiology of multiple myeloma: a target for therapy? Blood Cancer J. 5, e282 (2015).
Ju, S. et al. Correlation of expression levels of BLyS and its receptors with multiple myeloma. Clin. Biochem. 42, 387–399 (2009).
Novak, A. J. et al. Expression of BCMA, TACI, and BAFF-R in multiple myeloma: a mechanism for growth and survival. Blood 103, 689–694 (2004).
Moreaux, J. et al. The level of TACI gene expression in myeloma cells is associated with a signature of microenvironment dependence versus a plasmablastic signature. Blood 106, 1021–1030 (2005).
Moreaux, J. et al. TACI expression is associated with a mature bone marrow plasma cell signature and C-MAF overexpression in human myeloma cell lines. Haematologica 92, 803–811 (2007).
Mackay, F. & Schneider, P. TACI, an enigmatic BAFF/APRIL receptor, with new unappreciated biochemical and biological properties. Cytokine Growth Factor Rev. 19, 263–276 (2008).
Martincic, K., Alkan, S. A., Cheatle, A., Borghesi, L. & Milcarek, C. Transcription elongation factor ELL2 directs immunoglobulin secretion in plasma cells by stimulating altered RNA processing. Nat. Immunol. 10, 1102–1109 (2009).
Park, K. S. et al. Transcription elongation factor ELL2 drives ig secretory-specific mRNA production and the unfolded protein response. J. Immunol. 193, 4663–4674 (2014).
Nutt, S. L., Hodgkin, P. D., Tarlinton, D. M. & Corcoran, L. M. The generation of antibody-secreting plasma cells. Nat. Rev. Immunol. 15, 160–171 (2015).
Shaffer, A. L. et al. IRF4 addiction in multiple myeloma. Nature 454, 226–231 (2008).
Pjanic, M. et al. Nuclear factor I genomic binding associates with chromatin boundaries. BMC Genom. 14, 99 (2013).
Su, C. L., Deng, T. R., Shang, Z. & Xiao, Y. JARID2 inhibits leukemia cell proliferation by regulating CCND1 expression. Int J. Hematol. 102, 76–85 (2015).
Birger, Y., Ito, Y., West, K. L., Landsman, D. & Bustin, M. HMGN4, a newly discovered nucleosome-binding protein encoded by an intronless gene. DNA Cell Biol. 20, 257–264 (2001).
Sharma, A., Gerard, S. F., Olieric, N. & Steinmetz, M. O. Cep120 promotes microtubule formation through a unique tubulin binding C2 domain. J. Struct. Biol. 203, 62–70 (2018).
Wang, Y. et al. Rare variants of large effect in BRCA2 and CHEK2 affect risk of lung cancer. Nat. Genet. 46, 736–741 (2014).
Sekulovic, S. et al. Prolonged self-renewal activity unmasks telomerase control of telomere homeostasis and function of mouse hematopoietic stem cells. Blood 118, 1766–1773 (2011).
Brümmendorf, T. H. & Balabanov, S. Telomere length dynamics in normal hematopoiesis and in disease states characterized by increased stem cell turnover. Leukemia 20, 1706–1716 (2006).
Allsopp, R. C., Cheshier, S. & Weissman, I. L. Telomere shortening accompanies increased cell cycle activity during serial transplantation of hematopoietic stem cells. J. Exp. Med. 193, 917–924 (2001).
Fiorini, E., Santoni, A. & Colla, S. Dysfunctional telomeres and hematological disorders. Differ. Res. Biol. Diversity 100, 1–11 (2018).
Yamaguchi, H. et al. Mutations in TERT, the gene for telomerase reverse transcriptase, in aplastic anemia. N. Engl. J. Med. 352, 1413–1424 (2005).
Codd, V. et al. Polygenic basis and biomedical consequences of telomere length variation. Nat. Genet. 53, 1425–1433 (2021).
Speed, D., Hemani, G., Johnson, M. R. & Balding, D. J. Improved heritability estimation from genome-wide SNPs. Am. J. Hum. Genet. 91, 1011–1021 (2012).
Yarmolinsky, J. et al. Causal inference in cancer epidemiology: what Is the role of mendelian randomization? Cancer epidemiology, biomarkers & prevention: a publication of the American association for cancer research, cosponsored by the american society of preventive. Oncology 27, 995–1010 (2018).
Smith, G. D. & Ebrahim, S. Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 32, 1–22 (2003).
Lawlor, D. A., Harbord, R. M., Sterne, J. A. C., Timpson, N. & Davey Smith, G. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat. Med. 27, 1133–1163 (2008).
Aviv, A., Anderson, J. J. & Shay, J. W. Mutations, cancer and the telomere length paradox. Trends cancer 3, 253–258 (2017).
Ferkingstad, E. et al. Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet 53, 1712–1721 (2021).
O’Neill, C. & van de Donk, N. T-cell redirecting bispecific antibodies in multiple myeloma: current landscape and future directions. EJHaem 4, 811–822 (2023).
Cowan, A. J. et al. gamma-Secretase inhibitor in combination with BCMA chimeric antigen receptor T-cell immunotherapy for individuals with relapsed or refractory multiple myeloma: a phase 1, first-in-human trial. Lancet Oncol. 24, 811–822 (2023).
Visram, A. et al. Serum BCMA levels predict outcomes in MGUS and smoldering myeloma patients. Blood Cancer J. 11, 120 (2021).
Seipel, K. et al. sBCMA plasma level dynamics and anti-BCMA CAR-T-cell treatment in relapsed multiple myeloma. Curr. Issues Mol. Biol. 44, 1463–1471 (2022).
Girgis, S. et al. Effects of teclistamab and talquetamab on soluble BCMA levels in patients with relapsed/refractory multiple myeloma. Blood Adv. 7, 644–648 (2023).
Alomari, M., Kunacheewa, C. & Manasanch, E. E. The role of soluble B cell maturation antigen as a biomarker in multiple myeloma. Leuk. Lymphoma 64, 261–272 (2023).
Wiedemann, A. et al. Soluble B-cell maturation antigen as a monitoring marker for multiple myeloma. Pathol. Oncol. Res 29, 1611171 (2023).
Xu, C., Gao, M., Zhang, J. & Fu, Y. IL5RA as an immunogenic cell death-related predictor in progression and therapeutic response of multiple myeloma. Sci. Rep. 13, 8528 (2023).
Horikawa, K. & Takatsu, K. Interleukin-5 regulates genes involved in B-cell terminal maturation. Immunology 118, 497–508 (2006).
Jonsson, S. et al. Identification of sequence variants influencing immunoglobulin levels. Nat. Genet 49, 1182–1191 (2017).
Liao, M. et al. Genome-wide association study identifies common variants at TNFRSF13B associated with IgG level in a healthy Chinese male population. Genes Immun. 13, 509–513 (2012).
Osman, W. et al. Association of common variants in TNFRSF13B, TNFSF13, and ANXA3 with serum levels of non-albumin protein and immunoglobulin isotypes in Japanese. PLoS One 7, e32683 (2012).
Fried, A. J., Rauter, I., Dillon, S. R., Jabara, H. H. & Geha, R. S. Functional analysis of transmembrane activator and calcium-modulating cyclophilin ligand interactor (TACI) mutations associated with common variable immunodeficiency. J. Allergy Clin. Immunol. 128, 226–228 e1 (2011).
Kumar, P., Henikoff, S. & Ng, P. C. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat. Protoc. 4, 1073–1081 (2009).
Adzhubei, I., Jordan, D. M. & Sunyaev, S. R. Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet Chapter 7, Unit7 20 (2013).
Morgan, G. J. et al. Cyclophosphamide, thalidomide, and dexamethasone as induction therapy for newly diagnosed multiple myeloma patients destined for autologous stem-cell transplantation: MRC Myeloma IX randomized trial results. Haematologica 97, 442–450 (2012).
Morgan, G. J. et al. Long-term follow-up of MRC Myeloma IX trial: Survival outcomes with bisphosphonate and thalidomide treatment. Clin. Cancer Res. Off. J. Am. Assoc. Cancer Res. 19, 6030–6038 (2013).
Jackson, G. H. et al. Lenalidomide maintenance versus observation for patients with newly diagnosed multiple myeloma (Myeloma XI): a multicentre, open-label, randomised, phase 3 trial. Lancet Oncol. 20, 57–73 (2019).
Bycroft, C. et al. Genome-wide genetic data on ~500,000 UK biobank participants. bioRxiv https://doi.org/10.1101/166298 (2017).
Marchini, J., Howie, B., Myers, S., McVean, G. & Donnelly, P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat. Genet 39, 906–913 (2007).
Liu, J. Z. et al. Meta-analysis and imputation refines the association of 15q25 with smoking quantity. Nat. Genet. 42, 436–440 (2010).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet 88, 76–82 (2011).
Huang, J. et al. Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel. Nat. Commun. 6, 8111 (2015).
Via, M., Gignoux, C. & Burchard, E. G. The 1000 Genomes project: new opportunities for research and social challenges. Genome Med 2, 3 (2010).
Weinhold, N. et al. The 7p15.3 (rs4487645) association for multiple myeloma shows strong allele-specific regulation of the MYC-interacting gene CDCA7L in malignant plasma cells. Haematologica 100, e110 (2015).
Gamazon, E. R. et al. SCAN: SNP and copy number annotation. Bioinformatics 26, 259–262 (2010).
Stegle, O., Parts, L., Piipari, M., Winn, J. & Durbin, R. Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses. Nat. Protoc. 7, 500–507 (2012).
Gamazon, E. R. et al. A gene-based association method for mapping traits using reference transcriptome data. Nat. Genet. 47, 1091–1098 (2015).
Barbeira, A. N. et al. Integrating predicted transcriptome from multiple tissues improves association detection. bioRxiv https://doi.org/10.1101/292649 (2018).
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
Schmidl, C., Rendeiro, A. F., Sheffield, N. C. & Bock, C. ChIPmentation: fast, robust, low-input ChIP-seq for histones and transcription factors. Nat. Methods 12, 963–965 (2015).
Rendeiro, A. F. et al. Chromatin accessibility maps of chronic lymphocytic leukaemia identify subtype-specific epigenome signatures and transcription regulatory networks. Nat. Commun. 7, 11938–11938 (2016).
Hoffman, M. M. et al. Integrative annotation of chromatin elements from ENCODE data. Nucleic Acids Res. 41, 827–841 (2013).
Fiziev, P. et al. Systematic epigenomic analysis reveals chromatin states associated with melanoma progression. Cell Rep. 19, 875–889 (2017).
Schoenfelder, S. et al. Polycomb repressive complex PRC1 spatially constrains the mouse embryonic stem cell genome. Nat. Genet 47, 1179–1186 (2015).
Satterlee, J. S. et al. Community resources and technologies developed through the NIH roadmap epigenomics program. Methods Mol. Biol. 1238, 27–49 (2015).
Bernstein, B. E. et al. The NIH roadmap epigenomics mapping consortium. Nat. Biotechnol. 28, 1045–1048 (2010).
Cowper-Sal lari, R. et al. Breast cancer risk-associated SNPs modulate the affinity of chromatin for FOXA1 and alter gene expression. Nat. Genet 44, 1191–1198 (2012).
Kong, A. et al. Detection of sharing by descent, long-range phasing and haplotype imputation. Nat. Genet 40, 1068–1075 (2008).
Gudbjartsson, D. F. et al. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet 47, 435–4 (2015).
Krietenstein, N. et al. Ultrastructural details of mammalian chromosome architecture. Mol. Cell 78, 554–565.e7 (2020).
Hsieh, T. S. et al. Resolving the 3D landscape of transcription-linked mammalian chromatin folding. Mol. Cell 78, 539–553.e8 (2020).
Morgan, G. J. et al. First-line treatment with zoledronic acid as compared with clodronic acid in multiple myeloma (MRC myeloma IX): a randomised controlled trial. Lancet 376, 1989–1999 (2010).
Ali, M. et al. The multiple myeloma risk allele at 5q15 lowers ELL2 expression and increases ribosomal gene expression. Nat. Commun. 9, 1649 (2018).
Manojlovic, Z. et al. Comprehensive molecular profiling of 718 multiple myelomas reveals significant differences in mutation frequencies between African and European descent cases. PLoS Genet 13, e1007087 (2017).
Samur, M. K. et al. Long intergenic non-coding RNAs have an independent impact on survival in multiple myeloma. Leukemia 32, 2626–2635 (2018).
Ota, M. et al. Dynamic landscape of immune cell-specific gene regulation in immune-mediated diseases. Cell 184, 3006–3021 e17 (2021).
Gordon, M. G. et al. lentiMPRA and MPRAflow for high-throughput functional characterization of gene regulatory elements. Nat. Protoc. 15, 2387–2412 (2020).
Ashuach, T. et al. MPRAnalyze: statistical framework for massively parallel reporter assays. Genome Biol. 20, 183 (2019).
Niroula, A., Ajore, R. & Nilsson, B. MPRAscore: robust and non-parametric analysis of massively parallel reporter assays. Bioinformatics 35, 5351–5353 (2019).
Ulirsch, J. C. et al. Interrogation of human hematopoiesis at single-cell and single-variant resolution. Nat. Genet. 51, 683–693 (2019).
Granja, J. M. et al. Single-cell multiomic analysis identifies regulatory programs in mixed-phenotype acute leukemia. Nat. Biotechnol. 37, 1458–1465 (2019).
Sun, B. B. et al. Genomic atlas of the human plasma proteome. Nature 558, 73–79 (2018).
Loh, P. R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet 47, 284–290 (2015).
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Hemani, G. et al. The MR-base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018).
Hemani, G., Tilling, K. & Davey Smith, G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLoS Genet 13, e1007081 (2017).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet 10, e1004383 (2014).
Acknowledgements
This work was supported by grants from the Swedish Research Council (2017-02023 and 2018-00424 to B.N.), the Swedish Cancer Society (200696 to B.N.), the Nordic Cancer Union (R217-A13329-18-S65 to B.N.), European Research Council (EU-MSCA-COFUND 754299 and 847583 CanFaster to B.N., L.D.L., N.U.D., Z.A.), Stiftelsen Borås Forsknings- och Utvecklingsfond mot Cancer (to B.N., U-H.M.), Myeloma UK (to R.S.), Cancer Research UK (C1298/A8362 to R.S.), European Commission Horizon 2020 (856620 to K.H.), the Dietmar Hopp Foundation (to H.G.), National Institute of Health Research and Institute of Cancer Research (to A.S.), Academy of Medical Sciences (SGL024/1013), and Mr. Ralph Stockwell. We are indebted to the patients who participated in the study.
Funding
Open access funding provided by Lund University.
Author information
Authors and Affiliations
Contributions
B.N. and R.H. designed the study. M.W., B.N., and L.D.L. performed bioinformatic analyses. G.H., G.T., L.S., A.S., P.L., M.T., M.P., L.E., N.W., T.O., A.N., A.L.A., and A.L.d.L.P. performed additional bioinformatic analyses. A.G., L.D.L., M.W., and N.U.D. performed the main laboratory experiments. C.C., R.A., M.P., and Z.A. contributed to the experiments. M.H., T.O., T.R., U.T., K.S., A.J.V., A.W., H.G., G.M., I.J., K.H., S.N.S., L.S., S.R., S.Y.K., M.K., U-H.M., T.J.L., P.Su., P.So., E.F., R.H., P.V., U.P-K., F.S., H.T., A.F., C.S., F.v.R, and G.H.E. contributed samples or association data. R.H., B.N., M.W., L.D.L., T.R., G.T., U.T., and G.H. drafted the manuscript. All authors contributed to the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
T.O., G.H.H., G.T., L.S., P.Su., E.F., G.H.E., I.J., T.R., U.T., S.N.S., and K.S. are employed by deCODE Genetics/Amgen Inc. The remaining authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Francesco Maura and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Went, M., Duran-Lozano, L., Halldorsson, G.H. et al. Deciphering the genetics and mechanisms of predisposition to multiple myeloma. Nat Commun 15, 6644 (2024). https://doi.org/10.1038/s41467-024-50932-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-50932-7
- Springer Nature Limited