Atypical splicing variants in PKD1 explain most undiagnosed typical familial ADPKD

Hort, Yvonne; Sullivan, Patricia; Wedd, Laura; Fowles, Lindsay; Stevanovski, Igor; Deveson, Ira; Simons, Cas; Mallett, Andrew; Patel, Chirag; Furlong, Timothy; Cowley, Mark J.; Shine, John; Mallawaarachchi, Amali

doi:10.1038/s41525-023-00362-z

Atypical splicing variants in PKD1 explain most undiagnosed typical familial ADPKD

Article
Open access
Published: 07 July 2023

Volume 8, article number 16, (2023)
Cite this article

Download PDF

You have full access to this open access article

npj Genomic Medicine

Atypical splicing variants in PKD1 explain most undiagnosed typical familial ADPKD

Download PDF

2857 Accesses
6 Citations
12 Altmetric
1 Mention
Explore all metrics

Abstract

Autosomal dominant polycystic kidney disease (ADPKD) is the most common monogenic cause of kidney failure and is primarily associated with PKD1 or PKD2. Approximately 10% of patients remain undiagnosed after standard genetic testing. We aimed to utilise short and long-read genome sequencing and RNA studies to investigate undiagnosed families. Patients with typical ADPKD phenotype and undiagnosed after genetic diagnostics were recruited. Probands underwent short-read genome sequencing, PKD1 and PKD2 coding and non-coding analyses and then genome-wide analysis. Targeted RNA studies investigated variants suspected to impact splicing. Those undiagnosed then underwent Oxford Nanopore Technologies long-read genome sequencing. From over 172 probands, 9 met inclusion criteria and consented. A genetic diagnosis was made in 8 of 9 (89%) families undiagnosed on prior genetic testing. Six had variants impacting splicing, five in non-coding regions of PKD1. Short-read genome sequencing identified novel branchpoint, AG-exclusion zone and missense variants generating cryptic splice sites and a deletion causing critical intron shortening. Long-read sequencing confirmed the diagnosis in one family. Most undiagnosed families with typical ADPKD have splice-impacting variants in PKD1. We describe a pragmatic method for diagnostic laboratories to assess PKD1 and PKD2 non-coding regions and validate suspected splicing variants through targeted RNA studies.

Identified eleven exon variants in PKD1 and PKD2 genes that altered RNA splicing by minigene assay

Article Open access 19 July 2023

Deciphering Variability of PKD1 and PKD2 in an Italian Cohort of 643 Patients with Autosomal Dominant Polycystic Kidney Disease (ADPKD)

Article Open access 08 August 2016

Nephroplex: a kidney-focused NGS panel highlights the challenges of PKD1 sequencing and identifies a founder BBS4 mutation

Article Open access 08 May 2021

Introduction

Autosomal dominant polycystic kidney disease (ADPKD) is the most common monogenic cause of kidney failure, affecting approximately 1 in 1000 people¹. The condition is primarily caused by disease-causing variants in PKD1 and PKD2. Genetic diagnosis of ADPKD is technically challenging due to six pseudogenes that are >97% homologous in sequence to the genuine PKD1 gene¹. This sequence homology has driven the development of specific genetic diagnostic techniques to robustly sequence PKD1 that have focussed mainly on the analysis of the protein-coding regions of PKD1, PKD2 and then the wider exome. These techniques include long-range polymerase chain reaction (LR-PCR) and Sanger sequencing, targeted next-generation sequencing using probes specific to coding regions of cystic-related genes (tNGS), exome sequencing and genome sequencing with coding-based analysis^1,2,3,4,5,6. The diagnostic yield from these studies differs based on the clinical breadth of the cohort, ranging from approximately 60% in phenotypically broad cohorts to >90% in cohorts tightly selected for features typical of PKD1 and PKD2-mediated disease^1,3,4,5,7. Even with the most stringent coding-based analysis, at least 7% of ADPKD families are left without a genetic diagnosis⁷.

In recent years, there have been substantial advances in understanding the breadth of ADPKD gained through investigating genetically undiagnosed patients. New genes have been identified that contribute to the ADPKD spectrum, including GANAB, DNAJB11, IFT140, ALG5, ALG8 and ALG9^5,7,8. Despite these advances in disease knowledge and extensive coding-region focussed analysis, there remains a cohort of patients with a typical ADPKD phenotype who are without a genetic diagnosis. It is thus an open question as to whether this is due to technical limitations in identifying causative variants within PKD1 or PKD2 or the existence of an unknown ‘PKD3’ gene that is associated with a typical ADPKD phenotype.

We aimed to address this question by investigating a cohort selected to have a typical ADPKD phenotype, a positive family history and be undiagnosed on standard diagnostic genetic testing. We aimed to investigate whether these patients had variants in previously unidentified genes or, as other diseases suggest, novel variants in the most likely genes of interest – PKD1 and PKD2. To approach this challenge, we applied sequencing methods not previously extensively used in ADPKD, including short and long-read genome sequencing combined with targeted RNA sequencing.

We have previously shown that short-read genome sequencing is a robust diagnostic method in ADPKD, allowing the detection of single nucleotide, short indel and structural variants^1,4. However, though the whole genome is sequenced, current diagnostic laboratory protocols essentially limit analysis to protein-coding regions of the genome. This particularly biases against the detection of non-coding variants that may impact splicing. To date, there has been limited study of potential atypical splice-impacting variants in ADPKD and how best to predict their pathogenicity^9,10,11. Another challenge for diagnostic laboratories is in clarifying the pathogenicity of identified variants of uncertain significance (VUS). Even if non-coding variants are identified, pathogenicity confirmation of these variants typically requires functional analysis that is not routinely performed in diagnostic laboratories¹². Pathogenicity can also be clarified in some instances by phasing of variants (confirming which allele the variant is present on), which is not usually possible with short-read sequencing. More recently available long-read technologies, such as Oxford Nanopore Technologies (ONT), have been shown to inform phasing in other disease groups, but this has not been previously applied in ADPKD¹³. Our previous studies in genome sequencing diagnostics in ADPKD have informed this study, and patients left undiagnosed from these previous cohorts were assessed for suitability for this study^1,4. In this study, we report for the first time the combination of short and long-read genome sequencing with whole genome analysis and RNA studies to investigate ADPKD families without a diagnosis after standard diagnostic genetic testing.

Results

Over 172 patients were assessed for suitability for recruitment. This included 28 patients from a cohort of typical ADPKD who had undergone genome sequencing, 144 patients from a cohort of suspected ADPKD who had undergone diagnostic genome sequencing and a cohort of patients with typical and atypical PKD reviewed at multidisciplinary kidney genetics clinics from across Australia who had undergone diagnostic genetic testing^1,4,14. From this initial pool of over 172 probands, 9 families were recruited who met the study inclusion criteria (Fig. 1). Patients with atypical clinical features or no family history were deemed ineligible as they did not meet the inclusion criteria. Recruitment was restricted to those with a family history of ADPKD in order to target analysis toward inherited germline rather than mosaic variants. All patients had previously undergone standard diagnostic genetic testing via LR-PCR of PKD1 and PKD2 coding regions and massively parallel sequencing of this PCR-product (2 probands) or diagnostic genome sequencing with analysis targeted to coding regions of a cystic kidney disease gene panel (6 probands) or both (1 participant) (Supplementary Table 2). An additional seven patients met the inclusion criteria and had a VUS identified in PKD1 on initial diagnostic genetic testing but were not consented to further research analysis and therefore did not proceed to this study (Fig. 1).

Four of nine probands had ESKD (End Stage Kidney Disease), and all had enlarged kidney lengths with numerous kidney cysts on imaging (Table 1 & Supplementary Table 2). Five of nine probands had extra-renal features of ADPKD reported (Supplementary Table 2).

Table 1 Patient characteristics and overall results.

Full size table

After genome sequencing and whole genome analysis, a genetic diagnosis (identification of a Pathogenic or Likely Pathogenic variant) was made in eight out of nine families, with all having disease-causing variants in PKD1 (Table 1 and Fig. 2). An additional family (FRPA007) had a VUS identified in PKD1. Six of the disease-causing variants were shown through RNA studies to impact splicing. Four of these splicing variants had been identified on the initial diagnostic testing (including one coding variant) but classified as of uncertain significance, with segregation studies not able to clarify pathogenicity (Supplementary Table 2).

**Fig. 2: Overview of variants types identified in *PKD1*.**

PKD1 intron 37 splice-impacting variants

Patient RBW403 had a clinical diagnosis of ADPKD made at 12 years of age in the context of a known diagnosis in his father, who reached ESKD at 44yo (Table 1 and Fig. 3A). Through this study, a novel variant was identified in intron 37 of PKD1 (c.11017-25 A > G) that was predicted by in silico splice prediction tool, introme, to interrupt the splicing branchpoint (Fig. 3C, D). This variant was absent in control databases and not previously reported in ADPKD cohorts. This variant had been detected on both previously performed diagnostic tests (next-generation sequencing of LR-PCR amplicons targeted to PKD1 and PKD2 and then genome sequencing) but predicted benign based on available in silico tools. Introme predicted multiple potential splicing impacts. The predominant interpretation was that this branchpoint variant would largely result in the skipping of exon 38, introducing a premature stop codon. An alternate interpretation was that the presence of a wildtype cryptic splice site 156 base pairs upstream of the c.11017-25 A > G branchpoint variant would result in retention of 180 bp of intron 37 at a reduced frequency (Fig. 3D). RNA studies revealed evidence for both splicing outcomes, with the skipping of exon 38 being far more prevalent than intron retention. A review of control GTEx RNA data suggests low-level (5%) natural alternative splicing of exon 38 in kidney samples (Fig. 3B and Supplementary Table 3).

**Fig. 3: Disease-causing splicing variants in *PKD1* intron 37.**

Two additional patients in this cohort (RPA028 and RPA014, Table 1 and Fig. 3A) were identified to have different variants in intron 37 that were also predicted to interrupt the usual function of the exon 38 acceptor splice site (Fig. 3C, D). RPA028 had a different nucleotide substitution (PKD1 c.11017-25 A > C), interrupting the same branchpoint as in RBW403. RPA014 had a variant 10 base pairs from the start of exon 38 (PKD1 c.11017-10 C > A) that impacted the acceptor splice site through the inclusion of the ‘AG’ dinucleotide in the AG Exclusion Zone¹⁵. RNA studies in both patients demonstrated a similar impact to that seen in RBW403, with a combination of skipping of exon 38 and partial retention of intron 37 (Fig. 3C).

Generation of novel cryptic donor splice site

Patient RPA019 had a clinical diagnosis of ADPKD made at 29yo during screening as a potential kidney donor for her affected brother (Table 1 and Supplementary Fig. 1A). Previous diagnostic testing had identified a missense variant classified as a VUS in exon 10 of PKD1 (c.1991 C > T). RPA019 underwent genome sequencing and whole genome analysis, and no additional phenotype-relevant variants were identified. Introme predicted that the c.1991C > T variant would generate a new cryptic donor site and result in an in-frame deletion of 36 amino acids (Supplementary Fig. 1C). RNA studies supported this prediction (Supplementary Fig. 1B). The variant was segregated to the proband’s affected mother and was absent in population datasets, though alternate amino acid substitutions at the same residue and substitutions at the same nucleotide are reported in population datasets. To our knowledge, this variant has not previously been reported in ADPKD cohorts.

Intronic deletion causing critical shortening of intron length

19F00138 and her sister (RBW401) both had a clinical diagnosis of ADPKD, with bilateral kidney enlargement, multiple kidney cysts and multi-generational family history of ADPKD (Supplementary Fig. 1D). Clinical short-read genome sequencing in 19F00138 had been non-diagnostic. Re-analysis of the genome sequencing data identified a 19 bp deletion within intron 31 of PKD1 that was predicted to result in shortening of the intron beneath its critical length and, therefore, intron retention¹⁶ (Supplementary Fig. 1E). This variant was segregated to RBW401 and RNA studies in her demonstrated retention of intron 31, creating a frameshifting insertion (Supplementary Fig. 1F). This variant has been reported previously in a patient with a de novo ADPKD phenotype¹⁷.

Extended donor splice site variant

RPA021 and her brother both had a clinical diagnosis of ADPKD, with both undergoing kidney transplantation in their 50s (Supplementary Fig. 2A, B). In our previous study¹, a variant of uncertain significance had been identified in intron 18 of PKD1 (c.7489+5 G > A). Though suspicious for being disease-causing via disruption of the native splice site, there was insufficient evidence to confirm pathogenicity without support from functional studies. This variant has also been reported previously by our research group in an unrelated patient who was part of a cohort of patients who underwent clinical PKD testing via short-read genome sequencing⁴. The patient in this previous study (Pt D158) was not known to be related to FRPA021 and did not share ethnicity⁴. RNA studies in RPA021 demonstrated that the c.7489+5 G > A variant in PKD1 resulted in the retention of 93 base pairs of intron 18, introducing a premature stop codon (Supplementary Fig. 2C, D).

Coding PKD1 variants

The RG_0044 family had a multi-generational history of ADPKD (Supplementary Fig. 3). Participant RG_0044.0048 had previously undergone diagnostic genetic testing via next-generation sequencing of LR-PCR amplicons targeted to PKD1 and PKD2, and no clinically significant variants had been identified. Genome sequencing and analysis identified a previously reported, likely pathogenic missense variant in PKD1 p.(Gly960Ser) that had not been identified on the previous diagnostic testing¹⁸. This variant was appropriately segregated to six affected and unaffected family members (Supplementary Fig. 3).

RPA017 had a clinical diagnosis of ADPKD made at 34yo (Fig. 4 and Supplementary Table 2). She was motivated for a genetic diagnosis to inform IVF and PGD. Her mother (RPA015) was diagnosed with ADPKD with CKD 3a in her 50s in the context of a diagnosis in her mother. RPA017’s father (RPA016) had normal kidney ultrasound at 72yo. Genome sequencing in RPA017 identified a nonsense variant in exon 31 of PKD1. However, segregation by LR-PCR and Sanger sequencing demonstrated that this variant was absent in her affected mother and unaffected father (paternity confirmed). Subsequent genome sequencing in RPA015 identified a nonsense variant in exon 5 of PKD2 that was absent in her affected daughter. Both variants met ACMG criteria for likely pathogenic, though the inheritance pattern in the family was unclear. In addition, PGD for PKD1 variants is contingent on multigenerational linkage studies. The information from short-read sequencing was not adequate to inform these linkage studies as the parent of origin of the PKD1 pathogenic allele was unknown. To clarify the inheritance in this family, ONT long-read sequencing was performed to facilitate variant phasing, which demonstrated that the PKD1 variant identified in RPA017 was present on the allele she inherited from her father. Sanger sequencing showed that the variant was absent in her father’s peripheral blood DNA, strongly suggesting this was a de novo variant in RPA017(Fig. 4). Phasing also confirmed that RPA017 had not inherited the affected PKD2 allele from her mother. This information was used to inform linkage studies for PGD.

**Fig. 4: Long read nanopore sequencing confirms de novo *PKD1* variant occurred on the paternal allele.**

Undiagnosed

Research analysis performed in RPA007 identified a VUS, p.(Gln2824Arg) in PKD1 that segregated to her affected mother. Whole genome analysis in RPA007 with both short and long-read genome sequencing did not identify any additional variants of interest. Copy number variant analysis was uninformative. No splicing impact was predicted by introme. The PKD1 p.(Gln2824Arg) variant is absent in population databases, predicted pathogenic by in silico tools and has not been previously reported in PKD cohorts. This information alone is insufficient to clarify the pathogenicity of this variant.

Discussion

Pathogenic variants in PKD1 and PKD2 have been shown to be responsible for disease in most patients with a typical phenotype of ADPKD. However, genetic sequencing in ADPKD cohorts consistently results in approximately 10% of patients being left without a genetic diagnosis^4,5,7. This study shows that most of these undiagnosed families have splice-impacting variants in PKD1 that were uncertain or undetected on standard diagnostic genetic testing. We demonstrate the value of sequencing and, importantly, analysis of the protein-coding and non-coding regions of PKD1 and PKD2, combined with targeted RNA studies to confirm a genetic diagnosis. For the first time, we show the value of long-read sequencing in ADPKD to inform the phasing and inheritance of variants.

Utilising a genetic diagnosis to inform clinical care requires a definitive genetic result. Cascade testing can only be offered in families with a definitive genetic diagnosis, and this is the same for using genetic results to inform family planning. This highlights the value of improving diagnostic yield for families with ADPKD and the value of the results of this study, which demonstrates that a significant proportion of undiagnosed families have variants that affect gene splicing. There is increasing evidence for similar variants across other disease groups, where, for example, RNA-sequencing in a cohort of patients with undiagnosed muscle disease identified a diagnosis through aberrant splicing in 35% of patients¹⁹. RNA-sequencing often requires analysis of a tissue of interest rather than using blood RNA^12,19,20. In kidney disease, RNA extracted from kidney tissue or urothelial cells is technically more challenging to access¹². We demonstrate a practical approach for evaluating suspected splicing variants in ADPKD using RT-PCR of total RNA extracted from peripheral blood. This pragmatic, targeted approach provides functional evidence to classify VUS that results in substantial splicing defects without requiring access to kidney tissue and is achievable for a diagnostic laboratory to replicate²¹. Developing protocols for diagnostic genetic laboratories to identify and then confirm coding or non-coding aberrant splicing variants is key in improving current genomic diagnostic rates¹². The high diagnostic yield in our study highlights the importance of diagnostic laboratories analysing beyond the coding region in patients with a typical ADPKD phenotype by using robust in silico tools, such as Introme²². Importantly, we also show that RT-PCR can then be used to evaluate potential splicing variants identified through this broader analysis. The homologous PKD1-pseudogenes produce mRNA transcripts that are approximately 97% homologous to the 5′ regions of PKD1 mRNA; therefore, this RT-PCR method requires the use of unique primers to avoid inadvertently amplifying transcripts from the PKD1-pseudogenes²³.

A confirmed genetic diagnosis is increasingly becoming the standard of care for families with genetic disorders and is being utilised across all inherited kidney diseases, including ADPKD^1,4,24,25,26. A genetic diagnosis allows early definitive diagnosis in ADPKD, which can provide prognostic information and allow for early institution of treatments, including vigorous hypertension management and tolvaptan for those predicted to have more rapidly progressive disease⁴. Genetic diagnosis also allows for informing the selection of kidney donors and for family planning. In this cohort alone, several families utilised a confirmed genetic diagnosis to inform PGD of embryos. In many jurisdictions, such as Australia, New Zealand and the United Kingdom, national health service subsidies are available for ADPKD families for IVF and PGD, highlighting that providing genetic counselling to ADPKD families is an essential aspect of their care²⁷. ADPKD is one of the most common monogenic conditions screened for in IVF and PGD²⁸. There is also increasing evidence of nocturnal hypertension in children with ADPKD, suggesting that guidelines regarding the diagnosis of ADPKD in childhood may be modified in the future to recommend early intervention for these children²⁹. Imaging results can be variable in paediatric populations, whereas genetic diagnostics allows reliable, definitive diagnosis²⁹^,30.

Classifying VUS is currently a challenge for diagnostic laboratories, particularly in ADPKD, where variants are often private to families and multiple samples from large pedigrees are typically not available to perform segregation studies to clarify pathogenicity. We also demonstrate the additional value of long-read sequencing in understanding the pathogenicity of variants in ADPKD. Long-read technologies have the additional advantage of allowing the phasing of variants, which has obvious applications in confirming bi-allelic inheritance in autosomal recessive disease. In autosomal dominant disorders, long-read sequencing allows the opportunity to identify the parental allele on which a de novo variant occurs. This is valuable in understanding inheritance in complex families, such as we demonstrate in family FRPA017. Another unique application is in informing linkage studies for couples undergoing PGD for de novo ADPKD. For couples undergoing PGD, detailed phasing studies are performed to identify accurate markers that are then used to ascertain affected vs unaffected embryos. For patients with PKD1-mediated disease, this requires samples from multiple generations of the affected family, as direct sequencing of PKD1 is hampered by the presence of the homologous pseudogenes. This makes PGD challenging to access for patients with de novo PKD1-disease, who are the only affected person in their family. In this situation, which impacts 10% of patients with ADPKD, long-read sequencing can provide important phasing information that can allow these patients to access PGD of embryos³¹. Long-read data may additionally be used to detect structural variants that are missed by short-read sequencing, although we did not detect any relevant events here¹³.

In order to maximise the study size, this cohort was collected from a larger pool of over 172 patients from across Australia who had undergone standard diagnostic genetic testing for PKD. Given the high yield of diagnostic testing in ADPKD and the strict inclusion criteria of this study, only 16 patients from this larger pool met the inclusion criteria, of which 9 consented to participation. Inclusion was deliberately limited to patients with a family history of ADPKD to focus on germline rather than mosaic variants, restricting the eligible pool of patients³². Our results are comparable to other disease groups and highlight that previously unrecognised or undetected splice variants may be causative in these families³³. Our smaller cohort size means there is value in applying this method to a larger cohort.

Our results show that for families with a typical ADPKD phenotype, variants are most likely to be found in PKD1 and PKD2 rather than other PKD-associated genes and that most of these variants are splice-impacting. We provide evidence of the value of diagnostic laboratories expanding the analysis to non-protein-coding regions to improve diagnostic yield in ADPKD – we achieved this through WGS, though other validated sequencing methodologies could also be utilised. Importantly, we describe an achievable method for assessing uncertain variants that are predicted to impact splicing in this common disorder. ADPKD is the most common inherited kidney disorder and contributes to approximately 10% of kidney failure cohorts⁴. Improving diagnostic rates allows for improved management through earlier institution of treatment and access to holistic care that includes genetic counselling. Importantly, improving the understanding of the underlying genetic basis for all families with ADPKD is a critical step in developing personalised therapies for this common genetic disease.

Methods

Enrolment and inclusion criteria

We enrolled patients with typical ADPKD clinical features and a family history of ADPKD who were without a genetic diagnosis after diagnostic sequencing of PKD1 and PKD2 or a larger cystic gene panel that included the PKD1 and PKD2 genes. Patients undiagnosed from our previous studies were assessed for suitability for this study (Fig. 1). In addition, patients were recruited from clinical sites across Australia. Family members were recruited as required and available. Ethics approval for the study was obtained from the RPAH Human Research Ethics Committee (HREC/18/RPAH/726). All participants provided written informed consent. The authors have received and archived written patient consent. All data included is de-identified.

Clinical, family and imaging data were obtained during the clinical review or review of medical records. Kidney lengths were based on ultrasound measurements as kidney ultrasound is the Medicare-funded imaging modality available for the assessment of ADPKD patients in Australia. The kidney function was calculated using the CKD–EPI equation.

Short-read genome sequencing

All probands underwent short-read genome sequencing using DNA extracted from peripheral blood samples. Genome sequencing was performed on the HiSeqX sequencing system (Illumina Inc., California, CA, USA) after either PCR-based library preparation (Illumina HiSeq X TruSeq Nano DNA HT Sample Prep Kit) or PCR-free library preparation (KAPA Hyper PCR-free kit, Roche). The sequencing was performed within an ISO17025-accredited laboratory at the Kinghorn Centre for Clinical Genomics within the Garvan Institute. All samples were processed via a custom bioinformatics pipeline based on GATK best practice, which was optimised for the identification of germline variants^1,4. Reads were aligned to the hg37 reference sequence. Sequence variants were filtered using Seave³⁴. CNV and structural variant analysis was performed using ClinSV³⁵. Introme was used to assess for variants predicted to impact splicing²². Control PKD1 splice junction usage was obtained using GTEx V8, filtered to include only kidney samples³⁶. Initial variant analysis was targeted to coding and intronic and promoter regions of PKD1 (NM_001009944.3) and PKD2 (NM_000297.4), with all variants (ranging from predicted high to low impact) manually reviewed. Analysis was then expanded to phenotype-driven whole genome analysis. Variants were classified according to American College of Medical Genetics (ACMG) Guidelines^30,37. Sanger sequencing (with prior LR-PCR amplification if within the PKD1-pseudogene homologous region) was performed to confirm all single nucleotide and short indel variants identified on genome sequencing and for family studies.

RNA studies

RNA functional studies were performed to assess variants predicted to impact splicing in PKD1. Total RNA was extracted from venous blood (Macherey-Nagel Nucleospin RNA Blood Kit) for RT-PCR studies. If the variant of interest was within the PKD1-pseudogene homologous region, amplification was performed with at least one of the primer pairs being unique to the PKD1 sequence in order to avoid amplifying PKD1-pseudogene transcripts (see Supplementary Table 1 for primer sequences). Sanger sequencing was performed on this PCR product. See Supplementary Methods for further details. RNA studies were performed within a research laboratory at the Garvan Institute.

Long-read sequencing

In families who remained negative after short-read genome sequencing or for whom phasing could inform variant interpretation and classification, long-read sequencing was performed. High molecular weight DNA was sheared to ~20 kb fragment size using Covaris G-tubes. Sequencing libraries were prepared from ~1.5 to 5 µg of sheared DNA using native library prep kits (SQK-LSK110) and sequenced for 72 h on a PromethION (FLO-PRO002, R9.4.1) flow cell. Raw ONT sequencing data was converted to BLOW5 format with slow5tools (v0.3.0)³⁸, then base-called using Guppy (4.0.11 or later). Resulting FASTQ files were aligned to the hg38 reference genome using minimap2 (v2.14-r883)³⁹, and Longshot (v0.4.1)⁴⁰ was used to identify and phase variants within the PKD1 locus. Long-read sequencing was performed within a research laboratory at the Garvan Institute.

Data availability

Variants identified in the study have been submitted to ClinVar (ClinVar Accessions: SCV002756451–SCV002756459). Other data is available upon request. As per the patient’s informed consent, the long-read DNA sequencing data can be made available on request for institutional ethics-approved research groups.

References

Mallawaarachchi, A. C. et al. Whole-genome sequencing overcomes pseudogene homology to diagnose autosomal dominant polycystic kidney disease. Eur. J. Hum. Genet. 24, 1584–1590 (2016).
Article CAS PubMed PubMed Central Google Scholar
Audrézet, M.-P. et al. Autosomal dominant polycystic kidney disease: comprehensive mutation analysis of PKD1 and PKD2 in 700 unrelated patients. Hum. Mutat. 33, 1239–1250 (2012).
Article PubMed Google Scholar
Rossetti, S. et al. Identification of gene mutations in autosomal dominant polycystic kidney disease through targeted resequencing. J. Am. Soc. Nephrol. 23, 915–933 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mallawaarachchi, A. et al. Genomic diagnostics in polycystic kidney disease: an assessment of real-world use of whole-genome sequencing. Eur. J. Hum. Genet. 29, 760–770 (2021).
Article CAS PubMed PubMed Central Google Scholar
Senum, S. R. et al. Monoallelic IFT140 pathogenic variants are an important cause of the autosomal dominant polycystic kidney-spectrum phenotype. Am. J. Hum. Genet. 109, 136–156 (2022).
Article CAS PubMed Google Scholar
Bullich, G. et al. A kidney-disease gene panel allows a comprehensive genetic diagnosis of cystic and glomerular inherited kidney diseases. Kidney Int. 94, 363–371 (2018).
Article PubMed Google Scholar
Lemoine, H. et al. Monoallelic pathogenic ALG5 variants cause atypical polycystic kidney disease and interstitial fibrosis. Am. J. Hum. Genet. 109, 1484–1499 (2022).
Article CAS PubMed PubMed Central Google Scholar
Huynh, V. T. et al. Clinical spectrum, prognosis and estimated prevalence of DNAJB11-kidney disease. Kidney Int. 98, 476–487 (2020).
Article CAS PubMed PubMed Central Google Scholar
Claverie-Martín, F., Gonzalez-Paredes, F. J. & Ramos-Trujillo, E. Splicing defects caused by exonic mutations in PKD1as a new mechanism of pathogenesis in autosomal dominant polycystic kidney disease. RNA Biol. 12, 369–374 (2015).
Article PubMed PubMed Central Google Scholar
Gonzalez-Paredes, F. J., Ramos-Trujillo, E. & Claverie-Martín, F. Defective pre-mRNA splicing in PKD1 due to presumed missense and synonymous mutations causing autosomal dominant polycystic disease. Gene 546, 243–249 (2014).
Article CAS PubMed Google Scholar
Wang, K. et al. Evidence for pathogenicity of atypical splice mutations in autosomal dominant polycystic kidney disease. Clin. J. Am. Soc. Nephrol. 4, 442–449 (2009).
Article CAS PubMed PubMed Central Google Scholar
Bournazos, A. M. et al. Standardized practices for RNA diagnostics using clinically accessible specimens reclassifies 75% of putative splicing variants. Genet Med. 24, 130–145 (2022).
Article CAS PubMed Google Scholar
Miller, D. E. et al. Targeted long-read sequencing identifies missing disease-causing variation. Am. J. Hum. Genet 108, 1436–1449 (2021).
Article CAS PubMed PubMed Central Google Scholar
Jayasinghe, K. et al. Renal genetics in Australia: kidney medicine in the genomic age. Nephrology 14, 131S (2018).
Google Scholar
Gooding, C. et al. A class of human exons with predicted distant branch points revealed by analysis of AG dinucleotide exclusion zones. Genome Biol. 7, R1 (2006).
Article PubMed PubMed Central Google Scholar
Bryen, S. J. et al. Pathogenic abnormal splicing due to intronic deletions that induce biophysical space constraint for spliceosome assembly. Am. J. Hum. Genet. 105, 573–587 (2019).
Article CAS PubMed PubMed Central Google Scholar
Peral, B. et al. Identification of mutations in the duplicated region of the polycystic kidney disease 1 gene (PKD1) by a novel approach. Am. J. Hum. Genet. 60, 1399–1410 (1997).
Article CAS PubMed PubMed Central Google Scholar
Neumann, H. P. H. et al. Epidemiology of autosomal-dominant polycystic kidney disease: an in-depth clinical study for south-western Germany. Nephrol. Dialysis Transplant. 28, 1472–1487 (2013).
Article CAS Google Scholar
Cummings, B. B. et al. Improving genetic diagnosis in Mendelian disease with transcriptome sequencing. Sci. Transl. Med. 9, eaal5209 (2017).
Article PubMed PubMed Central Google Scholar
Gonorazky, H. D. et al. Expanding the boundaries of RNA sequencing as a diagnostic tool for rare Mendelian disease. Am. J. Hum. Genet. 104, 466–483 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wai, H. A. et al. Blood RNA analysis can increase clinical diagnostic rate and resolve variants of uncertain significance. Genet. Med. 22, 1005–1014 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sullivan, P. J. et al. Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications. Genome Biol. 24, 118 (2023).
Article PubMed PubMed Central Google Scholar
Lea, W. A. et al. Human-specific abnormal alternative splicing of wild-type PKD1 induces premature termination of polycystin-1. J. Am. Soc. Nephrol. 29, 2482–2492 (2018).
Article CAS PubMed PubMed Central Google Scholar
Mallett, A. J. et al. Massively parallel sequencing and targeted exomes in familial kidney disease can diagnose underlying genetic disorders. Kidney Int. 92, 1493–1506 (2017).
Article CAS PubMed Google Scholar
Groopman, E. E. et al. Diagnostic utility of exome sequencing for kidney disease. N. Engl. J. Med. 380, 142–151 (2019).
Article CAS PubMed Google Scholar
Tanudisastro, H. et al. Australia and New Zealand renal gene panel testing in routine clinical practice of 542 families. npj Genom. Med. 6, 20 (2021).
Article PubMed PubMed Central Google Scholar
Murphy, E. L., Droher, M. L., DiMaio, M. S. & Dahl, N. K. Preimplantation genetic diagnosis counseling in autosomal dominant polycystic kidney disease. Am. J. Kidney Dis. 72, 866–872 (2018).
Article PubMed Google Scholar
Chaperon, J. L. et al. Preimplantation genetic testing for kidney disease-related genes: a laboratory’s experience. Am. J. Nephrol. 52, 684–690 (2021).
Article CAS PubMed Google Scholar
Gimpel, C. et al. International consensus statement on the diagnosis and management of autosomal dominant polycystic kidney disease in children and young people. Nat. Rev. Nephrol. 15, 713–726 (2019).
Article PubMed PubMed Central Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–423 (2015).
Article PubMed PubMed Central Google Scholar
M, M. Y. et al. Variant haplophasing by long-read sequencing: a new approach to preimplantation genetic testing workups. Fertil. Steril. 116, 774–783 (2021).
Article Google Scholar
Hopp, K. et al. Detection and characterization of mosaicism in autosomal dominant polycystic kidney disease. Kidney Int. 97, 370–382 (2020).
Article CAS PubMed Google Scholar
Bagnall, R. D. et al. Whole genome sequencing improves outcomes of genetic testing in patients with hypertrophic cardiomyopathy. J. Am. Coll. Cardiol. 72, 419–429 (2018).
Article PubMed Google Scholar
Gayevskiy, V., Roscioli, T., Dinger, M. E. & Cowley, M. J. Seave: a comprehensive web platform for storing and interrogating human genomic variation. Bioinformatics 35, 122–125 (2019).
Article CAS PubMed Google Scholar
Minoche, A. E. et al. ClinSV: clinical grade structural and copy number variant detection from whole genome sequencing data. Genome Med. 13, 32 (2021).
Article CAS PubMed PubMed Central Google Scholar
Consortium, G. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article Google Scholar
Ellingford, J. M. et al. Recommendations for clinical interpretation of variants found in non-coding regions of the genome. Genome Med. 14, 73 (2022).
Article CAS PubMed PubMed Central Google Scholar
Gamaarachchi, H. et al. Fast nanopore sequencing data analysis with SLOW5. Nat. Biotechnol. 40, 1026–1029 (2022).
Article CAS PubMed PubMed Central Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Edge, P. & Bansal, V. Longshot enables accurate variant calling in diploid genomes from single-molecule long read sequencing. Nat. Commun. 10, 4660 (2019).
Article PubMed PubMed Central Google Scholar
Zhou, X. et al. Exploring genomic alteration in pediatric cancer using ProteinPaint. Nat. Genet. 48, 4–6 (2016).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This study has been funded through support from PKD Australia, PKD Foundation, RACP Jacquot Foundation, Cancer Australia and My Room, the NSW Office of Health and Medical Research funded Luminesce Alliance, The Sylvia and Charles Viertel Charitable Foundation and the Lewis Foundation.

Author information

Authors and Affiliations

Molecular Genetics of Inherited Kidney Disorders Laboratory, Garvan Institute of Medical Research, Sydney, Australia
Yvonne Hort, Laura Wedd, Timothy Furlong, John Shine & Amali Mallawaarachchi
Children’s Cancer Institute, Lowy Cancer Centre, UNSW Sydney, Kensington, NSW, Australia
Patricia Sullivan & Mark J. Cowley
School of Clinical Medicine, UNSW Medicine & Health, UNSW Sydney, Kensington, NSW, Australia
Patricia Sullivan & Mark J. Cowley
Centre for Population Genomics, Garvan Institute of Medical Research and UNSW Sydney, Sydney, NSW, Australia
Laura Wedd & Cas Simons
Genetic Health Queensland, Royal Brisbane and Women’s Hospital, Herston, QLD, Australia
Lindsay Fowles & Chirag Patel
Genomic Technologies, Garvan Institute of Medical Research, Sydney, Australia
Igor Stevanovski & Ira Deveson
Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children’s Research Institute, Sydney, Australia
Igor Stevanovski & Ira Deveson
Centre for Population Genomics, Murdoch Children’s Research Institute, Melbourne, VIC, Australia
Cas Simons
Department of Renal Medicine, Townsville University Hospital, Townsville, QLD, Australia
Andrew Mallett
Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
Andrew Mallett
College of Medicine and Dentistry, James Cook University, Townsville, QLD, Australia
Andrew Mallett
Clinical Genetics Service, Institute of Precision Medicine and Bioinformatics, Royal Prince Alfred Hospital, Sydney, Australia
Amali Mallawaarachchi

Authors

Yvonne Hort
View author publications
You can also search for this author in PubMed Google Scholar
Patricia Sullivan
View author publications
You can also search for this author in PubMed Google Scholar
Laura Wedd
View author publications
You can also search for this author in PubMed Google Scholar
Lindsay Fowles
View author publications
You can also search for this author in PubMed Google Scholar
Igor Stevanovski
View author publications
You can also search for this author in PubMed Google Scholar
Ira Deveson
View author publications
You can also search for this author in PubMed Google Scholar
Cas Simons
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Mallett
View author publications
You can also search for this author in PubMed Google Scholar
Chirag Patel
View author publications
You can also search for this author in PubMed Google Scholar
Timothy Furlong
View author publications
You can also search for this author in PubMed Google Scholar
Mark J. Cowley
View author publications
You can also search for this author in PubMed Google Scholar
John Shine
View author publications
You can also search for this author in PubMed Google Scholar
Amali Mallawaarachchi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.H. processed DNA and RNA samples, performed and interpreted the RNA studies, contributed to paper drafting and approved the final version; P.S. performed in-silico splicing analysis, interpreted RNA studies, contributed to paper drafting and approved the final version; L.W. performed variant interpretation contributed to manuscript drafting and approved the final version; L.F., C.P. and AnMa recruited and consented participants, collected clinical data, interpreted final results, critically revised the draft and approved the final version; I.S. processed DNA samples, performed and interpreted long-read sequencing, critically revised the draft and approved the final version; I.D. performed and interpreted long-read sequencing, critically revised the draft and approved the final version; C.S. performed data processing and variant interpretation, critically revised the draft and approved the final version; T.F. contributed to study design, interpretation of clinical data and final results, critically revised the draft and approved the final version; M.C. contributed to study design, processing and interpretation of genomic data, interpretation of in silico splicing studies and RNA studies, contributed to manuscript drafting and approved the final version; J.S. contributed to study design, interpretation of RNA studies and final results, critically revised the draft and approved the final version; AmMa designed the study, recruited and consented participants, collected clinical data, performed variant processing and interpretation, interpreted final results, drafted the paper and approved the final version.

Corresponding author

Correspondence to Amali Mallawaarachchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Supplementary Table 3: GTex Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hort, Y., Sullivan, P., Wedd, L. et al. Atypical splicing variants in PKD1 explain most undiagnosed typical familial ADPKD. npj Genom. Med. 8, 16 (2023). https://doi.org/10.1038/s41525-023-00362-z

Download citation

Received: 20 December 2022
Accepted: 26 June 2023
Published: 07 July 2023
DOI: https://doi.org/10.1038/s41525-023-00362-z
Springer Nature Limited

This article is cited by

Refining clinically relevant parameters for mis-splicing risk in shortened introns with donor-to-branchpoint space constraint
- Katharine Y. Zhang
- Himanshu Joshi
- Frances J. Evesson
European Journal of Human Genetics (2024)

Atypical splicing variants in PKD1 explain most undiagnosed typical familial ADPKD

Abstract

Similar content being viewed by others

Identified eleven exon variants in PKD1 and PKD2 genes that altered RNA splicing by minigene assay

Deciphering Variability of PKD1 and PKD2 in an Italian Cohort of 643 Patients with Autosomal Dominant Polycystic Kidney Disease (ADPKD)

Nephroplex: a kidney-focused NGS panel highlights the challenges of PKD1 sequencing and identifies a founder BBS4 mutation

Introduction