Abstract
Over 100 X-linked intellectual disability genes have been identified, with triplet repeat expansions at the FMR1 (FRAXA) and AFF2 (FRAXE) genes being the causative agent in two of them. The absence of FRAXE pathognomonic features hampers early recognition, delaying testing and molecular confirmation. Hence, our laboratory uses a multiplex PCR-based strategy to genotype both FRAXA and FRAXE. However, AFF2 expansions are missed giving rise to an uninformative result in around 20% of female samples. To rule out undetected expansions and confirm homozygosity Southern blot analysis is performed being labour- and resource-intensive. The aim of this study is to develop a timely and economic triplet-primed amplification (TP-PCR) screening strategy to size the AFF2 GCC repeat and accurately assess homozygosity as well as pinpoint multiplex-PCR false negatives in female samples. In order to achieve this, validation was performed in a cohort of 500 females with a previous uninformative FRAXE PCR result. Interestingly, the presence of a T > C SNP (rs868949662), contiguous to the GCC repetitive tract, allows triplet primer binding in two additional repeats, increasing the discrimination power of the TP-PCR assay in heterozygous and homozygous samples. Twelve alleles outside the normal range were recognized: eight intermediate and four premutated, which seems relevant considering the rarity of the AFF2 expansions. All genotypes are concordant with that obtained by Southern blotting, confirming this as a strict, reproducible and low-cost homozygosity screening strategy that enables the identification of small expanded alleles missed by the routine multiplex-PCR due to allele dropout. Overall, this assay is capable of spotting multiplex-PCR false negatives besides identifying alleles up to > 80 GCC repeats. Furthermore, the occurrence of intermediate repeat sizes with unexpected frequency, introduces new areas of clinical research in this cohort in understanding these less explored AFF2 repeat sizes and newly associated phenotypes.
Similar content being viewed by others
Introduction
The most prevalent form of non-syndromic X-linked intellectual disability (NS-XLID), the Fragile XE Mental Retardation Syndrome (MIM #309548), with an estimated frequency of 1/50 000 newborn males, is caused by a GCC expansion above 200 triplet repeats (full mutation) at the AFF2 gene 5′ UTR1,2,3,4,5,6. Normal individuals show alleles with 6–30 GCC repeats that are stable upon transmission. Alleles with 31–60 GCC repeats are considered intermediate or grey-zone and may vary slightly upon transmission. Alleles with repeats ranging from 61 to 200 GCC repeats are considered unstable and named premutations7,8. When the repeats expand to over 200 GCC repeats, the so-called full mutation, gene transcription is silenced due to methylation, causing intellectual disability9,10,11. AFF2 gene methylation is variable with reports showing that alleles with 130 GCCs can be methylated12. The GCC repeat can either expand or contract and is equally unstable when transmitted through both males and females. The absence of FRAXE pathognomonic features hampers early recognition, limiting testing and molecular confirmation5,13,14,15. In some patients the presence of borderline cognitive functioning hinders intellectual disability classification16,17. Several commercial PCR-based kits have been developed for FRAXA analysis, yet, to the best of our knowledge none exist for FRAXE testing, probably due to the scarcity of research reports as well as clinical cases18,19,20,21. Therefore, in some countries genetic testing for AFF2 is considered investigational. The absence of FRAXE pathognomonic features requires careful clinical assessment, and mandates genetic testing for early recognition and appropriate monitoring22. Hence, our laboratory uses a low-cost multiplex PCR-based strategy to determine simultaneous allele sizing of FRAXA and FRAXE loci in both males and females that fulfil clinical criteria for Fragile X Syndrome, idiopathic intellectual disability (FXS, MIM #300624)23. In the presence of an expanded allele, the PCR result is usually inconclusive as the amplification of expanded alleles is inefficient in high repeat sizes. Furthermore, around 20% of female samples are uninformative due to possible homozygosity (both alleles share the same number of repeats) for one or both loci. These observations prompted us to develop a rapid triplet-primed amplification (TP-PCR) screening strategy to size the AFF2 GCC repeat and accurately assess homozygosity in female samples. In the case of FMR1 a commercially available TP-PCR kit is used. Overall, this study allowed validation of a homozygosity screening strategy, spotting false negatives from the multiplex-PCR allowing the identification of FRAXE-related expansions with putative diagnostic utility.
Results
TP-PCR development and optimization
For the optimization of the TP-PCR assay conditions, we used four male and female samples, previously characterized by sequencing and Southern blot analyses (Supplementary Fig. S1). Taking advantage of the (GCC)2GCT(GCC)2C sequence located 52 nucleotides downstream of the unstable GCC tract, specific primer sequence and ligation conditions allowed the simultaneous amplification of the triplet repeat profile and the determination of the allele total length in one assay. The use of a (GCC)5 primer allows ligation to occur at (GCC)2GCT(GCC)2C, with SNP rs1333167094 “T” being unnoticed (Fig. 1). The TP-PCR electropherogram reflects the total repeat length as well as each GCC triplet, represented as a repetitive stutter pattern where the leftmost peak corresponds to repeat number 5 and each beyond that being represented as an individual peak until the end of the repeat tract is reached. The downstream peak corresponds to the distance between the forward primer and the sole GCC priming site 52 bp downstream of the end of the repeat. For instance, a peak of around 197 bp corresponds to 18 GCCs matching the thirteen (plus five triplets of the “CGG primer”) consecutive fragments differing by 3 bp (Fig. 1). Our TP-PCR assay is also able to discriminate alleles differing by a single repeat. In samples carrying larger alleles, a continuous series of uninterrupted triplet fragments is displayed, overlapping the total repeat size peak. Despite being visible triplets with an RFU greater than the height of the noise signal, only those with 30 RFUs or above are considered, allowing to accurately size expanded alleles up to 80 GCCs (Fig. 1C,ii). In cases where alleles are visible with a stuttering pattern past the established threshold of 80 repeats, further testing is mandatory following the classical/routine workflow (e.g. Southern blotting).
TP-PCR validation
Following optimization, a total of 500 female samples, previously classified as uninformative by multiplex-PCR (due to observed homoallelism), were analyzed. In our cohort, FRAXE alleles ranged between 11 to 107 GCC triplets, in which alleles with 15 repeats are the most frequent. As expected, a homozygous normal result was confirmed by TP-PCR in the vast majority of the samples tested (n = 483; 96.6%), showing 13 different genotypes (Table 1). The most common genotype was 15/15 which is expected because the 15 GCC allele is the most common in the European population. Allele distribution is similar to that obtained by Clark et al., with the 15 (n = 692), 18 (n = 84) and 16 (n = 76) GCC alleles being the most common, followed by those with 20 GCC (n = 29) suggesting a bimodal distribution (Table 1)8. The unique design of the “CGG primer” in the reverse sense, with a 5′ tail and (CGG)5C at its 3′ terminus, enables the annealing anywhere within the GCC tract as well as rs1333167094 “T” SNP overriding leading to primer ligation in a second distal site. This occurs because the non-complementary “T” is located exactly in the middle of the (CGG)5 primer’s sequence. In case of rs868949662 “C”, the primer is unable to bind due to the absence of ligation in 3′ end nucleotide of the primer. We speculate that the presence of a nucleotide interspersed at the GCC stretch would likely produce a partial peak dropout as a consequence of the absence of annealing as that interspersed nucleotide is located towards the primer 3′ end. As no partial or total allele dropout was observed in our cohort, we anticipate that interspersions in the AFF2 repetitive region are scarcer than the AGG interruption(s) in the FMR1 gene repetitive region.
Unexpected alleles
The majority of normal-sized alleles were identically sized by both multiplex-PCR and TP-PCR. The discrepant result observed in one sample, multiplex-PCR (15/15) and TP-PCR (15/27) might be explained by the preferential amplification of the smallest allele when using three primer pairs (multiplexing FMR1, AFF2 and ARX genes), as there is a 36 bp (12 GCCs) difference between the size of each allele. This bias towards the smallest allele seems to be occurring more frequently with AFF2 than FMR1 gene alleles (in our experience, observed even with singleplex PCR). In 6 samples a second expanded allele was observed and in 11 a second unexpected normal-sized allele could be depicted by the triplet repeat profile but not by the total repeat length fragment (Fig. 2). These results prompted sequencing of the repetitive region in those samples, which revealed that the presence of the SNP minor allele "C" at the end of the GCC region [NM_002025.3:c.-413T > C (rs868949662)], resulted in the extension of the repeat tract by three repeats. The nucleotide following the last GCC triplet is a “C”, hampering its recognition due to non-complementarity in the very last nucleotide (3′) of the primer (because of the last nucleotide, the primer needs to bind the "G" of the next triplet for amplification to occur). As a consequence, the "CGG primer” only binds to two extra repeats, without interfering in the total fragment length, and thus explaining the discrepant results. This SNP is described with a frequency of 0.021 (gnomAD), and contributes to the discriminatory power of the TP-PCR assay in heterozygous and homozygous samples carrying this variant. In samples where total length fragment shows a difference in two GCCs, when compared to TP-PCR profile, sequencing should be performed to confirm exact GCC number.
Southern blot confirmation
Southern blotting and hybridization were performed to validate TP-PCR results, determine the exact size of expanded alleles, exclude size mosaicism and confirm methylation status. Southern blot analysis using an AFF2-specific probe was performed in six samples: FRAXE full mutation male sample showing a methylated fragment smear of over 6 kb (> 400 GCCs), three female samples with premutations (with 80, 84 and 107 GCCs) and two samples with intermediate alleles (with 47 and 49 GCCs) (Supplementary Fig. S2).
Discussion
A new and accurate screening method for the analysis of the AFF2 gene repetitive region was developed, allowing the quantification of alleles up to 100 GCCs. Among the 500 female samples, seventeen showed alleles with distinct GCC sizes of which three premutations and two intermediate-sized alleles were confirmed by Southern blotting, the remaining were confirmed to have homozygous AFF2 alleles. The previous multiplex-PCR also failed to detect a second normal-sized allele in eleven samples (allele categorizing followed Murray et al. 1996)7. Overall, this TP-PCR assay allows a clear distinction between true homoallelism and the presence of an expanded allele and thus proves to be an accurate, reproducible, and low-cost homozygosity screening strategy. There is little information on FRAXE probably due to its rarity and/or mild phenotypic presentation that can be overlooked in the intellectually disabled population. In our cohort a 1:250 premutation carrier ratio was observed that otherwise would be overlooked as homozygous. This frequency seems quite high when compared to that of FRAXA and justifies per se the importance of such a study. AFF2 intermediate and premutated alleles have been associated with disease due to the production of abnormal mRNA levels, similarly to other neurodegenerative disorders caused by triplet repeat expansions24. Association between AFF2 and Parkinson’s disease clinical features such as gait disorder, visual hallucination and urinary incontinence have been observed, however studies with larger cohorts are needed to further understand the AFF2 role in the disease progression and manifestation25,26. It also highlights the importance of the molecular characterization notably in highly heterogeneous and broad conditions such as non-syndromic intellectual disability, for proper medical care and follow-up as well as accurate genetic counselling27. There is a relatively high degree of variation of the FRAXE repeat size in particular, and the extensive data available from this study opens areas of research into understanding phenotype associated with relatively unexplored repeat sizes. This could be particularly interesting for the lower repeat sizes occurring with high frequency at FRAXE in this population. As the data can be linked to the newly associated phenotypes, it will provide a resource for researchers worldwide.
Methods
Studies fully respect to the Declaration of Helsinki (1964 and amendments), the Oviedo Convention and its protocols (1997) and the Principles of good clinical practice in clinical research. Participants, legal guardians and parents/patient’s legal representatives signed informed consent for the use of DNA samples in intellectual disability research and biobanking. Genomic DNA samples had been extracted by Salting out method from peripheral blood collected in EDTA, dissolved in Tris–EDTA buffer (pH 7) and anonymously stored at 4 °C28. Samples were previously tested by routine FRAXA, FRAXE and ARX multiplex-PCR23. This study has been approved by DEFI—Departamento de Ensino, Formação e Investigação do Centro Hospitalar Universitário do Porto, (CHUPorto), as well as the Hospital’s Ethical Committee—N/REF.a 2018.179 (154-DEFI/153-CES) as part of Cecília Silva (CS) Master’s degree.
Triplet-repeat primed PCR (TP-PCR) assay
The AFF2 TP-PCR mix was prepared for a final reaction volume of 25 μL containing 1 × PCR Master Mix (Promega, Madison, WI, USA), 0.6 M Betaine (Sigma-Aldrich, St. Louis, MO, USA), 10% DMSO (Sigma-Aldrich), 0.5 × Q-Solution (Qiagen GmbH, Hilden, Germany), 0.2 mM 7-Deaza-dGTP (Roche, Basel, Switzerland), 0.2 pmol/μL forward and tail primers, 0.1 pmol/μL "GCC primer" and approximately 450 ng of gDNA. Primer sequences are as follows: (i) the “forward primer” NED labelled 5′-TGTGAGTGTGTAAGTGTGTGATGCTGCC-3′; (ii) the “CGG primer” with a nonspecific tail sequence composed by 21 nucleotides 5′-TACGCATCCCAGTTTGAGACG(CGG)5C-3′; (iii) and the “tail primer” 5′-TACGCATCCCAGTTTGAGACG-3′. Amplification cycle includes an initial denaturation step of 5 min at 98 °C, 15 cycles of denaturation at 98 °C for 60 s, annealing at 55 °C for 2 min and extension at 68 °C for 4 min, and 30 cycles at 98 °C for 60 s, 58 °C for 2 min and 68 °C for 4 min and with a final extension of 10 min at 68 °C. Five μL of TP-PCR product was added to 15 μL of a mixture (15:1) of formamide and GeneScan 500 ROX dye Size Standard (Applied Biosystems, Foster City, CA, USA), resolved on ABI PRISM 3130xl Genetic Analyser (Applied Biosystems) using POP-7 polymer and a 36 cm long 16-capillary array. Analyses were performed using GeneMapper Software version 4.0 (Applied Biosystems) with the NED fluorochrome peak detection threshold adjusted to 30 relative fluorescence units. A previously sequenced sample with 15 GCCs (around 188 bp in the established conditions) was used as control to determine the exact size in base pairs (bp).
TP-PCR results validation
Sanger sequencing
Symmetric amplification was performed using 5′-TGTGAGTGTGTAAGTGTGTGAT-3′ and 5′-GCCCGCGCACCCAGCGAC-3′ primers, followed by asymmetric PCR using BigDye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) according to manufacturer’s instructions and standard conditions. Products were resolved on ABI PRISM 3130xl Genetic Analyser (Applied Biosystems) and the results analysed using SeqScape Software version 2.5 (Applied Biosystems).
Southern blotting and hybridization
Around 10 μg of gDNA was digested overnight at 37 °C using endonucleases NotI (methylation sensitive) and AflIII (New England Biolabs, Ipswich, MA, USA), followed by 1% agarose gel overnight electrophoresis, blotting for 3 h and overnight hybridization at 60 °C using AFF2-AJ31Dig1 probe (GeneLink, Hawthorne, NY, USA) 1:4 diluted in Dig Easy Hyb (Roche). Dig Wash and Block Buffer Set (Roche) and CDP-Star (Roche) were used for signal revelation. Signal detection was accomplished using a Fujifilm Luminescent Image Analyzer LAS-3000 v2.2 (Fujifilm, Tokyo, Japan). DIG-labeled, DNA Molecular Weight Markers II and III (Roche), were mixed (1:1) and used as a size standard.
References
Gecz, J., Gedeon, A. K., Sutherland, G. R. & Mulley, J. C. Identification of the gene fmr2, associated with fraxe mental retardation. Nat. Genet. 13, 105–108. https://doi.org/10.1038/ng0596-105 (1996).
Zhong, N. et al. A survey of fraxe allele sizes in three populations. Am. J. Med. Genet. 64, 415–419. https://doi.org/10.1002/(SICI)1096-8628(19960809)64:2%3c415::AID-AJMG36%3e3.0.CO;2-G (1996)
Gecz, J., Bielby, S., Sutherland, G. R. & Mulley, J. C. Gene structure and subcellular localization of fmr2, a member of a new family of putative transcription activators. Genomics 44, 201–213. https://doi.org/10.1006/geno.1997.4867 (1997).
Crawford, D. C. et al. Prevalence and phenotype consequence of fraxa and fraxe alleles in a large, ethnically diverse, special education-needs population. Am. J. Hum. Genet. 64, 495–507. https://doi.org/10.1086/302260 (1999).
Mulatinho, M. V., Llerena, J. C. & Pimentel, M. M. Fraxe mutation in mentally retarded patients using the oxe18 probe. Int. J. Mol. Med. 5, 67–69. https://doi.org/10.3892/ijmm.5.1.67 (2000).
Tejada, M. I. & Ibarluzea, N. Non-syndromic × linked intellectual disability: Current knowledge in light of the recent advances in molecular and functional studies. Clin. Genet. 97, 677–687. https://doi.org/10.1111/cge.13698 (2020).
Murray, A. et al. Population screening at the fraxa and fraxe loci: Molecular analyses of boys with learning difficulties and their mothers. Hum. Mol. Genet. 5, 727–735. https://doi.org/10.1093/hmg/5.6.727 (1996).
Clark, R. et al. The fraxa and fraxe allele repeat size of boys from the avon longitudinal study of parents and children (alspac). Wellcome Open Res. 4, 116. https://doi.org/10.12688/wellcomeopenres.15342.2 (2019).
Flynn, G. A. et al. Identification of the fraxe fragile site in two families ascertained for x linked mental retardation. J. Med. Genet. 30, 97–100. https://doi.org/10.1136/jmg.30.2.97 (1993).
Knight, S. J. et al. Trinucleotide repeat amplification and hypermethylation of a cpg island in fraxe mental retardation. Cell 74, 127–134. https://doi.org/10.1016/0092-8674(93)90300-f (1993).
Holinski-Feder, E. et al. Fraxe testing. Am. J. Hum. Genet. 59, 1168–1169 (1996).
Nelson D.L., Gu Y. The Neglected Fragile X Mutations FRAXE and FRAXF. In: Fry M., Usdin K. (eds) Human Nucleotide Expansion Disorders. Nucleic Acids and Molecular Biology, vol 19. https://doi.org/10.1007/3-540-33336-3_3 (Springer, Berlin, Heidelberg, 2006)
Knight, S. J. et al. A study of fraxe in mentally retarded individuals referred for fragile x syndrome (fraxa) testing in the united kingdom. Am. J. Hum. Genet. 58, 906–913 (1996).
Gecz, J. The fmr2 gene, fraxe and non-specific x-linked mental retardation: Clinical and molecular aspects. Ann. Hum. Genet. 64, 95–106. https://doi.org/10.1017/S0003480000007983 (2000).
Melko, M. et al. Functional characterization of the aff (af4/fmr2) family of rna-binding proteins: Insights into the molecular pathology of fraxe intellectual disability. Hum. Mol. Genet. 20, 1873–1885. https://doi.org/10.1093/hmg/ddr069 (2011).
Hamel, B. C. et al. Segregation of fraxe in a large family: Clinical, psychometric, cytogenetic, and molecular data. Am. J. Hum. Genet. 55, 923–931 (1994).
Carbonell, P. et al. Fraxe mutation analysis in three Spanish families. Am. J. Med. Genet. 64, 434–440. https://doi.org/10.1002/(SICI)1096-8628(19960809)64:2%3c434::AID-AJMG40%3e3.0.CO;2-D (1996)
Accardo, P. J. & Shapiro, L. R. Fraxa and fraxe: To test or not to test?. J. Pediatr. 132, 762–764. https://doi.org/10.1016/s0022-3476(98)70299-9 (1998).
Strelnikov, V., Nemtsova, M., Chesnokova, G., Kuleshov, N. & Zaletayev, D. A simple multiplex fraxa, fraxe, and fraxf pcr assay convenient for wide screening programs. Hum. Mutat. 13, 166–169. https://doi.org/10.1002/(SICI)1098-1004(1999)13:2%3c166:AID-HUMU10%3e3.0.CO;2-X (1999).
Patsalis, P. C. et al. Molecular screening of fragile x (fraxa) and fraxe mental retardation syndromes in the hellenic population of greece and cyprus: Incidence, genetic variation, and stability. Am. J. Med. Genet. 84, 184–190. (1999).
Santos, C. B., Costa Lima, M. A. & Pimentel, M. M. A new pcr assay useful for screening of fraxe/fmr2 mental impairment among males. Hum. Mutat. 18, 157–162. https://doi.org/10.1002/humu.1165 (2001).
Youings, S. A. et al. Fraxa and fraxe: The results of a 5 year survey. J. Med. Genet. 37, 415–421. https://doi.org/10.1136/jmg.37.6.415 (2000).
Jorge, P., Oliveira, B., Marques, I. & Santos, R. Development and validation of a multiplex-pcr assay for x-linked intellectual disability. BMC Med. Genet. 14, 80. https://doi.org/10.1186/1471-2350-14-80 (2013).
Darling, A. L. & Uversky, V. N. Intrinsic disorder in proteins with pathogenic repeat expansions. Molecules 22(12)–2027. https://doi.org/10.3390/molecules22122027 (2017).
Annesi, G. et al. Fraxe intermediate alleles are associated with Parkinson’s disease. Neurosci. Lett. 368, 21–24. https://doi.org/10.1016/j.neulet.2004.06.049 (2004).
Costa, A. et al. Intermediate alleles at the fraxa and fraxe loci in Parkinson’s disease. Parkinson. Relat. Disord. 17, 281–284. https://doi.org/10.1016/j.parkreldis.2010.12.013 (2011).
Verma, A. K., Khan, E., Bhagwat, S. R. & Kumar, A. Exploring the potential of small molecule-based therapeutic approaches for targeting trinucleotide repeat disorders. Mol. Neurobiol. 57, 566–584. https://doi.org/10.1007/s12035-019-01724-4 (2020).
Miller, S. A., Dykes, D. D. & Polesky, H. F. A simple salting out procedure for extracting dna from human nucleated cells. Nucleic Acids Res. 16, 1215. https://doi.org/10.1093/nar/16.3.1215 (1988).
Acknowledgements
Unit for Multidisciplinary Research in Biomedicine (UMIB) is supported by national funds, through the Foundation for Science and Technology (FCT) Portugal (Grant numbers UIDB/00215/2020, and UIDP/00215/2020) and ITR—Laboratory for Integrative and Translational Research in Population Health (LA/P/0064/2020).
Author information
Authors and Affiliations
Contributions
P.J. and I.M. conceived and planned the study, C.S. and N.M. conducted the experiments, B.R., N.M. and F.S. contributed to sample preparation, data analysis and interpretation, I.M. and P.J. supervised the project, N.M. and C.S. designed the figures, tables and drafted the manuscript with input from all authors, P.J. and R.S. aided in writing—review and editing. All authors provided critical feedback and helped shape the research, analysis and manuscript review.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Silva, C., Maia, N., Santos, F. et al. Development and validation in 500 female samples of a TP-PCR assay to identify AFF2 GCC expansions. Sci Rep 11, 14676 (2021). https://doi.org/10.1038/s41598-021-93473-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-93473-5
- Springer Nature Limited