Abstract
This study sought to determine the potential role of HBB haplotypes to predict beta-thalassemia in the Malaysian population. A total of 543 archived samples were selected for this study. Five tagging SNPs in the beta-globin gene (HBB; NG_000007.3) were analyzed for SNP-based and haplotype association using SHEsis online software. Single-SNP-based association analysis showed three SNPs have a statistically significant association with beta-thalassemia. When Bonferroni correction was applied, four SNPs were found statistically significant with beta-thalassemia; IVS2-74T>G (padj = 0.047), IVS2-16G>C (padj = 0.017), IVS2-666C>T (padj = 0.017) and 3’UTR + 314G>A (padj = 0.002). However, 3'UTR + 233G>C did not yield a significant association with padj value = 0.076. Further investigation using combined five SNPs for haplotype association analysis revealed three susceptible haplotypes with significant p values of which, haplotypes 1-2-2-1-1 (p = 6.49 × 10−7, OR = 10.371 [3.345–32.148]), 1-2-1-1-1 (p = 0.009, OR = 1.423 [1.095–1.850] and 1-1-1-1-1 (p = 1.39 × 10−4, OR = 10.221 [2.345–44.555]). Three haplotypes showed protective effect with significant p value of which, 2-2-1-1-1 (p = 0.006, OR = 0.668 [0.500–0.893]), 1-1-2-2-1 (p = 0.013, OR = 0.357 [0.153–0.830]) and 1-1-2-1-1 (p = 0.033, OR = 0.745 [0.567–0.977]). This study has identified the potential use of intragenic polymorphic markers in the HBB gene, which were significantly associated with beta-thalassemia. Combining these five SNPs defined a new haplotype model for beta-thalassemia and further evaluation for predicting severity in beta-thalassemia.
Similar content being viewed by others
Introduction
Haemoglobin disorders are the most common monogenic disease worldwide1,2. These inherited autosomal recessive disorders are classified according to the haemoglobin expression or synthesis. There are three main categories of haemoglobin disorders; haemoglobinopathy is the structurally abnormal haemoglobin, thalassemia is the quantitatively reduced haemoglobin3,4 and, hereditary persistence of foetal haemoglobin (HPFH) and δβ-thalassemia are characterized by increased levels of foetal haemoglobin (HbF) in adulthood. The prevalence of haemoglobin disorders was high in tropical and subtropical regions such as sub-Saharan Africa, Mediterranean, Middle East, Indian subcontinent, and Southeast Asia3. However, due to modernization, people from epidemic areas migrated to the non-epidemic area. Hence, haemoglobin disorders have become a significant health problem in 71% of 229 countries globally5.
The recent report by the Malaysian Thalassemia Registry (MTR) has recorded a total of 8681 thalassemia cases from 2007 until November 2018. Since the launch of the National Thalassemia Preventive and Control Program in 2004, healthcare facilities have been upgraded to provide better quality for patient management. Hence, the survival rate of patients with thalassemia in Malaysia has improved6. With several molecular studies have been done previously in the East and West Malaysia, genetic heterogeneity is more observed in multiracial population in Malaysia with a diverse spectrum of alpha (α-), beta (β-) and delta (δ-) globin genes mutations among the patients with thalassemia syndromes7,8,9,10,11. Beta-thalassemia is due to decreased beta-globin chain synthesis of which, caused by a mutation in the HBB gene. The HBB gene mapped on chromosome 11p15.4 with a region spanning from 5,225,464 to 5,229,395 bp on the reverse strand12. Therefore, the identification of nucleic acid variations in the HBB gene has improved our understanding of underlying causal mutations of beta-thalassemia in Malaysia.
The clinical presentation and molecular circumstantial of beta-thalassemia are highly heterogeneous and dependent on geographical and ethnic factors13. The evidence may be derived from genetic predisposition, which is unique to particular ethnic groups, and that could enable targeted molecular analysis being designed14,15. Genetic heterogeneity is more observed among different ethnicity in Malaysia with a diverse spectrum of HBB gene mutations. According to Elizabeth &Ann (2010), approximately 73% of Malay patients with beta-thalassemia were due to mutation at codon 26(A>G) or HbE (βE), IVS1-5(G>C) (severe β+) and IVS1-1(G>T) (β°). Whilst for Malaysian Chinese, 90% of beta-thalassemia cases were due to codon 41/42 (-TTCT) (β°), IVS2-654(C>T), -28(A>G), codon 17(A>T) and codon 71/72(+ A). In East Malaysia, especially in Sabah, 90% of the Kadazan-Dusun with beta-thalassemia were due to β-Filipino deletion (β°)14,15.
However, the genetic variant interaction in conferring the effect based on haplotype inference has yet to be explored and refined in beta-thalassemia among the Malaysian population. Deciphering the predisposing effect by the potential haplotype markers can promote the exposition of underlying mechanisms of thalassemia development. In this study, five single nucleotide polymorphisms (SNPs) within the HBB gene were evaluated to determine its significance and haplotype structure inference with beta-thalassemia in Malaysia, which was the first study conducted in Malaysia to the best of our knowledge.
Results
Single association analysis
In single-based association analysis, three tagging SNPs at IVS2-16G>C, IVS2-666C>T and 3’UTR + 314G>A showed a statistically significant association with beta-thalassemia with p value of 0.036, OR = 1.300 [1.017–1.66]; 0.032, OR = 0.765 [0.598–0.978] and 0.004, OR = 2.013 [1.238–3.272], respectively. However, tagging SNP at IVS2-74T>G and 3'UTR + 233G>C did not show statistically significant association with beta-thalassemia of which, the p value of 0.099 and 0.211. The minor allele of these two variants showed a trend towards protective effect based on odds ratios less than 1 (OR = 0.794 [0.604–1.044] and OR = 0.663 [0.347–1.267] respectively).
Table 1 depicts the genotypic association analysis of the five assigned SNPs. The most common genotype for IVS2-74T>G, IVS2-16G>C, IVS2-666C>T, 3’UTR + 233G>C and 3’UTR + 314G>A was TT (59.8% in case and 52.6% in control group), GC (49% in case and 49.1% in control group), CT (43% in case and 48.5% in control group), GG (94.4% in case and 91.5% in control group) and GG (82.3% in case and 90.4% in control group) respectively. Three tagging SNPs (IVS2-74T>G, 3'UTR + 233G>C, and 3'UTR + 314G>A) showed a high homozygosity rate in case and control groups. Meanwhile, a high heterozygosity rate was found in IVS2-16G>C and IVS2-666C>T in both groups.
All these SNPs were also tested for pairwise linkage disequilibrium (LD) across the HBB gene by the square of the correlation coefficient (r2). No deviation in Hardy–Weinberg equilibrium (HWE) for all five SNPs in the control and case group (0 < p < 1). After the Bonferroni correction, four SNPs were found significantly associated with beta-thalassemia; IVS2-74T>G (padj = 0.047), IVS2-16G>C (padj = 0.017), IVS2-666C>T (padj = 0.017) and 3’UTR + 314G>A (padj = 0.002).
Haplotype analysis
Captivated by this favorable data in single association analysis, further investigation was conducted using combined allele from IVS2-74T>G, IVS2-16G>C, IVS2-666C>T, 3'UTR + 233G>C and 3'UTRt + 314G>A of HBB gene in an attempt to evaluate the predisposing effect of HBB intragenic haplotype with beta-thalassemia. The naming system for the haplotype in this study is not related to the system used in the previous PCR–RFLP based haplotyping studies. Haplotype analysis revealed significant association for three haplotypes; 1-2-2-1-1, 1-2-1-1-1 and 1-1-1-1-1 with susceptibility effect towards beta-thalassemia of which, the p values were 6.49 × 10−7 (OR = 10.371 [3.345–32.148]), 0.009 (OR = 1.423 [1.095–1.850] and 1.39 × 10−4 (OR = 10.221 [2.345–44.555]) respectively. On the other hand, three haplotypes; 2-2-1-1-1, 1-1-2-2-1 and 1-1-2-1-1 significantly conferred an opposing manner of effect to beta-thalassemia with the p value 0.006 (OR = 0.668 [0.500–0.893]), 0.013 (OR = 0.357 [0.153–0.830]) and 0.033 (OR = 0.745 [0.567–0.977]) respectively. However, one haplotype with allele combinations 1-1-2-1-2 did not show any significant association with beta-thalassemia, where the p value was 0.899, yet the trend was towards susceptibility as depicted by OR = 1.041 [0.559–1.939]). The summary of these findings was tabulated in Table 2. Haplotype with the frequency < 0.03 in both controls and cases were automatically excluded from the analysis by the SHEsis online software.
Discussion
In this study, we explored a single-based and haplotype association of five intragenic HBB polymorphisms in beta-thalassemia cases from Malaysia. It was suggested that the association of intragenic SNPs might be useful for the diagnosis and delineation of the clinical heterogeneity of beta-thalassemia16. Furthermore, the intragenic SNPs could be useful marker for linkage analysis and in prenatal diagnosis it can improve the diagnostic errors of which, caused by recombination17.
From the analysis of single-based association, two intronic polymorphisms; IVS2-16G>C, and IVS2-666C>T, and one variant at 3' untranslated region to HBB gene assigned as 3'UTR + 314G>A were found significantly associated with beta-thalassemia. The substitution of C to T allele at position 666 of intron 2 with minor allele frequency (MAF) of 0.359 in case group and 0.423 in control group conferring protection in beta-thalassemia with the odds ratio of 0.765 (p = 0.032). However, we noted that the MAF for IVS2-666C>T from this study was higher when compared with global MAF in the ClinVar (0.286) database but lower compared to 1000 Genome Project (0.713)18. The genotype distribution of this intronic polymorphism revealed the heterozygote had yielded the highest frequency (48.5%) in the control group. Association study done by Akhavan-Niaki et al. (2011) reported that IVS2-666C>T was found to be linked to a mutation at codon 8(-AA) [HBB:c.25_26delAA], of which this β°-mutation was mainly described among the population from the Middle East and the Mediterranean. Hence, the authors suggested that IVS2-666C>T would be useful as a marker for codon 8 genotyping in prenatal diagnosis17.
Meanwhile, two other variants showed a significant susceptibility effect towards beta-thalassemia: IVS2-16G>C and 3'UTR + 314G>A. The MAF for IVS2-16G>C was 0.357 in the case group and 0.420 in the control group conferring susceptibility in beta-thalassemia with the odds ratio of 1.300 (p = 0.036). In comparison to global MAF from the ClinVar database (0.280), MAF findings for IVS2-16G>C in this study were noted higher but lower when compared to the 1000 Genomes Project (0.720)19. The untranslated region (UTR) is the sequence in the 3' region of a gene but not translated during protein synthesis and contains regulatory element for the gene expression20. A variant in the 3'UTR of the HBB gene, which is assigned as 3'UTR + 314 G>A was found to have a significant susceptibility effect towards beta-thalassemia with the odds ratio 2.013 (p = 0.004). The MAFs were found to be 0.092 in the case group and 0.048 in the control group. However, we noted that there was very limited report of this variant in the literature for further comparison. Overall, we noticed that the MAF for the three significant variants in this study were within the range of global MAF from other studies reported in the ClinVar database18,19. The different MAF value could be varied across diverse ethnic or population as well as study sample size21.
In an attempt to further evaluate the role of HBB haplotypes in beta-thalassemia in Malaysia, haplotype analysis revealed several susceptible and protective haplotypes22. The potential applications of haplotype-tagged SNPs have been widely described in the literature. Fields of application include, for example, disease association and pharmacogenetic studies23. Protective haplotype had been observed in association with breast density in fine mapping analysis24. In this study, we identified seven different haplotypes using the five intragenic HBB SNPs. A comparable finding was reported by Bilgen et al. (2011) for the haplotype analysis in the Turkish population. Likewise, the authors have also reported that SNP based haplotyping using five intragenic SNPs has successfully established the beta globin gene mutation related haplotypes16. In the earlier studies done by Fuchareon et al. (2001) and Sanguansermri et al. (2004) also have reported association of certain haplotype pattern with HbE and common beta-thalassemia mutation respectively by using PCR–RFLP method. To the best of our knowledge, no study was done so far to evaluate the important role of intragenic HBB SNPs in thalassemia syndrome in Southeast Asian region. In this study, we identified six significant haplotypes of which, have important role for beta-thalassemia. Noteworthy, individuals with haplotype that consists of all major alleles from our assigned HBB polymorphisms (1-1-1-1-1) might have a higher risk in developing beta-thalassemia. However, if the minor allele from IVS2-666C>T is substituted, the effect becomes a protective effect. This allele transition might reveal the protective role from the minor allele of IVS2-666C>T. The same effect is reflected in IVS2-16G>C. However, the protective effect from the minor allele of IVS2-16G>C was not strong enough to confer susceptibility for this haplotype.
Interesting to note that the combination of both minor alleles from IVS2-666C>T and 3'UTR + 233G>C with other dominant alleles projected higher protection, which elucidates the same protective role from 3'UTR + 233G>C. These synergist effects provide a better outcome for individuals with this haplotype 1-1-2-2-1. The same synergist effect was also observed for haplotype 2-2-1-1-1, which revealed the protective role from IVS2-74T>G and IVS2-16G>C. Likewise, the allele substitution for 3'UTR + 314G>A in haplotype 1-1-2-1-2 dropped the protective effect from haplotype 1-1-2-1-1. The susceptible effect might explain this from a minor allele of 3'UTR + 314G>A. This haplotype-based association analysis was carried out to provide a prediction of the predisposing effect and reveal the severity and possible prognosis using haplotype-tagged SNPs of HBB gene for beta-thalassemia. Thus, this model could be further developed for the improvement of clinical management of beta-thalassemia in Malaysia mainly based on the personalized haplotype profile.
In conclusion, the presented study the first study on intragenic polymorphic markers of the beta-globin gene involving the Malaysian population. Identification of susceptible and protective haplotype markers that conferred the significant association with beta-thalassemia in Malaysia can be further refined following the multi-ethnic background of the Malaysian population. The association data on a single genotype and haplotype might disclose the effect of HBB polymorphisms in beta-thalassemia that might provide an impact in the understanding of beta-thalassemia propensity. This study can be ascertained by larger sample size, and stratification by ethnicity should be deliberated since Malaysia is inhibited by various ethnicity.
Materials and methods
Study population
This cross-sectional study was conducted among the referral case for DNA analysis of thalassemia syndromes in the Institute for Medical Research (IMR), Kuala Lumpur. The study protocol was approved by the Medical Research Ethics Committee [MREC; NMRR-18-3977-43849 (IIR)] and UniSZA Human Research Ethical Committee [UniSZA/UHREC/2020/170]. Informed consent was obtained from each case prior blood collection was done. Protocol of this study was in accordance with the Declaration of Helsinki. A total of 543 (294 controls & 249 cases) archived cases from the year 2011 until 2014 were reviewed for this study. Only cases with valid Malaysian identity card numbers were included in this study. Cases with no sequencing results and no valid Malaysian identity card numbers were excluded from this study. These cases were molecularly ascertained via Sanger sequencing using 3730XL DNA Analyser (Applied Biosystem, Foster City, CA, USA) for the presence of HBB gene variation. Samples with heterozygous or compound heterozygous or homozygous state of HBB gene mutations were grouped as cases. Whilst controls were the samples without the known beta-globin gene mutation.
SNP genotyping
Genomic DNA was extracted from peripheral blood using a commercial DNA extraction kit (QIAGEN, Germany). The detection of the genotype for IVS2-74T>G (HBB:c.315 + 74T>G), IVS2-16G>C (HBB:c.315 + 16G>C), IVS2-666C>T (HBB:c.316-185C>T), 3'UTR + 233G>C (HBB:c.*233G>C) and 3'UTR + 314G>A (HBB:c.*314G>A) polymorphic site in the HBB gene was performed using a direct DNA sequencing technique in which the cycle sequencing used the BigDye® Terminator v3.1 cycle sequencing kit. Sequence analysis was performed on CLC Main Workbench 6 version 6.6.1 software (CLC Bio, Denmark).
Bioinformatics analysis
The SHEsis Online software (http://analysis.bio-x.cn/myAnalysis.php) was employed to assess the Hardy–Weinberg equilibrium, allele frequency, SNPs and haplotype association in which allelic and genotypic distribution were compared between case and control groups25. Bonferroni correction was used for multiple comparison correction. The odds ratios (ORs) value with a 95% confidence interval (95% CI) in which a p value of 0.05 was considered as significant.
References
Old J. Hemoglobinopathies and Thalassemias. Academic Press; :1–44. https://www.sciencedirect.com/science/article/pii/B9780123838346000756?via%3Dihub. Accessed September 5, 2019.
Williams, T. N. & Weatherall, D. J. World distribution, population genetics, and health burden of the hemoglobinopathies. Cold Spring Harb. Perspect. Med. 2(9), a011692. https://doi.org/10.1101/cshperspect.a011692 (2012).
De, S. V. et al. β-thalassemia distribution in the old world: an ancient disease seen from a historical standpoint. Mediterr. J. Hematol. Infect. Dis. https://doi.org/10.4084/mjhid.2017.018 (2017).
Greene, D. N., Vaugn, C. P., Crews, B. O. & Agarwal, A. M. Advances in detection of hemoglobinopathies. Clin Chim Acta. https://doi.org/10.1016/j.cca.2014.10.006 (2014).
Modell, B., Darlison, M. Global Epidemiology of Haemoglobin Disorders and Derived Service Indicators. Vol 86. World Health Organization; 2008:480–487. https://doi.org/10.2471/BLT.06.036673
Mohd, I. H.Malaysian Thalassemia Registry Report 2018. 1st ed. Medical Development Division, Ministry of Health M. Malaysian Thalassaemia Registry. Vol 19.; 2019.
Tan, J. A. M. A. et al. Molecular defects in the beta-globin gene identified in different ethnic groups/populations during prenatal diagnosis for beta-thalassemia: a Malaysian experience. Clin. Exp. Med. 4(3), 142–147. https://doi.org/10.1007/s10238-004-0048-x (2004).
Rosline, H., Ahmed, S. A., Al-Joudi, F. S., Rapiaah, M., Naing, N. N. & Adam, N. A. M. Thalassemia among blood donors at the Hospital Universiti Sains Malaysia. Southeast Asian J. Trop. Med. Public Health. 2006;37(3):549–552. http://www.ncbi.nlm.nih.gov/pubmed/17120978. Accessed August 22, 2019.
Tan, J. A. M. A. A. et al. Transfusion-dependent Thalassemia in northern Sarawak: a molecular study to identify different genotypes in the multi-ethnic groups and the importance of genomic sequencing in unstudied populations. Public Health Genom. https://doi.org/10.1159/000368342 (2018).
Teh, L. K. et al. Molecular basis of transfusion dependent beta-thalassemia major patients in Sabah. J. Hum. Genet. 59(3), 119–123. https://doi.org/10.1038/jhg.2013.131 (2014).
Mohd Yatim, N. F. et al. Molecular characterization of α- and β-thalassaemia among Malay patients. Int. J. Mol. Sci. https://doi.org/10.3390/ijms15058835 (2014).
HBB Gene - GeneCards | HBB Protein | HBB Antibody. https://www.genecards.org/cgi-bin/carddisp.pl?gene=HBB. Accessed June 8, 2020.
Taghavifar, F., Hamid, M. & Shariati, G. Gene expression in blood from an individual with β-thalassemia: an RNA sequence analysis. Mol. Genet. Genomic. Med. 7(7), e740. https://doi.org/10.1002/mgg3.740 (2019).
Tan, J.-A.M.A.A.M.A. et al. High prevalence of alpha- and beta-thalassemia in the kadazandusuns in east Malaysia: Challenges in providing effective health care for an indigenous group. J. Biomed. Biotechnol. https://doi.org/10.1155/2010/706872 (2010).
Elizabeth, G., Ann, M. T. J. A. Genotype-phenotype diversity of beta-thalassemia in Malaysia: Treatment options and emerging therapies. Med. J. Malaysia (2010).
Bilgen, T., Arikan, Y., Canatan, D., Yeşilipek, A. & Keser, I. The association between intragenic SNP haplotypes and mutations of the beta globin gene in a Turkish population. Blood Cells Mol. Dis. https://doi.org/10.1016/j.bcmd.2011.01.004 (2011).
Akhavan-Niaki, H., Seresti, S. S., Asghari, B. & Banihashemi, A. IVSII-666 of human beta-globin gene: a polymorphic marker linked to codon 8(-AA) mutation. Genet. Test Mol. Biomark. https://doi.org/10.1089/gtmb.2010.0242 (2011).
National Center for Biotechnology Information. VCV000036316.3—ClinVar—NCBI. https://www.ncbi.nlm.nih.gov/clinvar/variation/36316/#id_first. Accessed June 14, 2020.
National Center for Biotechnology Information. VCV000256345.3—ClinVar—NCBI. https://www.ncbi.nlm.nih.gov/clinvar/variation/256345/. Accessed June 11, 2020.
Barrett, L. W., Fletcher, S., Wilton, S. D. Untranslated gene regions and other non-coding elements (2013). https://doi.org/10.1007/978-3-0348-0679-4_1
Shamsuddin, A. A., Ahmad, A. & Wan Taib, W. R. Haplotype analysis of leptin gene polymorphisms in obesity among Malays in Terengga Haplotype analysis of leptin gene polymorphisms in obesity among Malays in Terengganu. . Med. J. Malaysia. 73, 281 (2018).
Wan Taib, W. R. Study of Association of TGF-ß1 polymorphism with breast density in a tertiary medical center of Malaysia. Med. Health. https://doi.org/10.17576/mh.2017.1202.15 (2017).
Ouyang, C. & Krontiris, T. G. Identification and functional significance of SNPs underlying conserved haplotype frameworks across ethnic populations. Pharmacogenet. Genomics. https://doi.org/10.1097/01.fpc.0000220569.82842.9b (2006).
Taib, W. R. W. et al. Suggestive evidence of protective haplotype within TGF-B1 gene region in breast density utilizing fine mapping analysis. Meta Gene. https://doi.org/10.1016/j.mgene.2019.100631 (2020).
Yong, Y., He, L., Yy, S. & He, L. SHEsis, a powerful software platform for analyses of linkage disequilibrium, haplotype construction, and genetic association at polymorphism loci. Cell Res. 15, 97. https://doi.org/10.1038/sj.cr.7310101 (2005).
Acknowledgements
The authors would like to thank IMR and MTR for their cooperation and participation in this study. We also would like to thank the Director General of Health, Malaysia, for his permission to publish this article.
Author information
Authors and Affiliations
Contributions
N.A. and W.W.T. wrote the main manuscript text. N.A., N.K., and W.W.T. run the investigation, revised the manuscript for important intellectual content. N.A, N.K., W.W.T., H.A.N.A., W.N.W.A.J., E.E. and H.I. contributed to all aspects of the investigation, including methodological design, data collection and analysis, interpretation of the results. All authors reviewed and revised the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Aziz, NA., Taib, WR.W., Kharolazaman, NK. et al. Evidence of new intragenic HBB haplotypes model for the prediction of beta-thalassemia in the Malaysian population. Sci Rep 11, 16772 (2021). https://doi.org/10.1038/s41598-021-96018-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-96018-y
- Springer Nature Limited