Abstract
Colorectal cancer (CRC) is a common complex disease caused by the combination of genetic variants and environmental factors. Genome-wide association studies (GWAS) have been performed and reported some novel CRC susceptibility variants. The rs16892766 (8q23.3) polymorphism was first identified to be significantly associated with CRC in European ancestry. The following studies investigated this association in Chinese, Japanese, Romanian, Swedish, African American, European American and Croatian populations. These studies reported consistent and inconsistent results. Here, we reevaluated this association using the relatively large-scale samples from 13 studies (N = 59737, 26237 cases and 33500 controls) using a meta-analysis by searching the PubMed, Google Scholar and CRCgene databases. We observed no significant heterogeneity among the included studies. Our results showed significant association between rs16892766 polymorphism and CRC (P = 1.33E-35, OR = 1.23, 95% CI 1.20-1.27). Collectively, our analysis further supports previous findings that the rs16892766 polymorphism is significantly associated with CRC susceptibility. We believe that our findings will be very useful for future genetic studies on CRC.
Similar content being viewed by others
Introduction
Colorectal cancer (CRC), also called colon cancer or large bowel cancer, is the third most common form of cancer and the second leading cause of cancer-related death in the Western world and its lifetime risk in the United States is about 7%1. CRC is a common complex disease caused by the combination of genetic variants and environmental factors1. Genome-wide association studies (GWAS) are considered to be a new and power approach to detect the genetic variants of human complex diseases. Recently, GWAS have been performed and reported some novel CRC susceptibility variants2,3,4,5,6.
The rs16892766 (8q23.3) polymorphism was first identified to be significantly associated with CRC in European ancestry (P = 3.30E-18, the odds ratio (OR) = 1.25, 95% confidence interval (CI) 1.19-1.32, Minor allele = C)6. Based on the different genetic architecture, it is important to investigate whether rs16892766 polymorphism is associated with CRC risk in other ethnic populations. The following studies investigated this association in Chinese, Japanese, Romanian, Swedish, African American, European American and Croatian populations6,7,8,9,10,11,12,13,14. The results showed that rs16892766 was not polymorphic in Chinese and Japanese populations12,15,16. The other studies reported consistent and inconsistent results for the association between rs16892766 and CRC. Some studies reported significant association between rs16892766 and CRC (P < 0.05)6,8,9,13,14 and the other studies reported no association between rs16892766 and CRC (P > = 0.05)7,10,11,12.
Recent studies investigated the influence of rs16892766 in Lynch syndrome. Wijnen et al. genotyped the rs16892766 polymorphism in 675 individuals from 127 different families from the Dutch Lynch syndrome Registry whose mutation carrier status was known17. They found a significant association between CRC risk and rs16892766 (8q23.3). The possession of the C-allele was associated with an elevated risk of CRC in a dose-dependent fashion, with homozygosity for CC being associated with a 2.16-fold increased risk17. Talseth-Palmer et al. investigated whether the rs16892766 (8q23.3) acts as modifier of disease risk in patients with Lynch syndrome using 684 mutation-positive patients with Lynch syndrome from 298 Australian and Polish families18. They identified an association between rs16892766 on chromosome 8q23.3 and the risk of developing CRC and age of diagnosis was found in MLH1 mutation carriers18.
It is reported that meta-analysis method involves combining and analyzing quantitative evidence from related studies to produce results based on a whole body of research19. It is a quantitative, formal, epidemiological study design used to systematically assess previous research studies to derive conclusions about that body of research20. The motivation of a meta-analysis is to aggregate information in order to achieve a higher statistical power. Considering the important role of rs16892766 polymorphism in CRC risk and inconsistent results reported by previous studies, we reevaluated this association using the relatively large-scale samples from 13 studies (N = 59737, 26237 cases and 33500 controls) using meta-analysis method by searching the PubMed, Google Scholar and CRCgene databases21.
Methods
Literature search
We searched the PubMed database to select all possible studies with key words including ‘rs16892766’ and ‘colorectal cancer’ or ‘8q23.3’ and ‘colorectal cancer’. The literature search was updated on June 5, 2014. Meanwhile, we used the Google Scholar (http://scholar.google.com/) to query the articles citing the studies and all references in these studies identified by the PubMed. We selected only published articles written in English. Theodoratou et al. report the first comprehensive field synopsis and creation of a parallel publicly available and regularly updated database (CRCgene) that catalogs all genetic association studies on colorectal cancer (http://www.chs.med.ed.ac.uk/CRCgene/)21. They carried out meta-analyses to derive summary effect estimates for 92 polymorphisms in 64 different genes.
Inclusion criteria
We selected the studies meeting the following criteria: (1) the study was conducted by a case-control design; (2) the study evaluated the association between rs16892766 polymorphism and CRC; (3) the study provided the numbers of rs16892766 genotypes or (4) the study must provided sufficient data to calculate the numbers of rs16892766 genotypes or (5) the study provided an OR with 95% CI as well as the P value; or (6) the study must provided sufficient data to calculate the OR and 95% CI;
Data extraction
We extracted the following information from each study: (1) the name of the first author; (2) the year of publication; (3) the population and ethnicity; (4) the numbers of AD cases and controls; (5) the genotype numbers of rs16892766 polymorphism in cases and controls; (6) the numbers of rs16892766 genotypes or (7) to calculate the numbers of rs16892766 genotypes; (8) the OR with 95% CI or (9) to calculate the OR and 95% CI; All relevant calculations were completed using the program R (http://www.r-project.org/).
Genetic model
The rs16892766 polymorphism has two alleles including C and A. C is the minor allele. We assume that C is the high-risk allele and A is the lower-risk allele. We selected the additive genetic model for further meta-analysis. The additive model can be described as C allele versus A allele22.
Heterogeneity test
We evaluated the genetic heterogeneity among the studies included using Cochran's Q test, which approximately follows a X2 distribution with k-1 degrees of freedom (k stands for the number of studies for analysis). , which ranges from 0 to 100%, was also used23. I2 is a measure of heterogeneity and a statistic that indicates the percentage of variance in a meta-analysis that is attributable to study heterogeneity24. Low, moderate, large and extreme heterogeneity corresponded to 0–25%, 25–50%, 50–75% and 75–100%, respectively23. The significant levels for heterogeneity are defied to be with P < 0.01 and I2 > 50%.
Meta-analysis
If there is no significant heterogeneity among the included studies, the pooled OR is calculated by the fixed effect model (Mantel-Haenszel), otherwise the OR is calculated by random-effect model (DerSimonian-Laird). Z test is used to determine the significance of OR. All statistical tests for heterogeneity and meta-analysis were computed using R Package (http://cran.r-project.org/web/packages/meta/index.html).
Sensitivity and publication bias analyses
We evaluated the relative influence of each study by omitting each study at a time. Meanwhile, we used funnel plots to evaluate the potential publication bias25. Begg and Egger's tests were used to evaluate the asymmetry of the funnel plot25.
Results
Literature search
We selected 41 articles from PubMed and Google Scholar databases and two articles from the CRCgene database. Finally, 9 articles including 13 independent studies were included for our following analysis. More detailed information about the inclusion or exclusion of selected studies was described in Figure 1. The main characteristics of the included studies are described in Table 1, which included the name of the first author, the year of publication, the population or ethnicity, the numbers of AD cases and controls and the OR with 95% CI.
Heterogeneity test
We evaluated the genetic heterogeneity of rs16892766 polymorphism among the selected studies using additive model and as well as P value. We did not identify significant heterogeneity among these 13 studies using additive model (P = 0.8239 and I2 = 0%).
Meta-analysis
As described above, we identified no significant heterogeneity among these 13 studies. We then performed a meta-analysis. We calculated the overall OR by the fixed effect model. Our results showed significant association between rs16892766 polymorphism and CRC using additive model (P = 1.33E-35, OR = 1.23, 95% CI 1.20-1.27). In Figure 2, for each study, we list the name of the first author, the year of publication, the population or ethnicity, the OR with 95% CI and the weight in meta-analysis. Detailed results are described in Figure 2.
Sensitivity analysis and publication bias analysis
By excluding any one study, we identified that the association between rs16892766 polymorphism and CRC did not vary substantially. The funnel plots are symmetrical inverted funnels for models (Figure 3), which suggest no significant publication bias for the additive model (Begg's test, P = 0.2206 and Egger's test, P = 0.2206).
Discussion
Recent GWAS identified rs16892766 (8q23.3) polymorphism to be significantly associated with CRC in European ancestry6. The following studies investigated this association and reported consistent and inconsistent results. It is important to assess the genetic architecture of rs999737 polymorphism across different populations. Here, we reevaluated this association using the relatively large-scale samples from 13 studies by searching the PubMed, Google Scholar and CRCgene databases. We first evaluated the genetic heterogeneity of rs16892766 polymorphism among the selected studies. We did not identify significant heterogeneity among these 13 studies using additive model (P = 0.8239 and I2 = 0%). We then conducted a meta-analysis using fixed effect model. Our results showed significant association between rs16892766 polymorphism and CRC using additive model (P = 1.33E-35, OR = 1.23, 95% CI 1.20-1.27). Collectively, our analysis further supports previous findings that the rs16892766 polymorphism is significantly associated with CRC susceptibility. We believe that our findings will be very useful for future genetic studies on CRC.
Before our submission, we accessed the PubMed and Google Scholar databases using the key words ‘rs16892766’ and ‘meta’. We identified two articles8,21. Hutter et al. examined potential effect-modification between 10 loci and probable or established environmental risk factors for CRC in 7,016 CRC cases and 9,723 controls from nine cohort and case-control studies8. They used meta-analysis of an efficient empirical-Bayes estimator to detect potential multiplicative interactions between each of the SNPs and select major CRC risk factors8. The strongest statistical evidence for a gene-environment interaction across studies was for vegetable consumption and rs16892766, located on chromosome 8q23.3, near the EIF3H and UTP23 genes8. Theodoratou et al. carried out meta-analyses to derive summary effect estimates for 92 polymorphisms in 64 different genes and constructed the CRCgene database (http://www.chs.med.ed.ac.uk/CRCgene/)21.
Our study is different from previous studies8,21. Hutter et al. investigated the gene-environment interaction between each of the SNPs and select major CRC risk factors8. We accessed CRCgene databases and found two articles including three studies investigating rs16892766 polymorphism. Here, we conducted an updated analysis to reevaluate the association between rs16892766 polymorphism and CRC using the relatively large-scale samples by searching the PubMed and Google Scholar databases. We observed no significant heterogeneity among the included studies. Our results from this meta-analysis are consistent with the findings from CRC GWAS. Our results showed association between rs16892766 polymorphism and CRC (P = 1.33E-35, OR = 1.23, 95% CI 1.20-1.27), which is more significant than previous GWAS (P = 3.30E-18, OR = 1.25, 95% CI 1.19-1.32)6.
Pittman et al. generated a fine scale map of a 300 Kb region encompassing the rs16892766 association signal using 1,964 CRC cases and 2,081 controls26. A 22 kb genomic region of linkage disequlibrium (LD; Chr8:117,690,773–117,712,909) capturing rs16892766 provided the best evidence for the 8q23 CRC association signal26. Four most significantly associated SNPs-rs16892766, Novel 28, rs16888589 and rs11986063 are strongly correlated with one another (pairwise r2 > 0.75) and constitute a single risk haplotype26. Reporter gene studies demonstrated that the rs16888589, which was in high LD with rs16892766, acts as an allele-specific transcriptional repressor26. Chromosome conformation capture analysis showed that the genomic region harboring rs16888589 interacts with the promoter of gene for eukaryotic translation initiation factor 3, subunit H (EIF3H)26. EIF3H is located at 8q23 and identified to be a CRC susceptibility gene by previous GWAS6,27. Increased expression of EIF3H gene increases CRC growth and invasiveness thereby providing a biological mechanism for the 8q23.3 association26.
Despite these interesting results, our study has a limitation. Here, we investigated the association between rs16892766 and CRC using additive model. It is reported that most meta-analyses used an additive genetic model28. In general, this model performs well when the true underlying genetic model is uncertain28. It was also important to analyze the association between rs16892766 and CRC using dominant model (CC+CA versus AA) and recessive model (CC versus CA+AA)22. Exact genotype numbers of all studies used in our analysis are required for the dominant and recessive models. We attempted to obtain these genotype numbers but were not successful. Considering that the original genotype data are not publicly available for us, future replication studies using genotype data are required to replicate our findings.
References
Lindblom, A. et al. Colorectal cancer as a complex disease: defining at-risk subjects in the general population - a preventive strategy. Expert Rev Anticancer Ther 4, 377–385 (2004).
Broderick, P. et al. A genome-wide association study shows that common alleles of SMAD7 influence colorectal cancer risk. Nat. Genet. 39, 1315–1317 (2007).
Tenesa, A. et al. Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21. Nat. Genet. 40, 631–637 (2008).
Peters, U. et al. Meta-analysis of new genome-wide association studies of colorectal cancer risk. Hum. Genet. 131, 217–234 (2012).
Peters, U. et al. Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-Wide Meta-analysis. Gastroenterology 144, 799–807 e724 (2013).
Tomlinson, I. P. et al. A genome-wide association study identifies colorectal cancer susceptibility loci on chromosomes 10p14 and 8q23.3. Nat Genet 40, 623–630 (2008).
Middeldorp, A. et al. Enrichment of low penetrance susceptibility loci in a Dutch familial colorectal cancer cohort. Cancer Epidemiol Biomarkers Prev 18, 3062–3067 (2009).
Hutter, C. M. et al. Characterization of gene-environment interactions for colorectal cancer susceptibility loci. Cancer Res 72, 2036–2044 (2012).
Wang, H. et al. Fine-mapping of genome-wide association study-identified risk loci for colorectal cancer in African Americans. Hum Mol Genet 22, 5048–5055 (2013).
Mates, I. N. et al. Association of common genetic variants with colorectal cancer risk in a Romanian sample. Chirurgia (Bucur) 105, 749–757 (2010).
Kirac, I. et al. SMAD7 variant rs4939827 is associated with colorectal cancer risk in Croatian population. PLoS One 8, e74042 (2013).
He, J. et al. Generalizability and epidemiologic characterization of eleven colorectal cancer GWAS hits in multiple populations. Cancer Epidemiol Biomarkers Prev 20, 70–81 (2011).
von Holst, S. et al. Association studies on 11 published colorectal cancer risk loci. Br J Cancer 103, 575–580 (2010).
Kupfer, S. S. et al. Genetic heterogeneity in colorectal cancer associations between African and European americans. Gastroenterology 139, 1677–1685, 1685 e1671–1678 (2010).
Li, F. X. et al. Single-nucleotide polymorphism associations for colorectal cancer in southern chinese population. Chin J Cancer Res 24, 29–35 (2012).
Thean, L. F. et al. Association of Caucasian-identified variants with colorectal cancer risk in Singapore Chinese. PLoS One 7, e42407 (2012).
Wijnen, J. T. et al. Chromosome 8q23.3 and 11q23.1 variants modify colorectal cancer risk in Lynch syndrome. Gastroenterology 136, 131–137 (2009).
Talseth-Palmer, B. A. et al. Colorectal cancer susceptibility loci on chromosome 8q23.3 and 11q23.1 as modifiers for disease expression in Lynch syndrome. J Med Genet 48, 279–284 (2011).
Riley, R. D., Lambert, P. C. & Abo-Zaid, G. Meta-analysis of individual participant data: rationale, conduct and reporting. BMJ 340, c221 (2010).
Haidich, A. B. Meta-analysis in medical research. Hippokratia 14, 29–37 (2010).
Theodoratou, E. et al. Systematic meta-analyses and field synopsis of genetic association studies in colorectal cancer. J Natl Cancer Inst 104, 1433–1457 (2012).
Lewis, C. M. & Knight, J. Introduction to genetic association studies. Cold Spring Harb Protoc 2012, 297–306 (2012).
Liu, G. et al. PICALM Gene rs3851179 Polymorphism Contributes to Alzheimer's Disease in an Asian Population. Neuromolecular Med 15, 384–388 (2013).
Higgins, J. P., Thompson, S. G., Deeks, J. J. & Altman, D. G. Measuring inconsistency in meta-analyses. BMJ 327, 557–560 (2003).
Jiang, Y. et al. Meta-analysis of 125 rheumatoid arthritis-related single nucleotide polymorphisms studied in the past two decades. PLoS One 7, e51571 (2012).
Pittman, A. M. et al. Allelic variation at the 8q23.3 colorectal cancer risk locus functions as a cis-acting regulator of EIF3H. PLoS Genet 6, e1001126 (2010).
Peters, U. et al. Meta-analysis of new genome-wide association studies of colorectal cancer risk. Hum Genet 131, 217–234 (2012).
Gogele, M. et al. Methods for meta-analyses of genome-wide association studies: critical assessment of empirical evidence. Am J Epidemiol 175, 739–749 (2012).
Acknowledgements
This work was supported by funding from the National Nature Science Foundation of China (grant numbers 81300945, 31200934, 81172265 and 31301938) and the National High-Tech Research and Development Project of China (grant number 2012AA022101).
Author information
Authors and Affiliations
Contributions
G.Y.L., Y.S.J. and M.Z.L. conceived and initiated the project, searched the PubMed database and extracted the information from each study. G.Y.L., B.K.Q., X.S.Q., Z.H.Y. and G.Y.W. analyzed the data. R.N.F. and L.C.Z. prepared the figures 1–3. G.Y.L., Y.S.J., M.Z.L. and Y.Q.Z. wrote the manuscript. All authors reviewed the manuscript and contributed to the final manuscript.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Liao, M., Wang, G., Quan, B. et al. Analyzing large-scale samples confirms the association between rs16892766 polymorphism and colorectal cancer susceptibility. Sci Rep 5, 7957 (2015). https://doi.org/10.1038/srep07957
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep07957
- Springer Nature Limited
This article is cited by
-
rs1990622 variant associates with Alzheimer’s disease and regulates TMEM106B expression in human brain tissues
BMC Medicine (2021)
-
Rs4878104 contributes to Alzheimer’s disease risk and regulates DAPK1 gene expression
Neurological Sciences (2017)
-
Colorectal cancer risk genes are functionally enriched in regulatory pathways
Scientific Reports (2016)
-
Genetic Variations in Phospholipase C-epsilon 1 (PLCE1) and Susceptibility to Colorectal Cancer Risk
Biochemical Genetics (2016)
-
Analyzing large-scale samples confirms the association between the rs1051730 polymorphism and lung cancer susceptibility
Scientific Reports (2015)