Introduction

Coronary artery disease (CAD) is the leading cause of death worldwide, which is caused by multiple interacting endogenous and exogenous factors. In recent years, genetic basis of CAD has gained considerable interest [1]; heritable factors accounted for 40%-60% in occurrence and development of CAD [2]. Genome- wide association studies (GWAS) some large-scale association analysis have identified many common, uncommon and functional variants for CAD [3, 4].

Collagens are a group of proteins characterized by a unique sequence whose every third amino acid is a glycine (Gly-Xaa-Yaa motif). There are at least 25 different types of collagens. The most abundant four found in mammals are Type I, II, III and IV collagens [5]. Among them, Type IV collagen is the main component comprising 50% of all basement membrane (BM) and is expressed at all tissues including the vasculature, renal glomerular and ocular structures [6, 7]. The vascular BM is an important component of the vasculature. BM surrounds vascular smooth muscle cells in the media making up the barrel of every blood vessel and capillary [8]. As is known, integrity and stability of blood vessels are critical to vascular system, and maintenance of vessel system can be successfully achieved by cooperation of many constituents of blood vessels such as endothelial cells, pericytes, and BM. However failure of this system results in serious consequences such as hemorrhage, edema, inflammation, and tissue ischemia. Type IV collagen, a major component of vascular BM, could provide physical barrier to both soluble molecules and migrating cells and could function as scaffold that allows interaction between pericytes and endothelial cells, thereby contributing to vessel stabilization and enables BM to with-stand mechanical stress [9].

COL4A1 gene is mapped on the telomeric region of 13q (13q34) and consists of 52 exons encodes a1 chain of type IV collagen [10]. In recent years, understanding of polymorphisms of the COL4A1 gene reached a certain degree. For example, Yamada Y et al. [11] identified a novel polymorphism in the COLA1 which is significantly associated with prevalence of myocardial infarction. A Genome-Wide Association Scan (GWAS ) study conducted by Kirill V et al. reported that COL4A1 gene is associated with arterial stiffness [12]. However, the relationship between polymorphisms of COL4A1 gene and CAD remains unclear.

In the present study, we aimed to clarify the association of COL4A1 gene polymorphisms with CAD in a Uygur population of China.

Methods

Ethical approval of the study protocol

This study was approved by the Ethics Committee of the First Affiliated Hospital of Xinjiang Medical University (Xinjiang, China). It was conducted according to the standards of the Declaration of Helsinki. Each participant gave written informed consent and explicitly provided permission for DNA analyses as well as collection of relevant clinical data.

Study population

Study population is comprised of 1095 Uygur people (726 men, 368 women) who lived in Xinjiang Uygur Autonomous Region of China. Among them, 471 subjects attended as inpatients at the First Affiliated Hospital of Xinjiang Medical University between January 2006 to 2011 and underwent coronary angiography, diagnosed with CAD (based on the presence of at least one significant coronary artery stenosis of > 50% luminal diameter). Data and information about traditional coronary risk factors (including hypertension, diabetes mellitus, and smoking) and other biochemical indices were collected from all participants. The diagnosis of hypertension was established if patients were on anti-hypertensive medication or if the mean of 3 measurements of systolic blood pressure (SBP) ≥ 140 mmHg and/or diastolic blood pressure (DBP) ≥ 90 mmHg. Diabetes mellitus was defined on the basis of the World Health Organization (WHO) criteria. Smoking was classified as smokers (including current or ex-smokers) or non smokers. Patients with congenital heart disease, multiple organ failure syndrome and drug users were excluded from this study.

The control subjects were comprised of 624 subjects selected from the Cardiovascular Risk Survey (CRS) [13, 14]. This Survey consists of 14,618 subjects and is a multiple-ethnic, community-based, cross-sectional study designed to investigate the prevalence, incidence, and risk factors for cardiovascular diseases in Han, Uygur, and Kazakh population in Xinjiang (west China) between June 2007 and March 2010. Individuals who had underwent heart bypass surgery, coronary stenting, had history of myocardial infarction, CAD, electrocardiographic signs of CAD, and relevant valvular abnormalities in echocardiograms were excluded from this group.

Biochemical analyses

5 mL of fasting venous blood was collected into tubes containing ethylene diamine tetraacetic acid (EDTA). The collected samples were centrifugated into plasma, serum and blood cells (including leukocytes). Genomic DNA was extracted from the peripheral leukocytes using standard phenol-chloroform method and stored at -80°C for future analysis. We used standard methods for chemical analysis(Dimension AR/AVL Clinical Chemistry System, Newark, NJ, USA) employed by the Clinical Laboratory Department of the First Affiliated Hospital of Xinjiang Medical University [15, 16]. Serum concentrations of total cholesterol (TC), triglycerides (TG), glucose, high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), blood urea nitrogen (BUN), creatinine (Cr) and uric acid were measured.

Genotyping of COL4A1 gene

Using Haploview 4.2 software and International HapMap Project website phase I &II data base (http://www.hapmap.org), we obtained two tag SNPs: SNP1 (rs605143) and SNP2 (rs565470) by using minor allele frequency (MAF) ≥ 0.05 and linkage disequilibrium patterns with r2 ≥ 0.8 as a cutoff. Genotyping in this present case–control study was confirmed by polymerase chain reaction (PCR)-restriction fragment length polymorphism (RFLP) analysis. Sequence information for use as a reference template was obtained from the Ensembl Genome Browser (Human, numberENSG00000187498). Sequencing primers were designed using Primer Premier 5.0 software, synthesis of the Premier was undertaken by Shanghai Genery Biological Technology Company Limited (Shanghai, China). PCR amplification was performed using 25 uL of 2*powder Taq PCR master mix (Beijing Biotech, Beijing, China), 50 ng of genomic DNA, 21 uL of distilled water, 1uL of each forward and reverse primer in a 50 μL final reaction volume. The thermal cycling conditions were as follows: an initial denaturation step at 95°C for 5 min; 30 cycles of 95°C for 30s, 60°C for 35 s and 72°C for 1 min was followed by a final extension step of 72°C for 10 min. Thermal cycling was performed using the GeneAmp 9700 system (Applied Biosystems) and. PCR products were digested by restriction enzyme (Fermentas, Beijing, China) in a 20 μL final reaction volume, along with 10 μL of PCR product, 5U of restriction enzyme, 9 uL of distilled water and 1uL Solution Buffer, incubated overnight at 37°C. The primer pair sequences, annealing temperatures, resulting fragments and restriction enzymes for the two SNPs are detailed in Table 1. Resulting fragments were separated on 3.0% agarose gel (Figure 1). Finally to ensure the results to be verified, we used sequenced genomic DNAs as positive controls in our assays.

Table 1 Primer sequences of each SNP
Figure 1
figure 1

Restriction fragment length polymorphism analysis for determination of genotype. A. For SNP1, the GG genotype shows two bands of 362 bp and 220 bp (1 and 2); The AG genotype shows three bands of 362 bp, 258 bp and 220 bp (3, 4, 6 and 7), The AA genotype shows three bands of 258 bp, 220 bp and 104bp (5). B. For SNP2, the TT genotype shows one band of 416 bp (1 and 4); The CT genotype shows two bands of 416 bp and 324 bp (2 and 3); The CC genotype shows one band of 324 bp (5).

Statistical analyses

All continuous variables were expressed as mean ± standard deviation (S.D), and the differences between CAD and control group were analyzed by using an independent-sample t-test. Differences in frequencies of smoking, drinking, hypertension, diabetes mellitus, and COL4A1 genotypes were analyzed using χ2 test or Fisher’s exact test while appropriate. Hardy–Weinberg equilibrium was assessed by χ2 analysis. Logistic regression analyses with effect ratios (odds ratio [OR] and 95% CI) were used to assess contribution of major risk factors. P value < 0.05 was considered statistically significant. All statistical analyses were performed by using SPSS17.0 software for Windows (SPSS Institute, Chicago, USA).

Result

Table 2 shows demographic and clinical characteristics of 1095 study subjects. For total, including men and women, BMI, systolic blood pressure, diastolic blood pressure, plasma concentration of glucose, triglyceride, uric acid and prevalence of conventional risk factors for CAD including hypertension and diabetes mellitus were significantly higher in subjects with CAD than in the controls, whereas plasma concentration of HDL cholesterol was higher in controls than in subjects with CAD. The plasma concentration of LDL cholesterol, total cholesterol were significantly higher in controls than in subjects with CAD. For woman, the BUN concentration was significantly higher in subjects with CAD than in controls. No significant differences were found in the following variables between CAD subjects and controls: age, drinking and plasma concentration of creatinine (Cr).

Table 2 Demographic and clinical characteristics of study participants

Table 3 shows distribution of genotypes and alleles of SNPs for COL4A1 gene. The genotype distributions for each of the SNPs were in good agreement with the predicted Hardy – Weinberg equilibrium values (data not shown). For total, distribution of rs605143 genotypes, dominant model (GG vs AA + AG), recessive model (AA vs GG + AG) and allele frequency showed significant difference between CAD and control subjects (p = 0.02, p = 0.014, p = 0.048 and p = 0.005). For men, distribution of rs605143 genotypes, dominant model (GG vs AA + AG) and allele frequency showed difference between CAD and control subjects (p = 0.03, p = 0.013 and p < 0.001). For total and men, A allele of rs605143 was significantly higher in controls than in subjects with CAD (total: 46.2%vs 40.1%; men: 47.2%vs 40.4%), the dominant model (GG vs AA + AG) of rs605143 was significantly higher in subjects with CAD than in controls (total: 37.2% vs 30.0%; man: 37.0%vs 28.2%). For total, the recessive model (AA vs GG + AG) of rs605143 was significantly lower in CAD subjects than in controls (17.4%vs 22.4%). There was no significant difference between CAD and control subjects in women for distribution of rs605143 genotypes, dominant model (GG vs AA + AG), recessive model (AA vs GG + AG) and allelic distribution (p > 0.05).

Table 3 Genotype and Allele distributions in patients with CAD and control participants

For total and men, distribution of rs565470 genotypes, dominant model (TT vs CC + CT), recessive model (CC vs TT + CT) and allele frequency showed difference between CAD and control subjects (all P < 0.001). C allele of rs605143 was significantly higher in CAD patients than in control groups (total: 40.7% vs 28.9%; men: 44.8% vs 29.4%), the dominant model (TT vs CC + CT) of rs565470 was significantly higher in controls than in CAD patients (total: 49.4% vs 35.90%; men: 48.0% vs31.8%). The recessive model (CC vs TT + CT) of rs565470 was significantly lower in CAD subjects than in controls (total: 82.8% vs 92.8%; men: 78.6% vs 93.3). There was no significant difference between CAD and control subjects in women for distribution of rs565470 genotypes, dominant model (TT vs CC + CT), recessive model (CC vs TT + CT) and allelic distribution (p > 0.05).

Table 4 shows multivariable logistic regression analysis combining genotypes with following variables: plasma concentration of TG, TC, HDL, LDL, incidence of hypertension, diabetes and smoking which were the major confounding factors for CAD. For total and men (Table 4), after multivariate adjustment, rs605143 remain significantly associated with CAD in dominant model (for total: OR = 1.369, 95% confidence interval [CI]: 1.021-1.834, p = 0.036; for men: OR = 1.583,95% confidence interval [CI]: 1.099-2.281, p = 0.014). For total, after multivariate adjustment, rs605143 remains significantly associated with CAD (OR = 0.661, 95% confidence interval [CI]: 0.465-0.942, p = 0.022) in recessive model (data not shown).

Table 4 Multiple logistic regression analysis for CAD patients and control subjects (rs605143)

For total and men (Table 5), after multivariate adjustment, rs565470 remain significantly associated with CAD in recessive model (for total: OR = 1.993, 95% confidence interval [CI]: 1.277-3.112, p = 0.002; for men: OR = 2.506, 95% confidence interval [CI]: 1.475-4.258, p = 0.001) and in dominant model (for total: OR = 0.708, 95% confidence interval [CI]: 1.021-1.834, p = 0.016; for men: OR = 0.616, 95% confidence interval [CI]: 0.434-0.873, p = 0.006; data not shown for dominant model).

Table 5 Multiple logistic regression analysis for CAD patients and control subjects (rs565470)

Discussion

We found that variation in COL4A1 gene is associated with CAD in a Uygur population of China. After multivariate adjustment, the associations between COL4A1 gene polymorphisms with CAD were not modified. This was the first study to investigate the common allelic variants in COL4A1 gene and its association with CAD in Uygur population of China.

Type IV Collagen was derived from glomerular basement membrane by Kefalides in 1966 [17]. Collagen type IV is major structural component of BM. Thus it is important for integrity and functions of BM [10]. There are 6 different types of type IV collagen α-chains (α1-α6). Each of the six chains of collagen IV has three domains: There is a short 7S domain at the N-terminal; along with collagenous domain occupying the midsection of the molecule, which contains the classic Gly-Xaa-Yaa repeated amino acid sequence and a non-collagenous domain (NC1) is positioned at the C-terminal. These six different collagen type IV alpha chains (α1- α6), form only three sets of triple helical molecules called protomers, which are designated as α1.α1.α2, α3.α4.α4, α5.α5.α6 [4, 1820], among them α1.α1.α2 is very crucial for the protein.

COL4A1 gene which encodes α1 chain of type IV collagen is a new gene identified in the CARDIoGRAM Consortium [4]. Mutations of COL4A1 gene have been initially reported in a mouse bearing a heterozygous that results in in-frame deletion of exon 40, this mutation causes abnormal synthesis of the protein and the protein cannot be properly secreted outside the cell, finally leading mouse prone to brain hemorrhage at their birth [21]. In addition, Gould DB et al. [22] also observed that with in-frame deletion of exon 40 may predispose adult mouse to vascular fragility and act in concert with birth trauma to cause cerebral hemorrhage. The first description of a pathologic COL4A1 mutation in humans was reported in 2005, Gould and co-workers indicated that mutation in COL4A1 was related to congenital autosomal dominant porencephaly [21]. Since then, several researchers reported that mutation in COL4A1 gene results in human hemorrhagic stroke [22, 23]. Besides, mutant mice also showed very tortuous retinal vasculature, perinatal haemorrhage, porencephaly and these conditions were also seen in a French family with familial infantile hemiparesis. Cassandre Labelle-Dumais et al. [24] demonstrates that mutations in COL4A1 gene are a major cause of Muscle-eye-brain disease (MEB) and Walker Warburg Syndrome (WWS), characterized by ocular dysgenesis, neuronal migration defects, and congenital muscular dystrophy, this finding could develop an assay to test the functional significance of putative COL4A1 mutations.

Research about the relationship between polymorphisms of COL4A1 gene and cardiovascular diseases was first reported by Yamada Y et al. [12], they demonstrated that the A → C (Gln1334His) polymorphism (rs3742207) of COL4A1 is associated with prevalence of MI in Japanese people, with C allele protecting against this condition. However, the underlying molecular mechanism is not clear. Tom Van Agtmael et al. [25] indicated that animals with a COL4A1 missense mutation resulting in a complex vascular phenotype including defects in maintenance of vascular tone, endothelial cell function and blood pressure regulation. One genome-wide association study (GWAS) for vascular stiffness measures reported a strong replicated association of SNP (rs3742207) in COL4A1 with arterial stiffness, this study suggesting previously unrecognized cell-matrix interactions may exert an important role in regulating arterial stiffness, but the mechanism to regulate arterial stiffness is not yet clear; further work is needed to elucidate these mechanisms [12].

In our study, we found that polymorphisms of COL4A1 were associated with risk of CAD in a Uygur population. There was significant difference in genotype distribution of rs605143 and rs565470 between CAD patients and control subjects. For rs605143, compared with women, for men, frequency of A allele is higher in control subjects than in CAD patients. There is no difference of A allele between CAD patients and control subjects in women. This result indicated that A allele of rs605143 is a protective factor for CAD in male patients. For total and men, the dominant model (GG vs AG + AA) was significantly higher in CAD patients than in control subjects, after multivariate adjustment of confounding factors such as plasma concentration of TG, TC, HDL, LDL, incidence of hypertension, diabetes and smoking for CAD, the significant difference was retained. This indicated that the G allele might be protecting against for CAD.

For rs565470, compared with women, for men, frequency of C allele is higher in CAD patients than in control subjects. There is no difference of C allele between CAD patients and control subjects in women. This result indicated that C allele of rs565470 is a risk factor for CAD in male patients. For total and men, the recessive model (CC vs TT + CT) was significantly higher in CAD patients than in control subjects, after multivariate adjustment of confounding factors such as plasma concentration of TG, TC, HDL, LDL, incidence of hypertension, diabetes and smoking for CAD, the significant difference was retained. This indicated that the C allele might be a risk factor for CAD.

In conclusion, polymorphisms of COL4A1 gene were associated with CAD in a Uygur population in China. Additional studies will need to be undertaken in order to clarify the underlying molecular mechanism which associates polymorphism of COL4A1 gene with CAD.