Abstract
To facilitate the pseudochromosomes assembly and gene cloning in rapeseed, we developed a reference genetic population/map (named BnaZNF2) from two sequenced cultivars, Zhongshuang11 and No.73290, those exhibit significant differences in many traits, particularly yield components. The BnaZNF2 genetic map exhibited perfect collinearity with the physical map of B. napus, indicating its high quality. Comparative mapping revealed several genomic rearrangements between B. napus and B. rapa or B. oleracea. A total of eight and 16 QTLs were identified for pod number and seed number per pod, respectively and of which three and five QTLs are identical to previously identified ones, whereas the other five and 11 are novel. Two new major QTL respectively for pod number and seed number per pod, qPN.A06-1 and qSN.A06-1 (R2 = 22.8% and 32.1%), were colocalised with opposite effects and only qPN.A06-1 was confirmed and narrowed by regional association analysis to 180 kb including only 33 annotated genes. Conditional QTL analysis and subsequent NILs test indicated that tight linkage, rather than pleiotropy, was the genetic causation of their colocalisation. Our study demonstrates potential of this reference genetic population/map for precise QTL mapping and as a base for positional gene cloning in rapeseed.
Similar content being viewed by others
Introduction
Rapeseed, Brassica napus L. (AACC genome, 2n = 38), is mainly cultivated for the production of edible oil and is the second largest oil crop globally, after soybean. Rapeseed originated from recent (≈0.01 million years) allopolyploidy between the ancestors of two diploid species, B. rapa (AA genome, 2n = 20) and B. oleracea (CC genome, 2n = 18)1. Yield is the most important target for the genetic improvement of most crops, including rapeseed. Yield is also the most complex trait in crops. Yield is multiply determined by its component traits2, each of which is quantitatively inherited and controlled by many quantitative trait loci (QTLs). In addition, negative correlations are usually observed among yield components, which has traditionally been considered the genetic trade-off among competitive demands, i.e., antagonistic pleiotropy3. Among the three yield components of rapeseed, pod number is most highly correlated with seed yield, followed by seed number per pod, while the correlation of seed weight with yield is relatively low2, suggesting that the pod number and seed number per pod are the major contributors to the yield in rapeseed.
Nearly ten linkage QTL mapping studies and one association analysis study involving pod number and/or seed number per pod in rapeseed have been reported2,4,5,6,7,8,9,10,11,12, but the genetic and molecular bases of both traits remain ambiguous. First, the linkage maps used in these studies involved markers with unknown sequence information and the genomic sequence of rapeseed was unknown, preventing the integration of these QTLs in the reference physical map and the identification of the underlying genes. Second, the genomic sequences of the parental cultivars used in these studies were unknown, complicating the development of molecular markers and the screening of candidate genes. More importantly, only a few major QTLs for pod number and/or seed number per pod have been identified and none have been fine-mapped or further cloned.
Recent innovations in genome sequencing technology and bioinformatics have enabled the sequencing of European winter type rapeseed cultivar Darmor-bzh13 and Chinese semi-winter type cultivar Zhongshuang11 (http://www.brassica.info/resource/sequencing.php) by our own and several other institutes. Approximately a dozen linkage maps have been developed using different genetic populations of rapeseed (http://www.brassica.info/resource/maps/published-data.php); however, the number of sequence-based markers in these linkage maps was insufficient and none of these genetic populations were derived from Zhongshuang11. To accurately anchor and orientate the 2730 sequence scaffolds of Zhongshuang11 (http://www.ncbi.nlm.nih.gov/genome/genomes/203), a skeleton linkage map of sequence-based markers must be constructed using a reference population derived from itself. In addition, tens of rapeseed cultivars have also been resequenced (http://www.brassica.info/resource/sequencing.php)14. These sequenced cultivars may represent a unique genetic resource for the development of reference populations for genetic study in rapeseed.
Among the sequenced cultivars, Zhongshuang11 (de novo sequencing) and No.73290 (resequencing) exhibit significant differences in many traits related to agronomy, plant/root architecture and yield (particularly pod number and seed number per pod)15, suggesting that they are ideal parents for reference population construction and genetic study in rapeseed. The main objectives of the current study were the following: (1) construction of a skeleton linkage map with sequence-based polymerase chain reaction (PCR) markers using the reference population derived from Zhongshuang11 and No.73290; (2) validation of the quality of reference skeleton linkage map by comparative mapping with the physical maps of B. napus and B. rapa/B. oleracea; (3) mapping and comparison of QTLs for pod number and seed number per pod using the reference skeleton linkage map; (4) validation and fine mapping of the major QTLs by regional association analysis; and (5) dissection of the genetic basis of the colocalised QTLs for pod number and seed number per pod by conditional QTL analysis.
Results
Construction of a skeleton linkage map using the BnaZNF2 population
Of the 9351 screened primer pairs, i.e. 6504 simple sequence repeat (SSR), 2592 single nucleotide polymorphism (SNP), 179 sequence tagged site (STS) and 76 Insertion/Deletion (InDel), 961 (10.3%) primer pairs displayed clear polymorphism between the two parental cultivars Zhongshuang11 and No.73290, producing 1038 sequence-based PCR markers in the BnaZNF2 population (Supplementary Table S1).
All 1038 markers were subjected to Joinmap 4.0 software for linkage analysis, which resulted in a linkage map of 19 linkage groups and 803 markers (710 SSR, 48 SNP, 31 InDel and 14 STS) (Fig. 1; Table 1). Of these, 499 SSR markers (prefixed by “BoSF”, “BrSF” and “BnSF”), 31 SNP (prefixed by “ns”) markers and all 31 InDel markers (prefixed by “BnID”) were newly developed and mapped (Supplementary Table S2). The 19 linkage groups were successfully assigned to the 19 chromosomes of the A (A01-A10) and C (C01–C09) genomes, respectively. The length of the 19 linkage groups ranged from 32.1 (C07) to 148.7 (C03) cM, with a sum and mean of 1763.2 and 92.8 cM, respectively. It should be noted that A05, A08, A10, C07 and C09 were typically shorter than the other 14 linkage groups. The marker density of the 19 linkage groups ranged from 4.01 (C07) to 1.45 (A09) cM, with an average of 2.19 cM.
Only 40 (5.0% of the total) markers exhibited significant (P ≤ 0.001) segregation distortion, of which 5 and 35 skewed to Zhongshuang11 and No.73290, respectively (Table 1). Interestingly, the distorted markers tended to be concentrated on/near the two ends of the linkage groups. The distorted markers on A06 bottom, A09 top and C02 bottom skewed to Zhongshuang11, whereas those on A01 top, A04 bottom and A09 bottom skewed to No.73290 (Supplementary Table S3).
Validation of the quality of BnaZNF2 linkage map by comparative mapping with the physical maps of B. napus and B. rapa/B. oleracea
The availability of the pseudochromosomes of B. napus13, B. rapa16 and B. oleracea17 facilitated the comparison of genetic and physical maps in Brassica. Comparative mapping between the BnaZNF2 linkage map and the physical maps of B. napus and B. rapa/B. oleracea was based on the markers aligned to their pseudochromosomes and the results are displayed using dot-plots (Fig. 2).
A total of 26, 17 and 733 markers were successfully aligned to the pseudochromosomes of B. napus, B. rapa/B. oleracea and both, respectively, while the remaining 27 failed (Supplementary Table S4). Nearly all markers mapped on each of the 19 linkage groups were collinear with the corresponding pseudochromosomes of B. napus. This collinearity was also supported by the high correlation between the genetic and physical positions of the aligned markers, with coefficients ranging from 0.884 (C05) to 0.997 (A03) for the different chromosomes (mean = 0.963) (Table 2). These results strongly indicate the high quality of the current BnaZNF2 linkage map. High collinearity was also observed between most linkage groups and the corresponding pseudochromosomes of B. rapa/B. oleracea, with the exception of C05 and C06. The discontinuous collinearity observed for C05 and C06 was also supported by the intermediate correlation between the genetic and physical positions of the aligned markers, with r = 0.586 and 0.640, respectively. A detailed comparison of the physical positions of these aligned markers revealed several obvious genomic rearrangements between the pseudochromosomes (including C05 and C06) of B. napus and B. rapa/B. oleracea: translocation in one location in A01, A02 and C01 and in two locations in C05 and C06; inversion in one location in C02 and C06; and reshuffling in A05 and C04 (Fig. 2; Supplementary Table S4).
Phenotypic variation and correlation of pod number and seed number per pod in the parents and linkage population
The two parents (Zhongshuang11 and No.73290) differed significantly in pod number and seed number per pod in all four investigated environments (Table 3). The pod number of No.73290 was approximately 50% greater than that of Zhongshuang11, while the seed number per pod of Zhongshuang11 was nearly twice that of No.73290. Interestingly, the pod number of branch inflorescence was nearly twice that of main inflorescence in both the parents and populations. A normal or near-normal distribution was observed in the populations for both traits in all four investigated environments (Fig. 3), indicating a quantitative inheritance suitable for QTL identification. In addition, the populations exhibited transgressive segregation of both traits but with a small degree (Table 3), indicating that the favourable alleles were mainly distributed on one of the two parents.
The pod number of the whole plant was significantly positively correlated with those of the main and branch inflorescence with moderate and high coefficients, respectively (Table 4), suggesting that the pod number of the whole plant is mainly determined by that of the branch followed by the main inflorescence. The pod numbers of the main and branch inflorescence were also significantly positively correlated with moderate coefficient, suggesting that the genetic determination of the pod number of the main and branch inflorescence differs. The seed number per pod of the main and branch inflorescence were significantly positively correlated with high coefficient, indicating similar genetic control. As expected, the pod number and seed number per pod were significantly negatively correlated for the main and branch inflorescence as well as the whole plant, suggesting competition among sinks for assimilates. However, these correlation coefficients were all small, suggesting that the final seed yield of rapeseed (B. napus L.) could be improved by increasing its component traits, such as pod number and/or seed number per pod.
Analysis of variance (ANOVA) revealed that genotype, environment and the genotype × environment interaction all have significant effects on both the pod number and seed number per pod of the main and branch inflorescence as well as whole plant (Supplementary Table S5). The broad-sense heritability of the pod number was moderate as follows: main inflorescence (0.65)> whole plant (0.49)> branch inflorescence (0.45) (Table 4). Whereas, the broad-sense heritability of the seed number per pod was equally high for the main (0.83) and branch (0.82) inflorescence as well as whole plant (0.83).
Linkage mapping and integration of the QTLs for pod number and seed number per pod
A total of 69 QTLs (51 and 18 at significant and suggestive levels, respectively) were detected for pod number and seed number per pod in five experiments (Supplementary Table S6A). After deleting seven non-reproducible suggestive QTLs, 16 and 46 QTLs were identified for pod number and seed number per pod, respectively, which were distributed on seven (A01, A03, A05, A06, A09, C02 and C06) and nine (A01-A03, A05-07, A09, C02 and C06) of the 19 linkage groups (Supplementary Table S6B). Where the confidence intervals of these identified QTLs in different experiments overlapped, they were integrated into a single consensus QTL for each trait (Supplementary Table S6C). Eight and 16 consensus QTLs were obtained for pod number and seed number per pod, respectively and five and 13 of these were reproducible (Table 5).
Interestingly, the numbers of identified QTLs and consensus QTLs for seed number per pod were both much larger than those for pod number, consistent with the much higher heritability of seed number per pod compared to pod number (Table 4). In addition, the Zhongshuang11 alleles in the majority of both the identified and consensus QTLs decreased the pod number but increased the seed number per pod (Table 5; Supplementary Table S6C), consistent with the higher seed number per pod and lower pod number of Zhongshuang11 compared to No.73290 (Table 3). The consensus QTLs for pod number and seed number per pod explained 2.3–22.8% and 2.7–32.1% of the phenotypic variance, respectively. The additive/dominant effect of the consensus QTLs for pod number and seed number per pod ranged from −9.65/−7.48 to 7.29/9.17 and from −1.73/−1.05 to 1.97/1.59, respectively. Two consensus QTLs on the A06 linkage group (qPN.A06-1 and qSN.A06-1) were repeatedly detected in four and all five experiments, respectively and displayed a large effect (R2 = 22.8% and 32.1%); thus, they can be treated as major QTLs (Fig. 4). Interestingly, the confidence intervals of three pairs of consensus QTLs for pod number and seed number per pod (qPN.A05-1/qSN.A05-1, qPN.A06-1/qSN.A06-1 and qPN.C02-1/qSN.C02-1) overlapped well (Table 5). Of these, only qPN.A06-1 and qSN.A06-1 exhibited an opposite additive/dominant effect in the same experiments, explaining the moderate genetically negative correlation between pod number and seed number per pod (Table 4).
Physical map-based comparison of currently and previously detected QTLs for pod number and seed number per pod in rapeseed
The availability of the pseudochromosomes of B. napus13 also enabled the physical map-based comparison of currently and previously detected QTLs for pod number and seed number per pod in rapeseed. The corresponding genomic intervals of these QTLs were determined by BLAST/e-PCR analysis using the associated markers (within confidence intervals) with available probe/primer sequences against the physical map of B. napus.
The genomic regions of the majority (≈80%) of these QTL could be determined, of which approximately a quarter overlapped (Table 6; Supplementary Table S7). In detail, qPN.A03-1, qPN.A03-2, qPN.C06-1, qSN.A01-1, qSN.A09-1, qSN.C02-1, qSN.C06-1 and qSN.C06-2 identified in the current study corresponded to Sil/dm2_N3, qPN.A3-2, PNNP-C6, qSN.A1-1, qSN-LP1-A9a/qSN-OP2-A9, qSN.C2-1, qSN.C6-4 and qSN.C6-1, respectively, identified in the previous studies2,5,8,9. In addition, PNLP-A3 and qPN-LP2-A3b, PNNP-A5 and qPN.A5-2, qPN.A6-1 and PNLP-A6, Sil/dm2_N7 and qPN.A7-1, NPP1_LG4 and qPN.A7-2, qPN-LP1-A9 and Sil/dm2_N9, qPN-LP3-C3a and qPN.C3-1, qSNA1 and qSN.A1-7, qSN-OP3-A2/qSN-OP1-A2a/qSN-LP3-A2 and SNLP-A2, qSN.A3-1 and SNLP-A3, qSN.A5-1 and qPN-OP1-A5, S/Sil_N5 and qSN.A5-2, qSN.A9-1 and SNNP-A9, qSN-OP1-C1 and S/Sil_N11, qSN-OP3-C6b/qSN-LP1-C6 and qSNC6, qSS.N19 and qSN.C9-3, which were identified in previous studies2,4,5,9,11,12, were also determined to be identical.
Fine mapping of the major QTLs for pod number and seed number per pod by regional association analysis
To further fine-map the two colocalised major QTLs for pod number and seed number per pod, regional association analysis was conducted. The corresponding genomic region of the target major QTL was identified by the alignment of the primer-pairs sequences of the molecular markers within the QTL interval and reference genomic sequences of B. napus13 and B. rapa16. A total of 103 putative single-locus SSR markers (Supplementary Table S8) within the corresponding ~1.7 Mb genomic region of B. rapa18 were synthesised and used to screen polymorphism among several mini-core collection of the association population15. Of these, 35 SSR markers that amplified only one main band and exhibited obvious polymorphism were selected for further regional association analysis.
A large range of phenotypic variation was observed for both pod number (32–106) and seed number per pod (7.6–28.0) among the 576 inbred lines of the association population. Based on population structure and family relatedness, regional association analysis was conducted using mixed linear mode (MLM) by TASSEL3.0 software. Interestingly, a significant (P < 0.001) association signal was only observed for pod number and not for seed number per pod (Fig. 5). Of the six SSR markers associated with pod number, BrSF46–177 displayed the strongest association signal (P = 2.2E-08 and R2 = 8.7%); this signal was very close (13 kb) to the peak marker (BrSF46–167) of qPN.A06-1 identified by linkage analysis (Fig. 1; Table 5). Therefore, the confidence interval of qPN.A06-1 was defined by genomic region between BrSF46-136 and BrSF46-238, which corresponds to 180 kb (from 22,607 to 22,787 kb) including only 33 annotated genes in the reference genome of B. napus (Table 7). Because the extent of linkage disequilibrium (LD) decay for the target QTL region was estimated to be 193 kb (Fig. 6), it was impossible to further fine-map it using the current association population. In addition, the –log(P) values of the association of the six SSR markers with pod number and seed number per pod were not accordant (Fig. 5). These results also suggested that the colocalisation of qPN.A06-1 and qSN.A06-1 was not likely caused by pleiotropy.
Dissection of the genetic basis of the colocalised QTLs for pod number and seed number per pod by conditional QTL analysis
To determine the genetic basis (pleiotropy vs. tight-linkage) of the colocalisation of qPN.A06-1 and qSN.A06-1 in the same experiments, conditional QTL analysis was performed in both the linkage and association populations (Table 8). Regardless of whether pod number was conditioned by seed number per pod or seed number per pod was conditioned by pod number, the LOD values and R2 of the two major QTLs, as well as the –log(P) values and R2 of the six associated markers, exhibited only a small variation. These results suggest that tight linkage rather than pleiotropy is more likely the genetic basis of the colocalisation of qPN.A06-1 and qSN.A06-1.
Discussion
In the current study, an F2 population from two sequenced rapeseed cultivars was developed. Among the reported genetic populations in rapeseed5,19,20,21,22,23,24, this is the first for which parental genomic information is available, which will greatly facilitate subsequent studies. In addition, Zhongshuang11 and No.73290 exhibit large differences in many traits related to agronomy, plant/root architecture and yield15. More importantly, many QTLs with relatively large effects have been identified in this population, including for pod length, seed weight15, pod number, seed number per pod (Table 5) and flowering time (Shi et al. unpublished data). Due to the heterozygosity of the current BnaZNF2 population, the corresponding recombinant inbred lines population (named BnaZNRIL) was also developed in our lab. All major QTLs identified from the BnaZNF2 population have been fine-mapped in our laboratory and the functional validation of the candidate genes is in progress. Therefore, the BnaZNF2/BnaZNRIL population is an ideal reference population for QTL mapping and map-based gene cloning in rapeseed.
To facilitate pseudochromosome assembly of Zhongshuang11, a skeleton linkage map of 803 sequence-based PCR markers was constructed using the BnaZNF2 population (Table 1). A total of 561 markers (499 SSR, 31 SNP and all 31 InDel) on the BnaZNF2 linkage map were developed in this study (Supplementary Table S2), increasing the number of sequence-based PCR markers in rapeseed14. To our knowledge, this is the first rapeseed linkage map comprising only sequence-based PCR markers. In addition, the current linkage map mainly (88.4%) comprises SSR markers and has more SSR markers (710) than other published linkage maps of B. napus (http://www.brassica.info/resource/maps/published-data.php). These favourable characteristics will facilitate the easy transfer of these markers to other genetic populations and the comparison/integration of different linkage maps in B. napus as well as in other Brassica species25,26.
The existence of short linkage groups is commonly observed in published linkage maps of B. napus5,9,12,27,28,29,30, but the detailed causation (insufficient number of markers, poor polymorphism of parents and low frequency of recombination) has not been investigated. Although a total of 9351 primer-pairs were used to screen the polymorphic markers for map construction (Supplementary Table S1), five short linkage groups were also identified in the current BnaZNF2 linkage map (Table 1). Of these, four linkage groups (A08, A10, C07 and C09; A05 was the exception) were nearly the same short length in a high-density genetic map constructed using the BnaZNRIL population and the Brassica 60K Illumina Infinium SNP array (Shi et al. unpublished data); therefore, they were unlikely to be caused by an insufficient number of markers. Because its genomic coverage ratio was not low, the short length of the C09 linkage group was obviously due to its low recombination frequency among all 19 linkage groups. For the A08, A10 and C07 linkage groups, their short length is likely due to the low polymorphism between parents because their genomic coverage ratios were very low, while the recombination frequencies were not low. In fact, the A08 and A10 linkage groups are also short in more than half of the published linkage maps of B. napus5,9,12,27,28,29,31. Thus, a lack of genetic diversity is likely a common characteristic of the two chromosomes in rapeseed, which may indicate a new direction for breaking the genetic bottleneck in rapeseed cultivars.
Interestingly, most of the distorted markers tended to cluster on both ends of the linkage groups and skew to the same parent (Supplementary Table S3). This clustering is not likely due to chance and suggests the presence of a segregation distortion region (SDR) in rapeseed, which can be explained by factors such as the selection of gametophytes and sporophytes, homeologous recombination and residual heterozygosity in parents32.
The major part of the constructed BnaZNF2 linkage map and physical map of B. napus were collinear (Fig. 2; Table 2), suggesting its high quality and suitability for rapeseed genomic studies such as pseudochromosome assembly and comparative mapping. The inconsistency between the genetic and physical distance of a few tightly-linked markers (Supplementary Table S4) may be due to genotyping errors, misanchored scaffolds or distorted markers. This is the first comparative study of the genetic and physical map of rapeseed. The BnaZNF2 skeleton linkage map-based comparison of the physical maps of B. napus and B. rapa or B. oleracea revealed several genomic rearrangements (including inversion, translocation and reshuffling) between these species. These rearrangements should occur very recently, after the formation of B. napus from the progenitors of the two diploid species B. rapa and B. oleracea. This finding is consistent with the reported genomic rearrangements in the resynthesised Brassica polyploids, including B. napus33,34,35,36.
In the reported linkage2,4,5,6,7,8,9,11,12 and association10 mapping studies of pod number and/or seed number per pod, more than 70 and 80 QTLs have been detected for pod number and seed number per pod, respectively, distributed on 16 (excluding A04, C04 and C07) and 17 (excluding C04 and C07) of the 19 linkage groups (Supplementary Table S7). Most of these QTLs exhibited a moderate effect and only two on A02 (qSNP2) and C01 (qSNP11b) for pod number, three on A01 (qSNA1), C03 (cqSS.N13) and C09 (qSS.C9) for seed number per pod and one on C01 (qSNP11a/qSS11) for both traits could be considered major QTLs6,7,11. In the current linkage and regional association analysis study, a total of eight and 16 QTLs were identified for pod number and seed number per pod, respectively, which were distributed on seven (A01, A03, A05, A06, A09, C02 and C06) and nine (A01-A03, A05-07, A09, C02 and C06) of the 19 linkage groups (Table 5). One and two thirds of them, respectively, exhibited a relatively large (R2 ≥ 10%) or moderate (R2 < 10%) effect and only two each for pod number and seed number per pod should be considered major QTLs. The recent completion of sequencing and assembling of rapeseed genome enabled the first physical map-based comparative QTL analysis in the current study, which undoubtedly increased the accuracy of comparison. Only three QTLs for pod number and five QTLs for seed number per pod identified in the current study had been reported previously (Table 6), whereas the remaining five and 11 are novel (Supplementary Table S7). In addition, seven pairs of QTLs for pod number and nine pairs of QTLs for seed number per pod, those had been reported in the previous studies2,4,5,9,11,12, are also identical. The aforementioned 24 pairs of QTLs common to the current and previous studies should be potential targets for marker-assisted selection to improve yield in rapeseed. More importantly, the major QTLs identified across these studies will be important targets of map-based gene cloning for elucidating the molecular basis of yield in rapeseed. These results demonstrate that both pod number and seed number per pod are controlled by a large number of loci that are distributed on nearly all of the 19 linkage groups and mostly exhibit moderate effects, which strongly suggests the complexity of the genetic architecture of both traits. This complexity is understandable because the pod number per plant and seed number per pod are affected by many biological/developmental processes, including inflorescence/flower/ovule differentiation, fertilization and pod/seed development37,38,39, each of which involves many genes40,41.
Several major QTLs for pod number and/or seed number per pod had been identified in previous QTL mapping studies6,7,11,12, but none have been fine-mapped and further cloned. The traditional/classical fine mapping strategy is based on screening of recombinant individuals from a large-scale segregation population of near-isogenic lines (NILs) developed from several rounds of successive backcrossing and/or self-crossing, which is time-consuming and labour intensive42. To directly fine-map the two colocalised major QTL (qPN.A06-1 and qSN.A06-1) identified in the current study, a regional association strategy was proposed by using the high resolution of the natural population15. Using this strategy, qPN.A06-1 was successfully narrowed to a much smaller (approximately 1/7) interval of 180 kb including only 33 annotated genes (Fig. 5). This result suggests that regional association analysis is an effective and highly efficient strategy for direct fine mapping of target major QTLs identified by preliminary linkage analysis. According to the detailed annotation of the 33 genes (Table 7), a homeodomain-like superfamily protein gene GSBRNA2T00001826001 was likely to be the candidate43, because the two parents Zhongshuang11 and No.73290 showed significant difference in flowering time and flower organ number. To further confirm the candidate gene(s), comparative sequencing of the 180 kb genomic region between the two parents is in progress. However, qSN.A06-1 was not validated by regional association analysis using the same association population and markers. This failure may have occurred because the genetic variation responsible for the major QTL of seed number per pod is rare in our association population or the expression of this major QTL depends on the genetic background44. These results suggest that the use of regional association analysis for fine mapping of the target major QTL depends on the type of variants (rare or common) and/or genetic background. These results also suggest that the fine mapping of qSN.A06-1 have to use the traditional/classical NIL strategy.
Resource limitation force an organism to allocate energy to processes in a competitive manner45. In seed plants, the major trade-off between the number of fruits per inflorescence/plant and the number of seeds per fruit, which was traditionally explained by antagonistic pleiotropy, can be quantitated/reflected by the negative coefficients of genetic correlation3. For rapeseed, the negative correlation between pod number and seed number per pod has been commonly observed, including in the present study and is consistent with the opposite additive-effect direction of most of the colocalised QTLs for both traits2,6,9,46. However, the genetic basis of these colocalised QTLs with the opposite effect for both traits has not previously been investigated and remains unknown. The existence of two colocalised major QTL with opposite direction for pod number and seed number per pod in the current study provided an ideal example to test the hypothesis of antagonistic pleiotropy. The genetic basis of their colocalisation was then investigated by further regional association analysis (Fig. 5) followed by conditional QTL analysis (Table 8). The results of both analyses suggested that their colocalisation was caused by tight-linkage rather than pleiotropy, which was also confirmed by the subsequent measurement of both traits on the NILs that were recombinant at the target major QTL region (Shi et al. unpublished data). This is the first study that provides solid evidences to clearly demonstrate a negative correlation between yield component traits in rapeseed as well as other crops. The existence of two tightly linked major QTLs for pod number and seed number per pod also indicates the retention of a large linkage drag during the breeding of the elite rapeseed cultivar Zhongshuang11. Therefore, the strategy described in this paper will be effective in increasing the pod number of Zhongshuang11 without decreasing the seed number per pod via the introgression of qPN.A6 through backcross and marker-assisted selection47.
Methods
Population construction, field experiment and trait measurement
The reference genetic population (named BnaZNF2) used for the linkage analysis included 184 F2 individuals derived from Zhongshuang11 and No.73290. The natural population used for the association analysis was described in our previous study and comprises 576 global inbred lines, including the two parents of the linkage population15.
The F2 individuals were planted in Wuhan from Oct. 2008 to May 2009 (code W09F2). The F2:3 families were planted in Wuhan from Oct. 2009 to May 2010 (code W10F2:3) and Oct. 2010 to May 2011 (code W11F2:3) as well as in Xining from April to Aug. 2011 (code X11F2:3). The F2:4 families were planted in Xining from April to Aug. 2011 (code X11F2:4). The association population was planted in Wuhan from Oct. 2011 to May 2012 (code W12AP). The planting of the F2:3, F2:4 and association populations followed a randomised complete block design with two to three replications. Each block contained two rows, with 33.3 cm between rows and 16.7 cm between individual plants. The seeds were sown by hand and the field management followed standard agricultural practice. In each block, 10 representative individuals from the middle of each row were harvested by hand at maturity.
Pod number was measured as the number of effective pods on the main inflorescence, branch inflorescence and whole plant, respectively (PNm, PNb and PNw). Seed number per pod was calculated as the number of well-filled seeds on the main inflorescence, branch inflorescence and whole plant, respectively, divided by the corresponding pod number (SNPPm, SNPPb and SNPPw).
Molecular marker analysis and linkage map construction
Three groups of molecular markers were first used to screen polymorphism between the two parents of the linkage population and the single-locus markers were then used to screen the mini-core collection of the association population. The first group, mainly consisting of SSR and STS markers, was downloaded from the public database of Brassica (http://www.brassica.info/resource/markers.php; http://brassica.nbi.ac.uk/IMSORB/). The second group, consisting of SSR markers (prefixed “BoSF”, “BrSF” and “BnSF”), was developed by our lab from the sequence scaffolds of B. rapa, B. oleracea and B. napus18. The third group consisted of SNP (prefixed “snap” and “ns”) and InDel (prefixed “BnID”) markers and was developed in our lab from the re-sequencing of several B. oleracea cultivars48 and No.7329014. For markers detected at more than one polymorphic locus, a lowercase alphabetic letter was added to distinguish the different loci. PCR, electrophoresis and silver staining were performed as previously described18.
The genetic linkage map was constructed using the software JoinMap 4.0 (http://www.kyazma.nl/index.php/mc.JoinMap) using a threshold for goodness-of-fit of ≤ 5, a recombination frequency of < 0.4 and minimum logarithm of odds (LOD) score of 2.0. All genetic distances were expressed in centimorgans (cM) as derived by the Kosambi function. Double-crossover events were examined and the original scores were rechecked for potential scoring errors.
Linkage groups were assigned to the corresponding chromosomes (A01–A10; C01–C09) based on the common markers in the reported linkage maps of B. napus (http://www.brassica.info/CropStore/maps.php)5,20,22,49,50,51,52.
QTL mapping and integration
QTL mapping was conducted using the composite interval mapping (CIM) procedure53 incorporated in WinQTLCart v2.5 software (http://statgen.ncsu.edu/qtlcart/WQTLCart.htm). The walk speed, number of control markers, window size and regression method were set to 1 cM, 5, 10 cM and forward regression, respectively. The default minimum distance between QTLs (5 cM) and minimum LOD from top to valley (1) were used to define a QTL peak in an experiment. The experiment-wise LOD threshold was determined by permutation analysis54 with 1000 repetitions. The LOD score corresponding to P = 0.05 (3.0–4.6) was used to identify significant QTLs. To avoid missing QTLs with a relatively small effect, a lower LOD score, corresponding to P = 0.10 (2.6–4.1), was adopted for the presence of suggestive QTLs. Both significant QTLs and reproducible suggestive QTLs were admitted55 and named as identified QTLs.
Because QTLs detected in different experiments and mapped to the same region of a chromosome might be several estimates of the position of a single QTL56, integration of these reproducible QTLs was conducted.
Linkage disequilibrium and regional association analysis
Linkage disequilibrium (LD) was estimated as the correlation coefficient R2 between all pairs of markers using the standalone software TASSEL v3.057. Rare alleles with a frequency of <0.05 were treated as missing data. The background level of LD was defined as the 95% quantile of the R2 value among unlinked markers58. The LD decay was calculated using the linked markers as previously described58.
The structure of the association population was inferred using the software STRUCTURE v2.3.459. The iteration number and K-value (the putative number of genetic groups) were set to 3 and from 1 to 10, respectively. Both the length of the burnin period and number of MCMC (Markov Chain Monte Carlo) replications after burnin were set to 10000. The most likely K-value was determined by posterior probability [LnP(D)] and an ad hoc statistic Δk based on the rate of change in LnP(D) between successive k60.
The relative kinship of the 576 inbred lines of the association population was calculated using SPAGeDi v1.4 software61. All negative kinship values between individuals were set to zero.
The regional association analysis was conducted using single-locus SSR markers within the confidence interval of the target major QTL in the association population of 576 inbred lines. Based on the population structure and relative kinship (Q and K matrix), the calculation was performed with a mixed linear model (MLM) incorporated into the TASSEL v3.0 software. The threshold of the significant marker-trait association was set to P ≤ 0.001.
Conditional QTL analysis
To determine the genetic basis of the colocalisation of QTLs for different traits62, conditional QTL analysis was performed using the conditional phenotypic values. The conditional phenotypic values y (T1|T2) were obtained by the mixed model approach for the conditional analysis of quantitative traits63 using QGAStation 1.0 (http://ibi.zju.edu.cn/software/qga/index.htm), where T1|T2 indicates that trait 1 is conditioned by trait 2.
Statistical analysis
Broad-sense heritability was calculated as h2 = σg2/(σg2 + σge2/n + σe2/nr), where σg2, σge2 and σe2 are the variances of genotype, genotype by environment and error, respectively, while n and r are the number of environments and replicates, respectively. The genetic correlation was calculated as rG = covGxy/(σGx2 × σGy2)1/2, where covGxy is the genotypic covariance and σGx2 and σGy2 are the variances of the pairwise traits. The significance of each genetic correlation was determined using a t-test of the correlation coefficients64. The components of variance/covariance and the coefficients of correlation were estimated using the PROC GLM or CORR procedure, respectively, incorporated into SAS software version 8.1. The Excel statistical functions CHISQ.TEST and T.TEST were used to obtain the significance level (Px2-test and Pt-test) of the degree-of-fit and differences, respectively.
Additional Information
How to cite this article: Shi, J. et al. Linkage and regional association analysis reveal two new tightly-linked major-QTLs for pod number and seed number per pod in rapeseed (Brassica napus L.). Sci. Rep. 5, 14481; doi: 10.1038/srep14481 (2015).
References
U, N. Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. Jap. J. Bot. 7, 389–452 (1935).
Shi, J. et al. Unraveling the complex trait of crop yield with quantitative trait loci mapping in Brassica napus. Genetics 182, 851–861 (2009).
Gardner, K. M. & Latta, R. G. Shared quantitative trait loci underlying the genetic correlation between continuous traits. Mol. Ecol. 16, 4195–4209 (2007).
Yi, B., Chen, W., Ma, C., Fu, T. & Tu, J. Mapping of quantitative trait loci for yield and yield components in Brassica napus L. Acta Agronomica Sinica 32, 676–682 (2006).
Radoev, M., Becker, H. C. & Ecke, W. Genetic analysis of heterosis for yield and yield components in rapeseed (Brassica napus L.) by quantitative trait locus mapping. Genetics 179, 1547–1558 (2008).
Wang, F. & Guan, C. Molecular mapping and identification of quantitative trait loci for yield components in rapeseed (Brasscia napus L.). Hereditas 32, 271–277 (2010).
Zhang, L., Li, S., Chen, L. & Yang, G. Identification and mapping of a major dominant quantitative trait locus controlling seeds per silique as a single Mendelian factor in Brassica napus L. Theor. Appl. Genet. 125, 695–705 (2012).
Shi, T. et al. QTL for yield traits and their association with functional genes in response to phosphorus deficiency in Brassica napus. Plos One 8, e54559 (2013).
Ding, G. et al. Quantitative trait loci for seed yield and yield-related traits and their responses to reduced phosphorus supply in Brassica napus. Ann. Bot. 109, 747–759 (2012).
Cai, D. et al. Association mapping of six yield-related traits in rapeseed (Brassica napus L.). Theor. Appl. Genet. 127, 85–96 (2014).
Qi, L. et al. Interpreting the genetic basis of silique traits in Brassica napus using a joint QTL network. Plant Breeding 133, 52–60 (2014).
Zhang, L. et al. Genetic and correlation analysis of silique-traits in Brassica napus L. by quantitative trait locus mapping. Theor. Appl. Genet. 122, 21–31 (2011).
Chalhoub, B. et al. Early allopolyploid evolution in the post-Neolithic Brassica napus oilseed genome. Science 345, 950–953 (2014).
Huang, S. et al. Identification of genome-wide single nucleotide polymorphisms in allopolyploid crop Brassica napus. BMC Genomics 14, 717 (2013).
Li, N., Shi, J., Wang, X., Liu, G. & Wang, H. A combined linkage and regional association mapping validation and fine mapping of two major pleiotropic QTLs for seed weight and silique length in rapeseed (Brassica napus L.). BMC Plant Biol. 14, 114 (2014).
Wang, X. et al. The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet. 43, 1035–1039 (2011).
Liu, S. et al. The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat. Commun. 5, 3930 (2014).
Shi, J. et al. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species. DNA Res. 21, 53–68 (2014).
Qiu, D. et al. A comparative linkage map of oilseed rape and its use for QTL analysis of seed oil and erucic acid content. Theor. Appl. Genet. 114, 67–80 (2006).
Chen, W. et al. Detection of QTL for six yield-related traits in oilseed rape (Brassica napus L.) using DH and immortalized F(2) populations. Theor. Appl. Genet. 115, 849–858 (2007).
Delourme, R. et al. Genetic control of oil content in oilseed rape (Brassica napus L.). Theor. Appl. Genet. 113, 1331–1345 (2006).
Basunanda, P. et al. Comparative mapping of quantitative trait loci involved in heterosis for seedling and yield traits in oilseed rape (Brassica napus L.). Theor. Appl. Genet. 120, 271–281 (2010).
Zhao, J. Y., Becker, H. C., Zhang, D. Q., Zhang, Y. F. & Ecke, W. Oil content in a European × Chinese rapeseed population: QTL with additive and epistatic effects and their genotype-environment interactions. Crop Sci. 45, 51–59 (2005).
Xiao, S. et al. Generation and mapping of SCAR and CAPS markers linked to the seed coat color gene in Brassica napus using a genome-walking technique. Genome 50, 611–618 (2007).
Marquez-Lema, A., Velasco, L. & Perez-Vich, B. Transferability, amplification quality and genome specificity of microsatellites in Brassica carinata and related species. J. Appl. Genet. 51, 123–131 (2010).
Ramchiary, N. et al. Genic microsatellite markers in Brassica rapa: development, characterization, mapping and their utility in other cultivated and wild Brassica relatives. DNA Res. 18, 305–320 (2011).
Delourme, R. et al. High-density SNP-based genetic map development and linkage disequilibrium assessment in Brassica napus L. BMC Genomics 14, 120 (2013).
Wang, J. et al. Integration of linkage maps for the Amphidiploid Brassica napus and comparative mapping with Arabidopsis and Brassica rapa. BMC Genomics 12, 101 (2011).
Long, Y. et al. Epigenetic QTL mapping in Brassica napus. Genetics 189, 1093–1102 (2011).
Raman, H. et al. A consensus map of rapeseed (Brassica napus L.) based on diversity array technology markers: applications in genetic dissection of qualitative and quantitative traits. BMC Genomics 14, 277 (2013).
Parkin, I. A. et al. Segmental structure of the Brassica napus genome based on comparative analysis with Arabidopsis thaliana. Genetics 171, 765–781 (2005).
Song, X., Sun, X. & Zhang, T. Segregation distrotion and its effect on genetic mapping in plants. Chin. Agric. Biotechnol. 14, 286–292 (2006).
Pires, J. C. et al. Flowering time divergence and genomic rearrangements in resynthesized Brassica polyploids (Brassicaceae). Biol. J. Linn. Soc. 82, 675–688 (2004).
Lukens, L. N. et al. Genome redundancy and plasticity within ancient and recent Brassica crop species. Biol. J. Linn. Soc. 82, 665–674 (2004).
Gaeta, R. T., Pires, J. C., Iniguez-Luy, F., Leon, E. & Osborn, T. C. Genomic changes in resynthesized Brassica napus and their effect on gene expression and phenotype. Plant Cell 19, 3403–3417 (2007).
Szadkowski, E. et al. Polyploid formation pathways have an impact on genetic rearrangements in resynthesized Brassica napus. New Phytol. 191, 884–894 (2011).
Faraji, A. Flower formation and pod/flower ratio in canola (Brassica napus L.) affected by assimilates supply around flowering. Int. J. Plant Prod. 4, 271–280 (2010).
Bouttier, C. & Morgan, D. G. Ovule development and determination of seed number per pod in oilseed rape (Brassica napus L.). J. Exp. Bot. 43, 709–714 (1992).
Pechan, P. M. Ovule fertilization and seed number per pod determination in oilseed rape (Brassica napus). Ann. Bot. 61, 201–207 (1988).
Wang, Y. & Li, J. Molecular basis of plant architecture. Annu. Rev. Plant Biol. 59, 253–279 (2008).
Xing, Y. & Zhang, Q. Genetic and molecular bases of rice yield. Annu. Rev. Plant Biol. 61, 421–442 (2010).
Yang, Q., Zhang, D. & Xu, M. A sequential quantitative trait locus fine-mapping strategy using recombinant-derived progeny. J. Integr. Plant. Biol. 54, 228–237 (2012).
Wang, H. et al. Biological functions of HD-Zip transcription factors. Hereditas 35, 1179–1188 (2013).
Gibson, G. Rare and common variants: twenty arguments. Nat. Rev. Genet. 13, 135–145 (2012).
Obeso, J. R. The costs of reproduction in plants. New Phytol. 155, 321–348 (2002).
Zhang, F. et al. Yield and agronomic traits of winter rapeseed cultivars registered in China from 2001 to 2010. Chin. J. Oil Crop Sci. 34, 239–244 (2012).
Cao, Z. et al. Analysis of QTLs for erucic acid and oil content in seeds on A8 chromosome and the linkage drag between the alleles for the two traits in Brassica napus. J. Genet. Genomics 37, 231–240 (2010).
Wang, W. et al. Construction and analysis of a high-density genetic linkage map in cabbage (Brassica oleracea L. var. capitata). BMC Genomics 13, 523 (2012).
Kaur, S. et al. Genetic map construction and QTL mapping of resistance to blackleg (Leptosphaeria maculans) disease in Australian canola (Brassica napus L.) cultivars. Theor. Appl. Genet. 120, 71–83 (2009).
Xu, J. et al. Construction of an integrated genetic linkage map for the A genome of Brassica napus using SSR markers derived from sequenced BACs in B. rapa. BMC Genomics 11, 594 (2010).
Cheng, X. et al. Development and genetic mapping of microsatellite markers from genome survey sequences in Brassica napus. Theor. Appl. Genet. 118, 1121–1131 (2009).
Li, H. et al. Development and genetic mapping of microsatellite markers from whole genome shotgun sequences in Brassica oleracea. Mol. Breeding 28, 585–596 (2011).
Zeng, Z. B. Precesion mapping of quantitative trait loci. Genetics 136, 1457–1468 (1994).
Churchill, G. A. & Doerge, R. W. Empirical threshold values for quantitative trait mapping. Genetics 138, 963–971 (1994).
Long, Y. et al. Flowering time quantitative trait loci analysis of oilseed brassica in multiple environments and genomewide alignment with Arabidopsis. Genetics 177, 2433–2444 (2007).
Goffinet, B. & Gerber, S. Quantitative trait loci: a meta-analysis. Genetics 155, 463–473 (2000).
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 (2007).
Mather, K. A. et al. The extent of linkage disequilibrium in rice (Oryza sativa L.). Genetics 177, 2223–2232 (2007).
Hubisz, M. J., Falush, D., Stephens, M. & Pritchard, J. K. Inferring weak population structure with the assistance of sample group information. Mol. Ecol. Resour. 9, 1322–1332 (2009).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Hardy, O. J. & Vekemans, X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol. Ecol. Notes 2, 618–620 (2002).
Wagner, G. P. & Zhang, J. The pleiotropic structure of the genotype-phenotype map: the evolvability of complex organisms. Nat. Rev. Genet. 12, 204–213 (2011).
Zhu, J. Analysis of conditional genetic effects and variance components in developmental genetics. Genetics 141, 1633–1639 (1995).
Kong, F. L. Quantitative genetics in plants. (China Agricultural University Press, Beijing, 2006).
Acknowledgements
This research was supported by the National Basic Research and Development Program (2015CB150202 and 2011CB109305), Natural Science Foundation (31101181), Rapeseed Industry Technology System (CARS-13), Agricultural Science and Technology Innovation Project (CAAS-ASTIP-2013-OCRI), Hubei Agricultural Science and Technology Innovation Center and Core Research Budget of the Non-profit Governmental Research Institution (1610172014001) of China.
Author information
Authors and Affiliations
Contributions
J.S. and H.W. designed the study. J.S. and J.Z. performed the marker analysis in the F2 population. J.S. and Y.Y. performed the marker analysis in the association population. J.S. and J.Y. performed the regional association analysis of the target major QTL region. S.H. and R.L. performed the SSR/InDel marker development. X.W. and G.L. performed the field experiments and traits investigation. J.S. analysed all the data and wrote the manuscript. All authors have read and approved the final manuscript.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Electronic supplementary material
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Shi, J., Zhan, J., Yang, Y. et al. Linkage and regional association analysis reveal two new tightly-linked major-QTLs for pod number and seed number per pod in rapeseed (Brassica napus L.). Sci Rep 5, 14481 (2015). https://doi.org/10.1038/srep14481
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep14481
- Springer Nature Limited
This article is cited by
-
Meta-analysis of seed weight QTLome using a consensus and highly dense genetic map in Brassica napus L.
Theoretical and Applied Genetics (2023)
-
Linkage and association mapping of ovule number per ovary (ON) in oilseed rape (Brassica napus L.)
Molecular Breeding (2023)
-
Genetic analysis and QTL mapping for silique density in rapeseed (Brassica napus L.)
Theoretical and Applied Genetics (2023)
-
Genomic selection and genetic architecture of agronomic traits during modern rapeseed breeding
Nature Genetics (2022)
-
Evaluation of the effect of polyethylene glycol (PEG) on germination and morphological characteristics of bread wheat
Cereal Research Communications (2022)