Multi-trait multi-environment quantitative trait loci mapping for a sugarcane commercial cross provides insights on the inheritance of important traits

Margarido, G. R. A.; Pastina, M. M.; Souza, A. P.; Garcia, A. A. F.

doi:10.1007/s11032-015-0366-6

Multi-trait multi-environment quantitative trait loci mapping for a sugarcane commercial cross provides insights on the inheritance of important traits

Open access
Published: 09 August 2015

Volume 35, article number 175, (2015)
Cite this article

Download PDF

You have full access to this open access article

Molecular Breeding Aims and scope Submit manuscript

Multi-trait multi-environment quantitative trait loci mapping for a sugarcane commercial cross provides insights on the inheritance of important traits

Download PDF

G. R. A. Margarido¹,
M. M. Pastina²,
A. P. Souza³ &
…
A. A. F. Garcia ORCID: orcid.org/0000-0003-0634-3277¹

2419 Accesses
16 Citations
Explore all metrics

Abstract

Breeding trials typically consist of phenotypic observations for various traits evaluated in multiple environments. For sugarcane in particular, repeated measures are obtained for plant crop and one or more ratoons, such that joint analysis through mixed models for modeling heterogeneous genetic (co)variances between traits, locations and harvests is appropriate. This modeling approach also enables us to include molecular marker information, aiding in understanding the genetic architecture of quantitative traits. Our work aims at detecting QTL and QTL by environment interactions by fitting mixed models with multiple QTLs, with appropriate modeling of multi-trait multi-environment data for outcrossing species. We evaluated 100 individuals from a biparental cross at two locations and three years for fiber content, sugar content (POL) and tonnes of cane per hectare (TCH). We detected 13 QTLs exhibiting QTL by location, QTL by harvest or the three-way interaction. Overall, 11 of the 13 effects presented some degree of pleiotropy, affecting at least two traits. Furthermore, these QTLs always affected fiber and TCH in the same direction, whereas POL was affected in the opposite way. There was no evidence in favor of the linked QTL over the pleiotropic QTL hypothesis for any detected genome position. These results provide valuable insights into the genetic basis of quantitative variation in sugarcane and the genetic relation between traits.

Marker-trait association study for sucrose and yield contributing traits in sugarcane (Saccharum spp. hybrid)

Article 15 March 2015

Association Mapping for Sugarcane Quality Traits at Three Harvest Times

Article 26 September 2021

QTL analysis in multiple sorghum mapping populations facilitates dissection of the genetic control of agronomic and yield-related traits in sorghum [Sorghum bicolor (Moench)]

Article 08 February 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Plant breeding is an essential activity to agriculture, affording short-term new elite cultivars and long-term potential for increased yields and response to adversities (Moose and Mumm 2008). To select both broadly stable genotypes and those adapted to specific environmental conditions, it is paramount to have information about genotype by environment interaction (G × E) and, particularly for marker-assisted selection (MAS), QTL × E interaction (Eeuwijk et al. 2001; Smith et al. 2001; Verbyla et al. 2003; Boer et al. 2007). Likewise, knowledge of the genetic correlations between traits can provide clues as to the possibility of breaking undesirable correlations between agronomically important traits, therefore playing an important role when designing breeding strategies (Jiang and Zeng 1995; Welham et al. 2010).

To achieve such goals, evaluation trials are generally conducted in several locations, ideally with contrasting environmental features, and throughout many (consecutive) years. For species that can be vegetatively propagated, such as sugarcane, repeated measures can be obtained for clones kept in the field, for both plant crop and ratoons (Smith et al. 2007). The ability to evaluate the same genotypes in multiple sites and years makes such data naturally suited for joint analysis. In particular, appropriate modeling of genetic and residual (co)variance matrices alleviates the need to make unrealistic assumptions about the distribution of errors and affords unbiased estimates of model effects, such as QTL effects (Balzarini 2001). In this context of modeling G × E interaction and correlation between traits, mixed models are a natural statistical approach to use (Smith et al. 2005).

It can be observed from the breeding field trial evaluation literature that few studies jointly analyze multi-trait multi-environment (MTME) data (Malosetti et al. 2008). The most likely reason for this is the difficulty in analyzing the data, particularly in interpreting the results, as well as the lack of adequate and easily accessible statistical methodology and software for such goals. Traditionally, data of this type have been analyzed by fitting separate ordinary or somehow naïve models for each trait and environment combination, multi-environment models for each trait or multi-trait models individually for each environment (Sun et al. 2012; Bonneau et al. 2013; Freeman et al. 2013; Lopes et al. 2013; Liu et al. 2014), followed by informal comparisons, generally graphical, or ad hoc meta-analysis to derive conclusions about G × E and/or correlation between traits (Piepho 2000). The situation is particularly evident for QTL studies: only a handful of examples utilize MTME modeling for QTL mapping, even for well-studied species for which inbred-derived populations are available (Malosetti et al. 2006, 2008; Singh et al. 2012; Alimi et al. 2013).

Furthermore, another important aspect is that most QTL mapping studies disregard the occurrence of epistasis. This phenomenon has occasionally been overlooked, considered rare and less important than other genetic effects, but collective evidence attests its prevalence and importance (Garcia et al. 2008; Phillips 2008; Manolio et al. 2009; Eichler et al. 2010). QTL mapping can benefit from the inclusion of epistatic interactions directly in the search process, such as that by the multiple interval mapping (MIM) model (Kao and Zeng 1997; Kao et al. 1999). This practice increases statistical power for QTL detection, removes biases from QTL effect and position estimates and yields breeding values that can be directly leveraged by breeding programs through MAS (Zeng et al. 1999; Collard and Mackill 2008). The MIM method can be interpreted as a model (variable) selection procedure and, as such, can be readily incorporated into the mixed model context through a least-squares approximation (Haley and Knott 1992; Broman and Sen 2009).

The situation for sugarcane is not unlike that encountered for most diploid species, albeit with a few extra obstacles. Most studies make use of two linkage maps, one for each parent, constructed based on markers segregating on a 1:1 fashion. QTL mapping has been conducted through single-marker, Interval Mapping (IM) (Lander and Botstein 1989) or Composite Interval Mapping (CIM) analyses (Zeng 1993, 1994), with methodologies devised for backcross progenies, and only for single trait–environment combinations (Jordan et al. 2004; Wei et al. 2006; Raboin et al. 2008; Pinto et al. 2010; Costet et al. 2012; Nibouche et al. 2012; Singh et al. 2013). A few exceptions exist, with some studies making use of markers with a 3:1 segregation pattern or higher dosages (Aitken et al. 2008; Piperidis et al. 2008). To our knowledge, Pastina et al. (2012) described the most realistic model for QTL mapping in sugarcane to date, based on mixed models for multi-environment data. However, these authors did not include multitrait data.

In this paper, we present a QTL mapping analysis of a sugarcane segregating progeny, evaluated over multiple locations and years. Our goals were to appropriately model the MTME structure of the observations, to extend the multi-trait MIM model, in a mixed model context, to outcrossing species, i.e., progenies derived from non-inbred heterozygous parents, and to study the genetic architecture of important agricultural traits related to bioethanol and sucrose production.

Materials and methods

Plant material and genetic linkage map

The genetic markers utilized in this work have been previously described by Garcia et al. (2006) and Oliveira et al. (2007), and the field trial data analyzed herein have been previously addressed by Pastina et al. (2012). Briefly, we evaluated a progeny of 100 individuals obtained by a cross between Brazilian pre-commercial cultivars SP80-180 and SP80-4966 in field trials conducted in two locations (Piracicaba and Jaú, State of São Paulo, Brazil), for three consecutive harvest years (2003 through 2005). Measured traits included cane yield in Mg ${\rm ha}^{-1}$ (tonnes of cane per hectare, or TCH), sugar yield in Mg ${\rm ha}^{-1}$ (tonnes of sugar per hectare, or TSH), percent sucrose content (POL) and percent fiber content. We removed trait TSH from the analysis because of its extremely high correlation with TCH, which caused numerical problems during model fitting (data not shown). The experimental design consisted of an augmented randomized complete block design with two replicates. We separated genotypes into three groups and included four commercial checks in each of them.

These data naturally lend themselves to an MTME-based analysis because we evaluated multiple traits in multiple environments (site × harvest combinations), thus resulting in genetic and environmental correlations between traits and between environments. To have models based on more realistic assumptions, all these correlations need to be considered.

Genotypic data were available and consisted of restriction fragment length polymorphism and simple sequence repeat single-dose markers (SDMs, i.e., markers present in at most one copy) coded as dominant markers, such that only markers with 1:1 and 3:1 segregation patterns were present (Wu et al. 1992). The first situation arises when the SDM is present in a single parent, and the latter exists when the marker band is present in both parents. We applied a Chi-square test to each marker to test for segregation distortion and discarded strongly deviating markers after Bonferroni correction at a genomewise error rate of 0.05. A multipoint linkage map had previously been obtained for this population through OneMap (Wu et al. 2002; Margarido et al. 2007), with a total of 317 markers distributed over 96 linkage groups (LGs), jointly covering 2468.14 centiMorgans (cM) with the Kosambi mapping function (Kosambi 1944). This map is presented as supplementary material in Pastina et al. (2012). The number of markers per LG varied between 2 and 14. Based on shared loci, 91 out of the 96 LGs had been assembled into 11 putative homology groups (HGs), each containing from 2 to 23 LGs (Pastina et al. 2012). The other 424 markers did not map to any linkage group, and we included them in the analysis as single markers.

For notation purposes, following the convention from Wu et al. (2002), we denoted SDM loci segregating exclusively for parent SP80-180 (SP80-4966) by type $D_1 (D_2)$ and markers informative for both parents by type C. Note that some LGs had a mixture of all marker types, whereas others were composed solely of $D_1$ (or $D_2$) markers.

QTL model for noninbred populations

In a similar manner to Pastina et al. (2012) and Gazaffi et al. (2014), we consider two diploid non-inbred individuals, denoted as P and Q. For a genetic marker m, the two alleles of individual P can be denoted as $P^{1}_m$ and $P_m^{2}$, with a similar definition for both alleles of individual Q. Figure 1 shows a cross between two such individuals, for two adjacent markers m and $m+1$, and an intervening QTL with alleles $P^{1}$ & $P^{2}$ and $Q^{1}$ & $Q^{2}$ (Lin et al. 2003). Despite sugarcane being a polyploid species, in practice, we only consider two alleles because the molecular markers we used are all presence/absence dominant markers.

At any given marker or QTL locus, a diploid organism can hold up to four different alleles, which allows for a maximum of three orthogonal contrasts to be defined. For QTL alleles $P^{1}$ & $P^{2}$ and $Q^{1}$ & $Q^{2}$, the following three orthogonal contrasts have a clear genetic interpretation:

$$\begin{aligned} c_{P}= & + P^1Q^1 + P^1Q^2 - P^2Q^1 - P^2Q^2 \\ c_{Q}= & + P^1Q^1 - P^1Q^2 + P^2Q^1 - P^2Q^2 \\ c_{PQ}= & + P^1Q^1 - P^1Q^2 - P^2Q^1 + P^2Q^2 \end{aligned}$$

The first contrast compares $P^{1}$ and $P^{2}$, i.e., it corresponds to the additive effect between both alleles from individual P. Similarly, the second contrast refers to the additive effect between alleles from individual Q. Finally, the third contrast tests deviations from additivity, or dominance.

Given a linkage map and a segregating progeny, conditional QTL genotype probabilities can be estimated and a mixture model devised to fit these contrasts to phenotypic data, allowing the estimation and testing of additive and dominant effects (Lander and Botstein 1989; Zeng et al. 1999). This method, however, is computationally expensive and can make generalizations to multiple QTLs and MTME data impracticable.

A commonly used approximation, initially proposed by Haley and Knott (1992), defines a linear regression model with the mathematical expectations of the effects as covariates, which makes model fitting possible via the usual least-squares estimator, available as part of many statistical packages. Both theoretical and empirical works show that such approximation generally performs almost as well as the exact model (Broman 2001). In the mixed model context, these mathematical expectations are named genetic predictors.

In the eventual scenario where a linkage group does not contain sufficient marker information for the four QTL genotypes to be identifiable (e.g., having only $D_1$-type markers), collinearity issues arise between the three contrasts such that only one or two of them can be fitted, depending on the situation (Belsley et al. 1980). As an example, consider the case where a linkage group presents markers exclusively of type $D_{1}$: This prevents recombination on individual Q to be identified, so only the additive effect between $P^{1}$ and $P^{2}$ can be estimated. Our model took these into consideration by only fitting effects without collinearity.

Phenotypic model fitting

According to the experimental design implemented in the field, we fitted the following model to the phenotypic data:

$$\underline{y}_{istjkr} = \mu _{stjkr} + \underline{G}_{istjk} + \underline{\varepsilon }_{istjkr}$$

where $\underline{y}_{istjkr}$ is the phenotype of the $r{\rm th}$ replicate (block) of $i{\rm th}$ genotype in group s, for trait t in site j and harvest k; $\mu _{stjkr}$ is the mean of block r within group s, for trait t in site j and harvest k; $\underline{G}_{istjk}$ is the genetic effect of genotype i of group s, for trait t in site j and harvest k; and $\underline{\varepsilon }_{istjkr}$ is the non-genetic effect. Underlined terms represent random effects. This model will henceforth be called “genetic effects model”.

According to their origin, we separated genotypes into two groups, $n = n_g + n_c$, where $n_g$ is the number of genotypes in the progeny (clones) ($i = 1, \ldots , n_g$) and $n_c$ is the number of checks ($i = n_g+1, \ldots , n_g+n_c$). The model for $\underline{G}_{istjk}$ is given by:

$$\underline{G}_{istjk} = \left\{ \begin{array} {lll} \underline{g}_{ite} & \quad i=1, \ldots , n_g\\ c_{istjk} & \quad i=n_g+1, \ldots , n_g+n_c \end{array} \right.$$

where $\underline{g}_{ite}$ is a random variable for the genetic effect of genotype i for trait t in environment e and $c_{istjk}$ represents a fixed effect for check i in group s for trait t in site j and harvest k. Note that, for the random effect $\underline{g}_{ite}$, we combined site and harvest into an environment variable due to software limitations.

Main fixed effects of blocks, groups, traits, sites and harvests, as well as their possible interactions, were not of direct interest; thus, they were all included in the term $\mu _{stjkr}$ rather being modeled individually, simply to control for their presence. Consequently, we could obtain unbiased estimates for the effects of interest, particularly $\underline{G}_{istjk}$ (Verbyla et al. 2003; Boer et al. 2007).

The vector of genetic effects ${{\underline{\varvec{{g}}}}} = (g_{1111}, \ldots , g_{ITJK})$ follows a multivariate normal distribution, ${{\underline{\varvec{{g}}}}} \sim N(\mathbf 0 , {\varvec{G}} \otimes {\varvec{I}}_{n_{g}})$, where $\otimes$ represents the Kronecker (direct) product between two matrices and ${\varvec{I}}_{n_{g}}$ is an identity matrix of size $n_{g}$. Note that the effect of group s is not included in this random variable. As a first step in the model fitting process, we examined several different structures for matrix ${\varvec{G}}$ (Table 1). Two main classes of models can be distinguished: models 1 through 4 combine traits and environments factorially into different “traits”, in a broader sense of the term, and fit a single matrix ${\varvec{G}}$ to these new “traits”. Models 5 through 10, on the other hand, fit two individual component matrices, denoted ${\varvec{G}}^{{\rm trait}}$ and ${\varvec{G}}^{{\rm env}}$, respectively, for traits and environments. For the latter models, it follows that ${\varvec{G}} = {\varvec{G}}^{{\rm trait}} \otimes {\varvec{G}}^{{\rm env}}$ (Smith et al. 2007). Due to software limitations, we could decompose the (co)variance matrix into only two component matrices, which required the combination of sites and harvests into environments and forcefully prevented a more refined model to be fitted. This and all subsequent steps were performed in Genstat version 16 (VSN International 2013).

In Table 1, model 1 corresponds to a diagonal matrix, i.e., a model in which a different genetic variance is assigned to each “trait”, with all correlations being equal to zero. Model 2 also fits a different variance for each class, but includes a uniform genetic correlation. The first-order factor analytic model 3 is a multiplicative model that allows heterogeneity in both variances and covariances, i.e., approximates a fully unstructured model, while using a smaller number of parameters (Piepho 2000; Eeuwijk et al. 2001). Finally, the unstructured model 4 fits an individual (co)variance term for each trait–environment combination. Models 5 through 10 use these same structures, separately for each component matrix, in several distinct combinations. Note that all these models allow heterogeneous variances for the various traits. Additionally, the genetic effects model includes a different mean for each trait. This modeling strategy takes into account the fact that phenotypic traits are measured in different scales (i.e., data do not need to be standardized). We compared all models based on their AIC (Akaike Information Criterion) and BIC (Bayesian Information Criterion) values, where a smaller value corresponds to a better model (Burnham and Anderson 2004). Notably, we also evaluated other component matrix combinations (such as a ${\rm DIAG}\otimes {\rm DIAG}$, for example), but these resulted in poor fits to the data and are thus not shown.

Table 1 Genetic (co)variance matrix (${\varvec{G}}$): evaluated models

Full size table

Taking into consideration the fact that the three studied variables are measured in different scales and hence have different orders of magnitude for their corresponding (co)variances, we also examined adequate models for the non-genetic residual error term $\underline{\varepsilon }_{istjkr}$. We accomplished this by testing several (co)variance structures for its associated matrix ${\varvec{R}}$, in a manner similar to that described for ${\varvec{G}}$. In particular, we evaluated models with a single matrix for factorial combinations of traits, sites and harvests, as well as models with three component matrices (trait $\otimes$ site $\otimes$ harvest), which was in this case allowed by the software. In the first group of models, we compared the fit of a diagonal model with different variances, compound symmetry with heterogeneous variances, first-order factor analytic and the fully unstructured model. In the latter group, we compared twenty-four distinct combinations of appropriate component matrices, in this case including an auto-regressive model for harvest years. We employed the same criteria for model comparison as for the ${\varvec{G}}$ matrix, namely AIC and BIC.

QTL model

After selecting the best-fit model for the experimental design, we included genotypic information for the QTL searching process. Adapting the multiple interval mapping (MIM) methodology described by Kao and Zeng (1997) and Kao et al. (1999) to a mixed model framework, i.e., using a least-squares approximation rather than the usual mixture model approach, the QTL mapping model is expressed by:

$$\begin{aligned} \underline{y}_{istjkr}= & \,\mu _{stjkr} + \sum _{w=1}^{m} \left( x_{p_{iw}} \alpha _{p_{tjkw}} + x_{q_{iw}} \alpha _{q_{tjkw}} + x_{pq_{iw}} \delta _{pq_{tjkw}} \right) \\&+ \underline{G}^{*}_{istjk} + \underline{\varepsilon }_{istjkr} \end{aligned}$$

where $\alpha _{p_{tjkw}}, \alpha _{q_{tjkw}}$ and $\delta _{pq_{tjkw}}$ are the additive genetic predictor effects for parents P and Q and the dominance genetic predictor effect, respectively, specific for each trait × site × harvest combination, for the QTL in genomic position w. The term $\underline{G}^{*}_{istjk}$ refers to the residual genetic variation, not explained by QTL, thus marked with an asterisk to be differentiated from the genetic term previously described for the genetics effect model. The (co)variance matrix used for $\underline{G}^{*}_{istjk}$ was the one selected for the model without genotypic data. Henceforth, we refer to the MIM strategy more broadly as a model searching scheme.

The significance of (fixed) QTL effects was tested through the Wald test, with the null hypothesis $H_0$ defined by:

$$H_0:\left\{ \begin{array} {lllllllll} \alpha _{p_{111w}}&=& \alpha _{p_{112w}} & = & \ldots & = & \alpha _{p_{TJKw}}&=& 0 \\ \alpha _{q_{111w}}&=& \alpha _{q_{112w}} & = & \ldots & = & \alpha _{q_{TJKw}}&=& 0 \\ \delta _{{pq}_{111w}}&=& \delta _{{pq}_{112w}} & = & \ldots & = & \delta _{{pq}_{TJKw}}&=& 0 \end{array} \right.$$

The above hypothesis tests for the presence of at least one effect different from zero, i.e., if the locus at hand affects the expression of at least one trait, in at least one site and harvest combination.

QTL search

As originally proposed, the MIM methodology allows for several model searching strategies (Kao and Zeng 1997; Kao et al. 1999; Zeng et al. 1999). In the present study, we opted for sequential forward searches, with intervening significance checks and refining steps.

In detail, starting from the genetic effects model, we sequentially conducted one-dimensional searches for QTL and kept positions with significant effects in the model. During these searches, we jointly tested for QTL and QTL × E interaction effects and later tested for QTL × E alone (see section on “Refining steps”). To correct for multiple testing, we employed a p value cutoff of 0.001. We initially conducted searches for pseudo-markers positioned every 1 cM on all linkage groups and subsequently for single markers. For each genomic position, we calculated the condition indexes of the genetic predictors matrix and removed non-informative contrasts, i.e., contrasts for which the condition index was greater than an empirically chosen threshold of 3.5 (Belsley et al. 1980).

We iteratively repeated the above scheme through searches for linkage groups and unlinked markers until no more significant effects could be detected. Inclusion of each new effect in the model explains away part of the phenotypic variance, thus decreasing residual variation and increasing statistical power for detecting QTL effects (Zeng 1993, 1994).

Refining steps

Having performed the aforementioned forward searches, we turned to some final fine-tuning steps. The first consisted of dropping one term from the model at a time and discarding effects with p value >0.05. This is important because the QTL found at the initial search rounds may no longer be significant after the inclusion of other QTLs in the model.

Next, we tested for QTL × site, QTL × harvest and QTL × site × harvest interactions, individually for each QTL. We excluded non-significant interaction terms from the model, provided that there were no significant higher-order interaction effects involving the term at hand.

Following the MIM strategy, QTL positions were refined by first constructing a p value profile for each QTL, followed by choosing the most likely position, i.e., the one resulting in the smallest p value. We conducted this step iteratively until no further QTL had its position altered.

It is not formally possible to test for the competing hypotheses of pleiotropy versus linkage in the mixed model context when QTLs are modeled as fixed effects, for two reasons. First, because the two models are not nested, a p value evaluation is not feasible. Second, the AIC and BIC criteria should only be used to compare models with different random terms or structures, with a common fixed part, which is not the case for the current scenario. Hence, we took an ad hoc approach to removing any given QTL from the model and adding an effect for each trait separately, for a window of adjacent positions. We then compared the profiles for each trait, checking whether peaks were found at the same genomic position.

Finally, we used the final model to estimate QTL effects for each trait, in each site and harvest year. We then used the corresponding standard deviations to test the significance of each effect. Individual effects were deemed significant when $\left| {\rm effect}\right| \ge 2 \times {\rm standard \ \ deviation}$, as proposed by Malosetti et al. (2008).

Results

Genetic effects model

Comparison of the examined (co)variance structures for matrix ${\varvec{G}}$ made it evident that models with an unstructured matrix for environments resulted in better fits, in general, according to the BIC criterion (models 8 through 10 in Table 2). The AIC selection criterion suggested that the best model for ${\varvec{G}}$ was the fully unstructured model for trait–environment combinations (model 4), which contained a total of 171 parameters. On the other hand, BIC selected the model combining an unstructured matrix ${\varvec{G}}^{{\rm trait}}$ for traits and a separate unstructured matrix ${\varvec{G}}^{{\rm env}}$ for environments (model 10). The latter model seems to be a good balance between parsimony and goodness of fit, as it allows for a fairly complex (co)variance structure without requiring a great number of parameters. Indeed, because for this dataset BIC imposes a heavier penalty on the number of parameters, simpler models are expected. Given that our ultimate goal was to map QTLs, we decided to use the model selected by BIC because a simpler model results in smaller runtime for fitting and is thus more amenable to QTL mapping, which is particularly important in the MIM context.

Table 2 Models for the genetic (co)variance matrix (M = T × E, where T = 3 is the number of traits and E = 6 is the number of environments) and corresponding AIC and BIC values

Full size table

When we tested various different (co)variance structures for the ${\varvec{R}}$ matrix, the best model was a first-order factor analytic for trait–site–harvest combinations, according to the BIC (data not shown). However, this model was excessively slow to fit, such that performing several rounds of QTL searching proved to be infeasible. Hence, for all further analyses, we used a diagonal model, with 18 different variances (one for each trait × site × harvest combination) and no covariances. For comparison, we ran a few initial QTL searches with both models and obtained vastly similar results, showing that the simpler model did not have a detrimental effect on QTL mapping.

QTL mapping

All detected QTLs are summarized in Fig. 2. Significant effects are displayed separately for each trait, site and harvest, with positive effects highlighted in green, negative effects in red and non-significant effects represented by the symbol 0. For each trait, the presence of two lines indicates that effects are different for both locations, and three columns likewise indicate different effects across harvest years. A single line (column) represents a QTL that does not interact with sites (harvests). Note that QTL numbering follows the order in which they were detected throughout the analysis, not the position along the genome as informed by the numbering of linkage groups.

We detected a total of 13 QTLs. Out of these 13 significant positions, eight were detected on a linkage group, while five were found on single, unmapped markers (Online Resource 1).

Interestingly, all QTLs interacted with sites and/or harvest years; hence, no effects were consistent across all environmental conditions evaluated (Online Resource 1). From the total of 13 terms, three interacted only with sites, specifically QTL V (marker ESTB45m6D1), XI (LG10) and XII (marker ESTB64m3C). On the other hand, QTL IV (LG72) exhibited only QTL × harvest interaction. QTL VIII (LG7) interacted both with sites and with harvests, but the three-way interaction was not significant. Finally, the remaining eight effects showed a significant three-way interaction, i.e., displayed an oscillating effect across site × harvest combinations (Online Resource 1). These results also make it apparent that very few detected effects were expressed in a more stable manner, as we observed many effects for only a single site and harvest. Nonetheless, it is interesting to observe that we did not observe any crossover interaction, as the effects for any given trait, when significant, were always consistently positive or negative across sites and harvests.

It is also important to emphasize that we determined individual significances according to the criterion $\left| {\rm effect}\right| \ge 2 \times {\rm standard \ \ deviation}$, as indicated by a colored background in Fig. 2, while we jointly tested interactions with environments through the Wald test, by dropping the appropriate effect from the model and checking the p value, as represented by the presence or absence of multiple effect lines or columns in those figures. This is the reason why Fig. 2 only displays effects for some of the traits, indicating that effects for the other traits were not individually significant.

Figure 2 also shows that most QTLs had pleiotropic effects, as eleven of the 13 QTLs simultaneously affected at least two traits. Of even more interest is the observation that fiber and TCH were always affected in the same direction, while sugar content was influenced in the opposite direction. Although the expression pattern of each QTL was dissimilar from the others with regard to expression in various environments, they affected the three traits in directions that agreed with phenotypic correlations, i.e., a moderate positive correlation of 0.3907 ($p < 0.0001$) between TCH and fiber and almost no correlation between sugar content and the other variables (correlation of 0.0659 between POL and TCH, with $p = 0.0232$, and a correlation of $-0.0053$ between POL and fiber, with $p = 0.8553$).

For QTLs detected on linkage groups, profiles of the $-{\rm log}_{10}(p{\rm -value})$ statistic are shown in Fig. 3, both for the joint analysis of the three traits and for separate analysis of each individual trait. Visual inspection of these profiles did not provide evidence in favor of the linked QTL hypothesis over pleiotropy, for any of the detected QTLs. For some QTLs, e.g., QTL XI on LG10, peaks for all traits were found to be very close to each other. For others, such as QTL III on LG49, a significant peak was observed exclusively for one of the traits. Finally, as observed for QTL I on LG92, the linkage group was too small to allow a clear distinction of peaks.

Discussion

Breeding programs typically leverage data collected for many traits, in multiple locations and along several years. Consequently, genetic and residual (co)variances are expected to be different across traits and environments, which in turn makes this type of data particularly suited for mixed model analysis. Proper modeling of such features decreases type I error probability, increasing the reliability of results (Piepho 2005). In this work, the selected genetic effects model explored the product of two unstructured matrices, separately for traits and environments.

Malosetti et al. (2008), working with a maize MTME dataset, reported that using a direct product of component matrices resulted in a good fit to the data, while considerably reducing the number of parameters. On the other hand, Malosetti et al. (2006, 2008) argue that unstructured models can be problematic when the number of traits or environments gets moderately large, due to numerical issues when fitting the model. Notwithstanding, the number of parameters to be estimated was relatively small for a study of this magnitude, with three traits and six environments, such that QTL searches were feasible even with these more complex models. This strategy reduced the number of parameters from 171 to 26 (Table 2), improving the fit of the genetic effects model and facilitating subsequent steps. Eeuwijk et al. (2001) state that unstructured models can be adequate when there are differences in the gene pools responsible for performance at each environment.

Deciding whether each effect in a model should be fixed or random depends on the nature of observations, objectives of the work and even preferences of each researcher (Boer et al. 2007). Because the 100 genotypes herein used for QTL mapping were sampled from a segregating progeny, we had no specific interest in any of them, but rather in the estimation of genetic variance as a whole. For this reason, we treated genotypes as random effects. We also saw no particular appeal in studying the average performance of these genotypes across environments, such that we included sites and harvests as fixed effects. Literature on QTL mapping with mixed models shows that different authors take QTLs as fixed or random according to their goals (Piepho 2000; Verbyla et al. 2003; Boer et al. 2007). Molecular markers are included as random effects usually when the goal is simply to control for residual genetic variation, as done by Wang et al. (1999), or to model QTL × E interaction, as done by Piepho (2000) and Verbyla et al. (2003). Conversely, fixed genetic predictors represent positions of specific interest in the genome, whose effects are to be estimated, as done by Piepho (2000) and Boer et al. (2007), among others. We considered QTLs as fixed effects, which allowed for the estimation of specific QTL effects for each site and harvest, given that the latter were also included as fixed. Such an approach is more appropriate to study QTL × E interaction than the stability of QTL expression (Piepho 2000). Due to the absence of shrinkage of fixed effects, it is important to emphasize that QTL effect estimates are overestimated and excessively optimistic (Boer et al. 2007), not to mention the well-known overestimation of QTL effects when the progeny sample size is small (Beavis 1994).

There are two contrasting approaches to analyzing G × E interaction from multi-environment data: one-stage analysis uses individual plot data as input and fits a statistical model simultaneously to all environments, while under the two-stage scheme, separate models are initially fit to each environment, from which BLUEs (best linear unbiased estimates) are obtained for each genotype to compose a genotype × environment table of means. The second step then consists of using this table, weighted or not, to model the G × E interaction (Smith et al. 2005; Welham et al. 2010). The first approach yields maximum statistical power and eliminates biases in effect estimation, while the latter speeds up analyses and allows a much greater amount of data to be handled, at the cost of (potentially) biased results and reduced power, and hence should be seen as an approximation. The present work had 4032 available data points, a reasonably small number that enabled a single-stage analysis to be conducted.

Analysis for individual traits did not provide, in any linkage group, strong evidence in favor of the linked QTL hypothesis. Therefore, we kept positions unchanged and estimated final effects jointly for all traits. According to the $\left| {\rm effect}\right| \ge 2 \times {\rm standard \ deviation}$ criterion (Fig. 2), the 13 detected effects exhibited different pleiotropy patterns. Only four of them expressed some influence over the three traits, seven had effects over two traits, and two terms affected a single trait. Pastina et al. (2012), utilizing univariate mixed models for QTL mapping with the same data, found a significant effect for TCH on marker ESTB64m3C, which herein only influenced fiber (QTL XII). However, closer inspection of QTL effects reveals that this marker also marginally influenced TCH, particularly on the first site (city of Piracicaba) (Online Resource 1). This apparent discrepancy possibly indicates that the individual significance criterion may not adequately reflect the joint significance tested by the Wald procedure. We noticed consistency between results for QTL II on LG55, which influenced fiber in both works, and for QTL IV on linkage group LG72, which influenced TCH. Indeed, Fig. 3 shows that the most pronounced peak in this linkage group happened exactly for TCH. It is interesting to note that despite the relatively large number of QTLs found in each study, there was limited overlap between them. In reality, only the three aforementioned effects were detected in common. It is possible that the multivariate analysis misses QTLs with (moderate) effects on a single trait, while univariate mapping may fail to identify QTLs with modest effects in each of the traits. This makes it evident that the task of finding QTL is not trivial and still deserves further investigation. In particular, it is crucial to consider the particularities of the QTL mapping study at hand, such as progeny sample size, statistical model employed and search strategy, when deriving conclusions about the genetic architecture of any given trait.

In terms of the genetic correlation between traits, $84.6\,\%$ of the effects (11 out of 13) influenced at least two of the traits, that is, most QTLs were pleiotropic to some degree. Furthermore, it is remarkable that all QTL exhibited the same pattern of signs of effects. In other words, all pleiotropic QTL contributed in the direction of a positive genetic correlation between fiber and TCH, but negative between fiber and POL and between POL and TCH. Any deviations happened only for minor effects, which were not statistically significant. QTL effects followed the sign of phenotypic correlation between fiber and TCH. On the other hand, POL was phenotypically uncorrelated with fiber and only marginally correlated with TCH, which does not agree with the genetic correlations. Notwithstanding, these negative genetic correlations reflect what breeders usually observe in breeding practice, that is, sugar content is genetically negatively correlated with yield-related traits (Jackson 2005). This important information at least partially explains the correlation between these traits. If QTL contributing against the phenotypic correlation had been found, they would make it feasible for MAS to partially break the correlation, through the selection of genotypes with increased value for all traits. Evans (2002) proved theoretically that pleiotropic QTLs opposing the phenotypic correlation are more easily detected, since hypothesis testing has greater power in that situation. The fact that most QTLs herein described presented the same correlation pattern provides evidence that there really are no effects of a different nature in the evaluated progeny.

Some causes of the QTL × E interaction could be investigated. The detected QTL exhibited some type of interaction, which can have important implications for MAS. Specifically, it would not be feasible to select for markers associated with QTLs with the same effect in both sites and with unaltered effects across years, which might be linked to stably expressed genes with major effects. In fact, it would be beneficial to select specific markers for each location, but whose effects would potentially oscillate as a function of different conditions throughout the years. These results apparently contrast with those from Pastina et al. (2012), where fewer interactions with sites were detected. However, even though we found few QTLs with fairly consistent effects, it is interesting to note that we did not observe variations in the signs of effects, neither across years within a given site nor between sites for a given year. Thus, selection for any QTL would not negatively affect genotype performance in the other location. Moreover, we did observe many QTLs that showed significant interaction with sites and harvests, but with highly similar effects at the six environments for isolated traits (Online Resource 1). When this happened, however, effects for the other traits oscillated considerably, which probably caused the Wald test for interactions to be significant, as this procedure jointly tests all traits. Univariate analyses would likely flag such QTLs as not interacting with environments, for a subset of the traits, which may partly explain differences in the extent of identified QTL × E between both studies.

From a breeding perspective, these observations thus collectively indicate that there is potential for selection of QTLs with somewhat broader effects, but that most of the effort has to be focused on QTLs for specific locations. In any case, such narrow-effect QTLs would not have detrimental effects in other (similar) environments. The nature of effect inconsistencies we observed would not alter selection procedures, as we did not detect crossover interactions, but suggests that selection response could be strongly influenced by particular crop conditions, hence limiting the efficiency of MAS. Because the goal of most breeding programs is to simultaneously improve various agronomically important traits, the main advantage of employing multi-trait analyses lies in depicting the pleiotropy patterns of QTL, which point to the inability of MAS to break undesirable genetic correlations.

Pastina et al. (2012) described the use of mixed models for QTL mapping in outcrossing progenies based on multi-environment data, but restricted their analyses to a single trait and used an interval mapping (IM) approach (Lander and Botstein 1989) to identify putative QTLs, which were then fitted in a multiple-QTL model to test hypotheses and estimate QTL effects. More recently, Gazaffi et al. (2014) proposed a fixed effects model for QTL mapping in full-sib progenies that allows the segregation pattern of each QTL to be evaluated. Nonetheless, it is based on the CIM method and does not allow the analysis of multiple traits or environments. Our model aims to provide a complete framework for the QTL analysis of MTME data from full-sib progenies, with explicit modeling of genetic and residual (co)variances, more realistic model searching strategies and also the inclusion of epistatic interactions.

A notable advantage of the MIM method is that it may include epistatic terms directly in the QTL searching process (Kao and Zeng 1997; Kao et al. 1999). In the least-squares approximation context, genetic predictors for epistatic effects can be obtained by simply multiplying the appropriate genetic predictors, as they are orthogonal. Because of the restricted progeny size available in this study, we conducted tentative, exploratory searches for epistasis between the QTLs detected in our MTME dataset. We detected eight QTL × QTL interaction terms involving nine of the 13 QTLs. Interestingly, these epistatic interactions all displayed the same patterns of pleiotropy as the detected QTL. Additionally, as observed for QTL, epistatic terms extensively interacted with sites and/or harvests, such that breeding values would have to be calculated specifically for each environment, should this information be used for selection. Podlich et al. (2004) and Cooper et al. (2005) showed that epistatic interactions between QTL and the genetic background, in combination with QTL × E interaction, can be important for MAS.

Some authors have performed searches for epistatic effects through digenic approaches and analysis of variance or regression models (Hoarau et al. 2002; Aitken et al. 2006, 2008), but the lack of residual genetic variation control usually results in high false positive rates (Wang et al. 1999). The latter authors stated that just as the inclusion of cofactors in the CIM model successfully controls the influence of residual genetic variation in QTL mapping in the absence of epistasis, the inclusion of interacting markers linked to epistatic QTL increases power, accuracy and precision of QTL mapping. Our MIM approach allows these advantages to be achieved through the inclusion of epistasis as fixed or random effects in the model.

This is the only work, to the best of our knowledge, to make use of a multivariate mixed model for pleiotropic QTL searching with joint modeling of QTL × E interaction in sugarcane. Such a methodology has as its main advantages increased statistical power and reduced rates of false positives, which, collectively, make conclusions more reliable (Piepho 2005; Malosetti et al. 2008). Single-stage analysis with direct modeling of genetic (co)variances reduces biases and makes QTL effect estimates valid for MAS (Welham et al. 2010). Moreover, fitting multiple QTLs in a single joint model allows for breeding value estimates to be obtained, which can subsequently be leveraged by breeding programs. Even so, there is substantial room for improvement of the employed methodology, and extensive future efforts are still required. First, the mapping model hereby used was developed for diploid species, for which it is not possible for alleles to be present in multiple dosages, such that our conclusions are also approximations. It is necessary to develop molecular markers that allow precise estimates of the number of alleles to be obtained. To that end, there is ongoing work aiming at high-throughput SNP genotyping of polyploid species such as sugarcane (Serang et al. 2012; Garcia et al. 2013). Next, genetic mapping methodology for polyploids must be further developed to take these markers into account. Furthermore, QTL mapping models capable of estimating quantitative effects of multiple doses must be devised. Finally, although the integration of markers segregating 1:1 and 3:1 in a single map is an advancement in comparison with the two-way pseudo-testcross, we only used SDMs to construct the linkage map and discarded markers with larger copy numbers, thus reducing genome coverage. Lower marker saturation can be noted by the small number of markers per group, the small length of linkage groups and the fact that many linkage groups could not be integrated, as a result of the exclusive presence of $D_{1}$ or $D_{2}$ markers.

Nonetheless, despite the inherent limitations of the dataset, one-stage MTME analysis in sugarcane extracted unbiased information with high power from the data. Modeling of (co)variances through mixed models dismissed the need for unrealistic assumptions. Finally, the biological foundations of MIM culminated in easily interpretable results with potential application to breeding programs, especially through the enhanced understanding of the genetic architecture of important agronomic traits, as well as the possibility of estimating breeding values directly from the QTL model. The coupling of these advantages makes this methodology genuinely appropriate for handling data of this nature.

References

Aitken KS, Jackson PA, McIntyre CL (2006) Quantitative trait loci identified for sugar related traits in a sugarcane (Saccharum spp.) cultivar x Saccharum officinarum population. Theor Appl Genet 112:1306–1317
Article CAS PubMed Google Scholar
Aitken KS, Hermann S, Karno K, Bonnett GD, McIntyre CL, Jackson PA (2008) Genetic control of yield related stalk traits in sugarcane. Theor Appl Genet 117:1191–1203
Article CAS PubMed Google Scholar
Alimi NA, Bink MCAM, Dieleman JA, Magán JJ, Wubs AM, Palloix A, van Eeuwijk FA (2013) Multi-trait and multi-environment QTL analyses of yield and a set of physiological traits in pepper. Theor Appl Genet 126:2597–2625
Article CAS PubMed Google Scholar
Balzarini M (2001) Applications of mixed models in plant breeding. In: Kang MS (ed) Quantitative genetics, genomics and plant breeding. CABI Publishing, New York, pp 353–363
Chapter Google Scholar
Beavis WD (1994) The power and deceit of QTL experiments: lessons from comparative QTL studies. In: Proceedings of the forty-ninth annual corn and sorghum industry research conference. American Seed Trait Association, Washington, pp 250–266
Belsley DA, Kuh E, Welsch RE (1980) Regression diagnostics: identifying influential data and sources of collinearity. Wiley, New York
Book Google Scholar
Boer MP, Wright D, Feng L, Podlich DW, Luo L, Cooper M, van Eeuwijk FA (2007) A mixed-model quantitative trait loci (QTL) analysis for multiple-environment trial data using environmental covariables for QTL-by-environment interactions, with an example in maize. Genetics 177:1801–1813
Article PubMed Central PubMed Google Scholar
Bonneau J, Taylor J, Parent B, Bennett D, Reynolds M, Feuillet C, Langridge P, Mather D (2013) Multi-environment analysis and improved mapping of a yield-related QTL on chromosome 3B of wheat. Theor Appl Genet 126:747–761
Article CAS PubMed Google Scholar
Broman KW (2001) Review of statistical methods for QTL mapping in experimental crosses. Lab Anim 30:44–52
CAS Google Scholar
Broman KW, Sen Ś (2009) A guide to QTL mapping with R/qtl. Springer, New York
Book Google Scholar
Burnham KP, Anderson DR (2004) Multimodel inference: understanding AIC and BIC in model selection. Sociol Method Res 33:261–304
Article Google Scholar
Collard BCY, Mackill DJ (2008) Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Philos Trans R Soc B Biol Sci 363:557–572
Article CAS Google Scholar
Cooper M, Podlich DW, Smith OS (2005) Gene-to-phenotype models and complex trait genetics. Aust J Agric Res 56:895–918
Article Google Scholar
Costet L, Raboin LM, Payet M, D’Hont A, Nibouche S (2012) A major quantitative trait allele for resistance to the Sugarcane yellow leaf virus (Luteoviridae). Plant Breed 131:637–640
Article CAS Google Scholar
Eichler EA, Flint J, Gibson G, Kong A, Leal SM, Moore JH, Nadeau JH (2010) Missing heritability and strategies for finding the underlying causes of complex disease. Nat Rev Genet 11:446–450
Article PubMed Central CAS PubMed Google Scholar
Evans DM (2002) The power of multivariate quantitative-trait loci linkage analysis is influenced by the correlation between variables. Am J Hum Genet 70:1599–1602
Article PubMed Central CAS PubMed Google Scholar
Freeman JS, Potts BM, Downes GM, Pilbeam D, Thavamanikumar S, Vaillancourt RE (2013) Stability of quantitative trait loci for growth and wood properties across multiple pedigrees and environments in Eucalyptus globulus. New Phytol 198:1121–1134
Article CAS PubMed Google Scholar
Garcia AAF, Kido EA, Meza AN, Souza HMB, Pinto LR, Pastina MM, Leite CS, da Silva JAG, Ulian EC, Figueira A et al (2006) Development of an integrated genetic map of a sugarcane (Saccharum spp.) commercial cross, based on a maximum-likelihood approach for estimation of linkage and linkage phases. Theor Appl Genet 112:298–314
Article CAS PubMed Google Scholar
Garcia AAF, Wang S, Melchinger AE, Zeng ZB (2008) Quantitative trait loci mapping and the genetic basis of heterosis in maize and rice. Genetics 180:1707–1724
Article PubMed Central PubMed Google Scholar
Garcia AAF, Mollinari M, Marconi TG, Serang OR, Silva RR, Vieira MLC, Vicentini R, Costa EA, Mancini MC, Garcia MOS et al (2013) SNP genotyping allows an in-depth characterisation of the genome of sugarcane and other complex autopolyploids. Sci Rep 3:3399
Article PubMed Central PubMed Google Scholar
Gazaffi R, Margarido GRA, Pastina MM, Mollinari M, Garcia AAF (2014) A model for quantitative trait loci mapping, linkage phase and segregation pattern estimation for a full-sib progeny. Tree Genet Genomes 10:791–801
Article Google Scholar
Haley CS, Knott SS (1992) A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 69:315–324
Article CAS PubMed Google Scholar
Hoarau JY, Grivet L, Offmann B, Raboin LM, Diorflar JP, Payet J, Hellmann M, D’Hont A, Glaszmann JC (2002) Genetic dissection of a modern sugarcane cultivar (Saccharum spp.). II. Detection of QTLs for yield components. Theor Appl Genet 105:1027–1037
Article PubMed Google Scholar
Jackson PA (2005) Breeding for improved sugar content in sugarcane. Field Crop Res 92:277–290
Article Google Scholar
Jiang C, Zeng ZB (1995) Multiple trait analysis of genetic mapping for quantitative trait loci. Genetics 140:1111–1127
PubMed Central CAS PubMed Google Scholar
Jordan DR, Casu RE, Besse P, Carroll BC, Berding N, McIntyre CL (2004) Markers associated with stalk number and suckering in sugarcane colocate with tillering and rhizomatousness QTLs in sorghum. Genome 47:988–993
Article CAS PubMed Google Scholar
Kao CH, Zeng ZB (1997) General formulas for obtaining the MLEs and the asymptotic variance-covariance matrix in mapping quantitative trait loci when using the EM algorithm. Biometrics 53:653–665
Article CAS PubMed Google Scholar
Kao CH, Zeng ZB, Teasdale RD (1999) Multiple interval mapping for quantitative trait loci. Genetics 152:1203–1216
PubMed Central CAS PubMed Google Scholar
Kosambi DD (1944) The estimation of map distances from recombination values. Ann Eugen 12:172–175
Article Google Scholar
Lander ES, Botstein D (1989) Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199
PubMed Central CAS PubMed Google Scholar
Lin M, Lou XY, Chang M, Wu R (2003) A general statistical framework for mapping quantitative trait loci in nonmodel systems: issue for characterizing linkage phases. Genetics 165:901–913
PubMed Central CAS PubMed Google Scholar
Liu Y, Wang L, Sun C, Zhang Z, Zheng Y, Qiu F (2014) Genetic analysis and major QTL detection for maize kernel size and weigh in multi-environments. Theor Appl Genet 127:1019–1037
Article CAS PubMed Google Scholar
Lopes MS, Reynolds MP, McIntyre CL, Mathews KL, Jalal Kamali MR, Mossad M, Feltaous Y, Tahir ISA, Chatrath R, Ogbonnaya F et al (2013) QTL for yield and associated traits in the Seri/Babax population grown across several environments in Mexico, in the West Asia, North Africa, and South Asia regions. Theor Appl Genet 126:971–984
Article PubMed Google Scholar
Malosetti M, Boer MP, Bink MCAM, Eeuwijk FA van (2006) Multi-trait QTL analysis based on mixed models with parsimonious covariance matrices. In: Proceedings of the world congress on genetics applied to livestock production, 8th edn. WCGALP, Belo Horizonte, pp 25–04
Malosetti M, Ribaut JM, Vargas M, Crossa J, van Eeuwijk FA (2008) A multi-trait multi-environment QTL mixed model with an application to drought and nitrogen stress trials in maize (Zea mays L.). Euphytica 161:241–257
Article Google Scholar
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A et al (2009) Finding the missing heritability of complex diseases. Nature 461:747–753
Article PubMed Central CAS PubMed Google Scholar
Margarido GRA, Souza AP, Garcia AAF (2007) OneMap: software for genetic mapping in outcrossing species. Hereditas 144:78–79
Article CAS PubMed Google Scholar
Moose SP, Mumm RH (2008) Molecular plant breeding as the foundation for 21st century crop improvement. Plant Physiol 147:969–977
Article PubMed Central CAS PubMed Google Scholar
Nibouche S, Raboin LM, Hoarau JY, D’Hont A, Costet L (2012) Quantitative trait loci for sugarcane resistance to the spotted stem borer Chilo sacchariphagus. Mol Breed 29:129–135
Article Google Scholar
Oliveira KM, Pinto LR, Marconi TG, Margarido GRA, Pastina MM, Teixeira LHM, Figueira AV, Ulian EC, Garcia AAF, Souza AP (2007) Functional integrated genetic linkage map based on EST-markers for a sugarcane (Saccharum spp.) commercial cross. Mol Breed 20:189–208
Article CAS Google Scholar
Pastina MM, Malosetti M, Gazaffi R, Mollinari M, Margarido GRA, Oliveira KM, Pinto LR, Souza AP, van Eeuwijk FA, Garcia AAF (2012) A mixed model QTL analysis for sugarcane multiple-harvest-location trial data. Theor Appl Genet 124:835–849
Article PubMed Central CAS PubMed Google Scholar
Phillips PC (2008) Epistasis—the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet 9:855–867
Article PubMed Central CAS PubMed Google Scholar
Piepho HP (2000) A mixed-model approach to mapping quantitative trait loci in barley on the basis of multiple environment data. Genetics 156:2043–2050
PubMed Central CAS PubMed Google Scholar
Piepho HP (2005) Statistical tests for QTL and QTl-by-environment effects in segregating populations derived from line crosses. Theor Appl Genet 110:561–566
Article CAS PubMed Google Scholar
Pinto LR, Garcia AAF, Pastina MM, Teixeira LHM, Bressiani JA, Ulian EC, Bidoia MAP, Souza AP (2010) Analysis of genomic and functional RFLP derived markers associated with sucrose content, fiber and yield QTLs in a sugarcane (Saccharum spp.) commercial cross. Euphytica 172:313–327
Article CAS Google Scholar
Piperidis N, Jackson PA, D’Hont A, Besse P, Hoarau JY, Courtois B, Aitken KS, McIntyre CL (2008) Comparative genetics in sugarcane enables structured map enhancement and validation of marker-trait associations. Mol Breed 21:233–247
Article Google Scholar
Podlich DW, Winkler CR, Cooper M (2004) Mapping as you go: an effective approach for marker-assisted selection of complex traits. Crop Sci 44:1560–1571
Article Google Scholar
Raboin LM, Pauquet J, Butterfield M, D’Hont A, Glaszmann JC (2008) Analysis of genome-wide linkage disequilibrium in the highly polyploid sugarcane. Theor Appl Genet 116:701–714
Article CAS PubMed Google Scholar
Serang OR, Mollinari M, Garcia AAF (2012) Efficient exact maximum a posteriori computation for bayesian SNP genotyping in polyploids. PLoS One 7:e30906
Article PubMed Central CAS PubMed Google Scholar
Singh S, Hernandez MV, Crossa J, Singh PK, Bains NS, Singh K, Sharma I (2012) Multi-trait and multi-environment QTL analyses for resistance to wheat diseases. PLoS One 7:e38008
Article PubMed Central CAS PubMed Google Scholar
Singh RK, Singh SP, Tiwari DK, Srivastava S, Singh SB, Sharma ML, Singh R, Mohapatra T, Singh NK (2013) Genetic mapping and QTL analysis for sugar yield-related traits in sugarcane. Euphytica 191:333–353
Article CAS Google Scholar
Smith AB, Cullis BR, Thompson R (2001) Analyzing variety by environment data using multiplicative mixed models and adjustments for spatial field trend. Biometrics 57:1138–1147
Article CAS PubMed Google Scholar
Smith AB, Cullis BR, Thompson R (2005) The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches. J Agric Sci 143:449–462
Article Google Scholar
Smith AB, Stringer JK, Wei X, Cullis BR (2007) Varietal selection for perennial crops where data relate to multiple harvests from a series of field trials. Euphytica 157:253–266
Article Google Scholar
Sun FD, Zhang JH, Wang SF, Gong WK, Shi YZ, Liu AY, Li JW, Gong JW, Shang HH, Yuan YL (2012) QTL mapping for fiber quality traits across multiple generations and environments in upland cotton. Mol Breed 30:569–582
Article Google Scholar
van Eeuwijk FA, Cooper M, DeLacy IH, Ceccarelli S, Grando S (2001) Some vocabulary and grammar for the analysis of multi-environment trials, as applied to the analysis of FPB and PPB trials. Euphytica 122:477–490
Article Google Scholar
Verbyla AP, Eckerman PJ, Thompson R, Cullis BR (2003) The analysis of quantitative trait loci in multi-environment trials using a multiplicative mixed model. Aust J Agric Res 54:1395–1408
Article CAS Google Scholar
VSN International (2013) GenStat for Windows 16th Edition. VSN International, Hemel Hempstead
Wang DL, Zhu J, Li ZK, Paterson AH (1999) Mapping QTLs with epistatic effects and QTL × environment interactions by mixed linear model approaches. Theor Appl Genet 99:1255–1264
Article Google Scholar
Wei X, Jackson PA, McIntyre CL, Aitken KS, Croft B (2006) Associations between DNA markers and resistance to diseases in sugarcane and effects of population substructure. Theor Appl Genet 114:155–164
Article CAS PubMed Google Scholar
Welham SJ, Gogel BJ, Smith AB, Thompson R, Cullis BR (2010) A comparison of analysis methods for late-stage variety evaluation trials. Aust NZ J Stat 52:125–149
Article Google Scholar
Wu KK, Burnquist W, Sorrells ME, Tew TL, Moore PH, Tanksley SD (1992) The detection and estimation of linkage in polyploids using single-dose restriction fragments. Theor Appl Genet 83:294–300
Article CAS PubMed Google Scholar
Wu R, Ma CX, Painter I, Zeng ZB (2002) Simultaneous maximum likelihood estimation of linkage and linkage phases in outcrossing species. Theor Popul Biol 61:349–363
Article PubMed Google Scholar
Zeng ZB (1993) Theoretical basis of separation of multiple linked gene effects on mapping quantitative trait loci. Proc Natl Acad Sci USA 90:10972–10976
Article PubMed Central CAS PubMed Google Scholar
Zeng ZB (1994) Precision mapping of quantitative trait loci. Genetics 136:1457–1468
PubMed Central CAS PubMed Google Scholar
Zeng ZB, Kao CH, Basten CJ (1999) Estimating the genetic architecture of quantitative traits. Genet Res 74:279–289
Article CAS PubMed Google Scholar

Download references

Acknowledgments

We thank M. Malosetti and F. A. van Eeuwijk for reading an earlier version of this manuscript and providing valuable feedback. This research was supported by Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP, grants 2007/02775-9 and 2008/52197-4) from Brazil. A. A. F. Garcia and A. P. Souza are recipients of a research fellowship from CNPq.

Author information

Authors and Affiliations

Departamento de Genética, Escola Superior de Agricultura “Luiz de Queiroz” (ESALQ), Universidade de São Paulo (USP), CP 83, Piracicaba, SP, 13418-900, Brazil
G. R. A. Margarido & A. A. F. Garcia
Embrapa Milho e Sorgo, CP 285, Sete Lagoas, MG, 35701-970, Brazil
M. M. Pastina
Centro de Biologia Molecular e Engenharia Genética (CBMEG), Departamento de Genética e Evolução, Universidade Estadual de Campinas (UNICAMP), Cidade Universitária Zeferino Vaz, CP6010, Campinas, SP, 13083-875, Brazil
A. P. Souza

Authors

G. R. A. Margarido
View author publications
You can also search for this author in PubMed Google Scholar
M. M. Pastina
View author publications
You can also search for this author in PubMed Google Scholar
A. P. Souza
View author publications
You can also search for this author in PubMed Google Scholar
A. A. F. Garcia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. A. F. Garcia.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 34 KB)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Margarido, G.R.A., Pastina, M.M., Souza, A.P. et al. Multi-trait multi-environment quantitative trait loci mapping for a sugarcane commercial cross provides insights on the inheritance of important traits. Mol Breeding 35, 175 (2015). https://doi.org/10.1007/s11032-015-0366-6

Download citation

Received: 21 August 2014
Accepted: 29 July 2015
Published: 09 August 2015
DOI: https://doi.org/10.1007/s11032-015-0366-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Multi-trait multi-environment quantitative trait loci mapping for a sugarcane commercial cross provides insights on the inheritance of important traits

Abstract

Similar content being viewed by others

Marker-trait association study for sucrose and yield contributing traits in sugarcane (Saccharum spp. hybrid)

Association Mapping for Sugarcane Quality Traits at Three Harvest Times

QTL analysis in multiple sorghum mapping populations facilitates dissection of the genetic control of agronomic and yield-related traits in sorghum [Sorghum bicolor (Moench)]

Introduction