Correction to: Genet Sel Evol (2018) 50:53 https://doi.org/10.1186/s12711-018-0426-6
After publication of original article [1], the authors noticed that there was an error.
In the section “Quadratic forms of estimated breeding values” we wrote, based on Taylor series approximations, that:
-
“In the remainder of this paper, we assume that the expectation of a ratio of quadratic forms is equal to the ratio of the expectations. The “Appendix” shows that this holds when the number of individuals included in the statistics is large (several hundred or more) or when they are not structured into very large sibships. Otherwise, as shown in the “Appendix”, both the true regression coefficient \( b = cov\left( {{\hat{\mathbf{u}}}_{p} ,{\mathbf{u}}} \right)/var\left( {{\hat{\mathbf{u}}}_{p} } \right) \) and its estimator \( \hat{b} = cov\left( {{\hat{\mathbf{u}}}_{p} ,{\hat{\mathbf{u}}}_{w} } \right)/var\left( {{\hat{\mathbf{u}}}_{p} } \right) \) have an expectation less than 1, even when the model is perfect and the EBV have the right dispersion.”
This statement is wrong, because a further expansion of the Taylor series (until its 3rd term) shows that the expectation of both \( b \) and \( \hat{b} \) is 1. The correct paragraph should be:
-
“In the remainder of this paper, we assume that the expectation of a ratio of quadratic forms is equal to the ratio of the expectations. The “Correction” below shows that this holds (up to the second order of the Taylor series expansion) when the BLUP properties hold, as assumed throughout the paper.”
Moreover, the Abstract reads “Contrary to common belief, the regression of true on estimated breeding values is (on expectation) lower than 1 for small or related validation sets, due to family structures.” whereas it should read “Up to the second term of a Taylor series expansion, the regression of true on estimated breeding values is (on expectation) equal to 1, even for small or related validation sets.”
Further proof (equivalent to the Appendix) is below. According to [2] the second-order Taylor series expansion of a ratio is \( E\left( {\frac{X}{Y}} \right) \approx \frac{E\left( X \right)}{E\left( Y \right)} - \frac{{Cov\left( {X,Y} \right)}}{{E\left( Y \right)^{2} }} + \frac{Var\left( Y \right)E\left( X \right)}{{E\left( Y \right)^{3} }} \). Consider the bias in the estimation of the ratio \( b = \frac{{cov\left( {{\hat{\mathbf{u}}}_{p} ,{\mathbf{u}}} \right)}}{{var\left( {{\hat{\mathbf{u}}}_{p} } \right)}} \) using the estimator \( \hat{b} = \frac{{cov\left( {{\hat{\mathbf{u}}}_{p} ,{\hat{\mathbf{u}}}_{w} } \right)}}{{var\left( {{\hat{\mathbf{u}}}_{p} } \right)}} \). In our case:
Therefore, we expand \( E\left( {\frac{X}{Y}} \right) \) for \( = \frac{1}{n}\left( {\widehat{{\mathbf{u}}}_{p} - \overline{{\widehat{{\mathbf{u}}}}} _{p} } \right)^{\prime } \left( {\widehat{{\mathbf{u}}}_{w} - \overline{{\widehat{{\mathbf{u}}}}} _{w} } \right) = \frac{1}{n}\left( {\widehat{{\mathbf{u}}}^{\prime } _{p} {\mathbf{S^{\prime}S\widehat{u}}}_{w} {\text{ }}} \right) = \frac{1}{n}\left( {\widehat{{\mathbf{u}}}_{p} {{\mathbf{S\widehat{{u}}}}_{w}} } \right), \) where \( {\mathbf{S}} = {\mathbf{I}} - \frac{1}{n}{\mathbf{J}} \) and \( Y = \frac{1}{n}\left( {\widehat{{\mathbf{u}}}_{p} - \overline{{\widehat{{\mathbf{u}}}}} _{p} } \right)^{\prime } \left( {\widehat{{\mathbf{u}}}_{p} - \widehat{{\mathbf{u}}}_{p} } \right) = \frac{1}{n}\left( {\widehat{{\mathbf{u}}}_{p} {\mathbf{S\widehat{u}}}_{p} } \right). \) The expression for the covariance of bilinear forms (under normality) is:
(\( {\mathbf{C}} \) is the covariance matrix across \( {\mathbf{x}}_{i} \); chapter 2 Equation 58 in [3]). Applied to our case, this yields:
The terms linked to the means disappear, as before, because they have the form \( \mu \mathbf{1}^{\prime}{\mathbf{S^{\prime}}}\left( {{\mathbf{G}} - {\mathbf{C}}_{p}^{uu} } \right){\mathbf{S1}}\mu \), which has a value of 0. Then, we have:
which are all the elements needed. To simplify notation, consider \( {\mathbf{K}} = {\mathbf{G}} - {\mathbf{C}}_{p}^{uu} \). Putting all together gives:
which cancels out. Therefore, up to the 3rd order approximation, the bias in the estimation of \( b = \frac{{cov\left( {{\hat{\mathbf{u}}}_{p} ,{\mathbf{u}}} \right)}}{{var\left( {{\hat{\mathbf{u}}}_{p} } \right)}} \) using the estimator \( \hat{b} = \frac{{cov\left( {{\hat{\mathbf{u}}}_{p} ,{\hat{\mathbf{u}}}_{w} } \right)}}{{var\left( {{\hat{\mathbf{u}}}_{p} } \right)}} \) is 0. In addition, this shows that the expected value of \( b = \frac{{cov\left( {{\hat{\mathbf{u}}}_{p} ,{\mathbf{u}}} \right)}}{{var\left( {{\hat{\mathbf{u}}}_{p} } \right)}} \) is also 1 since the same derivation as above holds with \( {\mathbf{u}} \) in the place of \( {\mathbf{u}}_{w} \).
References
Legarra A, Reverter A. Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method. Genet Sel Evol. 2018;50:53. https://doi.org/10.1186/s12711-018-0426-6.
Stuart A, Ord K. Kendall’s advanced theory of statistics. London: Arnold Hodder Headline Group; 1998.
Searle S. Linear models. New York: John Wiley; 1971.
Acknowledgements
The authors thank Peter Sullivan (Lactanet, Guelph, ON, Canada) for pointing out that a second order expansion should give a ratio of 1.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Legarra, A., Reverter, A. Correction to: Semi-parametric estimates of population accuracy and bias of predictions of breeding values and future phenotypes using the LR method. Genet Sel Evol 51, 69 (2019). https://doi.org/10.1186/s12711-019-0511-5
Published:
DOI: https://doi.org/10.1186/s12711-019-0511-5