Abstract
We present a lattice QCD determination of light quark masses with three sea-quark flavours (\(N_\mathrm{f}=2+1\)). Bare quark masses are known from PCAC relations in the framework of CLS lattice computations with a non-perturbatively improved Wilson-Clover action and a tree-level Symanzik improved gauge action. They are fully non-perturbatively improved, including the recently computed Symanzik counter-term \(b_{\mathrm{A}} - b_{\mathrm{P}}\). The mass renormalisation at hadronic scales and the renormalisation group running over a wide range of scales are known non-perturbatively in the Schrödinger functional scheme. In the present paper we perform detailed extrapolations to the physical point, obtaining (for the four-flavour theory) \(m_{\mathrm{u}/\mathrm{d}}(2~\mathrm{GeV})= 3.54(12)(9)~\mathrm{MeV}\) and \(m_{\mathrm{s}}(2~\mathrm{GeV}) = 95.7(2.5)(2.4)~\mathrm{MeV}\) in the \(\overline{\mathrm{MS}}\) scheme. For the mass ratio we have \(m_{\mathrm{s}}/m_{\mathrm{u}/\mathrm{d}}= 27.0(1.0)(0.4)\). The RGI values in the three-flavour theory are \(M_{\mathrm{u}/\mathrm{d}}= 4.70(15)(12)~\mathrm{MeV}\) and \(M_{\mathrm{s}}= 127.0(3.1)(3.2)~\mathrm{MeV}\).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The lattice regularisation of QCD provides a well-defined procedure for the determination of the fundamental parameters of the theory (i.e. the gauge coupling and the quark masses) from first principles. The aim of the present work is the determination of the three lightest quark masses (i.e. those of the up, down and strange flavours), in a framework in which the up and down quarks are degenerate and all heavier flavours (i.e. charm and above), if present in the theory, would be quenched (valence) degrees of freedom. This is known as \(N_\mathrm{f}=2+1\) lattice QCD. Moreover, QED effects are ignored.
The three-flavour theory adopted in this paper is presumably sufficient for determining light quark masses due to the decoupling of heavier quarks [1,2,3,4]. Indeed, lattice world averages of light quark masses \(m_{\mathrm{u}/\mathrm{d}}\), \(m_{\mathrm{s}}\) do not show a significant dependence on the number of flavours at low energies for \(N_\mathrm{f}\ge 2\) within present-day errors [5]. This also holds for the more accurately known renormalisation group independent ratio \(m_{\mathrm{u}/\mathrm{d}}/m_{\mathrm{s}}\). Recently, heavy-flavour decoupling has been substantiated also non-perturbatively [6].
This paper is based on large-scale \(N_\mathrm{f}=2+1\) flavour ensembles produced by the Coordinated Lattice Simulation (CLS) effort [7, 8]. The simulations employ a tree-level Symanzik-improved gauge action and a non-perturbatively improved Wilson fermion action; see references [9,10,11,12]. The sea quark content is made of a doublet of light degenerate quarks \(m_{\mathrm {q},1} = m_{\mathrm {q},2}\), plus a heavier one \(m_{\mathrm {q},3}\). At the physical point \(m_{\mathrm {q},1}, m_{\mathrm {q},2} = m_{\mathrm{u}/\mathrm{d}}\equiv \tfrac{1}{2}(m_{\mathrm{u}}+m_{\mathrm{d}})\) and \(m_{\mathrm {q},3} = m_{\mathrm{s}}\).
The bare quark masses produced by CLS [7, 8] need to be combined with renormalisation and improvement coefficients in order to obtain renormalised quantities with \(\mathrm{O}(a^2)\) discretisation effects. We use ALPHA collaboration results for the quark mass renormalisation and Renormalisation Group (RG) running [13] in the Schrödinger functional (SF) scheme. Symanzik improvement is implemented for the removal of discretisation effects from correlation functions, leaving us with \(\mathrm{O}(a^2)\) uncertainties in the bulk and \(\mathrm{O}(g_0^4 a)\) ones at the time boundaries. We find that correlation functions extrapolations to the continuum limit are compatible with an \(\mathrm{O}(a^2)\) overall behaviour. The counter-terms required for the improvement of the axial current are known from refs. [14,15,16]. The present work has combined all these elements, obtaining estimates of the up/down and strange quark masses, as well as their ratio. These are expressed as renormalisation scheme-independent and scale-independent quantities, known as Renormalisation Group Invariant (RGI) quark masses. Of course we also give the same results in the \(\overline{\mathrm{MS}}\) scheme at scale \(\mu = 2\) GeV.
The bare dimensionless parameters of the lattice theory are the strong coupling \(g_0^2 \equiv 6/\beta \) and the quark masses expressed in lattice units \(a m_{\mathrm {q},1} = a m_{\mathrm {q},2}\) and \(a m_{\mathrm {q},3}\), with a the lattice spacing. They can be varied freely in simulations. Having chosen a specific discretisation of the QCD action, its parameters must be calibrated so that three “input” hadronic quantities (one for each bare mass parameter and one for the lattice spacing) attain their physical values. Other physical quantities can subsequently be predicted. Such input quantities are typically very well-known from experiment, but they also need to be precisely computed on the lattice; examples are ground state hadron masses and decay constants \((m_{\pi },f_{\pi },m_\mathrm{K},f_\mathrm{K},\ldots )\). Since the majority of numerical large-scale simulations do not yet include the small strong-isospin breaking and electromagnetic effects, the physical input quantities have to be corrected accordingly. Following Ref. [8], we use the values of Ref. [17]
The calibration of the lattice spacing, referred to as scale setting, usually singles out a dimensionful quantity as reference scale \(f_\mathrm{ref}\,[\mathrm{MeV}]\). Its dimensionless counterpart \(af_\mathrm{ref}\) is computed on the lattice for fixed values of the bare coupling at the point where the physical spectrum, such as \([am_{\pi }/(af_\mathrm{ref})]_{g_0^2}\equiv m_{\pi }/f_\mathrm{ref}\), is reproduced in the bare parameter space \((g_0^2,am_{\mathrm{q},i})\) of the lattice theory. In this way, the lattice spacings \(a(g_0^2)=[af_\mathrm{ref}]_{g_0^2}/f_\mathrm{ref}\), and consequently all computed observables, are obtained in physical units. When simulations approach the point of physical mass parameters while the lattice spacing is lowered, computational demands rapidly increase. In the present work results are obtained at non-zero lattice spacings and at quark masses which correspond to unphysical meson and decay constant values. Thus our data need to be extrapolated to the continuum limit and extra/interpolated to the physical quark mass values. This is achieved with a joint chiral and continuum extrapolation. The present work pays particular attention to these extrapolations and interpolations and the ensuing sources of systematic error.
So far we did not specify the reference scale \(f_\mathrm{ref}\). In Ref. [8] the three-flavor symmetric combination \(f_{\pi \mathrm K}^{{\scriptscriptstyle \mathrm phys}}=\tfrac{2}{3}(f_\mathrm{K}^{{\scriptscriptstyle \mathrm phys}}+\tfrac{1}{2}f_{\pi }^{{\scriptscriptstyle \mathrm phys}})=147.6(5)~\mathrm{MeV}\), (obtained from the physical input of Eqs. (1.1)) was used for calibration, and for the determination of the hadronic gradient flow scale \(t_0\) [18].Footnote 1 An artificial (theoretical) hadronic scale with mass dimension \(-2\), \(t_0\) is precisely computable with small systematic effects [19, 20], and thus well-suited as intermediate scale on the lattice. Its physical value determined from CLS ensembles [8] reads
at fixed
where the first error of \(\sqrt{8t_{0}^{{\scriptscriptstyle \mathrm phys}}}\) is statistical and the second systematic.
The theoretical framework of our work is explained in Sect. 2. The definitions of bare current quark masses, their renormalisation parameters and the \(\mathrm{O}(a)\)-improvement counter-terms are provided in standard ALPHA-collaboration fashion. There is also a fairly detailed exposition of how the so-called “chiral trajectory” (a line of constant physics – LCP) is traced by \(N_\mathrm{f}=2+1\) CLS simulations. In Sect. 3 we outline the computations leading to renormalised current quark masses as functions of the pion squared mass. These are computed in the SF renormalisation scheme at a hadronic (low energy) scale. In Sect. 4 we perform the combined chiral and continuum limit extrapolations in order to obtain estimates of the physical up/down and strange quark masses. Details of the ansätze we have used are provided in Appendix A and in Appendix B. Our final results are gathered in Sect. 5. Preliminary results have been presented in [21].
2 Theoretical framework
We review our strategy for computing light quark masses with improved Wilson fermions. In what follows equations are often written for a general number of flavours \(N_\mathrm{f}\). In practice \(N_\mathrm{f}= 2+1\). Flavours 1 and 2 indicate the lighter fermion fields, which are degenerate; at the physical point their mass is the average up/down quark mass. Flavour 3 stands for the heavier fermion, corresponding to the strange quark at the physical point.
2.1 Quark masses, renormalisation, and improvement
The starting point is the definition of bare correlation functions on a lattice with spacing is a and physical extension \(L^3 \times T\):
where the pseudoscalar density and axial current are
The indices \(i,j =1,2,3\) label quark flavours, which are always distinct (\( i \ne j \)).
The bare current (or PCAC) quark mass is defined via the axial Ward identity at zero momentum and a plateau average between suitable initial and final time-slices \(t_{\mathrm{i}} < t_{\mathrm{f}}\),
with the source \(P^{ji}\) positioned either at \(y_0=a\) or \(y_0=T-a\).Footnote 2 The mass-independent improvement coefficient \(c_\mathrm{A}\) is determined non-perturbatively [14]. The average of two renormalised quark masses is then expressed in terms of the PCAC mass \( m_{ij}\) as follows:
where \(M_\mathrm{q}\equiv \mathrm{diag}(m_{\mathrm {q},1},m_{\mathrm {q},2}, \cdots , m_{\mathrm {q},N_\mathrm{f}})\) is the matrix of the sea quark subtracted masses, characteristic of Wilson fermions. Given the bare mass parameter \(m_{0,i} \equiv (1/\kappa _i - 8)/(2a)\), with \(\kappa _i\) the Wilson hopping parameter, these are defined as
where \(m_\mathrm {cr}\sim 1/a\) is an additive mass renormalisation arising from the loss of chiral symmetry by the regularisation and \(\kappa _{\mathrm{cr}}\) is the critical (chiral) point. The average of two subtracted masses is then denoted by \( m_{\mathrm {q},ij} \equiv \tfrac{1}{2}(m_{\mathrm {q},i}+m_{\mathrm {q},j})\) in Eq. (2.5).
The axial current normalisation \(Z_\mathrm {A}(g_0^2)\) is scale-independent, whereas the current quark mass renormalisation parameter \(1/Z_\mathrm {P}(g_0^2,a\mu )\) depends on the renormalisation scale \(\mu \). The renormalisation condition imposed on the pseudoscalar density operator \(P^{ji}\) defines the renormalisation scheme for the quark masses. The schemes used in the present work (SF and \(\overline{\mathrm{MS}}\)) are mass-independent. Pertinent details will be discussed in latter sections.
The improvement coefficients \(b_\mathrm{A}-b_\mathrm{P}\) and \(\bar{b}_{\text {A}} - \bar{b}_{\text {P}}\) of Eq. (2.5) cancel \(\mathrm{O}(a)\) mass-dependent cutoff effects; they are functions of the bare gauge coupling \(g_0^2\). The corresponding counter-terms of Eq. (2.5) contain the subtracted masses \(am_{\mathrm {q},ij}\) and \(\mathrm{Tr}[aM_\mathrm{q}]\), which require knowledge on the critical mass \(m_\mathrm {cr}\). This can be avoided by substituting these masses with current quark masses and their sum. Their relationship is [22],
where \(Z(g_0^2)\equiv Z_\mathrm{P}/(Z_\mathrm{S}Z_\mathrm{A})\) and \(r_\text {m}(g_0^2)\) are finite normalisations. \(Z_\mathrm{S}\) is the renormalisation parameter of the non-singlet scalar density \(S^{ij} \equiv {{\bar{\psi }}}^i \psi ^j\) and \(r_{\mathrm {m}}/Z_\mathrm{S}\) is the renormalisation parameter of the singlet scalar density, which indirectly defines \(r_{\mathrm {m}}\); cf. Ref. [22]. In the above we neglect \(\mathrm{O}(a)\) terms, as they only contribute to \(\mathrm{O}(a^2)\) in the b-counter-terms of Eq. (2.5). Substituting \(am_{\mathrm {q},ij}\rightarrow am_{ij}\) in the latter expression, we obtain
where we define
To leading order in perturbation theory the difference \(b_\mathrm{A}-b_\mathrm{P}\) is \(\mathrm{O}(g_0^2)\) and equals \(\tilde{b}_{\mathrm {A}}-\tilde{b}_{\mathrm {P}}\). However, non-perturbative estimates are likely to differ significantly, especially in the range of couplings \(g_0\) considered here (\(1.56 \lesssim g_0^2 \lesssim 1.76\)). We will employ non-perturbative estimates of \(b_\mathrm{A}-b_\mathrm{P}\) and Z; cf. Ref. [23]. The term multiplying \( M_{\mathrm{sum}}\) contains \((1-r_{\mathrm {m}})/r_{\mathrm {m}}\) and \((\bar{b}_{\text {A}} - \bar{b}_{\text {P}})\). In perturbation theory \(r_{\mathrm {m}}= 1 + 0.001158\,C_\mathrm{F}\,N_\mathrm{f}\,g_0^4\) [24, 25], \((1-r_{\mathrm {m}})/r_{\mathrm {m}}\sim \mathrm{O}(g_0^4)\) and \((\bar{b}_{\text {A}} - \bar{b}_{P}) \sim \mathrm{O}(g_0^4)\) [22]. A first non-perturbative study of the coefficients \(\bar{b}_{\text {A}}\) and \(\bar{b}_{\text {P}}\) produced noisy results with 100% errors [26]. Given the lack of robust non-perturbative results and the fact that the term in curly brackets is \(\mathrm{O}(g_0^4)\) in perturbation theory, it will be dropped in what follows.
Once the quark mass averages \(m_{12\mathrm R}\) and \(m_{13\mathrm R}\) are computed say, in the SF scheme at a scale \(\mu _{\mathrm{had}}\), the three renormalised quark masses can be determined. Since we are working in the isospin limit (\(m_{\mathrm {q},1} = m_{\mathrm {q},2}\)), the lighter quark mass is given by \(m_{12\mathrm R}\). Then one can isolate \(m_{13\mathrm R}\) from the ratio \(m_{13\mathrm R}/m_{12\mathrm R}\) in which, as seen from Eq. (2.8), the \( M_{\mathrm{sum}}\) counter-term cancels out.
The ALPHA Collaboration is devoting considerable resources to the determination of the non-perturbative evolution of the renormalised QCD parameters (strong coupling and quark masses) between a hadronic and a perturbative energy scale (\(\mu _{\text {had}} \le \mu \le \mu _{\text {pt}}\)). Quark masses are renormalised at \(\mu _{\text {had}} \sim \mathrm{O}(\varLambda _{\text {QCD}})\) and evolved to \(\mu _{\text {pt}} \sim \mathrm{O}(M_{\text {W}})\) [13, 27,28,29,30,31,32,33,34,35,36] in the SF scheme [37, 38]. Both renormalisation and RG-running are done non-perturbatively. At \(\mu _{\text {pt}}\) perturbation theory is believed to be reliably controlled and we may safely switch to the conventionally preferred, albeit inherently perturbative \(\overline{\mathrm{MS}}\) scheme.
We will be quoting results also for the scheme- and scale-independent renormalisation group invariant (RGI) quark masses \(M_{12}\) and \(M_{13}\) (corresponding to the current masses \(m_{12}\) and \(m_{13}\)) as well as the physical RGI quark masses \(M_{\mathrm{u}/\mathrm{d}}\) and \(M_{\mathrm{s}}\) derived from them. They are conventionally defined in massless schemes [39] by
for each quark flavour i. In our opinion, \(M_i\) is better suited for comparisons either to experimental results or other theoretical determinations. Equation (2.10) is formally exact and independent of perturbation theory as long as the renormalised parameters \((g_{\mathrm{R}}, m_{i\mathrm R})\) and the continuum renormalisation group functions (i.e. the Callan–Symanzik \(\beta \)-function and the mass anomalous dimension \(\tau \)) are known non-perturbatively with satisfacory accuracy [13, 33,34,35,36]. Their computation in the SF scheme with \(N_\mathrm{f}=3\) massless quarks has been carried out in Ref. [13].
Our determination of the renormalised quark masses is based on the bare current mass averages \(m_{ij\mathrm R}\); cf. Eqs. (2.5) and (2.8). The analogue of these expressions for the RGI mass averages is given by
Note that the ratio \(M/m_{\mathrm{R}}(\mu _{\text {had}})\) is flavour-independent; cf. Eq. (2.10). In Ref. [13] it has been computed in the SF scheme for the \(N_\mathrm{f}=3\) massless flavours at \(\mu _{\text {had}} = 233(8)~\mathrm{MeV}\).
2.2 The chiral trajectory and scale setting
Our aim is to stay on a line of constant Physics within systematic uncertainties of \(\mathrm{O}(a^2)\), as we vary the bare parameters of the theory (i.e. the gauge coupling \(g_0\) and the \(N_\mathrm{f}= 2+1\) quark masses). In particular, if the improved bare gauge coupling
is kept fixed in the simulations, so does the lattice spacing, with any fluctuations being attributed to \(\mathrm{O}(a^2)\)-effects. The problem is that \(b_\mathrm{g}(g_0^2)\) is only known to one-loop order in perturbation theory [40, 41]; \(b_\mathrm{g}^{\mathrm{PT}}=0.012N_\mathrm{f}g_0^2\). Thus, following refs. [42, 43], we vary the quark masses at fixed \(g_0^2\), ensuring that the trace of the quark mass matrix remains constant:
In this way the improved bare gauge coupling \({\tilde{g}}_0^2\) is kept constant at fixed \(\beta \) for any \(b_g\).Footnote 3
This requirement leads to an unusual but unambiguous approach to the physical point, shown in the \((M_{12},M_{13})\)-plane in the left panel of Fig. 1. Initially, one starts at the symmetric point (\(am_{\mathrm {q},1}=am_{\mathrm {q},2}=am_{\mathrm {q},3}=am_{\mathrm {q}}^{{\scriptscriptstyle \mathrm sym}}\)) for some fixed coupling \(\beta =6/g_0^2\), and tunes the mass parameter of the simulation in such a way that \(\mathrm{Tr}[M_\mathrm{q}]=\mathrm{Tr}[M_\mathrm{q}]_{{\scriptscriptstyle \mathrm phys}}\) to a good approximation.Footnote 4 This is achieved by varying \(am_{\mathrm {q}}^{{\scriptscriptstyle \mathrm sym}}\) until \((m_\mathrm{K}^2+\tfrac{1}{2}m_{\pi }^2)/f_\mathrm{ref}\) takes its physical value. Since it is proportional to \(\mathrm{Tr}[M_\mathrm{q}]\) at leading order in chiral perturbation theory (\(\chi \mathrm {PT}\)), it suffices as tuning observable. In subsequent simulations, one successively lifts the mass-degeneracy towards the physical point by decreasing the light quark masses while maintaining the constant-trace condition. By doing so the physical strange quark mass is approached from below as in Fig. 1 (left panel). We call this procedure “the determination of the chiral trajectory”.
Note that the improved renormalised quark mass matrix \(M_{\mathrm{R}}\) is given by [22]
Since the \(d_m\)-counter-term is proportional to squared bare masses, a constant \(\mathrm{Tr}[M_\mathrm{q}]\) does not correspond to a constant \(\mathrm{Tr}[M_{\mathrm{R}}]\); the latter requirement is violated by \(\mathrm{O}(a)\) effects. This is an undesirable feature, as it implies that the chiral trajectory is not a line of constant-physics. In practice these violations have been monitored in Ref. [8] (Fig. 4, lowest lhs panel), where \(\mathrm{Tr}[M_{\mathrm{R}}]\) has been computed, at constant \(\mathrm{Tr}[M_q]\), from the current quark masses with 1-loop perturbative Symanzik b-coefficients. The violations appear to be bigger than what one would expect from \(\mathrm{O}(a)\) effects.
These considerations have led the authors of Ref. [8] to redefine the chiral trajectory in terms of \(\phi _4 =\) const., where
and \(t_0 \) is the gluonic quantity of the Wilson flow [18]; it has mass dimension \(-2\). Here \(m_{\pi }\) and \(m_\mathrm{K}\) are the lightest and strange pseudoscalar mesons respectively; at the physical point these are the pion \(m_{\pi }^{\scriptscriptstyle \mathrm phys}\) and kaon \(m_\mathrm{K}^{\scriptscriptstyle \mathrm phys}\). Keeping \(\phi _4\) constant is a Symanzik-improved constant physics condition. But \(\phi _4\) is proportional to the sum of the three quark masses only in leading-order (LO) chiral perturbation theory (\(\chi \)PT). Thus, the improved bare coupling \({\tilde{g}}_0^2\) now suffers from \(\mathrm{O}(a m_q \mathrm{Tr}[M_q ])\) discretisation effects due to higher-order \(\chi \)PT contributions. In practice, these turn out to be small, as can be seen from Ref. [8] (Fig. 4, lowest rhs panel), where \(\mathrm{Tr}[M_{\mathrm{R}}]\) has been computed, at constant \(\phi _4\). The violations appear to be at most 1% and thus the variation of the \(\mathrm{O}(a)\) \(b_g\)-term in \({\tilde{g}}_0^2\) can be ignored.
Obviously, one must also ensure, through careful tuning, that the chosen \(\phi _4 =\) const. trajectory passes through the point corresponding to physical up/down and strange renormalised masses (i.e. quark masses that correspond to the physical pseudoscalar mesons \(m_{\pi }^{\scriptscriptstyle \mathrm phys}\) and \(m_\mathrm{K}^{\scriptscriptstyle \mathrm phys}\)). This is done by driving \(\phi _4\) to its physical value \(\phi _4^{\scriptscriptstyle \mathrm phys}= 8 t_0 [ (m_\mathrm{K}^{\scriptscriptstyle \mathrm phys})^2 \, + \, (m_{\pi }^{\scriptscriptstyle \mathrm phys})^2/2 ]\) through mass shifts [8]. The aim is to express the computed quantities of interest (in our case the quark masses) as functions of
with \(\phi _4\) held fixed at \(\phi _4^{\scriptscriptstyle \mathrm phys}\), and eventually extrapolate them to \(\phi _2^{\scriptscriptstyle \mathrm phys}= 8 t_0 (m_{\pi }^{\scriptscriptstyle \mathrm phys})^2\).
The determination of the redefined chiral trajectory is not straightforward. One needs to know the value of \(\phi _4^\mathrm{{\scriptscriptstyle \mathrm phys}}\). The latter is obtained from \(t_0\) and the pseudoscalar masses (corrected for isospin-breaking effects) quoted in Eq. (1.1). But since the value of \(t_0\) is only approximately known, one starts with an initial guess \({\tilde{t}}_0\), which provides an initial guess \({{\tilde{\phi }}}_4\). At each \(\beta \), the symmetric point with degenerate masses (\(\kappa _1 = \kappa _3\)) is tuned so that the computed \(t_0/a^2\), \(a m_{\pi }\) and \(a m_\mathrm{K}\) combine as in Eq. (2.15) to give a value close to \({{\tilde{\phi }}}_4\). The other ensembles at the same \(\beta \) have been obtained by decreasing the degenerate (lightest) quark mass \(m_{\mathrm{q},1} = m_{\mathrm{q},2}\), while increasing the heavier mass \(m_{\mathrm{q},3}\) so as to keep \(\mathrm{Tr}[M_\mathrm{q}]\) constant. Thus they do not correspond exactly to the same \({{\tilde{\phi }}}_4\). Small corrections of the subtracted bare quark masses (or hopping parameters) are introduced, using a Taylor expansion discussed in Sect. IV of Ref. [8], in order to shift \(\phi _4\) to the reference value \({{\tilde{\phi }}}_4\) and correct analogously the measured PCAC quark masses and other quantities of interest such as the decay constants. The procedure is repeated for each \(\beta \) and the same value \({{\tilde{\phi }}}_4\) at the starting symmetric point.
All shifted quantities are now known at \({{\tilde{\phi }}}_4\) as functions of \(\phi _2\). Defining the combination of decay constants
the dimensionless \(\sqrt{8 {\tilde{t}}_0} f_{\pi \mathrm{K}}\) is computed for all \(\phi _2\) and extrapolated to \({{\tilde{\phi }}}_2 = 8 {\tilde{t}}_0 (m_\pi ^{{\scriptscriptstyle \mathrm phys}})^2\). The extrapolated \(\sqrt{{\tilde{t}}_0} f_{\pi \mathrm{K}}\) , combined with the experimentally known \(f_{\pi \mathrm{K}}^{{\scriptscriptstyle \mathrm phys}}\), gives a better estimate of \({\tilde{t}}_0\), and thus of \({{\tilde{\phi }}}_4\). As described in Sect. V of Ref. [8], this procedure can be recursively repeated and eventually the physical value of \(t_0\) is fixed through \(f_{\pi \mathrm{K}}^{{\scriptscriptstyle \mathrm phys}}\); the value in Eq. (1.2) from Ref. [8] leads to
The main message is that once PCAC quark masses are shifted onto the chiral trajectory defined by the constant \(\phi _4^{{\scriptscriptstyle \mathrm phys}}\), they only depend on a single variable, namely \(\phi _2\).
In analogy to the definitions (2.15) and (2.16), we also define rescaled dimensionless bare current quark masses and their renormalised counterparts at scale \(\mu _{\mathrm{had}}\)
The redefined chiral trajectory is shown in Fig. 1 (right panel), where the light-light and heavy-light dimensionless mass averages (\(\phi _{12 \mathrm R}\) and \(\phi _{13 \mathrm R}\) respectively) are plotted as functions of \(\phi _2\). Extrapolating in \(\phi _2\) to \(\phi _2^{{\scriptscriptstyle \mathrm phys}}\) amounts to the simultaneous approach of the light and heavy quark masses to the corresponding physical up/down and strange values. All other physical quantities are then also at the physical point. Section 4 is dedicated to these extrapolations.
3 Quark mass computations
We base our determination of quark masses on the CLS ensembles for \(N_\mathrm{f}= 2+1\) QCD, listed in Table 1. The bare gauge action is the Lüscher–Weisz one, with tree-level coefficients [11]. The bare quark action is the Wilson, Symanzik-improved [10] one. The Clover term coefficient \(c_\mathrm{sw}\) has been tuned non-perturbatively in Ref. [12]. Boundary conditions are periodic in space and open in time, as detailed in Ref. [45].
For details on the generation of these ensembles see Ref. [7]. As seen in Table 1, results have been obtained at four lattice spacings in the range \(0.05 \lesssim a/\mathrm{fm}\lesssim 0.086\). For each lattice coupling \(\beta = 6/g_0^2\), gauge field ensembles have been generated for a fewFootnote 5 values of the Wilson hopping parameters \(\kappa _1 = \kappa _2\) and \(\kappa _3\). The light pseudoscalar meson (pion) varies between 200 and 420 MeV. The heaviest value corresponds to the symmetric point where the three quark masses and the pseudoscalar mesons are degenerate. The strange meson (kaon) varies between 420 and 470 MeV. Given that our lightest pseudoscalars are relatively heavy (200 MeV), the chiral limit ought to be taken with care.
The bare correlation functions \(f_{\mathrm {P}}^{ij}, f_{\mathrm {A}}^{ij}\) of Eqs. (2.1) are estimated with stochastic sources located on time slice \(y_0\), with either \(y_0 = a\) or \(y_0 = T-a\). From them the current quark masses \(m_{12}, m_{13}\) are computed as in Eq. (2.4), with the \(\mathrm{O}(a)\)-improvement coefficient \(c_{\mathrm{A}}\) determined non-perturbatively in Ref. [14]. The exact procedure to select the plateaux range in the presence of open boundary conditions has been explained in Refs. [7, 44, 46].
Having obtained the bare current quark masses \(m_{12}\), \(m_{13}\) at four values of the coupling \(g_0^2\), we construct the renormalised dimensionless quantities \(m_{12\mathrm R}(\mu _{\text {had}})\) and \(m_{13\mathrm R}(\mu _{\text {had}})\); cf. Eq. (2.8). For this we need the ratio \(Z_{\mathrm{A}}(g_0^2)/Z_{\mathrm{P}}(g_0^2,\mu _{\mathrm{had}})\) and the Symanzik b-counter-terms. Results for the axial current normalisation \(Z_{\mathrm{A}}(g_0^2)\) are available in Ref. [47], from a separate computation based on the chirally rotated Schrödinger Functional setup of Refs. [48,49,50]. The computation of \(Z_{\mathrm{P}}(g_0^2,\mu _{\mathrm{had}})\) in the SF scheme, for \(\mu _{\mathrm{had}} = 233(8)~\mathrm{MeV}\), was carried out in Ref. [13] for a theory with \(N_\mathrm{f}=3\) massless quarks and the lattice action of the present work. The \(Z_{\mathrm{P}}\) results, shown in Eqs. (5.2) and (5.3) of Ref. [13], are in a range of inverse gauge couplings which covers the \(\beta \in [3.40,3.85]\) interval of the large volume simulations of Ref. [8], from which our bare dimensionless PCAC masses are extracted.
Besides the ratio \(Z_{\mathrm{A}}/Z_{\mathrm{P}}\), we also need the improvement coefficient \((\tilde{b}_{\mathrm {A}}-\tilde{b}_{\mathrm {P}})\), which multiplies the \(\mathrm{O}(a)\) counter-term proportional to \(a m_{ij}\) in Eq. (2.8). To leading order in perturbation theory \(\tilde{b}_{\mathrm {A}}-\tilde{b}_{\mathrm {P}}= -0.0012 g_0^2\). Non-perturbative estimates based on a coordinate-space renormalisation scheme have been provided for \(N_\mathrm{f}=2+1\) lattice QCD in Ref. [26]. More accurate non-perturbative results have been subsequently obtained by the ALPHA Collaboration, using suitable combinations of valence current quark masses, measured on ensembles with \(N_\mathrm{f}=3\) nearly-chiral sea quark masses in small physical volumes [16, 23]. Also these simulations have been carried out in an inverse coupling range that spans the interval \(\beta \in [3.40,3.85]\) of the large volume CLS results of Ref. [8]. They are expressed in the form of ratios \(R_{\mathrm{AP}}\) and \(R_{\mathrm{Z}}\), from which \((b_\mathrm {A}- b_\mathrm {P})\) and Z are estimated; thus \((\tilde{b}_{\mathrm {A}}-\tilde{b}_{\mathrm {P}}) = R_{\mathrm{AP}}/R_{\mathrm{Z}}\). In Ref. [23], results are quoted for two values of constant Physics, dubbed LCP-0 and LPC-1. In LCP-0, \(R_{\mathrm{AP}}\) and \(R_{\mathrm{Z}}\) are obtained with all masses in the chiral limit. In LCP-1, one valence flavour is in the chiral limit (so it is equal to the sea quark mass), while a second one is held fixed to a non-zero value. The physical volumes are always kept fixed. In Ref. [23], Eqs. (5.1), (5.2) and (5.3) refer to LCP-0 results, while those in Eqs. (5.1), (5.4) and (5.5) refer to LCP-1; differences are due to \(\mathrm{O}(a)\) discretisation effects.
We have opted to use the LCP-0 values of \(\tilde{b}_{\mathrm {A}}-\tilde{b}_{\mathrm {P}}\) in the present work. The covariance matrices of the fit parameters of \(R_{\mathrm{AP}}\) as well as those of \(R_{\mathrm{Z}}\) are provided in Ref. [23]. We assume that the covariance matrix between fit parameters of \(R_{\mathrm{AP}}\) and \(R_{\mathrm{Z}}\) is nil. This is justified a posteriori, by repeating the analysis with LCP-1 values, as a means to estimate the magnitude of systematic errors arising from our choice. Moreover, we have also compared our LCP-0 results to those obtained from different fit functions, used in the preliminary analysis of Ref. [16], as well as from the perturbative estimate \(\tilde{b}_{\mathrm {A}}-\tilde{b}_{\mathrm {P}}\). We find that the contribution arising from such variations is below \(\sim 1\%\) of the total error on renormalised quark masses at the physical point.
As discussed in Sect. 2, the complicated Symanzik counter-term in curly brackets, multiplying \(aM_{\mathrm{sum}}\) in Eq. (2.8), is \(\mathrm{O}(g_0^4 a)\) in perturbation theory. As there are no robust non-perturbative estimates of its magnitude at present, we will drop this term, assuming that the \(\mathrm{O}(g_0^4 a)\) effects it would remove are subdominant compared to \(\mathrm{O}(a^2)\) uncertainties.
As already explained in Sect. 2.2, our analysis is based on the rescaled dimensionless quantities defined in Eqs. (2.15), (2.16), and (2.20). At each \(\beta \) value and for each gauge field configuration, we have results for \(t_0/a^2\), \(a m_{12}\) and \(a m_{13}\) from Refs. [7, 8], from which \(\phi _{12}\) and \(\phi _{13}\) are obtained. The error analysis is carried out using the Gamma method approach [51,52,53,54] and automatic differentiation for error propagation, using the library described in Ref. [55]. This takes into account all the existing errors and correlations in the data and ancillary quantities (renormalisation constants, improvement coefficients, etc.), and estimates autocorrelation functions (including exponential tails) to rescale the uncertainties correspondingly. Following [8], the estimate of the exponential autocorrelation times \(\tau _{\mathrm{\scriptscriptstyle exp}}\) used in the analysis is the one quoted in [7], viz.,
We have checked that without attaching exponential tails statistical errors are 40–70% smaller in our final results. The full analysis has been crosschecked by an independent code based on (appropriately) binned jackknife error estimation. Note that one of the strengths of data analysis based on the Gamma-method is that each Monte Carlo ensemble is treated independently, and the final statistical uncertainty is determined as a sum in quadratures of the statistical fluctuations for each ensemble. This allows to trace back which fraction of the statistical variance comes from each ensemble or ancillary quantities, such as renormalisation constants (see References 5–7 in [55]). This feature will be exploited in the error budgets provided below.
The starting values for \(\phi _{12\mathrm R}\) and \(\phi _{13 \mathrm R}\) on which the analysis is based are shown in Table 2, where renormalised quark masses are in the SF scheme at a scale \(\mu _{\mathrm{\scriptscriptstyle had}} = 233(8)~\mathrm{MeV}\). By suitably fitting these quantities as functions of \(\phi _2\), and extrapolating to \(\phi _2^{{\scriptscriptstyle \mathrm phys}}\), we obtain the results for physical up/down and strange quarks at scale \(\mu _{\mathrm{had}}\), as detailed in Sect. 4. Only then do we convert them to the RGI masses, by multiplying them with the RG-running factor [13]
with the error added in quadrature; cf. Eq. (2.11).
Before presenting our chiral fits in Sect. 4, we conclude this section with a comment on finite-volume effects. Current quark masses are not expected to be affected by finite-volume corrections, since their values are fixed by Ward identities. On the other hand, meson masses, decay constants, and the ratio \(t_0/a^2\) are expected to suffer from such effects. This can be directly checked in the ensembles H200 and N202, obtained at \(\beta = 3.55\) with degenerate masses and corresponding to volumes of about 2 fm and 3 fm respectively. A glance at the relevant entries of Table II of Ref. [8] shows that quark masses do not change as the volume is varied, while meson masses and decay constants vary by about 2.5%, which corresponds to differences of about \(2-3.5\sigma \).Footnote 6 Standard \(\mathrm{SU}(3)\) \(\chi \)PT NLO formulae are available for masses and decay constants [56]; \(t_0/a^2\) does not suffer from finite-volume effects up to NNLO corrections [20]. In particular, the \(\chi \)PT-predicted effects for meson masses are below the percent level, since the lattice spatial size in units of the inverse lightest pseudoscalar meson mass is in the range [3.9, 5.8]. On the other hand, by directly comparing the values in Table 2 obtained at the same lattice spacing and sea quark masses but different volumes (cf. Ref. [44]), it is seen that the finite-volume effects on \(t_0/a^2\) and \(m_\pi ^2\) are comparable and come with opposite signs. As a result, they largely cancel in \(\phi _2\), the variable in which chiral fits are performed. Decay constants, which generally suffer from larger finite-volume effects than meson masses, enter our computation indirectly only – firstly through NLO terms in chiral fits, where the finite-volume correction is sub-leading, and secondly through the physical value of \(\sqrt{8t_0}\) determined in [8], where these corrections have already been taken into account. We therefore expect that the quantities most affected by finite-volume effects are the rescaled current quark masses \(\phi _{12},\phi _{13}\), due to the presence of \(\sqrt{8t_0}/a\) in their definition. As mentioned above, these are much smaller than our statistical uncertainty, cf. Table 2. In the rest of our analysis we will therefore neglect this source of uncertainty.
4 Extrapolations to physical quark masses
Having obtained the dimensionless renormalised current mass combinations \(\phi _{12 \mathrm R}\) and \(\phi _{13 \mathrm R}\) at each \(\beta \) as functions of \(\phi _2\), we now proceed with the determination of the physical values \(\phi _{\mathrm{ud}}\) and \(\phi _{\mathrm{s}}\). This is done in fairly standard fashion through fits and extrapolations. To begin with, we note that the two lighter degenerate quark masses are simply given by \(\phi _{12}\), whereas the heavier strange one is obtained from the differenceFootnote 7
It is then possible to perform simultaneous fits of \(\phi _{12}\) and \(\phi _{\mathrm{h}}\) as functions of \(\phi _2\) and the lattice spacing, subsequently extrapolating the results to \(\phi _2^{{\scriptscriptstyle \mathrm phys}}\) of Eq. (2.19) and the continuum limit, so as to obtain \(\phi _{\mathrm{ud}}\) and \(\phi _{\mathrm{s}}\). Variants of this method consist in simultaneous fits and extrapolations of either \(\phi _{\mathrm{13}}\) or \(\phi _{\mathrm{12}}\) on one hand and their ratio \(\phi _{\mathrm{12}}/\phi _{\mathrm{13}}\) on the other. These turn out to be advantageous, as does a certain combination of ratios involving \(\phi _{12}\), \(\phi _{13}\), \(\phi _2\), and \(\phi _4\), for reasons discussed below. We recall in passing that in the ratio \(\phi _{12}/\phi _{13}\) all renormalisation factors cancel.
We use fits based in chiral perturbation theory (\(\chi \)PT fits) which are expected to model the data well close to the chiral limit \(\phi _2 = 0\). Recall that we have performed \(N_\mathrm{f}= 2+1\) simulations on a chiral trajectory; starting from a symmetric point where all quark masses are degenerate, we increase the mass of the heavy quark while decreasing that of the light one, until the physical point is reached. Since both masses are varying, it is natural to use \(\mathrm SU(3)_L \otimes SU(3)_R\) chiral perturbation theory, which bears explicit dependence on both masses. This works when all three quark masses in the simulations are light enough for say, NLO \(\chi \)PT with three flavours to provide reliable fits. In Ref. [57] it is stated that this is the case for their data, obtained with domain wall fermions, as long as the average quark mass satisfies \(a m_{\mathrm{avg}} < 0.01\). As seen in Table 2 of Ref. [8], our PCAC dimensionless quark masses \(a m_{12}\) and \(a m_{13}\) also satisfy this empirical constraint. The real test comes about a posteriori, when the \(\mathrm SU(3)_L \otimes SU(3)_R\) NLO ansätze are seen to fit our results well.
In Appendix A and Appendix B, ansätze for NLO \(\chi \)PT and discretisation effects are adapted to our specific parametrisation in terms of \(\phi _2\) and \(\phi _4\). For the current quark masses these are
where \(\phi _K = (2 \phi _4 - \phi _2)/2\). The constants \(p_1, p_2, p_3\) and K are related to standard \(\chi \)PT parameters in Eqs. (A.8)-(A.11), whereas the chiral logarithms \({\mathcal {L}}_2\) and \({\mathcal {L}}_\eta \) are defined in Eq. (A.12). For justification of the ansatz used for the discretisation effects, see comments after Eqs. (B.8) and (B.9). We stress again that \(\phi _{12}\) and \(\phi _{13}\) are functions of \(\phi _2\) only, \(\phi _4\) being held constant. They have common fit parameters \(p_1\), \(p_2\) and \(p_3\).
Using the above expressions and consistently neglecting higher orders in the continuum \(\chi \)PT terms, we obtain the ratio of PCAC masses (cf. Eqs. (A.13) and (B.11))
As discussed in Appendix B, the form of the cutoff effects respects the constraint \(\phi _{12}/\phi _{13}=1\) at the symmetric point \(m_{\mathrm{q},1} = m_{\mathrm{q},3}\), which is exact at all lattice spacings by construction.
For the combination defined in Eq. (A.14), we have
An alternative to NLO \(\chi \)PT fits is the use of power series, based simply on Taylor expansions around the symmetric point \(m_{\mathrm{q},1} = m_{\mathrm{q},2} = m_{\mathrm{q},3}\), for which \(\phi _2^{\scriptscriptstyle \mathrm{sym}} = 2\phi _4^{\scriptscriptstyle \mathrm{phys}}/3\):
Note that imposing the constraint \(\phi _{12}=\phi _{13}\) at the symmetric point implies that \(s_0\) and \(S_0\) are common fit parameters. These expansions are expected to give reliable results in the higher end of the \(\phi _2\) range, underperforming close to the chiral limit. They are thus complementary to the chiral fits, which are better suited for the small-mass regime. In this sense the two approaches may provide a handle to estimate the systematic uncertainties due to these fits and extrapolations.
We explore various fit variants, in order to unravel the presence of potentially significant systematic effects. They are encoded as follows:
Fitted quantities and ansätze:
[chi12] Fit of \(\phi _{12}\) data only, using the \(\chi \)PT ansatz.
[chi13] Fit of \(\phi _{13}\) data only, using the \(\chi \)PT ansatz.
[tay12] Fit of \(\phi _{12}\) data only, using the Taylor expansion ansatz.
[tay13] Fit of \(\phi _{13}\) data only, using the Taylor expansion ansatz.
[chipc] Combined fit to \(\phi _{12}\) and \(\phi _{13}\), using \(\chi \)PT.
[chirc] Combined fit to \(\phi _{13}\) and \(\phi _{12}/\phi _{13}\), using \(\chi \)PT.
[chirr] Combined fit to the ratio \(\phi _{12}/\phi _{13}\) and the combination \(2\phi _{13}/\phi _K+\phi _{12}/\phi _2\) using \(\chi \)PT.
[tchir] Combined fit to \(\phi _{13}\) and the ratio \(\phi _{12}/\phi _{13}\), using the Taylor expansion for \(\phi _{13}\) and \(\chi \)PT for \(\phi _{12}/\phi _{13}\).
Discretisation effects:
[a1] Fits with terms \(\propto a^2/t_0\) only.
[a2] Fits with terms \(\propto a^2/t_0\) and \(\propto \phi _2 a^2/t_0\).
Cuts on pseudoscalar meson masses:
[420] Fit all available data, including the symmetric point; i.e. data satisfies \(m_\pi \lesssim 420~\mathrm{MeV}\).
[360] Fit excluding the symmetric point; i.e. data satisfies \(m_\pi \lesssim 360~\mathrm{MeV}\).
[300] Fit only points for which \(m_\pi \le 300~\mathrm{MeV}\).
Any given fit will thus be labelled as [xxxxx][yy][zzz], using the above tags.
The results obtained with the various fit methods at the physical point (cf. Eq. (2.19)) are expressed in physical units by dividing them out by \(\sqrt{8t_0^{\scriptscriptstyle {\mathrm{phys}}}}\). Multiplication by the factor of Eq. (3.2) subsequently gives the RGI mass estimates shown in Figs. 2 and 3. We comment on the various fit ansätze:
Independent fits of \(\varvec{\phi }_{12}\) and \(\varvec{\phi }_{13}\): comparing light quark masses \(M_{\mathrm{u}/\mathrm{d}}\) (upper panel of Fig. 2) from [chi12][a1] and [chi12][a2] we find that they are sensitive to the presence of a discretisation term \(\propto a^2/t_0\), albeit within \(\sim 1-2\sigma \). This difference is attenuated when the more stringent mass cutoff [chi12][300] is enforced, mainly because the error increases as less points are fitted. The same qualitative conclusions are true for the Taylor expansion fits [tay12] of the light quark mass. On the other hand, the lower panel of Fig. 2 shows that the average quark mass \(M_{13}^{{\scriptscriptstyle \mathrm phys}}\) is not sensitive to the details of the fit ansätze. This is not surprising, given that our simulations have been performed in a region of rather heavy pions 220 MeV \(\le m_\pi \le \) 420 MeV, with data covering the physical point \(M_{13}^{{\scriptscriptstyle \mathrm phys}}\), while \(M_{\mathrm{u}/\mathrm{d}}\) requires long extrapolations. The conclusion is that independent fits are reliable for \(\phi _{13}\) but less so for \(\phi _{12}\), and so we discard their results.
Combined fits to \(\varvec{\phi }_{12}\) and \(\varvec{\phi }_{13}\): Fig. 3 shows that the fits [chipc][a1] and [chipc][a2] give results which are sensitive to the ansatz employed for the cutoff effects. This is more pronounced for \(M_{\mathrm{u}/\mathrm{d}}\) and the ratio \(M_{\mathrm{s}}/M_{\mathrm{u}/\mathrm{d}}\), but persists also for \(M_{\mathrm{s}}\). Moreover, fits [chipc][a1][420] and [chipc][a1][360] display visible differences when compared to fits of the [chipc][a2] variety; the latter agree with results obtained from different fit ansätze. For these reason we have also discarded results from this analysis.
Combined fits to \(\varvec{\phi }_{13}\) and \(\varvec{\phi }\)-ratios: As previously explained, we have explored three ansätze, namely [chirc], [chirr], and [tchir]. In all cases Fig. 3 shows that there is no significant dependence of the results from the details of these fits, except for a very slight fluctuation of the [tchir][a1][420] results for \(M_{\mathrm{s}}\). Preferring to err on the side of caution, we also discard [tchir] fits.
A few general points concerning the fit analysis deserve to be highlighted:
In all our fits the \(\chi ^2\)/dof is well below 1. This is partly because our data are correlated – both from the fact that there are common renormalisation factors and improvement coefficients, and because we are including the contribution to the \(\chi ^2\) from the fluctuations of the meson masses (horizontal errors). Therefore, while the goodness-of-fit is in general satisfactory, we will refrain from quoting the corresponding p-values, since they are not really meaningful.
Unsurprisingly, the inclusion of a second discretisation term \(\propto \phi _2(a^2/t_0)\) in the fits contributes to an increase of the error. This term is often compatible with zero, and almost always so within \(\sim 2\sigma \), suggesting that fits [a1] are safe. As stated previously, exceptions are fits [chi12] and [chipc], where inclusion of this term has a strong effect.
Within large uncertainties, the coefficients of the leading cutoff effects (i.e. those \(\propto a^2/t_0\)) depend on the fitted observable, and are larger for \(\phi _{13}\) than for \(\phi _{12}\).
The power-series fits [tay12] and [tay13] behave remarkably well. Results from [tay12] vanish within errors in the chiral limit, except for fits going up to the symmetric point, which are sometimes incompatible with naught by 2–3\(~\sigma \). This is evidence that our data are not precise enough to capture the impact of chiral logs. Fits [tay13] to \(\phi _{lh}\) are very stable, and impressively better than those obtained with the \(\chi \)PT ansatz. Indeed, if one considers fits [texp1] and [texp2], which are safest from the point of view of error estimation, all the fits considered provide compatible results for \(M_{13}\) within one sigma. Notice, furthermore, that the constant terms of [tay12] and [tay13] are generally in good agreement, signalling the consistency of the approach. It is also interesting to note that the coefficient of the quadratic term is very small and always compatible with zero within 1\(\sigma \) (save for two cases where it vanishes within 2\(\sigma \)).
Fits [chirc] and [chirr] appear to be the stablest.
NLO \(\chi \)PT appears to be suffering around and above \(400~\mathrm{MeV}\).
5 Final results and discussion
Following the analysis of Sect. 4, we quote as final results those obtained from the following procedure:
The central values are those of a combined fit to the ratio \(\phi _{12}/\phi _{13}\) and the quantity \(2\phi _{13}/\phi _K+\phi _{12}/\phi _2\), using NLO \(\chi \)PT, with pseudoscalar meson masses less than 360 MeV and a discretisation term proportional to \(a^2/t_0\) (i.e., fit [chirr][a1][360]). The error from this fit will appear as the first uncertainty in the results below. The fit is illustrated in Fig. 4.
We estimate systematic errors from the spread of central values of all other [chirr] and [chirc] fits, for all pion mass cutoffs, and for both [a1] and [a2]. The spread is intended to be the difference between the central value, obtained as described in the previous item, and the most distant central value of all other [chirr] and [chirc] fits. This is the second error of the results below. Recall that [chirc] are combined fits to \(\phi _{13}\) and the ratio \(\phi _{12}/\phi _{13}\), using NLO \(\chi \)PT.
Discard other fits, including [chipc], considered too unstable.
All results have been obtained using the Symanzik \({\tilde{b}}\)-parameters computed in the LCP-0 case (see discussion in Sect. 3). Using LPC-1 results instead, has very marginal effects on the error.
In Sect. 3 we have also argued that for the quantities under consideration finite volume effects are negligible.
The resulting RGI masses are
The quark mass ratio is obtained from
Dependence on renormalisation is only implicit, from the joint fit with \(\phi _{13}\). The same procedures as above yield
The above results for RGI masses refer to the \(N_\mathrm{f}=2+1\) theory.
It is customary in phenomenological studies to report light quark masses measured in the \(N_\mathrm{f}=2+1\) lattice theory in the \(\overline{\mathrm{MS}}\) scheme at 2 GeV, referred to the more physical QCD with four flavours. This entails using \(N_\mathrm{f}=3\) perturbative RG-running from 2 GeV down to the charm threshold, followed by \(N_\mathrm{f}=4\) perturbative RG-running back to 2 GeV; see for example Ref. [5]. We use 4-loop perturbative RG-running and the value for the \(\varLambda _{\mathrm{QCD}}^{\overline{\mathrm{MS}}}\) parameter computed by the ALPHA Collaboration in Ref. [34] to obtainFootnote 8
The mass ratio is obviously the same as in Eq. (5.3). We note in passing that switching to the four-flavour theory has a very small effect on \(\overline{\mathrm{MS}}\) results, since at \(2~\mathrm{GeV}\) the matching factor is \(m_{\mathrm{R}}(N_\mathrm{f}=4)/m_{\mathrm{R}}(N_\mathrm{f}=3) =1.002\).
The error budget for our computation is summarised in Table 3 and Fig. 5. Uncertainties are completely dominated by our chiral fits. We have separated these errors into two contributions; see first two lines of Table 3. The first error is that of our best fit [chirr][a1][360], and includes the statistical errors as well as the error from combined fits in \(\phi _2\) and a. The second uncertainty is the one arising upon varying the fit ansätze and their \(\phi _2\) range. All other errors are clearly seen to be subdominant. It is worth noting that, as expected, the largest contribution to the uncertainty comes from the ensembles with the lightest sea pion masses, especially the one with the finest lattice spacing. It is then clear that decreasing our errors would require more chiral ensembles, and more extensive simulations at light masses.
The current FLAG 2019 [5] world averages from \(N_\mathrm{f}=2+1\) simulations, in the \(\overline{\mathrm{MS}}\) scheme, reportedly quoted for the \(N_\mathrm{f}=4\) theory as explained above, are:
The strange mass estimate is based on the results of Refs. [58,59,60,61,62,63], while the up/down one is based on Refs. [58,59,60,61, 64]. For the quark mass ratio, based on Refs. [58,59,60, 63], FLAG quotes
Our results for the strange and light quark masses agree with those of FLAG within \(1.7\sigma \) and \(1.2\sigma \) respectively and thus exhibit good compatibility albeit with bigger errors.
Data Availability Statement
This manuscript has no associated data or the data will not be deposited. [Authors’ comment: Several Terabytes of raw and processed data are stored in the several HPC systems used in the project. Data can be made available by the authors on request, in compliance with the Open Data policies of the respective institutions.]
Notes
The gradient flow scale \(t_0\) is defined by the implicit relation \(\{t^2 \langle E(t) \rangle \}_{t=t_0}=0.3\), for the finite Yang–Mills energy density E(t) at flow time t; see Section 6 of Ref. [7].
In our simulations we average correlation functions with the source at \(y_0=a\) and (time-reversed) correlation functions with the source at \(y_0=T-a\). Since bare quantities are computed on lattices with open boundary conditions in time, we do not use translation invariance for the source position.
In Ref. [44] (cf. Sect. 5.3.2) it was estimated that when, in some ensembles, \(\mathrm{Tr}[ M_\mathrm{q}]\) is not constant, the resulting effect on the shift of the lattice spacing is about 6 per mille. This estimate was based on the 1-loop value of \(b_g\), \(b_\mathrm{A}-b_\mathrm{P}\) and Z.
In practise one only tunes the bare quark mass \(am_{0}\), since \(am_\mathrm {cr}\) is unknown a priori, but constant at fixed \(\beta \).
We note in passing that for the ensemble with \(\beta = 3.46\) we only have results for degenerate quark masses.
The ensemble H200 is only used in this context in the present work. Since at \(\beta = 3.55\) we have results at two larger volumes (N202/203/200 and D200), we do not use H200 results in our analysis.
Henceforth all quark masses will be renormalised. In order to simplify the notation, we shall drop the subscript R from \(\phi _{12 \mathrm R}\), \(\phi _{13 \mathrm R}\) in this section, in Appendix A and in Appendix B.
In converting our results to \(\overline{\mathrm{MS}}\) we have taken into account the uncertainty in the matching factor coming from the error on \(\varLambda _{\mathrm{QCD}}^{\overline{\mathrm{MS}}}\), as well as the covariance of the latter with our determination of \(M_i\).
References
K. Symanzik, Infrared singularities and small distance behavior analysis. Commun. Math. Phys. 34, 7 (1973). https://doi.org/10.1007/BF01646540
T. Appelquist, J. Carazzone, Infrared singularities and massive fields. Phys. Rev. D 11, 2856 (1975). https://doi.org/10.1103/PhysRevD.11.2856
B.A. Ovrut, H.J. Schnitzer, Gauge theories with minimal subtraction and the decoupling theorem. Nucl. Phys. B 179, 381 (1981). https://doi.org/10.1016/0550-3213(81)90011-0
W. Bernreuther, W. Wetzel, Decoupling of heavy quarks in the minimal subtraction scheme. Nucl. Phys. B 197, 228 (1982). https://doi.org/10.1016/0550-3213(82)90288-7, https://doi.org/10.1016/S0550-3213(97)00811-0
S. Aoki et al. (Flavour Lattice Averaging Group), FLAG review (2019). arXiv:1902.08191
ALPHA collaboration, F. Knechtli, T. Korzec, B. Leder, G. Moir, Power corrections from decoupling of the charm quark. Phys. Lett. B 774, 649 (2017). https://doi.org/10.1016/j.physletb.2017.10.025. arXiv:1706.04982
M. Bruno et al., Simulation of QCD with \(\text{ N }_{f} =\) 2 \(+\) 1 flavors of non-perturbatively improved Wilson fermions. JHEP 02, 043 (2015). https://doi.org/10.1007/JHEP02(2015)043. arXiv:1411.3982
M. Bruno, T. Korzec, S. Schaefer, Setting the scale for the CLS \(2 + 1\) flavor ensembles. Phys. Rev. D 95, 074504 (2017). https://doi.org/10.1103/PhysRevD.95.074504. arXiv:1608.08900
K.G. Wilson, Confinement of quarks. Phys. Rev. D 10, 2445 (1974)
B. Sheikholeslami, R. Wohlert, Improved continuum limit lattice action for QCD with Wilson fermions. Nucl. Phys. B 259, 572 (1985). https://doi.org/10.1016/0550-3213(85)90002-1
M. Lüscher, P. Weisz, On-shell improved lattice gauge theories. Commun. Math. Phys. 97, 59 (1985). https://doi.org/10.1007/BF01206178
J. Bulava, S. Schaefer, Improvement of \(N_f=3\) lattice QCD with Wilson fermions and tree-level improved gauge action. Nucl. Phys. B 874, 188 (2013). https://doi.org/10.1016/j.nuclphysb.2013.05.019. arXiv:1304.7093
ALPHA collaboration, I. Campos, P. Fritzsch, C. Pena, D. Preti, A. Ramos, A. Vladikas, Non-perturbative quark mass renormalisation and running in \(N_f=3\) QCD. Eur. Phys. J. C 78, 387 (2018). https://doi.org/10.1140/epjc/s10052-018-5870-5. arXiv:1802.05243
ALPHA collaboration, J. Bulava, M. Della Morte, J. Heitger, C. Wittemeier, Non-perturbative improvement of the axial current in \(N_f =3\) lattice QCD with Wilson fermions and tree-level improved gauge action. Nucl. Phys. B 896, 555 (2015). https://doi.org/10.1016/j.nuclphysb.2015.05.003. arXiv:1502.04999
J. Bulava, M. Della Morte, J. Heitger, C. Wittemeier, Nonperturbative renormalization of the axial current in \(N_f = 3\) lattice QCD with Wilson fermions and a tree-level improved gauge action. Phys. Rev. D 93, 114513 (2016). https://doi.org/10.1103/PhysRevD.93.114513. arXiv:1604.05827
ALPHA collaboration, G.M. de Divitiis, M. Firrotta, J. Heitger, C.C. Köster, A. Vladikas, Non-perturbative determination of improvement \(b\)-coefficients in \(N_f=3\). EPJ Web Conf. 175, 10008 (2018). https://doi.org/10.1051/epjconf/201817510008. arXiv:1710.07020
S. Aoki et al., Review of lattice results concerning low-energy particle physics. Eur. Phys. J. C 77, 112 (2017). https://doi.org/10.1140/epjc/s10052-016-4509-7. arXiv:1607.00299
M. Lüscher, Properties and uses of the Wilson flow in lattice QCD. JHEP 08, 071 (2010). https://doi.org/10.1007/JHEP08(2010)071, https://doi.org/10.1007/JHEP03(2014)092. arXiv:1006.4518
R. Sommer, Scale setting in lattice QCD. PoS LATTICE2013, 015 (2014). https://doi.org/10.22323/1.187.0015. arXiv:1401.3270
O. Bär, M. Golterman, Chiral perturbation theory for gradient flow observables. Phys. Rev. D 89, 034505 (2014). https://doi.org/10.1103/PhysRevD.89.099905,https://doi.org/10.1103/PhysRevD.89.034505. arXiv:1312.4999
ALPHA collaboration, M. Bruno, I. Campos, J. Koponen, C. Pena, D. Preti, A. Ramos et al., Light and strange quark masses from \(N_f=2+1\) simulations with Wilson fermions, PoS. LATTICE2018, 220 (2019). https://doi.org/10.22323/1.334.0220. arXiv:1903.04094
T. Bhattacharya, R. Gupta, W. Lee, S.R. Sharpe, J.M. Wu, Improved bilinears in lattice QCD with non-degenerate quarks. Phys. Rev. D 73, 034504 (2006). https://doi.org/10.1103/PhysRevD.73.034504. arXiv:hep-lat/0511014
ALPHA collaboration, G. M. Divitiis, P. Fritzsch, J. Heitger, C. C. Köster, S. Kuberski, A. Vladikas, Non-perturbative determination of improvement coefficients \(b_{{\rm m}}\) and \(b_{{\rm A}} - b_{{\rm P}}\) and normalisation factor \(Z_{{\rm m}} Z_{{\rm P}}/Z_{{\rm A}}\) with \(N_{{\rm f}}= 3\) Wilson fermions. Eur. Phys. J. C 79, 797 (2019). https://doi.org/10.1140/epjc/s10052-019-7287-1. arXiv:1906.03445
M. Constantinou, M. Hadjiantonis, H. Panagopoulos, Renormalization of flavor singlet and nonsinglet fermion bilinear operators. PoS LATTICE2014, 298 (2014). https://doi.org/10.22323/1.214.0298. arXiv:1411.6990
RQCD collaboration, G. S. Bali, E. E. Scholz, J. Simeth, W. Södner, Lattice simulations with \(N_f=2+1\) improved Wilson fermions at a fixed strange quark mass. Phys. Rev. D 94, 074501 (2016). https://doi.org/10.1103/PhysRevD.94.074501. arXiv:1606.09039
P. Korcyl, G.S. Bali, Non-perturbative determination of improvement coefficients using coordinate space correlators in \(N_f=2+1\) lattice QCD. Phys. Rev. D 95, 014505 (2017). https://doi.org/10.1103/PhysRevD.95.014505. arXiv:1607.07090
M. Lüscher, R. Sommer, U. Wolff, P. Weisz, Computation of the running coupling in the SU(2) Yang–Mills theory. Nucl. Phys. B 389, 247 (1993). https://doi.org/10.1016/0550-3213(93)90292-W. arXiv:hep-lat/9207010
M. Lüscher, R. Sommer, P. Weisz, U. Wolff, A precise determination of the running coupling in the SU(3) Yang–Mills theory. Nucl. Phys. B 413, 481 (1994). https://doi.org/10.1016/0550-3213(94)90629-7. arXiv:hep-lat/9309005
ALPHA collaboration, M. Della Morte, R. Frezzotti, J. Heitger, J. Rolf, R. Sommer, U. Wolff, Computation of the strong coupling in QCD with two dynamical flavors. Nucl. Phys. B 713, 378 (2005). https://doi.org/10.1016/j.nuclphysb.2005.02.013. arXiv:hep-lat/0411025
S. Capitani, M. Lüscher, R. Sommer, H. Wittig, Non-perturbative quark mass renormalization in quenched lattice QCD. Nucl. Phys. B 544, 669 (1999). https://doi.org/10.1016/S0550-3213(00)00163-2,https://doi.org/10.1016/S0550-3213(98)00857-8. arXiv:hep-lat/9810063
ALPHA collaboration, M. Della Morte, R. Hoffmann, F. Knechtli, J. Rolf, R. Sommer, I. Wetzorke et al., Non-perturbative quark mass renormalization in two-flavor QCD. Nucl. Phys. B 729, 117 (2005). https://doi.org/10.1016/j.nuclphysb.2005.09.028. arXiv:hep-lat/0507035
P. Fritzsch, A. Ramos, The gradient flow coupling in the Schrödinger functional. JHEP 10, 008 (2013). https://doi.org/10.1007/JHEP10(2013)008. arXiv:1301.4388
ALPHA collaboration, M. Dalla Brida, P. Fritzsch, T. Korzec, A. Ramos, S. Sint, R. Sommer, Slow running of the gradient flow coupling from 200 MeV to 4 GeV in \(N_{{\rm f}}=3\) QCD. Phys. Rev. D 95, 014507 (2017). https://doi.org/10.1103/PhysRevD.95.014507. arXiv:1607.06423
ALPHA collaboration, M. Bruno, M. Dalla Brida, P. Fritzsch, T. Korzec, A. Ramos, S. Schaefer et al., QCD coupling from a nonperturbative determination of the three-flavor \(\Lambda \) parameter. Phys. Rev. Lett. 119, 102001 (2017). https://doi.org/10.1103/PhysRevLett.119.102001. arXiv:1706.03821
ALPHA collaboration, M. Dalla Brida, P. Fritzsch, T. Korzec, A. Ramos, S. Sint, R. Sommer, Determination of the QCD \(\Lambda \)-parameter and the accuracy of perturbation theory at high energies. Phys. Rev. Lett. 117, 182001 (2016). https://doi.org/10.1103/PhysRevLett.117.182001. arXiv:1604.06193
ALPHA collaboration, M. Dalla Brida, P. Fritzsch, T. Korzec, A. Ramos, S. Sint, R. Sommer, A non-perturbative exploration of the high energy regime in \(N_{\rm f}=3\) QCD. Eur. Phys. J. C 78, 372 (2018). https://doi.org/10.1140/epjc/s10052-018-5838-5. arXiv:1803.10230
M. Lüscher, R. Narayanan, P. Weisz, U. Wolff, The Schrödinger functional: a renormalizable probe for nonabelian gauge theories. Nucl. Phys. B 384, 168 (1992). https://doi.org/10.1016/0550-3213(92)90466-O. arXiv:hep-lat/9207009
S. Sint, On the Schrödinger functional in QCD. Nucl. Phys. B 421, 135 (1994). https://doi.org/10.1016/0550-3213(94)90228-3. arXiv:hep-lat/9312079
J. Gasser, H. Leutwyler, Quark masses. Phys. Rep. 87, 77 (1982). https://doi.org/10.1016/0370-1573(82)90035-7
S. Sint, R. Sommer, The running coupling from the QCD Schrödinger functional: a one loop analysis. Nucl. Phys. B 465, 71 (1996). https://doi.org/10.1016/0550-3213(96)00020-X. arXiv:hep-lat/9508012
S. Sint, P. Weisz, Further results on \(O(a)\) improved lattice QCD to one loop order of perturbation theory. Nucl. Phys. B 502, 251 (1997). https://doi.org/10.1016/S0550-3213(97)00372-6. arXiv:hep-lat/9704001
W. Bietenholz et al., Tuning the strange quark mass in lattice simulations. Phys. Lett. B 690, 436 (2010). https://doi.org/10.1016/j.physletb.2010.05.067. arXiv:1003.1114
W. Bietenholz et al., Flavour blindness and patterns of flavour symmetry breaking in lattice simulations of up, down and strange quarks. Phys. Rev. D 84, 054509 (2011). https://doi.org/10.1103/PhysRevD.84.054509. arXiv:1102.5300
M. Bruno. The energy scale of the 3-flavour \(\Lambda \) parameter. Ph.D. thesis, Humboldt University, Berlin (2015). https://doi.org/10.18452/17516
M. Lüscher, S. Schaefer, Lattice QCD with open boundary conditions and twisted-mass reweighting. Comput. Phys. Commun. 184, 519 (2013). https://doi.org/10.1016/j.cpc.2012.10.003. arXiv:1206.2809
M. Bruno, P. Korcyl, T. Korzec, S. Lottini, S. Schaefer, On the extraction of spectral quantities with open boundary conditions. PoS LATTICE2014, 089 (2014). https://doi.org/10.22323/1.214.0089. arXiv:1411.5207
M. Dalla Brida, T. Korzec, S. Sint, P. Vilaseca, High precision renormalization of the flavour non-singlet Noether currents in lattice QCD with Wilson quarks. Eur. Phys. J. C79, 23 (2019). https://doi.org/10.1140/epjc/s10052-018-6514-5. arXiv:1808.09236
S. Sint, The chirally rotated Schrödinger functional with Wilson fermions and automatic \(O(a)\) improvement. Nucl. Phys. B 847, 491 (2011). https://doi.org/10.1016/j.nuclphysb.2011.02.002. arXiv:1008.4857
S. Sint, B. Leder, Testing universality and automatic \(O(a)\) improvement in massless lattice QCD with Wilson quarks. PoS LATTICE2010, 265 (2010). arXiv:1012.2500
M. Dalla Brida, S. Sint, A dynamical study of the chirally rotated Schrödinger functional in QCD. PoS LATTICE2014, 280 (2014). arXiv:1412.8022
ALPHA collaboration, U. Wolff, Monte Carlo errors with less errors. Comput. Phys. Commun. 156, 143 (2004). https://doi.org/10.1016/S0010-4655(03)00467-3,https://doi.org/10.1016/j.cpc.2006.12.001. arXiv:hep-lat/0306017
ALPHA collaboration, S. Schaefer, R. Sommer, F. Virotta, Critical slowing down and error analysis in lattice QCD simulations. Nucl. Phys. B 845, 93 (2011). https://doi.org/10.1016/j.nuclphysb.2010.11.020. arXiv:1009.5228
F. Virotta, Critical slowing down and error analysis of lattice QCD simulations. Ph.D. thesis, Humboldt-Universität zu Berlin, Mathematisch-Naturwissenschaftliche Fakultät I (2012)
ALPHA collaboration, H. Simma, R. Sommer, F. Virotta, General error computation in lattice gauge theory. ALPHA Collaboration internal notes (2012–2014)
A. Ramos, Automatic differentiation for error analysis of Monte Carlo data. Comput. Phys. Commun. 238, 19 (2019). https://doi.org/10.1016/j.cpc.2018.12.020. arXiv:1809.01289
G. Colangelo, S. Dürr, C. Haefeli, Finite volume effects for meson masses and decay constants. Nucl. Phys. B 721, 136 (2005). https://doi.org/10.1016/j.nuclphysb.2005.05.015. arXiv:hep-lat/0503014
RBC-UKQCD collaboration, C. Allton et al., Physical results from 2+1 flavor domain wall QCD and SU(2) chiral perturbation theory. Phys. Rev. D 78, 114509 (2008). https://doi.org/10.1103/PhysRevD.78.114509. arXiv:0804.0473
RBC, UKQCD collaboration, T. Blum et al., Domain wall QCD with physical quark masses. Phys. Rev. D 93, 074505 (2016). https://doi.org/10.1103/PhysRevD.93.074505. arXiv:1411.7017
S. Dürr, Z. Fodor, C. Hoelbling, S.D. Katz, S. Krieg, T. Kurth et al., Lattice QCD at the physical point: light quark masses. Phys. Lett. B 701, 265 (2011). https://doi.org/10.1016/j.physletb.2011.05.053. arXiv:1011.2403
S. Dürr, Z. Fodor, C. Hoelbling, S. Katz, S. Krieg et al., Lattice QCD at the physical point: simulation and analysis details. JHEP 1108, 148 (2011). https://doi.org/10.1007/JHEP08(2011)148. arXiv:1011.2711
C. McNeile, C.T.H. Davies, E. Follana, K. Hornbostel, G.P. Lepage, High-precision c and b masses, and QCD coupling from current–current correlators in lattice and continuum QCD. Phys. Rev. D 82, 034512 (2010). https://doi.org/10.1103/PhysRevD.82.034512. arXiv:1004.4285
Y. Maezawa, P. Petreczky, Quark masses and strong coupling constant in \(2+1\) flavor QCD. Phys. Rev. D 94, 034507 (2016). https://doi.org/10.1103/PhysRevD.94.034507. arXiv:1606.08798
MILC collaboration, A. Bazavov et al., MILC results for light pseudoscalars. PoS CD09, 007 (2009). https://doi.org/10.22323/1.086.0007. arXiv:0910.2966
A. Bazavov et al., Staggered chiral perturbation theory in the two-flavor case and SU(2) analysis of the MILC data. PoS LATTICE2010, 083 (2010)
G. Rupak, N. Shoresh, Chiral perturbation theory for the Wilson lattice action. Phys. Rev. D 66, 054503 (2002). https://doi.org/10.1103/PhysRevD.66.054503. arXiv:hep-lat/0201019
O. Bär, G. Rupak, N. Shoresh, Chiral perturbation theory at \(O(a^2)\) for lattice QCD. Phys. Rev. D 70, 034508 (2004). https://doi.org/10.1103/PhysRevD.70.034508. arXiv:hep-lat/0306021
Acknowledgements
We thank T. Blum, G. Herdoíza, L. Lellouch and A. Portelli for useful discussions. J.K., C.P., and A.V. thank CERN for its hospitality. A.V. also thanks BNL for its hospitality. C.P. and D.P. thankfully acknowledge support through the Spanish projects FPA2015-68541-P (MINECO/FEDER) and PGC2018-094857-B-I00, the Centro de Excelencia Severo Ochoa Programme SEV-2016-0597, and the EU H2020-MSCAITN-2018-813942 (EuroPLEx). We acknowledge PRACE for awarding us access to resource FERMI based in Italy at CINECA, Bologna, and to resource SuperMUC based in Germany at LRZ, Munich. Computer resources were also provided by the INFN (MARCONI cluster at CINECA, Bologna); a grant from the Swiss National Supercomputing Centre (CSCS) under project ID s38; the Gauss Centre for Supercomputing (GCS) – through the John von Neumann Institute for Computing (NIC) – on the GCS share of the supercomputer JUQUEEN at Jülich Supercomputing Centre (JSC); Altamira, provided by IFCA at the University of Cantabria; the FinisTerraeII machine provided by CESGA (Galicia Supercomputing Centre); and a dedicated HPC cluster at CERN. We are grateful for the technical support received by all the computer centers.
Author information
Authors and Affiliations
Corresponding author
Appendices
Chiral perturbation theory expansions
We adapt standard \(\chi \)PT expressions to our specific parametrisation of the data, stemming from our choice of chiral trajectory. We start, for example, from Eqs. (B5) and (B6) of Ref. [57], which are NLO chiral expansions of the light and strange pseudoscalar mesons \(m_{\pi }\) and \(m_{\mathrm{K}}\) in terms of light and strange quark masses \(m_1 = m_2\) and \(m_3\). These series are inverted, so that quark masses are functions of meson masses. The PCAC quark mass combinations \(m_{12} = (m_1 + m_2)/2\) and \(m_{13} = (m_1 + m_3)/2\) are then formed and everything is re-expressed in terms of the dimensionless quark masses \(\phi _{\mathrm{12}}\), \(\phi _{\mathrm{13}}\) and the dimensionless quantities \(\phi _2\) and \(\phi _4\), so that we arrive at
and
where \(B_0, f_0, L_k (k=4,5,6,8)\) are standard \(\chi \)PT parameters and
The NLO LECs \(L_i\) and \(B_0\) are implicitly renormalised at scale \(\varLambda _\chi \). It is also useful to consider the ratio
which does not require renormalisation. Note that at the symmetric point (\(\phi _2 = 2 \phi _4 /3\)) the current quark masses of Eqs. (A.1) and (A.2) respect the constraint \(\phi _{12} = \phi _{13}\), while the ratio (A.4) is exactly 1. Note that the sum of ratios
has the remarkable advantages of depending on just one combination of NLO LECs, and of being free of polynomial dependence on \(\phi _2\).
We next rewrite Eqs. (A.1) and (A.2) in forms which are suitable for combined fits, with common coefficients for \(\phi _{12}\) and \(\phi _{13}\), obtaining
where the coefficients \(p_1, p_2\), and \(p_3\) relate to LECs as follows:
We also define
with \(f_{\mathrm{\pi K}}\) given by Eq. (2.17). The chiral logarithms are
The following points should be kept in mind:
We are using only configurations along the \(\phi _4=\mathrm{constant}\) chiral trajectory. Terms proportional to \(\phi _4\) are thus reabsorbed into constant fit terms.
Our expressions are linear in fit parameters, rather than non-linear factors in which LECs appear explicitly. Determination of LECs is beyond the scope of the present work.
By replacing \(f_0^2\) by \(f_{\pi K}^2\) in the above definition of K, the coefficients of chiral logarithms are completely fixed relative to the LO value; cf. Eqs. (A.1) and (A.2). This eliminates one fit parameter, pushing its effect to NNLO LECs. In practice, the fact that terms with \(\phi _4\) are reabsorbed into the LO terms nullifies the effect in some fits, e.g., those for \(\phi _{12}\) and \(\phi _{13}\). A second advantage of this choice is that the resulting ansätze are fully linear in the fit parameters. See also Eq. (2.5) in [8] and comments therein on the reasons that \(f_0 \approx f_{\pi K}\) and for preferring \(f_{\pi K}\) to \(f_0\).
We conveniently set the renormalisation scale to \(\varLambda _\chi =1/\sqrt{8t_0} \simeq 476~\mathrm{MeV}\), simplifying the chiral logs. There is no need to reabsorb \(\ln (8t_0\varLambda _\chi ^2)\) terms in fit parameters. This is an unconventional choice, as common practice consists in providing results for LECs at \(\varLambda _\chi =m_\rho \) or \(\varLambda _\chi =4\pi f_0\). Consequently, NLO LECs eventually obtained with our methodology may only be compared to results in the literature after some extra work.
Using the above expressions and consistently neglecting higher mass orders, we obtain for the ratio (A.4) of PCAC masses
For the combination (A.5) we have
With \(\phi _4\) held constant, the quantities of Eqs. (A.6), (A.7), (A.13), and (A.14) are functions of \(\phi _2\) only. We use these expressions to fit our data, after adding \(\mathrm{O}(a^2)\) terms which model leading discretisation effects that have been neglected throughout this “Appendix”.
Discretisation effects
In order to parametrise the discretisation effects of the quantities we fit, we first examine \(\phi _{ij}\); cf. Eqs. (2.5) and (2.20). It can be written in the very general form
where \(\phi _{ij}^{\text {cont}}\) is the continuum quantity and the function f contains the discretisation effects which in general depend on the lattice spacing a, the quark masses \(m_i\), \(m_j\), and the trace of the mass matrix \(\mathrm{Tr}[M_\mathrm{q}]\). As we have discussed in Sect. 2, we will ignore \(\mathrm{O}(g_0^4 \mathrm{Tr}[M_{\mathrm{sum}}])\) discretisation effects and only consider the influence of \(\mathrm{O}(a^2)\) uncertainties. Also \(\phi _{ij}\) has to be symmetric with respect to the exchange of quarks, \(i \leftrightarrow j\). We can thus parametrise f as follows:
A further simplification is brought about by neglecting the dependence of \(c_0, \ldots , c_7\) on the bare coupling \(g_0^2\).
Next we write the function f in terms of \(\phi _2\), recalling that a constant \(\phi _4\) constrains the relation between the heavier (strange) and light quark masses. This is done by first expressing the current quark masses on the rhs of the above equation in terms of \(\phi _{12}\) and \(\phi _{13}\), followed by using their LO \(\chi \mathrm {PT}\) relations to \(\phi _2\) and \(\phi _4\). In particular, with \(\beta _0 \equiv 1/(2 B_0 \sqrt{8 t_0})\), we see from Eqs. (A.6), (A.7) that to LO:
Inserting the above LO expressions in Eq. (B.2) we obtain, after some straightforward algebra, that for two light quarks the discretisation function has the form
where \(C_k\) (with \(k = 0, 1, \dots \)) depend on the constants \(\beta _0\), \(\phi _4\), and the coefficients \(c_l\), suitably rescaled by factors of \(8 t_0\) (with \(l = 0, 1, \dots \)). Similarly, for the heavier and a light quark we obtain
Note that, although in general coefficients \(C_n\) and \({\widetilde{C}}_n\) are not the same, in the case of \(m_3 = m_1\) (symmetric point) \(f_{13} = f_{12}\) trivially.
The very fact that we have used LO \(\chi \)PT to obtain the last two expressions (cf. Eqs. (B.3)–(B.7)) allows us to drop \(\mathrm{O}(a^2 \phi _2^2)\) contributions of \(f_{12}\) and \(f_{13}\). Moreover, standard power-counting schemes in Wilson \(\chi \)PT [65, 66] suggest that terms of \(\mathrm{O}(a^2)\) enter at the same order as \(\mathrm{O}(m_\pi ^2)\), which would imply that the terms of \(\mathrm{O}(a^2 \phi _2)\) should also be dropped. We will nevertheless keep this term and explore its impact.
For the ratio of \(\phi _{12}\) and \(\phi _{13}\) we have that
We write the discretisation functions \(f_{12}\) and \(f_{13}\) as in Eqs. (B.8) and (B.9) and then express coefficients \(C_0, C_1, \widetilde{C}_0, \widetilde{C}_1, \dots \) in terms of the original coefficients \(c_i\) of Eq. (B.2). After some algebra we end up with
The coefficients \(D_0, D_1, D_2, \dots \) depend on the \(c_i\)’s. The factor \(2 \phi _4 - \phi _2\) in the discretisation term vanishes at the symmetric point \(\phi _2= 2 \phi _4/3\). This confirms that at the symmetric point the ratio \(\phi _{12}/\phi _{13}\) is 1 by construction, for any lattice spacing. In analogy to the arguments exposed above for \(f_{12}\) and \(f_{13}\), we drop the \(D_2 \phi _2^2\) term in our fits. Moreover, the variation of the denominator \((2\phi _4 - \phi _2)^2\) is relatively mild, ranging between \(\sim 2\) and \(\sim 4.6\) as \(\phi _2\) varies between \(\sim 0.1\) and \(\sim 0.8\) in our simulations. To simplify matters, we reabsorb this \(\mathrm{O}(1)\) term in re-definitions of \(D_0\) and \(D_1\).
Finally, for the combination of Eq. (A.14), we straightforwardly parametrise the discretisation errors in a way analogous to \(f_{12}\) and \(f_{13}\); see Eq. (4.5).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3
About this article
Cite this article
Bruno, M., Campos, I., Fritzsch, P. et al. Light quark masses in \({N_\mathrm{f}=2+1}\) lattice QCD with Wilson fermions. Eur. Phys. J. C 80, 169 (2020). https://doi.org/10.1140/epjc/s10052-020-7698-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-020-7698-z