Abstract
We derive chiral Ward identities for lattice QCD with Wilson quarks and \(N_{\mathrm{f}}\ge 3\) flavours, on small lattices with Schrödinger functional boundary conditions and vanishingly small quark masses. These identities relate the axial variation of the non-singlet pseudoscalar density to the scalar one, thus enabling the non-perturbative determination of the scale-independent ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) of the renormalisation parameters of these operators. We obtain results for \(N_{\mathrm{f}}=3\) QCD with tree-level Symanzik-improved gluons and Wilson-Clover quarks, for bare gauge couplings which cover the typical range of large-volume \(N_{\mathrm{f}}= 2+1\) simulations with Wilson fermions at lattice spacings below \(0.1\,\)fm. The precision of our results varies from 0.3 to 1%, except for the coarsest lattice, where it is 2%. We discuss how the \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) ratio can be used in the non-perturbative calculations of \({\mathrm {O}}(a)\) improved renormalised quark masses.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Lattice QCD with Wilson fermions is a long-established regularisation. The fermionic action satisfies most desirable properties, namely strict locality, lack of fermion doublers, and preservation of flavour symmetry in a straightforward way. Well-known shortcomings are the presence of discretisation effects linear in the lattice spacing and, most importantly, the loss of chiral symmetry. The first problem is solved by applying the Symanzik-improvement programme (see for instance Ref. [1] for a review and Ref. [2] for more details). Chiral symmetry is recovered in the continuum, at the cost of having to deal with complicated renormalisation properties for most quantities of interest (cf. Ref. [3] and references therein; for a review see also Ref. [4]). A frequently cited example of these complications is the power divergence \(m_{\mathrm{crit}} \sim 1/a\), which must be subtracted from bare quark masses before they are renormalised multiplicatively. Other examples are the normalisation parameter \(Z_{\mathrm {A}}\) of the axial current and the ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) of the non-singlet scalar and pseudoscalar density renormalisation parameters. In a regularisation scheme which respects chiral symmetry, these quantities are strictly equal to unity at finite values of the UV cutoff. With Wilson fermions these quantities are scale-independent finite functions of the gauge coupling, which tend to unity as we approach the continuum limit. In principle they are determined by requiring that chiral Ward identities at non-vanishing lattice spacing tend to their formal counter-parts in the continuum limit. The scope of this paper is to provide a method for the determination of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) based on Ward identities on physically small lattices with Schrödinger functional boundary conditions and realising a line of constant physics (LCP) in parameter space. Results are obtained for \(N_{\mathrm{f}}=3\) dynamical quarks.
The general idea behind using chiral Ward identities in order to evaluate \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) for Wilson fermions appeared in Ref. [3].Footnote 1 It has been put to practice with quenched, unimproved Wilson fermions in Ref. [5] and subsequently with tree-level Symanzik-improved ones in Ref. [6]. The chiral Ward identities in question were obtained for large-volume lattices with periodic boundary conditions and non-chiral quark masses. Ratios of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) were calculated at fixed gauge coupling for several quark masses and extrapolated to the chiral limit. A second-generation of calculations was not based on Ward identities but obtained by computing \(Z_{\mathrm {S}}\) and \(Z_{\mathrm {P}}\) in the RI/MOM scheme [7]. Again these calculations are performed at finite quark masses, followed by chiral extrapolations. A well known problem in this approach is that the \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) ratio thus obtained differs from the Ward identity one by “Goldstone pole contaminations” at the IR end of a renormalisation window. This problem was first identified in Ref. [7], and subsequently discussed in Refs. [8,9,10,11] (and reviewed in Ref. [4]), while the discussion specific to the difference between Ward identity and RI/MOM determinations of the ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) is found in Ref. [10]. Although the problem is greatly attenuated by the RI/SMOM variant of this method [12], the requirement of a reliable renormalisation window is inherent in these approaches.
In the present work we revisit the Ward identity method, with an important novelty: lattices with small physical volumes and Schrödinger functional boundary conditions are used, with quark flavours degenerate in mass and (almost) at the chiral limit. In doing so, we follow closely the method introduced in Ref. [13] (and originally applied in the quenched approximation in that work) for the non-perturbative determination of the scale independent normalisation parameter \(Z_{\mathrm {A}}\) of the axial vector current. Updates and optimisations of these computations can be found in refs. [14, 15] for two- and three-flavour QCD, respectively. Ward identities are imposed at constant physics to ensure a removal of \({\mathrm {O}}(a)\) effects in on-shell quantities and, at the same time, smoothly vanishing \({\mathrm {O}}(a^2)\) effects as the bare coupling is varied. It must be stressed that the chiral Ward identities adopted in these works to determine \(Z_{\mathrm {A}}\) are valid for \(N_{\mathrm{f}}\ge 2\) quark flavours, while the ones we introduce in the present work for the determination of \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\) are valid for \(N_{\mathrm{f}}\ge 3\).
We note in passing that, based on the chirally rotated Schrödinger functional construction of Ref. [16], a more recent method for the non-perturbative computation of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) has been mentioned in Ref. [17].
This paper is organised as follows: in Sect. 2 (Sect. 2.1) we formally derive chiral Ward identities for continuum QCD, which relate correlation functions of non-singlet pseudoscalar and scalar composite operators (densities). The former are correlation functions with two operator insertions at two distinct space-time points (an axial current and a pseudoscalar density) in the presence of a generic external source operator. The latter involve a single insertion of the scalar operator. Subsequently (Sect. 2.2), we rewrite the same Ward identities in the lattice-regularised QCD with Wilson fermions. The external source consists of two standard Schrödinger functional boundary sources, each placed at a temporal boundary. The loss of chiral symmetry by Wilson fermions is taken into account by the renormalisation constants \(Z_{\mathrm {P}}\) and \(Z_{\mathrm {S}}\) of the pseudoscalar and scalar densities and the normalisation of the axial current, \(Z_{\mathrm {A}}\). In the chiral limit, these Ward identities hold up to \(\mathrm{O}(a^2)\) discretisation effects. We also discuss the corrections arising in practical simulations, which slightly deviate from the chiral limit; these are \(\mathrm{O}(am,a^2)\). Finally, in Sect. 2.3 we re-express these Ward identities in terms of traces of valence quark propagators, which multiply factored-out traces of generators of the \(SU(N_{\mathrm{f}})\) flavour group.
Section 3 takes an even closer look at these Ward identities. We distinguish several equivalence classes, each consisting of identities with different flavour structure, which reduce to the same relations between correlation functions, giving the same \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\) result. Ward identities belonging to different equivalence classes provide \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\) estimates which differ by \(\mathrm{O}(am,a^2)\) effects. If we neglect these effects, we can combine identities from different equivalence classes, ending up with new relations between correlation functions (true up to \(\mathrm{O}(am,a^2)\) errors). Thus we can explore to what extent different equivalence classes provide independent estimates of \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\). Some of these estimates are expected to be noisier than others, as they are obtained using both quark-connected and quark-disconnected correlation functions.
In Sect. 4 we present our results for QCD with \(N_{\mathrm{f}}= 3\) dynamical flavours, where the lattice gauge action is tree-level Symanzik-improved and the fermion action is non-perturbatively Wilson–Clover improved. Our simulations are performed with degenerate mass flavours lying close to the chiral limit. The non-perturbative determination of the ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) is carried out along a line of constant physics in parameter space. In practice, this requirement is met by ensuring a volume of almost constant spatial extent \(L\sim 1.2\,\)fm in physical units, with Schrödinger functional boundary conditions. The ratio between temporal and spatial extent T/L is also kept fixed. This implies that any remaining intrinsic ambiguities in \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) of \(\mathrm{O}(a^2)\) or higher (in the \(\mathrm{O}(a)\) improved setup adopted here) disappear smoothly towards the continuum limit. The gauge couplings of our simulations span a range typical for the computations performed by the CLS (Coordinated Lattice Simulations) effort in QCD with \(N_{\mathrm{f}}=2+1\) flavours of non-perturbatively improved Wilson fermions [18,19,20,21]. Our \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\) results are divided out by \(Z_{\mathrm {A}}\), estimated in Ref. [22]. Our \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) estimates are subsequently extrapolated to the chiral limit at fixed \(g_0^2\). Results are obtained from several Ward identities; they differ by discretisation effects. Thus it is possible to create ratios of the different \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) determinations, and plot them against (powers of) the lattice spacing, confirming the expected scaling behaviour. The statistically and systematically most precise \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) determination is parameterised as a continuous function of \(g_0^2\), which is our final answer. This is compared to two other determinations: one is based on ratios of PCAC quark masses with different flavours, employing essentially the same small-volume Schrödinger functional setup [23]; the other is based on the relation between bare current quark masses and bare subtracted quark masses, computed on large volumes with open boundary conditions [20].
Finally, in Sect. 5 we discuss how \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) can be used in quark mass determinations along the lines proposed in Ref. [24], but performing the mass renormalisation in the Schrödinger functional scheme and the renormalisation group running non-perturbatively, between renormalisation scales \(\mu _{{\mathrm {had}}} \sim \Lambda _{{\mathrm {QCD}}}\) and \(\mu _{\mathrm {PT}} \sim M_{{\mathrm {W}}}\). Such a calculation is subjected to different systematics than the standard ALPHA-CLS method, recently applied in Ref. [25].
Work in progress culminating to this paper had been reported in Refs. [26, 27].
2 Chiral Ward identities for \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\)
In this Section we will derive chiral Ward identities which relate correlation functions of non-singlet scalar and pseudoscalar composite operators (densities). These enable us to compute non-perturbatively the ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\), which determines the relative normalisation of these scalar and pseudoscalar densities when the regularisation (Wilson fermion action) breaks chiral symmetry. First we will derive the pertinent chiral Ward identities in the formal continuum theory. Subsequently, we will show their lattice analogues with Schrödinger functional boundary conditions. The resulting Ward identity computation of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) follows very closely that of \(Z_{\mathrm {A}}\), described in refs. [13,14,15].
Our notation is pretty standard. Definitions of composite operators of dimension-3, axial transformations and Schrödinger functional (SF) boundary operators are collected in Appendix A. Conventions concerning the \(su(N_{\mathrm{f}})\) flavour algebra are to be found in Appendix B. The lattice spacing is denoted by a, the (squared) gauge coupling by \(g_0^2\), and the inverse lattice coupling by \(\beta \equiv 6/g_0^2\). Bare current (PCAC) and subtracted masses are defined in Appendix C.
2.1 Formal chiral Ward identities in the continuum
Under the small axial variations (A.8) of the fermion fields the formal, continuum QCD action in Euclidean space-time transforms as follows:
The fermion mass matrix is denoted by M. We work in the flavour symmetric (isospin) limit, so all quark masses m are degenerate. In the last expression we have integrated by parts the term with the axial current. Chiral Ward identities are obtained by considering that under the change of field variables defined in Eqs. (A.6), the expectation value of any composite operator \({\mathcal {O}}\) (and products of them) is invariant. In the limit of small axial variations this leads to:
We now take the axial variations to be non zero only in a space-time region R with a smooth boundary \(\partial R\) (i.e., for \(x \in R\), \(\epsilon ^a(x) \ne 0\); otherwise \(\epsilon ^a(x) = 0\)). The above expression reduces to
We consider a product of composite operators \({\mathcal {O}}{=} P^b(y) {\mathcal {O}}_{\mathrm{ext}}\), where \(y \in R\) and \( {\mathcal {O}}_{\mathrm{ext}}\) is defined outside the region R. This implies that \(\delta _{\mathrm {A}} {\mathcal {O}}= [\delta _{\mathrm {A}} P^b(y)] {\mathcal {O}}_{\mathrm{ext}}\). The pseudoscalar density \(P^b(x)\) transforms as follows:
At this stage we impose that \(\epsilon ^c(x) = \epsilon \delta ^{ac}\); i.e., it is a constant phase \(\epsilon \) in a fixed direction a in flavour space, so that Ward identities become expressions reflecting global chiral symmetry. Moreover, in order to sidestep a number of complications,Footnote 2 we chose \(a \ne b\), so that the last term on the r.h.s. of Eq. (4) drops out.Footnote 3 Putting everything together, we obtain
We note in passing that the first term is a surface term:
As done in Ref. [13] for \(Z_{\mathrm {A}}\), we chose the region R to be the space-time volume between the hyper-planes at \(y_0-t\) and \(y_0+t\).Footnote 4 Boundary conditions in space are periodic, implying \(\int _R dx_0 d^3 x \partial _k \langle A_k \cdots \rangle = 0\). The Ward identity becomes
It is convenient to introduce a spatial integration over \(\mathbf{y}\):
The second line of the l.h.s. contains a contact term, arising when \(r \equiv |x-y| \rightarrow 0\). The operator product is expressed in terms of an OPE (recall that \(a \ne b\))
where [D] is the operator dimension and the Wilson coefficients \(C_k\) contain logarithms. The most divergent term in the OPE, taking into account the various symmetry properties of the operator product, is proportional to \(S^e(x)\). The contribution to the space-time volume integral \(2m \int _R \cdots \) of a small four-sphere of centre x and radius a (or a small four-cube of size a) is then \(\sim ~m~\int _0^a dr~r^3~r^{D-6} \langle \cdots \rangle \sim m~a^{D-2}~ \langle \cdots \rangle \) and thus the leading term in the OPE contributes \(\mathrm{O}(am)\). In the lattice regularisation this implies that the contact term contributes an \(\mathrm{O}(am)\) discretisation effect to the Ward identity, even in a Symanzik-improved setup.
2.2 Lattice Ward identities with Schrödinger functional boundary conditions
We now adapt the previous formal manipulations to the lattice regularisation with Schrödinger functional boundary conditions. The external source for the Ward identity correlation functions is chosen to be a tensor in flavour space \({\mathcal {O}}^{ad}_{\mathrm{ext}}\):
with \({\mathcal {O}}^{\prime a}\) and \({\mathcal {O}}^d\) defined in Eqs. (A.9). With this source and in lattice notation the Ward identity (8) becomes (with \(b \ne c\)):
In this expression, repeated flavour indices e are summed, as usual. The weight factor is \(w(x_0) = 1/2\) for \(x_0 = y_0 \pm t\) and \(w(x_0) = 1\) otherwise. It is introduced in order to implement the trapezoidal rule for discretising integrals. The mass m is the current quark mass defined in Eq. (C.5); recall that we work with degenerate masses.
Assuming that we work in the chiral limit (or with nearly vanishing quark masses, so that \(\mathrm{O}(am)\) effects may be safely neglected), the above Ward identity is valid up to \(\mathrm{O}(a^2)\) dicretisation errors in lattice QCD with Wilson quarks. Chiral symmetry breaking implies the (re)normalisation and improvement properties summarised in Appendix C. The Symanzik b-coefficients appearing in Eqs. (C.2)–(C.4) multiply the subtracted quark mass \(m_{\mathrm {q}}\) or the quark mass matrix \(M_{\mathrm {q}}\). When working in or close to the chiral limit, as is the case in our simulations, we may safely drop these terms. Putting everything together we obtain Ward identity (11). The renormalisation factors of the external sources \({\mathcal {O}}^{\prime a}\) and \({\mathcal {O}}^d\) are not taken into consideration, as they cancel out on both sides of the identity. Note that the term proportional to the current quark mass m may also be dropped in the chiral limit. In practice, since we are always working with masses that are not strictly zero, it turns out that it is advantageous to keep this term; see Ref. [15] and Sect. 4.1.
Equation (11) can be solved for \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\). With \(Z_{\mathrm {A}}\) known either from other PCAC Ward identities [13,14,15] or from the chirally rotated Schrödinger functional formalism [22], we can thus obtain \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\).
2.3 Lattice Ward identities, Wick contractions, and flavour factors
Ward identity (11) relates expectation values of four composite operators on the l.h.s. to those of three composite operators on the r.h.s.; with a slight abuse of terminology, we call these four- and three-point correlation functions, respectively. We express these correlation functions, with Schrödinger functional boundary fields, in terms of traces of quark propagators. In standard ALPHA notation [28], \([\psi (y) \, \bar{\psi }(x)]_{\mathrm {F}}\) denotes a quark propagator in a fixed background gauge field configuration, where x and y are space-time points in the bulk of the lattice. Propagators from the \(x_0=0\) boundary to the bulk are \([\zeta (\mathbf{v}) \bar{\psi }(y)]_{{\mathrm {F}}}\) (with \(\mathbf{v}\) a point at the \(x_0=0\) boundary), while those from the \(x_0=T\) boundary to the bulk are \([\zeta ^\prime ({\mathbf{v}^\prime }) \bar{\psi }(y)]_{{\mathrm {F}}}\) (with \(\mathbf{v}^\prime \) a point at \(x_0=T\)). Boundary-to-boundary propagators are \([\zeta ^\prime ({\mathbf{v}^\prime }) \bar{\zeta }(\mathbf{u})]_{{\mathrm {F}}}\). For proper definitions see Ref. [28]. Note that, since we are working in the \(su(N_{\mathrm{f}})\)-symmetric limit, all masses are degenerate and quark propagators of different flavours are indistinguishable.Footnote 5
Performing the Wick contractions, we write the three-point correlation function of Eq. (11) as
where \(T^{aed} \equiv \mathrm{Tr}(T^aT^eT^d)\) are traces of three flavour \(su(N_{\mathrm{f}})\) generators and \(F_{{\mathrm {S;1}}}(y_0),F_{{\mathrm {S;2}}}(y_0)\) are expectation values of traces of quark propagators with a scalar insertion. The exact expressions can be found in Table 1. Note that traces \(\mathrm{Tr}\) act in flavour space, traces \(\,\hbox {tr}\,\) act in spin-colour space, and \(\langle \cdots \rangle \) denote averages over gauge field configurations. In Fig. 1 we show the quark-line diagrams corresponding to the spin-colour traces in the above equation. Any Wick contraction between fermion fields at the same point in the bulk \([\psi (y), \bar{\psi }(y)]_{\mathrm{F}}\), or between boundary fields at the same time-slice (e.g. \([\zeta (\mathbf{v}) \bar{\zeta }(\mathbf{u})]_{\mathrm{F}}\)) gives rise to a quark-disconnected diagram,Footnote 6 multiplied by the trace of an \(su(N_{\mathrm{f}})\) generator. As this trace is zero, such diagrams do not contribute to the three-point correlation function. An example of such a diagram is shown in Fig. 1.
In Appendix D we combine the usual \(\gamma _5\)-Hermiticity property of quark propagators, charge conjugation invariance of the lattice theory, and the trace properties of Eq. (B.4), to cast the r.h.s. of Eq. (12) into a single real term, and obtain for the r.h.s. of the Ward identity (11):
Next we concentrate on the l.h.s. of Eq. (11). For simplicity we drop, for the moment, the term proportional to the quark mass. The l.h.s. consists of boundary-to-boundary correlation functions with two insertions of dimension-3 operators in the bulk, which can be cast in the general form
Upon performing the Wick contractions, each correlation function is expressed as the sum of 9 terms. They are products of traces of flavour matrices (denoted as \(T_k^{abcd}\)) and traces of loops of quark propagators averaged over gauge field configurations (denoted as \(F_{{\mathrm {AP}};k}(x_0,y_0)\)). The former traces are defined as:
while the latter ones are also given in Table 1.
The spin-colour trace diagrams are shown in Fig. 2. We see that there are six quark-connceted diagrams, and three quark-disconnected ones. The condition \(b \ne c\) implies that \(T_9F_{{\mathrm {AP}};9}(x_0,y_0) =0\), due to the vanishing of \({\mathrm {Tr}}(T^cT^b)\). From Eq. (B.2) we see that \(T_k^{abcd}\) for \(k=7,8\) are real.
Once more we combine \(\gamma _5\)-Hermiticity, charge conjugation invariance, and Eq. (B.5), to obtain for the l.h.s. of the Ward identity (11):
Note that correlation functions \(F_{{\mathrm {AP}};k}\) are real for \(k=1, \ldots , 9\). See Appendix D for more details. We will use a somewhat more compact notation, defining
Collecting Eqs. (13), (21), and (22), we write the Ward identity (11) in the chiral limit as:
In order to keep the equation simple, we have not shown the mass-dependent terms with two pseudoscalar density insertions, appearing in Eq. (11). These terms are included in the numerical analysis, which is carried out close to, but not strictly at the chiral limit. The reader should have no difficulty convincing himself that they are exactly analogous to \(F_{{\mathrm {AP}};k}(y_0+t,y_0)\) and \( F_{{\mathrm {AP}};k}(y_0-t,y_0)\) appearing above. Their net effect is to add extra mass-dependent contributions to the \(\varDelta _k(y_0,t)\) functions. From now on, the \(\varDelta _k(y_0,t)\) functions are meant to include these contributions, proportional to the quark mass. Consequently, the uncertainty on the r.h.s. of Eq. (23) becomes \(\mathrm{O}(am,a^2)\).
It is interesting to compare the Ward identities we have derived here to the one introduced in Ref. [13] for the determination of \(Z_{\mathrm {A}}\). The former are valid for \(N_{\mathrm{f}}\ge 3\), while the latter for \(N_{\mathrm{f}}\ge 2\). The Ward identity of Ref. [13] involves correlation functions with two axial current insertions in the bulk. In our case we have more complicated contributions, consisting of time-differences of correlation functions with one axial current and one pseudoscalar density insertion.
3 Determination of \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\) from Ward identities
Ward identity (23) is a master equation, from which a plethora of relations arise for specific choices of flavour indices a, b, c, d. In what follows, each of them will be distinguished by the label WI(abcd). Not all of them are suitable for the determination of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\). The following constraints need to be imposed:
- (i)
\(b \ne c\); this ensures the suppression of the scalar term in Eq. (4);
- (ii)
\(d^{bce} \ne 0\) and \(d^{ade} \ne 0\), so that the r.h.s. of Eq. (23) does not vanish. Note that once b, c are fixed, property A in Appendix B ensures that \(d^{bce} \ne 0\) for a single value of e. Thus the summation over e on the r.h.s. of our master equation is trivial and the requirement \(d^{bce} d^{ade} \ne 0\) is satisfied for at most a single value of e;
- (iii)
\(f^{bce} = 0\) for the choice of indices b, c, e for which \(d^{bce} \ne 0\); \(f^{ade} = 0\) for the choice of indices a, d, e for which \(d^{ade} \ne 0\). This follows from property B in Appendix B.
In spite of these constraints, a lot of freedom remains in the choice of flavour indices, resulting in many Ward identities. They are relations between the correlation functions of the master equation, which can be solved for \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\). These Ward identities can be grouped into different equivalence classes. Each class consists of several identities WI(abcd) with different flavour indices \(a,\ldots ,d\), but identical flavour factors \(\mathrm{Re}\,(T_k)\) (\(k=1,3,5,7,8\)), and thus the same Eq. (23). Therefore, the same \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\) estimate is obtained from all Ward identities of the same equivalence class. Estimates of \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\) from Ward identities of different classes differ by discretisation effects.
The combinations of conditions (i)–(iii) simmer down to the choice of flavour indices (a, b, c, d), with \(b \ne c\), such that \(d^{bce} d^{ade}\ne 0\). We systematically investigated the choices of flavour indices which fulfill these conditions with a computer algebra program and grouped them into the equivalence classes which are tabulated in Table 2. These results depend on the \(su(N_{\mathrm{f}})\) Gell–Mann matrix definitions of Appendix B. Some interesting observations are:
There are pairs of equivalence classes that have the same number of elements. Examples are WI(1245) paired to WI(1425), WI(1144) paired to WI(1414) etc. These pairs of classes are separated by a single horizontal line in Table 2. Class WI(1468) does not have a partner.
The flavour factors \(\mathrm{Re}\,(T_k)\) for \((k = 1,3,5)\), \(T_7\), and \(T_8\) of paired classes have closely related numerical values; see Table 3. We will see below how this leads to useful relations between certain \(\varDelta _k\) functions.
The quark disconnected traces \(\varDelta _7\) and \(\varDelta _8\) do not contribute to the equivalence classes of the top half of Table 2 (separated by a triple line from the bottom half).
In Table 3 we collect the flavour factors \(\mathrm{Re}\,(T_k)\) (\(k=1,3,5\)), \(T_7\), and \(T_8\) for each class. Depending on the choice of flavour indices a,b,c,d, some of these flavour factors vanish. This simplifies the resulting Ward identity. Also here the top part of the Table (separated by a double line from the bottom half) lists the Ward identities without \(\varDelta _7\)- and \(\varDelta _8\)-type contributions.
There are two possible ways of using the 11 Ward identities of Table 3. A first approach would be to determine \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\) from each of the 11 variants of Eq. (23). In principle these determinations differ by \(\mathrm{O}(am,a^2)\) effects and that should provide a handle for a good control of the related systematics. However, in practice the different \(Z_{{\mathrm {S}}}/(Z_{{\mathrm {P}}}Z_{{\mathrm {A}}})\) results are all obtained from the same configuration ensembles and are thus strongly correlated. Moreover, paired Ward identities (in the sense discussed above; cf. Table 2) have very similar relations between their \(\varDelta _k\)-terms and this also leads to very similar Z-ratios.
A second approach would be to combine these Ward identities in order to first obtain relations between the various \(\varDelta _k\)-terms. These would be true up to \(\mathrm{O}(am,a^2)\) at fixed gauge coupling, and once established, would simplify the equation(s) relating \(Z_{{\mathrm {S}}}/(Z_{{\mathrm {P}}}Z_{{\mathrm {A}}})\) to the \(\varDelta _k\)’s. In this spirit we proceed as follows:
- (i)
Starting from Ward identities without quark disconnected contributions (i.e., with \(\mathrm{Re}\,(T_7) = \mathrm{Re}\,(T_8)=0\); top part of Table 3), we combine the pair WI(1245) and WI(1425) to obtain:
$$\begin{aligned}&\varDelta _1(y_0,t) = \varDelta _5(y_0,t) +\mathrm{O}(am,a^2) \,, \end{aligned}$$(24)$$\begin{aligned}&Z_{\mathrm {A}} Z_{\mathrm {P}} a^3 \big [ \varDelta _1(y_0,t) - \varDelta _3(y_0,t) \big ] \nonumber \\&\quad = - Z_{\mathrm {S}} \mathrm{Re}\,\big [ F_{{\mathrm {S;1}}}(y_0) \big ] + \mathrm{O}(am,a^2) \,. \end{aligned}$$(25)Note that by combining the pair WI(1486) and WI(1846) we also obtain the above expressions, so this pair does not provide extra information.
- (ii)
WI(1468), which has no partner, is written, in terms of the \(\varDelta \)’s defined in Eq. (22), as:
$$\begin{aligned} \begin{aligned}&Z_{\mathrm {A}} Z_{\mathrm {P}} a^3 \big [ \varDelta _1(y_0,t) - 2 \varDelta _3(y_0,t) + \varDelta _5(y_0,t) \big ] \\&\quad = -2 \, Z_{\mathrm {S}} \mathrm{Re}\,\big [ F_{{\mathrm {S;1}}}(y_0) \big ] + \mathrm{O}(am,a^2) \,. \end{aligned} \end{aligned}$$(26)This on its own determines the ratio \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\). Note that combined with Eq. (24), it gives us Eq. (25). Our conclusion is that all Ward identities with \(\mathrm{Re}\,(T_7) = \mathrm{Re}\,(T_8)=0\) reduce to the equality \(\varDelta _1 = \varDelta _5\) (i.e., diagrams \(F_{{\mathrm {AP}};1}\) and \(F_{{\mathrm {AP}};5}\) of Fig. 2 are related) and a single Ward identity, from which \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\) may be computed.
- (iii)
Passing to Ward identities with quark-disconnected contributions (bottom part of Table 3), we combine the pair WI(1188) and WI(1818) to obtain:
$$\begin{aligned}&\varDelta _7(y_0,t) = \varDelta _8(y_0,t) +\mathrm{O}(am,a^2) \,, \end{aligned}$$(27)$$\begin{aligned}&Z_{\mathrm {A}} Z_{{\mathrm {P}}} a^3 \big [ 2 \varDelta _1(y_0,t) + \varDelta _3(y_0,t) + 3 \varDelta _7(y_0,t) \big ] \nonumber \\&\quad = -2 Z_{{\mathrm {S}}} \mathrm{Re}\,\big [ F_{{\mathrm {S;1}}}(y_0) \big ] + \mathrm{O}(am,a^2) \,, \end{aligned}$$(28) - (iv)
Similarly, the pair WI(1144) and WI(1414) combine to give
$$\begin{aligned}&\varDelta _1(y_0,t) + 2 \varDelta _7(y_0,t) \nonumber \\&\quad =\varDelta _5(y_0,t) + 2 \varDelta _8(y_0,t) + \mathrm{O}(am,a^2) \,, \end{aligned}$$(29)$$\begin{aligned}&Z_{\mathrm {A}} Z_{\mathrm {P}} a^3 2 \big [ \varDelta _1(y_0,t) + \varDelta _3(y_0,t) + 2 \varDelta _7(y_0,t) \big ] \nonumber \\&\quad = -2 Z_{\mathrm {S}} \mathrm{Re}\,\big [ F_{{\mathrm {S;1}}}(y_0) \big ] + \mathrm{O}(am,a^2) \,. \end{aligned}$$(30)Eq. (29) carries no new information, as it is a combination of Eqs. (24) and (27).
- (v)
If we now combine Eqs. (28) and (30), we obtain again Eq. (25) and the new relation
$$\begin{aligned} \varDelta _3(y_0,t) = - \varDelta _7(y_0,t) +\mathrm{O}(am,a^2) \,. \end{aligned}$$(31)
The bottom line is that, up to \(\mathrm{O}(am,a^2)\) discretisation effects, the 11 Ward identities corresponding to the entries of Table 3 are not all independent. They can be combined to give three relations between the functions \(\varDelta _k\), which depend on traces of valence quark propagators, without references to flavour traces; these are Eqs. (24), (27), and (31).Footnote 7 The extent to which these relations are fulfilled at non-zero lattice spacing is an indicator of the size of discretisation effects. Moreover, if we take them at face value, the remaining Ward identities (25), (26), (28), and (30) reduce to a single expression. Any of them can be used to provide estimates of the ratio \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\). We expect Eqs. (28), and (30) to be noisier, as they involve quark-disconnected diagrams. Eq. (25) seems promising, as it only involves \(\varDelta _1\) and \(\varDelta _3\), but it cannot be excluded a priori that Eq. (28) turns out to be better behaved. This can only be decided by numerical investigation.
Of course, these considerations do not exhaust all possibilities. Any linear combination of the Ward identities considered above, possibly combined with the relations (24), (27), (31), can be used for the computation of \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\). For example, the linear combination \({\mathrm{L}}_1\equiv \) [WI(1245)−WI(1425)], combined with Eq. (24) gives:
The determination of \(Z_{{\mathrm {S}}}/(Z_{{\mathrm {P}}}Z_{{\mathrm {A}}})\) from the above depends only on quark-connected diagrams. Similarly, the linear combination \({\mathrm{L}}_2 \equiv \) [12WI(1818)−8WI(1414)] gives:
which yields a \(Z_{\mathrm {S}}/(Z_{\mathrm {P}}Z_{\mathrm {A}})\) estimate from quark-connected and quark-disconnected diagrams. The last two expressions will be used in the following for numerical crosschecks.
4 Numerical setup and results
We investigate the proposed Ward identities on lattices with tree-level Symanzik improved gluons and Wilson-Clover quarks. The action coincides with the one used by CLS [18, 20, 21]. We employ Schrödinger functional boundary conditions in time, which enable us to simulate at quark masses close to the chiral point and control systematic effects related to the massless renormalisation framework. The details of this aspect are discussed in Sect. 4.1. Similar to the procedure in [15], we construct boundary-to-boundary three- and four-point functions with pseudoscalar Schrödinger functional wall sources and use wavefunctions at the boundaries as explained in [29]. The statistical error analysis is performed using a python implementation of the \(\Gamma \)-method [30] (exploiting information from the autocorrelation function) with automatic differentiation [31].
The gauge ensembles used in this study are detailed in Table 4. They coincide with the ones used in [23] but for the ensemble C1k1. These are essentially the ensembles used in [15, 29] plus the ensembles A1k3, A1k4, B1k4, C1k1, D1k2 and D1k4, which were added to improve the chiral fits. For the two ensembles E1k1 and E1k2 the number of molecular dynamics units was increased by factor of more than 4. The ensembles with volume \(L^3\times T\) described above are designed to lie on a line of constant physics (LCP), where the spatial extent of \(L \approx 1.2\,\mathrm{fm}\) and \(T/L\approx 3/2\) are almost constant. The Ward identity conditions which fix the ratio \(Z_{{\mathrm {S}}}/(Z_{{\mathrm {P}}} Z_{{\mathrm {A}}})\) are imposed at constant physics, i.e., we require that all length scales in the correlation functions, which define a given condition formulated through one of the foregoing Ward identities, are kept fixed in physical units. Once this requirement is satisfied, only the lattice spacing a changes as \(g_0\) is varied. Consequently, renormalisation constants (as well as their ratios) extracted from different constant physics conditions are expected to rapidly approach an almost unique function of \(g_0\) as \(g_0\rightarrow 0\). For a more general discussion of the constant physics idea in a similar context see, e.g., Ref. [32].
The initial tuning of this LCP was done based on the (universal) 2-loop beta-function as explained in Ref. [29]. Thus the volume of the lattices varies by \(\approx 10\)% over the range of couplings considered. However, using the results of Ref. [19], we verified that this deviation is proportional to the lattice spacing a and thus contributes to our quantity of interest only as a higher-order ambiguity.Footnote 8
The simulations in this work suffer from critical slowing down of the topological charge for smaller lattice spacings. This phenomenon, often dubbed “topology freezing”, could give unreliable results due to an insufficient sampling of topological sectors. We circumvent this problem by reweighting all data to the trivial topological sector \(Q = 0\) at the cost of decreasing the effective number of configurations; see [29, 34] for a discussion. Furthermore we increase the statistical uncertainties by attaching a tail to the integrated autocorrelation functions as proposed in [35]. As measure for \(\tau _{{\mathrm {exp}}}\), the autocorrelation time of the slowest mode in the simulation, we use the integrated autocorrelation time of the squared topological charge \(Q^2\) extracted from the longest Monte Carlo chain for each value of \(\beta \). The \(\tau _{{\mathrm {exp}}}\)-values for the individual ensembles can be found in Table 4.
In order to solve the Ward identity for \(Z_{{\mathrm {S}}}/Z_{{\mathrm {P}}}\) we need non-perturbative knowledge of the non-singlet axial current renormalisation constant \(Z_{\mathrm {A}}\) and the \({\mathrm {O}}(a)\) improvement coefficient \(c_{\mathrm {A}}\). The constant \(Z_{\mathrm {A}}\) was calculated on a subset of the gauge configurations in this work, Ref. [15], as well as in the chirally rotated Schrödinger functional, Ref. [22], which is a completely different determination. We prefer the results from the latter because of their smaller statistical uncertainties. The errors of \(Z_{\mathrm {A}}\) are accounted for in quadrature when solving for \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) in our Ward identity expressions. For \(c_{\mathrm {A}}\) we use the results of [29], without error, following standard practice.
In principle the ratio we would like to determine, as well as all correlation functions involved, depend on the \({\mathrm {O}}(a)\) improved coupling \(\tilde{g}_0^2=g_0^2[1+ab_{\mathrm {g}}{\mathrm {tr}}\,M_{\mathrm {q}}/N_{\mathrm {f}}]\), where the coefficient \(b_{\mathrm {g}}\) is only known at 1-loop perturbation theory [2]. This issue is of no relevance here, as all normalisation conditions are imposed at zero quark mass. However, this should be kept in mind when using results obtained here in a different setting with non-vanishing sea quark masses.
In order to study the scaling behaviour of some of our results, we need the lattice spacings in physical units at the bare couplings used in this work. In Ref. [19], such values are provided for couplings close to those in Table 4; these enable us to extract the lattice spacings at our gauge couplings using a polynomial interpolation.
As additional cross checks we investigate the non-perturbative validity of the identities (24), (27) and (31). The results can be found in Appendix E.
4.1 Chiral extrapolation
From the plethora of possible renormalisation conditions listed in Sect. 3, we single out a class labeled WI(1468) to which only quark connected diagrams contribute and for which the statistical precision is best. We detail the analysis for this specific choice, but the same steps also apply to any other identity discussed in the following.
In order to obtain \(Z_{{\mathrm {S}}}/Z_{{\mathrm {P}}}\) at vanishing quark mass, we extra- or interpolate the data at fixed bare coupling to the chiral point. For this procedure we employ the \({\mathrm {O}}(a)\) improved PCAC mass, which we average over the central third of the temporal extent of the lattice, similarly to what was done in Ref. [23]. This choice keeps the plateau length approximately constant in physical units. For the insertion times in the master Eq. (23), we chose \(y_0=T/2\) and \(t=T/6\) rounded up to the closest integer.Footnote 9 The idea behind this choice is to place the operators as far away from the temporal boundaries as possible, so as to suppress boundary induced cutoff effects, while keeping the individual operators apart from each other, thus avoiding contact terms.
In Fig. 3 we show the chiral extrapolation of our preferred determination WI(1468), at \(\beta =3.676\), where quark masses cover a large range in lattice units. We compare results obtained from the Ward identity with and without the mass term [i.e., the term with two pseudoscalar insertions in Eq. (11)]. We see that in the “massive” case our results display a linear behaviour in the whole mass range. In addition statistical uncertainties are smaller and the data show an almost flat dependence on am, resulting to a more reliable chiral extrapolation. Therefore, we obtain \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) in the chiral limit by fitting linearly the results of the “massive” case. For this fit we employ orthogonal distance regression [36] which takes into account not only errors in the dependent, but also in the independent variable. The error obtained from this procedure for the chirally extrapolated \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) is in general larger compared to the one obtained from a standard least squares fit. Results for the individual ensembles as well as the chiral extrapolations are summarised in Table 5, which will be discussed in Sect. 4.2.
4.2 Scaling
In Table 3 we have listed 11 classes of distinct Ward identities; each of them is a different relation between correlation function differences \(\varDelta _k\) \((k=1,3,5,7,8)\) and \(F_{{\mathrm {S}};1}\), from which \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) may be obtained. In Fig. 4 we show these determinations in the chiral limit as functions of the gauge coupling \(g_0^2\). It is evident, as argued in Sect. 3, that there are very strong correlations between results obtained on the same configuration ensembles from “similar” Ward identity classes, as grouped in Table 2.
We are thus led to select, from the plethora of Ward identities, four representative determinations of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\). Two of these involve only quark connected diagrams. These are WI(1245) and the linear combination \({\mathrm {L}}_1\), leading to Eq. (32). The other two determinations involve both quark connected and disconnected diagrams and are therefore numerically more challenging. Here we chose WI(4488), and the linear combination \({\mathrm {L}}_2\), leading to Eq. (33). The results for each ensemble and in the chiral limit are shown in Table 5.
To evaluate the relative cutoff effects among our different results, we form ratios of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\), obtained from each of the four determinations described above, to \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) from our preferred identity WI(1468). We investigate the lattice spacing dependence of each of these four ratios which, in our Symanzik-improved setup, consists of powers of \(a^2\) and higher. The ratios are known to tend to unity in the continuum limit. We therefore fit them with polynomials in the lattice spacing, constrained to be 1 at the origin. Results are displayed in Fig. 5. The top panel of the figure displays results from the first two determinations, without quark disconnected contributions.
The deviations from 1 in the ratio WI(1245)/WI(1468) are very mild and can be described by a single term quadratic in the lattice spacing with \(\chi ^2/\text {d.o.f}=0.474\). For the ratio \({\mathrm {L}}_1/\)WI(1468) the deviation from 1 as well as the statistical uncertainties are larger. A glance at Fig. 5 should convince the reader that the data cannot be described by a single-parameter fit with a quadratic term. Fitting with \(1 + c_2 a^2 + c_3 a^3\) results to \(c_2 = -9.3(4.3)\), \(c_3 = 303 (71)\) and \(\chi ^2/\text {d.o.f}=0.138\). A one-parameter fit with a term proportional to \(a^3\) gives \(c_3 = 169(22)\) with \(\chi ^2/\text {d.o.f}=0.775\); this is the curve shown in Fig. 5. The bottom panel of Fig. 5 displays results from the determinations with quark disconnected contributions. Again it is obvious that none of the data displays a pure \(a^2\)-dependence. Fitting the ratio WI(4488)/WI(1468) with \(1 + c_2 a^2 + c_3 a^3\) results to \(c_2 = -26(28)\), \(c_3 = 911 (410)\) and \(\chi ^2/\text {d.o.f}=0.494\); note that \(c_2\) is compatible with zero. Fitting by \(1 + c_3 a^3\) gives \(c_3 = 567(131)\) and \(\chi ^2/\text {d.o.f}=0.511\); this is the fit shown in the Figure. For the ratio \({\mathrm {L}}_2/\)WI(1468) we again fit with two parameters, one quadratic and one cubic in the lattice spacing, obtaining \(c_2 = -7.8(4.6)\), \(c_3 = 211(68)\) and \(\chi ^2/\text {d.o.f}=1.719\). The relatively large value for \(\chi ^2/\text {d.o.f}\) can be traced to the data point at the coarsest lattice spacing. All four cases conform with the theoretical expectation of \({\mathrm {O}}(a^2)\) ambiguities or higher. We did not find any evidence for \({\mathrm {O}}(a)\) cutoff effects; trying to fit an additional term proportional to a gives coefficients which are zero within errors.
4.3 Interpolation formula
To facilitate the use of our \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) results in large volume simulations, we provide an interpolation formula for lattice spacings \(0.04\,\)fm\(\,\lesssim a \lesssim 0.1\,\)fm. Having tried several fit ansätze, we opt for a Padé interpolation constrained by the 1-loop value [37] of the form
with the covariance matrix
and \(\chi ^2/\text {d.o.f.}=0.169\).
As the functional form in the non-perturbative coupling region is in principle unknown, we investigated the significance of systematic effects by also experimenting with alternative forms of interpolating functions (such as higher-order Padés, exponentials and polynomials), constrained to monotonically approach the 1-loop perturbation theory result. However, among those describing our results reliably (as signaled by an acceptable \(\chi ^2/\text {d.o.f.}\)) practically coincide with the interpolation (34) in the fitted range of couplings, so that the associated systematic errors are negligible compared to the statistical ones. Therefore, we only account for systematic uncertainties when extrapolating with Eq. (34) to values slightly outside the fitted range by adding a systematic error of 50% of the size of the statistical one in quadrature. This prescription is applied at \(\beta =3.85\), which corresponds to the finest lattice spacing simulated by the CLS effort.
The WI(1468) results with the interpolation are shown in Fig. 6, where they are also compared to the prediction of 1-loop perturbation theory. The vertical dashed lines mark the bare couplings used in CLS simulations, to which we want to interpolate our results. Results for \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) at the \(g_0^2\)-values used in \(N_{\mathrm {f}} = 2 + 1\) CLS simulations are given in Table 6.
4.4 Comparison with previous works
We are not aware of any direct determinations of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) in our specific setup, but we can compare our findings, using existing results for the quark mass renormalisation constant \(Z \equiv Z_{\mathrm {P}}/(Z_{\mathrm {S}} Z_{\mathrm {A}})\). The idea is to compute \(Z_{\mathrm {S}}/Z_{\mathrm {P}}=(Z Z_{\mathrm {A}})^{-1}\), with Z from either Ref. [20] or Ref. [23], and \(Z_{\mathrm {A}}\) from Ref. [22]. In Ref. [20], Z has been computed on large-volume CLS ensembles, from the relation between PCAC quark masses \(m_{ij}\) and subtracted quark masses \(m_{{\mathrm {q}},ij}\) (see Sect. 5 and Appendix C for these mass definitions). The Z-results in Ref. [23] were obtained on almost the same gauge ensembles used in this workFootnote 10 at small volumes and nearly-chiral sea quark masses. The method of Ref. [23] is based on suitable combinations of renormalised quark masses, defined both through the PCAC relation and the subtracted bare mass, evaluated in the \({\mathrm {O}}(a)\) improved theory with non-degenerate valence quarks, including all necessary counterterms. Results are quoted for two different lines of constant physics labeled LCP-0 and LCP-1, which differ by the values at which the quark masses in the valence sector are kept fixed as \(g_0\) is varied.
We compute the ratio of \(1/(Z Z_{\mathrm {A}})\) from Refs. [20] and [23] to \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) from our preferred WI(1468). We investigate the lattice spacing dependence of this ratio, which consists of powers of \(a^2\) and higher, and tends to unity in the continuum limit. The results are plotted in Fig. 7. Polynomial fits are performed on the LCP-0 and LCP-1 ratios, excluding the data of the coarsest ensembles, which display poor scaling behaviour and large errors. A two-parameter fit of the form \(1+ c_2 a^2 + c_3 a^3\) results to \(\chi ^2/\text {d.o.f} = 0.281\), \(c_2=-2.5(3.7)\) and \(c_3=242(57)\) for LCP-0, and \(\chi ^2/\text {d.o.f} = 0.166\), \(c_2=1.5(2.8)\) and \(c_3=148(45)\) for LCP-1, in both cases \(c_2\) is consistent with zero. We thus prefer to plot the results as functions of \(a^3\) in Fig. 7, where we also show a one-parameter fit of the form \(1+ c_3 a^3\); for this ansatz we obtain \(\chi ^2/\text {d.o.f} = 0.300\), \(c_3 = 206(14)\) for LCP-0 and \(\chi ^2/\text {d.o.f} = 0.170\), \(c_3 = 169(12)\) for LCP-1.Footnote 11 We interpret this as confirmation that the two methods are compatible w.r.t. the expected lattice spacing ambiguities and that the effects of \(\mathrm{O}(a^2)\) are sub-dominant compared to the next higher order.
Let us briefly comment on the possible benefits of the respective results on \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) collected in Table 6, originating from the different approaches underlying Ref. [23] and this work. First, one observes comparable uncertainties between the two. While the method of that reference involves combinations of simpler and thus typically less noisy correlation functions (i.e., with only one operator insertion in the bulk) as well as an accurate computation of the valence quark mass dependence prior to the chiral extrapolations, our estimates on \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) from the more direct Ward identity approach followed here exhibit an overall flatter and, at larger couplings, less steep \(g_0^2\)-dependence. This points to generically smaller cutoff effects so that continuum extrapolations of quantities where it enters may be expected to become better controlled and more precise in the long run, because they are also less affected by unpleasantly significant admixtures of higher-order cutoff effects.
The results for Z presented in [20], stemming from large-volume calculations on a subset of the CLS ensembles, are only available at two values of the bare coupling, which do not coincide with the couplings investigated in this work. In order to compare with our results we make use of the interpolation formula Eq. (34). Although the estimates for Z from Ref. [20] are only available at two values of the bare coupling and we hence do not attempt a fit in this case, we notice that they are compatible with LCP-0.
In summary, comparison with earlier works is consistent with the expectation that all ambiguities between different determinations of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) show a scaling according to \(\mathrm{O}(a^2)\) or higher. However, the size of these ambiguities is quite large and may still have a relevant impact on applications as described in the next Section.
5 Application: quark mass computations with Wilson fermions
We will now discuss a method of computing quark masses with Wilson fermions which uses the ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\).
First we review the well-established “PCAC quark mass method”. It is the conventional ALPHA Collaboration approach, which relies on the PCAC definition of quark masses \(m_{ij}\) of Eq. (C.7). These bare current masses are computed on large physical volumesFootnote 12 and for a range of couplings typical of hadronic, low-energy scales \(\mu _{{\mathrm {had}}} \sim \Lambda _{{\mathrm {QCD}}}\). Although we keep our notation as general as possible, for concreteness we consider a theory with \(N_{\mathrm{f}}= 2+1\) dynamical fermions; i.e. the two lightest flavours are degenerate in mass while the third flavour is heavier (\(m_{\mathrm{q,1}} = m_{\mathrm{q,2}} < m_{\mathrm{q,3}}\)).
We see from Eq. (C.8) that the renormalised light mass is given by
The ratio of the heavy to light renormalised masses is also derived from the above expression:
Knowing the renormalised light mass from Eq. (35), and the ratio of the heavy and light renormalised masses from Eq. (36), the up/down and strange masses are obtained [19, 25]. So in principle this method requires:
- 1.
The axial current normalisation \(Z_{\mathrm {A}}(g_0^2)\) and the renormalisation constant \(Z_{\mathrm {P}}(g_0^2,\mu _{{\mathrm {had}}})\) of the non-singlet pseudoscalar density; the latter carries the renormalisation scheme and scale dependence of the continuum quark mass. In our \(N_{\mathrm{f}}=3\) setup, these may be found in Refs. [22] and [42], respectively.
- 2.
The Symanzik-improvement coefficients \((b_{\mathrm{A}}- b_{\mathrm{P}})\) and \((\bar{b}_{{\mathrm {A}}} - \bar{b}_{{\mathrm {P}}})\). Non-perturbative \((b_{\mathrm{A}}- b_{\mathrm{P}})\)-estimates in our setup may be found in Ref. [23]. Note that in perturbation theory \((\bar{b}_{{\mathrm {A}}} - \bar{b}_{{\mathrm {P}}}) \sim {\mathrm{O}}(g_0^4)\), so that the term proportional to this coefficient is habitually dropped.
- 3.
It is also noteworthy that Eq. (36) does not require knowledge of \(\kappa _{{\mathrm {crit}}}\), which is however needed in \(m_{{\mathrm {q}},12}\) and \(\hbox {Tr}(M_{{\mathrm {q}}})\) in Eq. (35). We shall return to this point in Sect. 5.1.
Based on the results of Ref. [43] for Symanzik-improved quark masses with Wilson fermions, an alternative approach, known as the “ratio-difference method”, has been proposed in Ref. [24]. The renormalised quark mass difference is given by
Knowing the renormalised mass difference from Eq. (37), and the ratio of the heavy and light renormalised masses from Eq. (36), the up/down and strange masses are obtained. So in principle this method requires:
- 1.
The renormalisation constant \(Z_{\mathrm {S}}(g_0^2,\mu _{{\mathrm {had}}})\) of the non-singlet scalar density, which carries the renormalisation scheme and scale dependence of the continuum quark mass.
- 2.
The Symanzik-improvement coefficients \((b_{\mathrm{A}}- b_{\mathrm{P}})\), \(b_m\) and \(\bar{b}_m\). Non-perturbative estimates of the \(b_m\)-coefficient in this setup may be found in Ref. [23].Footnote 13 Since \(\bar{b}_m \sim {\mathrm{O}}(g_0^4)\), the term proportional to \(\hbox {Tr}(M_{\mathrm {q}})\) is habitually dropped.
- 3.
The critical hopping parameter \(\kappa _{\mathrm {crit}}\) is needed in \(m_{{\mathrm {q}},13}\) and \(\hbox {Tr}(M_{\mathrm {q}})\) in Eq. (37). We shall return to this point in Sect. 5.1.
We have outlined the basic idea behind the PCAC quark mass method and the ratio-difference method, listing the renormalisation parameters and improvement coefficients required by each one. The most crucial difference is that in the PCAC quark mass method all bare masses are given in terms of the current masses \(m_{12}\) and \(m_{13}\), which are renormalised by \(Z_{\mathrm {P}}^{-1} Z_{\mathrm {A}}\), while in the ratio-difference method the bare mass difference is the exactly known \([m_{{\mathrm {q}},3} - m_{{\mathrm {q}},1}]\), which is renormalised by \(Z_{\mathrm {S}}^{-1}\). It is not possible to determine \(Z_{{\mathrm {S}}}\) with a Schrödinger functional renormalisation condition analogous to that introduced in Ref. [44] for \(Z_{{\mathrm {P}}}\). The latter involves correlation functions with a pseudoscalar source at the boundary (see Eq. (A.9)) and the pseudoscalar scalar operator at the bulk. If we place a scalar operator at the bulk, keeping the pseudoscalar boundary source, the correlation function vanishes due to parity. Nor is it possible to have a scalar source at the boundary and the scalar density at the bulk, since this would result in the product \(P_+ P_-\) of the projection operators of the boundary quarks and the vanishing of the correlation function. An option would be to impose a renormalisation condition on the correlation function \(\langle {\mathcal {O}}^{\prime a} \, S^b(x) \, {\mathcal {O}}^c \rangle \), with the two pseudoscalar boundary sources \({\mathcal {O}}^{\prime a}\) and \({\mathcal {O}}^c\) and the scalar operator \(S^b\) in the bulk. This would be an acceptable intermediate scheme of the Schrödinger functional variety, but different than the one introduced in Ref. [44] for \(Z_{{\mathrm {P}}}\). Thus, the renormalised quark masses \(m_{1{\mathrm{R}}},m_{3{\mathrm{R}}}\) obtained by combining Eqs. (35) and (36) (PCAC quark mass method with \(Z_{{\mathrm {P}}}\)) would be in a different scheme than those obtained from Eqs. (37) and (36) (difference-ratio method with \(Z_{{\mathrm {S}}}\)). Only results obtained for the scheme-independent renormalisation group invariant (RGI) masses from the two methods would be comparable. This comparison would be very useful but cumbersome, as it requires the computation from scratch of the step scaling function in the new intermediate scheme, from ratios of \(Z_{{\mathrm {S}}}\)’s at fixed renormalised coupling and two different renormalisation scales, and for a range of couplings.
Given the above considerations, we are led to define the scalar operator renormalisation parameter through:
This is our definition of the Schrödinger functional renormalisation scheme for the scalar non-singlet operator. The \(Z_{{\mathrm {S}}}/Z_{{\mathrm {P}}}\)-ratio on the r.h.s. is scale independent, being determined from Ward identities. Clearly, scalar and pseudoscalar densities have the same renormalisation group running properties (i.e., the same anomalous dimensions, the same step scaling functions in the continuum, etc.). So knowledge of the \(Z_{{\mathrm {S}}}/Z_{{\mathrm {P}}}\) ratio enables us to obtain the light and heavy quark masses in the usual Schrödinger functional scheme [44], but with a different method based on mass differences (and \(Z_{{\mathrm {S}}}\)) combined with scale-independent PCAC mass ratios. The novel renormalisation and improvement patterns provide an important handle for the control and reduction of systematic effects related to the non-perturbative determination of renormalisation parameters and discretisation errors.Footnote 14 What is common in both methods is the renormalisation group running that takes us non-perturbatively from renormalised masses at low energy scales \(\mu _{{\mathrm {had}}}\) to masses at large, perturbative scales \(\mu _{\mathrm {PT}} \sim M_{{\mathrm {W}}}\), as described in Ref. [44]. For recent results on the running of quark masses in \(N_{\mathrm{f}}= 3\) QCD see Ref. [42].
5.1 Subtracted masses, PCAC masses, and redefined Symanzik counterterms
We will close this section by reviewing how, in both methods, we can circumvent the need to use \(\kappa _{\mathrm {crit}}\) in the Symanzik counterterms of Eqs. (35) and (37), which feature subtracted masses \(am_{{\mathrm {q}},ij}\) and \(\mathrm{Tr}[aM_{\mathrm{q}}]\). This can be avoided by substituting these subtracted masses with current quark masses. Their relation is given by [43],
where \(Z(g_0^2)\equiv Z_{{\mathrm {P}}}/(Z_{{\mathrm {S}}} Z_{{\mathrm {A}}})\) and \(r_m \equiv Z_{{\mathrm {S}}}/Z_{{\mathrm {S}}^0}\) are finite normalisations (\(Z_{{\mathrm {S}}^0}\) is the renormalisation parameter of the singlet scalar density). In the above we neglect \(\mathrm{O}(a)\) terms, as they only contribute to \(\mathrm{O}(a^2)\) in the b-counterterms of Eqs. (35) and (37). Substituting \(am_{{\mathrm {q}},ij}\rightarrow am_{ij}\) in these expressions, we obtain respectively
and
where we define
Thus, \(am_{{\mathrm {q}},ij}\) and \(\kappa _{{\mathrm {crit}}}\) in Eqs. (35) and (37) have been traded off for \(m_{ij}\), Z, and \(r_m\). Accurate non-perturbative estimates of Z, \((b_{\mathrm{A}}-b_{\mathrm{P}})\), and \(b_m\) in our \(N_{\mathrm{f}}=3\) setup have been reported in Ref. [23]. The term multiplying \(M_{\mathrm{sum}}\) contains \((1-r_{m})/r_m\) and \((\bar{b}_{{\mathrm {A}}} - \bar{b}_{{\mathrm {P}}})\). To leading order in perturbation theory \(r_m = 1 + 0.001158\,C_{\mathrm{F}}\,N_{\mathrm{f}}\,g_0^4\) [20, 45]; thus \((1-r_m)/r_m \sim \mathrm{O}(g_0^4)\). A first non-perturbative study of the coefficients \(\bar{b}_{\mathrm {A}}\), \(\bar{b}_{\mathrm {P}}\), and \(\bar{b}_m\) produced noisy results with 100% errors [46]. Since in perturbation theory \((\bar{b}_{\mathrm {A}}- \bar{b}_{\mathrm {P}}), \bar{b}_m \sim \mathrm{O}(g_0^4)\) [43], the terms proportional to \(M_{\mathrm{sum}}\) are habitually dropped.
For completeness we also discuss a slightly different way to write the \(b_m\)-counterterm of the renormalised quark mass difference of Eq. (37), in close analogy to what is done in Ref. [24]. The term in question is written as follows:
We arrive at the second expression using Eq. (39) and introducing the PCAC mass \(m_{33^\prime }\), which consists of two degenerate but distinct heavy valence flavours. Neglecting the term proportional to \(M_{\mathrm{sum}}\) in Eq. (44), we conclude that in this approximation the difference-ratio method is based on Eqs. (36) and (37), which depend on the exactly known subtracted quark mass difference \([m_{{\mathrm {q}},3} - m_{{\mathrm {q}},1}]\) and suitable PCAC quark mass ratios, but not on subtracted quark mass averages \(m_{{\mathrm {q}},ij}\) and \(\kappa _{\mathrm {crit}}\).
6 Conclusions
In the present study we have addressed, for the first time within the finite-volume Schrödinger functional setup, the non-perturbative determination of the ratio of the scalar to pseudoscalar non-singlet renormalisation constants \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) in Wilson’s lattice QCD, exploiting suitable massive chiral Ward identities. We have shown that in lattice QCD with three flavours of Wilson-Clover quarks (with non-perturbative \(c_{\mathrm{sw}}\) [47]) and tree-level Symanzik-improved gauge action, the Ward identities are restored up to \(\mathrm{O}(a^2)\) at finite lattice spacing. In order to ensure a smooth dependence of the renormalisation constant ratio on the bare gauge coupling, we have enforced a constant physics condition by working with an approximately fixed physical volume of spatial extent \(L \approx 1.2\,\mathrm{fm}\) and \(T/L \approx 3/2\).
Our main results are the parameterisation of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) in Eq. (34), valid for bare couplings \(1.55\lesssim g_0^2\lesssim 1.85\) (i.e., lattice spacings \(0.042\,\mathrm{fm}\lesssim a\lesssim 0.105\,\mathrm{fm}\)), as well as the values for \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\), given in Table 6, at the bare couplings typically employed in the large-volume \(N_{\mathrm {f}}=2+1\) CLS ensembles [18,19,20,21]. On the technical level, we had to treat properly the topology freezing encountered in our simulations, principally at the finest lattice spacing, which may prevent a trustworthy estimation of the statistical error. The operator character of Ward identities ensures their validity in sectors of fixed topological charge. Thus we have projected the correlation functions entering the Ward identities onto the trivial topological sector throughout our analysis.
Several checks have been performed, in order to guarantee the stability of the analysis and a careful assessment of the statistical as well as the systematic errors. In particular, we have verified that results on \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) from the different classes of Ward identities at our disposal are perfectly consistent with each other as expected, i.e., up to ambiguities of \({\mathrm {O}}(a^2)\) or even higher. Among the various estimators for \([Z_{\mathrm {S}}/Z_{\mathrm {P}}](g_0^2)\), our preferred choice, advocated in Eq. (34), was guided by the structural simplicity of the underlying chiral Ward identity, its numerical precision, and its robustness against systematic effects.
Since the range of couplings covered in this work matches those of the large-volume gauge field configurations generated by CLS with the same lattice action, our result for \([Z_{\mathrm {S}}/Z_{\mathrm {P}}](g_0^2)\), combined with the scale dependent renormalisation factor \(Z_{\mathrm {P}}\) from [42], can be used in the computation of quark masses as outlined in Sect. 5. Work in this direction, extending the \((2+1)\)-flavour computations of light, strange and charm quark masses on the CLS ensembles reported in refs. [25, 41], is in progress.
Data Availability Statement
This manuscript has no associated data or the data will not be deposited. [Authors’ comment: The data on which the conclusions of this article rely are reported in the tables to ensure the reproduction of our results. Further underlying datasets generated during and/or analysed during the current study and supporting the data in the tables are available from the corresponding author on reasonable request.]
Notes
In practice, distinct chiral Ward identities are used for the computation of the ratio \(Z_{\mathrm {S}}/(Z_{\mathrm {P}} Z_{\mathrm {A}})\) and \(Z_{\mathrm {A}}\); the two results are subsequently multiplied to give \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\).
With Wilson fermions, the singlet scalar operator \(\bar{\psi }(x) \psi (x)\) mixes with the identity operator, introducing the complication of power divergences. Moreover, Wick contractions of the fermion fields of this operator generate quark-disconnected diagrams.
Here we are working with the algebra \(su(N_{\mathrm{f}})\) for \(N_{\mathrm{f}}\ge 3\); for \(N_{\mathrm{f}}= 2\) we have that \(d^{abe} = 0\) and the r.h.s. of Eq. (4) is trivial.
This choice of hyperplanes is made for simplicity. A more general choice, \(y_0-t_-\) and \(y_0+t_+\), with \(t_-\ne t_+\) and \(t_-\),\(t_+ >0\), is also acceptable.
The notation for fermion fields is somewhat ambiguous: for example, while in this Subsection \(\psi (x), \zeta (\mathbf{v}), \zeta ^\prime ({\mathbf{v}^\prime })\) etc. stand for fields of a single flavour, in Appendix A the same quantities denote column vectors in flavour space. This ambiguity is fairly standard and should not create confusion.
It is common practice to refer to these diagrams simply as disconnected. Since from a strict field-theoretic point of view they are connected (with multitudes of gluon lines, some of which contain fermion loops), the term quark-disconnected is more appropriate (valence-quark-disconnected would be even more accurate, but far too long). In the literature, quark-connected and quark-disconnected are sometimes referred to as one- and two-boundary diagrams.
A more explicit quantitative investigation of violations of the constant physical volume requirement by our Schrödinger functional ensembles, demonstrating that it affects the Ward identity determination of improvement coefficients and normalisation factors only beyond the order we are actually interested in, will be reported in [33].
As discussed in Ref. [15], the temporal extent of our lattices is odd, so there is no central time-slice.
We additionally use ensemble C1k1.
Since we neglect correlations between our results and those of Ref. [23], the error in their ratio is probably overestimated. This explains the small values of \(\chi ^2/\text {d.o.f}\).
The ALPHA Collaboration has performed these calculations for quenched QCD with Schrödinger functional boundary conditions; see Ref. [38]. The CLS effort determined quark masses for \(N_{\mathrm{f}}=2\) QCD with periodic boundary conditions [39, 40] and for \(N_{\mathrm{f}}=2+1\) QCD with open boundary conditions [25, 41].
In perturbation theory \(2b_m = -1 + {\mathrm{O}}(g_0^2)\) and the non-perturbative estimates of Ref. [23] are also numerically sizeable. Thus this Symanzik counterterm is expected to remove large \(\mathrm{O}(a)\) effects, especially in future computations of heavy flavour quark masses (charm etc.).
This could be crucial in computations of heavier quark masses (charm etc.), where the discretisation errors become dominant.
See Ref. [28] for their definitions.
The Dirac matrix conventions used in the present work are those of Appendix A of Ref. [2]. The charge conjugation conventions are those of Appendix B of the same reference.
References
M. Lüscher, Advanced lattice QCD, Proceedings, Summer School in Theoretical Physics, Les Houches. arXiv: hep-lat/9802029
M. Lüscher, S. Sint, R. Sommer, P. Weisz, Chiral symmetry and \({\rm O}(a)\) improvement in lattice QCD. Nucl. Phys. B 478, 365 (1996). [arXiv: hep-lat/9605038]
M. Bochicchio, L. Maiani, G. Martinelli, G.C. Rossi, M. Testa, Chiral Symmetry on the Lattice with Wilson Fermions. Nucl. Phys. B 262, 331 (1985)
A. Vladikas, Three Topics in Renormalization and Improvement, Proceedings, Summer School in Theoretical Physics, Les Houches. arXiv: 1103.1323
L. Maiani, G. Martinelli, M.L. Paciello, B. Taglienti, Scalar densities and baryon mass differences in lattice QCD wth Wilson fermions. Nucl. Phys. B 293, 420 (1987)
G. Martinelli, S. Petrarca, C.T. Sachrajda, A. Vladikas, Nonperturbative renormalization of two quark operators with an improved lattice fermion action. Phys. Lett. B 311, 241 (1993). [Erratum: Phys. Lett. B 317, 660 (1993)]
G. Martinelli, C. Pittori, C.T. Sachrajda, M. Testa, A. Vladikas, A General method for nonperturbative renormalization of lattice operators. Nucl. Phys. B 445, 81 (1995). arXiv: hep-lat/9411010
J.-R. Cudell, A. Le Yaouanc, C. Pittori, Pseudoscalar vertex, Goldstone boson and quark masses on the lattice. Phys. Lett. B 454, 105 (1999). arXiv: hep-lat/9810058
J.-R. Cudell, A. Le Yaouanc, C. Pittori, Large pion pole in \(Z_{{\rm S}}^{{\rm MOM}}/Z_{{\rm P}}^{{\rm MOM}}\) from from Wilson action data. Phys. Lett. B 516, 92 (2001). arXiv: hep-lat/0101009
L. Giusti, A. Vladikas, RI / MOM renormalization window and Goldstone pole contamination. Phys. Lett. B 488, 303 (2000). arXiv:hep-lat/0005026
M. Papinutto, New lattice approaches to non-leptonic Kaon decays, Ph.D. thesis (2001)
C. Sturm, Y. Aoki, N.H. Christ, T. Izubuchi, C.T.C. Sachrajda, A. Soni, Renormalization of quark bilinear operators in a momentum-subtraction scheme with a nonexceptional subtraction point. Phys. Rev. D 80, 014501 (2009). arXiv:0901.2599
M. Lüscher, S. Sint, R. Sommer, H. Wittig, Nonperturbative determination of the axial current normalization constant in \({\rm O}(a)\) improved lattice QCD. Nucl. Phys. B 491, 344 (1997). arXiv:hep-lat/9611015
M. Della Morte, R. Hoffmann, F. Knechtli, R. Sommer, U. Wolff, Non-perturbative renormalization of the axial current with dynamical Wilson fermions. JHEP 0507, 007 (2005). arXiv:hep-lat/0505026
J. Bulava, M. Della Morte, J. Heitger, C. Wittemeier, Nonperturbative renormalization of the axial current in \(N_{{\rm f}}=3\) lattice QCD with Wilson fermions and a tree-level improved gauge action. Phys. Rev. D 93, 114513 (2016). arXiv:1604.05827
S. Sint, The Chirally rotated Schrödinger functional with Wilson fermions and automatic \({\rm O}(a)\) improvement. Nucl. Phys. B 847, 491 (2011). arXiv:1008.4857
M. Dalla Brida, S. Sint, P. Vilaseca, The chirally rotated Schrödinger functional: theoretical expectations and perturbative tests. JHEP 08, 102 (2016). arXiv:1603.00046
M. Bruno et al., Simulation of QCD with \(N_{\rm f} = 2 + 1\) flavors of non-perturbatively improved Wilson fermions. JHEP 02, 043 (2015). arXiv:1411.3982
M. Bruno, T. Korzec, S. Schaefer, Setting the scale for the CLS \(2 + 1\) flavor ensembles. Phys. Rev. D 95, 074504 (2017). arXiv:1608.08900
G.S. Bali, E.E. Scholz, J. Simeth, W. Söldner, Lattice simulations with \(N_{{\rm f}}=2+1\) improved Wilson fermions at a fixed strange quark mass. Phys. Rev. D 94, 074501 (2016). arXiv:1606.09039
D. Mohler, S. Schaefer, J. Simeth, CLS 2+1 flavor simulations at physical light- and strange-quark masses. EPJ Web Conf. 175, 02010 (2018). arXiv:1712.04884
M. Dalla Brida, T. Korzec, S. Sint, P. Vilaseca, High precision renormalization of the flavour non-singlet Noether currents in lattice QCD with Wilson quarks. Eur. Phys. J. C 79, 23 (2019). arXiv:1808.09236
G.M. de Divitiis, P. Fritzsch, J. Heitger, C.C. Köster, S. Kuberski, A. Vladikas, Non-perturbative determination of improvement coefficients \(b_{{\rm m}}\) and \(b_{{\rm A}}-b_{{\rm P}}\) and normalisation factor \(Z_{{\rm m}}Z_{{\rm P}}/Z_{{\rm A}}\) with \(N_{{\rm f}}= 3\) Wilson fermions. Eur. Phys. J. C 79, 797 (2019). arXiv:1906.03445
S. Dürr, Z. Fodor, C. Hoelbling, S. Katz, S. Krieg, T. Kurth et al., Lattice QCD at the physical point: Simulation and analysis details. JHEP 08, 148 (2011). arXiv:1011.2711
M. Bruno, I. Campos, P. Fritzsch, J. Koponen, C. Pena, D. Preti et al., Light quark masses in \(N_{\rm f} = 2+1\) lattice QCD with Wilson fermions. Eur. Phys. J. C 80, 169 (2020). arXiv:1911.08025
J. Heitger, F. Joswig, A. Vladikas, C. Wittemeier, Non-perturbative determination of \(c_{{\rm V}}, Z_{{\rm V}}\) and \(Z_{{\rm S}}/Z_{{\rm P}}\) in \(N_{{\rm f}}=3\) lattice QCD. EPJ Web Conf. 175, 10004 (2018). arXiv:1711.03924
J. Heitger, F. Joswig, A. Vladikas, \(Z_{{\rm S}}/Z_{{\rm P}}\) from three-flavour lattice QCD. PoS LATTICE2018, 217 (2018). arXiv:1810.03509
M. Lüscher, P. Weisz, \({\rm O}(a)\) improvement of the axial current in lattice QCD to one loop order of perturbation theory. Nucl. Phys. B 479, 429 (1996). arXiv:hep-lat/9606016
J. Bulava, M. Della Morte, J. Heitger, C. Wittemeier, Non-perturbative improvement of the axial current in \(N_{\rm f}=3\) lattice QCD with Wilson fermions and tree-level improved gauge action. Nucl. Phys. B 896, 555 (2015). arXiv:1502.04999
U. Wolff, Monte Carlo errors with less errors. Comput. Phys. Commun. 156, 143 (2004). arXiv:hep-lat/0306017. [Erratum: Comput. Phys. Commun. 176, 383 (2007)]
A. Ramos, Automatic differentiation for error analysis of Monte Carlo data. Comput. Phys. Commun. 238, 19 (2019). arXiv:1809.01289
P. Fritzsch, J. Heitger, N. Tantalo, Non-perturbative improvement of quark mass renormalization in two-flavour lattice QCD. JHEP 08, 074 (2010). arXiv:1004.3978
J. Heitger, F. Joswig, The renormalized \({\rm O}(a)\) improved vector current in three-flavour lattice QCD with Wilson quarks, in preparation
P. Fritzsch, A. Ramos, F. Stollenwerk, Critical slowing down and the gradient flow coupling in the Schrödinger functional. PoS Lattice 2013, 461 (2014). arXiv:1311.7304
S. Schaefer, R. Sommer, F. Virotta, Critical slowing down and error analysis in lattice QCD simulations. Nucl. Phys. B 845, 93 (2011). arXiv:1009.5228
P. T. Boggs, J. E. Rogers, Orthogonal distance regression, tech. rep., National Institute of Standards and Technology, Gaithersburg, MD (1989). https://doi.org/10.6028/NIST.IR.89-4197
M. Constantinou, V. Lubicz, H. Panagopoulos, F. Stylianou, \({\rm O}(a^2)\) corrections to the one-loop propagator and bilinears of clover fermions with Symanzik improved gluons. JHEP 10, 064 (2009). arXiv:0907.0381
J. Garden, J. Heitger, R. Sommer, H. Wittig, Precision computation of the strange quark’s mass in quenched QCD. Nucl. Phys. B 571, 237 (2000). arXiv:hep-lat/9906013
P. Fritzsch, F. Knechtli, B. Leder, M. Marinkovic, S. Schaefer, R. Sommer et al., The strange quark mass and Lambda parameter of two flavor QCD. Nucl. Phys. B 865, 397 (2012). arXiv:1205.5380
J. Heitger, G.M. von Hippel, S. Schaefer, F. Virotta, Charm quark mass and D-meson decay constants from two-flavour lattice QCD. PoS LATTICE2013, 475 (2014). arXiv:1312.7693
J. Heitger, F. Joswig, S. Kuberski, Towards the determination of the charm quark mass on \(N_{{\rm f}}=2+1\) CLS ensembles. PoS LATTICE2019, 092 (2019). arXiv:1909.05328
I. Campos, P. Fritzsch, C. Pena, D. Preti, A. Ramos, A. Vladikas, Non-perturbative quark mass renormalisation and running in \(N_{\rm f}=3\) QCD. Eur. Phys. J. C 78, 387 (2018).arXiv:1802.05243
T. Bhattacharya, R. Gupta, W. Lee, S.R. Sharpe, J.M. Wu, Improved bilinears in lattice QCD with non-degenerate quarks. Phys. Rev. D 73, 034504 (2006). arXiv:hep-lat/0511014
S. Capitani, M. Lüscher, R. Sommer, H. Wittig, Non-perturbative quark mass renormalization in quenched lattice QCD. Nucl. Phys. B 544, 669 (1999). arXiv:hep-lat/9810063. [Erratum: Nucl. Phys. B 582, 762 (2000)]
M. Constantinou, M. Hadjiantonis, H. Panagopoulos, Renormalization of flavor singlet and nonsinglet fermion bilinear operators. PoS LATTICE2014, 298 (2014). arXiv:1411.6990
P. Korcyl, G.S. Bali, Non-perturbative determination of improvement coefficients using coordinate space correlators in \(N_{\rm f}=2+1\) lattice QCD. Phys. Rev. D 95, 014505 (2017). arXiv:1607.07090
J. Bulava, S. Schaefer, Improvement of \(N_{\rm f}=3\) lattice QCD with Wilson fermions and tree-level improved gauge action. Nucl. Phys. B 874, 188 (2013). arXiv:1304.7093
Acknowledgements
We thank Stefan Sint, Christian Wittemeier, Carl Christian Köster and Simon Kuberski for helpful discussions and especially Carl for his valuable contributions in extending the set of ensembles used in our computations. A. V. wishes to thank the Particle Physics Theory Group at WWU Münster and Trinity College Dublin for their hospitality. This work is supported by the Deutsche Forschungsgemeinschaft (DFG) through the Research Training Group GRK 2149: Strong and Weak Interactions – from Hadrons to Dark Matter (F. J. and J. H.). We acknowledge the computer resources provided by the Zentrum für Informationsverarbeitung of the University of Münster (PALMA & PALMA II HPC clusters) and thank its staff for support.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Basic definitions
We define non-singlet vector and axial vector currents in QCD with \(N_{\mathrm{f}}\) quarks as
with \(a=1,\dots ,(N_{\mathrm{f}}^2-1)\) an \(SU(N_{\mathrm{f}})\) flavour index. See Appendix B for our conventions regarding \(SU(N_{\mathrm{f}})\) groups and \(su(N_{\mathrm{f}})\) Lie algebras. Analogously, non-singlet scalar and pseudoscalar densities are given by
Axial transformations of the fermion fields are defined as:
Small axial field variations are obtained by expanding the above up to \(\mathrm{O}(\epsilon )\):
Note that in general these transformations are defined to be local (i.e., \(\epsilon ^a\) depends on space-time). Their global counterparts are related to symmetries of the continuum theory (vector and chiral).
In the Schrödinger functional framework, standard zero-momentum sources are defined as followsFootnote 15:
where \(\zeta \) and \(\zeta ^\prime \) are the quark fields at the Schrödinger functional boundaries \(x_0\) = 0 and \(x_0 = T\) , respectively.
Appendix B: Properties of \(su(N_{\mathrm{f}})\) Lie algebra generators
Our conventions for the \(su(N_{\mathrm{f}})\) Lie Algebra are those of Appendix A.3. of Ref. [2]. In general, the anti-Hermitean generators of the algebra satisfy
We work in the fundamental representation, with the generators normalised so that
The anticommutator of these generators is given by
where \(I_{N_{\mathrm{f}}}\) is the dimension-\(N_{\mathrm{f}}\) unit matrix. The structure constants \(f^{abc}\) are real and totally antisymmetric tensors, while \(d^{abc}\) are real and totally symmetric. Two useful identities are
For \(N_{\mathrm{f}}= 2\) we have \(T^a = \tau ^a/(2{\mathrm {i}})\) (\(\tau ^a\) are the Pauli matrices), \(f^{abc} = \epsilon ^{abc}\) (the Levi-Civita symbol) and \(d^{abc} = 0\).
For \(N_{\mathrm{f}}= 3\) we have \(T^a = \lambda ^a/(2{\mathrm {i}})\) (\(\lambda ^a\) are the Gell-Mann matrices). The non-vanishing structure constants are
and their anti-symmetric counterparts. The non-vanishing symmetric constants are
and their symmetric counterparts.
Two useful properties are straightforward consequences of Eqs. (B.6) and (B.7):
Property A: For any pair of indices a, b, there is at most one value of a third index c for which \(d^{abc} \ne 0\).
Property B: There is no combination of flavour indices a, b, c for which \(f^{abc} \ne 0\) and \(d^{abc} \ne 0\). In other words, when \(f^{abc} \ne 0\), then \(d^{abc} = 0\), and when \(d^{abc} \ne 0\), then \(f^{abc} = 0\).
Appendix C: Renormalisation and improvement
All operators of interest are flavour non-singlets and, unless otherwise stated, quark masses are degenerate. For Wilson fermions, with \(\mathrm{O}(a)\) Symanzik improvement, we know that the improved current
is correctly normalised a follows:
The renormalised and Symanzik-improved scalar and pseudoscalar densities are given by
with \(a m_{\mathrm {q}} = 1/(2\kappa ) - 1/(2 \kappa _{{\mathrm {crit}}})\) the subtracted bare mass; here \(\kappa \) is the Wilson hopping parameter and \(\kappa _{{\mathrm {crit}}}\) its critical value (chiral limit). The mass matrix of subtracted quark masses is denoted by \(M_{\mathrm {q}}\). The current (bare) quark mass, which appears in the chiral Ward identities of the present paper, is defined by the PCAC relation
The renormalised quark mass \(m_{{\mathrm {R}}}\) is given in terms of the current mass m by
For two distinct flavours i, j, the subtracted quark masses are \(a m_{{{\mathrm {q}}},i} = 1/(2\kappa _i) - 1/(2 \kappa _{{\mathrm {crit}}})\) and similarly for \(a m_{{{\mathrm {q}}},j}\). The PCAC mass is defined as
and the renormalised quark mass average is expressed in terms of \(m_{ij}\) as follows:
where \(m_{{\mathrm {q}},ij} \equiv (m_{{\mathrm {q}},i} + m_{{\mathrm {q}},j})/2\). This reduces to Eq. (C.6) for two degenerate masses \(m_{{\mathrm {q}},i} = m_{{\mathrm {q}},j}\).
In practice for the divergence of the improved axial current we use \(\partial _\mu (A_{\mathrm{I}})^a_\mu \equiv \tilde{\partial }_\mu A^a_\mu + a c_{\mathrm{A}} \partial _\mu ^*\partial _\mu P^a \), where \(\tilde{\partial }_\mu \) denotes the average of the usual forward and backward derivatives defined as \(a \partial _\mu f(x) \equiv f(x+a\hat{\mu }) - f(x)\) and \(a \partial _\mu ^*f(x) \equiv f(x) - f(x-a\hat{\mu })\).
Appendix D: Charge conjugation, \(\gamma _5\)-Hermiticity, and correlation functions
Wilson quark propagators in lattices with Schrödinger functional boundary conditions, on a fixed background gauge field, are standard ones, denoted as \([\psi (y) \, \bar{\psi }(x)]_{\mathrm {F}}\), or boundary-to-bulk ones like \([\zeta (\mathbf{v}) \bar{\psi }(x)]_{{\mathrm {F}}}\).Footnote 16 They all obey the \(\gamma _5\)-Hermiticity property; e.g.
Under charge conjugation,Footnote 17 the quark bilinear operators of interest transform as follows:
with \([T^a]^T\) the transpose of \([T^a]\). The time-boundary operators \(\bar{\zeta }(\mathbf{u}) \gamma _5 T^a \zeta (\mathbf{v})\) and \(\bar{\zeta }^\prime (\mathbf{u}^\prime ) \gamma _5 T^a \zeta ^\prime (\mathbf{v}^\prime )\) satisfy analogous properties. Note that in Eqs. (D.1), Wick-contracted fermion fields are same-flavour functions, while in Eqs. (D.2) they are vectors in flavour space.
We now concentrate on the r.h.s. of WI (11), and in particular on Eq. (12) and the traces \(F_{{\mathrm {S;1}}}\) and \(F_{{\mathrm {S;2}}}\) of Table 1. Using the \(\gamma _5\)-Hermiticity properties of Eqs. (D.1), it can be easily shown that \(F_{{\mathrm {S}};2}(y_0) = F_{{\mathrm {S}};1}(y_0)^\dagger \). On the other hand, the traces of three flavour matrices \(T^{dea}\) and \(T^{aed}\) are given by Eq. (B.4). Putting everything together, the r.h.s. of the Ward identity (11) becomes
Next we apply charge conjugation to the correlation function \(\langle {\mathcal {O}}^{\prime a} \,\, S^e(y) \,\,{\mathcal {O}}^d \rangle \). We see from Eq. (D.2) that the transformation only affects the flavour matrices; instead of \(\mathrm{Tr}( T^a T^e T^d)\) we have \(\mathrm{Tr}( T^{aT} T^{eT} T^{dT}) = \mathrm{Tr}( T^d T^e T^a)\) and instead of \(\mathrm{Tr}( T^d T^e T^a)\) we have \(\mathrm{Tr}( T^{dT} T^{eT} T^{aT}) = \mathrm{Tr}( T^a T^e T^d)\). Thus, under a charge conjugation transformation,
This should be equal to the original expression (D.3), because charge conjugation leaves QCD correlation functions unaffected. Comparing the last two equations we see that this can only be true if \( \mathrm{Im}\,\big \{ F_{{\mathrm {S (1)}}}(y_0) \big \}\) vanishes. This proves Eq. (13).
Having shown that the r.h.s. of WI (11) is real, the l.h.s. must also be real. As a crosscheck we show this explicitly. The l.h.s. correlation function is given by Eq. (14), with the traces of flavour matrices given by Eqs. (15)–(20) and the 9 terms \(F_{{\mathrm {AP}};k}\) listed in Table 1. Taking the Hermitean conjugate of these terms we find that the one-boundary ones are related pairwise by complex conjugation,
Hermitean conjugation also implies that the quark-disconnected contributions are real:
From these properties it immediately follows that the l.h.s. of the WI is real.
However we want to go a step further and show the reality of the traces \(F_{{\mathrm {AP}};1}, \ldots , F_{{\mathrm {AP}};9}\). For the one-boundary contributions, Eqs. (D.5) imply that
with (cf. Eq. (B.5)):
Applying charge conjugation to the 4-point correlation function \(\langle \mathcal {O}^{\prime a} \,A_0^b(x) \, P^c(y) \,\mathcal {O}^d\rangle \), we find that \(F_{{\mathrm {AP}};1} \rightarrow F_{{\mathrm {AP}};1}\), \(F_{{\mathrm {AP}};2} \rightarrow F_{{\mathrm {AP}};2}\), and \(T_1^{abcd} \leftrightarrow T_2^{abcd}\). Thus under charge conjugation Eq. (D.7) transforms as follows:
But applying Eq. (B.5) to \(T_2^{abcd}\) (cf. also Eq. (D.8)) we see that \(\mathrm{Re}\,(T_2^{abcd}) = \mathrm{Re}\,(T_1^{abcd})\) and \(\mathrm{Im}\,(T_2^{abcd}) = -\mathrm{Im}\,(T_1^{abcd})\). Thus, under charge conjugation
Comparing this result to Eq. (D.7) and recalling that QCD correlation functions remain invariant under charge conjugation, we deduce that \(\mathrm{Im}\,(F_{{\mathrm {AP}};1})=0\). Analogously, \(F_{{\mathrm {AP}};2}, \ldots , F_{{\mathrm {AP}};6}\) are also real. Concerning one-boundary contributions, traces \(T_7^{abcd}, T_8^{abcd} , T_9^{abcd}\) are easily seen to be real from Eq. (B.2). The reality of \(F_{{\mathrm {AP}};7}, F_{{\mathrm {AP}};8}, F_{{\mathrm {AP}};9}\) then follows immediately from Eqs. (D.6). This completes our proof that also the l.h.s. of WI (11) is real.
Appendix E: Non-perturbative checks
As additional validation of our method we want to make sure that the relations (24), (27) and (31) which relate different diagrams to one another are fulfilled up to ambiguities of \({\mathrm {O}}(a^2)\). After making sure that the identities are valid at tree-level of perturbation theory we evaluate them non-perturbatively on our ensembles. The analysis is analogous to the one for the ratio \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\). After evaluating the identities on each lattice for a given value of \(\beta \), we perform an extra- or interpolation to the chiral point linear in the current quark mass. The values presented here are the results at the chiral point obtained from this procedure. The clearest evidence comes from identity (24) which we can rewrite as
In the top part of Fig. 8 we present the results which show the expected scaling towards the continuum.
The identities (27) and (31) are more complicated to verify as they involve quark disconnected contributions. We can rewrite the identities as follows
The numerical results are presented in the bottom part of Fig. 8. In this case the statistical uncertainties are orders of magnitudes larger and grow towards the continuum limit. A possible explanation of this is that the \(\varDelta _i\) involved here are vanishing at tree-level in perturbation theory. Despite the large uncertainties our data still suggest that the identities are fulfilled up to the expected ambiguities in the lattice spacing.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3
About this article
Cite this article
Heitger, J., Joswig, F. & Vladikas, A. Ward identity determination of \(Z_{\mathrm {S}}/Z_{\mathrm {P}}\) for \(N_{\mathrm {f}}=3\) lattice QCD in a Schrödinger functional setup. Eur. Phys. J. C 80, 765 (2020). https://doi.org/10.1140/epjc/s10052-020-8266-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-020-8266-2