Semileptonic form factors for $$B\rightarrow D^*\ell \nu $$ at nonzero recoil from $$2+1$$ -flavor lattice QCD

We present the first unquenched lattice-QCD calculation of the form factors for the decay $B\rightarrow D^*\ell \nu $ at nonzero recoil. Our analysis includes 15 MILC ensembles with $N_f=2+1$ flavors of asqtad sea quarks, with a strange quark mass close to its physical mass. The lattice spacings range from $a\approx 0.15$ fm down to 0.045 fm, while the ratio between the light- and the strange-quark masses ranges from 0.05 to 0.4. The valence b and c quarks are treated using the Wilson-clover action with the Fermilab interpretation, whereas the light sector employs asqtad staggered fermions. We extrapolate our results to the physical point in the continuum limit using rooted staggered heavy-light meson chiral perturbation theory. Then we apply a model-independent parametrization to extend the form factors to the full kinematic range. With this parametrization we perform a joint lattice-QCD/experiment fit using several experimental datasets to determine the CKM matrix element $|V_{cb}|$. We obtain $\left| V_{cb}\right| = (38.40 \pm 0.68_{\text {th}} \pm 0.34_{\text {exp}} \pm 0.18_{\text {EM}})\times 10^{-3}$. The first error is theoretical, the second comes from experiment and the last one includes electromagnetic and electroweak uncertainties, with an overall $\chi ^2\text {/dof} = 126/84$, which illustrates the tensions between the experimental data sets, and between theory and experiment. This result is in agreement with previous exclusive determinations, but the tension with the inclusive determination remains. Finally, we integrate the differential decay rate obtained solely from lattice data to predict $R(D^*) = 0.265 \pm 0.013$, which confirms the current tension between theory and experiment.

Semileptonic $\varvec{B\rightarrow D^{**}}$ decays in lattice QCD: a feasability study and first results

Article Open access 21 August 2015

Exclusive determinations of $\vert V_{cb} \vert $ and $R(D^{*})$ through unitarity

Article Open access 01 December 2022

The $B \rightarrow {{D}^{(*)}} {l}{\nu _l}$ decays in the pQCD approach with the Lattice QCD input

Article 01 December 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

High precision tests of the standard model (SM) offer exciting possibilities for discovering new physics. In particular, the flavor sector of the SM is very rich in phenomena that can be used to explore physics beyond the standard model (BSM). Most flavor physics revolves around the Cabibbo–Kobayashi–Maskawa (CKM) matrix, which relates the mass and flavor eigenstates of the quarks. Since it is a basis transformation, the CKM matrix is constrained by unitarity, so violations of this rule could indicate the influence of new physics. Weak processes that are loop-suppressed in the SM may also expose new physics. To determine CKM matrix elements to high precision and to perform precision tests of the SM in measurements of rare decay processes, it is essential to know the strong-interaction environment in which these processes occur.

Among the CKM matrix elements, $\left| V_{cb}\right| $ has arguably been one of the most perplexing. There is a long-standing tension between the determination of this element via exclusive and inclusive decays. The operator product expansion (OPE) is used to analyze inclusive decay experiments measuring semileptonic decays $B \rightarrow X_c\ell \nu $, where $X_c$ represents any charmed hadron or combination of hadrons with a single c quark. On the other hand, exclusive decay experiments focus on decays with a specific charmed hadron in the final state, for example, $B \rightarrow D\ell \nu $ or $B \rightarrow D^*\ell \nu $. We expect both types of experiments to yield consistent results for $|V_{cb}|$; however, there is a $\sim 3\sigma $ discrepancy between the inclusive and exclusive determinations [1, 2]. Since contributions from new physics are unlikely to explain these differences [3, 4], the disagreement presents an obstacle to higher precision tests of the SM.

We turn now to the determination of $|V_{cb}|$ from the exclusive decay $B \rightarrow D^*\ell \nu $, which has an interesting history. As is detailed below in Eq. (7), to determine $|V_{cb}|$, a measurement of the differential decay rate $d\varGamma /dw$ is needed and the form factors must be computed by theory. Using lattice QCD, it has been possible to determine the key form factor at zero recoil. However, the differential decay rate vanishes at that point because of kinematic factors, so it is necessary to use the differential decay rate at nonzero recoil to extrapolate the form factors to zero recoil. Since the 1990s, there have been two parametrizations of the form factors, one by Boyd, Grinstein, and Lebed (BGL) [5,6,7], and the other by Caprini, Lellouch, and Neubert (CLN) [8]. The CLEO and BaBar experiments [9,10,11], for example, relied on the CLN parametrization to analyze the dependence of the recoil parameter of their data. This was also true for earlier reviews from the Heavy Flavor Averaging Group (HFLAV) [12].

The situation changed in 2017 with the publication of unfolded data from a $B \rightarrow D^*\ell \nu $ experiment by the Belle Collaboration [13]. This data release was quickly followed by several theoretical analyses [14,15,16,17] comparing the effect of the choice of parametrization on $|V_{cb}|$. They found that the CLN parametrization [8], at least as it is usually employed to extrapolate the experimental data to the zero-recoil point (see for instance, Sec.V.A of Ref. [18]), does not provide a good description of the experimental data, whereas the BGL parametrization [5,6,7] describes the data properly and yielded exclusive determinations of $|V_{cb}|$ that were compatible with the inclusive ones [14,15,16,17]. However, more recent analyses by the Belle Collaboration using the much larger untagged dataset [18] and by the BaBar Collaboration performing a new analysis of their old data [19] contradicted this picture and reinforced the long-standing tension between the inclusive and the exclusive determinations. Newer theoretical analyses using Belle’s untagged dataset also found agreement between CLN and BGL results [20, 21]. These analyses also conclude that CLN is still a useful parametrization given the current errors in the experimental measurements. Unfortunately, previous unquenched lattice-QCD calculations of the $B\rightarrow D^*$ form factor [22, 23] cannot provide constraints on the shape, because they are limited to zero recoil. Hence, a precise calculation from first principles of the form factors involved in the exclusive process performed for a range of nonzero-recoil momenta, could be extremely helpful.

Another motivation to study this process is the existing tension between experimental measurements and SM predictions of several lepton-flavor-universality-violating (LFUV) observables in B-meson semileptonic decays [24]. The ratios of branching fractions of the semitauonic and other semileptonic $B\rightarrow D^{(*)}$ transitions

$$\begin{aligned} R(D^{(*)})\equiv \frac{{{\mathcal {B}}}(B\rightarrow D^{(*)}\tau \nu _\tau )}{{{\mathcal {B}}}(B\rightarrow D^{(*)}\ell \nu _\ell )},\quad \ell =e,\mu \end{aligned}$$

(1)

disagree with the SM at the $\sim 3\sigma $ level when R(D) and $R(D^*)$ are taken together [1]. Although the last HFLAV average [1] shows a large discrepancy between theory and experiment, the most recent measurements from the BaBar, Belle, and LHCb Collaborations find $R(D^*)$ to be closer to SM expectations [25,26,27,28,29]. However, a complete lattice-QCD calculation of $R(D^*)$ is still lacking. In view of current tensions in these observables, the limitations of the available theoretical predictions, and the future improvements on the experimental side expected from LHCb and Belle II forthcoming data, an independent theoretical calculation with a tight control of systematic errors that could help to either confirm or reduce the tension is urgently needed.

In this work, we use lattice QCD to address these two points, i.e., tensions between exclusive and inclusive determinations of $|V_{cb}|$, and between the SM theoretical prediction and experiment for $R(D^*)$. Although lattice QCD has previously been used to extract $|V_{cb}|$ from experimental data for $B\rightarrow D^*\ell \nu $, the relevant decay amplitude has always been computed at zero recoil [22, 30,31,32], except for an early study in the quenched approximation [33]. Here we compute the form factors that contribute to the $B\rightarrow D^*\ell \nu $ decay for nonzero values of the recoil parameter in full QCD with $2+1$ flavors of dynamical sea quarks and extrapolate their behavior to the large recoil region. Instead of using the standard procedure of extrapolating experimental results to zero recoil and then extracting $\left| V_{cb}\right| $ using the calculated value of the form factor at zero recoil, we do a joint fit of lattice and experimental data where $\left| V_{cb}\right| $ is one of the free parameters. Once the decay amplitude is determined, we integrate over the whole kinematic space to find the branching ratios with and without a $\tau $ in the decay. We calculate $R(D^*)$ and compare our results with existing experimental determinations. Preliminary reports of this analysis were presented in Refs. [34,35,36,37,38]. In keeping with previous work on the same ensembles [39,40,41,42], this analysis was blinded until a systematic error budget was finalized. The final results were then frozen, apart from unblinding.

This article is organized as follows. In Sect. 2, we introduce the formalism and present the form factors that take part in our calculation. Section 3 gives details on the ensembles available and also describes the analysis of the lattice data up to the chiral-continuum extrapolation. In Sect. 4, we discuss the systematic errors, and Sect. 5 shows the z expansion and the joint fit with experimental data that leads to our final results for $\left| V_{cb}\right| $ and $R(D^*)$. Section 6 presents our conclusions. Appendix A includes details on the fit function, employed in the chiral-continuum extrapolation. Appendix B outlines the calculation of the matching factors and its errors. Appendix C explains in detail how the $\kappa $ tuning correction for the heavy quarks was calculated. Finally, Appendix D provides a guide to ancillary files containing complete results for the form factors, with a full correlation matrix.

2 Form factor definitions

In this section, we set the definitions and notation used in the following sections for the form factors, ratio of correlators, currents, and renormalization factors, among others.

2.1 Form factors in the continuum

The $B\rightarrow D^*\ell \nu $ process is mediated by the axial ${\mathcal {A}}^\mu = {\bar{c}}\gamma _\mu \gamma _5b$ and the vector ${\mathcal {V}}^\mu = {\bar{c}}\gamma _\mu b$ electroweak currents. The transition matrix for this process is usually decomposed into form factors inspired by the heavy quark effective theory (HQET) [43]:

$$\begin{aligned} \frac{\langle D^*(p_{D^*},\varepsilon ) |{\mathcal {A}}^\mu | B(p_B)\rangle }{\sqrt{M_{D^*}M_B}}&= i\varepsilon ^*_\nu \Big [g^{\mu \nu }(w+1)\,h_{A_1}(w) \nonumber \\&\quad - v_B^\nu (v_B^\mu \,h_{A_2}(w) + v_{D^*}^\mu \,h_{A_3}(w))\Big ], \end{aligned}$$

(2)

$$\begin{aligned} \frac{\langle D^*(p_{D^*},\varepsilon ) |{\mathcal {V}}^\mu | B(p_B)\rangle }{\sqrt{M_{D^*}M_B}}&= \epsilon ^{\mu \nu }_{\rho \sigma }\varepsilon ^*_\nu v_B^\rho v_{D^*}^\sigma \,h_V(w), \end{aligned}$$

(3)

where $\varepsilon $ is the polarization vector of the $D^*$ meson, $M_Y$ is the mass of the $Y=B$, $D^*$ meson and $p_Y$ its respective momentum. From the four-velocities $v_Y=p_Y/M_Y$, one can define the recoil parameter $w=v_B\cdot v_{D^*}$.

To express the differential decay rate, it is convenient to introduce helicity amplitudes according to the polarization of the off-shell W boson [44]:

$$\begin{aligned} H_\pm (w)&= (w+1)\left[ h_{A_1}(w) \mp \sqrt{\frac{w-1}{w+1}} h_V(w)\right] , \end{aligned}$$

(4)

$$\begin{aligned} H_0(w)&= y (w+1) \Big \{(w-r) h_{A_1}(w) \nonumber \\&\quad - (w-1)\left[ r h_{A_2}(w) + h_{A_3}(w) \right] \Big \}, \end{aligned}$$

(5)

$$\begin{aligned} H_S(w)&= y \sqrt{w^2-1} \Big [(w+1) h_{A_1}(w) \nonumber \\&\quad - (1-wr) h_{A_2}(w) - (w-r) h_{A_3}(w)\Big ], \end{aligned}$$

(6)

where $r=M_{D^*}/M_B$ and $y^2(1 - 2wr + r^2)=1$. The differential decay rate for $B^-\rightarrow D^{0*}\ell ^-{{\bar{\nu }}}$ is then

$$\begin{aligned} \frac{d\varGamma }{dw}&= |V_{cb}|^2 |\eta _\text {EW}|^2 \frac{G_F^2 M_B^5}{16\pi ^3} \left( 1 - \frac{m_\ell ^2}{q^2}\right) ^2 r^3(w^2-1)^{1/2} \nonumber \\&\quad \times \left\{ \frac{1}{3y^2} \left( 1 + \frac{m_\ell ^2}{2q^2} \right) \left[ |H_+|^2 + |H_-|^2 + |H_0|^2 \right] \right. \nonumber \\&\left. \quad + \frac{m_\ell ^2}{2M_B^2} |H_S|^2 \right\} , \end{aligned}$$

(7)

where $\eta _\text {EW}$ is a short-distance electroweak correction [45], $G_F$ is the Fermi constant determined from muon decay, and $m_\ell $ is the charged lepton mass. Note that the scalar helicity amplitude’s contribution is suppressed by $(m_\ell /M_B)^2$. In practice, it is neglected for semielectronic and semimuonic decays. The process ${\bar{B}}^0\rightarrow D^{+ *}\ell ^-{\bar{\nu }}$ needs an extra factor $(1+\alpha \pi )$ on the right-hand side of Eq. (7) in order to account for the Coulomb attraction among the charged decay products [46,47,48]. Other electromagnetic effects, including structure-dependent corrections, are smaller, of order $\alpha /\pi $ instead of $\alpha \pi $ [46,47,48]. For the determination of $|V_{cb}|$, the full angular information of the decay chain $B\rightarrow D^*\ell \nu \;(D^*\rightarrow D\pi )$ is used, as discussed in Sect. 5.2.

As noted in the introduction, experimental measurements of the ratio of branching fractions in Eq. (1) are in tension with the SM. To date, the $B\rightarrow D^*$ form factors have been estimated with HQET, QCD sum rules, and input from experiment. With the results presented below, however, we can compute this ratio directly from (lattice) QCD:

$$\begin{aligned} {\mathcal {B}}(B\rightarrow D^*\ell \nu ) = \tau _B \int _1^{w_{\text {Max},\ell }} dw\,\frac{d\varGamma }{dw}, \end{aligned}$$

(8)

where $w_{\text {Max},\ell } = (1+r^2 - m_\ell ^2/M_B^2)/2r$. In $R(D^*)$, the B-meson lifetime $\tau _B$ drops out, so we form it from the ratio of partial widths.

Many papers in the literature introduce a decay amplitude ${\mathcal {F}}(w)$, defined by

$$\begin{aligned} |{\mathcal {F}}(w)|^2 = \frac{1-2wr+r^2}{w+1} \frac{|H_+|^2 + |H_-|^2 + |H_0|^2}{(5w+1)(1-r)^2 -8rw(w-1)}, \nonumber \\ \end{aligned}$$

(9)

and refer to ${\mathcal {F}}$ as a “form factor.” For example, experimental results are often reported as $|V_{cb}|^2|\eta _\text {EW}|^2|{\mathcal {F}}(w)|^2$. At zero recoil, ${\mathcal {F}}(1)=h_{A_1}(1)$. Thus, previous work in lattice QCD on this decay has focused on this single number, rather than the four independent functions, $h_{A_1} (w)$, $h_{V} (w)$, $h_{A_2} (w)$, and $h_{A_3} (w)$, computed in this work.

2.2 Extracting the form factors from lattice matrix elements

Our heavy quarks (b, c) are simulated using the Fermilab action [49], as discussed in Sect. 3.1. In this framework, the lattice currents for the quark transition $y\rightarrow x$ are

$$\begin{aligned} V_{xy}^\mu&= {\bar{\varPsi }}_x\gamma ^\mu \varPsi _y, \end{aligned}$$

(10)

$$\begin{aligned} A_{xy}^\mu&= {\bar{\varPsi }}_x\gamma ^\mu \gamma _5\varPsi _y, \end{aligned}$$

(11)

where x, y indicates the flavor c, b and $\varPsi $ is the Fermilab-improved field,

$$\begin{aligned} \varPsi = (1+d_1{\varvec{\gamma }}\cdot {\varvec{D}}_\text {lat})\psi . \end{aligned}$$

(12)

In this expression, the original heavy-quark field $\psi $ is rotated in order to reduce the discretization errors. In particular, the coefficient $d_1$ must be calculated for each value of the quark mass in order to remove the O(a) terms.

The lattice current $J_{xy}^\mu $ is related to its equivalent in the continuum ${\mathcal {J}}_{xy}^\mu $ through the renormalization factors,

$$\begin{aligned} {\mathcal {J}}_{xy}^\mu \, \dot{=} \, Z_{J_{xy}^\mu } J_{xy}^\mu , \end{aligned}$$

(13)

where the $\dot{=}$ symbol means that both sides of the equation have the same matrix elements. In practice, the renormalization factors are only calculated approximately up to some order in a and $\alpha _s$. In this work, we use a technique called mostly nonperturbative renormalization [50, 51] that eliminates most of the nonperturbative dependence of the renormalization factors by defining the factors,

$$\begin{aligned} \rho ^2_{J^\mu } = \frac{Z_{J_{cb}^\mu }Z_{J_{bc}^\mu }}{Z_{V^4_{cc}}Z_{V^4_{bb}}}. \end{aligned}$$

(14)

When taking appropriate ratios of three-point correlators, the dominant, nonperburbative contribution to the renormalization of the currents, collected in the flavor-diagonal renormalization factors $Z_{V^4_{xx}}$, cancels. The remaining matching factors $\rho _{J^\mu }$ are amenable to a perturbative calculation [50]. We compute these matching factors to one-loop in perturbation theory, with the full $m_ca$ dependence at zero recoil, but $m_ca=0$ at nonzero recoil. Of course, these simplifications introduce a variety of errors that must be kept under control. The truncation in the perturbative expansion is expected to be small, because $\alpha _s$ ranges in our case from 0.20 to 0.35. In addition, the coefficients of the expansion are small due to several cancelations [50]. The errors coming from the other two approximations are estimated in Appendix B and taken into account accordingly.

Table 1 List of ensembles used in this work. The columns, from left to right, list the approximate lattice spacing, the scale-setting parameter $r_1/a$ in lattice units, the ratio $am_l/am_s$ between the light- and the strange-quark masses, the spatial length of the lattice in fm, the mass of the lightest pseudoscalar meson $M_\pi ^P$ in MeV, the dimensionless factor $M_\pi ^P L$, the dimensions of the lattice in lattice units, the total sample size expressed as the number of sources $\times $ the number of configurations, and the tadpole-improvement factor $u_0$ obtained from the average plaquette

Full size table

Not only does the use of ratios reduce the error in the calculation of the matching factors, but it also reduces the statistical fluctuations from the correlators. We set up the calculation in the rest frame of the B meson while the $D^*$ meson carries a momentum ${\varvec{p}}$, which determines the recoil w. The first ratio is the nonzero-recoil version of the double ratio [30],

$$\begin{aligned} R^2_{A_1} = \frac{\langle D^*({\varvec{p}}_\bot )|A_j|B({\varvec{0}})\rangle \langle B({\varvec{0}})| A_j |D^*({\varvec{p}}_\bot )\rangle }{\langle D^*({\varvec{0}})|V^4|D^*({\varvec{0}})\rangle \langle B({\varvec{0}})| V^4 |B({\varvec{0}})\rangle } , \end{aligned}$$

(15)

where the $\bot $ symbol in the momentum ${\varvec{p}}_\bot $ indicates that the polarization of the $D^*$ is aligned with the current and perpendicular to the momentum (i.e., transverse polarization). A parallel symbol $\parallel $ is used for longitudinal polarization. This double ratio yields $h_{A_1}$, which is the only form factor that survives at zero recoil.

The following single ratios

$$\begin{aligned} X_V&= \frac{\langle D^*({\varvec{p}}_\bot )| V_j |B({\varvec{0}})\rangle }{\langle D^*({\varvec{p}}_\bot )| A_j |B({\varvec{0}})\rangle } , \end{aligned}$$

(16)

$$\begin{aligned} X_0&= \frac{\langle D^*({\varvec{p}}_\parallel )| A^4 |B({\varvec{0}})\rangle }{\langle D^*({\varvec{p}}_\bot )| A_j |B({\varvec{0}})\rangle } , \end{aligned}$$

(17)

$$\begin{aligned} X_1&= \frac{\langle D^*({\varvec{p}}_\parallel )| A_j |B({\varvec{0}})\rangle }{\langle D^*({\varvec{p}}_\bot )| A_j |B({\varvec{0}})\rangle } , \end{aligned}$$

(18)

yield the remaining form factors. Last, the ratio [52, 53]

$$\begin{aligned} x_f = \frac{\langle D^*({\varvec{p}}_{(\alpha )})| V_j |D^*({\varvec{0}})\rangle }{\langle D^*({\varvec{p}}_{(\alpha )})| V^4 |D^*({\varvec{0}})\rangle } \end{aligned}$$

(19)

yields the recoil parameter w and involves only the flavor-diagonal transition $D^*\rightarrow D^*$. Here $(\alpha )=\bot ,\parallel $ is the polarization, and must be the same in numerator and denominator to achieve the right cancelation of form factors.

The ratio in Eq. (19) yields the three-velocity,

$$\begin{aligned} x_f = \frac{v_{D^*}}{w+1}, \end{aligned}$$

(20)

from which it is straightforward to calculate w. In the B-meson rest frame (${\varvec{v}}_B={\varvec{0}}$),

$$\begin{aligned} w = \frac{1 + x_f^2}{1 - x_f^2}. \end{aligned}$$

(21)

The ratio $X_1$ defined in Eq. (18) can be used to extract $h_{A_3}(w)$ as

$$\begin{aligned} X_1(w) = w - \frac{(w^2-1)h_{A_3}(w)}{(w+1)h_{A_1}(w)}. \end{aligned}$$

(22)

The matching factors for these two ratios, $x_f$ and $X_1$, are $\rho = 1 + O(\alpha _s^2)$, and thus no renormalization is required at LO. In contrast, the remaining ratios require several nontrivial matching factors,

$$\begin{aligned} X_0 (w)&= \frac{\rho _{A_j}}{\rho _{A^4}} \sqrt{w^2-1} \left( 1 - \frac{h_{A_2}(w) + wh_{A_3}(w)}{(w+1)h_{A_1}(w)}\right) , \end{aligned}$$

(23)

$$\begin{aligned} X_V (w)&= \frac{\rho _{A_j}}{\rho _{V_j}} \frac{\sqrt{w-1}}{\sqrt{w+1}}\frac{h_V(w)}{h_{A_1}(w)}, \end{aligned}$$

(24)

$$\begin{aligned} R_{A_1}(w)&= \frac{w+1}{2}\frac{h_{A_1}(w)}{\rho _{A_j}}. \end{aligned}$$

(25)

From these equations, it is quite easy to extract all form factors as a function of the ratios defined in Eqs. (15)–(19),

$$\begin{aligned} h_{A_1}(w)&= \rho _{A_j}\frac{2R_{A_1}}{w+1}, \end{aligned}$$

(26)

$$\begin{aligned} h_{A_2}(w)&= \rho _{A_j}\frac{2R_{A_1}}{w^2-1}\left( wX_1 - \sqrt{w^2-1}\frac{\rho _{A^4}}{\rho _{A_j}}X_0 - 1\right) , \end{aligned}$$

(27)

$$\begin{aligned} h_{A_3}(w)&= \rho _{A_j}\frac{2R_{A_1}}{w^2-1}(w - X_1), \end{aligned}$$

(28)

$$\begin{aligned} h_V (w)&= \rho _{A_j}\frac{2R_{A_1}}{\sqrt{w^2-1}}\frac{\rho _{V_j}}{\rho _{A_j}}X_V. \end{aligned}$$

(29)

These expressions determine the four form factors up to discretization and matching errors.

Table 2 Parameters used on each ensemble to generate the propagators for the valence, heavy quarks c and b. The approximate lattice spacing and the masses of the sea quarks (light and strange) in the first two columns identify the ensemble. The remaining columns show the clover term coefficient $c_{\textrm{SW}}$, the bare hopping parameter $\kappa $, the rotation parameter $d_1$ of the Fermilab action, and the values of the available source/sink Euclidean-time separations T in lattice units for the computed three-point correlators. The primes on the $\kappa $ indicates that this is the value used in the simulation, as opposed to the physical (tuned) value, see Appendix C

Full size table

3 Analysis

3.1 Lattice setup

In this analysis, we use 15 ensembles of gauge-field configurations, generated by the MILC Collaboration [54,55,56]. These ensembles include three flavors of asqtad-improved staggered sea quarks at five different lattice spacings, ranging from 0.15 fm in the coarsest case to 0.045 fm in the finest case. The mass of the strange sea quark is tuned to be close to its physical value, while the two light sea-quark masses are set equal, and cover a range of values that correspond to pion masses from $M_\pi \approx 560$ MeV to $M_\pi \approx 180$ MeV. The simulation parameters of all ensembles employed in this analysis are given in Table 1, while Fig. 1 provides a visual summary of the range of lattice spacings, sea-quark light-to-strange-mass ratios, and number of statistical samples.

In the light sector, we use the same value for the masses of the valence and sea quarks. The heavy quarks employ the clover action with the Fermilab interpretation, and since the regularization used for the light quarks has a different Dirac structure, we promote the staggered propagators to “naive” ones, so we can apply the standard Dirac spin algebra and combine them with the Wilson-like heavy quark propagators to construct heavy-light mesons [57]. The heavy quark masses are tuned so that the kinetic masses of the $D_s$ and the $B_s$ mesons are equal to their physical values (see Appendix C of Ref. [22]). In Table 2, we gather the parameters we used to calculate the heavy quark propagators for each ensemble. The simulation values chosen for the heavy quark masses to generate the meson correlators are close to, but not exactly the same as, our best-tuned values, which were determined a posteriori. Hence we apply a correction to the form factors to account for this slight mistuning which is described in Appendix C.

3.2 Correlation functions

The two- and three-point correlation functions are calculated using four sources, equally-spaced in time, except for the case of the coarsest ensemble that employs 24 sources. The sources are randomly shifted in space and time from one configuration to another in order to reduce correlations between successive gauge-field configurations within the same ensemble. A standard blocking analysis of the correlator data, ranging from block size 1 to block size 8, reveals that the autocorrelations in our ensembles are negligible and that the errors in the correlator points stay approximately constant as we increase the block size, in line with our previous analyses that employed the same gauge configurations, fermion formulations, and source set-up [22, 39,40,41,42, 53, 58, 59]. Therefore, we do not block the data in this work, and the correlators are processed through a single-elimination jackknife.

Two previous analyses with the asqtad ensembles [31, 60] found that blocking the configurations by 4 or 8 was necessary in order to suppress autocorrelations. However, these analyses refer either to global observables (the topological susceptibility), or did not use the randomization procedure for the sources, which greatly reduces the autocorrelations in our data.

Given that one of our ensembles has a very fine lattice spacing $a\approx 0.045$ fm, one might be worried about the topology freezing and its effect in the final results of the form factors. We did not perform a topology freezing analysis in the asqtad ensembles, but we expect the behavior to be similar to that of the HISQ ensembles. Based on Refs. [61, 62], we expect topology freezing to introduce a negligible bias in the chiral-continuum limit of the form factors.

The correlation functions described in the following two subsections contain the desired ground-state matrix elements, energies and form factors, but they also include contributions from excited states, which we must remove. For this purpose, we use two different kinds of interpolating operators per source: a local operator d and a smeared operator based on the Richardson 1S wave function [63, 64], and therefore we fix the configuration to Coulomb gauge. For each meson, the radius of the smearing operator is the same in physical units for all ensembles. We refer the reader to Ref. [64] for further details. The smeared operator increases the overlap with the ground state, allowing for a more precise determination of the lowest energy level and its overlap factors. The inclusion of a local operator gives us a useful handle on the excited states. Further, to quantify excited-state contributions and obtain robust estimates of the associated uncertainties, we use Bayesian constraints with Gaussian priors and fit functions that include varying numbers of excited states [65].

To implement the Bayesian constraints, we follow the procedure of Appendix B of Ref. [58]. We minimize the augmented $\chi ^2_{\textrm{aug}}$, as defined in Eq. (B3) of Ref. [58], but for the goodness of fit use the data-only $\chi ^2$ (evaluated at the minimum of $\chi ^2_{\textrm{aug}}$) and subtract the number of parameters from the number of data. Below we refer to this $\chi ^2$ and counting of degrees of freedom as the deaugmented $\chi ^2/\textrm{dof}$. In our experience, the p value calculated with a $\chi ^2$ and a number of dof as defined in Eq. (B5) of Ref. [58] is a good indicator of goodness of fit, but it has not been proven rigorously to follow a uniform distribution. When calculating p values, we further process the $\chi ^2$/dof ratio to take into account finite sample size [66].

3.3 Two-point functions

The $D^*$ and B-meson two-point functions are needed to extract the overlap factors and the energy states, for these are required inputs for the ratio fits. The two-point functions are constructed using interpolating operators ${\mathcal {O}}_{Y_a}({\varvec{p}}, t)$, where $Y=\{B,D^*\}$ is the meson of interest, $a=\{d,1S\}$ represents the smearing (point and Richardson), t is the time and ${\varvec{p}}$ is the spatial momentum. These operators are constructed with the same quantum numbers as a pseudoscalar for $Y=B$ and a vector for $Y=D^*$. In terms of the interpolating operators, the two-point correlators are then

$$\begin{aligned} C^{\text {2pt}}_{Y_a\rightarrow Y_b}({\varvec{p}}, t) = \left\langle {\mathcal {O}}_{Y_b}({\varvec{p}},t) \,{\mathcal {O}}_{Y_a}^\dagger ({\varvec{p}}, 0)\right\rangle . \end{aligned}$$

(30)

Inserting a complete set of states between the interpolating operators, we obtain the spectral decomposition:

$$\begin{aligned} C^{\text {2pt}}_{Y_a\rightarrow Y_b}({\varvec{p}}, t)&= \sum _n s_n(t)\frac{\sqrt{Z_{Y_a,n}({\varvec{p}})\, Z_{Y_b,n}({\varvec{p}})}}{2E_n({\varvec{p}})}\nonumber \\&\quad \times \left( e^{-E_n({\varvec{p}})t} + e^{-E_n({\varvec{p}})(L_t-t)}\right) , \end{aligned}$$

(31)

with $\sqrt{Z_{Y_{a,b}}({\varvec{p}})}$ the overlap factors, $L_t$ the temporal extent of our lattice and $s_n(t)$ the extra sign that arises due to the presence of particles with the opposite parity in the staggered regularization for the fermions,

$$\begin{aligned} s_n(t) = \left\{ \begin{matrix} 1 &{} \text {Correct parity} \\ -(-1)^{t} &{} \text {Opposite parity} \end{matrix}\right. . \end{aligned}$$

(32)

Most correlators are available in four different configurations according to the smearing of the source and the sink: d-d, d-1S, 1S-d and 1S-1S. In the case of the $D^*$ meson, eight different momenta are available, namely, (0, 0, 0), (1, 0, 0), (1, 1, 0), (1, 1, 1), (2, 0, 0),(2, 1, 0), (2, 2, 0), (2, 2, 1),(3, 0, 0), and (4, 0, 0) in $2\pi /L$ units. Of these, only (0, 0, 0), (1, 0, 0) and (2, 0, 0) are used to calculate the form factors; the rest allow us to calculate the dispersion relation of the $D^*$. For (1, 0, 0) and (2, 0, 0), two different orientations of the momenta are considered, namely, parallel and perpendicular to the $D^*$ polarization.

The outline of the analysis of the two-point functions, explained in detail in the following subsections, is as follows. First the zero-momentum correlators are fit using phenomenological guidance for the prior central values. For the ground state, we set a prior similar to the physical mass of the mesons, and the excited states differ by $\varDelta E=0.5$ GeV. The prior widths are large enough to accommodate significant departures from these assumptions. These choices for the central value and widths of the priors are such that they have no influence on the fit result for the ground states. In fact, we consider several variations of the energy priors to verify that their only function is to guarantee the stability of the fits without influencing the ground-state fit parameters. The results of the zero-momentum fits are used to construct priors for the dispersion-relation fits. In particular, the ground-state energies are expected to follow the continuum dispersion relation, and the overlap factors for the local operators should be approximately constant, barring, in both cases, discretization effects. Using data for a variety of momenta, we fit the ground-state energies to a dispersion-relation expression that includes discretization terms, see Eq. (36). The resulting fit is used to calculate a prior for the energy of the ground state of the two nonzero-momentum correlators.

3.3.1 Two-point function fits

For the two-point function fits, we employ the form

$$\begin{aligned} C^{\text {2pt}}_{Y_a\rightarrow Y_b}(t)&= \sum _{i=0,1} (-1)^{i(t+1)} {\mathcal {Z}}_{i,a}{\mathcal {Z}}_{i,b}\left( e^{-E_i t} + e^{-E_i (L_t-t)}\right) \nonumber \\&\quad + \sum _{i=2}^{2N-1} (-1)^{i(t+1)} {\mathcal {Z}}^2_{i,ab}\left( e^{-E_i t} + e^{-E_i (L_t-t)}\right) , \end{aligned}$$

(33)

where the state oscillates in time for odd i, but not for even i. This kind of fit is denoted as $N+N$, meaning we include N nonoscillating and N oscillating states. Both oscillating and nonoscillating excited states are fitted as the logarithm of the energy difference $\varDelta E_i = E_i - E_{i-2}$ in order to avoid the collapse of two energy levels. In the higher states, we never interrelate energies of oscillating and nonoscillating states, i.e., the fitted $\varDelta E_i$ always refer to the difference between two states of the same type. The overlap factors in Eq. (31) are included in the fit function via

$$\begin{aligned} {\mathcal {Z}}_j = \sqrt{Z_j / 2E_j}. \end{aligned}$$

(34)

For the ${\mathcal {Z}}$ factors of the ground states, we also use a logarithm, forbidding the possibility ${\mathcal {Z}}\le 0$.

We perform joint fits of all available correlators for a given combination of meson and momentum. That gives us three correlators corresponding to the d-d, 1S-1S, and the crossed average between the d-1S and the 1S-d operators. In the cases where we distinguish between different orientations of the polarization of the $D^*$ meson with respect to its momentum, the total number of correlators increases to six. The fitter uses the covariance matrix of the whole set of data, where the fit parameters are constrained with Gaussian priors. The prior central values for the energy levels in the fit functions for the zero-momentum correlators are guided by the experimental values for the meson mass in question and an empirical analysis of the data. The prior width of the physical (oscillating) ground state is chosen to be 140 MeV (520 MeV). In the fit functions for the nonzero-momentum correlators used for the dispersion relation, the ground-state energy prior central values are set equal to $\sqrt{M^2 + {\varvec{p}}^2}$, where M is the posterior ground state energy from the fit to the corresponding zero-momentum correlator. The width of the prior is enlarged to encompass the expected discretization errors $O(\alpha _s a^2p^2)$. The prior central value for the energy difference between two neighboring oscillating or nonoscillating states is taken to be 0.5 GeV. Their widths vary with the ensemble, but they are always larger than 0.2 GeV. The fit functions for the nonzero-momentum correlators employed in the three-point function analysis, namely momenta $2\pi (1,0,0)/L$ and $2\pi (2,0,0)/L$, use the dispersion-relation results as priors for the ground-state energy.

The energy levels are constrained to be the same across smearings, but the overlap factors are different, and they are represented with different parameters. For the ground states of the crossed average, we do not fit the ${\mathcal {Z}}_j$ amplitudes, but we impose the exact constraint

$$\begin{aligned} {\mathcal {Z}}^2_{d,1S} = {\mathcal {Z}}_{1S}{\mathcal {Z}}_d. \end{aligned}$$

(35)

The ${\mathcal {Z}}_j$ amplitudes of the excited states of the crossed average are treated as separate fit parameters as they may describe a mix of excited states. Our ${\mathcal {Z}}_j$ amplitudes are also allowed to depend on the orientation of the momentum, when applicable, and Eq. (35) applies independently to each orientation. The priors for the ${\mathcal {Z}}_j$ factors of the ground states follow a log-normal distribution. Their central values are estimated following an empirical examination of the data, and their widths are large enough to accommodate significant departures from those original choices, roughly within one order of magnitude. In particular, the width of the physical ground state amplitude prior is set to 0.5 for all ensembles, and the width of the posterior is usually 20 times smaller. The prior is enlarged for the nonzero momentum correlators by a factor $\approx (1+2\alpha _s p^2)$. In the case of the oscillating ground state, the width of the amplitude is set to 1.2 for the zero momentum correlators and 2.0 for the nonzero momentum ones, with a typical posterior width of 0.5. In contrast, we use a Gaussian distribution for the excited-state priors. The width is fixed to be 3.5 for all ensembles, whereas the width of the resulting posteriors is typically an order of magnitude smaller. We test thoroughly that the fit results for the ground states are largely unaffected by the choice of priors and prior widths, as long as the fit remains stable.

The fit ranges are chosen following a systematic procedure: $t_{\text {Max}}$ is chosen such that the correlator points for $t<t_{\text {Max}}$ have fractional errors smaller than $\approx 20$–30%. In this way, the covariance matrix is not contaminated by excessive noise, but the value of $t_{\text {Max}}$ becomes ensemble dependent. However, the correlator fits are generally insensitive to variations of $t_{\text {Max}}$ within this constraint. In contrast, $t_{\text {Min}}$ is chosen to have the same value in physical units for all ensembles and momenta. We do this because we expect the degree to which excited states influence the fit depends on their physical separation from the ground state. We apply the following four criteria to select the best $t_{\text {Min}}$ value: (1) when including all ensembles and momenta, the p value must follow a sufficiently flat distribution for 15 ensembles, (2) the $2+2$ and the $3+3$ fits must agree on the nonoscillating ground-state energy and overlap factors, (3) the fit result must be stable under small variations of the fit range, and (4) the product ${\mathcal {Z}}_d\sqrt{2E} = Z_d$ for the overlap factor of the ground state should be approximately independent of the momentum, barring discretization effects. When these conditions are all fulfilled, we consider that the systematic errors due to the omission of still further excited states have been included in the statistical fit error. This usually leaves us with a small range of possible values for $t_{\text {Min}}$, we chose among those the one that complies best with all these conditions. The selected values are listed in Table 3. An example of the level of agreement that is reached between our $2+2$ and $3+3$ two-point correlator fits is shown in Table 4.

Previous experience [22, 39, 41, 42, 53, 58], which also applies to this study, has shown that it is better to impose the four criteria introduced above on a set of fits, rather than choosing, on a case-by-case basis, the fit with the smallest $\chi ^2/$dof, the smallest error, or some other notion of “best” fit. The case-by-case approach amplifies meaningless statistical fluctuations, which can introduce problems in subsequent steps of the analysis (here, the chiral-continuum extrapolation). Figure 2 shows the stability of 1 + 1, 2 + 2, and 3 + 3 fits on ensembles at four lattice spacings, denoting the common $t_\text {Min}$. It illustrates that we could have chosen smaller values of $t_\text {Min}$ on some ensembles, if we had adopted ensemble-by-ensemble criteria. Hence, our common $t_\text {Min}$ value is conservatively chosen.

Table 3 Fit ranges in physical units for the two-point function fits used in the analysis of the form factors. As the number of included states increases, $t_{\text {Min}}$ is reduced to include information from the rapidly decaying excited states in the fit. For the coarsest ensembles and momentum $2\pi (4,0,0)/L$, the fit ranges do not yield enough points to perform a fit. In those cases the $2\pi (4,0,0)/L$ point is simply dropped

Full size table

Table 4 Results for the $D^*$-meson two-point correlator fits on the $a\approx 0.12$ fm, $m_l=0.14m_s$ ensemble. We compare the nonoscillating ground-state energy and overlap factors for the $2+2$ and $3+3$ state fits. We include here all fits that distinguish between the different orientations of the momentum. In the analysis we use the $3+3$ state fit result

Full size table

3.3.2 The dispersion relation

The calculation of the dispersion relation serves two purposes: first, we can estimate a good prior for the two-point functions that enter in the analysis of the form factors; second, by checking the size of the deviations from the continuum dispersion relation, we can test whether the discretization errors due to the heavy quarks are under control. The dispersion relation which includes discretization effects can be written as

$$\begin{aligned} a^2E^2({\varvec{p}})&= (aM_1)^2 + \frac{M_1}{M_2}(a{\varvec{p}})^2 \nonumber \\&\quad + \frac{1}{4}\left[ \frac{1}{(aM_2)^2} - \frac{aM_1}{\left( aM_4\right) ^3}\right] (a^2{\varvec{p}}^2)^2 \nonumber \\&\quad - \frac{aM_1w_4}{3}\sum _{i=1}^3(ap_i)^4 + O(p_i^6), \end{aligned}$$

(36)

where $M_1$ is the rest mass, $M_2$ is the kinetic mass, and $M_4$ is a further mass-like quantity. A key observation of Ref. [49] is that the matching of the relativistic Wilson action via HQET or NRQCD to continuum QCD removes discretization effects that grow uncontrollably with aM. In Eq. (36) discretization effects are described by the coefficients of the $(a{\varvec{p}})^n$ terms, parameterized by $w_4$, $M_1$, $M_2$ and $M_4$, for which explicit expressions are given in Ref. [49]. We tune the kinetic mass $M_2$ to match the experimentally observed mass, according to the nonrelativistic interpretation of the clover action [49].^{Footnote 1}

The leading $O(a^2)$ discretization effects are due to $M_1/M_2\sim 1$. Expectations for this ratio can be inferred from perturbation theory for the quark masses [68] and by tracing contributions to the binding energy [69, 70]. On this basis, we expect $M_1/M_2$ to be $1+O(\alpha _s,(am_{0c})^2)$, and we would like to test whether the leading deviation from the continuum dispersion relation, $E^2={\varvec{p}}^2+M^2$, grows as $O(\alpha _s a^2p^2)$. We can check whether our nonzero momentum fits show deviations of order $O(\alpha _s a^2p^2)$ from the continuum dispersion relation, and we can also fit the energies from our correlator fits to Eq. (36), considering the coefficients in front of the powers of momenta as fit parameters. These results are used to guide the prior central values for the ground-state energies of the two-point correlators with nonzero momentum that are part of the three-point analysis which yields the form factors. In order to make this prior independent of the form-factor data, we exclude the $p=2\pi (1,0,0)/L$, $2\pi (2,0,0)/L$ momenta from the dispersion-relation calculation. As explained in Sect. 3.3, data for different polarizations of the $D^*$ meson are available only for $p=2\pi (1,0,0)/L$ and $2\pi (2,0,0)/L$. Therefore, these are the only momenta for which we obtaine the form factors at nonzero recoil.

Table 5 Results for the $D^*$-meson two-point correlator fits on the $a=0.12$ fm, $m_l=0.14m_s$ ensemble. We compare the nonoscillating ground-state energy and overlap factors for the $2+2$ and $3+3$ state fits. We include here all fits that did not distinguish between the different orientations of the momentum. Momentum $2\pi (4,0,0)/L$ did not have enough degrees of freedom left for the $3+3$ state fit, and hence it is not shown. In the analysis we use the $3+3$ state fit result

Full size table

In Table 5, we show the results of our two-point correlator fits that enter in the dispersion-relation fit for a particular ensemble. There is good agreement between the $2+2$ and the $3+3$ state fits, indicating that the systematic error from the omission of higher states is negligible. Results for other ensembles show a similar behavior. Figure 3 compares the continuum dispersion relation with our data. The data points show small discretization errors, which tells us that, indeed, these errors are under control.

3.4 Three-point functions

With our previously defined interpolating operators, we can also construct three-point correlators by sandwiching a current between two meson states,

$$\begin{aligned} C^{J^\mu }_{X_a\rightarrow Y_b}({\varvec{p}}, t) = \left\langle {\mathcal {O}}_{Y_b}({\varvec{0}}, T)J^\mu ({\varvec{p}}, t)\,{\mathcal {O}}_{X_a}^\dagger (-{\varvec{p}},0)\right\rangle . \end{aligned}$$

(37)

Using the same notation as in Eq. (31), we can write the spectral decomposition of the three-point correlators for a particular source-sink separation T as

$$\begin{aligned} C^{J^\mu }_{X_a\rightarrow Y_b}({\varvec{p}}, t)&= \sum _n s_n(t)\,s_m(T - t)\sqrt{Z_{Y_b,n}({\varvec{p}})}\frac{e^{-E_n({\varvec{p}})t}}{2E_n({\varvec{p)}}} \nonumber \\&\quad \times \left\langle Y_b, n, {\varvec{p}}|J^\mu | X_a, m, {\varvec{0}}\right\rangle \nonumber \\&\quad \times \sqrt{Z_{X_a,m}({\varvec{0}})}\frac{e^{-M_m(T-t)}}{2M_m}, \end{aligned}$$

(38)

where we choose $t < T \ll L_t$, such that wraparound terms with $t\rightarrow L_t - t$ and $T - t \rightarrow L_t - (T - t)$ in the exponent are completely negligible, at most $\sim 10^{-16}$,

In our three-point functions, we always use a Richardson 1S smearing for the B meson, but the $D^*$ meson operator is either 1S-smeared or point d. This gives a variety of possibilities for constructing ratios of correlators. For $x_f$ we use

$$\begin{aligned} x_f({\varvec{p}}, t, T) = \frac{C^{V_j}_{D^*_{1S}\rightarrow D^*_a}({\varvec{p}}_{\bot ,\parallel },t,T)}{C^{V_4}_{D^*_{1S} \rightarrow D^*_a}({\varvec{p}}_{\bot ,\parallel },t,T)}, \end{aligned}$$

(39)

where $a = d,1S$ and the orientation of the momentum can be arbitrary, as long as it is the same for the correlator in the numerator and denominator. These combinations cancel the leading overlap factors and exponentials. The same cancelation can be achieved in $X_V$,

$$\begin{aligned} X_V({\varvec{p}}, t, T) = \frac{C^{V_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\bot ,t,T)}{C^{A_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\bot ,t,T)}. \end{aligned}$$

(40)

In these two ratios we can find the desired matrix element in the limit $t \gg 0$ and $T - t \gg 0$, with $t < T$. The double ratio and the other two single ratios can be expressed in the same way

$$\begin{aligned} X_0 ({\varvec{p}}, t, T)&= \frac{C^{A_4}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\parallel ,t,T)}{C^{A_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\bot ,t,T)} \sqrt{\frac{Z_{D^*,a}(p_\bot )}{Z_{D^*,a}(p_\parallel )}}, \end{aligned}$$

(41)

$$\begin{aligned} X_1 ({\varvec{p}}, t, T)&= \frac{C^{A_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\parallel ,t,T)}{C^{A_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\bot ,t,T)} \sqrt{\frac{Z_{D^*,a}(p_\bot )}{Z_{D^*,a}(p_\parallel )}}, \end{aligned}$$

(42)

$$\begin{aligned} R_{A_1}({\varvec{p}}, t, T)&= \frac{C^{A_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\bot ,t,T) \, C^{A_j}_{D^*_a\rightarrow B_{1S}}({\varvec{p}}_\bot ,t,T)}{C^{V^4}_{D^*_a\rightarrow D^*_{1S}}({\varvec{0}},t,T) \, C^{ V^4 }_{B_{1S}\rightarrow B_{1S}}({\varvec{0}},t,T)} \nonumber \\&\quad \times \frac{Z_{D^*,a}(p_\bot )}{\sqrt{Z_{D^*,a}(0)\,Z_{D^*,1S}(0)}}\nonumber \\&\quad \times \frac{M^2_{D^*}}{E^2_{D^*}({\varvec{p}})} e^{-(E_{D^*}({\varvec{p}}) - M_{D^*})T}, \end{aligned}$$

(43)

but in this case the computed ratios depend on extra factors that must be removed before extracting the matrix elements. The overlap factors are removed per jackknife bin using the results of the two-point correlator fits. In this way we can propagate correlations from one fit to the other. The $M_{D^*}/E_{D^*} = 1/w$ factor in Eq. (43) is removed using the value of the recoil parameter, as extracted from Eq. (21) per jackknife bin.

The double ratio $R_{A_1}$ deserves further comment. First, we reanalyze the zero-momentum correlators [22] using the criteria given above. That also implies the double ratio $R_{A_1}({\varvec{p}}={\varvec{0}})$ is constructed only for the $a=1S$ smearing. Second, we did not generate three-point functions at nonzero momentum of the form $C^{A_j}_{D^*_a\rightarrow B_{1S}}({\varvec{p}}_\bot ,t,T)$, so we use the time reversal operation ${\mathcal {T}}$ to obtain the missing correlator,

$$\begin{aligned} C^{A_j}_{B_{1S}\rightarrow D^*_a}({\varvec{p}}_\bot ,t,T) \xrightarrow {{\mathcal {T}}} C^{A_j}_{D^*_a\rightarrow B_{1S}}({\varvec{p}}_\bot ,T-t,T). \end{aligned}$$

(44)

3.4.1 Three-point function fits

The three-point functions are also affected by the oscillating states introduced by the staggered regularization. So are the ratios constructed with such three-point correlators, but the dependence on the oscillating states is not as clean as in the case of the two-point functions. Our ratios do not show any noticeable oscillatory behavior in source and sink, but states that oscillate at both ends introduce a nonnegligible overall shift on the ratio central value that depends on the sink time T as $(-1)^T$. In order to remove this contribution, we smooth the data following Refs. [22, 31, 53, 73], namely, we calculate the three-point correlators at two different values of the sink time T, and then we compute the following weighted average to suppress this unwanted shift in most ratios:

$$\begin{aligned}&{\bar{R}}(t,T)\equiv \frac{1}{2}R(t,T) + \frac{1}{4}R(t,T+1) + \frac{1}{4}R(t+1,T+1), \nonumber \\&\text {with}\quad R=X_0,X_1,X_V,x_f\text { and }R_{A_1}({\varvec{p}}=0). \end{aligned}$$

(45)

The contribution of the oscillating shift is then greatly suppressed.

The double ratio at nonzero momentum, $R_{A_1}({\varvec{p}}\ne 0)$, requires the explicit removal of the sink-dependent exponentials in order to avoid bias,

$$\begin{aligned}&{\bar{R}}_{A_1}({\varvec{p}}\ne 0,t,T)\nonumber \\ {}&\quad \equiv \frac{1}{2}R_{A_1}({\varvec{p}},t, T )e^{(E_{D^*}({\varvec{p}}) - M_{D^*})T} \nonumber \\&\qquad + \frac{1}{4}R_{A_1}({\varvec{p}},t, T+1)e^{(E_{D^*}({\varvec{p}}) - M_{D^*})(T+1)} \nonumber \\&\qquad + \frac{1}{4}R_{A_1}({\varvec{p}},t+1,T+1)e^{(E_{D^*}({\varvec{p}}) - M_{D^*})(T+1)}. \end{aligned}$$

(46)

These exponentials are removed using the energy and mass values coming from the two-point correlator fits per jackknife bin. The ratio averages defined in Eqs. (45) and (46) suppress the contributions from the unwanted oscillations to a fraction of the statistical errors. Therefore, we henceforth employ only the averaged ratios in our analysis and omit the bar for simplicity. The data are then processed through a single elimination jackknife, and the extra overlap factors and exponentials are removed by using the values obtained in the two-point correlator fits per jackknife bin. Then the ratios are fitted to the functional form:

$$\begin{aligned} R({\varvec{p}}, t, T)&= K\Big (1 + A_1 e^{-\varDelta E^1_{X}t} + A_2 e^{-\varDelta E^2_{X}t} \nonumber \\&\quad + B_1 e^{-\varDelta E^1_{Y}(T-t)} + B_2 e^{-\varDelta E^2_{Y}(T-t)}\Big ), \end{aligned}$$

(47)

where K is the matrix element we want to extract, and the extra terms take into account the presence of excited states, assuming their contribution is small. The labels X, Y represent mesons at source and sink, respectively, and $\varDelta E^j_{X,Y}$ represents the energy difference between the ground state and the $j^{\text {th}}$ excited state. The second excited states at source and sink (included in the $A_2$ and $B_2$ terms) are necessary to remove systematic errors due to unaccounted excited states. In order to check this point, we computed the ratio $x_f$ with different polarizations of the $D^*$ meson. We expect the extracted matrix elements from different polarizations to agree, except for discretization effects that should be reduced as the lattice spacing decreases. Nonetheless, our results show a difference between the analysis with a single excited state at source and sink and the analysis with two excited states at each end. The addition of extra excited states not only increases the error, as expected, but also brings the central values calculated with different polarizations closer. Overall there is a large reduction in the difference between the cases with polarization parallel and perpendicular to the momentum. This behavior depends only mildly on the lattice spacing, as can be checked in Fig. 4.

The two available sets of correlators with smearing operators $a=d$, 1S are fit simultaneously, so that they share $\varDelta E$ and K, but each smearing has its own $A_{1,2}$ and $B_{1,2}$. We employ a loose prior for K with a central value roughly set by the ratios at $\approx T/2$, where the excited states are more suppressed, and with a width large enough to accommodate significant variations: in the case of the double ratio $R_{A_1}$ the width of the prior is set to 0.1, whereas the other ratios use 0.05. In all cases, the width of the prior encompasses all available correlator points for the single ratios. Except for the double ratio, the excited states at source and sink carry different signs, hence we ensure the prior covers the central value of the matrix element. Typically, the posterior is almost an order of magnitude narrower that the prior, although in the least precise cases th the posterior width is $\approx 60\%$ of the prior. The priors for the $\varDelta E$ of the first (second) excited states are taken from the two-point function fits with $3+3$ states, but we increase the error by a factor of three (eight) to allow $\varDelta E$ to differ from the two-point correlator values. This increase takes into account the fact that the $t_{\text {Min}}$ in the ratio fits is much smaller than in the two-point function fits, and the excited-state pattern might be different as well. The priors for $A_{1,2}$ and $B_{1,2}$ are taken to be 0(2) and 0(1) in the smeared and point source cases respectively.^{Footnote 2} As stated above, our priors are conservative enough that significant variations of their central values and/or widths do not result in a relevant change in the posterior for the matrix element.

The fit ranges are chosen following criteria similar to the two-point function case. We use the same value of $t_{\text {Min}}$ and $t_{\text {Max}}$ in physical units for all ensembles and all ratios, except for the double ratio, where we use $t_{\text {Max}}=T-t_{\text {Min}}$ to account for the fact that the states on source and sink are exactly the same. In this case, we take the same $t_{\text {Min}}$ as for the rest of the ratios. We choose the fits that show stability over small variations of the fit range and result in a reasonably flat distribution for the p values.

3.5 Calculation of the recoil parameter

As in previous work [52, 53], we use the ratio $x_f$ to define the recoil parameter, following Eq. (21). The disadvantage of this method is that it introduces systematic errors due to the renormalization of the currents. One could also use the continuum dispersion relation to define the recoil parameter:

$$\begin{aligned} w = \sqrt{\frac{M_{D^*}^2 + {\varvec{p}}^2}{M_{D^*}^2}}, \end{aligned}$$

(48)

where ${\varvec{p}}$ is the three-momentum of the $D^*$ meson, and the mass is either $M_1$ or $M_2$. The different choices for the different definition of the mass are expected to result in slightly different discretization errors that are resolved in our chiral-continuum extrapolation, so the choice should not affect the final results. As shown in Fig. 5, the error in the $x_f$ method encompasses the differences in the rest- and the kinetic-mass versions of Eq. (48). In this work, we take a conservative approach and define w via Eq. (21), but we note that all choices lead to results for the form factors, $|V_{cb}|$, and $R(D^*)$ that are compatible within errors.

3.6 Current renormalization and blinding

As outlined in Sect. 2.2, the ratios described in Eqs. (39)–(43) are constructed in such a way that the flavor-diagonal renormalization factors $Z_{V^4_{cc}}Z_{V^4_{bb}}$ from Eq. (14) cancel out. Hence, what remains is only the computation of the different matching factors $\rho _X$ that enter in the ratios. These factors can be calculated using perturbation theory, but the calculation becomes cumbersome for $w>1$. In this article, we use the approximation $am_{2c}\rightarrow 0$, where $am_{2c}$ is the charm kinetic mass, which removes the dependence on w, because a light quark cannot modify the dynamics of a heavy quark in the heavy quark limit (see Appendix C and Refs. [50, 74]). Then we incorporate errors coming from these approximations. The w dependence introduces an error proportional to $w-1$, and the $am_{2c}\rightarrow 0$ approximation increases the error by $O(\alpha _s am_{2c})$.

The calculation of the axial matching factor $\rho _{A_j}(1)$ then follows exactly the procedure of Ref. [22]. The other ratios require further calculations that are detailed in Appendix B. The resulting values for all matching factors are gathered in Table 6, where the errors shown are from the VEGAS integration. The errors associated with the approximations we make to obtain those factors are discussed in Appendix B. They are added to those in Table 6 before carrying out the chiral-continuum extrapolation.

Table 6 Matching factors calculated at one loop in perturbation theory for all ensembles and form factors. The errors shown in the table do not include the systematic errors coming from the approximations employed in the calculation of the matching factors. They are nonetheless included in the chiral-continuum extrapolation. These errors and more details on the calculation are described in Appendix B. The last two columns include the values of $\alpha _V$ and $m_{2c}a$ used in Appendix B to estimate the matching errors

Full size table

We also calculate the matching factors for the ensembles involved in the heavy-quark (HQ) mistuning corrections. This is a departure from Ref. [53], where the matching factors are applied after the HQ-mistuning correction. In this way, we completely separate the HQ-mistuning corrections from the matching errors.

During the analysis, we blinded the form-factor data by multiplying the matching factor $\rho _{A_j}$ by an unknown, random number close to 1. The ratios $\rho _{A_4}/\rho _{A_j}$ and $\rho _{V_j}/\rho _{A_j}$ are left unchanged. Via the correlator ratios and Eqs. (26)–(29), all form-factor values on all ensembles are thus multiplied by a common factor. All stages of the analysis were tested either through independent fits performed by two co-authors or by using independent methods or codes. Only after the full analysis was complete, including construction of the systematic error budget (Sect. 4) and z expansion (Sect. 5), the blinding factor was removed from $\rho _{A_j}$, and the analysis scripts were rerun to extract the unblinded results for the form factors.

3.7 Heavy-quark-mass adjustment

The bare masses for the b and c quarks are tuned such that the kinetic masses of the $B_s$ and $D_s$ meson on each ensemble are equal to their physical values. Nonetheless, the tuning procedure has errors that must be taken into account. The procedure is explained in Appendix C and outlined here. As we generate configurations, values for the heavy-quark masses that give approximately the correct meson masses can be estimated. These initial values are employed to compute the two- and three-point functions that we analyze in this work. At the end of the data generation, the much larger statistical sample allows for a more precise determination of the b and c quark masses by using the procedure detailed in Ref. [22]. We must then correct for this mismatch.

The correction is calculated nonperturbatively by studying the effect of a varying heavy-quark mass in a single ensemble on all correlation functions. Since the heavy-quark mismatch is small, a linear fit in $1/m_Q$ for each flavor $Q=c$, b and form factor is usually sufficient. Once the functions that describe the evolution of the form factors with the quark masses are known, we can apply the correction to all other ensembles. Table 20 in Appendix C gathers all our adjustments. The error in the correction comes from the error in the heavy-quark-mass tuning procedure and from the error of the linear fit to find the heavy-quark mass dependence.

3.8 Chiral-continuum extrapolation

After applying the renormalization factors and the heavy-quark-mistuning corrections as described in previous subsections, the resulting form factors still need to be corrected for the fact that they are calculated at nonzero lattice spacing and nonphysical values for the light-quark masses. An extrapolation to both the continuum limit and the physical value of the light-quark masses is thus necessary to extract values that can be used in a physical calculation. The extrapolation should be based on an appropriate effective field theory (EFT) description of lattice QCD. The relevant EFT for the calculation at hand is rooted staggered chiral perturbation theory (rS$\chi $PT), which describes how the form factors behave as the lattice spacing and the light-quark masses approach the desired limits, extended to include heavy-light observables [75]. The unquenched MILC configurations generated with 2 + 1 flavors of improved staggered fermions make use of the fourth-root procedure for eliminating the unwanted four-fold degeneracy of staggered quarks. At nonzero lattice spacing, this procedure has small violations of unitarity [76,77,78,79,80] and locality [81]. Nevertheless, a careful treatment of the continuum limit, in which all assumptions are made explicit, argues that lattice QCD with rooted staggered quarks reproduces the desired local theory of QCD as $a\rightarrow 0$ [82, 83]. When coupled with other analytical and numerical evidence (see Refs. [84,85,86] for reviews), this gives us confidence that the rooting procedure is indeed correct in the continuum limit. We then use the following functions obtained in $\text {SU}(3)$ rS$\chi $PT to lowest nontrivial order in the heavy-quark expansion to fit the different form factors:

$$\begin{aligned} h_Y(a, m, m_s, w)&= \Bigg (K_Y + \frac{\chi _Y(\varLambda _\chi )}{m_c^{k_Y}} + f_Y^w + f_Y^\text {NLO} \nonumber \\&\quad + f_Y^\text {NNLO}\Bigg ) \times \left( 1 + f_Y^\text {HQ}\right) , \end{aligned}$$

(49)

where $Y=A_1$, $A_2$, $A_3$, and V; $K_{A_2}=0$ but $K_Y=1$ otherwise; $k_{A_1}=2$ but $k_Y=1$ otherwise. These expressions contain the correct dependence in $\chi $PT on the light- and strange-quark masses, the lattice spacing, and the recoil parameter w at next-to-leading order (NLO). This result expands on the one in Ref. [87] by adding the missing recoil dependence in the relevant places. The terms

$$\begin{aligned} f_Y^{\text {NLO}}&= \frac{g^2_{D^*D\pi }}{48\pi ^2 f_\pi ^2 r_1^2} \, \textrm{logs}^Y_\text {SU(3)}(a, m, m_s, w, \varLambda _\chi ) \nonumber \\&\quad + c_{m_1,Y}x_l + c_{a_1,Y}x_{a^2}, \end{aligned}$$

(50)

introduce nonanalytic dependence on the light and the strange-quark masses through the chiral logarithms $\textrm{logs}^Y_\text {SU(3)}$. Those terms also include the leading taste-breaking discretization effects from the light-quark sector. The explicit expression of the logarithms for each form factor is given in Appendix A and includes a dependence on the recoil parameter w. The coefficient of the chiral logarithms comes from $\chi $PT and it is known, but the current determinations of the coupling $g_{D^*D\pi }$ are not very accurate, hence we fit the coupling with a Gaussian prior $0.53\pm 0.08$, compatible with experimental data [88,89,90] and lattice-QCD results [91,92,93,94,95,96]. We fix the pion decay constant appearing in the chiral logs in Eq. (50), and elsewhere in the fit function to the three-flavor FLAG 2019 average with the error increased by the estimated 0.7% charm sea-quark contribution $f_\pi =130.2\pm 1.2$ MeV [97].

The other terms in Eq. (50) introduce analytic NLO corrections in the light-quark masses through $x_l=2B_0m/(8\pi ^2f_\pi ^2)$ and in the lattice spacing through $x_{a^2} = [a/(4\pi f_\pi r_1^2)]^2$, where $B_0$ is the low-energy-constant (LEC) of $\chi $PT that relates the light- and strange-quark masses with the meson masses. The value of $B_0$ for each lattice spacing is the same as in the earlier analysis at zero recoil [22], which uses exactly the same ensembles, and is given in Appendix A. We take into account truncation errors by including the term

$$\begin{aligned} f_Y^{\text {NNLO}} = c_{c,Y} x_l x_{a^2} + c_{m_2,Y} x_l^2 + c_{a_2,Y} x_{a^2}^2, \end{aligned}$$

(51)

which describes the dependence on the light-quark

masses and the lattice spacing a at next-to-next-to-leading order (NNLO), not including logarithmic terms. According to $\chi $PT power-counting, these analytical terms are expected to have coefficients of O(1), so we take them as fit parameters with priors $0\pm 1$. We don’t include analytical terms in the strange-quark mass because we do not have data at different values of $m_s$ and nonzero recoil, and our $h_{A_1}(1)$ result using only zero-recoil data agrees within errors with our previous result from Ref. [22]. Also, $\chi $PT predicts a much milder dependence on $m_s$ than on the light-quark masses.

We allow a simple NLO analytical dependence on w to describe the behavior in our small-recoil range through the term

$$\begin{aligned} f_Y^w = - \rho _Y^2(w-1) + \kappa _Y(w-1)^2, \end{aligned}$$

(52)

where the fit parameters $\rho _Y$ and $\kappa _Y$ are related to the slope and curvature of the form factor $h_Y$ respectively. We can reasonably expect the slope of the form factors to be roughly 1, but in order to accommodate substantial deviations from this value, we set the prior of $\rho _Y$ to $1\pm 2$. The priors of $\kappa _Y$ are chosen to be $0\pm 3$, and the posteriors are compatible with zero within a fraction of a sigma.

The constant term $\chi _Y(\varLambda _\chi )$ in Eq. (49) is a LEC of the chiral effective theory, and it is suppressed for $h_{A_1}$ by a factor of $1/m_c^2$ due to Luke’s theorem [98], whereas the other form factors receive contributions of order $O(1/m_c)$ in the heavy-quark power counting. The dependence of this LEC on the chiral scale $\varLambda _\chi $ cancels against the dependence of the nonanalytical terms in Eq. (50). We set the prior of this LEC to $0\pm 1$, except for $h_{A_1}$ where we use $0.0\pm 0.2$ to reflect the suppression due to Luke’s theorem.

The last term in Eq. (49) accounts for the heavy quark discretization errors,

$$\begin{aligned} f_Y^\text {HQ} = \beta ^{\alpha _s a}_Y \alpha _s a\varLambda _{\text {QCD}} + \beta _Y^{a^2} a^2\varLambda ^2_{\text {QCD}} + \beta _Y^{a^3} a^3\varLambda ^3_{\text {QCD}}, \end{aligned}$$

(53)

with $\beta ^p_Y$ the coefficient of the term of order O(p) corresponding to the form factor $h_Y$, and $\varLambda _\text {QCD}=0.6$ GeV for normalization purposes. In previous articles,^{Footnote 3} we have employed the universal functions described in Refs. [49, 67]. But in those cases there was only one heavy meson. Here we need to deal with the B and the $D^*$, and our data are not accurate enough to distinguish the different terms described by the universal functions. In order to avoid terms that mimic the effect of others, we consider it a better strategy to implement generic discretization error terms, which would account for the same dependence as the universal functions. A side effect of this approach is that the heavy- and NLO light-quark discretization effects become mixed together through terms with the same dependence on the lattice spacing. To avoid this, we drop the $O(a^2)$ term from Eq. (53), which has the same dependence as the $O(a^2)$ term already present in Eq. (50), and we enlarge the prior of the latter assuming that both corrections are independent, i.e., with a quadrature sum. The priors for the $\beta ^p_Y$ coefficients are set to $0\pm 1$, but the $O(a^2)$ coefficients of Eqs. (50) and (53) have different normalizations. For this reason, the final prior for $c_{a_2,Y}$ becomes $0.0\pm 6.1$. This approach does not allow us to distinguish cleanly the origin of the discretization errors, but it accounts for the correct dependence and size of discretization errors. However, absorbing the $O(a^2)$ term from Eq. (53) in Eq. (50) may have further effects, since the correction in Eq. (53) is applied to the chiral-continuum fit function in Eq. (49). It is possible that this procedure does not account for discretization effects in the shapes of the form factors, which would give rise to higher order terms of the form $a^2(w-1)$ and $a^2(w-1)^2$. We test for such effects by performing two alternate chiral-continuum fits. In the first, we add the terms $x_{a^2}(w-1)$ and $x_{a^2}(w-1)^2$ to Eq. (52), where the priors for the coefficients of these terms are chosen as 0(1). We find that the results of this fit differ by at most $0.1\sigma $ in their central values from our base fit, while the statistical fit uncertainty is unchanged. In the second variation, we keep the $O(a^2)$ term in Eq. (53). In this case, the central values are consistent with those of our base fit within $0.25\sigma $, again with an unchanged uncertainty. Hence, we conclude that such discretization effects are already accounted for in our base fit.

Since each ensemble is statistically independent from the others, there are no correlations among them. On the other hand, we keep track of correlations both between different form factors within the same ensemble and within the same form factor calculated at different momenta, by combining the jackknife data of all form factors into a large, block-diagonal dataset. Our large statistics allow us to resolve the full covariance matrix without resorting to thinning procedures or singular-value-decomposition cuts on its eigenvalues. Nonetheless, we use the shrinkage procedure described in Refs. [99,100,101,102] to ensure the small eigenvalues of the covariance matrix have the correct behavior, and we find that our results do not change with respect to the analysis without shrinkage.

The systematic errors coming from the heavy-quark mistuning corrections and those introduced by the matching factors are built into our chiral-continuum extrapolation by constructing the combined covariance matrix,

$$\begin{aligned} C_{ij} = C_{ij}^{\textrm{stat}} + \delta _i^{(\rho )}\delta _j^{(\rho )} + \delta _i^{(\kappa )}\delta _j^{(\kappa )}, \end{aligned}$$

(54)

where the first term includes the statistical covariance, and the second and the third ones account for the matching factor and heavy-quark-mass mistuning correction errors respectively. The i, j indices run over all form factors, ensembles, and momenta. With $\delta ^{(\rho ,\kappa )}_i$ we represent either the shift in the $i^\text {th}$ datum due to a correction the heavy-quark mass ($\kappa $) or the propagated error of the form factor from the errors in the matching factors ($\rho $) as calculated in Eqs. (B.49a)–(B.49d). As a result, the systematic errors introduce new correlations between all data points. In fact, Eq. (54) assumes the worst case scenario that the matching systematic errors and the errors coming from the heavy-quark mistuning are 100% anticorrelated.

The extrapolation results for the four form factors are shown in Fig. 6. As one can see, $h_{A_1}$, which is protected by Luke’s theorem, receives small corrections from 1 at $w=1$. The other form factors do not enjoy this privilege, and for them the plots show large corrections from the HQET limit. Figure 6 also shows the result of the previous Fermilab-MILC calculation at zero recoil $w=1$ for comparison [22]. The agreement is good, although the errors have increased, mainly due to more conservative choices in this work, which stem from the data at nonzero recoil requiring an extra excited state at source and sink in the ratio calculations, which resulted in larger errors. For consistency, we employed the same approach at zero recoil as well. The deaugmented $\chi ^2$/dof of the chiral-continuum extrapolation is 85.2/95.

4 Systematic errors

This section provides specific information on our estimates of every source of systematic error in the determination of the form factors $h_X$. Even though only $h_{A_1}$ contributes to the decay amplitude at zero recoil, all form factors are nonzero at $w=1$, and their errors need not be suppressed at small recoil. Even if the errors of $h_V$, $h_{A_2}$, and $h_{A_3}$ become large, however, their contribution to the decay amplitude and, hence, the resulting uncertainty in the decay amplitude is still suppressed at small recoil.

Some general features of the uncertainties in the form factors can be understood via HQET. The form factor $h_{A_1}$ is protected by Luke’s theorem [98] and, indeed, we find HQET corrections of a few percent. The form factors $h_V$ and $h_{A_3}$, which are not protected by Luke’s theorem, receive HQET corrections at the $\sim 30\%$ level. The form factor $h_{A_2}$ starts in HQET with terms of order $\alpha _s$ and $1/m_c$, which is roughly consistent with our data, $h_{A_2}\sim -\frac{1}{2}$. Figure 7 shows the error budget for the different form factors in the continuum as a function of the recoil parameter. The relative uncertainty in each form factor follows the same pattern as the HQET corrections: small for $h_{A_1}$, moderate for $h_V$ and $h_{A_3}$, and large for $h_{A_2}$. In the last case, the relative uncertainty is large, because the overall value of $h_{A_2}$ is smaller than the others.

Our chiral-continuum extrapolation ansatz to NNLO incorporates errors from statistics, choices in the chiral-continuum extrapolation, discretization effects, $O(am_c \alpha _s)$ matching errors, and heavy-quark parameter mistuning. Thus, they are all entangled in the fit, and it is not straightforward to extract each particular contribution. In addition, our treatment of the heavy-quark discretization errors includes a term identical to one of gluon and light-quark discretization errors. We can, however, roughly estimate each contribution by making modifications to the fit. In this spirit, we define the statistical contribution to the error as the error obtained in a NLO fit without mistuning correction or matching-factor errors included. We have a specific way to deal with the matching factors, which is explained below. The contribution coming from the chiral-continuum extrapolations is estimated by comparing the fit errors with and without NNLO terms.

There are more contributions to the final error that have been taken into account: light-quark mass mistuning, scale setting, isospin effects, and finite-volume effects. The final error is taken to be the quadrature sum of these uncertainties with that of the chiral-continuum extrapolation error, which (again) includes statistical, chiral-continuum extrapolation, discretization, heavy-quark mistuning, and matching errors, as shown in Table 7. In the rest of this section, we discuss each source of uncertainty one by one, explaining how they enter this error budget.

Table 7 Error budget for all form factors at $w=1.11$. The first row shows the combined error coming from our chiral-continuum fit, which encompasses the statistical errors, the matching errors, the systematics due to our chiral-continuum extrapolation, errors coming from HQ-mistuning corrections, and discretization errors. The next several rows show estimates of the individual contributions (in parentheses as a reminder that they are contained in the first row). Since the terms that describe the discretization errors come from both the (N)NLO terms in the extrapolation and the HQ discretization terms, there is an overlap between the statistical errors (determined as those of a chiral-continuum extrapolation at NLO without any matching or heavy-quark mistuning errors taken into account) and the discretization errors, and the sum in quadrature of the numbers in parenthesis does not equal the first row. The remaining rows show other contributions, which are added to the first row in quadrature to obtain the total error in the last row. Dashes represent terms so small that were not included in the final computation of the error

Full size table

4.1 Statistics and stability of the correlator fits

In principle, the determination of masses, energies, and form factors depends on choices made in fitting the two- and three-point correlation functions, but we argue that the associated uncertainties are encompassed in the statistical component of the first line of Table 7. We have analyzed the two-point functions with both 2 + 2 and 3 + 3 states in the fit. Only when the two results agree within statistical errors do we select a particular fitting range. In this way, the influence of excited states is reduced below the statistical uncertainty of the B masses and $D^*$ energies. For the three-point-function ratios, we find that excited states play a more important role, as can be seen for the example of $x_f$ in Fig. 4. When fitting the three-point functions, we therefore include extra states at the source and sink in order to control this potential source of systematic error.

Bias can arise from the choice of fitting ranges. To avoid the problems that can come from choosing different fitting ranges for different ensembles, we impose the same $t_\text {Min}$ in physical units for all two-point correlator fits. Our $t_\text {Max}$ is chosen differently and varies from ensemble to ensemble, but the impact of a different $t_\text {Max}$ is much smaller, because these points have much larger errors. We refer to the reader to Sect. 3.3.1, where all the details are explained. For the form-factor ratio fits, we employ the same range in physical units for all ensembles and most ratios. For the double ratio $R_{A_1}$ and $x_f$, where the same pattern of states is expected at source and sink, we use a symmetric fit range with $t_\text {Max}=T-t_\text {Min}$. Our fits also take into account all correlations between the data points fitted, and, as pointed out in Sect. 3.2, we find that autocorrelations in our data are negligible.

For each correlator fit, we compute a p value from the deaugmented $\chi ^2$ and number of degrees of freedom, as explained in 3.2. We then verify that these p values follow an approximately uniform distribution.

4.2 Stability of the chiral-continuum extrapolation

To assess the stability of the chiral-continuum extrapolation, we repeat the fit for several different functional forms. Compared with the base fit, we omit, in turn, the NNLO terms, data at $w>1.10$, the coarsest ensemble, the finest ensemble, and heavy-quark discretization terms of order $a^3$. The results are very stable under modifications, as shown in Fig. 8. The most dramatic changes occur when we remove the $w>1.10$ data, leading to shifts of around one standard deviation in the worst case. Obviously, the large-recoil extrapolation is affected when removing data at larger recoil, but we find that the z expansion, discussed below in Sect. 5.1, stabilizes the final results for $|V_{cb}|$ and $R(D^*)$ in this respect. Table 8 shows that the quality of fit remains good, with $\chi ^2/\text {dof}\lesssim 1$, in all cases.

Table 8 Values of $\chi ^2$/dof for several variations in the chiral-continuum extrapolation

Full size table

4.3 Discretization errors

The improved action in the ensemble simulations has light-quark and gluon discretization errors of order $\alpha _s a^2$. Simple power-counting arguments suggest that the discretization errors range from $\sim 0.5\%$ in the finest ensemble to $\sim 10\%$ in the coarsest one. The chiral-continuum extrapolation describes, however, such terms via the lattice-spacing dependence of the chiral logarithms and the analytical terms proportional to $a^2$. Moreover, the form factors do not seem to be sensitive to the lattice spacing. Hence, we expect the chiral-continuum extrapolation to take all these errors into account, and no further systematic uncertainties are added to the final result.

In order to take into account the discretization errors coming from the lattice treatment of the heavy quarks, we include extra terms in the chiral-continuum extrapolation as shown in Eq. (53). These terms are motivated by the HQET description of cutoff effects [50, 103], which uses HQET to derive the mismatch between the lattice gauge theory at hand and continuum QCD. The result is a set of functions that depend on the heavy-quark mass, and that can account for discretization effects of different sizes (in our case, order $\alpha _sa$, $a^2$ and $a^3$).

In our analysis, we would like to introduce these functions for both the B and the $D^*$ mesons. Our data do not, however, distinguish between the contributions of the two mesons, so we instead use a single generic term for both mesons. Equation (53) shows the terms used in the end: $\alpha _sa$, $a^2$, and $a^3$. Since the $a^2$ term is already included in the light-quark discretization errors, it would be superfluous to include it again here. The downside of this approach is that it is impossible for us to disentangle light- and heavy-quark discretization errors in the error budget, and as such, we report them together.

One can estimate the size of these individual effects from variations of the chiral-continuum extrapolation with and without the terms in Eq. (53), and also removing the $O(a^2)$ term coming from NLO corrections in Eq. (50). Heavy- and light-quark discretization errors turn out to be the largest contribution to the total error in our analysis, and the inclusion of the terms listed in Eq. (53) is key in order to account for the heavy-quark systematic errors.

4.4 Matching errors

The matching factors are calculated at one-loop order in perturbation theory. Appendix B explains how we estimate the uncertainties, listed in Eqs. (B.49a)–(B.49d). They are included in the chiral-continuum extrapolation through Eq. (54). We can estimate the error introduced by the uncertainty in the matching up to order $am_c\alpha _s$ by removing the contribution of the matching factors to Eq. (54). The effect of higher order contributions is estimated by including an overall factor $(1 + r^{h_X}_2 \alpha _s^2 + r^{h_X}_3 \alpha _s^3)$ multiplying Eq. (49) in the fit and checking the shift in the central value of the form factor. The priors for the $r^{h_X}_{2,3}$ coefficients are set to 0(1); the posteriors have central values and widths close to those of the priors. We see no impact in including $O(\alpha _s^3)$ terms, but the $O(\alpha _s^2)$ contribute to the final error at the subpercent level. We collect all observed differences in the corresponding line in Table 7.

4.5 Heavy quark mistuning

The form factors are adjusted for the differences between the simulated masses of the heavy quarks and the physical ones before the chiral-continuum extrapolation. The correction procedure is detailed in Appendix C. The largest correction is about $1\sigma $, but in general the correction is negligible. Equation (54) includes the contribution of the mistuning in the chiral-continuum extrapolation, therefore, we do not need to add any further error. Switching off these corrections gives small variations in the results in our chiral-continuum extrapolation, as shown in Table 7 and Fig. 7.

4.6 Light quark mistuning

The endpoint for the light quark masses in the chiral-continuum extrapolation is set to $r_1m_l=0.003612(126)$ [104]. We can determine the uncertainty in the form factors coming from a mistuning in the light quark mass by varying $r_1m_l$ within $1\sigma $ and monitoring its effect on the form factors. The resulting uncertainty is shown in Table 7 and Fig. 7.

4.7 Scale setting

In order to determine the relative lattice spacing, we use the distance scale $r_1/a$ defined from the force between static quarks [105, 106], which has been extensively computed [54]. Absolute scale setting is taken from the chiral $f_\pi $ analysis of the MILC Collaboration [64], leading to $r_1=0.3117(22)~\text {fm}$. The form factors are dimensionless, so uncertainties from scale setting appear indirectly through the tuning of the heavy-quark masses, the setting of the light-meson masses in the chiral logarithms, and in the approach to the continuum limit.

We estimate the systematic error associated with $r_1/a$ and $r_1$ by propagating their errors to the final result. We find that the form factors change only slightly when we vary $r_1$ or $r_1/a$ by $\pm 1\sigma $, and we include an extra error associated to this variation as shown in Table 7 and Fig. 7.

4.8 Isospin effects

The whole calculation of the form factors has been done assuming isospin symmetry. The main effect of isospin breaking is to modify the endpoint of the chiral extrapolation through a change in the pion mass. This effect could bring the endpoint of the extrapolation closer to the $D\pi $-threshold cusp described by the chiral logs. We estimate the errors introduced by this approximation by varying the endpoint of the extrapolation in the pion mass from $m_{\pi ^0}$ to $m_{\pi ^+}$, by modifying the value of $r_1m_l$ from 0.003612 to 0.004065. While the pion-mass difference is mainly due to isospin breaking QED effects, here we are using it as proxy for the valence quark mass difference, to which we do not have direct access. Following the resulting difference, we assign an error ranging from 0.0% to 0.5%, depending on the form factor and the value of the recoil parameter, as shown in Fig. 7 and Table 7. This increase in the error has no impact in the final result for the form factors.

As an alternative way of estimating these effects, we have also tried to move the endpoint of the extrapolation to isospin symmetric points with $m_l = m_u$ and $m_l = m_d$. The difference of the values of the form factors between these two endpoints overestimates the isospin breaking errors, because it includes sea-quark effects that cancel out at first order. The estimate of the isospin breaking errors is larger with this method, but it is still negligible. Hence we can safely assume that isospin effects are insignificant at our current level of precision.

4.9 Finite-volume effects

To estimate finite-volume effects in our heavy-light $\chi $PT description of the form factors, we replace the loop integrals by discrete sums. Following Refs. [87, 107], we estimate the correction to the integrals in the formulas appearing in $B\rightarrow D^*$ at zero recoil to be smaller than $0.01\%$. This comes from the fact that the contribution of the chiral logarithms to the form factors is quite small. We have not calculated the corrections at nonzero recoil, and one also expects an increase in the error close to the cusp of the chiral logs. Given that $M_\pi L > 4$ on most ensembles, and $M_\pi L\ge 3.7$ always, there is no reason to expect such a large increase in the error as to make the finite-volume corrections sizable. Hence, we do not assign any additional error due to them.

5 Determination of $|V_{cb}|$ and $R(D^*)$

After calculating the form factors, we can reconstruct the decay amplitude using Eq. (9) and use experimental data to extract $|V_{cb}|$. Similarly, the form factors lead directly to $R(D^*)$ via Eqs. (1), (7), and (8). There is a problem: the form factors are obtained only at small values of the recoil parameter, and an extrapolation to large w with the chiral-continuum fit formula would greatly increase the error. To bring the large w behavior under control we use a standard, model-independent parametrization based on unitarity and analyticity to extrapolate the form factors to the large recoil region.

Historically the CLN parametrization [8] has been widely used for this process. However, recent developments have called into question the reliability of CLN fits, given the high accuracy of the latest experiments and calculations [14, 16]. Apart from using outdated data to derive the coefficients of the expansions, the main criticism of CLN in its most common usage is the lack of error estimates for theoretical ingredients. Even though the original CLN article [8] includes equations defining the covariance matrix of the slope and the curvature of the reference form factor, the final expressions omit this information. Finally, the strong unitarity constraints, based on heavy-quark symmetry, play an important role in the CLN parametrization. Instead of introducing these constraints as an additional assumption, we would rather perform a fit without imposing them at the outset and then use them after the fits as a consistency check.

To perform the z expansion we therefore use the completely general BGL parametrization [5,6,7]. Nonetheless, we compare our results with an updated version of CLN in Sect. 5.4.2. For a review on the status of the parametrizations for heavy-to-heavy decays, see Ref. [108].

5.1 z expansion with the BGL parametrization

The z expansion is based on a conformal map that takes w to a variable z, which remains small over the physical region for the decay process, namely,

$$\begin{aligned} z = \frac{\sqrt{w+1} - \sqrt{2N}}{\sqrt{w+1} + \sqrt{2N}}. \end{aligned}$$

(55)

The value $N=1$ is most commonly used, because it fixes the point $z=0$ at zero recoil, but a symmetric range has been advocated, with the claim that it reduces errors in the expansion [7, 8]. With $N=1$, the maximum recoil point for massless leptons $w_\text {Max}\approx 1.503$ becomes $z_\text {Max}\approx 0.056$, so indeed $z\ll 1$ in any case, and any reasonable expansion in z with coefficients of order 1 converges with a few terms. The conformal map given in Eq. (55) also pushes the branch cut in the w plane onto the unit circle, $|z|=1$, and subthreshold poles onto the real axis near $z=-1$. Since the nearest threshold is very far ($|z|\sim 1$) from the valid kinematic range ($0\le z\lesssim 0.056$) and higher-energy thresholds even farther, there is no advantage in using an alternative parametrization, such as the one proposed in Ref. [109], that would take care of the behavior at such values of z.

The BGL parametrization does not apply directly to the $h_X$ form factors, but to combinations with definite spin-parity [108]:

$$\begin{aligned} g&= \frac{h_V}{M_B \sqrt{r}}, \end{aligned}$$

(56)

$$\begin{aligned} f&= M_B \sqrt{r}(1+w) h_{A_1}, \end{aligned}$$

(57)

$$\begin{aligned} {\mathcal {F}}_1&= M_B^2 \sqrt{r} (1+w) \Big [(w-r)h_{A_1} \nonumber \\&\quad - (w-1)\left( rh_{A_2} + h_{A_3}\right) \Big ], \end{aligned}$$

(58)

$$\begin{aligned} {\mathcal {F}}_2&= \frac{1}{\sqrt{r}} \Big [(1+w)h_{A_1} + (rw-1)h_{A_2} + (r-w)h_{A_3}\Big ], \end{aligned}$$

(59)

where $r = M_{D^*}/M_B$. These form factors are proportional to the helicity amplitudes $H_-- H_+$, $H_+ + H_-$, $H_0$, and $H_S$, respectively [cf., Eqs. (4)-(6)]. Thus, the form factor ${\mathcal {F}}_2$ is important only with massive leptons, in particular in the determination of $R(D^*)$.

The BGL parametrization expresses the dependence of the form factors on z as

$$\begin{aligned} f_i(z) = \frac{1}{P_i(z)\,\phi _i(z)}\sum _{j=0}^\infty a_{i,j} z^j, \end{aligned}$$

(60)

where the functions $P_i(z)$ are called Blaschke factors, and the $\phi _i$ are known as outer functions. As discussed below, wise choices of $\phi _i(z)$ make the coefficients of the expansion $a_{i,j}$ of order 1 and ensure rapid convergence of the series. The Blaschke factors are given by

$$\begin{aligned} P_i(z) = \prod _p\frac{z-z_p}{1 - zz_p}, \end{aligned}$$

(61)

with

$$\begin{aligned} z_p(M_p, N) = \frac{\sqrt{\left( 1 + r\right) ^2 - \frac{M_p^2}{M_B^2}} - \sqrt{4Nr}}{\sqrt{\left( 1 + r\right) ^2 - \frac{M_p^2}{M_B^2}} + \sqrt{4Nr}}. \end{aligned}$$

(62)

They include the explicit poles with mass $M_p$ below the $BD^*$ threshold and with the appropriate quantum numbers. Table 9 shows the poles we use for the BGL form factors. Although some analyses employ four $1^-$ resonances, the fourth one is very far from $z>0$ and its value uncertain. We therefore follow Ref. [15] and use only three.

The z-expansion coefficients of the BGL form factors are then defined via

$$\begin{aligned} g&= \frac{1}{P_{1^-}(z)\,\phi _g (z)}\sum _{j=0}^\infty a_j z^j, \end{aligned}$$

(63)

$$\begin{aligned} f&= \frac{1}{P_{1^+}(z)\,\phi _f (z)}\sum _{j=0}^\infty b_j z^j, \end{aligned}$$

(64)

$$\begin{aligned} {\mathcal {F}}_1&= \frac{1}{P_{1^+}(z)\,\phi _{{\mathcal {F}}_1}(z)}\sum _{j=0}^\infty c_j z^j, \end{aligned}$$

(65)

$$\begin{aligned} {\mathcal {F}}_2&= \frac{1}{P_{0^-}(z)\,\phi _{{\mathcal {F}}_2}(z)}\sum _{j=0}^\infty d_j z^j, \end{aligned}$$

(66)

where the Blaschke factors’ subscripts denote the $J^P$ of the $l\nu $ final state.

Table 9 Poles for the Blaschke factors, taken from Ref. [15] and references therein. For $J^P=1^-$, $0^-$ ($1^+$), the first two (first only) resonances are well determined from either a lattice calculation or experimental measurements. All other masses are based on model estimates

Full size table

Setting $N=1$, we choose the outer functions to be [5,6,7]

$$\begin{aligned} \phi _g&= 16r^2 \sqrt{\frac{n_I}{3\pi {\tilde{\chi }}^T_{1^-}(0)}} \frac{(1+z)^2(1-z)^{-\frac{1}{2}}}{\left[ (1+r)(1-z) + 2\sqrt{r}(1+z)\right] ^4}, \end{aligned}$$

(67)

$$\begin{aligned} \phi _f&= \frac{4r}{M^2_B}\sqrt{\frac{n_I}{3\pi {\chi }^T_{1^+}(0)}} \frac{(1+z) (1-z)^{ \frac{3}{2}}}{\left[ (1+r)(1-z) + 2\sqrt{r}(1+z)\right] ^4}, \end{aligned}$$

(68)

$$\begin{aligned} \phi _{{\mathcal {F}}_1}&= \frac{4r}{M^3_B}\sqrt{\frac{n_I}{6\pi {\chi }^T_{1^+}(0)}} \frac{(1+z) (1-z)^{ \frac{5}{2}}}{\left[ (1+r)(1-z) + 2\sqrt{r}(1+z)\right] ^5}, \end{aligned}$$

(69)

$$\begin{aligned} \phi _{{\mathcal {F}}_2}&= 8\sqrt{2}r^2 \sqrt{\frac{n_I}{ \pi {\tilde{\chi }}^L_{1^+}(0)}} \frac{(1+z)^2(1-z)^{-\frac{1}{2}}}{\left[ (1+r)(1-z) + 2\sqrt{r}(1+z)\right] ^4}, \end{aligned}$$

(70)

where the undefined symbols under the square root are given in Table 10, along with the numerical values we use for $M_B$ and $M_D^*$. In testing the effect of this pole and various other choices for the pole positions, we find that although the numerical values of the z-expansion coefficients depend on the details, the final curves for the form factors are largely independent of such choices.

With these outer functions, the coefficients of the expansion satisfy the following unitarity constraints [5,6,7],

$$\begin{aligned} \sum _{j=0}^\infty a_j^2 \lesssim 1,\quad \sum _{j=0}^\infty \left( b_j^2 + c_j^2\right) \lesssim 1,\quad \sum _{j=0}^\infty d_j^2 \lesssim 1, \end{aligned}$$

(71)

where the $\lesssim $ symbols reflect the fact that the values for the $\chi $ factors in Table 10 are not exact, so the bounds are not precisely known. These constraints provide information from unitarity and analyticity, which can be used together with the output of the chiral-continuum extrapolation.

In this analysis, we do not impose the unitarity constraints, Eq. (71), on the BGL coefficients, but we check that the final results comply with the constraints within errors. We note that uncertainties in the values of $\chi _{J^P}^{T,L}$ provided in Table 10 complicate strict unitarity constraints on the z expansion coefficients. Still, we find that an implementation of the unitarity constraints using hard cutoffs (following, for instance, Ref. [110]) leaves the fit results essentially unchanged.

Table 10 Inputs for the outer functions, taken from Ref. [15] and references therein. The $\chi $ parameters are calculated in perturbative QCD through $O(\alpha ^2_s)$, and depend on charm and bottom quark mass inputs, see Ref. [111]

Full size table

There are two kinematic relations between the form factors, one at zero recoil and another at maximum recoil,

$$\begin{aligned} {\mathcal {F}}_1(1)&= M_B(1-r)f(1), \end{aligned}$$

(72)

$$\begin{aligned} {\mathcal {F}}_2(w_\text {Max})&= \frac{1+r}{M_B^2(1+w_\text {Max})(1-r)r}{\mathcal {F}}_1(w_\text {Max}). \end{aligned}$$

(73)

These constraints follow trivially from the HQET basis of form factors (the $h_X$), but the BGL parametrization does not automatically impose them.^{Footnote 4} Equation (72) is straightforward to implement for the BGL parametrization with $N=1$ in Eq. (55), since it amounts to a relationship between the $b_0$ and the $c_0$ coefficients of the expansion,

$$\begin{aligned} \frac{1-r}{\sqrt{2}(1+\sqrt{r})^2} b_0 = c_0. \end{aligned}$$

(74)

On the other hand, Eq. (73) can be imposed by adding an extra data point that enforces the constraint. Alternatively, we can remove $d_0$ and write it as a function of the remaining $d_j$ and $c_j$. The two approaches give compatible results. In our final value, we choose not to impose the second constraint. The results of the chiral-continuum extrapolation trivially build in both constraints, so we would expect any well-behaved expansion to keep this property, even when extrapolating to the whole recoil range. Indeed, the zero-recoil constraint is satisfied to very high accuracy, even if we do not impose it. This happens because the z expansion is quite constrained by the lattice-QCD values. For the maximum-recoil constraint, we just check for compatibility within errors.

5.1.1 Synthetic data

The output of the chiral-continuum extrapolation is not a set of points, but a set of functions that express the form factors at any value of the recoil parameter. In order to make the results amenable to a BGL fit with experimental data, we use synthetic data, together with their covariance, based on selected central values of the chiral-continuum fit, evaluated at zero lattice spacing and physical quark masses. The selected values with correlations are included in the ancillary files, as explained in Appendix D. We choose data points at three w values, $\{1.03, 1.10, 1.17\}$, as representative of the span of our lattice-QCD data, but we have checked that varying these values does not change significantly the curves generated for the form factors from the z expansion, as long as there are no w values too close to $w=1$. This robust behavior is not surprising, since the full covariance matrix is available, and hence the same amount of information is provided. The covariance matrix of the lattice data is well defined for all recoil values, but at $w=1$ the kinematic constraint given in Eq. (72) is exactly satisfied. Therefore, the form factors are no longer independent as $w\rightarrow 1$, and the covariance matrix becomes singular. The singularity can be avoided by choosing any value slightly larger than $w=1$. Also, we cannot push w much higher than 1.17 without going outside the region where lattice-QCD data are available, because the uncertainty grows rapidly. We have selected three points per form factor because in the continuum limit there are only twelve free independent functions in our chiral-continuum extrapolation, three per form factor. Adding more points does not increase the accuracy of the z-expansion fits, because the new points are not independent.

We first carry out BGL fits to our lattice-QCD form factors. The constant and linear coefficients are well determined by the data, and no prior constraints are used for them. The quadratic and cubic coefficents are constrained with priors 0(1) to stabilize the fit. Keeping terms up to quadratic (linear) order in z and imposing the kinematic relation given in Eq. (72), leaves just one (three) degree(s) of freedom. Since unitarity constrains the size of the coefficients, we can include cubic terms with the unitary-inspired priors to check stability of the results against truncation effects. Table 11 gathers the coefficients for these three versions of the z expansion, with the $\chi ^2/\text {dof}$ and unitarity sums. All fits satisfy the unitarity constraints within errors. The unitarity sums are computed by taking the median of the distributions obtained from the Gaussian posteriors, along with the confidence levels, $\pm \, 34.1\%$, for the uncertainties. We note that the distributions of the squares are not Gaussian when the errors on the posteriors are large. This is the case for coefficients that are not well determined by the underlying lattice data. The kinematic constraint at $z=0$, Eq. (72), is satisfied to very high accuracy, even when it is not imposed. On the other hand, the constraint at $z_\text {Max}$, Eq. (73), is satisfied only within approximately one standard deviation, unless, of course, it is imposed.

Table 11 Results of linear, quadratic, and unitarity-constrained cubic z expansions using only lattice-QCD data. The coefficient $c_0$ is fixed by the constraint given in Eq. (72), and it is shown for convenience

Full size table

The cubic coefficients ($a_3$, $b_3$, $c_3$, $d_3$) shown in Table 11 are not well determined by our lattice data, resulting in unitarity sums, with central values $>1$, while still consistent with unitarity within error. However, the cubic fit provides useful information on truncation effects. We see that the lower-order coefficients in the cubic fit are in very good agreement with the coefficients in the quadratic fit. Further, we find, that the cubic coefficients have little effect on the decay rate and the form factors, as expected, since $|z|\ll 1$. In contrast, and as Table 11 shows, the linear expansion leads to coefficients with nearly the same central values as in the quadratic and cubic fits, but with slightly smaller errors, suggesting an underestimation of the truncation error. We conclude that the errors coming from the truncation of the series in Eqs. (63)–(66) are already included in the uncertainties on the coefficients of the quadratic fit, which we choose for the main result of the z expansion. The full correlation matrix of the quadratic fit, which can be used to reconstruct the output of the z expansion, is included in the ancillary files in binary format, as explained in Appendix D. Form factors from this information can be used in phenomenology with no assumption about the presence of new physics. Below we discuss z fits incorporating shape information from experiment, which are more precise but possibly contaminated by new physics in the light semileptonic channel.

Experimentalists [18, 19] usually adjust the order of the z expansion to allow unconstrained fits to the form factors without violating unitarity. Following this criterion we find that we can remove the $a_2$ and the $d_2$ coefficients, and obtain the same fit results as in our quadratic fit without including any 0(1) priors for the higher order coefficients. This ensures that our utilization of priors for some coefficients indeed does not influence the fit results, and that their only function is to stabilize the fit.

5.1.2 Functional method

A functional method can also be used to fit the result of the chiral-continuum extrapolation to the BGL parametrization [39]. The method exploits the fact that the chiral-continuum fit functions are linear in the fit parameters. The covariance in the fit parameters is then easily converted to the covariance of the values of the resulting fitted form factors (at zero lattice spacing and physical quark masses) at any pair of recoil parameters $(w,w')$. Through the BGL parametrization, this covariance is then converted to a covariance in the form factors values at any z pair, $(z,z')$. The method has the esthetic property that information from the best fit continuum form factors is spread over the entire physical region in z, rather than at a few arbitrarily chosen discrete points.

We have compared form factors from the functional approach with those from the synthetic data. They show no discernible difference in the form factors, implying that the systematic errors associated with the choice of synthetic data from the chiral-continuum extrapolation are very small. Since the functional fits do not provide any new insight, and they make it difficult to combine data from several sources, we focus on the synthetic-data results in the rest of the paper.

5.2 Determination of $|V_{cb}|$

The lattice-QCD form factors can be used in conjunction with experimental data to perform a joint fit to the BGL parametrization, with an additional fit parameter for the relative normalization, which is nothing but $|V_{cb}|$. In these fits, the low-recoil behavior is determined by lattice QCD, and the large-recoil behavior by experiment. As experimental input, we use the 2018 raw dataset from Belle [18] and the synthetic data generated from the 2019 BaBar analysis [19]. These data are combined with the lattice-QCD synthetic data. We do not use Belle’s 2017 tagged dataset [13], because it is still unpublished.

Experiments extract the fully differential decay rate, not only with respect to the recoil parameter, but also to all angular variables in the decay chain $B\rightarrow D^*\ell \nu $, $D^*\rightarrow D\pi $ [14, 18, 112],

$$\begin{aligned}&\frac{d\varGamma }{dw\,d\cos {\theta _v}\,d\cos {\theta _\ell }\,d\chi }\nonumber \\&\quad = \left| V_{cb}\right| ^2 \left| \eta _\text {EW}\right| ^2 \frac{3G_F^2M_B^5}{1024\pi ^4} r^3 \sqrt{w^2-1}(1-2wr+r^2) \nonumber \\&\qquad \times \left[ \left( 1-\cos {\theta _\ell }\right) ^2\sin ^2{\theta _v}H^2_+(w) + \left( 1+\cos {\theta _\ell }\right) ^2 \right. \nonumber \\&\qquad \times \left. \sin ^2{\theta _v}H^2_-(w) + 4\sin ^2{\theta _\ell }\cos ^2{\theta _v}H_0^2(w)\right. \nonumber \\&\qquad - 2\sin ^2{\theta _\ell }\sin ^2{\theta _v}\cos {2\chi }H_+(w)H_-(w) - 4\sin {\theta _\ell }\nonumber \\&\qquad \times \left( 1-\cos {\theta _\ell }\right) \sin {\theta _v}\cos {\theta _v}\cos {\chi }H_+(w)H_0(w) \nonumber \\&\qquad \left. + 4\sin {\theta _\ell }\left( 1+\cos {\theta _\ell }\right) \sin {\theta _v}\cos {\theta _v}\cos {\chi }\right. \nonumber \\&\qquad \times \left. H_-(w)H_0(w) \right] {\mathcal {B}}(D^*\rightarrow D\pi ), \end{aligned}$$

(75)

where ${\mathcal {B}}(D^*\rightarrow D\pi )$ is the branching fraction of the daughter $D^*$ decay; further, $\theta _v$, $\theta _\ell $, and $\chi $ are the polar angle of the D in the $D^*$ rest frame, the polar angle of the charged lepton in the rest frame of the virtual W meson, and the angle between the $\ell \nu $ and $D\pi $ planes, respectively. As with Eq. (7), for neutral $B^0$ decays the right-hand side of Eq. (75) should have an additional factor $(1+\alpha \pi )$ for the Coulomb attraction in the final state (see for example, Refs. [113, 114]). Other electromagnetic corrections are expected to be smaller, and we will neglect them, keeping only $|\eta _\text {EW}|^2$ and the Coulomb factor.^{Footnote 5} Following our previous estimates of EM effects [22, 53], as well as the HFLAV procedure [1, 115], we use $\eta _{\text {EW}} = 1.0066(50)$ in our calculation.

Belle marginalizes on one variable at a time, integrating (binning) the rest. BaBar’s method consists of a full, four-dimensional analysis without integrating over any variable. The collaboration claims such an analysis is needed to achieve correct results [19]. Nonetheless, both the Belle and BaBar Collaborations give compatible final values of $|V_{cb}|$ in their respective publications [18, 19].

For Belle, we integrate Eq. (75) in the required bins, and the BGL expressions are introduced in the integrated results. Also, we multiply the right-hand side of Eq. (75) by the Coulomb factor $(1+\alpha \pi )$, because these data are for neutral B mesons only. We perform a combined fit to both the electron and muon modes, instead of averaging them.^{Footnote 6} For BaBar, we fit the lattice-QCD synthetic data with those in Ref. [19]. BaBar publishes results for a BGL fit to their data that includes both neutral and charged B-meson decays. According to Ref. [117], $35.1\%$ of the decays in this data set correspond to $B^0$ decays, and the remaining $64.9\%$ correspond to $B^\pm $ decays. These fractions imply that a Coulomb factor of $(1+0.351\alpha \pi )$ should be applied to the $B^0$ + $B^\pm $ BaBar dataset.

BaBar uses the previous Fermilab-MILC result of $h_{A_1}$ at zero recoil to extract $|V_{cb}|$ [19, 22]. In order to minimize the influence of the lattice-QCD results for $h_{A_1}(1)$ from Ref. [22] in the joint fit, we create synthetic data from Ref. [19] for $|\eta _{EW}|^2|V_{cb}|^2|{\mathcal {F}}(w)|^2$ at five recoil values away from $w=1$, as shown in Fig. 9. Since the BaBar Collaboration employs a linear fit for all form factors, we exhaust the number of degrees of freedom with just five points. Each one of these synthetic data points has a larger impact than each single data point from Belle’s untagged dataset, because the Babar synthetic data inherits its precision from the precision of the underlying, full dataset. In the end, the green band for Belle in Fig. 9 is noticeably narrower than BaBar’s, so it is expected that the Belle untagged dataset has a larger impact in the final results for $|V_{cb}|$ and $R(D^*)$.

The kinematic constraint in Eq. (72) is included in these fits, and although there are no direct experimental measurements that determine ${\mathcal {F}}_2$, experiments also have an impact on the $d_j$ coefficients through the correlations between this and other form factors. Our preferred fits, coming from quadratic z expansions, are shown in Fig. 9 and Table 12. As in the lattice-QCD-only fits, the fits in Table 12 include 0(1) priors for the quadratic and higher (if applicable) coefficients of each form factor, while leaving the rest of the coefficients unconstrained. Fig. 9 shows that the mean of the lattice estimate falls below the experimental curves, but the errors are large enough to make the difference remain at $\approx 2\sigma $. The correlations between the different lattice, synthetic data points determine very precisely the slope of the decay amplitude, forcing it to be noticeably larger than what we obtain from our fits to experimental data. The full correlation matrix is provided in the ancillary files, as described in Appendix D. Form factors from this information can be used in phenomenology, under the assumption that only the $\tau $ couples to new physics.

Our final result for $|V_{cb}|$ is obtained from the quadratic BGL fit to the lattice-QCD form factors and both experimental datasets (see the column labeled “Lattice+both” in Table 12), which yields

$$\begin{aligned} \left| V_{cb}\right| = (38.40 \pm 0.78)\times 10^{-3}, \end{aligned}$$

(76)

and a $\chi ^2/\text {dof} =126/84$. This relatively large $\chi ^2/\text {dof}$ indicates tensions among the datasets: a combined fit of Belle and BaBar data, using lattice-QCD input only for normalization results in a large $\chi ^2/\text {dof}$ of 104/76. It is therefore to be expected that the combined fit would result in a similarly large $\chi ^2/\text {dof}$. Further, we note that our fit to the lattice-QCD form factors only has $\chi ^2/\text {dof} < 1$, as shown in Table 12, which also lists the results of joint fits of lattice-QCD form factors with each experimental dataset separately, as well as with the combined Belle and BaBar data. We find that all joint lattice-QCD with experimental data fits have $\chi ^2/\text {dof} > 1$, including the one leading to Eq. (76), but the central values of $|V_{cb}|$ do not differ more than approximately one standard deviation among these fits, and the sizes of the errors are similar. We also see a general agreement in the coefficients of the expansion, particularly in the important low-order ones.

Table 12 Quadratic z expansion results. The second column shows results from a fit only to synthetic lattice-QCD data (the same as the “quadratic” column in Table 11), the third, from a joint fit to lattice QCD plus BaBar’s synthetic data, the fourth, from lattice QCD plus Belle’s untagged dataset, the fifth, lattice QCD plus both experiments, and the last, a combined fit of all experimental data using the value of $h_{A_1}(1)$ extracted from our chiral-continuum extrapolation as normalization. The coefficient $c_0$ is fixed by the constraint given in Eq. (72), and it is shown for convenience

Full size table

Because most previous inclusive and exclusive determinations of $\left| V_{cb}\right| $ omit the Coulomb factor, we also perform the BGL fits without it; the results are collected in Table 13.

Table 13 $\left| V_{cb}\right| $ results for our different BGL fits without including the Coulomb factor

Full size table

Compared to the results for $\left| V_{cb}\right| $ in Table 12, the central values are shifted by the respective Coulomb factors. They are consistent with previous exclusive determinations, for example $|V_{cb}|_\text {excl}=(39.9\pm 0.9)\times 10^{-3}$ from the PDG [2]. The long-standing tension with inclusive determinations thus remains: $|V_{cb}|_\text {incl}=(42.2\pm 0.8)\times 10^{-3}$ [2].

Since the Belle data are binned in different variables, there is a normalization constraint between the different bins, assuming that they contain the same underlying data. Then only 37 of the 40 bins are truly independent for each mode [116], because the sum of all bins for a particular variable should give the same total number of events. Such constraints should be reflected as zero eigenmodes, or – with rounding errors – very small eigenvalues in the $40\times 40$ statistical correlation matrices. The correlation matrices provided in Ref. [18] are constructed using Monte Carlo simulations, and do not resolve these constraints due to the underlying approximations. We therefore investigate the effect of removing the last bin on each one of the angular variables data and reconstructing its value from the total normalization. We find that this procedure correctly introduces the anticipated constraints between the bins, while the values of the reconstructed last bins are compatible with those given Ref. [18]. Hence, the expected zero eigenvalues in the statistical correlation matrices are recovered. With this procedure our combined Belle + lattice-QCD BGL fit does not yield any significant changes in the final values and uncertainties for $|V_{cb}|$ and $R(D^*)$, but we observe a substantial decrease in $\chi ^2/\text {dof}$ from 111/79 to 96/73. In the case of our joint fit, which includes Belle, BaBar and lattice-QCD data, the $\chi ^2/\text {dof}$ decreases from 126/84 to 109/78. Nevertheless, the results for $|V_{cb}|$ and $R(D^*)$ quoted in this work use the Belle data and correlation matrices as given in Ref. [18].

The BGL fit to the BaBar data [19] includes fewer coefficients than our BGL fit to the lattice-QCD form factors. We test for the presence of truncation errors by performing BGL fits to the BaBar data including higher coefficients with priors of 0.0(5). This increases the errors in the BaBar data points, most likely because the extra coefficients are completely uncorrelated with the rest of the BaBar data. Because the joint fit to all data is currently dominated by the Belle and lattice-QCD data, the addition of extra coefficients in the BaBar expansion does not change our final results for $|V_{cb}|$ and $R(D^*)$ in a meaningful way. Hence, for the our final results quoted in this work, the synthetic data points from Ref. [19] are generated without adding extra coefficients.

In the Belle and BaBar analyses the number of coefficients in the BGL z expansion is limited to exclude those that cannot be properly determined by their data, and thus avoiding apparent unitarity violations. This procedure, however, does not account for possible truncation errors. Repeating the fits in Table 12 without the $c_3$ and $d_2$ coefficients yields similar results with reduced errors and much smaller sums for the unitarity constraints.

5.3 Determination of $R(D^*)$

From the fit results in Table 12 we can calculate $R(D^*)$ through direct integration of the differential decay rate over the whole kinematic range. In Fig. 10, we show the differential decay rate as a function of the recoil parameter extracted using lattice-only data (red and brown curves), compared with that of our joint fit. The curves below (maroon and blue) show the differential decay rate for the $\tau $ case. Our final result for $R(D^*)$ from our purely lattice-QCD calculation is

$$\begin{aligned} R(D^*)_{\text {Lat}} = 0.265 \pm 0.013 . \end{aligned}$$

(77)

If we assume that new physics effects are visible only at large lepton masses (i.e., the $\tau $), we can use our joint fit of the lattice and light-lepton experimental data to obtain a more precise SM value of $R(D^*)$. We note that in our joint fit, the curve corresponding to light leptons is determined mainly from experiment, and the one corresponding to the $\tau $ comes mainly from the lattice data. In that case, we obtain

$$\begin{aligned} R(D^*)_{\text {Lat+Exp}} = 0.2484(13), \end{aligned}$$

(78)

where the Coulomb factor is included. Its removal does not change significantly neither the central value nor the error. We emphasize, however, that Eq. (77) is the SM prediction, relying only on lattice QCD, while Eq. (78) is also based on the shape information coming from experimental data. In any case, the correlated difference between the two results is $1.3\sigma $. Our values also agree with previous theoretical determinations [20, 21, 118,119,120]. We note that more recent experimental measurements have found $R(D^*)$ to be consistently smaller than before, hence reducing the tension between theory and experiment [1]. The current status of the R(D)-$R(D^*)$ determinations is summarized in Fig. 11.

5.4 Tests

5.4.1 Imposing the constraint at maximum recoil

As we explained above, our preferred analysis does not impose the kinematic constraint in Eq. (73), is trivially satisfied in the HQET basis of form factors (the $h_X$) used in our chiral-continuum extrapolation. However, the BGL expansion does not naturally incorporate it. Maximum recoil is far from the region where lattice data are available, and there are no experimental data available for this decay with a heavy lepton $\ell = \tau $. Thus, to the extent that the BGL expansion does not match the HQET-basis form factors precisely, we expect small deviations from Eq. (73) in the BGL fit. Such deviations are tolerable because small violations of the constraint do not have any physical consequences, as long as they are within errors. Figure 10 shows that our fits, nonetheless, satisfy the maximum-recoil constraint to within approximately $1\sigma $.

Imposing the constraint in the fit model, we find new values for $|V_{cb}|=38.36(78)\times 10^{-3}$ and $R(D^*)=0.274(10)$, which are compatible with the values obtained in our preferred analysis. It is not surprising that the constraint does not alter the value of $|V_{cb}|$. After all, the CKM matrix element is extracted mainly from the behavior of the form factors at small recoil and does not entail the form factor ${\mathcal {F}}_2$. The error on $R(D^*)$, on the other hand, is slightly reduced by the constraint.

A comparison of the BGL coefficients of the constrained analysis with those of our preferred one is shown in Table 14. We do not find significant changes, and the coefficients in both analyses are compatible with each other within $1\sigma $, although the differences increase with the order of the coefficients. This behavior is expected, as the low-order coefficients are well determined by the data, and the higher-order coefficients become more relevant at maximum recoil. The new information does not improve the quality of the fit in a significant way. This is particularly clear in the pure lattice fit, where the $\chi ^2$ almost doubles, but the number of degrees of freedom increases just from three in the unconstrained fit to four in the constrained one. It appears that the constraint introduces small tensions with the BGL expansion. Because of this, and since both the unconstrained and the constrained fit give compatible results with only small differences, we choose the unconstrained fit as our preferred result.

Table 14 Comparison of results of the z expansion fits with and without the kinematic constraint given in Eq. (73). The largest differences appear in the higher-order coefficients. The coefficient $c_0$ is fixed by the constraint given in Eq. (72), and it is shown for convenience. The Coulomb factors are included in the fits with experimental data input

Full size table

5.4.2 The $z$ expansion with an improved CLN parametrization

For the sake of completeness, we offer an alternative analysis, replacing the BGL parametrization with CLN. In the CLN parametrization, the form factor $h_{A_1}$ is expressed as a polynomial in z, and the other form factors appear as ratios with respect to $h_{A_1}$:

$$\begin{aligned} R_0&= \frac{1}{1+r}\left( w+1 + w\frac{rh_{A_2}-h_{A_3}}{h_{A_1}} - \frac{h_{A_2}-rh_{A_3}}{h_{A_1}}\right) , \end{aligned}$$

(79)

$$\begin{aligned} R_1&= \frac{h_V}{h_{A_1}}, \end{aligned}$$

(80)

$$\begin{aligned} R_2&= \frac{rh_{A_2}+h_{A_3}}{h_{A_1}}. \end{aligned}$$

(81)

We include a few improvements to address the weak points of CLN: first we extract the full covariance matrix relating the parameters $\rho _{A_1}$ and $c_{A_1}$ from the original article [8] using the data given for the one-sigma ellipsoids. With the full covariance matrix, we can account for the strong correlations between $\rho ^2_{A_1}$, $c_{A_1}$ and $d_{A_1}$, which allows for small variations of the fixed relations often used in CLN fits. Second, we use updated results for the expansions in $w-1$ of the ratios $R_j$. The form factors to be fit are

$$\begin{aligned} h_{A_1}(z)&= h_{A_1}(1)\Big [1 - 8\rho _{A_1}^2z + \left( 64c_{A_1} - 16\rho _{A_1}^2\right) z^2 \nonumber \\&\quad + \left( 512d_{A_1} + 256c_{A_1} - 24\rho _{A_1}^2\right) z^3\Big ], \end{aligned}$$

(82)

$$\begin{aligned} R_0(w)&= 1.25(35) - 0.183(77)(w-1) \nonumber \\ {}&\quad + 0.063(23)(w-1)^2, \end{aligned}$$

(83)

$$\begin{aligned} R_1(w)&= 1.28(36) - 0.101(51)(w-1) \nonumber \\ {}&\quad + 0.066(24)(w-1)^2, \end{aligned}$$

(84)

$$\begin{aligned} R_2(w)&= 0.740(44) + 0.128(38)(w-1)\nonumber \\ {}&\quad - 0.079(19)(w-1)^2, \end{aligned}$$

(85)

where to fit $R_0$ to experimental data requires measurements of the $\tau $ final state. The full correlation matrix of $\rho ^2_{A_1}$ and $c_{A_1}$ is given in Table 15, and we follow Ref. [8] to calculate $d_{A_1}$. In the following, we refer to the CLN parametrization as “improved” when using Eqs. (82)–(85) and “base” when using the coefficients of the original paper [8].

Fitting our lattice data with the improved CLN parametrization yields a result for $R(D^*)$ that is compatible with Eq. (77), but the $\chi ^2$/dof of the fit increases spectacularly to $\chi ^2/\text {dof}=25.7/1$. A base CLN fit is similarly bad, with $\chi ^2/\text {dof}=31.9/7$. The combined fit using lattice and Belle data again yields a compatible $|V_{cb}|$ and is also very bad, with $\chi ^2/\text {dof}=133.5/80$. Here too the base and the improved versions of CLN are equally incompatible with the lattice-QCD form factors and Belle data. The CLN fits involving only lattice-QCD data violate the kinematic constraint in Eq. (73) by $2.7\sigma $. The combined CLN fits satisfy it at around $1\sigma $. In light of these issues, the BGL parametrization provides a much superior z expansion, so we have chosen it in our main analysis.^{Footnote 7}

One can wonder why the improved CLN fit performs so poorly. One possible point of tension between our data and the improved CLN ansatz is the relationship between the slope, the curvature and the cubic coefficient in $h_{A_1}$, which is much more constrained than in the BGL parametrization. We compare our fit results to only lattice data for both parametrizations by calculating a Taylor expansion around $z=0$ up to cubic order of our BGL result for $h_{A_1}\propto f$, and we present in Fig. 12 contour plots of the CLN priors, the improved CLN fit result, and the BGL fit result. Our BGL results for $\rho _{A_1}^2$, $c_{A_1}$ and $d_{A_1}$ are compatible with our improved CLN results within one sigma, suggesting that the tensions with the improved CLN parametrization come from the other form factors.

Figures 13 and 14 show the results for $h_{A_1}$ and $R_{0,1,2}$ in the BGL and the improved CLN cases, along with the lattice data used in the fit. In general, the BGL fit shows a much better agreement with our lattice data, and the ratios $R_{0,2}$ are poorly fitted with the improved CLN ansatz. Also, the improved CLN fit violates the constraint given by Eq. (73). Imposing the constraint does not improve the fit quality, or reduce the tensions in the $R_{0,2}$ ratios.

Table 15 Correlation matrix relating $\rho ^2_{A_1}$ and $c_{A_1}$ in the CLN parametrization

Full size table

Following Ref. [108], we advocate either a revision or a deprecation of the CLN parametrization in favor of a truly model-independent parametrization, such as BGL. Even if we had found a good quality of fit with CLN, we would still prefer the BGL results on theoretical grounds. The main reason not to use the CLN parametrization in a first principles analysis like this one is to test the theoretical assumptions that makes CLN different from BGL. CLN imposes constraints through crossing symmetry from the physical region of the cross channel and constraints from heavy-quark effective theory. The BGL parametrization does not. With the latter we are able to verify ex post facto the reliability of these constraints.

Other limitations of the base CLN, such as an update of its inputs, and a more careful treatment of the errors and correlations of the CLN coefficients, have been addressed in a few recent papers [119,120,121]. The HQET parametrization discussed in those works also include corrections of order $1/m_c^2$. We refer the reader to those papers for further discussion.

5.4.3 Comparison with LCSR

We can also test the validity of the light cone sum rules (LCSR), often employed to constrain the form factors at maximum recoil. To this end, we take the latest results from Ref. [122]. They present results for the form factors in a different notation. For the reader’s convenience, we provide the conversion formulae:

$$\begin{aligned} V\,(w_{\text {Max}})&= 0.69(13), \quad V\,(w) = h_V(w)\frac{1+r}{2\sqrt{r}}, \end{aligned}$$

(86)

$$\begin{aligned} A_1(w_{\text {Max}})&= 0.60( 9), \quad A_1(w) = h_{A_1}(w) \frac{(1+w)\sqrt{r}}{1+r}, \end{aligned}$$

(87)

$$\begin{aligned} A_2(w_{\text {Max}})&= 0.51( 9), \nonumber \\ A_2(w)&= \left( rh_{A_2}(w) + h_{A_3}(w)\right) \frac{1+r}{2\sqrt{r}}. \end{aligned}$$

(88)

Our lattice-QCD only results for the aforementioned form factors are:

$$\begin{aligned} V\,(w_{\text {Max}})&= 0.65(10), \end{aligned}$$

(89)

$$\begin{aligned} A_1(w_{\text {Max}})&= 0.608(71), \end{aligned}$$

(90)

$$\begin{aligned} A_2(w_{\text {Max}})&= 0.71(11). \end{aligned}$$

(91)

The agreement is excellent for $A_1$ and V, and the $A_2$ form factor also agrees within $1.4\sigma $.

6 Discussion and outlook

Using the first unquenched lattice-QCD calculation of the form factors describing the decay $B\rightarrow D^*\ell \nu $ at nonzero recoil, together with 2018 Belle [18] and 2019 BaBar [19] measurements, we obtain the following results for the CKM matrix element $|V_{cb}|$:

$$\begin{aligned} |V_{cb}| = (38.40 \pm 0.78) \times 10^{-3}, \end{aligned}$$

(92)

which includes the Coulomb correction for neutral B-meson decays. Omitting this correction leads to $|V_{cb}|=(38.74 \pm 0.78) \times 10^{-3}$, which is the result that should be compared with inclusive decays. The quoted uncertainty is from experiment and theory together. As discussed in Sect. 5.2, this result comes from a fit exhibiting tension among the datasets, $\chi ^2/\text {dof}=126/84$. The tension is motivation for better experimental measurements and lattice-QCD calculations, as a $\sim 2\sigma $ effect it is inconclusive. In order to disentangle the contributions from experiment and theory, we run a BGL fit as described in Sect. 5.1, but assuming very small errors on the synthetic lattice-QCD data, and then we run analogous fits assuming very small experimental errors. By looking at how the final error changes, we estimate each contribution, finding $0.34\times 10^{-3}$ from experiment, $0.67\times 10^{-3}$ from lattice QCD, $0.10\times 10^{-3}$ from the truncation of the BGL expansion, and $0.18\times 10^{-13}$ due to EW+EM effects. These partial values are an approximation to guide how much improvement we can expect from future calculations and experiments. The final error in Eq. (92) is an accurate estimate of the full uncertainty.

Belle II is expected to deliver experimental data for this decay, although until there is a better understanding of its detector performance and the systematics of the experiment, it is not clear how much improvement on the error for $|V_{cb}|$ could come from these high statistics data [123]. An improved result can be obtained from this or any other forthcoming data, in combination with the synthetic data points and covariance matrix that fully describe the output of our chiral-continuum extrapolation, given in the ancillary files as described in Appendix D.

The main result of this article is the behavior of the form factors parametrizing semileptonic $B\rightarrow D^*\ell \nu $ decays at small, but nonzero recoil. It allows us to perform a much more robust analysis of $|V_{cb}|$ than when using just the value of the decay amplitude at zero recoil. We find excellent agreement between our current and previous results, both from $B\rightarrow D^*\ell \nu $ at zero recoil [22] and $B\rightarrow D\ell \nu $ at nonzero recoil [53]. Our result also agrees with other recent $B\rightarrow D^{(*)}\ell \nu $ exclusive determinations [32, 124]. It is also compatible with the recent extraction based on $B_s\rightarrow D_s^{(*)}\ell \nu $ decays measured by LHCb [125], and with form factors recently calculated by HPQCD [23, 126], but with much smaller errors.

Our result for $|V_{cb}|$ does not change the current status of the inclusive-exclusive puzzle. The inclusive determination was recently updated in [127], slightly reducing the uncertainty, and preliminary results from Belle based on a data-driven approach [128], and compatible with the current inclusive world average [1], have been presented [129]. On the other hand, it has been argued that it is very challenging for BSM physics to accommodate such a tension [3, 4]. A further reduction in errors is necessary to extract conclusions. Inclusive calculations of $|V_{cb}|$ might also benefit in the longer term from recent ideas in lattice QCD [130, 131], which might lead to calculations with very different systematics.

During the implementation of the joint experimental-lattice fits we found issues in both experimental datasets used: the BaBar Collaboration does not provide unfolded data, and their z expansion uses only five coefficients to describe three form factors, as opposed to the nine coefficients we use for the same form factors in our lattice data only fit. As a result, we are concerned that the truncation errors in the z expansion could be underestimated in the synthetic data generated from BaBar fits. However, the $|V_{cb}|$ and $R(D^*)$ fits are dominated by the Belle and lattice-QCD data, and the effect of including BaBar data is just a small reduction in the total error in Eq. (92). The Belle Collaboration provides unfolded data, but as pointed out in Ref. [116], the statistical correlation matrices given in Ref. [18] seem inconsistent. We also checked that this potential problem has no significant impact on our results for either $|V_{cb}|$ or $R(D^*)$ (with form factors from the $|V_{cb}|$ fit). Thus, an improvement in the presentation of the data from both collaborations would be very welcome.

Another benefit of the knowledge of all form factors at nonzero recoil is the possibility of calculating $R(D^*)$ from first principles. Our result

$$\begin{aligned} R(D^*)_{\text {Lat}} = 0.265 \pm 0.013 \end{aligned}$$

(93)

reaches a similar precision to that of the $B \rightarrow D\ell \nu $ analysis for R(D). Even though our calculation of $R(D^*)$ involves integrals of extrapolated quantities with large errors, the combined error is relatively small due to the large correlations between ${\mathcal {B}}(B\rightarrow D^*\tau \nu )$ and ${\mathcal {B}}(B\rightarrow D^*\ell \nu )$. In this case, the form factor that enter only in the determination of the decay to a $\tau $ was computed with lattice input only. We have also calculated a more precise value using form factors from the $|V_{cb}|$ fit, $R(D^*)_{\text {Lat+Exp}} = 0.2484(13)$. Our preferred SM value is the one given in Eq. (93), which comes exclusively from lattice QCD and avoids any experimental decay-rate input.

The result in Eq. (93) confirms previous theoretical estimates of $R(D^*)$, as well as the current tension between the SM and experiment in the R(D)-$R(D^*)$ plane. Recent experimental determinations of $R(D^*)$ tend to reduce the tension, however. In fact, before Belle published results from their untagged dataset [29], the tension was as large as $4\sigma $, but the newest analysis has reduced it to $3\sigma $, and remaining tensions come mainly because of the influence of the BaBar R(D) result [132]. An updated measurement of R(D) could cast some light on the current tensions. Also, future high-precision experimental measurements from Belle II and LHCb are bound to become critical to determine whether these quantities will agree with the SM in the end.

Together with more precise experimental measurements, lattice-QCD form factors with a smaller uncertainty will also be crucial for shedding light to this theory-experiment tension, as well as to the exclusive–inclusive tension in the determination of $|V_{cb}|$. We expect to reduce the uncertainties in the form factors at nonzero recoil in future work, the first of which is already in progress at the time of this publication. The main sources of uncertainty in this work come from statistics and the quark discretization errors. An improvement in this area will require a modification in both the light and the heavy-quark actions to allow for smaller systematics. Chiral-continuum-extrapolation errors can be reduced by using a better discretization for light quarks and physical pion masses. Another area where we can reduce errors is the renormalization, by using a nonperturbative calculation of the renormalization factors. Further validation of our lattice-QCD result will come with independent analyses currently in progress by other lattice-QCD collaborations [133]. Similar improvements, with an expected reduction of errors, can be applied to our calculation of $B\rightarrow D\ell \nu $ form factors at nonzero recoil [53]. A correlated analysis of both decays will allow a correlated determination of R(D) and $R(D^*)$ that could provide tighter theoretical constraints.

In this work, we also assess the impact of using an improved CLN parametrization to describe the shape of the form factors. Instead of using the fixed coefficients published in Ref. [8], we employ the full covariance matrix that relates the slope and curvature coefficients of the reference form factor using data from the original CLN paper [8], and pass this information as priors to a CLN fit. We also compute the errors in the cubic coefficient, and update the values of the ratios with respect to the reference form factor with values coming from one of the latest HQET calculations [15], assuming a 20% error on each coefficient. Our updated CLN parametrization gives a very similar central value and error bar, compared with that of the BGL parametrization, but the quality of fit decreases greatly when the lattice-QCD data are included. CLN is very restrictive with the shape of certain form factors, and because the lattice-QCD data have relatively small errors, they introduce serious constraints in the parametrization. Our findings reinforce the current consensus of the community [108] to abandon CLN in favor of the more flexible and rigorous BGL parametrization. The impact of using improved HQE parametrizations, such as the one in Refs. [119,120,121], should be nevertheless investigated.

Data Availability Statement

This manuscript has data included as electronic supplementary material. The online version of this article contains supplementary material, which is available to authorized users.

Change history

16 January 2023
An Erratum to this paper has been published: https://doi.org/10.1140/epjc/s10052-022-11153-8

Notes

In the Fermilab formulation, one can adjust $M_1=M_2$ by introducing an asymmetry parameter into the lattice action, but this is not necessary for valence quarks. Adjusting $M_4=M_2$ requires a more improved action [67].
We observe that the local operators have a smaller overlap with the excited states, and we reduce the width of their corresponding priors for $A_{1,2}$ and $B_{1,2}$ accordingly.
See, for instance, Ref. [64].
It is interesting to note that the CLN parametrization does include the constraint at zero recoil given in Eq. (72). The constraint at maximum recoil, Eq. (73), is not imposed in CLN, and indeed does not hold unless the original CLN expressions are modified.
Both the Belle and BaBar experiments use the PHOTOS package to account for low-energy EM radiation; to the best of our knowledge, the EM interactions between charged particles in the final state, described by the Coulomb factor, are not included in PHOTOS.
The systematic correlation matrices given in Ref. [18] do not include off-diagonal blocks for the correlated systematic errors between electron and muon modes. They can be reconstructed from the given data [116], but we do not attempt such a reconstruction in our analysis.
The fits discussed include the Coulomb factor. Its removal does not change the $\chi ^2/\text {dof}$ significantly.
Matching at the next order in $1/m_Q$ is possible in principle but cumbersome beyond the tree level.
These conventions also hold in Minkowski space when using the metric $g^{\mu \nu }=\textrm{diag}(-1,1,1,1)$ [103]. NB: in Minkowski space, $\mu \in \{0,1,2,3\}$; in Euclidean space, $g^{\mu \nu }=\delta ^{\mu \nu }$ and $\mu \in \{1,2,3,4\}$; $x^4=ix^0$.
We overlooked this factor in previous papers [52, 53, 136].

References

Y.S. Amhis et al., Eur. Phys. J. C 81(3), 226 (2021). https://doi.org/10.1140/epjc/s10052-020-8156-7
Article ADS Google Scholar
P.A. Zyla et al., PTEP 2020(8), 083C01 (2020). https://doi.org/10.1093/ptep/ptaa104
A. Crivellin, S. Pokorski, Phys. Rev. Lett. 114(1), 011802 (2015). https://doi.org/10.1103/PhysRevLett.114.011802
Article ADS Google Scholar
M. Jung, D.M. Straub, JHEP 01, 009 (2019). https://doi.org/10.1007/JHEP01(2019)009
Article ADS Google Scholar
C.G. Boyd, B. Grinstein, R.F. Lebed, Nucl. Phys. B 461, 493 (1996). https://doi.org/10.1016/0550-3213(95)00653-2
Article ADS Google Scholar
C.G. Boyd, B. Grinstein, R.F. Lebed, Phys. Lett. B 353, 306 (1995). https://doi.org/10.1016/0370-2693(95)00480-9
Article ADS Google Scholar
C.G. Boyd, B. Grinstein, R.F. Lebed, Phys. Rev. D 56, 6895 (1997). https://doi.org/10.1103/PhysRevD.56.6895
Article ADS Google Scholar
I. Caprini, L. Lellouch, M. Neubert, Nucl. Phys. B 530, 153 (1998). https://doi.org/10.1016/S0550-3213(98)00350-2
Article ADS Google Scholar
B. Aubert et al., Phys. Rev. Lett. 100, 231803 (2008). https://doi.org/10.1103/PhysRevLett.100.231803
Article ADS Google Scholar
N.E. Adam et al., Phys. Rev. D 67, 032001 (2003). https://doi.org/10.1103/PhysRevD.67.032001
Article ADS Google Scholar
B. Aubert et al., Phys. Rev. D 79, 012002 (2009). https://doi.org/10.1103/PhysRevD.79.012002
Article ADS Google Scholar
Y. Amhis, et al. (2012). arXiv:1207.1158
A. Abdesselam, et al. (2017). arXiv:1702.01521
D. Bigi, P. Gambino, S. Schacht, Phys. Lett. B 769, 441 (2017). https://doi.org/10.1016/j.physletb.2017.04.022
Article ADS Google Scholar
D. Bigi, P. Gambino, S. Schacht, JHEP 11, 061 (2017). https://doi.org/10.1007/JHEP11(2017)061
Article ADS Google Scholar
B. Grinstein, A. Kobach, Phys. Lett. B 771, 359 (2017). https://doi.org/10.1016/j.physletb.2017.05.078
Article ADS Google Scholar
S. Jaiswal, S. Nandi, S.K. Patra, JHEP 12, 060 (2017). https://doi.org/10.1007/JHEP12(2017)060
Article ADS Google Scholar
E. Waheed et al., Phys. Rev. D 100(5), 052007 (2019). https://doi.org/10.1103/PhysRevD.100.052007
Article ADS Google Scholar
J. Lees et al., Phys. Rev. Lett. 123(9), 091801 (2019). https://doi.org/10.1103/PhysRevLett.123.091801
P. Gambino, M. Jung, S. Schacht, Phys. Lett. B 795, 386 (2019). https://doi.org/10.1016/j.physletb.2019.06.039
Article ADS Google Scholar
S. Jaiswal, S. Nandi, S.K. Patra, JHEP 06, 165 (2020). https://doi.org/10.1007/JHEP06(2020)165
Article ADS Google Scholar
J.A. Bailey et al., Phys. Rev. D 89(11), 114504 (2014). https://doi.org/10.1103/PhysRevD.89.114504
Article ADS Google Scholar
E. McLean, C.T.H. Davies, A.T. Lytle, J. Koponen, Phys. Rev. D 99(11), 114512 (2019). https://doi.org/10.1103/PhysRevD.99.114512
Article ADS Google Scholar
S. Bifani, S. Descotes-Genon, A. Romero Vidal, M.H. Schune, J. Phys. G 46(2), 023001 (2019). https://doi.org/10.1088/1361-6471/aaf5de
S. Hirose et al., Phys. Rev. Lett. 118(21), 211801 (2017). https://doi.org/10.1103/PhysRevLett.118.211801
Article ADS Google Scholar
S. Hirose et al., Phys. Rev. D 97(1), 012004 (2018). https://doi.org/10.1103/PhysRevD.97.012004
Article ADS Google Scholar
R. Aaij et al., Phys. Rev. Lett. 120(17), 171802 (2018). https://doi.org/10.1103/PhysRevLett.120.171802
Article ADS Google Scholar
R. Aaij et al., Phys. Rev. D 97(7), 072013 (2018). https://doi.org/10.1103/PhysRevD.97.072013
Article ADS Google Scholar
G. Caria et al., Phys. Rev. Lett. 124(16), 161803 (2020). https://doi.org/10.1103/PhysRevLett.124.161803
Article ADS Google Scholar
S. Hashimoto, A.S. Kronfeld, P.B. Mackenzie, S.M. Ryan, J.N. Simone, Phys. Rev. D 66, 014503 (2002). https://doi.org/10.1103/PhysRevD.66.014503
Article ADS Google Scholar
C. Bernard et al., Phys. Rev. D 79, 014506 (2009). https://doi.org/10.1103/PhysRevD.79.014506
Article ADS Google Scholar
J. Harrison, C. Davies, M. Wingate, Phys. Rev. D 97(5), 054502 (2018). https://doi.org/10.1103/PhysRevD.97.054502
G.M. de Divitiis, R. Petronzio, N. Tantalo, Nucl. Phys. B 807, 373 (2009). https://doi.org/10.1016/j.nuclphysb.2008.09.013
Article ADS Google Scholar
S.W. Qiu, C. DeTar, A.X. El-Khadra, A.S. Kronfeld, J. Laiho, R.S. Van de Water, PoS LATTICE2013, 385 (2014). https://doi.org/10.22323/1.187.0385
A. VaqueroAvilés-Casco, C. DeTar, D. Du, A. El-Khadra, A.S. Kronfeld, J. Laiho, R.S. Van deWater, EPJ Web Conf 175, 13003 (2018). https://doi.org/10.1051/epjconf/201817513003
A.V. Avilés-Casco, C. DeTar, A.X. El-Khadra, A.S. Kronfeld, J. Laiho, R.S. Van de Water, PoS LATTICE2018, 282 (2019). https://doi.org/10.22323/1.334.0282
A. Vaquero, C. DeTar, A.X. El-Khadra, A.S. Kronfeld, J. Laiho, R.S. Van de Water, in 17th Conference on Flavor Physics and CP Violation (2019). arXiv:1906.01019
A.V. Avilés-Casco, C. DeTar, A.X. El-Khadra, A.S. Kronfeld, J. Laiho, R.S. Van de Water, PoS LATTICE2019, 049 (2019). https://doi.org/10.22323/1.363.0049
J.A. Bailey et al., Phys. Rev. D 92(1), 014024 (2015). https://doi.org/10.1103/PhysRevD.92.014024
Article ADS Google Scholar
J.A. Bailey et al., Phys. Rev. Lett. 115(15), 152002 (2015). https://doi.org/10.1103/PhysRevLett.115.152002
Article ADS Google Scholar
J.A. Bailey et al., Phys. Rev. D 93(2), 025026 (2016). https://doi.org/10.1103/PhysRevD.93.025026
Article ADS Google Scholar
A. Bazavov et al., Phys. Rev. D 100(3), 034501 (2019). https://doi.org/10.1103/PhysRevD.100.034501
Article ADS MathSciNet Google Scholar
A.F. Falk, M. Neubert, Phys. Rev. D 47, 2965 (1993). https://doi.org/10.1103/PhysRevD.47.2965
Article ADS Google Scholar
J.G. Körner, G.A. Schuler, Z. Phys. C 46, 93 (1990). https://doi.org/10.1007/BF02440838
A. Sirlin, Nucl. Phys. B 196, 83 (1982). https://doi.org/10.1016/0550-3213(82)90303-0
Article ADS Google Scholar
E.S. Ginsberg, Phys. Rev. 171, 1675 (1968). https://doi.org/10.1103/PhysRev.171.1675. https://doi.org/10.1103/PhysRev.174.2169.3 [Erratum: Phys. Rev. 174, 2169 (1968)]
E.S. Ginsberg, Phys. Rev. 162, 1570 (1967). https://doi.org/10.1103/physrev.187.2280.2. https://doi.org/10.1103/PhysRev.162.1570 [Erratum: Phys. Rev. 187, 2280 (1969)]
D. Atwood, W.J. Marciano, Phys. Rev. D 41, 1736 (1990). https://doi.org/10.1103/PhysRevD.41.1736
Article ADS Google Scholar
A.X. El-Khadra, A.S. Kronfeld, P.B. Mackenzie, Phys. Rev. D 55, 3933 (1997). https://doi.org/10.1103/PhysRevD.55.3933
Article ADS Google Scholar
J. Harada, S. Hashimoto, A.S. Kronfeld, T. Onogi, Phys. Rev. D 65, 094514 (2002). https://doi.org/10.1103/PhysRevD.65.094514
Article ADS Google Scholar
A.X. El-Khadra, A.S. Kronfeld, P.B. Mackenzie, S.M. Ryan, J.N. Simone, Phys. Rev. D 64, 014502 (2001). https://doi.org/10.1103/PhysRevD.64.014502
Article ADS Google Scholar
J.A. Bailey et al., Phys. Rev. D 85, 114502 (2012). https://doi.org/10.1103/PhysRevD.85.114502 [Erratum: Phys. Rev. D 86, 039904 (2012)]
J.A. Bailey et al., Phys. Rev. D 92(3), 034506 (2015). https://doi.org/10.1103/PhysRevD.92.034506
Article ADS MathSciNet Google Scholar
A. Bazavov et al., Rev. Mod. Phys. 82, 1349 (2010). https://doi.org/10.1103/RevModPhys.82.1349
Article ADS Google Scholar
C. Aubin, C. Bernard, C. DeTar, J. Osborn, S. Gottlieb, E.B. Gregory, D. Toussaint, U.M. Heller, J.E. Hetrick, R. Sugar, Phys. Rev. D 70, 094505 (2004). https://doi.org/10.1103/PhysRevD.70.094505
Article ADS Google Scholar
C.W. Bernard, T. Burch, K. Orginos, D. Toussaint, T.A. DeGrand, C.E. Detar, S. Datta, S.A. Gottlieb, U.M. Heller, R. Sugar, Phys. Rev. D 64, 054506 (2001). https://doi.org/10.1103/PhysRevD.64.054506
Article ADS Google Scholar
M. Wingate, J. Shigemitsu, C.T.H. Davies, G.P. Lepage, H.D. Trottier, Phys. Rev. D 67, 054505 (2003). https://doi.org/10.1103/PhysRevD.67.054505
Article ADS Google Scholar
A. Bazavov et al., Phys. Rev. D 93(11), 113016 (2016). https://doi.org/10.1103/PhysRevD.93.113016
Article ADS MathSciNet Google Scholar
A. Bazavov et al., Phys. Rev. D 97(3), 034513 (2018). https://doi.org/10.1103/PhysRevD.97.034513
Article ADS MathSciNet Google Scholar
A. Bazavov et al., Phys. Rev. D 81, 114501 (2010). https://doi.org/10.1103/PhysRevD.81.114501
Article ADS Google Scholar
R. Li et al., PoS LATTICE2018, 269 (2019). https://doi.org/10.22323/1.334.0269
A. Bazavov et al., Phys. Rev. D 98(7), 074512 (2018). https://doi.org/10.1103/PhysRevD.98.074512
Article ADS MathSciNet Google Scholar
J.L. Richardson, Phys. Lett. B 82, 272 (1979). https://doi.org/10.1016/0370-2693(79)90753-6
Article ADS Google Scholar
A. Bazavov et al., Phys. Rev. D 85, 114506 (2012). https://doi.org/10.1103/PhysRevD.85.114506
Article ADS Google Scholar
G.P. Lepage, B. Clark, C.T.H. Davies, K. Hornbostel, P.B. Mackenzie, C. Morningstar, H. Trottier, Nucl. Phys. B Proc. Suppl. 106, 12 (2002). https://doi.org/10.1016/S0920-5632(01)01638-3
Article ADS Google Scholar
D. Toussaint, W. Freeman, (2008). arXiv:0808.2211
M.B. Oktay, A.S. Kronfeld, Phys. Rev. D 78, 014504 (2008). https://doi.org/10.1103/PhysRevD.78.014504
Article ADS Google Scholar
B.P.G. Mertens, A.S. Kronfeld, A.X. El-Khadra, Phys. Rev. D 58, 034505 (1998). https://doi.org/10.1103/PhysRevD.58.034505
Article ADS Google Scholar
A.S. Kronfeld, Nucl. Phys. B Proc. Suppl. 53, 401 (1997). https://doi.org/10.1016/S0920-5632(96)00671-8
Article ADS Google Scholar
C. Bernard et al., Phys. Rev. D 83, 034503 (2011). https://doi.org/10.1103/PhysRevD.83.034503
Article ADS Google Scholar
S.J. Brodsky, G.P. Lepage, P.B. Mackenzie, Phys. Rev. D 28, 228 (1983). https://doi.org/10.1103/PhysRevD.28.228
Article ADS Google Scholar
G.P. Lepage, P.B. Mackenzie, Phys. Rev. D 48, 2250 (1993). https://doi.org/10.1103/PhysRevD.48.2250
Article ADS Google Scholar
J.A. Bailey et al., Phys. Rev. D 79, 054507 (2009). https://doi.org/10.1103/PhysRevD.79.054507
Article ADS Google Scholar
J. Harada, S. Hashimoto, K.I. Ishikawa, A.S. Kronfeld, T. Onogi, N. Yamada, Phys. Rev. D 65, 094513 (2002). https://doi.org/10.1103/PhysRevD.71.019903. https://doi.org/10.1103/PhysRevD.65.094513 [Erratum: Phys. Rev. D 71, 019903 (2005)]
C. Aubin, C. Bernard, Phys. Rev. D 73, 014515 (2006). https://doi.org/10.1103/PhysRevD.73.014515
Article ADS Google Scholar
S. Prelovsek, Phys. Rev. D 73, 014506 (2006). https://doi.org/10.1103/PhysRevD.73.014506
Article ADS Google Scholar
C.W. Bernard, C.E. DeTar, Z. Fu, S. Prelovsek, PoS LAT2006, 173 (2006). https://doi.org/10.22323/1.032.0173
C. Bernard, Phys. Rev. D 73, 114503 (2006). https://doi.org/10.1103/PhysRevD.73.114503
Article ADS Google Scholar
C. Bernard, C.E. DeTar, Z. Fu, S. Prelovsek, Phys. Rev. D 76, 094504 (2007). https://doi.org/10.1103/PhysRevD.76.094504
Article ADS Google Scholar
C. Aubin, J. Laiho, R.S. Van de Water, Phys. Rev. D 77, 114501 (2008). https://doi.org/10.1103/PhysRevD.77.114501
Article ADS Google Scholar
C. Bernard, M. Golterman, Y. Shamir, Phys. Rev. D 73, 114511 (2006). https://doi.org/10.1103/PhysRevD.73.114511
Article ADS Google Scholar
Y. Shamir, Phys. Rev. D 75, 054503 (2007). https://doi.org/10.1103/PhysRevD.75.054503
Article ADS Google Scholar
Y. Shamir, Phys. Rev. D 71, 034509 (2005). https://doi.org/10.1103/PhysRevD.71.034509
Article ADS Google Scholar
S. Dürr, PoS LAT2005, 021 (2006). https://doi.org/10.22323/1.020.0021
S.R. Sharpe, PoS LAT2006, 022 (2006). https://doi.org/10.22323/1.032.0022
A.S. Kronfeld, PoS 016, LATTICE2007 (2007). https://doi.org/10.22323/1.042.0016
J. Laiho, R.S. Van de Water, Phys. Rev. D 73, 054501 (2006). https://doi.org/10.1103/PhysRevD.73.054501
Article ADS Google Scholar
A. Anastassov et al., Phys. Rev. D 65, 032003 (2002). https://doi.org/10.1103/PhysRevD.65.032003
Article ADS Google Scholar
J.P. Lees et al., Phys. Rev. D 88(5), 052003 (2013). https://doi.org/10.1103/PhysRevD.88.079902. https://doi.org/10.1103/PhysRevD.88.052003 [Erratum: Phys. Rev. D 88, no. 7, 079902 (2013)]
J.P. Lees et al., Phys. Rev. Lett. 111(11), 111801 (2013). https://doi.org/10.1103/PhysRevLett.111.111801, https://doi.org/10.1103/PhysRevLett.111.169902
W. Detmold, C.J.D. Lin, S. Meinel, Phys. Rev. Lett. 108, 172003 (2012). https://doi.org/10.1103/PhysRevLett.108.172003
Article ADS Google Scholar
K.U. Can, G. Erkol, M. Oka, A. Ozpineci, T.T. Takahashi, Phys. Lett. B 719, 103 (2013). https://doi.org/10.1016/j.physletb.2012.12.050
Article ADS Google Scholar
D. Becirevic, F. Sanfilippo, Phys. Lett. B 721, 94 (2013). https://doi.org/10.1016/j.physletb.2013.03.004
Article ADS Google Scholar
J.M. Flynn, P. Fritzsch, T. Kawanai, C. Lehner, B. Samways, C.T. Sachrajda, R.S. Van de Water, O. Witzel, Phys. Rev. D 93(1), 014510 (2016). https://doi.org/10.1103/PhysRevD.93.014510
Article ADS Google Scholar
F. Bernardoni, J. Bulava, M. Donnellan, R. Sommer, Phys. Lett. B 740, 278 (2015). https://doi.org/10.1016/j.physletb.2014.11.051
Article ADS Google Scholar
W. Detmold, C.J.D. Lin, S. Meinel, Phys. Rev. D 85, 114508 (2012). https://doi.org/10.1103/PhysRevD.85.114508
Article ADS Google Scholar
S. Aoki et al., Eur. Phys. J. C 80(2), 113 (2020). https://doi.org/10.1140/epjc/s10052-019-7354-7
Article ADS Google Scholar
M.E. Luke, Phys. Lett. B 252, 447 (1990). https://doi.org/10.1016/0370-2693(90)90568-Q
Article ADS Google Scholar
O. Ledoit, M. Wolf, J. Empir. Finance 10(5), 603 (2003). https://doi.org/10.1016/S0927-5398(03)00007-0
J. Schaefer, K. Strimmer, Statistical Applications in Genetics and Molecular Biology 4, 32 (2005)
O. Ledoit, M. Wolf, (264) (2017). https://doi.org/10.5167/uzh-139880. http://hdl.handle.net/10419/173422. Accessed 27 May 2021
J.N. Simone. Improved data covariance estimation techniques in lattice QCD (2017). https://cafpe.ugr.es/lattice2017/indico/session/102/contribution/386.html. talk at https://cafpe.ugr.es/lattice2017/. 35th International Symposium on Lattice Field Theory
A.S. Kronfeld, Phys. Rev. D 62, 014505 (2000). https://doi.org/10.1103/PhysRevD.62.014505
Article ADS Google Scholar
a Bazavov et al., PoS LATTICE2010, 074 (2010). https://doi.org/10.22323/1.105.0074
R. Sommer, Nucl. Phys. B 411, 839 (1994). https://doi.org/10.1016/0550-3213(94)90473-1
Article ADS Google Scholar
C.W. Bernard, T. Burch, K. Orginos, D. Toussaint, T.A. DeGrand, C.E. DeTar, S.A. Gottlieb, U.M. Heller, J.E. Hetrick, B. Sugar, Phys. Rev. D 62, 034503 (2000). https://doi.org/10.1103/PhysRevD.62.034503
Article ADS Google Scholar
D. Arndt, C.J.D. Lin, Phys. Rev. D 70, 014503 (2004). https://doi.org/10.1103/PhysRevD.70.014503
Article ADS Google Scholar
P. Gambino et al., Eur. Phys. J. C 80(10), 966 (2020). https://doi.org/10.1140/epjc/s10052-020-08490-x
Article ADS Google Scholar
C. Bourrely, I. Caprini, L. Lellouch, Phys. Rev. D 79, 013008 (2009). https://doi.org/10.1103/PhysRevD.82.099902 [Erratum: Phys. Rev. D 82, 099902 (2010)]
D. Ferlewicz, P. Urquijo, E. Waheed, Phys. Rev. D 103(7), 073005 (2021). https://doi.org/10.1103/PhysRevD.103.073005
Article ADS Google Scholar
D. Bigi, P. Gambino, Phys. Rev. D 94(9), 094008 (2016). https://doi.org/10.1103/PhysRevD.94.094008
Article ADS Google Scholar
W. Dungel et al., Phys. Rev. D 82, 112007 (2010). https://doi.org/10.1103/PhysRevD.82.112007
Article ADS Google Scholar
S. de Boer, T. Kitahara, I. Nisandzic, Phys. Rev. Lett. 120(26), 261804 (2018). https://doi.org/10.1103/PhysRevLett.120.261804
Article ADS Google Scholar
S. Calí, S. Klaver, M. Rotondo, B. Sciascia, Eur. Phys. J. C 79(9), 744 (2019). https://doi.org/10.1140/epjc/s10052-019-7254-x
Article ADS Google Scholar
Y. Amhis et al., Eur. Phys. J. C 77(12), 895 (2017). https://doi.org/10.1140/epjc/s10052-017-5058-4
Article ADS Google Scholar
C. Bobeth, M. Bordone, N. Gubernari, M. Jung, D. van Dyk, Eur. Phys. J. C 81(11), 984 (2021). https://doi.org/10.1140/epjc/s10052-021-09724-2
J.P. Lees et al., Phys. Rev. Lett. 116(4), 041801 (2016). https://doi.org/10.1103/PhysRevLett.116.041801
Article ADS Google Scholar
S. Fajfer, J.F. Kamenik, I. Nisandzic, Phys. Rev. D 85, 094025 (2012). https://doi.org/10.1103/PhysRevD.85.094025
Article ADS Google Scholar
F.U. Bernlochner, Z. Ligeti, M. Papucci, D.J. Robinson, Phys. Rev. D 95(11), 115008 (2017). https://doi.org/10.1103/PhysRevD.95.115008 [Erratum: Phys. Rev. D 97, 059902 (2018)]
M. Bordone, M. Jung, D. van Dyk, Eur. Phys. J. C 80(2), 74 (2020). https://doi.org/10.1140/epjc/s10052-020-7616-4
M. Bordone, N. Gubernari, D. van Dyk, M. Jung, Eur. Phys. J. C 80(4), 347 (2020). https://doi.org/10.1140/epjc/s10052-020-7850-9
Article ADS Google Scholar
N. Gubernari, A. Kokulu, D. van Dyk, JHEP 01, 150 (2019). https://doi.org/10.1007/JHEP01(2019)150
Article ADS Google Scholar
W. Altmannshofer, et al., PTEP 2019(12), 123C01 (2019). https://doi.org/10.1093/ptep/ptz106 [Erratum: PTEP 2020, 029201 (2020)]
H. Na, C.M. Bouchard, G.P. Lepage, C. Monahan, J. Shigemitsu, Phys. Rev. D 92(5), 054510 (2015). https://doi.org/10.1103/PhysRevD.93.119906. [Erratum: Phys. Rev. D 93, 119906 (2016)]
R. Aaij et al., Phys. Rev. D 101(7), 072004 (2020). https://doi.org/10.1103/PhysRevD.101.072004
E. McLean, C.T.H. Davies, J. Koponen, A.T. Lytle, Phys. Rev. D 101(7), 074513 (2020). https://doi.org/10.1103/PhysRevD.101.074513
Article ADS Google Scholar
M. Bordone, B. Capdevila, P. Gambino, Phys. Lett. B 822, 136679 (2021). https://doi.org/10.1016/j.physletb.2021.136679
M. Fael, T. Mannel, K.K. Vos, JHEP 02, 177 (2019). https://doi.org/10.1007/JHEP02(2019)177
Article ADS Google Scholar
F. Bernlochner, M. Fael, K. Olschewsky, E. Persson, R. van Tonder, K.K. Vos, M. Welsch, JHEP 10, 068 (2022). https://doi.org/10.1007/JHEP10(2022)068
S. Hashimoto, PTEP 2017(5), 053B03 (2017). https://doi.org/10.1093/ptep/ptx052
P. Gambino, S. Hashimoto, Phys. Rev. Lett. 125(3), 032001 (2020). https://doi.org/10.1103/PhysRevLett.125.032001
Article ADS Google Scholar
J. Lees et al., Phys. Rev. Lett. 109, 101802 (2012). https://doi.org/10.1103/PhysRevLett.109.101802
Article ADS Google Scholar
T. Kaneko, Y. Aoki, G. Bailas, B. Colquhoun, H. Fukaya, S. Hashimoto, J. Koponen, PoS LATTICE2019, 139 (2019). https://doi.org/10.22323/1.363.0139
C.K. Chow, M.B. Wise, Phys. Rev. D 48, 5202 (1993). https://doi.org/10.1103/PhysRevD.48.5202
Article ADS Google Scholar
C. Aubin, C. Bernard, Phys. Rev. D 68, 034014 (2003). https://doi.org/10.1103/PhysRevD.68.034014
J.A. Bailey et al., Phys. Rev. Lett. 109, 071802 (2012). https://doi.org/10.1103/PhysRevLett.109.071802
Article ADS Google Scholar
P. Lepage, C. Gohlke, D. Hackett. gplepage/gvar: gvar version 11.9.2 (2021). https://doi.org/10.5281/zenodo.4695132

Download references

Acknowledgements

We thank Claude Bernard for early contributions to this project, in particular for laying the foundations for the chiral-continuum extrapolations employed in this analysis. We thank Biplab Dey, Danny van Dyk, Paolo Gambino, Martin Jung, Laurent Lellouch, William Marciano, Christoph Schwanda, Phillip Urquijo, and Eiasha Waheed for useful discussions. We thank Biplab Dey especially for providing additional information on the BaBar $B\rightarrow D^*\ell \nu $ analysis [19], and Martin Jung for his careful reading of the manuscript. We thank Jon Bailey and Chris Bouchard for generating some of the correlator data. Computations for this work were carried out with resources provided by the USQCD Collaboration, the National Energy Research Scientific Computing Center, and the Argonne Leadership Computing Facility, which are funded by the Office of Science of the U.S. Department of Energy; and with resources provided by the National Institute for Computational Science and the Texas Advanced Computing Center, which are funded through the National Science Foundation’s Teragrid/XSEDE Program. This work was supported in part by the U.S. Department of Energy under Awards No. DE-FG02-13ER41976 (D.T.), No. DE-SC0009998 (J.L.), No. DE-SC0010120 (S.G.), and No. DE-SC0015655 (A.X.K.,Z.G.); by the U.S. National Science Foundation under Grants No. PHY17-19626 (C.D., A.V.), and No. PHY14-17805 (J.L.); by SRA (Spain) under Grant No. PID2019-106087GB-C21/10.13039/501100011033 (E.G.); by the Consejería de Economía, Innovación, Ciencia y Empleo, Junta de Andalucía (Spain) under Grants No. FQM- 101, A-FQM-467-UGR18, and P18-FR-4314 (FEDER) (E.G.); by the Fermilab Distinguished Scholars Program (A.X.K.). This document was prepared by the Fermilab Lattice and MILC Collaborations using the resources of the Fermi National Accelerator Laboratory (Fermilab), a U.S. Department of Energy, Office of Science, HEP User Facility. Fermilab is managed by Fermi Research Alliance, LLC (FRA), acting under Contract No. DE-AC02-07CH11359.

Author information

Authors and Affiliations

Department of Computational Mathematics, Science and Engineering, and Department of Physics and Astronomy, Michigan State University, East Lansing, MI, 48824, USA
A. Bazavov
Department of Physics and Astronomy, University of Utah, Salt Lake City, UT, 84112, USA
C. E. DeTar & A. Vaquero
Department of Physics, Syracuse University, Syracuse, NY, 13244, USA
D. Du & J. Laiho
Department of Physics, University of Illinois, Urbana, IL, 61801, USA
A. X. El-Khadra & Z. Gelzer
Illinois Center for Advanced Studies of the Universe, University of Illinois, Urbana, IL, 61801, USA
A. X. El-Khadra
Departamento de Física Teórica y del Cosmos, Universidad de Granada, 18071, Granada, Spain
E. Gámiz
Department of Physics, Indiana University, Bloomington, IN, 47405, USA
S. Gottlieb
American Physical Society, Ridge, NY, 11961, USA
U. M. Heller
Fermi National Accelerator Laboratory, Batavia, IL, 60510, USA
A. S. Kronfeld, P. B. Mackenzie, J. N. Simone & R. S. Van de Water
Department of Physics, University of California, Santa Barbara, CA, 93106, USA
R. Sugar
Department of Physics, University of Arizona, Tucson, AZ, 85721, USA
D. Toussaint

Authors

A. Bazavov
View author publications
You can also search for this author in PubMed Google Scholar
C. E. DeTar
View author publications
You can also search for this author in PubMed Google Scholar
D. Du
View author publications
You can also search for this author in PubMed Google Scholar
A. X. El-Khadra
View author publications
You can also search for this author in PubMed Google Scholar
E. Gámiz
View author publications
You can also search for this author in PubMed Google Scholar
Z. Gelzer
View author publications
You can also search for this author in PubMed Google Scholar
S. Gottlieb
View author publications
You can also search for this author in PubMed Google Scholar
U. M. Heller
View author publications
You can also search for this author in PubMed Google Scholar
A. S. Kronfeld
View author publications
You can also search for this author in PubMed Google Scholar
J. Laiho
View author publications
You can also search for this author in PubMed Google Scholar
P. B. Mackenzie
View author publications
You can also search for this author in PubMed Google Scholar
J. N. Simone
View author publications
You can also search for this author in PubMed Google Scholar
R. Sugar
View author publications
You can also search for this author in PubMed Google Scholar
D. Toussaint
View author publications
You can also search for this author in PubMed Google Scholar
R. S. Van de Water
View author publications
You can also search for this author in PubMed Google Scholar
A. Vaquero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A. Vaquero.

Additional information

The original online version of this article was revised. In table 14 of this article, the data in the column “Lattice QCD + Experiment Unconstrained”, rows a1 and c0 were incorrect. The correct values are -0.147(31) (a1) and 0.002092(37) (c0). The data in the column “Lattice QCD + Experiment Constrained”, rows a0 and c0 were also incorrect. The correct values are 0.0320(10) (a0) and 0.002094(37) (c0).

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Information 1.

Supplementary Information 2.

Appendices

Appendix A: The chiral logs in our chiral-continuum extrapolation

In this Appendix, we give further details on the chiral-continuum extrapolation. In particular, we discuss the staggered version of the chiral logs corresponding to the $B\rightarrow D^*\ell \nu $ case at nonzero recoil.

The chiral logs employed in Eq. (50) are derived following Refs. [87, 134]. Since in this analysis we do not use partially quenched ensembles, the expressions employed are those of full QCD with 2 + 1 flavors. Here we reproduce the expression of the logs for the different form factors for the sake of completeness,

$$\begin{aligned} \mathrm{logs^Y_{SU(3)}}&= \frac{1}{16}\sum _{\Theta }\left( 2{\bar{F}}_{\pi _\Theta }^Y + {\bar{F}}_{K_\Theta }^Y\right) - \frac{1}{2}{\bar{F}}^Y_{\pi _I} + \frac{1}{6}{\bar{F}}^Y_{\eta _I} \nonumber \\&\quad + \sum _{\varXi =V,A} a^2\delta '_\varXi \Bigg (\frac{m^2_{S_\varXi } - m^2_{\pi _\varXi }}{(m^2_{\eta _\varXi } - m^2_{\pi _\varXi })(m^2_{\pi _\varXi } - m^2_{\eta '_\varXi })}{\bar{F}}^Y_{\pi _\varXi } \nonumber \\&\quad + \frac{m^2_{\eta _\varXi } - m^2_{S_\varXi }}{(m^2_{\eta _\varXi } - m^2_{\eta '_\varXi })(m^2_{\eta _\varXi } - m^2_{\pi _\varXi })}{\bar{F}}^Y_{\eta _\varXi } \nonumber \\&\quad + \frac{m^2_{S_\varXi } - m^2_{\eta '_\varXi }}{(m^2_{\eta _\varXi } - m^2_{\eta '_\varXi })(m^2_{\eta '_\varXi } - m^2_{\pi _\varXi })}{\bar{F}}^Y_{\eta '_\xi }\Bigg ). \end{aligned}$$

(A.1)

In the expression above $Y=A_{1,2,3}$, and for the vector form factor $\mathrm{logs^V_{SU(3)}}=\mathrm{logs^{A_1}_{SU(3)}}$. The index $\Theta $ goes over all pseudoscalar meson fields of the effective theory, whereas $\varXi $ labels vector and axial counterparts. Explicit expressions for the mass of a pseudoscalar meson P with taste $\varXi $, $m_{P_\varXi }$, in terms of the parameters of the rooted staggered theory can be found in Ref. [135]. The hairpin parameters, which come from $\chi $PT disconnected diagrams, are marked as $\delta '_{V,A}$. Both vector and axial hairpin parameters are defined for the lightest coarse ensemble $a\approx 0.12$ fm to be $\delta '_A = -0.28(6)$ and $\delta '_V=0.00(7)$, and their value is obtained for other ensembles by rescaling this number, assuming the hairpin parameter scales as the root mean square of the taste splittings. The taste splittings along with the tree-level LEC $B_0$ depend solely on the lattice spacing and are given in Table 16. The values for the hairpin parameters, as well as those quoted in Table 16, were determined from fits to light-quark quantities by the MILC Collaboration in Ref. [54]. Our results are insensitive to the errors in the parameters quoted in the table, as well as the errors in the hairpin parameters, since the chiral logs are a very small contribution to our fit function in the region where we have data.

Table 16 Taste splittings and $B_0$ LEC used to compute the meson masses at tree level in $\chi $PT

Full size table

The functions appearing in Eq. (A.1) are given by

$$\begin{aligned} {\bar{F}}^Y_j \equiv F^Y(w,m_j,-\varDelta ^{(c)}/m_j), \end{aligned}$$

(A.2)

where $\varDelta ^{(c)}$ is the D-$D^*$ mass splitting that we take from the PDG [2], assuming a charged meson. The error from the PDG uncertainty, even if enlarged to cover a neutral-meson mass difference, also has a negligible impact in the final fit. The $F^Y$ functions are defined as

$$\begin{aligned} F^{A_1}(w,m,x)&= -2\Bigg [I_1(w,m,x) - \frac{1}{2}I_3(w,m,x) \nonumber \\&\quad + (w+1)I_1(w,m,0)\nonumber \\&\quad + (w^2-1)I_2(w,m,0) \nonumber \\&\quad - \frac{5}{2}I_3(w,m,0)\Bigg ], \end{aligned}$$

(A.3)

$$\begin{aligned} F^{A_2}(w,m,x)&= -2\bigg [I_1(w,m,x) + (w+1)I_2(w,m,x) \nonumber \\&\quad - I_1(w,m,0) - (w+1)I_2(w,m,0)\bigg ], \end{aligned}$$

(A.4)

$$\begin{aligned} F^{A_3}(w,m,x)&= -2\Bigg [-(w+1)I_2(w,m,x) \nonumber \\&\quad - \frac{1}{2}I_3(w,m,x) + (w+2) I_1(w,m,0) \nonumber \\&\quad + w(w+1)I_2(w,m,0) - \frac{5}{2}I_3(w,m,0)\Bigg ], \end{aligned}$$

(A.5)

with

$$\begin{aligned} I_j(w,m,x)&= - \Bigg [m^2 x E_j(w) + m^2x^2 \textrm{ln}\left( \frac{m^2}{\varLambda _\chi ^2}\right) G_j(w) \nonumber \\&\quad + m^2x^2F_j(w,x)\Bigg ]. \end{aligned}$$

(A.6)

The functions $E_j(w)$ and $G_j(w)$ are

$$\begin{aligned} E_1(w)&= \frac{\pi }{w+1}, \quad G_1(w) = \frac{r(w)-w}{2(w^2-1)}, \end{aligned}$$

(A.7)

$$\begin{aligned} E_2(w)&= \frac{-\pi }{(w+1)^2}, \quad G_2(w) = \frac{w^2 + 2 - 3wr(w)}{2(w^2-1)^2}, \end{aligned}$$

(A.8)

$$\begin{aligned} E_3(w)&= \pi , \quad G_3(w) = -1. \end{aligned}$$

(A.9)

Here r(w) is

$$\begin{aligned} r(w) = \frac{1}{\sqrt{w^2-1}}\ln (w + \sqrt{w^2-1}). \end{aligned}$$

(A.10)

The $F_j$ functions in general cannot be expressed in closed form, and are defined as integrals,

(A.11)

(A.12)

(A.13)

where

$$\begin{aligned} a = \frac{x\cos \theta }{\sqrt{1+w\sin 2\theta }}\,. \end{aligned}$$

(A.14)

Appendix B: Estimation of the matching factors and their errors

In this Appendix, we derive the renormalization factors for the vector and axial-vector currents using heavy-quark effective theory (HQET) as an intermediary between lattice gauge theory and continuum QCD [50, 103]. We have calculated the renormalization factors at one loop in perturbation theory, sometimes using an expedient approximation described below. These one-loop results are shown in Table 6. Here, we use the notation $p_B=p$, $v_B=v$, $p_{D^*}=p'$, and $v_{D^*}=v'$.

1.1 Appendix B.1: HQET matching

The HQET description of lattice-QCD currents is [50]

$$\begin{aligned} V^\mu&\doteq {\bar{C}}_{V_\parallel }^\text {LGT}(w)v^\mu {\bar{c}}_{v'}b_v + {\bar{C}}_{V_\perp }^\text {LGT}(w){\bar{c}}_{v'}i\gamma ^\mu _\perp b_v \nonumber \\&\quad + {\bar{C}}_{V_{v'}}^\text {LGT}(w){v'_\perp \!}^\mu {\bar{c}}_{v'}b_v , \end{aligned}$$

(B.15)

$$\begin{aligned} A^\mu&\doteq {\bar{C}}_{V_\perp }^\text {LGT}(w){\bar{c}}_{v'}i\gamma ^\mu _\perp \gamma ^5b_v - {\bar{C}}_{V_\parallel }(w)^\text {LGT}v^\mu {\bar{c}}_{v'}\gamma ^5b_v \nonumber \\&\quad - {\bar{C}}_{V_{v'}}^\text {LGT}(w){v'_\perp \!}^\mu {\bar{c}}_{v'}\gamma ^5b_v . \end{aligned}$$

(B.16)

where $\doteq $ means “has the same matrix elements as”. The currents on the left-hand side are lattice operators, while the bilinears on the right-hand side are HQET operators (e.g., built from fields satisfying ). Similarly, HQET can be used to describe continuum-QCD currents.

$$\begin{aligned} {\mathscr {V}}^\mu&\doteq {\bar{C}}_{V_\parallel }(w)v^\mu {\bar{c}}_{v'}b_v + {\bar{C}}_{V_\perp }(w){\bar{c}}_{v'}i\gamma ^\mu _\perp b_v \nonumber \\&\quad + {\bar{C}}_{V_{v'}}(w){v'_\perp \!}^\mu {\bar{c}}_{v'}b_v , \end{aligned}$$

(B.17)

$$\begin{aligned} {\mathscr {A}}^\mu&\doteq {\bar{C}}_{V_\perp }(w){\bar{c}}_{v'}i\gamma ^\mu _\perp \gamma ^5b_v - {\bar{C}}_{V_\parallel }(w)v^\mu {\bar{c}}_{v'}\gamma ^5b_v \nonumber \\&\quad - {\bar{C}}_{V_{v'}}(w){v'_\perp \!}^\mu {\bar{c}}_{v'}\gamma ^5b_v , \end{aligned}$$

(B.18)

The only difference between the lattice and the continuum currents lies in the short-distance coefficients; for Wilson-like fermions, as used here,

$$\begin{aligned} \lim _{a\rightarrow 0} {\bar{C}}_J^\text {LGT}(w) = {\bar{C}}_J(w). \end{aligned}$$

(B.19)

Equations (B.15)–(B.18) are valid at zeroth order in $1/m_Q$, $Q=c,b$, which suffices here.^{Footnote 8}

To continue with the analysis, it is advantageous to write explicitly the velocity and polarization vectors of our mesons. Assuming the B meson to be at rest, and the $D^*$ to have momentum $p'$ along the z axis:

$$\begin{aligned} v&= (0,0,0,i) = p/M_B , \end{aligned}$$

(B.20)

$$\begin{aligned} v'&= (0,0,\sqrt{w^2-1},iw) = p'/M_{D^*}, \end{aligned}$$

(B.21)

$$\begin{aligned} v'_\perp&= (0,0,\sqrt{w^2-1},0) , \end{aligned}$$

(B.22)

$$\begin{aligned} {\hat{v}}'_\perp&\equiv v'_\perp /(v'_\perp \cdot v'_\perp )^{1/2} = (0,0,1,0) , \end{aligned}$$

(B.23)

$$\begin{aligned} \epsilon ^\pm&= ({\varvec{\epsilon }}^\pm ,0,0) , \end{aligned}$$

(B.24)

$$\begin{aligned} \epsilon ^3&= (0,0,w,i\sqrt{w^2-1}) , \end{aligned}$$

(B.25)

$$\begin{aligned} \epsilon ^s&= v', \end{aligned}$$

(B.26)

where ${\varvec{\epsilon }}^\pm $ are two-component unit vectors in the xy plane. For momenta in other directions, such as ${\varvec{v}}'\propto (1,1,0)$ or $(1,-1,1)$, one can rotate the spatial components accordingly. Note that we pick ${\hat{v}}'_\perp $, and then we deduce $\epsilon ^\pm $.

In this notation, $v\cdot v=-1$, $v'\cdot v'=-1$, and $v\cdot v'=-w$.^{Footnote 9} The polarization vectors satisfy ${\bar{\epsilon }}^m\cdot \epsilon ^n=g^{mn}$ for $(m,n)\in \{+,-,3,s\}$ with $g^{mn}=\textrm{diag}(1,1,1,-1)$. The bar on a polarization vector means to complex-conjugate the spatial components, which arises if (as usual) $\epsilon ^\pm $ corresponds to $D^*$ helicity $\pm 1$. (We use linear polarizations with real ${\varvec{\epsilon }}$ but keep the ± notation.) The polarization vectors satisfy $v'\cdot \epsilon ^{\pm ,3}=0$, and also $v\cdot \epsilon ^{\pm }=0$.

These vectors can be used to isolate parts of the currents with different matching factors:

(B.27)

(B.28)

(B.29)

and similarly for ${\mathscr {V}}$ (without the superscript “LGT”) and for A and ${\mathscr {A}}$ with ${\bar{c}}_{v'}\rightarrow {\bar{c}}_{v'}\gamma ^5$.

In HQET, the matrix elements can be worked out with the “trace formalism” [43]:

$$\begin{aligned} \langle D^*|{\bar{c}}_{v'}\varGamma b_v|B\rangle = \sqrt{M_{D^*}M_B}\, \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}\varGamma {\mathscr {M}}_B\right] \xi (w) , \end{aligned}$$

(B.30)

where $\xi (w)$ is a form factor known as the “Isgur-Wise function”, and

(B.31)

and similarly for $\bar{{\mathscr {M}}}_D$ and ${\mathscr {M}}_{D^*}$. Then

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}{\mathscr {M}}_B\right]&= 0 , \end{aligned}$$

(B.32a)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}i\gamma ^\mu _\perp {\mathscr {M}}_B\right]&= \varepsilon ^{\mu \nu \alpha \beta } {\bar{\epsilon }}_\nu v'_\alpha v_\beta , \end{aligned}$$

(B.32b)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}(-\gamma ^5){\mathscr {M}}_B\right]&= i{\bar{\epsilon }}\cdot v ,\end{aligned}$$

(B.32c)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}i\gamma ^\mu _\perp \gamma ^5{\mathscr {M}}_B\right]&= -i\left[ (w+1){\bar{\epsilon }}^\mu _\perp + {\bar{\epsilon }}\cdot v v'_\perp \right] , \end{aligned}$$

(B.32d)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_D{\mathscr {M}}_B\right]&= w+1 , \end{aligned}$$

(B.32e)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_Di\gamma ^\mu _\perp {\mathscr {M}}_B\right]&= {v'_\perp }^\mu , \end{aligned}$$

(B.32f)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_D(-\gamma ^5){\mathscr {M}}_B\right]&= 0 , \end{aligned}$$

(B.32g)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_Di\gamma ^\mu _\perp \gamma ^5{\mathscr {M}}_B\right]&= 0 , \end{aligned}$$

(B.32h)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}{\mathscr {M}}_{B^*}\right]&= (w+1){\bar{\epsilon }}'\cdot \epsilon + {\bar{\epsilon }}'\cdot v\,\epsilon \cdot v' , \end{aligned}$$

(B.32i)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}i\gamma ^\mu _\perp {\mathscr {M}}_{B^*}\right]&= v_\perp ^{\prime \,\mu } {\bar{\epsilon }}'\cdot \epsilon - \epsilon _\perp ^\mu {\bar{\epsilon }}'\cdot v - {\bar{\epsilon }}^{\prime \,\mu }_\perp \epsilon \cdot v', \end{aligned}$$

(B.32j)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}(-\gamma ^5){\mathscr {M}}_{B^*}\right]&= \varepsilon ^{\nu \rho \alpha \beta }{\bar{\epsilon }}'_\nu \epsilon _\rho v'_\alpha v_\beta , \end{aligned}$$

(B.32k)

$$\begin{aligned} \textrm{tr}\left[ \bar{{\mathscr {M}}}_{D^*}i\gamma ^\mu _\perp \gamma ^5{\mathscr {M}}_{B^*}\right]&= (\delta ^\mu _{\,\tau }+v^\mu v_\tau ) \varepsilon ^{\tau \nu \rho \alpha }{\bar{\epsilon }}'_\nu (v+v')_\alpha \epsilon _\rho . \end{aligned}$$

(B.32l)

With these results, physical combinations of the decompositions in Eqs. (B.15)–(B.18) can be more easily tracked. Note that the sign convention for $\varepsilon $ cancels in ratios introduced below.

These expressions can be used to relate matrix elements of continuum and LGT currents to each other, via HQET and Eqs. (B.15)–(B.18). Thus, $Z_VV$ and $Z_AA$ have the same matrix elements as ${\mathscr {V}}$ and ${\mathscr {A}}$ if one chooses matching factors such that

$$\begin{aligned} {\bar{Z}}_{V_\parallel }(w) v \cdot V&\doteq v \cdot {\mathscr {V}} , \end{aligned}$$

(B.33a)

$$\begin{aligned} {\bar{Z}}_{V_{v'}} (w){\hat{v}}'_\perp \cdot V&\doteq {\hat{v}}'_\perp \cdot {\mathscr {V}} , \end{aligned}$$

(B.33b)

$$\begin{aligned} {\bar{Z}}_{V_\perp } (w)\epsilon ^\pm \cdot V&\doteq \epsilon ^\pm \cdot {\mathscr {V}} , \end{aligned}$$

(B.33c)

$$\begin{aligned} {\bar{Z}}_{A_\parallel }(w) v \cdot A&\doteq v \cdot {\mathscr {A}} , \end{aligned}$$

(B.33d)

$$\begin{aligned} {\bar{Z}}_{A_{v'}} (w){\hat{v}}'_\perp \cdot A&\doteq {\hat{v}}'_\perp \cdot {\mathscr {A}} , \end{aligned}$$

(B.33e)

$$\begin{aligned} {\bar{Z}}_{A_\perp } (w)\epsilon ^\pm \cdot A&\doteq \epsilon ^\pm \cdot {\mathscr {A}} . \end{aligned}$$

(B.33f)

From these requirements one finds

$$\begin{aligned} {\bar{Z}}_{V_\parallel } (w)&= \frac{{\bar{C}}_{V_\parallel }(w)}{{\bar{C}}^\text {LGT}_{V_\parallel }(w)} , \end{aligned}$$

(B.34a)

$$\begin{aligned} {\bar{Z}}_{V_{v'}} (w)&= \frac{{\bar{C}}_{V_\perp }(w)+(w+1){\bar{C}}_{V_{v'}}}{{\bar{C}}^\text {LGT}_{V_\perp }(w)+(w+1){\bar{C}}^\text {LGT}_{V_{v'}}} , \end{aligned}$$

(B.34b)

$$\begin{aligned} {\bar{Z}}_{V_\perp } (w)&= \frac{{\bar{C}}_{V_\perp } (w)}{{\bar{C}}^\text {LGT}_{V_\perp } (w)} , \end{aligned}$$

(B.34c)

$$\begin{aligned} {\bar{Z}}_{A_\parallel } (w)&= \frac{{\bar{C}}_{A_\parallel }(w)}{{\bar{C}}^\text {LGT}_{A_\parallel }(w)} , \end{aligned}$$

(B.34d)

$$\begin{aligned} {\bar{Z}}_{A_{v'}} (w)&= \frac{{\bar{C}}_{A_\perp }(w)+(w-1){\bar{C}}_{A_{v'}}}{{\bar{C}}^\text {LGT}_{A_\perp }(w)+(w-1){\bar{C}}^\text {LGT}_{A_{v'}}} , \end{aligned}$$

(B.34e)

$$\begin{aligned} {\bar{Z}}_{A_\perp } (w)&= \frac{{\bar{C}}_{A_\perp } (w)}{{\bar{C}}^\text {LGT}_{A_\perp } (w)} , \end{aligned}$$

(B.34f)

In anticipation of one-loop perturbative calculations, it is convenient to define

$$\begin{aligned} {\bar{\rho }}_J(w) = \frac{{\bar{Z}}_J(w)}{{\bar{Z}}_{V_\parallel ,{\bar{b}}b}^{1/2}(1) {\bar{Z}}_{V_\parallel ,{\bar{c}}c}^{1/2}(1)} \end{aligned}$$

(B.35)

to cancel conventional field-normalization factors as well as potentially large tadpole diagrams. The denominators can be computed nonperturbatively. With a quantitative method to compute matrix elements, one can obtain the matching factors. For example, one can use quark states and expand them in perturbative QCD.

1.2 Appendix B.2: Useful ratios

We start with the original double ratio

$$\begin{aligned} |R_{A_1}(1)|^2 = \frac{ \langle D^*({\varvec{0}},\epsilon )|\epsilon \!\cdot \!A|B({\varvec{0}})\rangle \, \langle B ({\varvec{0}})|{\bar{\epsilon }}\!\cdot \!A|D^*({\varvec{0}},\epsilon )\rangle }{ \langle D^*({\varvec{0}},\epsilon )|v\cdot V|D^*({\varvec{0}},\epsilon )\rangle \langle B ({\varvec{0}})|v\cdot V|B ({\varvec{0}})\rangle } , \nonumber \\ \end{aligned}$$

(B.36)

which requires the matching factor

$$\begin{aligned} {\bar{\rho }}^2_{A_\perp }(1) = \frac{{\bar{Z}}^2_{A_\perp }(1)}{{\bar{Z}}_{V_\parallel ,{\bar{b}}b}(1){\bar{Z}}_{V_\parallel ,{\bar{c}}c}(1)}. \end{aligned}$$

(B.37)

To obtain the w dependence of all four form factors, we define several further ratios:

$$\begin{aligned} Q_{A_1}(w)&= \frac{\langle D^*({\varvec{p}},\epsilon )|\epsilon ^\pm \!\cdot \!A|B({\varvec{0}})\rangle }{ \langle D^*({\varvec{0}},\epsilon )|\epsilon ^\pm \!\cdot \!A|B({\varvec{0}})\rangle }, \end{aligned}$$

(B.38a)

$$\begin{aligned} X_{0} (w)&= \frac{\langle D^*({\varvec{p}},\epsilon )|(-v\!\cdot \!A)|B({\varvec{0}})\rangle }{ \langle D^*({\varvec{p}},\epsilon )|\epsilon ^\pm \!\cdot \!A|B({\varvec{0}})\rangle }, \end{aligned}$$

(B.38b)

$$\begin{aligned} X_{1} (w)&= \frac{\langle D^*({\varvec{p}},\epsilon )|{\hat{v}}'_\perp \!\cdot \!A|B({\varvec{0}})\rangle }{ \langle D^*({\varvec{p}},\epsilon )|\epsilon ^\pm \!\cdot \!A|B({\varvec{0}})\rangle }, \end{aligned}$$

(B.38c)

$$\begin{aligned} X_{V} (w)&= \frac{\langle D^*({\varvec{p}},\epsilon )|\epsilon ^\pm \!\cdot \!V|B({\varvec{0}})\rangle }{ \langle D^*({\varvec{p}},\epsilon )|\epsilon ^\pm \!\cdot \!A|B({\varvec{0}})\rangle }, \end{aligned}$$

(B.38d)

which require, respectively, the matching factors

$$\begin{aligned} \frac{{\bar{Z}}_{A_\perp } (w)}{{\bar{Z}}_{A_\perp }(1)}&= \frac{{\bar{\rho }}_{A_\perp } (w)}{{\bar{\rho }}_{A_\perp }(1)} , \end{aligned}$$

(B.39a)

$$\begin{aligned} \frac{{\bar{Z}}_{A_\parallel }(w)}{{\bar{Z}}_{A_\perp }(w)}&= \frac{{\bar{\rho }}_{A_\parallel }(w)}{{\bar{\rho }}_{A_\perp }(w)} , \end{aligned}$$

(B.39b)

$$\begin{aligned} \frac{{\bar{Z}}_{A_{v'}} (w)}{{\bar{Z}}_{A_\perp }(w)}&= \frac{{\bar{\rho }}_{A_{v'}} (w)}{{\bar{\rho }}_{A_\perp }(w)} , \end{aligned}$$

(B.39c)

$$\begin{aligned} \frac{{\bar{Z}}_{V_\perp } (w)}{{\bar{Z}}_{A_\perp }(w)}&= \frac{{\bar{\rho }}_{V_\perp } (w)}{{\bar{\rho }}_{A_\perp }(w)} , \end{aligned}$$

(B.39d)

Note that the form factor $A_1(q^2)\propto h_{A_1}(w)$, defined in Eq. (87), comes directly from $R_{A_1}Q_{A_1}(w)$, and $V(q^2)\propto h_V(w)$, defined in Eq. (86), comes directly from $R_{A_1}Q_{A_1}(w)X_V(w)$, while the helicity amplitudes $H_0$ and $H_s$ are linear combinations of $R_{A_1}Q_{A_1}(w)X_0(w)$ and $R_{A_1}Q_{A_1}(w)X_1(w)$. The helicity amplitudes $H_\pm $ come from $A_1$ and V, but it is presumably more convenient (or just as convenient) to keep the axial and vector parts separate.

For the dynamic velocity consider the vector-current matrix element

$$\begin{aligned}&\langle D^*({\varvec{p}},\epsilon ')|{\mathscr {V}}^\mu |D^*({\varvec{0}},\epsilon )\rangle \nonumber \\ {}&\quad = \quad {\bar{\epsilon }}'\cdot \epsilon (p'+p)^\mu f_1(w) \nonumber \\&\qquad + \text {terms proportional to } v'\cdot \epsilon , v\cdot \epsilon ' . \end{aligned}$$

(B.40)

If we take the same transverse polarization, $\epsilon '=\epsilon =\epsilon ^\pm $, for the initial and final $D^*$s, then

$$\begin{aligned} \langle D^*({\varvec{p}})|(-v \cdot {\mathscr {V}})|D^*({\varvec{0}})\rangle&= M_{D^*} (w + 1) f_1(w) , \end{aligned}$$

(B.41)

$$\begin{aligned} \langle D^*({\varvec{p}})|{\hat{v}}'_\perp \cdot {\mathscr {V}} |D^*({\varvec{0}})\rangle&= M_{D^*} \sqrt{w^2-1} f_1(w) , \end{aligned}$$

(B.42)

where ${\hat{v}}'_\perp $, defined in Eq. (B.23), is a unit vector in the direction of ${\varvec{p}}$, such as (0, 0, 1) or $(1,1,0)/\sqrt{2}$. On the right-hand side of the second equation, $\sqrt{w^2-1}$ is nothing but $|v'_\perp |$, the magnitude of $v'_\perp $; that is, $v'_\perp =\sqrt{w^2-1}{\hat{v}}'_\perp $. Because ${\mathscr {V}}$ is properly normalized, it measures the flavor charge, so the form factor satisfies

$f_1(1)=1$.

Using the trace formalism, it is easy to show

$$\begin{aligned} \langle D^*({\varvec{p}})|(-v \cdot V)|D^*({\varvec{0}})\rangle&= M_{D^*} (w + 1) {\bar{C}}_{V_\parallel }^\text {LGT}(w)\xi (w) , \end{aligned}$$

(B.43)

$$\begin{aligned} \langle D^*({\varvec{p}})|{\hat{v}}'_\perp \cdot V |D^*({\varvec{0}})\rangle&= M_{D^*} \sqrt{w^2-1} \nonumber \\& \left[ {\bar{C}}_{V_\perp }^\text {LGT}(w) + (w+1){\bar{C}}_{V_{v'}}^\text {LGT}(w)\right] \xi (w) . \end{aligned}$$

(B.44)

Taking the ratio, the form factor drops out:

$$\begin{aligned}&\frac{\langle D^*({\varvec{p}})|{\hat{v}}'_\perp \cdot V|D^*({\varvec{0}})\rangle }{\langle D^*({\varvec{p}})|(-v\cdot V)|D^*({\varvec{0}})\rangle } \nonumber \\&\quad = \frac{|v'_\perp |}{w+1} \frac{{\bar{C}}_{V_\perp }^\text {LGT}(w) + (w+1){\bar{C}}_{V_{v'}}^\text {LGT}(w)}{{\bar{C}}_{V_\parallel }^\text {LGT}(w)}, \end{aligned}$$

(B.45)

but a matching factor remains.^{Footnote 10} The analogous equations for ${\mathscr {V}}$ shows that $f_1(w)={\bar{C}}_{V_\parallel }(w)\xi (w)$ and ${\bar{C}}_{V_\parallel }(w)={\bar{C}}_{V_\perp }(w) + (w+1){\bar{C}}_{V_{v'}}(w)$.

1.3 Appendix B.3: Expedient approximation

Calculating the full w dependence of the ${\bar{Z}}_J$ at the one-loop level is, in general, very cumbersome. From Ref. [50], however, we have

$$\begin{aligned} \lim _{m_ca\rightarrow 0} {\bar{Z}}_{J_\parallel }(w)&= Z_{J_\parallel } , \end{aligned}$$

(B.46)

$$\begin{aligned} \lim _{m_ca\rightarrow 0} {\bar{Z}}_{J_\perp } (w)&= Z_{J_\perp } , \end{aligned}$$

(B.47)

$$\begin{aligned} \lim _{m_ca\rightarrow 0} {\bar{Z}}_{J_\perp } {\bar{C}}^\text {LGT}_{J_{v'}}(w)&= Z_{J_\perp } {\bar{C}}_{J_{v'}}(w) \nonumber \\&\Rightarrow \lim _{m_ca\rightarrow 0} {\bar{Z}}_{J_{v'}} (w) = Z_{J_\perp } . \end{aligned}$$

(B.48)

Then we can approximate,

$$\begin{aligned} \frac{{\bar{\rho }}_{A_\perp } (w)}{{\bar{\rho }}_{A_\perp }(1)}&\approx 1 \pm \alpha _V(q^*) \rho ^{[1]}_\text {max}(w-1)m_{2c}a , \end{aligned}$$

(B.49a)

$$\begin{aligned} \frac{{\bar{\rho }}_{A_\parallel }(w)}{{\bar{\rho }}_{A_\perp }(w)}&\approx \frac{\rho _{A_\parallel }}{\rho _{A_\perp }} \pm \alpha _V(q^*) \rho ^{[1]}_\text {max}m_{2c}a , \end{aligned}$$

(B.49b)

$$\begin{aligned} \frac{{\bar{\rho }}_{A_{v'}} (w)}{{\bar{\rho }}_{A_\perp }(w)}&\approx 1 \pm \alpha _V(q^*) \rho ^{[1]}_\text {max}m_{2c}a , \end{aligned}$$

(B.49c)

$$\begin{aligned} \frac{{\bar{\rho }}_{V_\perp } (w)}{{\bar{\rho }}_{A_\perp }(w)}&\approx \frac{\rho _{V_\perp } }{\rho _{A_\perp }} \pm \alpha _V(q^*) \rho ^{[1]}_\text {max}m_{2c}a , \end{aligned}$$

(B.49d)

where $\rho ^{[1]}_\text {max} = 0.352$ is the largest one-loop coefficient that we find among the computable one-loop coefficients. For lack of a better choice, we set $q^*=2/a$, as in other papers.

In the limit $m_ca\rightarrow 0$, the matching factor in the velocity tends to 1. We could use an uncertainty like that in Eq. (B.49c), but that seems to be an unecessary complication. A little algebra shows that the mismatch in w is very small:

$$\begin{aligned} w \approx w^\text {LGT} \pm \alpha _V(q^*) \rho ^{[1]}_\text {max}(w^2-1)m_{2c}a, \end{aligned}$$

(B.50)

while the mismatch in z is

$$\begin{aligned} z \approx z^\text {LGT} \left[ 1 \pm \alpha _V(q^*) 2\rho ^{[1]}_\text {max}\frac{1+z}{1-z}m_{2c}a\right] . \end{aligned}$$

(B.51)

Appendix C: Heavy quark mistuning corrections procedure

Table 17 Simulation values $\kappa '_c$ and $\kappa '_b$ employed in the calculation of the form factors compared with their more precisely tuned values $\kappa _c$ and $\kappa _b$ [22]. The first error in $\kappa _b$ and $\kappa _c$ includes statistical and fitting contributions; the second one comes from scale setting

Full size table

Table 18 Ensemble and kappa parameters used to calculate the mistuning correction [22]. The values in bold are the uncorrected simulation values for this ensemble. Not all combinations are available, since we change only one quark mass at a time while keeping the other quark mass fixed to its uncorrected value

Full size table

We tune the heavy-quark masses ($\kappa _b$ and $\kappa _c$) for clover quarks in the Fermilab interpretation using the procedure described in Ref. [22]. In brief, in a mass-independent scheme for the lattice scale, set by $r_1 = 0.3117(22)$ fm [64], we tune the heavy-quark masses so that the kinetic masses $M_2$ of the $B_s$ and $D_s$ mesons agree with their experimental values [64, 70]. Historically, the tuning was done in two stages. A preliminary, lower-statistics study set the heavy-quark masses used in the simulation to generate the two- and three-point correlators in the present study. A subsequent, higher-statistics study was then possible, and it gave slightly different kappa values, as shown in Table 17. We denote the simulation values with $\kappa _c^\prime $ and $\kappa _b^\prime $ and the refined values with $\kappa _c$ and $\kappa _b$. As in Ref. [22], we proceed to correct our results for the slight mistuning and, in the process, estimate the systematic error associated with the correction.

Table 19 Results of the $\kappa $ correction fits

Full size table

Table 20 Effect of the mistuning adjustments in the form factors and the recoil parameter of the $a\approx 0.12$ fm ensemble used to compute the correction. Only the final error is shown in the table. The error for w seems to decrease with the correction, but the relevant quantity is $w-1$, for which the error either stays the same or increases. The correction (and its error) is highly correlated with $w-1$, as it is directly proportional to it

Full size table

The correction procedure is based on a single $a\approx 0.12$ fm ensemble. A full set of two- and three-point correlation functions were calculated at slightly shifted values of $\kappa _b^\prime $ and $\kappa _c^\prime $, as shown in Table 18. The shifted values were chosen close to both the corrected and simulation kappa values. With these data, we then estimate the derivatives of the form factors and the recoil parameter with respect to the heavy quark masses. By expressing these results in dimensionless terms, we can extrapolate them to other ensembles with different light-quark masses and lattice spacings. As a departure from Ref. [53], we perform the correction after the renormalization factors have been applied.

The first step is to construct the combinations of ratios we need to correct. Clever combinations can remove a dependence on the recoil parameter or vanish at zero recoil. We can use this information to improve the precision of the corrections. In this case, we define the following quantities:

$$\begin{aligned} A&= \,(1-x_f^2) R_{A_1}, \end{aligned}$$

(C.52)

$$\begin{aligned} V&= \,\frac{X_v}{x_f}, \end{aligned}$$

(C.53)

$$\begin{aligned} B_0&= \,\frac{X_0}{x_f}, \end{aligned}$$

(C.54)

$$\begin{aligned} B_1&= \,\frac{X_1 - 1}{2x_f^2}, \end{aligned}$$

(C.55)

$$\begin{aligned} C_1&= \,\frac{X_1 + 1}{2}, \end{aligned}$$

(C.56)

At zero recoil we expect $C_1\rightarrow 1$, since $X_1\rightarrow 1$ in that limit. Although $x_f$ vanishes when $w\rightarrow 1$, the combinations are designed to give a finite value at zero recoil. In fact, it is easy to reconstruct the form factors using these building blocks,

$$\begin{aligned} h_{A_1}&= \,A, \end{aligned}$$

(C.57)

$$\begin{aligned} h_V&= \,AV, \end{aligned}$$

(C.58)

$$\begin{aligned} h_{A_2}&= \,A(C_1 + B_1 - B_0), \end{aligned}$$

(C.59)

$$\begin{aligned} h_{A_3}&= \,A(C_1 - B_1). \end{aligned}$$

(C.60)

The last quantity corrected is the recoil parameter, defined as in Eq. 21, with a trivial dependence on $\kappa _b$.

We estimate the derivative by taking finite differences between the observables in Eqs. (C.52)–(C.56) calculated in the standard run and one of the correction runs listed in Table 18. The derivative is taken with respect to $\xi _\alpha = 1/am_{2\alpha }$ with $\alpha =b,c$, where the kinetic mass $am_{2\alpha }$ is defined as

$$\begin{aligned} \frac{1}{am_{2\alpha }} = \frac{2}{am_{0\alpha }(2+am_{0\alpha })} + \frac{1}{1+am_{0\alpha }}, \end{aligned}$$

(C.61)

and the bare quark mass is calculated using the tadpole-improved, tree-level formula

$$\begin{aligned} am_{0\alpha } = \frac{1}{u_0}\left( \frac{1}{2\kappa _\alpha } - \frac{1}{2\kappa _{cr}}\right) . \end{aligned}$$

(C.62)

Here $u_0$ is the tadpole parameter, and $\kappa _{cr}$ is the value of $\kappa $ at which the clover-quark pion becomes massless. We calculate the derivatives of each observable for each available value of the recoil parameter. When computing the correction for the b quark, since the $\kappa '_b=0.860$ simulation value is so close to the tuned value of $\kappa _b$, we do not use the data coming from $\kappa '_b=0.820$.

In general, the calculated derivative is small for all recoil values, and the errors are large enough that we can consider a linear dependence on $w-1$ for all derivatives. Nonetheless, we can exploit some good properties of our observables in order to reduce the number of free parameters. For instance, $dC_1/d\xi _\alpha (w=1) = 0$, since $C_1=1$ in that limit, so we can set the constant term to zero. Also, the derivatives of $V,B_{0,1}$ change very little with $w-1$ compared with their errors. Thus, we can safely assume these derivatives behave as constants in the small recoil range we are considering. In fact, these quantities are derived from ratios of form factors $h_X/h_{A_1}$, which should depend only weakly on w. These simplifications are not only welcome, they are necessary since for $V,B_{0,1},C_1$ we have only two values of the recoil parameter available, so our fits can have one degree of freedom. The observable A is not a ratio per se, but it is calculated at zero recoil as well, and a clear dependence with w arises. Finally, it is obvious that $dw/d\xi _c(w=1) = 0$, and we can apply the same treatment as for $C_1$. Summarizing, we fit our data to the expressions,

$$\begin{aligned} \frac{dw}{d\xi _c}&= \,a_{w, c} (w-1), \end{aligned}$$

(C.63)

$$\begin{aligned} \frac{dA}{d\xi _c}&= \,a_{A, c} (w-1) + b_{A,c}, \end{aligned}$$

(C.64)

$$\begin{aligned} \frac{dV}{d\xi _c}&= \,b_{V, c}, \end{aligned}$$

(C.65)

$$\begin{aligned} \frac{dB_0}{d\xi _c}&= \,b_{B_0,c}, \end{aligned}$$

(C.66)

$$\begin{aligned} \frac{dB_1}{d\xi _c}&= \,b_{B_1,c}, \end{aligned}$$

(C.67)

$$\begin{aligned} \frac{dC_1}{d\xi _c}&= \,a_{C_1,c} (w-1), \end{aligned}$$

(C.68)

$$\begin{aligned} \frac{dw}{d\xi _b}&= \,0, \end{aligned}$$

(C.69)

$$\begin{aligned} \frac{dA}{d\xi _b}&= \,a_{A, b} (w-1) + b_{A,b}, \end{aligned}$$

(C.70)

$$\begin{aligned} \frac{dV}{d\xi _b}&= \,b_{V, b}, \end{aligned}$$

(C.71)

$$\begin{aligned} \frac{dB_0}{d\xi _b}&= \,b_{B_0,b}, \end{aligned}$$

(C.72)

$$\begin{aligned} \frac{dB_1}{d\xi _b}&= \,b_{B_1,b},\end{aligned}$$

(C.73)

$$\begin{aligned} \frac{dC_1}{d\xi _b}&= \,a_{C_1,b} (w-1). \end{aligned}$$

(C.74)

All fits are unconstrained and result in a p value exceeding 0.5. Our final corrections are gathered in Table 19. These values are used to correct the form factors for all ensembles. As an example, we show in Table 20 how this tuning modifies the calculated form factors for the ensemble used in the correction.

Appendix D: Full covariance matrix of the chiral-continuum extrapolation results

Future work with the lattice-QCD form factors computed here can start with the synthetic data generated for the z expansion. We provide these data for the form factors in the BGL and HQET notation, along with their correlation matrix, in the ancillary files included in this paper. We also include the values and correlation matrix of the BGL z expansion coefficients resulting from our preferred fits from Sects. 5.1.1 and 5.2, using only lattice-QCD data (the quadratic fit in Table 11) and lattice QCD plus experimental data (the last column in Table 12). Our BGL lattice-QCD-only fit can be reproduced by fitting the included synthetic data in the BGL notation to the BGL expansion described in Sect. 5.1, and using the constraint at zero recoil given in Eq. (72) to remove $c_0$.

1.1 Appendix D.1: Reading the data

The data is provided in Python format using the gvar package [137]. The synthetic data points can be read from the file SynthData.PyDat using the following code:

After execution, data holds a dictionary whose keys are in the format F(w), where F $=$ g, f, F1 or F2 for the g, f, ${\mathcal {F}}_1$ and ${\mathcal {F}}_2$ form factors in the BGL notation, or F $=$ hA1, hA2, hA3 or V for the $h_{A_i}$ and $h_V$ form factors in the HQET notation, and w is the recoil parameter, $w = 1.03, 1.10, 1.17$. The $h_X$ form factors are also available at zero recoil, but the extra data point at $w = 1.00$ is not independent and should be used only if one of the other available points is removed. The last line stores the correlations between the different synthetic data points in a dictionary called corr. One can access any correlation by invoking the pair corr[F(w1),G(w2)], where F and G are the two relevant form factors, and w1 and w2 are the recoil parameters at which the correlation should be evaluated.

The z-fit results are stored in the file FitResults.PyDat and can be read following the same procedure. The dictionary keys in this case are in the format FIT_xj, where FIT can be either LQCD for the lattice-QCD-only fit, or LQCDEXP for the fit including lattice-QCD and experimental data, and xj = a0, a1... are the different coefficients of the z expansion $a_0$, $a_1$... Our final result for $|\eta _{\text {EW}}|^2|V_{cb}|^2$ is stored under the key LQCDEXP_eVcb, and the results for $R(D^*)$ are stored as LQCD_RDst and LQCDEXP_RDst.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Funded by SCOAP³. SCOAP³ supports the goals of the International Year of Basic Sciences for Sustainable Development.

Reprints and permissions

About this article

Cite this article

Bazavov, A., DeTar, C.E., Du, D. et al. Semileptonic form factors for $B\rightarrow D^*\ell \nu $ at nonzero recoil from $2+1$-flavor lattice QCD. Eur. Phys. J. C 82, 1141 (2022). https://doi.org/10.1140/epjc/s10052-022-10984-9

Download citation

Received: 13 May 2022
Accepted: 31 October 2022
Published: 16 December 2022
DOI: https://doi.org/10.1140/epjc/s10052-022-10984-9

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Semileptonic form factors for \(B\rightarrow D^*\ell \nu \) at nonzero recoil from \(2+1\)-flavor lattice QCD

Abstract

Similar content being viewed by others

Semileptonic \(\varvec{B\rightarrow D^{**}}\) decays in lattice QCD: a feasability study and first results

Exclusive determinations of \(\vert V_{cb} \vert \) and \(R(D^{*})\) through unitarity

The \(B \rightarrow {{D}^{(*)}} {l}{\nu _l}\) decays in the pQCD approach with the Lattice QCD input

1 Introduction

2 Form factor definitions

2.1 Form factors in the continuum

2.2 Extracting the form factors from lattice matrix elements

3 Analysis

3.1 Lattice setup

3.2 Correlation functions

3.3 Two-point functions

3.3.1 Two-point function fits

3.3.2 The dispersion relation

3.4 Three-point functions

3.4.1 Three-point function fits

3.5 Calculation of the recoil parameter

3.6 Current renormalization and blinding

3.7 Heavy-quark-mass adjustment

3.8 Chiral-continuum extrapolation

4 Systematic errors

4.1 Statistics and stability of the correlator fits

4.2 Stability of the chiral-continuum extrapolation

4.3 Discretization errors

4.4 Matching errors

4.5 Heavy quark mistuning

4.6 Light quark mistuning

4.7 Scale setting

4.8 Isospin effects

4.9 Finite-volume effects

5 Determination of \(|V_{cb}|\) and \(R(D^*)\)

5.1 z expansion with the BGL parametrization

5.1.1 Synthetic data

5.1.2 Functional method

5.2 Determination of \(|V_{cb}|\)

5.3 Determination of \(R(D^*)\)

5.4 Tests

5.4.1 Imposing the constraint at maximum recoil

5.4.2 The \(z\) expansion with an improved CLN parametrization

5.4.3 Comparison with LCSR

6 Discussion and outlook

Data Availability Statement

Change history

16 January 2023

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Appendices

Appendix A: The chiral logs in our chiral-continuum extrapolation

Appendix B: Estimation of the matching factors and their errors

1.1 Appendix B.1: HQET matching

1.2 Appendix B.2: Useful ratios

1.3 Appendix B.3: Expedient approximation

Appendix C: Heavy quark mistuning corrections procedure

Appendix D: Full covariance matrix of the chiral-continuum extrapolation results

1.1 Appendix D.1: Reading the data

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation