Abstract
We establish an estimate related to the \(H^1-BMO\) duality in the dyadic setting. Specifically, for any Hilbert space \({\mathcal {H}}\) over \({\mathbb {R}}\) and any functions \(\varphi ,\,\psi :{\mathbb {R}}\rightarrow {\mathcal {H}}\), we have
The constant \(\sqrt{2}\) is the best possible. The proof rests on the existence of a certain special function enjoying appropriate size and concavity requirements.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
A classical result of Fefferman states that the class BMO, the space of functions of bounded mean oscillation, is the dual space of \(H^1\). This fact, originally proved in [2] in the analytic setting, was later extended to the probabilistic context by Getoor and Sharpe [3]. The purpose of this paper is to establish a certain sharp estimate related to this duality in the one-dimensional context.
We start with some background and notation. Throughout, for brevity, the symbol \({\mathcal {I}}\) will stand for the interval [0, 1]. Let \((h_n)_{n\ge 0}\) be the Haar system on [0, 1], i.e.,
and so on. Let \({\mathcal {H}}\) be a separable real Hilbert space with scalar product \(\cdot \) and norm \(|\cdot |\). For any dyadic subinterval \(I\subseteq {\mathcal {I}}\) and an integrable function \(\varphi :{\mathcal {I}}\rightarrow {\mathcal {H}}\), we will write \(\langle \varphi \rangle _I\) for the average of \(\varphi \) over I: that is, \(\langle \varphi \rangle _I=\frac{1}{|I|}\int _I \varphi \text{ d }u\). Furthermore, for any such \(\varphi \) and any nonnegative integer n, we will use the notation
for the projection of \(\varphi \) on the subspace generated by the first \(n+1\) Haar functions (\(I_k\) denotes the support of \(h_k\)). The dyadic square function of \(\varphi \) is
where the summation runs over all nonnegative integers n such that \(x\in I_n\). The Hardy space \(H^1({\mathcal {I}})\) consists of those \(\varphi \) on \({\mathcal {I}}\), for which the norm \(\Vert \varphi \Vert _{H^1}=\Vert S(\varphi )\Vert _{L^1}\) is finite. All the above definitions extend obviously to the case when \({\mathcal {I}}\) is replaced by an arbitrary dyadic subinterval of \({\mathbb {R}}\). Actually, one easily defines \(S(\varphi )\) and \(H^1\) when the underlying space is the entire line \({\mathbb {R}}\). To this end, consider in (1.1) the sum over all dyadic intervals I contained in \({\mathbb {R}}\), and replace \(I_n\) with I and \(h_n\) with \(h_I=\chi _{I^-}-\chi _{I^+}\), where \(I^-\), \(I^+\) are the left and right halves of I. The definition of \(H^1\) remains unchanged.
A function \(\psi :{\mathcal {I}}\rightarrow {\mathcal {H}}\) belongs to the (dyadic) class BMO if
where the supremum is taken over all dyadic subintervals I of \({\mathcal {I}}\). This space, introduced by John and Nirenberg in [6], plays a fundamental role in harmonic analysis and probability theory. It is well-known that it often serves as a convenient substitute for \(L^\infty \): many important operators—e.g., singular integrals, area functions—are not bounded on \(L^\infty \), but send \(L^\infty \) into BMO instead (cf. [4]). In addition, BMO behaves nicely from the viewpoint of interpolation, see, e.g., [1, 4]. Our motivation comes from another property, already mentioned above: this space is dual to \(H^1\). Specifically, it follows from the above result of Fefferman that for \({\mathcal {H}}={\mathbb {R}}\), there is a finite constant C such that for any \(\varphi \in {\mathcal {H}}^1\) and any \(\psi \in BMO\) with \(\langle \psi \rangle _{\mathcal {I}}=0\),
We will identify the optimal constant in this estimate, in the context of general Hilbert spaces \({\mathcal {H}}\). Here is our main result.
Theorem 1.1
For any Hilbert space \({\mathcal {H}}\), the inequality (1.2) holds with the constant \(C=\sqrt{2}\). The constant is the best possible already for \({\mathcal {H}}={\mathbb {R}}\).
Obviously, this result is valid if we replace the interval \({\mathcal {I}}\) by an arbitrary dyadic subinterval of \({\mathbb {R}}\). Actually, by a standard limiting argument, it also remains true if \({\mathcal {I}}\) is replaced with the real line \({\mathbb {R}}\). There is a natural question about the optimal constant in the higher-dimensional setting. Unfortunately, although our argumentation seems to work up to some point, we have not managed to push all the relevant calculations through.
We would also like to mention two related works from the literature. First, Getoor and Sharpe [3] obtained the probabilistic version of the above inequality (with the same constant \(\sqrt{2}\)) for continuous, real-valued martingales. That result can be deduced from Theorem 1.1: namely, one can rephrase the estimate (1.2) in the language of the so-called dyadic martingales which, by approximation, lead to the continuous case. The reverse implication does not seem to be valid. We would also like to refer the reader to the paper [11] by Slavin and Volberg, which contains the proof of a certain version of (1.2) in the context of Triebel–Lizorkin spaces.
A few words about the proof are in order. Our approach will exploit the so-called Bellman function method, which reduces the study of (1.2) to the existence of a certain special function. The efficiency of this technique in the study of various sharp BMO estimates has been confirmed in numerous papers: we refer the interested reader to the works [5, 7,8,9,10, 12] (the complete list is much longer). We should stress, however, that the estimate (1.2) is four-dimensional, i.e., the corresponding Bellman function involves four variables. Hence, it does not fall into scope of the contexts studied in the aforementioned papers, which mostly handled two-dimensional problems.
The remaining part is organized as follows. The general version of our approach is described in detail in the next section. The final part of the paper is devoted to the proof of Theorem 1.1.
2 On the method of proof
Fix a Hilbert space \({\mathcal {H}}\) and consider the four-dimensional domain
Next, let \(V:{\mathcal {H}}\times [0,\infty )\times {\mathcal {H}}\rightarrow {\mathbb {R}}\) be a given function and suppose that we want to establish the estimate
for all simple functions \(\varphi ,\,\psi :{\mathcal {I}}\rightarrow {\mathcal {H}}\) satisfying \(||\psi ||_{BMO}\le 1\) and \(\langle \psi \rangle _{\mathcal {I}}=0\). Here by simplicity we mean that \(\varphi \), \(\psi \) can be written as finite sums of the form \(\sum _{k=0}^m a_kh_{k}\) for some integer m and some coefficients \(a_0\), \(a_1\), \(\ldots \), \(a_m\in {\mathcal {H}}\). This assumption implies in particular that the integral appearing in (2.1) makes sense. To study this problem, consider the class \({\mathcal {U}}(V)\) consisting of all \(U:{\mathscr {D}}\rightarrow (-\infty ,\infty ]\) satisfying the following three requirements:
and the further condition that for any \((x,s,y,z)\in {\mathscr {D}}\) and any \(d,\,e\in {\mathcal {H}}\), \(f\in {\mathbb {R}}\) satisfying \((x_\pm ,\sqrt{s^2+|d|^2},y_\pm ,z_\pm ):=(x\pm d,\sqrt{s^2+|d|^2},y\pm e,z\pm f)\in {\mathscr {D}}\),
So, the functions from \({\mathcal {U}}(V)\) cannot be too big nor too small, and they must satisfy a mid-concavity-type property. The connection between the class \({\mathcal {U}}(V)\) and the estimate (2.1) is studied in the two theorems below.
Theorem 2.1
If the class \({\mathcal {U}}(V)\) is nonempty, then (2.1) holds true for all simple \(\varphi ,\,\psi \) satisfying \(\Vert \psi \Vert _{BMO}\le 1\) and \(\langle \psi \rangle _{\mathcal {I}}=0\).
Proof
Fix \(\varphi \) and \(\psi \). The first step of our analysis is to prove that the sequence
is nonincreasing. (Let us comment here that \((|\psi |^2)_n\) is the projection of \(|\psi |^2\) on the subspace generated by the first \(n+1\) Haar functions.) Observe that \(0\le (|\psi |^2)_n-|\psi _n|^2\le 1\), where the left estimate follows from Schwarz’ inequality, while the right is due to \(\Vert \psi \Vert _{BMO}\le 1\). So, in particular, the integrals in (2.5) are well-defined: the point \(\big (\varphi _n,S(\varphi _{n}),\psi _n,(|\psi |^2)_n\big )\) belongs to the domain of U. Fix \(n\ge 1\), denote by I the support of \(h_n\), and let \(I^-\), \(I^+\) be the left and the right halves of I. Observe that \(\varphi _{n-1}\), \(S(\varphi _{n-1})\), \(\psi _{n-1}\), and \((|\psi |^2)_{n-1}\) are constant on I; denote the corresponding values by x, s, y, and z, respectively. Furthermore, there exist \(d,\,e\in {\mathcal {H}}\) and \(f\in {\mathbb {R}}\) such that \(\varphi _n \equiv x\pm d\), \(S(\varphi _n)\equiv \sqrt{s^2+|d|^2}\), \(\psi _n\equiv y\pm e\), and \((|\psi |^2)_n\equiv z+f\) on \(I^\pm \), respectively. Therefore,
which, by (2.4), does not exceed
Since \((\varphi _n,S(\varphi _n), \psi _n,(|\psi |^2)_n)\) and \((\varphi _{n-1},S(\varphi _{n-1}),\psi _{n-1},(|\psi |^2)_{n-1})\) coincide on \({\mathcal {I}}\setminus I\), the monotonicity of the sequence (2.5) follows. Now, recall that \(\varphi \), \(\psi \) are simple, which implies that there is m such that \(\varphi _m=\varphi \), \(S(\varphi _m)=S(\varphi )\), \(\psi _m=\psi \), and \((|\psi |^2)_m=|\psi |^2\). Combining this with (2.2) and (2.3), we obtain
This completes the proof. \(\square \)
The beautiful feature of the method is that the implication of the above theorem can be reversed. For \((y,z)\in {\mathcal {H}}\times [0,\infty )\) such that \(|y|^2\le z\le |y|^2+1\), let \({\mathcal {M}}(y,z)\) denote the class of all simple functions \(\psi :{\mathcal {I}}\rightarrow {\mathcal {H}}\) satisfying \(\Vert \psi \Vert _{BMO}\le 1\), \(\langle \psi \rangle _{\mathcal {I}}=y\), and \(\langle |\psi |^2\rangle _{\mathcal {I}}=z\). The class \({\mathcal {M}}(y,z)\) is nonempty: for example, it contains the function \(\psi =(y-e)\chi _{[0,1/2)}+(y+e)\chi _{[1/2,1)}\), where \(e\in {\mathcal {H}}\) is a vector satisfying \(|e|^2=z-|y|^2\). Define \(U^0:{\mathscr {D}}\rightarrow (-\infty ,\infty ]\) by
where the supremum is taken over all simple \(\varphi \) with \(\langle \varphi \rangle _{\mathcal {I}}=x\) and all \(\psi \in {\mathcal {M}}(y,z)\). Considering the constant function \(\varphi \equiv x\) and the above “two-point” function \(\psi =(y-e)\chi _{[0,1/2)}+(y+e)\chi _{[1/2,1)}\in {\mathcal {M}}(y,z)\), we derive that
Theorem 2.2
If (2.1) holds for all simple \(\varphi ,\,\psi :{\mathcal {I}}\rightarrow {\mathcal {H}}\) satisfying \(||\psi ||_{BMO}\le 1\), then the class \({\mathcal {U}}(V)\) is nonempty and \(U^0\) is its least element.
Proof
Let us first handle the minimality of \(U^0\). Pick \(U \in {\mathcal {U}}(V)\), \((x,s,y,z)\in {\mathscr {D}}\) and an arbitrary pair \(\varphi \), \(\psi \) as in the definition of \(U^0(x,s,y,z)\). Repeating the arguments in (2.6), one gets
So, taking the supremum over \(\varphi \) and \(\psi \), we get \(U^0(x,s,y,z)\le U(x,s,y,z)\) and therefore \(U^0\) is indeed the smallest element.
Now we check that \(U^0\) belongs to the class \({\mathcal {U}}(V)\). The majorization (2.2) is the direct consequence of (2.8): if \(z=y^2\), then \(e=0\) and hence the bound follows. The condition (2.3) is also easy: by (2.1), for any \(\varphi \) of average x and any \(\psi \in {\mathcal {M}}(0,z)\), we have
Taking the supremum over all such \(\varphi \) and \(\psi \), we get \(U^0(x,|x|,0,z)\le 0\). To prove (2.4), fix \(x,\,s,\,y,\,z,\,d,\,e,\,f\) as in the statement of this condition. Pick two simple functions \(\varphi _\pm :{\mathbb {R}}\rightarrow {\mathcal {H}}\) satisfying \(\langle \varphi _\pm \rangle _{\mathcal {I}}=x_\pm \), and two functions \(\psi _\pm \in {\mathcal {M}}(y_\pm ,z_\pm )\). Next, splice these objects by the formula
Then \(\varphi \) has the average x since
Similarly, we have
and hence \(\psi \in {\mathcal {M}}(y,z)\) (the inequality \(\Vert \psi \Vert _{BMO}\le 1\) is directly inherited from the analogous estimate for \(\psi _\pm \)). Finally, we easily check that
Therefore, by the very definition of \(U^0\),
Taking the supremum over all \(\varphi _\pm \) and \(\psi _\pm \) as above yields (2.4). \(\square \)
3 Proof of Theorem 1.1
3.1 Proof of (1.2)
The desired estimate is of the form (2.1) with \(V(x,s,y)=x\cdot y-\sqrt{2}s\), and by a straightforward approximation, it is enough to establish it for simple functions only. By virtue of Theorem 2.1, the estimate will be proved if we find an element of \({\mathcal {U}}(V)\). Consider the function \(U:{\mathscr {D}}\rightarrow {\mathbb {R}}\) given by
Some steps which lead to the discovery of this object can be found in the next subsection. Let us verify that the function enjoys all the required properties. The majorization (2.2) is equivalent to \(z-|y|^2\ge 0\), which is guaranteed in the definition of the domain \({\mathscr {D}}\). The condition (2.3), after straightforward manipulations, transforms into \(s(z-2)\le 0\), which follows from the estimates \(s\ge 0\) and \(z\in [0,1]\) (the latter being the consequence of \((x,|x|,0,z)\in {\mathscr {D}}\)). It remains to establish the concavity-type condition (2.4). Note that
and hence (2.4) can be rewritten in the form
We have \(z-|y|^2\le 1\) (since \((x,s,y,z)\in {\mathscr {D}}\)) and \(d\cdot e\le |d||e|\), so it is enough to show that
Consider the left-hand side above as a quadratic function of |e|; the discriminant of this function is equal to
Therefore (3.1) holds and hence \(U\in {\mathcal {U}}(V)\). This completes the proof of (1.2).
3.2 Sharpness
The examples, which show that the constant \(\sqrt{2}\) is optimal in (1.2), have very complicated structure and their analysis is quite involved. Most of these technicalities an be avoided, by an appropriate use of Theorem 2.1, as we shall see below. Throughout this subsection, we let \({\mathcal {H}}={\mathbb {R}}\) and assume that the inequality (1.2) holds with some constant C. This estimate is of the form (2.1), with \(V(x,s,y)=xy-Cs\). Recall the definition (2.7) of the associated special function \(U^0\):
the supremum taken over all simple \(\varphi \) of average x and all \(\psi \in {\mathcal {M}}(y,z)\). We start with some homogeneity-type properties of \(U^0\) (which also give some hint how the function U used in the previous subsection was discovered).
Lemma 3.1
For any \(\lambda >0\) and \(a\in {\mathbb {R}}\), we have
and
Consequently, \(U^0(x,s,y,z)=xy+s\zeta (z-y^2)\) for some function \(\zeta :[0,1]\rightarrow {\mathbb {R}}\).
Proof
Fix an arbitrary \(\varphi \) of average x and any \(\psi \in {\mathcal {M}}(y,z)\). Then \(\lambda \varphi \) is also simple and has average \(\lambda x\), so by the definition of \(U^0(\lambda x,\lambda s,y,z)\),
Taking the supremum over all \(\varphi \) and \(\psi \) as above, we get \(U^0(\lambda x,\lambda s,y,z)\ge \lambda U^0(x,s,y,z)\). Putting \({\bar{x}}=\lambda x\), \({\bar{s}}=\lambda s\), and \(\bar{\lambda }=\lambda ^{-1}\), we get the reverse bound (with x, s, \(\lambda \) replaced by \({\bar{x}}\), \({\bar{s}}\), \({\bar{\lambda }}\)). The identity (3.3) is shown similarly: for any \(a\in {\mathbb {R}}\) and any simple \(\varphi \) of average x, we have \(\langle \varphi +a\rangle _{\mathcal {I}}=x+a\) and hence for any \(\psi \in {\mathcal {M}}(y,z)\),
Consequently, we get \(U^0(x+a,\lambda s,y,z)\ge ay+U^0(x,s,y,z)\) and the reverse bound follows by replacing a with \(-a\). The proof of the identity (3.4) is the same and is left to the reader. To show the final claim, we use (3.3) (with \(a=-x\)), (3.4) (with \(a=-y\)), and then (3.2) (with \(\lambda =s^{-1}\)) to obtain
Thus we let \(\zeta (u)=U^0(0,1,0,u)\). It remains to show that \(\zeta \) is finite (recall that the elements of \({\mathcal {U}}(V)\), in general, are allowed to take infinite values). But this is simple: applying (3.5) to \(x=s=1\), \(y=0\), and \(z=u\) gives \(\zeta (u)=U^0(1,1,0,u)\), which, by (2.3), is nonpositive. \(\square \)
We are ready to prove that \(C\ge \sqrt{2}\). Fix \(\varepsilon \in {\mathbb {R}}\) and apply the inequality (2.4) to \((x,s,y,z)=(1,1,0,1)\) and \((d,e,f)=(\sqrt{2}\varepsilon ,\varepsilon ,0)\) to obtain
Now it follows from (2.4) that the function \(z\mapsto U^0(1-\sqrt{2}\varepsilon ,\sqrt{1+2\varepsilon ^2},-\varepsilon ,z)\) is midpoint concave on \([\varepsilon ^2,1+\varepsilon ^2]\). Since it is bounded from below (by (2.8), we have \(U^0(x,s,y,z)\ge xy-Cs\) for all \((x,s,y,z)\in {\mathscr {D}}\)), it is merely concave and therefore
However, the condition (2.2) implies
and by (3.4), we get
Plugging these two observations into (3.7) gives
Replacing \(\varepsilon \) with \(-\varepsilon \), we see that we also have
Combining the last two inequalities with (3.6) gives
Now observe that \((1-\varepsilon ^2)\sqrt{1+2\varepsilon ^2}=1+o(\varepsilon ^2)\) as \(\varepsilon \rightarrow 0\). Therefore, moving the term \((1-\varepsilon ^2)\sqrt{1+2\varepsilon ^2}\zeta (1)\) to the left, dividing by \(\varepsilon ^2\) and letting \(\varepsilon \rightarrow 0\) gives \(0\ge \sqrt{2}-C\), which is equivalent to the desired bound \(C\ge \sqrt{2}\).
References
Bennett, C., DeVore, R.A., Sharpley, R.: Weak-\(L^{\infty }\) and \(BMO\). Ann. Math. 113, 601–611 (1981)
Fefferman, C.: Characterization of bounded mean oscillation. Bull. Am. Math. Soc. 77, 587–588 (1971)
Getoor, R.K., Sharpe, M.J.: Conformal martingales. Invent. Math. 16, 271–308 (1972)
Grafakos, L.: Classical Fourier Analysis. Graduate Texts in Mathematics, 2nd edn. Springer, New York (2008)
Ivanisvili, P., Osipov, N., Stolyarov, D., Vasyunin, V., Zatitskiy, P.: Bellman function for extremal problems in \(BMO\). Trans. Am. Math. Soc. 368, 3415–3468 (2016)
John, F., Nirenberg, L.: On functions of bounded mean oscillation. Comm. Pure Appl. Math. 14, 415–426 (1961)
Osękowski, A.: Sharp maximal estimates for \(BMO\) martingales. Osaka J. Math. 52, 1125–1143 (2015)
Osękowski, A.: Weak-\(\text{L}^\infty \) inequalities for \(BMO\) functions. New York J. Math. 21, 699–713 (2015)
Slavin, L., Vasyunin, V.: Sharp results in the integral-form John–Nirenberg inequality. Trans. Am. Math. Soc. 363, 4135–4169 (2011)
Slavin, L., Vasyunin, V.: Sharp \(L^ p\) estimates on \(BMO\). Indiana Math. J. 61, 1051–1110 (2012)
Slavin, L., Volberg, A.: Bellman Function and the \(H^1-BMO\) Duality. In: Gavosto, E.A., Korten, M.K., Moore, C.N., Torres, R.H. (eds.) Harmonic Analysis, Partial Differential Equations, and Related Topics. Contemporary Mathematics, vol. 428, pp. 113–126. American Mathematical Society, Providence, RI (2007)
Vasyunin, V., Volberg, A.: Sharp constants in the classical weak form of the John–Nirenberg inequality. Proc. Lond. Math. Soc. 108, 1417–1434 (2014)
Acknowledgements
The author would like to thank the anonymous referee for the careful reading of the first version of the paper and several helpful suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The research was supported by the National Science Centre Poland, Grant DEC-2014/14/E/ST1/00532.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Osękowski, A. A note on \(\varvec{H^1-BMO}\) duality. Arch. Math. 114, 687–697 (2020). https://doi.org/10.1007/s00013-020-01456-1
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00013-020-01456-1